BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 035980
         (857 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225435510|ref|XP_002285548.1| PREDICTED: uncharacterized protein LOC100246702 [Vitis vinifera]
          Length = 864

 Score = 1267 bits (3279), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 616/858 (71%), Positives = 709/858 (82%), Gaps = 9/858 (1%)

Query: 5   FVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
            V+F F   G  LGK+CTN  +   SH+FRYEL  S N++WK E+  H+HL  TDDSAWS
Sbjct: 11  IVVFAFVLCGCVLGKECTNVPTQLSSHSFRYELLASNNESWKAEMFQHYHLIHTDDSAWS 70

Query: 63  SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
           +L+P K+L ++ DE SWA++YR +KN  G +   NFLKE+SLHDV LD  S+  RAQQTN
Sbjct: 71  NLLPRKLLREE-DEFSWAMMYRNMKNYDGSN--SNFLKEMSLHDVRLDSDSLHGRAQQTN 127

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
           L+YLL+LDVD LVWSFRKTA L TPG  YGGWE P  ELRGHFVGHY+SASAQMWASTHN
Sbjct: 128 LDYLLILDVDRLVWSFRKTAGLSTPGLPYGGWEAPNVELRGHFVGHYMSASAQMWASTHN 187

Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
            T+KEKMS VV +L+ CQ K+GTGYLSAFP+ELFD FEA+KPVWAPYYTIHKILAGLLDQ
Sbjct: 188 DTLKEKMSAVVSALATCQEKMGTGYLSAFPSELFDRFEAIKPVWAPYYTIHKILAGLLDQ 247

Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
           Y  A N+QALKM TWMVE+FY RVQ VITMYS+ERHW SLNEETGGMNDVLYRLYSIT D
Sbjct: 248 YTFAGNSQALKMMTWMVEHFYKRVQNVITMYSLERHWLSLNEETGGMNDVLYRLYSITGD 307

Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
            KHL+LAHLFDKPCFLG LA+QAD +S FHANTHIP+VIGSQMRYEVTGDPLYK IGTFF
Sbjct: 308 QKHLVLAHLFDKPCFLGLLAVQADSISGFHANTHIPVVIGSQMRYEVTGDPLYKAIGTFF 367

Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
           MDIVN+SHSYATGGTS  EFW DPKRLA TL  ENEE+CTTYNMLKVSRHLFRWTKE+ Y
Sbjct: 368 MDIVNSSHSYATGGTSVGEFWSDPKRLASTLQRENEESCTTYNMLKVSRHLFRWTKEVVY 427

Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
           ADYYERALTNGVLSIQRGT+PGVMIYMLPLGRG SKARS HGWGTKF+SFWCCYGTGIES
Sbjct: 428 ADYYERALTNGVLSIQRGTDPGVMIYMLPLGRGDSKARSYHGWGTKFDSFWCCYGTGIES 487

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
           FSKLGDSIYFEEEG  P +YIIQYISSS DWKSG +VLNQKVDP+VSWDPYLR TLTF+ 
Sbjct: 488 FSKLGDSIYFEEEGKSPEVYIIQYISSSLDWKSGQIVLNQKVDPVVSWDPYLRTTLTFTP 547

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
           K+  GQ S++NLR+PVW  S+GA+AS+N Q+LP+P P +FLS T  WS  DKLT+QLP+ 
Sbjct: 548 KEGAGQSSTINLRIPVWASSSGAKASINAQDLPVPAPSSFLSLTRNWSPGDKLTLQLPIR 607

Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
           LRTEAI+DDRP+YASIQAIL+GPYLLAG TS +WDIKTG+A SLS  I+PIP S N++LV
Sbjct: 608 LRTEAIKDDRPKYASIQAILYGPYLLAGLTSDDWDIKTGSATSLSDWITPIPASDNSRLV 667

Query: 663 TFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKS 722
           + +QESGNS+FV SNSNQSITME+FP  GTDA+LHATFRL+LKDA+     S  + IGKS
Sbjct: 668 SLSQESGNSSFVFSNSNQSITMEKFPEEGTDASLHATFRLVLKDATSLKVLSPKDAIGKS 727

Query: 723 VMLEPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCF 781
           VMLEP D PGM +VQQG    L ++ S    GS  F LVAGLD ++ TVSLE+E++K C+
Sbjct: 728 VMLEPIDLPGMVVVQQGTNQNLGIANSAAGKGSL-FHLVAGLDGKDGTVSLESESQKDCY 786

Query: 782 VSSGVNFEPGASLKL--LCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLA 839
           V SG+++  G S+KL  L  + S D  FN+A SF+++ GIS+YHPISFVAKG +RNFLL 
Sbjct: 787 VYSGIDYNSGTSIKLKSLSESGSSDEDFNKATSFILKEGISQYHPISFVAKGMKRNFLLT 846

Query: 840 PLLSFRDEAYTVYFNIQD 857
           PLL  RDE+YTVYFNIQD
Sbjct: 847 PLLGLRDESYTVYFNIQD 864


>gi|224053368|ref|XP_002297785.1| predicted protein [Populus trichocarpa]
 gi|222845043|gb|EEE82590.1| predicted protein [Populus trichocarpa]
          Length = 858

 Score = 1264 bits (3271), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 604/855 (70%), Positives = 711/855 (83%), Gaps = 7/855 (0%)

Query: 5   FVLFFFFCFGLALGKQCTNQ-SPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
            V+    C G    K+CTN  +   SH FRY L +S N+TWKEE+ +H+HLTPTDDSAW+
Sbjct: 7   LVVLSMLC-GFGTSKECTNTPTQLSSHTFRYALLSSENETWKEEMFAHYHLTPTDDSAWA 65

Query: 63  SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
           +L+P KIL  ++DE SWA++YR +K+P      GNFLKEVSLH+V LD SS+ W+AQQTN
Sbjct: 66  NLLPRKIL-REEDEYSWAMMYRNLKSP--LKSSGNFLKEVSLHNVRLDPSSIHWQAQQTN 122

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
           LEYLLMLDVDSLVWSFRKTA L TPG AYGGWE P  ELRGHFVGHYLSASAQMWASTHN
Sbjct: 123 LEYLLMLDVDSLVWSFRKTAGLSTPGTAYGGWEAPNCELRGHFVGHYLSASAQMWASTHN 182

Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
             ++++MS VV +LS CQ K+G+GYLSAFP+ELFD FEA+KPVWAPYYTIHKILAGLLDQ
Sbjct: 183 DILEKQMSAVVSALSSCQEKMGSGYLSAFPSELFDRFEAIKPVWAPYYTIHKILAGLLDQ 242

Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
           Y  ADNAQALKM  WMV+YFYNRV+ VIT +SVERH+ SLNEETGGMNDVLY+L+SIT D
Sbjct: 243 YTFADNAQALKMVKWMVDYFYNRVRNVITNFSVERHYQSLNEETGGMNDVLYKLFSITGD 302

Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
           PKHL+LAHLFDKPCFLG LA+QA+ +S FHANTHIPIVIG+QMRYE+TGDPLYK IGTFF
Sbjct: 303 PKHLVLAHLFDKPCFLGLLAVQAEDISGFHANTHIPIVIGAQMRYEITGDPLYKDIGTFF 362

Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
           MDIVN+SHSYATGGTS  EFW DPKRLA TL +ENEE+CTTYNMLKVSRHLFRWTKE+AY
Sbjct: 363 MDIVNSSHSYATGGTSVSEFWSDPKRLASTLQTENEESCTTYNMLKVSRHLFRWTKEMAY 422

Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
           ADYYERALTNGVL IQRGTEPGVMIYMLP   G SK +S HGWGT +++FWCCYGTGIES
Sbjct: 423 ADYYERALTNGVLGIQRGTEPGVMIYMLPQHPGSSKGKSYHGWGTLYDTFWCCYGTGIES 482

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
           FSKLGDSIYFEEEG  PGLYIIQYISSS DWKSG +++NQKVDP+VS DPYLR+T TFS 
Sbjct: 483 FSKLGDSIYFEEEGEAPGLYIIQYISSSLDWKSGQIMINQKVDPVVSSDPYLRVTFTFSP 542

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
            +   Q S+LNLR+PVWT+ +GA A++N Q+L +P PG+FLS   +WS  DKL++QLP+S
Sbjct: 543 NKGSSQASTLNLRIPVWTHLDGATATINSQSLAIPAPGSFLSVNRKWSSGDKLSLQLPIS 602

Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
           LRTEAIQDDR +YASIQAIL+GPYLLAGHTSG+W++K G+A SLS  I+PIP S+N QLV
Sbjct: 603 LRTEAIQDDRHQYASIQAILYGPYLLAGHTSGDWNLKAGSAGSLSDSITPIPASYNEQLV 662

Query: 663 TFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKS 722
           +F+Q+SGNSTFV++NSNQSITMEE P SGTDA L ATFR++  D+S S    +N+VI KS
Sbjct: 663 SFSQDSGNSTFVLTNSNQSITMEEHPKSGTDACLQATFRIVFNDSSSSEVLGINDVIDKS 722

Query: 723 VMLEPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCF 781
           VMLEPFD PGM LVQQGK+  L V+ S  + GSS F +V GLD ++ TVSLE+ +++GC+
Sbjct: 723 VMLEPFDLPGMLLVQQGKDSSLAVTNSAADDGSSIFHVVLGLDGKDGTVSLESGSQEGCY 782

Query: 782 VSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPL 841
           + SGVN++ G S+KL C   S D GFN+ ASF+M  G+SEYHPISFVA+G +RNFLLAPL
Sbjct: 783 IYSGVNYKSGQSMKLSCKLGSSDPGFNQGASFVMNKGLSEYHPISFVAEGDKRNFLLAPL 842

Query: 842 LSFRDEAYTVYFNIQ 856
            S RDE YT+YFNIQ
Sbjct: 843 HSLRDEFYTIYFNIQ 857


>gi|224075776|ref|XP_002304762.1| predicted protein [Populus trichocarpa]
 gi|222842194|gb|EEE79741.1| predicted protein [Populus trichocarpa]
          Length = 858

 Score = 1251 bits (3237), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 608/848 (71%), Positives = 714/848 (84%), Gaps = 10/848 (1%)

Query: 13  FGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWSSLIPSKIL 70
           FG++  K+CTN  +   SH+FRYEL +S N+TWKEE+  H+HL PTDDSAWSSL+P KIL
Sbjct: 16  FGIS--KECTNIPTQLSSHSFRYELLSSQNETWKEEMFEHYHLIPTDDSAWSSLLPRKIL 73

Query: 71  GDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLD 130
            ++ DE SW ++YR +K+P      GNFL E+SLH+V LD SS+ W+AQQTNLEYLLMLD
Sbjct: 74  REE-DEHSWEMMYRNLKSP--LKSSGNFLNEMSLHNVRLDPSSIHWKAQQTNLEYLLMLD 130

Query: 131 VDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMS 190
           V++LVWSFRKTA   TPGKAYGGWE P SELRGHFVGHYLSASAQMWASTHN T+K+KMS
Sbjct: 131 VNNLVWSFRKTAGSSTPGKAYGGWEKPDSELRGHFVGHYLSASAQMWASTHNETLKKKMS 190

Query: 191 TVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQ 250
            VV +LS CQ K+GTGYLSAFP+ELFD FEA+KPVWAPYYTIHKILAGLLDQY LADNAQ
Sbjct: 191 AVVSALSACQVKMGTGYLSAFPSELFDRFEAIKPVWAPYYTIHKILAGLLDQYTLADNAQ 250

Query: 251 ALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAH 310
           ALKM  WMV+YFYNRV+ VIT YSVERH+ SLNEETGGMNDVLY+L+SIT DPKHL+LAH
Sbjct: 251 ALKMVKWMVDYFYNRVRNVITNYSVERHYLSLNEETGGMNDVLYKLFSITGDPKHLVLAH 310

Query: 311 LFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASH 370
           LFDKPCFLG LA+QAD +S FHANTHIP+VIG+QMRYE+TGDPLYK IG FFMD+VN+SH
Sbjct: 311 LFDKPCFLGLLAVQADDISGFHANTHIPVVIGAQMRYEITGDPLYKDIGAFFMDVVNSSH 370

Query: 371 SYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERAL 430
           SYATGGTS  EFW DPKRLA TL +ENEE+CTTYNMLKVSRHLFRWTKE+AYADYYERAL
Sbjct: 371 SYATGGTSVSEFWSDPKRLASTLQTENEESCTTYNMLKVSRHLFRWTKEMAYADYYERAL 430

Query: 431 TNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSI 490
           TNGVL IQRGTEPGVMIYMLP   G SKA+S HGWGT ++SFWCCYGTGIESFSKLGDSI
Sbjct: 431 TNGVLGIQRGTEPGVMIYMLPQYPGSSKAKSYHGWGTSYDSFWCCYGTGIESFSKLGDSI 490

Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
           YF EEG  PGLYIIQYISSS DWKSG +VLNQKVDPIVS DPYLR+TLTFS K+   Q S
Sbjct: 491 YF-EEGEAPGLYIIQYISSSLDWKSGQIVLNQKVDPIVSSDPYLRVTLTFSPKKGTSQAS 549

Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
           +L LR+P+WT S GA A++N Q+L LP PG+FLS   +W  +DKLT+Q+P+SLRTEAI+D
Sbjct: 550 TLYLRIPIWTNSEGATATINSQSLRLPAPGSFLSVNRKWRSSDKLTLQIPISLRTEAIKD 609

Query: 611 DRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGN 670
           +R EYAS+QAIL+GPYLLAGHTSG+W++K+G+  SLS  I+PIP S+N QLV+F+QESG 
Sbjct: 610 ERHEYASVQAILYGPYLLAGHTSGDWNLKSGSGNSLSDSITPIPGSYNGQLVSFSQESGI 669

Query: 671 STFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDF 730
           STFV++NSNQSI+ME+ P SGTDA+L ATFRL+ KD+S S  SS+ +VIGKSVMLEPF  
Sbjct: 670 STFVLTNSNQSISMEKLPESGTDASLQATFRLVFKDSSSSKLSSVKDVIGKSVMLEPFHL 729

Query: 731 PGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFE 789
           PGM LVQQGK+    ++ S  + GSS FR+V+GLD ++ TVSLE+  + GC+V SGV+++
Sbjct: 730 PGMLLVQQGKDRSFTLTNSADDDGSSIFRVVSGLDGKDGTVSLESGIQNGCYVYSGVDYK 789

Query: 790 PGASLKLLC-STESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEA 848
            G S+KL C S  S D GFN+ ASF+M  G+S+YHPISFVAKG +RNFLLAPL S RDE+
Sbjct: 790 SGQSMKLSCKSGSSSDTGFNQGASFVMNKGLSQYHPISFVAKGDKRNFLLAPLHSLRDES 849

Query: 849 YTVYFNIQ 856
           YT+YFNIQ
Sbjct: 850 YTIYFNIQ 857


>gi|359478753|ref|XP_002283032.2| PREDICTED: uncharacterized protein LOC100250068 [Vitis vinifera]
          Length = 874

 Score = 1211 bits (3133), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 587/849 (69%), Positives = 691/849 (81%), Gaps = 9/849 (1%)

Query: 14  GLALGKQCTNQ-SPYDSHAFRYELT-STNKTWKEEVLSHF-HLTPTDDSAWSSLIPSKIL 70
           G  LGK+CTN  SP  SH  RYEL  S N++ K E L+H+ +L  TD S W + +P K L
Sbjct: 20  GCGLGKKCTNSGSPLSSHTLRYELLFSKNESRKAEALAHYSNLIRTDGSGWLTSLPRKAL 79

Query: 71  GDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLD 130
             ++DE S A+ Y+ +K+  G +    FLKE SLHDV L   S+ WRAQQTNLEYLLMLD
Sbjct: 80  -REEDEFSRAMKYQTMKSYDGSN--SKFLKEFSLHDVRLGSDSLHWRAQQTNLEYLLMLD 136

Query: 131 VDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMS 190
            D LVWSFR+TA LPTP   YGGWE+P  ELRGHFVGHYLSASAQMWASTHN ++KEKMS
Sbjct: 137 ADRLVWSFRRTAGLPTPCSPYGGWESPDGELRGHFVGHYLSASAQMWASTHNESLKEKMS 196

Query: 191 TVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQ 250
            VV +L ECQ K+GTGYLSAFP+ELFD FEAL+ VWAPYYTIHKILAGLLDQY L  NAQ
Sbjct: 197 AVVCALGECQKKMGTGYLSAFPSELFDRFEALEEVWAPYYTIHKILAGLLDQYTLGGNAQ 256

Query: 251 ALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAH 310
           ALKM TWMVEYFYNRVQ VI+ YS+ERHW SLNEETGGMND LY LY IT D KH +LAH
Sbjct: 257 ALKMVTWMVEYFYNRVQNVISSYSIERHWLSLNEETGGMNDFLYNLYRITGDQKHFVLAH 316

Query: 311 LFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASH 370
           LFDKPCFLG LA+QAD +S FHANTHIPIV+G+QMRYE+TGDPLYK IG FF+D VN+SH
Sbjct: 317 LFDKPCFLGLLAMQADDISGFHANTHIPIVVGAQMRYEITGDPLYKTIGAFFIDTVNSSH 376

Query: 371 SYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERAL 430
           SYATGGTS  EFW DPKR+A TL +EN E+CTTYNMLKVSR+LFRWTKE+AYADYYERAL
Sbjct: 377 SYATGGTSVDEFWSDPKRMATTLQTENAESCTTYNMLKVSRNLFRWTKEVAYADYYERAL 436

Query: 431 TNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSI 490
           TNG+LSIQRGT+PGVM+YMLPLG G SKARS HGWGTKF+SFWCCYGTGIESFSKLGDSI
Sbjct: 437 TNGILSIQRGTDPGVMLYMLPLGHGNSKARSYHGWGTKFHSFWCCYGTGIESFSKLGDSI 496

Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK--QEVGQ 548
           YFEEEG VPGLYIIQYISSS DWKSG VVLNQKVD +VSWDPYLR+TLTFS K  Q  GQ
Sbjct: 497 YFEEEGEVPGLYIIQYISSSLDWKSGQVVLNQKVDTVVSWDPYLRITLTFSPKKMQGAGQ 556

Query: 549 LSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
            S++NLR+PVW YS+GA+A++N Q LP+P P +FLS   +WS +DKLT+QLP++LRTEAI
Sbjct: 557 SSAINLRIPVWAYSSGAKAAVNAQALPVPAPNSFLSFRRKWSPDDKLTLQLPIALRTEAI 616

Query: 609 QDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQES 668
           +DDRP+YA +QAIL+GPYLL G T+ +WDI+T  A SLS  I+PIP S N+ L++ +QES
Sbjct: 617 KDDRPKYACLQAILYGPYLLVGLTNNDWDIQTDLAASLSDWITPIPASHNSHLISLSQES 676

Query: 669 GNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPF 728
           GNS+F  +NSNQS+TME +P SGTDA+L+ATFRLIL+D++ S  SS  + IGK VMLEP 
Sbjct: 677 GNSSFAFTNSNQSLTMERYPESGTDASLNATFRLILEDSTSSKISSPKDAIGKFVMLEPI 736

Query: 729 DFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVN 787
           +FPGM +VQ+G  + L ++ S   +GSS F LVAGLD ++ TVSLE++ +KGCFV S VN
Sbjct: 737 NFPGMAVVQRGTNESLGITNSASVVGSSLFHLVAGLDGKDGTVSLESKTQKGCFVYSDVN 796

Query: 788 FEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDE 847
           ++ G+++KL C   S D  FN+A SF ++ GISEYHPISFVAKG RR++LLAPLLS RDE
Sbjct: 797 YDSGSAIKLKCKLASSDVVFNQATSFTLKHGISEYHPISFVAKGLRRDYLLAPLLSLRDE 856

Query: 848 AYTVYFNIQ 856
           +YTVYFNIQ
Sbjct: 857 SYTVYFNIQ 865


>gi|356541912|ref|XP_003539416.1| PREDICTED: uncharacterized protein LOC100783150 [Glycine max]
          Length = 854

 Score = 1181 bits (3055), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/857 (67%), Positives = 688/857 (80%), Gaps = 12/857 (1%)

Query: 5   FVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWSS 63
           F L      G    K+CTN  P  SH FRYEL  STN TWK EV+ H+HLTPTD++AW+ 
Sbjct: 6   FALVAILLCGCDAAKECTN-IPTQSHTFRYELLMSTNATWKAEVMDHYHLTPTDETAWAD 64

Query: 64  LIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNL 123
           L+P K+L +Q ++  W ++YRKIKN G F     FLKEV L DV L + S+  RAQQTNL
Sbjct: 65  LLPRKLLSEQ-NQHDWGVMYRKIKNMGVFKSGEGFLKEVPLQDVRLHKDSIHGRAQQTNL 123

Query: 124 EYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNA 183
           EYLLMLDVDSL+WSFRKTA+L TPG  YGGWE P  ELRGHFVGHYLSASA MWAST N 
Sbjct: 124 EYLLMLDVDSLIWSFRKTAALSTPGTPYGGWEGPEVELRGHFVGHYLSASALMWASTQND 183

Query: 184 TIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQY 243
           T+K+KMS++V  LS CQ KIGTGYLSAFP+E FD FEA++PVWAPYYTIHKILAGLLDQ+
Sbjct: 184 TLKQKMSSLVAGLSACQEKIGTGYLSAFPSEFFDRFEAVQPVWAPYYTIHKILAGLLDQH 243

Query: 244 VLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDP 303
             A N QALKM TWMV+YFYNRVQ VIT Y+V RH+ S+NEETGGMNDVLYRLYSIT D 
Sbjct: 244 TFAGNPQALKMVTWMVDYFYNRVQNVITKYTVNRHYQSMNEETGGMNDVLYRLYSITGDS 303

Query: 304 KHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFM 363
           KHL+LAHLFDKPCFLG LA+QA+ ++  HANTHIPIV+GSQMRYE+TGDPLYK IGTFFM
Sbjct: 304 KHLVLAHLFDKPCFLGLLAVQANDIADLHANTHIPIVVGSQMRYEITGDPLYKQIGTFFM 363

Query: 364 DIVNASHSYATGGTSAREFWWDPKRLADTL-GSENEETCTTYNMLKVSRHLFRWTKEIAY 422
           D+VN+SHSYATGGTS REFW DPKR+AD L  +ENEE+CTTYNMLKVSRHLFRWTKE++Y
Sbjct: 364 DLVNSSHSYATGGTSVREFWSDPKRIADNLRTTENEESCTTYNMLKVSRHLFRWTKEVSY 423

Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
           ADYYERALTNGVLSIQRGT+PGVMIYMLPLG  VSKAR+ H WGT+F+SFWCCYGTGIES
Sbjct: 424 ADYYERALTNGVLSIQRGTDPGVMIYMLPLGFAVSKARTGHSWGTQFDSFWCCYGTGIES 483

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
           FSKLGDSIYFEEEG  P LYIIQYISSSF+WKSG ++LNQ V P  S DPYLR+T TFS 
Sbjct: 484 FSKLGDSIYFEEEGKDPTLYIIQYISSSFNWKSGKILLNQTVVPASSSDPYLRVTFTFSP 543

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
            +    LS+LN R+P WT  +GA+  LNGQ L LP PGN+LS T +WS +DKLT+QLPL+
Sbjct: 544 VEVTNTLSTLNFRLPSWTLLDGAKGILNGQTLSLPNPGNYLSITRQWSASDKLTLQLPLT 603

Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTS-GEWDIKTGTARSLSALISPIPPSFNAQL 661
           +RTEAI+DDRPEYAS+QAIL+GPYLLAGHT+ G+W++K G     +  I+PIP S+N+QL
Sbjct: 604 VRTEAIKDDRPEYASVQAILYGPYLLAGHTTGGDWNLKAGANN--ADWITPIPASYNSQL 661

Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGK 721
           V+F ++   STFV++NSNQS++M++ P  GTD AL ATFR++L+++S S FS L +   +
Sbjct: 662 VSFFRDFEGSTFVLANSNQSVSMQKLPEFGTDLALQATFRIVLEESS-SKFSKLADANDR 720

Query: 722 SVMLEPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGC 780
           SVMLEPFD PGM ++ QG    L+  +S +   S+ F LV GLD RNETVSLE+++ KGC
Sbjct: 721 SVMLEPFDLPGMNVIHQGAGKPLLTVDSSQGGPSAVFLLVPGLDGRNETVSLESQSNKGC 780

Query: 781 FVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAP 840
           +V SG++  P A +KL C ++S DA FN+AASF+   G+S+Y+PISFVAKGA RNFLL P
Sbjct: 781 YVYSGMS--PSAGVKLSCKSDS-DATFNQAASFVALQGLSQYNPISFVAKGANRNFLLQP 837

Query: 841 LLSFRDEAYTVYFNIQD 857
           LLSFRDE YTVYFNIQD
Sbjct: 838 LLSFRDEHYTVYFNIQD 854


>gi|356541181|ref|XP_003539059.1| PREDICTED: uncharacterized protein LOC100781521 [Glycine max]
          Length = 854

 Score = 1176 bits (3042), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 575/859 (66%), Positives = 684/859 (79%), Gaps = 12/859 (1%)

Query: 3   FGFVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAW 61
           F FV       G    K+CTN  P  SH FRYEL  S N TWK EV+ H+HLTPTD++ W
Sbjct: 4   FVFVFVAILLCGCVAAKECTN-IPTQSHTFRYELLMSKNATWKAEVMDHYHLTPTDETVW 62

Query: 62  SSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQT 121
           + L+P K L +Q ++  W ++YRKIKN G F     FLKEV L DV L + S+  RAQQT
Sbjct: 63  ADLLPRKFLSEQ-NQHDWGVMYRKIKNMGVFKSGEGFLKEVPLQDVRLHKDSIHARAQQT 121

Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH 181
           NLEYLLMLDVDSL+WSFRKTA L TPG  YGGWE P  ELRGHFVGHYLSASA MWAST 
Sbjct: 122 NLEYLLMLDVDSLIWSFRKTAGLSTPGTPYGGWEGPEVELRGHFVGHYLSASALMWASTQ 181

Query: 182 NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLD 241
           N T+K+KMS++V  LS CQ KIGTGYLSAFP+E FD FE ++PVWAPYYTIHKILAGLLD
Sbjct: 182 NDTLKQKMSSLVAGLSACQEKIGTGYLSAFPSEFFDRFETVQPVWAPYYTIHKILAGLLD 241

Query: 242 QYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITH 301
           Q+  A N QALKM TWMV+YFYNRVQ VIT Y+V RH+ SLNEETGGMNDVLYRLYSIT 
Sbjct: 242 QHTFAGNPQALKMVTWMVDYFYNRVQNVITKYTVNRHYESLNEETGGMNDVLYRLYSITG 301

Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTF 361
           D KHL+LAHLFDKPCFLG LA+QA+ +++FHANTHIP+V+GSQMRYE+TGDPLYK IGTF
Sbjct: 302 DSKHLVLAHLFDKPCFLGLLAMQANDIANFHANTHIPVVVGSQMRYEITGDPLYKQIGTF 361

Query: 362 FMDIVNASHSYATGGTSAREFWWDPKRLADTL-GSENEETCTTYNMLKVSRHLFRWTKEI 420
           FMD+VN+SHSYATGGTS  EFW DPKR+AD L  +ENEE+CTTYNMLKVSRHLFRWTKE+
Sbjct: 362 FMDLVNSSHSYATGGTSVSEFWSDPKRIADNLRTTENEESCTTYNMLKVSRHLFRWTKEV 421

Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
           +YADYYERALTNGVLSIQRGT+PGVMIYMLPLG  VSKAR+ H WGT+F+SFWCCYGTGI
Sbjct: 422 SYADYYERALTNGVLSIQRGTDPGVMIYMLPLGFAVSKARTGHSWGTQFDSFWCCYGTGI 481

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
           ESFSKLGDSIYFEEEG  P LYIIQYI SSF+WKSG ++LNQ V P+ S DPYLR+T TF
Sbjct: 482 ESFSKLGDSIYFEEEGKDPTLYIIQYIPSSFNWKSGKILLNQTVVPVASSDPYLRVTFTF 541

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
           S  +    LS+LN R+P WT  +GA+  LNGQ L LP PG +LS T +WS +DKLT+QLP
Sbjct: 542 SPVEVTNTLSTLNFRLPSWTLLDGAKGILNGQTLSLPNPGKYLSVTRQWSGSDKLTLQLP 601

Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS-GEWDIKTGTARSLSALISPIPPSFNA 659
           L++RTEAI+DDRPEYAS+QAIL+GPYLLAGHT+ G+WD+K G     +  I+PIP S+N+
Sbjct: 602 LTVRTEAIKDDRPEYASVQAILYGPYLLAGHTTGGDWDLKAGANN--ADWITPIPASYNS 659

Query: 660 QLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVI 719
           QLV+F ++   STFV++NSN+S++M++ P  GTD  L ATFR++LKD+S S FS+L +  
Sbjct: 660 QLVSFFRDFEGSTFVLTNSNKSVSMQKLPEYGTDLTLQATFRIVLKDSS-SKFSTLADAN 718

Query: 720 GKSVMLEPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRK 778
            +SVMLEPFDFPGM ++ QG    L++++S     SS F LV GLD RNETVSLE+++ K
Sbjct: 719 DRSVMLEPFDFPGMNVIHQGAGKPLLIADSSHGGPSSVFLLVPGLDGRNETVSLESQSNK 778

Query: 779 GCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLL 838
           GC+V SG++  P + +KL C ++S DA FN+A SF+   G+S+Y+PISFVAKG  RNFLL
Sbjct: 779 GCYVYSGMS--PSSGVKLSCKSDS-DATFNKATSFVALQGLSQYNPISFVAKGTNRNFLL 835

Query: 839 APLLSFRDEAYTVYFNIQD 857
            PLLSFRDE YTVYFNIQD
Sbjct: 836 QPLLSFRDEHYTVYFNIQD 854


>gi|449448754|ref|XP_004142130.1| PREDICTED: uncharacterized protein LOC101207833 [Cucumis sativus]
          Length = 868

 Score = 1171 bits (3030), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 571/840 (67%), Positives = 681/840 (81%), Gaps = 6/840 (0%)

Query: 19  KQCTNQ-SPYDSHAFRYELTST-NKTWKEEVLSHFHLTPTDDSAWSSLIPSKILGDQKDE 76
           K+CTN  +   SH FRYEL S+ N TWK+E+ SH+HLTPTDD AWS+L+P K+L  +++E
Sbjct: 28  KECTNTPTQLGSHTFRYELLSSGNVTWKKELFSHYHLTPTDDFAWSNLLPRKML-KEENE 86

Query: 77  VSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVW 136
            +W ++YR++KN  G  +PG  LKE+SLHDV LD +S+   AQ TNL+YLLMLDVD L+W
Sbjct: 87  YNWEMMYRQMKNKDGLRIPGGMLKEISLHDVRLDPNSLHGTAQTTNLKYLLMLDVDRLLW 146

Query: 137 SFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSL 196
           SFRKTA LPTPG+ Y GWE    ELRGHFVGHYLSASAQMWAST N+ +KEKMS +V  L
Sbjct: 147 SFRKTAGLPTPGEPYVGWEKSDCELRGHFVGHYLSASAQMWASTGNSVLKEKMSALVSGL 206

Query: 197 SECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMAT 256
           + CQ+K+GTGYLSAFP+E FD FEA++PVWAPYYTIHKILAGLLDQY  A N+QALKM T
Sbjct: 207 ATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAGLLDQYTFAGNSQALKMVT 266

Query: 257 WMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPC 316
           WMVEYFYNRVQ VI  Y+VERH+ SLNEETGGMNDVLYRLY IT + KHLLLAHLFDKPC
Sbjct: 267 WMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYRITGNTKHLLLAHLFDKPC 326

Query: 317 FLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG 376
           FLG LA+QA+ +S FH NTHIPIV+GSQMRYEVTGDPLYK I T+FMDIVN+SHSYATGG
Sbjct: 327 FLGLLAVQAEDISGFHVNTHIPIVVGSQMRYEVTGDPLYKEISTYFMDIVNSSHSYATGG 386

Query: 377 TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
           TS  EFW DPKRLAD LG+E EE+CTTYNMLKVSR+LF+WTKEIAYADYYERALTNGVLS
Sbjct: 387 TSVHEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTKEIAYADYYERALTNGVLS 446

Query: 437 IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
           IQRGT+PGVMIYMLPLG G SKA S HGWGT F SFWCCYGTGIESFSKLGDSIYFEEE 
Sbjct: 447 IQRGTDPGVMIYMLPLGSGSSKAISYHGWGTPFESFWCCYGTGIESFSKLGDSIYFEEEL 506

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
             P LY+IQYISSS DWKSG+V+LNQ VDPI S DP LRMTLTFS K      S++NLR+
Sbjct: 507 QTPTLYVIQYISSSLDWKSGNVLLNQTVDPIHSEDPKLRMTLTFSPKVGSVHSSTINLRI 566

Query: 557 PVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
           P WT ++GA+  LNGQ+L     GNF S T  WS  +KL+++LP++LRTEAI DDR EYA
Sbjct: 567 PSWTSASGAKVVLNGQSLGNNINGNFKSVTNSWSSGNKLSLELPINLRTEAIDDDRSEYA 626

Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMS 676
           S++AILFGPYLLA +++G+W+IKT  A SLS  I+ +P ++N  LVTF+Q SG ++F ++
Sbjct: 627 SVKAILFGPYLLAAYSNGDWEIKTQQADSLSDWITHVPSAYNTFLVTFSQASGKTSFALT 686

Query: 677 NSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLV- 735
           NSNQSITME++P  GTD+A+HATFRLI+ D S +  + L +VIGK VMLEPF FPGM++ 
Sbjct: 687 NSNQSITMEKYPGQGTDSAVHATFRLIIDDPS-AKVTELQDVIGKRVMLEPFSFPGMVLG 745

Query: 736 QQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLK 795
            +GK++ L ++++  E  SS F LV GLD +N TVSL + + +GCFV SGVN+E GA LK
Sbjct: 746 NKGKDERLEIADANSEGHSSDFYLVEGLDGKNGTVSLASIDNEGCFVYSGVNYESGAQLK 805

Query: 796 LLCSTE-SLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFN 854
           L C ++ SLD GF+ A+SF++E G S+YHPISFV KG  RNFLLAPLLSF DE+YTVYFN
Sbjct: 806 LSCKSKLSLDDGFDEASSFLLESGASQYHPISFVTKGMTRNFLLAPLLSFVDESYTVYFN 865


>gi|15239944|ref|NP_196799.1| uncharacterized protein [Arabidopsis thaliana]
 gi|7630051|emb|CAB88259.1| putative protein [Arabidopsis thaliana]
 gi|26451123|dbj|BAC42665.1| unknown protein [Arabidopsis thaliana]
 gi|332004451|gb|AED91834.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 861

 Score = 1130 bits (2923), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 546/846 (64%), Positives = 658/846 (77%), Gaps = 10/846 (1%)

Query: 15  LALGKQCTNQ-SPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWSSLIPSKILGD 72
           +++ K+CTN  +   SH FR EL  S N+T K E+ SH+HLTP DDSAWSSL+P K+L +
Sbjct: 21  VSVAKECTNTPTQLSSHTFRSELLQSKNETLKTELFSHYHLTPADDSAWSSLLPRKMLKE 80

Query: 73  QKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVD 132
           + DE +W +LYRK K+       GNFLK+VSLHDV LD  S  WRAQQTNLEYLLMLDVD
Sbjct: 81  EADEFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPDSFHWRAQQTNLEYLLMLDVD 137

Query: 133 SLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTV 192
            L WSFRK A L  PG  YGGWE P SELRGHFVGHYLSA+A MWASTHN T+KEKMS +
Sbjct: 138 GLAWSFRKEAGLDAPGDYYGGWERPDSELRGHFVGHYLSATAYMWASTHNDTLKEKMSAL 197

Query: 193 VFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQAL 252
           V +LSECQ K GTGYLSAFP+  FD FEA+ PVWAPYYTIHKILAGL+DQY LA N+QAL
Sbjct: 198 VSALSECQQKSGTGYLSAFPSSFFDRFEAITPVWAPYYTIHKILAGLVDQYKLAGNSQAL 257

Query: 253 KMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF 312
           KMAT M +YFY RV+ VI  YSVERHW SLNEETGGMNDVLY+LYSIT D K+LLLAHLF
Sbjct: 258 KMATGMADYFYGRVRNVIRKYSVERHWQSLNEETGGMNDVLYQLYSITGDSKYLLLAHLF 317

Query: 313 DKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSY 372
           DKPCFLG LA+QAD +S FHANTHIPIV+GSQ RYE+TGD L+K I  FFMDI NASHSY
Sbjct: 318 DKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEISMFFMDIFNASHSY 377

Query: 373 ATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
           ATGGTS  EFW DPKR+A  L +ENEE+CTTYNMLKVSR+LFRWTKE++YADYYERALTN
Sbjct: 378 ATGGTSVSEFWQDPKRMATALQTENEESCTTYNMLKVSRNLFRWTKEVSYADYYERALTN 437

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
           GVL IQRGT+PG+MIYMLPLG+GVSKA + HGWGT ++SFWCCYGTGIESFSKLGDSIYF
Sbjct: 438 GVLGIQRGTQPGLMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYGTGIESFSKLGDSIYF 497

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF-SSKQEVGQLSS 551
           +E+G  P LY+ QYISSS DWKS  + ++QKV+P+VSWDPY+R+T T  SSK  V + S+
Sbjct: 498 QEDGATPALYVTQYISSSLDWKSAGLSISQKVNPVVSWDPYMRVTFTLSSSKVGVAKEST 557

Query: 552 LNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDD 611
           LNLR+PVWT S GA+ SLNG+ L +P  GNFLS  ++W   D++T++LP+S+RTEAI+DD
Sbjct: 558 LNLRIPVWTNSIGAKVSLNGRPLNVPTSGNFLSIKQKWKSGDQVTMELPMSIRTEAIKDD 617

Query: 612 RPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNS 671
           RPEYAS+QAIL+GPYLLAGHTS +W I T         I+PIP + N+ LVT +Q+SGN 
Sbjct: 618 RPEYASLQAILYGPYLLAGHTSRDWSITTQAKP--GKWITPIPETQNSYLVTLSQQSGNV 675

Query: 672 TFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFP 731
           ++V SNSNQ+ITM   P  GT  A+ ATFRL+  D S    S    +IG+ VMLEPFDFP
Sbjct: 676 SYVFSNSNQTITMRVSPEPGTQDAVAATFRLV-TDNSKPRISGPEGLIGRLVMLEPFDFP 734

Query: 732 GMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEP 790
           GM+V+Q  +  L V + SP + G+S FRLV+GLD +  +VSL  E++KGCFV S    + 
Sbjct: 735 GMIVKQATDSSLTVQASSPSDKGASSFRLVSGLDGKLGSVSLRLESKKGCFVYSDQTLKQ 794

Query: 791 GASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYT 850
           G  L+L C +++ D  F  AASF ++ G+ +Y+P+SFV  G +RNF+L+PL S RDE Y 
Sbjct: 795 GTKLRLECGSDATDEKFKEAASFSLKTGMHQYNPMSFVMSGTQRNFVLSPLFSLRDETYN 854

Query: 851 VYFNIQ 856
           VYF++Q
Sbjct: 855 VYFSVQ 860


>gi|356557388|ref|XP_003546998.1| PREDICTED: uncharacterized protein LOC100815634 [Glycine max]
          Length = 841

 Score = 1119 bits (2895), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 556/862 (64%), Positives = 670/862 (77%), Gaps = 28/862 (3%)

Query: 1   MNFGFVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDS 59
           M F F       +G A GK+CTN     SH FRY+L TSTN+TW   ++SH HLT  DD 
Sbjct: 1   MAFLFAFVAIVVWGCAAGKECTNNDA-QSHTFRYQLSTSTNETW--NIMSHNHLTTKDDH 57

Query: 60  AWSSLIPSKILGDQKDEVSWAL-LYRKIKNPGGF---DLPGNFLKEVSLHDVWLDQSSVL 115
             + L+P K+L   K+E    L + RKI+  G       P  FLK VSLHDV L+Q S+ 
Sbjct: 58  LLADLLPRKLL---KEENQRNLDMLRKIEKVGVLKPPQQPQGFLKPVSLHDVRLNQGSIH 114

Query: 116 WRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQ 175
            +AQ+TNLEYLLML+VD L+WSFRKTA LPTPG  YGGWE+P  ELRGHFVGHYLSASA 
Sbjct: 115 AQAQRTNLEYLLMLNVDRLLWSFRKTAGLPTPGTPYGGWEDPKMELRGHFVGHYLSASAL 174

Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
           MWASTHN ++K+KMS +V +LS CQ KIGTGYLSAFP+E FD  EA K VWAPYYT HKI
Sbjct: 175 MWASTHNDSLKKKMSALVANLSICQEKIGTGYLSAFPSEFFDRLEATKYVWAPYYTTHKI 234

Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYR 295
           LAGLLDQ+ +A+N QALKM TWMV+YFYNRVQ VIT +S+ RH+ SLNEETGGMNDVLY+
Sbjct: 235 LAGLLDQHSIAENPQALKMVTWMVDYFYNRVQNVITKFSISRHYQSLNEETGGMNDVLYK 294

Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY 355
           LYSIT DP+HLLLAHLFDKPCFLG LA++A+ ++HFHANTHIP+++GSQMRYEVTGDPLY
Sbjct: 295 LYSITGDPRHLLLAHLFDKPCFLGLLAVKANDIAHFHANTHIPVIVGSQMRYEVTGDPLY 354

Query: 356 KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLF 414
           K IGT FMD+VN+SH+YATGGTS  EFW DPKR+ADTL S +NEE+CTTYNMLKVSRHLF
Sbjct: 355 KEIGTLFMDLVNSSHTYATGGTSVNEFWSDPKRMADTLESTDNEESCTTYNMLKVSRHLF 414

Query: 415 RWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC 474
            WTK+++YADYYERALTNGVLSIQRGTEPGVMIYMLP GRGVSKA++  GWGTKF+SFWC
Sbjct: 415 TWTKKVSYADYYERALTNGVLSIQRGTEPGVMIYMLPQGRGVSKAKTYFGWGTKFDSFWC 474

Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
           CYGTGIESFSKLGDSIYFEE+G  P LYIIQYISS F+WKSG ++LNQ V P  SWDP+L
Sbjct: 475 CYGTGIESFSKLGDSIYFEEQGENPTLYIIQYISSLFNWKSGQIILNQTVVPPASWDPFL 534

Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
           R++ TFS  ++ G LS+LN R+P   + NG +  LN + L LP PGNFLS T +W+  DK
Sbjct: 535 RVSFTFSPAKKTGALSTLNFRLPTRMHKNGEKGILNNETLTLPGPGNFLSITRKWNAGDK 594

Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIP 654
           L++QLPL+LR EAI+DDR +YASIQAIL+GPYLLAGHT+G+W+IKT    S++  I+PIP
Sbjct: 595 LSLQLPLTLRAEAIKDDRTKYASIQAILYGPYLLAGHTTGDWNIKTAANASIADWITPIP 654

Query: 655 PSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSS 714
            S+N  L  F+Q   NSTFV++NSNQS+ +++ P  GTD+AL ATFR+I +  S + F++
Sbjct: 655 ASYNIHLFYFSQAFANSTFVLTNSNQSLAVKKVPEPGTDSALGATFRVI-QGKSSTKFTT 713

Query: 715 LNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEA 774
           L + IGKSVMLEPFD PGM        + + S  P    SS F +V GLD R ET+SLE+
Sbjct: 714 LTDAIGKSVMLEPFDHPGM--------QALPSGGP----SSVFVVVPGLDGRKETISLES 761

Query: 775 ENRKGCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARR 834
           ++  GCFV SG+    G  +KL C T S DA FN+AASF+ + GIS+Y+PISFVAKG  R
Sbjct: 762 KSHNGCFVHSGL--RSGRGVKLSCKTTS-DATFNQAASFIAKRGISKYNPISFVAKGENR 818

Query: 835 NFLLAPLLSFRDEAYTVYFNIQ 856
           NFLL PLL+FRDE+YTVYFNI+
Sbjct: 819 NFLLEPLLAFRDESYTVYFNIK 840


>gi|297746368|emb|CBI16424.3| unnamed protein product [Vitis vinifera]
          Length = 741

 Score = 1117 bits (2889), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 528/732 (72%), Positives = 616/732 (84%), Gaps = 3/732 (0%)

Query: 128 MLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKE 187
           MLD D LVWSFR+TA LPTP   YGGWE+P  ELRGHFVGHYLSASAQMWASTHN ++KE
Sbjct: 1   MLDADRLVWSFRRTAGLPTPCSPYGGWESPDGELRGHFVGHYLSASAQMWASTHNESLKE 60

Query: 188 KMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLAD 247
           KMS VV +L ECQ K+GTGYLSAFP+ELFD FEAL+ VWAPYYTIHKILAGLLDQY L  
Sbjct: 61  KMSAVVCALGECQKKMGTGYLSAFPSELFDRFEALEEVWAPYYTIHKILAGLLDQYTLGG 120

Query: 248 NAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLL 307
           NAQALKM TWMVEYFYNRVQ VI+ YS+ERHW SLNEETGGMND LY LY IT D KH +
Sbjct: 121 NAQALKMVTWMVEYFYNRVQNVISSYSIERHWLSLNEETGGMNDFLYNLYRITGDQKHFV 180

Query: 308 LAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVN 367
           LAHLFDKPCFLG LA+QAD +S FHANTHIPIV+G+QMRYE+TGDPLYK IG FF+D VN
Sbjct: 181 LAHLFDKPCFLGLLAMQADDISGFHANTHIPIVVGAQMRYEITGDPLYKTIGAFFIDTVN 240

Query: 368 ASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           +SHSYATGGTS  EFW DPKR+A TL +EN E+CTTYNMLKVSR+LFRWTKE+AYADYYE
Sbjct: 241 SSHSYATGGTSVDEFWSDPKRMATTLQTENAESCTTYNMLKVSRNLFRWTKEVAYADYYE 300

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLG 487
           RALTNG+LSIQRGT+PGVM+YMLPLG G SKARS HGWGTKF+SFWCCYGTGIESFSKLG
Sbjct: 301 RALTNGILSIQRGTDPGVMLYMLPLGHGNSKARSYHGWGTKFHSFWCCYGTGIESFSKLG 360

Query: 488 DSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK--QE 545
           DSIYFEEEG VPGLYIIQYISSS DWKSG VVLNQKVD +VSWDPYLR+TLTFS K  Q 
Sbjct: 361 DSIYFEEEGEVPGLYIIQYISSSLDWKSGQVVLNQKVDTVVSWDPYLRITLTFSPKKMQG 420

Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
            GQ S++NLR+PVW YS+GA+A++N Q LP+P P +FLS   +WS +DKLT+QLP++LRT
Sbjct: 421 AGQSSAINLRIPVWAYSSGAKAAVNAQALPVPAPNSFLSFRRKWSPDDKLTLQLPIALRT 480

Query: 606 EAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFT 665
           EAI+DDRP+YA +QAIL+GPYLL G T+ +WDI+T  A SLS  I+PIP S N+ L++ +
Sbjct: 481 EAIKDDRPKYACLQAILYGPYLLVGLTNNDWDIQTDLAASLSDWITPIPASHNSHLISLS 540

Query: 666 QESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVML 725
           QESGNS+F  +NSNQS+TME +P SGTDA+L+ATFRLIL+D++ S  SS  + IGK VML
Sbjct: 541 QESGNSSFAFTNSNQSLTMERYPESGTDASLNATFRLILEDSTSSKISSPKDAIGKFVML 600

Query: 726 EPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSS 784
           EP +FPGM +VQ+G  + L ++ S   +GSS F LVAGLD ++ TVSLE++ +KGCFV S
Sbjct: 601 EPINFPGMAVVQRGTNESLGITNSASVVGSSLFHLVAGLDGKDGTVSLESKTQKGCFVYS 660

Query: 785 GVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSF 844
            VN++ G+++KL C   S D  FN+A SF ++ GISEYHPISFVAKG RR++LLAPLLS 
Sbjct: 661 DVNYDSGSAIKLKCKLASSDVVFNQATSFTLKHGISEYHPISFVAKGLRRDYLLAPLLSL 720

Query: 845 RDEAYTVYFNIQ 856
           RDE+YTVYFNIQ
Sbjct: 721 RDESYTVYFNIQ 732


>gi|297807309|ref|XP_002871538.1| hypothetical protein ARALYDRAFT_350453 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317375|gb|EFH47797.1| hypothetical protein ARALYDRAFT_350453 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 860

 Score = 1116 bits (2887), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 542/860 (63%), Positives = 658/860 (76%), Gaps = 11/860 (1%)

Query: 1   MNFGFVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDD 58
           +    +LF  F   + + K+CT+  +   SH  R EL  S N+T K E+ SH+HLTPTDD
Sbjct: 7   ITIALLLFTSFVL-VCVAKECTDIPTKLSSHTLRSELLQSQNETLKTELSSHYHLTPTDD 65

Query: 59  SAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRA 118
           +AWS+L+P K+L ++ D+ +W +LYRK K+       GNFLK+VSLHDV LD SS  WRA
Sbjct: 66  AAWSTLLPRKMLKEETDDFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPSSFHWRA 122

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
           QQTNLEYLLML+VD L +SFRK A L  PG  YGGWE P SELRGHFVGHYLSA+A MWA
Sbjct: 123 QQTNLEYLLMLNVDGLAYSFRKVAGLDAPGVPYGGWEKPDSELRGHFVGHYLSATAYMWA 182

Query: 179 STHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAG 238
           STHN T+K KMS +V +L+ECQ K GTGYLSAFP+  FD FEA+  VWAPYYTIHKILAG
Sbjct: 183 STHNDTLKTKMSALVSALAECQQKSGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAG 242

Query: 239 LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYS 298
           L+DQY LA N QALKMAT M +YFY RVQ VI  YSVERHW SLNEETGGMNDVLY+LYS
Sbjct: 243 LVDQYKLAGNTQALKMATGMADYFYGRVQNVIRKYSVERHWLSLNEETGGMNDVLYQLYS 302

Query: 299 ITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLI 358
           IT D K+L LAHLFDKPCFLG LA+QAD +S FHANTHIPIV+GSQ RYE+TGD L+K I
Sbjct: 303 ITRDSKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEI 362

Query: 359 GTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTK 418
             FFMDIVNASHSYATGGTS +EFW DPKR+A TL +ENEE+CTTYNMLKVSR+LFRWTK
Sbjct: 363 SMFFMDIVNASHSYATGGTSVKEFWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWTK 422

Query: 419 EIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGT 478
           E++YADYYERALTNGVL IQRGT+PG MIYMLPLG+GVSKA + HGWGT ++SFWCCYGT
Sbjct: 423 EVSYADYYERALTNGVLGIQRGTQPGRMIYMLPLGQGVSKAVTYHGWGTPYDSFWCCYGT 482

Query: 479 GIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL 538
           GIESFSKLGDSIYF+E+G  P LY+ QYISSS DWKS  ++L+QKV+P+VSWDPY+R+T 
Sbjct: 483 GIESFSKLGDSIYFQEDGASPALYVTQYISSSLDWKSAGLLLSQKVNPVVSWDPYMRVTF 542

Query: 539 TF-SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTI 597
           T  SSK  V + S+LNLR+PVWT S GA+ SLNG+ L +P  GNFLS  + W   D++T+
Sbjct: 543 TLSSSKVGVAKKSTLNLRIPVWTNSIGAKVSLNGKPLKVPTSGNFLSIKQNWKSGDQVTM 602

Query: 598 QLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSF 657
           +LP+S+RTEAI+DDRPEYAS+QAIL+GPYLLAGHTS +W I   T       I+PIP ++
Sbjct: 603 ELPMSIRTEAIKDDRPEYASLQAILYGPYLLAGHTSRDWSIT--TQAKAGNWITPIPETY 660

Query: 658 NAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNN 717
           N+ LVT +Q+SGN ++V+SN+NQ+ITM   P  GT  A+ ATFRL+  D S    S    
Sbjct: 661 NSHLVTLSQQSGNISYVLSNTNQTITMRVSPELGTQDAVAATFRLV-TDNSKPRISGPEA 719

Query: 718 VIGKSVMLEPFDFPGMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAEN 776
           +IG  VMLEPFDFPGM+V+Q  +  L V + SP + G+S FRLV+G+D +  +VSL  E+
Sbjct: 720 LIGSLVMLEPFDFPGMIVKQATDSSLTVQASSPSDKGASSFRLVSGVDGKPGSVSLRLES 779

Query: 777 RKGCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNF 836
             GCFV S    + G  LKL C   + D  F  AASF +  G+++Y+P+SFV  G +RNF
Sbjct: 780 NNGCFVYSDQTLKQGTKLKLECGPVATDEKFKEAASFKLNTGMNQYNPMSFVMSGTQRNF 839

Query: 837 LLAPLLSFRDEAYTVYFNIQ 856
           +L+PL S RDE Y VYF++Q
Sbjct: 840 VLSPLFSLRDETYNVYFSVQ 859


>gi|297811349|ref|XP_002873558.1| hypothetical protein ARALYDRAFT_488069 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319395|gb|EFH49817.1| hypothetical protein ARALYDRAFT_488069 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 860

 Score = 1116 bits (2887), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 542/856 (63%), Positives = 657/856 (76%), Gaps = 14/856 (1%)

Query: 5   FVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
           +  F   C    + K+CT+  +   SH    EL  S NKT K E+ SH+HLTPTDD+AWS
Sbjct: 14  YTSFLLVC----VAKECTDIPTKLSSHTLNSELLQSHNKTLKTELFSHYHLTPTDDAAWS 69

Query: 63  SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
           +L+P K+L ++ DE +W +LYRK K+       GNFLK+VSLHDV LD +S  WRAQQTN
Sbjct: 70  TLLPRKMLKEETDEFAWTMLYRKFKDSNSV---GNFLKDVSLHDVRLDPNSFHWRAQQTN 126

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
           LEYLLMLDVD L +SFRK A L   G  YGGWE P SELRGHFVGHYLSA+A MWASTHN
Sbjct: 127 LEYLLMLDVDGLAYSFRKVAGLDASGVPYGGWEKPDSELRGHFVGHYLSATAHMWASTHN 186

Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
            T+K KMS +V +L+ECQ K GTGYLSAFP+  FD FEA+  VWAPYYTIHKILAGL+DQ
Sbjct: 187 DTLKAKMSALVSALAECQQKSGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAGLVDQ 246

Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
           Y LA N QALKMAT M +YFY RV+ VIT YSVERH+ SLNEETGGMNDVLY+LYSIT D
Sbjct: 247 YKLAGNIQALKMATGMADYFYGRVRNVITKYSVERHYQSLNEETGGMNDVLYQLYSITRD 306

Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
            K+L LAHLFDKPCFLG LA+QAD +S FHANTHIPIV+GSQ RYE+TGD L+K I  FF
Sbjct: 307 SKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEISMFF 366

Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
           MDI+NASHSYATGGTS REFW DPKR+A TL +ENEE+CTTYNMLKVSR+LFRWTKE++Y
Sbjct: 367 MDIINASHSYATGGTSVREFWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWTKEVSY 426

Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
           ADYYERALTNGVL IQRGT+PG MIYMLPLG+GVSKA + HGWGT ++SFWCCYGTGIES
Sbjct: 427 ADYYERALTNGVLGIQRGTQPGRMIYMLPLGQGVSKAVTYHGWGTPYDSFWCCYGTGIES 486

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF-S 541
           FSKLGDSIYF+E+G  P LY+ QYISSS DWKS  ++L+QKV+P+VSWDPY+R+T T  S
Sbjct: 487 FSKLGDSIYFQEDGASPALYVTQYISSSLDWKSAGLLLSQKVNPVVSWDPYMRVTFTLSS 546

Query: 542 SKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
           SK  V + S+LNLR+PVWT S GA+ SLNG+ L +P  GNFLS  + W   D++T++LP+
Sbjct: 547 SKVGVAKKSTLNLRIPVWTNSIGAKVSLNGKPLKVPTSGNFLSIKQNWKSGDQVTMELPM 606

Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQL 661
           S+RTEAI+DDRPEYAS+QAIL+GPYLLAGHTS +W I   T       I+PIP ++N+ L
Sbjct: 607 SIRTEAIKDDRPEYASLQAILYGPYLLAGHTSRDWSIT--TQAKAGNWITPIPETYNSHL 664

Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGK 721
           VT +Q+SGN ++V+SN+NQ+ITM   P  GT  A+ ATFRL+  D S    S L  +IG 
Sbjct: 665 VTLSQQSGNISYVLSNTNQTITMRVSPELGTQDAVAATFRLV-TDNSKPQISGLEALIGS 723

Query: 722 SVMLEPFDFPGMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGC 780
            VMLEPFDFPGM+V+Q  +  L V + SP + G+S FRLV+G+D +  +VSL  E+  GC
Sbjct: 724 LVMLEPFDFPGMIVKQTTDSSLTVQASSPSDKGASSFRLVSGVDGKPGSVSLRLESNNGC 783

Query: 781 FVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAP 840
           FV S    + G  LKL C   + D  F +AASF + IG+++Y+P+SFV  G +RNF+L+P
Sbjct: 784 FVYSDQTLKQGTKLKLECGPVATDEKFKQAASFKLNIGMNQYNPMSFVMSGTQRNFVLSP 843

Query: 841 LLSFRDEAYTVYFNIQ 856
           L S RDE Y VYF++Q
Sbjct: 844 LFSLRDETYNVYFSVQ 859


>gi|297746357|emb|CBI16413.3| unnamed protein product [Vitis vinifera]
          Length = 767

 Score = 1113 bits (2878), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 536/720 (74%), Positives = 608/720 (84%), Gaps = 5/720 (0%)

Query: 5   FVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
            V+F F   G  LGK+CTN  +   SH+FRYEL  S N++WK E+  H+HL  TDDSAWS
Sbjct: 11  IVVFAFVLCGCVLGKECTNVPTQLSSHSFRYELLASNNESWKAEMFQHYHLIHTDDSAWS 70

Query: 63  SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
           +L+P K+L ++ DE SWA++YR +KN  G +   NFLKE+SLHDV LD  S+  RAQQTN
Sbjct: 71  NLLPRKLLREE-DEFSWAMMYRNMKNYDGSN--SNFLKEMSLHDVRLDSDSLHGRAQQTN 127

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
           L+YLL+LDVD LVWSFRKTA L TPG  YGGWE P  ELRGHFVGHY+SASAQMWASTHN
Sbjct: 128 LDYLLILDVDRLVWSFRKTAGLSTPGLPYGGWEAPNVELRGHFVGHYMSASAQMWASTHN 187

Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
            T+KEKMS VV +L+ CQ K+GTGYLSAFP+ELFD FEA+KPVWAPYYTIHKILAGLLDQ
Sbjct: 188 DTLKEKMSAVVSALATCQEKMGTGYLSAFPSELFDRFEAIKPVWAPYYTIHKILAGLLDQ 247

Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
           Y  A N+QALKM TWMVE+FY RVQ VITMYS+ERHW SLNEETGGMNDVLYRLYSIT D
Sbjct: 248 YTFAGNSQALKMMTWMVEHFYKRVQNVITMYSLERHWLSLNEETGGMNDVLYRLYSITGD 307

Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
            KHL+LAHLFDKPCFLG LA+QAD +S FHANTHIP+VIGSQMRYEVTGDPLYK IGTFF
Sbjct: 308 QKHLVLAHLFDKPCFLGLLAVQADSISGFHANTHIPVVIGSQMRYEVTGDPLYKAIGTFF 367

Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
           MDIVN+SHSYATGGTS  EFW DPKRLA TL  ENEE+CTTYNMLKVSRHLFRWTKE+ Y
Sbjct: 368 MDIVNSSHSYATGGTSVGEFWSDPKRLASTLQRENEESCTTYNMLKVSRHLFRWTKEVVY 427

Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
           ADYYERALTNGVLSIQRGT+PGVMIYMLPLGRG SKARS HGWGTKF+SFWCCYGTGIES
Sbjct: 428 ADYYERALTNGVLSIQRGTDPGVMIYMLPLGRGDSKARSYHGWGTKFDSFWCCYGTGIES 487

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
           FSKLGDSIYFEEEG  P +YIIQYISSS DWKSG +VLNQKVDP+VSWDPYLR TLTF+ 
Sbjct: 488 FSKLGDSIYFEEEGKSPEVYIIQYISSSLDWKSGQIVLNQKVDPVVSWDPYLRTTLTFTP 547

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
           K+  GQ S++NLR+PVW  S+GA+AS+N Q+LP+P P +FLS T  WS  DKLT+QLP+ 
Sbjct: 548 KEGAGQSSTINLRIPVWASSSGAKASINAQDLPVPAPSSFLSLTRNWSPGDKLTLQLPIR 607

Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
           LRTEAI+DDRP+YASIQAIL+GPYLLAG TS +WDIKTG+A SLS  I+PIP S N++LV
Sbjct: 608 LRTEAIKDDRPKYASIQAILYGPYLLAGLTSDDWDIKTGSATSLSDWITPIPASDNSRLV 667

Query: 663 TFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKS 722
           + +QESGNS+FV SNSNQSITME+FP  GTDA+LHATFRL+LKDA+     S  + IGKS
Sbjct: 668 SLSQESGNSSFVFSNSNQSITMEKFPEEGTDASLHATFRLVLKDATSLKVLSPKDAIGKS 727



 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/112 (41%), Positives = 62/112 (55%), Gaps = 19/112 (16%)

Query: 750 KEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNF----EPGASLKLLCSTESLDA 805
           +E G+S F         N+++++E    +G   S    F    +   SLK+L   +++  
Sbjct: 671 QESGNSSFVF----SNSNQSITMEKFPEEGTDASLHATFRLVLKDATSLKVLSPKDAIGK 726

Query: 806 GFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNIQD 857
                       GIS+YHPISFVAKG +RNFLL PLL  RDE+YTVYFNIQD
Sbjct: 727 S-----------GISQYHPISFVAKGMKRNFLLTPLLGLRDESYTVYFNIQD 767


>gi|297807305|ref|XP_002871536.1| hypothetical protein ARALYDRAFT_909245 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317373|gb|EFH47795.1| hypothetical protein ARALYDRAFT_909245 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 862

 Score = 1109 bits (2869), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 542/859 (63%), Positives = 660/859 (76%), Gaps = 13/859 (1%)

Query: 5   FVLFFFFCFGL-ALGKQCTNQ-SPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAW 61
            VL  +  F L  + K+CTN  +   SH FR EL  S N+T K E+ SH+HLTPTDD+AW
Sbjct: 9   IVLLLYTSFVLVCVAKECTNTPTQLSSHTFRSELLQSKNETLKTELFSHYHLTPTDDAAW 68

Query: 62  SSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQT 121
           S+L+P K+L ++ DE +W +LYR  K+       GNFLKEVSLHDV LD +S   RAQQT
Sbjct: 69  STLLPRKMLKEEADEFAWTMLYRTFKDSNS---SGNFLKEVSLHDVRLDPNSFHGRAQQT 125

Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH 181
           NLEYLLMLDVD L WSFRK A L  PG  YGGWE P SELRGHFVGHYLSA+A MWASTH
Sbjct: 126 NLEYLLMLDVDGLAWSFRKEAGLDAPGDHYGGWEKPDSELRGHFVGHYLSATAYMWASTH 185

Query: 182 NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLD 241
           N T+KEKMS +V +LSECQ K GTGYLSAFP+  FD FEA+ PVWAPYYTIHKI+AGL+D
Sbjct: 186 NDTLKEKMSALVSALSECQQKSGTGYLSAFPSSFFDRFEAITPVWAPYYTIHKIIAGLVD 245

Query: 242 QYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITH 301
           QY LA N+QAL+MAT M +YFY RV+ VI  YSVERHW SLNEETGGMND+LY+LYSIT 
Sbjct: 246 QYKLAGNSQALQMATGMADYFYGRVRNVIRKYSVERHWQSLNEETGGMNDILYQLYSITG 305

Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTF 361
           D K+LLLAHLFDKPCFLG LA+QAD +S FH+NTHIPIV+GSQ RYE+TGDPL+K I  F
Sbjct: 306 DSKYLLLAHLFDKPCFLGVLAIQADDISGFHSNTHIPIVVGSQQRYEITGDPLHKEISIF 365

Query: 362 FMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIA 421
           FMDIVNASHSYATGGTS  EFW +PKR+A TL +ENEE+CTTYNMLKVSR+LFRWTKE++
Sbjct: 366 FMDIVNASHSYATGGTSVSEFWQNPKRMATTLQTENEESCTTYNMLKVSRNLFRWTKEVS 425

Query: 422 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
           YADYYERALTNGVL IQRGT+PG+MIYMLPLG+GVSKA + HGWGT ++SFWCCYGTGIE
Sbjct: 426 YADYYERALTNGVLGIQRGTQPGLMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYGTGIE 485

Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF- 540
           SFSKLGDSIYF+E+   P LY+ QYISSS DWKS  + L+QKV+P+VSWDPY+R+T +F 
Sbjct: 486 SFSKLGDSIYFQEDDVSPALYVTQYISSSLDWKSAGLSLSQKVNPVVSWDPYMRVTFSFS 545

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP--PPGNFLSATERWSYNDKLTIQ 598
           SSK  + + S+LNLR+PVWT S GA+ SLNGQ+L +P     NFLS  + W   D+LT++
Sbjct: 546 SSKGGMAKESTLNLRIPVWTNSVGAKISLNGQSLKVPNFRTRNFLSIKQNWKSGDQLTME 605

Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFN 658
           LPLS+RTEAI+DDR EY+S+QAIL+GPYLLAGHTS +W I   T       I+PIP + N
Sbjct: 606 LPLSIRTEAIKDDRQEYSSLQAILYGPYLLAGHTSRDWSIT--TQAKAGKWITPIPETQN 663

Query: 659 AQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNV 718
           + LVT +Q+SG+ ++V SNSNQ+ITM   P  GT  A+ ATFRL+  D S    S    +
Sbjct: 664 SYLVTLSQQSGDISYVFSNSNQTITMRVSPEPGTQDAVAATFRLV-TDNSKPRISGPEAL 722

Query: 719 IGKSVMLEPFDFPGMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAENR 777
           IG  V LEPFDFPGM+V+Q  +  L V + SP + G+S FRLV+G+D +  +VSL  E++
Sbjct: 723 IGSLVKLEPFDFPGMIVKQATDSSLTVQASSPSDKGASSFRLVSGVDGKPGSVSLRLESK 782

Query: 778 KGCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFL 837
           KGCFV S    + G  L+L C + + D  F  AASF ++ G+++Y+P+SFV  G +RNF+
Sbjct: 783 KGCFVYSDQTLKQGTKLRLECGSAATDEKFKEAASFKLKTGMNQYNPMSFVMSGTQRNFV 842

Query: 838 LAPLLSFRDEAYTVYFNIQ 856
           L+PL S RDE Y VYF++Q
Sbjct: 843 LSPLFSLRDETYNVYFSVQ 861


>gi|30684197|ref|NP_196800.2| uncharacterized protein [Arabidopsis thaliana]
 gi|28393685|gb|AAO42255.1| unknown protein [Arabidopsis thaliana]
 gi|332004452|gb|AED91835.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 865

 Score = 1107 bits (2862), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 543/856 (63%), Positives = 657/856 (76%), Gaps = 14/856 (1%)

Query: 5   FVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
           +  F   C    L K+CT+  +   SH  R EL  S N   K E  SH+HLTPTDDSAWS
Sbjct: 19  YTSFLLVC----LAKECTDIPTKLSSHTLRSELLQSQNANLKSEEFSHYHLTPTDDSAWS 74

Query: 63  SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
           +L+P K+L ++ D+ +W +LYRK K+       GNFLK+VSLHDV LD SS  WRAQQTN
Sbjct: 75  TLLPRKMLKEETDDFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPSSFHWRAQQTN 131

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
           LEYLLMLDVD L ++FRK A L  PG  YGGWE P SELRGHFVGHYLSA+A MWASTHN
Sbjct: 132 LEYLLMLDVDGLAYNFRKEAGLNAPGVPYGGWEKPDSELRGHFVGHYLSATAYMWASTHN 191

Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
            T+K KM+ +V +L+ECQ K GTGYLSAFP+  FD FEA+  VWAPYYTIHKILAGL+DQ
Sbjct: 192 ETLKAKMTALVSALAECQQKYGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAGLVDQ 251

Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
           Y LA N QALKMAT M +YFY RVQ VI  YSVERHW SLNEETGGMNDVLY+LYSIT D
Sbjct: 252 YKLAGNTQALKMATGMADYFYGRVQNVIKKYSVERHWLSLNEETGGMNDVLYQLYSITRD 311

Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
            K+L LAHLFDKPCFLG LA+QAD +S FHANTHIPIV+GSQ RYE+TGD L+K I  FF
Sbjct: 312 SKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEIPMFF 371

Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
           MDIVNASHSYATGGTS +EFW DPKR+A TL +ENEE+CTTYNMLKVSR+LFRWTKE++Y
Sbjct: 372 MDIVNASHSYATGGTSVKEFWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWTKEVSY 431

Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
           ADYYERALTNGVL IQRGT+PG MIYMLPLG+GVSKA + HGWGT ++SFWCCYGTGIES
Sbjct: 432 ADYYERALTNGVLGIQRGTDPGRMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYGTGIES 491

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF-S 541
           FSKLGDSIYF+E+G  P LY+ QYISSS DWKS  + ++QKV+P+VSWDPY+R+T T  S
Sbjct: 492 FSKLGDSIYFQEDGATPALYVTQYISSSLDWKSAGLSISQKVNPVVSWDPYMRVTFTLSS 551

Query: 542 SKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
           SK  V + S+LNLR+PVWT S GA+ SLNG+ L +P  GNFLS  ++W   D++T++LP+
Sbjct: 552 SKVGVAKESTLNLRIPVWTNSIGAKVSLNGRPLNVPTSGNFLSIKQKWKSGDQVTMELPM 611

Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQL 661
           S+RTEAI+DDRPEYAS+QAIL+GPYLLAGHTS +W I   T       I+PIP + N+ L
Sbjct: 612 SIRTEAIKDDRPEYASLQAILYGPYLLAGHTSMDWSIT--TQAKAGNWITPIPETLNSHL 669

Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGK 721
           VT +Q+SGN ++V+SNSNQ+I M+  P  GT  A+ ATFRL+  D S    SS   +IG 
Sbjct: 670 VTLSQQSGNISYVLSNSNQTIIMKVSPEPGTQDAVSATFRLVTDD-SKHPISSPEGLIGS 728

Query: 722 SVMLEPFDFPGMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGC 780
            VMLEPFDFPGM+V+Q  +  L V + SP + GSS FRLV+GLD +  +VSL  E++KGC
Sbjct: 729 LVMLEPFDFPGMIVKQATDSSLTVQASSPSDKGSSSFRLVSGLDGKPGSVSLSLESKKGC 788

Query: 781 FVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAP 840
           FV S    + G  L+L C + + D  F +AASF ++ G+++Y+P+SFV  G +RNF+L+P
Sbjct: 789 FVYSDQTLKQGTKLRLECGSAATDEKFKQAASFSLKTGMNQYNPMSFVMSGTQRNFVLSP 848

Query: 841 LLSFRDEAYTVYFNIQ 856
           L S RDE Y VYF++Q
Sbjct: 849 LFSLRDETYNVYFSVQ 864


>gi|7630052|emb|CAB88260.1| putative protein [Arabidopsis thaliana]
          Length = 860

 Score = 1106 bits (2861), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 543/856 (63%), Positives = 657/856 (76%), Gaps = 14/856 (1%)

Query: 5   FVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
           +  F   C    L K+CT+  +   SH  R EL  S N   K E  SH+HLTPTDDSAWS
Sbjct: 14  YTSFLLVC----LAKECTDIPTKLSSHTLRSELLQSQNANLKSEEFSHYHLTPTDDSAWS 69

Query: 63  SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
           +L+P K+L ++ D+ +W +LYRK K+       GNFLK+VSLHDV LD SS  WRAQQTN
Sbjct: 70  TLLPRKMLKEETDDFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPSSFHWRAQQTN 126

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
           LEYLLMLDVD L ++FRK A L  PG  YGGWE P SELRGHFVGHYLSA+A MWASTHN
Sbjct: 127 LEYLLMLDVDGLAYNFRKEAGLNAPGVPYGGWEKPDSELRGHFVGHYLSATAYMWASTHN 186

Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
            T+K KM+ +V +L+ECQ K GTGYLSAFP+  FD FEA+  VWAPYYTIHKILAGL+DQ
Sbjct: 187 ETLKAKMTALVSALAECQQKYGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAGLVDQ 246

Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
           Y LA N QALKMAT M +YFY RVQ VI  YSVERHW SLNEETGGMNDVLY+LYSIT D
Sbjct: 247 YKLAGNTQALKMATGMADYFYGRVQNVIKKYSVERHWLSLNEETGGMNDVLYQLYSITRD 306

Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
            K+L LAHLFDKPCFLG LA+QAD +S FHANTHIPIV+GSQ RYE+TGD L+K I  FF
Sbjct: 307 SKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEIPMFF 366

Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
           MDIVNASHSYATGGTS +EFW DPKR+A TL +ENEE+CTTYNMLKVSR+LFRWTKE++Y
Sbjct: 367 MDIVNASHSYATGGTSVKEFWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWTKEVSY 426

Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
           ADYYERALTNGVL IQRGT+PG MIYMLPLG+GVSKA + HGWGT ++SFWCCYGTGIES
Sbjct: 427 ADYYERALTNGVLGIQRGTDPGRMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYGTGIES 486

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF-S 541
           FSKLGDSIYF+E+G  P LY+ QYISSS DWKS  + ++QKV+P+VSWDPY+R+T T  S
Sbjct: 487 FSKLGDSIYFQEDGATPALYVTQYISSSLDWKSAGLSISQKVNPVVSWDPYMRVTFTLSS 546

Query: 542 SKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
           SK  V + S+LNLR+PVWT S GA+ SLNG+ L +P  GNFLS  ++W   D++T++LP+
Sbjct: 547 SKVGVAKESTLNLRIPVWTNSIGAKVSLNGRPLNVPTSGNFLSIKQKWKSGDQVTMELPM 606

Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQL 661
           S+RTEAI+DDRPEYAS+QAIL+GPYLLAGHTS +W I   T       I+PIP + N+ L
Sbjct: 607 SIRTEAIKDDRPEYASLQAILYGPYLLAGHTSMDWSIT--TQAKAGNWITPIPETLNSHL 664

Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGK 721
           VT +Q+SGN ++V+SNSNQ+I M+  P  GT  A+ ATFRL+  D S    SS   +IG 
Sbjct: 665 VTLSQQSGNISYVLSNSNQTIIMKVSPEPGTQDAVSATFRLVTDD-SKHPISSPEGLIGS 723

Query: 722 SVMLEPFDFPGMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGC 780
            VMLEPFDFPGM+V+Q  +  L V + SP + GSS FRLV+GLD +  +VSL  E++KGC
Sbjct: 724 LVMLEPFDFPGMIVKQATDSSLTVQASSPSDKGSSSFRLVSGLDGKPGSVSLSLESKKGC 783

Query: 781 FVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAP 840
           FV S    + G  L+L C + + D  F +AASF ++ G+++Y+P+SFV  G +RNF+L+P
Sbjct: 784 FVYSDQTLKQGTKLRLECGSAATDEKFKQAASFSLKTGMNQYNPMSFVMSGTQRNFVLSP 843

Query: 841 LLSFRDEAYTVYFNIQ 856
           L S RDE Y VYF++Q
Sbjct: 844 LFSLRDETYNVYFSVQ 859


>gi|357139358|ref|XP_003571249.1| PREDICTED: uncharacterized protein LOC100841742 [Brachypodium
           distachyon]
          Length = 883

 Score =  992 bits (2564), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 479/822 (58%), Positives = 616/822 (74%), Gaps = 29/822 (3%)

Query: 52  HLTPTDDSAWSSLIPSKILGDQ--------KDEVSWALLYRKIKNPGGFDLPGN------ 97
           HL PTD+SAW +L+P ++L           ++   W +LYRK++  G   + G       
Sbjct: 71  HLIPTDESAWMALMPRRLLAGGAGGNGAPPREAFDWLMLYRKLRGGGDGAIDGPAAAAAG 130

Query: 98  -FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
            FL E SLHDV L   +V W+AQQTNLEYLL+LD D LVWSFR  A LP  G  YGGWE 
Sbjct: 131 PFLSEASLHDVRLQPGTVYWQAQQTNLEYLLLLDADRLVWSFRTQAGLPATGTPYGGWEG 190

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
           P  ELRGHFVGHYL+A+A+MWASTHN T++ KMS+V+ +L +CQ K+G GYLSAFPTE F
Sbjct: 191 PSVELRGHFVGHYLTAAAKMWASTHNDTLRTKMSSVIDTLYDCQKKMGMGYLSAFPTEFF 250

Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
           D  EAL  VWAPYYTIHKI+ GLLDQY +A +++AL+M   M +YF  RV+ VI  YS+E
Sbjct: 251 DRAEALTTVWAPYYTIHKIMQGLLDQYTVAGSSKALEMVVGMADYFSGRVKNVIQKYSIE 310

Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
           RHW SLNEETGGMNDVLY+LY+IT+D KHL LAHLFDKPCFLG LA+QAD +S FH+NTH
Sbjct: 311 RHWASLNEETGGMNDVLYQLYAITNDLKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTH 370

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
           IP+VIG+QMRYEVTGD LYK I + FMD++N+SHSYATGGTSA EFW+DPKRLA TL +E
Sbjct: 371 IPVVIGAQMRYEVTGDVLYKQIASSFMDMINSSHSYATGGTSAGEFWYDPKRLAATLSTE 430

Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
           NEE+CTTYNMLKVSR+LFRWTKEI+YADYYERAL NGVLSIQRGT+PGVMIYMLP   G 
Sbjct: 431 NEESCTTYNMLKVSRNLFRWTKEISYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGR 490

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
           SKA   HGWGT ++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+F+WK+ 
Sbjct: 491 SKAVGYHGWGTLYDSFWCCYGTGIESFSKLGDSIYFEEKGHAPALNIIQYIPSTFNWKTA 550

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
            + + Q+++ + S DPYLR++L+ S+K   GQ ++LN+R+P WT +NG +A+L G++L L
Sbjct: 551 GLTVTQQLESLSSSDPYLRVSLSVSAK---GQSATLNVRIPTWTSANGTKATLTGKDLGL 607

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
             PG  LS +++W+ ++ L++Q P+SLRTEAI+DDRP+YAS+QAILFGP++LAG +SG+W
Sbjct: 608 VTPGTLLSISKQWNSDEHLSLQFPISLRTEAIKDDRPQYASLQAILFGPFVLAGLSSGDW 667

Query: 637 DIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAA 695
           D K  +A  +S  I+ +P S+N+QL+TFTQES   TFV+S+SN S+TM+E P + GTD A
Sbjct: 668 DAKASSA--VSDWITAVPSSYNSQLMTFTQESNGKTFVLSSSNGSLTMQERPSIDGTDTA 725

Query: 696 LHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSS 755
           +HATFR+  +D++    +    + G  V +EPFD PG ++         ++ S ++  +S
Sbjct: 726 VHATFRVHSQDSTSQQGTYNAALKGTPVQIEPFDLPGTVITNN------LTFSAQKSSAS 779

Query: 756 GFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASF 813
            F +V GLD +  +VSLE   + GCF+ SG ++  G  +++ C  S +S+   F +AASF
Sbjct: 780 FFDIVPGLDGKPNSVSLELGTKSGCFMVSGADYSAGTKIQVSCKSSLQSIGGIFEQAASF 839

Query: 814 MMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
           +    + +YHPISFVAKG RRNFLL PL S RDE YTVYFN+
Sbjct: 840 VQATPLRQYHPISFVAKGVRRNFLLEPLYSLRDEFYTVYFNL 881


>gi|242060854|ref|XP_002451716.1| hypothetical protein SORBIDRAFT_04g006520 [Sorghum bicolor]
 gi|241931547|gb|EES04692.1| hypothetical protein SORBIDRAFT_04g006520 [Sorghum bicolor]
          Length = 888

 Score =  988 bits (2554), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 486/814 (59%), Positives = 613/814 (75%), Gaps = 18/814 (2%)

Query: 52  HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLP-------GNFLKEVSL 104
           HLTPTD+S W SL+P + L  +++   W +LYRK++       P       G FL + SL
Sbjct: 81  HLTPTDESTWMSLMPRRAL-RREEAFDWLMLYRKLRGATAGGAPRRPGVAAGTFLSDASL 139

Query: 105 HDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGH 164
           HDV L+  S+ WRAQQTNLEYLL+LDVD LVWSFRK A L  PG  YGGWE P  ELRGH
Sbjct: 140 HDVRLEPGSLYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPDVELRGH 199

Query: 165 FVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKP 224
           FVGHYLSA+A+MWASTHN T+  KMS+V+ +LS+CQ K+GTGYLSAFPTE FD  EA+KP
Sbjct: 200 FVGHYLSATAKMWASTHNDTLNAKMSSVIDALSDCQKKMGTGYLSAFPTEFFDRVEAIKP 259

Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
           VWAPYYTIHKI+ GLLDQY +A N++AL M   M  YF +RV+ VI  YS+ERHW SLNE
Sbjct: 260 VWAPYYTIHKIMQGLLDQYTVAGNSKALDMVVNMANYFSDRVKNVIQKYSIERHWESLNE 319

Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
           ETGGMNDVLY+LY+IT+D KHL LAHLFDKPCFLG LA+QAD +S FH+NTHIP+VIG+Q
Sbjct: 320 ETGGMNDVLYQLYTITNDLKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQ 379

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
           MRYEVTGDPLYK I +FFMD +N+SHSYATGGTSA EFW DPK LA TL +ENEE+CTTY
Sbjct: 380 MRYEVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKHLAGTLSTENEESCTTY 439

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           NMLK+SR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP   G SKA S H 
Sbjct: 440 NMLKISRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHS 499

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
           WGTK++SFWCCYGTGIESFSKLGDSIYFEE+ ++P L IIQYI S++DWK+  +++ QKV
Sbjct: 500 WGTKYDSFWCCYGTGIESFSKLGDSIYFEEKEDLPALNIIQYIPSTYDWKAAGLIVTQKV 559

Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLS 584
           + + S D YL+++L+ S+K + GQ + LN+R+P WT+++GA A+LN ++L    PG+FLS
Sbjct: 560 NTLSSSDQYLQISLSISAKTK-GQTAKLNVRIPSWTFADGAGATLNDKDLGSISPGSFLS 618

Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTAR 644
            T++W+ +D L ++ P+ LRTEAI+DDRPEYAS+QA+LFGP++LAG ++G+WD K G   
Sbjct: 619 ITKQWNSDDHLALRFPIRLRTEAIKDDRPEYASLQAVLFGPFVLAGLSTGDWDAKAGNGS 678

Query: 645 SLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATFRLI 703
           ++S  I+ +PP+ N+QLVTF+Q S   TFV+S++N ++TM+E P V GTD A+HATFR  
Sbjct: 679 AISDWITAVPPAHNSQLVTFSQVSNGKTFVLSSANGTLTMQERPEVDGTDTAIHATFRAH 738

Query: 704 LKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGL 763
            +D++  +        G S+++EPFD PG ++         ++ S ++     F LV GL
Sbjct: 739 PQDSTELHDIYRTIAKGASILIEPFDLPGTVITNN------LTLSAQKSTDCLFNLVPGL 792

Query: 764 DKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIGISE 821
           D    +VSLE   R GCF+ +G N+  G  +++ C  S ES+     +AASF     + +
Sbjct: 793 DGNPNSVSLELGTRPGCFLVTGTNYSAGTKIQVSCKSSLESIGGILEQAASFSQTDPLRQ 852

Query: 822 YHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
           YHPISFVAKG  RNFLL PL S RDE YTVYFNI
Sbjct: 853 YHPISFVAKGMTRNFLLEPLYSLRDEFYTVYFNI 886


>gi|219885159|gb|ACL52954.1| unknown [Zea mays]
          Length = 879

 Score =  985 bits (2547), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 500/874 (57%), Positives = 631/874 (72%), Gaps = 30/874 (3%)

Query: 4   GFVLFFFFCFGL--ALGKQCTNQSP-YDSHAFRYELT---STNKTWKEEVLSHF------ 51
           G V+      G   A GK CTN  P   SH  R           T  + ++ H       
Sbjct: 14  GIVVVMLLAAGFRGAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQ 73

Query: 52  HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPG----NFLKEVSLHDV 107
           HLTPTD+S W SL+P + L  +++   W +LYR+++  GG   PG     FL E SLHDV
Sbjct: 74  HLTPTDESTWMSLMPRRAL-RREEAFDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDV 132

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG 167
            L+  S+ WRAQQTNLEYLL+LDVD LVWSFRK A L  PG  YGGWE P  +LRGHFVG
Sbjct: 133 RLEPGSMYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVG 192

Query: 168 HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWA 227
           HYLSA+A+MWASTHN T+  KMS+VV +L +CQ K+GTGYLSAFP++ FD  EA+K VWA
Sbjct: 193 HYLSATAKMWASTHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWA 252

Query: 228 PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
           PYYTIHKI+ GLLDQY +A N+ AL M   M  YF +RV+ VI  YS+ERHW SLNEETG
Sbjct: 253 PYYTIHKIMQGLLDQYTVAGNSMALDMVIKMANYFSDRVKNVIQNYSIERHWESLNEETG 312

Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
           GMNDVLY+LY+ITHD KHL LAHLFDKPCFLG LA+QAD +S FH+NTHIP+VIG+QMRY
Sbjct: 313 GMNDVLYQLYTITHDMKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRY 372

Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNML 407
           EVTGDPLYK I +FFMD +N+SHSYATGGTSA EFW DPKRLA TL +ENEE+CTTYNML
Sbjct: 373 EVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTYNML 432

Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
           KVSR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP   G SKA S HGWGT
Sbjct: 433 KVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHGWGT 492

Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI 527
           K++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+  + + Q++  +
Sbjct: 493 KYDSFWCCYGTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQQIKTL 552

Query: 528 VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATE 587
            S D YL+++ + S+    GQ +++N R+P WT+++GA A+LNG++L    PG+FLS T+
Sbjct: 553 SSSDQYLQISFSISANTS-GQTANINFRIPSWTFADGAGATLNGKDLGSISPGSFLSITK 611

Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLS 647
           +W+ +D L +  P+ LRTEAI+DDR EYAS+QA+LFGP++LAG ++G+WD K G   ++S
Sbjct: 612 QWNSDDHLALHFPIRLRTEAIKDDRLEYASLQAVLFGPFVLAGLSTGDWDAKAGNGSAIS 671

Query: 648 ALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATFRLILKD 706
             I+ +PP+ N+QLVTFTQ S    FV+S++N ++TM+E P V GTDAA+HATFR   ++
Sbjct: 672 DWIAAVPPAHNSQLVTFTQVSNGKAFVLSSANGTLTMQERPEVDGTDAAVHATFRAHPQE 731

Query: 707 AS--LSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLD 764
            S  L +  S   + G S++LEPFD PG ++         ++ S ++   S F +V GLD
Sbjct: 732 DSTELHDIYS-TTLTGTSILLEPFDLPGTVITNN------LTLSAQKSSDSLFNIVPGLD 784

Query: 765 KRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIGISEY 822
               +VSLE   + GCF+ +G N+  G  +++ C  S ES+     +AASF     + +Y
Sbjct: 785 GNPNSVSLELGTKPGCFLVTGTNYSAGTRIEVNCKSSLESIGGILEQAASFSQTDPLRQY 844

Query: 823 HPISFVAKGARRNFLLAPLLSFRDEAYTVYFNIQ 856
           HPISFVAKG  RNFLL PL S RDE YTVYFN++
Sbjct: 845 HPISFVAKGVARNFLLEPLYSLRDEFYTVYFNVR 878


>gi|326495110|dbj|BAJ85651.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 868

 Score =  985 bits (2547), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 485/815 (59%), Positives = 612/815 (75%), Gaps = 21/815 (2%)

Query: 52  HLTPTDDSAWSSLIPSKILGD------QKDEVSWALLYRKIKN-PGGFDLP-GNFLKEVS 103
           HLTPTD+SAW  L+P + L         ++   W +LYR+++      D P G FL E S
Sbjct: 62  HLTPTDESAWMELMPRRSLSGGGGSTPPREAFDWLMLYRRLRGGAAAVDGPAGPFLSEAS 121

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           LHDV L   ++ W+AQQTNLEYLL+LD D LVWSFR  A L   G  YGGWE P  ELRG
Sbjct: 122 LHDVRLQPGTIYWQAQQTNLEYLLLLDTDRLVWSFRTQAGLTATGTPYGGWEGPNVELRG 181

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK 223
           HFVGHYLSA+A+MWASTHN T++ KMS+VV  L +CQ K+GTGYLSAFP+E FD  EAL 
Sbjct: 182 HFVGHYLSATAKMWASTHNDTLRAKMSSVVDVLYDCQKKMGTGYLSAFPSEFFDRAEALT 241

Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
            VWAPYYTIHK++ GLLDQY +A N++AL+M   M  YF +RV+ +I  YS+ERHW SLN
Sbjct: 242 TVWAPYYTIHKVMQGLLDQYTVAGNSKALEMVVGMANYFSDRVKNIIQKYSIERHWASLN 301

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGS 343
           EETGGMNDVLY+LY+IT D KHL LAHLFDKPCFLG LALQAD +S FH+NTHIP+V+G+
Sbjct: 302 EETGGMNDVLYQLYTITDDLKHLTLAHLFDKPCFLGLLALQADSISGFHSNTHIPVVVGA 361

Query: 344 QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTT 403
           QMRYEVTGD LYK I T FMD++N+SHSYATGGTSA EFW DPKRLA TL +EN E+CTT
Sbjct: 362 QMRYEVTGDVLYKQIATSFMDMINSSHSYATGGTSAGEFWSDPKRLAATLSTENAESCTT 421

Query: 404 YNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH 463
           YNMLKVSR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP   G SKA S H
Sbjct: 422 YNMLKVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVSYH 481

Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
           GWGTK++SFWCCYGTGIESFSKLGDSIYFEE+G  P L IIQYI S+F+WK+  V + Q+
Sbjct: 482 GWGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGETPALSIIQYIPSTFNWKTAGVTVTQQ 541

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
           ++P+ S D  ++++L+FS K   GQ ++LN+R+P WT ++GA+A+LN ++L    PG+ L
Sbjct: 542 LEPLSSPDMNVQVSLSFSGKN--GQSATLNVRIPTWTSASGAKATLNDKDLGSVTPGSLL 599

Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTA 643
           S T++W+ ND L++Q P++LRTEAI+DDRPEYAS+QAILFGP++LAG +S + D KTG+A
Sbjct: 600 SVTKQWNSNDHLSLQFPIALRTEAIKDDRPEYASLQAILFGPFVLAGLSSSDCDAKTGSA 659

Query: 644 RSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATFRL 702
             +S  I+ +P S N+QL+TFTQES   TFV+S+SN S+TM+E P V GTD A+HATFR+
Sbjct: 660 --VSDWITAVPSSHNSQLMTFTQESSGKTFVLSSSNGSLTMQERPTVDGTDTAIHATFRV 717

Query: 703 ILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAG 762
             +D +  + +    +   SV++EPFD PG  +     ++L +S + K  GS  F +V+G
Sbjct: 718 HPQDTARLHGTYGATLQDTSVLIEPFDMPGTAI----ANDLTLS-TQKSTGSL-FNIVSG 771

Query: 763 LDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIGIS 820
           LD +  +VSLE   + GCF+ SG ++  G  +++ C  S +S+   F +AASF     + 
Sbjct: 772 LDGKPNSVSLELGTKPGCFLVSGADYSAGTKIQVSCKSSIQSIGGIFEQAASFAQAAPLR 831

Query: 821 EYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
           +YHPISFVAKG +RNFLL PL S RDE YT YFN+
Sbjct: 832 QYHPISFVAKGVQRNFLLEPLYSLRDEFYTAYFNL 866


>gi|226497412|ref|NP_001145969.1| uncharacterized protein LOC100279496 precursor [Zea mays]
 gi|223945575|gb|ACN26871.1| unknown [Zea mays]
          Length = 879

 Score =  985 bits (2546), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 500/874 (57%), Positives = 631/874 (72%), Gaps = 30/874 (3%)

Query: 4   GFVLFFFFCFGL--ALGKQCTNQSP-YDSHAFRYELT---STNKTWKEEVLSHF------ 51
           G V+      G   A GK CTN  P   SH  R           T  + ++ H       
Sbjct: 14  GIVVVMLLAAGFRGAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQ 73

Query: 52  HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPG----NFLKEVSLHDV 107
           HLTPTD+S W SL+P + L  +++   W +LYR+++  GG   PG     FL E SLHDV
Sbjct: 74  HLTPTDESTWMSLMPRRAL-RREEAFDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDV 132

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG 167
            L+  S+ WRAQQTNLEYLL+LDVD LVWSFRK A L  PG  YGGWE P  +LRGHFVG
Sbjct: 133 RLEPGSMYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVG 192

Query: 168 HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWA 227
           HYLSA+A+MWASTHN T+  KMS+VV +L +CQ K+GTGYLSAFP++ FD  EA+K VWA
Sbjct: 193 HYLSATAKMWASTHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWA 252

Query: 228 PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
           PYYTIHKI+ GLLDQY +A N+ AL M   M  YF +RV+ VI  YS+ERHW SLNEETG
Sbjct: 253 PYYTIHKIMQGLLDQYTVAGNSMALDMVIKMANYFSDRVKNVIQNYSIERHWESLNEETG 312

Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
           GMNDVLY+LY+ITHD KHL LAHLFDKPCFLG LA+QAD +S FH+NTHIP+VIG+QMRY
Sbjct: 313 GMNDVLYQLYTITHDMKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRY 372

Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNML 407
           EVTGDPLYK I +FFMD +N+SHSYATGGTSA EFW DPKRLA TL +ENEE+CTTYNML
Sbjct: 373 EVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTYNML 432

Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
           KVSR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP   G SKA S HGWGT
Sbjct: 433 KVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHGWGT 492

Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI 527
           K++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+  + + Q++  +
Sbjct: 493 KYDSFWCCYGTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQQIKTL 552

Query: 528 VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATE 587
            S D YL+++ + S+    GQ +++N R+P WT+++GA A+LNG++L    PG+FLS T+
Sbjct: 553 SSSDQYLQISFSISANTS-GQTANINFRIPSWTFADGAGATLNGKDLGSISPGSFLSITK 611

Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLS 647
           +W+ +D L +  P+ LRTEAI+DDR EYAS+QA+LFGP++LAG ++G+WD K G   ++S
Sbjct: 612 QWNSDDHLALHFPIRLRTEAIKDDRLEYASLQAVLFGPFVLAGLSTGDWDAKAGNGSAIS 671

Query: 648 ALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATFRLILKD 706
             I+ +PP+ N+QLVTFTQ S    FV+S++N ++TM+E P V GTDAA+HATFR   ++
Sbjct: 672 DWIAAVPPAHNSQLVTFTQVSNGKAFVLSSANGTLTMQERPEVDGTDAAIHATFRAHPQE 731

Query: 707 AS--LSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLD 764
            S  L +  S   + G S++LEPFD PG ++         ++ S ++   S F +V GLD
Sbjct: 732 DSTELHDIYS-TTLTGTSILLEPFDLPGTVITNN------LTLSAQKSSDSLFNIVPGLD 784

Query: 765 KRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIGISEY 822
               +VSLE   + GCF+ +G N+  G  +++ C  S ES+     +AASF     + +Y
Sbjct: 785 GNPNSVSLELGTKPGCFLVTGTNYSAGTRIEVNCKSSLESIGGILEQAASFSQTDPLRQY 844

Query: 823 HPISFVAKGARRNFLLAPLLSFRDEAYTVYFNIQ 856
           HPISFVAKG  RNFLL PL S RDE YTVYFN++
Sbjct: 845 HPISFVAKGVARNFLLEPLYSLRDEFYTVYFNVR 878


>gi|115444811|ref|NP_001046185.1| Os02g0195500 [Oryza sativa Japonica Group]
 gi|49388119|dbj|BAD25250.1| unknown protein [Oryza sativa Japonica Group]
 gi|113535716|dbj|BAF08099.1| Os02g0195500 [Oryza sativa Japonica Group]
 gi|125581152|gb|EAZ22083.1| hypothetical protein OsJ_05746 [Oryza sativa Japonica Group]
          Length = 891

 Score =  979 bits (2532), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 482/817 (58%), Positives = 619/817 (75%), Gaps = 20/817 (2%)

Query: 52  HLTPTDDSAWSSLIPSKILGD-----QKDEVSWALLYRKIKNPGGFDLPGN-----FLKE 101
           HLTPTD+S W SL+P ++L       ++D   W +LYR ++  G             L E
Sbjct: 80  HLTPTDESTWMSLMPRRLLASPVSSPRRDAFDWLMLYRNLRGSGSGAGAIAASGGALLAE 139

Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
            SLHDV L   +V W+AQQTNLEYLL+LDVD LVWSFR  A LP  G  YGGWE P  EL
Sbjct: 140 ASLHDVRLQPGTVYWQAQQTNLEYLLLLDVDRLVWSFRTQAGLPASGAPYGGWEGPGVEL 199

Query: 162 RGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA 221
           RGHFVGHYLSA+A+MWASTHN T+  KMS+VV +L +CQ K+G+GYLSAFP+E FD  E+
Sbjct: 200 RGHFVGHYLSATAKMWASTHNDTLLAKMSSVVDALHDCQKKMGSGYLSAFPSEFFDRVES 259

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
           +K VWAPYYTIHKI+ GLLDQY +A N++AL +   M  YF +RV+ VI  YS+ERHW S
Sbjct: 260 IKAVWAPYYTIHKIMQGLLDQYTVAGNSKALDLVVGMANYFSDRVKNVIQKYSIERHWAS 319

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
           LNEE+GGMNDVLY+LY+IT+D KHL LAHLFDKPCFLG LA+QAD +S FH+NTHIP+VI
Sbjct: 320 LNEESGGMNDVLYQLYTITNDQKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVI 379

Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
           G+QMRYEVTGD LYK I TFFMD +N+SHSYATGGTSA EFW +PKRLADTL +ENEE+C
Sbjct: 380 GAQMRYEVTGDLLYKQIATFFMDTINSSHSYATGGTSAGEFWTNPKRLADTLSTENEESC 439

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
           TTYNMLKVSR+LFRWTKE++YADYYERAL NGVLSIQRGT+PGVMIYMLP   G SKA S
Sbjct: 440 TTYNMLKVSRNLFRWTKELSYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVS 499

Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
            HGWGTK++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+  + +N
Sbjct: 500 YHGWGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDRPVLNIIQYIPSAYNWKAAGLTVN 559

Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
           Q++ PI S D +L+++L+ S+K   GQ ++LN+R+P WT +NGA+A+LN  +L L  PG+
Sbjct: 560 QQLKPISSLDMFLQVSLSTSAKTN-GQSATLNVRIPSWTSANGAKATLNDNDLGLMSPGS 618

Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG 641
           FLS +++W+ +D L++Q P++LRTEAI+DDRPEYAS+QAILFGP++LAG ++G+W+ + G
Sbjct: 619 FLSISKQWNSDDHLSLQFPITLRTEAIKDDRPEYASLQAILFGPFVLAGLSTGDWNAEAG 678

Query: 642 TARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATF 700
              ++S  ISP+P S+N+QLVTFTQES   TFV+S++N S+TM+E P V GTD A+HATF
Sbjct: 679 NTSAISDWISPVPSSYNSQLVTFTQESSGKTFVLSSANGSLTMQERPTVDGTDTAIHATF 738

Query: 701 RLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLV 760
           R+  +D++    +    + G SV +EPFD PG ++         +++S ++   S F +V
Sbjct: 739 RVHPQDSAGQLDTQGATLKGTSVQIEPFDLPGTVITNN------LTQSAQKSSDSLFNIV 792

Query: 761 AGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIG 818
            GLD    +VSLE   + GCF+  GV++  G  +++ C  S  S++  F +AASF+    
Sbjct: 793 PGLDGNPNSVSLELGTKPGCFLVIGVDYSVGTKIQVSCKSSLPSINGIFEQAASFVQAAP 852

Query: 819 ISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
           + +YHPISF+AKG +RNFLL PL S RDE YTVYFN+
Sbjct: 853 LRQYHPISFIAKGVKRNFLLEPLYSLRDEFYTVYFNL 889


>gi|125538467|gb|EAY84862.1| hypothetical protein OsI_06226 [Oryza sativa Indica Group]
          Length = 891

 Score =  979 bits (2531), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 480/817 (58%), Positives = 619/817 (75%), Gaps = 20/817 (2%)

Query: 52  HLTPTDDSAWSSLIPSKILGD-----QKDEVSWALLYRKIKNPGGFDLPGN-----FLKE 101
           HLTPTD+S W SL+P ++L       ++D   W +LYR ++  G             L E
Sbjct: 80  HLTPTDESTWMSLMPRRLLASPASSPRRDAFDWLMLYRNLRGSGSGAGAIAASGGALLAE 139

Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
            SLHDV L   +V W+AQQTNLEYLL+LDVD LVWSFR  A LP  G  YGGWE P  EL
Sbjct: 140 ASLHDVRLQPGTVYWQAQQTNLEYLLLLDVDRLVWSFRTQAGLPASGAPYGGWEGPGVEL 199

Query: 162 RGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA 221
           RGHFVGHYLSA+A+MWASTHN T++ KMS+VV +L +CQ K+G+GYLSAFP+E FD  E+
Sbjct: 200 RGHFVGHYLSATAKMWASTHNDTLQAKMSSVVDALHDCQKKMGSGYLSAFPSEFFDRVES 259

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
           +K VWAPYYTIHKI+ GLLDQY +A N++AL +   M  YF +RV+ VI  YS+ERHW S
Sbjct: 260 IKAVWAPYYTIHKIMQGLLDQYTVAGNSKALDLVVGMANYFSDRVKNVIQKYSIERHWAS 319

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
           LNEE+GGMNDVLY+LY+IT+D KHL LAHLFDKPCFLG LA+QAD +S FH+NTHIP+VI
Sbjct: 320 LNEESGGMNDVLYQLYTITNDQKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVI 379

Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
           G+QMRYEVTGD LYK I TFFMD +N+SHSYATGGTSA EFW +PKRLADTL +ENEE+C
Sbjct: 380 GAQMRYEVTGDLLYKQIATFFMDTINSSHSYATGGTSAGEFWTNPKRLADTLSTENEESC 439

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
           TTYNMLKVSR+LFRWTKE++YADYYERAL NGVLSIQRGT+PGVMIYMLP   G SKA S
Sbjct: 440 TTYNMLKVSRNLFRWTKELSYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVS 499

Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
            HGWGTK++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+  + +N
Sbjct: 500 YHGWGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDRPVLNIIQYIPSAYNWKAAGLTVN 559

Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
           Q++ PI S D +L+++L+ S+K   GQ ++LN+R+P WT +NGA+A+LN  +L L  PG+
Sbjct: 560 QQLKPISSLDMFLQVSLSTSAKTN-GQSATLNVRIPSWTSANGAKATLNDNDLGLMSPGS 618

Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG 641
           FLS +++W+ +D L++Q P++LRTEAI+DDRPEYAS+QAILFGP++LAG ++G+W+ + G
Sbjct: 619 FLSISKQWNSDDHLSLQFPITLRTEAIKDDRPEYASLQAILFGPFVLAGLSTGDWNAEAG 678

Query: 642 TARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATF 700
              ++S  ISP+P S+N+QLVTFTQES   TFV+S++N S+ M+E P V GTD A+HATF
Sbjct: 679 NTSAISDWISPVPSSYNSQLVTFTQESSGKTFVLSSANGSLAMQERPTVDGTDTAIHATF 738

Query: 701 RLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLV 760
           R+  +D++    +    + G SV +EPFD PG ++         +++S ++   S F +V
Sbjct: 739 RVHPQDSAGQLDTQGATLKGTSVQIEPFDLPGTVITNN------LTQSAQKSSDSLFNIV 792

Query: 761 AGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIG 818
            GLD    +VSLE   + GCF+ +GV++  G  +++ C  S  S++  F +A SF+    
Sbjct: 793 PGLDGNPNSVSLELGTKPGCFLVTGVDYSVGTKIQVSCKSSLPSINGIFEQATSFVQAAP 852

Query: 819 ISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
           + +YHPISF+AKG +RNFLL PL S RDE YTVYFN+
Sbjct: 853 LRQYHPISFIAKGVKRNFLLEPLYSLRDEFYTVYFNL 889


>gi|357123866|ref|XP_003563628.1| PREDICTED: uncharacterized protein LOC100829886 [Brachypodium
           distachyon]
          Length = 850

 Score =  951 bits (2458), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 499/865 (57%), Positives = 616/865 (71%), Gaps = 41/865 (4%)

Query: 15  LALGKQCTN-QSPYDSHAFRYELTS--TNKTWKEEVL--SHFHLTPTDDSAWSSLIPSKI 69
           +A+ K+CTN  +   SH  R  L    + + W+   L   H H++PTD++ W  L     
Sbjct: 1   MAVAKECTNVPTQLSSHTVRARLQGDPSAEEWRLRALFHDHAHVSPTDEATWMDLRAPLA 60

Query: 70  LGDQKDEVSWALLYRKIKNPGGFDLPGN---FLKEVSLHDVWLD--QSSVLWRAQQTNLE 124
                +E  WA+LYR +K             FL+EV L DV LD  + +V  RAQQTNLE
Sbjct: 61  SSAATEESGWAMLYRALKGSASGGSASAAAGFLEEVPLQDVRLDMEEDAVYGRAQQTNLE 120

Query: 125 YLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNAT 184
           YLL+LDVD L+WSFR  A LP PGK YGGWE    ELRGHFVGHYLSA+A+ WASTHN T
Sbjct: 121 YLLLLDVDRLLWSFRTQAGLPAPGKPYGGWEGADVELRGHFVGHYLSAAAKTWASTHNGT 180

Query: 185 IKEKMSTVVFSLSECQNKI----GTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLL 240
           +  KMS VV +L ECQ       G GYLSAFP E FD FEA++PVWAPYYT+HKI+ GLL
Sbjct: 181 LAAKMSAVVDALHECQQAAAANGGNGYLSAFPAEFFDRFEAIQPVWAPYYTVHKIMQGLL 240

Query: 241 DQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSIT 300
           DQ+ +A N +AL MA  M  YF  RV+ VI  + +ERHW SLNEETGGMNDVLY+LY+IT
Sbjct: 241 DQHTVAGNGKALAMAVAMAGYFGGRVRSVIQRHGIERHWTSLNEETGGMNDVLYQLYTIT 300

Query: 301 HDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGT 360
           +D +HL+LAHLFDKPCFLG LA+QAD L+ FHANTHIP+V+G QMRYEVTGDPLYK I T
Sbjct: 301 NDQRHLVLAHLFDKPCFLGLLAVQADSLTGFHANTHIPVVVGGQMRYEVTGDPLYKEIST 360

Query: 361 FFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEI 420
           FFMDIVN SHSYATGGTS  EFW DPKRLA TL +ENEE+CTTYNMLKVSRHLFRWTKEI
Sbjct: 361 FFMDIVNTSHSYATGGTSVSEFWSDPKRLASTLTTENEESCTTYNMLKVSRHLFRWTKEI 420

Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
           AYADYYERAL NGVLSIQRG +PGVMIYMLP G G SKA S HGWGT+++SFWCCYGTGI
Sbjct: 421 AYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAVSYHGWGTQYDSFWCCYGTGI 480

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
           ESFSKLGD+IYFEE+G+ P LY++QYI S F+WKS  + + Q++ P+ S D YL+++L+ 
Sbjct: 481 ESFSKLGDTIYFEEKGSKPTLYVVQYIPSIFNWKSAGLTVTQRLKPLSSSDQYLQVSLSI 540

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
           S+K   GQ +++N+R+P W  +NGA+A+LN + L L  PG FL+ T++W+  D LT+QLP
Sbjct: 541 SAKTN-GQYATVNVRIPSWASANGAKATLNDKYLQLGSPGTFLTVTKQWNSGDHLTLQLP 599

Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG-TARSLSALISPIPPSFNA 659
           ++LRTEAI+DDR E+AS+QA+LFGP+LLAG ++G+WD KTG  A ++S  ISP+P S+++
Sbjct: 600 INLRTEAIKDDRAEFASLQAVLFGPFLLAGLSTGDWDAKTGAAAAAISDWISPVPSSYSS 659

Query: 660 QLVTFTQESGNSTFVMSNSN-QSITMEEFPV-SGTDAALHATFRLILKDASLSNFSSLNN 717
           QLVT TQESG STFV+S  N  S+ M+  P   GT+AA+H TFRL+ +    S   + N 
Sbjct: 660 QLVTLTQESGGSTFVLSTVNGTSLAMQPRPEGGGTEAAVHGTFRLVPQ--GFSPPPTTNR 717

Query: 718 VIG-----KSVMLEPFDFPGMLVQQGKEDEL-VVSESPKEMGSSGFRLVAGLDKRNETVS 771
             G      S M+EPFD PGM +     D L VV    K  GS  F +V GLD +  +VS
Sbjct: 718 RHGAPTNLASAMIEPFDLPGMAIT----DALTVVRSEEKSSGSLLFNVVPGLDGKPGSVS 773

Query: 772 LEAENRKGCFVSSGVNFEPGASLKLLCSTESLDAGFNR-AASFMMEIGISEYHPISFVAK 830
           LE   R GCFV +      GA +++ C      AGF++ AASF     +  YHPISFVA+
Sbjct: 774 LELGTRPGCFVVTA-----GAKVQVGCG-----AGFSQAAASFARAEPLRRYHPISFVAR 823

Query: 831 GARRNFLLAPLLSFRDEAYTVYFNI 855
           GARR FLL PL + RDE YTVYFN+
Sbjct: 824 GARRGFLLEPLFTLRDEFYTVYFNL 848


>gi|242096362|ref|XP_002438671.1| hypothetical protein SORBIDRAFT_10g024070 [Sorghum bicolor]
 gi|241916894|gb|EER90038.1| hypothetical protein SORBIDRAFT_10g024070 [Sorghum bicolor]
          Length = 887

 Score =  945 bits (2442), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 502/881 (56%), Positives = 618/881 (70%), Gaps = 64/881 (7%)

Query: 17  LGKQCTN-QSPYDSHAFRYELTST--NKTWKEEVLSHFHLTPTDDSAWSSLIPSKILGDQ 73
           + K+CTN  +   SH  R  L ++     W+   L H HL PTD++AW  L+P    G  
Sbjct: 27  MAKECTNIPTELSSHTVRARLQASPGAAEWRWRELFHEHLNPTDEAAWMDLMPPPPRGGL 86

Query: 74  KDE---------------VSWALLYRKIKNP----------GGFDLPGNFLKEVSLHDVW 108
           +                 + W +LYR +K             G    G FL+EVSLHDV 
Sbjct: 87  QTAAAADAGHHHHQEEEELDWVMLYRSLKGQQVVVGGAVPASGAAAAGPFLEEVSLHDVR 146

Query: 109 LD---QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHF 165
           LD     +   RAQ+TNLEYLL+LDVD LVWSFR  A+LP PG+ YGGWE P SELRGHF
Sbjct: 147 LDPDGDDAAYGRAQRTNLEYLLLLDVDRLVWSFRSQAALPAPGEPYGGWEKPDSELRGHF 206

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
           VGHYLSA+A+MWASTHN T+  KMS VV +L ECQ   GTGYLSAFP E FD FEA+KPV
Sbjct: 207 VGHYLSATAKMWASTHNGTLAGKMSAVVDALDECQRAAGTGYLSAFPAEFFDRFEAIKPV 266

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
           WAPYYTIHKI+ GLLDQ+V+A N +AL M   M +YF  RV+ VI  YS+ERHW SLNEE
Sbjct: 267 WAPYYTIHKIMQGLLDQHVVAGNGKALGMVVAMADYFAGRVRNVIRRYSIERHWTSLNEE 326

Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
           TGGMNDVLY+LY+ITHD +HL+LAHLFDKPCFLG LA+QAD LS+FHANTHIP+VIG QM
Sbjct: 327 TGGMNDVLYQLYTITHDQRHLVLAHLFDKPCFLGLLAVQADSLSNFHANTHIPVVIGGQM 386

Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
           RYEVTGDPLYK I TFFMD VN+SH+YATGGTS  EFW DPKRLA+ L +E EE+CTTYN
Sbjct: 387 RYEVTGDPLYKEIATFFMDTVNSSHAYATGGTSVSEFWSDPKRLAEALTTETEESCTTYN 446

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
           MLKVSRHLFRWTKE+AYADYYERAL NGVLSIQRG +PGVMIYMLP G G SKA+S HGW
Sbjct: 447 MLKVSRHLFRWTKEVAYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAKSYHGW 506

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
           GT+  SFWCCYGTGIESFSKLGDSIYFEE+G  P LYI+Q+I S+F+W++  + + QK+ 
Sbjct: 507 GTQNESFWCCYGTGIESFSKLGDSIYFEEKGQKPALYIVQFIPSTFNWRTTGLTVTQKLM 566

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
           P+ SWD YL+++ + S+K + GQ ++LN+R+P WT  NGA+A+LN ++L L  PG FL+ 
Sbjct: 567 PLSSWDQYLQVSFSISAKTD-GQFATLNVRIPSWTSLNGAKATLNDKDLQLASPGTFLTV 625

Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG-TAR 644
           +++W   D+L +QLP+ LRTEAI+DDRPEYASIQA+LFGP+LLAG T+GEWD KTG  A 
Sbjct: 626 SKQWGSGDQLLLQLPIHLRTEAIKDDRPEYASIQAVLFGPFLLAGLTTGEWDAKTGAAAA 685

Query: 645 SLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP--VSGTDAALHATFRL 702
           + +  I+P+PP  N+QLVT  QESG   FV+S  N S+TM+E P    GTDAA+HATFRL
Sbjct: 686 AATDWITPVPPGSNSQLVTLAQESGGKAFVLSAVNGSLTMQERPKDSGGTDAAVHATFRL 745

Query: 703 ILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSG--FRLV 760
           + +  + +           +  LEP D PGM+V     D L VS       SSG  F +V
Sbjct: 746 VPQGTNST----------AAATLEPLDMPGMVVT----DTLTVSAEK----SSGALFNVV 787

Query: 761 AGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTESLDAG------FNRAASFM 814
            GL     +VSLE  +R GCF+ +G +   G  +++ C+      G      F +AASF 
Sbjct: 788 PGLAGAPGSVSLELGSRPGCFLVAGGS---GEKVQVGCTGGVKKHGNGGGDWFRQAASFA 844

Query: 815 MEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
               +  YHP+SF A+G RR+FLL PL + RDE YT+YFN+
Sbjct: 845 RAEPMRRYHPMSFAARGVRRSFLLEPLFTLRDEFYTIYFNL 885


>gi|51090917|dbj|BAD35522.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|51090951|dbj|BAD35554.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 883

 Score =  926 bits (2393), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 491/874 (56%), Positives = 609/874 (69%), Gaps = 52/874 (5%)

Query: 19  KQCTN-QSPYDSHAFRYELTSTNKT---WKEEVLSHFHLTPTDDSAWSSLIPSKILGDQK 74
           K+CTN  +   SH  R  L S++     W+EE     HL PTD++AW  L+P  +     
Sbjct: 23  KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMP--LAAASA 80

Query: 75  DEVSWALLYRKIKNPGGFDLPGN-----------FLKEVSLHDVWLDQSS----VLWRAQ 119
            E  WA+LYR +K   G  + G+           FL+EVSLHDV LD       V  RAQ
Sbjct: 81  SEFDWAMLYRSLK---GAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQ 137

Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAS 179
           QTNLEYLL+L+VD LVWSFR  A LP PGK YGGWE P  ELRGHFVGHYLSA+A+MWAS
Sbjct: 138 QTNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVGHYLSAAAKMWAS 197

Query: 180 THNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGL 239
           THN T+  KM+ VV +L +CQ   GTGYLSAFP E FD FEA++PVWAPYYTIH I+ GL
Sbjct: 198 THNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIH-IMQGL 256

Query: 240 LDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSI 299
           LDQ+ +A N +AL M   M +YF  RV+ VI  Y++ERHW SLNEETGGMNDVLY+LY+I
Sbjct: 257 LDQHTVAGNGKALGMVVAMADYFAGRVRSVIQRYTIERHWTSLNEETGGMNDVLYQLYTI 316

Query: 300 THDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIG 359
           T D +HL+LAHLFDKPCFLG LA+QAD LS FHANTHIP+VIG QMRYEVTGDPLYK I 
Sbjct: 317 TKDQRHLVLAHLFDKPCFLGLLAVQADSLSGFHANTHIPVVIGGQMRYEVTGDPLYKEIA 376

Query: 360 TFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
           TFFMDIVN+SHSYATGGTS  EFW +PK LA+ L +E EE+CTTYNMLKVSRHLFRWTKE
Sbjct: 377 TFFMDIVNSSHSYATGGTSVSEFWSNPKHLAEALTTETEESCTTYNMLKVSRHLFRWTKE 436

Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTG 479
           IAYADYYERAL NGVLSIQRG +PGVMIYMLP G G SKA S HGWGT++NSFWCCYGTG
Sbjct: 437 IAYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAVSYHGWGTQYNSFWCCYGTG 496

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
           IESFSKLGDSIYFE++G+ PGLYIIQYI S+F+W++  + + Q+V P+ S D YL+++L+
Sbjct: 497 IESFSKLGDSIYFEQKGDKPGLYIIQYIPSTFNWRTAGLTVTQQVKPLSSSDQYLQVSLS 556

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERW-SYNDKLTIQ 598
            S+ +  GQ ++LN+R+P WT  NGA+A+LN ++L L  PG FL+ +++W S +D L +Q
Sbjct: 557 ISAAKTNGQYATLNVRIPSWTSMNGAKATLNDKDLQLASPGTFLTISKQWDSGDDHLLLQ 616

Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWD-IKTGTARSLSALISPIPPSF 657
            P++LRTEAI+DDRP+ AS+ AILFGP+LLAG T+G+WD    G A + S  I+P+P S+
Sbjct: 617 FPINLRTEAIKDDRPQVASLNAILFGPFLLAGLTTGDWDAKTGGAATAASDWITPVPASY 676

Query: 658 NAQLVTFTQESGNSTFVMSNSNQ-SITMEEFP--VSGTDAALHATFRLI--------LKD 706
           N+QLVT TQESG  T ++S  N  S+ M E P    GTDAA+ ATFR++         + 
Sbjct: 677 NSQLVTLTQESGGKTMLLSTVNDTSLAMLERPEGAGGTDAAVRATFRVVPPGSRAELRQR 736

Query: 707 ASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKR 766
           A          +   +  +EPF  PG  V  G    L V  +     S+ F +  GLD +
Sbjct: 737 AGAGAGEGAARLKVAAATIEPFGLPGTAVSNG----LAVVRAGNS-SSTLFNVAPGLDGK 791

Query: 767 NETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTE-----SLDAGFNRAASFMMEIGISE 821
             +VSLE  ++ GCF+ +G     GA + + C T      +  AGF +AASF     +  
Sbjct: 792 PGSVSLELGSKPGCFLVAGA----GAKVHVGCRTRGGAAAAAAAGFEQAASFAQAEPLRR 847

Query: 822 YHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
           YH ISF A G RR+FLL PL + RDE YT+YFN+
Sbjct: 848 YHAISFFASGVRRSFLLEPLFTLRDEFYTIYFNL 881


>gi|357472931|ref|XP_003606750.1| hypothetical protein MTR_4g065200 [Medicago truncatula]
 gi|355507805|gb|AES88947.1| hypothetical protein MTR_4g065200 [Medicago truncatula]
          Length = 646

 Score =  896 bits (2316), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 437/675 (64%), Positives = 516/675 (76%), Gaps = 34/675 (5%)

Query: 3   FGFVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAW 61
           F F+      FG   GK+C N  P  SH FRYEL  S N+TWK+EV+SH+HLTPTD+SAW
Sbjct: 4   FVFMFMAIMLFGCVAGKECMNNLP-QSHTFRYELWASKNETWKKEVMSHYHLTPTDESAW 62

Query: 62  SSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQT 121
           + L+P K+L ++ ++  WA  YR++KN      P  FLKEV L DV L + S+  +AQ+T
Sbjct: 63  ADLLPRKLLSEE-NQRDWAAKYREMKNADLSKPPVGFLKEVPLGDVRLLEGSIHAQAQKT 121

Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH 181
           NLEYLLMLDVDSL+WSFRKTA LPTPG  YGGWE+P  ELRGHFVGHYLSASA MWAST 
Sbjct: 122 NLEYLLMLDVDSLIWSFRKTAGLPTPGTPYGGWEDPSIELRGHFVGHYLSASALMWASTK 181

Query: 182 NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLD 241
           N  + EKMS +V  LS CQ KIGTGYLSAFPTELFD  EAL+  WAPYYTIHKILAGLLD
Sbjct: 182 NDNLNEKMSALVSGLSACQEKIGTGYLSAFPTELFDRVEALQYAWAPYYTIHKILAGLLD 241

Query: 242 QYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITH 301
           QY +  N QALKM TWMV+YFYNRV  VI   +V  H+ SLNEE GGMNDVLYRLYSIT 
Sbjct: 242 QYTIGGNPQALKMVTWMVDYFYNRVMNVIQKLTVNGHYQSLNEEAGGMNDVLYRLYSITR 301

Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTF 361
           D KHL+LAHLFDKPCFLG LA+QA+ +++FHANTHIPIV+GSQ+RYEVTGDPLYK IG F
Sbjct: 302 DSKHLVLAHLFDKPCFLGVLAVQANDIANFHANTHIPIVVGSQLRYEVTGDPLYKDIGAF 361

Query: 362 FMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI 420
           FMDIVN+SH+YATGGTS REFW DPKR+AD L S ENEE+CTTYNMLKVSRHLFRWTKE+
Sbjct: 362 FMDIVNSSHTYATGGTSVREFWNDPKRIADNLKSTENEESCTTYNMLKVSRHLFRWTKEV 421

Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
           +YADYYERALTNGVLSIQRGT+PGVMIYMLPLG GVSKA++  GWG  FN+FWCCYGTGI
Sbjct: 422 SYADYYERALTNGVLSIQRGTDPGVMIYMLPLGLGVSKAKTDKGWGNPFNTFWCCYGTGI 481

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
           ESFSKLGDSIYFEEEG+ P LYIIQYISSSF+WKSG ++L Q V P  S DPYLR+T TF
Sbjct: 482 ESFSKLGDSIYFEEEGHNPSLYIIQYISSSFNWKSGKILLTQTVVPAASSDPYLRVTFTF 541

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
           S  +  G  S+LN R+P W++++GA+A LN + L LP P                     
Sbjct: 542 SPNETTGTSSTLNFRVPSWSHADGAKAILNSETLSLPAP--------------------- 580

Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQ 660
                    DDRPE+AS+QAIL+GPYLLAGHT+  WDIK  T ++++  I+PIP ++++Q
Sbjct: 581 ---------DDRPEFASLQAILYGPYLLAGHTTSIWDIKGVTNKAVADWITPIPSNYSSQ 631

Query: 661 LVTFTQESGNSTFVM 675
           LV F  ++  +  ++
Sbjct: 632 LVFFIHKTSTNQLLL 646


>gi|218198543|gb|EEC80970.1| hypothetical protein OsI_23693 [Oryza sativa Indica Group]
          Length = 905

 Score =  882 bits (2278), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 480/900 (53%), Positives = 598/900 (66%), Gaps = 82/900 (9%)

Query: 19  KQCTN-QSPYDSHAFRYELTSTNKT---WKEEVLSHFHLTPTDDSAWSSLIPSKILGDQK 74
           K+CTN  +   SH  R  L S++     W+EE     HL PTD++AW  L+P  +     
Sbjct: 23  KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMP--LAAASA 80

Query: 75  DEVSWALLYRKIKNPGGFDLPGN-----------FLKEVSLHDVWLDQSS----VLWRAQ 119
            E  WA+LYR +K   G  + G+           FL+EVSLHDV LD       V  RAQ
Sbjct: 81  SEFDWAMLYRSLK---GAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQ 137

Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAS 179
           QTNLEYLL+L+VD LVWSFR  A LP PGK YGGWE P  ELRGHFVGHYLSA+A+MWAS
Sbjct: 138 QTNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVGHYLSAAAKMWAS 197

Query: 180 THNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK----- 234
           THN T+  KM+ VV +L +CQ   GTGYLSAFP E FD FEA++PVWAPYYTIHK     
Sbjct: 198 THNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKARNAT 257

Query: 235 ---------------------ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMY 273
                                I+ GLLDQ+ +A N +AL M   M +YF  RV+ VI  Y
Sbjct: 258 QSICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGKALGMVVAMADYFAGRVRSVIQRY 317

Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
           ++ERHW SLNEETGGMNDVLY+L +     +       F + CFLG LA+QAD LS FHA
Sbjct: 318 TIERHWTSLNEETGGMNDVLYQLKT-----EAFGAGSSFRQACFLGLLAVQADSLSGFHA 372

Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
           NTHIP+VIG QMRYEVTGDPLYK I TFFMDIVN+SHSYATGGTS  EFW +PK LA+ L
Sbjct: 373 NTHIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVSEFWSNPKHLAEAL 432

Query: 394 GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG 453
            +E EE+CTTYNMLKVSRHLFRWTKEIAYADYYERAL NGVLSIQRG +PGVMIYMLP G
Sbjct: 433 TTETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMIYMLPQG 492

Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
            G SKA S HGWGT++NSFWCCYGTGIESFSKLGDSIYFE++G+ PGLYIIQYI S+F+W
Sbjct: 493 PGRSKAVSYHGWGTQYNSFWCCYGTGIESFSKLGDSIYFEQKGDKPGLYIIQYIPSTFNW 552

Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
           ++  + + Q+V P+ S D YL+++L+ S+ +  GQ ++LN+R+P WT  NGA+A+LN ++
Sbjct: 553 RTAGLTVTQQVKPLSSSDQYLQVSLSISAAKTNGQYATLNVRIPSWTSMNGAKATLNDKD 612

Query: 574 LPLPPPGNFLSATERW-SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
           L L  PG FL+ +++W S +D L +Q P++LRTEAI+DDRP+ AS+ AILFGP+LLAG T
Sbjct: 613 LQLASPGTFLTISKQWDSGDDHLLLQFPINLRTEAIKDDRPQVASLNAILFGPFLLAGLT 672

Query: 633 SGEWD-IKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQ-SITMEEFP-- 688
           +G+WD    G A + S  I+P+P S+N+QLVT TQESG  T ++S  N  S+ M E P  
Sbjct: 673 TGDWDAKTGGAATAASDWITPVPASYNSQLVTLTQESGGKTMLLSTVNDTSLAMLERPEG 732

Query: 689 VSGTDAALHATFRLI--------LKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKE 740
             GTDAA+ ATFR++         + A          +   +  +EPF  PG  V  G  
Sbjct: 733 AGGTDAAVRATFRVVPPGSRAELRQRAGAGAGEGAARLKVAAATIEPFGLPGTAVSNG-- 790

Query: 741 DELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCST 800
             L V  +     S+ F +V GLD +  +VSLE  ++ GCF+ +G     GA + + C T
Sbjct: 791 --LAVVRAGNS-SSTLFNVVPGLDGKPGSVSLELGSKPGCFLVAGA----GAKVHVGCRT 843

Query: 801 E-----SLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
                 +  AGF +AASF     +  YH ISF A G RR+FLL PL + RDE YT+YFN+
Sbjct: 844 RGGAAAAAAAGFEQAASFAQAEPLRRYHAISFFASGVRRSFLLEPLFTLRDEFYTIYFNL 903


>gi|255544804|ref|XP_002513463.1| conserved hypothetical protein [Ricinus communis]
 gi|223547371|gb|EEF48866.1| conserved hypothetical protein [Ricinus communis]
          Length = 759

 Score =  850 bits (2195), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 412/625 (65%), Positives = 491/625 (78%), Gaps = 36/625 (5%)

Query: 233 HKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDV 292
           H +LAGLLDQY+ ADNAQALKM  WMVEYFYNRVQ VIT YSVERH+ SLNEETGGMNDV
Sbjct: 169 HFVLAGLLDQYIFADNAQALKMVNWMVEYFYNRVQNVITKYSVERHFLSLNEETGGMNDV 228

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
           LY+L+SIT +PKHL+LAHLFDKPCFLG LA+Q                            
Sbjct: 229 LYKLFSITGEPKHLVLAHLFDKPCFLGLLAVQE--------------------------- 261

Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
                IGTFFMDIVN+SH+YATGGTS  EFW DPKRLA TL  + EE+CTTYNMLKVSRH
Sbjct: 262 -----IGTFFMDIVNSSHTYATGGTSDYEFWSDPKRLASTLNDQTEESCTTYNMLKVSRH 316

Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
           LFRWTKE+AYADYYERALTNGVL IQRGTEPGVMIY+LP   G SKAR+ H WGT  +SF
Sbjct: 317 LFRWTKEMAYADYYERALTNGVLGIQRGTEPGVMIYLLPQNPGGSKARTIHKWGTPDDSF 376

Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
           WCCYGTGIESFSKLGDSIYFEE   +PGLY+IQYISSS DWK G +VLNQKVDPI SWDP
Sbjct: 377 WCCYGTGIESFSKLGDSIYFEEGSQIPGLYVIQYISSSLDWKLGQIVLNQKVDPIFSWDP 436

Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYN 592
           +LR+T TF   Q   Q S+LNLR+P+WT+S+  +A++N Q+LP+PPPGNFLS T  WS +
Sbjct: 437 FLRVTFTFD--QGASQSSTLNLRIPIWTHSDDVKATINAQSLPVPPPGNFLSVTGSWSSS 494

Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
           DKL +QLP+ LRTEAI+DDRPEYASIQAILFGPYLLAGH+SG+WD+K+ +A+SLS  I+ 
Sbjct: 495 DKLFLQLPIILRTEAIKDDRPEYASIQAILFGPYLLAGHSSGDWDLKSESAKSLSDWITA 554

Query: 653 IPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNF 712
           IP ++N+ LV+F+Q+SG+S F ++NSNQS+TME FP  GTD ++HATFRLIL D+S S  
Sbjct: 555 IPATYNSHLVSFSQDSGDSVFALTNSNQSLTMEIFPQPGTDDSVHATFRLILNDSSSSEL 614

Query: 713 SSLNNVIGKSVMLEPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVS 771
           ++  + +GK VMLEPF+ PGM LVQQGKE  L V  +    GSS FRLV+GLD ++ +VS
Sbjct: 615 ANFEDAVGKLVMLEPFNLPGMLLVQQGKEVSLAVGYTDGSDGSSLFRLVSGLDGKDGSVS 674

Query: 772 LEAENRKGCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKG 831
           LE+ + + CFV SGV+++ G +LKL C   S +  FN+ ASFM+  GIS YHPISFVAKG
Sbjct: 675 LESVSNENCFVFSGVDYKSGTALKLSCKKSS-ETKFNQGASFMVNKGISHYHPISFVAKG 733

Query: 832 ARRNFLLAPLLSFRDEAYTVYFNIQ 856
           A+RNFLL+PL SFRDE+YT+YFNIQ
Sbjct: 734 AKRNFLLSPLFSFRDESYTIYFNIQ 758



 Score =  196 bits (499), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 100/172 (58%), Positives = 123/172 (71%), Gaps = 12/172 (6%)

Query: 4   GFVLFFFFCF-------GLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLT 54
           GFV+F            G  + K+CTN  +   SH FRY L +S N++ K+E+ +H+HLT
Sbjct: 3   GFVVFELLVLVAASVLCGFGMSKECTNIPTQLSSHTFRYALLSSNNESLKQEMFAHYHLT 62

Query: 55  PTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSV 114
           PTDDS WSSL+P K+L  ++DE  WA++Y+K+K+P      GNFLKEVSLH+V LD  S 
Sbjct: 63  PTDDSVWSSLLPRKML-KEEDEFDWAMMYKKLKSP--LQSSGNFLKEVSLHNVRLDLGSF 119

Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
            WRAQQTNLEYLLML++D LVWSFRKTA LPTPG AYGGWE P  ELRGHFV
Sbjct: 120 HWRAQQTNLEYLLMLNLDRLVWSFRKTAGLPTPGTAYGGWEAPNVELRGHFV 171


>gi|357472921|ref|XP_003606745.1| hypothetical protein MTR_4g065150 [Medicago truncatula]
 gi|355507800|gb|AES88942.1| hypothetical protein MTR_4g065150 [Medicago truncatula]
          Length = 617

 Score =  834 bits (2154), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 400/606 (66%), Positives = 488/606 (80%), Gaps = 16/606 (2%)

Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
           M TWMV+YFY+RV  VI+ Y+V RH+ SLNEETGGMNDVLY+LYS+T D KHLLLAHLFD
Sbjct: 1   MVTWMVDYFYDRVVNVISKYTVNRHYQSLNEETGGMNDVLYKLYSVTGDSKHLLLAHLFD 60

Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYA 373
           KPCFLG LA+QA+ ++ FHANTHIPIV+GSQMRYEVTGDPLY+ IG+FFMDIVN+SHSYA
Sbjct: 61  KPCFLGLLAVQANDIADFHANTHIPIVVGSQMRYEVTGDPLYREIGSFFMDIVNSSHSYA 120

Query: 374 TGGTSAREFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
           TGGTS REFW +PKR+AD LG+ ENEE+CTTYNMLKVSRHLFRWTKE+ YADYYERALTN
Sbjct: 121 TGGTSVREFWSNPKRIADNLGTTENEESCTTYNMLKVSRHLFRWTKEVTYADYYERALTN 180

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
           GVL IQRGT+PGVMIYMLPLG GVSKA++ H WG  F++FWCCYGTGIESFSKLGDSIYF
Sbjct: 181 GVLGIQRGTDPGVMIYMLPLGIGVSKAKTGHSWGNPFDTFWCCYGTGIESFSKLGDSIYF 240

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
           EEEGN P LYIIQYISSSF+WKSG  +L Q V P  S DPYLR+T TFSS ++ G  S+L
Sbjct: 241 EEEGNSPSLYIIQYISSSFNWKSGKTLLTQTVVPAASSDPYLRVTFTFSSNEKTGTSSTL 300

Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
           N R+P W++++GA+A LN + L LP PGNFLS T +WS  DKLT+QLPL +RTEAI+DDR
Sbjct: 301 NFRVPSWSHADGAKAILNSEALSLPAPGNFLSITRQWSAGDKLTLQLPLIIRTEAIKDDR 360

Query: 613 PEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNST 672
           PEYAS+QAIL+GPYLLAGHT+  WDIK  T ++++  I+PIP S+N+QLV+F+Q+   ST
Sbjct: 361 PEYASVQAILYGPYLLAGHTTRNWDIKADTNKAVADWITPIPSSYNSQLVSFSQDFDQST 420

Query: 673 FVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPG 732
           FV++NSNQS+TM++ P  GTD AL ATFRLILK A           + K+VMLEP D PG
Sbjct: 421 FVITNSNQSLTMQKSPEPGTDVALQATFRLILKGA-----------VSKTVMLEPIDLPG 469

Query: 733 MLVQQGKEDE-LVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPG 791
           M+V   + D+ L+V +S     SS F +V GLD RN+T+SL++++ K C+V S  +   G
Sbjct: 470 MIVSHQEPDQPLIVVDSSLGGPSSVFLVVPGLDGRNQTISLQSQSNKDCYVYS--DMSSG 527

Query: 792 ASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTV 851
           + +KL C ++S +A FN+AASF+   G+ +YHPISFVAKG  +NFLL PL +FRDE YTV
Sbjct: 528 SGVKLRCKSDS-EASFNQAASFVSGKGLRQYHPISFVAKGGNQNFLLEPLFNFRDEHYTV 586

Query: 852 YFNIQD 857
           YFNIQ+
Sbjct: 587 YFNIQE 592


>gi|242096364|ref|XP_002438672.1| hypothetical protein SORBIDRAFT_10g024080 [Sorghum bicolor]
 gi|241916895|gb|EER90039.1| hypothetical protein SORBIDRAFT_10g024080 [Sorghum bicolor]
          Length = 933

 Score =  833 bits (2151), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/879 (49%), Positives = 571/879 (64%), Gaps = 82/879 (9%)

Query: 52  HLTPTDDSAWSSLIPSKILGDQKD------EVSWALLYRKIKNPGGFD--------LPGN 97
           HLTPT+++ W +L+P ++ G          E  W  LYR +   GG D         PG 
Sbjct: 55  HLTPTEEATWMALLPRRLRGGGGGGARARAEFDWLALYRSLTRGGGPDDDADAGKPGPGE 114

Query: 98  FLKEVSLHDVWL----------------DQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKT 141
            L   SLHDV L                  +++ W+AQQTNLEYLL LD D L W+FR+ 
Sbjct: 115 LLTPASLHDVRLHGDDDDDDRVLTGSSSSSAAMYWQAQQTNLEYLLYLDPDRLTWTFRRQ 174

Query: 142 ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN 201
           A LPT G  YGGWE P  +LRGHF GHYLSASA MWA+THN+T++E+M+ VV  L +CQ 
Sbjct: 175 AGLPTVGDPYGGWEAPGGQLRGHFTGHYLSASAHMWAATHNSTLRERMTRVVDILYDCQK 234

Query: 202 KIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEY 261
           K+GTGYL+A+P  +FD +E L   W+PYYTIHKI+ GLLDQY+LA N + L +  WM +Y
Sbjct: 235 KMGTGYLAAYPETMFDLYEQLDEAWSPYYTIHKIMQGLLDQYMLASNKKGLDVVVWMTDY 294

Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
           F NRV+ +I  Y+++RHW ++NEETGG NDV+Y+LY+IT + KHL +AHLFDKPCFLG L
Sbjct: 295 FSNRVKNLIQKYTIQRHWEAMNEETGGFNDVMYQLYTITKNQKHLTMAHLFDKPCFLGPL 354

Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
            L  D +S  H NTH+P++IG+Q RYEV GD LYK I T+  D+VN+SH++ATGGTS  E
Sbjct: 355 GLHKDDISGLHVNTHLPVIIGTQKRYEVVGDHLYKDISTYLFDVVNSSHTFATGGTSTME 414

Query: 382 FWWDPKRLADTLG-SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRG 440
            W DPKRL D +  S NEETC TYN LKVSR+LFRWTKE  YAD+YER L NG++  QRG
Sbjct: 415 HWHDPKRLVDEIKISSNEETCATYNFLKVSRNLFRWTKEAKYADHYERLLINGIMGNQRG 474

Query: 441 TEPGVMIYMLPLGRGVSKA-----------RSTHGWGTKFNSFWCCYGTGIESFSKLGDS 489
           T+PGVM+Y LP+G G SK+           ++  GWG   ++FWCCYGTGIESFSKLGDS
Sbjct: 475 TQPGVMLYFLPMGPGRSKSVSGLSPSGLPPKNPGGWGGPNDTFWCCYGTGIESFSKLGDS 534

Query: 490 IYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL 549
           IYF EEG  PGLYIIQYI S+FDWK+  + +NQ+  P++S DP+ +++LTFS+K +  QL
Sbjct: 535 IYFLEEGEAPGLYIIQYIPSTFDWKATGLTVNQQAKPLLSTDPFFKVSLTFSAKGDA-QL 593

Query: 550 SSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN-----FLSATERWSYNDKLTIQLPLSLR 604
           + +++R+P WT ++G  A+LNGQ L L   GN     FL+ T+ W+  D LT+Q P++LR
Sbjct: 594 AKVSVRIPSWTSTDGTTATLNGQKLNLTSTGNSTNGGFLTVTKLWA-EDTLTLQFPITLR 652

Query: 605 TEAIQDDRPEYASIQAILFGPYLLAGHTSGE-----------------WDIKTGTARSLS 647
           TEAI+DDRPEYASIQA+LFGP+LLAG T G+                 W++   +A +++
Sbjct: 653 TEAIKDDRPEYASIQAVLFGPHLLAGLTHGKLPVTDSNHSNDGLTPSIWEVNATSATAVT 712

Query: 648 ALISPIPP-SFNAQLVTFTQESGNSTFVMSNS--NQSITMEEFPVSGTDAALHATFRLIL 704
             ++P+P  + N+QLVT TQ +G  T V+S S  +  + M+E P  GTDA +HATFR + 
Sbjct: 713 DWVTPLPSETLNSQLVTLTQTAGGRTLVLSVSIADAKLEMQEQPAPGTDACVHATFR-VY 771

Query: 705 KDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLD 764
             A  S+  SL  + G +V +EPFD PGM V  G    L+    P     + F  V GLD
Sbjct: 772 GQAGSSSSESLLPMQGPNVTIEPFDRPGMAVTNG----LLAVGRPAGGRDTLFNAVPGLD 827

Query: 765 KRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--------STESLDAGFNRAASFMME 816
               +VSLE   R GCFV++       A+ +++C        S     A   RAASF+  
Sbjct: 828 GAPGSVSLELATRPGCFVATAPAAGANAATQVVCRGNKNNGGSASGDGAALRRAASFVRA 887

Query: 817 IGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
             +  Y+P+SF A+G  RNFLL PL S +DE YTVYF++
Sbjct: 888 APLRRYNPLSFAARGTARNFLLEPLRSLQDEFYTVYFSL 926


>gi|125556053|gb|EAZ01659.1| hypothetical protein OsI_23694 [Oryza sativa Indica Group]
          Length = 898

 Score =  833 bits (2151), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/853 (50%), Positives = 560/853 (65%), Gaps = 62/853 (7%)

Query: 52  HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQ 111
           HL   +++ W  L+P +     +DE+ W  LYR I   GG + P  FL   SLHDV +D 
Sbjct: 56  HLNQAEEATWMGLLPRR--AGPRDELDWLALYRSITRGGGGE-PAGFLSPASLHDVRVDP 112

Query: 112 --SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHY 169
             +++ W+ QQTNLEYLL LD D L W+FR+ A LP  G+ YGGWE P  +LRGHF GHY
Sbjct: 113 YGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPIVGEPYGGWEAPDGQLRGHFTGHY 172

Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPY 229
           LSA+A MWASTHN  ++EKM+ VV  L  CQ K+ TGYLSA+P  +FD+++ L   W+PY
Sbjct: 173 LSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDAYDELAEAWSPY 232

Query: 230 YTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGM 289
           YTIHKI+ GLLDQY LA N + L++  WM +YF  RV+K+I  YS++RHW ++NEETGG 
Sbjct: 233 YTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSIQRHWEAINEETGGF 292

Query: 290 NDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEV 349
           NDV+Y+LY+IT + KHL +AHLFDKPCFLG L L  D +S  H NTH+P+++G+Q RYEV
Sbjct: 293 NDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNTHVPVIVGAQKRYEV 352

Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG-SENEETCTTYNMLK 408
            GD LYK I TFF D+VN+SH++ATGGTS  E W DPKRL D +  S NEETC TYN+LK
Sbjct: 353 VGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWHDPKRLVDEIKISSNEETCATYNLLK 412

Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA--------- 459
           VSR+LFRWTKE  Y D+YER L NG++  QRG EPGVMIY LP+G G SK+         
Sbjct: 413 VSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGPGRSKSISGMPTSGL 472

Query: 460 --RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
             ++  GWG    +FWCCYGTGIESFSKLGDSIYF EEG +PGLYIIQYI S+FDWK+  
Sbjct: 473 PPKNPGGWGNANATFWCCYGTGIESFSKLGDSIYFLEEGEIPGLYIIQYIPSTFDWKAAG 532

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
           + + Q+  P+ S D +  +++  SSK +  + +++N+R+P WT  +GA A+LNGQ L L 
Sbjct: 533 LTVKQQAKPLSSTDSHFEVSIFISSKGDA-RPANVNVRIPSWTSVDGAIATLNGQKLNLT 591

Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWD 637
             G+FLS T+ W  +D L+++ P++LRTE I+DDRPEY+SIQA+LFGP+LLAG T G   
Sbjct: 592 SAGDFLSVTKLWG-DDTLSLKFPITLRTEPIKDDRPEYSSIQAVLFGPHLLAGLTHGNQT 650

Query: 638 IKTGTARSLSAL-------------------ISPIPPSFNAQLVTFTQESGN----STFV 674
           +KT +  S S L                   ++P+  S N+QLVT TQ  G+    + FV
Sbjct: 651 VKT-SNDSNSGLTPGVWEVNATHAAAAVAGWVTPVSQSLNSQLVTLTQRDGDAQAAAAFV 709

Query: 675 MSNS--NQSITMEEFPVSGTDAALHATFRLILKDASLSNF-SSLNNVIGKSVMLEPFDFP 731
           +S S  + ++TM+E PV+G+DA +HATFR     +  S   ++   + G++V LEPFD P
Sbjct: 710 LSVSIADGALTMQESPVAGSDACVHATFRAYHSPSGASAIDAATGRLQGRNVALEPFDRP 769

Query: 732 GMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVN-FEP 790
           GM V     D L V    +   ++ F  VAGLD    TVSLE   R GCFV++    +  
Sbjct: 770 GMAVT----DALSVG---RPGPATRFNAVAGLDGLPGTVSLELATRPGCFVAAPTTAYLA 822

Query: 791 GASLKLLCSTESL--------DAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLL 842
           GA  ++ C   +         D  F RAASF     +  YHP+SF A G  RNFLL PL 
Sbjct: 823 GAKAQVSCRKPTAAGGGEDDDDTAFRRAASFTQAAPLRLYHPLSFSATGTDRNFLLEPLQ 882

Query: 843 SFRDEAYTVYFNI 855
           S +DE YTVYFN+
Sbjct: 883 SLQDEFYTVYFNV 895


>gi|326520888|dbj|BAJ92807.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 683

 Score =  830 bits (2145), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 415/694 (59%), Positives = 507/694 (73%), Gaps = 27/694 (3%)

Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKI---GTGYLSAFPTELFDSFEALKPVWAPYYTI 232
           MWASTHN T+  KMS VV +L  CQ      G GYLSAFP E FD FEA+KPVWAPYYTI
Sbjct: 1   MWASTHNGTLAGKMSAVVDALHACQQAPANGGAGYLSAFPAEFFDRFEAIKPVWAPYYTI 60

Query: 233 HKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDV 292
           HKI+ GLLDQY +A N +AL M   M  YF  RV+ VI  +S+ERHW SLNEETGGMNDV
Sbjct: 61  HKIMQGLLDQYTVAGNGKALAMVVAMAGYFGERVRSVIQRHSIERHWTSLNEETGGMNDV 120

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
           LY+LY+IT+D +HL+LAHLFDKPCFLG LA+QAD LS FHANTHIPIV+G QMRYEVTGD
Sbjct: 121 LYQLYAITNDQRHLVLAHLFDKPCFLGLLAVQADSLSDFHANTHIPIVVGGQMRYEVTGD 180

Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
           PLYK I TFFM++VN+SHSYATGGTS  EFW+DPKRLA+TL +ENEE+CTTYNMLKVSRH
Sbjct: 181 PLYKEIATFFMNVVNSSHSYATGGTSVSEFWFDPKRLAETLTTENEESCTTYNMLKVSRH 240

Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
           LFRWTKEIAYADYYERAL NGV SIQRG +PGVMIYMLP G G SKA S HGWGT+++SF
Sbjct: 241 LFRWTKEIAYADYYERALINGVQSIQRGRDPGVMIYMLPQGPGRSKALSYHGWGTQYDSF 300

Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
           WCCYGTGIESFSKLGDSIYFEE+G  P LY++QYI S+F+W+S  + + Q + P+ S D 
Sbjct: 301 WCCYGTGIESFSKLGDSIYFEEKGGKPALYLVQYIPSTFNWRSVGLTVTQTLKPLSSSDQ 360

Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYN 592
            L+++L+ S+K   GQ +++N+R+P W  SNGA+A+LNG++L +  PG FLS T++W   
Sbjct: 361 NLQVSLSISAKTN-GQYATVNVRIPSWASSNGAKATLNGKDLTMASPGTFLSVTKQWGGG 419

Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
           D L +QLP+ LRTEAI+DDRPEYAS+QA+LFGP+LLAG T+G+WD KTG   ++S  I+ 
Sbjct: 420 DHLALQLPIRLRTEAIKDDRPEYASLQAVLFGPFLLAGLTTGDWDAKTGGG-AISEWITA 478

Query: 653 IPPSFNAQLVTFTQESGNSTFVMS----NSNQSITMEEFPV-SGTDAALHATFRLILKDA 707
           IP ++N+QLVT TQESGNST V+S        S+TM+  P   GTDAA+HATFRL+ +  
Sbjct: 479 IPATYNSQLVTLTQESGNSTLVLSLLSTAKATSLTMQPRPEGGGTDAAVHATFRLVTQGQ 538

Query: 708 SLSNFSSLNNVIG-----KSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAG 762
                    +         S ++EPFD PGM V         ++ S ++  SS F +V G
Sbjct: 539 GTPPMGERRHATNATAALASAVIEPFDMPGMAVTNS------LTLSAEKGPSSLFNVVPG 592

Query: 763 LDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTESLDAGFNR-AASFMMEIGISE 821
           LD +  +VSLE   R GCF+ +      GA   +         GF+R AASF     +  
Sbjct: 593 LDGQPGSVSLELGARPGCFLVTA-----GAKANVQVGCGGGGTGFSRQAASFARAEPLRR 647

Query: 822 YHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
           YHPISF AKGARR+FLL PL + RDE YTVYFN+
Sbjct: 648 YHPISFAAKGARRSFLLEPLFTLRDEFYTVYFNL 681


>gi|125597849|gb|EAZ37629.1| hypothetical protein OsJ_21963 [Oryza sativa Japonica Group]
          Length = 902

 Score =  827 bits (2135), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 429/856 (50%), Positives = 558/856 (65%), Gaps = 65/856 (7%)

Query: 52  HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDL---PGNFLKEVSLHDVW 108
           HL   +++ W  L+P +     +DE+ W  LYR I   GG D+   P  FL   SLHDV 
Sbjct: 57  HLNQAEEATWMGLLPRR--AGPRDELDWLALYRSITRGGG-DVGGEPAGFLSPASLHDVR 113

Query: 109 LDQ--SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
           +D   +++ W+ QQTNLEYLL LD D L W+FR+ A LPT G+ YGGWE P  +LRGHF 
Sbjct: 114 VDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWEAPDGQLRGHFT 173

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVW 226
           GHYLSA+A MWASTHN  ++EKM+ VV  L  CQ K+ TGYLSA+P  +FD+++ L   W
Sbjct: 174 GHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDAYDELAEAW 233

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
           +PYYTIHKI+ GLLDQY LA N + L++  WM +YF  RV+K+I  YS++RHW ++NEET
Sbjct: 234 SPYYTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSIQRHWEAINEET 293

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GG NDV+Y+LY+IT + KHL +AHLFDKPCFLG L L  D +S  H NTH+P+++G+Q R
Sbjct: 294 GGFNDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNTHVPVIVGAQKR 353

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG-SENEETCTTYN 405
           YEV GD LYK I TFF D+VN+SH++ATGGTS  E W DPKRL D +  S NEETC TYN
Sbjct: 354 YEVVGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWHDPKRLVDEIKISSNEETCATYN 413

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA------ 459
           +LKVSR+LFRWTKE  Y D+YER L NG++  QRG EPGVMIY LP+G G SK+      
Sbjct: 414 LLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGPGRSKSISGMPT 473

Query: 460 -----RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
                ++  GWG    +FWCCYGTGIESFSKLGDSIYF EEG +PGLYIIQYI S+FDWK
Sbjct: 474 SGLPPKNPGGWGNANATFWCCYGTGIESFSKLGDSIYFLEEGEIPGLYIIQYIPSTFDWK 533

Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
           +  + + Q+  P+ S D +  +++  SSK +  + +++N+R+P WT  +GA A+LNGQ L
Sbjct: 534 AAGLTVKQQAKPLSSTDSHFEVSIFISSKGDA-RPANVNVRIPSWTSVDGAIATLNGQKL 592

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
            L   G+FLS T+ W  +D L+++ P++LRTE I+DDRPEY+SIQA+LFGP+LLAG T G
Sbjct: 593 NLTSAGDFLSVTKLWG-DDTLSLKFPITLRTEPIKDDRPEYSSIQAVLFGPHLLAGLTHG 651

Query: 635 EWDIKTGTARSLSALISPI-------------------PPSFNAQLVTFTQESGN----S 671
              +KT +  S S L   +                     S N+QLVT TQ  G+    +
Sbjct: 652 NQTVKT-SNDSNSGLTPGVWEVNATHAAAAVAVWVTPVSQSLNSQLVTLTQRDGDAQAAA 710

Query: 672 TFVMSNS--NQSITMEEFPVSGTDAALHATFRLILKDASLSNF-SSLNNVIGKSVMLEPF 728
            FV+S S  + ++TM+E PV+G+DA +HATFR     +  S   ++   + G+ V LEPF
Sbjct: 711 AFVLSVSIADGALTMQESPVAGSDACVHATFRAYQSPSGASAIDAATGRLQGRDVALEPF 770

Query: 729 DFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVN- 787
           D PGM V     D L V    +   ++ F  VAGLD    TVSLE   R GCFV++    
Sbjct: 771 DRPGMAVT----DALSVG---RPGPATRFNAVAGLDGLPGTVSLELATRPGCFVAAPTTA 823

Query: 788 FEPGASLKLLCSTESL--------DAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLA 839
           +  GA  ++ C   +         D  F RAASF     +  YHP+SF A G  RNFLL 
Sbjct: 824 YLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASFTQAAPLRLYHPLSFSATGTDRNFLLE 883

Query: 840 PLLSFRDEAYTVYFNI 855
           PL S +DE YTVYFN+
Sbjct: 884 PLQSLQDEFYTVYFNV 899


>gi|51090918|dbj|BAD35523.1| unknown protein [Oryza sativa Japonica Group]
 gi|51090952|dbj|BAD35555.1| unknown protein [Oryza sativa Japonica Group]
          Length = 902

 Score =  827 bits (2135), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 429/856 (50%), Positives = 558/856 (65%), Gaps = 65/856 (7%)

Query: 52  HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDL---PGNFLKEVSLHDVW 108
           HL   +++ W  L+P +     +DE+ W  LYR I   GG D+   P  FL   SLHDV 
Sbjct: 57  HLNQAEEATWMGLLPRR--AGPRDELDWLALYRSITRGGG-DVGGEPAGFLSPASLHDVR 113

Query: 109 LDQ--SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
           +D   +++ W+ QQTNLEYLL LD D L W+FR+ A LPT G+ YGGWE P  +LRGHF 
Sbjct: 114 VDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWEAPDGQLRGHFT 173

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVW 226
           GHYLSA+A MWASTHN  ++EKM+ VV  L  CQ K+ TGYLSA+P  +FD+++ L   W
Sbjct: 174 GHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDAYDELAEAW 233

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
           +PYYTIHKI+ GLLDQY LA N + L++  WM +YF  RV+K+I  YS++RHW ++NEET
Sbjct: 234 SPYYTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSIQRHWEAINEET 293

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GG NDV+Y+LY+IT + KHL +AHLFDKPCFLG L L  D +S  H NTH+P+++G+Q R
Sbjct: 294 GGFNDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNTHVPVIVGAQKR 353

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG-SENEETCTTYN 405
           YEV GD LYK I TFF D+VN+SH++ATGGTS  E W DPKRL D +  S NEETC TYN
Sbjct: 354 YEVVGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWHDPKRLVDEIKISSNEETCATYN 413

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA------ 459
           +LKVSR+LFRWTKE  Y D+YER L NG++  QRG EPGVMIY LP+G G SK+      
Sbjct: 414 LLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGPGRSKSISGMPT 473

Query: 460 -----RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
                ++  GWG    +FWCCYGTGIESFSKLGDSIYF EEG +PGLYIIQYI S+FDWK
Sbjct: 474 SGLPPKNPGGWGNANATFWCCYGTGIESFSKLGDSIYFLEEGEIPGLYIIQYIPSTFDWK 533

Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
           +  + + Q+  P+ S D +  +++  SSK +  + +++N+R+P WT  +GA A+LNGQ L
Sbjct: 534 AAGLTVKQQAKPLSSTDSHFEVSIFISSKGDA-RPANVNVRIPSWTSVDGAIATLNGQKL 592

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
            L   G+FLS T+ W  +D L+++ P++LRTE I+DDRPEY+SIQA+LFGP+LLAG T G
Sbjct: 593 NLTSAGDFLSVTKLWG-DDTLSLKFPITLRTEPIKDDRPEYSSIQAVLFGPHLLAGLTHG 651

Query: 635 EWDIKTGTARSLSALISPI-------------------PPSFNAQLVTFTQESGN----S 671
              +KT +  S S L   +                     S N+QLVT TQ  G+    +
Sbjct: 652 NQTVKT-SNDSNSGLTPGVWEVNATHAAAAVAVWVTPVSQSLNSQLVTLTQRDGDAQAAA 710

Query: 672 TFVMSNS--NQSITMEEFPVSGTDAALHATFRLILKDASLSNF-SSLNNVIGKSVMLEPF 728
            FV+S S  + ++TM+E PV+G+DA +HATFR     +  S   ++   + G+ V LEPF
Sbjct: 711 AFVLSVSIADGALTMQESPVAGSDACVHATFRAYHSPSGASAIDAATGRLQGRDVALEPF 770

Query: 729 DFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVN- 787
           D PGM V     D L V    +   ++ F  VAGLD    TVSLE   R GCFV++    
Sbjct: 771 DRPGMAVT----DALSVG---RPGPATRFNAVAGLDGLPGTVSLELATRPGCFVAAPTTA 823

Query: 788 FEPGASLKLLCSTESL--------DAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLA 839
           +  GA  ++ C   +         D  F RAASF     +  YHP+SF A G  RNFLL 
Sbjct: 824 YLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASFTQAAPLRLYHPLSFSATGTDRNFLLE 883

Query: 840 PLLSFRDEAYTVYFNI 855
           PL S +DE YTVYFN+
Sbjct: 884 PLQSLQDEFYTVYFNV 899


>gi|168021740|ref|XP_001763399.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685534|gb|EDQ71929.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 757

 Score =  822 bits (2124), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 414/767 (53%), Positives = 539/767 (70%), Gaps = 20/767 (2%)

Query: 98  FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
            LK+VSLH V L   S  + AQ TNL+YLL LDVD+++WSFRK ++L  PG+ YGGWE+P
Sbjct: 1   LLKDVSLHKVRLGADSPQFMAQNTNLQYLLELDVDNMMWSFRKVSNLNAPGQPYGGWESP 60

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
            SELRGHFVGHYLSASA MWASTHN  + EKM+ ++ +L ECQ  IGTGYLSAFP+E FD
Sbjct: 61  ASELRGHFVGHYLSASALMWASTHNEVLHEKMNALLGALKECQMSIGTGYLSAFPSEFFD 120

Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
            FEA++ VWAPYYTIHKI+AGLLDQY+LA +  AL M   M  YFY RV+ VI  +++ER
Sbjct: 121 RFEAIEYVWAPYYTIHKIMAGLLDQYLLAGSKDALDMVVEMANYFYKRVKTVIEKFTIER 180

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
           HW SLNEETGGMNDVLYRLY++T D KHL LAHLFDKPCFLG LALQAD+LS FH+NTHI
Sbjct: 181 HWRSLNEETGGMNDVLYRLYTVTGDNKHLELAHLFDKPCFLGPLALQADHLSGFHSNTHI 240

Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
           PIV+G+QMRYEVT D +Y+ I  +FM IVN+SHSYATGGTS  EFW D  R  DTL +EN
Sbjct: 241 PIVVGAQMRYEVTSDLIYRSIAEYFMGIVNSSHSYATGGTSVSEFWTDSMRQGDTLHTEN 300

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
           +ETCTTYNMLK++R LFRWTK+I Y DYY+RAL NG+L  QRG +PGVMIYMLP+G GVS
Sbjct: 301 QETCTTYNMLKIARTLFRWTKDIKYMDYYDRALINGILGTQRGQQPGVMIYMLPMGPGVS 360

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
           K RS HGWG KFNSFWCCYGT IESF+KLGDSIYFE++G +P +Y+ Q++SS F W S  
Sbjct: 361 KGRSYHGWGNKFNSFWCCYGTAIESFAKLGDSIYFEDDGEIPSVYVAQFVSSDFVWDSAG 420

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEV--GQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
           +VL+Q + P+ +    L +T +FS    V   Q + +++R+P W    G +A LNGQ + 
Sbjct: 421 LVLHQSLKPLNAEQSILEVTFSFSHATIVRASQDAVIHVRLPSWV--RGCRAHLNGQEIE 478

Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
              PG FLS    WS +D+L + LP+SL  E IQDDR +Y+++ AI++GP+++AG ++G+
Sbjct: 479 SLIPGKFLSIARAWSSDDELVLLLPMSLGLEKIQDDRAQYSALHAIMYGPFVMAGLSTGD 538

Query: 636 WDIKTGTARSLSALISPIPPSFNAQLVTFTQ-----ESGNSTFVMSNSNQSITMEEFPVS 690
           W  K G   +L+  + P+P ++++QL TF+Q     E   S ++  N+  +I M   P  
Sbjct: 539 W--KLGHKENLTQWVYPVPAAYHSQLSTFSQFHVNGEYSGSLYLACNNGTAI-MRYAPED 595

Query: 691 GTDAALHATFRLILKDASLSNFSSLNNVIGKS-VMLEPFDFPGMLVQQGKEDELVVSESP 749
           GTD    +TFR+        N+S L+    K  V LE F  PG+ +Q   ED+ + +  P
Sbjct: 596 GTDECGLSTFRV---SDPFGNYSQLSAGDDKRLVSLELFSQPGIFLQHNGEDKPISTGPP 652

Query: 750 KEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLL-CSTESLDAGFN 808
                S F  + GL  ++ TVS EA ++ GCF+SS  +         L C T   D   N
Sbjct: 653 SW---SVFFYLPGLTGKSGTVSFEAVDKPGCFLSSSFSGSSVLGGVFLRCKTSRNDNTLN 709

Query: 809 RAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
             ++F +++G++ YHP+SF+A+G  RNFLLAPL S RDE+YT+YF++
Sbjct: 710 AFSTFDVQMGVAAYHPVSFIAEGQHRNFLLAPLNSLRDESYTIYFDM 756


>gi|293331149|ref|NP_001170532.1| uncharacterized protein LOC100384546 precursor [Zea mays]
 gi|238005884|gb|ACR33977.1| unknown [Zea mays]
 gi|413954824|gb|AFW87473.1| hypothetical protein ZEAMMB73_711416 [Zea mays]
          Length = 902

 Score =  820 bits (2117), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/853 (50%), Positives = 555/853 (65%), Gaps = 62/853 (7%)

Query: 52  HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDL-------PGNFLKEVSL 104
           HLTPT+++ W SL+P ++ G  + E  W  LYR +    G D        P   L   SL
Sbjct: 57  HLTPTEEATWMSLLPRRLRGGGRAEFDWLALYRSLTRGDGPDGGAGKAAGPEGLLSPASL 116

Query: 105 HDVWLDQ----SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISE 160
           HDV L      SS+ WRAQQTNLEYLL LD D L W+FR+ A LPT G  YGGWE P  +
Sbjct: 117 HDVRLHGDGSLSSMYWRAQQTNLEYLLYLDPDRLTWTFRQQAGLPTVGDPYGGWEAPDGQ 176

Query: 161 LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFE 220
           LRGHFVGHYLSASA  WA+THN T++E+M+ VV  L  CQ K+GTGYLSA+P  +FD +E
Sbjct: 177 LRGHFVGHYLSASAHAWAATHNGTLRERMARVVDILHACQKKMGTGYLSAYPETMFDLYE 236

Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWY 280
            L   W+PYYT HKI+ GLLDQY LA N + L +   M +YF NRV+ ++ +++++RHW 
Sbjct: 237 QLDEAWSPYYTTHKIMQGLLDQYTLASNEKGLDVVLRMADYFSNRVKNLVQIHTIQRHWE 296

Query: 281 SLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV 340
           ++NEETGG NDV+Y+LY+IT D KHL +AHLFDKPCFLG L L  D +S  H NTH+P++
Sbjct: 297 AMNEETGGFNDVMYQLYTITRDQKHLTMAHLFDKPCFLGPLGLHKDDISGLHVNTHLPVL 356

Query: 341 IGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG-SENEE 399
           +G+Q RYEV GD LYK I T+  D+VN+SH++ATGGTS  E W DPKRL D +  S NEE
Sbjct: 357 VGAQKRYEVVGDRLYKDISTYLFDVVNSSHTFATGGTSTMEHWHDPKRLVDEIKISSNEE 416

Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
           TC TYN LKVSR+LFRWTKE  YAD+YER L NG++  QRGT+PGVM+Y LP+G G SK+
Sbjct: 417 TCATYNFLKVSRNLFRWTKEAKYADHYERLLINGIMGNQRGTQPGVMLYFLPMGPGRSKS 476

Query: 460 RSTH-----------GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
            S             GWG   ++FWCCYGTGIESFSKLGDSIYF EEG+ PGLYIIQYI 
Sbjct: 477 VSGQSPSGLPPKNPGGWGGPNDTFWCCYGTGIESFSKLGDSIYFLEEGDTPGLYIIQYIP 536

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
           S+FDWK+  + +NQ+  P++S DP+ +++LT S+K+   Q + +++R+P WT ++GA A 
Sbjct: 537 STFDWKATGLTVNQRAKPLLSTDPFFKVSLTISAKRGARQ-AKVSVRIPSWTTTDGATAI 595

Query: 569 LNGQNLPLPPPGN-----FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
           LNGQ L L P GN     FL+ T+ W+ ND LT+  P++LRTEAI+DDRPEYASIQA+LF
Sbjct: 596 LNGQKLNLTPTGNSTNGGFLTITKLWA-NDTLTLHFPITLRTEAIKDDRPEYASIQAVLF 654

Query: 624 GPYLLAGHTSGE-----------------WDIKTGTARSLSALISPI-PPSFNAQLVTFT 665
           GP+LLAG T G+                 W++    A S++  ++P+   + N+QLVT  
Sbjct: 655 GPHLLAGLTHGKLPVTDSSHSNDGLTAGIWEVDATGAASVAGWVTPLHSETLNSQLVTLK 714

Query: 666 QESGNSTFVMSNS--NQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSV 723
           Q  G  T V+S S  +  + M+E P  GTDA +HATFR   +    S       + G +V
Sbjct: 715 QSIGGRTLVLSVSIADAKLEMQEQPAPGTDACVHATFRAYGQAGGSSQL-----LRGPNV 769

Query: 724 MLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVS 783
            +EPFD PGM V  G      ++   +    + F  V GLD    +VSLE   R G FV+
Sbjct: 770 TIEPFDRPGMAVTNG------LAVGCRGGRDTLFNAVPGLDGAPGSVSLELATRPGWFVA 823

Query: 784 SG-VNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLL 842
           +        A+ +++C      A F RAASF     +  YHP+SF A+G  RNFLL PL 
Sbjct: 824 TAPTAMHANATTQVVCRANKGGAAFRRAASFARAPPLRRYHPLSFAARGTARNFLLEPLR 883

Query: 843 SFRDEAYTVYFNI 855
           S +DE YTVYF++
Sbjct: 884 SLQDEFYTVYFSL 896


>gi|297606169|ref|NP_001058067.2| Os06g0612900 [Oryza sativa Japonica Group]
 gi|255677223|dbj|BAF19981.2| Os06g0612900 [Oryza sativa Japonica Group]
          Length = 717

 Score =  773 bits (1995), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 404/724 (55%), Positives = 504/724 (69%), Gaps = 53/724 (7%)

Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK- 234
           MWASTHN T+  KM+ VV +L +CQ   GTGYLSAFP E FD FEA++PVWAPYYTIHK 
Sbjct: 1   MWASTHNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKA 60

Query: 235 -------------------------ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
                                    I+ GLLDQ+ +A N +AL M   M +YF  RV+ V
Sbjct: 61  RNATQSICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGKALGMVVAMADYFAGRVRSV 120

Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
           I  Y++ERHW SLNEETGGMNDVLY+LY+IT D +HL+LAHLFDKPCFLG LA+QAD LS
Sbjct: 121 IQRYTIERHWTSLNEETGGMNDVLYQLYTITKDQRHLVLAHLFDKPCFLGLLAVQADSLS 180

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
            FHANTHIP+VIG QMRYEVTGDPLYK I TFFMDIVN+SHSYATGGTS  EFW +PK L
Sbjct: 181 GFHANTHIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVSEFWSNPKHL 240

Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
           A+ L +E EE+CTTYNMLKVSRHLFRWTKEIAYADYYERAL NGVLSIQRG +PGVMIYM
Sbjct: 241 AEALTTETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMIYM 300

Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
           LP G G SKA S HGWGT++NSFWCCYGTGIESFSKLGDSIYFE++G+ PGLYIIQYI S
Sbjct: 301 LPQGPGRSKAVSYHGWGTQYNSFWCCYGTGIESFSKLGDSIYFEQKGDKPGLYIIQYIPS 360

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
           +F+W++  + + Q+V P+ S D YL+++L+ S+ +  GQ ++LN+R+P WT  NGA+A+L
Sbjct: 361 TFNWRTAGLTVTQQVKPLSSSDQYLQVSLSISAAKTNGQYATLNVRIPSWTSMNGAKATL 420

Query: 570 NGQNLPLPPPGNFLSATERW-SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
           N ++L L  PG FL+ +++W S +D L +Q P++LRTEAI+DDRP+ AS+ AILFGP+LL
Sbjct: 421 NDKDLQLASPGTFLTISKQWDSGDDHLLLQFPINLRTEAIKDDRPQVASLNAILFGPFLL 480

Query: 629 AGHTSGEWD-IKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQ-SITMEE 686
           AG T+G+WD    G A + S  I+P+P S+N+QLVT TQESG  T ++S  N  S+ M E
Sbjct: 481 AGLTTGDWDAKTGGAATAASDWITPVPASYNSQLVTLTQESGGKTMLLSTVNDTSLAMLE 540

Query: 687 FP--VSGTDAALHATFRLI--------LKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQ 736
            P    GTDAA+ ATFR++         + A          +   +  +EPF  PG  V 
Sbjct: 541 RPEGAGGTDAAVRATFRVVPPGSRAELRQRAGAGAGEGAARLKVAAATIEPFGLPGTAVS 600

Query: 737 QGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKL 796
            G    L V  +     S+ F +  GLD +  +VSLE  ++ GCF+ +G     GA + +
Sbjct: 601 NG----LAVVRAGNS-SSTLFNVAPGLDGKPGSVSLELGSKPGCFLVAGA----GAKVHV 651

Query: 797 LCSTE-----SLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTV 851
            C T      +  AGF +AASF     +  YH ISF A G RR+FLL PL + RDE YT+
Sbjct: 652 GCRTRGGAAAAAAAGFEQAASFAQAEPLRRYHAISFFASGVRRSFLLEPLFTLRDEFYTI 711

Query: 852 YFNI 855
           YFN+
Sbjct: 712 YFNL 715


>gi|302818405|ref|XP_002990876.1| hypothetical protein SELMODRAFT_20509 [Selaginella moellendorffii]
 gi|300141437|gb|EFJ08149.1| hypothetical protein SELMODRAFT_20509 [Selaginella moellendorffii]
          Length = 755

 Score =  773 bits (1995), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 416/775 (53%), Positives = 523/775 (67%), Gaps = 37/775 (4%)

Query: 98  FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
           FL+ VSLHDV L   S    AQQTNL+YLLMLDVD+LV+SFR TA L   G AYGGWE P
Sbjct: 1   FLEAVSLHDVRLLPDSWQAIAQQTNLDYLLMLDVDNLVYSFRTTAGLNASGSAYGGWELP 60

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
            SELRGHFVGHYLSASA  WASTHN TI E M+ VV +L+ECQ KIGTGYLSAFPT LFD
Sbjct: 61  TSELRGHFVGHYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSAFPTSLFD 120

Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
            FEAL+ VWAPYYTIHKI+AGLLDQY  A N+ A +M   M +YF +RV++VI  YS+ER
Sbjct: 121 RFEALESVWAPYYTIHKIMAGLLDQYTYAANSLAFEMLLGMTDYFGSRVERVIEKYSIER 180

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
           HW SLNEETGGMNDVLYR+Y IT D KHL LAHLFDKPCFLG LA++AD +S FHANTHI
Sbjct: 181 HWQSLNEETGGMNDVLYRVYQITGDAKHLKLAHLFDKPCFLGLLAVRADSISGFHANTHI 240

Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
           PIVIG+Q+RYEV GD LYK +  +FM IV++SH+YATGGTSA EFW DP RL DTLG+EN
Sbjct: 241 PIVIGAQLRYEVVGDKLYKDLSEYFMKIVSSSHTYATGGTSAGEFWSDPSRLGDTLGTEN 300

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
           EE+CTTYNMLKV+R+LFRWTK++ YAD+YERAL NGVL+IQRG EPGVMIYMLPL  G S
Sbjct: 301 EESCTTYNMLKVARNLFRWTKQMHYADFYERALINGVLTIQRGKEPGVMIYMLPLAPGSS 360

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEE-GNVPGLYIIQYISSSFDWKSG 516
           KA S HGWGT F+SFWCCYGT IESFSKLGDSIYF +E  + P LY+IQY+SS   W + 
Sbjct: 361 KATSYHGWGTPFSSFWCCYGTAIESFSKLGDSIYFTDEVQDTPQLYVIQYLSSKVLWTAA 420

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEV-GQLS--SLNLRMPVWTYSNGAQASLNGQN 573
            + ++Q+V  + S DP   MT+TF+  Q V G+ S   L++R+P W  S  ++  LNG  
Sbjct: 421 GLSVDQRVYHMTSTDPV--MTVTFNFTQLVLGKTSEAKLSVRVPYWAQS--SRCLLNGLE 476

Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           L    PG F   +  W   DKL+      LR E IQD+R +Y+S+ AI +GPYLLAG + 
Sbjct: 477 LQNLTPGTFFDVSREWKTGDKLSFTFSAMLRLEKIQDERSKYSSLYAIYYGPYLLAGMSD 536

Query: 634 GEWDIKTGTARSLSALISPIPPSFNAQLVTFTQ-ESGNSTFVMSNSNQSITMEEFPVSGT 692
           G + + +    + S  I P+    ++ L +FTQ + G   ++ ++S+ +++M   P  G+
Sbjct: 537 GNYKLGSVNVSTPSRWIKPVR---DSNLFSFTQLQQGKLQYLAASSDGALSMISKPQHGS 593

Query: 693 DAALHATFRLIL-------KDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQ-GKEDELV 744
           + A  ATFRL L       +   + + +SL  ++ + V LE  + PG  V   G ED + 
Sbjct: 594 EEAPLATFRLKLLPSLKTIEKFQVKDVTSL--LLDREVSLELLNRPGRFVTHFGIEDGVR 651

Query: 745 VSESPKEMGSSG---FRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTE 801
           ++        S    F+L + L      +S EA   +GCF+ +      G  + L C   
Sbjct: 652 LTNGKSSGFPSSSSVFKLRSALSGHPGEISFEASGIQGCFLVA-----QGRDITLECER- 705

Query: 802 SLDAGFNR-AASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
                FN+ AASF +  G + YHP+SF A G    +L+ PL S+ DE Y VYF +
Sbjct: 706 -----FNKMAASFGVTAGRASYHPMSFEAYGDNDTYLMFPLSSYSDEKYAVYFEV 755


>gi|302785087|ref|XP_002974315.1| hypothetical protein SELMODRAFT_30650 [Selaginella moellendorffii]
 gi|300157913|gb|EFJ24537.1| hypothetical protein SELMODRAFT_30650 [Selaginella moellendorffii]
          Length = 755

 Score =  770 bits (1989), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 416/775 (53%), Positives = 519/775 (66%), Gaps = 37/775 (4%)

Query: 98  FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
           FL  VSLHDV L   S    AQQTNL+YLLMLDVD+LV+SFR TA L   G AYGGWE P
Sbjct: 1   FLGAVSLHDVRLLPDSWQAIAQQTNLDYLLMLDVDNLVYSFRTTAGLNASGSAYGGWELP 60

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
            SELRGHFVGHYLSASA  WASTHN TI E M+ VV +L+ECQ KIGTGYLSAFPT LFD
Sbjct: 61  TSELRGHFVGHYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSAFPTSLFD 120

Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
            FEAL+ VWAPYYTIHKI+AGLLDQY  A N+ A +M   M +YF +RV+ VI  YS+ER
Sbjct: 121 RFEALESVWAPYYTIHKIMAGLLDQYTYAANSLAFEMLLGMTDYFGSRVEMVIEKYSIER 180

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
           HW SLNEETGGMNDVLYR+Y IT D KHL LAHLFDKPCFLG LA++AD +S FHANTHI
Sbjct: 181 HWQSLNEETGGMNDVLYRIYQITGDAKHLKLAHLFDKPCFLGLLAVRADSISGFHANTHI 240

Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
           PIVIG+Q+RYEV GD LYK +  +FM IV++SH+YATGGTS+ EFW +P RL DTLG+EN
Sbjct: 241 PIVIGAQLRYEVVGDKLYKDLSEYFMKIVSSSHTYATGGTSSGEFWSNPNRLGDTLGTEN 300

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
           EE+CTTYNMLKV+R+LFRWTK++ YAD+YERAL NGVL+IQRG EPGVMIYMLPL  G S
Sbjct: 301 EESCTTYNMLKVARNLFRWTKQMHYADFYERALINGVLTIQRGKEPGVMIYMLPLAPGSS 360

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEE-GNVPGLYIIQYISSSFDWKSG 516
           KA+S HGWGT F SFWCCYGT IESFSKLGDSIYF  E  + P LY+IQY+SS   W + 
Sbjct: 361 KAKSYHGWGTPFTSFWCCYGTAIESFSKLGDSIYFTNEVQDTPQLYVIQYLSSKVLWTAA 420

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEV-GQLS--SLNLRMPVWTYSNGAQASLNGQN 573
            + L+Q+V  + S DP   MT+TF+  Q V G+ S   L++R+P W  S  ++  LNG  
Sbjct: 421 GLSLDQRVYHMTSTDPV--MTVTFNFTQLVLGKTSEAKLSVRVPYWAQS--SRCLLNGLE 476

Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           L    PG F   +  W   DKL+      LR E IQD+R +Y+S+ AI +GPYLLAG + 
Sbjct: 477 LQNLTPGTFFDVSREWKTGDKLSFTFSAMLRLEKIQDERSKYSSLYAIYYGPYLLAGMSD 536

Query: 634 GEWDIKTGTARSLSALISPIPPSFNAQLVTFTQ-ESGNSTFVMSNSNQSITMEEFPVSGT 692
           G + + +    + S  I P+  S    L +FTQ + G   ++ ++S+ +++M   P  G+
Sbjct: 537 GNYKLGSVNVSTPSRWIKPVRDS---NLFSFTQLQQGKLQYLAASSDGALSMISKPQHGS 593

Query: 693 DAALHATFRLIL-------KDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQ-GKEDELV 744
           + A  ATFRL L       +   + + +SL  ++ + V LE  + PG  V   G ED + 
Sbjct: 594 EEASLATFRLKLLPSLKTIEKIQVKDVTSL--LLDREVSLELLNRPGRFVTYFGIEDGVR 651

Query: 745 VSESPKEMGSSG---FRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTE 801
           ++        S    F+L + L      +S EA   +GCF+ +      G  + L C   
Sbjct: 652 LTNGKSSGFPSSSSVFKLRSALSGHPGEISFEASGIQGCFLVA-----QGRDITLECER- 705

Query: 802 SLDAGFNR-AASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
                FN+ AASF +  G + YHP+SF A G    +L+ PL S+ DE Y VYF +
Sbjct: 706 -----FNKMAASFGVTTGRASYHPMSFEAYGGNDTYLMFPLSSYSDEKYAVYFEV 755


>gi|302788790|ref|XP_002976164.1| hypothetical protein SELMODRAFT_104205 [Selaginella moellendorffii]
 gi|300156440|gb|EFJ23069.1| hypothetical protein SELMODRAFT_104205 [Selaginella moellendorffii]
          Length = 797

 Score =  757 bits (1954), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 390/788 (49%), Positives = 518/788 (65%), Gaps = 41/788 (5%)

Query: 97  NFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
           + L+  SLH V +D  S+  + QQTNLEYLLMLDVDSL +SFR  + LPT G  YGGWE 
Sbjct: 21  HLLEGSSLHKVRIDADSLQGKGQQTNLEYLLMLDVDSLAYSFRNNSGLPTKGVPYGGWEA 80

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
           P  ELRGHFVGHYLSA+A+MWASTHN  +K +M  +V  L ECQ KIGTGYLSAFP  LF
Sbjct: 81  PDQELRGHFVGHYLSATAKMWASTHNEELKRRMDHLVDILDECQQKIGTGYLSAFPLNLF 140

Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
             FE  +PVWAPYYTIHKI+AGLLDQY  A N +AL+M  WM +YF  RV+  I  YS++
Sbjct: 141 TRFETYRPVWAPYYTIHKIMAGLLDQYTEAGNMKALRMVIWMAQYFSKRVENYIEKYSIQ 200

Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
            H+ +LNEETGGMNDVLY LY IT DP+HL LAHLFDKPCFLG LALQ D LS FHANTH
Sbjct: 201 AHFQALNEETGGMNDVLYDLYKITGDPQHLKLAHLFDKPCFLGPLALQQDTLSGFHANTH 260

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
           IPI+IG+Q RYE+TGD + K + TFFMD VN+SH + TGGTS  EFW DP R+A +LG +
Sbjct: 261 IPILIGAQKRYELTGDQVSKELVTFFMDAVNSSHRFVTGGTSDNEFWKDPNRMASSLGKD 320

Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
            EE+C++YNMLK++R+LFRWTKE +Y DYYER + NGVL+IQRG EPGVMIYMLP+G G+
Sbjct: 321 VEESCSSYNMLKIARNLFRWTKEASYMDYYERLILNGVLTIQRG-EPGVMIYMLPMGPGM 379

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG----------NVPGLYIIQY 506
           +K  ST GWG  F+SFWCCYGTGIESFSK GDSIYFE+ G           +P LY+ Q+
Sbjct: 380 AKTSSTMGWGDPFDSFWCCYGTGIESFSKFGDSIYFEDYGVRDENPGAQRPIPALYVAQF 439

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV---------GQLSSLNLRMP 557
           + S+ +W S  ++L Q V P+ S+DP + +T+      +            +++L +R+P
Sbjct: 440 VPSTLEWDSAGLILKQTVKPLTSFDPVMEVTIHLHENPKATIEETSPYHKLINTLYVRIP 499

Query: 558 VWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
            W  S G +A  N +   +  PG+FL+    W   D+LT + P  +R E IQDDR E+ S
Sbjct: 500 SWVAS-GYEAYFNDEPQDI-TPGSFLAIQREWKAGDRLTFKFPAEVRLEHIQDDREEHQS 557

Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN 677
           +  I+FGP++LAG + GE+D+      S S  I+P+ PS N  L TF        + + +
Sbjct: 558 LNGIMFGPFVLAGLSHGEFDLGPVDTSSPSDWITPVNPSDNDLLYTFRM----GDYQLGH 613

Query: 678 SNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLV-Q 736
            ++++T++    +GTD    ATF++I   +     S  + ++G+ V LE  D PG ++  
Sbjct: 614 KHRTVTIDSASTNGTDWDFQATFKVISSSSPSLAASKHSGLVGRVVSLELMDQPGRIIAH 673

Query: 737 QGKEDELVVSESPKEMGSS--------GFRLVAGLDKRNETVSLEAENRKGCFVSSGVNF 788
            G    LVV ++ +   S+        GF++V GL   +  VS E+++  GC++    ++
Sbjct: 674 SGINKNLVVVDTSQFADSTNYLSQANLGFKVVPGL-ASDRLVSFESQDLPGCYIYVD-DW 731

Query: 789 EPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKG-ARRNFLLAPLLSFRDE 847
              A LK  C ++  D GF+  ASF +  G+  YHP+SFVA     RNFLL P L++RDE
Sbjct: 732 RVPAQLK--CRSKEND-GFDAKASFKVSQGLRSYHPLSFVATSQGLRNFLLFPQLAYRDE 788

Query: 848 AYTVYFNI 855
            Y +YF++
Sbjct: 789 HYAIYFDM 796


>gi|302769588|ref|XP_002968213.1| hypothetical protein SELMODRAFT_89765 [Selaginella moellendorffii]
 gi|300163857|gb|EFJ30467.1| hypothetical protein SELMODRAFT_89765 [Selaginella moellendorffii]
          Length = 797

 Score =  756 bits (1951), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 390/788 (49%), Positives = 517/788 (65%), Gaps = 41/788 (5%)

Query: 97  NFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
           + L+  SLH V +D  S+  + QQTNLEYLLMLDVDSL +SFR  + LPT G  YGGWE 
Sbjct: 21  HLLEGSSLHKVRIDADSLQGKGQQTNLEYLLMLDVDSLAYSFRNNSGLPTKGVPYGGWEA 80

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
           P  ELRGHFVGHYLSA+A+MWASTHN  +K +M  +V  L ECQ KIGTGYLSAFP  LF
Sbjct: 81  PDQELRGHFVGHYLSATAKMWASTHNEELKRRMDHLVDILDECQQKIGTGYLSAFPLNLF 140

Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
             FE  +PVWAPYYTIHKI+AGLLDQY  A N +AL+M  WM +YF  RV+  I  YS++
Sbjct: 141 TRFETYRPVWAPYYTIHKIMAGLLDQYTEAGNMKALRMVIWMAQYFSKRVENYIEKYSIQ 200

Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
            H+ +LNEETGGMNDVLY LY IT DP+HL LAHLFDKPCFLG LALQ D LS FHANTH
Sbjct: 201 AHFQALNEETGGMNDVLYDLYKITGDPQHLKLAHLFDKPCFLGPLALQQDTLSGFHANTH 260

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
           IPI+IG+Q RYE+TGD + K + TFFMD VN+SH + TGGTS  EFW DP R+A +LG +
Sbjct: 261 IPILIGAQKRYELTGDQVSKELVTFFMDAVNSSHRFVTGGTSDNEFWKDPNRMASSLGKD 320

Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
            EE+C++YNMLK++R+LFRWTK+ +Y DYYER + NGVL+IQRG EPGVMIYMLP+G G+
Sbjct: 321 VEESCSSYNMLKIARNLFRWTKDASYMDYYERLILNGVLTIQRG-EPGVMIYMLPMGPGM 379

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG----------NVPGLYIIQY 506
           +K  ST GWG  F+SFWCCYGTGIESFSK GDSIYFE+ G           +P LY+ Q+
Sbjct: 380 AKTSSTMGWGDPFDSFWCCYGTGIESFSKFGDSIYFEDYGVRDENPGAQRPIPALYVAQF 439

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV---------GQLSSLNLRMP 557
           + S+ +W S  ++L Q V P+ S+DP + +T+      +            +++L +R+P
Sbjct: 440 VPSTLEWDSAGLILKQTVKPLTSFDPVMEVTIHLHENPKATIEETSPYHKLINTLYVRIP 499

Query: 558 VWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
            W  S G +A  N +   +  PG+FL+    W   DKLT + P  +R E IQDDR E+ S
Sbjct: 500 SWVAS-GYEAYFNDEPQDI-TPGSFLAIQREWKAGDKLTFKFPAEVRLEHIQDDREEHQS 557

Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN 677
           +  I+FGP++LAG + GE+D+      S S  I+P+ PS N  L TF        + + +
Sbjct: 558 LNGIMFGPFVLAGLSHGEFDLGPVDTSSPSDWITPVNPSDNDLLYTFRM----GDYQLGH 613

Query: 678 SNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLV-Q 736
            ++++T++    +GTD    ATF++I   +     S  + ++G+ V LE  D PG ++  
Sbjct: 614 KHRTVTLDSASTNGTDWDFEATFKVISSSSPSLAASKHSGLVGRVVSLELLDQPGRIIAH 673

Query: 737 QGKEDELVVSESPKEMGSS--------GFRLVAGLDKRNETVSLEAENRKGCFVSSGVNF 788
            G    LVV ++ +   S+        GF++V GL   +  VS E+++  GC++    ++
Sbjct: 674 SGINKNLVVVDTSQFADSTNYLSQANLGFKVVPGL-ASDRLVSFESQDLPGCYIYVD-DW 731

Query: 789 EPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKG-ARRNFLLAPLLSFRDE 847
              A LK  C ++  D GF+  ASF    G+  YHP+SFVA     RNFLL P L++RDE
Sbjct: 732 RVPAQLK--CRSKEND-GFDAKASFKASQGLRSYHPLSFVATSQGLRNFLLFPQLAYRDE 788

Query: 848 AYTVYFNI 855
            Y +YF++
Sbjct: 789 HYAIYFDM 796


>gi|357472933|ref|XP_003606751.1| hypothetical protein MTR_4g065220 [Medicago truncatula]
 gi|355507806|gb|AES88948.1| hypothetical protein MTR_4g065220 [Medicago truncatula]
          Length = 593

 Score =  715 bits (1845), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/677 (55%), Positives = 456/677 (67%), Gaps = 94/677 (13%)

Query: 189 MSTVVFSLSECQNKIGTGYLSAFPTELF-DSFEALKPVWAPYYTIHKIL------AGLLD 241
           MS +V  LS CQ K   G        +F    + L+  WAPYYTIHK+          LD
Sbjct: 1   MSALVSGLSACQEKNWNGISVCISNRVFLIELKNLEYAWAPYYTIHKLFDFDRSWLAFLD 60

Query: 242 QYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITH 301
           QY +A N Q LKM TWMV+YFYNRV  VI  ++V RH+ SLNEE GGMND+LYRLYS+T 
Sbjct: 61  QYTIAGNPQGLKMVTWMVDYFYNRVMNVIQKFTVNRHYQSLNEEAGGMNDLLYRLYSLTR 120

Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTF 361
           DPKHL LAHLFDKPCFLG LA+Q + ++ FHANTHIPIV+G+Q+RYE+TGD  YK IG +
Sbjct: 121 DPKHLELAHLFDKPCFLGVLAVQGNDIADFHANTHIPIVVGAQLRYELTGDLHYKDIGQY 180

Query: 362 FMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI 420
           FMDIVN+SH+YATGGTS  EFW +PKR+AD L S E EE+C+TYNMLKVSRHLFRWTKE+
Sbjct: 181 FMDIVNSSHAYATGGTSVGEFWRNPKRIADNLKSAETEESCSTYNMLKVSRHLFRWTKEV 240

Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
            YADYYERALTNGVLSIQRGT+PGVMIYMLPLG GVSKA++   WGT F+SFWCCYGTGI
Sbjct: 241 TYADYYERALTNGVLSIQRGTDPGVMIYMLPLGLGVSKAQTYWKWGTPFDSFWCCYGTGI 300

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
           ESFSKLGDSIYFEEEG    LYIIQYISSSF+W SG                        
Sbjct: 301 ESFSKLGDSIYFEEEGKHRSLYIIQYISSSFNWNSG------------------------ 336

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
                +G  S+LN R+P WT +NGA+A LN + LPLP P                     
Sbjct: 337 ---TAIGTSSTLNFRIPSWTLANGAKALLNSETLPLPAP--------------------- 372

Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQ 660
                    DDRPE+AS+QAIL+GPYLLAGHT+  W             I+PIP ++++Q
Sbjct: 373 ---------DDRPEFASLQAILYGPYLLAGHTT-NW-------------ITPIPSNYSSQ 409

Query: 661 LVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIG 720
           LV+++Q+   ST V++NS QS+TME  P  GT+ A HATFRLI KDA            G
Sbjct: 410 LVSYSQDINKSTLVITNSKQSLTMEILPGPGTENAPHATFRLIPKDAD-----------G 458

Query: 721 KSVMLEPFDFPGMLV-QQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKG 779
           K+VMLEPFD PGM V  QG E  L++ +S     SS F +V GLD RN+T+SLE+++ K 
Sbjct: 459 KTVMLEPFDLPGMTVSHQGPEKPLIIVDSSHGGPSSVFLVVPGLDGRNQTISLESQSNKD 518

Query: 780 CFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLA 839
           C+V S  +   G+ +KL+C + S +  FN+A SF+   G+ +Y+PISFVAKGA +NFLL 
Sbjct: 519 CYVHS--DMSAGSGVKLVCKSAS-ETSFNQANSFVSGKGLRQYNPISFVAKGANQNFLLE 575

Query: 840 PLLSFRDEAYTVYFNIQ 856
           PL +FRDE YTVYFN+Q
Sbjct: 576 PLFNFRDEHYTVYFNLQ 592


>gi|449522353|ref|XP_004168191.1| PREDICTED: uncharacterized protein LOC101224273 [Cucumis sativus]
          Length = 495

 Score =  666 bits (1719), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/494 (66%), Positives = 397/494 (80%), Gaps = 4/494 (0%)

Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
           MDIVN+SHSYATGGTS  EFW DPKRLAD LG+E EE+CTTYNMLKVSR+LF+WTKEIAY
Sbjct: 1   MDIVNSSHSYATGGTSVHEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTKEIAY 60

Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
           ADYYERALTNGVLSIQRGT+PGVMIYMLPLG G SKA S HGWGT F SFWCCYGTGIES
Sbjct: 61  ADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKAISYHGWGTPFESFWCCYGTGIES 120

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
           FSKLGDSIYFEEE   P LY+IQYISSS DWKSG+V+LNQ VDPI S DP LRMTLTFS 
Sbjct: 121 FSKLGDSIYFEEELQTPTLYVIQYISSSLDWKSGNVLLNQTVDPIHSEDPKLRMTLTFSP 180

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
           K  V   S++NLR+P WT ++GA+  LNGQ+L     GNF S T  WS  +KL+++LP++
Sbjct: 181 KGSV-HSSTINLRIPSWTSASGAKVVLNGQSLGNNINGNFKSVTNSWSSGNKLSLELPIN 239

Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
           LRTEAI DDR EYAS++AILFGPYLLA +++G+W+IKT  A SLS  I+ +P ++N  LV
Sbjct: 240 LRTEAIDDDRSEYASVKAILFGPYLLAAYSNGDWEIKTQQADSLSDWITHVPSAYNTFLV 299

Query: 663 TFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKS 722
           TF+Q SG ++F ++NSNQSITME++P  GTD+A+HATFRLI+ D S +  + L +VIGK 
Sbjct: 300 TFSQASGKTSFALTNSNQSITMEKYPGQGTDSAVHATFRLIIDDPS-AKVTELQDVIGKR 358

Query: 723 VMLEPFDFPGMLV-QQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCF 781
           VMLEPF FPGM++  +GK++ L ++++  E  SS F LV GLD +N TVSL + + +GCF
Sbjct: 359 VMLEPFSFPGMVLGNKGKDERLEIADANSEGHSSDFYLVEGLDGKNGTVSLASIDNEGCF 418

Query: 782 VSSGVNFEPGASLKLLCSTE-SLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAP 840
           V SGVN+E GA LKL C ++ SLD GF+ A+SF++E G S+YHPISFV KG  RNFLLAP
Sbjct: 419 VYSGVNYESGAQLKLSCKSKLSLDDGFDEASSFLLESGASQYHPISFVTKGMTRNFLLAP 478

Query: 841 LLSFRDEAYTVYFN 854
           LLSF DE+YTVYFN
Sbjct: 479 LLSFVDESYTVYFN 492


>gi|125556048|gb|EAZ01654.1| hypothetical protein OsI_23690 [Oryza sativa Indica Group]
          Length = 466

 Score =  612 bits (1578), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 295/461 (63%), Positives = 356/461 (77%), Gaps = 26/461 (5%)

Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK- 234
           MWASTHN T+  KM+ VV +L +CQ   GTGYLSAFP E FD FEA++PVWAPYYTIHK 
Sbjct: 1   MWASTHNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKA 60

Query: 235 -------------------------ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
                                    I+ GLLDQ+ +A N +AL M   M +YF  RV+ V
Sbjct: 61  RNATQSICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGRALGMVVAMADYFAGRVRSV 120

Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
           I  Y++ERHW SLNEETGGMNDVLY+LY+IT D +HL+LAHLFDKPCFLG LA+QAD LS
Sbjct: 121 IQRYTIERHWTSLNEETGGMNDVLYQLYTITKDQRHLVLAHLFDKPCFLGLLAVQADSLS 180

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
            FHANTHIP+VIG QMRYEVTGDPLYK I TFFMDIVN+SHSYATGGTS  EFW +PK L
Sbjct: 181 GFHANTHIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVSEFWSNPKHL 240

Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
           A+ L +E EE+CTTYNMLKVSRHLFRWTKEIAYADYYERAL NGVLSIQRG +PGVMIYM
Sbjct: 241 AEALTTETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMIYM 300

Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
           LP G G SKA S HGWGT++NSFWCCYGTGIESFSKLGDSIYFE++G+ PGLYIIQYI S
Sbjct: 301 LPQGPGRSKAVSYHGWGTQYNSFWCCYGTGIESFSKLGDSIYFEQKGDKPGLYIIQYIPS 360

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
           +F+W++  + + Q+V P+ S D YL+++L+ S+ +  GQ ++LN+R+P WT  NGA+A+L
Sbjct: 361 TFNWRTAGLTVTQQVKPLSSSDQYLQVSLSISAAKTNGQYATLNVRIPSWTSMNGAKATL 420

Query: 570 NGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
           N ++L L  PG FL+ +++W   D L +Q P++LRTEAI+D
Sbjct: 421 NDKDLQLASPGTFLTISKQWDSGDHLLLQFPINLRTEAIKD 461


>gi|413926260|gb|AFW66192.1| hypothetical protein ZEAMMB73_605676 [Zea mays]
 gi|413952504|gb|AFW85153.1| hypothetical protein ZEAMMB73_422486 [Zea mays]
          Length = 510

 Score =  592 bits (1525), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 291/517 (56%), Positives = 381/517 (73%), Gaps = 13/517 (2%)

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
           MRYEVTGDPLYK I +FFMD +N+SHSYATGGTSA EFW DPKRLA TL +ENEE+CTTY
Sbjct: 1   MRYEVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTY 60

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           NMLKVSR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP   G SKA S HG
Sbjct: 61  NMLKVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHG 120

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
           WGTK++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+  + + Q++
Sbjct: 121 WGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQQI 180

Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLS 584
             + S D YL+++ + S+    GQ +++N R+P WT+++GA A+LNG++L    PG+FLS
Sbjct: 181 KTLSSSDQYLQISFSISANTS-GQTANINFRIPSWTFADGAGATLNGKDLGSISPGSFLS 239

Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTAR 644
            T++W+ +D L +  P+ LRTEAI+DDR EYAS+QA+LFGP++LAG ++G+WD K G   
Sbjct: 240 ITKQWNSDDHLALHFPIRLRTEAIKDDRLEYASLQAVLFGPFVLAGLSTGDWDAKAGNGS 299

Query: 645 SLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATFRLI 703
           ++S  I+ +PP+ N+QLVTFTQ S    FV+S++N ++TM+E P V GTDAA+HATFR  
Sbjct: 300 AISDWIAAVPPAHNSQLVTFTQVSNGKAFVLSSANGTLTMQERPEVDGTDAAIHATFRAH 359

Query: 704 LKDAS--LSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVA 761
            ++ S  L +  S   + G S++LEPFD PG ++         ++ S ++   S F +V 
Sbjct: 360 PQEDSTELHDIYS-TTLTGTSILLEPFDLPGTVITNN------LTLSAQKSSDSLFNIVP 412

Query: 762 GLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIGI 819
           GLD    +VSLE   + GCF+ +G N+  G  +++ C  S ES+     +AASF     +
Sbjct: 413 GLDGNPNSVSLELGTKPGCFLVTGTNYSAGTRIEVNCKSSLESIGGILEQAASFSQTDPL 472

Query: 820 SEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNIQ 856
            +YHPISFVAKG  RNFLL PL S RDE YTVYFN++
Sbjct: 473 RQYHPISFVAKGVARNFLLEPLYSLRDEFYTVYFNVR 509


>gi|413954825|gb|AFW87474.1| hypothetical protein ZEAMMB73_309562 [Zea mays]
          Length = 483

 Score =  521 bits (1341), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 273/501 (54%), Positives = 355/501 (70%), Gaps = 28/501 (5%)

Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
           MD VN+SH+YATGGTS  EFW +PKRLA+ L +E EE+CTTYNMLKVSRHLFRWTKEIAY
Sbjct: 1   MDTVNSSHAYATGGTSVSEFWSNPKRLAEALTTETEESCTTYNMLKVSRHLFRWTKEIAY 60

Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
           ADYYERAL NGVLSIQRG +PGVMIYMLP G G SKA+S HGWGT++ SFWCCYGTGIES
Sbjct: 61  ADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAKSYHGWGTQYESFWCCYGTGIES 120

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
           FSKLGDSIYFEE G  P LY++Q+I S+F W++  + + Q++ P+ S D YL+++ + S+
Sbjct: 121 FSKLGDSIYFEERGERPALYVVQFIPSTFSWRTAGLTVAQQLMPLSSSDQYLQVSFSVSA 180

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
           K   GQ ++LN+R+P WT  NGA+A+LNG++L L  PG FL+ +++W   D+L++QLP+ 
Sbjct: 181 KTTNGQFATLNVRIPSWTSLNGAKATLNGKHLELASPGTFLTISKQWGSGDQLSLQLPIH 240

Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG-TARSLSALISPIPPSFNAQL 661
           LRTEAI+DDRPEYASIQA+LFGP+LLAG T+G+WD KTG    + S  I+P+P   N+QL
Sbjct: 241 LRTEAIKDDRPEYASIQAVLFGPFLLAGLTTGDWDAKTGAADAAASDWITPVPVESNSQL 300

Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPV--SGTDAALHATFRLILKDASLSNFSSLNNVI 719
           VT  QESG   FV+S  N S+TM + P    GT+AA+HATFRL+ +  + +         
Sbjct: 301 VTLAQESGGEAFVLSALNGSLTMLQRPKDGGGTEAAVHATFRLVPQGGAGAG-------- 352

Query: 720 GKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKG 779
             + MLEP D PGM+V     D L V+ + K  G++ F +V GL     +VSLE  +R G
Sbjct: 353 -AAAMLEPLDMPGMVV----TDRLTVA-AEKSSGAA-FNVVPGLAGAPGSVSLELASRPG 405

Query: 780 CFVSSGVNFEPGASLKLLCSTESLD-----AGFNRAASFMMEIGISEYHPISFVAKGARR 834
           CF+  G     G  +++ C+  +       A F R+ASF     +  YHP+SF A+G RR
Sbjct: 406 CFLVGG-----GEKVQVGCAGGAQQKRGDGAWFRRSASFARGEPLRRYHPMSFAARGVRR 460

Query: 835 NFLLAPLLSFRDEAYTVYFNI 855
           +FLL PL + RDE YTVYFN+
Sbjct: 461 SFLLEPLFTLRDEFYTVYFNL 481


>gi|449531121|ref|XP_004172536.1| PREDICTED: uncharacterized LOC101224273, partial [Cucumis sativus]
          Length = 366

 Score =  500 bits (1287), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 240/340 (70%), Positives = 281/340 (82%), Gaps = 3/340 (0%)

Query: 19  KQCTNQ-SPYDSHAFRYELTST-NKTWKEEVLSHFHLTPTDDSAWSSLIPSKILGDQKDE 76
           K+CTN  +   SH FRYEL S+ N TWK+E+ SH+HLTPTDD AWS+L+P K+L ++ +E
Sbjct: 28  KECTNTPTQLGSHTFRYELLSSGNVTWKKELFSHYHLTPTDDFAWSNLLPRKMLKEE-NE 86

Query: 77  VSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVW 136
            +W ++YR++KN  G  +PG  LKE+SLHDV LD +S+   AQ TNL+YLLMLDVD L+W
Sbjct: 87  YNWEMMYRQMKNKDGLRIPGGMLKEISLHDVRLDPNSLHGTAQTTNLKYLLMLDVDRLLW 146

Query: 137 SFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSL 196
           SFRKTA LPTPG+ Y GWE    ELRGHFVGHYLSASAQMWAST N+ +KEKMS +V  L
Sbjct: 147 SFRKTAGLPTPGEPYVGWEKSDCELRGHFVGHYLSASAQMWASTGNSVLKEKMSALVSGL 206

Query: 197 SECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMAT 256
           + CQ+K+GTGYLSAFP+E FD FEA++PVWAPYYTIHKILAGLLDQY  A N+QALKM T
Sbjct: 207 ATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAGLLDQYTFAGNSQALKMVT 266

Query: 257 WMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPC 316
           WMVEYFYNRVQ VI  Y+VERH+ SLNEETGGMNDVLYRLY IT + KHLLLAHLFDKPC
Sbjct: 267 WMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYRITGNTKHLLLAHLFDKPC 326

Query: 317 FLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYK 356
           FLG LA+QA+ +S FH NTHIPIV+GSQMRYEVTGDPLYK
Sbjct: 327 FLGLLAVQAEDISGFHVNTHIPIVVGSQMRYEVTGDPLYK 366


>gi|159491176|ref|XP_001703549.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280473|gb|EDP06231.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1485

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 296/875 (33%), Positives = 416/875 (47%), Gaps = 180/875 (20%)

Query: 117  RAQQTNLEYLL-MLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFVGHYLSASA 174
            R ++ N +YLL MLD D L+W FRK A LPTPG+ Y G WE+P  ELRGHFVGHYLSA +
Sbjct: 557  RYERINSKYLLDMLDADRLLWVFRKNAGLPTPGEPYVGSWEDPNCELRGHFVGHYLSALS 616

Query: 175  QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK 234
              WA T N+  K ++  +V  L + Q K+GTGYLSAFPT  FD  E+L+ VWAPYYTIHK
Sbjct: 617  LAWAGTGNSAFKTRLDLMVSELGKVQEKLGTGYLSAFPTSWFDRVESLQAVWAPYYTIHK 676

Query: 235  ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVL 293
            I+AGL+D + LA +  AL MAT MV+Y +NR Q VI+     +HW  + E E GGMN++L
Sbjct: 677  IIAGLVDAHELAGHPSALTMATRMVDYHWNRTQAVISKKGA-KHWQKVLEFEYGGMNEIL 735

Query: 294  YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
            YRLY IT    H   A LFDK  FLG +A   D L   HANTH+  ++G    YE TG+P
Sbjct: 736  YRLYLITGKDDHRDFASLFDKTVFLGHMAAHDDVLYDLHANTHLAQIVGFAAGYEATGNP 795

Query: 354  LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
              +     F +IV   H YATGGTS  E WW  +        +  ETCT YNMLK++R L
Sbjct: 796  KLRTAVNNFFEIVVQHHGYATGGTSVFERWWGRRGRGPRNALKTHETCTQYNMLKIARQL 855

Query: 414  FRWTKEIAYADYYERALTNGVLSIQR-------------------GTEP----------- 443
            F WT ++ YAD+YERA+ NG+  + R                   G +P           
Sbjct: 856  FMWTGDVYYADHYERAMVNGMWGVARLPADELPENGAAGAGGVDKGGQPVSPYTRFHDDE 915

Query: 444  ----------------------GVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
                                  GV +Y+LP+G G SK+ + H WG  F+SFWCCYGT IE
Sbjct: 916  WMDYISFSKPKPEWNASDAAGPGVYLYLLPMGHGNSKSDNLHHWGFPFHSFWCCYGTIIE 975

Query: 482  SFSKLGDSIYF-------------EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
            S++KL DSI+F             E+ G        ++  +  D  +       K+ P +
Sbjct: 976  SYAKLADSIFFKWVRVRDMSPESDEDAGAKTAKKRTRHDVNPSDGSASGAKGAVKLPPRL 1035

Query: 529  SWDPYLRMTLTFSSKQEVGQLS----SLNLRMPVWTYSNGAQASLNGQ---NLPLPP-PG 580
              + ++   L+ +S       +    +L LR+P W    G    LNGQ     P  P P 
Sbjct: 1036 YLNQFVSSRLSKASSTTASGPTDGVFTLMLRIPAWARDGGVLLELNGQAFNGCPGAPLPD 1095

Query: 581  NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKT 640
            ++   T +W   D L++++ L       QD R EY S++A++ GPY++AG     W+   
Sbjct: 1096 SYCRITRKWQARDVLSVRVALRWWFSPAQDAREEYRSLKAVMMGPYMMAG-----WN--- 1147

Query: 641  GTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATF 700
                      S +    +AQ++      G+S     +S+ S+       +G  ++L +  
Sbjct: 1148 ----------SSLHLRHDAQILYIEDADGSS----GHSHGSL-------AGAFSSLRSMM 1186

Query: 701  RLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSES-PKEMGSSGFR- 758
            RL   D+            G ++ LE   +P   +     D +V+    P+E  S  F  
Sbjct: 1187 RLGAADS------------GSALSLEAMSYPNHYLAHDHTDVIVLQPGPPREDASHPFAP 1234

Query: 759  -------LVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGAS------------------ 793
                   +  GLD   +TVS EA  R G FV++     PG S                  
Sbjct: 1235 CSRAMWMMRPGLDGAADTVSFEAVARPGWFVTAAR--PPGESAAAAKDSPVTCVDANEVD 1292

Query: 794  ---------------LKLLC--------STESL---------DAGFNRAASFMMEIGISE 821
                            ++LC         TE            A +   ASF +   +  
Sbjct: 1293 CTAAVPDGCGTNAFLARVLCRKSCRSCLGTEQALRLRQQVPGSAVYAATASFRLAPPVRR 1352

Query: 822  YHPI-SFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
             +P  + V  G+ R++L+APL +  DE Y+ YFN+
Sbjct: 1353 AYPAGAHVLAGSNRHYLIAPLGNLVDERYSAYFNV 1387



 Score =  109 bits (273), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 67/213 (31%), Positives = 109/213 (51%), Gaps = 36/213 (16%)

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV---- 498
           PGV IY+LPLG G SK+ + H WG  F+SFWCCYGT IES++KL DSIYF+E        
Sbjct: 195 PGVFIYLLPLGTGQSKSDNIHHWGFPFHSFWCCYGTVIESYAKLADSIYFKEMSPANPES 254

Query: 499 -----------PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVG 547
                      P LY+ Q +SS   W   ++ +  + D + +  P     LT  S +  G
Sbjct: 255 RAHDKAGVRLPPRLYVNQLVSSKATWAEMNLRVTMQAD-MFTPGPAAVAQLTLDSTKAPG 313

Query: 548 QLS------SLNLRMPVWTYSN----------GAQASLNGQ---NLPLP-PPGNFLSATE 587
             +      +L +R+P W   +          GA   +NGQ   + P P   G++ +   
Sbjct: 314 PGTHDLGTFTLMVRVPEWLAPDRHGGVAQGGSGASIEVNGQLWTSCPGPVKAGSYCALMR 373

Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
           RW+  D ++++LP+  R +++ ++R ++  +++
Sbjct: 374 RWASGDGVSLRLPMRWRLQSLAENRAQHQGLKS 406



 Score =  106 bits (265), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 58/140 (41%), Positives = 77/140 (55%), Gaps = 22/140 (15%)

Query: 305 HLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMD 364
           H+  A LF+KP F   +    D L + HANTH+  V G    Y                D
Sbjct: 2   HMEFAQLFNKPFFRKPMEAGNDMLMNLHANTHLAQVAGFAEEY----------------D 45

Query: 365 IVNASHSYATGGTSAREFWWDPKRLADTL-----GSENEETCTTYNMLKVSRHLFRWTKE 419
            V+    +ATGG++  EFW  P  LAD++     G E +ETCT YN+LK++R LFRWT +
Sbjct: 46  TVD-KRVFATGGSTDHEFWQAPDELADSVLTQKHGVETQETCTQYNILKIARSLFRWTGD 104

Query: 420 IAYADYYERALTNGVLSIQR 439
           + YAD+YERAL NG+L   R
Sbjct: 105 VRYADFYERALVNGILGTAR 124


>gi|218198541|gb|EEC80968.1| hypothetical protein OsI_23691 [Oryza sativa Indica Group]
          Length = 759

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 242/518 (46%), Positives = 317/518 (61%), Gaps = 57/518 (11%)

Query: 385 DPKRLADTLG-SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
           DPKRL D +  S NEETC TYN+LKVSR+LFRWTKE  Y D+YER L NG++  QRG EP
Sbjct: 249 DPKRLVDEIKISSNEETCATYNLLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEP 308

Query: 444 GVMIYMLPLGRGVSKA-----------RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
           GVMIY LP+G G SK+           ++  GWG    +FWCCYGTGIESFSKLGDSIYF
Sbjct: 309 GVMIYFLPMGPGRSKSISGMPTSGLPPKNPGGWGNANATFWCCYGTGIESFSKLGDSIYF 368

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
            EEG +PGLYIIQYI S+FDWK+  + + Q+  P+ S D +  +++  SSK +  + +++
Sbjct: 369 LEEGEIPGLYIIQYIPSTFDWKAAGLTVKQQAKPLSSTDSHFEVSIFISSKGDA-RPANV 427

Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
           N+R+P WT  +GA A+LNGQ L L   G+FLS T+ W  +D L+++ P++LRTE I+DDR
Sbjct: 428 NVRIPSWTSVDGAIATLNGQKLNLTSAGDFLSVTKLWG-DDTLSLKFPITLRTEPIKDDR 486

Query: 613 PEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPI------------------- 653
           PEY+SIQA+LFGP+LLAG T G   +KT +  S S L   +                   
Sbjct: 487 PEYSSIQAVLFGPHLLAGLTHGNQTVKT-SNDSNSGLTPGVWEVNATHAAAAVAVWVTPV 545

Query: 654 PPSFNAQLVTFTQESGN----STFVMSNS--NQSITMEEFPVSGTDAALHATFRLILKDA 707
             S N+QLVT TQ  G+    + FV+S S  + ++TM+E PV+G+DA +HATFR     +
Sbjct: 546 SQSLNSQLVTLTQRDGDAQAAAAFVLSVSIADGALTMQESPVAGSDACVHATFRAYHSPS 605

Query: 708 SLSNF-SSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKR 766
             S   ++   + G+ V LEPFD PGM V     D L V    +   ++ F  VAGLD  
Sbjct: 606 GASAIDAATGRLQGRDVALEPFDRPGMAVT----DALSVG---RPGPATRFNAVAGLDGL 658

Query: 767 NETVSLEAENRKGCFVSSGVN-FEPGASLKLLCSTESL--------DAGFNRAASFMMEI 817
             TVSLE   R GCFV++    +  GA  ++ C   +         D  F RAASF    
Sbjct: 659 PGTVSLELATRPGCFVAAPTTAYLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASFTQAA 718

Query: 818 GISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
            +  YHP+SF A G  RNFLL PL S +DE YTVYFN+
Sbjct: 719 PLRLYHPLSFSATGTDRNFLLEPLQSLQDEFYTVYFNV 756



 Score =  204 bits (520), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 98/190 (51%), Positives = 127/190 (66%), Gaps = 8/190 (4%)

Query: 52  HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDL---PGNFLKEVSLHDVW 108
           HL   +++ W  L+P +     +DE+ W  LYR I   GG D+   P  FL   SLHDV 
Sbjct: 57  HLNQAEEATWMGLLPRR--AGPRDELDWLALYRSITR-GGGDVGGEPAGFLSPASLHDVR 113

Query: 109 LDQ--SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
           +D   +++ W+ QQTNLEYLL LD D L W+FR+ A LPT G+ YGGWE P  +LRGHF 
Sbjct: 114 VDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWEAPDGQLRGHFT 173

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVW 226
           GHYLSA+A MWASTHN  ++EKM+ VV  L  CQ K+ TGYLSA+P  +FD+++ L   W
Sbjct: 174 GHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDAYDELAEAW 233

Query: 227 APYYTIHKIL 236
           +PYYTIHK +
Sbjct: 234 SPYYTIHKFI 243


>gi|384252025|gb|EIE25502.1| DUF1680-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 648

 Score =  417 bits (1071), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 247/645 (38%), Positives = 359/645 (55%), Gaps = 52/645 (8%)

Query: 97  NFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWE 155
           + ++   L  + L++ S+  +A   N +Y+L L+ D L+ +FR  A LP+  + + G WE
Sbjct: 20  DIIQPFPLDQITLERDSLFDKALALNTDYMLQLNADQLLHTFRLNAGLPSSAQPFTGSWE 79

Query: 156 NPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL 215
           +P  E+RG F+GHYLSA + +   T N  I+ +++ ++  L + Q  +  GYLSAFP E 
Sbjct: 80  DPSCEVRGQFMGHYLSACSMLVNHTGNGKIESRLTYIIDELRKVQIALSGGYLSAFPEEH 139

Query: 216 FDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSV 275
           F   ++L+ VWAP+Y IHKI+AGLLD +       AL+M     E+F      V+     
Sbjct: 140 FVRLQSLQTVWAPFYVIHKIMAGLLDAHNFLGYDVALEMVKDEAEHFTRYYNDVVATNGT 199

Query: 276 ERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHAN 334
           E HW  + E E GGMN+VL+ LY +T DP+H+ LA  F KP F   L    D L   HAN
Sbjct: 200 E-HWLRMLEVEFGGMNEVLFNLYDVTGDPEHIRLAEAFTKPKFFEPLLQNTDPLPGLHAN 258

Query: 335 THIPIVIGSQMRYE-VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
           TH+  V G   R+E  + D  Y  +  FF  IV   HS+ATGG +  E+W  P++LAD++
Sbjct: 259 THLAQVNGFAARFEKASHDGSYAAVTNFF-SIVTRGHSFATGGNNDHEYWGPPRQLADSI 317

Query: 394 ---GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR--------GTE 442
               +E EETCT YNMLK++R+LFRWT    +ADYYERA+ NG+L  QR         + 
Sbjct: 318 LLHATETEETCTQYNMLKIARYLFRWTGAPVFADYYERAILNGLLGTQRMPADYSPHTSR 377

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
           PGV+IY+LP+G G +K  ST GWG   +SFWCCYG+ +ESFSKL DSI+F  + +   L 
Sbjct: 378 PGVVIYLLPMGSGQTKGGSTRGWGDPLHSFWCCYGSSVESFSKLADSIFFYRQAHSSCLT 437

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL----TFSSKQEVGQLS-------- 550
           +  Y +  +   S          P+V     L+ +     T S+   V  LS        
Sbjct: 438 LHAYPAHFYTSAS-------LASPLVGLSVQLQASFFQGTTASANITVAPLSAAAHDSTA 490

Query: 551 --SLNLRMPVWTYSNGAQASLNGQN------LPLPPPGNFLSATERWSYNDKLTIQLPLS 602
             +L LR+P W  S+G +  +NGQ+         P  G+F +   R++  DK+T+ LP+S
Sbjct: 491 EVTLKLRIPSWAVSSGVRVEVNGQSWADCAPAAGPQAGSFCTVRRRFAAGDKVTLALPMS 550

Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
           +R E +QDDRPEY+S  AI+ GP L+AG T+G   I+    R ++ L++ I     A L+
Sbjct: 551 IRAERVQDDRPEYSSQHAIMMGPLLMAGITNGSRSIQ-ADPRKVADLLTDISSQGLASLI 609

Query: 663 TFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLI-LKD 706
                 G+    + +    +  E  P+ G   AL +TFRL+ LKD
Sbjct: 610 I----PGDLPLHIRHEGAMLRAE--PMKGP-YALDSTFRLLGLKD 647


>gi|390957656|ref|YP_006421413.1| hypothetical protein Terro_1782 [Terriglobus roseus DSM 18391]
 gi|390412574|gb|AFL88078.1| hypothetical protein Terro_1782 [Terriglobus roseus DSM 18391]
          Length = 635

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 221/552 (40%), Positives = 292/552 (52%), Gaps = 43/552 (7%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG-HYLSASAQ 175
           R+   N +YL  L VD L+ SFR TA + +  K YGGWE P  ELRGHF G HYLSA A 
Sbjct: 60  RSADVNEKYLDSLQVDRLLHSFRLTAGITSSAKPYGGWEIPNGELRGHFAGGHYLSAVAF 119

Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
             A   N T++EK + +V  L+ CQ   G GYLSA+P ELF      K VWAP+YT HKI
Sbjct: 120 ASAGAGNTTLREKGNALVAGLAACQKANGNGYLSAYPPELFQRLALGKQVWAPFYTYHKI 179

Query: 236 LAGLLDQYVLADNAQALK----MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMND 291
           +AGL+D Y    N  ALK    MA W   YF +       M   +R    L  E GGMN+
Sbjct: 180 MAGLVDMYTQTGNEDALKVAEGMAGWSSAYFAD-------MSDAQRQGI-LRIEYGGMNE 231

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTG 351
           VL  LYS+T   ++L  A  F++P FL  LA   D L   HANT IP +IG+   YE TG
Sbjct: 232 VLVNLYSLTGKERYLSQARKFEQPTFLDPLAAHRDELQGLHANTSIPKIIGAARMYEATG 291

Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK-RLADTLGSENEETCTTYNMLKVS 410
           D  Y+ I ++F+D V ++H+YA G TS  E W  P   LA +L  +N E C  YN++K+ 
Sbjct: 292 DRRYQEIASYFLDDVLSAHTYAIGNTSDDEHWRTPAGSLAGSLSLKNAECCVAYNLMKLE 351

Query: 411 RHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFN 470
           RHL  WT +  + D YER L N  L  Q     G+  Y  PL  G  +      +G+   
Sbjct: 352 RHLSAWTGDARWMDAYERTLFNARLGTQDAA--GLKQYFFPLAAGYWRV-----YGSPEE 404

Query: 471 SFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSW 530
           SFWCC GTG E F+K GDSIYF     V   Y+ Q+I+S   WK     L Q+       
Sbjct: 405 SFWCCTGTGAEDFAKFGDSIYFHANDTV---YVNQFIASVLTWKEKGFTLRQETS--FPS 459

Query: 531 DPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERW 589
           +   R+T+  +  QE     S+ +R+P W  ++G   ++N + L     PG++L     W
Sbjct: 460 ESQTRLTIQTAQPQE----RSIAIRIPSWI-ADGGFVAVNDKRLEAFAEPGSYLVIRRTW 514

Query: 590 SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH-----TSGEWDIKT--GT 642
              D +T+ LP++LR E +    P   +  A L+GP +LAG      TSG   I T  GT
Sbjct: 515 HAGDTVTVHLPMALREEPL----PGSPNTAAALYGPLVLAGTLGDGPTSGPTKILTGRGT 570

Query: 643 ARSLSALISPIP 654
           A       +P+P
Sbjct: 571 APEGVPAAAPLP 582


>gi|225872906|ref|YP_002754363.1| Tat pathway signal sequence domain-containing protein
           [Acidobacterium capsulatum ATCC 51196]
 gi|225794208|gb|ACO34298.1| Tat pathway signal sequence domain protein [Acidobacterium
           capsulatum ATCC 51196]
          Length = 644

 Score =  347 bits (891), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 209/527 (39%), Positives = 287/527 (54%), Gaps = 39/527 (7%)

Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG-HY 169
           +  VL  A + N +YL ++  D L+ +FR TA LPT  +  GGWE P  ELRGHF G HY
Sbjct: 66  RDGVLKNALEINRQYLYLVPNDRLLHTFRLTAGLPTSAEPLGGWEAPDCELRGHFAGGHY 125

Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPY 229
           LSA A M+AST +  IK K   +V  L++CQ     GYLSAFP   FD     + VWAP+
Sbjct: 126 LSACALMYASTGDEKIKAKGDALVAELAKCQQP--DGYLSAFPASFFDRLRHYQKVWAPF 183

Query: 230 YTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
           YT HKI+AG LD YV   N QAL    +MA W +EY      K I     +R    L  E
Sbjct: 184 YTYHKIMAGHLDMYVHTGNQQALETCKRMADWAIEY-----TKPIPADQWQR---MLLVE 235

Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
            GGMN+V + LY++T + K+  L   F+       LA + D+L+  HANT+IP VIG+  
Sbjct: 236 QGGMNEVSFNLYAVTGEKKYRDLGFRFEHKLIFDPLAKREDHLAGNHANTNIPKVIGAAR 295

Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
            YEV  D  Y  I  FF   V + H+YATGGTS  EFW  P  LA+ LG   EE C +YN
Sbjct: 296 GYEVADDKRYHTIAEFFWGAVTSQHAYATGGTSDGEFWHKPGTLAEHLGPAAEECCCSYN 355

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
           M+K+SRHL+ WT +    DYYER + N  +  Q     G+++Y + L  G  K      +
Sbjct: 356 MMKLSRHLYGWTGDPRIFDYYERLMYNVRIGTQ--DPKGMLMYYVSLKPGYWKT-----F 408

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
           GT F++FWCC GTG+E +SK+ DSIYF +  N+   Y+  +  S   W   +V L Q+ +
Sbjct: 409 GTPFDAFWCCTGTGVEEYSKVNDSIYFHDAKNI---YVNLFAGSEVQWPEKNVSLVQETN 465

Query: 526 -PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFL 583
            P       L    T + + +      L +R+P W  +NG    +NGQ   +   P ++ 
Sbjct: 466 FP-------LEEATTLTVRAQKPSAFGLKIRVPYWA-TNGFTIHINGQPQSVEAKPESYA 517

Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +    W   D + + +P+SL    I    P+   +QA+L+GP +LAG
Sbjct: 518 TLHRTWHDGDTIKVSMPMSLHISPI----PDSPDVQAVLYGPLVLAG 560


>gi|413926259|gb|AFW66191.1| hypothetical protein ZEAMMB73_605676 [Zea mays]
 gi|413952505|gb|AFW85154.1| hypothetical protein ZEAMMB73_422486 [Zea mays]
          Length = 250

 Score =  344 bits (882), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 160/239 (66%), Positives = 196/239 (82%), Gaps = 1/239 (0%)

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
           MRYEVTGDPLYK I +FFMD +N+SHSYATGGTSA EFW DPKRLA TL +ENEE+CTTY
Sbjct: 1   MRYEVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTY 60

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           NMLKVSR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP   G SKA S HG
Sbjct: 61  NMLKVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHG 120

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
           WGTK++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+  + + Q++
Sbjct: 121 WGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQQI 180

Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
             + S D YL+++ + S+    GQ +++N R+P WT+++GA A+LNG++L    PG  +
Sbjct: 181 KTLSSSDQYLQISFSISANTS-GQTANINFRIPSWTFADGAGATLNGKDLGSISPGKIV 238


>gi|116620365|ref|YP_822521.1| hypothetical protein Acid_1242 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116223527|gb|ABJ82236.1| protein of unknown function DUF1680 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 664

 Score =  336 bits (862), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 215/540 (39%), Positives = 290/540 (53%), Gaps = 53/540 (9%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE-----------NPISELRGHFV 166
           A + N  Y+  L  D L+ +FR  A LP+  +  GGWE           N   ELRGHFV
Sbjct: 82  AAEWNRGYMNRLPADRLLHAFRLNAGLPSSAQPLGGWEIYVEPTPGKRINSEGELRGHFV 141

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSAFPTELFDSFEALKPV 225
           GH+LSASAQ++AS  +   K K   +V  L++CQ K+G +GYLSAFP E FD  +A KPV
Sbjct: 142 GHFLSASAQLYASMGDKDAKAKADYIVAELAKCQQKLGPSGYLSAFPIEWFDRLDARKPV 201

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALK----MATWMVEYFYNRVQKVITMYSVERHWYS 281
           WAP+YTIHKI+AG+ D Y LA N QAL+    M+ W  E+         T    E H   
Sbjct: 202 WAPFYTIHKIMAGMFDMYTLAGNQQALQVLEGMSNWADEW---------TASKSEAHMQD 252

Query: 282 -LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV 340
            L  E GGMN+VLY L ++T + +       F K  F   LAL+ D L+  H NTHIP V
Sbjct: 253 ILRTEYGGMNEVLYNLAAVTGNDRWAKAGDRFTKKEFFNPLALRNDALTGLHVNTHIPQV 312

Query: 341 IGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW-DPKRLADTLGSE--N 397
           IG+  RYE++ D  +  +  +F   V  + SY T GTS  E W   P+ LA  L      
Sbjct: 313 IGAAARYEISSDMRFHDVADYFWYEVVTARSYVTEGTSNGEGWLTQPRMLAAELKRSVAT 372

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVL-SIQRGTEPGVMIYMLPLGRGV 456
            E C +YNMLK++RHL+ W  + AY DYYERAL N  L +IQ  T  G   Y L L  G 
Sbjct: 373 AECCCSYNMLKLTRHLYGWKPDPAYFDYYERALFNHRLGTIQPKT--GYTQYYLSLTPGA 430

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
            K      + T+  SFWCC G+G+E +SKL DSIY+ +     GL +  +I S  +W+  
Sbjct: 431 WKT-----FNTEDKSFWCCTGSGVEEYSKLNDSIYWHD---AEGLTVNLFIPSELNWEEK 482

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
              L Q+      +      TLT ++ +      ++ LR+P WT S  A   +NG+ + +
Sbjct: 483 GFRLRQE----TKFPEQQSTTLTVTAAKSAPM--AMRLRIPAWTKS--AAVKINGRAVDV 534

Query: 577 -PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
            P PG++L+ T  W   DK+ + LP+ L  E + DD       QA L+GP +LAG    E
Sbjct: 535 TPTPGSYLTLTRPWKAGDKIEMTLPMHLSVEYMPDD----PKTQAFLYGPIVLAGDLGAE 590


>gi|302844990|ref|XP_002954034.1| hypothetical protein VOLCADRAFT_106211 [Volvox carteri f.
           nagariensis]
 gi|300260533|gb|EFJ44751.1| hypothetical protein VOLCADRAFT_106211 [Volvox carteri f.
           nagariensis]
          Length = 1160

 Score =  328 bits (842), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 177/361 (49%), Positives = 225/361 (62%), Gaps = 21/361 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLL-MLDVDSLVWSFRKTASLPTPGKAY-GGWEN 156
           ++  +L DV L  +S   R ++ N +YLL MLD D L+WSFRKTA LPTPG+ Y   WE+
Sbjct: 30  IEPFALSDVRLLDTSHQIRYERLNAKYLLEMLDPDRLLWSFRKTAGLPTPGQPYIASWED 89

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSAFPTEL 215
           P  ELRGHFVGHYLSA +  +AST N     +++ +V  L + Q  +G  GYLSAFP+E 
Sbjct: 90  PGCELRGHFVGHYLSALSLAYASTGNIAFHTRLALMVSELGKVQQALGLGGYLSAFPSEF 149

Query: 216 FDSFEALKPVWAPYYTI-----------HKILAGLLDQYVLADNAQALKMATWMVEYFYN 264
           FD  EALKPVWAPYYTI           HKI+AGL+D Y L    +AL MA+ MV Y +N
Sbjct: 150 FDRVEALKPVWAPYYTIPIAPFPDTTQIHKIIAGLVDAYELGGQKEALAMASRMVAYHWN 209

Query: 265 RVQKVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
           R Q +I     E HW   LN E GGMN++LYR++ IT DP HL  A LF+KP F+  +  
Sbjct: 210 RTQALIASKGRE-HWNGVLNCEFGGMNEILYRMHRITKDPTHLEFARLFEKPFFMKPMVN 268

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
             D L   HANTH+  V G    Y+  GD   +     F DIV   HS+ATGG++  EFW
Sbjct: 269 NFDILESLHANTHLAQVAGFAEAYDTVGDEAARNATRNFFDIVTTHHSFATGGSNDHEFW 328

Query: 384 WDPKRLADTL-----GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
             P R+AD++       E +ETCT YN+LK++R LFRWT  +AYAD+YERAL NG+L   
Sbjct: 329 QAPDRMADSVIKQKDAVETQETCTQYNILKIARSLFRWTGNVAYADFYERALLNGILGTA 388

Query: 439 R 439
           R
Sbjct: 389 R 389



 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 73/223 (32%), Positives = 118/223 (52%), Gaps = 34/223 (15%)

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE----EEGN- 497
           PGV +Y+ PLG G SK+ + H WG  ++SFWCCYGT +ES +KL DSIYF+    ++G  
Sbjct: 486 PGVFLYLTPLGTGQSKSDNIHHWGFPYHSFWCCYGTVVESHAKLADSIYFKDMNPQQGGP 545

Query: 498 --------VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF---SSKQEV 546
                    P LYI Q + S   W    + +  + D + +  P     + F   S+    
Sbjct: 546 SDPSAPKLPPRLYINQLVPSKVTWHELGLRITTEAD-MFAPGPAATAQIRFDPLSAAAAG 604

Query: 547 GQLS---SLNLRMPVWTYSNGAQAS----------LNGQ---NLP-LPPPGNFLSATERW 589
            QLS   +L +R+P W     A  +          +NGQ   + P  P PG++   T +W
Sbjct: 605 SQLSAMFTLMVRVPEWAAREAASGTAGRGRGISIGVNGQSWTSCPGAPVPGSYCQVTRQW 664

Query: 590 SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
           S  D ++++LP+    + + ++RP+Y+ +QA++ GP+++AG T
Sbjct: 665 STGDVVSLRLPMRWWLKPLPENRPQYSGLQAVMMGPFVMAGIT 707


>gi|383316642|ref|YP_005377484.1| hypothetical protein [Frateuria aurantia DSM 6220]
 gi|379043746|gb|AFC85802.1| hypothetical protein Fraau_1370 [Frateuria aurantia DSM 6220]
          Length = 651

 Score =  328 bits (840), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 202/544 (37%), Positives = 284/544 (52%), Gaps = 42/544 (7%)

Query: 103 SLHDVWLDQSSV----LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           SL    LDQ ++       A   N  YL  L VD L  +F + A LP+  +  GGWE+P 
Sbjct: 58  SLQAFALDQVTLSPGPFAEAAAINARYLHQLPVDRLAHNFLRQAGLPSTAQPLGGWESPE 117

Query: 159 SELRGHFVG-HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
            ELRGHF G H+LSA+A +WA+T + T+K++   +V  L+ CQ     GYLSAFP   F+
Sbjct: 118 CELRGHFCGGHWLSAAALVWATTADRTLKQRADELVAILARCQRS--DGYLSAFPDSFFE 175

Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMAT----WMVEYFYNRVQKVITMY 273
                + VWAP+YT+HKIL G LD Y+ A N QAL +AT    W V +   R    +   
Sbjct: 176 RLSHGQKVWAPFYTLHKILCGHLDMYMHAGNQQALDIATGLGDWTVHWLNGRSDAQMNEI 235

Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
                   L  E GGMND L  LY+IT + ++L  AH FD+   L  LA   D L   H+
Sbjct: 236 --------LRTEYGGMNDALCELYAITGNGRYLDAAHRFDQASLLDPLAAHRDELKGLHS 287

Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD-PKRLADT 392
           NT +P +IG+  RYE+TG+  Y+ +  F  + ++ +  YA GG+S  EFW + P  L D 
Sbjct: 288 NTQLPKIIGAARRYELTGEQRYRRMAEFGWETISGTRCYANGGSSNDEFWNNGPDDLHDQ 347

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
           LG    E C  YN+LK++RH++ WT +    DYYER L N  L  Q     G+ +Y  PL
Sbjct: 348 LGVAAAECCVAYNLLKLTRHVYGWTGDPRAFDYYERNLYNARLGTQ--DPAGMKLYYYPL 405

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
             G  K      + +  +SFWCC GTG E F++  DSIYF   G    LY+  YI+S   
Sbjct: 406 APGSYKY-----FNSPLHSFWCCTGTGAEEFARFNDSIYFHTPGE---LYVNLYIASRLK 457

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
           W    + L+Q            ++ LT  ++  +      NLR+P WT +   Q  +N Q
Sbjct: 458 WAEQGLTLSQLTRFPEQDVSDFKLQLTAPARLRI------NLRIPSWT-AGAPQLWINDQ 510

Query: 573 NLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
              +   PG++LS    W   D L +QLP+ L+ + +  D  ++    A+L+GP  LA  
Sbjct: 511 LQNVSALPGSYLSIERMWHDKDHLRLQLPMQLKMQPLPGDDAQF----ALLYGPITLAAE 566

Query: 632 TSGE 635
             G+
Sbjct: 567 LPGD 570


>gi|427385118|ref|ZP_18881623.1| hypothetical protein HMPREF9447_02656 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727286|gb|EKU90146.1| hypothetical protein HMPREF9447_02656 [Bacteroides oleiciplenus YIT
           12058]
          Length = 629

 Score =  325 bits (833), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 200/547 (36%), Positives = 287/547 (52%), Gaps = 30/547 (5%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
           RA + +  +L   DV+  + +FR TA L T  +  GGWE+   ELRGH  GH LSA + M
Sbjct: 60  RAMEVDQRWLKEADVNRFLHAFRVTAGLATGAQNLGGWESLDCELRGHTTGHLLSALSLM 119

Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
           +AST +   + K + +V  L+ECQ  +G  GYLSAFP    D     + VWAP+YT+HK+
Sbjct: 120 YASTGDEQYRTKGAELVKGLAECQQTLGKNGYLSAFPEYFIDRAIKEEIVWAPFYTLHKV 179

Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYR 295
            AGLLDQY L  N QAL + T M ++ YN++ K +T   ++     LN E GGM +  Y 
Sbjct: 180 YAGLLDQYTLCGNQQALDVLTGMCDWAYNKL-KPLTPTQLQG---MLNSEFGGMPETFYN 235

Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY 355
           LY++T + +H  LA +F     L  LA + D L+  H NT IP V+G    YE+TG+P  
Sbjct: 236 LYALTGNARHKELAEMFYHNSILDPLAARRDSLAGIHVNTQIPKVLGEARGYEMTGNPQS 295

Query: 356 KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
             I  FF + V   H+Y TGG S +E +  P  L+D L     ETC TYNMLK++RHLF 
Sbjct: 296 ATIANFFWEAVVGDHTYVTGGNSDKEIFSKPGILSDQLSENTTETCNTYNMLKLTRHLFT 355

Query: 416 WTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCC 475
           W    A ADYYERAL N +LS Q   E G + Y   L  G  K      +   F    CC
Sbjct: 356 WDASPARADYYERALYNHILSSQN-PETGGVTYYHTLHPGSCKK-----FHYPFRDNTCC 409

Query: 476 YGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR 535
            GTG E+ +K G++IY+ +  +  GLY+  +I+S  +WK   + + Q+ +    +     
Sbjct: 410 VGTGYENHAKYGEAIYY-KTADQSGLYVNLFIASVLNWKEKDLTVRQETN----YPDEAS 464

Query: 536 MTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDK 594
             +T ++  E G      LR P W   +G    +NG+   +   PG+++     W   D 
Sbjct: 465 TRITIAAAPEAGIQMPFMLRYPSWAV-DGVTIKVNGKKQHVKKAPGSYIHIDRTWRQGDV 523

Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWD--------IKTGTARSL 646
           +T+++P+SL  E + D + +     AIL+GP +LA       D           G  R +
Sbjct: 524 ITMEMPMSLHIEYMPDTKEK----GAILYGPIVLAAELGKTEDPAQNPAVPTLAGDFRKI 579

Query: 647 SALISPI 653
              I P+
Sbjct: 580 EQCIKPV 586


>gi|262407449|ref|ZP_06083997.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|262354257|gb|EEZ03349.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
          Length = 642

 Score =  316 bits (809), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV  L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 69  WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSAFP EL +     K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QALK  T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 189 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 244

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   K 
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 304

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+ +  L     ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 364

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + L Q+ + P          
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFP-------KEE 468

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S  A+  +NG+ + +   PG++++ T  W  ND++
Sbjct: 469 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 526

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ +  EA     P+  +  A+L+GP +LAG    E
Sbjct: 527 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 562


>gi|336404833|ref|ZP_08585521.1| hypothetical protein HMPREF0127_02834 [Bacteroides sp. 1_1_30]
 gi|335940654|gb|EGN02520.1| hypothetical protein HMPREF0127_02834 [Bacteroides sp. 1_1_30]
          Length = 640

 Score =  316 bits (809), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV  L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 67  WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 126

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSAFP EL +     K VWAP+YT+HK+ +
Sbjct: 127 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 186

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QALK  T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 187 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 242

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   K 
Sbjct: 243 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 302

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+ +  L     ETC TYNMLK+SRHLF WT
Sbjct: 303 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 362

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 363 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 416

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + L Q+ + P          
Sbjct: 417 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFPK-------EE 466

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S  A+  +NG+ + +   PG++++ T  W  ND++
Sbjct: 467 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 524

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ +  EA     P+  +  A+L+GP +LAG    E
Sbjct: 525 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 560


>gi|294810816|ref|ZP_06769462.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
 gi|294442004|gb|EFG10825.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 642

 Score =  316 bits (809), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV  L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 69  WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSAFP EL +     K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QALK  T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 189 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 244

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   K 
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 304

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+ +  L     ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 364

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + L Q+ + P          
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFP-------KEE 468

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S  A+  +NG+ + +   PG++++ T  W  ND++
Sbjct: 469 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 526

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ +  EA     P+  +  A+L+GP +LAG    E
Sbjct: 527 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 562


>gi|345512074|ref|ZP_08791613.1| hypothetical protein BSAG_00984 [Bacteroides sp. D1]
 gi|229443482|gb|EEO49273.1| hypothetical protein BSAG_00984 [Bacteroides sp. D1]
          Length = 640

 Score =  315 bits (808), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV  L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 67  WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 126

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSAFP EL +     K VWAP+YT+HK+ +
Sbjct: 127 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 186

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QALK  T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 187 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 242

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   K 
Sbjct: 243 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 302

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+ +  L     ETC TYNMLK+SRHLF WT
Sbjct: 303 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 362

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 363 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 416

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + L Q+ + P          
Sbjct: 417 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFP-------KEE 466

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S  A+  +NG+ + +   PG++++ T  W  ND++
Sbjct: 467 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 524

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ +  EA     P+  +  A+L+GP +LAG    E
Sbjct: 525 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 560


>gi|294646892|ref|ZP_06724513.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|292637837|gb|EFF56234.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
          Length = 640

 Score =  315 bits (808), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV  L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 67  WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 126

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSAFP EL +     K VWAP+YT+HK+ +
Sbjct: 127 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 186

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QALK  T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 187 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 242

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   K 
Sbjct: 243 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 302

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+ +  L     ETC TYNMLK+SRHLF WT
Sbjct: 303 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 362

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 363 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 416

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + L Q+ + P          
Sbjct: 417 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFP-------KEE 466

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S  A+  +NG+ + +   PG++++ T  W  ND++
Sbjct: 467 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 524

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ +  EA     P+  +  A+L+GP +LAG    E
Sbjct: 525 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 560


>gi|423212948|ref|ZP_17199477.1| hypothetical protein HMPREF1074_01009 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392694204|gb|EIY87432.1| hypothetical protein HMPREF1074_01009 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 642

 Score =  315 bits (808), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 198/519 (38%), Positives = 284/519 (54%), Gaps = 33/519 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV  L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 69  WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSAFP EL +     K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QALK  T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 189 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 244

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   K 
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 304

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK  +  L     ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKNFSKHLTGYTGETCCTYNMLKLSRHLFCWT 364

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   V L Q+ +      P    T
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGVTLLQETE-----FPKEETT 470

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
           L  + + E    +++ LR P W  S  A+  +NG+ + +   PG++++ T  W  ND+++
Sbjct: 471 L-LTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRIS 527

Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
              P+ +  EA     P+  +  A+L+GP +LAG    E
Sbjct: 528 ATYPMQIELEAT----PDNPNKVALLYGPLVLAGERGTE 562


>gi|298483785|ref|ZP_07001958.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           D22]
 gi|298270079|gb|EFI11667.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           D22]
          Length = 642

 Score =  315 bits (807), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 69  WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSAFP EL +     K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QALK  T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 189 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 244

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   K 
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 304

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+ +  L     ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 364

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + L Q+   P          
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETGFP-------KEE 468

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S  A+  +NG+ + +   PG++++ T  W  ND++
Sbjct: 469 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 526

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ +  EA     P+  +  A+L+GP +LAG    E
Sbjct: 527 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 562


>gi|330995449|ref|ZP_08319354.1| hypothetical protein HMPREF9442_00414 [Paraprevotella xylaniphila
           YIT 11841]
 gi|329575517|gb|EGG57055.1| hypothetical protein HMPREF9442_00414 [Paraprevotella xylaniphila
           YIT 11841]
          Length = 618

 Score =  315 bits (807), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 204/588 (34%), Positives = 295/588 (50%), Gaps = 57/588 (9%)

Query: 85  KIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL 144
           K+++P   +L     K+V L   W+ Q   L      ++ YL  ++ D L+ +FR TA L
Sbjct: 24  KVESPSVVELRPFSGKDVELEASWIKQREDL------DVAYLQSVEADRLLHNFRVTAGL 77

Query: 145 PTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG 204
           P+  K   GWE+P   LRGHF GHYLSA + +     +    +++  +V  L +CQ   G
Sbjct: 78  PSLAKPLEGWESPGVGLRGHFTGHYLSALSVLAERYGDGWASQRLEYMVDELYKCQQAHG 137

Query: 205 TGYLSAFPTELFDSFEA-LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFY 263
            GYLSAFP + F++ E     VWAPYYT+HKIL GLLD Y    N +A  M   +  Y  
Sbjct: 138 NGYLSAFPEKDFETLETRFTGVWAPYYTLHKILQGLLDAYTKTGNRKAYGMVEALAGYVE 197

Query: 264 NRVQKVITMYSVERHWYSL----NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
            R+ K ++   +ER  Y++      E G MN+ LY LY I+ +P+HL LA  FD   FL 
Sbjct: 198 GRMAK-LSPERIERMMYTVEANPQNEAGAMNEALYELYGISGNPRHLALAACFDPAWFLE 256

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            L    D L+  HANTHI +V G   RYEVTG+  YK     F DI+   H+Y  G +S 
Sbjct: 257 PLVRNEDILAGLHANTHIVLVNGFARRYEVTGEEKYKKAAMQFWDILQRGHAYVNGTSSG 316

Query: 380 ------------REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
                        E W +P  L +TL  E  E+C T+N  K+S +LF WT +  YAD Y 
Sbjct: 317 PRPVVTTRTSLTAEHWGEPGHLCNTLTREIAESCVTHNTQKLSAYLFGWTGDPCYADAYM 376

Query: 428 RALTNGVLSIQ-RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKL 486
               NG L +Q R T  G  +Y LPLG   +K         K N F+CC G+  E+F+KL
Sbjct: 377 NTFYNGALPVQSRST--GAYVYHLPLGSPRNKKY------LKDNDFFCCSGSCAEAFAKL 428

Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK----VDPIVSWDPYLRMTLTFSS 542
              IY+ ++  V   ++  Y+ S   W S  V L Q     + PI  +   +R  ++F  
Sbjct: 429 NSGIYYHDDSAV---FVNLYVPSELHWTSKKVELEQTGGFPLQPIADFTVSVRRPVSF-- 483

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPL 601
                   +LNL +P W  + G    +NG+   +P  P +FL  + RW+  D++ +    
Sbjct: 484 --------TLNLFVPAW--AEGTVVYVNGEKQDMPVRPSSFLRISRRWADGDRVRMDFRY 533

Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSAL 649
           + R +++    P+  ++ A+ +GP LLA  T  E  +K      L  L
Sbjct: 534 AFRLQSM----PDKENMFAVFYGPMLLAFETRSEVILKGSKDEVLQGL 577


>gi|160883345|ref|ZP_02064348.1| hypothetical protein BACOVA_01314 [Bacteroides ovatus ATCC 8483]
 gi|156111329|gb|EDO13074.1| hypothetical protein BACOVA_01314 [Bacteroides ovatus ATCC 8483]
          Length = 643

 Score =  315 bits (806), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 194/520 (37%), Positives = 288/520 (55%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A ++
Sbjct: 69  WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALIY 128

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSAFP EL +     K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QALK+ T M ++ YN+++ +    + E     +  E GG+N+  Y LY
Sbjct: 189 GLIDQYLYADNLQALKVVTKMGDWAYNKLKPL----TEETRKLMIRNEFGGINESFYNLY 244

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   + 
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 304

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+L+  L     ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 364

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGAHKLYS-----TKENSFWCCVG 418

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + + Q+ + P          
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 468

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S   +  +NG+ + +   PG+++  T  W   D++
Sbjct: 469 TTRFTLRTENPVRTTIYLRYPSW--SKDVKVLVNGKKISVKQKPGSYIVITREWKDGDQI 526

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ ++ EA  D+ P+ A   A+L+GP +LAG    E
Sbjct: 527 SATYPMQIKLEATPDN-PDKA---ALLYGPLVLAGERGTE 562


>gi|116625830|ref|YP_827986.1| hypothetical protein Acid_6783 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116228992|gb|ABJ87701.1| protein of unknown function DUF1680 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 675

 Score =  314 bits (805), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 205/570 (35%), Positives = 290/570 (50%), Gaps = 59/570 (10%)

Query: 84  RKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTAS 143
           RKI  P     P        +  V L   S    +Q+ N  Y+  L  D L+ +FR  A 
Sbjct: 55  RKIVTPRAEPFP--------MPQVRLLPGSAYHDSQEWNRGYMERLAADRLLHTFRANAG 106

Query: 144 LPT-PGKAYGGWENP-----ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLS 197
           LP    K  GGWE P      SELRGHF GH+LSASAQ+ ++  +   + K   +V  ++
Sbjct: 107 LPVGSAKPLGGWEQPENGQRSSELRGHFAGHFLSASAQL-SANGDKNAQSKGDFMVAEMA 165

Query: 198 ECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALK---- 253
            CQ K+G  YLSAFPT  +D     + VWAP+YTIHKI+AG+ D Y LA N QAL+    
Sbjct: 166 RCQQKLGGKYLSAFPTTWWDRLGKGERVWAPFYTIHKIMAGMFDMYSLAGNQQALEVLEG 225

Query: 254 MATWMVEYFYNR----VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLA 309
           MA W  E+   +    +Q+++T+            E GG+ + LYRL + T   +   + 
Sbjct: 226 MAAWADEWTAPKAAEHMQQILTI------------EFGGIAETLYRLAAATDQDRWGRVG 273

Query: 310 HLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNAS 369
             F K  FL  LA + D L   H NTHIP V+ +  RY+++GD  +  +  +F   V  +
Sbjct: 274 DRFQKKSFLNPLAARRDELRGLHVNTHIPQVMAAARRYDLSGDMRFHDVADYFFSEVAGA 333

Query: 370 HSYATGGTSAREFWWDPKRLADT---LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYY 426
            +Y TGGTS  E W  P R   T   L     E C  YNMLK++RHL+ W  + +Y DYY
Sbjct: 334 RTYVTGGTSNAEAWLAPPRRLATELKLSVNTAECCCAYNMLKLARHLYSWDPKPSYFDYY 393

Query: 427 ERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKL 486
           E  L N  +   R  + G+  Y L L  G  K      + T+  +FWCC G+G+E +SKL
Sbjct: 394 EHLLLNHRIGTIR-PKVGLTQYYLSLTPGAWKT-----FNTEDQTFWCCTGSGVEEYSKL 447

Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV 546
            DSIY+ +     GLY+  +ISS  DW      L Q      S  P   +T+T +   ++
Sbjct: 448 NDSIYWRDG---EGLYVNLFISSELDWAERGFKLRQATQYPAS--PSTALTVTAARAGDL 502

Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRT 605
               ++ LR+P W  S      LNG+ L     PG++L     W   D++ ++LP+ L  
Sbjct: 503 ----AIRLRIPGWLQS-APSVKLNGKALDASAAPGSYLVLKRNWKVGDRIDMELPMRLHV 557

Query: 606 EAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +A+ DD     ++QA L+GP +LAG   GE
Sbjct: 558 QAMPDD----PAMQAFLYGPLVLAGDLGGE 583


>gi|383123868|ref|ZP_09944538.1| hypothetical protein BSIG_4114 [Bacteroides sp. 1_1_6]
 gi|251838901|gb|EES66986.1| hypothetical protein BSIG_4114 [Bacteroides sp. 1_1_6]
          Length = 641

 Score =  314 bits (805), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 195/519 (37%), Positives = 285/519 (54%), Gaps = 33/519 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  LDV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 69  WMTSLDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSA+P EL +     K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYS 188

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QAL + T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 189 GLIDQYLYADNQQALSVVTKMGDWAYNKLKPL----SEETRRLMIRNEFGGINESFYNLY 244

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   K 
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKK 304

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+ +  L     ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWT 364

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + L Q+ D     +   R+T
Sbjct: 419 SGFENHAKYGEAIYYH---NDKGIYVNLFIPSQVTWKEKGLTLLQETD--FPKEETTRLT 473

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
           L    + E  + +++ LR P W  S   +  +NG+ + +   PG++++ T  W   D++ 
Sbjct: 474 L----RAEKPRHTTIYLRYPSW--SKNVKVLVNGKKVSVKQKPGSYIAITREWKDGDRIA 527

Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
              P+ +  EA     P+  +  A+L+GP +LAG    E
Sbjct: 528 ATYPMQIELEAT----PDNPNKVALLYGPLVLAGERGTE 562


>gi|29345547|ref|NP_809050.1| hypothetical protein BT_0137 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29337439|gb|AAO75244.1| Acetyl-CoA carboxylase-like protein [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 641

 Score =  314 bits (805), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 195/519 (37%), Positives = 285/519 (54%), Gaps = 33/519 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  LDV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 69  WMTSLDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSA+P EL +     K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYS 188

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QAL + T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 189 GLIDQYLYADNQQALSVVTKMGDWAYNKLKPL----SEETRRLMIRNEFGGINESFYNLY 244

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   K 
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKK 304

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+ +  L     ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWT 364

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + L Q+ D     +   R+T
Sbjct: 419 SGFENHAKYGEAIYYH---NDKGIYVNLFIPSQVTWKEKGLTLLQETD--FPKEETTRLT 473

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
           L    + E  + +++ LR P W  S   +  +NG+ + +   PG++++ T  W   D++ 
Sbjct: 474 L----RAEKPRHTTIYLRYPSW--SKNVKVLVNGKKVSVKQKPGSYIAITREWKDGDRIA 527

Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
              P+ +  EA     P+  +  A+L+GP +LAG    E
Sbjct: 528 ATYPMQIELEAT----PDNPNKVALLYGPLVLAGERGTE 562


>gi|298384470|ref|ZP_06994030.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           1_1_14]
 gi|298262749|gb|EFI05613.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           1_1_14]
          Length = 641

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 195/519 (37%), Positives = 285/519 (54%), Gaps = 33/519 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  LDV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 69  WMTSLDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSA+P EL +     K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYS 188

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QAL + T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 189 GLIDQYLYADNQQALSVVTKMGDWAYNKLKPL----SEETRRLMIRNEFGGINESFYNLY 244

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   K 
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKK 304

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+ +  L     ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWT 364

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + L Q+ D     +   R+T
Sbjct: 419 SGFENHAKYGEAIYYH---NDKGIYVNLFIPSQVTWKEKGLTLLQETD--FPKEETTRLT 473

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
           L    + E  + +++ LR P W  S   +  +NG+ + +   PG++++ T  W   D++ 
Sbjct: 474 L----RAEKPRHTTIYLRYPSW--SKNVKVLVNGKKVSVKQKPGSYIAITREWKDGDRIA 527

Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
              P+ +  EA     P+  +  A+L+GP +LAG    E
Sbjct: 528 ATYPMQIELEAT----PDNPNKVALLYGPLVLAGERGTE 562


>gi|423287825|ref|ZP_17266676.1| hypothetical protein HMPREF1069_01719 [Bacteroides ovatus
           CL02T12C04]
 gi|392671840|gb|EIY65311.1| hypothetical protein HMPREF1069_01719 [Bacteroides ovatus
           CL02T12C04]
          Length = 643

 Score =  313 bits (802), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 194/520 (37%), Positives = 287/520 (55%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A ++
Sbjct: 69  WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALIY 128

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSAFP EL +     K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QALK+ T M ++ YN+++ +    + E     +  E GG+N+  Y LY
Sbjct: 189 GLIDQYLYADNLQALKVVTKMGDWAYNKLKSL----TEETRKLMIRNEFGGINESFYNLY 244

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   + 
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARSYELTRNETSRK 304

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+L+  L     ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 364

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 418

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + + Q+ + P          
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 468

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S   +  +NG+ + +   PG+++  T  W   D++
Sbjct: 469 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVLVNGKKISVKQKPGSYIVITREWKDGDQI 526

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ ++ EA  D+ P  A   A+L+GP +LAG    E
Sbjct: 527 SATYPMQIKLEATPDN-PNKA---ALLYGPLVLAGERGTE 562


>gi|329957171|ref|ZP_08297738.1| hypothetical protein HMPREF9445_02614 [Bacteroides clarus YIT
           12056]
 gi|328523439|gb|EGF50538.1| hypothetical protein HMPREF9445_02614 [Bacteroides clarus YIT
           12056]
          Length = 694

 Score =  313 bits (801), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 190/519 (36%), Positives = 281/519 (54%), Gaps = 33/519 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASL-------PTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV+ L+ SFR  A +           K YGGWE+   ELRGH  GH LSA   M+
Sbjct: 121 WMTSIDVNRLIHSFRTNAGIWAGREGGYVTVKKYGGWESLDCELRGHTTGHLLSAYGLMY 180

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L + Q+ +G GYLSAFP EL +     + VWAP+YT+HK+ +
Sbjct: 181 AATGSEIFKLKGDSIVTELGKVQDALGNGYLSAFPEELINRNIKGQSVWAPWYTLHKLFS 240

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADNAQAL + T M ++ Y++++ +    S E     +  E GG+N+  Y LY
Sbjct: 241 GLIDQYLYADNAQALAVVTKMGDWAYDKLKPL----SEETRRRMIRNEFGGINESFYNLY 296

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           ++T D ++  LAH F     +  L  Q D L   H NT IP V+     YE+TGD   K 
Sbjct: 297 AVTGDERYRWLAHFFYHNDVIDPLKEQNDDLGTKHTNTFIPKVLAEARNYELTGDKDSKA 356

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++D KR +  L     ETC TYNMLK+SRHLF W 
Sbjct: 357 LSDFFWHTMIDHHTFAPGCSSQKEHYFDTKRFSHFLNGYTGETCCTYNMLKLSRHLFCWQ 416

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            +   ADYYERAL N +L  Q+  + G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 417 PDARIADYYERALYNHILG-QQDPQTGMVCYFLPLLSGAHKVYS-----TKENSFWCCVG 470

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           +G E+ +K G+ IY+    +  G+YI  +I S   WK   + L Q+     ++       
Sbjct: 471 SGFENHAKYGEGIYYR---SAAGIYINLFIPSVVRWKEKGITLKQE----TAFPAGEATV 523

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
           LT  + + V   +++ LR P W  S      +NG+ + +   PG++++    W   D++ 
Sbjct: 524 LTVEADRPV--RTTVYLRYPSW--SEKVTVRVNGKKVQVKRKPGSYIALNRLWQNGDRIE 579

Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
              P+ +  E   D+ P+     A+L+GP +LAG    E
Sbjct: 580 AAYPMRVHLETTPDN-PQKG---ALLYGPLVLAGERGTE 614


>gi|270296104|ref|ZP_06202304.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|423303646|ref|ZP_17281645.1| hypothetical protein HMPREF1072_00585 [Bacteroides uniformis
           CL03T00C23]
 gi|423307631|ref|ZP_17285621.1| hypothetical protein HMPREF1073_00371 [Bacteroides uniformis
           CL03T12C37]
 gi|270273508|gb|EFA19370.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|392688010|gb|EIY81301.1| hypothetical protein HMPREF1072_00585 [Bacteroides uniformis
           CL03T00C23]
 gi|392689500|gb|EIY82777.1| hypothetical protein HMPREF1073_00371 [Bacteroides uniformis
           CL03T12C37]
          Length = 641

 Score =  313 bits (801), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 188/488 (38%), Positives = 277/488 (56%), Gaps = 26/488 (5%)

Query: 149 KAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL 208
           K  GGWE+   ELRGH  GH LSA A M+AST +   K K  ++V  L+E Q  +G GYL
Sbjct: 99  KKLGGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYL 158

Query: 209 SAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
           SA+P EL +       VWAP+YT+HK+ +GL+DQY+ ADN  AL++ T M ++ YN++ K
Sbjct: 159 SAYPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYADNKPALEVVTRMGDWAYNKL-K 217

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
            +   + +R    +  E GG+N+  Y LY+IT D ++  LA  F     +  L  Q D L
Sbjct: 218 PLDEATRKR---MIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDL 274

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
              H NT IP V+     YE+T D   + +  FF   +   H++A G +S +E ++DP++
Sbjct: 275 GTKHTNTFIPKVLAEARNYELTQDNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQ 334

Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
           L+  L     ETC TYNMLK+SRHLF WT +   ADYYERAL N +L  Q+  E G++ Y
Sbjct: 335 LSKHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSY 393

Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
            LPL  G  K  S     T+ NSFWCC G+G ES +K G++IY   E    G+Y+  +I 
Sbjct: 394 FLPLLSGSHKVYS-----TRENSFWCCVGSGFESHAKYGEAIYCHNE---KGIYVNLFIP 445

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
           S  +WK+  + L Q+      +      TLT  + + V   +++ LR P W  S G + +
Sbjct: 446 SEVNWKAKGITLRQE----TGFPAEENTTLTIQTDKPV--TTTIYLRYPSW--SEGVKVN 497

Query: 569 LNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           +NG+ + +   PG++++ T +W   D++    P+SL+ E   D+ P+     A+L+GP +
Sbjct: 498 VNGKKVSVKQKPGSYIAVTRQWKDGDRIEANYPMSLQLETTSDN-PQKG---ALLYGPLV 553

Query: 628 LAGHTSGE 635
           LAG    E
Sbjct: 554 LAGELGTE 561


>gi|423313782|ref|ZP_17291717.1| hypothetical protein HMPREF1058_02329 [Bacteroides vulgatus
           CL09T03C04]
 gi|392684317|gb|EIY77645.1| hypothetical protein HMPREF1058_02329 [Bacteroides vulgatus
           CL09T03C04]
          Length = 640

 Score =  312 bits (800), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 187/519 (36%), Positives = 284/519 (54%), Gaps = 33/519 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  ++VD L+ SFR  A +           K  GGWE+   ELRGH  GH LSA   M+
Sbjct: 67  WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYGLMY 126

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K+K  ++V  L+E Q  +G GYLSA+P EL +       VWAP+YT+HK+ +
Sbjct: 127 AATGSEQFKQKGDSLVNGLAEVQTALGNGYLSAYPEELINRNICGTSVWAPWYTLHKLFS 186

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ +DN +AL++   M ++ Y++++ +      +     +  E GG+N+  Y LY
Sbjct: 187 GLIDQYLYSDNQKALEVVVRMADWAYHKLKPLDETTRQK----MIRNEFGGVNESFYNLY 242

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D +H  LA  F     +  L    D L   H NT IP VI     YE+T D   + 
Sbjct: 243 AITGDERHRWLAQFFYHNEVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTEDENSRK 302

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DP R +  +     ETC TYNMLK+SRHLF WT
Sbjct: 303 LSDFFWHTMIDHHTFAPGCSSDKEHYFDPARFSKHVSGYTGETCCTYNMLKLSRHLFCWT 362

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + A ADYYERAL N +L  Q+  + G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 363 ADAAIADYYERALYNHILG-QQDPQTGMVTYFLPLLSGSHKVYS-----TKENSFWCCVG 416

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           +G E+ +K G++IY+    N  G+Y+  +I S  +W+   + L Q+ D    +       
Sbjct: 417 SGFENHAKYGEAIYYH---NDKGIYVNLFIPSVVNWRKKGLTLRQETD----FPAEETTV 469

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
           LT  ++  V   +++ LR P W  S G +  +NG+ + +   PG++++ T  W   D++T
Sbjct: 470 LTIRAQNPVE--TTVYLRYPSW--SKGVKVFVNGKKIAVKQKPGSYIAITRLWKDGDRIT 525

Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
              P+ LR E   D+ P+     A+++GP +LAG    E
Sbjct: 526 ADYPMCLRVETTPDN-PQKG---ALVYGPVVLAGKRGTE 560


>gi|299146414|ref|ZP_07039482.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           3_1_23]
 gi|298516905|gb|EFI40786.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           3_1_23]
          Length = 642

 Score =  311 bits (798), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 196/520 (37%), Positives = 283/520 (54%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV  L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 69  WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +  GYLSAFP EL +     K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN QALK  T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 189 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 244

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   K 
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 304

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+ +  L     ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 364

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + L Q+ + P          
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFP-------KEE 468

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F  + E    +++ LR P W  S  A+  +NG+ + +    G++++ T  W  ND++
Sbjct: 469 TTRFIIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKSGSYIAITRDWKDNDRI 526

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ +  EA     P+  +  A+L+GP +LAG    E
Sbjct: 527 SATYPMQIELEAT----PDNPNKVALLYGPLVLAGERGTE 562


>gi|255692201|ref|ZP_05415876.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides
           finegoldii DSM 17565]
 gi|260622065|gb|EEX44936.1| hypothetical protein BACFIN_07304 [Bacteroides finegoldii DSM
           17565]
          Length = 644

 Score =  311 bits (798), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 193/520 (37%), Positives = 286/520 (55%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA   M+
Sbjct: 70  WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L E QN +  GYLSA+P EL +     K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN +AL + T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 190 GLIDQYLYADNKKALIIVTRMGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           SIT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   + 
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+L+  L     ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + + Q+ + P          
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S   +  +NG+ + +   PG++++ T  W  +D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVLVNGKKISVKQKPGSYIAITREWKDDDQI 527

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ ++ EA  D+ P  A   A+L+GP +LAG    E
Sbjct: 528 SATYPMQIKLEATPDN-PNKA---ALLYGPLVLAGERGTE 563


>gi|319643216|ref|ZP_07997844.1| hypothetical protein HMPREF9011_03445 [Bacteroides sp. 3_1_40A]
 gi|345520493|ref|ZP_08799881.1| hypothetical protein BSFG_01473 [Bacteroides sp. 4_3_47FAA]
 gi|254835017|gb|EET15326.1| hypothetical protein BSFG_01473 [Bacteroides sp. 4_3_47FAA]
 gi|317385120|gb|EFV66071.1| hypothetical protein HMPREF9011_03445 [Bacteroides sp. 3_1_40A]
          Length = 640

 Score =  311 bits (797), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 187/519 (36%), Positives = 284/519 (54%), Gaps = 33/519 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  ++VD L+ SFR  A +           K  GGWE+   ELRGH  GH LSA   M+
Sbjct: 67  WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYGLMY 126

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K+K  ++V  L+E Q  +G GYLSA+P EL +       VWAP+YT+HK+ +
Sbjct: 127 AATGSEQFKQKGDSLVNGLAEVQTALGNGYLSAYPEELINRNIRGTSVWAPWYTLHKLFS 186

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ +DN +AL++   M ++ Y++++ +      +     +  E GG+N+  Y LY
Sbjct: 187 GLIDQYLYSDNQKALEVVVRMADWAYHKLKPLDETTRQK----MIRNEFGGVNESFYNLY 242

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           +IT D +H  LA  F     +  L    D L   H NT IP VI     YE+T D   + 
Sbjct: 243 AITGDERHRWLAQFFYHNEVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTEDENSRK 302

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DP R +  +     ETC TYNMLK+SRHLF WT
Sbjct: 303 LSDFFWHTMIDHHTFAPGCSSDKEHYFDPARFSKHVSGYTGETCCTYNMLKLSRHLFCWT 362

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + A ADYYERAL N +L  Q+  + G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 363 ADAAIADYYERALYNHILG-QQDPQTGMVTYFLPLLSGSHKVYS-----TKENSFWCCVG 416

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           +G E+ +K G++IY+    N  G+Y+  +I S  +W+   + L Q+ D    +       
Sbjct: 417 SGFENHAKYGEAIYYH---NDKGIYVNLFIPSVVNWREKGLTLRQETD----FPAEETTV 469

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
           LT  ++  V   +++ LR P W  S G +  +NG+ + +   PG++++ T  W   D++T
Sbjct: 470 LTIRAQNPVE--TTVYLRYPSW--SKGVKVFVNGKKIAVKQKPGSYIAITRLWKDGDRIT 525

Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
              P+ LR E   D+ P+     A+++GP +LAG    E
Sbjct: 526 ADYPMCLRVETTPDN-PQKG---ALVYGPVVLAGKRGTE 560


>gi|424790951|ref|ZP_18217449.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
           pv. graminis ART-Xtg29]
 gi|422797791|gb|EKU25992.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
           pv. graminis ART-Xtg29]
          Length = 651

 Score =  311 bits (797), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 183/525 (34%), Positives = 278/525 (52%), Gaps = 45/525 (8%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG-HYLSASAQ 175
           +A+  +  YL+ +  D L+ +FR  A L +  +  GGWE+P  E+RGHF G HYLSA A 
Sbjct: 74  QARDRDRRYLMSIPNDRLLHTFRLVAGLDSQAEPLGGWESPHCEIRGHFAGGHYLSACAL 133

Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
           ++A+T +A +K+K   +V  L+ CQ     GY+ A+P+  +D     + VW P YT HKI
Sbjct: 134 LYAATGDAALKDKADALVAELARCQR--ADGYIGAYPSSFYDRLGRHEEVWVPIYTAHKI 191

Query: 236 LAGLLDQYVLADNAQALKMA--------TWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
           LAG LD    A NAQAL+ A         WM  +   + Q+++ +            E G
Sbjct: 192 LAGHLDMARHAGNAQALRTAQRFADWLGAWMDGFDDAQWQRILGV------------EFG 239

Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
           G++  L  LY ++ D K+   A  +++   L  LA Q D L+  HANT IP ++ +   Y
Sbjct: 240 GVHASLLELYLLSGDAKYQRWATRYEQASLLEPLAQQRDALAGLHANTQIPKIVAAARAY 299

Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNML 407
           E+ G P  + I  FF   V+  H+Y TGG S  E +  P   A  L   + E C +YNML
Sbjct: 300 EIDGAPRQRQIAEFFWRTVSGHHAYCTGGVSDYEMFGKPDHFAGHLSGHSHECCCSYNML 359

Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
           K++RHL+ W  + A  DYYER L N  L  Q   E G+M+Y +P+  G  K      + T
Sbjct: 360 KLTRHLYTWQPDAALMDYYERVLFNARLGTQ--DEAGMMMYFVPMDAGYWKL-----YNT 412

Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI 527
            F SFWCC GTG+E F+K  DSIYF ++    GL +  +I+S  DW    + + Q+    
Sbjct: 413 PFASFWCCTGTGVEEFAKSNDSIYFRDDA---GLTVNLFIASQLDWAERGLRVVQR---- 465

Query: 528 VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSAT 586
             +       L F  K+   Q  +L LR+P W  + G +  +NG+   +   PG++L+  
Sbjct: 466 TRFPQQEGTALEFQCKRP--QQMTLRLRIPYWA-TQGVRLRINGKAQAVKATPGSYLALE 522

Query: 587 ERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
            R++  D++ + LP++L    +    P+  S+QA+++GP +LA  
Sbjct: 523 RRFADGDRIELDLPMALHAAPL----PDEPSLQAMMYGPLVLAAQ 563


>gi|423222645|ref|ZP_17209115.1| hypothetical protein HMPREF1062_01301 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392641932|gb|EIY35705.1| hypothetical protein HMPREF1062_01301 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 641

 Score =  311 bits (796), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 185/488 (37%), Positives = 278/488 (56%), Gaps = 26/488 (5%)

Query: 149 KAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL 208
           K  GGWE+   ELRGH  GH LSA A M+AST +   K K  ++V  L+E Q  +G GYL
Sbjct: 99  KKLGGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYL 158

Query: 209 SAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
           SA+P EL +       VWAP+YT+HK+ +GL+DQY+  DN QAL++ T M ++ YN++ K
Sbjct: 159 SAYPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALEVVTRMGDWAYNKL-K 217

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
            +   + +R    +  E GG+N+  Y LY+IT D ++  LA  F     +  L  Q D L
Sbjct: 218 PLDEPTRKR---MIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDL 274

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
              H NT IP V+     YE+T D   + +  FF   +   H++A G +S +E ++DP++
Sbjct: 275 GTKHTNTFIPKVLAEARNYELTQDNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQ 334

Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
           L+  L     ETC TYNMLK+SRHLF WT +   ADYYERAL N +L  Q+  E G++ Y
Sbjct: 335 LSKHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSY 393

Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
            LPL  G  K  S     T+ NSFWCC G+G E+ +K G++IY+    N  G+Y+  +I 
Sbjct: 394 FLPLLSGSHKVYS-----TRENSFWCCVGSGFENHAKYGEAIYYH---NDQGIYVNLFIP 445

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
           S  +WK+  + L Q+     ++       LT  + + V   +++ LR P W  S   + +
Sbjct: 446 SEVNWKAKRITLRQE----TAFPAAENTALTIQTDKPV--TTTIYLRYPSW--SKNVKVN 497

Query: 569 LNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           +NG+ + +   PG++++ T +W   D++    P+SL+ E   D+ P+     A+L+GP +
Sbjct: 498 VNGKKVSVKQKPGSYIAVTRQWKDGDRIEANYPMSLQLETTPDN-PQKG---ALLYGPLV 553

Query: 628 LAGHTSGE 635
           LAG +  E
Sbjct: 554 LAGESGTE 561


>gi|224539132|ref|ZP_03679671.1| hypothetical protein BACCELL_04034 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224519254|gb|EEF88359.1| hypothetical protein BACCELL_04034 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 641

 Score =  310 bits (795), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 186/488 (38%), Positives = 279/488 (57%), Gaps = 26/488 (5%)

Query: 149 KAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL 208
           K  GGWE+   ELRGH  GH LSA A M+AST +   K K  ++V  L+E Q  +G GYL
Sbjct: 99  KKLGGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYL 158

Query: 209 SAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
           SA+P EL +       VWAP+YT+HK+ +GL+DQY+  DN QAL++ T M ++ YN++ K
Sbjct: 159 SAYPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALEVVTRMGDWAYNKL-K 217

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
            +   + +R    +  E GG+N+  Y LY+IT D ++  LA  F     +  L  Q D L
Sbjct: 218 PLDEPTRKR---MIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDL 274

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
              H NT IP V+     YE+T D   + +  FF   +   H++A G +S +E ++DP++
Sbjct: 275 GTKHTNTFIPKVLAEARNYELTQDNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQ 334

Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
           L+  L     ETC TYNMLK+SRHLF WT +   ADYYERAL N +L  Q+  E G++ Y
Sbjct: 335 LSKHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSY 393

Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
            LPL  G  K  S     T+ NSFWCC G+G E+ +K G++IY+    N  G+Y+  +I 
Sbjct: 394 FLPLLSGSHKVYS-----TRENSFWCCVGSGFENHAKYGEAIYYH---NDQGIYVNLFIP 445

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
           S  +WK+  + L+Q+    V  +      LT  + + V   +++ LR P W  S   + +
Sbjct: 446 SEVNWKAKGITLHQETAFPVEEN----TALTIQTDKPV--TTTIYLRYPSW--SKNVKVN 497

Query: 569 LNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           +NG+ + +   PG++++ T +W   D++    P+SL+ E   D+ P+     A+L+GP +
Sbjct: 498 VNGKKVSVKQKPGSYIAVTRQWKDGDRIEANYPMSLQLETTPDN-PQKG---ALLYGPLV 553

Query: 628 LAGHTSGE 635
           LAG +  E
Sbjct: 554 LAGESGTE 561


>gi|345512540|ref|ZP_08792066.1| hypothetical protein BSEG_01611 [Bacteroides dorei 5_1_36/D4]
 gi|423229086|ref|ZP_17215491.1| hypothetical protein HMPREF1063_01311 [Bacteroides dorei
           CL02T00C15]
 gi|423244926|ref|ZP_17226000.1| hypothetical protein HMPREF1064_02206 [Bacteroides dorei
           CL02T12C06]
 gi|345456387|gb|EEO45470.2| hypothetical protein BSEG_01611 [Bacteroides dorei 5_1_36/D4]
 gi|392634839|gb|EIY28751.1| hypothetical protein HMPREF1063_01311 [Bacteroides dorei
           CL02T00C15]
 gi|392640967|gb|EIY34758.1| hypothetical protein HMPREF1064_02206 [Bacteroides dorei
           CL02T12C06]
          Length = 646

 Score =  310 bits (795), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 192/545 (35%), Positives = 294/545 (53%), Gaps = 34/545 (6%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
           +K   L DV L  S       + ++ ++  ++VD L+ SFR  A +           K  
Sbjct: 48  VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 106

Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
           GGWE+   ELRGH  GH LSA   M+A+T +   + K  ++V  L+E QN +G GYLSA+
Sbjct: 107 GGWESLDCELRGHTTGHLLSAYGLMYAATGSKLFRHKGDSLVSGLAEVQNALGNGYLSAY 166

Query: 212 PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
           P EL +       VWAP+YT+HK+ +GL+DQY+ +DN +AL++   M ++ Y++++ +  
Sbjct: 167 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVIRMADWAYHKLKPLDE 226

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
               +     +  E GG+N+  Y LY+IT D +H  LA  F     +  L    D L   
Sbjct: 227 TTRQK----MIRNEFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDLGTK 282

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           H NT IP VI     YE+T D   + +  FF   +   H++A G +S +E ++DP R + 
Sbjct: 283 HTNTFIPKVIAEARNYELTEDENSRKLSDFFWHTMIDHHTFAPGCSSDKEHYFDPARFSK 342

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
            +     ETC TYNMLK+SRHLF WT + A ADYYERAL N +L  Q+  + G++ Y LP
Sbjct: 343 HVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTYFLP 401

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           L  G  K  S     TK NSFWCC G+G E+ +K G++IY+    N  G+Y+  +I S  
Sbjct: 402 LLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYH---NDKGIYVNLFIPSVV 453

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           +W+   + L Q+ D    +       LT  ++  V   +++ LR P W  S   + ++NG
Sbjct: 454 NWQEKGLTLRQETD----FPAEETTVLTIGTQSPVE--TTVYLRYPSW--SKEVKVAVNG 505

Query: 572 QNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           + + +   PG++++ T  W   D++T   P+ LR E   D+ P+     A+++GP +LAG
Sbjct: 506 KKVAVKQKPGSYIAITRLWKDGDRITADYPMRLRVETTPDN-PQKG---ALVYGPVVLAG 561

Query: 631 HTSGE 635
               E
Sbjct: 562 ERGTE 566


>gi|265752243|ref|ZP_06088036.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|263237035|gb|EEZ22505.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
          Length = 640

 Score =  310 bits (795), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 192/545 (35%), Positives = 294/545 (53%), Gaps = 34/545 (6%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
           +K   L DV L  S       + ++ ++  ++VD L+ SFR  A +           K  
Sbjct: 42  VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 100

Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
           GGWE+   ELRGH  GH LSA   M+A+T +   + K  ++V  L+E QN +G GYLSA+
Sbjct: 101 GGWESLDCELRGHTTGHLLSAYGLMYAATGSKLFRHKGDSLVSGLAEVQNALGNGYLSAY 160

Query: 212 PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
           P EL +       VWAP+YT+HK+ +GL+DQY+ +DN +AL++   M ++ Y++++ +  
Sbjct: 161 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVIRMADWAYHKLKPLDE 220

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
               +     +  E GG+N+  Y LY+IT D +H  LA  F     +  L    D L   
Sbjct: 221 TTRQK----MIRNEFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDLGTK 276

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           H NT IP VI     YE+T D   + +  FF   +   H++A G +S +E ++DP R + 
Sbjct: 277 HTNTFIPKVIAEARNYELTEDENSRKLSDFFWHTMIDHHTFAPGCSSDKEHYFDPARFSK 336

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
            +     ETC TYNMLK+SRHLF WT + A ADYYERAL N +L  Q+  + G++ Y LP
Sbjct: 337 HVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTYFLP 395

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           L  G  K  S     TK NSFWCC G+G E+ +K G++IY+    N  G+Y+  +I S  
Sbjct: 396 LLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYH---NDKGIYVNLFIPSVV 447

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           +W+   + L Q+ D    +       LT  ++  V   +++ LR P W  S   + ++NG
Sbjct: 448 NWQEKGLTLRQETD----FPAEETTVLTIGTQSPVE--TTVYLRYPSW--SKEVKVAVNG 499

Query: 572 QNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           + + +   PG++++ T  W   D++T   P+ LR E   D+ P+     A+++GP +LAG
Sbjct: 500 KKVAVKQKPGSYIAITRLWKDGDRITADYPMRLRVETTPDN-PQKG---ALVYGPVVLAG 555

Query: 631 HTSGE 635
               E
Sbjct: 556 ERGTE 560


>gi|383115004|ref|ZP_09935763.1| hypothetical protein BSGG_0819 [Bacteroides sp. D2]
 gi|313693284|gb|EFS30119.1| hypothetical protein BSGG_0819 [Bacteroides sp. D2]
          Length = 643

 Score =  310 bits (794), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 192/520 (36%), Positives = 285/520 (54%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA   M+
Sbjct: 70  WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L E QN +  GYLSA+P EL +     K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN +AL + T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 190 GLIDQYLYADNKKALTIVTRMGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           SIT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   + 
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+L+  L     ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + + Q+ + P          
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S   + S+NG+ + +    G++++ T  W   D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVSVNGKKISVKQKSGSYIAITREWKDGDQI 527

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ ++ E   D+ P+ A   A+L+GP +LAG    E
Sbjct: 528 SATYPMQIKLETTPDN-PDKA---ALLYGPLVLAGERGTE 563


>gi|237722400|ref|ZP_04552881.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229448210|gb|EEO54001.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
          Length = 644

 Score =  310 bits (794), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 192/520 (36%), Positives = 285/520 (54%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA   M+
Sbjct: 70  WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L E QN +  GYLSA+P EL +     K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN +AL + T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 190 GLIDQYLYADNKKALTIVTRMGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           SIT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   + 
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+L+  L     ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + + Q+ + P          
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S   + S+NG+ + +    G++++ T  W   D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVSVNGKKISVKQKSGSYIAITREWKDGDQI 527

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ ++ E   D+ P+ A   A+L+GP +LAG    E
Sbjct: 528 SATYPMQIKLETTPDN-PDKA---ALLYGPLVLAGERGTE 563


>gi|150002728|ref|YP_001297472.1| hypothetical protein BVU_0120 [Bacteroides vulgatus ATCC 8482]
 gi|294776982|ref|ZP_06742443.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
 gi|149931152|gb|ABR37850.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
 gi|294449230|gb|EFG17769.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
          Length = 640

 Score =  310 bits (793), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 192/545 (35%), Positives = 294/545 (53%), Gaps = 34/545 (6%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
           +K   L DV L  S       + ++ ++  ++V+ L+ SFR  A +           K  
Sbjct: 42  VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVNRLLHSFRTNAGVFAGREGGYMTVKKL 100

Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
           GGWE+   ELRGH  GH LSA   M+A+T +   K+K  ++V  L+E Q  +G GYLSA+
Sbjct: 101 GGWESLDCELRGHTTGHLLSAYGLMYAATGSEQFKQKGDSLVNGLAEVQTALGNGYLSAY 160

Query: 212 PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
           P EL +       VWAP+YT+HK+ +GL+DQY+ +DN +AL++   M ++ Y++++ +  
Sbjct: 161 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVIRMADWAYHKLKPLDE 220

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
               +     +  E GG+N+  Y LY+IT D +H  LA  F     +  L    D L   
Sbjct: 221 TTRQK----MIRNEFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDLGTK 276

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           H NT IP VI     YE+T D   + +  FF   +   H++A G +S +E ++DP R + 
Sbjct: 277 HTNTFIPKVIAEARNYELTEDENSRKLSDFFWHTMIDHHTFAPGCSSDKEHYFDPARFSK 336

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
            +     ETC TYNMLK+SRHLF WT + A ADYYERAL N +L  Q+  + G++ Y LP
Sbjct: 337 HVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTYFLP 395

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           L  G  K  S     TK NSFWCC G+G E+ +K G++IY+    N  G+Y+  +I S  
Sbjct: 396 LLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYH---NDKGIYVNLFIPSVV 447

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           +W+   + L Q+ D    +       LT  ++  V   +++ LR P W  S G +  +NG
Sbjct: 448 NWREKGLTLRQETD----FPAEETTVLTIRAQNPVE--TTVYLRYPSW--SKGVKVFVNG 499

Query: 572 QNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           + + +   PG++++ T  W   D++T   P+ LR E   D+ P+     A+++GP +LAG
Sbjct: 500 KKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQKG---ALVYGPVVLAG 555

Query: 631 HTSGE 635
               E
Sbjct: 556 KRGTE 560


>gi|293369447|ref|ZP_06616030.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
 gi|292635445|gb|EFF53954.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
          Length = 644

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 191/520 (36%), Positives = 285/520 (54%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA   M+
Sbjct: 70  WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L E QN +  GYLSA+P EL +     K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN +AL + T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 190 GLIDQYLYADNKKALTIVTRMGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           SIT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   + 
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DP++L+  L     ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPRKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + + Q+ + P          
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S   + S+NG+ + +    G++++ T  W   D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVSVNGKKISVKQKSGSYIAITREWKDGDQI 527

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ ++ E   D+ P+ A   A+L+GP +LAG    E
Sbjct: 528 SATYPMQIKLETTPDN-PDKA---ALLYGPLVLAGERGTE 563


>gi|212690961|ref|ZP_03299089.1| hypothetical protein BACDOR_00451 [Bacteroides dorei DSM 17855]
 gi|212666193|gb|EEB26765.1| hypothetical protein BACDOR_00451 [Bacteroides dorei DSM 17855]
          Length = 646

 Score =  309 bits (791), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 187/516 (36%), Positives = 285/516 (55%), Gaps = 37/516 (7%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  ++VD L+ SFR  A +           K  GGWE+   ELRGH  GH LSA   M+
Sbjct: 73  WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYGLMY 132

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E QN +G GYLSA+P EL +       VWAP+YT+HK+ +
Sbjct: 133 AATGSELFKHKGDSLVSGLAEVQNALGNGYLSAYPEELINRNIRGTSVWAPWYTLHKLFS 192

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKV--ITMYSVERHWYSLNEETGGMNDVLYR 295
           GL+DQY+ +DN +AL++ T M ++ Y++++ +  +T   + R+      E GG+N+  Y 
Sbjct: 193 GLIDQYLYSDNQKALEIVTRMADWAYHKLKPLDEVTRRKMIRN------EFGGINESFYN 246

Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY 355
           LY+IT D ++  LA  F     +  L    D L   H NT IP V+     YE+T D   
Sbjct: 247 LYAITGDERYRWLARFFYHNEVIDPLKELRDDLGTKHTNTFIPKVLAEARNYELTEDEDS 306

Query: 356 KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
           + +  FF   +   H++A G +S +E ++DP   +  +     ETC TYNMLK+SRHLF 
Sbjct: 307 RKLSGFFWHTMIDRHTFAPGCSSDKEHYFDPDHFSKHISGYTGETCCTYNMLKLSRHLFC 366

Query: 416 WTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCC 475
           WT + A ADYYERAL N +L  Q+    G++ Y LPL  G  K  S     TK NSFWCC
Sbjct: 367 WTADAAVADYYERALYNHILG-QQDPHTGMVTYFLPLLSGSHKVYS-----TKENSFWCC 420

Query: 476 YGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR 535
            G+G E+ +K G++IY+    N  G+Y+  +I S  +W+   + L Q+ D    +     
Sbjct: 421 VGSGFENHAKYGEAIYYH---NDKGIYVNLFIPSVVNWREKGLTLRQETD----FPAEET 473

Query: 536 MTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDK 594
             LT  ++  V   +++ LR P W  S G +  +NG+ + +   PG++++ T  W   D+
Sbjct: 474 TVLTIGAQNPVE--TTVYLRYPSW--SKGVKVFVNGKKIAVKQKPGSYIAITRLWKDGDR 529

Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +T   P+ LR E   D+ P+     A+++GP +LAG
Sbjct: 530 ITADYPMCLRVETTPDN-PQKG---ALIYGPLVLAG 561


>gi|423295661|ref|ZP_17273788.1| hypothetical protein HMPREF1070_02453 [Bacteroides ovatus
           CL03T12C18]
 gi|392672370|gb|EIY65839.1| hypothetical protein HMPREF1070_02453 [Bacteroides ovatus
           CL03T12C18]
          Length = 644

 Score =  308 bits (789), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 191/520 (36%), Positives = 285/520 (54%), Gaps = 35/520 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA   M+
Sbjct: 70  WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L E QN +  GYLSA+P EL +     K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN +AL + T + ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 190 GLIDQYLYADNKKALTIVTRVGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           SIT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   + 
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+L+  L     ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + + Q+ + P          
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S   + S+NG+ + +    G++++ T  W   D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVSVNGKKISVKQKSGSYIAITREWKDGDQI 527

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +   P+ ++ E   D+ P+ A   A+L+GP +LAG    E
Sbjct: 528 SATYPMQIKLETTPDN-PDKA---ALLYGPLVLAGERGTE 563


>gi|433678837|ref|ZP_20510648.1| hypothetical protein BN444_02893 [Xanthomonas translucens pv.
           translucens DSM 18974]
 gi|430816044|emb|CCP41169.1| hypothetical protein BN444_02893 [Xanthomonas translucens pv.
           translucens DSM 18974]
          Length = 648

 Score =  308 bits (789), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 185/518 (35%), Positives = 277/518 (53%), Gaps = 31/518 (5%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG-HYLSASAQ 175
           +A++ N  YL+ +    L+ +FR  A L +  +  GGWE+P  ELRGHF G HYLSA A 
Sbjct: 71  QARERNRRYLMSIPNARLLHNFRLVAGLSSDAEPLGGWESPKCELRGHFAGGHYLSACAL 130

Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
           ++A+T +A +K+K   +V  L+ CQ +   GYL A+P   +      + VW P YT HKI
Sbjct: 131 LYAATSDAALKDKADALVAELARCQRQ--DGYLGAYPAAFYARLRRGEDVWVPLYTAHKI 188

Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLY 294
           LAG LD    A NAQAL+ A    ++    +         +  W + L  E GG+ + L 
Sbjct: 189 LAGHLDMARHAGNAQALRSAQRFADWLGAWMDGCD-----DAQWQHILGVEFGGVQESLL 243

Query: 295 RLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL 354
            LY ++ DPK+   A  + +P  L  LA Q D L+  HANT IP ++ +   YE+ G+P 
Sbjct: 244 ELYLLSGDPKYQRWAARYAQPALLEPLAQQRDALAGLHANTQIPKIVAAARAYEIGGEPR 303

Query: 355 YKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLF 414
            + I  FF   V+  H+Y TGGTS  E +  P   A  L   + E C +YNMLK++RHL+
Sbjct: 304 QRDIAAFFWRTVSGHHAYCTGGTSDYELFGKPDHFAGRLSGHSHECCCSYNMLKLTRHLY 363

Query: 415 RWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC 474
            W  + A  DYYER L N  L  Q   E G+++Y +P+  G  K      + T F SFWC
Sbjct: 364 TWQPDAALMDYYERVLFNARLGTQ--DEAGMLMYFVPMDAGYWKL-----YNTPFASFWC 416

Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
           C GTG+E F+K  DSIYF    +  GL +  +I+S  DW    + + Q+      +    
Sbjct: 417 CTGTGVEEFAKSNDSIYFR---DAAGLTVNLFIASQLDWPERGLRVVQR----TRFPQQE 469

Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYND 593
              L F  K+   Q  +L LR+P W  + G +  +NG+   +   PG++L+   R++  D
Sbjct: 470 GTALEFQCKRP--QQMTLRLRIPYWA-TQGVRLRINGKAQAIKATPGSYLALQRRFADGD 526

Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
           ++ + LP++L    +    P+  S+QA+++GP +LA  
Sbjct: 527 RIELDLPMALHAAPL----PDEPSLQAMMYGPLVLAAQ 560


>gi|336415976|ref|ZP_08596314.1| hypothetical protein HMPREF1017_03422 [Bacteroides ovatus
           3_8_47FAA]
 gi|335939879|gb|EGN01751.1| hypothetical protein HMPREF1017_03422 [Bacteroides ovatus
           3_8_47FAA]
          Length = 644

 Score =  307 bits (787), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 191/515 (37%), Positives = 284/515 (55%), Gaps = 35/515 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           ++  +DV+ L+ SFR  A +           K  GGWE+   ELRGH  GH LSA   M+
Sbjct: 70  WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L E QN +  GYLSA+P EL +     K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ ADN +AL + T M ++ YN+++ +    S E     +  E GG+N+  Y LY
Sbjct: 190 GLIDQYLYADNKKALIIVTRMGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           SIT D ++  LA  F     +  L    D L   H NT IP VI     YE+T +   + 
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DPK+L+  L     ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            + + ADYYERAL N +L  Q+  E G++ Y LPL  G  K  S     TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
           +G E+ +K G++IY+    N  G+Y+  +I S   WK   + + Q+ + P          
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
           T  F+ + E    +++ LR P W  S   + S+NG+ + +    G++++ T  W   D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVSVNGKKIFVKQKSGSYIAITREWKDGDQI 527

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +   P+ ++ E   D+ P+ A   A+L+GP +LAG
Sbjct: 528 SATYPMQIKLETTPDN-PDKA---ALLYGPLVLAG 558


>gi|237712552|ref|ZP_04543033.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|229453873|gb|EEO59594.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
          Length = 640

 Score =  307 bits (786), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 192/542 (35%), Positives = 294/542 (54%), Gaps = 38/542 (7%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
           +K   L DV L  S       + ++ ++  ++VD L+ SFR  A +           K  
Sbjct: 42  VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 100

Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
           GGWE+   ELRGH  GH LSA   M+A+T +   K K  ++V  L+E QN +G GYLSA+
Sbjct: 101 GGWESLDCELRGHTTGHLLSAYGLMYAATGSELFKHKGDSLVSGLAEVQNALGNGYLSAY 160

Query: 212 PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV-- 269
           P EL +       VWAP+YT+HK+ +GL+DQY+ +DN +AL++ T M ++ Y++++ +  
Sbjct: 161 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEIVTRMADWAYHKLKPLDE 220

Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
           +T   + R+      E GG+N+  Y LY+IT D ++  LA  F     +  L    D L 
Sbjct: 221 VTRRKMIRN------EFGGINESFYNLYAITGDERYRWLARFFYHNEVIDPLKELRDDLG 274

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
             H NT IP V+     YE+T D   + +  FF   +   H++A G +S +E ++DP   
Sbjct: 275 TKHTNTFIPKVLAEARNYELTEDEDSRKLSGFFWHTMIDRHTFAPGCSSDKEHYFDPDHF 334

Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
           +  +     ETC TYNMLK+S HLF WT + A ADYYERAL N +L  Q+    G++ Y 
Sbjct: 335 SKHISGYTGETCCTYNMLKLSSHLFCWTADAAVADYYERALYNHILG-QQDPHTGMVTYF 393

Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
           LPL  G  K  S     TK NSFWCC G+G E+ +K G++IY+    N  G+Y+  +I S
Sbjct: 394 LPLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYH---NDKGIYVNLFIPS 445

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             +W+   + L Q+ D    +       LT  ++  V   +++ LR P W  S G +  +
Sbjct: 446 VVNWREKGLTLRQETD----FPAEETTVLTIGAQNPVE--TTVYLRYPSW--SKGVKVFV 497

Query: 570 NGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
           NG+ + +   PG++++ T  W   D++T   P+ LR E   D+ P+     A+++GP +L
Sbjct: 498 NGKKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQKG---ALIYGPLVL 553

Query: 629 AG 630
           AG
Sbjct: 554 AG 555


>gi|427386207|ref|ZP_18882404.1| hypothetical protein HMPREF9447_03437 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726247|gb|EKU89112.1| hypothetical protein HMPREF9447_03437 [Bacteroides oleiciplenus YIT
           12058]
          Length = 641

 Score =  306 bits (784), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 179/488 (36%), Positives = 276/488 (56%), Gaps = 26/488 (5%)

Query: 149 KAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL 208
           K  GGWE+   E+RGH  GH LSA A M+A++ +   K K  ++V  L+E Q+ +G GYL
Sbjct: 99  KKLGGWESLDCEIRGHTTGHLLSAYALMYAASGSEIFKLKGDSLVSGLAEVQDALGNGYL 158

Query: 209 SAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
           SA+P EL +       VWAP+YT+HK+ +GL+DQY+  DN QALK+ T M ++ YN+++ 
Sbjct: 159 SAYPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALKVVTRMGDWAYNKLKP 218

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           +      E     +  E GG+N+  Y LY+IT D ++  LA+ F     +  L  Q D L
Sbjct: 219 L----DEETRKRMIRNEFGGVNESFYNLYAITGDERYHWLANFFYHNDVIDPLKEQRDDL 274

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
              H NT IP V+     YE+T +   + +  FF   + A H++A G +S +E ++DP++
Sbjct: 275 GTKHTNTFIPKVLAEARNYELTQNAESRTLTDFFWHTMIAHHTFAPGCSSDKEHYFDPQQ 334

Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
            +  L     ETC TYNMLK+SRHLF WT + + ADYYERAL N +L  Q+  E G+  Y
Sbjct: 335 FSKHLTGYTGETCCTYNMLKLSRHLFCWTGDASIADYYERALYNHILG-QQDPETGMFSY 393

Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
            LPL  G  K  S     T+ NSFWCC G+G E+ +K G++IY++ E    G+Y+  +I 
Sbjct: 394 FLPLLSGSHKVYS-----TQENSFWCCVGSGFENHAKYGEAIYYQNE---KGIYVNLFIP 445

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
           S  +WK   + + Q+ +    +       L+  +K+ V   +++ LR P W  S     S
Sbjct: 446 SEVNWKEKGMTIRQETN----FPAEETTILSIHAKEPVK--TTVYLRYPSW--SKKVTVS 497

Query: 569 LNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           +NG+ + +   PG++++ T +W   DK+    P+ ++ E   D+ P+     A+++GP +
Sbjct: 498 VNGKKVSVKQKPGSYIAVTRQWKDGDKIEANYPMEIQLETTPDN-PQKG---ALVYGPLV 553

Query: 628 LAGHTSGE 635
           LAG    E
Sbjct: 554 LAGELGTE 561


>gi|440732599|ref|ZP_20912422.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
           DAR61454]
 gi|440368630|gb|ELQ05659.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
           DAR61454]
          Length = 652

 Score =  306 bits (783), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 184/518 (35%), Positives = 276/518 (53%), Gaps = 31/518 (5%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG-HYLSASAQ 175
           +A++ N  YL+ +    L+ +FR  A L +  +  GGWE+P  ELRGHF G HYLSA A 
Sbjct: 75  QARERNRRYLMSIPNARLLHNFRLVAGLSSDAEPLGGWESPKCELRGHFAGGHYLSACAL 134

Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
           ++A+T +A +K+K   +V  L+ CQ +   GYL A+P   +      + VW P YT HKI
Sbjct: 135 LYAATGDAALKDKADALVAELARCQRQ--DGYLGAYPAAFYARLRRGEDVWVPLYTAHKI 192

Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLY 294
           LAG LD    A NAQAL+ A    ++    +         +  W + L  E GG+ + L 
Sbjct: 193 LAGHLDMARHAGNAQALRSAQRFADWLGAWMDGCD-----DAQWQHILGVEFGGVQESLL 247

Query: 295 RLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL 354
            LY ++ DPK+   A  + +P  L  LA Q D L+  HANT IP ++ +   YE+  DP 
Sbjct: 248 ELYLLSGDPKYQRWAARYAQPALLEPLAQQRDALAGLHANTQIPKIVAAARAYEIGRDPR 307

Query: 355 YKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLF 414
            + +  FF   V+  H+Y TGGTS  E +  P   A  L   + E C +YNMLK++RHL+
Sbjct: 308 QRDVAAFFWRTVSGHHAYCTGGTSDYELFGKPDHFAGRLSGHSHECCCSYNMLKLTRHLY 367

Query: 415 RWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC 474
            W  + A  DYYER L N  L  Q   E G+++Y +P+  G  K      + T F SFWC
Sbjct: 368 TWQPDAALMDYYERVLFNARLGTQ--DEAGMLMYFVPMDAGYWKL-----YNTPFASFWC 420

Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
           C GTG+E F+K  DSIYF    +  GL +  +I+S  DW    + + Q+      +    
Sbjct: 421 CTGTGVEEFAKSNDSIYFR---DAAGLTVNLFIASQLDWPERGLRVVQR----TRFPQQE 473

Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYND 593
              L F  K+   Q  +L LR+P W  + G +  +NG+   +   PG++L+   R++  D
Sbjct: 474 GTALVFQCKRP--QQMTLRLRIPYWA-TQGVRLRINGKAQAIKATPGSYLALQRRFADGD 530

Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
           ++ + LP++L    +    P+  S+QA+++GP +LA  
Sbjct: 531 RIELDLPMALHAAPL----PDEPSLQAMMYGPLVLAAQ 564


>gi|395774802|ref|ZP_10455317.1| protein [Streptomyces acidiscabies 84-104]
          Length = 818

 Score =  306 bits (783), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 198/525 (37%), Positives = 273/525 (52%), Gaps = 33/525 (6%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
           Q+ N  YL  +D+D L+ +FR    LP+  +   GWE P  ELRGH  GH LS  A   A
Sbjct: 43  QRRNTAYLRFVDLDRLLHTFRLNVGLPSTAQPCSGWEGPNVELRGHSTGHLLSGLALTHA 102

Query: 179 STHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYTIH 233
           +T +  +++K   +V +L+ECQ          GYLSAFP   FD  EA   VWAPYYT+H
Sbjct: 103 NTGDTELRDKGRRLVAALAECQAASPAAGFNAGYLSAFPESFFDRLEAGTGVWAPYYTLH 162

Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
           KI+AGL+DQY L+ N QAL +     ++   R   +    S ER    L+ E GGMNDVL
Sbjct: 163 KIMAGLVDQYRLSGNEQALDVVLRKGDWVDRRTAGL----SYERMQRVLDTEFGGMNDVL 218

Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
             L+ IT D + L +A  F        LA   D L+  HANT IP ++G+   +E   D 
Sbjct: 219 ADLHEITGDARWLAVAERFTHARVFDPLARGEDRLAGLHANTQIPKMVGALRMWEEGLDV 278

Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
            Y+ IG  F  IV   H+Y  GG S  E + +P  +A  L     E C +YNMLK++R L
Sbjct: 279 RYRTIGENFWRIVTGHHTYVIGGNSNGEAFHEPDVIAGQLSDSTCENCNSYNMLKLTRLL 338

Query: 414 -FRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARST-----HGWG 466
            F         DYYERAL N +L  Q  G+E G  IY   L  G +K + +       + 
Sbjct: 339 HFHAPGRTDLLDYYERALFNQMLGEQDPGSEHGYNIYYTGLAPGSAKRQPSFMSPEDAYS 398

Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDP 526
           T + +F C +GTG+E+ +K  D+IY  +E     L +  +I S  DWK+  +   Q    
Sbjct: 399 TDYTNFSCDHGTGMETHAKFADTIYTHDEQR---LLVNLFIPSEVDWKAKGITWRQTTR- 454

Query: 527 IVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLSA 585
           +   D     TLT ++ Q      +L +R+P W  + GA+  LNG+ LP  P PG + + 
Sbjct: 455 LPDQDT---ATLTVTAGQ---ARHALVVRVPGW--ARGARVRLNGRTLPDRPAPGTWFTL 506

Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
              W   D++ + LPL    EA  DD PE   +QA+L GP +LAG
Sbjct: 507 DRAWRRGDRVDVTLPLRTTVEATPDD-PE---VQAVLHGPVVLAG 547


>gi|423239921|ref|ZP_17221036.1| hypothetical protein HMPREF1065_01659 [Bacteroides dorei
           CL03T12C01]
 gi|392644910|gb|EIY38644.1| hypothetical protein HMPREF1065_01659 [Bacteroides dorei
           CL03T12C01]
          Length = 646

 Score =  305 bits (782), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 192/542 (35%), Positives = 293/542 (54%), Gaps = 38/542 (7%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
           +K   L DV L  S       + ++ ++  ++VD L+ SFR  A +           K  
Sbjct: 48  VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 106

Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
           GGWE+   ELRGH  GH LSA   M+A+T +   K K  ++V  L E QN +G GYLSA+
Sbjct: 107 GGWESLDCELRGHTTGHLLSAYGLMYAATGSELFKHKGDSLVSGLVEVQNALGNGYLSAY 166

Query: 212 PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV-- 269
           P EL +       VWAP+YT+HK+ +GL+DQY+ +DN +AL++ T M ++ Y++++ +  
Sbjct: 167 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEIVTRMADWAYHKLKPLDE 226

Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
           +T   + R+      E GG+N+  Y LY+IT D ++  LA  F     +  L    D L 
Sbjct: 227 VTRRKMIRN------EFGGINESFYNLYAITGDERYRWLARFFYHNEVIDPLKELRDDLG 280

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
             H NT IP V+     YE+T D   + +  FF   +   H++A G +S +E ++DP   
Sbjct: 281 TKHTNTFIPKVLAEARNYELTEDEDSRKLSGFFWHTMIDRHTFAPGCSSDKEHYFDPDHF 340

Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
           +  +     ETC TYNMLK+S HLF WT + A ADYYERAL N +L  Q+    G++ Y 
Sbjct: 341 SKHISGYTGETCCTYNMLKLSSHLFCWTADAAVADYYERALYNHILG-QQDPHTGMVTYF 399

Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
           LPL  G  K  S     TK NSFWCC G+G E+ +K G++IY+    N  G+Y+  +I S
Sbjct: 400 LPLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYH---NDKGIYVNLFIPS 451

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             +W+   + L Q+ D    +       LT  ++  V   +++ LR P W  S G +  +
Sbjct: 452 VVNWREKGLTLRQETD----FPAEETTVLTIGAQNPVE--TTVYLRYPSW--SKGVKVFV 503

Query: 570 NGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
           NG+ + +   PG++++ T  W   D++T   P+ LR E   D+ P+     A+++GP +L
Sbjct: 504 NGKKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQKG---ALIYGPLVL 559

Query: 629 AG 630
           AG
Sbjct: 560 AG 561


>gi|332880745|ref|ZP_08448418.1| hypothetical protein HMPREF9074_04196 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|357045883|ref|ZP_09107513.1| hypothetical protein HMPREF9441_01526 [Paraprevotella clara YIT
           11840]
 gi|332681379|gb|EGJ54303.1| hypothetical protein HMPREF9074_04196 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|355530889|gb|EHH00292.1| hypothetical protein HMPREF9441_01526 [Paraprevotella clara YIT
           11840]
          Length = 618

 Score =  304 bits (778), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 213/621 (34%), Positives = 301/621 (48%), Gaps = 67/621 (10%)

Query: 105 HDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGH 164
           HDV L  S V  R +  N  +L  L+ D L+ +FR  A LP+  K   GWE+P   LRGH
Sbjct: 39  HDVELASSWVKQR-EDLNTAFLRSLEPDRLLHNFRVNAGLPSVAKPLEGWESPGVGLRGH 97

Query: 165 FVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-LK 223
           FVGHYLSA + +     +A +   +  VV  +  CQ   G GYLSAFP    +  E    
Sbjct: 98  FVGHYLSAVSALVERYEDAGLARNLEKVVEGMYACQQAHGNGYLSAFPETDIEVLETRFT 157

Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
            VWAPYYT+HKI+ GLLD Y+   N +A  M   +  Y   R+ K +   +V R  Y+ +
Sbjct: 158 GVWAPYYTLHKIMQGLLDVYLRTGNEKAYAMVEGLAGYVDRRMSK-LDPATVARMMYTAD 216

Query: 284 ----EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
                E GGMN+VLY+LY ++  P++L LA LFD   FL  L    D LS  HANTHI +
Sbjct: 217 ANPQNEMGGMNEVLYQLYCVSGKPRYLELASLFDPSWFLEPLVRNEDILSGLHANTHIAL 276

Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA------------REFWWDPK 387
           V G   RYE TG+  Y      F +++   H+Y  G +S              E W +P 
Sbjct: 277 VNGFARRYESTGEECYGKSVANFWNMLMHFHAYVNGTSSGPRPNVTTETSLTAEHWGEPC 336

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ-RGTEPGVM 446
            L +TL     E+C T+N  +++  LF WT    YAD Y     N VL +Q R T  G  
Sbjct: 337 HLCNTLTKGIAESCVTHNTQRLNASLFSWTGNPCYADVYMNMFYNAVLPVQSRST--GAY 394

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
           +Y LPLG    KA          N F CC G+  E+F+KL + IY+ ++  V   Y+  Y
Sbjct: 395 VYHLPLGSPRHKAYMAD------NDFKCCSGSCAEAFAKLNNGIYYHDDSAV---YVNLY 445

Query: 507 ISSSFDWKSGHVVLNQK----VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
           + S   W    V L Q     V+PIV +   +R  + F           LNL +P WT  
Sbjct: 446 VPSKVHWADKKVGLEQAGGFPVEPIVDFTVSVRRPVDF----------VLNLFIPAWT-- 493

Query: 563 NGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
           +GA   +NG+   +P  P +FL  + RW+  D++ I+   + R +++    P+  ++ A+
Sbjct: 494 DGAVVYVNGEKQEMPVRPSSFLKLSRRWADGDRVRIEFRYAFRLQSM----PDKENMLAV 549

Query: 622 LFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQS 681
            +GP LLA  T  E  +K      L+ L      SF         +S +  FV+ N  + 
Sbjct: 550 FYGPMLLAFETRDEVILKGNKDEILAGL------SF--------ADSESGRFVLKNGERE 595

Query: 682 ITMEE-FPVSGTDAALHATFR 701
             +   F V      ++AT R
Sbjct: 596 FRLRPLFDVDKESYGVYATIR 616


>gi|326204047|ref|ZP_08193908.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
           2782]
 gi|325985814|gb|EGD46649.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
           2782]
          Length = 743

 Score =  302 bits (774), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 189/518 (36%), Positives = 271/518 (52%), Gaps = 31/518 (5%)

Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
           L  A +  +EYL   D D L+  F  T  L    + Y GWEN  +E+RGH +GHYL+A A
Sbjct: 11  LVNAFKKEIEYLEAFDCDKLLSCFYITKGLTPKAENYRGWEN--TEIRGHTMGHYLTALA 68

Query: 175 QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK 234
           Q +++T+++ I E++  ++  LS CQ    +GYLSAFP E FD  E  KP+W P+YT+HK
Sbjct: 69  QAYSATNDSKIYERLQYLMKELSLCQ--FESGYLSAFPEEFFDRVENRKPIWVPWYTMHK 126

Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLY 294
           I+ GL+  Y LA    ALK+ + + E+ ++R  K    ++ E H   L  E GGMND +Y
Sbjct: 127 IITGLISVYKLAKIETALKIVSRLGEWVFSRTDK----WTPEIHANVLAVEYGGMNDCMY 182

Query: 295 RLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP- 353
            LY I+ + KH   AH+FD+      +    D L++ HANT IP  +G+  RY   G+  
Sbjct: 183 ELYKISGNEKHCTAAHMFDEIELFKEIHDGKDILNNRHANTTIPKFLGALNRYLAIGEEE 242

Query: 354 -LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
             Y      F  IV  +HSY TGG S  E + +P  L     S N ETC TYNMLK++R 
Sbjct: 243 QFYLDTCKEFWSIVTNNHSYVTGGNSEWEHFGEPGILDAERTSTNCETCNTYNMLKMTRE 302

Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
           LF+ T    YAD+YE   TN +LS Q   + G+ +Y  P+  G  K      +G  F  F
Sbjct: 303 LFKITGNKKYADFYENTFTNAILSSQ-NPDTGMTMYFQPMETGYFKV-----YGKPFEHF 356

Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
           WCC GTG+E+F+KL +SIYF EE     LY+  Y S+  +W+   V L Q  D I   D 
Sbjct: 357 WCCTGTGMENFTKLNNSIYFYEEDR---LYVNMYYSTELNWEEKGVKLTQNSD-IPGTD- 411

Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYN 592
                  F+ K E G   +L +R+P W  + G + ++N           +      W  N
Sbjct: 412 ----RAGFTIKAETGAEFTLCMRIPTW--AKGVKINVNNNLSIFTEERGYALIHRTWKDN 465

Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           D + I   +  +   +    P+  +  A  +GP +L+ 
Sbjct: 466 DTVEIIFKIEPQLSTL----PDNPNAVAFTYGPVVLSA 499


>gi|325106457|ref|YP_004276111.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324975305|gb|ADY54289.1| protein of unknown function DUF1680 [Pedobacter saltans DSM 12145]
          Length = 648

 Score =  302 bits (774), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 194/551 (35%), Positives = 293/551 (53%), Gaps = 41/551 (7%)

Query: 101 EVSLHDVWLDQSSVLWRAQQTNLE----YLLMLDVDSLVWSFRKTASLPTPG-------K 149
           +V ++   L    +L  A + N+E    +L+ LDV+ L+ SFR TA + +         K
Sbjct: 39  DVKVYSFDLKDVRLLPSAFRDNMERDSKWLMSLDVNRLLHSFRNTAGVFSSKEGGYMTIK 98

Query: 150 AYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQ---NKIG-T 205
             GGWE+   +LRGH  GH +SA + ++AST +   K K  ++V  L+E Q    K+G  
Sbjct: 99  KLGGWESLDCDLRGHTTGHIMSALSYLYASTGDERYKIKSDSIVNGLAEVQYALTKVGQN 158

Query: 206 GYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
           G++SAFP    +   A + +WAP+YT+HKI AGL+DQY+   N +AL + T    + Y  
Sbjct: 159 GFISAFPENFINRNIAGQSIWAPWYTLHKIYAGLIDQYLYCGNEKALDIMTKAASWAY-- 216

Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
            QK++ + + E+    L  E GG N+  Y LY+IT +P+HL LA  F     L  LA + 
Sbjct: 217 -QKLMPL-TEEQRATMLRNEFGGTNEAFYNLYAITGNPEHLKLAEFFYHNAVLDPLAERK 274

Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
             L   HANT IP +IG    YE+  D   K + TFF D V    +Y TGG S +E +  
Sbjct: 275 SDLYFKHANTFIPKLIGEARNYELNADKRSKDVATFFWDEVVNHQTYCTGGNSHKEKFIH 334

Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
             ++++ L    +ETC + NMLK++RHLF W     YAD+YERAL N +L  Q+  + G+
Sbjct: 335 TDKVSENLTGYTQETCNSNNMLKLTRHLFSWDANPKYADFYERALYNHILG-QQDPQTGM 393

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
           + Y LPL  G  K  ST       NSFWCC GTG E+ +K G++IY+    N   LY+  
Sbjct: 394 VAYFLPLLPGSYKVYSTAE-----NSFWCCVGTGFENHAKYGEAIYYHNNTN---LYVNL 445

Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
           +I S   W    V L Q+   +      +++T+  +  Q+     +LNLR P W  ++G 
Sbjct: 446 FIPSELTWNEKGVKLKQET--VFPESDLVKLTVQTAKSQKF----ALNLRYPYW--ASGV 497

Query: 566 QASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           Q  +NG+ + +   P +++     W   D++ I+ P+SL      D+        A+++G
Sbjct: 498 QVKINGKAVKVKQVPSSYIVIDRTWKNGDQIIIKYPMSLHLAEANDN----VDKAAVMYG 553

Query: 625 PYLLAGHTSGE 635
           P +LAG    E
Sbjct: 554 PLVLAGMMGTE 564


>gi|326203856|ref|ZP_08193718.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
           2782]
 gi|325985954|gb|EGD46788.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
           2782]
          Length = 854

 Score =  301 bits (771), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 187/547 (34%), Positives = 281/547 (51%), Gaps = 41/547 (7%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           +++V++ D +L        A    + YL  +D + L+  +R+TA L T    YGGWEN  
Sbjct: 43  MEQVNITDTYLAN------AFNKEISYLQSIDPNRLLVGYRQTAGLSTSYSKYGGWEN-- 94

Query: 159 SELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVVFSLSECQNKIGTGYLSAFPT 213
           + L+GH +GHY+SA AQ + +T      NA +K+++  ++  L +CQNK G GY+ A   
Sbjct: 95  TPLKGHTLGHYMSALAQAYKNTKSNATVNADMKKRIDLIISELQQCQNKRGDGYIYAETP 154

Query: 214 ELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
           E F+  E  A   +WAP+YT+HKI++GL+  Y L  N  AL +A+ + ++ YNRV     
Sbjct: 155 EQFNVVEGKATGTLWAPWYTMHKIMSGLISIYELEGNPTALTVASKLGDWIYNRVN---- 210

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
            +        L  E GGMND L  LY +T    HL  A  F++P  L  +A   + L+  
Sbjct: 211 AWDSATQAKVLGVEYGGMNDCLIELYKLTGKSNHLAAAKKFEEPSLLNTIASGNNVLAGK 270

Query: 332 HANTHIPIVIGSQMRYEVTG--DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
           HANT IP  IG+  RY   G  +  Y      F ++V   H+Y TGG S  E +    +L
Sbjct: 271 HANTTIPKFIGAINRYRTLGTSEASYLTAAQQFWNMVIRDHTYVTGGNSQWEAFRAAGKL 330

Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
                  N ETC +YNMLK++R LF+ T ++ YAD+YER+  N +L+ Q   E G+  Y 
Sbjct: 331 DQYRDEVNNETCNSYNMLKLTRELFQVTGDVKYADFYERSFINEILASQN-PETGMTTYF 389

Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
            P+G G  K  S       F++FWCC GTG+E+F+KL DSIYF    N   LY+  YISS
Sbjct: 390 KPMGTGYFKVFS-----KPFDNFWCCTGTGMENFTKLNDSIYFN---NGSDLYVNMYISS 441

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN-GAQAS 568
           + +W    + L QK D  +S       T+TF+          +  R P W  ++      
Sbjct: 442 TLNWSEKGLSLTQKADVPLS------DTVTFTIDSAPSSEVKIKFRSPYWVAADKKVTVK 495

Query: 569 LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
           +NG ++       +L  +  W   DKL + +P  ++     D++    ++ A  +GP +L
Sbjct: 496 VNGSSVNASVVNGYLDVSRVWKVGDKLELTIPAEVQISRCTDNQ----NVAAFTYGPVVL 551

Query: 629 AGHTSGE 635
                 E
Sbjct: 552 CAGLGNE 558


>gi|332663228|ref|YP_004446016.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332332042|gb|AEE49143.1| protein of unknown function DUF1680 [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 791

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 197/561 (35%), Positives = 300/561 (53%), Gaps = 43/561 (7%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L D+ L   S  + A + +  YLL ++ D L+  F   A LPT    YGGWE+    L G
Sbjct: 50  LEDLRLLPGSAFYNAMEKDAAYLLKIESDRLLHRFYANAGLPTKAPVYGGWES--EGLSG 107

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-- 221
           H +GHYLSA A M+A + +    E+++ +V  L+ CQ    TGY+ A P E  DS  A  
Sbjct: 108 HTLGHYLSACALMYAGSKDEKYLERVNYLVQELARCQVARKTGYVGAIPKE--DSIFAQV 165

Query: 222 -----------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
                      L   W+P+YTIHK++AGL D Y+  +N QAL++   M ++  + V K+ 
Sbjct: 166 ARGDIRSSGFDLNGGWSPWYTIHKVMAGLADAYLYTNNDQALQVLRGMSDWTASVVDKL- 224

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
              +  +    L  E GGMN++L  +Y+ T + K+L L++ F     +  L+ + D L  
Sbjct: 225 ---NDPQRQKMLKCEYGGMNEILANVYAFTGEKKYLDLSYKFYDDFVMEPLSKKIDPLPG 281

Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
            H+NT++P  IGS  +YE+TG+   + I +FF + +  +H+Y  GG S  E+  D  +L 
Sbjct: 282 KHSNTNVPKAIGSARQYELTGNTRDQTIASFFWETMVHNHTYVIGGNSNYEYCGDAGKLN 341

Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
           D L     ETC TYNMLK++RHLF W      ADYYERAL N +L+ Q   E G+M Y +
Sbjct: 342 DRLSDNTCETCNTYNMLKLTRHLFCWQPSAELADYYERALYNHILASQH-PETGMMTYFV 400

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE-EEGNVPGLYIIQYISS 509
           PL  G  K  S      +F++F CC G+G+E+  K  +SIY+  ++GN   LY+  +I S
Sbjct: 401 PLRMGSKKEFS-----NEFHTFTCCVGSGMENHVKYTESIYYRGQDGN--SLYLNLFIPS 453

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             +WK   + L Q+       D  + ++ T +  Q++    +LNLR P W  ++  Q  +
Sbjct: 454 ELNWKERGLTLRQETK--FPQDGKVTLSFTCAKSQKL----ALNLRRPWWMKADW-QIKV 506

Query: 570 NGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
           NG+ + P+     +     RW   DKL +++P+ L TE++    P+  +  A L+GP +L
Sbjct: 507 NGKAVQPVAGTNGYYVLNRRWKNGDKLELEMPMQLYTESM----PDNPNRIAFLYGPLVL 562

Query: 629 AGHTSGEW-DIKTGTARSLSA 648
           AG    +  D   GT   LSA
Sbjct: 563 AGQLGDKMPDPVYGTPVLLSA 583


>gi|325281981|ref|YP_004254523.1| hypothetical protein Odosp_3391 [Odoribacter splanchnicus DSM
           20712]
 gi|324313790|gb|ADY34343.1| protein of unknown function DUF1680 [Odoribacter splanchnicus DSM
           20712]
          Length = 782

 Score =  297 bits (761), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 196/566 (34%), Positives = 292/566 (51%), Gaps = 45/566 (7%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           LK+V L D      S    A   N  ++L +D+D L+ +F K A L   G++YG WE+  
Sbjct: 45  LKDVRLLD------SPFKNAMDRNAAWMLEMDMDRLLSNFLKNAGLEPKGESYGSWES-- 96

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
             + GH +GHYLSA AQ +AST +   K+++  +V  L  CQ     G++   P    +F
Sbjct: 97  MGIAGHTLGHYLSAVAQQYASTGDERFKQRVDYIVHELDSCQQYFVNGFIGGMPGGDRVF 156

Query: 217 DSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
              +          L  +W P+Y  HK + GL D Y+LA N  A K+   + +Y  +   
Sbjct: 157 KQVKKGIIRSAGFDLNGLWVPWYNEHKTMMGLNDAYLLAGNKTAKKVLVNLADYLVD--- 213

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            V+   + E+    LN E GGMN+ L ++Y++T D K+L  ++ F     +  LA   D 
Sbjct: 214 -VLAGLTDEQVQTMLNCEFGGMNEALAQVYALTGDKKYLDASYRFYHRRLMEPLAEGKDI 272

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L   H+NT IP +IGS  +YE+TG+P  + I  FF   +   HSYA GG S+ E+   P 
Sbjct: 273 LPGLHSNTQIPKIIGSARQYELTGNPKDERIAEFFWTTMVNHHSYANGGNSSGEYLSTPD 332

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
           +L D L     ETC TYNMLK+SRHL+ WT +  Y D+YE+AL N +L+ Q   E G+  
Sbjct: 333 KLNDRLTHSTCETCNTYNMLKLSRHLYEWTGDPKYLDFYEKALYNHILASQH-PETGMTC 391

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y +PL  G  K      +  K+NSF CC G+G E+ SK G +IY     +   L++  YI
Sbjct: 392 YFVPLAMGTRKD-----FCDKYNSFTCCMGSGFENHSKYGGAIY-SHGSDDRSLFVNLYI 445

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S   WK   +    KV     +    R+TL     +   Q  +LNLR PVW    G   
Sbjct: 446 PSVLTWKEKGL----KVRLETVYPENGRVTLKVVEGER--QPLALNLRYPVWA-GEGIVV 498

Query: 568 SLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
            +NG    +   PG+F++   +W   D++ + +P++L T+ +    P+ A  +A+ +GP 
Sbjct: 499 KVNGTKQKITSKPGSFVTLERKWKAGDRIELNIPMNLYTKEM----PDNADRRAVFYGPT 554

Query: 627 LLAGHTSGEWDIKTGTARSLSALISP 652
           LLAG   GE +I+    R +   +SP
Sbjct: 555 LLAG-ALGEKEIE--PIRGVPVFVSP 577


>gi|427386203|ref|ZP_18882400.1| hypothetical protein HMPREF9447_03433 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726590|gb|EKU89454.1| hypothetical protein HMPREF9447_03433 [Bacteroides oleiciplenus YIT
           12058]
          Length = 616

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 188/560 (33%), Positives = 287/560 (51%), Gaps = 51/560 (9%)

Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
           +EV+L   W+ Q       ++ N+ +L  LD D L+ +FR TA LP+  +   GWE+P  
Sbjct: 37  EEVTLKSSWIKQR------EELNITFLKSLDPDRLLHNFRVTAGLPSNAEPLEGWESPKI 90

Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSF 219
            LRGHFVGHYLSA + +     +  + E++  ++  L +CQ   G  YLSAFP + FD+ 
Sbjct: 91  GLRGHFVGHYLSAVSSLVEKYKDLELVERLRYMIDELCKCQQSFGNSYLSAFPDKDFDAL 150

Query: 220 EA-LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
           EA    VWAPYYT +K++ GLLD Y    N +A  M   M  Y  NR+ K ++  ++E+ 
Sbjct: 151 EAKFTGVWAPYYTYNKVMQGLLDAYTHTGNQKAYDMLLDMAAYVDNRMSK-LSGETIEKM 209

Query: 279 WYSLN----EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHAN 334
            Y+++     E G MN+VLY+LY I+ +PKHL LA +FD+  F+  LA   D LS  H+N
Sbjct: 210 LYTVDANPQNEPGAMNEVLYKLYKISRNPKHLALAEIFDRNWFITPLAENKDILSGLHSN 269

Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA------------REF 382
           TH+ +V G   RY +TG+  Y    T F D++ + H YA G +S              E 
Sbjct: 270 THLVLVNGFAQRYSITGESKYYAASTNFWDMLISQHVYANGTSSGPRPNATTRTSVTAEH 329

Query: 383 WWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
           W  P  L +TL  E  E+C ++N  K++  +F WT    YAD Y     N VL+ Q    
Sbjct: 330 WGVPGHLCNTLTKEIAESCVSHNTQKLTSSIFTWTAAPKYADAYMNTFYNAVLASQ-SAH 388

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
            G  +Y LPLG   +K         K N F CC G+  E++S+L   IY+ ++     L+
Sbjct: 389 TGAYMYHLPLGSPRNKKY------LKDNDFACCSGSSAEAYSRLNSGIYYHDDS---ALW 439

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
           +  ++ S  +WK  +V L Q  +    +     +  T S+K++VG   +L L +P W  +
Sbjct: 440 VNLFVPSEVNWKEKNVRLEQNGN----FPKDTNICFTISTKKKVG--FALKLFIPSW--A 491

Query: 563 NGAQASLNGQNLPLPP-PGNFLSATERWSYND--KLTIQLPLSLRTEAIQDDRPEYASIQ 619
             A+  +NG+   +   P +++     W   D  KL       L+T       P+   + 
Sbjct: 492 KNAEVYINGEKQEIETFPSSYIDLNRNWRDKDEVKLIFHYDFHLKT------MPDNKDVL 545

Query: 620 AILFGPYLLAGHTSGEWDIK 639
           ++ +GP LLA  +  E  +K
Sbjct: 546 SLFYGPMLLAFESDEEVILK 565


>gi|337746495|ref|YP_004640657.1| hypothetical protein KNP414_02226 [Paenibacillus mucilaginosus
           KNP414]
 gi|336297684|gb|AEI40787.1| hypothetical protein KNP414_02226 [Paenibacillus mucilaginosus
           KNP414]
          Length = 749

 Score =  296 bits (759), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 203/571 (35%), Positives = 294/571 (51%), Gaps = 64/571 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           LH V + +S  L  A + N  YLL L+ D L+  FR+ A L      Y GWE+    + G
Sbjct: 8   LHKVRI-ESGPLKHAMELNASYLLNLEADRLLSRFREYAGLAPKAPHYEGWES--RGISG 64

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H +GHYLS  A M+AST    +  +++ VV  L +CQ   G+G++S  P   ELF   +A
Sbjct: 65  HTLGHYLSGCALMYASTGREELLSRVNYVVEELQQCQRADGSGFISGIPRGKELFQEVKA 124

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYF----YN 264
                    L   W P YT+HK+ AGL D Y+LA + +AL    K+  W+ + F    + 
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLAGSRKALEIEIKLGLWLDDVFSGLSHE 184

Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
           +VQ+V            L+ E GGMN+VL  L   + D + L LA  F     LG +A +
Sbjct: 185 QVQRV------------LHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAER 232

Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
            D L   HANT IP +IG+  +YEVTG+  Y  I  FF D V   HSY  GG S  E + 
Sbjct: 233 KDTLGGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHFG 292

Query: 385 DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
           +P +L D LG    ETC TYNMLK++RHLF+W    AYADYYERA+ N +L  Q+  + G
Sbjct: 293 EPDKLNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILGSQQPVD-G 351

Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
            + Y + L  G  K+     + +++  F CC G+G+ES S  G +IYF    N   L++ 
Sbjct: 352 RVCYFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSLYGSAIYFH---NGSALFVN 403

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL----TFSSKQEVGQLSSLNLRMPVWT 560
           Q++ S+ +W+   V L Q+     +    LR+      TF+ K          +R P W 
Sbjct: 404 QFVPSTVEWEEQGVRLTQETAFPENGRGVLRIRTAKPGTFAVK----------VRYPSWA 453

Query: 561 YSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
              G    +NGQ +     PG +++    W   D L    P++LR E++ D+ P+     
Sbjct: 454 -EPGISVKVNGQAVSADARPGGYVTVEREWQDGDTLEYDFPMTLRIESMPDN-PDRI--- 508

Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSLSALI 650
           A+L+GP +LAG   G  D      R+L++++
Sbjct: 509 ALLYGPLVLAGDL-GAIDAPQDGERALASVL 538


>gi|302548275|ref|ZP_07300617.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Streptomyces
           hygroscopicus ATCC 53653]
 gi|302465893|gb|EFL28986.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Streptomyces
           himastatinicus ATCC 53653]
          Length = 849

 Score =  296 bits (758), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 189/526 (35%), Positives = 279/526 (53%), Gaps = 34/526 (6%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
           Q  N  YL  +D++ L+ +FR    + +  +  GGWE+P +ELRGH  GH LS  A  +A
Sbjct: 72  QSRNTAYLRFVDINRLLHTFRLNVGIASSAQPCGGWESPTTELRGHSTGHLLSGLALTYA 131

Query: 179 STHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYTIH 233
           +T +  + +K   +V +L+ CQ K       TGYLSAFP   FD  EA   VWAPYYTIH
Sbjct: 132 NTGDTALLDKSRKLVSALAACQAKSPAAGYRTGYLSAFPENFFDRLEAGSGVWAPYYTIH 191

Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
           KI+AGL+DQY LA NA+AL+       +   R  ++    S ++    L  E GGMNDVL
Sbjct: 192 KIMAGLVDQYRLAGNAEALETVLRQAAWVDTRTARL----SYDQMQRVLETEYGGMNDVL 247

Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
             L++IT D + L +A  F        L+   D L+  HANT IP ++G+   +E   D 
Sbjct: 248 ADLHAITGDSRWLRVAERFTHARVFDPLSRNEDRLAGLHANTQIPKMVGALRLWEEGLDS 307

Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
            Y+ IG  F  IV   H+Y  GG S  E + +P  +A  L     E C +YNMLK++R +
Sbjct: 308 RYRTIGENFWKIVTDHHTYVIGGNSNGEAFHEPDAIAAQLSGSCCENCNSYNMLKLARLI 367

Query: 414 -FRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARST------HGW 465
            F   +     DYYER L N +L  Q   +  G  IY   L  G  K + +      + +
Sbjct: 368 HFHAPERTDLLDYYERTLFNQMLGEQDPDSAHGFNIYYTGLAPGSFKQQPSFMGPDPNQY 427

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
            T +++F C +G+G+E+ +K  D+IY   + +   L +  +I S   W+   +   Q   
Sbjct: 428 STDYDNFSCDHGSGMETHAKFADTIYTRGDRS---LLVNLFIPSELRWQEKGITWRQ--- 481

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLS 584
               +      TLT SS       +SL LR+ + ++++GA+A+LNG  LP  P PG++L 
Sbjct: 482 -TTGFPDQQTTTLTVSSGG-----ASLELRVRIPSWASGARAALNGATLPDQPKPGSWLI 535

Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
              +W   D++ + LP+ LR +   DD P+   IQA+L+GP +LAG
Sbjct: 536 IDRQWKTGDRVEVTLPMKLRLDPTPDD-PD---IQAVLYGPVVLAG 577


>gi|386723005|ref|YP_006189331.1| hypothetical protein B2K_12670 [Paenibacillus mucilaginosus K02]
 gi|384090130|gb|AFH61566.1| hypothetical protein B2K_12670 [Paenibacillus mucilaginosus K02]
          Length = 749

 Score =  296 bits (757), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 202/571 (35%), Positives = 294/571 (51%), Gaps = 64/571 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           LH V + +S  L  A + N  YLL L+ D L+  FR+ A L      Y GWE+    + G
Sbjct: 8   LHKVRI-ESGPLKHAMELNASYLLNLEADRLLSRFREYAGLAPKAPHYEGWES--RGISG 64

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H +GHYLS  A M+AST    +  +++ VV  L +CQ   G+G++S  P   ELF+  +A
Sbjct: 65  HTLGHYLSGCALMYASTGREELLSRVNYVVEELEQCQRADGSGFISGIPRGKELFEEVKA 124

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYF----YN 264
                    L   W P YT+HK+ AGL D Y+L  + +AL    K+  W+ + F    + 
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLTGSRKALEIEIKLGLWLDDVFSGLSHE 184

Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
           +VQ+V            L+ E GGMN+VL  L   + D + L LA  F     LG +A +
Sbjct: 185 QVQRV------------LHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAER 232

Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
            D L   HANT IP +IG+  +YEVTG+  Y  I  FF D V   HSY  GG S  E + 
Sbjct: 233 KDTLGGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHFG 292

Query: 385 DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
           +P +L D LG    ETC TYNMLK++RHLF+W    AYADYYERA+ N +L+ Q+  + G
Sbjct: 293 EPDKLNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILASQQPVD-G 351

Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
            + Y + L  G  K+     + +++  F CC G+G+ES S  G +IYF        L++ 
Sbjct: 352 RVCYFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSLYGSAIYFHSGST---LFVN 403

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL----TFSSKQEVGQLSSLNLRMPVWT 560
           Q++ S+ DW+   V L Q+     +    LR+      TF+ K          +R P W 
Sbjct: 404 QFVPSTVDWEEQGVRLTQETSFPENGRGVLRIRTAKPGTFAVK----------VRYPSWA 453

Query: 561 YSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
              G    +NGQ +     PG +++    W   D L    P++LR E++ D+ P+     
Sbjct: 454 -EPGISVKVNGQAVSADARPGGYVTVEREWQDGDTLEYDFPMTLRIESMPDN-PDRI--- 508

Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSLSALI 650
           A+L+GP +LAG   G  D      R+L++++
Sbjct: 509 ALLYGPLVLAGDL-GAIDAPQDGERALASVL 538


>gi|375148455|ref|YP_005010896.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361062501|gb|AEW01493.1| protein of unknown function DUF1680 [Niastella koreensis GR20-10]
          Length = 786

 Score =  296 bits (757), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 193/529 (36%), Positives = 283/529 (53%), Gaps = 43/529 (8%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
           +A + ++ YL +++ D L+  FR+ A L   G+ YGGWE+  S L GH +GHYLSA A  
Sbjct: 59  KAMEADVRYLQVIEPDRLLADFREHAGLKPKGEHYGGWEH--SGLAGHTLGHYLSACAMH 116

Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-------------LK 223
           +A++H+     K++ +V  L+ECQ K   GY+ A P E  DS  A             L 
Sbjct: 117 YAASHDKQFLGKVNYIVDELAECQPK-RNGYVGAIPKE--DSMWAEVEKGNIHSRGFDLN 173

Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
             W+P+YT+HKI+AGLLD Y+  DN +AL + T M ++  + ++ +    S++R  +   
Sbjct: 174 GAWSPWYTVHKIMAGLLDAYLYCDNKKALAVETGMADWTAHLLRNLPDS-SLQRMLFC-- 230

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGS 343
            E GGMNDVL   Y++T + K+L L++ F     L  LALQ D L   H+NT IP VIG 
Sbjct: 231 -EYGGMNDVLNNTYALTGEKKYLDLSYKFHDKRILDSLALQKDILPGKHSNTQIPKVIGC 289

Query: 344 QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTT 403
             RYE+T     K IG FF   V   H+YA GG S  E+     +L +TL     ETC T
Sbjct: 290 IRRYELTAGEKDKTIGDFFWQTVVNDHTYAPGGNSNYEYLGPAGQLNETLTDNTMETCNT 349

Query: 404 YNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH 463
           YNMLK++RHLF      +  DYYERAL N +LS Q  +  G+M Y +PL  G  K  S  
Sbjct: 350 YNMLKLTRHLFALQPTASLMDYYERALYNHILSSQDHST-GMMCYFVPLRMGTQKEFS-- 406

Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
                FN+F CC G+G+E+  K G++IY+  +G    LY+  +I+S   WK   VV+ Q+
Sbjct: 407 ---DSFNTFTCCVGSGMENHVKYGETIYY--QGADGSLYVNLFIASRLTWKEKGVVVEQQ 461

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPG--N 581
               +    Y+R+ +  +         +L +R P W    G   ++NG+      PG   
Sbjct: 462 TQ--LPESNYIRLAIKAARPVAF----TLRIRNPYWA-KQGVWIAVNGKEQTNLQPGADG 514

Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           + + T  W   D + ++  L L T ++    P+  +  AI +GP +LAG
Sbjct: 515 YFTITRTWKTGDAVIVKPSLQLYTRSM----PDNPNRLAIFYGPLVLAG 559


>gi|379720404|ref|YP_005312535.1| hypothetical protein PM3016_2500 [Paenibacillus mucilaginosus 3016]
 gi|378569076|gb|AFC29386.1| hypothetical protein PM3016_2500 [Paenibacillus mucilaginosus 3016]
          Length = 749

 Score =  295 bits (755), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 202/571 (35%), Positives = 295/571 (51%), Gaps = 64/571 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           LH V + +S  L  A + N  YLL L+ D L+  FR+ A L      Y GWE+    + G
Sbjct: 8   LHKVRI-ESGPLKHAMELNASYLLNLEADRLLSRFREYAGLEPKAPHYEGWES--RGISG 64

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H +GHYLS  A M+AST    +  +++ VV  L +CQ   G+G++S  P   ELF   +A
Sbjct: 65  HTLGHYLSGCALMYASTGREELLSRVNYVVEELQQCQRADGSGFISGIPRGKELFQEVKA 124

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYF----YN 264
                    L   W P YT+HK+ AGL D Y+LA + +AL    K+  W+ + F    + 
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLAGSRKALEIEIKLGLWLDDVFSGLSHE 184

Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
           +VQ+V            L+ E GGMN+VL  L   + D + L LA  F     LG +A +
Sbjct: 185 QVQRV------------LHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAER 232

Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
            D L   HANT IP +IG+  +YEVTG+  Y  I  FF D V   HSY  GG S  E + 
Sbjct: 233 KDTLGGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHFG 292

Query: 385 DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
           +P +L D LG    ETC TYNMLK++RHLF+W    AYADYYERA+ N +L+ Q+  + G
Sbjct: 293 EPDKLNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILASQQPVD-G 351

Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
            + Y + L  G  K+     + +++  F CC G+G+ES S  G +IYF    +   L++ 
Sbjct: 352 RVCYFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSLYGSAIYFH---SGSALFVN 403

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL----TFSSKQEVGQLSSLNLRMPVWT 560
           Q++ S+ +W+   V L Q+     +    LR+      TF+ K          +R P W 
Sbjct: 404 QFVPSTVEWEEQGVRLTQETAFPENGRGVLRIRTAKPGTFAVK----------VRYPSWA 453

Query: 561 YSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
              G    +NGQ +     PG +++    W   D L    P++LR E++ D+ P+     
Sbjct: 454 -EPGISVKVNGQAVSADARPGGYVTVEREWQDGDTLEYDFPMTLRIESMPDN-PDRI--- 508

Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSLSALI 650
           A+L+GP +LAG   G  D      R+L++++
Sbjct: 509 ALLYGPLVLAGDL-GAIDAPQDGERALASVL 538


>gi|393783247|ref|ZP_10371422.1| hypothetical protein HMPREF1071_02290 [Bacteroides salyersiae
           CL02T12C01]
 gi|392669526|gb|EIY63014.1| hypothetical protein HMPREF1071_02290 [Bacteroides salyersiae
           CL02T12C01]
          Length = 1022

 Score =  295 bits (754), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 191/549 (34%), Positives = 279/549 (50%), Gaps = 49/549 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLL-MLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
           LK++ L D      S    A   + ++L+  L  D  +  F   A LPT G  YGGWEN 
Sbjct: 54  LKQIRLLD------SPFKTAMNADRKWLMETLKPDRFLHRFHANAGLPTKGTIYGGWEN- 106

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE--L 215
            ++  G   GHY+SA + ++A+T    IK ++   +  L  CQ+K GTGY+ A P E  L
Sbjct: 107 -TDQSGFSFGHYISALSMLYATTGEEDIKIRLDYCISELKRCQDKRGTGYVGAIPNEDKL 165

Query: 216 FDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRV 266
           +D             L  VW P+Y +HK+ +GL+D Y+  +N  A  +   + ++  ++ 
Sbjct: 166 WDDVSKGIIDGRNFNLNNVWVPWYNLHKLWSGLIDAYIFGENETAKTIVIALTDWACDKF 225

Query: 267 QKVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
           + +      E  W + L  E GGMND LY +Y+IT D +HL +A+ F     L  L+ + 
Sbjct: 226 KDL-----TEEQWQNILTCEHGGMNDALYNVYAITGDTRHLEIANKFYHKKVLDPLSKRK 280

Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
           + L+  HANT IP VIG    YE+TG+  +  I ++F   V   HSY  GG S  E + +
Sbjct: 281 NELAGLHANTQIPKVIGISRSYELTGNQDHHTISSYFWHTVTHEHSYCIGGNSNYEHFVE 340

Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
           P +L+  L ++  ETC TYNMLK++RHLF W       D+YERAL N +L+ Q   E G+
Sbjct: 341 PGKLSGELSNKTTETCNTYNMLKLTRHLFAWNPSAELMDFYERALYNHILASQ-NPETGM 399

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
           + Y +PL      A S   +    N+FWCC GTG E+  K  + IY   E     LYI  
Sbjct: 400 VCYCVPLA-----ANSQKNYCNAENNFWCCVGTGFENHVKYAEQIYSHNENE---LYINL 451

Query: 506 YISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
           YI S  DW   ++ L Q  + P            T +  + V Q  + ++R P W  S G
Sbjct: 452 YIPSELDWSEKNMKLKQTNNFP-------DTDNTTITITETVPQTLTFHVRFPNWVQS-G 503

Query: 565 AQASLNG-QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
               +NG + +    PG+++S T  W  NDK+ I LP +L  E +  D+ +     A L 
Sbjct: 504 YSIKINGTEQVFNSTPGSYVSITREWKTNDKIEINLPKTLTKEQLLGDKYK----TAFLN 559

Query: 624 GPYLLAGHT 632
           GP +LAG T
Sbjct: 560 GPIVLAGKT 568


>gi|29827685|ref|NP_822319.1| protein [Streptomyces avermitilis MA-4680]
 gi|29604785|dbj|BAC68854.1| putative secreted protein [Streptomyces avermitilis MA-4680]
          Length = 854

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 189/526 (35%), Positives = 272/526 (51%), Gaps = 34/526 (6%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
           Q+ N  YL  +D+D L+ +FR    LP+  +  GGWE P  ELRGH  GH LS  A   A
Sbjct: 77  QRRNSAYLRFVDIDRLLHTFRTNVGLPSDAEPCGGWEGPGVELRGHSTGHLLSGLALAHA 136

Query: 179 STHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYTIH 233
           ST    +++K   +V +L+ECQ+       GTGYLSAFP   FD  EA   VWAPYYTIH
Sbjct: 137 STGEEALRDKGRRLVAALAECQSAAPAAGFGTGYLSAFPESFFDRLEAGSGVWAPYYTIH 196

Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
           KI+AGL++QY L    QAL++      +   R  K+    S E+    L  E GGMNDVL
Sbjct: 197 KIMAGLVEQYRLVGVGQALEVVLRQARWVDERTAKL----SYEQMQRVLETEFGGMNDVL 252

Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
             L+++T DP+ L +A  F        LA   D L+  HANT IP ++G+   +E     
Sbjct: 253 ADLHALTGDPRWLDVAERFTHARVFDPLAGNQDKLAGLHANTQIPKMVGALRLWEEGRAD 312

Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
            Y+ +   F  IV   H+Y  GG S  E + +P  +A  L     E C +YNMLK++R L
Sbjct: 313 RYRTVAENFWQIVTDHHTYVIGGNSNGEAFHEPDVIAGQLSDNTCENCNSYNMLKLTRLL 372

Query: 414 -FRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARST------HGW 465
            F         DYYER L N +L  Q   +E G  IY   L  G  K + +        +
Sbjct: 373 HFHAPDRTDLLDYYERTLLNQMLGEQDPDSEHGFAIYYTGLAPGSFKRQPSFMGPDPDVY 432

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
            T +++F C +GTG+E+ +K  D++Y  +  +   L +  ++ S   W++  +   Q   
Sbjct: 433 STDYDNFSCDHGTGMETPAKFADTVYSHDGRS---LRVNLFVPSEVVWRAKGISWRQ--- 486

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLS 584
               +      TLT SS +   +L    +R+P W  + GA+A+LNG+ LP  P PG++L+
Sbjct: 487 -TTRFPDRSSTTLTVSSGRAAHRLL---IRVPSW--AAGARATLNGRALPDRPQPGSWLA 540

Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
               W   D++ + LP+    EA  DD      +QA++ GP +LAG
Sbjct: 541 LERVWRTGDRVEVSLPMRTAVEATPDD----PDVQAVVHGPVVLAG 582


>gi|440694505|ref|ZP_20877120.1| hypothetical protein STRTUCAR8_01091 [Streptomyces turgidiscabies
           Car8]
 gi|440283503|gb|ELP70762.1| hypothetical protein STRTUCAR8_01091 [Streptomyces turgidiscabies
           Car8]
          Length = 747

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 191/553 (34%), Positives = 280/553 (50%), Gaps = 48/553 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENP 157
           L +V+L D       V  R +   LE+      D ++  FR  A L T G +  GGWE  
Sbjct: 90  LDQVALGD------GVFRRKRDLMLEFARSYPADRILAVFRANAGLDTRGAQPPGGWETA 143

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT---------GYL 208
              LRGHF GH+L+  AQ +A T  A +K K+  +V +L ECQ  +           G+L
Sbjct: 144 DGNLRGHFGGHFLTLVAQAYADTREAALKTKLDYLVTALGECQQALADHGSPRPSHPGFL 203

Query: 209 SAFPTE---LFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
           +A+P     L +S+     +WAPYYT HKI+ G LD + L  N QAL +A+ M ++ ++R
Sbjct: 204 AAYPETQFILLESYTTYPTIWAPYYTCHKIMRGFLDAHTLTGNQQALTIASKMGDWVHSR 263

Query: 266 VQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
           + + +    ++R W   +  E GGMN+VL  LY++T   +HL  A  FD    L   A  
Sbjct: 264 LSR-LPQAQLDRMWSIYIAGEYGGMNEVLADLYALTGRAEHLAAARCFDNTALLDACADN 322

Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
            D L   HAN HIP   G    ++ TG+  Y      F  +V    +Y+ GGT   E + 
Sbjct: 323 RDILDGRHANQHIPQFTGYIRLFDHTGEAEYATAARNFWGMVAGPRTYSLGGTGQGEMFR 382

Query: 385 DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
               +A TLG  N ETC TYNMLK+SR LF  T + AY DYYE+ LTN +L+ +R     
Sbjct: 383 ARNAIAATLGDNNAETCATYNMLKLSRQLFFHTPDPAYMDYYEKGLTNHILASRRDARST 442

Query: 445 V---MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGNVPG 500
           V   + Y + +G GV +           N+  CC GTG+E+ +K  DS+YF   +GN   
Sbjct: 443 VSPEVTYFVGMGPGVVREYD--------NTGTCCGGTGMENHTKYQDSVYFRSADGNA-- 492

Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
           LY+  Y++S+  W    +V++Q  D    +      TLTF   +E G    L LR+P W 
Sbjct: 493 LYVNLYLASTLRWPERGLVIDQTSD----FPGEGVRTLTF---REGGGSLDLKLRVPSWA 545

Query: 561 YSNGAQASLNG-QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
            + G   ++NG        PG++L+ +  W   D++T+  P  LR E   DD     ++Q
Sbjct: 546 -TGGFTVTVNGVPQQTAAVPGSYLTLSRNWQRGDRITVSAPYRLRIERALDD----PTVQ 600

Query: 620 AILFGPYLLAGHT 632
           ++ +GP LL   +
Sbjct: 601 SLFYGPVLLVARS 613


>gi|220928663|ref|YP_002505572.1| hypothetical protein Ccel_1236 [Clostridium cellulolyticum H10]
 gi|110588920|gb|ABG76968.1| CBM22- and dockerin-containing enzyme [Clostridium cellulolyticum
           H10]
 gi|219998991|gb|ACL75592.1| protein of unknown function DUF1680 [Clostridium cellulolyticum
           H10]
          Length = 955

 Score =  293 bits (749), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 186/542 (34%), Positives = 271/542 (50%), Gaps = 35/542 (6%)

Query: 98  FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
            LK+  +  V +  +  +  A    + YL  +D + L+  F+KTA L T    YGGWEN 
Sbjct: 34  LLKQFDMEQVKITDTYYV-NALNKEVAYLQAIDPNRLLVGFKKTAGLSTTYSYYGGWENN 92

Query: 158 ISELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVVFSLSECQNKIGTGYLSAFP 212
            + ++GH +GHY+SA AQ + +T      NA +K ++  ++  L  CQNK G GYL A P
Sbjct: 93  -TLIQGHTMGHYMSALAQAYKNTKSDPTVNADLKSRIDLIISELQACQNKNGNGYLFATP 151

Query: 213 TELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
              FD  E  A    W P+YT+HKI++GLLD Y    N  AL +AT +  + Y RV    
Sbjct: 152 ATQFDVVEGKASGSSWVPWYTMHKIMSGLLDIYKFGGNQTALTIATNLGNWIYKRVN--- 208

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
             +        L  E GGMND LY LY +T +  HL  AH FD+      +A   + L  
Sbjct: 209 -AWDSATQSRVLGVEYGGMNDCLYELYKLTGNGNHLTAAHKFDENSLFNTIAAGTNVLPG 267

Query: 331 FHANTHIPIVIGSQMRYEVTG--DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
            HANT IP  IG+  RY   G  +  Y      F  IV   H+Y TGG S  E + D  +
Sbjct: 268 KHANTTIPKFIGALNRYSTLGTSESSYLKAAQQFWAIVLKDHTYVTGGNSEDERFRDAGK 327

Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
           L     + N ETC   NMLK+++ LF+ T ++ YADYYE AL N +++ Q   E G+  Y
Sbjct: 328 LDAYRDNVNNETCNVNNMLKLTKELFKATGDVKYADYYENALINEIMASQN-PETGMATY 386

Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
              +G G  K  S     ++FN FWCC GTG+E+F+KL DS+Y+    N   LY+  Y+S
Sbjct: 387 FKAMGTGYFKVFS-----SQFNHFWCCTGTGMENFTKLNDSLYYN---NGSDLYVNMYLS 438

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS-NGAQA 567
           S+ +W    + L Q+ +  +S    +  T+  +S  EV     +  R P W  +      
Sbjct: 439 STLNWSEKGLSLTQQANLPLS--DKVTFTINSASSSEV----KIKFRSPAWIAAGQNITV 492

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
            +NG  + +     +L  +  W   D + + LP  +R   + D      +  A  +GP +
Sbjct: 493 KVNGTPINVDKANGYLDVSRVWQTGDTVELTLPTEVRVSRLTDS----PNTVAFTYGPVV 548

Query: 628 LA 629
           L+
Sbjct: 549 LS 550


>gi|345011855|ref|YP_004814209.1| hypothetical protein [Streptomyces violaceusniger Tu 4113]
 gi|344038204|gb|AEM83929.1| protein of unknown function DUF1680 [Streptomyces violaceusniger Tu
           4113]
          Length = 849

 Score =  292 bits (748), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 188/531 (35%), Positives = 274/531 (51%), Gaps = 34/531 (6%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
           Q  N  YL  +D+D L+ +FR    L +  +  GGWE+P +ELRGH  GH LS  A  +A
Sbjct: 72  QSRNTAYLRFVDIDRLLHTFRLNVGLSSAAQPCGGWESPTTELRGHSTGHLLSGLALTYA 131

Query: 179 STHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYTIH 233
           +T +   ++K   +V +L+ CQ +      G GYLSAFP   FD  EA   VWAPYYTIH
Sbjct: 132 ATGDTAPRDKGRALVSALAACQARSPAAGYGQGYLSAFPESFFDRLEAGTGVWAPYYTIH 191

Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
           KI+AGL+DQY LA NA+AL+       +   R  K+    S ++    L  E GGMNDVL
Sbjct: 192 KIMAGLVDQYRLAGNAEALQTVLRQAAWVDTRTGKL----SYDQMQRVLQTEFGGMNDVL 247

Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
             L+ IT D + L +A  F        LA   D L+  HANT IP ++G+   +E   D 
Sbjct: 248 ADLHEITGDSRWLKVAERFTHARVFDPLARNEDRLAGLHANTQIPKMVGAMRLWEEGLDS 307

Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
            Y+ IG  F  IV   H+Y  GG S  E + +P  +A  L     E C +YNMLK++R +
Sbjct: 308 RYRTIGENFWKIVTDHHTYVIGGNSNGEAFHEPDAIAAQLSDNACENCNSYNMLKLTRLI 367

Query: 414 -FRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARST------HGW 465
            F   +     DYYER L N +L  Q   +  G  IY   L  G  K + +      + +
Sbjct: 368 HFHAPERTDLLDYYERTLLNQMLGEQDPDSAHGFNIYYTGLAPGSFKQQPSFMGTDPNQY 427

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
            T +++F C +G+G+E+ +K  D+IY   + +   L +  +I S   W+   +   Q   
Sbjct: 428 STDYDNFSCDHGSGMETQAKFADTIYTYADRS---LLVNLFIPSELRWQDKGITWRQ--- 481

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLS 584
               +      TLT +S       +SL LR+ + +++ GA+A+LNG  L   P PG++L 
Sbjct: 482 -TTGFPDQQTTTLTVASGG-----ASLELRVRIPSWAAGARATLNGTTLADRPEPGSWLI 535

Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
              +W   D++ + LP+ L  +   DD      +QA+L+GP +LAG   G 
Sbjct: 536 IDRQWRTGDRVEVTLPMKLTFDPTPDD----PDVQAVLYGPVVLAGAYGGR 582


>gi|300785310|ref|YP_003765601.1| hypothetical protein AMED_3413 [Amycolatopsis mediterranei U32]
 gi|384148599|ref|YP_005531415.1| hypothetical protein RAM_17360 [Amycolatopsis mediterranei S699]
 gi|399537193|ref|YP_006549855.1| hypothetical protein AMES_3374 [Amycolatopsis mediterranei S699]
 gi|299794824|gb|ADJ45199.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
 gi|340526753|gb|AEK41958.1| hypothetical protein RAM_17360 [Amycolatopsis mediterranei S699]
 gi|398317963|gb|AFO76910.1| hypothetical protein AMES_3374 [Amycolatopsis mediterranei S699]
          Length = 740

 Score =  292 bits (747), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 196/533 (36%), Positives = 265/533 (49%), Gaps = 37/533 (6%)

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
           L Y   +D D L+ +FR  A L +  +  GGWE+P +ELRGH  GH LS  AQ +A+T +
Sbjct: 68  LAYFRFVDADRLLHTFRLNAGLASSAQPCGGWESPGTELRGHSTGHLLSGLAQAYANTGD 127

Query: 183 ATIKEKMSTVVFSLSECQ-----NKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
              K K   +V +L+ CQ          GYLSAFP   FD  E+ + VWAPYYT+HKI+A
Sbjct: 128 TAHKTKGDYLVNALAACQAAAPGRGFHAGYLSAFPENFFDRLESGQSVWAPYYTLHKIMA 187

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GLLDQY+LA N QAL +      +   R   +    SV +   +L  E GGM +VL  LY
Sbjct: 188 GLLDQYLLAGNQQALDVLLRKAAWTKTRTDPL----SVTQMQAALRTEFGGMPEVLTNLY 243

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
            +T D  HL  A  FD    L  LA   D LS FHANT IP ++G+   Y  TG   Y+ 
Sbjct: 244 QVTGDANHLATAQRFDHAQILDPLAANQDRLSGFHANTQIPKILGAIREYHATGTTRYRD 303

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           I   F  IV   H+Y  GG S  E++  P  +A  L     E C TYNMLK++R LF   
Sbjct: 304 IAVNFWRIVLDHHTYVIGGNSDGEYFQAPDAIASQLSDTTCEVCNTYNMLKLTRQLFFTN 363

Query: 418 KEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCY 476
               Y DYYE AL N +L  Q   +  G + Y  PL  G  K      +   ++ F C +
Sbjct: 364 PAPEYMDYYELALFNQILGEQDPDSSHGFVTYYTPLRAGGIKT-----YANDYDDFTCDH 418

Query: 477 GTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM 536
           GTG+ES +K  DS+YF        LY+  +I+S   W    + + Q      ++      
Sbjct: 419 GTGMESQTKFADSVYFFTGET---LYVNLFIASVLTWPGRGITVRQD----TTFPASSGT 471

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLT 596
            LT      +    +L LR+P WT  +GA   +NG     P PG+F +    W+  D + 
Sbjct: 472 KLTIGGSGHI----ALKLRIPKWT--SGAVVKVNGVAQGSPSPGSFCTIDRTWAAGDVVD 525

Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG-----HTSGEWDIKTGTAR 644
           + +P SL      DD    AS+ A  +G  +LAG     + S    ++TGT R
Sbjct: 526 VSVPASLTFPRANDD----ASVGAAKYGAIVLAGQYGSTNLSALPTLQTGTVR 574


>gi|427385120|ref|ZP_18881625.1| hypothetical protein HMPREF9447_02658 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727288|gb|EKU90148.1| hypothetical protein HMPREF9447_02658 [Bacteroides oleiciplenus YIT
           12058]
          Length = 778

 Score =  291 bits (746), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 199/599 (33%), Positives = 300/599 (50%), Gaps = 58/599 (9%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A   N E+LL L  D L+  FR  A L   G+ YGGWE+    + GH +GHYLSA A M+
Sbjct: 55  AMDKNGEWLLDLSPDRLLHRFRLNAGLTPKGEIYGGWES--RGVSGHTLGHYLSACAMMY 112

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-------------LKP 224
           A++ +   KE++  +V  L+ECQ+   TGY+   P E  D   A             L  
Sbjct: 113 AASGDKRFKERVDYIVKELAECQDARKTGYVGGIPDE--DKIWAEVSSGDIRSQGFDLNG 170

Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWY 280
            W P+YT+HK+ AGL+D Y  A + QA     K++ W V  F +         S E    
Sbjct: 171 GWVPWYTLHKLWAGLIDAYRYAGSEQAKEVGTKLSDWAVRSFGD--------LSEEDFQK 222

Query: 281 SLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV 340
            L  E GGMN+    +Y+IT +  +L LA  F     L  L  Q D L   H+NT +P +
Sbjct: 223 MLACEFGGMNESFADMYAITGNESYLKLARQFYHKAILDPLKEQRDELEGKHSNTQVPKI 282

Query: 341 IGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEET 400
           IG    YE+TGD     I TF+ D +   H+Y  GG S  E    P  L D L     ET
Sbjct: 283 IGEARLYELTGDKDMHTIATFYWDRIVNHHTYVNGGNSNYEHLGKPDCLNDRLSPFTSET 342

Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
           C TYNMLK+++HLF W  + AY DYYE+AL N +L+ Q   + G++ Y +PL  G  K  
Sbjct: 343 CNTYNMLKLTKHLFSWDPQAAYMDYYEQALYNHILASQN-PDDGMVCYSVPLESGTKKEF 401

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
           S     T+F+SFWCC  +GIE+  K  +S++F+   +  GL++  +I +S +WK   + +
Sbjct: 402 S-----TRFDSFWCCVASGIENHVKYAESVFFQSVKD-GGLFVNLFIPTSLNWKEKGMEV 455

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PP 579
             K++  +  D  ++++    SK+       L++R P W  + G + +LNG+   +   P
Sbjct: 456 --KLETQLPADNKVQISFKGKSKE-----FPLHIRYPRWA-TQGIKVTLNGKEEKVTGTP 507

Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH-TSGE--- 635
           G++ +    W  + +L I++P+ L T ++    P+ A    I +GP LLA    +GE   
Sbjct: 508 GSYFTLQGEWDTDTQLVIEIPMELYTVSM----PDNADRMGIFYGPVLLAAPLGTGELQA 563

Query: 636 WDIKT--GTARSLSALISPIPP---SFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPV 689
           +DI        S+   I+P+P    +F A      Q      + +     ++  + FPV
Sbjct: 564 YDIPCFISDTESIVQSIAPVPDKPLTFTANTTANAQLLLVPFYTIHGQKHAVYFDRFPV 622


>gi|345851934|ref|ZP_08804893.1| secreted protein [Streptomyces zinciresistens K42]
 gi|345636594|gb|EGX58142.1| secreted protein [Streptomyces zinciresistens K42]
          Length = 867

 Score =  291 bits (745), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 205/570 (35%), Positives = 280/570 (49%), Gaps = 54/570 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L    L +V L +S  L   ++T+  YLL +D D L+ +FR TA LP+  +  GGWE P 
Sbjct: 63  LDAFGLSEVRLLESPFLANMRRTS-AYLLFVDADRLLHTFRLTAGLPSSAQPCGGWEAPD 121

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPT 213
            +LRGH  GH LSA AQ  A T      EK   +V +L+ECQ          GYLSAFP 
Sbjct: 122 VQLRGHTTGHLLSALAQAHAHTGERAYAEKGRALVAALAECQRAAPAAGFTRGYLSAFPE 181

Query: 214 ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWM----VEYFYNR 265
            +F   EA    WAPYYT+HKI+AGLLDQY+LA + QAL    +MA W         Y +
Sbjct: 182 SVFARLEAGGKPWAPYYTLHKIMAGLLDQYLLAGDRQALDVLREMAAWAEARTAPLPYPQ 241

Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
           +Q V+ +            E GGMNDVL RLY  T DP HL  A  FD       LA   
Sbjct: 242 MQNVLRV------------EFGGMNDVLMRLYLETGDPAHLRTARRFDHEDLYAPLAAGR 289

Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
           D L+  HANT I  ++G+   YE TGD  Y  I   F   V   HSYA GG S +E +  
Sbjct: 290 DELAGRHANTEIAKIVGTVPSYEATGDTRYLDIADTFWTTVVRHHSYAIGGNSNQELFGP 349

Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQR-GTEP 443
           P  +   L     E C +YNMLK+ R LF    + A Y D+YE  L N +L  Q   +  
Sbjct: 350 PDEIVSRLSDVTCENCNSYNMLKLGRGLFLHRPDRAGYMDHYEWTLYNQMLGEQDPASAH 409

Query: 444 GVMIYMLPLGRGVSKARSTHGWG-------TKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
           G + Y   L  G S+     G G       + +++F C +GTG+E+ +K  DS+YF   G
Sbjct: 410 GFVTYYTGLWAG-SRREPKAGLGSAPGSYSSDYDNFSCDHGTGLETHTKFADSVYFRSRG 468

Query: 497 ---NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
               VP LY+  +I S   W+   V + QK     S+    R  LT  + +      +L 
Sbjct: 469 TRDGVPSLYVNLFIPSEVRWRQTGVTVRQK----TSYPSEGRTRLTVVAGR---ARFALR 521

Query: 554 LRMPVWTYSNGAQASL--NGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
           +R+P W    G +A L  NG+ +     PG + +    W   D + + LP       +  
Sbjct: 522 IRIPSWVAGTGREAVLEVNGRGVAARLRPGTYATVERTWHTGDTVDLTLP----RRPVWT 577

Query: 611 DRPEYASIQAILFGPYLLAGHTSGEWDIKT 640
             P+   ++++ +GP +LAG   G+ D+ T
Sbjct: 578 AAPDNPQVRSVSYGPLVLAGEY-GDDDLAT 606


>gi|251798261|ref|YP_003012992.1| hypothetical protein Pjdr2_4282 [Paenibacillus sp. JDR-2]
 gi|247545887|gb|ACT02906.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
          Length = 758

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 182/515 (35%), Positives = 267/515 (51%), Gaps = 30/515 (5%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A Q  L+YL   DVD L+  FR+T+ L      Y GWEN  +E+RGH +GHYL+A +Q +
Sbjct: 28  AFQKELDYLRSYDVDRLLAGFRETSGLQPKADKYPGWEN--TEIRGHTLGHYLTAVSQAY 85

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A T ++ + EK+  +V  L+E Q +   GYLSAFP  LFD+ E  KP W P+YT+HKI+A
Sbjct: 86  AQTQDSGLLEKLKYLVAELAEAQQE--NGYLSAFPETLFDNVENRKPAWVPWYTMHKIIA 143

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+  Y      QA ++ + + ++  +R       +S E     L  E GGMND +Y LY
Sbjct: 144 GLIAVYQATKLQQAYEVVSRLGDWVADRA----CSWSEELQATVLAVEYGGMNDCMYDLY 199

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL--Y 355
            +T +  HL  AH FD+      L    D L   HANT IP  IG+  RY   G+    Y
Sbjct: 200 KLTGNNLHLEAAHKFDEISLFEALREGKDVLKGKHANTMIPKFIGALNRYLTLGESERGY 259

Query: 356 KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
                 F D V   HSY TGG S  E + +P  L         ETC +YNMLK+++ LF+
Sbjct: 260 LEAAVNFWDTVVYHHSYLTGGNSECEHFGEPDILDGKRSDVTCETCNSYNMLKLTKELFK 319

Query: 416 WTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCC 475
            T+   YAD+YER   N +LS Q   E G+ +Y  P+  G  K  S     + F  FWCC
Sbjct: 320 LTQNSKYADFYERTYINAILSSQ-NPETGMTMYFQPMATGYFKIYS-----SPFEHFWCC 373

Query: 476 YGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR 535
            GTG+ESF+KL DSIYF  + N   LY+ Q+ SS  DW     V+ Q         P+  
Sbjct: 374 TGTGMESFTKLNDSIYFHLDHN---LYVNQFYSSRLDWTEQQTVVTQTTSL-----PHSD 425

Query: 536 MTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKL 595
           + + F+   +  +  ++++R+P W  +      LNG+ +P      ++     W   D +
Sbjct: 426 L-VHFTVGTDSPKRLAIHIRVPSWA-AGEVDILLNGETVPASVQQQYVVLDRIWKDGDTI 483

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
             ++P+ +   ++    P+   +  + +GP +L+ 
Sbjct: 484 EARIPMKVSFSSL----PDAPHVIGLQYGPIVLSA 514


>gi|376260753|ref|YP_005147473.1| hypothetical protein [Clostridium sp. BNL1100]
 gi|373944747|gb|AEY65668.1| hypothetical protein Clo1100_1435 [Clostridium sp. BNL1100]
          Length = 743

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 185/518 (35%), Positives = 268/518 (51%), Gaps = 31/518 (5%)

Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
           L  A +  +EYL   D D L+  F KT  L    K Y GWE+  +E+RGH +GHYL+A A
Sbjct: 11  LVNAFKKEIEYLESFDCDKLLSCFYKTKGLAPKAKNYHGWED--TEIRGHTMGHYLTALA 68

Query: 175 QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK 234
           Q +++T+++ I E++  ++  LS CQ    +GYLSAFP E FD  E  KPVW P+YT+HK
Sbjct: 69  QAYSATNDSKIYERLQYLLKELSLCQ--FESGYLSAFPEEFFDRVENRKPVWVPWYTMHK 126

Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLY 294
           I+ GL+  Y L     AL + + + ++ ++R  K    ++ E H   L  E GGMND LY
Sbjct: 127 IITGLISVYKLTKIETALNIVSGLGDWVFSRTDK----WTPEIHANVLAVEYGGMNDCLY 182

Query: 295 RLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP- 353
            LY IT + KH   AH+FD+      +    D L++ HANT IP  +G+  R+   G+  
Sbjct: 183 ELYKITGNEKHSAAAHMFDEIELFKEIHDGKDILNNRHANTTIPKFLGALNRFLAIGEEE 242

Query: 354 -LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
             Y      F  IV  +HSY TGG S  E + +P  L     S N ETC TYNMLK++R 
Sbjct: 243 QFYLDTCKEFWSIVTNNHSYVTGGNSEWEHFGEPNILDAERTSTNCETCNTYNMLKMTRV 302

Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
           LF+ T +  YAD+YE    N +LS Q   + G+ +Y  P+  G  K      +   F  F
Sbjct: 303 LFKITGDKKYADFYENTFINAILSSQ-NPDTGMTMYFQPMATGYFKV-----YSKPFEHF 356

Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
           WCC GTG+E+F+KL +SIYF EE     LY+  Y S+  +W+   V + Q  D I   D 
Sbjct: 357 WCCTGTGMENFTKLNNSIYFHEEDR---LYVNMYYSTLLNWEEKCVRITQNSD-IPGTD- 411

Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYN 592
                 +F  + E     +L LR+P W  +     ++N           +      W  N
Sbjct: 412 ----RASFIIEAETETEFTLCLRIPTW--AKDVNINVNKNPSLFTEERGYALINRTWKDN 465

Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           D  T+++   +  E +    P+  +  A  +GP +L+ 
Sbjct: 466 D--TVEINFKIEPELVS--LPDNPNAVAFTYGPVVLSA 499


>gi|374984433|ref|YP_004959928.1| secreted protein [Streptomyces bingchenggensis BCW-1]
 gi|297155085|gb|ADI04797.1| secreted protein [Streptomyces bingchenggensis BCW-1]
          Length = 875

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 185/526 (35%), Positives = 274/526 (52%), Gaps = 34/526 (6%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
           Q  N  YL  +D+D L+ +FR    L +  +  GGWE+P +ELRGH  GH LS  A  +A
Sbjct: 99  QSRNTAYLRYVDIDRLLHTFRLNVGLASSAQPCGGWESPTTELRGHSTGHLLSGLALSYA 158

Query: 179 STHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYTIH 233
           +T +  + +K   +V +L+ CQ K      G GYLSAFP   FD  E+   VWAPYYTIH
Sbjct: 159 NTGDTALLDKGRKLVSALAACQAKSPAAGYGQGYLSAFPENFFDRLESGSGVWAPYYTIH 218

Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
           KI+AGL+DQ+ LA NA+AL +      +   R  K+      ++    L  E GGMN+VL
Sbjct: 219 KIMAGLVDQHRLAGNAEALDVVERQAAWVDTRTGKL----GYDQMQRVLQTEFGGMNEVL 274

Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
             L++IT D + L +A  F        LA   D L+  HANT IP ++G+   +E   + 
Sbjct: 275 ADLHAITGDTRWLRVAERFTHARVFDPLARNEDQLAGLHANTQIPKMVGALRLWEQGLNS 334

Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
            Y+ IG  F  IV   H+Y  GG S  E + +P  +A  L +   E C +YNMLK++R +
Sbjct: 335 RYRTIGENFWKIVTDHHTYVIGGNSNGEAFHEPDAIAAQLSNNCCENCNSYNMLKLTRLI 394

Query: 414 -FRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARST------HGW 465
            F         DYYER L N +L  Q   +  G  IY   L  G  K + +      + +
Sbjct: 395 HFHAPDRTDLLDYYERTLFNQMLGEQDPDSAHGFNIYYTGLAPGAFKQQPSFMGTDPNQY 454

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
            T +N+F C +G+G+E+ +K  D+IY   + +   L +  +I S   W+   +   Q   
Sbjct: 455 STDYNNFSCDHGSGMETQAKFADTIYTYADRS---LLVNLFIPSELRWQEKAITWRQN-- 509

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLS 584
               +      TLT +S       +SL LR+ +  ++ GA+A+LNG  LP  P PG++L 
Sbjct: 510 --TGFPDQQTTTLTVASGA-----ASLELRVRIPAWATGARAALNGTTLPDQPKPGSWLV 562

Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
               W   D++ + LP++L+ +   DD      +QA+L+GP +LAG
Sbjct: 563 IDRSWKAGDRVDVTLPMALKLDPTPDD----PDVQAVLYGPVVLAG 604


>gi|383779461|ref|YP_005464027.1| hypothetical protein AMIS_42910 [Actinoplanes missouriensis 431]
 gi|381372693|dbj|BAL89511.1| hypothetical protein AMIS_42910 [Actinoplanes missouriensis 431]
          Length = 777

 Score =  290 bits (743), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 199/554 (35%), Positives = 283/554 (51%), Gaps = 43/554 (7%)

Query: 96  GNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGW 154
           GN   E     V L  S +L   Q   + YL  +DV+ +++ FR    L T G A  GGW
Sbjct: 48  GNAASEFMPGQVRLTASRLL-DNQNRTMNYLRFVDVNRMLYVFRANHRLSTAGAAANGGW 106

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQ--NKIG---TGYLS 209
           + P    R H  GH+L+A AQ +A T + T ++K   +V  L++CQ  N +     GYLS
Sbjct: 107 DAPNFPFRSHMQGHFLTAWAQAYAYTGDTTCRDKADYMVAELAKCQANNAVAGFNAGYLS 166

Query: 210 AFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNR 265
            FP    D+ E+ KP+   YY IHK LAGLLD + L  N QA    LK+A W V++   R
Sbjct: 167 GFPESDLDAVESGKPIAVSYYCIHKTLAGLLDVWRLIGNTQAKDVLLKLAGW-VDWRTGR 225

Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
           +       S  +   +L  E GGMN+VL  LY  T D + L +A  FD       LA   
Sbjct: 226 L-------SYSQMQTTLQTEFGGMNEVLANLYQQTGDARWLRVAQRFDHAAIFDPLAANR 278

Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
           D L+  HANT+IP  +G+   ++ TG   Y+ I     +I   +H+YA GG S  E +  
Sbjct: 279 DELNGKHANTNIPKWVGAIREFKATGTTRYRDIAGNAWNITVGAHTYAIGGNSQAEHFKA 338

Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEP- 443
           P  +A  L ++  E C TYNMLK++R L++     A Y D+YE AL N ++  Q   +  
Sbjct: 339 PNAIAGYLTNDTCEQCNTYNMLKLTRELWQLDPNRAGYFDFYENALYNHLIGAQNPADSH 398

Query: 444 GVMIYMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
           G + Y  PL     RGV  A     W T +NSFWCC GTGIE+ +KL DSIYF       
Sbjct: 399 GHITYFTPLKAGGRRGVGPAWGGGTWSTDYNSFWCCQGTGIETNTKLMDSIYFRGG---T 455

Query: 500 GLYIIQYISSSFDW-KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            L +  Y+ S+ +W + G  V      P+         T TF+    V     +  R+P 
Sbjct: 456 TLTVNLYVPSTLNWSERGLTVTQTTAYPVGD-------TSTFTLSGSVSGSWGIRFRIPA 508

Query: 559 WTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
           W  + GA  ++NG N  +   PG++ + T  W+  D +T++LP+ +  +A  D+    A 
Sbjct: 509 W--AAGATIAVNGANQNITVTPGSYATVTRTWADGDTITVRLPMRVIIKAANDN----AD 562

Query: 618 IQAILFGPYLLAGH 631
           IQAI +GP +LAG+
Sbjct: 563 IQAITYGPSVLAGN 576


>gi|256394133|ref|YP_003115697.1| hypothetical protein Caci_4996 [Catenulispora acidiphila DSM 44928]
 gi|256360359|gb|ACU73856.1| protein of unknown function DUF1680 [Catenulispora acidiphila DSM
           44928]
          Length = 846

 Score =  290 bits (742), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 199/519 (38%), Positives = 260/519 (50%), Gaps = 38/519 (7%)

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
           L YL  +D D L++ FR T  + T     GGWE+P  ELRGH  GH +SA AQ +AST +
Sbjct: 84  LAYLRFVDPDRLLYMFRTTVGIATSASPCGGWEDPTEELRGHSTGHIMSALAQAYASTGD 143

Query: 183 ATIKEKMSTVVFSLSECQNKIG-----TGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           +T+K K    V SL+ CQ         TGYLSAFP   FD  E+ + VWAPYYTIHKI+A
Sbjct: 144 STLKSKGDYFVSSLAACQAASPAAGFHTGYLSAFPESFFDRLESGQSVWAPYYTIHKIMA 203

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GLLDQY++A N QAL +   M  +   R   +    S  +    L  E GGM +VL  LY
Sbjct: 204 GLLDQYLVAGNTQALTVLKGMAAWVKTRTDPL----SHSQMQAVLQTEFGGMPEVLAHLY 259

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
            +T D   L  A  FD       LA   D L+ FHANT +P +IG+   Y  TG   Y  
Sbjct: 260 QVTGDANTLTAAQRFDHAQIEDPLAAGTDQLAGFHANTQVPKIIGALREYLATGTARYLT 319

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL-FRW 416
           I   F  I    H Y  GG S  E++  P  +A  L +   E C TYN LK+SR L F  
Sbjct: 320 IAQNFWAITTGHHMYEIGGFSNGEYFQTPNAIASQLSNTTCEVCVTYNELKLSRGLFFTD 379

Query: 417 TKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCC 475
               AY DYYER L N VL  Q   +  G + Y  PL  G  K  S       +N F C 
Sbjct: 380 PTRAAYLDYYERGLFNTVLGQQDPASSHGFVCYYTPLQPGGYKTYS-----NDYNDFTCD 434

Query: 476 YGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYL 534
           +GTG+ES +K  DSIYF    N   LY+  +I+S   W    + + Q    P  S     
Sbjct: 435 HGTGMESNTKYADSIYFY---NGETLYVNLFIASQLAWPGRAITVRQDTTFPAASSS--- 488

Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG--QNLPLPPPGNFLSATERWSYN 592
           R+T+T       G + +L +R+P W   +G    +NG  QNL    PG +L+    W+  
Sbjct: 489 RLTIT-----GAGHI-ALKIRVPSW--CSGMTVKVNGTLQNL-TATPGTYLTIDRTWASG 539

Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
           D + + LP  L      DD    +++Q + +G  +LAG 
Sbjct: 540 DVVDLALPAKLTFVPAPDD----STVQVVKYGGIVLAGQ 574


>gi|334364979|ref|ZP_08513951.1| conserved hypothetical protein [Alistipes sp. HGB5]
 gi|313158812|gb|EFR58195.1| conserved hypothetical protein [Alistipes sp. HGB5]
          Length = 778

 Score =  290 bits (741), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 191/540 (35%), Positives = 285/540 (52%), Gaps = 38/540 (7%)

Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
           V L+DV +     L  AQ+ +  +L  +D D  +  FR  A L      YGGWE+  +  
Sbjct: 45  VPLNDVRITGGPFL-HAQEMDRRWLDSMDPDRYLSGFRSEAGLEPKAPRYGGWES--AGC 101

Query: 162 RGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE--LFDSF 219
            GH  GH+LSA+A M+A+T +  + +K++  +  L+ECQ K GTG L+ F     LF   
Sbjct: 102 SGHGFGHFLSAAAMMYAATGDRALLDKINYSIDGLAECQQKEGTGLLAGFERSRALFAEL 161

Query: 220 EA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
           E          L   W P+YT+HK+ AGL+D      NA+AL +     ++    V K+ 
Sbjct: 162 ERGDIRSQGFDLNGGWVPFYTLHKMYAGLVDVCRYTPNAKALTVLVRFADWLDGLVAKL- 220

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
              S E+    L  E GG+ + L  +Y +T + K+L LA  FD    L  LA   D L  
Sbjct: 221 ---SDEQMDKILICEHGGITESLADIYVLTGERKYLELARRFDHREILRPLAAGVDSLPG 277

Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
            HANT IP ++G+   YE +GD  Y+ I  +F   V   HSYA GG S  E +  P  LA
Sbjct: 278 KHANTQIPKIVGAVREYECSGDERYRRIADYFWHRVVGFHSYAIGGNSEYEHFGAPGMLA 337

Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
           + L     ETC TYNMLK+++HL++    +  ADYYERAL N +L+ Q   + G++ YM 
Sbjct: 338 NRLSDGTCETCNTYNMLKLTKHLYQLDPTVRRADYYERALYNQILASQ-NPDDGMVCYMS 396

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           P+G G  K     G+   F+SFWCC G+G+E+ ++ G+ IYF +      LY+  YI S+
Sbjct: 397 PMGSGHRK-----GFCLPFDSFWCCVGSGMENHARYGEFIYFTDARE--NLYVNLYIPST 449

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
            DWKS  V + Q  D   S +  LR+ ++ +      Q   LNLR P W  + G + ++N
Sbjct: 450 LDWKSRGVKVEQLTDFPCSDEVRLRVEMSGA------QRFVLNLRYPEWA-AEGYELTVN 502

Query: 571 GQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           G+ +     PG+++S   +W   D++   L  SL +E I  D    ++++A  +GP +L+
Sbjct: 503 GRPVKQKAKPGSYISVNRKWRSGDEVRFVLRQSLHSEPIPGD----STLRAYFYGPVVLS 558


>gi|456393067|gb|EMF58410.1| putative glycosylase [Streptomyces bottropensis ATCC 25435]
          Length = 714

 Score =  290 bits (741), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 188/530 (35%), Positives = 270/530 (50%), Gaps = 44/530 (8%)

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMWASTH 181
           L Y      D ++  FR  A L T G +  GGWE     LRGH+ GH+L+  AQ +A T 
Sbjct: 75  LNYARSYPADRILAVFRANAGLDTRGARPPGGWETSDGNLRGHYGGHFLTLVAQAYADTR 134

Query: 182 NATIKEKMSTVVFSLSECQNKIGT---------GYLSAFPTE---LFDSFEALKPVWAPY 229
            A +K K+  +V +L ECQ  +           G+L+A+P     L +S+     +WAPY
Sbjct: 135 EAALKSKLDQLVGALGECQAALAERGSPRPSHPGFLAAYPETQFILLESYATYPTIWAPY 194

Query: 230 YTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETGG 288
           YT HKI+ GLLD + LA NAQAL + + M ++ ++R+   +    +ER W   +  E GG
Sbjct: 195 YTCHKIMRGLLDAHTLAGNAQALTIVSRMGDWVHSRL-GALPRAQLERMWSLYIAGEYGG 253

Query: 289 MNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYE 348
           MN+VL  LY++T   +HL  A  FD    L   A   D L   HAN HIP   G    ++
Sbjct: 254 MNEVLADLYALTGKAEHLAAARCFDNTALLDACAQDRDILDGRHANQHIPQFTGYLRLFD 313

Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLK 408
            TG+  Y      F  +V    +Y+ GGT   E +     +A TL  +N ETC TYNMLK
Sbjct: 314 ETGEERYAEAARNFWGMVAGPRTYSLGGTGQGEMFKARGAIAATLDDKNAETCATYNMLK 373

Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGT----EPGVMIYMLPLGRGVSKARSTHG 464
           +SRHLF    + A  DYYER LTN +L+ +R T     P V  Y + +G GV +      
Sbjct: 374 LSRHLFFREPDAARMDYYERGLTNHILASRRDTASTSSPEV-TYFVGMGPGVVREYG--- 429

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
                N+  CC GTG+E+ +K  DS+YF   +GN   LY+  Y++S+  W    +V+ Q 
Sbjct: 430 -----NTGTCCGGTGMENHTKYQDSVYFRSADGNA--LYVNLYLASTLRWPERGLVVEQ- 481

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNF 582
                ++      TLTF   +EV     L LR+P W  + G   ++NG    +   PG++
Sbjct: 482 ---TSAYPAEGVRTLTF---REVRGTLDLRLRVPSWA-TGGFTVTVNGVRQQVEATPGSY 534

Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
           L+ +  W   D++ I  P  LR E   DD     ++Q++ FGP LL   +
Sbjct: 535 LTLSRNWRRGDRVGISAPYRLRVERALDD----PTVQSVFFGPLLLVAQS 580


>gi|374991816|ref|YP_004967311.1| secreted protein [Streptomyces bingchenggensis BCW-1]
 gi|297162468|gb|ADI12180.1| secreted protein [Streptomyces bingchenggensis BCW-1]
          Length = 858

 Score =  289 bits (740), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 199/552 (36%), Positives = 277/552 (50%), Gaps = 41/552 (7%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L    L  V L +S  L   ++T L YL  +D + L+ +FR    LP+  +  GGWE P 
Sbjct: 55  LAPFPLSAVRLLESPFLANMRRT-LAYLRFVDPERLLHTFRLNVQLPSTAQPCGGWEAPN 113

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPT 213
             LRGH  GH LSA A   A T   T  +K   +V +L+ECQ         TGYLSAFP 
Sbjct: 114 VLLRGHSTGHLLSALAFAHAHTGEQTYADKARGIVAALAECQAASPGAGYRTGYLSAFPE 173

Query: 214 ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV--IT 271
            +FD  EA    WAPYYTIHKI+AGLLDQ+ L+ N QAL++   M  +  +R   +   T
Sbjct: 174 RIFDELEAGGKPWAPYYTIHKIMAGLLDQHRLSGNDQALEVLRGMAAWVDSRTAPLDEAT 233

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
           M  +      L  E GGMN+VL  LY +T DP HL  A  FD     G L    D L   
Sbjct: 234 MQRL------LGVEFGGMNEVLAGLYLVTGDPVHLRTARRFDHQSLYGPLDEGRDELDGR 287

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           HANT I  ++G+   Y  TGDP Y  I   F DIV   HSY  GG S +EF+  P ++  
Sbjct: 288 HANTEIAKIVGAAEEYRATGDPRYLRIARNFWDIVVRDHSYVIGGNSNQEFFGPPGQIVS 347

Query: 392 TLGSENEETCTTYNMLKVSRHLF-RWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYM 449
            L  +  E C +YNMLK+ R LF       AY D+YE  L N +L  Q   ++ G + Y 
Sbjct: 348 RLSEDTCENCNSYNMLKIGRQLFLHEPGRAAYMDHYEWTLYNQMLGEQDPDSDHGFVTYY 407

Query: 450 LPLGRGVSKARSTHGWGT-------KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
             L  G S+ +   G G+        +++F C +GTG+E+ +K  D+IYF +E +   LY
Sbjct: 408 TGLWAG-SRRQPKGGLGSAPGSYSGDYDNFSCDHGTGMETHTKFADTIYFRDE-HAGALY 465

Query: 503 IIQYISSSFDW-KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
           +  +I S   W + G  ++ +   P       +R+T+      E G   +L +R+P W  
Sbjct: 466 VNLFIPSEVTWAERGFRLVQRSGYPDTD---TVRLTVA-----EGGGRLALKVRVPGWLA 517

Query: 562 SNGAQASLNGQNLPL---PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
             G +A +     P+   P PG +L+   RW   D + +  P  L    +    P+   I
Sbjct: 518 DAGPRARVLVAGRPVDATPVPGRYLTLDRRWRTGDTVELTFPREL----VWRPAPDNPHI 573

Query: 619 QAILFGPYLLAG 630
           +A+ +GP +LAG
Sbjct: 574 KAVSYGPLVLAG 585


>gi|333382563|ref|ZP_08474231.1| hypothetical protein HMPREF9455_02397 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332828505|gb|EGK01205.1| hypothetical protein HMPREF9455_02397 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 644

 Score =  289 bits (740), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 192/545 (35%), Positives = 284/545 (52%), Gaps = 45/545 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
           LK+V L D    Q+       +   +++L L VD L+ SFR TA +           K  
Sbjct: 46  LKDVRLLDSPFRQN------MERESKWILSLGVDRLLHSFRNTAGVYAGREGGYMTIKKL 99

Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI----GTGY 207
           GGWE+   ELRGH +GH +S  A ++AST +   K K  ++V  L+E Q+ +      GY
Sbjct: 100 GGWESLDCELRGHSIGHIMSGLAYLYASTGDERYKIKADSLVAGLAEVQDILIENGQKGY 159

Query: 208 LSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
           +SA+P  L +   A K VWAP+YT+HK+ AGL+DQY+  DN +AL +      + Y   Q
Sbjct: 160 ISAYPENLINRNIAGKSVWAPWYTLHKVYAGLIDQYLYCDNKEALDIMKEAASWAY---Q 216

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
           K++ + S E+    L  E GG+N+  Y LY+IT +P+H   A  F     +  LA     
Sbjct: 217 KLMPL-SEEQRALMLRNEFGGVNEAFYNLYAITGNPEHKKSAEFFYHADVIDPLAEHKAD 275

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L   HANT IP VIG    YE+      K I  FF + V    +Y TGG S +E +    
Sbjct: 276 LYFKHANTFIPKVIGEARNYELHNSERSKDIANFFWNTVIDHQTYCTGGNSHKEKFIHSD 335

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
            ++  L    +ETC T NMLK++RHLF W     YADYYERAL N +L  Q+  + G++ 
Sbjct: 336 SISKNLTGYTQETCNTNNMLKLTRHLFCWDANAKYADYYERALYNHILG-QQDPQSGMVA 394

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y LP+  G  K  S     T  NSFWCC GTG E+ +K G++IY+ +     GLY+  +I
Sbjct: 395 YFLPMLPGAHKVYS-----TPENSFWCCVGTGFENHAKYGEAIYYHDNN---GLYVNLFI 446

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S   WK   + + Q+     ++     + LT ++ +++     + LR P WT  +  + 
Sbjct: 447 PSELTWKEKGIKIKQE----TAFPEEGNICLTVTTDKDIKM--PVYLRYPSWT--SNVEV 498

Query: 568 SLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLR-TEAIQDDRPEYASIQAILFGP 625
            +NG+   +   P  +++    W   DK+ +  P+ L  TE   +D P+ A   AI++GP
Sbjct: 499 KVNGKKTKIKQSPSGYITIDRTWKNGDKIEVHYPMHLYLTET--NDNPDKA---AIMYGP 553

Query: 626 YLLAG 630
            +LAG
Sbjct: 554 LVLAG 558


>gi|376260258|ref|YP_005146978.1| hypothetical protein [Clostridium sp. BNL1100]
 gi|373944252|gb|AEY65173.1| hypothetical protein Clo1100_0916 [Clostridium sp. BNL1100]
          Length = 952

 Score =  289 bits (739), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 180/517 (34%), Positives = 262/517 (50%), Gaps = 34/517 (6%)

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH- 181
           + YL  +D + L+  F+K A L T    YGGWEN  + ++GH +GHY+SA AQ + +T  
Sbjct: 58  VAYLRAIDPNRLLVGFKKAAGLSTTYSYYGGWENN-TLIQGHTMGHYMSALAQAYKNTKS 116

Query: 182 ----NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFE--ALKPVWAPYYTIHKI 235
               NA +K ++  ++  L  CQNK G GYL A P   FD  E  A    W P+YT+HKI
Sbjct: 117 DATVNADLKSRIDLIISELQACQNKNGNGYLFATPVTQFDVVEGKASGSSWVPWYTMHKI 176

Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYR 295
           ++GLLD Y    N  AL +AT +  + Y RV      +        L  E GGMND LY 
Sbjct: 177 MSGLLDVYKFEGNQTALTIATNLGNWIYKRVNA----WDSATQSKVLGVEYGGMNDCLYE 232

Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTG--DP 353
           LY +T +  HL  AH FD+      +A   + L   HANT IP  IG+  RY   G  + 
Sbjct: 233 LYKLTGNSNHLTAAHKFDETSLFNTIAAGTNVLPGKHANTTIPKFIGALNRYRTLGTTES 292

Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
            Y      F +IV   H+Y TGG S  E +    +L     + N ETC   NMLK++R L
Sbjct: 293 SYLTAAQQFWNIVLKDHTYVTGGNSEDEHFRAAGKLDAYRDNVNNETCNVNNMLKLTREL 352

Query: 414 FRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW 473
           F+ T ++ YADYYE AL N +++ Q   E G+  Y   +G G  K  S     ++F+ FW
Sbjct: 353 FKVTGDVKYADYYENALINEIMASQN-PETGMATYFKAMGTGYFKVFS-----SQFDHFW 406

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC GTG+E+F+KL DS+Y+    N   LY+  Y+SS  +W    + L Q+ +  +S    
Sbjct: 407 CCTGTGMENFTKLNDSLYYN---NGSDLYVNMYLSSILNWSEKGLSLTQQANLPLS--DK 461

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYS-NGAQASLNGQNLPLPPPGNFLSATERWSYN 592
           +  T+  +   EV     +  R P W  +   A   +NG ++ +     +L  +  W   
Sbjct: 462 VTFTINSAPSSEV----KIKFRSPSWIAAGQTATVKVNGTSINIAKVNGYLDVSRVWQAG 517

Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           D + + LP  +R   + D+     +  A  +GP +L+
Sbjct: 518 DTVELTLPTEVRVSRLTDN----PNAVAFTYGPVVLS 550


>gi|315506549|ref|YP_004085436.1| hypothetical protein ML5_5828 [Micromonospora sp. L5]
 gi|315413168|gb|ADU11285.1| protein of unknown function DUF1680 [Micromonospora sp. L5]
          Length = 917

 Score =  289 bits (739), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 190/532 (35%), Positives = 270/532 (50%), Gaps = 42/532 (7%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFVGHYLSASAQMW 177
           Q   + YL  +DV+ L+++FR    L T G A  GGW+ P    R H  GH+L+A AQ W
Sbjct: 71  QNRTVAYLRFVDVNRLLYNFRANHRLSTGGAAANGGWDAPNFPFRTHMQGHFLTAWAQAW 130

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEA--LKPVWAPYY 230
           A   + T ++K  T+V  L+ CQ   G      GYLS FP   F + EA  L     PYY
Sbjct: 131 AVLGDTTCRDKALTMVAELARCQANNGAAGFSAGYLSGFPESDFTALEARTLSNGNVPYY 190

Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            IHK LAGLLD + L  + QA    L +A W+        Q+   + S +     L  E 
Sbjct: 191 CIHKTLAGLLDVWRLIGSTQARDVLLALAGWVD-------QRTGRLTSAQMQ-AMLGTEF 242

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGMN VL  LY  T D + L +A  FD       LA  +D L+  HANT +P  IG+   
Sbjct: 243 GGMNAVLTDLYQQTGDGRWLTVAQRFDHAAVFNPLAANSDQLNGLHANTQVPKWIGAARE 302

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           Y+ TG   Y+ I      I   +H+YA GG S  E +  P  +A  L ++  E C TYNM
Sbjct: 303 YKATGVTRYRDIAANAWAITVGAHTYAIGGNSQAEHFRAPNAIAGYLRNDTCEACNTYNM 362

Query: 407 LKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKAR 460
           LK++R L++   + +AYAD+YERAL N ++  Q   +  G + Y  PL     RGV  A 
Sbjct: 363 LKLTRELWQLDPDRVAYADFYERALLNHMIGQQNPADAHGHVTYFTPLNPGGRRGVGPAW 422

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
               W T +NSFWCC GTG+E+ + L D+IYF    N   L +  ++ S   W    + +
Sbjct: 423 GGGTWSTDYNSFWCCQGTGLETNTTLADAIYFH---NGTTLTVNLFVPSVLTWSQRGITV 479

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-P 579
            Q     V        T T +    V    ++ +R+P WT  +GA  S+NG    +   P
Sbjct: 480 TQATSYPVG------DTTTLTVTGSVAGSWTMRIRIPAWT--SGASVSVNGVAAGIAATP 531

Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
           G++   T  W+  D +T++LP+ + T A  DD    A++QA+ +GP +L+G+
Sbjct: 532 GSYAVLTRAWTSGDTVTVRLPMRVTTVAANDD----AAVQAVTYGPVVLSGN 579


>gi|427384240|ref|ZP_18880745.1| hypothetical protein HMPREF9447_01778 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727501|gb|EKU90360.1| hypothetical protein HMPREF9447_01778 [Bacteroides oleiciplenus YIT
           12058]
          Length = 777

 Score =  288 bits (738), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 192/546 (35%), Positives = 283/546 (51%), Gaps = 43/546 (7%)

Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
           K   + DV L +S  L  A   N +++  LD+D L+ +FRK A+L    + Y  WE+   
Sbjct: 37  KYFGIQDVRLLESPFL-HAMNQNEQWMKELDLDRLLSNFRKNANLRPKAEPYDSWES--M 93

Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFD 217
            + GH +GH L+A +Q +A+T + T K K+  VV  L  CQ     G++   P   ++F 
Sbjct: 94  GIAGHTLGHLLTAMSQHYAATGDETFKTKIDYVVNELDSCQMNFVNGFIGGMPGGDKVFK 153

Query: 218 SFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
             +          L  +W P+Y  HK + GL D Y+LA N  A K+   + +Y  +    
Sbjct: 154 EVKKGIIRSMGFDLNGIWVPWYNEHKTMMGLNDAYLLAGNETAKKVLINLSDYLAD---- 209

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           VI   + E+    LN E GGMN+   ++Y++T D K+L  ++ F        LA   D L
Sbjct: 210 VIAPLNEEQMQTMLNCEYGGMNEAFAQVYALTGDEKYLDASYAFYHKRLQDKLAEGIDAL 269

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
              H+NT IP +IGS  +YE+TG+   + I  F  + +   HSYA GG S  E+   P +
Sbjct: 270 QGLHSNTQIPKLIGSARQYELTGNQRDEKIARFSWETIVLHHSYANGGNSMGEYLSVPDK 329

Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
           L+D LGS   ETC TYNMLK++ HL+ WT ++ Y DYYERAL N +L+ Q   E G + Y
Sbjct: 330 LSDRLGSNTCETCNTYNMLKLTGHLYEWTNDVQYLDYYERALYNHILASQH-PETGNVCY 388

Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ--- 505
            L LG G  K     G+G++ N+F CC G+G E+ SK G +IY      VPG  +I    
Sbjct: 389 FLSLGMGTHK-----GFGSRHNNFSCCMGSGFENHSKYGGTIY----SYVPGKEMININL 439

Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
           YI S   WK   + L    D        + + L  +SKQ +    ++NLR P W  +   
Sbjct: 440 YIPSVLTWKEKSLKLRMTTD--YPEHGKIVIKLEETSKQSL----TINLRRPAWA-TGDV 492

Query: 566 QASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
              +NG    +   PG+F+S   RW  ND + + LP+ L T ++    P+ A  +A+ +G
Sbjct: 493 VVRINGSKQKVGNTPGSFISLHHRWKKNDVIELILPMPLYTVSM----PDNADRRAVFYG 548

Query: 625 PYLLAG 630
           P +LAG
Sbjct: 549 PTILAG 554


>gi|302867043|ref|YP_003835680.1| hypothetical protein Micau_2566 [Micromonospora aurantiaca ATCC
           27029]
 gi|302569902|gb|ADL46104.1| protein of unknown function DUF1680 [Micromonospora aurantiaca ATCC
           27029]
          Length = 917

 Score =  288 bits (738), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 190/532 (35%), Positives = 270/532 (50%), Gaps = 42/532 (7%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFVGHYLSASAQMW 177
           Q   + YL  +DV+ L+++FR    L T G A  GGW+ P    R H  GH+L+A AQ W
Sbjct: 71  QNRTVAYLRFVDVNRLLYNFRANHRLSTGGAAANGGWDAPNFPFRTHMQGHFLTAWAQAW 130

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEA--LKPVWAPYY 230
           A   + T ++K  T+V  L+ CQ   G      GYLS FP   F + EA  L     PYY
Sbjct: 131 AVLGDTTCRDKALTMVAELARCQANNGAAGFSAGYLSGFPESDFTALEARTLSNGNVPYY 190

Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            IHK LAGLLD + L  + QA    L +A W+        Q+   + S +     L  E 
Sbjct: 191 CIHKTLAGLLDVWRLIGSTQARDVLLALAGWVD-------QRTGRLTSAQMQ-AMLGTEF 242

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGMN VL  LY  T D + L +A  FD       LA  +D L+  HANT +P  IG+   
Sbjct: 243 GGMNAVLTDLYQQTGDGRWLTVAQRFDHAAVFNPLAANSDQLNGLHANTQVPKWIGAARE 302

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           Y+ TG   Y+ I      I   +H+YA GG S  E +  P  +A  L ++  E C TYNM
Sbjct: 303 YKATGVTRYRDIAANAWAITVGAHTYAIGGNSQAEHFRAPNAIAGYLRNDTCEACNTYNM 362

Query: 407 LKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKAR 460
           LK++R L++   + +AYAD+YERAL N ++  Q   +  G + Y  PL     RGV  A 
Sbjct: 363 LKLTRELWQLDPDRVAYADFYERALLNHMIGQQNPADAHGHVTYFTPLNPGGRRGVGPAW 422

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
               W T +NSFWCC GTG+E+ + L D+IYF    N   L +  ++ S   W    + +
Sbjct: 423 GGGTWSTDYNSFWCCQGTGLETNTTLADAIYFH---NGTTLTVNLFVPSVLTWSQRGITV 479

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-P 579
            Q     V        T T +    V    ++ +R+P WT  +GA  S+NG    +   P
Sbjct: 480 TQATSYPVG------DTTTLTVTGSVAGSWTMRIRIPAWT--SGASVSVNGVAAGIAATP 531

Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
           G++   T  W+  D +T++LP+ + T A  DD    A++QA+ +GP +L+G+
Sbjct: 532 GSYAVLTRAWTSGDTVTVRLPMRVTTVAANDD----AAVQAVTYGPVVLSGN 579


>gi|189464178|ref|ZP_03012963.1| hypothetical protein BACINT_00515 [Bacteroides intestinalis DSM
           17393]
 gi|189437968|gb|EDV06953.1| hypothetical protein BACINT_00515 [Bacteroides intestinalis DSM
           17393]
          Length = 777

 Score =  288 bits (738), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 197/571 (34%), Positives = 293/571 (51%), Gaps = 47/571 (8%)

Query: 75  DEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSL 134
           D+V +A  Y+    P   D+     +  S+ DV L  S  L  A   N +++  LD+D L
Sbjct: 16  DQVGFAQNYK----PAVKDVISPKTRYFSIQDVRLLDSPFL-HAMNQNEQWMKELDLDRL 70

Query: 135 VWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVF 194
           + +FRK A+L    + YG WE+    + GH +GH L+A +Q +A+T + T K K+  VV 
Sbjct: 71  LSNFRKNANLKPKAEPYGSWES--MGIAGHTLGHLLTAMSQHYAATGDETFKAKIDYVVN 128

Query: 195 SLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVWAPYYTIHKILAGLLDQY 243
            L  CQ     G++   P   ++F   +          L  +W P+Y  HK + GL D Y
Sbjct: 129 ELDSCQMNFVNGFIGGMPGGDKVFKEVKKGIIRSMGFDLNGIWVPWYNEHKTMMGLNDAY 188

Query: 244 VLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDP 303
           +LA N  A K+   + +Y  +    VI   S E+    LN E GGMN+   ++Y++T D 
Sbjct: 189 LLAGNETAKKVLINLSDYLAD----VIAPLSEEQMQTMLNCEYGGMNEAFAQMYALTGDK 244

Query: 304 KHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFM 363
           K L  ++ F        LA   D L   H+NT IP +IGS  +YE+TG+   + I  F  
Sbjct: 245 KFLDASYAFYHKRLQDKLAEGVDVLQGLHSNTQIPKLIGSARQYELTGNHRDEEIARFSW 304

Query: 364 DIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYA 423
           + +   HSYA GG S  E+   P +L + LG+   ETC TYNMLK++ HL+ WT ++ Y 
Sbjct: 305 ETIVHHHSYANGGNSMGEYLSVPDKLNNRLGTNTCETCNTYNMLKLTAHLYEWTNDVQYL 364

Query: 424 DYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESF 483
           DYYERAL N +L+ Q   E G + Y L LG G  K     G+G++ N+F CC G+G E+ 
Sbjct: 365 DYYERALYNHILASQH-PETGNVCYFLSLGMGTHK-----GFGSRHNNFSCCMGSGFENH 418

Query: 484 SKLGDSIYFEEEGNVPG---LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
           SK G +IY      VPG   + I  YI S   WK   + L    D        +++  T 
Sbjct: 419 SKYGGAIY----SYVPGKEMMNINLYIPSVLTWKEKSLKLRMTTDYPEHGKVVIKLEET- 473

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQL 599
            SK+ +    ++NLR PVW   + A   +NG    +   PG+F+S   +W  ND + + L
Sbjct: 474 -SKEPL----TINLRRPVWAAGDVA-IRINGSKQKVESVPGSFISLHRKWKKNDVIELIL 527

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           P+ L T ++    P+    +A+ +GP +LAG
Sbjct: 528 PMPLYTVSM----PDNVDRRAVFYGPTILAG 554


>gi|33113961|gb|AAP94583.1| putative protein [Zea mays]
          Length = 786

 Score =  288 bits (738), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 134/205 (65%), Positives = 160/205 (78%)

Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSF 219
           +L GHFVGHYL A+A+MWASTHN T+  KMS +V +L +CQ K+G GYLSAFP+E F   
Sbjct: 475 QLWGHFVGHYLGATAKMWASTHNDTLNAKMSYIVNALYDCQKKMGIGYLSAFPSEFFVWV 534

Query: 220 EALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
           EA+  VWAPYYTIHKI+ GLLDQY +A N+ AL M   MV YF +RV+ VI  YS+E HW
Sbjct: 535 EAITSVWAPYYTIHKIMQGLLDQYTVAGNSVALVMVVKMVNYFSDRVKNVIQNYSIETHW 594

Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
            SLNE+TGGMNDV Y+LY+I +D KHL LA LFDKPCFLG LA Q D +S FH+NT IP+
Sbjct: 595 ESLNEKTGGMNDVFYQLYTIMNDTKHLTLAPLFDKPCFLGLLAGQDDSISGFHSNTRIPV 654

Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMD 364
            IG+QMRY+VTGDPLYK I +FFMD
Sbjct: 655 AIGAQMRYKVTGDPLYKQIASFFMD 679


>gi|332880466|ref|ZP_08448140.1| hypothetical protein HMPREF9074_03915 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|357046164|ref|ZP_09107794.1| hypothetical protein HMPREF9441_01810 [Paraprevotella clara YIT
           11840]
 gi|332681454|gb|EGJ54377.1| hypothetical protein HMPREF9074_03915 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|355531170|gb|EHH00573.1| hypothetical protein HMPREF9441_01810 [Paraprevotella clara YIT
           11840]
          Length = 641

 Score =  288 bits (737), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 182/514 (35%), Positives = 269/514 (52%), Gaps = 33/514 (6%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
           +++ +  D L+  FR TA +           K  GGWE+   ELRGH  GH LSA A M+
Sbjct: 67  WMVSIGADRLLHGFRTTAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHVLSALALMY 126

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A+T +   K K  ++V  L+E Q     GYLSA+P EL +     + VWAP+YT+HK+ +
Sbjct: 127 AATGSDVFKMKGDSLVAGLAEVQAAGTGGYLSAYPEELINRNIRGESVWAPWYTLHKLFS 186

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GL+DQY+ A NAQAL +   M ++ Y +++ +      E     +  E GG+N+  Y LY
Sbjct: 187 GLIDQYLYARNAQALDVVRKMGDWAYGKLRPL----PEEMRRKMIRNEFGGINESFYNLY 242

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
           ++T D ++  LA  F     +  L  Q D L   H NT IP V+     YE+TGD   K 
Sbjct: 243 ALTGDERYRWLAGFFYHNDVIDPLKEQRDDLGTKHTNTFIPKVLAEARNYELTGDGDSKA 302

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           +  FF   +   H++A G +S +E ++DP   +  +     ETC TYNMLK+SRHLF W 
Sbjct: 303 LSEFFWHTMIGRHTFAPGCSSDKEHYFDPDEFSKHISGYTGETCCTYNMLKLSRHLFCWE 362

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
                ADYYERAL N +L  Q+    G++ Y LPL  G  K  S     T  NSFWCC G
Sbjct: 363 ASPEVADYYERALYNHILG-QQDPATGMVSYFLPLQSGTHKVYS-----TPENSFWCCVG 416

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           +G ES +K  +SIY+  E     LY+  +I S   WK   + L Q+       +   R+T
Sbjct: 417 SGFESHAKYAESIYYRGEDC---LYVNLFIPSELAWKEKGLNLRQETR--FPEEETTRLT 471

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLT 596
           L   + + +    ++ LR P W  S      +NG+++ +   PG++++   RW   D++ 
Sbjct: 472 LALETPRRL----AVKLRYPSW--SGRPTVRVNGKSVRVKQHPGSYITLDRRWEDGDRIE 525

Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +  P+ L  E +    P+     A+L+GP +LAG
Sbjct: 526 VTYPMRLAMERM----PDNPHKGALLYGPIVLAG 555


>gi|429199615|ref|ZP_19191363.1| hypothetical protein STRIP9103_08616 [Streptomyces ipomoeae 91-03]
 gi|428664699|gb|EKX63974.1| hypothetical protein STRIP9103_08616 [Streptomyces ipomoeae 91-03]
          Length = 655

 Score =  287 bits (735), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 184/539 (34%), Positives = 273/539 (50%), Gaps = 40/539 (7%)

Query: 113 SVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLS 171
            V  R +   LEY      D ++  FR  A L T G +  GGWE     LRGH+ GH+L+
Sbjct: 6   GVFRRKRDLMLEYARSYPADRILAVFRANAGLDTRGARPPGGWETSDGNLRGHYGGHFLT 65

Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT---------GYLSAFPTE---LFDSF 219
             AQ +A T  A +K K+  +V +L+ECQ  +           G+L+A+P     L +S+
Sbjct: 66  LVAQAYADTREAALKAKLDYLVGALAECQRTLAERGNPRPSHPGFLAAYPETQFILLESY 125

Query: 220 EALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
                +WAPYYT HKI+ GLLD + LA NA+AL +A+ M ++ ++R+ + +    ++R W
Sbjct: 126 TTYPTIWAPYYTCHKIMRGLLDAHTLAGNAEALTVASKMGDWVHSRLGR-LPKAQLDRMW 184

Query: 280 -YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
              +  E GGMN+V+  LY++T   +HL  A  FD    L   A   D L   HAN HIP
Sbjct: 185 SIYIAGEYGGMNEVMADLYALTGRAEHLAAARCFDNTALLDACAEDRDILDGRHANQHIP 244

Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
              G    ++ TG+  Y      F  +V    +Y+ GGT   E +     +A TL  +N 
Sbjct: 245 QFTGYLRMFDHTGEERYADAARNFWGMVAGHRTYSLGGTGQGEMFRARDAVAATLDDKNA 304

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ---RGTEPGVMIYMLPLGRG 455
           ETC TYNMLK+SR LF    + AY D+YER LTN +L+ +   R T+   + Y + +G G
Sbjct: 305 ETCATYNMLKLSRQLFFRDPDPAYMDHYERGLTNHILASRRDARSTDGPEVTYFVGMGPG 364

Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
           V +     G         CC GTG+E+ +K  DS+YF    +   LY+  Y++S+  W  
Sbjct: 365 VVREYGNIG--------TCCGGTGMENHTKYQDSVYF-RSADGGALYVNLYLASTLRWPE 415

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
             +V+ Q  D    +      TLTF   +E G    L LR+P W  + G   ++NG    
Sbjct: 416 RGIVVEQTSD----FPAEGVRTLTF---REGGGTLDLKLRIPSWA-TEGVTVTVNGVRQR 467

Query: 576 LPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           +   PG +L+ +  W   D++ I  P  LR E   DD     ++Q++  GP LL   ++
Sbjct: 468 VEAVPGTYLTLSRSWQRGDRVAISTPYRLRIERALDD----PAVQSVFHGPVLLVARSA 522


>gi|399029634|ref|ZP_10730435.1| hypothetical protein PMI10_02273 [Flavobacterium sp. CF136]
 gi|398072450|gb|EJL63666.1| hypothetical protein PMI10_02273 [Flavobacterium sp. CF136]
          Length = 642

 Score =  286 bits (733), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 171/486 (35%), Positives = 263/486 (54%), Gaps = 28/486 (5%)

Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSA 210
           GGWE+   +LRGH  GH LS  A ++A+T     K K  ++V  L E Q  +   GYLSA
Sbjct: 102 GGWESLDCDLRGHSTGHILSGLALLYAATGEKMYKIKADSLVTGLDEVQKVLNQNGYLSA 161

Query: 211 FPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
           FP  L D   A K VWAP+YT HK+ +GL+DQY+  D+  AL++   M ++ Y +++ + 
Sbjct: 162 FPQNLIDRAIAGKSVWAPWYTQHKLFSGLMDQYLYCDSEPALEIVKGMADWAYEKLKSLT 221

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
                E     L  E GGMND  Y LY IT + K+  LA  F     L  L  + D L+ 
Sbjct: 222 N----EERKRMLRNEFGGMNDSFYALYEITAESKYKFLAEFFYHEDALDPLLNKTDNLNK 277

Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
            HANT+IP +IG    YE+ G    + I  FF + V   H++ TG  S +E +++P  L+
Sbjct: 278 KHANTYIPKLIGISRDYELEGGSKNREIPEFFWNTVVNHHTFVTGSNSDKEKFFEPDHLS 337

Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
           + L     E+C  YNMLK++RHL+    +I Y DYYE+AL N +L  Q+  + G++ Y L
Sbjct: 338 EHLSGFTGESCNVYNMLKLTRHLYGVNPQIKYVDYYEKALYNHILG-QQDPKTGMVAYFL 396

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           P+  G  K  S     T  NSFWCC G+G E+ +K G+ IY+ ++    GLY+  +I S 
Sbjct: 397 PMMPGAHKVYS-----TPENSFWCCVGSGFENQAKYGEFIYYHDK----GLYVNLFIPSE 447

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
            +WK   +++ Q+     S+      TLT S+K  V     +++R P W  + GA+  +N
Sbjct: 448 LNWKEKGIIVKQE----TSFPNVGSTTLTLSTKNPVSM--PISIRYPSW--AAGAEVKVN 499

Query: 571 GQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           G+   +   PG++++   +WS  D++ +   + ++        P+  ++ A+ +GP +LA
Sbjct: 500 GKKQIINVKPGSYITLERKWSDGDRIEVSFGIQIKLAPT----PDNPNVVAVTYGPIVLA 555

Query: 630 GHTSGE 635
           G    E
Sbjct: 556 GEMGTE 561


>gi|374983575|ref|YP_004959070.1| putative glycosylase [Streptomyces bingchenggensis BCW-1]
 gi|297154227|gb|ADI03939.1| putative glycosylase [Streptomyces bingchenggensis BCW-1]
          Length = 713

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 189/538 (35%), Positives = 273/538 (50%), Gaps = 42/538 (7%)

Query: 114 VLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSA 172
           V  R +   L Y      D ++  FR  A L T G +  GGWE     LRGH+ GH+L+ 
Sbjct: 65  VFRRKRDLMLGYARSYPADRILAVFRANAGLDTRGARPPGGWETSDGNLRGHYGGHFLTL 124

Query: 173 SAQMWASTHNATIKEKMSTVVFSLSECQNKIGT---------GYLSAFPTE---LFDSFE 220
            AQ +A T  A +K K+  +V +L ECQ  +           GYL+A+P     L +S+ 
Sbjct: 125 IAQAYADTREAALKTKLDYLVGALGECQKALADHGSPIPSHPGYLAAYPETQFILLESYT 184

Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW- 279
               +WAPYYT HKI+ GLLD + L  N QAL++A+ M ++ ++R+   +    +ER W 
Sbjct: 185 TYPTIWAPYYTCHKIMRGLLDAHTLGGNQQALQIASGMGDWVHSRLGH-LPAAQLERMWS 243

Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
             +  E GGMN+VL  LY++T   +HL  A  FD    L   A   D L   HAN HIP 
Sbjct: 244 IYIAGEYGGMNEVLADLYALTGRAEHLAAARCFDNTALLKACAENRDILEGRHANQHIPQ 303

Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE 399
             G    ++ T    Y      F  +V  S  Y+ GGT   E +     +A TL  +N E
Sbjct: 304 FTGYLRLFDHTAKQEYSSAARNFWGMVTGSRMYSLGGTGQGEMFRARGAIAATLDDKNAE 363

Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR---GTEPGVMIYMLPLGRGV 456
           TC TYNMLK++R LF    + AY DYYER LTN +L+ +R    T+   + Y + +G GV
Sbjct: 364 TCATYNMLKLTRQLFFHQPDPAYMDYYERGLTNHILASRRDAAATDSPEVTYFVGMGPGV 423

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGNVPGLYIIQYISSSFDWKS 515
            +           N+  CC GTG+E+ +K  DS+YF   +GN   LY+  Y++S+  W  
Sbjct: 424 RREFD--------NTGTCCGGTGMENHTKYQDSVYFRSADGNA--LYVNLYLASTLRWPE 473

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG-QNL 574
              V+ Q  D    +      TLTF  ++  G+L  L LR+P W  + G   ++NG +  
Sbjct: 474 RGFVIEQSSD----FPAEGVRTLTF--REGSGRL-DLRLRVPAWA-TAGFTVTVNGVRQR 525

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
               PG++LS +  W   D++ I  P SLR E   DD     ++Q++ +GP LL   +
Sbjct: 526 AEAEPGSYLSLSRDWRPGDRVRISAPNSLRIERALDD----PTVQSVFYGPVLLTAQS 579


>gi|256376951|ref|YP_003100611.1| hypothetical protein Amir_2836 [Actinosynnema mirum DSM 43827]
 gi|255921254|gb|ACU36765.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
          Length = 614

 Score =  286 bits (731), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 190/521 (36%), Positives = 265/521 (50%), Gaps = 34/521 (6%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
           R +     YL  LD D L+ +FR+   L +     GGWE+P +ELRGH  GH LSA AQ 
Sbjct: 66  RNESRTHAYLKFLDPDRLLHTFRRNVGLASGATPCGGWESPTTELRGHSTGHVLSALAQA 125

Query: 177 WASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYT 231
             ST +   K K   +V  L+ CQ++       TGYLSAFP    D  EA + VWAPYYT
Sbjct: 126 HTSTGDTAFKTKSDYLVAGLAACQDRAAAAGFNTGYLSAFPESFIDRVEARQQVWAPYYT 185

Query: 232 IHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMND 291
           +HKILAGLLD + L  +AQAL + T    +   R  ++    +  +    L  E GGMN+
Sbjct: 186 LHKILAGLLDAHQLTGSAQALTVLTRKAAWVAWRNGRL----TQAQRQAMLGTEFGGMNE 241

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTG 351
           VL  LY +T DP HL  A  FD       LA   D LS FHANT IP  +G+   Y  TG
Sbjct: 242 VLANLYQLTGDPLHLTAARYFDHAQVFDPLAAGRDALSGFHANTQIPKALGAIREYHATG 301

Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSR 411
           +  Y+ I   F + V  +H+YA GG S  E++ +P R+A  L     E C T+NMLK++R
Sbjct: 302 ETRYRDIARNFWNFVVGAHTYAIGGNSNGEYFKNPGRIASELSDSTCECCNTHNMLKLTR 361

Query: 412 HLFRWTK-EIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARSTHGWGTKF 469
            LFR         D++E+AL N +L  Q   +  G   Y +PL  G  +  S       +
Sbjct: 362 QLFRTEPGRPELFDFHEKALYNHLLGAQNPDSAHGHHSYYVPLRAGGQRTFS-----NDY 416

Query: 470 NSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVS 529
             F CC+GTG+E+ +K  DSIYF        L++  +I S+  W    + + Q  D    
Sbjct: 417 QDFTCCHGTGMETNTKHRDSIYFHGGET---LWVNLFIPSTLTWPGRGITVRQ--DTGFP 471

Query: 530 WDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERW 589
                ++T+T S + +      L LR+P W  + GA+  LNG  +    PG +      W
Sbjct: 472 DTASTKLTITGSGRVD------LRLRVPAW--ATGARLRLNGAPV-AATPGGYARIDRTW 522

Query: 590 SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +  D + + LP++L  E+  DD     + Q +  GP +LAG
Sbjct: 523 ASGDTVELTLPMALTRESAPDD----PAAQVVKHGPIVLAG 559


>gi|289773961|ref|ZP_06533339.1| secreted protein [Streptomyces lividans TK24]
 gi|289704160|gb|EFD71589.1| secreted protein [Streptomyces lividans TK24]
          Length = 854

 Score =  286 bits (731), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 202/562 (35%), Positives = 274/562 (48%), Gaps = 43/562 (7%)

Query: 91  GFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA 150
           G   PG  L+   L  V L  S  L   ++T   YL  +D D L+ +FR    LP+  + 
Sbjct: 43  GAHRPGPLLEPFPLSAVRLLDSPFLANMRRT-CAYLRFVDPDRLLHTFRLNVGLPSAAEP 101

Query: 151 YGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT----- 205
            GGWE P  +LRGH  GH LSA AQ  A T      +K   +V +L+ECQ          
Sbjct: 102 CGGWEAPDVQLRGHTTGHLLSALAQAHAGTGETAYADKARLLVSALAECQRAAPAAGFHR 161

Query: 206 GYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEY 261
           GYLSAFP  +FD  EA    WAPYYT+HKI+AGLLDQY L+ N +A    L+MA W    
Sbjct: 162 GYLSAFPESVFDQLEAGGKPWAPYYTLHKIMAGLLDQYRLSGNREAFDVLLEMAAW---- 217

Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
                +      S ER    L  E GGMNDVL RL+  T DP HL  A  FD       L
Sbjct: 218 ----TEARTAPLSRERMQSVLKVEFGGMNDVLARLHLETGDPVHLRTARRFDHDELYAPL 273

Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
           A   D L+  HANT I  V+G+   YE TGD  Y  I   F   V   HSYA GG S +E
Sbjct: 274 AAGRDELAGRHANTEIAKVVGAVPAYEATGDRRYLDIADTFWTTVVRHHSYAIGGNSNQE 333

Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQR- 439
            +  P  +A  L     E C +YNMLK+ R LFR   E   Y D+YE  L N +L+ Q  
Sbjct: 334 LFGPPDEIASRLSEVTCENCNSYNMLKLGRDLFRHDPERTEYLDHYEWTLYNQMLAEQDP 393

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGT-------KFNSFWCCYGTGIESFSKLGDSIYF 492
            +  G + Y   L  G S+     G G+        +++F C +GTG+E+ +K  D++YF
Sbjct: 394 DSAHGFVTYYTGLWAG-SRREPKGGLGSAPGSYSGDYDNFSCDHGTGLETHTKFADTVYF 452

Query: 493 EEEG-NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
              G   P L++  ++ S   W    V L Q  D + + D   R+T+T    +      +
Sbjct: 453 RTPGTRRPALHVNLFVPSEVCWDDLGVTLRQDTD-MPTGD-RTRLTVTGGEAR-----FA 505

Query: 552 LNLRMPVWTYSNGAQASL--NGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
           L +R+P W  +   +A L  NG+       PG + + T  W   D++ + LP       +
Sbjct: 506 LRIRVPGWLAAGDGRAGLTVNGRRTGGRLEPGTYTTVTRHWRTGDRVELVLPRV----PV 561

Query: 609 QDDRPEYASIQAILFGPYLLAG 630
               P+   ++A+ +GP +LAG
Sbjct: 562 WRPAPDNPQVKAVSYGPLVLAG 583


>gi|399071242|ref|ZP_10749941.1| hypothetical protein PMI01_00976 [Caulobacter sp. AP07]
 gi|398043612|gb|EJL36503.1| hypothetical protein PMI01_00976 [Caulobacter sp. AP07]
          Length = 789

 Score =  285 bits (729), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 187/532 (35%), Positives = 269/532 (50%), Gaps = 44/532 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A + N  YLL L  D  + +F   A LP  G+ YGGWE+    + GH +GHY+SA   M+
Sbjct: 53  AVEVNRAYLLRLSPDRFLHNFMTFAGLPAKGEIYGGWES--DTIAGHTLGHYVSALVVMY 110

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE-----LFDSFEALKPV------- 225
             T +   + +   +V  L+  Q K G GY+ A   +     + D  E    V       
Sbjct: 111 EQTGDVECRRRADYIVGELARAQAKRGDGYIGALQRKRKDGTVVDGEEIFAEVMKGDIRS 170

Query: 226 --------WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
                   W+P YT+HK  AGLLD +    N QAL +A  +  YF    ++V    + E+
Sbjct: 171 GGFDLNGSWSPLYTVHKTFAGLLDVHRAWGNQQALDVAVGLGGYF----ERVFAALNDEQ 226

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
               L  E GG+N+    LY+ T D + L++A        L  L  Q D L++FHANT +
Sbjct: 227 MQTLLGCEYGGLNESYAELYARTGDRRWLVVAERIYDRKVLDPLVAQQDKLANFHANTQV 286

Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
           P +IG    YE+TG P       FF + V   HSY  GG + RE++ +P  +A  +  + 
Sbjct: 287 PKLIGLGRLYELTGKPQDAAAARFFWNTVTQHHSYVIGGNADREYFAEPDTIAAHISEQT 346

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
            E C TYNMLK++R L+ W  E A  DYYERA  N V++ Q   + G   YM PL  G  
Sbjct: 347 CEHCNTYNMLKLTRQLYSWRPEGALFDYYERAHLNHVMAAQ-NPKTGGFTYMTPLLTGAD 405

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
           +  ST+    + ++FWCC GTG+ES +K G+SI++E EG    L +  YI +   WK+  
Sbjct: 406 RGYSTN----EDDAFWCCVGTGMESHAKHGESIFWEGEG---ALLVNLYIPAEAQWKARG 458

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
             L  ++D    ++P  R+TL   +K   G+  ++ LR+P W  S  A+ S+NGQ +   
Sbjct: 459 AAL--RLDTRYPFEPESRLTLAKLAKP--GRF-TIALRVPAWAGSE-AKVSVNGQVVTPE 512

Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
             G +     RW   D + I LPL LR EA   D    AS  A++ GP +LA
Sbjct: 513 MAGGYALVDRRWREGDVVAITLPLGLRLEATPGD----ASTVAVVRGPMVLA 560


>gi|21231831|ref|NP_637748.1| hypothetical protein XCC2394 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66768042|ref|YP_242804.1| hypothetical protein XC_1718 [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|21113547|gb|AAM41672.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
 gi|66573374|gb|AAY48784.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 791

 Score =  285 bits (729), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 203/612 (33%), Positives = 297/612 (48%), Gaps = 63/612 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           ++ V L  V L   S+   A  TN  YL+ L  D L+ +F   A L     AYGGWE   
Sbjct: 49  IRAVPLAQVRL-MPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWE--A 105

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
             + GH +GHYLSA A M A T +A  + + S +V  L+ CQ   G GY++ F       
Sbjct: 106 DTIAGHTLGHYLSALALMHAQTDDAQCRTRASYLVAELARCQAHAGDGYVAGFTRKNAAG 165

Query: 214 ------ELFDSFE--ALKPV-------WAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
                 E+FD  +   ++P+       WAP YT HK+ AGLLD +V  DNAQAL++A  +
Sbjct: 166 QIESGREVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVGL 225

Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
             Y    +Q V ++    +    L+ E GG+N+    L+  T D + L LA        L
Sbjct: 226 AGY----LQAVFSVLDDAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHAVL 281

Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
             L  Q D L H H+NT+IP +IG    YEVTGD        FF + V   HSY  GG  
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGNG 341

Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
            RE++  P  +A  L  +  E C++YNMLK++RHL++W  + AY DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSIARFLTEQTCEHCSSYNMLKLTRHLYQWGPQAAYFDYYERTLLNHVMA-Q 400

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
           +    G+  YM P+  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+E+    
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWEDG--- 452

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            G+ I  Y+ S     +G  +      P       LR+    ++++      +L+LR+P 
Sbjct: 453 QGVAINLYVPSRVRNAAGLDMTLHSALP-AQGSVSLRIDAAPAAQR------TLSLRVPG 505

Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
           W  +   Q  LNG  +       +L  T  W   D L + L + LR EA  DD P + S 
Sbjct: 506 WAAAPVLQ--LNGAVVDAAAVDGYLRVTRIWHPGDTLNLSLQMPLRLEATPDD-PAWVS- 561

Query: 619 QAILFGPYLLA---GHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
             +L GP +LA   G  +  W  KT        ++  + P+           +G  ++V 
Sbjct: 562 --VLRGPLVLAADLGDAATPWSGKTLALIGGDEVLQQLQPA-----------AGQGSYVY 608

Query: 676 SNSNQSITMEEF 687
           S+  Q      F
Sbjct: 609 SDGAQQWRFSPF 620


>gi|374324035|ref|YP_005077164.1| hypothetical protein HPL003_21035 [Paenibacillus terrae HPL-003]
 gi|357203044|gb|AET60941.1| hypothetical protein HPL003_21035 [Paenibacillus terrae HPL-003]
          Length = 767

 Score =  285 bits (728), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 190/569 (33%), Positives = 288/569 (50%), Gaps = 39/569 (6%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENP 157
           +KE   HDV L++ S    A    L+Y+  +D D ++++FR TA++ T G +   GW+ P
Sbjct: 191 VKEFKGHDVRLEKESEFGAAMDRFLQYVRSVDDDQMLYNFRATAAVDTKGAQPMTGWDAP 250

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI------GTGYLSAF 211
              L+GH  GHYLSA A  + +T ++ +  K+  +V  L +CQ  +      G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYNATEDSALLGKIQYMVAELGKCQTALSEQAGYGRGFLSAY 310

Query: 212 PTELFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
             E F+  E       +WAPYYT+HKI+AGLLD Y LA   +AL++   +  + +NR+ +
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALEICDKLGHWLHNRLSR 370

Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            +    + + W   +  E GGMN+VL +LY+IT    +L+ A  FD       +    D 
Sbjct: 371 -LPREQLHKMWSLYIAGEFGGMNEVLAKLYAITSHEHYLITAKYFDNEKLFLPMKENVDT 429

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L + HAN HIP VIG+   +EV G+  Y  I   F  +V   H Y+ GG    E + +P 
Sbjct: 430 LGNMHANQHIPQVIGALKLFEVAGEKAYFKIAENFWTMVTQRHIYSIGGAGETEMFREPD 489

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP-GVM 446
            +A  L  +  ETC +YNMLK+++ LF++     Y DYYE+AL N +L+ +   +  G  
Sbjct: 490 AIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEGGS 549

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
            Y +PL  G  K   TH          CC+GTG+E+  K  ++IYF +E     LY+  Y
Sbjct: 550 TYFMPLAPGSIKKFDTH-------ENTCCHGTGLENHFKYQEAIYFYDEDR---LYVNLY 599

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
           I S  DW    + L QK D       +  +        E G  ++L  R+P W  S   Q
Sbjct: 600 IPSQLDWSEQGLSLIQKRDQSSLEKAHFYI--------EGGTETTLMFRIPDWV-SEPVQ 650

Query: 567 ASLNGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
             +NG+    L     +L   + W   D++ + LP SLR  +  +D     +  ++ +GP
Sbjct: 651 VKINGEPCRDLEYEHGYLKLRKVWK-EDEIELTLPRSLRLASAPNDH----TFMSLTYGP 705

Query: 626 YLLAGHTSGEWDIKTGTARSLSALISPIP 654
           Y+LA   SGE D  + T      L   IP
Sbjct: 706 YVLAA-ISGEQDYISWTYSEQEFLEQIIP 733


>gi|374322441|ref|YP_005075570.1| hypothetical protein HPL003_12970 [Paenibacillus terrae HPL-003]
 gi|357201450|gb|AET59347.1| hypothetical protein HPL003_12970 [Paenibacillus terrae HPL-003]
          Length = 774

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 194/530 (36%), Positives = 263/530 (49%), Gaps = 46/530 (8%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
           +A + N  YLL L  D L+  FR+ A L T    Y GWE     + GH +GHYLSA + M
Sbjct: 28  QAMELNRSYLLELQPDRLLARFREYAGLSTKAPQYEGWE--AMSISGHTLGHYLSACSMM 85

Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPV 225
           +AST +   KE    +   L  CQ   G GY+S  P   ELF+   A         L   
Sbjct: 86  YASTGDNRFKEIAHYITDELDVCQEAHGDGYVSGIPGGKELFEEVSAGNIRSKGFDLNGA 145

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYS 281
           WAP YT+HK+ AGL D Y L    +AL    K+A W+          ++T  S E+    
Sbjct: 146 WAPLYTLHKLFAGLRDAYHLTGCNKALLVERKLADWL--------GGILTPMSDEQMQQM 197

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
           +  E GGMN+VL  LY+ T +  +L LA  F     L  L+ Q D L   HANT IP +I
Sbjct: 198 MFCEYGGMNEVLADLYADTGEESYLRLAECFWHKLVLDPLSSQEDCLQGIHANTQIPKLI 257

Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
           G    YE+T D   +    FF D V   HSY  GG S  E++  P  L D +G    ETC
Sbjct: 258 GLAKEYELTNDTKRRATVEFFWDRVVDHHSYVIGGNSFGEYFGAPGGLNDRIGPHTTETC 317

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
            TYNMLK++ HLF+W      AD+YER L N +L+ Q     GV  Y L L  G  K   
Sbjct: 318 NTYNMLKLTSHLFQWNVSAKEADFYERGLFNHILASQDPVHGGV-TYFLSLAMGGHKH-- 374

Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
              + +KF+ F CC GTG+E+ +  G  IYF +      LY+ Q+I+S+ +WK   V L 
Sbjct: 375 ---FESKFDDFTCCVGTGMENHASYGSGIYFHDHDK---LYVNQFIASTLEWKDTGVTLK 428

Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPG 580
           Q      S+      TL     Q    +  L +R P W    G    +NG+    +  PG
Sbjct: 429 QS----TSYPDTDHTTLEIQCDQPAKFM--LLVRYPYWA-EKGITIRVNGKEQSVVSEPG 481

Query: 581 NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +F+S    W   D + + +P+SLR E + D+ P+ A   A+++GP +LAG
Sbjct: 482 SFVSIARTWIDGDVVEVTIPMSLRLEQMPDN-PDRA---AVMYGPLVLAG 527


>gi|436837799|ref|YP_007323015.1| hypothetical protein FAES_4423 [Fibrella aestuarina BUZ 2]
 gi|384069212|emb|CCH02422.1| hypothetical protein FAES_4423 [Fibrella aestuarina BUZ 2]
          Length = 781

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 187/534 (35%), Positives = 272/534 (50%), Gaps = 56/534 (10%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A + +  +LL L  D L+  FR  A L      YGGWE+  S L GH +GHYLSA A  +
Sbjct: 58  AMEADTRFLLNLQPDRLLAQFRAHAGLAPKAAKYGGWES--SGLAGHSLGHYLSALALQY 115

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-------------LKP 224
           A+T++    ++++ +V  L++CQ    TGY+ A P E  D+  A             L  
Sbjct: 116 AATNDPEYLKRVNYIVDELADCQRARKTGYVGAIPRE--DTVFAEVAQGNIRSRGFDLNG 173

Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALK----MATWMVEYFYN----RVQKVITMYSVE 276
            W+P+YT+HK++AGLLD Y+ A N +AL     MA W  E   N    +VQK++      
Sbjct: 174 AWSPWYTVHKVMAGLLDAYLYAHNDKALAVTVGMADWTGETLKNLTDEQVQKMLLC---- 229

Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
                   E GGMNDVL  +Y++T + K+L L++ F     L  LA Q D L   HANT 
Sbjct: 230 --------EYGGMNDVLANIYALTGNKKYLDLSYKFHDRVVLDSLAHQKDILPGRHANTQ 281

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
           +P +IG+  RYE+TG      +  FF   V   H+YA GG S  E+   P +L D L   
Sbjct: 282 VPKLIGTIRRYELTGSQPDLAMSDFFWKTVVNHHTYAPGGNSNYEYLSTPDQLTDKLTDN 341

Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
             ETC T+NMLK++RHLF      AY DYYERAL N +L+ Q   + G++ Y +PL  G 
Sbjct: 342 TMETCNTHNMLKLTRHLFALQPNAAYMDYYERALYNHILASQH-HKTGMVCYFVPLRMGT 400

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
            K      +  +   F CC GTG+E+  K G+SI+F  +G    L++  +I S  +W   
Sbjct: 401 RKH-----FSDEEEDFTCCVGTGMENHVKYGESIFF--KGADQSLFVNLFIPSELNWAEK 453

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
            + L    +  +  DP +R+T+      ++     + LR P W  +   Q  +NG+    
Sbjct: 454 GLRLTLNAN--LPADPTVRLTVQADKPTKL----PIRLRKPYW-LAGPMQVRVNGKAATS 506

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
                ++   +RW   D + + LP SLR   +    P+  + QA  +GP LLAG
Sbjct: 507 TVQDGYVVIDQRWKTGDVVELTLPASLRAMPM----PDNIARQAFFYGPVLLAG 556


>gi|298246853|ref|ZP_06970658.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
           44963]
 gi|297549512|gb|EFH83378.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
           44963]
          Length = 600

 Score =  282 bits (722), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 182/537 (33%), Positives = 279/537 (51%), Gaps = 36/537 (6%)

Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL----PTPGKAYGGWENPISELRGHFV 166
           Q   L +  + N  Y+L L   +L+ +    A L      P   + GWE+P  +LRGHF+
Sbjct: 16  QPGPLKKRAELNRAYMLSLKSTNLLQNHYGEAGLWNPPQQPTDCHRGWESPTCQLRGHFL 75

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVW 226
           GH+LSA+A++ AST +  IK K   +V  L+ CQ ++   ++ + P +  D     K VW
Sbjct: 76  GHWLSAAARLVASTGDTEIKGKADFIVAELARCQQEMEGEWIGSIPEKYLDWIARGKRVW 135

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
           AP+YT+HK L GL D Y +  N QAL +     ++F+    +    +S E+    L+ ET
Sbjct: 136 APHYTLHKTLMGLYDMYEIGQNEQALDILIHWADWFH----RWTGQFSREQMDDILDVET 191

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGM +V   LY +T+  +HL L   +D+      L    D L++ HANT IP V G+   
Sbjct: 192 GGMLEVWANLYGVTNRQEHLDLIRRYDRSRLFDRLLAGEDVLTYMHANTTIPEVHGAARA 251

Query: 347 YEVTGDPLYK-LIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
           +EVTG+  ++ ++  ++   V     + TGG ++ E W  P +L   LG EN+E CT YN
Sbjct: 252 WEVTGEQRWRDIVEAYWRLAVTDRGYFCTGGQTSDEVWCPPHQLGGQLGPENQEHCTVYN 311

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
           +++++ +LFRWT ++ YADYYER   NG+L+ Q+  + G++ Y LPL  G +K      W
Sbjct: 312 LMRLANYLFRWTGDVVYADYYERNFYNGILA-QQNAQTGMVAYYLPLETGGTKV-----W 365

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH----VVLN 521
           GT  N FWCC+GT +++ +     IYF    N  GL + QYI S   W        V L 
Sbjct: 366 GTPTNDFWCCHGTLVQAQASHTRDIYFT---NDEGLVVSQYIPSRLQWHHDGSEVIVTLE 422

Query: 522 QKVDPIVSWDP---YLRMT----LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
            K   + +        R T     T S   E     +L LR+P W  ++    ++NG+  
Sbjct: 423 SKAHNVYALKAPREQPRQTSHPEYTLSVNCEQPTEYTLTLRLPWWL-ADEPMITINGERQ 481

Query: 575 PLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
            +P  P ++      W +NDKLTI LP +L+   +    P  + + A + GP +LAG
Sbjct: 482 RVPHTPSSYYHIRRTW-HNDKLTILLPKALQIVPL----PGASDMMAFMDGPIVLAG 533


>gi|21218915|ref|NP_624694.1| hypothetical protein SCO0371 [Streptomyces coelicolor A3(2)]
 gi|5881940|emb|CAB55733.1| putative secreted protein [Streptomyces coelicolor A3(2)]
          Length = 869

 Score =  282 bits (721), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 201/562 (35%), Positives = 273/562 (48%), Gaps = 43/562 (7%)

Query: 91  GFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA 150
           G   PG  L+   L  V L  S  L   ++T   YL  +D D L+ +FR    LP+  + 
Sbjct: 58  GAHRPGPLLEPFPLSAVRLLDSPFLANMRRT-CAYLRFVDPDRLLHTFRLNVGLPSAAEP 116

Query: 151 YGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT----- 205
            GGWE P  +LRGH  GH LSA AQ  A T      +K   +V +L+ECQ          
Sbjct: 117 CGGWEAPDVQLRGHTTGHLLSALAQAHAGTGETAYADKARLLVSALAECQRAAPAAGFHR 176

Query: 206 GYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEY 261
           GYLSAFP  +FD  EA    WAPYYT+HKI+AGLLDQY L+ N +A    L+MA W    
Sbjct: 177 GYLSAFPESVFDQLEAGGKPWAPYYTLHKIMAGLLDQYRLSGNREAFDVLLEMAAW---- 232

Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
                +      S ER    L  E GGMNDVL RL+  T DP HL  A  FD       L
Sbjct: 233 ----TEARTAPLSRERMQSVLKVEFGGMNDVLARLHLETGDPVHLRTARRFDHDELYAPL 288

Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
           A   D L+  HANT I  V+G+   YE TGD  Y  I   F   V   HSYA GG S +E
Sbjct: 289 AAGRDELAGRHANTEIAKVVGAVPAYEATGDRRYLDIADTFWTTVVRHHSYAIGGNSNQE 348

Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQR- 439
            +  P  +A  L     E C +YNMLK+ R LFR   E   Y D+YE  L N +L+ Q  
Sbjct: 349 LFGPPDEIASRLSEVTCENCNSYNMLKLGRDLFRHDPERTEYLDHYEWTLYNQMLAEQDP 408

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGT-------KFNSFWCCYGTGIESFSKLGDSIYF 492
            +  G + Y   L  G S+     G G+        +++F C +GTG+E+ +K  D++YF
Sbjct: 409 DSAHGFVTYYTGLWAG-SRREPKGGLGSAPGSYSGDYDNFSCDHGTGLETHTKFADTVYF 467

Query: 493 EEEG-NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
              G   P L++  ++ S   W    V L Q  D + + D   R+T+T    +      +
Sbjct: 468 RTPGTRRPALHVNLFVPSEVCWDDLGVTLRQDTD-MPTGD-RTRLTVTGGEAR-----FA 520

Query: 552 LNLRMPVWTYSNGAQASL--NGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
           L +R+  W  +   +A L  NG+       PG + + T  W   D++ + LP       +
Sbjct: 521 LRIRVAGWLAAGDGRAGLTVNGRRTGGRLEPGTYTTVTRHWRTGDRVELVLPRV----PV 576

Query: 609 QDDRPEYASIQAILFGPYLLAG 630
               P+   ++A+ +GP +LAG
Sbjct: 577 WRPAPDNPQVKAVSYGPLVLAG 598


>gi|188991168|ref|YP_001903178.1| hypothetical protein xccb100_1772 [Xanthomonas campestris pv.
           campestris str. B100]
 gi|167732928|emb|CAP51124.1| Putative secreted protein [Xanthomonas campestris pv. campestris]
          Length = 791

 Score =  282 bits (721), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 201/612 (32%), Positives = 296/612 (48%), Gaps = 63/612 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGWE   
Sbjct: 49  IRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWE--A 105

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE---- 214
             + GH +GHYLSA A M A T +A  + + S +V  L+ CQ  +G GY++ F  +    
Sbjct: 106 DTIAGHTLGHYLSALALMHAQTDDAHCRTRASYLVAELARCQAHVGDGYVAGFTRKNAAG 165

Query: 215 -------LFDSFE--ALKPV-------WAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
                  +FD  +   ++P+       WAP YT HK+ AGLLD +V  DNAQAL++A  +
Sbjct: 166 QIESGRAVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVAL 225

Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
             Y    +Q +       +    L+ E GG+N+    L+  T D + L LA        L
Sbjct: 226 AGY----LQGIFAALDDTQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHTVL 281

Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
             L  Q D L H H+NT+IP +IG    YEVTGD        FF + V   HSY  GG  
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGNG 341

Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
            RE++  P  ++  L  +  E C++YNMLK++RHL++W  + AY DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCSSYNMLKLTRHLYQWGPQAAYFDYYERTLLNHVMA-Q 400

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
           +    G+  YM P+  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+E+    
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWEDG--- 452

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            G+ I  Y+ S     +G  +      P       LR+    ++++      +L+LR+P 
Sbjct: 453 QGVAINLYVPSRVRNAAGLDMTLHSALP-AQGSVSLRIDAAPAAQR------TLSLRVPG 505

Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
           W  +   Q  LNG  +       +L  T  W   D L + L + LR EA  DD P + S 
Sbjct: 506 WAAAPVLQ--LNGAVVDAAAVDGYLRVTRTWHPGDTLNLSLQMPLRLEATPDD-PAWVS- 561

Query: 619 QAILFGPYLLA---GHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
             +L GP +LA   G  +  W  KT        ++  + P+           +G  ++V 
Sbjct: 562 --VLRGPLVLAADLGDAATPWSGKTPALIGGDEVLQQLQPA-----------AGQGSYVY 608

Query: 676 SNSNQSITMEEF 687
           S+  Q      F
Sbjct: 609 SDGAQQWRFSPF 620


>gi|333381736|ref|ZP_08473415.1| hypothetical protein HMPREF9455_01581 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332829665|gb|EGK02311.1| hypothetical protein HMPREF9455_01581 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 775

 Score =  282 bits (721), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 194/555 (34%), Positives = 285/555 (51%), Gaps = 49/555 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           LK  SL DV L  SS    A   + ++LL  + D  +  FR  + L      YGGWE+  
Sbjct: 35  LKPFSLSDVRL-TSSPFMSAMSLDEKWLLSFEPDRFLSGFRSESGLQPKAPKYGGWES-- 91

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSAFP----- 212
             + G   GHYLSA + M+AST N  + +++   +  L  CQ   G  G ++AFP     
Sbjct: 92  QGVAGQTFGHYLSALSMMYASTGNEQLNDRIKYSINELDSCQQAFGMNGIVAAFPRAKGL 151

Query: 213 ----------TELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYF 262
                     TE FD    L   W P Y++HK+ AGL+D Y    N QA K+   + +  
Sbjct: 152 FTEISTGDIRTEGFD----LNGGWVPLYSMHKLFAGLIDVYEYTGNKQAYKIYINLAD-- 205

Query: 263 YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLA 322
              V K+++  S E+    L  E GG+N+ L  +Y++T + K+L LA   +    L  L+
Sbjct: 206 --GVDKMLSGLSDEQIQKILICEHGGINESLAEVYALTGNKKYLNLATRLNHKAVLDPLS 263

Query: 323 LQADYLSHFHANTHIPIVIGSQMRYEVTG-DPLYKLIGTFFMDIVNASHSYATGGTSARE 381
              D L+  HANT IP VIG    YE+TG D L+K    FF + V  SHSY  GG S  E
Sbjct: 264 KGVDELAGKHANTQIPKVIGVIREYELTGNDDLFK-TAEFFWNTVVHSHSYVIGGNSEAE 322

Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
            +    R  D +  +  E C TYNMLK+++HLF    +I  ADYYERAL N +L+ Q   
Sbjct: 323 HFGVAGRTYDRITDKTCENCNTYNMLKLTKHLFSLQPDIQKADYYERALYNQILASQ-NP 381

Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
           + G++ YM PL  G     S  G+ T F+SFWCC GTG+E+ ++ G+ IYF ++     L
Sbjct: 382 QDGMVCYMSPLAAG-----SRRGFSTPFDSFWCCVGTGLENHARYGEFIYFSDKDK--NL 434

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
           +I  +I S  DWK  ++V+ Q  +   S       T+ +  K +  Q  ++N+R P+W  
Sbjct: 435 FINLFIPSKLDWKDRNMVIEQITNFPES------DTVRYKIKAKKTQEFTVNIRYPLWA- 487

Query: 562 SNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
            +G    +NG+ + +   PGN++  T +W  ND +   LP  L +EA   D     +++A
Sbjct: 488 QDGFSLFVNGKRVEINSSPGNYIQLTRKWKNNDDICYVLPKRLLSEAALGD----TNLRA 543

Query: 621 ILFGPYLLAGHTSGE 635
            L+GP +L+     E
Sbjct: 544 YLYGPIVLSAVLDNE 558


>gi|330997549|ref|ZP_08321396.1| hypothetical protein HMPREF9442_02496 [Paraprevotella xylaniphila
           YIT 11841]
 gi|329570407|gb|EGG52138.1| hypothetical protein HMPREF9442_02496 [Paraprevotella xylaniphila
           YIT 11841]
          Length = 622

 Score =  282 bits (721), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 182/549 (33%), Positives = 273/549 (49%), Gaps = 48/549 (8%)

Query: 94  LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG----- 148
           LPG F   +    VW+                 + + VD L+  FR TA +         
Sbjct: 37  LPGRFRDNMMRDSVWM-----------------VSIGVDRLLHGFRTTAGIFAGREGGYM 79

Query: 149 --KAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTG 206
             K  GGWE+   ELRGH  GH+LSA + M+A+T +   K K  ++V  L+E Q  +G G
Sbjct: 80  TVKKLGGWESLDCELRGHTTGHFLSALSLMYAATGSEVFKLKGDSLVAGLAEVQVALGNG 139

Query: 207 YLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRV 266
           YLSAFP EL +       VWAP+YT+HKI +GL+DQY+ A N QAL++   M ++ Y ++
Sbjct: 140 YLSAFPEELINRNIRATSVWAPWYTLHKIFSGLIDQYLYAGNTQALEVVRKMGDWAYAKL 199

Query: 267 QKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
           + +    S E     +  E GG+N+  Y LY++T D ++  LA  F     +  L  Q D
Sbjct: 200 KPL----SEETRRKMIRNEFGGVNESFYNLYALTGDERYKWLAGFFYHNEVIDPLKAQKD 255

Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
            L   H NT IP V+     YE+TGD   K +  FF   +   H++A G +S +E ++  
Sbjct: 256 DLGTKHTNTFIPKVLAEARNYELTGDADSKALSEFFWHTMIDRHTFAPGCSSDKEHYFPT 315

Query: 387 KRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
            +    +     ETC TYNMLK+SRHLF W      ADYYERAL N +L  Q+    G++
Sbjct: 316 DKFTAHISGYTGETCCTYNMLKLSRHLFCWDASPEVADYYERALYNHILG-QQDPASGMV 374

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
            Y LPL  G  +  S     T  NSFWCC G+G E+ +K  ++IY+ +     G+++  +
Sbjct: 375 AYFLPLQTGTHRVYS-----TPENSFWCCVGSGFENHAKYAEAIYYHDRD---GIFVNLF 426

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
           I S   W+   +VL Q  D     +  +  T+     +++    ++ LR P W+ S  + 
Sbjct: 427 IPSEVKWREKGLVLRQ--DTRFPEEGKVTFTVGLDEPKQL----TVRLRYPSWS-SEVSV 479

Query: 567 ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
                +      PG+++  + RW   D++     + LR E      P+     A+L+GP 
Sbjct: 480 KVNGKKVKVRQKPGSYILLSRRWKDGDRIEADYAMGLRLERT----PDGTERGALLYGPV 535

Query: 627 LLAGHTSGE 635
           +LAG    E
Sbjct: 536 VLAGELGTE 544


>gi|256377207|ref|YP_003100867.1| hypothetical protein Amir_3107 [Actinosynnema mirum DSM 43827]
 gi|255921510|gb|ACU37021.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
          Length = 771

 Score =  281 bits (720), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 202/586 (34%), Positives = 287/586 (48%), Gaps = 60/586 (10%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPISELRGHFV 166
           W+D        Q   L YL  +D D L+++FR    L T G A   GWE P    R H  
Sbjct: 62  WMDN-------QNRALSYLRFVDPDRLLYNFRANHRLSTAGAAPLAGWEAPDFPFRTHSQ 114

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
           GH+L+A AQ WA   + T +++ + +V  L++CQ          GYLS FP    D+ EA
Sbjct: 115 GHFLTAWAQAWAVLGDTTSRDRANHLVAELAKCQANNAAAGFTAGYLSGFPESDLDALEA 174

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVER 277
             P    YY +HK LAGLLD +    + QA    L+ A W V++   R+ +  TM  V  
Sbjct: 175 GTPKAVSYYALHKTLAGLLDVWRHLGSTQARDVLLRFAGW-VDWRTARLSQA-TMQRV-- 230

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
               L  E GGMN VL  LY  T D + L  A  FD       LA   D L+  HANT +
Sbjct: 231 ----LATEFGGMNAVLADLYQQTGDARWLATAQRFDHAAAFDPLAANQDRLNGLHANTQV 286

Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
           P  IG+   Y+ TG   Y+ I T   +I  A+H+Y  GG S  E +  P  +A  L ++ 
Sbjct: 287 PKWIGAAREYKATGTTRYRDIATNAWNITVAAHTYVIGGNSQAEHFRAPNAIAAHLATDT 346

Query: 398 EETCTTYNMLKVSRHLFRWTKE---IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG 453
            E C TYNMLK++R L  W  E    AY D+YERAL N ++  Q   +  G + Y   L 
Sbjct: 347 AEACNTYNMLKLTREL--WLLEPTKAAYFDFYERALLNHLIGQQNPADAHGHICYFTGLN 404

Query: 454 RGVSKARSTHGWG-----TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
            G  + R+   WG     T +++FWCC GTGIE+ +KL DSIYF +      L +  Y  
Sbjct: 405 PGHRRGRTGPAWGGGTWSTDYSTFWCCQGTGIETNTKLADSIYFRDGTT---LTVNLYTP 461

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
           S+  W    + + Q      S    L +T + S         ++ LR+P WT  +GA  +
Sbjct: 462 STLTWSERGITVTQSTTYPASDTTTLTVTGSASGSW------TMRLRIPAWT--SGATVA 513

Query: 569 LNG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
           +NG  QN+    PG++ S T  W+ +D +T++LP+ + T       P+  ++ A+ +GP 
Sbjct: 514 VNGTPQNV-AAAPGSYASLTRSWTSDDTVTLRLPMRVTTAPA----PDNPNVVAVTYGPV 568

Query: 627 LLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNST 672
           +LAG      +  + T  +L AL        +   +TFT  SG ST
Sbjct: 569 VLAG------NFGSTTLSALPALDVASITRTSTTALTFTARSGGST 608


>gi|329849035|ref|ZP_08264063.1| hypothetical protein ABI_21080 [Asticcacaulis biprosthecum C19]
 gi|328844098|gb|EGF93667.1| hypothetical protein ABI_21080 [Asticcacaulis biprosthecum C19]
          Length = 773

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 183/536 (34%), Positives = 264/536 (49%), Gaps = 44/536 (8%)

Query: 115 LWR-AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSAS 173
           +WR A   N  YLL L+ D L+ +F K+A L   G  YGGWEN    + GH +GHYL+A 
Sbjct: 44  VWRDAVDANGHYLLSLEPDRLLHNFHKSAGLAPKGDIYGGWEN--MGIAGHSLGHYLTAL 101

Query: 174 AQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV-------- 225
              +A T +   K K+   V  ++  Q   G GY+     E     +  K V        
Sbjct: 102 GLAYAQTRDPAYKAKLDYTVSEMAIIQKAHGDGYIGGTTVERDGKLQDGKIVYEEVRKHV 161

Query: 226 -----------WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYS 274
                      W P YT HK+ AGLLD +  A+N QALK+A  M +Y       V+   S
Sbjct: 162 ITSHGFDLNGGWVPLYTWHKVHAGLLDAHRYANNGQALKIAIGMSDYLIG----VLGDLS 217

Query: 275 VERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHAN 334
            E     L  E GG+N+    +Y  T D ++L  A        L  LA + D L   HAN
Sbjct: 218 DEEMQKVLAAEHGGLNETYAEMYVRTGDKRYLDTARRIYHKAVLTPLAQRRDELEGKHAN 277

Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
           T IP +IG    YEVTGD  Y    ++F D V   HSY  GG SA E +  P +L+  L 
Sbjct: 278 TQIPKLIGLARLYEVTGDKAYGDTASYFWDRVIHHHSYVIGGNSAGEHFGAPDKLSGRLD 337

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
            +  E+C TYNMLK++RHL++W  + A+ DYYERA  N +L+ Q   + G  +Y +PL  
Sbjct: 338 DKTCESCNTYNMLKLTRHLYQWQPDAAWFDYYERAHLNHILAHQ-DPQTGAFVYFVPLAS 396

Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
           G  +  S     T   SFWCC G+G+ES +K GDSI++ + G    +Y   +I S   W 
Sbjct: 397 GSQRLYS-----TPDTSFWCCVGSGMESHAKHGDSIWWRQAGGGDTVYANLFIPSELSWT 451

Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
                +    D I+  +P     +TF+   +     +L +R+P W  ++G + S+NG+N 
Sbjct: 452 DKATKIALSGD-ILKGEP-----VTFTVTPQGTADFTLAIRVPKW--ADGPRLSVNGKNT 503

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           PL     ++     W   D + + LP +L+ E +    P+   + A + GP ++AG
Sbjct: 504 PLLVKNGYVRVRRAWKAGDTVVLTLPHALKVETM----PDNPRLAAFIKGPMVMAG 555


>gi|294667526|ref|ZP_06732741.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292602646|gb|EFF46082.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 791

 Score =  281 bits (718), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 205/627 (32%), Positives = 294/627 (46%), Gaps = 86/627 (13%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A QTN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRLTPSLFL-DALQTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   +NAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCENAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q V       +   +L+ E GG+N+    L+  T D + L LA     
Sbjct: 222 AVALAGY----LQGVFAALDDAQLQKALSCEFGGLNESFVELHVQTGDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L+H H+NT+IP +IG    YEVTGDP       FF   V   H+Y  
Sbjct: 278 HAVLDPLIAQRDALAHQHSNTNIPKLIGLAREYEVTGDPASGAAARFFWHTVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RHL++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSG-----HVVLNQKVDPIVSWD---PYLRMTLTFSSKQEV 546
                G+YI  Y+ S+    +G     H  L ++    +  D   P  RM          
Sbjct: 452 G---QGVYINLYVPSTVRDAAGLNMTLHSALPEQGSASLRIDGAPPAQRM---------- 498

Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
                L LR+P W  +   +  LNGQ +       +L  T  W   D L +   + LR E
Sbjct: 499 -----LALRVPGW--AQQPRLRLNGQPVDGSASDGYLRLTRVWQPGDTLQLSFDMPLRLE 551

Query: 607 AIQDDRPEYASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQ 660
           A  DD P + S   +L GP +LA   G  +  W  KT T    + +   + P+P      
Sbjct: 552 ATPDD-PAWVS---VLHGPLVLAVDLGDAAKPWSGKTPTLIGGQDILQRLQPVP------ 601

Query: 661 LVTFTQESGNSTFVMSNSNQSITMEEF 687
                   G + F  S+  Q   +  F
Sbjct: 602 --------GKTAFTYSDGAQQWQLSPF 620


>gi|371778346|ref|ZP_09484668.1| hypothetical protein AnHS1_13085 [Anaerophaga sp. HS1]
          Length = 796

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 188/546 (34%), Positives = 275/546 (50%), Gaps = 46/546 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L+EV L D           A + N + LL  + D L+  FR+ A L    + YGGWE   
Sbjct: 50  LEEVELLD------GPFLEASKLNEKILLNYEPDRLLAHFREQAHLKPKAQHYGGWEG-- 101

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELF 216
             L GH +GHYLSA + M+ +T N    ++++ +V  L   Q   G GYL AF    ++F
Sbjct: 102 ESLTGHSLGHYLSACSMMYKTTGNEEFLKRVNYIVNELDTVQKAHGDGYLGAFDNGKKIF 161

Query: 217 DSFEA----------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRV 266
           +   A          L  +WAP YT HKI+AGL+D Y L  N +AL++     + F + +
Sbjct: 162 EEEIANGNIRSAGFDLNGIWAPIYTQHKIMAGLMDAYKLCGNKKALEVE----QKFADWL 217

Query: 267 QKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
             ++   S E     L+ E GG+N+    L+++T + ++L +A LF     L  LA   D
Sbjct: 218 GSIVENLSHEEIQKMLHCEHGGINEAYAELFAVTGNERYLKIARLFHHEAVLDPLAKGID 277

Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
            L   HANT IP +IG    YE+TGD   +    FF + V   HSY TGG    E++  P
Sbjct: 278 ILPGHHANTQIPKIIGLSRLYELTGDTTDRKTAQFFWERVVYHHSYVTGGNGDHEYFGPP 337

Query: 387 KRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
             L++ L S   ETC  YNMLK+S HLF+W  E   ADYYERAL N +LS Q   + G +
Sbjct: 338 DTLSNRLSSNTTETCNVYNMLKLSNHLFKWEAEAEVADYYERALFNHILSSQH-PQSGHV 396

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
           IY L L  G  K      +   F  F CC GTG+E+ +K   +IYF    N   L++ Q+
Sbjct: 397 IYNLSLEMGGHKH-----YQNPF-GFTCCVGTGMENHAKYPKNIYFH---NDRELFVSQF 447

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
           I+S  +WK   + L Q       +    + +  F  ++ V  +  L +R P W    G  
Sbjct: 448 IASRLNWKEKGLKLTQN----TRYPDEQKTSFIFECEKPVDLI--LQIRYPYWA-EKGMI 500

Query: 567 ASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
            ++NG+ +     P +F++    W   DK+ +  P SLR EA+ D++       A+++GP
Sbjct: 501 VTVNGKKVSYSQKPQSFVAIHREWKTGDKVEVSFPFSLRLEAMPDNKDRV----ALMYGP 556

Query: 626 YLLAGH 631
            +LAG 
Sbjct: 557 LVLAGQ 562


>gi|381203003|ref|ZP_09910112.1| hypothetical protein SyanX_20925 [Sphingobium yanoikuyae XLDN2-5]
          Length = 790

 Score =  280 bits (716), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 179/533 (33%), Positives = 266/533 (49%), Gaps = 44/533 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A + N  YLL L+ D L+ +FRK A L   G  YGGWEN    + GH +GHYL+A A M 
Sbjct: 51  AVEGNRRYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWEN--DTIAGHTLGHYLTALALMH 108

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------------- 221
           A T +A    + + ++  L+ECQ   G GY++ F     D  E                 
Sbjct: 109 AQTGDAECARRAAYIIAELAECQAAAGDGYVAGFTRRRDDVIEDGRLIFPEIMRGDIRSA 168

Query: 222 ---LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
              L   W P+Y  HK+ AGL D      N+QA  +A  +  Y    +  V       + 
Sbjct: 169 GFDLNGCWVPFYNWHKLFAGLFDAESHLGNSQARGVALALAAY----IDGVFAKLDDAQV 224

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
              L+ E GG+N+    L++ T DP+ L LA        L  LA + + L   HANT IP
Sbjct: 225 QQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDPLAQRQNSLPWIHANTQIP 284

Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
            +IG    +E+TG+    +   FF + V   +SY  GG + RE++ DP  ++  +  +  
Sbjct: 285 KLIGLARLHEITGNAADAIAANFFWETVVGQYSYVIGGNADREYFPDPGTISKHITEQTC 344

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +YNMLK++RHL+ W  E    DYYERA  N +L+ Q     G+  YM+PL  G  +
Sbjct: 345 ESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQ-NPATGMFAYMVPLMSGSHR 403

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ-YISSSFDWKSGH 517
                 W   F+ FWCC G+G+ES +K G+SI++E+      + I   YI S  DW +  
Sbjct: 404 V-----WSEPFDDFWCCVGSGMESHAKHGESIWWEDADRPADMLIANLYIPSEADWAARG 458

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
             L  +++    +D ++   L+       G+  +L LR+P W    GA+ ++NG  LP P
Sbjct: 459 AKL--RIESGYPFDGHI--ALSIPKLARAGRF-TLALRIPGWC--QGARVAVNGTPLPAP 511

Query: 578 PPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
              +  +  +R W   D++T+ LP++LR EA  DD    A   A+L GP +LA
Sbjct: 512 RIADGYALIDRKWKAGDQVTLDLPMALRIEATPDD----ARTIALLHGPVVLA 560


>gi|374321589|ref|YP_005074718.1| hypothetical protein HPL003_08660 [Paenibacillus terrae HPL-003]
 gi|357200598|gb|AET58495.1| hypothetical protein HPL003_08660 [Paenibacillus terrae HPL-003]
          Length = 755

 Score =  280 bits (715), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 205/598 (34%), Positives = 305/598 (51%), Gaps = 58/598 (9%)

Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
           K   LH V +D S  L+ A + N  YLL L+ D L+  FR+ A L      Y GWE    
Sbjct: 6   KAFDLHKVSID-SGPLYHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--AR 62

Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFD 217
            + GH +GHYLS  A M+AST +  + E+++ VV  L  CQN  G GY+S  P   ELF+
Sbjct: 63  GISGHTLGHYLSGCALMFASTGDERLLERVNYVVNELEICQNNHGNGYISGIPRGKELFE 122

Query: 218 SFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFY- 263
             +A         L   W P YT+HK+ AGL D ++LA + +AL+M      W+ + F  
Sbjct: 123 EVKAGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLARHPKALQMEIKLGDWLEDVFKG 182

Query: 264 ---NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF 320
              ++VQ+V            L+ E GGMN+VL  L   + + + L LA  F     L  
Sbjct: 183 LNDDQVQQV------------LHCEFGGMNEVLTDLAEHSGEERFLRLAERFYHGEVLND 230

Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR 380
           LA   D L+  HANT IP +IG+  +YE+TG P Y  +  FF + V   HSY  GG S  
Sbjct: 231 LADSRDTLAGRHANTQIPKIIGAARQYEMTGKPQYADLSRFFWERVVHKHSYVIGGNSYN 290

Query: 381 EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRG 440
           E + +P +L D LG    ETC TYNMLK++RH+F W    AYADYYERA+ N +L+ Q+ 
Sbjct: 291 EHFGEPGKLNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQP 350

Query: 441 TEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
            + G + Y + L  G  K+     + ++++ F CC G+G+ES S  G +IYF     +  
Sbjct: 351 VD-GRVCYFVSLEMGGHKS-----FNSQYDDFTCCVGSGMESHSMYGTAIYFHTPETI-- 402

Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
            Y+ QY+ S+  W+   V L Q+      +    R TL   SK+   +L ++ LR P W 
Sbjct: 403 -YVNQYVPSTVTWEEMDVQLKQE----TLFPQNGRGTLRVISKEP--KLFTIKLRCPHWA 455

Query: 561 YSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
              G    +NG+       P +++     W+  D +   +P+++R E +    P+     
Sbjct: 456 -EQGMMIKINGEEYATEACPTSYVVIEREWNDADTIEYDIPMTVRIEEM----PDNPRRI 510

Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN 677
           A ++GP +LAG   G    K+   R L++++     S   +L+    E   +TF M++
Sbjct: 511 AFMYGPLVLAGDL-GPVTPKSNEERLLASVLIGAADSLTTKLIADGNEP--NTFRMND 565


>gi|390452646|ref|ZP_10238174.1| hypothetical protein PpeoK3_01345 [Paenibacillus peoriae KCTC 3763]
          Length = 767

 Score =  279 bits (714), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 185/552 (33%), Positives = 283/552 (51%), Gaps = 39/552 (7%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENP 157
           +KE     V L++ S    A    L+++  ++ D ++++FR+ A++ T G +   GW+ P
Sbjct: 191 VKEFKGQKVSLERESEFEAAMNRFLQFVRSVNDDQMLYNFREAAAIDTKGAQPMTGWDAP 250

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI------GTGYLSAF 211
              L+GH  GHYLSA A  + +T ++ +  K+  +V  L +CQ  +      G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYNATEDSALLGKIQYMVVELGKCQTALSEQAGYGRGFLSAY 310

Query: 212 PTELFDSFE---ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
             E F+  E       +WAPYYT+HKI+AGLLD Y LA   +AL +   +  + +NR+ +
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALDICDKLGHWLHNRLGR 370

Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            +    + + W   +  E GGMN+VL +LY+IT +  +L+ A  FD       +    D 
Sbjct: 371 -LPREQLHKMWSLYIAGEFGGMNEVLAKLYAITGNKNYLMTAKYFDNEKLFLPMKENVDT 429

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L + HAN HIP VIG+   +EV GD  Y  I   F  +V  SH Y  GGT   E + +P 
Sbjct: 430 LGNTHANQHIPQVIGALKLFEVAGDEAYFNIAENFWTMVTQSHIYPIGGTGETEMFREPD 489

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP-GVM 446
            +A  L  +  ETC +YNMLK+++ LF++     Y DYYE+AL N +L+ +   +  G  
Sbjct: 490 AIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEGGS 549

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
            Y +PL  G  K   TH          CC+GTG+E+  K  ++IYF +E     LY+  Y
Sbjct: 550 TYFMPLAPGSIKKFDTH-------ENTCCHGTGLENHFKYQEAIYFHDEDR---LYVNLY 599

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
           I S  DW    + L QK D     D    +        E    ++L  R+P W  S   Q
Sbjct: 600 IPSRLDWSDQGLSLVQKRDS----DGLETVRFYIEGVPE----TTLMFRIPDWI-SEPVQ 650

Query: 567 ASLNGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
             +NG+    L     +L   + W   D++ + LP SLR      D P+  +++++ +GP
Sbjct: 651 VKINGEPCRDLEYEDGYLKLRKVWK-KDEIELTLPCSLRLA----DAPDDHTLKSLAYGP 705

Query: 626 YLLAGHTSGEWD 637
           Y+LA   SGE D
Sbjct: 706 YVLAA-ISGEQD 716


>gi|325919533|ref|ZP_08181551.1| hypothetical protein XGA_0490 [Xanthomonas gardneri ATCC 19865]
 gi|325549987|gb|EGD20823.1| hypothetical protein XGA_0490 [Xanthomonas gardneri ATCC 19865]
          Length = 791

 Score =  279 bits (714), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 196/575 (34%), Positives = 276/575 (48%), Gaps = 65/575 (11%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A QTN  YL+ L+ D L+ +F   A L     AYGGW
Sbjct: 46  PGS-IRAVPLAQVRLTPSLFL-DALQTNRRYLMRLEPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAHYLVAELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DNAQAL++
Sbjct: 162 NAAGKIESGRAVFDELKKGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q V +     +    L+ E GG+N+    L+  T D + L LA     
Sbjct: 222 AVGLAGY----LQAVFSALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P   +  L  +  E C +YNMLK++RHL++W  +  + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSTSKFLTEQTCEHCASYNMLKLTRHLYQWGPQAEFFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM P+  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL------TFSSKQEVGQ 548
                G+Y+  Y+ SS             V      D  LR T+      +        +
Sbjct: 452 G---QGVYVNLYVPSS-------------VRDAAGLDMTLRSTMPEQGSASLRVDAAPAE 495

Query: 549 LSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
             +L LR+P W  S   Q  LNGQ +       +L  T  W   D L +   + LR EA 
Sbjct: 496 QRTLALRVPGWAQSPVLQ--LNGQPVGAAVSDGYLRITRVWRAGDTLDLSFEMPLRLEAA 553

Query: 609 QDDRPEYASIQAILFGPYLLA---GHTSGEWDIKT 640
            DD P + S   +L GP +LA   G  +  W  KT
Sbjct: 554 ADD-PAWVS---VLRGPLVLAADLGDAAKPWSGKT 584


>gi|374372949|ref|ZP_09630610.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
 gi|373235025|gb|EHP54817.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
          Length = 653

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 184/541 (34%), Positives = 278/541 (51%), Gaps = 39/541 (7%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
           L EV L D    ++  + R Q     +LL + + SL+ SF   A +           K Y
Sbjct: 57  LSEVKLLDSRFKEN--MLREQH----WLLAISLKSLLHSFYTNAGMYDANEGGYDEIKKY 110

Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSA 210
            GWE+   ELRGH  GH LS  A M+AST     K K  T++ +L+  Q  +   GY+SA
Sbjct: 111 AGWESMDCELRGHSTGHILSGLALMYASTGEQIYKSKGDTIIKALAAIQKTLNQNGYISA 170

Query: 211 FPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
           FP E  +     + VWAP+YT+HKILAG+LDQY+  +N QAL +A     + Y ++  + 
Sbjct: 171 FPQEFINRNIRGEKVWAPWYTLHKILAGVLDQYLYCNNDQALDIAKNFSAWAYKKLHPL- 229

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
              +  +    L  E GGMN+V + LY+IT D K   L + F     L  L    D L  
Sbjct: 230 ---TAGQRTLMLRNEFGGMNEVFFNLYAITGDEKDKWLGNFFYDNRMLDPLKAGIDNLKG 286

Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
            HANT+IP ++G    YE+ G+     +  FF   V   HS+ATG  S RE ++ P  ++
Sbjct: 287 AHANTYIPKLLGVTRDYEIEGNAGGDAVVRFFWQRVTTHHSFATGSNSDREHFFQPDAIS 346

Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
             L     E+C  YNMLK++RHL+  +  + YADYYE+AL N +L  Q+    G++ Y L
Sbjct: 347 THLTGYTGESCNVYNMLKLTRHLYIHSGNVKYADYYEKALFNHILG-QQDPATGMIAYFL 405

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           P+  G  K  ST       +SFWCC GTG E+ +K G+ IY+  + +   LYI  +I S 
Sbjct: 406 PMLPGAHKVYSTPD-----SSFWCCVGTGFENQAKYGEGIYYHTQND---LYINLFIPSD 457

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
            +WK     L Q+       D  ++ T+  + +  +    ++N+R P W  +     ++N
Sbjct: 458 LNWKEKSFRLMQQTK--FPEDGNMKFTIDEAPEFPL----TINIRYPDWV-AGRPTITIN 510

Query: 571 GQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           G+++ +    + ++S    W  ND++ +   + LRT    D+     S+ AI +GP +LA
Sbjct: 511 GRSIKIEQAADSYISIKRIWKKNDRIEVNYRMQLRTIPANDN----PSVAAIAYGPVVLA 566

Query: 630 G 630
           G
Sbjct: 567 G 567


>gi|443291943|ref|ZP_21031037.1| Conserved secreted hypothetical protein [Micromonospora lupini str.
           Lupac 08]
 gi|385885131|emb|CCH19144.1| Conserved secreted hypothetical protein [Micromonospora lupini str.
           Lupac 08]
          Length = 778

 Score =  279 bits (713), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 194/576 (33%), Positives = 277/576 (48%), Gaps = 55/576 (9%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
           WLD        Q   L YL  +DVD ++++FR    L T G A  GGW+ P    R H  
Sbjct: 65  WLDN-------QNRTLNYLRFVDVDRMLYNFRANHRLSTNGAATNGGWDAPNFPFRTHMQ 117

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
           GH+L+A AQ +A   + T ++K + +V  L++CQ        G GYLS FP   F + EA
Sbjct: 118 GHFLTAWAQAYAVLGDTTCRDKANYMVAELAKCQANNGAAGFGAGYLSGFPESDFSALEA 177

Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
             L     PYY IHK LAGLLD +    N QA  +   +  +   R  ++    S  +  
Sbjct: 178 RTLSNGNVPYYCIHKTLAGLLDVWRYTGNTQARTVLLALAGWVDTRTSRL----SSSQMQ 233

Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
             L  E GGMNDVL  +Y +T D + L  A  FD       LA   D L+  HANT +P 
Sbjct: 234 SMLGTEFGGMNDVLTEIYQMTGDSRWLTTAQRFDHASVFNPLANNQDQLNGLHANTQVPK 293

Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE 399
            +G+   ++ TG   Y+ I +   +I   +H+Y  GG S  E +  P  +A  L ++  E
Sbjct: 294 WVGAAREFKATGTTRYRDIASNAWNITVRAHTYVIGGNSQAEHFRAPNAIAGYLSNDTCE 353

Query: 400 TCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQRGTE-PGVMIYMLPLG---- 453
            C TYNMLK++R L+        Y DYYERA  N ++  Q   +  G + Y  PL     
Sbjct: 354 QCNTYNMLKLTRELWLLDPSRTDYFDYYERATINHLIGAQNPADSKGHITYFTPLKPGGR 413

Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
           RGV  A     W T +NSFWCC GTG+E  +KL DSIYF        L +  ++ S  +W
Sbjct: 414 RGVGPAWGGGTWSTDYNSFWCCQGTGVEINTKLMDSIYFYSG---TTLTVNLFVPSELNW 470

Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
               + + Q     VS    L +  T S         S+ +R+P WT  NGA  S+NG  
Sbjct: 471 SQRGITVTQSTTYPVSDTTTLTLGGTMSGSW------SVRVRIPAWT--NGATVSVNGVE 522

Query: 574 LPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
             +   PG++ + T  W+  D +T++LP+ +  +   D+    +SI A+ +GP +LAG+ 
Sbjct: 523 QSVATTPGSYATVTRTWAAGDTITVRLPMRVVVQPTNDN----SSIAAVTYGPSVLAGNY 578

Query: 633 SGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQES 668
                       +LSA     PP+ N   +  T  S
Sbjct: 579 GNS---------TLSA-----PPALNVSSIARTSTS 600


>gi|307110572|gb|EFN58808.1| hypothetical protein CHLNCDRAFT_56904 [Chlorella variabilis]
          Length = 937

 Score =  278 bits (712), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 149/322 (46%), Positives = 197/322 (61%), Gaps = 7/322 (2%)

Query: 124 EYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNA 183
           +YLL L+ D L+++FRK A LPTPG +YGGWE   SE+RG F+GHY+SA A     T   
Sbjct: 51  QYLLALEPDRLLFNFRKNAGLPTPGASYGGWEWSESEVRGQFIGHYMSAVAFAALHTGRT 110

Query: 184 TIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQY 243
              ++   +V  L + Q+  G GYLSAFP   FD  EAL+PVWAPYY IHKI+AGLLDQ+
Sbjct: 111 EFYDRSKLMVHELKKVQDAFGNGYLSAFPESHFDRLEALQPVWAPYYVIHKIMAGLLDQH 170

Query: 244 VLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWY-SLNEETGGMNDVLYRLYSITHD 302
            LA   +ALKMA  M  YF  R Q+V    + E +WY  L  E GGMN+VLY L+++T D
Sbjct: 171 QLAGTDEALKMAEQMASYFCGRAQRV-RENNGEDYWYRCLENEFGGMNEVLYNLFAVTAD 229

Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
             H   AH FDKP F   L    D L   HANTH+  V G   RYE  GD         F
Sbjct: 230 DHHAECAHWFDKPVFYRPLVEGTDPLPGLHANTHLAQVQGFAARYEHLGDEEAMAAVRNF 289

Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN-----EETCTTYNMLKVSRHLFRWT 417
             ++   H+++TGG++  E W +   LA+ + + +     EE+CT YN+LK++R+LFR T
Sbjct: 290 FALILQHHTFSTGGSNWYERWGNEDSLAEAINNTDASRITEESCTQYNILKLARYLFRHT 349

Query: 418 KEIAYADYYERALTNGVLSIQR 439
            + A AD+YERA+ N V+ IQ+
Sbjct: 350 GDPALADFYERAILNDVIGIQK 371



 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 102/242 (42%), Gaps = 63/242 (26%)

Query: 424 DYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESF 483
           D Y  A  N V    +   PGV IY LPLG G  K      WGT +++FWCCYGT +ESF
Sbjct: 441 DPYAAAHANSV----QPAGPGVYIYYLPLGVGHDK-----NWGTPWDTFWCCYGTAVESF 491

Query: 484 SKLGDSIYFEE---------------EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
           S L  SIYF+                  ++P L++ Q +SSS  W+              
Sbjct: 492 SSLAGSIYFKHMPGTAPSASSSGPTAAEDLPQLFVNQMVSSSVHWRE------------- 538

Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN--------------- 573
                L +  + +  +   Q   LN R+P W   +     +NG+                
Sbjct: 539 -----LGVEGSANGDKPQAQF-VLNWRVPGWAKGDEVMLRVNGKEYLECAQGAAAAAHDA 592

Query: 574 LPLPPP-----GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
           L   PP       F S    WS  D +   +P+ + TE + D R    S++AI+ GP+++
Sbjct: 593 LGFQPPQFGAGARFCSLGSTWSDGDVVEADMPMWVVTEDLNDSRKAMQSLKAIMMGPFVM 652

Query: 629 AG 630
           AG
Sbjct: 653 AG 654


>gi|310639749|ref|YP_003944507.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus polymyxa
           SC2]
 gi|386038950|ref|YP_005957904.1| hypothetical protein PPM_0260 [Paenibacillus polymyxa M1]
 gi|309244699|gb|ADO54266.1| Acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus polymyxa
           SC2]
 gi|343094988|emb|CCC83197.1| DUF1680 domain containing protein [Paenibacillus polymyxa M1]
          Length = 751

 Score =  278 bits (711), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 190/540 (35%), Positives = 281/540 (52%), Gaps = 39/540 (7%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
            LH V +D S  L+ A + N  YLL L+ D L+  FR+ A L      Y GWE     + 
Sbjct: 7   DLHKVSID-SGPLYHAMELNTTYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--ARGIS 63

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
           GH +GHYLS  A M+AST +  + E+++ V+  L  CQN  G GY+S  P   E+F+  +
Sbjct: 64  GHTLGHYLSGCALMFASTGDKRLLERVNYVIDELEICQNSHGNGYISGIPRGKEIFEEVK 123

Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
           A         L   W P YT+HK+ AGL D ++LA + +AL M   + ++    ++ V  
Sbjct: 124 AGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLAHHPKALAMEIQLGDW----LEDVFQ 179

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
             S E+    L+ E GGMN+VL  L   + + + L LA  F     L  LA   D L+  
Sbjct: 180 GLSDEQVQQVLHCEFGGMNEVLTDLAEHSGEKRFLNLAERFYHGEVLNDLADSRDTLAGR 239

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           HANT IP +IG+  ++EVTG PLY  +  FF D V   HSY  GG S  E + +P +L D
Sbjct: 240 HANTQIPKIIGAARQFEVTGKPLYADLSRFFWDRVVHKHSYVIGGNSYNEHFGEPGKLND 299

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
            LG    ETC TYNMLK++RH+F W    AYADYYERA+ N +L+ Q+  + G + Y + 
Sbjct: 300 RLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-GRVCYFVS 358

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           L  G  K+     + +++  F CC G+G+ES S  G +IYF     +   Y+ QY+ S+ 
Sbjct: 359 LEMGGHKS-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTANTI---YVNQYVPSTV 410

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
            W   ++ L Q+      +    R TL   SK+   +  ++ LR P W    G +  +NG
Sbjct: 411 TWDEMNIQLKQE----TLFPQNGRGTLHLISKEP--KFFTIKLRCPHWA-EQGMKIKING 463

Query: 572 QNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +       P +++     W   D +   +P+++R E +    P+     A ++GP +LAG
Sbjct: 464 EEYAAEACPTSYIVIEREWKDGDTVEYDIPMTVRVEEM----PDNPRRIAFMYGPLVLAG 519


>gi|384428325|ref|YP_005637684.1| hypothetical protein XCR_2693 [Xanthomonas campestris pv. raphani
           756C]
 gi|341937427|gb|AEL07566.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
           756C]
          Length = 791

 Score =  278 bits (711), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 194/565 (34%), Positives = 279/565 (49%), Gaps = 52/565 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGWE   
Sbjct: 49  IRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWE--A 105

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE---- 214
             + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +    
Sbjct: 106 DTIAGHTLGHYLSALALMHAQTDDAQCRTRARYLVAELARCQAHAGDGYVAGFTRKNAAG 165

Query: 215 -------LFDSFE--ALKPV-------WAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
                  +FD  +   ++P+       WAP YT HK+ AGLLD +V  DNAQAL++A  +
Sbjct: 166 QIESGRAVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVAL 225

Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
             Y    +Q +       +    L+ E GG+N+    L+  T   + L LA         
Sbjct: 226 AGY----LQGIFAALDDTQLQKVLSCEFGGLNESFVELHVRTGHAQWLALAQRLHHHAVF 281

Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
             L  Q D L H H+NT+IP +IG    YEVTGD        FF + V   HSY  GG  
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGNG 341

Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
            RE++  P  ++  L  +  E C++YNMLK++RHL+RW  + AY DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCSSYNMLKLTRHLYRWGPQAAYFDYYERTLLNHVMA-Q 400

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
           +    G+  YM P+  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+E+    
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWEDG--- 452

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            G+ I  Y+ S     +G  +      P       LR+    ++++      +L+LR+P 
Sbjct: 453 QGVAINLYVPSRVRNAAGLDMTLHSALP-AQGSVSLRIDAAPAAQR------TLSLRVPG 505

Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
           W  +   Q  LNG  +   P   +L  T  W   D L + L + LR EA  DD P + S 
Sbjct: 506 WAATPVLQ--LNGAVVDAAPVDGYLRVTRIWHPGDTLDLSLHMPLRLEATPDD-PAWVS- 561

Query: 619 QAILFGPYLLA---GHTSGEWDIKT 640
             +L GP +LA   G  +  W  KT
Sbjct: 562 --LLRGPLVLAADLGDAATPWSGKT 584


>gi|374313035|ref|YP_005059465.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358755045|gb|AEU38435.1| protein of unknown function DUF1680 [Granulicella mallensis
           MP5ACTX8]
          Length = 798

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 195/594 (32%), Positives = 285/594 (47%), Gaps = 61/594 (10%)

Query: 100 KEVSLHDVWLDQSSV------LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGG 153
           ++V L  V L  SSV      L RAQ  + +YLL L  + ++   R+ A+L    + YGG
Sbjct: 28  QKVQLKAVPLPFSSVRLTGGPLKRAQDLDAQYLLDLQPERMLARLRQRANLAPKAEGYGG 87

Query: 154 WENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-- 211
           W+    +L GH  GHYLSA + M+A+T +   K +    V  L   QN  G GY+ A   
Sbjct: 88  WDGDGRQLTGHIAGHYLSAISMMYATTGDVRFKNRADDFVTELQNIQNAQGDGYIGALLD 147

Query: 212 --------------PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQAL----K 253
                           E+      L  +W+P+Y  HK+ AGL D Y L  N +AL    K
Sbjct: 148 AKGVDGKVRFQDLSKGEIHSGGFDLNGLWSPWYVEHKLFAGLRDAYHLTGNRKALDVEIK 207

Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
            A W         + ++   S E+    L  E GGMN+VL  LY+ T+DP+ L L+  F+
Sbjct: 208 FAGW--------AETIVGHLSDEQLQRMLATEFGGMNEVLADLYADTNDPRWLKLSDKFE 259

Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYA 373
               +  L+   D L+  HANT IP +IG   RY  TGD        FF D V+  HS+A
Sbjct: 260 HHAIVDPLSRGQDILAGKHANTQIPKMIGELARYVYTGDETDGKAAMFFFDEVSEHHSFA 319

Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
           TGG    E++  P ++ D +     E+C  YNM+K++R LF    +  YAD+ ERA  N 
Sbjct: 320 TGGDGKNEYFGQPDKMNDMIDGRTAESCAAYNMIKMARDLFSLDPQARYADFIERADLNA 379

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
           +L  Q   E G + YM+P+GRGV      H +  KF SF CC G+ +E+ +     IY  
Sbjct: 380 ILGGQ-DPEDGRVSYMVPVGRGVQ-----HEYQDKFESFTCCVGSQMETHAFHAYGIY-S 432

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
           E GN   L++ QY  ++ DW S  + L    +  +     L++T   S K +V    ++ 
Sbjct: 433 ESGNK--LWVSQYDPTTVDWASQGMKLEMVTNLPMGDSAALKIT---SGKTKV---FTIA 484

Query: 554 LRMPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
           LR P W  + G    +NG+ L     P  ++    +W   D + I LP +LR EA+    
Sbjct: 485 LRRPYWVGA-GFSVKVNGETLQNTSTPDTYIEINRKWKVGDTVEIVLPKTLRKEAL---- 539

Query: 613 PEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQ 666
           P+  +  AI++GP +LAG      D+    +R  S     + P     L+T  Q
Sbjct: 540 PDNPNRMAIMWGPLVLAG------DLGPEVSRRHSGGQGGVAPEPAPALITAEQ 587


>gi|325927064|ref|ZP_08188334.1| hypothetical protein XPE_2335 [Xanthomonas perforans 91-118]
 gi|325542563|gb|EGD14035.1| hypothetical protein XPE_2335 [Xanthomonas perforans 91-118]
          Length = 791

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 197/619 (31%), Positives = 293/619 (47%), Gaps = 70/619 (11%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q + +     +    L+ E GG+N+    L+  T+D + L LA     
Sbjct: 222 AVGLAGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L+H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RHL++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
                G+Y+  Y+ S     +G   L+  +   +       + +  +  ++     +L L
Sbjct: 452 G---QGVYVNLYVPSMVHDAAG---LDMTLHSALPEQGSASLRIDAAPAEQ----RTLAL 501

Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           R+P W      Q  LNGQ +       +L  T  W   D L++   + LR EA  DD P 
Sbjct: 502 RVPGWAQQPRLQ--LNGQPVDTAASDGYLRITRVWQRGDTLSLAFDMPLRLEATSDD-PA 558

Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQLVTFTQES 668
           + S   +L GP +LA   G  +  W  KT      + +   + P+P              
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKTPALIGGQDILQRLQPVP-------------- 601

Query: 669 GNSTFVMSNSNQSITMEEF 687
           G + FV ++  Q   +  F
Sbjct: 602 GKTAFVYNDGVQQWQLSPF 620


>gi|427384529|ref|ZP_18881034.1| hypothetical protein HMPREF9447_02067 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727790|gb|EKU90649.1| hypothetical protein HMPREF9447_02067 [Bacteroides oleiciplenus YIT
           12058]
          Length = 777

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 188/539 (34%), Positives = 267/539 (49%), Gaps = 55/539 (10%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A++    YLL L+ D  +  FR  A L      Y GWE+    + G  +GHY+SA A  +
Sbjct: 51  AEEKEATYLLELEPDRFLSGFRSEAGLVPKAPKYEGWES--LGVAGQTLGHYMSACAMYY 108

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVW 226
           A++ +    +K+  ++  L  CQ   G GYL+A P   ++F    A         L   W
Sbjct: 109 ATSGDERFLQKLEYIINELDSCQQANGNGYLAATPGGKKIFAEVSAGNIYSQGFDLNGGW 168

Query: 227 APYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFY----NRVQKVITMYSVERH 278
            P Y +HK+LAGL+D Y  A + QAL    K+A WM   FY    +++QKV+        
Sbjct: 169 VPLYVMHKVLAGLIDAYQYARSEQALRIAEKLADWMYGTFYHLTEDQMQKVLAC------ 222

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLGFLALQADYLSHFHANTHI 337
                 E GGMN+ L  LY+ T + K LLLA  FD     +  LA+  D L   HANT +
Sbjct: 223 ------EFGGMNEALANLYAYTKNDKFLLLAQRFDNHKAIMDSLAIGVDDLEGKHANTQV 276

Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
           P +IG+   YE+TG      I +FF   V  +HSY  GG S  E +  P++L + L + N
Sbjct: 277 PKMIGAARLYELTGSKRDSSIASFFWHTVVDNHSYVNGGNSDGEHFGTPRKLNERLSTSN 336

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
            ETC TYNMLK++RHLF W     Y+ YYERA+ N +L+ Q   + G+  Y  PL  G  
Sbjct: 337 TETCNTYNMLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGK 395

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
           K     G+ + F SF CC G+G+E+  K GD IY   EG+   L++  +I S   W +  
Sbjct: 396 K-----GYLSPFQSFCCCSGSGMENHVKYGDFIY--SEGSDSSLFVNLFIPSRLTWTARD 448

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
           +++ Q  D      P    T+  + K E+ Q     LR P W  S      +NG+++ L 
Sbjct: 449 LIVTQDTDI-----PSSNKTV-LTVKTEMPQSVVFRLRYPEWAES--MSLKVNGKSVSLK 500

Query: 578 PPG-NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
             G N++S    W  NDKL I   +   T A+ D+         + +GP LLAG    E
Sbjct: 501 ASGNNYVSIEREWKDNDKLEITFGIKFYTVAMPDNEKRV----GLFYGPVLLAGELGQE 555


>gi|294624781|ref|ZP_06703443.1| secreted protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB
           11122]
 gi|292600913|gb|EFF44988.1| secreted protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB
           11122]
          Length = 791

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 194/569 (34%), Positives = 277/569 (48%), Gaps = 53/569 (9%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+F + V L  V L  S  L  A  TN  YL+ L+ D L+ +F   A L     AYGGW
Sbjct: 46  PGSF-RAVPLAQVRLTPSLFL-DALHTNRRYLMRLEPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVAELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD             L   WAP YT HK+ AGLLD +   DNAQAL++
Sbjct: 162 NAAGKIESGRAVFDELRRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q +       +    L+ E GG+N+    L+  T D + L LA     
Sbjct: 222 AVSLAGY----LQGIFAALDDAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  +  +  E C +YNMLK++RHL++W  +  + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFVTEQTCEHCASYNMLKLTRHLYQWGPQAEFFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           L+ Q+    G+  YM P+  G ++A     W + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 LA-QQHPRTGMFTYMTPMLAGEARA-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
                G+Y+  Y+ SS    +G  +  +   P       LR+ +  + ++       L L
Sbjct: 452 G---QGVYVNLYVPSSVRDAAGLDMTLRSTMPEQG-SASLRIDVAPAEQR------MLAL 501

Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           R+P W  S   Q  LNGQ +       +L     W   D LT+   + LR EA  DD P 
Sbjct: 502 RLPGWAQSPRLQ--LNGQPVDTTVNEGYLRIARFWRAGDTLTLSFEMPLRLEATTDD-PA 558

Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
           + S   +L GP +LA   G  +  W  KT
Sbjct: 559 WVS---VLRGPLVLAADLGAAAKPWSGKT 584


>gi|398384929|ref|ZP_10542957.1| hypothetical protein PMI04_02662 [Sphingobium sp. AP49]
 gi|397722209|gb|EJK82754.1| hypothetical protein PMI04_02662 [Sphingobium sp. AP49]
          Length = 802

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 178/533 (33%), Positives = 266/533 (49%), Gaps = 44/533 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A + N  YLL L+ D L+ +FRK A L   G  YGGWEN    + GH +GHYL+A A M 
Sbjct: 63  AVEGNRLYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWEN--DTIAGHTLGHYLTALALMH 120

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------------- 221
           A T +A    + + ++  L+ CQ   G GY++ F     D  E                 
Sbjct: 121 AQTGDAECARRAAYIIDELAACQAAAGDGYVAGFTRRRDDVIEDGRLIFPEIMRGDIRSA 180

Query: 222 ---LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
              L   W P+Y  HK+ AGL D      N+QA  +A  +  Y    +  V       + 
Sbjct: 181 GFDLNGCWVPFYNWHKLFAGLFDAEAHLGNSQARGVALALAAY----IDGVFAKLDDAQV 236

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
              L+ E GG+N+    L++ T DP+ L LA        L  LA + + L   HANT IP
Sbjct: 237 QQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDPLAQRQNSLPWIHANTQIP 296

Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
            +IG    +E+TG+    +   FF + V   +SY  GG + RE++ DP  ++  +  +  
Sbjct: 297 KLIGLARLHEITGNAADAIAANFFWETVVGQYSYVIGGNADREYFPDPGTISKHITEQTC 356

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +YNMLK++RHL+ W  E    DYYERA  N +L+ Q     G+  YM+PL  G  +
Sbjct: 357 ESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQNPAT-GMFAYMVPLMSGSHR 415

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ-YISSSFDWKSGH 517
                 W   F+ FWCC G+G+ES +K G+SI++E+      + I   YI S  DW +  
Sbjct: 416 V-----WSEPFDDFWCCVGSGMESHAKHGESIWWEDTDRPADMLIANLYIPSEADWAARG 470

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
             L  +++    +D ++   L+  +    G+  +L LR+P W    GA+ ++NG  LP P
Sbjct: 471 AKL--RIETGYPFDGHI--ALSIPTLARAGRF-TLALRIPGW--CQGARVAVNGTPLPTP 523

Query: 578 PPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
              +  +  +R W   D++T+ LP++LR EA  DD    A   A+L GP +LA
Sbjct: 524 RIVDGYALIDRKWKAGDQVTLDLPMALRVEATPDD----ARTIALLHGPVVLA 572


>gi|78048280|ref|YP_364455.1| hypothetical protein XCV2724 [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|78036710|emb|CAJ24403.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 791

 Score =  276 bits (706), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 197/619 (31%), Positives = 293/619 (47%), Gaps = 70/619 (11%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q + +     +    L+ E GG+N+    L+  T+D + L LA     
Sbjct: 222 AVSLAGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L+H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RHL++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
                G+Y+  Y+ S     +G   L+  +   +       + +  +  ++     +L L
Sbjct: 452 G---QGVYVNLYVPSMVHDAAG---LDMTLHSALPEQGSASLRIDAAPAEQ----RTLAL 501

Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           R+P W      Q  LNGQ +       +L  T  W   D L++   + LR EA  DD P 
Sbjct: 502 RVPGWAQQPRLQ--LNGQPVDTAASDGYLRITRVWQRGDTLSLAFDMPLRLEATSDD-PA 558

Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQLVTFTQES 668
           + S   +L GP +LA   G  +  W  KT      + +   + P+P              
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKTPALIGGQDILQRLQPVP-------------- 601

Query: 669 GNSTFVMSNSNQSITMEEF 687
           G + FV ++  Q   +  F
Sbjct: 602 GKTAFVYNDGVQQWQLSPF 620


>gi|300785876|ref|YP_003766167.1| hypothetical protein AMED_3987 [Amycolatopsis mediterranei U32]
 gi|384149186|ref|YP_005532002.1| hypothetical protein RAM_20325 [Amycolatopsis mediterranei S699]
 gi|399537759|ref|YP_006550421.1| hypothetical protein AMES_3940 [Amycolatopsis mediterranei S699]
 gi|299795390|gb|ADJ45765.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
 gi|340527340|gb|AEK42545.1| hypothetical protein RAM_20325 [Amycolatopsis mediterranei S699]
 gi|398318529|gb|AFO77476.1| hypothetical protein AMES_3940 [Amycolatopsis mediterranei S699]
          Length = 775

 Score =  276 bits (706), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 194/548 (35%), Positives = 273/548 (49%), Gaps = 51/548 (9%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
           WLD        Q     YL  +DV+ L++ FR    L T G A  GGW+ P    R H  
Sbjct: 67  WLDN-------QNRTQNYLRFVDVNRLLYVFRANHRLSTGGAATNGGWDAPSFPFRSHVQ 119

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEA 221
           GH+L+A AQ+WA T + T ++K +T+V  L++CQ   G      GYLS FP   FD+ EA
Sbjct: 120 GHFLTAWAQLWAVTGDTTSRDKATTMVAELAKCQANNGAAGFSAGYLSGFPEADFDNLEA 179

Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSV 275
             L     PYY IHK +AGLLD +    + QA    L +A W        V +     S 
Sbjct: 180 GRLSNGNVPYYCIHKTMAGLLDVWRYIGSTQARDVLLNLAGW--------VDRRTARLST 231

Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
            +    LN E GGMNDVL  LY  T D + L  A  FD       LA   D L+  HANT
Sbjct: 232 SQLQSVLNTEFGGMNDVLADLYQYTGDARWLTAAQRFDHAAVFDPLAANRDQLNGLHANT 291

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
            +P  IG+   Y+ TG   Y+ I T   +I   +H+YA GG S  E +  P  +A  L  
Sbjct: 292 QVPKWIGAAREYKATGTTRYRDIATNAWNITVGAHTYAIGGNSQAEHFRAPNAIAAYLNQ 351

Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEP-GVMIYMLPLG 453
           +  E+C TYNMLK++R L     + A  ADYYERAL N ++  Q   +  G + Y   L 
Sbjct: 352 DTCESCNTYNMLKLTRELIALYPDRADLADYYERALLNQMIGQQNPADSHGHITYFSSLN 411

Query: 454 ----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
               RG+  A     W T ++SFWCC GTG+E+ +KL DSIYF  +     L +  ++ S
Sbjct: 412 PGGRRGLGPAWGGGTWSTDYDSFWCCQGTGLETQTKLADSIYFYNDTT---LTVNLFLPS 468

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
              W    + + Q      S       T T +    V    ++ +R+P WT   GA  S+
Sbjct: 469 VLTWTQRGITVTQTTSFPAS------DTSTLTVTGSVSGTWAMRIRIPGWT--TGATISV 520

Query: 570 NG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           NG  QN+    PG++ + +  W+  D +T++LP+ +  +A      + A++ A+ +GP +
Sbjct: 521 NGVAQNVAT-TPGSYATLSRSWASGDAVTVRLPMKVALKAAN----DNANVAAVTYGPVV 575

Query: 628 LAGHTSGE 635
           LAG+ SG 
Sbjct: 576 LAGNYSGS 583


>gi|289661682|ref|ZP_06483263.1| putative secreted protein, partial [Xanthomonas campestris pv.
           vasculorum NCPPB 702]
          Length = 756

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 195/619 (31%), Positives = 294/619 (47%), Gaps = 70/619 (11%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L   S+   A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRL-MPSLFLDALNTNRRYLMRLQPDRLLHNFVLYAGLDPQAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q + +     +    L+ E GG+N+    L+  T+D + L LA     
Sbjct: 222 AVGLAGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L+H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLVTQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RHL++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRSGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
                G+++  Y+ S+    +G   L+  +   +       + +  +  ++     +L L
Sbjct: 452 G---QGVFVNLYVPSTVRDAAG---LDMTLHSALPEQGSASLRIDAAPAEQ----RTLAL 501

Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           R+P W      Q  LNGQ +       +L  T  W   D L++   + LR EA  DD P 
Sbjct: 502 RVPGWAQQPRLQ--LNGQPVDSAASDGYLRITRVWQRGDTLSLAFDMPLRLEATPDD-PA 558

Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQLVTFTQES 668
           + S   +L GP +LA   G  +  W  KT      + +   + P+P              
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSSKTPALIGGQDILQRLQPVP-------------- 601

Query: 669 GNSTFVMSNSNQSITMEEF 687
           G + FV ++  Q   +  F
Sbjct: 602 GKTAFVYNDGAQQWQLSPF 620


>gi|408393860|gb|EKJ73118.1| hypothetical protein FPSE_06731 [Fusarium pseudograminearum CS3096]
          Length = 623

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 200/560 (35%), Positives = 273/560 (48%), Gaps = 44/560 (7%)

Query: 89  PGGFDLPGNF-LKEVSLHDV-WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT 146
           P   DL   F L +VSL D  W+D        Q   + YLL +D D L++ FRK   L T
Sbjct: 25  PKVSDLADAFELSDVSLTDSRWMDN-------QGRTVNYLLSIDPDRLLYVFRKNHGLDT 77

Query: 147 PGKAY-GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN---K 202
            G A  GGW+ P    R H  GH+LSA +  +A+  N     + S  V  L++CQ    K
Sbjct: 78  KGAAKNGGWDAPDFPFRSHVQGHFLSAWSNCYATLGNKECGSRASYFVKELAKCQANNAK 137

Query: 203 IG--TGYLSAFPTELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
           +G  +GYLS FP       E   L     PYY IHK LAGLLD Y    +  A  +   +
Sbjct: 138 VGFTSGYLSGFPESEITKVEDRTLSSGNVPYYAIHKTLAGLLDVYRRVGDNDAKTVMLSL 197

Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
             +   R  K+    S  +    +  E GGMN+VL  +   T D K L +A  FD     
Sbjct: 198 ASWVDARTGKL----SYAKMQQMMQTEFGGMNEVLADIAYYTQDNKWLKVAQRFDHAAIF 253

Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
             L    D LS  HANT +P  IG+   Y+V+GD  Y  IG    D+    H+YA GG S
Sbjct: 254 DPLQNNVDKLSGLHANTQVPKWIGALREYKVSGDKKYLDIGRNAWDLTVHKHTYAIGGNS 313

Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSI 437
             E + +P  +A  L  +  E C TYNMLK++R L+     + +Y DYYE AL N +L  
Sbjct: 314 QAEHFREPNAIAKYLTKDTCEACNTYNMLKLTRELWALNPTDASYFDYYENALMNHLLGQ 373

Query: 438 QRGTEP-GVMIYMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
           Q   +  G + Y  PL     RGV  A     W T +NSFWCC G+GIE+ +KL DSIYF
Sbjct: 374 QNPKDSHGHVTYFTPLTPGGRRGVGPAWGGGTWSTDYNSFWCCQGSGIETNTKLMDSIYF 433

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
             +     LY+  +  S  +W    V + Q  +    +      TL    K     L+  
Sbjct: 434 HTKDT---LYVNLFTPSKLNWSQQGVSIIQTTE----YPQKDSSTLQIGGKAGTWTLA-- 484

Query: 553 NLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDD 611
            +R+P WT  + A   +NGQ++ +   PG +   T  W+  DK+TI LP+SLRT A  D+
Sbjct: 485 -VRIPSWT--SKASIQVNGQSVNVNTTPGKYALVTRNWNSGDKVTITLPMSLRTIAANDN 541

Query: 612 RPEYASIQAILFGPYLLAGH 631
               + + A+ FGP +LA +
Sbjct: 542 ----SQVAAVAFGPVILAAN 557


>gi|312621677|ref|YP_004023290.1| hypothetical protein Calkro_0576 [Caldicellulosiruptor
           kronotskyensis 2002]
 gi|312202144|gb|ADQ45471.1| protein of unknown function DUF1680 [Caldicellulosiruptor
           kronotskyensis 2002]
          Length = 588

 Score =  275 bits (703), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 178/580 (30%), Positives = 297/580 (51%), Gaps = 43/580 (7%)

Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT----PGKAYGGWENPISELRGHFVG 167
            S  +R  + N  Y+L L  ++L+ +F   + L +    P   +GGWE+P  +LRGHF+G
Sbjct: 18  ESEFYRRFEINRNYMLSLKTENLLQNFYLESGLVSWSFLPQDIHGGWESPTCQLRGHFLG 77

Query: 168 HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWA 227
           H+LSA+A+++A+  +  IK K   ++  L +CQ + G  ++ + P + F+     K VWA
Sbjct: 78  HWLSAAAKIYANFGDEEIKGKADYIINELEKCQRENGGEWVGSIPEKYFEWMARGKYVWA 137

Query: 228 PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
           P+YT+HK   GL+D Y  A N +AL++A     +FY    +    +S E+    L+ ETG
Sbjct: 138 PHYTVHKTFMGLVDMYKYASNQKALEIADKWANWFY----RWSGQFSREKMDDILDYETG 193

Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
           GM ++   LY IT D K+  L   + +      L +  D L+  HANT IP + G+   +
Sbjct: 194 GMLEIWAELYDITKDSKYKDLMERYYRGRLFDRLLMGEDVLTGKHANTTIPEIHGAARVW 253

Query: 348 EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           E+TG+  + K++ +++ + V+    + TGG +  E W   +++ + LG+ N+E C  YNM
Sbjct: 254 EITGEEKFRKIVESYWKEAVDERGYFCTGGQTLGEVWTPKQKIKNYLGTTNQEHCVVYNM 313

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
           ++++  LFRWT +  Y+DY ER + NG+ + QR  + G++ Y LPL  G  K      WG
Sbjct: 314 IRLAEFLFRWTGDKRYSDYIERNIYNGLFAQQR-LKDGMVTYYLPLMPGSQKR-----WG 367

Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDP 526
           T  N FWCC+GT +++ +   D IY++ +    G+ I Q+I SS  WK      + K + 
Sbjct: 368 TPTNDFWCCHGTLVQAHTIYNDLIYYKSQN---GIVISQFIPSSVTWK------DDKGND 418

Query: 527 IVSWDPYLRMTLTFSSKQEVGQLS-----------SLNLRMPVWTYSNGAQASLNGQNLP 575
           I     + R   +F+   E  ++             L +R P W  +   +  +NG +  
Sbjct: 419 ITITQYFERKHGSFAYTAEKDEIYIEIQCKSPVEFELAIRKPWW--AKKVEIEINGNSYY 476

Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
                 ++  T+RW+ N+K+ I    ++ T ++ DD P+     A + GP +LAG     
Sbjct: 477 AADDSPYIQLTQRWN-NEKIKITFYKAVETCSMPDD-PQQV---AFMIGPVVLAGLCERR 531

Query: 636 WDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
             I  G  R +  +I PI       L+  TQ      F +
Sbjct: 532 RKIYIG-ERKIEEIIVPIDKRGYGPLLYTTQGQIEDIFFL 570


>gi|390456441|ref|ZP_10241969.1| hypothetical protein PpeoK3_20683 [Paenibacillus peoriae KCTC 3763]
          Length = 759

 Score =  275 bits (703), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 176/544 (32%), Positives = 284/544 (52%), Gaps = 38/544 (6%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT-PGKAYGGWENP 157
           L ++S   V L+  S+L  AQ   L++LL ++ D ++++FRK A L T    A  GW++ 
Sbjct: 185 LHDISTQKVHLEGPSLLKTAQNRRLQFLLTVNDDQMLYNFRKAAGLDTLNAPAMIGWDSD 244

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQ------NKIGTGYLSAF 211
            S L+GH  GHYLSA A  +AST N  I++K++ ++  L++ Q      ++   G+LSA+
Sbjct: 245 DSLLKGHTTGHYLSALALCYASTGNERIRQKLAYLIDELNKVQLAFEADDRYHYGFLSAY 304

Query: 212 PTELFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
             E FD  E       +WAPYYT+HKI AGLLD Y +A    AL +A  + ++ YNR+  
Sbjct: 305 SEEQFDLLEVYTRYPEIWAPYYTLHKIFAGLLDSYHIAGIELALVIADKVGDWIYNRLS- 363

Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
           V+    +++ W   +  E GG+N+ L  LY+ T    H+  A LFD       +    D 
Sbjct: 364 VLPQEQLKKMWGLYIAGEYGGINESLAELYTYTQKEHHIAAAKLFDNDRLFFPMEQHVDA 423

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L   HAN HIP ++G+   +E TG+  Y  I  FF + V  +H Y+ GGT   E +  P 
Sbjct: 424 LGGMHANQHIPQIVGAFKIFEATGEQKYYDIAKFFWESVVNAHIYSIGGTGEGEMFKQPY 483

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
           ++   L     ETC +YNMLK+++ L+ +  ++ Y DYYER + N +LS       G   
Sbjct: 484 QIGAHLTEHTAETCASYNMLKLTKQLYVYENDVKYMDYYERTMINHILSSTDHECLGAST 543

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y +P   G  K           NS  CC+GTG+E+  K  ++I+FE   +   LY+  ++
Sbjct: 544 YFMPTSSGGQKGYDEE------NS--CCHGTGLENHFKYAEAIFFE---DADSLYVNLFV 592

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRM-TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
            S+ + ++  + + Q V  I + +  + + TLT          ++L +R+P W +     
Sbjct: 593 PSALNDEAKGLQVVQSVPEIFNGEVEIHIETLT---------RTNLRVRIPYW-HQGEVT 642

Query: 567 ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
           A +N   +       +L  +++W+  D++T++    LR E      P+ A I ++ FGPY
Sbjct: 643 AFVNHTKVNTVEENGYLVLSQKWNKGDQVTMKFTPRLRLERT----PDKADIASLAFGPY 698

Query: 627 LLAG 630
           +LA 
Sbjct: 699 ILAA 702


>gi|322433089|ref|YP_004210338.1| hypothetical protein AciX9_4244 [Granulicella tundricola MP5ACTX9]
 gi|321165316|gb|ADW71020.1| protein of unknown function DUF1680 [Granulicella tundricola
           MP5ACTX9]
          Length = 800

 Score =  275 bits (702), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 185/564 (32%), Positives = 275/564 (48%), Gaps = 42/564 (7%)

Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
           L +AQ  + +YLL L  + ++   R+ A L    + YGGW+ P  +L GH  GHYLSA +
Sbjct: 49  LKKAQDLDAQYLLELQPERMLAFLRQRAGLEAKAQGYGGWDGPGRQLTGHIAGHYLSAIS 108

Query: 175 QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF----------------PTELFDS 218
            M+A+T +   KE+    V  L   QN  G GY+ A                   E+   
Sbjct: 109 MMYATTGDVRFKERADEFVAELQTIQNAQGDGYIGALLDAKGVDGKVKFQDLSKGEIKSG 168

Query: 219 FEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
              L  +W+P+Y  HK+ AGL D Y L  +  AL++       F   V+ ++   + ++ 
Sbjct: 169 GFDLDGLWSPWYVEHKLFAGLRDAYHLTGDRTALEVEI----EFAGWVEGILKNLNEDQI 224

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
              L  E GGMN+VL  LY+ T+D + + L+  F+    +  L+   D L+  HANT+IP
Sbjct: 225 QRMLATEFGGMNEVLADLYADTNDTRWMKLSDKFEHHAIVDPLSQGQDILAGKHANTNIP 284

Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
            +IG   RYE TGD        FF D V+  HS+ATGG    E++  P ++ D +     
Sbjct: 285 KMIGELARYEYTGDEKDGKAANFFFDEVSLHHSFATGGDGKNEYFGQPDKMNDMIDGRTA 344

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C  YNM+K++R LF    +  YAD+ ERA  N +L  Q   + G + YM+P+GRGV  
Sbjct: 345 ESCAAYNMIKMARTLFSLDPQARYADFVERADLNAILGGQ-DPDDGRVSYMVPVGRGVQ- 402

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
               H +  KF SF CC G+ +E+ +     IY  E GN   L++ QY  ++ DW S  V
Sbjct: 403 ----HEYQNKFESFTCCVGSQMETHAFHAYGIY-NESGN--KLWVSQYDPTTVDWASQGV 455

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LP 577
            L    D  +     L+MT   S      ++ +L LR P W  S G    +NG  L  + 
Sbjct: 456 KLEMVTDLPMGDTATLKMTSGQS------KVFTLALRRPYWATS-GFAVKVNGVLLKNVS 508

Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWD 637
            P  ++    RW   D + + LP +LR E +    P+  +  AI++GP +LAG    E  
Sbjct: 509 GPDTYIEINRRWKVGDAVEVVLPKTLRKEPL----PDNPNRMAIMWGPLVLAGDLGPEVS 564

Query: 638 -IKTGTARSLSALISPIPPSFNAQ 660
             + G   S SA+    P    A+
Sbjct: 565 RRRNGGEGSASAVPEAAPALITAE 588


>gi|427411824|ref|ZP_18902026.1| hypothetical protein HMPREF9718_04500 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710114|gb|EKU73137.1| hypothetical protein HMPREF9718_04500 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 802

 Score =  275 bits (702), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 178/533 (33%), Positives = 264/533 (49%), Gaps = 44/533 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A + N  YLL L+ D L+ +FRK A L   G  YGGWEN    + GH +GHYL+A A M 
Sbjct: 63  AVEGNRLYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWEN--DTIAGHTLGHYLTALALMH 120

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------------- 221
           A T +A    + + ++  L+ CQ   G GY++ F     D  E                 
Sbjct: 121 AQTGDAECARRAAYIIDELAACQAAAGDGYVAGFTRRRDDVIEDGRLIFPEIMRGDIRSA 180

Query: 222 ---LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
              L   W P+Y  HK+ AGL D      N+QA  +A  +  Y    +  V       + 
Sbjct: 181 GFDLNGCWVPFYNWHKLFAGLFDAETHLGNSQARGVALALAAY----IDGVFAKLDDAQV 236

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
              L+ E GG+N+    L++ T DP+ L LA        L  LA + + L   HANT IP
Sbjct: 237 QQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDPLAQRQNSLPWIHANTQIP 296

Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
            +IG    +E+TG+    +   FF + V   +SY  GG + RE++ DP  ++  +  +  
Sbjct: 297 KLIGLARLHEITGNAADAIAANFFWETVVGQYSYVIGGNADREYFPDPGTISKHITEQTC 356

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +YNMLK++RHL+ W  E    DYYERA  N +L+ Q     G+  YM+PL  G  +
Sbjct: 357 ESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQNPAT-GMFAYMVPLMSGSHR 415

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ-YISSSFDWKSGH 517
                 W   F+ FWCC G+G+ES +K G+SI++E+      + I   YI S  DW +  
Sbjct: 416 V-----WSEPFDDFWCCVGSGMESHAKHGESIWWEDADRPADMLIANLYIPSEADWAARG 470

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
             L  +++    +D ++   L+       G+  +L LR+P W    GA+ ++NG  LP P
Sbjct: 471 AKL--RIETGYPFDGHI--ALSIPKLARAGRF-TLALRIPGW--CQGARIAVNGTPLPAP 523

Query: 578 PPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
              +  +   R W   D++T+ LP++LR EA  DD    A   A+L GP +LA
Sbjct: 524 RIADGYALIGRKWKAGDQVTLDLPMALRVEATPDD----ARTIALLHGPVVLA 572


>gi|375308750|ref|ZP_09774033.1| hypothetical protein WG8_2558 [Paenibacillus sp. Aloe-11]
 gi|375079377|gb|EHS57602.1| hypothetical protein WG8_2558 [Paenibacillus sp. Aloe-11]
          Length = 770

 Score =  275 bits (702), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 183/548 (33%), Positives = 282/548 (51%), Gaps = 44/548 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENP 157
           +KE +   V L++ S    A    L+++  ++ D ++++FR+ A++ T G +   GW+ P
Sbjct: 191 VKEFTGPKVSLERESEFAAAMNRFLQFVRSVNDDQMLYNFREAAAIDTKGAQPMTGWDAP 250

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI------GTGYLSAF 211
              L+GH  GHYLSA A  + +T ++ +  K+  +V  L +CQ  +      G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYHATEDSALLGKIQYMVAELGKCQTALSEQAGYGRGFLSAY 310

Query: 212 PTELFDSFE---ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
             E F+  E       +WAPYYT+HKI+AGLLD Y LA   +AL +   +  + ++R+ +
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALDICDKLGHWLHSRLSR 370

Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            +    + + W   +  E GGMN+ L +LY+IT +  +L+ A  FD       +    D 
Sbjct: 371 -LPREQLHKMWSLYIAGEFGGMNEALAKLYAITGNENYLMTAKYFDNAKLFLPMKENVDT 429

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L + HAN HIP VIG+   +EV GD  Y  I   F  +V  SH Y  GGT   E + +P 
Sbjct: 430 LGNMHANQHIPQVIGALKLFEVAGDKAYFNIAENFWTMVTQSHIYPIGGTGETEMFREPD 489

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP-GVM 446
            +A  L  +  ETC +YNMLK+++ LF++     Y DYYE+AL N +L+ +   +  G  
Sbjct: 490 AIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEGGS 549

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
            Y +PL  G  K   TH          CC+GTG+E+  K  ++IYF +E     LY+  Y
Sbjct: 550 TYFMPLAPGSIKKFDTH-------ENTCCHGTGLENHFKYQEAIYFHDEDR---LYVNLY 599

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL-TFSSKQEVGQLSSLNLRMPVWTYSNGA 565
           I S  DW    + L QK D         R  L T     E G  ++L  R+P W  S   
Sbjct: 600 IPSRLDWSEQGISLMQKRD---------RDGLETVRFYIEGGPETTLMFRIPDWV-SEPV 649

Query: 566 QASLNGQNLP---LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
           Q  +NG  +P   L     +L   + W   D++ + LP SLR      D P+  +++++ 
Sbjct: 650 QVKING--VPCRDLEYEHGYLKLRKVWK-KDEIELTLPCSLRLA----DAPDDHTLKSLT 702

Query: 623 FGPYLLAG 630
           +GPY+LA 
Sbjct: 703 YGPYVLAA 710


>gi|325915124|ref|ZP_08177450.1| hypothetical protein XVE_1336 [Xanthomonas vesicatoria ATCC 35937]
 gi|325538646|gb|EGD10316.1| hypothetical protein XVE_1336 [Xanthomonas vesicatoria ATCC 35937]
          Length = 791

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 188/569 (33%), Positives = 275/569 (48%), Gaps = 53/569 (9%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG  ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGR-MRAVPLAQVRLTPSLFL-DALNTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A    + + +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCATRAAYLVSELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +    NAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKKGKIDSAPFYLNGSWAPLYTWHKLFAGLLDVHAHCGNAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q +    +  +    L+ E GG+N+    L+  T D + L LA     
Sbjct: 222 AVGLAGY----LQGIFAALNDAQLQQVLSCEFGGLNESFVELHVQTDDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              +  L  Q D L H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVIDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWQTVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RHL++W  +  + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAVHFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+E+
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWED 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
                G+++  Y+ S+    +G  +  +   P        R  +T           +L L
Sbjct: 452 G---QGVFVNLYVPSTVRDAAGFALSLRSTLPE-------RGEVTLQIDAAPAAARTLAL 501

Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           R+P W  +   Q  +NGQ   L P   +L     W+  D +++QL + LR E   DD P 
Sbjct: 502 RVPGWAGAFTLQ--VNGQLQTLQPVDGYLRIERVWAAGDTVSLQLGMPLRLEPTSDD-PA 558

Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
           +     ++ GP +LA   G  +  WD  T
Sbjct: 559 WV---VVMRGPLVLAADLGDAATPWDNTT 584


>gi|451851952|gb|EMD65250.1| hypothetical protein COCSADRAFT_141970 [Cochliobolus sativus
           ND90Pr]
          Length = 620

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 189/538 (35%), Positives = 280/538 (52%), Gaps = 41/538 (7%)

Query: 112 SSVLWRAQQT-NLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHY 169
           S+  W+  +   L YL  ++VD L+++FR T  L T G +  GGW+ P    R H  GHY
Sbjct: 45  SNSRWKDNENRTLNYLKFVNVDRLLYNFRATHKLSTNGAQPNGGWDAPNFPFRSHVQGHY 104

Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEALKP 224
           L+A    +A+  ++T K++ +  V  L++CQ   G      GYLS FP   F + EA K 
Sbjct: 105 LTAWVNCYATLRDSTCKDRAAYFVQELAKCQANNGVAGFSPGYLSGFPESEFAALEAGKL 164

Query: 225 VWA--PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
                PYY +HK +AGLLD + +  + +A  +   +  +   R +K+    S  +    L
Sbjct: 165 TGGNVPYYAVHKTMAGLLDAWRIIGDQKARDVLLALAGWVDGRTKKL----STAQMQTML 220

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
             E GGMNDVL  +Y +T + + L +A  FD       LA + D LS  HANT +P  IG
Sbjct: 221 GTEFGGMNDVLAEIYQLTGNKQWLTVAQRFDHAKVFDPLANKQDQLSGNHANTQVPKWIG 280

Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCT 402
           +   Y+ TG   Y  I     D    +H+YA GG S  E +  P ++++ L ++  E C 
Sbjct: 281 AAREYKSTGTKRYLDIARNAWDFTINAHTYAIGGNSQAEHFRPPNQISNFLTNDTAEQCN 340

Query: 403 TYNMLKVSRHLFRWTKE---IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPL----GR 454
           TYNMLK++R L  WT +     Y DYYERAL N +L  Q   +  G + Y  PL     R
Sbjct: 341 TYNMLKLTRDL--WTTDPTSTKYFDYYERALINHLLGAQNAADNHGHITYFTPLRSGGRR 398

Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
           GV  A     W T +NSFWCC GT +E+ +KL DSIYF +      LY+  +  S+ DWK
Sbjct: 399 GVGPAWGGGTWSTDYNSFWCCQGTALETNTKLMDSIYFYDNS---ALYVNLFTPSTLDWK 455

Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
             +V + Q     +     L++T T +         ++ +R+P WT  +GA  SLNGQ  
Sbjct: 456 QRNVKITQVTTFPIGDTTTLKVTGTGN--------WAMKIRIPSWT--SGATISLNGQAS 505

Query: 575 PLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
            +   PG++ + +  W   D +T++LP+ LRT A      + A+I AI +GP +L+G+
Sbjct: 506 GVAANPGSYATLSRNWVSGDTVTVKLPMKLRTVAAN----DNANIAAIAYGPTILSGN 559


>gi|373958137|ref|ZP_09618097.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
           18603]
 gi|373894737|gb|EHQ30634.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
           18603]
          Length = 789

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 194/541 (35%), Positives = 281/541 (51%), Gaps = 43/541 (7%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L DV L +S    +A + +  YLL ++ D L+  FR  + L   GK YGGWE+  S L G
Sbjct: 52  LQDVRLLESP-FKQAMEKDAAYLLSVEPDRLLSGFRSHSGLTPKGKMYGGWES--SGLAG 108

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-- 221
           H +GHYLSA +  +AS+ N    E+++ +V  L ECQ    TGY+ A P E  D+  A  
Sbjct: 109 HTLGHYLSAISMQYASSRNPQFLERVNYIVKELKECQVARKTGYIGAIPKE--DTIWAEI 166

Query: 222 -----------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
                      L   W+P+YT+HK++AGLLD Y+  +NA+AL +   M ++    +Q + 
Sbjct: 167 KKGDIRSRGFDLNGGWSPWYTVHKVMAGLLDAYLYCNNAEALNICKGMGDWTGELLQNL- 225

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
              + E+    L  E GGM + L  LY+IT +  +L  ++ F     L  L+   D L  
Sbjct: 226 ---NDEQIQSMLLCEYGGMAETLVNLYAITGNKAYLATSYKFYDKRILNPLSENKDILPG 282

Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
            H+NT IP VI S  RYE+TG+   + I   F +I+   HSYATGG S  E+  +P +L 
Sbjct: 283 KHSNTQIPKVIASARRYELTGEKKDEDISVNFWNIITKDHSYATGGNSNYEYLSEPDKLN 342

Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
           D L     ETC TYNMLK++RHLF      A  DYYE+AL N +L+ Q   + G+M Y +
Sbjct: 343 DKLTENTTETCNTYNMLKLTRHLFSVNPSAALMDYYEKALYNHILASQNHDD-GMMCYFV 401

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           PL  G  K  S     + F++F CC G+G+E+  K  +SIY+   GN   LY+  +I S 
Sbjct: 402 PLRMGGKKEYS-----SPFDTFTCCVGSGMENHVKYNESIYY--RGNDGSLYVNLFIPSV 454

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
             WK   + L Q+ +   S       T   +S + V    +L +R P W  +      +N
Sbjct: 455 LTWKEKGITLTQQNNFPAS----DVTTFVINSTKPVN--FALKIRKPKW--AGNCLIKVN 506

Query: 571 GQ-NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           G+  +       +L     W  NDK+    P S+ TEAI    P+  + +A+ +GP LLA
Sbjct: 507 GKAGITTTNEQGYLVINRLWKNNDKIEFVTPESIYTEAI----PDNINRKALFYGPVLLA 562

Query: 630 G 630
           G
Sbjct: 563 G 563


>gi|345302361|ref|YP_004824263.1| hypothetical protein Rhom172_0482 [Rhodothermus marinus
           SG0.5JP17-172]
 gi|345111594|gb|AEN72426.1| protein of unknown function DUF1680 [Rhodothermus marinus
           SG0.5JP17-172]
          Length = 641

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 183/528 (34%), Positives = 267/528 (50%), Gaps = 42/528 (7%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A Q ++ YL  LD D L+  FR+ A L      YGGWE+    + GH +GHYLSA +  +
Sbjct: 56  AMQRDVAYLFELDPDRLLSRFRRFAGLEPKAPEYGGWES--QGISGHTLGHYLSALSMYY 113

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFE-ALKPV 225
           A+T +   + ++  +V  L+E Q   G GY+ A P            E++ +   +L   
Sbjct: 114 AATGDEKARARIDYIVSELAEVQRAHGNGYVGAIPEGDRLWAEIARGEIWQAEPFSLNGA 173

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS-LNE 284
           W P+YT+HKI  GL+D Y    N QAL++ T + ++ Y   + +         W   L  
Sbjct: 174 WVPWYTMHKIFQGLIDAYWYGGNEQALEVVTRLADWAYETTKNLTPA-----QWQQMLRT 228

Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
           E GGMN+ L  LYSIT +PKH  L+  F     L  LA     L+  HANT IP VIG  
Sbjct: 229 EHGGMNEALANLYSITGNPKHRELSQKFYHAAVLSPLARGIPNLTGLHANTQIPKVIGVV 288

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
            +YE+ G    + +  FF + V   H+Y  GG S  E +     LA+ LG    ETC TY
Sbjct: 289 RQYELIGSDSLRAVAEFFWEEVVQHHTYVIGGNSQNEHFGPRDSLANRLGEGTAETCNTY 348

Query: 405 NMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH 463
           NML+++RHLF    E + Y D+YERAL N +L+ Q   + G+  Y + L  G  K     
Sbjct: 349 NMLRLTRHLFALHPEKVRYVDFYERALYNHILASQ-DPKHGMFTYYMSLRPGHFKT---- 403

Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
            + T  NSFWCC GTG+E+  K  + IYF    N   LY+  +I S  +W+   + L  +
Sbjct: 404 -YATPENSFWCCVGTGMENHVKYNEFIYFY---NGDTLYVNLFIPSELNWERRALRLRLE 459

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNF 582
                ++    R+ L F    EV Q   + +R P W   +  +  +NG+   +   PG++
Sbjct: 460 ----TAFPESNRVRLDFDP--EVPQRLVVKVRHPSWA-QDALEVRINGEVQSVTSRPGSY 512

Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           L+    W   D++ I LP+ LR E + D+   +    AIL+GP +LAG
Sbjct: 513 LTLARLWQPGDEVEITLPMRLRVETMPDNPDRF----AILYGPIVLAG 556


>gi|386847956|ref|YP_006265969.1| Glucan endo-1,3-beta-glucosidase [Actinoplanes sp. SE50/110]
 gi|359835460|gb|AEV83901.1| Glucan endo-1,3-beta-glucosidase [Actinoplanes sp. SE50/110]
          Length = 765

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 192/580 (33%), Positives = 283/580 (48%), Gaps = 46/580 (7%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFVGHYLSASAQMW 177
           Q   L YL  +D D L+++FR      T G A  GGW+ P    R H  GH+L+A AQ W
Sbjct: 65  QTRTLNYLRFVDADRLLYNFRANHGRSTGGAAANGGWDAPDFPFRTHVQGHFLTAWAQAW 124

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA--LKPVWAPYYTIHKI 235
           A+  + T +++ + +V  L++CQ     GYLS FP   F + EA  L     PYY +HK 
Sbjct: 125 AALGDTTCRDRANYMVAELAKCQ--AANGYLSGFPESDFTALEAGTLSNGNVPYYCVHKT 182

Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYR 295
           LAGLLD + L    QA  +   +  +   R  ++ T     +    L  E GGMN+VL  
Sbjct: 183 LAGLLDVWRLIGGTQARDVLLRLAGWVDTRTARLTT----SQMQAMLGTEFGGMNEVLAD 238

Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY 355
           +Y  T D + L  A  FD       LA  AD L+  HANT +P  +G+   Y+ TG   Y
Sbjct: 239 IYQQTGDGRWLATAQRFDHAAVFTPLAAGADQLNGLHANTQVPKWVGAVREYKATGTTRY 298

Query: 356 KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
           + IG    +I   +H+YA GG S  E +  P  +A  L ++  E C +YNMLK++R L+ 
Sbjct: 299 RDIGLNAWNITTGAHTYAIGGNSQAEHFRAPNAIAGYLTNDTCEHCNSYNMLKLTRELWL 358

Query: 416 WTKE-IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPL----GRGVSKARSTHGWGTKF 469
              +  AY D+YERAL N ++  Q   +  G + Y  PL     RGV  A     W T +
Sbjct: 359 TDPDRAAYFDFYERALLNHLIGAQNPADSHGHITYFTPLRPGGRRGVGPAWGGGTWSTDY 418

Query: 470 NSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVS 529
            SFWCC GTG+E+ +KL +SIYF        L +  +  S   W    + + Q     VS
Sbjct: 419 ASFWCCQGTGVETNTKLMESIYFFSGTT---LTVNLFTPSVLSWAERGITVTQATAYPVS 475

Query: 530 WDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATER 588
               L ++ T S         S+ +R+P WT   GA  ++NG    +   PG + + T  
Sbjct: 476 DTTTLTVSGTPSGTW------SIRVRIPGWT--TGATLAVNGVAQGVGATPGGYATVTRA 527

Query: 589 WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSA 648
           W+  D LT++LP+ +  +   D+     ++QAI +GP +L G+  G          +LSA
Sbjct: 528 WAAGDVLTVRLPMRVIMQPAADN----PAVQAITYGPVVLCGNYGGT---------TLSA 574

Query: 649 LISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP 688
                 PS N   +  T  SG+  F  + +  ++++  FP
Sbjct: 575 -----HPSLNVSSIART-GSGSLAFTATANGATVSLGPFP 608


>gi|255075873|ref|XP_002501611.1| predicted protein [Micromonas sp. RCC299]
 gi|226516875|gb|ACO62869.1| predicted protein [Micromonas sp. RCC299]
          Length = 1214

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 206/694 (29%), Positives = 298/694 (42%), Gaps = 164/694 (23%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYL-LMLDVDSLVWSFRKTASLPT------- 146
           P N L    +H   LD       AQ+ N  YL  ++D   L+ +FR  A LP        
Sbjct: 180 PANVLHGAGVH---LD-------AQRLNARYLTAVVDPRRLLANFRVVAGLPPETIPDRH 229

Query: 147 --------------------PGKAYGGWENPISELRGHFVGHYLSA-------------- 172
                               PG     WE P  ELRGHF GHYLSA              
Sbjct: 230 PTETVAPYCDVGSGLSYAEHPGAC---WEAPDCELRGHFAGHYLSALAFVAAGAGDRPNT 286

Query: 173 -----------SAQMWASTHNATI------KEKMSTVVFSLSECQNKIGT--GYLSAFPT 213
                      S   + + H + +      +E +   V  L+  Q   GT  GY+SAFP 
Sbjct: 287 SPDRTSSSDHLSDPEYVTGHQSDVATARHAREMLDRFVDGLATAQASSGTSAGYVSAFPE 346

Query: 214 ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMY 273
           E+ D   A+   WAPYYT+HKI  GL+D +V+A NA+AL +   +      RV  +I   
Sbjct: 347 EVLDRQGAVGGAWAPYYTLHKIGQGLMDAHVVAGNAKALDVLKGLANAVLTRVMGLIQQR 406

Query: 274 SVERHWY---------SLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
               HW+         +   E+GG N++ +RLY +T +  ++ LA LFD P FLG +   
Sbjct: 407 GAS-HWFGGALEYSKAAFGAESGGFNELAWRLYQLTGNGDYVTLASLFDHPTFLGRMRAG 465

Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
            D L+  HAN H PI +G+  RYE+TGD   +     F++++  + SYATGGT   E W 
Sbjct: 466 GDGLTREHANFHEPIAMGAYSRYEITGDTESRRAFRNFIELLRDTRSYATGGTCDGERWQ 525

Query: 385 DPKRLADTL-GSENEETCTTYNMLKVSRHL---FRWTKEIAYADYYERALTNGVLSIQRG 440
            P RL   +  +E +ETCT  N  +++      F   +   +ADY ERA  +G + +QR 
Sbjct: 526 APGRLERIIVSTETQETCTQVNFERLANAAVASFGEAEARDWADYSERASLHGPVGLQR- 584

Query: 441 TEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY--FEEEGNV 498
            +PG ++Y  PLG GVSK RS HGWG    +FWCCYGTG+E+ ++L D ++   E    V
Sbjct: 585 -KPGELLYTTPLGVGVSKGRSGHGWGRPDAAFWCCYGTGVEALARLQDGVFWRLEAGATV 643

Query: 499 PG-----------LYIIQYISSSF-DWKSGHVVLNQKVDPIVSWDPY------------- 533
           PG           +YI +  +S+   W    V     VDP     P              
Sbjct: 644 PGDDTSSTTATDVVYIARVTTSAVATWDEKGVTTRVSVDPFNVGGPVQREGGRDGRRRRG 703

Query: 534 --------LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPG----- 580
                   + +T+    + E    +S+ +++P W    G++ +LNG+ +     G     
Sbjct: 704 TAGFFASAVAITVHAEGRNEP---TSIRVKLPRWA-GGGSRITLNGERVRCENGGDSSSS 759

Query: 581 -----------------NFLSATERWSYNDKLTIQLPLSLRTEAI--QDDRPEY------ 615
                             +   T  W   D L    P+ +R E +   D  P +      
Sbjct: 760 EDSDSDSDSDSDSDSDSGWCDVTRVWRKTDLLRASFPIVVRAEPLLGSDLTPGFGTGSNQ 819

Query: 616 -----ASIQAILFGPYLLAGHTSGEWDIKTGTAR 644
                 +  AI+ GPY+LA    G W    G  R
Sbjct: 820 RLDGKGARHAIVAGPYVLAALGPGAWIADLGVKR 853


>gi|346725400|ref|YP_004852069.1| hypothetical protein XACM_2511 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346650147|gb|AEO42771.1| hypothetical protein XACM_2511 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 791

 Score =  273 bits (697), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 197/619 (31%), Positives = 292/619 (47%), Gaps = 70/619 (11%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DNAQAL++
Sbjct: 162 DAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q + +     +    L+ E GG+N+    L+  T D + L LA     
Sbjct: 222 AMGLAGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTDDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L+H H+NT+IP +IG    YEVTG+        FF   V   H+Y  
Sbjct: 278 HAVLDPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGNAASGAAARFFWHTVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RHL++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRSGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
                G+Y+  Y+ S     +G   L+  +   +       + +  +  ++     +L L
Sbjct: 452 G---QGVYVNLYVPSMVHDAAG---LDMTLHSALPEQGSASLRIDAAPAEQ----RTLAL 501

Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           R+P W      Q  LNGQ +       +L  T  W   D L++   + LR EA  DD P 
Sbjct: 502 RVPGWAKQPRLQ--LNGQPVDSTVSDGYLRITRTWQRGDTLSLAFDMPLRLEATPDD-PA 558

Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQLVTFTQES 668
           + S   +L GP +LA   G  S  W  KT      + +   + P+P              
Sbjct: 559 WVS---VLRGPLVLAVDLGDASKPWSGKTPALIGGQDILQRLQPVP-------------- 601

Query: 669 GNSTFVMSNSNQSITMEEF 687
           G + FV ++  Q   +  F
Sbjct: 602 GKTAFVYNDGVQQWQLSPF 620


>gi|116182754|ref|XP_001221226.1| hypothetical protein CHGG_02005 [Chaetomium globosum CBS 148.51]
 gi|88186302|gb|EAQ93770.1| hypothetical protein CHGG_02005 [Chaetomium globosum CBS 148.51]
          Length = 797

 Score =  273 bits (697), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 198/607 (32%), Positives = 301/607 (49%), Gaps = 59/607 (9%)

Query: 99  LKEVSL-HDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWEN 156
           L  +SL +  W+D        Q   + YL  +DV+ L+++FR    L T G  A GGW+ 
Sbjct: 34  LSTISLTNSRWMDN-------QNRTVSYLKWVDVNRLLYNFRANHRLSTQGASANGGWDA 86

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAF 211
           P    R H  GHYL+A A  +AS  +   +++ +  V  L++CQ   G      GYLS F
Sbjct: 87  PNFPFRTHAQGHYLTAWAFCYASLRDTECRDRAAYFVAELAKCQKNNGAAGFSAGYLSGF 146

Query: 212 PTELFDSFEA--LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
           P   F + EA  L     PYY IHK +AGLLD +    +  A  +   +  +  +R  K+
Sbjct: 147 PESEFAALEARTLNNGNVPYYAIHKTMAGLLDVWRHLGDTNARDVLLALAGWVDSRTGKL 206

Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
               S ++    L  E GGMNDVL  L+  T D + L +A  FD       LA   D L+
Sbjct: 207 ----SYQQMQSMLGTEFGGMNDVLADLHKQTKDERWLKVAQRFDHAAVFDPLAAGRDQLN 262

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
             HANT +P  IG+ + Y+ TG   Y+ I     ++   +H+YA GG S  E +  P  +
Sbjct: 263 GLHANTQVPKWIGAALEYKATGSTRYRDIAKNAWELTVGAHTYAIGGNSQAEHFRPPNAI 322

Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRW-TKEIAYADYYERALTNGVLSIQR-GTEPGVMI 447
           A  L  +  E C TYNML+++R L+       AY D+YERAL N +L  Q   +  G + 
Sbjct: 323 AGYLQKDTAEACNTYNMLRLTRELWPLDAASTAYFDFYERALLNHLLGQQDPASHHGHVT 382

Query: 448 YMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
           Y  PL     RGV  A     W T ++SFWCC GT +E+ +KL DSIYF +E     L++
Sbjct: 383 YFTPLNPGGRRGVGPAWGGGTWSTDYDSFWCCQGTALETNTKLMDSIYFHDEA---ALFV 439

Query: 504 IQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
             +  S   W + +V + Q  D P          T T +   + G+   L +R+P WT +
Sbjct: 440 NLFTPSVLKWAAQNVTVTQATDFPAGD-------TTTLTIGGQPGESWDLFVRIPSWT-T 491

Query: 563 NGAQASLNGQNLPLP-PPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
           + A+ S+NG+   +   PG +    +R W   DK+T++LP++LRT    D+     ++ A
Sbjct: 492 DQAEISVNGEKANIDTKPGTYAVIQDRAWKAGDKVTVRLPMTLRTVPANDN----PNVAA 547

Query: 621 ILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQ 680
           + +GP +L+G          G+A SLS++     P+ +   V   + SG  TF  +   +
Sbjct: 548 VAYGPVVLSG--------DYGSA-SLSSM-----PTLSLDSVR-REGSGGLTFTATAGGK 592

Query: 681 SITMEEF 687
           ++ ++ F
Sbjct: 593 TVKLKPF 599


>gi|390456178|ref|ZP_10241706.1| hypothetical protein PpeoK3_19346 [Paenibacillus peoriae KCTC 3763]
          Length = 753

 Score =  272 bits (695), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 190/540 (35%), Positives = 275/540 (50%), Gaps = 39/540 (7%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
            LH V +D S  L  A + N  YLL L+ D L+  FR+ A L      Y GWE     + 
Sbjct: 9   DLHKVSID-SGPLCHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--ARGIS 65

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
           GH +GHYLS  + M+AST +  + E+++ V+  L  CQN  G GY+S  P   E+F+  +
Sbjct: 66  GHTLGHYLSGCSLMYASTGDERLLERVNYVIDELEICQNSHGNGYISGIPRGKEIFEEVK 125

Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
           A         L   W P YT+HK+ AGL D Y+L  + +AL M   + ++    ++ V  
Sbjct: 126 AGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLVHHPKALPMEIKLGDW----LEDVFR 181

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
               E+    L+ E GGMN+VL  L   + + + L LA  F     L  LA   D L+  
Sbjct: 182 GLDDEQMQRVLHCEFGGMNEVLTDLAEHSGEERFLKLAERFYHGEVLNDLADSRDTLAGR 241

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           HANT IP +IG+  +YEVTG P Y  +  FF D V   HSY  GG S  E + +P +L D
Sbjct: 242 HANTQIPKIIGAARQYEVTGKPHYADLSRFFWDRVVHKHSYVIGGNSYNEHFGEPGKLND 301

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
            LG    ETC TYNMLK++RH+F W    AYADYYERA+ N +L+ Q+  + G + Y + 
Sbjct: 302 RLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-GRVCYFVS 360

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           L  G  K+     + +++  F CC G+G+ES S  G +IYF     +   Y+ QY+ S+ 
Sbjct: 361 LEMGGHKS-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTPQTI---YVNQYVPSTV 412

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
            W    V L Q+      +    R TL   SK+   Q  ++ LR P W    G    +NG
Sbjct: 413 TWDEMDVQLKQE----TLFPQTGRGTLCVISKKP--QSFTIKLRCPYWA-EQGMIIKING 465

Query: 572 QNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +       P +++     W   D +   +P+++R E +    P+     A ++GP +LAG
Sbjct: 466 EAFAAEACPTSYVVIEREWKDGDTVEYDIPMTVRIEEM----PDNPRRIAFMYGPLVLAG 521


>gi|418517157|ref|ZP_13083324.1| hypothetical protein MOU_10158 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|410706214|gb|EKQ64677.1| hypothetical protein MOU_10158 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
          Length = 791

 Score =  272 bits (695), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 190/569 (33%), Positives = 275/569 (48%), Gaps = 53/569 (9%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +   + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q +       +    L+ E GG+N+    L+  T D + L LA     
Sbjct: 222 AVGLAGY----LQGIFAALDAAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RHL++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAKLFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
                G+Y+  Y+ S+    +G   LN  +   +       + +  +   +     +L L
Sbjct: 452 G---QGVYVNLYVPSTVRDAAG---LNMTLHSALPEQGSASLRIDGAPPAQ----RTLAL 501

Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           R+P WT     Q  LNGQ +       +L  T  W   D L++   + LR E+  DD P 
Sbjct: 502 RVPGWTQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PA 558

Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
           + S   +L GP +LA   G  +  W  KT
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKT 584


>gi|384418897|ref|YP_005628257.1| hypothetical protein XOC_1936 [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353461810|gb|AEQ96089.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 791

 Score =  271 bits (694), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 191/574 (33%), Positives = 279/574 (48%), Gaps = 63/574 (10%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRIRAGYLVSELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DN QAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q + +     +    L+ E GG+N+    L+  T+D + L LA     
Sbjct: 222 AVGLAGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDTASGAAARFFWHTVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RH+++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHVYQWGPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM P+  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSG-----HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL 549
                G+YI  Y+ S+    +G     H  L ++   +      LR+     +++     
Sbjct: 452 G---QGVYINLYVPSTVRDAAGLDMTLHSALPEQGSAL------LRIDAAPPAQR----- 497

Query: 550 SSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +L LR+P W      Q  LNGQ +       +L  T  W   D L++   + LR EA  
Sbjct: 498 -TLALRVPGWAQQPRLQ--LNGQPVDTAASDGYLRITRVWQRGDTLSLSFDMPLRLEATP 554

Query: 610 DDRPEYASIQAILFGPYLLA---GHTSGEWDIKT 640
           DD P + S   +L GP +LA   G  +  W  KT
Sbjct: 555 DD-PAWVS---VLRGPLVLAVDLGDAAKPWSGKT 584


>gi|46113732|ref|XP_383116.1| hypothetical protein FG02940.1 [Gibberella zeae PH-1]
          Length = 1393

 Score =  271 bits (694), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 196/563 (34%), Positives = 272/563 (48%), Gaps = 50/563 (8%)

Query: 89  PGGFDLPGNF-LKEVSLHDV-WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT 146
           P   DL   F L +VSL D  W+D        Q   + YLL +D D L++ FRK   L T
Sbjct: 25  PKVNDLADAFELSDVSLTDSRWMDN-------QGRTVNYLLSIDPDRLLYVFRKNHGLDT 77

Query: 147 PGKAY-GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK--- 202
            G    GGW+ P    R H  GH+L+A +  +A+  N     + S  V  L++CQ K   
Sbjct: 78  KGATKNGGWDAPDFPFRSHVQGHFLTAWSNCYATLGNKECGSRASYFVKELAKCQAKNAK 137

Query: 203 --IGTGYLSAFPTELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
               +GYLS FP       E   L     PYY IHK LAGLLD Y    +  A  +   +
Sbjct: 138 AGFTSGYLSGFPESEIAKVENRTLNNGNVPYYAIHKTLAGLLDVYRRVGDNDAKAVMLSL 197

Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
             +   R  K+    S  +    +  E GGMN+VL  +   T D K L +A  FD     
Sbjct: 198 AGWVDTRTGKL----SYAQMQQMMQTEFGGMNEVLADIAYYTQDNKWLKVAQRFDHAAIF 253

Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
             L    D LS  HANT +P  IG+   Y+V+GD  Y  IG    D+    H+YA GG S
Sbjct: 254 DPLQNNVDKLSGLHANTQVPKWIGALREYKVSGDKKYLDIGRNAWDLTVHKHTYAIGGNS 313

Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSI 437
             E + DP  +A  L S+  E C TYNMLK++R L+     + +Y D+YE AL N +L  
Sbjct: 314 QAEHFRDPDAIAKYLTSDTCEACNTYNMLKLTRELWALDPSDASYFDFYENALMNHLLGQ 373

Query: 438 QRGTEP-GVMIYMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
           Q   +  G + Y  PL     RGV  A     W T +NSFWCC G+GIE+ +KL DSIYF
Sbjct: 374 QNPKDNHGHVTYFTPLNPGGRRGVGPAWGGGTWSTDYNSFWCCQGSGIETNTKLMDSIYF 433

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-- 550
             +     LY+  +  S  +W    V + Q  +               SS  ++G  +  
Sbjct: 434 HTKDT---LYVNLFTPSKLNWSQQQVSIIQTTE----------YPQKDSSTLQIGGKAGT 480

Query: 551 -SLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
            +L +R+P WT  + A   +NGQ++ +   PG +      W+  DK+T+ LP+SLRT A 
Sbjct: 481 WTLAVRIPSWT--SKASIQVNGQSVNVNATPGKYALVKRNWNSGDKVTVTLPMSLRTIAA 538

Query: 609 QDDRPEYASIQAILFGPYLLAGH 631
            D+    + + A+ FGP +LA +
Sbjct: 539 NDN----SQVAAVAFGPVILAAN 557


>gi|393718114|ref|ZP_10338041.1| hypothetical protein SechA1_00115 [Sphingomonas echinoides ATCC
           14820]
          Length = 789

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 186/544 (34%), Positives = 269/544 (49%), Gaps = 51/544 (9%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A + N  YLL L  D L+ +FR  A L   G+ YGGWE+    + GH +GHY+SA   + 
Sbjct: 54  AVEVNRAYLLRLSADRLLHNFRAYAGLKPKGEVYGGWES--DTIAGHTLGHYMSALVLLH 111

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE-----LFDSFEA----------- 221
             T +A  K +   +V  L++ Q   G GY+ A   +     + D+ E            
Sbjct: 112 EQTGDAQAKRRADYIVDELADAQAARGNGYIGAMQRKRKDGTVVDAIEIFPEIIKGDIRS 171

Query: 222 ----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
               L   W+P+YT+HK+ AGLLD +    NA+AL +A     YF    + V       +
Sbjct: 172 GGFDLNGAWSPFYTVHKLFAGLLDIHASWGNAKALSVAIAFAGYF----EPVFAALDDAQ 227

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTH 336
               L  E GG+N+    L++ T D K L +A  L+D+       A Q D L++FHANT 
Sbjct: 228 MQTMLGTEYGGLNESFAELFARTKDRKWLAIAERLYDRKVLDPLTAGQ-DKLANFHANTQ 286

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
           +P +IG    +E+TG+P       FF   V   HSY  GG + RE++ +P  ++  +  +
Sbjct: 287 VPKLIGLARIHELTGEPAKAAAPRFFWQAVTKHHSYVIGGNADREYFSEPDSISRHITEQ 346

Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
             E C TYNMLK++R L+ W  + A  DYYERA  N V++ Q     G   YM PL  G 
Sbjct: 347 TCEHCNTYNMLKLTRQLYSWQPDGALFDYYERAHLNHVMAAQDPKTAG-FTYMTPLLTGA 405

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
            +  ST    +  ++FWCC GTG+ES +K G+SI++E EG    L +  YI +   W++ 
Sbjct: 406 VRGYST----SADDAFWCCVGTGMESHAKHGESIFWEGEG---ALLVNLYIPADATWRAR 458

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
              L   +D    ++P    TLT +     G+  ++ LR+P W  +  A   +NGQ +  
Sbjct: 459 GATLT--LDTRYPFEPT--STLTLTQLARPGRF-AIALRVPGWA-AGKAVVRVNGQPVTP 512

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ-DDRPEYASIQAILFGPYLLA---GHT 632
                +     RW   D + I LPL LR EA   DDR       AIL GP +LA   G T
Sbjct: 513 SFASGYAIVERRWKAGDSVAITLPLELRIEATPGDDR-----TVAILRGPMVLAADLGTT 567

Query: 633 SGEW 636
            G+W
Sbjct: 568 EGDW 571


>gi|375306379|ref|ZP_09771677.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus sp.
           Aloe-11]
 gi|375081632|gb|EHS59842.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus sp.
           Aloe-11]
          Length = 753

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 192/548 (35%), Positives = 278/548 (50%), Gaps = 55/548 (10%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
            LH V +D S  L+ A + N  YLL L+ D L+  FR+ A L      Y GWE     + 
Sbjct: 9   DLHKVSID-SGPLYHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--ARGIS 65

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
           GH +GHYLS  + M+A+T +  + E++S V+  L  CQN  G GY+S  P   E+F+  +
Sbjct: 66  GHTLGHYLSGCSLMYAATGDERLLERVSYVIDELEICQNNHGNGYISGIPRGKEIFEEVK 125

Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFY---- 263
           A         L   W P YT+HK+ AGL D ++LA + +AL    K+  W+ + F     
Sbjct: 126 AGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLAHHPKALPIEIKLGAWLEDVFRGLDD 185

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
            ++Q+V            L+ E GGMN+VL  L   + + + L LA  F     L  LA 
Sbjct: 186 EQMQRV------------LHCEFGGMNEVLTDLAEHSGEERFLKLAERFYHGEVLNDLAD 233

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
             D L+  HANT IP +IG+  +YEVTG P Y  +  FF D V   HSY  GG S  E +
Sbjct: 234 SRDTLAGRHANTQIPKIIGAARQYEVTGKPHYADLSRFFWDRVVHKHSYVIGGNSYNEHF 293

Query: 384 WDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
            +P +L D LG    ETC TYNMLK++RH+F W    AYADYYERA+ N +L+ Q+  + 
Sbjct: 294 GEPGKLNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD- 352

Query: 444 GVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
           G + Y + L  G  K      + +++  F CC G+G+ES S  G +IYF     +   Y+
Sbjct: 353 GRVCYFVSLEMGGHKT-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTPQTI---YV 404

Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
            QY+ S+  W    V L Q+      +    R TL   SK+   Q  ++ LR P W    
Sbjct: 405 NQYVPSTVTWDDMDVQLKQE----TLFPQTGRGTLRVISKKP--QSFTIKLRCPHWA-EQ 457

Query: 564 GAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
           G    +NG+       P +++     W   D +   +P+++R E +    P+     A +
Sbjct: 458 GMIIKINGEAFTAEACPTSYVVIEREWKDGDTVEYDIPMTVRIEEM----PDNPRRIAFM 513

Query: 623 FGPYLLAG 630
           +GP +LAG
Sbjct: 514 YGPLVLAG 521


>gi|367031082|ref|XP_003664824.1| hypothetical protein MYCTH_55581 [Myceliophthora thermophila ATCC
           42464]
 gi|347012095|gb|AEO59579.1| hypothetical protein MYCTH_55581 [Myceliophthora thermophila ATCC
           42464]
          Length = 608

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 194/593 (32%), Positives = 287/593 (48%), Gaps = 54/593 (9%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
           Q   + YL  +DVD L+++FR    L T G +  GGW+ P    R H  GH+L+A +  +
Sbjct: 26  QNRTVTYLKWVDVDRLLYNFRANHGLSTQGARQNGGWDAPDFPFRTHVQGHFLTAWSHCY 85

Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA--LKPVWAPYY 230
           AS  +   +++ +  V  L++CQ        G GYLS FP   FD+ EA  L     PYY
Sbjct: 86  ASLRDDACRDRATYFVAELAKCQANNDAVGFGAGYLSGFPESEFDALEARTLSNGNVPYY 145

Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
            IHK +AGLLD +    +  A  +   +  +  +R  ++    S E+    L  E GGMN
Sbjct: 146 AIHKTMAGLLDVWRHVGDTTARDVLLALAGWVDSRTGRL----SYEQMQAVLGTEFGGMN 201

Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           DVL  L   T DP+ L +A  FD       LA + D L   HANT +P  IG+ + Y+ T
Sbjct: 202 DVLTELSLQTGDPRWLEVAQRFDHAAVFDPLASRQDRLDGLHANTQVPKWIGAVLEYKAT 261

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
           G   Y+ I     +    +HSYA GG S  E + +P  +A  L  +  E C TYNML+++
Sbjct: 262 GTARYRDIAANAWNFTVGAHSYAIGGNSQAEHFHEPDAIAKYLLEDTAEACNTYNMLRLT 321

Query: 411 RHLFRWT-KEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKARSTHG 464
           R L+       AY D+YERAL N +L  Q   +P G + Y  PL     RGV  A     
Sbjct: 322 RELWMLDPASTAYFDFYERALLNHLLGQQNPADPHGHVTYFTPLNPGGRRGVGPAWGGGT 381

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFE------EEGNVPGLYIIQYISSSFDWKSGHV 518
           W T ++SFWCC GT +E+ +KL DSIY+       ++     L++  +  S   W    V
Sbjct: 382 WSTDYDSFWCCQGTALETNTKLMDSIYWHDDDDDADDDGAANLWVNLFTPSVLRWTERGV 441

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
            L Q+       D     T+T +   E      +++R+P WT S GA+  +NG+   +  
Sbjct: 442 TLTQETAFPAGSD-----TITLTVGGEPTGGWDMHVRIPSWTTS-GAEVLVNGEKAGVAA 495

Query: 579 --PGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
             PG ++S   R W   D +T++LP++LRT A  D+      + A+ +GP +L+G     
Sbjct: 496 AVPGTYVSIRGRDWKAGDVVTVRLPMTLRTVAANDN----PGVAALAYGPVVLSG----- 546

Query: 636 WDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNS-TFVMSNSNQSITMEEF 687
            D  + +  SL  L           L +  +  GN   F  +   QS+T+  F
Sbjct: 547 -DYGSASLASLPTL----------DLDSVRRAKGNGLVFTATADGQSVTLGPF 588


>gi|84624616|ref|YP_451988.1| hypothetical protein XOO_2959 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|84368556|dbj|BAE69714.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 791

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 200/627 (31%), Positives = 291/627 (46%), Gaps = 86/627 (13%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DN QAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q + +     +    L+ E GG+N+    L+  T+D + L LA     
Sbjct: 222 AVGLAGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++ H+++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKCLTEQTCEHCASYNMLKLTCHVYQWCPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM P+  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSG-----HVVLNQKVDPIVSWD---PYLRMTLTFSSKQEV 546
                G+YI  Y+ S+    +G     H  L ++    +  D   P  RM          
Sbjct: 452 G---QGVYINLYVPSTVRDAAGLDMTLHSALPEQGSASLRIDAAPPEQRM---------- 498

Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
                L LR+P W      Q  LNGQ +       +L  T  W   D L++   + LR E
Sbjct: 499 -----LALRVPGWAQQPRLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLE 551

Query: 607 AIQDDRPEYASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQ 660
           A  DD P + S   +L GP +LA   G  +  W  KT      + +   + P+P      
Sbjct: 552 ATPDD-PAWVS---VLRGPLVLAVDLGDAAKPWSGKTPALIGGQDILQRLQPVP------ 601

Query: 661 LVTFTQESGNSTFVMSNSNQSITMEEF 687
                   GN+ FV ++  Q   +  F
Sbjct: 602 --------GNTAFVYNDGLQQWQLSPF 620


>gi|330467876|ref|YP_004405619.1| hypothetical protein VAB18032_19585 [Verrucosispora maris
           AB-18-032]
 gi|328810847|gb|AEB45019.1| hypothetical protein VAB18032_19585 [Verrucosispora maris
           AB-18-032]
          Length = 913

 Score =  271 bits (692), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 206/628 (32%), Positives = 296/628 (47%), Gaps = 78/628 (12%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPISELRGHFV 166
           WLD        Q   L YL  +DV+ L+++FR    L T G A  GGWE P    R H  
Sbjct: 63  WLDN-------QNRTLNYLRFVDVNRLLYNFRANHRLSTAGAAALGGWEAPTFPFRTHSQ 115

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
           GH+L+A + MWA   + T ++K + +V  L++CQ          GYL  +P   F + EA
Sbjct: 116 GHFLTAWSHMWAVLGDTTCRDKANYMVAELAKCQANNAAAGFNPGYLCGYPESDFTAVEA 175

Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSV 275
             L     PYYTIHK L GLLD +    N QA    L +A W V++   R+     M ++
Sbjct: 176 RTLNNGNVPYYTIHKTLVGLLDVWRHIGNNQARDVLLALAGW-VDWRTGRLSSA-QMQAM 233

Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
                 L  E GGMN VL  LY  T D + L +A  FD       LA   D L+  HANT
Sbjct: 234 ------LGTEFGGMNAVLTDLYQQTGDARWLTVAQRFDHAAVFNPLAANQDQLNGLHANT 287

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
            IP  IG+   ++ TG   Y+ I +   ++   + +YA GG S  E +  P  ++  L +
Sbjct: 288 QIPKWIGAAREFKATGTTRYRDIASNAWNLTVNTRTYAIGGNSQAEHFRAPNAISGYLRN 347

Query: 396 ENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG 453
           +  E C TYNMLK++R L+      +AY D+YERAL N ++  Q   +  G + Y  PL 
Sbjct: 348 DTCEHCNTYNMLKLTRELWLLDPNRVAYFDFYERALLNHLIGAQNPADNHGHITYFTPLQ 407

Query: 454 ----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
               RGV  A     W T +NSFWCC GTG+E+ + L DSIYF    N   L +  ++ S
Sbjct: 408 PGGRRGVGPAWGGGTWSTDYNSFWCCQGTGLENNTTLMDSIYFH---NGSTLTVNLFMPS 464

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             +W    + + Q      S       T T +    VG   ++ +R+P WT    A  S+
Sbjct: 465 VLNWSQRGITVTQSTSYPAS------DTSTLTVTGTVGGSWTMRIRIPAWTQD--ATVSV 516

Query: 570 NG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           NG  QN+    PG + S T  W+  D +T++LP+ +  E   D+     S+ A+ +GP +
Sbjct: 517 NGTVQNIAT-TPGTYASLTRTWTSGDTVTVRLPMRVVVEPTNDN----PSVVALTYGPAV 571

Query: 628 LAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEF 687
           L+G+             +LSAL     P+     VT T  +   TF  + +N  + +  F
Sbjct: 572 LSGNYGNT---------ALSAL-----PALATASVTRTSSTA-LTFTATANNTQVNLLPF 616

Query: 688 ------------PVSGTDAALHATFRLI 703
                          G+     ATFRL+
Sbjct: 617 YDAHGHNYTVYWSSGGSSGPAQATFRLV 644


>gi|375308065|ref|ZP_09773352.1| hypothetical protein WG8_1877 [Paenibacillus sp. Aloe-11]
 gi|375080396|gb|EHS58617.1| hypothetical protein WG8_1877 [Paenibacillus sp. Aloe-11]
          Length = 759

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 177/544 (32%), Positives = 282/544 (51%), Gaps = 38/544 (6%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT-PGKAYGGWENP 157
           L  +S   V L+  S+L  AQ   L++LL ++ D ++++FRK ASL T    A  GW++ 
Sbjct: 185 LHGISTQKVHLEGPSLLKSAQNRRLQFLLTVNDDQMLYNFRKAASLDTLNAPAMIGWDSD 244

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQ------NKIGTGYLSAF 211
            S L+GH  GHYLSA A  +AST N  I +K++ +V  L++ Q      ++   G+LSA+
Sbjct: 245 ESLLKGHTTGHYLSALALCYASTGNERIHQKLAYLVDELNKVQLAFEADDRYHYGFLSAY 304

Query: 212 PTELFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
             E FD  E       +WAPYYT+HKILAGLLD Y +A    AL +A  + ++ YNR+  
Sbjct: 305 SEEQFDLLEVYTRYPEIWAPYYTLHKILAGLLDSYHIAGIELALAIADKVGDWIYNRLS- 363

Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
           V+    +++ W   +  E GG+N+ L  L++ T    H+  A LFD       +  Q D 
Sbjct: 364 VLPHEQLKKMWGLYIAGEFGGINESLAELFTYTQKEHHIAAAKLFDNDRLFFPMEQQVDA 423

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L   HAN HIP ++G+   +E TG+  Y  I  FF + V  +H Y+ GGT   E +  P 
Sbjct: 424 LGAMHANQHIPQIVGAFKIFEATGEQKYYDIAKFFWESVVNAHIYSIGGTGEGEMFKQPH 483

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
           ++   L     ETC +YN+LK+++ L+ +  +  Y DYYER + N +LS       G   
Sbjct: 484 KIGTHLTEHTAETCASYNLLKLTKQLYVYENDAKYMDYYERTMLNHILSSTDHECLGAST 543

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y +P   G  K           NS  CC+GTG+E+  K  ++I+FE   +V  LY+  ++
Sbjct: 544 YFMPTSPGGQKGYDEE------NS--CCHGTGLENHFKYAEAIFFE---DVDSLYVNLFV 592

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRM-TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
            ++ + +   + + Q V  I + +  + + TLT          ++L +R+P W +     
Sbjct: 593 PAALNDEGKGLQVVQSVPEIFNGEVEIHIETLT---------RTNLRVRIPYW-HQGEIT 642

Query: 567 ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
             +N   +       +L  ++ W+  D++T++    LR E      P+ A I ++ FGPY
Sbjct: 643 TFVNHTKVNTIEENGYLVLSQEWNKGDQVTMKFTPRLRLEHT----PDKADIASLAFGPY 698

Query: 627 LLAG 630
           +LA 
Sbjct: 699 ILAA 702


>gi|58582735|ref|YP_201751.1| hypothetical protein XOO3112 [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|188577523|ref|YP_001914452.1| hypothetical protein PXO_01470 [Xanthomonas oryzae pv. oryzae
           PXO99A]
 gi|58427329|gb|AAW76366.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|188521975|gb|ACD59920.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
           PXO99A]
          Length = 783

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 200/627 (31%), Positives = 291/627 (46%), Gaps = 86/627 (13%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 38  PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 95

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +
Sbjct: 96  E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRK 153

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DN QAL++
Sbjct: 154 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQV 213

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q + +     +    L+ E GG+N+    L+  T+D + L LA     
Sbjct: 214 AVGLAGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 269

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 270 HAVLDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 329

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++ H+++W  +    DYYER L N V
Sbjct: 330 GGNGDREYFQQPDSISKCLTEQTCEHCASYNMLKLTCHVYQWGPQAELFDYYERTLLNHV 389

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM P+  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 390 MA-QQHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 443

Query: 495 EGNVPGLYIIQYISSSFDWKSG-----HVVLNQKVDPIVSWD---PYLRMTLTFSSKQEV 546
                G+YI  Y+ S+    +G     H  L ++    +  D   P  RM          
Sbjct: 444 G---QGVYINLYVPSTVRDAAGLDMTLHSALPEQGSASLRIDAAPPEQRM---------- 490

Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
                L LR+P W      Q  LNGQ +       +L  T  W   D L++   + LR E
Sbjct: 491 -----LALRVPGWAQQPRLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLE 543

Query: 607 AIQDDRPEYASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQ 660
           A  DD P + S   +L GP +LA   G  +  W  KT      + +   + P+P      
Sbjct: 544 ATPDD-PAWVS---VLRGPLVLAVDLGDAAKPWSGKTPALIGGQDILQRLQPVP------ 593

Query: 661 LVTFTQESGNSTFVMSNSNQSITMEEF 687
                   GN+ FV ++  Q   +  F
Sbjct: 594 --------GNTAFVYNDGLQQWQLSPF 612


>gi|342872240|gb|EGU74628.1| hypothetical protein FOXB_14856 [Fusarium oxysporum Fo5176]
          Length = 616

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 197/557 (35%), Positives = 269/557 (48%), Gaps = 59/557 (10%)

Query: 99  LKEVSLHDV-WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWEN 156
           L +VSL D  W+D        Q   L YLL +D D L++ FRK   + T G +  GGW+ 
Sbjct: 34  LTQVSLTDSRWMDN-------QNRTLNYLLSVDPDRLLYVFRKNHGVDTKGAQTNGGWDA 86

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAF 211
           P    R H  GH+LSA  Q +AS        + +  V  L++CQ          GYLS F
Sbjct: 87  PDFPFRSHVQGHFLSAWTQCYASAGVKECGSRATYFVQELAKCQANNAKAGFNKGYLSGF 146

Query: 212 PTELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWM----VEY 261
           P       E   L     PYY IHK LAGLLD Y    +  A    L +A+W+     + 
Sbjct: 147 PESDITKVEDRTLNNGNVPYYAIHKTLAGLLDVYRRLGDQTAKDTMLSLASWVDTRTSKL 206

Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
            YN++Q +            L  E GGMN+VL  +   T D K L +A  FD       L
Sbjct: 207 SYNQMQSM------------LQTEFGGMNEVLADIAFYTKDAKWLKVAQRFDHAVIFDPL 254

Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
               D LS  HANT +P  IG+   Y+V GD  Y  IG    ++V   H+YA GG S  E
Sbjct: 255 QQNVDKLSGLHANTQLPKWIGALREYKVGGDKKYLDIGRNAWNMVVNKHTYAIGGNSQAE 314

Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQR- 439
            +  P  +A  L  +  E C +YNMLK++R L+     + +Y D+YE+AL N +L  Q  
Sbjct: 315 HFRAPDAIAGFLTDDTCEACNSYNMLKLTRELWALNPTDASYFDFYEKALLNHLLGQQDP 374

Query: 440 GTEPGVMIYMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEE 495
            ++ G + Y  PL     RGV  A     W T +NSFWCC GTG+E+ +KL DSIYF   
Sbjct: 375 SSDHGHVTYFTPLKAGGRRGVGPAWGGGTWSTDYNSFWCCQGTGVETNTKLMDSIYFHTS 434

Query: 496 GNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
                LY+  +  S  +W    V + Q  D   S       T TF    +  +  +L +R
Sbjct: 435 DT---LYVNLFTPSKLNWSQKKVSVTQTTDFPES------DTSTFKISGDTSEW-TLAVR 484

Query: 556 MPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           +P WT  + A   +NGQ   +   PG +     +W   D +T+QLP+SL T A  DD+  
Sbjct: 485 IPSWT--SKASIKVNGQAANVAVQPGKYALIKRQWKSGDTVTVQLPMSLHTVAANDDQ-- 540

Query: 615 YASIQAILFGPYLLAGH 631
             ++ AI FGP +LAG+
Sbjct: 541 --TLGAIAFGPVILAGN 555


>gi|386837867|ref|YP_006242925.1| hypothetical protein SHJG_1777 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|374098168|gb|AEY87052.1| hypothetical protein SHJG_1777 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|451791159|gb|AGF61208.1| hypothetical protein SHJGH_1542 [Streptomyces hygroscopicus subsp.
           jinggangensis TL01]
          Length = 769

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 185/556 (33%), Positives = 273/556 (49%), Gaps = 47/556 (8%)

Query: 94  LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYG 152
           LP +F +       WLD        Q     YL  +DVD L+++FR    L T G  A G
Sbjct: 48  LPFDFGQVRLTASRWLDN-------QDRAAAYLRFVDVDRLLYNFRANHRLSTGGASATG 100

Query: 153 GWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGY 207
           GW+ P    R H  GH+L+A AQ++A T +A  ++K   +V  L++CQ        G GY
Sbjct: 101 GWDAPTFPFRSHVQGHFLTAWAQLYAVTGDAVARDKALYMVAELAKCQANNGAAGFGAGY 160

Query: 208 LSAFPTELFDSFEA--LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
           LS +P   F + EA  L+    PYYT+HK ++GLLD +    + QA  +   +  +   R
Sbjct: 161 LSGYPESDFTALEAGTLRNGNVPYYTVHKTMSGLLDVWRHLGSTQARDVLLALAGWVDAR 220

Query: 266 VQKVIT--MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
             ++ T  M +V      L  E GGMN VL  LY  T D + L +A  FD       LA 
Sbjct: 221 TGRLTTAQMQAV------LGTEFGGMNAVLADLYQQTGDARWLTVAQRFDHAAVFDPLAA 274

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
             D L+  HANT +P  IG+   Y+ TG   Y+ I T   +    SH+YA GG S  E +
Sbjct: 275 NQDALAGLHANTQVPKWIGAVRAYKATGITRYRDIATNAWNHCVGSHTYAIGGNSQAEHF 334

Query: 384 WDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTE 442
             P  +A  L  +  E+C + NML ++R LF  T + +A  DYYE+A  N ++  Q   +
Sbjct: 335 RAPNAIAAYLADDTCESCNSVNMLTLTRELFTLTPDRVALFDYYEQAWLNHIIGNQNPAD 394

Query: 443 P-GVMIYMLPL----GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
           P G + Y  PL     RGV  A     W T + +FWCC GTG+E  ++L DS+YF     
Sbjct: 395 PHGHITYFTPLRPGGRRGVGPAWGGGTWSTDYTTFWCCQGTGVEIHTRLMDSVYFHSGTT 454

Query: 498 VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
              L +  ++ S   W    + + Q      S    LR+T       +VG   ++ +R+P
Sbjct: 455 ---LTVNMFVPSVLTWTQRGITVTQTTSYPASDTTTLRVT------GDVGGTWAMRVRIP 505

Query: 558 VWTYSNGAQASLNG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
            WT   GA  S+NG  QN+P    G++ +    W+  D +T++LP+        D+    
Sbjct: 506 GWT--TGASVSVNGVVQNIP-AATGSYATLDRAWASGDTVTVRLPMRTALRPANDN---- 558

Query: 616 ASIQAILFGPYLLAGH 631
            ++ A+ +GP +LAG+
Sbjct: 559 PNVSAVTYGPVVLAGN 574


>gi|381170950|ref|ZP_09880102.1| Tat (twin-arginine translocation) pathway signal sequence domain
           protein [Xanthomonas citri pv. mangiferaeindicae LMG
           941]
 gi|380688673|emb|CCG36589.1| Tat (twin-arginine translocation) pathway signal sequence domain
           protein [Xanthomonas citri pv. mangiferaeindicae LMG
           941]
          Length = 791

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 189/569 (33%), Positives = 277/569 (48%), Gaps = 53/569 (9%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +   + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q + ++    +    L+ E GG+N+    L+  T D + L LA     
Sbjct: 222 AVDLAGY----LQGIFSVLDDTQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L+H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLIAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHAVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RHL++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
                G+Y+  Y+ S+    +G   LN  +   +       + +  +   +     +L L
Sbjct: 452 G---QGVYVNLYVPSTVRDAAG---LNMTLHSALPEQGSASLRIDGAPPAQ----RTLAL 501

Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           R+P W      Q  LNGQ +       +L  T  W   D L++   + LR E+  DD P 
Sbjct: 502 RVPGWAQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PA 558

Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
           + S   +L GP +LA   G  +  W  KT
Sbjct: 559 WVS---VLRGPLVLAADLGDAAKPWSGKT 584


>gi|390993493|ref|ZP_10263643.1| TAT (twin-arginine translocation) pathway signal sequence domain
           protein [Xanthomonas axonopodis pv. punicae str. LMG
           859]
 gi|372551771|emb|CCF70618.1| TAT (twin-arginine translocation) pathway signal sequence domain
           protein [Xanthomonas axonopodis pv. punicae str. LMG
           859]
          Length = 791

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 190/569 (33%), Positives = 274/569 (48%), Gaps = 53/569 (9%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +   + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDTQCRTRTRYLVSELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q V       +    L+ E GG+N+    L+  T D + L LA     
Sbjct: 222 AVALAGY----LQGVFAALEDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RHL++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
                G+Y+  Y+ S+    +G   LN  +   +       + +  +   +     +L L
Sbjct: 452 G---QGVYVNLYVPSTVRDAAG---LNMTLHSALPEQGSASLRIDGAPPAQ----RTLAL 501

Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           R+P W      Q  LNGQ +       +L  T  W   D L++   + LR E+  DD P 
Sbjct: 502 RVPGWAQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PA 558

Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
           + S   +L GP +LA   G  +  W  KT
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKT 584


>gi|418520534|ref|ZP_13086583.1| hypothetical protein WS7_05828 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410703915|gb|EKQ62403.1| hypothetical protein WS7_05828 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 791

 Score =  269 bits (687), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 190/569 (33%), Positives = 274/569 (48%), Gaps = 53/569 (9%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +   + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDTQCRTRTRYLVSELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPSPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q V       +    L+ E GG+N+    L+  T D + L LA     
Sbjct: 222 AVALAGY----LQGVFAALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RHL++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
                G+Y+  Y+ S+    +G   LN  +   +       + +  +   +     +L L
Sbjct: 452 G---QGVYVNLYVPSTVRDAAG---LNMTLHSALPKQGSASLRIDGAPPAQ----RTLAL 501

Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           R+P W      Q  LNGQ +       +L  T  W   D L++   + LR E+  DD P 
Sbjct: 502 RVPGWAQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PA 558

Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
           + S   +L GP +LA   G  +  W  KT
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKT 584


>gi|115399582|ref|XP_001215378.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114192261|gb|EAU33961.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 614

 Score =  269 bits (687), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 196/552 (35%), Positives = 266/552 (48%), Gaps = 49/552 (8%)

Query: 99  LKEVSLHD-VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWEN 156
           L E+SL D  +LD        Q+  L YL  +D + L+ +FR    L T G  A GGW+ 
Sbjct: 31  LSELSLGDGRFLDN-------QERTLSYLKFVDTERLLLNFRANHKLDTKGAVANGGWDA 83

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAF 211
           P    R H  GH+L+A AQ +A   +   +E+ +  V  L++CQ         TGYLS F
Sbjct: 84  PTFPFRTHVQGHFLTAWAQCYAVLGDTDCQERATYFVSELAKCQANNEAAGFKTGYLSGF 143

Query: 212 PTELFDSFEA--LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
           P   FD+ EA  L     PYY IHK LAGLLD + L  +  A  +   +  +   R   +
Sbjct: 144 PESDFDALEAGTLNNGNVPYYNIHKTLAGLLDVWRLVGDTTARDVLLALAGWVDTRTSAL 203

Query: 270 --ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
               M SV      L  E GGMNDVL  LY  T D K L  A  FD       LA   D 
Sbjct: 204 SEAQMQSV------LGTEFGGMNDVLADLYHQTSDEKWLKTAQRFDHAAVFDPLAANEDQ 257

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L+  HANT +P  IG+   Y+ TGD  Y  I      I   +H+YA G  S  E +  P 
Sbjct: 258 LNGLHANTQVPKWIGAVREYKATGDTRYLDIARNAWTITVNAHTYAIGANSQAEHFHAPN 317

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEP-GV 445
            +A  L S+  E C +YNMLK++R L+    E   Y D+YE AL N +L  Q   +  G 
Sbjct: 318 AIAQYLDSDTAEACNSYNMLKLTRELWTLDPENTTYFDFYENALLNHLLGQQNPADSHGH 377

Query: 446 MIYMLPL----GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
           + Y   L     RGV  A     W T ++SFWCC GT +E+ +KL DSI+F  +     L
Sbjct: 378 ITYFTSLNPGGNRGVGPAWGGGTWSTDYDSFWCCQGTALETNTKLMDSIFFHSDS---AL 434

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
           Y+ Q+I S   W    V + Q     VS      +TL      +      L +R+P WT 
Sbjct: 435 YVNQFIPSVLTWSEKGVKVTQSTTFPVS----DTITLDIDGNGDW----ELYVRIPSWT- 485

Query: 562 SNGAQASLNGQNLP--LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
            + A  ++NG+ +      PG++      W+  DK+ IQLP+ LRT    DD     S+ 
Sbjct: 486 -SNAAITINGEQVTDVDVSPGSYAKIARTWASGDKVQIQLPMHLRTVPANDD----PSLM 540

Query: 620 AILFGPYLLAGH 631
           AI +GP +L+G+
Sbjct: 541 AIAYGPVILSGN 552


>gi|308067040|ref|YP_003868645.1| hypothetical protein PPE_00225 [Paenibacillus polymyxa E681]
 gi|305856319|gb|ADM68107.1| DUF1680 domain containing protein [Paenibacillus polymyxa E681]
          Length = 752

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 201/598 (33%), Positives = 301/598 (50%), Gaps = 58/598 (9%)

Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
           K   LH V +D S  L  A + N  YLL L+ D L+  FR+ A L      Y GWE    
Sbjct: 4   KAFDLHKVRID-SGPLLHAMELNTAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--AR 60

Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFD 217
            + GH +GHYLS  A M+AST +  + E+++ VV  L  CQN  G GY+S  P   E+F+
Sbjct: 61  GISGHTLGHYLSGCALMFASTGDERLLERVNYVVDELEICQNSHGNGYISGIPRGKEIFE 120

Query: 218 SFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFY- 263
             +A         L   W P YT+HK+ AGL D ++ A + +AL    K+  W+ +    
Sbjct: 121 EVKAGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLPAHHPKALSIEIKLGNWLEDVLQG 180

Query: 264 ---NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF 320
              ++VQ+V            L+ E GGMN+VL  L   + + + L LA  F     L  
Sbjct: 181 LDDDQVQQV------------LHCEFGGMNEVLTDLAEHSGEERFLSLAERFYHGEVLND 228

Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR 380
           LA   D L+  HANT IP +IG+  ++E+TG P Y  +  FF D V   HSY  GG S  
Sbjct: 229 LADSQDTLAGRHANTQIPKIIGAARQFEMTGKPQYADLSRFFWDRVVHKHSYVIGGNSYN 288

Query: 381 EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRG 440
           E + +P +L D LG    ETC TYNMLK++RH+F W    AYADYYERA+ N +L+ Q+ 
Sbjct: 289 EHFGEPGKLNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQP 348

Query: 441 TEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
            + G + Y + L  G  K+     + +++  F CC G+G+ES S  G +IYF     +  
Sbjct: 349 VD-GRVCYFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTPETI-- 400

Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
            Y+ QY+ S+  W    V L Q  D +   +   R TL   SK+   +  ++ LR P W 
Sbjct: 401 -YVNQYVPSTVTWDEMGVQLKQ--DTLFPQNG--RGTLRVISKEP--KSFAIKLRCPHWA 453

Query: 561 YSNGAQASLNGQN-LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
              G    +NG+  +    P +++     WS  D +   +P+++R E +    P+     
Sbjct: 454 -EQGMMIKINGEKYVTEACPTSYVVMEREWSNGDTIEYDIPMTVRVEEM----PDNPRRV 508

Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN 677
           A ++GP +LAG   G  + ++     L++++     S   +L+    E   +TF M++
Sbjct: 509 AFMYGPLVLAGDL-GPVEQESNEEHLLASVLIGSADSLTTKLIADGNEP--NTFHMTD 563


>gi|373954098|ref|ZP_09614058.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
           18603]
 gi|373890698|gb|EHQ26595.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
           18603]
          Length = 787

 Score =  268 bits (685), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 190/550 (34%), Positives = 282/550 (51%), Gaps = 59/550 (10%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
           +L DV L  +S   +A + +  YLL ++ D L+  FR  + L   GK Y GWE+  S L 
Sbjct: 49  NLKDVKL-LNSPFKQAMEVDAAYLLSIEPDRLLSGFRAHSGLKPKGKMYEGWES--SGLA 105

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA- 221
           GH +GHYLSA +  +A+T +    ++++ +V  L ECQ    TGY+ A P E  D+  A 
Sbjct: 106 GHTLGHYLSAISMHYAATRDPEFLKRVNYIVKELGECQVARKTGYVGAIPKE--DTVWAE 163

Query: 222 ------------LKPVWAPYYTIHKILAGLLDQYVLADNAQALK----MATWMVEYFYN- 264
                       L   W+P+YT+HK++AGLLD ++  ++ QAL     MA W  E   N 
Sbjct: 164 VAKGDIRSRGFDLNGGWSPWYTVHKVMAGLLDAFLYCNSTQALHVCKGMADWTGETLKNL 223

Query: 265 ---RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
              ++QK++              E GGM + L  LY+I  + K+L L++ F     L  L
Sbjct: 224 DDEKLQKMLLC------------EYGGMAETLVNLYAINGNKKYLDLSYKFYDKRILDPL 271

Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
           A Q D L   H+NT IP +I S  RYE+ GD   K I  FF + +  +HSYATGG S  E
Sbjct: 272 ANQQDILPGKHSNTQIPKIIASARRYELNGDKKDKAIAEFFWETIVNNHSYATGGNSNYE 331

Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
           +  +P +L D L     ETC TYNMLK++RHLF         DYYE+AL N +L+ Q   
Sbjct: 332 YLSEPNKLNDKLTENTTETCNTYNMLKLTRHLFALEPSAKLMDYYEKALYNHILASQ-NH 390

Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
           E G+M Y +PL  G  K  S     + F++F CC G+G+E+  K  +SIYF   G    L
Sbjct: 391 ETGMMCYFVPLRMGGKKEYS-----SPFDTFTCCVGSGMENHVKYNESIYF--RGADGSL 443

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
           Y+  +I S  +WK   + + Q+ +   S         T +         ++ +R P W  
Sbjct: 444 YVNLFIPSVLNWKEKGLSITQESNLPQS------DKTTLTVTTLKPVAMAIRVRKPKW-- 495

Query: 562 SNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
           ++     +NG+   +      +L    +W  NDK+   +P ++ TEA+    P+ A+ +A
Sbjct: 496 ADNTTVGVNGKKQQVTADAQGYLVINRKWKNNDKIEFIMPENIHTEAM----PDNANRRA 551

Query: 621 ILFGPYLLAG 630
           + +GP LLAG
Sbjct: 552 VFYGPVLLAG 561


>gi|336319285|ref|YP_004599253.1| hypothetical protein Celgi_0157 [[Cellvibrio] gilvus ATCC 13127]
 gi|336102866|gb|AEI10685.1| protein of unknown function DUF1680 [[Cellvibrio] gilvus ATCC
           13127]
          Length = 1577

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 194/579 (33%), Positives = 284/579 (49%), Gaps = 66/579 (11%)

Query: 93  DLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWS-FRKTASLPTPGKAY 151
           DL  + L++  L D++L   + L  A     EYLL L  +  ++  +R     PT    Y
Sbjct: 362 DLTEHALQDSGLEDLYL-TDAYLTNAAAKEHEYLLSLSSEKFLYEWYRNVGLTPTTTSGY 420

Query: 152 GGWE-NPISELRGHFVGHYLSASAQMWASTHNAT----IKEKMSTVVFSLSECQNKIGT- 205
           GGWE + ++  RGH  GHY+SA +Q +++T +AT    + E++   V  L+  Q+     
Sbjct: 421 GGWERSDVTNFRGHAFGHYMSALSQSYSATADATTKAALLEQVEDAVAGLTLVQDTYAAA 480

Query: 206 -----GYLSAFPTELFDSFEAL----KPVWAPYYTIHKILAGLLD--QYVL-ADNAQALK 253
                GY+SAFP    D+ +        V  P+Y +HK+LAGLLD   YV  A  AQAL 
Sbjct: 481 HPASAGYVSAFPESALDAVDGTGTTTDKVLVPWYNLHKVLAGLLDIHDYVGGATGAQALD 540

Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
           +A+   EY Y R+ ++     + R  Y      GGMND LYRLY +T DP     A  FD
Sbjct: 541 IASQFGEYTYQRISRLTDRTRMLRTEY------GGMNDALYRLYDLTDDPHVKTAAEAFD 594

Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEV-TGD---------------PLYKL 357
           +      LA   D L+  HANT IP +IG+  RY V T D               P Y  
Sbjct: 595 ETALFTQLAAGQDVLNGKHANTTIPKLIGALKRYTVFTSDADRLASLTEAERAQLPTYLA 654

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRL-------ADTLGSENEETCTTYNMLKVS 410
               F  I    H+YATG  S  E + DP  L        +T  ++  ETC  YNMLK+S
Sbjct: 655 AAEEFWQITVDHHTYATGSNSQSEHFHDPDSLHEFATQQGETGNAQTSETCNEYNMLKLS 714

Query: 411 RHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFN 470
           R LF+ TK++ YA YYE    N VL+ Q   + G+  Y  P+  G  +      +   + 
Sbjct: 715 RELFKLTKDVKYAHYYENTFINTVLASQN-PDTGMTTYFQPMAAGYDRI-----YSMPYT 768

Query: 471 SFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSW 530
            FWCC GTG+ESFSKLGDS+YF +  +V   Y+  + SS FD+   ++ L Q+ D  +  
Sbjct: 769 EFWCCTGTGMESFSKLGDSMYFTDRRSV---YVTMFFSSRFDYAEQNLRLTQEAD--LPS 823

Query: 531 DPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWS 590
           D  +   +      +V   ++L LR+P W     A  ++NG+ +  P         E  +
Sbjct: 824 DDTVTFRVAAIDGDQVADGTTLRLRVPQW-IDGAATLTVNGEAV-TPQVVRGFVVLEGVA 881

Query: 591 YNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
             D +T ++P+ ++  A  D+ P +A   A  +GP +L+
Sbjct: 882 AGDVITYRMPMKVQAHAAPDN-PTWA---AFSYGPVVLS 916


>gi|429858822|gb|ELA33628.1| secreted protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 623

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 187/542 (34%), Positives = 273/542 (50%), Gaps = 52/542 (9%)

Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTP-GKAYGGWENPISELRGHFVGHYL 170
           S  L+  Q   L YL  +DV+ L+++FRK   L T   +A GGW+ P    R HF GH+L
Sbjct: 51  SGRLFDNQARTLTYLKWVDVERLLYNFRKNHGLSTNNAQANGGWDAPDFPFRTHFQGHFL 110

Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE--ALK 223
           +A A  +A  H+   K++ +     L +CQ         TGYLS FP     + E  +L 
Sbjct: 111 NAWAFCYAQLHDTECKDRATYFAAELKKCQANNANVGFNTGYLSGFPESEITAVEDRSLS 170

Query: 224 PVWAPYYTIHKILAGLLD--QYVLADNAQA--LKMATWMV----EYFYNRVQKVITMYSV 275
               PYY IHK +AGLLD  +++   NA+   L+MA W+     +  Y ++Q +++    
Sbjct: 171 NGNVPYYAIHKTMAGLLDVWRHIGDTNARDVLLEMAAWVDLRTGKLTYAQMQNMMST--- 227

Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
                    E GGMN+V+  ++  T D + L +A  FD       LA   D L+  HANT
Sbjct: 228 ---------EFGGMNEVMADIFHQTGDQRWLTVAQRFDHAAIFDPLASNQDSLNGLHANT 278

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
            +P  IG+   Y+ TG   Y+ I     +I  ++HSYA GG S  E +  P  +A  L S
Sbjct: 279 QVPKWIGASREYKATGTSRYQDIARNAWNITVSAHSYAIGGNSQAEHFRLPNAIAGFLNS 338

Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEP-GVMIYMLPLG 453
           +  E C TYNMLK++R L+        Y D+YERAL N +L  Q  ++  G + Y  PL 
Sbjct: 339 DTCEACNTYNMLKLTRELWLTNPSATHYFDFYERALLNHLLGQQDPSDSHGHITYFTPLN 398

Query: 454 ----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
               RGV  A     W T ++SFWCC GTG+E+ +KL DSIYF +      LY+  ++ S
Sbjct: 399 PGGRRGVGPAWGGGTWSTDYDSFWCCQGTGLETNTKLMDSIYFYDNS---ALYVNLFVPS 455

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
              W    V + Q  D       + R   T       GQ  +L +R+P WT  +GAQ ++
Sbjct: 456 VLRWTQRGVTVTQTTD-------FPRGDTTTLKVSGSGQW-TLRVRIPSWT--SGAQVTV 505

Query: 570 NGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           NGQ +     G + +    W+  D + + LP+ L+T A  D+     SI A+ FGP +L+
Sbjct: 506 NGQAV-TATSGAYAAIDRTWADGDTVVVTLPMKLQTIAANDN----PSIAALAFGPVILS 560

Query: 630 GH 631
           G+
Sbjct: 561 GN 562


>gi|290955577|ref|YP_003486759.1| hypothetical protein SCAB_10131 [Streptomyces scabiei 87.22]
 gi|260645103|emb|CBG68189.1| putative secreted protein [Streptomyces scabiei 87.22]
          Length = 786

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 196/585 (33%), Positives = 285/585 (48%), Gaps = 59/585 (10%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPISELRGHFV 166
           WLD        Q     YL  +DVD L+++FR T  L T G    GGW+ P    R H  
Sbjct: 81  WLDN-------QNRTQNYLRFIDVDRLLYNFRATHKLSTNGATPNGGWDAPNFGFRTHIQ 133

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
           GH+L+A AQ++A T + T ++K + +V  L++CQ         TGYLS +P   F + E 
Sbjct: 134 GHFLTAWAQLYAVTGDTTCRDKATRMVAELAKCQANNSAAGFNTGYLSGYPESNFTALEQ 193

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRV--QKVITMYSV 275
                  YYTIHK L GLLD + L  + QA    L +A W V++   R+  Q++ TM  +
Sbjct: 194 GTSGEVLYYTIHKTLTGLLDVWRLIGSTQARDVLLALAGW-VDWRTGRLTGQQMQTMLRI 252

Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
           E          GGMN VL  LY  T D + L +A  FD       LA   D L+  HANT
Sbjct: 253 E---------FGGMNTVLTDLYQQTGDARWLTVAQRFDHAAVFDPLAANQDKLNGLHANT 303

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
            +P  IG+   Y+ TG   Y+ I T   +I  A+H+YA GG S  E +  P  +A  L +
Sbjct: 304 QVPKWIGAAREYKATGTTRYRDIATNAWNITVAAHTYAIGGNSQAEHFRAPNAIAGFLNN 363

Query: 396 ENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLG 453
           +  E+C T NML ++R L+    + +   DYYERA  N ++  Q    + G + Y  PL 
Sbjct: 364 DTCESCNTVNMLTLTRELYTLDPDRVELFDYYERAWLNQMIGQQNPADDHGHVTYFTPLK 423

Query: 454 ----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
               RGV  A     W T + SFWCC GTG+E  ++L DSIYF    N   L +  ++ S
Sbjct: 424 PGGRRGVGPALGGGTWSTDYGSFWCCQGTGLEMHTRLMDSIYFH---NDTTLTVNMFVPS 480

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
              W    + + Q      S    L++T + S         ++ +R+P WT   GA  S+
Sbjct: 481 VLTWTERGITVTQTTTYPTSDTTTLQVTGSVSGTW------AMRIRIPGWT--TGAAVSV 532

Query: 570 NG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           NG  QN+    PG++ +    W+  D +T++LP+ +      D+    A++ AI +GP +
Sbjct: 533 NGVAQNIT-TTPGSYATLNRSWTSGDTVTVRLPMRIGIRPANDN----ANVAAITYGPVV 587

Query: 628 LAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNST 672
           L+G      +    T  SL AL +      ++  + FT  S  ST
Sbjct: 588 LSG------NYGDSTLSSLPALTTSSIKRTSSSSLAFTATSSGST 626


>gi|268316049|ref|YP_003289768.1| hypothetical protein Rmar_0478 [Rhodothermus marinus DSM 4252]
 gi|262333583|gb|ACY47380.1| protein of unknown function DUF1680 [Rhodothermus marinus DSM 4252]
          Length = 641

 Score =  267 bits (683), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 180/528 (34%), Positives = 266/528 (50%), Gaps = 42/528 (7%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A Q ++ YL  LD D L+  FR+ A L      YGGWE+    + GH +GHYLSA +  +
Sbjct: 56  AMQRDVAYLFELDPDRLLSRFRRFAGLEPKAPEYGGWES--QGISGHTLGHYLSALSMYY 113

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFE-ALKPV 225
           A+T +   + ++  +V  L+E Q   G GY+ A P            E++ +   +L   
Sbjct: 114 AATGDEKARARIDYIVSELAEVQRAHGNGYVGAIPEGDRLWAEIARGEIWQAEPFSLNGA 173

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS-LNE 284
           W P+YT+HKI  GL+D Y    + QAL++ T + ++ Y   + +         W   L  
Sbjct: 174 WVPWYTMHKIFQGLIDAYWYGGSEQALEVVTRLADWAYETTKNLTPA-----QWQQMLRT 228

Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
           E GGMN+ L  LYSIT +PKH  L+  F     L  L+     L+  HANT IP VIG  
Sbjct: 229 EHGGMNEALANLYSITGNPKHRELSEKFYHAAVLSPLSRGIPNLTGLHANTQIPKVIGVV 288

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
            +YE+ G    + +  FF + V   H+Y  GG S  E +     LA+ LG    ETC TY
Sbjct: 289 RQYELIGSDSLRAVAEFFWEEVVQHHTYVIGGNSQNEHFGPRDSLANRLGEGTAETCNTY 348

Query: 405 NMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH 463
           NML+++RHLF    E + Y D+YERAL N +L+ Q   + G+  Y + L  G  K     
Sbjct: 349 NMLRLTRHLFALHPEKVRYVDFYERALYNHILASQ-DPKRGMFTYYMSLRPGHFKT---- 403

Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
            + T  +SFWCC GTG+E+  K  + IYF    N   LY+  +I S  +W+   + L  +
Sbjct: 404 -YATPEHSFWCCVGTGMENHVKYNEFIYFY---NGDTLYVNLFIPSELNWERRALRLRLE 459

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNF 582
                ++    R+ L F    EV Q   + +R P W   +     +NG+   +   PG++
Sbjct: 460 ----TAFPESNRVRLDFDP--EVPQRLVVKVRHPSWA-QDALDVRINGEVQSVTSRPGSY 512

Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           L+    W   D++ I LP+ LR E + D+   +    AIL+GP +LAG
Sbjct: 513 LTLARVWQPGDEVEITLPMRLRVETMPDNPDRF----AILYGPIVLAG 556


>gi|402080566|gb|EJT75711.1| hypothetical protein GGTG_05643 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 640

 Score =  267 bits (683), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 186/536 (34%), Positives = 263/536 (49%), Gaps = 47/536 (8%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
           Q   L Y+  ++VD L+++FR    + T G ++  GW+ P    R HF GH+L+A AQ +
Sbjct: 67  QDRALTYIKSVNVDRLLYNFRANHRVSTNGAQSNKGWDAPDFPFRTHFQGHFLTAWAQCY 126

Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE--ALKPVWAPYY 230
           A+  +AT ++  +  V  L++CQN         GYLS FP    D  E   L     PYY
Sbjct: 127 ATLGDATCRDHANYFVAELAKCQNNNAAAGFKAGYLSGFPESEIDKVEQRTLSNGNVPYY 186

Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            IHK +AGLLD + +  + QA    L+MA W        V       S ++    L  E 
Sbjct: 187 AIHKTMAGLLDVWRVMGSTQARDVLLRMAGW--------VDTRTAALSYQQMQNMLGTEF 238

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGMN+VL  ++  T D + +  A  FD       LA   D LS  HANT +P  IG+   
Sbjct: 239 GGMNEVLADVFHQTGDARWIKTARRFDHAAVFDPLAQGQDRLSGLHANTQVPKWIGAARE 298

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           Y+ T +  Y+ +     +   A+H+YA GG S  E +  P  +A  L  +  E C +YNM
Sbjct: 299 YKATKEERYRTVARAAWNFTVAAHTYAIGGNSQSEHFRSPNAIAGYLAKDTAEACNSYNM 358

Query: 407 LKVSRHLFRWTKE---IAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLG----RGVSK 458
           LK++R L  W  +    AY D+YERAL N +L  Q   +  G + Y  PL     RGV  
Sbjct: 359 LKLTREL--WLADPSAAAYFDFYERALLNHMLGQQDPRSAHGHVTYFTPLNPGGRRGVGP 416

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KSG 516
           A     + T ++SFWCC GTGIE+ +KL DSIYF    +   LY+  +ISSS  W  K G
Sbjct: 417 AWGGGTYSTDYDSFWCCQGTGIETNTKLMDSIYFRGRDDAT-LYVNLFISSSVKWTQKGG 475

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP- 575
            VV      P          T T       G   +L +R+P W  +  A  ++NGQ +  
Sbjct: 476 VVVTQTTTFPKSD-------TTTLDVSGAGGGRWTLAVRVPSWV-AGQAVITVNGQAVQG 527

Query: 576 -LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
               PG + S T  W   DK+ ++LP+ L T A  DD      + A+ +GP +L+G
Sbjct: 528 VSTAPGTYASITRDWQAGDKVVVRLPMRLYTIAANDD----MGLVAVAYGPAVLSG 579


>gi|21243263|ref|NP_642845.1| hypothetical protein XAC2530 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21108798|gb|AAM37381.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 791

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 189/569 (33%), Positives = 274/569 (48%), Gaps = 53/569 (9%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L  S  L  A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +   + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   +NAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCENAQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q V       +    L+ E GG+N+    L+  T D + L LA     
Sbjct: 222 AVALAGY----LQGVFAALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RHL++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
                G+Y+  Y+ S+    +G   LN  +   +       + +  +   +     +L L
Sbjct: 452 G---QGVYVNLYVPSTVRDAAG---LNMTLHSALPEQGSASLRIDGAPPAQ----RTLAL 501

Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           R+P W      Q  LNGQ +       +L  T  W   D L++   + LR E+  DD P 
Sbjct: 502 RVPGWAQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PA 558

Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
           + S   +L GP +LA   G  +  W  KT
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKT 584


>gi|290954983|ref|YP_003486165.1| hypothetical protein SCAB_3871 [Streptomyces scabiei 87.22]
 gi|260644509|emb|CBG67594.1| putative secreted protein [Streptomyces scabiei 87.22]
          Length = 768

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 202/601 (33%), Positives = 289/601 (48%), Gaps = 68/601 (11%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
           WLD        Q     YL  +DVD L+++FR    L T G A  GGW+ P    R H  
Sbjct: 62  WLDN-------QDRTRNYLRFVDVDRLLYNFRANHRLSTAGAAATGGWDAPTFPFRTHVQ 114

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-----TGYLSAFPTELFDSFE- 220
           GH+L+A AQ++A T + T ++K + +V  L++CQ   G     TGYLS +P   F + E 
Sbjct: 115 GHFLTAWAQLYAVTGDTTCRDKATRMVAELAKCQANNGAAGFNTGYLSGYPESDFTALEQ 174

Query: 221 -ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRV--QKVITMY 273
             L     PYYTIHK LAGLLD +    + QA    L +A W V++   R+  Q++  M 
Sbjct: 175 RTLSNGNVPYYTIHKTLAGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRLTGQQMQAM- 232

Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
                   L  E GGMN VL  LY  T D + L  A  FD       LA   D LS  HA
Sbjct: 233 --------LQTEFGGMNAVLTDLYQQTGDARWLTAARRFDHAAVFDPLASNQDRLSGLHA 284

Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
           NT +P  IG+   Y+ TG   Y+ I T    I  A+H+YA GG S  E +  P  +A  L
Sbjct: 285 NTQVPKWIGAAREYKATGTTRYRDIATNAWSITVAAHTYAIGGNSQAEHFRAPNAIAGFL 344

Query: 394 GSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLP 451
             +  E+C T+NML ++R LF       A  DYYERA  N ++  Q    + G + Y  P
Sbjct: 345 NQDTCESCNTFNMLVLTRELFALDPNRAALFDYYERAWLNQMIGQQNPADDHGHVTYFTP 404

Query: 452 L----GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           L     RGV  A     W T + +FWCC GTG+E  ++L DS+Y+  +     L +  ++
Sbjct: 405 LRPGGRRGVGPAWGGGTWSTDYGTFWCCQGTGLEMHTRLMDSVYYRSDTT---LIVNMFV 461

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S   W    + + Q  D        LR+T        VG   ++ LR+P WT  +GA  
Sbjct: 462 PSVLTWSERGITVTQTTDYPAGDTTTLRVT------GSVGGTWAMRLRIPGWT--SGATI 513

Query: 568 SLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
           S+NG    +   PG++ + T  W+  D +T++LP+ +    +     + A+I AI +GP 
Sbjct: 514 SVNGTAQDIATTPGSYATLTRSWTSGDTVTVRLPMRI----VMRAANDNANIAAITYGPV 569

Query: 627 LLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEE 686
           +L+G                SAL S  PPS     +T T  +G+  F  + +  ++ +  
Sbjct: 570 VLSGDYGD------------SALGS--PPSLKTSSITRT-STGSLAFTATANGSTVGLGP 614

Query: 687 F 687
           F
Sbjct: 615 F 615


>gi|407923357|gb|EKG16430.1| Six-hairpin glycosidase-like protein [Macrophomina phaseolina MS6]
          Length = 612

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 178/531 (33%), Positives = 269/531 (50%), Gaps = 39/531 (7%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFVGHYLSASAQMW 177
           Q+    YL  +D+D L++++R T  L T G A  GGW+ P    R H  GH+L+A  Q W
Sbjct: 43  QERTRTYLKFVDLDRLLYNYRATHGLSTNGAASNGGWDAPDFPFRSHAQGHFLTAWVQCW 102

Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA--LKPVWAPYY 230
           ++T +   +++       L +CQ          GYLS FP   FD+ E   L     PYY
Sbjct: 103 STTGDTECRDRAVQFTAELLKCQENNEAAGFTAGYLSGFPESEFDALEGRTLSNGNVPYY 162

Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
            +HK++AGLLD +    +  A  +   +  +   R +  I+   ++R    L  E GGM+
Sbjct: 163 VVHKLMAGLLDVWRGIGDLTARDVLLALAGWVDARTEN-ISYGDMQR---ILQTEFGGMS 218

Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           +VL  +Y  + D + L +A  F+    L  LA   D L+  HANT +P  IG+   Y+ T
Sbjct: 219 EVLADIYYQSGDSRWLTVAQRFEHAAVLTPLANNRDQLNGLHANTQVPKWIGAAREYKAT 278

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
           G+  Y  I     DI   +H+YA GG S  E +  P  +A  L ++  E+C +YNMLK++
Sbjct: 279 GNTTYYDIARNAWDITVRAHTYAIGGNSQAEHFRPPNAIAGYLTADTAESCNSYNMLKLT 338

Query: 411 RHLFRWTKE---IAYADYYERALTNGVLSIQRGTEP-GVMIY---MLPLG-RGVSKARST 462
           R L  WT E    AY DYYER L N ++  Q   +P G + Y   + P G RGV  A   
Sbjct: 339 REL--WTTEPSSSAYFDYYERTLMNHLVGQQDPEDPHGHVTYFNSLQPGGVRGVGPAWGG 396

Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
             W T ++SFWCC GTG+E+ +KL DSIYF  +G+   LY+  +  S  DW+   V + Q
Sbjct: 397 GTWSTDYDSFWCCQGTGVETNTKLMDSIYF-RDGDSSALYVNLFAPSVLDWRQRAVTVTQ 455

Query: 523 KVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PG 580
               P+         TL  +       ++   +R+P WT  +GA+  +NG++  +   PG
Sbjct: 456 TTSFPVTD-----NTTLQVAGAAGAWDMA---IRIPDWT--SGAEILVNGESANVAAEPG 505

Query: 581 NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
            + + +  W+  D +T+ LP+  R     DD     SI A+ +GP +L G+
Sbjct: 506 TYATISRDWASGDTVTVTLPMGFRLVPANDD----TSIAALAYGPVILCGN 552


>gi|289668636|ref|ZP_06489711.1| putative secreted protein [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 793

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 200/630 (31%), Positives = 289/630 (45%), Gaps = 92/630 (14%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG+ ++ V L  V L   S+   A  TN  YL+ L  D L+ +F   A L     AYGGW
Sbjct: 46  PGS-VRAVPLAQVRL-MPSLFLDALNTNRRYLMRLQPDRLLHNFVLYAGLDPQAPAYGGW 103

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A M A T +A  + +   +V  L+ CQ   G GY++ F  +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRK 161

Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
                      +FD  +          L   WAP YT HK+ AGLLD +   DN QAL++
Sbjct: 162 NAAGKIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNVQALQV 221

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A  +  Y    +Q + +     +    L+ E GG+N+    L+  T D + L LA     
Sbjct: 222 AVSLAGY----LQGIFSALDDAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHH 277

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
              L  L  Q D L H H+NT+IP +IG    YEVTGD        FF   V   H+Y  
Sbjct: 278 HAVLDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG   RE++  P  ++  L  +  E C +YNMLK++RH+++W  +    DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHVYQWGPQAELFDYYERTLLNHV 397

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           ++ Q+    G+  YM PL  G ++     GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS---- 550
                G+YI  Y+ S+    +G                 L MTL  S+  E G  S    
Sbjct: 452 G---QGVYINLYVPSTVRDAAG-----------------LDMTL-HSALPEQGSASLRID 490

Query: 551 -------SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
                  +L LR+P W      Q  LNGQ +       +L  T  W   D L++   + L
Sbjct: 491 AAPPAQRTLALRVPGWVQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPL 548

Query: 604 RTEAIQDDRPEYASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSF 657
           R E   DD P + S   +L GP +LA   G  +  W  K+      + +   + P+P   
Sbjct: 549 RLETTPDD-PAWVS---VLRGPLVLAVDLGDAAKPWSGKSPALIGGQDILQRLQPVP--- 601

Query: 658 NAQLVTFTQESGNSTFVMSNSNQSITMEEF 687
                      G + F  S+  Q   +  F
Sbjct: 602 -----------GKNAFTYSDGAQQWQLSPF 620


>gi|423223548|ref|ZP_17210017.1| hypothetical protein HMPREF1062_02203 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638305|gb|EIY32149.1| hypothetical protein HMPREF1062_02203 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 777

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 184/531 (34%), Positives = 259/531 (48%), Gaps = 39/531 (7%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A++    YLL L+ D  +  FR  A L      Y GWE+    + G  +GHYLSA A  +
Sbjct: 51  AEEKETAYLLELEPDRFLSGFRSEAGLVPKAPKYEGWES--LGVAGQTLGHYLSACAMYY 108

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVW 226
           A++ +    +++   +  L  CQ   G GYL+A P    +F    A         L   W
Sbjct: 109 ATSGDERFLQRLEYTINELDSCQQANGDGYLAATPDGKRIFKEVSAGKIYSQGFDLNGGW 168

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            P Y +HK+LAGL+D Y  A N +AL +A  +  + Y   Q +    + E+    L  E 
Sbjct: 169 VPLYVMHKVLAGLIDTYQYAHNERALAVAEKLANWMYGTFQHL----TEEQMQKVLACEF 224

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
           GGMN+ L  LY+ T + K L LA  FD     +  LA+  D L   HANT +P +IG+  
Sbjct: 225 GGMNEALANLYACTKNEKFLALAQRFDNHKAIMDSLAVGVDDLEGKHANTQVPKIIGAAR 284

Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
            YE+TG      I +FF   V  +HSY  GG S  E +  P +L + L + N ETC TYN
Sbjct: 285 LYELTGSKRDSAIASFFWHTVVQNHSYVNGGNSDGEHFGTPGQLNERLSTSNTETCNTYN 344

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
           MLK++RHLF W     Y+ YYERA+ N +L+ Q   + G+  Y  PL  G  K     G+
Sbjct: 345 MLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGKK-----GY 398

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
            + F SF CC G+G+E+  K GD IY   EG+   L++  +I S  +W    +++ Q  D
Sbjct: 399 LSPFQSFCCCSGSGMENHVKYGDFIY--SEGSDSSLWVNLFIPSQLNWTDRKMIVTQDTD 456

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
            I S D   +  LT   K E  Q     LR P W  S   +  +NG ++      N   +
Sbjct: 457 -IPSSD---KTVLTV--KTEKPQSVIFRLRYPEWAES--MRIRVNGSSVSFEASNNSYVS 508

Query: 586 TER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
            ER W  NDK+ I   +   T ++ D+         I +GP LLAG    E
Sbjct: 509 IEREWKDNDKIEITFKIKFYTVSMPDNEKRV----GIFYGPVLLAGELGTE 555


>gi|354583886|ref|ZP_09002783.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
 gi|353197148|gb|EHB62641.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
          Length = 778

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 185/547 (33%), Positives = 271/547 (49%), Gaps = 47/547 (8%)

Query: 102 VSLHDVWLDQ--------SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGG 153
           V L D+W D           +   +Q+T   YLL LDVD L+    + ASL      YGG
Sbjct: 3   VKLVDLWGDDIMPKTELLEGIFKESQETGKGYLLHLDVDRLMAPCYEAASLEPKKPRYGG 62

Query: 154 WEN-PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP 212
           WE  PI+   GH +GH+LSA+A M  +T +  + +K+   V  L+  Q+    GY+S FP
Sbjct: 63  WEETPIA---GHSIGHWLSAAAAMIDATSDEELLKKLVYAVNELAYVQSHDKDGYVSGFP 119

Query: 213 TELFD-----SFE----ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFY 263
            + FD      FE    +L   W P+Y++HKI AGL+D Y L    QAL++   + ++  
Sbjct: 120 RDCFDIVFTGDFEVHNFSLAGSWVPWYSLHKIFAGLIDAYRLTGIEQALEVVIRLADW-- 177

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
              +K     + E+    L  E GGMND +  LY +T++  +L LA  F     L  LA 
Sbjct: 178 --AKKGTDRLTDEQFQRMLICEHGGMNDTMADLYRLTNNHAYLELAIRFCHRAILEPLAR 235

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
             D L   HANT IP VIG+   YE+TGD  Y+    FF   V  + SY  GG S  E +
Sbjct: 236 GVDELEGKHANTQIPKVIGAAKLYEITGDDFYRKAAEFFWKEVTRNRSYIIGGNSIFEHF 295

Query: 384 WDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
               +  + LG E  ETC TYNMLK++ HLF W+++  Y D+YERAL N +L+ Q   + 
Sbjct: 296 RAANQ--EKLGVETAETCNTYNMLKLTDHLFGWSQDAEYMDFYERALYNHILASQ-DPDT 352

Query: 444 GVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
           G+ +Y +    G  K      +GT  +SFWCC GTG+E+ ++    IY         +Y+
Sbjct: 353 GMKMYFVSTEPGHFKV-----YGTAEHSFWCCTGTGMENPARYTHEIY---HATSNAIYV 404

Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
             +I+S   +    VV+ Q+ +      P    T     + +      L +R+P WT + 
Sbjct: 405 NLFIASKATFDDHQVVIRQETEF-----PKQSRTRLIIEEAKAAHF-KLRIRIPQWT-AG 457

Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
              A +NG  +       +L+    W+  D + + LP+ LR    +DD    A    IL+
Sbjct: 458 AVTAVVNGSEIYADAEPGYLNIERDWNAGDTIEVTLPMELRLYHAKDD----AKKVGILY 513

Query: 624 GPYLLAG 630
           GP +LAG
Sbjct: 514 GPIVLAG 520


>gi|383644433|ref|ZP_09956839.1| hypothetical protein SeloA3_13744 [Sphingomonas elodea ATCC 31461]
          Length = 746

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 188/571 (32%), Positives = 274/571 (47%), Gaps = 67/571 (11%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A + N   LL L+ D L+ +FRK A L   GK YGGWE+    + GH +GHYL+A   MW
Sbjct: 14  AVEVNHRALLQLEPDRLLHNFRKYAGLEPKGKLYGGWES--DTIAGHTLGHYLTALVLMW 71

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA----------------FPTELFDSFEA 221
             T +  ++ +   +V  L+E Q K GTGY+ A                FP  +    ++
Sbjct: 72  QQTGDPEMRRRADYIVAELAEAQAKRGTGYVGALGRKRKDGTIVDGEEIFPEIMRGEIKS 131

Query: 222 ----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
               L   W+P YT+HK+ AGLLD +    NAQAL++   +  YF    +KV    +  +
Sbjct: 132 GGFDLNGSWSPLYTVHKVFAGLLDVHAGWGNAQALQVTLGLAGYF----EKVFAALNDAQ 187

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
               L  E GG+N+    LY+ T D + +++A        LG L    D L++FHANT +
Sbjct: 188 MQQMLGCEYGGLNESYAELYARTRDARWMVVAKRLYDDRVLGPLKAGEDKLANFHANTQV 247

Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
           P +IG    +E+TGD        FF + V   HSY  GG + RE++  P  +A  +  + 
Sbjct: 248 PKLIGLARIHELTGDAGDATAARFFWERVTGHHSYVIGGNADREYFSAPDSIAQHITDQT 307

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
            E C TYNMLK++ HLF W       DYYERA  N V++ Q   + G   YM PL  G  
Sbjct: 308 CEHCNTYNMLKLTSHLFAWQPNGVLFDYYERAHLNHVMAAQ-NPKTGGFTYMTPLMSGAE 366

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
           +  S        ++FWCC G+G+ES +K G++ +++ EG    L +  YI +  DWK+  
Sbjct: 367 RQYSQ----PNEDAFWCCIGSGLESHAKHGEAAFWQGEG---ALLVNLYIPAEIDWKA-- 417

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVWTYSNGAQASLNGQNLPL 576
               QK   ++        T T   +Q       ++ LR+P W     A  ++NG+    
Sbjct: 418 ----QKAKLVLDTAYPFEGTATLKVEQLARAARFAIALRVPGWAEGK-AVVTVNGK---- 468

Query: 577 PPPGN------FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
             PG+      +      W  +D + I LP++LR EA   D     S  A+L GP +LAG
Sbjct: 469 --PGDAVFDRGYAIVARSWKRDDTIAISLPMALRLEAAPGDD----STVAVLRGPMVLAG 522

Query: 631 H---TSGEWDIK----TGTARSLSALISPIP 654
               TS  W+       GT   L A  +P P
Sbjct: 523 DLGPTSTPWNAGDPALVGT--DLLAAFTPAP 551


>gi|383779543|ref|YP_005464109.1| hypothetical protein AMIS_43730 [Actinoplanes missouriensis 431]
 gi|381372775|dbj|BAL89593.1| hypothetical protein AMIS_43730 [Actinoplanes missouriensis 431]
          Length = 799

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 180/537 (33%), Positives = 261/537 (48%), Gaps = 41/537 (7%)

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
           + YL  +D+D ++  FR TA LP+  +  GGWE P  +LRGH  GH LS  AQ      +
Sbjct: 61  VAYLRFVDLDRMLHMFRVTAGLPSAAEPLGGWEAPTVQLRGHTTGHLLSGLAQAAYHLDD 120

Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
             +K + + +V  L  CQ     GYLSAFP  +FD  EA K  WAPYYTIHKI AGLLDQ
Sbjct: 121 RDLKARSAALVDGLKACQAP--NGYLSAFPETIFDQLEAGKNPWAPYYTIHKIFAGLLDQ 178

Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
           + L  N  AL +A  M ++  +RV K+    + E+    L+ E GGMN+    LY +T +
Sbjct: 179 HRLLGNTTALDVARRMADWVGSRVSKL----TREQMQKVLHVEFGGMNESFVNLYRVTGE 234

Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
             HL LA  FD       L+ + D L+  HANT IP V+G+   Y+ TG   ++ I T+F
Sbjct: 235 AAHLELARAFDHDEIFVPLSEKRDTLAGRHANTDIPKVVGAAAMYQATGSDYHRTIATYF 294

Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRW-TKEIA 421
            D V   HSY  GG S  EF+  P ++   LG    E C TYNMLK++  L+        
Sbjct: 295 WDQVVRHHSYVIGGNSNAEFFGPPGQVVSQLGENTCENCNTYNMLKLTERLYAIDPSRTD 354

Query: 422 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG----------WGTKFNS 471
           Y DY+E AL N +L  Q   +P      +    G+S   S  G          + + + +
Sbjct: 355 YLDYHEWALINQMLGEQ---DPDSAHGNVTYYTGLSSTASRKGKEGLVSDPGSYSSDYGN 411

Query: 472 FWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWD 531
           F C +G+G+E+ +K  + IY         L +  +I S   ++   + +N          
Sbjct: 412 FSCDHGSGLETHTKFAEPIYDTSRDT---LSVKLFIPSETTFRGAKIQINTMF------- 461

Query: 532 PYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSY 591
           PY R T+        G   +L +R+P W      +  +NG+ +P   PG F +    W  
Sbjct: 462 PY-RETVRLRV-DGTGAPFTLRVRIPSWVRDPALR--VNGKPVPA-HPGRFATIRRVWRR 516

Query: 592 NDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH--TSGEWDIKTGTARSL 646
            D +T+ LP   R        P+  ++ A+ +GP +LAG     G   + T   R+L
Sbjct: 517 GDVVTLHLPFRTRWLPA----PDNPAVHALTYGPLVLAGRYGAQGPATLPTADPRTL 569


>gi|374992736|ref|YP_004968231.1| hypothetical protein SBI_09982 [Streptomyces bingchenggensis BCW-1]
 gi|297163388|gb|ADI13100.1| hypothetical protein SBI_09982 [Streptomyces bingchenggensis BCW-1]
          Length = 733

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 189/544 (34%), Positives = 274/544 (50%), Gaps = 51/544 (9%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
           WLD        +     YL  +D D L+++FR    LPT G A  GGW+ P    R H  
Sbjct: 18  WLDN-------ENRTRNYLRFVDADRLLYNFRANHRLPTNGAASNGGWDGPTFPFRTHVQ 70

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEA 221
           GH+L+A AQ++A T + T ++K + +V  L++CQ   G      GYLS FP   F + EA
Sbjct: 71  GHFLTAWAQVYAVTGDTTCRDKAAYMVAELAKCQANNGAAGFNGGYLSGFPESDFSALEA 130

Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSV 275
             L     PYY IHKILAGLLD +    + QA    L +A W V++   R+       S 
Sbjct: 131 GTLSNGNVPYYVIHKILAGLLDVWRHMGSTQARDMLLSLAGW-VDWRTGRL-------SG 182

Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
           ++   +L  E GGMN VL  LY  T D + L  A  FD       LA   D L+  HANT
Sbjct: 183 QQMQSTLGTEFGGMNAVLSDLYLQTSDSRWLTTAQRFDHGAVFDPLASNQDRLNGLHANT 242

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
            +P  IG+   Y+ TG   Y+ I T   +I   +H+Y  GG S  E +  P  +A  L  
Sbjct: 243 QVPKWIGAAREYKATGTTRYRDIATNAWNICVNAHTYVIGGNSQAEHFRPPNAIAAYLNQ 302

Query: 396 ENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG 453
           +  E+C TYNML ++R LF    + +A  DYYERA  N ++  Q   +  G + Y  PL 
Sbjct: 303 DACESCNTYNMLTLTRELFTLDPDRVALFDYYERAWLNQMIGQQNPADNHGHVTYFTPLN 362

Query: 454 ----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
               RGV  A     W T ++SFWCC GTG+E  +KL DS+YF  +     L +  ++ S
Sbjct: 363 PGGRRGVGPAWGGGTWSTDYDSFWCCQGTGLEMHTKLMDSVYFSSD---TTLIVNLFVPS 419

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             +W    + + Q     VS    L++T   S         ++ +R+P WT   GA  S+
Sbjct: 420 VLNWSQRGITVTQTTSYPVSDTTTLQVTGNLSGTW------AMRIRIPSWTA--GATISV 471

Query: 570 NG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           NG  QN+    PG++ + T  W+  D +T++LP+ +    I     + A++ A+ +GP +
Sbjct: 472 NGTTQNIT-TTPGSYATLTRSWTSGDTVTVRLPMRI----IMRAANDNANVAAVTYGPVV 526

Query: 628 LAGH 631
           L+G+
Sbjct: 527 LSGN 530


>gi|339021543|ref|ZP_08645591.1| hypothetical protein ATPR_1899 [Acetobacter tropicalis NBRC 101654]
 gi|338751393|dbj|GAA08895.1| hypothetical protein ATPR_1899 [Acetobacter tropicalis NBRC 101654]
          Length = 799

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 192/572 (33%), Positives = 278/572 (48%), Gaps = 67/572 (11%)

Query: 88  NPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTP 147
            PGG    G  +  V L DV L  S  L  A ++N  YLL L  D L+ +FR+ A LP  
Sbjct: 34  GPGGVG-AGESVTPVPLQDVRLLPSHWL-DAVESNRAYLLSLSADRLLHNFRRQAGLPPK 91

Query: 148 GKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGY 207
           G+ YGGWEN    + GH +GHYLSA A M+A T +   + +++ +V  L+  Q+K G GY
Sbjct: 92  GEVYGGWEN--DTIAGHTLGHYLSALALMYAQTGDTECRRRVAYIVQELAIVQDKWGDGY 149

Query: 208 LSAFPTE-----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLAD 247
           ++ F  +           +F   E          L   W+P Y IHK  AGL D      
Sbjct: 150 VAGFTRKEKDGTITDGKVIFAEMEKGDIRSGGFDLNGAWSPLYNIHKTFAGLFDAQTYCQ 209

Query: 248 NAQALKMATWM---VEYFYNRV-----QKVITMYSVERHWYSLNEETGGMNDVLYRLYSI 299
           +  AL +A  +    E FY+++     QKV+T             E GG+N+    L + 
Sbjct: 210 DPNALAVAVKLGGFFEAFYSKLTDAQLQKVLTC------------EYGGLNESFAELAAR 257

Query: 300 THDPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLI 358
           T D K L LA   +D+P     +A   D L++ HANT IP +IG     EV+ D  +++ 
Sbjct: 258 TGDAKWLRLAKRTYDRPVLDPLMARHDD-LANRHANTQIPKLIGLGRIAEVSRDAHWQVG 316

Query: 359 GTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTK 418
             FF   V   HSY  GG + RE++ +P  ++  +  +  E C TYNMLK++R L+ W  
Sbjct: 317 PRFFWQAVTQHHSYVIGGNADREYFSEPDTISQHITEQTCEHCNTYNMLKLTRQLYTWQP 376

Query: 419 EIAYADYYERALTNGVLSIQRGTEPGVMIYMLP-LGRGVSKARSTHGWGTKFNSFWCCYG 477
           + A  DYYERA  N VL+     + G+  YM P +  GV +      W T  +SFWCC G
Sbjct: 377 DSALFDYYERAHLNHVLAAH-DPQTGMFTYMTPTITAGVRE------WSTPTDSFWCCVG 429

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           TG+ES +K G+SI++E       L++  YI S   W   +V    K        PY    
Sbjct: 430 TGMESHAKHGESIWWE---GAETLFVNLYIPSRVQWARKNVSWRMKTR-----YPYDGQV 481

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTI 597
                  +  +  +L LR+P W   +    ++NGQ++   P G +L     W   D + +
Sbjct: 482 TLKVEDVKAPEPFALALRVPGWVKGD-LSLTVNGQSVSATPSGGYLMLNRTWHAGDTVAL 540

Query: 598 QLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
            LPL+LRTEA      E   + ++L GP +LA
Sbjct: 541 TLPLALRTEAPV----EAPHLVSLLHGPMVLA 568


>gi|285018715|ref|YP_003376426.1| hypothetical protein XALc_1948 [Xanthomonas albilineans GPE PC73]
 gi|283473933|emb|CBA16434.1| conserved hypothetical protein [Xanthomonas albilineans GPE PC73]
          Length = 810

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 204/647 (31%), Positives = 308/647 (47%), Gaps = 61/647 (9%)

Query: 57  DDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLW 116
           D +A ++L PS+    Q   ++ A    +         PG  ++ + L  V L + S+  
Sbjct: 21  DHAAGAALDPSRRRFLQWSALAMAAGLLRFPQDAAASTPGR-VQALPLRQVTL-KPSLFL 78

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
            + QTN  YLL L+ D L+ +F + A LP  G  YGGWE     + GH +GHYLSA ++M
Sbjct: 79  DSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGAVYGGWEG--DTIAGHTLGHYLSALSKM 136

Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDS--FEALKPV--------- 225
            A T +++++ ++  +V  L+  Q +   GY+  F T   D+   E  K V         
Sbjct: 137 HAQTRDSSLRTRIDYIVAELARAQAQDPDGYVGGF-TRKNDNGKIEGGKAVLEDLRRGII 195

Query: 226 ----------WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSV 275
                     W+P YT HK+ AGLLD + L  NAQAL +   +  YF      V      
Sbjct: 196 KGGKFNLNGSWSPLYTQHKLFAGLLDAHALGGNAQALTVLVKVAGYFAG----VFDALDH 251

Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
            +    L+ E GG+N+    L + T   + + +         +  LA   D L H HANT
Sbjct: 252 AQMQTLLDTEFGGLNESFIELGARTGQERWIAIGKRLRHEKIIDPLAAGHDVLPHIHANT 311

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
            +P  IG   ++EV GD        FF + V A +SY  GG S RE++ +P  +A  L  
Sbjct: 312 QVPKFIGEARQFEVAGDADAAAAARFFWETVTAHYSYVIGGNSDREYFQEPDSIAGFLTE 371

Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
           +  E C +YNMLK++RHL++WT +  Y DYYER L N  ++ Q     G+  YM P+  G
Sbjct: 372 QTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAAQHPAT-GMFTYMTPMISG 430

Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
             +     G+  KF+SFWCC G+G+E+ ++ GD+IY+++E     LY+  YI S  DW  
Sbjct: 431 GER-----GFSEKFDSFWCCVGSGMEAHAQFGDAIYWQDEA---ALYVNLYIPSRLDWSE 482

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
             + L  ++D  V  +  +R+ +  +  +   +L    LR+P W   +     LNG+ L 
Sbjct: 483 RDLAL--ELDSGVPENGKVRLQVLRAGARAPRRLL---LRVPAWCQGS-YTLRLNGKPLR 536

Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA---GHT 632
             P   +L+    W   D + ++L   LR E    D PE      ++ GP  LA   G  
Sbjct: 537 RTPIDGYLALERDWRSGDVIELELATPLRLEHAAGD-PESV---VVMRGPLALAADLGPV 592

Query: 633 SGEWDIK----TGTARSLSALIS-PIPPSFNAQLVTFTQESGNSTFV 674
           S  +D        TA  L+  +  P P  F   L + TQ  G  TFV
Sbjct: 593 STPYDAPDPALVATADPLAGFVELPQPGHF---LASDTQPPG-LTFV 635


>gi|224536588|ref|ZP_03677127.1| hypothetical protein BACCELL_01463 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521844|gb|EEF90949.1| hypothetical protein BACCELL_01463 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 777

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 183/526 (34%), Positives = 258/526 (49%), Gaps = 39/526 (7%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A++    YLL L+ D  +  FR  A L      Y GWE+    + G  +GHYLSA A  +
Sbjct: 51  AEEKETAYLLELEPDRFLSGFRSEAGLVPKAPKYEGWES--LGVAGQTLGHYLSACAMYY 108

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVW 226
           A++ +    +++   +  L  CQ   G GYL+A P    +F    A         L   W
Sbjct: 109 ATSGDERFLQRLEYTINELDSCQQANGDGYLAATPDGKRIFKEVSAGKIYSQGFDLNGGW 168

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            P Y +HK+LAGL+D Y  A N +AL +A  +  + Y   Q +    + E+    L  E 
Sbjct: 169 VPLYVMHKVLAGLIDTYQYAHNERALVVAEKLANWMYGTFQHL----TEEQMQKVLACEF 224

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
           GGMN+ L  LY+ T + K L LA  FD     +  LA+  D L   HANT +P +IG+  
Sbjct: 225 GGMNEALANLYACTKNEKFLALAQRFDNHKAIMDSLAVGVDDLEGKHANTQVPKIIGAAR 284

Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
            YE+TG      I +FF   V  +HSY  GG S  E +  P +L + L + N ETC TYN
Sbjct: 285 LYELTGSKRDSAIASFFWHTVVQNHSYVNGGNSDGEHFGTPGQLNERLSTSNTETCNTYN 344

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
           MLK++RHLF W     Y+ YYERA+ N +L+ Q   + G+  Y  PL  G  K     G+
Sbjct: 345 MLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGKK-----GY 398

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
            + F SF CC G+G+E+  K GD IY   EG+   L++  +I S  +W    +++ Q  D
Sbjct: 399 LSPFQSFCCCSGSGMENHVKYGDFIY--SEGSDSSLWVNLFIPSQLNWTDRKMIVTQDTD 456

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
            I S D   +  LT   K E  Q     LR P W  S   +  +NG ++      N   +
Sbjct: 457 -IPSSD---KTVLTV--KTEKSQSVIFRLRYPEWAES--MRIKVNGSSVSFEASNNSYVS 508

Query: 586 TER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
            ER W  NDK+ I   +   T ++ D+         I +GP LLAG
Sbjct: 509 IEREWKDNDKIEITFKIKFYTVSMPDNEKRV----GIFYGPVLLAG 550


>gi|86196151|gb|EAQ70789.1| hypothetical protein MGCH7_ch7g196 [Magnaporthe oryzae 70-15]
 gi|440463815|gb|ELQ33359.1| hypothetical protein OOU_Y34scaffold00969g44 [Magnaporthe oryzae
           Y34]
 gi|440485206|gb|ELQ65183.1| hypothetical protein OOW_P131scaffold00516g8 [Magnaporthe oryzae
           P131]
          Length = 633

 Score =  266 bits (679), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 183/535 (34%), Positives = 262/535 (48%), Gaps = 40/535 (7%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
           Q   L Y+  +D++ L+++FR    + T G +A GGW+ P    R H  GH+L+A A  +
Sbjct: 53  QDRTLTYIKFVDLNRLLYNFRANHGVSTNGAQANGGWDAPDFPFRSHIQGHFLTAWANCY 112

Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE--ALKPVWAPYY 230
           A   +   + +    V  L++CQ+         GYLS FP     + E   L     PYY
Sbjct: 113 AVLKDQECRSRAEQFVEELAKCQDNNAAAGFQAGYLSGFPESDITAVEQRTLTNGNVPYY 172

Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
            IHK +AGLLD +    + +A  +   M  +   R  ++    S  +    +  E GGM+
Sbjct: 173 AIHKTMAGLLDVWRNVGSTKAKDVLVKMAGWVDTRTARL----SYAQMQSMMGTEFGGMS 228

Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           +VL  ++  T D + L +A  FD    L  LA   D L   HANT +P  IG+   Y+ T
Sbjct: 229 EVLADMFHQTGDERWLTVARRFDHAAVLDPLARSQDSLDGLHANTQVPKWIGAAREYKAT 288

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
            D  Y  I     D    +H+YA GG S  E +  P  +A  L  +  E C TYNMLK++
Sbjct: 289 KDQRYLDIARNAWDFTVEAHTYAIGGNSQSEHFRPPNAIAGYLLHDTAEACNTYNMLKLT 348

Query: 411 RHLFR-----WTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLG----RGVSKAR 460
           R LF         + A  D+YERAL N +L  Q  G   G + Y  PL     RGV  A 
Sbjct: 349 RELFMHDAAPGMNDTAKFDFYERALLNHLLGQQDPGDGHGHVTYFTPLNPGGRRGVGPAW 408

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KSGHV 518
               W T + SFWCC GTGIE+ +KL DSIYF    N   LY+  +I SS  W  + G V
Sbjct: 409 GGGTWSTDYESFWCCQGTGIETNTKLMDSIYFRSRDN-NALYVNLFIPSSVQWSDRDGVV 467

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP--- 575
           V  +   P+         TLT S     G   +L++R+P W  + GA+ S+NGQ +    
Sbjct: 468 VTQETEFPLGD-----ATTLTVSGAG--GGRWTLSVRIPSWV-AGGAEVSVNGQKVGGDV 519

Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
              PG + + T  W+  DK+T++LP+ L T A  DD     ++ A+ +GP +L+G
Sbjct: 520 RTTPGGYAAITREWAVGDKVTVRLPMKLHTVAANDD----PTLVALAYGPAILSG 570


>gi|169596765|ref|XP_001791806.1| hypothetical protein SNOG_01152 [Phaeosphaeria nodorum SN15]
 gi|111069681|gb|EAT90801.1| hypothetical protein SNOG_01152 [Phaeosphaeria nodorum SN15]
          Length = 620

 Score =  265 bits (678), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 192/557 (34%), Positives = 278/557 (49%), Gaps = 60/557 (10%)

Query: 99  LKEVSLHDV-WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWEN 156
           L +VSL +  W D        +   L YL  ++VD L+++FR T  L T G +  GGW+ 
Sbjct: 39  LSQVSLSNSRWKDN-------ENRTLNYLKAVNVDRLLYNFRATHKLSTNGAQPNGGWDA 91

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-----TGYLSAF 211
           P    R H  GHYL+A    +A+  +   K + S  V  L++CQ   G     TGYLS F
Sbjct: 92  PNFPFRSHAQGHYLTAWVHCYATLRDNECKNRASYFVQELAKCQANNGAAQFSTGYLSGF 151

Query: 212 PTELFDSFEA--LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
           P   F + EA  LK    PYY +HK +AGLLD + +  + +A  +   +  +   R +K+
Sbjct: 152 PESEFVALEAGQLKGGNVPYYAVHKTMAGLLDAWRIIGDTKARDVLLALAGWVDGRTKKL 211

Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
               S  +    L  E GGMNDVL  +Y +T + + L +A  FD       LA   D LS
Sbjct: 212 ----SSSQMQTMLGTEFGGMNDVLAAIYQLTGNQQWLTVAQRFDHASQFDPLANNQDRLS 267

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
             HANT +P  IG+   Y+ TG   Y  I     D    +H+YA GG S  E +  P ++
Sbjct: 268 GNHANTQVPKWIGAAREYKSTGTKRYLDIAKNAWDFTINAHTYAIGGNSQAEHFRPPNQI 327

Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKE---IAYADYYERALTNGVLSIQRGTEP-GV 445
           ++ L ++  E C TYNMLK++R L  WT +     Y DYYERAL N +L  Q  T+  G 
Sbjct: 328 SNFLTNDTAEQCNTYNMLKLTRDL--WTTDPSSTKYFDYYERALINHLLGAQNPTDNHGH 385

Query: 446 MIYMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
           + Y  PL     RG+  A     W T +NSFWCC GT +E+ +KL DSIYF +      L
Sbjct: 386 ITYFTPLKSGGRRGIGPAWGGGTWSTDYNSFWCCQGTALETNTKLMDSIYFYDSS---AL 442

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS------SLNLR 555
           Y+  +  S+ DWK   V ++Q                TF +              ++ +R
Sbjct: 443 YVNLFTPSTLDWKQRSVKISQ--------------VTTFPASDTTTLTVTGTGNWAMKIR 488

Query: 556 MPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           +P WT  +GA  S+N Q   +   PG++ + +  W   D +T++LP+ LRT A      +
Sbjct: 489 IPSWT--SGATISINRQASGVAANPGSYATLSRDWKSGDIVTVKLPMKLRTVAAN----D 542

Query: 615 YASIQAILFGPYLLAGH 631
            A+I A+ FGP +L+G+
Sbjct: 543 NANIAAVAFGPVILSGN 559


>gi|389647349|ref|XP_003721306.1| hypothetical protein MGG_09030 [Magnaporthe oryzae 70-15]
 gi|351638698|gb|EHA46563.1| hypothetical protein MGG_09030 [Magnaporthe oryzae 70-15]
          Length = 680

 Score =  265 bits (678), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 183/535 (34%), Positives = 262/535 (48%), Gaps = 40/535 (7%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
           Q   L Y+  +D++ L+++FR    + T G +A GGW+ P    R H  GH+L+A A  +
Sbjct: 100 QDRTLTYIKFVDLNRLLYNFRANHGVSTNGAQANGGWDAPDFPFRSHIQGHFLTAWANCY 159

Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE--ALKPVWAPYY 230
           A   +   + +    V  L++CQ+         GYLS FP     + E   L     PYY
Sbjct: 160 AVLKDQECRSRAEQFVEELAKCQDNNAAAGFQAGYLSGFPESDITAVEQRTLTNGNVPYY 219

Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
            IHK +AGLLD +    + +A  +   M  +   R  ++    S  +    +  E GGM+
Sbjct: 220 AIHKTMAGLLDVWRNVGSTKAKDVLVKMAGWVDTRTARL----SYAQMQSMMGTEFGGMS 275

Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           +VL  ++  T D + L +A  FD    L  LA   D L   HANT +P  IG+   Y+ T
Sbjct: 276 EVLADMFHQTGDERWLTVARRFDHAAVLDPLARSQDSLDGLHANTQVPKWIGAAREYKAT 335

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
            D  Y  I     D    +H+YA GG S  E +  P  +A  L  +  E C TYNMLK++
Sbjct: 336 KDQRYLDIARNAWDFTVEAHTYAIGGNSQSEHFRPPNAIAGYLLHDTAEACNTYNMLKLT 395

Query: 411 RHLFR-----WTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLG----RGVSKAR 460
           R LF         + A  D+YERAL N +L  Q  G   G + Y  PL     RGV  A 
Sbjct: 396 RELFMHDAAPGMNDTAKFDFYERALLNHLLGQQDPGDGHGHVTYFTPLNPGGRRGVGPAW 455

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KSGHV 518
               W T + SFWCC GTGIE+ +KL DSIYF    N   LY+  +I SS  W  + G V
Sbjct: 456 GGGTWSTDYESFWCCQGTGIETNTKLMDSIYFRSRDN-NALYVNLFIPSSVQWSDRDGVV 514

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP--- 575
           V  +   P+         TLT S     G   +L++R+P W  + GA+ S+NGQ +    
Sbjct: 515 VTQETEFPLGD-----ATTLTVSGAG--GGRWTLSVRIPSWV-AGGAEVSVNGQKVGGDV 566

Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
              PG + + T  W+  DK+T++LP+ L T A  DD     ++ A+ +GP +L+G
Sbjct: 567 RTTPGGYAAITREWAVGDKVTVRLPMKLHTVAANDD----PTLVALAYGPAILSG 617


>gi|114047478|ref|YP_738028.1| hypothetical protein Shewmr7_1982 [Shewanella sp. MR-7]
 gi|113888920|gb|ABI42971.1| protein of unknown function DUF1680 [Shewanella sp. MR-7]
          Length = 795

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 178/554 (32%), Positives = 276/554 (49%), Gaps = 54/554 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L  + L+DV L     L  AQQT+L Y++ +D + L+  +RK A + T    Y  WEN  
Sbjct: 28  LTPIPLNDVRLTAGPFL-HAQQTDLAYIMSMDPERLLAPYRKEAGIATTADNYPNWEN-- 84

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
           + L GH  GHYLSA A M+A+T +  + E+++ +V  L +CQ   G GY+   P   +L+
Sbjct: 85  TGLDGHIGGHYLSALALMYAATGDQAVLERLNYMVAELEKCQQAHGNGYVGGVPHGDKLW 144

Query: 217 DSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFY 263
               A         L   W P+Y +HK+ AGL D Y+   N  A KM    A WM++   
Sbjct: 145 QQVAAGHIEADLFTLNQSWVPWYNLHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDLSR 204

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
           N   + + +         L  E GG+N+ L  +YSIT   K+L LA+ +     L  L  
Sbjct: 205 NLTDEQLQLM--------LRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQ 256

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
             D L+  HANT IP ++G     E++ +  +     +F   V    + + GG S RE +
Sbjct: 257 HQDKLTRLHANTQIPKIVGVARIAELSHNKAWLESADYFWQQVVHQRTVSIGGNSVREHF 316

Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
              +  +  L S E  ETC TYNMLK+S+ L+   +++ Y DYYERAL N +LS Q   +
Sbjct: 317 HPSEDFSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-PQ 375

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
            G ++Y  P+     +      + +   S WCC G+GIE+ +K G+ IY EE+ N   L+
Sbjct: 376 TGGLVYFTPM-----RPDHYRVYSSAQESMWCCVGSGIENHAKYGELIYAEEDNN---LF 427

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS--SLNLRMPVWT 560
           +  ++ S  +WK+  + L+QK           +     +S+  + Q +  +LNLR P W 
Sbjct: 428 VNLFVDSEVNWKAKGISLSQKT----------QFPDDNTSQMIIHQEADFTLNLRYPTWA 477

Query: 561 YSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
             +    S+NG+     P  G ++  T  W   D +TI LP+ +  E + D    Y    
Sbjct: 478 KGD-VTVSINGEPQRFTPTQGQYIPLTRHWRKGDSVTITLPMDISLEQLPDKTAYY---- 532

Query: 620 AILFGPYLLAGHTS 633
           ++L+GP +LA  T+
Sbjct: 533 SVLYGPIVLAAKTA 546


>gi|325106128|ref|YP_004275782.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324974976|gb|ADY53960.1| protein of unknown function DUF1680 [Pedobacter saltans DSM 12145]
          Length = 782

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 194/632 (30%), Positives = 285/632 (45%), Gaps = 67/632 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L+EV L D       +   A+Q +L+Y+L +D+D L+  + + A L    K+YG WEN  
Sbjct: 32  LQEVKLLD------GIFKNAEQVDLKYILSMDMDKLLAPYLREAGLSEKAKSYGNWEN-- 83

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
           S L GH  GHYLSA + M+AST N  I +++   +  L  CQ+  G GYL   P      
Sbjct: 84  SGLDGHIGGHYLSALSLMYASTKNPDINKRIDYYLSELKRCQDANGDGYLGGVPDGKAMW 143

Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
                 ++  +  +L   W P Y IHK+ AGL D +V   N  A    +K+  W    F 
Sbjct: 144 RDISDGKIDAATFSLNKKWVPLYNIHKVFAGLYDAWVYTGNNTAKDMFIKLCDWATTTFG 203

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
           N  ++ I           L  E GG+N+     Y +T   K++ LA  F     L  L  
Sbjct: 204 NLNEQQIQQM--------LKSEHGGINESFADAYKLTGQQKYMDLALKFSHKAILDPLRN 255

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
           Q D L+  HANT IP VIG +   E+     +    TFF D V    + A GG S RE +
Sbjct: 256 QEDKLTGIHANTQIPKVIGFEKISEIEHKDDWHKAATFFWDNVVYKRTVAIGGNSVREHF 315

Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
                    +   E  ETC TYNM+K+S+ L+  + E  Y DY E+AL N +LS Q   E
Sbjct: 316 HPINNFMPMIEDIEGPETCNTYNMIKLSKALYNQSGETKYIDYIEKALYNHILSSQH-PE 374

Query: 443 PGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
            G  +Y  P+       R  H   +     S WCC G+G+E+ +K G+ IY     N   
Sbjct: 375 KGGFVYFTPM-------RPNHYRVYSQPETSMWCCVGSGLENHAKYGEFIYAH---NDKD 424

Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
           L++  +I S  DWK   + + Q  +     +  +++T      +   +  ++N+R+P W 
Sbjct: 425 LFVNLFIPSELDWKEKKIKITQTTNFPEEGNTSIKLT------EIKNENFNINIRIPNWA 478

Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
             N     +NG+ +     G +++  ++W   D++ I LPLS R E + D  P YAS   
Sbjct: 479 SENDISVKINGKQIQPIVEGKYITLNKKWKKGDEINIDLPLSNRIEQMPDGLP-YAS--- 534

Query: 621 ILFGPYLLAGHT------------SGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQES 668
           I +GP LLA  T            S    I  G    LS     I    +  L   T++S
Sbjct: 535 IFYGPILLAAKTDTIDLKGLFADDSRGGHIAKGKQLPLSTAPQFIVEKKDDILKNLTKQS 594

Query: 669 GNSTFVMSNSNQSITMEEFPVSGTDAALHATF 700
            N  F  +N   S  +E  P        +A +
Sbjct: 595 NNLIFKSANIKYSKNLELVPFYKVHDTRYAVY 626


>gi|329847073|ref|ZP_08262101.1| hypothetical protein ABI_01350 [Asticcacaulis biprosthecum C19]
 gi|328842136|gb|EGF91705.1| hypothetical protein ABI_01350 [Asticcacaulis biprosthecum C19]
          Length = 800

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 183/566 (32%), Positives = 277/566 (48%), Gaps = 51/566 (9%)

Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
           V L DV L  S  L  A + N +YL+ L  D ++ ++ K A LP  G+ YGGWE+    +
Sbjct: 46  VPLSDVRLLPSPFL-TAVEANTKYLMFLSPDRMLHNYHKFAGLPVKGEIYGGWES--DTI 102

Query: 162 RGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF---------- 211
            G  +GHYLSA + ++A T +A  + ++  ++  L++ Q   G GY + F          
Sbjct: 103 AGEALGHYLSALSLLYAQTGHAEARTRIEYIIAELAKVQAAHGDGYAAGFMRKRKDASIV 162

Query: 212 -PTELFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEY 261
              E+F    A         L   W P+Y  HK+ AGL+D    A     + +A  +  Y
Sbjct: 163 DGKEIFAEIMAGDIRSAGFDLNGCWVPFYNWHKLFAGLMDAQTYAGIDAGIPVAVALGGY 222

Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
               ++KV    + E+    L+ E GG+N+    LY+ T DP+ L LA        L  L
Sbjct: 223 ----IEKVFAALNDEQVQKVLDCEHGGINESFAELYTRTKDPRWLALAERIYHHRILDPL 278

Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
               D L++ HANT +P ++G    YE+TG P Y+   +FF D V   HS+A GG + RE
Sbjct: 279 TAGEDKLANNHANTQVPKLVGLARLYEITGKPGYRKASSFFWDRVVNHHSFAIGGNADRE 338

Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
           ++++P  +A  +  +  E+C TYNMLK++RHL+ WT   A+ DYYERA  N +++ Q   
Sbjct: 339 YFFEPDTIAKHITEQTCESCNTYNMLKLTRHLYAWTPNAAWFDYYERAHLNHIMAHQN-P 397

Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
           E G+  YM+PL  G  +  S     T  +SFWCC  +GIES SK GDSIY++ +     L
Sbjct: 398 ETGMFAYMVPLMSGTGREYS-----TPEDSFWCCVLSGIESHSKHGDSIYWQSDDT---L 449

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVWT 560
           ++  +I S   W       N+    + +  PY    + F   Q  G  + ++ +R+P W 
Sbjct: 450 FVNLFIPSKLTW-------NKAAFELTTQYPY-DSRVAFKVTQSSGAKAFTVAVRIPGWA 501

Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
            S+     +NG+         +      W   D +T+ LPL LR E    D      + A
Sbjct: 502 KSH--TLLVNGKPALAAIDKGYALIRRTWKAGDVVTLDLPLELRFEGTAGDD----KVVA 555

Query: 621 ILFGPYLLAGHTSGEWDIKTGTARSL 646
           +L GP +LA       D   G A +L
Sbjct: 556 LLRGPMVLAADLGAIEDSWQGDAPAL 581


>gi|313204495|ref|YP_004043152.1| hypothetical protein Palpr_2030 [Paludibacter propionicigenes WB4]
 gi|312443811|gb|ADQ80167.1| protein of unknown function DUF1680 [Paludibacter propionicigenes
           WB4]
          Length = 788

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 177/539 (32%), Positives = 268/539 (49%), Gaps = 38/539 (7%)

Query: 106 DVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHF 165
           DV L +S     A+  ++ YLL LD D L+  + K   L    + Y  WEN  + L GH 
Sbjct: 38  DVRLTESP-FKHAEDMDINYLLGLDADRLMAPYLKGGGLTPKAENYPNWEN--TGLDGHI 94

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE--- 220
            GHYLSA + M+A+T N  IKE++   +  L   Q+  G GYL   P   +++D  +   
Sbjct: 95  GGHYLSALSYMYAATGNTRIKERLDYSLNELKRAQDAAGDGYLGGTPNGRKIWDEIKKGT 154

Query: 221 ------ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYS 274
                  L   W P Y IHK  AGL D Y+   +  A  M   + ++ YN V  +     
Sbjct: 155 INASSFGLNGGWVPLYNIHKTYAGLRDAYLQGGSLLAKDMLIKLTDWMYNTVSGLTDAQV 214

Query: 275 VERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHAN 334
            E     L  E GG+N+V   + SIT + K+L LAH F     L  L    D L+  HAN
Sbjct: 215 QEM----LKSEHGGLNEVFADVASITGNKKYLELAHKFSHQTLLQLLLQHQDKLTGMHAN 270

Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
           T IP VIG +   ++ G+  +    +FF   V  + S + GG S RE +           
Sbjct: 271 TQIPKVIGFKRIADLEGNKDWSDAASFFWKTVVDNRSVSIGGNSVREHFHPSDNFTSMFE 330

Query: 395 SEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG 453
           SE   ETC TYNML++++ LF+ + E ++ DYYERAL N +LS Q   + G  +Y  P+ 
Sbjct: 331 SEQGPETCNTYNMLRLTKLLFQTSGEASFMDYYERALYNHILSTQDPIQGG-FVYFTPMR 389

Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
            G  +      +     SFWCC G+G+E+ ++ G+ IY  ++ +   LY+  +I S   W
Sbjct: 390 AGHYRV-----YSQPQTSFWCCVGSGLENHARYGEMIYGFKDND---LYVNLFIPSVLTW 441

Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
           K+ ++ + Q+ +    +       +   +K+    L +L++R P W   N  + S+NGQ+
Sbjct: 442 KAKNIRIEQQNN----FAKQEAADIIVDAKKTA--LFTLHIRKPEWVKDNDLKVSVNGQS 495

Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
            P+     +LS T  WS  DK+ ++LP+ LR     D+  EY    + L+GPY+LA  T
Sbjct: 496 TPVTIKDGYLSITRNWSKGDKVHLELPMQLRAVTTPDNAQEY----SFLYGPYVLAAKT 550


>gi|325679069|ref|ZP_08158663.1| hypothetical protein CUS_6624 [Ruminococcus albus 8]
 gi|324109193|gb|EGC03415.1| hypothetical protein CUS_6624 [Ruminococcus albus 8]
          Length = 791

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 193/538 (35%), Positives = 270/538 (50%), Gaps = 62/538 (11%)

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPISELRGHFVGHYLSASAQMWASTH 181
           + YLL  D D L+  FR+TA L   G   Y GWE+ +  + GH VGHY++A AQ +AS  
Sbjct: 29  IAYLLSFDTDRLLAGFRETAGLDMRGAVRYSGWEDDL--IGGHCVGHYMTAVAQAYASLQ 86

Query: 182 NATIKE----KMS-TVVFSLSECQNKIGTGYLSAFPTEL---------FDSFEA-----L 222
               +     K++ T    L ECQ  +GTG++  F  ++         FD+ E      +
Sbjct: 87  EGDSRRDALYKLAVTTTDGLKECQQALGTGFI--FGAKIIDKNNVEAQFDNVEKNLSNIM 144

Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
              W PYYT+HKILAG +D Y L     A  +A+ + ++ Y RV +    +S E     L
Sbjct: 145 TQAWVPYYTLHKILAGAIDIYRLTGYENAKTVASRLGDWVYRRVSR----WSEETQRTVL 200

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLGFLALQADYLSHFHANTHIPIVI 341
             E GGMND LY LY++T   +H + AH FD+ P F    A   + L++ HANT IP  +
Sbjct: 201 GIEYGGMNDCLYELYAVTGKEEHAIAAHCFDEVPLFENVYAGTENALNNKHANTTIPKFL 260

Query: 342 GSQMRYE------VTGDPL----YKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           G+  RY       V G+ +    Y      F D+V   HSY TGG S  E +     L  
Sbjct: 261 GALKRYAILDGRTVNGETVDAGRYLGYAERFWDMVVQKHSYITGGNSEWEHFGCDYVLDA 320

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
              + N ETC TYNMLK+SR LF  T E  YADYYE    N +LS Q   E G+  Y  P
Sbjct: 321 ERTNANCETCNTYNMLKLSRLLFEITGEKKYADYYENTFINAILSSQN-PETGMSTYFQP 379

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           +  G  K  S     T +  FWCC G+G+E+F+KLGDSIYF  EGN   L + QYISSS 
Sbjct: 380 MASGYFKVYS-----TPYTKFWCCTGSGMENFTKLGDSIYF-TEGNA--LIVNQYISSSA 431

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           +W    V + Q  D I + D     T  F    + G   SL LR+P W   + A  +++G
Sbjct: 432 EWSEKGVKVEQMTD-IPNSD-----TAKFMIHGKGG--ISLKLRLPDWLAGD-AVITVDG 482

Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           +       G +   +   +    + I+LP+ +R  ++ D++  Y       +GP +L+
Sbjct: 483 KAYDADINGGYAEVS-GIADGSVVEIKLPMEVRAHSLPDNKNTY----GFRYGPIVLS 535


>gi|379719928|ref|YP_005312059.1| hypothetical protein PM3016_2010 [Paenibacillus mucilaginosus 3016]
 gi|378568600|gb|AFC28910.1| hypothetical protein PM3016_2010 [Paenibacillus mucilaginosus 3016]
          Length = 641

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 181/531 (34%), Positives = 265/531 (49%), Gaps = 55/531 (10%)

Query: 153 GWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP 212
           GWE+P  ELRGH +GH+LSA+A ++  T +  +K K   +V  L+ CQ   G  +L+AFP
Sbjct: 76  GWESPTCELRGHIMGHWLSAAATIYGQTQDGLVKAKADYIVAELARCQEANGGEWLAAFP 135

Query: 213 TELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITM 272
                     K VWAP+YTIHK+L GL D Y LA +A AL++ T M  +FY    +    
Sbjct: 136 ESYMHRIARGKYVWAPHYTIHKLLMGLYDMYRLAGSAAALELMTNMAAWFY----RWTDG 191

Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
           ++ E     L+ ETGGM +    LY +T    HL L   +D+  F   L    D L++ H
Sbjct: 192 FTREEMDDLLDLETGGMLETWADLYGVTGSGAHLELVRRYDRRRFFDALLEGRDVLTNKH 251

Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSY-ATGGTSAREFWWDPKRLAD 391
           ANT IP ++G+   +EVTG+  Y+ I   F     +   Y ATG     E W     +A 
Sbjct: 252 ANTQIPEILGAARAWEVTGEERYRRIVEAFWRCAVSERGYTATGAGDNGELWMPQGEMAA 311

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
            LG+  +E C  YNM+++++ L RWT + AYADY+ER   NGVL+ Q G E G++ Y + 
Sbjct: 312 RLGA-GQEHCCNYNMMRLAQVLLRWTGDPAYADYWERRFVNGVLAHQHG-ETGMISYFIG 369

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           LG G  K      WGT    FWCC+GT +++ +     I+ EEE    GL + Q++ S  
Sbjct: 370 LGAGSRKT-----WGTPTGHFWCCHGTLMQANASYEGQIFMEEE---DGLAVCQWLPSKL 421

Query: 512 DWKSGHVVLNQKV--------DPIVSWD---------------PYLR-----MTLTFSSK 543
           +++ G   +  ++        +P+ SW                P  R       LTF ++
Sbjct: 422 EYEIGGTAIRLRIEQDGQHGLEPLSSWSVTGMTAITRADLPPIPVHRPDRFMYRLTFEAE 481

Query: 544 QEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP---PPGNFLSATERWSYNDKLTIQLP 600
           + V     L +R+P W  S     ++NG+  PL     P  F+     W   D +T++LP
Sbjct: 482 RAV--TFKLRMRLP-WWLSGEPVITVNGEA-PLQGELKPSTFVELEREWKSGDTITVELP 537

Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALIS 651
             L+ EA+    P      A L GP +LAG T+ E  I TG       L++
Sbjct: 538 KGLKAEAL----PGEPGTVAFLDGPIVLAGLTAEE-RILTGNLEQPETLLA 583


>gi|427384528|ref|ZP_18881033.1| hypothetical protein HMPREF9447_02066 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727789|gb|EKU90648.1| hypothetical protein HMPREF9447_02066 [Bacteroides oleiciplenus YIT
           12058]
          Length = 1145

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 180/561 (32%), Positives = 278/561 (49%), Gaps = 39/561 (6%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQQ + ++LL LD D L+  F K A LP  G+ YGGWE      RG     Y+SA A MW
Sbjct: 421 AQQLDAKWLLSLDPDRLLHRFHKNAGLPPKGENYGGWEEHRGGGRGLGH--YMSACAMMW 478

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------LKPVWAP 228
           AST     K++   V+  L  CQ   GTGY+ +    ++              L     P
Sbjct: 479 ASTGEPEFKQRTDYVINELERCQKARGTGYIGSVEDSIWTQVGRGDIRSTGFDLNGGIVP 538

Query: 229 YYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGG 288
           ++ +HK+ AGL D Y+   N +A  +   + ++ Y +   +    + E+    L  E GG
Sbjct: 539 WFILHKLFAGLYDIYIYTGNEKAKTVLVNLCDWAYRQFGNL----NDEQWQKMLACEHGG 594

Query: 289 MNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYE 348
           M +VL  +YSI  D K+L ++H FD   F   L+ Q D L+  HANT IP V+G + R++
Sbjct: 595 MLEVLANVYSIVGDKKYLDMSHWFDHKQFFSPLSHQVDSLAGLHANTQIPKVVGLERRHQ 654

Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLK 408
           +T     K+   FF + V  +H+Y  GG    E +     L++ L     ETC TYNMLK
Sbjct: 655 LTHSEEDKVKSHFFWETVVKNHTYCIGGNGDGEHFGPKGILSNRLSDRTAETCNTYNMLK 714

Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
           +++ L   T +  Y DYYE+AL N +L+ Q   E G+  Y +PL  G  K     G+ + 
Sbjct: 715 LTKMLLAETGDTKYGDYYEKALYNHILASQ-NPETGMTTYYVPLVAGGKK-----GYSSA 768

Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
           F +F CC GTG E+ ++ G++IYF+   N   L +  YI S+  W+   + + Q+     
Sbjct: 769 FETFTCCVGTGFENHARYGEAIYFKGRKN--NLLVNLYIPSALTWEETGITIRQE----G 822

Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATE 587
           +++   ++  T +S +   + +SL  RMP WT +   +  +NG+ +  P  PG +L  T 
Sbjct: 823 AYEKNGKVKFTINSSKP--KKASLFFRMPYWTTAK-TEVKVNGRKIDNPVIPGMYLEITG 879

Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLS 647
            W  ND + I   + + TE      P+  +  AI +GP +LAG    +   K    + + 
Sbjct: 880 EWKKNDIIEIHFDMPVYTEPT----PDNPNRLAIKYGPLVLAGKLGNK---KIDPVKDIP 932

Query: 648 ALISPIPPSFNAQLVTFTQES 668
            LI    P  N  +   +Q+S
Sbjct: 933 VLIVDDKP-VNEWVSRISQDS 952


>gi|337745980|ref|YP_004640142.1| hypothetical protein KNP414_01710 [Paenibacillus mucilaginosus
           KNP414]
 gi|336297169|gb|AEI40272.1| protein of unknown function DUF1680 [Paenibacillus mucilaginosus
           KNP414]
          Length = 636

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 181/531 (34%), Positives = 265/531 (49%), Gaps = 55/531 (10%)

Query: 153 GWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP 212
           GWE+P  ELRGH +GH+LSA+A ++  T +  +K K   +V  L+ CQ   G  +L+AFP
Sbjct: 71  GWESPTCELRGHIMGHWLSAAATIYGQTQDGLVKAKADYIVAELARCQEANGGEWLAAFP 130

Query: 213 TELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITM 272
                     K VWAP+YTIHK+L GL D Y LA +A AL++ T M  +FY    +    
Sbjct: 131 ESYMHRIARGKYVWAPHYTIHKLLMGLYDMYRLAGSAAALELMTNMAAWFY----RWTDG 186

Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
           ++ E     L+ ETGGM +    LY +T    HL L   +D+  F   L    D L++ H
Sbjct: 187 FTREEMDDLLDLETGGMLETWADLYGVTGSGAHLELVRRYDRRRFFDALLEGRDVLTNKH 246

Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSY-ATGGTSAREFWWDPKRLAD 391
           ANT IP ++G+   +EVTG+  Y+ I   F     +   Y ATG     E W     +A 
Sbjct: 247 ANTQIPEILGAARAWEVTGEERYRRIVEAFWRCAVSERGYTATGAGDNGELWMPQGEMAA 306

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
            LG+  +E C  YNM+++++ L RWT + AYADY+ER   NGVL+ Q G E G++ Y + 
Sbjct: 307 RLGA-GQEHCCNYNMMRLAQVLLRWTGDPAYADYWERRFVNGVLAHQHG-ETGMISYFIG 364

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           LG G  K      WGT    FWCC+GT +++ +     I+ EEE    GL + Q++ S  
Sbjct: 365 LGAGSRKT-----WGTPTGHFWCCHGTLMQANASYEGQIFMEEE---DGLAVCQWLPSKL 416

Query: 512 DWKSGHVVLNQKV--------DPIVSWD---------------PYLR-----MTLTFSSK 543
           +++ G   +  ++        +P+ SW                P  R       LTF ++
Sbjct: 417 EYEIGGTAIRLRIEQDGQHGLEPLSSWSVTGMTAITRADLPPIPVHRPDRFMYRLTFEAE 476

Query: 544 QEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP---PPGNFLSATERWSYNDKLTIQLP 600
           + V     L +R+P W  S     ++NG+  PL     P  F+     W   D +T++LP
Sbjct: 477 RAV--TFKLRMRLP-WWLSGEPVITVNGEA-PLQGELKPSTFVELEREWKSGDTITVELP 532

Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALIS 651
             L+ EA+    P      A L GP +LAG T+ E  I TG       L++
Sbjct: 533 KGLKAEAL----PGEPGTVAFLDGPIVLAGLTAEE-RILTGNLEQPETLLA 578


>gi|117920524|ref|YP_869716.1| acetyl-CoA carboxylase, biotin carboxylase [Shewanella sp. ANA-3]
 gi|117612856|gb|ABK48310.1| acetyl-CoA carboxylase, biotin carboxylase [Shewanella sp. ANA-3]
          Length = 795

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 178/554 (32%), Positives = 276/554 (49%), Gaps = 54/554 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L  + L+DV L     L  AQQT+L Y++ +D + L+  +RK A + T    Y  WEN  
Sbjct: 28  LTPIPLNDVRLTAGPFL-HAQQTDLAYIMSMDPERLLAPYRKAAGIATTADNYPNWEN-- 84

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
           + L GH  GHYLSA A M+A+T +  +  +++ +V  L +CQ   G GY+   P   +L+
Sbjct: 85  TGLDGHIGGHYLSALALMYAATGDQAVLSRLNYMVAELEKCQQAHGNGYVGGVPHGDKLW 144

Query: 217 DSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFY 263
               A         L   W P+Y +HK+ AGL D Y+   N  A KM    A WM++   
Sbjct: 145 QQVAAGHIEADLFTLNQSWVPWYNVHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDLSR 204

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
           N         S E+    L  E GG+N+ L  +YSIT   K+L LA+ +     L  L  
Sbjct: 205 N--------LSDEQLQLMLRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQ 256

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
             D L+  HANT IP ++G     E++ +  +     +F   V    + + GG S RE++
Sbjct: 257 HQDKLTGLHANTQIPKIVGVARIAELSNNKEWLESADYFWQQVVHQRTVSIGGNSVREYF 316

Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
              +  +  L S E  ETC TYNMLK+S+ L+   +++ Y DYYERAL N +LS Q   +
Sbjct: 317 HPSEDFSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-PQ 375

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
            G ++Y  P+     +      + +   S WCC G+GIE+ +K G+ IY EE+ N   L+
Sbjct: 376 TGGLVYFTPM-----RPDHYRVYSSAQESMWCCVGSGIENHAKYGELIYAEEDNN---LF 427

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS--SLNLRMPVWT 560
           +  ++ S   WK+  + L+QK           +     +S+  + Q +  +LNLR P W 
Sbjct: 428 VNLFVDSEVHWKAKGISLSQKT----------QFPDDNTSQMIIHQEADFTLNLRYPTWA 477

Query: 561 YSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
                  S+NG+     P  G ++  T  W   D +TI LP+ +  E +    P+ ++  
Sbjct: 478 -KGEVTVSINGEPQRFTPTQGQYIPLTRHWRKGDSVTITLPMDISLEQL----PDKSAYY 532

Query: 620 AILFGPYLLAGHTS 633
           ++L+GP +LA  T+
Sbjct: 533 SVLYGPIVLAAKTA 546


>gi|383641951|ref|ZP_09954357.1| hypothetical protein SchaN1_14318 [Streptomyces chartreusis NRRL
           12338]
          Length = 768

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 187/545 (34%), Positives = 269/545 (49%), Gaps = 53/545 (9%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
           WLD        Q     YL  +DVD L+++FR    L T G A  GGW+ P    R H  
Sbjct: 62  WLDN-------QDRTRNYLRFVDVDRLLYNFRANHRLSTNGAAANGGWDAPDFPFRTHVQ 114

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFE- 220
           GH+L+A AQ++A T + T ++K +T+V  L++CQ    T     GYLS +P   F + E 
Sbjct: 115 GHFLTAWAQLYAVTGDTTCRDKATTMVAELAKCQANNSTAGFNAGYLSGYPESDFTALEQ 174

Query: 221 -ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRV--QKVITMY 273
             L     PYYTIHK L GLLD +    + QA    L +A W V++   R+  Q++  M 
Sbjct: 175 RTLSNGNVPYYTIHKTLVGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRLSGQQMQAM- 232

Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
                   L  E GGMN VL  LY  T D + L +A  FD       LA   D LS  HA
Sbjct: 233 --------LQTEFGGMNTVLTDLYQQTGDARWLTVARRFDHAAVFDPLAAGQDQLSGLHA 284

Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
           NT +P  IG+   Y+ TG   Y+ I T   +I   SH+YA GG S  E +  P  +A  L
Sbjct: 285 NTQVPKWIGAAREYKATGTTRYRDIATNAWNICVNSHTYAIGGNSQAEHFRAPNAIAGFL 344

Query: 394 GSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLP 451
             +  E+C T+NML ++R LF      +A  DYYERA  N ++  Q    + G + Y  P
Sbjct: 345 NKDTCESCNTFNMLTLTRELFALDPNRVALFDYYERAWLNQMIGQQNPADDHGHVTYFTP 404

Query: 452 LG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           L     RGV  A     W T + +FWCC GTG+E  ++L DSIYF  +     L +  ++
Sbjct: 405 LNPGGRRGVGPAWGGGTWSTDYGTFWCCQGTGLEMHTRLMDSIYFRSDNT---LIVNMFV 461

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S  +W    + + Q      S    L +T   S         ++ +R+P WT   GA  
Sbjct: 462 PSVLNWSERGITVTQTTSYPNSDTTTLHVTGNASGTW------AMRIRIPSWT--TGATV 513

Query: 568 SLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
           S+NG    +   PG++ + +  W+  D +T++LP+ +    I     + A++ AI +GP 
Sbjct: 514 SVNGVAQTITTTPGSYATLSRSWASGDTVTVRLPMRV----IMRAANDNANVAAITYGPV 569

Query: 627 LLAGH 631
           +L+G+
Sbjct: 570 VLSGN 574


>gi|418466296|ref|ZP_13037222.1| secreted protein [Streptomyces coelicoflavus ZG0656]
 gi|371553101|gb|EHN80323.1| secreted protein [Streptomyces coelicoflavus ZG0656]
          Length = 773

 Score =  262 bits (670), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 179/538 (33%), Positives = 259/538 (48%), Gaps = 40/538 (7%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
           WLD        Q   L YL  +DVD L+ +FR    L T G A  GGWE P    R H  
Sbjct: 64  WLDN-------QSRTLSYLRFVDVDRLLHNFRANHRLSTNGAAATGGWEAPDFPFRSHVQ 116

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
           GH+L+A AQ +A T +   ++K   +V  L++CQ        GTGYLS +P   F + E+
Sbjct: 117 GHFLTAWAQAYAVTGDTACRDKALYMVAELAKCQANNGAAGFGTGYLSGYPESDFAALES 176

Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
             L     PYYTIHK LAGLL+ + L  + +A  +   +  +   R  ++    S  R  
Sbjct: 177 GTLNNGNVPYYTIHKTLAGLLEVWRLLGSTRARDVLLALAGWVDRRTGRL----STTRMQ 232

Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
             L  E GGMN VL  L   T D + L +A  FD       LA   D L+  HANT +P 
Sbjct: 233 AVLGTEFGGMNAVLTDLCQQTGDTRWLAVAQRFDHAAVFDPLAANQDRLAGLHANTQVPK 292

Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE 399
            IG+   Y+ TG   Y+ I T   ++   +H+YA GG S  E +  P  +A  L ++  E
Sbjct: 293 WIGAVREYKATGSTRYRDIATNAWNMCVTTHTYAVGGNSQAEHFRPPNAIAAHLANDTCE 352

Query: 400 TCTTYNMLKVSRHLFRWTKEIAYA-DYYERALTNGVLSIQRGTEP-GVMIYMLPLG---- 453
           +C T NML ++R LF  + + A   DYYE+A  N ++  Q   +P G + Y  PL     
Sbjct: 353 SCNTVNMLGLTRELFALSPDRAELFDYYEQAWLNHMIGQQNPADPHGHVTYFTPLKPGGR 412

Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
           RGV  A     W T + +FWCC GTG+E  ++L DS+YF + G    L +  ++ S   W
Sbjct: 413 RGVGPAWGGGTWSTDYTTFWCCQGTGLEMHTRLMDSVYFHDGGTT--LTVNLFVPSVLTW 470

Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG-Q 572
               + + Q      S    LR+T       +     ++ +R+P WT   GA  S+NG +
Sbjct: 471 AERGITVTQSTSYPASDTTTLRIT------GDAAGTWAMRVRIPGWT--TGAVVSVNGVR 522

Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
                 PG + +    W   D +T++LP+        DD     ++ A+  GP +L+G
Sbjct: 523 QHVTAAPGTYATLDRAWDSGDTVTVRLPMRTVVRPANDD----PAVGAVTHGPVVLSG 576


>gi|251795999|ref|YP_003010730.1| hypothetical protein Pjdr2_1987 [Paenibacillus sp. JDR-2]
 gi|247543625|gb|ACT00644.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
          Length = 626

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 178/523 (34%), Positives = 251/523 (47%), Gaps = 58/523 (11%)

Query: 147 PGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTG 206
           P   + GWE+   ELRGH +GH+LSA+AQ++A T +A +K K   +V  L  CQ   G  
Sbjct: 65  PEHWHWGWESVTCELRGHIMGHWLSAAAQIYAQTSDALVKAKADYIVEELVRCQEANGGE 124

Query: 207 YLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRV 266
           +L+AFP            VWAP+YTIHK+L GL D Y +A N QAL++   + ++FY   
Sbjct: 125 WLAAFPESYMHRIAKGSFVWAPHYTIHKLLMGLYDMYAIAGNEQALRVMRGIADWFY--- 181

Query: 267 QKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
            K    +S E     L+ ETGGM +V   LY IT + KHL L   +D+  F   L    D
Sbjct: 182 -KWTGNFSQEEMDELLDLETGGMLEVWADLYGITKEDKHLNLVKRYDRRRFFDALLEGQD 240

Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSY-ATGGTSAREFWWD 385
            L++ HANT IP ++G+   +EVTG+  Y+ I   F  +      Y ATG     E W  
Sbjct: 241 VLTNKHANTQIPEILGAARAWEVTGEDRYRRIVEAFWRLAVTDRGYVATGAGDNGELWMP 300

Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
              +   LG   +E C  YNM++++  L RWT + AYADY+ER   NGVL+ Q G + G+
Sbjct: 301 RGEMGSRLGV-GQEHCCNYNMMRLAHVLLRWTGDPAYADYWERRFYNGVLAHQHG-DTGM 358

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
           + Y L +G G  K+     WGT    FWCC+GT +++ +     I+ E+E    G+ I Q
Sbjct: 359 ISYFLGMGAGSKKS-----WGTPTQHFWCCHGTLMQANAAYESQIFMEDEN---GIAICQ 410

Query: 506 YISSSF-------------------------DWKSGHVVLNQKVD--PIVSWDPYLRMTL 538
           +I S                           +W    +    KVD  PI    P  R   
Sbjct: 411 WIPSELQLSRADGNLRIRIEQDGQYGVYPLNNWSVKGMTAITKVDMPPIPEHRPD-RFVY 469

Query: 539 TFSSKQEVGQLSSLNLRMPVWTYS------NGAQASLNGQNLPLPPPGNFLSATERWSYN 592
           T +   E      L LR+P W         NG+Q   N        P ++ +    WS  
Sbjct: 470 TVTIGLEHASTFELKLRLPWWLSGPPVIRVNGSQVEQNEAK-----PSSYTAIAREWSNG 524

Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           D +T++LP +L  E +  D   Y    A   GP ++AG T  E
Sbjct: 525 DVVTVELPKTLTMEPLPGDTGTY----AFFDGPIVMAGLTEEE 563


>gi|113970330|ref|YP_734123.1| hypothetical protein Shewmr4_1993 [Shewanella sp. MR-4]
 gi|113885014|gb|ABI39066.1| protein of unknown function DUF1680 [Shewanella sp. MR-4]
          Length = 795

 Score =  262 bits (669), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 177/554 (31%), Positives = 276/554 (49%), Gaps = 54/554 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L  + L+DV L     L  AQQT+L Y++ +D + L+  +RK A + T    Y  WEN  
Sbjct: 28  LTPIPLNDVRLTAGPFL-HAQQTDLAYIMSMDPERLLAPYRKEAGIATTADNYPNWEN-- 84

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
           + L GH  GHYLSA A M+A+T +  + E+++ +V  L +CQ   G GY+   P   +L+
Sbjct: 85  TGLDGHIGGHYLSALALMYAATGDQAVLERLNYMVAELEKCQQAHGNGYVGGVPHGDKLW 144

Query: 217 DSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFY 263
               A         L   W P+Y +HK+ AGL D Y+   N  A KM    A WM++   
Sbjct: 145 QQVAAGHIEADLFTLNQSWVPWYNLHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDLSR 204

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
           N   + + +         L  E GG+N+ L  +YSIT   K+L LA+ +     L  L  
Sbjct: 205 NLTDEQLQLM--------LRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQ 256

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
             + L+  HANT IP ++G     E++ +  +     +F   V    + + GG S RE +
Sbjct: 257 HQEKLTGLHANTQIPKIVGVARIAELSHNKAWLESADYFWQQVVHQRTVSIGGNSVREHF 316

Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
              +  +  L S E  ETC TYNMLK+S+ L+   +++ Y DYYERAL N +LS Q   +
Sbjct: 317 HPSEDFSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-PQ 375

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
            G ++Y  P+     +      + +   S WCC G+GIE+ +K G+ IY EE+ N   L+
Sbjct: 376 TGGLVYFTPM-----RPDHYRVYSSAQESMWCCVGSGIENHAKYGELIYAEEDNN---LF 427

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS--SLNLRMPVWT 560
           +  ++ S  +WK+  + L+QK           +     +S+  + Q +  +LNLR P W 
Sbjct: 428 VNLFVDSEVNWKAKGISLSQKT----------QFPDDNTSQMIIHQEADFTLNLRYPTWA 477

Query: 561 YSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
             +    S+NG+     P  G ++  T  W   D +TI LP+ +  E + D    Y    
Sbjct: 478 KGD-VTVSINGEPQRFTPTQGQYIPLTRHWRKGDSVTITLPMDISLEQLPDKTAYY---- 532

Query: 620 AILFGPYLLAGHTS 633
           ++L+GP +LA  T+
Sbjct: 533 SVLYGPIVLAAKTA 546


>gi|433676676|ref|ZP_20508761.1| hypothetical protein BN444_00795 [Xanthomonas translucens pv.
           translucens DSM 18974]
 gi|430818203|emb|CCP39076.1| hypothetical protein BN444_00795 [Xanthomonas translucens pv.
           translucens DSM 18974]
          Length = 807

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 176/551 (31%), Positives = 268/551 (48%), Gaps = 51/551 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           LK+V+L      + S+   + QTN  YLL L+ D L+ +F + A LP  G+ YGGWE   
Sbjct: 65  LKQVTL------KPSLFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGEVYGGWEG-- 116

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE---- 214
             + GH +GHYLSA A+M A T +A +++++  +V  L+  Q K   GY+     +    
Sbjct: 117 DTIAGHTLGHYLSALAKMHAQTRDAALRQRIDYIVAELARAQAKDPDGYVGGLTRKNDKG 176

Query: 215 -------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
                  +F+             L   W+P YT+HK+ AGLLD + LA NAQAL++   +
Sbjct: 177 AIDNGKLVFEEVRRGIIKGSKFNLNGSWSPLYTVHKLFAGLLDAHELAGNAQALQVLLPL 236

Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
             Y       V       +    L+ E GG+N+    L + T DP+ + L         +
Sbjct: 237 AGYLGG----VFDALDHAQMQALLDTEFGGLNESYIELGARTGDPRWIALGKRLRHEKVI 292

Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
              A   D L H HANT +P  IG   ++EV GD        FF + V   +SY  GG +
Sbjct: 293 DPAAAGRDELPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTGHYSYVIGGNA 352

Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
            RE++ +P  +A  L  +  E C +YNMLK++RHL++WT +  Y DYYER L N  ++ Q
Sbjct: 353 DREYFQEPDTIAAFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAAQ 412

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
                G+  YM P+  G  +     G+  KF+SFWCC G+G+E+ ++ GDSIY+++  + 
Sbjct: 413 H-PATGMFTYMTPMIGGGER-----GFSDKFDSFWCCVGSGMEAHAQFGDSIYWQDAAS- 465

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             LY+  YI S+ DW    + L  ++D  V  +  +R+ L  +  +   +L         
Sbjct: 466 --LYVNLYIPSTLDWPERDLAL--ELDSGVPDNGKVRLQLRCAGARTPRRLLLRLP---A 518

Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
           W    G    LNG+         +L+   RW   D + + L + LR E    D    A  
Sbjct: 519 WC-QGGYTLRLNGKAQRGTAADGYLALERRWRSGDMIELDLAMPLRLEHAAGD----ADT 573

Query: 619 QAILFGPYLLA 629
             ++ GP  LA
Sbjct: 574 VVVMRGPLALA 584


>gi|238059692|ref|ZP_04604401.1| secreted protein [Micromonospora sp. ATCC 39149]
 gi|237881503|gb|EEP70331.1| secreted protein [Micromonospora sp. ATCC 39149]
          Length = 740

 Score =  261 bits (666), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 181/539 (33%), Positives = 268/539 (49%), Gaps = 41/539 (7%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
           WLD        Q   L YL  +DVD L+++FR    L T G A  GGW+ P    R H  
Sbjct: 28  WLDN-------QNRTLSYLRFVDVDRLLYNFRANHRLSTNGAASNGGWDAPSFPFRTHVQ 80

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEA 221
           GH+L+A AQ +A   + T ++K + +V  L++CQ   G      GYLS FP   F + EA
Sbjct: 81  GHFLTAWAQAYAVLGDTTCRDKANYMVAELAKCQANNGAAGFTAGYLSGFPESDFTALEA 140

Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
             L     PYY IHK L GLLD +    N QA  +   +  +   R  ++    S  +  
Sbjct: 141 RTLSNGNVPYYCIHKTLLGLLDVWRYIGNTQARSVLLALAGWVDTRTARL----SSSQMQ 196

Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
             L  E GGMN+ L  LY  T D + L +A  FD       LA  +D L+  HANT +P 
Sbjct: 197 AMLGTEFGGMNEALADLYQQTGDGRWLTVAQRFDHAAVFNPLAANSDQLNGLHANTQVPK 256

Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE 399
            IG+   Y+ TG   Y+ I +   ++   +H+YA GG S  E +  P  +A  L ++  E
Sbjct: 257 WIGAAREYKATGTTRYRDIASNAWNMTVNAHTYAIGGNSQAEHFRAPNAIAGYLTNDTCE 316

Query: 400 TCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG---- 453
            C T NMLK++R L+     + AY DY+ERAL N V+  Q   +  G + Y  PL     
Sbjct: 317 HCNTVNMLKLTRELWLIDPNQAAYFDYFERALANHVIGAQNPADGHGHVTYFTPLKPGGR 376

Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
           RGV  A     W T ++SFWCC GTGIE  ++L DSIYF    N   L +  +  S+ +W
Sbjct: 377 RGVGPAWGGGTWSTDYDSFWCCQGTGIEINTRLMDSIYFH---NGTTLTVNLFAPSTLNW 433

Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
               + + Q  +  V     L ++ T S         S+ +R+P W  ++GA  ++NG  
Sbjct: 434 SQRGITVTQSTNYPVGDTTTLTLSGTMSGSW------SIRVRIPAW--ASGATIAVNGAT 485

Query: 574 LPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
             +   PG++ + T  W+  D +T++LP+ +    +     + A++ A+ +GP +L G+
Sbjct: 486 QSVATTPGSYATVTRTWASGDTITVRLPMRV----VLSPANDNAAVAAVTYGPMVLCGN 540


>gi|399074049|ref|ZP_10750795.1| hypothetical protein PMI01_01867 [Caulobacter sp. AP07]
 gi|398040822|gb|EJL33912.1| hypothetical protein PMI01_01867 [Caulobacter sp. AP07]
          Length = 775

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 177/573 (30%), Positives = 273/573 (47%), Gaps = 48/573 (8%)

Query: 77  VSWALLYRKIKNPGGFDLPGNFLKE-VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLV 135
            S A+ +    +PG     G  + E V    V L + S+  +AQ  N  YL+ L  D L+
Sbjct: 14  ASSAMAFVGAASPGLAAPAGRVVAEPVPARHVAL-KPSIFQQAQAANRAYLVSLSADRLL 72

Query: 136 WSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFS 195
            +F + A L      YGGWE     + GH +GHYL+A A   A T +  + ++++ +V  
Sbjct: 73  HNFHQGAGLSVKAPVYGGWE--AQSIAGHTLGHYLTACALQVAGTGDPVLSDRLTYIVAE 130

Query: 196 LSECQNKIGTGYL----------SAFPTELFDSFE---------ALKPVWAPYYTIHKIL 236
           L+  Q   G GY+          +A   ++F+            +L   W P YT HK+ 
Sbjct: 131 LARVQAAHGDGYVGGTTRWGQSDAAGGKQVFEELRRGDIRASRFSLNDGWVPIYTWHKVH 190

Query: 237 AGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRL 296
           AGLLD + LA   +AL +A  +  YF      ++   S  +    L  E GG+N+     
Sbjct: 191 AGLLDAHRLAGTPRALAVAVGLAGYFAT----IVEGLSDAQVQQILITEHGGINEAYAET 246

Query: 297 YSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYK 356
           Y++T D + L +A        L  +A   D L+  HANT IP VIG    YEV GDP   
Sbjct: 247 YALTGDERWLKVARRLRHKAVLDPIAEGRDELAGLHANTQIPKVIGLARLYEVGGDPAEA 306

Query: 357 LIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRW 416
               FF  +V  +HSY  GG S RE +  P  +A  +     E C TYNMLK++R L+ W
Sbjct: 307 RAARFFHQVVTENHSYVIGGNSDREHFGKPNEIARHMAETTCEACNTYNMLKLTRRLWSW 366

Query: 417 TKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCY 476
               A  DYYERA  N +++ QR ++ G+ +Y +P+  G  ++ S     T  +SFWCC 
Sbjct: 367 APNGALFDYYERAQLNHIMAHQRPSD-GMFVYFMPMAAGGRRSYS-----TPEDSFWCCV 420

Query: 477 GTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM 536
           G+G+ES +K  DSI++   G+   LY+  ++ S  D   G   ++  +D     +  +R+
Sbjct: 421 GSGMESHAKHADSIWW-RGGDT--LYLNLFLPSRLDLPDGDFAID--LDTRYPAEGLVRL 475

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLT 596
           ++  +   E      + LR+P W  +      +NG  +  P    +     RW   D++ 
Sbjct: 476 SVVRAPSAE----REIALRLPAWCAA--PLVKVNGAAIGRPGRDGYARLKRRWKAGDRIE 529

Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           + LP+ LR E   DD     ++ A + GP +LA
Sbjct: 530 LVLPMHLRAEPTPDD----PNLVAFVSGPLVLA 558


>gi|440730056|ref|ZP_20910155.1| hypothetical protein A989_02030 [Xanthomonas translucens DAR61454]
 gi|440379682|gb|ELQ16270.1| hypothetical protein A989_02030 [Xanthomonas translucens DAR61454]
          Length = 807

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 179/576 (31%), Positives = 277/576 (48%), Gaps = 53/576 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           LK+V+L      + S+   + QTN  YLL L+ D L+ +F + A LP  G+ YGGWE   
Sbjct: 65  LKQVTL------KPSLFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGEVYGGWEG-- 116

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE---- 214
             + GH +GHYLSA A+M A T +A +++++  +V  L+  Q K   GY+     +    
Sbjct: 117 DTIAGHTLGHYLSALAKMHAQTRDAALRQRIDYIVAELARAQAKDPDGYVGGLTRKNDKG 176

Query: 215 -------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
                  +F+             L   W+P YT+HK+ AGLLD + LA NAQAL++   +
Sbjct: 177 AIDNGKLVFEEVRRGIIKGSKFNLNGSWSPLYTVHKLFAGLLDAHALAGNAQALQVLLPL 236

Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
             Y       V       +    L+ E GG+N+    L + T DP+ + L         +
Sbjct: 237 AGYLGG----VFDALDHAQMQTLLDTEFGGLNESYIELGARTGDPRWIALGKRLRHEKVI 292

Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
              A   D L H HANT +P  IG   ++EV GD        FF + V   +SY  GG +
Sbjct: 293 DPAAAGRDELPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTGHYSYVIGGNA 352

Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
            RE++ +P  +A  L  +  E C +YNMLK++RHL++WT +  Y DYYER L N  ++ Q
Sbjct: 353 DREYFQEPDTIAAFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAAQ 412

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
                G+  YM P+  G  +     G+  KF+SFWCC G+G+E+ ++ GDSIY++   + 
Sbjct: 413 H-PATGMFTYMTPMISGGER-----GFSDKFDSFWCCVGSGMEAHAQFGDSIYWQ---DA 463

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             LY+  YI S+ DW    + L  ++D  V  +  +R+ L     +  G  +   L + +
Sbjct: 464 VSLYVNLYIPSTLDWPERDLTL--ELDSGVPDNGKVRLQL-----RRAGARTPRRLLLRL 516

Query: 559 WTYSNGAQA-SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
             +  GA    +NG++        +L+   +W   D + + L + LR E    D    A 
Sbjct: 517 PAWCQGAYTLRVNGKSQRGTAADGYLALERQWRSGDVIELDLAMPLRLEHAAGD----AD 572

Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPI 653
              ++ GP  LA       D       +L A   P+
Sbjct: 573 TVVVMRGPLALAADLGPVADPYDAPDPALVAAADPL 608


>gi|332185145|ref|ZP_08386894.1| hypothetical protein SUS17_217 [Sphingomonas sp. S17]
 gi|332014869|gb|EGI56925.1| hypothetical protein SUS17_217 [Sphingomonas sp. S17]
          Length = 782

 Score =  259 bits (662), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 188/554 (33%), Positives = 268/554 (48%), Gaps = 55/554 (9%)

Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYL 170
           + S+   A +TN  YL  LD D L+ +FR  A L      YGGWE+    + GH +GHY+
Sbjct: 39  RPSIYATAVETNRRYLYRLDPDRLLHNFRLYAGLKPKAPIYGGWES--DTIAGHTLGHYM 96

Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSF 219
           SA    W  T +  ++ +   +V  L+E Q K GTGY+ A              E+F   
Sbjct: 97  SALVLTWQQTGDTEMRRRADYIVSELAEAQAKRGTGYVGALGRKRADGTIVDGEEIFHEI 156

Query: 220 EA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
            A         L   W+P YT+HK+ AGLLD +    NAQAL +A  +  YF     +V 
Sbjct: 157 MAGKIKSGGFDLNGSWSPLYTVHKLFAGLLDIHGGWGNAQALDVAVKLGGYF----ARVF 212

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
                 R    L  E GG+N+    LY  T D + L LA        L  L    D L++
Sbjct: 213 AALDDARLQDVLGCEYGGLNESFAELYQRTGDRQWLALAERIYDNKVLDPLVAGKDQLAN 272

Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
            HANT +P +IG    +E+T  P       FF + V   HSY  GG + RE++ +P  +A
Sbjct: 273 LHANTQVPKLIGLARIHEITAAPAPAAGARFFWENVTGHHSYVIGGNADREYFSEPDTIA 332

Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
             +  +  E C +YNMLK++RHL+ W  +    DYYERA  N V++ Q     G   YM 
Sbjct: 333 RHITEQTCEHCNSYNMLKLTRHLYGWQPDGRLFDYYERAHLNHVMAAQHPVHAG-FTYMT 391

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           PL  G+++  ST     K ++FWCC G+G+ES +K G+SI++ + G+   L++  YI + 
Sbjct: 392 PLMTGMAREFST----DKDDAFWCCVGSGMESHAKHGESIFW-QGGDT--LFVNLYIPAE 444

Query: 511 FDW-KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG-AQAS 568
             W K G VV      P+          L FS     G+   + LR+P W  +NG A   
Sbjct: 445 ARWDKRGAVVTLDTAYPMDG-----AAKLAFSRLDRAGRF-PVALRVPGW--ANGQAAVE 496

Query: 569 LNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
           +NGQ  P+ P     +     RW   D + I+LPL LR E    D     S+ A++ GP 
Sbjct: 497 VNGQ--PVTPVFERGYAVVDRRWKTGDTVAIRLPLDLRVEPTPGDD----SVVAVVRGPM 550

Query: 627 LLA---GHTSGEWD 637
           ++A   G T+  WD
Sbjct: 551 VMAADLGPTTTPWD 564


>gi|407790778|ref|ZP_11137869.1| Acetyl-CoA carboxylase, biotin carboxylase [Gallaecimonas
           xiamenensis 3-C-1]
 gi|407202325|gb|EKE72317.1| Acetyl-CoA carboxylase, biotin carboxylase [Gallaecimonas
           xiamenensis 3-C-1]
          Length = 780

 Score =  259 bits (661), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 185/556 (33%), Positives = 269/556 (48%), Gaps = 57/556 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L+ + L +V L   S   +AQ TN  YL  LD D L+  FR  A LP P   YG WE   
Sbjct: 20  LETLPLQEVRL-LPSPFKQAQDTNRHYLDSLDPDRLLAPFRAEAGLPQPKPGYGNWE--A 76

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
             L GH  GHYLSA + M+AST +  +  ++  ++  L +CQ+K+GTGY+   P      
Sbjct: 77  DGLGGHMGGHYLSALSLMYASTGDPALLARLQYMLDELKKCQDKLGTGYIGGVPGGSALW 136

Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKM-------ATWMVE 260
                 ++      L   W P+Y +HK+ AGL D Y    +AQAL M         W+VE
Sbjct: 137 QQIHQGDIQADLFTLNQKWVPWYNLHKLYAGLRDAYRYTGSAQALAMWIKLSDWTDWLVE 196

Query: 261 YFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF 320
              +   + +           L  E GGMN+V   LY IT   K+L LA  F +   L  
Sbjct: 197 GLSDEQMQAM-----------LVTEYGGMNEVFADLYEITGQDKYLQLAKRFSQQQLLQP 245

Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR 380
           LA   D L+  HANT IP VIG +   +V+GD        +F   V    + A GG S R
Sbjct: 246 LAHGQDQLNGLHANTQIPKVIGFERIAQVSGDRAMGAAADYFWHQVVEQRTVAIGGNSVR 305

Query: 381 EFWWDPKRLADTLGSENE--ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
           E  + PK    ++  E E  ETC +YNMLK++R L++    + Y  YYERAL N +L+ Q
Sbjct: 306 EH-FHPKDDFSSMVEEVEGPETCNSYNMLKLARLLYQRQGGLDYLAYYERALYNHILASQ 364

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
              + G ++Y  P+     +      +     + WCC G+GIES SK G  IY  ++   
Sbjct: 365 H-PDDGGLVYFTPM-----RPNHYRVYSQADKAMWCCVGSGIESHSKYGAMIYATDQS-- 416

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             LYI  +I S  DW    V L+  +D     D  + +T   +S         L +R P 
Sbjct: 417 -ALYINLFIPSRLDWTEKGVKLS--LDTRFPDDDSVFITFEQASS------LPLKIRYPS 467

Query: 559 WTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
           W  +   +  +NG    +   PG +LS   +W   D+++++LP++L  E +    P+ ++
Sbjct: 468 WVKAGQLELRVNGTPRAVTAKPGQYLSLAGQWQKGDQISLKLPMALSLEQM----PDQSN 523

Query: 618 IQAILFGPYLLAGHTS 633
             A+LFGP +LA  T+
Sbjct: 524 YYAVLFGPIVLAAKTN 539


>gi|347528202|ref|YP_004834949.1| hypothetical protein SLG_18170 [Sphingobium sp. SYK-6]
 gi|345136883|dbj|BAK66492.1| hypothetical protein SLG_18170 [Sphingobium sp. SYK-6]
          Length = 805

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 176/536 (32%), Positives = 257/536 (47%), Gaps = 47/536 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A   N  YLL L+ D L+ +F   A L   G+AYGGWE     + GH +GHY++A A M 
Sbjct: 61  AVDANRRYLLQLEPDRLLHNFLVHAGLEPKGEAYGGWEG--DTIAGHTLGHYMTALALMH 118

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV------------ 225
           A T +A    +   +V  L   Q   G GY++ F     D  E  K +            
Sbjct: 119 AQTGDAECARRALYIVDELERAQKASGDGYVAGFTRRNGDVVEDGKAIFPEIMAGDIRSA 178

Query: 226 -------WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
                  W P+Y  HK+ AGL D      + +A+ +A  +  Y    ++KV       + 
Sbjct: 179 GFDLNGCWVPFYNWHKLYAGLFDIQTWIGSDKAIPIAVSLSGY----IEKVFASLDDTQL 234

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
              L+ E GG+N+    L+  T DP+ L LA        L  L+   + L   HANT IP
Sbjct: 235 QTVLDCEHGGINESFAELHVRTGDPRWLALAERIRHRKVLDPLSRGENSLPWIHANTQIP 294

Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
            VIG    +E+TG   + +   +F D V   +SY  GG + RE++ DP  ++  +  +  
Sbjct: 295 KVIGLARLHEITGRADHAIAARYFWDTVVHRYSYVIGGNADREYFPDPDTVSRHITEQTC 354

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C TYNMLK++RHL+ W  E +  DYYERA  N +L+ QR T+ G+  YM+PL  G  +
Sbjct: 355 ESCNTYNMLKLTRHLYAWRPEASLFDYYERAHINHILAQQR-TDNGMFAYMVPLMSGTHR 413

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV---PGLYIIQYISSSFDWKS 515
           A     W   F+SFWCC G+GIES SK G+SI++EE+        L    YI S   W +
Sbjct: 414 A-----WSDPFDSFWCCVGSGIESHSKHGESIWWEEDDQRRAGEALVANLYIPSRTQWSA 468

Query: 516 -GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
            G  ++ +   P   +D  + + LT  +K       +L LR+P W   +     +NG+  
Sbjct: 469 RGATLVMETAYP---FDGEIDIALTELAKPGT---FTLALRIPAWC--DEPAVLINGKAW 520

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
              P   +++    W   D + + LP+ LR E   DD     S  A L GP +LA 
Sbjct: 521 KATPADGYIAIKRPWKRGDSIRLSLPMKLRMEPTPDD----PSTVAFLRGPVVLAA 572


>gi|347738800|ref|ZP_08870212.1| hypothetical protein AZA_89687 [Azospirillum amazonense Y2]
 gi|346918071|gb|EGY00199.1| hypothetical protein AZA_89687 [Azospirillum amazonense Y2]
          Length = 804

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 174/531 (32%), Positives = 253/531 (47%), Gaps = 44/531 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A   NL YL  L+ D L+ +FR  A L   G AYGGWE     + GH +GHYLSA + M 
Sbjct: 53  AVDANLAYLHSLEADRLLHNFRSGAGLQPKGAAYGGWEG--DTIAGHTLGHYLSALSLMH 110

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV------------ 225
           A T +A  K ++  +V  L+ECQ   G GY++ F  +  D  E  K V            
Sbjct: 111 AQTGDAECKRRVDYIVAELAECQKAQGDGYVAGFTRKRGDIVEDGKVVFDELRRGEIRSA 170

Query: 226 -------WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
                  W P Y  HK+  GL D   L  N QAL +   +  Y    + +V +  + E+ 
Sbjct: 171 GFDLNGCWVPLYNWHKLYTGLFDAQTLCGNTQALDVGVKLGGY----IDEVFSHLNDEQV 226

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
              L+ E GG+N+    LY+ T D + LLLA        L  L+   D L++ HANT IP
Sbjct: 227 QKVLDCEHGGINESFAELYARTGDRRWLLLAERLYHAKVLVPLSEGRDELANIHANTQIP 286

Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
            +IG     E+TG   +     FF   V  +HSY  GG + RE++ +P+ ++  +  +  
Sbjct: 287 KLIGLARLAELTGSERHAKASAFFWQTVTTNHSYVIGGNADREYFQEPRSISRHITEQTC 346

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E C +YNMLK++R L+    +  Y D+YERA  N VL+ Q+    G+  YM PL  G   
Sbjct: 347 EGCNSYNMLKLTRLLYARQADAHYFDFYERAHLNHVLA-QQNPATGMFTYMTPLMSG--- 402

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
             S   + T    FWCC GTG+ES +K G+S+Y+        L +  YI S+  W     
Sbjct: 403 --SAREFSTPTEDFWCCVGTGMESHAKHGESVYWRR--GAEDLAVNLYIPSTLTWGERGA 458

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
           V    VD    +     + LT  + +      +++ R+P W    GA  ++NG+   L  
Sbjct: 459 V----VDLDTRYPEAETVLLTLKALKRPATF-AVSFRIPAW--CTGATLAVNGKPQDLVV 511

Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
              +      W   D + ++LP++LR E+  DD    A   A L GP +LA
Sbjct: 512 QNGYAVVRREWKAGDAVALRLPMALRLESTNDD----ADTVAFLHGPLVLA 558


>gi|408527846|emb|CCK26020.1| secreted protein [Streptomyces davawensis JCM 4913]
          Length = 731

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 182/540 (33%), Positives = 263/540 (48%), Gaps = 47/540 (8%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
           WLD        Q     YL  +DVD L+++FR    L T G A  GGW+ P    R H  
Sbjct: 27  WLDN-------QNRTGNYLRFVDVDRLLYNFRANHKLSTNGAAANGGWDAPDFPFRTHIQ 79

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
           GH+L+A AQ++A T + T ++K + +V  L++CQ          GYLS +P   F + E 
Sbjct: 80  GHFLTAWAQLYAVTGDTTCRDKATYMVAELAKCQANNSAAGFSPGYLSGYPEANFTALEQ 139

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVER 277
                  YYTIHK LAGLLD +    + QA    L +A W V++   R+       + E+
Sbjct: 140 GTKGDVLYYTIHKTLAGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRL-------TSEQ 191

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
               L  E GGMN VL  L+  T D + L +A  FD       LA   D L+  HANT +
Sbjct: 192 MQNMLRIEFGGMNAVLTDLHVRTGDARWLAVAQRFDHAAVFDPLAANQDKLNGLHANTQV 251

Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
           P  IG+   Y+ TG   Y+ I T   +I   SH+YA GG S  E +  P  +A  L  + 
Sbjct: 252 PKWIGAAREYKATGTTRYRDIATNAWNITLDSHTYAIGGNSQAEHFRAPHAIAGFLNKDT 311

Query: 398 EETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLG-- 453
            E+C T+NML ++R LF    +  A  DYYERA  N ++  Q    + G + Y  PL   
Sbjct: 312 CESCNTFNMLVLTRELFELDPDRAALFDYYERAWLNQMIGQQNPADDHGHVTYFTPLNPG 371

Query: 454 --RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
             RGV  A     W T + +FWCC GTG+E  ++L DSIY+  +     L +  ++ S  
Sbjct: 372 GRRGVGPAWGGGTWSTDYGTFWCCQGTGLEMNTRLMDSIYYRRDDT---LIVNLFVPSVL 428

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
            W    + + Q      S       T T       G   ++ +R+P WT   GA  S+NG
Sbjct: 429 TWPERGITVTQTTSYPNS------DTTTLKVTGNAGGTWAMRIRIPSWT--TGASISVNG 480

Query: 572 QNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
               +   PG++ + +  WS  D +T++LP+ +   A  DD P   ++ A+ +GP +L+G
Sbjct: 481 VAQTVATTPGSYATLSRAWSSGDTVTVRLPMRIILRA-ADDNP---NVTAVTYGPVVLSG 536


>gi|297203356|ref|ZP_06920753.1| secreted protein [Streptomyces sviceus ATCC 29083]
 gi|297148382|gb|EDY55480.2| secreted protein [Streptomyces sviceus ATCC 29083]
          Length = 723

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 182/545 (33%), Positives = 263/545 (48%), Gaps = 53/545 (9%)

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENPISELRGHFV 166
           WLD        Q     YL  +DVD L+++FR    L T G  A GGW+ P    R H  
Sbjct: 17  WLDN-------QNRTQNYLRFVDVDRLLYNFRANHRLSTNGAVATGGWDAPDFPFRTHVQ 69

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE- 220
           GH+L+A AQ++A + +   ++K + +V  L++CQ          GYLS +P   F + E 
Sbjct: 70  GHFLTAWAQLYAVSGDTVCRDKATYMVAELAKCQANNSAAGFSAGYLSGYPESDFTALEQ 129

Query: 221 -ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRV--QKVITMY 273
             L     PYYTIHK LAGLLD +    + QA    L +A W V++   R+  Q++ TM 
Sbjct: 130 RTLSNGNVPYYTIHKTLAGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRLSGQQMQTM- 187

Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
                   L  E GGMN VL  LY  T D + L  A  FD       LA   D LS  HA
Sbjct: 188 --------LQTEFGGMNTVLTDLYQQTGDARWLTAARRFDHAAVFDPLASGQDQLSGLHA 239

Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
           NT +P  IG+   Y+ TG   Y+ I T   +    +H+YA GG S  E +  P  +A  L
Sbjct: 240 NTQVPKWIGAAREYKATGTTRYRDIATNAWNFTVNAHTYAIGGNSQAEHFRAPNAIAGYL 299

Query: 394 GSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLP 451
             +  E+C T NML ++R LF       A  DYYE+A  N ++  Q   +  G + Y  P
Sbjct: 300 NKDTCESCNTVNMLTLTRELFALDPNRAALFDYYEQAWLNQMIGQQNPADGHGHVTYFTP 359

Query: 452 LG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           L     RGV  A     W T + +FWCC GTG+E  ++L DS+YF  +     L +  ++
Sbjct: 360 LNPGGRRGVGPAWGGGTWSTDYGTFWCCQGTGLEMHTRLMDSLYFRSDDT---LIVNLFV 416

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S  +W    + + Q      S       T T      V    ++ +R+P WT   GA  
Sbjct: 417 PSVLNWSERGITVTQTTSYPNS------DTTTLQVTGNVSGTWAMRIRIPGWTA--GATI 468

Query: 568 SLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
           S+NG    +   PG++ + T  W+  D +T++LP+ +   A  D+     ++ AI +GP 
Sbjct: 469 SVNGTRQDITTTPGSYATLTRSWTSGDTVTVRLPMRVVMRAANDN----PNVAAITYGPV 524

Query: 627 LLAGH 631
           +L+G+
Sbjct: 525 VLSGN 529


>gi|408357216|ref|YP_006845747.1| hypothetical protein AXY_18530 [Amphibacillus xylanus NBRC 15112]
 gi|407727987|dbj|BAM47985.1| hypothetical protein AXY_18530 [Amphibacillus xylanus NBRC 15112]
          Length = 755

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 184/545 (33%), Positives = 268/545 (49%), Gaps = 46/545 (8%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           +  V L+Q  +   +QQ   EYLL LD+D L+    +          YGGWE+   E+ G
Sbjct: 1   MDQVQLNQG-MFKESQQKGKEYLLYLDIDRLIAPCYEAVGQEPRAPRYGGWES--MEIAG 57

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFE--- 220
           H +GH+LSA++ M+  T +  +K K+   +  L+  Q     GY+S FP + FD      
Sbjct: 58  HSIGHWLSAASLMYNVTGDLLLKHKIDYAIDELAHVQAFDPEGYVSGFPRDCFDEVFTGE 117

Query: 221 ------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVI 270
                  L   W P+Y+IHKI AGL+D Y LA N +A    +K++ W          + +
Sbjct: 118 FRVDNFGLGGSWVPWYSIHKIYAGLVDAYRLASNEKAKTVLVKLSNW--------ADQGL 169

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
           +  + E+    L  E GGMN+ +  +Y IT D + L LA  F+    L  L    D L+ 
Sbjct: 170 SKLNDEQFQRMLICEFGGMNETMADVYEITGDKRFLKLAERFNHKAVLDPLIEGIDDLAG 229

Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
            HANT IP VIG+   Y++TG   Y+ +  FF D V    SYA GG S  E +       
Sbjct: 230 KHANTQIPKVIGAAKLYDMTGKEEYQKLSRFFWDQVVYHRSYAFGGNSNAEHFGPVD--T 287

Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
           + LG  + ETC TYNMLK++ HLF W  +  Y DYYE AL N +L  Q   E G+  Y +
Sbjct: 288 EPLGIISTETCNTYNMLKLTEHLFDWQPDSRYMDYYENALYNHILGSQ-DPESGMKSYFI 346

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           P   G  K      + +  NSFWCC G+G+E+ ++   +IY  +      LY+  +I S+
Sbjct: 347 PTEPGHFKV-----YCSPDNSFWCCTGSGMENPARYTKNIYTRK---ADSLYVNLFIPST 398

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
                  +   Q+ D      PY   T+ F+ K+  G+  ++ LR P W     A   +N
Sbjct: 399 LTIAEKDLQFIQETDF-----PY-DETVHFTVKEGNGERLTVYLRKPNWLAGEMA-LQIN 451

Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           G+ + L     +     +W  ND +T QLP+ LRT   + D+PE    +A  +GP LLAG
Sbjct: 452 GEPVALELVNGYYEIDRKWYKNDTVTFQLPMGLRTYTAK-DQPEK---KAFFYGPILLAG 507

Query: 631 HTSGE 635
               E
Sbjct: 508 RLGRE 512


>gi|385677991|ref|ZP_10051919.1| hypothetical protein AATC3_18830 [Amycolatopsis sp. ATCC 39116]
          Length = 886

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 184/582 (31%), Positives = 288/582 (49%), Gaps = 55/582 (9%)

Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAS 179
           +  + YL  +D D L+  FR TA LP+  +  GGWE P  +LRGH  GH LS  A   A+
Sbjct: 54  ERTVAYLRFVDADRLLHMFRVTAGLPSTAEPCGGWEAPDIQLRGHTTGHLLSGLALAAAN 113

Query: 180 THNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEALKPVWAPYYTIHK 234
           T +  +  K +++V +L+ECQ          GYLSAFP   F   EA K VWAPYYTIHK
Sbjct: 114 TGDTELAAKGASIVAALAECQAAAPAAGFTEGYLSAFPERAFADLEAGKVVWAPYYTIHK 173

Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLY 294
           I+AGLLDQY L  N QAL +   M  +   R+  +    + E     L+ E GGMN+ L 
Sbjct: 174 IMAGLLDQYRLLGNRQALDVLLGMARWARARMANL----TREAQQKVLHTEFGGMNETLA 229

Query: 295 RLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL 354
            L  +T D +HL  A LFD       L+ + D L+  HANT I  ++G+ + ++ TG+  
Sbjct: 230 SLALVTGDRQHLETAKLFDHDEIFVPLSQRRDTLAGRHANTDIAKIVGAAVEWDATGEEY 289

Query: 355 YKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLF 414
           Y+ I T+F D V   H+Y  GG +  EF+  P ++   LG    E C +YNMLK+SR LF
Sbjct: 290 YRTIATYFWDQVVHHHTYVIGGNANAEFFGPPDQIVSQLGENTCENCNSYNMLKLSRLLF 349

Query: 415 -RWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARSTHG-------W 465
            R      Y DY E  L N +L  Q   +  G + Y   L  G  + +   G       +
Sbjct: 350 LRDPSRTDYLDYSEWTLLNQMLGEQDPDSAHGFVTYYTGLVPGAQR-KGKEGVVSDPGTY 408

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
            + + +F C +GTG+E+  K  ++IY+  +    GL++ Q+I S  D+    + L  +  
Sbjct: 409 SSDYGNFTCDHGTGLETHVKYAENIYYAADD---GLWVNQFIPSEVDYGGVRIRLETE-- 463

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
               +D  +R+ ++ +         +L +R+P W  +  A+  +NG+ +    PG F   
Sbjct: 464 --YPYDETVRLHVSGAGA------FALRVRIPSW--ATHARLFVNGEAM-RAEPGRFAVV 512

Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARS 645
             RW   D + ++LP++++        P+  ++ A+ +GP +LA               S
Sbjct: 513 GRRWRDGDVVELRLPMTVQWRPA----PDNPAVHALTYGPLVLAARHGD----------S 558

Query: 646 LSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEF 687
           + A+I  + P       +  +E G + F +   ++ + +  F
Sbjct: 559 VPAVIPTVDPR------SLRREPGRAEFSVQAGDRRLRLSPF 594


>gi|255936447|ref|XP_002559250.1| Pc13g08250 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211583870|emb|CAP91894.1| Pc13g08250 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 627

 Score =  256 bits (653), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 185/535 (34%), Positives = 264/535 (49%), Gaps = 49/535 (9%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTP-GKAYGGWENPISELRGHFVGHYLSASAQMW 177
           Q   L+YL  +DVD L++ FR T  L T      GGW+ P    R H  GH+LSA AQ +
Sbjct: 58  QDRTLKYLKEIDVDRLLYVFRATHGLSTQQATPNGGWDAPDFPFRSHVQGHFLSAWAQCY 117

Query: 178 ASTHNATIKEKMSTVVFSLSECQ--NK-IG--TGYLSAFPTELFDSFE--ALKPVWAPYY 230
           A   + T  ++       L++CQ  NK +G   GY+S FP   F   E   L     PYY
Sbjct: 118 AVLRDQTCYDRAIYFAAELAKCQANNKAVGFTDGYVSGFPESEFAKLENDTLTNGNVPYY 177

Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            +HK LAGLLD + L ++  +    L +A+W        V K    +S       L  E 
Sbjct: 178 AVHKTLAGLLDIWRLTNDTTSRDILLSLASW--------VDKRTEPFSYAAMQKLLQTEF 229

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGMN+V+  +Y  T D + L +A  FD       LA   D L   HANT +P  IG+  +
Sbjct: 230 GGMNEVMADIYHQTGDERWLTVAQRFDHAVIFDPLAANKDELDGLHANTQVPKWIGAARQ 289

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           Y+ TG+  Y  I     +I   SH+YA GG S  E +  P  +A  L ++  E C +YNM
Sbjct: 290 YKATGESRYLDIARNAWEINVKSHTYAIGGNSQAEHFRAPNAIAAYLTNDTCEACNSYNM 349

Query: 407 LKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTE-PGVMIYMLPLG----RGVSKAR 460
           LK++R L+    +  AY D+YE +L N +L  Q   +  G + Y  PL     RGV  A 
Sbjct: 350 LKLTRELWLLDSDNSAYFDFYENSLLNHLLGQQDPHDHHGHITYFTPLNAGGRRGVGPAW 409

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
               W T ++SFWCC GT +E+ +KL DSIYF  +     L+I  ++SS   W    + L
Sbjct: 410 GGGTWSTDYDSFWCCQGTALETNTKLMDSIYFYNDST---LFINLFMSSVLKWPEMGITL 466

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLS--SLNLRMPVWTYSNGAQASLNGQNLP--L 576
            Q     V            +SK EV      ++N+R+P W  S  A+ +LNG+ L    
Sbjct: 467 KQSTTYPVG----------DTSKLEVSGSGAWTMNIRIPAWASS--AELTLNGEALSDVK 514

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
             PG +   +  W+  D + I+ P++LRT A  D+    +S+ AI +GP +L G+
Sbjct: 515 AAPGKYAQISRTWADGDVIEIRFPMTLRTVAANDN----SSMVAIAYGPTVLCGN 565


>gi|393782435|ref|ZP_10370619.1| hypothetical protein HMPREF1071_01487 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673263|gb|EIY66726.1| hypothetical protein HMPREF1071_01487 [Bacteroides salyersiae
           CL02T12C01]
          Length = 781

 Score =  256 bits (653), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 181/565 (32%), Positives = 278/565 (49%), Gaps = 43/565 (7%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A + + ++L+ L  D  +  F + A        Y GWE+  S   G   GHYLSA + ++
Sbjct: 62  AMEADRKWLMSLQPDRFLHRFHENAGFTPKAPMYDGWED--SSQSGFSFGHYLSAMSMLY 119

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELF-DSFEA----LKPVW 226
           A+T +  +  ++   +  + +CQ  IGTGY++A P       EL  D  E     +   W
Sbjct: 120 AATGDNELLGRIEYSINEIRKCQLAIGTGYVAAIPDGDRLWNELVADKIEPGGSWINGFW 179

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL-NEE 285
           AP+Y +HK+ +G +D Y+      A  +A  + ++  ++ + +      +  W  + + E
Sbjct: 180 APWYNLHKLWSGFIDVYLYTGVETAKTVAIELTDWACDKFRDM-----TDDQWQRMISCE 234

Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
           TGGMND LY +Y+IT + ++L LA  F     +  L+ Q D L+  HANT IP V G   
Sbjct: 235 TGGMNDALYNMYAITGNLRYLQLADKFYHYSVMEPLSQQRDELNGLHANTQIPKVTGIAR 294

Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
            YE+ G    K I TFF + V   H+Y  GG S  E +  P  L   L  +  ETC TYN
Sbjct: 295 SYELRGREKDKTIATFFWNTVLKKHTYCIGGNSNYEHFGKPGELF--LSDKTTETCNTYN 352

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
           MLK++ HLF W  +  Y DYYERAL N +L+ Q   E G+++Y LPL     K  ST   
Sbjct: 353 MLKLTGHLFAWEPKAEYMDYYERALYNHILASQ-NHETGMVVYSLPLAYASFKEFSTPE- 410

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
               +SFWCC GTG E+  K  + IY E E +   LYI  +++S  +W+   +++ Q+ +
Sbjct: 411 ----HSFWCCVGTGFENHVKYAEGIYSESEND---LYINLFVASRLNWRRKGMIIEQQTE 463

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLS 584
              S    L +    S      Q  +L++R P W  + G    +N +   +   PG+++S
Sbjct: 464 FPESDKSSLILRCAKS------QTLTLHIRYPQWA-TTGYTIKVNDKIQEIEKKPGSYIS 516

Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTAR 644
               W   DK+ I++P SL  E +  D  ++    A L GP +LAG    +        +
Sbjct: 517 LNRLWKDGDKIEIEMPKSLHKEVLPGDEHKF----AFLNGPIVLAGEMDLDERKIVFLEK 572

Query: 645 SLSALISPIPPSFNAQLVTFTQESG 669
             S L   I PS N    +F  ++G
Sbjct: 573 KDSELRDWIQPS-NRTKTSFITKTG 596


>gi|150003078|ref|YP_001297822.1| hypothetical protein BVU_0490 [Bacteroides vulgatus ATCC 8482]
 gi|149931502|gb|ABR38200.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
          Length = 783

 Score =  256 bits (653), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 170/543 (31%), Positives = 269/543 (49%), Gaps = 47/543 (8%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           + DV L  +S    A+  ++ YLL +D D L+  + K A L    + Y  WEN  + L G
Sbjct: 33  VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
           H  GHYLSA + M+A+T N  IK ++  ++  L  CQ+  G GYL   P   +++   E 
Sbjct: 90  HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149

Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
                    L   W P Y IHKI AGL D  +  D+ +A    +K+  WM+        +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMI--------R 201

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           +++  S E+    L  E GG+N+    + +IT D ++L LAH F     L  L  Q D L
Sbjct: 202 LVSKLSDEQIQEMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
           +  HANT IP VIG +   ++ G+  +     +F + V    S   GG S RE +     
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321

Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
            +  L SE   ETC TYNML++++ L+  + ++ + DYYERAL N +LS Q   + G  +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG-FV 380

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  P+  G  +      +     SFWCC G+G+E+ ++ G+ IY  ++ N   LY+  +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S+  W       + +++   ++      TL  S ++   + + L  R+P WT     + 
Sbjct: 433 PSTLRWG------DTQIEQQTAFPDEEGSTLVISPEKGKKEFTLL-FRIPEWTKPEALRL 485

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           S+NG+   +     ++S    WS  DK+ ++LP+ LR  A+ D    Y    +IL+GP +
Sbjct: 486 SVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541

Query: 628 LAG 630
           LA 
Sbjct: 542 LAA 544


>gi|357032903|ref|ZP_09094838.1| tat twin-arginine translocation pathway signal sequence domain
           protein [Gluconobacter morbifer G707]
 gi|356413894|gb|EHH67546.1| tat twin-arginine translocation pathway signal sequence domain
           protein [Gluconobacter morbifer G707]
          Length = 790

 Score =  255 bits (652), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 176/543 (32%), Positives = 275/543 (50%), Gaps = 65/543 (11%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A + N  YLL L+ D L+ +FRK A LP  G  YGGWE+    + GH +GHYLSA A M+
Sbjct: 57  AVERNRIYLLSLEADRLLHNFRKQAGLPPKGALYGGWES--DTIAGHTLGHYLSALALMY 114

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE-----------LFDSFEA----- 221
           A T +A  +E+++ +V  L   Q + G GY++ F  +           +F   EA     
Sbjct: 115 AQTDDAACRERVAYIVQELVVVQKQWGDGYVAGFTRKEKNGALVDGKRIFAEIEAGDIRS 174

Query: 222 ----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEY---FYNRV-----QKV 269
               L   W+P Y IHK  AGLLD ++     QAL +A  + ++   F+ ++     QKV
Sbjct: 175 SGFDLNGAWSPLYNIHKTFAGLLDAHIYCHCDQALNVAVGLGQFLKAFFGKLTDAQMQKV 234

Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAH-LFDKPCFLGFLALQADYL 328
           +T             E GG+N+    L + T D + L LA+ ++D+P  L  L  + D L
Sbjct: 235 LTC------------EYGGLNESFAELAARTGDEEWLRLAYRIYDRP-VLDPLMEERDDL 281

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
           ++ HANT IP ++G     EV+ +  +     FF   V   HSY  GG + RE++ +P  
Sbjct: 282 ANRHANTQIPKLVGLARIAEVSQNRHWMTGPQFFWKAVTRHHSYVIGGNADREYFSEPDT 341

Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
           ++  +  +  E C TYNMLK++R  +    + A  DYYERA  N +L+     + G+  Y
Sbjct: 342 ISQHITEQTCEHCNTYNMLKLTRQCYASNPQAALFDYYERAHLNHILAAH-DPQTGMFTY 400

Query: 449 MLP-LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           M P +  GV +      W T   SFWCC GTG+ES +K GDSI+++ E     L++  YI
Sbjct: 401 MTPTITAGVRE------WSTPTESFWCCVGTGMESHAKHGDSIWWQREET---LFVNLYI 451

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S   W    V  + K++     D  + + L   +     +L+   LR+P W      Q 
Sbjct: 452 PSRMVWDRKDV--SWKMETGYPHDGRVSLLLEDLNSPVAFRLA---LRVPGWV-REPIQV 505

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           ++NG+++P  P   ++    +WS  D + + LP+++RTE+  DD    + +  +L GP +
Sbjct: 506 AVNGRDVPATPSDGYIVLDRKWSAGDHVVLDLPMTVRTESPVDD----SKLVTVLRGPMV 561

Query: 628 LAG 630
           +A 
Sbjct: 562 MAA 564


>gi|325836901|ref|ZP_08166283.1| hypothetical protein HMPREF9402_1694 [Turicibacter sp. HGF1]
 gi|325491107|gb|EGC93399.1| hypothetical protein HMPREF9402_1694 [Turicibacter sp. HGF1]
          Length = 763

 Score =  255 bits (652), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 181/536 (33%), Positives = 278/536 (51%), Gaps = 46/536 (8%)

Query: 107 VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
           V L++ S+   +Q    +YLL LDV+ L+    + AS   P  +YGGWE+   E++GH +
Sbjct: 6   VRLEKDSLFEISQAVGKQYLLDLDVERLLAPIYEGASQIPPKPSYGGWES--LEIKGHSI 63

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF---------- 216
           GHYLSA A M+ +T +  +KE+M  ++ + S  Q     GYL  F +  F          
Sbjct: 64  GHYLSALACMYEATKDLELKERMDYIIETFSLLQR--ADGYLGGFLSTPFEQVFTGEFHV 121

Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
           D F +L   W P+Y+IHKI AGL+D Y +  N +AL +   + ++ Y   +    + S E
Sbjct: 122 DHF-SLSHYWVPWYSIHKIYAGLMDAYQIGKNVEALNILKKLADWAYEGSR----LMSDE 176

Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
           +    L  E GGMN+V+  LY IT D ++L LA  F +   +  LA   D L   HANT 
Sbjct: 177 QFQRMLICEYGGMNEVMAELYEITQDERYLYLAKRFTQHLIMDPLAAGVDDLQGRHANTQ 236

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
           IP V+G+   YEVTGD  Y  +  FF + V    SY  GG S+ E +       + L  E
Sbjct: 237 IPKVLGAAKLYEVTGDDYYYRVAKFFFETVVLHRSYVIGGNSSGEHFGPSD--TEPLSRE 294

Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
             ETC TYNM+K++++LF+WTK+  Y D+ ERA  N +L+ Q     G  IY      G 
Sbjct: 295 AAETCNTYNMIKLAKYLFKWTKDSKYIDFIERATYNHILASQ-DPHTGCKIYFTSNYPGH 353

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
            K      +GTK +SFWCC GTG+E+  +    I+F+E+ +    Y+  +++SSF  +  
Sbjct: 354 FKV-----YGTKEDSFWCCTGTGMENPGRYTHHIFFKEDED---FYVNLFMASSFVKEDE 405

Query: 517 HVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQL-SSLNLRMPVWTYSNGAQASLNGQNL 574
            + +  + D PI +      + L F   +E  QL  ++ +R+P W  +   +    GQ+ 
Sbjct: 406 QLKVVLQTDFPISN-----VVKLVF---EEANQLFLNVKIRVPYWL-NAPIEVRFKGQSY 456

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
                G +L  ++ +  +D++ I LP+ L  E +  D P      A ++GP +LA 
Sbjct: 457 EANGQG-YLMISDTFHADDEIEIVLPMGLH-EYVSMDDPHKV---AFMYGPVVLAA 507


>gi|319640591|ref|ZP_07995310.1| hypothetical protein HMPREF9011_00907 [Bacteroides sp. 3_1_40A]
 gi|345517952|ref|ZP_08797412.1| acetyl-CoA carboxylase [Bacteroides sp. 4_3_47FAA]
 gi|254835150|gb|EET15459.1| acetyl-CoA carboxylase [Bacteroides sp. 4_3_47FAA]
 gi|317387761|gb|EFV68621.1| hypothetical protein HMPREF9011_00907 [Bacteroides sp. 3_1_40A]
          Length = 783

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 170/543 (31%), Positives = 269/543 (49%), Gaps = 47/543 (8%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           + DV L  +S    A+  ++ YLL +D D L+  + K A L    + Y  WEN  + L G
Sbjct: 33  VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
           H  GHYLSA + M+A+T N  IK ++  ++  L  CQ+  G GYL   P   +++   E 
Sbjct: 90  HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149

Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
                    L   W P Y IHKI AGL D  +  D+ +A    +K+  WM+        +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMI--------R 201

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           +++  S E+    L  E GG+N+    + +IT D ++L LAH F     L  L  Q D L
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
           +  HANT IP VIG +   ++ G+  +     +F + V    S   GG S RE +     
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321

Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
            +  L SE   ETC TYNML++++ L+  + ++ + DYYERAL N +LS Q   + G  +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG-FV 380

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  P+  G  +      +     SFWCC G+G+E+ ++ G+ IY  ++ N   LY+  +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S+  W       + +++   ++      TL  S ++   + + L  R+P WT     + 
Sbjct: 433 PSTLRWG------DTQIEQQTAFPDEEGSTLVISPEKGKKEFTLL-FRIPEWTKPEALRL 485

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           S+NG+   +     ++S    WS  DK+ ++LP+ LR  A+ D    Y    +IL+GP +
Sbjct: 486 SVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541

Query: 628 LAG 630
           LA 
Sbjct: 542 LAA 544


>gi|15614440|ref|NP_242743.1| hypothetical protein BH1877 [Bacillus halodurans C-125]
 gi|10174495|dbj|BAB05596.1| BH1877 [Bacillus halodurans C-125]
          Length = 758

 Score =  255 bits (651), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 168/526 (31%), Positives = 270/526 (51%), Gaps = 35/526 (6%)

Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLS 171
             + + +Q+   + +L LD+D L+  + + A+LP   ++YGGWE    E+RGH +GH+LS
Sbjct: 11  KGLFYNSQKKGNDVILALDIDRLLAPYYEAANLPPKKRSYGGWEE--REIRGHSLGHWLS 68

Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQNKIG--TGYLSAFPTELFD-SFEA----LKP 224
           A+A M+ +T +  + E++   V  L+  Q+ +G   G   A   E+F   F+     +  
Sbjct: 69  AAAAMYETTGDKALLERIDRAVQELATIQDDVGYVGGVKRAHFDEMFSGEFQVGHFNIAG 128

Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
            W P+Y +HK+ AGL+D + L  ++ AL + T + ++     +K     + ++    L  
Sbjct: 129 TWVPWYNLHKLFAGLIDVHQLTGHSLALTVVTKLADW----AKKGTDQLTDDQFQRMLIC 184

Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
           E GGMN+ +  LY++T    +L LA  F     L  LA   D L   HANT IP VIG+ 
Sbjct: 185 EHGGMNEAMADLYTLTGHKDYLQLAIRFCHWAVLEPLANGIDELEGKHANTQIPKVIGAA 244

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
             +E+TGD  Y+ I  FF   V    SY  GG S  E +    +  +TLG E  ETC TY
Sbjct: 245 KLFEITGDDTYRAIAEFFWRQVTNDRSYIIGGNSNSEHFGPANK--ETLGVETAETCNTY 302

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           NMLK++ HLFRW +     DYYE+AL N +L+ Q   + G+  Y + L  G  K  S+  
Sbjct: 303 NMLKLTEHLFRWNRSSQLMDYYEKALYNHILASQ-DPDSGMKTYFVSLQPGHFKVYSSLE 361

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
                 SFWCC+GTG+E+ ++   +IY  ++ ++   Y+  +++S    K   V + Q+ 
Sbjct: 362 -----ESFWCCFGTGLENPARYTRTIYDRDDRHI---YVNLFMASEIHLKDLQVQIRQET 413

Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLS 584
           +    +    R  LTF     V     L++R+P W  +    A +NG+        ++L+
Sbjct: 414 N----FPETDRTKLTFVKADGVS--IKLHIRVPEWV-AGPVTARINGKETFSESGADYLT 466

Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
               W   D++ + LP+ LR    +DD  +      I++GP +LAG
Sbjct: 467 IEREWQKGDEIEVHLPMELRIYEAKDDSHKV----GIMYGPIVLAG 508


>gi|423313734|ref|ZP_17291670.1| hypothetical protein HMPREF1058_02282 [Bacteroides vulgatus
           CL09T03C04]
 gi|392684669|gb|EIY77993.1| hypothetical protein HMPREF1058_02282 [Bacteroides vulgatus
           CL09T03C04]
          Length = 783

 Score =  255 bits (651), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 170/543 (31%), Positives = 269/543 (49%), Gaps = 47/543 (8%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           + DV L  +S    A+  ++ YLL +D D L+  + K A L    + Y  WEN  + L G
Sbjct: 33  VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
           H  GHYLSA + M+A+T N  IK ++  ++  L  CQ+  G GYL   P   +++   E 
Sbjct: 90  HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149

Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
                    L   W P Y IHKI AGL D  +  D+ +A    +K+  WM+        +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMI--------R 201

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           +++  S E+    L  E GG+N+    + +IT D ++L LAH F     L  L  Q D L
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
           +  HANT IP VIG +   ++ G+  +     +F + V    S   GG S RE +     
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321

Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
            +  L SE   ETC TYNML++++ L+  + ++ + DYYERAL N +LS Q   + G  +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG-FV 380

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  P+  G  +      +     SFWCC G+G+E+ ++ G+ IY  ++ N   LY+  +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S+  W       + +++   ++      TL  S ++   + + L  R+P WT     + 
Sbjct: 433 PSTLRWG------DTQIEQQTAFPDEEGSTLVISPEKGKKEFTLL-FRIPEWTKPEALRL 485

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           S+NG+   +     ++S    WS  DK+ ++LP+ LR  A+ D    Y    +IL+GP +
Sbjct: 486 SVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541

Query: 628 LAG 630
           LA 
Sbjct: 542 LAA 544


>gi|333380462|ref|ZP_08472153.1| hypothetical protein HMPREF9455_00319 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826457|gb|EGJ99286.1| hypothetical protein HMPREF9455_00319 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 790

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 177/547 (32%), Positives = 271/547 (49%), Gaps = 41/547 (7%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
           SL DV L   S    A+  + +YLL L  D L+  F + + L    ++Y  WEN  + L 
Sbjct: 29  SLKDVRL-LDSPFKHAEDLDKQYLLELKADRLLSPFLRESGLTPKAESYTNWEN--TGLD 85

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELF 216
           GH  GHYLSA + M+AST +  IKE++  +V  L  CQ+    GY+   P       E+ 
Sbjct: 86  GHIGGHYLSALSLMYASTGDKQIKERLDYMVSELKRCQDANDNGYIGGVPGGKAIWEEVA 145

Query: 217 DS------FEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
           +       F+ L   W P Y IHK  AGL D Y+ A++  A +M   M ++  N V K+ 
Sbjct: 146 NGNIRAGGFD-LNGKWVPLYNIHKTYAGLRDAYLYANSDMAKEMLIKMTDWAINLVSKL- 203

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
              S E+    L  E GG+N+    + +IT D K+L LAH F     L  L    D L+ 
Sbjct: 204 ---SEEQIQDMLRSEHGGLNETFADVAAITGDKKYLKLAHQFSHQLVLNPLLNHEDKLTG 260

Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
            HANT IP V+G +   +V G+  +     FF + V    S + GG S  E +      +
Sbjct: 261 MHANTQIPKVLGFKRIADVEGNESWSEASRFFWETVVEHRSVSIGGNSVGEHFNPTNDFS 320

Query: 391 DTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
             + S E  ETC TYNML++S+ L++ +++  Y DYYERAL N +LS Q   E G  +Y 
Sbjct: 321 RVIKSIEGPETCNTYNMLRLSKMLYQTSQDEKYMDYYERALYNHILSTQ-NPEQGGFVYF 379

Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
             +  G  +      +     SFWCC G+GIE+ +K G+ IY   +     LY+  +I S
Sbjct: 380 TQMRPGHYRV-----YSQPQTSFWCCVGSGIENHAKYGEMIYAHTDNE---LYVNLFIPS 431

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             +WK     + Q+     S+    +  L  + ++      +L LR PVW    G + S+
Sbjct: 432 RLNWKEKKTEIIQE----NSFPDEAKTQLIINPEKTAA--FTLKLRYPVWVKKWGLKVSV 485

Query: 570 NGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
           NG++ P+   P +++S   +W   DK+ +++P+ +  E +    P+ ++  +I +GP  L
Sbjct: 486 NGKDYPVSQDPASYISIDRKWKKGDKVVVEMPMRITVEQL----PDKSNYYSIFYGPVTL 541

Query: 629 AGHTSGE 635
           A  T  E
Sbjct: 542 AAKTGTE 548


>gi|445497812|ref|ZP_21464667.1| hypothetical protein Jab_2c14210 [Janthinobacterium sp. HH01]
 gi|444787807|gb|ELX09355.1| hypothetical protein Jab_2c14210 [Janthinobacterium sp. HH01]
          Length = 789

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 178/548 (32%), Positives = 266/548 (48%), Gaps = 41/548 (7%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L+   L DV L  S  L  AQ+T+L YLL ++ D L+  F + A LP    +YG WE+  
Sbjct: 29  LQLFPLADVRLGDSPFL-EAQRTDLHYLLEMEPDRLLAPFLREAGLPPKQPSYGNWES-- 85

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
           + L GH  GHYLSA A M+AST +  +  +++  V  L  CQ + G GY+   P      
Sbjct: 86  TGLDGHLGGHYLSALALMYASTGDEEVLRRLNYFVAELKRCQERNGNGYIGGIPDGSAAW 145

Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
                 EL     ++   W P+Y +HK+ AGL D Y  A NA A  M   M ++      
Sbjct: 146 QAIARGELHVDNFSVNGKWVPWYNLHKVYAGLRDAYAYAGNADARAMLVSMSDW----AL 201

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
           ++ +  S E+    L  E GGMN+VL  +  +T   K++ LA  F     L  L    D 
Sbjct: 202 ELTSHLSEEQMQAMLRSEHGGMNEVLADVAQMTGQKKYMDLAVRFSHQAILRPLEEGKDQ 261

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L+  HANT IP VIG +   ++TG   ++    FF   V    + A GG S +E + D +
Sbjct: 262 LTGLHANTQIPKVIGFKHIGDMTGRRDWQQAAQFFWQTVRDHRTVAIGGNSVKEHFHDDR 321

Query: 388 RLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
                +   E  ETC TYNMLK++  LF    + +Y DYYERAL N +LS QR  + G  
Sbjct: 322 DFLPMVDEVEGPETCNTYNMLKLTELLFLGDAKGSYTDYYERALYNHILSSQR-PDSGGF 380

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
           +Y  P+     +      +     + WCC G+GIES +K G+ IY         LY+  +
Sbjct: 381 VYFTPM-----RPNHYRVYSQVDKAMWCCVGSGIESHAKYGEFIYAHRGDQ---LYVNLF 432

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
           I S+ +W+S  V + Q  +     D   R T+T    +      ++ +R P W      +
Sbjct: 433 IPSTLNWRSQGVTITQ-ANRFPDED---RSTITVQGSKAF----TMKIRYPEWVARGALR 484

Query: 567 ASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
            ++NG+ +P     + ++S    W   DK+ IQLP+    E +    P+ ++  A+L GP
Sbjct: 485 ITVNGKPVPADAGADRYVSLRRIWRDGDKVDIQLPMKTHLEQM----PDKSNYYAVLHGP 540

Query: 626 YLLAGHTS 633
            +LA  T+
Sbjct: 541 IVLAAKTN 548


>gi|402300545|ref|ZP_10820034.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus alcalophilus
           ATCC 27647]
 gi|401724312|gb|EJS97686.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus alcalophilus
           ATCC 27647]
          Length = 761

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 169/519 (32%), Positives = 272/519 (52%), Gaps = 41/519 (7%)

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
           ++YLL LD+D LV  F + ASL    + YGGWE   + + GH +GH+LSA+A M+ +T N
Sbjct: 19  MDYLLFLDIDRLVAPFYEAASLAPKKQRYGGWEE--TGISGHSLGHWLSAAAYMYRNTMN 76

Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD-----SFEA----LKPVWAPYYTIH 233
             +K+K++  +  L   Q+     ++  FP+  F+     +FE     L   W P+Y++H
Sbjct: 77  RALKDKINKAIDELEYIQSVHDRNFIGGFPSTCFEKVFTGNFEVDHFTLAGHWVPWYSMH 136

Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
           K+ AGL+D Y L  N +AL + T + ++    V+      +  +    L  E GGMNDV+
Sbjct: 137 KLFAGLIDVYKLVKNEKALSVVTKLADW----VESGTVRLTEAQFQKMLICEHGGMNDVM 192

Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
             LY +T +  +L LA  F +   L  L+ + D L   HANT IP VIG+   Y++T + 
Sbjct: 193 AELYLLTQNQTYLQLAIRFCEQQILEPLSNRRDLLEGKHANTQIPKVIGAAKLYDITKEE 252

Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD-TLGSENEETCTTYNMLKVSRH 412
            YK   TFF   V    SY  GG S  E +    R++D TLG +  ETC TYNMLK++ H
Sbjct: 253 KYKTAATFFWQEVTRVRSYIIGGNSINEHF---GRVSDETLGVQTTETCNTYNMLKLTAH 309

Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
           LF W ++  Y D+YERAL N +L+ Q   + G+  Y +    G  K      + +  +SF
Sbjct: 310 LFLWEQKSEYYDFYERALYNHILASQ-DPDSGMKAYFVSTEPGHFKV-----YHSPEDSF 363

Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
           WCC GTG+E+ ++  + IY++ +     L++  +I+S    +   + L  + D   S   
Sbjct: 364 WCCTGTGMENPTRYSEHIYYQRDDE---LFVNLFIASQLQLEEKELRLKLETDFPHSGRV 420

Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS-LNGQNLPLPPPGNFLSATERWSY 591
            L++      ++  G+  S++LR+P W   NG  +  +N +   L     +++ + RW  
Sbjct: 421 QLKV------EEGDGRFLSIHLRIPYWI--NGKVSIFVNKKQTFLTDKKGYVTLSRRWKA 472

Query: 592 NDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
            D++ +  PL L +   +DD     +    ++GP +LAG
Sbjct: 473 GDRVEVDFPLGLHSYIAKDD----PNKVGFMYGPIVLAG 507


>gi|373955475|ref|ZP_09615435.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
           18603]
 gi|373892075|gb|EHQ27972.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
           18603]
          Length = 782

 Score =  254 bits (649), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 171/545 (31%), Positives = 272/545 (49%), Gaps = 44/545 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           LK   L +V L    +   A+  +L+Y++ L  D L+  + + A L    ++Y  WEN  
Sbjct: 24  LKTFRLQEVKL-LPGIFNDAENADLKYMMQLSPDKLLAPYLREAGLKPKAESYTNWEN-- 80

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
           S L GH  GHYLSA A M+AST +    ++++ ++  L  CQ+K G GY+   P   EL+
Sbjct: 81  SGLDGHIGGHYLSALAMMYASTGDKQALDRLNYMIAELKICQDKNGNGYVGGVPGSKELW 140

Query: 217 DSF-----EALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
            +       A+   W P+Y IHK  AGL D Y  A N  A    +K A W V        
Sbjct: 141 AAVMQGDVGAINKKWVPFYNIHKTFAGLRDAYTYAGNETAKVMLIKFADWFV-------- 192

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            + T  + ++    L  E GG+N+VL  +Y++T D K+L  A+ F     L  L    D 
Sbjct: 193 MIATSITPQKMQEMLKTEHGGVNEVLADVYALTGDKKYLTAAYSFSHQAILEPLEQGQDK 252

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L++ HANT IP VIG +   +VT D  Y     FF   V    + A GG S RE +    
Sbjct: 253 LNNLHANTQIPKVIGFKRISDVTADSNYNKAAQFFWQTVVQHRTVAIGGNSVREHFNPSN 312

Query: 388 RLADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
             +  + +E   ETC TYNMLK++  L+     ++Y DYYERAL N +LS +R    G  
Sbjct: 313 DFSSMITTEQGPETCNTYNMLKLTEDLYLSDPRVSYIDYYERALYNHILSTER--PGGGF 370

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
           +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY  ++ NV   ++  +
Sbjct: 371 VYFTPMRPGHYRV-----YSQPQTSMWCCVGSGMENHAKYGEMIYAHDQNNV---FVNLF 422

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
           I S+ +WK   +VL Q  +    +    + ++T ++ +  G   ++N+R P W ++   +
Sbjct: 423 IPSTLNWKQKGLVLTQHTN----FPEEEKTSITINAVRP-GAF-AINIRYPSWVHTGALK 476

Query: 567 ASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
            ++NG  + +    + ++S    W   D + + LP+   TE +    P+  + +A+L GP
Sbjct: 477 VTVNGTPIKVSAKSSAYVSINRVWKKGDVIGVTLPMQTTTEQL----PDGLNYEAVLHGP 532

Query: 626 YLLAG 630
            +LA 
Sbjct: 533 IVLAA 537


>gi|294775898|ref|ZP_06741397.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
 gi|294450267|gb|EFG18768.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
          Length = 783

 Score =  254 bits (649), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 172/543 (31%), Positives = 267/543 (49%), Gaps = 47/543 (8%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           + DV L  +S    A+  ++ YLL +D D L+  + K A L    + Y  WEN  + L G
Sbjct: 33  VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
           H  GHYLSA + M+A+T N  IK ++  ++  L  CQ+  G GYL   P   +++   E 
Sbjct: 90  HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIED 149

Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
                    L   W P Y IHKI AGL D  +   N +A    +K+  WM+        +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTGNKEAKEMLVKLTDWMI--------R 201

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           +++  S E+    L  E GG+N+    + +IT D ++L LAH F     L  L  Q D L
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
           +  HANT IP VIG +   ++ G+  +     +F + V    S   GG S RE +     
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321

Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
            +  L SE   ETC TYNML++++ L+  + +  + DYYERAL N +LS Q   + G  +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHFMDYYERALYNHILSTQDPVQGG-FV 380

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  P+  G  +      +     SFWCC G+G+E+ ++ G+ IY  ++ N   LY+  +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S+  W  G + + Q+     ++      TL  S ++   + + L  R+P WT       
Sbjct: 433 PSTLRW--GDIQIEQQ----TAFPDEEETTLVISPEKGKKEFTLL-FRIPEWTKPEALCL 485

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           S+NG+   +     ++S    WS  DK+ ++LP+ LR  A+ D    Y    +IL+GP +
Sbjct: 486 SVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541

Query: 628 LAG 630
           LA 
Sbjct: 542 LAA 544


>gi|452750721|ref|ZP_21950468.1| Putative glycosyl hydrolase of unknown function (DUF1680) [alpha
           proteobacterium JLT2015]
 gi|451961915|gb|EMD84324.1| Putative glycosyl hydrolase of unknown function (DUF1680) [alpha
           proteobacterium JLT2015]
          Length = 744

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 181/577 (31%), Positives = 270/577 (46%), Gaps = 51/577 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A + N EYL+ LD D L+ ++R +A L   G  YGGWE+    + GH +GHYLSA A   
Sbjct: 9   AVERNREYLMSLDPDRLLHNYRTSAGLAPKGDVYGGWES--DTIAGHTLGHYLSALALTH 66

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----ELFDSFEA----------- 221
           A T +     + + +V  L+  Q   G GY++ F       E+ D  E            
Sbjct: 67  AQTGDEESCRRANYIVGELATVQAAHGDGYVAGFTRKRPDGEIVDGKEIFPEIMAGDIRS 126

Query: 222 ----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
               L   W P Y  HK+  GL D   L  N  AL +A  + +Y    + ++      E+
Sbjct: 127 AGFDLNGCWVPLYNWHKLYTGLYDVADLCGNRTALPIAVALGDY----IDRMFAALDDEQ 182

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
               L  E GG+N+    LY+ T + + L L         L  L    D L++FHANT +
Sbjct: 183 VQTVLACEYGGLNESFAELYARTGERRWLRLGERIYDNKVLDPLTRGEDRLANFHANTQV 242

Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
           P +IG    YE+T  P       FF D V   HSY  GG + RE++ +P  ++  +  + 
Sbjct: 243 PKLIGLARLYELTSKPAQGAAAEFFWDTVTKRHSYVIGGNADREYFSEPNSISKHITEQT 302

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
            E C +YNMLK++RHL+ W    A  D+YERA  N +LS Q+  E G   YM PL  G +
Sbjct: 303 CEHCNSYNMLKLTRHLYSWRPRSALFDFYERAHLNHILS-QQHPETGGFSYMTPLMSGTA 361

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KS 515
           +  S  G     ++FWCC GTG+ES +K GDSI+++ +     L +  YI ++ +W  + 
Sbjct: 362 REYSEPG----KDAFWCCVGTGMESHAKHGDSIFWQGDD---ALIVNLYIPAAANWRPRG 414

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
             V L  +     S        LTF+   + G+   + LR+P W  S      +NG+ + 
Sbjct: 415 ASVRLETRYPEEGS------ANLTFTELAKPGRF-PVALRVPAWAES--VDVRVNGKAVA 465

Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
                 +++ + RW   D+L I +P+ LR E   DD      + A+L GP +LA      
Sbjct: 466 AKVEDGYVTVSRRWQAGDRLAIAMPMRLRIEPTADD----PDMIALLRGPMVLAADLGPA 521

Query: 636 WDIKTGTARSL--SALISPIPPSFNAQLVTFTQESGN 670
            +   G A +L  S L++   P   +     TQ  G 
Sbjct: 522 EEEFDGAAPALVGSDLLAKFVPEAGSATAFATQGIGR 558


>gi|404254065|ref|ZP_10958033.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas sp. PAMC
           26621]
          Length = 646

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 192/614 (31%), Positives = 287/614 (46%), Gaps = 59/614 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE-NP 157
           L+   L DV L +   L  AQ+    YLL LD D ++ +FR  A L      YGGWE +P
Sbjct: 46  LQPFDLADVDLGEGPFL-HAQRKTEAYLLSLDPDRMLHAFRVNAGLKPKAAVYGGWESDP 104

Query: 158 I---SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           I      +GH +GHYLSA A  + ST     ++++  +   L+ CQ+   +G + AFP  
Sbjct: 105 IWADINCQGHTLGHYLSACALAYRSTRKPAFRQRIDHIARELAACQDAAKSGLVCAFPKG 164

Query: 215 ---LFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
              +            P+YT+HK+ AGL D  +LAD+A++    L++A W V        
Sbjct: 165 PALVAAHLRGDAITGVPWYTLHKVFAGLRDATLLADSAESRAVLLRLADWAV-------- 216

Query: 268 KVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
            V T    +  + ++ E E GGMN+V   LY +T +P +  +A  F     L  LA   D
Sbjct: 217 -VATRPLSDAQFETMLETEHGGMNEVFADLYLMTGNPDYRTMAERFSHKALLTPLAAGRD 275

Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
            L   HANT +P ++G Q  +E TG P Y     FF   V  + S+ATGG    E ++  
Sbjct: 276 QLDGLHANTQLPKIVGFQRVFEATGTPHYHEAAAFFWRTVALTRSFATGGHGDNEHFFPM 335

Query: 387 KRL-ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
                    ++  ETC  +NMLK++R LF    +  YADYYER L NG+L+ Q   + G+
Sbjct: 336 AEFDKHVFSAKGSETCGQHNMLKLTRALFLQDPQAEYADYYERTLYNGILASQ-DPDTGM 394

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
           + Y      G  K      + T  +SFWCC GTG+E+  K  DSIYF ++     LY+  
Sbjct: 395 VTYFQGARPGYMKL-----YHTPEHSFWCCTGTGMENHVKYRDSIYFHDD---KALYVNL 446

Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--- 562
           ++ S+  W+   V L Q+        P   +  T     +V    +L LR P W+ S   
Sbjct: 447 FVPSAVRWREKGVALRQETR--FPDAPTTTLHWTVERPTDV----TLQLRHPRWSRSAIV 500

Query: 563 --NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
             NG +A+ +        PG+++     W   D  T++L L++  E + D  P    I A
Sbjct: 501 LVNGVEAARSDT------PGSYVKLARTWHSGD--TVELRLAM--EVVPDQAPAAPDIVA 550

Query: 621 ILFGPYLLAGHTSGEWDIKTGTARSLSALISPIP-PSFNAQLVTFTQESGNSTFVMSNSN 679
             +GP +LAG    E     G A     +++      +NA LVT     GN   + +   
Sbjct: 551 FSYGPMVLAGVLGRE-----GLAPGADVIVNERKYGEYNAGLVTVPTLVGNPATLAAQVR 605

Query: 680 QSITMEEFPVSGTD 693
           ++    EF +   D
Sbjct: 606 KADGPLEFTIPAAD 619


>gi|336425130|ref|ZP_08605160.1| hypothetical protein HMPREF0994_01166 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336013039|gb|EGN42928.1| hypothetical protein HMPREF0994_01166 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 628

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 175/549 (31%), Positives = 272/549 (49%), Gaps = 62/549 (11%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPGKA----YGGWENPISELRGHFVGHYLSASAQMWAST 180
           Y++ L+   L+ +F   +   T  +A    +GGWE P  +LRGHF+GH+LSA+A  + +T
Sbjct: 32  YMMHLENRFLLLNFNLESGRDTSAEAIEGMHGGWEFPTCQLRGHFLGHWLSAAAMHYHAT 91

Query: 181 HNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLL 240
            +  +K K  T+V  L+ECQ + G  + +  P +        K VWAP+YTIHK+  GLL
Sbjct: 92  GDRELKAKADTLVEELAECQKENGGKWAAPIPEKYLYRIAEGKQVWAPHYTIHKVFMGLL 151

Query: 241 DQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSIT 300
           D Y  A NA AL++A    ++FY+  +     +S +     L+ ETGGM ++  +LY+IT
Sbjct: 152 DMYEYAGNAIALEIAENFADWFYDWTKD----FSRDEMDDILDFETGGMLEIWVQLYAIT 207

Query: 301 HDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGT 360
              K+  L   + +      L    D L++ HANT IP +IG    Y+VTGD  ++ I  
Sbjct: 208 GKDKYAALMERYYRGRLFDPLLKGEDVLTNMHANTTIPEIIGCARAYDVTGDEKWRKIAE 267

Query: 361 FFMDI-VNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
            + D+ V     YATGG +  E W   K+L   LG + +E CT YNM++++  LFRW+ +
Sbjct: 268 NYWDLAVTQRGQYATGGQTCGEIWSPKKKLGARLGLKGQEHCTVYNMIRLAGFLFRWSLD 327

Query: 420 IAYADYYERALTNGVLS-------IQRG-TEP----GVMIYMLPLGRGVSKARSTHGWGT 467
            AY DY E+ L NG+++       +  G T P    G++ Y LP+  G  K     GW +
Sbjct: 328 PAYLDYQEKLLYNGLMAQAYWQSNLSHGFTSPYPSKGLLTYFLPMQAGGRK-----GWSS 382

Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS--SFDWKSGHVVLNQKVD 525
           K   F+CC+GT +++ +     IY++ E +   LYI QY+ S  SF      V + QK D
Sbjct: 383 KTGDFFCCHGTLVQANAAFNRGIYYQSEDS---LYICQYLDSQVSFSVNDSRVTILQKAD 439

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLS------------------------SLNLRMPVWTY 561
           P+        +  T S++Q V + +                        +L LR+P W  
Sbjct: 440 PLTGSS---HLASTSSARQSVLEDTRKYPSQPDCLVPCLKMELEKETEMTLQLRIPGWLA 496

Query: 562 SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
                   + +         F+     W   D + I LP +++T  +    PE  +  A 
Sbjct: 497 GEAVILINDTEVYRSNDSCLFVPLKRVWKDGDIIRILLPKAVKTFPL----PEDENTVAF 552

Query: 622 LFGPYLLAG 630
           L+GP +LAG
Sbjct: 553 LYGPVVLAG 561


>gi|302872476|ref|YP_003841112.1| hypothetical protein COB47_1852 [Caldicellulosiruptor obsidiansis
           OB47]
 gi|302575335|gb|ADL43126.1| protein of unknown function DUF1680 [Caldicellulosiruptor
           obsidiansis OB47]
          Length = 587

 Score =  253 bits (647), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 172/544 (31%), Positives = 271/544 (49%), Gaps = 46/544 (8%)

Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH 181
           NL     L+   + WSF        P   +GGWE+P  +LRGHF+GH+LSA+A+++AS  
Sbjct: 39  NLLQNFYLESGIMSWSF-------LPQDIHGGWESPTCQLRGHFLGHWLSAAARIYASFG 91

Query: 182 NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLD 241
           +  IK K   +V  L  CQ + G  ++ + P + F+     K VWAP+YT+HK   GL+D
Sbjct: 92  DEEIKGKADYIVDELERCQKENGGEWVGSIPEKYFEWMARGKWVWAPHYTVHKTFMGLVD 151

Query: 242 QYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITH 301
            Y    N +AL++A     +FY    +    +S E+    L+ ETGGM ++   LY+IT 
Sbjct: 152 MYKYTSNQKALEIADRWANWFY----RWSGQFSREKMDDILDYETGGMLEIWAELYNITK 207

Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY-KLIGT 360
           D K+  L   + +      L    D L+  HANT IP + G+   +EVTG+  + K++ +
Sbjct: 208 DSKYKELMERYYRGRLFDRLLNGEDVLTGRHANTTIPEIHGAARVWEVTGEEKFRKIVES 267

Query: 361 FFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEI 420
           ++ + V     + TGG +  E W    R+ + LG  N+E C  YNM++++  LFRWT + 
Sbjct: 268 YWREAVEERGYFCTGGQTLGEVWTPKHRIRNYLGPTNQEHCVVYNMIRLAEFLFRWTGDK 327

Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
            Y+DY ER + NG+ + QR  + G++ Y LPL  G  K      WGT  N FWCC+GT +
Sbjct: 328 KYSDYIERNIYNGLFAQQR-LKDGMVTYFLPLMPGSQKR-----WGTPTNDFWCCHGTLV 381

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
           ++ +   D IY++      G+ I Q+I S   WK      + K + I     Y R   +F
Sbjct: 382 QAHTIYNDIIYYKTPN---GVVISQFIPSFVTWK------DDKGNGITIKQYYGRRQESF 432

Query: 541 SSKQEVGQLS-----------SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERW 589
           +   E  ++             L +R P W  +   + ++N          +++  T RW
Sbjct: 433 AYTAEKDEICIEVQCKDPIEFELAIRKPWW--AKKIEVAVNEDLNYGVDDSSYIKLTRRW 490

Query: 590 SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSAL 649
           + +DK+ I    ++ T  + DD P+     A + GP +LAG       I     R +  +
Sbjct: 491 N-SDKIKITFYKTVETCPMPDD-PQQV---AFMVGPVVLAGLCERRRKIYI-NGRKIEEV 544

Query: 650 ISPI 653
           I PI
Sbjct: 545 IVPI 548


>gi|302422424|ref|XP_003009042.1| secreted protein [Verticillium albo-atrum VaMs.102]
 gi|261352188|gb|EEY14616.1| secreted protein [Verticillium albo-atrum VaMs.102]
          Length = 635

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 181/532 (34%), Positives = 263/532 (49%), Gaps = 43/532 (8%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
           Q   L Y+  +DVD L++ FR+T  LP  G +  GGW+ P    R HF GH+L+A +  W
Sbjct: 65  QDRTLNYIKFVDVDRLLYVFRQTHGLPLQGAQPNGGWDAPDFPFRSHFQGHFLNAWSYCW 124

Query: 178 ASTHNATIKEKMSTVVFSLSECQ---NKIG--TGYLSAFPTELFDSFE--ALKPVWAPYY 230
           A   +   +++ S     L++CQ   +K G   GYLS FP    ++ E   L     PYY
Sbjct: 125 AVLRDEACRDRASYFATELAKCQGNNDKAGFNPGYLSGFPESEIEAVEKRTLSNGNVPYY 184

Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
           +IHK +AGLLD +    +  A  +   M  +   R  K+    S  +    ++ E GGMN
Sbjct: 185 SIHKTMAGLLDVWRHIGDETARDVLLGMAGWVDLRTGKL----SYSQMQTMMSTEFGGMN 240

Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           +V+  ++  T D + L +A  FD       LA   D L+  HANT +P  IG+   Y+ T
Sbjct: 241 EVMADIFHQTGDERWLTVAQRFDHASVFDPLAGNRDSLNGLHANTQVPKWIGAAREYKAT 300

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
           G   Y  I     +I   +H+YA G  S  E +  P  +A  L  +  E C TYNMLK++
Sbjct: 301 GTTRYSDIAHNAWNITVQAHTYAIGANSQSEHFRPPNAIASYLDEDTAEACNTYNMLKLT 360

Query: 411 RHLFRWTKEIA---YADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKARST 462
           R L  W  + +   Y D+YE+AL N  +  Q  +   G + Y   L     RGV  A   
Sbjct: 361 REL--WVMDPSNSKYFDFYEQALINHAIGQQDPSSAHGHVTYFTSLNPGGHRGVGPAWGG 418

Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
             W T + + WCC GT +E+ +KL DSIYF +E +   LY+  Y  S  +W    V + Q
Sbjct: 419 GTWSTDYGTAWCCQGTALETNTKLMDSIYFYDESS---LYVNLYAPSRLNWTQRKVTVLQ 475

Query: 523 KVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP--LPPP 579
           + D P       L+ T T + K   G    L LR+P+W  S GA  ++NGQ L      P
Sbjct: 476 ETDFP-------LQETSTLTVKG--GGDWDLRLRIPIW--SKGATIAINGQALDGVETVP 524

Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
           G + +    W   D +TI LP++L T +  DD P   S+ A+ +GP +LA +
Sbjct: 525 GTYATIKRSWGEEDIVTITLPMALHTISA-DDEP---SVAALAYGPVVLAAN 572


>gi|16126789|ref|NP_421353.1| hypothetical protein CC_2550 [Caulobacter crescentus CB15]
 gi|221235569|ref|YP_002518006.1| membrane-bound glycosyl hydrolase [Caulobacter crescentus NA1000]
 gi|13424115|gb|AAK24521.1| conserved hypothetical protein [Caulobacter crescentus CB15]
 gi|220964742|gb|ACL96098.1| membrane-bound glycosyl hydrolase [Caulobacter crescentus NA1000]
          Length = 786

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 172/539 (31%), Positives = 260/539 (48%), Gaps = 47/539 (8%)

Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYL 170
           + S+  +AQ  N  YL+ L  D L+ +F   A LP     YGGWE     + GH +GHYL
Sbjct: 57  KPSIFAQAQGANRAYLVSLQPDRLLHNFHLGAGLPVKAPVYGGWE--AQSIAGHTLGHYL 114

Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLS-------AFPTELFDSFEALK 223
           SA A   A+  +  + ++++  V  L+  Q   G GY+        A P      FE L+
Sbjct: 115 SACALQVANDGDPVLSQRLAYTVAQLARVQAAHGDGYVGGTTRWGQADPVGGKAVFEELR 174

Query: 224 ------------PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
                         W P YT HKI AGLLD + LA    AL +A  +  Y       ++ 
Sbjct: 175 RGDIRANRFSLNDGWVPIYTWHKIHAGLLDAHRLAATPGALDVALGLAGYLAT----ILE 230

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
             + ++    L  E GG+ +     Y++T DP+ L +A        +  LA   D L+  
Sbjct: 231 GLNDDQVQAILVAEHGGLCEAYAETYALTGDPRWLNIARRLRHRELVDPLAQGRDELAGL 290

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           HANT IP +IG    YEV GDP       FF   V   HSYA GG S RE +  P  +A 
Sbjct: 291 HANTQIPKIIGLARLYEVAGDPAEARTARFFHQTVTRRHSYAIGGNSDREHFGPPDAIAT 350

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
            L     E C +YNMLK++R L+ W  + A  D YERA  N +++ QR ++ G+ +Y +P
Sbjct: 351 RLSETTCEACNSYNMLKLTRRLWSWAPDGALFDDYERAQLNHIMAHQRPSD-GMFVYFMP 409

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           +  G  ++ S     T  +SFWCC G+G+ES +K  DSI++        LY+  +I+S  
Sbjct: 410 MAAGGRRSYS-----TPEDSFWCCVGSGMESHAKHADSIWWRGGQT---LYLNLFIASRL 461

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           D       ++  +D        + +T+T + +     L  + LR+P W  +   + S+NG
Sbjct: 462 DLPGDDFAID--LDTAFPQSGQVDLTVTRAPR----GLREIALRLPAWCAA--PRLSVNG 513

Query: 572 QNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
              P+   G+ +   + RW   D++T+ LP+++R E   DD     ++ A L GP +LA
Sbjct: 514 APTPIQTRGDGYARLSRRWKAGDRVTLMLPMAVRAEPTPDD----PNLVAFLSGPLVLA 568


>gi|312135764|ref|YP_004003102.1| hypothetical protein Calow_1766 [Caldicellulosiruptor owensensis
           OL]
 gi|311775815|gb|ADQ05302.1| protein of unknown function DUF1680 [Caldicellulosiruptor
           owensensis OL]
          Length = 587

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 168/550 (30%), Positives = 279/550 (50%), Gaps = 43/550 (7%)

Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPT----PGKAYGGWENPISELRGHFVGHYLSASAQ 175
           + N  Y+L L  ++L+ +F   + + +    P   +GGWE+P  +LRGHF+GH+LSA+A+
Sbjct: 26  KLNRSYMLSLKTENLLQNFYLESGIMSWSFLPQDIHGGWESPTCQLRGHFLGHWLSAAAR 85

Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
           ++A+  +  IK K   +V  L  CQ + G  ++ + P + F+     K VWAP+YT+HK 
Sbjct: 86  IYANFGDEEIKGKADYIVDELERCQKENGGEWVGSIPEKYFEWMARGKWVWAPHYTVHKT 145

Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYR 295
             GL+D Y    N +AL++      +FY    +    +S E+    L+ ETGGM ++   
Sbjct: 146 FMGLVDMYKYTSNQKALEIVDRWANWFY----RWSGQFSREKMDDILDYETGGMLEIWAE 201

Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY 355
           LY+IT D K+  L   + +      L    D L+  HANT IP + G+   +EVTG+  +
Sbjct: 202 LYNITKDIKYRDLMERYYRGRLFDRLLNGEDVLTGRHANTTIPEIHGAARVWEVTGEEKF 261

Query: 356 -KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLF 414
            K++ +++ + V     + TGG +  E W   +++ + LG  N+E C  YNM++++  LF
Sbjct: 262 RKIVESYWREAVEERGYFCTGGQTLGEVWTPKQKIKNYLGPTNQEHCVVYNMIRLAEFLF 321

Query: 415 RWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC 474
           RWT +  Y+DY ER + NG+ + QR  + G++ Y LPL  G  K      WGT  N FWC
Sbjct: 322 RWTGDKRYSDYIERNIYNGLFAQQR-LKDGMVTYFLPLMPGSQKR-----WGTPTNDFWC 375

Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
           C+GT +++ +   D IY++ +    G+ I Q+I S   WK      + K + I     Y 
Sbjct: 376 CHGTLVQAHTIYNDIIYYKGQN---GIVISQFIPSFVTWK------DDKGNDITIKQYYG 426

Query: 535 RMTLTFSSKQEVGQLS-----------SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
           R   +F+   +  ++             L +R P W      + ++N          +++
Sbjct: 427 RRQESFAYTAKKDEICIEIQCKNPIEFELAIRKPWWAMK--IEVAVNEDLYYSIDDSSYI 484

Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTA 643
              +RW+ NDK+ I    ++ T  + DD P+     A + GP +LAG       I T   
Sbjct: 485 QLMQRWN-NDKVKITFYKTVETCPMPDD-PQQV---AFMIGPVVLAGLCENRKKI-TING 538

Query: 644 RSLSALISPI 653
           + +  +I PI
Sbjct: 539 KEIKDVIIPI 548


>gi|302670053|ref|YP_003830013.1| glycoside hydrolase [Butyrivibrio proteoclasticus B316]
 gi|302394526|gb|ADL33431.1| glycoside hydrolase [Butyrivibrio proteoclasticus B316]
          Length = 780

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 204/625 (32%), Positives = 299/625 (47%), Gaps = 58/625 (9%)

Query: 98  FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
            LKE  L  V ++       A   ++ YL  LD + L+  F + A L      Y GWEN 
Sbjct: 1   MLKEFDLTQVCVNDEYCA-NALNKDVAYLKSLDPERLLAGFYENAGLTPKKIRYSGWENM 59

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEK-----MSTVVFSLSECQNK--------IG 204
           +  + GH +GHYL+A+AQ +A+       +K     + T+V  L ECQ           G
Sbjct: 60  L--IGGHTLGHYLTAAAQGYANPGTRKEDKKALFDIIKTLVDGLLECQEHSQGKKGFVFG 117

Query: 205 TGYLSAFPTEL-FDSFE-----ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
              + +   EL FD  E      +   W P+YT+HKIL GL+  +V      ALK+A  +
Sbjct: 118 AIIMDSNNVELQFDHVEHGRTNIITESWVPWYTMHKILDGLVSTFVFTGYEPALKVAEGI 177

Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
            ++ YNR     + +S E H   L+ E GGMND LY+LY +T   +HL  AH FD+    
Sbjct: 178 GDWTYNRA----SGWSEETHKTVLSIEYGGMNDALYKLYRLTGKKEHLEAAHAFDEEELF 233

Query: 319 GFLAL-QADYLSHFHANTHIPIVIGSQMRYEVTGDPL--YKLIGTFFMDIVNASHSYATG 375
             +A   A+ L++ HANT IP  +G+  RY   GD    Y      F D+V   H+YATG
Sbjct: 234 KKVATGDANVLNNRHANTTIPKFLGALQRYMTLGDVAGEYLTYVQKFWDMVVERHTYATG 293

Query: 376 GTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVL 435
           G S  E + +   L     + N ETC TYNMLK+SR LFR T +  YADYYE    N +L
Sbjct: 294 GNSEWEHFGEDFVLDAERTNCNNETCNTYNMLKMSRDLFRITGDKKYADYYENTFINAIL 353

Query: 436 SIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEE 495
           S Q   E G+ +Y  P+  G  K      +GT F+ FWCC GTG+E+F+KL DSIYF ++
Sbjct: 354 SSQN-PESGMTMYFQPMATGYYKV-----YGTPFDKFWCCTGTGMENFTKLNDSIYFLDD 407

Query: 496 GNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
            +V    +  YISS        + L QK     S  P     L F+   E    + L  R
Sbjct: 408 ESV---IVNMYISSVVCDSKKKLTLTQK-----SLIPKGNTAL-FTINLEEPVKTKLRFR 458

Query: 556 MPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
           +P W  +   +A  +G+       G F   T   ++ND   I++   + T  +    P+ 
Sbjct: 459 VPDWAVNATCKALSSGKTYQAEADGYF---TVEETFNDGDQIEISFEMHT--VVKRLPDC 513

Query: 616 ASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
            ++ A  +GP LL+     E  I   T   ++     IP +  A     T + G+ +  +
Sbjct: 514 ENVFAFKYGPVLLSADLGCENMIDGTTGVDVT-----IPTNKIAGKEYLTVQDGSVSDYI 568

Query: 676 SNSNQSITMEE----FPVSGTDAAL 696
           ++ ++ +  +     F ++GTD  L
Sbjct: 569 ADIDKHLIRKGDELCFTLTGTDREL 593


>gi|293375008|ref|ZP_06621302.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
 gi|292646370|gb|EFF64386.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
          Length = 763

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 180/536 (33%), Positives = 277/536 (51%), Gaps = 46/536 (8%)

Query: 107 VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
           V L++ S+   +Q    +YLL LDV+ L+    + AS   P  +YGGWE+   E++GH +
Sbjct: 6   VRLEKDSLFEISQAVGKQYLLDLDVERLLAPIYEGASQIPPKPSYGGWES--LEIKGHSI 63

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF---------- 216
           GHYLSA   M+ +T +  +KE+M  ++ + S  Q     GYL  F +  F          
Sbjct: 64  GHYLSALTCMYEATKDLELKERMDYIIETFSLLQR--ADGYLGGFLSTPFEQVFTGEFHV 121

Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
           D F +L   W P+Y+IHKI AGL+D Y +  N +AL +   + ++ Y   +    + S E
Sbjct: 122 DHF-SLSHYWVPWYSIHKIYAGLMDAYQIGKNVEALNILKKLADWAYEGSR----LMSDE 176

Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
           +    L  E GGMN+V+  LY IT D ++L LA  F +   +  LA   D L   HANT 
Sbjct: 177 QFQRMLICEYGGMNEVMAELYEITQDERYLYLAKRFTQHLIMDPLAAGVDDLQGRHANTQ 236

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
           IP V+G+   YEVTGD  Y  +  FF + V    SY  GG S+ E +       + L  E
Sbjct: 237 IPKVLGAAKLYEVTGDDYYYRVAKFFFETVVLHRSYVIGGNSSGEHFGPSD--TEALSRE 294

Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
             ETC TYNM+K++++LF+WTK+  Y D+ ERA  N +L+ Q     G  IY      G 
Sbjct: 295 AAETCNTYNMIKLAKYLFKWTKDSKYIDFIERATYNHILASQ-DPHTGCKIYFTSNYPGH 353

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
            K      +GTK +SFWCC GTG+E+  +    I+F+E+ +    Y+  +++SSF  +  
Sbjct: 354 FKV-----YGTKEDSFWCCTGTGMENPGRYTHHIFFKEDED---FYVNLFMASSFVKEDE 405

Query: 517 HVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQL-SSLNLRMPVWTYSNGAQASLNGQNL 574
            + +  + D PI +      + L F   +E  QL  ++ +R+P W  +   +    GQ+ 
Sbjct: 406 QLKVVLQTDFPISN-----VVKLVF---EEANQLFLNVKIRVPYWL-NAPIEVRFKGQSY 456

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
                G +L  ++ +  +D++ I LP+ L  E +  D P      A ++GP +LA 
Sbjct: 457 EGNGQG-YLMISDTFHADDEIEIVLPMGLH-EYVSMDDPHKV---AFMYGPVVLAA 507


>gi|336321977|ref|YP_004601945.1| hypothetical protein Celgi_2884 [[Cellvibrio] gilvus ATCC 13127]
 gi|336105558|gb|AEI13377.1| protein of unknown function DUF1680 [[Cellvibrio] gilvus ATCC
           13127]
          Length = 781

 Score =  253 bits (646), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 194/581 (33%), Positives = 270/581 (46%), Gaps = 73/581 (12%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWE-- 155
           L EVSL +      SV  RAQQ  ++      VD ++  FR+ A+L   G  A GGWE  
Sbjct: 91  LTEVSLGE------SVFTRAQQQMVDLARAYPVDRVLVVFRRNANLDVRGASAPGGWEEL 144

Query: 156 NPISE---------------------LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVF 194
            P  +                     LRGH+ GH+LS  A  +A+T +  I +K+   V 
Sbjct: 145 GPAPDEQRWGPAEYVRGQNTRGAGGLLRGHYGGHFLSMLAMAYATTGDQAILDKVDDFVD 204

Query: 195 SLSECQNKIGT-------GYLSAFPTELFDSFEALKP---VWAPYYTIHKILAGLLDQYV 244
            L EC+  +         G+L+A+    F + EA  P   +WAP+YT HKILAGL+D Y 
Sbjct: 205 GLEECRAALAATGKYSHPGFLAAYGEWQFSALEAYAPYGEIWAPWYTCHKILAGLIDAYR 264

Query: 245 LADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDP 303
              +A AL++A  +  + + R+    T   +ER W   +  E GGMND L  LY+++   
Sbjct: 265 YTGSALALQLAEGLGRWTHARL-SACTPEQLERMWGIYIGGEAGGMNDALVDLYTLSAAA 323

Query: 304 KH---LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGT 360
                L  A LFD    +   A   D L+  HAN HIP  +G       TGD  Y     
Sbjct: 324 DRDDFLAAAALFDLRSLVTACAQDRDTLNGKHANMHIPTFVGYAKLGAWTGDATYTAATR 383

Query: 361 FFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEI 420
            F  ++     YA GGT   E W     +A  +G  N E+C  YNMLKV+R LF   ++ 
Sbjct: 384 NFFGMIVPGRMYAHGGTGEGEMWGPANTVAGDIGPRNAESCAAYNMLKVARTLFFEQQDP 443

Query: 421 AYADYYERALTNGVLSIQR---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
           AY DYYER + N +L  +R    T     +YM P+G G  K       GT      CC G
Sbjct: 444 AYMDYYERTVLNHILGGKRDQASTTSPQNLYMFPVGPGARKEYGNGNIGT------CCGG 497

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           TG+ES  K  DSI+F    +   L++  Y+ S   W S  + + Q+ D           T
Sbjct: 498 TGLESPVKYQDSIWFRSADD-SALWVNLYVPSELRWTSRGLRIVQEGDYPND------ET 550

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYS-----NGAQASLNGQNLPLPPPGNFLSATERWSYN 592
           +T    +  G+L  L LR+P W  S     NGA  +          PG +LS    W+  
Sbjct: 551 VTLRIAEGAGEL-DLRLRVPAWATSFVVAVNGATVASTAAGTAT--PGTYLSVDRTWAAG 607

Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           D++TI L L LR E    DRP+   IQ++  GP +L+  +S
Sbjct: 608 DQVTITLALPLRAEPTI-DRPD---IQSLQRGPVVLSALSS 644


>gi|390943351|ref|YP_006407112.1| hypothetical protein Belba_1756 [Belliella baltica DSM 15883]
 gi|390416779|gb|AFL84357.1| hypothetical protein Belba_1756 [Belliella baltica DSM 15883]
          Length = 785

 Score =  253 bits (645), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 183/611 (29%), Positives = 298/611 (48%), Gaps = 58/611 (9%)

Query: 109 LDQSSVL----WRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGH 164
           LDQ  +L      AQ+ + +Y+L +DVD L+  + K A +    + YG WE+  + L GH
Sbjct: 32  LDQVRLLDSPFKNAQEVDKKYILEMDVDRLLAPYMKDAGIEWIAENYGNWED--TGLDGH 89

Query: 165 FVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE-- 220
             GHYLSA + M+AST +  IK ++  ++  L   Q+K   GY+   P   ++++     
Sbjct: 90  IGGHYLSALSMMYASTGDIEIKSRLDYMIEQLKLAQDKNANGYIGGVPNGQKIWEEIRVG 149

Query: 221 -------ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMY 273
                  +L   W P Y IHKI AGL D Y++A  A A  M   + ++FY+  +     +
Sbjct: 150 NIKAGSFSLNDRWVPLYNIHKIYAGLKDAYLIAGIADAKPMLIALSDWFYDLTEG----F 205

Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
           S  +    L  E GG+N+V   + ++T +PK+L LA        L  L+ + D L+  HA
Sbjct: 206 SEAQFQEILISEHGGLNEVFADVSAMTGNPKYLELAKKMSHNLILDPLSKRQDNLTGMHA 265

Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
           NT IP VIG Q   +++ +  +    T+F + V    S + GG S RE +      +  L
Sbjct: 266 NTQIPKVIGFQRIAQLSDEAKWNNSATYFWENVTNQRSVSIGGNSVREHFHPKDDFSPML 325

Query: 394 GSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
            S+   ETC TYNM+++S  LF  + +  Y DYYERAL N +LS Q  T+ G  +Y  P+
Sbjct: 326 SSDQGPETCNTYNMMRLSEKLFESSPDRKYIDYYERALYNHILSSQHPTKGG-FVYFTPM 384

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
                + +    +     +FWCC G+G+E+ +K G  IY  +E     L++  +I+S   
Sbjct: 385 -----RPQHYRVYSQPHENFWCCVGSGLENHAKYGQVIYAHKEDE---LFVNLFIASELS 436

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
           W+   + L QK D   S       TL F  K +  +   L +R P W      +  +NG+
Sbjct: 437 WEEKGIKLTQKTDFPFS----ESTTLQFDHKGK--KEFKLKIRYPDWVKGGAMEVKVNGK 490

Query: 573 NLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
           + P+      ++    +W   D++++ LP+S + E + D  P +AS    + GP +LA  
Sbjct: 491 SFPISLSKDGYVVIDRKWKSKDQVSVTLPMSTKVEYLADGSP-WAS---FVHGPIVLAAE 546

Query: 632 TSGEWDIK---TGTARSLSALISPIPPSFNAQLVTFTQE-------SGNSTFVMS----- 676
           T G+ D+K      +R        + P F   ++  T+E       + N  F ++     
Sbjct: 547 T-GKEDLKGVFADDSRMGHVASGKMIPIFQTAILKQTEEKISPAKSNDNFNFYLAENQFH 605

Query: 677 NSNQSITMEEF 687
           N N+S+++  F
Sbjct: 606 NQNESVSLVPF 616


>gi|399025507|ref|ZP_10727503.1| hypothetical protein PMI13_03476 [Chryseobacterium sp. CF314]
 gi|398077884|gb|EJL68831.1| hypothetical protein PMI13_03476 [Chryseobacterium sp. CF314]
          Length = 791

 Score =  253 bits (645), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 169/536 (31%), Positives = 260/536 (48%), Gaps = 38/536 (7%)

Query: 113 SVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSA 172
           SV  +A QT+ +Y+L +D D L+  + K A L      Y  WEN  + L GH  GHY+SA
Sbjct: 37  SVFSKAMQTDEKYILSMDADRLLAPYLKEAGLKPKKANYPNWEN--TGLDGHIGGHYISA 94

Query: 173 SAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEA 221
            A M+AST +A +K+++  ++  L  CQN    GYLS  P             +  +   
Sbjct: 95  LALMYASTGDAKVKQRLDYMIDELERCQNLSENGYLSGVPNGKKIWKEIAGGNIRAATFG 154

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
           L   W P Y IHKI +GL D Y  AD+ +A KM   + ++    V  V++   ++     
Sbjct: 155 LNDRWVPLYNIHKIYSGLRDAYWYADSGKAKKMLIRLTDWMVGEVS-VLSDAQIQN---M 210

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
           L  E GG+N+V   +Y IT +PK+L LAH F     L  L    D  +  HANT IP VI
Sbjct: 211 LRSEHGGLNEVFADVYDITKNPKYLRLAHRFSHLAILNPLLNGEDKFTGIHANTQIPKVI 270

Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEET 400
           G +   ++  +  +     FF   V    S   GG S  E +      +  + S E  ET
Sbjct: 271 GFKRIADLENNKEWSNAADFFWINVTQKRSAVIGGNSVSEHFNPINDFSGMIKSIEGPET 330

Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
           C TYNMLK+S+ L+    + +Y DYYERAL N +LS Q   E G  +Y  P+  G  +  
Sbjct: 331 CNTYNMLKLSKELYATNPKSSYIDYYERALYNHILSTQ-NPEKGGFVYFTPMRPGHYRV- 388

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
               +     SFWCC G+G+E+ +K G+ IY   + +   LY+  +I S   W    +VL
Sbjct: 389 ----YSQPETSFWCCVGSGMENHAKYGEMIYAHSDED---LYVNLFIPSILKWSEKKMVL 441

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPG 580
            Q+ +         ++     SK ++    ++ LR P W+ ++    S+N +N+ +P   
Sbjct: 442 RQENN--FPESASTKLIFDVVSKSDI----NMKLRAPEWSDASQITISVNHKNINVPIDA 495

Query: 581 N-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
             + S   +W   D + +++P+ L  E +    P+++   A  +GP +LA     E
Sbjct: 496 EGYFSVKRKWKKGDVIEMKMPMHLSAEQL----PDHSDYFAFKYGPIVLAAKYGKE 547


>gi|388259955|ref|ZP_10137121.1| hypothetical protein O59_001162 [Cellvibrio sp. BR]
 gi|387936316|gb|EIK42881.1| hypothetical protein O59_001162 [Cellvibrio sp. BR]
          Length = 803

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 174/553 (31%), Positives = 268/553 (48%), Gaps = 56/553 (10%)

Query: 106 DVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHF 165
           DV L  S  L +AQ TN +YL+ LD + L+  FR+ A LP   + YG WE+  + L GH 
Sbjct: 31  DVQLLDSPFL-QAQNTNKDYLMALDTEKLLAPFRREAGLPFK-ETYGNWES--TGLDGHM 86

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELF--- 216
            GHY++A A ++A+T +  + ++++ V+  L +CQ+K+G+GY+   P      +E+    
Sbjct: 87  GGHYVTALALLYAATKDDVVLQRLNYVIAELKKCQDKLGSGYIGGIPDSNTMWSEIARGD 146

Query: 217 ---DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMY 273
              D+F +    W P+Y +HKI AGL D Y+ A N  A KM   + ++     +K+    
Sbjct: 147 IRADNF-STNERWVPWYNLHKIYAGLRDAYLYAGNEDAKKMLVRLSDWTIELTKKL---- 201

Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
           S E+    L  E GGMN+V   +  IT D K+L LA  F     L  L  Q D L+  HA
Sbjct: 202 SPEQMQTMLRTEHGGMNEVFVDVAEITGDKKYLKLAEAFSHQAILQPLEKQQDQLTGLHA 261

Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
           NT IP +IG +   + T +  +     FF   V    + A GG S +E + D       +
Sbjct: 262 NTQIPKIIGFKKVADATHNESWNKAAEFFWQTVVDKRTVAIGGNSVKEHFHDSHDFTAMI 321

Query: 394 GS-ENEETCTTYNMLKVSRHLFRWTKE--------------IAYADYYERALTNGVLSIQ 438
              E  ETC TYNMLK+++ LF  +++              + Y DYYERAL N +LS Q
Sbjct: 322 EDVEGPETCNTYNMLKLTQLLFLSSRDNSAADMKKSKNNPAMKYVDYYERALYNHILSSQ 381

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGN 497
                G++ +         K    H      +  WCC G+GIES SK  + IY  + +  
Sbjct: 382 HPQTGGLVYFTSMRPNHYRKYSQVH------DGMWCCVGSGIESHSKYAEFIYARDLDKK 435

Query: 498 VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
           +P +++  +I S   W    +   Q      +    L M        E  +   L LR P
Sbjct: 436 IPEVFLNLFIPSRMTWAEQGISFTQNTQFPDAETTELVM--------ETSKRFRLQLRYP 487

Query: 558 VWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
            W  +   Q  +NG+ + +   PG++++   RW   DK+ + LP+  R E +    P+ +
Sbjct: 488 RWVEAGQLQLRVNGKTVSVKQQPGDYIALERRWKKGDKVQLALPMKPRLEKL----PDGS 543

Query: 617 SIQAILFGPYLLA 629
           +  A+L GP +LA
Sbjct: 544 NYYAVLHGPIVLA 556


>gi|315498357|ref|YP_004087161.1| hypothetical protein Astex_1338 [Asticcacaulis excentricus CB 48]
 gi|315416369|gb|ADU13010.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
           48]
          Length = 797

 Score =  252 bits (643), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 180/593 (30%), Positives = 284/593 (47%), Gaps = 49/593 (8%)

Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
           + + L  V L  S  L  A + N  YLL L  D  ++++ K A +P  G+ YGGWE+   
Sbjct: 39  RPIPLTQVRLLPSPFL-EAVEANRRYLLFLSPDRFLYNYHKFAGMPVKGEIYGGWES--D 95

Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-------- 211
            + G  +GHYLSA + M A T +     ++  ++  L + Q   G GY++ F        
Sbjct: 96  TIAGEGLGHYLSALSLMHAQTGDNECVARIHYIISELEKVQAAHGDGYVAGFMRKRKDGS 155

Query: 212 ---PTELFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
                E+F    A         L   W P+Y  HK+ AGLLD        + + +A  + 
Sbjct: 156 IVDGKEIFPEIMAGDIRSAGFDLNGCWVPFYNWHKLFAGLLDAQAYCGVDRGIPVAEKLG 215

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
            Y    ++ V       +    L+ E GG+N+    LYS T++P+ L L+        L 
Sbjct: 216 GY----IEMVFAALDDAQTQKVLDCEHGGINESFAELYSRTNNPRWLKLSERLYHHRMLD 271

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            LA + D L++ HANT +P +IG    YE+T  P Y+   +FF + V   HS+  GG + 
Sbjct: 272 PLAAREDKLANNHANTQVPKLIGLARLYELTQKPQYQTASSFFWERVVNHHSFVIGGNAD 331

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
           RE++++P  ++  +  +  E+C TYNMLK++RHL+ W+ + A+ DYYERA  N +L+ Q 
Sbjct: 332 REYFFEPDTISAHITEQTCESCNTYNMLKLTRHLYSWSPKAAWFDYYERAHLNHMLAHQN 391

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
             + G+  YM+PL  G ++     G+  + NSFWCC  +GIE+ SK GDSIY+ +E    
Sbjct: 392 -PKTGMFTYMMPLMSGAAR-----GFSDEENSFWCCVLSGIETHSKHGDSIYWHQEKT-- 443

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            L++  +I S  +W             + +  PY        S+    +  ++ +R+P W
Sbjct: 444 -LFVNLFIPSKVNWAEQKAAFE-----LTTKYPYEGQVALKLSQLSGAKTFTVAVRIPGW 497

Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
             ++  Q  +NG+         +   T +W   D +T+ LPL LR E    D      + 
Sbjct: 498 AEASTLQ--VNGKPALAKMNDGYALITRKWRAGDVVTLDLPLKLRFETAAGDN----KVV 551

Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSL--SALISPIPPSFNAQLVTFTQESGN 670
           A+L GP +LA           G A +L  S LI    P   A+ V  ++ SG 
Sbjct: 552 ALLRGPMVLAADLGPADQPWGGDAPALVGSDLIGSFYPVSAAEAVYVSKGSGR 604


>gi|90020425|ref|YP_526252.1| Acetyl-CoA carboxylase, biotin carboxylase [Saccharophagus
           degradans 2-40]
 gi|89950025|gb|ABD80040.1| protein of unknown function DUF1680 [Saccharophagus degradans 2-40]
          Length = 803

 Score =  252 bits (643), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 171/534 (32%), Positives = 262/534 (49%), Gaps = 41/534 (7%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ  N+EY+L L  D L+  F K A LP   + YG WE+    L GH  GHYL+A +  +
Sbjct: 49  AQDKNVEYVLALQPDKLLAPFLKEAGLPVKAENYGNWES--QGLDGHIGGHYLTALSLAY 106

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE---------ALKPVW 226
           A+T +  + ++++ ++  L   QNK   GY+        L+D+           AL   W
Sbjct: 107 AATGDKRLLDRLNYMLNELERAQNKNSNGYIGGVRNGKALWDNIAKGDIRADLFALNDYW 166

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            P+Y +HKI AGL D Y+   + QA  M   + E+       +    + E+    L  E 
Sbjct: 167 VPWYNLHKIYAGLRDAYIYTGSEQAKAMLIGLGEW----TIALTADLNDEQIEKMLTTEY 222

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGMN+V   + +IT D ++L LA  F     L  L  + D L+  HANT IP V+G Q  
Sbjct: 223 GGMNEVFADMAAITGDKRYLSLAKQFSHKKILNPLLQKRDALNGLHANTQIPKVVGYQRV 282

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
            E+TGD  +     +F   V  + + A GG S RE + D +  A  +   E  ETC TYN
Sbjct: 283 AELTGDEEWHKAADYFWHHVVNNRTVAIGGNSVREHFHDSEDFAPMINDVEGPETCNTYN 342

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
           MLK+SR LF     + Y DY+ERAL N +LS Q   E G ++Y  P+     + +    +
Sbjct: 343 MLKLSRMLFSVNPSVDYVDYFERALYNHILSSQH-PETGGLVYFTPM-----RPQHYRMY 396

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
                + WCC G+GIE+  K G+ IY ++  N   LY+  +I+S+  W+   V L Q+  
Sbjct: 397 SQVDTAMWCCVGSGIENHVKYGEFIYAKQNNN---LYVNLFIASTLVWQEKGVHLTQE-- 451

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLS-----SLNLRMPVWTYSNGAQASLNGQNLPL-PPP 579
              ++    R TLT +   +V         ++++R P W  +      +NG+ + +    
Sbjct: 452 --NTFPDSNRTTLTVALDSKVKSSKKHAKFTMHIRYPRWAQAGKVVVKVNGKPINVKAKA 509

Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           G ++    RW   D + + LP+++  EA+ D    Y    A+L+GP +LA  T 
Sbjct: 510 GEYIEINRRWHNGDNVELSLPMNIALEALPDQSDYY----AVLYGPIVLAAKTQ 559


>gi|315499577|ref|YP_004088380.1| hypothetical protein Astex_2584 [Asticcacaulis excentricus CB 48]
 gi|315417589|gb|ADU14229.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
           48]
          Length = 791

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 170/540 (31%), Positives = 265/540 (49%), Gaps = 52/540 (9%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A   N  YLL ++ D L+ ++RK A L    + YGGWE     + GH +GHYLSA + M 
Sbjct: 56  AVDVNEAYLLSVNPDRLLHNYRKFAGLTPKAELYGGWER--DTIAGHSLGHYLSAISLMH 113

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP-----------TELFDSFEA----- 221
           A T NA +K + + ++  L+  Q   G GY++ F             E+F    A     
Sbjct: 114 AQTGNAALKLRAAYIIDELALVQGAHGDGYVAGFTRKRKDGRVVDGKEIFPELMAGDIRS 173

Query: 222 ----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
               L   W P Y  HK+ +GL D        +AL +A  +  Y    + KV    + ++
Sbjct: 174 AGFDLNGCWVPLYNWHKLYSGLFDAQTFCGYDKALTVAVGLGVY----IDKVFRALTDDQ 229

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
               LN E GG+ND    LY  T +P+ L LA        +  L    D L++ HANT +
Sbjct: 230 VQTVLNCEFGGLNDSFAELYRRTENPRWLALAQRLHHKRIIDPLTAGEDKLANNHANTQV 289

Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
           P ++G    +EVTG+   +   +FF + V   HSY  GG + RE++++P  ++  +    
Sbjct: 290 PKLLGEATLFEVTGNENNRKAASFFWERVVNHHSYVIGGNADREYFFEPDTISKHITEAT 349

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
            E C TYNMLK++RHL+ W  +  Y DY+ERA  N VL+ Q+  + G+  YM PL  G +
Sbjct: 350 CEHCNTYNMLKLTRHLYGWEPDARYFDYFERAHFNHVLA-QQNPKTGMFSYMTPLFTGAA 408

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KS 515
           +     G+    +++ CC+G+G+ES +K G+SI+++       L++  YI ++  W  K 
Sbjct: 409 R-----GFSDPVDNWTCCHGSGMESHAKHGESIFWQSSDT---LFVNLYIPATARWATKG 460

Query: 516 GHVVLNQKVDPIVSWDPYL-RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
            H+ L+          PY   +  + SS +   +   L LR+P W  +  A  +LN + +
Sbjct: 461 AHLRLDTGY-------PYDGNIVFSLSSLRRPTKF-KLALRVPAW--AKRADLTLNNKPV 510

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
                G +L     W+  D + + LPL LR EA +DD      + A+L GP +LA    G
Sbjct: 511 KATRDGGYLVIDRAWAVGDTVRLSLPLDLRFEATRDD----GKVVAVLRGPLVLAADLGG 566


>gi|392964292|ref|ZP_10329713.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
 gi|387847187|emb|CCH51757.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
          Length = 739

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 176/554 (31%), Positives = 272/554 (49%), Gaps = 53/554 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           ++  +L +V L  S    +AQ  +L+Y+L L+ D L+  +   A LP   + YG WE+  
Sbjct: 1   MQPFTLQEVRL-TSGPFKQAQDVDLKYILALNPDKLLAPYLIDAGLPLKAQRYGNWES-- 57

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
             L GH  GHYLSA A M+AST    +K+++  ++  L+ CQ K G GY+   P      
Sbjct: 58  VGLDGHIGGHYLSALAMMYASTGEPELKKRLDYMIGELARCQAKNGNGYVGGIPQGKVFW 117

Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
                 ++  S   L   W P Y IHK+ AGL D Y  A N QA ++   + ++F     
Sbjct: 118 DRIHKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDAYAYAGNGQAKQVLIGLGDWFV---- 173

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
           ++I   S E+    L  E GG+N+    LY +T+D K+L  A        L  L  Q D 
Sbjct: 174 ELIKPLSDEQIQQVLRTEHGGINETFADLYILTNDKKYLETAQRLSHRALLYPLLEQQDK 233

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L+  HANT IP VIG +    +TG   +     +F   V+ + S A GG S RE +    
Sbjct: 234 LTGLHANTQIPKVIGFEKIATLTGKTDWSEAAMYFWRNVSQTRSVAFGGNSVREHFNPTT 293

Query: 388 RLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
             +  L S +  ETC ++NML++S+ LF    +++Y D+YER L N +LS Q   E G  
Sbjct: 294 DFSQVLRSNQGPETCNSFNMLRLSKALFLDKNDVSYLDFYERTLYNHILSSQH-PEKGGF 352

Query: 447 IYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
           +Y  P+       R  H   +     S WCC G+G+E+ +K G+ IY     +   L++ 
Sbjct: 353 VYFTPI-------RPNHYRVYSQSETSMWCCVGSGLENHTKYGELIYSHSTND---LFVN 402

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS-- 562
            +I S+ +WK   V LNQ+ +      PY   T     +Q   Q+ S+ +R P W  +  
Sbjct: 403 LFIPSTLNWKEKGVRLNQRTNF-----PYENGT-ELVVQQAKPQVFSVQIRYPKWAENLE 456

Query: 563 ---NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
              NG Q ++NG+      P  +++ + +W   D +T++   S R E +    P+ ++  
Sbjct: 457 VLVNGKQQAVNGK------PSEYVAISRKWKAGDIITVRFKTSTRLEQL----PDGSNWA 506

Query: 620 AILFGPYLLAGHTS 633
           A + GP +LA  TS
Sbjct: 507 AFVHGPIVLAAKTS 520


>gi|395493738|ref|ZP_10425317.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas sp. PAMC
           26617]
          Length = 646

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 191/614 (31%), Positives = 286/614 (46%), Gaps = 59/614 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE-NP 157
           L+   L DV L +   L  AQ+    YLL LD D ++ +FR  A L      YGGWE +P
Sbjct: 46  LQPFDLADVDLGEGPFL-HAQRKTEAYLLSLDPDRMLHAFRVNAGLKPKAAVYGGWESDP 104

Query: 158 I---SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           I      +GH +GHYLSA A  + ST     ++++  +   L+ CQ+   +G + AFP  
Sbjct: 105 IWADINCQGHTLGHYLSACALAYRSTRKPAFRQRIDHIARELAACQDAARSGLVCAFPKG 164

Query: 215 ---LFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
              +            P+YT+HK+ AGL D  ++AD+A++    L++A W V        
Sbjct: 165 PALVAAHLRGDAITGVPWYTLHKVFAGLRDATLMADSAESRAVLLRLADWAV-------- 216

Query: 268 KVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
            V T    +  + ++ E E GGMN+V   LY +T +P +  +A  F     L  LA   D
Sbjct: 217 -VATRPLSDAQFETMLETEHGGMNEVFADLYLMTGNPDYRTMAERFSHKALLTPLAAGRD 275

Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
            L   HANT +P ++G Q  +E TG P Y     FF   V  + S+ATGG    E ++  
Sbjct: 276 QLDGLHANTQLPKIVGFQRVFEATGTPHYHEAAAFFWRTVALTRSFATGGHGDNEHFFPM 335

Query: 387 KRL-ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
                    ++  ETC  +NMLK++R LF    +  YADYYER L NG+L+ Q   + G+
Sbjct: 336 AEFDKHVFSAKGSETCGQHNMLKLTRALFLQDPQAEYADYYERTLYNGILASQ-DPDTGM 394

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
           + Y      G  K      + T  +SFWCC GTG+E+  K  DSIYF ++     LY+  
Sbjct: 395 VTYFQGARPGYMKL-----YHTPEHSFWCCTGTGMENHVKYRDSIYFHDDK---ALYVNL 446

Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--- 562
           ++ S+  W+   V L Q+        P   +  T     +V    +L LR P W+ S   
Sbjct: 447 FVPSAVRWREKGVALRQETR--FPDAPTTTLHWTVERPTDV----TLQLRHPRWSRSAIV 500

Query: 563 --NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
             NG +A+ +        PG+++     W   D  T++L L++  E + D  P    I A
Sbjct: 501 LVNGVEAARSDT------PGSYVKLARTWHSGD--TVELRLAM--EVVPDQAPAAPDIVA 550

Query: 621 ILFGPYLLAGHTSGEWDIKTGTARSLSALISPIP-PSFNAQLVTFTQESGNSTFVMSNSN 679
             +GP +LAG    E     G A     +I+      +NA  VT     GN   + +   
Sbjct: 551 FSYGPMVLAGVLGRE-----GLAPGADVIINERKYGEYNAGPVTVPTLVGNPATLAAQVR 605

Query: 680 QSITMEEFPVSGTD 693
           ++    EF +   D
Sbjct: 606 KADGPLEFTIPAAD 619


>gi|380512705|ref|ZP_09856112.1| hypothetical protein XsacN4_15862 [Xanthomonas sacchari NCPPB 4393]
          Length = 799

 Score =  251 bits (640), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 175/556 (31%), Positives = 266/556 (47%), Gaps = 49/556 (8%)

Query: 95  PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
           PG  ++ + L  V L + S+   + QTN  YLL L+ D L+ +F + A LP  G  YGGW
Sbjct: 51  PGR-VQALPLQQVTL-KPSLFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGAVYGGW 108

Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           E     + GH +GHYLSA A+M A T +  ++E++  +V  L+  Q +   GY+  F T 
Sbjct: 109 EG--DTIAGHTLGHYLSALAKMHAQTRDPVLRERIDYIVAELARAQAQDPDGYVGGF-TR 165

Query: 215 LFDSFEA---------------------LKPVWAPYYTIHKILAGLLDQYVLADNAQALK 253
             D  E                      L   W+P YT HK+ AGLLD + LA + QAL+
Sbjct: 166 KNDKGEIEGGKAVLEDVRRGIIKGSKFNLNGSWSPLYTQHKLFAGLLDAHALAGSKQALE 225

Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
           +   +  Y       V       +    L+ E GG+N+    L + T D + + +     
Sbjct: 226 VLLPLAAY----TAGVFDALDHAQMQTLLDTEFGGLNESYIELGARTGDARWVAIGKRLR 281

Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYA 373
               +   A   D L H HANT +P  IG   ++EV GD        FF + V A +SY 
Sbjct: 282 HEKVIDPAAAGRDELPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTAHYSYV 341

Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
            GG + RE++ +P  +A  L  +  E C +YNMLK++RHL++WT +  Y DYYER L N 
Sbjct: 342 IGGNADREYFQEPDTIAAFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNH 401

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
            ++ Q     G+  YM P+  G  +     G+  KF+SFWCC G+G+E+ ++ GD+IY++
Sbjct: 402 TMAAQHPAT-GMFTYMTPMISGGER-----GFSDKFDSFWCCVGSGMEAHAQFGDAIYWQ 455

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
              +   LY+  YI S  DW    + L  ++D  V  +  +R+ +  + ++   +L    
Sbjct: 456 ---DATSLYVNLYIPSRLDWTERDLAL--ELDSGVPDNGKVRLQVLRAGQRAPRRLL--- 507

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
           LR+P W     A   +NG          +L+    W   D + + L   LR E    D  
Sbjct: 508 LRVPAWCQGRYA-LRVNGSPARAALVDGYLTLERDWRAGDVIDLDLATPLRLEHAAGD-- 564

Query: 614 EYASIQAILFGPYLLA 629
             A    ++ GP  LA
Sbjct: 565 --ADTVVVMRGPLALA 578


>gi|192360871|ref|YP_001981311.1| hypothetical protein CJA_0803 [Cellvibrio japonicus Ueda107]
 gi|190687036|gb|ACE84714.1| conserved hypothetical protein [Cellvibrio japonicus Ueda107]
          Length = 802

 Score =  250 bits (639), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 178/539 (33%), Positives = 257/539 (47%), Gaps = 50/539 (9%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ TN +YL+ LDV+ L+  FR+ A LP   + YG WE+  + L GH  GHY+SA A  +
Sbjct: 49  AQNTNKQYLMALDVEKLLAPFRREAGLPYK-ETYGNWES--TGLDGHIGGHYISALALTY 105

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL------------FDSFEALKPV 225
           AST +  +  ++  V+  L +CQ+K G GYL+  P                D+F +    
Sbjct: 106 ASTGDPAVLARLEYVITELKKCQDKNGNGYLAGLPEGAGIWQEIARGDIRADNF-STNER 164

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
           W P+Y +HK  AGL D Y    N  A  M     E+ +   + +    S E+    L+ E
Sbjct: 165 WVPWYNLHKTFAGLRDAYRYTGNETAKAMLVAFSEWTWALTKDL----SDEQMQTLLHTE 220

Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
            GGMNDV   +  IT D ++L LA  F     L  L  + D L+  HANT IP VIG + 
Sbjct: 221 HGGMNDVFVDVADITGDKRYLHLAERFSHRAILQPLLEKRDALTGLHANTQIPKVIGFKR 280

Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
             +      ++    FF + V    S A GG S RE +         +   E  ETC TY
Sbjct: 281 VGDAEQLAEWQSAAEFFWETVVNKRSVAIGGNSVREHFHPQDNFHSMIEDVEGPETCNTY 340

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           NMLK++  LF       Y DYYERAL N +L  Q   + G  +Y  P+     +      
Sbjct: 341 NMLKLTEQLFLDNPLGKYGDYYERALYNHILGSQH-PQTGGFVYFTPM-----RPNHYRV 394

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEE--------EGNVPGLYIIQYISSSFDWKSG 516
           +    +  WCC G+G+ES SK  + IY             N+P +Y+  +I S  +WK  
Sbjct: 395 YSQVHDGMWCCVGSGLESHSKYAEFIYARGMKKSAGWFARNIPQVYVNLFIPSQLNWKET 454

Query: 517 HVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
            + L Q+   P V   P   + L  S +       +L+LR P W  ++  Q  +NG+   
Sbjct: 455 GIRLRQENQFPDV---PETSIVLESSGR------FTLHLRYPQWVEADTLQLRINGKVEK 505

Query: 576 L-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           +   PGN+L+   RW   DKL I+LP+    E++    P+ +S  A+L+GP +LA  T 
Sbjct: 506 ISSQPGNYLAIERRWKKGDKLDIRLPMKPHLESL----PDGSSYYAVLYGPIVLAAKTQ 560


>gi|296331240|ref|ZP_06873712.1| hypothetical protein BSU6633_09061 [Bacillus subtilis subsp.
           spizizenii ATCC 6633]
 gi|296151355|gb|EFG92232.1| hypothetical protein BSU6633_09061 [Bacillus subtilis subsp.
           spizizenii ATCC 6633]
          Length = 761

 Score =  250 bits (638), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 171/527 (32%), Positives = 264/527 (50%), Gaps = 39/527 (7%)

Query: 114 VLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSAS 173
           + + +Q    EYLL LDVD L+    +  S       YGGWE    E+ GH +GH+LSA+
Sbjct: 10  MFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAGHSIGHWLSAA 67

Query: 174 AQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------LKP 224
           + M+ ++ +  +K K    V  LS  Q     GY+S F    FD   +         L  
Sbjct: 68  SAMYQASGDEKLKRKAEYAVNELSHIQQFDEEGYISGFSRACFDEVFSGDFRVDHFSLGG 127

Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
            W P+Y++HK+ AGL+D Y L  N  AL++   + ++     +K +   + E+    L  
Sbjct: 128 SWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRMLIC 183

Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
           E GGMN+ +  LY +T +  +L LA  F     L  LA   D L   HANT IP VIG+ 
Sbjct: 184 EHGGMNEAMADLYILTKNKSYLDLAERFCHRAILQPLAEGKDELEGKHANTQIPKVIGAA 243

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
             Y++TG+  Y+    FF + V    SYA GG S  E +      ++ LG    ETC TY
Sbjct: 244 KLYDITGNEAYRNPALFFWEQVVYQRSYAIGGNSIGEHF--GAEGSEELGVTTAETCNTY 301

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           NMLK++ HLFRW  E  + DYYE AL N +LS Q   E G+  Y +    G  K      
Sbjct: 302 NMLKLTGHLFRWFHEARFTDYYENALYNHILSSQ-DPESGMKTYFVSTQPGHFKV----- 355

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
           + +  +SFWCC GTG+E+ ++   +IY  ++ +   LY+  +I S  + +   +++ Q+ 
Sbjct: 356 YCSPEDSFWCCTGTGMENPARYTQNIYHLDQDD---LYVNLFIPSQINVREKQMIITQE- 411

Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA-QASLNGQNLPLPPPGNFL 583
               S+    +  L    K+  G   +L +R+P WT  NG+ +A +NG+ +       +L
Sbjct: 412 ---TSFPAANKTKLVV--KKADGVPMTLQIRIPYWT--NGSLKAVVNGKRVQSVEKNGYL 464

Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +  + W+  D + I LP+ L     +DD  +      +++GP +LAG
Sbjct: 465 AIHKHWNTGDCIEIDLPMKLHIYQAKDDPKK----SVLMYGPVVLAG 507


>gi|398305096|ref|ZP_10508682.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus vallismortis
           DV1-F-3]
          Length = 762

 Score =  249 bits (637), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 168/528 (31%), Positives = 261/528 (49%), Gaps = 37/528 (7%)

Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLS 171
             + + +Q    EYLL LDVD L+    +  S       YGGWE    E+ GH VGH+LS
Sbjct: 8   KGMFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAGHSVGHWLS 65

Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------L 222
           A++ M+ ++ +  +K K +  V  LS  Q     GY+S F    FD   +         L
Sbjct: 66  AASAMYRASGDEELKRKTAYAVNELSHIQQFDQEGYVSGFSRACFDEVFSGDFRVDHFSL 125

Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
              W P+Y++HK+ AGL+D Y L  N  AL++   + ++     +K +   + E+    L
Sbjct: 126 GGSWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLNDEQFQRML 181

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
             E GGMN+ +  LY +T +  +L LA  F     L  LA   D L   HANT IP VIG
Sbjct: 182 ICEHGGMNEAMADLYMLTKNKAYLELAERFCHRAILQPLAEGKDELEGKHANTQIPKVIG 241

Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCT 402
           +   Y++TG+  Y+    FF + V    SYA GG S  E +      ++ LG    ETC 
Sbjct: 242 AAKLYDITGNEAYRNAALFFWEQVVYQRSYAIGGNSIGEHF--GAEGSEELGVTTAETCN 299

Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARST 462
           TYNMLK++ HLFRW +E  + DYYE AL N +L+ Q   + G+  Y +    G  K    
Sbjct: 300 TYNMLKLTAHLFRWFQESKFMDYYENALYNHILASQ-DPDSGMKTYFVSTQPGHFKV--- 355

Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
             + +  +SFWCC GTG+E+ ++    IY  +  +   LY+  +I S    +  H+++ Q
Sbjct: 356 --YCSPEDSFWCCTGTGMENPARYTKHIYHIDRDD---LYVNLFIPSQIHVREKHMLIAQ 410

Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNF 582
           +        P    T     K + G   +L++R+P W +  G +A++NG+ +       +
Sbjct: 411 ETSF-----PAAEQTRLMVKKAD-GVPMALHIRIPYWAHG-GLKAAVNGKRIQPVEKNGY 463

Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           L   + W+  D + + LP+ L     +DD  +      +++GP +LAG
Sbjct: 464 LVIHKHWNTGDCIEVDLPMKLHLYQAKDDPKK----NVLMYGPVVLAG 507


>gi|350267868|ref|YP_004879175.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus subtilis
           subsp. spizizenii TU-B-10]
 gi|349600755|gb|AEP88543.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus subtilis
           subsp. spizizenii TU-B-10]
          Length = 761

 Score =  249 bits (636), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 172/529 (32%), Positives = 267/529 (50%), Gaps = 39/529 (7%)

Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENPISELRGHFVGHYL 170
             + + +Q    EYLL LDVD L+    + A L TP K  YGGWE    E+ GH +GH+L
Sbjct: 8   KGMFYDSQMKGKEYLLFLDVDRLLAPCYE-AVLQTPKKPRYGGWE--AKEIAGHSIGHWL 64

Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA--------- 221
           SA++ M+ ++ +  +K K    V  LS  Q     GY+S F    FD   +         
Sbjct: 65  SAASAMYQASGDEELKRKAEYAVNELSHIQQFDEEGYVSGFSRACFDEVFSGDFRVDHFS 124

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
           L   W P+Y+IHK+ AGL+D Y L  N  AL++   + ++     +K +   + E+    
Sbjct: 125 LGGSWVPWYSIHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRM 180

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
           L  E GGMN+ +  L+ +T +  +L LA  F     L  LA   D L   HANT IP VI
Sbjct: 181 LICEHGGMNEAMADLFMLTKNKAYLELAERFCHRAILQPLAEGKDELEGKHANTQIPKVI 240

Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
           G+   Y++TG+  Y+    FF + V    SYA GG S  E +      ++ LG    ETC
Sbjct: 241 GAAKLYDITGNEAYRNAALFFWEQVVYQRSYAIGGNSIGEHF--GAEGSEELGVTTAETC 298

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
            TYNMLK++ HLFRW  E  + DYYE AL N +L+ Q   + G+  Y +    G  K   
Sbjct: 299 NTYNMLKLTGHLFRWFHEARFMDYYENALYNHILASQ-DPDSGMKTYFVSTQPGHFKV-- 355

Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
              + +  +SFWCC GTG+E+ ++    IY  ++ +   LY+  +I S  + +   +++ 
Sbjct: 356 ---YCSPEDSFWCCTGTGMENPARYTQHIYDIDQDD---LYVNLFIPSQINMQEKQLIIT 409

Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
           Q+     S+    +  L    K+  G   +L++R+P WT + G +A++NG+ +       
Sbjct: 410 QE----TSFPAAEKTRLVV--KKADGVPMTLHIRIPYWT-NGGLKAAVNGKRIQSVEKNG 462

Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +L   + W+  D + I LP+ L     +DD  +      +++GP +LAG
Sbjct: 463 YLVIHKHWNTGDCIEIDLPMKLHIYQAKDDPKK----SVLMYGPVVLAG 507


>gi|237708621|ref|ZP_04539102.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
 gi|229457321|gb|EEO63042.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
          Length = 783

 Score =  249 bits (635), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 170/544 (31%), Positives = 265/544 (48%), Gaps = 47/544 (8%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           + DV L  +S    A+  ++ YLL +D D L+  + K A L    + Y  WEN  + L G
Sbjct: 33  VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
           H  GHYLSA + M+A+T N  IK ++  ++  L  CQ+  G GYL   P   +++   E 
Sbjct: 90  HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149

Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
                    L   W P Y IHK+ AGL D  +   + +A    +K+  WM+        +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           +I+  S E+    L  E GG+N+    + +IT D ++L LAH F     L  L  Q D L
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
           +  HANT IP VIG +   ++ G+  +     +F + V    S   GG S RE +     
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321

Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
            +  L SE   ETC TYNML++++ L+  + +    DYYERAL N +LS Q   + G  +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDSVQGG-FV 380

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  P+  G  +      +     SFWCC G+G+E+ ++ G+ IY  ++ N   LY+  +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S+  W   H      ++   ++      TL  S ++   + + L  R+P WT     + 
Sbjct: 433 PSTLRWGDIH------IEQQTAFPDEEGTTLAVSPEKGEKEFTLL-FRVPEWTNPEALRL 485

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           S+NG+   +     ++S    WS  DK+ ++LP+ LR  A+ D    Y    +IL+GP +
Sbjct: 486 SVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541

Query: 628 LAGH 631
           LA  
Sbjct: 542 LAAQ 545


>gi|329847096|ref|ZP_08262124.1| tat twin-arginine translocation pathway signal sequence domain
           protein [Asticcacaulis biprosthecum C19]
 gi|328842159|gb|EGF91728.1| tat twin-arginine translocation pathway signal sequence domain
           protein [Asticcacaulis biprosthecum C19]
          Length = 795

 Score =  248 bits (634), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 174/556 (31%), Positives = 270/556 (48%), Gaps = 47/556 (8%)

Query: 94  LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGG 153
           LP      ++L DV L   S    A   N  YLL L+ D  + ++RK A L    + YGG
Sbjct: 36  LPQKRTTSLALGDVRL-LPSPFKTALDVNHTYLLTLEPDRFLHNYRKGAGLTPKAEKYGG 94

Query: 154 WENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP- 212
           WEN    + GH +GHYLSA + M+A T +AT+K + + V+  L+  Q   G GY++ F  
Sbjct: 95  WEN--DTIAGHSLGHYLSAISLMYAQTGDATLKARAAYVIDELALIQGMQGDGYVAGFTR 152

Query: 213 ----------TELFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALK 253
                      ELF   +A         L   W P Y  HK+  GL D        + + 
Sbjct: 153 KRPDGTIVDGKELFAEIKAGDIRSAGFDLNGCWVPLYNWHKLYTGLFDAQTFCGLNKGVV 212

Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
           +AT +  Y    +  V    + ++    LN E GG+N+    L++ T D + L LA    
Sbjct: 213 VATGLGHY----IDSVFAALNDDQVQQVLNCEFGGLNESFAELHARTGDARWLTLAERMH 268

Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYA 373
               L  +  + D L++ H+NT IP V+G    YE+TG   Y     FF + V   HSY 
Sbjct: 269 HNRVLDPMIKREDKLANIHSNTTIPKVLGLARLYEITGKADYHTASDFFWERVTGHHSYV 328

Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
            GG   RE++++P  ++  +     E C TYNML+++R L+ W  + +  DY+ERA  N 
Sbjct: 329 IGGNGDREYFFEPDTISRHITEATCEHCATYNMLRLTRFLYSWQPDASRFDYFERAHLNH 388

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
           VLS Q+  + G+  YM PL  G  +     G+    +++ CC+GTG+ES ++  +SI+++
Sbjct: 389 VLS-QQNPKTGMFSYMTPLFTGAER-----GFSDPVDNWTCCHGTGMESHARHAESIWWQ 442

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
                  L++  YI S+  W +    L  ++D    +D  +++ +T   +    +L+   
Sbjct: 443 SADT---LFVNLYIPSTAQWTTKGASL--RMDTGYPYDGGVKLAVTALRRPTRFKLA--- 494

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
           LR+P W  +  A  +LNG+       G +L     W   DK+ + LPL LR EA  D+  
Sbjct: 495 LRVPGWAKT--AAVTLNGKPAQAVRDGGYLVIDRVWQAGDKIALDLPLDLRLEATSDN-- 550

Query: 614 EYASIQAILFGPYLLA 629
               I A+L GP +LA
Sbjct: 551 --TGIVAVLRGPMVLA 564


>gi|345513549|ref|ZP_08793069.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
 gi|229437570|gb|EEO47647.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
          Length = 783

 Score =  248 bits (634), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 170/544 (31%), Positives = 265/544 (48%), Gaps = 47/544 (8%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           + DV L  +S    A+  ++ YLL +D D L+  + K A L    + Y  WEN  + L G
Sbjct: 33  VRDVRL-TASPFKHAEDMDIRYLLGMDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
           H  GHYLSA + M+A+T N  IK ++  ++  L  CQ+  G GYL   P   +++   E 
Sbjct: 90  HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149

Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
                    L   W P Y IHK+ AGL D  +   + +A    +K+  WM+        +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           +I+  S E+    L  E GG+N+    + +IT D ++L LAH F     L  L  Q D L
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
           +  HANT IP VIG +   ++ G+  +     +F + V    S   GG S RE +     
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321

Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
            +  L SE   ETC TYNML++++ L+  + +    DYYERAL N +LS Q   + G  +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  P+  G  +      +     SFWCC G+G+E+ ++ G+ IY  ++ N   LY+  +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S+  W   H      ++   ++      TL  S ++   + + L  R+P WT     + 
Sbjct: 433 PSTLRWGDIH------IEQQTAFPDEEGTTLAVSPEKGEKEFTLL-FRVPEWTNPEALRL 485

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           S+NG+   +     ++S    WS  DK+ ++LP+ LR  A+ D    Y    +IL+GP +
Sbjct: 486 SVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541

Query: 628 LAGH 631
           LA  
Sbjct: 542 LAAQ 545


>gi|346970201|gb|EGY13653.1| secreted protein [Verticillium dahliae VdLs.17]
          Length = 634

 Score =  248 bits (633), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 175/532 (32%), Positives = 259/532 (48%), Gaps = 43/532 (8%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
           Q   L Y+  +DVD L++ FR+T  LP  G +  GGW+ P    R HF GH+L+A +  W
Sbjct: 65  QDRTLSYIKFVDVDRLLYVFRQTHGLPLQGAQPNGGWDAPDFPFRSHFQGHFLNAWSYCW 124

Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE--ALKPVWAPYY 230
           A   +   +++ S     L++CQ          GYLS FP    ++ E   L     PYY
Sbjct: 125 AVLRDEECRDRASYFATELAKCQANNEQAGFNPGYLSGFPESEIEALEKRTLSNGNVPYY 184

Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
           +IHK +AGLLD +    +  A  +   M  +   R  K+    S  +    ++ E GGMN
Sbjct: 185 SIHKTMAGLLDVWRHIGDETARDVLLGMAGWVDLRTGKL----SYSQMQTMMSTEFGGMN 240

Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           +V+  ++  T D + L +A  FD       LA   D L+  HANT +P  IG+   Y+ T
Sbjct: 241 EVMADIFHQTGDERWLTVAQRFDHASVFDPLAGNRDSLNGLHANTQVPKWIGAAREYKAT 300

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
           G   Y  I     +I   +H+YA G  S  E +  P  +A  L  +  E C TYNMLK++
Sbjct: 301 GTTRYSDIARNAWNITVQAHTYAIGANSQSEHFRPPNAIASYLDEDTAEACNTYNMLKLT 360

Query: 411 RHLFRWTKEIA---YADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKARST 462
           R L  W  + +   Y D+YE+AL N  +  Q  +   G + Y   L     RGV  A   
Sbjct: 361 REL--WVMDPSNSKYFDFYEQALINHAIGQQDPSSAHGHVTYFTSLNPGGHRGVGPAWGG 418

Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
             W T + + WCC GT +E+ +KL DSIYF +E +   LY+  Y  S  +W    V + Q
Sbjct: 419 GTWSTDYGTAWCCQGTALETNTKLMDSIYFYDESS---LYVNLYAPSKLNWTQRKVTVLQ 475

Query: 523 KVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP--LPPP 579
           + + P       L+ T T + K   G    L +R+P+W  S GA  ++NGQ L      P
Sbjct: 476 ETEFP-------LQDTSTLTVKG--GGDWDLRVRIPMW--SKGATIAINGQALDGVEAAP 524

Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
           G + +    W   D +TI LP++L T +  D+     S+ A+ +GP +LA +
Sbjct: 525 GTYATIKRSWGEEDIVTITLPMALHTISANDE----PSVAALAYGPVVLAAN 572


>gi|423242461|ref|ZP_17223569.1| hypothetical protein HMPREF1065_04192 [Bacteroides dorei
           CL03T12C01]
 gi|392639254|gb|EIY33080.1| hypothetical protein HMPREF1065_04192 [Bacteroides dorei
           CL03T12C01]
          Length = 783

 Score =  248 bits (633), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 170/544 (31%), Positives = 265/544 (48%), Gaps = 47/544 (8%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           + DV L  +S    A+  ++ YLL +D D L+  + K A L    + Y  WEN  + L G
Sbjct: 33  VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
           H  GHYLSA + M+A+T N  IK ++  ++  L  CQ+  G GYL   P   +++   E 
Sbjct: 90  HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149

Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
                    L   W P Y IHK+ AGL D  +   + +A    +K+  WM+        +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           +I+  S E+    L  E GG+N+    + +IT D ++L LAH F     L  L  Q D L
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
           +  HANT IP VIG +   ++ G+  +     +F + V    S   GG S RE +     
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321

Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
            +  L SE   ETC TYNML++++ L+  + +    DYYERAL N +LS Q   + G  +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  P+  G  +      +     SFWCC G+G+E+ ++ G+ IY  ++ N   LY+  +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S+  W   H      ++   ++      TL  S ++   + + L  R+P WT     + 
Sbjct: 433 PSTLRWGDIH------IEQQTAFPDEEGTTLAVSPEKGEKEFTLL-FRVPEWTNPEALRL 485

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           S+NG+   +     ++S    WS  DK+ ++LP+ LR  A+ D    Y    +IL+GP +
Sbjct: 486 SVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541

Query: 628 LAGH 631
           LA  
Sbjct: 542 LAAQ 545


>gi|291544094|emb|CBL17203.1| Uncharacterized protein conserved in bacteria [Ruminococcus
           champanellensis 18P13]
          Length = 1075

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 192/619 (31%), Positives = 297/619 (47%), Gaps = 63/619 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENP 157
           +++ SL D+ +  +  +  A    +EYLL  D D L+  FR+ A L T G K Y GWEN 
Sbjct: 36  IEDFSLADLTMTDAYTV-NAFSKEVEYLLSFDTDRLLCGFRENAKLDTKGAKRYAGWENT 94

Query: 158 ISELRGHFVGHYLSASAQMW-----ASTHNATIKEKMSTVVFSLSECQ--NKIGTGYL-- 208
           +  + GH VGHYL+A AQ +      +   + ++ K+  ++  +  CQ  +K   G+L  
Sbjct: 95  L--IAGHSVGHYLTAVAQAYQNPTLTAAQRSALEGKIKALLDGMRVCQQNSKGKPGFLWA 152

Query: 209 ----SAFPTEL-FDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
               +A   E+ FD  E      +   W P+YT+HKI+ GL+D Y    N  A  +A+ +
Sbjct: 153 GQIKNANNVEVQFDLVEQGKTNIINESWVPWYTMHKIVQGLVDVYNATGNETAKTIASDL 212

Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCF- 317
            ++ YNR  K    +S + H   L+ E GGMND LY LY IT    H + AH FD+    
Sbjct: 213 GDWTYNRASK----WSAQTHNTVLSIEYGGMNDCLYELYEITGKDTHAVAAHYFDETNLH 268

Query: 318 LGFLALQADYLSHFHANTHIPIVIGSQMRY------EVTGDPL----YKLIGTFFMDIVN 367
              L    + L++ HANT IP  IG+  RY       V G+ +    Y      F D+V 
Sbjct: 269 EAVLKGGRNVLTNKHANTTIPKFIGALKRYIVLDGKTVNGEKIDASRYLEYAEAFWDMVT 328

Query: 368 ASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
             H+Y TGG S  E + +   L     + N ETC +YNMLK+SR LF+ T +  Y D+YE
Sbjct: 329 THHTYITGGNSEWEHFGEDDILDKERTNCNCETCNSYNMLKLSRELFKITGDRKYMDFYE 388

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLG 487
               N +LS Q   E G+  Y  P+  G  K  S     + ++SFWCC G+G+ESF+KLG
Sbjct: 389 GTYYNSILSSQN-PESGMTTYFQPMATGYFKVYS-----SPYDSFWCCTGSGMESFTKLG 442

Query: 488 DSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVG 547
           D++Y    GN   LY+  Y SS  +W+   V + Q  D  +      + T+  S     G
Sbjct: 443 DTMYM-HSGNT--LYVNMYQSSVLNWEDQKVKITQ--DSNIPESDTAKFTIDGS-----G 492

Query: 548 QLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
            L     R+P W  +     ++NG         ++   T  +   D +++ +P     E 
Sbjct: 493 SL-DFRFRIPSWK-AGKMTIAVNGTKYTYKTVNDYAQVTGDFKTGDVISVTIP----AEV 546

Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQE 667
           +  + P+  ++    +GP +L+     E   K+ T   ++    PI  S N   +T ++E
Sbjct: 547 VAYNLPDNKAVYGFKYGPVVLSAELGTENMEKSSTGMWVTIPKDPIGSSQN---ITISKE 603

Query: 668 SGNSTFVMSNSNQSITMEE 686
             + T  M+  N  +  ++
Sbjct: 604 GQSVTSFMAEINDHLVKDK 622


>gi|212691787|ref|ZP_03299915.1| hypothetical protein BACDOR_01282 [Bacteroides dorei DSM 17855]
 gi|212665688|gb|EEB26260.1| hypothetical protein BACDOR_01282 [Bacteroides dorei DSM 17855]
          Length = 783

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 170/544 (31%), Positives = 265/544 (48%), Gaps = 47/544 (8%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           + DV L  +S    A+  ++ YLL +D D L+  + K A L    + Y  WEN  + L G
Sbjct: 33  VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
           H  GHYLSA + M+A+T N  IK ++  ++  L  CQ+  G GYL   P   +++   E 
Sbjct: 90  HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149

Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
                    L   W P Y IHK+ AGL D  +   + +A    +K+  WM+        +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           +I+  S E+    L  E GG+N+    + +IT D ++L LAH F     L  L  Q D L
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
           +  HANT IP VIG +   ++ G+  +     +F + V    S   GG S RE +     
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321

Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
            +  L SE   ETC TYNML++++ L+  + +    DYYERAL N +LS Q   + G  +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  P+  G  +      +     SFWCC G+G+E+ ++ G+ IY  ++ N   LY+  +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S+  W   H      ++   ++      TL  S ++   + + L  R+P WT     + 
Sbjct: 433 PSTLRWGDIH------IEQQTAFPDEEGTTLAVSPEKGEKEFALL-FRVPEWTNPEALRL 485

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           S+NG+   +     ++S    WS  DK+ ++LP+ LR  A+ D    Y    +IL+GP +
Sbjct: 486 SVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541

Query: 628 LAGH 631
           LA  
Sbjct: 542 LAAQ 545


>gi|265755220|ref|ZP_06089990.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|423231114|ref|ZP_17217517.1| hypothetical protein HMPREF1063_03337 [Bacteroides dorei
           CL02T00C15]
 gi|423246788|ref|ZP_17227840.1| hypothetical protein HMPREF1064_04046 [Bacteroides dorei
           CL02T12C06]
 gi|263234362|gb|EEZ19952.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|392629229|gb|EIY23239.1| hypothetical protein HMPREF1063_03337 [Bacteroides dorei
           CL02T00C15]
 gi|392634665|gb|EIY28581.1| hypothetical protein HMPREF1064_04046 [Bacteroides dorei
           CL02T12C06]
          Length = 783

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 170/544 (31%), Positives = 265/544 (48%), Gaps = 47/544 (8%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           + DV L  +S    A+  ++ YLL +D D L+  + K A L    + Y  WEN  + L G
Sbjct: 33  VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
           H  GHYLSA + M+A+T N  IK ++  ++  L  CQ+  G GYL   P   +++   E 
Sbjct: 90  HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149

Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
                    L   W P Y IHK+ AGL D  +   + +A    +K+  WM+        +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           +I+  S E+    L  E GG+N+    + +IT D ++L LAH F     L  L  Q D L
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
           +  HANT IP VIG +   ++ G+  +     +F + V    S   GG S RE +     
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321

Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
            +  L SE   ETC TYNML++++ L+  + +    DYYERAL N +LS Q   + G  +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  P+  G  +      +     SFWCC G+G+E+ ++ G+ IY  ++ N   LY+  +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            S+  W   H      ++   ++      TL  S ++   + + L  R+P WT     + 
Sbjct: 433 PSTLRWGDIH------IEQQTAFPDEEGTTLAVSPEKGEKEFTLL-FRVPEWTNPEALRL 485

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           S+NG+   +     ++S    WS  DK+ ++LP+ LR  A+ D    Y    +IL+GP +
Sbjct: 486 SVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541

Query: 628 LAGH 631
           LA  
Sbjct: 542 LAAQ 545


>gi|317476834|ref|ZP_07936077.1| hypothetical protein HMPREF1016_03061 [Bacteroides eggerthii
           1_2_48FAA]
 gi|316907009|gb|EFV28720.1| hypothetical protein HMPREF1016_03061 [Bacteroides eggerthii
           1_2_48FAA]
          Length = 781

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 170/546 (31%), Positives = 270/546 (49%), Gaps = 43/546 (7%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L D+ L +S  L +AQQT+L Y++ ++ D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQDIKLLESPFL-QAQQTDLHYIMAMNPDRLLAPFLREAGLAPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF------- 216
           H  GHY+SA + M+A+T + T+  +++ ++  L   Q  +G G++   P  L        
Sbjct: 87  HIGGHYISALSMMYAATGDTTVYNRLNYMLNELHRAQQAVGNGFIGGTPGSLQLWKEIKE 146

Query: 217 -----DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
                +SF +L   W P Y IHK  AGL D Y+ A +  A +M   + ++    +  + +
Sbjct: 147 GNIRPESF-SLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIALTDW----MAGITS 201

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
             + ++    L  E GG+N++   +  IT D K+L LA  F     L  L    D+L+  
Sbjct: 202 GLTEQQMQDMLRSEHGGLNEIFADVADITGDKKYLELARRFSHKTLLEPLIGGEDHLTGM 261

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           HANT IP VIG +   ++T +  +     FF + V    S   GG S RE +        
Sbjct: 262 HANTQIPKVIGYKRIADLTQNDAWDQAARFFWNTVVNHRSVCIGGNSVREHFHPADNFTS 321

Query: 392 TLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
            L   +  ETC TYNML++++ LF+ + +I +ADYYERAL N +L+ Q+  + G  +Y  
Sbjct: 322 MLNDVQGPETCNTYNMLRLTKMLFQTSPDIRFADYYERALYNHILASQQPAKGG-FVYFT 380

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           P+  G  +      +     S WCC G+G+E+ +K G+ IY   E     LY+  +I S 
Sbjct: 381 PMRSGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAHAEDT---LYVNLFIPSR 432

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
             WK   + L Q+       +  +R  +  S+K+      SL  R P W  + GA  S+N
Sbjct: 433 LTWKEQKLTLVQESR--FPDEAQIRFRIEKSNKKTF----SLKFRYPSW--AKGASVSVN 484

Query: 571 GQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           G+   +   PG +L+   +W   D++T+ LP+ +  E I D    Y    A ++GP +LA
Sbjct: 485 GKVQDINAQPGEYLTVRRKWKAGDEITLNLPMQVTLEQIPDQEHFY----AFMYGPIVLA 540

Query: 630 GHTSGE 635
             T  E
Sbjct: 541 SPTGTE 546


>gi|217973327|ref|YP_002358078.1| hypothetical protein Sbal223_2153 [Shewanella baltica OS223]
 gi|217498462|gb|ACK46655.1| protein of unknown function DUF1680 [Shewanella baltica OS223]
          Length = 792

 Score =  247 bits (631), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 174/555 (31%), Positives = 270/555 (48%), Gaps = 57/555 (10%)

Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
           + L+DV +     L  AQQT+L Y++ +D + L+  +RK A + T  + Y  WE+  + L
Sbjct: 23  IPLNDVRITAGPFL-HAQQTDLHYIMSMDPERLLAPYRKDAGIATTAENYPNWED--TGL 79

Query: 162 RGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSF 219
            GH  GHYLSA A M+A+T +  +  +++ +V  L +CQ   G GYL   P   +L+   
Sbjct: 80  DGHIGGHYLSALALMYAATSDKAVLARLNYMVAELEKCQQAHGNGYLGGVPNSRKLWQQI 139

Query: 220 E---------ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
           E          L   W P+Y +HK+ +GL D ++  +N  A KM     ++  +   K+ 
Sbjct: 140 EQGKIEADLFTLNQAWVPWYNVHKVFSGLRDAHLYTNNPTAKKMLVHFADWMLHLSNKL- 198

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
              S E+    L  E GG+N+ L  +Y IT   K+L LA  +     L  L    D L+ 
Sbjct: 199 ---SDEQLQLMLRTEYGGLNETLADVYVITGQDKYLALAKRYTDQSLLQPLLHHEDKLTG 255

Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
            HANT IP ++G     E++ + ++     FF   V    + + GG S RE +      +
Sbjct: 256 LHANTQIPKIVGVARIAELSNNKVWLDSADFFWQQVVHKRTVSIGGNSVREHFHPSDDFS 315

Query: 391 DTLGS-ENEETCTTYNMLKVSRHLF------RWTKEIAYADYYERALTNGVLSIQRGTEP 443
             L S E  ETC TYNMLK+S+ L+          ++AY +YYERAL N +LS Q   E 
Sbjct: 316 SMLESAEGPETCNTYNMLKLSKLLYENKLLDENKADLAYIEYYERALYNHILSSQH-PEN 374

Query: 444 GVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
           G ++Y  P+       R  H   + +   S WCC G+GIE+ +K G+ IY  E  +    
Sbjct: 375 GGLVYFTPM-------RPDHYRVYSSAQQSMWCCVGSGIENHAKYGELIYASEGDD---F 424

Query: 502 YIIQYISSSFDWKSGHVVLNQKV---DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
           Y+  ++ S   W+   + L QK    D   S      +TL   ++       +LN+R P 
Sbjct: 425 YVNLFVDSEVHWQEKGITLTQKTLFPDANTS-----EITLDKDAQ------FALNVRYPQ 473

Query: 559 WTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
           W   N    S+NGQ        G ++    +W   DK++I LP+++  E I    P+ +S
Sbjct: 474 WVQHNDLTLSINGQAQKFNAVAGQYIKIKRQWHKGDKISITLPMTVTLEQI----PDRSS 529

Query: 618 IQAILFGPYLLAGHT 632
             ++L+GP +LA  T
Sbjct: 530 YYSVLYGPIVLAAKT 544


>gi|436835729|ref|YP_007320945.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
 gi|384067142|emb|CCH00352.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
          Length = 760

 Score =  247 bits (631), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 174/549 (31%), Positives = 266/549 (48%), Gaps = 43/549 (7%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           ++  +L DV L        AQ  +  Y+L L+ D L+  +   A LP     YG WE+  
Sbjct: 22  MQPFALQDVKL-TGGPFKNAQDVDQRYILALNPDKLLAPYLIDAGLPVKAPRYGNWES-- 78

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
           S L GH  GHYLSA A ++AST +A +K+++  +V  L++CQ K G GY+   P      
Sbjct: 79  SGLDGHIGGHYLSALAMLYASTGDAELKKRLDYMVDQLAQCQAKNGNGYVGGIPQGKVFW 138

Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
                 ++  S   L   W P Y IHK+ AGL D Y  A N QA ++   + ++F     
Sbjct: 139 ERIHKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDAYEYAGNQQAKQVLIGLGDWFV---- 194

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
           ++I   S E+    L  E GG+N+    LY +T+D K+L  A        L  L  + D 
Sbjct: 195 ELIKPLSDEQIQQVLRTEHGGINETFADLYILTNDKKYLETAQRISHRAILEPLLAKQDK 254

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L+  HANT IP VIG +    + G P +    T+F   V+   S A GG S RE +    
Sbjct: 255 LTGLHANTQIPKVIGFEKIAMLAGKPDWSDAATYFWQNVSQHRSVAFGGNSVREHFNPTT 314

Query: 388 RLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
             +  L S +  ETC ++NML++S+ LF    ++ Y D+YERAL N +LS Q   E G  
Sbjct: 315 DFSQVLRSNQGPETCNSFNMLRLSKALFLDKSDVTYLDFYERALYNHILSSQH-PEKGGF 373

Query: 447 IYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
           +Y  P+       R  H   +     S WCC G+GIE+ +K G+ IY     +   L++ 
Sbjct: 374 VYFTPI-------RPNHYRVYSQPETSMWCCVGSGIENHTKYGELIYSHSAND---LFVN 423

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            +I S+ +W   +V L Q+ +    +     + +  +  QE     SLN+R P W  +  
Sbjct: 424 LFIPSTVNWADKNVKLTQRTE--FPYKNESDLVIETTKPQEF----SLNIRYPKWAENLV 477

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
              +   Q +   P G +++   +W   DK+T++   S R E +    P+ ++  A + G
Sbjct: 478 VLVNGKAQAVADAPAG-YVAVARKWRAGDKVTVRFNTSTRLEQL----PDGSNWSAFVHG 532

Query: 625 PYLLAGHTS 633
           P +LA  TS
Sbjct: 533 PIVLAAKTS 541


>gi|218129947|ref|ZP_03458751.1| hypothetical protein BACEGG_01530 [Bacteroides eggerthii DSM 20697]
 gi|217988057|gb|EEC54382.1| hypothetical protein BACEGG_01530 [Bacteroides eggerthii DSM 20697]
          Length = 781

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 170/546 (31%), Positives = 270/546 (49%), Gaps = 43/546 (7%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L D+ L +S  L +AQQT+L Y++ ++ D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQDIKLLESPFL-QAQQTDLYYIMAMNPDRLLAPFLREAGLAPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF------- 216
           H  GHY+SA + M+A+T + T+  +++ ++  L   Q  +G G++   P  L        
Sbjct: 87  HIGGHYISALSMMYAATGDTTVYNRLNYMLNELHRAQQAVGNGFIGGTPGSLQLWKEIKE 146

Query: 217 -----DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
                +SF +L   W P Y IHK  AGL D Y+ A +  A +M   + ++    +  + +
Sbjct: 147 GSIRPESF-SLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIALTDW----MAGITS 201

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
             + ++    L  E GG+N++   +  IT D K+L LA  F     L  L    D+L+  
Sbjct: 202 GLTEQQMQDMLRSEHGGLNEIFADVADITGDKKYLELARRFSHKTLLEPLIGGEDHLTGM 261

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           HANT IP VIG +   ++T +  +     FF + V    S   GG S RE +        
Sbjct: 262 HANTQIPKVIGYKRIADLTQNDAWDQAARFFWNTVVNHRSVCIGGNSVREHFHPADNFTS 321

Query: 392 TLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
            L   +  ETC TYNML++++ LF+ + +I +ADYYERAL N +L+ Q+  + G  +Y  
Sbjct: 322 MLNDVQGPETCNTYNMLRLTKMLFQTSPDIRFADYYERALYNHILASQQPAKGG-FVYFT 380

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           P+  G  +      +     S WCC G+G+E+ +K G+ IY   E     LY+  +I S 
Sbjct: 381 PMRSGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAHAEDT---LYVNLFIPSR 432

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
             WK   + L Q+       +  +R  +  S+K+      SL  R P W  + GA  S+N
Sbjct: 433 LTWKEQKLTLVQESR--FPDEAQIRFRIEKSNKKTF----SLKFRYPSW--AKGASVSVN 484

Query: 571 GQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           G+   +   PG +L+   +W   D++T+ LP+ +  E I D    Y    A ++GP +LA
Sbjct: 485 GKVQDINAQPGEYLTVRRKWKAGDEITLNLPMQVTLEQIPDQEHFY----AFMYGPIVLA 540

Query: 630 GHTSGE 635
             T  E
Sbjct: 541 SPTGTE 546


>gi|224540696|ref|ZP_03681235.1| hypothetical protein BACCELL_05610 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224517692|gb|EEF86797.1| hypothetical protein BACCELL_05610 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 782

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 172/552 (31%), Positives = 271/552 (49%), Gaps = 44/552 (7%)

Query: 100 KEVS---LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
           +EVS   L DV L +S  L +AQQT+L Y++ ++ D L+  F + A L     +Y  WEN
Sbjct: 24  QEVSYFPLQDVKLLESPFL-QAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWEN 82

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TE 214
             + L GH  GHY+SA + M+A+T +  I  +++ ++  L   Q  +GTG++   P   +
Sbjct: 83  --TGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQ 140

Query: 215 LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
           L+   +A         L   W P Y IHK  AGL D Y+ A +  A +M   + ++  + 
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLVALTDWMID- 199

Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
              +    + ++    L  E GG+N+    +  IT D K+L LA  F     L  L    
Sbjct: 200 ---ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPLVKDE 256

Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
           D L+  HANT IP VIG +   ++  D  +     FF + V    S   GG S RE +  
Sbjct: 257 DRLTGMHANTQIPKVIGYKRIADLAQDQNWDHAARFFWNTVVNHRSVCIGGNSVREHFHP 316

Query: 386 PKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
                  L   +  ETC TYNML++++ L++ + +I +ADYYERAL N +L+ Q+ T+ G
Sbjct: 317 ADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNHILASQQPTKGG 376

Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
             +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY   +     LY+ 
Sbjct: 377 -FVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAHAKDT---LYVN 427

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            +I S   WK   + L Q+       +  +R  +  S K+      SL LR P W  + G
Sbjct: 428 LFIPSRLTWKDKKITLVQETR--FPDEEQIRFRVEKSKKKAF----SLKLRYPSW--AKG 479

Query: 565 AQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
           A  S+NG+       PG +L+   +W   D++T+ +P+ +  E I    P+  +  A ++
Sbjct: 480 ASVSVNGKVQETNAQPGEYLTIHRKWKAGDEITLNMPMQVALEQI----PDRENFYAFMY 535

Query: 624 GPYLLAGHTSGE 635
           GP +LA  T  E
Sbjct: 536 GPIVLASPTGTE 547


>gi|251798256|ref|YP_003012987.1| hypothetical protein Pjdr2_4277 [Paenibacillus sp. JDR-2]
 gi|247545882|gb|ACT02901.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
          Length = 605

 Score =  246 bits (629), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 172/529 (32%), Positives = 254/529 (48%), Gaps = 57/529 (10%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNAT 184
           Y+   D++ L+ +F+  A + +  +  GGWE P   LRGHFVGHYLSA A+     H+ T
Sbjct: 27  YIREFDLERLMHTFKINAGISSTAEPLGGWEAPDCGLRGHFVGHYLSACAKFAYGDHDGT 86

Query: 185 IKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD--SFEALKPVWAPYYTIHKILAGLLDQ 242
           +K     +V  +  C     +GYLSAF  E  D    E  + VWAPYYT+HKI+ GL+D 
Sbjct: 87  LKTMADEIVDVMQACAQP--SGYLSAFEEEKLDVLELEENRDVWAPYYTLHKIMQGLIDC 144

Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWY--------SLN--EETGGMNDV 292
           YV   N QAL++A  +  Y   R + +        HW          LN   E GG+ D 
Sbjct: 145 YVYLQNTQALELAVNLAHYIRRRFEYL-------SHWKIDGILRCTKLNPVNEFGGLGDS 197

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
           LY LY +T D   L LAHLFD+  +L  LA   D L   HANTH+P+++    RY++  +
Sbjct: 198 LYTLYELTGDAALLGLAHLFDRDYWLWPLAEGRDVLEDLHANTHLPMILACMHRYKIREE 257

Query: 353 PLYK---------LIGTFFMDIVNASHSYA--TGGTSAR-EFWWDPKRLADTLGSENEET 400
             YK         L+G  F +  N+S + A   GG S + E W     LAD L     E+
Sbjct: 258 DSYKKSALHFYDFLMGRTFANGNNSSKATAFIQGGVSEKAEHWGGYGELADALTGGESES 317

Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
           C  +N  K+   L  W+ EI Y D+ E    N +L+     + G+  Y  PLG    K  
Sbjct: 318 CCAHNTEKIVERLLEWSPEIGYLDHLESLKYNAILN-SASAKTGLSQYHQPLGTNAVKKF 376

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
           S       ++SFWCC G+GIE+ S+L  +I+F    N   + +  ++SS   WK   +V+
Sbjct: 377 S-----EPYHSFWCCTGSGIEAMSELQKNIWFR---NGNAILLNAFVSSKAAWKERGIVI 428

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPG 580
           +Q+     S+   L   L F + + V       LRM ++          N + + L    
Sbjct: 429 HQR----TSFPDSLISALHFETDEPV------ELRM-MFKEKAIKNIRFNDEGIHLQKEE 477

Query: 581 NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
            ++     +   D++ I++  SLR   +    P   +  A+L+G  LLA
Sbjct: 478 GYIVVERLFRNGDRMDIEIEASLRLIPL----PGSEAESALLYGNVLLA 522


>gi|423224675|ref|ZP_17211143.1| hypothetical protein HMPREF1062_03329 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392635115|gb|EIY29021.1| hypothetical protein HMPREF1062_03329 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 782

 Score =  246 bits (629), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 172/552 (31%), Positives = 271/552 (49%), Gaps = 44/552 (7%)

Query: 100 KEVS---LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
           +EVS   L DV L +S  L +AQQT+L Y++ ++ D L+  F + A L     +Y  WEN
Sbjct: 24  QEVSYFPLQDVKLLESPFL-QAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWEN 82

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TE 214
             + L GH  GHY+SA + M+A+T +  I  +++ ++  L   Q  +GTG++   P   +
Sbjct: 83  --TGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQ 140

Query: 215 LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
           L+   +A         L   W P Y IHK  AGL D Y+ A +  A +M   + ++  + 
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLVALTDWMID- 199

Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
              +    + ++    L  E GG+N+    +  IT D K+L LA  F     L  L    
Sbjct: 200 ---ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPLVKDE 256

Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
           D L+  HANT IP VIG +   ++  D  +     FF + V    S   GG S RE +  
Sbjct: 257 DCLTGMHANTQIPKVIGYKRIADLAQDQNWDHAARFFWNTVVNHRSVCIGGNSVREHFHP 316

Query: 386 PKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
                  L   +  ETC TYNML++++ L++ + +I +ADYYERAL N +L+ Q+ T+ G
Sbjct: 317 ADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNHILASQQPTKGG 376

Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
             +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY   +     LY+ 
Sbjct: 377 -FVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAHAKDT---LYVN 427

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            +I S   WK   + L Q+       +  +R  +  S K+      SL LR P W  + G
Sbjct: 428 LFIPSRLTWKEKKITLVQETR--FPDEEQIRFRVEKSKKKAF----SLKLRYPSW--AKG 479

Query: 565 AQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
           A  S+NG+       PG +L+   +W   D++T+ +P+ +  E I    P+  +  A ++
Sbjct: 480 ASVSVNGKVQETNAQPGEYLTIHRKWKAGDEITLNMPMQVALEQI----PDRENFYAFMY 535

Query: 624 GPYLLAGHTSGE 635
           GP +LA  T  E
Sbjct: 536 GPIVLASPTGTE 547


>gi|284036341|ref|YP_003386271.1| hypothetical protein Slin_1422 [Spirosoma linguale DSM 74]
 gi|283815634|gb|ADB37472.1| protein of unknown function DUF1680 [Spirosoma linguale DSM 74]
          Length = 760

 Score =  246 bits (628), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 169/531 (31%), Positives = 262/531 (49%), Gaps = 44/531 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ  +L+Y+L L+ + L+  +   A LP     YG WE+  S L GH  GHYLSA A M+
Sbjct: 40  AQDVDLKYILALNPNKLLAPYLIDAGLPEKAPRYGNWES--SGLDGHIGGHYLSALAMMY 97

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEALKPVW 226
           AST NA  K+++  +V  L++CQ K G GY+   P            ++  S   L   W
Sbjct: 98  ASTGNAETKKRLDYMVDQLAQCQAKNGNGYVGGIPQGKVFWERIHKGDIDGSSFGLNNTW 157

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            P Y IHK+ AGL D Y  A N QA ++   + ++F     ++I   S E+    L  E 
Sbjct: 158 VPLYNIHKLFAGLRDAYEYAGNQQAKQVLIGLGDWFV----ELIKPLSDEQIQQVLRTEH 213

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GG+N+    LY +T D K+L  A        L  L  + D L+  HANT IP VIG +  
Sbjct: 214 GGINETFADLYILTKDQKYLETAQRISHRAILDPLIDKQDKLTGLHANTQIPKVIGFEKI 273

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
             +TG   +     +F   V+ + S A GG S RE +      +  L S +  ETC ++N
Sbjct: 274 ATLTGKSDWSDAAQYFWQNVSQTRSVAFGGNSVREHFNPTTDFSQLLRSNQGPETCNSFN 333

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH-- 463
           ML++S+ LF    +++Y D+YER + N +LS Q   E G  +Y  P+       R  H  
Sbjct: 334 MLRLSKALFLDKNDVSYLDFYERTMYNHILSSQH-PEKGGFVYFTPI-------RPNHYR 385

Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
            +     S WCC G+GIE+ +K G+ IY     +   L++  +I S+ +W    + L Q+
Sbjct: 386 VYSQPETSMWCCVGSGIENHTKYGELIYSHSAND---LFVNLFIPSTVNWADKKLKLTQQ 442

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNF 582
                 +     + +  S  QE+    SLN+R P W  +   +  +NG+  P+   P ++
Sbjct: 443 TQ--FPYQNQSELIIETSRPQEL----SLNIRYPKW--AENLEVLVNGKAQPVTGKPASY 494

Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           ++   +W   DK+T++   + R E +    P+ ++  A + GP +LA  TS
Sbjct: 495 VAVNRKWKSGDKVTVRFKTTTRLEQL----PDGSNWAAFVNGPIVLAAKTS 541


>gi|300777572|ref|ZP_07087430.1| acetyl-CoA carboxylase [Chryseobacterium gleum ATCC 35910]
 gi|300503082|gb|EFK34222.1| acetyl-CoA carboxylase [Chryseobacterium gleum ATCC 35910]
          Length = 791

 Score =  246 bits (628), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 172/538 (31%), Positives = 259/538 (48%), Gaps = 42/538 (7%)

Query: 113 SVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSA 172
           SV  +A + + +YL+ L+ D L+  + K A L      Y  WEN  + L GH  GHY+SA
Sbjct: 37  SVFSKAMKADHKYLMALEPDRLLAPYLKEAGLKPKANNYPNWEN--TGLDGHIGGHYISA 94

Query: 173 SAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEA 221
            + M+AST +  I+E+++ ++  L  CQ     GY+S  P             +  S   
Sbjct: 95  LSLMYASTGDKAIQERINYMISELERCQKASPDGYISGIPNGKKIWKEIKQGNIRASGFG 154

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
           L   W P Y IHK+ +GL D Y  A N +A  M   + ++  N V  +    S E+    
Sbjct: 155 LNDRWVPLYNIHKLYSGLRDAYWYAKNEKAKAMLIKLTDWMANEVSNL----SDEQIQDM 210

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
           L  E GG+N+V   +Y ITHD K+L LAH F     L  L    D L+  HANT IP VI
Sbjct: 211 LRSEHGGLNEVFADVYEITHDQKYLKLAHRFSHQAILSPLLTGEDKLTGLHANTQIPKVI 270

Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEET 400
           G +   ++  +  +     FF   V    S   GG S  E +      +  + S E  ET
Sbjct: 271 GYKRIADLENNTSWSNAADFFWHNVTEKRSSVIGGNSVSEHFNPVNDFSSMIKSIEGPET 330

Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
           C TYNMLK+++ L+    E  Y DYYE+AL N +LS +   + G  +Y  P+  G  +  
Sbjct: 331 CNTYNMLKLTKELYATLPESYYIDYYEKALYNHILSTE-NHDHGGFVYFTPMRPGHYRVY 389

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
           S         SFWCC G+GIE+ +K G+ IY   + +   LY+  +I S+  WK  +VVL
Sbjct: 390 SQPQ-----TSFWCCVGSGIENHAKYGEMIYARSDKD---LYVNLFIPSTLTWKQQNVVL 441

Query: 521 NQKVDPIVSWDPYLRMTLTFSS--KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
            Q    + ++      TL F +  K E      L LR P WT  +  +  +NG+   +  
Sbjct: 442 RQ----VNNFPEAPETTLIFDAAGKSEF----DLKLRCPEWTTPSEVKILVNGKQERVQR 493

Query: 579 PGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
             + + + T++W   D + + LP+ L  E +    P++++  A  +GP +LA     E
Sbjct: 494 GSDGYFTLTKKWKKGDVVKMTLPMQLSAEQL----PDHSNYYAFKYGPVVLAAKYGTE 547


>gi|357046482|ref|ZP_09108109.1| hypothetical protein HMPREF9441_02134 [Paraprevotella clara YIT
           11840]
 gi|355530721|gb|EHH00127.1| hypothetical protein HMPREF9441_02134 [Paraprevotella clara YIT
           11840]
          Length = 762

 Score =  245 bits (625), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 171/538 (31%), Positives = 264/538 (49%), Gaps = 40/538 (7%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L+DV L QS     A+  ++ YLL LD D L+  + K A L      Y  WEN  + L G
Sbjct: 8   LNDVRLTQSP-FKHAEDLDVRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWEN--TGLDG 64

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT---------- 213
           H  GHY+SA + M+A+T +  IK+++  ++  L   Q+  G GYL   P           
Sbjct: 65  HIGGHYVSALSYMYAATGDEEIKQRLDYMLSELKRAQDAAGDGYLCGAPNGRKIWEAVSK 124

Query: 214 -ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITM 272
            ++  S   L   W P Y IHK  AGL D Y+LA + +A  M   + ++  N  + +   
Sbjct: 125 GDIQASSFGLNGGWVPLYNIHKTYAGLRDAYLLAGSKEARDMLVKLTDWMMNLTKDL--- 181

Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
            S E+    L  E GG+N+V   +  +T    +L LA  F     L  L    D L+  H
Sbjct: 182 -SDEQIQDMLRSEHGGLNEVFADVADLTGKDNYLQLARRFSHREILDPLLEHEDRLTGKH 240

Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
           ANT IP VIG +   ++ GD  +     FF + V    S + GG S RE +   +  +  
Sbjct: 241 ANTQIPKVIGYKRIADLQGDEGWDDAARFFWETVVERRSISIGGNSVREHFHPSEDFSSM 300

Query: 393 LGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
           L SE   ETC TYNML++++ L++ + ++ Y DYYERAL N +LS     + G  +Y  P
Sbjct: 301 LTSEQGPETCNTYNMLRLTKMLYQTSADVHYMDYYERALYNHILSTIDPVQGG-FVYFTP 359

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           +  G  +  S         SFWCC G+G+E+ +K G+ IY   E     LY+  +I S  
Sbjct: 360 MRSGHYRVYS-----QPQTSFWCCVGSGMENHAKYGEMIYGHSEDE---LYVNLFIPSVL 411

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
            W  G V + Q     ++  PY   T    S  +  +  ++  R+P WT  +  + ++NG
Sbjct: 412 QW--GKVRVEQ-----LTGFPYEEATTLHLSCGKAKEF-TVKFRVPEWTDVSQMELTVNG 463

Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
              P+   G +++ + +W+  D++ + LP+SLR  A+ D    Y    + ++GP +LA
Sbjct: 464 TAQPVSVSGGYVTVSRKWADGDEVRLTLPMSLRVAALPDGSDNY----SFMYGPIVLA 517


>gi|94494954|ref|ZP_01301535.1| hypothetical protein SKA58_00635 [Sphingomonas sp. SKA58]
 gi|94425220|gb|EAT10240.1| hypothetical protein SKA58_00635 [Sphingomonas sp. SKA58]
          Length = 665

 Score =  245 bits (625), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 181/549 (32%), Positives = 262/549 (47%), Gaps = 51/549 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           LK   + DV LD    L  AQ+    YLL L  D ++ +FR  A L      YGGWE+  
Sbjct: 64  LKPFDMADVTLDDGPFL-HAQRMTETYLLRLQPDRMLHNFRINAGLKPKAPVYGGWESEP 122

Query: 159 S----ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP-- 212
           +       GH +GHYLSA A  + ST +   K+++  +   L+ CQ    +G + AFP  
Sbjct: 123 TWAEINCHGHTLGHYLSACALAYRSTRDRRFKQRLDYIASELAACQKAAHSGLICAFPDG 182

Query: 213 TELFDSFEALKPVWA-PYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
             L  +    +P+   P+YT+HKI AGL D  +LAD+ +A    L++A W V        
Sbjct: 183 PALVAAHINGEPITGVPWYTLHKIYAGLRDAALLADSREAREVLLRLADWGV-------- 234

Query: 268 KVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
            V T    +  + + L  E GGMN++   LY++T   ++  LA  F     +  L    D
Sbjct: 235 -VATRPLSDAQFEAMLATEHGGMNEIYADLYAMTGKEEYRTLARRFSHKAVMEPLVAGKD 293

Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
            L   HANT +P ++G Q  YE TGD  Y     FF   V  + S+ATGG    E ++  
Sbjct: 294 LLDGMHANTQVPKIVGFQRVYEETGDDRYAKAADFFFRTVAHTRSFATGGHGDNEHFF-- 351

Query: 387 KRLAD----TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
             +AD       ++  ETC  +NMLK++R LF    +  YADYYER L NG+L+ Q   +
Sbjct: 352 -AMADFESHVFSAKGSETCCQHNMLKLARLLFMQDPQADYADYYERTLYNGILASQ-DPD 409

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
            G+  Y      G  K      + T  +SFWCC GTG+E+  K  DSIYF ++ +   LY
Sbjct: 410 SGMATYFQGARPGYMKL-----YHTPEDSFWCCTGTGMENHVKYRDSIYFHDDRS---LY 461

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
           +  ++ S+  W      L Q         P   +  T  +  E+    +L+LR P W  S
Sbjct: 462 VSLFLPSAVQWADKGARLEQATS--FPDTPSTSLKWTLRTPVEI----ALHLRHPRW--S 513

Query: 563 NGAQASLNGQN-LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
             A   +NG+  L    PG FL  T  W   D++ + L +    E+     P   +I A 
Sbjct: 514 PTATVRVNGREVLRSTAPGRFLEVTRLWRDGDRVELTLDMMPGVESA----PAAPNIVAF 569

Query: 622 LFGPYLLAG 630
            +GP +LAG
Sbjct: 570 TYGPLVLAG 578


>gi|332882274|ref|ZP_08449902.1| hypothetical protein HMPREF9074_05700 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|332679658|gb|EGJ52627.1| hypothetical protein HMPREF9074_05700 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
          Length = 786

 Score =  244 bits (624), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 170/538 (31%), Positives = 264/538 (49%), Gaps = 40/538 (7%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L+DV L QS     A+  ++ YLL LD D L+  + K A L      Y  WEN  + L G
Sbjct: 32  LNDVRLTQSP-FKHAEDLDVRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWEN--TGLDG 88

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT---------- 213
           H  GHY+SA + M+A+T +  IK+++  ++  L   Q+  G GYL   P           
Sbjct: 89  HIGGHYVSALSYMYAATGDEEIKQRLDYMLSELKRAQDAAGDGYLCGAPNGRKIWEAVSK 148

Query: 214 -ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITM 272
            ++  S   L   W P Y IHK  AGL D Y+LA + +A  M   + ++  N  + +   
Sbjct: 149 GDIQASSFGLNGGWVPLYNIHKTYAGLRDAYLLAGSKEARDMLVKLTDWMMNLTKDL--- 205

Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
            S E+    L  E GG+N+V   +  +T    +L LA  F     L  L    D L+  H
Sbjct: 206 -SDEQIQDMLRSEHGGLNEVFADVADLTGKDNYLQLARRFSHREILDPLLEHEDRLTGKH 264

Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
           ANT IP VIG +   ++ GD  +     FF + V    S + GG S RE +   +  +  
Sbjct: 265 ANTQIPKVIGYKRIADLQGDEGWDDAARFFWETVVERRSISIGGNSVREHFHPSEDFSSM 324

Query: 393 LGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
           L SE   ETC TYNML++++ L++ + ++ Y DYYERAL N +LS     + G  +Y  P
Sbjct: 325 LTSEQGPETCNTYNMLRLTKMLYQTSADVHYMDYYERALYNHILSTIDPVQGG-FVYFTP 383

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           +  G  +      +     SFWCC G+G+E+ +K G+ IY   E     LY+  +I S  
Sbjct: 384 MRSGHYRV-----YSQPQTSFWCCVGSGMENHAKYGEMIYGHSEDE---LYVNLFIPSVL 435

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
            W  G V + Q     ++  PY   T    S  +  +  ++  R+P WT  +  + ++NG
Sbjct: 436 QW--GKVRVEQ-----LTGFPYEEATTLHLSCGKAKEF-TVKFRVPEWTDVSQMELTVNG 487

Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
              P+   G +++ + +W+  D++ + LP+SLR  A+ D    Y    + ++GP +LA
Sbjct: 488 TAQPVSVSGGYVTVSRKWADGDEVRLTLPMSLRVAALPDGSDNY----SFMYGPIVLA 541


>gi|392554933|ref|ZP_10302070.1| Acetyl-CoA carboxylase, biotin carboxylase [Pseudoalteromonas
           undina NCIMB 2128]
          Length = 816

 Score =  244 bits (623), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 179/545 (32%), Positives = 267/545 (48%), Gaps = 46/545 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L++VSL       +S    AQQTN+ YLL L  D L+  + + A +     +YG WE+  
Sbjct: 51  LEQVSL------SASPFLHAQQTNVRYLLALHPDQLLAPYLREAGIEPKASSYGNWED-- 102

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
           S L GH  GHYLSA +  WA+T +  +K ++  ++  L   Q ++  GYL   P      
Sbjct: 103 SGLDGHIGGHYLSALSLAWAATGDEELKRRLDYMLNELQRAQ-QVNDGYLGGIPNGQAMW 161

Query: 214 -ELFD-----SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
            ++ D        +L   W P Y I KI  GL D Y++A + QA  M   + E+F N   
Sbjct: 162 QQIHDGNIKADLFSLNDRWVPLYNIDKIFHGLRDAYLIAGSEQAKTMLFGLGEWFLNLTS 221

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
           K+    S E+    L  E GG+N V   + +I +D ++L LA  F     +  L  + D 
Sbjct: 222 KL----SDEQIQQMLYSEYGGLNAVFADMATIGNDKRYLKLARQFTHHSIVDPLLKKQDK 277

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L+  HANT IP +IG     E + D  ++    +F   V    S A GG S RE + D K
Sbjct: 278 LTGLHANTQIPKIIGMLKVAETSDDEAWQQGADYFWQTVTKERSVAIGGNSVREHFHDKK 337

Query: 388 RLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
                +   E  ETC TYNM+K+S+ LF  T +  Y +YYERA  N +LS Q   E G +
Sbjct: 338 DFTAMVEDVEGPETCNTYNMMKLSKLLFLKTADTRYLEYYERATYNHILSSQH-PEHGGL 396

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
           +Y  P+  G  +  S+       +S WCC G+GIE+ SK G+ IY + + N   L++  +
Sbjct: 397 VYFTPMRPGHYRMYSSVQ-----DSMWCCVGSGIENHSKYGELIYSKNDDN---LWVNLF 448

Query: 507 ISSSFDW-KSGHVVLNQKVDPIVSWDPYLRMTLTFSS-KQEVGQLSSLNLRMPVWTYSNG 564
           ISS+ DW + G  V  Q   P  +      +TL F++  ++    + L++R P W  +  
Sbjct: 449 ISSTLDWQQQGLKVTQQSHFPDAN-----NVTLVFNTLDKKDNSPAQLHIRKPSWI-TGD 502

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
            Q  LNG+ +       + +    W   DKLT  L   L TE + D +  Y    A+L+G
Sbjct: 503 LQFKLNGKPINATAEQGYYAIKHDWHDGDKLTFTLAPKLYTEQLPDGQDYY----AVLYG 558

Query: 625 PYLLA 629
           P ++A
Sbjct: 559 PVVMA 563


>gi|146301615|ref|YP_001196206.1| hypothetical protein Fjoh_3876 [Flavobacterium johnsoniae UW101]
 gi|146156033|gb|ABQ06887.1| protein of unknown function DUF1680 [Flavobacterium johnsoniae
           UW101]
          Length = 765

 Score =  244 bits (623), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 195/645 (30%), Positives = 292/645 (45%), Gaps = 89/645 (13%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L+EV L D          +AQ  +L+Y+L L+ D L+  +   A LP     YG WE+  
Sbjct: 32  LQEVRLED------GPFKKAQDVDLKYILALNPDKLLAPYLIDAGLPVKSTRYGNWES-- 83

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
             L GH  GHYLSA + M+AST N  +K ++  ++  L+ CQ+K G GY+   P      
Sbjct: 84  LGLDGHIAGHYLSALSMMYASTGNPELKNRLDYMISELARCQDKNGNGYVGGIPQGKVFW 143

Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
                 ++  S   L   W P Y IHK+ AGL D Y    N QA    +K+  W +E   
Sbjct: 144 DRIHKGDIDGSSFGLNNTWVPIYNIHKLFAGLNDAYQYTGNQQAKEVLIKLGDWFIEMIK 203

Query: 264 ----NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
               +++QK+            L  E GG+N+    LY IT D K+L  A    +  FL 
Sbjct: 204 PLSDDQIQKI------------LKTEHGGINESFADLYLITKDKKYLETAQKISQKSFLE 251

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            L  + D L+  HANT IP VIG +    ++ D  +    TFF D V    S A GG S 
Sbjct: 252 SLIKKEDKLTGLHANTQIPKVIGFEKIASISADKEWSEAVTFFWDNVTQKRSVAFGGNSV 311

Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
            E +      +  L S E  ETC +YNM ++S+ LF   +E+ Y D+YER L N +LS Q
Sbjct: 312 SEHFNPVNDFSGMLKSNEGPETCNSYNMERLSKALFLEKQEMNYLDFYERTLYNHILSSQ 371

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIY--FEE 494
              E G  +Y  P+       R  H   +     S WCC G+G+E+ +K G+ IY  F+E
Sbjct: 372 H-PEKGGFVYFTPI-------RPNHYRVYSQPETSMWCCVGSGLENHTKYGELIYSHFDE 423

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT-LTFSSKQEVGQLSSLN 553
                 +++  +I+S+ +W    +V+ Q+        PY   T +  + K+   +   LN
Sbjct: 424 -----AVFVNLFIASTLNWNEKGIVIEQRTKF-----PYENSTEIVLNLKK--AKTFDLN 471

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
           +R P W  +     +   Q   L P G ++S   +W   D + I+       E +    P
Sbjct: 472 IRRPKWAENFRVFINDKEQKTELKPSG-YISLKRKWKSKDHVRIEFETKTHLEQL----P 526

Query: 614 EYASIQAILFGPYLLAGHTSGEW-------DIKTGTARSLSALISPIPPSF-----NAQL 661
           + ++  A + GP +LA  TS E        D + G   S   +  P+  ++      A  
Sbjct: 527 DGSNWSAFVNGPIVLAAKTSKEALDGLFADDSRMGHVASGKYM--PMDKAYALVGEKASY 584

Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKD 706
           V+  +E GN  F +     S+ +E F     DA     F+   KD
Sbjct: 585 VSRLKELGNMRFALD----SLELEPF-FELHDARYQMYFQTFTKD 624


>gi|291544618|emb|CBL17727.1| Uncharacterized protein conserved in bacteria [Ruminococcus
           champanellensis 18P13]
          Length = 597

 Score =  244 bits (622), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 173/552 (31%), Positives = 278/552 (50%), Gaps = 45/552 (8%)

Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTP---GKAYGGWENPISELRGHFVGHYLSASAQMWA 178
           N  YL+ L  ++L+ +F   A + T     + + GWE+P  +LRGHF+GH+LSA+A + A
Sbjct: 24  NRAYLMELKSENLLQNFLLEAGVRTDRDVTEMHLGWESPTCQLRGHFLGHWLSAAALLIA 83

Query: 179 STHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAG 238
              +  +K K+ T++ +L+ CQ   G  ++ + P + F+  +  + +W+P YT+HK L G
Sbjct: 84  QNQDRELKAKLDTIIDALARCQELNGGRWIGSIPEKYFEKLKKNEYIWSPQYTLHKTLLG 143

Query: 239 LLDQYVLADNAQALKM----ATWMVEYFYNRVQK-VITMYSVERHWYSLNEETGGMNDVL 293
           L    + A N  AL++    A W +E+    +QK    +YS E          GGM +V 
Sbjct: 144 LYHSALYAKNQVALEILGRAADWYLEWTEKMMQKNPHAVYSGEE---------GGMLEVW 194

Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
             LY +T D ++L LA  +  P   G LA   D LS+ HAN  IP   G+   YE+TGD 
Sbjct: 195 AGLYQLTEDERYLTLAQRYAHPSIFGRLADGEDPLSNCHANASIPWAHGAAKMYEITGDA 254

Query: 354 LY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
            + +L+  F+   V+   ++ TGG ++ EFW  P++L   LG   +E CT YNM++++ +
Sbjct: 255 AWLELVKRFWQCAVSDRDAFCTGGQNSGEFWIPPRKLGMFLGERTQEFCTVYNMVRLADY 314

Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
           LF +T    Y DY E  L NG L+ Q+    G+  Y LP+     KA S   WG+K   F
Sbjct: 315 LFCFTGAHEYLDYIENNLYNGFLA-QQNKYTGMPAYFLPM-----KAGSVKKWGSKTKDF 368

Query: 473 WCCYGTGIESFSKLGDSIYF-EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI---- 527
           WCC+GT +++ +      ++ ++E N   L + QYI+S   + + HV + Q VD      
Sbjct: 369 WCCHGTTVQAHTIYPQLCWYADKEQN--RLILAQYINSVCKF-NAHVTITQSVDMKYYND 425

Query: 528 -VSWDP-----YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
             S+D        R  +    K E  +  +L+LR+P W  +      +NGQ+  +     
Sbjct: 426 GASFDERDDSRMFRWYIKLHVKAEQPERFTLSLRIPAWV-AGELVILVNGQHAEVESVNG 484

Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG 641
           F      W  +D + +  P +L T ++    P+   + A   GP +LAG    +  I   
Sbjct: 485 FAELDRVWE-DDTVNLYFPAALTTCSL----PDMPQLLAFREGPIVLAGLCESDRGIYLA 539

Query: 642 TARSLSALISPI 653
                SAL +P+
Sbjct: 540 QNDPTSAL-TPV 550


>gi|408357351|ref|YP_006845882.1| hypothetical protein AXY_19880 [Amphibacillus xylanus NBRC 15112]
 gi|407728122|dbj|BAM48120.1| hypothetical protein AXY_19880 [Amphibacillus xylanus NBRC 15112]
          Length = 622

 Score =  243 bits (621), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 176/581 (30%), Positives = 282/581 (48%), Gaps = 72/581 (12%)

Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL----PTPGKAYGGWE 155
           K V++HD        L R +  N  YL+ L  D+L++++R  A        P  A+GGWE
Sbjct: 7   KNVTVHD------GDLKRREAANKSYLMSLTNDNLLFNYRVEAGRFHGREIPKDAHGGWE 60

Query: 156 NPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL 215
            P+ ++RGHF+GH+LSA+A  +  + +  +K K   +V  L+ECQ   G  ++   P + 
Sbjct: 61  TPVCQIRGHFLGHWLSAAALHYHQSGDLELKVKADLIVSELAECQKDNGGQWVGPIPEKY 120

Query: 216 FDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSV 275
                  K +WAP Y +HK+  GL+D Y    N QAL +A    ++F     K    ++ 
Sbjct: 121 LHWIAEGKNIWAPQYNLHKLFMGLIDMYSYTGNQQALDIADNFADWFVKWSGK----FTR 176

Query: 276 ERHWYSLNEETGGMNDVLYRLYSIT-HDPKHLLLAHLFDKPCFLGFLALQADYLSHFHAN 334
           E+    L+ ETGGM +V   L  IT HD    LL   + +  F   L  + D L++ HAN
Sbjct: 177 EQFDDILDVETGGMLEVWADLLEITGHDKYKFLLDRYYRQRLFQPLLEGK-DPLTNMHAN 235

Query: 335 THIPIVIGSQMRYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
           T IP V+G    YEVTGD  +  ++  ++   V    + ATGG ++ E W    ++   L
Sbjct: 236 TTIPEVLGCARAYEVTGDNRWLDIVKAYWNCAVTERGTLATGGNTSGEVWMPKMKIKARL 295

Query: 394 GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ-------RGTEP--- 443
           G +N+E CT YNM++++  LF+ TK+ AY  Y E  L NG+++          GT     
Sbjct: 296 GDKNQEHCTVYNMIRLADFLFQQTKDPAYGQYIEYNLYNGIMAQAYYQSYHVAGTGKNHP 355

Query: 444 --GVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
             G++ Y LP+  G+ K      W ++ NSF+CC+GT +++ + L   IY++++  +   
Sbjct: 356 WTGLLTYFLPMKAGLYKE-----WSSETNSFFCCHGTMVQANATLNRGIYYQDQDQI--- 407

Query: 502 YIIQYISSSFDWKSG--HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS-------- 551
           Y+ QY +S  +   G   V + Q  D +      L  + + + +Q + +++S        
Sbjct: 408 YVSQYFNSELETTIGSDRVRIKQSQDIMSG---SLLDSSSIAGQQRLSEITSIHENTPDF 464

Query: 552 ----------------LNLRMPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERWSYNDK 594
                           L LR+P W   + A   LNG+ +        F   T  WS  DK
Sbjct: 465 KKYDFTIQLDQKKTFTLGLRIPEWIMKD-ASIYLNGELIGKTNDSSAFYKLTREWSDGDK 523

Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           ++I  P+ +R   + DD     +  A  +GP +LAG T  E
Sbjct: 524 VSITFPIGIRFIQLPDD----LNTGAFRYGPDVLAGITEHE 560


>gi|305676227|ref|YP_003867899.1| hypothetical protein BSUW23_17775, partial [Bacillus subtilis
           subsp. spizizenii str. W23]
 gi|305414471|gb|ADM39590.1| hypothetical protein BSUW23_17775 [Bacillus subtilis subsp.
           spizizenii str. W23]
          Length = 497

 Score =  243 bits (621), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 166/508 (32%), Positives = 254/508 (50%), Gaps = 35/508 (6%)

Query: 114 VLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSAS 173
           + + +Q    EYLL LDVD L+    +  S       YGGWE    E+ GH +GH+LSA+
Sbjct: 10  MFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAGHSIGHWLSAA 67

Query: 174 AQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------LKP 224
           + M+ ++ +  +K K    V  LS  Q     GY+S F    FD   +         L  
Sbjct: 68  SAMYQASGDEKLKRKAEYAVNELSHIQQFDEEGYISGFSRACFDEVFSGDFRVDHFSLGG 127

Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
            W P+Y++HK+ AGL+D Y L  N  AL++   + ++     +K +   + E+    L  
Sbjct: 128 SWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRMLIC 183

Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
           E GGMN+ +  LY +T +  +L LA  F     L  LA   D L   HANT IP VIG+ 
Sbjct: 184 EHGGMNEAMADLYILTKNKSYLDLAERFCHRAILQPLAEGKDELEGKHANTQIPKVIGAA 243

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
             Y++TG+  Y+    FF + V    SYA GG S  E +      ++ LG    ETC TY
Sbjct: 244 KLYDITGNEAYRNPALFFWEQVVYQRSYAIGGNSIGEHF--GAEGSEELGVTTAETCNTY 301

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           NMLK++ HLFRW  E  + DYYE AL N +LS Q   E G+  Y +    G  K      
Sbjct: 302 NMLKLTGHLFRWFHEARFTDYYENALYNHILSSQ-DPESGMKTYFVSTQPGHFKV----- 355

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
           + +  +SFWCC GTG+E+ ++   +IY  ++ +   LY+  +I S  + +   +++ Q+ 
Sbjct: 356 YCSPEDSFWCCTGTGMENPARYTQNIYHLDQDD---LYVNLFIPSQINVREKQMIITQE- 411

Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA-QASLNGQNLPLPPPGNFL 583
               S+    +  L    K+  G   +L +R+P WT  NG+ +A +NG+ +       +L
Sbjct: 412 ---TSFPAANKTKLVV--KKADGVPMTLQIRIPYWT--NGSLKAVVNGKRVQSVEKNGYL 464

Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDD 611
           +  + W+  D + I LP+ L     +DD
Sbjct: 465 AIHKHWNTGDCIEIDLPMKLHIYQAKDD 492


>gi|317057297|ref|YP_004105764.1| hypothetical protein Rumal_2655 [Ruminococcus albus 7]
 gi|315449566|gb|ADU23130.1| protein of unknown function DUF1680 [Ruminococcus albus 7]
          Length = 602

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 179/572 (31%), Positives = 289/572 (50%), Gaps = 56/572 (9%)

Query: 139 RKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSE 198
           R+  S P   + + GWE+P  +LRGHF+GH++SA+A + AS  +A ++ K+  +V  L  
Sbjct: 51  RQVISEPEKAELHWGWESPACQLRGHFLGHWMSAAAMLSASDGDAELRAKLVKIVDELER 110

Query: 199 CQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KM 254
           CQ + G  ++ + P + F   E+ + +W+P YT+HK L GL+D Y  A   +AL    ++
Sbjct: 111 CQQRNGGKWVGSIPEKYFKLMESEEYIWSPQYTMHKTLMGLVDAYRFAGIQKALDIADRL 170

Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
           A W +E+  + V+K    ++V         E GGM +    LY +T+DPK+  L  ++ +
Sbjct: 171 ADWYIEWAAS-VEKTAP-FTV------FKGEQGGMLEEWCILYELTNDPKYRKLMDIYRE 222

Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLI-GTFFMDIVNASHSYA 373
                 L    + L+  HAN  IP+  G+   Y++TG+  +K+I   F+   V     +A
Sbjct: 223 NGLYHKLEQHREALTDDHANASIPLSHGAARMYDITGEERWKIITDEFWRQAVTERGMFA 282

Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
           T G ++ EFW  P  +   LG  ++E CT YNM++++  L+R T +  YADY ERAL NG
Sbjct: 283 TTGANSGEFWVPPHSMGSYLGDTDQEFCTVYNMVRLADFLYRRTGDTVYADYIERALYNG 342

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
            L+ Q+    G+  Y LPL  G  K      WG+K + FWCC+GT +++ +     I++ 
Sbjct: 343 FLA-QQNMHSGMPAYFLPLSSGSRKK-----WGSKRHDFWCCHGTMVQAQTLYPQLIWYT 396

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLN-------QKVDPIVSWD-----PYLRMTLTFS 541
           E+     L + QYI S  +   G   +        + ++  V +D        R ++ F 
Sbjct: 397 EDST---LTVAQYIPSEAELDIGGKKIKVSQCTELKNLNNQVFFDEDEGGEKSRWSIRFD 453

Query: 542 SKQEVGQLSSLNLRMPVWTYSNG-AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
            K +     +L LRMP W   NG  Q  ++G ++      N+L+ +  W +ND + + L 
Sbjct: 454 IKCDEPTFFTLWLRMPKWL--NGRPQLIIDGGSVQADIADNYLTISRTW-HNDTIQLLLI 510

Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQ 660
            +L TE +  D PE A   A+L GP +LAG T    D   G     SA     P SF  +
Sbjct: 511 PTLYTEPLA-DMPETA---ALLDGPIVLAGMT----DKDAGITGDFSA-----PESFLHR 557

Query: 661 LVTFTQES---GNSTFVMSNSNQSITMEEFPV 689
             T   ++     +T+V    NQ + +E  P+
Sbjct: 558 RTTHEYKTYVWKQNTYV--TRNQPVNIEFKPL 587


>gi|330996333|ref|ZP_08320217.1| hypothetical protein HMPREF9442_01300 [Paraprevotella xylaniphila
           YIT 11841]
 gi|329573383|gb|EGG54994.1| hypothetical protein HMPREF9442_01300 [Paraprevotella xylaniphila
           YIT 11841]
          Length = 811

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 169/538 (31%), Positives = 263/538 (48%), Gaps = 40/538 (7%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L+DV L Q      A+  ++ YLL LD D L+  + K A L      Y  WEN  + L G
Sbjct: 57  LNDVRLTQGP-FKHAEDLDIRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWEN--TGLDG 113

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
           H  GHY+SA A M+A+T N  IK+++  ++      Q+  G GYL   P   +++D+   
Sbjct: 114 HIGGHYVSALAYMYAATGNEEIKQRLDYMLSEWKRAQDAAGDGYLCGAPNGRKIWDAVSK 173

Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITM 272
                    L   W P Y IHK  AGL D YV+A  AQA  M   + ++  N  + +   
Sbjct: 174 GDIQASSFGLNGGWVPLYNIHKTYAGLRDAYVVAGCAQAKDMLVKLTDWMMNLTKDL--- 230

Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
            S E+    L  E GG+N+V   +  +T    ++ LA  F     L  L  Q D L+  H
Sbjct: 231 -SDEQIQDMLRSEHGGLNEVFADVADLTGKDGYMQLARRFSHREILDPLLKQEDQLTGKH 289

Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
           ANT IP VIG +   ++ GD  +     FF   V    S + GG S RE +   +  +  
Sbjct: 290 ANTQIPKVIGYKRIADLEGDESWDDAARFFWKTVVDQRSISIGGNSVREHFHPSEDFSSM 349

Query: 393 LGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
           L SE   ETC TYNML++++ L++ + +  Y DYYERAL N +LS     + G  +Y  P
Sbjct: 350 LTSEQGPETCNTYNMLRLTKMLYQTSADAHYMDYYERALYNHILSTIDPVQGG-FVYFTP 408

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           +  G  +  S         SFWCC G+G+E+ +K G+ IY     +   LY+  +I S  
Sbjct: 409 MRSGHYRVYS-----QPQTSFWCCVGSGMENHAKYGEMIYAHGGDD---LYVNLFIPSVL 460

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
            W  G V + Q+          LR++ + +      +  ++  R+P WT ++  + ++NG
Sbjct: 461 QW--GKVRVEQRTSFPYEEATTLRLSCSKA------KTFTVKFRVPEWTDASRMELTVNG 512

Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
              P+   G +++ + +W+  D++ + LP+SLR   + D    Y    + ++GP +LA
Sbjct: 513 TAQPVSVSGGYVAVSRKWTDGDEVRLTLPMSLRAVVLPDGSDNY----SFMYGPVVLA 566


>gi|406027774|ref|YP_006726606.1| hypothetical protein LBUCD034_2040 [Lactobacillus buchneri CD034]
 gi|405126263|gb|AFS01024.1| hypothetical protein LBUCD034_2040 [Lactobacillus buchneri CD034]
          Length = 803

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 185/564 (32%), Positives = 263/564 (46%), Gaps = 77/564 (13%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPIS-ELRGHFVGHYLSASA 174
           RAQQ  ++YLL LD    + +F + A + + G   Y GWE       RGHF GHYLSA +
Sbjct: 19  RAQQMTVKYLLALDPKRFLVTFDQVAGIDSGGVTGYQGWERTDGLNFRGHFFGHYLSALS 78

Query: 175 QMWASTHNATIKE----KMSTVVFSLSECQNKIG------TGYLSAFPTELFDSFEALK- 223
           Q   +T +  I++    K+   V  L   Q           GY+SAF     D  E  + 
Sbjct: 79  QAILATEDNAIRQQLLDKLRLGVNGLQSAQAAYAKKHPESAGYVSAFREVALDEVEGREV 138

Query: 224 ------PVWAPYYTIHKILAGLLDQYVLADN------AQALKMATWMVEYFYNRVQKVIT 271
                  V  P+Y +HK+LAGLL   V   N       +ALK A     Y + R+ ++  
Sbjct: 139 PKDEKENVLVPWYNLHKVLAGLLAVNVNLQNIDPLLSEKALKSAHQFGLYVFKRINQLAD 198

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
              +      L  E GGMND LY L+ +T D + L  A  FD+      LA   D L+  
Sbjct: 199 PTQM------LKIEYGGMNDALYELFDLTDDKRMLTAATYFDETTLFKQLAKGDDVLAGK 252

Query: 332 HANTHIPIVIGSQMRYEVTGD----------------PLYKLIGTFFMDIVNASHSYATG 375
           HANT IP +IG+  RYE   D                 +Y      F  IV   H+Y TG
Sbjct: 253 HANTTIPKLIGALHRYESLHDVKRADQYLSPEEKGSLNMYLKAAVNFWQIVIDDHTYVTG 312

Query: 376 GTSAREFWWDPKRL-ADTL---GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
           G S  E + +P +L  D +   G+   ETC TYNMLK+SR LFR T +  Y DYYE+  T
Sbjct: 313 GNSQSEHFHEPGQLFHDAVLEDGATTCETCNTYNMLKLSRELFRVTGDKKYLDYYEQTYT 372

Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY 491
           N +L  Q     G+M Y  P+  G +K      +   F+ FWCC GTGIESF+KLGDS Y
Sbjct: 373 NAILGSQ-NPNTGMMTYFQPMAAGYTKV-----YNRPFDEFWCCTGTGIESFTKLGDSYY 426

Query: 492 FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
           F        LY+  Y S+     S ++ + ++VD        + +T+     Q+     +
Sbjct: 427 FRSGDQ---LYLSLYFSNVLRLDSRNLQMTEQVDRKAG---KVHLTVVKIRSQDSAGTIN 480

Query: 552 LNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK---LTIQLPLSLRTEAI 608
           L LR P W   + A+ +++G +  +    +F      W  ++     T+ L + +  E +
Sbjct: 481 LKLRNPAWLVQS-AKLAVDGISQQMDQNADF------WEIDNAGPGTTVDLEMPMSLEMV 533

Query: 609 Q-DDRPEYASIQAILFGPYLLAGH 631
           Q  D P Y + +   +GPY+LAG 
Sbjct: 534 QTKDNPHYLAFK---YGPYVLAGQ 554


>gi|302897238|ref|XP_003047498.1| hypothetical protein NECHADRAFT_97856 [Nectria haematococca mpVI
           77-13-4]
 gi|256728428|gb|EEU41785.1| hypothetical protein NECHADRAFT_97856 [Nectria haematococca mpVI
           77-13-4]
          Length = 626

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 181/550 (32%), Positives = 261/550 (47%), Gaps = 42/550 (7%)

Query: 99  LKEVSLHDV-WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWEN 156
           L EV+L D  W+D        Q   L YLL +D D L++ FR    L T G +  GGW+ 
Sbjct: 42  LSEVTLTDSRWMDN-------QNRTLTYLLSVDPDRLLYVFRANHGLDTKGAQKNGGWDA 94

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAF 211
           P    R H  GH+L+A +Q +A+  N     + +     L +CQ          GYLS F
Sbjct: 95  PDFPFRSHIQGHFLTAWSQCYATLRNEECGSRATYFAKELGKCQANNEKANFTEGYLSGF 154

Query: 212 PTELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
           P     + E   L     PYY IHK LAGLLD + L  +  A  +   +  +   R +K+
Sbjct: 155 PESEITAVEKRTLNNGNVPYYAIHKTLAGLLDVHRLVGDEDAKDVMLALAGWVDTRTKKL 214

Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
               + ++    +  E GGMN+VL  +     D K L +A  FD       L    D LS
Sbjct: 215 ----TYDQMQAMMQTEFGGMNEVLADIAYYIGDKKWLEVAQRFDHATIFDPLEKGQDKLS 270

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
             HANT +P  IG+   Y+V+G   Y  IG    D+    H+YA GG S  E +  P  +
Sbjct: 271 GLHANTQVPKWIGAIREYKVSGLQKYLDIGRNAWDLTVHKHTYAIGGNSQAEHFRAPDAI 330

Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQRGTE-PGVMI 447
           A+ L ++  E C TYNMLK++R L+     + ++ D+YE AL N +L  Q   +  G + 
Sbjct: 331 AEYLDNDTCEACNTYNMLKLTRELWVMDPSDASFFDFYENALMNHLLGQQNPEDHHGHIT 390

Query: 448 YMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
           Y  PL     RGV  A     W T ++SFWCC G+GIE+ +KL DSIYF ++     LY+
Sbjct: 391 YFTPLNPGGRRGVGPAWGGGTWSTDYDSFWCCQGSGIETNTKLMDSIYFHDD---ETLYV 447

Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
             +  S  DW    + + Q  D      P    T      Q      ++ +R+P WT  +
Sbjct: 448 NLFTPSQLDWSDRKISITQSTDF-----PERDTTTLKVGNQGENNEWTMAIRVPSWT--S 500

Query: 564 GAQASLNGQNLPLP--PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
            A   +NG+ +       G +     +WS  D +T+ LP+SLRT A      + A+  AI
Sbjct: 501 KASIKINGEAVEGVDIESGKYAIIKRKWSSGDAVTVTLPMSLRTIAAN----DDAATAAI 556

Query: 622 LFGPYLLAGH 631
            FGP +L+ +
Sbjct: 557 AFGPVILSAN 566


>gi|319786479|ref|YP_004145954.1| hypothetical protein Psesu_0871 [Pseudoxanthomonas suwonensis 11-1]
 gi|317464991|gb|ADV26723.1| protein of unknown function DUF1680 [Pseudoxanthomonas suwonensis
           11-1]
          Length = 806

 Score =  243 bits (619), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 184/562 (32%), Positives = 275/562 (48%), Gaps = 63/562 (11%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L++V L D    +SS L      NL YL  LD D L+  FR  A LP+P   Y  WE+  
Sbjct: 40  LEDVRLGDGAFARSSAL------NLRYLAALDPDRLLAPFRIEAGLPSPAPKYPNWES-- 91

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF-- 216
             L GH  GHYLSA AQ  A+  +A ++ ++  +V +LS+ Q   G GY+   P      
Sbjct: 92  MGLDGHTAGHYLSALAQQ-AAQGSAGMRRRLDYMVAALSQVQAANGDGYVGGVPNGRVLW 150

Query: 217 -----DSFEA----LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
                  F+A    L+  W P+Y +HK  AGL D ++LA NAQA    ++ A W      
Sbjct: 151 NRIASGDFQAESFSLEGAWVPFYNLHKTYAGLRDAWLLAGNAQARDVLVRFADWAGALVA 210

Query: 264 N----RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           N    ++Q+V            L+ E GGMN+VL  +Y+IT D ++L LA  F     L 
Sbjct: 211 NLDDTQLQRV------------LDTEHGGMNEVLADVYAITGDRRYLALARRFSHRAILD 258

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            L  + D L   HANT IP VIG     E+ GD  +     FF + V    S A GG S 
Sbjct: 259 PLLRREDRLDGLHANTQIPKVIGFARIGELDGDVEWIEAAQFFWERVALHRSIAFGGNST 318

Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
           RE +      +  + S E  ETC +YNML+++  L R   +  +AD+YERAL N +LS Q
Sbjct: 319 REHFNPADDFSGMIASREGPETCNSYNMLRLTLLLERLRPDPRHADFYERALFNHILSTQ 378

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
              + G ++Y  P+     + R    +      FWCC G+G+E+  + G   Y  +E + 
Sbjct: 379 H-PDHGGLVYFTPI-----RPRHYRVYSQPQECFWCCVGSGMENHGRHGAFAYTHDESS- 431

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             L +  Y+ S   W+   +VL Q+      +    R  L  ++ +   Q+ +L LR P 
Sbjct: 432 --LRVNLYLDSELHWRERGLVLRQR----TRFPEEPRSVLEVATPRP--QVFALELRHPH 483

Query: 559 WTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
           W  +   +  LNG+  P+   P ++     +W   D++ ++LP+S R E++    P+ + 
Sbjct: 484 W-LAGPLRVKLNGRRWPVESSPSSYARIERQWQDGDRIEVELPMSTRIESL----PDGSD 538

Query: 618 IQAILFGPYLLAGHTSGEWDIK 639
             A++ GP +LA   SGE DI+
Sbjct: 539 WVAVMHGPLMLAAR-SGEEDIE 559


>gi|254444174|ref|ZP_05057650.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
 gi|198258482|gb|EDY82790.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
          Length = 788

 Score =  243 bits (619), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 171/533 (32%), Positives = 255/533 (47%), Gaps = 49/533 (9%)

Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAS 179
           + ++ Y+L  D D L+  F   A L    + YG WE+  S L GH  GH+LSA A +   
Sbjct: 47  EADVTYVLAHDPDRLLAPFLTAAGLEPKAEKYGNWES--SGLDGHSAGHFLSAYATLSLQ 104

Query: 180 THNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELF------DSFEALKPVWA 227
           + N  ++E++  ++  L+ CQ+ IGTGYL   P      T LF      D F +L   W 
Sbjct: 105 SDNPLLRERLDYMLDELTRCQDAIGTGYLGGVPNSQEFTTRLFAGEIKADRF-SLNGAWV 163

Query: 228 PYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
           P+Y +HK  AGL D +++AD+ +A    + +A W V              + E+    L 
Sbjct: 164 PWYNLHKTYAGLKDAWLVADSEKAKNILIALADWTV--------AATAKLTDEQMQEMLY 215

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGS 343
            E GGMN++   LY  T D ++L LA+ F     L  L    D L+ FHANT IP VIG 
Sbjct: 216 TEHGGMNEIFADLYLHTQDQRYLELAYRFTHHELLDPLLENQDKLTGFHANTQIPKVIGY 275

Query: 344 QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCT 402
           Q       D        FF D V    S + GG S RE +         L S E  ETC 
Sbjct: 276 QRTALAAQDEKLHQASQFFWDTVVNHRSVSIGGNSVREHFHPADDFRSMLESREGPETCN 335

Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARST 462
           T+NML+++  LF      A  DYYERAL N +LS Q   E G ++Y  P      + R  
Sbjct: 336 THNMLRLTTLLFEAEPTAALTDYYERALYNHILSAQH-PETGGLVYFTP-----QRPRHY 389

Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
             +    N+FWCC G+GIE+  +  + IY   +     L++  +++SS +W+   + L Q
Sbjct: 390 RVYSVPENAFWCCVGSGIENPGRYSEFIYAHTDD---ALFVNLFLASSLNWQEKGLRLTQ 446

Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN- 581
             +          +T+  + K+++    +L +R P WT ++  Q +LN + +      N 
Sbjct: 447 STN--FPQTASTELTIDQAPKKKL----TLKIRRPAWT-TDAFQITLNDKPVKTKTNANG 499

Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
           + S T +W   D L++ LP+ +  E I D  P Y    + L+GP +LA  T  
Sbjct: 500 YASLTRKWKTGDTLSVALPMQVHVEQIPDHSPFY----SFLYGPIVLAAKTDA 548


>gi|332185536|ref|ZP_08387284.1| tat (twin-arginine translocation) pathway signal sequence domain
           protein [Sphingomonas sp. S17]
 gi|332014514|gb|EGI56571.1| tat (twin-arginine translocation) pathway signal sequence domain
           protein [Sphingomonas sp. S17]
          Length = 639

 Score =  242 bits (618), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 178/561 (31%), Positives = 262/561 (46%), Gaps = 57/561 (10%)

Query: 90  GGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK 149
           G   LP   ++   + DV LD    L  AQ+    YL+ L  D L+ +FR  A L     
Sbjct: 33  GATRLPATVVQPFDMADVTLDGGPFL-HAQRMTEAYLMRLQPDRLLANFRANAGLKPKAP 91

Query: 150 AYGGWENPIS----ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT 205
           AYGGWE+          GH +GHYLSA A  + +T +   ++++  +   L+ CQ   G+
Sbjct: 92  AYGGWESEPEWADINCHGHTLGHYLSACALAYRATKDKRYRQRIDYIANELAACQKASGS 151

Query: 206 GYLSAFPT--ELFDSFEALKPVWA-PYYTIHKILAGLLDQYVLADNAQA----LKMATWM 258
           G + AFP    L  +    +P+   P+YT+HK+ AGL D   LAD+  +     ++A W 
Sbjct: 152 GLVCAFPKGPALVAAHLRGEPITGVPWYTLHKVYAGLRDSVQLADSEPSRGVLFRLADWG 211

Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
           V              S E+    L  E GGMN++   LY +T +  +  +A  F +   +
Sbjct: 212 V--------VATKPLSDEQFEKMLETEYGGMNEIYADLYFMTGNEDYRRVAERFSQKAIM 263

Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
             LA   DYL   HANT IP +IG Q  +E TGD  Y     FF   V  + ++ATGG  
Sbjct: 264 NPLAQGRDYLDGMHANTQIPKIIGFQRVFEATGDDKYHNAAAFFWRTVAHTRAFATGGHG 323

Query: 379 AREFWWDPKRLAD----TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
             E ++    +AD       ++  ETC  +NMLK++R LF       YADYYER L NG+
Sbjct: 324 DAEHFF---AMADFDKHVFSAKGSETCCQHNMLKLTRALFLRDPRAEYADYYERTLYNGI 380

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           L+ Q   + G+  Y      G  K      + T  +SFWCC GTG+E+  K  DSIYF +
Sbjct: 381 LASQ-DPDSGMATYFQGARPGYMKL-----YHTPEDSFWCCTGTGMENHVKYRDSIYFHD 434

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
           +     LY+  +I S+  W     VL Q      + +   R  L   ++       +L L
Sbjct: 435 DR---ALYVNLFIPSTVTWADKGAVLTQATTFPDAANTQFRWKLRQPTEL------TLKL 485

Query: 555 RMPVWTYS-----NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
           R P W+ +     NGA+ S + +      PG++   T  W   D + ++L +    E+  
Sbjct: 486 RHPKWSPTATLLVNGAEVSHSDK------PGSYAELTRTWKTGDTVEMRLVMEPAVESA- 538

Query: 610 DDRPEYASIQAILFGPYLLAG 630
              P    I A  +GP +LAG
Sbjct: 539 ---PAAPEIVAFTYGPLVLAG 556


>gi|295133987|ref|YP_003584663.1| glycosyl hydrolase [Zunongwangia profunda SM-A87]
 gi|294982002|gb|ADF52467.1| putative glycosyl hydrolase [Zunongwangia profunda SM-A87]
          Length = 794

 Score =  242 bits (618), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 168/563 (29%), Positives = 273/563 (48%), Gaps = 65/563 (11%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           LK+V LH      + +   A  T+L+Y+L ++ D L+  F + A L    ++Y  WEN  
Sbjct: 36  LKDVKLH------TGLFEEAMYTDLDYILQMEPDRLLAPFLREAGLQPKAESYPNWEN-- 87

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDS 218
           + L GH  GHYL+A AQM+AS  +    ++++ ++  L + Q+  G GY+   P    DS
Sbjct: 88  TGLDGHIGGHYLTALAQMYASAGSDEALQRLNYMIGELKKAQDANGNGYVGGIP----DS 143

Query: 219 FEALKPV---------------WAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMV 259
               K +               W P Y IHK  AGL D Y++A N +A +M      WM+
Sbjct: 144 ERIWKEISEGKINAGGFSLNGGWVPLYNIHKTYAGLRDAYLIAGNEEAKQMLIDLTDWMI 203

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           +   N  +  I           L  E GG+N+    +Y +T D K+L LA+ F +   L 
Sbjct: 204 DITANLSEAQIQEM--------LKSEHGGLNETFADVYKMTGDKKYLDLAYAFTQKQVLD 255

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            L  + D L+  HANT IP VIG +    +  +  Y    T+F + V  + + + GG S 
Sbjct: 256 PLEHEKDILNGMHANTQIPKVIGYETIAALDQNKDYHNAATYFWENVVNNRTVSIGGNSV 315

Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
           RE +      +  + S +  ETC TYNMLK+S  LF    E  Y D+YE+ L N +LS Q
Sbjct: 316 REHFHPADDFSSMINSVQGPETCNTYNMLKLSEKLFLANPEEKYIDFYEQGLYNHILSSQ 375

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
                G  +Y  P+  G  +      +     S WCC G+G+E+  K  + IY   +   
Sbjct: 376 HPE--GGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHGKYNEMIYAHSDD-- 426

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
             LY+  +I S  +W+  +  L Q+ D P          T +F  + +  Q  ++N R P
Sbjct: 427 -ALYVNLFIPSEVNWEDKNFKLIQETDFPNAE-------TASFKIETQKPQKLTINFRYP 478

Query: 558 VWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
            W    G    +N + +     PG+++S T +W  +D+++++LP+++ +E +    P+ +
Sbjct: 479 SWA-GEGFDVQVNDKKVKFDKKPGSYISITRKWEDDDQISMRLPMNITSERL----PDGS 533

Query: 617 SIQAILFGPYLLAGHTSGEWDIK 639
             +++ +GP +LA  T G+ D+K
Sbjct: 534 DYESLKYGPLVLAAKT-GKEDLK 555


>gi|198275797|ref|ZP_03208328.1| hypothetical protein BACPLE_01972 [Bacteroides plebeius DSM 17135]
 gi|198271426|gb|EDY95696.1| hypothetical protein BACPLE_01972 [Bacteroides plebeius DSM 17135]
          Length = 796

 Score =  242 bits (618), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 168/532 (31%), Positives = 256/532 (48%), Gaps = 43/532 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ+ NL+ L+  DVD L+  F K A LP   + +  W    + L GH  GHYLSA A  +
Sbjct: 48  AQELNLKVLMEYDVDRLLAPFLKEAGLPLKAEPFPNW----AGLDGHVGGHYLSAMAMNY 103

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSF-----EALKPVWAPYY 230
           A+T N   +++M  ++  L  CQ   G GY+   P   EL+        E++   WAP+Y
Sbjct: 104 AATGNEECRKRMEYMLGELKRCQESNGDGYIGGVPNGKELWADIKNGKVESIWKYWAPWY 163

Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
            +HKI AGL D ++   N +AL M   + ++  + V + ++   +E+    L  E GGM+
Sbjct: 164 NVHKIFAGLRDAWMYTGNKEALDMFLRLCDWGVS-VTEGLSDNQMEQ---MLANEFGGMD 219

Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           ++    Y IT   K+L  A  F        +    D L + HANT IP VIG Q   EV 
Sbjct: 220 EIFADAYQITGKKKYLTTAKRFSHRWLFDSMVAHKDNLDNIHANTQIPKVIGYQRIAEVC 279

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFW--WDPKRLADTLGSENEETCTTYNMLK 408
           GD  Y     FF +IV    S A GG S RE++   D  R +     E  E+C TYNMLK
Sbjct: 280 GDNQYMDAADFFWNIVACKRSLALGGNSRREYFSSMDDFR-SHVEDREGPESCNTYNMLK 338

Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH--GWG 466
           ++  LFR T +  Y D+YE+AL N +LS Q     G + +        + AR  H   + 
Sbjct: 339 LTEGLFRMTGKAVYVDFYEKALYNHILSTQHPKHGGYVYF--------TSARPAHYRVYS 390

Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDP 526
              ++ WCC GTG+E+  K G+ IY     +   L++  +ISS  +W+   V + Q+ + 
Sbjct: 391 KPNSAMWCCVGTGMENHGKYGEFIYTHSSDS---LFVNLFISSRLNWEQEKVTITQETN- 446

Query: 527 IVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP---GNFL 583
               +   R+T+   S +       L LR P W  + G +   NG+ + +       +++
Sbjct: 447 -FPDEETSRLTVKLKSGESCH--FKLLLRRPAWV-TEGYEVKCNGKVVDVSEKVAGSSYI 502

Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
               +W   DK+ + LP+ +R E +Q +        AI+ GP L+      E
Sbjct: 503 CIDRKWKDGDKVEVSLPMKMRLETLQGE----DDFVAIMRGPILMGASVGTE 550


>gi|220928430|ref|YP_002505339.1| hypothetical protein Ccel_0997 [Clostridium cellulolyticum H10]
 gi|219998758|gb|ACL75359.1| protein of unknown function DUF1680 [Clostridium cellulolyticum
           H10]
          Length = 597

 Score =  242 bits (617), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 168/538 (31%), Positives = 272/538 (50%), Gaps = 63/538 (11%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
           ++T  +Y+   D++ L+ +FRK A + +  +  GGWE+    LRGHFVGH+LSA ++   
Sbjct: 21  RETAKKYVNDFDINRLMHTFRKNAGIESLAEPLGGWESEECNLRGHFVGHFLSACSKFAF 80

Query: 179 STHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEAL--KPVWAPYYTIHKIL 236
           S ++  +K K   +V  ++EC ++   GYLSAF  E+ D  E    + VWAPYYT+HKIL
Sbjct: 81  SDNDDCLKTKADNIVKIMAECASE--NGYLSAFGEEMLDILETEEDRGVWAPYYTLHKIL 138

Query: 237 AGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS--------LN--EET 286
            GL+D Y+  +N  AL +A  +  Y   R +++        +W +        +N   E 
Sbjct: 139 QGLVDCYLFLNNKTALSLAVNLAHYIRRRFERL-------SYWKTDGILRCTRVNPVNEF 191

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GG+ DVLY LY IT D K   LA +F++  F+G LA   D L   HANTH+P+VI +  R
Sbjct: 192 GGIGDVLYSLYEITGDRKIFDLADIFNRDYFIGNLAADRDVLEDLHANTHLPMVISAIHR 251

Query: 347 YEVTGDPLYK---------LIGTFFMDIVNASH--SYATGGTSAR-EFWWDPKRLADTLG 394
           + +TG+  YK         L+G  F++  ++S   S+  G  S + E W     L ++L 
Sbjct: 252 FNLTGEYKYKHAAQNFYKYLLGRTFVNGNSSSKATSFKKGEVSEKSEHWGAHNHLENSLT 311

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
               E+C  +N  K+ + LF WT++  + ++ E    N VL+    T  G+  Y  P+G 
Sbjct: 312 GGESESCCAHNTEKIVQQLFAWTEDERFLEHLEILKYNAVLN-STSTVTGLSQYQQPMGT 370

Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
           GV K      +   F++FWCC GTGIE+ S++  +I+F+++     L +  +I+S+  W 
Sbjct: 371 GVKK-----NFSGLFDTFWCCTGTGIEAMSEIQKNIWFKDKDT---LLLNMFIASTVQWD 422

Query: 515 SGHVVLNQKV---DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
             +V + Q     D  VS        LT S+   V    +L LR      S      +NG
Sbjct: 423 EKNVKIVQNTAYPDNTVS-------VLTVSTSNPVS--FTLMLRK-----SQVKSVKING 468

Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           ++        ++     ++ ND + I++  SL    ++    +     A+++   LLA
Sbjct: 469 KSFNFIADNGYIYIKRIFNNNDTIEIEIDSSLHLIQLKGSENK----AAVMYDRILLA 522


>gi|334144880|ref|YP_004538089.1| acetyl-CoA carboxylase, biotin carboxylase [Novosphingobium sp.
           PP1Y]
 gi|333936763|emb|CCA90122.1| acetyl-CoA carboxylase, biotin carboxylase [Novosphingobium sp.
           PP1Y]
          Length = 651

 Score =  241 bits (616), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 177/549 (32%), Positives = 260/549 (47%), Gaps = 51/549 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN-- 156
           L+   L DV L++   L  AQ+    YLL L  D L+ +FR  A L      YGGWE+  
Sbjct: 50  LEPFDLSDVTLEEGPFL-HAQRLTEAYLLRLQPDRLLHNFRVNAGLAPRAAVYGGWESDE 108

Query: 157 --PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
                   GH +GHYLSA A  + ST++   K+++  +   L+ CQ   G+G + AFP  
Sbjct: 109 IWADINCHGHTLGHYLSACALAFRSTNDRRFKQRVDYIANELAACQKATGSGLVCAFPDG 168

Query: 215 ---LFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
              L       K    P+YT+HK+ AGL D  +LAD+  +    +++A W V        
Sbjct: 169 PALLTAHLRGDKITGVPWYTLHKVYAGLRDGALLADSTVSREVLIRLADWGV-------- 220

Query: 268 KVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
            V T    +  + + L  E GGMN+V   LY++T +  +  L+  F     +  L    D
Sbjct: 221 -VATRPLTDGQFETMLATEHGGMNEVYADLYAMTGNEDYRELSQRFSHKAVMDPLVQGRD 279

Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
            L   HANT +P ++G Q  YE+TGD  Y     FF   V  + S+ATGG    E ++  
Sbjct: 280 LLDGMHANTQVPKIVGFQRVYEITGDDRYAQAANFFFRTVAHTRSFATGGHGDNEHFF-- 337

Query: 387 KRLAD----TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
             +AD       ++  ETC  +NMLK++R LF       YADYYER L NG+L+ Q   +
Sbjct: 338 -AMADFDRHVFSAKGSETCCQHNMLKLARLLFMQDPNADYADYYERTLYNGILASQ-DPD 395

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
            G++ Y      G  K      + T  +SFWCC GTG+E+  K  DSIYF +E +   LY
Sbjct: 396 SGMVTYFQGARPGYMKL-----YHTPEHSFWCCTGTGMENHVKYRDSIYFHDERS---LY 447

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
           +  ++ SS  WK     L Q+        P   +     +  ++    +L LR P W  S
Sbjct: 448 VNLFVPSSVAWKEKGAELIQRT--AFPEKPTTGLQWKLRAPAKI----ALQLRHPRW--S 499

Query: 563 NGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
             A   +NGQ +      G+++     W   D++ +QL +    E+     P    I A 
Sbjct: 500 RTAVVRVNGQEVARSATAGSYVEVARTWKDGDRVELQLEMEPTVESA----PAAPDIVAF 555

Query: 622 LFGPYLLAG 630
            +GP +LAG
Sbjct: 556 TYGPIVLAG 564


>gi|302340651|ref|YP_003805857.1| hypothetical protein Spirs_4187 [Spirochaeta smaragdinae DSM 11293]
 gi|301637836|gb|ADK83263.1| protein of unknown function DUF1680 [Spirochaeta smaragdinae DSM
           11293]
          Length = 764

 Score =  241 bits (615), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 172/559 (30%), Positives = 266/559 (47%), Gaps = 39/559 (6%)

Query: 107 VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENPISELRGHF 165
           V L + SV    Q   +++L+  D D ++++FR  A + T G     GW+ P   LRGH 
Sbjct: 196 VMLKEGSVFCDEQDKMIQHLIDTDDDQMLYNFRVAAGVDTRGALPMTGWDAPSCNLRGHT 255

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----GTGYLSAFPTELFDSFE 220
            GHYLS+ A  W+ T    + +K+  ++ SLSECQN +       G+LSA+    FD  E
Sbjct: 256 TGHYLSSLALGWSVTKKTELMDKIVYLIESLSECQNALEERGCSKGFLSAYSERQFDLLE 315

Query: 221 ALKP---VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
              P   +WAPYYT+ KI++GL D Y LAD++ AL +   M ++ Y R+ + ++   +++
Sbjct: 316 TYTPYPTIWAPYYTLDKIMSGLYDCYSLADSSLALNILCKMGDWVYERLSR-LSRNQLDK 374

Query: 278 HW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
            W   +  E GGM  V+ +LY++T    +L  A+ FD       +    D L   HAN H
Sbjct: 375 MWSMYIAGEFGGMISVMVKLYTLTKKKTYLQTAYYFDNEKLFYPMQENIDTLKDMHANQH 434

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
           IP ++G+   YE  G   Y  I   F +IV ASH Y+ GG    E + +P  +   +  +
Sbjct: 435 IPQIMGAVELYEADGSGRYYDIAKNFWNIVTASHVYSIGGIGETEMFHEPNEIMTYITDK 494

Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
             E+C +YN+L+++  LF    E    D+YE  L N +LS       G   Y +PL  G 
Sbjct: 495 TAESCASYNILRLTGQLFALEPERRKMDFYETVLYNHILSSFSHKSDGGTTYFMPLRPGG 554

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
            K      + TK N+  CC+G+G+E+  +    IY     N   LYI  YI S+ +W+  
Sbjct: 555 HKE-----FNTKENT--CCHGSGLETRFRYVQDIY---ACNHDTLYINLYIPSAVEWE-- 602

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
               N +++   + D       TF          +L  R+P W          N +++  
Sbjct: 603 ----NFRIEQTTASDA----AGTFIFLIHSSGWRNLAFRIPHWAEDEYKVTINNQESVEE 654

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
                +      W   D++ I  P   R   + D +P YA    + +GPY+LA  +  E 
Sbjct: 655 MAQDGYFYLHRDWREGDRIEILTPYHFRKLPVPDGKP-YA---CMAYGPYILAALSDQEE 710

Query: 637 DIK----TGTARSLSALIS 651
            +     TG  R L+A IS
Sbjct: 711 YLPFPELTGDDRVLTASIS 729


>gi|431795908|ref|YP_007222812.1| hypothetical protein Echvi_0518 [Echinicola vietnamensis DSM 17526]
 gi|430786673|gb|AGA76802.1| hypothetical protein Echvi_0518 [Echinicola vietnamensis DSM 17526]
          Length = 784

 Score =  241 bits (614), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 166/527 (31%), Positives = 254/527 (48%), Gaps = 40/527 (7%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQQ ++ Y+  ++VD L+  +   A +      Y  WEN  + L GH  GHYLSA A M+
Sbjct: 46  AQQVDMTYMKAMEVDRLLAPYMLEAGVDWAADRYPNWEN--TGLDGHIGGHYLSALAMMY 103

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP-----------TELFDSFEALKPVW 226
           AST +A +K +M  +V  L+  Q K G GY+   P            E+     +L   W
Sbjct: 104 ASTGDAEMKRRMDYMVEQLAMAQAKNGNGYVGGIPGGMAMWEEIGQGEIDAGGFSLNQKW 163

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            P Y IHKI AGL D Y++  NAQA ++   + ++FY   + +      E+    L  E 
Sbjct: 164 VPLYNIHKIYAGLRDAYLIGGNAQAKEVLLDLTDWFYELTKGLTD----EQFQQMLVSEH 219

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GG+N+V   + +IT + K+L LA        L  L  Q D L+  HANT IP VIG Q R
Sbjct: 220 GGLNEVFADVAAITGEAKYLELAKKMSHEWLLEPLEEQEDKLTGMHANTQIPKVIGFQ-R 278

Query: 347 YEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
               GD   ++    FF   V  + + A GG S RE +      +  + S +  ETC TY
Sbjct: 279 VAQEGDLAEWQEAADFFWHTVVENRTVAIGGNSVREHFHPEDDFSPMVSSNQGPETCNTY 338

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           NML++S  LF    +  Y D++ER L N +LS Q   E G  +Y  P+     +      
Sbjct: 339 NMLRLSEQLFMSNPQAEYVDFFERGLYNHILSSQH-PEKGGFVYFTPM-----RPEHYRV 392

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
           +      FWCC G+G+E+ +K G+ IY   E     LYI  +I S  +W+   +VL Q  
Sbjct: 393 YSQPQQGFWCCVGSGLENHAKYGEFIYAHSEEE---LYINLFIPSELNWEEKGMVLTQTN 449

Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFL 583
           +     +P    T      +++     + LR P W      Q S+NG+   +   P +++
Sbjct: 450 N--FPEEPQSVFTFEMDKARKM----PVKLRYPSWVAEGALQVSVNGRPFEVNASPSSYI 503

Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +   +W   D+L ++LP+ ++ E +    P+ +   A ++GP +LA 
Sbjct: 504 TINRKWKDGDRLEVKLPMEMQWEQL----PDGSDWGAFVYGPIVLAA 546


>gi|268609237|ref|ZP_06142964.1| hypothetical protein RflaF_07037 [Ruminococcus flavefaciens FD-1]
          Length = 1082

 Score =  241 bits (614), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 180/576 (31%), Positives = 274/576 (47%), Gaps = 61/576 (10%)

Query: 96  GNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGW 154
           G+ + + S+ DV +        A +  ++YLL  D + L+  FR+ A L T G K YGGW
Sbjct: 37  GSRISDFSISDVKM-TDDYCTNAFEKEMKYLLSFDTERLLAGFRENAGLSTNGAKRYGGW 95

Query: 155 ENPISELRGHFVGHYLSASAQMW-----ASTHNATIKEKMSTVVFSLSECQN--KIGTGY 207
           EN  + + GH VGHYL+A AQ +      S     + ++M T++  +  CQ   +   G+
Sbjct: 96  EN--TNIAGHCVGHYLTALAQAYQNPNVTSDQKDALYKRMKTLIDGMQACQQHPRGKKGF 153

Query: 208 LSAFPT-------ELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMA 255
           L A P          FD  E  K       W P+YT+HK++AG++D Y     A A  + 
Sbjct: 154 LWAAPVPSDGNVERQFDRVEIGKANIFDDAWVPWYTMHKLIAGIVDVYNATQYAPAKDVG 213

Query: 256 TWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKP 315
           + + ++ YNR     + +S +     L+ E GGMND +Y LY IT    H   AH+FD+ 
Sbjct: 214 SALGDWVYNRC----SGWSQQTRNTVLSIEYGGMNDCMYDLYRITGKDSHAAAAHVFDED 269

Query: 316 CFLGFLALQA-DYLSHFHANTHIPIVIGSQMRY------EVTGDPL----YKLIGTFFMD 364
                ++    D L+  HANT IP  IG+  RY       V G  +    Y      F D
Sbjct: 270 ALFQKVSNGGRDVLNGRHANTTIPKFIGALKRYMVLDGKTVNGQKVDASAYLKYAENFWD 329

Query: 365 IVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYAD 424
           +V   H+Y TGG S  E +     L     + N ETC +YNMLK+SR LF+ T +  Y D
Sbjct: 330 MVTTHHTYITGGNSEWEHFGKDDILDAERTNCNCETCNSYNMLKLSRELFKITHDSKYMD 389

Query: 425 YYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFS 484
           +YE    N +LS Q   E G+  Y  P+  G  K  S     T+++ FWCC G+G+ESF+
Sbjct: 390 FYENTYYNSILSSQN-PETGMTTYFQPMATGYFKVYS-----TQWDKFWCCTGSGMESFT 443

Query: 485 KLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQ 544
           KLGD+IY  +  +   LY+  Y SS  +W   +V + Q  +  +     ++ T+  SS  
Sbjct: 444 KLGDTIYMHDNDS---LYVNFYQSSVINWAEKNVSITQ--ESTIPDGASVKFTIKGSSDL 498

Query: 545 EVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
           +      L  R+P W        S+NG          +   +  +S  D + + +P  +R
Sbjct: 499 D------LRFRIPDWI-DGTMGVSVNGTKYSYKTVNGYADVSGSFSNGDVIELTVPSKVR 551

Query: 605 TEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKT 640
              +    P+   +    +GP +L+    G+ D+KT
Sbjct: 552 AYPL----PDSPDVYGFKYGPLVLSAEL-GKDDMKT 582


>gi|383640258|ref|ZP_09952664.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas elodea
           ATCC 31461]
          Length = 652

 Score =  239 bits (611), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 179/549 (32%), Positives = 262/549 (47%), Gaps = 51/549 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE-NP 157
           L+   + DV L +   L  AQ+    YLL L+ D L+  FR  A L     AYGGWE +P
Sbjct: 51  LQPFDMADVTLGEGPFL-HAQRATEAYLLRLEPDRLLHQFRVNAGLEPKAPAYGGWESDP 109

Query: 158 I---SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
           +      +GH +GHYLSA A  + +T  A  ++++  +   L  CQ+   +G ++AFP  
Sbjct: 110 LWSDIHCQGHTLGHYLSACALAYRATGEARYRQRVDYIATELGACQDAAKSGLVTAFPKG 169

Query: 215 ---LFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
              +       K    P+YT+HK+ AGL D  +LAD+  A    L++A W V        
Sbjct: 170 AALVSAHLRGEKITGVPWYTLHKVYAGLRDGALLADSEPARATLLRLADWGVV-----AS 224

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
           + ++    E     L  E GGMN++   LY +T   ++  +A  F     L  LA   D+
Sbjct: 225 RPLSDAEFEA---MLETEHGGMNEIYADLYFMTGKEEYRAIARRFSHKALLAPLARAQDH 281

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L   HANT +P V+G Q  YE TGD  Y+    FF   V  + S+ATGG    E ++   
Sbjct: 282 LDGLHANTQVPKVVGFQRVYEATGDAAYRDAAAFFWKTVAQTRSFATGGHGDNEHFFA-- 339

Query: 388 RLAD----TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
            +AD       ++  ETC  +NMLK++R LF    + AYADYYER L NG+L+ Q   + 
Sbjct: 340 -MADFETHVFSAKGSETCCQHNMLKLTRALFLHDPDPAYADYYERTLYNGILASQ-DPDS 397

Query: 444 GVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
           G+  Y      G  K      + T  +SFWCC GTG+E+  K  DSIYF +      LY+
Sbjct: 398 GMATYFQGARPGYMKL-----YHTPEHSFWCCTGTGMENHVKYRDSIYFHDAST---LYV 449

Query: 504 IQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
             ++ S+  W+    VL Q+   P V        T T   + +     +L+LR P W  S
Sbjct: 450 NLFLPSTLRWRDKGAVLVQETRFPEVP-------TTTLRWRLDKPVDVTLSLRHPGW--S 500

Query: 563 NGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
             A   +NG+       PG+ ++    W   D + +QL +    E      P    + A 
Sbjct: 501 RTATVRVNGKVAARSVAPGSRIALPRNWRDGDVVELQLVMEPGVERA----PAAPDVVAF 556

Query: 622 LFGPYLLAG 630
            +GP +LAG
Sbjct: 557 TYGPLVLAG 565


>gi|379726800|ref|YP_005318985.1| hypothetical protein MPD5_0184 [Melissococcus plutonius DAT561]
 gi|376317703|dbj|BAL61490.1| hypothetical protein MPD5_0184 [Melissococcus plutonius DAT561]
          Length = 883

 Score =  239 bits (611), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 178/565 (31%), Positives = 264/565 (46%), Gaps = 70/565 (12%)

Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL-PTPGKAYGGWENPIS-ELRGHFVGH 168
           Q   + +AQ+  + YLL LDV   ++ F K A + P     Y GWE       RGHF GH
Sbjct: 12  QDPYIHKAQENVIHYLLSLDVQKFLFEFYKVAGMKPLTESGYQGWERSDQVNFRGHFFGH 71

Query: 169 YLSASAQMWASTHNATIKEKM----STVVFSLSECQNKIG------TGYLSAFPTELFDS 218
           +LSA A  + +     +K+K+     T +  L   Q           GY+SAF     D 
Sbjct: 72  FLSALALSYQAEKQPILKKKIHQQIKTAITGLKAVQKNYAKQHPEHAGYISAFKEVALDE 131

Query: 219 FEALKPV--------WAPYYTIHKILAGLLDQYVLADNA------QALKMATWMVEYFYN 264
            E  KPV           +Y +HKILAGLL+  +           +AL +A+W  +Y Y 
Sbjct: 132 VEG-KPVDPKEKENVLVSWYNLHKILAGLLEVNISLKEVDSQLSKEALFIASWFGDYIYK 190

Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
           R+  +     +      L  E GGMND LY L+ +T   +H + A  FD+      LA  
Sbjct: 191 RMMNLTDKNQM------LTIEYGGMNDALYCLFELTQKKEHAIAATYFDEDNLFNQLAND 244

Query: 325 ADYLSHFHANTHIPIVIGSQMRYEV----------TGDPLYKLIGTF-----FMDIVNAS 369
            + L   HANT IP +IG+  RY V          + +    L+  F     F  IV  +
Sbjct: 245 ENVLPGKHANTTIPKLIGALKRYMVFQSEDLSAWLSNEEKEHLMSYFKAAEKFWQIVVDN 304

Query: 370 HSYATGGTSAREFWWDPKRL----ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
           H+Y TGG S  E + +P  L        G    ETC T+NMLK++R L+  TK   Y DY
Sbjct: 305 HTYCTGGNSQSEHFHEPNELFYDSEIRQGDCTCETCNTHNMLKLTRKLYECTKNPKYLDY 364

Query: 426 YERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSK 485
           YE    N +L+ Q  ++ G+M+Y  P+G G +K      +   ++ FWCC GTGIESFSK
Sbjct: 365 YETTYINAILASQ-NSKTGMMMYFQPMGAGYNKV-----YNRPYDEFWCCSGTGIESFSK 418

Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
           L D+ YF+E      L++  Y S++   K  ++ + QK D     +  + + L   + + 
Sbjct: 419 LADTYYFKENNR---LFVNLYFSNTLKLKENNLKIIQKTD---RKNGNVTIDLKTLTDKN 472

Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           + Q   L LR+P W  +         + L   P   F   +E  + ND++ +++   L+ 
Sbjct: 473 IIQPLQLALRLPNW--AKQVTIKKGKKLLNYEPHLGFAYLSELVTANDQIILEMEQELQL 530

Query: 606 EAIQDDRPEYASIQAILFGPYLLAG 630
                D P+ A+  A  +GPY+LAG
Sbjct: 531 L----DTPDNANYIAFKYGPYILAG 551


>gi|430751026|ref|YP_007213934.1| hypothetical protein Theco_2852 [Thermobacillus composti KWC4]
 gi|430734991|gb|AGA58936.1| hypothetical protein Theco_2852 [Thermobacillus composti KWC4]
          Length = 621

 Score =  239 bits (611), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 164/563 (29%), Positives = 271/563 (48%), Gaps = 60/563 (10%)

Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFR----KTASLPTPGKAYGGWENPISELRGHFVGHYL 170
           L R +Q N  YL+ L+ DSL++++R    + +    P  A+GGWE+P+ +LRGHF+GH+L
Sbjct: 16  LRRREQANRAYLMKLNSDSLLFNYRLEAGRYSGREIPPWAHGGWESPVCQLRGHFLGHWL 75

Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYY 230
           SA+A  + +T +A +K K   ++  L+ECQ   G  +    P +      A K +WAP Y
Sbjct: 76  SAAAIHYHATGDAELKAKADGIIDELAECQKDNGGQWAGPIPEKYLHWIAAGKAIWAPQY 135

Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
            +HK+  GL+D +  A N +AL +A    ++F     +    ++ ++    L+ ETGGM 
Sbjct: 136 NLHKLFMGLVDSFQYAGNQKALDIADRFADWFVEWSGR----FTRDQFDDILDVETGGML 191

Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           +V   L  IT + K+  L   + +      L    D L++ HANT IP V+G    YEVT
Sbjct: 192 EVWADLLHITGNGKYKTLLERYYRGRLFQPLLEGKDPLTNMHANTTIPEVLGCARAYEVT 251

Query: 351 GDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
           GD  +  ++  ++   V      ATGG ++ E W    ++   LG +N+E CT YNM+++
Sbjct: 252 GDSRWMDVVKAYWNCAVTERGFLATGGQTSGEVWMPKMKMKARLGDKNQEHCTVYNMMRL 311

Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTE------------PGVMIYMLPLGRGVS 457
           +  LFR T +  YA Y E  L NGV++     E             G++ Y LP+  G+ 
Sbjct: 312 AEFLFRHTGDPGYAQYREYNLYNGVMAQTYYREYALNGNPHNHPGTGLLTYFLPMKAGLR 371

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF--DWKS 515
           K      W T+ +SF+CC+GT +++ +     IY+++  ++   YI QY +S    +   
Sbjct: 372 K-----DWSTETSSFFCCHGTMVQANAAWNRGIYYQDRDDI---YICQYFNSEMTTEING 423

Query: 516 GHVVLNQKVDPI-----------------------VSWDPYLRMTLTFSSKQEVGQLSSL 552
           G + + Q  DP+                        +  PY +    F  +  V Q  ++
Sbjct: 424 GELRIIQTQDPMNGNSMTSSNTAGYQSINEVAAIHENLPPYRK--YDFVIRTSVQQPFAI 481

Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
           + R+P W  S+      +  +        F      W   DK+++ LP+ +R   + DD 
Sbjct: 482 HFRIPEWIMSDAVLYVNDEFHGKTSDSTRFYPIRRVWRDGDKISVLLPIGIRFVPLPDDE 541

Query: 613 PEYASIQAILFGPYLLAGHTSGE 635
               +  A  +GP +LAG    E
Sbjct: 542 ----NTGAFRYGPEVLAGICDAE 560


>gi|410638732|ref|ZP_11349285.1| acetyl-CoA carboxylase, biotin carboxylase [Glaciecola lipolytica
           E3]
 gi|410141260|dbj|GAC16490.1| acetyl-CoA carboxylase, biotin carboxylase [Glaciecola lipolytica
           E3]
          Length = 818

 Score =  239 bits (609), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 168/528 (31%), Positives = 263/528 (49%), Gaps = 44/528 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQQTN+ YLL +  D L+  + + A L     +YG WEN  + L GH  GHYLSA +  W
Sbjct: 67  AQQTNVGYLLAIQPDKLLAPYLREAGLEPKVDSYGNWEN--TGLDGHIGGHYLSALSLAW 124

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE--LFDSFE---------ALKPVW 226
           A+T +  +K ++  ++  L + QN  G GYL   P    ++D  +         +L   W
Sbjct: 125 AATQDTELKRRLDYMLNELQKAQNANG-GYLGGIPNGKVMWDEIKQGNIKADLFSLNDRW 183

Query: 227 APYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
            P Y I KI  GL D Y++A++ QA    L +  WM++   N     ++   +++  YS 
Sbjct: 184 VPLYNIDKIFHGLRDAYLIANSEQAKTMLLSLGQWMLDVTNN-----LSDEQIQQMLYS- 237

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
             E GG+N+V   + +I+ D  +L LA  F     +  L    D L+  HANT IP +IG
Sbjct: 238 --EHGGLNEVFADMSTISGDKAYLELARKFSHKRIIDPLVAHKDELNGLHANTQIPKIIG 295

Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETC 401
           +    ++  D  +K    FF + V    S A GG S RE + D    +  +   E  ETC
Sbjct: 296 ALKVAQLNNDESWKEAARFFWETVTKQRSVAIGGNSVREHFHDAADFSPMVEDPEGPETC 355

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
            TYNM+K+S+ LF  T +  Y DYYERA  N +LS Q   E G ++Y   +  G  +  S
Sbjct: 356 NTYNMIKLSKLLFLQTADTRYLDYYERATYNHILSSQH-PEHGGLVYFTSMRPGHYRMYS 414

Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
           +       +S WCC G+GIE+ SK G+ IY     +V  L +  +ISS+  W    + L 
Sbjct: 415 SVQ-----DSMWCCVGSGIENHSKYGELIY---SHSVDNLSVNLFISSTLRWPEKGLKLT 466

Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
            +     S +  +++     +++++G+   LN+R P W +S+      NG+ +       
Sbjct: 467 LETQFPDSQNVVIKLHQL--AEKQMGEF-VLNIRKPAW-FSHDISMFKNGEKINYVENEG 522

Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           ++   + W   D+L+ +L   L TE + D +  Y    A+L+GP +LA
Sbjct: 523 YIQIQQNWQDGDELSFELAAGLSTEQLPDGQNYY----AVLYGPVVLA 566


>gi|312133546|ref|YP_004000885.1| protein [Bifidobacterium longum subsp. longum BBMN68]
 gi|322690281|ref|YP_004219851.1| hypothetical protein BLLJ_0089 [Bifidobacterium longum subsp.
           longum JCM 1217]
 gi|311772796|gb|ADQ02284.1| Hypothetical protein BBMN68_1283 [Bifidobacterium longum subsp.
           longum BBMN68]
 gi|320455137|dbj|BAJ65759.1| conserved hypothetical protein [Bifidobacterium longum subsp.
           longum JCM 1217]
          Length = 800

 Score =  238 bits (608), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 181/597 (30%), Positives = 268/597 (44%), Gaps = 82/597 (13%)

Query: 94  LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL---PTPGKA 150
           LPG  +    L +V +  +SV  RA++  L+Y     VD  +  FR  A+L       + 
Sbjct: 81  LPGWKVAPFPLRNVAITSNSVFDRAKEGMLDYARNYPVDRWLVCFRAQANLLPKDNTTQP 140

Query: 151 YGGWENPISE---------------------------LRGHFVGHYLSASAQMWASTHNA 183
            GGWEN  S                            LRGHF GH L   +Q +A T   
Sbjct: 141 SGGWENFPSGSLDKAVEQQWGDAEYTRGQNKNGADGLLRGHFAGHALHMLSQAYAETGEE 200

Query: 184 TIKEKMSTVVFSLSECQNKIGT------------GYLSAFPTELFDSFEALKP---VWAP 228
            I  K++  V  L EC++ +              G+L+A+    F + E   P   +WAP
Sbjct: 201 AILNKINEFVSGLKECRDSLREMKYNGKARYSHPGFLAAYGEWQFKALEEYAPYGEIWAP 260

Query: 229 YYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETG 287
           +YT HKILAGL+  Y  A NA AL +A  +  + Y R+ K  T   +++ W   +  E G
Sbjct: 261 WYTEHKILAGLIAAYEFAGNADALDLAEGIGHWTYARLSKC-TKTQLQKMWDIYIGGEYG 319

Query: 288 GMNDVLYRLYSITHDPKH---LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
           GMND L  LY+++ D      L  +  FD    +       D L++ HAN HIP  +G  
Sbjct: 320 GMNDSLVDLYNVSKDKDRSEFLKASAFFDTDKLIDNCGAGVDILNNLHANQHIPQFVGYA 379

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNA-------SHSYATGGTSAREFWWDPKRLADTLGSEN 397
               +    +       ++  V            YA GGT   E W     +A  +G  N
Sbjct: 380 KDAAMGDADIDADARARYLKAVEGYWGMIVPGRMYAHGGTGEGEMWGPAHTVAGDIGKRN 439

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ-RGTEPGVMI-----YMLP 451
            E+C  YNMLKV+R+LF   ++ AY DYYER + N +L  + R  + G  +     YM P
Sbjct: 440 AESCAAYNMLKVARYLFFIEQKPAYMDYYERTILNHILGGKSRDLDSGTALTPGNCYMYP 499

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           +     K       GT      CC GT +ES SK  DSIYF    N   LY+  + +S+ 
Sbjct: 500 VNPATQKEYGDGNIGT------CCGGTALESHSKYQDSIYFHSTDNKE-LYVNLFTASTL 552

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           DW    + L Q+ +     +    +++T + K  V    +  +R+P W  S GA+  +NG
Sbjct: 553 DWTDTGLKLAQETN--YPEEETSTISITAAPKSAV----TFRIRIPAW--SKGAKIEVNG 604

Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
           + +     G + +    W   DK+ + +PL LRTE+  DDR +   IQ + +GP +L
Sbjct: 605 KAIDGVTAGEYATVAGSWKVGDKIVVTIPLQLRTEST-DDRKD---IQTLFYGPTVL 657


>gi|338209455|ref|YP_004646426.1| hypothetical protein Runsl_5734 [Runella slithyformis DSM 19594]
 gi|336308918|gb|AEI52019.1| protein of unknown function DUF1680 [Runella slithyformis DSM
           19594]
          Length = 760

 Score =  238 bits (608), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 166/531 (31%), Positives = 257/531 (48%), Gaps = 44/531 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ  +L Y+L L+ D L+  +   A LP   + YG WE+  S L GH  GHYLSA A M+
Sbjct: 40  AQDVDLRYILSLNPDKLLAPYLIDAGLPLKAERYGNWES--SGLDGHIGGHYLSALAMMY 97

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEALKPVW 226
           AST NA +K+++  ++  L++CQ K G GY+   P            ++  S   L   W
Sbjct: 98  ASTGNAELKKRLDYMIDQLAQCQAKNGNGYVGGIPQGKVFWERIYKGDIDGSSFGLNNTW 157

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            P Y IHK+ AGL D Y    N QA ++   + ++F     ++I   S ++    L  E 
Sbjct: 158 VPLYNIHKLFAGLRDSYEFGGNQQAKQVLIGLGDWF----AELIRPLSDDQIQQILRTEH 213

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGMN+    LY +T + K+L  A        L  L  + D L+  HANT IP VIG +  
Sbjct: 214 GGMNEAFADLYILTKNQKYLETAQRISHRAILNPLVQKQDKLTGLHANTQIPKVIGFEKI 273

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
             +T +  +     +F   V+ + + A GG S RE +      +  L S +  ETC ++N
Sbjct: 274 AMLTENAKWSEAARYFWQNVSQTRTVAFGGNSVREHFNPTNDFSSMLKSNQGPETCNSFN 333

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH-- 463
           ML++S+ LF    + +Y D+YER L N +LS Q   + G  +Y  P+       R  H  
Sbjct: 334 MLRLSKALFLDKNDPSYLDFYERTLYNHILSSQH-PQKGGFVYFTPI-------RPNHYR 385

Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
            +     S WCC G+G+E+ +K  + IY     +   L++  +I S+  WK   + L Q 
Sbjct: 386 VYSQPETSMWCCVGSGLENHTKYSELIYSHSAND---LFVNLFIPSTLHWKEKSIQLTQA 442

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNF 582
            +      PY   +  F  K    Q  +LN+R P W  ++  +  +NG+  P    P N+
Sbjct: 443 TEF-----PYKNQS-EFVLKLAKSQAFTLNIRYPKW--ADDVEVMVNGKLYPTSAQPSNY 494

Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           +    +W   DKL+++   S   E +    P+ ++  A + GP +LA  TS
Sbjct: 495 IGIRRKWKTGDKLSVRFTTSTHLEYL----PDGSNWAAFVHGPIVLAAKTS 541


>gi|182415028|ref|YP_001820094.1| hypothetical protein Oter_3214 [Opitutus terrae PB90-1]
 gi|177842242|gb|ACB76494.1| protein of unknown function DUF1680 [Opitutus terrae PB90-1]
          Length = 844

 Score =  238 bits (607), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 179/554 (32%), Positives = 265/554 (47%), Gaps = 46/554 (8%)

Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
           + L  V L +    + A + N  YLL LD D L+  FR+ A LP   + YG WE+    L
Sbjct: 76  LPLASVRLLEGGPFFTAVKANRTYLLALDADRLLAPFRREAGLPALAQPYGNWES--GGL 133

Query: 162 RGHFVGHYLSASAQMWASTHN---ATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELF 216
            GH  GHYLSA A M A+ H+     ++ ++  +V  L  CQ+  G GY+   P   EL+
Sbjct: 134 DGHTAGHYLSALAHMIAAGHDTPEGELRRRLDHMVAELKACQDANGNGYVGGVPGSHELW 193

Query: 217 D-----SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
                    A+   W P+Y +HK  AGL D ++   N  A    +++  W V        
Sbjct: 194 QRVAAGDVTAVNRKWVPWYNLHKTFAGLRDAWLQTGNTTARDVLVRLGDWCV-------- 245

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            + +  + E+    L +E GGMN+VL  +Y+IT D K+L  A  F+    L  L    D 
Sbjct: 246 ALTSPLTDEQMQRMLAQEHGGMNEVLADIYAITGDKKYLTAAERFNHHAVLDPLEQHRDE 305

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L+  HANT IP V+G +    +TGD        FF + V    S A GG S  E + DP 
Sbjct: 306 LTGKHANTQIPKVVGLERIATLTGDKAADSGARFFWETVTQHRSVAFGGNSVSEHFNDPH 365

Query: 388 RL-ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
              A  +  E  ETC TYNML+++  LF    E AYADYYERAL N +L+      PG  
Sbjct: 366 NFHALLVHREGPETCNTYNMLRLTEGLFASAPEAAYADYYERALFNHILASINPDHPG-Y 424

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
           +Y  P+     +  S    G     FWCC GTG+E+  K G+ IY        G+++  +
Sbjct: 425 VYFTPIRPNHYRVYSQPDQG-----FWCCVGTGMENPGKYGEFIYARAHD---GVFVNLF 476

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
           I+S        + L Q+       D   ++TL  +  Q      +L++R P W  +    
Sbjct: 477 IASELTVAPLGLTLRQQT--AFPDDERSQLTLKLAQPQTF----TLHVRQPGWVAAGTFT 530

Query: 567 ASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
            ++NG+ + +   P ++++    W   D++ I+ P+    E + D  P Y    AIL GP
Sbjct: 531 LTVNGEPVAVTSAPSSYVTIHREWRDGDRVEIRFPMHTSIEGLPDGSPWY----AILRGP 586

Query: 626 YLLAGHTSGEWDIK 639
            +LA H +G W++K
Sbjct: 587 IVLA-HPAGTWELK 599


>gi|189466409|ref|ZP_03015194.1| hypothetical protein BACINT_02784 [Bacteroides intestinalis DSM
           17393]
 gi|189434673|gb|EDV03658.1| hypothetical protein BACINT_02784 [Bacteroides intestinalis DSM
           17393]
          Length = 789

 Score =  238 bits (607), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 174/563 (30%), Positives = 269/563 (47%), Gaps = 59/563 (10%)

Query: 100 KEVS---LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
           +EVS   L DV L +S  L +AQQT+L Y++ ++ D L+  F + A L     +Y  WEN
Sbjct: 24  QEVSYFPLQDVKLLESPFL-QAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWEN 82

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TE 214
             + L GH  GHY+SA + M+A+T +  I  +++ ++  L   Q  +GTG++   P   +
Sbjct: 83  --TGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLAELHRAQQAVGTGFIGGTPGSLQ 140

Query: 215 LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEY 261
           L+   +A         L   W P Y IHK  AGL D Y+ A +  A +M      WM++ 
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSNLAREMLIALTDWMID- 199

Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
                  +    + ++    L  E GG+N+    +  IT D K+L LA  F     L  L
Sbjct: 200 -------ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPL 252

Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYAT 374
               D L+  HANT IP VIG +   ++  D         +     FF + V    S   
Sbjct: 253 VKDEDRLTGMHANTQIPKVIGYKRIADLAQDDKDWNHASEWDHAARFFWNTVVNHRSVCI 312

Query: 375 GGTSAREFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
           GG S RE +         L   +  ETC TYNML++++ L++ + +I +ADYYERAL N 
Sbjct: 313 GGNSVREHFHPADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNH 372

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
           +L+ Q+  E G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY  
Sbjct: 373 ILASQQ-PEKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAH 426

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
                  LY+  +I S   W+   V L Q+       +  +R  +  S K+      SL 
Sbjct: 427 TNDT---LYVNLFIPSRLTWQEKKVTLVQETR--FPDEEQIRFRVEKSRKKAF----SLK 477

Query: 554 LRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
           LR P W  + GA  S+NG+       PG +L+   +W   D++T+ +P+ +  E I    
Sbjct: 478 LRYPSW--AKGASVSVNGKVQETNAQPGEYLTIHRKWKAGDEITLNMPMQVALEQI---- 531

Query: 613 PEYASIQAILFGPYLLAGHTSGE 635
           P+  +  A ++GP +LA  T  E
Sbjct: 532 PDRENFYAFMYGPIVLASPTGTE 554


>gi|419849455|ref|ZP_14372501.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
           35B]
 gi|419852148|ref|ZP_14375044.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
           2-2B]
 gi|386411767|gb|EIJ26479.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
           2-2B]
 gi|386411993|gb|EIJ26692.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
           35B]
          Length = 800

 Score =  238 bits (606), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 180/597 (30%), Positives = 269/597 (45%), Gaps = 82/597 (13%)

Query: 94  LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL---PTPGKA 150
           LPG  +    L +V +  +SV  RA++  L+Y     VD  +  FR  A+L       + 
Sbjct: 81  LPGWKVAPFPLRNVAITSNSVFDRAKEGMLDYARNYPVDRWLVCFRAQANLLPKDNTTQP 140

Query: 151 YGGWEN--------PISE-------------------LRGHFVGHYLSASAQMWASTHNA 183
            GGWEN         + +                   LRGHF GH L   +Q +A T   
Sbjct: 141 SGGWENFPNGSLDKAVEQQWGDAEYTRGQNKNGADGLLRGHFAGHALHMLSQAYAETGEE 200

Query: 184 TIKEKMSTVVFSLSECQNKIGT------------GYLSAFPTELFDSFEALKP---VWAP 228
            I  K++  V  L EC++ +              G+L+A+    F + E   P   +WAP
Sbjct: 201 AILNKINEFVSGLKECRDSLREMKYNGKARYSHPGFLAAYGEWQFKALEEYAPYGEIWAP 260

Query: 229 YYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETG 287
           +YT HKILAGL+  Y  A NA AL +A  +  + Y R+ K  T   +++ W   +  E G
Sbjct: 261 WYTEHKILAGLIAAYEFAGNADALDLAEGIGHWTYARLSKC-TKTQLQKMWDIYIGGEYG 319

Query: 288 GMNDVLYRLYSITHDPKH---LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
           GMND L  LY+++ D      L  +  FD    +       D L++ HAN HIP  +G  
Sbjct: 320 GMNDSLVDLYNVSKDKDRSEFLKASAFFDTDKLIDNCGAGVDILNNLHANQHIPQFVGYA 379

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNA-------SHSYATGGTSAREFWWDPKRLADTLGSEN 397
               +    +       ++  V            YA GGT   E W     +A  +G  N
Sbjct: 380 KDAAMGDADIDADARARYLKAVEGYWGMIVPGRMYAHGGTGEGEMWGPAHTVAGDIGKRN 439

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ-RGTEPGVMI-----YMLP 451
            E+C  YNMLKV+R+LF   ++ AY DYYER + N +L  + R  + G  +     YM P
Sbjct: 440 AESCAAYNMLKVARYLFFIEQKPAYMDYYERTILNHILGGKSRDLDSGTALTPGNCYMYP 499

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           +     K       GT      CC GT +ES SK  DSIYF    N   LY+  + +S+ 
Sbjct: 500 VNPATQKEYGDGNIGT------CCGGTALESHSKYQDSIYFHSTDNKE-LYVNLFTASTL 552

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           DW    + L Q+ +     +    +++T + K  V    +  +R+P W  S GA+  +NG
Sbjct: 553 DWTDTGLKLAQETN--YPEEETSTISITAAPKSAV----TFRIRIPAW--SKGAKIEVNG 604

Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
           + +     G + +    W   DK+ + +PL LRTE+  DDR +   IQ + +GP +L
Sbjct: 605 KAIDGVTAGEYATVAGSWKVGDKIVVTIPLQLRTEST-DDRKD---IQTLFYGPTVL 657


>gi|399030291|ref|ZP_10730797.1| hypothetical protein PMI10_02670 [Flavobacterium sp. CF136]
 gi|398071797|gb|EJL63044.1| hypothetical protein PMI10_02670 [Flavobacterium sp. CF136]
          Length = 771

 Score =  238 bits (606), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 173/556 (31%), Positives = 262/556 (47%), Gaps = 53/556 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           ++E  L ++ L  S     AQ  +L+YLL L+ D L+  +  +A +PT    YG WEN  
Sbjct: 34  MQEFKLQEIKL-TSGPFKNAQNVDLKYLLDLNPDRLLAPYLISAGIPTKADRYGNWENI- 91

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
             L GH  GHYL+A + M+AST N  IK ++  ++  L+ CQ K GTGY+   P      
Sbjct: 92  -GLDGHIGGHYLAALSMMYASTGNKEIKSRLDYMISELALCQEKDGTGYVGGIPEGKVFW 150

Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
                 ++  S   L   W P Y IHK+ AGL+D Y    N +A    +K+  W +E   
Sbjct: 151 DRIHKGDIDGSGFGLNNTWVPIYNIHKLFAGLIDAYNYTGNEKAKEIVIKLGDWFIE--- 207

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
                +I   S E+    L  E GG+N+    LYSIT + K+L  A    +   L  L  
Sbjct: 208 -----LIRPLSDEQIQKILKTEHGGINESFADLYSITKNKKYLETAEKLSQKAILDPLIK 262

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
           + D L+  HANT IP VIG +   +++ +  +     FF   V    + A GG S  E +
Sbjct: 263 KEDKLTGLHANTQIPKVIGFEKIGKLSDNKQWSDAAQFFWMNVTEKRTVAFGGNSVAEHF 322

Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
                 +  L S +  ETC +YNM ++S+ LF     ++Y D+YER L N +LS Q    
Sbjct: 323 NPINDFSGMLKSNQGPETCNSYNMERLSKALFLDKNNVSYLDFYERTLYNHILSSQEPNR 382

Query: 443 PGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
            G  +Y  P+       R  H   +     S WCC GTG+E+ SK G+ IY   E ++  
Sbjct: 383 GG-FVYFTPI-------RPNHYRVYSQPETSMWCCVGTGLENHSKYGELIYSHSERDI-- 432

Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
            ++  +I S+ +WK   + L Q       ++    + L   + +       LN+R P W 
Sbjct: 433 -FVNLFIPSTLNWKEKGIELEQTTK--FPYENNTEIVLKLKNPKSF----VLNIRYPKW- 484

Query: 561 YSNGAQASLNGQ-NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
            +   +  +NG+       P N++S   +W   DK+TI    S   E +    P+ ++  
Sbjct: 485 -ATNFEILVNGKLQKAEAKPTNYVSMARKWKSGDKITIAFKTSTHLEKL----PDGSNWA 539

Query: 620 AILFGPYLLAGHTSGE 635
           A + GP +LA  TS E
Sbjct: 540 AFVNGPIVLAAKTSTE 555


>gi|333378944|ref|ZP_08470671.1| hypothetical protein HMPREF9456_02266 [Dysgonomonas mossii DSM
           22836]
 gi|332885756|gb|EGK06002.1| hypothetical protein HMPREF9456_02266 [Dysgonomonas mossii DSM
           22836]
          Length = 787

 Score =  237 bits (605), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 171/553 (30%), Positives = 265/553 (47%), Gaps = 51/553 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           LK+++L D      S   RAQ  + +YLL LD D L+  F + A L    ++Y  WEN  
Sbjct: 31  LKDITLLD------SPFKRAQDLDKKYLLDLDADRLLAPFIREAGLQKKAESYTNWEN-- 82

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
           + L GH  GHY+SA A M+AST +  IK+++  ++  L  CQ++ G GY+   P    ++
Sbjct: 83  TGLDGHIGGHYVSALALMYASTGDQQIKDRLDYMISELKRCQDENGNGYIGGVPGGKAIW 142

Query: 217 DSFE---------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
           D             L   W P Y IHK  AGL D Y++A N  A    +KM  W V    
Sbjct: 143 DEIAKGDIQASGFGLNNRWVPLYNIHKTYAGLRDAYLIAGNETAKDMLIKMTDWAV---- 198

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
               K+++  S E+    L  E GG+N+    +  IT + K+L LAH F     L  L  
Sbjct: 199 ----KLVSNLSEEQIQDMLRSEHGGLNETFADVAVITQNEKYLKLAHQFSHQLILNPLLA 254

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
             D L+  HANT IP V+G +   ++ G+  +     FF + V    S   GG S RE +
Sbjct: 255 HEDKLTGLHANTQIPKVLGFKRIADIEGNESWSEASRFFWETVVEHRSVCIGGNSVREHF 314

Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
                 +  + S E  ETC TYNML++S+  ++ + +  Y DYYE+AL N +LS Q   +
Sbjct: 315 HPTNDFSSMITSNEGPETCNTYNMLRLSKMFYQTSLDKKYIDYYEKALYNHILSSQ-NPQ 373

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
            G ++Y   +  G  +      +     S WCC G+GIES +K G+ IY         LY
Sbjct: 374 TGGLVYFTQMRPGHYRV-----YSQPQTSMWCCVGSGIESHAKYGEMIYAHTSD---ALY 425

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
           +  +I S  +WK  +V + Q  D     +    +T+    K E     ++ +R P W   
Sbjct: 426 VNLFIPSLLNWKDRNVEIVQ--DNKFPDESKTEITVNPKKKSEF----TVYVRYPSWVEK 479

Query: 563 NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
              +  LNG+  P      ++     W   D+++++LP+++  E +    P+ ++  +  
Sbjct: 480 GTMKIKLNGKTYPGVEKDGYIGIKRTWQKGDRISVELPMTIVAEQL----PDKSNYYSFR 535

Query: 623 FGPYLLAGHTSGE 635
           +GP +LA  T  E
Sbjct: 536 YGPIVLAAKTGVE 548


>gi|226325822|ref|ZP_03801340.1| hypothetical protein COPCOM_03635 [Coprococcus comes ATCC 27758]
 gi|225205946|gb|EEG88300.1| hypothetical protein COPCOM_03635 [Coprococcus comes ATCC 27758]
          Length = 761

 Score =  237 bits (604), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 173/592 (29%), Positives = 282/592 (47%), Gaps = 49/592 (8%)

Query: 61  WSSLIPSK----------ILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLD 110
           W++  P+K          ++   + E S  + Y+K   P     P   +    L  V L 
Sbjct: 147 WNTYEPAKEEKKVVAVAGVIDGTEKEASAEIHYKKEIVP--VKGPKKKVGYFPLGQVRLK 204

Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPISELRGHFVGHY 169
           + ++ ++ Q+   EYLL +D D ++++FRK   L T G     GW+    +L+GH  GHY
Sbjct: 205 EGTLYYKYQKLMEEYLLGIDDDQMLYNFRKATGLDTKGAPPMTGWDEESCKLKGHTTGHY 264

Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQN------KIGTGYLSAFPTELFDSFEALK 223
           LS  A  +A+T N    +K++ +V  L +CQ+      K   G+LSA+  E FD  E   
Sbjct: 265 LSGIALAFAATGNLKFLDKVNYMVAELKKCQDAFAATGKYHRGFLSAYSEEQFDLLEVYT 324

Query: 224 P---VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW- 279
               +WAPYYT+ KI++GL D +VLA N  A ++   M ++ Y+R+ + +   ++++ W 
Sbjct: 325 KYPEIWAPYYTLDKIMSGLYDCHVLAGNETAKEILDLMGDWVYDRLSR-LPKETLDKMWA 383

Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
             +  E GGM   + ++Y +T    HL  A LF+       +  + D L   HAN HIP 
Sbjct: 384 MYIAGEFGGMLGTMVKVYELTGKENHLKAAKLFENEKLFYPMEEECDTLEDMHANQHIPQ 443

Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE 399
           +IG+   Y  TGD +Y  IG  F +IV   H+Y  GG    E +         L  +  E
Sbjct: 444 IIGAMDLYRATGDEIYWEIGKNFWNIVTGGHTYCIGGVGETEMFHRANTTCSYLTDKAAE 503

Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
           +C +YNML+++  LF +T+     DYY+  L N +L+       G   Y LPLG G  K 
Sbjct: 504 SCASYNMLRLTSQLFEYTRSGNLMDYYDNTLRNHILTSSSHKCDGGTTYFLPLGPGGRKE 563

Query: 460 RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
                +    NS  CC+GTG+ES  +  ++IY ++E     LYI   + S    ++G  +
Sbjct: 564 -----FFLSENS--CCHGTGMESRFRYMENIYAQDE---DALYINLLVDSVLTDENGKTM 613

Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPP 578
           +      + S D    M +     Q+      L + +P W   +    S+NG+ L     
Sbjct: 614 IE-----LQSVDEEGVMEIRCQKDQK----KVLKIHIPAWGQKD-FNVSVNGKVLANTAL 663

Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
              +L         D + ++LP+  R   + D++ + A +  + +GPY+LA 
Sbjct: 664 HDGYLVIDADPKAGDVIRLELPMEFR---VLDNKSDAAFVN-LAYGPYILAA 711


>gi|404450474|ref|ZP_11015456.1| hypothetical protein A33Q_14151 [Indibacter alkaliphilus LW1]
 gi|403763872|gb|EJZ24792.1| hypothetical protein A33Q_14151 [Indibacter alkaliphilus LW1]
          Length = 782

 Score =  236 bits (603), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 171/558 (30%), Positives = 273/558 (48%), Gaps = 53/558 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L++V L D      S   RAQ+ + +Y+L +DVD L+  + K A L      YG WEN  
Sbjct: 33  LRQVKLKD------SPFKRAQEVDKKYILEMDVDRLLAPYMKEAGLTWSADNYGNWEN-- 84

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
           + L GH  GHYLSA + M+AST +  I +++  ++  L   Q++ G GYLS  P   +++
Sbjct: 85  TGLDGHIGGHYLSALSLMFASTGDPEINKRLDYMLEQLKHAQDQSGDGYLSGVPYGRKIW 144

Query: 217 DSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
           +  ++         L   W P Y IHKI AGL D Y +     A  M   + ++F +   
Sbjct: 145 NELKSGKINAGNFSLNDRWVPLYNIHKIFAGLRDAYWIGGKEIAKPMLVSLSDWFLD--- 201

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            +   ++ ++    L  E GG+N+V   +  +T D K+L LA        L  L  + D 
Sbjct: 202 -LTDGFTEDQFQEMLISEHGGLNEVFADVAVMTGDSKYLSLAKKMSHNAILQPLKEEKDE 260

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L+  HANT IP VIG Q   +V+ D        FF   V    S + GG S RE +    
Sbjct: 261 LNGLHANTQIPKVIGFQRIAQVSKDQNLHQASDFFWKNVVYQRSVSIGGNSVREHFHPTS 320

Query: 388 RLADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
             +  L SE   ETC TYNM+++S  LF+   +  Y DYYERA+ N +LS Q   + G +
Sbjct: 321 DFSSMLSSEQGPETCNTYNMMRLSEMLFQLAPDRKYIDYYERAVFNHILSTQHPKKGGFV 380

Query: 447 IYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
            +        +  R  H   +     +FWCC G+G+E+ +K G +IY   + +   LY+ 
Sbjct: 381 YF--------TSMRPQHYRVYSQPHENFWCCVGSGLENHAKYGQAIYAYRKDD---LYLN 429

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT-LTFSSKQEVGQLS-SLNLRMPVWTYS 562
            +I+S  DW+   + L Q  D      PY   + +TFS K   G+ S +L +R P W   
Sbjct: 430 LFIASELDWEEKGIKLIQNTDF-----PYKDESEITFSHK---GKKSFNLKIRYPNWVKE 481

Query: 563 NGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
              + ++NG+ + +    + +++    W+  DK+ ++LP+  + E +    P+ ++  + 
Sbjct: 482 GMLEVTINGEQVEVSVDRHGYITLNREWTSKDKINLKLPMETKAERL----PDGSNWVSF 537

Query: 622 LFGPYLLAGHTSGEWDIK 639
             GP +L   T  + D+K
Sbjct: 538 SHGPIVLGAKTGAD-DLK 554


>gi|224537183|ref|ZP_03677722.1| hypothetical protein BACCELL_02060 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521238|gb|EEF90343.1| hypothetical protein BACCELL_02060 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 790

 Score =  236 bits (603), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 184/606 (30%), Positives = 280/606 (46%), Gaps = 65/606 (10%)

Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
           L  A+  N+E LL  D D L+  +RK A L    K Y  W+     L GH  GHYL+A A
Sbjct: 40  LKHARDLNIETLLKYDCDRLMAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMA 95

Query: 175 QMWASTHNATIKEKMSTVVFSLSECQN-------KIGTGYLSAFPTELF-------DSFE 220
            + A+T N   +++M  ++  ++EC         + G GY+   P             F 
Sbjct: 96  -INAATGNEECRKRMEYIISEIAECAEANSKNHPQWGIGYMGGMPNSQNIWNGFKDGDFR 154

Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVE 276
                WAP+Y +HK+ AGL D ++   N QA    L+   W +         + +  S E
Sbjct: 155 VYSGSWAPFYNLHKMYAGLRDAWLYCGNEQAKSLFLQFCNWAIH--------ITSGLSDE 206

Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
           +    L  E GGMN+VL   Y+ITH+ K+L  A  F        ++ + D L + HANT 
Sbjct: 207 QMERMLGNEHGGMNEVLADAYAITHEQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQ 266

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS- 395
           +P VIG +   E++G+  Y +  +FF DIV    S A GG S RE +       D +   
Sbjct: 267 VPKVIGFERISELSGNEDYHVASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDI 326

Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
           +  E+C T NMLK++  L R   E  YADYYE A  N +LS Q   E G  +Y  P    
Sbjct: 327 DGPESCNTNNMLKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTP---- 381

Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
            ++ R    +     + WCC GTG+E+  K G  IY    G+   L++  Y +S  DWK 
Sbjct: 382 -ARPRHYRNYSAPNEAMWCCVGTGMENHGKYGQFIY-THAGDA--LFVNLYAASQLDWKE 437

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
             + L Q+     +  PY   + T +  +  G   +L +R P W +    + S+NG+ + 
Sbjct: 438 RGITLRQE-----TAFPYSENS-TITIAEGKGTF-NLMVRYPGWVHPGEFKVSVNGKPVD 490

Query: 576 -LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
            +  P +++S   +W   D + I  P+      + ++ P+Y    A++ GP LL      
Sbjct: 491 IITGPSSYVSINRKWKKGDVVNINFPMHSSLRYLPNE-PQYV---ALMHGPILLG----- 541

Query: 635 EWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDA 694
              +KTGT  S+++LI+     F        Q    +  +++N   SI  +  PVSG   
Sbjct: 542 ---MKTGT-ESMASLIAD-DSRFGQYAGGPKQPIDKAPILINNDITSIPSQLTPVSG--K 594

Query: 695 ALHATF 700
            LH T 
Sbjct: 595 PLHFTL 600


>gi|302873208|ref|YP_003841841.1| hypothetical protein Clocel_0296 [Clostridium cellulovorans 743B]
 gi|307688627|ref|ZP_07631073.1| hypothetical protein Ccel74_10733 [Clostridium cellulovorans 743B]
 gi|302576065|gb|ADL50077.1| protein of unknown function DUF1680 [Clostridium cellulovorans
           743B]
          Length = 607

 Score =  236 bits (602), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 165/558 (29%), Positives = 265/558 (47%), Gaps = 50/558 (8%)

Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPG----------KAYGGWENPISELRGHFVGHYLS 171
           N  YL+ +    L+ +F   A +  PG          + + GW+ P  +LRGHF+GH+LS
Sbjct: 24  NRNYLINVKNQGLLQNFYLEAGIILPGLQVLHNPDTDEIHWGWDAPTCQLRGHFLGHWLS 83

Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYT 231
           A+A ++ S  +  +K K+  ++  L +CQ   G  ++   P + F   E    VW+P Y 
Sbjct: 84  AAASIFVSEQDHELKAKLDKIIDELIKCQELNGGEWIGPIPEKYFQKLENSHHVWSPQYV 143

Query: 232 IHKILAGLLDQYVLADNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
           +HK+L GL++ Y+  ++ +AL    K++ W +++  + + K        R  Y    E  
Sbjct: 144 MHKVLMGLMNSYIDTNSDKALAILDKLSNWYIKWTDDMLIK------NPRAIYG--GEEA 195

Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
           GM +V   +Y IT + K+L LA  +  P     L    D L++ HAN  IP   G+   Y
Sbjct: 196 GMLEVWITMYEITAEEKYLELAKKYSNPRIFRDLEAGRDTLTNCHANASIPWSHGAAKLY 255

Query: 348 EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           EVTGD  + K+   F+ + V     Y +GG  A E+W  P +L   L   N+E CT YNM
Sbjct: 256 EVTGDEKWRKITEAFWKNAVTDRGYYCSGGQGAGEYWTPPFKLGLFLSDSNQEFCTVYNM 315

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
           ++ + +L++WT + ++ADY E  L NG L+ Q+    G+  Y LPLG G  K      WG
Sbjct: 316 IRTASYLYKWTGDTSFADYIELNLYNGFLA-QQNKYTGMPTYFLPLGAGSKKK-----WG 369

Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--VVLNQKV 524
           T+   FWCC+GT +++ +     IYFE++     L + QYI S   W   +  + + Q+V
Sbjct: 370 TETRDFWCCHGTMVQAQTLYNSLIYFEDKER---LVVSQYIPSELKWNYNNTDITIQQRV 426

Query: 525 DPIVSWDPYL----------RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
           +     D             R +L F    E  +  +L+ R+P W     +    N +  
Sbjct: 427 NMKYYNDLAFFDERDESQMSRWSLKFQVAAEKNESFTLSFRVPKWVKELPSVTINNEKID 486

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
            L     +++    WS  D++ I  P  L    +    P+     A + GP +LAG    
Sbjct: 487 DLTVDEGYINIKREWS-QDEVLIYFPCRLEISPL----PDMPDTFAFMEGPIVLAGICDE 541

Query: 635 EWDIKTGTARSLSALISP 652
           E  +  G A   S ++ P
Sbjct: 542 ERRL-YGDADKPSEILMP 558


>gi|86142285|ref|ZP_01060795.1| hypothetical protein MED217_11584 [Leeuwenhoekiella blandensis
           MED217]
 gi|85831037|gb|EAQ49494.1| hypothetical protein MED217_11584 [Leeuwenhoekiella blandensis
           MED217]
          Length = 793

 Score =  236 bits (601), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 168/536 (31%), Positives = 258/536 (48%), Gaps = 41/536 (7%)

Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYL 170
           +S V   A  T+  Y+  LD D L+  F + A L     +Y  WEN  + L GH  GHY+
Sbjct: 37  ESGVFKEAALTDFNYIQALDADRLLAPFLREAGLEPKADSYTNWEN--TGLDGHTAGHYI 94

Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA------- 221
           SA +  +AST +   KE +   +  L   Q   G GY+   P    L+   +A       
Sbjct: 95  SALSMYYASTGDPKAKEMLEYALAELDRVQKSNGNGYIGGVPGSDALWAEIKAGKINAGS 154

Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
             L   W P Y IHK   GL D ++ A+  QA +M   + ++F +    +    S  +  
Sbjct: 155 FSLNDKWVPLYNIHKTFNGLKDAWIHAELPQAKRMLIELTDWFLD----ITADLSEAQIQ 210

Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
             L  E GG+N+V   +Y+IT D K+L LA  F +   L  LA   D L+  HANT IP 
Sbjct: 211 DMLRSEHGGLNEVFAEVYAITSDKKYLKLAEDFSQHALLKPLAANEDILTGMHANTQIPK 270

Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN-E 398
            IG +   ++     Y    + F D V    S + GG S RE +      +  + SE   
Sbjct: 271 FIGFERISQLEEAKDYHDAASNFFDNVTTRRSISIGGNSVREHFNPVDDFSSVVSSEQGP 330

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C TYNMLK+S+ LF  T E  Y D+YER L N +LS Q     G  +Y  P+  G  +
Sbjct: 331 ESCNTYNMLKLSKLLFEDTSEEHYIDFYERGLYNHILSSQNPD--GGFVYFTPIRPGHYR 388

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
                 +     SFWCC G+G+E+ +K  + IY ++E     LY+  +I S  +W+  + 
Sbjct: 389 V-----YSQPETSFWCCVGSGMENHTKYNELIYAKKEDK---LYVNLFIPSEVNWEEKNA 440

Query: 519 VLNQKVDPIVSWDPYLRMT-LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
            L QK +      P   +T L ++S+++    ++L LR P W  +   +  +N +   + 
Sbjct: 441 TLTQKTNF-----PEEALTELIWNSRKKTK--ATLMLRYPQWVNAGELKVYVNDKLEKID 493

Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
             PG+++S   +W   D++ ++LP+ L  E + DD   Y S++   +GP +LA  T
Sbjct: 494 ATPGSYVSLERKWKNGDRIKMELPMHLSLEELPDDSG-YVSVK---YGPIVLAAVT 545


>gi|332685731|ref|YP_004455505.1| hypothetical protein MPTP_0197 [Melissococcus plutonius ATCC 35311]
 gi|332369740|dbj|BAK20696.1| hypothetical protein MPTP_0197 [Melissococcus plutonius ATCC 35311]
          Length = 883

 Score =  236 bits (601), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 178/569 (31%), Positives = 264/569 (46%), Gaps = 78/569 (13%)

Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL-PTPGKAYGGWENPIS-ELRGHFVGH 168
           Q   + +AQ+  + YLL LDV   ++ F K A + P     Y GWE       RGHF GH
Sbjct: 12  QDPYIHKAQENVIHYLLSLDVQKFLFEFYKVAGMKPLTESGYQGWERSDQVNFRGHFFGH 71

Query: 169 YLSASAQMWASTHNATIKEKM----STVVFSLSECQNKIG------TGYLSAFPTELFDS 218
           +LSA A  + +     +K+K+     T +  L   Q           GY+SAF     D 
Sbjct: 72  FLSALALSYQAEKQPILKKKIHQQIKTAITGLKAIQKNYAKQHPEHAGYISAFKEVALDE 131

Query: 219 FEALKPV--------WAPYYTIHKILAGLLDQYVLADNA------QALKMATWMVEYFYN 264
            E  KPV          P+Y +HKILAGLL+  +           +AL +A+W  +Y Y 
Sbjct: 132 VEG-KPVDPKEKENVLVPWYNLHKILAGLLEVNISLKEVDSQLSKEALFIASWFGDYIYK 190

Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
           R+  +     +      L  E GGMND LY L+ +T   +H + A  FD+      LA  
Sbjct: 191 RMMNLTDKNQM------LTIEYGGMNDALYYLFELTQKKEHAIAATYFDEDNLFNQLAND 244

Query: 325 ADYLSHFHANTHIPIVIGSQMRYEV----------TGDPLYKLIGTF-----FMDIVNAS 369
            + L   HANT IP +IG+  RY V          + +    L+  F     F  IV  +
Sbjct: 245 ENVLPGKHANTTIPKLIGALKRYMVFQSEDLSAWLSNEEKEHLMSYFKAAENFWQIVVDN 304

Query: 370 HSYATGGTSAREFWWDPKRL----ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
           H+Y TGG S  E +  P  L        G    ETC T+NMLK++R L+  TK+  Y DY
Sbjct: 305 HTYCTGGNSQSEHFHGPNELFYDSEIRQGDCTCETCNTHNMLKLTRKLYECTKDPKYLDY 364

Query: 426 YERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSK 485
           YE    N +L+ Q  ++ G+M+Y  P+G G +K      +   ++ FWCC GTGIESFSK
Sbjct: 365 YETTYINAILASQ-NSKTGMMMYFQPMGAGYNKV-----YNRPYDEFWCCSGTGIESFSK 418

Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
           L D+ YF+E      L++  Y S++   K  ++ + QK D     +  + + L   + + 
Sbjct: 419 LADTYYFKENNR---LFVNLYFSNTLKLKENNLKIIQKTD---RKNGNVTIDLKTLTDKN 472

Query: 546 VGQLSSLNLRMPVW----TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
           + Q   L LR+P W    T   G +      +L        ++A      ND++ +++  
Sbjct: 473 IIQPLQLALRLPNWAKQVTIKKGKKLLNYKSHLGFAYLSGLVTA------NDQIILEMEQ 526

Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAG 630
            L+      D P+  +  A  +GPY+LAG
Sbjct: 527 ELQLL----DTPDNTNYIAFKYGPYILAG 551


>gi|328956144|ref|YP_004373477.1| hypothetical protein Corgl_1563 [Coriobacterium glomerans PW2]
 gi|328456468|gb|AEB07662.1| protein of unknown function DUF1680 [Coriobacterium glomerans PW2]
          Length = 751

 Score =  235 bits (600), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 184/545 (33%), Positives = 268/545 (49%), Gaps = 44/545 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           ++ ++L  V L   +    AQQ  L +L  +D D ++ +FR+ A + T G     GW+ P
Sbjct: 182 MRPINLTCVRLAPGTPAAAAQQRRLSFLKQVDDDQMLINFRRAAHMDTKGAPEMIGWDTP 241

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK------IGTGYLSAF 211
            S LRGH  GHYLSA A  WA+T + T+  K+S +V SL E Q        I  G+LSA+
Sbjct: 242 DSNLRGHTTGHYLSALALAWAATGDETVHSKLSYMVHSLGEVQAAFRGQPGIHEGFLSAY 301

Query: 212 PTELFDSFEALKP---VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
               FD  E   P   +WAPYYT+HKILAGLLD Y  A N QAL++A  +  + YNR+ +
Sbjct: 302 DESQFDLLERYTPYPEIWAPYYTLHKILAGLLDSYRYAGNRQALEIAIGVGHWVYNRLSQ 361

Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ-AD 326
           +  +  +++ W   +  E GGMN+ L  L +IT +   +  A  FD    + F ALQ  D
Sbjct: 362 LDPI-QLKKMWAMYIAGEFGGMNESLAMLGAITGEESFVKAARFFDNDKLI-FPALQKVD 419

Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
            L   HAN HIP VIG+   Y VT +  Y  +  FF   V A H YA GGT   E +  P
Sbjct: 420 ALGTLHANQHIPQVIGALSLYGVTHEESYYQVAEFFWHSVVAHHIYAFGGTGDGEMFQQP 479

Query: 387 KRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
             +A  +   + E+C +YNM+K++R L+ +        Y E  L N +LS       G  
Sbjct: 480 CEIAAKIDEFSAESCASYNMIKLTRDLYEYEPTADKMAYCENVLINHILSSTDHEGTGGS 539

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
            Y +    G  K   T       NS  CC+GTG+ES    G SIY++ EG    L +  Y
Sbjct: 540 TYFMETQPGARKGFDTE------NS--CCHGTGLESQFMYGQSIYYQGEGQ---LIVALY 588

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVWTYSNGA 565
           ++S        V     +D   +    +R+         +G+L   L LR P W  S+  
Sbjct: 589 LASHLKTDDTDVT----IDCDFNHPETVRIA--------IGRLEGKLVLRHPDW--SDRM 634

Query: 566 QASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
             S+NG    +     +++  +  +  D++T++L   LR     DD     +  AI +GP
Sbjct: 635 TVSINGAAARIAEKDGYVTVEDSLAPGDEITVRLNPELRLIPTPDD----PNRVAIGYGP 690

Query: 626 YLLAG 630
           ++LA 
Sbjct: 691 FVLAA 695


>gi|120435050|ref|YP_860736.1| hypothetical protein GFO_0692 [Gramella forsetii KT0803]
 gi|117577200|emb|CAL65669.1| conserved hypothetical protein, membrane or secreted [Gramella
           forsetii KT0803]
          Length = 796

 Score =  235 bits (600), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 184/587 (31%), Positives = 278/587 (47%), Gaps = 64/587 (10%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A   +LEY+L LD D L+  F K A L T  ++Y  WEN  + L GH  GHYL+A + M+
Sbjct: 52  AMLVDLEYILKLDPDRLLAPFLKEAGLETKVESYPNWEN--TGLDGHIGGHYLTALSLMY 109

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE---------ALKPVW 226
           A+T N  + E+++ ++  L + Q +   GY+   P   EL+             +L   W
Sbjct: 110 AATGNQEVLERLNYMLDELQKVQ-QANVGYIGGVPDSKELWQQISEGNINAGSFSLNDRW 168

Query: 227 APYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
            P Y IHK  AGL D Y +A   +A    + ++ WM+E        V +  S E+    L
Sbjct: 169 VPLYNIHKTYAGLRDAYQIAGIERAKTMLIDLSDWMLE--------VTSDLSEEQIQELL 220

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
             E GG+N+    +Y IT + K+L LA+ F +   L  L    D L+  HANT IP VIG
Sbjct: 221 ISEYGGLNETFADVYEITGEKKYLDLAYAFSQKELLKPLEDDQDVLTGMHANTQIPKVIG 280

Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS--ENEET 400
            Q    +  +  Y+   +FF D V    S A GG S RE  + PK    T+ S  +  ET
Sbjct: 281 FQTIAALNDNREYRDAASFFWDNVVNERSVAIGGNSVREH-FHPKDDFSTMMSSVQGPET 339

Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
           C TYNMLK+S  LF       Y DYYE+AL N +LS Q   E G  +Y  P+  G  +  
Sbjct: 340 CNTYNMLKLSEKLFLTEANEKYVDYYEQALYNHILSSQH-PEKGGFVYFTPMRPGHYRVY 398

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
           S         SFWCC G+G+E+  K  + IY   E     LY+  +I S  +W+   + L
Sbjct: 399 SQPE-----TSFWCCVGSGLENHGKYNEFIYAHTENE---LYVNLFIPSILNWEEKGLKL 450

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPP 579
            QK +     +   ++++     +E     +L LR P W  + G    +N + + L   P
Sbjct: 451 TQKTE--FPNEETSKISINLKEVEEF----TLMLRYPTW--AKGFNILVNQEKVELNNEP 502

Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW--- 636
           G+++S    W+  D++ +Q+P+++ +  + D    +    A+ +GP +L   T  E+   
Sbjct: 503 GSYVSIKREWTDGDEIELQIPMNISSVGLPDGSNNF----ALKYGPLVLGAKTGNEYMEG 558

Query: 637 ---------DIKTGTARSLSALISPIPPSFNAQLVTF-TQESGNSTF 673
                     I  G    LS     +  + NA LV + ++E G   F
Sbjct: 559 LFADASRGGHIAAGKKIPLSETPIFLADTKNADLVNYISKEEGELKF 605


>gi|380694971|ref|ZP_09859830.1| hypothetical protein BfaeM_13572 [Bacteroides faecis MAJ27]
          Length = 802

 Score =  235 bits (599), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 179/568 (31%), Positives = 266/568 (46%), Gaps = 65/568 (11%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
           SL DV L  SS   +AQQT+L Y+L LD D L   F + A L     +Y  WEN  + L 
Sbjct: 29  SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
           GH  GHYLSA + M+A+T +  I  +++ ++  L   Q  +GTG++   P   +L+   +
Sbjct: 86  GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145

Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQ 267
           A         L   W P Y IHK  AGL D Y+ A +  A +M      WM++       
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            + +  S  +    L  E GG+N+    +  IT D K+L LA  F     L  L    D 
Sbjct: 199 -ITSGLSDSQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPLIKDEDR 257

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSAR 380
           L+  HANT IP VIG +   EV+ D         +     FF + V    S   GG S R
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKDDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317

Query: 381 EFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI--------AYADYYERALT 431
           E +         L   +  ETC TYNML++++ L++ + ++         Y DYYERAL 
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377

Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY 491
           N +LS Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIY 431

Query: 492 FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
              +     LY+  +I S  +WK   V L Q+   +   D  + + +  +SK+++    +
Sbjct: 432 AHRQDT---LYVNLFIPSQLNWKEQGVTLTQET--LFPDDGKVTLRIDKASKKKL----T 482

Query: 552 LNLRMPVWTYSNGAQA-SLNGQNLPL---PPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
           L +R+P W  S+   A ++NGQ       P    +L    +W   D +T  LP+ +  E 
Sbjct: 483 LMIRIPGWAGSSKDYAITINGQKKKYAIRPGVSTYLPIHRKWKKGDVITFNLPMEVSLEQ 542

Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGE 635
           I D +  Y    A L+GP +LA  T  E
Sbjct: 543 IPDKKDYY----AFLYGPIVLAASTGTE 566


>gi|331702303|ref|YP_004399262.1| hypothetical protein Lbuc_1953 [Lactobacillus buchneri NRRL
           B-30929]
 gi|329129646|gb|AEB74199.1| protein of unknown function DUF1680 [Lactobacillus buchneri NRRL
           B-30929]
          Length = 803

 Score =  235 bits (599), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 196/637 (30%), Positives = 287/637 (45%), Gaps = 96/637 (15%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPIS-ELRGHFVGHYLSASAQ 175
           AQQ  ++YLL LD    + +F + A + + G   Y GWE       RGHF GHYLSA +Q
Sbjct: 20  AQQMTVKYLLALDPKRFLVTFDEVAGIDSGGVTGYQGWERTDGLNFRGHFFGHYLSALSQ 79

Query: 176 MWASTHNATIKE----KMSTVVFSLSECQNKIG------TGYLSAFPTELFDSFEALK-- 223
              +T    I++    K+   V  L   Q           GY+SAF     D  E  +  
Sbjct: 80  AILATEENDIRQQLLDKLRLGVNGLQSAQAAYAKSHPDSAGYVSAFREVALDEVEGREVP 139

Query: 224 -----PVWAPYYTIHKILAGLLDQYVLAD------NAQALKMATWMVEYFYNRVQKVITM 272
                 V  P+Y +HK+LAGLL   V         + +ALK+A     Y + R+ ++   
Sbjct: 140 KDEKENVLVPWYNLHKVLAGLLAVKVNLQGIDPLLSEKALKIAHQFGIYVFKRLNQLADP 199

Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
             +      L  E GGMND LY L+ +T D + L  A  FD+      LA   D L+  H
Sbjct: 200 TQM------LKIEYGGMNDALYELFDLTDDKRMLTAATYFDETALFKQLAEGDDVLAGKH 253

Query: 333 ANTHIPIVIGSQMRYEVTGD----------------PLYKLIGTFFMDIVNASHSYATGG 376
           ANT IP +IG+  RYE   D                 +Y      F  IV   H+Y TGG
Sbjct: 254 ANTTIPKLIGALHRYESLHDVKRADQYLSPEEKGSLNMYLKAAVNFWQIVVDDHTYVTGG 313

Query: 377 TSAREFWWDPKRL-ADTL---GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
            S  E + +P +L  D +   G+   ETC TYNMLK+SR LFR T +  Y DYYE+  TN
Sbjct: 314 NSQSEHFHEPGQLFHDAVLEDGATTCETCNTYNMLKLSRELFRVTGDKKYLDYYEQTYTN 373

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L  Q     G+M Y  P+  G +K      +   F+ FWCC GTGIE+F+KLGDS  F
Sbjct: 374 AILGSQ-NPNTGMMTYFQPMAAGYTKV-----YNRPFDEFWCCTGTGIENFTKLGDSYDF 427

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
                   LY+  Y S+     S ++ + ++VD        + +T+     Q+     +L
Sbjct: 428 MSGDQ---LYLSLYFSNVLRLDSNNLQMTEQVDRKTG---KVHLTVAKLRSQDSAGAINL 481

Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK-----LTIQLPLSLRTEA 607
            LR P W   + A+ +++G +  +    +F      W  ++      + +++P+SL+   
Sbjct: 482 KLRNPAWLVQS-AKLAVDGISQQVDQNADF------WEIDNAGPGTTVDLEIPMSLKMVQ 534

Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEW---DIKTGTARSLSALISPIP---------- 654
            +D+ P Y + +   +GPY+LAG         D   G    +S     +P          
Sbjct: 535 TKDN-PHYVAFK---YGPYVLAGQLGKHHINDDRPNGVLVRISTHDQAVPSTLTTGMDWH 590

Query: 655 ---PSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP 688
               S N+Q V  T E+ N+ F +   N S T+   P
Sbjct: 591 DWQQSLNSQAVVDT-ETTNTLFELKLPNTSETITFVP 626


>gi|326798346|ref|YP_004316165.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326549110|gb|ADZ77495.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
          Length = 1022

 Score =  234 bits (597), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 191/645 (29%), Positives = 293/645 (45%), Gaps = 104/645 (16%)

Query: 93  DLPGNFLKEVSLHDVWLD-----QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKT--ASLP 145
           D+P + L   +L  V L+       +     +   +  L   D +S ++ FR       P
Sbjct: 368 DIPSSKLAPFNLDQVSLEADAHGHKTKFIENRDKFINTLAATDPNSFLYMFRHAFGQKQP 427

Query: 146 TPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVV---FSLS 197
              +  G W++  ++LRGH  GHYL+A AQ +A T       A   EKM  +V   + LS
Sbjct: 428 EGARPLGVWDSQETKLRGHATGHYLTAIAQAYAGTGYDKALQAKFAEKMEYMVNTLYELS 487

Query: 198 ECQNKI---------------------------------------GTGYLSAFPTELFDS 218
           +   K                                        G G++SA+P + F  
Sbjct: 488 QLSGKPKEAGGIHVSDPTAVPYGPGKTEYDSDFSDEGIRTDYWNWGEGFISAYPPDQFIM 547

Query: 219 FE-------ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
            E           VWAPYYT+HKILAGL+D Y ++ N +AL++AT M ++ Y R+ K+ T
Sbjct: 548 LERGAKYGGQKNQVWAPYYTLHKILAGLMDVYEVSGNKKALEIATGMGDWVYARLSKLPT 607

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQ 324
              ++     +  E GGMN+V+ RLY IT+ P +L  A LFD    F G       LA  
Sbjct: 608 ETLIKMWNTYIAGEFGGMNEVMARLYRITNKPNYLKTAQLFDNIKMFYGDASHSHGLAKN 667

Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS------ 378
            D     HAN HIP ++GS   Y V+ +P+Y  I   F   V   + Y+ GG +      
Sbjct: 668 VDTFRGLHANQHIPQIVGSIEMYRVSNNPVYYSIADNFWYKVVNDYMYSIGGVAGARNPA 727

Query: 379 -AREFWWDPKRLAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
            A  F   P  L +   + G +N ETC TYNMLK++  LF + +     DYYER L N +
Sbjct: 728 NAECFISQPATLYENGFSAGGQN-ETCATYNMLKLTSDLFLFDQRPELMDYYERGLYNHI 786

Query: 435 LSIQRGTEPGVMIYMLPLGRG-VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
           L+      P    Y +PL  G + +  + H  G     F CC GT IES +KL +SIYF+
Sbjct: 787 LASVAEDSP-ANTYHVPLRPGSIKQFGNPHMTG-----FTCCNGTAIESSTKLQNSIYFK 840

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
            + N   LY+  +I S+ +W    + + Q  D     + + R+T+    K +      ++
Sbjct: 841 SKDN-DALYVNLFIPSTLEWAERKITVQQTTD--FPNEDHTRLTIKGGGKFD------MH 891

Query: 554 LRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
           +R+P W  + G    +NG++  L   PG++L  +  W   D + +Q+P     + + D +
Sbjct: 892 VRVPGWA-TKGFFVRVNGKDQKLEAKPGSYLKISRNWKDGDVVDLQMPFQFHLDPVMDQQ 950

Query: 613 PEYASIQAILFGPYLLAGH---TSGEWDIKTGTARSLSALISPIP 654
               +I ++ +GP LLA        +W   +  A  +S  I   P
Sbjct: 951 ----NIASLFYGPILLAAQEPEARKDWRTVSLDAEDISKSIKGDP 991


>gi|423223047|ref|ZP_17209516.1| hypothetical protein HMPREF1062_01702 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392640316|gb|EIY34118.1| hypothetical protein HMPREF1062_01702 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 790

 Score =  234 bits (596), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 183/606 (30%), Positives = 278/606 (45%), Gaps = 65/606 (10%)

Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
           L  A+  N+E LL  D D L+  +RK A L    K Y  W+     L GH  GHYL+A A
Sbjct: 40  LKHARDLNIETLLKYDCDRLMAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMA 95

Query: 175 QMWASTHNATIKEKMSTVVFSLSECQN-------KIGTGYLSAFPTELF-------DSFE 220
            + A+T N   +++M  ++  ++EC         + G GY+   P             F 
Sbjct: 96  -INAATGNEECRKRMEYIISEIAECAEANCKNHPQWGVGYMGGMPNSQNIWNGFKDGDFR 154

Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVE 276
                WAP+Y +HK+ AGL D ++   N QA    L+   W +         + +  S E
Sbjct: 155 VYSGSWAPFYNLHKMYAGLRDAWLYCGNEQAKSLFLQFCNWAIH--------ITSGLSDE 206

Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
           +    L  E GGMN+VL   Y+ITH+ K+L  A  F        ++ + D L + HANT 
Sbjct: 207 QMERMLGNEHGGMNEVLADAYAITHEQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQ 266

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS- 395
           +P VIG +   E++G+  Y +  +FF DIV    S A GG S RE +       D +   
Sbjct: 267 VPKVIGFERISELSGNEDYHVASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDI 326

Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
           +  E+C T NMLK++  L R   E  YADYYE A  N +LS Q   E G  +Y  P    
Sbjct: 327 DGPESCNTNNMLKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTP---- 381

Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
            ++ R    +     + WCC GTG+E+  K G  IY    G+   L++  Y +S  DWK 
Sbjct: 382 -ARPRHYRNYSAPNEAMWCCVGTGMENHGKYGQFIY-THAGDA--LFVNLYAASQLDWKE 437

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
             + L Q+     +  PY   + T +  +  G   +L +R P W +    + S+NG+   
Sbjct: 438 RGITLRQE-----TAFPYSENS-TITIAEGKGTF-NLMVRYPGWVHPGEFKVSVNGKPAD 490

Query: 576 -LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
            +  P +++S   +W   D + I  P+      + ++ P+Y    A++ GP LL      
Sbjct: 491 IITGPSSYVSINRKWKKGDVVNINFPMHSSLRYLPNE-PQYV---ALMHGPILLG----- 541

Query: 635 EWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDA 694
              +KTGT  S+++LI+     F        Q    +  +++N   SI  +  PV G   
Sbjct: 542 ---MKTGT-ESMASLIAD-DSRFGQYAGGPKQPIDKAPILINNDIASIPSQLTPVPGK-- 594

Query: 695 ALHATF 700
            LH T 
Sbjct: 595 PLHFTL 600


>gi|195643412|gb|ACG41174.1| hypothetical protein [Zea mays]
 gi|413926261|gb|AFW66193.1| hypothetical protein ZEAMMB73_983510 [Zea mays]
          Length = 262

 Score =  233 bits (594), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 132/248 (53%), Positives = 161/248 (64%), Gaps = 17/248 (6%)

Query: 4   GFVLFFFFCFGL--ALGKQCTNQSP-YDSHAFRYELT---STNKTWKEEVLSHF------ 51
           G V+      G   A GK CTN  P   SH  R           T  + ++ H       
Sbjct: 14  GIVVVMLLAAGFRGAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQ 73

Query: 52  HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPG----NFLKEVSLHDV 107
           HLTPTD+S W SL+P + L  +++   W +LYR+++  GG   PG     FL E SLHDV
Sbjct: 74  HLTPTDESTWMSLMPRRAL-RREEAFDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDV 132

Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG 167
            L+  S+ WRAQQTNLEYLL+LDVD LVWSFRK A L  PG  YGGWE P  +LRGHFVG
Sbjct: 133 RLEPGSMYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVG 192

Query: 168 HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWA 227
           HYLSA+A+MWASTHN T+  KMS+VV +L +CQ K+GTGYLSAFP++ FD  EA+K VWA
Sbjct: 193 HYLSATAKMWASTHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWA 252

Query: 228 PYYTIHKI 235
           PYYTIHK+
Sbjct: 253 PYYTIHKV 260


>gi|440700043|ref|ZP_20882328.1| Tat pathway signal sequence domain protein [Streptomyces
           turgidiscabies Car8]
 gi|440277439|gb|ELP65547.1| Tat pathway signal sequence domain protein [Streptomyces
           turgidiscabies Car8]
          Length = 934

 Score =  233 bits (594), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 158/462 (34%), Positives = 233/462 (50%), Gaps = 31/462 (6%)

Query: 206 GYLSAFPTELFDSFEALK-----PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F + E++       VWAPYYT HKIL GLLD Y+  D+++AL +A+ M +
Sbjct: 383 GFLAAYPETQFIALESMTSGDYTKVWAPYYTAHKILKGLLDAYLATDDSRALDLASGMCD 442

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + Y+R+ K+    +++R W   +  E GG+ + +  LY+IT+  +HL LA LFD    + 
Sbjct: 443 WMYSRLSKLPDA-TLQRMWGIFSSGEFGGIVETIVDLYTITNKAEHLALAKLFDLDTLID 501

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L+  HAN HIPI  G    Y+ TG+  Y      F  +V     Y  GGTS 
Sbjct: 502 ACAANTDTLNGLHANQHIPIFTGYVRLYDATGEARYLTAAKNFWGMVIPQRMYGIGGTST 561

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW     +A T+   N ETC  YN+LK+SR LF   ++  Y DYYERAL N VL  ++
Sbjct: 562 GEFWKARGVIAGTVSDTNAETCCAYNLLKLSRTLFFHEQDPKYMDYYERALYNQVLGSKQ 621

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  ++ Y + L  G  +   T   GT      CC GTG+ES +K  DS+YF+   
Sbjct: 622 DKADAEKPLVTYFIGLNPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYFKSA- 674

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
           +   LY+  Y  S+  W    V + Q  +    +      TLT           +L LR+
Sbjct: 675 DGGSLYVNLYSPSTLTWAEKGVTVTQTTE----YPKEQGTTLTIGGGSAA---FALRLRV 727

Query: 557 PVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
           P+W  + G Q ++NGQ +   P  G++ + +  W   D + I +P  LR E   DD    
Sbjct: 728 PLWA-TAGFQVTVNGQAVSGTPVAGSYFAVSRTWQSGDVVRISVPFRLRVEKALDD---- 782

Query: 616 ASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSF 657
            S+Q + +GP  L   ++    +  G  R+ SAL   + PS 
Sbjct: 783 PSLQTLFYGPVNLVARSASTSYLSVGLYRN-SALSGDLLPSL 823



 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 61/112 (54%), Gaps = 6/112 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           L+   L DV L Q  V    +Q  L++    DV+ L+  FR  A L T G  A GGWE  
Sbjct: 44  LRPFELKDVALGQG-VFASKRQLMLDHGRGYDVNRLLQVFRANAGLSTGGAVAPGGWEGL 102

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT 205
             E    LRGH+ GH+LS  +Q +AST +    ++++T+V +L++ +  + T
Sbjct: 103 DGEANGNLRGHYTGHFLSMLSQAYASTRDQAYADRIATMVGALTDVRAALRT 154


>gi|189467200|ref|ZP_03015985.1| hypothetical protein BACINT_03584 [Bacteroides intestinalis DSM
           17393]
 gi|189435464|gb|EDV04449.1| beta-lactamase [Bacteroides intestinalis DSM 17393]
          Length = 720

 Score =  233 bits (593), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 143/405 (35%), Positives = 225/405 (55%), Gaps = 26/405 (6%)

Query: 232 IHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMND 291
           +HK+ +GL+ QY+ ADN QAL++ T M  + YN++ K +   + +R    +  E GG+N+
Sbjct: 1   MHKLFSGLIYQYLYADNKQALEVVTRMGNWTYNKL-KPLDESTRKR---MIRNEFGGVNE 56

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTG 351
             Y LY+IT D ++  LA  F     +  L  Q D L   H NT IP V+     YE+T 
Sbjct: 57  SFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDLGTKHTNTFIPKVLTEARNYELTQ 116

Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSR 411
           D   + +  FF   +   H++A G +S +E ++DP++L+  L     ETC TYNMLK+SR
Sbjct: 117 DNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQLSKHLTGYTGETCCTYNMLKLSR 176

Query: 412 HLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNS 471
           HLF WT +   ADYYERAL N +L  Q+  E G++ Y LPL  G  K      + T+ NS
Sbjct: 177 HLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSYFLPLLSGSHKV-----YSTRENS 230

Query: 472 FWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWD 531
           FWCC G+G E+ +K G++IY+    N  G+Y+  +I S  +WK+  + L Q+     ++ 
Sbjct: 231 FWCCVGSGFENHAKYGEAIYYH---NDQGIYVNLFIPSEVNWKAKGITLRQE----TAFP 283

Query: 532 PYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWS 590
                 LT  + + V   +++ LR P W  S   + ++NG+ + +   PG+++  T +W 
Sbjct: 284 AEENTALTIQTDKPV--TTTIYLRYPSW--SKNVKVNVNGKKVSVKQKPGSYIPVTRQWK 339

Query: 591 YNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
             D++    P+SL+ E   D+ P+     A+L+GP +LAG +  E
Sbjct: 340 DGDRIEANYPMSLQLETTPDN-PQKG---ALLYGPLVLAGESGTE 380


>gi|383123086|ref|ZP_09943771.1| hypothetical protein BSIG_0174 [Bacteroides sp. 1_1_6]
 gi|251841821|gb|EES69901.1| hypothetical protein BSIG_0174 [Bacteroides sp. 1_1_6]
          Length = 802

 Score =  232 bits (591), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 178/568 (31%), Positives = 268/568 (47%), Gaps = 65/568 (11%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
           SL DV L  SS   +AQQT+L Y+L LD D L   F + A L     +Y  WEN  + L 
Sbjct: 29  SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
           GH  GHYLSA + M+A+T +  I  +++ ++  L   Q  +GTG++   P   +L+   +
Sbjct: 86  GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145

Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQ 267
           A         L   W P Y IHK  AGL D Y+ A +  A +M      WM++       
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            + +  S  +    L  E GG+N+    +  IT D K+L LA  F     L  L    D 
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFFHKVILDPLIKNEDR 257

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSAR 380
           L+  HANT IP VIG +   EV+ D         +     FF + V    S   GG S R
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKDDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317

Query: 381 EFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI--------AYADYYERALT 431
           E +         L   +  ETC TYNML++++ L++ + ++         Y DYYERAL 
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377

Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY 491
           N +LS Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIY 431

Query: 492 FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
             ++     LY+  +I S  +WK   V L Q+   +   D  + + +  ++K+ +    +
Sbjct: 432 AHQQDT---LYVNLFIPSQLNWKEQGVTLTQET--LFPDDEKVTLRIDKAAKKNL----T 482

Query: 552 LNLRMPVWT-YSNGAQASLNG-QNLPLPPPG--NFLSATERWSYNDKLTIQLPLSLRTEA 607
           L +R+P W   S G + ++NG ++L     G   +L    +W   D +T  LP+ +  E 
Sbjct: 483 LMIRIPEWAGNSKGYEITINGKKHLSDIQTGASTYLPIRRKWKKGDMITFHLPMKVSLEQ 542

Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGE 635
           I D +  Y    A L+GP +LA  T  E
Sbjct: 543 IPDKKDYY----AFLYGPIVLATSTGTE 566


>gi|189464752|ref|ZP_03013537.1| hypothetical protein BACINT_01096 [Bacteroides intestinalis DSM
           17393]
 gi|189437026|gb|EDV06011.1| hypothetical protein BACINT_01096 [Bacteroides intestinalis DSM
           17393]
          Length = 790

 Score =  232 bits (591), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 182/606 (30%), Positives = 278/606 (45%), Gaps = 65/606 (10%)

Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
           L  A+  N+E LL  D D L+  +RK A L    K Y  W+     L GH  GHYL+A A
Sbjct: 40  LKHARDLNIETLLKYDCDRLIAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMA 95

Query: 175 QMWASTHNATIKEKMSTVVFSLSECQN-------KIGTGYLSAFPTELF-------DSFE 220
            + A+T N   +++M  ++  ++EC         K G GY+   P             F 
Sbjct: 96  -INAATGNEECRKRMEYIINEIAECAEANYKNHPKWGVGYMGGMPNSQNIWSGFKNGDFR 154

Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVE 276
                WAP+Y +HK+ AGL D ++   N QA    L+   W ++        + +  S E
Sbjct: 155 VYSGSWAPFYNLHKMYAGLRDAWLYCGNEQAKTLFLQFCNWAID--------ITSGLSDE 206

Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
           +    L  E GGMN+VL   Y+IT + K+L  A  F        ++ + D L + HANT 
Sbjct: 207 QMERMLGNEHGGMNEVLADAYAITREQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQ 266

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS- 395
           +P VIG +   E++G+  Y +  +FF DIV    S A GG S RE +       D +   
Sbjct: 267 VPKVIGFERISELSGNEDYHMASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDI 326

Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
           +  E+C T N+LK++  L R   E  YADYYE A  N +LS Q   E G  +Y  P    
Sbjct: 327 DGPESCNTNNILKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTP---- 381

Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
            ++ R    +     + WCC GTG+E+  K G  IY    G+   L++  Y +S  DWK 
Sbjct: 382 -ARPRHYRNYSAPNEAMWCCVGTGMENHGKYGQFIY-THVGDA--LFVNLYAASQLDWKE 437

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
             + L Q+     +  PY   + T +  +  G   +L +R P W +    + S+NG+ + 
Sbjct: 438 RGITLRQE-----TAFPYSENS-TITIAEGKGTF-NLMVRYPGWVHPGEFKVSVNGKPVD 490

Query: 576 -LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
            +  P +++S   +W   D + I  P+      + ++ P+Y    A + GP LL      
Sbjct: 491 IITGPSSYVSINRKWKKGDVVNINFPMHSSLRYLPNE-PQYI---AFMHGPILLG----- 541

Query: 635 EWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDA 694
              +KTGT  S+++LI+     F        Q    +  +++N   SI  +  PV G   
Sbjct: 542 ---MKTGT-ESMASLIAD-DSRFGQYAGGPKQPIDKAPILINNDIASIPSQLTPVPGK-- 594

Query: 695 ALHATF 700
            LH T 
Sbjct: 595 PLHFTL 600


>gi|298384655|ref|ZP_06994215.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           1_1_14]
 gi|298262934|gb|EFI05798.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           1_1_14]
          Length = 802

 Score =  231 bits (590), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 177/568 (31%), Positives = 269/568 (47%), Gaps = 65/568 (11%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
           SL DV L  SS   +AQQT+L Y+L LD D L   F + A L     +Y  WEN  + L 
Sbjct: 29  SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
           GH  GHYLSA + M+A+T +  I  +++ ++  L   Q  +GTG++   P   +L+   +
Sbjct: 86  GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145

Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQ 267
           A         L   W P Y IHK  AGL D Y+ A +  A +M      WM++       
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            + +  S  +    L  E GG+N+    +  IT D K+L LA  F     L  L    D 
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKVILDRLIKNEDR 257

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSAR 380
           L+  HANT IP VIG +   EV+ +         +     FF + V    S   GG S R
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKNDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317

Query: 381 EFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI--------AYADYYERALT 431
           E +         L   +  ETC TYNML++++ L++ + ++         Y DYYERAL 
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377

Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY 491
           N +LS Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIY 431

Query: 492 FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
             ++     LY+  +I S  +WK   V L Q+   +   D  + + +  ++K+++    +
Sbjct: 432 AHQQDT---LYVNLFIPSQLNWKEQGVTLTQET--LFPDDEKVTLRIDKAAKKKL----T 482

Query: 552 LNLRMPVWT-YSNGAQASLNG-QNLPLPPPG--NFLSATERWSYNDKLTIQLPLSLRTEA 607
           L +R+P W   S G + ++NG ++L     G   +L    +W   D +T  LP+ +  E 
Sbjct: 483 LMIRIPEWAGNSKGYEITINGKKHLSDIQAGTSTYLPLRRKWKKGDVITFHLPMKVSLEQ 542

Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGE 635
           I D +  Y    A L+GP +LA  T  E
Sbjct: 543 IPDKKDYY----AFLYGPIVLATSTGTE 566


>gi|404451488|ref|ZP_11016452.1| hypothetical protein A33Q_19258 [Indibacter alkaliphilus LW1]
 gi|403762834|gb|EJZ23856.1| hypothetical protein A33Q_19258 [Indibacter alkaliphilus LW1]
          Length = 1019

 Score =  231 bits (590), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 194/626 (30%), Positives = 294/626 (46%), Gaps = 108/626 (17%)

Query: 95  PGNFLKEVSLHDVWL--DQSSVLWRAQQTNLEYLLML---DVDSLVWSFRKTASLPTPGK 149
           P   L+   LH + L  DQ+    +  +   ++LL L   D +S ++ FR     P P  
Sbjct: 369 PQQKLELFKLHQINLEEDQTGQKTKFIENRDKFLLTLAETDPNSFLYMFRHAFDQPQPEN 428

Query: 150 AY--GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEK--------MSTVVFSLSEC 199
           A   G W++  ++LRGH  GHYL+A AQ +AST    + ++        M  V++ LS+ 
Sbjct: 429 AVPLGVWDSQETKLRGHATGHYLTAIAQAYASTGYDEVLQQNFLDKMDYMVNVLYDLSKL 488

Query: 200 Q-NKI------------------------------------GTGYLSAFPTELFDSFEA- 221
             NK+                                    G GY+SA+P + F   E  
Sbjct: 489 SGNKVNGKGNEDPVLVPKGPGKSDFDSDLSDEGIRSDYWNWGKGYISAYPPDQFIMLEKG 548

Query: 222 ------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSV 275
                    +WAPYYT+HKILAGL+D Y ++ N +AL++A  M E+ Y R+   +   ++
Sbjct: 549 ATYGGQKNQIWAPYYTLHKILAGLIDIYKVSGNEKALEIAKGMGEWVYTRLD-ALPQETL 607

Query: 276 ERHWYS-LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADY 327
            + W + +  E GGMN+ +  LY IT DP+ L  A LFD    F G       LA   D 
Sbjct: 608 IKMWNTYIAGEFGGMNETMATLYEITQDPRFLKGAQLFDNIQMFFGDAEYSHGLAKNVDT 667

Query: 328 LSHFHANTHIPIVIGSQMRYEVTG-DPLYKLIGTFFMDIVNASHSYATGGTS-------A 379
               HAN HIP V+GS   Y V+  D  +++   ++   VN  + Y+ GG +       A
Sbjct: 668 FRGLHANQHIPQVVGSLEMYRVSAKDEYFRVADNYWFKAVN-DYMYSIGGVAGARNPANA 726

Query: 380 REFWWDPKRLAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
             F  +P  L +   + G +N ETC TYNMLK++ +LF + +     DY+ER L N +L+
Sbjct: 727 ECFIAEPATLYENGFSSGGQN-ETCATYNMLKLTGNLFLFEQRGELMDYFERGLYNHILA 785

Query: 437 IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE--E 494
                 P    Y +PL  G  K    H    K   F CC GT IES +KL  SIY++  E
Sbjct: 786 SVAEDSPA-NTYHVPLRPGSIK----HFGNAKMTGFTCCNGTSIESNTKLQQSIYYKSIE 840

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
           E  V   Y+  +I S+ DW+  ++ + Q      S+    +  L    + E      L+L
Sbjct: 841 ENAV---YVNLFIPSTLDWEERNIKIKQA----TSFPKEDKTQLLVEGEGEF----VLHL 889

Query: 555 RMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
           R+P W    G   S+NG+ + L   PG++++ +  W   DK+ +++P     + +  D+P
Sbjct: 890 RVPSWA-RKGYHVSINGKEIQLDVKPGSYIAISRFWEDGDKVDLRMPFDFYLDPVM-DQP 947

Query: 614 EYASIQAILFGPYLLAGHTSG---EW 636
             AS   + +GP LLA   S    EW
Sbjct: 948 NIAS---LFYGPILLAAQESDARKEW 970


>gi|317476510|ref|ZP_07935758.1| hypothetical protein HMPREF1016_02741 [Bacteroides eggerthii
           1_2_48FAA]
 gi|316907322|gb|EFV29028.1| hypothetical protein HMPREF1016_02741 [Bacteroides eggerthii
           1_2_48FAA]
          Length = 793

 Score =  231 bits (589), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 169/558 (30%), Positives = 258/558 (46%), Gaps = 62/558 (11%)

Query: 114 VLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSAS 173
            L  A+  N+  LL  + D L+  +RK A L    + Y  W+     L GH  GHYL+A 
Sbjct: 38  TLKSARDLNINTLLKYNCDRLLAPYRKEAGLTPKAECYPNWDG----LDGHVGGHYLTAM 93

Query: 174 AQMWASTHNATIKEKMSTVVFSLSECQN-------KIGTGYLSAFPTEL-------FDSF 219
           A + A+T N   +++M  ++  ++EC         + G GY+   P             F
Sbjct: 94  A-INAATGNEECRKRMEYIIKEIAECAEANRKNHPEWGVGYMGGMPNSQNIWSNFKKGDF 152

Query: 220 EALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSV 275
                 WAP+Y +HK+ AGL D ++   N QA    L+   W ++   N   K +     
Sbjct: 153 RVYSGSWAPFYNLHKMYAGLRDAWLYCGNEQAKDLFLQFCDWAIDVTSNLSDKQMEQM-- 210

Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
                 L  E GGMN+VL   Y+ITH+ K+L  A  F        L  + D L + HANT
Sbjct: 211 ------LGNEHGGMNEVLADAYAITHEQKYLDCAKRFSHKQLFTPLLQRQDCLDNLHANT 264

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
            +P  IG +   E++G+  Y +  +FF DIV    S A GG S RE +       D +  
Sbjct: 265 QVPKAIGFERISELSGNEDYHMASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFIND 324

Query: 396 -ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
            +  E+C T NMLK++ +L R   E  YADYYE A  N +LS Q     G  +Y  P   
Sbjct: 325 IDGPESCNTNNMLKLTENLHRRNPEARYADYYELATFNHILSTQHPKHGGY-VYFTP--- 380

Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
             ++ R    +     + WCC GTG+E+  K G  IY    G+   L++  Y +S  DWK
Sbjct: 381 --ARPRHYRNYSAPNEAMWCCVGTGMENHGKYGQFIY-THVGDA--LFVNLYAASQLDWK 435

Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
              + L Q+     S +  L +T       E     +L +R P W +    + S+NGQ++
Sbjct: 436 KRGITLRQETTFPYSENSTLTIT-------EGKGAFNLMVRYPEWVHPGEFKVSVNGQSV 488

Query: 575 P-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
             +  P +++S   +W   D + I  P+      + ++ P+Y    A ++GP LL     
Sbjct: 489 DVITGPSSYVSINRKWKKGDVVNISFPMHASLRYLPNE-PQYV---AFMYGPILLG---- 540

Query: 634 GEWDIKTGTARSLSALIS 651
               +KTGT  S+++LI+
Sbjct: 541 ----MKTGT-ESMTSLIA 553


>gi|262405235|ref|ZP_06081785.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|345508054|ref|ZP_08787694.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
 gi|229444700|gb|EEO50491.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
 gi|262356110|gb|EEZ05200.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
          Length = 801

 Score =  231 bits (589), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 191/692 (27%), Positives = 309/692 (44%), Gaps = 74/692 (10%)

Query: 98  FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
           +  E  + DV L    V   A++ N+E LL  DVD L+  +RK A L    K Y  W+  
Sbjct: 27  YKNEFPIADVKL-LDGVFKHARELNIEVLLKYDVDRLLAPYRKEAGLTERKKTYPNWDG- 84

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSEC--QNKIGT-----GYLSA 210
              L GH  GHYLSA +  +A+T N     +M  ++  L  C   N I       GY+  
Sbjct: 85  ---LDGHVAGHYLSAMSMNYAATGNKECGRRMEYMISELQLCLEANAINNTEWAIGYIGG 141

Query: 211 FPT--ELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMV 259
           FP    L+ +F+          WAP+Y +HK+ AGL D ++  +N QA    LK   W +
Sbjct: 142 FPNSKNLWSTFKKGDLRIYNSAWAPFYNLHKMYAGLRDAWLYCNNKQAKTLFLKFCDWAI 201

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
                    +    + E+    L  E GGMN++L   Y IT + K+L+ A  + +   L 
Sbjct: 202 S--------ITDDLNEEQMQTVLKMEYGGMNEILADAYQITGNKKYLVAAKRYSQNILLD 253

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            L+   D L + HANT IP  IG     E++GD  Y     F  + +  + S A GG S 
Sbjct: 254 PLSQGIDNLDNKHANTQIPKFIGFARIAELSGDTKYTNASRFSWETITGNRSLAFGGNSR 313

Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
           RE +      +D +   +  E+C +YNMLK++  LFR      YADYYER + N +LS Q
Sbjct: 314 REHFPSVTSCSDYINDVDGPESCNSYNMLKLTEDLFRMQPSAHYADYYERTVFNHILSTQ 373

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
              E G  +Y        ++ R    +     + WCC GTG+E+ SK    IY   + + 
Sbjct: 374 H-PEHGGYVYFTS-----ARPRHYRVYSAPNEAMWCCVGTGMENHSKYNQFIYTHSDDS- 426

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             L++  +I+S  +WK+  + L Q+ +    ++   ++T+T +S         L +R P 
Sbjct: 427 --LFVNLFIASELNWKNKKISLRQETN--FPYEERTKLTVTKASSP-----FKLMIRYPG 477

Query: 559 WTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
           W      + S+NG+++     P +++    +W+  D + ++LP+    E +    P   +
Sbjct: 478 WVDKGALKVSVNGKSMNYSALPSSYICIDRKWNKGDVVEVELPMRSTIEHL----PNVPN 533

Query: 618 IQAILFGPYLLAGHTSGEWDIK---TGTAR-------------SLSALISPIPPSFNAQL 661
             A + GP LL   T  E D++    G  R                 LI     +  ++L
Sbjct: 534 YIAFMHGPILLGAKTGTE-DLRGLIAGDGRWGQYPSGKLLPVDQAPILIVDDMENITSKL 592

Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNF-SSLNNVIG 720
           V    E  +    +  +N SI ++  P +    A +  + L L +     +  SL+ +  
Sbjct: 593 VPIKNEPLHFKANIKAAN-SIDIKLEPFANIHDARYMMYWLTLTNKGYQTYIDSLSTIEK 651

Query: 721 KSVMLEPF--DFPGMLVQQGKEDELVVSESPK 750
           + ++LE    DF     QQ + D  ++ E  +
Sbjct: 652 EKIILEKLTVDFVAPGEQQPETDHKILQEKSR 683


>gi|346226219|ref|ZP_08847361.1| Acetyl-CoA carboxylase, biotin carboxylase [Anaerophaga
           thermohalophila DSM 12881]
          Length = 795

 Score =  231 bits (589), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 179/608 (29%), Positives = 282/608 (46%), Gaps = 53/608 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A+  N +Y++  D D L+  F   A L      YG WE+  S L GHF GHYL++ + M 
Sbjct: 49  AEALNEQYVMAHDPDRLLAPFLIDAGLEPKAPGYGNWES--SGLNGHFGGHYLTSLSLMI 106

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE---------ALKPVW 226
           AST N   +E+++ ++  L+ CQ   G GY+   P   +++             +L   W
Sbjct: 107 ASTGNEEARERLNYMIDELARCQEANGNGYVGGVPGGQDMWAEIAKGNIDAGNFSLNGKW 166

Query: 227 APYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
            P Y IHK+ AGL D ++ A N +A    +K+  W ++        +    S ++    L
Sbjct: 167 VPLYNIHKLYAGLRDAWLYAGNEKAREILIKLTDWCID--------LTAALSDDQIQEML 218

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
             E GG+N+V   +Y IT D K+L LA  F     L  L    D L+  HANT IP VIG
Sbjct: 219 VSEHGGLNEVFADVYDITGDEKYLELARRFSHREILEPLLQHEDRLTGLHANTQIPKVIG 278

Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETC 401
                E+T D  +     FF + V  + +   GG S  E +      +  + S +  ETC
Sbjct: 279 YMRIAELTHDSAWIDASDFFWNTVVNNRTITIGGNSTHEHFHPVDDFSSMIESRQGPETC 338

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
            TYNMLK+S+HLF +  ++ Y DYYE+AL N +LS Q     G ++Y  P+     + R 
Sbjct: 339 NTYNMLKLSKHLFLYKNDLKYIDYYEQALYNHILSSQHPGHGG-LVYFTPM-----RPRH 392

Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
              +     +FWCC G+GIE+  K G+ IY  ++ +V   ++  +I S  +WK   + L 
Sbjct: 393 YRVYSNPEETFWCCVGSGIENHEKYGELIYAHDDEDV---FVNLFIPSELNWKEKGLKLV 449

Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PG 580
           QK +        LR+ L  S +  VG      +R P W      + ++NG ++      G
Sbjct: 450 QKNNFPDIEKSTLRVELDESDEFIVG------IRCPAWANPGEMEVTVNGNSVNGEAVSG 503

Query: 581 NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT-SGEWD-I 638
            +   + +W   D + + LP+    + + D  P Y S   ++ GP++L   T S + D +
Sbjct: 504 QYFLVSRKWDDGDVIEVHLPMHTFGKYLPDKSP-YLS---LMHGPFVLGAATDSTDLDGL 559

Query: 639 KTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEF----PVSGTDA 694
               +R       P+ P   A ++    E+     V+   +Q +T +      P S  D 
Sbjct: 560 IADDSRMGHIAHGPLYPLDEAPMLLIDGENWEKK-VIPVDDQPMTFKALGLIVPDSEDDL 618

Query: 695 ALHATFRL 702
            L   FR+
Sbjct: 619 VLEPFFRI 626


>gi|290958971|ref|YP_003490153.1| glycosylase [Streptomyces scabiei 87.22]
 gi|260648497|emb|CBG71608.1| putative secreted glycosylase [Streptomyces scabiei 87.22]
          Length = 936

 Score =  231 bits (588), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 159/465 (34%), Positives = 232/465 (49%), Gaps = 32/465 (6%)

Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F   E++       VWAPYYT HKIL GLLD Y+  D+ +AL +A+ + +
Sbjct: 384 GFLAAYPETQFIELESMTSGDYTRVWAPYYTAHKILRGLLDAYLHVDDERALDLASGLCD 443

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + Y+R+ K+    +++R W   +  E GG+ + +  LY+IT    HL LA LFD    + 
Sbjct: 444 WMYSRLSKLPDA-TLQRMWGIFSSGEYGGLVEAIVDLYAITGKADHLALARLFDLDKLID 502

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L   HAN HIPI  G    Y+VTG+  Y      F  +V     Y  GGTS 
Sbjct: 503 ACAANTDTLDGLHANQHIPIFTGLVRLYDVTGEARYLSAAKNFWGMVIPQRMYGIGGTST 562

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW     +A T+   N ETC  YN+LK+SR LF   ++  Y DYYERAL N VL  ++
Sbjct: 563 AEFWKARGAVAGTISDTNAETCCAYNLLKLSRSLFFHEQDPKYMDYYERALLNQVLGSKQ 622

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  ++ Y + L  G  +   T   GT      CC GTG+ES +K  DS+YF    
Sbjct: 623 DKADAEKPLVTYFIGLEPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYF-ARA 675

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
           +   LY+  Y +++ DW +  V + Q  D       Y R   T  +    G   ++ LR+
Sbjct: 676 DGSALYVNLYSAATLDWSAKGVTIAQSTD-------YPREQGTTITVGGGGAAFAMRLRV 728

Query: 557 PVWTYSNGAQASLNGQNLP-LPPPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPE 614
           P W  + G + ++NG  +   P PG++ +   R W   D + + +P  LRTE   DD+  
Sbjct: 729 PSWA-TAGFRVTVNGGVVDGTPDPGSYFTIPSRTWDDGDVVRVSIPFRLRTEKALDDQ-- 785

Query: 615 YASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNA 659
             S+Q + +GP  L G       +  G  R+ + L   + PS  A
Sbjct: 786 --SLQTLFYGPVNLVGRNRATSYLPVGLYRN-AGLSGDLLPSLTA 827



 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 58/113 (51%), Gaps = 6/113 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           ++   L DV L Q  +    ++  L++    DVD L+  FR  A L T G  A GGWE  
Sbjct: 45  VRPFELKDVTLGQG-LFAEKRRLMLDHGRGYDVDRLLQVFRANAGLSTKGAVAPGGWEGL 103

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTG 206
             E    LRGH+ GH+L+  AQ  A T +    +++  ++ +L+E +  + TG
Sbjct: 104 DGEANGNLRGHYTGHFLTMLAQAHAGTRDTVYSDRIRYMIGALAEVREALRTG 156


>gi|294646986|ref|ZP_06724603.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|294806386|ref|ZP_06765229.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
 gi|292637657|gb|EFF56058.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|294446401|gb|EFG15025.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 813

 Score =  231 bits (588), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 191/692 (27%), Positives = 309/692 (44%), Gaps = 74/692 (10%)

Query: 98  FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
           +  E  + DV L    V   A++ N+E LL  DVD L+  +RK A L    K Y  W+  
Sbjct: 39  YKNEFPIADVKL-LDGVFKHARELNIEVLLKYDVDRLLAPYRKEAGLTERKKTYPNWDG- 96

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSEC--QNKIGT-----GYLSA 210
              L GH  GHYLSA +  +A+T N     +M  ++  L  C   N I       GY+  
Sbjct: 97  ---LDGHVAGHYLSAMSMNYAATGNKECGRRMEYMISELQLCLEANAINNTEWAIGYIGG 153

Query: 211 FPT--ELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMV 259
           FP    L+ +F+          WAP+Y +HK+ AGL D ++  +N QA    LK   W +
Sbjct: 154 FPNSKNLWSTFKKGDLRIYNSAWAPFYNLHKMYAGLRDAWLYCNNKQAKTLFLKFCDWAI 213

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
                    +    + E+    L  E GGMN++L   Y IT + K+L+ A  + +   L 
Sbjct: 214 S--------ITDDLNEEQMQTVLKMEYGGMNEILADAYQITGNKKYLVAAKRYSQNILLD 265

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            L+   D L + HANT IP  IG     E++GD  Y     F  + +  + S A GG S 
Sbjct: 266 PLSQGIDNLDNKHANTQIPKFIGFARIAELSGDTKYTNASRFSWETITGNRSLAFGGNSR 325

Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
           RE +      +D +   +  E+C +YNMLK++  LFR      YADYYER + N +LS Q
Sbjct: 326 REHFPSVTSCSDYINDVDGPESCNSYNMLKLTEDLFRMQPSAHYADYYERTVFNHILSTQ 385

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
              E G  +Y        ++ R    +     + WCC GTG+E+ SK    IY   + + 
Sbjct: 386 H-PEHGGYVYFTS-----ARPRHYRVYSAPNEAMWCCVGTGMENHSKYNQFIYTHSDDS- 438

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             L++  +I+S  +WK+  + L Q+ +    ++   ++T+T +S         L +R P 
Sbjct: 439 --LFVNLFIASELNWKNKKISLRQETN--FPYEERTKLTVTKASSP-----FKLMIRYPG 489

Query: 559 WTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
           W      + S+NG+++     P +++    +W+  D + ++LP+    E +    P   +
Sbjct: 490 WVDKGALKVSVNGKSMNYSALPSSYICIDRKWNKGDVVEVELPMRSTIEHL----PNVPN 545

Query: 618 IQAILFGPYLLAGHTSGEWDIK---TGTAR-------------SLSALISPIPPSFNAQL 661
             A + GP LL   T  E D++    G  R                 LI     +  ++L
Sbjct: 546 YIAFMHGPILLGAKTGTE-DLRGLIAGDGRWGQYPSGKLLPVDQAPILIVDDMENITSKL 604

Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNF-SSLNNVIG 720
           V    E  +    +  +N SI ++  P +    A +  + L L +     +  SL+ +  
Sbjct: 605 VPIKNEPLHFKANIKAAN-SIDIKLEPFANIHDARYMMYWLTLTNKGYQTYIDSLSTIEK 663

Query: 721 KSVMLEPF--DFPGMLVQQGKEDELVVSESPK 750
           + ++LE    DF     QQ + D  ++ E  +
Sbjct: 664 EKIILEKLTVDFVAPGEQQPETDHKILQEKSR 695


>gi|29345759|ref|NP_809262.1| hypothetical protein BT_0349 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29337652|gb|AAO75456.1| Acetyl-CoA carboxylase, biotin carboxylase [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 802

 Score =  231 bits (588), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 177/568 (31%), Positives = 268/568 (47%), Gaps = 65/568 (11%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
           SL DV L  SS   +AQQT+L Y+L LD D L   F + A L     +Y  WEN  + L 
Sbjct: 29  SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
           GH  GHYLSA + M+A+T +  I  +++ ++  L   Q  +GTG++   P   +L+   +
Sbjct: 86  GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145

Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQ 267
           A         L   W P Y IHK  AGL D Y+ A +  A +M      WM++       
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            + +  S  +    L  E GG+N+    +  IT D K+L LA  F     L  L    D 
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKVILDPLIKNEDR 257

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSAR 380
           L+  HANT IP VIG +   EV+ +         +     FF + V    S   GG S R
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKNDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317

Query: 381 EFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI--------AYADYYERALT 431
           E +         L   +  ETC TYNML++++ L++ + ++         Y DYYERAL 
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377

Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY 491
           N +LS Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIY 431

Query: 492 FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
             ++     LY+  +I S  +WK   V L Q+   +   D  + + +  ++K+ +    +
Sbjct: 432 AHQQDT---LYVNLFIPSQLNWKEQGVTLTQET--LFPDDEKVTLRIDKAAKKNL----T 482

Query: 552 LNLRMPVWT-YSNGAQASLNG-QNLPLPPPG--NFLSATERWSYNDKLTIQLPLSLRTEA 607
           L +R+P W   S G + ++NG ++L     G   +L    +W   D +T  LP+ +  E 
Sbjct: 483 LMIRIPEWAGNSKGYEITINGKKHLSDIQTGASTYLPIRRKWKKGDMITFHLPMKVSLEQ 542

Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGE 635
           I D +  Y    A L+GP +LA  T  E
Sbjct: 543 IPDKKDYY----AFLYGPIVLATSTGTE 566


>gi|399033094|ref|ZP_10732120.1| hypothetical protein PMI10_04007 [Flavobacterium sp. CF136]
 gi|398068528|gb|EJL59944.1| hypothetical protein PMI10_04007 [Flavobacterium sp. CF136]
          Length = 1019

 Score =  231 bits (588), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 189/635 (29%), Positives = 290/635 (45%), Gaps = 101/635 (15%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--YGGWEN 156
           L +V L+D      +     +   L  L   D DS ++ FR       P +A   G W+ 
Sbjct: 376 LDQVVLNDNLDGHHTKFMENRDKFLTTLATTDPDSFLYMFRNAFGQEQPKEAEPLGVWDT 435

Query: 157 PISELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVVFSL----------SECQN 201
             ++LRGH  GHYL+A AQ +AST       A  K+KM  +V +L           E   
Sbjct: 436 QETKLRGHATGHYLTAIAQAYASTGYDKTLQANFKDKMEYMVNTLYDLEQLSGKPKEAGG 495

Query: 202 KI--------------------------------GTGYLSAFPTELFDSFE-------AL 222
           K                                 G G++SA+P + F   E         
Sbjct: 496 KFVSDPTAIPFGPGKTNYDSDLSAEGIRTDYWNWGKGFISAYPPDQFIMLENGATYGGQK 555

Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
             +WAPYYT+HKILAGL+D Y ++ N +AL+ A  M ++ Y R++K+ T   +      +
Sbjct: 556 TQIWAPYYTLHKILAGLMDVYEVSGNEKALETAKGMGDWVYARMKKLPTETLISMWNRYI 615

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
             E GGMN+ + RLY IT DP +L +A LFD    F G       LA   D     HAN 
Sbjct: 616 AGEFGGMNEAMARLYRITKDPHYLEVAQLFDNIKVFYGDANHSHGLAKNVDTFRGLHANQ 675

Query: 336 HIPIVIGSQMRYEVTGDP-LYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPK 387
           HIP ++G+   Y  +  P  Y++   F+   VN  + Y+ GG +       A  F   P 
Sbjct: 676 HIPQIMGALEMYRDSNTPDYYRVADNFWYKTVN-DYMYSIGGVAGARNPANAECFISQPA 734

Query: 388 RLAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
            + +   + G +N ETC TYNMLK++  LF + +     DYYER L N +LS      P 
Sbjct: 735 TIYENGFSSGGQN-ETCATYNMLKLTGDLFLYEQRGELMDYYERGLYNHILSSVAENSP- 792

Query: 445 VMIYMLPLGRG-VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
              Y +PL  G V +  + H  G     F CC GT IES +K  +SIYF+   N   LY+
Sbjct: 793 ANTYHVPLRPGSVKQFGNPHMTG-----FTCCNGTAIESNTKFQNSIYFKSADN-NSLYV 846

Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
             Y+ S+  W   ++ + Q  D     + + ++T+  + K +      L +R+P W  + 
Sbjct: 847 NLYVPSTLKWTEKNITVKQTTD--FPNEDFTKLTIKGNGKFD------LKVRVPHWA-TK 897

Query: 564 GAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
           G    +NG++  +   PG++L+  ++W   D + +++P     E + D +    +I ++ 
Sbjct: 898 GFFVKINGKSEKVKAQPGSYLTLNKKWKDGDVIELRMPFQFHLEPVMDQQ----NIASLF 953

Query: 623 FGPYLLAGHTS---GEWDIKTGTARSLSALISPIP 654
           +GP LLA   S    +W   T   + +S  I+  P
Sbjct: 954 YGPILLAAQESEPGKDWRKVTLDVKDISKSIAGDP 988


>gi|395803808|ref|ZP_10483051.1| hypothetical protein FF52_18073 [Flavobacterium sp. F52]
 gi|395434079|gb|EJG00030.1| hypothetical protein FF52_18073 [Flavobacterium sp. F52]
          Length = 760

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 171/560 (30%), Positives = 255/560 (45%), Gaps = 66/560 (11%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L EV L D           AQ  +L+Y+L LD D L+  +   + LP     YG WEN  
Sbjct: 27  LSEVKLKD------GPFKNAQDVDLKYILALDPDKLLAPYLLESRLPPKADRYGNWEN-- 78

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
             L GH  GHYLSA A M+ ST N  +K+++  ++  L+ CQ K G GY+   P      
Sbjct: 79  IGLDGHIGGHYLSALALMYKSTGNKELKDRLDYMLSELARCQAKNGNGYVGGIPQGKVFW 138

Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
                 ++  S   L   W P Y IHK+ AGL D Y    + QA    +K+  W +E   
Sbjct: 139 DRIHKGDIDGSSFGLNNTWVPIYNIHKLFAGLTDAYQYTGSEQAKDIVIKLGDWFIE--- 195

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
                +I   S E+    L  E GG+N+    LY IT D K+L  A        L  L  
Sbjct: 196 -----LIRPLSDEQIQKVLATEHGGINESFADLYIITKDKKYLETAEKLSHKALLNPLLQ 250

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
           + D L+  HANT IP V+G +    ++ +  +     FF + V    + A GG S  E +
Sbjct: 251 KEDKLTGLHANTQIPKVVGFEKIAALSDNKEWSDGVQFFWNNVTQKRTVAFGGNSVAEHF 310

Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
                 +  + S E  ETC +YNM ++++ LF    ++ Y D+YER L N +LS Q   E
Sbjct: 311 NPVNDFSGMVKSNEGPETCNSYNMERLAKALFLDKNDVHYLDFYERTLYNHILSSQH-PE 369

Query: 443 PGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
            G  +Y  P+       R  H   +     S WCC GTG+E+ +K G+ IY   + +   
Sbjct: 370 KGGFVYFTPI-------RPNHYRVYSQPQTSMWCCVGTGLENHTKYGELIYSHTQSD--- 419

Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
           L++  +I S   WK   V L Q  +      PY   T      ++     +LN+R P W 
Sbjct: 420 LFVNLFIPSVLKWKENGVELEQNTNF-----PYENQTELVLKLKKTKNF-ALNIRYPKWA 473

Query: 561 -----YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
                + NG +  +  Q      P  ++S +++W   DK+ ++   S+  E +    P+ 
Sbjct: 474 ENFEIFVNGKEQKIASQ------PSEYVSISKKWKTGDKIIVRFKTSIHLENL----PDG 523

Query: 616 ASIQAILFGPYLLAGHTSGE 635
           ++  A + GP +LA  TS E
Sbjct: 524 SNWSAFVKGPIVLAAKTSTE 543


>gi|431799831|ref|YP_007226735.1| hypothetical protein Echvi_4552 [Echinicola vietnamensis DSM 17526]
 gi|430790596|gb|AGA80725.1| putative glycosyl hydrolase of unknown function (DUF1680) [Echinicola
            vietnamensis DSM 17526]
          Length = 1042

 Score =  230 bits (586), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 196/665 (29%), Positives = 295/665 (44%), Gaps = 97/665 (14%)

Query: 99   LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY--GGWEN 156
            L  VSL       SS     +   +  L   + D  ++ FR       P  A   G W++
Sbjct: 398  LDAVSLETDIHGHSSKFIENRDKFISTLAGTNPDDFLYMFRNAFGQEQPAGAVPLGVWDS 457

Query: 157  PISELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVV---FSLSECQNKI----- 203
              ++LRGH  GHYL+A AQ +AST       A   +KM+ +V   ++LS+   K      
Sbjct: 458  QETKLRGHATGHYLTAIAQAYASTGYDTALQANFADKMAYMVNTLYNLSQMAGKPSAEAD 517

Query: 204  ----------------------------------GTGYLSAFPTELFDSFE-------AL 222
                                              G GY+SA+P + F   E         
Sbjct: 518  GHNADPTAVPMGPGKDFYDSDLSEEGIRTDYWNWGEGYISAYPPDQFIMLEHGAKYGGQK 577

Query: 223  KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
              VWAPYYT+HKILAGL+D Y ++ N +AL +A  M  +   R+ K+ T   +      +
Sbjct: 578  DQVWAPYYTLHKILAGLMDIYEVSGNEKALSVAKGMGTWVAARLDKLPTSTLISMWNTYI 637

Query: 283  NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
              E GGMN+ + RLY IT   ++L  A LFD    F G       LA   D     HAN 
Sbjct: 638  AGEFGGMNEAMARLYRITGSSRYLAAAKLFDNITVFYGNADHDHGLAKNVDTFRGLHANQ 697

Query: 336  HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPKR 388
            HIP ++G+   Y  T    Y  I   F  I    + Y+ GG +       A  F  +P  
Sbjct: 698  HIPQIMGALEMYRDTESAPYFHIADNFWHIATNDYMYSIGGVAGARTPANAECFTTEPAT 757

Query: 389  LAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
            L +   + G +N ETC TYNMLK+SR+LF + ++ AY DYYER L N +L+      P  
Sbjct: 758  LYEFGFSAGGQN-ETCATYNMLKLSRNLFLFQQDPAYMDYYERGLYNHILASVAKDSP-A 815

Query: 446  MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
              Y +PL  G  K         K   F CC GT IES +KL +SIYF+   +   LY+  
Sbjct: 816  NTYHVPLRPGSIKQFGN----PKMKGFTCCNGTAIESSTKLQNSIYFKSVDD-QSLYVNL 870

Query: 506  YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
            ++ S+  WK  ++ + Q        + + R+T+     Q  G+   L +R+P W  + G 
Sbjct: 871  FVPSTLHWKERNLTIVQST--AFPKEDHTRLTV-----QGKGKF-VLKIRVPQWA-TEGI 921

Query: 566  QASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
            + S+NG+   +   PG + +   +W   D + I +P     E + D +    +I ++ +G
Sbjct: 922  KVSINGKPAQVDAVPGTYATIQRKWKNGDTIDINIPFQFHLEPVMDQQ----NIASLFYG 977

Query: 625  PYLLAGHTS---GEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQS 681
            P LLA        EW   T  A+++ A I+  P +    +   T +    T+   +    
Sbjct: 978  PVLLAAQEEEPRKEWRKVTLNAKNIGATINGNPEALEFTIDGVTYKPFYETYGRHSVYLD 1037

Query: 682  ITMEE 686
            +T+E+
Sbjct: 1038 VTLED 1042


>gi|261407096|ref|YP_003243337.1| hypothetical protein GYMC10_3284 [Paenibacillus sp. Y412MC10]
 gi|261283559|gb|ACX65530.1| protein of unknown function DUF1680 [Paenibacillus sp. Y412MC10]
          Length = 622

 Score =  229 bits (585), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 171/572 (29%), Positives = 265/572 (46%), Gaps = 78/572 (13%)

Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASL----PTPGKAYGGWENPISELRGHFVGHYL 170
           L R ++ N  YL+ LD   L++++   A        P  A+GGWE P+ +LRGHF+GH+L
Sbjct: 16  LIRRERANRSYLMKLDSGHLLFNYHLEAGRFHGRTIPEGAHGGWETPVCQLRGHFLGHWL 75

Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYY 230
           S +A  +  + +  +K K+  +V  L ECQ   G  ++   P +      + K +WAP Y
Sbjct: 76  SGAALHYEESGDIELKAKLDAIVHELHECQRDNGGQWVGPIPEKYLHWIASGKSIWAPQY 135

Query: 231 TIHKILAGLLDQYVLADNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
             HKIL GL+D +  A N QAL    + A W VE+           ++ E+    L+ ET
Sbjct: 136 NCHKILMGLVDAWQYAGNRQALDIVDRFADWFVEW--------SGTFTREQFDDILDVET 187

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGM +V   L  IT   K+ +L   + +      L    D L++ HANT IP V+G    
Sbjct: 188 GGMLEVWADLLHITGADKYRVLLDRYYRSRLFQPLLEGKDPLTNMHANTTIPEVLGCARA 247

Query: 347 YEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
           YEVTGD  +  ++  ++   V    S ATGG +A E W    ++   LG +N+E CT YN
Sbjct: 248 YEVTGDDRWLSIVQAYWNCAVTERGSLATGGQTAGEVWMPKMKMKARLGDKNQEHCTVYN 307

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE------------PGVMIYMLPLG 453
           M++++  LFR + +  YA Y E  L NG+++     E             G++ Y LP+ 
Sbjct: 308 MIRLADFLFRQSGDPTYAQYIEYNLYNGIMAQAYYQEYGLTGSQHNYPRTGLLTYFLPMK 367

Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
            G+ K      W T+ +SF+CC+GT +++ +     IY+ ++G++  +YI QY  S  D 
Sbjct: 368 AGLRKE-----WSTETDSFFCCHGTMVQANAAWNMGIYY-QDGDI--VYISQYFDSELDA 419

Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSK----QEVGQLSSLN---------------- 553
                ++      IV     +  +L  SS     Q +   +S+N                
Sbjct: 420 SIAGTLIR-----IVQTQDKMSGSLLSSSNTAGYQAINDTASINENIPTFRKYDFIVSAA 474

Query: 554 --------LRMPVWTYSNGAQASLNG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
                    R+P W  + GA   +N   Q   L    NF      W   D ++I LP+ +
Sbjct: 475 APTTFTLRFRIPEWIMA-GASVYVNDVLQGTTLDSE-NFYDIHRAWKEGDTVSIMLPIGI 532

Query: 604 RTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           R   + DD        A  +GP +LAG    E
Sbjct: 533 RFVPLPDDE----RTGAFRYGPEVLAGLCESE 560


>gi|293370109|ref|ZP_06616674.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
 gi|292634837|gb|EFF53361.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
          Length = 800

 Score =  229 bits (585), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 172/567 (30%), Positives = 266/567 (46%), Gaps = 64/567 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L LD D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L+  Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A +M      WM++        
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L  + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   EV+ D         +     FF + V    S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAEVSQDDKTWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
             +     LY+  +I S   WK   ++L Q+       D  + + +  + K++     +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETR--FPDDDKVTLRIDEAPKKK----RTL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+     +     +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGEW 636
           D +  Y    A L+GP +LA  T  E+
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTEY 566


>gi|295085157|emb|CBK66680.1| Uncharacterized protein conserved in bacteria [Bacteroides
           xylanisolvens XB1A]
          Length = 800

 Score =  229 bits (584), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 171/567 (30%), Positives = 266/567 (46%), Gaps = 64/567 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L LD D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L+  Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A +M      WM++        
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L  + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V    S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
             +     LY+  +I S   WK   ++L Q+       D  + + +  + K++     +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETR--FPDDDKVTLRIDEAPKKK----RTL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+     +     +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGEW 636
           D +  Y    A L+GP +LA  T  E+
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTEY 566


>gi|419850639|ref|ZP_14373619.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
           35B]
 gi|419851584|ref|ZP_14374510.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
           2-2B]
 gi|386408481|gb|EIJ23391.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
           35B]
 gi|386413301|gb|EIJ27914.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
           2-2B]
          Length = 1834

 Score =  229 bits (584), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 186/610 (30%), Positives = 276/610 (45%), Gaps = 87/610 (14%)

Query: 85  KIKNPGGFDLP--GNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTA 142
           + +N GG D+    N+L E  + +V +     L  A +  +EYLL  + D L+  FR  A
Sbjct: 208 QTENGGGHDVQYLKNYLSEQGMENVTV-ADEYLQNAGKKEVEYLLSFEPDRLLVEFRAQA 266

Query: 143 SLPTPG-KAYGGWENPISELR------------GHFVGHYLSASAQMWAST-----HNAT 184
            L T G K YGGWEN   E R            GHFVGH++SA++Q   ST       A 
Sbjct: 267 GLDTKGAKNYGGWENGPDESRNPDGSSKPGRFTGHFVGHWISAASQAQRSTFATADQKAQ 326

Query: 185 IKEKMSTVVFSLSECQ------NKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAG 238
           +   ++ VV  + E Q      +    G+  AF   +  +      +  P+Y +HK+ AG
Sbjct: 327 LSANLTAVVKGIREAQEAYAKKDTANAGFFPAFSASVVPNGGG--GLIVPFYNLHKVEAG 384

Query: 239 LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYS 298
           ++  Y  + +A+  + A      F    + V+   S       L  E GGMND LY++  
Sbjct: 385 MVQAYDYSTDAETRETAKAAAVDF---AKWVVNWKSAHASTDMLRTEYGGMNDALYQVAE 441

Query: 299 ITH--DPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY-------- 347
           I    D + +L A HLFD+      LA   D L+  HANT IP + G+  RY        
Sbjct: 442 IADASDKQTVLTAAHLFDETALFQKLANGQDPLNGLHANTTIPKLTGAMQRYVAYTEDED 501

Query: 348 ---EVTGD------PLYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPKRLAD 391
               ++ D       LY      F DIV   H+Y  GG S       A E W D  +  D
Sbjct: 502 LYNSLSADERGELTSLYLKAAQNFFDIVVKDHTYVNGGNSQSEHFHVAGELWKDATQNGD 561

Query: 392 TLGS----ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             G        ETC  YNMLK++R LF+ TK+  Y++YYE    N +++ Q   E G+  
Sbjct: 562 QNGGYRNFSTVETCNEYNMLKLARILFQVTKDSKYSEYYEHTFINAIVASQN-PETGMTT 620

Query: 448 YMLPLGRGVSKARSTHG-------WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
           Y  P+  G  K     G       +G     +WCC GTGIE+F+KL DS YF +E NV  
Sbjct: 621 YFQPMKAGYPKVFGITGTDYDADWFGGAIGEYWCCQGTGIENFAKLNDSFYFTDENNV-- 678

Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
            Y+  + SS++     ++ + Q  +   + D    ++ T S        ++L LR+P W 
Sbjct: 679 -YVNMFWSSTYTDTRHNLTITQTANVPKTEDVTFEVSGTGS--------ANLKLRVPDWA 729

Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
            +NG +  ++G    L    N    T       K+T  LP  L+T    D++ ++ + Q 
Sbjct: 730 ITNGVKLVVDGTEQALTKDENGW-VTVAIKDGAKITYTLPAKLQTIDAADNK-DWVAFQ- 786

Query: 621 ILFGPYLLAG 630
             +GP +LAG
Sbjct: 787 --YGPVVLAG 794


>gi|427403045|ref|ZP_18894042.1| hypothetical protein HMPREF9710_03638 [Massilia timonae CCUG 45783]
 gi|425718056|gb|EKU81008.1| hypothetical protein HMPREF9710_03638 [Massilia timonae CCUG 45783]
          Length = 781

 Score =  229 bits (584), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 168/537 (31%), Positives = 249/537 (46%), Gaps = 45/537 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ TNL YL+ ++ D L+  F + A L     +YG WE+  + L GH  GHYLSA A M 
Sbjct: 38  AQTTNLNYLMAMEPDRLLAPFLREAGLQPRQPSYGNWES--TGLDGHMGGHYLSALALMH 95

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL------------FDSFEALKPV 225
           AST +     +++  V  L   Q   G GYL   P                D+F ++   
Sbjct: 96  ASTGDQEALRRLNYFVAELKRAQQANGDGYLGGIPGGRQAWRDIAAGKLEADNF-SVNGK 154

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
           W P+Y +HK+ AGL D Y  A N  A  M   + ++      K+    S E+    L  E
Sbjct: 155 WVPWYNLHKVYAGLRDAYRYAGNEDAKAMLVQLSDWALALSAKL----SPEQMQTMLRSE 210

Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
            GGMN++   +  +T + K+L LA  F     L  LA + D L+  HANT IP VIG + 
Sbjct: 211 HGGMNEIFVDVAEMTGERKYLDLALAFSHQAVLQPLARKQDQLTGLHANTQIPKVIGFKR 270

Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
             ++TG         FF   V    + A GG S +E +         +   E  ETC TY
Sbjct: 271 IADMTGRQDMGEAARFFWQTVVDKRTVAIGGNSVKEHFHSTDDFDPMVHEVEGPETCNTY 330

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           NMLK++  LFR  ++  Y+DYYERAL N +LS QR    G  +Y  P+     +  S   
Sbjct: 331 NMLKLTGMLFRSEQKGMYSDYYERALYNHILSSQR--PEGGFVYFTPMRPNHYRVYSQVD 388

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
            G      WCC G+GIES +K G+ IY  ++     L++  +++S+ DWK   V + Q  
Sbjct: 389 KG-----MWCCVGSGIESHAKYGEFIYARDKDT---LFVNLFVASTLDWKDKGVRVTQAT 440

Query: 525 D-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNF 582
             P        R+T+    +       ++ +R P W         +NG  + +   PG +
Sbjct: 441 TFPDAD---TTRLTVDGEGR------FTMKIRYPAWVAPGRMAVRVNGAEVKIDARPGGY 491

Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIK 639
            +    W   D++ ++LP++   E +    P  ++  A+L GP +LA  T    D K
Sbjct: 492 ATIARAWRKGDRVDVRLPMTTHLEQM----PGRSNYYAVLHGPVVLAARTRMVGDDK 544


>gi|427386394|ref|ZP_18882591.1| hypothetical protein HMPREF9447_03624 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726434|gb|EKU89299.1| hypothetical protein HMPREF9447_03624 [Bacteroides oleiciplenus YIT
           12058]
          Length = 792

 Score =  229 bits (583), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 184/625 (29%), Positives = 284/625 (45%), Gaps = 68/625 (10%)

Query: 113 SVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSA 172
           S   +AQQT+L Y+L ++ D L+  F + A L     +Y  WEN  + L GH  GHY+SA
Sbjct: 38  SPFLQAQQTDLHYILAMEPDRLLAPFLREAGLAPKAPSYTNWEN--TGLDGHIGGHYISA 95

Query: 173 SAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL---------------FD 217
            + M+A+T +  +  +++ ++  L   Q  +GTG++   P  L               FD
Sbjct: 96  LSMMYAATGDTAVYNRLNYMLDELHRAQQAVGTGFIGGTPGSLQLWKEIKEGNIRAGGFD 155

Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
               L   W P Y IHK  AGL D Y+ A +  A +M   + ++       +    + ++
Sbjct: 156 ----LNSKWVPLYNIHKTYAGLRDAYLYAGSDLAREMLIALTDWMIG----ITAGLTDQQ 207

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
               L  E GG+N+    + +IT D K+L LA  F     L  L    D L+  HANT I
Sbjct: 208 MQDMLRSEHGGLNETFADVAAITGDKKYLELARRFSHKVILDPLIKDEDRLTGMHANTQI 267

Query: 338 PIVIGSQMRYEVTGDP-------LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
           P VIG +   E++ D         +     FF + V    S   GG S RE +      +
Sbjct: 268 PKVIGYKRIAELSQDDNVWNHATEWDHAARFFWNTVVNHRSVCIGGNSVREHFHPANDFS 327

Query: 391 DTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
             L   E  ETC TYNML++++ L++ + +  +ADYYERAL N +L+ Q   + G  +Y 
Sbjct: 328 PMLNDIEGPETCNTYNMLRLTKMLYQDSPDSRFADYYERALYNHILASQE-PDKGGFVYF 386

Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
            P+  G  +      +     S WCC G+G+E+ +K G+ IY  ++     LY+  +I S
Sbjct: 387 TPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAHQKDT---LYVNLFIPS 438

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-YSNGAQAS 568
              WK   V L Q+     +    LR  +  +SK+      ++++R P W   S G    
Sbjct: 439 QLTWKEKGVSLVQETRFPDNGQVTLR--IDKASKKAF----TISIRQPEWADSSKGYNLK 492

Query: 569 LNGQNLPLPPPGN--FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
           +NG+        N  +LS   +W   D +T  LP+ ++ E I D    Y    A L+GP 
Sbjct: 493 VNGKEQSSATATNSGYLSVNRKWKKGDVVTFTLPMQIKMEQIPDKENYY----AFLYGPI 548

Query: 627 LLAGHTSGEW-------DIKTG-TARSLSALISPIPPSF-NAQLVTFTQESGNSTFVMSN 677
           +LA  T  E        D + G  A      +S IP    N + ++ +    NST +  N
Sbjct: 549 VLAASTGTEHLDGLYADDSRGGHIAHGKQIPVSEIPMLIGNPEAISQSLHKENSTQLAFN 608

Query: 678 SNQSITMEEFPVSGTDAALHATFRL 702
            +  +    +P SG    L   FRL
Sbjct: 609 YDGKV----YPASGKAMKLIPFFRL 629


>gi|322692034|ref|YP_004221604.1| cell surface protein [Bifidobacterium longum subsp. longum JCM
           1217]
 gi|320456890|dbj|BAJ67512.1| putative cell surface protein [Bifidobacterium longum subsp. longum
           JCM 1217]
          Length = 1984

 Score =  228 bits (582), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 185/610 (30%), Positives = 276/610 (45%), Gaps = 87/610 (14%)

Query: 85  KIKNPGGFDLP--GNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTA 142
           + +N GG D+    N+L E  + +V +     L  A +  +EYLL  + D L+  FR  A
Sbjct: 358 QTENGGGHDVQYLKNYLSEQGMENVTV-ADEYLQNAGKKEVEYLLSFEPDRLLVEFRAQA 416

Query: 143 SLPTPG-KAYGGWENPISELR------------GHFVGHYLSASAQMWAST-----HNAT 184
            L T G K YGGWEN   E R            GHFVGH++SA++Q   ST       A 
Sbjct: 417 GLDTKGAKNYGGWENGPDESRNPDGSSKPGRFTGHFVGHWISAASQAQRSTFATADQKAQ 476

Query: 185 IKEKMSTVVFSLSECQ------NKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAG 238
           +   ++ VV  + E Q      +    G+  AF   +  +      +  P+Y +HK+ AG
Sbjct: 477 LSANLTAVVKGIREAQEAYAKKDTANAGFFPAFSASVVPNGGG--GLIVPFYNLHKVEAG 534

Query: 239 LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYS 298
           ++  Y  + +A+  + A      F    + V+   S       L  E GGMND LY++  
Sbjct: 535 MVQAYDYSTDAETRETAKAAAVDF---AKWVVNWKSAHASTDMLRTEYGGMNDALYQVAE 591

Query: 299 ITH--DPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY-------- 347
           I    D + +L A HLFD+      LA   D L+  HANT IP + G+  RY        
Sbjct: 592 IADASDKQTVLTAAHLFDETALFQKLANGQDPLNGLHANTTIPKLTGAMQRYVAYTEDED 651

Query: 348 ---EVTGDPLYKLIGTF------FMDIVNASHSYATGGTS-------AREFWWDPKRLAD 391
               ++ D   KL   +      F DIV   H+Y  GG S       A E W D  +  D
Sbjct: 652 LYNSLSADERGKLTSLYLKAAQNFFDIVVKDHTYVNGGNSQSEHFHVAGELWKDATQNGD 711

Query: 392 TLGS----ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             G        ETC  YNMLK++R LF+ TK+  Y++YYE    N +++ Q   E G+  
Sbjct: 712 QNGGYRNFSTVETCNEYNMLKLARILFQVTKDSKYSEYYEHTFINAIVASQN-PETGMTT 770

Query: 448 YMLPLGRGVSKARSTHG-------WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
           Y  P+  G  K     G       +G     +WCC GTGIE+F+KL DS YF +E NV  
Sbjct: 771 YFQPMKAGYPKVFGITGTDYDADWFGGAIGEYWCCQGTGIENFAKLNDSFYFTDENNV-- 828

Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
            Y+  + SS++     ++ + Q  +   + D    ++ T S        ++L LR+P W 
Sbjct: 829 -YVNMFWSSTYTDTRHNLTITQTANVPKTEDVTFEVSGTGS--------ANLKLRVPDWA 879

Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
            +NG +  ++G    L    N    T       K+T  LP  L+     D++ ++ + Q 
Sbjct: 880 ITNGVKLVVDGTEQALTKDENGW-VTVAIKDGAKITYTLPAKLQAIDAADNK-DWVAFQ- 936

Query: 621 ILFGPYLLAG 630
             +GP +LAG
Sbjct: 937 --YGPVVLAG 944


>gi|255691978|ref|ZP_05415653.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides
           finegoldii DSM 17565]
 gi|260622387|gb|EEX45258.1| hypothetical protein BACFIN_07051 [Bacteroides finegoldii DSM
           17565]
          Length = 800

 Score =  228 bits (582), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 173/566 (30%), Positives = 269/566 (47%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L DV L  S  L +AQQT+L Y+L L+ D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQDVKLLDSPFL-QAQQTDLHYILALNPDRLLAPFLREAGLTPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  I  +++ ++  L   Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAIYNRLNYMLDELYRAQQAVGTGFIGGTPGSLQLWKEIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+   + QA +M      WM++        
Sbjct: 147 GNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYTGSDQARRMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S ++    L  E  G+N+    + +IT D K+L LA  F     L  L    D L
Sbjct: 199 ITSGLSDQQIQDMLRSEHSGLNETFADVAAITGDKKYLELARRFSHKIILDPLIKDKDRL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V  + S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAEEWDHAARFFWNTVVNNRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         +   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPADNFTSMINDVQGPETCNTYNMLRLTKMLYQNSHNPCNINEPDPNYINYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
            ++     LY+  +I S  +WK   V+L Q+       D  + + +  +SK++     +L
Sbjct: 433 HQKDT---LYVNLFIPSQLNWKEQGVILTQETR--FPDDNKVTLRIDKASKKQ----RTL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQNLPLP-PPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S+    S+NG+    P   GN +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSSNYSISINGKKETFPTKKGNQYLPLSRKWKKGDVITFNLPMKVTIEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565


>gi|325299889|ref|YP_004259806.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
 gi|324319442|gb|ADY37333.1| protein of unknown function DUF1680 [Bacteroides salanitronis DSM
           18170]
          Length = 797

 Score =  228 bits (582), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 172/537 (32%), Positives = 247/537 (45%), Gaps = 47/537 (8%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
           +A   N++ L   D D L+  + K A LP+  + +  WE     L GH  GHYLSA A  
Sbjct: 43  QACDLNVKTLKQYDTDRLLAPYLKEAGLPSKAEGFSNWEG----LDGHVGGHYLSALAIH 98

Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA-----LKPVWAPY 229
           +A+T +A  +++M  +V  L  CQ   G GY+   P    L+   +      +   W P+
Sbjct: 99  YAATGDAECRQRMDYMVSELKRCQEAHGNGYIGGVPDGERLWKEIQQGNVGLIWKYWVPW 158

Query: 230 YTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGM 289
           Y +HK  AGL D +    N +A +M   + ++       VI   S E+    L  E GGM
Sbjct: 159 YNLHKTYAGLRDAWAYGGNEEARQMFLDLCDWGLT----VIAPLSDEQMEQMLENEFGGM 214

Query: 290 NDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEV 349
           ++V    Y +T D K+L  A  F     L  +A   D L + HANT +P V+G Q   E+
Sbjct: 215 DEVYADAYEMTGDVKYLDAAKRFSHHWLLDSMAAGIDNLDNKHANTQVPKVVGYQRIAEL 274

Query: 350 TGD-------PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR-LADTLGSENEETC 401
           +          LY+    FF   V  + S A GG S RE +   +  L+     E  E+C
Sbjct: 275 SARSGHTEDAALYRKASEFFWQTVVETRSLALGGNSRREHFAPAEDCLSYVYDREGPESC 334

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
            T NMLK++  LFR   E  YADYYERA+ N +LS Q   E G  +Y  P       AR 
Sbjct: 335 NTNNMLKLTEGLFRLNPEARYADYYERAVLNHILSTQH-PEHGGYVYFTP-------ARP 386

Query: 462 TH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
            H   +    ++ WCC GTG+E+  K G+ IY   E     LY+  +I+S  DW    V 
Sbjct: 387 AHYRVYSAPNSAMWCCVGTGMENHGKYGELIYTHTENE---LYVNLFIASELDWAERGVR 443

Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP 579
           + Q+       +  +R+T+    + E      L +R P W  +   QA LNGQ+      
Sbjct: 444 IIQETK--FPDEESVRLTI----RTEKPMKFKLLIRHPHWCRTGAMQAVLNGQDYAAASV 497

Query: 580 GNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
            +     ER W   DK+ ++LP+S+  E +    P      AIL GP LL      E
Sbjct: 498 SSSYIEIERIWKDGDKVQLELPMSVSVEEL----PNVPQYIAILRGPVLLGARMGTE 550


>gi|371776971|ref|ZP_09483293.1| Acetyl-CoA carboxylase, biotin carboxylase [Anaerophaga sp. HS1]
          Length = 794

 Score =  228 bits (582), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 163/534 (30%), Positives = 255/534 (47%), Gaps = 48/534 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A++ N +Y++  D D ++  F   A L    + YG WE   S L GHF GHYL++ + M 
Sbjct: 49  AEELNEKYVMAHDPDRILAPFLIDAGLKPKAQGYGNWEG--SGLNGHFGGHYLTSLSLMI 106

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEALKPVW 226
           AST +   ++++  +V  L+ CQ   G GY+   P             +     +L   W
Sbjct: 107 ASTGSEEARKRLDYMVDQLARCQKANGNGYVGGIPGGQAMWAEIAKGNINAGNFSLNGKW 166

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            P Y IHK+ AGL D ++LA N +A ++   + ++F N + K +T   +++    L  E 
Sbjct: 167 VPLYNIHKLFAGLRDAWLLAQNKKAKEVLINLTDWFLN-LTKNLTDDQIQK---MLVSEH 222

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GG+N+V   +Y IT +  +L LA  F     L  L  Q D L+  HANT IP VIG    
Sbjct: 223 GGLNEVFADVYDITGNENYLKLARRFSHQAILRPLLQQKDQLTGLHANTQIPKVIGFMRI 282

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
            E+  D  +     FF + V  + + + GG S  E +      +  + S +  ETC TYN
Sbjct: 283 GELAHDTAWINAADFFWNTVVQNRTVSIGGNSTHEHFHAVDDFSSMIESRQGPETCNTYN 342

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
           MLK+S+ LF +  ++ Y DYYE+AL N +LS Q     G ++Y   +     + R    +
Sbjct: 343 MLKLSKQLFLFKNDLKYIDYYEQALYNHILSSQHPLHGG-LVYFTSM-----RPRHYRVY 396

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK-- 523
                +FWCC G+GIE+  K G+ IY  ++ NV   Y+  +I S   WK   + L Q+  
Sbjct: 397 SRPEQTFWCCVGSGIENHEKYGELIYAHDDENV---YVNLFIPSILHWKEKQLKLVQENH 453

Query: 524 ---VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-P 579
              +D I      +R+     ++  VG      +R P WT        +NG+       P
Sbjct: 454 FPDIDKIT-----IRVEPQRKTEFVVG------IRCPAWTRPEDMNVLVNGKAFKGKAIP 502

Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           G++      W  ND + + LP+    + + D  P Y S   ++ GP++LA  T 
Sbjct: 503 GHYFLIRRYWEKNDVIEVHLPMHTYGKFLPDGSP-YLS---LMHGPFVLAATTD 552


>gi|408369881|ref|ZP_11167661.1| hypothetical protein I215_03228 [Galbibacter sp. ck-I2-15]
 gi|407744935|gb|EKF56502.1| hypothetical protein I215_03228 [Galbibacter sp. ck-I2-15]
          Length = 1011

 Score =  228 bits (581), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 189/618 (30%), Positives = 281/618 (45%), Gaps = 99/618 (16%)

Query: 126 LLMLDVDSLVWSFRKT--ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAST-HN 182
           L   D DS ++ FR     S P   K  G W++  ++LRGH  GHYL+A AQ +AS+ ++
Sbjct: 395 LAKTDPDSFLYMFRNAFGVSQPQDAKPLGVWDSQETKLRGHATGHYLTAIAQAYASSSYD 454

Query: 183 ATIKE----KMSTVV---FSLSECQNKI-------------------------------- 203
             +KE    KM+ +V   + LS+   +                                 
Sbjct: 455 EQLKELFAQKMNYMVETLYDLSKLSGQPINSGGEHVSDPTKVPFGPGKTDYNSDLSEQGI 514

Query: 204 -------GTGYLSAFPTELFDSFEA-------LKPVWAPYYTIHKILAGLLDQYVLADNA 249
                  GTGY+SA+P + F   E+          +WAPYYT+HKILAGLLD Y ++ N 
Sbjct: 515 RNDYWNWGTGYISAYPPDQFIMLESGATYGGQNDQIWAPYYTLHKILAGLLDVYEISGNK 574

Query: 250 QALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLA 309
           +AL +A  M ++   R+ ++ T   +      +  E GGMN+V+ RLY +T    +L +A
Sbjct: 575 KALSVAQGMGDWVSARMVELPTSTLISMWNRYIAGEYGGMNEVMARLYRLTGTESYLKVA 634

Query: 310 HLFDK-PCFLG------FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
            LFD    F G       LA   D     H+N HIP ++G+   Y  T +  Y  I   F
Sbjct: 635 GLFDNIKMFYGDAQHTHGLAKNVDTFRGLHSNQHIPQIVGALEMYRDTDEVEYFKIADNF 694

Query: 363 MDIVNASHSYATGGTS-------AREFWWDPKRLAD---TLGSENEETCTTYNMLKVSRH 412
                  + Y+ GG +       A  F   P  L +   + G +N ETC TYNMLK++R 
Sbjct: 695 WFKATHDYMYSIGGVAGARNPANAECFPVQPATLYENGFSSGGQN-ETCATYNMLKLTRD 753

Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
           LF +  +    DYYER L N +L+      P    Y +PL  G  K    H        F
Sbjct: 754 LFFFEPKAQLMDYYERGLYNHILASVAKDSPA-NTYHVPLLPGSVK----HFGNPDMTGF 808

Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
            CC GT IES +KL +SIYF+ + N   LY+  +I S+  W   ++ + Q    + S+  
Sbjct: 809 TCCNGTAIESSTKLQNSIYFKGKDN-KSLYVNLFIPSTLHWTERNIEIQQ----VTSFPK 863

Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSY 591
               TL  + K        L LR+P W  +NG   S+NG+ + +   PG++LS   +W  
Sbjct: 864 EDNTTLKVTGKGRF----DLKLRVPNWA-TNGYHVSINGKEMDIQVTPGSYLSIDRKWKN 918

Query: 592 NDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG---EWDIKTGTARSLSA 648
            D + + +P   R E + D +    +I ++ +GP LLA         W   T  A  +  
Sbjct: 919 GDIIELSMPFDFRLEPVMDQQ----NIASLFYGPVLLAAQEESPLTHWRKVTFDAEQIGK 974

Query: 649 LISPIPPS--FNAQLVTF 664
            I   P +  FN + + F
Sbjct: 975 FIKGDPSTLEFNYKGIEF 992


>gi|409196987|ref|ZP_11225650.1| Acetyl-CoA carboxylase, biotin carboxylase [Marinilabilia
           salmonicolor JCM 21150]
          Length = 788

 Score =  228 bits (581), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 160/528 (30%), Positives = 252/528 (47%), Gaps = 38/528 (7%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A+Q N +Y+   D D L+  F   A L      YG WE   S L GH  GHYL++ A M 
Sbjct: 43  AEQLNEKYVFAHDPDRLLAPFLIDAGLEPKAPGYGNWEG--SGLNGHIGGHYLTSLALMV 100

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELFD-SFEA----LKPVW 226
           AST N   +E++  ++  L+ CQ   G GY+   P       E+   + +A    L   W
Sbjct: 101 ASTGNEEAQERLDYMIEELARCQEANGNGYVGGIPGGQPMWAEIAKGNIDAGGFSLNGKW 160

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            P Y IHK+ AGL D +  A   +AL++   + ++F +    V +  S E+    L  E 
Sbjct: 161 VPLYNIHKLFAGLHDAWKYAGKEKALEILIQLTDWFID----VNSGLSDEQIQEILVSEH 216

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GG+N+V   +Y IT + K+L LA  +     L  L    D L+  HANT IP V+G    
Sbjct: 217 GGLNEVFADVYDITGEDKYLTLARQYSHRSILEPLLNHEDKLTGLHANTQIPKVVGFMRV 276

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
            E+ GD  +     FF + V ++ +   GG S  E +      +  + S +  ETC TYN
Sbjct: 277 GELAGDSAWIDASDFFWNTVVSNRTITIGGNSTHEHFHPVDDFSSMVESRQGPETCNTYN 336

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
           MLK+S+ L+ +  ++ Y DYYE+AL N +LS Q   E G ++Y  P+     + +    +
Sbjct: 337 MLKLSKQLYLYKNDLRYVDYYEQALYNHILSSQH-PEHGGLVYFTPM-----RPQHYRVY 390

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
                +FWCC G+GIE+  K G+ IY   + +V   ++  +I S  +W+   + L QK +
Sbjct: 391 SNPEETFWCCVGSGIENHEKYGELIYAHSDDDV---FVNLFIPSELNWEEKGLKLTQKTN 447

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLS 584
              +    L++ L  +    +G      +R P W      + ++NG+       PG +  
Sbjct: 448 FPDNEQTTLKVELPEARSFTIG------IRYPQWMKEGEMKVTVNGKRARGGGAPGAYYQ 501

Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
               W   D++T+ L +    E + D+ P      +I  GP++LA  T
Sbjct: 502 VKREWQDGDEITVNLKMHTSGEYLPDNSP----FLSIKHGPFVLAAVT 545


>gi|359453850|ref|ZP_09243152.1| hypothetical protein P20495_1902 [Pseudoalteromonas sp. BSi20495]
 gi|358049097|dbj|GAA79401.1| hypothetical protein P20495_1902 [Pseudoalteromonas sp. BSi20495]
          Length = 816

 Score =  228 bits (581), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 165/526 (31%), Positives = 254/526 (48%), Gaps = 38/526 (7%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
            AQQTN+ YLL L  D L+  + + A +     +YG WE+  + L GH  GHYLS+ +  
Sbjct: 63  HAQQTNVRYLLALYPDQLLAPYLREAGIEQKAPSYGNWED--TGLDGHIGGHYLSSLSLA 120

Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELFD-----SFEALKPV 225
           WA+T +  +K ++  ++  L   Q ++  GYL   P       ++ D        +L   
Sbjct: 121 WAATGDEELKRRLDYMLNELQRAQ-QVNDGYLGGIPDGQAMWQQIHDGNIKADLFSLNDR 179

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
           W P Y I KI  GL D Y++A + QA  M   + E+F N   K+    S E+    L  E
Sbjct: 180 WVPLYNIDKIFHGLRDAYLIAGSEQAKTMLFDLGEWFLNLTAKL----SDEQIQQMLYSE 235

Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
            GG+N V   + +I +D ++L LA  F     +  L  + D L+  HANT IP +IG   
Sbjct: 236 YGGLNAVFADMATIGNDKRYLKLARQFTHNNIIDPLLEKQDKLTGLHANTQIPKIIGMLK 295

Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
             E + D  ++    +F   V    S A GG S  E + D       +   E  ETC TY
Sbjct: 296 VAEASDDKAWQQGADYFWQTVTKQRSVAIGGNSVSEHFHDKNDFTPMVEDVEGPETCNTY 355

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           NM+K+S+ LF  T +  Y +YYERA  N +LS Q   E G ++Y   +  G  +  S+  
Sbjct: 356 NMMKLSKLLFLKTADTRYLEYYERATYNHILSSQH-PEHGGLVYFTSMRPGHYRMYSSVQ 414

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW-KSGHVVLNQK 523
                +S WCC G+GIE+ SK G+ IY + + N   L++  +I S+ DW + G  V  Q 
Sbjct: 415 -----DSMWCCVGSGIENHSKYGEQIYSKNDDN---LWVNLFIPSTLDWQQQGLKVTQQS 466

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
           + P  +    + + +    K+ +   + L++R P W  ++  Q  LNG+ +       + 
Sbjct: 467 LFPDAN---NITLVINTLDKKHISS-AQLHIRKPSWV-TDELQFELNGKAINATAEQGYY 521

Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           +    W   D LT  L   L TE + D +  Y    A+L+GP ++A
Sbjct: 522 AIKHDWHDGDNLTFTLAPKLYTEQLPDGQDYY----AVLYGPVVMA 563


>gi|299146241|ref|ZP_07039309.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           3_1_23]
 gi|298516732|gb|EFI40613.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           3_1_23]
          Length = 800

 Score =  228 bits (581), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 171/566 (30%), Positives = 265/566 (46%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L LD D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L+  Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A +M      WM++        
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L  + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V    S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
             +     LY+  +I S   WK   ++L Q+       D  + + +  + K++     +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETR--FPDDDKVTLRIDEAPKKK----RTL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+     +     +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKIFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565


>gi|354580825|ref|ZP_08999729.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
 gi|353201153|gb|EHB66606.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
          Length = 623

 Score =  228 bits (580), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 162/557 (29%), Positives = 260/557 (46%), Gaps = 58/557 (10%)

Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASL----PTPGKAYGGWENPISELRGHFVGHYL 170
           L R ++ N  YL+ LD   L+++++  A        P  A+GGWE P+ +LRGHF+GH+L
Sbjct: 16  LIRRERANRSYLMKLDSGHLLFNYQLEAGRFHGRTIPEGAHGGWETPVCQLRGHFLGHWL 75

Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYY 230
           S +A  +  + +  +K K+  +V  L ECQ   G  ++   P +        K +WAP Y
Sbjct: 76  SGAAMHYEKSGDMELKAKLDAIVQELHECQRDNGGQWVGPIPEKYLHWIARGKSIWAPQY 135

Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
            +HKIL GL+D +  A N QAL +     ++F N        ++ E+    L+ ETGGM 
Sbjct: 136 NLHKILMGLVDAWQYAGNRQALDIVDRFADWFVNWS----GTFTREQFDDILDVETGGML 191

Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           +V   L  IT   K+ +L   + +      L    D L++ HANT IP V+G    YEVT
Sbjct: 192 EVWADLLHITGADKYRVLLERYYRSRLFQPLLEGKDPLTNMHANTTIPEVLGCARAYEVT 251

Query: 351 GDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
           GD  +  ++  ++   V    S ATGG +A E W    ++   LG +N+E CT YNM+++
Sbjct: 252 GDDRWLSIVQAYWKCAVTERGSLATGGQTAGEVWMPKMKMKARLGDKNQEHCTVYNMIRL 311

Query: 410 SRHLFRWTKEIAYADYYERALTNGVL------------SIQRGTEPGVMIYMLPLGRGVS 457
           +  LFR T + +YA Y E  L NG++            S  +    G++ Y LP+  G+ 
Sbjct: 312 AEFLFRQTGDPSYAQYIEYNLYNGIMAQAYYQEYGLTGSQHKHPHTGLLTYFLPMKAGLR 371

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF------ 511
           K      W T+ +SF+CC+GT +++ +     IY+ ++G +  +YI QY  S        
Sbjct: 372 KE-----WSTETDSFFCCHGTMVQANAAWNKGIYY-QDGEI--IYISQYFDSELRTSIDG 423

Query: 512 ---------DWKSGHVVLN------QKVDPIVSWD---PYLRMTLTFSSKQEVGQLSSLN 553
                    D  SG ++ +      Q ++   + +   P  R    F          +L 
Sbjct: 424 TDIQIVQTQDKMSGSLLSSSNTAGYQAINDTAATNENMPAFR-KYDFIVSTAAPTTFTLR 482

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
            R+P W  +  +    +          +F      W   D ++I LP+ +R   + DD  
Sbjct: 483 FRIPEWIMAEVSVYVNDRLQGTTRDSSSFYDIHRAWKEGDTVSIMLPIGIRFVPLPDDE- 541

Query: 614 EYASIQAILFGPYLLAG 630
                 A  +GP +LAG
Sbjct: 542 ---RTGAFRYGPEVLAG 555


>gi|423287556|ref|ZP_17266407.1| hypothetical protein HMPREF1069_01450 [Bacteroides ovatus
           CL02T12C04]
 gi|392672671|gb|EIY66138.1| hypothetical protein HMPREF1069_01450 [Bacteroides ovatus
           CL02T12C04]
          Length = 800

 Score =  228 bits (580), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 172/566 (30%), Positives = 262/566 (46%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L LD D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L+  Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A +M      WM++        
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L  + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHNLILDPLIKEEDKL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V    S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
             +     LY+  +I S   WK   ++L Q+          LR+      K+      +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRIDEAPKKKR------TL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+     +     +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWEKGDVITFHLPMKVSVEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565


>gi|395772531|ref|ZP_10453046.1| glycosylase [Streptomyces acidiscabies 84-104]
          Length = 828

 Score =  228 bits (580), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 156/464 (33%), Positives = 232/464 (50%), Gaps = 32/464 (6%)

Query: 206 GYLSAFPTELFDSFEAL-----KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F   E++       VWAPYYT HKIL GLLD Y    +A+AL +A  M +
Sbjct: 340 GFLAAYPETQFIQLESMTASDYSKVWAPYYTAHKILRGLLDAYAATGDARALDLAGGMAD 399

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + ++R+ K +   +++R W   +  E GG+ + L  LY +T   +HL LA LFD    + 
Sbjct: 400 WMHSRLSK-LPGATLQRMWGLFSSGEFGGIVEALCDLYDLTGKGEHLALARLFDLDRLID 458

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L   HAN HIPI  G    Y+ TG+  Y      F D+V     Y+ GGTS 
Sbjct: 459 ACAANTDVLDGLHANQHIPIFTGYLRLYDATGEERYLAAARNFWDMVVPHRMYSIGGTSD 518

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW     +A  +   + E+C  YNMLK+SR LF   ++  Y DYYERAL N VL  +R
Sbjct: 519 AEFWRARDVVAGAISGASAESCCAYNMLKLSRALFLHAQDAKYMDYYERALFNQVLGSKR 578

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  ++ Y L L  G  +   T   GT      CC GTG+ES +K  D++YF    
Sbjct: 579 DVADAEKPLVTYFLGLNPGHVR-DYTPKQGTT-----CCEGTGLESATKYQDTVYFVAA- 631

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
           +   LY+  +  S+ +W +  V + Q  D    ++    +T+        G L  + LR+
Sbjct: 632 DGSSLYVNLFSPSTLEWAAKGVRVVQ--DTAFPFEQGTTLTV------RGGGLFEMRLRV 683

Query: 557 PVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
           PVW   +G +  +NGQ +   P PG++   +  W   D + +++P  +R E   DD    
Sbjct: 684 PVWAV-DGFRVFVNGQAVSGSPMPGSYFGVSREWRDGDVVRVEVPFRMRVERTPDD---- 738

Query: 616 ASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNA 659
           +S+QA+ +GP  L   ++    +     R+ SAL   +  SF A
Sbjct: 739 SSVQAVFYGPVNLVARSASTSYLSVALYRN-SALSGDLVSSFTA 781



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/90 (37%), Positives = 51/90 (56%), Gaps = 5/90 (5%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENPISE----LRGHFVGHYLSAS 173
           +Q  L++    DV+ L+  FR  A L T G  A GGWE    E    LRGH+ GH+L+  
Sbjct: 26  RQLMLDHARGYDVNRLLQVFRANAGLATLGAVAPGGWEGLDGEANGNLRGHYTGHFLTML 85

Query: 174 AQMWASTHNATIKEKMSTVVFSLSECQNKI 203
           +Q +AST +    EK+ T+V +L+E +  +
Sbjct: 86  SQAYASTGDEVYAEKIRTIVGALTESREAL 115


>gi|336417295|ref|ZP_08597620.1| hypothetical protein HMPREF1017_04728 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936275|gb|EGM98208.1| hypothetical protein HMPREF1017_04728 [Bacteroides ovatus
           3_8_47FAA]
          Length = 800

 Score =  228 bits (580), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 171/566 (30%), Positives = 265/566 (46%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L LD D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L+  Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A +M      WM++        
Sbjct: 147 GKIHAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L  + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V    S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
             +     LY+  +I S   WK   ++L Q+       D  + + +  + K++     +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILRQETR--FPDDDKVTLRIDEAPKKK----RTL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+     +     +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWKKGDVITFNLPMRVSMEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565


>gi|336405535|ref|ZP_08586212.1| hypothetical protein HMPREF0127_03525 [Bacteroides sp. 1_1_30]
 gi|335937406|gb|EGM99306.1| hypothetical protein HMPREF0127_03525 [Bacteroides sp. 1_1_30]
          Length = 800

 Score =  227 bits (579), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 172/566 (30%), Positives = 262/566 (46%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L LD D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L+  Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A +M      WM++        
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLAHQMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L  + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V    S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
             +     LY+  +I S   WK   ++L Q+          LR+      K+      +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRIDEAPKKKR------TL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+     +     +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSKGYSISINGKRKMFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565


>gi|423299329|ref|ZP_17277354.1| hypothetical protein HMPREF1057_00495 [Bacteroides finegoldii
           CL09T03C10]
 gi|408473138|gb|EKJ91660.1| hypothetical protein HMPREF1057_00495 [Bacteroides finegoldii
           CL09T03C10]
          Length = 800

 Score =  227 bits (579), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 174/566 (30%), Positives = 272/566 (48%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L DV L  S  L +AQQT+L Y+L L+ D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQDVKLLDSPFL-QAQQTDLHYILALNPDRLLAPFLREAGLTPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  I  +++ ++  L   Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAIYNRLNYMLDELYRAQQAVGTGFIGGTPGSLQLWKEIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVL--ADNAQALKMA--TWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+   +D A+ + +A   WM++        
Sbjct: 147 GNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYTGSDRARLMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S ++    L  E GG+N+    + +IT D K+L LA  F     L  L    D L
Sbjct: 199 ITSGLSDQQIQDMLRSEHGGLNETFADVAAITGDKKYLELARRFSHKIILDPLIKDEDRL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V  + S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAEEWDHAARFFWNTVVNNRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         +   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPADNFTSMINDVQGPETCNTYNMLRLTKMLYQNSHNPCNINEPDPNYINYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
            ++     LY+  +I S  +WK   V+L Q+       D  + + +  +SK++     +L
Sbjct: 433 HQKDT---LYVNLFIPSQLNWKEQGVILTQETR--FPDDNKVTLRIDKASKKQ----RTL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQNLPLP-PPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S+    S+NG+    P   GN +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSSNYSISINGKKETFPTKKGNQYLPLSRKWKKGDVITFNLPMKVTIEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565


>gi|160883737|ref|ZP_02064740.1| hypothetical protein BACOVA_01709 [Bacteroides ovatus ATCC 8483]
 gi|423297720|ref|ZP_17275780.1| hypothetical protein HMPREF1070_04445 [Bacteroides ovatus
           CL03T12C18]
 gi|156110822|gb|EDO12567.1| hypothetical protein BACOVA_01709 [Bacteroides ovatus ATCC 8483]
 gi|392665078|gb|EIY58610.1| hypothetical protein HMPREF1070_04445 [Bacteroides ovatus
           CL03T12C18]
          Length = 800

 Score =  227 bits (579), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 172/566 (30%), Positives = 262/566 (46%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L LD D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L+  Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A +M      WM++        
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L  + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHNLILDPLIKEEDKL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V    S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
             +     LY+  +I S   WK   ++L Q+          LR+      K+      +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRINEAPKKKR------TL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+     +     +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWEKGDVITFHLPMKVSVEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565


>gi|295133234|ref|YP_003583910.1| hypothetical protein ZPR_1378 [Zunongwangia profunda SM-A87]
 gi|294981249|gb|ADF51714.1| putative secreted protein [Zunongwangia profunda SM-A87]
          Length = 1016

 Score =  227 bits (579), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 178/607 (29%), Positives = 271/607 (44%), Gaps = 94/607 (15%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKT--ASLPTPGKAYGGWEN 156
           L +VSL      Q++     +   +  L   + DS ++ FR       P   K  G W+ 
Sbjct: 373 LDQVSLESNTNGQNTKFIENRDKFINTLAQTNPDSFLYMFRNAFGQEQPVGAKPLGVWDT 432

Query: 157 PISELRGHFVGHYLSASAQMWAST--------HNATIKEKMSTVVFSLSECQNKI----- 203
             ++LRGH  GHYL+A AQ +AST        + A   E M   ++ LS+   K      
Sbjct: 433 QETKLRGHATGHYLTAIAQAYASTGYDKALQQNFADKMEYMVNTLYQLSQMSGKPAEEGG 492

Query: 204 ----------------------------------GTGYLSAFPTELFDSFE-------AL 222
                                             G G++SA+P + F   E         
Sbjct: 493 DFNANPTAVPMGPGKEIYSSDLSEEGIRTDYWNWGEGFISAYPPDQFIMLENGAVYGTEE 552

Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
             +WAPYYT+HKILAGL+D Y ++ N +AL +A  M ++ Y R+ ++ T   +      +
Sbjct: 553 TKIWAPYYTLHKILAGLMDIYEVSGNEKALAVAEGMGDWVYARLSELPTDTLISMWNRYI 612

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
             E GGMN+ + RLY IT    +L  A LFD    F G       LA   D     HAN 
Sbjct: 613 AGEFGGMNEAMARLYRITGKDTYLETARLFDNIKVFFGDANHSHGLAKNVDTFRGLHANQ 672

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPKR 388
           HIP ++G+   Y  +  P Y  +   F       + Y+ GG +       A  F   P  
Sbjct: 673 HIPQIVGALEMYRDSDKPEYFNVADNFWVKATNDYMYSIGGVAGARNPANAECFIAQPGT 732

Query: 389 LAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
           L +   + G +N ETC TYNMLK++R+LF + +     DYYER L N +L+      P  
Sbjct: 733 LYENGLSAGGQN-ETCATYNMLKLTRNLFLYEQRPELMDYYERGLYNHILASVAEDSP-A 790

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
             Y +PL  G  K+            F CC GT +ES +KL +SIYF+   N   LY+  
Sbjct: 791 NTYHVPLRPGSKKSFGN----PNMTGFTCCNGTALESSTKLQNSIYFKGADN-KALYVNL 845

Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
           Y+ S+  W   ++ L Q+ +     + + ++T+    K +      L LR+P W  +NG 
Sbjct: 846 YVPSTLHWHEKNIELTQETN--FPKEDHTKLTINGKGKFD------LKLRVPGWA-TNGF 896

Query: 566 QASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
              +NG++  +   PG +LS + +W   D + +Q+P     + I D +    +I ++ +G
Sbjct: 897 TVKINGKDQKVKATPGTYLSLSRKWKDGDTVELQMPFGFYLDPIMDQQ----NIASLFYG 952

Query: 625 PYLLAGH 631
           P LLA  
Sbjct: 953 PVLLAAQ 959


>gi|239627978|ref|ZP_04671009.1| secreted protein [Clostridiales bacterium 1_7_47_FAA]
 gi|239518124|gb|EEQ57990.1| secreted protein [Clostridiales bacterium 1_7_47FAA]
          Length = 822

 Score =  227 bits (578), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 161/566 (28%), Positives = 268/566 (47%), Gaps = 52/566 (9%)

Query: 101 EVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPIS 159
           EV    V L + +  W AQ+  + +LL +D D ++++FR  A L   G     GW+ P  
Sbjct: 225 EVPAGSVRLSEGTRFWDAQERMIRWLLSVDDDQMLYNFRSAAGLDVRGAGPMTGWDAPEC 284

Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----GTGYLSAFPTE 214
            L+GH  GHYLS  A   +      +K+K++ +V +L+ECQ  +       G+LSA+  +
Sbjct: 285 NLKGHTTGHYLSGLALACSVHGQPELKDKINYMVNALAECQKALEAKGCAKGFLSAYSEQ 344

Query: 215 LFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
            FD  E       +WAPYYT+ KI++GL D Y LA + +A  + T + ++ Y R+ + ++
Sbjct: 345 QFDLLEVYTRYPEIWAPYYTLDKIMSGLYDCYCLAGSKEAFHLLTGLGDWIYGRLSR-LS 403

Query: 272 MYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
              +++ W   +  E GGM  V+ RLY  T D ++   A  F        +    D L  
Sbjct: 404 RAQLDKMWSMYIAGEFGGMISVMVRLYRETGDGRYRRAALFFRNEKLFYPMEENVDTLKD 463

Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
            HAN HIP  IG+   Y+  G   Y  I   F  +V  SH Y+ GG    E + +P  +A
Sbjct: 464 MHANQHIPQAIGALELYKAGGGKRYLAIARNFWQMVVRSHEYSIGGVGETEMFHEPGDIA 523

Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
             +  ++ E+C +YN+++++  LF  + +    DYYE  L N +LS       G   Y +
Sbjct: 524 HYMTDKSAESCASYNLMRLTFGLFGLSPDSRKMDYYENVLYNHILSSASHKADGGTTYFM 583

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           P+  G  K      + T  N+  CC+GTG+ES  +   +IY   E +   +Y+  YI S 
Sbjct: 584 PVRPGGRKE-----FNTSENT--CCHGTGLESRFRYIRNIYAAGE-DKKEVYVNLYIPSE 635

Query: 511 FDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT--------- 560
            D + G  + L +       +       +TF+  ++ G+  ++ LR+P W          
Sbjct: 636 LDMEDGWKLKLEEDARTQGGY------RITFNGPKDGGE-RTVALRIPCWAGEDWDIRIH 688

Query: 561 --YSNGAQA---------SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
             +  GA+A         +   Q   +   G ++    +W  +D++ I+LP   R     
Sbjct: 689 TVHPEGAEADGLAKTDAVTEASQGFTVDSDG-YVRIRRQWMPDDRMEIRLPFRFRKLPA- 746

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
              P+ ++  ++ +GPY+LA    GE
Sbjct: 747 ---PDGSAYSSVAYGPYILAALNDGE 769


>gi|237722208|ref|ZP_04552689.1| acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
 gi|229448018|gb|EEO53809.1| acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
          Length = 800

 Score =  226 bits (577), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 172/566 (30%), Positives = 261/566 (46%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L LD D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L+  Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAVYSRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A +M      WM++        
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L    D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKDEDKL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V    S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNNTQEPDPNYVNYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
            ++     LY+  +I S   WK   + L Q+          LR+      K+      +L
Sbjct: 433 HQKDT---LYVNLFIPSQLTWKEQGITLTQETRFPDDGKVTLRIDEAHKKKR------TL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+     +     +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKIFVMGKGNQYLPLSRKWKKGDVVTFNLPMKVTMEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565


>gi|359776490|ref|ZP_09279799.1| hypothetical protein ARGLB_045_00070 [Arthrobacter globiformis NBRC
           12137]
 gi|359306199|dbj|GAB13628.1| hypothetical protein ARGLB_045_00070 [Arthrobacter globiformis NBRC
           12137]
          Length = 1025

 Score =  226 bits (577), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 151/433 (34%), Positives = 217/433 (50%), Gaps = 31/433 (7%)

Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F   E+        VWAPYYT HKIL GLLD Y      +AL +AT + +
Sbjct: 391 GFLAAYPETQFIELESRTTPDYFRVWAPYYTAHKILKGLLDAYTATAEPKALDLATGLCD 450

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + ++R+ K +T    +R W   +  E GG+ + +   Y  +  P+HL LA  FD    + 
Sbjct: 451 WMHSRLSK-LTPAVRQRMWGIFSSGEYGGVVEAILETYGHSGKPEHLELAKYFDLDSLID 509

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L+  HAN HIPI  G  + Y  TG+  Y      F  +V  +  ++ GGTS 
Sbjct: 510 ACAQDKDILAGLHANQHIPIFTGLVLMYNATGEERYLAAARNFWTMVVPTRMFSIGGTSQ 569

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW +  R+A TL + + E+C  YNMLK+SR LF   +  AY DYYERAL N VL  ++
Sbjct: 570 GEFWKERDRIAATLNATDAESCCAYNMLKLSRELFFREQNPAYMDYYERALFNQVLGSKQ 629

Query: 440 GTEPG---VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
             E     +  Y + L  G  +   T   GT      CC GTG+ES +K  DS+YF   G
Sbjct: 630 DKESAELPLATYFIGLQPGAVR-DFTPKQGTT-----CCEGTGLESATKYQDSVYF-TAG 682

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
           +   LY+  Y+ S+  W + +V + Q+     S+    R TL  +     GQ   L LR+
Sbjct: 683 DGSALYVNLYMPSTLRWAAKNVTVTQQ----TSYPFEQRTTLQVAGS---GQF-ELRLRV 734

Query: 557 PVWTYSNGAQASLNGQ-NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
           P W  + G    +NG        PG +LS    W   D + +++P +LR E   DD    
Sbjct: 735 PAWA-TAGFTVRVNGAVTEAAATPGTYLSIARAWKNGDTVDVEMPFTLRAERALDD---- 789

Query: 616 ASIQAILFGPYLL 628
            S+Q +++GP  L
Sbjct: 790 PSVQTLMYGPVHL 802



 Score = 49.3 bits (116), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 50/116 (43%), Gaps = 15/116 (12%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL-------PTPGKAY 151
           ++   L DV L    V  R ++  L +    D    V  FR  A L       P P    
Sbjct: 49  VRPFKLSDVSLG-PGVFARKRELILNFARGYDERRYVNVFRANAGLRPLDGVVPLPA--- 104

Query: 152 GGWENPISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
           GGWE    E    LRGHF GH++S  AQ +A T       K+  +V SL EC+  +
Sbjct: 105 GGWEGLDGEANGNLRGHFTGHHMSMLAQAYAGTGEEVFGTKLRNLVASLHECRQAL 160


>gi|332662487|ref|YP_004445275.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332331301|gb|AEE48402.1| protein of unknown function DUF1680 [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 793

 Score =  226 bits (577), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 169/557 (30%), Positives = 263/557 (47%), Gaps = 59/557 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L EVSL D           A+  N++ LL  D+D L+  +RK A LP    +Y  W+   
Sbjct: 32  LAEVSLLD------GPFKHARDLNIQTLLQYDIDRLLNPYRKEAGLPEKAASYPNWDG-- 83

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-------GTGYLSAF 211
             L GH  GHYLSA A M A+T NA  +++++ ++  L  CQ          G GYL   
Sbjct: 84  --LDGHVGGHYLSAMA-MNAATGNAECRKRLAYMLSELKACQEAHALKHPAWGIGYLGGV 140

Query: 212 P--TELFDSFE-----ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVE 260
           P   E++ +F+     AL+  W P+Y +HK+ +GL D ++   +  A    L    W + 
Sbjct: 141 PKSAEIWSTFKNGDFKALRAAWVPWYNVHKLYSGLRDAWLYTGDETAKTLFLDFCDWGIA 200

Query: 261 YFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF 320
              N  +    M S+      L+ E GGMN++    Y +T D K+L  A  F     L  
Sbjct: 201 ITANLSEA--QMQSM------LDIEHGGMNEIFADAYQMTGDEKYLKAAKGFSHQALLDP 252

Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR 380
           +++  D L + HANT +P  +G Q   E++ +  Y   G FF + V +  S A GG S R
Sbjct: 253 MSMGKDNLDNKHANTQVPKAVGFQRIAELSKEDKYAKAGRFFWETVTSKRSLALGGNSRR 312

Query: 381 EFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
           EF+       D +   E  E+C +YNMLK++  LFR      Y DYYER L N +LS Q 
Sbjct: 313 EFFPSIAAGRDFVHDVEGPESCNSYNMLKLTEELFRANPSGHYIDYYERTLYNHILSTQH 372

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
             E G  +Y  P     ++ R    +       WCC G+G+E+  K    IY +++ +  
Sbjct: 373 -PEHGGYVYFTP-----ARPRHYRVYSAPNQGMWCCVGSGMENHGKYNQLIYTQQKDS-- 424

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            L++  +I+S+ +W++  +VL Q+ +     +   ++T+T     E     +L +R P W
Sbjct: 425 -LFLNLFIASALNWRAKGIVLKQQTN--FPEEEQTKLTIT-----EGRARFTLMIRYPSW 476

Query: 560 TYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
             +   Q  +N + +     P  +++    W   D + I LP+    E +  + PEY   
Sbjct: 477 VQAGALQIRVNNKRVTYTTSPSAYVAIKRLWKKGDVVQIVLPMRNTLEHLT-NAPEYV-- 533

Query: 619 QAILFGPYLLAGHTSGE 635
            A+L GP LL   T  E
Sbjct: 534 -ALLHGPILLGAKTGTE 549


>gi|383112514|ref|ZP_09933306.1| hypothetical protein BSGG_0614 [Bacteroides sp. D2]
 gi|313693079|gb|EFS29914.1| hypothetical protein BSGG_0614 [Bacteroides sp. D2]
          Length = 800

 Score =  226 bits (577), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 176/566 (31%), Positives = 262/566 (46%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L L+ D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQNVKLLDSPFL-QAQQTDLHYILALNPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L   Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKDIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A KM      WM++        
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARKMLIDLTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L    D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKLILDPLIKDEDKL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V    S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKSWSHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNNTQEPDPNYVNYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
            +      LYI  +I S   WK   V L Q+          LR+      K+      +L
Sbjct: 433 HQRDT---LYINLFIPSQLTWKEQGVTLTQETRFPDDGKVTLRIDEAPKKKR------TL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQ-NLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+  + +   GN +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSKGYSISINGKRKIFIMAKGNQYLPLSRKWKKGDVITFNLPMRVSMEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565


>gi|374992692|ref|YP_004968187.1| hypothetical protein SBI_09938 [Streptomyces bingchenggensis BCW-1]
 gi|297163344|gb|ADI13056.1| hypothetical protein SBI_09938 [Streptomyces bingchenggensis BCW-1]
          Length = 769

 Score =  226 bits (577), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 170/541 (31%), Positives = 253/541 (46%), Gaps = 55/541 (10%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ T L+YLL LD D L+   R+ A LP   ++YG WE+  S L GH VGH LS +A M 
Sbjct: 19  AQATALDYLLSLDTDRLLAPLRREAGLPPVAESYGNWES--SGLDGHTVGHALSGAALMS 76

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF------------DSFEALKPV 225
           A T +   +  +  +V  + ECQ+ +GTGY+   P  +             DSFE L   
Sbjct: 77  AVTDDPRPRAMVDRLVQGVVECQDALGTGYVGGVPDGVRLWQRVAAGQVERDSFE-LGGA 135

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
           W P+Y +HK+ AGLLD Y    +  AL     + +++     +V      + H   L  E
Sbjct: 136 WVPWYNLHKLFAGLLDAYRHTGSEPALTAVRRLADWW----GRVAAGMDDDTHEAMLRTE 191

Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
            GGM +VL  L  +T   ++  LA  F     L  L    D L   HANT I  V+G Q 
Sbjct: 192 FGGMCEVLADLAEVTGTDRYAALARRFLDQSLLRPLCEHRDVLDGMHANTQIAKVVGYQR 251

Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
             EV  DP  +    FF   +    + + GG S RE        +  L S E  ETC TY
Sbjct: 252 LGEVVDDPGLRDAARFFWQAMTRHRTVSFGGNSVREHLHPRDDFSSALQSPEGPETCNTY 311

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLGRGVSKARSTH 463
           NMLK+SR LF    +    D+YERA  N +LS     +P G ++Y  P+  G  +  S  
Sbjct: 312 NMLKLSRALFLERPDTEVLDHYERATVNHILS---SLQPKGGLVYFTPVRPGHYRVVS-- 366

Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
              T  N FWCC GTG+E+ +K G+ +Y  E  +   L++  +I+S       ++VL Q 
Sbjct: 367 ---TPQNCFWCCVGTGLENHAKYGELVYTTEGDD---LFVNLFIASRLSRPEQNLVLEQT 420

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG---QNLPLP--- 577
                 +D  +R+ +  +    +     +++R+P W +    Q  +NG   ++ P P   
Sbjct: 421 G--TAPYDEEVRLVVRGAPATPL----PIHIRVPGW-HEGTPQIRINGAPPEDGPGPLTT 473

Query: 578 ------PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
                  P  ++    +W   D +T++L   +  E + D  P + S +   FGP +LA  
Sbjct: 474 RRAAGGQPLTYVRLERQWCEGDTVTMRLRPRISAELLPDGSP-WVSYR---FGPSVLAAE 529

Query: 632 T 632
           +
Sbjct: 530 S 530


>gi|386820708|ref|ZP_10107924.1| putative glycosyl hydrolase of unknown function (DUF1680)
           [Joostella marina DSM 19592]
 gi|386425814|gb|EIJ39644.1| putative glycosyl hydrolase of unknown function (DUF1680)
           [Joostella marina DSM 19592]
          Length = 1018

 Score =  226 bits (576), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 186/633 (29%), Positives = 279/633 (44%), Gaps = 97/633 (15%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--YGGWEN 156
           L EVSL        S     +   +  L   + D+ ++ FR T   P P  A   G W++
Sbjct: 375 LDEVSLDVDTHGHESKFIENRDKFISTLAQTNPDAFLYMFRNTFGQPQPDAAEPLGVWDS 434

Query: 157 PISELRGHFVGHYLSASAQMWAST--------HNATIKEKMSTVVFSLSECQNKI----- 203
             ++LRGH  GHYL+A AQ +AST        + A   E M   ++ L++          
Sbjct: 435 QETKLRGHATGHYLTAIAQAYASTGYDKSLQNNFADKMEYMVNTLYKLAQMSGNPKTKDG 494

Query: 204 ----------------------------------GTGYLSAFPTELFDSFE-------AL 222
                                             G G++SA+P + F   E         
Sbjct: 495 SYVANPTEVPPGPGKSNYDSDLSEDGIRTDYWNWGEGFISAYPPDQFIMLENGATYGGQQ 554

Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
             VWAPYYT+HKILAGLLD Y ++ N +AL++A  M  + Y R+ ++ T   +      +
Sbjct: 555 TQVWAPYYTLHKILAGLLDIYEVSGNKKALEVAEGMGSWVYARLNELPTETLISMWNRYI 614

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
             E GGMN+V+ RLY +T + K+L +A LFD    F G       LA   D     HAN 
Sbjct: 615 AGEFGGMNEVMARLYRLTDEEKYLQVAQLFDNIKVFYGDANHSNGLAKNVDTFRGLHANQ 674

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPKR 388
           HIP ++G+   Y  +    Y  I   F       + Y+ GG +       A  F   P  
Sbjct: 675 HIPQIVGAIEMYRDSNTAEYYRIADNFWFKSKNDYMYSIGGVAGARNPANAECFISQPAT 734

Query: 389 LAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
           + +   + G +N ETC TYNMLK++R+LF + +   Y DYYER L N +L+      P  
Sbjct: 735 IYENGLSAGGQN-ETCATYNMLKLTRNLFLFDQRAEYMDYYERGLYNHILASVAEKTPA- 792

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
             Y +PL  G  K    H        F CC GT IES +KL +SIYF+   N   LY+  
Sbjct: 793 NTYHVPLRPGSVK----HFGNPDMKGFTCCNGTAIESSTKLQNSIYFKSVEN-DALYVNL 847

Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
           Y+ S+  W    + + QK       + + ++T+  + K +      L +R+P W  + G 
Sbjct: 848 YVPSTLHWAEKKLTITQKT--AFPKEDFTQLTINGNGKFD------LKVRVPNWA-TKGF 898

Query: 566 QASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
              +NG+   +   PG++L+    W   D + +++P     E+I D +    +I ++ +G
Sbjct: 899 IVKINGKEEKVEAIPGSYLTLNRTWKDGDTVELKMPFQFHLESIMDQQ----NIASLFYG 954

Query: 625 PYLLAGHTS---GEWDIKTGTARSLSALISPIP 654
           P LL    S    EW   T     +   IS  P
Sbjct: 955 PILLVAQESEPRTEWRKVTFDKNEIGKDISGDP 987


>gi|302561993|ref|ZP_07314335.1| secreted protein [Streptomyces griseoflavus Tu4000]
 gi|302479611|gb|EFL42704.1| secreted protein [Streptomyces griseoflavus Tu4000]
          Length = 950

 Score =  226 bits (576), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 153/466 (32%), Positives = 229/466 (49%), Gaps = 38/466 (8%)

Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F + E++       VWAPYYT HKIL GLLD Y+  D+ +AL +A+ M +
Sbjct: 399 GFLAAYPETQFIALESMTGSDYTRVWAPYYTAHKILRGLLDAYLATDDERALDLASGMCD 458

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + + R+  V+   +++R W   +  E GG+ + +  L+++T  P+HL LA LFD    + 
Sbjct: 459 WMHARLS-VLPAATLQRMWGLFSSGEFGGIVEAVCDLHALTGRPEHLALARLFDLDRLID 517

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L   HAN HIP+  G    ++ TG+  Y      F  +V    +YA GGTS+
Sbjct: 518 ACAADTDVLEGLHANQHIPVFTGLVRLHDETGEQRYLTAAKNFWGMVVPHRTYAIGGTSS 577

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW     +A T+G    E+C  YNMLK+SR LF   ++ AY DYYER L N VL  ++
Sbjct: 578 GEFWKARGVIAGTIGDTTAESCCAYNMLKLSRALFFHEQDPAYMDYYERTLYNQVLGSKQ 637

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  ++ Y + L  G  +        T      CC GTG+ES +K  DS+YF +  
Sbjct: 638 DRPDAEKPLVTYFVGLTPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYFAKA- 690

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLR 555
           +   LY+  Y  S   W    V + Q       +      TLT       G+ S +L LR
Sbjct: 691 DGSALYVNLYSDSRLAWAEKGVTVTQS----TRYPEEQGSTLTIGG----GRASFTLLLR 742

Query: 556 MPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           +P W  + G + ++NG+ +P  P PG +   +  W   D + I +P  LR E   DD   
Sbjct: 743 VPSWA-TAGFRVTVNGRAVPGAPVPGRYFGVSRSWRDGDTVRISVPFRLRVEKAPDD--- 798

Query: 615 YASIQAILFGPYLLAGHTSGEWDIK------TGTARSLSALISPIP 654
              +QA+  GP  L     G   ++       G +  L   ++P+P
Sbjct: 799 -PGLQALFLGPVCLVARRPGPEPVRFGLYGNAGLSGDLLPSLTPVP 843



 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 6/110 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           ++   L DV L    V    ++  L++    DV+ L+  FR  A L T G  A GGWE  
Sbjct: 60  VRPFGLEDVTLG-PGVFAAKRRLMLDHARGYDVNRLLQVFRANAGLSTRGAVAPGGWEGL 118

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
             E    LRGH+ GH+L+  AQ   ST      +++ TVV +L E +  +
Sbjct: 119 DGEANGNLRGHYTGHFLTMLAQAHRSTGEQVFADRIDTVVGALVEVREAL 168


>gi|423213125|ref|ZP_17199654.1| hypothetical protein HMPREF1074_01186 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392694381|gb|EIY87609.1| hypothetical protein HMPREF1074_01186 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 800

 Score =  226 bits (575), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 173/566 (30%), Positives = 267/566 (47%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L LD D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L+  Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A +M      WM++        
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L  + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V    S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
             +     LY+  +I S   WK   + L Q+       D  + + +  + K++     +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGITLTQET--CFPDDGKVTLRIDEAPKKK----HTL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQ-NLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+  + +   GN +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFIMAKGNQYLPLSRKWKKGDVVTFHLPMKVSVEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565


>gi|443629445|ref|ZP_21113773.1| putative Secreted protein [Streptomyces viridochromogenes Tue57]
 gi|443337063|gb|ELS51377.1| putative Secreted protein [Streptomyces viridochromogenes Tue57]
          Length = 941

 Score =  226 bits (575), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 157/463 (33%), Positives = 228/463 (49%), Gaps = 31/463 (6%)

Query: 206 GYLSAFPTELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F   E+        VWAPYYT HKIL G+LD Y+  D+A+AL +A+ M +
Sbjct: 390 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGVLDAYLATDDARALDLASGMCD 449

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + Y+R+ K +   +++R W   +  E GG+ + +  L++IT   +HL LA LFD    + 
Sbjct: 450 WMYSRLSK-LPEATLQRMWGLFSSGEFGGIVEAVCDLHTITGKAEHLALAQLFDLDRLID 508

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L   HAN HIPI  G    Y+ TG+  Y      F  +V     Y  GGTS 
Sbjct: 509 NCAANTDILDGLHANQHIPIFTGYLRLYDATGEQRYLDAARNFWGMVVPHRMYGIGGTST 568

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW     +A T+ + N ETC  YNMLK+SR LF   ++  Y DYYERAL N VL  ++
Sbjct: 569 GEFWKARDVIAGTISATNAETCCAYNMLKLSRTLFFHEQQPKYMDYYERALFNQVLGSKQ 628

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  ++ Y + L  G  +   T   GT      CC GTG+ES +K  DS+YF+   
Sbjct: 629 DKADAEKPLVTYFIGLTPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYFKAA- 681

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
           +   LY+  Y  S   W    V + Q      ++      TLT           +L LR+
Sbjct: 682 DGSALYVNLYSPSRLAWAEKGVTVTQ----TTAFPREQGTTLTIGGGSAA---FALRLRV 734

Query: 557 PVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
           P W  + G + ++NG  +   P PG++ + +  W   D + I +P  LR E   DD    
Sbjct: 735 PSWA-TAGFRVTVNGSAVSGTPKPGSYFTVSRTWRSGDTVRISMPFRLRVEKAIDD---- 789

Query: 616 ASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFN 658
            S+Q + +GP  L G  S    ++ G  R+ + L   + PS  
Sbjct: 790 PSLQTLFYGPVNLVGRNSATSYLQLGLYRN-AGLSGDLLPSLT 831



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 58/110 (52%), Gaps = 6/110 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           ++  +L DV L +  +    +Q  L++    DV+ L+  FR  A L T G  A GGWE  
Sbjct: 51  VQPFALDDVAL-RPGLFADKRQLMLDHARGYDVNRLLQVFRANAGLSTGGAVAPGGWEGL 109

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
             E    LRGH+ GH+L+  +Q +A T      +++ T+V +L+E +  +
Sbjct: 110 DGEANGNLRGHYTGHFLTMLSQAYAGTGEQVFVDRIRTMVGALTEVREAL 159


>gi|344201935|ref|YP_004787078.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
 gi|343953857|gb|AEM69656.1| protein of unknown function DUF1680 [Muricauda ruestringensis DSM
           13258]
          Length = 1022

 Score =  226 bits (575), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 182/629 (28%), Positives = 285/629 (45%), Gaps = 97/629 (15%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKT--ASLPTPGKAYGGWEN 156
           L +VSL+     Q +     +   +  L+  + DS ++ FR       P   K  G W++
Sbjct: 379 LDQVSLNADAHGQQTKFIENRDKFINTLVQTNPDSFLYMFRNAFGQEQPEGAKPLGVWDS 438

Query: 157 PISELRGHFVGHYLSASAQMWASTH-----NATIKEKMS---TVVFSLSE---------- 198
             ++LRGH  GHYL+A AQ +AST       A   +KM+    V++ LS+          
Sbjct: 439 QETKLRGHATGHYLTAIAQAYASTGYDKALQANFADKMNYMVDVLYQLSQMSGQSAKAGG 498

Query: 199 ----------------------CQNKI-------GTGYLSAFPTELFDSFE-----ALKP 224
                                  +N I       G G++SA+P + F   E       +P
Sbjct: 499 EHVADPTAVPPGPGKSTYDSDLSENGIRTDYWNWGEGFISAYPPDQFIMLENGATYGTQP 558

Query: 225 --VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
             VWAPYYT+HKILAGL+D Y ++ N +AL++A  M ++ Y R+ ++ T   +      +
Sbjct: 559 TQVWAPYYTLHKILAGLMDIYEVSGNEKALEIAKGMGDWVYARLSQLPTDTLISMWNTYI 618

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
             E GGMN+ + RL  IT +P++L +A LFD    F G       LA   D     HAN 
Sbjct: 619 AGEFGGMNEAMARLDRITDEPRYLKVAQLFDNIKMFFGDAEHSHGLARNVDSFRGLHANQ 678

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG-------TSAREFWWDPKR 388
           HIP ++G+   Y  +  P Y  +   F       + Y+ GG       T+A  F   P  
Sbjct: 679 HIPQIVGALEIYRDSESPEYYQVADNFWYKAKNDYMYSIGGVAGARNPTNAECFIAQPAT 738

Query: 389 LAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
           L +   + G +N ETC TYNMLK++++LF + +     DYYER L N +L+      P  
Sbjct: 739 LYENGFSSGGQN-ETCATYNMLKLTKNLFLFDQRTELMDYYERGLYNHILASVAEDSP-A 796

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
             Y +PL  G  K        +    F CC GT +ES +KL +SIYF+ + N   LY+  
Sbjct: 797 NTYHVPLRPGSVKRFGN----SDMTGFTCCNGTALESSTKLQNSIYFKSQDNST-LYVNL 851

Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
           ++ S+  W    + + QK     ++       LT   K +      LN+R+P W  + G 
Sbjct: 852 FVPSTLKWAEKDITVEQK----TAFPKEDNTQLTIKGKGKF----DLNIRVPQWA-TKGF 902

Query: 566 QASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
              +NG+   +   PG +L+ + +W   D + +++P     + + D +    +I ++ +G
Sbjct: 903 FVKINGKEEKVEAKPGTYLTLSRKWKDGDVIDLKMPFQFHLDPVMDQQ----NIASLFYG 958

Query: 625 PYLLAGHT---SGEWDIKTGTARSLSALI 650
           P LL         EW   T  A  +   I
Sbjct: 959 PVLLVAQEPEPRNEWRKITLDAEDIGKTI 987


>gi|86140890|ref|ZP_01059449.1| putative secreted protein [Leeuwenhoekiella blandensis MED217]
 gi|85832832|gb|EAQ51281.1| putative secreted protein [Leeuwenhoekiella blandensis MED217]
          Length = 1004

 Score =  225 bits (574), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 181/607 (29%), Positives = 276/607 (45%), Gaps = 94/607 (15%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--YGGWEN 156
           L EV+L++  L   S     +   ++ L   + DS ++ FR       P  A   G W+ 
Sbjct: 360 LNEVNLNNTSLGDHSKFIENRNKFIDTLAQTNPDSFLYMFRNAFGQEQPEGATPLGVWDT 419

Query: 157 PISELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVV---FSLSECQNKI----- 203
             ++LRGH  GHYL+A AQ +AST          ++KM+ +V   + LS+   K      
Sbjct: 420 QETKLRGHATGHYLTAIAQAYASTGYDKALQKNFEDKMNYMVNTLYDLSQLSGKPKTEGG 479

Query: 204 ----------------------------------GTGYLSAFPTELFDSFE-------AL 222
                                             G G++SA+P + F   E         
Sbjct: 480 AYVEDPSSVPPGPGSTAYTSDLSEDGIRTDYWNWGKGFISAYPPDQFIMLEHGAKYGGQE 539

Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
             VWAPYYT+HKILAGL+D Y ++ N +AL++A  M  + + R+ K+ T   +      +
Sbjct: 540 TQVWAPYYTLHKILAGLIDVYEVSGNPKALQVAEGMAAWVHTRLSKLPTETLITMWNTYI 599

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
             E GG+N+ L  L+ IT   ++L  A LFD    F G       LA   D     HAN 
Sbjct: 600 AGELGGINESLAHLHRITGKSEYLETAKLFDNIKVFYGDAEHTHGLAKNVDTYRGLHANQ 659

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPKR 388
           HIP ++G+   Y  +  P Y  I   F       + Y+ GG +       A  F   P  
Sbjct: 660 HIPQIMGALELYRNSNSPEYYHIADNFWYKTKNDYMYSIGGVAGARNPANAECFVAQPAT 719

Query: 389 LAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
           L +   + G +N ETC TYNMLK++R LF + ++    DYYE+AL N +L+      P  
Sbjct: 720 LYENGLSAGGQN-ETCGTYNMLKLTRGLFFYNQQPELMDYYEQALYNQILASVAENSPA- 777

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
             Y +PL  G  K  S        + F CC GT IES +KL +SIYF+   N   LY+  
Sbjct: 778 NTYHIPLRPGSRKQFSN----ADMSGFTCCNGTAIESSTKLQNSIYFKSVDN-KALYVNL 832

Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
           ++ S+  WK   VV+ Q+       + + ++T+    K E      LNLR+P W  + G 
Sbjct: 833 FVPSTLTWKEQDVVITQETS--FPREDHTKLTVNGKGKFE------LNLRIPGWATA-GV 883

Query: 566 QASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           +  +NG+   +    G++LS   +W   D + +++P +   + I D      +I ++ +G
Sbjct: 884 ELKINGKTQKIAIEAGSYLSLDRKWKNGDTIELKMPFTFHLDPIMDQE----NIASLFYG 939

Query: 625 PYLLAGH 631
           P LLA  
Sbjct: 940 PVLLAAQ 946


>gi|298484121|ref|ZP_07002288.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           D22]
 gi|298269711|gb|EFI11305.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
           D22]
          Length = 776

 Score =  225 bits (574), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 173/566 (30%), Positives = 267/566 (47%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L LD D L+  F + A L     +Y  WEN  + L G
Sbjct: 6   LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 62

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L+  Q  +GTG++   P   +L+   +A
Sbjct: 63  HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 122

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A +M      WM++        
Sbjct: 123 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 174

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L  + D L
Sbjct: 175 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 234

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V    S   GG S RE
Sbjct: 235 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 294

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYNML++++ L++ +         +  Y +YYERAL N
Sbjct: 295 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 354

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 355 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 408

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
             +     LY+  +I S   WK   + L Q+       D  + + +  + K++     +L
Sbjct: 409 YRKDT---LYVNLFIPSQLTWKEQGITLTQET--CFPDDGKVTLRIDEAPKKK----RTL 459

Query: 553 NLRMPVW-TYSNGAQASLNGQ-NLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+  + +   GN +L  + +W   D +T  LP+ +  E I 
Sbjct: 460 MIRIPEWANQSKGYSVSINGKRKMFIMAKGNQYLPLSRKWKKGDVVTFHLPMKVSVEQIP 519

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 520 DKKDYY----AFLYGPIVLAASTGTE 541


>gi|126348374|emb|CAJ90096.1| conserved hypothetical protein [Streptomyces ambofaciens ATCC
           23877]
          Length = 942

 Score =  225 bits (574), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 152/438 (34%), Positives = 221/438 (50%), Gaps = 40/438 (9%)

Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F   E++       VWAPYYT HKIL GLLD ++   + +AL +A+ + +
Sbjct: 391 GFLAAYPETQFVELESMTGSDYTRVWAPYYTAHKILRGLLDAHLATGDGRALDLASGLCD 450

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + Y+R+ K +   +++R W   +  E GG+ + +  L+++T +  HL LA LFD    + 
Sbjct: 451 WMYSRLSK-LPAATLQRMWGLFSSGEFGGIVEAICDLHAVTGEAHHLALARLFDLDRLID 509

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L   HAN HIPI  G    ++ TG+  Y      F  +V     YA GGTS 
Sbjct: 510 ACAADDDVLDGLHANQHIPIFTGLVRLHDATGEERYLTAAKNFWGMVVPHRMYAIGGTST 569

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW     +A TLG+   E+C  YNMLK+SR LF   ++ AY DYYERAL N VL  ++
Sbjct: 570 GEFWQARDVIAGTLGATTAESCCAYNMLKLSRTLFFHEQDPAYMDYYERALYNQVLGSKQ 629

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF-EEE 495
                E  ++ Y + L  G  +        T      CC GTG+ES +K  DS+YF   +
Sbjct: 630 DAADAEKPLVTYFVGLTPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYFAAAD 683

Query: 496 GNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR---MTLTFSSKQEVGQLS-S 551
           GN   LY+  Y  S+  W    V + Q  D       Y R    TLT       G  S +
Sbjct: 684 GNA--LYVNLYSRSTLTWAERGVTVTQDTD-------YPREQGSTLTLGG----GSASFA 730

Query: 552 LNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
           L LR+P W  + G + ++NG  +P    PG++ + +  W   D + +++P  LR E   D
Sbjct: 731 LRLRVPAWA-TAGFRVTVNGHAVPGTATPGSYFTVSRTWRRGDTVRVRVPFRLRVEKALD 789

Query: 611 DRPEYASIQAILFGPYLL 628
           D     S+QA+  GP  L
Sbjct: 790 D----PSLQALFLGPVHL 803



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 56/110 (50%), Gaps = 6/110 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           ++   L DV L +  V    ++  L++    DVD L+  FR  A L T G  A GGWE  
Sbjct: 52  VRPFGLEDVTLGRG-VFADKRRLMLDHARGYDVDRLLQVFRANAGLSTLGAVAPGGWEGL 110

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
             E    LRGH+ GH+L+  AQ    T      E+++++V +L+E +  +
Sbjct: 111 DGEANGNLRGHYTGHFLTMLAQAHRGTGEEVFAERITSMVTALTEVRESL 160


>gi|262407626|ref|ZP_06084174.1| acetyl-CoA carboxylase [Bacteroides sp. 2_1_22]
 gi|294644495|ref|ZP_06722254.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|294808396|ref|ZP_06767149.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345511903|ref|ZP_08791442.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
 gi|262354434|gb|EEZ03526.1| acetyl-CoA carboxylase [Bacteroides sp. 2_1_22]
 gi|292640162|gb|EFF58421.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|294444324|gb|EFG13038.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345453983|gb|EEO49450.2| acetyl-CoA carboxylase [Bacteroides sp. D1]
          Length = 800

 Score =  225 bits (573), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 172/566 (30%), Positives = 267/566 (47%), Gaps = 64/566 (11%)

Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
           L +V L  S  L +AQQT+L Y+L LD D L+  F + A L     +Y  WEN  + L G
Sbjct: 30  LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86

Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
           H  GHYLSA + M+A+T +  +  +++ ++  L+  Q  +GTG++   P   +L+   +A
Sbjct: 87  HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
                    L   W P Y IHK  AGL D Y+ A +  A +M      WM++        
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198

Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
           + +  S E+    L  E GG+N+    +  IT D K+L LA  F     L  L  + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
           +  HANT IP VIG +   E++ D         +     FF + V    S   GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318

Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
            +         L   +  ETC TYN+L++++ L++ +         +  Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNILRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E+ +K G+ IY 
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
             +     LY+  +I S   WK   + L Q+       D  + + +  + K++     +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGITLTQET--CFPDDGKVTLRIDEAPKKK----RTL 483

Query: 553 NLRMPVW-TYSNGAQASLNGQ-NLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            +R+P W   S G   S+NG+  + +   GN +L  + +W   D +T  LP+ +  E I 
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFIMAKGNQYLPLSRKWKKGDVVTFHLPMKVSVEQIP 543

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
           D +  Y    A L+GP +LA  T  E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565


>gi|374712027|gb|AEZ64557.1| putative secreted protein [Streptomyces chromofuscus]
          Length = 933

 Score =  225 bits (573), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 152/468 (32%), Positives = 236/468 (50%), Gaps = 41/468 (8%)

Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F + E++       VWAPYYT HKIL GLLD ++  D+ +AL +A+ + +
Sbjct: 382 GFLAAYPETQFITLESMTSSDYGVVWAPYYTAHKILRGLLDAHLYTDDPRALDLASGLCD 441

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + Y+R+ + +   +++R W   +  E GG+ + +  L+++T  P+HL LA LFD    + 
Sbjct: 442 WMYSRLSR-LPASTLQRMWGIFSSGEFGGLVEAVCDLHALTGKPEHLALARLFDLDSLID 500

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L   HAN HIPI  G    ++ TG+  Y      F D+V  +  Y  GGTS 
Sbjct: 501 ACAANRDVLDGLHANQHIPIFTGLLRLHDATGEARYLAAAKNFWDMVVPTRMYGIGGTST 560

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW     +A T+ +   E+C  YNMLK+SR LF   ++  Y DYYERAL N VL  ++
Sbjct: 561 GEFWRGRGSVAGTISATTAESCCAYNMLKLSRLLFFHEQDPKYMDYYERALYNQVLGSKQ 620

Query: 440 GT---EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
            T   E  ++ Y + L  G  +  +     T      CC GTG+ES +K  DS+YF +  
Sbjct: 621 DTADAEKPLVTYFIGLTPGHVRDYTPKAGTT------CCEGTGMESATKYQDSVYFRKAD 674

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR---MTLTFSSKQEVGQLSSLN 553
           +   LY+  Y +S+  W    + + Q  D       Y R    TLT        +   L 
Sbjct: 675 DSV-LYVNLYSASTLTWAERGITVTQTTD-------YPREQGSTLTIGGGSAAFE---LR 723

Query: 554 LRMPVWTYSNGAQASLNG---QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
           LR+P W    G Q ++NG   Q  PL  PG++ + +  W   D + +++P  LR E   D
Sbjct: 724 LRVPSWA-DAGFQVTVNGTAVQGKPL--PGSYFAVSRTWRGGDIVRVRVPFRLRVEPTPD 780

Query: 611 DRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFN 658
           D     ++Q++  GP  L   ++    ++ G  R+ +AL   + P+  
Sbjct: 781 D----PALQSLFHGPVNLVARSASTSPLRFGLYRN-AALSGDLLPTLT 823



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 59/112 (52%), Gaps = 6/112 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           L+   L DV L    +    ++  L++    DVD L+  FR  A L T G  A GGWE  
Sbjct: 44  LRPFDLKDVTLG-PGIFATKRRFMLDHGRGYDVDRLLQVFRANAGLSTRGAVAPGGWEGL 102

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT 205
             E    LRGH+ GH+L+  AQ + ST +    +++ ++V +L+E ++ + T
Sbjct: 103 DGEANGNLRGHYTGHFLTMLAQSYGSTGDQVYADRIRSMVDALTEVRSALRT 154


>gi|256423606|ref|YP_003124259.1| hypothetical protein Cpin_4617 [Chitinophaga pinensis DSM 2588]
 gi|256038514|gb|ACU62058.1| protein of unknown function DUF1680 [Chitinophaga pinensis DSM
           2588]
          Length = 1025

 Score =  225 bits (573), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 187/611 (30%), Positives = 282/611 (46%), Gaps = 101/611 (16%)

Query: 123 LEYLLMLDVDSLVWSFRKT--ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAST 180
           +  L   D +S ++ FR       P   K    W++  ++LRGH  GHYL+A AQ +AST
Sbjct: 406 IRTLATTDPNSFLYMFRHAFGRQQPEGAKPLDVWDSQDTKLRGHATGHYLTAIAQAYAST 465

Query: 181 -HNATIKE----KMSTVVFSLSE------------------------------------- 198
            ++ T+++    KM+ +V +L E                                     
Sbjct: 466 GYDKTLQQNFEQKMAYMVNTLYELSLLSGNPKETGGVAVSDPTAVPYGPGKSGYDSDLSN 525

Query: 199 --CQNKI---GTGYLSAFPTELFDSFEA-------LKPVWAPYYTIHKILAGLLDQYVLA 246
              +N     G G++SA+P + F   E           +WAPYYT+HKILAGL+D Y ++
Sbjct: 526 EGIRNDYWNWGKGFISAYPPDQFIMLEKGAKYGGQKNQIWAPYYTLHKILAGLMDVYEVS 585

Query: 247 DNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKH 305
            N +AL +AT M ++ Y R+  V    ++ + W + +  E GGMN+ + RLY IT   ++
Sbjct: 586 GNQKALTVATGMGDWVYARLSHV-PQDTLIKMWNTYIAGEFGGMNEAMARLYLITGKQQY 644

Query: 306 LLLAHLFDK-PCFLG------FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP-LYKL 357
           L  A LFD    F G       LA   D     HAN HIP ++GS   Y  + +P  YK+
Sbjct: 645 LQTAQLFDNIRVFFGDTAHSHGLAKNVDIFRGLHANQHIPQIVGSIEMYRASNNPEYYKI 704

Query: 358 IGTFFMDIVNASHSYATGGTS-------AREFWWDPKRLAD---TLGSENEETCTTYNML 407
              F+   VN  + Y+ GG +       A  F   P  L +   + G +N ETC TYNML
Sbjct: 705 ADNFWYKAVN-DYMYSIGGVAGARNPANAECFISQPATLYENGFSSGGQN-ETCATYNML 762

Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
           K++  LF + +   + DYYERAL N +L+      P    Y +PL  G  K         
Sbjct: 763 KLTSDLFLFDQRAEFMDYYERALYNHILASVAKDNP-ANTYHVPLRPGAIKQFGN----P 817

Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI 527
               F CC GT IES +KL ++IYF+   N   LY+  YI S+  W   +V + Q  D  
Sbjct: 818 DMTGFTCCNGTAIESNTKLQNTIYFKSRDN-QALYVNLYIPSTLQWTERNVTIEQTTDFP 876

Query: 528 VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSAT 586
              D   R+T+  +     GQ   +N+R+P W  + G    +NG+   L   PG +L+  
Sbjct: 877 KEDD--TRLTIKGN-----GQF-DINVRVPGWA-TKGFFVKINGKEQALTAKPGTYLTIR 927

Query: 587 ERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA---GHTSGEWDIKTGTA 643
            +W   D + +++P     + + D +    +I ++ +GP LLA   G    +W   T  A
Sbjct: 928 RQWKDGDIIDLKMPFRFHLDPVMDQQ----NIASLFYGPILLAAQEGEARKDWRKITLNA 983

Query: 644 RSLSALISPIP 654
             +S  I   P
Sbjct: 984 DDISKSIKGDP 994


>gi|383641062|ref|ZP_09953468.1| glycosylase [Streptomyces chartreusis NRRL 12338]
          Length = 900

 Score =  224 bits (571), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 158/466 (33%), Positives = 228/466 (48%), Gaps = 38/466 (8%)

Query: 206 GYLSAFPTELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F   E+        VWAPYYT HKIL GLLD Y   D+ +AL +A+ M +
Sbjct: 349 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGLLDAYGATDDDRALDLASGMCD 408

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + ++R+ K +   +++R W   +  E GG+ + +  L++IT   +HL LA LFD    + 
Sbjct: 409 WMHSRLSK-LPESTLQRMWGIFSSGEFGGIVEAVCDLHTITGKAEHLALAQLFDLDRLID 467

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L   HAN HIPI  G    Y+ TG+  Y      F D+V     Y  GGTS 
Sbjct: 468 ACAANTDILDGLHANQHIPIFTGYVRLYDETGEERYLTSAKNFWDMVVPHRMYGIGGTST 527

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
           +EFW     +A T+ +   ETC  YNMLK+SR LF   ++  Y DYYERAL N VL  ++
Sbjct: 528 QEFWKARDVIAGTISATTAETCCAYNMLKLSRTLFFHEQDPKYMDYYERALYNQVLGSKQ 587

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  ++ Y + L  G  +   T   GT      CC GTG+ES +K  DS+YF  + 
Sbjct: 588 DKPDAEKPLVTYFIGLTPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYF-AKA 640

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLR 555
           +   LY+  Y  S+  W    V + Q       +      TL F      G+ S +L LR
Sbjct: 641 DGSALYVNLYSPSTLTWAEKGVTVTQ----TTGFPEEQGSTLAFGG----GRASFTLRLR 692

Query: 556 MPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           +P W  + G + ++NG+ +   P PGN+   +  W   D + I +P   R E   DD   
Sbjct: 693 VPSWA-TAGFRVTVNGRAVSGTPKPGNYFEVSRTWRAGDTVRIAMPFRTRVEKALDD--- 748

Query: 615 YASIQAILFGPYLLAGHTSGEWDIKTGTAR------SLSALISPIP 654
             S+Q +  GP  L    +    +K G  R       LS  ++P+P
Sbjct: 749 -PSLQTLFHGPVNLVARDAATEYLKVGLYRDAGLSGDLSHSLTPVP 793



 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 58/110 (52%), Gaps = 6/110 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           ++  +L DV L +  +    ++  L++    DV+ L+  FR  A LPT G  A GGWE  
Sbjct: 10  VQPFALEDVAL-RPGLFAEKRRLMLDHARGYDVNRLLQVFRANAGLPTGGAVAPGGWEGL 68

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
             E    LRGH+ GH+L+  AQ +  T      +++ T+V +L+E +  +
Sbjct: 69  DGEANGNLRGHYTGHFLTMLAQAYRGTKERVFADRIGTMVGALTEVRAAL 118


>gi|294675240|ref|YP_003575856.1| hypothetical protein PRU_2607 [Prevotella ruminicola 23]
 gi|294471633|gb|ADE81022.1| conserved hypothetical protein [Prevotella ruminicola 23]
          Length = 788

 Score =  224 bits (571), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 184/629 (29%), Positives = 296/629 (47%), Gaps = 63/629 (10%)

Query: 98  FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
           +  E  L DV L  +  L  A+  N+E LL  D D L+  + K A L   GK+Y  W+  
Sbjct: 17  YANEFPLGDVTL-LNGPLKHARDLNIETLLKYDNDRLLAPYLKEAGLTPKGKSYPNWDG- 74

Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-------GTGYLSA 210
              L GH  GHYL+A A + A+T +   +++M   +  L  C +         G GY+  
Sbjct: 75  ---LDGHVGGHYLTAMA-INAATGSQECRKRMEYWISELQACADANAKNHPDWGRGYVGG 130

Query: 211 FP--TELFDSFEA--LKP---VWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMV 259
            P    ++ +F+     P    W P+Y IHK+ AGL D +V   N QA K+      W +
Sbjct: 131 VPGSDRIWSNFKKGNFGPYFGAWVPFYNIHKMYAGLRDAWVYCGNEQAKKLFLGFCDWAI 190

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           +   N     +T   +ER   +L+ E GGMN+VL   Y+IT + K+L +A  F     L 
Sbjct: 191 DLTAN-----LTDAQMER---ALDTEHGGMNEVLADAYAITGEQKYLDVARRFSHRRLLN 242

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            L  + D L + HANT +P VIG +   E++GD  Y   G +F DIV    + A GG S 
Sbjct: 243 PLMQRRDVLDNMHANTQVPKVIGFERIAELSGDEAYHTAGAYFWDIVTGERTLAFGGNSR 302

Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
           RE +   +   D +   +  E+C T NMLK++  L R   E  YAD++E A  N +LS Q
Sbjct: 303 REHFPSREACQDFVQDIDGPESCNTNNMLKLTEDLHRRNPEARYADFFELATFNHILSTQ 362

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
              E G  +Y        ++ R    +     + WCC GTG+E+  K    IY    G+ 
Sbjct: 363 H-PEHGGYVYFTS-----ARPRHYRNYSAPNEAMWCCVGTGMENHGKYNQFIY-THSGDA 415

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             L++  +++S  +WK+  + L Q+      +    R+T+T SS  +  Q + + +R P 
Sbjct: 416 --LFVNLFVASELNWKAKGITLRQETS--FPYSENSRITITQSSNTK--QPTPIMVRYPG 469

Query: 559 WTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
           W         +NG+ + +   P ++++   +W   D + IQ P+    + +    P    
Sbjct: 470 WVKPGQFSVKVNGKPVSIVTGPSSYVAINRQWKKGDVIDIQFPMYNSVKYL----PNLPQ 525

Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN 677
             A++ GP +LA        +KTGT   L+ LI+    S   QL T  +   +   ++ N
Sbjct: 526 YIALMHGPIMLA--------MKTGT-EDLAHLIA--DDSRFGQLATGKKLPIDQAPILVN 574

Query: 678 SN-QSITMEEFPVSGTDAALHATFRLILK 705
            + +SI  +  P++G     + + +++ K
Sbjct: 575 KDVESIANQLQPIAGKPLHFNLSTKMVNK 603


>gi|297191370|ref|ZP_06908768.1| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
 gi|197720620|gb|EDY64528.1| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
          Length = 942

 Score =  224 bits (570), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 153/444 (34%), Positives = 228/444 (51%), Gaps = 36/444 (8%)

Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F + E++       VWAPYYT HKIL GLLD ++   + +AL +A+ M +
Sbjct: 393 GFLAAYPETQFITLESMTSPDYTVVWAPYYTAHKILKGLLDAHLSTGDVRALDLASGMCD 452

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + ++R+  ++   +  R W   +  E GGM + +  ++S+T   +HL LA +FD    + 
Sbjct: 453 WMHSRL-ALLPSATRRRMWGLFSSGEYGGMVEAVVDVHSLTGRAEHLELARMFDLDPLID 511

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D LS  HAN HIPI  G    ++ TG+  Y      F D+V  +  Y  GGTS 
Sbjct: 512 ACAENRDVLSGLHANQHIPIFTGLIRLHDATGEERYLTAARNFWDMVVPTRMYGIGGTST 571

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW D   +A TLG    ETC  +NMLK+SR LF   ++  YAD+YER L N +L  ++
Sbjct: 572 GEFWRDAGVIAGTLGDTTAETCCAHNMLKLSRLLFLHEQDPKYADHYERTLFNQILGSKQ 631

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  +M Y + L  G  +   T   GT      CC GTGIES +K  DS+YF    
Sbjct: 632 DLADAELPLMTYFIGLAPGAVR-DFTPKQGTT-----CCEGTGIESATKYQDSVYFRTR- 684

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM--TLTFSSKQEVGQLSSLNL 554
           +  GLY+  Y++S+ DW    V + Q           LR+  + TF           L+L
Sbjct: 685 DGSGLYVNLYMASTLDWTDRGVRVTQTTRFPYEQGSTLRIAGSGTF----------DLHL 734

Query: 555 RMPVWTYSNGAQASLNGQ-NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
           R+P W    G    +NG+ +     PG++L+ +  W   D + I +P +LRTE   DD  
Sbjct: 735 RVPHWA-DAGFFVRVNGRAHHGGAAPGSYLTVSRAWRDGDTVEISMPFTLRTEPALDDH- 792

Query: 614 EYASIQAILFGP-YLLAGHTSGEW 636
               +Q +++GP +L+A H   E+
Sbjct: 793 ---DVQCLMYGPVHLVARHEQREF 813



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENPISE----LRGHFVGHYLSASAQMW 177
           L++    DV  L+  FR  A L T G  A GGWE    E    LRGHF GH+LS  +Q +
Sbjct: 77  LDFGRSYDVHRLLQVFRANAGLSTRGAVAPGGWEGLDGEARGNLRGHFTGHFLSMLSQAY 136

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKI 203
            ST      +K+ T+V  L+EC+  +
Sbjct: 137 VSTREQVFADKIGTMVDGLAECREAL 162


>gi|295132897|ref|YP_003583573.1| glycosyl hydrolase [Zunongwangia profunda SM-A87]
 gi|294980912|gb|ADF51377.1| putative glycosyl hydrolase [Zunongwangia profunda SM-A87]
          Length = 797

 Score =  224 bits (570), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 170/538 (31%), Positives = 252/538 (46%), Gaps = 45/538 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A+  N+  LL  DVD L+  +RK A L     +Y  WE     L GH  GHYLSA A  +
Sbjct: 49  ARDLNMSVLLQYDVDRLLAPYRKEAGLEPRKPSYPNWEG----LDGHIGGHYLSALAMNY 104

Query: 178 ASTHNATIKEKMSTVVFSLSECQ-------NKIGTGYLSAFPTE--LFDSF-----EALK 223
           A+T N     +M+ ++  L ECQ        + G GY+  FP    L+ SF     E   
Sbjct: 105 AATDNQEFLARMNYMLKELRECQLANTKKHPEWGVGYVGGFPNSEALWSSFKKGNFEKYN 164

Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
             WAP+Y +HK+ AGL D ++ AD+ +A +M     ++     + +    S E+    LN
Sbjct: 165 SAWAPFYNLHKMYAGLRDAWLYADSEKAKEMFLDFCDWGITLTKDL----SHEQMQSVLN 220

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGS 343
            E GGM +V    Y IT + K+L  A  +     L  L+   D L + HANT IP  +G 
Sbjct: 221 MEHGGMPEVYADAYQITGEKKYLEAAKRYSHEQVLHPLSKGIDNLDNKHANTQIPKFVGF 280

Query: 344 QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN-EETCT 402
           +   EV GD  +   G++F + V  + S A GG S +E +       D +  ++  E+C 
Sbjct: 281 ERIAEVDGDEKFAKAGSYFWETVTKNRSLAFGGNSRKEHFPSTSASIDYINEDDGPESCN 340

Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARST 462
           +YNMLK++  LFR   E  YADYYER L N +LS Q   + G  +Y  P     ++ R  
Sbjct: 341 SYNMLKLTEDLFRVNPEAKYADYYERTLYNHILSTQH-PQHGGYVYFTP-----ARPRHY 394

Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
             +     + WCC GTG+E+  K    IY   +G+   LYI  +I S  +W+   V + Q
Sbjct: 395 RIYSAPEEAMWCCVGTGMENHGKYNQFIY-THQGD--SLYINLFIPSELNWEKQGVKIRQ 451

Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGN 581
           + +        L++T       E      L LR P W      +  +N + + L   P +
Sbjct: 452 ETNFPSEEGTSLKIT-------EGTAEFPLFLRYPGWIKEGEMKIKINSEEIELIGKPSS 504

Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIK 639
           ++     W   D + + LP+    E +  + P+Y    A   GP LL G  SG  D+K
Sbjct: 505 YVKIDRNWQKGDIVDVSLPMHNHMERLP-NVPQYV---AFFHGPILL-GAPSGSEDLK 557


>gi|429195121|ref|ZP_19187172.1| hypothetical protein STRIP9103_04852 [Streptomyces ipomoeae 91-03]
 gi|428669175|gb|EKX68147.1| hypothetical protein STRIP9103_04852 [Streptomyces ipomoeae 91-03]
          Length = 936

 Score =  223 bits (569), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 152/442 (34%), Positives = 224/442 (50%), Gaps = 37/442 (8%)

Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F   E++       VWAPYYT HKIL GLLD Y+  D+A+AL +A+ + +
Sbjct: 384 GFLAAYPETQFIELESMTSGDYTRVWAPYYTAHKILRGLLDAYLNVDDARALDLASGLCD 443

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + Y+R+ K+    +++R W   +  E GG+ + +  LY+IT   +HL LA LFD    + 
Sbjct: 444 WMYSRLSKLPDA-TLQRMWGIFSSGEFGGLVEAIVDLYTITGKAEHLALARLFDLDKLID 502

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L   HAN HIPI  G    Y+ TG+  Y      F  +V     Y  GGTS 
Sbjct: 503 ACAANTDTLDGLHANQHIPIFTGLARLYDATGEVRYLTAAKNFWGMVVPPRMYGIGGTST 562

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW     +A T+   N ETC  YN+LK+SR LF   ++  Y DYYERAL N VL  ++
Sbjct: 563 GEFWKARGVIAGTISDTNAETCCAYNLLKLSRTLFFHEQDPKYMDYYERALLNQVLGSKQ 622

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  ++ Y + L  G  +   T   GT      CC GTG+ES +K  DS+YF +  
Sbjct: 623 DKTDAEKPLVTYFIGLKPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYFTKA- 675

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS---LN 553
           +   LY+  Y +++ +W +  V + Q  D       Y R      S   +G  S+   L 
Sbjct: 676 DGSALYVNLYSATTLNWSAKGVTVTQTTD-------YPREQ---GSTITIGGGSAAFELR 725

Query: 554 LRMPVWTYSNGAQASLNGQNLP-LPPPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDD 611
           LR+P W  + G + ++NG  +   P  G++ + + R W   D + + +P  LR E   DD
Sbjct: 726 LRVPSWA-TAGFRVTVNGGAVSGTPTAGSYFTISSRTWRGGDVVRVTMPFRLRVEKALDD 784

Query: 612 RPEYASIQAILFGPYLLAGHTS 633
                S+Q + +GP  L G  +
Sbjct: 785 ----PSLQTLFYGPVNLVGRNT 802



 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 59/112 (52%), Gaps = 6/112 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           ++   L DV L Q  +    +Q  L++    DVD L+  FR  A L T G  A GGWE  
Sbjct: 45  VRPFELKDVTLGQG-LFAGKRQLMLDHGRGYDVDRLLQVFRANAGLSTKGAVAPGGWEGL 103

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT 205
             E    LRGH+ GH+L+  AQ +AST +    +K+  +V +L+E +  + T
Sbjct: 104 DGEANGNLRGHYTGHFLTTLAQAYASTADTVYADKIRYMVGALTEVRAALRT 155


>gi|423303007|ref|ZP_17281028.1| hypothetical protein HMPREF1057_04169 [Bacteroides finegoldii
           CL09T03C10]
 gi|408470336|gb|EKJ88871.1| hypothetical protein HMPREF1057_04169 [Bacteroides finegoldii
           CL09T03C10]
          Length = 801

 Score =  223 bits (569), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 172/540 (31%), Positives = 245/540 (45%), Gaps = 55/540 (10%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ  N   LL  DVD L+  F   A L    + +  W      L GH  GHYLSA A  +
Sbjct: 47  AQDLNRSVLLEYDVDRLLAPFLIEAGLEPKAEKFPNWPG----LDGHVAGHYLSAMAMNY 102

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
            +      K +M  ++  L  CQ   G GY+   P       E  K         WAP+Y
Sbjct: 103 RAGGGEEFKRRMEYILSELYRCQQANGDGYIGGIPNGKAGWKEIKKGNVGIIWKYWAPWY 162

Query: 231 TIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            +HK+ AGL D ++ AD+  A KM      W +         VI+  + E+    LN E 
Sbjct: 163 NLHKLYAGLRDAWLYADSELAKKMFLDYCDWGI--------GVISGLNDEQMEQMLNNEF 214

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGMN+V    Y I+ D K+L  A  F        +    D L + HANT +P  +G Q  
Sbjct: 215 GGMNEVFADAYQISGDTKYLDAAKRFSHKWLFESMRDGKDNLDNKHANTQVPKAVGYQRV 274

Query: 347 YEVT------GDPL-YKLIGTFFMDIVNASHSYATGGTSARE-FWWDPKRLADTLGSENE 398
            E++      GD + Y     FF   V A+ S A GG S RE F  D   L+     E  
Sbjct: 275 AELSVQAKRSGDAVDYTRAAYFFWQTVTANRSLAFGGNSRREHFPDDADYLSYVDDREGP 334

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C TYNML+++  LFR   + AYAD+YERAL N +LS Q     G  +Y  P       
Sbjct: 335 ESCNTYNMLRLTEGLFRKDPKAAYADFYERALFNHILSTQHPVHGGY-VYFTP------- 386

Query: 459 ARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
           AR  H   +     + WCC GTG+E+  K G+ IY     +   LY+  +ISS  +WK  
Sbjct: 387 ARPAHYRVYSAPNEAMWCCVGTGMENHGKYGEFIYAHTGDS---LYVNLFISSRLEWKKR 443

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
            + L Q      S+    +  LT ++K+       L +R P W        ++NG+++  
Sbjct: 444 RISLTQ----TTSFPDEGKTCLTITAKKSTK--FPLFVRKPGWVGDGKVIITVNGKSIET 497

Query: 577 PPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
               N + +   +W   D + +Q+P+++R E ++   PEY    AI+ GP LL  +   E
Sbjct: 498 TTAANSYYTINRKWKNGDVVEVQMPMNIRIEELK-HHPEYI---AIMRGPILLGANVGKE 553


>gi|160882548|ref|ZP_02063551.1| hypothetical protein BACOVA_00499 [Bacteroides ovatus ATCC 8483]
 gi|156112129|gb|EDO13874.1| hypothetical protein BACOVA_00499 [Bacteroides ovatus ATCC 8483]
          Length = 801

 Score =  223 bits (568), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 172/540 (31%), Positives = 246/540 (45%), Gaps = 55/540 (10%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ  N   LL  DVD L+  F   A L    + +  W      L GH  GHYLSA A  +
Sbjct: 47  AQDLNRSVLLEYDVDRLLAPFLIEAGLKPKAEKFPNWPG----LDGHVAGHYLSAMAMNY 102

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
            +      K +M  ++  L +CQ   G GY+   P       E  K         WAP+Y
Sbjct: 103 RAGDGEEFKRRMEYMLSELYKCQQANGDGYIGGIPNGKAGWKEIKKGNVGIIWKYWAPWY 162

Query: 231 TIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            +HK+ AGL D ++ AD+  A KM      W +         VI+  + E+    LN E 
Sbjct: 163 NLHKLYAGLRDAWLYADSELAKKMFLDYCDWGI--------GVISGLNDEQMEQMLNNEF 214

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGMN+V    Y I+ D K+L  A  F        +    D L + HANT +P  +G Q  
Sbjct: 215 GGMNEVFADAYQISGDTKYLDAAKRFSHKWLFESMRDGKDNLDNKHANTQVPKAVGYQRV 274

Query: 347 YEVT------GDPL-YKLIGTFFMDIVNASHSYATGGTSARE-FWWDPKRLADTLGSENE 398
            E++      GD + Y     FF   V A+ S A GG S RE F  D   L+     E  
Sbjct: 275 AELSVQAKRSGDAVDYTRAAYFFWQTVTANRSLAFGGNSRREHFPDDADYLSYVDDREGP 334

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C TYNML+++  LFR   + AYAD+YERAL N +LS Q     G  +Y  P       
Sbjct: 335 ESCNTYNMLRLTEGLFRKDPKAAYADFYERALFNHILSTQHPVHGGY-VYFTP------- 386

Query: 459 ARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
           AR  H   +     + WCC GTG+E+  K G+ IY     +   LY+  +ISS  +WK  
Sbjct: 387 ARPAHYRVYSAPNEAMWCCVGTGMENHGKYGEFIYAHTGDS---LYVNLFISSRLEWKKR 443

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
            + L Q      S+    +  LT ++K+       L +R P W        ++NG+++  
Sbjct: 444 RISLTQ----TTSFPNEGKTCLTITAKKSTK--FPLFVRKPGWVGDGKVIITVNGKSIET 497

Query: 577 PPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
               N + +   +W   D + +Q+P+++R E ++   PEY    AI+ GP LL  +   E
Sbjct: 498 TTAANSYYTINRKWKNGDVVEVQMPMNIRIEELK-HHPEYI---AIMRGPILLGANVGKE 553


>gi|302549595|ref|ZP_07301937.1| secreted protein [Streptomyces viridochromogenes DSM 40736]
 gi|302467213|gb|EFL30306.1| secreted protein [Streptomyces viridochromogenes DSM 40736]
          Length = 943

 Score =  223 bits (567), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 158/466 (33%), Positives = 228/466 (48%), Gaps = 38/466 (8%)

Query: 206 GYLSAFPTELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F   E+        VWAPYYT HKIL GLLD Y   D+ +AL +A+ M +
Sbjct: 392 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGLLDAYTATDDDRALDLASGMCD 451

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + ++R+ K +   +++R W   +  E GG+ + +  L+++T   +HL LA LFD    + 
Sbjct: 452 WMHSRLSK-LPESTLQRMWGIFSSGEFGGIVEAICDLHTLTGKAEHLALAQLFDLDRLIE 510

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L   HAN HIPI  G    Y+ TG+  Y      F D+V     Y  GGTS 
Sbjct: 511 ACAANTDILDGLHANQHIPIFTGYVRLYDETGEERYLRSAKNFWDMVVPHRMYGIGGTST 570

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
           +EFW     +A T+ +   ETC  YNMLK+SR LF   ++  Y DYYERAL N VL  ++
Sbjct: 571 QEFWKARDVIAGTISATTAETCCAYNMLKLSRTLFFHEQDPKYMDYYERALYNQVLGSKQ 630

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  ++ Y + L  G  +   T   GT      CC GTG+ES +K  DS+YF  + 
Sbjct: 631 DKPDVEKPLVTYFIGLTPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYF-AQA 683

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLR 555
           +   LY+  Y  S+  W    V + Q      S+      TLT       G+ S +L LR
Sbjct: 684 DGSALYVNLYSPSTLTWAEKGVTVTQS----TSFPREQGSTLTLGG----GRASFTLRLR 735

Query: 556 MPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           +P W  + G   ++NG+ +   P PG++   +  W   D + I +P   R E   DD   
Sbjct: 736 VPSWA-TAGFGVTVNGRAVSGTPRPGSYFDVSRTWRAGDTVRIAMPFRTRVEKALDD--- 791

Query: 615 YASIQAILFGPYLLAGHTSGEWDIKTGTAR------SLSALISPIP 654
             S+Q +  GP  L    S    +K G  R       LS  ++P+P
Sbjct: 792 -PSLQTLFHGPVNLVARDSATEYLKVGLYRDAGLSGDLSHSLTPVP 836



 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 56/110 (50%), Gaps = 6/110 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           ++   L DV L +  V    +Q  L++    DV+ L+  FR  A L T G  A GGWE  
Sbjct: 53  VRPFGLEDVSLGRG-VFADKRQLMLDHARGYDVNRLLQVFRANAGLATGGAVAPGGWEGL 111

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
             E    LRGH+ GH+L+  AQ + ST      +++  VV +L+E +  +
Sbjct: 112 DGEANGNLRGHYTGHFLTMLAQAYRSTKEQVFADRIGAVVGALTEVRAAL 161


>gi|336428272|ref|ZP_08608256.1| hypothetical protein HMPREF0994_04262 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336006508|gb|EGN36542.1| hypothetical protein HMPREF0994_04262 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 601

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 137/418 (32%), Positives = 212/418 (50%), Gaps = 18/418 (4%)

Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL-----PTPGKAYGGWENPISELRGHFV 166
            S + R  Q N + LL      L+ S+   A L       P   + GWE P SE+RGHFV
Sbjct: 17  DSEIRRRFQVNEDLLLRYQSKDLLRSYYFEAGLWKDNSENPKIEHWGWEGPTSEIRGHFV 76

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVW 226
           GH+LSA+A  +AS  N  +  +   ++  L  CQ   G  ++ A P +     E  +   
Sbjct: 77  GHWLSAAAITYASDGNRELLGRAEYMLDELERCQKANGGEWIGAIPEKQLRWTEEGRNFG 136

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            P Y +HKI+ GL+D YV A N +AL++     ++FY  V+ + T    +R    +  ET
Sbjct: 137 VPLYNLHKIIMGLIDMYVYAGNCKALEIVGHFADWFYRWVKDIPT----DRMDIIMETET 192

Query: 287 GGMNDVLYRLYSITHDPKH-LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
           GG+ +   RLY IT + K+ +L+     +P F   L    D L++ HANT IP ++G   
Sbjct: 193 GGILEEWCRLYEITGEEKYQVLMEKFLRRPLFHALLE-NKDVLTNMHANTTIPEILGIAR 251

Query: 346 RYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
            YEVTG+P Y K +  ++   V     + TGG ++ E W  P  + + LG  N+E C  Y
Sbjct: 252 MYEVTGNPEYLKAVKNYWSIAVTKRGGFVTGGQTSGEVWIPPFHIRERLGKLNQEHCAVY 311

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           NM++++  L+++T +I + +Y E  L NG+L+ Q+    G   Y LP+  G  K      
Sbjct: 312 NMMRLAEFLYQYTGDIEFENYRELNLYNGILA-QQNPNTGAAAYYLPMQAGSRKI----- 365

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
           W T+  SFWCC G+GI++ +  G  IY E +  +     I  + +S  W+    +  Q
Sbjct: 366 WSTEKKSFWCCCGSGIQAGASHGMGIYAENKNQIAVNQFIPSVLTSDRWERKVKITQQ 423


>gi|408533805|emb|CCK31979.1| secreted protein [Streptomyces davawensis JCM 4913]
          Length = 943

 Score =  219 bits (559), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 151/439 (34%), Positives = 218/439 (49%), Gaps = 32/439 (7%)

Query: 206 GYLSAFPTELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F   E+        VWAPYYT HKIL G+LD Y+  D+A+AL +A+ M +
Sbjct: 392 GFLAAYPETQFIDLESRTSSDYTKVWAPYYTAHKILRGVLDAYLATDDARALDLASGMAD 451

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + ++R+ K +   +++R W   +  E GG+ + +  L++IT   +HL LA LFD    + 
Sbjct: 452 WMHSRLSK-LPEATLQRMWGLFSSGEFGGIVEAICDLHAITGKAEHLALARLFDLDRLID 510

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L   HAN HIPI  G    Y+ TG+  Y      F  +V     Y  GGTS 
Sbjct: 511 SCAANTDILDGLHANQHIPIFTGYLRLYDATGEQRYLDAARNFWGMVVPHRMYGIGGTST 570

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW     +A T+ +   ETC  YN+LK+SR LF       Y DYYERAL N VL  ++
Sbjct: 571 GEFWKARDVIAGTISATTAETCCAYNLLKLSRTLFFHEPSPKYMDYYERALYNQVLGSKQ 630

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  ++ Y + L  G  +   T   GT      CC GTG+ES +K  DS+YF  + 
Sbjct: 631 DKPDAEKPLVTYFIGLTPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYFTTD- 683

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLR 555
           +   LY+  Y  S  +W    V + Q      ++      TLT       G  S  L LR
Sbjct: 684 DGSALYVNLYSPSRLNWADKGVTVTQA----TAFPQEQGTTLTIGG----GSASFELRLR 735

Query: 556 MPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           +P W  + G + ++NG+ +   P PG++ + +  W   D + I +P  LR E   DD   
Sbjct: 736 VPSWA-TAGFRVTVNGRAVSGTPAPGSYFAVSRTWRSGDTVRISMPFRLRAEKALDD--- 791

Query: 615 YASIQAILFGPYLLAGHTS 633
             S+Q + +GP  L G  S
Sbjct: 792 -PSLQTLCYGPVNLVGRNS 809



 Score = 59.3 bits (142), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 57/110 (51%), Gaps = 6/110 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT-PGKAYGGWENP 157
           +K  +L  V L Q  +    ++  L++    DVD L+  FR  A LPT    A GGWE  
Sbjct: 53  VKPFALDQVTLGQG-LFADKRELMLDHARGYDVDRLLQVFRANAGLPTGDAVAPGGWEGL 111

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
             E    LRGH+ GH+++  AQ WA T      +++ T++ +L+E +  +
Sbjct: 112 DGEANGNLRGHYTGHFMTMLAQAWAGTGEQVFADRLRTMIGALTEVRAAL 161


>gi|384109447|ref|ZP_10010323.1| hypothetical protein MSI_18910 [Treponema sp. JC4]
 gi|383868978|gb|EID84601.1| hypothetical protein MSI_18910 [Treponema sp. JC4]
          Length = 727

 Score =  219 bits (558), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 166/546 (30%), Positives = 254/546 (46%), Gaps = 68/546 (12%)

Query: 109 LDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGH 168
           L++ S+  ++Q+  LEY+L  + D ++    +          YGGWEN   +++GH +GH
Sbjct: 8   LEKDSLFEKSQRLGLEYVLEYEPDRMLAPCYRALGKNPCAINYGGWEN--RQIQGHMLGH 65

Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD-------SFE- 220
           YLSA +  +  T     KEK+   +  + E Q K   GY    P++ FD       +FE 
Sbjct: 66  YLSALSGFYYQTGKQDAKEKLDYTIDLIKELQRK--DGYFGGIPSDSFDKVFYSGGNFEV 123

Query: 221 ---ALKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFYN----RVQKV 269
              +L   W P+Y+IHKI AGL+D YV   N  AL    KMA W +    N     +QK+
Sbjct: 124 ERFSLAGWWVPWYSIHKIYAGLIDAYVYGGNEDALQIVFKMADWAINGTKNLSDSSIQKM 183

Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
           +T             E GGM  V   LY IT + K+L  A  +     +   + + D L 
Sbjct: 184 LTC------------EHGGMCKVFADLYGITGNKKYLSEAERWIHHEIIDPASKKEDKLQ 231

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
            +HANT IP  IG    YE+TG   Y+    FF + V  + SYA GG S  E +   +  
Sbjct: 232 GYHANTQIPKFIGIARLYELTGKSEYRTAAEFFFETVTKNRSYAIGGNSKGEHF--GREF 289

Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
            + L  +  ETC TYNML+++ H+F W K    AD+YE AL N +L+ Q   + G   Y 
Sbjct: 290 EEPLMRDTCETCNTYNMLELAEHIFAWNKTSDIADFYENALYNHILASQ-DPQTGAKTYF 348

Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
           + + +G  K   +H      N+ WCC GTG+E+ S+    I  + +     LYI  +I +
Sbjct: 349 VSMQQGFHKVYCSHD-----NAMWCCTGTGLENPSRYNRFIACDFD---DVLYINLFIPA 400

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
           + + + G  V   KV+    +D  +++ +    K+  G    L +R P W      +A  
Sbjct: 401 TVETEDGWKV---KVETDFPYDAAVKIKVLERGKENKG----LKVRKPGWADKMAEKAGE 453

Query: 570 NGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           +G        GN  S +E       + + LP+ L     +D    +    A+ +GP +LA
Sbjct: 454 DG----YIDFGNLSSESE-------IELSLPMKLSIYKAKDHSGNF----AVKYGPLVLA 498

Query: 630 GHTSGE 635
                E
Sbjct: 499 ADLGNE 504


>gi|238061684|ref|ZP_04606393.1| secreted protein [Micromonospora sp. ATCC 39149]
 gi|237883495|gb|EEP72323.1| secreted protein [Micromonospora sp. ATCC 39149]
          Length = 933

 Score =  219 bits (557), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 151/464 (32%), Positives = 229/464 (49%), Gaps = 36/464 (7%)

Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F + E++       VWAPYYT HKIL G+LD Y+   + +AL +AT M +
Sbjct: 382 GFLAAYPETQFITLESMTASDYAKVWAPYYTAHKILQGILDAYLNTGDERALDLATGMCD 441

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + ++R+ K +   +++R W   +  E GG+ + +  ++ IT  P HL LA LFD    + 
Sbjct: 442 WMHSRLSK-LPAATLQRMWGLFSSGEFGGIVETICDVHRITGSPNHLALARLFDLNSLID 500

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D ++  HAN HIPI  G    ++ TG+  Y      F  +V  +  Y+ GGTS 
Sbjct: 501 AAAAGTDTITGLHANQHIPIFTGLLRLHDETGEQRYLNAARNFWPMVVPTRMYSIGGTST 560

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            EFW +P  +A +L   N ETC  YN+LK+SR LF   ++  Y DYYERAL N +L  +R
Sbjct: 561 VEFWKEPGAIAGSLSDTNAETCCAYNLLKLSRTLFLHEQDPKYMDYYERALYNQILGSKR 620

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  ++ Y + L  G  +   T   GT      CC GTG+ES +K  D++Y  +  
Sbjct: 621 DLADAEKPLVTYFIGLVPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDTVYL-DTA 673

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS--LNL 554
           +   LY+  Y SS   W    + L Q            R     ++  +VG  ++  L L
Sbjct: 674 DGRALYVNLYSSSKLTWARRGITLTQTT----------RYPFEQNTTIKVGGNATFELRL 723

Query: 555 RMPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
           R+P W   +  +  +NG+  P    PG++     RW   D + + +P  LR E   DD  
Sbjct: 724 RVPGWVKGD-FKVYVNGRRAPGKATPGSYFPVARRWRAGDTVRVHIPFQLRVEKALDD-- 780

Query: 614 EYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSF 657
              S Q + +GP  L   ++    +K G  R+  AL   + PS 
Sbjct: 781 --PSTQTLFYGPVNLVARSASTNFLKIGLYRN-CALSGDLLPSL 821



 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 54/110 (49%), Gaps = 11/110 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           L EV+L D       V  R +   LE+    +VD L+  FR  A L T G  A  GWE  
Sbjct: 54  LGEVALRD------GVFARKRDLMLEHARGYNVDRLLQVFRANAGLDTLGAVAPSGWEGL 107

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
             E    LRGH+ GH+L+  AQ + ST +    +K+  +V +L E +  +
Sbjct: 108 DGEANGNLRGHYTGHFLTMLAQAYGSTGDKVFADKLKYMVGALVEARAAL 157


>gi|373463723|ref|ZP_09555310.1| hypothetical protein HMPREF9104_01016 [Lactobacillus kisonensis
           F0435]
 gi|371763942|gb|EHO52383.1| hypothetical protein HMPREF9104_01016 [Lactobacillus kisonensis
           F0435]
          Length = 747

 Score =  219 bits (557), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 165/575 (28%), Positives = 266/575 (46%), Gaps = 61/575 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENP 157
           +K VS ++V    +S L    + N+ ++L L  D L++++R  A L T G      WE+P
Sbjct: 22  MKPVSYYNVKYLPNSTLKEKFERNVNWMLSLTPDQLLYNYRINAGLDTKGATPLTVWESP 81

Query: 158 ISELRGHFVGHYLSASAQMWASTHNAT-------IKEKMSTVVFSLSECQNKIGT----- 205
               RGHF GHYLS +++ +   +N         +K++++ +V  L ECQ K  T     
Sbjct: 82  DWFFRGHFTGHYLSGASRSFVELNNMEDTKEANELKDRVNKIVDGLKECQEKFDTFEEFP 141

Query: 206 GYLSAFPTELFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYF 262
           GYL+A P++ FD  E L+     + PYY + K++ GL+D Y  A N  AL++   M  YF
Sbjct: 142 GYLAAEPSKRFDDVEKLRFNGNHYVPYYAVQKLMDGLMDAYEFAGNQTALELTMNMTHYF 201

Query: 263 YNRVQKVITMY---SVERHWYS------LNEETGGMNDVLYRLYSITHDPKHLL--LAHL 311
             R++++        ++  WY        ++E G M+  L RLY IT   +  +  LA  
Sbjct: 202 EKRMERLTPEQINAMIDTRWYQGKGHYVYHQEFGAMHRTLLRLYEITDKKQKDIFDLAQK 261

Query: 312 FDKPCFLGFLALQADYLSHF--HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNAS 369
           FD+  F   L    D L ++  HANT +    G    Y VTGD  YK     +M+ ++  
Sbjct: 262 FDRKWFRDMLINNDDELGYYSCHANTELVCAEGMLEYYHVTGDENYKKGVVNYMNWMHDG 321

Query: 370 HSYATGGTSAR-----------EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTK 418
           H   T G S R           E +  P+     L   N E+C ++++  +S  LF  TK
Sbjct: 322 HELPTKGISGRSAYPAPADYGSELYDYPEMFFKHLSMLNGESCCSHDLNFLSSELFADTK 381

Query: 419 EIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGT 478
           +    D YE    N +++ Q+  +  +  Y+  L    +  +     G     FWCC G+
Sbjct: 382 DATLLDDYEIRFINAIMA-QQNNDSAIAEYLYNLSVAPNSTKEYSHTG-----FWCCTGS 435

Query: 479 GIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL 538
           G E  S L D IY+ ++ ++   Y+ QY  S  D K   V + Q  D       +  +T+
Sbjct: 436 GTERHSTLVDGIYYTDKKDI---YVGQYFDSILDLKDQGVTVTQ--DSHYPEQHFAHITV 490

Query: 539 TFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
             +  QE     ++ LR+P W  S     S++G+N+   P   F++    W    ++T+ 
Sbjct: 491 EAAKSQEF----TVYLRVPKW--SRNTTISVDGENVDAEPKNGFVAIKRTWGKKAEITVN 544

Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
               LR + + D    +  + AI +GP LLA  T 
Sbjct: 545 FDFELRYQTLAD---RFNRV-AIYYGPILLAAQTK 575


>gi|345513939|ref|ZP_08793454.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
 gi|423241465|ref|ZP_17222578.1| hypothetical protein HMPREF1065_03201 [Bacteroides dorei
           CL03T12C01]
 gi|229435753|gb|EEO45830.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
 gi|392641358|gb|EIY35135.1| hypothetical protein HMPREF1065_03201 [Bacteroides dorei
           CL03T12C01]
          Length = 797

 Score =  219 bits (557), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 178/554 (32%), Positives = 258/554 (46%), Gaps = 58/554 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L EV L D +  ++  L +       YLL LDVD L+   R++  L   G  YGGWE   
Sbjct: 44  LSEVELTDSYFKKAMDLHKG------YLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 94

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----------GTGY 207
            +  G   GHY+SA A M+AST    + +K++ ++  L ECQ +              GY
Sbjct: 95  -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 153

Query: 208 LSAFPTE--LFDSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
           L        L    E  +P W        +Y IHKILAGL D YV A   QA  +   + 
Sbjct: 154 LQLLQGNVVLNQPDETGQP-WNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLA 212

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           ++    +  +    + +    +L+ E GGMN+V   +YSIT D K L  A  F+    + 
Sbjct: 213 DF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIY 268

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            +A   D L   HAN  IP  +G    YE + + +Y      F +IV   H+ A GG S 
Sbjct: 269 PIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSC 328

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E +  P   +  L   + ETC TYNMLK+SR LF    +  Y +YYE AL N +L+ Q 
Sbjct: 329 YERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQD 388

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
              PG + Y   L  G  K  S     T F+SFWCC GTG+E+ SK  +SIYF++     
Sbjct: 389 PDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTGMENHSKYAESIYFKDNQE-- 441

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLSSLNL-RMP 557
            L +  YI S   WK   + L        + D Y   + T + +  E+G  + + L R P
Sbjct: 442 -LLVNLYIPSRLHWKEKGLKL--------TLDTYFPESDTVTVRMDEIGSYTGMLLFRYP 492

Query: 558 VWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
            W  S  A   +NG+        G+++   +     D +T+    +L  +  +D+ P + 
Sbjct: 493 DWV-SGDAVVRINGKPAQTEAHKGSYIRLLDSVKSGDVITLVFTRNLYIDYAKDE-PHFG 550

Query: 617 SIQAILFGPYLLAG 630
           S   +++GP LLAG
Sbjct: 551 S---VMYGPILLAG 561


>gi|212695367|ref|ZP_03303495.1| hypothetical protein BACDOR_04914 [Bacteroides dorei DSM 17855]
 gi|212662096|gb|EEB22670.1| hypothetical protein BACDOR_04914 [Bacteroides dorei DSM 17855]
          Length = 807

 Score =  219 bits (557), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 178/554 (32%), Positives = 258/554 (46%), Gaps = 58/554 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L EV L D +  ++  L +       YLL LDVD L+   R++  L   G  YGGWE   
Sbjct: 54  LSEVELTDSYFKKAMDLHKG------YLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 104

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----------GTGY 207
            +  G   GHY+SA A M+AST    + +K++ ++  L ECQ +              GY
Sbjct: 105 -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 163

Query: 208 LSAFPTE--LFDSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
           L        L    E  +P W        +Y IHKILAGL D YV A   QA  +   + 
Sbjct: 164 LQLLQGNVVLNQPDETGQP-WNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLA 222

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           ++    +  +    + +    +L+ E GGMN+V   +YSIT D K L  A  F+    + 
Sbjct: 223 DF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIY 278

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            +A   D L   HAN  IP  +G    YE + + +Y      F +IV   H+ A GG S 
Sbjct: 279 PIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSC 338

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E +  P   +  L   + ETC TYNMLK+SR LF    +  Y +YYE AL N +L+ Q 
Sbjct: 339 YERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQD 398

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
              PG + Y   L  G  K  S     T F+SFWCC GTG+E+ SK  +SIYF++     
Sbjct: 399 PDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTGMENHSKYAESIYFKDNQE-- 451

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLSSLNL-RMP 557
            L +  YI S   WK   + L        + D Y   + T + +  E+G  + + L R P
Sbjct: 452 -LLVNLYIPSRLHWKEKGLKL--------TLDTYFPESDTVTVRMDEIGSYTGMLLFRYP 502

Query: 558 VWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
            W  S  A   +NG+        G+++   +     D +T+    +L  +  +D+ P + 
Sbjct: 503 DWV-SGDAVVRINGKPAQTEAHKGSYIRLLDSVKSGDVITLVFTRNLYIDYAKDE-PHFG 560

Query: 617 SIQAILFGPYLLAG 630
           S   +++GP LLAG
Sbjct: 561 S---VMYGPILLAG 571


>gi|265753023|ref|ZP_06088592.1| acetyl-CoA carboxylase [Bacteroides sp. 3_1_33FAA]
 gi|263236209|gb|EEZ21704.1| acetyl-CoA carboxylase [Bacteroides sp. 3_1_33FAA]
          Length = 797

 Score =  218 bits (556), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 178/554 (32%), Positives = 258/554 (46%), Gaps = 58/554 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L EV L D +  ++  L +       YLL LDVD L+   R++  L   G  YGGWE   
Sbjct: 44  LSEVELTDSYFKKAMDLHKG------YLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 94

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----------GTGY 207
            +  G   GHY+SA A M+AST    + +K++ ++  L ECQ +              GY
Sbjct: 95  -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 153

Query: 208 LSAFPTE--LFDSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
           L        L    E  +P W        +Y IHKILAGL D YV A   QA  +   + 
Sbjct: 154 LQLLQGNVVLNQPDETGQP-WNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLA 212

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           ++    +  +    + +    +L+ E GGMN+V   +YSIT D K L  A  F+    + 
Sbjct: 213 DF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIY 268

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            +A   D L   HAN  IP  +G    YE + + +Y      F +IV   H+ A GG S 
Sbjct: 269 PIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSC 328

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E +  P   +  L   + ETC TYNMLK+SR LF    +  Y +YYE AL N +L+ Q 
Sbjct: 329 YERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQD 388

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
              PG + Y   L  G  K  S     T F+SFWCC GTG+E+ SK  +SIYF++     
Sbjct: 389 PDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTGMENHSKYAESIYFKDNQE-- 441

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLS-SLNLRMP 557
            L +  YI S   WK   + L        + D Y   + T + +  E+G  + +L  R P
Sbjct: 442 -LLVNLYIPSRLHWKEKGLKL--------TLDTYFPESDTVTVRMDEIGSYTGTLLFRYP 492

Query: 558 VWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
            W  S  A   +NG+        G+++   +     D +T+    +L  +  +D+ P + 
Sbjct: 493 DWV-SGDAVVRINGEPAQTEAHKGSYIRLLDSVKSGDVITLVFTRNLYIDYAKDE-PHFG 550

Query: 617 SIQAILFGPYLLAG 630
           S   +++GP LLAG
Sbjct: 551 S---VMYGPILLAG 561


>gi|302539859|ref|ZP_07292201.1| putative secreted protein [Streptomyces hygroscopicus ATCC 53653]
 gi|302457477|gb|EFL20570.1| putative secreted protein [Streptomyces himastatinicus ATCC 53653]
          Length = 940

 Score =  218 bits (556), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 146/433 (33%), Positives = 214/433 (49%), Gaps = 36/433 (8%)

Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
           G+L+A+P   F + E++       VWAPYYT HKIL GLLD ++   +A+AL +A  M +
Sbjct: 389 GFLAAYPETQFITLESMTSGDYTVVWAPYYTAHKILRGLLDAHLATGDARALDLAMGMCD 448

Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           + Y+R+ K +   +++R W   +  E GG+ + +  LY+++   +HL LA LFD    + 
Sbjct: 449 WMYSRLSK-LPRSTLQRMWGIFSSGEFGGIVEAICDLYALSGKAQHLALARLFDLDKLID 507

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
             A   D L   HAN HIPI  G    Y+ T +  Y      F D+V  +  Y  GGTS 
Sbjct: 508 ACAAGDDTLDGLHANQHIPIFTGLVRLYDETEEERYLTAAKNFWDMVVPTRMYGIGGTSN 567

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
           REFW     +A TL     ETC  YNMLK+SR LF   ++ AY DYYERAL N VL  ++
Sbjct: 568 REFWGARGAIAKTLSDTTAETCCAYNMLKLSRMLFFHEQDPAYMDYYERALYNQVLGSKQ 627

Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
                E  ++ Y + L  G  +  +     T      CC GTG+ES +K  DS+YF+   
Sbjct: 628 DRADAEKPLVTYFIGLVPGHVRDYTPKAGTT------CCEGTGMESATKYQDSVYFKRAD 681

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR---MTLTFSSKQEVGQLSSLN 553
               LY+  Y  S+  W    + + Q          Y R    TLT   +        L 
Sbjct: 682 GT-ALYVNLYSPSTLTWAEKGITVTQSTG-------YPREQGSTLTVRGRTAA---FDLR 730

Query: 554 LRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
           LR+P W  ++G + ++NG+ +     PG++ S +  W   D + + +P  LR E   DD 
Sbjct: 731 LRVPAWA-TDGFRVTVNGRAVKGTWTPGSYASVSRTWRDGDTVRVDIPFRLRVEKALDD- 788

Query: 613 PEYASIQAILFGP 625
                +Q +  GP
Sbjct: 789 ---PRVQTLFHGP 798



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 56/110 (50%), Gaps = 6/110 (5%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
           L+  +  DV L ++SV    +Q  L++    DVD L+  FR  A L T G  A GGWE  
Sbjct: 50  LRPFNPEDVAL-RTSVFTAKRQLMLDFGRGYDVDRLLQVFRANAGLSTRGAVAPGGWEGL 108

Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
             E    LRGHF GH+L+  +Q +  T      +K+  +V +L E +  +
Sbjct: 109 DGEANGNLRGHFTGHFLTMLSQAYTGTGEKVYADKIRHMVGALDEVREAL 158


>gi|312131189|ref|YP_003998529.1| hypothetical protein Lbys_2513 [Leadbetterella byssophila DSM
           17132]
 gi|311907735|gb|ADQ18176.1| protein of unknown function DUF1680 [Leadbetterella byssophila DSM
           17132]
          Length = 737

 Score =  218 bits (556), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 170/555 (30%), Positives = 259/555 (46%), Gaps = 64/555 (11%)

Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
           + + L+ V L +  V   AQ  +L+Y+L LD D L+  +R  A L    + YG WE+  S
Sbjct: 18  QNIPLNQVKL-KEGVFKNAQDVDLKYILALDPDKLLAPYRIDAGLEKKAERYGNWES--S 74

Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT------ 213
            L GH  GHYLSA A ++AS+    +K+++  +V  L+ CQ K G GY+   P       
Sbjct: 75  GLDGHIGGHYLSALAMLYASSGEPELKKRLDYMVSELAACQKKNGNGYVGGIPQGKVFWE 134

Query: 214 -----ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMAT----WMVEYFYN 264
                ++  S   L   W P Y IHK+ AGL D Y    N +AL + T    WM+E F  
Sbjct: 135 RIGKGDIDGSSFGLNNTWVPLYNIHKLFAGLYDAYHFTGNNEALTVLTGLSDWMIELF-- 192

Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
                +T   VE+    L  E GG+N+    +YS T + K+L  A  F +  FL  +   
Sbjct: 193 ---SALTDEQVEK---VLRTEHGGLNEAFLDVYSATGEQKYLRAAERFTQKAFLQPMIEG 246

Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
            D L+  HANT IP ++G++   +VT +  +    ++F D V    S A GG S RE + 
Sbjct: 247 KDILTGLHANTQIPKMVGAEKISQVTKNQDWHKGASYFWDNVALHRSVAFGGNSYREHFH 306

Query: 385 DPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
           +  R    L + +  ETC +YNMLK+S+ L+  T +  Y D+YE+ L N +LS Q   E 
Sbjct: 307 ELDRFDKMLETNQGPETCNSYNMLKLSKALYESTGDNKYLDFYEKTLFNHILSSQH-PEK 365

Query: 444 GVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
           G  +Y  P+       R  H   +     S WCC GTG+E+ +K G+ I+    G    L
Sbjct: 366 GGFVYFTPI-------RPNHYRVYSQPETSMWCCVGTGLENHTKYGEMIFSRRAGV---L 415

Query: 502 YIIQYISSSFDWKSGH-VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
            +   I++  +   GH V L+ K        PY    +       V    ++  R+P W 
Sbjct: 416 QVNLLIAAKLE---GHSVTLDTKY-------PYENTAVL-----RVDGEKTVKWRIPAWM 460

Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
             +  + ++NG+ +       F        +      ++ LS + +  Q+  P      A
Sbjct: 461 --DEVKFTVNGKKVNPKMESGFA------VFTGLKKAEIHLSFQPKMGQEFLPNDQKWAA 512

Query: 621 ILFGPYLLAGHTSGE 635
             +GP +LA  TS E
Sbjct: 513 FTYGPLVLAAETSKE 527


>gi|315498334|ref|YP_004087138.1| hypothetical protein Astex_1314 [Asticcacaulis excentricus CB 48]
 gi|315416346|gb|ADU12987.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
           48]
          Length = 774

 Score =  218 bits (555), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 168/552 (30%), Positives = 255/552 (46%), Gaps = 79/552 (14%)

Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYL 170
           + S+   + + N  YLL L  D  + +FRK A L   G+ YGGWE     + GH +GHYL
Sbjct: 44  KPSIFLTSIEANQRYLLSLSPDRFLHNFRKGAGLEPKGEVYGGWE--ARGIAGHSLGHYL 101

Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA-------------------- 210
           S  + M+A T     +++ + V+  L   Q K   GY                       
Sbjct: 102 SGLSLMYAQTGKPEFRDRAAHVLSELKTIQAKHSDGYAGGTTVGRNGQEVDGKVVYEELR 161

Query: 211 ---FPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
                T  FD    L   W P YT HK+ AG LD +  A  A AL +AT + +Y    ++
Sbjct: 162 KGDIRTSGFD----LNGGWVPLYTYHKVFAGALDAHQYAGLADALIVATGLGDYLGTILE 217

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
            +      E     L  E GG+ +    LY+ T + + L L+        +  LA   D 
Sbjct: 218 SLSDAQIQE----ILRAEHGGLTESYAELYARTKNQRWLTLSQRLRHRAIVDPLAAGHDE 273

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L+  HANT IP ++GS   +E+T +     I  FF   V+  HSY  GG S  E +  P+
Sbjct: 274 LAGKHANTQIPKIVGSARLFELTQNADDARIARFFWQTVSRDHSYVIGGNSDHEHFGAPR 333

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
           +LA  L  +  E C +YNML+++RHL+ W+ + A  D+YER   N ++S Q+  + G+  
Sbjct: 334 QLASRLDQQTCEACNSYNMLRLTRHLYGWSGDAALFDFYERTHLNHIMS-QQDPQTGMFT 392

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGNVPGLYIIQY 506
           Y   L  G+ +  S        N FWCC G+G+ES SK G+SIY++  EG    LY    
Sbjct: 393 YFTGLASGLGRVHS-----DPTNDFWCCVGSGMESHSKHGESIYWKRGEGVAVNLYYAST 447

Query: 507 IS---SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS- 562
           ++   +  + ++   + +Q V           +T+  + K       +L+LR+P W  + 
Sbjct: 448 LNAPETQLEMETAFPLSDQVV-----------ITVHKAPK-------ALDLRVPGWCDTP 489

Query: 563 ----NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
               NG  A + GQ       G +L  T      D++ + L + +R EA+ DD    A +
Sbjct: 490 VLRVNGKAAGV-GQ-------GGYLRLTG-LKNGDRIELCLAMHVRVEAMPDD----AKL 536

Query: 619 QAILFGPYLLAG 630
            A L GP +LAG
Sbjct: 537 IAFLSGPLVLAG 548


>gi|224537186|ref|ZP_03677725.1| hypothetical protein BACCELL_02063 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521241|gb|EEF90346.1| hypothetical protein BACCELL_02063 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 805

 Score =  218 bits (554), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 161/536 (30%), Positives = 245/536 (45%), Gaps = 45/536 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A   N++ LL  D D L+  F + A LP   + YG WE     L GH  GHYL+A A  +
Sbjct: 44  ACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEK--DGLDGHIGGHYLTALAIHY 101

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
           A+T N   K++M  +V   +  Q   G G +  FP     + E  K         W  +Y
Sbjct: 102 AATGNLECKKRMDYMVSEFARVQQANGDGSICGFPNSKKFAEEIRKGNVGIVWNYWVAWY 161

Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            +HK  AGL D ++   N +A    LK   W V+   N   +      +ER    L+ E 
Sbjct: 162 NMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISNLDDR-----QMER---MLDNEF 213

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGMN+V    + +T +PK+L  A  F        +A + D L + HANT +P  +G Q  
Sbjct: 214 GGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMARRIDNLDNKHANTQVPKAVGYQRV 273

Query: 347 YEVTGD--PLYKLIGT---FFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEET 400
            E+     P Y    T   FF + V +  S + GG S  E + +  + +D +   +  E+
Sbjct: 274 AELNSKIAPDYNDFMTAAEFFWETVVSHRSLSLGGNSRGEHFPEAGKCSDYMHERQGPES 333

Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
           C T NMLK++  LFR   ++ YAD+YERA+ N +LS Q   E G  +Y  P      +  
Sbjct: 334 CNTNNMLKLTEGLFRMHPKVEYADFYERAMYNHILSTQH-PEHGGYVYFTPACPSHYRVY 392

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
           S  G      + WCC GTG+E+  K G  IY  +  +   LY+  +I S  +WK   + +
Sbjct: 393 SAPG-----KAMWCCVGTGMENHGKYGQFIYTHDMAD-NALYVNLFIPSELNWKEKKIKI 446

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-P 579
            Q+ D      P    T    +  +  Q   L +R P W      Q   NG +      P
Sbjct: 447 VQETDF-----PNEEGTTLTVNPSKATQFKLL-IRYPSWVEQGKMQVVCNGVDYAKSAQP 500

Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           G++++   +WS  D + ++ P++++ E +    P   +  +I+ GP LL   T  E
Sbjct: 501 GSYIAIDRQWSKGDVVEVKTPMTVKIEEL----PNVPNAISIMRGPILLGARTGTE 552


>gi|423223044|ref|ZP_17209513.1| hypothetical protein HMPREF1062_01699 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392640313|gb|EIY34115.1| hypothetical protein HMPREF1062_01699 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 805

 Score =  217 bits (553), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 161/536 (30%), Positives = 244/536 (45%), Gaps = 45/536 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A   N++ LL  D D L+  F + A LP   + YG WE     L GH  GHYL+A A  +
Sbjct: 44  ACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEK--DGLDGHIGGHYLTALAIHY 101

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
           A+T N   K++M  +V   +  Q   G G +  FP     + E  K         W  +Y
Sbjct: 102 AATGNLECKKRMDYMVSEFARVQQANGDGSICGFPNSKKFAEEIRKGNVGIVWNYWVAWY 161

Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            +HK  AGL D ++   N +A    LK   W V+   N   +      +ER    L+ E 
Sbjct: 162 NMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISNLDDR-----QMER---MLDNEF 213

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGMN+V    + +T +PK+L  A  F        +A   D L + HANT +P  +G Q  
Sbjct: 214 GGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMARHIDNLDNKHANTQVPKAVGYQRV 273

Query: 347 YEVTGD--PLYKLIGT---FFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEET 400
            E+     P Y    T   FF + V +  S + GG S  E + +  + +D +   +  E+
Sbjct: 274 AELNSKTAPDYNDFMTAAEFFWETVVSHRSLSLGGNSRGEHFPEAGKCSDYMHERQGPES 333

Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
           C T NMLK++  LFR   ++ YAD+YERA+ N +LS Q   E G  +Y  P      +  
Sbjct: 334 CNTNNMLKLTEGLFRMHPKVEYADFYERAMYNHILSTQH-PEHGGYVYFTPACPSHYRVY 392

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
           S  G      + WCC GTG+E+  K G  IY  +  +   LY+  +I S  +WK   + +
Sbjct: 393 SAPG-----KAMWCCVGTGMENHGKYGQFIYTHDMAD-NALYVNLFIPSELNWKEKKIKI 446

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-P 579
            Q+ D      P    T    +  +  Q   L +R P W      Q   NG +      P
Sbjct: 447 VQETDF-----PNEEGTTLTVNPSKATQFKLL-IRYPSWVEQGKMQVVCNGVDYAKSAQP 500

Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           G++++   +WS  D + ++ P++++ E +    P   +  +I+ GP LL   T  E
Sbjct: 501 GSYIAIDRQWSKGDVVEVKTPMTVKIEEL----PNVPNAISIMRGPILLGARTGTE 552


>gi|451820300|ref|YP_007456501.1| acetyl-CoA carboxylase, biotin carboxylase [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
 gi|451786279|gb|AGF57247.1| acetyl-CoA carboxylase, biotin carboxylase [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
          Length = 766

 Score =  216 bits (550), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 159/522 (30%), Positives = 252/522 (48%), Gaps = 38/522 (7%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           +Q    +Y+L LDVD  +    +   L    K Y GWE     + GH +GH++SA A  +
Sbjct: 24  SQDLGEKYILSLDVDRFLAPCYEAHGLEPKKKRYSGWE--ARAISGHSLGHFMSALAVTY 81

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP----TELFDSFEA----LKPVWAPY 229
            +T N  +K+ +   V  LS  Q   G GY+         E+ D        +   W P+
Sbjct: 82  QATGNEELKKILDYAVSELSHIQQVTGRGYIGGLVETPFVEIIDGTNIGKFDINGYWVPW 141

Query: 230 YTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGM 289
           Y+IHKI  GL+D Y LA+N++AL +    V  F +    ++   S E+    L  E GGM
Sbjct: 142 YSIHKIYKGLIDAYELAENSEALNV----VVNFADWAVSILNQMSDEQVQAMLECEHGGM 197

Query: 290 NDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG-SQMRYE 348
           N +  +LY  T +  +L  A  F     +  L    D L   HANT IP +IG +++  +
Sbjct: 198 NHIFAKLYGFTCNSIYLDTAVRFSHKAIVEPLEQCVDDLQGKHANTQIPKIIGIAEIYNQ 257

Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLK 408
                 YK    FF + V    SY  GG S +E +       ++LG +  E+C T+NML 
Sbjct: 258 EHAYEKYKTAAQFFWNTVVNRRSYVIGGNSLKEHF--EAIDMESLGIKTAESCNTHNMLL 315

Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
           +++ LF W    AY DYYE AL N ++  Q     G   Y   L  G  +  S     TK
Sbjct: 316 LTKLLFSWNHYSAYMDYYENALFNHIIGTQ-DCHTGNKTYFTSLLPGHYRIYS-----TK 369

Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
             ++WCC GTG+E+  K  ++IYF+E+ +   LY+  +ISS FDW++  + + Q+ +   
Sbjct: 370 DTAWWCCTGTGMENPGKYAEAIYFQEQDD---LYVNLFISSQFDWEAKGLTIRQESNLPY 426

Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATER 588
           S    L++      K E    +++N+R+P W  S    A +NG++  +     +L+ +  
Sbjct: 427 SDTVILKI---IEGKAE----ANINIRVPSWITSELV-AVVNGKDRFVQREKGYLTVSGA 478

Query: 589 WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           W   +++ I  P+++     +D+    A   A  +GP +LAG
Sbjct: 479 WDKGNEIRITFPMAVSKYTSKDN----AGKIAFTYGPVVLAG 516


>gi|307109022|gb|EFN57261.1| hypothetical protein CHLNCDRAFT_143813 [Chlorella variabilis]
          Length = 349

 Score =  216 bits (549), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 124/267 (46%), Positives = 156/267 (58%), Gaps = 10/267 (3%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
           SL DV L + S   R  + N EYLL L+ D L+++FRKTA LP PG +YGGWE    E+R
Sbjct: 27  SLADVQLARGSEYARNFEQNSEYLLALEPDRLLYNFRKTAGLPAPGASYGGWEWSGVEIR 86

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEAL 222
           GHFVGHYLSA A     +    ++E+   +V  L + Q+  GTGYLSAFP   FD  EAL
Sbjct: 87  GHFVGHYLSALALATLHSGRPELRERCGVMVSELKKVQDAAGTGYLSAFPESHFDRLEAL 146

Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
           +PV       HKILAGLLDQ+ L   A AL  A  M  +F  RV+ V+     + HW+ +
Sbjct: 147 QPV-------HKILAGLLDQHRLVGTAGALGAARRMASHFCARVRAVVAANGTD-HWHRV 198

Query: 283 NE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
            E E GGMN+ LY LY+IT  P+H   AH FDKP F   LA   D L   HANTH+  V 
Sbjct: 199 LEVEFGGMNEALYNLYAITKSPEHAECAHFFDKPAFFRPLAEGRDPLPGLHANTHMAQVP 258

Query: 342 GSQMRYEVTGDPLYKL-IGTFFMDIVN 367
           G   RYE+ GD   ++   TFF  ++ 
Sbjct: 259 GFTARYELLGDGEAQVAAATFFGTLLQ 285


>gi|427383714|ref|ZP_18880434.1| hypothetical protein HMPREF9447_01467 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728419|gb|EKU91277.1| hypothetical protein HMPREF9447_01467 [Bacteroides oleiciplenus YIT
           12058]
          Length = 791

 Score =  216 bits (549), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 161/528 (30%), Positives = 243/528 (46%), Gaps = 48/528 (9%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A   N++ LL  DVD L+  F K A L   G+++  WE     L GH  GHYLSA A  +
Sbjct: 46  ACDLNVQILLQYDVDRLLAPFLKEAGLQPKGESFPNWEG----LDGHVGGHYLSALAIHY 101

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
           A+T N   K++M  ++  L  CQ K   GY+   P  +    E  K         W P+Y
Sbjct: 102 AATGNVDCKKRMEYMISELKRCQQKHADGYVGGVPDGMKVWNEIKKGNVGIVWKYWVPWY 161

Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
            +HKI AGL D ++   N +A  M   + ++       +I   + E+    L  E GGM+
Sbjct: 162 NLHKIYAGLRDAWIYGGNEEARMMFLELCDWG----MTIIAPLNDEQMEQMLANEFGGMD 217

Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           +V    Y +T D K+L  A  F     L  +A Q D L + HANT +P V+G Q   E+ 
Sbjct: 218 EVYADAYQMTGDMKYLNTAKRFSHKWLLDSMAAQVDNLDNKHANTQVPKVVGYQRIAELG 277

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-----ENEETCTTYN 405
            D  Y++   +F + V  + S + GG S RE +      AD   S     E  E+C T N
Sbjct: 278 HDKKYEVATEYFWNTVVYNRSLSLGGNSRREHF----AAADDCKSYVEDREGPESCNTNN 333

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH-- 463
           MLK++  LFR   E  YAD+YERA+ N +LS Q   E G  +Y        + AR  H  
Sbjct: 334 MLKLTEGLFRMHPEARYADFYERAMYNHILSTQH-PEHGGYVYF-------TSARPAHYR 385

Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
            +    ++ WCC GTG+E+  K G+ IY     +   L++  +++S  +WK   + L Q+
Sbjct: 386 VYSAPNSAMWCCVGTGMENHGKYGEFIYTHAHDS---LFVNLFVASELNWKEKGITLIQE 442

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNF 582
                  +   R+T+      +      L +R P W   N  +    G++      P ++
Sbjct: 443 TR--FPDEESSRLTIRVKKPTKF----KLLVRHPWWADGNDMKVLCKGKDYASGSSPSSY 496

Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           +     W   D + I  P+ +  EA+    P  +   +I+ GP LL  
Sbjct: 497 IVIERTWKNGDVVDITTPMKVHIEAL----PNVSEYISIMRGPILLGA 540


>gi|312131938|ref|YP_003999278.1| hypothetical protein Lbys_3265 [Leadbetterella byssophila DSM
           17132]
 gi|311908484|gb|ADQ18925.1| protein of unknown function DUF1680 [Leadbetterella byssophila DSM
           17132]
          Length = 1004

 Score =  215 bits (548), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 184/613 (30%), Positives = 268/613 (43%), Gaps = 99/613 (16%)

Query: 123 LEYLLMLDVDSLVWSFRKT--ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAST 180
           ++ L   D +S ++ FR       P   K  G W++  ++LRGH  GHYL+A AQ +AST
Sbjct: 385 IQGLAKTDPNSFLYMFRHAFGQKQPEGAKPLGVWDSQNTKLRGHATGHYLTAIAQAYAST 444

Query: 181 H-----NATIKEKMSTVVFSLSECQNKIGT------------------------------ 205
                  A    KM  +V +L E     GT                              
Sbjct: 445 GYDKNLQANFAGKMDQLVNTLYELSRLSGTPKVQGGEAVADPTKVPMGPGKTEYDSDLTD 504

Query: 206 ------------GYLSAFPTELFDSFEA-------LKPVWAPYYTIHKILAGLLDQYVLA 246
                       GY+SA+P + F   E           VWAPYYT+HKILAGL+D Y ++
Sbjct: 505 EGIRTDYWNWGKGYISAYPPDQFIMLEQGAKYGGQKNQVWAPYYTLHKILAGLMDVYEVS 564

Query: 247 DNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKH 305
            N +AL +A  M E+ + R+   +   ++ + W + +  E GGMN+ + RL+ +T + K 
Sbjct: 565 GNKKALDVAVGMSEWVHARL-AALPQDTLIKMWNTYIAGEYGGMNESMARLFFLTKNEKF 623

Query: 306 LLLAHLFDK-PCFLG------FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLI 358
           L  A LFD    F G       LA   D     HAN HIP ++GS   Y V+ +P Y  I
Sbjct: 624 LKTAQLFDNIKMFYGDASHSHGLARNVDTFRGLHANQHIPQIVGSIEMYAVSQNPDYYFI 683

Query: 359 GTFFMDIVNASHSYATGGTS-------AREFWWDPKRLAD---TLGSENEETCTTYNMLK 408
              F     + + Y+ GG +       A  F   P  + +   + G +N ETC TYNMLK
Sbjct: 684 AENFWHRTVSDYMYSIGGVAGARNPANAECFIAQPATIYENGFSQGGQN-ETCATYNMLK 742

Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
           ++  LF + ++  Y DYYER L N +L+      P    Y +PL  G  K          
Sbjct: 743 LTSSLFMFDQKAEYMDYYERGLYNHILASVAKDSP-ANTYHVPLRPGSIKQFGN----PN 797

Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
              F CC GT IES +KL +SIYF+   N   LY+  +I S+ +W+   + + Q      
Sbjct: 798 MTGFTCCNGTAIESNTKLQNSIYFKSLDNST-LYVNLFIPSTLNWEEKGIKVVQTTSFPK 856

Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATE 587
                LR+        E      L +R+P W    G    +NG+   +   PG++   + 
Sbjct: 857 EDQTKLRI--------EGNGKFDLQVRVPGWA-KKGFVVKINGKKQKIKATPGSYAKISR 907

Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS---GEWDIKTGTAR 644
            W   D L I +P     + +  D+P  AS   + +GP LLA   +    EW   T  A+
Sbjct: 908 TWKNGDVLEITMPFEFHLDYVM-DQPNIAS---LFYGPVLLAAQETEARKEWRQVTFDAK 963

Query: 645 SLSALISPIPPSF 657
            LS  I   P + 
Sbjct: 964 DLSKNIKGNPETL 976


>gi|326801658|ref|YP_004319477.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326552422|gb|ADZ80807.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
          Length = 790

 Score =  214 bits (546), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 158/542 (29%), Positives = 246/542 (45%), Gaps = 54/542 (9%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ+T+L Y+L L+ D L+  + + A L     +YG WEN  + L GH  GHYLSA + M 
Sbjct: 51  AQETDLRYILALNPDRLLAPYLREAGLEPKASSYGNWEN--TGLDGHIGGHYLSALSLMA 108

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE---------ALKPVW 226
           A+T N  I+++++ ++  L  CQ++   GY+   P   ++++  +         +L   W
Sbjct: 109 AATGNHAIQDRLTYMLSELKRCQDQDSDGYVGGIPGGKQMWNDIKRGKIEAQSFSLNGKW 168

Query: 227 APYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
            P Y IHK+ AGL+D Y    N  A    LK+  W +  F     + I           L
Sbjct: 169 VPIYNIHKLFAGLIDAYRYTGNEHARQMVLKLGKWWLSVFGGLTDEQIQTI--------L 220

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
             E GG+N+V   L  I+ D K+L +A        L  L    D L+  HANT IP VIG
Sbjct: 221 RSEHGGINEVFADLAQISGDQKYLTMAKRLSHRAILQPLIAGKDELTGLHANTQIPKVIG 280

Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETC 401
            +    +     +     FF + V    + + GG S  E +         L S E  ETC
Sbjct: 281 FEKIAALADSMSWANAARFFWETVVEHRTVSIGGNSESEHFHALNSFGKMLSSREGPETC 340

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG----RGVS 457
            TYNM+K+S+ LF    +  + DYYERA  N +LS Q   E G  +Y  P+     R  S
Sbjct: 341 NTYNMMKLSKDLFLQGPDRKFIDYYERATYNHILSSQHPKEGG-FVYFTPMRPNHYRVYS 399

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
           +A++          FWCC G+G+E+  K G+ IY     +   LYI  +I S+  W+   
Sbjct: 400 QAQAC---------FWCCVGSGLENHGKYGELIYTHSGQD---LYINLFIPSTLKWQEQG 447

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
           + L Q+      ++    +T+  ++ +      S+ +R P W         +NG+ +   
Sbjct: 448 ISLTQRTR--FPYEQKSSVTIEVANPKTF----SVFIRKPKWLGKQPINLLVNGKQISYQ 501

Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWD 637
               +L    +W     +T  LP+ +  E +    P      +  +GP +LA     E D
Sbjct: 502 EDKGYLKINRKWVGQSIITFNLPMQINAELLPSGEPWV----SYTYGPIVLASKNGTE-D 556

Query: 638 IK 639
           +K
Sbjct: 557 LK 558


>gi|423230906|ref|ZP_17217310.1| hypothetical protein HMPREF1063_03130 [Bacteroides dorei
           CL02T00C15]
 gi|423244617|ref|ZP_17225692.1| hypothetical protein HMPREF1064_01898 [Bacteroides dorei
           CL02T12C06]
 gi|392630026|gb|EIY24028.1| hypothetical protein HMPREF1063_03130 [Bacteroides dorei
           CL02T00C15]
 gi|392641466|gb|EIY35242.1| hypothetical protein HMPREF1064_01898 [Bacteroides dorei
           CL02T12C06]
          Length = 797

 Score =  214 bits (546), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 177/554 (31%), Positives = 257/554 (46%), Gaps = 58/554 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L EV L D +  ++  L +       YLL LDVD L+   R++  L   G  YGGWE   
Sbjct: 44  LSEVELTDSYFKKAMDLHKG------YLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 94

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----------GTGY 207
            +  G   GHY+SA A M+AST    + +K++ ++  L ECQ +              GY
Sbjct: 95  -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 153

Query: 208 LSAFPTE--LFDSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
           L        L    E  +P W        +Y IHKILAGL D YV A   QA  +   + 
Sbjct: 154 LQLLQGNVVLNQPDETGQP-WNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLA 212

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           ++    +  +    + +    +L+ E GGMN+V   +YSIT D K L  A  F+    + 
Sbjct: 213 DF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIY 268

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            +A   D L   HAN  IP  +G    YE + + +Y      F +IV   H+ A GG S 
Sbjct: 269 PIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSC 328

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E +      +  L   + ETC TYNMLK+SR LF    +  Y +YYE AL N +L+ Q 
Sbjct: 329 YERFGVLGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQD 388

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
              PG + Y   L  G  K  S     T F+SFWCC GTG+E+ SK  +SIYF++     
Sbjct: 389 PDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTGMENHSKYAESIYFKDNQE-- 441

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLS-SLNLRMP 557
            L +  YI S   WK   + L        + D Y   + T + +  E+G  + +L  R P
Sbjct: 442 -LLVNLYIPSRLHWKEKGLKL--------TLDTYFPESDTVTVRMDEIGSYTGTLLFRYP 492

Query: 558 VWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
            W  S  A   +NG+        G+++   +     D +T+    +L  +  +D+ P + 
Sbjct: 493 DWV-SGDAVVRINGEPAQTEAHKGSYIRLLDSVKSGDVITLVFTRNLYIDYAKDE-PHFG 550

Query: 617 SIQAILFGPYLLAG 630
           S   +++GP LLAG
Sbjct: 551 S---VMYGPILLAG 561


>gi|237711613|ref|ZP_04542094.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
 gi|229454308|gb|EEO60029.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
          Length = 770

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 177/554 (31%), Positives = 257/554 (46%), Gaps = 58/554 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L EV L D +  ++  L +       YLL LDVD L+   R++  L   G  YGGWE   
Sbjct: 17  LSEVELTDSYFKKAMDLHKG------YLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 67

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----------GTGY 207
            +  G   GHY+SA A M+AST    + +K++ ++  L ECQ +              GY
Sbjct: 68  -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 126

Query: 208 LSAFPTE--LFDSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
           L        L    E  +P W        +Y IHKILAGL D YV A   QA  +   + 
Sbjct: 127 LQLLQGNVVLNQPDETGQP-WNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLA 185

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           ++    +  +    + +    +L+ E GGMN+V   +YSIT D K L  A  F+    + 
Sbjct: 186 DF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIY 241

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            +A   D L   HAN  IP  +G    YE + + +Y      F +IV   H+ A GG S 
Sbjct: 242 PIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSC 301

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E +      +  L   + ETC TYNMLK+SR LF    +  Y +YYE AL N +L+ Q 
Sbjct: 302 YERFGVLGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQD 361

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
              PG + Y   L  G  K  S     T F+SFWCC GTG+E+ SK  +SIYF++     
Sbjct: 362 PDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTGMENHSKYAESIYFKDNQE-- 414

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLS-SLNLRMP 557
            L +  YI S   WK   + L        + D Y   + T + +  E+G  + +L  R P
Sbjct: 415 -LLVNLYIPSRLHWKEKGLKL--------TLDTYFPESDTVTVRMDEIGSYTGTLLFRYP 465

Query: 558 VWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
            W  S  A   +NG+        G+++   +     D +T+    +L  +  +D+ P + 
Sbjct: 466 DWV-SGDAVVRINGEPAQTEAHKGSYIRLLDSVKSGDVITLVFTRNLYIDYAKDE-PHFG 523

Query: 617 SIQAILFGPYLLAG 630
           S   +++GP LLAG
Sbjct: 524 S---VMYGPILLAG 534


>gi|189464749|ref|ZP_03013534.1| hypothetical protein BACINT_01093 [Bacteroides intestinalis DSM
           17393]
 gi|189437023|gb|EDV06008.1| hypothetical protein BACINT_01093 [Bacteroides intestinalis DSM
           17393]
          Length = 805

 Score =  212 bits (539), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 159/536 (29%), Positives = 240/536 (44%), Gaps = 45/536 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           A   N++ LL  D D L+  F + A LP   + YG WE     L GH  GHYLSA A  +
Sbjct: 44  ACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEK--DGLDGHIGGHYLSALAIHY 101

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
           A+T N   K++M  +V   +  Q     G +  FP     + E  K         W  +Y
Sbjct: 102 AATGNQECKKRMDYMVSEFARVQQANDDGSICGFPNSKKFAEEIRKGNVGIVWNYWVAWY 161

Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            +HK  AGL D ++   N +A    LK   W V+   N   +      +ER    L+ E 
Sbjct: 162 NMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISNLDDR-----QMER---MLDNEF 213

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGMN+V    + +T +PK+L  A  F        +  + D L + HANT +P  +G Q  
Sbjct: 214 GGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMTRRIDNLDNKHANTQVPKAVGYQRV 273

Query: 347 YEVTGDPL-----YKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEET 400
            E+          +     FF + V    S + GG S  E + +  + +D +   +  E+
Sbjct: 274 AELNSKTASDYNEFMTAAEFFWETVVFHRSLSLGGNSRGEHFPEAGKCSDYMHERQGPES 333

Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
           C T NMLK++  LFR   ++ YAD+YERAL N +LS Q   E G  +Y  P      +  
Sbjct: 334 CNTNNMLKLTEGLFRIHPKVEYADFYERALYNHILSTQH-PEHGGYVYFTPACPSHYRVY 392

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
           S  G      + WCC GTG+E+  K G  IY  +  +   LY+  +I S  +WK   + +
Sbjct: 393 SAPG-----EAMWCCVGTGMENHGKYGQFIYTHDTVD-NALYVNLFIPSELNWKEKKIKI 446

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPP 579
            Q+ D      P    T    +  +  Q   L +R P W      Q   +G +      P
Sbjct: 447 VQETDF-----PNEEGTTLTVNPSKATQFKLL-IRYPSWVEQGKMQVVCDGVDYAKNAQP 500

Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           G++++   +WS  D + I+ P+++R E +    P   +  +I+ GP LL   T  E
Sbjct: 501 GSYIAIDRQWSKGDVVEIKTPMTVRIEEL----PNVPNAISIMRGPILLGARTGTE 552


>gi|393782713|ref|ZP_10370896.1| hypothetical protein HMPREF1071_01764 [Bacteroides salyersiae
           CL02T12C01]
 gi|392672940|gb|EIY66406.1| hypothetical protein HMPREF1071_01764 [Bacteroides salyersiae
           CL02T12C01]
          Length = 796

 Score =  208 bits (530), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 172/547 (31%), Positives = 249/547 (45%), Gaps = 47/547 (8%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
           SL DV L  S +   A   +  YLL LDVD L+   R+   L    + YGGWE       
Sbjct: 41  SLSDVKL-TSGIFKGAMDLHKGYLLSLDVDRLIPHVRRNVGLTGKNENYGGWETH----G 95

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN-----------KIGTGYLSAF 211
           G   GHY+SA A M+AST     ++++  ++  L ECQ            +   GY    
Sbjct: 96  GCTYGHYMSACAMMYASTGEKIFRDRLEYMMDELKECQQQTQDGWFISGERAKEGYRKLL 155

Query: 212 PTELF-DSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYN 264
             E+F +  +  K  W        +Y IHK+LAGL D Y+ A   +A ++   + ++   
Sbjct: 156 HGEVFLNRPDETKQPWNYNQNGNSWYCIHKVLAGLRDVYLYAGIQKAKEILMPLADF--- 212

Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
            +  +    + +    +L+ E GGMN+V   +Y+ T D K+L  A  F+    +  +A  
Sbjct: 213 -IADIALNSNKDLFQSTLSVEQGGMNEVFTDIYAFTGDYKYLETACRFNHINVIYPVANG 271

Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
            D L   HAN  IP  IG    Y      +Y+     F D+V  +H+ A GG S  E + 
Sbjct: 272 EDVLFGRHANDQIPKFIGVAKEYAYDTKEIYRKAAENFWDMVVNNHTLAIGGNSCYERFG 331

Query: 385 DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
            P   +  L   + ETC TYNMLK+SR LF    +  Y +YYE AL N +L+ Q     G
Sbjct: 332 MPGEESKRLDYSSAETCNTYNMLKLSRLLFMMNGDYKYLNYYEHALYNHILASQDPDMAG 391

Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
            + Y   L  G  K  S     T ++SFWCC GTG+E+ +K  +SIYF+   N   L I 
Sbjct: 392 CVTYYTSLLPGSFKQYS-----TPYDSFWCCVGTGMENHAKYAESIYFK---NGNSLLIN 443

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            YI S  +WK     L    D   S       T++     +     S+ LR P W   N 
Sbjct: 444 LYIPSELNWKEQGFRLRLDTDFPES------DTISVCVVDKGRFSGSVMLRYPEWVEGN- 496

Query: 565 AQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
            +  LNG+ + L      ++   +     D + I LP  L     +D+ P + S   I++
Sbjct: 497 PEMMLNGRPVKLEYGKKEYIRLPDSIKSGDTIKIVLPRKLSVRYAKDE-PHFGS---IMY 552

Query: 624 GPYLLAG 630
           GP LLAG
Sbjct: 553 GPILLAG 559


>gi|383777661|ref|YP_005462227.1| hypothetical protein AMIS_24910 [Actinoplanes missouriensis 431]
 gi|381370893|dbj|BAL87711.1| hypothetical protein AMIS_24910 [Actinoplanes missouriensis 431]
          Length = 939

 Score =  207 bits (527), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 150/490 (30%), Positives = 236/490 (48%), Gaps = 48/490 (9%)

Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSF 219
           ELRG+   +    +     +  +A+ ++  + V+  +         G+L+A+P   F   
Sbjct: 355 ELRGNLAWYRFDETEGT--TVADASGRDWDAAVITGVGGAPGPSHAGFLAAYPETQFVLL 412

Query: 220 EALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
           E L     +WAPYYT HKI+ GLLD + L  NA AL +   M E+ ++R+ K +    ++
Sbjct: 413 EQLTTYPAIWAPYYTCHKIMRGLLDAHTLGGNATALDVVRGMGEWAHSRLSK-LPREQLD 471

Query: 277 RHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
           R W   +  E GGMN+V+  L ++T +   L  A  FD    L       D L   HAN 
Sbjct: 472 RMWALYIAGEYGGMNEVMVDLATLTGNKTFLETARFFDNTKLLADCVADIDSLDGKHANQ 531

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL-G 394
           HIP  +G    YE   D  Y+     F D+V    +Y  GGT   E +     +A ++  
Sbjct: 532 HIPQFLGYLRLYENGADKTYRTAAANFFDMVVPHRTYMHGGTGQGEVFRKRDVIAGSIVN 591

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRG----TEPGVMIYML 450
           + N E+C  YNMLKV+R+LF    +  + DYYE+AL N +L+ +R     T+P ++ YM+
Sbjct: 592 TTNAESCAAYNMLKVARNLFSHAPDGRFMDYYEKALVNQILASRRDVDSTTDP-LVTYMV 650

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           P+G G  +     G+G   N   CC GTG+E+ +K  D+I+F        LY+  YI S+
Sbjct: 651 PVGPGARR-----GYG---NIGTCCGGTGLENHTKYQDTIWF-RSAKSDTLYVNLYIPST 701

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-------TYSN 563
            +W +  + + Q  D   S  P   +T+T S++ +      L LR+P W       T ++
Sbjct: 702 LNWAAKKLTVTQTGDYPRS--PETTLTITGSARLD------LRLRVPSWADDDFSVTVNS 753

Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
             Q    G++        ++S    W   D +T+  P  L  E   DD     S+QA+L+
Sbjct: 754 KIQRVRAGRD-------GYVSLDRHWRSGDTITVSSPYRLHVERALDD----PSLQALLY 802

Query: 624 GPYLLAGHTS 633
           GP  L   ++
Sbjct: 803 GPLALVAKST 812



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 49/90 (54%), Gaps = 1/90 (1%)

Query: 113 SVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLS 171
           S+    +   L Y    D D +V +FR  A L   G +  GGW++    LRGH+ GH++S
Sbjct: 79  SIFTEKRDRILAYARAYDADRIVSNFRTAAGLDNRGAQPPGGWDDATGNLRGHYSGHFIS 138

Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQN 201
             AQ WA T  A  KEK+  +V +L ECQ+
Sbjct: 139 MLAQAWADTGEAIFKEKLDYIVTALKECQD 168


>gi|393782709|ref|ZP_10370892.1| hypothetical protein HMPREF1071_01760 [Bacteroides salyersiae
           CL02T12C01]
 gi|392672936|gb|EIY66402.1| hypothetical protein HMPREF1071_01760 [Bacteroides salyersiae
           CL02T12C01]
          Length = 673

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 167/601 (27%), Positives = 259/601 (43%), Gaps = 100/601 (16%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-------- 150
            +   L +V L       R Q  + +Y+  L+ D  +  FR+ A +    K         
Sbjct: 34  FRSFGLDEVRLKDREFKLR-QNHDFDYIRTLEPDRYLSPFRRNAGIEVDSKGIPVDNTKH 92

Query: 151 YGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-------I 203
           Y GWE     L     GHYLSA + M+  T + T+  K++ ++  L+  Q         +
Sbjct: 93  YDGWEF----LGSSTFGHYLSAISMMYKVTGDTTLLHKINYIIDELNFIQRNPSYENENL 148

Query: 204 GTGYLSAFPTE---------LFDSFEALKP-----VWAP--------------------- 228
             G L AF  +            +++ L+        AP                     
Sbjct: 149 RHGALVAFDRDRHKHVREPNFLRTYDELRQGQVNLTSAPDNRGATVENVYFKTFYWLSGG 208

Query: 229 --YYTIHKILAGLLDQYVLADNAQALKM-------ATWMVEYFYNRVQKVITMYSVERHW 279
             +YT HKI AG+ D Y+   N +A K+       A W+ E         +T ++  R  
Sbjct: 209 LSWYTNHKIYAGIRDAYLYTGNPKAKKVFLSFCDWACWVTE--------KLTDHAFARML 260

Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-----PCFLGFLALQADYLSHFHAN 334
           YS   E G MN++L   Y+ + + K+L  A  F++     PC  G +   A+ +SH HAN
Sbjct: 261 YS---EHGAMNEMLTDAYAFSGERKYLDCAFRFNEQETMVPCIDGDIKKIAETISHTHAN 317

Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
             IP   G    +E TGD L+K+    F   V    S+ TGG S  E +  P  +   + 
Sbjct: 318 AQIPQFYGLIKEFEYTGDSLFKVAAENFFKYVTNYQSFVTGGNSEWEQFRAPGNIMAQVT 377

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
             + ETC TYNMLK+++ LF  T +  Y +Y ERAL N +L     ++PG   Y L L  
Sbjct: 378 RRSGETCNTYNMLKIAKGLFELTGDTLYLNYMERALYNHILPSIHTSQPGAFTYFLSLEP 437

Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
           G  K  S       ++S WCC GTG+E+ +K G+ IYF  E  V   Y+  +++S+  W+
Sbjct: 438 GYFKTFS-----RPYDSHWCCVGTGMENHAKYGEFIYFHHEKEV---YVNLFVASALCWE 489

Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
                +    D     D   R+       Q  G++++L +R+P W    G +  +NG+ +
Sbjct: 490 KEGFQMETITDFPYESDVRFRIL------QNKGRIATLKIRIPRWAKEVGVK--VNGKMI 541

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
                  +L   + W   D + + LP+ LR E +    P  +   A  +GP LLAG    
Sbjct: 542 KYKNRDGYLKLEKLWKIGDLVELTLPMYLRKEYV----PNCSDKFAFFYGPVLLAGRLGN 597

Query: 635 E 635
           E
Sbjct: 598 E 598


>gi|365852804|ref|ZP_09393150.1| hypothetical protein HMPREF9103_01934 [Lactobacillus parafarraginis
           F0439]
 gi|363714017|gb|EHL97570.1| hypothetical protein HMPREF9103_01934 [Lactobacillus parafarraginis
           F0439]
          Length = 728

 Score =  203 bits (517), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 158/578 (27%), Positives = 262/578 (45%), Gaps = 62/578 (10%)

Query: 97  NFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWE 155
           N +K VS ++V    +S L    + N+ ++L L  D L++++RK A L T G      WE
Sbjct: 3   NIMKPVSYYNVEYLPNSTLKEKFERNINWMLSLTPDQLLYNYRKNAGLDTKGATPLTVWE 62

Query: 156 NPISELRGHFVGHYLSASAQMWASTHN--------ATIKEKMSTVVFSLSECQNKIGT-- 205
           +P    RGHF GHYLS +++ +    N          +K ++  +V  L E Q+K+    
Sbjct: 63  SPDFFFRGHFTGHYLSGASKTFVELTNTDEKDPQAVELKNRVDLIVTGLKEVQDKLSETS 122

Query: 206 ---GYLSAFPTELFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
              GYL+A P + FD+ E L+     + PYY I K++ GL+D Y    N  AL++   + 
Sbjct: 123 EFPGYLAAEPEKRFDNLEKLRFNGNHYVPYYAIQKLMDGLMDAYQYTGNQTALQLVKNLT 182

Query: 260 EYFYNRVQKVI---TMYSVERHWYS------LNEETGGMNDVLYRLYSITHDPKHLL--L 308
            Y   R+ K+        ++  WY        ++E G M+  L RLY +T   +  +  L
Sbjct: 183 SYVEKRMAKLTPERISAMLDTRWYQGSGQYIFHQEFGAMHRTLLRLYELTGKKEQDVFDL 242

Query: 309 AHLFDKPCFLGFLALQADYLSHF--HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIV 366
           A  FD+  F   L    D L ++  H+NT +    G    Y VTGD  YK     +MD +
Sbjct: 243 AEKFDRKWFRDMLINNEDKLGYYSMHSNTELVCAEGMLEYYHVTGDDQYKKGVENYMDWM 302

Query: 367 NASHSYATGGTSAR-----------EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
           +  H   T G S R           E +  P+     L   N E+C ++++  +S  LF 
Sbjct: 303 HTGHELPTKGISGRSAYPAPADYGSELYDYPEMFFKHLSKLNGESCCSHDLNYLSSELFA 362

Query: 416 WTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCC 475
            TK+    + YE    N +++ Q+  +  +  Y+  L    +  +     G     FWCC
Sbjct: 363 DTKDPVLMNDYEIRFINAIMA-QQNNDSAIAEYLYNLSVAPNSVKHYDRGG-----FWCC 416

Query: 476 YGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR 535
            G+G E  S L D IY+++  ++   Y+ QY  S  + K   V + Q  D       +  
Sbjct: 417 VGSGTERHSTLVDGIYYQDNDDI---YVAQYFDSILNLKDQGVKVTQ--DAHYPDQHFAH 471

Query: 536 MTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKL 595
           +T+     ++     ++ +R+P W  S     +++G+ + + P   F++    WS   ++
Sbjct: 472 ITVETEQPKDF----TIYVRVPKW--SAETTITVDGKAVKVQPENGFVAIKRNWSKKSEI 525

Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           TI     LR + + D    +  I AI +GP LLA   +
Sbjct: 526 TINFDFQLRYQVLAD---RFNRI-AIYYGPILLAAQKA 559


>gi|302818287|ref|XP_002990817.1| hypothetical protein SELMODRAFT_429245 [Selaginella moellendorffii]
 gi|300141378|gb|EFJ08090.1| hypothetical protein SELMODRAFT_429245 [Selaginella moellendorffii]
          Length = 226

 Score =  201 bits (511), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 95/133 (71%), Positives = 108/133 (81%)

Query: 168 HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWA 227
           HYLSASA  WASTHN TI E M+ VV +L+ECQ KIGTGYLSAFPT LFD FEAL+ VWA
Sbjct: 25  HYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSAFPTSLFDRFEALESVWA 84

Query: 228 PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
           PYYTIHKI+AGLLDQY  A N+ A +M   M +YF +RV++VI  YS+ERHW SLNEETG
Sbjct: 85  PYYTIHKIMAGLLDQYTYAANSFAFEMLLGMTDYFGSRVERVIEKYSIERHWQSLNEETG 144

Query: 288 GMNDVLYRLYSIT 300
           GMNDVLYR+Y IT
Sbjct: 145 GMNDVLYRVYQIT 157


>gi|159491178|ref|XP_001703550.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280474|gb|EDP06232.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 226

 Score =  200 bits (509), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 108/197 (54%), Positives = 136/197 (69%), Gaps = 4/197 (2%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLL-MLDVDSLVWSFRKTASLPTPGKAY-GGWEN 156
           ++ + L DV L  +++  R ++ N +YLL ML+ D L+WSFRKT+ LPTPG  Y   WE+
Sbjct: 28  IEPLPLSDVRLLDTALQARYEKLNAKYLLDMLEPDRLLWSFRKTSGLPTPGTPYIASWED 87

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
           P  ELRGHFVGHYLSA +   A T N+  K ++  +V  L + Q K+GTGYLSAFPTE F
Sbjct: 88  PGCELRGHFVGHYLSALSLALAGTGNSAFKTRLDLMVSELGKVQEKLGTGYLSAFPTEFF 147

Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
           D  EALKPVWAPYYTIHKI+AGL+D + LA +  AL MAT MV+Y +NR Q VI     E
Sbjct: 148 DRVEALKPVWAPYYTIHKIIAGLVDAHELAGHPSALAMATRMVDYHWNRTQAVIAAKGRE 207

Query: 277 RHWYS-LNEETGGMNDV 292
            HW + LN E GGMN+V
Sbjct: 208 -HWNAVLNCEFGGMNEV 223


>gi|256378728|ref|YP_003102388.1| hypothetical protein Amir_4712 [Actinosynnema mirum DSM 43827]
 gi|255923031|gb|ACU38542.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
          Length = 881

 Score =  199 bits (506), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 188/645 (29%), Positives = 285/645 (44%), Gaps = 92/645 (14%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWEN- 156
           L+   L DV L    V  RA    L    +  VD ++  FR  A L T G    G WE+ 
Sbjct: 9   LEPFPLRDVEL-LDGVQSRAAGQMLHLARVFPVDRVLAVFRANAGLDTRGALPPGNWEDF 67

Query: 157 --------------------PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSL 196
                                 S LRGH+ GH+LS  A   AST   +++ K   +V  L
Sbjct: 68  GHPDERPWSAEEYPGAGVAPTASLLRGHYAGHFLSMVALAHASTGEESLRAKAWEIVAGL 127

Query: 197 SECQNKIGT-------GYLSAFPTELFDSFEALKP---VWAPYYTIHKILAGLLDQYVLA 246
           +E ++ +         G+L+A+    F   E L P   +WAPYYT HKI+AGLLD +   
Sbjct: 128 AEVRDALAATGRYSHPGFLAAYGEWQFSRLEDLAPYGEIWAPYYTCHKIMAGLLDAHEHT 187

Query: 247 DNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKH 305
            + QAL++A  M  +   RV ++   + ++R W   +  E GGMN+ L  L+ IT +   
Sbjct: 188 GSEQALELAVGMGHWVAGRVLRLERAH-LQRMWSLYIAGEFGGMNESLAALHRITGEEVF 246

Query: 306 LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDI 365
           L  A  F+    L   A   D L   HAN H+P+++G   +Y+ TG+  Y    T   D 
Sbjct: 247 LRAAAAFELDHLLEGAAQGRDLLDGMHANQHLPMLVGHLDQYDATGETRYLDAVTALWDQ 306

Query: 366 VNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
           V    ++A GGT   E W     +A  +G  N E+C TYN+LK++R LF  T +  Y +Y
Sbjct: 307 VVPGRTFAHGGTGEGELWGPADTVAGFIGRRNAESCATYNLLKIARSLFARTGDARYPEY 366

Query: 426 YERALTNGVLSIQRGTEPGV---MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
            ERA  N ++  +   +  V   ++YM P+  G  +           N   CC GTG+E+
Sbjct: 367 AERAWLNHMVGSRADLDSDVSPEVVYMYPVDAGAVREYD--------NVGTCCGGTGLET 418

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
             K  D ++F   G    L + +++ S      G  V  +   P        R+ + F +
Sbjct: 419 HVKHQDWVWFHAPGK---LVVARHVPSRVTLPGGGSVALRTGYPRDG-----RVVVEFDA 470

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
                    L+LR+P W     A   ++G+ +PL   G F   +  +   D++ + LPL 
Sbjct: 471 DFS----GELHLRVPSWAT---AGYLVDGERVPL-TDGGFAVLSRDFRRGDEVELVLPLP 522

Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPS----FN 658
           LR  +  DD P   S++    GP +L              AR  +A + P+ P+     +
Sbjct: 523 LRLVSTVDD-PTLVSVE---LGPTVL-------------LARDDAATVLPVSPAAFRGLD 565

Query: 659 AQLVTFTQESGNSTFVMSNSNQSITMEEFPV-SGTDAALHATFRL 702
             LV + ++    +F        +T E  P  SG DA  HA  RL
Sbjct: 566 GSLVGYERDGDLVSF------GGLTFE--PAWSGGDARYHAYLRL 602


>gi|357472913|ref|XP_003606741.1| hypothetical protein MTR_4g065110 [Medicago truncatula]
 gi|355507796|gb|AES88938.1| hypothetical protein MTR_4g065110 [Medicago truncatula]
          Length = 203

 Score =  199 bits (505), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 100/165 (60%), Positives = 120/165 (72%), Gaps = 3/165 (1%)

Query: 3   FGFVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAW 61
           F F+       G    K+CTN  P  SH FRYEL  S N+TWK+EV+SH+H+TPTD+SAW
Sbjct: 6   FMFMFMALMLRGCVTIKECTN-IPTQSHTFRYELFASKNETWKKEVMSHYHVTPTDESAW 64

Query: 62  SSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQT 121
           ++L+P KIL ++ ++  WAL+YRKIKN G F  P  FLKEV L DV L + S+   AQQT
Sbjct: 65  ATLLPRKILSEE-NQHDWALMYRKIKNLGVFKPPVGFLKEVPLGDVRLLEGSIHAVAQQT 123

Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
           NLEYLLMLDVD L+WSFRKTA LPTPG  YGGWE P +ELRGHFV
Sbjct: 124 NLEYLLMLDVDRLIWSFRKTAGLPTPGNPYGGWEEPNTELRGHFV 168


>gi|29348320|ref|NP_811823.1| hypothetical protein BT_2911 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383124515|ref|ZP_09945178.1| hypothetical protein BSIG_1739 [Bacteroides sp. 1_1_6]
 gi|29340224|gb|AAO78017.1| putative Acetyl-CoA carboxylase, biotin carboxylase [Bacteroides
           thetaiotaomicron VPI-5482]
 gi|251841333|gb|EES69414.1| hypothetical protein BSIG_1739 [Bacteroides sp. 1_1_6]
          Length = 655

 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 166/558 (29%), Positives = 257/558 (46%), Gaps = 49/558 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP- 157
           L+EV L D      S     Q+   EYLL L+ DSL+  +R  A LP+    Y GWE+  
Sbjct: 48  LREVRLLD------SPFLDLQRKGKEYLLWLNPDSLLHFYRIEAGLPSKAAPYAGWESQD 101

Query: 158 ---ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP-- 212
                 LRG F+G YLS+ + M+ ST +  + +++  V+  L  CQ     G+L      
Sbjct: 102 VWGAGPLRGGFLGFYLSSVSMMYQSTDDKRLLKRLKYVLKELELCQKAGKDGFLLGLKDG 161

Query: 213 TELFDSFEALK---------PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFY 263
            +LF    + K           WAP Y I+K+L GL   Y      +AL +   + ++F 
Sbjct: 162 RKLFAEVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCQMEEALPILIRLADWFG 221

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
            +V   +T   ++R    L  E G +N+     Y +T + + L  A   +     G L+ 
Sbjct: 222 YQVLDKLTDDQIQR---LLICEHGSINESYVEAYELTGEKRFLDWARRLNDHAMWGPLSE 278

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
             D L  +HANT IP   G    Y+ TGD  +    T F +IV  +H++  GG S  E +
Sbjct: 279 GKDILFGWHANTQIPKFTGFHKYYQFTGDERFLTAATNFWNIVTQNHTWVIGGNSTGEHF 338

Query: 384 WDPKRLAD-TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
           +  +  AD  L     ETC + NML+++  LF    + A A YYER L N +LS     E
Sbjct: 339 FPKEEFADRVLLVGGPETCNSVNMLRLTESLFCQYPDAAKASYYERVLFNHILS-AYDPE 397

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV---P 499
            G+  Y   +  G  +      + ++ +SFWCC  TG+ES +KL   IY   +  +   P
Sbjct: 398 KGMCCYFTSMRPGHYRI-----YASRDSSFWCCGHTGLESPAKLSKFIYSHSKRIIDGDP 452

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            + +  +I S   WK   + L Q+    +     +   L    KQE+     L +R P W
Sbjct: 453 DIRVNLFIPSILFWKEKGIELIQQNR--LPESEQVSFMLNLKKKQEL----ILRIRKPDW 506

Query: 560 TYSNGAQASLNGQ-NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ-DDRPEYAS 617
             ++     +NG+   P+     +      W+  +K+ +QLP+ +  E++   DR  YA 
Sbjct: 507 --ADKVTFIINGKVEYPILDKDGYWVVNRTWARKNKIILQLPMHVYVESLMGSDR--YA- 561

Query: 618 IQAILFGPYLLAGHTSGE 635
             A+L+GPY+LAG    E
Sbjct: 562 --ALLYGPYVLAGRMGTE 577


>gi|408500683|ref|YP_006864602.1| acetyl-CoA carboxylase [Bifidobacterium asteroides PRL2011]
 gi|408465507|gb|AFU71036.1| acetyl-CoA carboxylase [Bifidobacterium asteroides PRL2011]
          Length = 807

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 151/492 (30%), Positives = 225/492 (45%), Gaps = 33/492 (6%)

Query: 107 VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
           V L   S+   AQQ   +YLL LD D L+  +R+ A L      Y  WE+    L GH  
Sbjct: 26  VRLTPGSIYADAQQAGADYLLSLDPDRLLAPYRREAGLTATADPYPNWES--MGLDGHIG 83

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF-------- 216
           GHYLS  A  W S       E+ + ++  L ECQ   G G+L   P   ELF        
Sbjct: 84  GHYLSGLAAYWQSLQTWPFLERATRMLTGLLECQEASGDGFLGGMPHSAELFRNLREGHV 143

Query: 217 --DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYS 274
              SF+ L   W P Y +HK+ AGLLD +       A +MA  MV    +    +     
Sbjct: 144 QAQSFDLLG-SWVPLYNLHKLFAGLLDCWQSFQTKGASEMARVMVLRLADWWCDLADNID 202

Query: 275 VERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAH-LFDKPCFLGFLALQADYLSHFHA 333
            +     L  E GG+N+   RLY +T   ++L  A  L D+P F   LA+  D L+  HA
Sbjct: 203 EQDFQTMLTCEYGGLNEAFARLYQLTGKDRYLRQARRLTDRP-FFEPLAVGKDQLTGLHA 261

Query: 334 NTHIPIVIGSQMRYEVTGDPLYKL-IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
           NT IP V+G +   E+TGD  ++  + TF+  +V+   + + G  S  E +  P   +  
Sbjct: 262 NTQIPKVLGYERLAEITGDQAFRTAVDTFWHGVVD-KRTVSIGAHSISEHFNPPDDFSAM 320

Query: 393 LGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
           + S E  ETC +YNM K++  L+  T +  Y D+YER L N ++S     E G  +Y  P
Sbjct: 321 VTSREGLETCNSYNMAKLALRLYDRTGQARYLDFYERVLVNHLVSTVGIREHG-FVYFTP 379

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG-----LYIIQY 506
           +     + R    + +   SFWCC GTG+E+ ++ G  I+    G  PG     L +  +
Sbjct: 380 M-----RPRHYRVYSSAQRSFWCCVGTGLENHARYGAMIFERRPGKDPGQESESLAVNLF 434

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
           I +S DW    + ++    P        R+ L    + +  Q   L++R P W      +
Sbjct: 435 IPASLDWSQRGLRVSLAYAPGPGTTNLGRIDLEADDQSQ--QTLDLDIRHPWWVEDADYR 492

Query: 567 ASLNGQNLPLPP 578
            +    N+ + P
Sbjct: 493 IAQGQANMTVEP 504


>gi|261415299|ref|YP_003248982.1| hypothetical protein Fisuc_0892 [Fibrobacter succinogenes subsp.
           succinogenes S85]
 gi|385790233|ref|YP_005821356.1| hypothetical protein FSU_1340 [Fibrobacter succinogenes subsp.
           succinogenes S85]
 gi|261371755|gb|ACX74500.1| protein of unknown function DUF1680 [Fibrobacter succinogenes
           subsp. succinogenes S85]
 gi|302327243|gb|ADL26444.1| conserved hypothetical protein [Fibrobacter succinogenes subsp.
           succinogenes S85]
          Length = 897

 Score =  196 bits (499), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 173/624 (27%), Positives = 277/624 (44%), Gaps = 60/624 (9%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
           +L DV L    VL   Q  N+E LL  DVD L+  F + A +      +  W    + L 
Sbjct: 36  ALSDVQL-LDGVLKERQDLNVETLLSYDVDRLLAPFYEEAGMKPKASKFPNW----AGLD 90

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPT--EL 215
           GH +GHYLSA A  +A   +  +KE++  ++  L   Q++        GY+S  P   ++
Sbjct: 91  GHVLGHYLSALAMHYADNDDVQVKERLEYILKELKTIQDQNSKDNNFKGYISGVPNGKQM 150

Query: 216 FDSFE-----ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
           +   +     A    W P+Y IHK+ AGL D YV A   QA  M   + ++       + 
Sbjct: 151 WLKMKNGDAGAQNGYWVPWYNIHKLYAGLRDAYVYAGYEQAKTMFLALCDWGIT----IT 206

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
              +  +    L  E GGM +V    Y +T D K+L  A  +     L  ++   D L++
Sbjct: 207 NGLNDSKMQQMLGTEHGGMPEVYADAYKLTKDEKYLNAAKKWSHQWLLNPMSQGNDNLTN 266

Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW---WDPK 387
            HANT +P V+G     E++GD  YK    FF   V    S A GG S  E +    + K
Sbjct: 267 VHANTQVPKVVGFARIAELSGDEKYKKGSDFFWQTVVNKRSIAIGGNSISEHFPALNNHK 326

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
           +  +    E  E+C TYNMLK++  LF    +  Y D+YERAL N +LS    T  G  +
Sbjct: 327 KFIEE--REGPESCNTYNMLKLTERLFNIKHDAHYTDFYERALFNHILSTIHPTHGG-YV 383

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  P     ++ R    +       WCC G+G+E+ +K    IY +++     LY+  + 
Sbjct: 384 YFTP-----ARPRHYRVYSKVNAGMWCCVGSGMENPAKYNQFIYTKDKD---ALYVNLFA 435

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
           +S  +WK   V + Q+           + T+T S + +      + +R P W      + 
Sbjct: 436 ASILNWKDKSVKIKQET--AFPKGESSKFTITGSGEFD------MQIRHPYWVKEGAFKV 487

Query: 568 SLNGQN-LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
            +NG   +    P +++SA + W   D + +  P+    E    D P      A+L GP 
Sbjct: 488 IVNGDTVVKKSTPSSYVSAGKSWKSGDVVEVLYPMYTHVE----DLPGVTDYVALLHGPI 543

Query: 627 LLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEE 686
           +L+         KTGTA +L+ L++       + + +   ES +   ++++  + I  + 
Sbjct: 544 VLSA--------KTGTA-NLNGLVA--DDGRWSHIASGALESLDQAPMLASKKEDIPSKV 592

Query: 687 FPVSGTDAALHATFRLI-LKDASL 709
            PV G      A +     KDA+L
Sbjct: 593 EPVKGEPLHFKAPYLFAKQKDANL 616


>gi|265753026|ref|ZP_06088595.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|263236212|gb|EEZ21707.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
          Length = 808

 Score =  196 bits (497), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 162/574 (28%), Positives = 255/574 (44%), Gaps = 54/574 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE--- 155
           LKEV L D      S        N  Y+L L+ D L+  FR+ A L    + Y  WE   
Sbjct: 39  LKEVRLLD------SDFKHIMDLNHAYMLSLEPDRLLSWFRREAGLTPKAQPYPFWESEY 92

Query: 156 -NPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL------ 208
            N    L GH +G YLS  + M+ ST +  I  ++S ++  LS CQ   G GYL      
Sbjct: 93  MNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQAGGDGYLLPTICG 152

Query: 209 -SAFPTELFDSFEALKP--------VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
            + F   L  +F+   P         W P Y ++KI+ GL   Y+  D  QA ++   M 
Sbjct: 153 RAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMRCDLLQAKEILVKMA 212

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           ++F   V   ++   +++    L  E G +N+    +Y IT + K+L  A   +      
Sbjct: 213 DWFGYSVIDKLSHDDLQK---LLVCEHGSINESFIDVYQITGEEKYLKWAQRLNDEDMWV 269

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            ++   D L  +HANT IP   G +  Y    +  +     FF D V   H++  GG S 
Sbjct: 270 PMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDTVVRKHTWVMGGNST 329

Query: 380 REFWWDPKRLADTLG-SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
            E ++ P+     +  +   E+C + NML+++  L+    E+   DYYE+ L N +L+  
Sbjct: 330 GEHFFAPEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEKVDYYEKVLFNHILA-N 388

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
              + G+ +Y   +  G  K      +GTK++SFWCC GTG E  +K G  IY   +   
Sbjct: 389 YDPDQGMCVYYTSMKPGHYKI-----YGTKYDSFWCCTGTGFEQTAKFGQMIYAHTDD-- 441

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             LY+  +I S   W  G  +  +   P          +LT S +     + +L +R P 
Sbjct: 442 -ALYVNMFIPSVVTWNKGVSIHQETAFPDEG-----VTSLTVSGE----AVFNLKIRCPY 491

Query: 559 WTYSNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
           W  S+     +NG+   +    + ++S   +W   DK+ I+LP+ L    +     E A 
Sbjct: 492 WVGSSSLNVIVNGKREKIKAGMDGYVSINRQWKDGDKVRIELPMKLEIVPLN----EAAH 547

Query: 618 IQAILFGPYLLAGHTSGEWDIKTG--TARSLSAL 649
             A+ +GP +LA   S E   K    +ARS  A+
Sbjct: 548 YLALKYGPIVLAARISDEHLSKDDFRSARSTVAM 581


>gi|423219866|ref|ZP_17206362.1| hypothetical protein HMPREF1061_03135 [Bacteroides caccae
           CL03T12C61]
 gi|392625071|gb|EIY19149.1| hypothetical protein HMPREF1061_03135 [Bacteroides caccae
           CL03T12C61]
          Length = 655

 Score =  195 bits (496), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 169/559 (30%), Positives = 256/559 (45%), Gaps = 51/559 (9%)

Query: 99  LKEVSLHD-VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
           LKE+ L D  +LD        QQ   EYLL L+ DSL+  +R  A L +    Y GWE+ 
Sbjct: 48  LKEIRLSDGPFLD-------LQQKGKEYLLWLNPDSLLHFYRIEAGLSSKAGPYAGWESQ 100

Query: 158 ----ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP- 212
                  LRG F+G YLS+ + M+ ST +  +  ++  V+  L  CQ     G+L     
Sbjct: 101 DVWGAGPLRGGFLGFYLSSVSMMYQSTGDRELLRRLKYVLKELKLCQEAGKDGFLLGVKG 160

Query: 213 -TELFDSFEALK---------PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYF 262
             ELF    + K           WAP Y I+K+L GL   Y   D  +AL +   + ++F
Sbjct: 161 GRELFREVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCDLKEALPILVRLADWF 220

Query: 263 YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLA 322
            ++V   +T   +++    L  E G +N+    +Y +T   + L  A   +       L+
Sbjct: 221 GSQVLDKLTDEQIQQ---LLICEHGSINESYVEVYELTGQKRFLDWARRLNDRAMWVPLS 277

Query: 323 LQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREF 382
              D L  +HANT IP   G    Y  TGD  + L  T F +IV  +H++  GG S  E 
Sbjct: 278 EGKDVLFGWHANTQIPKFTGFHKYYMFTGDRAFLLAATNFWNIVKQNHTWVIGGNSTGEH 337

Query: 383 WWDPKRLAD-TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
           ++  K   D  L     ETC + NML+++  LF    +   A YYER L N +LS     
Sbjct: 338 FFSKKEFIDRMLHISGPETCNSVNMLRLTEALFMQQPDATKAAYYERTLFNHILSAYDPV 397

Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV--- 498
           + G+  Y   +  G  +      + ++ +SFWCC  TG+ES +KLG  IY  +  N    
Sbjct: 398 K-GMCCYFTSMRPGHYRI-----YASRDSSFWCCGHTGLESPAKLGKFIYSHKVTNRHQE 451

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             + +  +I S   WK   V L Q+    +     + +TL    KQ++     L +R P 
Sbjct: 452 KDIRVNLFIPSILSWKEEGVELIQQSR--IPESEQVDLTLNLKKKQKL----ILRIRKPD 505

Query: 559 WTYSNGAQASLNG-QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ-DDRPEYA 616
           WT  + A   +NG +  PL     +      W   + +T++LP+ + TE +   DR    
Sbjct: 506 WT--DKATFIINGEEEQPLLGSDGYWIIDRVWERKNVITLRLPMHIYTENLTGTDR---- 559

Query: 617 SIQAILFGPYLLAGHTSGE 635
              A+L+GPY+LAG    E
Sbjct: 560 -YVALLYGPYVLAGRMGKE 577


>gi|237711616|ref|ZP_04542097.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|229454311|gb|EEO60032.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
          Length = 780

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 161/574 (28%), Positives = 254/574 (44%), Gaps = 54/574 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE--- 155
           LKEV L D      S        N  Y+L L+ D L+  FR+ A L    + Y  WE   
Sbjct: 11  LKEVRLLD------SDFKHIMDLNHAYMLSLEPDRLLSWFRREAGLTPKAQPYPFWESEY 64

Query: 156 -NPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL------ 208
            N    L GH +G YLS  + M+ ST +  I  ++S ++  LS CQ   G GYL      
Sbjct: 65  MNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQAGGDGYLLPTICG 124

Query: 209 -SAFPTELFDSFEALKP--------VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
            + F   L  +F+   P         W P Y ++KI+ GL   Y+  D  QA ++   M 
Sbjct: 125 RAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMRCDLLQAKEILVKMA 184

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           ++F   V   ++   +++    L  E G +N+    +Y IT + K+L  A   +      
Sbjct: 185 DWFGYSVIDKLSHDDLQK---LLVCEHGSINESFIDVYQITGEEKYLKWAQRLNDEDMWV 241

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            ++   D L  +HANT IP   G +  Y    +  +     FF D V   H++  GG S 
Sbjct: 242 PMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDTVVRKHTWVMGGNST 301

Query: 380 REFWWDPKRLADTLG-SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
            E ++ P+     +  +   E+C + NML+++  L+    E+   DYYE+ L N +L+  
Sbjct: 302 GEHFFAPEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEKVDYYEKVLFNHILA-N 360

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
              + G+ +Y   +  G  K      +GTK++SFWCC GTG E  +K G  IY   +   
Sbjct: 361 YDPDQGMCVYYTSMKPGHYKI-----YGTKYDSFWCCTGTGFEQTAKFGQMIYAHTDD-- 413

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             LY+  +I S   W  G  +  +   P          +LT S +     + +L +R P 
Sbjct: 414 -ALYVNMFIPSVVTWDKGISIHQETAFPDEG-----VTSLTVSGE----AVFNLKIRCPY 463

Query: 559 WTYSNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
           W  S+     +NG+   +    + ++S   +W   DK+ I+LP+ L    +     E   
Sbjct: 464 WVGSSSLNVIVNGKREKIKAGVDGYVSINRQWKDGDKVRIELPMKLEIVPLN----EATH 519

Query: 618 IQAILFGPYLLAGHTSGEWDIKTG--TARSLSAL 649
             A+ +GP +LA   S E   K    +ARS  A+
Sbjct: 520 YLALKYGPIVLAARISDEHLSKDDFRSARSTVAM 553


>gi|153805786|ref|ZP_01958454.1| hypothetical protein BACCAC_00022 [Bacteroides caccae ATCC 43185]
 gi|149130463|gb|EDM21669.1| hypothetical protein BACCAC_00022 [Bacteroides caccae ATCC 43185]
          Length = 659

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 169/559 (30%), Positives = 255/559 (45%), Gaps = 51/559 (9%)

Query: 99  LKEVSLHD-VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
           LKE+ L D  +LD        QQ   EYLL L+ DSL+  +R  A L +    Y GWE+ 
Sbjct: 52  LKEIRLSDGPFLD-------LQQKGKEYLLWLNPDSLLHFYRIEAGLSSKAGPYAGWESQ 104

Query: 158 ----ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP- 212
                  LRG F+G YLS+ + M+ ST +  +  ++  V+  L  CQ     G+L     
Sbjct: 105 DVWGAGPLRGGFLGFYLSSVSMMYQSTGDRELLRRLKYVLKELKLCQEAGKDGFLLGVKG 164

Query: 213 -TELFDSFEALK---------PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYF 262
             ELF    + K           WAP Y I+K+L GL   Y   D  +AL +   + ++F
Sbjct: 165 GRELFREVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCDLKEALPILVRLADWF 224

Query: 263 YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLA 322
            ++V   +T   +++    L  E G +N+    +Y +T   + L  A   +       L+
Sbjct: 225 GSQVLDKLTDEQIQQ---LLICEHGSINESYVEVYELTGQKRFLDWARRLNDRAMWVPLS 281

Query: 323 LQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREF 382
              D L   HANT IP   G    Y  TGD  + L  T F +IV  +H++  GG S  E 
Sbjct: 282 EGKDVLFGGHANTQIPKFTGFHKYYMFTGDRAFLLAATNFWNIVKQNHTWVIGGNSTGEH 341

Query: 383 WWDPKRLAD-TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
           ++  K   D  L     ETC + NML+++  LF    +   A YYER L N +LS     
Sbjct: 342 FFSKKEFIDRMLHISGPETCNSVNMLRLTEALFMQQPDATKAAYYERTLFNHILSAYDPV 401

Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV--- 498
           + G+  Y   +  G  +      + ++ +SFWCC  TG+ES +KLG  IY  +  N    
Sbjct: 402 K-GMCCYFTSMRPGHYRI-----YASRDSSFWCCGHTGLESPAKLGKFIYSHKVTNRHQE 455

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             + +  +I S   WK   V L Q+    +     + +TL    KQ++     L +R P 
Sbjct: 456 KDIRVNLFIPSILSWKEEGVELIQQSR--IPESEQVDLTLNLKKKQKL----ILRIRKPD 509

Query: 559 WTYSNGAQASLNG-QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD-DRPEYA 616
           WT  + A   +NG +  PL     +      W   + +T++LP+ + TE +   DR    
Sbjct: 510 WT--DKATFIINGEEEQPLLGSDGYWIIDRVWERKNVITLRLPMHIYTENLTGTDR---- 563

Query: 617 SIQAILFGPYLLAGHTSGE 635
              A+L+GPY+LAG    E
Sbjct: 564 -YVALLYGPYVLAGRMGKE 581


>gi|212695364|ref|ZP_03303492.1| hypothetical protein BACDOR_04911 [Bacteroides dorei DSM 17855]
 gi|345513936|ref|ZP_08793451.1| hypothetical protein BSEG_01968 [Bacteroides dorei 5_1_36/D4]
 gi|423230909|ref|ZP_17217313.1| hypothetical protein HMPREF1063_03133 [Bacteroides dorei
           CL02T00C15]
 gi|423241462|ref|ZP_17222575.1| hypothetical protein HMPREF1065_03198 [Bacteroides dorei
           CL03T12C01]
 gi|423244620|ref|ZP_17225695.1| hypothetical protein HMPREF1064_01901 [Bacteroides dorei
           CL02T12C06]
 gi|212662093|gb|EEB22667.1| hypothetical protein BACDOR_04911 [Bacteroides dorei DSM 17855]
 gi|229435750|gb|EEO45827.1| hypothetical protein BSEG_01968 [Bacteroides dorei 5_1_36/D4]
 gi|392630029|gb|EIY24031.1| hypothetical protein HMPREF1063_03133 [Bacteroides dorei
           CL02T00C15]
 gi|392641355|gb|EIY35132.1| hypothetical protein HMPREF1065_03198 [Bacteroides dorei
           CL03T12C01]
 gi|392641469|gb|EIY35245.1| hypothetical protein HMPREF1064_01901 [Bacteroides dorei
           CL02T12C06]
          Length = 808

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 161/574 (28%), Positives = 254/574 (44%), Gaps = 54/574 (9%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE--- 155
           LKEV L D      S        N  Y+L L+ D L+  FR+ A L    + Y  WE   
Sbjct: 39  LKEVRLLD------SDFKHIMDLNHAYMLSLEPDRLLSWFRREAGLTPKAQPYPFWESEY 92

Query: 156 -NPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL------ 208
            N    L GH +G YLS  + M+ ST +  I  ++S ++  LS CQ   G GYL      
Sbjct: 93  MNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQAGGDGYLLPTICG 152

Query: 209 -SAFPTELFDSFEALKP--------VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
            + F   L  +F+   P         W P Y ++KI+ GL   Y+  D  QA ++   M 
Sbjct: 153 RAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMRCDLLQAKEILVKMA 212

Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
           ++F   V   ++   +++    L  E G +N+    +Y IT + K+L  A   +      
Sbjct: 213 DWFGYSVIDKLSHDDLQK---LLVCEHGSINESFIDVYQITGEEKYLKWAQRLNDEDMWV 269

Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
            ++   D L  +HANT IP   G +  Y    +  +     FF D V   H++  GG S 
Sbjct: 270 PMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDTVVRKHTWVMGGNST 329

Query: 380 REFWWDPKRLADTLG-SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
            E ++ P+     +  +   E+C + NML+++  L+    E+   DYYE+ L N +L+  
Sbjct: 330 GEHFFAPEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEKVDYYEKVLFNHILA-N 388

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
              + G+ +Y   +  G  K      +GTK++SFWCC GTG E  +K G  IY   +   
Sbjct: 389 YDPDQGMCVYYTSMKPGHYKI-----YGTKYDSFWCCTGTGFEQTAKFGQMIYAHTDD-- 441

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
             LY+  +I S   W  G  +  +   P          +LT S +     + +L +R P 
Sbjct: 442 -ALYVNMFIPSVVTWDKGISIHQETAFPDEG-----VTSLTVSGE----AVFNLKIRCPY 491

Query: 559 WTYSNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
           W  S+     +NG+   +    + ++S   +W   DK+ I+LP+ L    +     E   
Sbjct: 492 WVGSSSLNVIVNGKREKIKAGVDGYVSINRQWKDGDKVRIELPMKLEIVPLN----EATH 547

Query: 618 IQAILFGPYLLAGHTSGEWDIKTG--TARSLSAL 649
             A+ +GP +LA   S E   K    +ARS  A+
Sbjct: 548 YLALKYGPIVLAARISDEHLSKDDFRSARSTVAM 581


>gi|444305788|ref|ZP_21141565.1| hypothetical protein G205_09453 [Arthrobacter sp. SJCon]
 gi|443481842|gb|ELT44760.1| hypothetical protein G205_09453 [Arthrobacter sp. SJCon]
          Length = 444

 Score =  191 bits (485), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 137/410 (33%), Positives = 194/410 (47%), Gaps = 29/410 (7%)

Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
           +AQ T++ Y+L LD D L   +   A L    +AYG WE+    L GH  GHYLS  A++
Sbjct: 23  QAQDTSVRYILSLDADRLFAPYLHEAGLVRAAEAYGNWES--DGLGGHIGGHYLSGCARL 80

Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEALKPV 225
           +A+T NA +  K+   V  L  CQ   G GY+   P            E+      L   
Sbjct: 81  YAATGNAELLAKVRAAVVILGNCQAAHGDGYVGGVPRGGDLGQELARGEVDADLFTLNGR 140

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
           W P Y +HK LAGLLD  V A + +AL +A  +  ++  RV   +   + E     L+ E
Sbjct: 141 WVPLYNLHKTLAGLLDARVFAGSGEALDIAVGLAGWWL-RVSAHLADDAFEE---VLHAE 196

Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
            GGMN+    L+ +T   ++L  A  F     L  LA   D L   HANT IP V+G   
Sbjct: 197 FGGMNEAFALLWELTGREEYLREARRFSHRALLDPLAAGQDLLDGLHANTQIPKVVGYAR 256

Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
               T D         F + V +  S + GG S RE +      +  +   +  ETC TY
Sbjct: 257 LAGPTHDADLAHACDIFWESVVSRRSVSIGGNSVREHFHPASDFSPMVQDPQGPETCNTY 316

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARSTH 463
           NMLK+++  F    + A  D++ERA  N +LS Q  GT  G ++Y  P+  G  +  S  
Sbjct: 317 NMLKLAKLRFEAHGDAAAVDFFERATYNHILSSQHPGT--GGLVYFTPMRPGHYRVYS-- 372

Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
                  S WCC G+G+E+ ++ G+ IY    GN   L +  YI S+ DW
Sbjct: 373 ---RAQESMWCCVGSGLENHARYGELIY-SRAGN--DLLVNLYIPSTLDW 416


>gi|393782707|ref|ZP_10370890.1| hypothetical protein HMPREF1071_01758 [Bacteroides salyersiae
           CL02T12C01]
 gi|392672934|gb|EIY66400.1| hypothetical protein HMPREF1071_01758 [Bacteroides salyersiae
           CL02T12C01]
          Length = 1293

 Score =  189 bits (480), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 153/569 (26%), Positives = 253/569 (44%), Gaps = 61/569 (10%)

Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
           L +A   N+ YL   DV+ L+    K        K YGG  +           HYLSA +
Sbjct: 457 LKQAMDKNITYLKSFDVNRLLAQTFKYNLGIDDYKLYGGANDAT-------FAHYLSAIS 509

Query: 175 QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA--FPTELFDSFEALKPV------- 225
             +A+T +  + ++++ +V  + + Q+ +G G  S    PT  F      K +       
Sbjct: 510 MGYAATGDEDLLQRVNHMVDVMIQAQDVMGDGLYSNNDAPTWGFYKMAKEKVITPYGWDE 569

Query: 226 ----WA------PYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVIT 271
               W       P+Y  HK  A   D Y+ A N  A    +K   W+V +  N       
Sbjct: 570 NGHPWGNNNIGFPFYAHHKAFAAFRDAYIYAGNENARVAFVKFCEWLVMWMQN------- 622

Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
            ++ +     L  E GGM +VL   Y+++   K L  A  F +  F   ++   D LS  
Sbjct: 623 -FTDDNLQKMLESEHGGMVEVLSDAYALSGKIKFLDAARRFTRDNFAAAMSGNRDDLSGR 681

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           H+N H+P+ +G+ + Y  +GD         F  IV+  H+   GG    E +  P  L  
Sbjct: 682 HSNFHVPMAVGAAIHYLYSGDERSGKTAHNFFHIVHDHHTLCNGGNGNNERFGTPDLLTY 741

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
            LG    ETC++YNMLK+++ LF    +  Y DYYE  + N +L+I        + Y + 
Sbjct: 742 RLGQRGPETCSSYNMLKLAKDLFCQEGDTEYLDYYENTMWNHILAILSPRSDAGVCYHVN 801

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           L  G  K  S       +++ WCC GTG+ES +K  D+IYF  +G++ G+ +  +  S+ 
Sbjct: 802 LKPGTFKMYS-----DLYSNLWCCVGTGMESHAKYVDAIYF--KGDI-GILVNLFTPSTL 853

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           +W+   + L  + D  V+ +  L +  + S  +++       +R P W    G   ++NG
Sbjct: 854 NWEETGLKLTMETDFPVTNNVKLIINESGSFNKDIC------IRYPSWVEEGGIAITING 907

Query: 572 QNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
               +   PG  +  +  W+  D++ I +P  LR   + DD     ++ AI +GP LLA 
Sbjct: 908 AKQKISAKPGEIIKLSSSWAAGDEILITIPCKLRLVDLPDD----INVSAIFYGPVLLAA 963

Query: 631 HTS--GEWDIKTGTARSLSALISPIPPSF 657
           +    G+ DI  G +     +  P P ++
Sbjct: 964 NMGEVGQSDI--GFSWPQEEIKDPAPDAY 990


>gi|265751351|ref|ZP_06087414.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|263238247|gb|EEZ23697.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
          Length = 791

 Score =  189 bits (480), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 145/548 (26%), Positives = 252/548 (45%), Gaps = 45/548 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L EV + D +          Q  + +YLL L+ D L+  FR+ A L    + Y  WE+  
Sbjct: 18  LSEVRITDKYFKH------IQDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPFWESED 71

Query: 159 ----SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA---- 210
                 L GH +G Y+S+ + M+ +T++  I ++++ +V  L  CQ   G GYL A    
Sbjct: 72  VWGGGPLAGHILGFYMSSMSMMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLLATVNG 131

Query: 211 ---FPTELFDSFEALKPV----WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFY 263
              F   +   F    P+    W P Y ++KI+ GL   Y       A ++   M ++F 
Sbjct: 132 KQVFEDMIDGDFTTSNPLINQTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGMADWFG 191

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
             V   +   ++++    L  E G +N+    +Y IT D K+L  A   +       L+ 
Sbjct: 192 YEVLDKLNHENIQK---MLVCEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMWVPLSK 248

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
             D L+ +HANT IP   G    Y  T +  Y    T F DIV   H++  GG S  E +
Sbjct: 249 GEDILNGWHANTQIPKFTGFNAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNSTGEHF 308

Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
           ++       +      E+C + NM++++  L++    +   DYYER L N +L+     E
Sbjct: 309 FEESMFEKKIPQYGGPESCNSVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA-NYDPE 367

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
            G+ +Y  P+  G  K      +GT+++SFWCC GTG E+ +K    IY  ++ +   LY
Sbjct: 368 EGMCVYYTPMRPGHYKI-----YGTRYHSFWCCTGTGFEAPAKFAKMIYAHKDNS---LY 419

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
           +  +I+S+ DW   ++++ Q  +     D  L +T+  SS Q++     L +R+P W  +
Sbjct: 420 VNMFIASTLDWNEKNIMITQSTN-FPDEDQTL-LTIKSSSTQQI----DLKIRIPFWIKN 473

Query: 563 NGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
                 +N + +  +     +++ +  WS  D++ +     L    +++         A+
Sbjct: 474 KSMVVRVNNKIVKGIKSEKGYVTISREWSDGDEIKVTFTPLLEIVPLKNSE----RYLAM 529

Query: 622 LFGPYLLA 629
            +GP +LA
Sbjct: 530 TYGPIVLA 537


>gi|212693864|ref|ZP_03301992.1| hypothetical protein BACDOR_03386 [Bacteroides dorei DSM 17855]
 gi|212663396|gb|EEB23970.1| hypothetical protein BACDOR_03386 [Bacteroides dorei DSM 17855]
          Length = 811

 Score =  189 bits (480), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 145/548 (26%), Positives = 252/548 (45%), Gaps = 45/548 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
           L EV + D +          Q  + +YLL L+ D L+  FR+ A L    + Y  WE+  
Sbjct: 38  LSEVRITDKYFKH------IQDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPFWESED 91

Query: 159 ----SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA---- 210
                 L GH +G Y+S+ + M+ +T++  I ++++ +V  L  CQ   G GYL A    
Sbjct: 92  VWGGGPLAGHILGFYMSSMSMMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLLATVNG 151

Query: 211 ---FPTELFDSFEALKPV----WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFY 263
              F   +   F    P+    W P Y ++KI+ GL   Y       A ++   M ++F 
Sbjct: 152 KQVFEDMIDGDFTTSNPLINQTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGMADWFG 211

Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
             V   +   ++++    L  E G +N+    +Y IT D K+L  A   +       L+ 
Sbjct: 212 YEVLDKLNHENIQK---MLVCEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMWVPLSK 268

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
             D L+ +HANT IP   G    Y  T +  Y    T F DIV   H++  GG S  E +
Sbjct: 269 GEDILNGWHANTQIPKFTGFNAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNSTGEHF 328

Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
           ++       +      E+C + NM++++  L++    +   DYYER L N +L+     E
Sbjct: 329 FEESMFEKKIPQYGGPESCNSVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA-NYDPE 387

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
            G+ +Y  P+  G  K      +GT+++SFWCC GTG E+ +K    IY  ++ +   LY
Sbjct: 388 EGMCVYYTPMRPGHYKI-----YGTRYHSFWCCTGTGFEAPAKFAKMIYAHKDNS---LY 439

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
           +  +I+S+ DW   ++++ Q  +     D  L +T+  SS Q++     L +R+P W  +
Sbjct: 440 VNMFIASTLDWNEKNIMITQSTN-FPDEDQTL-LTIKSSSTQQI----DLKIRIPFWIKN 493

Query: 563 NGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
                 +N + +  +     +++ +  WS  D++ +     L    +++         A+
Sbjct: 494 KSMVVRVNNKIVKGIKSEKGYVTISREWSDGDEIKVTFTPLLEIVPLKNSE----RYLAM 549

Query: 622 LFGPYLLA 629
            +GP +LA
Sbjct: 550 TYGPIVLA 557


>gi|423223251|ref|ZP_17209720.1| hypothetical protein HMPREF1062_01906 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392639352|gb|EIY33177.1| hypothetical protein HMPREF1062_01906 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 643

 Score =  189 bits (479), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 163/554 (29%), Positives = 252/554 (45%), Gaps = 50/554 (9%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP----I 158
           SL DV L +S  L   QQ   EYLL L+ DSL+  +R  A L    +AY GWE+      
Sbjct: 41  SLEDVRLLESPFL-DLQQKGKEYLLWLNPDSLLHFYRIEAGLQPKARAYAGWESQDVWGA 99

Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
             LRG F+G YLS+ + M+ +T +  + +++  V+  L  CQ     G+L       +LF
Sbjct: 100 GPLRGGFLGFYLSSVSMMYQATGDKELLKRLQYVLNELELCQKAGKDGFLLGIKDGRKLF 159

Query: 217 DSFEALK---------PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
               + K           WAP Y I+K+L GL   Y      +AL M   + ++F  +V 
Sbjct: 160 SEVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYAQCGQEKALPMMIRLADWFGYQVL 219

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
             +T   V+R    L  E G +N+    +Y +T + + L  A   +       L+   D 
Sbjct: 220 DKLTDEQVQR---LLVCEHGSINESFVEIYKLTGEIRFLEWAGRLNDRAMWVPLSEGKDI 276

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L  +HANT IP   G +  YE TGD         F DIVN +H++  GG S  E ++  K
Sbjct: 277 LFGWHANTQIPKFTGFEKYYEATGDKRLLNAAMNFWDIVNQNHTWVIGGNSTGEHFFPKK 336

Query: 388 RLAD-TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
              +  L     ETC + NML+++  LF +  +   A YYER L N +LS     + G+ 
Sbjct: 337 EFEERVLLKGGPETCNSVNMLRLTETLFSYQPDAKKAAYYERVLFNHILSAYDPVK-GMC 395

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
            Y   +  G  +      + ++ +SFWCC  TG+ES +KLG  IY  ++G   G+ +  +
Sbjct: 396 CYFTSMRPGHYRI-----YASRDSSFWCCGHTGLESPAKLGKFIYSRDKG---GIRVNLF 447

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS---- 562
           I S    K   + L Q      S     R+ L     Q+   L +L +R P W  +    
Sbjct: 448 IPSVLTSKELGMELAQYSHMPESDKVEFRLNL-----QDERTL-TLRIRRPDWAKNPILV 501

Query: 563 -NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
            NG + +++           +     +W   +++ ++LP+   TE +           A+
Sbjct: 502 INGKEEAIDTDT------SGYWVLDRKWKKKNRIILKLPMEPYTENLVGS----DKYVAL 551

Query: 622 LFGPYLLAGHTSGE 635
           L+GPY+LAG    E
Sbjct: 552 LYGPYVLAGRLGME 565


>gi|423228769|ref|ZP_17215175.1| hypothetical protein HMPREF1063_00995 [Bacteroides dorei
           CL02T00C15]
 gi|423247580|ref|ZP_17228629.1| hypothetical protein HMPREF1064_04835 [Bacteroides dorei
           CL02T12C06]
 gi|392631910|gb|EIY25877.1| hypothetical protein HMPREF1064_04835 [Bacteroides dorei
           CL02T12C06]
 gi|392635508|gb|EIY29407.1| hypothetical protein HMPREF1063_00995 [Bacteroides dorei
           CL02T00C15]
          Length = 811

 Score =  189 bits (479), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 141/528 (26%), Positives = 246/528 (46%), Gaps = 39/528 (7%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI----SELRGHFVGHYLSASA 174
           Q  + +YLL L+ D L+  FR+ A L    + Y  WE+        L GH +G Y+S+ +
Sbjct: 52  QDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPFWESEDVWGGGPLAGHILGFYMSSMS 111

Query: 175 QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA-------FPTELFDSFEALKPV-- 225
            M+ +T++  I ++++ +V  L  CQ   G GYL A       F   +   F    P+  
Sbjct: 112 MMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLLATVNGKQVFEDMIDGDFTTSNPLIN 171

Query: 226 --WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
             W P Y ++KI+ GL   Y       A ++   M ++F   V   +   ++++    L 
Sbjct: 172 QTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGMADWFGYEVLDKLNHENIQK---MLV 228

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGS 343
            E G +N+    +Y IT D K+L  A   +       L+   D L+ +HANT IP   G 
Sbjct: 229 CEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMWVPLSKGEDILNGWHANTQIPKFTGF 288

Query: 344 QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCT 402
              Y  T +  Y    T F DIV   H++  GG S  E +++       +      E+C 
Sbjct: 289 NAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNSTGEHFFEESMFEKKIPQYGGPESCN 348

Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARST 462
           + NM++++  L++    +   DYYER L N +L+     E G+ +Y  P+  G  K    
Sbjct: 349 SVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA-NYDPEEGMCVYYTPMRPGHYKI--- 404

Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
             +GT+++SFWCC GTG E+ +K    IY  ++ +   LY+  +I+S+ DW   ++++ Q
Sbjct: 405 --YGTRYHSFWCCTGTGFEAPAKFAKMIYAHKDNS---LYVNMFIASTLDWNEKNIMITQ 459

Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGN 581
             +     D  L +T+  SS Q++     L +R+P W  +      +N + +  +     
Sbjct: 460 STN-FPDEDQTL-LTIKSSSTQQI----DLKIRIPFWIKNKSMVVRVNNKIVKGIKSEKG 513

Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           +++ +  WS  D++ +     L    +++         A+ +GP +LA
Sbjct: 514 YVTISREWSDGDEIKVTFTPLLEIVPLKNSE----RYLAMTYGPIVLA 557


>gi|336404182|ref|ZP_08584880.1| hypothetical protein HMPREF0127_02193 [Bacteroides sp. 1_1_30]
 gi|335943510|gb|EGN05349.1| hypothetical protein HMPREF0127_02193 [Bacteroides sp. 1_1_30]
          Length = 650

 Score =  188 bits (478), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 172/568 (30%), Positives = 264/568 (46%), Gaps = 48/568 (8%)

Query: 93  DLPGNFLKEVS----LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG 148
           +LP   +K  S    L++V L  S  L   QQ   EYLL L+ DSL+  +R  A LP   
Sbjct: 24  NLPSTMVKPESVYFPLNEVRLLDSPFL-TLQQKGKEYLLWLNPDSLLHFYRVEAGLPPKA 82

Query: 149 KAYGGWENP----ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG 204
            AY GWE+        LRG F+G YLS+ + M  ST +  + +++  V+  L  CQ+   
Sbjct: 83  DAYAGWESQNVWGAGPLRGGFLGFYLSSVSMMHQSTGDKELLKRLKYVLKELKLCQDAGK 142

Query: 205 TGYLSAFP--TELFDSFEALK---------PVWAPYYTIHKILAGLLDQYVLADNAQALK 253
            G+L        LF    + K           WAP Y I+K+L GL   Y      +AL 
Sbjct: 143 DGFLLGIKDGRMLFKEVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCGLEEALP 202

Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
           M   + ++F  +V   ++   +++    L  E G +N+     Y +T   + L  A    
Sbjct: 203 MMIRLADWFGYQVLDKLSDEQIQK---LLVCEHGSINESYVEAYELTGQKRFLDWARRLH 259

Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYA 373
                  L+   D L  +HANT IP   G    Y  TGD  +    T F +IVN +H++ 
Sbjct: 260 DRAMWVPLSEGKDILYGWHANTQIPKFTGFHKYYMFTGDKRFLTAATNFWNIVNRNHTWV 319

Query: 374 TGGTSAREFWWDPKRLADTLGSE-NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
            GG S  E ++  +  AD L  +   ETC + NML+++  LF    +   A YYER L N
Sbjct: 320 IGGNSTGEHFFPKEEFADRLLLKGGPETCNSVNMLRLTESLFSQYPDAVKASYYERVLFN 379

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            +LS     + G+  Y   +  G  +      + ++ +SFWCC  TG+ES +KLG  IY 
Sbjct: 380 HILSAY-DPKKGMCCYFTSMRPGHYRI-----YASRDSSFWCCGHTGLESPAKLGKFIYS 433

Query: 493 EEEGNVPGLYIIQ---YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL 549
            +  N      I+   +I S   W  G V L Q+ + +   D   R+ LT + K++  Q 
Sbjct: 434 HKATNRKEEKEIRVNLFIPSVLTWHEGGVELVQR-NRLPDSD---RVELTMNLKKK--QR 487

Query: 550 SSLNLRMPVWTYSNGAQASLNG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
             L +R P W  ++ A   +NG  + L L   G ++   + W+  +++++QLP+   TE 
Sbjct: 488 LILWIRKPDW--ADKATLIINGKAEQLLLGNDGYWM-IDKVWNRKNRISLQLPMHTYTEN 544

Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGE 635
           +           A+L+GPY+LAG    E
Sbjct: 545 LIGT----GRYVALLYGPYVLAGRMGKE 568


>gi|427384823|ref|ZP_18881328.1| hypothetical protein HMPREF9447_02361 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728084|gb|EKU90943.1| hypothetical protein HMPREF9447_02361 [Bacteroides oleiciplenus YIT
           12058]
          Length = 813

 Score =  187 bits (474), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 162/578 (28%), Positives = 252/578 (43%), Gaps = 54/578 (9%)

Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
            L +V L   S  + A Q + +YLL  D++ ++   RK   +P   KAY G   P    R
Sbjct: 42  CLSEVRLLPGSPFYHAMQVSQQYLLDADIERMLNGRRKEVGIPEK-KAYPGSNQPAG-TR 99

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN-----------KIGTGYLSAF 211
                HY+S ++ M+A T +    ++++ ++  L+   N           K+   Y    
Sbjct: 100 ATDWHHYISGTSLMYAQTGDRRFLDRVNYLIDELAMLDNRKDSLYRVQGKKLELPYAKLM 159

Query: 212 PTELF--DSFEALKP----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
             EL      EA  P     W P+Y  HK  A   D Y+  DN +AL +     E     
Sbjct: 160 KGELLLNSPDEAGYPWGGLCWIPFYWQHKEFAAYRDAYLYCDNLKALNLWIKQAE----P 215

Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
           V + I   + +     L+ E GG+N V   LY++T D ++L ++   +    +  +A   
Sbjct: 216 VTEFILKVNPDLFEGFLDIENGGINAVFADLYALTGDERYLAVSMKLNHQKVILNIANGK 275

Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
           D L   HAN  +P   G+  +Y++TGD + +     F  I    H    GG S  E +  
Sbjct: 276 DVLYGRHANFQLPAFEGTARQYQLTGDEVCRKATQNFAGIYYRDHMNCIGGNSCYERFGR 335

Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
              +   LGS + ETC TYNM+K++ + F  T ++ + DY+ERAL N +L+ Q     GV
Sbjct: 336 SGEITKRLGSTSSETCNTYNMMKIALNTFESTGDLHHMDYFERALYNHILASQDPETGGV 395

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFN--SFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
             Y + L  G         +  +FN    WCC GTG+E+ SK G+ IYF    N   LY+
Sbjct: 396 TYYTMLLPGGFK------SYSDRFNIEGIWCCVGTGMENHSKYGECIYF---NNHQSLYV 446

Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS-LNLRMPVWTYS 562
             +I S  +WK  ++ L Q+ D       + +   T  +  E G  +  + +R P W   
Sbjct: 447 NLFIPSELNWKEKNLHLKQETD-------FPQGDCTTLTILESGAYNHPIYIRYPHWA-G 498

Query: 563 NGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
                 +N +  PL    G ++     W   D++ I++  + R EA  DD      +  I
Sbjct: 499 REVSVRINDEEYPLHAQAGEYIRLQHPWKTGDRIRIEMKQTFRLEAAPDD----PFMNVI 554

Query: 622 LFGPY-----LLAGHTSGEWDIKTGTARSLSALISPIP 654
             GP      L A H   E+ IKT    S    +  IP
Sbjct: 555 FRGPIAYAAQLGADHLPNEY-IKTSRQNSSFLPLDDIP 591


>gi|396489945|ref|XP_003843216.1| hypothetical protein LEMA_P073260.1 [Leptosphaeria maculans JN3]
 gi|312219795|emb|CBX99737.1| hypothetical protein LEMA_P073260.1 [Leptosphaeria maculans JN3]
          Length = 748

 Score =  187 bits (474), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 160/527 (30%), Positives = 232/527 (44%), Gaps = 90/527 (17%)

Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-------IG 204
           GGWE+    L GH+ GHY+SA +Q +     +  KEK+  +V  L+ CQ           
Sbjct: 100 GGWEDG-GLLSGHWTGHYMSALSQAYIDKGESIFKEKLDWMVAELAACQEAYTEYKQPTH 158

Query: 205 TGYLSAFPTELFDSFEALKP-------------VWAPYYTIHKILAGLLDQYVLADNAQA 251
            GYL A P    D+   L P              WA +YT HKI+ GLLD Y  A+N QA
Sbjct: 159 LGYLGALPE---DTVLRLGPPRFAVYGSNISTDTWAGWYTQHKIMRGLLDAYYNANNTQA 215

Query: 252 LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHL 311
           L +   M ++ +  +               +  E GG N+V   +Y++T + KHL  A  
Sbjct: 216 LDIVIKMADWAHLALTDTY-----------IAGEFGGANEVFPEIYALTGEEKHLQTAKA 264

Query: 312 FDKPCFLGFLALQAD---------------YLSHFHANTHIPIVIGSQMRYEVTGDPLYK 356
           FD    L F A  +D                    HANTH+P  IG    YE TG   Y 
Sbjct: 265 FDNRESL-FSAAVSDQDILVMTPERKPGRRRRERLHANTHVPQFIGYLRIYEHTGSNEYL 323

Query: 357 LIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEETCTTYNMLK 408
           L    F   V     +A+G T           E + +   +A+++  E  ETC TYN L 
Sbjct: 324 LAAKNFFGWVVPHREFASGSTGGNVPGFSANPELFQNRDNIANSIADEGAETCITYNTLN 383

Query: 409 VSRHLFRWTKEIAYADYYERALTNGV----LSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
           ++R+LF       Y D+ ER L N +    +     ++P  + Y  PL  G  +      
Sbjct: 384 LARNLFLDEHNATYMDHCERGLFNMIAGSRVDTSNNSDP-QLTYFQPLSPGFGREYG--- 439

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
                N+  CC GTG+ES +K  +++Y     + P L+I  +I S+  W      + Q+ 
Sbjct: 440 -----NTGTCCGGTGMESHTKYQETVYL-RSAHSPVLWINLFIPSTLHWMERGFAIKQET 493

Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG--QNLPLPPPGNF 582
           +       + R   T  +    G L  + LR+P W   NG   ++NG  Q      P  +
Sbjct: 494 N-------FPREGSTKLTIAGEGAL-VIKLRVPGWV-RNGFAVTINGEAQATKNVQPSTY 544

Query: 583 LSATERWSYNDKLTIQLPLSLRTE-AIQDDRPEYASIQAILFGPYLL 628
           LS    W  ND + +Q+PLS+RTE AI  DRP+    QA+++GP LL
Sbjct: 545 LSLKRIWKTNDVIEVQMPLSIRTERAI--DRPD---TQAVMWGPVLL 586


>gi|227509161|ref|ZP_03939210.1| possible acetyl-CoA carboxylase, biotin carboxylase [Lactobacillus
           brevis subsp. gravesensis ATCC 27305]
 gi|227191368|gb|EEI71435.1| possible acetyl-CoA carboxylase, biotin carboxylase [Lactobacillus
           brevis subsp. gravesensis ATCC 27305]
          Length = 606

 Score =  185 bits (470), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 127/369 (34%), Positives = 185/369 (50%), Gaps = 38/369 (10%)

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
           L  E GGMND LY L+SIT D +HL  A  FD+      LA   D L   HANT IP ++
Sbjct: 2   LKVEYGGMNDALYHLFSITKDERHLTAATYFDEVELFKDLAAAKDVLPGKHANTTIPKLL 61

Query: 342 GSQMRYEVTGD----------------PLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
           G+  RYE+  D                P+Y      F  IV   H+YATGG S  E + D
Sbjct: 62  GAIRRYEIFDDPQMAGQYLYEKDQKQLPIYLKAAENFWRIVINHHTYATGGNSQSEHFHD 121

Query: 386 PKRL-ADTL---GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
           P +L  D +   G+   ETC T+NMLK+SR LFR T +  Y DYY+R  +N +L  Q   
Sbjct: 122 PNQLYHDAVIEDGATTCETCNTHNMLKLSRELFRVTGDKKYLDYYDRTYSNAILGSQ-NP 180

Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
           + G+M Y  P+  G  K      +   ++ FWCC GTGIESF+KLGDS YF+E      L
Sbjct: 181 KTGMMTYFQPMAAGYRKV-----FNRPYDEFWCCTGTGIESFTKLGDSYYFKEGQT---L 232

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
           Y   Y S+       ++ L+ +VD  V     +++T++     +  +  ++  R P W++
Sbjct: 233 YATGYFSNQLSLPKENLKLDMQVDRKVGA---VKLTVSKLIDNKTSEPLNVKFRHPDWSH 289

Query: 562 SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
                   N +  P      F+   ++    D + I L ++L   +  D++ +Y S++  
Sbjct: 290 GR-LSVKKNQKTQPNNETFGFVEV-KKLVPGDVIEINLSMTLTVGSTPDNQ-QYISLK-- 344

Query: 622 LFGPYLLAG 630
            +GPY+LAG
Sbjct: 345 -YGPYVLAG 352


>gi|340347550|ref|ZP_08670658.1| hypothetical protein HMPREF9136_1656 [Prevotella dentalis DSM 3688]
 gi|339609246|gb|EGQ14121.1| hypothetical protein HMPREF9136_1656 [Prevotella dentalis DSM 3688]
          Length = 1007

 Score =  184 bits (467), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 174/674 (25%), Positives = 275/674 (40%), Gaps = 120/674 (17%)

Query: 69  ILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEV-SLHDVWLDQSSVLWRAQQTNLEYLL 127
           I+GD   +  + +  +          PG  +    SL DV LD  + L   +   L  + 
Sbjct: 136 IIGDATTDKGYPIKAQVRVVATAVAAPGQEMAHAFSLADVTLDGDNRLTHNRDEALREIC 195

Query: 128 MLDVDSLVWSFRKTASLPTPGKAYG-GWENPISELRGHFVGHYLSASAQMWASTHN---- 182
             DV   ++++R T  L T G     GW++P ++L+GH  GHY+SA AQ +A T +    
Sbjct: 196 SWDVSQQLYNYRDTYGLSTDGYTRSDGWDSPDTKLKGHGSGHYMSAIAQAYAVTKDPRQK 255

Query: 183 ATIKEKMSTVVFSLSECQNKI--------------------------------------- 203
           A +++ ++ +V  L  CQ K                                        
Sbjct: 256 AILRKNITRMVNELRACQEKTFVFDKALNRYWEARDFAPEEELRGLKGTWEAFDEYKKHP 315

Query: 204 ---GTGYLSAFPT------ELFDSFEALKPVWAPYYTIHKILAGLLD------QYVLADN 248
              G GY++A P       E++ ++     VWAPYY++HK LAGL+D         + D 
Sbjct: 316 EKYGYGYINAIPAQHCALIEMYRAYNNSDWVWAPYYSVHKQLAGLIDIATYFDDKAICDK 375

Query: 249 A--QALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE----------ETGGMNDVLYRL 296
           A   A  M  W+    + R          ER     N           E GGM++ L RL
Sbjct: 376 ALLTAKDMGLWVWNRMHYRTYVKEDGTEAERRSKPGNRYEMWDMYIAGEVGGMSESLARL 435

Query: 297 YSITHDP----KHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
             +  DP    K +  A  FD P F   L+   D +   HAN HIP+++G+   Y+   +
Sbjct: 436 SEMVSDPGEKAKLIEAAGCFDAPKFYNPLSKNVDDIRTRHANQHIPMIVGALRSYKTNKN 495

Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK----RLADTLGSENE--------ET 400
           P Y  +   F  +V   + YATGG    E +  P      +A     E E        ET
Sbjct: 496 PFYYHLSQNFWHLVQGRYMYATGGVGNGEMFRQPYTQILSMATNGMQEGERQANPDINET 555

Query: 401 CTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
           C TYN+LK++  L  +  + A Y DYYER L N ++      +     Y   +G   +K 
Sbjct: 556 CCTYNLLKLTSDLNCYNPDDARYMDYYERGLYNQIVG-SLNPDKYETCYQYAVGLNATKP 614

Query: 460 RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
                +G +     CC GTG E+ +K   + YF    N   L++  Y+ ++  WK+  + 
Sbjct: 615 -----FGNETPQSTCCGGTGSENHTKYQAAAYF---ANTHTLWVGLYMPTTLHWKAKGLT 666

Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPP 578
           + Q+     +W            K E     +L LR+P W  + G +  +NG+ +  L  
Sbjct: 667 IRQE----CAWPAQHTAIQIAEGKGEF----TLKLRVPYWA-TGGFEVKVNGKKVKQLFR 717

Query: 579 PGNFLSATE-RWSYNDKLTIQLPLSLRTE----------AIQDDRP-EYASIQAILFGPY 626
           P ++++  + RW   D + I +P +   E          A  D  P   A +  +++GP 
Sbjct: 718 PSSYVALEKTRWKAGDVVEIDMPFTKHIEYGADKLTSEVASMDGTPLRTAWVGTLMYGPL 777

Query: 627 LLAGHTSGEWDIKT 640
            + G  S  W   T
Sbjct: 778 AMTGTGSAIWKEAT 791


>gi|433653573|ref|YP_007297427.1| hypothetical protein Prede_2696 [Prevotella dentalis DSM 3688]
 gi|433304106|gb|AGB29921.1| hypothetical protein Prede_2696 [Prevotella dentalis DSM 3688]
          Length = 986

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 174/674 (25%), Positives = 275/674 (40%), Gaps = 120/674 (17%)

Query: 69  ILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEV-SLHDVWLDQSSVLWRAQQTNLEYLL 127
           I+GD   +  + +  +          PG  +    SL DV LD  + L   +   L  + 
Sbjct: 115 IIGDATTDKGYPIKAQVRVVATAVAAPGQEMAHAFSLADVTLDGDNRLTHNRDEALREIC 174

Query: 128 MLDVDSLVWSFRKTASLPTPGKAYG-GWENPISELRGHFVGHYLSASAQMWASTHN---- 182
             DV   ++++R T  L T G     GW++P ++L+GH  GHY+SA AQ +A T +    
Sbjct: 175 SWDVSQQLYNYRDTYGLSTDGYTRSDGWDSPDTKLKGHGSGHYMSAIAQAYAVTKDPRQK 234

Query: 183 ATIKEKMSTVVFSLSECQNKI--------------------------------------- 203
           A +++ ++ +V  L  CQ K                                        
Sbjct: 235 AILRKNITRMVNELRACQEKTFVFDKALNRYWEARDFAPEEELRGLKGTWEAFDEYKKHP 294

Query: 204 ---GTGYLSAFPT------ELFDSFEALKPVWAPYYTIHKILAGLLD------QYVLADN 248
              G GY++A P       E++ ++     VWAPYY++HK LAGL+D         + D 
Sbjct: 295 EKYGYGYINAIPAQHCALIEMYRAYNNSDWVWAPYYSVHKQLAGLIDIATYFDDKAICDK 354

Query: 249 A--QALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE----------ETGGMNDVLYRL 296
           A   A  M  W+    + R          ER     N           E GGM++ L RL
Sbjct: 355 ALLTAKDMGLWVWNRMHYRTYVKEDGTEAERRSKPGNRYEMWDMYIAGEVGGMSESLARL 414

Query: 297 YSITHDP----KHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
             +  DP    K +  A  FD P F   L+   D +   HAN HIP+++G+   Y+   +
Sbjct: 415 SEMVSDPGEKAKLIEAAGCFDAPKFYNPLSKNVDDIRTRHANQHIPMIVGALRSYKTNKN 474

Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK----RLADTLGSENE--------ET 400
           P Y  +   F  +V   + YATGG    E +  P      +A     E E        ET
Sbjct: 475 PFYYHLSQNFWHLVQGRYMYATGGVGNGEMFRQPYTQILSMATNGMQEGERQANPDINET 534

Query: 401 CTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
           C TYN+LK++  L  +  + A Y DYYER L N ++      +     Y   +G   +K 
Sbjct: 535 CCTYNLLKLTSDLNCYNPDDARYMDYYERGLYNQIVG-SLNPDKYETCYQYAVGLNATKP 593

Query: 460 RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
                +G +     CC GTG E+ +K   + YF    N   L++  Y+ ++  WK+  + 
Sbjct: 594 -----FGNETPQSTCCGGTGSENHTKYQAAAYF---ANTHTLWVGLYMPTTLHWKAKGLT 645

Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPP 578
           + Q+     +W            K E     +L LR+P W  + G +  +NG+ +  L  
Sbjct: 646 IRQE----CAWPAQHTAIQIAEGKGEF----TLKLRVPYWA-TGGFEVKVNGKKVKQLFR 696

Query: 579 PGNFLSATE-RWSYNDKLTIQLPLSLRTE----------AIQDDRP-EYASIQAILFGPY 626
           P ++++  + RW   D + I +P +   E          A  D  P   A +  +++GP 
Sbjct: 697 PSSYVALEKTRWKAGDVVEIDMPFTKHIEYGADKLTSEVASMDGTPLRTAWVGTLMYGPL 756

Query: 627 LLAGHTSGEWDIKT 640
            + G  S  W   T
Sbjct: 757 AMTGTGSAIWKEAT 770


>gi|336397986|ref|ZP_08578786.1| protein of unknown function DUF1680 [Prevotella multisaccharivorax
           DSM 17128]
 gi|336067722|gb|EGN56356.1| protein of unknown function DUF1680 [Prevotella multisaccharivorax
           DSM 17128]
          Length = 943

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 181/680 (26%), Positives = 290/680 (42%), Gaps = 132/680 (19%)

Query: 69  ILGDQKDEVSWALLYRKIKNPGGFDLPGNF-LKEVS----LHDVWLDQSSVLWRAQQTNL 123
           I+GD+  +  + +   KIK      +P N   KE++    L DV ++  + L   +   +
Sbjct: 93  IIGDETTDNGYPIT-AKIK---VVSMPANEEKKEIAQTFPLSDVTINGDNRLTHNRDEAI 148

Query: 124 EYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
             +   DV   ++++R T ++ T G K   GW++P ++L+GH  GHY+SA AQ +A T +
Sbjct: 149 AAICSWDVTQQLYNYRDTYNMSTEGYKVADGWDSPDTKLKGHGSGHYMSAIAQAYAVTKD 208

Query: 183 ----ATIKEKMSTVVFSLSECQNKI----------------------------------- 203
               A +K+ ++ +V  L  CQ K                                    
Sbjct: 209 PQQKAILKKNITRMVNELRACQEKTFVWNDSLGRYWEARDFAPESELKNMKGTWAAFDEY 268

Query: 204 -------GTGYLSAFPTELFDSFEALKP------VWAPYYTIHKILAGLLDQYVLADN-- 248
                  G GY++A P++     E  +P      VWAPYYTIHK LAGL+D   L D+  
Sbjct: 269 KKHPEKYGYGYINAIPSQHCALIEMYRPYNNSDWVWAPYYTIHKELAGLIDIATLFDDKE 328

Query: 249 --AQALKMATWMVEYFYNRVQKVITMYS----VERHWYSLNE----------ETGGMNDV 292
             A+AL +A  M  + +NR+     + +     ER     N           E GGM + 
Sbjct: 329 VAAKALLIAKDMGLWVWNRMHYRTYVKADGTQEERRAKPGNRYEMWDMYIAGEVGGMQES 388

Query: 293 LYRLYSI----THDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYE 348
           L RL  +    T   + L  A  FD P F   LA   D +   HAN HIP+++G+   Y+
Sbjct: 389 LSRLSEMVSNSTDKARLLEAAQCFDAPKFYEPLAKNIDDIRTRHANQHIPMIVGALRSYK 448

Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP-----------KRLADTLGSEN 397
              D  Y  +   F  +V   + YATGG    E +  P            +  + + + N
Sbjct: 449 SNHDIHYYNVADNFWHLVQGRYMYATGGVGNGEMFRQPYTQVLSMATNGMQEGEAMANPN 508

Query: 398 -EETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEPG--VMIYMLPLG 453
             ETC TYN+LK+++ L  +  + A   DYYER L N ++      +P    + Y   +G
Sbjct: 509 LNETCCTYNLLKLTKDLNVYNPDDAELMDYYERGLYNQIVG---SLDPDHYAVTYQYAVG 565

Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
              +K      +G +     CC GTG E+ +K   + YF  +     L++  Y+ ++  W
Sbjct: 566 LNATKP-----FGNETPQSTCCGGTGSENHTKYQQAAYFHNDST---LWVCLYMPTTLQW 617

Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
           +   + L Q      +W P  R  +  +  +  G   +L LR+P W  + G +  LNG+ 
Sbjct: 618 RDKGITLEQD----CTW-PAQRSVIRLTKGE--GNF-TLKLRVPYWA-TRGFEILLNGKP 668

Query: 574 LP--LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP-EYASIQAI--------- 621
           +     P      +   W+ +D+L I +P S   E   D  P + AS   I         
Sbjct: 669 VQHHYQPSSYVTISGHHWTVSDRLEIIMPFSTHIEYGADKLPAKVASADGIPLKSAWTGV 728

Query: 622 -LFGPYLLAGHTSGEWDIKT 640
            ++GP  + G  +  W   T
Sbjct: 729 VMYGPLCMTGTNATTWKQAT 748


>gi|389638620|ref|XP_003716943.1| acetyl-CoA carboxylase [Magnaporthe oryzae 70-15]
 gi|351642762|gb|EHA50624.1| acetyl-CoA carboxylase [Magnaporthe oryzae 70-15]
          Length = 1018

 Score =  181 bits (460), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 142/478 (29%), Positives = 214/478 (44%), Gaps = 76/478 (15%)

Query: 206 GYLSAFPTELFDSFEALKP-------------VWAPYYTIHKILAGLLDQYVLADNAQAL 252
           GYL A P    D+   L P              WAP+YT HKI+ GLLD Y   +N+QAL
Sbjct: 390 GYLGALPE---DTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQAL 446

Query: 253 KMATWMVEYFY----------NRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITH 301
           ++ T M ++ +             +  +T   +   W   +  E GG N+V   +Y +T 
Sbjct: 447 QVVTRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTG 506

Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYL--------------SHFHANTHIPIVIGSQMRY 347
           DPKHL  A  FD    L   A+  D +                 HANTH+P  IG    +
Sbjct: 507 DPKHLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIF 566

Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEE 399
           E  G   Y      F   V     +A+GGT           E + +   +A+ +G    E
Sbjct: 567 EQGGGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAE 626

Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV----MIYMLPLGRG 455
           TCT YNMLK++R+LF       Y D YER L N +   +  T        + Y  PL  G
Sbjct: 627 TCTAYNMLKLARNLFLHNHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPG 686

Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
                S   +G   N+  CC GTG+ES +K  +++Y     +   L++  Y+ S+  W+ 
Sbjct: 687 -----SNRDYG---NTGTCCGGTGLESHTKYQETVYL-RSADGSALWVNLYVPSTLTWEE 737

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--NGAQASLNGQN 573
             + + Q+       D  ++ T+T SS+QE      + LR+P W      G   S+NG+ 
Sbjct: 738 KGITVRQET--AFPRDDTVKFTVTTSSRQEP---LDMKLRVPAWIQKTPGGFNVSINGEQ 792

Query: 574 L---PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
                 P PG++++ +  W+  D + I++P ++R E    DRP+    QAI++GP LL
Sbjct: 793 FRPGETPTPGSYMTVSRTWATGDVVEIKMPFAVRIERAP-DRPD---TQAIMWGPLLL 846



 Score = 41.6 bits (96), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 5/62 (8%)

Query: 142 ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN 201
           A LP PG    GWE+    L GH+ GH+++A +Q +A       K K+  +V  L+ CQ+
Sbjct: 79  AGLPVPG----GWEDG-GLLSGHWAGHFMTALSQAFADQGEELYKTKLDWMVKELAACQD 133

Query: 202 KI 203
            I
Sbjct: 134 AI 135


>gi|440483441|gb|ELQ63839.1| acetyl-CoA carboxylase [Magnaporthe oryzae P131]
          Length = 1055

 Score =  181 bits (460), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 142/478 (29%), Positives = 214/478 (44%), Gaps = 76/478 (15%)

Query: 206 GYLSAFPTELFDSFEALKP-------------VWAPYYTIHKILAGLLDQYVLADNAQAL 252
           GYL A P    D+   L P              WAP+YT HKI+ GLLD Y   +N+QAL
Sbjct: 427 GYLGALPE---DTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQAL 483

Query: 253 KMATWMVEYFY----------NRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITH 301
           ++ T M ++ +             +  +T   +   W   +  E GG N+V   +Y +T 
Sbjct: 484 QVVTRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTG 543

Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYL--------------SHFHANTHIPIVIGSQMRY 347
           DPKHL  A  FD    L   A+  D +                 HANTH+P  IG    +
Sbjct: 544 DPKHLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIF 603

Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEE 399
           E  G   Y      F   V     +A+GGT           E + +   +A+ +G    E
Sbjct: 604 EQGGGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAE 663

Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV----MIYMLPLGRG 455
           TCT YNMLK++R+LF       Y D YER L N +   +  T        + Y  PL  G
Sbjct: 664 TCTAYNMLKLARNLFLHDHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPG 723

Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
                S   +G   N+  CC GTG+ES +K  +++Y     +   L++  Y+ S+  W+ 
Sbjct: 724 -----SNRDYG---NTGTCCGGTGLESHTKYQETVYL-RSADGSALWVNLYVPSTLTWEE 774

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--NGAQASLNGQN 573
             + + Q+       D  ++ T+T SS+QE      + LR+P W      G   S+NG+ 
Sbjct: 775 KGITVRQET--AFPRDDTVKFTVTTSSRQEP---LDMKLRVPAWIQKTPGGFNVSINGEQ 829

Query: 574 L---PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
                 P PG++++ +  W+  D + I++P ++R E    DRP+    QAI++GP LL
Sbjct: 830 FRPGETPTPGSYMTVSRTWATGDVVEIKMPFAVRIERAP-DRPD---TQAIMWGPLLL 883



 Score = 41.6 bits (96), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 5/62 (8%)

Query: 142 ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN 201
           A LP PG    GWE+    L GH+ GH+++A +Q +A       K K+  +V  L+ CQ+
Sbjct: 116 AGLPVPG----GWEDG-GLLSGHWAGHFMTALSQAFADQGEELYKTKLDWMVKELAACQD 170

Query: 202 KI 203
            I
Sbjct: 171 AI 172


>gi|300726603|ref|ZP_07060044.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Prevotella
           bryantii B14]
 gi|299776135|gb|EFI72704.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Prevotella
           bryantii B14]
          Length = 832

 Score =  181 bits (460), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 162/563 (28%), Positives = 247/563 (43%), Gaps = 84/563 (14%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--------YGGWENPISELRGHFVGHY 169
           A + N   LL  DVD L+  F + A L     A        +  W     +L GH  GHY
Sbjct: 38  AMEINFNTLLAYDVDRLLTPFIRQAGLHEGRYADWQKKHPNFKNWGGDGFDLSGHIGGHY 97

Query: 170 LSASAQMWASTHNATIKEKMST----VVFSLSECQNKIGT------GYLSAFPTELFDSF 219
           LSA A  +A+  +A  KE++ +    ++  L +CQN          G++   P  + + +
Sbjct: 98  LSALAMAYAACQDAATKERLQSRLLYMIDVLKDCQNSFDQNTTGLYGFIGGQP--INEDW 155

Query: 220 EAL----------KPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFYN- 264
           E L             W P+Y  HK++AGL D Y+ A N  A     KMA W  +     
Sbjct: 156 EKLYQGDISGIWQHRGWVPFYCEHKVMAGLRDAYLYAHNQDAKLMLKKMADWCTQLIAKV 215

Query: 265 ---RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL-GF 320
               +QK++T+            E GG+N+ +   Y+I  D ++L  A  + +   L G 
Sbjct: 216 SDADMQKMLTI------------EHGGINESMADCYAIFKDTRYLEAAKKYSQREMLEGL 263

Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL-YKLIGTFFMDIVNASHSYATGGTSA 379
            +L A +L + HANT +P  IG +   E     L Y    + F   V    +   GG S 
Sbjct: 264 QSLNATFLDNRHANTQVPKYIGFERIVEEDPAALQYATAASNFWQDVAHHRTVCIGGNSI 323

Query: 380 REFWW---DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
            E +    +  R  D L  E  E+C T NMLK+S  L   T +  YAD+YE A+ N +LS
Sbjct: 324 SEHFLSKTNSNRYIDNL--EGPESCNTNNMLKLSEMLSDRTHDAGYADFYEYAMWNHILS 381

Query: 437 IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
            Q   + G  +Y   L     + +    +       WCC GTG+E+ SK G  +Y  +  
Sbjct: 382 TQ-DPQTGGYVYFTTL-----RPQGYRIYSVPNQGMWCCVGTGMENHSKYGHFVYTHDGD 435

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
               LY+  + +S  D K     L Q+ +    ++P   +T+  S +  +       +R 
Sbjct: 436 RT--LYVNLFTASKLDGKK--FKLTQQTN--YPYEPKTTITIEKSGRYAIA------IRR 483

Query: 557 PVWTYSNGAQASLNG--QNLPLPPPGNFLSAT--ERWSYNDKLTIQLPLSLRTEAIQDDR 612
           P WT S+  +  +NG  Q L +P  G    AT   +W   D +T+ +P++LR EA     
Sbjct: 484 PWWTTSD-YRIQVNGQTQQLNIPSAGTSAYATLERKWKKGDVITVDIPMTLRQEAC---- 538

Query: 613 PEYASIQAILFGPYLLAGHTSGE 635
           P Y    A  +GP LL   T+ +
Sbjct: 539 PNYEDYIAFEYGPILLGAQTTSQ 561


>gi|440466410|gb|ELQ35678.1| acetyl-CoA carboxylase [Magnaporthe oryzae Y34]
          Length = 1055

 Score =  181 bits (460), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 142/478 (29%), Positives = 214/478 (44%), Gaps = 76/478 (15%)

Query: 206 GYLSAFPTELFDSFEALKP-------------VWAPYYTIHKILAGLLDQYVLADNAQAL 252
           GYL A P    D+   L P              WAP+YT HKI+ GLLD Y   +N+QAL
Sbjct: 427 GYLGALPE---DTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQAL 483

Query: 253 KMATWMVEYFY----------NRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITH 301
           ++ T M ++ +             +  +T   +   W   +  E GG N+V   +Y +T 
Sbjct: 484 QVVTRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTG 543

Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYL--------------SHFHANTHIPIVIGSQMRY 347
           DPKHL  A  FD    L   A+  D +                 HANTH+P  IG    +
Sbjct: 544 DPKHLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIF 603

Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEE 399
           E  G   Y      F   V     +A+GGT           E + +   +A+ +G    E
Sbjct: 604 EQGGGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAE 663

Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV----MIYMLPLGRG 455
           TCT YNMLK++R+LF       Y D YER L N +   +  T        + Y  PL  G
Sbjct: 664 TCTAYNMLKLARNLFLHDHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPG 723

Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
                S   +G   N+  CC GTG+ES +K  +++Y     +   L++  Y+ S+  W+ 
Sbjct: 724 -----SNRDYG---NTGTCCGGTGLESHTKYQETVYL-RSADGSALWVNLYVPSTLTWEE 774

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--NGAQASLNGQN 573
             + + Q+       D  ++ T+T SS+QE      + LR+P W      G   S+NG+ 
Sbjct: 775 KGITVRQET--AFPRDDTVKFTVTTSSRQEP---LDMKLRVPAWIQKTPGGFNVSINGEQ 829

Query: 574 L---PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
                 P PG++++ +  W+  D + I++P ++R E    DRP+    QAI++GP LL
Sbjct: 830 FRPGETPTPGSYMTVSRTWATGDVVEIKMPFAVRIERAP-DRPD---TQAIMWGPLLL 883



 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 5/62 (8%)

Query: 142 ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN 201
           A LP PG    GWE+    L GH+ GH+++A +Q +A       K K+  +V  L+ CQ+
Sbjct: 116 AGLPVPG----GWEDG-GLLSGHWAGHFMTALSQAFADQGEELYKTKLDWMVKELAACQD 170

Query: 202 KI 203
            I
Sbjct: 171 AI 172


>gi|330467692|ref|YP_004405435.1| glycosylase [Verrucosispora maris AB-18-032]
 gi|328810663|gb|AEB44835.1| glycosylase [Verrucosispora maris AB-18-032]
          Length = 1126

 Score =  177 bits (450), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 146/472 (30%), Positives = 214/472 (45%), Gaps = 73/472 (15%)

Query: 206 GYLSAFPTELFDSF----------EALKPVWAPYYTIHKILAGLLDQYVLADNAQAL--- 252
           GYL A P +                A    WAP+YT HKI+ GLLD Y   DNA AL   
Sbjct: 416 GYLGAIPEDAVLRLGPPRWAVYGSNATTNTWAPWYTQHKIMRGLLDAYYHTDNATALDVV 475

Query: 253 -KMATW------MVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPK 304
            KMA W      + +  +      IT  ++   W   +  ETGG N+V   +Y++T D K
Sbjct: 476 VKMAGWAHLALTIGDKNHPAYTGPITRDNLNYMWDLYIAGETGGANEVFPEIYALTGDQK 535

Query: 305 HLLLAHLFD-KPCFLGFLALQADYL-------------SHFHANTHIPIVIGSQMRYEVT 350
           HL  A LFD +           D L                HAN+H+P  +G    YE +
Sbjct: 536 HLETAKLFDNRESLFDACVENRDILVVTPQNNPGRRRPDRLHANSHVPQFVGYLRVYEHS 595

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEETCT 402
           GD  Y      F  +V     YA GGT           E + +   +A+++     ETCT
Sbjct: 596 GDTEYFQAAKNFYGMVVPHRMYANGGTGGNYPGSNNNIELFQNRGNIANSIAQGGAETCT 655

Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT----EPGVMIYMLPLGRGVSK 458
           TYN+LK++R+LF    + AY DYYER L N +   +  T     P V  Y  PL  G ++
Sbjct: 656 TYNLLKLARNLFFHEHDAAYLDYYERGLINQIAGSRADTTTVSNPQVT-YFQPLTPGANR 714

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGNVPGLYIIQYISSSFDWKSGH 517
                G+G   N+  CC GTG+E+ +K  ++IYF+  +G+   L++  Y++S+  W    
Sbjct: 715 -----GYG---NTGTCCGGTGVENHTKYQETIYFKSADGDT--LWVNLYVASTLTWAERD 764

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
             + Q+ D       Y R   T  +    G L  + LR+P W    G   ++NG    + 
Sbjct: 765 FTITQQTD-------YPRADRTRLTVDGSGPL-DIKLRVPGWV-RKGFFVTINGLAQQVT 815

Query: 578 PPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
              N +L+ +  W   D + I++P S+R E    DRP+    Q++ +GP LL
Sbjct: 816 ATANSYLTLSRTWQRGDVIEIRMPFSIRIERAL-DRPD---TQSVFWGPVLL 863



 Score = 47.8 bits (112), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 49/107 (45%), Gaps = 9/107 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG--KAYGGWEN 156
           L++V+L D    +     R +  N  YL  LD    +  F   A  P P    A GGWE+
Sbjct: 67  LRDVTLGDGLFQEK----RDRMKN--YLRQLDERRFLVLFNNQAGRPNPAGVTAPGGWED 120

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
               L GH+ GH ++A AQ +A       K K+  +V  L+ CQ  I
Sbjct: 121 G-GLLSGHWAGHVMTALAQGYADHGEPIFKSKLDWIVDELAACQTAI 166


>gi|261879318|ref|ZP_06005745.1| acetyl-CoA carboxylase [Prevotella bergensis DSM 17361]
 gi|270334148|gb|EFA44934.1| acetyl-CoA carboxylase [Prevotella bergensis DSM 17361]
          Length = 839

 Score =  177 bits (449), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 177/587 (30%), Positives = 257/587 (43%), Gaps = 94/587 (16%)

Query: 95  PGNF-LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--- 150
           P +F L EV+L D      S L  A   N++ L+  DVD L+  F + A L T   A   
Sbjct: 29  PHHFNLDEVTLLD------SPLKTAMDLNIKMLMQYDVDRLLTPFIRQAGLHTGRYADWQ 82

Query: 151 -----YGGWENPISELRGHFVGHYLSASAQMWASTHN----ATIKEKMSTVVFSLSECQN 201
                +  W     +L GH  GHY+SA A  +A+ H+    A IKE++  ++  L +CQ+
Sbjct: 83  SRHPNFMNWGGNNFDLSGHVGGHYVSALAMAYAACHDTATKARIKERLDYMIDVLKDCQD 142

Query: 202 KIGT------GYLSAFPTELF---------DSFEALKPVWAPYYTIHKILAGLLDQYVLA 246
              T      G++   P              SF   +  W P+Y  HK+LAGL D Y+  
Sbjct: 143 AYDTNTEGLYGFIGGQPINDMWKKMYAGDISSFRQHRG-WVPFYCQHKVLAGLRDAYLYT 201

Query: 247 DNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
            N  A     K+A W V    N      TM +V      L+ E GGMN+ L   Y++  D
Sbjct: 202 GNTTARDLFRKLADWSVNLVSNLSDA--TMQTV------LDTEHGGMNETLADAYTLFGD 253

Query: 303 PKHLLLAHLFDKPCFL-GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTF 361
            K+L  A  +     L G       +L + HANT +P  IG +   E   DP      T 
Sbjct: 254 SKYLAAARKYSHQTMLNGMQTPNPTFLDNRHANTQVPKYIGFERVAE--EDPTATTYATA 311

Query: 362 ---FMDIVNASHSYATGGTSAREFWW---DPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
              F D V  + +   GG S  E +    +  R  D L  +  E+C T NM+K+S  +  
Sbjct: 312 ASNFWDDVAQNRTVCIGGNSVGEHFLSVGNSNRYIDHL--DGPESCNTNNMMKLSEMMAD 369

Query: 416 WTKEIAYADYYERALTNGVLSIQRGTEPGVMIY--MLPLG-RGVSKARSTHGWGTKFNSF 472
            T +  YAD+YE A+ N +LS Q  T  G + +  + P G R  SK              
Sbjct: 370 RTHDARYADFYEYAMYNHILSTQDPTTGGYVYFTTLRPQGYRIYSKVNE---------GM 420

Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
           WCC GTG+E+ SK G  +Y  +      +YI  + +S  D K  H +L Q+     +  P
Sbjct: 421 WCCVGTGMENHSKYGHFVYTHDADT--AVYINLFTASKLDNK--HFMLTQE-----TAYP 471

Query: 533 YLRMTLTFSSKQEVGQLS--SLNLRMPVWTYSNGAQASLNGQNLPLP---PPGNFLSATE 587
           Y + T     K  VG+    ++ +R P WT ++ +  S+NG   PL       ++     
Sbjct: 472 YEQRT-----KITVGKSGTYTIAVRHPWWTTADYS-ISVNGTKQPLDVLQGQASYCRLKR 525

Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
            W   D +T+ LP+SLR        P Y+   A  +GP LL   T+ 
Sbjct: 526 AWKAGDVITVDLPMSLRVAEC----PNYSDYIAFEYGPVLLGAQTTA 568


>gi|332669733|ref|YP_004452741.1| hypothetical protein Celf_1219 [Cellulomonas fimi ATCC 484]
 gi|332338771|gb|AEE45354.1| protein of unknown function DUF1680 [Cellulomonas fimi ATCC 484]
          Length = 752

 Score =  176 bits (445), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 168/594 (28%), Positives = 247/594 (41%), Gaps = 49/594 (8%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ+T+L YLL LD   L+  FR+ A LP   + YG WE+    L GH  GH LSA++ +W
Sbjct: 19  AQRTDLAYLLRLDPQRLLAPFRREAGLPPLAEPYGNWES--MGLDGHTGGHALSAASLLW 76

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVW 226
           A+T +    E  + +V  L  CQ  +GTGY+   P    LF+   A         L   W
Sbjct: 77  AATGDPRTAELAAALVDGLDACQEALGTGYVGGVPHGVALFERIAAGEVSADSFGLNGAW 136

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            P+Y +HK +AGL+D    A    A + A  +V  F      V       +    L  E 
Sbjct: 137 VPWYNLHKTVAGLVDAVRYAPAGTA-ERARRVVLRFAEWWLGVAAGLDDAQFAAMLRTEF 195

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGM +    L ++T       +A  F     L  L    D L   HANT I  V+G    
Sbjct: 196 GGMCEAFADLAALTGRDDLRAMAVRFADRTLLDPLLDGRDALDGLHANTQIAKVVGWAAL 255

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
            E  GD  ++     F D V    S   GG S  E +      +  L S E  E+C T N
Sbjct: 256 AEQDGDGGWERAARTFWDAVTTHRSLVFGGDSVGEHFHPVDDFSGALTSPEGPESCNTAN 315

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH-- 463
           ML+++R L     +    D+ ERAL N VLS Q     G  +Y  P       AR  H  
Sbjct: 316 MLELTRRLLLRRPDPTLLDFAERALVNHVLSAQH--PDGGFVYFTP-------ARPDHYR 366

Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
            +    + FWCC GTG+E++++LG+ +    +G+   L +   +     W    V L   
Sbjct: 367 VYSQPEDGFWCCVGTGLETYARLGE-LALATQGD--DLIVHLPVPVRATWGDAVVTLRSP 423

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
             P +S      +TL     +      ++ +R P W   + A  ++ G        G +L
Sbjct: 424 Y-PDLSAAAPTTLTLDLPGPRRF----AVRVRRPAWVGGDLAL-TVGGAPADATDDGTYL 477

Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA--GHTSGEWDIKTG 641
           S T  W   D LT + P  +  E +    P+ +   A   GP +LA  G T     ++  
Sbjct: 478 SVTRTWHDGDVLTWEHPARVVAERL----PDGSDWVAFRRGPVVLAARGGTDDLPGLRAD 533

Query: 642 TAR-------SLSALI-SPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEF 687
            +R        L AL  +P+  + +A        +     V+      + +E F
Sbjct: 534 ASRMGHVAAGPLHALAGTPVVEAVDATAAASRVRTAGREVVLDTDAGPVALEPF 587


>gi|345514178|ref|ZP_08793691.1| hypothetical protein BSEG_03388 [Bacteroides dorei 5_1_36/D4]
 gi|229437170|gb|EEO47247.1| hypothetical protein BSEG_03388 [Bacteroides dorei 5_1_36/D4]
          Length = 1118

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 172/692 (24%), Positives = 292/692 (42%), Gaps = 124/692 (17%)

Query: 69  ILGDQKDEVSWALLYR-KIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLL 127
           I+GD   E  + +  + ++ +      P      + L++V +D ++ L   +   ++ ++
Sbjct: 117 IIGDDTTENGYPITAKIEVVDTKNTIFPKLIAHTIPLNNVKIDGNNRLTSNRDLAIKEII 176

Query: 128 MLDVDSLVWSFRKTASLPTPGKAYG-GWENPISELRGHFVGHYLSASAQMWAS----THN 182
             DV   ++++R T  L T G     GW++P ++L+GH  GHY+SA A  +A+    +H 
Sbjct: 177 SWDVSQQLYNYRDTYGLSTEGYTRSDGWDSPETKLKGHGSGHYMSALALAYAAATNPSHK 236

Query: 183 ATIKEKMSTVVFSLSECQNKI--------------------------------------- 203
             ++  ++ +V  L ECQ +                                        
Sbjct: 237 EILRRNITRMVNELRECQERTFVWSEELGRYLEARDFAPEEELKKMKGTWEAFDEHKTKW 296

Query: 204 ---GTGYLSAFP------TELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNA----Q 250
              G GYL+A P       E++ ++     VWAPYY+IHK LAGL+D     D+     +
Sbjct: 297 ATYGYGYLNAIPPHHPALIEMYRAYNNSDWVWAPYYSIHKQLAGLIDIATYMDDKSIADK 356

Query: 251 ALKMATWMVEYFYNR------VQKVITMYSVERHWYSLNE----------ETGGMNDVLY 294
           AL +A  M  + +NR      V+K  T    ER     N           E GGM + L 
Sbjct: 357 ALLIAKDMGLWVWNRMHYRTYVKKDGT--QEERRTRPGNRYEMWNMYIAGEVGGMGESLA 414

Query: 295 RLYSITHDPKH----LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           RL  +   P+     +  ++ FD P F   L+   D + + HAN HIP++IG+   Y   
Sbjct: 415 RLSEMVSAPEEKARLIEASNCFDSPAFYEPLSKNIDDIRNRHANQHIPMIIGALRSYLSN 474

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG----SENE-------- 398
            D  Y  +   F +++   + Y+TGG    E +  P     ++     SE E        
Sbjct: 475 NDTFYYHVSHNFWNLIQGRYRYSTGGVGNGEMFRQPYTQIVSMAMNGVSEGESHSNPHIN 534

Query: 399 ETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
           ETC TYN+LK+++ L  +  + A Y DYYER L N ++      E     Y   +G   S
Sbjct: 535 ETCCTYNLLKLTKDLNCFNPDDARYMDYYERTLYNQIIG-SLHPEHYQTTYQYAVGLNAS 593

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
           K      WG +     CC GTG E+  K  ++ YF  +     L++  Y+ ++  W+  +
Sbjct: 594 KP-----WGNETPQSTCCGGTGSENHVKYQEATYFVSDNT---LWVALYMPTTLHWEEKN 645

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL- 576
           + L Q+      W P    T+  ++ +      ++ LR+P W  ++G    LNG ++   
Sbjct: 646 ITLQQE----CLW-PAKSSTIKVTAGE---ARFAMKLRVPYWA-TDGFDVKLNGISIATH 696

Query: 577 -PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP-----------EYASIQAILFG 624
             P    +    +W  ND + I +P +   +   D  P           E A +  +++G
Sbjct: 697 YQPCSYAVIPARQWKENDIVEITMPFTKHIDYGPDKLPAKIASKDGHQLETAWVGTLMYG 756

Query: 625 PYLLAGHTSGEWDIKTGTARSLSALISPIPPS 656
           P+ +       W   T    S  A I+ + P+
Sbjct: 757 PFAMTATDITNWTEATLNIDSRLASIAVVEPN 788


>gi|296129045|ref|YP_003636295.1| hypothetical protein Cfla_1194 [Cellulomonas flavigena DSM 20109]
 gi|296020860|gb|ADG74096.1| protein of unknown function DUF1680 [Cellulomonas flavigena DSM
           20109]
          Length = 749

 Score =  172 bits (437), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 169/619 (27%), Positives = 264/619 (42%), Gaps = 79/619 (12%)

Query: 94  LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGG 153
           LPG  L+ V L D    Q      AQ+T LEYLL LD D L+  FR+ A LP   + YG 
Sbjct: 10  LPG--LRAVRLTDGLFAQ------AQRTALEYLLGLDPDRLLAPFRREAGLPPVAEPYGS 61

Query: 154 WENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP- 212
           WE+    L GH  GH LSA++  WA+T +         +V  L  CQ+ +GTGY+   P 
Sbjct: 62  WES--LGLDGHIGGHALSAASLQWAATGDDRAAGMAHALVDGLVLCQDALGTGYVGGLPG 119

Query: 213 -TELFDSFEA---------LKPVWAPYYTIHKILAGLLD--QYVLADNA-----QALKMA 255
              L++S  +         L   W P+Y +HK  AGL+D  +Y  AD A      A+++ 
Sbjct: 120 GVALWESVASGGAEAGTFDLGGAWVPWYNVHKTYAGLIDAARYAPADVAVRAMRAAVRLG 179

Query: 256 TWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKP 315
            W V    +R+               L  E GGM +    L ++T D ++  LA  F   
Sbjct: 180 DWGVA-LSDRLDDAAFA-------RMLRTEFGGMCEAYGDLAALTGDARYAALARRFADE 231

Query: 316 CFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATG 375
             LG L    D L   HANT +  V+G    +   G+    L    F+  V    +   G
Sbjct: 232 SLLGPLRESRDELDGLHANTQVAKVVG----WPAIGEADAALA---FVRTVLDHRTLVLG 284

Query: 376 GTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVL 435
           G S  E  + P+        E  E+C T N+L+V R L+  T ++A  D  ER L N VL
Sbjct: 285 GHSVAEH-FTPRPERHVTHREGPESCNTANLLEVERRLYERTGDVALLDAAERQLVNHVL 343

Query: 436 SIQRGTEPGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
           S Q     G  +Y  P       AR  H   + T+    WCC GT +E++++LG+  Y  
Sbjct: 344 SAQH--PDGGFVYFTP-------ARPGHYRVYSTRDACMWCCVGTALETYARLGELAYAL 394

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
              +   L +   + S+ +     V L+    P      +  +T+   +  ++    +++
Sbjct: 395 CGHD---LLVNLPVPSTLEEPGLRVRLDSTY-PRALATTHATLTVDVDAPTDL----AVH 446

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
           LR P W   + A  +++G  +P     + +++    W   + L  +L      E +  D 
Sbjct: 447 LRRPSWARGDLAP-TVDGVGVPATAERDGYVTVRRTWRAGEVLAWRLVAGPAAERLPGDD 505

Query: 613 PEYASIQAILFGPYLLA--GHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQE--- 667
                  A+ +GP  LA  G T     ++ G AR       P+ P  +  ++  + +   
Sbjct: 506 ----GWVALRWGPVALAVRGDTDDLVGLRAGDARMGHVAHGPLRPLADTPVLVGSDDDIS 561

Query: 668 -----SGNSTFVMSNSNQS 681
                  + TFV+    ++
Sbjct: 562 AALRPGPDGTFVLDRGAEA 580


>gi|150003704|ref|YP_001298448.1| hypothetical protein BVU_1135 [Bacteroides vulgatus ATCC 8482]
 gi|149932128|gb|ABR38826.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
          Length = 1116

 Score =  171 bits (434), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 170/690 (24%), Positives = 291/690 (42%), Gaps = 120/690 (17%)

Query: 69  ILGDQKDEVSWALLYR-KIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLL 127
           I+GD   E  + +  + ++ +      P      + L++V ++ ++ L   +   ++ ++
Sbjct: 115 IIGDDTTENGYPITAKIEVVDTKNTISPKLIAHTIPLNNVKINGNNRLTSNRDLAIKEII 174

Query: 128 MLDVDSLVWSFRKTASLPTPGKAYG-GWENPISELRGHFVGHYLSASAQMWAS----THN 182
             DV   ++++R T  L T G     GW++P ++L+GH  GHY+SA A  +A+    +H 
Sbjct: 175 SWDVSQQLYNYRDTYGLSTEGYTRSDGWDSPETKLKGHGSGHYMSALALAYAAATNPSHK 234

Query: 183 ATIKEKMSTVVFSLSECQNKI--------------------------------------- 203
             ++  ++ +V  L ECQ +                                        
Sbjct: 235 EILRRNITRMVNELRECQERTFVWSEELGRYLEARDFAPEEELKKMKGTWEAFDEHKTKW 294

Query: 204 ---GTGYLSAFP------TELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNA----Q 250
              G GYL+A P       E++ ++     VWAPYY+IHK LAGL+D     D+     +
Sbjct: 295 ATYGYGYLNAIPPHHPALIEMYRAYNNSDWVWAPYYSIHKQLAGLIDIATYMDDKSIADK 354

Query: 251 ALKMATWMVEYFYNR------VQKVITMYSVERHWYSLNE--------ETGGMNDVLYRL 296
           AL +A  M  + +NR      V+K  T      H  +  E        E GGM + L RL
Sbjct: 355 ALLIAKDMGLWVWNRMHYRTYVKKDGTQEERRTHPGNRYEMWNMYIAGEVGGMGESLARL 414

Query: 297 YSITHDPKH----LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
             +   P+     +  ++ FD P F   L+   D + + HAN HIP++IG+   Y    D
Sbjct: 415 SEMVSAPEEKARLIEASNCFDSPAFYEPLSKNIDDIRNRHANQHIPMIIGALRSYLSNND 474

Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG----SENE--------ET 400
             Y  +   F +++   + Y+TGG    E +  P     ++     SE E        ET
Sbjct: 475 TFYYHVSHNFWNLIQGRYRYSTGGVGNGEMFRQPYTQIVSMAMNGVSEGESHSNPHINET 534

Query: 401 CTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
           C  YN+LK+++ L  +  + A Y DYYER L N ++      E     Y   +G   SK 
Sbjct: 535 CCAYNLLKLTKDLNCFNPDDARYMDYYERTLYNQIIG-SLHPEHYQTTYQYAVGLNASKP 593

Query: 460 RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
                WG +     CC GTG E+  K  ++ YF  +     L++  Y+ ++  W+  ++ 
Sbjct: 594 -----WGNETPQSTCCGGTGSENHVKYQEATYFVSDNT---LWVALYMPTTLHWEEKNIT 645

Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL--P 577
           L Q+      W P    T+  ++ +      ++ LR+P W  ++G    LNG ++     
Sbjct: 646 LQQE----CLW-PAKSSTIKVTAGE---ARFAMKLRVPYWA-TDGFDVKLNGISIATHYQ 696

Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP-----------EYASIQAILFGPY 626
           P    +  T +W  ND + I +P +   +   D  P           E A +  ++ GP+
Sbjct: 697 PCSYAVIPTRQWKENDIVEITMPFTKHIDYGPDKLPAEIASKDGHQLETAWVGTLMHGPF 756

Query: 627 LLAGHTSGEWDIKTGTARSLSALISPIPPS 656
            +       W   T    S  A I+ + P+
Sbjct: 757 AMTATDITNWTEATLNIDSRLASITVVEPN 786


>gi|297606173|ref|NP_001058068.2| Os06g0613000 [Oryza sativa Japonica Group]
 gi|255677225|dbj|BAF19982.2| Os06g0613000, partial [Oryza sativa Japonica Group]
          Length = 279

 Score =  171 bits (433), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 115/281 (40%), Positives = 149/281 (53%), Gaps = 43/281 (15%)

Query: 610 DDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPI---------------- 653
           DDRPEY+SIQA+LFGP+LLAG T G   +KT +  S S L   +                
Sbjct: 4   DDRPEYSSIQAVLFGPHLLAGLTHGNQTVKT-SNDSNSGLTPGVWEVNATHAAAAVAVWV 62

Query: 654 ---PPSFNAQLVTFTQESGNS----TFVMSNS--NQSITMEEFPVSGTDAALHATFRLIL 704
                S N+QLVT TQ  G++     FV+S S  + ++TM+E PV+G+DA +HATFR   
Sbjct: 63  TPVSQSLNSQLVTLTQRDGDAQAAAAFVLSVSIADGALTMQESPVAGSDACVHATFRAYH 122

Query: 705 KDASLSNFSSLNNVI-GKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGL 763
             +  S   +    + G+ V LEPFD PGM V     D L V    +   ++ F  VAGL
Sbjct: 123 SPSGASAIDAATGRLQGRDVALEPFDRPGMAV----TDALSVG---RPGPATRFNAVAGL 175

Query: 764 DKRNETVSLEAENRKGCFVSSGVN-FEPGASLKLLCSTESL--------DAGFNRAASFM 814
           D    TVSLE   R GCFV++    +  GA  ++ C   +         D  F RAASF 
Sbjct: 176 DGLPGTVSLELATRPGCFVAAPTTAYLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASFT 235

Query: 815 MEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
               +  YHP+SF A G  RNFLL PL S +DE YTVYFN+
Sbjct: 236 QAAPLRLYHPLSFSATGTDRNFLLEPLQSLQDEFYTVYFNV 276


>gi|402081502|gb|EJT76647.1| acetyl-CoA carboxylase [Gaeumannomyces graminis var. tritici
           R3-111a-1]
          Length = 1032

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 138/491 (28%), Positives = 206/491 (41%), Gaps = 76/491 (15%)

Query: 206 GYLSAFPTELFDSF----------EALKPVWAPYYTIHKILAGLLDQYVLADNAQAL--- 252
           GYL A P +               +A    WAP+YT HKI+ GLLD Y   +N QAL   
Sbjct: 404 GYLGALPEDTVLRLGPPRWAIYGGDAATNTWAPWYTQHKIMRGLLDAYYNTNNTQALDVV 463

Query: 253 -KMATW------MVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPK 304
            KMA W      + +  Y      +T   + R W   +  E+GG N+V   LY +T D +
Sbjct: 464 VKMADWAHLALTIGDKNYPGYTGNLTRDDLNRMWDLYIAGESGGANEVFPELYELTGDSR 523

Query: 305 HLLLAHLFDKPCFL--------GFLALQAD------YLSHFHANTHIPIVIGSQMRYEVT 350
           HL  A  FD    L          L L  D           HAN H+P  IG    +E +
Sbjct: 524 HLETAKAFDNRASLFDAAVEDRDILVLTRDKNPGPRRTDRLHANMHVPQFIGYLRIFEQS 583

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEETCT 402
            +  Y      F   V     +A+GGT           E + +   +A+ +     ETCT
Sbjct: 584 REQDYLDAARNFYSWVFPHRQFASGGTGGNYPGSNNNAEMFQNRGNIANAIAENGAETCT 643

Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV---MIYMLPLGRGVSKA 459
           TYNMLK++R+LF       Y D YER L N +   +  T       + Y  PL  G S+ 
Sbjct: 644 TYNMLKLARNLFMHEHNATYMDGYERGLFNMIAGSRADTATTADPQLTYFQPLTPGASRD 703

Query: 460 RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
                     N+  CC G+G+ES +K  +++Y     +   L++  ++ S+  W  G   
Sbjct: 704 YG--------NTGTCCGGSGLESHTKYQETVYLRSA-DGSALWVNLFVPSTLTW--GEKA 752

Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP---L 576
            + + D         ++T+T +     G    + LR+P W        ++NG+  P    
Sbjct: 753 FSLRQDTAFPRADSTKLTVTAAGG---GGPLDIKLRVPAWAQRGTVTVTVNGEADPAAQT 809

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL-------- 628
           P PG +L+    W   D + +++P  +R E    DRP+    QA++ GP LL        
Sbjct: 810 PLPGTYLTLARAWRAGDTIEMRMPFRVRVERAP-DRPD---TQALMRGPVLLQIVGRPPA 865

Query: 629 -AGHTSGEWDI 638
             G  SG W++
Sbjct: 866 TGGANSGYWEL 876



 Score = 47.0 bits (110), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 48/107 (44%), Gaps = 9/107 (8%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY--GGWEN 156
           L +V L D       +L   +    ++L   D    +  F K A  P+ G     GGWE+
Sbjct: 50  LDQVRLGD------GLLQEKRDRTKDFLREFDERRFLVLFNKQAGRPSAGGVAVPGGWED 103

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
               L GH+ GHY++A +Q +A       K K+  +V  L+ CQ  I
Sbjct: 104 G-GLLSGHWAGHYMTALSQAYADQGEEVFKAKLDWMVQELAACQKAI 149


>gi|357472937|ref|XP_003606753.1| hypothetical protein MTR_4g065240 [Medicago truncatula]
 gi|355507808|gb|AES88950.1| hypothetical protein MTR_4g065240 [Medicago truncatula]
          Length = 184

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 91/176 (51%), Positives = 112/176 (63%), Gaps = 6/176 (3%)

Query: 3   FGFVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVL---SHFHLTPTDD 58
           F +V       G A  K+C N  P  SH  R EL  S N+TWK+EV+   SH H+TP+D+
Sbjct: 4   FVYVFLALILCGCANSKECINNLP-QSHTLRTELMASKNETWKKEVMMYQSHVHVTPSDE 62

Query: 59  SAWSSLIPSKI-LGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWR 117
           SAW  +IP ++ L  +K  V   L  R++KN      P  FLKEV L DV L + S+  +
Sbjct: 63  SAWQEMIPKEMFLTQEKPNVIGLLSNREMKNADVSKPPVGFLKEVPLGDVRLLEGSIHAQ 122

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSAS 173
           AQ+TNLEYLLMLDVD L+WSFRK A LPTPG  YGGWE P  ELRGHFVG  +SA+
Sbjct: 123 AQKTNLEYLLMLDVDRLIWSFRKMAGLPTPGAPYGGWEKPDQELRGHFVGCNVSAT 178


>gi|433651701|ref|YP_007278080.1| hypothetical protein Prede_0695 [Prevotella dentalis DSM 3688]
 gi|433302234|gb|AGB28050.1| hypothetical protein Prede_0695 [Prevotella dentalis DSM 3688]
          Length = 1032

 Score =  167 bits (422), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 166/579 (28%), Positives = 261/579 (45%), Gaps = 78/579 (13%)

Query: 95  PGNF-LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--- 150
           P +F L EV+L D      S    A + N + LL  D D L+  F + A L T   A   
Sbjct: 22  PHHFDLSEVTLFD------SPFKTAMELNFKVLLDYDADRLLAPFVRQAGLNTGDYAGWQ 75

Query: 151 -----YGGWENPISELRGHFVGHYLSASAQMWASTHNA----TIKEKMSTVVFSLSECQN 201
                +  W     +L GH  GHYLSA A  +A+  +A     +K+++  ++  L +CQ+
Sbjct: 76  TLHPNFANWGGNGFDLSGHVGGHYLSALALAYAACRDAGMKARLKQRLEYMLKVLKDCQD 135

Query: 202 KIG------TGYLSAFP-----TELF----DSFEALKPVWAPYYTIHKILAGLLDQYVLA 246
                     G++   P      +L+      F +++  W P+Y  HK+LAGL D YV A
Sbjct: 136 AYDGNTEGLRGFIGGQPINEAWKKLYAGDVSGFRSVRG-WVPFYCQHKVLAGLRDAYVYA 194

Query: 247 DNAQALKMATWMVEYFYNRVQKV--ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPK 304
            N +A +M   + ++  N V ++    M SV      L+ E GGMN+ L   Y++  D K
Sbjct: 195 GNKEAREMFRKLADWSVNVVARLDNAAMQSV------LDTEHGGMNESLADAYTLFGDQK 248

Query: 305 HLLLAHLFDKPCFLGFLALQ-ADYLSHFHANTHIPIVIGSQMRYEVTGDPL---YKLIGT 360
           ++  A  +     L  + +Q A +L + HANT +P  IG +   E  G  L   Y+L   
Sbjct: 249 YMDAAQKYSHQTMLNGMQMQNATFLDNRHANTQVPKYIGFERIGEQGGSELQKKYELAAG 308

Query: 361 FFMDIVNASHSYATGGTSAREFWW---DPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
            F + V  + +   GG S  E +    +  R  D L  +  E+C + NMLK+S  L   T
Sbjct: 309 NFWNDVALNRTVCIGGNSVAEHFLSAANSHRYIDHL--DGPESCNSNNMLKLSEMLSDNT 366

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            +  YAD+YE    N +LS Q   + G  +Y   L     + +    +       WCC G
Sbjct: 367 HDARYADFYEYTTWNHILSTQD-PKTGGYVYFTTL-----RPQGYRIYSQVNQGMWCCVG 420

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           TG+E+ SK G  +Y  +  +V  +Y+  + +S     +    L Q+      ++P  R+T
Sbjct: 421 TGMENHSKYGHFVYTHDGDSV--IYVNLFTASKL--ANAKFALTQQT--AYPYEPQTRIT 474

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL---PPPGNFLSATERWSYNDK 594
           +      + G   +L +R P WT + G    +NG+   +   P    +   T +W   D 
Sbjct: 475 I------DKGGSYTLAVRHPWWT-TEGYAILVNGEKQQVAVTPGKAGYARLTRKWKRGDV 527

Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           +T+ LP+ LRT       P Y    A  +GP LLA  T+
Sbjct: 528 VTVALPMQLRTVEC----PNYTDYVAFEYGPLLLAAQTT 562


>gi|340345934|ref|ZP_08669064.1| acetyl-CoA carboxylase [Prevotella dentalis DSM 3688]
 gi|339612921|gb|EGQ17717.1| acetyl-CoA carboxylase [Prevotella dentalis DSM 3688]
          Length = 1039

 Score =  167 bits (422), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 166/579 (28%), Positives = 261/579 (45%), Gaps = 78/579 (13%)

Query: 95  PGNF-LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--- 150
           P +F L EV+L D      S    A + N + LL  D D L+  F + A L T   A   
Sbjct: 29  PHHFDLSEVTLFD------SPFKTAMELNFKVLLDYDADRLLAPFVRQAGLNTGDYAGWQ 82

Query: 151 -----YGGWENPISELRGHFVGHYLSASAQMWASTHNA----TIKEKMSTVVFSLSECQN 201
                +  W     +L GH  GHYLSA A  +A+  +A     +K+++  ++  L +CQ+
Sbjct: 83  TLHPNFANWGGNGFDLSGHVGGHYLSALALAYAACRDAGMKARLKQRLEYMLKVLKDCQD 142

Query: 202 KIG------TGYLSAFP-----TELF----DSFEALKPVWAPYYTIHKILAGLLDQYVLA 246
                     G++   P      +L+      F +++  W P+Y  HK+LAGL D YV A
Sbjct: 143 AYDGNTEGLRGFIGGQPINEAWKKLYAGDVSGFRSVRG-WVPFYCQHKVLAGLRDAYVYA 201

Query: 247 DNAQALKMATWMVEYFYNRVQKV--ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPK 304
            N +A +M   + ++  N V ++    M SV      L+ E GGMN+ L   Y++  D K
Sbjct: 202 GNKEAREMFRKLADWSVNVVARLDNAAMQSV------LDTEHGGMNESLADAYTLFGDQK 255

Query: 305 HLLLAHLFDKPCFLGFLALQ-ADYLSHFHANTHIPIVIGSQMRYEVTGDPL---YKLIGT 360
           ++  A  +     L  + +Q A +L + HANT +P  IG +   E  G  L   Y+L   
Sbjct: 256 YMDAAQKYSHQTMLNGMQMQNATFLDNRHANTQVPKYIGFERIGEQGGSELQKKYELAAG 315

Query: 361 FFMDIVNASHSYATGGTSAREFWW---DPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
            F + V  + +   GG S  E +    +  R  D L  +  E+C + NMLK+S  L   T
Sbjct: 316 NFWNDVALNRTVCIGGNSVAEHFLSAANSHRYIDHL--DGPESCNSNNMLKLSEMLSDNT 373

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
            +  YAD+YE    N +LS Q   + G  +Y   L     + +    +       WCC G
Sbjct: 374 HDARYADFYEYTTWNHILSTQD-PKTGGYVYFTTL-----RPQGYRIYSQVNQGMWCCVG 427

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
           TG+E+ SK G  +Y  +  +V  +Y+  + +S     +    L Q+      ++P  R+T
Sbjct: 428 TGMENHSKYGHFVYTHDGDSV--IYVNLFTASKL--ANAKFALTQQT--AYPYEPQTRIT 481

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL---PPPGNFLSATERWSYNDK 594
           +      + G   +L +R P WT + G    +NG+   +   P    +   T +W   D 
Sbjct: 482 I------DKGGSYTLAVRHPWWT-TEGYAILVNGEKQQVAVTPGKAGYARLTRKWKRGDV 534

Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           +T+ LP+ LRT       P Y    A  +GP LLA  T+
Sbjct: 535 VTVALPMQLRTVEC----PNYTDYVAFEYGPLLLAAQTT 569


>gi|256831608|ref|YP_003160335.1| hypothetical protein Jden_0363 [Jonesia denitrificans DSM 20603]
 gi|256685139|gb|ACV08032.1| protein of unknown function DUF1680 [Jonesia denitrificans DSM
           20603]
          Length = 744

 Score =  166 bits (420), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 152/540 (28%), Positives = 234/540 (43%), Gaps = 62/540 (11%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLS--ASAQM 176
           + T L+Y L LD   LV  +R+ + LP    +YG WEN  S L GH +GH LS  A A +
Sbjct: 20  RNTALDYTLALDPQRLVAPYRRESGLPLLAPSYGNWEN--SGLDGHTLGHVLSALAYASV 77

Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL------------FDSFEALKP 224
             +  +A  +E++  +V  + ECQ  +GTGY+   P                DSF  L  
Sbjct: 78  THTPRSAEARERLEWLVAQVQECQAAVGTGYVGGIPQGRALWERIGNGDVDADSF-GLHG 136

Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
            W P+Y +HK+ AGL+D   +A  A A  +   +  ++     +V      E+    L  
Sbjct: 137 AWVPWYNLHKVFAGLVDAGWVAGVAVARDVVVGLANWWL----RVAARLRDEQFQAMLVT 192

Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
           E G +N     L   T D ++L +A  F        L    D L   HANT I   +G  
Sbjct: 193 EFGAINGAFADLAVHTGDARYLEMAKRFTDRALFDALVAGEDPLVGLHANTQIAKALGWA 252

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW-WDPKRLADTLGSENEETCTT 403
                 G   Y +      D+V   H+ + GG S RE    DP   A  +  +  E+C T
Sbjct: 253 RVALAGGGREYLVAARRVWDVVVRDHTLSFGGNSVREHCAGDP--WAPFVSEQGPESCNT 310

Query: 404 YNMLKVSRHLFRWTKEI-AYADYYERALTNGVLSIQRGTEP-GVMIYMLPLGRGVSKARS 461
           +NML+++  L    +      D+ E AL N V+S      P G  +Y  P       AR 
Sbjct: 311 HNMLRLTGALLELGESPRPLVDFVEVALMNHVVS---SVHPEGGFVYFTP-------ARP 360

Query: 462 TH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
            H   +      FWCC GTG+E   K G+ +Y  +     GL++   ++S  +W S  V 
Sbjct: 361 QHYRVYSQVHECFWCCVGTGMEHLMKNGELVYSPD---ATGLFVHLGVASVGEWASRGVR 417

Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS------NGAQASLNGQN 573
           + Q   P    D  + + +    + E G+  ++++R+P W         N A  S   ++
Sbjct: 418 VRQ---PWTLDDAGITVGIDAVGQGE-GEF-AIHVRVPGWVDGPVTVRVNDAVISTRVEH 472

Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
                   +++ T  WS  D+L + LP +LR      + P + S Q    GP++LA   +
Sbjct: 473 ------SGYVTVTRVWSAGDRLDVSLPATLRLRPAPRNAP-FVSFQK---GPWVLAARAT 522


>gi|257068350|ref|YP_003154605.1| hypothetical protein Bfae_11690 [Brachybacterium faecium DSM 4810]
 gi|256559168|gb|ACU85015.1| uncharacterized conserved protein [Brachybacterium faecium DSM
           4810]
          Length = 752

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 149/538 (27%), Positives = 225/538 (41%), Gaps = 52/538 (9%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
           AQ+T+LEYLL L+ + L+  FR+ A + T    YG WE+    L GH  GH L+A++ MW
Sbjct: 25  AQRTDLEYLLGLEAERLLAPFRREAGIATTAAPYGNWES--MGLDGHIGGHALAAASLMW 82

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVW 226
           A+T +    E    +V  L ECQ ++GTGY+   P   EL+              L   W
Sbjct: 83  AATGDERAAELARQLVEGLRECQARLGTGYVGGIPGGAELWAQIRTIASQAQTWDLGGAW 142

Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
            P+Y +HK  AGL++    A    A   A  ++    +   ++      E     L  E 
Sbjct: 143 VPWYNLHKTFAGLIEAVRHAPAGTA-SCALEVLRGLGDWGARLGEQLDDEAFARMLRTEF 201

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
           GGM      L  IT + +H  +A  F     L  L    D L   HANT I  VIG    
Sbjct: 202 GGMCAAYADLAEITGEERHARMARRFADESLLAPLRAGRDELDGMHANTQIAKVIGWPAL 261

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTS-AREFWWDPKRLADTLGSENEETCTTYN 405
            E             F+  V    + A GG S A  F  +P  LA     E  E+C T N
Sbjct: 262 GETAA-------AETFVRTVLERRTLAFGGNSVAEHFTAEP--LAHVTDREGPESCNTVN 312

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
           ML+  + L+         D  ER L   VLS Q     G  +Y  P   G  +  S    
Sbjct: 313 MLEAEQRLYEHGGGPWLFDAIERQLVGHVLSAQH--PEGGFVYFTPARPGHYRVYS---- 366

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
            T+ N  WCC GTG+E +++ G   +  + G+   L +   + +S  W+   +  +    
Sbjct: 367 -TRENGMWCCVGTGLEVYARTGRFTFAAQGGD---LLVNLPLPASLRWEEQGIAAHLD-- 420

Query: 526 PIVSWDPYLR----MTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP-G 580
                 PY R      +T   + +     ++++R+P W  +     S++GQ++       
Sbjct: 421 -----SPYPRPAPETPVTLRIEADAPSDVAVHVRVPAWA-TTPPTVSVDGQDVTAHAELD 474

Query: 581 NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDI 638
            +++   RW   + L   L      E +    P   S  ++ +GP +LA    GE D+
Sbjct: 475 GYVTVRRRWQGGEVLRWTLHAGPSWEPL----PGEDSWGSLRWGPVVLAAR-DGEEDL 527


>gi|82523843|emb|CAI78585.1| hypothetical protein [uncultured candidate division OP8 bacterium]
          Length = 766

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 113/385 (29%), Positives = 169/385 (43%), Gaps = 72/385 (18%)

Query: 98  FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY--GGWE 155
            L  V L+     + ++  + +   L  L  ++ D+ +++FR    LP P  A   GGW+
Sbjct: 378 LLGRVVLNRDAAGRETLFMKNRDKFLSTLAEVNPDNFLYNFRDAFGLPQPEGAVQLGGWD 437

Query: 156 NPISELRGHFVGHYLSASAQMWA-----STHNATIKEKMSTVVFSLSECQNKIG------ 204
           +  + LRGH  GHYLSA AQ +A     S   A   +KM+ ++ +L +   K G      
Sbjct: 438 DQTTRLRGHASGHYLSALAQAYAGSVYDSALQANFLQKMNYMIDTLYDLAQKSGRPVESG 497

Query: 205 ------------------------------------TGYLSAFPTELFDSFE-------A 221
                                                G++SA+P + F   E        
Sbjct: 498 GLCNPDPTTVPSGPGKSGYDSDLSQKGLRHDYWNWGVGFISAYPPDQFIMLEQGATYGGT 557

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
              +WAPYYT+HKILAGLLD Y +  N +AL++A  M  +   R+Q V     +      
Sbjct: 558 NAQIWAPYYTLHKILAGLLDCYEVGGNPKALQIAEGMGGWALKRLQAVPEATRIAMWSRY 617

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL-------GFLALQADYLSHFHAN 334
           +  E GGMN+V+ RL+ +T     L  A LFD   F          LA   D +   HAN
Sbjct: 618 IAGEYGGMNEVMARLFRLTGKRDFLACAKLFDNTNFFFGNAGREHGLAKNVDTVRGRHAN 677

Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGT-------SAREFWWDPK 387
            HIP +IG+   Y  +G+P+Y  I   F +I    + Y  GG        +A  F  +P 
Sbjct: 678 QHIPQIIGTLETYRGSGEPVYHEIAENFWEIARNHYMYNIGGVGGAKNPRNAECFTAEPD 737

Query: 388 -RLADTLGSENE-ETCTTYNMLKVS 410
            + A+    + + ETC TYN+LK +
Sbjct: 738 TQFANGFSMDGQNETCATYNLLKCA 762


>gi|225351247|ref|ZP_03742270.1| hypothetical protein BIFPSEUDO_02839 [Bifidobacterium
           pseudocatenulatum DSM 20438 = JCM 1200]
 gi|225158703|gb|EEG71945.1| hypothetical protein BIFPSEUDO_02839 [Bifidobacterium
           pseudocatenulatum DSM 20438 = JCM 1200]
          Length = 853

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 154/571 (26%), Positives = 230/571 (40%), Gaps = 90/571 (15%)

Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISE--------LRGHFVGHY 169
           AQQ    YLL LDVD L++ FR+ A LP P  A G   NP++         L GH  GHY
Sbjct: 24  AQQAGARYLLDLDVDRLLYPFRREAGLPQPTDADG---NPVTSYPNWEETGLDGHIAGHY 80

Query: 170 LSASAQMWASTHNAT-IKEKMSTVVFSLSECQ-----NKIGTGYLSAFPTE--LFDSFEA 221
           LSA         +     ++ +TVV S  ECQ     + +  GY+   P    +F    A
Sbjct: 81  LSACVGFAQVADDPQPFIDRAATVVRSWHECQQSFAGDAVMRGYVGGVPDSRTVFGRLAA 140

Query: 222 ---------LKPVWAPYYTIHKILAGLLDQY-----VLADNAQALKMATWMVEYFYNRVQ 267
                    +   W P Y +HK  AGLLD +     +    +Q  +     +  ++ R+ 
Sbjct: 141 GDVESQNFSMNDAWVPMYNVHKTFAGLLDTWADFASIDEQTSQLARTVVLDLADWWCRIA 200

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
           + +   + +R    L  E GGM +    LY+ T + ++ ++A  F        LA   D 
Sbjct: 201 EPLDDETFDR---ILVSEFGGMCESFAELYARTGEERYHVMADRFKDHAIFDPLAQGEDV 257

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
           L+  HANT IP V+G +    +  D         F D V    S + G  S  E +    
Sbjct: 258 LTGMHANTQIPKVLGWERLGAICNDEQADAATNTFWDSVVHHRSVSIGAHSVSEHFHPTD 317

Query: 388 RLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
             +  + S E  ETC +YNM K++  L+  +    Y ++YER L N +LS     +PG  
Sbjct: 318 DFSSMIESREGPETCNSYNMSKLAERLWLRSGSADYINFYERVLENHLLSTINPKQPG-F 376

Query: 447 IYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIY------------- 491
           +Y  P+       RS H   + T    FWCC G+G+E+ ++ G  IY             
Sbjct: 377 VYFTPM-------RSQHYRAYSTPQECFWCCVGSGLENHARYGRLIYALQRPAAQDSADS 429

Query: 492 --------FEEEGNVPG---------LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
                     E GN            L +  YI S+FD     + + Q+   I     Y 
Sbjct: 430 AAAGFASSAAETGNTVSNNAEAEATRLLVNLYIDSTFDCPEQGLRITQRAARIEDGVDYT 489

Query: 535 RMTLTFSSKQE-----VGQL--SSLNLRMPVWTYSNGAQASLNGQNLPLPP-----PGNF 582
            +T T  S  E      G L  ++L LR P W    G   +        P      P  +
Sbjct: 490 -VTFTLESTAEHVPDTPGGLRETTLFLRRPWWAEHYGVMEATCAVCTLDPARTNDIPEGY 548

Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
           L    RW+   ++ ++L   +  E + D  P
Sbjct: 549 LPLRLRWNGVAEVVMRLRPRITVERMPDGSP 579


>gi|297725075|ref|NP_001174901.1| Os06g0612950 [Oryza sativa Japonica Group]
 gi|255677224|dbj|BAH93629.1| Os06g0612950 [Oryza sativa Japonica Group]
          Length = 198

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 82/168 (48%), Positives = 98/168 (58%), Gaps = 24/168 (14%)

Query: 19  KQCTN-QSPYDSHAFRYELTSTNKT---WKEEVLSHFHLTPTDDSAWSSLIPSKILGDQK 74
           K+CTN  +   SH  R  L S++     W+EE     HL PTD++AW  L+P  +     
Sbjct: 23  KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMP--LAAASA 80

Query: 75  DEVSWALLYRKIKNPGGFDLPGN-----------FLKEVSLHDVWLDQSS----VLWRAQ 119
            E  WA+LYR +K   G  + G+           FL+EVSLHDV LD       V  RAQ
Sbjct: 81  SEFDWAMLYRSLK---GAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQ 137

Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG 167
           QTNLEYLL+L+VD LVWSFR  A LP PGK YGGWE P  ELRGHFVG
Sbjct: 138 QTNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVG 185


>gi|302547294|ref|ZP_07299636.1| putative secreted protein [Streptomyces hygroscopicus ATCC 53653]
 gi|302464912|gb|EFL28005.1| putative secreted protein [Streptomyces himastatinicus ATCC 53653]
          Length = 740

 Score =  135 bits (340), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 97/294 (32%), Positives = 142/294 (48%), Gaps = 28/294 (9%)

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
           G+  Y      F  +V     Y+ GGT   E +     +A TL  +N ETC TYNMLK+S
Sbjct: 337 GETAYAAAARNFWGMVAGPRMYSLGGTGQGEMFRARNAIAATLDGKNAETCATYNMLKLS 396

Query: 411 RHLFRWTKEIAYADYYERALTNGVLSIQRG----TEPGVMIYMLPLGRGVSKARSTHGWG 466
           R LF    + AY DYYER LTN +L+ +R     T P V  Y + +G GV +        
Sbjct: 397 RQLFFREPDAAYMDYYERGLTNHILASRRDAPSTTSPEV-TYFVGMGPGVRREYD----- 450

Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDP 526
              N+  CC GTG+E+ +K  DS+YF        LY+   ++S+  W     V+ Q  D 
Sbjct: 451 ---NTGTCCGGTGMENHTKYQDSVYFRSADGT-ALYVNLALASTLRWPERGFVIEQTGD- 505

Query: 527 IVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG-QNLPLPPPGNFLSA 585
              +      TLTF   +E G    + LR+P W  + G   ++NG +      PG++L+ 
Sbjct: 506 ---YPAEGVRTLTF---REGGGRLEVKLRVPAWA-TGGFTVTVNGVRQRGKAVPGSYLTL 558

Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIK 639
           +  W   D++ I  P  LR E   DD     ++Q++ +GP LL    SGE + +
Sbjct: 559 SRDWRRGDRIRISAPYRLRIERALDD----PAVQSVFYGPVLLVAR-SGETEFR 607


>gi|427409221|ref|ZP_18899423.1| hypothetical protein HMPREF9718_01897 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425711354|gb|EKU74369.1| hypothetical protein HMPREF9718_01897 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 616

 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 146/572 (25%), Positives = 236/572 (41%), Gaps = 78/572 (13%)

Query: 125 YLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNAT 184
           + L LD D ++  FR+ A LP PG   GGW +    + G   G Y+S  A++ A+T +  
Sbjct: 82  HYLALDNDRVLKVFRQQAGLPAPGPDMGGWYDRDGFVPGLAFGQYMSGLARIGATTGDKA 141

Query: 185 IKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYV 244
           +  K++ +V    E   K    Y  A P          +  WA  YT+ K + GL+D Y 
Sbjct: 142 VHAKVAALVQGFGEFITKTRNPY--AGPKA--------QDQWAA-YTMDKYVVGLIDAYR 190

Query: 245 LADNAQALKMATWMVEYF--------YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRL 296
           L+   QA  +    +E           +R+ KV   Y          +ET  +++ L+ +
Sbjct: 191 LSGVEQAKTLLPITIEKCRPYISPVSRDRIGKVDPPY----------DETYVLSENLFHV 240

Query: 297 YSITHDPKHLLLA--HLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL 354
             IT   K+  +A  +L +K  F    A Q D L   HA +H   +      Y   GD  
Sbjct: 241 ADITGQDKYRQMAIHYLLNKEWFDPLAAGQ-DVLPTKHAYSHTIALSSGAQAYLHLGDEK 299

Query: 355 YKLIGTFFMDIVNA-----SHSYATGGTSAREFWWD--PKRLADTLGSEN---EETCTTY 404
           Y+        +VNA        +A+GG    E + +    +LA +L S     E  C ++
Sbjct: 300 YRKA------LVNAWTYMEPQRFASGGWGPEEQFVELHQGKLAASLKSSKAHFETPCGSF 353

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
             +K++R+L R+T E  Y D  ER L N +L+ +     G   Y    G    K      
Sbjct: 354 ADMKLARYLVRFTGEPVYGDGLERTLYNTMLATRLPDSDGGYPYYSNYGAAAEKLYYHQK 413

Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK--SGHVVLNQ 522
           W        CC GT ++  +    ++YF ++     L +  +  S+  W    G V + Q
Sbjct: 414 WP-------CCSGTLVQGVADYVLNLYFHDDN---ALVVNMFAPSTVKWDRPGGAVQVEQ 463

Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNF 582
           + +     +   R+T+T           ++ LR+P W  + GAQ  +NG    +  PG  
Sbjct: 464 QTN--YPAEDTTRLTVTAPGNGRF----AMKLRIPAW--AKGAQLRVNGAAQGV-QPGTL 514

Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGT 642
                 W   D + + LP +LRT +I D  P+   I A++ G  +  G     W      
Sbjct: 515 AVIDRTWKAGDMVELTLPQALRTLSIDDKNPD---IAAVMRGAVMYVGLNP--WTGVEDQ 569

Query: 643 ARSLSALISPIPPSFNAQLVTFTQESGNSTFV 674
             +L A + P+P S     + +  E+G    V
Sbjct: 570 PLALPASLKPVPGSS----LNYAMETGGRNLV 597


>gi|336429869|ref|ZP_08609826.1| hypothetical protein HMPREF0994_05832 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336001322|gb|EGN31460.1| hypothetical protein HMPREF0994_05832 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 606

 Score =  126 bits (317), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 161/619 (26%), Positives = 248/619 (40%), Gaps = 111/619 (17%)

Query: 99  LKEVSLHDVWLDQSSVLW-RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW-EN 156
           LK+    +V L  S  LW R ++   E  L +  DSL++ FR  A L  PG+   GW  N
Sbjct: 4   LKDFRYRNVELKNS--LWERQRRETAETYLAIPNDSLLYYFRTLAGLEAPGEGLTGWYGN 61

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
             S       G  L A A+++A T +  +KEK       L+E     G G  +A   ++F
Sbjct: 62  GASTF-----GQKLGAFAKLYAVTGDYRLKEK----AVYLAE-----GWGKCAAANKKVF 107

Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
           D  +         Y   K+L G LD Y      + L   + + +    R ++ I    ++
Sbjct: 108 DCNDT--------YVYEKLLGGFLDMYENLGYEKGLAYCSGLTDSAAARFKRDIPRDGLQ 159

Query: 277 R---------HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
                      WY+L E        LYR Y +T + K+L  A  +D       L  +   
Sbjct: 160 GPELCENNMIEWYTLPEN-------LYRAYQLTGEQKYLDFAQEWDYTYLWDKLNNKDSA 212

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFW--- 383
           +   HA + +  +  + M YEVTG   Y   I   + +I    H+YATGG    E     
Sbjct: 213 IGPRHAYSQVNSLSSAAMAYEVTGKKYYLDAIENGYTEITE-RHTYATGGYGPAECLFAE 271

Query: 384 ------------WDPKRLA--------------DTLGSENEETCTTYNMLKVSRHLFRWT 417
                       WDP R +              D  GS  E +C  + + K+  +L R T
Sbjct: 272 EEGFLGEMLKDSWDPTRKSPVYRNFGGGLVGRNDNWGS-CEVSCCAWAVFKICNYLLRIT 330

Query: 418 KEIAYADYYERALTNGVLSIQRGTEPG-VMIYMLPLGRGVSKA---RSTHGWGTKFNSFW 473
            +  Y  + E+ L NGV         G VM Y      G  K+   R   G G  F  + 
Sbjct: 331 GKAKYGAWAEQMLINGVAGQPPIDSQGHVMYYADYFVDGAVKSVQDRRLQGNGANF-EWQ 389

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS--SFDWKSGHVVLNQKVDPIVSWD 531
           CC GT  +  ++  + +Y+ +E    G+Y+ QY+ S   F  +    VL    +  VS  
Sbjct: 390 CCTGTFPQDVAEYANMLYYTDE---EGIYVSQYMKSRAEFTIRGEKAVLENCSEEDVS-- 444

Query: 532 PYLRMTLTFSSKQEVGQLS-SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATER-W 589
           P  R  +     Q  G+L   ++ R+P W      +  +NG++  L P  +  +  ER W
Sbjct: 445 PIRRFRI-----QTRGELPFRISFRIPHWAKGEN-RILVNGEDSGLEPLPDSWAVLERVW 498

Query: 590 SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG----------HTSGEW--- 636
             +D +T+  P SL  + + +   +   I A++FGP +LA               EW   
Sbjct: 499 QEDDVITVTCPFSLAFKPVDEKNKD---IAALMFGPVVLAADKMTLFDGDMEKPEEWITC 555

Query: 637 -DIKTGTARSLSALISPIP 654
            D K    R+L   + P P
Sbjct: 556 VDEKEMLFRTLPGHVCPYP 574


>gi|310794204|gb|EFQ29665.1| hypothetical protein GLRG_04809 [Glomerella graminicola M1.001]
          Length = 436

 Score =  125 bits (314), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 149/348 (42%), Gaps = 49/348 (14%)

Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTP-GKAYGGWENPISELRGHFVGHYLSASAQMW 177
           Q   L YL  +DVD L++ FRK   L T   +   GW+ P    R H  GH+L+A A  +
Sbjct: 59  QARTLVYLKWIDVDRLLYVFRKNHGLYTNNAQPNAGWDAPDFPFRSHVQGHFLNAWAFCY 118

Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
           A   ++  K + +     L +CQ+                          PYY IHK +A
Sbjct: 119 AQLQDSECKRRATYFAAELKKCQH------------------NNTNSRNVPYYAIHKTMA 160

Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
           GLLD + L  +  A  +   M  +   R  K+    + ++    +    GGMN+VL  L 
Sbjct: 161 GLLDVWRLIGDTNARDVLLAMAAWVDLRTGKL----TYQQMQDMMGTVFGGMNEVLADLC 216

Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
             T D + + +A  FD       LA   D LS  HANT                    + 
Sbjct: 217 RQTGDQRWVTVAQRFDHAAIFNPLASNQDSLSGLHANT--------------------QD 256

Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
           I     +I  ++HSYA GG S  E +  P  +A  L S+  E C TYNMLK++  L+   
Sbjct: 257 IARNAWNITVSAHSYAIGGNSQAEHFRLPNAIAGFLTSDTCEACNTYNMLKLTGELWLTN 316

Query: 418 KE-IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKA 459
            +   Y D+YERAL N +L  Q  +   G + Y  PL     RGV  A
Sbjct: 317 PDTTTYFDFYERALLNHLLGQQDPSNSHGHVTYFTPLNPGGRRGVGPA 364


>gi|94967195|ref|YP_589243.1| hypothetical protein Acid345_0164 [Candidatus Koribacter versatilis
           Ellin345]
 gi|94549245|gb|ABF39169.1| conserved hypothetical protein [Candidatus Koribacter versatilis
           Ellin345]
          Length = 602

 Score =  121 bits (304), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 140/550 (25%), Positives = 234/550 (42%), Gaps = 58/550 (10%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE--N 156
           L E    DV L +S +  R  Q   + L+ L+ D+L+  FR     P PG+  GGW   +
Sbjct: 37  LDEFGYGDVSL-ESELHNRQFQNTHDVLMGLEDDALLKPFRAMVGQPPPGRDLGGWYCFD 95

Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
           P        VG   +A+   W S  + +   +    V       N++       +   + 
Sbjct: 96  PNYNPNDVGVGFAPTATFGQWISALSRSYALRPDPAVRDKVIRLNRL-------YAQTIS 148

Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
             F  LK  + P Y   K++ GL+D +    +  ALK+    +E   +    ++  ++VE
Sbjct: 149 PEFYGLKNRF-PAYCYDKLVCGLIDAHQYVGDPDALKI----LERTTDTATPLLPGHAVE 203

Query: 277 RH--WYSLNE------ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
               W S+ +      E+  +++ L+  Y      ++  L   +    +   LA     L
Sbjct: 204 HGTVWRSVKDDGYTWDESYTISENLFLAYRRGAGDRYRALGKQYLDDTYYNPLAEGRSDL 263

Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK- 387
              HA +H+  +  +   Y   GD  Y        D V A  SYATGG  A E    P  
Sbjct: 264 EGRHAYSHVNSLCSAMQAYLTLGDEKYFRAAKNGFDFVLA-QSYATGGWGADETLRAPNS 322

Query: 388 -RLADTLGSEN---EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
             +A +L   +   E  C +Y   K++R+L R T++  Y D  ER + N +L    G  P
Sbjct: 323 PEVAKSLTGTHHSFETPCGSYAHFKLTRYLLRVTRDSRYGDSMERVMYNTIL----GALP 378

Query: 444 GVMIYMLPLGRGVSKARSTHGWGTKF--NSFW-CCYGTGIESFSKLGDSIYFEEEGNVPG 500
                ++P GR    +      G+KF  ++ W CC GT  +  +  G S Y  +     G
Sbjct: 379 -----LMPDGRTFYYSDYNFK-GSKFYHDARWPCCSGTMPQIATDYGISTYLRDPQ---G 429

Query: 501 LYIIQYISSSFDWKS--GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
           +Y+  YI S+  W+     V L QK      +DP + + L+ + ++E      ++LR+P 
Sbjct: 430 IYVNLYIPSTVRWQQDGAQVSLTQKT--AYPFDPVVEIELSTTKQREF----EVHLRIPA 483

Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
           W  +  A   +NG+   +P    F +    W   D++ ++LPL  R E +  +R   A +
Sbjct: 484 W--AEQASIEVNGKREGVPVAERFATIRRTWKNGDRIQLELPLKNRLEPLNRER---AKL 538

Query: 619 QAILFGPYLL 628
            A+L GP +L
Sbjct: 539 VALLNGPLVL 548


>gi|237718517|ref|ZP_04548998.1| LOW QUALITY PROTEIN: acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
 gi|229452224|gb|EEO58015.1| LOW QUALITY PROTEIN: acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
          Length = 502

 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/274 (32%), Positives = 132/274 (48%), Gaps = 25/274 (9%)

Query: 366 VNASHSYATGGTSARE-FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYAD 424
           V A+ S A GG S RE F  D   L+     E  E+C TYNML+++  LFR      YAD
Sbjct: 2   VTANRSLAFGGNSRREHFPDDTDYLSYVDDREGPESCNTYNMLRLTEGLFRMNPTADYAD 61

Query: 425 YYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIES 482
           +YERAL N +LS Q   E G  +Y  P       AR  H   +     + WCC GTG+E+
Sbjct: 62  FYERALFNHILSTQH-PEHGGYVYFTP-------ARPAHYRVYSAPNEAMWCCVGTGMEN 113

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
             K G+ IY     +   LY+  +ISS  +WK   + L Q      S+    +  LT ++
Sbjct: 114 HGKYGEFIYAHTGDS---LYVNLFISSRLEWKKRRISLTQ----TTSFPNEGKTCLTITA 166

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPL 601
           K+       L +R P W        ++NG+++      N + +   +W   D + +Q+P+
Sbjct: 167 KKSTK--FPLFVRKPGWVGDGKVIITVNGKSIETTTAANSYYTINRKWKNGDVVEVQMPM 224

Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           ++R E ++   PEY    AI+ GP LL  +   E
Sbjct: 225 NIRIEELK-HHPEYI---AIMRGPILLGANVGKE 254


>gi|256375993|ref|YP_003099653.1| hypothetical protein Amir_1859 [Actinosynnema mirum DSM 43827]
 gi|255920296|gb|ACU35807.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
          Length = 736

 Score =  119 bits (299), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 90/292 (30%), Positives = 127/292 (43%), Gaps = 40/292 (13%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTG 351
            L  L + T  P+HL  A +FD    +   A   D L+  HAN HIPI  G     E TG
Sbjct: 278 ALRDLRARTGKPEHLAPARMFDLDALIDACAENRDVLAGLHANQHIPIFTGLVRLREATG 337

Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSR 411
           +  Y      F D+V     Y  GGTS  EFW  P  +A+TL  +N ETC  +NMLK+ R
Sbjct: 338 EQRYLDAARNFWDMVVPRRLYRIGGTSTGEFWRAPGVIAETLADDNAETCCAHNMLKLGR 397

Query: 412 HLFRWTKEIAYADYYERALTNGVLSIQRGTEPG---VMIYMLPLGRGVSKARSTHGWGTK 468
            LF                 N +L  ++        +M Y + L  G  +  +     T 
Sbjct: 398 ALF-----------------NQILGSKQDAPSADVPLMTYFIGLAPGSVRDFTPEQGAT- 439

Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
                CC GTG+ES +K  DS+YF +E     LY+  +  ++  W    +          
Sbjct: 440 -----CCEGTGLESAAKYQDSVYFHDEKT---LYVNLFAPTTAHWNETTITRGAHF---- 487

Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPG 580
              P+ R T      +  G   ++ +R+P W  + GA ASLNG+ L +P  G
Sbjct: 488 ---PHERGTSPGIGGK--GGRVTIKVRVPSW--ARGASASLNGRPLAVPAAG 532


>gi|225874351|ref|YP_002755810.1| hypothetical protein ACP_2792 [Acidobacterium capsulatum ATCC
           51196]
 gi|225791337|gb|ACO31427.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
           51196]
          Length = 611

 Score =  119 bits (297), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 130/516 (25%), Positives = 213/516 (41%), Gaps = 71/516 (13%)

Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISE----------LRGHFVGHY 169
           Q N  + L LD D+L+  FR+ A LP PG   GGW N   E          + GH  G Y
Sbjct: 62  QANHAFFLALDEDALLKPFRERAGLPAPGPQMGGWYNFSKEFDPPNNMTGYIPGHSFGQY 121

Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPY 229
           LS  A+ +A+T +   K K+  +V            G+  A   + +D +    P+  P 
Sbjct: 122 LSGLARAYAATGDQPTKAKVHRLV-----------RGFAEAVSPKFYDDY----PL--PC 164

Query: 230 YTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN------ 283
           YT  K   GL+D +  A +  AL   +  ++     V   +  +++ R   +        
Sbjct: 165 YTFDKSNCGLIDAHQFAGDPNALHALSRALD----AVMPYLPSHALTRPEMAARPHPNIA 220

Query: 284 ---EETGGMNDVLYRLYSITHDPKHLLLAHLF--DKPCFLGFLALQADYLSHFHANTHIP 338
              +E+  + +  +  Y  + D K+L++A  F  DK  +   LA   + L H HA +H+ 
Sbjct: 221 FTWDESYTLPENFFLAYKRSGDEKYLVMAQRFLQDK-SYFDPLAEGDNVLPHQHAYSHVN 279

Query: 339 IVIGSQMRYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDP------KRLAD 391
            +  +   Y V G   + +     F  +++   S+ATGG    E + +P      K L +
Sbjct: 280 ALNSASQAYLVLGSEKHLRAARNGFQFVLD--QSFATGGWGPNETFVEPGSGGLYKSLTE 337

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
           T  S  E  C  Y   KV+R+L R T +  Y D  E+ L N +L      + G   Y   
Sbjct: 338 THAS-FETPCGAYGHFKVTRYLMRITGDSRYGDSMEQVLYNTILGAMPLEQGGFSFYYSD 396

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
                +K      W        CC GT  +  +  G S YF    +  GLY+  ++ S  
Sbjct: 397 YNNYAAKNYYPEQWP-------CCSGTFPQVTADYGISSYFH---SPEGLYVNLFVPSRA 446

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
            ++ G    + +      ++  + M +   + Q      S+ LR+P W    G   ++NG
Sbjct: 447 KFQIGGARFSLEQRTHYPYENDIAMQVRGDNPQTF----SIALRVPAWA-GKGTSITVNG 501

Query: 572 QNLPLP-PPGNFLSATERWSYNDKL--TIQLPLSLR 604
           +       PG F+     W   D++  +I  PLSL+
Sbjct: 502 RKAEAEVKPGTFVRLHREWKDGDRIEYSIDRPLSLQ 537


>gi|336425065|ref|ZP_08605095.1| hypothetical protein HMPREF0994_01101 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336012974|gb|EGN42863.1| hypothetical protein HMPREF0994_01101 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 575

 Score =  117 bits (292), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 141/582 (24%), Positives = 240/582 (41%), Gaps = 82/582 (14%)

Query: 99  LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
            KEV+L++       ++ +     L + L +  D+++   R++A  P PG  Y GW    
Sbjct: 6   FKEVTLNE------GMMKKVLDETLAFYLKIPNDNILKYMRESAGKPAPGIFYTGW---Y 56

Query: 159 SELRG-HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
              RG   +G +LSA ++M+A + +   ++K   +     +C       Y SA  T  F 
Sbjct: 57  PNSRGIALIGQWLSAYSRMYAISGDEAFRQKAVYLADEFWDC-------YESAQHTAPFL 109

Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
           +  +       +Y + K+L    D ++      A + A +++++  + +           
Sbjct: 110 TSRS-------HYDVEKLLRAHCDLFLYCKYPCAKERAGYLIDFAADNLTAENIFGDNST 162

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD----------Y 327
            WY+L E         +  + I   P+   +A  F+   F       AD          Y
Sbjct: 163 EWYTLAES-------FWDAFEILEIPRAQQMAERFEYREFWDLFYKDADPFSKRPQAGLY 215

Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
               HA +H+         YE+T  P +      F   +      ATGG         PK
Sbjct: 216 SEFCHAYSHVNSFNSCAKAYEMTKSPYFLKSLRSFYRFMQTEEVMATGGYGPNYEHLMPK 275

Query: 388 -RLADTL--GSENEET-CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
            R+ D L  G ++ ET C TY   ++ ++L R+T E  Y ++ E  L N   +    TE 
Sbjct: 276 NRIIDALRTGHDSFETQCDTYAAFRLCKYLTRFTDEPEYGNWVESLLYNAAAATIPMTEE 335

Query: 444 GVMIYM--LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
           G +IY     +  G  K R   GW        CC GT     +++   IYFE +G    L
Sbjct: 336 GNIIYYSDYNMYAGYKKNRQD-GWT-------CCTGTRPLLVAEIQRLIYFEGDGE---L 384

Query: 502 YIIQYISSSFDW-KSGH-VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           YI QYI S+  W ++G+ + + Q+       +  L ++L+ S+         ++ R+P W
Sbjct: 385 YISQYIPSTLHWNRNGNDISIRQETGFPEGKETTLILSLSCSAA------FPIHFRLPGW 438

Query: 560 TYSNGAQASLNGQNLPLPP---PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
                 +  ++  N+PLP       +L+    W   D+LTI LP  +   ++    P   
Sbjct: 439 L---SGEMKVSCNNVPLPATVDKNGWLTIHSEWKEGDRLTISLPAEVWMHSLD---PVKN 492

Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGT----ARSLSALISPIP 654
              A L+GP +LA   SG   I+T       +SL+  + P+P
Sbjct: 493 GPNAFLYGPVVLAADYSG---IQTPNDWMDVQSLTEKMKPVP 531


>gi|413954826|gb|AFW87475.1| hypothetical protein ZEAMMB73_309562 [Zea mays]
          Length = 161

 Score =  108 bits (269), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 69/170 (40%), Positives = 97/170 (57%), Gaps = 25/170 (14%)

Query: 691 GTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPK 750
           GT+AA+HATFRL+ +  + +         G + MLEP D PGM+V     D L V+ + K
Sbjct: 10  GTEAAVHATFRLVPQGGAGA---------GAAAMLEPLDMPGMVVT----DRLTVA-AEK 55

Query: 751 EMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTESLD-----A 805
             G++ F +V GL     +VSLE  +R GCF+  G     G  +++ C+  +       A
Sbjct: 56  SSGAA-FNVVPGLAGAPGSVSLELASRPGCFLVGG-----GEKVQVGCAGGAQQKRGDGA 109

Query: 806 GFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
            F R+ASF     +  YHP+SF A+G RR+FLL PL + RDE YTVYFN+
Sbjct: 110 WFRRSASFARGEPLRRYHPMSFAARGVRRSFLLEPLFTLRDEFYTVYFNL 159


>gi|94967351|ref|YP_589399.1| hypothetical protein Acid345_0320 [Candidatus Koribacter versatilis
           Ellin345]
 gi|94549401|gb|ABF39325.1| Protein of unknown function DUF1680 [Candidatus Koribacter
           versatilis Ellin345]
          Length = 607

 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 124/533 (23%), Positives = 215/533 (40%), Gaps = 64/533 (12%)

Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI----------SELRGHFVGHYLS 171
           N  + L LD D L+  FR+ A LP PG+  GGW +              + GH +G Y+S
Sbjct: 58  NHAFFLKLDEDRLLKVFRQKAGLPAPGEDMGGWYDLTGFDLAKGDFHGFVPGHTLGQYVS 117

Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYT 231
           A A+ +A+T +   K K+  +V                 +   L D          P YT
Sbjct: 118 ALARCYAATGSEETKAKVHRLV---------------KGYGATLDDKASFFAGYRLPAYT 162

Query: 232 IHKILAGLLDQYVLADNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYSLN-EET 286
             K+  GL+D +  A +  A+    K+   M++Y   +        +      S   +E+
Sbjct: 163 YDKLSCGLIDAHEFAHDPDAMAIHEKLTRGMLQYLPEKALSRAEQRARPHKDESFTWDES 222

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
             + + L+  Y  T +  +  L   F +   +   L+   + L+  HA +H+     +  
Sbjct: 223 YTLPENLFLAYRRTGNKFYRELGTRFLEDDTYFNPLSEGINVLAGEHAYSHMNAFCSAMQ 282

Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW--WDPKRLADTLGSEN---EET 400
            Y       ++        +V A  S+ATGG    E +  ++  +L D+L   +   E  
Sbjct: 283 AYLTLDSERHRKAARNGFRMV-AEQSFATGGWGPSEAFVEFNKGQLGDSLEKSHSSFETP 341

Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
           C  Y   K++R+L +   +  Y D  ER + N VL  +     G   Y            
Sbjct: 342 CGAYAHFKLTRYLLQTDGDSTYGDSMERVMYNTVLGAKPIQPDGTSFYY--------SDY 393

Query: 461 STHGWGTKFNSFW-CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS--GH 517
           +T G     N  W CC GT  +  +    SIY +      G+ +  ++ S+  WK+  G 
Sbjct: 394 ATVGKKVYHNDKWPCCSGTLPQVAADYHISIYLKA---TDGVCVNLFVPSTLIWKASDGS 450

Query: 518 VVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
             L Q+   P  +      + + F++ Q V Q  +L +R+P W  S  A   +NGQ   +
Sbjct: 451 CKLTQETKYPFET-----SVAMRFATTQPVEQ--TLYIRIPAWVTSEPA-LRVNGQRTDV 502

Query: 577 PP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
              PG F +    W   D++ + LP+    + +     ++  + A++ GP +L
Sbjct: 503 AAKPGAFAAIRRTWKDGDRIDLDLPMGFELQPVDG---QHEKLVALVHGPLVL 552


>gi|224072775|ref|XP_002303875.1| predicted protein [Populus trichocarpa]
 gi|222841307|gb|EEE78854.1| predicted protein [Populus trichocarpa]
          Length = 103

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 58/131 (44%), Positives = 75/131 (57%), Gaps = 30/131 (22%)

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
           +R+P WT+  GA+  +N                         T Q+P S       DDRP
Sbjct: 1   MRIPTWTHLEGAETVINDS-----------------------TWQIPAS-------DDRP 30

Query: 614 EYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTF 673
           EYASIQAIL+GPYL AGHT+ +WDIK  +A SLS   +PIP ++N  LVTF+Q+S N TF
Sbjct: 31  EYASIQAILYGPYLFAGHTTADWDIKNVSADSLSEWSTPIPAAYNDHLVTFSQKSRNPTF 90

Query: 674 VMSNSNQSITM 684
            + NSN  IT+
Sbjct: 91  FLINSNHIITV 101


>gi|224072771|ref|XP_002303873.1| predicted protein [Populus trichocarpa]
 gi|222841305|gb|EEE78852.1| predicted protein [Populus trichocarpa]
          Length = 103

 Score = 98.2 bits (243), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 55/131 (41%), Positives = 72/131 (54%), Gaps = 30/131 (22%)

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
           +R+P WT+  GA+  +N                         T Q+P S       DDRP
Sbjct: 1   MRIPTWTHLEGAETVINDS-----------------------TWQIPAS-------DDRP 30

Query: 614 EYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTF 673
           EYASIQAIL+GP L AGHT+ +WDIK  +A SL    +PIP ++N  LVTF+Q+S N  F
Sbjct: 31  EYASIQAILYGPSLFAGHTTADWDIKNVSADSLPEWSTPIPAAYNDHLVTFSQKSRNPNF 90

Query: 674 VMSNSNQSITM 684
            + NSN  IT+
Sbjct: 91  FLINSNHIITV 101


>gi|255624614|ref|XP_002540501.1| hypothetical protein RCOM_2107350 [Ricinus communis]
 gi|223495313|gb|EEF21882.1| hypothetical protein RCOM_2107350 [Ricinus communis]
          Length = 208

 Score = 95.9 bits (237), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 65/207 (31%), Positives = 96/207 (46%), Gaps = 15/207 (7%)

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------EL 215
           GHYLSA A M A+T +  ++E++  VV  L  CQ   G GY+   P            +L
Sbjct: 3   GHYLSALAMMVAATGDEQVRERLDYVVAELKRCQAANGNGYIGGVPGGAAAWRDIAQGKL 62

Query: 216 FDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSV 275
                ++   W P+Y +HK  AGL D Y  A N  A  M   + ++      ++ +  S 
Sbjct: 63  HADNFSVNGKWVPWYNLHKTFAGLRDAYTYAGNQDAHAMLIALCDW----TLELTSHLSD 118

Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
           E+    +  E GGMN+VL  +  +T   K++ LA  F     L  L    D L+  HANT
Sbjct: 119 EQMQSMMRAEHGGMNEVLADVAQMTGQQKYMDLAIRFSHQALLRPLEEGKDQLTGLHANT 178

Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFF 362
            IP VIG +   ++T    ++    FF
Sbjct: 179 QIPKVIGFKRIGDITSRDDWQRAAAFF 205


>gi|284043399|ref|YP_003393739.1| hypothetical protein Cwoe_1938 [Conexibacter woesei DSM 14684]
 gi|283947620|gb|ADB50364.1| protein of unknown function DUF1680 [Conexibacter woesei DSM 14684]
          Length = 711

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 126/527 (23%), Positives = 222/527 (42%), Gaps = 84/527 (15%)

Query: 132 DSLVWSFRKTASLPTPGKAYGGWENPISELRGHF--VGHYLSASAQMWASTHNATIKEKM 189
           D+L++ FR       PG    GW        G F  +G + +  A+++A+T      EK 
Sbjct: 47  DALLYPFRIRKGSWAPGIPLRGWYG-----EGLFNNLGQFFTLYARLYAATGEHRFAEKA 101

Query: 190 STVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNA 249
             ++    E   + G G+LS   +    + E         Y+  K++ GLLD +    + 
Sbjct: 102 LALLDGWEETIEEDG-GFLS---SHFAGTVE---------YSYDKLVCGLLDLHEYVGSE 148

Query: 250 QAL----KMATWMVEYFYNRVQKVIT-MYSVERHWYSLNEETGGMNDVLYRLYSITHDPK 304
           +AL    +++ WM  +  +      + M  +E  WY+L E        L R Y++T DP 
Sbjct: 149 RALPVLERVSRWMQRHGGSSKPYAWSGMGPLE--WYTLPE-------YLLRAYAVTSDPL 199

Query: 305 HLLLAHLFDKPCF--------LGFLALQAD-----YLSHFHANTHIPIVIGSQMRYEVTG 351
           +  LA+ +    F        +G L  +AD     Y +H HANT    +  +   YE TG
Sbjct: 200 YRELANAYRYDEFYDALLERDVGALMRRADEARNFYQAHSHANT----LNSAAAVYETTG 255

Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN---EETCTTYNMLK 408
           DP Y  + T   +++  S ++ATG     E +  P++  + L SE    E  C ++ M++
Sbjct: 256 DPRYLDVLTAGYELLRESQTFATGMFGPLEAFMKPRQRVEVLHSEEGHAEVACPSWAMMR 315

Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG-VMIYMLPLGRGVSKARSTHGWGT 467
           + RHL   T E  + D+ E  + NG+ S       G    Y    G      R+T  WG 
Sbjct: 316 LVRHLIELTGEAQFGDWMELNVYNGIGSAPPTRADGRATQYFADYG----LDRATKTWGV 371

Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF--DWKSGHVVLNQK-- 523
           +++   CC  T   + ++  + IY+        L++  Y+ SS   +     + L Q+  
Sbjct: 372 EWS---CCSTTSGINMAEYVNQIYY---AGPDALHVCLYLPSSVTCEIDGATLWLTQRTA 425

Query: 524 --VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
             VD  V++D  +   L            ++  R+P WT +   + +L+G+ +       
Sbjct: 426 YPVDERVAFDVRVERPLR----------GTIAFRVPAWT-AGEPRLTLDGEPVEHVVRDG 474

Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
           + +    W   D + + LP+ L    ++      A   A+ +GP +L
Sbjct: 475 WATVERTWEDGDAIELTLPMELAVLPVEPATD--AGPVALRYGPVVL 519


>gi|557474|gb|AAA50392.1| ORF1, partial [Bacteroides ovatus]
          Length = 436

 Score = 93.2 bits (230), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 64/217 (29%), Positives = 100/217 (46%), Gaps = 22/217 (10%)

Query: 422 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
           Y +YYERAL N +L+ Q   + G  +Y  P+  G  +      +     S WCC G+G+E
Sbjct: 4   YVNYYERALYNHILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLE 57

Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFS 541
           + +K G+ IY   +     LY+  +I S   WK   ++L Q+          LR+     
Sbjct: 58  NHTKYGEFIYAYRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRINEAPK 114

Query: 542 SKQEVGQLSSLNLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQ 598
            K+      +L +R+P W   S G   S+NG+     +P    +L  + +W   D +T  
Sbjct: 115 KKR------TLMIRIPEWANQSKGYSVSINGKRKMFVMPKGNQYLPLSRKWEKGDVITFH 168

Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           LP+ +  E I D +  Y    A L+GP +LA  T  E
Sbjct: 169 LPMKVSVEQIPDKKDYY----AFLYGPIVLAASTGTE 201


>gi|229818564|ref|YP_002880090.1| hypothetical protein Bcav_0062 [Beutenbergia cavernae DSM 12333]
 gi|229564477|gb|ACQ78328.1| protein of unknown function DUF1680 [Beutenbergia cavernae DSM
           12333]
          Length = 596

 Score = 89.7 bits (221), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 122/542 (22%), Positives = 205/542 (37%), Gaps = 94/542 (17%)

Query: 124 EYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNA 183
           E  L +  D +V  FR  A LP PG    GW +  S+      G ++S  A++  +   A
Sbjct: 42  ETYLGMSPDDVVHGFRLQAGLPAPGNPMTGWSSRTSQ---PTFGQWVSGLARLGVTAGVA 98

Query: 184 TIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQY 243
              ++   +V                AF   + D  +A   +    Y   K++ GL D  
Sbjct: 99  EASQRAVDLV---------------DAFAATVGDDGDARMGL----YGYEKLVCGLADTA 139

Query: 244 VLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDP 303
           + A +  AL +     E+     ++        R   S N+  GG      R+   +H  
Sbjct: 140 LYAGHEDALALLGRTAEWASRTFERA-------RPAASPNDFAGG------RIGPASH-- 184

Query: 304 KHLLLAHLFDKPCFLGFLALQADYLSHF-----------------------------HAN 334
              +  + F +  + G+LA   D +  F                             HA 
Sbjct: 185 ARTMEWYTFAENLYRGWLAGADDAVREFASEWHYDAYWDRFLTPPPPGQPWDVPTWLHAY 244

Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK----RLA 390
           +H+     +   YEVTG+  Y  I       +  + +YATGG    E          R  
Sbjct: 245 SHVNTFASAAAAYEVTGEVRYLDILRNAHTYLTTTQTYATGGYGPSELTLPEDGSLGRSI 304

Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
           +      E  C ++   K+S  L + T E  YAD+ E+ + +G+ ++      G   Y  
Sbjct: 305 EWRTDTAEIVCGSWAAFKLSSALLKHTGEARYADWVEQLVYSGIGAVTPVRPGGRTPYYQ 364

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
            L  G++  +  H     ++ + CC GT +++ S L D +YF ++    GL +  Y+ S+
Sbjct: 365 DLRLGIAT-KLPH-----WDDWPCCSGTYLQAVSHLPDLVYFGDDDG--GLAVALYVPST 416

Query: 511 FDWKSGH--VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
             W+S    V L Q+              +  +S   VG      LR+ V  +S G + S
Sbjct: 417 VSWESAGSTVTLTQRT----------AFPVEDTSTITVGGSGRFRLRLRVPPWSEGFRVS 466

Query: 569 LNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           +NG  +  +  PG++      W+  D +T+ L   LR   +    P      A   GP +
Sbjct: 467 VNGVAVDGVATPGDWFVLERDWADGDVVTVTLGAGLRVLPVDRWHPNRV---AFAHGPVV 523

Query: 628 LA 629
           LA
Sbjct: 524 LA 525


>gi|361069271|gb|AEW08947.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
          Length = 75

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 38/69 (55%), Positives = 53/69 (76%)

Query: 787 NFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRD 846
           +++ G +++L C     D  FNRA+SF    G ++YHPISF+A+GARR +LLAPLL++RD
Sbjct: 5   SYQVGQAVELRCKPLVTDLAFNRASSFTWNTGFAKYHPISFIARGARRAYLLAPLLTYRD 64

Query: 847 EAYTVYFNI 855
           E+YTVYFNI
Sbjct: 65  ESYTVYFNI 73


>gi|383146477|gb|AFG54937.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146481|gb|AFG54941.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
          Length = 75

 Score = 87.4 bits (215), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 38/69 (55%), Positives = 53/69 (76%)

Query: 787 NFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRD 846
           +++ G +++L C     D  FNRA+SF    G ++YHPISF+A+GARR +LLAPLL++RD
Sbjct: 5   SYQVGQAVELRCKPLVTDLAFNRASSFTWNTGFAKYHPISFIARGARRAYLLAPLLAYRD 64

Query: 847 EAYTVYFNI 855
           E+YTVYFNI
Sbjct: 65  ESYTVYFNI 73


>gi|383146472|gb|AFG54932.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146473|gb|AFG54933.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146474|gb|AFG54934.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146475|gb|AFG54935.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146476|gb|AFG54936.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146478|gb|AFG54938.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146479|gb|AFG54939.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146480|gb|AFG54940.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146482|gb|AFG54942.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146483|gb|AFG54943.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146484|gb|AFG54944.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146485|gb|AFG54945.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146486|gb|AFG54946.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146487|gb|AFG54947.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146488|gb|AFG54948.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
 gi|383146489|gb|AFG54949.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
          Length = 75

 Score = 86.7 bits (213), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 37/69 (53%), Positives = 53/69 (76%)

Query: 787 NFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRD 846
           +++ G +++L C     D  FNRA+SF    G ++YHPISF+A+GARR +LLAPLL+++D
Sbjct: 5   SYQVGQAVELRCKPLVTDLAFNRASSFTWNTGFAKYHPISFIARGARRAYLLAPLLAYKD 64

Query: 847 EAYTVYFNI 855
           E+YTVYFNI
Sbjct: 65  ESYTVYFNI 73


>gi|380482670|emb|CCF41095.1| secreted protein [Colletotrichum higginsianum]
          Length = 246

 Score = 85.9 bits (211), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 82/289 (28%), Positives = 122/289 (42%), Gaps = 70/289 (24%)

Query: 406 MLKVSRHLFRWT--KEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSK 458
           MLK++R L+  +     AY D+YERAL N +L  Q  ++  G + Y  PL     RGV  
Sbjct: 1   MLKLTRELWLTSPGTTTAYFDFYERALLNHLLGQQDPSDDHGHVTYFTPLNPGGRRGVGP 60

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
           A     W T ++SFWCC GTG+E+ +KL DSIYF +      LY+  +I S  +W    V
Sbjct: 61  AWGGGTWSTDYDSFWCCQGTGLETNTKLTDSIYFYD---ASALYVNLFIPSVLEWTQRGV 117

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
            + Q  +       + R   T       G   S+ +R+P W  S GA             
Sbjct: 118 TVTQTTE-------FPRGDTTTLKVAGAGTW-SMRVRIPSWA-SGGA------------- 155

Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDI 638
                              QLP+ L      DD     ++ A+ FGP +L+G+   E   
Sbjct: 156 -------------------QLPMKLHVIPANDD----PNVAALAFGPVILSGNYGSE--- 189

Query: 639 KTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEF 687
                 +LS       P+ N   V  T +SG + F  +   +++ +  F
Sbjct: 190 ------TLSTT-----PALNLTTVRRTGDSGLA-FTATAGGKTVNLGPF 226


>gi|357472929|ref|XP_003606749.1| hypothetical protein MTR_4g065190 [Medicago truncatula]
 gi|355507804|gb|AES88946.1| hypothetical protein MTR_4g065190 [Medicago truncatula]
          Length = 111

 Score = 82.8 bits (203), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 56/139 (40%), Positives = 67/139 (48%), Gaps = 33/139 (23%)

Query: 724 MLEPFDFPGMLV-QQGKEDELVVSES----PKEMGSSGFRLVAGLDKRNETVSLEAENRK 778
           MLEPFD PGM V  QG E  L++ +S    P  + S G R+  G  K N    +     K
Sbjct: 1   MLEPFDLPGMTVSHQGPEKPLIIVDSSHGGPSSVFSCGTRI--GWTKSNNIFRITKLLLK 58

Query: 779 GCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLL 838
                  V                          F+   G+ +YHPISFVAKGA +NFLL
Sbjct: 59  LVLTKQLV--------------------------FVSGKGLRQYHPISFVAKGANQNFLL 92

Query: 839 APLLSFRDEAYTVYFNIQD 857
            PL +FRDE YTVYFNIQD
Sbjct: 93  DPLFNFRDEHYTVYFNIQD 111


>gi|423223914|ref|ZP_17210383.1| hypothetical protein HMPREF1062_02569 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392637516|gb|EIY31383.1| hypothetical protein HMPREF1062_02569 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 664

 Score = 82.4 bits (202), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 84/291 (28%), Positives = 122/291 (41%), Gaps = 42/291 (14%)

Query: 318 LGFLALQADYLSH-FHANTHIPIVIGSQMRYEVTGDP--LYKLIGTFFMDIVNASHSYAT 374
           LG   LQ    SH FH N      +G    Y +TGD   L K+ G +  D ++    Y T
Sbjct: 270 LGVDKLQPYVHSHTFHMN-----FMGFLRLYRITGDKTLLRKVSGAW--DDIHERQMYIT 322

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG S  E +         L     ETC T + +++++ L   T E  YAD  ER + N V
Sbjct: 323 GGVSVAEHY--EHDYVKPLSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHV 380

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSI 490
            + Q   E GV  Y             T   G+K + ++    CC  +G    S L   I
Sbjct: 381 FAAQ-DCESGVCRY------------HTAPNGSKPDGYFHGPDCCTASGHRIISMLPTFI 427

Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
           Y E E      YI QY+ S +  K     +        ++     M LT  S  E  +  
Sbjct: 428 YAEREKE---FYINQYMPSQYTGKDFAFEITG------NYPESENMQLTIVS--EKARNK 476

Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
           +LNLR+P W      +  +NG+N+    PG +L    +W+  DK++I  P+
Sbjct: 477 TLNLRIPSW--CEHPEIKVNGENIADVKPGTYLKLPRKWTKGDKVSITFPM 525


>gi|224537087|ref|ZP_03677626.1| hypothetical protein BACCELL_01964 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521314|gb|EEF90419.1| hypothetical protein BACCELL_01964 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 664

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 84/291 (28%), Positives = 123/291 (42%), Gaps = 42/291 (14%)

Query: 318 LGFLALQADYLSH-FHANTHIPIVIGSQMRYEVTGDP--LYKLIGTFFMDIVNASHSYAT 374
           LG   LQ    SH FH N      +G    Y +TGD   L K+ G +  D ++    Y T
Sbjct: 270 LGVDKLQPYVHSHTFHMN-----FMGFLRLYRITGDKTLLRKVSGAW--DDIHERQMYIT 322

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG S  E +         L     ETC T + +++++ L   T E  YAD  ER + N V
Sbjct: 323 GGVSVAEHY--EHDYVKPLSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHV 380

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSI 490
            + Q   E GV  Y             T   G+K + ++    CC  +G    S L   I
Sbjct: 381 FAAQ-DCESGVCRY------------HTAPNGSKPDGYFHGPDCCTASGHRIISMLPTFI 427

Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
           Y E+       YI QYI S +  K     +        ++     M LT  S  E  +  
Sbjct: 428 YAEKGKE---FYINQYIPSQYTGKDFAFEITG------NYPESENMQLTIVS--EKAKNK 476

Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
           +LNLR+P W      +  +NG+N+    PG +L  + +W+  DK++I  P+
Sbjct: 477 TLNLRIPSWC--EHPEIKVNGENIADVKPGAYLKLSRKWTKGDKVSITFPM 525


>gi|237719720|ref|ZP_04550201.1| predicted protein [Bacteroides sp. 2_2_4]
 gi|229450989|gb|EEO56780.1| predicted protein [Bacteroides sp. 2_2_4]
          Length = 663

 Score = 79.3 bits (194), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 74/285 (25%), Positives = 122/285 (42%), Gaps = 30/285 (10%)

Query: 318 LGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP-LYKLIGTFFMDIVNASHSYATGG 376
           LG   LQ     + H++T     +G    Y +TGD  L++ +   + DI +    Y TGG
Sbjct: 272 LGVDKLQP----YVHSHTFQMNFMGFLRLYRITGDKSLFRKVAGAWDDI-HKRQMYITGG 326

Query: 377 TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
            S  E +         +     ETC T + +++++ L   T E  YAD  ER + N V +
Sbjct: 327 VSVAEHY--EHDYVKPISGHVVETCATMSWMQLTQMLLELTGESKYADAMERLMINHVFA 384

Query: 437 IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
            Q         +  P G         HG+   F+   CC  +G    S L   +Y E+  
Sbjct: 385 AQDCETGSCRYHTAPNG------SKPHGY---FHGPDCCTASGHRIISMLPTFMYAEKGK 435

Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
                Y+ QY+ S +  K+    ++     + +      M LT +S++   ++  LNLR+
Sbjct: 436 E---FYVNQYVPSQYAGKAFSFEISGNYPEVEN------MELTVTSERVADRV--LNLRI 484

Query: 557 PVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
           P W      Q S+NG+ +    PG +L  + +W   DK+ I  P+
Sbjct: 485 PSW--CEKPQVSVNGEKMAGVQPGTYLKISRKWVKGDKVCIVFPM 527


>gi|332881627|ref|ZP_08449275.1| hypothetical protein HMPREF9074_05065 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|357045708|ref|ZP_09107342.1| hypothetical protein HMPREF9441_01351 [Paraprevotella clara YIT
           11840]
 gi|332680266|gb|EGJ53215.1| hypothetical protein HMPREF9074_05065 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|355531373|gb|EHH00772.1| hypothetical protein HMPREF9441_01351 [Paraprevotella clara YIT
           11840]
          Length = 586

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 116/493 (23%), Positives = 195/493 (39%), Gaps = 87/493 (17%)

Query: 153 GWENPISELRGHFV-GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
            W+   +E  G ++   YLSA  +     ++  + +K  T++  + + Q +  +GY+ A 
Sbjct: 2   AWDWTKAEQHGKWIESAYLSAIQR-----NDKALLDKARTMLKRIVDSQEE--SGYVGAT 54

Query: 212 ---------PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYF 262
                    P    D++E        Y+  H  +    +    A  A A K+A + ++YF
Sbjct: 55  SKNYRSDERPVRGMDAYEL-------YFVFHAFITVYEETGDKASLAAAEKLADYYLKYF 107

Query: 263 ---------------YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHL- 306
                           NR + +  +     H    + E   + D + RLY +T   K+L 
Sbjct: 108 GPGKLEFWPSDLRDPENRHKSIDALSQFAGHGVHYSWEGTLLCDPIARLYEVTGKKKYLD 167

Query: 307 ----LLAHL--------FDKPCFLGFLALQADYLS-HFHANTHIPIVIGSQMRYEVTGDP 353
               ++ ++        F +   +    L  D L  + H++T     +G    Y +TGD 
Sbjct: 168 WSLWVVGNIDKWSGWDAFSRLDSVADGTLGVDKLQPYVHSHTFQMNFMGFLRLYRITGDK 227

Query: 354 -LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
            L++ +   + DI N    Y TGG S  E +         +     ETC T + +++++ 
Sbjct: 228 SLFRKVAGAWDDICN-RQMYITGGVSVAEHY--EHGYVKPVSGNVVETCATMSWMQLTQM 284

Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
           L   T E  YAD  ER + N V + Q   E G   Y             T   GTK + +
Sbjct: 285 LLELTGESKYADAMERLMMNHVFAAQ-DCESGTCRY------------HTAPNGTKPHDY 331

Query: 473 W----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
           +    CC  +G    S L    Y E   N    YI QY+ S +D K     ++       
Sbjct: 332 FHGPDCCTASGHRIISLLPTFFYAE---NGKDFYINQYLPSRYDGKDFAFEISGNYPESE 388

Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATER 588
           S      M LT  S +   ++  LNLR+P W      + S+NG+ +     G +L+ T +
Sbjct: 389 S------MVLTVLSSKNKNKI--LNLRIPSWC--KAPEVSVNGERVSGIEAGKYLAITRK 438

Query: 589 WSYNDKLTIQLPL 601
           W   DK+ I  P+
Sbjct: 439 WEKGDKIGITFPM 451


>gi|427384256|ref|ZP_18880761.1| hypothetical protein HMPREF9447_01794 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727517|gb|EKU90376.1| hypothetical protein HMPREF9447_01794 [Bacteroides oleiciplenus YIT
           12058]
          Length = 662

 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 80/291 (27%), Positives = 121/291 (41%), Gaps = 42/291 (14%)

Query: 318 LGFLALQADYLSH-FHANTHIPIVIGSQMRYEVTGDP--LYKLIGTFFMDIVNASHSYAT 374
           LG   LQ    SH FH N      +G    Y +TGD   L K+ G +  D ++    Y T
Sbjct: 270 LGVDKLQPYVHSHTFHMN-----FMGFLRLYRITGDKSLLRKVAGAW--DDIHERQMYIT 322

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           GG S  E +         L     ETC T + +++++ L   T E  YAD  ER + N V
Sbjct: 323 GGVSVAEHY--EHDYVKPLSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHV 380

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSI 490
            + Q   E GV  Y             T   G+K + ++    CC  +G    S L   I
Sbjct: 381 FAAQ-DCENGVCRY------------HTAPNGSKPDGYFHGPDCCTASGHRIISMLPTFI 427

Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
           Y E+       Y+ QY+ S ++ K     +        ++     M L   S  E  +  
Sbjct: 428 YAEKGKE---FYVNQYMPSQYNGKDFAFSITG------NYPESENMELVIES--EKAKNK 476

Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
           ++NLR+P W      + S+NG+ +    PG +L  + +W   DK+ I  P+
Sbjct: 477 TINLRIPSWC--ENPKVSVNGEAVADIKPGTYLKLSRKWGKGDKINIIFPM 525


>gi|330998039|ref|ZP_08321870.1| hypothetical protein HMPREF9442_02974 [Paraprevotella xylaniphila
           YIT 11841]
 gi|329569340|gb|EGG51120.1| hypothetical protein HMPREF9442_02974 [Paraprevotella xylaniphila
           YIT 11841]
          Length = 661

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 72/276 (26%), Positives = 118/276 (42%), Gaps = 26/276 (9%)

Query: 330 HFHANTHIPIVIGSQMRYEVTGDP-LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
           + H++T     +G    Y +TGD  L++ +   + DI +    Y TGG S  E +     
Sbjct: 278 YVHSHTFQMNFMGFLRLYRITGDKSLFRKVEGAWEDI-HKRQMYITGGVSVAEHY--EHG 334

Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
               +     ETC T + +++++ L   T E  YAD  ER + N V + Q         +
Sbjct: 335 YVKPVSGNVVETCATMSWMQLTQMLLELTGESKYADAMERLMMNHVFAAQDCETGTCRYH 394

Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
             P   G   A   HG         CC  +G    S L   +Y E        ++ QY+ 
Sbjct: 395 TAP--NGTKPASYFHGPD-------CCTASGHRIISMLPTFMYAERGKE---FFVNQYLP 442

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
           S +  K     ++       ++     M LT  S++ V ++  LNLR+P W      + S
Sbjct: 443 SHYIGKDFAFQISG------NYPEAENMELTVLSEKAVDRV--LNLRIPSWC--KAPRVS 492

Query: 569 LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
           +NG+N+    PG +L  + +WS  DK++I  P+  R
Sbjct: 493 VNGKNVIGVEPGTYLKISRKWSKGDKVSIVFPMEER 528


>gi|189467307|ref|ZP_03016092.1| hypothetical protein BACINT_03695 [Bacteroides intestinalis DSM
           17393]
 gi|189435571|gb|EDV04556.1| hypothetical protein BACINT_03695 [Bacteroides intestinalis DSM
           17393]
          Length = 611

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 119/525 (22%), Positives = 204/525 (38%), Gaps = 73/525 (13%)

Query: 128 MLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKE 187
           + DVD L   FR               +N  +  +  F G ++  +   +   H+  +  
Sbjct: 46  LQDVDHLTAPFRT--------------KNDTASWQTEFWGKWVQGAIASYRYNHSVALYA 91

Query: 188 KMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLAD 247
           K+   V  +   Q     GY+  +     D+      +W   YT      GLL  Y ++ 
Sbjct: 92  KIKKSVDDIISTQQP--DGYIGNYR---LDAQLKSWDIWGRKYTT----LGLLSWYEISG 142

Query: 248 NAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLL 307
             QAL  A  ++++   +V +  T      ++Y +   +  +  V+Y LY  T D K+L 
Sbjct: 143 EKQALNAACRVIDHLMTQVGEGGTNIVTTGNYYGM-ASSSILEPVMY-LYKYTGDYKYLQ 200

Query: 308 LAHLF-------DKPCFL----GFLALQADYLSHF---------HANTHIPIVIGSQMRY 347
            A          + P  +      + + A +   F          A   +   IG    Y
Sbjct: 201 FAKYIVAQWETPEGPQLITKAINGVPVAARFPHPFDWFSPENGQKAYEMMSCYIGLLELY 260

Query: 348 EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           +VT +  Y   +     DI N   + A  G SA E W+  ++   +      ETC T+  
Sbjct: 261 KVTHNAAYLDAVQKTVNDIANTEINVAGSG-SAFESWYSGRKYQTSPTYHTMETCVTFTW 319

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
           +++   L   T    YAD  E++L N +++  +     +  Y    G    +       G
Sbjct: 320 IQLCDKLLALTGNPFYADQIEKSLYNALMAALKDDASQIAKYSPMEGH---RCEGEEQCG 376

Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--VVLNQKV 524
              N   CC   G  +F+ + D    ++ GN   +Y+  Y   S   ++GH  V++ Q  
Sbjct: 377 MHIN---CCNANGPRAFALIPD-FAVKKMGN--EVYVNYYGDMSASLENGHNKVLVKQHT 430

Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLS 584
              VS    + +T+  + +   G    L+LR+PVW  S     +LNG+ L    PG + +
Sbjct: 431 TYPVS--NVIDITIDVTKENVFG----LHLRVPVW--SAQTVITLNGEELKDICPGTYHA 482

Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
            T +W   D + I L +  R         E   +QAI+ GP +LA
Sbjct: 483 ITRKWKKGDHIQIILDMPARL-------LEQNQMQAIVRGPIVLA 520


>gi|374374779|ref|ZP_09632437.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
 gi|373231619|gb|EHP51414.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
          Length = 614

 Score = 72.8 bits (177), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 115/522 (22%), Positives = 203/522 (38%), Gaps = 60/522 (11%)

Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEAL 222
           G  VG YL A+A  W  T NA +K +M  +   L + Q  +  GYL  +   L DS+   
Sbjct: 89  GEHVGKYLEAAANTWIITKNAALKTQMDRIFNELIKTQ--LPDGYLGTY---LPDSYWTS 143

Query: 223 KPVWAPYYTIHKI-LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
             VW     +HK  L GLL  Y +  + +AL  A  + +     +  +     + +    
Sbjct: 144 WDVW-----VHKYDLVGLLAYYRVTGDRRALTAAVKVGDLLLKNIGDLPGQKDIIKTGSH 198

Query: 282 LNEETGGMNDVLYRLYSITHDPKHL----LLAHLFDKPCFLGFLAL-----QADYLSHFH 332
           +      + D +  LY  T D ++L     +   +D P     +       Q D +++  
Sbjct: 199 VGMAATSVIDPMTDLYQWTGDRRYLDFCKYIIKAYDHPAGPSIVTTLLKEKQVDKVANGK 258

Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
           A   +  ++G    Y +TGD  Y        D + A   + TG TS  E +     L   
Sbjct: 259 AYEMLSNLVGIIKLYRLTGDEKYLQACRNAFDDIAAKRLFVTGTTSDHERFMPDNILQAD 318

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
             +   E C T   ++ +  LF  T ++ Y +  E+++ N +L  +   E G + Y  PL
Sbjct: 319 TAAHMGEGCVTTTWIQFNVQLFAITGDLKYYNEIEKSVYNHLLGAE-NPETGCVSYYTPL 377

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
             G+   R          +  CC  +     + L   + + +  N P + + +    + D
Sbjct: 378 -IGIKPYRC---------NITCCLSSVPRGIA-LIPYLNYGKLNNRPTVLLYE----AAD 422

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV--------GQLSSLNLRMPVWTYSNG 564
            K   V    +  P+      L++  TF  + +             +L LR+P W  +NG
Sbjct: 423 IKDRVVTAGGRETPVA-----LQINTTFPKEGKATIKVALPSAARFALQLRVPAW--ANG 475

Query: 565 AQASLNGQNLPLPPPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
            +A + G+        N L   +R W+  + + I   + +         P Y +I+    
Sbjct: 476 FKAVIAGKT--YTAQANELVVIDRNWARENIIAISFEIPVTVLQGGASYPNYIAIKR--- 530

Query: 624 GPYLLAGHTS--GEWDI-KTGTARSLSALISPIPPSFNAQLV 662
           GP +L+   S    +DI KT     ++  ++  P    AQ +
Sbjct: 531 GPQVLSADQSLNPSFDITKTAFRTPVAVQLTSTPAKLPAQWI 572


>gi|340619901|ref|YP_004738354.1| hypothetical protein zobellia_3937 [Zobellia galactanivorans]
 gi|339734698|emb|CAZ98075.1| Conserved hypothetical periplasmic protein [Zobellia
           galactanivorans]
          Length = 629

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 100/463 (21%), Positives = 182/463 (39%), Gaps = 57/463 (12%)

Query: 160 ELRGHFVGH--YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
           E+ G F+G    + AS ++ A +H+  + E  + +V  + + Q K   GY   +  E   
Sbjct: 78  EVVGAFIGMGMLIDASVRLAAYSHDPKMMEIKNEIVDKVIDEQLK--NGYSGFYKPE--- 132

Query: 218 SFEALKPVW-----APYYTIHK---ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
                + +W        + IH+   I+ GL   Y L  N ++LK A    ++      ++
Sbjct: 133 -----RRLWNSQGGGDNWDIHEMAFIIDGLTSDYELFGNKRSLKAAIKTADFIMEHWHEM 187

Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLA------HLFDKPCFLGFLAL 323
              Y+ E   + L+    G++  ++RLY  T + + L  +      + +D    +G    
Sbjct: 188 PDDYAAEVDMHVLDT---GIDWAIFRLYKTTGEKRFLNFSEKTKSLYQWDTKIEIGRRPG 244

Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA-REF 382
            + ++  + A     I +     Y  TG+          M    A       G++  RE 
Sbjct: 245 VSGHMFAYFAMCMAQIEL-----YRYTGNKELLQQTENAMRFFLAEDGLTISGSAGQREI 299

Query: 383 WWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
           W D +   + LG    ETC T    +V   L R T +  Y D  ER + NG+   Q   +
Sbjct: 300 WTDDQDGENELG----ETCATAYQTRVYESLLRLTGKAEYGDLIERTVYNGLFGAQ-SPD 354

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
            G + Y  P           H +  +   + CC G      S+L   +Y+  + +   + 
Sbjct: 355 GGKLRYYTPF------EGERHYYDVE---YMCCPGNFRRIISELPGMVYYRSKEDGVAVN 405

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
           +     +  +   G  V    V    S+    R+ L+ S  +       L+LR+P W  +
Sbjct: 406 LYAQSEARVELNDGITV---DVQQKTSYPTSGRVELSVSPNK--ASTFPLSLRIPSW--A 458

Query: 563 NGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLR 604
             A   +NG+       PG F+  T +W+  D++ +  P+ +R
Sbjct: 459 KEATIMVNGEKWQGEIKPGTFVDITRKWTSKDRVLLDFPMDIR 501


>gi|189467199|ref|ZP_03015984.1| hypothetical protein BACINT_03583 [Bacteroides intestinalis DSM
           17393]
 gi|189435463|gb|EDV04448.1| hypothetical protein BACINT_03583 [Bacteroides intestinalis DSM
           17393]
          Length = 175

 Score = 70.1 bits (170), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 40/93 (43%), Positives = 52/93 (55%), Gaps = 7/93 (7%)

Query: 132 DSLVWSFRKTASL-------PTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNAT 184
           + L+ SFR  A +           K  GGWE+   ELRGH  GH LSA A M+AST +  
Sbjct: 75  NRLLHSFRDNAGVFAGREGGDMTVKKLGGWESLDCELRGHTTGHLLSAYALMYASTGSEI 134

Query: 185 IKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
            K K  ++V  L+E Q  +G GYLSA+P EL +
Sbjct: 135 FKLKGDSLVTGLAEVQAALGNGYLSAYPEELIN 167


>gi|365847237|ref|ZP_09387726.1| hypothetical protein HMPREF0880_01230 [Yokenella regensburgei ATCC
           43003]
 gi|364572491|gb|EHM50031.1| hypothetical protein HMPREF0880_01230 [Yokenella regensburgei ATCC
           43003]
          Length = 659

 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 77/292 (26%), Positives = 112/292 (38%), Gaps = 41/292 (14%)

Query: 364 DIVNASHSYATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
           D + +   Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +
Sbjct: 306 DNMASRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEAD 362

Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
             YAD  ERAL N VL      +     Y+ PL          H    KFN  +      
Sbjct: 363 SRYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHIKPV 413

Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
                   CC        + LG  +Y   +     LYI  YI +S +       L   + 
Sbjct: 414 RQRWFGCACCPPNIARVLTSLGHYLYTSRD---EALYINLYIGNSVEIPVAGHALRLHIS 470

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
               W    ++++T  S   V    +L LR+P W  +  AQ  LNG+ +PL P   +L  
Sbjct: 471 GDYPWQE--QVSITVESPDTVNH--TLALRIPDWCVN--AQVMLNGEEIPLLPHKGYLHI 524

Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           T  W   DKL + LP+ +R           A   AI  GP  Y L    +GE
Sbjct: 525 TRDWQEGDKLLLTLPMPVRRVYANPLMRHAAGKIAIQRGPLVYCLEQADNGE 576


>gi|262381468|ref|ZP_06074606.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262296645|gb|EEY84575.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 623

 Score = 66.6 bits (161), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 74/315 (23%), Positives = 121/315 (38%), Gaps = 45/315 (14%)

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           Y+VT +PLY  +    M+ +        G  SA E W+  K L         ETC T+  
Sbjct: 269 YKVTKNPLYLSVVEKTMNHIINEEINVAGSGSAFECWYGGKALQTYPTYHTMETCVTFTW 328

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW- 465
           +++   +   T    YAD  E+A+ N +L+  +                ++K     GW 
Sbjct: 329 MQICDRMLGLTGNSLYADQIEKAMYNALLASLKAD-----------ASQIAKYSPLEGWR 377

Query: 466 -------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP-GLYIIQYISSSFDWKSGH 517
                  G   N   CC   G  +F+ +    Y      +   LY    +    D K   
Sbjct: 378 HEGEEQCGMHIN---CCNANGPRAFAMIPQFAYQVNGRRIDVNLYAASSVEVELD-KKTR 433

Query: 518 VVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
           V + Q+ D PI   D  +R+ +      +     ++ LR+P W  S     S+NG+ L  
Sbjct: 434 VSMTQETDYPI---DGQVRIVVEPEKTSDF----TIALRIPAW--SERTVVSVNGEPLTD 484

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
              G +L     W   D++T++L +  R   + +        QAI+ GP +LA  +    
Sbjct: 485 LLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARDSR--- 534

Query: 637 DIKTGTARSLSALIS 651
             K G     S ++S
Sbjct: 535 -FKDGDVDEASVIVS 548


>gi|436834929|ref|YP_007320145.1| hypothetical protein FAES_1542 [Fibrella aestuarina BUZ 2]
 gi|384066342|emb|CCG99552.1| hypothetical protein FAES_1542 [Fibrella aestuarina BUZ 2]
          Length = 636

 Score = 66.6 bits (161), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 75/310 (24%), Positives = 131/310 (42%), Gaps = 31/310 (10%)

Query: 347 YEVTGDPLYKL-IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
           Y +TG P YK  +   + +I +   + A  G+S  E W+  K L     +  +ETC T  
Sbjct: 282 YRLTGKPAYKAAVEKTWQNIRDTEINLAGSGSSV-ECWFGGKALQTLSINHYQETCVTAT 340

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
            +K+S+ L R T +  YAD  E+   N +L   +        Y  PL     +     G 
Sbjct: 341 WIKLSQQLLRLTGDARYADAIEQTYYNALLGSMKADGSDWTKYT-PLSGQRLEGGEQCGM 399

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ--YISSSFDWKSGHVVLNQK 523
           G       CC  +G      L  ++       V   +  +  Y++++   +S  V L Q+
Sbjct: 400 GLN-----CCVASGPRGLFTLPQTVVMSRADGVQVNFYAEGTYLANTPGGQS--VSLRQQ 452

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
            D  VS    L ++L  +         ++ +R+P W+    +  ++NGQ +P    G ++
Sbjct: 453 TDYPVSGQSTLHLSLPKTES------FTVRVRIPAWSVQ--STVTVNGQAVPTVVAGEYV 504

Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTA 643
           +    W   D+L++ L +  R   +  D P++    AI+ GP +L        D + G  
Sbjct: 505 AIKRTWQTGDQLSLTLDMRGRVVRL-GDMPQHL---AIVRGPVVLTR------DARLG-G 553

Query: 644 RSLSALISPI 653
            S+   ISP+
Sbjct: 554 PSVDETISPV 563


>gi|301309993|ref|ZP_07215932.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|423340426|ref|ZP_17318165.1| hypothetical protein HMPREF1059_04090 [Parabacteroides distasonis
           CL09T03C24]
 gi|300831567|gb|EFK62198.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|409227861|gb|EKN20757.1| hypothetical protein HMPREF1059_04090 [Parabacteroides distasonis
           CL09T03C24]
          Length = 623

 Score = 66.2 bits (160), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 74/315 (23%), Positives = 121/315 (38%), Gaps = 45/315 (14%)

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           Y+VT +PLY  +    M+ +        G  SA E W+  K L         ETC T+  
Sbjct: 269 YKVTKNPLYLSVVEKTMNHIINEEINVAGSGSAFECWYGGKALQTYPTYHTMETCVTFTW 328

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW- 465
           +++   +   T    YAD  E+A+ N +L+  +                ++K     GW 
Sbjct: 329 MQICDRMLGLTGNSLYADQIEKAMYNALLASLKAD-----------ASQIAKYSPLEGWR 377

Query: 466 -------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP-GLYIIQYISSSFDWKSGH 517
                  G   N   CC   G  +F+ +    Y      +   LY    +    D K   
Sbjct: 378 HEGEEQCGMHIN---CCNANGPRAFAMIPRFAYQVNGRRIDVNLYAASSVEVELD-KKTR 433

Query: 518 VVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
           V + Q+ D PI   D  +R+ +      +     ++ LR+P W  S     S+NG+ L  
Sbjct: 434 VSMTQETDYPI---DGQVRIVVEPEKTSDF----TIALRIPAW--SERTVVSVNGEPLTD 484

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
              G +L     W   D++T++L +  R   + +        QAI+ GP +LA  +    
Sbjct: 485 LLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARDSR--- 534

Query: 637 DIKTGTARSLSALIS 651
             K G     S ++S
Sbjct: 535 -FKDGDVDEASVIVS 548


>gi|423345501|ref|ZP_17323190.1| hypothetical protein HMPREF1060_00862 [Parabacteroides merdae
           CL03T12C32]
 gi|409223287|gb|EKN16224.1| hypothetical protein HMPREF1060_00862 [Parabacteroides merdae
           CL03T12C32]
          Length = 625

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 73/296 (24%), Positives = 115/296 (38%), Gaps = 47/296 (15%)

Query: 347 YEVTGDPLY-----KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
           Y+VTG+PLY     K +G    + +N +     G  SA E W+  K           ETC
Sbjct: 269 YKVTGNPLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETC 323

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
            T+  +++   L + T    YADY E A+ N +++  +     +  Y  PL         
Sbjct: 324 VTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKYS-PL--------- 373

Query: 462 THGW--------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
             GW        G   N   CC   G  +F+ +    Y  ++  V   +     +     
Sbjct: 374 -EGWRHEGEEQCGMHIN---CCNANGPRAFAMIPQFAYQVQDDCVRVNFYAPSEAELVLP 429

Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
               V L Q  D    +    ++ +     +E     ++ LR+P W  S  A  S+NGQ 
Sbjct: 430 DKKPVRLKQTTD----YPRTDQIEIEVDPAKETA--FTIALRIPAW--SKIAVVSVNGQP 481

Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
                 G +L    +W   D++T++L L  R         E    QAI+ GP +LA
Sbjct: 482 QDGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQAIVRGPIVLA 530


>gi|154495303|ref|ZP_02034308.1| hypothetical protein PARMER_04360 [Parabacteroides merdae ATCC
           43184]
 gi|423722505|ref|ZP_17696681.1| hypothetical protein HMPREF1078_00744 [Parabacteroides merdae
           CL09T00C40]
 gi|154085227|gb|EDN84272.1| hypothetical protein PARMER_04360 [Parabacteroides merdae ATCC
           43184]
 gi|409242350|gb|EKN35113.1| hypothetical protein HMPREF1078_00744 [Parabacteroides merdae
           CL09T00C40]
          Length = 625

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 73/296 (24%), Positives = 115/296 (38%), Gaps = 47/296 (15%)

Query: 347 YEVTGDPLY-----KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
           Y+VTG+PLY     K +G    + +N +     G  SA E W+  K           ETC
Sbjct: 269 YKVTGNPLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETC 323

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
            T+  +++   L + T    YADY E A+ N +++  +     +  Y  PL         
Sbjct: 324 VTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKYS-PL--------- 373

Query: 462 THGW--------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
             GW        G   N   CC   G  +F+ +    Y  ++  V   +     +     
Sbjct: 374 -EGWRHEGEEQCGMHIN---CCNANGPRAFAMIPQFAYQVQDDCVRVNFYAPSEAELVLP 429

Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
               V L Q  D    +    ++ +     +E     ++ LR+P W  S  A  S+NGQ 
Sbjct: 430 GKKPVRLKQTTD----YPRTDQIEIEVDPAKETA--FTIALRIPAW--SKIAVVSVNGQP 481

Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
                 G +L    +W   D++T++L L  R         E    QAI+ GP +LA
Sbjct: 482 QDGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQAIVRGPIVLA 530


>gi|423343638|ref|ZP_17321351.1| hypothetical protein HMPREF1077_02781 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409214660|gb|EKN07669.1| hypothetical protein HMPREF1077_02781 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 625

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 76/302 (25%), Positives = 115/302 (38%), Gaps = 59/302 (19%)

Query: 347 YEVTGDPLY-----KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
           Y+VTG+PLY     K +G    + +N +     G  SA E W+  K           ETC
Sbjct: 269 YKVTGNPLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETC 323

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
            T+  +++   L + T    YADY E A+ N +++  +     +  Y  PL         
Sbjct: 324 VTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKYS-PL--------- 373

Query: 462 THGW--------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
             GW        G   N   CC   G  +F+ +    Y  ++  V   +     +     
Sbjct: 374 -EGWRHEGEEQCGMHIN---CCNANGPRAFAMIPGFAYQVQDDCVRVNFYAPSEAELVLP 429

Query: 514 KSGHVVLNQ-----KVDPI-VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
               V L Q     + D I +  DP    T T +            LR+P W  S  A  
Sbjct: 430 GKKSVWLRQTTEYPRTDQIEIEVDPTKETTFTIA------------LRIPAW--SKIATV 475

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           S+NG+       G +L    +W   D++T++L L  R         E    QAI+ GP +
Sbjct: 476 SVNGRPEAGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQAIVRGPLV 528

Query: 628 LA 629
           LA
Sbjct: 529 LA 530


>gi|218261883|ref|ZP_03476568.1| hypothetical protein PRABACTJOHN_02239 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223731|gb|EEC96381.1| hypothetical protein PRABACTJOHN_02239 [Parabacteroides johnsonii
           DSM 18315]
          Length = 625

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 76/302 (25%), Positives = 115/302 (38%), Gaps = 59/302 (19%)

Query: 347 YEVTGDPLY-----KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
           Y+VTG+PLY     K +G    + +N +     G  SA E W+  K           ETC
Sbjct: 269 YKVTGNPLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETC 323

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
            T+  +++   L + T    YADY E A+ N +++  +     +  Y  PL         
Sbjct: 324 VTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKYS-PL--------- 373

Query: 462 THGW--------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
             GW        G   N   CC   G  +F+ +    Y  ++  V   +     +     
Sbjct: 374 -EGWRHEGEEQCGMHIN---CCNANGPRAFAMIPGFAYQVQDDCVRVNFYAPSEAELVLP 429

Query: 514 KSGHVVLNQ-----KVDPI-VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
               V L Q     + D I +  DP    T T +            LR+P W  S  A  
Sbjct: 430 GKKSVWLRQTTEYPRTDQIEIEVDPTKETTFTIA------------LRIPAW--SKIATV 475

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           S+NG+       G +L    +W   D++T++L L  R         E    QAI+ GP +
Sbjct: 476 SVNGRPEAGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQAIVRGPLV 528

Query: 628 LA 629
           LA
Sbjct: 529 LA 530


>gi|256840863|ref|ZP_05546371.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|256738135|gb|EEU51461.1| conserved hypothetical protein [Parabacteroides sp. D13]
          Length = 625

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 73/315 (23%), Positives = 121/315 (38%), Gaps = 45/315 (14%)

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           Y+VT +PLY  +    M+ +        G  SA E W+  K L         ETC T+  
Sbjct: 271 YKVTKNPLYLSVVEKTMNHIINEEINVAGSGSAFECWYGGKALQTYPTYHTMETCVTFTW 330

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW- 465
           +++   +   T    YAD  E+A+ N +L+  +                ++K     GW 
Sbjct: 331 MQICDRMLGLTGNSLYADQIEKAMYNALLASLKAD-----------ASQIAKYSPLEGWR 379

Query: 466 -------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP-GLYIIQYISSSFDWKSGH 517
                  G   N   CC   G  +F+ +    Y      +   LY    +    D K   
Sbjct: 380 HEGEEQCGMHIN---CCNANGPRAFAMIPQFAYQINGRRIDVNLYAASSVEVELD-KKTR 435

Query: 518 VVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
           V + Q+ + PI   D  +R+ +      +     ++ LR+P W  S     S+NG+ L  
Sbjct: 436 VSMTQETNYPI---DGQVRIVVEPEKTSDF----TIALRIPAW--SERTVVSVNGEPLTD 486

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
              G +L     W   D++T++L +  R   + +        QAI+ GP +LA  +    
Sbjct: 487 LLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARDSR--- 536

Query: 637 DIKTGTARSLSALIS 651
             K G     S ++S
Sbjct: 537 -FKDGDVDEASVIVS 550


>gi|150007964|ref|YP_001302707.1| hypothetical protein BDI_1325 [Parabacteroides distasonis ATCC
           8503]
 gi|149936388|gb|ABR43085.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
          Length = 623

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 73/315 (23%), Positives = 121/315 (38%), Gaps = 45/315 (14%)

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           Y+VT +PLY  +    M+ +        G  SA E W+  K L         ETC T+  
Sbjct: 269 YKVTKNPLYLSVVEKTMNHIINEEINVAGSGSAFECWYGGKALQTYPTYHTMETCVTFTW 328

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW- 465
           +++   +   T    YAD  E+A+ N +L+  +                ++K     GW 
Sbjct: 329 MQICDRMLGLTGNSLYADQIEKAMYNALLASLKAD-----------ASQIAKYSPLEGWR 377

Query: 466 -------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP-GLYIIQYISSSFDWKSGH 517
                  G   N   CC   G  +F+ +    Y      +   LY    +    D K   
Sbjct: 378 HEGEEQCGMHIN---CCNANGPRAFAMIPQFAYQINGRRIDVNLYAASSVEVELD-KKTR 433

Query: 518 VVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
           V + Q+ + PI   D  +R+ +      +     ++ LR+P W  S     S+NG+ L  
Sbjct: 434 VSMTQETNYPI---DGQVRIVVEPEKTSDF----TIALRIPAW--SERTVVSVNGEPLTD 484

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
              G +L     W   D++T++L +  R   + +        QAI+ GP +LA  +    
Sbjct: 485 LLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARDSR--- 534

Query: 637 DIKTGTARSLSALIS 651
             K G     S ++S
Sbjct: 535 -FKDGDVDEASVIVS 548


>gi|429738051|ref|ZP_19271876.1| hypothetical protein HMPREF9151_00303 [Prevotella saccharolytica
           F0055]
 gi|429161156|gb|EKY03584.1| hypothetical protein HMPREF9151_00303 [Prevotella saccharolytica
           F0055]
          Length = 603

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 68/283 (24%), Positives = 115/283 (40%), Gaps = 20/283 (7%)

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           Y +TG+  YK         +  +    TG  SA E W+  K++        +ETC T   
Sbjct: 247 YRLTGNESYKAAVEKTWQSIMDTEINITGSGSAMESWFGGKQVQYMPIKHYQETCVTATW 306

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
           +K+SR L   T    YAD  E++L N +L   R        Y    G+ +  +      G
Sbjct: 307 IKLSRQLLMLTGNSKYADAIEQSLYNALLGAMRPDGSDWAKYTPLSGQRLPGSEQC---G 363

Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFE-EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
              N   CC  +G      +  +   +  EG V  LYI    +          ++ Q   
Sbjct: 364 MGLN---CCTASGPRGLFVIPQTAVMQSSEGAVVNLYIPGTYTLQSPKNKTVTLVQQGEY 420

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
           P         M + F ++Q   +  +L+LR+P W  S   + ++NGQ +     G++L  
Sbjct: 421 PKTG-----NMRIVFQAQQP--EEMTLSLRIPAW--SKTTRVAVNGQEVSAVRSGSYLQI 471

Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
             +WS  D++ + + +  +   +  + P+Y    AI  GP +L
Sbjct: 472 NRQWSAGDRVELTMDMQAQLHFMGTN-PQYL---AITRGPVVL 510


>gi|255034442|ref|YP_003085063.1| hypothetical protein Dfer_0635 [Dyadobacter fermentans DSM 18053]
 gi|254947198|gb|ACT91898.1| protein of unknown function DUF1680 [Dyadobacter fermentans DSM
           18053]
          Length = 656

 Score = 62.8 bits (151), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 84/370 (22%), Positives = 151/370 (40%), Gaps = 55/370 (14%)

Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF--LALQADYL 328
           T     R W S ++E   +   L +LY +TH+ ++L LA  F +    G+    +  ++ 
Sbjct: 191 TFRVANRPWVSGHQE---IELALMKLYHLTHEDRYLKLADWFLEQRGRGYGKGKIWDEWK 247

Query: 329 SHFHANTHIPI-----VIGSQMR-----------YEVTGDPLYKLIGTFFMDIVNASHSY 372
              +    +P+     + G  +R             VTGDP Y    T   + V   + Y
Sbjct: 248 DPKYCQDDVPVKQQKEITGHAVRAMYQYTGAADVASVTGDPGYMNAMTAVWEDVVYRNMY 307

Query: 373 ATGGTSA---REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERA 429
            TGG  +    E + D   L +  G+   ETC +  M+  ++ +   T +  Y D  ER+
Sbjct: 308 LTGGIGSSGHNEGFTDDYDLPN--GAAYSETCASVGMVFWNQRMNALTGDAKYIDVLERS 365

Query: 430 LTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDS 489
           L NG L     T      Y  PL    + ARS   +GT      CC        + +GD 
Sbjct: 366 LYNGALDGLSLT-GDRFFYGNPLSSIGNNARSAW-FGTA-----CCPSNIARLVASVGDY 418

Query: 490 IYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL 549
           IY + +G +   ++  ++ S+  ++ G   +  ++     W+  +R+ +T   K +    
Sbjct: 419 IYGKADGKI---WVNLFVGSNTTFQVGKTAVPLQMSTDYPWNGSIRIKVTPPQKVKY--- 472

Query: 550 SSLNLRMPVW--------------TYSNG-AQASLNGQNLPLPPPGNFLSATERWSYNDK 594
            +LN+R+P W                 NG  +  LNG+++       +      W   D+
Sbjct: 473 -ALNVRIPGWAAGTPVPGGLYNFAAAGNGRVEVLLNGKSVNYQSDKGYAVIDRTWQNGDE 531

Query: 595 LTIQLPLSLR 604
           + ++LP+ +R
Sbjct: 532 IEVRLPMDVR 541


>gi|423122678|ref|ZP_17110362.1| hypothetical protein HMPREF9690_04684 [Klebsiella oxytoca 10-5246]
 gi|376391959|gb|EHT04626.1| hypothetical protein HMPREF9690_04684 [Klebsiella oxytoca 10-5246]
          Length = 653

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 81/365 (22%), Positives = 132/365 (36%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYL 328
            L RLY IT +P++L L + F      +P F                    ++ +   Y 
Sbjct: 192 ALMRLYDITQEPRYLALVNYFVEERGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYS 251

Query: 329 SHFHANTHIPIVIGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
                 +  P+ IG  +R  Y +TG         D   +       + +     Y TGG 
Sbjct: 252 QAHQPISEQPVAIGHAVRFVYLMTGVAHLARLSQDEGKRQDCLRLWNNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPTSLKFNHIYDHVKPVRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY   +     LYI  Y+ +S +   G   L  ++     W   +++ + 
Sbjct: 420 ARVLTSLGHYIYTPHQD---ALYINLYVGNSAEIPVGDETLRLRISGNYPWQEQVKIAVD 476

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             +        +L LR+P W   +  Q +LNG+ +       +L  + RW   D L + L
Sbjct: 477 SPTPIN----HTLALRLPDWC--DNPQVTLNGKPVAQDVRKGYLHISHRWQEGDTLLLTL 530

Query: 600 PLSLR 604
           P+ +R
Sbjct: 531 PMPVR 535


>gi|409730702|ref|ZP_11272263.1| hypothetical protein Hham1_15864 [Halococcus hamelinensis 100A6]
 gi|448723717|ref|ZP_21706233.1| hypothetical protein C447_11225 [Halococcus hamelinensis 100A6]
 gi|445787256|gb|EMA38004.1| hypothetical protein C447_11225 [Halococcus hamelinensis 100A6]
          Length = 639

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/227 (24%), Positives = 95/227 (41%), Gaps = 17/227 (7%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC     +  ++ +   T +  YAD  ER L NG L+   G E     Y  PL      
Sbjct: 335 ETCAAIGSVFWNQRMLERTGDAKYADLIERTLYNGFLA-GVGLEGKEFFYENPLESSGDH 393

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
            R   GW T      CC       F+ LG  +Y ++  +   L++ QY+ S    + G  
Sbjct: 394 HRK--GWFTCA----CCPPNAARLFASLGGYLYGDDGDD---LFVHQYVGSRVSTEVGGT 444

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
            ++  V+  + W   + + +T S     G+  +L LR+P W  S G    +NG+++    
Sbjct: 445 AVDLDVETDLPWSGDVSLDVTASE----GESFALRLRVPAW--SEGTTVEVNGESVDAAV 498

Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
              +L+    W+ +D + +    +++T          A + A+  GP
Sbjct: 499 EDGYLALDREWT-DDTVELTFEQTVQTVRAHPAVEADAGLVAVERGP 544


>gi|432865910|ref|ZP_20088760.1| hypothetical protein A311_04524 [Escherichia coli KTE146]
 gi|431401839|gb|ELG85171.1| hypothetical protein A311_04524 [Escherichia coli KTE146]
          Length = 654

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 94/404 (23%), Positives = 145/404 (35%), Gaps = 87/404 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q +LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQVTLNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
            L + LP+ +R           A   AI  GP  Y L    +GE
Sbjct: 525 TLNLTLPMPVRRVYGNPLMRHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|386626404|ref|YP_006146132.1| hypothetical protein CE10_4140 [Escherichia coli O7:K1 str. CE10]
 gi|349740140|gb|AEQ14846.1| hypothetical protein CE10_4140 [Escherichia coli O7:K1 str. CE10]
          Length = 573

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 85/370 (22%), Positives = 135/370 (36%), Gaps = 85/370 (22%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
           L RLY +T +P++L L + F      +P +      +    SH+H             + 
Sbjct: 193 LMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYSQ 252

Query: 335 THIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
            H+PI      IG  +R+      +Y + G   +  ++   S                 Y
Sbjct: 253 AHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQLY 306

Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMER 363

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
           AL N VL      +     Y+ PL          H    KFN  +              C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCAC 414

Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
           C        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W    
Sbjct: 415 CPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE-- 469

Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
           ++T+   S Q V    +L LR+P W      Q +LNG+ +       +L  T  W   D 
Sbjct: 470 QVTIAVESPQPVRH--TLALRLPDWC--TQPQITLNGEEVEQDIRKGYLHITREWQEGDT 525

Query: 595 LTIQLPLSLR 604
           L + LP+ +R
Sbjct: 526 LNLTLPMPVR 535


>gi|323344406|ref|ZP_08084631.1| hypothetical protein HMPREF0663_11167 [Prevotella oralis ATCC
           33269]
 gi|323094533|gb|EFZ37109.1| hypothetical protein HMPREF0663_11167 [Prevotella oralis ATCC
           33269]
          Length = 627

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 65/259 (25%), Positives = 110/259 (42%), Gaps = 26/259 (10%)

Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
           TG  SA E W+  K++        +ETC T   +K+SR L   T    YAD  E++L N 
Sbjct: 300 TGSGSAMESWFGGKQVQYMPIKHYQETCVTATWIKLSRQLLMLTGNSKYADAIEQSLYNA 359

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
           +L   +        Y  PL     +     G G       CC  +G      +  +   +
Sbjct: 360 LLGAMKSDGSDWAKYT-PLSGQRLQGSEQCGMGLN-----CCTASGPRGLFIIPQTAVMQ 413

Query: 494 EEGNVPGLYIIQYISSSFDWKS---GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
              ++ G  I  YI  ++  +S     +++ Q+ D    +     + + F  KQ   +  
Sbjct: 414 ---SIKGAVINLYIPGTYTLQSPKGQEIIITQQGD----YPQTGTVRIAFKVKQT--EEF 464

Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA-IQ 609
           +L+LR+P W  S   + +LNG ++     G++L    +WS  D   ++L L +R +    
Sbjct: 465 TLSLRIPEW--SKDTKVTLNGNDVVPAHNGSYLQINRKWSDGDH--VELVLDMRAQLHFM 520

Query: 610 DDRPEYASIQAILFGPYLL 628
            + P+Y    AI  GP +L
Sbjct: 521 GENPQYL---AITRGPVVL 536


>gi|284122982|ref|ZP_06386886.1| protein of unknown function DUF1680 [Candidatus Poribacteria sp.
           WGA-A3]
 gi|283829311|gb|EFC33713.1| protein of unknown function DUF1680 [Candidatus Poribacteria sp.
           WGA-A3]
          Length = 577

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 115/485 (23%), Positives = 192/485 (39%), Gaps = 92/485 (18%)

Query: 171 SASAQMWASTH-NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV--WA 227
           +AS  +W  TH N T + ++  V+  ++ CQ     GYL+++       F  ++P   W 
Sbjct: 21  AASYTLW--THPNPTWEPELDEVIAKIAACQQP--DGYLNSY-------FTLVEPTKRWQ 69

Query: 228 PYYTIHKI-LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
               +H++  AG L +  +A + QA    T +++        +   +  ++       E 
Sbjct: 70  NLGMMHELYCAGHLFEAAVA-HYQATGKQT-LLDVACRFADLIDNTFGFDKRDGLPGHE- 126

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLF------------------DKPCFLG----FLALQ 324
            G+   L +L  +T +P+++ LA  F                  D P  LG         
Sbjct: 127 -GIELALVKLARVTGEPRYMALAEYFVTRRGHSPSIFEKELENPDLPGGLGAYQHHFTRD 185

Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG---TSARE 381
             Y  H+ A  H+PI    Q + E  G      +   ++    A  +Y TG    T+A E
Sbjct: 186 GKYEGHY-AQAHLPI----QEQTECVG----HAVRAMYLYSGAADIAYETGDSAITNALE 236

Query: 382 FWWD--PKRLADTLG----SENE---------------ETCTTYNMLKVSRHLFRWTKEI 420
             W    KRL  T G      NE               ETC +  ++  +  +F    E 
Sbjct: 237 ALWQNVGKRLYITGGVGPSGHNEGFTTDYELPNFSAYAETCASIGLIFWAHRMFLLRAES 296

Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
            + D  E AL NG LS       G   Y  PL     + R  H W   F    CC     
Sbjct: 297 RFVDVLETALYNGALSGISLDGTG-FFYQNPLASHGDRHR--HEW---FGCA-CCPPNIA 349

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD-WKSGHVVLNQKVDPIVSWDPYLRMTLT 539
              + +G  IY E E    G+Y+  Y+S + D   +G+V +    +    W   + +T+T
Sbjct: 350 RLLASVGQYIYAESE---EGIYVNLYVSITADAIAAGNVPVRLTQETDYPWAGDVTLTIT 406

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ-NLPLPPPGNFLSATERWSYNDKLTIQ 598
            ++        +LNLR+P W   +  +  +NG+ +   P    +L+ T  W   D++ +Q
Sbjct: 407 PTTPVPF----TLNLRIPGW--CDQCEVRVNGEADNSQPNATGYLTITREWRAGDRVQLQ 460

Query: 599 LPLSL 603
           LP+ +
Sbjct: 461 LPMPV 465


>gi|387609318|ref|YP_006098174.1| hypothetical protein EC042_3892 [Escherichia coli 042]
 gi|419917404|ref|ZP_14435664.1| hypothetical protein ECKD2_05705 [Escherichia coli KD2]
 gi|284923618|emb|CBG36715.1| conserved hypothetical protein [Escherichia coli 042]
 gi|388394341|gb|EIL55642.1| hypothetical protein ECKD2_05705 [Escherichia coli KD2]
          Length = 656

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 95/404 (23%), Positives = 145/404 (35%), Gaps = 87/404 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L LA+ F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALANYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVGQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
            L + LP+ +R           A   AI  GP  Y L    +GE
Sbjct: 525 TLNLTLPMPVRRVYGNPQVRHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|301020201|ref|ZP_07184325.1| conserved hypothetical protein [Escherichia coli MS 69-1]
 gi|300398864|gb|EFJ82402.1| conserved hypothetical protein [Escherichia coli MS 69-1]
          Length = 664

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 95/404 (23%), Positives = 145/404 (35%), Gaps = 87/404 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L LA+ F      +P +      +    SH+H             +
Sbjct: 200 ALMRLYEVTEEPRYLALANYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 260 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 313

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 370

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPIRQRWFGCA 421

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 422 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 477

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 478 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVGQDIRKGYLHITREWQEGD 532

Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
            L + LP+ +R           A   AI  GP  Y L    +GE
Sbjct: 533 TLNLTLPMPVRRVYGNPQVRHVAGKVAIQRGPLVYCLEQADNGE 576


>gi|298248099|ref|ZP_06971904.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
           44963]
 gi|297550758|gb|EFH84624.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
           44963]
          Length = 638

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 81/320 (25%), Positives = 132/320 (41%), Gaps = 34/320 (10%)

Query: 332 HANTHIPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
           HA   + +  G+   Y  TG+  L   I   + D+      Y TGG  +R   +D + + 
Sbjct: 260 HAVRALYLYAGATDAYTETGEQALLHAINALWADL-QQHKVYVTGGVGSR---YDGEAVG 315

Query: 391 DTLGSENE----ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGV 445
           ++    N+    ETC     +  +  L   T    YAD  E  L NG+L+ I    E   
Sbjct: 316 ESYELPNDQAYTETCAAIAHIMWAWRLLLLTGNALYADAMELTLYNGMLAGISLDGE--S 373

Query: 446 MIYMLPLG-RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
             Y  PL  RG  + R    +GT      CC        + L   IY   + +   L++ 
Sbjct: 374 YFYQNPLADRG--RHRRQPWFGTA-----CCPPNVARLLASLPGYIYTTSDAD---LWVH 423

Query: 505 QYISSSFDWKSGH-VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
            Y SS  + +     VL  K      W+  +++++     ++   +  LNLR+P W  ++
Sbjct: 424 LYTSSEANVRLPQGSVLKCKQTSNYPWEGKIKLSI---EPKQANAIFGLNLRIPAW--AH 478

Query: 564 GAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
           GA  S+NG+ LP P  PG++      W   D++ + LPL +R               A+L
Sbjct: 479 GATVSVNGETLPPPIQPGSYYRIERTWQPGDQVELVLPLLMRAVTSHPYISNNNGRVALL 538

Query: 623 FGPYLL----AGHTSGEWDI 638
            GP +     + H +  WD+
Sbjct: 539 RGPLVYCVEQSDHEADVWDL 558


>gi|337749269|ref|YP_004643431.1| hypothetical protein KNP414_05037 [Paenibacillus mucilaginosus
           KNP414]
 gi|336300458|gb|AEI43561.1| protein of unknown function DUF1680 [Paenibacillus mucilaginosus
           KNP414]
          Length = 660

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 94/386 (24%), Positives = 140/386 (36%), Gaps = 69/386 (17%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
            L +LY  T + ++L LA  F      +P FL     Q D  SH+ A   +PI    QM 
Sbjct: 195 ALVKLYGATGEERYLKLAQFFIDERGTEPNFLVEECRQRDGYSHW-AKKKLPIPTAEQMA 253

Query: 347 YEVTGDPLYKL-------IGTFFMDIVNASHSYATGGT----SAREFWWDPKR----LAD 391
           Y     P+ +        +   +M    A  +  TG      + R  W +  +    +  
Sbjct: 254 YNQAHKPVRQQDTAVGHSVRAVYMYTAMADLARLTGDAELLEACRRLWDNTTKKQMYITG 313

Query: 392 TLGSENE-----------------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
            +GS +                  ETC +  ++  +R + +   +  YAD  ERAL N V
Sbjct: 314 GIGSTHHGEAFSFDYDLPNDTVYAETCASIGLIFFARRMLQLEAKSEYADVLERALYNNV 373

Query: 435 LS--IQRGTEPGVMIYMLPL-----------GRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
           +    Q G       Y+ PL           GR   KA     +G       CC      
Sbjct: 374 IGSMSQDGKH---YFYVNPLEVWPKASEQNPGRHHVKAVRQPWFGCS-----CCPPNVAR 425

Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
             S L D IY    G+   +Y   +I S  SF   +G V L Q+    + W+   R  LT
Sbjct: 426 LLSSLNDYIYSASPGDNT-VYTHLFIGSEASFTLAAGQVALKQESR--LPWEGCARFELT 482

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
              +  V    +L LR+P W+    A+  +NG          +   T RW+  D +    
Sbjct: 483 AVPEAPV----TLALRIPSWS-GGRAELRINGAAEAYEVENGYAVVTRRWTAGDVVEWAP 537

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP 625
            L  +  A   +    A   AI  GP
Sbjct: 538 ALQAQLTAAHPEIRANAGRAAIERGP 563


>gi|417116562|ref|ZP_11967423.1| putative glycosyhydrolase [Escherichia coli 1.2741]
 gi|422801520|ref|ZP_16850016.1| hypothetical protein ERJG_02686 [Escherichia coli M863]
 gi|323965978|gb|EGB61421.1| hypothetical protein ERJG_02686 [Escherichia coli M863]
 gi|386139106|gb|EIG80261.1| putative glycosyhydrolase [Escherichia coli 1.2741]
          Length = 656

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q +LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQITLNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|417664178|ref|ZP_12313758.1| secreted protein [Escherichia coli AA86]
 gi|330909651|gb|EGH38165.1| secreted protein [Escherichia coli AA86]
          Length = 657

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 108/499 (21%), Positives = 181/499 (36%), Gaps = 97/499 (19%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
           V  +L A A       +A +++    V+  ++  Q K   GYL+ + T      +A +  
Sbjct: 74  VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCK--DGYLNTYFT-----VKAPEER 126

Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
           W+     H++     L++  V    A   +    +V    + +  V      + H Y  +
Sbjct: 127 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLTDHIDSVFGPDESKLHGYPGH 186

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
            E   +   L RLY +T +P++L L + F      +P +      +    SH+H      
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243

Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
                  +  H+PI      IG  +R+      +Y + G   +  ++   S         
Sbjct: 244 MVKDKAYSQAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLW 297

Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
                   Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354

Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
             YAD  ERAL N VL      +     Y+ PL          H    KFN  +      
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPI 405

Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
                   CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V 
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
               W    ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWCIQ--PQIILNGEEVEQDIRKGYLHI 516

Query: 586 TERWSYNDKLTIQLPLSLR 604
           T  W   D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535


>gi|416899982|ref|ZP_11929388.1| hypothetical protein ECSTEC7V_4230 [Escherichia coli STEC_7v]
 gi|327251242|gb|EGE62935.1| hypothetical protein ECSTEC7V_4230 [Escherichia coli STEC_7v]
          Length = 656

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q +LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQITLNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|293413020|ref|ZP_06655688.1| conserved hypothetical protein [Escherichia coli B354]
 gi|291468667|gb|EFF11160.1| conserved hypothetical protein [Escherichia coli B354]
          Length = 656

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q +LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWCAQ--PQVTLNGEEVGQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|170681898|ref|YP_001745874.1| hypothetical protein EcSMS35_3909 [Escherichia coli SMS-3-5]
 gi|170519616|gb|ACB17794.1| conserved hypothetical protein [Escherichia coli SMS-3-5]
          Length = 656

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q +LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQITLNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|336239737|ref|XP_003342727.1| hypothetical protein SMAC_10375 [Sordaria macrospora k-hell]
          Length = 159

 Score = 60.1 bits (144), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 35/98 (35%), Positives = 51/98 (52%), Gaps = 2/98 (2%)

Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYL 170
           Q S    A   N  YLL LD + L+ +F  +A LP P   YGGWE     + GH +GH+L
Sbjct: 60  QPSPFADAFAANRRYLLDLDPERLLHNFYISAGLPAPKPVYGGWE--AQGIAGHSLGHWL 117

Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL 208
           SA A   A++ +A I  ++   +  ++  Q   G GY+
Sbjct: 118 SACALTVANSGDAAIAARLDHALKEMARIQAAHGDGYV 155


>gi|422783824|ref|ZP_16836607.1| hypothetical protein ERFG_04064 [Escherichia coli TW10509]
 gi|323975001|gb|EGB70110.1| hypothetical protein ERFG_04064 [Escherichia coli TW10509]
          Length = 656

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSHYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q +LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQITLNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|423105419|ref|ZP_17093121.1| hypothetical protein HMPREF9686_04025 [Klebsiella oxytoca 10-5242]
 gi|376380736|gb|EHS93479.1| hypothetical protein HMPREF9686_04025 [Klebsiella oxytoca 10-5242]
          Length = 653

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 79/370 (21%), Positives = 131/370 (35%), Gaps = 85/370 (22%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
           L RLY +T +P+++ L   F      +P F                    ++ +   Y  
Sbjct: 193 LMRLYDVTQEPRYMALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
                +  P+ IG  +R+      +Y + G   +  ++                     Y
Sbjct: 253 AHQPISEQPVAIGHAVRF------VYLMTGVAHLARLSQDEGKRRDCLRLWKNMARRQLY 306

Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
           AL N VL      +     Y+ PL          H    KFN  +              C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPVRQRWFGCAC 414

Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
           C        + LG  IY   +     LYI  YI +S +   G+  L  ++     W   +
Sbjct: 415 CPPNIARVLTSLGHYIYTPHDD---ALYINLYIGNSVEIPVGNEALRLRISGNYPWQEQV 471

Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
           ++ +  SS        +L LR+P W   +  Q +LNG  +       +L  +  W   D 
Sbjct: 472 KIVIDSSSPVN----HTLALRLPDWC--DKPQVTLNGAPVTQDVRKGYLHISHLWQEGDT 525

Query: 595 LTIQLPLSLR 604
           L + LP+ +R
Sbjct: 526 LQLTLPMPVR 535


>gi|152968091|ref|YP_001363875.1| hypothetical protein Krad_4148 [Kineococcus radiotolerans SRS30216]
 gi|151362608|gb|ABS05611.1| protein of unknown function DUF1680 [Kineococcus radiotolerans
           SRS30216]
          Length = 652

 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 66/253 (26%), Positives = 109/253 (43%), Gaps = 39/253 (15%)

Query: 368 ASHSYATGGTSAREFWWDPKRLAD--TLGSENE--ETCTTYNMLKVSRHLFRWTKEIAYA 423
           AS +Y TGG  AR   WD ++  D   LG E    ETC     ++ +  +   T E  YA
Sbjct: 301 ASKTYVTGGIGAR---WDWEQFGDHYELGPERAYAETCAAIGSVQWTWRMLLATGEARYA 357

Query: 424 DYYERALTNGVLSIQRGTEPGVMI----------YMLPLGRGVSKARST-HGWGTKFNSF 472
           D  ER L N  L       PGV +            L  G    + RS  HG    F+  
Sbjct: 358 DLVERTLYNAFL-------PGVSLAGTEYFYVNALQLRHGAFAEEERSVAHGRRPWFDCA 410

Query: 473 WCCYGTGIESFSKLGDSIYFEEEGN-VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWD 531
            CC    + + S L   +      + V G+ + Q+ + + +  +    L+   D    WD
Sbjct: 411 -CCPPNIMRTLSSLDAYVATSSATDGVAGVQVHQFTTGTIE--AAGAALSVTTD--YPWD 465

Query: 532 PYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSY 591
             +R+ +T +  +       L LR+P W  + GA A+++G+ + +  PG +L     ++ 
Sbjct: 466 GTVRVEVTATPGE-----FELALRVPAW--AQGATATVDGEAVAV-TPGEYLRVRRDFAV 517

Query: 592 NDKLTIQLPLSLR 604
            D + + LP+++R
Sbjct: 518 GDVVELVLPMTVR 530


>gi|317048885|ref|YP_004116533.1| hypothetical protein Pat9b_2677 [Pantoea sp. At-9b]
 gi|316950502|gb|ADU69977.1| protein of unknown function DUF1680 [Pantoea sp. At-9b]
          Length = 651

 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 68/271 (25%), Positives = 100/271 (36%), Gaps = 39/271 (14%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H     FN  +              CC        + LG  IY   E     LYI 
Sbjct: 387 --EVHPKTLSFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTPRE---EALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +   G   L  +++    W     +T+T  S Q V    +L LR+P W   + 
Sbjct: 442 LYVGNSLEVPVGEQTLRLRINGNFPWQE--TVTITIDSPQPVQH--TLALRLPDWC--DA 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
            Q +LN   +       +L     WS  D LT+ LP+ +R           A   A+  G
Sbjct: 496 PQVTLNDAAVASDIRKGYLHINRSWSEGDTLTLTLPMPVRRVYGNPLVRHVAGKVALQRG 555

Query: 625 P--YLLAGHTSGE-----WDIKTGTARSLSA 648
           P  Y L    +GE     W  +T T R+   
Sbjct: 556 PLVYCLEQADNGEELHNLWLPQTATFRTFEG 586


>gi|402843427|ref|ZP_10891823.1| putative glycosyhydrolase [Klebsiella sp. OBRC7]
 gi|402277059|gb|EJU26151.1| putative glycosyhydrolase [Klebsiella sp. OBRC7]
          Length = 653

 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 79/370 (21%), Positives = 131/370 (35%), Gaps = 85/370 (22%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
           L RLY +T +P+++ L   F      +P F                    ++ +   Y  
Sbjct: 193 LMRLYDVTQEPRYMALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
                +  P+ IG  +R+      +Y + G   +  ++                     Y
Sbjct: 253 AHQPISEQPVAIGHAVRF------VYLMTGVAHLARLSQDEGKRRDCLRLWKNMARRQLY 306

Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
           AL N VL      +     Y+ PL          H    KFN  +              C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPVRQRWFGCAC 414

Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
           C        + LG  IY   +     LYI  YI +S +   G+  L  ++     W   +
Sbjct: 415 CPPNIARVLTSLGHYIYTPHDD---ALYINLYIGNSVEIPVGNEALRLRISGNYPWQEQV 471

Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
           ++ +  SS        +L LR+P W   +  Q +LNG  +       +L  +  W   D 
Sbjct: 472 KIVIDSSSPVN----HTLALRLPDWC--DKPQVTLNGAPVTQDVRKGYLHISHLWREGDT 525

Query: 595 LTIQLPLSLR 604
           L + LP+ +R
Sbjct: 526 LQLTLPMPVR 535


>gi|331685249|ref|ZP_08385835.1| putative cytoplasmic protein [Escherichia coli H299]
 gi|450194438|ref|ZP_21892361.1| hypothetical protein A364_18755 [Escherichia coli SEPT362]
 gi|331077620|gb|EGI48832.1| putative cytoplasmic protein [Escherichia coli H299]
 gi|449316669|gb|EMD06777.1| hypothetical protein A364_18755 [Escherichia coli SEPT362]
          Length = 656

 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 108/499 (21%), Positives = 182/499 (36%), Gaps = 97/499 (19%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
           V  +L A A       +A +++    V+  ++  Q +   GYL+A+ T      +A +  
Sbjct: 74  VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNAYFT-----VKAPEER 126

Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
           W+     H++     L++  V    A   +    +V    + +  V      + H Y  +
Sbjct: 127 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLADHIDSVFGPGESKLHGYPGH 186

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
            E   +   L RLY +T +P++L L + F      +P +      +    SH+H      
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243

Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
                  +  H+PI      IG  +R+      +Y + G   +  ++   S         
Sbjct: 244 MVKDKAYSQAHLPIAQQQTAIGHTVRF------VYLMTGVAHLARLSHDESKRQDCLRLW 297

Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
                   Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354

Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
             YAD  ERAL N VL      +     Y+ PL          H    KFN  +      
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPI 405

Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
                   CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V 
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
               W    ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 516

Query: 586 TERWSYNDKLTIQLPLSLR 604
           T  W   D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535


>gi|432618844|ref|ZP_19854944.1| hypothetical protein A1UM_04296 [Escherichia coli KTE75]
 gi|431151056|gb|ELE52093.1| hypothetical protein A1UM_04296 [Escherichia coli KTE75]
          Length = 659

 Score = 59.7 bits (143), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 108/499 (21%), Positives = 182/499 (36%), Gaps = 97/499 (19%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
           V  +L A A       +A +++    V+  ++  Q +   GYL+A+ T      +A +  
Sbjct: 74  VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNAYFT-----VKAPEER 126

Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
           W+     H++     L++  V    A   +    +V    + +  V      + H Y  +
Sbjct: 127 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLADHIDSVFGPGESKLHGYPGH 186

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
            E   +   L RLY +T +P++L L + F      +P +      +    SH+H      
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243

Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
                  +  H+PI      IG  +R+      +Y + G   +  ++   S         
Sbjct: 244 MVKDKAYSQAHLPIAQQQTAIGHTVRF------VYLMTGVAHLARLSHDESKRQDCLRLW 297

Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
                   Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354

Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
             YAD  ERAL N VL      +     Y+ PL          H    KFN  +      
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPI 405

Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
                   CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V 
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
               W    ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 516

Query: 586 TERWSYNDKLTIQLPLSLR 604
           T  W   D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535


>gi|387831475|ref|YP_003351412.1| hypothetical protein ECSF_3422 [Escherichia coli SE15]
 gi|432399540|ref|ZP_19642313.1| hypothetical protein WEI_04484 [Escherichia coli KTE25]
 gi|432408662|ref|ZP_19651364.1| hypothetical protein WEO_03868 [Escherichia coli KTE28]
 gi|432502151|ref|ZP_19743901.1| hypothetical protein A177_04267 [Escherichia coli KTE216]
 gi|432696461|ref|ZP_19931652.1| hypothetical protein A31I_03948 [Escherichia coli KTE162]
 gi|432725058|ref|ZP_19959971.1| hypothetical protein WE1_04112 [Escherichia coli KTE17]
 gi|432729639|ref|ZP_19964512.1| hypothetical protein WE3_04113 [Escherichia coli KTE18]
 gi|432743329|ref|ZP_19978043.1| hypothetical protein WEE_04046 [Escherichia coli KTE23]
 gi|432922799|ref|ZP_20125572.1| hypothetical protein A133_04523 [Escherichia coli KTE173]
 gi|432929459|ref|ZP_20130509.1| hypothetical protein A135_04587 [Escherichia coli KTE175]
 gi|432983040|ref|ZP_20171809.1| hypothetical protein A15W_04187 [Escherichia coli KTE211]
 gi|432992699|ref|ZP_20181347.1| hypothetical protein A179_04489 [Escherichia coli KTE217]
 gi|433098416|ref|ZP_20284583.1| hypothetical protein WK3_03618 [Escherichia coli KTE139]
 gi|433107854|ref|ZP_20293813.1| hypothetical protein WK7_03720 [Escherichia coli KTE148]
 gi|433112834|ref|ZP_20298684.1| hypothetical protein WK9_03711 [Escherichia coli KTE150]
 gi|281180632|dbj|BAI56962.1| conserved hypothetical protein [Escherichia coli SE15]
 gi|430912702|gb|ELC33874.1| hypothetical protein WEI_04484 [Escherichia coli KTE25]
 gi|430926036|gb|ELC46624.1| hypothetical protein WEO_03868 [Escherichia coli KTE28]
 gi|431025819|gb|ELD38905.1| hypothetical protein A177_04267 [Escherichia coli KTE216]
 gi|431231105|gb|ELF26873.1| hypothetical protein A31I_03948 [Escherichia coli KTE162]
 gi|431262277|gb|ELF54267.1| hypothetical protein WE1_04112 [Escherichia coli KTE17]
 gi|431270780|gb|ELF61923.1| hypothetical protein WE3_04113 [Escherichia coli KTE18]
 gi|431281486|gb|ELF72389.1| hypothetical protein WEE_04046 [Escherichia coli KTE23]
 gi|431435293|gb|ELH16905.1| hypothetical protein A133_04523 [Escherichia coli KTE173]
 gi|431440867|gb|ELH22195.1| hypothetical protein A135_04587 [Escherichia coli KTE175]
 gi|431488798|gb|ELH68428.1| hypothetical protein A15W_04187 [Escherichia coli KTE211]
 gi|431490717|gb|ELH70325.1| hypothetical protein A179_04489 [Escherichia coli KTE217]
 gi|431612416|gb|ELI81663.1| hypothetical protein WK3_03618 [Escherichia coli KTE139]
 gi|431623752|gb|ELI92378.1| hypothetical protein WK7_03720 [Escherichia coli KTE148]
 gi|431625172|gb|ELI93765.1| hypothetical protein WK9_03711 [Escherichia coli KTE150]
          Length = 657

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWCIQ--PQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|354603632|ref|ZP_09021629.1| hypothetical protein HMPREF9450_00544 [Alistipes indistinctus YIT
           12060]
 gi|353348727|gb|EHB92995.1| hypothetical protein HMPREF9450_00544 [Alistipes indistinctus YIT
           12060]
          Length = 630

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 68/272 (25%), Positives = 111/272 (40%), Gaps = 50/272 (18%)

Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           G  SA E ++  +R+  T      ETC T   +++  HL   T +  YAD  ER + N +
Sbjct: 304 GSGSADECFYHGRRMQTTPAYSMMETCVTMTWMQLCGHLLELTHDPLYADQIERTVYNAL 363

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHG--WGTKFNSFWCCYGTGIESFSKLGD---- 488
           L+  +G    +  Y  PL  GV   RS  G   G   N   CC   G  +F+ + +    
Sbjct: 364 LAALKGDGSQIAKYS-PL-EGV---RSPGGPQCGMHVN---CCNMNGPRAFAMIPELMAT 415

Query: 489 --------SIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
                   ++Y E    VP                G V+L Q+ +    +     + LT 
Sbjct: 416 CAADTLFVNLYGESVSKVP-------------LAGGEVILRQQTN----YPEQGSVELTV 458

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
           + ++   +  ++ +R+P W  S     ++NGQ +    PG++L+ +  W   DK+ +   
Sbjct: 459 NPRKS--REFAVAVRIPAW--SKITMVTVNGQAVADVRPGSYLTVSRTWKEGDKIALNFD 514

Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
           +  R         E    QAI  GP +LA  T
Sbjct: 515 MRGRLT-------ELNGYQAIERGPVVLARDT 539


>gi|386621273|ref|YP_006140853.1| hypothetical protein ECNA114_3739 [Escherichia coli NA114]
 gi|432423998|ref|ZP_19666535.1| hypothetical protein A137_04431 [Escherichia coli KTE178]
 gi|432560859|ref|ZP_19797513.1| hypothetical protein A1S7_04513 [Escherichia coli KTE49]
 gi|432707936|ref|ZP_19943011.1| hypothetical protein WCG_01214 [Escherichia coli KTE6]
 gi|432891143|ref|ZP_20103901.1| hypothetical protein A31K_00995 [Escherichia coli KTE165]
 gi|333971774|gb|AEG38579.1| Hypothetical protein ECNA114_3739 [Escherichia coli NA114]
 gi|430941626|gb|ELC61768.1| hypothetical protein A137_04431 [Escherichia coli KTE178]
 gi|431088585|gb|ELD94458.1| hypothetical protein A1S7_04513 [Escherichia coli KTE49]
 gi|431254890|gb|ELF48151.1| hypothetical protein WCG_01214 [Escherichia coli KTE6]
 gi|431430258|gb|ELH12090.1| hypothetical protein A31K_00995 [Escherichia coli KTE165]
          Length = 657

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWCIQ--PQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|432491369|ref|ZP_19733231.1| hypothetical protein A171_03302 [Escherichia coli KTE213]
 gi|432841396|ref|ZP_20074855.1| hypothetical protein A1YQ_04362 [Escherichia coli KTE140]
 gi|433205327|ref|ZP_20389073.1| hypothetical protein WGY_03902 [Escherichia coli KTE95]
 gi|431018040|gb|ELD31485.1| hypothetical protein A171_03302 [Escherichia coli KTE213]
 gi|431386628|gb|ELG70584.1| hypothetical protein A1YQ_04362 [Escherichia coli KTE140]
 gi|431716416|gb|ELJ80548.1| hypothetical protein WGY_03902 [Escherichia coli KTE95]
          Length = 654

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|432682342|ref|ZP_19917698.1| hypothetical protein A1YW_04093 [Escherichia coli KTE143]
 gi|431217316|gb|ELF14895.1| hypothetical protein A1YW_04093 [Escherichia coli KTE143]
          Length = 659

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|432604420|ref|ZP_19840650.1| hypothetical protein A1U5_04274 [Escherichia coli KTE66]
 gi|431137800|gb|ELE39645.1| hypothetical protein A1U5_04274 [Escherichia coli KTE66]
          Length = 654

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|300937197|ref|ZP_07152048.1| conserved hypothetical protein [Escherichia coli MS 21-1]
 gi|300457729|gb|EFK21222.1| conserved hypothetical protein [Escherichia coli MS 21-1]
          Length = 667

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 260 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 313

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 370

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 421

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 422 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 477

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 478 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 532

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 533 TLNLTLPMPVR 543


>gi|417141197|ref|ZP_11984110.1| putative glycosyhydrolase [Escherichia coli 97.0259]
 gi|417310126|ref|ZP_12096949.1| hypothetical protein PPECC33_35210 [Escherichia coli PCN033]
 gi|338768332|gb|EGP23129.1| hypothetical protein PPECC33_35210 [Escherichia coli PCN033]
 gi|386155687|gb|EIH12037.1| putative glycosyhydrolase [Escherichia coli 97.0259]
          Length = 654

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|422334703|ref|ZP_16415708.1| hypothetical protein HMPREF0986_04202 [Escherichia coli 4_1_47FAA]
 gi|432871119|ref|ZP_20091498.1| hypothetical protein A313_02338 [Escherichia coli KTE147]
 gi|373244312|gb|EHP63799.1| hypothetical protein HMPREF0986_04202 [Escherichia coli 4_1_47FAA]
 gi|431408324|gb|ELG91511.1| hypothetical protein A313_02338 [Escherichia coli KTE147]
          Length = 654

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 MLNLTLPMPVR 535


>gi|422829813|ref|ZP_16877977.1| hypothetical protein ESNG_02482 [Escherichia coli B093]
 gi|371607765|gb|EHN96330.1| hypothetical protein ESNG_02482 [Escherichia coli B093]
          Length = 659

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|432817355|ref|ZP_20051112.1| hypothetical protein A1Y1_03761 [Escherichia coli KTE115]
 gi|431361237|gb|ELG47834.1| hypothetical protein A1Y1_03761 [Escherichia coli KTE115]
          Length = 656

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|293417024|ref|ZP_06659661.1| hypothetical protein ECDG_04192 [Escherichia coli B185]
 gi|291431600|gb|EFF04585.1| hypothetical protein ECDG_04192 [Escherichia coli B185]
          Length = 656

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      + +  SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKREQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|423126346|ref|ZP_17114025.1| hypothetical protein HMPREF9694_03037 [Klebsiella oxytoca 10-5250]
 gi|376397918|gb|EHT10548.1| hypothetical protein HMPREF9694_03037 [Klebsiella oxytoca 10-5250]
          Length = 653

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 84/395 (21%), Positives = 142/395 (35%), Gaps = 71/395 (17%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
           L RLY +T +P+++ L   F      +P F                    ++ +   Y  
Sbjct: 193 LMRLYDVTQEPRYIALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
              + +  P+ IG  +R+      +Y + G   +  ++                     Y
Sbjct: 253 AHQSISEQPVAIGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWKNMAQRQLY 306

Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIES 482
           AL N VL      +     Y+ PL       +  H +         W    CC       
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPLEVNPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARV 422

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
            + LG  IY   +     LYI  Y+ +S +   G+  L  ++     W   +++ +  SS
Sbjct: 423 LTSLGHYIYTPHDD---ALYINLYVGNSVEIPVGNEALRLRISGNYPWQEQVKIVVDSSS 479

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
                   +L LR+P W   +  Q +LNG  +       +L  +  W   D L + LP+ 
Sbjct: 480 PVH----HTLALRLPDWC--DKPQVTLNGVPVTQDVRKGYLHISHLWQEGDTLQLTLPMP 533

Query: 603 LRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           +R           A + A+  GP  Y L    +GE
Sbjct: 534 VRRIYGNPLVRHQAGLVAVQRGPLVYCLEQADNGE 568


>gi|417588723|ref|ZP_12239485.1| hypothetical protein ECSTECC16502_4394 [Escherichia coli
           STEC_C165-02]
 gi|345331722|gb|EGW64181.1| hypothetical protein ECSTECC16502_4394 [Escherichia coli
           STEC_C165-02]
          Length = 654

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVRGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 MLNLTLPMPVR 535


>gi|432545326|ref|ZP_19782157.1| hypothetical protein A197_03919 [Escherichia coli KTE236]
 gi|432550808|ref|ZP_19787564.1| hypothetical protein A199_04285 [Escherichia coli KTE237]
 gi|432623948|ref|ZP_19859963.1| hypothetical protein A1UO_03835 [Escherichia coli KTE76]
 gi|431071355|gb|ELD79491.1| hypothetical protein A197_03919 [Escherichia coli KTE236]
 gi|431077175|gb|ELD84442.1| hypothetical protein A199_04285 [Escherichia coli KTE237]
 gi|431156242|gb|ELE56979.1| hypothetical protein A1UO_03835 [Escherichia coli KTE76]
          Length = 654

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWCIQ--PQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|419924680|ref|ZP_14442556.1| hypothetical protein EC54115_16625 [Escherichia coli 541-15]
 gi|388389076|gb|EIL50615.1| hypothetical protein EC54115_16625 [Escherichia coli 541-15]
          Length = 659

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 106/499 (21%), Positives = 182/499 (36%), Gaps = 97/499 (19%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
           V  +L A A       +A +++    V+  ++  Q +   GYL+ + T      +A +  
Sbjct: 74  VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNTYFT-----VKAPEER 126

Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
           W+     H++     L++  V    A   +    +V    + + +V      + H Y  +
Sbjct: 127 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLADHIDRVFGPDESKLHGYPGH 186

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
            E   +   L RLY +T +P++L L + F      +P +      +    SH+H      
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243

Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
                  +  H+P+      IG  +R+      +Y + G   +  ++   S         
Sbjct: 244 MVKDKAYSQAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLW 297

Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
                   Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354

Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
             YAD  ERAL N VL      +     Y+ PL          H    KFN  +      
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPI 405

Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
                   CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V 
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
               W    ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 516

Query: 586 TERWSYNDKLTIQLPLSLR 604
           T  W   D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535


>gi|379722221|ref|YP_005314352.1| hypothetical protein PM3016_4439 [Paenibacillus mucilaginosus 3016]
 gi|386724962|ref|YP_006191288.1| hypothetical protein B2K_22975 [Paenibacillus mucilaginosus K02]
 gi|378570893|gb|AFC31203.1| hypothetical protein PM3016_4439 [Paenibacillus mucilaginosus 3016]
 gi|384092087|gb|AFH63523.1| hypothetical protein B2K_22975 [Paenibacillus mucilaginosus K02]
          Length = 660

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 87/354 (24%), Positives = 129/354 (36%), Gaps = 69/354 (19%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
            L +LY  T + ++L LA  F      +P FL     Q D  SH+ A   +PI    QM 
Sbjct: 195 ALVKLYGATGEERYLKLAQFFIDERGTEPNFLVEECRQRDGYSHW-AKKKLPIPTAEQMA 253

Query: 347 YEVTGDPLYKL-------IGTFFMDIVNASHSYATGGT----SAREFWWDPKR----LAD 391
           Y     P+ +        +   +M    A  +  TG      + R  W +  +    +  
Sbjct: 254 YNQAHKPVRQQDTAVGHSVRAVYMYTAMADLARLTGDAELLEACRRLWDNTTKKQMYITG 313

Query: 392 TLGSENE-----------------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
            +GS +                  ETC +  ++  +R + +   +  YAD  ERAL N V
Sbjct: 314 GIGSTHHGEAFSFDYDLPNDTVYAETCASIGLIFFARRMLQLEAKSEYADVLERALYNNV 373

Query: 435 LS--IQRGTEPGVMIYMLPL-----------GRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
           +    Q G       Y+ PL           GR   KA     +G       CC      
Sbjct: 374 IGSMSQDGKH---YFYVNPLEVWPKASEQNPGRHHVKAVRQPWFGCS-----CCPPNVAR 425

Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
             S L D IY    G    +Y   +I S  SF   +G V L Q+    + W+   R  LT
Sbjct: 426 LLSSLNDYIYSASAGENT-VYTHLFIGSEASFKLAAGQVALKQESR--LPWEGCARFELT 482

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
              +  V    +L LR+P W+    A+  +NG          +   T RW+  D
Sbjct: 483 AVPEAPV----TLALRIPSWS-GGRAELRINGAAEAYEVENGYAVVTRRWTAGD 531


>gi|423230660|ref|ZP_17217064.1| hypothetical protein HMPREF1063_02884 [Bacteroides dorei
           CL02T00C15]
 gi|423244371|ref|ZP_17225446.1| hypothetical protein HMPREF1064_01652 [Bacteroides dorei
           CL02T12C06]
 gi|392630310|gb|EIY24303.1| hypothetical protein HMPREF1063_02884 [Bacteroides dorei
           CL02T00C15]
 gi|392641945|gb|EIY35717.1| hypothetical protein HMPREF1064_01652 [Bacteroides dorei
           CL02T12C06]
          Length = 811

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 83/386 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L  A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    T   + +     + TGG  +R     P+   +  G 
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC +   +  +  +F  T +  YAD  ERAL NGV+S       GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  I  F        +  +GN   +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCLGN-ITRFMASVPYYMYATQGN--DV 432

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
           Y+  +I S  D ++    +N +      WD  + + +T   +QE     +L +R+P W  
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488

Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
                    ++++ AQA   S+NG  +       + +    W   D + I LP+ +R   
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548

Query: 605 -TEAIQDDRPEYASIQAILFGPYLLA 629
             + ++DDR +     AI  GP +  
Sbjct: 549 ANDQVEDDRGKL----AIERGPIMFC 570


>gi|300898699|ref|ZP_07117012.1| conserved hypothetical protein [Escherichia coli MS 198-1]
 gi|300357662|gb|EFJ73532.1| conserved hypothetical protein [Escherichia coli MS 198-1]
          Length = 662

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 260 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 313

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 370

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPLRQRWFGCA 421

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 422 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLCLRVSGNYPWQE- 477

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 478 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 532

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 533 TLNLTLPMPVR 543


>gi|421448505|ref|ZP_15897898.1| hypothetical protein SEEE6482_08181 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
 gi|396073159|gb|EJI81465.1| hypothetical protein SEEE6482_08181 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
          Length = 651

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 91/398 (22%), Positives = 143/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGVQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +   G+  L  ++     W   +++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI- 475

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V  L +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 476 -DSVQPV--LHTLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|422975185|ref|ZP_16976637.1| hypothetical protein ESRG_03271 [Escherichia coli TA124]
 gi|371595315|gb|EHN84166.1| hypothetical protein ESRG_03271 [Escherichia coli TA124]
          Length = 654

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLCLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|218707221|ref|YP_002414740.1| hypothetical protein ECUMN_4099 [Escherichia coli UMN026]
 gi|293407210|ref|ZP_06651134.1| conserved hypothetical protein [Escherichia coli FVEC1412]
 gi|298382958|ref|ZP_06992553.1| hypothetical protein ECFG_04115 [Escherichia coli FVEC1302]
 gi|419934131|ref|ZP_14451275.1| hypothetical protein EC5761_10490 [Escherichia coli 576-1]
 gi|432355611|ref|ZP_19598877.1| hypothetical protein WCA_04610 [Escherichia coli KTE2]
 gi|432403987|ref|ZP_19646731.1| hypothetical protein WEK_04194 [Escherichia coli KTE26]
 gi|432428252|ref|ZP_19670733.1| hypothetical protein A139_03650 [Escherichia coli KTE181]
 gi|432462951|ref|ZP_19705084.1| hypothetical protein A15I_03831 [Escherichia coli KTE204]
 gi|432477946|ref|ZP_19719933.1| hypothetical protein A15Q_04149 [Escherichia coli KTE208]
 gi|432519807|ref|ZP_19756986.1| hypothetical protein A17U_02789 [Escherichia coli KTE228]
 gi|432539967|ref|ZP_19776859.1| hypothetical protein A195_03603 [Escherichia coli KTE235]
 gi|432633483|ref|ZP_19869403.1| hypothetical protein A1UW_03877 [Escherichia coli KTE80]
 gi|432643180|ref|ZP_19879004.1| hypothetical protein A1W1_04060 [Escherichia coli KTE83]
 gi|432668175|ref|ZP_19903747.1| hypothetical protein A1Y3_04798 [Escherichia coli KTE116]
 gi|432772362|ref|ZP_20006675.1| hypothetical protein A1SG_00431 [Escherichia coli KTE54]
 gi|432889014|ref|ZP_20102658.1| hypothetical protein A31C_04403 [Escherichia coli KTE158]
 gi|432915187|ref|ZP_20120514.1| hypothetical protein A13Q_04153 [Escherichia coli KTE190]
 gi|433020828|ref|ZP_20208923.1| hypothetical protein WI7_03757 [Escherichia coli KTE105]
 gi|433055258|ref|ZP_20242416.1| hypothetical protein WIK_04060 [Escherichia coli KTE122]
 gi|433069946|ref|ZP_20256714.1| hypothetical protein WIQ_03826 [Escherichia coli KTE128]
 gi|433160742|ref|ZP_20345560.1| hypothetical protein WKU_03818 [Escherichia coli KTE177]
 gi|433180460|ref|ZP_20364837.1| hypothetical protein WGM_04099 [Escherichia coli KTE82]
 gi|218434318|emb|CAR15240.1| conserved hypothetical protein [Escherichia coli UMN026]
 gi|291426021|gb|EFE99055.1| conserved hypothetical protein [Escherichia coli FVEC1412]
 gi|298276794|gb|EFI18312.1| hypothetical protein ECFG_04115 [Escherichia coli FVEC1302]
 gi|388409694|gb|EIL69966.1| hypothetical protein EC5761_10490 [Escherichia coli 576-1]
 gi|430872588|gb|ELB96188.1| hypothetical protein WCA_04610 [Escherichia coli KTE2]
 gi|430923400|gb|ELC44137.1| hypothetical protein WEK_04194 [Escherichia coli KTE26]
 gi|430951024|gb|ELC70250.1| hypothetical protein A139_03650 [Escherichia coli KTE181]
 gi|430986214|gb|ELD02797.1| hypothetical protein A15I_03831 [Escherichia coli KTE204]
 gi|431002149|gb|ELD17675.1| hypothetical protein A15Q_04149 [Escherichia coli KTE208]
 gi|431048059|gb|ELD58044.1| hypothetical protein A17U_02789 [Escherichia coli KTE228]
 gi|431067015|gb|ELD75632.1| hypothetical protein A195_03603 [Escherichia coli KTE235]
 gi|431167666|gb|ELE67931.1| hypothetical protein A1UW_03877 [Escherichia coli KTE80]
 gi|431177575|gb|ELE77497.1| hypothetical protein A1W1_04060 [Escherichia coli KTE83]
 gi|431198006|gb|ELE96833.1| hypothetical protein A1Y3_04798 [Escherichia coli KTE116]
 gi|431323599|gb|ELG11078.1| hypothetical protein A1SG_00431 [Escherichia coli KTE54]
 gi|431413832|gb|ELG96595.1| hypothetical protein A31C_04403 [Escherichia coli KTE158]
 gi|431436255|gb|ELH17862.1| hypothetical protein A13Q_04153 [Escherichia coli KTE190]
 gi|431526942|gb|ELI03673.1| hypothetical protein WI7_03757 [Escherichia coli KTE105]
 gi|431566044|gb|ELI39087.1| hypothetical protein WIK_04060 [Escherichia coli KTE122]
 gi|431578915|gb|ELI51501.1| hypothetical protein WIQ_03826 [Escherichia coli KTE128]
 gi|431673865|gb|ELJ40054.1| hypothetical protein WKU_03818 [Escherichia coli KTE177]
 gi|431697952|gb|ELJ63031.1| hypothetical protein WGM_04099 [Escherichia coli KTE82]
          Length = 654

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPLRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLCLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|291086404|ref|ZP_06355701.2| putative cytoplasmic protein [Citrobacter youngae ATCC 29220]
 gi|291068139|gb|EFE06248.1| putative cytoplasmic protein [Citrobacter youngae ATCC 29220]
          Length = 659

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/220 (24%), Positives = 86/220 (39%), Gaps = 32/220 (14%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 342 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 394

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + +G  IY   +     LYI 
Sbjct: 395 --EVHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARILTSIGHYIYTPRQD---ALYIN 449

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +      VL  ++     W  + ++T+   S Q V    +L LR+P W   + 
Sbjct: 450 LYVGNSMEVPVADGVLKLRISGNYPW--HEQVTIAIESPQPVKH--TLALRLPDWC--SA 503

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            Q  LNGQ +       +L  +  W   D L++ LP+ +R
Sbjct: 504 PQVLLNGQPVAQDIRKGYLHISRTWQEGDTLSLTLPMPVR 543


>gi|345514174|ref|ZP_08793688.1| six-hairpin glycosidase, partial [Bacteroides dorei 5_1_36/D4]
 gi|345456089|gb|EEO48255.2| six-hairpin glycosidase [Bacteroides dorei 5_1_36/D4]
          Length = 810

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 100/439 (22%), Positives = 168/439 (38%), Gaps = 92/439 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L  A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    T   + +     + TGG  +R     P+   +  G 
Sbjct: 276 AGYLYSGVADVATLTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC +   +  +  +F  T +  YAD  ERAL NGV+S       GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  I  F        +  +GN   +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
           Y+  +I S  D ++    +N +      WD  + + +T   +QE     +L +R+P W  
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488

Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
                    ++++ AQA   S+NG  +       + +    W   D + I LP+ +R   
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548

Query: 605 -TEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVT 663
             + ++DDR +     AI  GP +      G+    +          +P+  S++A L+ 
Sbjct: 549 ANDQVEDDRGKL----AIERGPIMFC--LEGQDQADSTVFNKFIPDGTPMEASYDADLL- 601

Query: 664 FTQESGNSTFVMSNSNQSI 682
                 N   V+S + + I
Sbjct: 602 ------NGVMVLSGTAKEI 614


>gi|265752762|ref|ZP_06088331.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|263235948|gb|EEZ21443.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
          Length = 811

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 96/415 (23%), Positives = 156/415 (37%), Gaps = 77/415 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L  A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDKIVGHAVR 275

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    T   + +     + TGG  +R     P+   +  G 
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC +   +  +  +F  T +  YAD  ERAL NGV+S       GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  I  F        +  +GN   +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT- 560
           Y+  +I S  D ++    +N +      WD  + + +T   +QE     +L +R+P WT 
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWTQ 488

Query: 561 ----------YSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
                     +++ AQA   S+NG  +       + +    W   D + I LP+ +R   
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548

Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
             D   +     AI  GP +      G+    +          +P+  SF+A L+
Sbjct: 549 ANDQVEDDHGKLAIERGPIMFC--LEGQDQADSTVFNKFIPDGTPMEASFHADLL 601


>gi|222530205|ref|YP_002574087.1| hypothetical protein Athe_2242 [Caldicellulosiruptor bescii DSM
           6725]
 gi|222457052|gb|ACM61314.1| protein of unknown function DUF1680 [Caldicellulosiruptor bescii
           DSM 6725]
          Length = 652

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 116/515 (22%), Positives = 199/515 (38%), Gaps = 74/515 (14%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFEALK 223
           V  +L A++ +     N  +++K+  V+  + + Q +   GYL+ + T  E    +  L+
Sbjct: 81  VAKWLEAASYILEKYPNPDLEKKVDEVIDIIEKAQWE--DGYLNTYFTIKEKGKRWTNLE 138

Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
                Y   H I AG+   + LA    +L      +E        V +++  E       
Sbjct: 139 ECHELYTAGHMIEAGV--AHFLATGKTSL------LEIIKKLADHVYSIFGKEEGKIPGY 190

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFL--------------GFLALQ 324
           +    +   L +LY +T D K+L LA  F      +P +               GF  L 
Sbjct: 191 DGHPEIELALVKLYEVTGDRKYLELAKFFIDERGQEPYYFDIEWEKRGRKEHWQGFKRLG 250

Query: 325 ADYLSHFHANTHIPIVIGSQMR----YEVTGD--------PLYKLIGTFFMDIVNASHSY 372
            +YL  +         +G  +R    Y    D         L+ +  T F DIV     Y
Sbjct: 251 REYLQVYRPVRQQKEAVGHAVRAVYLYSGMADVAAYTQDKELFDVCKTLFDDIVKRK-MY 309

Query: 373 ATG--GTSA--REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TG  G+SA    F ++     DT  +E   TC +  ++  +  L +      Y D  ER
Sbjct: 310 ITGAIGSSAHGEAFTFEYDLPNDTAYAE---TCASVGLIFFAHRLNKIEPHAKYYDVVER 366

Query: 429 ALTNGVLSI--QRGTEPGVMIYMLPLG---RGVSKARSTHGWGTKFNSFW---CCYGTGI 480
           AL N V+    Q G +     Y+ PL    + V K    H    +   ++   CC     
Sbjct: 367 ALYNTVIGSMSQDGKK---YFYVNPLEVYPKEVEKRFDRHHVKPERQPWFGCACCPPNVA 423

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV-VLNQKVDPIVSWDPYLRMTLT 539
              + LG  +Y     N  G+Y+  YI SS   + G + VL Q+V     ++  +++ L 
Sbjct: 424 RLLASLGRYVY---SYNHDGIYVNLYIGSSVQVEVGGIKVLLQQVSSY-PFEDMVKIDLK 479

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERWSYNDKLTIQ 598
            S +        L LR+P W  S   +  +NG+   P  PP  ++     W  ND++ ++
Sbjct: 480 PSKEARF----KLYLRIPGWCES--YEVYVNGKKEEPEEPPSGYVCIERLWKENDQVVLK 533

Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           +P  ++  +            A++ GP +     +
Sbjct: 534 IPTEVKMVSSHPQVRSNVGKVAVVKGPVVFCAEEA 568


>gi|86359423|ref|YP_471315.1| hypothetical protein RHE_CH03841 [Rhizobium etli CFN 42]
 gi|86283525|gb|ABC92588.1| hypothetical conserved protein [Rhizobium etli CFN 42]
          Length = 640

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 90/396 (22%), Positives = 159/396 (40%), Gaps = 61/396 (15%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFHANT------HIPI 339
            L +L  +T + K+L L+  F      +P F    A++    LS +H  T      H+P+
Sbjct: 198 ALVKLARVTGEKKYLDLSKFFIDERGTEPHFFTAEAVRDGRSLSDYHQKTYEYGQAHLPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 258 REQKKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +     +  YAD  E+AL NG L    
Sbjct: 317 NEGFTDYYDLPN--ATAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
            T+     Y  PL      A   H W  K++   CC        + +G  +Y   +  + 
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426

Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            +++    ++     +G  V L Q  +    WD      + F++K       +L+LR+P 
Sbjct: 427 AVHLYGESTARLKLANGAEVELEQATN--YPWDG----AVAFTAKLAKSAKFALSLRIPD 480

Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
           W  + GA  S+NG  + L       ++     W++ D++ + LP++LR +       + A
Sbjct: 481 W--AEGASLSVNGTGVELGAHLRDGYIRIEREWAHGDRVALDLPMALRPQYANPKVRQDA 538

Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
              A++ GP +    T       T   + L+A+I P
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGQDLNAIILP 567


>gi|294777480|ref|ZP_06742931.1| putative lipoprotein [Bacteroides vulgatus PC510]
 gi|294448548|gb|EFG17097.1| putative lipoprotein [Bacteroides vulgatus PC510]
          Length = 811

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 100/439 (22%), Positives = 168/439 (38%), Gaps = 92/439 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L  A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    T   + +     + TGG  +R     P+   +  G 
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC +   +  +  +F  T +  YAD  ERAL NGV+S       GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  I  F        +  +GN   +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
           Y+  +I S  D ++    +N +      WD  + + +T   +QE     +L +R+P W  
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488

Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
                    ++++ AQA   S+NG  +       + +    W   D + I LP+ +R   
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548

Query: 605 -TEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVT 663
             + ++DDR +     AI  GP +      G+    +          +P+  S++A L+ 
Sbjct: 549 ANDQVEDDRGKL----AIERGPIIFC--LEGQDQADSTVFNKFIPDGTPMEASYDAGLL- 601

Query: 664 FTQESGNSTFVMSNSNQSI 682
                 N   V+S + + I
Sbjct: 602 ------NGVMVLSGTAKEI 614


>gi|397660575|ref|YP_006501277.1| hypothetical protein A225_5616 [Klebsiella oxytoca E718]
 gi|394348582|gb|AFN34703.1| putative secreted protein [Klebsiella oxytoca E718]
          Length = 653

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 78/370 (21%), Positives = 131/370 (35%), Gaps = 85/370 (22%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
           L RLY +T +P+++ L   F      +P F                    ++ +   Y  
Sbjct: 193 LMRLYDVTQEPRYIALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
                +  P+ IG  +R+      +Y + G   +  ++                     Y
Sbjct: 253 AHQPISEQPVAIGHAVRF------VYLMTGVAHLARLSQDEGKRRDCLRLWKNMARRQLY 306

Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
           AL N VL      +     Y+ PL          H    KFN  +              C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPVRQRWFGCAC 414

Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
           C        + LG  IY   +     LYI  Y+ +S +   G+  L  ++     W   +
Sbjct: 415 CPPNIARVLTSLGHYIYTPHDDV---LYINLYVGNSVEIPVGNEALRLRISGNYPWQEQV 471

Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
           ++ +  SS        +L LR+P W   +  Q +LNG  +       +L  +  W   D 
Sbjct: 472 KIVIDSSSPVN----HTLALRLPDWC--DKPQVTLNGAPVTQDVRKGYLHISHLWQEGDT 525

Query: 595 LTIQLPLSLR 604
           L + LP+ +R
Sbjct: 526 LQLTLPMPVR 535


>gi|375257948|ref|YP_005017118.1| hypothetical protein KOX_05730 [Klebsiella oxytoca KCTC 1686]
 gi|365907426|gb|AEX02879.1| hypothetical protein KOX_05730 [Klebsiella oxytoca KCTC 1686]
          Length = 653

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 78/370 (21%), Positives = 131/370 (35%), Gaps = 85/370 (22%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
           L RLY +T +P+++ L   F      +P F                    ++ +   Y  
Sbjct: 193 LMRLYDVTQEPRYIALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
                +  P+ IG  +R+      +Y + G   +  ++                     Y
Sbjct: 253 AHQPISEQPVAIGHAVRF------VYLMTGVAHLARLSQDEGKRRDCLRLWKNMARRQLY 306

Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
           AL N VL      +     Y+ PL          H    KFN  +              C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPVRQRWFGCAC 414

Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
           C        + LG  IY   +     LYI  Y+ +S +   G+  L  ++     W   +
Sbjct: 415 CPPNIARVLTSLGHYIYTPHDDV---LYINLYVGNSVEIPVGNEALRLRISGNYPWQEQV 471

Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
           ++ +  SS        +L LR+P W   +  Q +LNG  +       +L  +  W   D 
Sbjct: 472 KIVIDSSSPVN----HTLALRLPDWC--DKPQVTLNGAPVTQDVRKGYLHISHLWQEGDT 525

Query: 595 LTIQLPLSLR 604
           L + LP+ +R
Sbjct: 526 LQLTLPMPVR 535


>gi|432394191|ref|ZP_19637011.1| hypothetical protein WE9_04517 [Escherichia coli KTE21]
 gi|430914340|gb|ELC35436.1| hypothetical protein WE9_04517 [Escherichia coli KTE21]
          Length = 656

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  ++     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRISGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVGQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|421728042|ref|ZP_16167199.1| hypothetical protein KOXM_22161 [Klebsiella oxytoca M5al]
 gi|410371224|gb|EKP25948.1| hypothetical protein KOXM_22161 [Klebsiella oxytoca M5al]
          Length = 653

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 79/370 (21%), Positives = 131/370 (35%), Gaps = 85/370 (22%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
           L RLY +T +P+++ L   F      +P F                    ++ +   Y  
Sbjct: 193 LMRLYDVTQEPRYIALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252

Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
                +  P+ IG  +R+      +Y + G   +  ++                     Y
Sbjct: 253 AHQPISEQPVAIGHAVRF------VYLMAGVAHLARLSQDEGKRQDCLRLWKNMAQRQLY 306

Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
           AL N VL      +     Y+ PL          H    KFN  +              C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPVRQRWFGCAC 414

Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
           C        + LG  IY   +     LYI  YI +S +   G+  L  ++     W   +
Sbjct: 415 CPPNIARVLTSLGHYIYTPHDD---ALYINLYIGNSAEIPVGNEALRLRISGNYPWQEQV 471

Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
           ++ +  SS        +L LR+P W   +  Q +LNG  +       +L  +  W   D 
Sbjct: 472 QIVIDSSSPVH----HTLALRLPDWC--DKPQVTLNGAPVTQDVRKGYLYISHLWQEGDT 525

Query: 595 LTIQLPLSLR 604
           L + LP+ +R
Sbjct: 526 LLLTLPMPVR 535


>gi|423313151|ref|ZP_17291087.1| hypothetical protein HMPREF1058_01699 [Bacteroides vulgatus
           CL09T03C04]
 gi|392686365|gb|EIY79671.1| hypothetical protein HMPREF1058_01699 [Bacteroides vulgatus
           CL09T03C04]
          Length = 811

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 83/386 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L  A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    T   + +     + TGG  +R     P+   +  G 
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC +   +  +  +F  T +  YAD  ERAL NGV+S       GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  I  F        +  +GN   +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
           Y+  +I S  D ++    +N +      WD  + + +T   +QE     +L +R+P W  
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488

Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
                    ++++ AQA   S+NG  +       + +    W   D + I LP+ +R   
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548

Query: 605 -TEAIQDDRPEYASIQAILFGPYLLA 629
             + ++DDR +     AI  GP +  
Sbjct: 549 ANDQVEDDRGKL----AIERGPIMFC 570


>gi|417243728|ref|ZP_12038126.1| putative glycosyhydrolase [Escherichia coli 9.0111]
 gi|386211280|gb|EII21745.1| putative glycosyhydrolase [Escherichia coli 9.0111]
          Length = 654

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|319640078|ref|ZP_07994805.1| hypothetical protein HMPREF9011_00402 [Bacteroides sp. 3_1_40A]
 gi|345517097|ref|ZP_08796575.1| six-hairpin glycosidase [Bacteroides sp. 4_3_47FAA]
 gi|254833866|gb|EET14175.1| six-hairpin glycosidase [Bacteroides sp. 4_3_47FAA]
 gi|317388356|gb|EFV69208.1| hypothetical protein HMPREF9011_00402 [Bacteroides sp. 3_1_40A]
          Length = 811

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 83/386 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L  A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    T   + +     + TGG  +R     P+   +  G 
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC +   +  +  +F  T +  YAD  ERAL NGV+S       GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  I  F        +  +GN   +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
           Y+  +I S  D ++    +N +      WD  + + +T   +QE     +L +R+P W  
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488

Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
                    ++++ AQA   S+NG  +       + +    W   D + I LP+ +R   
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548

Query: 605 -TEAIQDDRPEYASIQAILFGPYLLA 629
             + ++DDR +     AI  GP +  
Sbjct: 549 ANDQVEDDRGKL----AIERGPIMFC 570


>gi|312621510|ref|YP_004023123.1| hypothetical protein Calkro_0404 [Caldicellulosiruptor
           kronotskyensis 2002]
 gi|312201977|gb|ADQ45304.1| protein of unknown function DUF1680 [Caldicellulosiruptor
           kronotskyensis 2002]
          Length = 652

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 115/515 (22%), Positives = 197/515 (38%), Gaps = 74/515 (14%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFEALK 223
           V  +L A++ +     N  +++K+  V+  + + Q +   GYL+ + T  E    +  L+
Sbjct: 81  VAKWLEAASYVLEKYPNPDLEKKVDEVIQLIGKAQWE--DGYLNTYFTIKEKGKRWTNLE 138

Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
                Y   H I AG    ++       L++   + ++ YN       ++  E       
Sbjct: 139 ECHELYTAGHMIEAGCA-HFLATGKTTLLEIVKKIADHIYN-------VFGKEEGKIPGY 190

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-------------------DKPCFLGFLALQ 324
           +    +   L +LY +T D K+L LA  F                    K  + GF +L 
Sbjct: 191 DGHPEIELALVKLYEVTGDRKYLELAKFFVDERGQEPYYFDIEYEKRGKKSHWAGFKSLG 250

Query: 325 ADYLSHFHANTHIPIVIGSQMR----YEVTGD--------PLYKLIGTFFMDIVNASHSY 372
            +YL  +         +G  +R    Y    D         L+ +  T F DIV     Y
Sbjct: 251 REYLQAYRPLRQQKEAVGHAVRAVYLYSGAADVAAYTQDKELFDVCKTLFDDIVKRK-MY 309

Query: 373 ATG--GTSA--REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TG  G+SA    F ++     DT  +E   TC +  ++  +  L +      Y D  ER
Sbjct: 310 ITGAIGSSAHGEAFTFEYDLPNDTAYAE---TCASVGLIFFAHRLNKIEPHAKYYDVVER 366

Query: 429 ALTNGVLSI--QRGTEPGVMIYMLPLG---RGVSKA---RSTHGWGTKFNSFWCCYGTGI 480
           AL N V+    Q G +     Y+ PL    + V K    R        +    CC     
Sbjct: 367 ALYNTVIGSMSQDGKK---YFYVNPLEVYPKEVEKRFDRRHVKPERQPWFGCACCPPNVA 423

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV-VLNQKVDPIVSWDPYLRMTLT 539
              + LG  IY     N  G+Y+  YI SS   + G V VL Q++     ++  +++ L 
Sbjct: 424 RLLASLGRYIY---SYNHEGIYVNLYIGSSVQVEVGGVKVLLQQMSSY-PFEDIVKIDLK 479

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERWSYNDKLTIQ 598
            S +        L LR+P W  S   +  +NG+   P  PP  ++     W  ND++ ++
Sbjct: 480 PSKEARF----KLYLRIPSWCES--YEVYVNGKKEEPEEPPSGYVCIERLWKENDQVILK 533

Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           +P  ++  +            A++ GP +     +
Sbjct: 534 IPTEVKMVSSHPQVRSNVGKVAVVKGPVVFCAEEA 568


>gi|150003698|ref|YP_001298442.1| hypothetical protein BVU_1129 [Bacteroides vulgatus ATCC 8482]
 gi|149932122|gb|ABR38820.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
          Length = 811

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 83/386 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L  A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    T   + +     + TGG  +R     P+   +  G 
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC +   +  +  +F  T +  YAD  ERAL NGV+S       GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  I  F        +  +GN   +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
           Y+  +I S  D ++    +N +      WD  + + +T   +QE     +L +R+P W  
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488

Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
                    ++++ AQA   S+NG  +       + +    W   D + I LP+ +R   
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548

Query: 605 -TEAIQDDRPEYASIQAILFGPYLLA 629
             + ++DDR +     AI  GP +  
Sbjct: 549 ANDQVEDDRGKL----AIERGPIMFC 570


>gi|237711356|ref|ZP_04541837.1| six-hairpin glycosidase [Bacteroides sp. 9_1_42FAA]
 gi|229454051|gb|EEO59772.1| six-hairpin glycosidase [Bacteroides sp. 9_1_42FAA]
          Length = 806

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 83/386 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L  A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 215 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 270

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    T   + +     + TGG  +R     P+   +  G 
Sbjct: 271 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 323

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC +   +  +  +F  T +  YAD  ERAL NGV+S       GV +
Sbjct: 324 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 376

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  I  F        +  +GN   +
Sbjct: 377 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 427

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
           Y+  +I S  D ++    +N +      WD  + + +T   +QE     +L +R+P W  
Sbjct: 428 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 483

Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
                    ++++ AQA   S+NG  +       + +    W   D + I LP+ +R   
Sbjct: 484 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 543

Query: 605 -TEAIQDDRPEYASIQAILFGPYLLA 629
             + ++DDR +     AI  GP +  
Sbjct: 544 ANDQVEDDRGKL----AIERGPIMFC 565


>gi|261420102|ref|YP_003253784.1| hypothetical protein GYMC61_2720 [Geobacillus sp. Y412MC61]
 gi|319766914|ref|YP_004132415.1| hypothetical protein [Geobacillus sp. Y412MC52]
 gi|261376559|gb|ACX79302.1| protein of unknown function DUF1680 [Geobacillus sp. Y412MC61]
 gi|317111780|gb|ADU94272.1| protein of unknown function DUF1680 [Geobacillus sp. Y412MC52]
          Length = 640

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 62/265 (23%), Positives = 106/265 (40%), Gaps = 24/265 (9%)

Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSA----REFWWDPKRLADTLGSENEETCTTYN 405
           TGD   K       + V     Y TGG  +      F +D     DT+ +E   TC +  
Sbjct: 275 TGDESLKQACQTLWENVTKRQMYITGGVGSSAFGESFTFDFDLPNDTVYTE---TCASIA 331

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG---RGVSK--AR 460
           ++  +R +     +  YAD  ERAL NG +S     +     Y+ PL    +   +   R
Sbjct: 332 LVFWARRMLELEMDGKYADVMERALYNGTIS-GMDLDGKRFFYVNPLEVWPKACERHDKR 390

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
                  K+ S  CC        + +   IY +       L++  Y+ S    + G   +
Sbjct: 391 HVKPVRQKWFSCACCPPNLARLIASISHYIYSQ---TSDALFVHLYVGSDIQTEMGGRSV 447

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-- 578
               +    WD  +R+T++  S QE     +L LR+P W    GA+ ++NG+N+ + P  
Sbjct: 448 EIVQETNYPWDGKVRLTISPESAQEF----TLGLRIPGW--GRGAEVTINGENVDIAPLT 501

Query: 579 PGNFLSATERWSYNDKLTIQLPLSL 603
              +      W   D++ +  P+ +
Sbjct: 502 KKGYAYIRRVWRQGDEMVLHFPMPV 526


>gi|397166966|ref|ZP_10490409.1| hypothetical protein Y71_0972 [Enterobacter radicincitans DSM
           16656]
 gi|396091112|gb|EJI88679.1| hypothetical protein Y71_0972 [Enterobacter radicincitans DSM
           16656]
          Length = 651

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 82/212 (38%), Gaps = 16/212 (7%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLEVHPKT 392

Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
            R  H +         W    CC        + LG  IY   +     LYI  Y+ +S +
Sbjct: 393 LRFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTPHQD---ALYINLYVGNSIE 449

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
              G  VL  +V     W    ++ +   S   V    +L LRMP W   +  Q +LNG 
Sbjct: 450 VPVGDKVLRLRVSGNFPWQE--KVMIAVESPLPVQH--TLALRMPDW--CDAPQVTLNGV 503

Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            +       +L     W   D LT+ LP+ +R
Sbjct: 504 AVEKAVHKGYLHIHRLWQEGDTLTLTLPMPVR 535


>gi|331665212|ref|ZP_08366113.1| putative cytoplasmic protein [Escherichia coli TA143]
 gi|432767960|ref|ZP_20002352.1| hypothetical protein A1S9_00751 [Escherichia coli KTE50]
 gi|432964211|ref|ZP_20153463.1| hypothetical protein A15E_04412 [Escherichia coli KTE202]
 gi|433065055|ref|ZP_20251959.1| hypothetical protein WIO_03878 [Escherichia coli KTE125]
 gi|331057722|gb|EGI29708.1| putative cytoplasmic protein [Escherichia coli TA143]
 gi|431321992|gb|ELG09585.1| hypothetical protein A1S9_00751 [Escherichia coli KTE50]
 gi|431469844|gb|ELH49772.1| hypothetical protein A15E_04412 [Escherichia coli KTE202]
 gi|431578217|gb|ELI50831.1| hypothetical protein WIO_03878 [Escherichia coli KTE125]
          Length = 654

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +        YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGNSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|422768624|ref|ZP_16822348.1| hypothetical protein ERCG_03884 [Escherichia coli E1520]
 gi|323934869|gb|EGB31251.1| hypothetical protein ERCG_03884 [Escherichia coli E1520]
          Length = 659

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|331655213|ref|ZP_08356212.1| putative cytoplasmic protein [Escherichia coli M718]
 gi|331047228|gb|EGI19306.1| putative cytoplasmic protein [Escherichia coli M718]
          Length = 664

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 260 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 313

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 314 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 370

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 421

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 422 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 477

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 478 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 532

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 533 TLNLTLPMPVR 543


>gi|432720730|ref|ZP_19955692.1| hypothetical protein WCK_04370 [Escherichia coli KTE9]
 gi|432794804|ref|ZP_20028883.1| hypothetical protein A1US_04040 [Escherichia coli KTE78]
 gi|432796321|ref|ZP_20030359.1| hypothetical protein A1UU_01028 [Escherichia coli KTE79]
 gi|431259905|gb|ELF52266.1| hypothetical protein WCK_04370 [Escherichia coli KTE9]
 gi|431336741|gb|ELG23843.1| hypothetical protein A1US_04040 [Escherichia coli KTE78]
 gi|431348554|gb|ELG35405.1| hypothetical protein A1UU_01028 [Escherichia coli KTE79]
          Length = 654

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 83/365 (22%), Positives = 133/365 (36%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +       + +     Y TGG 
Sbjct: 252 QAHLPIAQQQTAIGHAVRFVYLMTAVAHLARLSHDESKRQDCLRLWNNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + +G  +Y   E     LYI  Y  +S +    + +L  +V     W    ++T+ 
Sbjct: 420 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGMLRLRVSGNYPWQE--QVTIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D L + L
Sbjct: 475 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVGQDIRKGYLHITREWQEGDTLNLTL 530

Query: 600 PLSLR 604
           P+ +R
Sbjct: 531 PMPVR 535


>gi|432451832|ref|ZP_19694088.1| hypothetical protein A13W_02801 [Escherichia coli KTE193]
 gi|433035497|ref|ZP_20223187.1| hypothetical protein WIC_04061 [Escherichia coli KTE112]
 gi|430977578|gb|ELC94414.1| hypothetical protein A13W_02801 [Escherichia coli KTE193]
 gi|431546634|gb|ELI21027.1| hypothetical protein WIC_04061 [Escherichia coli KTE112]
          Length = 656

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|300920475|ref|ZP_07136906.1| conserved hypothetical protein [Escherichia coli MS 115-1]
 gi|300412519|gb|EFJ95829.1| conserved hypothetical protein [Escherichia coli MS 115-1]
          Length = 664

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 85/365 (23%), Positives = 133/365 (36%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259

Query: 334 NTHIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
             H+P+      IG  +R  Y +TG         D   +       + +     Y TGG 
Sbjct: 260 QAHLPLAQQQTAIGHAVRFAYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQLYITGGI 319

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 320 GSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 376

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 377 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 427

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + +G  +Y   E     LYI  Y  +S +    +  L  +V     W    ++T+ 
Sbjct: 428 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIA 482

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D L + L
Sbjct: 483 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTL 538

Query: 600 PLSLR 604
           P+ +R
Sbjct: 539 PMPVR 543


>gi|417631018|ref|ZP_12281252.1| hypothetical protein ECSTECMHI813_3969 [Escherichia coli
           STEC_MHI813]
 gi|345370297|gb|EGX02275.1| hypothetical protein ECSTECMHI813_3969 [Escherichia coli
           STEC_MHI813]
          Length = 656

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|432752040|ref|ZP_19986617.1| hypothetical protein WEQ_03462 [Escherichia coli KTE29]
 gi|431293661|gb|ELF83953.1| hypothetical protein WEQ_03462 [Escherichia coli KTE29]
          Length = 659

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 85/365 (23%), Positives = 133/365 (36%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
             H+P+      IG  +R  Y +TG         D   +       + +     Y TGG 
Sbjct: 252 QAHLPLAQQQTAIGHAVRFAYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + +G  +Y   E     LYI  Y  +S +    +  L  +V     W    ++T+ 
Sbjct: 420 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D L + L
Sbjct: 475 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTL 530

Query: 600 PLSLR 604
           P+ +R
Sbjct: 531 PMPVR 535


>gi|425263519|ref|ZP_18655509.1| hypothetical protein ECEC96038_4736 [Escherichia coli EC96038]
 gi|408177761|gb|EKI04521.1| hypothetical protein ECEC96038_4736 [Escherichia coli EC96038]
          Length = 656

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|422836105|ref|ZP_16884154.1| hypothetical protein ESOG_03755 [Escherichia coli E101]
 gi|371609666|gb|EHN98200.1| hypothetical protein ESOG_03755 [Escherichia coli E101]
          Length = 656

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|432855232|ref|ZP_20083284.1| hypothetical protein A1YY_03450 [Escherichia coli KTE144]
 gi|431397569|gb|ELG81016.1| hypothetical protein A1YY_03450 [Escherichia coli KTE144]
          Length = 654

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+PI      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGKLCLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|432949979|ref|ZP_20144543.1| hypothetical protein A153_04333 [Escherichia coli KTE196]
 gi|433045129|ref|ZP_20232605.1| hypothetical protein WIG_03662 [Escherichia coli KTE117]
 gi|431453768|gb|ELH34151.1| hypothetical protein A153_04333 [Escherichia coli KTE196]
 gi|431552786|gb|ELI26734.1| hypothetical protein WIG_03662 [Escherichia coli KTE117]
          Length = 659

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|416342142|ref|ZP_11676508.1| hypothetical protein ECoL_01429 [Escherichia coli EC4100B]
 gi|419280237|ref|ZP_13822479.1| hypothetical protein ECDEC10E_4232 [Escherichia coli DEC10E]
 gi|419347353|ref|ZP_13888721.1| hypothetical protein ECDEC13A_3946 [Escherichia coli DEC13A]
 gi|419351812|ref|ZP_13893141.1| hypothetical protein ECDEC13B_3790 [Escherichia coli DEC13B]
 gi|419357284|ref|ZP_13898530.1| hypothetical protein ECDEC13C_4352 [Escherichia coli DEC13C]
 gi|419362259|ref|ZP_13903466.1| hypothetical protein ECDEC13D_4067 [Escherichia coli DEC13D]
 gi|419367374|ref|ZP_13908523.1| hypothetical protein ECDEC13E_4123 [Escherichia coli DEC13E]
 gi|419377671|ref|ZP_13918688.1| hypothetical protein ECDEC14B_4282 [Escherichia coli DEC14B]
 gi|419383008|ref|ZP_13923950.1| hypothetical protein ECDEC14C_4191 [Escherichia coli DEC14C]
 gi|419388306|ref|ZP_13929174.1| hypothetical protein ECDEC14D_4146 [Escherichia coli DEC14D]
 gi|425424537|ref|ZP_18805687.1| hypothetical protein EC01288_3892 [Escherichia coli 0.1288]
 gi|432535989|ref|ZP_19772946.1| hypothetical protein A193_04433 [Escherichia coli KTE234]
 gi|432811308|ref|ZP_20045165.1| hypothetical protein A1WM_02459 [Escherichia coli KTE101]
 gi|320201393|gb|EFW75974.1| hypothetical protein ECoL_01429 [Escherichia coli EC4100B]
 gi|378125150|gb|EHW86553.1| hypothetical protein ECDEC10E_4232 [Escherichia coli DEC10E]
 gi|378182886|gb|EHX43534.1| hypothetical protein ECDEC13A_3946 [Escherichia coli DEC13A]
 gi|378195992|gb|EHX56482.1| hypothetical protein ECDEC13C_4352 [Escherichia coli DEC13C]
 gi|378196853|gb|EHX57338.1| hypothetical protein ECDEC13B_3790 [Escherichia coli DEC13B]
 gi|378199461|gb|EHX59926.1| hypothetical protein ECDEC13D_4067 [Escherichia coli DEC13D]
 gi|378210031|gb|EHX70398.1| hypothetical protein ECDEC13E_4123 [Escherichia coli DEC13E]
 gi|378215636|gb|EHX75932.1| hypothetical protein ECDEC14B_4282 [Escherichia coli DEC14B]
 gi|378224949|gb|EHX85150.1| hypothetical protein ECDEC14C_4191 [Escherichia coli DEC14C]
 gi|378228861|gb|EHX89012.1| hypothetical protein ECDEC14D_4146 [Escherichia coli DEC14D]
 gi|408341050|gb|EKJ55523.1| hypothetical protein EC01288_3892 [Escherichia coli 0.1288]
 gi|431057624|gb|ELD67052.1| hypothetical protein A193_04433 [Escherichia coli KTE234]
 gi|431360470|gb|ELG47081.1| hypothetical protein A1WM_02459 [Escherichia coli KTE101]
          Length = 656

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 85/365 (23%), Positives = 133/365 (36%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
             H+P+      IG  +R  Y +TG         D   +       + +     Y TGG 
Sbjct: 252 QAHLPLAQQQTAIGHAVRFAYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + +G  +Y   E     LYI  Y  +S +    +  L  +V     W    ++T+ 
Sbjct: 420 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D L + L
Sbjct: 475 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTL 530

Query: 600 PLSLR 604
           P+ +R
Sbjct: 531 PMPVR 535


>gi|15804123|ref|NP_290162.1| hypothetical protein Z5002 [Escherichia coli O157:H7 str. EDL933]
 gi|15833713|ref|NP_312486.1| hypothetical protein ECs4459 [Escherichia coli O157:H7 str. Sakai]
 gi|168746875|ref|ZP_02771897.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4113]
 gi|168753398|ref|ZP_02778405.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4401]
 gi|168759671|ref|ZP_02784678.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4501]
 gi|168765993|ref|ZP_02791000.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4486]
 gi|168772459|ref|ZP_02797466.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4196]
 gi|168779729|ref|ZP_02804736.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4076]
 gi|168797417|ref|ZP_02822424.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC508]
 gi|195935108|ref|ZP_03080490.1| hypothetical protein EscherichcoliO157_01410 [Escherichia coli
           O157:H7 str. EC4024]
 gi|208809591|ref|ZP_03251928.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4206]
 gi|208813747|ref|ZP_03255076.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4045]
 gi|208821480|ref|ZP_03261800.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4042]
 gi|209399472|ref|YP_002273062.1| hypothetical protein ECH74115_4952 [Escherichia coli O157:H7 str.
           EC4115]
 gi|217324274|ref|ZP_03440358.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           TW14588]
 gi|254795534|ref|YP_003080371.1| hypothetical protein ECSP_4573 [Escherichia coli O157:H7 str.
           TW14359]
 gi|291284953|ref|YP_003501771.1| hypothetical protein G2583_4318 [Escherichia coli O55:H7 str.
           CB9615]
 gi|387508986|ref|YP_006161242.1| hypothetical protein ECO55CA74_20675 [Escherichia coli O55:H7 str.
           RM12579]
 gi|387884760|ref|YP_006315062.1| hypothetical protein CDCO157_4196 [Escherichia coli Xuzhou21]
 gi|416315758|ref|ZP_11659571.1| hypothetical protein ECoA_05479 [Escherichia coli O157:H7 str.
           1044]
 gi|416320011|ref|ZP_11662563.1| hypothetical protein ECoD_02794 [Escherichia coli O157:H7 str.
           EC1212]
 gi|416330228|ref|ZP_11669265.1| hypothetical protein ECF_04238 [Escherichia coli O157:H7 str. 1125]
 gi|416778240|ref|ZP_11875812.1| hypothetical protein ECO5101_02775 [Escherichia coli O157:H7 str.
           G5101]
 gi|416789533|ref|ZP_11880657.1| hypothetical protein ECO9389_12801 [Escherichia coli O157:H- str.
           493-89]
 gi|416801447|ref|ZP_11885596.1| hypothetical protein ECO2687_16571 [Escherichia coli O157:H- str. H
           2687]
 gi|416812344|ref|ZP_11890513.1| hypothetical protein ECO7815_03795 [Escherichia coli O55:H7 str.
           3256-97]
 gi|416832964|ref|ZP_11900127.1| hypothetical protein ECOSU61_16870 [Escherichia coli O157:H7 str.
           LSU-61]
 gi|419047735|ref|ZP_13594666.1| hypothetical protein ECDEC3A_4594 [Escherichia coli DEC3A]
 gi|419053393|ref|ZP_13600259.1| hypothetical protein ECDEC3B_4713 [Escherichia coli DEC3B]
 gi|419059343|ref|ZP_13606144.1| hypothetical protein ECDEC3C_4952 [Escherichia coli DEC3C]
 gi|419064888|ref|ZP_13611608.1| hypothetical protein ECDEC3D_4717 [Escherichia coli DEC3D]
 gi|419071821|ref|ZP_13617428.1| hypothetical protein ECDEC3E_4925 [Escherichia coli DEC3E]
 gi|419077685|ref|ZP_13623186.1| hypothetical protein ECDEC3F_4810 [Escherichia coli DEC3F]
 gi|419082821|ref|ZP_13628266.1| hypothetical protein ECDEC4A_4458 [Escherichia coli DEC4A]
 gi|419088700|ref|ZP_13634051.1| hypothetical protein ECDEC4B_4655 [Escherichia coli DEC4B]
 gi|419094624|ref|ZP_13639902.1| hypothetical protein ECDEC4C_4545 [Escherichia coli DEC4C]
 gi|419106234|ref|ZP_13651356.1| hypothetical protein ECDEC4E_4577 [Escherichia coli DEC4E]
 gi|419111620|ref|ZP_13656671.1| hypothetical protein ECDEC4F_4462 [Escherichia coli DEC4F]
 gi|419117157|ref|ZP_13662166.1| hypothetical protein ECDEC5A_4358 [Escherichia coli DEC5A]
 gi|419122875|ref|ZP_13667817.1| hypothetical protein ECDEC5B_4720 [Escherichia coli DEC5B]
 gi|419128272|ref|ZP_13673144.1| hypothetical protein ECDEC5C_4407 [Escherichia coli DEC5C]
 gi|419133720|ref|ZP_13678547.1| hypothetical protein ECDEC5D_4501 [Escherichia coli DEC5D]
 gi|419138882|ref|ZP_13683672.1| hypothetical protein ECDEC5E_4420 [Escherichia coli DEC5E]
 gi|420271748|ref|ZP_14774099.1| hypothetical protein ECPA22_4925 [Escherichia coli PA22]
 gi|420283060|ref|ZP_14785292.1| hypothetical protein ECTW06591_4436 [Escherichia coli TW06591]
 gi|420288947|ref|ZP_14791129.1| hypothetical protein ECTW10246_5014 [Escherichia coli TW10246]
 gi|420294768|ref|ZP_14796878.1| hypothetical protein ECTW11039_4946 [Escherichia coli TW11039]
 gi|420300624|ref|ZP_14802667.1| hypothetical protein ECTW09109_5140 [Escherichia coli TW09109]
 gi|420306468|ref|ZP_14808456.1| hypothetical protein ECTW10119_5262 [Escherichia coli TW10119]
 gi|420311766|ref|ZP_14813694.1| hypothetical protein ECEC1738_4805 [Escherichia coli EC1738]
 gi|420317423|ref|ZP_14819294.1| hypothetical protein ECEC1734_4754 [Escherichia coli EC1734]
 gi|421814567|ref|ZP_16250269.1| hypothetical protein EC80416_4344 [Escherichia coli 8.0416]
 gi|421821215|ref|ZP_16256686.1| hypothetical protein EC100821_5204 [Escherichia coli 10.0821]
 gi|421833209|ref|ZP_16268489.1| hypothetical protein ECPA7_5406 [Escherichia coli PA7]
 gi|423727615|ref|ZP_17701493.1| hypothetical protein ECPA31_4706 [Escherichia coli PA31]
 gi|424079832|ref|ZP_17816792.1| hypothetical protein ECFDA505_4757 [Escherichia coli FDA505]
 gi|424086239|ref|ZP_17822721.1| hypothetical protein ECFDA517_5074 [Escherichia coli FDA517]
 gi|424099319|ref|ZP_17834587.1| hypothetical protein ECFRIK1985_5028 [Escherichia coli FRIK1985]
 gi|424112173|ref|ZP_17846397.1| hypothetical protein EC93001_4874 [Escherichia coli 93-001]
 gi|424118115|ref|ZP_17851944.1| hypothetical protein ECPA3_4889 [Escherichia coli PA3]
 gi|424124302|ref|ZP_17857602.1| hypothetical protein ECPA5_4745 [Escherichia coli PA5]
 gi|424130447|ref|ZP_17863346.1| hypothetical protein ECPA9_4920 [Escherichia coli PA9]
 gi|424136776|ref|ZP_17869217.1| hypothetical protein ECPA10_5070 [Escherichia coli PA10]
 gi|424143329|ref|ZP_17875187.1| hypothetical protein ECPA14_4913 [Escherichia coli PA14]
 gi|424149721|ref|ZP_17881088.1| hypothetical protein ECPA15_5031 [Escherichia coli PA15]
 gi|424155573|ref|ZP_17886500.1| hypothetical protein ECPA24_4636 [Escherichia coli PA24]
 gi|424255558|ref|ZP_17892047.1| hypothetical protein ECPA25_4614 [Escherichia coli PA25]
 gi|424334046|ref|ZP_17897955.1| hypothetical protein ECPA28_4952 [Escherichia coli PA28]
 gi|424452012|ref|ZP_17903674.1| hypothetical protein ECPA32_4774 [Escherichia coli PA32]
 gi|424458199|ref|ZP_17909303.1| hypothetical protein ECPA33_4772 [Escherichia coli PA33]
 gi|424464678|ref|ZP_17915033.1| hypothetical protein ECPA39_4850 [Escherichia coli PA39]
 gi|424477467|ref|ZP_17926776.1| hypothetical protein ECPA42_4929 [Escherichia coli PA42]
 gi|424483230|ref|ZP_17932202.1| hypothetical protein ECTW07945_4773 [Escherichia coli TW07945]
 gi|424489411|ref|ZP_17937952.1| hypothetical protein ECTW09098_4852 [Escherichia coli TW09098]
 gi|424502761|ref|ZP_17949642.1| hypothetical protein ECEC4203_4848 [Escherichia coli EC4203]
 gi|424509021|ref|ZP_17955394.1| hypothetical protein ECEC4196_4900 [Escherichia coli EC4196]
 gi|424516380|ref|ZP_17960994.1| hypothetical protein ECTW14313_4703 [Escherichia coli TW14313]
 gi|424522562|ref|ZP_17966668.1| hypothetical protein ECTW14301_4626 [Escherichia coli TW14301]
 gi|424528439|ref|ZP_17972147.1| hypothetical protein ECEC4421_4687 [Escherichia coli EC4421]
 gi|424534588|ref|ZP_17977927.1| hypothetical protein ECEC4422_4816 [Escherichia coli EC4422]
 gi|424540646|ref|ZP_17983581.1| hypothetical protein ECEC4013_4958 [Escherichia coli EC4013]
 gi|424546791|ref|ZP_17989143.1| hypothetical protein ECEC4402_4833 [Escherichia coli EC4402]
 gi|424552999|ref|ZP_17994833.1| hypothetical protein ECEC4439_4786 [Escherichia coli EC4439]
 gi|424559188|ref|ZP_18000588.1| hypothetical protein ECEC4436_4739 [Escherichia coli EC4436]
 gi|424565524|ref|ZP_18006519.1| hypothetical protein ECEC4437_4898 [Escherichia coli EC4437]
 gi|424571655|ref|ZP_18012193.1| hypothetical protein ECEC4448_4800 [Escherichia coli EC4448]
 gi|424577810|ref|ZP_18017853.1| hypothetical protein ECEC1845_4760 [Escherichia coli EC1845]
 gi|424583627|ref|ZP_18023264.1| hypothetical protein ECEC1863_4496 [Escherichia coli EC1863]
 gi|425100295|ref|ZP_18503019.1| hypothetical protein EC34870_4837 [Escherichia coli 3.4870]
 gi|425106397|ref|ZP_18508705.1| hypothetical protein EC52239_4795 [Escherichia coli 5.2239]
 gi|425112407|ref|ZP_18514320.1| hypothetical protein EC60172_4950 [Escherichia coli 6.0172]
 gi|425128335|ref|ZP_18529494.1| hypothetical protein EC80586_5102 [Escherichia coli 8.0586]
 gi|425134077|ref|ZP_18534919.1| hypothetical protein EC82524_4714 [Escherichia coli 8.2524]
 gi|425140695|ref|ZP_18541067.1| hypothetical protein EC100833_5124 [Escherichia coli 10.0833]
 gi|425146362|ref|ZP_18546346.1| hypothetical protein EC100869_4617 [Escherichia coli 10.0869]
 gi|425152482|ref|ZP_18552087.1| hypothetical protein EC880221_4757 [Escherichia coli 88.0221]
 gi|425158354|ref|ZP_18557610.1| hypothetical protein ECPA34_4912 [Escherichia coli PA34]
 gi|425164699|ref|ZP_18563578.1| hypothetical protein ECFDA506_5110 [Escherichia coli FDA506]
 gi|425170445|ref|ZP_18568910.1| hypothetical protein ECFDA507_4850 [Escherichia coli FDA507]
 gi|425176495|ref|ZP_18574606.1| hypothetical protein ECFDA504_4775 [Escherichia coli FDA504]
 gi|425188821|ref|ZP_18586085.1| hypothetical protein ECFRIK1997_5034 [Escherichia coli FRIK1997]
 gi|425202058|ref|ZP_18598257.1| hypothetical protein ECNE037_5169 [Escherichia coli NE037]
 gi|425214195|ref|ZP_18609587.1| hypothetical protein ECPA4_4926 [Escherichia coli PA4]
 gi|425220319|ref|ZP_18615273.1| hypothetical protein ECPA23_4795 [Escherichia coli PA23]
 gi|425226960|ref|ZP_18621418.1| hypothetical protein ECPA49_5019 [Escherichia coli PA49]
 gi|425233121|ref|ZP_18627153.1| hypothetical protein ECPA45_4973 [Escherichia coli PA45]
 gi|425239047|ref|ZP_18632758.1| hypothetical protein ECTT12B_4670 [Escherichia coli TT12B]
 gi|425257257|ref|ZP_18649759.1| hypothetical protein ECCB7326_4836 [Escherichia coli CB7326]
 gi|425269512|ref|ZP_18661133.1| hypothetical protein EC5412_4766 [Escherichia coli 5412]
 gi|425296972|ref|ZP_18687122.1| hypothetical protein ECPA38_4622 [Escherichia coli PA38]
 gi|425313655|ref|ZP_18702824.1| hypothetical protein ECEC1735_4763 [Escherichia coli EC1735]
 gi|425319635|ref|ZP_18708414.1| hypothetical protein ECEC1736_4708 [Escherichia coli EC1736]
 gi|425325746|ref|ZP_18714090.1| hypothetical protein ECEC1737_4710 [Escherichia coli EC1737]
 gi|425332099|ref|ZP_18719925.1| hypothetical protein ECEC1846_4816 [Escherichia coli EC1846]
 gi|425338276|ref|ZP_18725622.1| hypothetical protein ECEC1847_4840 [Escherichia coli EC1847]
 gi|425344593|ref|ZP_18731474.1| hypothetical protein ECEC1848_4958 [Escherichia coli EC1848]
 gi|425350429|ref|ZP_18736886.1| hypothetical protein ECEC1849_4723 [Escherichia coli EC1849]
 gi|425356701|ref|ZP_18742759.1| hypothetical protein ECEC1850_4950 [Escherichia coli EC1850]
 gi|425362661|ref|ZP_18748298.1| hypothetical protein ECEC1856_4773 [Escherichia coli EC1856]
 gi|425368889|ref|ZP_18753993.1| hypothetical protein ECEC1862_4786 [Escherichia coli EC1862]
 gi|425375193|ref|ZP_18759826.1| hypothetical protein ECEC1864_4920 [Escherichia coli EC1864]
 gi|425388083|ref|ZP_18771633.1| hypothetical protein ECEC1866_4680 [Escherichia coli EC1866]
 gi|425394775|ref|ZP_18777875.1| hypothetical protein ECEC1868_4993 [Escherichia coli EC1868]
 gi|425400871|ref|ZP_18783568.1| hypothetical protein ECEC1869_4939 [Escherichia coli EC1869]
 gi|425406963|ref|ZP_18789176.1| hypothetical protein ECEC1870_4738 [Escherichia coli EC1870]
 gi|425413349|ref|ZP_18795102.1| hypothetical protein ECNE098_4927 [Escherichia coli NE098]
 gi|425419660|ref|ZP_18800921.1| hypothetical protein ECFRIK523_4776 [Escherichia coli FRIK523]
 gi|425430935|ref|ZP_18811535.1| hypothetical protein EC01304_4902 [Escherichia coli 0.1304]
 gi|428955440|ref|ZP_19027224.1| hypothetical protein EC881042_4795 [Escherichia coli 88.1042]
 gi|428961439|ref|ZP_19032721.1| hypothetical protein EC890511_4747 [Escherichia coli 89.0511]
 gi|428968048|ref|ZP_19038750.1| hypothetical protein EC900091_5143 [Escherichia coli 90.0091]
 gi|428980186|ref|ZP_19049993.1| hypothetical protein EC902281_4755 [Escherichia coli 90.2281]
 gi|428985972|ref|ZP_19055354.1| hypothetical protein EC930055_4673 [Escherichia coli 93.0055]
 gi|428992156|ref|ZP_19061135.1| hypothetical protein EC930056_4727 [Escherichia coli 93.0056]
 gi|428998047|ref|ZP_19066631.1| hypothetical protein EC940618_4635 [Escherichia coli 94.0618]
 gi|429010405|ref|ZP_19077843.1| hypothetical protein EC951288_4497 [Escherichia coli 95.1288]
 gi|429016933|ref|ZP_19083806.1| hypothetical protein EC950943_4910 [Escherichia coli 95.0943]
 gi|429022675|ref|ZP_19089186.1| hypothetical protein EC960428_4625 [Escherichia coli 96.0428]
 gi|429028846|ref|ZP_19094826.1| hypothetical protein EC960427_4802 [Escherichia coli 96.0427]
 gi|429041099|ref|ZP_19106187.1| hypothetical protein EC960932_4872 [Escherichia coli 96.0932]
 gi|429046954|ref|ZP_19111657.1| hypothetical protein EC960107_4684 [Escherichia coli 96.0107]
 gi|429052309|ref|ZP_19116869.1| hypothetical protein EC970003_4427 [Escherichia coli 97.0003]
 gi|429057821|ref|ZP_19122084.1| hypothetical protein EC971742_4298 [Escherichia coli 97.1742]
 gi|429063366|ref|ZP_19127341.1| hypothetical protein EC970007_4185 [Escherichia coli 97.0007]
 gi|429070723|ref|ZP_19134102.1| hypothetical protein EC990678_5252 [Escherichia coli 99.0678]
 gi|429081416|ref|ZP_19144532.1| hypothetical protein EC990713_5250 [Escherichia coli 99.0713]
 gi|429828751|ref|ZP_19359758.1| hypothetical protein EC960109_4876 [Escherichia coli 96.0109]
 gi|429835191|ref|ZP_19365469.1| hypothetical protein EC970010_4835 [Escherichia coli 97.0010]
 gi|444927256|ref|ZP_21246521.1| hypothetical protein EC09BKT78844_4882 [Escherichia coli
           09BKT078844]
 gi|444932846|ref|ZP_21251863.1| hypothetical protein EC990814_4226 [Escherichia coli 99.0814]
 gi|444938322|ref|ZP_21257070.1| hypothetical protein EC990815_4259 [Escherichia coli 99.0815]
 gi|444943914|ref|ZP_21262410.1| hypothetical protein EC990816_4314 [Escherichia coli 99.0816]
 gi|444949405|ref|ZP_21267701.1| hypothetical protein EC990839_4245 [Escherichia coli 99.0839]
 gi|444955079|ref|ZP_21273151.1| hypothetical protein EC990848_4357 [Escherichia coli 99.0848]
 gi|444960466|ref|ZP_21278295.1| hypothetical protein EC991753_4289 [Escherichia coli 99.1753]
 gi|444965679|ref|ZP_21283249.1| hypothetical protein EC991775_4151 [Escherichia coli 99.1775]
 gi|444971675|ref|ZP_21289020.1| hypothetical protein EC991793_4596 [Escherichia coli 99.1793]
 gi|444976975|ref|ZP_21294065.1| hypothetical protein EC991805_4182 [Escherichia coli 99.1805]
 gi|444982346|ref|ZP_21299247.1| hypothetical protein ECATCC700728_4176 [Escherichia coli ATCC
           700728]
 gi|444988560|ref|ZP_21305317.1| hypothetical protein ECPA11_5184 [Escherichia coli PA11]
 gi|444993068|ref|ZP_21309704.1| hypothetical protein ECPA19_4335 [Escherichia coli PA19]
 gi|444998301|ref|ZP_21314794.1| hypothetical protein ECPA13_4095 [Escherichia coli PA13]
 gi|445004788|ref|ZP_21321157.1| hypothetical protein ECPA2_5358 [Escherichia coli PA2]
 gi|445004922|ref|ZP_21321282.1| hypothetical protein ECPA47_5269 [Escherichia coli PA47]
 gi|445015398|ref|ZP_21331479.1| hypothetical protein ECPA48_5173 [Escherichia coli PA48]
 gi|445015754|ref|ZP_21331819.1| hypothetical protein ECPA8_5306 [Escherichia coli PA8]
 gi|445021071|ref|ZP_21337012.1| hypothetical protein EC71982_5249 [Escherichia coli 7.1982]
 gi|445028321|ref|ZP_21344063.1| hypothetical protein EC991781_1757 [Escherichia coli 99.1781]
 gi|445031935|ref|ZP_21347574.1| hypothetical protein EC991762_5373 [Escherichia coli 99.1762]
 gi|445042200|ref|ZP_21357565.1| hypothetical protein ECPA35_4500 [Escherichia coli PA35]
 gi|445043905|ref|ZP_21359240.1| hypothetical protein EC34880_0888 [Escherichia coli 3.4880]
 gi|445052978|ref|ZP_21367995.1| hypothetical protein EC950083_4258 [Escherichia coli 95.0083]
 gi|445061011|ref|ZP_21373522.1| hypothetical protein EC990670_4482 [Escherichia coli 99.0670]
 gi|452968310|ref|ZP_21966537.1| hypothetical protein EC4009_RS06445 [Escherichia coli O157:H7 str.
           EC4009]
 gi|12518318|gb|AAG58726.1|AE005584_8 orf; hypothetical protein [Escherichia coli O157:H7 str. EDL933]
 gi|13363934|dbj|BAB37882.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
 gi|187771563|gb|EDU35407.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4196]
 gi|188018366|gb|EDU56488.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4113]
 gi|189002301|gb|EDU71287.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4076]
 gi|189358833|gb|EDU77252.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4401]
 gi|189364486|gb|EDU82905.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4486]
 gi|189369459|gb|EDU87875.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4501]
 gi|189380134|gb|EDU98550.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC508]
 gi|208729392|gb|EDZ78993.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4206]
 gi|208735024|gb|EDZ83711.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4045]
 gi|208741603|gb|EDZ89285.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4042]
 gi|209160872|gb|ACI38305.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4115]
 gi|217320495|gb|EEC28919.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           TW14588]
 gi|254594934|gb|ACT74295.1| hypothetical protein ECSP_4573 [Escherichia coli O157:H7 str.
           TW14359]
 gi|290764826|gb|ADD58787.1| hypothetical protein G2583_4318 [Escherichia coli O55:H7 str.
           CB9615]
 gi|320191367|gb|EFW66017.1| hypothetical protein ECoD_02794 [Escherichia coli O157:H7 str.
           EC1212]
 gi|320639897|gb|EFX09491.1| hypothetical protein ECO5101_02775 [Escherichia coli O157:H7 str.
           G5101]
 gi|320645061|gb|EFX14085.1| hypothetical protein ECO9389_12801 [Escherichia coli O157:H- str.
           493-89]
 gi|320650327|gb|EFX18810.1| hypothetical protein ECO2687_16571 [Escherichia coli O157:H- str. H
           2687]
 gi|320655901|gb|EFX23824.1| hypothetical protein ECO7815_03795 [Escherichia coli O55:H7 str.
           3256-97 TW 07815]
 gi|320666706|gb|EFX33689.1| hypothetical protein ECOSU61_16870 [Escherichia coli O157:H7 str.
           LSU-61]
 gi|326337419|gb|EGD61254.1| hypothetical protein ECoA_05479 [Escherichia coli O157:H7 str.
           1044]
 gi|326339944|gb|EGD63751.1| hypothetical protein ECF_04238 [Escherichia coli O157:H7 str. 1125]
 gi|374360980|gb|AEZ42687.1| hypothetical protein ECO55CA74_20675 [Escherichia coli O55:H7 str.
           RM12579]
 gi|377889685|gb|EHU54145.1| hypothetical protein ECDEC3A_4594 [Escherichia coli DEC3A]
 gi|377889783|gb|EHU54242.1| hypothetical protein ECDEC3B_4713 [Escherichia coli DEC3B]
 gi|377903272|gb|EHU67570.1| hypothetical protein ECDEC3C_4952 [Escherichia coli DEC3C]
 gi|377907386|gb|EHU71622.1| hypothetical protein ECDEC3D_4717 [Escherichia coli DEC3D]
 gi|377908341|gb|EHU72558.1| hypothetical protein ECDEC3E_4925 [Escherichia coli DEC3E]
 gi|377918108|gb|EHU82161.1| hypothetical protein ECDEC3F_4810 [Escherichia coli DEC3F]
 gi|377924259|gb|EHU88215.1| hypothetical protein ECDEC4A_4458 [Escherichia coli DEC4A]
 gi|377927762|gb|EHU91677.1| hypothetical protein ECDEC4B_4655 [Escherichia coli DEC4B]
 gi|377939056|gb|EHV02814.1| hypothetical protein ECDEC4C_4545 [Escherichia coli DEC4C]
 gi|377944467|gb|EHV08170.1| hypothetical protein ECDEC4E_4577 [Escherichia coli DEC4E]
 gi|377954643|gb|EHV18202.1| hypothetical protein ECDEC4F_4462 [Escherichia coli DEC4F]
 gi|377957760|gb|EHV21288.1| hypothetical protein ECDEC5A_4358 [Escherichia coli DEC5A]
 gi|377962943|gb|EHV26395.1| hypothetical protein ECDEC5B_4720 [Escherichia coli DEC5B]
 gi|377970279|gb|EHV33643.1| hypothetical protein ECDEC5C_4407 [Escherichia coli DEC5C]
 gi|377972443|gb|EHV35793.1| hypothetical protein ECDEC5D_4501 [Escherichia coli DEC5D]
 gi|377981006|gb|EHV44266.1| hypothetical protein ECDEC5E_4420 [Escherichia coli DEC5E]
 gi|386798218|gb|AFJ31252.1| hypothetical protein CDCO157_4196 [Escherichia coli Xuzhou21]
 gi|390639210|gb|EIN18690.1| hypothetical protein ECFDA505_4757 [Escherichia coli FDA505]
 gi|390639622|gb|EIN19093.1| hypothetical protein ECFDA517_5074 [Escherichia coli FDA517]
 gi|390657072|gb|EIN34899.1| hypothetical protein ECFRIK1985_5028 [Escherichia coli FRIK1985]
 gi|390657374|gb|EIN35192.1| hypothetical protein EC93001_4874 [Escherichia coli 93-001]
 gi|390674723|gb|EIN50894.1| hypothetical protein ECPA3_4889 [Escherichia coli PA3]
 gi|390678199|gb|EIN54182.1| hypothetical protein ECPA5_4745 [Escherichia coli PA5]
 gi|390682075|gb|EIN57859.1| hypothetical protein ECPA9_4920 [Escherichia coli PA9]
 gi|390693074|gb|EIN67718.1| hypothetical protein ECPA10_5070 [Escherichia coli PA10]
 gi|390697368|gb|EIN71789.1| hypothetical protein ECPA14_4913 [Escherichia coli PA14]
 gi|390698263|gb|EIN72649.1| hypothetical protein ECPA15_5031 [Escherichia coli PA15]
 gi|390712206|gb|EIN85163.1| hypothetical protein ECPA22_4925 [Escherichia coli PA22]
 gi|390719137|gb|EIN91871.1| hypothetical protein ECPA25_4614 [Escherichia coli PA25]
 gi|390720026|gb|EIN92739.1| hypothetical protein ECPA24_4636 [Escherichia coli PA24]
 gi|390725222|gb|EIN97742.1| hypothetical protein ECPA28_4952 [Escherichia coli PA28]
 gi|390738126|gb|EIO09345.1| hypothetical protein ECPA31_4706 [Escherichia coli PA31]
 gi|390738929|gb|EIO10125.1| hypothetical protein ECPA32_4774 [Escherichia coli PA32]
 gi|390742351|gb|EIO13360.1| hypothetical protein ECPA33_4772 [Escherichia coli PA33]
 gi|390761275|gb|EIO30571.1| hypothetical protein ECPA39_4850 [Escherichia coli PA39]
 gi|390765920|gb|EIO35069.1| hypothetical protein ECPA42_4929 [Escherichia coli PA42]
 gi|390779851|gb|EIO47565.1| hypothetical protein ECTW06591_4436 [Escherichia coli TW06591]
 gi|390786558|gb|EIO54065.1| hypothetical protein ECTW07945_4773 [Escherichia coli TW07945]
 gi|390787899|gb|EIO55372.1| hypothetical protein ECTW10246_5014 [Escherichia coli TW10246]
 gi|390793629|gb|EIO60962.1| hypothetical protein ECTW11039_4946 [Escherichia coli TW11039]
 gi|390801428|gb|EIO68486.1| hypothetical protein ECTW09098_4852 [Escherichia coli TW09098]
 gi|390804995|gb|EIO71943.1| hypothetical protein ECTW09109_5140 [Escherichia coli TW09109]
 gi|390814183|gb|EIO80763.1| hypothetical protein ECTW10119_5262 [Escherichia coli TW10119]
 gi|390823323|gb|EIO89388.1| hypothetical protein ECEC4203_4848 [Escherichia coli EC4203]
 gi|390828114|gb|EIO93799.1| hypothetical protein ECEC4196_4900 [Escherichia coli EC4196]
 gi|390841966|gb|EIP05848.1| hypothetical protein ECTW14313_4703 [Escherichia coli TW14313]
 gi|390843557|gb|EIP07344.1| hypothetical protein ECTW14301_4626 [Escherichia coli TW14301]
 gi|390848287|gb|EIP11762.1| hypothetical protein ECEC4421_4687 [Escherichia coli EC4421]
 gi|390858717|gb|EIP21090.1| hypothetical protein ECEC4422_4816 [Escherichia coli EC4422]
 gi|390863135|gb|EIP25287.1| hypothetical protein ECEC4013_4958 [Escherichia coli EC4013]
 gi|390867335|gb|EIP29163.1| hypothetical protein ECEC4402_4833 [Escherichia coli EC4402]
 gi|390875728|gb|EIP36731.1| hypothetical protein ECEC4439_4786 [Escherichia coli EC4439]
 gi|390881173|gb|EIP41787.1| hypothetical protein ECEC4436_4739 [Escherichia coli EC4436]
 gi|390890973|gb|EIP50619.1| hypothetical protein ECEC4437_4898 [Escherichia coli EC4437]
 gi|390892686|gb|EIP52258.1| hypothetical protein ECEC4448_4800 [Escherichia coli EC4448]
 gi|390898319|gb|EIP57592.1| hypothetical protein ECEC1738_4805 [Escherichia coli EC1738]
 gi|390906250|gb|EIP65153.1| hypothetical protein ECEC1734_4754 [Escherichia coli EC1734]
 gi|390916344|gb|EIP74812.1| hypothetical protein ECEC1863_4496 [Escherichia coli EC1863]
 gi|390916988|gb|EIP75422.1| hypothetical protein ECEC1845_4760 [Escherichia coli EC1845]
 gi|408062465|gb|EKG96971.1| hypothetical protein ECPA7_5406 [Escherichia coli PA7]
 gi|408066781|gb|EKH01227.1| hypothetical protein ECPA34_4912 [Escherichia coli PA34]
 gi|408077084|gb|EKH11298.1| hypothetical protein ECFDA506_5110 [Escherichia coli FDA506]
 gi|408080700|gb|EKH14758.1| hypothetical protein ECFDA507_4850 [Escherichia coli FDA507]
 gi|408088919|gb|EKH22258.1| hypothetical protein ECFDA504_4775 [Escherichia coli FDA504]
 gi|408101414|gb|EKH33866.1| hypothetical protein ECFRIK1997_5034 [Escherichia coli FRIK1997]
 gi|408112898|gb|EKH44512.1| hypothetical protein ECNE037_5169 [Escherichia coli NE037]
 gi|408125331|gb|EKH55940.1| hypothetical protein ECPA4_4926 [Escherichia coli PA4]
 gi|408135214|gb|EKH65012.1| hypothetical protein ECPA23_4795 [Escherichia coli PA23]
 gi|408137363|gb|EKH67065.1| hypothetical protein ECPA49_5019 [Escherichia coli PA49]
 gi|408144386|gb|EKH73624.1| hypothetical protein ECPA45_4973 [Escherichia coli PA45]
 gi|408152571|gb|EKH81000.1| hypothetical protein ECTT12B_4670 [Escherichia coli TT12B]
 gi|408171077|gb|EKH98219.1| hypothetical protein ECCB7326_4836 [Escherichia coli CB7326]
 gi|408180941|gb|EKI07530.1| hypothetical protein EC5412_4766 [Escherichia coli 5412]
 gi|408214152|gb|EKI38607.1| hypothetical protein ECPA38_4622 [Escherichia coli PA38]
 gi|408224415|gb|EKI48128.1| hypothetical protein ECEC1735_4763 [Escherichia coli EC1735]
 gi|408235748|gb|EKI58682.1| hypothetical protein ECEC1736_4708 [Escherichia coli EC1736]
 gi|408239233|gb|EKI61987.1| hypothetical protein ECEC1737_4710 [Escherichia coli EC1737]
 gi|408244183|gb|EKI66641.1| hypothetical protein ECEC1846_4816 [Escherichia coli EC1846]
 gi|408252867|gb|EKI74491.1| hypothetical protein ECEC1847_4840 [Escherichia coli EC1847]
 gi|408256804|gb|EKI78168.1| hypothetical protein ECEC1848_4958 [Escherichia coli EC1848]
 gi|408263244|gb|EKI84109.1| hypothetical protein ECEC1849_4723 [Escherichia coli EC1849]
 gi|408271922|gb|EKI92038.1| hypothetical protein ECEC1850_4950 [Escherichia coli EC1850]
 gi|408274623|gb|EKI94619.1| hypothetical protein ECEC1856_4773 [Escherichia coli EC1856]
 gi|408283205|gb|EKJ02419.1| hypothetical protein ECEC1862_4786 [Escherichia coli EC1862]
 gi|408289130|gb|EKJ07907.1| hypothetical protein ECEC1864_4920 [Escherichia coli EC1864]
 gi|408304578|gb|EKJ22002.1| hypothetical protein ECEC1868_4993 [Escherichia coli EC1868]
 gi|408305359|gb|EKJ22756.1| hypothetical protein ECEC1866_4680 [Escherichia coli EC1866]
 gi|408316515|gb|EKJ32784.1| hypothetical protein ECEC1869_4939 [Escherichia coli EC1869]
 gi|408321867|gb|EKJ37871.1| hypothetical protein ECEC1870_4738 [Escherichia coli EC1870]
 gi|408324176|gb|EKJ40122.1| hypothetical protein ECNE098_4927 [Escherichia coli NE098]
 gi|408334438|gb|EKJ49326.1| hypothetical protein ECFRIK523_4776 [Escherichia coli FRIK523]
 gi|408343399|gb|EKJ57802.1| hypothetical protein EC01304_4902 [Escherichia coli 0.1304]
 gi|408545930|gb|EKK23352.1| hypothetical protein EC52239_4795 [Escherichia coli 5.2239]
 gi|408546745|gb|EKK24159.1| hypothetical protein EC34870_4837 [Escherichia coli 3.4870]
 gi|408547047|gb|EKK24447.1| hypothetical protein EC60172_4950 [Escherichia coli 6.0172]
 gi|408564499|gb|EKK40604.1| hypothetical protein EC80586_5102 [Escherichia coli 8.0586]
 gi|408576191|gb|EKK51804.1| hypothetical protein EC100833_5124 [Escherichia coli 10.0833]
 gi|408579122|gb|EKK54601.1| hypothetical protein EC82524_4714 [Escherichia coli 8.2524]
 gi|408588994|gb|EKK63538.1| hypothetical protein EC100869_4617 [Escherichia coli 10.0869]
 gi|408594205|gb|EKK68496.1| hypothetical protein EC880221_4757 [Escherichia coli 88.0221]
 gi|408599378|gb|EKK73290.1| hypothetical protein EC80416_4344 [Escherichia coli 8.0416]
 gi|408606541|gb|EKK79968.1| hypothetical protein EC100821_5204 [Escherichia coli 10.0821]
 gi|427201963|gb|EKV72321.1| hypothetical protein EC881042_4795 [Escherichia coli 88.1042]
 gi|427202497|gb|EKV72822.1| hypothetical protein EC890511_4747 [Escherichia coli 89.0511]
 gi|427218432|gb|EKV87442.1| hypothetical protein EC900091_5143 [Escherichia coli 90.0091]
 gi|427221712|gb|EKV90524.1| hypothetical protein EC902281_4755 [Escherichia coli 90.2281]
 gi|427238946|gb|EKW06445.1| hypothetical protein EC930056_4727 [Escherichia coli 93.0056]
 gi|427239084|gb|EKW06577.1| hypothetical protein EC930055_4673 [Escherichia coli 93.0055]
 gi|427243369|gb|EKW10745.1| hypothetical protein EC940618_4635 [Escherichia coli 94.0618]
 gi|427258569|gb|EKW24654.1| hypothetical protein EC950943_4910 [Escherichia coli 95.0943]
 gi|427260727|gb|EKW26692.1| hypothetical protein EC951288_4497 [Escherichia coli 95.1288]
 gi|427273802|gb|EKW38469.1| hypothetical protein EC960428_4625 [Escherichia coli 96.0428]
 gi|427276260|gb|EKW40835.1| hypothetical protein EC960427_4802 [Escherichia coli 96.0427]
 gi|427289537|gb|EKW53075.1| hypothetical protein EC960932_4872 [Escherichia coli 96.0932]
 gi|427296261|gb|EKW59321.1| hypothetical protein EC960107_4684 [Escherichia coli 96.0107]
 gi|427298383|gb|EKW61393.1| hypothetical protein EC970003_4427 [Escherichia coli 97.0003]
 gi|427308631|gb|EKW70996.1| hypothetical protein EC971742_4298 [Escherichia coli 97.1742]
 gi|427311712|gb|EKW73893.1| hypothetical protein EC970007_4185 [Escherichia coli 97.0007]
 gi|427324889|gb|EKW86347.1| hypothetical protein EC990713_5250 [Escherichia coli 99.0713]
 gi|427336056|gb|EKW97058.1| hypothetical protein EC990678_5252 [Escherichia coli 99.0678]
 gi|429251455|gb|EKY36050.1| hypothetical protein EC960109_4876 [Escherichia coli 96.0109]
 gi|429252515|gb|EKY37047.1| hypothetical protein EC970010_4835 [Escherichia coli 97.0010]
 gi|444535665|gb|ELV15735.1| hypothetical protein EC990814_4226 [Escherichia coli 99.0814]
 gi|444536994|gb|ELV16959.1| hypothetical protein EC09BKT78844_4882 [Escherichia coli
           09BKT078844]
 gi|444545831|gb|ELV24637.1| hypothetical protein EC990815_4259 [Escherichia coli 99.0815]
 gi|444555151|gb|ELV32633.1| hypothetical protein EC990839_4245 [Escherichia coli 99.0839]
 gi|444555319|gb|ELV32789.1| hypothetical protein EC990816_4314 [Escherichia coli 99.0816]
 gi|444560365|gb|ELV37532.1| hypothetical protein EC990848_4357 [Escherichia coli 99.0848]
 gi|444569733|gb|ELV46300.1| hypothetical protein EC991753_4289 [Escherichia coli 99.1753]
 gi|444573453|gb|ELV49819.1| hypothetical protein EC991775_4151 [Escherichia coli 99.1775]
 gi|444577174|gb|ELV53320.1| hypothetical protein EC991793_4596 [Escherichia coli 99.1793]
 gi|444588184|gb|ELV63570.1| hypothetical protein ECPA11_5184 [Escherichia coli PA11]
 gi|444589994|gb|ELV65310.1| hypothetical protein EC991805_4182 [Escherichia coli 99.1805]
 gi|444590079|gb|ELV65394.1| hypothetical protein ECATCC700728_4176 [Escherichia coli ATCC
           700728]
 gi|444604008|gb|ELV78694.1| hypothetical protein ECPA13_4095 [Escherichia coli PA13]
 gi|444604410|gb|ELV79084.1| hypothetical protein ECPA19_4335 [Escherichia coli PA19]
 gi|444611225|gb|ELV85574.1| hypothetical protein ECPA2_5358 [Escherichia coli PA2]
 gi|444618641|gb|ELV92715.1| hypothetical protein ECPA48_5173 [Escherichia coli PA48]
 gi|444634620|gb|ELW08085.1| hypothetical protein ECPA47_5269 [Escherichia coli PA47]
 gi|444639829|gb|ELW13128.1| hypothetical protein ECPA8_5306 [Escherichia coli PA8]
 gi|444646552|gb|ELW19556.1| hypothetical protein EC991781_1757 [Escherichia coli 99.1781]
 gi|444649874|gb|ELW22742.1| hypothetical protein EC71982_5249 [Escherichia coli 7.1982]
 gi|444652152|gb|ELW24923.1| hypothetical protein ECPA35_4500 [Escherichia coli PA35]
 gi|444655466|gb|ELW28079.1| hypothetical protein EC991762_5373 [Escherichia coli 99.1762]
 gi|444660513|gb|ELW32876.1| hypothetical protein EC950083_4258 [Escherichia coli 95.0083]
 gi|444666637|gb|ELW38700.1| hypothetical protein EC34880_0888 [Escherichia coli 3.4880]
 gi|444667586|gb|ELW39621.1| hypothetical protein EC990670_4482 [Escherichia coli 99.0670]
          Length = 656

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|332666559|ref|YP_004449347.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332335373|gb|AEE52474.1| protein of unknown function DUF1680 [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 656

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 91/410 (22%), Positives = 159/410 (38%), Gaps = 76/410 (18%)

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLA--LQADYLSHFHANT 335
           HW + ++E   +   L ++Y +T+D + L  +H   +    G+       D+    +A  
Sbjct: 198 HWVTGHQE---LELALVKVYQVTNDKRFLDFSHWLLEERGHGYAHGYTWTDWKDTAYAQD 254

Query: 336 HIPI-----VIGSQMRY-----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTS 378
             P+     + G  +R              TGD  Y K + T + D+V   + Y TGG  
Sbjct: 255 IKPVSLTTEITGHAVRAMYLYTGAADVAAYTGDESYLKAMNTVWDDVVE-RNMYITGGIG 313

Query: 379 AREFWWDPKRLADTLGSENE----ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           +       +  +      NE    ETC +  M+  ++ + R T +  + D  E++L NG 
Sbjct: 314 SSG---SNEGFSKDYDLPNERAYCETCASVGMVFWNQRMNRLTGQTKFIDVLEKSLYNGA 370

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSI 490
           L        G+ +       G   A S    GT F   W    CC        + LGD I
Sbjct: 371 LD-------GLSLAGDRFFYGNPLASS----GTHFRREWFGTACCPSNIARLIASLGDYI 419

Query: 491 YFEEEGNVPGLYIIQYISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQ 548
           Y  +  ++   Y+  ++ S  + D   G V + Q+ +    W   +++T+      E  Q
Sbjct: 420 YASDPQSI---YVNLFVGSNTTIDLAKGKVEIRQETE--YPWKGLIKLTVN----PEKAQ 470

Query: 549 LSSLNLRMPVWTYSN-GAQA---------------SLNGQNLPLPPPGNFLSATERWSYN 592
             +L +R+P W   N GA A                +NGQ   L     +L     W+  
Sbjct: 471 SFALKIRLPGWAKGNPGAGALYKFLDEGPTNFATLKVNGQAQNLKLDNGYLIVERNWNKG 530

Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAG--HTSGEWDI 638
           D + + L + +R    +D+  +  +  A+  GP  Y + G  H    W++
Sbjct: 531 DVVELNLAMPIRRVVARDEVKDNENRMALQRGPLVYCVEGVDHNGSAWNL 580


>gi|432672680|ref|ZP_19908201.1| hypothetical protein A1Y7_04240 [Escherichia coli KTE119]
 gi|431207880|gb|ELF06125.1| hypothetical protein A1Y7_04240 [Escherichia coli KTE119]
          Length = 656

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 85/365 (23%), Positives = 133/365 (36%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
             H+P+      IG  +R  Y +TG         D   +       + +     Y TGG 
Sbjct: 252 QAHLPLAQQQTAIGHAVRFAYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + +G  +Y   E     LYI  Y  +S +    +  L  +V     W    ++T+ 
Sbjct: 420 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D L + L
Sbjct: 475 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTL 530

Query: 600 PLSLR 604
           P+ +R
Sbjct: 531 PMPVR 535


>gi|397691075|ref|YP_006528329.1| six-hairpin glycosidase [Melioribacter roseus P3M]
 gi|395812567|gb|AFN75316.1| six-hairpin glycosidase [Melioribacter roseus P3M]
          Length = 643

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 90/385 (23%), Positives = 142/385 (36%), Gaps = 67/385 (17%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF----HANTHIPIV-----IGS 343
           L +LY IT   +++ LA  F        L ++ D  +H     +A  HIP+V     +G 
Sbjct: 219 LIKLYQITGKKEYMELAKFF--------LDIRGDSTTHKLYGEYAQDHIPLVEQKEAVGH 270

Query: 344 QMR----YEVTGD--------PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
            +R    Y    D           K + T + ++VN   +Y TGG  AR    D +   D
Sbjct: 271 AVRALYMYAAMTDIAVLHDDEDYRKAVFTLWDNVVN-KKTYITGGLGARH---DGEAFGD 326

Query: 392 TLGSEN----EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
                N     ETC     +  +  LF  T +  YAD  ER L NG++S     +     
Sbjct: 327 DYELPNLTAYGETCAAIGSVYWNYRLFEMTGDSKYADVIERTLYNGLIS-GISLDGKNFF 385

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
           Y  PL    S        G      W    CC    I     L   IY  +  +V   Y+
Sbjct: 386 YPNPLE---SDGEYKFNMGACTRQPWFDCSCCPTNLIRFIPSLPGLIYSVDRDSV---YV 439

Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT--- 560
             ++ S  D + G    N+ V  I      L   +T + + +     +L +R+P W+   
Sbjct: 440 NLFVGSKADIELG----NKNVRIIQKTSYPLDYKVTLNIEPQAATQFTLKIRIPGWSRNI 495

Query: 561 --------YSNGAQASL----NGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
                   Y+N     +    NG+   L     +   T+ W   DK+ + LP  ++    
Sbjct: 496 PLPGDLYRYANKQNGKIRLLVNGEEQSLNISSGYAVITKLWEKGDKVDLILPKEVKKVLA 555

Query: 609 QDDRPEYASIQAILFGPYLLAGHTS 633
            +   E  +  AI  GP++     +
Sbjct: 556 NEKVKENRNKVAIELGPFVYCAEEA 580


>gi|168785451|ref|ZP_02810458.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC869]
 gi|261224895|ref|ZP_05939176.1| hypothetical protein EscherichiacoliO157_09907 [Escherichia coli
           O157:H7 str. FRIK2000]
 gi|261254205|ref|ZP_05946738.1| hypothetical protein EscherichiacoliO157EcO_00065 [Escherichia coli
           O157:H7 str. FRIK966]
 gi|419100283|ref|ZP_13645472.1| hypothetical protein ECDEC4D_4507 [Escherichia coli DEC4D]
 gi|420277651|ref|ZP_14779931.1| hypothetical protein ECPA40_4914 [Escherichia coli PA40]
 gi|421826457|ref|ZP_16261810.1| hypothetical protein ECFRIK920_4874 [Escherichia coli FRIK920]
 gi|424092641|ref|ZP_17828567.1| hypothetical protein ECFRIK1996_4807 [Escherichia coli FRIK1996]
 gi|424105524|ref|ZP_17840261.1| hypothetical protein ECFRIK1990_4921 [Escherichia coli FRIK1990]
 gi|424470965|ref|ZP_17920770.1| hypothetical protein ECPA41_4862 [Escherichia coli PA41]
 gi|424496110|ref|ZP_17943684.1| hypothetical protein ECTW09195_4922 [Escherichia coli TW09195]
 gi|425182551|ref|ZP_18580237.1| hypothetical protein ECFRIK1999_4970 [Escherichia coli FRIK1999]
 gi|425195581|ref|ZP_18592342.1| hypothetical protein ECNE1487_5176 [Escherichia coli NE1487]
 gi|425208438|ref|ZP_18604226.1| hypothetical protein ECFRIK2001_5178 [Escherichia coli FRIK2001]
 gi|425245279|ref|ZP_18638577.1| hypothetical protein ECMA6_4977 [Escherichia coli MA6]
 gi|428949368|ref|ZP_19021633.1| hypothetical protein EC881467_4850 [Escherichia coli 88.1467]
 gi|428973751|ref|ZP_19044065.1| hypothetical protein EC900039_4634 [Escherichia coli 90.0039]
 gi|429004396|ref|ZP_19072475.1| hypothetical protein EC950183_4871 [Escherichia coli 95.0183]
 gi|429035002|ref|ZP_19100516.1| hypothetical protein EC960939_4827 [Escherichia coli 96.0939]
 gi|429069551|ref|ZP_19132995.1| hypothetical protein EC990672_4784 [Escherichia coli 99.0672]
 gi|189374407|gb|EDU92823.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC869]
 gi|377938510|gb|EHV02277.1| hypothetical protein ECDEC4D_4507 [Escherichia coli DEC4D]
 gi|390638393|gb|EIN17905.1| hypothetical protein ECFRIK1996_4807 [Escherichia coli FRIK1996]
 gi|390660758|gb|EIN38450.1| hypothetical protein ECFRIK1990_4921 [Escherichia coli FRIK1990]
 gi|390756526|gb|EIO26037.1| hypothetical protein ECPA40_4914 [Escherichia coli PA40]
 gi|390764034|gb|EIO33252.1| hypothetical protein ECPA41_4862 [Escherichia coli PA41]
 gi|390824028|gb|EIO90037.1| hypothetical protein ECTW09195_4922 [Escherichia coli TW09195]
 gi|408064841|gb|EKG99322.1| hypothetical protein ECFRIK920_4874 [Escherichia coli FRIK920]
 gi|408095070|gb|EKH28064.1| hypothetical protein ECFRIK1999_4970 [Escherichia coli FRIK1999]
 gi|408106180|gb|EKH38296.1| hypothetical protein ECNE1487_5176 [Escherichia coli NE1487]
 gi|408119214|gb|EKH50301.1| hypothetical protein ECFRIK2001_5178 [Escherichia coli FRIK2001]
 gi|408157817|gb|EKH85958.1| hypothetical protein ECMA6_4977 [Escherichia coli MA6]
 gi|427205698|gb|EKV75938.1| hypothetical protein EC881467_4850 [Escherichia coli 88.1467]
 gi|427225134|gb|EKV93792.1| hypothetical protein EC900039_4634 [Escherichia coli 90.0039]
 gi|427256997|gb|EKW23140.1| hypothetical protein EC950183_4871 [Escherichia coli 95.0183]
 gi|427281172|gb|EKW45506.1| hypothetical protein EC960939_4827 [Escherichia coli 96.0939]
 gi|427316599|gb|EKW78533.1| hypothetical protein EC990672_4784 [Escherichia coli 99.0672]
          Length = 656

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|331675072|ref|ZP_08375829.1| putative cytoplasmic protein [Escherichia coli TA280]
 gi|331067981|gb|EGI39379.1| putative cytoplasmic protein [Escherichia coli TA280]
          Length = 662

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 83/365 (22%), Positives = 131/365 (35%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259

Query: 334 NTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +       + +     Y TGG 
Sbjct: 260 QAHLPIAQQQTAIGHAVRFVYLMTAVAHLARLSHDESKRQDCLRLWNNMAQRQLYITGGI 319

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +        YAD  ERAL N 
Sbjct: 320 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGNSQYADVMERALYNT 376

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 377 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 427

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + +G  +Y   E     LYI  Y  +S +    +  L  +V     W    ++T+ 
Sbjct: 428 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIA 482

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D L + L
Sbjct: 483 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDMLNLTL 538

Query: 600 PLSLR 604
           P+ +R
Sbjct: 539 PMPVR 543


>gi|423240714|ref|ZP_17221828.1| hypothetical protein HMPREF1065_02451 [Bacteroides dorei
           CL03T12C01]
 gi|392643676|gb|EIY37425.1| hypothetical protein HMPREF1065_02451 [Bacteroides dorei
           CL03T12C01]
          Length = 811

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 99/435 (22%), Positives = 163/435 (37%), Gaps = 84/435 (19%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L  A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    T   + +     + TGG  +R     P+   +  G 
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC +   +  +  +F  T +  YAD  ERAL NGV+S       GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  I  F        +  +GN   +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT- 560
           Y+  +I S  D ++    +N +      WD  + + +T   +QE     +L +R+P WT 
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWTQ 488

Query: 561 ----------YSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
                     +++ AQA   S+NG  +       + +    W   D + I LP+ +R   
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGFKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548

Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQE 667
             D   +     AI  GP +      G+    +          +P+  S++A L+     
Sbjct: 549 ANDQVEDDHGKLAIERGPIMFC--LEGQDQADSTVFNKFIPDGTPMEASYDADLL----- 601

Query: 668 SGNSTFVMSNSNQSI 682
             N   V+S + + I
Sbjct: 602 --NGVMVLSGTAKEI 614


>gi|150009917|ref|YP_001304660.1| hypothetical protein BDI_3334 [Parabacteroides distasonis ATCC
           8503]
 gi|423333684|ref|ZP_17311465.1| hypothetical protein HMPREF1075_03116 [Parabacteroides distasonis
           CL03T12C09]
 gi|149938341|gb|ABR45038.1| putative exported protein [Parabacteroides distasonis ATCC 8503]
 gi|409226994|gb|EKN19896.1| hypothetical protein HMPREF1075_03116 [Parabacteroides distasonis
           CL03T12C09]
          Length = 683

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 89/380 (23%), Positives = 143/380 (37%), Gaps = 37/380 (9%)

Query: 279 WYSLNEETGGMN-DVLYRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTH 336
           W    E+ GG N  V+Y LY+IT D   L L  L  K  F    + L  D+LS   +   
Sbjct: 207 WTFWGEQRGGDNLMVVYWLYNITGDKFLLDLGELIHKQTFNWTDIFLNQDHLSRQLSLHC 266

Query: 337 IPIVIGSQ---MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD-T 392
           + +  G +   + Y+   DP         +  ++ +    TG     E      R  + T
Sbjct: 267 VNLAQGFKEPVVYYQQNQDPKQICAVKKAVKDIHNTIGLPTGLWGGDEL----LRFGEPT 322

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
            GSE    CT   M+     +   T ++ +ADY ER   N  L  Q   +     Y    
Sbjct: 323 TGSE---LCTAVEMMFSLEEMLEITGDVQWADYLERVAYNA-LPTQVTDDYSARQYYQQT 378

Query: 453 GRGVSKARSTHGWGT----------KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
            + V+  R    + T          +   + CC     + + KL  ++++    N  G+ 
Sbjct: 379 NQ-VAVTREWRNFSTPHDDTDILFGELTGYPCCTSNLHQGWPKLVQNLWYATADN--GIA 435

Query: 503 IIQYISSSFDWKSGHVVLNQ-KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
            + Y  SS   K  + V  Q + +    +D  L     F  K+        ++R+P W  
Sbjct: 436 ALVYAPSSVKAKVANGVTVQIEEETAYPFDETLHFKFAFEDKKIKRAFFPFHIRIPAW-- 493

Query: 562 SNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
            N     LNG+N+ +   PG        W   D LT++LP+ +           Y     
Sbjct: 494 CNQPVIKLNGENVVVDAYPGEIARINREWKQGDVLTVELPMQVAASRW------YGGSAV 547

Query: 621 ILFGPYLLAGHTSGEWDIKT 640
           I  GP + A   + +W+ KT
Sbjct: 548 IERGPLVYALKMNEKWEKKT 567


>gi|432487351|ref|ZP_19729258.1| hypothetical protein A15Y_03854 [Escherichia coli KTE212]
 gi|433175488|ref|ZP_20359993.1| hypothetical protein WGQ_03753 [Escherichia coli KTE232]
 gi|431013718|gb|ELD27447.1| hypothetical protein A15Y_03854 [Escherichia coli KTE212]
 gi|431688314|gb|ELJ53849.1| hypothetical protein WGQ_03753 [Escherichia coli KTE232]
          Length = 656

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 85/365 (23%), Positives = 133/365 (36%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
             H+P+      IG  +R  Y +TG         D   +       + +     Y TGG 
Sbjct: 252 QAHLPLAQQQTAIGHAVRFAYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + +G  +Y   E     LYI  Y  +S +    +  L  +V     W    ++T+ 
Sbjct: 420 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPLENGTLRLRVSGNYPWQE--QVTIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D L + L
Sbjct: 475 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTL 530

Query: 600 PLSLR 604
           P+ +R
Sbjct: 531 PMPVR 535


>gi|432836527|ref|ZP_20070058.1| hypothetical protein A1YO_03904 [Escherichia coli KTE136]
 gi|431382143|gb|ELG66487.1| hypothetical protein A1YO_03904 [Escherichia coli KTE136]
          Length = 659

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGVQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|301307791|ref|ZP_07213747.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|423337090|ref|ZP_17314834.1| hypothetical protein HMPREF1059_00759 [Parabacteroides distasonis
           CL09T03C24]
 gi|300834134|gb|EFK64748.1| conserved hypothetical protein [Bacteroides sp. 20_3]
 gi|409238278|gb|EKN31071.1| hypothetical protein HMPREF1059_00759 [Parabacteroides distasonis
           CL09T03C24]
          Length = 680

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 92/424 (21%), Positives = 170/424 (40%), Gaps = 41/424 (9%)

Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
           ++  +L QY  A N Q  ++ +++  YF  ++ ++    S    W    E+ GG N  V+
Sbjct: 164 VMLKVLQQYYSATNDQ--RVISFLTNYFKYQLSELPK--SPLGKWTFWAEQRGGDNLMVV 219

Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
           Y LY+IT DP  L L  L  K  F    + L  D+L+  ++   + +  G +   + Y+ 
Sbjct: 220 YWLYNITGDPFLLELGELIHKQTFNWTDVFLNQDHLARQNSLHCVNLAQGFKEPVIYYQQ 279

Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
           + +P         +  +  +  + TG       W   + L     ++  E CT   M+  
Sbjct: 280 SHNPQNLEAVKEAVRKMRHTIGFPTG------LWAGDELLRFGNPTQGSELCTAVEMMFS 333

Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL--------GRG-VSKAR 460
              +   T ++ +AD+ E+   N VL  Q   +     Y   +        GR  VS   
Sbjct: 334 LEKMLEITGDVQWADHLEKVAYN-VLPTQIKDDFSARQYYQQVNQIAITCEGRNFVSPHE 392

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--- 517
            T     + + + CC     + + K    ++F    N  G+  + Y  S    + G+   
Sbjct: 393 DTDIIFGELSGYPCCTSNLHQGWPKFTRHLWFATADN--GIASLIYAPSEVTAQVGNDIT 450

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
           V + +K D    ++  +   L+F SK++       +LR+P W   N    ++NG+ + + 
Sbjct: 451 VKIAEKTD--YPFEEKIDFNLSFPSKKDKKAFFPFHLRIPAW--CNNPVITINGEAVSIA 506

Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
              G  +     W   D + ++LP+ + T    DD         I  GP L +     +W
Sbjct: 507 AHSGEIVRINREWKDGDHIQLELPMRISTSNWYDD------AVVIERGPLLYSLKMDEKW 560

Query: 637 DIKT 640
           + K 
Sbjct: 561 ERKV 564


>gi|238910286|ref|ZP_04654123.1| hypothetical protein SentesTe_04004 [Salmonella enterica subsp.
           enterica serovar Tennessee str. CDC07-0191]
          Length = 651

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 90/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +   G+  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEIPVGNGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|419864579|ref|ZP_14387018.1| hypothetical protein ECO9340_21841 [Escherichia coli O103:H25 str.
           CVM9340]
 gi|388339862|gb|EIL06180.1| hypothetical protein ECO9340_21841 [Escherichia coli O103:H25 str.
           CVM9340]
          Length = 659

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGVQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|255012841|ref|ZP_05284967.1| hypothetical protein B2_02974 [Bacteroides sp. 2_1_7]
 gi|410102231|ref|ZP_11297158.1| hypothetical protein HMPREF0999_00930 [Parabacteroides sp. D25]
 gi|409238953|gb|EKN31741.1| hypothetical protein HMPREF0999_00930 [Parabacteroides sp. D25]
          Length = 680

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 92/424 (21%), Positives = 170/424 (40%), Gaps = 41/424 (9%)

Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
           ++  +L QY  A N Q  ++ +++  YF  ++ ++    S    W    E+ GG N  V+
Sbjct: 164 VMLKVLQQYYSATNDQ--RVISFLTNYFKYQLSELPK--SPLGKWTFWAEQRGGDNLMVV 219

Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
           Y LY+IT DP  L L  L  K  F    + L  D+L+  ++   + +  G +   + Y+ 
Sbjct: 220 YWLYNITGDPFLLELGELIHKQTFNWTDVFLNQDHLARQNSLHCVNLAQGFKEPVIYYQQ 279

Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
           + +P         +  +  +  + TG       W   + L     ++  E CT   M+  
Sbjct: 280 SHNPQNLEAVKEAVRKMRHTIGFPTG------LWAGDELLRFGNPTQGSELCTAVEMMFS 333

Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL--------GRG-VSKAR 460
              +   T ++ +AD+ E+   N VL  Q   +     Y   +        GR  VS   
Sbjct: 334 LEKMLEITGDVQWADHLEKVAYN-VLPTQIKDDFSARQYYQQVNQIAITCEGRNFVSPHE 392

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--- 517
            T     + + + CC     + + K    ++F    N  G+  + Y  S    + G+   
Sbjct: 393 DTDIVFGELSGYPCCTSNLHQGWPKFTRHLWFATADN--GIASLIYAPSEVTAQVGNDIT 450

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
           V + +K D    ++  +   L+F SK++       +LR+P W   N    ++NG+ + + 
Sbjct: 451 VKIAEKTD--YPFEEKIDFNLSFPSKKDKKAFFPFHLRIPAW--CNNPVITINGEAVSIA 506

Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
              G  +     W   D + ++LP+ + T    DD         I  GP L +     +W
Sbjct: 507 AHSGEIVRINREWKDGDHIQLELPMRISTSNWYDD------AVVIERGPLLYSLKMDEKW 560

Query: 637 DIKT 640
           + K 
Sbjct: 561 ERKV 564


>gi|300822009|ref|ZP_07102152.1| conserved hypothetical protein [Escherichia coli MS 119-7]
 gi|331679667|ref|ZP_08380337.1| putative cytoplasmic protein [Escherichia coli H591]
 gi|300525372|gb|EFK46441.1| conserved hypothetical protein [Escherichia coli MS 119-7]
 gi|331072839|gb|EGI44164.1| putative cytoplasmic protein [Escherichia coli H591]
          Length = 667

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGVQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 260 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 313

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 370

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 421

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 422 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 477

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 478 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 532

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 533 TLNLTLPMPVR 543


>gi|193068520|ref|ZP_03049482.1| conserved hypothetical protein [Escherichia coli E110019]
 gi|331670421|ref|ZP_08371260.1| putative cytoplasmic protein [Escherichia coli TA271]
 gi|332282156|ref|ZP_08394569.1| conserved hypothetical protein [Shigella sp. D9]
 gi|417222825|ref|ZP_12026265.1| putative glycosyhydrolase [Escherichia coli 96.154]
 gi|417267012|ref|ZP_12054373.1| putative glycosyhydrolase [Escherichia coli 3.3884]
 gi|417604475|ref|ZP_12255039.1| hypothetical protein ECSTEC94C_4306 [Escherichia coli STEC_94C]
 gi|418040528|ref|ZP_12678768.1| hypothetical protein ECW26_09970 [Escherichia coli W26]
 gi|419926997|ref|ZP_14444741.1| hypothetical protein EC5411_02255 [Escherichia coli 541-1]
 gi|423707870|ref|ZP_17682250.1| hypothetical protein ESTG_02341 [Escherichia coli B799]
 gi|432378754|ref|ZP_19621737.1| hypothetical protein WCQ_03650 [Escherichia coli KTE12]
 gi|432482897|ref|ZP_19724846.1| hypothetical protein A15U_04037 [Escherichia coli KTE210]
 gi|432676705|ref|ZP_19912149.1| hypothetical protein A1YU_03256 [Escherichia coli KTE142]
 gi|433200343|ref|ZP_20384227.1| hypothetical protein WGW_03892 [Escherichia coli KTE94]
 gi|192958171|gb|EDV88612.1| conserved hypothetical protein [Escherichia coli E110019]
 gi|331062483|gb|EGI34403.1| putative cytoplasmic protein [Escherichia coli TA271]
 gi|332104508|gb|EGJ07854.1| conserved hypothetical protein [Shigella sp. D9]
 gi|345347843|gb|EGW80147.1| hypothetical protein ECSTEC94C_4306 [Escherichia coli STEC_94C]
 gi|383476508|gb|EID68447.1| hypothetical protein ECW26_09970 [Escherichia coli W26]
 gi|385709502|gb|EIG46500.1| hypothetical protein ESTG_02341 [Escherichia coli B799]
 gi|386202627|gb|EII01618.1| putative glycosyhydrolase [Escherichia coli 96.154]
 gi|386229370|gb|EII56725.1| putative glycosyhydrolase [Escherichia coli 3.3884]
 gi|388408480|gb|EIL68825.1| hypothetical protein EC5411_02255 [Escherichia coli 541-1]
 gi|430896388|gb|ELC18632.1| hypothetical protein WCQ_03650 [Escherichia coli KTE12]
 gi|431003915|gb|ELD19148.1| hypothetical protein A15U_04037 [Escherichia coli KTE210]
 gi|431210613|gb|ELF08667.1| hypothetical protein A1YU_03256 [Escherichia coli KTE142]
 gi|431717675|gb|ELJ81769.1| hypothetical protein WGW_03892 [Escherichia coli KTE94]
          Length = 659

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGVQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|415831195|ref|ZP_11516965.1| hypothetical protein ECOK1357_3952 [Escherichia coli OK1357]
 gi|323182744|gb|EFZ68146.1| hypothetical protein ECOK1357_3952 [Escherichia coli OK1357]
          Length = 659

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGVQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|256838375|ref|ZP_05543885.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|256739294|gb|EEU52618.1| conserved hypothetical protein [Parabacteroides sp. D13]
          Length = 680

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 92/424 (21%), Positives = 170/424 (40%), Gaps = 41/424 (9%)

Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
           ++  +L QY  A N Q  ++ +++  YF  ++ ++    S    W    E+ GG N  V+
Sbjct: 164 VMLKVLQQYYSATNDQ--RVISFLTNYFKYQLSELPK--SPLGKWTFWAEQRGGDNLMVV 219

Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
           Y LY+IT DP  L L  L  K  F    + L  D+L+  ++   + +  G +   + Y+ 
Sbjct: 220 YWLYNITGDPFLLELGELIHKQTFNWTDVFLNQDHLARQNSLHCVNLAQGFKEPVIYYQQ 279

Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
           + +P         +  +  +  + TG       W   + L     ++  E CT   M+  
Sbjct: 280 SHNPQNLEAVKEAVRKMRHTIGFPTG------LWAGDELLRFGNPTQGSELCTAVEMMFS 333

Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL--------GRG-VSKAR 460
              +   T ++ +AD+ E+   N VL  Q   +     Y   +        GR  VS   
Sbjct: 334 LEKMLEITGDVQWADHLEKVAYN-VLPTQIKDDFSARQYYQQVNQIAITCEGRNFVSPHE 392

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--- 517
            T     + + + CC     + + K    ++F    N  G+  + Y  S    + G+   
Sbjct: 393 DTDIVFGELSGYPCCTSNLHQGWPKFTRHLWFATADN--GIASLIYAPSEVTAQVGNDIT 450

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
           V + +K D    ++  +   L+F SK++       +LR+P W   N    ++NG+ + + 
Sbjct: 451 VKIAEKTD--YPFEEKIDFNLSFPSKKDKKAFFPFHLRIPAW--CNNPVITINGEAVSIA 506

Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
              G  +     W   D + ++LP+ + T    DD         I  GP L +     +W
Sbjct: 507 AHSGEIVRINREWKDGDHIQLELPMRISTSNWYDD------AVVIERGPLLYSLKMDEKW 560

Query: 637 DIKT 640
           + K 
Sbjct: 561 ERKV 564


>gi|448238160|ref|YP_007402218.1| hypothetical protein GHH_c19490 [Geobacillus sp. GHH01]
 gi|445207002|gb|AGE22467.1| hypothetical protein GHH_c19490 [Geobacillus sp. GHH01]
          Length = 640

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 64/271 (23%), Positives = 107/271 (39%), Gaps = 36/271 (13%)

Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSA----REFWWDPKRLADTLGSENEETCTTYN 405
           TGD   K       + V     Y TGG  +      F +D     DT+ +E   TC +  
Sbjct: 275 TGDESLKQACQTLWENVTKRQMYITGGVGSSAFGESFTFDFDLPNDTVYAE---TCASIA 331

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-----------GR 454
           ++  +R +     +  YAD  ERAL NG +S     +     Y+ PL            R
Sbjct: 332 LVFWARRMLELEMDGKYADVMERALYNGTIS-GMDLDGKRFFYVNPLEVWPKACERHDKR 390

Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
            V   R       K+ S  CC        + +G  IY +       L++  Y+ S+   +
Sbjct: 391 HVKPVRQ------KWFSCACCPPNLARLIASIGHYIYSQ---TSDALFVHLYVGSNIQTE 441

Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
            G   +    +    WD  +R+T++  S QE     +L LR+P W    GA+ ++NG+N+
Sbjct: 442 IGGRSVEIVQETNYPWDGTVRLTISPESAQEF----TLGLRIPGWC--RGAEVTINGENV 495

Query: 575 PLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
            + P     +      W   D++ +   + +
Sbjct: 496 DIAPLTKKGYAYIRRVWRQGDEMVLHFSMPV 526


>gi|298374270|ref|ZP_06984228.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
 gi|298268638|gb|EFI10293.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
          Length = 680

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 92/424 (21%), Positives = 170/424 (40%), Gaps = 41/424 (9%)

Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
           ++  +L QY  A N Q  ++ +++  YF  ++ ++    S    W    E+ GG N  V+
Sbjct: 164 VMLKVLQQYYSATNDQ--RVISFLTNYFKYQLSELPK--SPLGKWTFWAEQRGGDNLMVV 219

Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
           Y LY+IT DP  L L  L  K  F    + L  D+L+  ++   + +  G +   + Y+ 
Sbjct: 220 YWLYNITGDPFLLELGELIHKQTFNWTDVFLNQDHLARQNSLHCVNLAQGFKEPVIYYQQ 279

Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
           + +P         +  +  +  + TG       W   + L     ++  E CT   M+  
Sbjct: 280 SHNPQNLEAVKEAVRKMRHTIGFPTG------LWAGDELLRFGNPTQGSELCTAVEMMFS 333

Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL--------GRG-VSKAR 460
              +   T ++ +AD+ E+   N VL  Q   +     Y   +        GR  VS   
Sbjct: 334 LEKMLEITGDVQWADHLEKVAYN-VLPTQIKDDFSARQYYQQVNQIAITCEGRNFVSPHE 392

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--- 517
            T     + + + CC     + + K    ++F    N  G+  + Y  S    + G+   
Sbjct: 393 DTDIVFGELSGYPCCTSNLHQGWPKFTRHLWFATADN--GIASLIYAPSEVTAQVGNDIT 450

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
           V + +K D    ++  +   L+F SK++       +LR+P W   N    ++NG+ + + 
Sbjct: 451 VKIAEKTD--YPFEEKIDFNLSFPSKKDKKAFFPFHLRIPAW--CNNPVITINGEAVSIA 506

Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
              G  +     W   D + ++LP+ + T    DD         I  GP L +     +W
Sbjct: 507 AHSGEIVRINREWKDGDHIQLELPMRISTSNWYDD------AVVIERGPLLYSLKMDEKW 560

Query: 637 DIKT 640
           + K 
Sbjct: 561 ERKV 564


>gi|212692449|ref|ZP_03300577.1| hypothetical protein BACDOR_01945 [Bacteroides dorei DSM 17855]
 gi|212665028|gb|EEB25600.1| hypothetical protein BACDOR_01945 [Bacteroides dorei DSM 17855]
          Length = 811

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 95/415 (22%), Positives = 157/415 (37%), Gaps = 77/415 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L  A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---SDGHKLSEY-SQDHKPILQQDEIVGHAVR 275

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    T   + +     + TGG  +R     P+   +  G 
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC +   +  +  +F  T +  YAD  ERAL NGV+S       GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  I  F        +  +GN   +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFVASVPYYMYATQGN--DV 432

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
           Y+  YI S  D ++    +N +      W+  + +++T   +QE     +L +R+P W  
Sbjct: 433 YVNLYIQSKADIETESNKINVEQTTDYPWNGKISISVTPEKEQEF----ALRVRIPGWAQ 488

Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
                    ++++ AQA   S+NG  +       + +    W   D + I LP+ +R   
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548

Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
             D   +     AI  GP +      G+    +          +P+  SF+A L+
Sbjct: 549 ANDQVEDDHGKLAIERGPIMFC--LEGQDQADSTVFNKFIPDGTPMEASFHADLL 601


>gi|373958292|ref|ZP_09618252.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
           18603]
 gi|373894892|gb|EHQ30789.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
           18603]
          Length = 679

 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 95/423 (22%), Positives = 168/423 (39%), Gaps = 52/423 (12%)

Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
           +  +Y  T +PK+L L+ +L D    +          +  +    +  HA     +  G+
Sbjct: 228 VVEMYRTTREPKYLELSKNLIDIRGLMKDGTDDNQDRIPFREQTQALGHAVRANYLYAGA 287

Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATGGTSA----------REFWWDPKRLADT 392
              Y  TGD  L   +   + D+VN    Y TGG  A               D +++   
Sbjct: 288 ADVYAETGDTTLMHTLNLVWNDVVN-RKMYITGGCGAIYDGASPDGTSYLLKDVQQIHQA 346

Query: 393 LGSE--------NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRG--- 440
            G +        + ETC +   +  +  + + T +  YAD  E  L NG+LS I      
Sbjct: 347 YGRDYQLPNFTAHNETCASVGNVLWNWRMLQLTGKAQYADVMELTLYNGMLSGISLNGKK 406

Query: 441 ---TEPGVMIYMLPLGRGVSKARSTH-GWGTKFNSFWCCYGTGIESFSKLGDSIY-FEEE 495
              T P  +   +P  +  SK R  + G+        CC    I + +++G+  Y   ++
Sbjct: 407 FLYTNPLSVSDDMPFQQRWSKDRVDYIGYSD------CCPPNVIRTIAEIGNYAYSISDK 460

Query: 496 GNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
           G    LY    +S+        + L+Q+ D    WD  + + L     +   +  SL LR
Sbjct: 461 GVWVNLYGGNNLSTQLLKDGSKIKLSQQTD--YPWDGKISIALN----EVPAKAFSLFLR 514

Query: 556 MPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
           +P W  S GA  ++NG+ +  +  PG +     +W   DK+ + LP+ ++         E
Sbjct: 515 IPGWCGS-GASVTVNGKAVNTILTPGQYAEINGKWHAGDKIELLLPMPVKMIEANPLVEE 573

Query: 615 YASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFV 674
             +  A+  GP +    ++G    K   + SLS+ I+ +P             +GN+T  
Sbjct: 574 VRNQIAVKRGPVVYCVESAGMPKDKKVFSLSLSSKINLVPQKIVIDNSDIVALNGNATLE 633

Query: 675 MSN 677
            +N
Sbjct: 634 NAN 636


>gi|438041968|ref|ZP_20855782.1| hypothetical protein SEEE5646_18016 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-5646]
 gi|435321796|gb|ELO94162.1| hypothetical protein SEEE5646_18016 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-5646]
          Length = 646

 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 90/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGVQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +   G+  L  ++     W   +++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI- 475

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 476 -DSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|205354717|ref|YP_002228518.1| hypothetical protein SG3751 [Salmonella enterica subsp. enterica
           serovar Gallinarum str. 287/91]
 gi|375125607|ref|ZP_09770771.1| hypothetical protein SG9_3828 [Salmonella enterica subsp. enterica
           serovar Gallinarum str. SG9]
 gi|445130406|ref|ZP_21381321.1| hypothetical protein SEEG9184_014572 [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
 gi|205274498|emb|CAR39532.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Gallinarum str. 287/91]
 gi|326629857|gb|EGE36200.1| hypothetical protein SG9_3828 [Salmonella enterica subsp. enterica
           serovar Gallinarum str. SG9]
 gi|444852215|gb|ELX77297.1| hypothetical protein SEEG9184_014572 [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
          Length = 651

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 90/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGVQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +   G+  L  ++     W   +++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI- 475

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 476 -DSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|207858916|ref|YP_002245567.1| hypothetical protein SEN3501 [Salmonella enterica subsp. enterica
           serovar Enteritidis str. P125109]
 gi|421357264|ref|ZP_15807576.1| hypothetical protein SEEE3139_04408 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|421362069|ref|ZP_15812325.1| hypothetical protein SEEE0166_05558 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|421368596|ref|ZP_15818785.1| hypothetical protein SEEE0631_15493 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|421370704|ref|ZP_15820867.1| hypothetical protein SEEE0424_03357 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|421376619|ref|ZP_15826719.1| hypothetical protein SEEE3076_10306 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|421379882|ref|ZP_15829946.1| hypothetical protein SEEE4917_03813 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|421387196|ref|ZP_15837201.1| hypothetical protein SEEE6622_17936 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|421388833|ref|ZP_15838818.1| hypothetical protein SEEE6670_03439 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|421393233|ref|ZP_15843178.1| hypothetical protein SEEE6426_02862 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|421400876|ref|ZP_15850758.1| hypothetical protein SEEE6437_19177 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|421404698|ref|ZP_15854538.1| hypothetical protein SEEE7246_15620 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|421408356|ref|ZP_15858156.1| hypothetical protein SEEE7250_11351 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|421414364|ref|ZP_15864109.1| hypothetical protein SEEE1427_18818 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|421418252|ref|ZP_15867957.1| hypothetical protein SEEE2659_15744 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|421423488|ref|ZP_15873147.1| hypothetical protein SEEE1757_19367 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|421427667|ref|ZP_15877286.1| hypothetical protein SEEE5101_17714 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|421429796|ref|ZP_15879391.1| hypothetical protein SEEE8B1_05655 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|421437646|ref|ZP_15887162.1| hypothetical protein SEEE5518_21932 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|421438534|ref|ZP_15888029.1| hypothetical protein SEEE1618_03571 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|421443523|ref|ZP_15892964.1| hypothetical protein SEEE3079_05727 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|436605457|ref|ZP_20513395.1| hypothetical protein SEE22704_08488 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|436694238|ref|ZP_20518150.1| hypothetical protein SEE30663_08486 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|436803411|ref|ZP_20525841.1| hypothetical protein SEECHS44_21392 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|436810025|ref|ZP_20529267.1| hypothetical protein SEEE1882_15710 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|436816420|ref|ZP_20533798.1| hypothetical protein SEEE1884_15817 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|436832038|ref|ZP_20536533.1| hypothetical protein SEEE1594_06730 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|436849358|ref|ZP_20540514.1| hypothetical protein SEEE1566_04035 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|436858888|ref|ZP_20547165.1| hypothetical protein SEEE1580_15161 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|436862962|ref|ZP_20549538.1| hypothetical protein SEEE1543_04463 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|436874233|ref|ZP_20556894.1| hypothetical protein SEEE1441_19202 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|436876728|ref|ZP_20558061.1| hypothetical protein SEEE1810_02403 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|436886249|ref|ZP_20562678.1| hypothetical protein SEEE1558_02954 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|436893215|ref|ZP_20567194.1| hypothetical protein SEEE1018_02862 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|436900848|ref|ZP_20571778.1| hypothetical protein SEEE1010_03453 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|436913977|ref|ZP_20579179.1| hypothetical protein SEEE1729_18338 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|436919198|ref|ZP_20582051.1| hypothetical protein SEEE0895_09976 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|436928295|ref|ZP_20587740.1| hypothetical protein SEEE0899_15810 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|436937155|ref|ZP_20592450.1| hypothetical protein SEEE1457_16920 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|436944088|ref|ZP_20596699.1| hypothetical protein SEEE1747_15796 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|436953454|ref|ZP_20601804.1| hypothetical protein SEEE0968_18795 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|436962937|ref|ZP_20605560.1| hypothetical protein SEEE1444_14849 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|436967670|ref|ZP_20607424.1| hypothetical protein SEEE1445_01381 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|436978926|ref|ZP_20612901.1| hypothetical protein SEEE1559_06556 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|436995892|ref|ZP_20619592.1| hypothetical protein SEEE1565_17627 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|437011806|ref|ZP_20624610.1| hypothetical protein SEEE1808_20431 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|437019323|ref|ZP_20627061.1| hypothetical protein SEEE1811_09865 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|437026609|ref|ZP_20629868.1| hypothetical protein SEEE0956_01180 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|437041181|ref|ZP_20635197.1| hypothetical protein SEEE1455_05273 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|437051574|ref|ZP_20641455.1| hypothetical protein SEEE1575_14346 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|437056616|ref|ZP_20644024.1| hypothetical protein SEEE1725_04709 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|437067549|ref|ZP_20650399.1| hypothetical protein SEEE1745_14184 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|437073604|ref|ZP_20653177.1| hypothetical protein SEEE1791_05329 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|437082599|ref|ZP_20658441.1| hypothetical protein SEEE1795_09343 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|437089107|ref|ZP_20661970.1| hypothetical protein SEEE6709_04610 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|437103922|ref|ZP_20666960.1| hypothetical protein SEEE9058_06907 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|437126597|ref|ZP_20674605.1| hypothetical protein SEEE0816_23102 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|437131843|ref|ZP_20677676.1| hypothetical protein SEEE0819_15610 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|437136794|ref|ZP_20680031.1| hypothetical protein SEEE3072_04644 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|437143889|ref|ZP_20684687.1| hypothetical protein SEEE3089_05314 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|437154248|ref|ZP_20690986.1| hypothetical protein SEEE9163_14382 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|437162604|ref|ZP_20696211.1| hypothetical protein SEEE151_18100 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|437166884|ref|ZP_20698338.1| hypothetical protein SEEEN202_06125 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|437178010|ref|ZP_20704356.1| hypothetical protein SEEE3991_14003 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|437183055|ref|ZP_20707414.1| hypothetical protein SEEE3618_06829 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|437198906|ref|ZP_20711454.1| hypothetical protein SEEE1831_04503 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|437262882|ref|ZP_20719212.1| hypothetical protein SEEE2490_17657 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|437271416|ref|ZP_20723680.1| hypothetical protein SEEEL909_17675 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|437275478|ref|ZP_20725823.1| hypothetical protein SEEEL913_05569 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|437291505|ref|ZP_20731569.1| hypothetical protein SEEE4941_12100 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|437304204|ref|ZP_20733917.1| hypothetical protein SEEE7015_01180 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|437324305|ref|ZP_20739563.1| hypothetical protein SEEE7927_06991 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|437339496|ref|ZP_20744149.1| hypothetical protein SEEECHS4_07410 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|437430625|ref|ZP_20755828.1| hypothetical protein SEEE2217_21365 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|437447211|ref|ZP_20758929.1| hypothetical protein SEEE4018_14080 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|437464509|ref|ZP_20763586.1| hypothetical protein SEEE6211_14683 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|437474444|ref|ZP_20766236.1| hypothetical protein SEEE4441_05197 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|437490700|ref|ZP_20771023.1| hypothetical protein SEEE4647_06743 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|437518116|ref|ZP_20778521.1| hypothetical protein SEEE9845_22471 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|437563498|ref|ZP_20786805.1| hypothetical protein SEEE0116_18550 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|437572857|ref|ZP_20789281.1| hypothetical protein SEEE1117_07915 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|437593902|ref|ZP_20795526.1| hypothetical protein SEEE1392_17150 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|437607245|ref|ZP_20800160.1| hypothetical protein SEEE0268_17650 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|437617397|ref|ZP_20802955.1| hypothetical protein SEEE0316_08756 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|437653610|ref|ZP_20810238.1| hypothetical protein SEEE0436_23184 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|437661278|ref|ZP_20812888.1| hypothetical protein SEEE1319_12828 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|437677654|ref|ZP_20817320.1| hypothetical protein SEEE4481_12502 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|437691966|ref|ZP_20820894.1| hypothetical protein SEEE6297_07188 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|437707522|ref|ZP_20825711.1| hypothetical protein SEEE4220_08863 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|437725054|ref|ZP_20829741.1| hypothetical protein SEEE1616_05992 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 16-16]
 gi|437789741|ref|ZP_20837126.1| hypothetical protein SEEE2651_21208 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 76-2651]
 gi|437814063|ref|ZP_20842185.1| hypothetical protein SEEE3944_22067 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|437862553|ref|ZP_20847967.1| hypothetical protein SEEE5621_05022 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 6.0562-1]
 gi|438086893|ref|ZP_20859191.1| hypothetical protein SEEE2625_08240 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|438102729|ref|ZP_20865150.1| hypothetical protein SEEE1976_15564 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|438113496|ref|ZP_20869671.1| hypothetical protein SEEE3407_15758 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|445168673|ref|ZP_21394919.1| hypothetical protein SEE8A_021969 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|445186279|ref|ZP_21399191.1| hypothetical protein SE20037_06257 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|445231881|ref|ZP_21405859.1| hypothetical protein SEE10_020946 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|445237706|ref|ZP_21407161.1| hypothetical protein SEE436_026260 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 436]
 gi|445333559|ref|ZP_21414841.1| hypothetical protein SEE18569_008126 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|445345844|ref|ZP_21418446.1| hypothetical protein SEE13_018274 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|445356148|ref|ZP_21421740.1| hypothetical protein SEE23_007297 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
 gi|206710719|emb|CAR35080.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Enteritidis str. P125109]
 gi|395984836|gb|EJH94014.1| hypothetical protein SEEE0631_15493 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|395991902|gb|EJI01024.1| hypothetical protein SEEE0166_05558 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|395992120|gb|EJI01241.1| hypothetical protein SEEE3139_04408 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|396001983|gb|EJI10994.1| hypothetical protein SEEE3076_10306 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|396004947|gb|EJI13927.1| hypothetical protein SEEE4917_03813 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|396005988|gb|EJI14959.1| hypothetical protein SEEE0424_03357 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|396010336|gb|EJI19249.1| hypothetical protein SEEE6622_17936 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|396017969|gb|EJI26832.1| hypothetical protein SEEE6426_02862 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|396018877|gb|EJI27737.1| hypothetical protein SEEE6670_03439 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|396022763|gb|EJI31575.1| hypothetical protein SEEE6437_19177 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|396025631|gb|EJI34407.1| hypothetical protein SEEE7246_15620 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|396028864|gb|EJI37623.1| hypothetical protein SEEE7250_11351 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|396036970|gb|EJI45625.1| hypothetical protein SEEE1427_18818 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|396037577|gb|EJI46226.1| hypothetical protein SEEE1757_19367 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|396038879|gb|EJI47511.1| hypothetical protein SEEE2659_15744 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|396049784|gb|EJI58322.1| hypothetical protein SEEE5518_21932 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|396050924|gb|EJI59443.1| hypothetical protein SEEE5101_17714 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|396058175|gb|EJI66643.1| hypothetical protein SEEE8B1_05655 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|396070205|gb|EJI78534.1| hypothetical protein SEEE3079_05727 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|396072341|gb|EJI80651.1| hypothetical protein SEEE1618_03571 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|434956555|gb|ELL50284.1| hypothetical protein SEECHS44_21392 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|434966085|gb|ELL58983.1| hypothetical protein SEEE1882_15710 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|434972090|gb|ELL64574.1| hypothetical protein SEE22704_08488 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|434972217|gb|ELL64683.1| hypothetical protein SEEE1884_15817 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|434981889|gb|ELL73751.1| hypothetical protein SEEE1594_06730 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|434987983|gb|ELL79584.1| hypothetical protein SEEE1580_15161 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|434988731|gb|ELL80315.1| hypothetical protein SEEE1566_04035 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|434997520|gb|ELL88761.1| hypothetical protein SEEE1441_19202 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|434998217|gb|ELL89439.1| hypothetical protein SEEE1543_04463 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|435000158|gb|ELL91309.1| hypothetical protein SEE30663_08486 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|435010814|gb|ELM01577.1| hypothetical protein SEEE1810_02403 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|435012005|gb|ELM02695.1| hypothetical protein SEEE1558_02954 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|435018866|gb|ELM09311.1| hypothetical protein SEEE1018_02862 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|435022069|gb|ELM12420.1| hypothetical protein SEEE1010_03453 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|435023777|gb|ELM14017.1| hypothetical protein SEEE1729_18338 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|435030256|gb|ELM20297.1| hypothetical protein SEEE0895_09976 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|435034856|gb|ELM24713.1| hypothetical protein SEEE0899_15810 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|435036430|gb|ELM26251.1| hypothetical protein SEEE1457_16920 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|435040717|gb|ELM30470.1| hypothetical protein SEEE1747_15796 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|435048135|gb|ELM37702.1| hypothetical protein SEEE0968_18795 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|435049092|gb|ELM38627.1| hypothetical protein SEEE1444_14849 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|435060990|gb|ELM50227.1| hypothetical protein SEEE1445_01381 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|435062727|gb|ELM51908.1| hypothetical protein SEEE1565_17627 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|435064420|gb|ELM53549.1| hypothetical protein SEEE1808_20431 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|435069121|gb|ELM58130.1| hypothetical protein SEEE1559_06556 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|435080300|gb|ELM68982.1| hypothetical protein SEEE1811_09865 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|435086361|gb|ELM74900.1| hypothetical protein SEEE0956_01180 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|435086388|gb|ELM74926.1| hypothetical protein SEEE1455_05273 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|435092283|gb|ELM80650.1| hypothetical protein SEEE1575_14346 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|435095779|gb|ELM84062.1| hypothetical protein SEEE1745_14184 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|435097290|gb|ELM85551.1| hypothetical protein SEEE1725_04709 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|435108390|gb|ELM96357.1| hypothetical protein SEEE1791_05329 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|435109351|gb|ELM97304.1| hypothetical protein SEEE1795_09343 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|435115756|gb|ELN03511.1| hypothetical protein SEEE0816_23102 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|435115924|gb|ELN03677.1| hypothetical protein SEEE6709_04610 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|435121957|gb|ELN09480.1| hypothetical protein SEEE9058_06907 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|435123743|gb|ELN11235.1| hypothetical protein SEEE0819_15610 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|435136035|gb|ELN23136.1| hypothetical protein SEEE3072_04644 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|435139610|gb|ELN26601.1| hypothetical protein SEEE3089_05314 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|435139761|gb|ELN26742.1| hypothetical protein SEEE9163_14382 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|435143085|gb|ELN29964.1| hypothetical protein SEEE151_18100 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|435152694|gb|ELN39323.1| hypothetical protein SEEEN202_06125 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|435153800|gb|ELN40397.1| hypothetical protein SEEE3991_14003 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|435161457|gb|ELN47685.1| hypothetical protein SEEE2490_17657 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|435162986|gb|ELN49124.1| hypothetical protein SEEE3618_06829 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|435169890|gb|ELN55648.1| hypothetical protein SEEEL909_17675 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|435174737|gb|ELN60178.1| hypothetical protein SEEEL913_05569 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|435181699|gb|ELN66752.1| hypothetical protein SEEE4941_12100 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|435188330|gb|ELN73047.1| hypothetical protein SEEE7015_01180 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|435194134|gb|ELN78592.1| hypothetical protein SEEE7927_06991 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|435195768|gb|ELN80158.1| hypothetical protein SEEECHS4_07410 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|435199033|gb|ELN83153.1| hypothetical protein SEEE2217_21365 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|435209540|gb|ELN92853.1| hypothetical protein SEEE4018_14080 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|435217080|gb|ELN99522.1| hypothetical protein SEEE6211_14683 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|435220781|gb|ELO03061.1| hypothetical protein SEEE1831_04503 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|435224213|gb|ELO06185.1| hypothetical protein SEEE4441_05197 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|435228101|gb|ELO09552.1| hypothetical protein SEEE9845_22471 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|435229852|gb|ELO11187.1| hypothetical protein SEEE4647_06743 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|435237063|gb|ELO17777.1| hypothetical protein SEEE0116_18550 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|435247221|gb|ELO27192.1| hypothetical protein SEEE1117_07915 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|435251581|gb|ELO31186.1| hypothetical protein SEEE1392_17150 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|435253937|gb|ELO33352.1| hypothetical protein SEEE0268_17650 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|435260557|gb|ELO39749.1| hypothetical protein SEEE0316_08756 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|435264830|gb|ELO43722.1| hypothetical protein SEEE0436_23184 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|435268721|gb|ELO47301.1| hypothetical protein SEEE1319_12828 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|435274894|gb|ELO52988.1| hypothetical protein SEEE4481_12502 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|435280067|gb|ELO57793.1| hypothetical protein SEEE6297_07188 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|435290984|gb|ELO67872.1| hypothetical protein SEEE4220_08863 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|435293025|gb|ELO69762.1| hypothetical protein SEEE1616_05992 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 16-16]
 gi|435295196|gb|ELO71717.1| hypothetical protein SEEE2651_21208 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 76-2651]
 gi|435295991|gb|ELO72414.1| hypothetical protein SEEE3944_22067 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|435318636|gb|ELO91560.1| hypothetical protein SEEE2625_08240 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|435323736|gb|ELO95733.1| hypothetical protein SEEE1976_15564 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|435329624|gb|ELP01026.1| hypothetical protein SEEE3407_15758 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|435336306|gb|ELP06273.1| hypothetical protein SEEE5621_05022 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 6.0562-1]
 gi|444862919|gb|ELX87757.1| hypothetical protein SEE10_020946 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|444864401|gb|ELX89201.1| hypothetical protein SEE8A_021969 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|444869705|gb|ELX94276.1| hypothetical protein SE20037_06257 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|444875839|gb|ELY00033.1| hypothetical protein SEE18569_008126 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|444878778|gb|ELY02892.1| hypothetical protein SEE13_018274 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|444887218|gb|ELY10942.1| hypothetical protein SEE23_007297 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
 gi|444891559|gb|ELY14803.1| hypothetical protein SEE436_026260 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 436]
          Length = 651

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 90/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGVQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +   G+  L  ++     W   +++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI- 475

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 476 -DSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|198242542|ref|YP_002217640.1| hypothetical protein SeD_A4064 [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|375121158|ref|ZP_09766325.1| Six-hairpin glycosidase-like domain protein [Salmonella enterica
           subsp. enterica serovar Dublin str. SD3246]
 gi|445143487|ref|ZP_21386535.1| hypothetical protein SEEDSL_010877 [Salmonella enterica subsp.
           enterica serovar Dublin str. SL1438]
 gi|445149123|ref|ZP_21388948.1| hypothetical protein SEEDHWS_023670 [Salmonella enterica subsp.
           enterica serovar Dublin str. HWS51]
 gi|197937058|gb|ACH74391.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|326625425|gb|EGE31770.1| Six-hairpin glycosidase-like domain protein [Salmonella enterica
           subsp. enterica serovar Dublin str. SD3246]
 gi|444848141|gb|ELX73271.1| hypothetical protein SEEDSL_010877 [Salmonella enterica subsp.
           enterica serovar Dublin str. SL1438]
 gi|444858418|gb|ELX83404.1| hypothetical protein SEEDHWS_023670 [Salmonella enterica subsp.
           enterica serovar Dublin str. HWS51]
          Length = 651

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 90/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGVQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +   G+  L  ++     W   +++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI- 475

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 476 -DSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|378580796|ref|ZP_09829449.1| secreted protein [Pantoea stewartii subsp. stewartii DC283]
 gi|377816535|gb|EHT99637.1| secreted protein [Pantoea stewartii subsp. stewartii DC283]
          Length = 651

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 63/251 (25%), Positives = 95/251 (37%), Gaps = 39/251 (15%)

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H     FN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLPFNHIYDHVKPVRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + LG  IY   E     L+I  YI +  +   G+  L  ++   + W   
Sbjct: 414 CCPPNIARLLTSLGHYIYTPRED---ALFINLYIGNRVEIPVGNQTLGLRISGNLPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
             +T+T  S Q V    +L LR+P W  S   Q + NG  +       +L     W   D
Sbjct: 470 -TVTITIDSTQPVNH--ALALRLPDWCAS--PQITCNGTEVNEAARKGYLYLNRHWQEGD 524

Query: 594 KLTIQLPLSLR 604
            +T+ LP+ +R
Sbjct: 525 TVTLTLPMPVR 535


>gi|209551193|ref|YP_002283110.1| hypothetical protein Rleg2_3619 [Rhizobium leguminosarum bv.
           trifolii WSM2304]
 gi|209536949|gb|ACI56884.1| protein of unknown function DUF1680 [Rhizobium leguminosarum bv.
           trifolii WSM2304]
          Length = 640

 Score = 56.2 bits (134), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 90/396 (22%), Positives = 155/396 (39%), Gaps = 61/396 (15%)

Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
            L +L  +T + K+L L+  F      +   F    A      + FH  T      H P+
Sbjct: 198 ALVKLARVTGEKKYLELSKFFIDERGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 258 REQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +     +  YAD  E+AL NG L    
Sbjct: 317 NEGFTDYYDLPN--ATAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
            T+     Y  PL      A   H W  K++   CC        + +G  +Y   +  + 
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426

Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            +++    ++     +G  V L Q  +    WD      +TF+++ +     +L+LR+P 
Sbjct: 427 AVHLYGESTARLKLANGAEVELQQTTN--YPWDG----AVTFATRLKAPAKFALSLRIPD 480

Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
           W  + GA  S+NG+ L L       +     +W+  D++ + LPLSLR +       + A
Sbjct: 481 W--AEGATLSVNGEMLDLAANIRDGYARIDRQWTDGDRVALSLPLSLRPQYANPKVRQDA 538

Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
              A++ GP +    T       T     L+A++ P
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGEDLNAIVLP 567


>gi|340346785|ref|ZP_08669904.1| hypothetical protein HMPREF9136_0902 [Prevotella dentalis DSM 3688]
 gi|433652020|ref|YP_007278399.1| hypothetical protein Prede_1032 [Prevotella dentalis DSM 3688]
 gi|339611002|gb|EGQ15842.1| hypothetical protein HMPREF9136_0902 [Prevotella dentalis DSM 3688]
 gi|433302553|gb|AGB28369.1| hypothetical protein Prede_1032 [Prevotella dentalis DSM 3688]
          Length = 663

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 74/327 (22%), Positives = 131/327 (40%), Gaps = 42/327 (12%)

Query: 291 DVLYRLYSITHDPKHLLLAH-------------LFDKPCFLGFLALQADYLS-HFHANTH 336
           D + RLY+IT   ++L  A               F +   +    L  D L  + HA+T 
Sbjct: 229 DPIARLYTITGKKRYLDWAKWVVGNIDKWSGWDAFSRLDSIADGKLGVDQLQPYVHAHTF 288

Query: 337 IPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
               +G    Y++TGD  L + +   + DI      Y TGG S  E +   K     L  
Sbjct: 289 QMNFMGFLRLYQITGDRSLLRKVEGAWNDIYR-RQMYITGGVSVAEHY--EKGYVKPLSG 345

Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
              ETC T + +++++ L   T +  YAD  E+ + N V + Q         +  P   G
Sbjct: 346 NIIETCATMSWMQLTQMLLELTGDTKYADAIEKIMLNHVFAAQDALSGTCRYHTAP--NG 403

Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
                  HG         CC  +G    S L  + ++ E+G     YI Q + +++  K+
Sbjct: 404 FKPDGYFHGPD-------CCTASGHRIISLL-PTFFYAEKGK--SFYINQLLPANYRGKA 453

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
                   +D  +S +  +  ++     +  G  + L +R+P W   +    ++NG+   
Sbjct: 454 --------IDFNISGNYPVSDSVVIDVNRMQG--NKLFIRVPAWC--DNPSITVNGKPQG 501

Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLS 602
               G +    ++WS  D++ + LP+ 
Sbjct: 502 NVAAGKYYVVNKKWSKGDRIVMHLPMK 528


>gi|283787780|ref|YP_003367645.1| hypothetical protein ROD_42311 [Citrobacter rodentium ICC168]
 gi|282951234|emb|CBG90928.1| conserved hypothetical protein [Citrobacter rodentium ICC168]
          Length = 651

 Score = 55.8 bits (133), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 82/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P++L LA+ F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYLALANYFVEQRGTQPHFYDQEYEKRGKTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H P+      IG  +R+      +Y + G   +  +N   S                 
Sbjct: 252 QAHQPLAEQQTAIGHAVRF------VYLMTGVAHLARLNNDESKRQDCLRLWRNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASVGLMMFARRMLEMEADSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H     FN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLSFNHIYDHVKPVRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  IY         LYI  Y+ +S +       L  ++     W  +
Sbjct: 414 CCPPNIARVLTSIGHYIYTPRP---EALYINLYVGNSMELPLAGGTLRLRISGDYPW--H 468

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q +    +L LR+P W     A+ +LNG+ +       ++  T  W   D
Sbjct: 469 EQVTIAVDSPQSIHH--TLALRLPDWCPQ--AKVALNGEEVAQDIRKGYIHITRSWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLRLTLPMPVR 535


>gi|161616753|ref|YP_001590718.1| hypothetical protein SPAB_04572 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|161366117|gb|ABX69885.1| hypothetical protein SPAB_04572 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
          Length = 651

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 143/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P++++LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMMLASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG ++       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLDVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|373456252|ref|ZP_09548019.1| protein of unknown function DUF1680 [Caldithrix abyssi DSM 13497]
 gi|371717916|gb|EHO39687.1| protein of unknown function DUF1680 [Caldithrix abyssi DSM 13497]
          Length = 676

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 121/584 (20%), Positives = 211/584 (36%), Gaps = 92/584 (15%)

Query: 185 IKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSF---------EALKPVWAPYYTIH 233
           IK+    + + L+  Q     GY    P  T +FD+          E +K  W P    H
Sbjct: 119 IKKAKKWIEYILTHQQE---DGYFGPLPDSTRVFDNTKWGRRQAWQEKVKQDWWP----H 171

Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DV 292
            I+  ++  Y  A   Q  ++  +M  YF  +++ +        +W    +  GG N   
Sbjct: 172 MIVLKVMQTYYEA--TQDERVLDFMRRYFQYQMKNIKE--KPLDYWTHWAKSRGGENLAS 227

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA--DYLSHFHA------NTH-IPIVIGS 343
           +Y LY+ T D      A L D    LG +  +   D+   F +      N H +   +G 
Sbjct: 228 IYWLYNHTGD------AFLLD----LGKIIFEQTLDWTQRFESANPQDWNWHGVNTAMGI 277

Query: 344 Q---MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEET 400
           +   + Y+ + D  Y       ++ +   H    G  +A E       LA        E+
Sbjct: 278 KQPGVWYQYSKDERYLKAVKTGIEKLMKHHGQVYGLWAADEL------LAGKDPVRGTES 331

Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
           CT    +     + + + +  Y D  ER   N + +  +        Y L     V   R
Sbjct: 332 CTVVEYMFSLETMLQISGDAEYGDILERVALNALPAFLKPGHTARQYYQL--ANQVICDR 389

Query: 461 STHGWGTKFNS----------FWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
             H + TK             + CC     + + K   ++++  + N  GL  + Y  S 
Sbjct: 390 GWHNFSTKHGETELLFGLETGYGCCTANYHQGWPKYVMNLWYATQDN--GLAALVYAPSE 447

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
               +  V  N +V  +   D   +  + F  K+  G     +LR+P W   + A   +N
Sbjct: 448 V---TARVADNVEVTFVEETDYPFKERIKFICKKSNGVAFPFHLRIPEW--CDNAVVFVN 502

Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
           G+    P  G+    T RW   D L + LP+ +R          +    A+  GP + A 
Sbjct: 503 GKVYGKPQAGSITKVTRRWKKGDVLELYLPMKIRISYW------FQRSAAVERGPLVFAL 556

Query: 631 HTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN---SNQSITMEEF 687
             + EW    G        + P  P +N  L+    +  ++TF++      NQ  T++  
Sbjct: 557 GLNEEWKKIGGKEPYADYEVLPKDP-WNYGLLRNYVDHPDTTFIVKEFTVKNQPWTLKNA 615

Query: 688 PVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFP 731
           PV           ++I K   +  +     + G  +   PF +P
Sbjct: 616 PV-----------KIIAKAKKIPEWKLYGGITG-PIPYSPFWYP 647


>gi|168260569|ref|ZP_02682542.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Hadar str. RI_05P066]
 gi|205350487|gb|EDZ37118.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Hadar str. RI_05P066]
          Length = 651

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY         LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +   G+  L  ++     W  + ++ +   S Q V    +L LR+P W     
Sbjct: 442 MYVGNSMEIPVGNGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 556 PLVYCLEQADNGE 568


>gi|168818493|ref|ZP_02830493.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Weltevreden str. HI_N05-537]
 gi|409247363|ref|YP_006888062.1| hypothetical protein SENTW_3778 [Salmonella enterica subsp.
           enterica serovar Weltevreden str. 2007-60-3289-1]
 gi|205344524|gb|EDZ31288.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Weltevreden str. HI_N05-537]
 gi|320088097|emb|CBY97859.1| hypothetical protein SENTW_3778 [Salmonella enterica subsp.
           enterica serovar Weltevreden str. 2007-60-3289-1]
          Length = 651

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P++++LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMMLASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWC--PAAKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|378957466|ref|YP_005214953.1| hypothetical protein SPUL_3886 [Salmonella enterica subsp. enterica
           serovar Gallinarum/pullorum str. RKS5078]
 gi|438120755|ref|ZP_20872004.1| hypothetical protein SEEP9120_00215 [Salmonella enterica subsp.
           enterica serovar Pullorum str. ATCC 9120]
 gi|357208077|gb|AET56123.1| hypothetical protein SPUL_3886 [Salmonella enterica subsp. enterica
           serovar Gallinarum/pullorum str. RKS5078]
 gi|434943466|gb|ELL49584.1| hypothetical protein SEEP9120_00215 [Salmonella enterica subsp.
           enterica serovar Pullorum str. ATCC 9120]
          Length = 651

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY         LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +   G+  L  ++     W   +++ +   S Q V    +L LR+P W     
Sbjct: 442 MYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI--DSVQPVRH--TLALRLPDWCPE-- 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 556 PLVYCLEQADNGE 568


>gi|417369073|ref|ZP_12140391.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Hvittingfoss str. A4-620]
 gi|353585087|gb|EHC45022.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Hvittingfoss str. A4-620]
          Length = 651

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY         LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +   G+  L  ++     W   +++ +   S Q V    +L LR+P W     
Sbjct: 442 MYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI--DSVQPVRH--TLALRLPDWCPE-- 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 556 PLVYCLEQADNGE 568


>gi|197261863|ref|ZP_03161937.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
 gi|197240118|gb|EDY22738.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
          Length = 651

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY         LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +   G+  L  ++     W   +++ +   S Q V    +L LR+P W     
Sbjct: 442 MYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI--DSVQPVRH--TLALRLPDWCPE-- 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 556 PLVYCLEQADNGE 568


>gi|437530472|ref|ZP_20780573.1| hypothetical protein SEEE9317_09781, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 648899 3-17]
 gi|435244046|gb|ELO24278.1| hypothetical protein SEEE9317_09781, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 648899 3-17]
          Length = 349

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 32  ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 84

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY         LYI 
Sbjct: 85  --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTPR---ADALYIN 139

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +   G+  L  ++     W   +++ +   S Q V    +L LR+P W     
Sbjct: 140 MYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI--DSVQPVRH--TLALRLPDWCPE-- 193

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 194 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 253

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 254 PLVYCLEQADNGE 266


>gi|440285639|ref|YP_007338404.1| hypothetical protein D782_0140 [Enterobacteriaceae bacterium strain
           FGI 57]
 gi|440045161|gb|AGB76219.1| hypothetical protein D782_0140 [Enterobacteriaceae bacterium strain
           FGI 57]
          Length = 652

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 81/364 (22%), Positives = 131/364 (35%), Gaps = 73/364 (20%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
           L RLY +T +P++  L   F      +P F      +    S++H             + 
Sbjct: 193 LMRLYDVTQEPRYQQLVRYFVEERGKQPHFYDIEYEKRGKTSYWHTYGPAWMVKDKAYSQ 252

Query: 335 THIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG-- 376
            H PI      IG  +R  Y +TG         D   +       + +     Y TGG  
Sbjct: 253 AHQPIAEQPKAIGHAVRFVYLMTGVAHLARLSHDEGKRQDCLRLWNNMAQRQLYITGGIG 312

Query: 377 --TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
             +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N V
Sbjct: 313 SQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNTV 369

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGI 480
           L      +     Y+ PL          H     FN  +              CC     
Sbjct: 370 LG-GMALDGKHFFYVNPL--------EVHPKSLNFNHIYDHVKPVRQRWFGCACCPPNIA 420

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
              + +G  IY   +     LY+  Y+ +S +   G+  L   +     W   +++T+  
Sbjct: 421 RVLTSIGHYIYTPRD---EALYVNLYVGNSVEIPVGNETLRLTISGNYPWQEQIKITIDS 477

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
            S  +     +L LR+P W  +   +  LNG          +L  + RW   D LT+ LP
Sbjct: 478 PSPVQ----HTLALRLPDWCVN--PRVILNGDAAEGTVEKGYLHLSRRWQEGDTLTLTLP 531

Query: 601 LSLR 604
           + +R
Sbjct: 532 MPIR 535


>gi|395228933|ref|ZP_10407251.1| cytoplasmic protein [Citrobacter sp. A1]
 gi|424732388|ref|ZP_18160966.1| sugar (glycoside-pentoside-hexuronide) transporter [Citrobacter sp.
           L17]
 gi|394717639|gb|EJF23323.1| cytoplasmic protein [Citrobacter sp. A1]
 gi|422893047|gb|EKU32896.1| sugar (glycoside-pentoside-hexuronide) transporter [Citrobacter sp.
           L17]
          Length = 651

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 60/243 (24%), Positives = 97/243 (39%), Gaps = 23/243 (9%)

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIE 481
           RAL N VL      +     Y+ PL       +  H +         W    CC      
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPLEVHPKSLKLNHIYDHVKPVRQRWFGCACCPPNIAR 421

Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFS 541
             + +G  IY   +     LYI  Y+ +S +    +  L  ++     W  + ++ +T  
Sbjct: 422 VLTSIGHYIYTPRQD---ALYINMYVGNSMEVPVVNGSLKLRISGDYPW--HEQVKITIE 476

Query: 542 SKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
           S Q V    +L LR+P W   +  Q  LNGQ +       +L  +  W   D L++ LP+
Sbjct: 477 SPQSV--YHTLALRLPDWC--SAPQVLLNGQPIEQDIRKGYLHISRTWQEGDTLSLTLPM 532

Query: 602 SLR 604
            +R
Sbjct: 533 PVR 535


>gi|254163510|ref|YP_003046618.1| hypothetical protein ECB_03438 [Escherichia coli B str. REL606]
 gi|253975411|gb|ACT41082.1| conserved hypothetical protein [Escherichia coli B str. REL606]
          Length = 659

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 105/499 (21%), Positives = 181/499 (36%), Gaps = 97/499 (19%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
           V  +L A A       +A +++    V+  ++  Q +   GYL+ + T      +A +  
Sbjct: 74  VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNTYFT-----VKAPEER 126

Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
           W+     H++     L++  V    A   +    +V    + + +V      + H Y  +
Sbjct: 127 WSNLAECHELYCAGHLIEAEVAFFQATGKRRLLEVVCRLADHIDRVFGPDESKLHGYPGH 186

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
            E   +   L RLY +T +P++L L + F      +P +      +    SH+H      
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243

Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
                  +  H+ +      IG  +R+      +Y + G   +  ++   S         
Sbjct: 244 MVKDKAYSQAHLSLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLW 297

Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
                   Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354

Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
             YAD  ERAL N VL      +     Y+ PL          H    KFN  +      
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPI 405

Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
                   CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V 
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
               W    ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 516

Query: 586 TERWSYNDKLTIQLPLSLR 604
           T  W   D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535


>gi|424897290|ref|ZP_18320864.1| hypothetical protein Rleg4DRAFT_3241 [Rhizobium leguminosarum bv.
           trifolii WSM2297]
 gi|393181517|gb|EJC81556.1| hypothetical protein Rleg4DRAFT_3241 [Rhizobium leguminosarum bv.
           trifolii WSM2297]
          Length = 640

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 90/395 (22%), Positives = 153/395 (38%), Gaps = 59/395 (14%)

Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
            L +L  +T + K+L L+  F      +   F    A      + FH  T      H P+
Sbjct: 198 ALVKLARVTGEKKYLDLSKFFIDERGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 258 REQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +     +  YAD  E+AL NG L    
Sbjct: 317 NEGFTDYYDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
            T+     Y  PL      A   H W  K++   CC        + +G  +Y   +  + 
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            +++    ++     +G  V  Q+V     WD      + F+++ E     +L+LR+P W
Sbjct: 427 AVHLYGESTTRLKLANGAEVELQQVTNY-PWD----GAVAFTTRLEKPARFALSLRIPDW 481

Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
             + GA  S+NG+ L L       +     +W+  D + + LPLSLR +       + A 
Sbjct: 482 --AEGATLSVNGEKLDLAATMRDGYARIDRQWADGDSVALHLPLSLRPQYANPKVRQDAG 539

Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
             A++ GP +    T       T     L+A++ P
Sbjct: 540 RVALMRGPLVYCVET-------TDNGADLNAIVLP 567


>gi|417514299|ref|ZP_12178139.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Senftenberg str. A4-543]
 gi|353634280|gb|EHC80885.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Senftenberg str. A4-543]
          Length = 651

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P++++LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMMLASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|167537610|ref|XP_001750473.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771013|gb|EDQ84687.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2823

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/172 (30%), Positives = 72/172 (41%), Gaps = 21/172 (12%)

Query: 98  FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
           F  EV   +V L   SVL RA   N+ YLL    D L++ FR     P P     GW+  
Sbjct: 93  FQVEVPTSNVTLTPGSVLRRAFDANIIYLLGHPTDDLLYFFRLRNGNPNPPGQCWGWD-- 150

Query: 158 ISELRGHFVGHYLSASAQM--WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL 215
            + LRG   G +L  S  +  W    NAT++ +M  VV  +   Q +   GY   F    
Sbjct: 151 -ANLRGSLAGEFLMGSGGISRWPMA-NATLRARMDEVVAGI--LQEQEADGYAMGF---- 202

Query: 216 FDSFEALKPVWA---PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYN 264
                A    W    P Y    +  GLL +  +A N QAL +    + +F N
Sbjct: 203 -----ARNETWTHENPDYVTSWVTHGLL-EAAIAGNEQALPLIRRHLNWFNN 248


>gi|417521365|ref|ZP_12183078.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Uganda str. R8-3404]
 gi|353641628|gb|EHC86306.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Uganda str. R8-3404]
          Length = 651

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P++++LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMMLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|424886647|ref|ZP_18310255.1| hypothetical protein Rleg10DRAFT_4682 [Rhizobium leguminosarum bv.
           trifolii WSM2012]
 gi|393175998|gb|EJC76040.1| hypothetical protein Rleg10DRAFT_4682 [Rhizobium leguminosarum bv.
           trifolii WSM2012]
          Length = 640

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 90/395 (22%), Positives = 155/395 (39%), Gaps = 59/395 (14%)

Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
            L +L  +T + K+L L+  F      +   F    A      + FH  T      H P+
Sbjct: 198 ALVKLARVTAEKKYLDLSKFFIDERGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 258 RQQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +     +  YAD  E+AL NG L    
Sbjct: 317 NEGFTDYYDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
            T+     Y  PL      A   H W  K++   CC        + +G  +Y   +  + 
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            +++    ++     +G  V  Q+V     WD      + F++K +     +L+LR+P W
Sbjct: 427 AVHLYGESTARLKLANGAEVELQQVTNY-PWD----GAVAFATKLKTPARFALSLRIPDW 481

Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
             + GA  S+NG+ L L       +     +W+  D++ + LPLSLR +       + A 
Sbjct: 482 --AEGATLSVNGERLDLGATMRDGYARLDRQWADGDRVDLFLPLSLRPQYANPKVRQDAG 539

Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
             A++ GP +    T       T   + L+A++ P
Sbjct: 540 RVALMRGPLVYCVET-------TDNGQDLNAIVLP 567


>gi|416529897|ref|ZP_11744588.1| hypothetical protein SEEM010_12041 [Salmonella enterica subsp.
           enterica serovar Montevideo str. LQC 10]
 gi|416538915|ref|ZP_11749679.1| hypothetical protein SEEM030_12964 [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB30]
 gi|416553241|ref|ZP_11757602.1| hypothetical protein SEEM29N_15108 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 29N]
 gi|417470705|ref|ZP_12166835.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. S5-403]
 gi|353624652|gb|EHC73633.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. S5-403]
 gi|363551713|gb|EHL36026.1| hypothetical protein SEEM010_12041 [Salmonella enterica subsp.
           enterica serovar Montevideo str. LQC 10]
 gi|363561277|gb|EHL45405.1| hypothetical protein SEEM030_12964 [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB30]
 gi|363563119|gb|EHL47199.1| hypothetical protein SEEM29N_15108 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 29N]
          Length = 651

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIEQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSCDYDLPNDSIYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|194435948|ref|ZP_03068051.1| conserved hypothetical protein [Escherichia coli 101-1]
 gi|253771579|ref|YP_003034410.1| hypothetical protein ECBD_0148 [Escherichia coli
           'BL21-Gold(DE3)pLysS AG']
 gi|254290260|ref|YP_003056008.1| hypothetical protein ECD_03438 [Escherichia coli BL21(DE3)]
 gi|422788952|ref|ZP_16841686.1| hypothetical protein ERGG_04098 [Escherichia coli H489]
 gi|442600526|ref|ZP_21018201.1| Putative glycosyl hydrolase of unknown function (DUF1680)
           [Escherichia coli O5:K4(L):H4 str. ATCC 23502]
 gi|194425491|gb|EDX41475.1| conserved hypothetical protein [Escherichia coli 101-1]
 gi|253322623|gb|ACT27225.1| protein of unknown function DUF1680 [Escherichia coli
           'BL21-Gold(DE3)pLysS AG']
 gi|253979567|gb|ACT45237.1| conserved hypothetical protein [Escherichia coli BL21(DE3)]
 gi|323959403|gb|EGB55063.1| hypothetical protein ERGG_04098 [Escherichia coli H489]
 gi|441650536|emb|CCQ03630.1| Putative glycosyl hydrolase of unknown function (DUF1680)
           [Escherichia coli O5:K4(L):H4 str. ATCC 23502]
          Length = 659

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 105/499 (21%), Positives = 181/499 (36%), Gaps = 97/499 (19%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
           V  +L A A       +A +++    V+  ++  Q +   GYL+ + T      +A +  
Sbjct: 74  VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNTYFT-----VKAPEER 126

Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
           W+     H++     L++  V    A   +    +V    + + +V      + H Y  +
Sbjct: 127 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLADHIDRVFGPDESKLHGYPGH 186

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
            E   +   L RLY +T +P++L L + F      +P +      +    SH+H      
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243

Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
                  +  H+ +      IG  +R+      +Y + G   +  ++   S         
Sbjct: 244 MVKDKAYSQAHLSLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLW 297

Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
                   Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354

Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
             YAD  ERAL N VL      +     Y+ PL          H    KFN  +      
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPI 405

Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
                   CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V 
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
               W    ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 516

Query: 586 TERWSYNDKLTIQLPLSLR 604
           T  W   D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535


>gi|262382783|ref|ZP_06075920.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262295661|gb|EEY83592.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 680

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 91/424 (21%), Positives = 170/424 (40%), Gaps = 41/424 (9%)

Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
           ++  +L QY  A N Q  ++ +++  YF  ++ ++    S    W    E+ GG N  V+
Sbjct: 164 VMLKVLQQYYSATNDQ--RVISFLTNYFKYQLSELPK--SPLGKWTFWAEQRGGDNLMVV 219

Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
           Y LY+IT DP  L L  L  K  F    + L  D+L+  ++   + +  G +   + Y+ 
Sbjct: 220 YWLYNITGDPFLLELGELIHKQTFNWTDVFLNQDHLARQNSLHCVNLAQGFKEPVIYYQQ 279

Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
           + +P         +  +  +  + TG       W   + L     ++  E CT   M+  
Sbjct: 280 SHNPQNLEAVKEAVRKMRHTIGFPTG------LWAGDELLRFGNPTQGSELCTAVEMMFS 333

Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL--------GRG-VSKAR 460
              +   T ++ +AD+ E+   N VL  Q   +     Y   +        GR  VS   
Sbjct: 334 LEKMLEITGDVQWADHLEKVAYN-VLPTQIKDDFSARQYYQQVNQVAITCEGRNFVSPHE 392

Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--- 517
            T     + + + CC     + + K    ++F    N  G+  + Y  S    + G+   
Sbjct: 393 DTDIVFGELSGYPCCTSNLHQGWPKFTRHLWFATADN--GIASLIYAPSEVTVQVGNDIT 450

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
           V + +K +    ++  +   L+F SK++       +LR+P W   N    ++NG+ + + 
Sbjct: 451 VKIAEKTN--YPFEEKIDFNLSFPSKKDKKAFFPFHLRIPAW--CNNPVITINGEAVSIA 506

Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
              G  +     W   D + ++LP+ + T    DD         I  GP L +     +W
Sbjct: 507 AHSGEIVRINREWKDGDHIQLELPMRISTSNWYDD------AVVIERGPLLYSLKMDEKW 560

Query: 637 DIKT 640
           + K 
Sbjct: 561 ERKV 564


>gi|251786831|ref|YP_003001135.1| ybl149 [Escherichia coli BL21(DE3)]
 gi|242379104|emb|CAQ33906.1| ybl149 [Escherichia coli BL21(DE3)]
          Length = 667

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 105/499 (21%), Positives = 181/499 (36%), Gaps = 97/499 (19%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
           V  +L A A       +A +++    V+  ++  Q +   GYL+ + T      +A +  
Sbjct: 82  VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNTYFT-----VKAPEER 134

Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
           W+     H++     L++  V    A   +    +V    + + +V      + H Y  +
Sbjct: 135 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLADHIDRVFGPDESKLHGYPGH 194

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
            E   +   L RLY +T +P++L L + F      +P +      +    SH+H      
Sbjct: 195 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 251

Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
                  +  H+ +      IG  +R+      +Y + G   +  ++   S         
Sbjct: 252 MVKDKAYSQAHLSLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLW 305

Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
                   Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +
Sbjct: 306 NNMAQRQLYITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 362

Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
             YAD  ERAL N VL      +     Y+ PL          H    KFN  +      
Sbjct: 363 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPI 413

Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
                   CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V 
Sbjct: 414 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 470

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
               W    ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  
Sbjct: 471 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 524

Query: 586 TERWSYNDKLTIQLPLSLR 604
           T  W   D L + LP+ +R
Sbjct: 525 TREWQEGDTLNLTLPMPVR 543


>gi|386016685|ref|YP_005934975.1| hypothetical protein PAJ_2099 [Pantoea ananatis AJ13355]
 gi|327394757|dbj|BAK12179.1| hypothetical protein PAJ_2099 [Pantoea ananatis AJ13355]
          Length = 659

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 82/362 (22%), Positives = 131/362 (36%), Gaps = 69/362 (19%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
           L RLY +T  P++L L + F      +P F      +    S++H             + 
Sbjct: 201 LMRLYEVTQQPRYLALVNTFVSQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 260

Query: 335 THIPIV-----IGSQMRYEVTGDPLYKLIGTFFM-----------DIVNASHS------Y 372
            H P+      +G  +R+      +Y + G   +           D +   H+      Y
Sbjct: 261 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 314

Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ER
Sbjct: 315 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 371

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIES 482
           AL N VL      +     Y+ PL          H +         W    CC       
Sbjct: 372 ALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIARL 430

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
            + LG  IY   E     L+I  Y+ +  D   G   L  ++     W+    +T++   
Sbjct: 431 LTSLGHYIYTPHEN---ALFINLYVGNRVDVPVGDRTLGIRISGNFPWEE--TVTISVDV 485

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
            Q V    +L LR+P W      Q S NG+ +       +L     W   D LT+ LP+ 
Sbjct: 486 TQPVKH--TLALRLPDW--CEAPQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPMP 541

Query: 603 LR 604
           +R
Sbjct: 542 VR 543


>gi|417386570|ref|ZP_12151238.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Johannesburg str. S5-703]
 gi|353602920|gb|EHC58138.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Johannesburg str. S5-703]
          Length = 651

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSCDYDLPNDSIYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|291618364|ref|YP_003521106.1| hypothetical Protein PANA_2811 [Pantoea ananatis LMG 20103]
 gi|291153394|gb|ADD77978.1| Hypothetical Protein PANA_2811 [Pantoea ananatis LMG 20103]
          Length = 659

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 82/362 (22%), Positives = 131/362 (36%), Gaps = 69/362 (19%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
           L RLY +T  P++L L + F      +P F      +    S++H             + 
Sbjct: 201 LMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 260

Query: 335 THIPIV-----IGSQMRYEVTGDPLYKLIGTFFM-----------DIVNASHS------Y 372
            H P+      +G  +R+      +Y + G   +           D +   H+      Y
Sbjct: 261 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 314

Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ER
Sbjct: 315 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 371

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIES 482
           AL N VL      +     Y+ PL          H +         W    CC       
Sbjct: 372 ALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIARL 430

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
            + LG  IY   E     L+I  Y+ +  D   G   L  ++     W+    +T++   
Sbjct: 431 LTSLGHYIYTPHEN---ALFINLYVGNRVDVPVGDRTLGIRISGNFPWEE--TVTISVDV 485

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
            Q V    +L LR+P W      Q S NG+ +       +L     W   D LT+ LP+ 
Sbjct: 486 TQPVKH--TLALRLPDW--CEAPQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPMP 541

Query: 603 LR 604
           +R
Sbjct: 542 VR 543


>gi|416425586|ref|ZP_11692369.1| hypothetical protein SEEM315_05068 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315996572]
 gi|416430384|ref|ZP_11695001.1| hypothetical protein SEEM971_12506 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-1]
 gi|416437565|ref|ZP_11698915.1| hypothetical protein SEEM973_00650 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-3]
 gi|416443382|ref|ZP_11702995.1| hypothetical protein SEEM974_19090 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-4]
 gi|416450281|ref|ZP_11707410.1| hypothetical protein SEEM201_09767 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-1]
 gi|416460310|ref|ZP_11714693.1| hypothetical protein SEEM202_18350 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-2]
 gi|416463475|ref|ZP_11715992.1| hypothetical protein SEEM954_09563 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 531954]
 gi|416480379|ref|ZP_11722779.1| hypothetical protein SEEM054_17193 [Salmonella enterica subsp.
           enterica serovar Montevideo str. NC_MB110209-0054]
 gi|416487797|ref|ZP_11725654.1| hypothetical protein SEEM675_20283 [Salmonella enterica subsp.
           enterica serovar Montevideo str. OH_2009072675]
 gi|416501897|ref|ZP_11732445.1| hypothetical protein SEEM965_19843 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CASC_09SCPH15965]
 gi|416504577|ref|ZP_11733224.1| hypothetical protein SEEM031_12180 [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB31]
 gi|416517070|ref|ZP_11739340.1| hypothetical protein SEEM710_04917 [Salmonella enterica subsp.
           enterica serovar Montevideo str. ATCC BAA710]
 gi|416543079|ref|ZP_11752034.1| hypothetical protein SEEM19N_14012 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 19N]
 gi|416562276|ref|ZP_11762033.1| hypothetical protein SEEM42N_02114 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 42N]
 gi|416573654|ref|ZP_11767961.1| hypothetical protein SEEM41H_07852 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 4441 H]
 gi|416578850|ref|ZP_11770886.1| hypothetical protein SEEM801_22092 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 81038-01]
 gi|416584544|ref|ZP_11774245.1| hypothetical protein SEEM507_12719 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MD_MDA09249507]
 gi|416589552|ref|ZP_11777137.1| hypothetical protein SEEM877_17791 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 414877]
 gi|416607005|ref|ZP_11788219.1| hypothetical protein SEEM180_02276 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 413180]
 gi|416611569|ref|ZP_11790943.1| hypothetical protein SEEM600_22079 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 446600]
 gi|416624752|ref|ZP_11798278.1| hypothetical protein SEEM581_05149 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609458-1]
 gi|416626628|ref|ZP_11798711.1| hypothetical protein SEEM501_13368 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556150-1]
 gi|416644435|ref|ZP_11806741.1| hypothetical protein SEEM460_09301 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609460]
 gi|416648059|ref|ZP_11808823.1| hypothetical protein SEEM020_002973 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 507440-20]
 gi|416658271|ref|ZP_11814206.1| hypothetical protein SEEM6152_18571 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556152]
 gi|416668027|ref|ZP_11818653.1| hypothetical protein SEEM0077_13550 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB101509-0077]
 gi|416681176|ref|ZP_11823586.1| hypothetical protein SEEM0047_06329 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB102109-0047]
 gi|416694001|ref|ZP_11826910.1| hypothetical protein SEEM0055_00612 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB110209-0055]
 gi|416708995|ref|ZP_11833799.1| hypothetical protein SEEM0052_07867 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB111609-0052]
 gi|416712890|ref|ZP_11836552.1| hypothetical protein SEEM3312_21482 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009083312]
 gi|416721065|ref|ZP_11842596.1| hypothetical protein SEEM5258_14872 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009085258]
 gi|416722793|ref|ZP_11843619.1| hypothetical protein SEEM1156_02237 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315731156]
 gi|416729527|ref|ZP_11848104.1| hypothetical protein SEEM9199_08226 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2009159199]
 gi|416741866|ref|ZP_11855415.1| hypothetical protein SEEM8282_16962 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008282]
 gi|416745954|ref|ZP_11857573.1| hypothetical protein SEEM8283_19676 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008283]
 gi|416755322|ref|ZP_11861983.1| hypothetical protein SEEM8284_15210 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008284]
 gi|416763125|ref|ZP_11866955.1| hypothetical protein SEEM8285_00470 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008285]
 gi|416771775|ref|ZP_11872954.1| hypothetical protein SEEM8287_16934 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008287]
 gi|418485126|ref|ZP_13054112.1| hypothetical protein SEEM906_06936 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 80959-06]
 gi|418491104|ref|ZP_13057631.1| hypothetical protein SEEM5278_11244 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035278]
 gi|418494659|ref|ZP_13061110.1| hypothetical protein SEEM5318_03413 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035318]
 gi|418499800|ref|ZP_13066201.1| hypothetical protein SEEM5320_09457 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035320]
 gi|418503417|ref|ZP_13069781.1| hypothetical protein SEEM5321_21740 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035321]
 gi|418508996|ref|ZP_13075294.1| hypothetical protein SEEM5327_04049 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035327]
 gi|418525130|ref|ZP_13091112.1| hypothetical protein SEEM8286_03017 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008286]
 gi|322613936|gb|EFY10872.1| hypothetical protein SEEM315_05068 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315996572]
 gi|322620305|gb|EFY17173.1| hypothetical protein SEEM971_12506 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-1]
 gi|322625311|gb|EFY22138.1| hypothetical protein SEEM973_00650 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-3]
 gi|322630022|gb|EFY26795.1| hypothetical protein SEEM974_19090 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-4]
 gi|322634213|gb|EFY30948.1| hypothetical protein SEEM201_09767 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-1]
 gi|322635886|gb|EFY32595.1| hypothetical protein SEEM202_18350 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-2]
 gi|322643086|gb|EFY39661.1| hypothetical protein SEEM954_09563 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 531954]
 gi|322644583|gb|EFY41119.1| hypothetical protein SEEM054_17193 [Salmonella enterica subsp.
           enterica serovar Montevideo str. NC_MB110209-0054]
 gi|322650825|gb|EFY47217.1| hypothetical protein SEEM675_20283 [Salmonella enterica subsp.
           enterica serovar Montevideo str. OH_2009072675]
 gi|322653011|gb|EFY49346.1| hypothetical protein SEEM965_19843 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CASC_09SCPH15965]
 gi|322659974|gb|EFY56214.1| hypothetical protein SEEM19N_14012 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 19N]
 gi|322663307|gb|EFY59511.1| hypothetical protein SEEM801_22092 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 81038-01]
 gi|322668793|gb|EFY64946.1| hypothetical protein SEEM507_12719 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MD_MDA09249507]
 gi|322674404|gb|EFY70497.1| hypothetical protein SEEM877_17791 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 414877]
 gi|322680894|gb|EFY76928.1| hypothetical protein SEEM180_02276 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 413180]
 gi|322687170|gb|EFY83143.1| hypothetical protein SEEM600_22079 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 446600]
 gi|323192129|gb|EFZ77362.1| hypothetical protein SEEM581_05149 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609458-1]
 gi|323200633|gb|EFZ85707.1| hypothetical protein SEEM501_13368 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556150-1]
 gi|323201343|gb|EFZ86409.1| hypothetical protein SEEM460_09301 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609460]
 gi|323211827|gb|EFZ96659.1| hypothetical protein SEEM6152_18571 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556152]
 gi|323216186|gb|EGA00914.1| hypothetical protein SEEM0077_13550 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB101509-0077]
 gi|323220409|gb|EGA04863.1| hypothetical protein SEEM0047_06329 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB102109-0047]
 gi|323226266|gb|EGA10481.1| hypothetical protein SEEM0055_00612 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB110209-0055]
 gi|323228386|gb|EGA12517.1| hypothetical protein SEEM0052_07867 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB111609-0052]
 gi|323234207|gb|EGA18295.1| hypothetical protein SEEM3312_21482 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009083312]
 gi|323237192|gb|EGA21259.1| hypothetical protein SEEM5258_14872 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009085258]
 gi|323244711|gb|EGA28715.1| hypothetical protein SEEM1156_02237 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315731156]
 gi|323249192|gb|EGA33110.1| hypothetical protein SEEM9199_08226 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2009159199]
 gi|323250689|gb|EGA34569.1| hypothetical protein SEEM8282_16962 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008282]
 gi|323257564|gb|EGA41251.1| hypothetical protein SEEM8283_19676 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008283]
 gi|323262273|gb|EGA45834.1| hypothetical protein SEEM8284_15210 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008284]
 gi|323266172|gb|EGA49663.1| hypothetical protein SEEM8285_00470 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008285]
 gi|323268806|gb|EGA52264.1| hypothetical protein SEEM8287_16934 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008287]
 gi|363557827|gb|EHL42031.1| hypothetical protein SEEM031_12180 [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB31]
 gi|363561441|gb|EHL45559.1| hypothetical protein SEEM710_04917 [Salmonella enterica subsp.
           enterica serovar Montevideo str. ATCC BAA710]
 gi|363571665|gb|EHL55571.1| hypothetical protein SEEM41H_07852 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 4441 H]
 gi|363573358|gb|EHL57244.1| hypothetical protein SEEM42N_02114 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 42N]
 gi|366056585|gb|EHN20901.1| hypothetical protein SEEM906_06936 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 80959-06]
 gi|366061420|gb|EHN25666.1| hypothetical protein SEEM5318_03413 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035318]
 gi|366063348|gb|EHN27567.1| hypothetical protein SEEM5278_11244 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035278]
 gi|366069988|gb|EHN34105.1| hypothetical protein SEEM5320_09457 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035320]
 gi|366073016|gb|EHN37095.1| hypothetical protein SEEM5321_21740 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035321]
 gi|366078850|gb|EHN42847.1| hypothetical protein SEEM5327_04049 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035327]
 gi|366830119|gb|EHN56993.1| hypothetical protein SEEM020_002973 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 507440-20]
 gi|372206701|gb|EHP20203.1| hypothetical protein SEEM8286_03017 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008286]
          Length = 651

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIEQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVHH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|386078433|ref|YP_005991958.1| hypothetical protein [Pantoea ananatis PA13]
 gi|354987614|gb|AER31738.1| hypothetical protein PAGR_g1212 [Pantoea ananatis PA13]
          Length = 651

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 82/362 (22%), Positives = 131/362 (36%), Gaps = 69/362 (19%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
           L RLY +T  P++L L + F      +P F      +    S++H             + 
Sbjct: 193 LMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 252

Query: 335 THIPIV-----IGSQMRYEVTGDPLYKLIGTFFM-----------DIVNASHS------Y 372
            H P+      +G  +R+      +Y + G   +           D +   H+      Y
Sbjct: 253 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 306

Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIES 482
           AL N VL      +     Y+ PL          H +         W    CC       
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIARL 422

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
            + LG  IY   E     L+I  Y+ +  D   G   L  ++     W+    +T++   
Sbjct: 423 LTSLGHYIYTPHEN---ALFINLYVGNRVDVPVGDRTLGIRISGNFPWEE--TVTISVDV 477

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
            Q V    +L LR+P W      Q S NG+ +       +L     W   D LT+ LP+ 
Sbjct: 478 TQPVKH--TLALRLPDW--CEAPQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPMP 533

Query: 603 LR 604
           +R
Sbjct: 534 VR 535


>gi|298247843|ref|ZP_06971648.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
           44963]
 gi|297550502|gb|EFH84368.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
           44963]
          Length = 643

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 101/464 (21%), Positives = 180/464 (38%), Gaps = 60/464 (12%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
           V  ++ A++   A T +A +++++  V+  ++  Q+    GYL+ +      SFE     
Sbjct: 92  VYKWVEAASWTLAQTPDARLEQQLDEVIALIASAQDD--DGYLNTY-----YSFERQAER 144

Query: 226 WAPYYTIHKI-LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
           W+    +H++  AG L Q  +A +    K +  +++        + +++  +        
Sbjct: 145 WSNLTDMHELYCAGHLLQAAVAHHRATGKAS--LLDVATRVANNIASVFGPQGR-----P 197

Query: 285 ETGGMNDV---LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF----- 331
            T G  ++   L  L   T +P++L  A  F      KP  L       D+L        
Sbjct: 198 GTCGHPEIELALVELARETGEPRYLQQAQFFIGQRGQKPPVLNGSPYCQDHLPVREQQEV 257

Query: 332 --HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
             HA   + +  G    Y  TG+             +    +Y TGG  +R   W+ +  
Sbjct: 258 VGHAVRALYLYAGVTDAYLETGEAALDHAQEALWQNLTERKTYVTGGVGSR---WEGEAF 314

Query: 390 ADTLGSENE----ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
            +     NE    ETC     +  +  L +   E  + D  E+ L NGV++     +  +
Sbjct: 315 GENYELPNERAYTETCAAIASVMWNWRLLQARPEARFTDVIEQTLYNGVIA-GSSLDGKL 373

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
             Y  PL       R  H     F++  CC        + L    Y   E    G+++  
Sbjct: 374 YFYQNPLA-----DRGKHRRQPWFDTA-CCPPNIARLLASLPGYFYSTSE---EGIWLHL 424

Query: 506 YISSS--FDWKSGHVV-LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
           Y S++      SG  + + Q+ +    WD  + + L     Q+     +L +R+P W  +
Sbjct: 425 YASNTAQIPLASGEAITIEQQTN--YPWDEEIGVRLQMREAQDF----TLFVRIPAW--A 476

Query: 563 NGAQASLNGQNLP--LPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            GAQ  +N Q +      PG +      W   DK+TI LPL +R
Sbjct: 477 TGAQIQVNKQPVEGLAIKPGTYAQLNRTWQPGDKVTIVLPLEVR 520


>gi|416597563|ref|ZP_11782144.1| hypothetical protein SEEM867_21594 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 366867]
 gi|322678388|gb|EFY74449.1| hypothetical protein SEEM867_21594 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 366867]
          Length = 651

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIEQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVHH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|168465016|ref|ZP_02698908.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Newport str. SL317]
 gi|418762014|ref|ZP_13318148.1| hypothetical protein SEEN185_06317 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35185]
 gi|418768178|ref|ZP_13324234.1| hypothetical protein SEEN199_22735 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35199]
 gi|418769292|ref|ZP_13325327.1| hypothetical protein SEEN539_08791 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21539]
 gi|418774344|ref|ZP_13330315.1| hypothetical protein SEEN953_04837 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 33953]
 gi|418782301|ref|ZP_13338167.1| hypothetical protein SEEN188_15334 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35188]
 gi|418784431|ref|ZP_13340269.1| hypothetical protein SEEN559_22215 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21559]
 gi|418804570|ref|ZP_13360175.1| hypothetical protein SEEN202_19624 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35202]
 gi|419790711|ref|ZP_14316381.1| hypothetical protein SEENLE01_19942 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|419795154|ref|ZP_14320760.1| hypothetical protein SEENLE15_09022 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|195632371|gb|EDX50855.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Newport str. SL317]
 gi|392613400|gb|EIW95860.1| hypothetical protein SEENLE01_19942 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|392613862|gb|EIW96317.1| hypothetical protein SEENLE15_09022 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|392732968|gb|EIZ90175.1| hypothetical protein SEEN199_22735 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35199]
 gi|392738037|gb|EIZ95186.1| hypothetical protein SEEN185_06317 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35185]
 gi|392740729|gb|EIZ97848.1| hypothetical protein SEEN539_08791 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21539]
 gi|392744606|gb|EJA01653.1| hypothetical protein SEEN188_15334 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35188]
 gi|392751846|gb|EJA08794.1| hypothetical protein SEEN953_04837 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 33953]
 gi|392754775|gb|EJA11691.1| hypothetical protein SEEN559_22215 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21559]
 gi|392770727|gb|EJA27452.1| hypothetical protein SEEN202_19624 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35202]
          Length = 651

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY         LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +    +  L  ++     W   +++T+   S Q V    +L LR+P W     
Sbjct: 442 MYVGNSMEIPVENGALKLRISGNYPWQEQVKITI--DSVQPVRH--TLALRLPDWCPE-- 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 556 PLVYCLEQADNGE 568


>gi|167549076|ref|ZP_02342835.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA29]
 gi|205325554|gb|EDZ13393.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA29]
          Length = 651

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 93/253 (36%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY         LYI 
Sbjct: 387 --EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +    +  L  ++     W  + +M +   S Q V    +L LR+P W     
Sbjct: 442 MYVGNSLEVPVENGALKLRIGGNYPW--HEQMKIAIDSVQPVRH--TLALRLPDWCPE-- 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 556 PLVYCLEQADNGE 568


>gi|417329582|ref|ZP_12114395.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Adelaide str. A4-669]
 gi|353564565|gb|EHC30601.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Adelaide str. A4-669]
          Length = 651

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY         LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +    +  L  ++     W   +++T+   S Q V    +L LR+P W     
Sbjct: 442 MYVGNSMEIPVENGALKLRISGNYPWQEQVKITI--DSVQPVRH--TLALRLPDWCPE-- 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 556 PLVYCLEQADNGE 568


>gi|315607261|ref|ZP_07882261.1| conserved hypothetical protein [Prevotella buccae ATCC 33574]
 gi|315250964|gb|EFU30953.1| conserved hypothetical protein [Prevotella buccae ATCC 33574]
          Length = 813

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 103/451 (22%), Positives = 173/451 (38%), Gaps = 95/451 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T   ++L +A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 224 ALCKLYKVTGSRRYLDMARYFVEETGRG---TDGHRLSEY-SQDHKPILRQQEIVGHAVR 279

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +TGD  Y        + +     + TGG  +R          +  G 
Sbjct: 280 AGYLYSGVADVAALTGDTAYFHALERLWNNMAGKKLFITGGMGSRA-------QGEGFGP 332

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
           + E        ETC +   +  +  +F  T E  Y D YERAL NGVLS       GV +
Sbjct: 333 DYELNNMTAYQETCASIANVFWNYRMFLATGEGKYVDVYERALYNGVLS-------GVSL 385

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G      + +    Y     ++   
Sbjct: 386 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGNVTRFVASVPQYQYAVRGSDI--- 436

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT- 560
           Y+  YI  + D  +G  +  Q   P   WD    +T+T   K+   +  +L  R+P W  
Sbjct: 437 YVNLYIQGTAD-VNGVRLAQQTRYP---WDG--DITVTVDPKRS--RRFALRFRIPGWAG 488

Query: 561 ----------YSNGAQ---ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
                     +++ ++     +NG+ +   P   ++    RW   D++ I LP+ +R  A
Sbjct: 489 ACPVGTNLYHFADSSRPFTVKVNGRKIAGEPVDGYMVIDRRWMRGDRVEISLPMEVRRVA 548

Query: 608 ----IQDDRPEYASIQAILFGP--YLLAGHTSGEWDIKTGTARSLSALISPIPPSFNA-Q 660
               ++DDR +Y    A+  GP  Y L G       +   + R    L +PI   + A +
Sbjct: 549 ANDNVEDDRGKY----ALERGPIVYCLEGRDQAHSTVFDKSVR----LDAPIRADYRADK 600

Query: 661 LVTFTQESGNSTFVMSN-SNQSITMEEFPVS 690
           L    + SG +  V ++ S + +  +  P S
Sbjct: 601 LNGIVELSGEAEEVEADGSVRPVAFKAIPYS 631


>gi|16766964|ref|NP_462579.1| hypothetical protein STM3679 [Salmonella enterica subsp. enterica
           serovar Typhimurium str. LT2]
 gi|167990915|ref|ZP_02572014.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar 4,[5],12:i:- str. CVM23701]
 gi|374978319|ref|ZP_09719662.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Typhimurium str. TN061786]
 gi|378447048|ref|YP_005234680.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
           Typhimurium str. D23580]
 gi|378452556|ref|YP_005239916.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. 14028S]
 gi|378701566|ref|YP_005183524.1| hypothetical protein SL1344_3644 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. SL1344]
 gi|378986276|ref|YP_005249432.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. T000240]
 gi|378990981|ref|YP_005254145.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. UK-1]
 gi|379702940|ref|YP_005244668.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. ST4/74]
 gi|383498313|ref|YP_005399002.1| hypothetical protein UMN798_3997 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 798]
 gi|422027921|ref|ZP_16374245.1| hypothetical protein B571_18397 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|422032964|ref|ZP_16379054.1| hypothetical protein B572_18529 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|427555556|ref|ZP_18929550.1| hypothetical protein B576_18491 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|427573106|ref|ZP_18934155.1| hypothetical protein B577_17365 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|427594481|ref|ZP_18939063.1| hypothetical protein B573_17940 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|427618885|ref|ZP_18943976.1| hypothetical protein B574_18451 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|427642409|ref|ZP_18948833.1| hypothetical protein B575_18572 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|427657950|ref|ZP_18953577.1| hypothetical protein B578_18134 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|427663174|ref|ZP_18958453.1| hypothetical protein B579_18597 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|427679110|ref|ZP_18963359.1| hypothetical protein B580_18826 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
 gi|427801169|ref|ZP_18968792.1| hypothetical protein B581_21834 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm5]
 gi|16422244|gb|AAL22538.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. LT2]
 gi|205330807|gb|EDZ17571.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar 4,[5],12:i:- str. CVM23701]
 gi|261248827|emb|CBG26680.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. D23580]
 gi|267995935|gb|ACY90820.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. 14028S]
 gi|301160215|emb|CBW19737.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. SL1344]
 gi|312914705|dbj|BAJ38679.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. T000240]
 gi|321226733|gb|EFX51783.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Typhimurium str. TN061786]
 gi|323132039|gb|ADX19469.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. ST4/74]
 gi|332990528|gb|AEF09511.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. UK-1]
 gi|380465134|gb|AFD60537.1| hypothetical protein UMN798_3997 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 798]
 gi|414013156|gb|EKS97053.1| hypothetical protein B571_18397 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|414014140|gb|EKS97993.1| hypothetical protein B576_18491 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|414014578|gb|EKS98419.1| hypothetical protein B572_18529 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|414027997|gb|EKT11199.1| hypothetical protein B577_17365 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|414029273|gb|EKT12434.1| hypothetical protein B573_17940 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|414031641|gb|EKT14688.1| hypothetical protein B574_18451 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|414042773|gb|EKT25304.1| hypothetical protein B575_18572 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|414043221|gb|EKT25734.1| hypothetical protein B578_18134 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|414047893|gb|EKT30155.1| hypothetical protein B579_18597 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|414056107|gb|EKT37949.1| hypothetical protein B580_18826 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
 gi|414062669|gb|EKT43947.1| hypothetical protein B581_21834 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm5]
          Length = 651

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRRRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG ++       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLDVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|237728888|ref|ZP_04559369.1| conserved hypothetical protein [Citrobacter sp. 30_2]
 gi|226909510|gb|EEH95428.1| conserved hypothetical protein [Citrobacter sp. 30_2]
          Length = 651

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 51/220 (23%), Positives = 83/220 (37%), Gaps = 32/220 (14%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + +G  IY   +     LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTPRQD---ALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +       L  ++     W  + ++ +   S Q +    +L LR+P W     
Sbjct: 442 MYVGNSMEVPVADGSLKLRISGDYPW--HEQVKIAIESPQSI--YHTLALRLPDWC--TA 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            Q  LNGQ +       +L  +  W   D L++ LP+ +R
Sbjct: 496 PQVLLNGQPVEQDIRKGYLHISRTWQEGDTLSLTLPMPVR 535


>gi|418511390|ref|ZP_13077652.1| hypothetical protein SEEPO729_11665 [Salmonella enterica subsp.
           enterica serovar Pomona str. ATCC 10729]
 gi|366084797|gb|EHN48695.1| hypothetical protein SEEPO729_11665 [Salmonella enterica subsp.
           enterica serovar Pomona str. ATCC 10729]
          Length = 651

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|421844899|ref|ZP_16278055.1| hypothetical protein D186_07681 [Citrobacter freundii ATCC 8090 =
           MTCC 1658]
 gi|411773762|gb|EKS57290.1| hypothetical protein D186_07681 [Citrobacter freundii ATCC 8090 =
           MTCC 1658]
 gi|455645502|gb|EMF24562.1| hypothetical protein H262_06439 [Citrobacter freundii GTC 09479]
          Length = 651

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 52/220 (23%), Positives = 86/220 (39%), Gaps = 32/220 (14%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + +G  IY   +     LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTPRQD---ALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +    +  L  ++     W  + ++ +T  S + V    +L LR+P W   + 
Sbjct: 442 MYVGNSMEVPVVNGSLKLRISGDYPW--HEQVKITIESPRSV--YHTLALRLPDWC--SA 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            Q  LNGQ +       +L  +  W   D L++ LP+ +R
Sbjct: 496 PQVLLNGQPVEQDIRKGYLHISRTWQEGDTLSLTLPMPVR 535


>gi|302809111|ref|XP_002986249.1| hypothetical protein SELMODRAFT_425170 [Selaginella moellendorffii]
 gi|300146108|gb|EFJ12780.1| hypothetical protein SELMODRAFT_425170 [Selaginella moellendorffii]
          Length = 192

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 33/52 (63%), Gaps = 3/52 (5%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
            GHYLSA+A++WASTHNA +K++M  +V  L+ECQ        S  P  LF 
Sbjct: 7   AGHYLSATAKLWASTHNAEVKKRMDALVNILAECQ---AASRKSELPVNLFQ 55


>gi|417361434|ref|ZP_12135327.1| secreted protein [Salmonella enterica subsp. enterica serovar Give
           str. S5-487]
 gi|353584072|gb|EHC44282.1| secreted protein [Salmonella enterica subsp. enterica serovar Give
           str. S5-487]
          Length = 651

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSCDYDLPNDSIYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSVEIPVENGALKLRIGGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|194430977|ref|ZP_03063270.1| conserved hypothetical protein [Shigella dysenteriae 1012]
 gi|417675158|ref|ZP_12324583.1| hypothetical protein SD15574_4764 [Shigella dysenteriae 155-74]
 gi|194420432|gb|EDX36508.1| conserved hypothetical protein [Shigella dysenteriae 1012]
 gi|332084488|gb|EGI89683.1| hypothetical protein SD15574_4764 [Shigella dysenteriae 155-74]
          Length = 656

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + L + +R
Sbjct: 525 TLNLTLSMPVR 535


>gi|429083191|ref|ZP_19146237.1| COG3533 secreted protein [Cronobacter condimenti 1330]
 gi|426548006|emb|CCJ72278.1| COG3533 secreted protein [Cronobacter condimenti 1330]
          Length = 651

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 80/212 (37%), Gaps = 16/212 (7%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLEVHPKT 392

Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               H +         W    CC        + LG  IY         LYI  Y+ +S +
Sbjct: 393 LCLNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTPRPD---ALYINLYVGNSIE 449

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
              G  VL  +V     W    ++ +   S   V    +L LRMP W   +  Q +LNG 
Sbjct: 450 VPVGENVLRLRVSGNFPWQE--KVVIAIDSPLPVQH--TLALRMPDWC--DAPQVTLNGI 503

Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            +       +L     W   D LT+ LP+ +R
Sbjct: 504 EVEKSVRKGYLHIPRVWREGDTLTLTLPMPVR 535


>gi|365102501|ref|ZP_09332802.1| hypothetical protein HMPREF9428_03810 [Citrobacter freundii
           4_7_47CFAA]
 gi|363646229|gb|EHL85477.1| hypothetical protein HMPREF9428_03810 [Citrobacter freundii
           4_7_47CFAA]
          Length = 651

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 51/220 (23%), Positives = 83/220 (37%), Gaps = 32/220 (14%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + +G  IY   +     LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTPRQD---ALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +       L  ++     W  + ++ +   S Q +    +L LR+P W     
Sbjct: 442 MYVGNSMEVPVADGSLKLRISGDYPW--HEQVKIAIESPQSI--YHTLALRLPDWC--TA 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            Q  LNGQ +       +L  +  W   D L++ LP+ +R
Sbjct: 496 PQVLLNGQPVEQDIRKGYLHISRTWQEGDTLSLTLPMPVR 535


>gi|420349607|ref|ZP_14850981.1| hypothetical protein SB96558_4565 [Shigella boydii 965-58]
 gi|391265984|gb|EIQ24949.1| hypothetical protein SB96558_4565 [Shigella boydii 965-58]
          Length = 656

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + L + +R
Sbjct: 525 TLNLTLSMPVR 535


>gi|340619112|ref|YP_004737565.1| hypothetical protein zobellia_3147 [Zobellia galactanivorans]
 gi|339733909|emb|CAZ97286.1| Conserved hypothetical periplasmic protein [Zobellia
           galactanivorans]
          Length = 681

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 90/402 (22%), Positives = 148/402 (36%), Gaps = 45/402 (11%)

Query: 293 LYRLYSITHDPKHLLLAHLF---------DKPCFLGF------LALQADYLSHFHANTHI 337
           L  +Y  T D K+L L   F         D+    G        A++ +  +  HA    
Sbjct: 235 LIEMYRTTGDKKYLELTETFVDMLGTAPKDRLDHRGMDHSQRGTAIREESKAVGHAGHAN 294

Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL-ADTLGSE 396
            +  G    Y  TGD   K         V+    Y TG T    F      + A+  G +
Sbjct: 295 YLYAGVADLYAETGDQALKDALERIWTNVSTQKMYITGATGPHHFGISNHAIVAEAYGQD 354

Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMI 447
            E        ETC        +  +F    E  +AD  E    N  +S I    E     
Sbjct: 355 YELPNIKAYNETCANIGNAMWNWRMFLMNGEGRFADIMELIFYNSAISGISLDGEHFFYT 414

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
             L    G  +     G   +F S +CC    I + +K+    Y   E    G+++  Y 
Sbjct: 415 NPLRFIEGHPQNTKDEGKRGEFMSVFCCPPNIIRTIAKMHTYAYSTSE---KGIWVNLYG 471

Query: 508 SSSFD---WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
           S+  D       ++ L Q+ +    WD  +++T+    K+E     +L LR+P W  + G
Sbjct: 472 SNVLDTDLADGSNIKLTQESN--YPWDGNIKITIDSKKKKEY----ALMLRIPAW--AEG 523

Query: 565 AQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
           A   +NG+     P  G++     +W   D + ++LP++ R      +  E  +  A+  
Sbjct: 524 ANIKVNGEKQDQSPKAGSYAEVNRKWKKGDVVELELPMAPRLITADPNVEETRNQVAVKR 583

Query: 624 GPYLLAGHTSGEWDIKTGTARSLSALISPIP--PSFNAQLVT 663
           GP +    +    D+  G+      L S I   P + A L++
Sbjct: 584 GPIVYCLESK---DLAAGSNIKDIVLPSDIKLQPKYEADLLS 622


>gi|416288023|ref|ZP_11649060.1| hypothetical protein SGB_04738 [Shigella boydii ATCC 9905]
 gi|320178140|gb|EFW53118.1| hypothetical protein SGB_04738 [Shigella boydii ATCC 9905]
          Length = 656

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + L + +R
Sbjct: 525 TLNLTLSMPVR 535


>gi|419730921|ref|ZP_14257856.1| hypothetical protein SEEH1579_14647 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|419735086|ref|ZP_14261970.1| hypothetical protein SEEH1563_04074 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|419740253|ref|ZP_14266986.1| hypothetical protein SEEH1573_01156 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|419743535|ref|ZP_14270200.1| hypothetical protein SEEH1566_22821 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|419746688|ref|ZP_14273264.1| hypothetical protein SEEH1565_23798 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
 gi|381293311|gb|EIC34483.1| hypothetical protein SEEH1579_14647 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|381295529|gb|EIC36640.1| hypothetical protein SEEH1573_01156 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|381295907|gb|EIC37016.1| hypothetical protein SEEH1563_04074 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|381312020|gb|EIC52830.1| hypothetical protein SEEH1566_22821 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|381320971|gb|EIC61499.1| hypothetical protein SEEH1565_23798 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
          Length = 651

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRSLKFNHIYEHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|168241855|ref|ZP_02666787.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Heidelberg str. SL486]
 gi|194451278|ref|YP_002047708.1| hypothetical protein SeHA_C4002 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. SL476]
 gi|386593352|ref|YP_006089752.1| hypothetical protein SU5_04156 [Salmonella enterica subsp. enterica
           serovar Heidelberg str. B182]
 gi|421571246|ref|ZP_16016925.1| hypothetical protein CFSAN00322_13684 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00322]
 gi|421575202|ref|ZP_16020815.1| hypothetical protein CFSAN00325_10357 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00325]
 gi|421579160|ref|ZP_16024730.1| hypothetical protein CFSAN00326_07107 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00326]
 gi|421586317|ref|ZP_16031800.1| hypothetical protein CFSAN00328_20048 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00328]
 gi|194409582|gb|ACF69801.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Heidelberg str. SL476]
 gi|205339076|gb|EDZ25840.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Heidelberg str. SL486]
 gi|383800393|gb|AFH47475.1| DUF1680 Glycosyl hydrolase [Salmonella enterica subsp. enterica
           serovar Heidelberg str. B182]
 gi|402521555|gb|EJW28891.1| hypothetical protein CFSAN00322_13684 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00322]
 gi|402522242|gb|EJW29566.1| hypothetical protein CFSAN00325_10357 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00325]
 gi|402523131|gb|EJW30450.1| hypothetical protein CFSAN00326_07107 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00326]
 gi|402529042|gb|EJW36291.1| hypothetical protein CFSAN00328_20048 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00328]
          Length = 651

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|417337268|ref|ZP_12119473.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Alachua str. R6-377]
 gi|353565179|gb|EHC31033.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Alachua str. R6-377]
          Length = 651

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|238023985|ref|YP_002908217.1| hypothetical protein [Burkholderia glumae BGR1]
 gi|237878650|gb|ACR30982.1| Hypothetical protein bglu_2g05390 [Burkholderia glumae BGR1]
          Length = 655

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 98/485 (20%), Positives = 184/485 (37%), Gaps = 74/485 (15%)

Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAP 228
           +L A A + A   +A +++     +  L+  Q+    GYL+ + T      +A    W  
Sbjct: 78  WLEAVAYLLAEQRDAELEQIADETIDLLARAQHD--DGYLNTYFT-----IKAPGQRWTN 130

Query: 229 YYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
               H++     L++  V    A   +    + E F   +  V    + + + Y  + E 
Sbjct: 131 LAECHELYCAGHLIEAAVAYWQATGKRKLLEVAERFVAHIDTVFGTEAGKLNGYPGHPE- 189

Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF---------- 331
             +   L RL+ ++ +P+HL LA  F      +P +      +   +SH+          
Sbjct: 190 --IELALMRLHEVSGNPRHLALARYFVEQRGARPHYYDIEYEKRGRVSHWDVHGRAWITT 247

Query: 332 ---HANTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSY 372
              ++  H PI      +G  +R             V+GD     +       +     Y
Sbjct: 248 HKAYSQAHKPIAEQDAAVGHAVRLVYLYAGVAHLARVSGDAAKLNVCKAVWRNMVTRQMY 307

Query: 373 ATGGTSAR----EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG  A+     F  D +   DT  +E   TC +  ++  +R +   ++E  YAD  ER
Sbjct: 308 VTGGIGAQVWGESFTCDYELPNDTAYTE---TCASVGLVFFARRMLEASRESGYADVLER 364

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW------GTKFNSFWCCYGTGIES 482
           AL N VL+   G +     Y+ PL    +  R  H +        ++    CC       
Sbjct: 365 ALYNTVLA-GIGLDGRSFFYVNPLETHPAGIRGNHKYEHVKPVRQRWFGCACCPPNVARL 423

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG--HVVLNQKVDPIVSWDPYLRMTLTF 540
            + L   +Y  ++  +   Y+  Y++      +G   V L Q+ +    W   LR+ +  
Sbjct: 424 IASLDQYVYLVDDSII---YVNLYVAGEARLNAGTSRVTLRQQGN--YPWRGDLRIVV-- 476

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP-GNFLSATERWSYNDKLTIQL 599
             +Q  G   ++ +R+P W  +   +  +NG  +        +L     W   D + + L
Sbjct: 477 --EQADGFDGTIAVRLPDWCAA--PEVRVNGDTVACSAAVDGYLHLPRVWHDGDTIELVL 532

Query: 600 PLSLR 604
           P+++R
Sbjct: 533 PMTVR 537


>gi|200389015|ref|ZP_03215627.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Virchow str. SL491]
 gi|199606113|gb|EDZ04658.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Virchow str. SL491]
          Length = 651

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|197247483|ref|YP_002148608.1| hypothetical protein SeAg_B3893 [Salmonella enterica subsp.
           enterica serovar Agona str. SL483]
 gi|440762586|ref|ZP_20941641.1| hypothetical protein F434_06500 [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
 gi|440769697|ref|ZP_20948654.1| hypothetical protein F514_18649 [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|440774815|ref|ZP_20953701.1| hypothetical protein F515_20523 [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
 gi|197211186|gb|ACH48583.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Agona str. SL483]
 gi|436412179|gb|ELP10122.1| hypothetical protein F515_20523 [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
 gi|436414203|gb|ELP12135.1| hypothetical protein F514_18649 [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|436422862|gb|ELP20686.1| hypothetical protein F434_06500 [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
          Length = 651

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|402306205|ref|ZP_10825256.1| putative glycosyhydrolase [Prevotella sp. MSX73]
 gi|400379972|gb|EJP32801.1| putative glycosyhydrolase [Prevotella sp. MSX73]
          Length = 816

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 131/321 (40%), Gaps = 60/321 (18%)

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLP 451
           +ETC +   +  +  +F  T E  Y D YERAL NGVLS       GV +      Y  P
Sbjct: 346 QETCASIANVFWNYRMFLATGEGKYVDVYERALYNGVLS-------GVSLSGDKFFYDNP 398

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           L   + +    H +G       CC G      + +    Y     ++   Y+  YI  + 
Sbjct: 399 L-ESMGQHERQHWFGCA-----CCPGNVTRFVASVPQYQYAVRGSDI---YVNLYIQGTA 449

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT----------- 560
           D  +G  +  Q   P   WD    +T+T   K+   +  +L  R+P W            
Sbjct: 450 D-VNGVRLAQQTRYP---WDG--DITVTVDPKRS--RRFALRFRIPGWAGACPVGTNLYH 501

Query: 561 YSNGAQ---ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA----IQDDRP 613
           +++ ++     +NG+ +   P   ++    RW   D++ I LP+ +R  A    ++DDR 
Sbjct: 502 FADSSRPFTVKVNGREIAGEPVDGYMVIDRRWMRGDRVEISLPMEVRRVAANDNVEDDRG 561

Query: 614 EYASIQAILFGP--YLLAGHTSGEWDIKTGTARSLSALISPIPPSFNA-QLVTFTQESGN 670
           +Y    A+  GP  Y L G       +   + R    L +PI   + A +L    + SG 
Sbjct: 562 KY----ALERGPIVYCLEGRDQAHSTVFDKSVR----LDAPIRADYRADKLNGIVELSGE 613

Query: 671 STFVMSN-SNQSITMEEFPVS 690
           +  V ++ S + +  +  P S
Sbjct: 614 AEEVEADGSVRPVAFKAIPYS 634


>gi|417376625|ref|ZP_12145767.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Inverness str. R8-3668]
 gi|353592514|gb|EHC50495.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Inverness str. R8-3668]
          Length = 651

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|156935976|ref|YP_001439892.1| hypothetical protein ESA_03870 [Cronobacter sakazakii ATCC BAA-894]
 gi|156534230|gb|ABU79056.1| hypothetical protein ESA_03870 [Cronobacter sakazakii ATCC BAA-894]
          Length = 655

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 80/365 (21%), Positives = 130/365 (35%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF-------------HA 333
            L RLY  T +P++ +LA  F      +P F      +    S++             ++
Sbjct: 195 ALMRLYEATQEPRYQVLARYFVEQRGTQPHFYDIEYEKRGRTSYWNTYGPAWMVQDKAYS 254

Query: 334 NTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H P+      +G  +R+            ++GD   +       + +     Y TGG 
Sbjct: 255 QAHQPLAEQTRAVGHAVRFVYLMTGVAHLARLSGDEEKRRACLRLWENMARRQLYITGGI 314

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 315 GSQSSGEAFSTDYDLPNDTVYAE---SCASIGLIMFARRMLEMEGDSQYADVMERALYNT 371

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 372 VLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPVRQRWFGCACCPPNI 422

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY   E     L+I  YI ++     G   L  ++     W   +R+ + 
Sbjct: 423 ARLLTSLGHYIYTARED---ALFINLYIGNNVQLPVGDSTLRLRISGDFPWHEEVRIHID 479

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
                E     +L LR+P W   +  +  LNG+         +L  T  W   D LT+ L
Sbjct: 480 SPRPVE----HTLALRLPDW--CDAPRVMLNGRPCEGDIRKGYLWLTRTWHEGDTLTLTL 533

Query: 600 PLSLR 604
           P+ +R
Sbjct: 534 PMPVR 538


>gi|168235286|ref|ZP_02660344.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Schwarzengrund str. SL480]
 gi|194737873|ref|YP_002116613.1| hypothetical protein SeSA_A3877 [Salmonella enterica subsp.
           enterica serovar Schwarzengrund str. CVM19633]
 gi|194713375|gb|ACF92596.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Schwarzengrund str. CVM19633]
 gi|197291306|gb|EDY30658.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Schwarzengrund str. SL480]
          Length = 651

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSVEIPVENGALKLRIGGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|417394187|ref|ZP_12156450.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Minnesota str. A4-603]
 gi|353606439|gb|EHC60665.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Minnesota str. A4-603]
          Length = 651

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSVEIPVENGALKLRIGGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|437834770|ref|ZP_20845077.1| hypothetical protein SEEERB17_013756 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SARB17]
 gi|435300940|gb|ELO76997.1| hypothetical protein SEEERB17_013756 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SARB17]
          Length = 651

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVHH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|345297339|ref|YP_004826697.1| hypothetical protein Entas_0157 [Enterobacter asburiae LF7a]
 gi|345091276|gb|AEN62912.1| protein of unknown function DUF1680 [Enterobacter asburiae LF7a]
          Length = 649

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 84/366 (22%), Positives = 130/366 (35%), Gaps = 77/366 (21%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
           L RLY +T  P++L L   F      +P F      +    SH+  NT+ P  +     Y
Sbjct: 193 LMRLYDVTQKPRYLALVKYFIEERGAQPHFYDIEYEKRGKTSHW--NTYGPAWMVKDKAY 250

Query: 348 EVTGDPLYK---LIG-----TFFM----DIVNASHS-------------------YATGG 376
                PL +    IG      + M     +   SH                    Y TGG
Sbjct: 251 SQAHQPLAEQQTAIGHAVRFVYLMAGMAHLARLSHDEGKRQDCLRLWNNMAQRQLYITGG 310

Query: 377 ----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
               +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N
Sbjct: 311 IGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYN 367

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGT 478
            VL      +     Y+ PL          H     FN  +              CC   
Sbjct: 368 TVLG-GMALDGKHFFYVNPL--------EVHPKTLSFNHIYDHVKPVRQRWFGCACCPPN 418

Query: 479 GIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL 538
                + LG  IY   E     L+I  Y+ +      G   L  ++     W   +++ +
Sbjct: 419 IARVLTSLGHYIYTVRED---ALFINLYVGNDVAIPVGDRKLQLRISGNYPWHEQVKIDI 475

Query: 539 TFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
           T      V    +L LR+P W  +   + +LNG+ +       +L  T RW   D +T+ 
Sbjct: 476 T----SPVPVTHTLALRLPDWCAN--PEIALNGEVITGEVTRGYLYLTRRWQEGDAITLT 529

Query: 599 LPLSLR 604
           LP+ +R
Sbjct: 530 LPMPVR 535


>gi|297520697|ref|ZP_06939083.1| hypothetical protein EcolOP_23892 [Escherichia coli OP50]
          Length = 563

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 63/251 (25%), Positives = 94/251 (37%), Gaps = 39/251 (15%)

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 210 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 266

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 267 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 317

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 318 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 373

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 374 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 428

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 429 TLNLTLPMPVR 439


>gi|168232522|ref|ZP_02657580.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Kentucky str. CDC 191]
 gi|194471797|ref|ZP_03077781.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Kentucky str. CVM29188]
 gi|194458161|gb|EDX47000.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Kentucky str. CVM29188]
 gi|205333286|gb|EDZ20050.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Kentucky str. CDC 191]
          Length = 651

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|417353052|ref|ZP_12130092.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Gaminara str. A4-567]
 gi|353564767|gb|EHC30749.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Gaminara str. A4-567]
          Length = 651

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|298247044|ref|ZP_06970849.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
           44963]
 gi|297549703|gb|EFH83569.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
           44963]
          Length = 639

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/377 (23%), Positives = 147/377 (38%), Gaps = 59/377 (15%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLA-LQADYLSHFHANT------HIPI 339
            L +LY +T + ++L L+  F      +P +    A L+ D    F A T      H+PI
Sbjct: 199 ALVKLYRVTGEKRYLNLSQYFVDERGKQPHYFDEEAHLRGDDPRDFWAQTYEYNQSHVPI 258

Query: 340 VIGSQMRYEVTGDPLYKL-IGTFFMDIVNASHSYATGGTSAREFWWD--PKRLADTLG-- 394
               + + EV G  +  + + +   D+V   +  +   T  R  W     KRL  T G  
Sbjct: 259 ----REQREVVGHAVRAMYLYSAVADLVKERYDESLFQTGER-LWHHLVSKRLYITGGIG 313

Query: 395 --SENE---------------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
             ++NE               E+C +  ++  +  L +   +  YAD  ERAL NG+LS 
Sbjct: 314 STAKNEGFTEDYDLPNLTAYAESCASIGLVMWNHRLLQLDADSRYADLLERALYNGMLS- 372

Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
               +     Y+ PL       R   GW   F    CC      +   LG  +Y   + +
Sbjct: 373 GISLDGSKYFYVNPLESKGDHHRV--GW---FKCA-CCPPNIARTLMSLGQYVYTVSDTD 426

Query: 498 VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
           +   +   YI  + +   G   +  + +    WD  + + +      + G    LNLR+P
Sbjct: 427 I---FTHLYIQGTGELSVGGHNVKVEQETKYPWDGAISLKMELDEPADFG----LNLRIP 479

Query: 558 VWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
            W     AQ SLNG+ + L       ++    RW   D++ + L + +       D  E 
Sbjct: 480 GW--CQAAQLSLNGEAIALDDHLQKGYVRIERRWQSGDQIVLNLAMPVMRVYAHPDIREN 537

Query: 616 ASIQAILFGP--YLLAG 630
           +   A+  GP  Y L G
Sbjct: 538 SDRVALQRGPLVYCLEG 554


>gi|253575972|ref|ZP_04853305.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
 gi|251844547|gb|EES72562.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
          Length = 637

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 99/494 (20%), Positives = 191/494 (38%), Gaps = 62/494 (12%)

Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAP 228
           +L A A +++ T +A + +KM   +  +++ Q+    GY+S    +L       + ++  
Sbjct: 78  FLEACAHVYSITKDAALDQKMDKYIGFIAKAQDP--DGYIST-NIQLSHKKRWGQRIYHE 134

Query: 229 YYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGG 288
            Y    +L      +     +  L +A     Y  N +      + +   W   N    G
Sbjct: 135 DYNFGHLLTAACVHHTATGKSNFLDVAVKAANYL-NEIFNPCPKHLIHYGWNPSN--IMG 191

Query: 289 MNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF---------LALQADYLSHFHANTHIPI 339
           + D    LY IT +  +L LA +F      G+           L+ +  +  HA T + +
Sbjct: 192 LVD----LYRITGNETYLKLADIFMTMRGAGYGGEDQNQDRTPLREETEATGHAVTAVYL 247

Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK--RLADTLGSEN 397
             G+   Y  TG+           + +     Y TGG  +      P   ++ +  G++ 
Sbjct: 248 YAGAADVYSHTGEEAVMRALEKIWNNMYTKKMYLTGGIGSIYNGLSPNGDKIWEAFGTDY 307

Query: 398 E--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
                    ETC        +  +F  T+E  Y D +E+ + N +L      +     Y 
Sbjct: 308 HLPNRSAYTETCANIGNAMWAMRMFNLTQEPKYMDAFEKVVYNSLLG-SMTLDGHHFCYT 366

Query: 450 LPLGRGVSKARSTHGWGTKF--------NSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
            PL     K  + H   T+         ++ +CC    + + ++L    Y +      GL
Sbjct: 367 NPLETRGGKLFNHHSPQTQHFRTARWFTHTCYCCPPQVLRTIARLHQWAYGQSN---DGL 423

Query: 502 YIIQYISSSFD--WKSGHVV-LNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
           YI  Y  +  +    SG  + L  K D P          T++ +    +   +S++LR+P
Sbjct: 424 YIHLYSGNELNTTLSSGETLSLTMKSDFPA-------EETISITINNSLNTETSIHLRIP 476

Query: 558 VWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA----IQDDRP 613
            W  ++GA   +NG        G +     +W  ND++ + LP+ ++  A    +++DR 
Sbjct: 477 QW--ADGATVKVNGVQQGDVEAGTYHELKRKWQANDQIELLLPMRVKRIAANPMVEEDRG 534

Query: 614 EYASIQAILFGPYL 627
           +     A ++GP++
Sbjct: 535 QV----AFMYGPFV 544


>gi|375003535|ref|ZP_09727874.1| hypothetical protein SEENIN0B_03911 [Salmonella enterica subsp.
           enterica serovar Infantis str. SARB27]
 gi|353074450|gb|EHB40211.1| hypothetical protein SEENIN0B_03911 [Salmonella enterica subsp.
           enterica serovar Infantis str. SARB27]
          Length = 651

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRMWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSVEIPVENGALKLRIGGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLAL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|204928680|ref|ZP_03219879.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Javiana str. GA_MM04042433]
 gi|452122524|ref|YP_007472772.1| hypothetical protein CFSAN001992_15250 [Salmonella enterica subsp.
           enterica serovar Javiana str. CFSAN001992]
 gi|204322113|gb|EDZ07311.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Javiana str. GA_MM04042433]
 gi|451911528|gb|AGF83334.1| hypothetical protein CFSAN001992_15250 [Salmonella enterica subsp.
           enterica serovar Javiana str. CFSAN001992]
          Length = 651

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 80/365 (21%), Positives = 131/365 (35%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLR 604
           P+ +R
Sbjct: 531 PMPVR 535


>gi|417691895|ref|ZP_12341101.1| hypothetical protein SB521682_4174 [Shigella boydii 5216-82]
 gi|332085042|gb|EGI90222.1| hypothetical protein SB521682_4174 [Shigella boydii 5216-82]
          Length = 656

 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 82/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +   + Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHLFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +   T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYFHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + L + +R
Sbjct: 525 TLNLTLSMPVR 535


>gi|416822592|ref|ZP_11895028.1| hypothetical protein ECO5905_02787 [Escherichia coli O55:H7 str.
           USDA 5905]
 gi|425251470|ref|ZP_18644405.1| hypothetical protein EC5905_5091 [Escherichia coli 5905]
 gi|320661682|gb|EFX29097.1| hypothetical protein ECO5905_02787 [Escherichia coli O55:H7 str.
           USDA 5905]
 gi|408161718|gb|EKH89653.1| hypothetical protein EC5905_5091 [Escherichia coli 5905]
          Length = 656

 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 54/220 (24%), Positives = 82/220 (37%), Gaps = 32/220 (14%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + +G  +Y   E     LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y  +S +    +  L  +V     W    ++T+   S Q V    +L LR+P W     
Sbjct: 442 IYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQ 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            Q  LNG+ +       +L  T  W   D L + LP+ +R
Sbjct: 496 PQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535


>gi|418846200|ref|ZP_13400973.1| hypothetical protein SEEN443_06909 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19443]
 gi|418858162|ref|ZP_13412783.1| hypothetical protein SEEN470_08679 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19470]
 gi|418865229|ref|ZP_13419709.1| hypothetical protein SEEN536_04836 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19536]
 gi|418867555|ref|ZP_13422012.1| hypothetical protein SEEN176_24132 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 4176]
 gi|392811425|gb|EJA67435.1| hypothetical protein SEEN443_06909 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19443]
 gi|392828511|gb|EJA84203.1| hypothetical protein SEEN536_04836 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19536]
 gi|392834500|gb|EJA90106.1| hypothetical protein SEEN470_08679 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19470]
 gi|392839395|gb|EJA94937.1| hypothetical protein SEEN176_24132 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 4176]
          Length = 651

 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 61/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY         LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +    +  L  ++     W  + ++ +   S Q V    +L LR+P W     
Sbjct: 442 MYVGNSMEIPVENGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG ++       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 496 AKVTLNGLDVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 556 PLVYCLEQADNGE 568


>gi|378766201|ref|YP_005194662.1| hypothetical protein PANA5342_1232 [Pantoea ananatis LMG 5342]
 gi|365185675|emb|CCF08625.1| hypothetical protein PANA5342_1232 [Pantoea ananatis LMG 5342]
          Length = 651

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 81/362 (22%), Positives = 131/362 (36%), Gaps = 69/362 (19%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
           L RLY +T  P++L L + F      +P F      +    S++H             + 
Sbjct: 193 LMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 252

Query: 335 THIPIV-----IGSQMRYEVTGDPLYKLIGTFFM-----------DIVNASHS------Y 372
            H P+      +G  +R+      +Y + G   +           D +   H+      Y
Sbjct: 253 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 306

Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
            TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363

Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIES 482
           AL N VL      +     Y+ PL          H +         W    CC       
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIARL 422

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
            + LG  IY   +     L+I  Y+ +  D   G   L   +     W+    +T++  +
Sbjct: 423 LTSLGHYIYTPHQN---ALFINLYVGNRVDVPVGDRTLGIHISGNFPWEE--TVTISVDA 477

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
            Q V    +L LR+P W      Q S NG+ +       +L     W   D LT+ LP+ 
Sbjct: 478 TQPVKH--TLALRLPDW--CEAPQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPMP 533

Query: 603 LR 604
           +R
Sbjct: 534 VR 535


>gi|288925304|ref|ZP_06419239.1| cytoplasmic protein [Prevotella buccae D17]
 gi|288338069|gb|EFC76420.1| cytoplasmic protein [Prevotella buccae D17]
          Length = 813

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 95/403 (23%), Positives = 158/403 (39%), Gaps = 78/403 (19%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T   ++L +A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 224 ALCKLYKVTGSRRYLDMARYFVEETGRG---TDGHRLSEY-SQDHKPILRQQEIVGHAVR 279

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW--WDPKRLADTL 393
                        +TGD  Y        + +     + TGG  +R     + P    + +
Sbjct: 280 AGYLYSGVADVAALTGDTAYFHALERLWNNMAGKKLFITGGMGSRAQGEGFGPDYELNNM 339

Query: 394 GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------ 447
            +  +ETC +   +  +  +F  T E  Y D YERAL NGVLS       GV +      
Sbjct: 340 -TAYQETCASIANVFWNYRMFLATGEGKYVDVYERALYNGVLS-------GVSLSGDKFF 391

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  PL   + +    H +G       CC G      + +    Y     ++   Y+  YI
Sbjct: 392 YDNPL-ESMGQHERQHWFGCA-----CCPGNVTRFVASVPQYQYAVRGSDI---YVNLYI 442

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT------- 560
             + D  +G  +  Q   P   WD    +T+T   K+   +  +L  R+P W        
Sbjct: 443 QGTAD-VNGVRLAQQTRYP---WDG--DITVTVDPKRS--RRFALRFRIPGWAGACPVGT 494

Query: 561 ----YSNGAQ---ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA----IQ 609
               +++ ++     +NG+ +   P   ++    RW   D++ I LP+ +R  A    ++
Sbjct: 495 NLYHFADSSRPFTVKVNGREIAGEPVDGYMVIDRRWMRGDRVEISLPMEVRRVAANDNVE 554

Query: 610 DDRPEYASIQAILFGP--YLLAGHTSGEWDIKTGTARSLSALI 650
           DDR +Y    A+  GP  Y L G       +   + R L ALI
Sbjct: 555 DDRGKY----ALERGPIVYCLEGRDQAHSTVFDKSVR-LDALI 592


>gi|429121562|ref|ZP_19182182.1| COG3533 secreted protein [Cronobacter sakazakii 680]
 gi|426323943|emb|CCK12919.1| COG3533 secreted protein [Cronobacter sakazakii 680]
          Length = 655

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 67/291 (23%), Positives = 106/291 (36%), Gaps = 39/291 (13%)

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG----TSAREFWWDPK 387
           HA   + ++ G      ++GD   +       + +     Y TGG    +S   F  D  
Sbjct: 269 HAVRFVYLMTGVAHLARLSGDEEKRRACLRLWENMARRQLYITGGIGSQSSGEAFSTDYD 328

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
              DT+ +E   +C +  ++  +R +     +  YAD  ERAL N VL      +     
Sbjct: 329 LPNDTVYAE---SCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFF 384

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFE 493
           Y+ PL          H    KFN  +              CC        + LG  IY  
Sbjct: 385 YVNPL--------EVHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTA 436

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
            E     L+I  YI ++     G   L  ++     W   +R+ +      E     +L 
Sbjct: 437 RED---ALFINLYIGNNVQLPVGDSTLRLRISGDFPWHEEVRIHIDSPRPVE----HTLA 489

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
           LR+P W   +  +  LNG+         +L  T  W   D LT+ LP+ +R
Sbjct: 490 LRLPDW--CDAPRVMLNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 538


>gi|449310077|ref|YP_007442433.1| hypothetical protein CSSP291_17885 [Cronobacter sakazakii SP291]
 gi|449100110|gb|AGE88144.1| hypothetical protein CSSP291_17885 [Cronobacter sakazakii SP291]
          Length = 655

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 67/291 (23%), Positives = 106/291 (36%), Gaps = 39/291 (13%)

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG----TSAREFWWDPK 387
           HA   + ++ G      ++GD   +       + +     Y TGG    +S   F  D  
Sbjct: 269 HAVRFVYLMTGVAHLARLSGDEEKRRACLRLWENMARRQLYITGGIGSQSSGEAFSTDYD 328

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
              DT+ +E   +C +  ++  +R +     +  YAD  ERAL N VL      +     
Sbjct: 329 LPNDTVYAE---SCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFF 384

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFE 493
           Y+ PL          H    KFN  +              CC        + LG  IY  
Sbjct: 385 YVNPL--------EVHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTA 436

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
            E     L+I  YI ++     G   L  ++     W   +R+ +      E     +L 
Sbjct: 437 RED---ALFINLYIGNNVQLPVGDSTLRLRISGDFPWHEEVRIHIDSPRPVE----HTLA 489

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
           LR+P W   +  +  LNG+         +L  T  W   D LT+ LP+ +R
Sbjct: 490 LRLPDW--CDAPRVMLNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 538


>gi|389842783|ref|YP_006344867.1| hypothetical protein ES15_3783 [Cronobacter sakazakii ES15]
 gi|387853259|gb|AFK01357.1| hypothetical protein ES15_3783 [Cronobacter sakazakii ES15]
          Length = 655

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 67/291 (23%), Positives = 105/291 (36%), Gaps = 39/291 (13%)

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG----TSAREFWWDPK 387
           HA   + ++ G      ++GD   +       + +     Y TGG    +S   F  D  
Sbjct: 269 HAVRFVYLMTGVAHLARLSGDEEKRRACLRLWENMARRQLYITGGIGSQSSGEAFSTDYD 328

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
              DT+ +E   +C +  ++  +R +     +  YAD  ERAL N VL      +     
Sbjct: 329 LPNDTVYAE---SCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFF 384

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFE 493
           Y+ PL          H    KFN  +              CC        + LG  IY  
Sbjct: 385 YVNPL--------EVHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTA 436

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
            E     L+I  YI +      G   L  ++     W   +R+ +      E     +L 
Sbjct: 437 RED---ALFINLYIGNDVQLPVGDSTLRLRISGDFPWHEEVRIHIDSPRPVE----HTLA 489

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
           LR+P W   +  +  LNG+         +L  T  W   D LT+ LP+ +R
Sbjct: 490 LRLPDW--CDAPRVMLNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 538


>gi|110807746|ref|YP_691266.1| hypothetical protein SFV_3953 [Shigella flexneri 5 str. 8401]
 gi|418259896|ref|ZP_12882543.1| hypothetical protein SF660363_4453 [Shigella flexneri 6603-63]
 gi|424840119|ref|ZP_18264756.1| hypothetical protein SF5M90T_3865 [Shigella flexneri 5a str. M90T]
 gi|110617294|gb|ABF05961.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
 gi|383469171|gb|EID64192.1| hypothetical protein SF5M90T_3865 [Shigella flexneri 5a str. M90T]
 gi|397894067|gb|EJL10519.1| hypothetical protein SF660363_4453 [Shigella flexneri 6603-63]
          Length = 659

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +  +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SYASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|284034063|ref|YP_003383994.1| hypothetical protein Kfla_6192 [Kribbella flavida DSM 17836]
 gi|283813356|gb|ADB35195.1| protein of unknown function DUF1680 [Kribbella flavida DSM 17836]
          Length = 637

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 98/428 (22%), Positives = 166/428 (38%), Gaps = 57/428 (13%)

Query: 229 YYTIHKILAGLLDQYVLADNA---QALKMATWMVEYFYN----RVQKVITMYSVERHWYS 281
           Y   H I A +       D A    A+K+A  +V  F +    +++ V     +E     
Sbjct: 150 YCAGHLIQAAVAQIRCTGDRALLDVAIKLADHLVATFGDSGQGKIRDVDGHPVIEMALVE 209

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
           L  ETG    +    + +      ++  H      F   + ++       HA   + +  
Sbjct: 210 LYRETGTTAYLELARWFVEARGHGIIEGHGHHPAYFSDRVPVREATTVEGHAVRAVYLAA 269

Query: 342 GS-QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE-- 398
           G+  +  E   D L +++   F  + + + +Y TGG  +R   WD     +  G E E  
Sbjct: 270 GAADVALETGDDDLLRVLEGQFAHMWS-TKTYLTGGLGSR---WD----GEAFGDEYELP 321

Query: 399 ------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLP 451
                 ETC     ++ +  +   T    YAD  ER L NG L+ +  G +     Y+ P
Sbjct: 322 PDRAYAETCAAIGGVQWAWRMLLATGNAFYADAIERMLYNGFLAGVSLGGDE--YFYVNP 379

Query: 452 LG-RGV-------SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
           L  RG        S A    GW   F+   CC    + + S L   +    +G +    +
Sbjct: 380 LQLRGAAEPDGNRSPAHGRRGW---FDCA-CCPPNIMRTLSSLDGYLASTTDGAI---QL 432

Query: 504 IQYISSSF--DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
            QY   +   D  +G V L  +VD    W+  +++T+    +Q      +L LR+P W  
Sbjct: 433 HQYAEGAVAADLPAGTVEL--QVDTEYPWNGSIKVTV----QQTPDTPWALELRIPGWAE 486

Query: 562 SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
                A+LNG+ +     G +    + W+  D + +QLP++ RT A            A+
Sbjct: 487 G----ATLNGKPVDA---GRYARVEQTWATGDTVELQLPMATRTVAADPRIDAVRGCVAL 539

Query: 622 LFGPYLLA 629
             GP + A
Sbjct: 540 ERGPLVYA 547


>gi|420368547|ref|ZP_14869294.1| hypothetical protein SF123566_9752 [Shigella flexneri 1235-66]
 gi|391322141|gb|EIQ78842.1| hypothetical protein SF123566_9752 [Shigella flexneri 1235-66]
          Length = 659

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L + F      +P +      +    SH+H             +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
             H+P+      IG  +R+      +Y + G   +  ++   S                 
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +  +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SYASIGLMMFARRMLEMEGDSQYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H    KFN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  +Y   E     LYI  Y  +S +    +  L  +V     W   
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
            ++T+   S Q V    +L LR+P W      Q  LNG+ +       +L  T  W   D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524

Query: 594 KLTIQLPLSLR 604
            L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535


>gi|386820698|ref|ZP_10107914.1| hypothetical protein JoomaDRAFT_2662 [Joostella marina DSM 19592]
 gi|386425804|gb|EIJ39634.1| hypothetical protein JoomaDRAFT_2662 [Joostella marina DSM 19592]
          Length = 660

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 103/431 (23%), Positives = 174/431 (40%), Gaps = 94/431 (21%)

Query: 251 ALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAH 310
           ALK A  MVE F     K+   ++V  H      ETG     L RLY IT++ K+L LA 
Sbjct: 208 ALKNADLMVETFGPEDGKI---HTVPGHQII---ETG-----LIRLYRITNEKKYLELAK 256

Query: 311 LFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMRY-----------EVTGDPL 354
            F      GF   + D+  +  A  H+P+     V+G  +R             +  D  
Sbjct: 257 YFLDG--RGFHEGRMDFGPY--AQDHVPVIKQDEVVGHAVRAVYMYAAMTDIAAIENDTA 312

Query: 355 Y-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADTLGSENEETCTTYNMLKVS 410
           Y K +   + ++VN    Y TGG  AR   E + +   L + L + NE TC     +  +
Sbjct: 313 YHKAVDNLWENMVN-KKMYLTGGIGARHEGEAFGENYELPN-LTAYNE-TCAAIGDVYWN 369

Query: 411 RHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG---RGVSKARSTHGWGT 467
             L   T  + Y D  ER L NG++S   G       +  P      GV K     G  T
Sbjct: 370 HRLHNMTGNVKYFDVIERTLYNGLIS---GLSLNGTQFFYPNALESDGVYKF--NQGACT 424

Query: 468 KFNSFWC-CYGTGIESF---------SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
           + + F C C  T +  F         SK  D+++         LY      ++   +   
Sbjct: 425 RKDWFDCSCCPTNVIRFIPSLPGLIYSKTSDTVFV-------NLYAAN--QATIGLEETA 475

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQ 566
           + + Q+      W+  +++T+T     E     ++ LR+P W           +Y    +
Sbjct: 476 IAITQETS--YPWNGSVKLTVT----PETASDFTIKLRIPGWARNEVLPGTLYSYKEKIK 529

Query: 567 A----SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR----TEAIQDDRPEYASI 618
           A     +NG+ +       +++ T  W   + +++++P+ +R     E +++DR +    
Sbjct: 530 AVPEVKVNGELVEATIDNGYITLTRNWKKGETISLEIPMKVREVLANEKVEEDRGKI--- 586

Query: 619 QAILFGPYLLA 629
            A+ +GP + A
Sbjct: 587 -ALEYGPIVYA 596


>gi|427384245|ref|ZP_18880750.1| hypothetical protein HMPREF9447_01783 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727506|gb|EKU90365.1| hypothetical protein HMPREF9447_01783 [Bacteroides oleiciplenus YIT
           12058]
          Length = 811

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 91/388 (23%), Positives = 150/388 (38%), Gaps = 87/388 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L +A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 220 ALAKLYKVTGDEKYLKMAKYFVEETGRG---TDGHRLSEY-SQDHKPILQQDEIVGHAVR 275

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    +   + + +   + TGG  +R     P+   +  G 
Sbjct: 276 AGYLYSGVADVAALTQDTAYFNALSRIWENMASKKLFITGGIGSR-----PQ--GEGFGP 328

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     +  +  +F  T    YAD  ERAL NGV+S       GV +
Sbjct: 329 NYELNNHTAYCETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVIS-------GVSL 381

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  +  F        +  +GN   +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-VTRFMASVPYYMYATQGN--DI 432

Query: 502 YIIQYISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           Y+  YI S  D    S +V L Q  +    W+  + + +T   +QE     +L  R+P W
Sbjct: 433 YVNLYIQSKADLNTDSNNVALEQTTE--YPWEGKVSILVTPEKEQEF----ALRFRIPGW 486

Query: 560 -----------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR- 604
                      ++++ A A   S+NG+ +       + + +  W   D + I LP+ +R 
Sbjct: 487 AQDAPVPTDLYSFTDKAGAYSISVNGKKVNAKQYDGYATISRTWKAGDVVEISLPMDVRR 546

Query: 605 ---TEAIQDDRPEYASIQAILFGPYLLA 629
               + ++DDR +     AI  GP +  
Sbjct: 547 IKANDNVEDDRGKL----AIERGPIMFC 570


>gi|435854425|ref|YP_007315744.1| hypothetical protein Halha_1714 [Halobacteroides halobius DSM 5150]
 gi|433670836|gb|AGB41651.1| hypothetical protein Halha_1714 [Halobacteroides halobius DSM 5150]
          Length = 647

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 91/215 (42%), Gaps = 20/215 (9%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVS 457
           ETC    ++  +  +     +  YAD  ERAL NGVLS + +  E    +  L +     
Sbjct: 332 ETCAAIGLMFWAHRMLHLDLDSQYADVMERALYNGVLSGMSQDGEKFFYVNPLEVWPEAC 391

Query: 458 KARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS--SF 511
           + R            W    CC        + +G+ IY  +E      YI  Y +S   F
Sbjct: 392 EERKDKEHVKPTRQKWFGCACCPPNIARLLASIGEYIYSTDE---QAAYIHLYTASVTEF 448

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           +     V L+Q+ D    WD    +T+T + ++EV    +L LR+P W  S  A+  +NG
Sbjct: 449 EIDGTSVELDQETD--YPWDE--NITITVNPREEVE--FTLALRIPDWCES--AELKVNG 500

Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLR 604
           + L L       ++     WS  D++ + L + ++
Sbjct: 501 RTLELDSIIDNGYVEVNRSWSKGDQIELVLAMPVK 535


>gi|408673627|ref|YP_006873375.1| protein of unknown function DUF1680 [Emticicia oligotrophica DSM
           17448]
 gi|387855251|gb|AFK03348.1| protein of unknown function DUF1680 [Emticicia oligotrophica DSM
           17448]
          Length = 652

 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 107/501 (21%), Positives = 194/501 (38%), Gaps = 77/501 (15%)

Query: 182 NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAG-LL 240
           +A +++K    +  ++  Q  +  GYL+ + T        L+  W          AG L+
Sbjct: 106 DAELEKKTDEWIDKIAAAQ--LPDGYLNTYYT-----LNGLQNRWTDMEKHEDYCAGHLI 158

Query: 241 DQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSIT 300
           +  V   N    +    +   F N + +   +    R W S ++E   +   L +LY  T
Sbjct: 159 EAAVAYYNTTGKRKLLDVAIRFANHIDETFRL--ANRPWVSGHQE---IELALVKLYRTT 213

Query: 301 HDPKHLLLAHLF--DKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMRYEV---- 349
            D ++L L+  F   +    G   +  D+    +    IP+     + G  +R       
Sbjct: 214 KDERYLKLSEWFLNQRGRGNGKGVIWDDWKDPAYCQDAIPVKDQKEITGHAVRAMYLYTG 273

Query: 350 -------TGDPLY-KLIGTFFMDIVNASHSYATGGT----SAREFWWDPKRLADTLGSEN 397
                  TGD  Y   + T + D+V+  + Y TGG     S   F  D       L +EN
Sbjct: 274 AADVAVNTGDTGYMNAMKTVWEDVVH-RNMYITGGIGSSGSNEGFSQDFD-----LPNEN 327

Query: 398 E--ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
              ETC +  M+  ++ +   T E  Y D  ER+L NG L            Y  PL   
Sbjct: 328 AYCETCASVGMVFWNQRMNALTGESKYIDVLERSLYNGALD-GLSLSGDRFFYGNPLASI 386

Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
              AR    +GT      CC        + LGD IY + E    G+++  ++ S+ + K 
Sbjct: 387 GRHARR-EWFGTA-----CCPSNIARLVASLGDYIYGKSEN---GIWVNLFVGSNTNIKL 437

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT--------------- 560
           G+  +   ++     +  +++++  S+K +     +L++R+P WT               
Sbjct: 438 GNTEILTSIETNYPLNGKVKISMNPSTKTKY----TLHVRIPSWTTNEPVAGNLYHYLGN 493

Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
           Y+      +NG+ +       +      WS  D ++ +LP+ +R    +++  +     A
Sbjct: 494 YAANIAMMVNGRKIDYKIENGYAIIDREWSAGDIVSFELPMDVRKIVARNELKQDNDRMA 553

Query: 621 ILFGP--YLLAG--HTSGEWD 637
           +  GP  Y + G  +    WD
Sbjct: 554 LQRGPLVYCVEGIDNEGKAWD 574


>gi|374374966|ref|ZP_09632624.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
 gi|373231806|gb|EHP51601.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
          Length = 629

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 66/286 (23%), Positives = 108/286 (37%), Gaps = 47/286 (16%)

Query: 361 FFMDIVNASHS------YATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLF 414
           +   IVN + S      + TG  S+ E W +  ++  T    + ETC T   +K+   L 
Sbjct: 285 YLEAIVNTAESIRKDEIFVTGSGSSMESWINGAKIQATPLRHSNETCVTATWMKLCLQLL 344

Query: 415 RWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG---------RGVSKARSTHGW 465
           R T +  +A+  ER   N +L             M+P G         RGV K    +  
Sbjct: 345 RTTGDAKWANEIERTFYNALLGA-----------MMPDGHTWNKYTDLRGV-KYLGENQC 392

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--VVLNQK 523
           G   N   CC   G      L    +     N  G+ +  Y ++S     G   V LN  
Sbjct: 393 GMDIN---CCIANGPRGLMVLPKEAFMI---NAAGIAVNFYGTASATLSVGQNKVTLNT- 445

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
              +  +     +T+  +  + +    +L LR+P W  S     S+NG  +    PG + 
Sbjct: 446 ---VTEYPKNGAVTIIVNPGKPLD--FNLQLRIPEW--SAHTNISINGVAVDNAVPGKYT 498

Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           +    W   D + +Q  + +R   +  D   Y     + +GP +LA
Sbjct: 499 AIKRTWKQGDIVKLQFQMDVRQYFVPGDSTRY----CLQYGPLVLA 540


>gi|284172576|ref|YP_003405958.1| protein of unknown function DUF1680 [Haloterrigena turkmenica DSM
           5511]
 gi|284017336|gb|ADB63285.1| protein of unknown function DUF1680 [Haloterrigena turkmenica DSM
           5511]
          Length = 636

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 118/544 (21%), Positives = 198/544 (36%), Gaps = 116/544 (21%)

Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKP--VW 226
           ++ A++ + A   +  ++ K+  V+  +++ Q     GYL+ +       F  ++P   W
Sbjct: 75  WIEAASYVLAQRDDPELEAKVDGVISLIADAQQP--DGYLNTY-------FSLVEPENRW 125

Query: 227 APYYTIHKI-LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
              + +H++  AG L +  +A + +A +  T ++E   +    V  ++  E      +EE
Sbjct: 126 TNLHMMHELYCAGHLIEAAVA-HYRATEKET-LLEVAVDFADLVDDVFGDEVEGVPGHEE 183

Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLF--------------DKPCFLG--------FLAL 323
              +   L +LY +T + ++L LA  F              D P  LG         +  
Sbjct: 184 ---IELALLKLYRVTDETRYLELAKYFIDLRGKDDRLAWEIDNPETLGGGEYEDGSIIPA 240

Query: 324 QADYLSH-------FHANTHIPI-----VIGSQMR------------YEVTGDPLYKLIG 359
             D  +H        +A  H P+     V G  +R             E   D L + + 
Sbjct: 241 ARDVFTHEDGTYDGRYAQAHEPLRDQETVEGHSVRAMYLFAAATDLAIETGEDELIESLE 300

Query: 360 TFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
             + ++      Y TGG    E         D       ETC     +  ++ LF  + E
Sbjct: 301 RLWTNMTT-KRMYVTGGLGPEEAHEGFTTDYDLRNDAYAETCAAIGSVYWNQRLFELSGE 359

Query: 420 IAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
             YAD  ER L NG L+     GTE     Y  PL       R   GW T      CC  
Sbjct: 360 AKYADLIERTLYNGFLAGVSLDGTE---FFYENPLESDGDHHRK--GWFTCA----CCPP 410

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
                 + LG+ +Y + +     +Y+ QY+ SS         +    D  + W   + + 
Sbjct: 411 NAARLLASLGEYVYSQRDS---AIYVNQYLGSSVTTAVDGATVELSQDSSLPWSGEVTVD 467

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTI 597
           +        G    L LR+P W  S  +  ++NG+++  P  G +L     W  +D++ +
Sbjct: 468 VDAD-----GASVPLRLRIPEWAES--STVTVNGESVETPSEG-YLEIERVWD-DDRIEL 518

Query: 598 QL-------------------------PLSLRTEAIQDDRP----EYASIQAILFGPYLL 628
                                      PL    EAI +DRP    E  S  +    P LL
Sbjct: 519 TFEQTVTRLEAHPDVAADAGRVALKRGPLVYCLEAIDNDRPLHQYEDPSPTSTTHRPDLL 578

Query: 629 AGHT 632
            G T
Sbjct: 579 EGVT 582


>gi|146295756|ref|YP_001179527.1| hypothetical protein [Caldicellulosiruptor saccharolyticus DSM
           8903]
 gi|145409332|gb|ABP66336.1| protein of unknown function DUF1680 [Caldicellulosiruptor
           saccharolyticus DSM 8903]
          Length = 653

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 88/363 (24%), Positives = 135/363 (37%), Gaps = 74/363 (20%)

Query: 293 LYRLYSITHDPKHLLLAHLF-------------------DKPCFLGFLALQADYLSHFHA 333
           L +LY +T++ K+L LA  F                    K  + GF  L  +YL     
Sbjct: 200 LVKLYEVTNNSKYLELAKFFIDERGQEPYYFDIEWEKRGKKEHWKGFKGLGKEYLQ---- 255

Query: 334 NTHIPI-----VIGSQMR------------YEVTGDPLYKLIGTFFMDIVNASHSYATG- 375
             H P+      +G  +R            Y      LY++    F DI N    Y TG 
Sbjct: 256 -AHKPVREQREAVGHAVRAVYLYSGMADVAYYTKDKELYEVCEALFNDIRNRK-MYITGA 313

Query: 376 -GTSAR----EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERAL 430
            G+SA      F +D    A        ETC +  ++  +  + R      Y D  ERAL
Sbjct: 314 IGSSAHGEAFTFEYDLPNAAAYA-----ETCASVGLVFFAHRMNRIKPHRKYYDVVERAL 368

Query: 431 TNGVLSI--QRGTEPGVMIYMLPLG---RGVSKARSTHGWGTKFNSFW---CCYGTGIES 482
            N ++    Q G +     Y+ PL    + V K    H    +   ++   CC       
Sbjct: 369 YNTIIGAMSQDGKK---YFYVNPLEVFPKEVEKRFDRHHVKPERQPWFGCACCPPNVARL 425

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
            + +G  IY     N   +Y+  YI S    +S  ++ NQKV  I          + F  
Sbjct: 426 LASIGKYIYLY---NNNEIYVNLYIGS----ESEFLINNQKVKIIQDSGYPFNDEVNFKI 478

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPL 601
                   +LNLR+P W   +  +  +NG+ L        ++S T  W  +D++ I LP 
Sbjct: 479 ITNGEMYFTLNLRIPSWC--DKFEIKINGELLTGFSLKDGYVSITRGWKSDDRIEIILPT 536

Query: 602 SLR 604
            L+
Sbjct: 537 QLK 539


>gi|417344582|ref|ZP_12124897.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Baildon str. R6-199]
 gi|417542477|ref|ZP_12193911.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Wandsworth str. A4-580]
 gi|353658599|gb|EHC98734.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Wandsworth str. A4-580]
 gi|357953998|gb|EHJ80341.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Baildon str. R6-199]
          Length = 651

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 61/253 (24%), Positives = 93/253 (36%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY         LYI 
Sbjct: 387 --EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RAHALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +    +  L  ++     W  + ++ +   S Q V    +L LR+P W     
Sbjct: 442 MYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 556 PLVYCLEQADNGE 568


>gi|417109929|ref|ZP_11963472.1| hypothetical protein RHECNPAF_800032 [Rhizobium etli CNPAF512]
 gi|327188729|gb|EGE55928.1| hypothetical protein RHECNPAF_800032 [Rhizobium etli CNPAF512]
          Length = 640

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 93/429 (21%), Positives = 167/429 (38%), Gaps = 73/429 (17%)

Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
            L +L  +T + K+L L+  F      +   F    A      + FH  T      H P+
Sbjct: 198 ALVKLARVTGEKKYLDLSKYFIDERGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 258 REQTKVVGHAVRAMYLYSGMADVATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +     +  YAD  E+AL NG L    
Sbjct: 317 NEGFTDYYDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
            T+     Y  PL      A   H W  K++   CC        + +G  +Y   +  + 
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAIADDEI- 426

Query: 500 GLYIIQYISSSFDWKSGHVV-LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            +++    ++     +G  V L Q  +    W+      + F+++ E     +L+LR+P 
Sbjct: 427 AVHLYGESTTRLKLANGAAVELQQATN--YPWE----GAVAFTTRLEKPAKFALSLRIPD 480

Query: 559 WTYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
           W  ++GA  S+NG+ L L       +     +W   D++ + LPLSLR +       + A
Sbjct: 481 W--ADGATLSVNGEKLDLGAATRDGYARIDRQWVDGDRVDLFLPLSLRPQYANPKVRQDA 538

Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMS 676
              A++ GP +    T       T   + L+A++ P             + S   T V++
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGQDLNAIVLP------------RELSAAETVVLN 579

Query: 677 NSNQSITME 685
           + N ++ ++
Sbjct: 580 DLNDAVALD 588


>gi|224585478|ref|YP_002639277.1| hypothetical protein SPC_3759 [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|224470006|gb|ACN47836.1| hypothetical protein SPC_3759 [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
          Length = 651

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 61/253 (24%), Positives = 93/253 (36%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY         LYI 
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +    +  L  ++     W  + ++ +   S Q V    +L LR+P W     
Sbjct: 442 MYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 556 PLVYCLEQADNGE 568


>gi|261409833|ref|YP_003246074.1| hypothetical protein GYMC10_6062 [Paenibacillus sp. Y412MC10]
 gi|261286296|gb|ACX68267.1| protein of unknown function DUF1680 [Paenibacillus sp. Y412MC10]
          Length = 658

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 108/494 (21%), Positives = 194/494 (39%), Gaps = 79/494 (15%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFEALK 223
           V  +L A+A   A+  +  ++E++  ++  +++ Q     GYL+ + T  E    +  L 
Sbjct: 79  VAKWLEAAAYSLATHRDPKLEEQVDELIDLVADAQQP--DGYLNTYFTVKEPEKRWTNLT 136

Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
                Y   H I AG+   Y      + L +   + ++    +  V      + H +  +
Sbjct: 137 DCHELYCAGHMIEAGVA-HYRATGKRKLLDVVCRLADH----IDTVFGPEDGKIHGFDGH 191

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFL------ALQADYLSHFH 332
           +E   +   L +LY +T +P++L L+  F      +P F  FL        ++ Y S  H
Sbjct: 192 QE---IELALVKLYEVTQEPRYLSLSQYFIDERGTEPHF--FLQEWEQRGKKSFYRSVLH 246

Query: 333 A------NTHIPI-----VIGSQMRY-----------EVTGDP-LYKLIGTFFMDIVNAS 369
           A       +H+P+      +G  +R              T DP L +   T + ++V+  
Sbjct: 247 APHLAYHQSHLPVREQKEAVGHSVRAVYMYTAMADLAARTKDPALLEACDTLWRNMVH-K 305

Query: 370 HSYATGGTSA----REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
             Y TGG  +      F  D     DT+ SE   TC +  ++  ++ + + + +  YAD 
Sbjct: 306 QMYITGGIGSTHHGEAFTTDYDLPNDTVYSE---TCASIGLIFFAQRMLQLSPKSEYADV 362

Query: 426 YERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYG 477
            ERAL N V+    Q G       Y+ PL    +  R   G          W    CC  
Sbjct: 363 MERALFNTVIGSMAQDGRH---FFYVNPLEVWPAACRHNPGKAHVKPVRPGWFACACCPP 419

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
                 S LG+ +Y   +     LY   YI    + + G V +    +  + WD      
Sbjct: 420 NVARLLSSLGEYVYTMNDDT---LYAHLYIGGEAEVRFGDVPVKVMQNSTLPWD----GD 472

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKL 595
           +TF+ + E     ++ LR+P W+    A   +NGQ + +       +      W+  D  
Sbjct: 473 VTFTLQPEQAVEWTVALRIPDWSRGK-AGLRVNGQEMNVEDITQDGYACVKRVWAPGD-- 529

Query: 596 TIQLPLSLRTEAIQ 609
           T++L  S+    ++
Sbjct: 530 TVELAFSMEIHQVR 543


>gi|424916536|ref|ZP_18339900.1| hypothetical protein Rleg9DRAFT_4102 [Rhizobium leguminosarum bv.
           trifolii WSM597]
 gi|392852712|gb|EJB05233.1| hypothetical protein Rleg9DRAFT_4102 [Rhizobium leguminosarum bv.
           trifolii WSM597]
          Length = 640

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 90/396 (22%), Positives = 155/396 (39%), Gaps = 61/396 (15%)

Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
            L +L  +T + K+L L+  F      +   F    A      + FH  T      H P+
Sbjct: 198 ALVKLARVTGEKKYLELSKFFIDARGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 258 REQKKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +     +  YAD  E+AL NG L    
Sbjct: 317 NEGFTDYYDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
            T+     Y  PL   V K    H W  K++   CC        + +G  +Y   +  + 
Sbjct: 374 STDGKTFFYDNPL-ESVGK---HHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426

Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            +++    ++     +G  V L Q  +    WD      + F+++ +     +L+LR+P 
Sbjct: 427 AVHLYGESTARLKLANGADVELEQTTN--YPWD----GAVAFTTRLKTPAKFALSLRIPD 480

Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
           W  + GA  S+NG+ L L       +     +W+  D++ + LPLSLR +       + A
Sbjct: 481 W--AEGATLSVNGEMLDLAANIRDGYARIDRQWADGDRVALSLPLSLRPQYANPKVRQDA 538

Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
              A++ GP +    T       T     L+A++ P
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGEDLNAIVLP 567


>gi|383111125|ref|ZP_09931943.1| hypothetical protein BSGG_2229 [Bacteroides sp. D2]
 gi|313694694|gb|EFS31529.1| hypothetical protein BSGG_2229 [Bacteroides sp. D2]
          Length = 621

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 88/434 (20%), Positives = 160/434 (36%), Gaps = 65/434 (14%)

Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
           +W   YT       LL  Y L  + +AL     ++ +   ++Q  I   ++    Y L  
Sbjct: 126 IWGRKYT----SLSLLSYYRLTGDKKALNAVERLINHLMEQLQ--IHNINIAATGYYLGM 179

Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT--HIPIVIG 342
            +  + + +  LY IT +P++L  A            +++ +  S     T  +IP+   
Sbjct: 180 ASCSILEPVVYLYDITRNPRYLSFAKSI-------VSSIEREGSSQLITKTLKNIPVSER 232

Query: 343 S------------QMRYE-------------VTGDPLYKLIGTFFMDIVNASHSYATGGT 377
           S            Q  YE             +  DP Y  I    ++ +        G  
Sbjct: 233 SAFPKSWWSFENGQKAYEMMSCYEGLIELGTIVNDPFYIRIAEKAVNNIQEDEINIAGSG 292

Query: 378 SAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
           +A E W+  K           ETC T+  +++   L   T    YA+ +E  + N +++ 
Sbjct: 293 AAFECWYKGKEKQTLPTYHTMETCVTFTYMQLCHRLLCKTGNSFYAEEFEHTMYNALMAT 352

Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
            +     +  Y    GR   +       G   N   CC   G   F+ +  +    ++ +
Sbjct: 353 MKNDGSQISKYSPLEGR---RQPGEEQCGMHIN---CCNANGPRGFALIPKTACTIKDNH 406

Query: 498 V-PGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
           +   LY+    + S + K   V LN + D PI      + + +    K++     +L LR
Sbjct: 407 IYLNLYLPLQATISLN-KKNKVHLNVESDYPI---HGKVNVNIGVQKKEKF----TLALR 458

Query: 556 MPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
           +P  T     +A +NG+   +   G +L     W   DK+T+   +  +   + +     
Sbjct: 459 IP--TQIEKMKAYINGEEQEITHKGGYLYIERIWENADKVTLDFKIETKVVKLNNS---- 512

Query: 616 ASIQAILFGPYLLA 629
              QAI+ GP L A
Sbjct: 513 ---QAIVRGPLLFA 523


>gi|325298731|ref|YP_004258648.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
 gi|324318284|gb|ADY36175.1| protein of unknown function DUF1680 [Bacteroides salanitronis DSM
           18170]
          Length = 666

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 101/455 (22%), Positives = 171/455 (37%), Gaps = 91/455 (20%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L +LY +T D K+L  A  F DK    G+ + +  Y     +  H P+V     +G  +R
Sbjct: 219 LAKLYLVTGDKKYLDEAKFFLDK---RGYTSRKDAY-----SQAHKPVVQQDEAVGHAVR 270

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +TGD  Y        D +     Y TGG  A           +  G+
Sbjct: 271 ATYMYSGMADVAALTGDTAYVHAIDRIWDNIVGKKLYLTGGIGATAH-------GEAFGA 323

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     + V+  LF +  +  Y D  ER+L NGVLS     + G   
Sbjct: 324 NYELPNATAYCETCAAIGNVYVNHRLFLFHGDAKYYDVLERSLYNGVLS-GISLDGGRFF 382

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  PL        S  G+  K      C  + +  F        +   G+   LY+  ++
Sbjct: 383 YPNPL-------ESAGGYERKAWFGCACCPSNLCRFLPSVPGYMYATRGD--SLYVNLFM 433

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT------- 560
             + + + G   ++ +      +D  +R+TL   S + V       +R+P WT       
Sbjct: 434 EGTSEIQVGKRKISIRQQTAYPFDGNIRLTLQKGSGEFV-----WKVRVPGWTRGEVVPG 488

Query: 561 ----YSNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR----TEAI 608
               +++G Q S    +NG+ +       + S + RW   D + +   ++ R     E +
Sbjct: 489 GLYRFADGKQTSYSVKVNGEKVEGSIEKGYFSISRRWKKGDVVEVSFDMTPRLVLADEKV 548

Query: 609 QDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQES 668
           + DR     + AI  GP +       EW    G    L +++ P  P    +L    +++
Sbjct: 549 EADR----GMLAIERGPLVYC----AEWCDNQGI--DLFSVLLPRKP----KLEVMDEKA 594

Query: 669 GNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLI 703
                ++S   Q+++   + V G   A  A  +LI
Sbjct: 595 PGGAQMISAGVQTLS---YDVEGKLHASDAVLKLI 626


>gi|429117671|ref|ZP_19178589.1| COG3533 secreted protein [Cronobacter sakazakii 701]
 gi|426320800|emb|CCK04702.1| COG3533 secreted protein [Cronobacter sakazakii 701]
          Length = 372

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 53/220 (24%), Positives = 81/220 (36%), Gaps = 32/220 (14%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 54  ESCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 106

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + LG  IY   E     L+I 
Sbjct: 107 --EVHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTARED---ALFIN 161

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            YI ++     G   L  ++     W   +R+ +      E     +L LR+P W   + 
Sbjct: 162 LYIGNNVQLPVGDSTLRLRISGDFPWHEEVRIHIDSPRPVE----HTLALRLPDW--CDA 215

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            +  LNG+         +L  T  W   D LT+ LP+ +R
Sbjct: 216 PRVMLNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 255


>gi|336417454|ref|ZP_08597777.1| hypothetical protein HMPREF1017_04885 [Bacteroides ovatus
           3_8_47FAA]
 gi|335935949|gb|EGM97896.1| hypothetical protein HMPREF1017_04885 [Bacteroides ovatus
           3_8_47FAA]
          Length = 621

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 88/434 (20%), Positives = 160/434 (36%), Gaps = 65/434 (14%)

Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
           +W   YT       LL  Y L  + +AL     ++ +   ++Q  I   ++    Y L  
Sbjct: 126 IWGRKYT----SLSLLSYYRLTGDKKALNAVERLINHLMEQLQ--IHNINIAATGYYLGM 179

Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT--HIPIVIG 342
            +  + + +  LY IT +P++L  A            +++ +  S     T  +IP+   
Sbjct: 180 ASCSILEPVVYLYDITRNPRYLSFAKSI-------VSSIEREGSSQLITKTLRNIPVSER 232

Query: 343 S------------QMRYE-------------VTGDPLYKLIGTFFMDIVNASHSYATGGT 377
           S            Q  YE             +  DP Y  I    ++ +        G  
Sbjct: 233 SAFPKSWWSFENGQKAYEMMSCYEGLIELGTIVNDPFYIRIAEKAVNNIQEDEINIAGSG 292

Query: 378 SAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
           +A E W+  K           ETC T+  +++   L   T    YA+ +E  + N +++ 
Sbjct: 293 AAFECWYKGKEKQTLPTYHTMETCVTFTYMQLCHRLLCKTGNSFYAEEFEHTMYNALMAT 352

Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
            +     +  Y    GR   +       G   N   CC   G   F+ +  +    ++ +
Sbjct: 353 MKNDGSQISKYSPLEGR---RQPGEEQCGMHIN---CCNANGPRGFALIPKTACTIKDNH 406

Query: 498 V-PGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
           +   LY+    + S + K   V LN + D PI      + + +    K++     +L LR
Sbjct: 407 IYLNLYLPLQATISLN-KKNKVHLNVESDYPI---HGKVNVNIGVQKKEKF----TLALR 458

Query: 556 MPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
           +P  T     +A +NG+   +   G +L     W   DK+T+   +  +   + +     
Sbjct: 459 IP--TQIEKMKAYINGEEQEITHKGGYLYIERIWENADKVTLDFKIETKVVKLNNS---- 512

Query: 616 ASIQAILFGPYLLA 629
              QAI+ GP L A
Sbjct: 513 ---QAIVRGPLLFA 523


>gi|296100552|ref|YP_003610698.1| hypothetical protein ECL_00181 [Enterobacter cloacae subsp. cloacae
           ATCC 13047]
 gi|295055011|gb|ADF59749.1| hypothetical protein ECL_00181 [Enterobacter cloacae subsp. cloacae
           ATCC 13047]
          Length = 651

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 79/365 (21%), Positives = 126/365 (34%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L   F      +P F      +    S++H             +
Sbjct: 192 ALMRLYDVTEEPRYLNLVKYFIEARGTQPHFYDIEYEKRGRTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H P+      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHQPLAEQQTAIGHAVRFVYLMAGMAHLARLSKDDAKRQDCLRLWSNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSRYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H     FN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRTLAFNHIYDHVKPVRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         L+I  Y+ +      G   L  ++     W   + + + 
Sbjct: 420 ARVLTSLGHYIYTVRPD---ALFINLYVGNEVTIPVGDETLKLRISGNYPWQEEVNIEIA 476

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
                 V    +L LR+P W  +     SLNG+ +       +L  T RW   D LT+ L
Sbjct: 477 ----SPVPVTHTLALRLPDWCAN--PHVSLNGEGMTGEVSRGYLHLTRRWQEGDTLTLTL 530

Query: 600 PLSLR 604
           P+ +R
Sbjct: 531 PMPVR 535


>gi|372209243|ref|ZP_09497045.1| hypothetical protein FbacS_03931 [Flavobacteriaceae bacterium S85]
          Length = 671

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 53/214 (24%), Positives = 92/214 (42%), Gaps = 20/214 (9%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC        S  +     E  YAD  E  L N  LS     E     Y  PL   VS 
Sbjct: 354 ETCANVCNSMFSYRMLGLHGEAKYADVMELVLFNSALS-GISIEGKDYFYANPLR--VSH 410

Query: 459 ARSTHGWGTKFN------SFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSF 511
                G  T+F+        +CC    + + +KL    Y     G    LY    ++++ 
Sbjct: 411 KGHDPGNDTEFDMRRPYIPCFCCPPNLVRTIAKLSGWAYSLTTNGVAVNLYGGNKLTTTL 470

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
              S   ++ Q   P   W+  + + +  + K+       + +R+P W  + G+Q  +NG
Sbjct: 471 LDGSKLELVQQSGYP---WNGKVTLIIKKAKKEAF----DIKIRVPEW--AKGSQIQING 521

Query: 572 QNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLR 604
           + + LP   G++++  ++WS NDK+T+Q+P+ ++
Sbjct: 522 KAVSLPVKAGSYVTLHQKWSKNDKITLQMPMEIK 555


>gi|189464183|ref|ZP_03012968.1| hypothetical protein BACINT_00520 [Bacteroides intestinalis DSM
           17393]
 gi|189437973|gb|EDV06958.1| hypothetical protein BACINT_00520 [Bacteroides intestinalis DSM
           17393]
          Length = 812

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 88/384 (22%), Positives = 145/384 (37%), Gaps = 79/384 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L +A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 221 ALAKLYKVTGDGKYLKMAKYFVEETGRG---TDGHRLSEY-SQDHKPILQQDEIVGHAVR 276

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    +   + + +   Y  GG  +R     P+   +  G 
Sbjct: 277 AGYLYSGVADVAALTQDTAYFNALSRIWENMVSKKLYIIGGIGSR-----PQ--GEGFGP 329

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     +  +  +F  T    YAD  ERAL NGV+S       GV +
Sbjct: 330 NYELNNHTNYCETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVIS-------GVSL 382

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +    H +G       CC G  +  F        +  +GN   +
Sbjct: 383 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-VTRFMASVPYYMYATQGN--DI 433

Query: 502 YIIQYISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           Y+  YI S  D    S ++ L Q  +    W+  + + +T   +QE     +L  R+P W
Sbjct: 434 YVNLYIQSKADLNTDSNNIALEQTTE--YPWEGKVSILVTPEKEQEF----ALRFRIPGW 487

Query: 560 -----------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
                      ++++ A A   S+NG+ +       + + +  W   D + I LP+ +R 
Sbjct: 488 AQDAPVPTDLYSFTDKAGAYSISVNGKKVNAKQYDGYATISRTWKVGDVVEINLPMDVRR 547

Query: 606 EAIQDDRPEYASIQAILFGPYLLA 629
               D+  +     AI  GP +  
Sbjct: 548 IKANDNVEDDCGKLAIERGPIMFC 571


>gi|194444786|ref|YP_002042927.1| hypothetical protein SNSL254_A3957 [Salmonella enterica subsp.
           enterica serovar Newport str. SL254]
 gi|418790980|ref|ZP_13346748.1| hypothetical protein SEEN447_08030 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19447]
 gi|418795399|ref|ZP_13351104.1| hypothetical protein SEEN449_11603 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19449]
 gi|418798645|ref|ZP_13354319.1| hypothetical protein SEEN567_19023 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19567]
 gi|418806870|ref|ZP_13362440.1| hypothetical protein SEEN550_01554 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21550]
 gi|418811033|ref|ZP_13366570.1| hypothetical protein SEEN513_20561 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22513]
 gi|418819963|ref|ZP_13375400.1| hypothetical protein SEEN425_16720 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22425]
 gi|418824033|ref|ZP_13379418.1| hypothetical protein SEEN462_22404 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22462]
 gi|418832501|ref|ZP_13387442.1| hypothetical protein SEEN486_20353 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM N18486]
 gi|418834359|ref|ZP_13389267.1| hypothetical protein SEEN543_19458 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM N1543]
 gi|418839823|ref|ZP_13394654.1| hypothetical protein SEEN554_21236 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21554]
 gi|418851856|ref|ZP_13406562.1| hypothetical protein SEEN978_01684 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 37978]
 gi|418853203|ref|ZP_13407898.1| hypothetical protein SEEN593_02318 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19593]
 gi|194403449|gb|ACF63671.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Newport str. SL254]
 gi|392756265|gb|EJA13162.1| hypothetical protein SEEN447_08030 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19447]
 gi|392758783|gb|EJA15648.1| hypothetical protein SEEN449_11603 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19449]
 gi|392766123|gb|EJA22905.1| hypothetical protein SEEN567_19023 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19567]
 gi|392780719|gb|EJA37371.1| hypothetical protein SEEN513_20561 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22513]
 gi|392782028|gb|EJA38666.1| hypothetical protein SEEN550_01554 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21550]
 gi|392793888|gb|EJA50323.1| hypothetical protein SEEN425_16720 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22425]
 gi|392797650|gb|EJA53956.1| hypothetical protein SEEN486_20353 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM N18486]
 gi|392805302|gb|EJA61433.1| hypothetical protein SEEN543_19458 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM N1543]
 gi|392811613|gb|EJA67613.1| hypothetical protein SEEN554_21236 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21554]
 gi|392816063|gb|EJA71993.1| hypothetical protein SEEN978_01684 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 37978]
 gi|392825252|gb|EJA81005.1| hypothetical protein SEEN462_22404 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22462]
 gi|392827750|gb|EJA83452.1| hypothetical protein SEEN593_02318 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19593]
          Length = 651

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 88/398 (22%), Positives = 140/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H     FN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLNFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|423214778|ref|ZP_17201306.1| hypothetical protein HMPREF1074_02838 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|423294029|ref|ZP_17272156.1| hypothetical protein HMPREF1070_00821 [Bacteroides ovatus
           CL03T12C18]
 gi|392676837|gb|EIY70260.1| hypothetical protein HMPREF1070_00821 [Bacteroides ovatus
           CL03T12C18]
 gi|392692684|gb|EIY85921.1| hypothetical protein HMPREF1074_02838 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 621

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 88/434 (20%), Positives = 160/434 (36%), Gaps = 65/434 (14%)

Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
           +W   YT       LL  Y L  + +AL     ++ +   ++Q  I   ++    Y L  
Sbjct: 126 IWGRKYT----SLSLLSYYRLTGDKKALNAVERLINHLMEQLQ--IHNINIAATGYYLGM 179

Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT--HIPIVIG 342
            +  + + +  LY IT +P++L  A            +++ +  S     T  +IP+   
Sbjct: 180 ASCSILEPVVYLYDITRNPRYLSFAKSI-------VSSIEREGSSQLITKTLKNIPVSER 232

Query: 343 S------------QMRYE-------------VTGDPLYKLIGTFFMDIVNASHSYATGGT 377
           S            Q  YE             +  DP Y  I    ++ +        G  
Sbjct: 233 SAFPKSWWSFENGQKAYEMMSCYEGLIELGTIVNDPFYIKIAEKAVNNIQEDEINIAGSG 292

Query: 378 SAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
           +A E W+  K           ETC T+  +++   L   T    YA+ +E  + N +++ 
Sbjct: 293 AAFECWYKGKEKQTLPTYHTMETCVTFTYMQLCHRLLCKTGNSFYAEEFEHTMYNALMAT 352

Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
            +     +  Y    GR   +       G   N   CC   G   F+ +  +    ++ +
Sbjct: 353 MKNDGSQISKYSPLEGR---RQPGEEQCGMHIN---CCNANGPRGFALIPKTACTIKDNH 406

Query: 498 V-PGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
           +   LY+    + S + K   V LN + D PI      + + +    K++     +L LR
Sbjct: 407 IYLNLYLPLQATISLN-KKNKVHLNVESDYPI---HGKVNVNIGVQKKEKF----TLALR 458

Query: 556 MPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
           +P  T     +A +NG+   +   G +L     W   DK+T+   +  +   + +     
Sbjct: 459 IP--TQIEKMKAYINGEEQEITHKGGYLYIERIWENADKVTLDFKIETKVVKLNNS---- 512

Query: 616 ASIQAILFGPYLLA 629
              QAI+ GP L A
Sbjct: 513 ---QAIVRGPLLFA 523


>gi|307719149|ref|YP_003874681.1| hypothetical protein STHERM_c14680 [Spirochaeta thermophila DSM
           6192]
 gi|306532874|gb|ADN02408.1| putative cytoplasmic protein [Spirochaeta thermophila DSM 6192]
          Length = 643

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 79/350 (22%), Positives = 137/350 (39%), Gaps = 53/350 (15%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFL-------GFLALQADY--LSHFHANTHI 337
            L +LY +T + +HL LA  F      +P +        G  +    +  L H ++ +HI
Sbjct: 194 ALLKLYELTGEKRHLDLASFFIEERGRQPHYFEWEWEKRGRTSFWPRFRELGHEYSQSHI 253

Query: 338 PI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
           P+      +G  +R             +TGD L           V     Y TGG  A  
Sbjct: 254 PVREQREAVGHAVRAMYMYTALADLARITGDTLLWETAQALWKDVTRRKMYLTGGIGASA 313

Query: 382 FWWDPKRLADTLGSEN--EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
           F  +   +A  L ++    ETC +  +   +  + R   +  Y+D  E AL NG+LS   
Sbjct: 314 FG-ESFSIAYDLPNDRAYNETCASIGLFFWASRMLRKEIDAEYSDVMELALYNGILS-GM 371

Query: 440 GTEPGVMIYMLPL------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
             +     Y+ PL       R     R       K+    CC        + +G   Y+ 
Sbjct: 372 SLDGSRFFYVNPLEVWPEACRHREDLRHVMTTRQKWFGCACCPPNLARLLASIG-GYYYS 430

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
             G+   L++  Y SS+   +   V + Q+ +    WD  +++++     +E     +L+
Sbjct: 431 RSGS--SLFVHFYGSSNLTIEDWGVTVEQETE--YPWDGEVKLSVIAREPREF----TLS 482

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
           LR+P W   N     +NG+     P   +++    W  N + T++L LS+
Sbjct: 483 LRIPGWC--NDFSLEMNGEAYTSTPERGYVAIRRTW--NGRDTVRLRLSM 528


>gi|423142165|ref|ZP_17129803.1| hypothetical protein SEHO0A_03740 [Salmonella enterica subsp.
           houtenae str. ATCC BAA-1581]
 gi|379050094|gb|EHY67987.1| hypothetical protein SEHO0A_03740 [Salmonella enterica subsp.
           houtenae str. ATCC BAA-1581]
          Length = 651

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 78/365 (21%), Positives = 124/365 (33%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF----------------------------------DKPCF 317
            L RLY IT  P+++ LA  F                                  DK   
Sbjct: 192 ALMRLYEITQQPRYMALADYFVEQRGTQPHYYDEEYAKRGKTAYWHTYGPAWMVKDKAYS 251

Query: 318 LGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
              L L A   +  HA   + ++ G      ++ D   +       + +     Y TGG 
Sbjct: 252 QAHLPLSAQQTATGHAVRFVYLMAGVAHLARLSQDEDKRQTCLRLWNNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSRYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H     FN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKTLTFNHIYDHVKPVRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  +Y         LYI  Y+ +S +    +  L  ++     W    ++T+T
Sbjct: 420 ARVLTSLGHYLYTPRN---EALYINMYVGNSVEIPLENGALKLRISGNYPWQE--QITIT 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q +    +L LR+P W      Q  +NGQ +       +L     W   D + + L
Sbjct: 475 VESSQPLRH--TLALRLPEWCPQ--PQVEVNGQPVEQDIRKGYLHIQRDWQEGDTIALTL 530

Query: 600 PLSLR 604
           P+ +R
Sbjct: 531 PMPVR 535


>gi|312126770|ref|YP_003991644.1| hypothetical protein Calhy_0533 [Caldicellulosiruptor
           hydrothermalis 108]
 gi|311776789|gb|ADQ06275.1| protein of unknown function DUF1680 [Caldicellulosiruptor
           hydrothermalis 108]
          Length = 654

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 92/390 (23%), Positives = 149/390 (38%), Gaps = 70/390 (17%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL--------------GFLALQADYLSHFHA 333
           L +LY +T D K+L LA  F      +P +               GF +L  +YL     
Sbjct: 200 LVKLYEVTGDRKYLELAKFFVDERGQEPYYFDIEYEKREKKSHWPGFKSLGREYLQAHKP 259

Query: 334 NTHIPIVIGSQMR----YEVTGD--------PLYKLIGTFFMDIVNASHSYATG--GTSA 379
                  +G  +R    Y    D         L+ +  T F DIV     Y TG  G+SA
Sbjct: 260 LRQQKEAVGHAVRAVYLYSGAADVAAYTQDKELFDVCKTLFDDIVKRK-MYITGAIGSSA 318

Query: 380 --REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS- 436
               F ++    +D   +E   TC +  ++  +  L +      Y D  ERAL N V+  
Sbjct: 319 HGEAFTFEYDLPSDAAYAE---TCASVGLIFFAHRLNKIEPHAKYYDVVERALYNTVIGS 375

Query: 437 -IQRGTEPGVMIYMLPLG---RGVSKARSTHGWGTKFNSFW---CCYGTGIESFSKLGDS 489
             Q G +     Y+ PL    + V K    H    +   ++   CC        + LG  
Sbjct: 376 MSQDGKK---YFYVNPLEVYPKEVEKRFDRHHVKPERQPWFGCACCPPNVARLLASLGRY 432

Query: 490 IYFEEEGNVPGLYIIQYISSSFDWKSGHV-VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQ 548
           +Y     N  G+Y+  YI SS   + G V VL Q+V     ++  +++ L  S +     
Sbjct: 433 VY---SYNHDGIYVNLYIGSSVQVEVGGVKVLLQQVSSY-PFEDMVKIDLKPSKEARF-- 486

Query: 549 LSSLNLRMPVW-----TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
              L LR+P W      Y NG +  +  Q L    P  ++     W  ND++ +++P  +
Sbjct: 487 --KLYLRIPGWCENYEVYVNGKKEEM--QKL----PSGYVCIERLWKENDQVVLKIPTEV 538

Query: 604 RTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           +  +            A++ GP +     +
Sbjct: 539 KMVSSHPQVRSNVGKVAVVKGPVVFCAEEA 568


>gi|251798052|ref|YP_003012783.1| hypothetical protein Pjdr2_4067 [Paenibacillus sp. JDR-2]
 gi|247545678|gb|ACT02697.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
          Length = 622

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 92/406 (22%), Positives = 152/406 (37%), Gaps = 50/406 (12%)

Query: 253 KMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDV-LYRLYSITHDPKHLLLAHL 311
           ++  +M  YF  +++++      ER      +  GG N + +Y LY+ T DP  + LA L
Sbjct: 135 RVIPFMTNYFRYQLKQL-----PERPLADWAKARGGDNLISVYWLYNRTGDPFLMELAQL 189

Query: 312 F--DKPCFLGFLALQADYL---SHFHANTHIPIVIGS----QMRYEVTGDPLYKLIGTFF 362
                  + G L  Q  Y    + F    H+  V  S     ++Y +TGD   K +    
Sbjct: 190 LIVQTEDWKG-LYEQYPYWYRQTSFDHRVHVVNVAMSFKQPALQYLLTGDETDKAVVYKA 248

Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
           ++ V A H    G  S  E+      LA T  S+  E C+    +    +L R T +  +
Sbjct: 249 INSVMACHGQVNGMFSGDEW------LAGTHPSQGTELCSVVEYMYSLENLIRITGDGFF 302

Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG---WGTKFNS-------- 471
            D  E+   N   ++     P   ++     +  ++   TH    W    N         
Sbjct: 303 GDILEKIAYN---ALPAAISPDWKVHQY--DQQANQIMCTHAKRNWTENNNEANLFGVEP 357

Query: 472 -FWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSW 530
            F CC     + + KL   ++   EG   G+  I Y         G     +    + + 
Sbjct: 358 HFGCCTANMHQGWPKLAARLWMASEGG--GIAAISYAPCLVTAALGSDKKTKAEIQVETS 415

Query: 531 DPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWS 590
            P+ R T+      E     ++ LR+P W      Q  +NG+  PL P   F+S    W 
Sbjct: 416 YPF-RDTVNIKVGLESSAAFAMKLRIPAWCEEPVLQ--INGEPYPLQPVNGFVSIERIWM 472

Query: 591 YNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
             D+L + LP   R   +    P       + +GP +LA     +W
Sbjct: 473 PEDELLLTLP---RHATL---IPRANGAAGVQYGPLMLAIPVKEQW 512


>gi|402489910|ref|ZP_10836703.1| hypothetical protein RCCGE510_19298 [Rhizobium sp. CCGE 510]
 gi|401811249|gb|EJT03618.1| hypothetical protein RCCGE510_19298 [Rhizobium sp. CCGE 510]
          Length = 640

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 88/396 (22%), Positives = 154/396 (38%), Gaps = 61/396 (15%)

Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
            L +L  +T + K+L L+  F      +   F    A      + FH  T      H P+
Sbjct: 198 ALVKLARVTGEKKYLELSKFFIDERGSEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 258 RDQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +     +  YAD  E+AL NG L    
Sbjct: 317 NEGFTDYYDLPN--ATAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
            T+     Y  PL      A   H W  K++   CC        + +G  +Y   +  + 
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426

Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            +++    ++     +G    L Q  +    WD      + F+++ +     +L+LR+P 
Sbjct: 427 AVHLYGESTARLKLANGAEGELQQTTN--YPWD----GAVAFTTRLKTPATFALSLRIPD 480

Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
           W  ++GA  S+NG+ L L       +     +W+  D++ + LPL+LR +       + A
Sbjct: 481 W--ADGATLSVNGEMLDLNANIRDGYARIDRQWADGDRVALHLPLALRPQYANPKVRQDA 538

Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
              A++ GP +    T       T     L+A+I P
Sbjct: 539 GRVALMRGPLVYCIET-------TDNGEDLNAIILP 567


>gi|56962984|ref|YP_174711.1| hypothetical protein ABC1212 [Bacillus clausii KSM-K16]
 gi|56909223|dbj|BAD63750.1| conserved hypothetical protein [Bacillus clausii KSM-K16]
          Length = 641

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 110/495 (22%), Positives = 190/495 (38%), Gaps = 93/495 (18%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG---TGYLSAFPTELFDSFEAL 222
           V  ++ A+A   A   +  ++++   ++  +S  Q   G   T Y    PT+ + +    
Sbjct: 72  VAKWIEAAAYTLAERPDPELEQRCDELIALISRAQQPDGYLNTHYTIKAPTKRWTNLRDN 131

Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
             ++   + I   +A     Y      QAL     +V  F + + +V      +   Y  
Sbjct: 132 HELYVAGHLIEAAVA-----YYETTGKQALLD---VVCKFADLIDQVFGPEPGKLRGYDG 183

Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF------ 331
           ++E   +   L +LY +  D ++L LA  F      +P F    A +      F      
Sbjct: 184 HQE---IELALLKLYRVKGDRRYLRLAQFFIEERGKEPHFFDDEAKKRGEDGTFWYSGRY 240

Query: 332 -HANTHIPI-----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYA 373
            ++ +H+P+       G  +R             E   + L K+  T + ++ N    Y 
Sbjct: 241 EYSQSHLPVRQQQEATGHAVRAVYMYTAMADLANETDDEQLAKVCRTLWDNVTN-QQMYI 299

Query: 374 TGGTSAREFW------WD-PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYY 426
           TGG  + EF       +D P  LA T      ETC +  ++  ++++     +  Y D  
Sbjct: 300 TGGIGSAEFGEAFTFAYDLPNDLAYT------ETCASIGLVFWAKNMLELEADSRYGDVM 353

Query: 427 ERALTNGVLS-IQ-RGTEPGVMIYMLPLGRGVSKARSTHGW---GTKFNSFW---CCYGT 478
           ERAL NG +S IQ  GT+     Y+ PL      A+  H      T+   ++   CC   
Sbjct: 354 ERALYNGTISGIQLDGTK---FFYVNPLEVWPQAAKHRHDLKHVKTERQPWFGCACCPPN 410

Query: 479 GIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL 538
                + +G  IY  +  N  G +I  YI             N+    I S +  L+M  
Sbjct: 411 IARLLASIGQYIYTTK--NQTG-FIHLYIG------------NESTLTIGSGEVGLKMKS 455

Query: 539 TFSSKQEVG--------QLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWS 590
           +F  K EVG        +  +L  R+P W  +N  Q ++NG  + +     +      W 
Sbjct: 456 SFPWKGEVGLEVNPDTSRPFTLAFRIPSW--ANDYQLTVNGHFVDVEVRDGYAYVERTWQ 513

Query: 591 YNDKLTIQLPLSLRT 605
             D ++IQ PL  + 
Sbjct: 514 KGDHISIQFPLETKV 528


>gi|16762630|ref|NP_458247.1| hypothetical protein STY4117 [Salmonella enterica subsp. enterica
           serovar Typhi str. CT18]
 gi|29144119|ref|NP_807461.1| hypothetical protein t3840 [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|213052815|ref|ZP_03345693.1| hypothetical protein Salmoneentericaenterica_07808 [Salmonella
           enterica subsp. enterica serovar Typhi str. E00-7866]
 gi|213428126|ref|ZP_03360876.1| hypothetical protein SentesTyphi_22630 [Salmonella enterica subsp.
           enterica serovar Typhi str. E02-1180]
 gi|213650623|ref|ZP_03380676.1| hypothetical protein SentesTy_27330 [Salmonella enterica subsp.
           enterica serovar Typhi str. J185]
 gi|213854603|ref|ZP_03382843.1| hypothetical protein SentesT_11074 [Salmonella enterica subsp.
           enterica serovar Typhi str. M223]
 gi|289826027|ref|ZP_06545185.1| hypothetical protein Salmonellentericaenterica_11725 [Salmonella
           enterica subsp. enterica serovar Typhi str. E98-3139]
 gi|378962007|ref|YP_005219493.1| hypothetical protein STBHUCCB_40530 [Salmonella enterica subsp.
           enterica serovar Typhi str. P-stx-12]
 gi|25333173|pir||AG0977 conserved hypothetical protein STY4117 [imported] - Salmonella
           enterica subsp. enterica serovar Typhi (strain CT18)
 gi|16504936|emb|CAD07947.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhi]
 gi|29139756|gb|AAO71321.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|374355879|gb|AEZ47640.1| hypothetical protein STBHUCCB_40530 [Salmonella enterica subsp.
           enterica serovar Typhi str. P-stx-12]
          Length = 651

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 87/398 (21%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + +G  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSIGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +++ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTISLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|365968450|ref|YP_004950011.1| hypothetical protein EcWSU1_00150 [Enterobacter cloacae EcWSU1]
 gi|365747363|gb|AEW71590.1| hypothetical protein EcWSU1_00150 [Enterobacter cloacae EcWSU1]
          Length = 667

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 82/359 (22%), Positives = 127/359 (35%), Gaps = 61/359 (16%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
            L RLY +T +P++L L   F      +P F      +    SH+  NT+ P  +     
Sbjct: 208 ALMRLYDVTQEPRYLALVKYFIDTRGTQPHFYDIEYEKRGRTSHW--NTYGPAWMVKDKA 265

Query: 347 YEVTGDPL---YKLIG-----TFFM----DIVNASHS-------------------YATG 375
           Y     PL   +  IG      + M     +   SH                    Y TG
Sbjct: 266 YSQAHQPLAEQHTAIGHAVRFVYLMAGMAHLARLSHDEDKRQDCLRLWNNMAQRQLYITG 325

Query: 376 G----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
           G    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL 
Sbjct: 326 GIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALY 382

Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIESFSK 485
           N VL      +     Y+ PL          H +         W    CC        + 
Sbjct: 383 NTVLG-GMALDGKHFFYVNPLEVHPKTLAFNHVYDHVKPVRQRWFGCACCPPNIARVLTS 441

Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
           LG  +Y   +     L+I  Y+ +          L  ++     W   + + +T  +   
Sbjct: 442 LGHYLYTVRQD---ALFINLYVGNDVAIPVDEGTLQLRISGNYPWQEEVNIEVTSPAPV- 497

Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
                +L LR+P W  S     SLNG+ +       +L  T RW   D LT+ LP+ +R
Sbjct: 498 ---THTLALRLPDWCASPAM--SLNGERVTGDVSRGYLYLTRRWQEGDTLTLTLPMPVR 551


>gi|329927011|ref|ZP_08281398.1| hypothetical protein HMPREF9412_4716 [Paenibacillus sp. HGF5]
 gi|328938722|gb|EGG35099.1| hypothetical protein HMPREF9412_4716 [Paenibacillus sp. HGF5]
          Length = 658

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 110/494 (22%), Positives = 195/494 (39%), Gaps = 79/494 (15%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFEALK 223
           V  +L A+A   A+  +  ++E++  ++  +++ Q     GYL+ + T  E    +  L 
Sbjct: 79  VAKWLEAAAYSLATHPDPKLEEQVDGLIDLVADAQQP--DGYLNTYFTVKEPEKRWTNLT 136

Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
                Y   H I AG+   Y      + L +   + ++    +  V      + H +  +
Sbjct: 137 DCHELYCAGHMIEAGVA-HYRATGKRKLLDVVCRLADH----IDTVFGPEDGKIHGFDGH 191

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFL------ALQADYLSHFH 332
           +E   +   L +LY +T +P++L L+  F      +P F  FL        ++ Y S  H
Sbjct: 192 QE---IELALVKLYEVTQEPRYLSLSQYFIDERGTEPHF--FLQEWEQRGKKSFYRSVLH 246

Query: 333 A------NTHIPI-----VIGSQMRY-----------EVTGDP-LYKLIGTFFMDIVNAS 369
           A       +H+P+      +G  +R              T DP L +   T + ++V+  
Sbjct: 247 APHLAYHQSHLPVREQKEAVGHSVRAVYMYTAMADLAARTKDPALLEACDTLWRNMVH-K 305

Query: 370 HSYATGGTSA----REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
             Y TGG  +      F  D     DT+ SE   TC +  ++  ++ + + + +  YAD 
Sbjct: 306 QMYITGGIGSTHHGEAFTTDYDLPNDTVYSE---TCASIGLIFFAQRMLQLSPKSEYADV 362

Query: 426 YERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYG 477
            ERAL N V+    Q G       Y+ PL    +  R   G          W    CC  
Sbjct: 363 MERALFNTVIGSMAQDGRH---FFYVNPLEVWPAACRYNPGKAHVKPVRPGWFACACCPP 419

Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
                 S LG+ +Y   +     LY   YI    + + G V +    +  + WD    +T
Sbjct: 420 NVARLLSSLGEYVYTMNDDT---LYAHLYIGGEAEVRFGDVPVKVMQNSALPWDG--DVT 474

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKL 595
           LT   +Q V    ++ LR+P W+    A   +NGQ + +       +      W+  D  
Sbjct: 475 LTLQPEQAVEW--TVALRIPDWSRGK-AGLRVNGQEMNVEDITQDGYACVKRVWAPGD-- 529

Query: 596 TIQLPLSLRTEAIQ 609
           T++L  S+    ++
Sbjct: 530 TVELAFSMEIHQVR 543


>gi|159041539|ref|YP_001540791.1| hypothetical protein Cmaq_0969 [Caldivirga maquilingensis IC-167]
 gi|157920374|gb|ABW01801.1| protein of unknown function DUF1680 [Caldivirga maquilingensis
           IC-167]
          Length = 634

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 64/262 (24%), Positives = 109/262 (41%), Gaps = 30/262 (11%)

Query: 350 TGD-PLYKLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADTLGSENEETCTTYN 405
           TGD  L++ +   ++D+   +  Y TGG  +R   E   +P  L +       ETC    
Sbjct: 275 TGDKALWEALSNLWVDL-TGTRMYVTGGVGSRHEGEAIGEPYELPND--RAYSETCAAVA 331

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
            +  +  +   T +  YAD  E AL N  L+     +     Y+ PL        +  GW
Sbjct: 332 NVMWNYRMLLATGDAKYADIMELALYNAALA-GISLDGKSYFYVNPL--------ANRGW 382

Query: 466 GTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
             +   F   CC        + L   IY        G++I  YI+S         ++  K
Sbjct: 383 HRRQPWFDVACCPPNIARLIASLPGYIYSTSSD---GVWIHLYIASEAKVNLNGGIVELK 439

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG--QNLPLPPPGN 581
           V+    WD  +++T+  S + E     ++ LR+P W  S G +  +NG  Q + L  P  
Sbjct: 440 VNTDYPWDGEVKVTVNPSKEDEF----TIYLRIPGW--SRGGKLLINGVEQGVEL-KPST 492

Query: 582 FLSATERWSYNDKLTIQLPLSL 603
           +L     W   D++ +++P+S+
Sbjct: 493 YLGVKRTWRSGDEVILRIPMSI 514


>gi|448408500|ref|ZP_21574295.1| hypothetical protein C475_08191 [Halosimplex carlsbadense 2-9-1]
 gi|445674355|gb|ELZ26899.1| hypothetical protein C475_08191 [Halosimplex carlsbadense 2-9-1]
          Length = 637

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/192 (25%), Positives = 75/192 (39%), Gaps = 19/192 (9%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC     +  ++ LF    + AYAD  ER L NG L+   G +     Y+ PL      
Sbjct: 338 ETCAAVGSVFWNQRLFELEPDPAYADLIERTLYNGFLA-GVGMDGEEFFYVNPLASDGDH 396

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
            RS  GW T      CC       F+ LG  +Y    G    LY+ QY+ S         
Sbjct: 397 HRS--GWFTCA----CCPPNAARLFASLGQYVYSTTGGE---LYVTQYVGSDLSTTVEGT 447

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
            +    +  + WD  + + +       V      NLR+P W  ++ A  +++G  +    
Sbjct: 448 AVELDQESALPWDGEVAIEVDADGAVPV------NLRIPEW--ADEATVTVDGDEVSHDG 499

Query: 579 PGNFLSATERWS 590
            G F+     W+
Sbjct: 500 SG-FVRVEREWN 510


>gi|56415571|ref|YP_152646.1| hypothetical protein SPA3530 [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. ATCC 9150]
 gi|197364498|ref|YP_002144135.1| hypothetical protein SSPA3296 [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. AKU_12601]
 gi|56129828|gb|AAV79334.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. ATCC 9150]
 gi|197095975|emb|CAR61560.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. AKU_12601]
          Length = 651

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 87/398 (21%), Positives = 141/398 (35%), Gaps = 75/398 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHTVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H    KFN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + +G  IY         LYI  Y+ +S +    +  L  ++     W  + ++ + 
Sbjct: 420 ARVLTSIGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIA 474

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
             S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +++ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTISLTL 530

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           P+ +R           A   AI  GP  Y L    +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568


>gi|116254107|ref|YP_769945.1| hypothetical protein RL4374 [Rhizobium leguminosarum bv. viciae
           3841]
 gi|115258755|emb|CAK09861.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
           3841]
          Length = 640

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 89/396 (22%), Positives = 157/396 (39%), Gaps = 61/396 (15%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFH------ANTHIPI 339
            L +L  +T + K+L L+  F      +P F    A +    +S +H      A  H P+
Sbjct: 198 ALVKLARVTDEKKYLDLSKYFIDERGTEPHFFTAEASRDGRDVSEYHQKTYEYAQAHQPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 258 RAQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +     +  YAD  E+AL NG L    
Sbjct: 317 NEGFTDYFDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
            T+     Y  PL      A   H W  K++   CC        + +G  +Y   +  + 
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVSDNEI- 426

Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            +++    ++     +G  V L Q  +    W+      + F+++ E     +L+LR+P 
Sbjct: 427 AVHLYGESTARLKLANGAEVELEQTTN--YPWEG----AVAFTTRLEKPAKFALSLRIPD 480

Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
           W  + GA  S+NG+ L L       ++     W+  D++ + LPL+LR +       + A
Sbjct: 481 W--AEGATLSVNGEMLDLNANMRDGYIRIDREWAAGDRVALYLPLALRPQYANPKVRQDA 538

Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
              A++ GP +    T       T     L+A++ P
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGEDLNAIVLP 567


>gi|332980748|ref|YP_004462189.1| hypothetical protein Mahau_0144 [Mahella australiensis 50-1 BON]
 gi|332698426|gb|AEE95367.1| protein of unknown function DUF1680 [Mahella australiensis 50-1
           BON]
          Length = 647

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 116/539 (21%), Positives = 200/539 (37%), Gaps = 75/539 (13%)

Query: 140 KTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSEC 199
           + A+    GK YG    P+   +   +  ++ A +   A   +  +K  +   +  +S+ 
Sbjct: 56  RIAAGEVSGKHYG----PV--FQDSDLAKWMEAVSCSLALRSDDDLKLHLEEAIALVSKA 109

Query: 200 QNKIGTGYLSAFPT--ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATW 257
           Q     GYL  + T  E    +  L+     Y   H I A + + Y +  N   L +A  
Sbjct: 110 QE--ADGYLDTYFTIEEPSARWTNLRDKHELYCAGHMIEAAVAN-YEVTGNKTLLNVACR 166

Query: 258 MVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK--- 314
           + ++    + ++    S +RH Y  +EE   +   L +LY  T++ K+L LAH F +   
Sbjct: 167 LADH----ICEMFGPESTKRHGYPGHEE---IELALVKLYHATNERKYLDLAHYFIRERG 219

Query: 315 --PCFLGFLAL-----------QADYLSHFHANTHIPI----VIGSQMRYEV-------- 349
             P +    A+               L +F A  H+P+     IG  +R           
Sbjct: 220 KAPYYFKIEAMARGEAKLDELWDPSKLEYFQA--HMPVTEQEAIGHAVRAMYLYSGMTDV 277

Query: 350 ---TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--ETCTTY 404
              TGD           D V     Y TGG  +  F  +    A  L ++    ETC + 
Sbjct: 278 ALETGDETIAQACRRLWDDVVKRKMYITGGVGSSSFG-EAFTFAYDLPNDTAYTETCASI 336

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR--GVSKARST 462
            ++  +  +F+  ++  Y D  ERAL N V +     +     Y+ PL     V   R  
Sbjct: 337 GLIFWAHRMFKMDQDAKYIDVMERALYNTVFA-SMSLDGKRYFYVNPLEVWPEVCHKRED 395

Query: 463 HGWGTKFNSFW----CCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISS--SFDWKS 515
           H         W    CC        + +G  +Y  +E+ N+  L++  Y+     F+   
Sbjct: 396 HRHVKTERQKWYDCACCPPNIARLLTSIGKYVYALDEDKNM--LFVNLYMDGQVKFNLND 453

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
             ++L Q  D +  WD  +  T+T ++        SL  R+P W         +NGQ + 
Sbjct: 454 KEIMLEQ--DTVYPWDGSISFTVTSNTPVTF----SLAFRIPDWC--KKWSIKINGQEIQ 505

Query: 576 LPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
                  +   T  W   DK+ + L + +       +    A   AI  GP +     +
Sbjct: 506 EHEKNKGYAVITRAWVAGDKVELMLDMPVMMMRANPEVRADAGKVAIQRGPVVYCAEEA 564


>gi|255531160|ref|YP_003091532.1| hypothetical protein Phep_1254 [Pedobacter heparinus DSM 2366]
 gi|255344144|gb|ACU03470.1| protein of unknown function DUF1680 [Pedobacter heparinus DSM 2366]
          Length = 684

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 62/308 (20%), Positives = 114/308 (37%), Gaps = 42/308 (13%)

Query: 345 MRYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTT 403
           + ++ TGD  Y K + T F D++   H    G  SA E       L     ++  E C T
Sbjct: 287 INFQRTGDSTYLKSLKTVFNDLMTL-HGLPNGIFSADE------DLHGNQPTQGTELCAT 339

Query: 404 YNMLKVSRHLFRWTKEIAYADYYERALTNGV---------------LSIQRGTEPGVMIY 448
              +     +   T +  Y D  ER   N +               ++ Q     GV  +
Sbjct: 340 VEAMYSLEEIINITGDTHYIDALERMTFNAMPSQTTDDYHEKQYFQMANQIEISRGVFAF 399

Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
            LP  R ++            + + CCY    + ++K   +++ + E    GL  + Y  
Sbjct: 400 TLPFDRKMNCVLGAK------SGYTCCYVNMHQGWTKFSQNLWHKTEN---GLAALIYGP 450

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
           ++   K G    +  ++ + ++    ++    S K+ V       LR+P W     A   
Sbjct: 451 NTLSTKVGAQQTDVTIEEVTNYPFEDQINFNLSLKKAVA--FPFQLRIPTWCKE--AVIL 506

Query: 569 LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
           +NG+       G  ++    W   D+LT+QLP+ +      D+       +A+  GP + 
Sbjct: 507 INGKIYSKEKGGKIITVNRTWQNKDRLTLQLPMEIAVSEWADNS------RAVERGPLVY 560

Query: 629 AGHTSGEW 636
                 +W
Sbjct: 561 GLKVQEKW 568


>gi|389805630|ref|ZP_10202778.1| hypothetical protein UUA_00270 [Rhodanobacter thiooxydans LCS2]
 gi|388447325|gb|EIM03335.1| hypothetical protein UUA_00270 [Rhodanobacter thiooxydans LCS2]
          Length = 607

 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 48/209 (22%), Positives = 85/209 (40%), Gaps = 23/209 (11%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC++   ++++R L   T E  YA+  ER   N +L  Q         Y+ P GR V  
Sbjct: 303 ETCSSLAWIQLNRELLAITGEARYAEEIERTGYNDLLGAQAPNGEDWCYYVFPNGRRV-- 360

Query: 459 ARSTHGWGTKFNSFW-CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK-SG 516
                       ++W CC  +G  +  +L    Y  ++     + +    S+SF    +G
Sbjct: 361 ----------HTTYWRCCKSSGAMALEELPALAYARDDDGAIAVNLYGAGSASFALDGAG 410

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
            + + Q        D  LR+ +    +       +L LR+P W  +  A   +NG++  +
Sbjct: 411 ELRIEQHTAYPYPDDVRLRIAVGRPMR------FTLKLRIPSW--AKDATLVINGEDAGV 462

Query: 577 P-PPGNFLSATERWSYNDKLTIQLPLSLR 604
              PG++      W   D+L  + P+  R
Sbjct: 463 ALSPGHYAVLEREWHDGDELVARFPMQPR 491


>gi|392977054|ref|YP_006475642.1| hypothetical protein A3UG_00935 [Enterobacter cloacae subsp.
           dissolvens SDM]
 gi|392322987|gb|AFM57940.1| hypothetical protein A3UG_00935 [Enterobacter cloacae subsp.
           dissolvens SDM]
          Length = 651

 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 78/365 (21%), Positives = 127/365 (34%), Gaps = 73/365 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T +P++L L   F      +P F      +    S++H             +
Sbjct: 192 ALMRLYDVTQEPRYLNLVKYFIEARGTQPHFYDTEYEKRGRTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H P+      IG  +R+            ++ D   +       + +     Y TGG 
Sbjct: 252 QAHQPLAEQQTAIGHAVRFVYLMAGMAHLARLSKDDAKRQDCLRLWNNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N 
Sbjct: 312 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSRYADVMERALYNT 368

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
           VL      +     Y+ PL          H     FN  +              CC    
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRTLAFNHIYDHVKPVRQRWFGCACCPPNI 419

Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
               + LG  IY         L+I  ++ +      G   L  ++     W   + + + 
Sbjct: 420 ARVLTSLGHYIYTVRPD---ALFINLFVGNEVTIPVGDETLKLRISGNYPWQKEVNIEIA 476

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
                 V    +L LR+P W  +     SLNG+ +       +L  T RW   D LT+ L
Sbjct: 477 ----SPVPVTHTLALRLPDWCAN--PHVSLNGEGMTGEVSRGYLHLTRRWQEGDTLTLTL 530

Query: 600 PLSLR 604
           P+ +R
Sbjct: 531 PMPVR 535


>gi|190333374|gb|ACE73687.1| hypothetical protein [Geobacillus stearothermophilus]
          Length = 642

 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 63/265 (23%), Positives = 107/265 (40%), Gaps = 26/265 (9%)

Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSA----REFWWDPKRLADTLGSENEETCTTYNM 406
           GD   K       + V     Y TGG  +      F +D     DT  +E   TC +  +
Sbjct: 278 GDESLKQACQTLWENVTKRQMYITGGVGSSAFGESFTFDFDLPNDTAYAE---TCASIAL 334

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG---RGVSK--ARS 461
           +  +R +     +  YAD  ERAL NG +S     +     Y+ PL    +   +   R 
Sbjct: 335 VFWTRRMLELEMDGKYADVMERALYNGTIS-GMDLDGKKFFYVNPLEVWPKACERHDKRH 393

Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFE-EEGNVPGLYIIQYISSSFDWKSGHVVL 520
                 K+ S  CC        + +G  IY +  +     LY+   I +  D +S  V +
Sbjct: 394 VKPVRQKWFSCACCPPNLARLIASIGHYIYLQTSDALFVHLYVGSDIQTEIDGRS--VKI 451

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-- 578
            Q+ +    WD  +R+T++  S  E     +L LR+P W    GA+ ++NG+ + + P  
Sbjct: 452 MQETN--YPWDGTVRLTVSPESAGEF----TLGLRIPGW--CRGAEVTINGEKVDIVPLI 503

Query: 579 PGNFLSATERWSYNDKLTIQLPLSL 603
              +      W   D++ +  P+ +
Sbjct: 504 KKGYAYIRRVWQQGDEVKLYFPMPV 528


>gi|421075310|ref|ZP_15536325.1| protein of unknown function DUF1680 [Pelosinus fermentans JBW45]
 gi|392526752|gb|EIW49863.1| protein of unknown function DUF1680 [Pelosinus fermentans JBW45]
          Length = 650

 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 113/500 (22%), Positives = 183/500 (36%), Gaps = 89/500 (17%)

Query: 181 HNATIKEKMSTVVFSLSECQNKIGTGYLSAFP---------TELFDSFEALKPVWAPYYT 231
           H  +  EK++     +  C  +   GYL+ +          T L D+ E         Y 
Sbjct: 92  HKDSALEKVADAAIDIV-CAAQQADGYLNTYYILNGLDKRWTNLQDNHEL--------YC 142

Query: 232 IHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMND 291
           +  ++ G +  Y      + LK A   V+Y    V  ++     ++H Y  +E    +  
Sbjct: 143 LGHMIEGAISYYQATGKDKLLKAAIRYVDY----VDTILGPEQGKKHGYPGHEV---IEL 195

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----------------------KPCFLGFLALQADY- 327
            L +LY IT D KHL LA  F                        K  +  +   QAD  
Sbjct: 196 ALVKLYQITKDEKHLKLAKYFIDERGQQPLYFQEETKRYGNDFPWKDSYFQYKYYQADQP 255

Query: 328 -----LSHFHANTHIPIVIGSQMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG--GTSA 379
                ++  HA     +  G      +T D  LY      + ++      Y TG  G SA
Sbjct: 256 VRSQQVAEGHAVRATYLYSGMADVARLTKDEELYAACKRIWNNMTQ-RQMYITGSIGASA 314

Query: 380 --REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
               F +D     DT+  E   TC +   +  +R +   + E  YAD  E+ L NG+LS 
Sbjct: 315 YGESFTYDYDLPNDTVYGE---TCASIGAVFFARRMLEISPEGEYADVIEKELFNGILS- 370

Query: 438 QRGTEPGVMIYMLPLGR--GVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIY 491
               +     Y+ PL      SK    H         W    CC       F+ LG  IY
Sbjct: 371 GMSMDGKSFFYVNPLEVVPEASKKDQLHHHVEVERQKWFGCACCPPNIARLFASLGSYIY 430

Query: 492 -FEEEGNV--PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQ 548
            +  + N     LYI   ++ +FD +     +N  V     WD  + +T++ +  +E   
Sbjct: 431 SYSAKSNTLWLHLYIGGELTHTFDSQE----VNFTVATNYPWDEDVEITVSLAESKEF-- 484

Query: 549 LSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
             +  LR+P W      + ++NG+    P    +      W   D   I L  ++  E +
Sbjct: 485 --TYALRIPGWC--KAYEVNVNGEKTNAPIVNGYAYLQREWKNGD--VIHLHFAMPIEVM 538

Query: 609 QDD---RPEYASIQAILFGP 625
           Q +   R +   + A++ GP
Sbjct: 539 QANPRVREDLGKV-AMMRGP 557


>gi|213418442|ref|ZP_03351508.1| hypothetical protein Salmonentericaenterica_11358 [Salmonella
           enterica subsp. enterica serovar Typhi str. E01-6750]
          Length = 385

 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 59/253 (23%), Positives = 93/253 (36%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 68  ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 120

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + +G  IY         LYI 
Sbjct: 121 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYIYTPR---ADALYIN 175

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +    +  L  ++     W  + ++ +   S Q V    +L LR+P W     
Sbjct: 176 MYVGNSMEIPVENGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 229

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +++ LP+ +R           A   AI  G
Sbjct: 230 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTISLTLPMPVRRVYGNPLARHVAGKVAIQRG 289

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 290 PLVYCLEQADNGE 302


>gi|418817745|ref|ZP_13373230.1| hypothetical protein SEEN538_23719 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21538]
 gi|392787738|gb|EJA44277.1| hypothetical protein SEEN538_23719 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21538]
          Length = 651

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 60/253 (23%), Positives = 92/253 (36%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H     FN  +              CC        + LG  IY         LYI 
Sbjct: 387 --EVHPKSLNFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +    +  L  ++     W  + ++ +   S Q V    +L LR+P W     
Sbjct: 442 MYVGNSMEIPVENGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 495

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +T+ LP+ +R           A   AI  G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 556 PLVYCLEQADNGE 568


>gi|424792517|ref|ZP_18218744.1| hypothetical protein XTG29_01554 [Xanthomonas translucens pv.
           graminis ART-Xtg29]
 gi|422797058|gb|EKU25452.1| hypothetical protein XTG29_01554 [Xanthomonas translucens pv.
           graminis ART-Xtg29]
          Length = 664

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 47/211 (22%), Positives = 79/211 (37%), Gaps = 18/211 (8%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +  + +   +  YAD  ERAL N VL+     +     Y+ PL      
Sbjct: 339 ESCASIGLMMFANRMLQLAPDSRYADVMERALYNTVLA-GMALDGRHFFYVNPLEVHPPT 397

Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               HG+         W    CC        + LG  +Y   +     LY+  Y+ S   
Sbjct: 398 VHGNHGFDHVKPVRQRWFGCACCPPNIARVVTSLGHYLYTRRDDT---LYVNLYVGSDAA 454

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
           +  G   L  +      W   + +++   +  E G    L LR+P W      Q  LNG+
Sbjct: 455 FDVGGQTLTLRQRGEYPWQEQVELSMDCDAPIEAG----LALRLPDWC--RAPQLQLNGE 508

Query: 573 NLPLPP--PGNFLSATERWSYNDKLTIQLPL 601
            + +       +    +RW   D L + LP+
Sbjct: 509 AVAIAAHLQHGYCVLRQRWQRGDTLHLHLPM 539


>gi|213582277|ref|ZP_03364103.1| hypothetical protein SentesTyph_14169 [Salmonella enterica subsp.
           enterica serovar Typhi str. E98-0664]
          Length = 380

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 59/253 (23%), Positives = 93/253 (36%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 63  ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 115

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + +G  IY         LYI 
Sbjct: 116 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYIYTPR---ADALYIN 170

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +    +  L  ++     W  + ++ +   S Q V    +L LR+P W     
Sbjct: 171 MYVGNSMEIPVENGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 224

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +++ LP+ +R           A   AI  G
Sbjct: 225 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTISLTLPMPVRRVYGNPLARHVAGKVAIQRG 284

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 285 PLVYCLEQADNGE 297


>gi|241206592|ref|YP_002977688.1| hypothetical protein Rleg_3907 [Rhizobium leguminosarum bv.
           trifolii WSM1325]
 gi|240860482|gb|ACS58149.1| protein of unknown function DUF1680 [Rhizobium leguminosarum bv.
           trifolii WSM1325]
          Length = 648

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 91/402 (22%), Positives = 159/402 (39%), Gaps = 73/402 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFH------ANTHIPI 339
            L +L  +T + K+L L+  F      +P F    A +    +S +H      A  H P+
Sbjct: 206 ALVKLARVTDEKKYLELSKYFIDERGTEPHFFTAEASRDGRDVSEYHQKTYEYAQAHQPV 265

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 266 RAQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 324

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +     +  YAD  E+AL NG L    
Sbjct: 325 NEGFTDYFDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL---- 378

Query: 440 GTEPGVMI------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
              PG+ I      Y  PL      A   H W  K++   CC        + +G  +Y  
Sbjct: 379 ---PGLSIDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAV 429

Query: 494 EEGNVPGLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
            +  +  +++    ++     +G  V L Q  +    W+      + F+++ E     +L
Sbjct: 430 SDNEI-AVHLYGESTARLKLANGAEVELEQTTN--YPWEG----AVAFTTRLEKPAKFAL 482

Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
           +LR+P W  ++GA  S+NG+ L L       +      W+  D++ + LPL+LR +    
Sbjct: 483 SLRVPDW--ADGATLSVNGEMLDLNANMRDGYARIDREWAAGDRVALYLPLALRPQYANP 540

Query: 611 DRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
              + A   A++ GP +    T       T     L+A++ P
Sbjct: 541 KVRQDAGRVALMRGPLVYCVET-------TDNGEDLNAIVLP 575


>gi|89067251|ref|ZP_01154764.1| hypothetical protein OG2516_10441 [Oceanicola granulosus HTCC2516]
 gi|89046820|gb|EAR52874.1| hypothetical protein OG2516_10441 [Oceanicola granulosus HTCC2516]
          Length = 633

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 55/228 (24%), Positives = 92/228 (40%), Gaps = 18/228 (7%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVS 457
           ETC +  M+  +  +     +  YAD  E AL N  L+ + R  E       L       
Sbjct: 332 ETCASVAMVFWAARMLNLDLDGQYADILELALYNNALAGLSRDGEHYFYDNKL------E 385

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
              S H W   ++   CC        + +    Y   E  +  +++    +++     G 
Sbjct: 386 SDGSHHRWA--WHECPCCTMNVSRLVASVAGYFYGVAETEI-AVHLYGGATATLPVAGGR 442

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
           V L +  D    WD  +R+ L    + E  +  +L+LR+P W +  GA AS+NG+ L + 
Sbjct: 443 VTLTETSD--YPWDGAVRIAL----EPEGTRTFTLSLRVPGWCH--GATASVNGEALEVA 494

Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
           P   +L  T  W+  D + + LP+         D  + A   A+  GP
Sbjct: 495 PERGYLKITRDWAPGDVVELNLPMQAERLYAHPDVRQDAGRVALRRGP 542


>gi|383189042|ref|YP_005199170.1| hypothetical protein Rahaq2_1140 [Rahnella aquatilis CIP 78.65 =
           ATCC 33071]
 gi|371587300|gb|AEX51030.1| hypothetical protein Rahaq2_1140 [Rahnella aquatilis CIP 78.65 =
           ATCC 33071]
          Length = 657

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 55/221 (24%), Positives = 84/221 (38%), Gaps = 34/221 (15%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC +  ++  +  + +   +  YAD  ERAL N VL+     +     Y+ PL      
Sbjct: 339 ETCASIGLMMFANRMLQMDADSRYADVMERALYNTVLA-GMALDGKHFFYVNPL------ 391

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H     FN  +              CC        + LG  IY +      G+ I 
Sbjct: 392 --EVHPKSIPFNHIYDHVKPVRQRWFGCACCPPNIARLLASLGHYIYTQRPD---GVDIN 446

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            YI S  D   G   L  K      W    R+ +   + Q +   ++L LR+P W  S  
Sbjct: 447 LYIGSDVDATIGGKALRLKQSGGYPWAE--RVLIEIDTDQPLE--ATLALRLPDWCGS-- 500

Query: 565 AQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
            Q +LNG  L L       +L  T+ W   D++ + LP+ +
Sbjct: 501 PQVTLNGHPLELASLTQRGYLRLTQEWQKGDRIEMTLPMPV 541


>gi|190893687|ref|YP_001980229.1| hypothetical protein RHECIAT_CH0004122 [Rhizobium etli CIAT 652]
 gi|190698966|gb|ACE93051.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
          Length = 640

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 92/429 (21%), Positives = 165/429 (38%), Gaps = 73/429 (17%)

Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
            L +L  +T + K+L L+  F      +   F    A      + FH  T      H P+
Sbjct: 198 ALVKLARVTGEKKYLDLSKYFIDERGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 258 REQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +     +  YAD  E+AL NG L    
Sbjct: 317 NEGFTDYYDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
            T+     Y  PL      A   H W  K++   CC        + +G  +Y   +  + 
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426

Query: 500 GLYIIQYISSSFDWKSGHVV-LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            +++    ++     +G  V L Q  +    W+      + F+++ E     +L+LR+P 
Sbjct: 427 AVHLYGESTTRLKLANGAAVELQQATN--YPWE----GAVAFTTRLEKPAKFALSLRIPD 480

Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
           W  ++GA  S+NG+ L L       +     +W   D++ + LPLSLR +       + A
Sbjct: 481 W--ADGATLSVNGEKLDLGAVTRDGYARIDRQWVDGDRVDLFLPLSLRPQYANPKVRQDA 538

Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMS 676
              A++ GP +    T       T   + L+ ++ P             + S   T V+ 
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGQDLNTIVLP------------RELSAAETVVLK 579

Query: 677 NSNQSITME 685
           + N ++ ++
Sbjct: 580 DLNDAVALD 588


>gi|384538328|ref|YP_005722412.1| hypothetical protein SM11_pD0078 [Sinorhizobium meliloti SM11]
 gi|336036981|gb|AEH82911.1| hypothetical protein SM11_pD0078 [Sinorhizobium meliloti SM11]
          Length = 640

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 105/248 (42%), Gaps = 33/248 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
           ETC +  ++  +  +     +  YAD  E+AL NG L       PG+ I      Y  PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFFYDNPL 386

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               S  R  H W  K++   CC        + +G  +Y   E  +  +++    ++   
Sbjct: 387 E---STGRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESTARLK 439

Query: 513 WKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
             SG  V L Q+ +    W+      + F++K +     +L+LR+P W  + GA  S+NG
Sbjct: 440 LASGAEVELRQETN--YPWE----GAIAFTTKLDRPAKFALSLRIPEW--AAGATLSVNG 491

Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
             L L     G +      WS  D++ + LPL+LR +       +     A++ GP  Y 
Sbjct: 492 TMLDLSAHLTGGYARIEREWSDGDRVALYLPLALRPQYANPKVRQDVGRVALMRGPLVYC 551

Query: 628 LAGHTSGE 635
           +    +GE
Sbjct: 552 VEATDNGE 559


>gi|384534128|ref|YP_005716792.1| hypothetical protein [Sinorhizobium meliloti BL225C]
 gi|433610342|ref|YP_007193803.1| hypothetical protein C770_GR4pD0078 [Sinorhizobium meliloti GR4]
 gi|333816304|gb|AEG08971.1| protein of unknown function DUF1680 [Sinorhizobium meliloti BL225C]
 gi|429555284|gb|AGA10204.1| hypothetical protein C770_GR4pD0078 [Sinorhizobium meliloti GR4]
          Length = 640

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 105/248 (42%), Gaps = 33/248 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
           ETC +  ++  +  +     +  YAD  E+AL NG L       PG+ I      Y  PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFFYDNPL 386

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               S  R  H W  K++   CC        + +G  +Y   E  +  +++    ++   
Sbjct: 387 E---STGRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESTARLK 439

Query: 513 WKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
             SG  V L Q+ +    W+      + F++K +     +L+LR+P W  + GA  S+NG
Sbjct: 440 LASGAEVELRQETN--YPWE----GAIAFTTKLDRPAKFALSLRIPEW--AAGATLSVNG 491

Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
             L L     G +      WS  D++ + LPL+LR +       +     A++ GP  Y 
Sbjct: 492 TMLDLSAHLTGGYARIEREWSDGDRVALYLPLALRPQYANPKVRQDVGRVALMRGPLVYC 551

Query: 628 LAGHTSGE 635
           +    +GE
Sbjct: 552 VEATDNGE 559


>gi|302883148|ref|XP_003040476.1| hypothetical protein NECHADRAFT_44741 [Nectria haematococca mpVI
           77-13-4]
 gi|256721360|gb|EEU34763.1| hypothetical protein NECHADRAFT_44741 [Nectria haematococca mpVI
           77-13-4]
          Length = 645

 Score = 50.1 bits (118), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 53/212 (25%), Positives = 89/212 (41%), Gaps = 23/212 (10%)

Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFW--WDPKRLADTLGSEN--EETCTTYNMLKV 409
           L   +G  + D+V+    Y TG   +   W  + P  +   L  E    ETC T+ ++  
Sbjct: 291 LKAALGRLWRDMVD-KRMYVTGSLGSVRQWEGFGPAYILPDLEHEGCYAETCATFALINW 349

Query: 410 SRHLFRWTKEIAYADYYERALTNGVL-SIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
              + R   +  YAD  E AL NG L ++ +  +      +L   +G  K RS      K
Sbjct: 350 CARMLRLDLDAEYADVMEVALYNGFLGAVNQDGDAFYYENVLRTRKGEFKERS------K 403

Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
           +    CC     +    LG S+ + ++ +   + I QYI S        V++ QK D  +
Sbjct: 404 WFGVACCPPNVAKLLGNLG-SLIYSQDASTNLVAIHQYIDSELKIPESGVIIRQKTD--M 460

Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
            WD  + +++  S        ++L LR+P W 
Sbjct: 461 PWDGQVVLSIQGS--------ANLALRIPSWA 484


>gi|424872619|ref|ZP_18296281.1| hypothetical protein Rleg5DRAFT_4131 [Rhizobium leguminosarum bv.
           viciae WSM1455]
 gi|393168320|gb|EJC68367.1| hypothetical protein Rleg5DRAFT_4131 [Rhizobium leguminosarum bv.
           viciae WSM1455]
          Length = 648

 Score = 50.1 bits (118), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 89/396 (22%), Positives = 156/396 (39%), Gaps = 61/396 (15%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFH------ANTHIPI 339
            L +L  +T + K+L L+  F      +P F    A +    +S +H      A  H P+
Sbjct: 206 ALVKLARVTDEKKYLELSKYFIDERGTEPHFFTAEASRDGRDVSEYHQKTYEYAQAHQPV 265

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 266 RAQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 324

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +     +  YAD  E+AL NG L    
Sbjct: 325 NEGFTDYFDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 381

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
            T+     Y  PL      A   H W  K++   CC        + +G  +Y   +  + 
Sbjct: 382 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVSDNEI- 434

Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            +++    ++     +G  V L Q  +    W+      + F+++ E     +L+LR+P 
Sbjct: 435 AVHLYGESTARLKLANGAEVELEQTTN--YPWEG----AVAFTTRLEKPARFALSLRIPD 488

Query: 559 WTYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
           W  + GA  S+NG+ L L       +      W+  D++ + LPL+LR +       + A
Sbjct: 489 W--AEGATLSVNGEMLDLNANMYDGYARIDREWAAGDRVALYLPLALRPQYANPKVRQDA 546

Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
              A++ GP +    T       T     L+A++ P
Sbjct: 547 GRVALMRGPLVYCVET-------TDNGEDLNAIVLP 575


>gi|359791407|ref|ZP_09294266.1| hypothetical protein MAXJ12_18203 [Mesorhizobium alhagi CCNWXJ12-2]
 gi|359252565|gb|EHK55793.1| hypothetical protein MAXJ12_18203 [Mesorhizobium alhagi CCNWXJ12-2]
          Length = 634

 Score = 50.1 bits (118), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 99/478 (20%), Positives = 189/478 (39%), Gaps = 77/478 (16%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG---TGYLSAFP----TELFDS 218
           VG ++ A++   +   +A I+ K+  +V  L + Q   G     YL   P    T L D+
Sbjct: 75  VGKWIEAASYALSHRRDADIEAKIEKIVDDLEKAQAPDGYLNCWYLQREPDKRWTNLRDN 134

Query: 219 FEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
            E         Y +  +L G +  ++     + L +    +E +   V++       ++ 
Sbjct: 135 HE--------LYNLGHLLEGGIAYFLATGRRRLLDI----LERYVEHVRETFGPNPGQKR 182

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLA-----------HLFDKPCFLGFLALQADY 327
            Y  ++E   +   L +LY +T + KHL LA           H FD+       + +  +
Sbjct: 183 GYCGHQE---IELALIKLYRLTGERKHLDLAAYFINERGRQPHYFDQEAVARGESPRDFW 239

Query: 328 LSHFHAN-THIPI-----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNAS 369
              +  N +H P+     V+G  +R             E+    L +     + D++N S
Sbjct: 240 AKSYEYNQSHRPVREQTKVVGHAVRAMYMFSAMADLAAELNDASLKQACEVLWADVMN-S 298

Query: 370 HSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYY 426
             Y T G    +A E + +   L +   +   ETC +  ++  ++ +     +  YAD  
Sbjct: 299 KIYITSGLGPAAANEGFTEDYDLPND--TAYAETCASVALIFWAQRMLHLDLDGRYADVM 356

Query: 427 ERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSK 485
           E+AL NG L+ + R  E     Y  PL      +R    W   +++  CC        + 
Sbjct: 357 EQALFNGALTGLSRDGEH--YFYSNPLDSDGRHSR----WA--WHTCPCCTMNSSRLIAS 408

Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
           +G   +     +    ++   IS++    +G+V L +       W   +R+ ++     E
Sbjct: 409 VG-GYFVSASDDAIAFHLYGGISTNIRLATGNVSLRET--SAYPWSGSVRIAVSPDEPAE 465

Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPL 601
                ++ L +P W  S  A AS+NG+ + +       +LS    W   D + ++LP+
Sbjct: 466 F----TVKLHIPGWAQS--ATASVNGEPVDVKRGIEAGYLSIKRMWREGDTIALELPM 517


>gi|329930292|ref|ZP_08283894.1| hypothetical protein HMPREF9412_2909 [Paenibacillus sp. HGF5]
 gi|328935161|gb|EGG31645.1| hypothetical protein HMPREF9412_2909 [Paenibacillus sp. HGF5]
          Length = 626

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 65/300 (21%), Positives = 120/300 (40%), Gaps = 28/300 (9%)

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           YE+ G+P+ +      +D +   H  A G  S  E+      L+ T  S+  E C     
Sbjct: 237 YELNGNPVERESVHRGIDSLMTYHGQAHGMFSGDEW------LSGTHPSQGVELCAVVEY 290

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGV--------LSIQRGTEPGVMIYMLPLGRGVSK 458
           +     L R   E  + D  E+   N +         S Q   +   MI  +   R  S 
Sbjct: 291 MFSMEQLTRIFGEGRFGDILEKVAFNALPAAISADWTSHQYDQQVNQMICNV-APRAWSN 349

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
           +   + +G + N F CC     + + KL   ++ +++ +  GL  + Y   +     G  
Sbjct: 350 SPDANVFGLEPN-FGCCTANMHQGWPKLASHLWMKDQED--GLVAVSYAPCTVRTTVGRQ 406

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
            ++ +V+    +    R+ +  S   E  +   ++LR+P W   +    +LNG+ LP+  
Sbjct: 407 GVSAEVEVTGEYPFKDRVQIHLSL--ERAESFPISLRIPAWC--DHPVITLNGRELPIQA 462

Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDI 638
              +    + W   D L + LP+ ++TE+    R  YA+  +I  GP +        W +
Sbjct: 463 ESGYAKIVQTWQSGDLLELYLPMEVKTES----RSMYAT--SITRGPLVYVLPVKENWQM 516


>gi|418401306|ref|ZP_12974836.1| hypothetical protein SM0020_14414 [Sinorhizobium meliloti
           CCNWSX0020]
 gi|359504683|gb|EHK77215.1| hypothetical protein SM0020_14414 [Sinorhizobium meliloti
           CCNWSX0020]
          Length = 640

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 104/248 (41%), Gaps = 33/248 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
           ETC +  ++  +  +     +  YAD  E+AL NG L       PG+ I      Y  PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFFYDNPL 386

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               S  R  H W  K++   CC        + +G  +Y   E  +  +++    ++   
Sbjct: 387 E---STGRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESTARLK 439

Query: 513 WKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
             SG  V L Q+ +    W+      + F++K +      L+LR+P W  + GA  S+NG
Sbjct: 440 LASGAEVELRQETN--YPWE----GAIAFTTKLDRPAKFELSLRIPEW--AAGATLSVNG 491

Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
             L L     G +      WS  D++ + LPL+LR +       +     A++ GP  Y 
Sbjct: 492 TMLDLSAHLTGGYARIEREWSDGDRVALYLPLALRPQYANPKVRQDVGRVALMRGPLVYC 551

Query: 628 LAGHTSGE 635
           +    +GE
Sbjct: 552 VEATDNGE 559


>gi|417432692|ref|ZP_12161408.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Mississippi str. A4-633]
 gi|353614176|gb|EHC66091.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Mississippi str. A4-633]
          Length = 352

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 58/253 (22%), Positives = 93/253 (36%), Gaps = 34/253 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERAL N VL      +     Y+ P+      
Sbjct: 35  ESCASIGLMMFARQMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPM------ 87

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H    KFN  +              CC        + +G  IY         LYI 
Sbjct: 88  --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYIYTPR---ADALYIN 142

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            Y+ +S +    +  L  ++     W  + ++ +   S Q V    +L LR+P W     
Sbjct: 143 MYVGNSLEVPVENGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 196

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
           A+ +LNG  +       +L     W   D +++ LP+ +R           A   AI  G
Sbjct: 197 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTISLTLPMPVRRVYGNPLARHVAGKVAIQRG 256

Query: 625 P--YLLAGHTSGE 635
           P  Y L    +GE
Sbjct: 257 PLVYCLEQADNGE 269


>gi|224537081|ref|ZP_03677620.1| hypothetical protein BACCELL_01958 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521308|gb|EEF90413.1| hypothetical protein BACCELL_01958 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 801

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 134/348 (38%), Gaps = 59/348 (16%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L +LY +T D K+L  A  F D+    G+ +   +Y     +  H P+V     +G  +R
Sbjct: 222 LAKLYLVTGDQKYLDQAKFFLDQ---RGYTSRTDEY-----SQAHKPVVQQDEAVGHAVR 273

Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADT 392
                        +TGD  Y        D +     Y TGG   T+A E +     L + 
Sbjct: 274 AAYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATAAGEAFGKNYELPNM 333

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
             S   ETC     + V+  LF    E  Y D  ER L NG++S     + G   Y  PL
Sbjct: 334 --SAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL 390

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
                ++   H     F    CC          L   IY  ++ +V   Y+  ++S++ D
Sbjct: 391 -----ESMGQHQRQPWFGCA-CCPSNICRFIPSLPGYIYAVKDKDV---YVNLFMSNTSD 441

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TY 561
            K G   ++ +      W+  + + +   +K   GQ  +L +R+P W           TY
Sbjct: 442 LKVGGKAVSIEQTTKYPWNGDITIGI---NKNNAGQF-NLKVRIPGWVRGQVVPSDLYTY 497

Query: 562 SNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           S+G +      +NG+ +       +     RW   DK+ +   +  RT
Sbjct: 498 SDGKRLKYTVKVNGEAVQNELKDGYFCIDRRWKKGDKVEVHFDMEPRT 545


>gi|423109493|ref|ZP_17097188.1| hypothetical protein HMPREF9687_02739 [Klebsiella oxytoca 10-5243]
 gi|376382227|gb|EHS94961.1| hypothetical protein HMPREF9687_02739 [Klebsiella oxytoca 10-5243]
          Length = 655

 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 50/212 (23%), Positives = 80/212 (37%), Gaps = 16/212 (7%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERA  N VL      +     Y+ PL      
Sbjct: 339 ESCASIGLMMFARRMLEMEGDAHYADVMERAFYNTVLG-GMALDGKHFFYVNPLETYPKS 397

Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               H +         W    CC      +   +G  ++         L+I  Y  S   
Sbjct: 398 IPHNHIYDHIKPVRQRWFGCACCPPNIARTLVAIGHYLFTPRRD---ALFINFYAGSEAQ 454

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
           +      L  K+     WD    + +TFS  Q V    +L LR+P W      Q  +NG+
Sbjct: 455 FTINDQPLALKISGNYPWDE--EVNITFSHPQAVQH--TLALRLPEW--CEAPQVLINGE 508

Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
                    +L  T +W   D +T++LP++LR
Sbjct: 509 AAQGEQLKGYLHITRQWQQGDIITLRLPMTLR 540


>gi|294673046|ref|YP_003573662.1| hypothetical protein PRU_0271 [Prevotella ruminicola 23]
 gi|294472095|gb|ADE81484.1| conserved hypothetical protein [Prevotella ruminicola 23]
          Length = 774

 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 95/441 (21%), Positives = 161/441 (36%), Gaps = 96/441 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLF---DKPCFLGFLALQADYLSHFHANTHIPI-----VIGS 343
            L +LY +T + K+L  A  F      C  G    +       ++  H+PI     ++G 
Sbjct: 187 ALCKLYKVTGNKKYLEGAKYFVDETGRCTDGHRPSE-------YSQDHMPILQQQEIVGH 239

Query: 344 QMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
            +R             +TGD  Y+       + +++   + TGG  +R     P+   + 
Sbjct: 240 AVRAGYLYSGVADVAALTGDKAYQEALERIWENMSSKKLFITGGIGSR-----PQ--GEG 292

Query: 393 LGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
            G + E        ETC     +  +  +F  T E  Y D  ERAL N VLS       G
Sbjct: 293 FGPDYELNNHTAYCETCAAIANVYWNYRMFLATGESKYIDVCERALYNNVLS-------G 345

Query: 445 VMI------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
           V +      Y  PL       R       K+    CC G      + +   IY  +    
Sbjct: 346 VSLSGDKFFYDNPLESDGEHERQ------KWFGCACCPGNITRFVASVPGYIYARQ---- 395

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            G  I   + +    K G++ L Q  D    WD  +R+ +T  S    G+  ++ LR+P 
Sbjct: 396 -GKDIFVNLYAQGKAKIGNIELEQTTD--YPWDGKIRIKVTKGS----GKF-AIKLRVPS 447

Query: 559 W-----------TYSNGAQ---ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
           W            Y + A+    S+NG+ L  P   +++  +  W   D + +  P+ +R
Sbjct: 448 WLKTSPTNNDLYQYQDKAKTYSVSVNGKAL-YPENRDYIEISRSWKKGDTIELDFPMDVR 506

Query: 605 TEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTF 664
                D+  +     A   GP +     + + D K      L +  +P+   F   L+  
Sbjct: 507 RIVANDNAEDDRGKVAFERGPIVFCLEGADQTDHKVFNKYILDS--APVSAHFEQDLL-- 562

Query: 665 TQESGNSTFVMSNSNQSITME 685
                N   V+  S + +  +
Sbjct: 563 -----NGVMVLEGSAKELQQD 578


>gi|269839244|ref|YP_003323936.1| hypothetical protein Tter_2215 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269790974|gb|ACZ43114.1| protein of unknown function DUF1680 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 638

 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 81/367 (22%), Positives = 131/367 (35%), Gaps = 60/367 (16%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR- 346
           L  LY  T + ++L  A  F      G L     +    +   H+P      ++G  +R 
Sbjct: 204 LVELYRATGNERYLEQAKYFLDVRGQGLLGRAWGHFGPEYHQDHVPFREMREIVGHAVRA 263

Query: 347 ----------YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
                     Y  TGD           + +     Y TGG  +R          +  G E
Sbjct: 264 VYLNAGAADIYAETGDEAIMRALERLWENMTTKKMYVTGGIGSR-------YEGEAFGKE 316

Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI- 447
            E        ETC     +  +  +   T +  YAD  E  L N VL       PG+ + 
Sbjct: 317 YELPNARAYAETCAAIGSVMWNWRMLLLTADARYADLIEHTLYNAVL-------PGISLD 369

Query: 448 -----YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
                Y  PL     +   TH     F    CC      + + LG   Y      +  ++
Sbjct: 370 GALYFYQNPL-----EDEGTHRRQEWFGCA-CCPPNVARTLASLGGYFYSTSRDGI-WVH 422

Query: 503 IIQYISSSFDWKSGH-VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
           +     +    + G  V+L+Q      S +  +R+        E G+L  + LR+P W  
Sbjct: 423 LYSEGRAKLGLQDGREVLLSQHTSYPWSGEVAIRL----EQVPEEGELG-IYLRIPSWC- 476

Query: 562 SNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
               + ++NG++   P  PG +L     W   D++ ++LP+++R         E A   A
Sbjct: 477 -ERGEVAINGEDAATPITPGTYLELRRTWRAGDEVRLRLPMTVRRLEAHPYLSEDAGRVA 535

Query: 621 ILFGPYL 627
           I+ GP L
Sbjct: 536 IMRGPIL 542


>gi|334320143|ref|YP_004556772.1| hypothetical protein [Sinorhizobium meliloti AK83]
 gi|407722785|ref|YP_006842446.1| hypothetical protein BN406_05164 [Sinorhizobium meliloti Rm41]
 gi|334097882|gb|AEG55892.1| protein of unknown function DUF1680 [Sinorhizobium meliloti AK83]
 gi|407322845|emb|CCM71446.1| hypothetical protein BN406_05164 [Sinorhizobium meliloti Rm41]
          Length = 640

 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 62/248 (25%), Positives = 105/248 (42%), Gaps = 33/248 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
           ETC +  ++  +  +     +  YAD  E+AL NG L       PG+ I      Y  PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFFYDNPL 386

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               S  R  H W  K++   CC        + +G  +Y   E  +  +++    ++   
Sbjct: 387 E---STGRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESTARLK 439

Query: 513 WKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
             SG  V L Q+ +    W+      + F++K +     +L+LR+P W  + GA  S+NG
Sbjct: 440 LASGAEVELRQETN--YPWE----GAIAFATKLDRPAKFALSLRIPEW--AAGATLSVNG 491

Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
             L L     G +      WS  D++ + LPL++R +       +     A++ GP  Y 
Sbjct: 492 TMLDLSAHLTGGYARIEREWSDGDRVALYLPLAIRPQYANPKVRQDVGRVALMRGPLVYC 551

Query: 628 LAGHTSGE 635
           +    +GE
Sbjct: 552 VEATDNGE 559


>gi|402306264|ref|ZP_10825315.1| putative glycosyhydrolase [Prevotella sp. MSX73]
 gi|400380031|gb|EJP32860.1| putative glycosyhydrolase [Prevotella sp. MSX73]
          Length = 825

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 83/360 (23%), Positives = 141/360 (39%), Gaps = 77/360 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
            L +LY +T + K+L  A  F    + G  A++ +Y     + +H+P++     +G  +R
Sbjct: 226 ALCKLYLVTGNRKYLNEAKFFLD--YRGKTAVRQEY-----SQSHLPVLEQSEAVGHAVR 278

Query: 347 YE-----------VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
                        +TGD  Y   I   + +IV     Y TGG  A           +  G
Sbjct: 279 AAYMYAGMADVAALTGDTAYIHAIDRIWNNIV-GRKLYITGGIGA-------TNNGEAFG 330

Query: 395 SENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
           ++ E        ETC     + V+  LF    E  Y D  ER L NG++S     + G  
Sbjct: 331 ADYELPNMSAYAETCAAIGNVYVNYRLFLLHGESKYFDVLERTLYNGLIS-GVSMDGGGF 389

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
            Y  PL     ++R  H     F    CC          L   +Y  ++ NV   Y+  +
Sbjct: 390 FYPNPL-----ESRGQHQRQAWFGCA-CCPSNICRFLPSLPGYVYAVKDRNV---YVNLF 440

Query: 507 ISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT---- 560
           +SS  S +     V L+Q+      W+  + +T+    +   G   +L +R+P W     
Sbjct: 441 LSSSASLEVAGKRVALSQQTQ--YPWNGDIALTV---DENRAGAF-ALKIRIPGWVKGQP 494

Query: 561 -------YSNGAQA----SLNGQNLPLP----PPGNFLSATERWSYNDKLTIQLPLSLRT 605
                  YS+G +     ++NG+ L        P  + +   +W   D+++I   + +RT
Sbjct: 495 VPSDLYEYSDGKRTGYTIAVNGRRLTATDINFSPDGYCTIARKWKKGDRVSIHFDMEVRT 554


>gi|16265291|ref|NP_438083.1| hypothetical protein SM_b20631 [Sinorhizobium meliloti 1021]
 gi|15141431|emb|CAC49943.1| conserved hypothetical protein [Sinorhizobium meliloti 1021]
          Length = 640

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 62/248 (25%), Positives = 105/248 (42%), Gaps = 33/248 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
           ETC +  ++  +  +     +  YAD  E+AL NG L       PG+ I      Y  PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFFYDNPL 386

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               S  R  H W  K++   CC        + +G  +Y   E  +  +++    ++   
Sbjct: 387 E---STGRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESTARLK 439

Query: 513 WKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
             SG  V L Q+ +    W+      + F++K +     +L+LR+P W  + GA  S+NG
Sbjct: 440 LASGAEVELRQETN--YPWE----GAIAFATKLDRPAKFALSLRIPEW--AAGATLSVNG 491

Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
             L L     G +      WS  D++ + LPL++R +       +     A++ GP  Y 
Sbjct: 492 TMLDLSAHLTGGYARIEREWSDGDRVALYLPLAIRPQYANPKVRQDVGRVALMRGPLVYC 551

Query: 628 LAGHTSGE 635
           +    +GE
Sbjct: 552 VEATDNGE 559


>gi|288927800|ref|ZP_06421647.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
           F0108]
 gi|288330634|gb|EFC69218.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
           F0108]
          Length = 623

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 60/284 (21%), Positives = 107/284 (37%), Gaps = 20/284 (7%)

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           Y +TG+  Y          +N +    TG  ++ E W+  K L        +ETC T   
Sbjct: 266 YRLTGNTEYLSAVEQVWQNINDTEINITGSGASMESWFGGKHLQYMPIRHFQETCVTATW 325

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
           +K+SR L   T    YAD  E +  N +L   R T+        PL           G G
Sbjct: 326 IKLSRQLLLLTGNTKYADAVEISFYNALLGAMR-TDASDWAKYTPLSGQRLPGSEQCGMG 384

Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS-GHVVLNQKVD 525
                  CC  +G      +  +       +  G+ +  YI+  +   +  H  +  K++
Sbjct: 385 LN-----CCNASGPRGLFVIPQTAVLT---SAKGVDVNLYIAGDYKLTTPRHQQMVLKLE 436

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
                +  +   L+    + +    ++ LR+P W  S   +  +N   +     G ++  
Sbjct: 437 GEYPKNNKMSFLLSLKKAENI----TIRLRIPEW--STATKVIVNDVAVEHVQAGKYMEL 490

Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           +  W + D+++I+  +      +    PEY    AI  GP +LA
Sbjct: 491 SRTWHHGDRISIEFDMPGIVHRL-GQHPEYV---AITRGPIVLA 530


>gi|295098715|emb|CBK87805.1| Uncharacterized protein conserved in bacteria [Enterobacter cloacae
           subsp. cloacae NCTC 9394]
          Length = 657

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 60/251 (23%), Positives = 91/251 (36%), Gaps = 39/251 (15%)

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADGHYADVME 370

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H     FN  +              
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIYDHVKPVRQRWFGCA 421

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + LG  IY         L I  Y+ +      G  +L  ++     W   
Sbjct: 422 CCPPNIARVLTSLGHYIYTVRPD---ALLINLYVGNDVAIPVGDNILQLRISGNYPWHEQ 478

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
           +++ +T      V  + +L LR+P W        SLNGQ +       +L     W   D
Sbjct: 479 VKIEIT----SPVPVIHTLALRLPDWCAEPA--VSLNGQAITGEVSRGYLYLNRSWQEGD 532

Query: 594 KLTIQLPLSLR 604
            LT+ LP+ +R
Sbjct: 533 TLTLTLPMPVR 543


>gi|270295052|ref|ZP_06201253.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270274299|gb|EFA20160.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 688

 Score = 49.3 bits (116), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 114/545 (20%), Positives = 202/545 (37%), Gaps = 70/545 (12%)

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
           W P   + KIL     QY  A N +  ++  +M +YF  R Q          HW S  E 
Sbjct: 171 WWPRMVVLKILQ----QYYSATNDK--RVVAFMTKYF--RYQLNTLPQKPLGHWSSWAEF 222

Query: 286 TGGMN-DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
               N   +Y LY++T +   L L HL  +  F  F+ +          +   P  I   
Sbjct: 223 RACDNLQAVYWLYNLTGEDFLLELGHLLHRQSF-SFIDMVD------RGDLRRPCTIHCV 275

Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK-------RLADTLGSEN 397
              +   +P+   +       ++A      G    R F   P+        L     ++ 
Sbjct: 276 NLAQGIKEPIIYYLQDTDRKYIDAVKE---GFRDIRRFHGQPQGMYGGDEALHGNNPTQG 332

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYER--------ALTNGVLSIQRGTEPG-VMIY 448
            E C+   ++     +   T +I +AD+ ER         +++  ++ Q   +P  VM+ 
Sbjct: 333 SELCSAVELMYSLEKMVEITGDIDFADHLERIAFNALPAQISDDFMTKQYFQQPNQVMVT 392

Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
                       +   +GT    + CC+    + + K    +++    N  G+  I Y  
Sbjct: 393 RHRRNFDQDHEGTDLAFGT-LTGYPCCFSNMHQGWPKFTQHLWYATPDN--GIAAIVYSP 449

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRM--TLTFSSKQ---EVGQLS-SLNLRMPVWTYS 562
           S      G       V  ++S D Y  M   +TF+ K+   +V Q+    +LR+P W   
Sbjct: 450 SEVTANVG-----DNVPVVISEDTYYPMDHQITFTIKEVRNKVKQVKFPFHLRVPKWC-- 502

Query: 563 NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
             A+  +NG+       G        W  NDK+ + LP+ + T         Y +  +I 
Sbjct: 503 KQAEIRVNGKMEQTVKGGKIAIVDRIWKRNDKIELYLPMEVFTSTW------YENAVSIE 556

Query: 623 FGPYLLAGHTSGEWDIKTGTARSLSALISPIPPS--FNAQLVTFTQESGNSTFVMSNSNQ 680
            GP + A      W+ K        +    +  S  +N  LV F +   N    +S ++Q
Sbjct: 557 RGPLVYALKMEENWEKKEFKDSWYGSYYYQVTSSDPWNYGLVDFDRNRMNEVAQVSINSQ 616

Query: 681 SITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKE 740
              + +FP +  +A +    +  L    +  ++  N + G     +PF F G    +G E
Sbjct: 617 KQQL-DFPWNQENAPVEIKMKARL----IPTWTVYNEMAGP----QPFSFCGSA--EGGE 665

Query: 741 DELVV 745
            E+ +
Sbjct: 666 QEITL 670


>gi|255012840|ref|ZP_05284966.1| hypothetical protein B2_02969 [Bacteroides sp. 2_1_7]
 gi|410102232|ref|ZP_11297159.1| hypothetical protein HMPREF0999_00931 [Parabacteroides sp. D25]
 gi|409238954|gb|EKN31742.1| hypothetical protein HMPREF0999_00931 [Parabacteroides sp. D25]
          Length = 618

 Score = 49.3 bits (116), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 53/231 (22%), Positives = 99/231 (42%), Gaps = 25/231 (10%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPL-GRGV 456
           ETC +  M+  ++ + + T +  Y D  ER+L NG L+ I  G +     Y+ PL  +G 
Sbjct: 336 ETCASVGMVLWNQRMNQLTGDSKYIDVLERSLYNGALAGISLGGDR--FFYVNPLESKGD 393

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
              +  +G         CC          +G+ IY   +     L++  YI ++   + G
Sbjct: 394 HHRQEWYGCA-------CCPSQLSRFLPSIGNYIYASSDD---ALWVNLYIGNTGQIRIG 443

Query: 517 H--VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
              ++L Q+ D    WD  +++T++ S   E      + LR+P W  +     S+NG+ +
Sbjct: 444 ETDILLTQETD--YPWDGSVKLTISTSQPLE----KEIRLRIPDWCKT--YDLSINGKRI 495

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
            +P    + +  + W   D + + + + +   A      E    +AI  GP
Sbjct: 496 NVPKEKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFDKRAIQRGP 545


>gi|160887789|ref|ZP_02068792.1| hypothetical protein BACUNI_00192 [Bacteroides uniformis ATCC 8492]
 gi|423304369|ref|ZP_17282368.1| hypothetical protein HMPREF1072_01308 [Bacteroides uniformis
           CL03T00C23]
 gi|423310517|ref|ZP_17288501.1| hypothetical protein HMPREF1073_03251 [Bacteroides uniformis
           CL03T12C37]
 gi|156862731|gb|EDO56162.1| hypothetical protein BACUNI_00192 [Bacteroides uniformis ATCC 8492]
 gi|392681688|gb|EIY75045.1| hypothetical protein HMPREF1073_03251 [Bacteroides uniformis
           CL03T12C37]
 gi|392684698|gb|EIY78021.1| hypothetical protein HMPREF1072_01308 [Bacteroides uniformis
           CL03T00C23]
          Length = 688

 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 116/542 (21%), Positives = 201/542 (37%), Gaps = 64/542 (11%)

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
           W P   + KIL     QY  A N +  ++  +M +YF  R Q          HW S  E 
Sbjct: 171 WWPRMVVLKILQ----QYYSATNDK--RVVAFMTKYF--RYQLNTLPQKPLGHWSSWAEF 222

Query: 286 TGGMN-DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
               N   +Y LY++T +   L L HL  +  F     +    L        + +  G +
Sbjct: 223 RACDNLQAVYWLYNLTGEDFLLELGHLLHRQSFSFIDMVDRGDLRRPCTIHCVNLAQGIK 282

Query: 345 ---MRYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEET 400
              + Y+   D  Y   +   F DI    H    G     E       L     ++  E 
Sbjct: 283 EPIIYYQQDTDRKYIDAVKEGFRDI-RRFHGQPQGMYGGDE------ALHGNNPTQGSEL 335

Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYER--------ALTNGVLSIQRGTEPG-VMIYMLP 451
           C+   ++     +   T +I +AD+ ER         +++  ++ Q   +P  VM+    
Sbjct: 336 CSAVELMYSLEKMVEITGDIDFADHLERIAFNALPAQISDDFMTKQYFQQPNQVMVTRHR 395

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
                    +   +GT    + CC+    + + K    +++    N  G+  I Y  S  
Sbjct: 396 RNFDQDHEGTDLAFGT-LTGYPCCFSNMHQGWPKFTQHLWYATPDN--GIAAIVYSPSEV 452

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRM--TLTFSSKQ---EVGQLS-SLNLRMPVWTYSNGA 565
               G       V  ++S D Y  M   +TF+ K+   +V Q+    +LR+P W     A
Sbjct: 453 TANVG-----DNVPVVISEDTYYPMDHQITFTIKEVRNKVKQVKFPFHLRVPKWC--KQA 505

Query: 566 QASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
           +  +NG+       G        W  NDK+ + LP+ + T         Y +  +I  GP
Sbjct: 506 EIRVNGKMEQTVKGGKIAIVDRIWKRNDKIELYLPMEVFTSTW------YENAVSIERGP 559

Query: 626 YLLAGHTSGEWDIKTGTARSLSALISPIPPS--FNAQLVTFTQESGNSTFVMSNSNQSIT 683
            + A      W+ K        +    +  S  +N  LV F +   N    +S ++Q   
Sbjct: 560 LVYALKMEENWEKKEFKDSWYGSYYYQVTSSDPWNYGLVDFDRNRMNEVAQVSINSQKQQ 619

Query: 684 MEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDEL 743
           + +FP +  +A +    +  L    +  ++  N + G     +PF F G    +G E E+
Sbjct: 620 L-DFPWNQENAPVEIKMKARL----IPTWTVYNEMAGP----QPFSFCGSA--EGGEQEI 668

Query: 744 VV 745
            +
Sbjct: 669 TL 670


>gi|227509160|ref|ZP_03939209.1| conserved hypothetical protein, partial [Lactobacillus brevis
           subsp. gravesensis ATCC 27305]
 gi|227191367|gb|EEI71434.1| conserved hypothetical protein [Lactobacillus brevis subsp.
           gravesensis ATCC 27305]
          Length = 106

 Score = 49.3 bits (116), Expect = 0.010,   Method: Composition-based stats.
 Identities = 34/100 (34%), Positives = 45/100 (45%), Gaps = 17/100 (17%)

Query: 161 LRGHFVGHYLSASAQMWASTHN----ATIKEKMSTVVFSLSECQNKIG------TGYLSA 210
            RGHF GHYLSA +Q   S  +    + +  K+   +  L   Q           GY+SA
Sbjct: 1   FRGHFFGHYLSALSQAIDSVSDDDTRSQLLSKLRIGIEGLFRAQQAYAKSHPQSAGYVSA 60

Query: 211 FPTELFDSFEALK-------PVWAPYYTIHKILAGLLDQY 243
           F     D  E  +        V  P+Y +HKILAGL+D Y
Sbjct: 61  FREVALDEVEGKRVPESEKENVIVPWYNLHKILAGLIDGY 100


>gi|423115429|ref|ZP_17103120.1| hypothetical protein HMPREF9689_03177 [Klebsiella oxytoca 10-5245]
 gi|376381515|gb|EHS94252.1| hypothetical protein HMPREF9689_03177 [Klebsiella oxytoca 10-5245]
          Length = 655

 Score = 48.9 bits (115), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 49/212 (23%), Positives = 80/212 (37%), Gaps = 16/212 (7%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +R +     +  YAD  ERA  N VL      +     Y+ PL      
Sbjct: 339 ESCASIGLMMFARRMLEMEGDAHYADVMERAFYNTVLG-GMALDGKHFFYVNPLETYPKS 397

Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               H +         W    CC      +   +G  ++         L+I  Y  S   
Sbjct: 398 IPHNHIYDHIKPVRQRWFGCACCPPNIARTLVAIGHYLFTPRRD---ALFINFYAGSEAQ 454

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
           +      L  K+     WD    + +TFS  Q +    +L LR+P W      Q  +NG+
Sbjct: 455 FTINDQPLALKISGNYPWDE--EVNITFSHPQAIQH--TLALRLPEW--CEAPQVLINGE 508

Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
                    +L  T +W   D +T++LP++LR
Sbjct: 509 AAQGEQLKGYLHITRQWQQGDIITLRLPMTLR 540


>gi|298386781|ref|ZP_06996336.1| hypothetical protein HMPREF9007_03534 [Bacteroides sp. 1_1_14]
 gi|298260455|gb|EFI03324.1| hypothetical protein HMPREF9007_03534 [Bacteroides sp. 1_1_14]
          Length = 668

 Score = 48.9 bits (115), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 83/356 (23%), Positives = 136/356 (38%), Gaps = 78/356 (21%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
           L +LY +T D K+L  A  F      G+ + +  Y     +  H P+V     +G  +R 
Sbjct: 219 LVKLYMVTGDKKYLDQAKFFLDT--RGYTSRKDAY-----SQAHKPVVEQDEAVGHAVRA 271

Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                       +TGD  Y K I   + +IV +   Y TGG  AR          +  G+
Sbjct: 272 VYMYSGMADVAAITGDSSYIKAIDKIWDNIV-SKKIYITGGIGARH-------AGEAFGN 323

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     + ++  LF    +  Y D  ER L NG++S     + G   
Sbjct: 324 NYELPNQSAYCETCAAIGNVYMNYRLFLLHGDAKYFDVLERTLYNGLIS-GVSLDGGSFF 382

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQ 505
           Y  PL        S++G  ++   F C C  + +  F   L   +Y  +   V   Y+  
Sbjct: 383 YPNPL--------SSNGKYSRKPWFGCACCPSNVSRFIPSLPGYVYAVKNDQV---YVNL 431

Query: 506 YISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
           Y+S+  + K     ++L Q+      W+  +R+ +T     +  Q  ++ LR+P W   N
Sbjct: 432 YLSNKAELKVDKKKILLEQETG--YPWNGDIRLKIT-----QGNQDFTMKLRIPGWVRGN 484

Query: 564 ---------------GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
                            Q S+NGQ +       +LS   +W   D + +   +  R
Sbjct: 485 VLPSDLYSYADNQKPAYQVSVNGQTVESDVNDGYLSIARKWKKGDVVEVHFDMIPR 540


>gi|433678396|ref|ZP_20510262.1| hypothetical protein BN444_02464 [Xanthomonas translucens pv.
           translucens DSM 18974]
 gi|430816487|emb|CCP40741.1| hypothetical protein BN444_02464 [Xanthomonas translucens pv.
           translucens DSM 18974]
          Length = 664

 Score = 48.9 bits (115), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 46/213 (21%), Positives = 80/213 (37%), Gaps = 18/213 (8%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +  + +   +  YAD  ERAL N VL+     +     Y+ PL      
Sbjct: 339 ESCASIGLMMFANRMLQLAPDSRYADVMERALYNTVLA-GMALDGRHFFYVNPLEVHPPT 397

Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               HG+         W    CC        + LG  +Y   +     LY+  Y+ S   
Sbjct: 398 VHGNHGFDHVKPVRQRWFGCACCPPNIARVLTSLGHYLYTRRDDT---LYVNLYVGSDAA 454

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
           +  G   L  +      W   + +++   +  E    ++L LR+P W      Q  LNG+
Sbjct: 455 FDVGGQTLTLRQRGEYPWQEQVELSVDCDAPVE----AALALRLPDWC--RAPQLRLNGE 508

Query: 573 NLPLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
            + +       +     RW   D L + LP+ +
Sbjct: 509 AVAIAAHLQHGYCVLRRRWQRGDTLHLHLPMPV 541


>gi|423288216|ref|ZP_17267067.1| hypothetical protein HMPREF1069_02110 [Bacteroides ovatus
           CL02T12C04]
 gi|392671105|gb|EIY64581.1| hypothetical protein HMPREF1069_02110 [Bacteroides ovatus
           CL02T12C04]
          Length = 666

 Score = 48.9 bits (115), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 60/236 (25%), Positives = 97/236 (41%), Gaps = 26/236 (11%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC T+     S  LF  T    Y D  E+A  N + S+  G +     Y   L R   K
Sbjct: 353 ETCATFYGAYYSWRLFMLTGNPMYLDVMEKAFYNNLSSM--GLDGKSYFYTNVL-RWYGK 409

Query: 459 AR-----STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
                    H   T+  +  CC  + +   ++  D  Y ++E +   L++  Y S+  D 
Sbjct: 410 QHPLLSLDFHQRWTEECTCVCCPTSLVRFLAETKDYAYAKDENS---LFVTLYGSNEIDT 466

Query: 514 K-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
           K +G  V  ++V     WD  + M        E     SL LR+P W    GA   +NG 
Sbjct: 467 KINGKNVRFEQVTNY-PWDDKIEMNYKGDKNAEF----SLKLRIPAWAI--GATLKVNGI 519

Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ---AILFGP 625
           ++P+   G F     +W   DK+ + LP+      + +  P+   ++   A+ +GP
Sbjct: 520 DMPI-NTGVFAVVNRKWKSGDKVELVLPMK---PILNEGNPKVEEVRNQLAVSYGP 571


>gi|440731554|ref|ZP_20911563.1| hypothetical protein A989_09226 [Xanthomonas translucens DAR61454]
 gi|440372448|gb|ELQ09250.1| hypothetical protein A989_09226 [Xanthomonas translucens DAR61454]
          Length = 664

 Score = 48.9 bits (115), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 46/211 (21%), Positives = 79/211 (37%), Gaps = 18/211 (8%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +  + +   +  YAD  ERAL N VL+     +     Y+ PL      
Sbjct: 339 ESCASIGLMMFANRMLQLAPDSRYADVMERALYNTVLA-GMALDGRHFFYVNPLEVHPPT 397

Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               HG+         W    CC        + LG  +Y   +     LY+  Y+ S   
Sbjct: 398 VHGNHGFDHVKPVRQRWFGCACCPPNIARVLTSLGHYLYTRRDDT---LYVNLYVGSDAA 454

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
           +  G   L  +      W   + +++   +  E    ++L LR+P W      Q  LNG+
Sbjct: 455 FDVGGQTLTLRQRGEYPWQEQVELSVDCDAPVE----AALALRLPDWC--RAPQLRLNGE 508

Query: 573 NLPLPP--PGNFLSATERWSYNDKLTIQLPL 601
            + +       +     RW   D L + LP+
Sbjct: 509 AVAIAAHLQHGYCVLRRRWQRGDTLHLHLPM 539


>gi|354604714|ref|ZP_09022703.1| hypothetical protein HMPREF9450_01618 [Alistipes indistinctus YIT
           12060]
 gi|353347293|gb|EHB91569.1| hypothetical protein HMPREF9450_01618 [Alistipes indistinctus YIT
           12060]
          Length = 623

 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 94/408 (23%), Positives = 164/408 (40%), Gaps = 57/408 (13%)

Query: 229 YYTIHKILAGLLDQYVLADNAQAL-KMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
           Y   H I AG+   Y+LA   + L +++T MV +  N           +RHW   +EE  
Sbjct: 160 YCAGHMIEAGI--AYLLATGDRTLLEVSTRMVGHMMNEFG------PGKRHWVPGHEE-- 209

Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIG 342
            +   L +LYS+T +PK+L  A    +    G+   +    +  +    IP+     + G
Sbjct: 210 -IELALAKLYSVTGEPKYLEFARWLLEERGHGYGRNEEGTWNAAYYQDSIPVSRMTDITG 268

Query: 343 SQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSA---REFWWDPKR 388
             +R             ++GD +Y+       D V   + Y TGG  +    E + +   
Sbjct: 269 HAVRCMYLFCGMADMSMLSGDTVYRAALDRVWDDVVQRNMYITGGIGSSHQNEGFTEDYD 328

Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
           L +       ETC +  M+  +  + R   +  YAD  ERAL NG L+     +     Y
Sbjct: 329 LPNL--EAYCETCASVGMVLWNARMNRLKGDAKYADVMERALYNGALA-GISLDGKRFFY 385

Query: 449 MLPL-GRGVSKARSTHGWG---TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
           + PL  +G    ++ +G     ++ + F    G+ I S S   D+++         LY+ 
Sbjct: 386 VNPLESKGDHHRKAWYGCACCPSQLSRFLPSIGSYIYSHSLDSDTVWVN-------LYLG 438

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL-SSLNLRMPVWTYSN 563
              +      S  V+      P   W+   R+T++    +  G++   L LR+P W  ++
Sbjct: 439 SNAAIPTQDGSRFVLTQTTRYP---WEGNARITVS----EAPGKIRKELRLRIPGWCKNH 491

Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDD 611
                +NG+    P    +      W   D+  I L L++ TE +  D
Sbjct: 492 --TLWVNGELFDHPTDKGYAVVNRSWKKGDR--IDLSLAMPTEVVAAD 535


>gi|315607259|ref|ZP_07882259.1| conserved hypothetical protein [Prevotella buccae ATCC 33574]
 gi|315250962|gb|EFU30951.1| conserved hypothetical protein [Prevotella buccae ATCC 33574]
          Length = 825

 Score = 48.5 bits (114), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 82/360 (22%), Positives = 141/360 (39%), Gaps = 77/360 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
            L +LY +T + K+L  A  F    + G  A++ +Y     + +H+P++     +G  +R
Sbjct: 226 ALCKLYLVTGNRKYLDEAKFFLD--YRGKTAIRQEY-----SQSHLPVLEQSEAVGHAVR 278

Query: 347 YE-----------VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
                        +TGD  Y   I   + +IV     Y TGG  A           +  G
Sbjct: 279 AAYMYAGMADVAALTGDTAYIHAIDRIWNNIV-GRKLYITGGIGATNN-------GEAFG 330

Query: 395 SENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
           ++ E        ETC     + V+  LF    E  Y D  ER L NG++S     + G  
Sbjct: 331 ADYELPNMSAYAETCAAIGNVYVNYRLFLLHGESKYFDVLERTLYNGLIS-GVSMDGGGF 389

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
            Y  PL     ++R  H     F    CC          L   +Y  ++ NV   Y+  +
Sbjct: 390 FYPNPL-----ESRGQHQRQAWFGCA-CCPSNICRFLPSLPGYVYAVKDRNV---YVNLF 440

Query: 507 I--SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT---- 560
           +  S+S +     V L+Q+      W+  + +T+    +   G   +L +R+P W     
Sbjct: 441 LSNSASLEVAGKRVALSQQTQ--YPWNGDIALTV---DENRAGAF-ALKIRIPGWVKGQP 494

Query: 561 -------YSNGAQA----SLNGQNLPLP----PPGNFLSATERWSYNDKLTIQLPLSLRT 605
                  YS+G +     ++NG+ L        P  + +   +W   D+++I   + +RT
Sbjct: 495 VPSDLYEYSDGKRTGYTIAVNGRRLTATDINFSPDGYCTIVRKWKKGDRVSIHFDMEVRT 554


>gi|378763347|ref|YP_005191963.1| hypothetical protein SFHH103_05359 [Sinorhizobium fredii HH103]
 gi|365182975|emb|CCE99824.1| hypothetical protein SFHH103_05359 [Sinorhizobium fredii HH103]
          Length = 879

 Score = 48.5 bits (114), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 79/370 (21%), Positives = 150/370 (40%), Gaps = 53/370 (14%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL--SHFHANTHIPI 339
            L +L  +T + K+L L+  F      +P F    A++      DY+  +H ++ +H P+
Sbjct: 435 ALVKLARVTGETKYLDLSKFFIDERGREPHFFTEEAIRDGRSPKDYIQKTHEYSQSHEPV 494

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG   ++ 
Sbjct: 495 RRQKKVVGHAVRAMYMYSGMADLATEYKDDTLTDALETLWDDLTT-KQMYVTGGIGPSAK 553

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +        +AD  E+AL NG LS   
Sbjct: 554 NEGFTDCYDLPND--TAYAETCASVALVFWASRMLGRGPNRRFADIMEQALYNGALS-GL 610

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
             +     Y  PL          H W  K+++  CC        + +G  +Y      + 
Sbjct: 611 SLDGKTFFYDNPL----ESTGKHHRW--KWHNCPCCPPNIARLVASVGAYMYGVAAEEI- 663

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            +++    +   +     V L Q  +    WD  + + L     ++     +L+LR+P W
Sbjct: 664 AVHLYGESTVRLEVGGSDVTLQQVTN--YPWDGAVSIKLDLKEPRQF----ALSLRIPEW 717

Query: 560 TYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
             ++GA+ ++NG ++ L       +     +W+  D ++++LPL LR +       + A 
Sbjct: 718 --ADGARIAINGSSVDLDAVMTDGYARIERQWANGDAVSLELPLQLRPQYANPKVRQDAG 775

Query: 618 IQAILFGPYL 627
             A++ GP +
Sbjct: 776 RVALMRGPLV 785


>gi|288925306|ref|ZP_06419241.1| cytoplasmic protein [Prevotella buccae D17]
 gi|288338071|gb|EFC76422.1| cytoplasmic protein [Prevotella buccae D17]
          Length = 825

 Score = 48.5 bits (114), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 82/360 (22%), Positives = 141/360 (39%), Gaps = 77/360 (21%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
            L +LY +T + K+L  A  F    + G  A++ +Y     + +H+P++     +G  +R
Sbjct: 226 ALCKLYLVTGNRKYLDEAKFFLD--YRGKTAVRQEY-----SQSHLPVLKQSEAVGHAVR 278

Query: 347 YE-----------VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
                        +TGD  Y   I   + +IV     Y TGG  A           +  G
Sbjct: 279 AAYMYAGMADVAALTGDTAYIHAIDRIWNNIV-GRKLYITGGIGATNN-------GEAFG 330

Query: 395 SENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
           ++ E        ETC     + V+  LF    E  Y D  ER L NG++S     + G  
Sbjct: 331 ADYELPNMSAYAETCAAIGNVYVNYRLFLLHGESKYFDVLERTLYNGLIS-GVSMDGGGF 389

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
            Y  PL     ++R  H     F    CC          L   +Y  ++ NV   Y+  +
Sbjct: 390 FYPNPL-----ESRGQHQRQAWFGCA-CCPSNICRFLPSLPGYVYAVKDRNV---YVNLF 440

Query: 507 I--SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT---- 560
           +  S+S +     V L+Q+      W+  + +T+    +   G   +L +R+P W     
Sbjct: 441 LSNSASLEVAGKRVALSQQTQ--YPWNGDIALTV---DENRAGAF-ALKIRIPGWVKGQP 494

Query: 561 -------YSNGAQA----SLNGQNLPLP----PPGNFLSATERWSYNDKLTIQLPLSLRT 605
                  YS+G +     ++NG+ L        P  + +   +W   D+++I   + +RT
Sbjct: 495 VPSDLYEYSDGKRTGYTIAVNGRRLTATDINFSPDGYCTIARKWKKGDRVSIHFDMEVRT 554


>gi|256394126|ref|YP_003115690.1| hypothetical protein Caci_4989 [Catenulispora acidiphila DSM 44928]
 gi|256360352|gb|ACU73849.1| protein of unknown function DUF1680 [Catenulispora acidiphila DSM
           44928]
          Length = 647

 Score = 48.1 bits (113), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 83/369 (22%), Positives = 134/369 (36%), Gaps = 59/369 (15%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFL---ALQADYLSHFHANTHIPI-----VIGS 343
            L  LY  T + ++L LA  F      G L   A +       +   H+P+     V G 
Sbjct: 202 ALVELYRETGEQRYLDLAAYFVDRRGHGLLNPEATRGTAAGPAYCQDHLPVREANAVAGH 261

Query: 344 QMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSA---REFWWDPKRL 389
            +R              TGD   +         + A  ++ TGG  A    E + DP  L
Sbjct: 262 AVRQLYFLAGVTDLAVETGDASLRAAAERLWTEMAARKTHITGGLGAHHAEEDFGDPYEL 321

Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI-- 447
            +       ETC     ++ +  +   T E  Y+D  ER L N VL       PGV +  
Sbjct: 322 PNE--RAYCETCAAIASVQWNWRMALLTGEAKYSDLAERTLYNAVL-------PGVSLDG 372

Query: 448 ----YMLPLGRGVSKARSTH-----GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
               Y  PL     + R  H       G    +++ C          L    ++   G+ 
Sbjct: 373 TRWFYANPL-----QVRDEHLDRHGDHGVSRKAWFRCACCPPNVMRLLASLPHYFVSGDA 427

Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            G+ + QY + S++  +G V    +V+    W   + +T+      E G   +L+LR+P 
Sbjct: 428 DGIQLHQYATGSYEAVAGTV----RVETGYPWSGGIAVTI------ERGGEWTLSLRVPG 477

Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
           W      +A +NG  +    P  +L     W   D +++ L + +R  A           
Sbjct: 478 WCAD--VEAGVNGVAVDTVVPDGWLRIRRAWQPGDVVSLNLAMPIRLTAADPRVDAVRGC 535

Query: 619 QAILFGPYL 627
            AI  GP +
Sbjct: 536 AAIERGPLV 544


>gi|336427168|ref|ZP_08607172.1| hypothetical protein HMPREF0994_03178 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336010021|gb|EGN40008.1| hypothetical protein HMPREF0994_03178 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 687

 Score = 48.1 bits (113), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 80/363 (22%), Positives = 129/363 (35%), Gaps = 66/363 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQA------DYLSHFHANTHIPI- 339
            L RLY +T + K+L L+  F      KP +      +A      D   + +   H+P+ 
Sbjct: 225 ALVRLYEVTGEDKYLNLSRFFVDQRGTKPYYYDTEHPEAVKKGHEDEQRYSYNQAHLPVR 284

Query: 340 ----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE--- 381
                +G  +R             +TGD           D +     Y TGG  A     
Sbjct: 285 EQDEAVGHAVRAVYLYSGMADVARLTGDEALLEACEKLWDNITQKKMYITGGIGATHMGE 344

Query: 382 ---FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
              F +D         S   ETC +  ++  +R +        YAD  E+AL NG+LS  
Sbjct: 345 AFSFNYDLPN-----DSAYAETCASIGLVFFARRMLEIKASSKYADVMEKALYNGILS-G 398

Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFN-----SFW----CCYGTGIESFSKLGDS 489
              +     Y+ PL    S   + H    KF+       W    CC        S +   
Sbjct: 399 MALDGKSFFYVNPLE---SLPEACHKDERKFHVKPVRQKWFGCACCPPNIARLLSSIASY 455

Query: 490 IYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL 549
            Y E E     LY+  Y+ S  +   G   L+ ++     WD   ++    ++++ V   
Sbjct: 456 AYTEAED---ALYVHLYMGSVLEKDCGGKKLDIRISSDFPWDG--KVMAEINAEEPVA-- 508

Query: 550 SSLNLRMPVWTYS---NGAQASLNGQNLPLPP-----PGNFLSATERWSYNDKLTIQLPL 601
             L  R+P W  S   NG +    G+ +            +L     W+  +KL +  P+
Sbjct: 509 CRLAFRIPGWCSSYTLNGQKGLEEGETVTADGETRQVKDGYLIIDRVWNGGEKLELDFPM 568

Query: 602 SLR 604
            +R
Sbjct: 569 EVR 571


>gi|315644006|ref|ZP_07897176.1| hypothetical protein PVOR_00550 [Paenibacillus vortex V453]
 gi|315280381|gb|EFU43670.1| hypothetical protein PVOR_00550 [Paenibacillus vortex V453]
          Length = 653

 Score = 48.1 bits (113), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 109/511 (21%), Positives = 196/511 (38%), Gaps = 79/511 (15%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFEALK 223
           V  +L A+A   A   +  ++E++  ++  ++  Q     GYL+ + T  E    +  L 
Sbjct: 79  VAKWLEAAAYSLAIHPDPKLEEQVDQLIDLVAAAQQP--DGYLNTYFTVKEPEKRWTNLT 136

Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
                Y   H + AG+   Y+     + L +   + +Y    +  V      + H +  +
Sbjct: 137 DCHELYCAGHMMEAGVA-HYLATGKRKLLDVVCRLADY----IDSVFGPEDGKIHGFDGH 191

Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFL----------GFLALQADYL 328
           +E   +   L +LY +T +P++L L+  F      +P F            F +  A+  
Sbjct: 192 QE---IELALVKLYEVTREPRYLSLSQYFIDVRGTEPHFFLQEWEQRGRKSFYSSVANPP 248

Query: 329 SHFHANTHIPI-----VIGSQMRY-----------EVTGDP-LYKLIGTFFMDIVNASHS 371
              +  +H+P+      +G  +R              T DP L +     + ++V+    
Sbjct: 249 HLPYHQSHLPVREQREAVGHSVRAVYMYTAMADLAARTKDPALLEACENLWFNMVH-KQM 307

Query: 372 YATGGTSA----REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG  +      F  D     DT+ +E   TC +  ++  +R +     +  YAD  E
Sbjct: 308 YITGGIGSTHHGEAFTTDYDLPNDTVYAE---TCASIGLIFFARRMLELAPKSEYADVME 364

Query: 428 RALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFN---------SFWCCY 476
           RAL N V+    Q G       Y+ PL    +  R   G   KF+         +  CC 
Sbjct: 365 RALFNTVIGSMAQDGRH---FFYVNPLEVWPAACRHNPG---KFHVKPVRPGWFACACCP 418

Query: 477 GTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM 536
                  S LG+ +Y   E     LY   Y+      + G V +    +  + W+    +
Sbjct: 419 PNVARLLSSLGEYVYTMNEDT---LYTHLYMGGEASVQFGDVPVKVIQNSALPWNG--DV 473

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDK 594
           TLT   ++ V    ++ LRMP W+    A   LNG+++ +       ++     W+  D 
Sbjct: 474 TLTIQPEKAVEW--TVALRMPDWSRGK-ADLRLNGEDVSIEDVMKDGYVYIKRVWAPGDT 530

Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
           L ++L + +       +    A   AI  GP
Sbjct: 531 LELELSMEIHQVRANPNIRANAGKAAIQRGP 561


>gi|423223926|ref|ZP_17210395.1| hypothetical protein HMPREF1062_02581 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392637372|gb|EIY31243.1| hypothetical protein HMPREF1062_02581 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 820

 Score = 48.1 bits (113), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 95/447 (21%), Positives = 168/447 (37%), Gaps = 92/447 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L +A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 229 ALAKLYKVTGDEKYLKMAKYFVEETGRG---TDGHRLSEY-SQDHKPILQQDEIVGHAVR 284

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    +   + + +   + TGG  +R     P+   +  G 
Sbjct: 285 AGYLYSGVADVASLTQDTAYFNALSRIWENMASKKLFITGGIGSR-----PQ--GEGFGP 337

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     +  +  +F  T    YAD  ERAL NGV+S       GV +
Sbjct: 338 NYELNNHTAYCETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVIS-------GVSL 390

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +      +G       CC G  +  F        +  +GN   +
Sbjct: 391 SGDKFFYDNPL-ESMGQHERQQWFGCA-----CCPGN-VTRFMASVPFYMYATQGN--DI 441

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVS--WDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           Y+  YI S  +  +     N K++ I +  WD  + +++    +QE     +L +R+P W
Sbjct: 442 YVNLYIQSKAELNTE--TNNVKLEQITTYPWDGKVSISVNPEKEQEF----ALRVRIPGW 495

Query: 560 -----------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
                      ++++ A+A   S+NG+ +       + +    W   D + I  P+ +R 
Sbjct: 496 AQDAPVPTDLYSFTDKAKAYTISINGKKVNATQLDGYATILHDWKTGDIVEINFPMDVRR 555

Query: 606 EAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFT 665
               D+  +     AI  GP +       + D         S + +   P   +   T+ 
Sbjct: 556 VKANDNVEDDRGKLAIERGPIMFCLEGKDQVD---------SIVFNKFIPDGTSMEATYD 606

Query: 666 QESGNSTFVMSNSNQSI----TMEEFP 688
            +  N   V++ + + I    +M+E P
Sbjct: 607 ADLLNGVMVLTGTAKEIEKDGSMKEVP 633


>gi|405380414|ref|ZP_11034253.1| hypothetical protein PMI11_04239 [Rhizobium sp. CF142]
 gi|397323106|gb|EJJ27505.1| hypothetical protein PMI11_04239 [Rhizobium sp. CF142]
          Length = 642

 Score = 48.1 bits (113), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 81/349 (23%), Positives = 132/349 (37%), Gaps = 57/349 (16%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFHANT------HIPI 339
            L +L  +T + K+L L+  F      +P F    A +     + FH  T      H+P+
Sbjct: 198 ALVKLARVTGEKKYLDLSKFFVDERGTEPHFFTDEATRDGRSAADFHQKTYEYGQAHLPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 258 REQKKVVGHAVRAMYLYAGMADIATEYNDDTLTAALETLWDDLTT-KQMYVTGGIGPAAS 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--I 437
            E + D   L +   S   ETC +  ++  +  +        YAD  E+AL NG ++   
Sbjct: 317 NEGFTDYYDLPNE--SAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMAGLS 374

Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
             GT      Y  PL      A   H W   ++   CC        + +G  +Y   E  
Sbjct: 375 LDGTR---FFYENPL----ESAGKHHRW--IWHHCPCCPPNIARLLASVGSYMYAIAEDE 425

Query: 498 VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
           +  +++     + FD     V L+Q+      WD  +   LT           +L+LR+P
Sbjct: 426 I-AVHLYGESKARFDLAGAKVELSQQTR--YPWDGAIHFDLTLDRPAHF----ALSLRIP 478

Query: 558 VWTYSNGAQASLNGQNLPLPPPG--NFLSATERWSYNDKLTIQLPLSLR 604
            W  + G   S+NG+ L L       +      W   DK+ + +PL+ R
Sbjct: 479 EW--AEGVALSVNGEKLDLQSTTVEGYARIERDWKSGDKVDLSIPLAAR 525


>gi|398351289|ref|YP_006396753.1| cytoplasmic protein [Sinorhizobium fredii USDA 257]
 gi|390126615|gb|AFL49996.1| putative cytoplasmic protein [Sinorhizobium fredii USDA 257]
          Length = 937

 Score = 48.1 bits (113), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 84/370 (22%), Positives = 149/370 (40%), Gaps = 53/370 (14%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL--SHFHANTHIPI 339
            L +L  +T + K+L L+  F      +P F    A++      DY+  +H ++ +H P+
Sbjct: 493 ALVKLARVTGETKYLDLSKFFIDERGQEPHFFTEEAIRDGRSPKDYVHKTHEYSQSHEPV 552

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGGT--SAR 380
                V+G  +R             E   D L   + T + D+      Y TGG   SAR
Sbjct: 553 RQQKKVVGHAVRAMYMYSGMADLATEYKDDTLTDALETLWDDLTT-KQMYVTGGIGPSAR 611

Query: 381 -EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +        +AD  E+AL NG LS   
Sbjct: 612 NEGFTDYYDLPND--TAYAETCASVALVFWASRMLGRGPNRRFADIMEQALYNGALS-GL 668

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
             +     Y  PL          H W  ++++  CC        + +G  +Y      + 
Sbjct: 669 SLDGKTFFYDNPL----ESTGKHHRW--RWHNCPCCPPNIARLVASVGAYMYGVATDEI- 721

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            +++    ++  +    +V L Q  +    W+  + + L     ++     +L+LR+P W
Sbjct: 722 AVHLYGESTARLELDGSNVTLRQVTN--YPWEGAVSIRLELEEPRQF----ALSLRIPEW 775

Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
             ++GA  S+NG  + L       +      WS  D ++I LPL LR +       + A 
Sbjct: 776 --ADGASISVNGSGIDLEHVTLDGYARIEREWSDGDAVSIDLPLKLRPQFANPKVRQDAG 833

Query: 618 IQAILFGPYL 627
             A+L GP +
Sbjct: 834 RIALLRGPLV 843


>gi|251797630|ref|YP_003012361.1| hypothetical protein Pjdr2_3643 [Paenibacillus sp. JDR-2]
 gi|247545256|gb|ACT02275.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
          Length = 645

 Score = 48.1 bits (113), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 73/322 (22%), Positives = 122/322 (37%), Gaps = 32/322 (9%)

Query: 327 YLSHFHANTHIPIVIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATG 375
           Y SH       P+ +G  +R             +TGD   +               Y TG
Sbjct: 241 YQSHLPVREQ-PVAVGHAVRAVYLYTAMADLARLTGDVKLREACERLWANTTGKQMYITG 299

Query: 376 GTSA----REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
           G  A      F +D     D + +E   TC +  ++  +R + +   +  YAD  ERAL 
Sbjct: 300 GIGATHLGEAFTFDHDLPNDIVYAE---TCASIGLIFWARRMLQLEAKSEYADVMERALY 356

Query: 432 NGVLSIQRGTEPGVMIYMLPLGR-GVSKARSTHGWGTK-FNSFW----CCYGTGIESFSK 485
           N VL      +     Y+ PL     + A+S   +  K     W    CC          
Sbjct: 357 NNVLG-SMAKDGKHFFYVNPLEVWPEASAKSPDKFHVKPVRQKWFGCSCCPPNVARLLGS 415

Query: 486 LGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQ 544
           L + IY   E+G+   +++      +F+ +   +VLNQK +  + W+   ++    S ++
Sbjct: 416 LDEYIYDVSEDGSTVRVHLFIGSEVAFETEGKKIVLNQKSE--LPWNG--QVEFKVSLQE 471

Query: 545 EVGQLS-SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
           + G +   L LR+P W  S  A   +NG+ +       + +    W   D++   LP+  
Sbjct: 472 DKGDVPFMLALRIPNWFSSKEALLKINGETVRYHVDKGYATVYRVWQDGDRVEWLLPIET 531

Query: 604 RTEAIQDDRPEYASIQAILFGP 625
           +  A        A   AI  GP
Sbjct: 532 QLIAANPLIRADAGKAAIQRGP 553


>gi|392965453|ref|ZP_10330872.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
 gi|387844517|emb|CCH52918.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
          Length = 650

 Score = 48.1 bits (113), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 86/400 (21%), Positives = 144/400 (36%), Gaps = 87/400 (21%)

Query: 293 LYRLYSITHDPKHLLLAH-------------LFDKPCFLGFLALQADYLSHFHANTHIPI 339
           L +LY +T+D ++L  A              LF  P   G     + YL      T    
Sbjct: 217 LVKLYRVTNDKRYLDFARFLLDMRGRSDKRELFPDPSRTGN---GSQYLQDHQPVTQQRE 273

Query: 340 VIGSQMR----YEVTGDPLYKLIGTFFMDIVNA-------SHSYATGGTSAREFWWDPKR 388
            +G  +R    Y    D         ++D + A          Y TGG  ARE       
Sbjct: 274 AVGHAVRAGYMYAAMTDIAAIQQDKAYLDALMAIWNDVVERKQYLTGGLGAREH------ 327

Query: 389 LADTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRG 440
             +  G+  E        ETC     L  +  +F  T +  Y D +ER L NG L+    
Sbjct: 328 -GEAFGNAYELPNDVAYAETCAAVANLLWNHRMFLLTGQSKYMDVFERVLYNGFLA-GVS 385

Query: 441 TEPGVMIYMLPLGR--------GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
            E     Y+ PL          GV+  R+   +GT      CC    +     L   +Y 
Sbjct: 386 LEGDKFFYVNPLASDGKRKFNVGVAAERAPW-FGTS-----CCPTNVVRFLPSLPGYVYA 439

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
            +  +V   ++  ++++S +   G   +  +      WD  + MT++  + Q       L
Sbjct: 440 VKNNDV---FVNLFLTNSSELTVGKTPVQVQQQTNYPWDGAVTMTVSPRNAQAF----DL 492

Query: 553 NLRMPVWTYSN-------------GAQASL--NGQNLPLPPPGNFLSATERWSYNDKLTI 597
            +R+P WT                GA  SL  NG+ +P+     +   +  W   D++ +
Sbjct: 493 LVRIPGWTLGKPMPGNLYSYRRNIGATPSLKVNGKAVPVKMDNGYARISRTWKPGDRVEL 552

Query: 598 QLPLSLR----TEAIQDDRPEYASIQAILFGPYLLAGHTS 633
           ++ + +R     + ++DD    A   AI  GP +     +
Sbjct: 553 RMEMPVREVIANQQVKDD----AGRVAIERGPIVYCAEAA 588


>gi|281424179|ref|ZP_06255092.1| conserved hypothetical protein [Prevotella oris F0302]
 gi|281401448|gb|EFB32279.1| conserved hypothetical protein [Prevotella oris F0302]
          Length = 638

 Score = 48.1 bits (113), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 56/257 (21%), Positives = 98/257 (38%), Gaps = 20/257 (7%)

Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
           TG  ++ E W+  K L        +ETC T   +K+SR L   T    YAD  E +  N 
Sbjct: 308 TGSGASMESWFGGKHLQYMPIRHFQETCVTATWIKLSRQLLLLTGNTKYADAVEISFYNA 367

Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
           +L   R T+        PL           G G       CC  +G      +  +    
Sbjct: 368 LLGAMR-TDASDWAKYTPLSGQRLPGSEQCGMGLN-----CCNASGPRGLFVIPQTAVLT 421

Query: 494 EEGNVPGLYIIQYISSSFDWKS-GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
              +  G+ +  YI+  +   +  H  +  K++     +  +   L+    + +    ++
Sbjct: 422 ---SAKGVDVNLYIAGDYKLTTPRHQQMVLKLEGEYPKNNKMSFLLSLKKAENI----TI 474

Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
            LR+P W  S   +  +N   +     G +L  +  W + D+++I+  +      +    
Sbjct: 475 RLRIPEW--STATKVIVNDVAVEHVQAGKYLELSRTWHHGDRISIEFDMPGIVHRL-GQH 531

Query: 613 PEYASIQAILFGPYLLA 629
           PEY    AI  GP +LA
Sbjct: 532 PEYV---AITRGPIVLA 545


>gi|150376304|ref|YP_001312900.1| hypothetical protein Smed_4162 [Sinorhizobium medicae WSM419]
 gi|150030851|gb|ABR62967.1| protein of unknown function DUF1680 [Sinorhizobium medicae WSM419]
          Length = 640

 Score = 47.8 bits (112), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 71/298 (23%), Positives = 121/298 (40%), Gaps = 37/298 (12%)

Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTY 404
           E   D L   + T + D+V     Y TGG    ++ E + D   L +   +   ETC + 
Sbjct: 283 EYKDDSLTAALETLWDDLVT-KQMYVTGGIGPAASNEGFTDYYDLPND--TAYAETCASV 339

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPLGRGVSK 458
            ++  +  +     +  YAD  E+AL NG L       PG+ I      Y  PL    S 
Sbjct: 340 GLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGKTFFYDNPL---EST 389

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG-H 517
            R  H W  K++   CC        + +G  +Y   E  +  +++    ++     +G  
Sbjct: 390 GRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESAARLKLANGAE 445

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
           V L Q  +    WD      + F+++ +     +L+LR+P W  + GA  S+NG  L L 
Sbjct: 446 VELRQATN--YPWD----GAIAFTARLDRPARFALSLRIPEW--AAGATLSVNGSMLDLS 497

Query: 578 P--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
                 +      WS  D++ + LPL+LR +       +     A++ GP +     +
Sbjct: 498 AHLADGYARIEREWSDGDRVALYLPLTLRPQYANPKVRQDVGRVALMRGPLVYCAEAA 555


>gi|320161641|ref|YP_004174866.1| hypothetical protein ANT_22400 [Anaerolinea thermophila UNI-1]
 gi|319995495|dbj|BAJ64266.1| hypothetical protein ANT_22400 [Anaerolinea thermophila UNI-1]
          Length = 664

 Score = 47.8 bits (112), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 61/237 (25%), Positives = 101/237 (42%), Gaps = 41/237 (17%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-GRGVS 457
           ETC     +  +  L + T +  Y++ +E  L N   S+  G +    +Y  PL  RG  
Sbjct: 353 ETCAALASMFWNWELAQITGKARYSELFEWQLYNAA-SVGMGLDGTTYLYNNPLTCRGGV 411

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS-- 515
           + R        + +  CC      +F+ LGD +Y  + G    LY+ QY+SS    +   
Sbjct: 412 ERRP-------WYAVPCCPSNLSRTFAWLGDYLYSAKPGR---LYVHQYLSSDLPAQEIP 461

Query: 516 ----GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN--LRMPVWTYSNGAQASL 569
                 V L+ ++D  + W  ++ + L      +  Q + L   LR+P W  +   + +L
Sbjct: 462 CANGNRVRLSLQMDSQLPWHGHVVLRLRRWEVLDPDQPAPLEILLRLPSW--AENPRLTL 519

Query: 570 NGQNL----PLP-----PPGN--------FLSATERWSYNDKLTIQ--LPLSLRTEA 607
           NGQ L    P P     PP +        FL  ++ W+  D L ++  LP+ LR  A
Sbjct: 520 NGQPLFLQIPQPQQDGEPPADGYDPRQAVFLPLSQPWAEGDTLELRFDLPIRLRHAA 576


>gi|284039567|ref|YP_003389497.1| hypothetical protein Slin_4720 [Spirosoma linguale DSM 74]
 gi|283818860|gb|ADB40698.1| protein of unknown function DUF1680 [Spirosoma linguale DSM 74]
          Length = 655

 Score = 47.8 bits (112), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 87/378 (23%), Positives = 138/378 (36%), Gaps = 73/378 (19%)

Query: 293 LYRLYSITHDPKHLLLAH-------------LFDKPCFLGFLALQADYLSHFHANTHIPI 339
           L +LY +T+D ++L  A              LF  P   G     A YL      T    
Sbjct: 216 LVKLYRVTNDKRYLDFARFLLDMRGRADKRPLFPDPAKTG---QGASYLQDHLPVTQQKT 272

Query: 340 VIGSQMR----YEVTGDPLYKLIGTFFMDIVNA-------SHSYATGGTSAR---EFWWD 385
            +G  +R    Y    D         +MD + A          Y TGG  AR   E + +
Sbjct: 273 AVGHSVRAGYMYAAMSDIAAIQKDKAYMDALLAIWNDVVERKQYLTGGLGARGHGEAFGE 332

Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
              L + +     ETC     +  +  +F  T E  Y D +ER L NG L+     E   
Sbjct: 333 AYELPNDVAYA--ETCAAVANMLWNHRMFLLTGESKYMDVFERVLYNGFLA-GVSLEGDS 389

Query: 446 MIYMLPLGR------GVSKARSTHGW-GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
             Y+ PL         V +A +   W GT      CC    +     L   +Y  +  N+
Sbjct: 390 FFYVNPLASDGKRKFNVGQAATRAPWFGTS-----CCPTNVVRFLPSLPGYVYATKGDNL 444

Query: 499 -PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
              L++      S + KS  V + Q+ +    WD  + +T+    + ++ Q  ++ LR+P
Sbjct: 445 FINLFLTNQSKLSVNGKS--VQIRQETN--YPWDGNVAITV----QPKLAQTFTIQLRLP 496

Query: 558 VWT-----------YSNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
            W            Y N    +    +NG+ +P      +   +  W   D+L   L + 
Sbjct: 497 GWASGTPMPGYLYEYVNTTAKTPVLLVNGKPVPYKIENGYARISRTWKPGDRLEWTLDMP 556

Query: 603 LR----TEAIQDDRPEYA 616
           +R     E + DDR + A
Sbjct: 557 VREVKANEQVTDDRKKVA 574


>gi|322831792|ref|YP_004211819.1| hypothetical protein Rahaq_1069 [Rahnella sp. Y9602]
 gi|321166993|gb|ADW72692.1| protein of unknown function DUF1680 [Rahnella sp. Y9602]
          Length = 657

 Score = 47.8 bits (112), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 53/221 (23%), Positives = 82/221 (37%), Gaps = 34/221 (15%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC +  ++  +  + +   +  YAD  ERAL N VL+     +     Y+ PL      
Sbjct: 339 ETCASIGLMMFANRMLQMDSDSRYADVMERALYNTVLA-GMALDGKHFFYVNPL------ 391

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H     FN  +              CC        + LG  IY +      G+ I 
Sbjct: 392 --EVHPKSIPFNHIYDHVKPVRQRWFGCACCPPNIARLLASLGHYIYTQRPD---GVDIN 446

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            YI S  +   G   L  K      W   + + +      E    ++L LR+P W  S  
Sbjct: 447 LYIGSDVEATIGGKALRLKQSGGYPWAEGVLIEIDTDQPLE----ATLALRLPDWCVS-- 500

Query: 565 AQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
            Q +LNG  L L       +L  T+ W   D++ + LP+ +
Sbjct: 501 PQVTLNGNPLELTSLTQRGYLRLTQEWQKGDRIEMMLPMPV 541


>gi|448238166|ref|YP_007402224.1| AraN-like protein [Geobacillus sp. GHH01]
 gi|445207008|gb|AGE22473.1| AraN-like protein [Geobacillus sp. GHH01]
          Length = 643

 Score = 47.8 bits (112), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 78/359 (21%), Positives = 129/359 (35%), Gaps = 70/359 (19%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGF----------LALQADYLSHFHANTHI 337
           L +LY +T +  +L L+  F      +P +  +               DY  H     HI
Sbjct: 193 LLKLYEVTGNESYLKLSQYFIDQRGQQPHYFDWEKKARGETKPFWFHDDYRYH---QAHI 249

Query: 338 PI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSA-- 379
           P+      +G  +R              TGD   K       + V     Y TGG  +  
Sbjct: 250 PVREQKQAVGHAVRALYMYTAMAGLAAKTGDESLKQACQTLWENVTKRQMYITGGVGSSA 309

Query: 380 --REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
               F +D     DT  +E   TC +  ++  +R +     +  YAD  ERAL NG +S 
Sbjct: 310 FGESFTFDFDLPNDTAYAE---TCASIALVFWARRMLELETDGKYADVMERALYNGTIS- 365

Query: 438 QRGTEPGVMIYMLPL-----------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKL 486
               +     Y+ PL            R V   R       K+ S  CC        + +
Sbjct: 366 GMDLDGKKFFYVNPLEVWPKACERHDKRHVKPVRQ------KWFSCACCPPNLARLIASI 419

Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV 546
           G  IY +       L++  Y+ S    + G   +    +    WD  +R+T+   S  E 
Sbjct: 420 GHYIYSQ---TSDALFVHLYVGSDIRTELGGRSVEIVQETNYPWDGTVRLTVLPESAGEF 476

Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
               ++ LR+P W    GA  ++NG+ + + P     +      W   D++ +  P+ +
Sbjct: 477 ----TIGLRIPGW--CRGATLTINGEKVDMVPLIQKGYAYIKRIWKKGDQVELVFPMPV 529


>gi|384256908|ref|YP_005400842.1| hypothetical protein Q7S_05115 [Rahnella aquatilis HX2]
 gi|380752884|gb|AFE57275.1| hypothetical protein Q7S_05115 [Rahnella aquatilis HX2]
          Length = 657

 Score = 47.8 bits (112), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 53/221 (23%), Positives = 82/221 (37%), Gaps = 34/221 (15%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC +  ++  +  + +   +  YAD  ERAL N VL+     +     Y+ PL      
Sbjct: 339 ETCASIGLMMFANRMLQMDSDSRYADVMERALYNTVLA-GMALDGKHFFYVNPL------ 391

Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
               H     FN  +              CC        + LG  IY +      G+ I 
Sbjct: 392 --EVHPKSIPFNHIYDHVKPVRQRWFGCACCPPNIARLLASLGHYIYTQRPD---GVDIN 446

Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            YI S  +   G   L  K      W   + + +      E    ++L LR+P W  S  
Sbjct: 447 LYIGSDVEATIGGKALRLKQSGGYPWAEGVLIEIDTDQPLE----ATLALRLPDWCAS-- 500

Query: 565 AQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
            Q +LNG  L L       +L  T+ W   D++ + LP+ +
Sbjct: 501 PQVTLNGNPLELTSLTQRGYLRLTQEWQKGDRIEMMLPMPV 541


>gi|383777979|ref|YP_005462545.1| hypothetical protein AMIS_28090 [Actinoplanes missouriensis 431]
 gi|381371211|dbj|BAL88029.1| hypothetical protein AMIS_28090 [Actinoplanes missouriensis 431]
          Length = 640

 Score = 47.8 bits (112), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 69/309 (22%), Positives = 116/309 (37%), Gaps = 39/309 (12%)

Query: 369 SHSYATGGTSAR---EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
           S +Y TGG  +R   E + D   L         ETC      ++   L   T    YAD 
Sbjct: 288 SRTYLTGGQGSRHRDEAYGDAYELPPD--RAYAETCAAIASFQLGFRLLLATGSAKYADE 345

Query: 426 YERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK---ARSTHGWGTKFNSFWCCYGTGIES 482
            ER L N + +     +     Y  PL R         +  G    +    CC      +
Sbjct: 346 MERVLYNAI-AASTAVDGKAFFYSQPLQRRTGHDGGGENAPGHRLDWYECACC----PPN 400

Query: 483 FSKLGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFS 541
            ++L  S++ +   G+  GL +  Y S +F   +  V    +V+    WD  + +T+T S
Sbjct: 401 LARLMASLHTYAATGDAGGLELHLYGSGTFTSANRSV----EVETRYPWDEQITVTVTSS 456

Query: 542 SKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQL 599
                    +L+LR+P W   +  + ++NG   P  P     +L     W   D++ + L
Sbjct: 457 PDDPW----TLSLRIPAW--CDDVRLTVNGTAAPAGPQIHDGYLRLNRIWHEGDRVVLTL 510

Query: 600 PLSLRTEAIQDDRPEYASIQAILFGPYL-------------LAGHTSGEWDIKTGTARSL 646
            +  R  A            A++ GP +              AGH   + ++ TG+  S+
Sbjct: 511 AMPARLVAAHPRVDATRGTAALVRGPIVHCLEHADIPATGPFAGHCFEDLELDTGSPVSV 570

Query: 647 SALISPIPP 655
           +   S + P
Sbjct: 571 AYHSSGLAP 579


>gi|423299822|ref|ZP_17277847.1| hypothetical protein HMPREF1057_00988 [Bacteroides finegoldii
           CL09T03C10]
 gi|408473631|gb|EKJ92153.1| hypothetical protein HMPREF1057_00988 [Bacteroides finegoldii
           CL09T03C10]
          Length = 698

 Score = 47.8 bits (112), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 58/216 (26%), Positives = 93/216 (43%), Gaps = 16/216 (7%)

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
           + + ETC     +  +  +   T +  YAD  E  L N VLS     +     Y  PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434

Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
             +    T  W    T++ S +CC    + +  +  +  Y    EG    LY    ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
           +  K G V L Q+ D    WD  +R+TL   + ++ G   SL LR+P W     A  ++N
Sbjct: 495 WKGK-GEVALTQETD--YPWDGNVRVTLD-KAPRKAGTF-SLFLRIPEW--CEKATLTVN 547

Query: 571 GQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
           GQ L +    N  +   R W   D  +L + +P+ L
Sbjct: 548 GQPLQVNAKANSYAEVNRAWKKGDVVELVMNMPVRL 583


>gi|150009918|ref|YP_001304661.1| hypothetical protein BDI_3335 [Parabacteroides distasonis ATCC
           8503]
 gi|423333683|ref|ZP_17311464.1| hypothetical protein HMPREF1075_03115 [Parabacteroides distasonis
           CL03T12C09]
 gi|149938342|gb|ABR45039.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
 gi|409226993|gb|EKN19895.1| hypothetical protein HMPREF1075_03115 [Parabacteroides distasonis
           CL03T12C09]
          Length = 617

 Score = 47.4 bits (111), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 48/212 (22%), Positives = 93/212 (43%), Gaps = 24/212 (11%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-GRGVS 457
           ETC +  M+  ++ + ++T +  Y D  ER++ NG L+     E     Y+ PL  +G  
Sbjct: 334 ETCASVGMVLWNQRMNQFTGDSKYIDVLERSMYNGALA-GISLEGDRFFYVNPLESKGDH 392

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
             ++ +G         CC          +G+ IY         +++  YI +S +  + +
Sbjct: 393 HRQAWYGCA-------CCPSQISRFLPSIGNYIYGTSN---EAIWVNLYIGNSTEINTDN 442

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSS--KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
             +  + +    WD  +++T+T S+  K+E+       LR+P W        S+NGQ + 
Sbjct: 443 TNVTLRQETNYPWDGTVKLTVTPSNPLKKEI------RLRIPSWCEQ--YTLSVNGQLVK 494

Query: 576 LPPPGNFLSATERWSYND--KLTIQLPLSLRT 605
            P    +    + W   D   L++++P+ L T
Sbjct: 495 APTEKGYAVLNKEWKQGDVISLSMEMPVKLMT 526


>gi|227820086|ref|YP_002824057.1| hypothetical protein NGR_b18560 [Sinorhizobium fredii NGR234]
 gi|227339085|gb|ACP23304.1| putative cytoplasmic protein [Sinorhizobium fredii NGR234]
          Length = 640

 Score = 47.4 bits (111), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 77/370 (20%), Positives = 149/370 (40%), Gaps = 53/370 (14%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL--SHFHANTHIPI 339
            L +L  +T + K+L L+  F      +P F    A++      DY+  +H ++ +H P+
Sbjct: 196 ALVKLARVTGEKKYLALSKFFIDERGQEPHFFTEEAIRDGRSPKDYIQKTHEYSQSHEPV 255

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L + + T + D+      Y TGG   ++ 
Sbjct: 256 RRQKKVVGHAVRAMYMYSGMADLATEYKDDTLTEALETLWDDLTT-KQMYVTGGIGPSAK 314

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   +   ETC +  ++  +  +        +AD  E+AL NG +S   
Sbjct: 315 NEGFTDYYDLPND--TAYAETCASVALVFWASRMLGRGPNRRFADIMEQALYNGAIS-GL 371

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
             +     Y  PL          H W  K+++  CC        + +G  +Y      + 
Sbjct: 372 SLDGKTFFYDNPL----ESTGKHHRW--KWHNCPCCPPNIARLVASVGAYMYGVAADEI- 424

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            +++    +   +     V L Q  +    W+  + + +     +      +L+LR+P W
Sbjct: 425 AVHLYGESTVRLELGGSQVTLRQVTN--YPWEGAVSIRIELDEPRHF----ALSLRIPEW 478

Query: 560 TYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
             ++GA+ ++NG ++ L       +      WS  D++++ LPL LR +       + A 
Sbjct: 479 --ADGARVAVNGSSIDLDGVMTDGYALIEREWSDGDEISLDLPLRLRPQYANPKVRQDAG 536

Query: 618 IQAILFGPYL 627
             A++ GP +
Sbjct: 537 RVALMRGPLV 546


>gi|299145521|ref|ZP_07038589.1| putative cytoplasmic protein [Bacteroides sp. 3_1_23]
 gi|298516012|gb|EFI39893.1| putative cytoplasmic protein [Bacteroides sp. 3_1_23]
          Length = 698

 Score = 47.4 bits (111), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 58/216 (26%), Positives = 91/216 (42%), Gaps = 16/216 (7%)

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
           + + ETC     +  +  +   T +  YAD  E  L N VLS     +     Y  PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434

Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
             +    T  W    T++ S +CC    + +  +  +  Y    EG    LY    ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494

Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             WK  G V L Q+ D    WD  +R+TL     ++VG   SL LR+P W     A   +
Sbjct: 495 --WKEKGEVALTQETD--YPWDGNVRVTLD-KVPRKVGTF-SLFLRIPEW--CEKATLRV 546

Query: 570 NGQNLPLPPPGNFLSATER-WSYNDKLTIQLPLSLR 604
           NGQ L +    N  +   R W   D + + + + +R
Sbjct: 547 NGQPLQVNAKANSYAEVNRAWKKGDIVELMMDMPVR 582


>gi|417487787|ref|ZP_12172639.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Rubislaw str. A4-653]
 gi|353632529|gb|EHC79566.1| secreted protein [Salmonella enterica subsp. enterica serovar
           Rubislaw str. A4-653]
          Length = 663

 Score = 47.4 bits (111), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 87/401 (21%), Positives = 140/401 (34%), Gaps = 69/401 (17%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
            L RLY +T  P+++ LA  F      +P F      +    S++H             +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251

Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
             H+PI      IG  +R+            ++ D   +         +     Y TGG 
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311

Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
              +S   F  D     D++ +E   +C +  ++  +R +     +  YAD  ERA    
Sbjct: 312 GSQSSGEAFSCDYDLPNDSIYAE---SCASIGLMMFARRMLEMEADSQYADVMERAREYA 368

Query: 434 -VLSIQRGTEPGVMIYMLPLGRGV--SKARSTHGWGTKFNSFW--------------CCY 476
            V+   R     V+  M   G+          H    KFN  +              CC 
Sbjct: 369 DVMERARALYNTVLGGMALDGKHFFYVNPLEVHPKSLKFNHIYDHVKPIRQRWFGCACCP 428

Query: 477 GTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM 536
                  + LG  IY         LYI  Y+ +S +    +  L  ++     W  + ++
Sbjct: 429 PNIARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQV 483

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLT 596
            +   S Q V    +L LR+P W     A+ +LNG  +       +L     W   D +T
Sbjct: 484 KIAIDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTIT 539

Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
           + LP+ +R           A   AI  GP  Y L    +GE
Sbjct: 540 LTLPMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 580


>gi|251797570|ref|YP_003012301.1| hypothetical protein Pjdr2_3583 [Paenibacillus sp. JDR-2]
 gi|247545196|gb|ACT02215.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
          Length = 674

 Score = 47.4 bits (111), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 77/299 (25%), Positives = 114/299 (38%), Gaps = 22/299 (7%)

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
           HA     +  G    Y  TG+  Y        D ++   S+ TGG  A     D K  A+
Sbjct: 292 HAVRATLLYTGLTALYLCTGEVPYLETAKKLWDNISHQKSHVTGGVGAVHH--DEKFGAN 349

Query: 392 TLGSENE--ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
               +N   ETC    M   S +LF  T E  Y D  E  + N VL+  R  +     Y 
Sbjct: 350 YELPDNGYLETCAGVGMGFFSWNLFLATGESRYIDKLETIIYNIVLA-GRSMDGHKYFYE 408

Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
            PL   VSK      W  +++S  CC    ++   +L   IY  +     G +I  YI S
Sbjct: 409 NPL---VSKGGHNR-W--EWHSCPCCPPMIMKLMPELASYIYAYDG---KGAFINLYIGS 459

Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             +   G V +  K      W   + +T+T     E      L LR+P W      + + 
Sbjct: 460 ESELLIGDVPVTVKQQTNYPWSGAVGITVTPERDAEF----DLRLRIPEWCGQYAIRVND 515

Query: 570 NGQNLPLPPPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
              N  L    N  +   R WS  D++ ++L + +    +  +   +A   AI  GP L
Sbjct: 516 QAANYELE---NGYAVLHRVWSPGDRIQLELDMPVHLVEVHPNVTTHADKAAIRRGPVL 571


>gi|336416221|ref|ZP_08596557.1| hypothetical protein HMPREF1017_03665 [Bacteroides ovatus
           3_8_47FAA]
 gi|335938952|gb|EGN00831.1| hypothetical protein HMPREF1017_03665 [Bacteroides ovatus
           3_8_47FAA]
          Length = 698

 Score = 47.4 bits (111), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 58/216 (26%), Positives = 91/216 (42%), Gaps = 16/216 (7%)

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
           + + ETC     +  +  +   T +  YAD  E  L N VLS     +     Y  PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434

Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
             +    T  W    T++ S +CC    + +  +  +  Y    EG    LY    ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494

Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             WK  G V L Q+ D    WD  +R+TL     ++VG   SL LR+P W     A   +
Sbjct: 495 --WKEKGEVALTQETD--YPWDGNVRVTLD-KVPRKVGTF-SLFLRIPEW--CEKATLRV 546

Query: 570 NGQNLPLPPPGNFLSATER-WSYNDKLTIQLPLSLR 604
           NGQ L +    N  +   R W   D + + + + +R
Sbjct: 547 NGQPLQVNAKANSYAEVNRAWKKGDIVELMMDMPVR 582


>gi|333994236|ref|YP_004526849.1| hypothetical protein TREAZ_1028 [Treponema azotonutricium ZAS-9]
 gi|333736667|gb|AEF82616.1| conserved hypothetical protein [Treponema azotonutricium ZAS-9]
          Length = 675

 Score = 47.4 bits (111), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 78/373 (20%), Positives = 126/373 (33%), Gaps = 51/373 (13%)

Query: 292 VLYRLYSITHDPKHLLLAHLFD-----------------------KPCFLGFLALQA--- 325
            L RLY +T D KHL LA  F                        K  ++ +   QA   
Sbjct: 220 ALVRLYDVTKDEKHLKLARYFIDQRGQSPLYFEEETKRNGNEFYWKDSYVKYQYYQAGKP 279

Query: 326 ---DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS---- 378
               +++  HA   + +  G      +TGD       +   + +     Y TGG      
Sbjct: 280 VRDQHIAEGHAVRAVYLYSGMADIARLTGDDTLIKSCSDLWENITQKQMYITGGIGQSAY 339

Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
              F +D     DT+ +E   TC +  +   +R +     + ++AD  E AL NG++S  
Sbjct: 340 GEAFSYDYDLPNDTVYAE---TCASIGLAFFARRMLSIAPKGSFADVLETALYNGIIS-G 395

Query: 439 RGTEPGVMIYMLPL------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
              +     Y+ PL             R   G   K+ +  CC        S LG  IY 
Sbjct: 396 MSLDGKSFFYVNPLEVIPEANEKDRIRRHVKGVRQKWFACACCPPNLARIISSLGSYIYS 455

Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
            ++     LY   +I S+   +     +  K++    W+  +R+      +   G     
Sbjct: 456 VKDN---ALYTHLFIGSTAKAQLSGKEVTVKLETSYPWEEKVRVDFQVPGE---GAKFDY 509

Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
             R+P W  S      LNG          +   +  W   D L+I   + +         
Sbjct: 510 AFRLPGWCRS--CSVELNGAKADYKKADGYAIISREWKSGDSLSIVFDMPVNFVEANPKV 567

Query: 613 PEYASIQAILFGP 625
            E +   AI  GP
Sbjct: 568 RENSGKLAITRGP 580


>gi|189464189|ref|ZP_03012974.1| hypothetical protein BACINT_00526 [Bacteroides intestinalis DSM
           17393]
 gi|189437979|gb|EDV06964.1| hypothetical protein BACINT_00526 [Bacteroides intestinalis DSM
           17393]
          Length = 801

 Score = 47.4 bits (111), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 81/347 (23%), Positives = 133/347 (38%), Gaps = 59/347 (17%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L +LY +T D K+L  A  F D+    G+ +   +Y     +  H P+V     +G  +R
Sbjct: 222 LAKLYLVTGDKKYLDQAKFFLDQ---RGYTSRTDEY-----SQAHKPVVQQDEAVGHAVR 273

Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADT 392
                        +TGD  Y        D +     Y TGG   T+A E +     L + 
Sbjct: 274 AAYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATAAGEAFGKNYELPNM 333

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
             S   ETC     + V+  LF    E  Y D  ER L NG++S     + G   Y  P+
Sbjct: 334 --SAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPM 390

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
                ++   H     F    CC          L   IY  ++ +V   Y+  ++S++ D
Sbjct: 391 -----ESMGQHQRQPWFGCA-CCPSNICRFIPSLPGYIYAVKDKDV---YVNLFMSNTSD 441

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TY 561
            K G   ++ +      W+  + + +   +K   GQ  +L +R+P W           TY
Sbjct: 442 LKVGGKAVSIEQTTQYPWNGDITIGI---NKNSAGQF-NLKVRIPGWVRGQVVPSDLYTY 497

Query: 562 SNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
           S+G +      +NG+ +       +     RW   DK+ +   +  R
Sbjct: 498 SDGKRLKYTVKVNGEAVQNELKDGYFCIDRRWKKGDKVEVHFDMEPR 544


>gi|435854457|ref|YP_007315776.1| hypothetical protein Halha_1747 [Halobacteroides halobius DSM 5150]
 gi|433670868|gb|AGB41683.1| hypothetical protein Halha_1747 [Halobacteroides halobius DSM 5150]
          Length = 655

 Score = 47.0 bits (110), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 67/298 (22%), Positives = 111/298 (37%), Gaps = 48/298 (16%)

Query: 332 HANTHIPI-----VIGSQMR------------YEVTGDPLYKLIGTFFMDIVNASHSYAT 374
           +A  H+P+     V+G  +R             E     L + +G  + ++      Y T
Sbjct: 265 YAQDHLPVREQDKVVGHAVRAMYLYCGMADVAMETKDHELIQALGNLWANMT-KKRMYVT 323

Query: 375 GGTSARE----FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERAL 430
           GG  +      F  D     DT  +E   TC     +  ++ + + T E  +AD  ER L
Sbjct: 324 GGIGSAHHNEGFTADYDLPNDTAYAE---TCAAVGSMMWNQRMLKLTGEACFADIIERTL 380

Query: 431 TNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSI 490
            NG LS    T      Y+ PL    +  R   GW        CC        + L   I
Sbjct: 381 YNGFLSGVSLT-GDKFFYVNPLESDGTHHRK--GWF----KVSCCPPNIARFLASLEKYI 433

Query: 491 YFEEEGNVPGLYIIQYIS--SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQ 548
           Y + E  +   +I QYIS           V++ Q  D    WD  + + +   +  E   
Sbjct: 434 YLKNEDCI---FINQYISGKGKVSIAEEEVIIRQ--DTAYPWDDKVNIKINLKNPSEF-- 486

Query: 549 LSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN---FLSATERWSYNDKLTIQLPLSL 603
             +L+LR+P W     A   +N Q+L +    N   +     +W   D++ ++  + +
Sbjct: 487 --TLSLRIPDWCQE--ASLQINNQSLEIESIINDNGYAQIRRKWRNGDQIRLEFAMPI 540


>gi|427384250|ref|ZP_18880755.1| hypothetical protein HMPREF9447_01788 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727511|gb|EKU90370.1| hypothetical protein HMPREF9447_01788 [Bacteroides oleiciplenus YIT
           12058]
          Length = 801

 Score = 47.0 bits (110), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 84/350 (24%), Positives = 136/350 (38%), Gaps = 61/350 (17%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQM 345
            L +LY +T   K+L  A  F D+    G+ +   +Y     +  H P+V     +G  +
Sbjct: 221 ALAKLYLVTGQQKYLDQAKFFLDQ---RGYTSRTDEY-----SQAHKPVVQQDEAVGHAV 272

Query: 346 RYE-----------VTGDPLY-KLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLA 390
           R             +TGD  Y   I   + +IV   + Y TGG   T+A E +     L 
Sbjct: 273 RAAYMYAGMADVAALTGDTAYIHAIDRIWNNIVGKKY-YITGGIGATAAGEAFGKNYELP 331

Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
           +   S   ETC     + V+  LF    E  Y D  ER L NG++S     + G   Y  
Sbjct: 332 NM--SAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPN 388

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           PL     ++   H     F    CC          L   IY  ++ +V   Y+  ++S++
Sbjct: 389 PL-----ESMGQHQRQPWFGCA-CCPSNICRFIPSLPGYIYAVKDKDV---YVNLFMSNT 439

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW----------- 559
            D K G   ++ +      W+  + + +    K   GQ  ++ +R+P W           
Sbjct: 440 SDLKVGGKAVSIEQTTKYPWNGDIAIGI---KKNNAGQF-TMKVRIPGWVRGQVVPSDLY 495

Query: 560 TYSNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           TYS+G +     ++NG+         +     RW   DK+ I   +  RT
Sbjct: 496 TYSDGKRLKYTVAVNGEPAQSELKDGYFCIDRRWKKGDKIEIHFDMEPRT 545


>gi|218260014|ref|ZP_03475493.1| hypothetical protein PRABACTJOHN_01154 [Parabacteroides johnsonii
           DSM 18315]
 gi|218224797|gb|EEC97447.1| hypothetical protein PRABACTJOHN_01154 [Parabacteroides johnsonii
           DSM 18315]
          Length = 816

 Score = 47.0 bits (110), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 85/386 (22%), Positives = 136/386 (35%), Gaps = 79/386 (20%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMRY 347
           L +LY +T D K+L +A  F +    G    + +  S      H+PI     ++G  +R 
Sbjct: 219 LAKLYKVTGDRKYLDMAKYFVEETGRGTDGHRLNAYSQ----DHMPILQQEEIVGHAVRA 274

Query: 348 -----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
                       +T D  Y        D +     Y TGG  +R          +  G E
Sbjct: 275 GYLYSGVADVAALTKDTAYFHAICRIWDNMATKKLYITGGIGSRA-------QGEGFGPE 327

Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI- 447
            E        ETC +   +  ++ +F  T +  Y D  ERAL NGV+S       GV + 
Sbjct: 328 YELHNHSAYCETCASIANVYWNQRMFLATGDAKYIDVLERALYNGVIS-------GVSLS 380

Query: 448 -----YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
                Y  PL     ++   H     F    CC G  +  F        +  +GN   LY
Sbjct: 381 GDKFFYDNPL-----ESMGQHERAPWFGCA-CCPGN-VTRFMASVPKYMYATQGN--SLY 431

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
           +  Y+ S       +  +    D    WD  +++T++           SL LR+P WT +
Sbjct: 432 VNLYVGSESRVALANDTVTLVQDTEYPWDGLVKLTVSPRKASSF----SLKLRIPSWTGN 487

Query: 563 NGAQAS----------------LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
                S                +NG  L       ++     W   D + +++P+ +R  
Sbjct: 488 EPVPGSDLYTYIKRDREPCAVFVNGTPLKEKAHHGYVVIEREWEPGDVIELRMPMDVRRV 547

Query: 607 AIQDDRPEYASIQAILFGP--YLLAG 630
              +       + A+  GP  Y L G
Sbjct: 548 KAHEKVRADQGLLAVERGPVVYCLEG 573


>gi|336251952|ref|YP_004585920.1| hypothetical protein Halxa_0515 [Halopiger xanaduensis SH-6]
 gi|335339876|gb|AEH39114.1| protein of unknown function DUF1680 [Halopiger xanaduensis SH-6]
          Length = 636

 Score = 47.0 bits (110), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 55/209 (26%), Positives = 86/209 (41%), Gaps = 26/209 (12%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGV 456
           ETC     +  +R +F  T +  YAD  ER L NG L+     GTE     Y   L    
Sbjct: 335 ETCAAIGSVFWNRRMFELTGDAKYADLIERTLYNGFLAGVSLDGTE---FFYDNRLESDG 391

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG--LYIIQYISSSFDWK 514
           S  R   GW   F+   CC       F+ L   +Y      V G  LY+ QY+ S+    
Sbjct: 392 SHGR--QGW---FDCA-CCPPNVARLFASLERYLY-----TVDGRELYVNQYVESTATPT 440

Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
                L         WD  + + +      +    ++++LR+P W   + A   +NG+  
Sbjct: 441 VDDAELEVAQTTDYPWDSEVTIDVEAPEPTQ----ATISLRVPEW--CDEASIEVNGE-- 492

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSL 603
           P+P  G+   + ER   +D++T    +S+
Sbjct: 493 PIPVDGDGYVSLERTWDDDRITATFEMSV 521


>gi|326802069|ref|YP_004319888.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326552833|gb|ADZ81218.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
          Length = 659

 Score = 47.0 bits (110), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 58/244 (23%), Positives = 93/244 (38%), Gaps = 33/244 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC +  M+  ++ +   T E  Y D  ER+L NG L            Y  PL      
Sbjct: 335 ETCASVGMVFWNQRMNLLTGEAKYFDILERSLYNGALD-GLSYSGNRFFYGNPLASHGGY 393

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS--SFDWKSG 516
            RS   +GT      CC          LGD IY   +  V   ++  ++ S  +     G
Sbjct: 394 GRS-EWFGTA-----CCPSNIARLVESLGDYIYAHSDKAV---WVNLFVGSKAAIPLSQG 444

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW---------------TY 561
            V + Q+      W   + + +T   K++      L++R+P W               T 
Sbjct: 445 TVEIAQQTG--YPWQGDVNIRVTPDRKRKF----PLHIRIPGWLLGQPAPGDTYRFLDTT 498

Query: 562 SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
            N     +NG+N+P      ++     W  ND ++IQ+PL ++  A  D      +  A+
Sbjct: 499 ENKYTLQVNGKNVPYHIEKGYVVIDRIWDKNDAVSIQMPLEVKKIAANDQVVANKNRIAL 558

Query: 622 LFGP 625
             GP
Sbjct: 559 QRGP 562


>gi|224537077|ref|ZP_03677616.1| hypothetical protein BACCELL_01954 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521304|gb|EEF90409.1| hypothetical protein BACCELL_01954 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 811

 Score = 46.6 bits (109), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 85/384 (22%), Positives = 146/384 (38%), Gaps = 79/384 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L +A  F +    G        LS + +  H PI     ++G  +R
Sbjct: 220 ALAKLYKVTGDEKYLKMAKYFVEETGRG---TDGHRLSEY-SQDHKPILQQDEIVGHAVR 275

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    +   + + +   + TGG  +R     P+   +  G 
Sbjct: 276 AGYLYSGVADVASLTQDTAYFNALSRIWENMASKKLFITGGIGSR-----PQ--GEGFGP 328

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     +  +  +F  T    YAD  ERAL NGV+S       GV +
Sbjct: 329 NYELNNHTAYCETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVIS-------GVSL 381

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL   + +      +G       CC G  +  F        +  +GN   +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQQWFGCA-----CCPGN-VTRFMASVPFYMYATQGN--DI 432

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVS--WDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           Y+  YI S  +  +     N K++ I +  WD  + +++    +QE     +L +R+P W
Sbjct: 433 YVNLYIQSKAELNTE--TNNVKLEQITTYPWDGKVSISVNPEKEQEF----ALRVRIPGW 486

Query: 560 -----------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
                      ++++ A+A   S+NG+ +       + +    W   D + I  P+ +R 
Sbjct: 487 AQDAPVPTDLYSFTDKAKAYTISINGKKVNATQLDGYATILHDWKTGDVVEINFPMDVRR 546

Query: 606 EAIQDDRPEYASIQAILFGPYLLA 629
               D+  +     AI  GP +  
Sbjct: 547 VKANDNVEDDRGKLAIERGPIMFC 570


>gi|423223921|ref|ZP_17210390.1| hypothetical protein HMPREF1062_02576 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392637419|gb|EIY31288.1| hypothetical protein HMPREF1062_02576 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 801

 Score = 46.6 bits (109), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 82/348 (23%), Positives = 133/348 (38%), Gaps = 59/348 (16%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L +LY +T D K+L  A  F D+    G+ +   +Y     +  H P+V     +G  +R
Sbjct: 222 LAKLYLVTGDQKYLDQAKFFLDQ---RGYTSRTDEY-----SQAHKPVVQQDEAVGHAVR 273

Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADT 392
                        +TGD  Y        D +     Y TGG   T+A E +     L + 
Sbjct: 274 AAYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATAAGEAFGANYELPNM 333

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
             S   ETC     + V+  LF    E  Y D  ER L NG++S     + G   Y  PL
Sbjct: 334 --SAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL 390

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
                ++   H     F    CC          L   IY  ++ +V   Y+  ++S++ D
Sbjct: 391 -----ESMGQHQRQPWFGCA-CCPSNICRFIPSLPGYIYAVKDKDV---YVNLFMSNTSD 441

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TY 561
            K G   ++ +      W+  + + +   +K   G   +L +R+P W           TY
Sbjct: 442 LKVGGKAVSIEQTTKYPWNGDITIGI---NKNSAGPF-NLKVRIPGWVRGQVVPSDLYTY 497

Query: 562 SNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           S+G +      +NG+ +       +     RW   DK+ +   +  RT
Sbjct: 498 SDGKRLKYTVKVNGEAVQNELKDGYFCIDRRWKKGDKVEVHFDMEPRT 545


>gi|409439808|ref|ZP_11266847.1| conserved hypothetical protein [Rhizobium mesoamericanum STM3625]
 gi|408748645|emb|CCM78028.1| conserved hypothetical protein [Rhizobium mesoamericanum STM3625]
          Length = 637

 Score = 46.6 bits (109), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 79/348 (22%), Positives = 133/348 (38%), Gaps = 53/348 (15%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFHANT------HIPI 339
            L +L  +T + K+L LA  F      +P F    A++     + FH  T      H P+
Sbjct: 198 ALVKLARVTGEKKYLDLAKFFIDERGTEPHFFTEEAIRDGRSAADFHQKTYEYGQAHQPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    +A
Sbjct: 258 REQKKVVGHAVRAMYLYSGMADIATEYDDDSLTGALETLWDDLTT-KQMYVTGGIGPAAA 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   S   ETC +  ++  +  +        YAD  E+AL NG ++   
Sbjct: 317 NEGFTDYYDLPNE--SAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GL 373

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
             +     Y  PL      A   H W   ++   CC        + +G  +Y   E  + 
Sbjct: 374 SLDGKKFFYENPL----ESAGKHHRW--IWHHCPCCPPNIARLLASIGSYMYGVAEDEI- 426

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            +++     + F      V L QK      W   +R+ +  ++      L +++LR+P W
Sbjct: 427 AVHLYGEGRARFKIGGTDVELTQKTR--YPWHGAVRLDIKLNAP----VLFAISLRIPEW 480

Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRT 605
             +NGA  ++NG+ + L       +      W   DK+ + +PL  R 
Sbjct: 481 --ANGATLAVNGEAIDLGSADVDGYARIEREWRDGDKIDLNIPLETRA 526


>gi|409098498|ref|ZP_11218522.1| hypothetical protein PagrP_08844 [Pedobacter agri PB92]
          Length = 673

 Score = 46.2 bits (108), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 108/480 (22%), Positives = 185/480 (38%), Gaps = 79/480 (16%)

Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF----PTELFDSFEALKPV 225
           L A A ++AST N  +   M   +  + + Q + G  Y  A      T   + F+  +  
Sbjct: 107 LEAVASLYASTKNPKLNAMMDKAIVVIGKSQREDGYIYTKAMIEQRKTGSNNQFQD-RLS 165

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
           +  Y   H + AG +  Y        L +A    +Y YN  +      ++ R+    +  
Sbjct: 166 FESYNIGHLMTAGCI-HYRATGKTTLLNIAKKATDYLYNFYKSASP--TLARNAICPSHY 222

Query: 286 TGGMNDVLYRLYSITHDPKHLLLA-HLF-----------DKPCFLGFLALQADYLSHFHA 333
            G     +  +Y  T+DP++L LA HL            D    + FL  Q   + H   
Sbjct: 223 MG-----VVEMYRTTNDPRYLELAQHLIAIKGKIDDGTDDNQDRIPFLQ-QTKAMGHAVR 276

Query: 334 NTHIPIVIGSQMRYEVTG-DPLYKLIGTFFMDIVNASHSYATGG-------TSAREFWWD 385
            +++    G    Y  TG D L   +   + D+ N    Y TGG       TS     ++
Sbjct: 277 ASYL--YAGVADLYAETGKDSLLNTLNLMWNDVQN-HKMYITGGLGSLYDGTSPDGTSYN 333

Query: 386 P---KRLADTLGSE--------NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
           P   +++    G +        + ETC     +  +  + + T +  YAD  E AL N V
Sbjct: 334 PVDVQKIHQAFGRDYQLPNFTAHNETCANIGNMLWNWRMLQITGDAKYADVMELALHNSV 393

Query: 435 LS-IQRG------TEPGVMIYMLPLGRGVSKARSTH-GWGTKFNSFWCCYGTGIESFSKL 486
           LS I         T P      LP  +  SK R  + G         CC    + + +++
Sbjct: 394 LSGISLDGKNFLYTNPLAQSNDLPFKQRWSKDRVPYIGLSN------CCPPNVVRTIAEV 447

Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWK---SGHVVLNQKVDPIVSWDPYLRMTLTFSSK 543
            D  Y        GL+   Y  ++   K      + L+++ +    WD  +++++     
Sbjct: 448 SDYAYSVSN---KGLWFNLYGGNNLTTKLADGSKISLSEETN--YPWDGNIKISV----- 497

Query: 544 QEVGQLS-SLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPL 601
           +E+G  + S+ LR+P WT    AQ S+NG+   +    G +      W   D + + LP+
Sbjct: 498 KEIGNKAYSVFLRIPAWT--QNAQISINGKPENIKAISGTYAEINRVWKKGDIIELNLPM 555


>gi|393782812|ref|ZP_10370994.1| hypothetical protein HMPREF1071_01862 [Bacteroides salyersiae
           CL02T12C01]
 gi|392672197|gb|EIY65667.1| hypothetical protein HMPREF1071_01862 [Bacteroides salyersiae
           CL02T12C01]
          Length = 675

 Score = 46.2 bits (108), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 88/397 (22%), Positives = 150/397 (37%), Gaps = 42/397 (10%)

Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
           W P   + KI+     QY  A   +  ++ T+M  YF  ++++ +    ++R W    + 
Sbjct: 155 WWPKMVVLKIM----QQYYSATGDE--RVITFMTNYFKYQLEQ-LPQNPLDR-WTHWGKF 206

Query: 286 TGGMN-DVLYRLYSITHDPKHLLLAHLFDKP------CFLGFLALQADYLSHF--HANTH 336
            GG N  V+Y LY+IT D   L L  L  +        FL    L   +  H    A   
Sbjct: 207 RGGDNLMVIYWLYNITGDKFLLELGDLVHQQTLDWTNVFLEGTQLMTQHSLHTVNLAQGF 266

Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
              VI  Q  Y+       K       +++  +  + TG  +  E      R  D   ++
Sbjct: 267 KEPVIYYQRDYDRKRIDAVKKAS----EVIRNTIGFPTGIWAGDEL----IRFGDP--TQ 316

Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR-- 454
             E C    M+     +   T +  +AD  ER   N  L  Q      V  Y   + +  
Sbjct: 317 GSELCAAVEMMFSLEKMLEITGDTQWADQLERIAYNA-LPTQVDDNCSVRQYYQQVNQIK 375

Query: 455 -------GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
                   V+    T         F CC     + + KL  +++F    N  G+  + Y 
Sbjct: 376 VSYEPRTFVTPHSHTGNLFGVLAGFPCCTSNLHQGWPKLVQNLWFATYDN--GIAALVYA 433

Query: 508 SSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
            S    K +G+V ++ + +    +D  +R  + F  K+        +LR+P W       
Sbjct: 434 PSKVTAKVAGNVTVDIEENTGYPFDEIIRFKMNFPDKKARTARFPFHLRIPEW--CEKPV 491

Query: 567 ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
             +NG+ +   P  N       W  ND++T++LP+S+
Sbjct: 492 IRVNGEVVSCVPVANIAVLERTWKSNDEVTLELPMSV 528


>gi|312135914|ref|YP_004003252.1| hypothetical protein Calow_1923 [Caldicellulosiruptor owensensis
           OL]
 gi|311775965|gb|ADQ05452.1| protein of unknown function DUF1680 [Caldicellulosiruptor
           owensensis OL]
          Length = 652

 Score = 46.2 bits (108), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 86/387 (22%), Positives = 141/387 (36%), Gaps = 64/387 (16%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL--------------GFLALQADYLSHFHA 333
           L +LY +T D K+L L+  F      +P +               GF  L  +YL     
Sbjct: 200 LVKLYEVTGDRKYLELSKFFVDERGQEPYYFDIEYEERGKKSHWNGFKGLGREYLQAHKP 259

Query: 334 NTHIPIVIGSQMR----YEVTGD--------PLYKLIGTFFMDIVNASHSYATG--GTSA 379
                  +G  +R    Y    D         L+ +  T F DIVN    Y TG  G+SA
Sbjct: 260 LRQQREAVGHAVRAVYLYSGAADVAAYTHDKELFDVCKTLFNDIVNRK-MYITGAIGSSA 318

Query: 380 R----EFWWD-PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
                 F +D P   A        ETC +  ++  +  L R      Y D  ERAL N V
Sbjct: 319 HGEAFTFEYDLPNDAAYA------ETCASVGLIFFAHRLNRIEPHAKYYDAVERALYNTV 372

Query: 435 LS--IQRGTEPGVMIYMLPLG---RGVSK---ARSTHGWGTKFNSFWCCYGTGIESFSKL 486
           +    Q G +     Y+ PL    + V K    R        +    CC        + L
Sbjct: 373 IGSMSQDGKK---YFYVNPLEVYPKEVEKRFDRRHVKPERQPWFGCACCPPNVARLLASL 429

Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV 546
           G  IY     N   +Y+  YI SS   + G   +  + +    ++  +++ L  S +   
Sbjct: 430 GRYIY---SYNQEEIYVNLYIGSSVQVEVGSAKVLLQQESGYPFEDMVKIDLKTSKEARF 486

Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
                L LR+P W        +   + +   P G ++     W+ N+++ +++P  ++  
Sbjct: 487 ----KLYLRIPSWCEKYEVYVNEKKEEMQKLPSG-YVCIERLWTENNQVVLKIPTEVKMV 541

Query: 607 AIQDDRPEYASIQAILFGPYLLAGHTS 633
           +         S  A++ GP +     +
Sbjct: 542 SSHPQVRSNVSKVAVVKGPVVFCAEEA 568


>gi|365837320|ref|ZP_09378689.1| hypothetical protein HMPREF0454_03564 [Hafnia alvei ATCC 51873]
 gi|364562052|gb|EHM39922.1| hypothetical protein HMPREF0454_03564 [Hafnia alvei ATCC 51873]
          Length = 665

 Score = 46.2 bits (108), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 69/302 (22%), Positives = 111/302 (36%), Gaps = 44/302 (14%)

Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG---- 376
           LALQ   + H  A   + ++ G      +  D   + I     + +     Y TGG    
Sbjct: 275 LALQQSAIGH--AVRFVYLLAGVAHLARLNNDEEKRQICLRLWNNMVQRQLYITGGIGSQ 332

Query: 377 TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
           +S   F  D     DT+ +E   +C +  ++  +  + +   +  YAD  ERAL N VL 
Sbjct: 333 SSGEAFSSDYDLPNDTVYAE---SCASIGLMMFANRMLQMEGDSQYADVMERALYNTVLG 389

Query: 437 IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGIES 482
                +     Y+ PL          H     FN  +              CC       
Sbjct: 390 -GMALDGRHFFYVNPL--------EVHPKSIPFNHIYDHVKPIRQRWFGCACCPPNIARI 440

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
            + +G  IY +       LYI  Y+ +     +G   L   +     WD    +++   +
Sbjct: 441 LTSIGHYIYTQRSD---ALYINLYVGNETHLDNG---LKIAISGNYPWDE--NVSVHIRT 492

Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
           ++ + Q  +L LRMP W      Q  LNG+         +L  T  W   D+L I LP+ 
Sbjct: 493 EKPLHQ--TLALRMPEWCEKPSVQ--LNGKTCEGLLKRGYLHITREWHDGDRLEIVLPMP 548

Query: 603 LR 604
           +R
Sbjct: 549 VR 550


>gi|330996652|ref|ZP_08320530.1| hypothetical protein HMPREF9442_01617 [Paraprevotella xylaniphila
           YIT 11841]
 gi|329572724|gb|EGG54357.1| hypothetical protein HMPREF9442_01617 [Paraprevotella xylaniphila
           YIT 11841]
          Length = 816

 Score = 46.2 bits (108), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 92/444 (20%), Positives = 158/444 (35%), Gaps = 102/444 (22%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKP---CFLGFLALQADYLSHFHANTHIPI-----VIGS 343
            L +LY +T   ++L  A  F +    C  G       +  + ++  H PI     ++G 
Sbjct: 221 ALAKLYKVTGKEEYLRTARYFVEETGRCTDG-------HAPNAYSQDHKPILEQDEIVGH 273

Query: 344 QMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
            +R              TGD  Y    T   + +     Y TGG  +R          + 
Sbjct: 274 AVRAGYLYSGVADVAAQTGDTAYFHALTRIWENMAGRKLYITGGIGSRA-------QGEG 326

Query: 393 LGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
            G + E        ETC +   +  +  +F  T +  Y D  ERAL NGV+S       G
Sbjct: 327 FGPDYELNNHTAYCETCASIANVYWNHRMFLATGDSRYEDILERALYNGVIS-------G 379

Query: 445 VMI------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
           V +      Y  PL     ++   HG    F    CC G      + + + +Y  +  +V
Sbjct: 380 VSLSGDRFFYDNPL-----ESMGQHGRQAWFGCA-CCPGNVTRFMASVPNYMYATQGKDV 433

Query: 499 PGLYIIQYISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
              ++  YI S  S       + + Q  D    WD  +R+ +    KQ      +L  R+
Sbjct: 434 ---FVNLYIQSTASLSTSQNKIEIRQTTD--YPWDGNIRLAVHPEKKQTF----ALRCRI 484

Query: 557 PVWTY--------------SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
           P W                  G    +NG+++       +     +W   D + +  P+ 
Sbjct: 485 PGWAQGRPVPTDLYHYTGKGKGYTIQVNGKDVDFHVENGYAVILRKWKKGDTVQLDFPMD 544

Query: 603 L-RTEA---IQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFN 658
           + R EA   ++DDR +     AI  GP +       + D         S + + + P+  
Sbjct: 545 VRRVEARVEVEDDRGK----AAIERGPIVYCIEDKDQPD---------SLIFNKVIPAGT 591

Query: 659 AQLVTFTQESGNSTFVMSNSNQSI 682
           A   T+  +  N    +  + Q++
Sbjct: 592 AISATYAPDMLNGIVTLEGTAQAV 615


>gi|255691741|ref|ZP_05415416.1| putative cytoplasmic protein [Bacteroides finegoldii DSM 17565]
 gi|260622626|gb|EEX45497.1| hypothetical protein BACFIN_06788 [Bacteroides finegoldii DSM
           17565]
          Length = 700

 Score = 46.2 bits (108), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 86/347 (24%), Positives = 140/347 (40%), Gaps = 47/347 (13%)

Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
           +  +Y  T +P++L L+ +L D    +          +  +  Y +  HA     +  G 
Sbjct: 250 VVEMYRATGNPRYLELSKNLIDIRGMVENGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 309

Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
              Y  TG+  L K + + + DIV     Y TG       GTS     ++P   +++  +
Sbjct: 310 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 368

Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
            G        + + ETC     +  +  +   T +  YAD  E  L N VLS     +  
Sbjct: 369 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 427

Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
              Y  PL R  +    T  W    T++ S +CC    + +  +  +  Y    EG    
Sbjct: 428 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 486

Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           LY    ++++  WK  G V L Q+ D    WD  +R+TL     ++ G   SL LR+P W
Sbjct: 487 LYGANTLTTT--WKEKGEVALTQETD--YPWDGNIRVTLD-KVPRKAGTF-SLFLRIPEW 540

Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
                A   +NGQ L +    N  +   R W   D  +L + +P+ L
Sbjct: 541 --CEKATLRVNGQPLQVNAKANSYAEVNRAWKKGDVVELVMDMPVRL 585


>gi|332882007|ref|ZP_08449642.1| hypothetical protein HMPREF9074_05436 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|357048165|ref|ZP_09109719.1| hypothetical protein HMPREF9441_03768 [Paraprevotella clara YIT
           11840]
 gi|332679931|gb|EGJ52893.1| hypothetical protein HMPREF9074_05436 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|355528748|gb|EHG98226.1| hypothetical protein HMPREF9441_03768 [Paraprevotella clara YIT
           11840]
          Length = 800

 Score = 45.8 bits (107), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 82/355 (23%), Positives = 132/355 (37%), Gaps = 73/355 (20%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L +LY +T D K+L  A  F D+    G  + +  Y     +  H P+V     +G  +R
Sbjct: 221 LAKLYIVTGDQKYLDEAKFFLDQ---RGHTSRRDAY-----SQAHKPVVEQDEAVGHAVR 272

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +TGD  Y        D +     Y TGG  A           +  G+
Sbjct: 273 ATYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATAN-------GEAFGA 325

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     + V+  LF    E  Y D  ER L NG++S     + G   
Sbjct: 326 NYELPNMSAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFF 384

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  PL     ++R  H     F    CC          L   +Y  ++ +V   Y+  ++
Sbjct: 385 YPNPL-----ESRGQHQRQPWFGCA-CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFM 435

Query: 508 SSSFDWKSGH--VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT----- 560
           S+  + + G   VVL Q+      WD  + +++    K +VG   ++ +R+P W      
Sbjct: 436 SNEANLEVGKKSVVLEQQTR--YPWDGDVAVSV---KKNKVGAF-AMKIRIPGWVRGQVV 489

Query: 561 ------YSNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
                 YS+G +      +NGQ +       + +   RW   DK+ +   +  R 
Sbjct: 490 PSDLYRYSDGKRLGYSVKVNGQPVESELQDGYFTIERRWKKGDKVEVHFDMEPRV 544


>gi|405383237|ref|ZP_11037007.1| hypothetical protein PMI11_07039 [Rhizobium sp. CF142]
 gi|397320335|gb|EJJ24773.1| hypothetical protein PMI11_07039 [Rhizobium sp. CF142]
          Length = 643

 Score = 45.8 bits (107), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 100/481 (20%), Positives = 185/481 (38%), Gaps = 83/481 (17%)

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-----PTELFDSFEA 221
           G ++ A++    +  N  I+ K+  +V  L   Q  +  GYL+++     P + + +   
Sbjct: 88  GKWIEAASYTLKNNPNPDIEAKIDAIVEKLEHGQ--MADGYLNSWFIRREPEKRWTNLRD 145

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER---H 278
           L  +    Y++  +L G +  +      + L +    V++       +I  +  E     
Sbjct: 146 LHEM----YSMGHLLEGAVAYFEATGKRRFLNVMIRAVDH-------IIDTFGREPGKLR 194

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL 328
            Y  +EE   +   L +LY +T DP+HL LA  F       P +    A +     A Y+
Sbjct: 195 GYDAHEE---IELALVKLYRVTKDPRHLDLAIYFVDERGQMPSYYDEEARKRGEDPASYV 251

Query: 329 --SHFHANTHIPI-----VIGSQMR------------YEVTGDPLYKLIGTFFMDIVNAS 369
             ++ ++  H+P+     V+G  +R            +E   + L    G  F ++V   
Sbjct: 252 FQTYAYSQAHMPVREQTQVVGHAVRAMYLFSAMADLAFENDDESLKSACGRLFDNLV-GR 310

Query: 370 HSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYY 426
             Y TGG   +++ E +     L +   +   ETC    +   S  + +   +  + D  
Sbjct: 311 QLYVTGGLGPSASNEGFTREYDLPNE--TAYAETCAAVALGFFSHRMAQIELDSKFTDKL 368

Query: 427 ERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF- 483
           E  L NG LS I R  +      +L           +HG   ++   +C C  T I  F 
Sbjct: 369 ETVLYNGALSGISRDGQHYFYENVL----------ESHGQNRRWKWHYCPCCPTNIARFI 418

Query: 484 SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK 543
           + LG   Y      V  + I  Y  ++ +   G+  L  K      W+  + ++L     
Sbjct: 419 TSLGQYFY---STKVDEVAIHLYGENAAELTVGNSFLRLKQKTEYPWNGDVGISLGLDQP 475

Query: 544 QEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND--KLTIQLPL 601
           +      +L LR+P W     A+A +NG+ + L     +      W   D  +L   +P+
Sbjct: 476 KRF----TLRLRIPGWC--RDAKALVNGEAIKLNVSKGYAPIEREWKDGDEVRLAFDMPV 529

Query: 602 S 602
            
Sbjct: 530 D 530


>gi|423344367|ref|ZP_17322079.1| hypothetical protein HMPREF1077_03509 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409212765|gb|EKN05799.1| hypothetical protein HMPREF1077_03509 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 816

 Score = 45.8 bits (107), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 87/388 (22%), Positives = 138/388 (35%), Gaps = 83/388 (21%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMRY 347
           L +LY +T D K+L +A  F +    G    + +  S      H+PI     ++G  +R 
Sbjct: 219 LAKLYKVTRDRKYLDMAKYFVEETGRGTDGHRLNAYSQ----DHMPILQQEEIVGHAVRA 274

Query: 348 -----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
                       +T D  Y        D +     Y TGG  +R          +  G E
Sbjct: 275 GYLYSGVADVAALTKDTAYFHAICRIWDNMATKKLYITGGIGSRA-------QGEGFGPE 327

Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI- 447
            E        ETC +   +  ++ +F  T +  Y D  ERAL NGV+S       GV + 
Sbjct: 328 YELHNHSAYCETCASIANVYWNQRMFLATGDAKYIDVLERALYNGVIS-------GVSLS 380

Query: 448 -----YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
                Y  PL     ++   H     F    CC G  +  F        +  +GN   LY
Sbjct: 381 GDKFFYDNPL-----ESMGQHERAPWFGCA-CCPGN-VTRFMASVPKYMYATQGN--SLY 431

Query: 503 IIQYISSS--FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
           +  Y+ S       +  V L Q  +    WD  +++T++           SL LR+P WT
Sbjct: 432 VNLYVGSESRVALANDTVTLVQNTE--YPWDGLVKLTVSPRKASSF----SLKLRIPSWT 485

Query: 561 YSNGAQAS----------------LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            +     S                +NG  L       ++     W   D + +++P+ +R
Sbjct: 486 GNEPVPGSDLYTYIKRDREPCAVFVNGTPLKEKAHHGYVVIEREWEPGDVIELRMPMDVR 545

Query: 605 TEAIQDDRPEYASIQAILFGP--YLLAG 630
                +       + A+  GP  Y L G
Sbjct: 546 RVKAHEKVRADQGLLAVERGPVVYCLEG 573


>gi|374985914|ref|YP_004961409.1| hypothetical protein SBI_03157 [Streptomyces bingchenggensis BCW-1]
 gi|297156566|gb|ADI06278.1| hypothetical protein SBI_03157 [Streptomyces bingchenggensis BCW-1]
          Length = 644

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 81/366 (22%), Positives = 134/366 (36%), Gaps = 57/366 (15%)

Query: 275 VERHWYSLNEETGGMNDV---LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
           VER+     +   G  +V   L  LY  T D ++L  A LF      G +  +    ++F
Sbjct: 173 VERYGPQGEDAVCGHPEVEMALVELYRETGDERYLTQARLFVDRRGRGTVPSRGMGSAYF 232

Query: 332 HAN---THIPIVIGSQMR-----------YEVTGD-PLYKLIGTFFMDIVNASHSYATGG 376
             +     +P V G  +R           +  TGD  L   +   + D+V A+  Y TGG
Sbjct: 233 QDHLPLRELPSVTGHAVRMAYLAAGATDVFLETGDRTLLDALRRLWDDMV-ATKLYVTGG 291

Query: 377 TSAREFWWDPKRLADT--LGSENE--ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
             +R      + + D   L SE    ETC     ++ +  +F  T +  Y D  ER L N
Sbjct: 292 LGSRH---SDEAVGDRYELPSERSYSETCAAIGTMQWAWRMFLATGDARYPDVLERVLYN 348

Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG--WGTKFNSFW----CCYGTGIESFSKL 486
              ++    +     Y  PL R     + +     G      W    CC    +   ++L
Sbjct: 349 -AFAVGLSADGRAFFYDNPLQRRPDHEQRSGAEEGGEPLRQAWFSCPCCPPNVVRWMAQL 407

Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV 546
            D +  E  G    L +  Y  +  D       L+        WD  +R+T+  +  +  
Sbjct: 408 ADFLVAERPGE---LLVAGYAQAGVD--GAEAALDMATG--YPWDGEVRLTVRRAPDEPY 460

Query: 547 GQLSSLNLRMPVW--------TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
                ++LR+P W        T     + +  G          +L+   RW   D+L + 
Sbjct: 461 ----RISLRVPGWADPGQVRLTVGTAGEETAAGDV-----SDGWLTVERRWRPGDELRLS 511

Query: 599 LPLSLR 604
           LP+ +R
Sbjct: 512 LPMPVR 517


>gi|393780984|ref|ZP_10369185.1| hypothetical protein HMPREF1071_00053 [Bacteroides salyersiae
           CL02T12C01]
 gi|392677319|gb|EIY70736.1| hypothetical protein HMPREF1071_00053 [Bacteroides salyersiae
           CL02T12C01]
          Length = 672

 Score = 45.4 bits (106), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 141/368 (38%), Gaps = 74/368 (20%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L +LY +T D K+L  A  F D   + G            ++  H P++     +G  +R
Sbjct: 222 LVKLYLVTGDRKYLDQAKFFLDARGYTG--------RKDAYSQAHKPVIEQDEAVGHAVR 273

Query: 347 Y-----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLAD 391
                        +TGD  Y K I   + +IV +   Y TGG  AR   E + D   L +
Sbjct: 274 AVYMYSGMADVAAITGDSSYIKAIDRIWDNIV-SKKMYITGGIGARHQGEAFGDNYELPN 332

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
              S   ETC     + ++  LF    +  Y D  ER L NG++S     + G   Y  P
Sbjct: 333 L--SAYCETCAAIGSVYMNYRLFLLHGDAKYFDVLERTLYNGLIS-GVSLDGGSFFYPNP 389

Query: 452 LGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISS 509
           L      +R    W      F C C  + I  F   L   +Y  ++  V   Y+  ++S+
Sbjct: 390 LASDGGYSRKP--W------FGCACCPSNISRFIPSLPGYVYAVKDRQV---YVNLFLSN 438

Query: 510 SFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT------- 560
             + K     VVL Q+      W   +R+ +     Q  G    +N+R+P W        
Sbjct: 439 RAELKVNDKKVVLEQETS--YPWKGDIRLKV-LQGNQPFG----MNVRIPGWVRGSVLPS 491

Query: 561 ----YSNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR----TEAI 608
               Y++  Q +    +NGQ +       +L+   +W  ND + I   +  R     E +
Sbjct: 492 DLYAYADHQQPAYRVMVNGQEVEGELHNGYLTIDRKWKKNDVVEIHFDMLPRLVKANEKV 551

Query: 609 QDDRPEYA 616
             DR   A
Sbjct: 552 AADRGRVA 559


>gi|380693440|ref|ZP_09858299.1| hypothetical protein BfaeM_05587 [Bacteroides faecis MAJ27]
 gi|380693449|ref|ZP_09858308.1| hypothetical protein BfaeM_05644 [Bacteroides faecis MAJ27]
          Length = 668

 Score = 45.4 bits (106), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 81/356 (22%), Positives = 135/356 (37%), Gaps = 78/356 (21%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
           L +LY +T D K+L  A  F      G+ + +  Y     +  H P+V     +G  +R 
Sbjct: 219 LVKLYMVTGDKKYLDQAKFFLDT--RGYTSRKDAY-----SQAHKPVVEQDEAVGHAVRA 271

Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                       +TGD  Y K I   + +IV +   Y TGG  AR          +  G+
Sbjct: 272 VYMYSGMADVAAITGDSSYIKAIDKIWDNIV-SKKIYITGGIGARH-------AGEAFGN 323

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     + ++  LF    +  Y D  ER L NG++S     + G   
Sbjct: 324 NYELPNLSAYCETCAAIGNVYMNYRLFLLHGDAKYFDVLERTLYNGLIS-GVSLDGGSFF 382

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQ 505
           Y  PL        S+ G  ++   F C C  + +  F   L   +Y  ++  V   Y+  
Sbjct: 383 YPNPL--------SSSGKYSRKPWFGCACCPSNVSRFIPSLPGYVYAVKDDQV---YVNL 431

Query: 506 YISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
           ++S+  + K     ++L Q+ D    W   +R+ +      +  Q  ++ LR+P W   N
Sbjct: 432 FLSNKAELKVDKKKIILEQETD--YPWKGDIRLKIA-----QGNQNFTMKLRIPGWVRGN 484

Query: 564 GA---------------QASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
                            + S+NGQ +       +LS   +W   D + +   +  R
Sbjct: 485 VLPGDLYAYADNQKPVYRVSVNGQPVESDVNNGYLSIARKWKKGDVVEVHFDMLPR 540


>gi|347530932|ref|YP_004837695.1| hypothetical protein RHOM_03205 [Roseburia hominis A2-183]
 gi|345501080|gb|AEN95763.1| hypothetical protein RHOM_03205 [Roseburia hominis A2-183]
          Length = 646

 Score = 45.4 bits (106), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 51/245 (20%), Positives = 99/245 (40%), Gaps = 22/245 (8%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVS 457
           ETC +  ++  +R++ +  K   YAD  ERAL NG++S +Q   +    +  L +  GVS
Sbjct: 336 ETCASIGLVFFARNMLKTEKNGRYADVMERALYNGIISGMQLDGKRFFYVNPLEVNPGVS 395

Query: 458 KARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
                +         W    CC    +   + LG   + E+E  V       Y       
Sbjct: 396 GEIFGYKHVIPERPGWYACACCPPNLVRMVTSLGKYAWDEDETAV-------YSHLFLGQ 448

Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
           ++     + +V+    W+     ++T+    ++ +L +L + +P   Y    + ++NG+ 
Sbjct: 449 EAALGKADIRVESAYPWEG----SVTYHVSAKIDELFTLAIHIP--AYVKDLRVTVNGEA 502

Query: 574 LPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLA 629
                     +L  + +W  +D++ +  PL +R         E     A++ GP  Y   
Sbjct: 503 FDTAGEIRDGYLYISRKWGSDDQVELHFPLPVRKIYASTHVREDVGCVALMRGPVVYCFE 562

Query: 630 GHTSG 634
           G  +G
Sbjct: 563 GADNG 567


>gi|261407601|ref|YP_003243842.1| hypothetical protein GYMC10_3802 [Paenibacillus sp. Y412MC10]
 gi|261284064|gb|ACX66035.1| protein of unknown function DUF1680 [Paenibacillus sp. Y412MC10]
          Length = 626

 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 62/300 (20%), Positives = 119/300 (39%), Gaps = 28/300 (9%)

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           YE+ G+P+ +      +D +   H  A G  S  E+      L+ T  S+  E C     
Sbjct: 237 YELHGNPVERESVHRGIDSLMTYHGQAHGMFSGDEW------LSGTHPSQGVELCAVVEY 290

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGV--------LSIQRGTEPGVMIYMLPLGRGVSK 458
           +     L R   E  + D  E+   N +         S Q   +   MI  +   R  S 
Sbjct: 291 MFSMEQLTRIFGEGRFGDILEKVAFNALPAAISADWTSHQYDQQVNQMICNV-APRAWSN 349

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
           +   + +G + N F CC     + + KL   ++ +++ +  G+  + Y   +     G  
Sbjct: 350 SPDANVFGLEPN-FGCCTANMHQGWPKLASHLWMKDQED--GVVAVSYAPCTVRTTVGRQ 406

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
            ++ ++     +    R+ +  S   E  +   ++LR+P W   +    +LNG+ +P+  
Sbjct: 407 GVSAEIAVTGEYPFKDRIQIHLSL--ERAESFRISLRIPAWC--DHPVITLNGREMPIQA 462

Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDI 638
              +    + W   D L + LP+ ++TE+    R  YA+  +I  GP +        W +
Sbjct: 463 ESGYAEIMQTWQSGDLLELYLPMEVKTES----RSMYAT--SITRGPLVYVLPVKENWQM 516


>gi|423344366|ref|ZP_17322078.1| hypothetical protein HMPREF1077_03508 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409212764|gb|EKN05798.1| hypothetical protein HMPREF1077_03508 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 657

 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 82/353 (23%), Positives = 122/353 (34%), Gaps = 69/353 (19%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L +LY +T   K+L LA  F DK    G+   +  Y     +  H P++     +G  +R
Sbjct: 219 LCKLYLVTGQKKYLDLAKFFLDK---RGYTERKDAY-----SQAHKPVLEQDEAVGHAVR 270

Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +TGD  Y        + V     Y TGG  A           +  G 
Sbjct: 271 AAYMYSGMADVAALTGDTGYVHAIDRIWENVVTKKLYITGGIGATNN-------GEAFGK 323

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     +  +  LF    E  Y D  ER L NG++S     E     
Sbjct: 324 NYELPNLSAYCETCAAIGNVYWNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLEGNGFF 382

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  PL       R        +    CC          L   IY   + NV   Y+  ++
Sbjct: 383 YPNPLASTGQHQRKP------WFGCACCPSNICRFIPSLPGYIYAVHDKNV---YVNLFM 433

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT------- 560
           S+S D K G   L         WD  +R+ +    KQ+     +L +R+P W        
Sbjct: 434 SNSSDLKVGGKSLKLTQSTGYPWDGDVRLDMAPKGKQDF----TLKIRVPGWVRGEVVPS 489

Query: 561 ----YSNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
               +S+G Q      +NG+ +       + S T +W   D + +   +  RT
Sbjct: 490 DLYMFSDGKQLGYSVKVNGEPVESNLDKGYFSITRQWKKGDVVEVHFDMEPRT 542


>gi|389844758|ref|YP_006346838.1| hypothetical protein Theba_1950 [Mesotoga prima MesG1.Ag.4.2]
 gi|387859504|gb|AFK07595.1| hypothetical protein Theba_1950 [Mesotoga prima MesG1.Ag.4.2]
          Length = 621

 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 95/496 (19%), Positives = 194/496 (39%), Gaps = 52/496 (10%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF--PTELFDSFEALK 223
           V  ++ A++   +   +  I+ ++ +++  + + Q   G GY++ +    +  + ++ LK
Sbjct: 78  VYKWIEAASYSLSYNEDPEIRARIESLITLIEKAQEISGDGYINTYFVGQKAGERWKDLK 137

Query: 224 PVWAPYYTIHKILAGLLDQYVLADNA---QALKMATWMVEYFYNRVQKVITMY-SVERHW 279
            +   Y   H I AG+ ++    D       +  A  +++ F +   KV T +  +E   
Sbjct: 138 NMHELYCAGHLIQAGIANKRASGDETLFKVCVSAADNILDSFRDDDCKVTTGHPELEMAM 197

Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF-------- 331
             L+ ETG  +              +L  A +       G++     ++ H         
Sbjct: 198 IELHRETGNRD--------------YLKFAQMLIDNRGRGYVGGDEYHIDHVPFRELKEL 243

Query: 332 --HANTHIPIVIGSQMRYEVTGDP-LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
             HA   + ++ G+   +  TGD  L  ++   ++D+  A   Y TGG  +R +  +   
Sbjct: 244 TGHAVRMLYLLAGAADIFLETGDETLLAVLERLWIDLT-ARKMYVTGGAGSR-YEGESFG 301

Query: 389 LADTLGSENE--ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
               L S     ETC     +  +  ++  + +  Y D +E++  NGVLS     +    
Sbjct: 302 EEFELPSRRAYAETCAAVGNVFWNWRMYMISGDAKYLDLFEQSFYNGVLS-GISLDGKRY 360

Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
            Y+ PL     + R       ++    CC        +  G  IY      +  +   + 
Sbjct: 361 FYVNPLEDAGKRERE------EWFECACCPPNIARLLTSFGGYIYGTTLNEIR-VNFYEE 413

Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
             ++  ++ G V + QK     S+     + LT ++  +  +LS L LR+P WT     +
Sbjct: 414 SKATIPFRDGEVSIIQK----TSYPHSEEVQLTVATDLDT-ELSIL-LRIPEWTEGE-FE 466

Query: 567 ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP- 625
             ++G    L P   F+     W    ++ + LP+ +R         E     ++  GP 
Sbjct: 467 VQVDGIKQKLRPEKGFVRLEGNWKGKTEVYLALPMRIRLMTANPLLRENTDKVSVQRGPL 526

Query: 626 -YLLAGHTSGEWDIKT 640
            Y   G  + ++D++T
Sbjct: 527 VYCAEGVDNPDFDVRT 542


>gi|334121751|ref|ZP_08495800.1| protein of hypothetical function DUF1680 [Enterobacter hormaechei
           ATCC 49162]
 gi|333392772|gb|EGK63868.1| protein of hypothetical function DUF1680 [Enterobacter hormaechei
           ATCC 49162]
          Length = 657

 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 58/251 (23%), Positives = 90/251 (35%), Gaps = 39/251 (15%)

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADGHYADVME 370

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H     FN  +              
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIYDHVKPVRQRWFGCA 421

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + LG  IY         L I  Y+ +      G  +L  ++     W   
Sbjct: 422 CCPPNIARVLTSLGHYIYTVRPD---ALLINLYVGNDVAIPVGDNILQLRISGNYPWHEQ 478

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
           +++ +T      V    +L LR+P W        SLNG+ +       +L     W   D
Sbjct: 479 VKIEIT----SPVPVTHTLALRLPDWCAEPA--VSLNGEAITGEVSRGYLYLNRSWQEGD 532

Query: 594 KLTIQLPLSLR 604
            L++ LP+ +R
Sbjct: 533 TLSLTLPMPVR 543


>gi|298374271|ref|ZP_06984229.1| cytoplasmic protein [Bacteroides sp. 3_1_19]
 gi|301307792|ref|ZP_07213748.1| putative cytoplasmic protein [Bacteroides sp. 20_3]
 gi|423337089|ref|ZP_17314833.1| hypothetical protein HMPREF1059_00758 [Parabacteroides distasonis
           CL09T03C24]
 gi|298268639|gb|EFI10294.1| cytoplasmic protein [Bacteroides sp. 3_1_19]
 gi|300834135|gb|EFK64749.1| putative cytoplasmic protein [Bacteroides sp. 20_3]
 gi|409238277|gb|EKN31070.1| hypothetical protein HMPREF1059_00758 [Parabacteroides distasonis
           CL09T03C24]
          Length = 618

 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 52/231 (22%), Positives = 98/231 (42%), Gaps = 25/231 (10%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPL-GRGV 456
           ETC +  M+  ++ + + T +  Y D  ER+L NG L+ I  G +     Y+ PL  +G 
Sbjct: 336 ETCASVGMVLWNQRMNQLTGDSKYIDILERSLYNGALAGISLGGDR--FFYVNPLESKGD 393

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
              +  +G         CC          +G+ IY   +     L++  YI ++   + G
Sbjct: 394 HHRQEWYGCA-------CCPSQLSRFLPSIGNYIYASSD---DALWVNLYIGNTGQIRIG 443

Query: 517 H--VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
              ++L Q+ D    WD  +++T++ S   E      + LR+P W  +     S+NG+ +
Sbjct: 444 ETDILLTQETD--YPWDGSVKLTISTSQPLE----KEIRLRIPNWCKT--YDLSINGKRI 495

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
            +     + +  + W   D + + + + +   A      E    +AI  GP
Sbjct: 496 NVSEKKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFGKRAIQRGP 545


>gi|333378296|ref|ZP_08470027.1| hypothetical protein HMPREF9456_01622 [Dysgonomonas mossii DSM
           22836]
 gi|332883272|gb|EGK03555.1| hypothetical protein HMPREF9456_01622 [Dysgonomonas mossii DSM
           22836]
          Length = 826

 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 84/390 (21%), Positives = 148/390 (37%), Gaps = 83/390 (21%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMRY 347
           L +LY +T DP +L +A  F     + ++      +S  +A  H P+      +G  +R 
Sbjct: 226 LVKLYRVTGDPLYLNMAKKFIDIRGVTYVPDGKGTMSPEYAQQHAPVREQDKAVGHAVRA 285

Query: 348 -----------EVTGDP-LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                       +TGD  L   +   + +IV+ +  + TGG  A           +  G 
Sbjct: 286 VYLYSGMSDVGTLTGDTTLSPALDKIWGNIVD-TRMHITGGLGAIHG-------IEGFGP 337

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
           E E        ETC     +  +  +F   K+  Y D  E +L N VL+     E     
Sbjct: 338 EYELPNKEAYNETCAAVGNVFFNHRMFLLEKDGKYMDVAEVSLLNNVLA-GVNLEGNKFF 396

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
           Y+ PL             GT   S+W    CC         ++   +Y   +  +   + 
Sbjct: 397 YVNPLASD----------GTVDRSYWFGTACCPTNLARLIPQISGLMYAHTDNEI---FC 443

Query: 504 IQYISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
             Y  S  D+   SG V L QK +      P+    +   + ++  Q  S+ +R+P W  
Sbjct: 444 SFYTGSKVDFALTSGKVALEQKTNY-----PFDESIVLTVNPEKNDQTFSIKMRIPTWVG 498

Query: 562 S------------NGAQA-----------SLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
           S            N ++A           +L+ +   +     F+S + +W   DK+ ++
Sbjct: 499 SQFVPGKLYSYVDNNSKAWELYINDKKVGNLSFKKGEVSLDKGFVSISRKWKKGDKVELK 558

Query: 599 LPLSLR-TEAIQDDRPEYASIQAILFGPYL 627
           LP+ +R + AI + + +   + AI  GP +
Sbjct: 559 LPMPVRYSHAINEVKADNDRV-AITRGPLV 587


>gi|29349082|ref|NP_812585.1| hypothetical protein BT_3674 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|383124304|ref|ZP_09944969.1| hypothetical protein BSIG_3668 [Bacteroides sp. 1_1_6]
 gi|29340989|gb|AAO78779.1| Six-hairpin glycosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|251839199|gb|EES67283.1| hypothetical protein BSIG_3668 [Bacteroides sp. 1_1_6]
          Length = 668

 Score = 45.1 bits (105), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 82/356 (23%), Positives = 134/356 (37%), Gaps = 78/356 (21%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
           L +LY  T D K+L  A  F      G+ + +  Y     +  H P+V     +G  +R 
Sbjct: 219 LVKLYMATGDKKYLDQAKFFLDT--RGYTSRKDTY-----SQAHKPVVEQDEAVGHAVRA 271

Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                       +TGD  Y K I   + +IV +   Y TGG  A           +  G+
Sbjct: 272 VYMYSGMADVAAITGDSSYIKAIDKIWDNIV-SKKIYITGGIGAH-------HAGEAFGN 323

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     + ++  LF    +  Y D  ER L NG++S     + G   
Sbjct: 324 NYELPNLSAYCETCAAIGNVYMNYRLFLLHGDAKYFDVLERTLYNGLIS-GVSLDGGSFF 382

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQ 505
           Y  PL        S++G  ++   F C C  + +  F   L   +Y  +   V   Y+  
Sbjct: 383 YPNPL--------SSNGKYSRKPWFGCACCPSNVSRFIPSLPGYVYAVKNDQV---YVNL 431

Query: 506 YISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
           Y+S+  + K     ++L Q+      W+  +R+ +T     +  Q  ++ LR+P W   N
Sbjct: 432 YLSNKAELKVDKKKILLEQETG--YPWNGDIRLKIT-----QGNQDFTMKLRIPGWVRGN 484

Query: 564 ---------------GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
                            Q S+NGQ +       +LS   +W   D + +   +  R
Sbjct: 485 VLPGDLYSYADNQKPAYQVSVNGQTVESDVNDGYLSIARKWKKGDVVEVHFDMIPR 540


>gi|430748744|ref|YP_007211652.1| hypothetical protein Theco_0434 [Thermobacillus composti KWC4]
 gi|430732709|gb|AGA56654.1| hypothetical protein Theco_0434 [Thermobacillus composti KWC4]
          Length = 806

 Score = 45.1 bits (105), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 51/212 (24%), Positives = 81/212 (38%), Gaps = 14/212 (6%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL------ 452
           ETC +  ++  +R + R      YAD  ERAL N VL+     +     Y+ PL      
Sbjct: 323 ETCASIVLIFWARRMLRLEARSEYADVMERALYNTVLA-GMARDGKHFFYVNPLEVWPEA 381

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS-- 510
                  R       K+    CC        + L D IY  +E     +++  YI S   
Sbjct: 382 SLKNPDRRHVKPIRQKWFGCSCCPPNVARLLASLDDYIYDIDEA-AGRVHVHLYIGSEAR 440

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
           F      V L+Q+    + WD  +   L+ S    V    +L LR+P W  +     ++N
Sbjct: 441 FAAAGREVTLHQRSG--LPWDGTVTFGLSVSGGGAV--RLALALRVPDWFQTAEPVLAVN 496

Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
           G+  P      +      W+  D+   +LP+ 
Sbjct: 497 GEACPYRMEKGYAVVEREWADGDRAEWRLPME 528


>gi|299141574|ref|ZP_07034710.1| hypothetical protein HMPREF0665_01155 [Prevotella oris C735]
 gi|298576910|gb|EFI48780.1| hypothetical protein HMPREF0665_01155 [Prevotella oris C735]
          Length = 673

 Score = 45.1 bits (105), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 72/321 (22%), Positives = 121/321 (37%), Gaps = 48/321 (14%)

Query: 349 VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--------E 399
           +TGD  Y K I   + +I++  + Y TGG  AR +        +  G++ E        E
Sbjct: 290 LTGDSAYIKAIDCIWDNILSKKY-YLTGGVGARHY-------GEAFGADYELPNLTAYNE 341

Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
           TC       ++  LF    +  Y D  ER L NGV+S     + G   Y  PL       
Sbjct: 342 TCAAIAQCYLNMRLFMLHGDSKYIDCLERTLYNGVIS-GMSIDGGRFFYPNPLSADGIYK 400

Query: 460 RSTHGWGTKFNSFWC-CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
            +  G  T+   F C C  + +  F        +   GN   +Y+  ++ S  + K G  
Sbjct: 401 FNADGTTTRQPWFGCACCPSNLSRFIPSVPGYVYAVRGN--DVYVNLFMGSKANVKVGGK 458

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT---------YS------N 563
            +  + +    WD  + + +    K    + +SL +R+P W          YS      +
Sbjct: 459 EMKIETETNYPWDGKVSICI----KGNANKHASLLVRIPGWARGEVTPGGLYSFTDKQKD 514

Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAIQDDRPEYASIQ 619
           G   ++NG+N             +     D +T+ L +  RT    + + DDR       
Sbjct: 515 GWSIAVNGKNRNAEKLEKGYIRIDNVKKGDVITLNLDMEPRTVVADKRVMDDR----GCV 570

Query: 620 AILFGPYLLAGHTSGEWDIKT 640
           A+  GP +    +     +KT
Sbjct: 571 AVERGPLVYCAESVDNNGMKT 591


>gi|116254709|ref|YP_770545.1| hypothetical protein pRL100266 [Rhizobium leguminosarum bv. viciae
           3841]
 gi|115259357|emb|CAK10492.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
           3841]
          Length = 647

 Score = 45.1 bits (105), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 101/482 (20%), Positives = 175/482 (36%), Gaps = 95/482 (19%)

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-----PTELFDSFEA 221
           G ++ A++       NA ++ K+  +V  L + Q  +  GYL+++     P   + +   
Sbjct: 90  GKWIEAASYTLKVHPNAALEAKIDAIVEKLEKGQ--MADGYLNSWFIRREPDRRWTNLRD 147

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER---H 278
           L  +    Y++  +L G +  Y      + L +    V++       +I  +  E     
Sbjct: 148 LHEM----YSMGHLLEGAVAYYEATGKRRFLDVMIRAVDH-------IIATFGAEPGKLR 196

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQAD------- 326
            Y  +EE   +   L +LY +T DP+HL LA  F       P +    A +         
Sbjct: 197 GYDAHEE---IELALVKLYRVTRDPRHLKLATYFVDERGRMPSYYDEEARKRGESPDDYV 253

Query: 327 YLSHFHANTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASH 370
           Y ++ ++  H+P+     V+G  +R                DP  K       D +    
Sbjct: 254 YKTYAYSQAHMPVRDQHQVVGHAVRAMYLFSAMADLSHENDDPTLKEACDRLFDNLVGRQ 313

Query: 371 SYATGGTS--------AREFWWDPKRLADTLGSEN--EETCTTYNMLKVSRHLFRWTKEI 420
            Y TGG           REF          L +E    ETC    +   S  + +   + 
Sbjct: 314 LYVTGGLGPSASNEGFTREF---------DLPNETAYAETCAAVALGFWSHRMAQVDLDS 364

Query: 421 AYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGT 478
            + D  E  L NG LS I R  E      +L           +HG   ++   +C C  T
Sbjct: 365 KFTDRLETVLYNGALSGISRDGEHYFYENVL----------ESHGQHRRWKWHYCPCCPT 414

Query: 479 GIESF-SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
            I  F + LG   Y  ++     L +  Y ++S +   G+  +    + +  WD  + + 
Sbjct: 415 NIARFITSLGQYFYSTDDHQ---LAVHLYGTNSAELTVGNSFVRLIQETLYPWDGDISLR 471

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKL 595
                         L LR+P W     AQ S+NG  + L       + + +  W   D++
Sbjct: 472 FAVERPSRF----QLRLRIPGWC--RQAQISVNGVAVDLDQCVTKGYAAISREWRNGDEV 525

Query: 596 TI 597
            I
Sbjct: 526 RI 527


>gi|373954097|ref|ZP_09614057.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
           18603]
 gi|373890697|gb|EHQ26594.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
           18603]
          Length = 800

 Score = 45.1 bits (105), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 61/246 (24%), Positives = 97/246 (39%), Gaps = 51/246 (20%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
           ETC     +  +  +F    +  Y D  ER L NG+LS       GV +      Y  PL
Sbjct: 335 ETCAAIGNVYWNNRMFLLHGDAKYIDVLERTLYNGLLS-------GVSLSGDRFFYPNPL 387

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
                  RS       + S  CC          L   +Y + + +   LY+  ++S+S +
Sbjct: 388 ASMFQHQRSA------WISCACCISNMTRFLPSLPGYVYAKNKND---LYVNLFMSNSSN 438

Query: 513 WK--SGHVVLNQKVD------------PIVSWDPYLRMTLTFSSKQE--VGQLSSLNLR- 555
            K  SG+V + Q+ D            P+ + D  LR+ +   +KQ+   G L S   + 
Sbjct: 439 IKLASGNVNIVQQTDYPWKGQVDMTINPVKTTDFTLRVRIPGWAKQQPVPGNLYSFMDKT 498

Query: 556 -MPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS----LRTEAIQD 610
            +PV  Y NG   S   +         +      W   DK+++ LPL     L  + ++D
Sbjct: 499 PLPVVIYINGKATSFVTEK-------GYAVLKRNWKKGDKVSLALPLETEKVLANDKVKD 551

Query: 611 DRPEYA 616
           DR  +A
Sbjct: 552 DRGRFA 557


>gi|332882008|ref|ZP_08449643.1| hypothetical protein HMPREF9074_05437 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|357048166|ref|ZP_09109720.1| hypothetical protein HMPREF9441_03769 [Paraprevotella clara YIT
           11840]
 gi|332679932|gb|EGJ52894.1| hypothetical protein HMPREF9074_05437 [Capnocytophaga sp. oral
           taxon 329 str. F0087]
 gi|355528749|gb|EHG98227.1| hypothetical protein HMPREF9441_03769 [Paraprevotella clara YIT
           11840]
          Length = 818

 Score = 45.1 bits (105), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 90/412 (21%), Positives = 147/412 (35%), Gaps = 84/412 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKP---CFLGFL--ALQADYLSHFHANTHIPIVIGSQMR 346
            L +LY +T   ++L  A  F +    C  G    A   DY      +  +   + +   
Sbjct: 221 ALAKLYKVTGKEEYLRTARYFVEETGRCTDGHAPSAYSQDYKPILEQDEIVGHAVRAGYL 280

Query: 347 YE-------VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE- 398
           Y        +TGD  Y    T   + +     Y TGG  +R          +  G + E 
Sbjct: 281 YSGVADVAALTGDTAYFHALTRIWENMAGRKLYLTGGIGSRA-------QGEGFGPDYEL 333

Query: 399 -------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI---- 447
                  ETC +   +  +  +F  T +  Y D  ERAL NGV+S       GV +    
Sbjct: 334 NNHTAYCETCASIANVYWNHRMFLATGDSRYEDVLERALYNGVIS-------GVSLSGDR 386

Query: 448 --YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
             Y  PL     ++   H     F    CC G      + + + +Y  +  +V   ++  
Sbjct: 387 FFYDNPL-----ESMGQHERQAWFGCA-CCPGNVTRFMASVPNYMYATQGKDV---FVNL 437

Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--- 562
           YI S+    +    +  +      WD  +RMT+    KQ      +L  R+P W      
Sbjct: 438 YIQSTAHLSTSQNKIEIRQTTDYPWDGKIRMTVHPEKKQTF----ALRCRIPGWAQDRPV 493

Query: 563 -----------NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL-RTEA--- 607
                       G    +NG++        +     +W   D + +  P+ + R EA   
Sbjct: 494 PTDLYHYTGKGKGYTIQVNGKDAEFRVENGYAVILRKWKKGDTVQLDFPMDVRRVEARGE 553

Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEWD-------IKTGTARSLSALISP 652
           ++DDR +     AI  GP +       + D       I TGT  ++SA  +P
Sbjct: 554 VEDDRGK----AAIERGPIVYCIEDKDQPDSLIFNKFIPTGT--TISATYAP 599


>gi|154495096|ref|ZP_02034101.1| hypothetical protein PARMER_04143 [Parabacteroides merdae ATCC
           43184]
 gi|423725062|ref|ZP_17699202.1| hypothetical protein HMPREF1078_03096 [Parabacteroides merdae
           CL09T00C40]
 gi|154085646|gb|EDN84691.1| hypothetical protein PARMER_04143 [Parabacteroides merdae ATCC
           43184]
 gi|409235418|gb|EKN28236.1| hypothetical protein HMPREF1078_03096 [Parabacteroides merdae
           CL09T00C40]
          Length = 679

 Score = 45.1 bits (105), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 87/424 (20%), Positives = 163/424 (38%), Gaps = 43/424 (10%)

Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
           ++  ++ QY  A   Q  ++  +M  YF  ++ ++    +    W    E+ GG N  V+
Sbjct: 163 VMLKVMQQYYTA--TQDRRVIDFMTRYFRYQLDELPK--NPLGKWTFWGEQRGGDNLMVV 218

Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
           Y LY+IT D   L L  L  K  F    + L  ++L   H+   + +  G +   + Y+ 
Sbjct: 219 YWLYNITGDKFLLDLGELIHKQTFNWTDIFLNQNHLRRQHSLHCVNLAQGFKEPIVYYQQ 278

Query: 350 TGDPLYKLIGTFFMDIVNASHSYA--TGGTSAREFWWDPKRLADTLGSENEETCTTYNML 407
             D   K I      + +  H+    TG       W   + L     +   E CT   M+
Sbjct: 279 GKDS--KQIQATRQAVNDIRHTIGLPTG------LWGGDELLRFGKPTTGSELCTAVEMM 330

Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
                +   T ++ +ADY ER   N  L  Q   +     Y     + ++  R    + T
Sbjct: 331 YSLETILEVTGDMQWADYLERVAYNA-LPTQVTDDYSARQYYQQTNQ-IAVTREWREFST 388

Query: 468 ----------KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK-SG 516
                     +   + CC     + + K   ++++    N  GL  + +  S    + +G
Sbjct: 389 PHDDTDLLFGELTGYPCCTSNLHQGWPKFVQNLWYATADN--GLASLLFAPSQVTARVAG 446

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
            + +N K +    ++  +R  ++F+ K+        +LR+P W         LNG+ L +
Sbjct: 447 GIEVNLKEETAYPFEETVRYHVSFTDKKVKKVFFPFHLRIPGW--CKQPVVKLNGKPLTV 504

Query: 577 PP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
              PG        W   D L+++LP+ +           Y +   +  GP + A   + +
Sbjct: 505 DAYPGTVTRINREWKEGDILSLELPMEVTVSRW------YENSAVVERGPLVYALKMNEK 558

Query: 636 WDIK 639
           W+ K
Sbjct: 559 WEKK 562


>gi|354725692|ref|ZP_09039907.1| hypothetical protein EmorL2_22781 [Enterobacter mori LMG 25706]
          Length = 649

 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 57/251 (22%), Positives = 93/251 (37%), Gaps = 39/251 (15%)

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADGHYADVME 362

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H     FN  +              
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIFDHVKPVRQRWFGCA 413

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + LG  IY   +     L+I  Y+ +      G   L  ++     W   
Sbjct: 414 CCPPNIARVLTSLGHYIYTVRQD---ALFINLYVGNDVAIPVGDETLALRISGNYPWHEQ 470

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
           +++ +T ++        +L LR+P W  +      LNG+ +       +L  T  W   D
Sbjct: 471 VKIDITSTAP----VTHTLALRLPDWGAT--PDVLLNGEAVTGEISRGYLYLTRSWQEGD 524

Query: 594 KLTIQLPLSLR 604
            +T+ LP+ +R
Sbjct: 525 VITLTLPMPVR 535


>gi|241554299|ref|YP_002979512.1| hypothetical protein Rleg_6525 [Rhizobium leguminosarum bv.
           trifolii WSM1325]
 gi|240863605|gb|ACS61267.1| protein of unknown function DUF1680 [Rhizobium leguminosarum bv.
           trifolii WSM1325]
          Length = 647

 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 105/482 (21%), Positives = 180/482 (37%), Gaps = 95/482 (19%)

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-----PTELFDSFEA 221
           G ++ A++    +  NA ++ K+  +V  L + Q  +  GYL+++     P   + +   
Sbjct: 90  GKWIEAASYTLKAHPNAALETKIDAIVEKLEKGQ--MADGYLNSWFIRREPDRRWTNLRD 147

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER---H 278
           L  +    Y++  +L G +  Y      + L +    V++       +I  +  E     
Sbjct: 148 LHEM----YSMGHLLEGAVAYYEATGKRRFLDVMIRAVDH-------IIETFGAEPGKLR 196

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL 328
            Y  +EE   +   L +LY +T DP+HL LA  F       P +    A +      DY+
Sbjct: 197 GYDAHEE---IELALVKLYRVTGDPRHLKLATYFVDERGRMPSYYDEEARKRGESPEDYV 253

Query: 329 --SHFHANTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASH 370
             ++ ++  H+P+     V+G  +R                DP  K       D + +  
Sbjct: 254 YKTYAYSQAHLPVRDQHQVVGHAVRAMYLFSAMADLSRENDDPTLKEACDRLFDNLVSRQ 313

Query: 371 SYATGGTS--------AREFWWDPKRLADTLGSEN--EETCTTYNMLKVSRHLFRWTKEI 420
            Y TGG           REF          L +E    ETC    +   S  + +   + 
Sbjct: 314 LYVTGGLGPSASNEGFTREF---------DLPNETAYAETCAAVALGFWSHRMAQVDLDS 364

Query: 421 AYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGT 478
            + D  E  L NG LS I R  E      +L           +HG   ++   +C C  T
Sbjct: 365 KFTDRLETVLYNGALSGISRDGERYFYENVL----------ESHGQHRRWKWHYCPCCPT 414

Query: 479 GIESF-SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
            I  F + LG   Y  ++  +  +++    S+        V L QK       D  LR  
Sbjct: 415 NIARFITSLGQYFYSTDDHQL-AVHLYGTNSAELTVGDSFVRLIQKTQYPWDGDISLRFA 473

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKL 595
           +   S+ +      L LR+P W     AQ S+NG  + L       + + +  W   D++
Sbjct: 474 VERPSRFQ------LRLRIPGWC--RQAQISVNGVAVDLDQCVTKGYAAISREWRNGDEV 525

Query: 596 TI 597
            I
Sbjct: 526 RI 527


>gi|270339568|ref|ZP_06005245.2| conserved hypothetical protein [Prevotella bergensis DSM 17361]
 gi|270334558|gb|EFA45344.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
          Length = 813

 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 66/294 (22%), Positives = 114/294 (38%), Gaps = 47/294 (15%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGV 456
           ETC +   +  +  +F  T +  Y D YERAL NGVLS     G E     Y  PL    
Sbjct: 344 ETCASIANVYWNYRMFLATGDAKYVDVYERALYNGVLSGVSLSGKE---FFYDNPLESMG 400

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
             AR    W   F    CC G  +  F        +   GN   +++  YI    D    
Sbjct: 401 QHAR--QAW---FGCA-CCPGN-VTRFVASVPQYQYATRGN--DIFVNLYIQGKADING- 450

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS-------- 568
            V L Q  +    WD  + + ++   +       ++  R+P W ++     +        
Sbjct: 451 -VQLTQTTN--YPWDGNISIQVSPKRRSTF----AIRFRIPGWAHNKPVSTNLYHFIDKA 503

Query: 569 ------LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR----TEAIQDDRPEYASI 618
                 LNG  +       ++  + +W   D++ I+LP+ +R     + ++DDR +    
Sbjct: 504 KPYAVKLNGDVVDATLEDGYVVISRKWKKGDRVEIELPMDVRRVQANDNVEDDRGKI--- 560

Query: 619 QAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNA-QLVTFTQESGNS 671
            A+  GP +       + D        +  L +PI  S+++ +L    + +GN+
Sbjct: 561 -ALERGPVMFCLEGKDQSD--NTVFNKIITLTTPITASYHSDKLNGIVELTGNA 611


>gi|255532639|ref|YP_003093011.1| hypothetical protein Phep_2748 [Pedobacter heparinus DSM 2366]
 gi|255345623|gb|ACU04949.1| protein of unknown function DUF1680 [Pedobacter heparinus DSM 2366]
          Length = 684

 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 109/484 (22%), Positives = 179/484 (36%), Gaps = 87/484 (17%)

Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG------------TGYLSAFPTELFD 217
           L A A M+AST++  +   M   +  ++  Q   G            TG  + F   L  
Sbjct: 118 LEAMASMYASTNDPKLDAMMDKAIAVIARSQRDDGYIYTKAMIEQRKTGSKNQFQDRL-- 175

Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
           SFEA        Y I  ++      Y        L +A    EY YN  QK     ++ R
Sbjct: 176 SFEA--------YNIGHLMTAACVHYRATGKTTLLNVAKKATEYLYNFYQKASP--ALAR 225

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLA-HLF-----------DKPCFLGFLALQA 325
           +    +   G     +  +Y    DP++L LA HL            D    + FL  Q 
Sbjct: 226 NAICPSHYMG-----VIEMYRTIKDPRYLELAKHLIAIKGKIEDGTDDNQDRIPFLQ-QT 279

Query: 326 DYLSH-FHANTHIPIVIGSQMRYEVTG-DPLYKLIGTFFMDIVNASHSYATGG------- 376
             + H   AN    +  G    Y  TG D L K +   + D VN    Y TGG       
Sbjct: 280 KAMGHAVRANY---LYAGVADLYAETGNDSLMKTLNLMW-DDVNQHKMYITGGCGSLYDG 335

Query: 377 TSAREFWWDP---KRLADTLGSE--------NEETCTTYNMLKVSRHLFRWTKEIAYADY 425
           TS     ++P   +++    G +        + ETC     +  +  + + + +  YAD 
Sbjct: 336 TSPDGTSYNPTEVQKIHQAFGRDFQLPNFTAHNETCANIGNVLWNWRMLQISGDAKYADV 395

Query: 426 YERALTNGVLS-IQRG------TEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGT 478
            E AL N VLS I         T P      LP  +  SK R  +          CC   
Sbjct: 396 MELALHNSVLSGISLDGKKFLYTNPLSYSDELPFKQRWSKDRVPY-----IGLSNCCPPN 450

Query: 479 GIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
            + + +++ D  Y   ++G    LY    ++++       + L+Q+ +    WD  +++ 
Sbjct: 451 VVRTIAEVSDYAYSISDKGLWFNLYGGNTVNTTLT-DGTKLKLSQETN--YPWDGNIKIK 507

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTI 597
           +  +  +      SL  R+P W      + +   +N+ L  PG +     +W   D + +
Sbjct: 508 ILSTGSKPY----SLFFRIPGWAARADLKVNGKVENMDL-RPGTYAELNRKWKAGDLVEL 562

Query: 598 QLPL 601
            LP+
Sbjct: 563 VLPM 566


>gi|237719717|ref|ZP_04550198.1| six-hairpin glycosidase [Bacteroides sp. 2_2_4]
 gi|229450986|gb|EEO56777.1| six-hairpin glycosidase [Bacteroides sp. 2_2_4]
          Length = 668

 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 87/363 (23%), Positives = 142/363 (39%), Gaps = 72/363 (19%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
           L +LY +T D K+L  A  F      G+ + +  Y     +  H P+V     +G  +R 
Sbjct: 219 LVKLYLVTGDKKYLDQAKFFLDA--RGYTSRKDAY-----SQAHKPVVEQDEAVGHAVRA 271

Query: 348 E-----------VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADT 392
                       +TGD  Y K I   + +IV +   Y TGG  AR   E + +   L ++
Sbjct: 272 AYMYSGMADVAAITGDSSYIKAIDKIWDNIV-SKKIYVTGGIGARHAGEAFGNNYELPNS 330

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
             S   ETC     + ++  LF    +  Y D  ER L NG++S     + G   Y  PL
Sbjct: 331 --SAYCETCAAIGNVYMNYRLFLLHGDAKYFDVLERTLYNGLIS-GVSLDGGSFFYPNPL 387

Query: 453 GRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISSS 510
                 +R    W      F C C  + +  F   L   +Y  ++  V   Y+  Y+S+ 
Sbjct: 388 ASNGKYSRKP--W------FGCACCPSNVSRFIPSLPGYVYAVKDNQV---YVNLYLSNK 436

Query: 511 FDW--KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW--------- 559
            +       VVL Q+      W+  +R+ +      +  Q  +L LR+P W         
Sbjct: 437 AELIVNKKKVVLEQETG--YPWNGDIRVKVA-----QGNQEFALKLRIPGWVRNEVLPSG 489

Query: 560 --TYSNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR----TEAIQ 609
             +Y++  + +    +NGQ         +LS   +W   D + I   +  R     E + 
Sbjct: 490 LYSYADNQKPTYRIIVNGQETANTLNNGYLSIERKWKKGDVVKIHFDMLPRIVKANEKVV 549

Query: 610 DDR 612
           DD+
Sbjct: 550 DDK 552


>gi|218260015|ref|ZP_03475494.1| hypothetical protein PRABACTJOHN_01155 [Parabacteroides johnsonii
           DSM 18315]
 gi|218224798|gb|EEC97448.1| hypothetical protein PRABACTJOHN_01155 [Parabacteroides johnsonii
           DSM 18315]
          Length = 665

 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 82/353 (23%), Positives = 122/353 (34%), Gaps = 69/353 (19%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L +LY +T   K+L LA  F DK    G+   +  Y     +  H P++     +G  +R
Sbjct: 227 LCKLYLVTGQKKYLDLAKFFLDK---RGYTERKDAY-----SQAHKPVLEQDEAVGHAVR 278

Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +TGD  Y        + V     Y TGG  A           +  G 
Sbjct: 279 AAYMYSGMADVAALTGDTGYVHAIDRIWENVVTKKLYITGGIGATNN-------GEAFGK 331

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     +  +  LF    E  Y D  ER L NG++S     E     
Sbjct: 332 NYELPNLSAYCETCAAIGNVYWNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLEGNGFF 390

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  PL       R        +    CC          L   IY   + NV   Y+  ++
Sbjct: 391 YPNPLASTGQHQRKP------WFGCACCPSNICRFIPSLPGYIYAVHDKNV---YVNLFM 441

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT------- 560
           S+S D K G   L         WD  +R+ +    KQ+     +L +R+P W        
Sbjct: 442 SNSSDLKVGGKSLKLTQSTGYPWDGDVRLDVAPKGKQDF----TLKIRVPGWVRGEVVPS 497

Query: 561 ----YSNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
               +S+G Q      +NG+ +       + S T +W   D + +   +  RT
Sbjct: 498 DLYMFSDGKQLGYSVKVNGEPVESNLDKGYFSITRQWKKGDVVEVHFDMEPRT 550


>gi|261878820|ref|ZP_06005247.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
 gi|270334561|gb|EFA45347.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
          Length = 819

 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 86/352 (24%), Positives = 137/352 (38%), Gaps = 64/352 (18%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
            L +LY  T + K+L  A  F    + G   ++ +Y     + +H P+V     +G  +R
Sbjct: 223 ALCKLYLATGNRKYLDQAKFFLD--YRGKTTIRQEY-----SQSHKPVVEQDEAVGHAVR 275

Query: 347 YE-----------VTGDPLY-KLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLAD 391
                        +TGD  Y K I   + +IV     Y TGG   TS  E +     L +
Sbjct: 276 AAYMYAGMADVAALTGDADYIKAIDRIWDNIV-GKKLYITGGIGATSNGEAFGKNYELPN 334

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
              S   ETC     + V+  LF    E  Y D  ER+L NG++S     + G   Y  P
Sbjct: 335 M--SAYAETCAAIGNVYVNYRLFLLHGESKYFDVLERSLYNGLIS-GVSMDGGGFFYPNP 391

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           L     ++   H     F    CC          L   +Y  ++ N   LY+  ++S+S 
Sbjct: 392 L-----ESMGQHQRQAWFGCA-CCPSNICRFLPSLPGYVYAVKDNN---LYVNLFLSNSA 442

Query: 512 DWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT--------- 560
             K    +V L Q  +    WD  + + +  +     G    L +R+P W          
Sbjct: 443 TMKVNGKNVSLTQSTN--YPWDGDIAIRVDRNKAGSFG----LKIRIPGWIKGQPVPSDL 496

Query: 561 --YSNGAQAS----LNGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
             YS+G + +    +NG+ + P      + +   RW   D +TI   + +RT
Sbjct: 497 YYYSDGKRPNYTILVNGKAIEPTITDDGYCTINRRWKKGDVVTIHFDMEVRT 548


>gi|189462782|ref|ZP_03011567.1| hypothetical protein BACCOP_03480 [Bacteroides coprocola DSM 17136]
 gi|189430398|gb|EDU99382.1| hypothetical protein BACCOP_03480 [Bacteroides coprocola DSM 17136]
          Length = 578

 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 51/227 (22%), Positives = 87/227 (38%), Gaps = 33/227 (14%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC     +  +  +F   K+  Y D  E AL N VL+     +     Y+ PL    + 
Sbjct: 109 ETCAAVGNVMFNYRMFLTKKDARYVDVAEVALYNNVLA-GVNLDGNKFFYVNPLE---AD 164

Query: 459 ARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS--FD 512
           AR+    G K  S W    CC         ++   +Y   + ++   Y   Y  +S    
Sbjct: 165 ARNAFNQGLKGRSPWFGTACCPSNIARLIPQIPGMMYAHTDNDI---YCTFYAGTSTVVP 221

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-----------Y 561
              G V + Q  +    +D  +R  +     ++  Q  +++ R+P W            Y
Sbjct: 222 LSDGKVTIKQTTN--YPFDESVRFEI---KPEQSKQKFAMHFRIPTWAGKQFVPGKLYHY 276

Query: 562 SNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            N   A     LNG+ + + P   F++    W   D + +QLP+ +R
Sbjct: 277 LNDKPAEWKVLLNGKEVSVKPHKGFVTIERAWKSGDLVELQLPMLVR 323


>gi|266624999|ref|ZP_06117934.1| putative cytoplasmic protein, partial [Clostridium hathewayi DSM
           13479]
 gi|288863113|gb|EFC95411.1| putative cytoplasmic protein [Clostridium hathewayi DSM 13479]
          Length = 323

 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 50/234 (21%), Positives = 96/234 (41%), Gaps = 20/234 (8%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC +  ++  +R + +   +  YAD  ER L NGVLS     +     Y+ PL   V +
Sbjct: 8   ETCASVGLVFFARRMLQIRPDAQYADVMERVLYNGVLS-GMALDGKSFFYVNPL-EVVPE 65

Query: 459 A-----RSTHGWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           A     R +H    +   F   CC        S +G   Y E+E  +   +I  YI +  
Sbjct: 66  ACHRDERKSHVKPVRQKWFGCACCPPNVARLLSSVGSYAYTEKEDTI---FIHLYIGAIL 122

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
             +     +  K+     W+  + + +     + V ++ ++   +P W  +    + +NG
Sbjct: 123 KKQINGKEMEVKIQSEFPWNGKVNVYV-----KGVREVCTIAFHIPEWGEAYQL-SKING 176

Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
             + +     +L  T++W   +++ +Q P+ +R         E     A++ GP
Sbjct: 177 ATIKVKE--RYLYVTKKWEEEEEIHLQFPMEVRLIEANPFVRENIGKNAVMRGP 228


>gi|325282251|ref|YP_004254793.1| hypothetical protein Odosp_3669 [Odoribacter splanchnicus DSM
           20712]
 gi|324314060|gb|ADY34613.1| protein of unknown function DUF1680 [Odoribacter splanchnicus DSM
           20712]
          Length = 796

 Score = 44.7 bits (104), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 118/320 (36%), Gaps = 59/320 (18%)

Query: 372 YATGGTSAREFWWDPKRLADTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYA 423
           Y TGG  AR +        +  G   E        ETC + + +  +  LF  T E  Y 
Sbjct: 309 YITGGIGARAW-------GEGFGENYELPNMTSYCETCASISNVYWNYRLFLLTGESKYY 361

Query: 424 DYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTGIES 482
           D  ERAL NGV+S     +     Y  PL    S  RS   W      F C C  + I  
Sbjct: 362 DVLERALYNGVIS-GVSLDGKRYFYDNPLMSDGSHDRSE--W------FGCSCCPSNITR 412

Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
           F        +   GN   L++  Y+ +          +  K +    W+  +++TL  S 
Sbjct: 413 FMPSIPGYVYAVRGNT--LFVNLYMGNEGQITLEGQPVRIKQETRYPWEGRIKLTLDHSP 470

Query: 543 KQEVGQLSSLNLRMPVW-----------TY----SNGAQASLNGQNLPLPPPGNFLSATE 587
                   +L LR+P W           TY    +     SLNG+ +       +     
Sbjct: 471 ASSF----TLALRIPGWVQQQPLPGTLYTYLDKDTPSYTISLNGKTVKPEVRNGYALLRG 526

Query: 588 RWSYNDKLTIQLPLSLRT----EAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTA 643
            W  ND++ + LP+ +R       + DDR +Y    A+++GP +     S       G A
Sbjct: 527 DWKGNDQIVLNLPMQVRKVIADPQVIDDRNKY----ALIYGPIVYCVEASDH----DGYA 578

Query: 644 RSL-SALISPIPPSFNAQLV 662
             L +   +P  P F   L+
Sbjct: 579 LDLFTEEDTPFSPEFKPDLL 598


>gi|261341800|ref|ZP_05969658.1| hypothetical protein ENTCAN_08284 [Enterobacter cancerogenus ATCC
           35316]
 gi|288316173|gb|EFC55111.1| putative cytoplasmic protein [Enterobacter cancerogenus ATCC 35316]
          Length = 651

 Score = 44.7 bits (104), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 78/364 (21%), Positives = 127/364 (34%), Gaps = 73/364 (20%)

Query: 293 LYRLYSITHDPKHLLLAHLFDK-----PCFLGFLALQADYLSHFH-------------AN 334
           L RLY +T +P+++ L + F +     P F      +    SH+H             + 
Sbjct: 193 LMRLYDVTQEPRYMALVNYFIEARGTTPHFYDIEYEKRGRTSHWHNYGPAWMVKDKAYSQ 252

Query: 335 THIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG-- 376
            H P+      IG  +R+            ++ D   +         +     Y TGG  
Sbjct: 253 AHQPLSEQQTAIGHAVRFVYLMAGMAHLARLSNDDGKRQDCLRLWRNMAQRQLYITGGIG 312

Query: 377 --TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
             +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N V
Sbjct: 313 SQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMETDSQYADVMERALYNTV 369

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGI 480
           L      +     Y+ PL          H     FN  +              CC     
Sbjct: 370 LG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIYDHVKPVRQRWFGCACCPPNIA 420

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
              + LG  IY         L+I  Y+ +      G   L  ++     W  + ++ +  
Sbjct: 421 RVLTSLGHYIYTLHPET---LFINLYVGNDIAVPVGDQQLQLRISGNYPW--HEQVNIEI 475

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
           +S   V    +L LR+P W      + SLNG  +       +L     W   D LT+ LP
Sbjct: 476 ASPVPVTH--TLALRLPDWC--ENPEVSLNGAAVTGEVSRGYLYLRRSWQEGDVLTLTLP 531

Query: 601 LSLR 604
           + +R
Sbjct: 532 MPVR 535


>gi|256420772|ref|YP_003121425.1| hypothetical protein Cpin_1728 [Chitinophaga pinensis DSM 2588]
 gi|256035680|gb|ACU59224.1| protein of unknown function DUF1680 [Chitinophaga pinensis DSM
           2588]
          Length = 675

 Score = 44.7 bits (104), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 41/169 (24%), Positives = 71/169 (42%), Gaps = 26/169 (15%)

Query: 444 GVMIYMLPLGR---GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
           GV  + LP  R    V  ARS          + CC     + ++K    ++++  G   G
Sbjct: 384 GVFNFSLPFDREMCNVLGARS---------GYTCCLANMHQGWTKYTSHLWYQTSGK--G 432

Query: 501 LYIIQY----ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
           +  ++Y    +++    K   V + +  D    ++  +R  +    + E      L LR+
Sbjct: 433 VAALEYGPCVMTAEVGKKHRDVTITEVTD--YPFNEEIRFQIAIKKETEF----PLQLRI 486

Query: 557 PVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           P W   N A   LNGQ L     G  ++    W   D+LT+QLP+++ T
Sbjct: 487 PAW--CNEAVILLNGQPLRKDKGGQIITIEREWQDKDELTLQLPMTITT 533


>gi|399041428|ref|ZP_10736483.1| hypothetical protein PMI09_04045 [Rhizobium sp. CF122]
 gi|398060198|gb|EJL52027.1| hypothetical protein PMI09_04045 [Rhizobium sp. CF122]
          Length = 640

 Score = 44.7 bits (104), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 77/348 (22%), Positives = 131/348 (37%), Gaps = 53/348 (15%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFHANT------HIPI 339
            L +L  +T + K+L LA  F      +P F    A++     + FH  T      H P+
Sbjct: 198 ALVKLARVTGEKKYLDLAKFFIDERGTEPNFFTEEAIRDGRDAADFHQKTYEYGQAHEPV 257

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    +A
Sbjct: 258 REQKKVVGHAVRAMYLYSGMADIATEYNDDSLTGALETLWDDLTT-KQMYVTGGIGPAAA 316

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   S   ETC +  ++  +  +        YAD  E+AL NG ++   
Sbjct: 317 NEGFTDYYDLPNE--SAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GL 373

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
             +     Y  PL      A   H W   ++   CC        + +G  +Y   E  + 
Sbjct: 374 SLDGKTFFYENPL----ESAGKHHRW--IWHHCPCCPPNIARLLASIGSYMYGVAEDEI- 426

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            +++     + F      V L QK      W   +   +  S   +     +++LR+P W
Sbjct: 427 AVHLYGEGRARFKMAGADVALTQKTR--YPWHGAVHFDIKTSKPAQF----AVSLRIPGW 480

Query: 560 TYSNGAQASLNGQNLPLP--PPGNFLSATERWSYNDKLTIQLPLSLRT 605
             +NGA  ++NG+ + +       +      W   DK+ + +PL  R+
Sbjct: 481 --ANGATLAVNGEAIDIGSVDVDGYARIEREWRDGDKIDLDIPLEARS 526


>gi|448391565|ref|ZP_21566711.1| hypothetical protein C477_10858 [Haloterrigena salina JCM 13891]
 gi|445665886|gb|ELZ18561.1| hypothetical protein C477_10858 [Haloterrigena salina JCM 13891]
          Length = 637

 Score = 44.7 bits (104), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 53/269 (19%), Positives = 102/269 (37%), Gaps = 32/269 (11%)

Query: 350 TGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE----ETCTTY 404
           TGD  LY  +   + ++     +Y TGG  +       +R  D     N     ETC   
Sbjct: 269 TGDRELYDQLQALWRNMTE-RRTYVTGGIGSTHH---GERFTDDYDLPNRTSYAETCAAV 324

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK------ 458
             +  +  +F+ + ++ Y +  ER L NG L+     +     Y  PL  G         
Sbjct: 325 GSVFWNHRMFQLSGDVQYPELVERTLYNGFLA-GLSLDATEFFYANPLEVGPDGHALADE 383

Query: 459 -----ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
                +    GW   F+   CC        + LG  IY     + P +Y+ Q++ S    
Sbjct: 384 NPDRFSNQRQGW---FDCA-CCPPNAARLIASLGRYIY-ARATDEPAVYVNQFVGSEAAL 438

Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
                 +  + +  + W   + +T+  +   +     +L +R+P W   +   A++ G++
Sbjct: 439 TIDDTDVRLRQESALPWAGDVTLTVDPAEPTDF----ALRVRVPEW--CSDVTATVAGES 492

Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLS 602
             + P   ++     W   D+LT+   ++
Sbjct: 493 RSVEPDDGYIEVAREWEDGDELTVTFGMA 521


>gi|262382782|ref|ZP_06075919.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
 gi|262295660|gb|EEY83591.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
          Length = 618

 Score = 44.7 bits (104), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 49/229 (21%), Positives = 94/229 (41%), Gaps = 21/229 (9%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPL-GRGV 456
           ETC +  M+  ++ + + T +  Y D  ER+L NG L+ I  G +     Y+ PL  +G 
Sbjct: 336 ETCASVGMVLWNQRMNQLTGDSKYIDVLERSLYNGALAGISLGGDR--FFYVNPLESKGD 393

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
              +  +G         CC          +G+ IY   +     L++  YI ++   + G
Sbjct: 394 HHRQEWYGCA-------CCPSQLSRFLPSIGNYIYASSD---DALWVNLYIGNTGQIRIG 443

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
              +    +    WD  +++T++ S   E      + LR+P W  +     S+NG+ + +
Sbjct: 444 ETDIQLTQETDYPWDGSVKLTISTSQPLE----KEIRLRIPNWCKT--YDLSINGKRINV 497

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
                + +  + W   D + + + + +   A      E    +AI  GP
Sbjct: 498 SEEKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFGKRAIQRGP 545


>gi|255035900|ref|YP_003086521.1| hypothetical protein Dfer_2133 [Dyadobacter fermentans DSM 18053]
 gi|254948656|gb|ACT93356.1| protein of unknown function DUF1680 [Dyadobacter fermentans DSM
           18053]
          Length = 673

 Score = 44.7 bits (104), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 109/486 (22%), Positives = 173/486 (35%), Gaps = 87/486 (17%)

Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD----------SF 219
             A A ++A+T +  + E M   +  +++ Q K G  Y  A   +  +          SF
Sbjct: 107 FEAVASLYAATKDPKLDELMDKTIAVIAKAQRKDGYIYTKAIIEQKQNGEGKMFADRLSF 166

Query: 220 EALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
           EA        Y    ++      Y        L +A    ++       +IT Y      
Sbjct: 167 EA--------YNFGHLMTAACVHYRATGKTSLLDVAKKAADF-------LITFYGAATPE 211

Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHL-LLAHLF----------DKPCFLGFLALQADYL 328
            S N         L  LY  THD K+L L+ HL           D    + FL  Q   +
Sbjct: 212 QSRNAICPAHYMGLSELYRTTHDEKYLTLVKHLIAIKGATEGTDDNQDRIPFLK-QTKVM 270

Query: 329 SH-FHANTHIPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
            H   AN    +  G    Y  TGD  L   + T + D+      Y TGG  A      P
Sbjct: 271 GHAVRANY---LYAGVADVYAETGDEALLAQLHTMWDDVTQ-HKMYVTGGCGALYDGTSP 326

Query: 387 ----------KRLADTLGSE--------NEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
                     +++    G +        + ETC     +  +  + + T E  YAD  E 
Sbjct: 327 DGTSYKPDEVQKIHQAYGRDYQLPNFTAHNETCANIGNVLWNWRMLQITGEAKYADIVEL 386

Query: 429 ALTNGVLS--IQRG-----TEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
           AL N VLS    +G     T P      LP  +   K R    + +K N   CC    + 
Sbjct: 387 ALYNSVLSGISLKGDKFLYTNPLAYSDALPFKQRWEKDR--QAYISKSN---CCPPNTVR 441

Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KSGHVVLNQKVDPIVSWDPYLRMTLT 539
           + +++    Y   +    G++   Y  + F    K G + L Q  D    W+  + +TL 
Sbjct: 442 TVAEVSQYAYSLSDA---GVFFNLYGGNKFQTAVKGGQLQLTQVTD--YPWNGKISITLD 496

Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQ 598
            + K  +    SL  R+P W   + A   +NG+        G++      W   DK+ + 
Sbjct: 497 QAPKDAL----SLFFRIPGW--CSNASMVINGKKETAKLASGSYAELRRTWKSGDKIELM 550

Query: 599 LPLSLR 604
           L + ++
Sbjct: 551 LEMPVK 556


>gi|338730906|ref|YP_004660298.1| hypothetical protein Theth_1126 [Thermotoga thermarum DSM 5069]
 gi|335365257|gb|AEH51202.1| protein of unknown function DUF1680 [Thermotoga thermarum DSM 5069]
          Length = 621

 Score = 44.3 bits (103), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 83/361 (22%), Positives = 132/361 (36%), Gaps = 47/361 (13%)

Query: 332 HANTHIPIVIGSQMRY-EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW---WDPK 387
           HA   + +  G+   Y E  G  ++K +   + D+      Y TGG  +R  W    +P 
Sbjct: 249 HAVRMLYLCCGATDLYLETEGKAIWKTLENLWKDMT-TRKMYITGGVGSRHDWESIGEPY 307

Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
            L +       ETC        +  +F  + E  + D  E+ + NG+LS     +     
Sbjct: 308 ELPNRRAYA--ETCAAIANFMWNYRMFLASGEARFVDVMEQVVYNGLLS-GISLDGDKYF 364

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
           Y  PL             GTK    W    CC      + + L   IY + +     L++
Sbjct: 365 YDNPL----------EDMGTKRRQRWFDCACCPPNIARTIASLPHYIYAQSKDK---LWV 411

Query: 504 IQYISSSFDWKSGHVVLN--QKVDPIVSWDPYLRM----TLTFSSKQEVGQLSSLNLRMP 557
             Y SS+F      V +   Q+ D   S D ++R+    TL+F          +L LR+P
Sbjct: 412 NLYESSTFKIIHNDVPIEIVQQTDYPWSGDVHIRIAARETLSF----------TLLLRIP 461

Query: 558 VWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR--PEY 615
            W  S      LNG+++       +      W   +   +QL L LR E +Q      E 
Sbjct: 462 EW--SADFDLKLNGKSVKFHLNNGYAELQNSWKGTN--NVQLTLKLRPECLQSHPYVSEN 517

Query: 616 ASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
               A+  GP L         D    T +  S     +P     + + F   +G +T + 
Sbjct: 518 HGKVAVRSGPVLYCIEQVDNPDFDIWTLKIDSDSFEMVPGEILGKRMFFLLGNGKATNIR 577

Query: 676 S 676
           S
Sbjct: 578 S 578


>gi|338212418|ref|YP_004656473.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336306239|gb|AEI49341.1| protein of unknown function DUF1680 [Runella slithyformis DSM
           19594]
          Length = 618

 Score = 44.3 bits (103), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 58/282 (20%), Positives = 109/282 (38%), Gaps = 22/282 (7%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR-GVS 457
           ETC +  M+  ++ +  ++ E  Y D  ER+L NG L+  + T   +  Y+ PL   G+ 
Sbjct: 331 ETCASVGMVFWNQRMNLYSGEAKYVDVLERSLYNGALAGVQLT-GNLFFYVNPLASFGLH 389

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
             R  +G         CC          +G  IY   E     L++  Y+ S  +   G+
Sbjct: 390 HRRPWYGTA-------CCPSNVSRLMPSVGGYIYNTSENT---LWVNLYVGSETEVMLGN 439

Query: 518 VVLNQKVDPIVSWDPYLRM-TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL-P 575
             +         W   + +  +  SSK +     +L LR+P W   +     +NG+ +  
Sbjct: 440 HKVKFAKKTNYPWAGEVEIKAIPDSSKADF----ALKLRIPAW--CDKYTVEINGKPVEK 493

Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTS 633
           L     +++    W+ ND L +++ + ++  A           +AI  GP  Y +    +
Sbjct: 494 LTVDKGYVTVARTWAKNDVLKLRMDMPVKVVAADPRVKANEGKRAIQRGPLVYCVEEQDN 553

Query: 634 GEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
              D         +   +   P+    + T   ++GN  F +
Sbjct: 554 RHLDYDQILLSKKTQFSTTFEPTLLGGVTTIKAQNGNENFTL 595


>gi|424870152|ref|ZP_18293818.1| hypothetical protein Rleg5DRAFT_7481 [Rhizobium leguminosarum bv.
           viciae WSM1455]
 gi|393171573|gb|EJC71619.1| hypothetical protein Rleg5DRAFT_7481 [Rhizobium leguminosarum bv.
           viciae WSM1455]
          Length = 647

 Score = 44.3 bits (103), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 105/483 (21%), Positives = 179/483 (37%), Gaps = 97/483 (20%)

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKP-- 224
           G ++ A++    +  +A ++ K+  +V  L + Q  +  GYL+++       F   +P  
Sbjct: 90  GKWIEAASYTLKAHPDAALEAKIDAIVEKLEKGQ--MADGYLNSW-------FIRREPDR 140

Query: 225 VWAPYYTIHKI--LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER---HW 279
            W     +H++  +  LL+  V    A   +     ++     V  +I  +  E      
Sbjct: 141 RWTNLRDLHEMYSMGHLLEGAVAYREATGKRR---FLDVMIRAVDHIIATFGAEPGKLRG 197

Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQAD-------Y 327
           Y  +EE   +   L +LY +T DP+HL LA  F       P +    A +         Y
Sbjct: 198 YDAHEE---IELALVKLYRVTRDPRHLKLATYFVDERGRMPSYYDEEARKRGESPDDYVY 254

Query: 328 LSHFHANTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHS 371
            ++ ++  H+P+     V+G  +R                DP  K       D +     
Sbjct: 255 KTYAYSQAHMPVRDQHQVVGHAVRAMYLFSAMADLSHENDDPTLKEACNRLFDNLVGRQL 314

Query: 372 YATGGTS--------AREFWWDPKRLADTLGSEN--EETCTTYNMLKVSRHLFRWTKEIA 421
           Y TGG           REF          L +E    ETC    +   S  + +   +  
Sbjct: 315 YVTGGLGPSASNEGFTREF---------DLPNETAYAETCAAVALGFWSHRMAQVDLDSK 365

Query: 422 YADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTG 479
           + D  E  L NG LS I R  E      +L           +HG   ++   +C C  T 
Sbjct: 366 FTDRLETVLYNGALSGISRDGEHYFYENVL----------ESHGQHRRWKWHYCPCCPTN 415

Query: 480 IESF-SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY--LRM 536
           I  F + LG   Y  ++     L +  Y ++S +   G+  +    + +  WD    LR 
Sbjct: 416 IARFITSLGQYFYSTDDHQ---LAVHLYGTNSAELTVGNSFVRLIQETLYPWDGDIGLRF 472

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDK 594
            L   S+ +      L LR+P W     AQ S+NG  + L       + + +  W   D+
Sbjct: 473 ALERPSRFQ------LRLRIPGWCRQ--AQISVNGVAVDLDQCVTKGYAAISREWRNGDE 524

Query: 595 LTI 597
           + I
Sbjct: 525 VRI 527


>gi|380510716|ref|ZP_09854123.1| hypothetical protein XsacN4_05853 [Xanthomonas sacchari NCPPB 4393]
          Length = 660

 Score = 44.3 bits (103), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 45/211 (21%), Positives = 79/211 (37%), Gaps = 18/211 (8%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E+C +  ++  +  + +   +  YAD  ERAL N VL      +     Y+ PL      
Sbjct: 339 ESCASIGLMMFANRMLQLAPDGRYADVMERALYNTVLG-GMALDGRHFFYVNPLEVHPPT 397

Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
               H +         W    CC        + LG  +Y   +     LY+  Y+ S   
Sbjct: 398 LHGNHTFDHVKPVRQRWFGCACCPPNIARVLTSLGHYLYTRHDDT---LYVNLYVGSDAR 454

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
           ++ G  +L  +      W   +   +  S+  +    ++L LR+P W      Q  LNG+
Sbjct: 455 FEVGGQILTLRQRGEYPWQDTIDFDVACSAPMD----AALALRLPDWC--QAPQLLLNGE 508

Query: 573 NLPLPP--PGNFLSATERWSYNDKLTIQLPL 601
            + +       +     RW   D L ++LP+
Sbjct: 509 PVAIEAHRQHGYCVLRRRWQSGDTLQLRLPM 539


>gi|148269779|ref|YP_001244239.1| hypothetical protein Tpet_0643 [Thermotoga petrophila RKU-1]
 gi|147735323|gb|ABQ46663.1| protein of unknown function DUF1680 [Thermotoga petrophila RKU-1]
          Length = 620

 Score = 44.3 bits (103), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 72/341 (21%), Positives = 138/341 (40%), Gaps = 50/341 (14%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD-----YLSHF----------HANTHI 337
           L  LY  T D K+L LA  F      G  ++  +     ++ H           HA   +
Sbjct: 196 LVELYRETGDRKYLDLARYFIYARGKGLASVPRNPGPEYFIDHKPFVELEEITGHAVRAL 255

Query: 338 PIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
            +  G+   Y  TGD  +++ +   + + V     Y TGG  +R  W       ++ G E
Sbjct: 256 YLCSGATDLYLETGDEKIWQALNRLWENFV-TKKMYITGGAGSRHDW-------ESFGEE 307

Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
            E        E+C +      +  +   T E  +AD  E+ L NG+LS     +     Y
Sbjct: 308 YELSNRRSYAESCASIANFMWNFRMLLATGEGKFADVMEQVLYNGLLS-GISLDGKHYFY 366

Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
             PL   + + R       K+    CC        +     +Y   +  V  +++ +  +
Sbjct: 367 FNPL-EDLGRTRR-----QKWFDCACCPPNLARFIASFPGYMYTTSDDGVQ-VHLYEKST 419

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
           S  ++K+  V + Q+ D    W       +TF+ + ++ +  S++LR+P W  ++     
Sbjct: 420 SKLNFKNSVVEIEQETD--YPWSG----EVTFTVETDIEEPFSISLRIPSW--ADDFVLR 471

Query: 569 LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
           ++G+ +   P   ++  ++ W    K T++L L ++ E I+
Sbjct: 472 VDGKTVTANPQNGYVKLSQSW--KGKHTVELSLPMKVEFIE 510


>gi|374984436|ref|YP_004959931.1| hypothetical protein SBI_01679 [Streptomyces bingchenggensis BCW-1]
 gi|297155088|gb|ADI04800.1| hypothetical protein SBI_01679 [Streptomyces bingchenggensis BCW-1]
          Length = 666

 Score = 44.3 bits (103), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 60/258 (23%), Positives = 97/258 (37%), Gaps = 15/258 (5%)

Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADTLGSENEETCTTYNM 406
           TGDP  +       + + A+ +Y TGG  +R   E + D   L         ETC     
Sbjct: 289 TGDPGLREALVRLWEDMAATKTYLTGGVGSRHDLEAFGDAYELPPD--RAYAETCAAIAS 346

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
           ++    +   T E  Y+D  ER L NG LS     +    +Y+ PL      A      G
Sbjct: 347 IQFGWRMALLTGEARYSDLVERTLYNGFLS-GVSLDGNRWLYVNPLQVREDYAGPHGDQG 405

Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDP 526
            +   ++ C          L    ++   G+  GL + QY S S+    G V    +V  
Sbjct: 406 ARRTEWFRCACCPPNVMRLLASLPHYVASGDADGLQLHQYASGSYAAGGGAV----RVGT 461

Query: 527 IVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSAT 586
              W+   R+ +        G   +L+LR+P W    G   ++ G+ +       +L   
Sbjct: 462 GYPWE--GRIAVVVDEVPGDGDW-TLSLRIPHWADEYG--VTVGGEPVAARAESGWLRLR 516

Query: 587 ERWSYNDKLTIQLPLSLR 604
             W   + + + LPL  R
Sbjct: 517 RHWRPGETVVLALPLRPR 534


>gi|343085566|ref|YP_004774861.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342354100|gb|AEL26630.1| protein of unknown function DUF1680 [Cyclobacterium marinum DSM
           745]
          Length = 690

 Score = 43.9 bits (102), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 59/250 (23%), Positives = 106/250 (42%), Gaps = 23/250 (9%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG--VMIYMLPLGRGV 456
           ETC     +  +  +   T +  +AD  E +L N VLS   GT+ G     Y  PL R  
Sbjct: 373 ETCANIGNVLWNHRMLLVTGDSRFADILELSLFNSVLS---GTDLGGTNFNYTNPL-RVD 428

Query: 457 SKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSF 511
                T  W      +     CC    + + ++  +  Y   + G V  LY    + +S 
Sbjct: 429 KDLPFTFRWNKVREPYISKSNCCPPNVVRTVAETHNYAYALSDNGLVVNLYGSNELKTSL 488

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVWTYSNGAQASLN 570
              S  + L Q+ D    WD  +++++     Q+ GQ   +++LR+P W  ++ A+ ++N
Sbjct: 489 PNGSS-LELKQETD--YPWDGKIKLSI-----QKTGQDPLAIDLRVPAW--ASQAEITVN 538

Query: 571 GQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           G+     P  G++ S   +W   D + + LP++ R         E  +  A++ GP +  
Sbjct: 539 GEKSKEKPIAGSYFSLVRQWEKGDVIELNLPMTARLMEANPLVEETRNQVAVVRGPIVYC 598

Query: 630 GHTSGEWDIK 639
             +S   D +
Sbjct: 599 IESSDLQDAR 608


>gi|421598168|ref|ZP_16041640.1| hypothetical protein BCCGELA001_11816 [Bradyrhizobium sp.
           CCGE-LA001]
 gi|404269708|gb|EJZ33916.1| hypothetical protein BCCGELA001_11816 [Bradyrhizobium sp.
           CCGE-LA001]
          Length = 276

 Score = 43.9 bits (102), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 37/154 (24%), Positives = 61/154 (39%), Gaps = 9/154 (5%)

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC       F+ +G  IY         LY+  YI +S     G   L  +++    W+  
Sbjct: 39  CCPPNIARLFTSVGHYIYTPRS---EALYVNLYIGNSVAIAVGGHTLRLRMNGNYPWEDL 95

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
           + + +   S+Q +    +L LR+P W   +  +  LNG+ +   P   +L     W   D
Sbjct: 96  VEIAV--ESEQPITH--TLALRLPEWC--SAPEVKLNGEPVNCEPRKGYLHIHRTWRKGD 149

Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
           +  +QLP+  R           A   AI  GP +
Sbjct: 150 RCKLQLPMKSRRVYGHPQLRHLAGKVAIQRGPLI 183


>gi|256838374|ref|ZP_05543884.1| conserved hypothetical protein [Parabacteroides sp. D13]
 gi|256739293|gb|EEU52617.1| conserved hypothetical protein [Parabacteroides sp. D13]
          Length = 618

 Score = 43.9 bits (102), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 51/231 (22%), Positives = 97/231 (41%), Gaps = 25/231 (10%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPL-GRGV 456
           ETC +  M+  ++ + + T +  Y D  ER+L NG L+ I  G +     Y+ PL  +G 
Sbjct: 336 ETCASVGMVLWNQRMNQLTGDSKYIDVLERSLYNGALAGISLGGDR--FFYVNPLESKGD 393

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
              +  +G         CC          +G+ IY   +     L++  YI ++   + G
Sbjct: 394 HHRQEWYGCA-------CCPSQLSRFLPSIGNYIYASSD---DALWVNLYIGNTGQIRIG 443

Query: 517 H--VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
              ++L Q+ D    WD  +++T++ S   E      + LR+P W  +     S+NG+ +
Sbjct: 444 ETDILLTQETD--YPWDGSVKLTISTSQPLE----KEIRLRIPNWCKT--YDLSINGKRI 495

Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
            +     + +  + W   D + + + + +   A      E    + I  GP
Sbjct: 496 NVSEEKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFGKRVIQRGP 545


>gi|281412335|ref|YP_003346414.1| hypothetical protein Tnap_0910 [Thermotoga naphthophila RKU-10]
 gi|281373438|gb|ADA67000.1| protein of unknown function DUF1680 [Thermotoga naphthophila
           RKU-10]
          Length = 620

 Score = 43.9 bits (102), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 72/342 (21%), Positives = 138/342 (40%), Gaps = 50/342 (14%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD-----YLSHF----------HANTH 336
            L  LY  T D K+L LA  F      G  ++  +     ++ H           HA   
Sbjct: 195 ALVELYRETGDRKYLDLARYFIYTRGKGLASVPRNPGPEYFIDHKPFVELEEITGHAVRA 254

Query: 337 IPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
           + +  G+   Y  TGD  +++ +   + + V     Y TGG  +R  W       ++ G 
Sbjct: 255 LYLCSGATDLYLETGDEKIWQALNRLWENFV-TKKMYITGGAGSRHDW-------ESFGE 306

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
           E E        E+C +      +  +   T E  +AD  E+ L NG+LS     +     
Sbjct: 307 EYELSNRRSYAESCASIANFMWNFRMLLATGEGKFADVMEQVLYNGLLS-GISLDGKHYF 365

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  PL   + + R       K+    CC        +     +Y   +  V  +++ +  
Sbjct: 366 YFNPL-EDLGRTRR-----QKWFDCACCPPNLARFIASFPGYMYTTSDDGVQ-VHLYEKS 418

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
           +S  ++K+  V + Q+ D    W       +TF+ + ++ +  S++LR+P W  ++    
Sbjct: 419 TSKLNFKNSVVEIEQETD--YPWSG----EVTFTVETDIEEPFSISLRIPSW--ADDFVL 470

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            ++G+ +   P   ++  ++ W    K T++L L ++ E I+
Sbjct: 471 RVDGKTVTANPQNGYVKLSQSW--KGKHTVELSLPMKVEFIE 510


>gi|271965305|ref|YP_003339501.1| hypothetical protein [Streptosporangium roseum DSM 43021]
 gi|270508480|gb|ACZ86758.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021]
          Length = 654

 Score = 43.9 bits (102), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 47/213 (22%), Positives = 84/213 (39%), Gaps = 25/213 (11%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC     + ++  L   T ++ YAD  ER + N VL+     E     Y  PL   V  
Sbjct: 304 ETCAGIGSIMLAHRLLLATGDVRYADLAERTMFN-VLATSPALEGRSFFYANPLHVRVPA 362

Query: 459 A-------RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
           A        +  G  + + +  CC      +++ L     +    +  G+ I  +  +  
Sbjct: 363 APPEGMNPAAEGGLRSPWFTVSCCPNNIARTYASLA---AYVATSDASGVQIHHHTPAEI 419

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
                H  L  +V+    W   + + +        G    ++LR+P W  ++GA+ S  G
Sbjct: 420 H----HEGLVLRVETGYPWSGEVTVRVVR------GGSGRISLRVPPW--ASGARISHGG 467

Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
              P+  P  +  A  RW   D++ + LP++ R
Sbjct: 468 TTRPV--PAGYAVAEGRWRPGDEIRLHLPMTPR 498


>gi|448418968|ref|ZP_21580124.1| hypothetical protein C474_15274 [Halosarcina pallida JCM 14848]
 gi|445675954|gb|ELZ28481.1| hypothetical protein C474_15274 [Halosarcina pallida JCM 14848]
          Length = 642

 Score = 43.9 bits (102), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 60/243 (24%), Positives = 90/243 (37%), Gaps = 50/243 (20%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGV 456
           ETC     +  ++ L   T E  YAD  ER L NG L+     GT      Y  PL    
Sbjct: 342 ETCAAIGSIFWNQRLLELTGEAKYADLIERTLYNGFLAGVSLDGTR---FFYENPLESSG 398

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII-QYISSSFDWKS 515
              R   GW T      CC       F+ LG  +Y     NV G+  + QY+ S+     
Sbjct: 399 DHHRK--GWFTCA----CCPPNAARLFASLGRYVY----SNVDGVLTVNQYVGSTVTTTV 448

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
           G   +       + W     +TLT  + + V     + LR+P W  +  A  S++G+   
Sbjct: 449 GGTEVELTQSSSLPWSG--EVTLTVDADEAV----PIRLRVPAW--ATDASVSIDGEEAE 500

Query: 576 LPPPGNFLSATERWSYNDKLTIQL-------------------------PLSLRTEAIQD 610
               G ++     W+  D++T++                          PL    EA+ +
Sbjct: 501 RSDDGAYVELDGEWN-GDRITVRFGQETELVRAHPAVESDAGRVAVERGPLVYCAEAVDN 559

Query: 611 DRP 613
           DRP
Sbjct: 560 DRP 562


>gi|241895790|ref|ZP_04783086.1| protein of hypothetical function DUF1680 [Weissella
           paramesenteroides ATCC 33313]
 gi|241870833|gb|EER74584.1| protein of hypothetical function DUF1680 [Weissella
           paramesenteroides ATCC 33313]
          Length = 655

 Score = 43.9 bits (102), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 111/530 (20%), Positives = 198/530 (37%), Gaps = 85/530 (16%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-----PTELFDSFE 220
           V  +L A+A  ++   +  +K+    ++  +++ Q+    GYLS +     P      F+
Sbjct: 86  VYKWLEAAAYSFSYHQDDNLKKMTDELIDLIADAQDD--DGYLSTYFQIDAPER---KFK 140

Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWY 280
            L+     Y   H I AG+   Y    N +AL++A  M +     + K   +   + H Y
Sbjct: 141 RLQQSHELYTMGHYIEAGVA-YYQATGNQKALQIAERMADC----IDKNFGLKDGQIHGY 195

Query: 281 SLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLG-----------FLALQ 324
             + E   +   L RL+  T + ++L LAH F       P F              +A  
Sbjct: 196 DGHPE---IELALARLFEATQEQRYLDLAHYFLNQRGQNPEFFDEQIKADGVDRDLIAGM 252

Query: 325 ADYLSHF---------------HANTHIPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNA 368
            D+   +               HA   + +  G  M    TGD  L      F+ DIV  
Sbjct: 253 RDFPRRYYQAAEPIKDQQTADGHAVRVVYLCTGMAMVARHTGDQELLAACKRFWNDIVK- 311

Query: 369 SHSYATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYAD 424
              Y TG     T+   F +D     DT+  E   TC +  M   ++ + +   +  Y D
Sbjct: 312 RRMYITGNIGSTTTGEAFTYDYDLPNDTMYGE---TCASVGMSFFAKEMLKIEAKGEYGD 368

Query: 425 YYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR----STHGWGTKFNSFW--CCYGT 478
             E+ L NG LS     +     Y+ PL    + ++     +H    + + F   CC   
Sbjct: 369 ILEKELFNGSLS-GMSLDGKHFFYVNPLEADPTASKLNPGKSHILTHRADWFGCACCPAN 427

Query: 479 GIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL 538
                + +   IY   +  +      Q+I++   +  G  V      P   W   ++  L
Sbjct: 428 LARLITSVDQYIYTVHDNTILSH---QFIANEASFSDGVTVTQTNNFP---WQGDIKYHL 481

Query: 539 TFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
             ++ +         +R+P W+  +    ++NGQN+       F+  T      D + I+
Sbjct: 482 ENANHKTY----QFGIRVPQWS-QDEFSVAVNGQNVDATIEDGFIYLTID---QDNVDIE 533

Query: 599 LPLSLRTEAIQDDRPEYASIQ--AILFGPYLLAGHTSGE----WDIKTGT 642
           L L++ T+ ++ +    A+    A+  GP + A   +      WD    T
Sbjct: 534 LTLNMATKLMRSNNRVKANFGQVAVTRGPLVYAAEEADNEAPLWDYHVNT 583


>gi|325103091|ref|YP_004272745.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324971939|gb|ADY50923.1| protein of unknown function DUF1680 [Pedobacter saltans DSM 12145]
          Length = 673

 Score = 43.5 bits (101), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 52/212 (24%), Positives = 90/212 (42%), Gaps = 23/212 (10%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRG------TEPGVMIYMLP 451
           ETC     +  +  + +   +  YAD  E AL N VLS I         T P      LP
Sbjct: 357 ETCANIGNVLWNWRMLQLEGDAKYADVMELALYNSVLSGISLDGKRFLYTNPLSYSDNLP 416

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
             +  SK R  +    K ++  CC    + + +++ +  Y    +G    LY    +S+ 
Sbjct: 417 FKQRWSKERVEY---IKLSN--CCPPNTVRTIAEVSNYAYSISNKGVYVNLYGSNNLSTK 471

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
            D  S   +  Q   P   W+  + +T++ S K       S+ +R+P W  +N A+ S+N
Sbjct: 472 LDDGSTIKLTQQTEYP---WEGRVAITISESKKSPF----SIFMRIPGW--ANSAKVSIN 522

Query: 571 GQNLPLP-PPGNFLSATERWSYNDKLTIQLPL 601
           G+++      G +L     W   D++ + LP+
Sbjct: 523 GKSVDADIKSGQYLELNRNWKKGDQIVLNLPM 554


>gi|302875896|ref|YP_003844529.1| hypothetical protein Clocel_3075 [Clostridium cellulovorans 743B]
 gi|307689330|ref|ZP_07631776.1| hypothetical protein Ccel74_14336 [Clostridium cellulovorans 743B]
 gi|302578753|gb|ADL52765.1| protein of unknown function DUF1680 [Clostridium cellulovorans
           743B]
          Length = 648

 Score = 43.5 bits (101), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 98/446 (21%), Positives = 162/446 (36%), Gaps = 62/446 (13%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF----------HANTHI 337
           L +LY +T++ K+L L+  F      +P +      + D +SHF          +   H 
Sbjct: 197 LVKLYDVTNNSKYLALSKYFIDQRGQEPNYFKEEYEKRDGVSHFLKTKIPLDLPYNQAHK 256

Query: 338 PI-----VIGSQMR--YEVTG----------DPLYKLIGTFFMDIVNASHSYATGGTSA- 379
           P+      +G  +R  Y  +G          + L K   T F +I +    Y TGG  + 
Sbjct: 257 PVREQEVAVGHAVRAVYMYSGMADIAAKTNDETLKKACETIFNNIKD-KQMYITGGVGST 315

Query: 380 ---REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
                F +D     DT+ SE   TC    ++  ++ + +  ++  YAD  ERAL N V S
Sbjct: 316 AHGEAFTYDYDLPNDTVYSE---TCAAIGLIFFAQRMLKLDQDRKYADVLERALYNTVTS 372

Query: 437 IQRGTEPGVMIYMLPL------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSI 490
                +     Y+ PL             R       K+    CC        + LG  I
Sbjct: 373 -GMALDGRHFFYVNPLEVQPEASEKSPIKRHVKAERQKWYGCACCPPNVARLLTSLGQYI 431

Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
           Y E    +   +   YI S  D+     V N+KV    + +       TF          
Sbjct: 432 YTESNDTI---FTHLYIGSKADF----TVNNKKVTVKQTTNYPSEGKATFVFDMSENNEF 484

Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
           +  LR+P W   N      N +   L     +L  T  +  +D + I + +     A   
Sbjct: 485 TFALRIPEWC-KNYKIFINNEEYRELDLNKGYLYITREFLNSDVVEISMEIETVLVASNP 543

Query: 611 DRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGN 670
                A   AI  GP +   +   E D     +  L     P+   +N +++    E   
Sbjct: 544 LVRANAGKVAICRGPLV---YCLEEIDNCKNLSSILIDTSKPVKEQYNPEVLGGAIELKA 600

Query: 671 STFVMSNSNQ----SITMEEFPVSGT 692
           S +++S+ +Q    S  ++E P + T
Sbjct: 601 SGYIVSSESQDLYTSFNVKEMPFNIT 626


>gi|410616495|ref|ZP_11327487.1| hypothetical protein GPLA_0709 [Glaciecola polaris LMG 21857]
 gi|410164204|dbj|GAC31625.1| hypothetical protein GPLA_0709 [Glaciecola polaris LMG 21857]
          Length = 659

 Score = 43.5 bits (101), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 48/221 (21%), Positives = 92/221 (41%), Gaps = 32/221 (14%)

Query: 399 ETCTTYNMLKVSRHLFRW-----TKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG 453
           ETC       +S  +F W     T E  +AD  E  L N  + +   TE     Y  PL 
Sbjct: 340 ETCAN-----ISNAMFNWRLLGITGEAKHADVIELVLHNSAM-VGISTEGDKYFYANPLR 393

Query: 454 RGVSKAR-STHGWGTK------FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
               +   S H   T+      +   +CC    + + +++    Y   +    GL +  +
Sbjct: 394 MNFGQREYSDHCDCTESPDREAYIECFCCPPNLVRTIAQVSAWAYSLTD---VGLAVNLF 450

Query: 507 ISSSFDWK---SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
            S++ + K      + L+Q+ D    WD  + + +    ++    L  + +R+P W  + 
Sbjct: 451 GSNALNTKLLDGSTLRLSQQTD--FPWDGKVALKI----EECKSALFDIQIRIPSW--AK 502

Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
           GA  S+NG+ +P+   G +     +W   D +T+ +P+ ++
Sbjct: 503 GATLSVNGETIPVVEAGQYTKIERQWQAGDNITLNMPMDIQ 543


>gi|423348680|ref|ZP_17326362.1| hypothetical protein HMPREF1060_04034 [Parabacteroides merdae
           CL03T12C32]
 gi|409213201|gb|EKN06225.1| hypothetical protein HMPREF1060_04034 [Parabacteroides merdae
           CL03T12C32]
          Length = 679

 Score = 43.5 bits (101), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 86/424 (20%), Positives = 162/424 (38%), Gaps = 43/424 (10%)

Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
           ++  ++ QY  A   Q  ++  +M  YF  ++ ++    +    W    E+ GG N  V+
Sbjct: 163 VMLKVMQQYYTA--TQDRRVIDFMTRYFRYQLDELPK--NPLGKWTFWGEQRGGDNLMVV 218

Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
           Y LY+IT D   L L  L  K  F    + L  ++L   H+   + +  G +   + Y+ 
Sbjct: 219 YWLYNITGDKFLLDLGELIHKQTFNWTDIFLNQNHLRRQHSLHCVNLAQGFKEPIVYYQQ 278

Query: 350 TGDPLYKLIGTFFMDIVNASHSYA--TGGTSAREFWWDPKRLADTLGSENEETCTTYNML 407
             D   K I      + +  H+    TG       W   + L     +   E CT   M+
Sbjct: 279 GKDS--KQIQATRQAVNDIRHTIGLPTG------LWGGDELLRFGKPTTGSELCTAVEMM 330

Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
                +   T ++ +ADY ER   N  L  Q   +     Y     + ++  R    + T
Sbjct: 331 YSLETILEVTGDMQWADYLERVAYNA-LPTQVTDDYSARQYYQQTNQ-IAVTREWREFST 388

Query: 468 ----------KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK-SG 516
                     +   + CC     + + K   ++++    N  GL  + +  S    + +G
Sbjct: 389 PHDDTDLLFGELTGYPCCTSNLHQGWPKFVQNLWYATADN--GLASLLFAPSQVTARVAG 446

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
            + +N K +    ++  +R  ++F+ K+        +LR+P W          NG+ L +
Sbjct: 447 GIEVNLKEETAYPFEETVRYHVSFTDKKVKKVFFPFHLRIPGW--CKQPVVKFNGKPLTV 504

Query: 577 PP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
              PG        W   D L+++LP+ +           Y +   +  GP + A   + +
Sbjct: 505 DAYPGTVTRINREWKEGDILSLELPMEVTVSRW------YENSAVVERGPLVYALKMNEK 558

Query: 636 WDIK 639
           W+ K
Sbjct: 559 WEKK 562


>gi|340619115|ref|YP_004737568.1| hypothetical protein zobellia_3150 [Zobellia galactanivorans]
 gi|339733912|emb|CAZ97289.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
          Length = 694

 Score = 43.5 bits (101), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 8/85 (9%)

Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP 579
           L QK D    WD  +++T+     +    L    LR+P W  + G Q  +NG  +    P
Sbjct: 502 LTQKTD--YPWDGAVKITVDECKAEAFEVL----LRIPSW--AKGTQIKVNGTKVAKAQP 553

Query: 580 GNFLSATERWSYNDKLTIQLPLSLR 604
           G F     +W+  D++TI +P+  +
Sbjct: 554 GTFAKIERQWAEGDEITIDMPMETK 578


>gi|116625572|ref|YP_827728.1| hypothetical protein Acid_6519 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116228734|gb|ABJ87443.1| protein of unknown function DUF1680 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 631

 Score = 43.5 bits (101), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 57/261 (21%), Positives = 99/261 (37%), Gaps = 34/261 (13%)

Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSR 411
           D LYK+      D ++  H   +G  SA E        A    S+  E C     +    
Sbjct: 268 DSLYKMF-----DALDRYHGQPSGIFSADE------HFAGRDPSQGTELCAVVEAMFSLE 316

Query: 412 HLFRWTKEIAYADYYERALTNGVLSI--------QRGTEPGVMIYMLPLGRGVSKARSTH 463
                  + A+ D  E+   N + +         Q   +   +I  +   R  +    ++
Sbjct: 317 QDMAIMGDAAFGDRLEKIAYNALPATLSPDLWAHQYDQQANQVICSISNRRWATNGPESN 376

Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
            +G + N F CC     + + KL  S++     N  G   + Y        SG V + ++
Sbjct: 377 IFGLEPN-FGCCTANMHQGWPKLAASLWMAT--NDGGFAAVAYGPGEV--TSGGVTIEER 431

Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
            D      P+ R  ++   K +  +   L LR+P W  +NGA  ++NGQ      PG F 
Sbjct: 432 TD-----YPF-RENVSLLVKTD--KSFPLVLRIPAW--ANGATVAVNGQQQAGVKPGAFF 481

Query: 584 SATERWSYNDKLTIQLPLSLR 604
                W   D++ +  P+++R
Sbjct: 482 RVQRAWRAGDRVELHFPMAVR 502


>gi|330996651|ref|ZP_08320529.1| hypothetical protein HMPREF9442_01616 [Paraprevotella xylaniphila
           YIT 11841]
 gi|329572723|gb|EGG54356.1| hypothetical protein HMPREF9442_01616 [Paraprevotella xylaniphila
           YIT 11841]
          Length = 800

 Score = 43.5 bits (101), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 81/355 (22%), Positives = 128/355 (36%), Gaps = 73/355 (20%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L +LY +T D K+L  A  F D+    G  + +  Y     +  H P+V     +G  +R
Sbjct: 221 LAKLYIVTGDRKYLDEAKFFLDQ---RGHTSRRDAY-----SQAHKPVVEQDEAVGHAVR 272

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +TGD  Y        D +     Y TGG  A           +  G+
Sbjct: 273 ATYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATAN-------GEAFGA 325

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     + V+  LF    E  Y D  ER L NG++S     + G   
Sbjct: 326 NYELPNMSAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFF 384

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  PL     ++R  H     F    CC          L   +Y  ++ +V   Y+  ++
Sbjct: 385 YPNPL-----ESRGQHQRQPWFGCA-CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFM 435

Query: 508 S--SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT----- 560
           S  ++ +     VVL Q+      WD      +  S K+    + +L +R+P W      
Sbjct: 436 SNEANLEVDKKGVVLEQQTR--YPWD----GDVAVSVKKNKAGVFALKIRIPGWVRGQVV 489

Query: 561 ------YSNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
                 YS+G +      +NGQ +       + +   RW   DK+ +   +  R 
Sbjct: 490 PSDLYRYSDGKRLGYSVKVNGQPVESGLQDGYFTIERRWKKGDKVEVHFDMEPRV 544


>gi|421589478|ref|ZP_16034616.1| hypothetical protein RCCGEPOP_11663, partial [Rhizobium sp. Pop5]
 gi|403705566|gb|EJZ21118.1| hypothetical protein RCCGEPOP_11663, partial [Rhizobium sp. Pop5]
          Length = 299

 Score = 43.5 bits (101), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 56/234 (23%), Positives = 95/234 (40%), Gaps = 26/234 (11%)

Query: 422 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
           YAD  E+AL NG L     T+     Y  PL      A   H W  K++   CC      
Sbjct: 16  YADIMEQALYNGALP-GLSTDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIAR 68

Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTF 540
             + +G  +Y   +  +  +++    ++     +G  V L Q  +    WD      + F
Sbjct: 69  LVTSIGSYMYAVADDEI-AVHLYGESTARLKLANGAEVELEQATN--YPWD----GAVAF 121

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQ 598
           +++       +L+LR+P W  + GA  S+NG  L L       +      W+  D++ + 
Sbjct: 122 TTRLTKPARFALSLRIPDW--AEGATLSVNGAMLDLGAHVRDGYARINREWADGDRVALY 179

Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
           LPL+LR +       + A   A++ GP +    T       T     L+A++ P
Sbjct: 180 LPLALRPQYANPKVRQDAGRVALMRGPLVYCVET-------TDNGADLNAIVLP 226


>gi|160934492|ref|ZP_02081878.1| hypothetical protein CLOLEP_03364 [Clostridium leptum DSM 753]
 gi|156865945|gb|EDO59317.1| hypothetical protein CLOLEP_03364 [Clostridium leptum DSM 753]
          Length = 650

 Score = 43.5 bits (101), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 46/184 (25%), Positives = 76/184 (41%), Gaps = 15/184 (8%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL----GR 454
           E+C +  ++  ++ +   T E  Y D  ERAL N VL      E     Y+ PL      
Sbjct: 334 ESCASVGLMMFAQRMASLTGEAVYYDVVERALCNTVLG-GISKEGKRYFYVNPLEVWPQN 392

Query: 455 GVSKARSTHGWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
            ++     H    +   F   CC      + + LG  IY + E +   LY+ Q+ISSS  
Sbjct: 393 CLASTSMAHVKPVRQKWFGCACCPPNIARTLASLGQYIYAQSEDS---LYVNQFISSSSA 449

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
            + G   +   +D     D  +R+T     ++E     +L LR+ +  Y       +NG+
Sbjct: 450 VEIGGQEIEFSMDSTYMKDGAVRITAKCGKREE-----ALYLRVRIPEYFKKPTLKVNGK 504

Query: 573 NLPL 576
           +  L
Sbjct: 505 DATL 508


>gi|449137673|ref|ZP_21772993.1| protein containing DUF1680 [Rhodopirellula europaea 6C]
 gi|448883726|gb|EMB14239.1| protein containing DUF1680 [Rhodopirellula europaea 6C]
          Length = 688

 Score = 43.1 bits (100), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 71/273 (26%), Positives = 109/273 (39%), Gaps = 47/273 (17%)

Query: 347 YEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK---------RLADTLG-- 394
           Y  TGD  L+  + T + ++V+    Y TGG  A      P          R+    G  
Sbjct: 304 YAETGDKALWSSLETIWRNVVD-KKMYITGGCGALHDGASPDGSKNQREITRVHQAFGRN 362

Query: 395 ------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVM 446
                 + + ETC     +  +  +F  + E  + D  E AL N VLS     GT     
Sbjct: 363 YQLPNATAHNETCANIGNVLWNWRMFLASGEAKHIDTLELALYNSVLSGVDLNGTN---F 419

Query: 447 IYMLPLGRGVSKARSTHGW--GTK-FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
            Y+ PL R    A     W  G K F + +CC      + + +G   Y +    V   ++
Sbjct: 420 FYINPL-RQSDMAPVALRWAGGRKPFVTSFCCPPNLARTIAGVGQYAYGKSNDTV---WV 475

Query: 504 IQYISSSFDWK---SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
             Y S++ D K   SGHV + Q       WD  + +T+     Q +     L LR+P WT
Sbjct: 476 NLYGSNTLDTKLIDSGHVRIEQTTG--YPWDGRIEITIAECQNQPM----CLKLRIPGWT 529

Query: 561 YSNGAQASLNGQNLPLPP---PGNFLSATERWS 590
            +    A++N   +P      PG+++S    WS
Sbjct: 530 TT----ATVNIDGVPTDAKIEPGSYVSLKRVWS 558


>gi|423214410|ref|ZP_17200938.1| hypothetical protein HMPREF1074_02470 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692825|gb|EIY86061.1| hypothetical protein HMPREF1074_02470 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 679

 Score = 43.1 bits (100), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 52/254 (20%), Positives = 98/254 (38%), Gaps = 24/254 (9%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           E CT   M+    ++   T  + +AD  ER   N  L  Q   +     Y   + + ++ 
Sbjct: 320 ELCTAVEMMYSLENMLEITGNMQWADQLERIAYNA-LPTQISDDAQARQYYQQVNQ-IAV 377

Query: 459 ARSTHGWGT----------KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
               H + T              + CC     + + K    +++    N  G+  + Y S
Sbjct: 378 VNDYHNFSTPHEGTDNLFGTLTGYPCCSSNLHQGWPKFVQHLWYATVDN--GVAALVYAS 435

Query: 509 SSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
           S    + + ++++N K +    +D  +  ++T+  K+        +LR+P W        
Sbjct: 436 SEVKMQVANNILVNIKEETYYPFDETVSFSITYPDKKIKKATFPFHLRVPEW--CKKPIV 493

Query: 568 SLNGQNLPLPPPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
           +LNGQ +     G  +    R W  NDK+TI+ P ++      D          +  GP 
Sbjct: 494 NLNGQTIKTDVTGERMIILNREWQQNDKITIEFPATISISHWFDGG------AVVERGPL 547

Query: 627 LLAGHTSGEWDIKT 640
           + A   + +W+ KT
Sbjct: 548 VYALKLNEKWEKKT 561


>gi|375146847|ref|YP_005009288.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361060893|gb|AEV99884.1| protein of unknown function DUF1680 [Niastella koreensis GR20-10]
          Length = 674

 Score = 43.1 bits (100), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 106/490 (21%), Positives = 176/490 (35%), Gaps = 91/490 (18%)

Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-------------TGYLSAFPTELF 216
           L     ++A T +  ++  + T + +++ CQ   G             T    AF   L 
Sbjct: 107 LEGVTSLYAVTKDKNLEVMLDTAIATIAACQRADGYIHTPVLIEERKATNKEKAFADRL- 165

Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEY---FYNRVQKVITMY 273
            +FE        Y   H + AG +  Y +      L +A    +Y   FY R    +   
Sbjct: 166 -NFET-------YNLGHLMTAGCI-HYRVTGKRTLLDVAIKAADYLDNFYKRASPELARN 216

Query: 274 SV-ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLA-HLFDKPCFLGFLALQAD----- 326
           ++   H+  + E           LY  T DPK+L LA +L +     G +    D     
Sbjct: 217 AICPSHYMGVVE-----------LYRTTRDPKYLQLAINLIN---IRGLVEEGTDDNQDR 262

Query: 327 --YLSHFHANTHIP----IVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSA 379
             +     A  H      +  G    Y  TGD  L   + + + D+VN    Y TGG  A
Sbjct: 263 VPFRQQMEAMGHAVRANYLYAGVADVYAETGDDSLMTCLNSIWNDVVN-KKLYVTGGCGA 321

Query: 380 REFWWDP----------KRLADTLG--------SENEETCTTYNMLKVSRHLFRWTKEIA 421
                 P          ++     G        + + ETC     L  +  +   + +  
Sbjct: 322 LYDGVSPYGTSYKPPVIQKTHQAYGRAYQLPNITAHNETCANIGNLLWNWRMLLLSGDAK 381

Query: 422 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH----GWGTKFNSFWCCYG 477
           YAD  E  L NG+LS     +     Y  PL        +      G         CC  
Sbjct: 382 YADVMELELYNGILS-GISLDGNNFFYTNPLSHSADYPYTLRWQEAGRVPYIKLSNCCPP 440

Query: 478 TGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM 536
             + + +++GD  Y    +G    LY    IS+  +  S   +  Q   P   WD +++ 
Sbjct: 441 NTVRTMAEVGDYAYTTSNKGLWVHLYGANKISTKLEDGSALEMTQQSNYP---WDGHIKF 497

Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYND-- 593
           T+T +  +      SL LR+P W   + A  ++NG+ +  P  P  ++     W   D  
Sbjct: 498 TVTKAEAKAF----SLYLRIPGW--CDKAALTVNGKPVTGPNKPATYVELNRAWKAGDVV 551

Query: 594 KLTIQLPLSL 603
           +L + +P++L
Sbjct: 552 ELNLSMPVTL 561


>gi|300854538|ref|YP_003779522.1| hypothetical protein CLJU_c13520 [Clostridium ljungdahlii DSM
           13528]
 gi|300434653|gb|ADK14420.1| conserved hypothetical protein [Clostridium ljungdahlii DSM 13528]
          Length = 658

 Score = 43.1 bits (100), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 105/496 (21%), Positives = 186/496 (37%), Gaps = 80/496 (16%)

Query: 159 SELRGHFVG---------HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLS 209
           S+++GH  G          +L A A       N  +K+    ++  ++E Q     GYLS
Sbjct: 70  SKIKGHHSGFPFQDTDVYKWLEAVAYSLRYHPNDDLKQIADKLIDLIAEAQEY--DGYLS 127

Query: 210 A-FPTELFD-SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
             F  E  +  F+ LK     Y   H I A +   Y +  N +AL +A  M +   N   
Sbjct: 128 TYFQIEAPERKFKRLKQSHELYTMGHYIEAAVA-YYQVTGNEKALNIARKMADCIDNN-- 184

Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-----PCFLGFLA 322
                + +E+      +    +   L RLY +TH+ K+L LA+ F K     P F     
Sbjct: 185 -----FGLEKGKIPGYDGHPEIELALSRLYELTHEKKYLNLAYYFLKQRGQDPKFFDHQI 239

Query: 323 LQADY------------LSHF--------------HANTHIPIVIGSQMRYEVTGDPLYK 356
            Q  +            LS++              HA   + +  G      +TGD    
Sbjct: 240 EQDGFDHDLIEGMRNFPLSYYQAAEPIVDQETAEGHAVRVVYLCTGIAYVARLTGDQDLL 299

Query: 357 LIGTFFMDIVNASHSYATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
            +   F + +     Y TG     T+   F +D     DT+  E   TC +  M   ++ 
Sbjct: 300 TVCKRFWNNIVKKRMYVTGNIGSTTTGESFTYDYDLPNDTMYGE---TCASVGMTFFAKQ 356

Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR----STHGWGTK 468
           + +   E  Y D  E+ L NG LS     +     Y+ PL    + ++     +H    +
Sbjct: 357 MLQIEPEGEYGDILEKELFNGSLS-GISLDGKHFFYVNPLEADPTASKGNPGKSHILTRR 415

Query: 469 FNSFWC-CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI 527
            + F C C  + +       D   +   G+   +   Q+IS+  ++ +   ++     P 
Sbjct: 416 ADWFGCACCPSNVARLIASVDQYIYTVHGST--ILSHQFISNEANFDNNISIIQSNNFP- 472

Query: 528 VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATE 587
             WD      +++  K          +R+P W+  N  +  +N +++ LP    F+    
Sbjct: 473 --WDG----NISYKIKNPGENKFKFGIRIPSWSQCN-YKLQVNKKDVNLPVKSGFV---- 521

Query: 588 RWSYNDKLTIQLPLSL 603
            + + +   +Q+ LSL
Sbjct: 522 -YIFVESSQMQIDLSL 536


>gi|297545103|ref|YP_003677405.1| hypothetical protein Tmath_1689 [Thermoanaerobacter mathranii
           subsp. mathranii str. A3]
 gi|296842878|gb|ADH61394.1| protein of unknown function DUF1680 [Thermoanaerobacter mathranii
           subsp. mathranii str. A3]
          Length = 648

 Score = 43.1 bits (100), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 82/376 (21%), Positives = 141/376 (37%), Gaps = 58/376 (15%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLAL------QADYLSHFHANTHIPI-- 339
           L +LY +T + K+L L+  F     +KP +    A          + S+F    H+P+  
Sbjct: 199 LVKLYRVTGEEKYLRLSKYFIDERGEKPLYFEIEAKARGDEWDEQWASYFQ--VHLPVRE 256

Query: 340 ---VIGSQMRYEV-----------TGDPLYKLIGTFFMDIVNASHSYATGGTSA----RE 381
                G  +R              TGD           D +     Y TGG  +      
Sbjct: 257 QTSAEGHAVRAAYLYSGMVDVAVETGDESLIQACKKLWDNITTKRMYITGGIGSSSFGEA 316

Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
           F +D     DT+ +E   TC    ++  +  + +   +  YAD  ERAL N V+S     
Sbjct: 317 FTFDFDLPNDTVYAE---TCAAIGLVFFAHRMLQIDPDRRYADVMERALYNSVIS-GMSL 372

Query: 442 EPGVMIYMLPL-------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
           +     Y+ PL        +   KA   +     F    CC        + LG  IY   
Sbjct: 373 DGKKYFYVNPLEVWPEACEKNKVKAHVKYTRQPWFKCA-CCPPNLARLLASLGKYIYSIR 431

Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
           +     LY+  Y+ S    K     +  + +    WD   R+ +    ++E+    +L L
Sbjct: 432 DNE---LYVHLYVDSEVQTKISENEVKVRQETEYPWDG--RIVINILPERELD--FTLAL 484

Query: 555 RMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLS-LRTEAIQDD 611
           R+P W     A+ S+NG+ + +       +      W   D++ + L ++ +R +A  + 
Sbjct: 485 RIPGWC--KDAKVSVNGEEIDISGIMDKGYAKIKRLWKPGDRIELLLSMTVMRVKANPNV 542

Query: 612 RPEYASIQAILFGPYL 627
           R +   + AI  GP +
Sbjct: 543 REDEGRV-AIQRGPVI 557


>gi|424879315|ref|ZP_18302950.1| hypothetical protein Rleg8DRAFT_5297 [Rhizobium leguminosarum bv.
           trifolii WU95]
 gi|392519986|gb|EIW44717.1| hypothetical protein Rleg8DRAFT_5297 [Rhizobium leguminosarum bv.
           trifolii WU95]
          Length = 647

 Score = 43.1 bits (100), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 104/482 (21%), Positives = 178/482 (36%), Gaps = 95/482 (19%)

Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-----PTELFDSFEA 221
           G ++ A++    +  NA ++ K+  +V  L + Q  +  GYL+++     P   + +   
Sbjct: 90  GKWIEAASYTLKAHPNAALETKIDAIVEKLEKGQ--MADGYLNSWFIRREPDRRWTNLRD 147

Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER---H 278
           L  +    Y++  +L G +  Y      + L +    V++       +I  +  E     
Sbjct: 148 LHEM----YSMGHLLEGAVAYYEATGKRRFLDVMIRAVDH-------IIETFGAEPGKLR 196

Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL 328
            Y  +EE   +   L +LY +T DP+HL LA  F       P +    A +      DY+
Sbjct: 197 GYDAHEE---IELALVKLYRVTGDPRHLKLATYFVDERGRMPSYYDEEARKRGESPEDYV 253

Query: 329 --SHFHANTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASH 370
             ++ ++  H+P+     V+G  +R                DP  K       D + +  
Sbjct: 254 YKTYAYSQAHLPVRDQHQVVGHAVRAMYLFSAMADLSRENDDPTLKEACDRLFDNLVSRQ 313

Query: 371 SYATGGTS--------AREFWWDPKRLADTLGSEN--EETCTTYNMLKVSRHLFRWTKEI 420
            Y TGG           REF          L +E    ETC    +   S  + +   + 
Sbjct: 314 LYVTGGLGPSASNEGFTREF---------DLPNETAYAETCAAVALGFWSHRMAQVDLDS 364

Query: 421 AYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGT 478
            + D  E  L NG LS I R  E      +L           +HG   ++   +C C  T
Sbjct: 365 KFTDRLETVLYNGALSGISRDGERYFYENVL----------ESHGQHRRWKWHYCPCCPT 414

Query: 479 GIESF-SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
            I  F + LG   YF   G+   L +  Y ++S +   G   +    +    WD  + + 
Sbjct: 415 NIARFITSLGQ--YFYSTGD-HQLAVHLYGTNSAELTVGDSFVRLIQETQYPWDGDISLR 471

Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKL 595
                         L LR+P W     AQ S+NG  + L       + + +  W   D++
Sbjct: 472 FAVERPSRF----QLRLRIPGWC--RQAQISVNGVAVDLDQCVTKGYAAISREWRNGDEV 525

Query: 596 TI 597
            I
Sbjct: 526 RI 527


>gi|317492212|ref|ZP_07950641.1| hypothetical protein HMPREF0864_01405 [Enterobacteriaceae bacterium
           9_2_54FAA]
 gi|316919551|gb|EFV40881.1| hypothetical protein HMPREF0864_01405 [Enterobacteriaceae bacterium
           9_2_54FAA]
          Length = 661

 Score = 42.7 bits (99), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 59/251 (23%), Positives = 93/251 (37%), Gaps = 42/251 (16%)

Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
           Y TGG    +S   F  D     DT+ +E   +C +  ++  +  + +   +  YAD  E
Sbjct: 320 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFANRMLQMEGDSQYADVME 376

Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
           RAL N VL      +     Y+ PL          H     FN  +              
Sbjct: 377 RALYNTVLG-GMALDGRHFFYVNPL--------EVHPKSIPFNHIYDHVKPIRQRWFGCA 427

Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           CC        + +G  IY +       LYI  Y+ +     +G   L   +     WD  
Sbjct: 428 CCPPNIARILTSIGHYIYTQRSD---ALYINLYVGNETLLDNG---LKIAISGNYPWDE- 480

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
             +++   +++ + Q  +L LRMP W      Q  LNG+         +L     W   D
Sbjct: 481 -NVSVHIRTEKPLHQ--TLALRMPEWCEKPRVQ--LNGETCEDLLQRGYLHIAREWQDGD 535

Query: 594 KLTIQLPLSLR 604
           +L I LP+ +R
Sbjct: 536 RLEIVLPMPVR 546


>gi|448360425|ref|ZP_21549056.1| hypothetical protein C481_00200 [Natrialba asiatica DSM 12278]
 gi|445653038|gb|ELZ05910.1| hypothetical protein C481_00200 [Natrialba asiatica DSM 12278]
          Length = 674

 Score = 42.7 bits (99), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 45/178 (25%), Positives = 69/178 (38%), Gaps = 14/178 (7%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC     +  +R LF +T    YAD  ER L N VL + R  +     Y   L    + 
Sbjct: 348 ETCAAIGSVFWNRRLFEFTGRARYADLIERTLYNAVL-VGRSRDGTEFFYDNRLASDGNH 406

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE-EEGNVPGLYIIQYISSSFDWKSGH 517
            R       ++    CC        + LG  +Y    E +   LY+ QYI SS     G 
Sbjct: 407 HRQ------EWFECACCPPNIARVLAALGRYLYATGGESDERCLYVNQYIGSSATATIGD 460

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
            V+         W+  + + +  ++  E     +L LR+P W         +NG+ +P
Sbjct: 461 TVVELDQTSGFPWNGEVTLDVEPATPTEF----ALRLRVPSWC--EDVSIRVNGEAVP 512


>gi|408372126|ref|ZP_11169874.1| hypothetical protein I215_14481 [Galbibacter sp. ck-I2-15]
 gi|407742435|gb|EKF54034.1| hypothetical protein I215_14481 [Galbibacter sp. ck-I2-15]
          Length = 664

 Score = 42.7 bits (99), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 83/381 (21%), Positives = 138/381 (36%), Gaps = 67/381 (17%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
           L +LY IT +  +L LA  F D+           DY     A  H+P+     V+G  +R
Sbjct: 241 LVKLYRITKNEDYLELARFFLDQRGHHDNRPSLGDY-----AQDHLPVTEQKEVVGHAVR 295

Query: 347 ----YEVTGDPLYKLIGTFFMDIVNA-------SHSYATGGTSAREFWWDPKRLADTLGS 395
               Y    D       T +++ VN           Y TGG  A           +  G+
Sbjct: 296 AVYMYAGMTDIAAIDKDTAYLNAVNNLWDNMVNKKMYITGGIGA-------IHDGEAFGA 348

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGV 445
             E        ETC     +  +  L   T ++ Y D  ER+L NG+LS     GTE   
Sbjct: 349 NYELPNLTAYSETCAAIGDVYWNHRLHNLTGDVKYMDVLERSLYNGLLSGISLSGTE--- 405

Query: 446 MIYMLPL-GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV-PGLYI 503
             Y   L   G  K         ++    CC    I     L + +Y +++  +   LY+
Sbjct: 406 FFYPNALESDGTYKFNRGSCTRQEWFDCSCCPTNMIRFLPSLPELVYSKKDDTIFVNLYV 465

Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW---- 559
                +  D  S  +V++Q+ +    WD  +  T+T   +       +L LR+P W    
Sbjct: 466 AN--QAQIDLPSTSLVIDQQTN--YPWDGLVNFTVTPEKEANF----TLKLRIPGWLRNE 517

Query: 560 -----------TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
                        ++  +  +N Q +       +++    W   + L++ LP+  R    
Sbjct: 518 VLPGTLYQYKDDMTSEFELKINDQLVDATLKDGYITINRDWKKGETLSLNLPMQPREVIT 577

Query: 609 QDDRPEYASIQAILFGPYLLA 629
            D   +     A+ +GP + A
Sbjct: 578 NDKVEDNLGKLALEYGPIVYA 598


>gi|344201929|ref|YP_004787072.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
 gi|343953851|gb|AEM69650.1| protein of unknown function DUF1680 [Muricauda ruestringensis DSM
           13258]
          Length = 656

 Score = 42.7 bits (99), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 85/391 (21%), Positives = 139/391 (35%), Gaps = 87/391 (22%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
           L +LY  T   ++  LA  F D         L  DY     +  H+P+     V+G  +R
Sbjct: 228 LIKLYQTTGKKEYFDLAKYFLDHRGKSEHHQLFGDY-----SQDHVPVTEQDEVVGHAVR 282

Query: 347 Y-----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
                        +  D  Y K +   + ++VN    Y TGG  A       K   +  G
Sbjct: 283 AVYMYAGMTDIAAIEKDTAYLKAVNALWDNMVN-KKMYITGGIGA-------KHEGEAFG 334

Query: 395 SENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
              E        ETC     +  +  L   T ++ Y D  ER L NG++S   G      
Sbjct: 335 ENYELPNLTAYNETCAAIGDVYWNHRLHNLTGDVKYFDVIERTLYNGLIS---GLSLDGQ 391

Query: 447 IYMLPLG---RGVSKARSTHGWGTKFNSFWC-CYGTGIESF---------SKLGDSIYFE 493
            +  P      GV K     G  T+ + F C C  T +  F         SK  D+IY  
Sbjct: 392 KFFYPNALESDGVYKF--NQGACTRKDWFDCSCCPTNVIRFLPAMPGLIYSKTDDTIYV- 448

Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
                  LY      ++ + K   V L+Q+      WD  +++ +  + K +     ++ 
Sbjct: 449 ------NLYAAN--GATVNLKDRAVKLSQETK--YPWDGKVKLMVDPTEKGKF----TIK 494

Query: 554 LRMPVW---------------TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
            R+P W                 +   + SLNG+ L L     + +  + W   D + ++
Sbjct: 495 FRVPGWARNKVLPGNLYQYATVINKKNKISLNGEELDLQAGDGYFTIAKEWEKGDVVELE 554

Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
            P+ +R         E     ++ +GP + A
Sbjct: 555 FPMEVRKVEANQLVEENKDKMSLEYGPMVYA 585


>gi|399031138|ref|ZP_10731277.1| hypothetical protein PMI10_03155 [Flavobacterium sp. CF136]
 gi|398070607|gb|EJL61899.1| hypothetical protein PMI10_03155 [Flavobacterium sp. CF136]
          Length = 673

 Score = 42.7 bits (99), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 58/239 (24%), Positives = 100/239 (41%), Gaps = 29/239 (12%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTE-----PGVMIYMLP 451
           ETC     +  +  + + T +  YAD  E AL N VLS     G E     P  +   LP
Sbjct: 358 ETCANIGNVLWNWRMLQITGDAKYADIVELALYNSVLSGISLEGKEFFYNNPLNVSKDLP 417

Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
             +  SK R   G+    N   CC      + +++ +  Y F +E    GLY+  Y S++
Sbjct: 418 FKQRWSKER--EGYIALSN---CCAPNVTRTIAEVSNYAYNFSKE----GLYVNLYGSNN 468

Query: 511 FDWKS---GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            + K+     + + Q+ +    WD  + + +    K+    L    LR+P W  S G   
Sbjct: 469 LNSKTLAGEKIEIEQQTN--YPWDGKITLKIVKVPKEAYAFL----LRIPGW--SQGTTI 520

Query: 568 SLNGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
           S+NG+N+      G++    ++W   D + + +P+ +          E  +  A+  GP
Sbjct: 521 SVNGKNINDAIVSGSYQKIAQKWKKGDVIELNIPMPVELMQANPLVEEVKNQVAVKRGP 579


>gi|374385208|ref|ZP_09642716.1| hypothetical protein HMPREF9449_01102 [Odoribacter laneus YIT
           12061]
 gi|373226413|gb|EHP48739.1| hypothetical protein HMPREF9449_01102 [Odoribacter laneus YIT
           12061]
          Length = 614

 Score = 42.7 bits (99), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 72/349 (20%), Positives = 123/349 (35%), Gaps = 35/349 (10%)

Query: 293 LYRLYSITHDPKHLLLAH-LFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY---- 347
           L +LY  T +  +L LA  L D+           DY         +  + G  +R     
Sbjct: 213 LVKLYRTTQNSAYLKLAQWLLDQRGHHKGDWKAKDYYQDLKPVRELSKISGHAVRAMYMF 272

Query: 348 -------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE- 399
                   +T D  Y++      + V     Y TGG  +       +  ++     NEE 
Sbjct: 273 TGMADVAAITQDSGYRIALDRLWEDVVEKKMYLTGGIGSSRH---NEGFSEDYDLPNEEA 329

Query: 400 ---TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
              TC +  M+  ++ +     E  Y D  ERA+ NG L+           Y+ PL    
Sbjct: 330 YCETCASVGMVFWNQRMNMLKGESRYEDVLERAMYNGALA-GISLSGDRFFYVNPLASSG 388

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
              R    +GT      CC          +G+ IY   E  V   ++  YI S  + ++ 
Sbjct: 389 KHHRKAW-YGTA-----CCPSQISRFLPSVGNYIYALSENTV---WVNLYIGSETEVETS 439

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
            V +  K + +  WD      +TF       +   + LR+P W         +NGQ    
Sbjct: 440 GVTVALKQETLYPWDG----NVTFYVNPRESKDFKMKLRIPAWC--EKYVVKVNGQIEEG 493

Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
                ++     W+  D + + + ++++  A        A  +A+  GP
Sbjct: 494 KKEKGYVVIDRLWAAGDVMELNMNMTVKVVAADPRVKANAGKRALQRGP 542


>gi|418468281|ref|ZP_13039095.1| hypothetical protein SMCF_2011 [Streptomyces coelicoflavus ZG0656]
 gi|371551122|gb|EHN78456.1| hypothetical protein SMCF_2011 [Streptomyces coelicoflavus ZG0656]
          Length = 796

 Score = 42.7 bits (99), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 52/222 (23%), Positives = 87/222 (39%), Gaps = 36/222 (16%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLGRGVS 457
           ETC     +     L R T +  +AD  E    N   ++    +P G  ++ +     V 
Sbjct: 305 ETCGVVEFMAAHELLVRITGDPVWADRCEDLAFN---ALPAALDPEGRAVHYVTSANSVD 361

Query: 458 --KARSTHG----------WGTKFNSFWCC---YGTGIESFSK---LGDSIYFEEEGNVP 499
              AR T G          +    +++ CC   YG G   F++   LG      + G   
Sbjct: 362 LDNARKTQGQFQNGFAMQAYQPGVDNYRCCPHNYGMGWPYFTEELWLGTP----DRGLAA 417

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
            +Y    ++++       V + +  D P         +TLT S  + V     L+LR+P 
Sbjct: 418 AMYAPSRVTAAVGADGTRVTVTEDTDYPFDD-----TITLTVSGPRRVA--FPLSLRIPG 470

Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
           W    G Q  +NG+ +P      F+     WS  D++T++LP
Sbjct: 471 W--CEGPQVRVNGRPVPAADGPAFVRVERTWSDGDRVTLRLP 510


>gi|281425429|ref|ZP_06256342.1| conserved hypothetical protein [Prevotella oris F0302]
 gi|281400422|gb|EFB31253.1| conserved hypothetical protein [Prevotella oris F0302]
          Length = 673

 Score = 42.7 bits (99), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 73/324 (22%), Positives = 121/324 (37%), Gaps = 54/324 (16%)

Query: 349 VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--------E 399
           +TGD  Y K I   + +IV+  + Y TGG  AR +        +  G++ E        E
Sbjct: 290 LTGDSAYIKAIDHIWNNIVSKKY-YLTGGVGARHY-------GEAFGADYELPNLTAYNE 341

Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-GRGVSK 458
           TC       ++  LF    +  Y D  ER L NGV+S     + G   Y  PL   G+ K
Sbjct: 342 TCAAIAQCYLNMRLFMLHGDSKYIDCLERTLYNGVIS-GMSIDGGRFFYPNPLSADGIYK 400

Query: 459 ARSTHGWGTKFNSFW---CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
             +     T     W    C  + +  F        +   GN   +Y+  ++ S  + K 
Sbjct: 401 FNADR---TTTRQLWFGCACCPSNLSRFIPSVPGYVYAVRGN--DVYVNLFMGSKANVKV 455

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT---------YS---- 562
           G   +  + +    WD  + + +    K    + +SL +R+P W          YS    
Sbjct: 456 GGKEMKIETETNYPWDGKVAIRV----KGNANKHASLLIRIPGWARGEVTPGGLYSFTDK 511

Query: 563 --NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAIQDDRPEYA 616
             +G   ++NG+N                   D +T+ L +  RT    + + DDR    
Sbjct: 512 QKDGWSIAVNGKNRNAGKLEKGYIRINNVKKGDVITLNLDMEPRTVVADKRVMDDR---- 567

Query: 617 SIQAILFGPYLLAGHTSGEWDIKT 640
              A+  GP +    ++    +KT
Sbjct: 568 GCVAVERGPLVYCAESADNNGMKT 591


>gi|336402464|ref|ZP_08583200.1| hypothetical protein HMPREF0127_00513 [Bacteroides sp. 1_1_30]
 gi|335948631|gb|EGN10334.1| hypothetical protein HMPREF0127_00513 [Bacteroides sp. 1_1_30]
          Length = 698

 Score = 42.7 bits (99), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 84/347 (24%), Positives = 140/347 (40%), Gaps = 47/347 (13%)

Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
           +  +Y  T +P++L L+ +L D    +          +  +  Y +  HA     +  G 
Sbjct: 248 VVEMYRATGNPRYLELSKNLIDIRGMVENGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307

Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
              Y  TG+  L K + + + DIV     Y TG       GTS     ++P   +++  +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366

Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
            G        + + ETC     +  +  +   T +  YAD  E  L N VLS     +  
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 425

Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
              Y  PL R  +    T  W    T++ S +CC    + +  +  +  Y    EG    
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484

Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           LY    ++++  WK  G + L Q+ D    W+  +R+TL    ++  G   SL LR+P W
Sbjct: 485 LYGANTLTTT--WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW 538

Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
                A  ++NGQ L      N  +   R W   D  +L + +P+ L
Sbjct: 539 --CEKATLTVNGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583


>gi|423286830|ref|ZP_17265681.1| hypothetical protein HMPREF1069_00724 [Bacteroides ovatus
           CL02T12C04]
 gi|392674368|gb|EIY67816.1| hypothetical protein HMPREF1069_00724 [Bacteroides ovatus
           CL02T12C04]
          Length = 698

 Score = 42.7 bits (99), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 18/217 (8%)

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
           + + ETC     +  +  +   T +  YAD  E  L N VLS     +     Y  PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434

Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
             +    T  W    T++ S +CC    + +  +  +  Y    EG    LY    ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494

Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             WK  G + L Q+ D    W+  +R+TL    ++  G   SL LR+P W     A  ++
Sbjct: 495 --WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGTF-SLFLRIPEW--CEKATLTV 546

Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
           NGQ L      N  +   R W   D  +L + +P+ L
Sbjct: 547 NGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583


>gi|410100001|ref|ZP_11294966.1| hypothetical protein HMPREF1076_04144 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409216556|gb|EKN09540.1| hypothetical protein HMPREF1076_04144 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 618

 Score = 42.4 bits (98), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 65/288 (22%), Positives = 117/288 (40%), Gaps = 31/288 (10%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-GRGVS 457
           ETC +  M+  +  + + T +  Y D  ER++ NGVL+           Y+ PL  +G  
Sbjct: 336 ETCASVGMVFWNHRMNQITGDAKYIDILERSMYNGVLA-GISLSGDRFFYVNPLESKGDH 394

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI--SSSFDWKS 515
             +  +G         CC          +G+ IY   +     L++  YI  ++ F    
Sbjct: 395 HRQEWYGCA-------CCPSQLSRFLPTIGNYIYAISD---DALWVNLYIGNTTRFTLND 444

Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
            +V+L Q+ +    WD  ++  LT SS +++ +   + LR+P W        ++NG+ + 
Sbjct: 445 DNVILRQETN--YPWDGSVK--LTVSSTKDLDK--EIRLRIPGW--CKNYTITINGKEVG 496

Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
           L     + +    W   D +++ + + +  E+      E    +AI  GP +   + + E
Sbjct: 497 LSQEKGY-AIVYDWKPGDMISLDMDMPVEVESADPLVTENIGKRAIQRGPLV---YCAEE 552

Query: 636 WDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSIT 683
            D      R      +    SF A L+     +G  T    N  QSIT
Sbjct: 553 TDNSAYFDRLTLTSDTEYHTSFEAGLL-----NGVKTINAKNEQQSIT 595


>gi|293371493|ref|ZP_06617913.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
 gi|292633530|gb|EFF52093.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
          Length = 698

 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 18/217 (8%)

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
           + + ETC     +  +  +   T +  YAD  E  L N VLS     +     Y  PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434

Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
             +    T  W    T++ S +CC    + +  +  +  Y    EG    LY    ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494

Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             WK  G + L Q+ D    W+  +R+TL    ++  G   SL LR+P W     A  ++
Sbjct: 495 --WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW--CEKATLTV 546

Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
           NGQ L      N  +   R W   D  +L + +P+ L
Sbjct: 547 NGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583


>gi|295084107|emb|CBK65630.1| Uncharacterized protein conserved in bacteria [Bacteroides
           xylanisolvens XB1A]
          Length = 698

 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 18/217 (8%)

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
           + + ETC     +  +  +   T +  YAD  E  L N VLS     +     Y  PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434

Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
             +    T  W    T++ S +CC    + +  +  +  Y    EG    LY    ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494

Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             WK  G + L Q+ D    W+  +R+TL    ++  G   SL LR+P W     A  ++
Sbjct: 495 --WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW--CEKATLTV 546

Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
           NGQ L      N  +   R W   D  +L + +P+ L
Sbjct: 547 NGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583


>gi|326799752|ref|YP_004317571.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326550516|gb|ADZ78901.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
          Length = 679

 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 111/489 (22%), Positives = 186/489 (38%), Gaps = 97/489 (19%)

Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG------------TGYLSAFPTELFD 217
           L A A ++A T +  + +KM  V+ +++  Q + G            TG  + F   L  
Sbjct: 110 LEAVASLYAVTKDPALDKKMDEVIKTIALSQREDGYIYTLSMIQQRKTGVKNQFEDRL-- 167

Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
           SFEA        Y I  ++      Y        L +A    +Y Y R  K  +  ++ R
Sbjct: 168 SFEA--------YNIGHLMTAACVHYRATGKRNLLDVAIKATDYLY-RFYKSASP-TLAR 217

Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTH 336
           +    +   G     +  +Y    D ++L LA HL D     G +    D          
Sbjct: 218 NAICPSHYMG-----VVEMYRTLGDKRYLELAKHLID---IKGQIEDGTD-----DNQDR 264

Query: 337 IPI-----VIGSQMR-----------YEVTGD-PLYKLIGTFFMDIVNASHSYATGG--- 376
           IP      V+G  +R           Y  TGD  L+  +   + D V +   Y TGG   
Sbjct: 265 IPFREQQKVMGHAVRANYLYAGVADVYAETGDTSLFNQLHKMWTD-VTSHKMYITGGCGS 323

Query: 377 ----TSAREFWWDPK---RLADTLGSE--------NEETCTTYNMLKVSRHLFRWTKEIA 421
                S     +DPK   ++    G +        + ETC     +  +  +   T    
Sbjct: 324 LYDGVSPDGTSYDPKEVQKIHQAYGRDYQLPNFTAHNETCANIGNMLWNWRMLLLTGNAK 383

Query: 422 YADYYERALTNGVLS-----IQR--GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC 474
           +AD  E AL N VLS      +R   T P      LP  +  SK R  +   +      C
Sbjct: 384 FADVLELALYNSVLSGISLDGERFLYTNPLAYSDKLPFKQRWSKDRVPYIALSN-----C 438

Query: 475 CYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
           C    + + +++ +  Y   +EG    LY    + +S     G V L Q+      WD  
Sbjct: 439 CPPNVVRTLAEVHNYFYSISDEGIWINLYGGSELKTSLP-NGGTVKLKQET--AYPWDGA 495

Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERWSYN 592
           +++ +  + K +     SL LR+P W  ++ A   +NGQ++  +  PG++     +W   
Sbjct: 496 IKVVVEEAVKDDF----SLFLRIPGW--ADQAMIQVNGQDVDKVLKPGSYTMIRRKWKKG 549

Query: 593 DKLTIQLPL 601
           D + +++P+
Sbjct: 550 DVVFLKMPM 558


>gi|160882339|ref|ZP_02063342.1| hypothetical protein BACOVA_00287 [Bacteroides ovatus ATCC 8483]
 gi|156112253|gb|EDO13998.1| hypothetical protein BACOVA_00287 [Bacteroides ovatus ATCC 8483]
          Length = 698

 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 18/217 (8%)

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
           + + ETC     +  +  +   T +  YAD  E  L N VLS     +     Y  PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434

Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
             +    T  W    T++ S +CC    + +  +  +  Y    EG    LY    ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494

Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             WK  G + L Q+ D    W+  +R+TL    ++  G   SL LR+P W     A  ++
Sbjct: 495 --WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW--CEKATLAV 546

Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
           NGQ L      N  +   R W   D  +L + +P+ L
Sbjct: 547 NGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583


>gi|395803606|ref|ZP_10482850.1| hypothetical protein FF52_17068 [Flavobacterium sp. F52]
 gi|395434160|gb|EJG00110.1| hypothetical protein FF52_17068 [Flavobacterium sp. F52]
          Length = 682

 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 56/236 (23%), Positives = 97/236 (41%), Gaps = 23/236 (9%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC     +  +  + + T +  YAD  E AL N VLS     E    +Y  PL   VS 
Sbjct: 367 ETCANIGNVLWNWRMLQITGDAKYADIVELALYNSVLS-GMNLEGDKFLYNNPL--NVSN 423

Query: 459 ARSTHG-WGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
               H  WG     +     CC      + +++G+  Y   +    GLY+  Y S++ + 
Sbjct: 424 DLPFHQRWGNVREGYIALSNCCAPNVTRTVAEVGNYAYNLSKD---GLYVNLYGSNTLNT 480

Query: 514 KS--GHVV-LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
           K+  G  + + Q+ +    WD  + + +  + K     L +  LR+P W  S  A+ S+N
Sbjct: 481 KTLNGETLEIEQQTN--YPWDGKVTLKILKAPK----DLQNFFLRIPGW--SQNAEVSVN 532

Query: 571 GQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
              +      G +L   ++W   D + + +P+ +          E  +  A+  GP
Sbjct: 533 NSKISDKIVSGTYLKLNQKWKKGDVIELNMPMPVELMEANPLVEEVKNQVAVKRGP 588


>gi|253574873|ref|ZP_04852213.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
 gi|251845919|gb|EES73927.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
          Length = 665

 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 79/361 (21%), Positives = 133/361 (36%), Gaps = 68/361 (18%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL----------GFLALQADYLSHFHANTHI 337
           L +LY +T   ++L L+  F      KP F              A  AD++   +   H+
Sbjct: 208 LVKLYEVTGQERYLRLSQYFLEQRGQKPSFFEEELKRRGGQTHWAGHADHVDLTYHQAHL 267

Query: 338 PI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSA-- 379
           P+      +G  +R             +TGD           D +     Y TGG  +  
Sbjct: 268 PVREQETAVGHAVRLLYMLTGMADVAALTGDESMLAACRKLWDNIVGKQMYITGGVGSMP 327

Query: 380 --REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
               F +D     DT+ SE   TC +  ++  ++ + R + +  YA+  ERAL N V+  
Sbjct: 328 QGEAFSFDYDLPNDTVYSE---TCASIGLIFFAQRMLRISPDSRYANVMERALYNTVVG- 383

Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF------W----CCYGTGIESFSKLG 487
               +     Y+ PL       ++  G   KF+        W    CC        + LG
Sbjct: 384 GMARDGKHFFYVNPL---EVDPKACGGANHKFDHIKTVRQEWFGCACCPPNIARLLASLG 440

Query: 488 DSIYFEEEGNVPGLYIIQYISSSFDWKS--GHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
           + IY  +   V   Y   YI    + ++  G V L Q  +    W   +R    F  + E
Sbjct: 441 EYIYTVQGDTV---YAHLYIGGEAELQTSGGKVKLTQTTN--YPWGGNVR----FEVQPE 491

Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP---GNFLSATERWSYNDKLTIQLPLS 602
                +L LR+P W     A   +NG+ + L        ++    +W   D + ++L + 
Sbjct: 492 GEGRFTLALRLPDWCPE--ASLQVNGEVVELEGALLQDGYIRLARQWCAGDVVELKLAMP 549

Query: 603 L 603
           +
Sbjct: 550 V 550


>gi|401761699|ref|YP_006576706.1| hypothetical protein ECENHK_00925 [Enterobacter cloacae subsp.
           cloacae ENHKU01]
 gi|400173233|gb|AFP68082.1| hypothetical protein ECENHK_00925 [Enterobacter cloacae subsp.
           cloacae ENHKU01]
          Length = 649

 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 77/364 (21%), Positives = 125/364 (34%), Gaps = 73/364 (20%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF-------------HAN 334
           L RLY +T +P++L L   F      +P F      +    S++             ++ 
Sbjct: 193 LMRLYDVTQEPRYLNLVKYFIEERGTQPHFYDIEYEKRGKTSYWNTYGPAWMVKDKAYSQ 252

Query: 335 THIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG-- 376
            H P+      IG  +R+            ++GD   +       + +     Y TGG  
Sbjct: 253 AHQPLAEQQTAIGHAVRFVYLMAGMAHLARLSGDEGKRQDCLRLWNNMAQRQLYITGGIG 312

Query: 377 --TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
             +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N V
Sbjct: 313 SQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSHYADVMERALYNTV 369

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGI 480
           L      +     Y+ PL          H     FN  +              CC     
Sbjct: 370 LG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIYDHVKPVRQRWFGCACCPPNIA 420

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
              + LG  IY         L I  Y+ +    +     L  ++     W   + + +T 
Sbjct: 421 RVLTSLGHYIYTVRPD---ALLINLYVGNDVAIQIDENTLRLRISGNYPWQDQVTIEIT- 476

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
                V    +L LR+P W        SLNG+ +       +L     W   D LT+ LP
Sbjct: 477 ---SPVPVTHTLALRLPDWCAEPA--VSLNGERVTGEVVRGYLYLNRCWHEGDTLTLTLP 531

Query: 601 LSLR 604
           + +R
Sbjct: 532 MPVR 535


>gi|294643636|ref|ZP_06721438.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|294808056|ref|ZP_06766829.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
 gi|292641013|gb|EFF59229.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
 gi|294444697|gb|EFG13391.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 698

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 18/217 (8%)

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
           + + ETC     +  +  +   T +  YAD  E  L N VLS     +     Y  PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434

Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
             +    T  W    T++ S +CC    + +  +  +  Y    EG    LY    ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494

Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             WK  G + L Q+ D    W+  +R+TL    ++  G   SL LR+P W     A  ++
Sbjct: 495 --WKDKGKLALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW--CEKATLTV 546

Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
           NGQ L      N  +   R W   D  +L + +P+ L
Sbjct: 547 NGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583


>gi|333381631|ref|ZP_08473310.1| hypothetical protein HMPREF9455_01476 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332829560|gb|EGK02206.1| hypothetical protein HMPREF9455_01476 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 811

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 74/364 (20%), Positives = 138/364 (37%), Gaps = 65/364 (17%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
           L ++Y +T   ++L LA  F        L L+    S  ++ TH P++     +G  +R 
Sbjct: 232 LAKMYRVTGKKEYLDLAKYF--------LDLKGHGHSGEYSQTHKPVIEQDEAVGHAVRA 283

Query: 348 E-----------VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
                       +TG+  Y        D V     Y TGG  A           +  G  
Sbjct: 284 AYMYSGMADVAALTGNEAYLHAIDKIWDNVVTKKLYITGGIGATGH-------GEAFGKN 336

Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
            E        ETC     +  +  LF    +  Y D  ER L NG++S     +     Y
Sbjct: 337 YELPNMSAYCETCAAIANVYWNHRLFLLHGDSKYYDVLERTLYNGLIS-GINLDGNRFFY 395

Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
             PL     ++   HG    F    CC          +   +Y +++  +   Y+  ++ 
Sbjct: 396 PNPL-----ESVGQHGRSEWFGCA-CCPSNVCRFMPSIPGYVYAKKDDKI---YVSLFVE 446

Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-----SLNLRMP--VWTY 561
           S  + + G   +N        WD  + + +  +  ++   L      +LN  +P  ++TY
Sbjct: 447 SEGEIELGKNKINLSQKTGYPWDGNVTINVDPAKSEKFDVLVRIPGWALNKPVPSDLYTY 506

Query: 562 SNGAQASL----NGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLR----TEAIQDDR 612
            N  + ++    NG+++      N +++ +++W   DK+ +  P+ +      E ++DDR
Sbjct: 507 LNPKKETVKIKVNGKDVDYTIGSNGYVTLSQKWKKGDKIDVSFPMDVHKDVANEKVEDDR 566

Query: 613 PEYA 616
            + A
Sbjct: 567 GKVA 570


>gi|423303854|ref|ZP_17281853.1| hypothetical protein HMPREF1072_00793 [Bacteroides uniformis
           CL03T00C23]
 gi|423307425|ref|ZP_17285415.1| hypothetical protein HMPREF1073_00165 [Bacteroides uniformis
           CL03T12C37]
 gi|392686852|gb|EIY80152.1| hypothetical protein HMPREF1072_00793 [Bacteroides uniformis
           CL03T00C23]
 gi|392690034|gb|EIY83305.1| hypothetical protein HMPREF1073_00165 [Bacteroides uniformis
           CL03T12C37]
          Length = 663

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 89/368 (24%), Positives = 143/368 (38%), Gaps = 66/368 (17%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
           L RLY++T D K+L  A  F      G  A +  YL      +H P++     +G  +R 
Sbjct: 223 LVRLYTVTGDKKYLDQAKFFLDA--RGTTARKDIYLQ-----SHKPVLEQEEAVGHAVRA 275

Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADT 392
                       +TGD  Y K I   + +IV     Y TGG  AR   E + D   L + 
Sbjct: 276 GYMYSGMADVAAITGDSSYIKAIDKIWENIV-GKKIYITGGIGARHAGEAFGDNYELPNL 334

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
             +   ETC     + ++  LF    +  Y D  ER L NG++S     + G   Y  PL
Sbjct: 335 --TAYNETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLIS-GVSLDGGKFFYPNPL 391

Query: 453 GRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISSS 510
                   +     T+   F C C  + I  F   L   +Y  ++  V   Y+  ++S+ 
Sbjct: 392 SCDGKYHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQV---YVNLFLSNR 448

Query: 511 FDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVW-------- 559
            + K     VVL Q+      W+  +R+      K   G L  ++N+R+P W        
Sbjct: 449 AELKLNEKKVVLEQETG--YPWNGDIRV------KVAQGNLPFTMNIRIPGWVRGSVLPS 500

Query: 560 ---TYSN----GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAI 608
              +Y++    G +  +NG+ +       +L    +W   D + +   +  R     E +
Sbjct: 501 DLYSYADDLKLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMQPRVVKANEKV 560

Query: 609 QDDRPEYA 616
             DR   A
Sbjct: 561 VADRGRVA 568


>gi|380695298|ref|ZP_09860157.1| hypothetical protein BfaeM_15227 [Bacteroides faecis MAJ27]
          Length = 698

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 55/217 (25%), Positives = 91/217 (41%), Gaps = 18/217 (8%)

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
           + + ETC     +  +  +   T +  YA+  E  L N VLS     +     Y  PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYAELVETCLYNSVLS-GISLDGKRYFYTNPL-R 434

Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
             +    T  W    T++ S +CC    + +  +  +  Y   +EG    LY    +  +
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLNDEGIYCNLYGANTL--T 492

Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
             WK  G +VL Q+ D    WD  +R+ L     ++ G   SL  R+P W     A  ++
Sbjct: 493 IHWKDKGEIVLTQETD--YPWDGNVRVRLN-KLPRKAGAF-SLFFRIPEW--CEKATLTV 546

Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
           NG+ + +    N  +   R W   D  +LT+ +P+ L
Sbjct: 547 NGEPVQIAAKANTYAEVNRIWKKGDMAELTMDMPVRL 583


>gi|354583084|ref|ZP_09001984.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
 gi|353198501|gb|EHB63971.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
          Length = 626

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 54/266 (20%), Positives = 101/266 (37%), Gaps = 22/266 (8%)

Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
           +E+ G P+ +      +D +   H  A G  S  E+      L+ T  S+  E C     
Sbjct: 237 FELNGSPMERESVHRGIDSLMTYHGQAHGMFSGDEW------LSGTHPSQGVELCAVVEY 290

Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGV--------LSIQRGTEPGVMIYMLPLGRGVSK 458
           +     L R   E  + D  E+   N +         S Q   +   +I  +   R  S 
Sbjct: 291 MFSMEQLTRILGEGRFGDILEKVAFNALPAAISPDWTSHQYDQQVNQIICNV-APRAWSN 349

Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
               + +G + N F CC     + + KL   ++ +++    GL  + Y   +     G  
Sbjct: 350 GPDANVFGLEPN-FGCCTANMHQGWPKLAAHLWMKDQEE--GLVAVSYAPCTVMTTVGRH 406

Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
            +   ++  V+ +   +  +      E  +   L+LR+P W   +    +LNG+ LP   
Sbjct: 407 DVAAVIE--VTGEYPFKDRIRIHMSLERAESFPLSLRIPAWC--DDPVITLNGRELPFQV 462

Query: 579 PGNFLSATERWSYNDKLTIQLPLSLR 604
              +    + W   D+L + LP+ +R
Sbjct: 463 ESGYARIVQHWQNGDRLELHLPMEVR 488


>gi|218195658|gb|EEC78085.1| hypothetical protein OsI_17564 [Oryza sativa Indica Group]
          Length = 640

 Score = 42.0 bits (97), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 77/364 (21%), Positives = 125/364 (34%), Gaps = 73/364 (20%)

Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF-------------HAN 334
           L RLY +T +P++L L   F      +P F      +    S++             ++ 
Sbjct: 184 LMRLYDVTEEPRYLNLVKYFIEERGAQPHFYDIEYEKRGKTSYWNTYGPAWMVKDKAYSQ 243

Query: 335 THIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG-- 376
            H P+      IG  +R+            ++GD   +       + +     Y TGG  
Sbjct: 244 AHQPLAEQQTAIGHAVRFVYLMAGMAHLARLSGDEGKRQDCLRLWNNMAQRQLYITGGIG 303

Query: 377 --TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
             +S   F  D     DT+ +E   +C +  ++  +R +     +  YAD  ERAL N V
Sbjct: 304 SQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSHYADVMERALYNTV 360

Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGI 480
           L      +     Y+ PL          H     FN  +              CC     
Sbjct: 361 LG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIYDHVKPVRQRWFGCACCPPNIA 411

Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
              + LG  IY         L I  Y+ +    +     L  ++     W   + + +T 
Sbjct: 412 RVLTSLGHYIYTVRPD---ALLINLYVGNDVAIQIDENTLRLRISGNYPWQDQVTIEIT- 467

Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
                V    +L LR+P W        SLNG+ +       +L     W   D LT+ LP
Sbjct: 468 ---SPVPVTHTLALRLPDWCAEPA--VSLNGERVTGEVVRGYLYLNRCWHEGDTLTLTLP 522

Query: 601 LSLR 604
           + +R
Sbjct: 523 MPVR 526


>gi|270295877|ref|ZP_06202077.1| six-hairpin glycosidase [Bacteroides sp. D20]
 gi|270273281|gb|EFA19143.1| six-hairpin glycosidase [Bacteroides sp. D20]
          Length = 663

 Score = 42.0 bits (97), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 89/368 (24%), Positives = 143/368 (38%), Gaps = 66/368 (17%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
           L RLY++T D K+L  A  F      G  A +  YL      +H P++     +G  +R 
Sbjct: 223 LVRLYTVTGDKKYLDQAKFFLDA--RGTTARKDIYLQ-----SHKPVLEQEEAVGHAVRA 275

Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADT 392
                       +TGD  Y K I   + +IV     Y TGG  AR   E + D   L + 
Sbjct: 276 GYMYSGMADVAAITGDSSYIKAIDKIWENIV-GKKIYITGGIGARHAGEAFGDNYELPNL 334

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
             +   ETC     + ++  LF    +  Y D  ER L NG++S     + G   Y  PL
Sbjct: 335 --TAYNETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLIS-GVSLDGGKFFYPNPL 391

Query: 453 GRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISSS 510
                   +     T+   F C C  + I  F   L   +Y  ++  V   Y+  ++S+ 
Sbjct: 392 SCDGKYHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQV---YVNLFLSNR 448

Query: 511 FDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVW-------- 559
            + K     VVL Q+      W+  +R+      K   G L  ++N+R+P W        
Sbjct: 449 AELKLNEKKVVLEQETG--YPWNGDIRV------KVAQGNLPFTMNIRIPGWVRGSVLPS 500

Query: 560 ---TYSN----GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAI 608
              +Y++    G +  +NG+ +       +L    +W   D + +   +  R     E +
Sbjct: 501 DLYSYADDLKLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMHPRVVKANEKV 560

Query: 609 QDDRPEYA 616
             DR   A
Sbjct: 561 VADRGRVA 568


>gi|403743937|ref|ZP_10953416.1| hypothetical protein URH17368_0706 [Alicyclobacillus hesperidum
           URH17-3-68]
 gi|403122527|gb|EJY56741.1| hypothetical protein URH17368_0706 [Alicyclobacillus hesperidum
           URH17-3-68]
          Length = 712

 Score = 42.0 bits (97), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 55/217 (25%), Positives = 85/217 (39%), Gaps = 25/217 (11%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG----R 454
           ETC +  ++  +  + R +    YAD  ERAL N V+      +     Y+ PL      
Sbjct: 384 ETCASIGLIFFANRMIRISPRREYADVMERALYNVVIG-SMALDGKHYCYVNPLALWPPA 442

Query: 455 GVSKARSTHGWGTKFNSFW--CCYGTGIESFSKLGDSIYF--EEEGNVPGLYIIQYISS- 509
            +      H    +   F   CC          LGD IY   EE+G V   Y+  YI S 
Sbjct: 443 NIQNPDRKHVKPVRQAWFGCACCPPNVARLMMSLGDYIYTIDEEKGKV---YVHLYIGSE 499

Query: 510 -SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVWTYSNGAQA 567
            SF      +VL Q  D  + W   ++  +        G ++ SL LR+P W  ++    
Sbjct: 500 ASFSVGGRKIVLIQ--DSEMPWQGRVKFRVALGE----GPVNFSLALRIPSWC-ADTPSV 552

Query: 568 SLNGQNLPLPP---PGNFLSATERWSYNDKLTIQLPL 601
            +NG  L +        ++     W+  D L + LP+
Sbjct: 553 RVNGNLLSIASVTTKDGYIEIERTWTDGDVLELDLPM 589


>gi|251796469|ref|YP_003011200.1| hypothetical protein Pjdr2_2459 [Paenibacillus sp. JDR-2]
 gi|247544095|gb|ACT01114.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
          Length = 659

 Score = 42.0 bits (97), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 85/242 (35%), Gaps = 31/242 (12%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPL---- 452
           ETC +  ++  ++ + +   +  YAD  ERAL N V+    Q G       Y+ PL    
Sbjct: 338 ETCASIGLIFFAQRMLKLEAKSEYADVLERALYNNVVGSMSQDGKH---YFYVNPLEVWP 394

Query: 453 -------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
                  GR   KA     +G       CC        S L D IY     N   +Y   
Sbjct: 395 QASEKNPGRHHVKAERQKWFGCS-----CCPPNVARLLSSLNDYIYTVSAANNT-IYTHL 448

Query: 506 YISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
           +I S   F+  +G V L Q+    + W  Y R    F      G   +  LR+P W+   
Sbjct: 449 FIGSVARFELAAGSVSLKQQSQ--LPWKGYTR----FEFDDVPGAAFTFALRIPSWSRGK 502

Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
            A  ++NGQ         +      W   D    +  L  +  A        A   AI  
Sbjct: 503 -AVLNINGQAAEYTEENGYALVNRNWQQGDVAEWEPALEAQLTAAHPQIRANAGKVAIER 561

Query: 624 GP 625
           GP
Sbjct: 562 GP 563


>gi|410096807|ref|ZP_11291792.1| hypothetical protein HMPREF1076_00970 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225424|gb|EKN18343.1| hypothetical protein HMPREF1076_00970 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 675

 Score = 42.0 bits (97), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 84/416 (20%), Positives = 159/416 (38%), Gaps = 59/416 (14%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRY 347
            +Y LY+IT D   L L HL  K  +    + L  D L+ F+    + +  G +   + Y
Sbjct: 214 AVYWLYNITGDAFLLDLGHLLHKQSYDFVDMFLNRDDLTRFNTIHCVNLAQGIKEPVIYY 273

Query: 348 EVTGDPLY-KLIGTFFMDI--VNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
           +   D  Y   +   F DI   N       GG            L     ++  E C+  
Sbjct: 274 QQHPDKKYLDAVKKGFADIRQYNGQPQGMYGGDEG---------LHGNNPTQGSELCSAV 324

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLS-----------IQRGTEPGVMIYMLPLG 453
            ++     +   T ++A+ D+ ER   N + +            Q+  +  +  +     
Sbjct: 325 ELMYSLEKIMEITGDLAFTDHLERIAFNALPTQVTDDFMDKQYFQQANQVMITRHAHNFY 384

Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
              + A +   +GT+   + CC+    + + K   S+++    N  G+  + Y  S    
Sbjct: 385 EDANHAETDIIYGTR-TGYPCCFSNMHQGWPKFTQSLWYATPDN--GIAALAYSPSEVTA 441

Query: 514 KSGH-VVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           K G+   +    +     D  +++T+    K +E+     L+LR+P W     A  ++NG
Sbjct: 442 KVGNGCKIKITEETCYPMDDKIQLTIRLLDKTKEIA--FPLHLRIPGWCKE--ATVTVNG 497

Query: 572 QNLPLP---PPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
               +P     GN ++   R W   D++ + LP+ + T         Y +  A+  GP +
Sbjct: 498 ----VPESTAKGNSVAIIRRTWKSGDQVLLHLPMEVSTSKW------YENSVAVERGPLV 547

Query: 628 LAGHTSGEWDIK-------TGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMS 676
            A     +W+ K       T   +S   + SP    +N  +V F  ++    F ++
Sbjct: 548 YALKMDEKWEKKEFKGDEITQFGKSYYEVTSPT--KWNYGIVAFDPDNMQENFQVT 601


>gi|160890885|ref|ZP_02071888.1| hypothetical protein BACUNI_03330 [Bacteroides uniformis ATCC 8492]
 gi|156859884|gb|EDO53315.1| hypothetical protein BACUNI_03330 [Bacteroides uniformis ATCC 8492]
          Length = 663

 Score = 42.0 bits (97), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 89/368 (24%), Positives = 143/368 (38%), Gaps = 66/368 (17%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
           L RLY++T D K+L  A  F      G  A +  YL      +H P++     +G  +R 
Sbjct: 223 LVRLYTVTGDKKYLDQAKFFLDA--RGTTARKDIYLQ-----SHKPVLEQEEAVGHAVRA 275

Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADT 392
                       +TGD  Y K I   + +IV     Y TGG  AR   E + D   L + 
Sbjct: 276 GYMYSGMADVAAITGDSSYIKAIDKIWENIV-GKKIYITGGIGARHTGEAFGDNYELPNL 334

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
             +   ETC     + ++  LF    +  Y D  ER L NG++S     + G   Y  PL
Sbjct: 335 --TAYNETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLIS-GVSLDGGKFFYPNPL 391

Query: 453 GRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISSS 510
                   +     T+   F C C  + I  F   L   +Y  ++  V   Y+  ++S+ 
Sbjct: 392 SCDGKYHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQV---YVNLFLSNR 448

Query: 511 FDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVW-------- 559
            + K     VVL Q+      W+  +R+      K   G L  ++N+R+P W        
Sbjct: 449 AELKLNEKKVVLEQETG--YPWNGDIRV------KVAQGNLPFTMNIRIPGWVRGSVLPS 500

Query: 560 ---TYSN----GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAI 608
              +Y++    G +  +NG+ +       +L    +W   D + +   +  R     E +
Sbjct: 501 DLYSYADDLKLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMHPRVVKANEKV 560

Query: 609 QDDRPEYA 616
             DR   A
Sbjct: 561 VADRGRVA 568


>gi|198274386|ref|ZP_03206918.1| hypothetical protein BACPLE_00531 [Bacteroides plebeius DSM 17135]
 gi|198272752|gb|EDY97021.1| hypothetical protein BACPLE_00531 [Bacteroides plebeius DSM 17135]
          Length = 821

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 80/384 (20%), Positives = 140/384 (36%), Gaps = 79/384 (20%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
            L +LY +T D K+L +A  F +    G    + +  S      H PI     ++G  +R
Sbjct: 230 ALCKLYKVTGDKKYLDMARYFVEETGRGTDGHKLNEYSQ----DHKPILQQDEIVGHAVR 285

Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
                        +T D  Y    T   D + +   Y TGG  +R          +  G 
Sbjct: 286 AGYLYSGVADVAALTNDTAYFHALTRLWDNLVSKKLYITGGMGSRA-------QGEGFGP 338

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
             E        ETC     +  +  +F  T +  Y D  ERAL NGV+S       GV +
Sbjct: 339 NYELQNHTAYCETCAAIANVYWNYRMFLATGDSKYVDVLERALYNGVIS-------GVSL 391

Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
                 Y  PL       R       ++    CC G      + +    Y  ++ ++   
Sbjct: 392 SGDKFFYDNPLESMGEHERQ------RWFGCACCPGNVTRFMASVPSYAYATQQNDI--- 442

Query: 502 YIIQYISSSFDWKSG--HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           Y+  YI    + ++    V L Q  +    W+   ++T+  + ++E G+  ++ LR+P W
Sbjct: 443 YVNLYIQGKAEMQTADNKVTLEQTTE--YPWNG--KVTIKVTPEKE-GKF-AIRLRIPGW 496

Query: 560 T-----------YSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           T           Y++ A+     +NG          + +    W   D + +++P+ +R 
Sbjct: 497 TKAAPVASDLYAYTDAAKKYTLKVNGSATRGAEGDGYETIVRTWKAGDVIELEMPMDVRR 556

Query: 606 EAIQDDRPEYASIQAILFGPYLLA 629
               D       + A+  GP +  
Sbjct: 557 IKANDKVEVDRGMVALERGPIMFC 580


>gi|265752773|ref|ZP_06088342.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|263235959|gb|EEZ21454.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
          Length = 801

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 63/275 (22%), Positives = 107/275 (38%), Gaps = 34/275 (12%)

Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYN 405
           +TGD  Y        D +     Y TGG   TS  E +     L +   S   ETC    
Sbjct: 287 LTGDSAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM--SAYCETCAAIG 344

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
            + V+  LF    E  Y D  ER L NG++S     + G   Y  PL   + + +    +
Sbjct: 345 NVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL-ESIGQHQRQPWF 402

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
           G       CC          L   +Y  ++ +V   Y+  ++S++ + K     ++ +  
Sbjct: 403 GCA-----CCPSNVCRFIPSLPGYVYAVKDKDV---YVNLFMSNTSNLKVEGKAVSLEQA 454

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQAS----LN 570
               WD  + + +   +K   GQ  ++ +R+P W           TYS+G + S    +N
Sbjct: 455 THYPWDGDVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTYSDGKRLSYTVKVN 510

Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           G+++       +     RW   DK+ +   +  RT
Sbjct: 511 GESVQSELKDGYFCIDRRWKKGDKVEVHFDMEPRT 545


>gi|317479689|ref|ZP_07938812.1| hypothetical protein HMPREF1007_01928 [Bacteroides sp. 4_1_36]
 gi|316904142|gb|EFV25973.1| hypothetical protein HMPREF1007_01928 [Bacteroides sp. 4_1_36]
          Length = 647

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 89/368 (24%), Positives = 143/368 (38%), Gaps = 66/368 (17%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
           L RLY++T D K+L  A  F      G  A +  YL      +H P++     +G  +R 
Sbjct: 223 LVRLYTVTGDKKYLDQAKFFLDA--RGTTARKDIYLQ-----SHKPVLEQEEAVGHAVRA 275

Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADT 392
                       +TGD  Y K I   + +IV     Y TGG  AR   E + D   L + 
Sbjct: 276 GYMYSGMADVAAITGDSSYIKAIDKIWENIV-GKKIYITGGIGARHTGEAFGDNYELPNL 334

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
             +   ETC     + ++  LF    +  Y D  ER L NG++S     + G   Y  PL
Sbjct: 335 --TAYNETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLIS-GVSLDGGKFFYPNPL 391

Query: 453 GRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISSS 510
                   +     T+   F C C  + I  F   L   +Y  ++  V   Y+  ++S+ 
Sbjct: 392 SCDGKYHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQV---YVNLFLSNR 448

Query: 511 FDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVW-------- 559
            + K     VVL Q+      W+  +R+      K   G L  ++N+R+P W        
Sbjct: 449 AELKLNEKKVVLEQETG--YPWNGDIRV------KVAQGNLPFTMNIRIPGWVRGSVLPS 500

Query: 560 ---TYSN----GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAI 608
              +Y++    G +  +NG+ +       +L    +W   D + +   +  R     E +
Sbjct: 501 DLYSYADDLKLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMHPRVVKANEKV 560

Query: 609 QDDRPEYA 616
             DR   A
Sbjct: 561 VADRGRVA 568


>gi|237720781|ref|ZP_04551262.1| six-hairpin glycosidase [Bacteroides sp. 2_2_4]
 gi|229449616|gb|EEO55407.1| six-hairpin glycosidase [Bacteroides sp. 2_2_4]
          Length = 698

 Score = 41.6 bits (96), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 83/347 (23%), Positives = 139/347 (40%), Gaps = 47/347 (13%)

Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
           +  +Y  T +P++L L+ +L D    +          +  +  Y +  HA     +  G 
Sbjct: 248 VVEMYRATENPRYLELSKNLIDIRGMVENGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307

Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
              Y  TG+  L K + + + DIV     Y TG       GTS     ++P   +++  +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366

Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
            G        + + ETC     +  +  +   T +  YAD  E  L N VLS     +  
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 425

Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
              Y  PL R  +    T  W    T++ S +CC    + +  +  +  Y    EG    
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484

Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           LY    ++++  WK  G + L Q+ D    W+  +R+TL    ++  G   SL LR+P W
Sbjct: 485 LYGANTLTTT--WKDKGELTLTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW 538

Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
                   ++NGQ L      N  +   R W   D  +L + +P+ L
Sbjct: 539 --CEKTTLTVNGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583


>gi|304316161|ref|YP_003851306.1| hypothetical protein Tthe_0663 [Thermoanaerobacterium
           thermosaccharolyticum DSM 571]
 gi|302777663|gb|ADL68222.1| protein of unknown function DUF1680 [Thermoanaerobacterium
           thermosaccharolyticum DSM 571]
          Length = 673

 Score = 41.6 bits (96), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 55/234 (23%), Positives = 95/234 (40%), Gaps = 17/234 (7%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG---RG 455
           ETC +  ++  +  + +   +  Y+D  ERAL N V+S     +     Y+ PL      
Sbjct: 358 ETCASVGLVFFAHRMLQIDPDRQYSDVMERALYNTVIS-GMSLDGKKFFYVNPLEVWPEA 416

Query: 456 VSKAR-STHGWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
             K +  +H   T+   F   CC        + LG  IY ++   V   ++  Y+ S   
Sbjct: 417 CEKNKVKSHVKYTRQPWFGCACCPPNIARLLTSLGKYIYSKKAKEV---FVHLYVDSELK 473

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
            K     +N K      WD   ++ +   SK+E     +L++R+P W      + + N  
Sbjct: 474 EKISESEVNIKQSTQYPWDE--KIIIDIDSKKETE--FTLSIRIPGWCKEAKVKVNNNEI 529

Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLS-LRTEAIQDDRPEYASIQAILFGP 625
           +L       +     RW + D L I L +  +R +A  + R +   + AI  GP
Sbjct: 530 DLDSVMEKGYAKINRRWKH-DSLEIYLSMPVMRIKANPNVREDEGKV-AIQRGP 581


>gi|281421440|ref|ZP_06252439.1| putative cytoplasmic protein [Prevotella copri DSM 18205]
 gi|281404512|gb|EFB35192.1| putative cytoplasmic protein [Prevotella copri DSM 18205]
          Length = 690

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 90/213 (42%), Gaps = 34/213 (15%)

Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L RLY++T + K+L  A +L D   + G   ++  Y     + + +PI+     +G  +R
Sbjct: 238 LARLYTLTGEKKYLDEAKYLLD---YRGKTHIRNPY-----SQSQVPILEQKEAVGHAVR 289

Query: 347 Y-----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLAD 391
                        +T D  Y K+I   F +IV   + Y TGG  AR   E + +   L +
Sbjct: 290 AGYMYAGIADVAALTKDSAYMKVIDRIFENIVGKKY-YLTGGVGARHAGEAFGENYELPN 348

Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
              +   ETC   +M+ +   +F    E  Y D  ER L NGV+S     + G   Y  P
Sbjct: 349 M--TAYNETCAAISMVYLFERMFLLHGESKYIDCMERTLYNGVIS-GMSMDGGRFFYPNP 405

Query: 452 LGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF 483
           L      A +  G  T+   F C C  + +  F
Sbjct: 406 LSSDGKYAFNADGNTTRQPWFGCACCPSNLSRF 438


>gi|345514164|ref|ZP_08793678.1| six-hairpin glycosidase [Bacteroides dorei 5_1_36/D4]
 gi|229435978|gb|EEO46055.1| six-hairpin glycosidase [Bacteroides dorei 5_1_36/D4]
          Length = 801

 Score = 41.2 bits (95), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 78/348 (22%), Positives = 132/348 (37%), Gaps = 59/348 (16%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L +LY +T   K+L  A  F D+    G+     +Y     +  H P+V     +G  +R
Sbjct: 222 LAKLYLVTGQQKYLDQAKFFLDQ---RGYTTRTDEY-----SQAHKPVVEQDEAVGHAVR 273

Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADT 392
                        +TGD  Y        D +     Y TGG   TS  E +     L + 
Sbjct: 274 AAYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM 333

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
             S   ETC     + V+  LF    E  Y D  ER L NG++S     + G   Y  PL
Sbjct: 334 --SAYCETCAAIGNVYVNYRLFLLHGEAKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL 390

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
              + + +    +G       CC          L   +Y  ++ +V   Y+  ++S++ +
Sbjct: 391 -ESIGQHQRQPWFGCA-----CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFMSNTSN 441

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TY 561
            K     ++ +      W+  + + +   +K   GQ  ++ +R+P W           TY
Sbjct: 442 LKVEGKAVSLEQTTHYPWNGEVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTY 497

Query: 562 SNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           S+G + S    +NG+ +       +     RW   DK+ +   +  RT
Sbjct: 498 SDGKRLSYTVKVNGEPVQSELKDGYFCIDRRWKKGDKIAVHFDMEPRT 545


>gi|218678364|ref|ZP_03526261.1| hypothetical protein RetlC8_05602 [Rhizobium etli CIAT 894]
          Length = 345

 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 54/233 (23%), Positives = 93/233 (39%), Gaps = 24/233 (10%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC +  ++  +  +     +  YAD  E+AL NG L     T+     Y  PLG     
Sbjct: 131 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GLSTDGKTFFYDNPLGSAGKH 189

Query: 459 ARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
               +G      + N        G   ++   D I          +++    ++     +
Sbjct: 190 HPLENGIIAPAARPNIARLVTSIGSYMYAVADDEI---------AVHLYGESTTRLKLAN 240

Query: 516 GHVV-LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
           G  V L Q  +    WD      + F+++ E     +L+LR+P W  + GA  S+NG+ L
Sbjct: 241 GAAVELQQATN--YPWD----GAVAFTTRLEKPAKFALSLRIPDW--AEGATLSVNGEKL 292

Query: 575 PLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
            L       +     +W+  D++ + LPLSLR +       + A   A++ GP
Sbjct: 293 DLGAAVRDGYARIDRQWADGDRVDLFLPLSLRPQYANPKVRQDAGRVALMRGP 345


>gi|257067398|ref|YP_003153653.1| hypothetical protein Bfae_01840 [Brachybacterium faecium DSM 4810]
 gi|256558216|gb|ACU84063.1| uncharacterized conserved protein [Brachybacterium faecium DSM
           4810]
          Length = 643

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 48/214 (22%), Positives = 91/214 (42%), Gaps = 23/214 (10%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-----G 453
           ETC     +  S  L+  T  + YAD+ ER L N V+++    +     Y  PL     G
Sbjct: 332 ETCAGIAAIMFSWRLYLATGGVEYADFIERVLYN-VVAVSPSPDGRAFFYSNPLHQREPG 390

Query: 454 RGVSKARSTHGWGTKFNSFW---CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
              S + +    G+    ++   CC      + + + DS +   +G   GL ++QY S +
Sbjct: 391 DSASSSVNMRAEGSTRAPWFDVSCCPTNVARTLASV-DSFFAATDGE--GLTLLQYASGT 447

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
           +   +  V ++ +      +     + LT     E    ++L LR+P W  ++GA  ++ 
Sbjct: 448 YRTPALTVAVHTE------YPAQGAIALTVLDAAE--DPATLRLRVPSW--ADGAALTVG 497

Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            + +    PG +   T  W   +++ + LP+  R
Sbjct: 498 SEPVRTVTPG-WSEVTRTWRAGERVLLDLPVVPR 530


>gi|433774251|ref|YP_007304718.1| hypothetical protein Mesau_02961 [Mesorhizobium australicum
           WSM2073]
 gi|433666266|gb|AGB45342.1| hypothetical protein Mesau_02961 [Mesorhizobium australicum
           WSM2073]
          Length = 666

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 97/482 (20%), Positives = 187/482 (38%), Gaps = 81/482 (16%)

Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKP- 224
           +G  +  +A       N  +++K+  V+      Q +   GYLS++       ++ ++P 
Sbjct: 108 LGKTIETAAYSLYRRKNPELEKKIDAVIDMYGRLQQE--DGYLSSW-------YQRIQPG 158

Query: 225 -VWAPYYTIHKI-LAGLLDQYVLA--DNAQALKMATWMVEYFYNRVQKVITMYSVERHWY 280
             W      H++  AG L +  +A        K+   M  Y  + +  V+     ++  Y
Sbjct: 159 KRWTNLRDCHELYCAGHLIEGAVAYYQATGKRKLLDIMCRYA-DHIASVLGPEPGKKKGY 217

Query: 281 SLNEETGGMNDVLYRLYSITHDPKHLLLA-----------HLFDKPCFLGFLALQADYLS 329
             +EE   +   L +L  +T + K++ LA           H FD+         +A +  
Sbjct: 218 CGHEE---IELALVKLARVTGEQKYMELAKYFIDQRGQQPHYFDEEARARGADPKAYHFK 274

Query: 330 HF-HANTHIPI-----VIGSQMRYEVT-----------GDPLYKLIGTFFMDIVNASHSY 372
            + ++ +HIP+     V+G  +R               GD   ++      D +   + Y
Sbjct: 275 TYEYSQSHIPVREQDKVVGHAVRAMYLYSGMADIATEYGDDTLRVALDRLWDDLTTKNLY 334

Query: 373 ATGGTSAREFWWDPKRLADTLGSE----NE----ETCTTYNMLKVSRHLFRWTKEIAYAD 424
            TGG         P    +   S+    NE    ETC +  ++  +  +        YAD
Sbjct: 335 ITGGLG-------PSAHNEGFTSDYDLPNETAYAETCASVGLVFWATRMLGMGPNARYAD 387

Query: 425 YYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG-WGTKFNSFWCCYGTGIESF 483
             ERAL NG +S     +  +  Y  PL     ++R  H  W  K++   CC        
Sbjct: 388 MMERALYNGSIS-GLSLDGSLFFYENPL-----ESRGKHNRW--KWHRCPCCPPNIGRMV 439

Query: 484 SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK 543
           + +G S ++    +   +++    ++ FD     V L Q       WD  + +T+   + 
Sbjct: 440 ASIG-SYFYSLADDALAVHLYGDSTARFDIADTPVTLTQASR--YPWDGAVEITVEPQTS 496

Query: 544 QEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPL 601
            E     +L+LR+P W  S+ A+  +NG+ + L       + +   +W   D++ + L +
Sbjct: 497 VEF----TLHLRVPAW--SSKAKLEINGEAIDLAEVTSDGYAAIRRQWKKGDRVRLDLEM 550

Query: 602 SL 603
            +
Sbjct: 551 PI 552


>gi|170288466|ref|YP_001738704.1| hypothetical protein TRQ2_0668 [Thermotoga sp. RQ2]
 gi|170175969|gb|ACB09021.1| protein of unknown function DUF1680 [Thermotoga sp. RQ2]
          Length = 620

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 71/342 (20%), Positives = 133/342 (38%), Gaps = 50/342 (14%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD-----YLSHF----------HANTH 336
            L  LY  T D K+L LA  F      G  ++  +     ++ H           HA   
Sbjct: 195 ALVELYRETGDRKYLDLARYFIYTRGKGLASVPRNPGPEYFIDHKPFVELEEITGHAVRA 254

Query: 337 IPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
           + +  G+   Y  TGD  +++ +   + + V     Y TGG  +R  W       ++ G 
Sbjct: 255 LYLCSGATDLYLETGDEKIWQALNRLWENFVT-KKMYITGGAGSRHDW-------ESFGE 306

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
           E E        E+C +      +  +   T +  +AD  E+ L NG+LS     +     
Sbjct: 307 EYELPNRRSYAESCASIANFMWNFRMLLATGDGKFADVMEQVLYNGLLS-GISLDGKHYF 365

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  PL     + R       K+    CC        +     +Y   +  V  +++ +  
Sbjct: 366 YFNPL-EDYGRTRR-----QKWFDCACCPPNLARFIASFPGYMYTTSDDGVQ-VHLYEKS 418

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
           ++  D+K   V + Q+ D    W       + F+ K ++ +  S+ LR+P W  ++    
Sbjct: 419 TARLDFKGSVVEIEQETD--YPWSG----EIAFTIKTDIEEPFSIYLRLPSW--ADDFVL 470

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
            + G+ +   P   ++  ++ W    K T++L L ++ E I+
Sbjct: 471 RVGGKTVTAKPQNGYVKLSQNW--KGKHTVELSLPMKAEFIE 510


>gi|298481311|ref|ZP_06999504.1| hypothetical protein HMPREF0106_01752 [Bacteroides sp. D22]
 gi|298272515|gb|EFI14083.1| hypothetical protein HMPREF0106_01752 [Bacteroides sp. D22]
          Length = 698

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 84/347 (24%), Positives = 139/347 (40%), Gaps = 47/347 (13%)

Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
           +  +Y  T +P++L L+ +L D    +          +  +  Y +  HA     +  G 
Sbjct: 248 VVEMYRATGNPRYLELSKNLIDIRGMVENGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307

Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
              Y  TG+  L K + + + DIV     Y TG       GTS     ++P   +++  +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366

Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
            G        + + ETC     +  +  +   T +  YAD  E  L N VLS     +  
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 425

Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
              Y  PL R  +    T  W    T++ S +CC    + +  +  +  Y    EG    
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484

Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           LY    +++   WK  G + L Q+ D    W+  +R+TL    ++  G   SL LR+P W
Sbjct: 485 LYGANTLTTI--WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW 538

Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
                A  ++NGQ L      N  +   R W   D  +L + +P+ L
Sbjct: 539 --CEKATLTVNGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583


>gi|146301833|ref|YP_001196424.1| hypothetical protein Fjoh_4097 [Flavobacterium johnsoniae UW101]
 gi|146156251|gb|ABQ07105.1| protein of unknown function DUF1680 [Flavobacterium johnsoniae
           UW101]
          Length = 672

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 55/236 (23%), Positives = 93/236 (39%), Gaps = 23/236 (9%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC     +  +  + + T +  YAD  E AL N VLS     E    +Y  PL   VS 
Sbjct: 357 ETCANIGNVLWNWRMLQITGDAKYADIIELALYNSVLS-GMDLEGEKFLYNNPL--NVSN 413

Query: 459 ARSTHG-WGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
               H  WG +   +     CC      + +++G+  Y   +    GLY+  Y S+    
Sbjct: 414 DLPFHQRWGNEREGYIALSNCCAPNVTRTIAEVGNYAYNISK---EGLYVNLYGSNQLKT 470

Query: 514 KS---GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
           KS     + + Q+ +    WD  + + +  + K     L +  LR+P W  S  A+  +N
Sbjct: 471 KSLNGEEIEIEQQTN--YPWDGKITLKIVKAPK----DLQNFFLRIPGW--SQNAEILIN 522

Query: 571 GQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
              +      G +L   ++W   D + +  P+ +          E  +  A+  GP
Sbjct: 523 NSKINDKIVSGTYLKLNQKWKKGDVIELNFPMPVELMEANPLVEEVKNQVAVKRGP 578


>gi|298385749|ref|ZP_06995307.1| hypothetical protein HMPREF9007_02451 [Bacteroides sp. 1_1_14]
 gi|298261890|gb|EFI04756.1| hypothetical protein HMPREF9007_02451 [Bacteroides sp. 1_1_14]
          Length = 698

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 82/347 (23%), Positives = 140/347 (40%), Gaps = 47/347 (13%)

Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
           +  +Y  T +P++L L+ +L D    +          +  +  Y +  HA     +  G 
Sbjct: 248 VVEMYRATGNPRYLELSKNLIDIRGMVESGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307

Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
              Y  TG+  L K + + + DIV     Y TG       GTS     ++P   +++  +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366

Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
            G        + + ETC     +  +  +   T +  YAD  E  L N VLS     +  
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 425

Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
              Y  PL R  +    T  W    T++ S +CC    + +  +  +  Y    EG    
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484

Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           LY    +++  +WK  G + L Q+ D    W+  +R+TL     ++ G   SL  R+P W
Sbjct: 485 LYGANTLTT--NWKDKGELALVQETD--YPWEGNVRVTLN-KVPRKAGAF-SLFFRIPEW 538

Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
                A  ++NGQ + +    N  +   R W   D  +L + +P+ L
Sbjct: 539 --CGKAALTVNGQPVSMNAKANTYAEVNRTWKKGDVVELVMDMPVCL 583


>gi|403252915|ref|ZP_10919220.1| hypothetical protein EMP_04025 [Thermotoga sp. EMP]
 gi|402811677|gb|EJX26161.1| hypothetical protein EMP_04025 [Thermotoga sp. EMP]
          Length = 621

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 68/335 (20%), Positives = 130/335 (38%), Gaps = 48/335 (14%)

Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD-----YLSHF----------HANTH 336
            L  LY  T D K+L LA  F      GF ++  +     ++ H           HA   
Sbjct: 195 ALVELYRETGDRKYLDLARYFIYARGKGFASVPRNPGPEYFIDHKPFVELEEITGHAVRA 254

Query: 337 IPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
           + +  G+   Y  TGD  +++ +   + + V     Y TGG  +R  W       ++ G 
Sbjct: 255 LYLCSGATDLYLETGDEKIWQALNKLWENFVT-KKMYITGGAGSRHDW-------ESFGE 306

Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
           E E        E+C +      +  +   T +  +AD  E+ L NG+LS     +     
Sbjct: 307 EYELPNRRSYAESCASIANFMWNFRMLLATGDGKFADVMEQVLYNGLLS-GISLDGKHYF 365

Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
           Y  PL   + + R       K+    CC        +     +Y   +  V  +++ +  
Sbjct: 366 YFNPL-EDLGRTRR-----QKWFDCACCPPNLARFIASFPGYMYTTSDDGVQ-VHLYEKS 418

Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
           ++  D+K   V + Q+ D    W       +TF+ + ++ +  S+ LR+P W  ++    
Sbjct: 419 TARLDFKGSVVEIEQETD--YPWSG----EVTFTVETDIEEPFSIYLRIPSW--ADDFVL 470

Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
            ++G+ +   P   ++   + W     + + LP+ 
Sbjct: 471 RVDGKAVIAKPQNGYVKLNQSWKGKHTVELSLPMK 505


>gi|436837570|ref|YP_007322786.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
 gi|384068983|emb|CCH02193.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
          Length = 683

 Score = 40.8 bits (94), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 74/334 (22%), Positives = 126/334 (37%), Gaps = 38/334 (11%)

Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
           D LY  Y + +  K   L  L  K         QA+ L ++H N +I         Y + 
Sbjct: 217 DNLYSAYWLYNRTKAPFLLELAQKIHRNTANWRQANNLPNWH-NVNIAQCFREPATYYLQ 275

Query: 351 GDPLYKLIGTFF-MDIVNASHSYATGGT-----SAREFWWDPKRLADTLGSENEETCTTY 404
                 L+ T+   ++V   +    GG      ++R  + DP++          ETC   
Sbjct: 276 SGDQSDLMATYHNFELVRQRYGQVPGGMWGGDENSRPGYTDPRQAV--------ETCGMV 327

Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH- 463
             +     L R+T +  +AD  E    N  L      +   + Y+       S A + H 
Sbjct: 328 EQMASDELLLRFTGDPFWADNCEDVAFN-TLPAAFMPDYRSLRYLTAPNMVRSDAANHHP 386

Query: 464 -----GWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
                G     N F   CC       +    +++Y     N  GL ++ Y +S    K G
Sbjct: 387 GIDNQGPFLMMNPFSSRCCQHNHANGWVYYAENLYMATPDN--GLAVVLYNASEVTAKVG 444

Query: 517 H---VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
           +   V L Q+      ++  +R+T+  +          L LR+P W   +     +NG+ 
Sbjct: 445 NGSAVTLKQETS--YPFEEQVRLTVQAARPTAF----PLYLRVPAWC--SNPTVRVNGRA 496

Query: 574 LPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
           +P+    G ++  T+ W   DK+T+ LP+ LR  
Sbjct: 497 VPVTAKAGQYIVLTDTWQSGDKITLDLPMRLRVR 530


>gi|423296614|ref|ZP_17274699.1| hypothetical protein HMPREF1070_03364 [Bacteroides ovatus
           CL03T12C18]
 gi|392670337|gb|EIY63822.1| hypothetical protein HMPREF1070_03364 [Bacteroides ovatus
           CL03T12C18]
          Length = 698

 Score = 40.8 bits (94), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 83/347 (23%), Positives = 139/347 (40%), Gaps = 47/347 (13%)

Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
           +  +Y  T +P++L L+ +L D    +          +  +  Y +  HA     +  G 
Sbjct: 248 VVEMYRATGNPRYLELSKNLIDIRGMVENGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307

Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
              Y  TG+  L K + + + DIV     Y TG       GTS     ++P   +++  +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TQKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366

Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
            G        + + ETC     +  +  +   T +  YAD  E  L N VLS     +  
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 425

Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
              Y  PL R  +    T  W    T++ S +CC    + +  +  +  Y    EG    
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484

Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           LY    ++++  WK  G + L Q+ D    W+  +R+TL    ++  G   SL LR+P W
Sbjct: 485 LYGANTLTTT--WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGTF-SLFLRIPEW 538

Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
                   ++NGQ L      N  +   R W   D  +L + +P+ L
Sbjct: 539 --CEKTTLTVNGQPLQTNTKANSYAEVNRTWKKGDVVELVMDMPVRL 583


>gi|319951999|ref|YP_004163266.1| hypothetical protein [Cellulophaga algicola DSM 14237]
 gi|319420659|gb|ADV47768.1| protein of unknown function DUF1680 [Cellulophaga algicola DSM
           14237]
          Length = 699

 Score = 40.8 bits (94), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 50/213 (23%), Positives = 83/213 (38%), Gaps = 19/213 (8%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
           ETC        S  +     E  YAD  E  L N  LS     E     Y  PL R V  
Sbjct: 374 ETCANICNSMFSYRMLGLHGESKYADVMETVLYNSALS-GINIEGDRYYYANPL-RTVHG 431

Query: 459 ARSTHGWGTKFN------SFWCCYGTGIESFSKLGDSIYFEEEGNVP-GLYIIQYISSSF 511
           +R      T+F         +CC    + + +++    Y + E  +   LY    ++++ 
Sbjct: 432 SRDYDKMNTEFPVRQDYLECFCCPPNLVRTIAQVSGWAYSKSENGIAVNLYGGNKLATTL 491

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           +  S    L  K +    W+  + +T+          L    LR+P W  + G++  +NG
Sbjct: 492 NDGSS---LKLKQETKYPWEGDVEITIEACRSDAFDIL----LRIPEW--AEGSKIMING 542

Query: 572 QNLP-LPPPGNFLSATERWSYNDKLTIQLPLSL 603
           +    L  PG + +    W  ND + + LPL++
Sbjct: 543 KESEILATPGTYATLNRTWKANDTIRLDLPLAI 575


>gi|153852636|ref|ZP_01994073.1| hypothetical protein DORLON_00046 [Dorea longicatena DSM 13814]
 gi|149754278|gb|EDM64209.1| hypothetical protein DORLON_00046 [Dorea longicatena DSM 13814]
          Length = 649

 Score = 40.8 bits (94), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 44/174 (25%), Positives = 74/174 (42%), Gaps = 20/174 (11%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPL---- 452
           ETC +  +    R + + TK+ +Y D  ERAL N +LS   Q G       Y+ PL    
Sbjct: 333 ETCASIGLALFGRRMAQITKDASYMDMVERALYNTLLSGIAQDGKS---FFYVNPLEVWP 389

Query: 453 GRGVSKARSTHGWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
              + +    H    +   F   CC      + + +G  IYF ++      Y+  YIS+ 
Sbjct: 390 DNCIDRTSKEHVKPVRQKWFGVACCPPNIARTLASMGQYIYFTDKNTA---YVNLYISNE 446

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP--VWTYS 562
              +     L  +++  ++   ++RM +T   + E      L LR+P  V TY+
Sbjct: 447 AQIELEEGALKIQIESDLTNTGHIRMAITPDGEGE----HRLALRIPDYVKTYT 496


>gi|222099378|ref|YP_002533946.1| hypothetical protein CTN_0404 [Thermotoga neapolitana DSM 4359]
 gi|221571768|gb|ACM22580.1| Putative uncharacterized protein [Thermotoga neapolitana DSM 4359]
          Length = 623

 Score = 40.8 bits (94), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 58/287 (20%), Positives = 110/287 (38%), Gaps = 35/287 (12%)

Query: 332 HANTHIPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
           HA   + +  G+   Y  TGD  +++ +   + + V     Y TGG  +R  W       
Sbjct: 252 HAVRALYLCAGATDLYLETGDEKIWQALNRLWENFVT-KKMYITGGAGSRHDW------- 303

Query: 391 DTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
           ++ G E E        E+C +      +  +   T +  +AD  E+ L NG+LS     +
Sbjct: 304 ESFGEEYELPNRRSYAESCASIANFMWNFRMLLATGDGKFADVMEQVLYNGLLS-GISLD 362

Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
                Y  PL       R       K+    CC        +     +Y      V  ++
Sbjct: 363 GKHYFYFNPLEDSGRTRRQ------KWFDCACCPPNLARFIASFPGYMYTTSNDGVQ-VH 415

Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
           + +  ++   +K   V + Q+ D    W       +  S + E+ +  S+ LR+P W  +
Sbjct: 416 LYEKSTAKVSFKGSTVKIEQETD--YPWSG----EIVLSIETEIEEPFSIYLRIPTW--A 467

Query: 563 NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
           +     ++G+ L L P   ++     W    ++ + LP  +R E ++
Sbjct: 468 DDFSIRVDGETLDLEPQNGYVKLNRNWKGGHRIELSLP--MRVELVE 512


>gi|212692436|ref|ZP_03300564.1| hypothetical protein BACDOR_01932 [Bacteroides dorei DSM 17855]
 gi|212665015|gb|EEB25587.1| F5/8 type C domain protein [Bacteroides dorei DSM 17855]
          Length = 801

 Score = 40.8 bits (94), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 63/275 (22%), Positives = 106/275 (38%), Gaps = 34/275 (12%)

Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYN 405
           +TGD  Y        D +     Y TGG   TS  E +     L +   S   ETC    
Sbjct: 287 LTGDSAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM--SAYCETCAAIG 344

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
            + V+  LF    E  Y D  ER L NG++S     + G   Y  PL   + + +    +
Sbjct: 345 NVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL-ESIGQHQRQPWF 402

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
           G       CC          L   +Y  ++ +V   Y+  ++S++ + K     ++ +  
Sbjct: 403 GCA-----CCPSNVCRFIPSLPGYVYAVKDKDV---YVNLFMSNTSNLKVEGKAVSLEQA 454

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQAS----LN 570
               WD  + + +   +K   GQ  ++ +R+P W           TYS+G + S    +N
Sbjct: 455 THYPWDGDVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTYSDGKRLSYTVKVN 510

Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           G+ +       +     RW   DK+ +   +  RT
Sbjct: 511 GEPVQSELKDGYFCIDRRWKKGDKVEVHFDMEPRT 545


>gi|154495095|ref|ZP_02034100.1| hypothetical protein PARMER_04142 [Parabacteroides merdae ATCC
           43184]
 gi|423725063|ref|ZP_17699203.1| hypothetical protein HMPREF1078_03097 [Parabacteroides merdae
           CL09T00C40]
 gi|154085645|gb|EDN84690.1| hypothetical protein PARMER_04142 [Parabacteroides merdae ATCC
           43184]
 gi|409235419|gb|EKN28237.1| hypothetical protein HMPREF1078_03097 [Parabacteroides merdae
           CL09T00C40]
          Length = 617

 Score = 40.8 bits (94), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 51/234 (21%), Positives = 96/234 (41%), Gaps = 31/234 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
           ETC +  M+  ++ + ++T +  Y D  ER++ NG L+       GV +      Y+ PL
Sbjct: 335 ETCASVGMVYWNQRMNQFTGDSKYIDVLERSMYNGALA-------GVSLAGDRFFYVNPL 387

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV-PGLYIIQYISSSF 511
                  R        +    CC          +G+ IY   +  +   L+I      + 
Sbjct: 388 ESNGDHHRQA------WYGCACCPSQISRFLPSIGNYIYGTSDKALWVNLFIGNTTEVTI 441

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           D K   VV+ Q+ D    WD  +++T+T  S+Q +G+   L +R+P W  S     S+NG
Sbjct: 442 DGKK--VVMKQETD--YPWDGLVKLTVT--SEQPLGK--ELRIRIPGWCKS--YTLSVNG 491

Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
             +       + +  + W   D + + + + +   +      +    +A+  GP
Sbjct: 492 NKVDSTTDKGY-TVIKEWKTGDLIVLNMDMPVEKVSADPRVRQNTGKRALQRGP 544


>gi|423348679|ref|ZP_17326361.1| hypothetical protein HMPREF1060_04033 [Parabacteroides merdae
           CL03T12C32]
 gi|409213200|gb|EKN06224.1| hypothetical protein HMPREF1060_04033 [Parabacteroides merdae
           CL03T12C32]
          Length = 617

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 51/234 (21%), Positives = 96/234 (41%), Gaps = 31/234 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
           ETC +  M+  ++ + ++T +  Y D  ER++ NG L+       GV +      Y+ PL
Sbjct: 335 ETCASVGMVYWNQRMNQFTGDSKYIDVLERSMYNGALA-------GVSLAGDRFFYVNPL 387

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV-PGLYIIQYISSSF 511
                  R        +    CC          +G+ IY   +  +   L+I      + 
Sbjct: 388 ESNGDHHRQA------WYGCACCPSQISRFLPSIGNYIYGTSDKALWVNLFIGNTTEVTI 441

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
           D K   VV+ Q+ D    WD  +++T+T  S+Q +G+   L +R+P W  S     S+NG
Sbjct: 442 DGKK--VVMKQETD--YPWDGLVKLTVT--SEQPLGK--ELRIRIPGWCKS--YTLSVNG 491

Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
             +       + +  + W   D + + + + +   +      +    +A+  GP
Sbjct: 492 NKVDSTTDKGY-TVIKEWKTGDLIVLNMDMPVEKVSADPRVRQNTGKRALQRGP 544


>gi|67538270|ref|XP_662909.1| hypothetical protein AN5305.2 [Aspergillus nidulans FGSC A4]
 gi|40743275|gb|EAA62465.1| hypothetical protein AN5305.2 [Aspergillus nidulans FGSC A4]
 gi|259485256|tpe|CBF82133.1| TPA: DUF1680 domain protein (AFU_orthologue; AFUA_1G08910)
           [Aspergillus nidulans FGSC A4]
          Length = 629

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 63/254 (24%), Positives = 93/254 (36%), Gaps = 27/254 (10%)

Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKL-IGTFFMDIVNASHSYATGGTSAREFWWD---PK 387
           HA   +   I +     +TGD   K  +   +MD+      Y TGG  A   W       
Sbjct: 264 HAVRAMYYYIAATDLVRLTGDEEIKAALDRMWMDMTE-RKLYVTGGIGAMRQWEGFGAKY 322

Query: 388 RLADT--LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
            LADT   G    ETC  + ++   + + +   +  YAD  E  L NG L    G + G 
Sbjct: 323 VLADTDESGICYAETCACFALIIWCQRMLQLDLDAKYADVMEVGLYNGFLGAV-GLDGGS 381

Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
             Y  PL       +    W        CC     +    +   IY  ++  V    I  
Sbjct: 382 FYYQNPLRTYTGHPKERSEW----FEVACCPPNVAKLLGSMESLIYSFKDDLVA---IHL 434

Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT--YSN 563
           YI S F      VV++QK +          M  +   +  V   ++L LR+P W   YS+
Sbjct: 435 YIESDFTVPETGVVVSQKTN----------MPWSGDVEISVKGTTALALRIPTWAEGYSS 484

Query: 564 GAQASLNGQNLPLP 577
             Q  +    L +P
Sbjct: 485 SVQGEVKNGYLYIP 498


>gi|160932141|ref|ZP_02079532.1| hypothetical protein CLOLEP_00975 [Clostridium leptum DSM 753]
 gi|156868743|gb|EDO62115.1| hypothetical protein CLOLEP_00975 [Clostridium leptum DSM 753]
          Length = 705

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 58/238 (24%), Positives = 90/238 (37%), Gaps = 24/238 (10%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVL-SIQRGTEPGVMIYMLPL----- 452
           ETC +  ++  +  + +   +  Y D  ERAL N VL S  R  +     Y+ PL     
Sbjct: 389 ETCASIGLIFFAHRMLQMDMDSRYGDVMERALYNVVLGSASRDGK--RFFYVNPLEVWPK 446

Query: 453 -GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
              G    +       K+    CC        + L   +Y  +E  +   Y   YIS   
Sbjct: 447 ACGGNPDKQHVKPVRQKWFGCACCPPNVARLMASLNQYLYSTDEDTI---YTHLYISGEA 503

Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMT-LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
             K     +  K +    WD +++ T L+   + E+    SL LR+P W          N
Sbjct: 504 GIKIAGGEMRLKQESSYPWDGHIKFTVLSALPEDEL----SLGLRLPGWC--RNWSVLFN 557

Query: 571 GQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF--GP 625
           G+ +P P     +L     W   D  T++L L +  E +Q +    A    I F  GP
Sbjct: 558 GKPVPRPVVQKGYLKVAAHWHEGD--TVELRLEMPVECLQANPQVRADAGKIAFQRGP 613


>gi|428217725|ref|YP_007102190.1| alpha-2-macroglobulin domain-containing protein [Pseudanabaena sp.
            PCC 7367]
 gi|427989507|gb|AFY69762.1| alpha-2-macroglobulin domain protein [Pseudanabaena sp. PCC 7367]
          Length = 1968

 Score = 40.8 bits (94), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 54/121 (44%), Gaps = 12/121 (9%)

Query: 587  ERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSL 646
            +RW ++   T+    +L+    Q  +PE+     +L G  LLA    G W      A++L
Sbjct: 1667 KRWRWSSSTTVAQAEALKLMVEQGQKPEFTD--RLLQG--LLAQRRDGLWRNDYENAKAL 1722

Query: 647  SALIS-----PIPPSFNAQLVTFTQESGNSTFVMSNSNQS---ITMEEFPVSGTDAALHA 698
            +AL++     P PP+F A      Q+ G + FV    NQ+   I M E P    +  L  
Sbjct: 1723 AALVAYARNEPTPPNFVAIANLDEQQIGTAQFVGYRDNQTQFEIPMAELPQGEQNLVLSK 1782

Query: 699  T 699
            T
Sbjct: 1783 T 1783


>gi|86143571|ref|ZP_01061956.1| hypothetical protein MED217_13269 [Leeuwenhoekiella blandensis
           MED217]
 gi|85830018|gb|EAQ48479.1| hypothetical protein MED217_13269 [Leeuwenhoekiella blandensis
           MED217]
          Length = 723

 Score = 40.4 bits (93), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 57/224 (25%), Positives = 86/224 (38%), Gaps = 31/224 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADY--------YERALTNGVLSIQRGTEPGVMIYML 450
           ETC     +    HL R T +I +AD+        Y  AL     S++  T P +   +L
Sbjct: 350 ETCGMVEQMNSDEHLLRITGDIFWADHAEEVAFNTYPAALMPDFRSLRYITSPNM---VL 406

Query: 451 PLGRGVSKARSTHGWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
                 S   +  G     N F   CC     + ++   ++++     N  GL  + Y +
Sbjct: 407 NDDANHSPGIANAGPFLMMNPFSSRCCQHNHGQGWAYFTENLFMATPDN--GLAAVLYAA 464

Query: 509 SSFDWKSGH----VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
            S   K        V N    P        R+  T ++ + V     L  R+P W  +  
Sbjct: 465 GSVTAKVSDGKEVTVTNNSNYPFTE-----RLDFTVNTSEAVE--FPLYFRIPAW--AKQ 515

Query: 565 AQASLNGQNLPLPPPGNFLSATER-WSYNDKLTIQLP--LSLRT 605
           A  +LNG+ L   P  N     ER W   D++T+ LP  L LRT
Sbjct: 516 ASVALNGEALDANPSANTYIRIEREWKDGDQVTLTLPKELGLRT 559


>gi|395771959|ref|ZP_10452474.1| hypothetical protein Saci8_19398 [Streptomyces acidiscabies 84-104]
          Length = 654

 Score = 40.4 bits (93), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 104/479 (21%), Positives = 175/479 (36%), Gaps = 84/479 (17%)

Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAP 228
           +L A+    A T + T+  ++  +V  ++  Q +   GYL  +  +L       +P W  
Sbjct: 93  WLEAACWQLADTPDETLATEVEAIVELIAAAQRE--DGYLQTY-YQLGGGTPWTEPGWGH 149

Query: 229 --YYTIHKILAGLLDQYVLADN---AQALKMATWMVEYFY--NRVQKVITMYSVERHWYS 281
             Y   H I A +        +   A A ++A  +   F    +V+ V     VE     
Sbjct: 150 ELYCAGHLIQAAVAHHRATGSDRLLAVARRLADHIDSVFGPGKQVETVCGHPEVE----- 204

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA-----NTH 336
                      L  L+  T + ++L LA  F +    G L+  AD              H
Sbjct: 205 ---------TALVELHRTTDEKRYLDLARYFLERRGHGTLSSGADRGHDRDPGPEYWQDH 255

Query: 337 IPI-----VIGSQMRYEV-----------TGD-PLYKLIGTFFMDIVNASHSYATGGTSA 379
            PI     V G  +R              TGD  L   +   + D+V  + +Y TG   +
Sbjct: 256 TPIRAADEVTGHAVRQLYLLAGAADLAAETGDTELRTALERLWRDMVT-TKTYLTGAVGS 314

Query: 380 REFWWDPKRLADTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
           R  W       +  G  +E        ETC     +  S  +   T E  Y+D  ER L 
Sbjct: 315 RHDW-------EAFGDAHELPADRAYAETCAAIASVHFSWRMALLTGEARYSDLVERTLF 367

Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIESFSK 485
           NG L+   G +    +Y+ PL R   +ARS    G  T   + W    CC    +   + 
Sbjct: 368 NGFLA-GAGLDGRTWLYVNPLHR---RARSHERPGDQTAHRTPWFRCACCPPNVMRLLAG 423

Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
           L    ++    +  GL + QY +  +    G   L  +V     W+  + +T+    +  
Sbjct: 424 L---PHYLATADDSGLQLHQYATGVY----GGDGLTVRVTTEYPWEGTVTVTV---DEAP 473

Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
                +L+LR+P W   +    ++NG  +       +L  T  ++  D + + L +  R
Sbjct: 474 TALPRTLSLRLPAWCADH--TLTVNGTTVEDGADSGWLRITRAFTPGDTVRLDLAMPAR 530


>gi|421613335|ref|ZP_16054421.1| protein containing DUF1680 [Rhodopirellula baltica SH28]
 gi|408495929|gb|EKK00502.1| protein containing DUF1680 [Rhodopirellula baltica SH28]
          Length = 688

 Score = 40.4 bits (93), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 90/221 (40%), Gaps = 28/221 (12%)

Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPL 452
           + + ETC     +  +  +F    E  + D  E AL N VLS     GT      Y  PL
Sbjct: 369 TAHNETCANIGNVLWNWRMFLANGESKHIDVLELALYNSVLSGVDLDGTN---FFYTNPL 425

Query: 453 GRGVSKARSTHGWGTK--FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
            +  +   +    G +  F + +CC      + + +G   Y + +  V   ++  Y S++
Sbjct: 426 RQSDTAPVALRWSGGRKPFVTSFCCPPNLARTIAGVGQYAYGKSDDTV---WVNLYGSNT 482

Query: 511 FD---WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
            D      GHV + Q  D    WD ++++T+     Q V     L LR+P W  +     
Sbjct: 483 LDTHLTNGGHVRIEQTTD--YPWDGHIQITIAECQNQPV----CLKLRIPGWATTT---- 532

Query: 568 SLNGQNLPLP---PPGNFLSATERWSYND--KLTIQLPLSL 603
           +L    +P      PG+++S    WS     +L   +P SL
Sbjct: 533 TLKIDGVPTETTIKPGSYVSLRRAWSPGTVIELDFAMPASL 573


>gi|225018685|ref|ZP_03707877.1| hypothetical protein CLOSTMETH_02635, partial [Clostridium
           methylpentosum DSM 5476]
 gi|224948545|gb|EEG29754.1| hypothetical protein CLOSTMETH_02635 [Clostridium methylpentosum
           DSM 5476]
          Length = 1108

 Score = 40.4 bits (93), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 58/251 (23%), Positives = 98/251 (39%), Gaps = 41/251 (16%)

Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
           +ETC +   +K    +   T +  YAD  E+   N +L   +G  P            V 
Sbjct: 529 QETCISVTWMKFCEKMLSITGDPIYADQIEKTAYNALLGAMQG--PNAQ---------VD 577

Query: 458 KARSTHGW-------GTKFNSFW--------CCYGTGIESFSKLG-DSIYFEEEGNVPGL 501
              ST  W       GT+ + F         CC  +GI     +    I     G V  L
Sbjct: 578 DVCSTLYWDYFTLYNGTRHHEFGGHIEGVDSCCSASGISGLGVIPLAQIMNSAAGPVINL 637

Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
           Y    ++++    SG+ V    VD     +  ++M +    + +V +  ++ LR+P W  
Sbjct: 638 YSPGSMAANT--PSGNKV-RFDVDTNYPVEGEIKMVV----QPDVQEQFTVKLRIPAW-- 688

Query: 562 SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ-- 619
           S      +NG       PG FL     W   D  TI++ +  RT  ++  + + +  +  
Sbjct: 689 SEQTVVKVNGAEQKDVVPGTFLELNRTWKPGD--TIEISMDFRTWIVESPKGKGSDTEGN 746

Query: 620 -AILFGPYLLA 629
            A++ GP +LA
Sbjct: 747 IALVRGPVVLA 757


>gi|423269691|ref|ZP_17248663.1| hypothetical protein HMPREF1079_01745 [Bacteroides fragilis
           CL05T00C42]
 gi|423272751|ref|ZP_17251698.1| hypothetical protein HMPREF1080_00351 [Bacteroides fragilis
           CL05T12C13]
 gi|392700537|gb|EIY93699.1| hypothetical protein HMPREF1079_01745 [Bacteroides fragilis
           CL05T00C42]
 gi|392708315|gb|EIZ01422.1| hypothetical protein HMPREF1080_00351 [Bacteroides fragilis
           CL05T12C13]
          Length = 695

 Score = 40.4 bits (93), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 72/318 (22%), Positives = 116/318 (36%), Gaps = 44/318 (13%)

Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--ETCTTYNMLKVS 410
           P Y    +   D +     + TGG  A  F  D K   D     +   ETC        S
Sbjct: 347 PEYIAAVSRLWDNMIGKRMFITGGVGAVHF--DEKFGPDYFLPTDAYLETCAAVGAGFFS 404

Query: 411 RHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
           + +   T +  Y D  ER L N VL+     GT+     Y  PL    S   +  GW   
Sbjct: 405 QRMNELTGDAKYMDELERTLYNNVLTGISLSGTQ---YTYQNPLN---SAKHARWGW--- 455

Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW---KSGHVVLNQKVD 525
            +   CC    ++  S +   IY ++  N+   Y+  +I S  +        + L QK  
Sbjct: 456 -HDCPCCPPMFLKMMSAMPGFIYSQKGDNI---YVNLFIGSETELSLSDQSRIRLTQKTG 511

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-------------YSNGAQASLNGQ 572
               WD  + MT+    + E  +   L +R+P W                +     +NG+
Sbjct: 512 --YPWDGSVVMTV----EPEKEKTFLLKVRIPGWAQGVENPYDLYRSEVKSAVNLKVNGK 565

Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAG 630
           ++ +     +     +W   D++ + LP+  R     +   +  +  AI  GP  Y L G
Sbjct: 566 SIAMKIFKGYAEIQRKWKKGDRVELTLPVQPRLVTANEAVADLQNKVAIAAGPFVYCLEG 625

Query: 631 -HTSGEWDIKTGTARSLS 647
               G  D++  T   LS
Sbjct: 626 CDNEGVADLRLDTRAPLS 643


>gi|423248317|ref|ZP_17229333.1| hypothetical protein HMPREF1066_00343 [Bacteroides fragilis
           CL03T00C08]
 gi|423253266|ref|ZP_17234197.1| hypothetical protein HMPREF1067_00841 [Bacteroides fragilis
           CL03T12C07]
 gi|392657166|gb|EIY50803.1| hypothetical protein HMPREF1067_00841 [Bacteroides fragilis
           CL03T12C07]
 gi|392660424|gb|EIY54038.1| hypothetical protein HMPREF1066_00343 [Bacteroides fragilis
           CL03T00C08]
          Length = 695

 Score = 40.4 bits (93), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 72/318 (22%), Positives = 116/318 (36%), Gaps = 44/318 (13%)

Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--ETCTTYNMLKVS 410
           P Y    +   D +     + TGG  A  F  D K   D     +   ETC        S
Sbjct: 347 PKYIAAVSRLWDNMIGKRMFITGGVGAVHF--DEKFGPDYFLPTDAYLETCAAVGAGFFS 404

Query: 411 RHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
           + +   T +  Y D  ER L N VL+     GT+     Y  PL    S   +  GW   
Sbjct: 405 QRMNELTGDAKYMDELERTLYNNVLTGISLSGTQ---YTYQNPLN---SAKHARWGW--- 455

Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW---KSGHVVLNQKVD 525
            +   CC    ++  S +   IY ++  N+   Y+  +I S  +        + L QK  
Sbjct: 456 -HDCPCCPPMFLKMMSAMPGFIYSQKGDNI---YVNLFIGSETELSLSDQSRIRLTQKTG 511

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-------------YSNGAQASLNGQ 572
               WD  + MT+    + E  +   L +R+P W                +     +NG+
Sbjct: 512 --YPWDGSVVMTV----EPEKEKTFLLKVRIPGWAQRVENPYDLYRSEVKSAVNLKVNGK 565

Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAG 630
           ++ +     +     +W   D++ + LP+  R     +   +  +  AI  GP  Y L G
Sbjct: 566 SIAMKIFKGYAEIQRKWKKGDRVELTLPVQPRLVTANEAVADLQNKVAIAAGPFVYCLEG 625

Query: 631 -HTSGEWDIKTGTARSLS 647
               G  D++  T   LS
Sbjct: 626 CDNEGVADLRLDTRAPLS 643


>gi|383124478|ref|ZP_09945142.1| hypothetical protein BSIG_3498 [Bacteroides sp. 1_1_6]
 gi|251839029|gb|EES67113.1| hypothetical protein BSIG_3498 [Bacteroides sp. 1_1_6]
          Length = 687

 Score = 40.4 bits (93), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 7/77 (9%)

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
           LR+P WT   GA+  +NG+ + + P  G +L     W+  DK+ + LP+SL     Q ++
Sbjct: 483 LRIPSWT--EGAEVRVNGKKISVKPVSGKYLCIEREWADGDKVEMTLPMSLSMRTWQVNK 540

Query: 613 PEYASIQAILFGPYLLA 629
               +  ++ +GP  L+
Sbjct: 541 ----NSVSVDYGPLTLS 553


>gi|29348940|ref|NP_812443.1| hypothetical protein BT_3531 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340847|gb|AAO78637.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 687

 Score = 40.4 bits (93), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 7/77 (9%)

Query: 554 LRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
           LR+P WT   GA+  +NG+ + + P  G +L     W+  DK+ + LP+SL     Q ++
Sbjct: 483 LRIPSWT--EGAEVRVNGKKISVKPVSGKYLCIEREWADGDKVEMTLPMSLSMRTWQVNK 540

Query: 613 PEYASIQAILFGPYLLA 629
               +  ++ +GP  L+
Sbjct: 541 ----NSVSVDYGPLTLS 553


>gi|430751377|ref|YP_007214285.1| hypothetical protein Theco_3232 [Thermobacillus composti KWC4]
 gi|430735342|gb|AGA59287.1| hypothetical protein Theco_3232 [Thermobacillus composti KWC4]
          Length = 672

 Score = 40.4 bits (93), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 70/286 (24%), Positives = 116/286 (40%), Gaps = 39/286 (13%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGV 456
           E+C +  ++  S+ + +   +  Y D  ERAL N  L+   Q G       Y+ PL    
Sbjct: 341 ESCASIGLIMFSKRMLQIEAKGEYGDVMERALYNTELAGMSQDGKR---YFYVNPLEVWP 397

Query: 457 SKARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYF--EEEGNV-PGLYI---- 503
              RS  G          W    CC        + LG  +Y    E G V   LYI    
Sbjct: 398 EACRSNPGKHHVKPVRQRWFGCACCPPNIARLIASLGGYVYDVDAESGIVYTHLYIGGEA 457

Query: 504 -IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS--LNLRMPVWT 560
            +           G VV+ Q+ +    WD  + +T+T     E G L++  L LR+P W 
Sbjct: 458 RLNVGKEGGGHDGGTVVVRQETN--YPWDGAVMLTVT----PEAGGLTAFTLALRLPGW- 510

Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
            S  ++ ++NG+ +       +      W   D + ++L +++R  A + +    A   A
Sbjct: 511 -SRTSEIAVNGERIAPEVRDGYAYICRDWQPGDTVELKLDMTIRLLAARPEVRADAGRVA 569

Query: 621 ILFGPYLLAGHTSGEWDIKTGTARSLSALI----SPIPPSFNAQLV 662
           I  GP +    ++   D   G    LSAL     +P+  +++AQL+
Sbjct: 570 IQRGPLVYCLESA---DNPGG---PLSALAIDTQTPLTATYDAQLL 609


>gi|53711660|ref|YP_097652.1| hypothetical protein BF0369 [Bacteroides fragilis YCH46]
 gi|52214525|dbj|BAD47118.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
          Length = 689

 Score = 40.0 bits (92), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 72/318 (22%), Positives = 116/318 (36%), Gaps = 44/318 (13%)

Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--ETCTTYNMLKVS 410
           P Y    +   D +     + TGG  A  F  D K   D     +   ETC        S
Sbjct: 341 PEYIAAVSRLWDNMIGKRMFITGGVGAVHF--DEKFGPDYFLPTDAYLETCAAVGAGFFS 398

Query: 411 RHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
           + +   T +  Y D  ER L N VL+     GT+     Y  PL    S   +  GW   
Sbjct: 399 QRMNELTGDAKYMDELERTLYNNVLTGISLSGTQ---YTYQNPLN---SAKHARWGW--- 449

Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW---KSGHVVLNQKVD 525
            +   CC    ++  S +   IY ++  N+   Y+  +I S  +        + L QK  
Sbjct: 450 -HDCPCCPPMFLKMMSAMPGFIYSQKGDNI---YVNLFIGSETELSLSDQSRIRLTQKTG 505

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-------------YSNGAQASLNGQ 572
               WD  + MT+    + E  +   L +R+P W                +     +NG+
Sbjct: 506 --YPWDGSVVMTV----EPEKEKTFLLKVRIPGWAQGVENPYDLYRSEVKSAVNLKVNGK 559

Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAG 630
           ++ +     +     +W   D++ + LP+  R     +   +  +  AI  GP  Y L G
Sbjct: 560 SIAMKIFKGYAEIQRKWKKGDRVELTLPVQPRLVTANEAVADLQNKVAIAAGPFVYCLEG 619

Query: 631 -HTSGEWDIKTGTARSLS 647
               G  D++  T   LS
Sbjct: 620 CDNEGVADLRLDTRAPLS 637


>gi|336407845|ref|ZP_08588341.1| hypothetical protein HMPREF1018_00356 [Bacteroides sp. 2_1_56FAA]
 gi|335944924|gb|EGN06741.1| hypothetical protein HMPREF1018_00356 [Bacteroides sp. 2_1_56FAA]
          Length = 695

 Score = 40.0 bits (92), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 72/318 (22%), Positives = 116/318 (36%), Gaps = 44/318 (13%)

Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--ETCTTYNMLKVS 410
           P Y    +   D +     + TGG  A  F  D K   D     +   ETC        S
Sbjct: 347 PEYIAAVSRLWDNMIGKRMFITGGVGAVHF--DEKFGPDYFLPTDAYLETCAAVGAGFFS 404

Query: 411 RHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
           + +   T +  Y D  ER L N VL+     GT+     Y  PL    S   +  GW   
Sbjct: 405 QRMNELTGDAKYMDELERTLYNNVLTGISLSGTQ---YTYQNPLN---SAKHARWGW--- 455

Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW---KSGHVVLNQKVD 525
            +   CC    ++  S +   IY ++  N+   Y+  +I S  +        + L QK  
Sbjct: 456 -HDCPCCPPMFLKMMSAMPGFIYSQKGDNI---YVNLFIGSETELSLSDQSRIRLTQKTG 511

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-------------YSNGAQASLNGQ 572
               WD  + MT+    + E  +   L +R+P W                +     +NG+
Sbjct: 512 --YPWDGSVVMTV----EPEKEKTFLLKVRIPGWAQGVENPYDLYRSEVKSAVNLKVNGK 565

Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAG 630
           ++ +     +     +W   D++ + LP+  R     +   +  +  AI  GP  Y L G
Sbjct: 566 SIAMKIFKGYAEIQRKWKKGDRVELTLPVQPRLVTANEAVADLQNKVAIAAGPFVYCLEG 625

Query: 631 -HTSGEWDIKTGTARSLS 647
               G  D++  T   LS
Sbjct: 626 CDNEGVADLRLDTRAPLS 643


>gi|423230666|ref|ZP_17217070.1| hypothetical protein HMPREF1063_02890 [Bacteroides dorei
           CL02T00C15]
 gi|423244377|ref|ZP_17225452.1| hypothetical protein HMPREF1064_01658 [Bacteroides dorei
           CL02T12C06]
 gi|392630316|gb|EIY24309.1| hypothetical protein HMPREF1063_02890 [Bacteroides dorei
           CL02T00C15]
 gi|392641951|gb|EIY35723.1| hypothetical protein HMPREF1064_01658 [Bacteroides dorei
           CL02T12C06]
          Length = 801

 Score = 40.0 bits (92), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 62/275 (22%), Positives = 106/275 (38%), Gaps = 34/275 (12%)

Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYN 405
           +TGD  Y        D +     Y TGG   TS  E +     L +   S   ETC    
Sbjct: 287 LTGDSAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM--SAYCETCAAIG 344

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
            + V+  LF    E  Y D  ER L NG++S     + G   Y  PL   + + +    +
Sbjct: 345 NVYVNYRLFLLHGEAKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL-ESIGQHQRQPWF 402

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
           G       CC          L   +Y  ++ +V   Y+  ++S++ + K     ++ +  
Sbjct: 403 GCA-----CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFMSNTSNLKVEGKAVSLEQT 454

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQAS----LN 570
               W+  + + +   +K   GQ  ++ +R+P W           TYS+G + S    +N
Sbjct: 455 THYPWNGEVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTYSDGKRLSYTVKVN 510

Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           G+ +       +     RW   DK+ +   +  RT
Sbjct: 511 GEPVQSELKDGYFCIDRRWKKGDKVAVHFDMEPRT 545


>gi|373462448|ref|ZP_09554170.1| hypothetical protein HMPREF9944_02434 [Prevotella maculosa OT 289]
 gi|371948225|gb|EHO66109.1| hypothetical protein HMPREF9944_02434 [Prevotella maculosa OT 289]
          Length = 932

 Score = 40.0 bits (92), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 98/235 (41%), Gaps = 25/235 (10%)

Query: 399 ETCTTYNMLKVS-RHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
           ETC +   + ++ R L  W  +  YA   E++L N V + Q   E G + Y   +     
Sbjct: 648 ETCGSVFWIDLNHRFLQLWPTKERYASEIEKSLYNVVFAAQ--GENGCIRYFNQVNDAKY 705

Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
            A         +N+  CC       +  L   +Y        G+++  + +S  D+K   
Sbjct: 706 PAMC-------YNT--CCEIQATALYGMLPQYVYSVAPD---GVFVNLFSASDIDFK--- 750

Query: 518 VVLNQKVDPIVSWD-PYL-RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
            V +Q V   +    PY  ++ L  S+ + V     + +R+P W    G    +N + + 
Sbjct: 751 -VKDQPVKLTMKTQFPYSNQVALRVSADRPV--TMKVRVRIPEWA-KGGVVLRVNDRKVK 806

Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEA-IQDDRPEYASIQAILFGPYLLA 629
              PG+++     W  ND++T  LP++   E  I   R   A+  A  +GP L+A
Sbjct: 807 TGMPGSYVEIDRTWKDNDEITWSLPMTWSYEKYIGATRIAGATRYAFFYGPMLMA 861


>gi|372221612|ref|ZP_09500033.1| hypothetical protein MzeaS_04798 [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 664

 Score = 40.0 bits (92), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 95/425 (22%), Positives = 153/425 (36%), Gaps = 81/425 (19%)

Query: 251 ALKMATWMVEYF---YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLL 307
           ALK A  MV+ F    N++Q V     +E         TG     L +LY IT +  +  
Sbjct: 211 ALKNANLMVKTFGAEQNQIQTVPGHQIIE---------TG-----LLKLYQITGEVAYKD 256

Query: 308 LAHLFDKPCFLGFLALQADY-LSHFHANTHIPI-----VIGSQMRY-----------EVT 350
           LA  F     L    +  D  L   ++  H+P+     V+G  +R             +T
Sbjct: 257 LAKFF-----LDNRGVAKDRKLFGAYSQDHLPVTQQKEVVGHAVRAVYMYAAMTDIAAIT 311

Query: 351 GDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--------ETC 401
            D  Y + + T + ++V     Y TGG  A       K   +  G+  E        ETC
Sbjct: 312 KDSTYLRAVDTLWQNMVE-KKMYITGGIGA-------KHEGEAFGANYELPNITAYNETC 363

Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
                +  +  L     +  Y D  ER L NG++S     +     Y  PL       + 
Sbjct: 364 AAIGDVYWNHRLHNLKGKAHYFDIIERTLYNGLIS-GISLDGKQFFYPNPL-ESDGLYQF 421

Query: 462 THGWGTKFNSFWC-CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
             G  T+ + F C C  T +  F      + + +  N   L++  Y S+S         L
Sbjct: 422 NQGACTRKDWFDCSCCPTNLIRFIPSIPGLLYSKGAN--ELFVNLYASNSATINLKSTEL 479

Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-----------YSNG----- 564
           N   +    WD  +R T+  +          ++ R+P W            Y N      
Sbjct: 480 NVVQETNYPWDGTIRFTVNTAKPYTF----PIHFRVPGWAQNQVVPSGLYQYENPNPSFP 535

Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
            +  +NG+   +     +LS   RW+ ND + I+ P+ ++         E     A+  G
Sbjct: 536 IKIKVNGKATAIDSKEGYLSLDRRWANNDVIEIEFPMDVKLVKTNTRVVENRGKVALERG 595

Query: 625 PYLLA 629
           P + A
Sbjct: 596 PIVYA 600


>gi|423240707|ref|ZP_17221821.1| hypothetical protein HMPREF1065_02444 [Bacteroides dorei
           CL03T12C01]
 gi|392643669|gb|EIY37418.1| hypothetical protein HMPREF1065_02444 [Bacteroides dorei
           CL03T12C01]
          Length = 801

 Score = 40.0 bits (92), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 62/275 (22%), Positives = 106/275 (38%), Gaps = 34/275 (12%)

Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYN 405
           +TGD  Y        D +     Y TGG   TS  E +     L +   S   ETC    
Sbjct: 287 LTGDSAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM--SAYCETCAAIG 344

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
            + V+  LF    E  Y D  ER L NG++S     + G   Y  PL   + + +    +
Sbjct: 345 NVYVNYRLFLLHGEAKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL-ESIGQHQRQPWF 402

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
           G       CC          L   +Y  ++ +V   Y+  ++S++ + K     ++ +  
Sbjct: 403 GCA-----CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFMSNTSNLKVEGKAVSLEQT 454

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQAS----LN 570
               W+  + + +   +K   GQ  ++ +R+P W           TYS+G + S    +N
Sbjct: 455 THYPWNGEVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTYSDGKRLSYTVKVN 510

Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           G+ +       +     RW   DK+ +   +  RT
Sbjct: 511 GEPVQSELKDGYFCIDRRWKKGDKVAVHFDMEPRT 545


>gi|237711367|ref|ZP_04541848.1| six-hairpin glycosidase [Bacteroides sp. 9_1_42FAA]
 gi|229454062|gb|EEO59783.1| six-hairpin glycosidase [Bacteroides sp. 9_1_42FAA]
          Length = 781

 Score = 40.0 bits (92), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 62/275 (22%), Positives = 106/275 (38%), Gaps = 34/275 (12%)

Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYN 405
           +TGD  Y        D +     Y TGG   TS  E +     L +   S   ETC    
Sbjct: 267 LTGDSAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM--SAYCETCAAIG 324

Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
            + V+  LF    E  Y D  ER L NG++S     + G   Y  PL   + + +    +
Sbjct: 325 NVYVNYRLFLLHGEAKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL-ESIGQHQRQPWF 382

Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
           G       CC          L   +Y  ++ +V   Y+  ++S++ + K     ++ +  
Sbjct: 383 GCA-----CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFMSNTSNLKVEGKAVSLEQT 434

Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQAS----LN 570
               W+  + + +   +K   GQ  ++ +R+P W           TYS+G + S    +N
Sbjct: 435 THYPWNGEVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTYSDGKRLSYTVKVN 490

Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           G+ +       +     RW   DK+ +   +  RT
Sbjct: 491 GEPVQSELKDGYFCIDRRWKKGDKVAVHFDMEPRT 525


>gi|222082345|ref|YP_002541710.1| hypothetical protein Arad_8964 [Agrobacterium radiobacter K84]
 gi|221727024|gb|ACM30113.1| conserved hypothetical protein [Agrobacterium radiobacter K84]
          Length = 643

 Score = 40.0 bits (92), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 80/376 (21%), Positives = 143/376 (38%), Gaps = 52/376 (13%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL--SHFHANTHIPI 339
            L +L  +T + K+L LA  F      +P F    AL+     A ++  ++ +   H P+
Sbjct: 197 ALVKLARVTGEKKYLDLAKYFVDERGQEPHFFTDEALRDGRDPAKFVQKTYEYNQAHQPV 256

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 257 REQTKVVGHAVRAMYLYSGMADIATEYNDDSLTSALETLWDDLTT-KQMYVTGGIGPAAS 315

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   S   ETC +  ++  +  +        YAD  E+AL NG ++   
Sbjct: 316 NEGFTDYYDLPNE--SAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GL 372

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
             +     Y  PL  G    R T      ++   CC        + +G  +Y   +  + 
Sbjct: 373 SLDGKTFFYENPLESGGKHHRWT------WHHCPCCPPNIARLLASIGSYMYAAADNEI- 425

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            +++     +     SG V +    +    WD  +R    F    +     +L+LR+P W
Sbjct: 426 AVHLYGESKARVPLASG-VTVELAQETRYPWDGAIR----FEVNPDRNARFALSLRIPEW 480

Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
             ++GA  ++NG  + L       +      W   D++ + +PL  RT        + A 
Sbjct: 481 --ADGATLAVNGVPVDLSAVTIDGYARIERDWQAGDRVDLNIPLIPRTLFANPKVRQDAG 538

Query: 618 IQAILFGPYLLAGHTS 633
             A++ GP +    T+
Sbjct: 539 RAALMRGPLVYCVETT 554


>gi|256424326|ref|YP_003124979.1| hypothetical protein Cpin_5347 [Chitinophaga pinensis DSM 2588]
 gi|256039234|gb|ACU62778.1| protein of unknown function DUF1680 [Chitinophaga pinensis DSM
           2588]
          Length = 686

 Score = 40.0 bits (92), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 69/344 (20%), Positives = 124/344 (36%), Gaps = 46/344 (13%)

Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA-DTLGSENE--ETCTTYNMLK 408
           +P Y      + D +     + TGG  A     D ++   D    E+   ETC +     
Sbjct: 341 NPAYFTTAVRYWDNMTGKRMFVTGGEGAIA---DQEKFGPDYFLPESAYLETCASIGAAF 397

Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPLGRGVSKARST 462
            S+ + +   +  Y D +ER + N +LS       GV +      Y  PL   ++K    
Sbjct: 398 FSQRMNQLLADGKYMDEFERVMYNNLLS-------GVSLSGDHYFYENPL---IAKDHKR 447

Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
             W    +S  CC    ++  + +   IY  +     GLY+  +ISS +    G   ++ 
Sbjct: 448 WAW----HSCPCCPPMILKMVAAIPAYIYAADN---TGLYVNLFISSEYKGAVGDKKVSL 500

Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-------------YSNGAQASL 569
           K      W    ++ +  + + +     ++++R+P W               +      +
Sbjct: 501 KQSTQYPWKGTTQIAVNPAEEGDF----AVSVRIPGWAQGRENYFGLYTSQVTTPVSLRV 556

Query: 570 NGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
           NG  +P+ P   ++     W   DK+ + LP+  R     D          I  GP +  
Sbjct: 557 NGAAVPVQPENGYVRIKRHWKKGDKIILALPMQPRLIFPHDSIRTVQGKATIAAGPVIYG 616

Query: 630 GHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTF 673
                   + + T    +AL     P F   +   T + G STF
Sbjct: 617 LEGIDNSKLDSLTISRNTALQLAFKPGFLGGVNVVTGQLGGSTF 660


>gi|398379890|ref|ZP_10538009.1| hypothetical protein PMI03_03641 [Rhizobium sp. AP16]
 gi|397721906|gb|EJK82452.1| hypothetical protein PMI03_03641 [Rhizobium sp. AP16]
          Length = 643

 Score = 39.7 bits (91), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 80/376 (21%), Positives = 143/376 (38%), Gaps = 52/376 (13%)

Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL--SHFHANTHIPI 339
            L +L  +T + K+L LA  F      +P F    AL+     A ++  ++ +   H P+
Sbjct: 197 ALVKLARVTGEKKYLDLAKYFVDERGQEPHFFTDEALRDGRDPAKFVQKTYEYNQAHQPV 256

Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
                V+G  +R             E   D L   + T + D+      Y TGG    ++
Sbjct: 257 REQTKVVGHAVRAMYLYSGMADIATEYNDDSLTSALETLWDDLTT-KQMYVTGGIGPAAS 315

Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
            E + D   L +   S   ETC +  ++  +  +        YAD  E+AL NG ++   
Sbjct: 316 NEGFTDYYDLPNE--SAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GL 372

Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
             +     Y  PL  G    R T      ++   CC        + +G  +Y   +  + 
Sbjct: 373 SLDGKTFFYENPLESGGKHHRWT------WHHCPCCPPNIARLLASIGSYMYAAADNEI- 425

Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
            +++     +     SG V +    +    WD  +R    F    +     +L+LR+P W
Sbjct: 426 AVHLYGESKARVPLASG-VTVELAQETRYPWDGAIR----FEVNPDRNARFALSLRIPEW 480

Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
             ++GA  ++NG  + L       +      W   D++ + +PL  RT        + A 
Sbjct: 481 --ADGATLAVNGVPVDLSAVTIDGYARIERDWQAGDRVDLNIPLIPRTLFANPKVRQDAG 538

Query: 618 IQAILFGPYLLAGHTS 633
             A++ GP +    T+
Sbjct: 539 RAALMRGPLVYCVETT 554


>gi|403252790|ref|ZP_10919097.1| hypothetical protein EMP_03410 [Thermotoga sp. EMP]
 gi|402811900|gb|EJX26382.1| hypothetical protein EMP_03410 [Thermotoga sp. EMP]
          Length = 622

 Score = 39.7 bits (91), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 67/332 (20%), Positives = 126/332 (37%), Gaps = 46/332 (13%)

Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA--DYLSHFHANTHIPIVIGSQMR---- 346
           L  LY  T D K+L LA  F      G    +   +YL        +  + G  +R    
Sbjct: 196 LVELYRETGDRKYLDLAKYFIYTRGKGLTGFKKNPEYLIDHKPFVELEEITGHAVRALYL 255

Query: 347 -------YEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
                  Y  TGD  +++ +   + + V     Y TGG  +R  W       ++ G E E
Sbjct: 256 CSGATDLYLETGDEKIWQALNKLWENFVT-KKMYITGGAGSRHDW-------ESFGEEYE 307

Query: 399 --------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
                   E+C +      +  +   T +  +AD  E+ L NG+LS     +     Y  
Sbjct: 308 LPNRRSYAESCASIANFMWNFRMLLATGDGKFADVMEQVLYNGLLS-GISLDGKHYFYFN 366

Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           PL   + + R       K+    CC        +     +Y   +  V  +++ +  +  
Sbjct: 367 PL-EDLGRTRRQ-----KWFDCACCPPNLARFIASFPGYMYTTSDDGVQ-VHLYEKSTVR 419

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
            D+K   V + Q+ D    W       +TF+ + ++ +  S++LR+P W  ++     ++
Sbjct: 420 LDFKGSVVEIEQETD--YPWSG----EVTFTVEADIEEPFSISLRIPSW--ADDFVLRVD 471

Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
           G+ +   P   ++   + W     + + LP+ 
Sbjct: 472 GKTVIAKPQNGYVKLNQSWKGKHTVELSLPMK 503


>gi|383763276|ref|YP_005442258.1| hypothetical protein CLDAP_23210 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381383544|dbj|BAM00361.1| hypothetical protein CLDAP_23210 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 636

 Score = 39.7 bits (91), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 45/177 (25%), Positives = 71/177 (40%), Gaps = 23/177 (12%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGV 456
           ETC    ++  +  L ++  E  YAD  E+ L NG +S    RG       Y+ PL    
Sbjct: 329 ETCAAIALILWNHRLLQFAGEGKYADVMEQTLYNGFISGVSLRGDS---FFYVNPLASNG 385

Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
           S  R      T +    CC        + LG+ +Y   EG   GL++  Y  +S      
Sbjct: 386 SHHR------TPWFECPCCPPNVGRILASLGNYLYSTGEG---GLWVHFYAQNSARTTVD 436

Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-----YSNGAQAS 568
              +  +++    WD  +++ +T +  Q      +L LR+P W        NGA A 
Sbjct: 437 GTEVGLRLESRYPWDGAVKLMITPAQPQRF----TLYLRIPGWCDRWSLRVNGAAAD 489


>gi|440699526|ref|ZP_20881821.1| hypothetical protein STRTUCAR8_01370 [Streptomyces turgidiscabies
           Car8]
 gi|440277899|gb|ELP65960.1| hypothetical protein STRTUCAR8_01370 [Streptomyces turgidiscabies
           Car8]
          Length = 654

 Score = 39.7 bits (91), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 103/479 (21%), Positives = 174/479 (36%), Gaps = 84/479 (17%)

Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAP 228
           +L A+    A T + T+  ++  +V  ++  Q +   GYL  +  +L       +P W  
Sbjct: 93  WLEAACWQLADTPDETLATEVEAIVELIAAAQRE--DGYLQTY-YQLGGGIPWTEPGWGH 149

Query: 229 --YYTIHKILAGLLDQYVLADN---AQALKMATWMVEYFY--NRVQKVITMYSVERHWYS 281
             Y   H I A +        +   A A ++A  +   F    +V  V     VE     
Sbjct: 150 ELYCAGHLIQAAVAHHRATGSDRLLAVARRLADHIDSVFGPGKQVDTVCGHPEVE----- 204

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA-----NTH 336
                      L  L+  T + ++L LA  F +    G L+  AD              H
Sbjct: 205 ---------TALVELHRTTDEKRYLDLARYFLERRGHGTLSSGADRGHDRDPGPEYWQDH 255

Query: 337 IPI-----VIGSQMRYEV-----------TGD-PLYKLIGTFFMDIVNASHSYATGGTSA 379
            P+     V G  +R              TGD  L   +   + D+V  + +Y TG   +
Sbjct: 256 TPVRAADEVTGHAVRQLYLLAGAADLAAETGDTELRTALERLWRDMVT-TKTYLTGAVGS 314

Query: 380 REFWWDPKRLADTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
           R  W       +  G  +E        ETC     +  S  +   T E  Y+D  ER L 
Sbjct: 315 RHDW-------EAFGDAHELPADRAYAETCAAIASVHFSWRMALLTGEARYSDLVERTLF 367

Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIESFSK 485
           NG L+   G +    +Y+ PL R   +ARS    G  T   + W    CC    +   + 
Sbjct: 368 NGFLA-GAGLDGRTWLYVNPLHR---RARSHERPGDQTAHRTPWFRCACCPPNVMRLLAG 423

Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
           L    ++    +  GL + QY +  +    G   L  +V     W+  + +T+    +  
Sbjct: 424 L---PHYLATADDSGLQLHQYATGVY----GGDGLTVRVTTEYPWEGTVTVTV---DEAP 473

Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
                +L+LR+P W   +    ++NG  +       +L  T  ++  D + + L +  R
Sbjct: 474 TALPRTLSLRLPAWCADH--TLTVNGTTVEDGADSGWLRITRAFTPGDTVRLDLAMPAR 530


>gi|290962053|ref|YP_003493235.1| hypothetical protein SCAB_77341 [Streptomyces scabiei 87.22]
 gi|260651579|emb|CBG74703.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
          Length = 654

 Score = 39.7 bits (91), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 103/479 (21%), Positives = 174/479 (36%), Gaps = 84/479 (17%)

Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAP 228
           +L A+    A T + T+  ++  +V  ++  Q +   GYL  +  +L       +P W  
Sbjct: 93  WLEAACWQLADTPDETLATEVEAIVELIAAAQRE--DGYLQTY-YQLGGGIPWTEPGWGH 149

Query: 229 --YYTIHKILAGLLDQYVLADN---AQALKMATWMVEYFY--NRVQKVITMYSVERHWYS 281
             Y   H I A +        +   A A ++A  +   F    +V  V     VE     
Sbjct: 150 ELYCAGHLIQAAVAHHRATGSDRLLAVARRLADHIDSVFGPGKQVDTVCGHPEVE----- 204

Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA-----NTH 336
                      L  L+  T + ++L LA  F +    G L+  AD              H
Sbjct: 205 ---------TALVELHRTTDEKRYLDLARYFLERRGHGTLSSGADRGHDRDPGPEYWQDH 255

Query: 337 IPI-----VIGSQMRYEV-----------TGD-PLYKLIGTFFMDIVNASHSYATGGTSA 379
            P+     V G  +R              TGD  L   +   + D+V  + +Y TG   +
Sbjct: 256 TPVRAADEVTGHAVRQLYLLAGAADLAAETGDTELRTALERLWRDMVT-TKTYLTGAVGS 314

Query: 380 REFWWDPKRLADTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
           R  W       +  G  +E        ETC     +  S  +   T E  Y+D  ER L 
Sbjct: 315 RHDW-------EAFGDAHELPADRAYAETCAAIASVHFSWRMALLTGEARYSDLVERTLF 367

Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIESFSK 485
           NG L+   G +    +Y+ PL R   +ARS    G  T   + W    CC    +   + 
Sbjct: 368 NGFLA-GAGLDGRTWLYVNPLHR---RARSHERPGDQTAHRTPWFRCACCPPNVMRLLAG 423

Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
           L    ++    +  GL + QY +  +    G   L  +V     W+  + +T+    +  
Sbjct: 424 L---PHYLATADDSGLQLHQYATGVY----GGDGLTVRVTTEYPWEGTVTVTV---DEAP 473

Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
                +L+LR+P W   +    ++NG  +       +L  T  ++  D + + L +  R
Sbjct: 474 TALPRTLSLRLPAWCADH--TLTVNGTTVEDGADSGWLRITRAFTPGDTVRLDLAMPAR 530


>gi|218508305|ref|ZP_03506183.1| hypothetical protein RetlB5_12284 [Rhizobium etli Brasil 5]
          Length = 177

 Score = 39.7 bits (91), Expect = 7.6,   Method: Composition-based stats.
 Identities = 33/137 (24%), Positives = 62/137 (45%), Gaps = 23/137 (16%)

Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAI 608
           +L+LR+P W  ++GA  S+NG+ L L       +     +W   D++ + LPLSLR +  
Sbjct: 10  ALSLRIPDW--ADGATLSVNGEKLDLGAATRDGYARIDRQWVDGDRVDLFLPLSLRPQYA 67

Query: 609 QDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQES 668
                + A   A++ GP +    T       T   + L+A++ P             + S
Sbjct: 68  NPKVRQDAGRVALMRGPLVYCVET-------TDNGQDLNAIVLP------------RELS 108

Query: 669 GNSTFVMSNSNQSITME 685
              T V+++ N ++ ++
Sbjct: 109 AAETVVLNDLNDAVALD 125


>gi|326789389|ref|YP_004307210.1| hypothetical protein Clole_0260 [Clostridium lentocellum DSM 5427]
 gi|326540153|gb|ADZ82012.1| protein of unknown function DUF1680 [Clostridium lentocellum DSM
           5427]
          Length = 638

 Score = 39.3 bits (90), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 51/214 (23%), Positives = 87/214 (40%), Gaps = 20/214 (9%)

Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR--GTEPGVMIYMLPLGR-- 454
           ETC    ++  +R +    K   YAD  ERAL N VL+  +  GT+     Y+ PL    
Sbjct: 328 ETCAAIGLIFFARKMIDLEKNNEYADIMERALYNCVLAGMQLDGTK---FFYVNPLESIP 384

Query: 455 GVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
           G+S    TH         W    CC        S +G    + EEGN   +Y   +I  +
Sbjct: 385 GISGEAVTHRHALPQRPKWFTCACCPPNVARLLSSMG-RYAWSEEGNT--VYSHLFIGGT 441

Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
            D       L+ K+    S+    ++   F    E   L +L +R+P+W  S      L+
Sbjct: 442 LDLTD---TLHGKIKVETSYPYGNQVRYRFEPNDESMDL-TLAIRLPLW--SENTSIMLD 495

Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
            +         ++  T+ ++  D +T+   ++++
Sbjct: 496 EKKANYEIRNGYVYLTKAFTQEDMVTVTFDMNVK 529


>gi|218675303|ref|ZP_03524972.1| hypothetical protein RetlG_29862 [Rhizobium etli GR56]
          Length = 175

 Score = 39.3 bits (90), Expect = 8.4,   Method: Composition-based stats.
 Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 11/104 (10%)

Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
           +L+LR+P W  + GA  S+NG  L L       +     +W+  D++ + LPLSLR +  
Sbjct: 8   ALSLRIPDW--AEGATLSVNGTMLDLSTHIRDGYARIDRQWADGDRVALHLPLSLRPQYA 65

Query: 609 QDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
                + A   A++ GP +    T       T     L+A++ P
Sbjct: 66  NPKVRQDAGRVALMRGPLVYCVET-------TDNGEDLNAIVLP 102


>gi|410725713|ref|ZP_11364076.1| hypothetical protein A370_02153 [Clostridium sp. Maddingley
           MBC34-26]
 gi|410601724|gb|EKQ56224.1| hypothetical protein A370_02153 [Clostridium sp. Maddingley
           MBC34-26]
          Length = 648

 Score = 39.3 bits (90), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 61/286 (21%), Positives = 112/286 (39%), Gaps = 31/286 (10%)

Query: 364 DIVNASHSYATGGTSARE----FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
           D +     Y TGG  + +    F +D     DT+ +E   TC +  ++  +R +   + +
Sbjct: 297 DNMTKKRMYITGGIGSSQYGEAFTYDYDLPNDTIYAE---TCASIGLVFFARRMLEISPK 353

Query: 420 IAYADYYERALTNGVLSIQR--GTEPGVMIYMLPLGRGVSKARSTHGWG------TKFNS 471
             YAD  E+AL NGV+S     GT+     Y+ PL      +   H          K+  
Sbjct: 354 SKYADIMEKALYNGVISGMSLDGTK---FFYVNPLEVVPESSEKDHLRAHVKVERQKWFG 410

Query: 472 FWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSW 530
             CC        + +G   Y  +E      LY+   I+++    + +V    KV+    W
Sbjct: 411 CACCPPNLARLLASIGSYAYSIKENTMFMHLYMGGEITTNL--SNNNVAF--KVETNYPW 466

Query: 531 DPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWS 590
           D  +++TL    K+E+     + +R+P W         +NG+++       +      W 
Sbjct: 467 DENVKITLNI--KEEIN--FEVAIRIPEWC--GNYNIKVNGEDVEYKIIYGYAYIDRVWK 520

Query: 591 YNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSG 634
             D + +   + +   +   +  E     A++ GP  Y L    +G
Sbjct: 521 NADAIDVDFKMPVEVMSANVNVRENIGKVAVMRGPIVYCLEEEDNG 566


>gi|212715353|ref|ZP_03323481.1| hypothetical protein BIFCAT_00247 [Bifidobacterium catenulatum DSM
           16992 = JCM 1194]
 gi|212661728|gb|EEB22303.1| hypothetical protein BIFCAT_00247 [Bifidobacterium catenulatum DSM
           16992 = JCM 1194]
          Length = 727

 Score = 39.3 bits (90), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 73/308 (23%), Positives = 115/308 (37%), Gaps = 28/308 (9%)

Query: 349 VTGDP-LYKLIGTFFMDIVNASHSYATGGTSA----REFWWDPKRLADTLGSENEETCTT 403
           +TG+  L +   T + +IV+    Y TGG  A      F +D     DT  SE   +C  
Sbjct: 323 ITGEAALLESCETLWRNIVD-RKLYITGGIGATHMGEAFSFDYDLPNDTAYSE---SCAA 378

Query: 404 YNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA---- 459
             +   +R +     +  YAD  E AL N  L+     +     Y+ PL   V +A    
Sbjct: 379 IALAFFARRMLEIQPKSEYADVMESALYNTTLA-GMALDGKSFFYVNPL-EVVPEACHRD 436

Query: 460 -RSTHGWGTKFNSFWC-CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
            R  H    +   F C C    I    +      +    +   LY+  Y+      K G 
Sbjct: 437 ERKFHVKPVRQKWFGCACCPPNIARMVESVQQYAYTVADDASTLYVHLYMGGVVSAKLGG 496

Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS---SLNLRMPVWTYSNGAQASLNG--- 571
             ++ +V   + W+    +T+T  S  E GQ+    +L LR+P W     A  S++    
Sbjct: 497 SDVSLEVRAGMPWNGAGAITVTLPSSDE-GQVPESFALALRLPAWAGGESAADSIHATGE 555

Query: 572 --QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
               +       +L  T  W   D +    P+ +R  A      E A   A + GP  Y 
Sbjct: 556 KDSRITRTTRDGYLYLTGTWRDGDVIDFDFPMPVRMIAANPLVREDAGKVAFIRGPLAYC 615

Query: 628 LAGHTSGE 635
             G  +G+
Sbjct: 616 AEGTDNGD 623


>gi|150003691|ref|YP_001298435.1| hypothetical protein BVU_1122 [Bacteroides vulgatus ATCC 8482]
 gi|149932115|gb|ABR38813.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
          Length = 801

 Score = 39.3 bits (90), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 78/348 (22%), Positives = 131/348 (37%), Gaps = 59/348 (16%)

Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
           L +LY +T   K+L  A  F D+    G+     +Y     +  H P+V     +G  +R
Sbjct: 222 LAKLYLVTGQQKYLDQAKFFLDQ---RGYTTRTDEY-----SQAHKPVVEQDEAVGHAVR 273

Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADT 392
                        +TGD  Y        D +     Y TGG   TS  E +     L + 
Sbjct: 274 AAYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM 333

Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
             S   ETC     + V+  LF    E  Y D  ER L NG++S     + G   Y  PL
Sbjct: 334 --SAYCETCAAIGNVYVNYRLFLLHGEAKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL 390

Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
              + + +    +G       CC          L   +Y  +  +V   Y+  ++S++ +
Sbjct: 391 -ESIGQHQRQPWFGCA-----CCPSNICRFIPSLPGYVYAVKGKDV---YVNLFMSNTSN 441

Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TY 561
            K     ++ +      W+  + + +   +K   GQ  ++ +R+P W           TY
Sbjct: 442 LKVEGKAVSLEQATHYPWNGDVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPCDLYTY 497

Query: 562 SNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
           S+G + S    +NG+ +       +     RW   DK+ +   +  RT
Sbjct: 498 SDGKRLSYTVKVNGEPVQSELKDGYFCIDRRWKKGDKVAVHFDMEPRT 545


>gi|29346413|ref|NP_809916.1| hypothetical protein BT_1003 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29338309|gb|AAO76110.1| Six-hairpin glycosidase [Bacteroides thetaiotaomicron VPI-5482]
          Length = 698

 Score = 39.3 bits (90), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 81/347 (23%), Positives = 140/347 (40%), Gaps = 47/347 (13%)

Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
           +  +Y  T +P++L L+ +L D    +          +  +  Y +  HA     +  G 
Sbjct: 248 VVEMYRATGNPRYLELSKNLIDIRGMVESGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307

Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
              Y  TG+  L K + + + DIV     Y TG       GTS     ++P   +++  +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366

Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
            G        + + ETC     +  +  +   T +  YA+  E  L N VLS     +  
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYAELVETCLYNSVLS-GISLDGK 425

Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
              Y  PL R  +    T  W    T++ S +CC    + +  +  +  Y    EG    
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484

Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
           LY    +++  +WK  G + L Q+ D    W+  +R+TL     ++ G   SL  R+P W
Sbjct: 485 LYGANTLTT--NWKDKGELALVQETD--YPWEGNVRVTLN-KVPRKAGAF-SLFFRIPEW 538

Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
                A  ++NGQ + +    N  +   R W   D  +L + +P+ L
Sbjct: 539 --CGKAALTVNGQPVSMNAKANTYAEVNRTWKKGDVVELVMDMPVCL 583


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.134    0.413 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,037,255,670
Number of Sequences: 23463169
Number of extensions: 600505259
Number of successful extensions: 1204287
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 500
Number of HSP's successfully gapped in prelim test: 514
Number of HSP's that attempted gapping in prelim test: 1199978
Number of HSP's gapped (non-prelim): 1589
length of query: 857
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 705
effective length of database: 8,792,793,679
effective search space: 6198919543695
effective search space used: 6198919543695
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 82 (36.2 bits)