BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 035980
(857 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225435510|ref|XP_002285548.1| PREDICTED: uncharacterized protein LOC100246702 [Vitis vinifera]
Length = 864
Score = 1267 bits (3279), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 616/858 (71%), Positives = 709/858 (82%), Gaps = 9/858 (1%)
Query: 5 FVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
V+F F G LGK+CTN + SH+FRYEL S N++WK E+ H+HL TDDSAWS
Sbjct: 11 IVVFAFVLCGCVLGKECTNVPTQLSSHSFRYELLASNNESWKAEMFQHYHLIHTDDSAWS 70
Query: 63 SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
+L+P K+L ++ DE SWA++YR +KN G + NFLKE+SLHDV LD S+ RAQQTN
Sbjct: 71 NLLPRKLLREE-DEFSWAMMYRNMKNYDGSN--SNFLKEMSLHDVRLDSDSLHGRAQQTN 127
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
L+YLL+LDVD LVWSFRKTA L TPG YGGWE P ELRGHFVGHY+SASAQMWASTHN
Sbjct: 128 LDYLLILDVDRLVWSFRKTAGLSTPGLPYGGWEAPNVELRGHFVGHYMSASAQMWASTHN 187
Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
T+KEKMS VV +L+ CQ K+GTGYLSAFP+ELFD FEA+KPVWAPYYTIHKILAGLLDQ
Sbjct: 188 DTLKEKMSAVVSALATCQEKMGTGYLSAFPSELFDRFEAIKPVWAPYYTIHKILAGLLDQ 247
Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
Y A N+QALKM TWMVE+FY RVQ VITMYS+ERHW SLNEETGGMNDVLYRLYSIT D
Sbjct: 248 YTFAGNSQALKMMTWMVEHFYKRVQNVITMYSLERHWLSLNEETGGMNDVLYRLYSITGD 307
Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
KHL+LAHLFDKPCFLG LA+QAD +S FHANTHIP+VIGSQMRYEVTGDPLYK IGTFF
Sbjct: 308 QKHLVLAHLFDKPCFLGLLAVQADSISGFHANTHIPVVIGSQMRYEVTGDPLYKAIGTFF 367
Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
MDIVN+SHSYATGGTS EFW DPKRLA TL ENEE+CTTYNMLKVSRHLFRWTKE+ Y
Sbjct: 368 MDIVNSSHSYATGGTSVGEFWSDPKRLASTLQRENEESCTTYNMLKVSRHLFRWTKEVVY 427
Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
ADYYERALTNGVLSIQRGT+PGVMIYMLPLGRG SKARS HGWGTKF+SFWCCYGTGIES
Sbjct: 428 ADYYERALTNGVLSIQRGTDPGVMIYMLPLGRGDSKARSYHGWGTKFDSFWCCYGTGIES 487
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
FSKLGDSIYFEEEG P +YIIQYISSS DWKSG +VLNQKVDP+VSWDPYLR TLTF+
Sbjct: 488 FSKLGDSIYFEEEGKSPEVYIIQYISSSLDWKSGQIVLNQKVDPVVSWDPYLRTTLTFTP 547
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
K+ GQ S++NLR+PVW S+GA+AS+N Q+LP+P P +FLS T WS DKLT+QLP+
Sbjct: 548 KEGAGQSSTINLRIPVWASSSGAKASINAQDLPVPAPSSFLSLTRNWSPGDKLTLQLPIR 607
Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
LRTEAI+DDRP+YASIQAIL+GPYLLAG TS +WDIKTG+A SLS I+PIP S N++LV
Sbjct: 608 LRTEAIKDDRPKYASIQAILYGPYLLAGLTSDDWDIKTGSATSLSDWITPIPASDNSRLV 667
Query: 663 TFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKS 722
+ +QESGNS+FV SNSNQSITME+FP GTDA+LHATFRL+LKDA+ S + IGKS
Sbjct: 668 SLSQESGNSSFVFSNSNQSITMEKFPEEGTDASLHATFRLVLKDATSLKVLSPKDAIGKS 727
Query: 723 VMLEPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCF 781
VMLEP D PGM +VQQG L ++ S GS F LVAGLD ++ TVSLE+E++K C+
Sbjct: 728 VMLEPIDLPGMVVVQQGTNQNLGIANSAAGKGSL-FHLVAGLDGKDGTVSLESESQKDCY 786
Query: 782 VSSGVNFEPGASLKL--LCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLA 839
V SG+++ G S+KL L + S D FN+A SF+++ GIS+YHPISFVAKG +RNFLL
Sbjct: 787 VYSGIDYNSGTSIKLKSLSESGSSDEDFNKATSFILKEGISQYHPISFVAKGMKRNFLLT 846
Query: 840 PLLSFRDEAYTVYFNIQD 857
PLL RDE+YTVYFNIQD
Sbjct: 847 PLLGLRDESYTVYFNIQD 864
>gi|224053368|ref|XP_002297785.1| predicted protein [Populus trichocarpa]
gi|222845043|gb|EEE82590.1| predicted protein [Populus trichocarpa]
Length = 858
Score = 1264 bits (3271), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 604/855 (70%), Positives = 711/855 (83%), Gaps = 7/855 (0%)
Query: 5 FVLFFFFCFGLALGKQCTNQ-SPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
V+ C G K+CTN + SH FRY L +S N+TWKEE+ +H+HLTPTDDSAW+
Sbjct: 7 LVVLSMLC-GFGTSKECTNTPTQLSSHTFRYALLSSENETWKEEMFAHYHLTPTDDSAWA 65
Query: 63 SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
+L+P KIL ++DE SWA++YR +K+P GNFLKEVSLH+V LD SS+ W+AQQTN
Sbjct: 66 NLLPRKIL-REEDEYSWAMMYRNLKSP--LKSSGNFLKEVSLHNVRLDPSSIHWQAQQTN 122
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
LEYLLMLDVDSLVWSFRKTA L TPG AYGGWE P ELRGHFVGHYLSASAQMWASTHN
Sbjct: 123 LEYLLMLDVDSLVWSFRKTAGLSTPGTAYGGWEAPNCELRGHFVGHYLSASAQMWASTHN 182
Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
++++MS VV +LS CQ K+G+GYLSAFP+ELFD FEA+KPVWAPYYTIHKILAGLLDQ
Sbjct: 183 DILEKQMSAVVSALSSCQEKMGSGYLSAFPSELFDRFEAIKPVWAPYYTIHKILAGLLDQ 242
Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
Y ADNAQALKM WMV+YFYNRV+ VIT +SVERH+ SLNEETGGMNDVLY+L+SIT D
Sbjct: 243 YTFADNAQALKMVKWMVDYFYNRVRNVITNFSVERHYQSLNEETGGMNDVLYKLFSITGD 302
Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
PKHL+LAHLFDKPCFLG LA+QA+ +S FHANTHIPIVIG+QMRYE+TGDPLYK IGTFF
Sbjct: 303 PKHLVLAHLFDKPCFLGLLAVQAEDISGFHANTHIPIVIGAQMRYEITGDPLYKDIGTFF 362
Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
MDIVN+SHSYATGGTS EFW DPKRLA TL +ENEE+CTTYNMLKVSRHLFRWTKE+AY
Sbjct: 363 MDIVNSSHSYATGGTSVSEFWSDPKRLASTLQTENEESCTTYNMLKVSRHLFRWTKEMAY 422
Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
ADYYERALTNGVL IQRGTEPGVMIYMLP G SK +S HGWGT +++FWCCYGTGIES
Sbjct: 423 ADYYERALTNGVLGIQRGTEPGVMIYMLPQHPGSSKGKSYHGWGTLYDTFWCCYGTGIES 482
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
FSKLGDSIYFEEEG PGLYIIQYISSS DWKSG +++NQKVDP+VS DPYLR+T TFS
Sbjct: 483 FSKLGDSIYFEEEGEAPGLYIIQYISSSLDWKSGQIMINQKVDPVVSSDPYLRVTFTFSP 542
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
+ Q S+LNLR+PVWT+ +GA A++N Q+L +P PG+FLS +WS DKL++QLP+S
Sbjct: 543 NKGSSQASTLNLRIPVWTHLDGATATINSQSLAIPAPGSFLSVNRKWSSGDKLSLQLPIS 602
Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
LRTEAIQDDR +YASIQAIL+GPYLLAGHTSG+W++K G+A SLS I+PIP S+N QLV
Sbjct: 603 LRTEAIQDDRHQYASIQAILYGPYLLAGHTSGDWNLKAGSAGSLSDSITPIPASYNEQLV 662
Query: 663 TFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKS 722
+F+Q+SGNSTFV++NSNQSITMEE P SGTDA L ATFR++ D+S S +N+VI KS
Sbjct: 663 SFSQDSGNSTFVLTNSNQSITMEEHPKSGTDACLQATFRIVFNDSSSSEVLGINDVIDKS 722
Query: 723 VMLEPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCF 781
VMLEPFD PGM LVQQGK+ L V+ S + GSS F +V GLD ++ TVSLE+ +++GC+
Sbjct: 723 VMLEPFDLPGMLLVQQGKDSSLAVTNSAADDGSSIFHVVLGLDGKDGTVSLESGSQEGCY 782
Query: 782 VSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPL 841
+ SGVN++ G S+KL C S D GFN+ ASF+M G+SEYHPISFVA+G +RNFLLAPL
Sbjct: 783 IYSGVNYKSGQSMKLSCKLGSSDPGFNQGASFVMNKGLSEYHPISFVAEGDKRNFLLAPL 842
Query: 842 LSFRDEAYTVYFNIQ 856
S RDE YT+YFNIQ
Sbjct: 843 HSLRDEFYTIYFNIQ 857
>gi|224075776|ref|XP_002304762.1| predicted protein [Populus trichocarpa]
gi|222842194|gb|EEE79741.1| predicted protein [Populus trichocarpa]
Length = 858
Score = 1251 bits (3237), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 608/848 (71%), Positives = 714/848 (84%), Gaps = 10/848 (1%)
Query: 13 FGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWSSLIPSKIL 70
FG++ K+CTN + SH+FRYEL +S N+TWKEE+ H+HL PTDDSAWSSL+P KIL
Sbjct: 16 FGIS--KECTNIPTQLSSHSFRYELLSSQNETWKEEMFEHYHLIPTDDSAWSSLLPRKIL 73
Query: 71 GDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLD 130
++ DE SW ++YR +K+P GNFL E+SLH+V LD SS+ W+AQQTNLEYLLMLD
Sbjct: 74 REE-DEHSWEMMYRNLKSP--LKSSGNFLNEMSLHNVRLDPSSIHWKAQQTNLEYLLMLD 130
Query: 131 VDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMS 190
V++LVWSFRKTA TPGKAYGGWE P SELRGHFVGHYLSASAQMWASTHN T+K+KMS
Sbjct: 131 VNNLVWSFRKTAGSSTPGKAYGGWEKPDSELRGHFVGHYLSASAQMWASTHNETLKKKMS 190
Query: 191 TVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQ 250
VV +LS CQ K+GTGYLSAFP+ELFD FEA+KPVWAPYYTIHKILAGLLDQY LADNAQ
Sbjct: 191 AVVSALSACQVKMGTGYLSAFPSELFDRFEAIKPVWAPYYTIHKILAGLLDQYTLADNAQ 250
Query: 251 ALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAH 310
ALKM WMV+YFYNRV+ VIT YSVERH+ SLNEETGGMNDVLY+L+SIT DPKHL+LAH
Sbjct: 251 ALKMVKWMVDYFYNRVRNVITNYSVERHYLSLNEETGGMNDVLYKLFSITGDPKHLVLAH 310
Query: 311 LFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASH 370
LFDKPCFLG LA+QAD +S FHANTHIP+VIG+QMRYE+TGDPLYK IG FFMD+VN+SH
Sbjct: 311 LFDKPCFLGLLAVQADDISGFHANTHIPVVIGAQMRYEITGDPLYKDIGAFFMDVVNSSH 370
Query: 371 SYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERAL 430
SYATGGTS EFW DPKRLA TL +ENEE+CTTYNMLKVSRHLFRWTKE+AYADYYERAL
Sbjct: 371 SYATGGTSVSEFWSDPKRLASTLQTENEESCTTYNMLKVSRHLFRWTKEMAYADYYERAL 430
Query: 431 TNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSI 490
TNGVL IQRGTEPGVMIYMLP G SKA+S HGWGT ++SFWCCYGTGIESFSKLGDSI
Sbjct: 431 TNGVLGIQRGTEPGVMIYMLPQYPGSSKAKSYHGWGTSYDSFWCCYGTGIESFSKLGDSI 490
Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
YF EEG PGLYIIQYISSS DWKSG +VLNQKVDPIVS DPYLR+TLTFS K+ Q S
Sbjct: 491 YF-EEGEAPGLYIIQYISSSLDWKSGQIVLNQKVDPIVSSDPYLRVTLTFSPKKGTSQAS 549
Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
+L LR+P+WT S GA A++N Q+L LP PG+FLS +W +DKLT+Q+P+SLRTEAI+D
Sbjct: 550 TLYLRIPIWTNSEGATATINSQSLRLPAPGSFLSVNRKWRSSDKLTLQIPISLRTEAIKD 609
Query: 611 DRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGN 670
+R EYAS+QAIL+GPYLLAGHTSG+W++K+G+ SLS I+PIP S+N QLV+F+QESG
Sbjct: 610 ERHEYASVQAILYGPYLLAGHTSGDWNLKSGSGNSLSDSITPIPGSYNGQLVSFSQESGI 669
Query: 671 STFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDF 730
STFV++NSNQSI+ME+ P SGTDA+L ATFRL+ KD+S S SS+ +VIGKSVMLEPF
Sbjct: 670 STFVLTNSNQSISMEKLPESGTDASLQATFRLVFKDSSSSKLSSVKDVIGKSVMLEPFHL 729
Query: 731 PGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFE 789
PGM LVQQGK+ ++ S + GSS FR+V+GLD ++ TVSLE+ + GC+V SGV+++
Sbjct: 730 PGMLLVQQGKDRSFTLTNSADDDGSSIFRVVSGLDGKDGTVSLESGIQNGCYVYSGVDYK 789
Query: 790 PGASLKLLC-STESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEA 848
G S+KL C S S D GFN+ ASF+M G+S+YHPISFVAKG +RNFLLAPL S RDE+
Sbjct: 790 SGQSMKLSCKSGSSSDTGFNQGASFVMNKGLSQYHPISFVAKGDKRNFLLAPLHSLRDES 849
Query: 849 YTVYFNIQ 856
YT+YFNIQ
Sbjct: 850 YTIYFNIQ 857
>gi|359478753|ref|XP_002283032.2| PREDICTED: uncharacterized protein LOC100250068 [Vitis vinifera]
Length = 874
Score = 1211 bits (3133), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 587/849 (69%), Positives = 691/849 (81%), Gaps = 9/849 (1%)
Query: 14 GLALGKQCTNQ-SPYDSHAFRYELT-STNKTWKEEVLSHF-HLTPTDDSAWSSLIPSKIL 70
G LGK+CTN SP SH RYEL S N++ K E L+H+ +L TD S W + +P K L
Sbjct: 20 GCGLGKKCTNSGSPLSSHTLRYELLFSKNESRKAEALAHYSNLIRTDGSGWLTSLPRKAL 79
Query: 71 GDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLD 130
++DE S A+ Y+ +K+ G + FLKE SLHDV L S+ WRAQQTNLEYLLMLD
Sbjct: 80 -REEDEFSRAMKYQTMKSYDGSN--SKFLKEFSLHDVRLGSDSLHWRAQQTNLEYLLMLD 136
Query: 131 VDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMS 190
D LVWSFR+TA LPTP YGGWE+P ELRGHFVGHYLSASAQMWASTHN ++KEKMS
Sbjct: 137 ADRLVWSFRRTAGLPTPCSPYGGWESPDGELRGHFVGHYLSASAQMWASTHNESLKEKMS 196
Query: 191 TVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQ 250
VV +L ECQ K+GTGYLSAFP+ELFD FEAL+ VWAPYYTIHKILAGLLDQY L NAQ
Sbjct: 197 AVVCALGECQKKMGTGYLSAFPSELFDRFEALEEVWAPYYTIHKILAGLLDQYTLGGNAQ 256
Query: 251 ALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAH 310
ALKM TWMVEYFYNRVQ VI+ YS+ERHW SLNEETGGMND LY LY IT D KH +LAH
Sbjct: 257 ALKMVTWMVEYFYNRVQNVISSYSIERHWLSLNEETGGMNDFLYNLYRITGDQKHFVLAH 316
Query: 311 LFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASH 370
LFDKPCFLG LA+QAD +S FHANTHIPIV+G+QMRYE+TGDPLYK IG FF+D VN+SH
Sbjct: 317 LFDKPCFLGLLAMQADDISGFHANTHIPIVVGAQMRYEITGDPLYKTIGAFFIDTVNSSH 376
Query: 371 SYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERAL 430
SYATGGTS EFW DPKR+A TL +EN E+CTTYNMLKVSR+LFRWTKE+AYADYYERAL
Sbjct: 377 SYATGGTSVDEFWSDPKRMATTLQTENAESCTTYNMLKVSRNLFRWTKEVAYADYYERAL 436
Query: 431 TNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSI 490
TNG+LSIQRGT+PGVM+YMLPLG G SKARS HGWGTKF+SFWCCYGTGIESFSKLGDSI
Sbjct: 437 TNGILSIQRGTDPGVMLYMLPLGHGNSKARSYHGWGTKFHSFWCCYGTGIESFSKLGDSI 496
Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK--QEVGQ 548
YFEEEG VPGLYIIQYISSS DWKSG VVLNQKVD +VSWDPYLR+TLTFS K Q GQ
Sbjct: 497 YFEEEGEVPGLYIIQYISSSLDWKSGQVVLNQKVDTVVSWDPYLRITLTFSPKKMQGAGQ 556
Query: 549 LSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
S++NLR+PVW YS+GA+A++N Q LP+P P +FLS +WS +DKLT+QLP++LRTEAI
Sbjct: 557 SSAINLRIPVWAYSSGAKAAVNAQALPVPAPNSFLSFRRKWSPDDKLTLQLPIALRTEAI 616
Query: 609 QDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQES 668
+DDRP+YA +QAIL+GPYLL G T+ +WDI+T A SLS I+PIP S N+ L++ +QES
Sbjct: 617 KDDRPKYACLQAILYGPYLLVGLTNNDWDIQTDLAASLSDWITPIPASHNSHLISLSQES 676
Query: 669 GNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPF 728
GNS+F +NSNQS+TME +P SGTDA+L+ATFRLIL+D++ S SS + IGK VMLEP
Sbjct: 677 GNSSFAFTNSNQSLTMERYPESGTDASLNATFRLILEDSTSSKISSPKDAIGKFVMLEPI 736
Query: 729 DFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVN 787
+FPGM +VQ+G + L ++ S +GSS F LVAGLD ++ TVSLE++ +KGCFV S VN
Sbjct: 737 NFPGMAVVQRGTNESLGITNSASVVGSSLFHLVAGLDGKDGTVSLESKTQKGCFVYSDVN 796
Query: 788 FEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDE 847
++ G+++KL C S D FN+A SF ++ GISEYHPISFVAKG RR++LLAPLLS RDE
Sbjct: 797 YDSGSAIKLKCKLASSDVVFNQATSFTLKHGISEYHPISFVAKGLRRDYLLAPLLSLRDE 856
Query: 848 AYTVYFNIQ 856
+YTVYFNIQ
Sbjct: 857 SYTVYFNIQ 865
>gi|356541912|ref|XP_003539416.1| PREDICTED: uncharacterized protein LOC100783150 [Glycine max]
Length = 854
Score = 1181 bits (3055), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/857 (67%), Positives = 688/857 (80%), Gaps = 12/857 (1%)
Query: 5 FVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWSS 63
F L G K+CTN P SH FRYEL STN TWK EV+ H+HLTPTD++AW+
Sbjct: 6 FALVAILLCGCDAAKECTN-IPTQSHTFRYELLMSTNATWKAEVMDHYHLTPTDETAWAD 64
Query: 64 LIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNL 123
L+P K+L +Q ++ W ++YRKIKN G F FLKEV L DV L + S+ RAQQTNL
Sbjct: 65 LLPRKLLSEQ-NQHDWGVMYRKIKNMGVFKSGEGFLKEVPLQDVRLHKDSIHGRAQQTNL 123
Query: 124 EYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNA 183
EYLLMLDVDSL+WSFRKTA+L TPG YGGWE P ELRGHFVGHYLSASA MWAST N
Sbjct: 124 EYLLMLDVDSLIWSFRKTAALSTPGTPYGGWEGPEVELRGHFVGHYLSASALMWASTQND 183
Query: 184 TIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQY 243
T+K+KMS++V LS CQ KIGTGYLSAFP+E FD FEA++PVWAPYYTIHKILAGLLDQ+
Sbjct: 184 TLKQKMSSLVAGLSACQEKIGTGYLSAFPSEFFDRFEAVQPVWAPYYTIHKILAGLLDQH 243
Query: 244 VLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDP 303
A N QALKM TWMV+YFYNRVQ VIT Y+V RH+ S+NEETGGMNDVLYRLYSIT D
Sbjct: 244 TFAGNPQALKMVTWMVDYFYNRVQNVITKYTVNRHYQSMNEETGGMNDVLYRLYSITGDS 303
Query: 304 KHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFM 363
KHL+LAHLFDKPCFLG LA+QA+ ++ HANTHIPIV+GSQMRYE+TGDPLYK IGTFFM
Sbjct: 304 KHLVLAHLFDKPCFLGLLAVQANDIADLHANTHIPIVVGSQMRYEITGDPLYKQIGTFFM 363
Query: 364 DIVNASHSYATGGTSAREFWWDPKRLADTL-GSENEETCTTYNMLKVSRHLFRWTKEIAY 422
D+VN+SHSYATGGTS REFW DPKR+AD L +ENEE+CTTYNMLKVSRHLFRWTKE++Y
Sbjct: 364 DLVNSSHSYATGGTSVREFWSDPKRIADNLRTTENEESCTTYNMLKVSRHLFRWTKEVSY 423
Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
ADYYERALTNGVLSIQRGT+PGVMIYMLPLG VSKAR+ H WGT+F+SFWCCYGTGIES
Sbjct: 424 ADYYERALTNGVLSIQRGTDPGVMIYMLPLGFAVSKARTGHSWGTQFDSFWCCYGTGIES 483
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
FSKLGDSIYFEEEG P LYIIQYISSSF+WKSG ++LNQ V P S DPYLR+T TFS
Sbjct: 484 FSKLGDSIYFEEEGKDPTLYIIQYISSSFNWKSGKILLNQTVVPASSSDPYLRVTFTFSP 543
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
+ LS+LN R+P WT +GA+ LNGQ L LP PGN+LS T +WS +DKLT+QLPL+
Sbjct: 544 VEVTNTLSTLNFRLPSWTLLDGAKGILNGQTLSLPNPGNYLSITRQWSASDKLTLQLPLT 603
Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTS-GEWDIKTGTARSLSALISPIPPSFNAQL 661
+RTEAI+DDRPEYAS+QAIL+GPYLLAGHT+ G+W++K G + I+PIP S+N+QL
Sbjct: 604 VRTEAIKDDRPEYASVQAILYGPYLLAGHTTGGDWNLKAGANN--ADWITPIPASYNSQL 661
Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGK 721
V+F ++ STFV++NSNQS++M++ P GTD AL ATFR++L+++S S FS L + +
Sbjct: 662 VSFFRDFEGSTFVLANSNQSVSMQKLPEFGTDLALQATFRIVLEESS-SKFSKLADANDR 720
Query: 722 SVMLEPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGC 780
SVMLEPFD PGM ++ QG L+ +S + S+ F LV GLD RNETVSLE+++ KGC
Sbjct: 721 SVMLEPFDLPGMNVIHQGAGKPLLTVDSSQGGPSAVFLLVPGLDGRNETVSLESQSNKGC 780
Query: 781 FVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAP 840
+V SG++ P A +KL C ++S DA FN+AASF+ G+S+Y+PISFVAKGA RNFLL P
Sbjct: 781 YVYSGMS--PSAGVKLSCKSDS-DATFNQAASFVALQGLSQYNPISFVAKGANRNFLLQP 837
Query: 841 LLSFRDEAYTVYFNIQD 857
LLSFRDE YTVYFNIQD
Sbjct: 838 LLSFRDEHYTVYFNIQD 854
>gi|356541181|ref|XP_003539059.1| PREDICTED: uncharacterized protein LOC100781521 [Glycine max]
Length = 854
Score = 1176 bits (3042), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 575/859 (66%), Positives = 684/859 (79%), Gaps = 12/859 (1%)
Query: 3 FGFVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAW 61
F FV G K+CTN P SH FRYEL S N TWK EV+ H+HLTPTD++ W
Sbjct: 4 FVFVFVAILLCGCVAAKECTN-IPTQSHTFRYELLMSKNATWKAEVMDHYHLTPTDETVW 62
Query: 62 SSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQT 121
+ L+P K L +Q ++ W ++YRKIKN G F FLKEV L DV L + S+ RAQQT
Sbjct: 63 ADLLPRKFLSEQ-NQHDWGVMYRKIKNMGVFKSGEGFLKEVPLQDVRLHKDSIHARAQQT 121
Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH 181
NLEYLLMLDVDSL+WSFRKTA L TPG YGGWE P ELRGHFVGHYLSASA MWAST
Sbjct: 122 NLEYLLMLDVDSLIWSFRKTAGLSTPGTPYGGWEGPEVELRGHFVGHYLSASALMWASTQ 181
Query: 182 NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLD 241
N T+K+KMS++V LS CQ KIGTGYLSAFP+E FD FE ++PVWAPYYTIHKILAGLLD
Sbjct: 182 NDTLKQKMSSLVAGLSACQEKIGTGYLSAFPSEFFDRFETVQPVWAPYYTIHKILAGLLD 241
Query: 242 QYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITH 301
Q+ A N QALKM TWMV+YFYNRVQ VIT Y+V RH+ SLNEETGGMNDVLYRLYSIT
Sbjct: 242 QHTFAGNPQALKMVTWMVDYFYNRVQNVITKYTVNRHYESLNEETGGMNDVLYRLYSITG 301
Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTF 361
D KHL+LAHLFDKPCFLG LA+QA+ +++FHANTHIP+V+GSQMRYE+TGDPLYK IGTF
Sbjct: 302 DSKHLVLAHLFDKPCFLGLLAMQANDIANFHANTHIPVVVGSQMRYEITGDPLYKQIGTF 361
Query: 362 FMDIVNASHSYATGGTSAREFWWDPKRLADTL-GSENEETCTTYNMLKVSRHLFRWTKEI 420
FMD+VN+SHSYATGGTS EFW DPKR+AD L +ENEE+CTTYNMLKVSRHLFRWTKE+
Sbjct: 362 FMDLVNSSHSYATGGTSVSEFWSDPKRIADNLRTTENEESCTTYNMLKVSRHLFRWTKEV 421
Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
+YADYYERALTNGVLSIQRGT+PGVMIYMLPLG VSKAR+ H WGT+F+SFWCCYGTGI
Sbjct: 422 SYADYYERALTNGVLSIQRGTDPGVMIYMLPLGFAVSKARTGHSWGTQFDSFWCCYGTGI 481
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
ESFSKLGDSIYFEEEG P LYIIQYI SSF+WKSG ++LNQ V P+ S DPYLR+T TF
Sbjct: 482 ESFSKLGDSIYFEEEGKDPTLYIIQYIPSSFNWKSGKILLNQTVVPVASSDPYLRVTFTF 541
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
S + LS+LN R+P WT +GA+ LNGQ L LP PG +LS T +WS +DKLT+QLP
Sbjct: 542 SPVEVTNTLSTLNFRLPSWTLLDGAKGILNGQTLSLPNPGKYLSVTRQWSGSDKLTLQLP 601
Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS-GEWDIKTGTARSLSALISPIPPSFNA 659
L++RTEAI+DDRPEYAS+QAIL+GPYLLAGHT+ G+WD+K G + I+PIP S+N+
Sbjct: 602 LTVRTEAIKDDRPEYASVQAILYGPYLLAGHTTGGDWDLKAGANN--ADWITPIPASYNS 659
Query: 660 QLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVI 719
QLV+F ++ STFV++NSN+S++M++ P GTD L ATFR++LKD+S S FS+L +
Sbjct: 660 QLVSFFRDFEGSTFVLTNSNKSVSMQKLPEYGTDLTLQATFRIVLKDSS-SKFSTLADAN 718
Query: 720 GKSVMLEPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRK 778
+SVMLEPFDFPGM ++ QG L++++S SS F LV GLD RNETVSLE+++ K
Sbjct: 719 DRSVMLEPFDFPGMNVIHQGAGKPLLIADSSHGGPSSVFLLVPGLDGRNETVSLESQSNK 778
Query: 779 GCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLL 838
GC+V SG++ P + +KL C ++S DA FN+A SF+ G+S+Y+PISFVAKG RNFLL
Sbjct: 779 GCYVYSGMS--PSSGVKLSCKSDS-DATFNKATSFVALQGLSQYNPISFVAKGTNRNFLL 835
Query: 839 APLLSFRDEAYTVYFNIQD 857
PLLSFRDE YTVYFNIQD
Sbjct: 836 QPLLSFRDEHYTVYFNIQD 854
>gi|449448754|ref|XP_004142130.1| PREDICTED: uncharacterized protein LOC101207833 [Cucumis sativus]
Length = 868
Score = 1171 bits (3030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 571/840 (67%), Positives = 681/840 (81%), Gaps = 6/840 (0%)
Query: 19 KQCTNQ-SPYDSHAFRYELTST-NKTWKEEVLSHFHLTPTDDSAWSSLIPSKILGDQKDE 76
K+CTN + SH FRYEL S+ N TWK+E+ SH+HLTPTDD AWS+L+P K+L +++E
Sbjct: 28 KECTNTPTQLGSHTFRYELLSSGNVTWKKELFSHYHLTPTDDFAWSNLLPRKML-KEENE 86
Query: 77 VSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVW 136
+W ++YR++KN G +PG LKE+SLHDV LD +S+ AQ TNL+YLLMLDVD L+W
Sbjct: 87 YNWEMMYRQMKNKDGLRIPGGMLKEISLHDVRLDPNSLHGTAQTTNLKYLLMLDVDRLLW 146
Query: 137 SFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSL 196
SFRKTA LPTPG+ Y GWE ELRGHFVGHYLSASAQMWAST N+ +KEKMS +V L
Sbjct: 147 SFRKTAGLPTPGEPYVGWEKSDCELRGHFVGHYLSASAQMWASTGNSVLKEKMSALVSGL 206
Query: 197 SECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMAT 256
+ CQ+K+GTGYLSAFP+E FD FEA++PVWAPYYTIHKILAGLLDQY A N+QALKM T
Sbjct: 207 ATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAGLLDQYTFAGNSQALKMVT 266
Query: 257 WMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPC 316
WMVEYFYNRVQ VI Y+VERH+ SLNEETGGMNDVLYRLY IT + KHLLLAHLFDKPC
Sbjct: 267 WMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYRITGNTKHLLLAHLFDKPC 326
Query: 317 FLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG 376
FLG LA+QA+ +S FH NTHIPIV+GSQMRYEVTGDPLYK I T+FMDIVN+SHSYATGG
Sbjct: 327 FLGLLAVQAEDISGFHVNTHIPIVVGSQMRYEVTGDPLYKEISTYFMDIVNSSHSYATGG 386
Query: 377 TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
TS EFW DPKRLAD LG+E EE+CTTYNMLKVSR+LF+WTKEIAYADYYERALTNGVLS
Sbjct: 387 TSVHEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTKEIAYADYYERALTNGVLS 446
Query: 437 IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
IQRGT+PGVMIYMLPLG G SKA S HGWGT F SFWCCYGTGIESFSKLGDSIYFEEE
Sbjct: 447 IQRGTDPGVMIYMLPLGSGSSKAISYHGWGTPFESFWCCYGTGIESFSKLGDSIYFEEEL 506
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
P LY+IQYISSS DWKSG+V+LNQ VDPI S DP LRMTLTFS K S++NLR+
Sbjct: 507 QTPTLYVIQYISSSLDWKSGNVLLNQTVDPIHSEDPKLRMTLTFSPKVGSVHSSTINLRI 566
Query: 557 PVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
P WT ++GA+ LNGQ+L GNF S T WS +KL+++LP++LRTEAI DDR EYA
Sbjct: 567 PSWTSASGAKVVLNGQSLGNNINGNFKSVTNSWSSGNKLSLELPINLRTEAIDDDRSEYA 626
Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMS 676
S++AILFGPYLLA +++G+W+IKT A SLS I+ +P ++N LVTF+Q SG ++F ++
Sbjct: 627 SVKAILFGPYLLAAYSNGDWEIKTQQADSLSDWITHVPSAYNTFLVTFSQASGKTSFALT 686
Query: 677 NSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLV- 735
NSNQSITME++P GTD+A+HATFRLI+ D S + + L +VIGK VMLEPF FPGM++
Sbjct: 687 NSNQSITMEKYPGQGTDSAVHATFRLIIDDPS-AKVTELQDVIGKRVMLEPFSFPGMVLG 745
Query: 736 QQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLK 795
+GK++ L ++++ E SS F LV GLD +N TVSL + + +GCFV SGVN+E GA LK
Sbjct: 746 NKGKDERLEIADANSEGHSSDFYLVEGLDGKNGTVSLASIDNEGCFVYSGVNYESGAQLK 805
Query: 796 LLCSTE-SLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFN 854
L C ++ SLD GF+ A+SF++E G S+YHPISFV KG RNFLLAPLLSF DE+YTVYFN
Sbjct: 806 LSCKSKLSLDDGFDEASSFLLESGASQYHPISFVTKGMTRNFLLAPLLSFVDESYTVYFN 865
>gi|15239944|ref|NP_196799.1| uncharacterized protein [Arabidopsis thaliana]
gi|7630051|emb|CAB88259.1| putative protein [Arabidopsis thaliana]
gi|26451123|dbj|BAC42665.1| unknown protein [Arabidopsis thaliana]
gi|332004451|gb|AED91834.1| uncharacterized protein [Arabidopsis thaliana]
Length = 861
Score = 1130 bits (2923), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 546/846 (64%), Positives = 658/846 (77%), Gaps = 10/846 (1%)
Query: 15 LALGKQCTNQ-SPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWSSLIPSKILGD 72
+++ K+CTN + SH FR EL S N+T K E+ SH+HLTP DDSAWSSL+P K+L +
Sbjct: 21 VSVAKECTNTPTQLSSHTFRSELLQSKNETLKTELFSHYHLTPADDSAWSSLLPRKMLKE 80
Query: 73 QKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVD 132
+ DE +W +LYRK K+ GNFLK+VSLHDV LD S WRAQQTNLEYLLMLDVD
Sbjct: 81 EADEFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPDSFHWRAQQTNLEYLLMLDVD 137
Query: 133 SLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTV 192
L WSFRK A L PG YGGWE P SELRGHFVGHYLSA+A MWASTHN T+KEKMS +
Sbjct: 138 GLAWSFRKEAGLDAPGDYYGGWERPDSELRGHFVGHYLSATAYMWASTHNDTLKEKMSAL 197
Query: 193 VFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQAL 252
V +LSECQ K GTGYLSAFP+ FD FEA+ PVWAPYYTIHKILAGL+DQY LA N+QAL
Sbjct: 198 VSALSECQQKSGTGYLSAFPSSFFDRFEAITPVWAPYYTIHKILAGLVDQYKLAGNSQAL 257
Query: 253 KMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF 312
KMAT M +YFY RV+ VI YSVERHW SLNEETGGMNDVLY+LYSIT D K+LLLAHLF
Sbjct: 258 KMATGMADYFYGRVRNVIRKYSVERHWQSLNEETGGMNDVLYQLYSITGDSKYLLLAHLF 317
Query: 313 DKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSY 372
DKPCFLG LA+QAD +S FHANTHIPIV+GSQ RYE+TGD L+K I FFMDI NASHSY
Sbjct: 318 DKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEISMFFMDIFNASHSY 377
Query: 373 ATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
ATGGTS EFW DPKR+A L +ENEE+CTTYNMLKVSR+LFRWTKE++YADYYERALTN
Sbjct: 378 ATGGTSVSEFWQDPKRMATALQTENEESCTTYNMLKVSRNLFRWTKEVSYADYYERALTN 437
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
GVL IQRGT+PG+MIYMLPLG+GVSKA + HGWGT ++SFWCCYGTGIESFSKLGDSIYF
Sbjct: 438 GVLGIQRGTQPGLMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYGTGIESFSKLGDSIYF 497
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF-SSKQEVGQLSS 551
+E+G P LY+ QYISSS DWKS + ++QKV+P+VSWDPY+R+T T SSK V + S+
Sbjct: 498 QEDGATPALYVTQYISSSLDWKSAGLSISQKVNPVVSWDPYMRVTFTLSSSKVGVAKEST 557
Query: 552 LNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDD 611
LNLR+PVWT S GA+ SLNG+ L +P GNFLS ++W D++T++LP+S+RTEAI+DD
Sbjct: 558 LNLRIPVWTNSIGAKVSLNGRPLNVPTSGNFLSIKQKWKSGDQVTMELPMSIRTEAIKDD 617
Query: 612 RPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNS 671
RPEYAS+QAIL+GPYLLAGHTS +W I T I+PIP + N+ LVT +Q+SGN
Sbjct: 618 RPEYASLQAILYGPYLLAGHTSRDWSITTQAKP--GKWITPIPETQNSYLVTLSQQSGNV 675
Query: 672 TFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFP 731
++V SNSNQ+ITM P GT A+ ATFRL+ D S S +IG+ VMLEPFDFP
Sbjct: 676 SYVFSNSNQTITMRVSPEPGTQDAVAATFRLV-TDNSKPRISGPEGLIGRLVMLEPFDFP 734
Query: 732 GMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEP 790
GM+V+Q + L V + SP + G+S FRLV+GLD + +VSL E++KGCFV S +
Sbjct: 735 GMIVKQATDSSLTVQASSPSDKGASSFRLVSGLDGKLGSVSLRLESKKGCFVYSDQTLKQ 794
Query: 791 GASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYT 850
G L+L C +++ D F AASF ++ G+ +Y+P+SFV G +RNF+L+PL S RDE Y
Sbjct: 795 GTKLRLECGSDATDEKFKEAASFSLKTGMHQYNPMSFVMSGTQRNFVLSPLFSLRDETYN 854
Query: 851 VYFNIQ 856
VYF++Q
Sbjct: 855 VYFSVQ 860
>gi|356557388|ref|XP_003546998.1| PREDICTED: uncharacterized protein LOC100815634 [Glycine max]
Length = 841
Score = 1119 bits (2895), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 556/862 (64%), Positives = 670/862 (77%), Gaps = 28/862 (3%)
Query: 1 MNFGFVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDS 59
M F F +G A GK+CTN SH FRY+L TSTN+TW ++SH HLT DD
Sbjct: 1 MAFLFAFVAIVVWGCAAGKECTNNDA-QSHTFRYQLSTSTNETW--NIMSHNHLTTKDDH 57
Query: 60 AWSSLIPSKILGDQKDEVSWAL-LYRKIKNPGGF---DLPGNFLKEVSLHDVWLDQSSVL 115
+ L+P K+L K+E L + RKI+ G P FLK VSLHDV L+Q S+
Sbjct: 58 LLADLLPRKLL---KEENQRNLDMLRKIEKVGVLKPPQQPQGFLKPVSLHDVRLNQGSIH 114
Query: 116 WRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQ 175
+AQ+TNLEYLLML+VD L+WSFRKTA LPTPG YGGWE+P ELRGHFVGHYLSASA
Sbjct: 115 AQAQRTNLEYLLMLNVDRLLWSFRKTAGLPTPGTPYGGWEDPKMELRGHFVGHYLSASAL 174
Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
MWASTHN ++K+KMS +V +LS CQ KIGTGYLSAFP+E FD EA K VWAPYYT HKI
Sbjct: 175 MWASTHNDSLKKKMSALVANLSICQEKIGTGYLSAFPSEFFDRLEATKYVWAPYYTTHKI 234
Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYR 295
LAGLLDQ+ +A+N QALKM TWMV+YFYNRVQ VIT +S+ RH+ SLNEETGGMNDVLY+
Sbjct: 235 LAGLLDQHSIAENPQALKMVTWMVDYFYNRVQNVITKFSISRHYQSLNEETGGMNDVLYK 294
Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY 355
LYSIT DP+HLLLAHLFDKPCFLG LA++A+ ++HFHANTHIP+++GSQMRYEVTGDPLY
Sbjct: 295 LYSITGDPRHLLLAHLFDKPCFLGLLAVKANDIAHFHANTHIPVIVGSQMRYEVTGDPLY 354
Query: 356 KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLF 414
K IGT FMD+VN+SH+YATGGTS EFW DPKR+ADTL S +NEE+CTTYNMLKVSRHLF
Sbjct: 355 KEIGTLFMDLVNSSHTYATGGTSVNEFWSDPKRMADTLESTDNEESCTTYNMLKVSRHLF 414
Query: 415 RWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC 474
WTK+++YADYYERALTNGVLSIQRGTEPGVMIYMLP GRGVSKA++ GWGTKF+SFWC
Sbjct: 415 TWTKKVSYADYYERALTNGVLSIQRGTEPGVMIYMLPQGRGVSKAKTYFGWGTKFDSFWC 474
Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
CYGTGIESFSKLGDSIYFEE+G P LYIIQYISS F+WKSG ++LNQ V P SWDP+L
Sbjct: 475 CYGTGIESFSKLGDSIYFEEQGENPTLYIIQYISSLFNWKSGQIILNQTVVPPASWDPFL 534
Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
R++ TFS ++ G LS+LN R+P + NG + LN + L LP PGNFLS T +W+ DK
Sbjct: 535 RVSFTFSPAKKTGALSTLNFRLPTRMHKNGEKGILNNETLTLPGPGNFLSITRKWNAGDK 594
Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIP 654
L++QLPL+LR EAI+DDR +YASIQAIL+GPYLLAGHT+G+W+IKT S++ I+PIP
Sbjct: 595 LSLQLPLTLRAEAIKDDRTKYASIQAILYGPYLLAGHTTGDWNIKTAANASIADWITPIP 654
Query: 655 PSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSS 714
S+N L F+Q NSTFV++NSNQS+ +++ P GTD+AL ATFR+I + S + F++
Sbjct: 655 ASYNIHLFYFSQAFANSTFVLTNSNQSLAVKKVPEPGTDSALGATFRVI-QGKSSTKFTT 713
Query: 715 LNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEA 774
L + IGKSVMLEPFD PGM + + S P SS F +V GLD R ET+SLE+
Sbjct: 714 LTDAIGKSVMLEPFDHPGM--------QALPSGGP----SSVFVVVPGLDGRKETISLES 761
Query: 775 ENRKGCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARR 834
++ GCFV SG+ G +KL C T S DA FN+AASF+ + GIS+Y+PISFVAKG R
Sbjct: 762 KSHNGCFVHSGL--RSGRGVKLSCKTTS-DATFNQAASFIAKRGISKYNPISFVAKGENR 818
Query: 835 NFLLAPLLSFRDEAYTVYFNIQ 856
NFLL PLL+FRDE+YTVYFNI+
Sbjct: 819 NFLLEPLLAFRDESYTVYFNIK 840
>gi|297746368|emb|CBI16424.3| unnamed protein product [Vitis vinifera]
Length = 741
Score = 1117 bits (2889), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 528/732 (72%), Positives = 616/732 (84%), Gaps = 3/732 (0%)
Query: 128 MLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKE 187
MLD D LVWSFR+TA LPTP YGGWE+P ELRGHFVGHYLSASAQMWASTHN ++KE
Sbjct: 1 MLDADRLVWSFRRTAGLPTPCSPYGGWESPDGELRGHFVGHYLSASAQMWASTHNESLKE 60
Query: 188 KMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLAD 247
KMS VV +L ECQ K+GTGYLSAFP+ELFD FEAL+ VWAPYYTIHKILAGLLDQY L
Sbjct: 61 KMSAVVCALGECQKKMGTGYLSAFPSELFDRFEALEEVWAPYYTIHKILAGLLDQYTLGG 120
Query: 248 NAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLL 307
NAQALKM TWMVEYFYNRVQ VI+ YS+ERHW SLNEETGGMND LY LY IT D KH +
Sbjct: 121 NAQALKMVTWMVEYFYNRVQNVISSYSIERHWLSLNEETGGMNDFLYNLYRITGDQKHFV 180
Query: 308 LAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVN 367
LAHLFDKPCFLG LA+QAD +S FHANTHIPIV+G+QMRYE+TGDPLYK IG FF+D VN
Sbjct: 181 LAHLFDKPCFLGLLAMQADDISGFHANTHIPIVVGAQMRYEITGDPLYKTIGAFFIDTVN 240
Query: 368 ASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
+SHSYATGGTS EFW DPKR+A TL +EN E+CTTYNMLKVSR+LFRWTKE+AYADYYE
Sbjct: 241 SSHSYATGGTSVDEFWSDPKRMATTLQTENAESCTTYNMLKVSRNLFRWTKEVAYADYYE 300
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLG 487
RALTNG+LSIQRGT+PGVM+YMLPLG G SKARS HGWGTKF+SFWCCYGTGIESFSKLG
Sbjct: 301 RALTNGILSIQRGTDPGVMLYMLPLGHGNSKARSYHGWGTKFHSFWCCYGTGIESFSKLG 360
Query: 488 DSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK--QE 545
DSIYFEEEG VPGLYIIQYISSS DWKSG VVLNQKVD +VSWDPYLR+TLTFS K Q
Sbjct: 361 DSIYFEEEGEVPGLYIIQYISSSLDWKSGQVVLNQKVDTVVSWDPYLRITLTFSPKKMQG 420
Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
GQ S++NLR+PVW YS+GA+A++N Q LP+P P +FLS +WS +DKLT+QLP++LRT
Sbjct: 421 AGQSSAINLRIPVWAYSSGAKAAVNAQALPVPAPNSFLSFRRKWSPDDKLTLQLPIALRT 480
Query: 606 EAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFT 665
EAI+DDRP+YA +QAIL+GPYLL G T+ +WDI+T A SLS I+PIP S N+ L++ +
Sbjct: 481 EAIKDDRPKYACLQAILYGPYLLVGLTNNDWDIQTDLAASLSDWITPIPASHNSHLISLS 540
Query: 666 QESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVML 725
QESGNS+F +NSNQS+TME +P SGTDA+L+ATFRLIL+D++ S SS + IGK VML
Sbjct: 541 QESGNSSFAFTNSNQSLTMERYPESGTDASLNATFRLILEDSTSSKISSPKDAIGKFVML 600
Query: 726 EPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSS 784
EP +FPGM +VQ+G + L ++ S +GSS F LVAGLD ++ TVSLE++ +KGCFV S
Sbjct: 601 EPINFPGMAVVQRGTNESLGITNSASVVGSSLFHLVAGLDGKDGTVSLESKTQKGCFVYS 660
Query: 785 GVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSF 844
VN++ G+++KL C S D FN+A SF ++ GISEYHPISFVAKG RR++LLAPLLS
Sbjct: 661 DVNYDSGSAIKLKCKLASSDVVFNQATSFTLKHGISEYHPISFVAKGLRRDYLLAPLLSL 720
Query: 845 RDEAYTVYFNIQ 856
RDE+YTVYFNIQ
Sbjct: 721 RDESYTVYFNIQ 732
>gi|297807309|ref|XP_002871538.1| hypothetical protein ARALYDRAFT_350453 [Arabidopsis lyrata subsp.
lyrata]
gi|297317375|gb|EFH47797.1| hypothetical protein ARALYDRAFT_350453 [Arabidopsis lyrata subsp.
lyrata]
Length = 860
Score = 1116 bits (2887), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 542/860 (63%), Positives = 658/860 (76%), Gaps = 11/860 (1%)
Query: 1 MNFGFVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDD 58
+ +LF F + + K+CT+ + SH R EL S N+T K E+ SH+HLTPTDD
Sbjct: 7 ITIALLLFTSFVL-VCVAKECTDIPTKLSSHTLRSELLQSQNETLKTELSSHYHLTPTDD 65
Query: 59 SAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRA 118
+AWS+L+P K+L ++ D+ +W +LYRK K+ GNFLK+VSLHDV LD SS WRA
Sbjct: 66 AAWSTLLPRKMLKEETDDFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPSSFHWRA 122
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
QQTNLEYLLML+VD L +SFRK A L PG YGGWE P SELRGHFVGHYLSA+A MWA
Sbjct: 123 QQTNLEYLLMLNVDGLAYSFRKVAGLDAPGVPYGGWEKPDSELRGHFVGHYLSATAYMWA 182
Query: 179 STHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAG 238
STHN T+K KMS +V +L+ECQ K GTGYLSAFP+ FD FEA+ VWAPYYTIHKILAG
Sbjct: 183 STHNDTLKTKMSALVSALAECQQKSGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAG 242
Query: 239 LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYS 298
L+DQY LA N QALKMAT M +YFY RVQ VI YSVERHW SLNEETGGMNDVLY+LYS
Sbjct: 243 LVDQYKLAGNTQALKMATGMADYFYGRVQNVIRKYSVERHWLSLNEETGGMNDVLYQLYS 302
Query: 299 ITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLI 358
IT D K+L LAHLFDKPCFLG LA+QAD +S FHANTHIPIV+GSQ RYE+TGD L+K I
Sbjct: 303 ITRDSKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEI 362
Query: 359 GTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTK 418
FFMDIVNASHSYATGGTS +EFW DPKR+A TL +ENEE+CTTYNMLKVSR+LFRWTK
Sbjct: 363 SMFFMDIVNASHSYATGGTSVKEFWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWTK 422
Query: 419 EIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGT 478
E++YADYYERALTNGVL IQRGT+PG MIYMLPLG+GVSKA + HGWGT ++SFWCCYGT
Sbjct: 423 EVSYADYYERALTNGVLGIQRGTQPGRMIYMLPLGQGVSKAVTYHGWGTPYDSFWCCYGT 482
Query: 479 GIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL 538
GIESFSKLGDSIYF+E+G P LY+ QYISSS DWKS ++L+QKV+P+VSWDPY+R+T
Sbjct: 483 GIESFSKLGDSIYFQEDGASPALYVTQYISSSLDWKSAGLLLSQKVNPVVSWDPYMRVTF 542
Query: 539 TF-SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTI 597
T SSK V + S+LNLR+PVWT S GA+ SLNG+ L +P GNFLS + W D++T+
Sbjct: 543 TLSSSKVGVAKKSTLNLRIPVWTNSIGAKVSLNGKPLKVPTSGNFLSIKQNWKSGDQVTM 602
Query: 598 QLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSF 657
+LP+S+RTEAI+DDRPEYAS+QAIL+GPYLLAGHTS +W I T I+PIP ++
Sbjct: 603 ELPMSIRTEAIKDDRPEYASLQAILYGPYLLAGHTSRDWSIT--TQAKAGNWITPIPETY 660
Query: 658 NAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNN 717
N+ LVT +Q+SGN ++V+SN+NQ+ITM P GT A+ ATFRL+ D S S
Sbjct: 661 NSHLVTLSQQSGNISYVLSNTNQTITMRVSPELGTQDAVAATFRLV-TDNSKPRISGPEA 719
Query: 718 VIGKSVMLEPFDFPGMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAEN 776
+IG VMLEPFDFPGM+V+Q + L V + SP + G+S FRLV+G+D + +VSL E+
Sbjct: 720 LIGSLVMLEPFDFPGMIVKQATDSSLTVQASSPSDKGASSFRLVSGVDGKPGSVSLRLES 779
Query: 777 RKGCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNF 836
GCFV S + G LKL C + D F AASF + G+++Y+P+SFV G +RNF
Sbjct: 780 NNGCFVYSDQTLKQGTKLKLECGPVATDEKFKEAASFKLNTGMNQYNPMSFVMSGTQRNF 839
Query: 837 LLAPLLSFRDEAYTVYFNIQ 856
+L+PL S RDE Y VYF++Q
Sbjct: 840 VLSPLFSLRDETYNVYFSVQ 859
>gi|297811349|ref|XP_002873558.1| hypothetical protein ARALYDRAFT_488069 [Arabidopsis lyrata subsp.
lyrata]
gi|297319395|gb|EFH49817.1| hypothetical protein ARALYDRAFT_488069 [Arabidopsis lyrata subsp.
lyrata]
Length = 860
Score = 1116 bits (2887), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 542/856 (63%), Positives = 657/856 (76%), Gaps = 14/856 (1%)
Query: 5 FVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
+ F C + K+CT+ + SH EL S NKT K E+ SH+HLTPTDD+AWS
Sbjct: 14 YTSFLLVC----VAKECTDIPTKLSSHTLNSELLQSHNKTLKTELFSHYHLTPTDDAAWS 69
Query: 63 SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
+L+P K+L ++ DE +W +LYRK K+ GNFLK+VSLHDV LD +S WRAQQTN
Sbjct: 70 TLLPRKMLKEETDEFAWTMLYRKFKDSNSV---GNFLKDVSLHDVRLDPNSFHWRAQQTN 126
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
LEYLLMLDVD L +SFRK A L G YGGWE P SELRGHFVGHYLSA+A MWASTHN
Sbjct: 127 LEYLLMLDVDGLAYSFRKVAGLDASGVPYGGWEKPDSELRGHFVGHYLSATAHMWASTHN 186
Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
T+K KMS +V +L+ECQ K GTGYLSAFP+ FD FEA+ VWAPYYTIHKILAGL+DQ
Sbjct: 187 DTLKAKMSALVSALAECQQKSGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAGLVDQ 246
Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
Y LA N QALKMAT M +YFY RV+ VIT YSVERH+ SLNEETGGMNDVLY+LYSIT D
Sbjct: 247 YKLAGNIQALKMATGMADYFYGRVRNVITKYSVERHYQSLNEETGGMNDVLYQLYSITRD 306
Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
K+L LAHLFDKPCFLG LA+QAD +S FHANTHIPIV+GSQ RYE+TGD L+K I FF
Sbjct: 307 SKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEISMFF 366
Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
MDI+NASHSYATGGTS REFW DPKR+A TL +ENEE+CTTYNMLKVSR+LFRWTKE++Y
Sbjct: 367 MDIINASHSYATGGTSVREFWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWTKEVSY 426
Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
ADYYERALTNGVL IQRGT+PG MIYMLPLG+GVSKA + HGWGT ++SFWCCYGTGIES
Sbjct: 427 ADYYERALTNGVLGIQRGTQPGRMIYMLPLGQGVSKAVTYHGWGTPYDSFWCCYGTGIES 486
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF-S 541
FSKLGDSIYF+E+G P LY+ QYISSS DWKS ++L+QKV+P+VSWDPY+R+T T S
Sbjct: 487 FSKLGDSIYFQEDGASPALYVTQYISSSLDWKSAGLLLSQKVNPVVSWDPYMRVTFTLSS 546
Query: 542 SKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
SK V + S+LNLR+PVWT S GA+ SLNG+ L +P GNFLS + W D++T++LP+
Sbjct: 547 SKVGVAKKSTLNLRIPVWTNSIGAKVSLNGKPLKVPTSGNFLSIKQNWKSGDQVTMELPM 606
Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQL 661
S+RTEAI+DDRPEYAS+QAIL+GPYLLAGHTS +W I T I+PIP ++N+ L
Sbjct: 607 SIRTEAIKDDRPEYASLQAILYGPYLLAGHTSRDWSIT--TQAKAGNWITPIPETYNSHL 664
Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGK 721
VT +Q+SGN ++V+SN+NQ+ITM P GT A+ ATFRL+ D S S L +IG
Sbjct: 665 VTLSQQSGNISYVLSNTNQTITMRVSPELGTQDAVAATFRLV-TDNSKPQISGLEALIGS 723
Query: 722 SVMLEPFDFPGMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGC 780
VMLEPFDFPGM+V+Q + L V + SP + G+S FRLV+G+D + +VSL E+ GC
Sbjct: 724 LVMLEPFDFPGMIVKQTTDSSLTVQASSPSDKGASSFRLVSGVDGKPGSVSLRLESNNGC 783
Query: 781 FVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAP 840
FV S + G LKL C + D F +AASF + IG+++Y+P+SFV G +RNF+L+P
Sbjct: 784 FVYSDQTLKQGTKLKLECGPVATDEKFKQAASFKLNIGMNQYNPMSFVMSGTQRNFVLSP 843
Query: 841 LLSFRDEAYTVYFNIQ 856
L S RDE Y VYF++Q
Sbjct: 844 LFSLRDETYNVYFSVQ 859
>gi|297746357|emb|CBI16413.3| unnamed protein product [Vitis vinifera]
Length = 767
Score = 1113 bits (2878), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 536/720 (74%), Positives = 608/720 (84%), Gaps = 5/720 (0%)
Query: 5 FVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
V+F F G LGK+CTN + SH+FRYEL S N++WK E+ H+HL TDDSAWS
Sbjct: 11 IVVFAFVLCGCVLGKECTNVPTQLSSHSFRYELLASNNESWKAEMFQHYHLIHTDDSAWS 70
Query: 63 SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
+L+P K+L ++ DE SWA++YR +KN G + NFLKE+SLHDV LD S+ RAQQTN
Sbjct: 71 NLLPRKLLREE-DEFSWAMMYRNMKNYDGSN--SNFLKEMSLHDVRLDSDSLHGRAQQTN 127
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
L+YLL+LDVD LVWSFRKTA L TPG YGGWE P ELRGHFVGHY+SASAQMWASTHN
Sbjct: 128 LDYLLILDVDRLVWSFRKTAGLSTPGLPYGGWEAPNVELRGHFVGHYMSASAQMWASTHN 187
Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
T+KEKMS VV +L+ CQ K+GTGYLSAFP+ELFD FEA+KPVWAPYYTIHKILAGLLDQ
Sbjct: 188 DTLKEKMSAVVSALATCQEKMGTGYLSAFPSELFDRFEAIKPVWAPYYTIHKILAGLLDQ 247
Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
Y A N+QALKM TWMVE+FY RVQ VITMYS+ERHW SLNEETGGMNDVLYRLYSIT D
Sbjct: 248 YTFAGNSQALKMMTWMVEHFYKRVQNVITMYSLERHWLSLNEETGGMNDVLYRLYSITGD 307
Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
KHL+LAHLFDKPCFLG LA+QAD +S FHANTHIP+VIGSQMRYEVTGDPLYK IGTFF
Sbjct: 308 QKHLVLAHLFDKPCFLGLLAVQADSISGFHANTHIPVVIGSQMRYEVTGDPLYKAIGTFF 367
Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
MDIVN+SHSYATGGTS EFW DPKRLA TL ENEE+CTTYNMLKVSRHLFRWTKE+ Y
Sbjct: 368 MDIVNSSHSYATGGTSVGEFWSDPKRLASTLQRENEESCTTYNMLKVSRHLFRWTKEVVY 427
Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
ADYYERALTNGVLSIQRGT+PGVMIYMLPLGRG SKARS HGWGTKF+SFWCCYGTGIES
Sbjct: 428 ADYYERALTNGVLSIQRGTDPGVMIYMLPLGRGDSKARSYHGWGTKFDSFWCCYGTGIES 487
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
FSKLGDSIYFEEEG P +YIIQYISSS DWKSG +VLNQKVDP+VSWDPYLR TLTF+
Sbjct: 488 FSKLGDSIYFEEEGKSPEVYIIQYISSSLDWKSGQIVLNQKVDPVVSWDPYLRTTLTFTP 547
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
K+ GQ S++NLR+PVW S+GA+AS+N Q+LP+P P +FLS T WS DKLT+QLP+
Sbjct: 548 KEGAGQSSTINLRIPVWASSSGAKASINAQDLPVPAPSSFLSLTRNWSPGDKLTLQLPIR 607
Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
LRTEAI+DDRP+YASIQAIL+GPYLLAG TS +WDIKTG+A SLS I+PIP S N++LV
Sbjct: 608 LRTEAIKDDRPKYASIQAILYGPYLLAGLTSDDWDIKTGSATSLSDWITPIPASDNSRLV 667
Query: 663 TFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKS 722
+ +QESGNS+FV SNSNQSITME+FP GTDA+LHATFRL+LKDA+ S + IGKS
Sbjct: 668 SLSQESGNSSFVFSNSNQSITMEKFPEEGTDASLHATFRLVLKDATSLKVLSPKDAIGKS 727
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 62/112 (55%), Gaps = 19/112 (16%)
Query: 750 KEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNF----EPGASLKLLCSTESLDA 805
+E G+S F N+++++E +G S F + SLK+L +++
Sbjct: 671 QESGNSSFVF----SNSNQSITMEKFPEEGTDASLHATFRLVLKDATSLKVLSPKDAIGK 726
Query: 806 GFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNIQD 857
GIS+YHPISFVAKG +RNFLL PLL RDE+YTVYFNIQD
Sbjct: 727 S-----------GISQYHPISFVAKGMKRNFLLTPLLGLRDESYTVYFNIQD 767
>gi|297807305|ref|XP_002871536.1| hypothetical protein ARALYDRAFT_909245 [Arabidopsis lyrata subsp.
lyrata]
gi|297317373|gb|EFH47795.1| hypothetical protein ARALYDRAFT_909245 [Arabidopsis lyrata subsp.
lyrata]
Length = 862
Score = 1109 bits (2869), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 542/859 (63%), Positives = 660/859 (76%), Gaps = 13/859 (1%)
Query: 5 FVLFFFFCFGL-ALGKQCTNQ-SPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAW 61
VL + F L + K+CTN + SH FR EL S N+T K E+ SH+HLTPTDD+AW
Sbjct: 9 IVLLLYTSFVLVCVAKECTNTPTQLSSHTFRSELLQSKNETLKTELFSHYHLTPTDDAAW 68
Query: 62 SSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQT 121
S+L+P K+L ++ DE +W +LYR K+ GNFLKEVSLHDV LD +S RAQQT
Sbjct: 69 STLLPRKMLKEEADEFAWTMLYRTFKDSNS---SGNFLKEVSLHDVRLDPNSFHGRAQQT 125
Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH 181
NLEYLLMLDVD L WSFRK A L PG YGGWE P SELRGHFVGHYLSA+A MWASTH
Sbjct: 126 NLEYLLMLDVDGLAWSFRKEAGLDAPGDHYGGWEKPDSELRGHFVGHYLSATAYMWASTH 185
Query: 182 NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLD 241
N T+KEKMS +V +LSECQ K GTGYLSAFP+ FD FEA+ PVWAPYYTIHKI+AGL+D
Sbjct: 186 NDTLKEKMSALVSALSECQQKSGTGYLSAFPSSFFDRFEAITPVWAPYYTIHKIIAGLVD 245
Query: 242 QYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITH 301
QY LA N+QAL+MAT M +YFY RV+ VI YSVERHW SLNEETGGMND+LY+LYSIT
Sbjct: 246 QYKLAGNSQALQMATGMADYFYGRVRNVIRKYSVERHWQSLNEETGGMNDILYQLYSITG 305
Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTF 361
D K+LLLAHLFDKPCFLG LA+QAD +S FH+NTHIPIV+GSQ RYE+TGDPL+K I F
Sbjct: 306 DSKYLLLAHLFDKPCFLGVLAIQADDISGFHSNTHIPIVVGSQQRYEITGDPLHKEISIF 365
Query: 362 FMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIA 421
FMDIVNASHSYATGGTS EFW +PKR+A TL +ENEE+CTTYNMLKVSR+LFRWTKE++
Sbjct: 366 FMDIVNASHSYATGGTSVSEFWQNPKRMATTLQTENEESCTTYNMLKVSRNLFRWTKEVS 425
Query: 422 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
YADYYERALTNGVL IQRGT+PG+MIYMLPLG+GVSKA + HGWGT ++SFWCCYGTGIE
Sbjct: 426 YADYYERALTNGVLGIQRGTQPGLMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYGTGIE 485
Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF- 540
SFSKLGDSIYF+E+ P LY+ QYISSS DWKS + L+QKV+P+VSWDPY+R+T +F
Sbjct: 486 SFSKLGDSIYFQEDDVSPALYVTQYISSSLDWKSAGLSLSQKVNPVVSWDPYMRVTFSFS 545
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP--PPGNFLSATERWSYNDKLTIQ 598
SSK + + S+LNLR+PVWT S GA+ SLNGQ+L +P NFLS + W D+LT++
Sbjct: 546 SSKGGMAKESTLNLRIPVWTNSVGAKISLNGQSLKVPNFRTRNFLSIKQNWKSGDQLTME 605
Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFN 658
LPLS+RTEAI+DDR EY+S+QAIL+GPYLLAGHTS +W I T I+PIP + N
Sbjct: 606 LPLSIRTEAIKDDRQEYSSLQAILYGPYLLAGHTSRDWSIT--TQAKAGKWITPIPETQN 663
Query: 659 AQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNV 718
+ LVT +Q+SG+ ++V SNSNQ+ITM P GT A+ ATFRL+ D S S +
Sbjct: 664 SYLVTLSQQSGDISYVFSNSNQTITMRVSPEPGTQDAVAATFRLV-TDNSKPRISGPEAL 722
Query: 719 IGKSVMLEPFDFPGMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAENR 777
IG V LEPFDFPGM+V+Q + L V + SP + G+S FRLV+G+D + +VSL E++
Sbjct: 723 IGSLVKLEPFDFPGMIVKQATDSSLTVQASSPSDKGASSFRLVSGVDGKPGSVSLRLESK 782
Query: 778 KGCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFL 837
KGCFV S + G L+L C + + D F AASF ++ G+++Y+P+SFV G +RNF+
Sbjct: 783 KGCFVYSDQTLKQGTKLRLECGSAATDEKFKEAASFKLKTGMNQYNPMSFVMSGTQRNFV 842
Query: 838 LAPLLSFRDEAYTVYFNIQ 856
L+PL S RDE Y VYF++Q
Sbjct: 843 LSPLFSLRDETYNVYFSVQ 861
>gi|30684197|ref|NP_196800.2| uncharacterized protein [Arabidopsis thaliana]
gi|28393685|gb|AAO42255.1| unknown protein [Arabidopsis thaliana]
gi|332004452|gb|AED91835.1| uncharacterized protein [Arabidopsis thaliana]
Length = 865
Score = 1107 bits (2862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 543/856 (63%), Positives = 657/856 (76%), Gaps = 14/856 (1%)
Query: 5 FVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
+ F C L K+CT+ + SH R EL S N K E SH+HLTPTDDSAWS
Sbjct: 19 YTSFLLVC----LAKECTDIPTKLSSHTLRSELLQSQNANLKSEEFSHYHLTPTDDSAWS 74
Query: 63 SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
+L+P K+L ++ D+ +W +LYRK K+ GNFLK+VSLHDV LD SS WRAQQTN
Sbjct: 75 TLLPRKMLKEETDDFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPSSFHWRAQQTN 131
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
LEYLLMLDVD L ++FRK A L PG YGGWE P SELRGHFVGHYLSA+A MWASTHN
Sbjct: 132 LEYLLMLDVDGLAYNFRKEAGLNAPGVPYGGWEKPDSELRGHFVGHYLSATAYMWASTHN 191
Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
T+K KM+ +V +L+ECQ K GTGYLSAFP+ FD FEA+ VWAPYYTIHKILAGL+DQ
Sbjct: 192 ETLKAKMTALVSALAECQQKYGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAGLVDQ 251
Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
Y LA N QALKMAT M +YFY RVQ VI YSVERHW SLNEETGGMNDVLY+LYSIT D
Sbjct: 252 YKLAGNTQALKMATGMADYFYGRVQNVIKKYSVERHWLSLNEETGGMNDVLYQLYSITRD 311
Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
K+L LAHLFDKPCFLG LA+QAD +S FHANTHIPIV+GSQ RYE+TGD L+K I FF
Sbjct: 312 SKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEIPMFF 371
Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
MDIVNASHSYATGGTS +EFW DPKR+A TL +ENEE+CTTYNMLKVSR+LFRWTKE++Y
Sbjct: 372 MDIVNASHSYATGGTSVKEFWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWTKEVSY 431
Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
ADYYERALTNGVL IQRGT+PG MIYMLPLG+GVSKA + HGWGT ++SFWCCYGTGIES
Sbjct: 432 ADYYERALTNGVLGIQRGTDPGRMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYGTGIES 491
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF-S 541
FSKLGDSIYF+E+G P LY+ QYISSS DWKS + ++QKV+P+VSWDPY+R+T T S
Sbjct: 492 FSKLGDSIYFQEDGATPALYVTQYISSSLDWKSAGLSISQKVNPVVSWDPYMRVTFTLSS 551
Query: 542 SKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
SK V + S+LNLR+PVWT S GA+ SLNG+ L +P GNFLS ++W D++T++LP+
Sbjct: 552 SKVGVAKESTLNLRIPVWTNSIGAKVSLNGRPLNVPTSGNFLSIKQKWKSGDQVTMELPM 611
Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQL 661
S+RTEAI+DDRPEYAS+QAIL+GPYLLAGHTS +W I T I+PIP + N+ L
Sbjct: 612 SIRTEAIKDDRPEYASLQAILYGPYLLAGHTSMDWSIT--TQAKAGNWITPIPETLNSHL 669
Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGK 721
VT +Q+SGN ++V+SNSNQ+I M+ P GT A+ ATFRL+ D S SS +IG
Sbjct: 670 VTLSQQSGNISYVLSNSNQTIIMKVSPEPGTQDAVSATFRLVTDD-SKHPISSPEGLIGS 728
Query: 722 SVMLEPFDFPGMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGC 780
VMLEPFDFPGM+V+Q + L V + SP + GSS FRLV+GLD + +VSL E++KGC
Sbjct: 729 LVMLEPFDFPGMIVKQATDSSLTVQASSPSDKGSSSFRLVSGLDGKPGSVSLSLESKKGC 788
Query: 781 FVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAP 840
FV S + G L+L C + + D F +AASF ++ G+++Y+P+SFV G +RNF+L+P
Sbjct: 789 FVYSDQTLKQGTKLRLECGSAATDEKFKQAASFSLKTGMNQYNPMSFVMSGTQRNFVLSP 848
Query: 841 LLSFRDEAYTVYFNIQ 856
L S RDE Y VYF++Q
Sbjct: 849 LFSLRDETYNVYFSVQ 864
>gi|7630052|emb|CAB88260.1| putative protein [Arabidopsis thaliana]
Length = 860
Score = 1106 bits (2861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 543/856 (63%), Positives = 657/856 (76%), Gaps = 14/856 (1%)
Query: 5 FVLFFFFCFGLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAWS 62
+ F C L K+CT+ + SH R EL S N K E SH+HLTPTDDSAWS
Sbjct: 14 YTSFLLVC----LAKECTDIPTKLSSHTLRSELLQSQNANLKSEEFSHYHLTPTDDSAWS 69
Query: 63 SLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTN 122
+L+P K+L ++ D+ +W +LYRK K+ GNFLK+VSLHDV LD SS WRAQQTN
Sbjct: 70 TLLPRKMLKEETDDFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPSSFHWRAQQTN 126
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
LEYLLMLDVD L ++FRK A L PG YGGWE P SELRGHFVGHYLSA+A MWASTHN
Sbjct: 127 LEYLLMLDVDGLAYNFRKEAGLNAPGVPYGGWEKPDSELRGHFVGHYLSATAYMWASTHN 186
Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
T+K KM+ +V +L+ECQ K GTGYLSAFP+ FD FEA+ VWAPYYTIHKILAGL+DQ
Sbjct: 187 ETLKAKMTALVSALAECQQKYGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAGLVDQ 246
Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
Y LA N QALKMAT M +YFY RVQ VI YSVERHW SLNEETGGMNDVLY+LYSIT D
Sbjct: 247 YKLAGNTQALKMATGMADYFYGRVQNVIKKYSVERHWLSLNEETGGMNDVLYQLYSITRD 306
Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
K+L LAHLFDKPCFLG LA+QAD +S FHANTHIPIV+GSQ RYE+TGD L+K I FF
Sbjct: 307 SKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEIPMFF 366
Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
MDIVNASHSYATGGTS +EFW DPKR+A TL +ENEE+CTTYNMLKVSR+LFRWTKE++Y
Sbjct: 367 MDIVNASHSYATGGTSVKEFWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWTKEVSY 426
Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
ADYYERALTNGVL IQRGT+PG MIYMLPLG+GVSKA + HGWGT ++SFWCCYGTGIES
Sbjct: 427 ADYYERALTNGVLGIQRGTDPGRMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYGTGIES 486
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF-S 541
FSKLGDSIYF+E+G P LY+ QYISSS DWKS + ++QKV+P+VSWDPY+R+T T S
Sbjct: 487 FSKLGDSIYFQEDGATPALYVTQYISSSLDWKSAGLSISQKVNPVVSWDPYMRVTFTLSS 546
Query: 542 SKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
SK V + S+LNLR+PVWT S GA+ SLNG+ L +P GNFLS ++W D++T++LP+
Sbjct: 547 SKVGVAKESTLNLRIPVWTNSIGAKVSLNGRPLNVPTSGNFLSIKQKWKSGDQVTMELPM 606
Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQL 661
S+RTEAI+DDRPEYAS+QAIL+GPYLLAGHTS +W I T I+PIP + N+ L
Sbjct: 607 SIRTEAIKDDRPEYASLQAILYGPYLLAGHTSMDWSIT--TQAKAGNWITPIPETLNSHL 664
Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGK 721
VT +Q+SGN ++V+SNSNQ+I M+ P GT A+ ATFRL+ D S SS +IG
Sbjct: 665 VTLSQQSGNISYVLSNSNQTIIMKVSPEPGTQDAVSATFRLVTDD-SKHPISSPEGLIGS 723
Query: 722 SVMLEPFDFPGMLVQQGKEDELVV-SESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGC 780
VMLEPFDFPGM+V+Q + L V + SP + GSS FRLV+GLD + +VSL E++KGC
Sbjct: 724 LVMLEPFDFPGMIVKQATDSSLTVQASSPSDKGSSSFRLVSGLDGKPGSVSLSLESKKGC 783
Query: 781 FVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAP 840
FV S + G L+L C + + D F +AASF ++ G+++Y+P+SFV G +RNF+L+P
Sbjct: 784 FVYSDQTLKQGTKLRLECGSAATDEKFKQAASFSLKTGMNQYNPMSFVMSGTQRNFVLSP 843
Query: 841 LLSFRDEAYTVYFNIQ 856
L S RDE Y VYF++Q
Sbjct: 844 LFSLRDETYNVYFSVQ 859
>gi|357139358|ref|XP_003571249.1| PREDICTED: uncharacterized protein LOC100841742 [Brachypodium
distachyon]
Length = 883
Score = 992 bits (2564), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/822 (58%), Positives = 616/822 (74%), Gaps = 29/822 (3%)
Query: 52 HLTPTDDSAWSSLIPSKILGDQ--------KDEVSWALLYRKIKNPGGFDLPGN------ 97
HL PTD+SAW +L+P ++L ++ W +LYRK++ G + G
Sbjct: 71 HLIPTDESAWMALMPRRLLAGGAGGNGAPPREAFDWLMLYRKLRGGGDGAIDGPAAAAAG 130
Query: 98 -FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
FL E SLHDV L +V W+AQQTNLEYLL+LD D LVWSFR A LP G YGGWE
Sbjct: 131 PFLSEASLHDVRLQPGTVYWQAQQTNLEYLLLLDADRLVWSFRTQAGLPATGTPYGGWEG 190
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
P ELRGHFVGHYL+A+A+MWASTHN T++ KMS+V+ +L +CQ K+G GYLSAFPTE F
Sbjct: 191 PSVELRGHFVGHYLTAAAKMWASTHNDTLRTKMSSVIDTLYDCQKKMGMGYLSAFPTEFF 250
Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
D EAL VWAPYYTIHKI+ GLLDQY +A +++AL+M M +YF RV+ VI YS+E
Sbjct: 251 DRAEALTTVWAPYYTIHKIMQGLLDQYTVAGSSKALEMVVGMADYFSGRVKNVIQKYSIE 310
Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
RHW SLNEETGGMNDVLY+LY+IT+D KHL LAHLFDKPCFLG LA+QAD +S FH+NTH
Sbjct: 311 RHWASLNEETGGMNDVLYQLYAITNDLKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTH 370
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
IP+VIG+QMRYEVTGD LYK I + FMD++N+SHSYATGGTSA EFW+DPKRLA TL +E
Sbjct: 371 IPVVIGAQMRYEVTGDVLYKQIASSFMDMINSSHSYATGGTSAGEFWYDPKRLAATLSTE 430
Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
NEE+CTTYNMLKVSR+LFRWTKEI+YADYYERAL NGVLSIQRGT+PGVMIYMLP G
Sbjct: 431 NEESCTTYNMLKVSRNLFRWTKEISYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGR 490
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
SKA HGWGT ++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+F+WK+
Sbjct: 491 SKAVGYHGWGTLYDSFWCCYGTGIESFSKLGDSIYFEEKGHAPALNIIQYIPSTFNWKTA 550
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
+ + Q+++ + S DPYLR++L+ S+K GQ ++LN+R+P WT +NG +A+L G++L L
Sbjct: 551 GLTVTQQLESLSSSDPYLRVSLSVSAK---GQSATLNVRIPTWTSANGTKATLTGKDLGL 607
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
PG LS +++W+ ++ L++Q P+SLRTEAI+DDRP+YAS+QAILFGP++LAG +SG+W
Sbjct: 608 VTPGTLLSISKQWNSDEHLSLQFPISLRTEAIKDDRPQYASLQAILFGPFVLAGLSSGDW 667
Query: 637 DIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAA 695
D K +A +S I+ +P S+N+QL+TFTQES TFV+S+SN S+TM+E P + GTD A
Sbjct: 668 DAKASSA--VSDWITAVPSSYNSQLMTFTQESNGKTFVLSSSNGSLTMQERPSIDGTDTA 725
Query: 696 LHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSS 755
+HATFR+ +D++ + + G V +EPFD PG ++ ++ S ++ +S
Sbjct: 726 VHATFRVHSQDSTSQQGTYNAALKGTPVQIEPFDLPGTVITNN------LTFSAQKSSAS 779
Query: 756 GFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASF 813
F +V GLD + +VSLE + GCF+ SG ++ G +++ C S +S+ F +AASF
Sbjct: 780 FFDIVPGLDGKPNSVSLELGTKSGCFMVSGADYSAGTKIQVSCKSSLQSIGGIFEQAASF 839
Query: 814 MMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
+ + +YHPISFVAKG RRNFLL PL S RDE YTVYFN+
Sbjct: 840 VQATPLRQYHPISFVAKGVRRNFLLEPLYSLRDEFYTVYFNL 881
>gi|242060854|ref|XP_002451716.1| hypothetical protein SORBIDRAFT_04g006520 [Sorghum bicolor]
gi|241931547|gb|EES04692.1| hypothetical protein SORBIDRAFT_04g006520 [Sorghum bicolor]
Length = 888
Score = 988 bits (2554), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 486/814 (59%), Positives = 613/814 (75%), Gaps = 18/814 (2%)
Query: 52 HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLP-------GNFLKEVSL 104
HLTPTD+S W SL+P + L +++ W +LYRK++ P G FL + SL
Sbjct: 81 HLTPTDESTWMSLMPRRAL-RREEAFDWLMLYRKLRGATAGGAPRRPGVAAGTFLSDASL 139
Query: 105 HDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGH 164
HDV L+ S+ WRAQQTNLEYLL+LDVD LVWSFRK A L PG YGGWE P ELRGH
Sbjct: 140 HDVRLEPGSLYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPDVELRGH 199
Query: 165 FVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKP 224
FVGHYLSA+A+MWASTHN T+ KMS+V+ +LS+CQ K+GTGYLSAFPTE FD EA+KP
Sbjct: 200 FVGHYLSATAKMWASTHNDTLNAKMSSVIDALSDCQKKMGTGYLSAFPTEFFDRVEAIKP 259
Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
VWAPYYTIHKI+ GLLDQY +A N++AL M M YF +RV+ VI YS+ERHW SLNE
Sbjct: 260 VWAPYYTIHKIMQGLLDQYTVAGNSKALDMVVNMANYFSDRVKNVIQKYSIERHWESLNE 319
Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
ETGGMNDVLY+LY+IT+D KHL LAHLFDKPCFLG LA+QAD +S FH+NTHIP+VIG+Q
Sbjct: 320 ETGGMNDVLYQLYTITNDLKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQ 379
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
MRYEVTGDPLYK I +FFMD +N+SHSYATGGTSA EFW DPK LA TL +ENEE+CTTY
Sbjct: 380 MRYEVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKHLAGTLSTENEESCTTY 439
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
NMLK+SR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA S H
Sbjct: 440 NMLKISRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHS 499
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
WGTK++SFWCCYGTGIESFSKLGDSIYFEE+ ++P L IIQYI S++DWK+ +++ QKV
Sbjct: 500 WGTKYDSFWCCYGTGIESFSKLGDSIYFEEKEDLPALNIIQYIPSTYDWKAAGLIVTQKV 559
Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLS 584
+ + S D YL+++L+ S+K + GQ + LN+R+P WT+++GA A+LN ++L PG+FLS
Sbjct: 560 NTLSSSDQYLQISLSISAKTK-GQTAKLNVRIPSWTFADGAGATLNDKDLGSISPGSFLS 618
Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTAR 644
T++W+ +D L ++ P+ LRTEAI+DDRPEYAS+QA+LFGP++LAG ++G+WD K G
Sbjct: 619 ITKQWNSDDHLALRFPIRLRTEAIKDDRPEYASLQAVLFGPFVLAGLSTGDWDAKAGNGS 678
Query: 645 SLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATFRLI 703
++S I+ +PP+ N+QLVTF+Q S TFV+S++N ++TM+E P V GTD A+HATFR
Sbjct: 679 AISDWITAVPPAHNSQLVTFSQVSNGKTFVLSSANGTLTMQERPEVDGTDTAIHATFRAH 738
Query: 704 LKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGL 763
+D++ + G S+++EPFD PG ++ ++ S ++ F LV GL
Sbjct: 739 PQDSTELHDIYRTIAKGASILIEPFDLPGTVITNN------LTLSAQKSTDCLFNLVPGL 792
Query: 764 DKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIGISE 821
D +VSLE R GCF+ +G N+ G +++ C S ES+ +AASF + +
Sbjct: 793 DGNPNSVSLELGTRPGCFLVTGTNYSAGTKIQVSCKSSLESIGGILEQAASFSQTDPLRQ 852
Query: 822 YHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
YHPISFVAKG RNFLL PL S RDE YTVYFNI
Sbjct: 853 YHPISFVAKGMTRNFLLEPLYSLRDEFYTVYFNI 886
>gi|219885159|gb|ACL52954.1| unknown [Zea mays]
Length = 879
Score = 985 bits (2547), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 500/874 (57%), Positives = 631/874 (72%), Gaps = 30/874 (3%)
Query: 4 GFVLFFFFCFGL--ALGKQCTNQSP-YDSHAFRYELT---STNKTWKEEVLSHF------ 51
G V+ G A GK CTN P SH R T + ++ H
Sbjct: 14 GIVVVMLLAAGFRGAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQ 73
Query: 52 HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPG----NFLKEVSLHDV 107
HLTPTD+S W SL+P + L +++ W +LYR+++ GG PG FL E SLHDV
Sbjct: 74 HLTPTDESTWMSLMPRRAL-RREEAFDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDV 132
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG 167
L+ S+ WRAQQTNLEYLL+LDVD LVWSFRK A L PG YGGWE P +LRGHFVG
Sbjct: 133 RLEPGSMYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVG 192
Query: 168 HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWA 227
HYLSA+A+MWASTHN T+ KMS+VV +L +CQ K+GTGYLSAFP++ FD EA+K VWA
Sbjct: 193 HYLSATAKMWASTHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWA 252
Query: 228 PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
PYYTIHKI+ GLLDQY +A N+ AL M M YF +RV+ VI YS+ERHW SLNEETG
Sbjct: 253 PYYTIHKIMQGLLDQYTVAGNSMALDMVIKMANYFSDRVKNVIQNYSIERHWESLNEETG 312
Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
GMNDVLY+LY+ITHD KHL LAHLFDKPCFLG LA+QAD +S FH+NTHIP+VIG+QMRY
Sbjct: 313 GMNDVLYQLYTITHDMKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRY 372
Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNML 407
EVTGDPLYK I +FFMD +N+SHSYATGGTSA EFW DPKRLA TL +ENEE+CTTYNML
Sbjct: 373 EVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTYNML 432
Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
KVSR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA S HGWGT
Sbjct: 433 KVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHGWGT 492
Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI 527
K++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+ + + Q++ +
Sbjct: 493 KYDSFWCCYGTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQQIKTL 552
Query: 528 VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATE 587
S D YL+++ + S+ GQ +++N R+P WT+++GA A+LNG++L PG+FLS T+
Sbjct: 553 SSSDQYLQISFSISANTS-GQTANINFRIPSWTFADGAGATLNGKDLGSISPGSFLSITK 611
Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLS 647
+W+ +D L + P+ LRTEAI+DDR EYAS+QA+LFGP++LAG ++G+WD K G ++S
Sbjct: 612 QWNSDDHLALHFPIRLRTEAIKDDRLEYASLQAVLFGPFVLAGLSTGDWDAKAGNGSAIS 671
Query: 648 ALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATFRLILKD 706
I+ +PP+ N+QLVTFTQ S FV+S++N ++TM+E P V GTDAA+HATFR ++
Sbjct: 672 DWIAAVPPAHNSQLVTFTQVSNGKAFVLSSANGTLTMQERPEVDGTDAAVHATFRAHPQE 731
Query: 707 AS--LSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLD 764
S L + S + G S++LEPFD PG ++ ++ S ++ S F +V GLD
Sbjct: 732 DSTELHDIYS-TTLTGTSILLEPFDLPGTVITNN------LTLSAQKSSDSLFNIVPGLD 784
Query: 765 KRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIGISEY 822
+VSLE + GCF+ +G N+ G +++ C S ES+ +AASF + +Y
Sbjct: 785 GNPNSVSLELGTKPGCFLVTGTNYSAGTRIEVNCKSSLESIGGILEQAASFSQTDPLRQY 844
Query: 823 HPISFVAKGARRNFLLAPLLSFRDEAYTVYFNIQ 856
HPISFVAKG RNFLL PL S RDE YTVYFN++
Sbjct: 845 HPISFVAKGVARNFLLEPLYSLRDEFYTVYFNVR 878
>gi|326495110|dbj|BAJ85651.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 868
Score = 985 bits (2547), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 485/815 (59%), Positives = 612/815 (75%), Gaps = 21/815 (2%)
Query: 52 HLTPTDDSAWSSLIPSKILGD------QKDEVSWALLYRKIKN-PGGFDLP-GNFLKEVS 103
HLTPTD+SAW L+P + L ++ W +LYR+++ D P G FL E S
Sbjct: 62 HLTPTDESAWMELMPRRSLSGGGGSTPPREAFDWLMLYRRLRGGAAAVDGPAGPFLSEAS 121
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
LHDV L ++ W+AQQTNLEYLL+LD D LVWSFR A L G YGGWE P ELRG
Sbjct: 122 LHDVRLQPGTIYWQAQQTNLEYLLLLDTDRLVWSFRTQAGLTATGTPYGGWEGPNVELRG 181
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK 223
HFVGHYLSA+A+MWASTHN T++ KMS+VV L +CQ K+GTGYLSAFP+E FD EAL
Sbjct: 182 HFVGHYLSATAKMWASTHNDTLRAKMSSVVDVLYDCQKKMGTGYLSAFPSEFFDRAEALT 241
Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
VWAPYYTIHK++ GLLDQY +A N++AL+M M YF +RV+ +I YS+ERHW SLN
Sbjct: 242 TVWAPYYTIHKVMQGLLDQYTVAGNSKALEMVVGMANYFSDRVKNIIQKYSIERHWASLN 301
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGS 343
EETGGMNDVLY+LY+IT D KHL LAHLFDKPCFLG LALQAD +S FH+NTHIP+V+G+
Sbjct: 302 EETGGMNDVLYQLYTITDDLKHLTLAHLFDKPCFLGLLALQADSISGFHSNTHIPVVVGA 361
Query: 344 QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTT 403
QMRYEVTGD LYK I T FMD++N+SHSYATGGTSA EFW DPKRLA TL +EN E+CTT
Sbjct: 362 QMRYEVTGDVLYKQIATSFMDMINSSHSYATGGTSAGEFWSDPKRLAATLSTENAESCTT 421
Query: 404 YNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH 463
YNMLKVSR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA S H
Sbjct: 422 YNMLKVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVSYH 481
Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
GWGTK++SFWCCYGTGIESFSKLGDSIYFEE+G P L IIQYI S+F+WK+ V + Q+
Sbjct: 482 GWGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGETPALSIIQYIPSTFNWKTAGVTVTQQ 541
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
++P+ S D ++++L+FS K GQ ++LN+R+P WT ++GA+A+LN ++L PG+ L
Sbjct: 542 LEPLSSPDMNVQVSLSFSGKN--GQSATLNVRIPTWTSASGAKATLNDKDLGSVTPGSLL 599
Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTA 643
S T++W+ ND L++Q P++LRTEAI+DDRPEYAS+QAILFGP++LAG +S + D KTG+A
Sbjct: 600 SVTKQWNSNDHLSLQFPIALRTEAIKDDRPEYASLQAILFGPFVLAGLSSSDCDAKTGSA 659
Query: 644 RSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATFRL 702
+S I+ +P S N+QL+TFTQES TFV+S+SN S+TM+E P V GTD A+HATFR+
Sbjct: 660 --VSDWITAVPSSHNSQLMTFTQESSGKTFVLSSSNGSLTMQERPTVDGTDTAIHATFRV 717
Query: 703 ILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAG 762
+D + + + + SV++EPFD PG + ++L +S + K GS F +V+G
Sbjct: 718 HPQDTARLHGTYGATLQDTSVLIEPFDMPGTAI----ANDLTLS-TQKSTGSL-FNIVSG 771
Query: 763 LDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIGIS 820
LD + +VSLE + GCF+ SG ++ G +++ C S +S+ F +AASF +
Sbjct: 772 LDGKPNSVSLELGTKPGCFLVSGADYSAGTKIQVSCKSSIQSIGGIFEQAASFAQAAPLR 831
Query: 821 EYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
+YHPISFVAKG +RNFLL PL S RDE YT YFN+
Sbjct: 832 QYHPISFVAKGVQRNFLLEPLYSLRDEFYTAYFNL 866
>gi|226497412|ref|NP_001145969.1| uncharacterized protein LOC100279496 precursor [Zea mays]
gi|223945575|gb|ACN26871.1| unknown [Zea mays]
Length = 879
Score = 985 bits (2546), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 500/874 (57%), Positives = 631/874 (72%), Gaps = 30/874 (3%)
Query: 4 GFVLFFFFCFGL--ALGKQCTNQSP-YDSHAFRYELT---STNKTWKEEVLSHF------ 51
G V+ G A GK CTN P SH R T + ++ H
Sbjct: 14 GIVVVMLLAAGFRGAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQ 73
Query: 52 HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPG----NFLKEVSLHDV 107
HLTPTD+S W SL+P + L +++ W +LYR+++ GG PG FL E SLHDV
Sbjct: 74 HLTPTDESTWMSLMPRRAL-RREEAFDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDV 132
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG 167
L+ S+ WRAQQTNLEYLL+LDVD LVWSFRK A L PG YGGWE P +LRGHFVG
Sbjct: 133 RLEPGSMYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVG 192
Query: 168 HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWA 227
HYLSA+A+MWASTHN T+ KMS+VV +L +CQ K+GTGYLSAFP++ FD EA+K VWA
Sbjct: 193 HYLSATAKMWASTHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWA 252
Query: 228 PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
PYYTIHKI+ GLLDQY +A N+ AL M M YF +RV+ VI YS+ERHW SLNEETG
Sbjct: 253 PYYTIHKIMQGLLDQYTVAGNSMALDMVIKMANYFSDRVKNVIQNYSIERHWESLNEETG 312
Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
GMNDVLY+LY+ITHD KHL LAHLFDKPCFLG LA+QAD +S FH+NTHIP+VIG+QMRY
Sbjct: 313 GMNDVLYQLYTITHDMKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRY 372
Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNML 407
EVTGDPLYK I +FFMD +N+SHSYATGGTSA EFW DPKRLA TL +ENEE+CTTYNML
Sbjct: 373 EVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTYNML 432
Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
KVSR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA S HGWGT
Sbjct: 433 KVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHGWGT 492
Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI 527
K++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+ + + Q++ +
Sbjct: 493 KYDSFWCCYGTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQQIKTL 552
Query: 528 VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATE 587
S D YL+++ + S+ GQ +++N R+P WT+++GA A+LNG++L PG+FLS T+
Sbjct: 553 SSSDQYLQISFSISANTS-GQTANINFRIPSWTFADGAGATLNGKDLGSISPGSFLSITK 611
Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLS 647
+W+ +D L + P+ LRTEAI+DDR EYAS+QA+LFGP++LAG ++G+WD K G ++S
Sbjct: 612 QWNSDDHLALHFPIRLRTEAIKDDRLEYASLQAVLFGPFVLAGLSTGDWDAKAGNGSAIS 671
Query: 648 ALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATFRLILKD 706
I+ +PP+ N+QLVTFTQ S FV+S++N ++TM+E P V GTDAA+HATFR ++
Sbjct: 672 DWIAAVPPAHNSQLVTFTQVSNGKAFVLSSANGTLTMQERPEVDGTDAAIHATFRAHPQE 731
Query: 707 AS--LSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLD 764
S L + S + G S++LEPFD PG ++ ++ S ++ S F +V GLD
Sbjct: 732 DSTELHDIYS-TTLTGTSILLEPFDLPGTVITNN------LTLSAQKSSDSLFNIVPGLD 784
Query: 765 KRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIGISEY 822
+VSLE + GCF+ +G N+ G +++ C S ES+ +AASF + +Y
Sbjct: 785 GNPNSVSLELGTKPGCFLVTGTNYSAGTRIEVNCKSSLESIGGILEQAASFSQTDPLRQY 844
Query: 823 HPISFVAKGARRNFLLAPLLSFRDEAYTVYFNIQ 856
HPISFVAKG RNFLL PL S RDE YTVYFN++
Sbjct: 845 HPISFVAKGVARNFLLEPLYSLRDEFYTVYFNVR 878
>gi|115444811|ref|NP_001046185.1| Os02g0195500 [Oryza sativa Japonica Group]
gi|49388119|dbj|BAD25250.1| unknown protein [Oryza sativa Japonica Group]
gi|113535716|dbj|BAF08099.1| Os02g0195500 [Oryza sativa Japonica Group]
gi|125581152|gb|EAZ22083.1| hypothetical protein OsJ_05746 [Oryza sativa Japonica Group]
Length = 891
Score = 979 bits (2532), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 482/817 (58%), Positives = 619/817 (75%), Gaps = 20/817 (2%)
Query: 52 HLTPTDDSAWSSLIPSKILGD-----QKDEVSWALLYRKIKNPGGFDLPGN-----FLKE 101
HLTPTD+S W SL+P ++L ++D W +LYR ++ G L E
Sbjct: 80 HLTPTDESTWMSLMPRRLLASPVSSPRRDAFDWLMLYRNLRGSGSGAGAIAASGGALLAE 139
Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
SLHDV L +V W+AQQTNLEYLL+LDVD LVWSFR A LP G YGGWE P EL
Sbjct: 140 ASLHDVRLQPGTVYWQAQQTNLEYLLLLDVDRLVWSFRTQAGLPASGAPYGGWEGPGVEL 199
Query: 162 RGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA 221
RGHFVGHYLSA+A+MWASTHN T+ KMS+VV +L +CQ K+G+GYLSAFP+E FD E+
Sbjct: 200 RGHFVGHYLSATAKMWASTHNDTLLAKMSSVVDALHDCQKKMGSGYLSAFPSEFFDRVES 259
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
+K VWAPYYTIHKI+ GLLDQY +A N++AL + M YF +RV+ VI YS+ERHW S
Sbjct: 260 IKAVWAPYYTIHKIMQGLLDQYTVAGNSKALDLVVGMANYFSDRVKNVIQKYSIERHWAS 319
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
LNEE+GGMNDVLY+LY+IT+D KHL LAHLFDKPCFLG LA+QAD +S FH+NTHIP+VI
Sbjct: 320 LNEESGGMNDVLYQLYTITNDQKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVI 379
Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
G+QMRYEVTGD LYK I TFFMD +N+SHSYATGGTSA EFW +PKRLADTL +ENEE+C
Sbjct: 380 GAQMRYEVTGDLLYKQIATFFMDTINSSHSYATGGTSAGEFWTNPKRLADTLSTENEESC 439
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
TTYNMLKVSR+LFRWTKE++YADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA S
Sbjct: 440 TTYNMLKVSRNLFRWTKELSYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVS 499
Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
HGWGTK++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+ + +N
Sbjct: 500 YHGWGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDRPVLNIIQYIPSAYNWKAAGLTVN 559
Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
Q++ PI S D +L+++L+ S+K GQ ++LN+R+P WT +NGA+A+LN +L L PG+
Sbjct: 560 QQLKPISSLDMFLQVSLSTSAKTN-GQSATLNVRIPSWTSANGAKATLNDNDLGLMSPGS 618
Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG 641
FLS +++W+ +D L++Q P++LRTEAI+DDRPEYAS+QAILFGP++LAG ++G+W+ + G
Sbjct: 619 FLSISKQWNSDDHLSLQFPITLRTEAIKDDRPEYASLQAILFGPFVLAGLSTGDWNAEAG 678
Query: 642 TARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATF 700
++S ISP+P S+N+QLVTFTQES TFV+S++N S+TM+E P V GTD A+HATF
Sbjct: 679 NTSAISDWISPVPSSYNSQLVTFTQESSGKTFVLSSANGSLTMQERPTVDGTDTAIHATF 738
Query: 701 RLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLV 760
R+ +D++ + + G SV +EPFD PG ++ +++S ++ S F +V
Sbjct: 739 RVHPQDSAGQLDTQGATLKGTSVQIEPFDLPGTVITNN------LTQSAQKSSDSLFNIV 792
Query: 761 AGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIG 818
GLD +VSLE + GCF+ GV++ G +++ C S S++ F +AASF+
Sbjct: 793 PGLDGNPNSVSLELGTKPGCFLVIGVDYSVGTKIQVSCKSSLPSINGIFEQAASFVQAAP 852
Query: 819 ISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
+ +YHPISF+AKG +RNFLL PL S RDE YTVYFN+
Sbjct: 853 LRQYHPISFIAKGVKRNFLLEPLYSLRDEFYTVYFNL 889
>gi|125538467|gb|EAY84862.1| hypothetical protein OsI_06226 [Oryza sativa Indica Group]
Length = 891
Score = 979 bits (2531), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/817 (58%), Positives = 619/817 (75%), Gaps = 20/817 (2%)
Query: 52 HLTPTDDSAWSSLIPSKILGD-----QKDEVSWALLYRKIKNPGGFDLPGN-----FLKE 101
HLTPTD+S W SL+P ++L ++D W +LYR ++ G L E
Sbjct: 80 HLTPTDESTWMSLMPRRLLASPASSPRRDAFDWLMLYRNLRGSGSGAGAIAASGGALLAE 139
Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
SLHDV L +V W+AQQTNLEYLL+LDVD LVWSFR A LP G YGGWE P EL
Sbjct: 140 ASLHDVRLQPGTVYWQAQQTNLEYLLLLDVDRLVWSFRTQAGLPASGAPYGGWEGPGVEL 199
Query: 162 RGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA 221
RGHFVGHYLSA+A+MWASTHN T++ KMS+VV +L +CQ K+G+GYLSAFP+E FD E+
Sbjct: 200 RGHFVGHYLSATAKMWASTHNDTLQAKMSSVVDALHDCQKKMGSGYLSAFPSEFFDRVES 259
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
+K VWAPYYTIHKI+ GLLDQY +A N++AL + M YF +RV+ VI YS+ERHW S
Sbjct: 260 IKAVWAPYYTIHKIMQGLLDQYTVAGNSKALDLVVGMANYFSDRVKNVIQKYSIERHWAS 319
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
LNEE+GGMNDVLY+LY+IT+D KHL LAHLFDKPCFLG LA+QAD +S FH+NTHIP+VI
Sbjct: 320 LNEESGGMNDVLYQLYTITNDQKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVI 379
Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
G+QMRYEVTGD LYK I TFFMD +N+SHSYATGGTSA EFW +PKRLADTL +ENEE+C
Sbjct: 380 GAQMRYEVTGDLLYKQIATFFMDTINSSHSYATGGTSAGEFWTNPKRLADTLSTENEESC 439
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
TTYNMLKVSR+LFRWTKE++YADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA S
Sbjct: 440 TTYNMLKVSRNLFRWTKELSYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVS 499
Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
HGWGTK++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+ + +N
Sbjct: 500 YHGWGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDRPVLNIIQYIPSAYNWKAAGLTVN 559
Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
Q++ PI S D +L+++L+ S+K GQ ++LN+R+P WT +NGA+A+LN +L L PG+
Sbjct: 560 QQLKPISSLDMFLQVSLSTSAKTN-GQSATLNVRIPSWTSANGAKATLNDNDLGLMSPGS 618
Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG 641
FLS +++W+ +D L++Q P++LRTEAI+DDRPEYAS+QAILFGP++LAG ++G+W+ + G
Sbjct: 619 FLSISKQWNSDDHLSLQFPITLRTEAIKDDRPEYASLQAILFGPFVLAGLSTGDWNAEAG 678
Query: 642 TARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATF 700
++S ISP+P S+N+QLVTFTQES TFV+S++N S+ M+E P V GTD A+HATF
Sbjct: 679 NTSAISDWISPVPSSYNSQLVTFTQESSGKTFVLSSANGSLAMQERPTVDGTDTAIHATF 738
Query: 701 RLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLV 760
R+ +D++ + + G SV +EPFD PG ++ +++S ++ S F +V
Sbjct: 739 RVHPQDSAGQLDTQGATLKGTSVQIEPFDLPGTVITNN------LTQSAQKSSDSLFNIV 792
Query: 761 AGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIG 818
GLD +VSLE + GCF+ +GV++ G +++ C S S++ F +A SF+
Sbjct: 793 PGLDGNPNSVSLELGTKPGCFLVTGVDYSVGTKIQVSCKSSLPSINGIFEQATSFVQAAP 852
Query: 819 ISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
+ +YHPISF+AKG +RNFLL PL S RDE YTVYFN+
Sbjct: 853 LRQYHPISFIAKGVKRNFLLEPLYSLRDEFYTVYFNL 889
>gi|357123866|ref|XP_003563628.1| PREDICTED: uncharacterized protein LOC100829886 [Brachypodium
distachyon]
Length = 850
Score = 951 bits (2458), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 499/865 (57%), Positives = 616/865 (71%), Gaps = 41/865 (4%)
Query: 15 LALGKQCTN-QSPYDSHAFRYELTS--TNKTWKEEVL--SHFHLTPTDDSAWSSLIPSKI 69
+A+ K+CTN + SH R L + + W+ L H H++PTD++ W L
Sbjct: 1 MAVAKECTNVPTQLSSHTVRARLQGDPSAEEWRLRALFHDHAHVSPTDEATWMDLRAPLA 60
Query: 70 LGDQKDEVSWALLYRKIKNPGGFDLPGN---FLKEVSLHDVWLD--QSSVLWRAQQTNLE 124
+E WA+LYR +K FL+EV L DV LD + +V RAQQTNLE
Sbjct: 61 SSAATEESGWAMLYRALKGSASGGSASAAAGFLEEVPLQDVRLDMEEDAVYGRAQQTNLE 120
Query: 125 YLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNAT 184
YLL+LDVD L+WSFR A LP PGK YGGWE ELRGHFVGHYLSA+A+ WASTHN T
Sbjct: 121 YLLLLDVDRLLWSFRTQAGLPAPGKPYGGWEGADVELRGHFVGHYLSAAAKTWASTHNGT 180
Query: 185 IKEKMSTVVFSLSECQNKI----GTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLL 240
+ KMS VV +L ECQ G GYLSAFP E FD FEA++PVWAPYYT+HKI+ GLL
Sbjct: 181 LAAKMSAVVDALHECQQAAAANGGNGYLSAFPAEFFDRFEAIQPVWAPYYTVHKIMQGLL 240
Query: 241 DQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSIT 300
DQ+ +A N +AL MA M YF RV+ VI + +ERHW SLNEETGGMNDVLY+LY+IT
Sbjct: 241 DQHTVAGNGKALAMAVAMAGYFGGRVRSVIQRHGIERHWTSLNEETGGMNDVLYQLYTIT 300
Query: 301 HDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGT 360
+D +HL+LAHLFDKPCFLG LA+QAD L+ FHANTHIP+V+G QMRYEVTGDPLYK I T
Sbjct: 301 NDQRHLVLAHLFDKPCFLGLLAVQADSLTGFHANTHIPVVVGGQMRYEVTGDPLYKEIST 360
Query: 361 FFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEI 420
FFMDIVN SHSYATGGTS EFW DPKRLA TL +ENEE+CTTYNMLKVSRHLFRWTKEI
Sbjct: 361 FFMDIVNTSHSYATGGTSVSEFWSDPKRLASTLTTENEESCTTYNMLKVSRHLFRWTKEI 420
Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
AYADYYERAL NGVLSIQRG +PGVMIYMLP G G SKA S HGWGT+++SFWCCYGTGI
Sbjct: 421 AYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAVSYHGWGTQYDSFWCCYGTGI 480
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
ESFSKLGD+IYFEE+G+ P LY++QYI S F+WKS + + Q++ P+ S D YL+++L+
Sbjct: 481 ESFSKLGDTIYFEEKGSKPTLYVVQYIPSIFNWKSAGLTVTQRLKPLSSSDQYLQVSLSI 540
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
S+K GQ +++N+R+P W +NGA+A+LN + L L PG FL+ T++W+ D LT+QLP
Sbjct: 541 SAKTN-GQYATVNVRIPSWASANGAKATLNDKYLQLGSPGTFLTVTKQWNSGDHLTLQLP 599
Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG-TARSLSALISPIPPSFNA 659
++LRTEAI+DDR E+AS+QA+LFGP+LLAG ++G+WD KTG A ++S ISP+P S+++
Sbjct: 600 INLRTEAIKDDRAEFASLQAVLFGPFLLAGLSTGDWDAKTGAAAAAISDWISPVPSSYSS 659
Query: 660 QLVTFTQESGNSTFVMSNSN-QSITMEEFPV-SGTDAALHATFRLILKDASLSNFSSLNN 717
QLVT TQESG STFV+S N S+ M+ P GT+AA+H TFRL+ + S + N
Sbjct: 660 QLVTLTQESGGSTFVLSTVNGTSLAMQPRPEGGGTEAAVHGTFRLVPQ--GFSPPPTTNR 717
Query: 718 VIG-----KSVMLEPFDFPGMLVQQGKEDEL-VVSESPKEMGSSGFRLVAGLDKRNETVS 771
G S M+EPFD PGM + D L VV K GS F +V GLD + +VS
Sbjct: 718 RHGAPTNLASAMIEPFDLPGMAIT----DALTVVRSEEKSSGSLLFNVVPGLDGKPGSVS 773
Query: 772 LEAENRKGCFVSSGVNFEPGASLKLLCSTESLDAGFNR-AASFMMEIGISEYHPISFVAK 830
LE R GCFV + GA +++ C AGF++ AASF + YHPISFVA+
Sbjct: 774 LELGTRPGCFVVTA-----GAKVQVGCG-----AGFSQAAASFARAEPLRRYHPISFVAR 823
Query: 831 GARRNFLLAPLLSFRDEAYTVYFNI 855
GARR FLL PL + RDE YTVYFN+
Sbjct: 824 GARRGFLLEPLFTLRDEFYTVYFNL 848
>gi|242096362|ref|XP_002438671.1| hypothetical protein SORBIDRAFT_10g024070 [Sorghum bicolor]
gi|241916894|gb|EER90038.1| hypothetical protein SORBIDRAFT_10g024070 [Sorghum bicolor]
Length = 887
Score = 945 bits (2442), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 502/881 (56%), Positives = 618/881 (70%), Gaps = 64/881 (7%)
Query: 17 LGKQCTN-QSPYDSHAFRYELTST--NKTWKEEVLSHFHLTPTDDSAWSSLIPSKILGDQ 73
+ K+CTN + SH R L ++ W+ L H HL PTD++AW L+P G
Sbjct: 27 MAKECTNIPTELSSHTVRARLQASPGAAEWRWRELFHEHLNPTDEAAWMDLMPPPPRGGL 86
Query: 74 KDE---------------VSWALLYRKIKNP----------GGFDLPGNFLKEVSLHDVW 108
+ + W +LYR +K G G FL+EVSLHDV
Sbjct: 87 QTAAAADAGHHHHQEEEELDWVMLYRSLKGQQVVVGGAVPASGAAAAGPFLEEVSLHDVR 146
Query: 109 LD---QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHF 165
LD + RAQ+TNLEYLL+LDVD LVWSFR A+LP PG+ YGGWE P SELRGHF
Sbjct: 147 LDPDGDDAAYGRAQRTNLEYLLLLDVDRLVWSFRSQAALPAPGEPYGGWEKPDSELRGHF 206
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
VGHYLSA+A+MWASTHN T+ KMS VV +L ECQ GTGYLSAFP E FD FEA+KPV
Sbjct: 207 VGHYLSATAKMWASTHNGTLAGKMSAVVDALDECQRAAGTGYLSAFPAEFFDRFEAIKPV 266
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
WAPYYTIHKI+ GLLDQ+V+A N +AL M M +YF RV+ VI YS+ERHW SLNEE
Sbjct: 267 WAPYYTIHKIMQGLLDQHVVAGNGKALGMVVAMADYFAGRVRNVIRRYSIERHWTSLNEE 326
Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
TGGMNDVLY+LY+ITHD +HL+LAHLFDKPCFLG LA+QAD LS+FHANTHIP+VIG QM
Sbjct: 327 TGGMNDVLYQLYTITHDQRHLVLAHLFDKPCFLGLLAVQADSLSNFHANTHIPVVIGGQM 386
Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
RYEVTGDPLYK I TFFMD VN+SH+YATGGTS EFW DPKRLA+ L +E EE+CTTYN
Sbjct: 387 RYEVTGDPLYKEIATFFMDTVNSSHAYATGGTSVSEFWSDPKRLAEALTTETEESCTTYN 446
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
MLKVSRHLFRWTKE+AYADYYERAL NGVLSIQRG +PGVMIYMLP G G SKA+S HGW
Sbjct: 447 MLKVSRHLFRWTKEVAYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAKSYHGW 506
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
GT+ SFWCCYGTGIESFSKLGDSIYFEE+G P LYI+Q+I S+F+W++ + + QK+
Sbjct: 507 GTQNESFWCCYGTGIESFSKLGDSIYFEEKGQKPALYIVQFIPSTFNWRTTGLTVTQKLM 566
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
P+ SWD YL+++ + S+K + GQ ++LN+R+P WT NGA+A+LN ++L L PG FL+
Sbjct: 567 PLSSWDQYLQVSFSISAKTD-GQFATLNVRIPSWTSLNGAKATLNDKDLQLASPGTFLTV 625
Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG-TAR 644
+++W D+L +QLP+ LRTEAI+DDRPEYASIQA+LFGP+LLAG T+GEWD KTG A
Sbjct: 626 SKQWGSGDQLLLQLPIHLRTEAIKDDRPEYASIQAVLFGPFLLAGLTTGEWDAKTGAAAA 685
Query: 645 SLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP--VSGTDAALHATFRL 702
+ + I+P+PP N+QLVT QESG FV+S N S+TM+E P GTDAA+HATFRL
Sbjct: 686 AATDWITPVPPGSNSQLVTLAQESGGKAFVLSAVNGSLTMQERPKDSGGTDAAVHATFRL 745
Query: 703 ILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSG--FRLV 760
+ + + + + LEP D PGM+V D L VS SSG F +V
Sbjct: 746 VPQGTNST----------AAATLEPLDMPGMVVT----DTLTVSAEK----SSGALFNVV 787
Query: 761 AGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTESLDAG------FNRAASFM 814
GL +VSLE +R GCF+ +G + G +++ C+ G F +AASF
Sbjct: 788 PGLAGAPGSVSLELGSRPGCFLVAGGS---GEKVQVGCTGGVKKHGNGGGDWFRQAASFA 844
Query: 815 MEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
+ YHP+SF A+G RR+FLL PL + RDE YT+YFN+
Sbjct: 845 RAEPMRRYHPMSFAARGVRRSFLLEPLFTLRDEFYTIYFNL 885
>gi|51090917|dbj|BAD35522.1| hypothetical protein [Oryza sativa Japonica Group]
gi|51090951|dbj|BAD35554.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 883
Score = 926 bits (2393), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 491/874 (56%), Positives = 609/874 (69%), Gaps = 52/874 (5%)
Query: 19 KQCTN-QSPYDSHAFRYELTSTNKT---WKEEVLSHFHLTPTDDSAWSSLIPSKILGDQK 74
K+CTN + SH R L S++ W+EE HL PTD++AW L+P +
Sbjct: 23 KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMP--LAAASA 80
Query: 75 DEVSWALLYRKIKNPGGFDLPGN-----------FLKEVSLHDVWLDQSS----VLWRAQ 119
E WA+LYR +K G + G+ FL+EVSLHDV LD V RAQ
Sbjct: 81 SEFDWAMLYRSLK---GAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQ 137
Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAS 179
QTNLEYLL+L+VD LVWSFR A LP PGK YGGWE P ELRGHFVGHYLSA+A+MWAS
Sbjct: 138 QTNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVGHYLSAAAKMWAS 197
Query: 180 THNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGL 239
THN T+ KM+ VV +L +CQ GTGYLSAFP E FD FEA++PVWAPYYTIH I+ GL
Sbjct: 198 THNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIH-IMQGL 256
Query: 240 LDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSI 299
LDQ+ +A N +AL M M +YF RV+ VI Y++ERHW SLNEETGGMNDVLY+LY+I
Sbjct: 257 LDQHTVAGNGKALGMVVAMADYFAGRVRSVIQRYTIERHWTSLNEETGGMNDVLYQLYTI 316
Query: 300 THDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIG 359
T D +HL+LAHLFDKPCFLG LA+QAD LS FHANTHIP+VIG QMRYEVTGDPLYK I
Sbjct: 317 TKDQRHLVLAHLFDKPCFLGLLAVQADSLSGFHANTHIPVVIGGQMRYEVTGDPLYKEIA 376
Query: 360 TFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
TFFMDIVN+SHSYATGGTS EFW +PK LA+ L +E EE+CTTYNMLKVSRHLFRWTKE
Sbjct: 377 TFFMDIVNSSHSYATGGTSVSEFWSNPKHLAEALTTETEESCTTYNMLKVSRHLFRWTKE 436
Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTG 479
IAYADYYERAL NGVLSIQRG +PGVMIYMLP G G SKA S HGWGT++NSFWCCYGTG
Sbjct: 437 IAYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAVSYHGWGTQYNSFWCCYGTG 496
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
IESFSKLGDSIYFE++G+ PGLYIIQYI S+F+W++ + + Q+V P+ S D YL+++L+
Sbjct: 497 IESFSKLGDSIYFEQKGDKPGLYIIQYIPSTFNWRTAGLTVTQQVKPLSSSDQYLQVSLS 556
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERW-SYNDKLTIQ 598
S+ + GQ ++LN+R+P WT NGA+A+LN ++L L PG FL+ +++W S +D L +Q
Sbjct: 557 ISAAKTNGQYATLNVRIPSWTSMNGAKATLNDKDLQLASPGTFLTISKQWDSGDDHLLLQ 616
Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWD-IKTGTARSLSALISPIPPSF 657
P++LRTEAI+DDRP+ AS+ AILFGP+LLAG T+G+WD G A + S I+P+P S+
Sbjct: 617 FPINLRTEAIKDDRPQVASLNAILFGPFLLAGLTTGDWDAKTGGAATAASDWITPVPASY 676
Query: 658 NAQLVTFTQESGNSTFVMSNSNQ-SITMEEFP--VSGTDAALHATFRLI--------LKD 706
N+QLVT TQESG T ++S N S+ M E P GTDAA+ ATFR++ +
Sbjct: 677 NSQLVTLTQESGGKTMLLSTVNDTSLAMLERPEGAGGTDAAVRATFRVVPPGSRAELRQR 736
Query: 707 ASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKR 766
A + + +EPF PG V G L V + S+ F + GLD +
Sbjct: 737 AGAGAGEGAARLKVAAATIEPFGLPGTAVSNG----LAVVRAGNS-SSTLFNVAPGLDGK 791
Query: 767 NETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTE-----SLDAGFNRAASFMMEIGISE 821
+VSLE ++ GCF+ +G GA + + C T + AGF +AASF +
Sbjct: 792 PGSVSLELGSKPGCFLVAGA----GAKVHVGCRTRGGAAAAAAAGFEQAASFAQAEPLRR 847
Query: 822 YHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
YH ISF A G RR+FLL PL + RDE YT+YFN+
Sbjct: 848 YHAISFFASGVRRSFLLEPLFTLRDEFYTIYFNL 881
>gi|357472931|ref|XP_003606750.1| hypothetical protein MTR_4g065200 [Medicago truncatula]
gi|355507805|gb|AES88947.1| hypothetical protein MTR_4g065200 [Medicago truncatula]
Length = 646
Score = 896 bits (2316), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/675 (64%), Positives = 516/675 (76%), Gaps = 34/675 (5%)
Query: 3 FGFVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAW 61
F F+ FG GK+C N P SH FRYEL S N+TWK+EV+SH+HLTPTD+SAW
Sbjct: 4 FVFMFMAIMLFGCVAGKECMNNLP-QSHTFRYELWASKNETWKKEVMSHYHLTPTDESAW 62
Query: 62 SSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQT 121
+ L+P K+L ++ ++ WA YR++KN P FLKEV L DV L + S+ +AQ+T
Sbjct: 63 ADLLPRKLLSEE-NQRDWAAKYREMKNADLSKPPVGFLKEVPLGDVRLLEGSIHAQAQKT 121
Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH 181
NLEYLLMLDVDSL+WSFRKTA LPTPG YGGWE+P ELRGHFVGHYLSASA MWAST
Sbjct: 122 NLEYLLMLDVDSLIWSFRKTAGLPTPGTPYGGWEDPSIELRGHFVGHYLSASALMWASTK 181
Query: 182 NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLD 241
N + EKMS +V LS CQ KIGTGYLSAFPTELFD EAL+ WAPYYTIHKILAGLLD
Sbjct: 182 NDNLNEKMSALVSGLSACQEKIGTGYLSAFPTELFDRVEALQYAWAPYYTIHKILAGLLD 241
Query: 242 QYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITH 301
QY + N QALKM TWMV+YFYNRV VI +V H+ SLNEE GGMNDVLYRLYSIT
Sbjct: 242 QYTIGGNPQALKMVTWMVDYFYNRVMNVIQKLTVNGHYQSLNEEAGGMNDVLYRLYSITR 301
Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTF 361
D KHL+LAHLFDKPCFLG LA+QA+ +++FHANTHIPIV+GSQ+RYEVTGDPLYK IG F
Sbjct: 302 DSKHLVLAHLFDKPCFLGVLAVQANDIANFHANTHIPIVVGSQLRYEVTGDPLYKDIGAF 361
Query: 362 FMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI 420
FMDIVN+SH+YATGGTS REFW DPKR+AD L S ENEE+CTTYNMLKVSRHLFRWTKE+
Sbjct: 362 FMDIVNSSHTYATGGTSVREFWNDPKRIADNLKSTENEESCTTYNMLKVSRHLFRWTKEV 421
Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
+YADYYERALTNGVLSIQRGT+PGVMIYMLPLG GVSKA++ GWG FN+FWCCYGTGI
Sbjct: 422 SYADYYERALTNGVLSIQRGTDPGVMIYMLPLGLGVSKAKTDKGWGNPFNTFWCCYGTGI 481
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
ESFSKLGDSIYFEEEG+ P LYIIQYISSSF+WKSG ++L Q V P S DPYLR+T TF
Sbjct: 482 ESFSKLGDSIYFEEEGHNPSLYIIQYISSSFNWKSGKILLTQTVVPAASSDPYLRVTFTF 541
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
S + G S+LN R+P W++++GA+A LN + L LP P
Sbjct: 542 SPNETTGTSSTLNFRVPSWSHADGAKAILNSETLSLPAP--------------------- 580
Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQ 660
DDRPE+AS+QAIL+GPYLLAGHT+ WDIK T ++++ I+PIP ++++Q
Sbjct: 581 ---------DDRPEFASLQAILYGPYLLAGHTTSIWDIKGVTNKAVADWITPIPSNYSSQ 631
Query: 661 LVTFTQESGNSTFVM 675
LV F ++ + ++
Sbjct: 632 LVFFIHKTSTNQLLL 646
>gi|218198543|gb|EEC80970.1| hypothetical protein OsI_23693 [Oryza sativa Indica Group]
Length = 905
Score = 882 bits (2278), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/900 (53%), Positives = 598/900 (66%), Gaps = 82/900 (9%)
Query: 19 KQCTN-QSPYDSHAFRYELTSTNKT---WKEEVLSHFHLTPTDDSAWSSLIPSKILGDQK 74
K+CTN + SH R L S++ W+EE HL PTD++AW L+P +
Sbjct: 23 KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMP--LAAASA 80
Query: 75 DEVSWALLYRKIKNPGGFDLPGN-----------FLKEVSLHDVWLDQSS----VLWRAQ 119
E WA+LYR +K G + G+ FL+EVSLHDV LD V RAQ
Sbjct: 81 SEFDWAMLYRSLK---GAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQ 137
Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAS 179
QTNLEYLL+L+VD LVWSFR A LP PGK YGGWE P ELRGHFVGHYLSA+A+MWAS
Sbjct: 138 QTNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVGHYLSAAAKMWAS 197
Query: 180 THNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK----- 234
THN T+ KM+ VV +L +CQ GTGYLSAFP E FD FEA++PVWAPYYTIHK
Sbjct: 198 THNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKARNAT 257
Query: 235 ---------------------ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMY 273
I+ GLLDQ+ +A N +AL M M +YF RV+ VI Y
Sbjct: 258 QSICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGKALGMVVAMADYFAGRVRSVIQRY 317
Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
++ERHW SLNEETGGMNDVLY+L + + F + CFLG LA+QAD LS FHA
Sbjct: 318 TIERHWTSLNEETGGMNDVLYQLKT-----EAFGAGSSFRQACFLGLLAVQADSLSGFHA 372
Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
NTHIP+VIG QMRYEVTGDPLYK I TFFMDIVN+SHSYATGGTS EFW +PK LA+ L
Sbjct: 373 NTHIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVSEFWSNPKHLAEAL 432
Query: 394 GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG 453
+E EE+CTTYNMLKVSRHLFRWTKEIAYADYYERAL NGVLSIQRG +PGVMIYMLP G
Sbjct: 433 TTETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMIYMLPQG 492
Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
G SKA S HGWGT++NSFWCCYGTGIESFSKLGDSIYFE++G+ PGLYIIQYI S+F+W
Sbjct: 493 PGRSKAVSYHGWGTQYNSFWCCYGTGIESFSKLGDSIYFEQKGDKPGLYIIQYIPSTFNW 552
Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
++ + + Q+V P+ S D YL+++L+ S+ + GQ ++LN+R+P WT NGA+A+LN ++
Sbjct: 553 RTAGLTVTQQVKPLSSSDQYLQVSLSISAAKTNGQYATLNVRIPSWTSMNGAKATLNDKD 612
Query: 574 LPLPPPGNFLSATERW-SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
L L PG FL+ +++W S +D L +Q P++LRTEAI+DDRP+ AS+ AILFGP+LLAG T
Sbjct: 613 LQLASPGTFLTISKQWDSGDDHLLLQFPINLRTEAIKDDRPQVASLNAILFGPFLLAGLT 672
Query: 633 SGEWD-IKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQ-SITMEEFP-- 688
+G+WD G A + S I+P+P S+N+QLVT TQESG T ++S N S+ M E P
Sbjct: 673 TGDWDAKTGGAATAASDWITPVPASYNSQLVTLTQESGGKTMLLSTVNDTSLAMLERPEG 732
Query: 689 VSGTDAALHATFRLI--------LKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKE 740
GTDAA+ ATFR++ + A + + +EPF PG V G
Sbjct: 733 AGGTDAAVRATFRVVPPGSRAELRQRAGAGAGEGAARLKVAAATIEPFGLPGTAVSNG-- 790
Query: 741 DELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCST 800
L V + S+ F +V GLD + +VSLE ++ GCF+ +G GA + + C T
Sbjct: 791 --LAVVRAGNS-SSTLFNVVPGLDGKPGSVSLELGSKPGCFLVAGA----GAKVHVGCRT 843
Query: 801 E-----SLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
+ AGF +AASF + YH ISF A G RR+FLL PL + RDE YT+YFN+
Sbjct: 844 RGGAAAAAAAGFEQAASFAQAEPLRRYHAISFFASGVRRSFLLEPLFTLRDEFYTIYFNL 903
>gi|255544804|ref|XP_002513463.1| conserved hypothetical protein [Ricinus communis]
gi|223547371|gb|EEF48866.1| conserved hypothetical protein [Ricinus communis]
Length = 759
Score = 850 bits (2195), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/625 (65%), Positives = 491/625 (78%), Gaps = 36/625 (5%)
Query: 233 HKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDV 292
H +LAGLLDQY+ ADNAQALKM WMVEYFYNRVQ VIT YSVERH+ SLNEETGGMNDV
Sbjct: 169 HFVLAGLLDQYIFADNAQALKMVNWMVEYFYNRVQNVITKYSVERHFLSLNEETGGMNDV 228
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
LY+L+SIT +PKHL+LAHLFDKPCFLG LA+Q
Sbjct: 229 LYKLFSITGEPKHLVLAHLFDKPCFLGLLAVQE--------------------------- 261
Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
IGTFFMDIVN+SH+YATGGTS EFW DPKRLA TL + EE+CTTYNMLKVSRH
Sbjct: 262 -----IGTFFMDIVNSSHTYATGGTSDYEFWSDPKRLASTLNDQTEESCTTYNMLKVSRH 316
Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
LFRWTKE+AYADYYERALTNGVL IQRGTEPGVMIY+LP G SKAR+ H WGT +SF
Sbjct: 317 LFRWTKEMAYADYYERALTNGVLGIQRGTEPGVMIYLLPQNPGGSKARTIHKWGTPDDSF 376
Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
WCCYGTGIESFSKLGDSIYFEE +PGLY+IQYISSS DWK G +VLNQKVDPI SWDP
Sbjct: 377 WCCYGTGIESFSKLGDSIYFEEGSQIPGLYVIQYISSSLDWKLGQIVLNQKVDPIFSWDP 436
Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYN 592
+LR+T TF Q Q S+LNLR+P+WT+S+ +A++N Q+LP+PPPGNFLS T WS +
Sbjct: 437 FLRVTFTFD--QGASQSSTLNLRIPIWTHSDDVKATINAQSLPVPPPGNFLSVTGSWSSS 494
Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
DKL +QLP+ LRTEAI+DDRPEYASIQAILFGPYLLAGH+SG+WD+K+ +A+SLS I+
Sbjct: 495 DKLFLQLPIILRTEAIKDDRPEYASIQAILFGPYLLAGHSSGDWDLKSESAKSLSDWITA 554
Query: 653 IPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNF 712
IP ++N+ LV+F+Q+SG+S F ++NSNQS+TME FP GTD ++HATFRLIL D+S S
Sbjct: 555 IPATYNSHLVSFSQDSGDSVFALTNSNQSLTMEIFPQPGTDDSVHATFRLILNDSSSSEL 614
Query: 713 SSLNNVIGKSVMLEPFDFPGM-LVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVS 771
++ + +GK VMLEPF+ PGM LVQQGKE L V + GSS FRLV+GLD ++ +VS
Sbjct: 615 ANFEDAVGKLVMLEPFNLPGMLLVQQGKEVSLAVGYTDGSDGSSLFRLVSGLDGKDGSVS 674
Query: 772 LEAENRKGCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKG 831
LE+ + + CFV SGV+++ G +LKL C S + FN+ ASFM+ GIS YHPISFVAKG
Sbjct: 675 LESVSNENCFVFSGVDYKSGTALKLSCKKSS-ETKFNQGASFMVNKGISHYHPISFVAKG 733
Query: 832 ARRNFLLAPLLSFRDEAYTVYFNIQ 856
A+RNFLL+PL SFRDE+YT+YFNIQ
Sbjct: 734 AKRNFLLSPLFSFRDESYTIYFNIQ 758
Score = 196 bits (499), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 100/172 (58%), Positives = 123/172 (71%), Gaps = 12/172 (6%)
Query: 4 GFVLFFFFCF-------GLALGKQCTN-QSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLT 54
GFV+F G + K+CTN + SH FRY L +S N++ K+E+ +H+HLT
Sbjct: 3 GFVVFELLVLVAASVLCGFGMSKECTNIPTQLSSHTFRYALLSSNNESLKQEMFAHYHLT 62
Query: 55 PTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSV 114
PTDDS WSSL+P K+L ++DE WA++Y+K+K+P GNFLKEVSLH+V LD S
Sbjct: 63 PTDDSVWSSLLPRKML-KEEDEFDWAMMYKKLKSP--LQSSGNFLKEVSLHNVRLDLGSF 119
Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
WRAQQTNLEYLLML++D LVWSFRKTA LPTPG AYGGWE P ELRGHFV
Sbjct: 120 HWRAQQTNLEYLLMLNLDRLVWSFRKTAGLPTPGTAYGGWEAPNVELRGHFV 171
>gi|357472921|ref|XP_003606745.1| hypothetical protein MTR_4g065150 [Medicago truncatula]
gi|355507800|gb|AES88942.1| hypothetical protein MTR_4g065150 [Medicago truncatula]
Length = 617
Score = 834 bits (2154), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/606 (66%), Positives = 488/606 (80%), Gaps = 16/606 (2%)
Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
M TWMV+YFY+RV VI+ Y+V RH+ SLNEETGGMNDVLY+LYS+T D KHLLLAHLFD
Sbjct: 1 MVTWMVDYFYDRVVNVISKYTVNRHYQSLNEETGGMNDVLYKLYSVTGDSKHLLLAHLFD 60
Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYA 373
KPCFLG LA+QA+ ++ FHANTHIPIV+GSQMRYEVTGDPLY+ IG+FFMDIVN+SHSYA
Sbjct: 61 KPCFLGLLAVQANDIADFHANTHIPIVVGSQMRYEVTGDPLYREIGSFFMDIVNSSHSYA 120
Query: 374 TGGTSAREFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
TGGTS REFW +PKR+AD LG+ ENEE+CTTYNMLKVSRHLFRWTKE+ YADYYERALTN
Sbjct: 121 TGGTSVREFWSNPKRIADNLGTTENEESCTTYNMLKVSRHLFRWTKEVTYADYYERALTN 180
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
GVL IQRGT+PGVMIYMLPLG GVSKA++ H WG F++FWCCYGTGIESFSKLGDSIYF
Sbjct: 181 GVLGIQRGTDPGVMIYMLPLGIGVSKAKTGHSWGNPFDTFWCCYGTGIESFSKLGDSIYF 240
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
EEEGN P LYIIQYISSSF+WKSG +L Q V P S DPYLR+T TFSS ++ G S+L
Sbjct: 241 EEEGNSPSLYIIQYISSSFNWKSGKTLLTQTVVPAASSDPYLRVTFTFSSNEKTGTSSTL 300
Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
N R+P W++++GA+A LN + L LP PGNFLS T +WS DKLT+QLPL +RTEAI+DDR
Sbjct: 301 NFRVPSWSHADGAKAILNSEALSLPAPGNFLSITRQWSAGDKLTLQLPLIIRTEAIKDDR 360
Query: 613 PEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNST 672
PEYAS+QAIL+GPYLLAGHT+ WDIK T ++++ I+PIP S+N+QLV+F+Q+ ST
Sbjct: 361 PEYASVQAILYGPYLLAGHTTRNWDIKADTNKAVADWITPIPSSYNSQLVSFSQDFDQST 420
Query: 673 FVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPG 732
FV++NSNQS+TM++ P GTD AL ATFRLILK A + K+VMLEP D PG
Sbjct: 421 FVITNSNQSLTMQKSPEPGTDVALQATFRLILKGA-----------VSKTVMLEPIDLPG 469
Query: 733 MLVQQGKEDE-LVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPG 791
M+V + D+ L+V +S SS F +V GLD RN+T+SL++++ K C+V S + G
Sbjct: 470 MIVSHQEPDQPLIVVDSSLGGPSSVFLVVPGLDGRNQTISLQSQSNKDCYVYS--DMSSG 527
Query: 792 ASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTV 851
+ +KL C ++S +A FN+AASF+ G+ +YHPISFVAKG +NFLL PL +FRDE YTV
Sbjct: 528 SGVKLRCKSDS-EASFNQAASFVSGKGLRQYHPISFVAKGGNQNFLLEPLFNFRDEHYTV 586
Query: 852 YFNIQD 857
YFNIQ+
Sbjct: 587 YFNIQE 592
>gi|242096364|ref|XP_002438672.1| hypothetical protein SORBIDRAFT_10g024080 [Sorghum bicolor]
gi|241916895|gb|EER90039.1| hypothetical protein SORBIDRAFT_10g024080 [Sorghum bicolor]
Length = 933
Score = 833 bits (2151), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/879 (49%), Positives = 571/879 (64%), Gaps = 82/879 (9%)
Query: 52 HLTPTDDSAWSSLIPSKILGDQKD------EVSWALLYRKIKNPGGFD--------LPGN 97
HLTPT+++ W +L+P ++ G E W LYR + GG D PG
Sbjct: 55 HLTPTEEATWMALLPRRLRGGGGGGARARAEFDWLALYRSLTRGGGPDDDADAGKPGPGE 114
Query: 98 FLKEVSLHDVWL----------------DQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKT 141
L SLHDV L +++ W+AQQTNLEYLL LD D L W+FR+
Sbjct: 115 LLTPASLHDVRLHGDDDDDDRVLTGSSSSSAAMYWQAQQTNLEYLLYLDPDRLTWTFRRQ 174
Query: 142 ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN 201
A LPT G YGGWE P +LRGHF GHYLSASA MWA+THN+T++E+M+ VV L +CQ
Sbjct: 175 AGLPTVGDPYGGWEAPGGQLRGHFTGHYLSASAHMWAATHNSTLRERMTRVVDILYDCQK 234
Query: 202 KIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEY 261
K+GTGYL+A+P +FD +E L W+PYYTIHKI+ GLLDQY+LA N + L + WM +Y
Sbjct: 235 KMGTGYLAAYPETMFDLYEQLDEAWSPYYTIHKIMQGLLDQYMLASNKKGLDVVVWMTDY 294
Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
F NRV+ +I Y+++RHW ++NEETGG NDV+Y+LY+IT + KHL +AHLFDKPCFLG L
Sbjct: 295 FSNRVKNLIQKYTIQRHWEAMNEETGGFNDVMYQLYTITKNQKHLTMAHLFDKPCFLGPL 354
Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
L D +S H NTH+P++IG+Q RYEV GD LYK I T+ D+VN+SH++ATGGTS E
Sbjct: 355 GLHKDDISGLHVNTHLPVIIGTQKRYEVVGDHLYKDISTYLFDVVNSSHTFATGGTSTME 414
Query: 382 FWWDPKRLADTLG-SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRG 440
W DPKRL D + S NEETC TYN LKVSR+LFRWTKE YAD+YER L NG++ QRG
Sbjct: 415 HWHDPKRLVDEIKISSNEETCATYNFLKVSRNLFRWTKEAKYADHYERLLINGIMGNQRG 474
Query: 441 TEPGVMIYMLPLGRGVSKA-----------RSTHGWGTKFNSFWCCYGTGIESFSKLGDS 489
T+PGVM+Y LP+G G SK+ ++ GWG ++FWCCYGTGIESFSKLGDS
Sbjct: 475 TQPGVMLYFLPMGPGRSKSVSGLSPSGLPPKNPGGWGGPNDTFWCCYGTGIESFSKLGDS 534
Query: 490 IYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL 549
IYF EEG PGLYIIQYI S+FDWK+ + +NQ+ P++S DP+ +++LTFS+K + QL
Sbjct: 535 IYFLEEGEAPGLYIIQYIPSTFDWKATGLTVNQQAKPLLSTDPFFKVSLTFSAKGDA-QL 593
Query: 550 SSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN-----FLSATERWSYNDKLTIQLPLSLR 604
+ +++R+P WT ++G A+LNGQ L L GN FL+ T+ W+ D LT+Q P++LR
Sbjct: 594 AKVSVRIPSWTSTDGTTATLNGQKLNLTSTGNSTNGGFLTVTKLWA-EDTLTLQFPITLR 652
Query: 605 TEAIQDDRPEYASIQAILFGPYLLAGHTSGE-----------------WDIKTGTARSLS 647
TEAI+DDRPEYASIQA+LFGP+LLAG T G+ W++ +A +++
Sbjct: 653 TEAIKDDRPEYASIQAVLFGPHLLAGLTHGKLPVTDSNHSNDGLTPSIWEVNATSATAVT 712
Query: 648 ALISPIPP-SFNAQLVTFTQESGNSTFVMSNS--NQSITMEEFPVSGTDAALHATFRLIL 704
++P+P + N+QLVT TQ +G T V+S S + + M+E P GTDA +HATFR +
Sbjct: 713 DWVTPLPSETLNSQLVTLTQTAGGRTLVLSVSIADAKLEMQEQPAPGTDACVHATFR-VY 771
Query: 705 KDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLD 764
A S+ SL + G +V +EPFD PGM V G L+ P + F V GLD
Sbjct: 772 GQAGSSSSESLLPMQGPNVTIEPFDRPGMAVTNG----LLAVGRPAGGRDTLFNAVPGLD 827
Query: 765 KRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--------STESLDAGFNRAASFMME 816
+VSLE R GCFV++ A+ +++C S A RAASF+
Sbjct: 828 GAPGSVSLELATRPGCFVATAPAAGANAATQVVCRGNKNNGGSASGDGAALRRAASFVRA 887
Query: 817 IGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
+ Y+P+SF A+G RNFLL PL S +DE YTVYF++
Sbjct: 888 APLRRYNPLSFAARGTARNFLLEPLRSLQDEFYTVYFSL 926
>gi|125556053|gb|EAZ01659.1| hypothetical protein OsI_23694 [Oryza sativa Indica Group]
Length = 898
Score = 833 bits (2151), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/853 (50%), Positives = 560/853 (65%), Gaps = 62/853 (7%)
Query: 52 HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQ 111
HL +++ W L+P + +DE+ W LYR I GG + P FL SLHDV +D
Sbjct: 56 HLNQAEEATWMGLLPRR--AGPRDELDWLALYRSITRGGGGE-PAGFLSPASLHDVRVDP 112
Query: 112 --SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHY 169
+++ W+ QQTNLEYLL LD D L W+FR+ A LP G+ YGGWE P +LRGHF GHY
Sbjct: 113 YGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPIVGEPYGGWEAPDGQLRGHFTGHY 172
Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPY 229
LSA+A MWASTHN ++EKM+ VV L CQ K+ TGYLSA+P +FD+++ L W+PY
Sbjct: 173 LSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDAYDELAEAWSPY 232
Query: 230 YTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGM 289
YTIHKI+ GLLDQY LA N + L++ WM +YF RV+K+I YS++RHW ++NEETGG
Sbjct: 233 YTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSIQRHWEAINEETGGF 292
Query: 290 NDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEV 349
NDV+Y+LY+IT + KHL +AHLFDKPCFLG L L D +S H NTH+P+++G+Q RYEV
Sbjct: 293 NDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNTHVPVIVGAQKRYEV 352
Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG-SENEETCTTYNMLK 408
GD LYK I TFF D+VN+SH++ATGGTS E W DPKRL D + S NEETC TYN+LK
Sbjct: 353 VGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWHDPKRLVDEIKISSNEETCATYNLLK 412
Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA--------- 459
VSR+LFRWTKE Y D+YER L NG++ QRG EPGVMIY LP+G G SK+
Sbjct: 413 VSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGPGRSKSISGMPTSGL 472
Query: 460 --RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
++ GWG +FWCCYGTGIESFSKLGDSIYF EEG +PGLYIIQYI S+FDWK+
Sbjct: 473 PPKNPGGWGNANATFWCCYGTGIESFSKLGDSIYFLEEGEIPGLYIIQYIPSTFDWKAAG 532
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
+ + Q+ P+ S D + +++ SSK + + +++N+R+P WT +GA A+LNGQ L L
Sbjct: 533 LTVKQQAKPLSSTDSHFEVSIFISSKGDA-RPANVNVRIPSWTSVDGAIATLNGQKLNLT 591
Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWD 637
G+FLS T+ W +D L+++ P++LRTE I+DDRPEY+SIQA+LFGP+LLAG T G
Sbjct: 592 SAGDFLSVTKLWG-DDTLSLKFPITLRTEPIKDDRPEYSSIQAVLFGPHLLAGLTHGNQT 650
Query: 638 IKTGTARSLSAL-------------------ISPIPPSFNAQLVTFTQESGN----STFV 674
+KT + S S L ++P+ S N+QLVT TQ G+ + FV
Sbjct: 651 VKT-SNDSNSGLTPGVWEVNATHAAAAVAGWVTPVSQSLNSQLVTLTQRDGDAQAAAAFV 709
Query: 675 MSNS--NQSITMEEFPVSGTDAALHATFRLILKDASLSNF-SSLNNVIGKSVMLEPFDFP 731
+S S + ++TM+E PV+G+DA +HATFR + S ++ + G++V LEPFD P
Sbjct: 710 LSVSIADGALTMQESPVAGSDACVHATFRAYHSPSGASAIDAATGRLQGRNVALEPFDRP 769
Query: 732 GMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVN-FEP 790
GM V D L V + ++ F VAGLD TVSLE R GCFV++ +
Sbjct: 770 GMAVT----DALSVG---RPGPATRFNAVAGLDGLPGTVSLELATRPGCFVAAPTTAYLA 822
Query: 791 GASLKLLCSTESL--------DAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLL 842
GA ++ C + D F RAASF + YHP+SF A G RNFLL PL
Sbjct: 823 GAKAQVSCRKPTAAGGGEDDDDTAFRRAASFTQAAPLRLYHPLSFSATGTDRNFLLEPLQ 882
Query: 843 SFRDEAYTVYFNI 855
S +DE YTVYFN+
Sbjct: 883 SLQDEFYTVYFNV 895
>gi|326520888|dbj|BAJ92807.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 683
Score = 830 bits (2145), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/694 (59%), Positives = 507/694 (73%), Gaps = 27/694 (3%)
Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKI---GTGYLSAFPTELFDSFEALKPVWAPYYTI 232
MWASTHN T+ KMS VV +L CQ G GYLSAFP E FD FEA+KPVWAPYYTI
Sbjct: 1 MWASTHNGTLAGKMSAVVDALHACQQAPANGGAGYLSAFPAEFFDRFEAIKPVWAPYYTI 60
Query: 233 HKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDV 292
HKI+ GLLDQY +A N +AL M M YF RV+ VI +S+ERHW SLNEETGGMNDV
Sbjct: 61 HKIMQGLLDQYTVAGNGKALAMVVAMAGYFGERVRSVIQRHSIERHWTSLNEETGGMNDV 120
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
LY+LY+IT+D +HL+LAHLFDKPCFLG LA+QAD LS FHANTHIPIV+G QMRYEVTGD
Sbjct: 121 LYQLYAITNDQRHLVLAHLFDKPCFLGLLAVQADSLSDFHANTHIPIVVGGQMRYEVTGD 180
Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
PLYK I TFFM++VN+SHSYATGGTS EFW+DPKRLA+TL +ENEE+CTTYNMLKVSRH
Sbjct: 181 PLYKEIATFFMNVVNSSHSYATGGTSVSEFWFDPKRLAETLTTENEESCTTYNMLKVSRH 240
Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
LFRWTKEIAYADYYERAL NGV SIQRG +PGVMIYMLP G G SKA S HGWGT+++SF
Sbjct: 241 LFRWTKEIAYADYYERALINGVQSIQRGRDPGVMIYMLPQGPGRSKALSYHGWGTQYDSF 300
Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
WCCYGTGIESFSKLGDSIYFEE+G P LY++QYI S+F+W+S + + Q + P+ S D
Sbjct: 301 WCCYGTGIESFSKLGDSIYFEEKGGKPALYLVQYIPSTFNWRSVGLTVTQTLKPLSSSDQ 360
Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYN 592
L+++L+ S+K GQ +++N+R+P W SNGA+A+LNG++L + PG FLS T++W
Sbjct: 361 NLQVSLSISAKTN-GQYATVNVRIPSWASSNGAKATLNGKDLTMASPGTFLSVTKQWGGG 419
Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
D L +QLP+ LRTEAI+DDRPEYAS+QA+LFGP+LLAG T+G+WD KTG ++S I+
Sbjct: 420 DHLALQLPIRLRTEAIKDDRPEYASLQAVLFGPFLLAGLTTGDWDAKTGGG-AISEWITA 478
Query: 653 IPPSFNAQLVTFTQESGNSTFVMS----NSNQSITMEEFPV-SGTDAALHATFRLILKDA 707
IP ++N+QLVT TQESGNST V+S S+TM+ P GTDAA+HATFRL+ +
Sbjct: 479 IPATYNSQLVTLTQESGNSTLVLSLLSTAKATSLTMQPRPEGGGTDAAVHATFRLVTQGQ 538
Query: 708 SLSNFSSLNNVIG-----KSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAG 762
+ S ++EPFD PGM V ++ S ++ SS F +V G
Sbjct: 539 GTPPMGERRHATNATAALASAVIEPFDMPGMAVTNS------LTLSAEKGPSSLFNVVPG 592
Query: 763 LDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTESLDAGFNR-AASFMMEIGISE 821
LD + +VSLE R GCF+ + GA + GF+R AASF +
Sbjct: 593 LDGQPGSVSLELGARPGCFLVTA-----GAKANVQVGCGGGGTGFSRQAASFARAEPLRR 647
Query: 822 YHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
YHPISF AKGARR+FLL PL + RDE YTVYFN+
Sbjct: 648 YHPISFAAKGARRSFLLEPLFTLRDEFYTVYFNL 681
>gi|125597849|gb|EAZ37629.1| hypothetical protein OsJ_21963 [Oryza sativa Japonica Group]
Length = 902
Score = 827 bits (2135), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/856 (50%), Positives = 558/856 (65%), Gaps = 65/856 (7%)
Query: 52 HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDL---PGNFLKEVSLHDVW 108
HL +++ W L+P + +DE+ W LYR I GG D+ P FL SLHDV
Sbjct: 57 HLNQAEEATWMGLLPRR--AGPRDELDWLALYRSITRGGG-DVGGEPAGFLSPASLHDVR 113
Query: 109 LDQ--SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
+D +++ W+ QQTNLEYLL LD D L W+FR+ A LPT G+ YGGWE P +LRGHF
Sbjct: 114 VDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWEAPDGQLRGHFT 173
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVW 226
GHYLSA+A MWASTHN ++EKM+ VV L CQ K+ TGYLSA+P +FD+++ L W
Sbjct: 174 GHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDAYDELAEAW 233
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
+PYYTIHKI+ GLLDQY LA N + L++ WM +YF RV+K+I YS++RHW ++NEET
Sbjct: 234 SPYYTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSIQRHWEAINEET 293
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GG NDV+Y+LY+IT + KHL +AHLFDKPCFLG L L D +S H NTH+P+++G+Q R
Sbjct: 294 GGFNDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNTHVPVIVGAQKR 353
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG-SENEETCTTYN 405
YEV GD LYK I TFF D+VN+SH++ATGGTS E W DPKRL D + S NEETC TYN
Sbjct: 354 YEVVGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWHDPKRLVDEIKISSNEETCATYN 413
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA------ 459
+LKVSR+LFRWTKE Y D+YER L NG++ QRG EPGVMIY LP+G G SK+
Sbjct: 414 LLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGPGRSKSISGMPT 473
Query: 460 -----RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
++ GWG +FWCCYGTGIESFSKLGDSIYF EEG +PGLYIIQYI S+FDWK
Sbjct: 474 SGLPPKNPGGWGNANATFWCCYGTGIESFSKLGDSIYFLEEGEIPGLYIIQYIPSTFDWK 533
Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
+ + + Q+ P+ S D + +++ SSK + + +++N+R+P WT +GA A+LNGQ L
Sbjct: 534 AAGLTVKQQAKPLSSTDSHFEVSIFISSKGDA-RPANVNVRIPSWTSVDGAIATLNGQKL 592
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
L G+FLS T+ W +D L+++ P++LRTE I+DDRPEY+SIQA+LFGP+LLAG T G
Sbjct: 593 NLTSAGDFLSVTKLWG-DDTLSLKFPITLRTEPIKDDRPEYSSIQAVLFGPHLLAGLTHG 651
Query: 635 EWDIKTGTARSLSALISPI-------------------PPSFNAQLVTFTQESGN----S 671
+KT + S S L + S N+QLVT TQ G+ +
Sbjct: 652 NQTVKT-SNDSNSGLTPGVWEVNATHAAAAVAVWVTPVSQSLNSQLVTLTQRDGDAQAAA 710
Query: 672 TFVMSNS--NQSITMEEFPVSGTDAALHATFRLILKDASLSNF-SSLNNVIGKSVMLEPF 728
FV+S S + ++TM+E PV+G+DA +HATFR + S ++ + G+ V LEPF
Sbjct: 711 AFVLSVSIADGALTMQESPVAGSDACVHATFRAYQSPSGASAIDAATGRLQGRDVALEPF 770
Query: 729 DFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVN- 787
D PGM V D L V + ++ F VAGLD TVSLE R GCFV++
Sbjct: 771 DRPGMAVT----DALSVG---RPGPATRFNAVAGLDGLPGTVSLELATRPGCFVAAPTTA 823
Query: 788 FEPGASLKLLCSTESL--------DAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLA 839
+ GA ++ C + D F RAASF + YHP+SF A G RNFLL
Sbjct: 824 YLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASFTQAAPLRLYHPLSFSATGTDRNFLLE 883
Query: 840 PLLSFRDEAYTVYFNI 855
PL S +DE YTVYFN+
Sbjct: 884 PLQSLQDEFYTVYFNV 899
>gi|51090918|dbj|BAD35523.1| unknown protein [Oryza sativa Japonica Group]
gi|51090952|dbj|BAD35555.1| unknown protein [Oryza sativa Japonica Group]
Length = 902
Score = 827 bits (2135), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/856 (50%), Positives = 558/856 (65%), Gaps = 65/856 (7%)
Query: 52 HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDL---PGNFLKEVSLHDVW 108
HL +++ W L+P + +DE+ W LYR I GG D+ P FL SLHDV
Sbjct: 57 HLNQAEEATWMGLLPRR--AGPRDELDWLALYRSITRGGG-DVGGEPAGFLSPASLHDVR 113
Query: 109 LDQ--SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
+D +++ W+ QQTNLEYLL LD D L W+FR+ A LPT G+ YGGWE P +LRGHF
Sbjct: 114 VDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWEAPDGQLRGHFT 173
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVW 226
GHYLSA+A MWASTHN ++EKM+ VV L CQ K+ TGYLSA+P +FD+++ L W
Sbjct: 174 GHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDAYDELAEAW 233
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
+PYYTIHKI+ GLLDQY LA N + L++ WM +YF RV+K+I YS++RHW ++NEET
Sbjct: 234 SPYYTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSIQRHWEAINEET 293
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GG NDV+Y+LY+IT + KHL +AHLFDKPCFLG L L D +S H NTH+P+++G+Q R
Sbjct: 294 GGFNDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNTHVPVIVGAQKR 353
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG-SENEETCTTYN 405
YEV GD LYK I TFF D+VN+SH++ATGGTS E W DPKRL D + S NEETC TYN
Sbjct: 354 YEVVGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWHDPKRLVDEIKISSNEETCATYN 413
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA------ 459
+LKVSR+LFRWTKE Y D+YER L NG++ QRG EPGVMIY LP+G G SK+
Sbjct: 414 LLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGPGRSKSISGMPT 473
Query: 460 -----RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
++ GWG +FWCCYGTGIESFSKLGDSIYF EEG +PGLYIIQYI S+FDWK
Sbjct: 474 SGLPPKNPGGWGNANATFWCCYGTGIESFSKLGDSIYFLEEGEIPGLYIIQYIPSTFDWK 533
Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
+ + + Q+ P+ S D + +++ SSK + + +++N+R+P WT +GA A+LNGQ L
Sbjct: 534 AAGLTVKQQAKPLSSTDSHFEVSIFISSKGDA-RPANVNVRIPSWTSVDGAIATLNGQKL 592
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
L G+FLS T+ W +D L+++ P++LRTE I+DDRPEY+SIQA+LFGP+LLAG T G
Sbjct: 593 NLTSAGDFLSVTKLWG-DDTLSLKFPITLRTEPIKDDRPEYSSIQAVLFGPHLLAGLTHG 651
Query: 635 EWDIKTGTARSLSALISPI-------------------PPSFNAQLVTFTQESGN----S 671
+KT + S S L + S N+QLVT TQ G+ +
Sbjct: 652 NQTVKT-SNDSNSGLTPGVWEVNATHAAAAVAVWVTPVSQSLNSQLVTLTQRDGDAQAAA 710
Query: 672 TFVMSNS--NQSITMEEFPVSGTDAALHATFRLILKDASLSNF-SSLNNVIGKSVMLEPF 728
FV+S S + ++TM+E PV+G+DA +HATFR + S ++ + G+ V LEPF
Sbjct: 711 AFVLSVSIADGALTMQESPVAGSDACVHATFRAYHSPSGASAIDAATGRLQGRDVALEPF 770
Query: 729 DFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVN- 787
D PGM V D L V + ++ F VAGLD TVSLE R GCFV++
Sbjct: 771 DRPGMAVT----DALSVG---RPGPATRFNAVAGLDGLPGTVSLELATRPGCFVAAPTTA 823
Query: 788 FEPGASLKLLCSTESL--------DAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLA 839
+ GA ++ C + D F RAASF + YHP+SF A G RNFLL
Sbjct: 824 YLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASFTQAAPLRLYHPLSFSATGTDRNFLLE 883
Query: 840 PLLSFRDEAYTVYFNI 855
PL S +DE YTVYFN+
Sbjct: 884 PLQSLQDEFYTVYFNV 899
>gi|168021740|ref|XP_001763399.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685534|gb|EDQ71929.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 757
Score = 822 bits (2124), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/767 (53%), Positives = 539/767 (70%), Gaps = 20/767 (2%)
Query: 98 FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
LK+VSLH V L S + AQ TNL+YLL LDVD+++WSFRK ++L PG+ YGGWE+P
Sbjct: 1 LLKDVSLHKVRLGADSPQFMAQNTNLQYLLELDVDNMMWSFRKVSNLNAPGQPYGGWESP 60
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
SELRGHFVGHYLSASA MWASTHN + EKM+ ++ +L ECQ IGTGYLSAFP+E FD
Sbjct: 61 ASELRGHFVGHYLSASALMWASTHNEVLHEKMNALLGALKECQMSIGTGYLSAFPSEFFD 120
Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
FEA++ VWAPYYTIHKI+AGLLDQY+LA + AL M M YFY RV+ VI +++ER
Sbjct: 121 RFEAIEYVWAPYYTIHKIMAGLLDQYLLAGSKDALDMVVEMANYFYKRVKTVIEKFTIER 180
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
HW SLNEETGGMNDVLYRLY++T D KHL LAHLFDKPCFLG LALQAD+LS FH+NTHI
Sbjct: 181 HWRSLNEETGGMNDVLYRLYTVTGDNKHLELAHLFDKPCFLGPLALQADHLSGFHSNTHI 240
Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
PIV+G+QMRYEVT D +Y+ I +FM IVN+SHSYATGGTS EFW D R DTL +EN
Sbjct: 241 PIVVGAQMRYEVTSDLIYRSIAEYFMGIVNSSHSYATGGTSVSEFWTDSMRQGDTLHTEN 300
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
+ETCTTYNMLK++R LFRWTK+I Y DYY+RAL NG+L QRG +PGVMIYMLP+G GVS
Sbjct: 301 QETCTTYNMLKIARTLFRWTKDIKYMDYYDRALINGILGTQRGQQPGVMIYMLPMGPGVS 360
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
K RS HGWG KFNSFWCCYGT IESF+KLGDSIYFE++G +P +Y+ Q++SS F W S
Sbjct: 361 KGRSYHGWGNKFNSFWCCYGTAIESFAKLGDSIYFEDDGEIPSVYVAQFVSSDFVWDSAG 420
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEV--GQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
+VL+Q + P+ + L +T +FS V Q + +++R+P W G +A LNGQ +
Sbjct: 421 LVLHQSLKPLNAEQSILEVTFSFSHATIVRASQDAVIHVRLPSWV--RGCRAHLNGQEIE 478
Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
PG FLS WS +D+L + LP+SL E IQDDR +Y+++ AI++GP+++AG ++G+
Sbjct: 479 SLIPGKFLSIARAWSSDDELVLLLPMSLGLEKIQDDRAQYSALHAIMYGPFVMAGLSTGD 538
Query: 636 WDIKTGTARSLSALISPIPPSFNAQLVTFTQ-----ESGNSTFVMSNSNQSITMEEFPVS 690
W K G +L+ + P+P ++++QL TF+Q E S ++ N+ +I M P
Sbjct: 539 W--KLGHKENLTQWVYPVPAAYHSQLSTFSQFHVNGEYSGSLYLACNNGTAI-MRYAPED 595
Query: 691 GTDAALHATFRLILKDASLSNFSSLNNVIGKS-VMLEPFDFPGMLVQQGKEDELVVSESP 749
GTD +TFR+ N+S L+ K V LE F PG+ +Q ED+ + + P
Sbjct: 596 GTDECGLSTFRV---SDPFGNYSQLSAGDDKRLVSLELFSQPGIFLQHNGEDKPISTGPP 652
Query: 750 KEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLL-CSTESLDAGFN 808
S F + GL ++ TVS EA ++ GCF+SS + L C T D N
Sbjct: 653 SW---SVFFYLPGLTGKSGTVSFEAVDKPGCFLSSSFSGSSVLGGVFLRCKTSRNDNTLN 709
Query: 809 RAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
++F +++G++ YHP+SF+A+G RNFLLAPL S RDE+YT+YF++
Sbjct: 710 AFSTFDVQMGVAAYHPVSFIAEGQHRNFLLAPLNSLRDESYTIYFDM 756
>gi|293331149|ref|NP_001170532.1| uncharacterized protein LOC100384546 precursor [Zea mays]
gi|238005884|gb|ACR33977.1| unknown [Zea mays]
gi|413954824|gb|AFW87473.1| hypothetical protein ZEAMMB73_711416 [Zea mays]
Length = 902
Score = 820 bits (2117), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/853 (50%), Positives = 555/853 (65%), Gaps = 62/853 (7%)
Query: 52 HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDL-------PGNFLKEVSL 104
HLTPT+++ W SL+P ++ G + E W LYR + G D P L SL
Sbjct: 57 HLTPTEEATWMSLLPRRLRGGGRAEFDWLALYRSLTRGDGPDGGAGKAAGPEGLLSPASL 116
Query: 105 HDVWLDQ----SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISE 160
HDV L SS+ WRAQQTNLEYLL LD D L W+FR+ A LPT G YGGWE P +
Sbjct: 117 HDVRLHGDGSLSSMYWRAQQTNLEYLLYLDPDRLTWTFRQQAGLPTVGDPYGGWEAPDGQ 176
Query: 161 LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFE 220
LRGHFVGHYLSASA WA+THN T++E+M+ VV L CQ K+GTGYLSA+P +FD +E
Sbjct: 177 LRGHFVGHYLSASAHAWAATHNGTLRERMARVVDILHACQKKMGTGYLSAYPETMFDLYE 236
Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWY 280
L W+PYYT HKI+ GLLDQY LA N + L + M +YF NRV+ ++ +++++RHW
Sbjct: 237 QLDEAWSPYYTTHKIMQGLLDQYTLASNEKGLDVVLRMADYFSNRVKNLVQIHTIQRHWE 296
Query: 281 SLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV 340
++NEETGG NDV+Y+LY+IT D KHL +AHLFDKPCFLG L L D +S H NTH+P++
Sbjct: 297 AMNEETGGFNDVMYQLYTITRDQKHLTMAHLFDKPCFLGPLGLHKDDISGLHVNTHLPVL 356
Query: 341 IGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG-SENEE 399
+G+Q RYEV GD LYK I T+ D+VN+SH++ATGGTS E W DPKRL D + S NEE
Sbjct: 357 VGAQKRYEVVGDRLYKDISTYLFDVVNSSHTFATGGTSTMEHWHDPKRLVDEIKISSNEE 416
Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
TC TYN LKVSR+LFRWTKE YAD+YER L NG++ QRGT+PGVM+Y LP+G G SK+
Sbjct: 417 TCATYNFLKVSRNLFRWTKEAKYADHYERLLINGIMGNQRGTQPGVMLYFLPMGPGRSKS 476
Query: 460 RSTH-----------GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
S GWG ++FWCCYGTGIESFSKLGDSIYF EEG+ PGLYIIQYI
Sbjct: 477 VSGQSPSGLPPKNPGGWGGPNDTFWCCYGTGIESFSKLGDSIYFLEEGDTPGLYIIQYIP 536
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
S+FDWK+ + +NQ+ P++S DP+ +++LT S+K+ Q + +++R+P WT ++GA A
Sbjct: 537 STFDWKATGLTVNQRAKPLLSTDPFFKVSLTISAKRGARQ-AKVSVRIPSWTTTDGATAI 595
Query: 569 LNGQNLPLPPPGN-----FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
LNGQ L L P GN FL+ T+ W+ ND LT+ P++LRTEAI+DDRPEYASIQA+LF
Sbjct: 596 LNGQKLNLTPTGNSTNGGFLTITKLWA-NDTLTLHFPITLRTEAIKDDRPEYASIQAVLF 654
Query: 624 GPYLLAGHTSGE-----------------WDIKTGTARSLSALISPI-PPSFNAQLVTFT 665
GP+LLAG T G+ W++ A S++ ++P+ + N+QLVT
Sbjct: 655 GPHLLAGLTHGKLPVTDSSHSNDGLTAGIWEVDATGAASVAGWVTPLHSETLNSQLVTLK 714
Query: 666 QESGNSTFVMSNS--NQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSV 723
Q G T V+S S + + M+E P GTDA +HATFR + S + G +V
Sbjct: 715 QSIGGRTLVLSVSIADAKLEMQEQPAPGTDACVHATFRAYGQAGGSSQL-----LRGPNV 769
Query: 724 MLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVS 783
+EPFD PGM V G ++ + + F V GLD +VSLE R G FV+
Sbjct: 770 TIEPFDRPGMAVTNG------LAVGCRGGRDTLFNAVPGLDGAPGSVSLELATRPGWFVA 823
Query: 784 SG-VNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLL 842
+ A+ +++C A F RAASF + YHP+SF A+G RNFLL PL
Sbjct: 824 TAPTAMHANATTQVVCRANKGGAAFRRAASFARAPPLRRYHPLSFAARGTARNFLLEPLR 883
Query: 843 SFRDEAYTVYFNI 855
S +DE YTVYF++
Sbjct: 884 SLQDEFYTVYFSL 896
>gi|297606169|ref|NP_001058067.2| Os06g0612900 [Oryza sativa Japonica Group]
gi|255677223|dbj|BAF19981.2| Os06g0612900 [Oryza sativa Japonica Group]
Length = 717
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/724 (55%), Positives = 504/724 (69%), Gaps = 53/724 (7%)
Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK- 234
MWASTHN T+ KM+ VV +L +CQ GTGYLSAFP E FD FEA++PVWAPYYTIHK
Sbjct: 1 MWASTHNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKA 60
Query: 235 -------------------------ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
I+ GLLDQ+ +A N +AL M M +YF RV+ V
Sbjct: 61 RNATQSICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGKALGMVVAMADYFAGRVRSV 120
Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
I Y++ERHW SLNEETGGMNDVLY+LY+IT D +HL+LAHLFDKPCFLG LA+QAD LS
Sbjct: 121 IQRYTIERHWTSLNEETGGMNDVLYQLYTITKDQRHLVLAHLFDKPCFLGLLAVQADSLS 180
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
FHANTHIP+VIG QMRYEVTGDPLYK I TFFMDIVN+SHSYATGGTS EFW +PK L
Sbjct: 181 GFHANTHIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVSEFWSNPKHL 240
Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
A+ L +E EE+CTTYNMLKVSRHLFRWTKEIAYADYYERAL NGVLSIQRG +PGVMIYM
Sbjct: 241 AEALTTETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMIYM 300
Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
LP G G SKA S HGWGT++NSFWCCYGTGIESFSKLGDSIYFE++G+ PGLYIIQYI S
Sbjct: 301 LPQGPGRSKAVSYHGWGTQYNSFWCCYGTGIESFSKLGDSIYFEQKGDKPGLYIIQYIPS 360
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
+F+W++ + + Q+V P+ S D YL+++L+ S+ + GQ ++LN+R+P WT NGA+A+L
Sbjct: 361 TFNWRTAGLTVTQQVKPLSSSDQYLQVSLSISAAKTNGQYATLNVRIPSWTSMNGAKATL 420
Query: 570 NGQNLPLPPPGNFLSATERW-SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
N ++L L PG FL+ +++W S +D L +Q P++LRTEAI+DDRP+ AS+ AILFGP+LL
Sbjct: 421 NDKDLQLASPGTFLTISKQWDSGDDHLLLQFPINLRTEAIKDDRPQVASLNAILFGPFLL 480
Query: 629 AGHTSGEWD-IKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQ-SITMEE 686
AG T+G+WD G A + S I+P+P S+N+QLVT TQESG T ++S N S+ M E
Sbjct: 481 AGLTTGDWDAKTGGAATAASDWITPVPASYNSQLVTLTQESGGKTMLLSTVNDTSLAMLE 540
Query: 687 FP--VSGTDAALHATFRLI--------LKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQ 736
P GTDAA+ ATFR++ + A + + +EPF PG V
Sbjct: 541 RPEGAGGTDAAVRATFRVVPPGSRAELRQRAGAGAGEGAARLKVAAATIEPFGLPGTAVS 600
Query: 737 QGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKL 796
G L V + S+ F + GLD + +VSLE ++ GCF+ +G GA + +
Sbjct: 601 NG----LAVVRAGNS-SSTLFNVAPGLDGKPGSVSLELGSKPGCFLVAGA----GAKVHV 651
Query: 797 LCSTE-----SLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTV 851
C T + AGF +AASF + YH ISF A G RR+FLL PL + RDE YT+
Sbjct: 652 GCRTRGGAAAAAAAGFEQAASFAQAEPLRRYHAISFFASGVRRSFLLEPLFTLRDEFYTI 711
Query: 852 YFNI 855
YFN+
Sbjct: 712 YFNL 715
>gi|302818405|ref|XP_002990876.1| hypothetical protein SELMODRAFT_20509 [Selaginella moellendorffii]
gi|300141437|gb|EFJ08149.1| hypothetical protein SELMODRAFT_20509 [Selaginella moellendorffii]
Length = 755
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/775 (53%), Positives = 523/775 (67%), Gaps = 37/775 (4%)
Query: 98 FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
FL+ VSLHDV L S AQQTNL+YLLMLDVD+LV+SFR TA L G AYGGWE P
Sbjct: 1 FLEAVSLHDVRLLPDSWQAIAQQTNLDYLLMLDVDNLVYSFRTTAGLNASGSAYGGWELP 60
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
SELRGHFVGHYLSASA WASTHN TI E M+ VV +L+ECQ KIGTGYLSAFPT LFD
Sbjct: 61 TSELRGHFVGHYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSAFPTSLFD 120
Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
FEAL+ VWAPYYTIHKI+AGLLDQY A N+ A +M M +YF +RV++VI YS+ER
Sbjct: 121 RFEALESVWAPYYTIHKIMAGLLDQYTYAANSLAFEMLLGMTDYFGSRVERVIEKYSIER 180
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
HW SLNEETGGMNDVLYR+Y IT D KHL LAHLFDKPCFLG LA++AD +S FHANTHI
Sbjct: 181 HWQSLNEETGGMNDVLYRVYQITGDAKHLKLAHLFDKPCFLGLLAVRADSISGFHANTHI 240
Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
PIVIG+Q+RYEV GD LYK + +FM IV++SH+YATGGTSA EFW DP RL DTLG+EN
Sbjct: 241 PIVIGAQLRYEVVGDKLYKDLSEYFMKIVSSSHTYATGGTSAGEFWSDPSRLGDTLGTEN 300
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
EE+CTTYNMLKV+R+LFRWTK++ YAD+YERAL NGVL+IQRG EPGVMIYMLPL G S
Sbjct: 301 EESCTTYNMLKVARNLFRWTKQMHYADFYERALINGVLTIQRGKEPGVMIYMLPLAPGSS 360
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEE-GNVPGLYIIQYISSSFDWKSG 516
KA S HGWGT F+SFWCCYGT IESFSKLGDSIYF +E + P LY+IQY+SS W +
Sbjct: 361 KATSYHGWGTPFSSFWCCYGTAIESFSKLGDSIYFTDEVQDTPQLYVIQYLSSKVLWTAA 420
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEV-GQLS--SLNLRMPVWTYSNGAQASLNGQN 573
+ ++Q+V + S DP MT+TF+ Q V G+ S L++R+P W S ++ LNG
Sbjct: 421 GLSVDQRVYHMTSTDPV--MTVTFNFTQLVLGKTSEAKLSVRVPYWAQS--SRCLLNGLE 476
Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
L PG F + W DKL+ LR E IQD+R +Y+S+ AI +GPYLLAG +
Sbjct: 477 LQNLTPGTFFDVSREWKTGDKLSFTFSAMLRLEKIQDERSKYSSLYAIYYGPYLLAGMSD 536
Query: 634 GEWDIKTGTARSLSALISPIPPSFNAQLVTFTQ-ESGNSTFVMSNSNQSITMEEFPVSGT 692
G + + + + S I P+ ++ L +FTQ + G ++ ++S+ +++M P G+
Sbjct: 537 GNYKLGSVNVSTPSRWIKPVR---DSNLFSFTQLQQGKLQYLAASSDGALSMISKPQHGS 593
Query: 693 DAALHATFRLIL-------KDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQ-GKEDELV 744
+ A ATFRL L + + + +SL ++ + V LE + PG V G ED +
Sbjct: 594 EEAPLATFRLKLLPSLKTIEKFQVKDVTSL--LLDREVSLELLNRPGRFVTHFGIEDGVR 651
Query: 745 VSESPKEMGSSG---FRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTE 801
++ S F+L + L +S EA +GCF+ + G + L C
Sbjct: 652 LTNGKSSGFPSSSSVFKLRSALSGHPGEISFEASGIQGCFLVA-----QGRDITLECER- 705
Query: 802 SLDAGFNR-AASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
FN+ AASF + G + YHP+SF A G +L+ PL S+ DE Y VYF +
Sbjct: 706 -----FNKMAASFGVTAGRASYHPMSFEAYGDNDTYLMFPLSSYSDEKYAVYFEV 755
>gi|302785087|ref|XP_002974315.1| hypothetical protein SELMODRAFT_30650 [Selaginella moellendorffii]
gi|300157913|gb|EFJ24537.1| hypothetical protein SELMODRAFT_30650 [Selaginella moellendorffii]
Length = 755
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/775 (53%), Positives = 519/775 (66%), Gaps = 37/775 (4%)
Query: 98 FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
FL VSLHDV L S AQQTNL+YLLMLDVD+LV+SFR TA L G AYGGWE P
Sbjct: 1 FLGAVSLHDVRLLPDSWQAIAQQTNLDYLLMLDVDNLVYSFRTTAGLNASGSAYGGWELP 60
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
SELRGHFVGHYLSASA WASTHN TI E M+ VV +L+ECQ KIGTGYLSAFPT LFD
Sbjct: 61 TSELRGHFVGHYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSAFPTSLFD 120
Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
FEAL+ VWAPYYTIHKI+AGLLDQY A N+ A +M M +YF +RV+ VI YS+ER
Sbjct: 121 RFEALESVWAPYYTIHKIMAGLLDQYTYAANSLAFEMLLGMTDYFGSRVEMVIEKYSIER 180
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
HW SLNEETGGMNDVLYR+Y IT D KHL LAHLFDKPCFLG LA++AD +S FHANTHI
Sbjct: 181 HWQSLNEETGGMNDVLYRIYQITGDAKHLKLAHLFDKPCFLGLLAVRADSISGFHANTHI 240
Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
PIVIG+Q+RYEV GD LYK + +FM IV++SH+YATGGTS+ EFW +P RL DTLG+EN
Sbjct: 241 PIVIGAQLRYEVVGDKLYKDLSEYFMKIVSSSHTYATGGTSSGEFWSNPNRLGDTLGTEN 300
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
EE+CTTYNMLKV+R+LFRWTK++ YAD+YERAL NGVL+IQRG EPGVMIYMLPL G S
Sbjct: 301 EESCTTYNMLKVARNLFRWTKQMHYADFYERALINGVLTIQRGKEPGVMIYMLPLAPGSS 360
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEE-GNVPGLYIIQYISSSFDWKSG 516
KA+S HGWGT F SFWCCYGT IESFSKLGDSIYF E + P LY+IQY+SS W +
Sbjct: 361 KAKSYHGWGTPFTSFWCCYGTAIESFSKLGDSIYFTNEVQDTPQLYVIQYLSSKVLWTAA 420
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEV-GQLS--SLNLRMPVWTYSNGAQASLNGQN 573
+ L+Q+V + S DP MT+TF+ Q V G+ S L++R+P W S ++ LNG
Sbjct: 421 GLSLDQRVYHMTSTDPV--MTVTFNFTQLVLGKTSEAKLSVRVPYWAQS--SRCLLNGLE 476
Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
L PG F + W DKL+ LR E IQD+R +Y+S+ AI +GPYLLAG +
Sbjct: 477 LQNLTPGTFFDVSREWKTGDKLSFTFSAMLRLEKIQDERSKYSSLYAIYYGPYLLAGMSD 536
Query: 634 GEWDIKTGTARSLSALISPIPPSFNAQLVTFTQ-ESGNSTFVMSNSNQSITMEEFPVSGT 692
G + + + + S I P+ S L +FTQ + G ++ ++S+ +++M P G+
Sbjct: 537 GNYKLGSVNVSTPSRWIKPVRDS---NLFSFTQLQQGKLQYLAASSDGALSMISKPQHGS 593
Query: 693 DAALHATFRLIL-------KDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQ-GKEDELV 744
+ A ATFRL L + + + +SL ++ + V LE + PG V G ED +
Sbjct: 594 EEASLATFRLKLLPSLKTIEKIQVKDVTSL--LLDREVSLELLNRPGRFVTYFGIEDGVR 651
Query: 745 VSESPKEMGSSG---FRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTE 801
++ S F+L + L +S EA +GCF+ + G + L C
Sbjct: 652 LTNGKSSGFPSSSSVFKLRSALSGHPGEISFEASGIQGCFLVA-----QGRDITLECER- 705
Query: 802 SLDAGFNR-AASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
FN+ AASF + G + YHP+SF A G +L+ PL S+ DE Y VYF +
Sbjct: 706 -----FNKMAASFGVTTGRASYHPMSFEAYGGNDTYLMFPLSSYSDEKYAVYFEV 755
>gi|302788790|ref|XP_002976164.1| hypothetical protein SELMODRAFT_104205 [Selaginella moellendorffii]
gi|300156440|gb|EFJ23069.1| hypothetical protein SELMODRAFT_104205 [Selaginella moellendorffii]
Length = 797
Score = 757 bits (1954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/788 (49%), Positives = 518/788 (65%), Gaps = 41/788 (5%)
Query: 97 NFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
+ L+ SLH V +D S+ + QQTNLEYLLMLDVDSL +SFR + LPT G YGGWE
Sbjct: 21 HLLEGSSLHKVRIDADSLQGKGQQTNLEYLLMLDVDSLAYSFRNNSGLPTKGVPYGGWEA 80
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
P ELRGHFVGHYLSA+A+MWASTHN +K +M +V L ECQ KIGTGYLSAFP LF
Sbjct: 81 PDQELRGHFVGHYLSATAKMWASTHNEELKRRMDHLVDILDECQQKIGTGYLSAFPLNLF 140
Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
FE +PVWAPYYTIHKI+AGLLDQY A N +AL+M WM +YF RV+ I YS++
Sbjct: 141 TRFETYRPVWAPYYTIHKIMAGLLDQYTEAGNMKALRMVIWMAQYFSKRVENYIEKYSIQ 200
Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
H+ +LNEETGGMNDVLY LY IT DP+HL LAHLFDKPCFLG LALQ D LS FHANTH
Sbjct: 201 AHFQALNEETGGMNDVLYDLYKITGDPQHLKLAHLFDKPCFLGPLALQQDTLSGFHANTH 260
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
IPI+IG+Q RYE+TGD + K + TFFMD VN+SH + TGGTS EFW DP R+A +LG +
Sbjct: 261 IPILIGAQKRYELTGDQVSKELVTFFMDAVNSSHRFVTGGTSDNEFWKDPNRMASSLGKD 320
Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
EE+C++YNMLK++R+LFRWTKE +Y DYYER + NGVL+IQRG EPGVMIYMLP+G G+
Sbjct: 321 VEESCSSYNMLKIARNLFRWTKEASYMDYYERLILNGVLTIQRG-EPGVMIYMLPMGPGM 379
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG----------NVPGLYIIQY 506
+K ST GWG F+SFWCCYGTGIESFSK GDSIYFE+ G +P LY+ Q+
Sbjct: 380 AKTSSTMGWGDPFDSFWCCYGTGIESFSKFGDSIYFEDYGVRDENPGAQRPIPALYVAQF 439
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV---------GQLSSLNLRMP 557
+ S+ +W S ++L Q V P+ S+DP + +T+ + +++L +R+P
Sbjct: 440 VPSTLEWDSAGLILKQTVKPLTSFDPVMEVTIHLHENPKATIEETSPYHKLINTLYVRIP 499
Query: 558 VWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W S G +A N + + PG+FL+ W D+LT + P +R E IQDDR E+ S
Sbjct: 500 SWVAS-GYEAYFNDEPQDI-TPGSFLAIQREWKAGDRLTFKFPAEVRLEHIQDDREEHQS 557
Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN 677
+ I+FGP++LAG + GE+D+ S S I+P+ PS N L TF + + +
Sbjct: 558 LNGIMFGPFVLAGLSHGEFDLGPVDTSSPSDWITPVNPSDNDLLYTFRM----GDYQLGH 613
Query: 678 SNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLV-Q 736
++++T++ +GTD ATF++I + S + ++G+ V LE D PG ++
Sbjct: 614 KHRTVTIDSASTNGTDWDFQATFKVISSSSPSLAASKHSGLVGRVVSLELMDQPGRIIAH 673
Query: 737 QGKEDELVVSESPKEMGSS--------GFRLVAGLDKRNETVSLEAENRKGCFVSSGVNF 788
G LVV ++ + S+ GF++V GL + VS E+++ GC++ ++
Sbjct: 674 SGINKNLVVVDTSQFADSTNYLSQANLGFKVVPGL-ASDRLVSFESQDLPGCYIYVD-DW 731
Query: 789 EPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKG-ARRNFLLAPLLSFRDE 847
A LK C ++ D GF+ ASF + G+ YHP+SFVA RNFLL P L++RDE
Sbjct: 732 RVPAQLK--CRSKEND-GFDAKASFKVSQGLRSYHPLSFVATSQGLRNFLLFPQLAYRDE 788
Query: 848 AYTVYFNI 855
Y +YF++
Sbjct: 789 HYAIYFDM 796
>gi|302769588|ref|XP_002968213.1| hypothetical protein SELMODRAFT_89765 [Selaginella moellendorffii]
gi|300163857|gb|EFJ30467.1| hypothetical protein SELMODRAFT_89765 [Selaginella moellendorffii]
Length = 797
Score = 756 bits (1951), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/788 (49%), Positives = 517/788 (65%), Gaps = 41/788 (5%)
Query: 97 NFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
+ L+ SLH V +D S+ + QQTNLEYLLMLDVDSL +SFR + LPT G YGGWE
Sbjct: 21 HLLEGSSLHKVRIDADSLQGKGQQTNLEYLLMLDVDSLAYSFRNNSGLPTKGVPYGGWEA 80
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
P ELRGHFVGHYLSA+A+MWASTHN +K +M +V L ECQ KIGTGYLSAFP LF
Sbjct: 81 PDQELRGHFVGHYLSATAKMWASTHNEELKRRMDHLVDILDECQQKIGTGYLSAFPLNLF 140
Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
FE +PVWAPYYTIHKI+AGLLDQY A N +AL+M WM +YF RV+ I YS++
Sbjct: 141 TRFETYRPVWAPYYTIHKIMAGLLDQYTEAGNMKALRMVIWMAQYFSKRVENYIEKYSIQ 200
Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
H+ +LNEETGGMNDVLY LY IT DP+HL LAHLFDKPCFLG LALQ D LS FHANTH
Sbjct: 201 AHFQALNEETGGMNDVLYDLYKITGDPQHLKLAHLFDKPCFLGPLALQQDTLSGFHANTH 260
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
IPI+IG+Q RYE+TGD + K + TFFMD VN+SH + TGGTS EFW DP R+A +LG +
Sbjct: 261 IPILIGAQKRYELTGDQVSKELVTFFMDAVNSSHRFVTGGTSDNEFWKDPNRMASSLGKD 320
Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
EE+C++YNMLK++R+LFRWTK+ +Y DYYER + NGVL+IQRG EPGVMIYMLP+G G+
Sbjct: 321 VEESCSSYNMLKIARNLFRWTKDASYMDYYERLILNGVLTIQRG-EPGVMIYMLPMGPGM 379
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG----------NVPGLYIIQY 506
+K ST GWG F+SFWCCYGTGIESFSK GDSIYFE+ G +P LY+ Q+
Sbjct: 380 AKTSSTMGWGDPFDSFWCCYGTGIESFSKFGDSIYFEDYGVRDENPGAQRPIPALYVAQF 439
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV---------GQLSSLNLRMP 557
+ S+ +W S ++L Q V P+ S+DP + +T+ + +++L +R+P
Sbjct: 440 VPSTLEWDSAGLILKQTVKPLTSFDPVMEVTIHLHENPKATIEETSPYHKLINTLYVRIP 499
Query: 558 VWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W S G +A N + + PG+FL+ W DKLT + P +R E IQDDR E+ S
Sbjct: 500 SWVAS-GYEAYFNDEPQDI-TPGSFLAIQREWKAGDKLTFKFPAEVRLEHIQDDREEHQS 557
Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN 677
+ I+FGP++LAG + GE+D+ S S I+P+ PS N L TF + + +
Sbjct: 558 LNGIMFGPFVLAGLSHGEFDLGPVDTSSPSDWITPVNPSDNDLLYTFRM----GDYQLGH 613
Query: 678 SNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLV-Q 736
++++T++ +GTD ATF++I + S + ++G+ V LE D PG ++
Sbjct: 614 KHRTVTLDSASTNGTDWDFEATFKVISSSSPSLAASKHSGLVGRVVSLELLDQPGRIIAH 673
Query: 737 QGKEDELVVSESPKEMGSS--------GFRLVAGLDKRNETVSLEAENRKGCFVSSGVNF 788
G LVV ++ + S+ GF++V GL + VS E+++ GC++ ++
Sbjct: 674 SGINKNLVVVDTSQFADSTNYLSQANLGFKVVPGL-ASDRLVSFESQDLPGCYIYVD-DW 731
Query: 789 EPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKG-ARRNFLLAPLLSFRDE 847
A LK C ++ D GF+ ASF G+ YHP+SFVA RNFLL P L++RDE
Sbjct: 732 RVPAQLK--CRSKEND-GFDAKASFKASQGLRSYHPLSFVATSQGLRNFLLFPQLAYRDE 788
Query: 848 AYTVYFNI 855
Y +YF++
Sbjct: 789 HYAIYFDM 796
>gi|357472933|ref|XP_003606751.1| hypothetical protein MTR_4g065220 [Medicago truncatula]
gi|355507806|gb|AES88948.1| hypothetical protein MTR_4g065220 [Medicago truncatula]
Length = 593
Score = 715 bits (1845), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/677 (55%), Positives = 456/677 (67%), Gaps = 94/677 (13%)
Query: 189 MSTVVFSLSECQNKIGTGYLSAFPTELF-DSFEALKPVWAPYYTIHKIL------AGLLD 241
MS +V LS CQ K G +F + L+ WAPYYTIHK+ LD
Sbjct: 1 MSALVSGLSACQEKNWNGISVCISNRVFLIELKNLEYAWAPYYTIHKLFDFDRSWLAFLD 60
Query: 242 QYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITH 301
QY +A N Q LKM TWMV+YFYNRV VI ++V RH+ SLNEE GGMND+LYRLYS+T
Sbjct: 61 QYTIAGNPQGLKMVTWMVDYFYNRVMNVIQKFTVNRHYQSLNEEAGGMNDLLYRLYSLTR 120
Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTF 361
DPKHL LAHLFDKPCFLG LA+Q + ++ FHANTHIPIV+G+Q+RYE+TGD YK IG +
Sbjct: 121 DPKHLELAHLFDKPCFLGVLAVQGNDIADFHANTHIPIVVGAQLRYELTGDLHYKDIGQY 180
Query: 362 FMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI 420
FMDIVN+SH+YATGGTS EFW +PKR+AD L S E EE+C+TYNMLKVSRHLFRWTKE+
Sbjct: 181 FMDIVNSSHAYATGGTSVGEFWRNPKRIADNLKSAETEESCSTYNMLKVSRHLFRWTKEV 240
Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
YADYYERALTNGVLSIQRGT+PGVMIYMLPLG GVSKA++ WGT F+SFWCCYGTGI
Sbjct: 241 TYADYYERALTNGVLSIQRGTDPGVMIYMLPLGLGVSKAQTYWKWGTPFDSFWCCYGTGI 300
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
ESFSKLGDSIYFEEEG LYIIQYISSSF+W SG
Sbjct: 301 ESFSKLGDSIYFEEEGKHRSLYIIQYISSSFNWNSG------------------------ 336
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
+G S+LN R+P WT +NGA+A LN + LPLP P
Sbjct: 337 ---TAIGTSSTLNFRIPSWTLANGAKALLNSETLPLPAP--------------------- 372
Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQ 660
DDRPE+AS+QAIL+GPYLLAGHT+ W I+PIP ++++Q
Sbjct: 373 ---------DDRPEFASLQAILYGPYLLAGHTT-NW-------------ITPIPSNYSSQ 409
Query: 661 LVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIG 720
LV+++Q+ ST V++NS QS+TME P GT+ A HATFRLI KDA G
Sbjct: 410 LVSYSQDINKSTLVITNSKQSLTMEILPGPGTENAPHATFRLIPKDAD-----------G 458
Query: 721 KSVMLEPFDFPGMLV-QQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKG 779
K+VMLEPFD PGM V QG E L++ +S SS F +V GLD RN+T+SLE+++ K
Sbjct: 459 KTVMLEPFDLPGMTVSHQGPEKPLIIVDSSHGGPSSVFLVVPGLDGRNQTISLESQSNKD 518
Query: 780 CFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLA 839
C+V S + G+ +KL+C + S + FN+A SF+ G+ +Y+PISFVAKGA +NFLL
Sbjct: 519 CYVHS--DMSAGSGVKLVCKSAS-ETSFNQANSFVSGKGLRQYNPISFVAKGANQNFLLE 575
Query: 840 PLLSFRDEAYTVYFNIQ 856
PL +FRDE YTVYFN+Q
Sbjct: 576 PLFNFRDEHYTVYFNLQ 592
>gi|449522353|ref|XP_004168191.1| PREDICTED: uncharacterized protein LOC101224273 [Cucumis sativus]
Length = 495
Score = 666 bits (1719), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/494 (66%), Positives = 397/494 (80%), Gaps = 4/494 (0%)
Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
MDIVN+SHSYATGGTS EFW DPKRLAD LG+E EE+CTTYNMLKVSR+LF+WTKEIAY
Sbjct: 1 MDIVNSSHSYATGGTSVHEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTKEIAY 60
Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
ADYYERALTNGVLSIQRGT+PGVMIYMLPLG G SKA S HGWGT F SFWCCYGTGIES
Sbjct: 61 ADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKAISYHGWGTPFESFWCCYGTGIES 120
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
FSKLGDSIYFEEE P LY+IQYISSS DWKSG+V+LNQ VDPI S DP LRMTLTFS
Sbjct: 121 FSKLGDSIYFEEELQTPTLYVIQYISSSLDWKSGNVLLNQTVDPIHSEDPKLRMTLTFSP 180
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
K V S++NLR+P WT ++GA+ LNGQ+L GNF S T WS +KL+++LP++
Sbjct: 181 KGSV-HSSTINLRIPSWTSASGAKVVLNGQSLGNNINGNFKSVTNSWSSGNKLSLELPIN 239
Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
LRTEAI DDR EYAS++AILFGPYLLA +++G+W+IKT A SLS I+ +P ++N LV
Sbjct: 240 LRTEAIDDDRSEYASVKAILFGPYLLAAYSNGDWEIKTQQADSLSDWITHVPSAYNTFLV 299
Query: 663 TFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKS 722
TF+Q SG ++F ++NSNQSITME++P GTD+A+HATFRLI+ D S + + L +VIGK
Sbjct: 300 TFSQASGKTSFALTNSNQSITMEKYPGQGTDSAVHATFRLIIDDPS-AKVTELQDVIGKR 358
Query: 723 VMLEPFDFPGMLV-QQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKGCF 781
VMLEPF FPGM++ +GK++ L ++++ E SS F LV GLD +N TVSL + + +GCF
Sbjct: 359 VMLEPFSFPGMVLGNKGKDERLEIADANSEGHSSDFYLVEGLDGKNGTVSLASIDNEGCF 418
Query: 782 VSSGVNFEPGASLKLLCSTE-SLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAP 840
V SGVN+E GA LKL C ++ SLD GF+ A+SF++E G S+YHPISFV KG RNFLLAP
Sbjct: 419 VYSGVNYESGAQLKLSCKSKLSLDDGFDEASSFLLESGASQYHPISFVTKGMTRNFLLAP 478
Query: 841 LLSFRDEAYTVYFN 854
LLSF DE+YTVYFN
Sbjct: 479 LLSFVDESYTVYFN 492
>gi|125556048|gb|EAZ01654.1| hypothetical protein OsI_23690 [Oryza sativa Indica Group]
Length = 466
Score = 612 bits (1578), Expect = e-172, Method: Compositional matrix adjust.
Identities = 295/461 (63%), Positives = 356/461 (77%), Gaps = 26/461 (5%)
Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK- 234
MWASTHN T+ KM+ VV +L +CQ GTGYLSAFP E FD FEA++PVWAPYYTIHK
Sbjct: 1 MWASTHNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKA 60
Query: 235 -------------------------ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
I+ GLLDQ+ +A N +AL M M +YF RV+ V
Sbjct: 61 RNATQSICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGRALGMVVAMADYFAGRVRSV 120
Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
I Y++ERHW SLNEETGGMNDVLY+LY+IT D +HL+LAHLFDKPCFLG LA+QAD LS
Sbjct: 121 IQRYTIERHWTSLNEETGGMNDVLYQLYTITKDQRHLVLAHLFDKPCFLGLLAVQADSLS 180
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
FHANTHIP+VIG QMRYEVTGDPLYK I TFFMDIVN+SHSYATGGTS EFW +PK L
Sbjct: 181 GFHANTHIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVSEFWSNPKHL 240
Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
A+ L +E EE+CTTYNMLKVSRHLFRWTKEIAYADYYERAL NGVLSIQRG +PGVMIYM
Sbjct: 241 AEALTTETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMIYM 300
Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
LP G G SKA S HGWGT++NSFWCCYGTGIESFSKLGDSIYFE++G+ PGLYIIQYI S
Sbjct: 301 LPQGPGRSKAVSYHGWGTQYNSFWCCYGTGIESFSKLGDSIYFEQKGDKPGLYIIQYIPS 360
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
+F+W++ + + Q+V P+ S D YL+++L+ S+ + GQ ++LN+R+P WT NGA+A+L
Sbjct: 361 TFNWRTAGLTVTQQVKPLSSSDQYLQVSLSISAAKTNGQYATLNVRIPSWTSMNGAKATL 420
Query: 570 NGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
N ++L L PG FL+ +++W D L +Q P++LRTEAI+D
Sbjct: 421 NDKDLQLASPGTFLTISKQWDSGDHLLLQFPINLRTEAIKD 461
>gi|413926260|gb|AFW66192.1| hypothetical protein ZEAMMB73_605676 [Zea mays]
gi|413952504|gb|AFW85153.1| hypothetical protein ZEAMMB73_422486 [Zea mays]
Length = 510
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 291/517 (56%), Positives = 381/517 (73%), Gaps = 13/517 (2%)
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
MRYEVTGDPLYK I +FFMD +N+SHSYATGGTSA EFW DPKRLA TL +ENEE+CTTY
Sbjct: 1 MRYEVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTY 60
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
NMLKVSR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA S HG
Sbjct: 61 NMLKVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHG 120
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
WGTK++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+ + + Q++
Sbjct: 121 WGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQQI 180
Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLS 584
+ S D YL+++ + S+ GQ +++N R+P WT+++GA A+LNG++L PG+FLS
Sbjct: 181 KTLSSSDQYLQISFSISANTS-GQTANINFRIPSWTFADGAGATLNGKDLGSISPGSFLS 239
Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTAR 644
T++W+ +D L + P+ LRTEAI+DDR EYAS+QA+LFGP++LAG ++G+WD K G
Sbjct: 240 ITKQWNSDDHLALHFPIRLRTEAIKDDRLEYASLQAVLFGPFVLAGLSTGDWDAKAGNGS 299
Query: 645 SLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP-VSGTDAALHATFRLI 703
++S I+ +PP+ N+QLVTFTQ S FV+S++N ++TM+E P V GTDAA+HATFR
Sbjct: 300 AISDWIAAVPPAHNSQLVTFTQVSNGKAFVLSSANGTLTMQERPEVDGTDAAIHATFRAH 359
Query: 704 LKDAS--LSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVA 761
++ S L + S + G S++LEPFD PG ++ ++ S ++ S F +V
Sbjct: 360 PQEDSTELHDIYS-TTLTGTSILLEPFDLPGTVITNN------LTLSAQKSSDSLFNIVP 412
Query: 762 GLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLC--STESLDAGFNRAASFMMEIGI 819
GLD +VSLE + GCF+ +G N+ G +++ C S ES+ +AASF +
Sbjct: 413 GLDGNPNSVSLELGTKPGCFLVTGTNYSAGTRIEVNCKSSLESIGGILEQAASFSQTDPL 472
Query: 820 SEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNIQ 856
+YHPISFVAKG RNFLL PL S RDE YTVYFN++
Sbjct: 473 RQYHPISFVAKGVARNFLLEPLYSLRDEFYTVYFNVR 509
>gi|413954825|gb|AFW87474.1| hypothetical protein ZEAMMB73_309562 [Zea mays]
Length = 483
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 273/501 (54%), Positives = 355/501 (70%), Gaps = 28/501 (5%)
Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
MD VN+SH+YATGGTS EFW +PKRLA+ L +E EE+CTTYNMLKVSRHLFRWTKEIAY
Sbjct: 1 MDTVNSSHAYATGGTSVSEFWSNPKRLAEALTTETEESCTTYNMLKVSRHLFRWTKEIAY 60
Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
ADYYERAL NGVLSIQRG +PGVMIYMLP G G SKA+S HGWGT++ SFWCCYGTGIES
Sbjct: 61 ADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAKSYHGWGTQYESFWCCYGTGIES 120
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
FSKLGDSIYFEE G P LY++Q+I S+F W++ + + Q++ P+ S D YL+++ + S+
Sbjct: 121 FSKLGDSIYFEERGERPALYVVQFIPSTFSWRTAGLTVAQQLMPLSSSDQYLQVSFSVSA 180
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
K GQ ++LN+R+P WT NGA+A+LNG++L L PG FL+ +++W D+L++QLP+
Sbjct: 181 KTTNGQFATLNVRIPSWTSLNGAKATLNGKHLELASPGTFLTISKQWGSGDQLSLQLPIH 240
Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG-TARSLSALISPIPPSFNAQL 661
LRTEAI+DDRPEYASIQA+LFGP+LLAG T+G+WD KTG + S I+P+P N+QL
Sbjct: 241 LRTEAIKDDRPEYASIQAVLFGPFLLAGLTTGDWDAKTGAADAAASDWITPVPVESNSQL 300
Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPV--SGTDAALHATFRLILKDASLSNFSSLNNVI 719
VT QESG FV+S N S+TM + P GT+AA+HATFRL+ + + +
Sbjct: 301 VTLAQESGGEAFVLSALNGSLTMLQRPKDGGGTEAAVHATFRLVPQGGAGAG-------- 352
Query: 720 GKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKRNETVSLEAENRKG 779
+ MLEP D PGM+V D L V+ + K G++ F +V GL +VSLE +R G
Sbjct: 353 -AAAMLEPLDMPGMVV----TDRLTVA-AEKSSGAA-FNVVPGLAGAPGSVSLELASRPG 405
Query: 780 CFVSSGVNFEPGASLKLLCSTESLD-----AGFNRAASFMMEIGISEYHPISFVAKGARR 834
CF+ G G +++ C+ + A F R+ASF + YHP+SF A+G RR
Sbjct: 406 CFLVGG-----GEKVQVGCAGGAQQKRGDGAWFRRSASFARGEPLRRYHPMSFAARGVRR 460
Query: 835 NFLLAPLLSFRDEAYTVYFNI 855
+FLL PL + RDE YTVYFN+
Sbjct: 461 SFLLEPLFTLRDEFYTVYFNL 481
>gi|449531121|ref|XP_004172536.1| PREDICTED: uncharacterized LOC101224273, partial [Cucumis sativus]
Length = 366
Score = 500 bits (1287), Expect = e-138, Method: Compositional matrix adjust.
Identities = 240/340 (70%), Positives = 281/340 (82%), Gaps = 3/340 (0%)
Query: 19 KQCTNQ-SPYDSHAFRYELTST-NKTWKEEVLSHFHLTPTDDSAWSSLIPSKILGDQKDE 76
K+CTN + SH FRYEL S+ N TWK+E+ SH+HLTPTDD AWS+L+P K+L ++ +E
Sbjct: 28 KECTNTPTQLGSHTFRYELLSSGNVTWKKELFSHYHLTPTDDFAWSNLLPRKMLKEE-NE 86
Query: 77 VSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVW 136
+W ++YR++KN G +PG LKE+SLHDV LD +S+ AQ TNL+YLLMLDVD L+W
Sbjct: 87 YNWEMMYRQMKNKDGLRIPGGMLKEISLHDVRLDPNSLHGTAQTTNLKYLLMLDVDRLLW 146
Query: 137 SFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSL 196
SFRKTA LPTPG+ Y GWE ELRGHFVGHYLSASAQMWAST N+ +KEKMS +V L
Sbjct: 147 SFRKTAGLPTPGEPYVGWEKSDCELRGHFVGHYLSASAQMWASTGNSVLKEKMSALVSGL 206
Query: 197 SECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMAT 256
+ CQ+K+GTGYLSAFP+E FD FEA++PVWAPYYTIHKILAGLLDQY A N+QALKM T
Sbjct: 207 ATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAGLLDQYTFAGNSQALKMVT 266
Query: 257 WMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPC 316
WMVEYFYNRVQ VI Y+VERH+ SLNEETGGMNDVLYRLY IT + KHLLLAHLFDKPC
Sbjct: 267 WMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYRITGNTKHLLLAHLFDKPC 326
Query: 317 FLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYK 356
FLG LA+QA+ +S FH NTHIPIV+GSQMRYEVTGDPLYK
Sbjct: 327 FLGLLAVQAEDISGFHVNTHIPIVVGSQMRYEVTGDPLYK 366
>gi|159491176|ref|XP_001703549.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280473|gb|EDP06231.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1485
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 296/875 (33%), Positives = 416/875 (47%), Gaps = 180/875 (20%)
Query: 117 RAQQTNLEYLL-MLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFVGHYLSASA 174
R ++ N +YLL MLD D L+W FRK A LPTPG+ Y G WE+P ELRGHFVGHYLSA +
Sbjct: 557 RYERINSKYLLDMLDADRLLWVFRKNAGLPTPGEPYVGSWEDPNCELRGHFVGHYLSALS 616
Query: 175 QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK 234
WA T N+ K ++ +V L + Q K+GTGYLSAFPT FD E+L+ VWAPYYTIHK
Sbjct: 617 LAWAGTGNSAFKTRLDLMVSELGKVQEKLGTGYLSAFPTSWFDRVESLQAVWAPYYTIHK 676
Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVL 293
I+AGL+D + LA + AL MAT MV+Y +NR Q VI+ +HW + E E GGMN++L
Sbjct: 677 IIAGLVDAHELAGHPSALTMATRMVDYHWNRTQAVISKKGA-KHWQKVLEFEYGGMNEIL 735
Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
YRLY IT H A LFDK FLG +A D L HANTH+ ++G YE TG+P
Sbjct: 736 YRLYLITGKDDHRDFASLFDKTVFLGHMAAHDDVLYDLHANTHLAQIVGFAAGYEATGNP 795
Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
+ F +IV H YATGGTS E WW + + ETCT YNMLK++R L
Sbjct: 796 KLRTAVNNFFEIVVQHHGYATGGTSVFERWWGRRGRGPRNALKTHETCTQYNMLKIARQL 855
Query: 414 FRWTKEIAYADYYERALTNGVLSIQR-------------------GTEP----------- 443
F WT ++ YAD+YERA+ NG+ + R G +P
Sbjct: 856 FMWTGDVYYADHYERAMVNGMWGVARLPADELPENGAAGAGGVDKGGQPVSPYTRFHDDE 915
Query: 444 ----------------------GVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
GV +Y+LP+G G SK+ + H WG F+SFWCCYGT IE
Sbjct: 916 WMDYISFSKPKPEWNASDAAGPGVYLYLLPMGHGNSKSDNLHHWGFPFHSFWCCYGTIIE 975
Query: 482 SFSKLGDSIYF-------------EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
S++KL DSI+F E+ G ++ + D + K+ P +
Sbjct: 976 SYAKLADSIFFKWVRVRDMSPESDEDAGAKTAKKRTRHDVNPSDGSASGAKGAVKLPPRL 1035
Query: 529 SWDPYLRMTLTFSSKQEVGQLS----SLNLRMPVWTYSNGAQASLNGQ---NLPLPP-PG 580
+ ++ L+ +S + +L LR+P W G LNGQ P P P
Sbjct: 1036 YLNQFVSSRLSKASSTTASGPTDGVFTLMLRIPAWARDGGVLLELNGQAFNGCPGAPLPD 1095
Query: 581 NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKT 640
++ T +W D L++++ L QD R EY S++A++ GPY++AG W+
Sbjct: 1096 SYCRITRKWQARDVLSVRVALRWWFSPAQDAREEYRSLKAVMMGPYMMAG-----WN--- 1147
Query: 641 GTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATF 700
S + +AQ++ G+S +S+ S+ +G ++L +
Sbjct: 1148 ----------SSLHLRHDAQILYIEDADGSS----GHSHGSL-------AGAFSSLRSMM 1186
Query: 701 RLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSES-PKEMGSSGFR- 758
RL D+ G ++ LE +P + D +V+ P+E S F
Sbjct: 1187 RLGAADS------------GSALSLEAMSYPNHYLAHDHTDVIVLQPGPPREDASHPFAP 1234
Query: 759 -------LVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGAS------------------ 793
+ GLD +TVS EA R G FV++ PG S
Sbjct: 1235 CSRAMWMMRPGLDGAADTVSFEAVARPGWFVTAAR--PPGESAAAAKDSPVTCVDANEVD 1292
Query: 794 ---------------LKLLC--------STESL---------DAGFNRAASFMMEIGISE 821
++LC TE A + ASF + +
Sbjct: 1293 CTAAVPDGCGTNAFLARVLCRKSCRSCLGTEQALRLRQQVPGSAVYAATASFRLAPPVRR 1352
Query: 822 YHPI-SFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
+P + V G+ R++L+APL + DE Y+ YFN+
Sbjct: 1353 AYPAGAHVLAGSNRHYLIAPLGNLVDERYSAYFNV 1387
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 67/213 (31%), Positives = 109/213 (51%), Gaps = 36/213 (16%)
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV---- 498
PGV IY+LPLG G SK+ + H WG F+SFWCCYGT IES++KL DSIYF+E
Sbjct: 195 PGVFIYLLPLGTGQSKSDNIHHWGFPFHSFWCCYGTVIESYAKLADSIYFKEMSPANPES 254
Query: 499 -----------PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVG 547
P LY+ Q +SS W ++ + + D + + P LT S + G
Sbjct: 255 RAHDKAGVRLPPRLYVNQLVSSKATWAEMNLRVTMQAD-MFTPGPAAVAQLTLDSTKAPG 313
Query: 548 QLS------SLNLRMPVWTYSN----------GAQASLNGQ---NLPLP-PPGNFLSATE 587
+ +L +R+P W + GA +NGQ + P P G++ +
Sbjct: 314 PGTHDLGTFTLMVRVPEWLAPDRHGGVAQGGSGASIEVNGQLWTSCPGPVKAGSYCALMR 373
Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
RW+ D ++++LP+ R +++ ++R ++ +++
Sbjct: 374 RWASGDGVSLRLPMRWRLQSLAENRAQHQGLKS 406
Score = 106 bits (265), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 58/140 (41%), Positives = 77/140 (55%), Gaps = 22/140 (15%)
Query: 305 HLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMD 364
H+ A LF+KP F + D L + HANTH+ V G Y D
Sbjct: 2 HMEFAQLFNKPFFRKPMEAGNDMLMNLHANTHLAQVAGFAEEY----------------D 45
Query: 365 IVNASHSYATGGTSAREFWWDPKRLADTL-----GSENEETCTTYNMLKVSRHLFRWTKE 419
V+ +ATGG++ EFW P LAD++ G E +ETCT YN+LK++R LFRWT +
Sbjct: 46 TVD-KRVFATGGSTDHEFWQAPDELADSVLTQKHGVETQETCTQYNILKIARSLFRWTGD 104
Query: 420 IAYADYYERALTNGVLSIQR 439
+ YAD+YERAL NG+L R
Sbjct: 105 VRYADFYERALVNGILGTAR 124
>gi|218198541|gb|EEC80968.1| hypothetical protein OsI_23691 [Oryza sativa Indica Group]
Length = 759
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 242/518 (46%), Positives = 317/518 (61%), Gaps = 57/518 (11%)
Query: 385 DPKRLADTLG-SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
DPKRL D + S NEETC TYN+LKVSR+LFRWTKE Y D+YER L NG++ QRG EP
Sbjct: 249 DPKRLVDEIKISSNEETCATYNLLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEP 308
Query: 444 GVMIYMLPLGRGVSKA-----------RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
GVMIY LP+G G SK+ ++ GWG +FWCCYGTGIESFSKLGDSIYF
Sbjct: 309 GVMIYFLPMGPGRSKSISGMPTSGLPPKNPGGWGNANATFWCCYGTGIESFSKLGDSIYF 368
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
EEG +PGLYIIQYI S+FDWK+ + + Q+ P+ S D + +++ SSK + + +++
Sbjct: 369 LEEGEIPGLYIIQYIPSTFDWKAAGLTVKQQAKPLSSTDSHFEVSIFISSKGDA-RPANV 427
Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
N+R+P WT +GA A+LNGQ L L G+FLS T+ W +D L+++ P++LRTE I+DDR
Sbjct: 428 NVRIPSWTSVDGAIATLNGQKLNLTSAGDFLSVTKLWG-DDTLSLKFPITLRTEPIKDDR 486
Query: 613 PEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPI------------------- 653
PEY+SIQA+LFGP+LLAG T G +KT + S S L +
Sbjct: 487 PEYSSIQAVLFGPHLLAGLTHGNQTVKT-SNDSNSGLTPGVWEVNATHAAAAVAVWVTPV 545
Query: 654 PPSFNAQLVTFTQESGN----STFVMSNS--NQSITMEEFPVSGTDAALHATFRLILKDA 707
S N+QLVT TQ G+ + FV+S S + ++TM+E PV+G+DA +HATFR +
Sbjct: 546 SQSLNSQLVTLTQRDGDAQAAAAFVLSVSIADGALTMQESPVAGSDACVHATFRAYHSPS 605
Query: 708 SLSNF-SSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGLDKR 766
S ++ + G+ V LEPFD PGM V D L V + ++ F VAGLD
Sbjct: 606 GASAIDAATGRLQGRDVALEPFDRPGMAVT----DALSVG---RPGPATRFNAVAGLDGL 658
Query: 767 NETVSLEAENRKGCFVSSGVN-FEPGASLKLLCSTESL--------DAGFNRAASFMMEI 817
TVSLE R GCFV++ + GA ++ C + D F RAASF
Sbjct: 659 PGTVSLELATRPGCFVAAPTTAYLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASFTQAA 718
Query: 818 GISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
+ YHP+SF A G RNFLL PL S +DE YTVYFN+
Sbjct: 719 PLRLYHPLSFSATGTDRNFLLEPLQSLQDEFYTVYFNV 756
Score = 204 bits (520), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 98/190 (51%), Positives = 127/190 (66%), Gaps = 8/190 (4%)
Query: 52 HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDL---PGNFLKEVSLHDVW 108
HL +++ W L+P + +DE+ W LYR I GG D+ P FL SLHDV
Sbjct: 57 HLNQAEEATWMGLLPRR--AGPRDELDWLALYRSITR-GGGDVGGEPAGFLSPASLHDVR 113
Query: 109 LDQ--SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
+D +++ W+ QQTNLEYLL LD D L W+FR+ A LPT G+ YGGWE P +LRGHF
Sbjct: 114 VDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWEAPDGQLRGHFT 173
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVW 226
GHYLSA+A MWASTHN ++EKM+ VV L CQ K+ TGYLSA+P +FD+++ L W
Sbjct: 174 GHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDAYDELAEAW 233
Query: 227 APYYTIHKIL 236
+PYYTIHK +
Sbjct: 234 SPYYTIHKFI 243
>gi|384252025|gb|EIE25502.1| DUF1680-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 648
Score = 417 bits (1071), Expect = e-113, Method: Compositional matrix adjust.
Identities = 247/645 (38%), Positives = 359/645 (55%), Gaps = 52/645 (8%)
Query: 97 NFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWE 155
+ ++ L + L++ S+ +A N +Y+L L+ D L+ +FR A LP+ + + G WE
Sbjct: 20 DIIQPFPLDQITLERDSLFDKALALNTDYMLQLNADQLLHTFRLNAGLPSSAQPFTGSWE 79
Query: 156 NPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL 215
+P E+RG F+GHYLSA + + T N I+ +++ ++ L + Q + GYLSAFP E
Sbjct: 80 DPSCEVRGQFMGHYLSACSMLVNHTGNGKIESRLTYIIDELRKVQIALSGGYLSAFPEEH 139
Query: 216 FDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSV 275
F ++L+ VWAP+Y IHKI+AGLLD + AL+M E+F V+
Sbjct: 140 FVRLQSLQTVWAPFYVIHKIMAGLLDAHNFLGYDVALEMVKDEAEHFTRYYNDVVATNGT 199
Query: 276 ERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHAN 334
E HW + E E GGMN+VL+ LY +T DP+H+ LA F KP F L D L HAN
Sbjct: 200 E-HWLRMLEVEFGGMNEVLFNLYDVTGDPEHIRLAEAFTKPKFFEPLLQNTDPLPGLHAN 258
Query: 335 THIPIVIGSQMRYE-VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
TH+ V G R+E + D Y + FF IV HS+ATGG + E+W P++LAD++
Sbjct: 259 THLAQVNGFAARFEKASHDGSYAAVTNFF-SIVTRGHSFATGGNNDHEYWGPPRQLADSI 317
Query: 394 ---GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR--------GTE 442
+E EETCT YNMLK++R+LFRWT +ADYYERA+ NG+L QR +
Sbjct: 318 LLHATETEETCTQYNMLKIARYLFRWTGAPVFADYYERAILNGLLGTQRMPADYSPHTSR 377
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
PGV+IY+LP+G G +K ST GWG +SFWCCYG+ +ESFSKL DSI+F + + L
Sbjct: 378 PGVVIYLLPMGSGQTKGGSTRGWGDPLHSFWCCYGSSVESFSKLADSIFFYRQAHSSCLT 437
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL----TFSSKQEVGQLS-------- 550
+ Y + + S P+V L+ + T S+ V LS
Sbjct: 438 LHAYPAHFYTSAS-------LASPLVGLSVQLQASFFQGTTASANITVAPLSAAAHDSTA 490
Query: 551 --SLNLRMPVWTYSNGAQASLNGQN------LPLPPPGNFLSATERWSYNDKLTIQLPLS 602
+L LR+P W S+G + +NGQ+ P G+F + R++ DK+T+ LP+S
Sbjct: 491 EVTLKLRIPSWAVSSGVRVEVNGQSWADCAPAAGPQAGSFCTVRRRFAAGDKVTLALPMS 550
Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
+R E +QDDRPEY+S AI+ GP L+AG T+G I+ R ++ L++ I A L+
Sbjct: 551 IRAERVQDDRPEYSSQHAIMMGPLLMAGITNGSRSIQ-ADPRKVADLLTDISSQGLASLI 609
Query: 663 TFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLI-LKD 706
G+ + + + E P+ G AL +TFRL+ LKD
Sbjct: 610 I----PGDLPLHIRHEGAMLRAE--PMKGP-YALDSTFRLLGLKD 647
>gi|390957656|ref|YP_006421413.1| hypothetical protein Terro_1782 [Terriglobus roseus DSM 18391]
gi|390412574|gb|AFL88078.1| hypothetical protein Terro_1782 [Terriglobus roseus DSM 18391]
Length = 635
Score = 353 bits (907), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 221/552 (40%), Positives = 292/552 (52%), Gaps = 43/552 (7%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG-HYLSASAQ 175
R+ N +YL L VD L+ SFR TA + + K YGGWE P ELRGHF G HYLSA A
Sbjct: 60 RSADVNEKYLDSLQVDRLLHSFRLTAGITSSAKPYGGWEIPNGELRGHFAGGHYLSAVAF 119
Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
A N T++EK + +V L+ CQ G GYLSA+P ELF K VWAP+YT HKI
Sbjct: 120 ASAGAGNTTLREKGNALVAGLAACQKANGNGYLSAYPPELFQRLALGKQVWAPFYTYHKI 179
Query: 236 LAGLLDQYVLADNAQALK----MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMND 291
+AGL+D Y N ALK MA W YF + M +R L E GGMN+
Sbjct: 180 MAGLVDMYTQTGNEDALKVAEGMAGWSSAYFAD-------MSDAQRQGI-LRIEYGGMNE 231
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTG 351
VL LYS+T ++L A F++P FL LA D L HANT IP +IG+ YE TG
Sbjct: 232 VLVNLYSLTGKERYLSQARKFEQPTFLDPLAAHRDELQGLHANTSIPKIIGAARMYEATG 291
Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK-RLADTLGSENEETCTTYNMLKVS 410
D Y+ I ++F+D V ++H+YA G TS E W P LA +L +N E C YN++K+
Sbjct: 292 DRRYQEIASYFLDDVLSAHTYAIGNTSDDEHWRTPAGSLAGSLSLKNAECCVAYNLMKLE 351
Query: 411 RHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFN 470
RHL WT + + D YER L N L Q G+ Y PL G + +G+
Sbjct: 352 RHLSAWTGDARWMDAYERTLFNARLGTQDAA--GLKQYFFPLAAGYWRV-----YGSPEE 404
Query: 471 SFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSW 530
SFWCC GTG E F+K GDSIYF V Y+ Q+I+S WK L Q+
Sbjct: 405 SFWCCTGTGAEDFAKFGDSIYFHANDTV---YVNQFIASVLTWKEKGFTLRQETS--FPS 459
Query: 531 DPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERW 589
+ R+T+ + QE S+ +R+P W ++G ++N + L PG++L W
Sbjct: 460 ESQTRLTIQTAQPQE----RSIAIRIPSWI-ADGGFVAVNDKRLEAFAEPGSYLVIRRTW 514
Query: 590 SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH-----TSGEWDIKT--GT 642
D +T+ LP++LR E + P + A L+GP +LAG TSG I T GT
Sbjct: 515 HAGDTVTVHLPMALREEPL----PGSPNTAAALYGPLVLAGTLGDGPTSGPTKILTGRGT 570
Query: 643 ARSLSALISPIP 654
A +P+P
Sbjct: 571 APEGVPAAAPLP 582
>gi|225872906|ref|YP_002754363.1| Tat pathway signal sequence domain-containing protein
[Acidobacterium capsulatum ATCC 51196]
gi|225794208|gb|ACO34298.1| Tat pathway signal sequence domain protein [Acidobacterium
capsulatum ATCC 51196]
Length = 644
Score = 347 bits (891), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 209/527 (39%), Positives = 287/527 (54%), Gaps = 39/527 (7%)
Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG-HY 169
+ VL A + N +YL ++ D L+ +FR TA LPT + GGWE P ELRGHF G HY
Sbjct: 66 RDGVLKNALEINRQYLYLVPNDRLLHTFRLTAGLPTSAEPLGGWEAPDCELRGHFAGGHY 125
Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPY 229
LSA A M+AST + IK K +V L++CQ GYLSAFP FD + VWAP+
Sbjct: 126 LSACALMYASTGDEKIKAKGDALVAELAKCQQP--DGYLSAFPASFFDRLRHYQKVWAPF 183
Query: 230 YTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
YT HKI+AG LD YV N QAL +MA W +EY K I +R L E
Sbjct: 184 YTYHKIMAGHLDMYVHTGNQQALETCKRMADWAIEY-----TKPIPADQWQR---MLLVE 235
Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
GGMN+V + LY++T + K+ L F+ LA + D+L+ HANT+IP VIG+
Sbjct: 236 QGGMNEVSFNLYAVTGEKKYRDLGFRFEHKLIFDPLAKREDHLAGNHANTNIPKVIGAAR 295
Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
YEV D Y I FF V + H+YATGGTS EFW P LA+ LG EE C +YN
Sbjct: 296 GYEVADDKRYHTIAEFFWGAVTSQHAYATGGTSDGEFWHKPGTLAEHLGPAAEECCCSYN 355
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
M+K+SRHL+ WT + DYYER + N + Q G+++Y + L G K +
Sbjct: 356 MMKLSRHLYGWTGDPRIFDYYERLMYNVRIGTQ--DPKGMLMYYVSLKPGYWKT-----F 408
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
GT F++FWCC GTG+E +SK+ DSIYF + N+ Y+ + S W +V L Q+ +
Sbjct: 409 GTPFDAFWCCTGTGVEEYSKVNDSIYFHDAKNI---YVNLFAGSEVQWPEKNVSLVQETN 465
Query: 526 -PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFL 583
P L T + + + L +R+P W +NG +NGQ + P ++
Sbjct: 466 FP-------LEEATTLTVRAQKPSAFGLKIRVPYWA-TNGFTIHINGQPQSVEAKPESYA 517
Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ W D + + +P+SL I P+ +QA+L+GP +LAG
Sbjct: 518 TLHRTWHDGDTIKVSMPMSLHISPI----PDSPDVQAVLYGPLVLAG 560
>gi|413926259|gb|AFW66191.1| hypothetical protein ZEAMMB73_605676 [Zea mays]
gi|413952505|gb|AFW85154.1| hypothetical protein ZEAMMB73_422486 [Zea mays]
Length = 250
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 160/239 (66%), Positives = 196/239 (82%), Gaps = 1/239 (0%)
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
MRYEVTGDPLYK I +FFMD +N+SHSYATGGTSA EFW DPKRLA TL +ENEE+CTTY
Sbjct: 1 MRYEVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTY 60
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
NMLKVSR+LFRWTKEIAYADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA S HG
Sbjct: 61 NMLKVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHG 120
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
WGTK++SFWCCYGTGIESFSKLGDSIYFEE+G+ P L IIQYI S+++WK+ + + Q++
Sbjct: 121 WGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQQI 180
Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
+ S D YL+++ + S+ GQ +++N R+P WT+++GA A+LNG++L PG +
Sbjct: 181 KTLSSSDQYLQISFSISANTS-GQTANINFRIPSWTFADGAGATLNGKDLGSISPGKIV 238
>gi|116620365|ref|YP_822521.1| hypothetical protein Acid_1242 [Candidatus Solibacter usitatus
Ellin6076]
gi|116223527|gb|ABJ82236.1| protein of unknown function DUF1680 [Candidatus Solibacter usitatus
Ellin6076]
Length = 664
Score = 336 bits (862), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 215/540 (39%), Positives = 290/540 (53%), Gaps = 53/540 (9%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE-----------NPISELRGHFV 166
A + N Y+ L D L+ +FR A LP+ + GGWE N ELRGHFV
Sbjct: 82 AAEWNRGYMNRLPADRLLHAFRLNAGLPSSAQPLGGWEIYVEPTPGKRINSEGELRGHFV 141
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSAFPTELFDSFEALKPV 225
GH+LSASAQ++AS + K K +V L++CQ K+G +GYLSAFP E FD +A KPV
Sbjct: 142 GHFLSASAQLYASMGDKDAKAKADYIVAELAKCQQKLGPSGYLSAFPIEWFDRLDARKPV 201
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALK----MATWMVEYFYNRVQKVITMYSVERHWYS 281
WAP+YTIHKI+AG+ D Y LA N QAL+ M+ W E+ T E H
Sbjct: 202 WAPFYTIHKIMAGMFDMYTLAGNQQALQVLEGMSNWADEW---------TASKSEAHMQD 252
Query: 282 -LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV 340
L E GGMN+VLY L ++T + + F K F LAL+ D L+ H NTHIP V
Sbjct: 253 ILRTEYGGMNEVLYNLAAVTGNDRWAKAGDRFTKKEFFNPLALRNDALTGLHVNTHIPQV 312
Query: 341 IGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW-DPKRLADTLGSE--N 397
IG+ RYE++ D + + +F V + SY T GTS E W P+ LA L
Sbjct: 313 IGAAARYEISSDMRFHDVADYFWYEVVTARSYVTEGTSNGEGWLTQPRMLAAELKRSVAT 372
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVL-SIQRGTEPGVMIYMLPLGRGV 456
E C +YNMLK++RHL+ W + AY DYYERAL N L +IQ T G Y L L G
Sbjct: 373 AECCCSYNMLKLTRHLYGWKPDPAYFDYYERALFNHRLGTIQPKT--GYTQYYLSLTPGA 430
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
K + T+ SFWCC G+G+E +SKL DSIY+ + GL + +I S +W+
Sbjct: 431 WKT-----FNTEDKSFWCCTGSGVEEYSKLNDSIYWHD---AEGLTVNLFIPSELNWEEK 482
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
L Q+ + TLT ++ + ++ LR+P WT S A +NG+ + +
Sbjct: 483 GFRLRQE----TKFPEQQSTTLTVTAAKSAPM--AMRLRIPAWTKS--AAVKINGRAVDV 534
Query: 577 -PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
P PG++L+ T W DK+ + LP+ L E + DD QA L+GP +LAG E
Sbjct: 535 TPTPGSYLTLTRPWKAGDKIEMTLPMHLSVEYMPDD----PKTQAFLYGPIVLAGDLGAE 590
>gi|302844990|ref|XP_002954034.1| hypothetical protein VOLCADRAFT_106211 [Volvox carteri f.
nagariensis]
gi|300260533|gb|EFJ44751.1| hypothetical protein VOLCADRAFT_106211 [Volvox carteri f.
nagariensis]
Length = 1160
Score = 328 bits (842), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 177/361 (49%), Positives = 225/361 (62%), Gaps = 21/361 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLL-MLDVDSLVWSFRKTASLPTPGKAY-GGWEN 156
++ +L DV L +S R ++ N +YLL MLD D L+WSFRKTA LPTPG+ Y WE+
Sbjct: 30 IEPFALSDVRLLDTSHQIRYERLNAKYLLEMLDPDRLLWSFRKTAGLPTPGQPYIASWED 89
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSAFPTEL 215
P ELRGHFVGHYLSA + +AST N +++ +V L + Q +G GYLSAFP+E
Sbjct: 90 PGCELRGHFVGHYLSALSLAYASTGNIAFHTRLALMVSELGKVQQALGLGGYLSAFPSEF 149
Query: 216 FDSFEALKPVWAPYYTI-----------HKILAGLLDQYVLADNAQALKMATWMVEYFYN 264
FD EALKPVWAPYYTI HKI+AGL+D Y L +AL MA+ MV Y +N
Sbjct: 150 FDRVEALKPVWAPYYTIPIAPFPDTTQIHKIIAGLVDAYELGGQKEALAMASRMVAYHWN 209
Query: 265 RVQKVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
R Q +I E HW LN E GGMN++LYR++ IT DP HL A LF+KP F+ +
Sbjct: 210 RTQALIASKGRE-HWNGVLNCEFGGMNEILYRMHRITKDPTHLEFARLFEKPFFMKPMVN 268
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
D L HANTH+ V G Y+ GD + F DIV HS+ATGG++ EFW
Sbjct: 269 NFDILESLHANTHLAQVAGFAEAYDTVGDEAARNATRNFFDIVTTHHSFATGGSNDHEFW 328
Query: 384 WDPKRLADTL-----GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
P R+AD++ E +ETCT YN+LK++R LFRWT +AYAD+YERAL NG+L
Sbjct: 329 QAPDRMADSVIKQKDAVETQETCTQYNILKIARSLFRWTGNVAYADFYERALLNGILGTA 388
Query: 439 R 439
R
Sbjct: 389 R 389
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/223 (32%), Positives = 118/223 (52%), Gaps = 34/223 (15%)
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE----EEGN- 497
PGV +Y+ PLG G SK+ + H WG ++SFWCCYGT +ES +KL DSIYF+ ++G
Sbjct: 486 PGVFLYLTPLGTGQSKSDNIHHWGFPYHSFWCCYGTVVESHAKLADSIYFKDMNPQQGGP 545
Query: 498 --------VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF---SSKQEV 546
P LYI Q + S W + + + D + + P + F S+
Sbjct: 546 SDPSAPKLPPRLYINQLVPSKVTWHELGLRITTEAD-MFAPGPAATAQIRFDPLSAAAAG 604
Query: 547 GQLS---SLNLRMPVWTYSNGAQAS----------LNGQ---NLP-LPPPGNFLSATERW 589
QLS +L +R+P W A + +NGQ + P P PG++ T +W
Sbjct: 605 SQLSAMFTLMVRVPEWAAREAASGTAGRGRGISIGVNGQSWTSCPGAPVPGSYCQVTRQW 664
Query: 590 SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
S D ++++LP+ + + ++RP+Y+ +QA++ GP+++AG T
Sbjct: 665 STGDVVSLRLPMRWWLKPLPENRPQYSGLQAVMMGPFVMAGIT 707
>gi|383316642|ref|YP_005377484.1| hypothetical protein [Frateuria aurantia DSM 6220]
gi|379043746|gb|AFC85802.1| hypothetical protein Fraau_1370 [Frateuria aurantia DSM 6220]
Length = 651
Score = 328 bits (840), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 202/544 (37%), Positives = 284/544 (52%), Gaps = 42/544 (7%)
Query: 103 SLHDVWLDQSSV----LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
SL LDQ ++ A N YL L VD L +F + A LP+ + GGWE+P
Sbjct: 58 SLQAFALDQVTLSPGPFAEAAAINARYLHQLPVDRLAHNFLRQAGLPSTAQPLGGWESPE 117
Query: 159 SELRGHFVG-HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
ELRGHF G H+LSA+A +WA+T + T+K++ +V L+ CQ GYLSAFP F+
Sbjct: 118 CELRGHFCGGHWLSAAALVWATTADRTLKQRADELVAILARCQRS--DGYLSAFPDSFFE 175
Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMAT----WMVEYFYNRVQKVITMY 273
+ VWAP+YT+HKIL G LD Y+ A N QAL +AT W V + R +
Sbjct: 176 RLSHGQKVWAPFYTLHKILCGHLDMYMHAGNQQALDIATGLGDWTVHWLNGRSDAQMNEI 235
Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
L E GGMND L LY+IT + ++L AH FD+ L LA D L H+
Sbjct: 236 --------LRTEYGGMNDALCELYAITGNGRYLDAAHRFDQASLLDPLAAHRDELKGLHS 287
Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD-PKRLADT 392
NT +P +IG+ RYE+TG+ Y+ + F + ++ + YA GG+S EFW + P L D
Sbjct: 288 NTQLPKIIGAARRYELTGEQRYRRMAEFGWETISGTRCYANGGSSNDEFWNNGPDDLHDQ 347
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
LG E C YN+LK++RH++ WT + DYYER L N L Q G+ +Y PL
Sbjct: 348 LGVAAAECCVAYNLLKLTRHVYGWTGDPRAFDYYERNLYNARLGTQ--DPAGMKLYYYPL 405
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
G K + + +SFWCC GTG E F++ DSIYF G LY+ YI+S
Sbjct: 406 APGSYKY-----FNSPLHSFWCCTGTGAEEFARFNDSIYFHTPGE---LYVNLYIASRLK 457
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
W + L+Q ++ LT ++ + NLR+P WT + Q +N Q
Sbjct: 458 WAEQGLTLSQLTRFPEQDVSDFKLQLTAPARLRI------NLRIPSWT-AGAPQLWINDQ 510
Query: 573 NLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
+ PG++LS W D L +QLP+ L+ + + D ++ A+L+GP LA
Sbjct: 511 LQNVSALPGSYLSIERMWHDKDHLRLQLPMQLKMQPLPGDDAQF----ALLYGPITLAAE 566
Query: 632 TSGE 635
G+
Sbjct: 567 LPGD 570
>gi|427385118|ref|ZP_18881623.1| hypothetical protein HMPREF9447_02656 [Bacteroides oleiciplenus YIT
12058]
gi|425727286|gb|EKU90146.1| hypothetical protein HMPREF9447_02656 [Bacteroides oleiciplenus YIT
12058]
Length = 629
Score = 325 bits (833), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 200/547 (36%), Positives = 287/547 (52%), Gaps = 30/547 (5%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
RA + + +L DV+ + +FR TA L T + GGWE+ ELRGH GH LSA + M
Sbjct: 60 RAMEVDQRWLKEADVNRFLHAFRVTAGLATGAQNLGGWESLDCELRGHTTGHLLSALSLM 119
Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
+AST + + K + +V L+ECQ +G GYLSAFP D + VWAP+YT+HK+
Sbjct: 120 YASTGDEQYRTKGAELVKGLAECQQTLGKNGYLSAFPEYFIDRAIKEEIVWAPFYTLHKV 179
Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYR 295
AGLLDQY L N QAL + T M ++ YN++ K +T ++ LN E GGM + Y
Sbjct: 180 YAGLLDQYTLCGNQQALDVLTGMCDWAYNKL-KPLTPTQLQG---MLNSEFGGMPETFYN 235
Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY 355
LY++T + +H LA +F L LA + D L+ H NT IP V+G YE+TG+P
Sbjct: 236 LYALTGNARHKELAEMFYHNSILDPLAARRDSLAGIHVNTQIPKVLGEARGYEMTGNPQS 295
Query: 356 KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
I FF + V H+Y TGG S +E + P L+D L ETC TYNMLK++RHLF
Sbjct: 296 ATIANFFWEAVVGDHTYVTGGNSDKEIFSKPGILSDQLSENTTETCNTYNMLKLTRHLFT 355
Query: 416 WTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCC 475
W A ADYYERAL N +LS Q E G + Y L G K + F CC
Sbjct: 356 WDASPARADYYERALYNHILSSQN-PETGGVTYYHTLHPGSCKK-----FHYPFRDNTCC 409
Query: 476 YGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR 535
GTG E+ +K G++IY+ + + GLY+ +I+S +WK + + Q+ + +
Sbjct: 410 VGTGYENHAKYGEAIYY-KTADQSGLYVNLFIASVLNWKEKDLTVRQETN----YPDEAS 464
Query: 536 MTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDK 594
+T ++ E G LR P W +G +NG+ + PG+++ W D
Sbjct: 465 TRITIAAAPEAGIQMPFMLRYPSWAV-DGVTIKVNGKKQHVKKAPGSYIHIDRTWRQGDV 523
Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWD--------IKTGTARSL 646
+T+++P+SL E + D + + AIL+GP +LA D G R +
Sbjct: 524 ITMEMPMSLHIEYMPDTKEK----GAILYGPIVLAAELGKTEDPAQNPAVPTLAGDFRKI 579
Query: 647 SALISPI 653
I P+
Sbjct: 580 EQCIKPV 586
>gi|262407449|ref|ZP_06083997.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|262354257|gb|EEZ03349.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
Length = 642
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV L+ SFR A + K GGWE+ ELRGH GH LSA A M+
Sbjct: 69 WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSAFP EL + K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QALK T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 189 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 244
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + K
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 304
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+ + L ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 364
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + L Q+ + P
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFP-------KEE 468
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S A+ +NG+ + + PG++++ T W ND++
Sbjct: 469 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 526
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ + EA P+ + A+L+GP +LAG E
Sbjct: 527 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 562
>gi|336404833|ref|ZP_08585521.1| hypothetical protein HMPREF0127_02834 [Bacteroides sp. 1_1_30]
gi|335940654|gb|EGN02520.1| hypothetical protein HMPREF0127_02834 [Bacteroides sp. 1_1_30]
Length = 640
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV L+ SFR A + K GGWE+ ELRGH GH LSA A M+
Sbjct: 67 WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 126
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSAFP EL + K VWAP+YT+HK+ +
Sbjct: 127 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 186
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QALK T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 187 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 242
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + K
Sbjct: 243 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 302
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+ + L ETC TYNMLK+SRHLF WT
Sbjct: 303 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 362
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 363 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 416
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + L Q+ + P
Sbjct: 417 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFPK-------EE 466
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S A+ +NG+ + + PG++++ T W ND++
Sbjct: 467 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 524
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ + EA P+ + A+L+GP +LAG E
Sbjct: 525 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 560
>gi|294810816|ref|ZP_06769462.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|294442004|gb|EFG10825.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 642
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV L+ SFR A + K GGWE+ ELRGH GH LSA A M+
Sbjct: 69 WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSAFP EL + K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QALK T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 189 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 244
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + K
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 304
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+ + L ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 364
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + L Q+ + P
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFP-------KEE 468
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S A+ +NG+ + + PG++++ T W ND++
Sbjct: 469 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 526
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ + EA P+ + A+L+GP +LAG E
Sbjct: 527 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 562
>gi|345512074|ref|ZP_08791613.1| hypothetical protein BSAG_00984 [Bacteroides sp. D1]
gi|229443482|gb|EEO49273.1| hypothetical protein BSAG_00984 [Bacteroides sp. D1]
Length = 640
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV L+ SFR A + K GGWE+ ELRGH GH LSA A M+
Sbjct: 67 WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 126
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSAFP EL + K VWAP+YT+HK+ +
Sbjct: 127 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 186
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QALK T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 187 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 242
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + K
Sbjct: 243 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 302
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+ + L ETC TYNMLK+SRHLF WT
Sbjct: 303 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 362
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 363 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 416
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + L Q+ + P
Sbjct: 417 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFP-------KEE 466
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S A+ +NG+ + + PG++++ T W ND++
Sbjct: 467 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 524
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ + EA P+ + A+L+GP +LAG E
Sbjct: 525 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 560
>gi|294646892|ref|ZP_06724513.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|292637837|gb|EFF56234.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
Length = 640
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV L+ SFR A + K GGWE+ ELRGH GH LSA A M+
Sbjct: 67 WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 126
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSAFP EL + K VWAP+YT+HK+ +
Sbjct: 127 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 186
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QALK T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 187 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 242
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + K
Sbjct: 243 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 302
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+ + L ETC TYNMLK+SRHLF WT
Sbjct: 303 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 362
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 363 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 416
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + L Q+ + P
Sbjct: 417 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFP-------KEE 466
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S A+ +NG+ + + PG++++ T W ND++
Sbjct: 467 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 524
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ + EA P+ + A+L+GP +LAG E
Sbjct: 525 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 560
>gi|423212948|ref|ZP_17199477.1| hypothetical protein HMPREF1074_01009 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694204|gb|EIY87432.1| hypothetical protein HMPREF1074_01009 [Bacteroides xylanisolvens
CL03T12C04]
Length = 642
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 198/519 (38%), Positives = 284/519 (54%), Gaps = 33/519 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV L+ SFR A + K GGWE+ ELRGH GH LSA A M+
Sbjct: 69 WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSAFP EL + K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QALK T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 189 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 244
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + K
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 304
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK + L ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKNFSKHLTGYTGETCCTYNMLKLSRHLFCWT 364
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
+G E+ +K G++IY+ N G+Y+ +I S WK V L Q+ + P T
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGVTLLQETE-----FPKEETT 470
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
L + + E +++ LR P W S A+ +NG+ + + PG++++ T W ND+++
Sbjct: 471 L-LTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRIS 527
Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
P+ + EA P+ + A+L+GP +LAG E
Sbjct: 528 ATYPMQIELEAT----PDNPNKVALLYGPLVLAGERGTE 562
>gi|298483785|ref|ZP_07001958.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
gi|298270079|gb|EFI11667.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
Length = 642
Score = 315 bits (807), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 197/520 (37%), Positives = 285/520 (54%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV+ L+ SFR A + K GGWE+ ELRGH GH LSA A M+
Sbjct: 69 WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSAFP EL + K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QALK T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 189 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 244
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + K
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 304
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+ + L ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 364
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + L Q+ P
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETGFP-------KEE 468
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S A+ +NG+ + + PG++++ T W ND++
Sbjct: 469 TTRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRI 526
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ + EA P+ + A+L+GP +LAG E
Sbjct: 527 SATYPMQIALEAT----PDNPNKVALLYGPLVLAGERGTE 562
>gi|330995449|ref|ZP_08319354.1| hypothetical protein HMPREF9442_00414 [Paraprevotella xylaniphila
YIT 11841]
gi|329575517|gb|EGG57055.1| hypothetical protein HMPREF9442_00414 [Paraprevotella xylaniphila
YIT 11841]
Length = 618
Score = 315 bits (807), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 204/588 (34%), Positives = 295/588 (50%), Gaps = 57/588 (9%)
Query: 85 KIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL 144
K+++P +L K+V L W+ Q L ++ YL ++ D L+ +FR TA L
Sbjct: 24 KVESPSVVELRPFSGKDVELEASWIKQREDL------DVAYLQSVEADRLLHNFRVTAGL 77
Query: 145 PTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG 204
P+ K GWE+P LRGHF GHYLSA + + + +++ +V L +CQ G
Sbjct: 78 PSLAKPLEGWESPGVGLRGHFTGHYLSALSVLAERYGDGWASQRLEYMVDELYKCQQAHG 137
Query: 205 TGYLSAFPTELFDSFEA-LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFY 263
GYLSAFP + F++ E VWAPYYT+HKIL GLLD Y N +A M + Y
Sbjct: 138 NGYLSAFPEKDFETLETRFTGVWAPYYTLHKILQGLLDAYTKTGNRKAYGMVEALAGYVE 197
Query: 264 NRVQKVITMYSVERHWYSL----NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
R+ K ++ +ER Y++ E G MN+ LY LY I+ +P+HL LA FD FL
Sbjct: 198 GRMAK-LSPERIERMMYTVEANPQNEAGAMNEALYELYGISGNPRHLALAACFDPAWFLE 256
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
L D L+ HANTHI +V G RYEVTG+ YK F DI+ H+Y G +S
Sbjct: 257 PLVRNEDILAGLHANTHIVLVNGFARRYEVTGEEKYKKAAMQFWDILQRGHAYVNGTSSG 316
Query: 380 ------------REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
E W +P L +TL E E+C T+N K+S +LF WT + YAD Y
Sbjct: 317 PRPVVTTRTSLTAEHWGEPGHLCNTLTREIAESCVTHNTQKLSAYLFGWTGDPCYADAYM 376
Query: 428 RALTNGVLSIQ-RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKL 486
NG L +Q R T G +Y LPLG +K K N F+CC G+ E+F+KL
Sbjct: 377 NTFYNGALPVQSRST--GAYVYHLPLGSPRNKKY------LKDNDFFCCSGSCAEAFAKL 428
Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK----VDPIVSWDPYLRMTLTFSS 542
IY+ ++ V ++ Y+ S W S V L Q + PI + +R ++F
Sbjct: 429 NSGIYYHDDSAV---FVNLYVPSELHWTSKKVELEQTGGFPLQPIADFTVSVRRPVSF-- 483
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPL 601
+LNL +P W + G +NG+ +P P +FL + RW+ D++ +
Sbjct: 484 --------TLNLFVPAW--AEGTVVYVNGEKQDMPVRPSSFLRISRRWADGDRVRMDFRY 533
Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSAL 649
+ R +++ P+ ++ A+ +GP LLA T E +K L L
Sbjct: 534 AFRLQSM----PDKENMFAVFYGPMLLAFETRSEVILKGSKDEVLQGL 577
>gi|160883345|ref|ZP_02064348.1| hypothetical protein BACOVA_01314 [Bacteroides ovatus ATCC 8483]
gi|156111329|gb|EDO13074.1| hypothetical protein BACOVA_01314 [Bacteroides ovatus ATCC 8483]
Length = 643
Score = 315 bits (806), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 194/520 (37%), Positives = 288/520 (55%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV+ L+ SFR A + K GGWE+ ELRGH GH LSA A ++
Sbjct: 69 WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALIY 128
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSAFP EL + K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QALK+ T M ++ YN+++ + + E + E GG+N+ Y LY
Sbjct: 189 GLIDQYLYADNLQALKVVTKMGDWAYNKLKPL----TEETRKLMIRNEFGGINESFYNLY 244
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + +
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 304
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+L+ L ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 364
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGAHKLYS-----TKENSFWCCVG 418
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + + Q+ + P
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 468
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S + +NG+ + + PG+++ T W D++
Sbjct: 469 TTRFTLRTENPVRTTIYLRYPSW--SKDVKVLVNGKKISVKQKPGSYIVITREWKDGDQI 526
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ ++ EA D+ P+ A A+L+GP +LAG E
Sbjct: 527 SATYPMQIKLEATPDN-PDKA---ALLYGPLVLAGERGTE 562
>gi|116625830|ref|YP_827986.1| hypothetical protein Acid_6783 [Candidatus Solibacter usitatus
Ellin6076]
gi|116228992|gb|ABJ87701.1| protein of unknown function DUF1680 [Candidatus Solibacter usitatus
Ellin6076]
Length = 675
Score = 314 bits (805), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 205/570 (35%), Positives = 290/570 (50%), Gaps = 59/570 (10%)
Query: 84 RKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTAS 143
RKI P P + V L S +Q+ N Y+ L D L+ +FR A
Sbjct: 55 RKIVTPRAEPFP--------MPQVRLLPGSAYHDSQEWNRGYMERLAADRLLHTFRANAG 106
Query: 144 LPT-PGKAYGGWENP-----ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLS 197
LP K GGWE P SELRGHF GH+LSASAQ+ ++ + + K +V ++
Sbjct: 107 LPVGSAKPLGGWEQPENGQRSSELRGHFAGHFLSASAQL-SANGDKNAQSKGDFMVAEMA 165
Query: 198 ECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALK---- 253
CQ K+G YLSAFPT +D + VWAP+YTIHKI+AG+ D Y LA N QAL+
Sbjct: 166 RCQQKLGGKYLSAFPTTWWDRLGKGERVWAPFYTIHKIMAGMFDMYSLAGNQQALEVLEG 225
Query: 254 MATWMVEYFYNR----VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLA 309
MA W E+ + +Q+++T+ E GG+ + LYRL + T + +
Sbjct: 226 MAAWADEWTAPKAAEHMQQILTI------------EFGGIAETLYRLAAATDQDRWGRVG 273
Query: 310 HLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNAS 369
F K FL LA + D L H NTHIP V+ + RY+++GD + + +F V +
Sbjct: 274 DRFQKKSFLNPLAARRDELRGLHVNTHIPQVMAAARRYDLSGDMRFHDVADYFFSEVAGA 333
Query: 370 HSYATGGTSAREFWWDPKRLADT---LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYY 426
+Y TGGTS E W P R T L E C YNMLK++RHL+ W + +Y DYY
Sbjct: 334 RTYVTGGTSNAEAWLAPPRRLATELKLSVNTAECCCAYNMLKLARHLYSWDPKPSYFDYY 393
Query: 427 ERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKL 486
E L N + R + G+ Y L L G K + T+ +FWCC G+G+E +SKL
Sbjct: 394 EHLLLNHRIGTIR-PKVGLTQYYLSLTPGAWKT-----FNTEDQTFWCCTGSGVEEYSKL 447
Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV 546
DSIY+ + GLY+ +ISS DW L Q S P +T+T + ++
Sbjct: 448 NDSIYWRDG---EGLYVNLFISSELDWAERGFKLRQATQYPAS--PSTALTVTAARAGDL 502
Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRT 605
++ LR+P W S LNG+ L PG++L W D++ ++LP+ L
Sbjct: 503 ----AIRLRIPGWLQS-APSVKLNGKALDASAAPGSYLVLKRNWKVGDRIDMELPMRLHV 557
Query: 606 EAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+A+ DD ++QA L+GP +LAG GE
Sbjct: 558 QAMPDD----PAMQAFLYGPLVLAGDLGGE 583
>gi|383123868|ref|ZP_09944538.1| hypothetical protein BSIG_4114 [Bacteroides sp. 1_1_6]
gi|251838901|gb|EES66986.1| hypothetical protein BSIG_4114 [Bacteroides sp. 1_1_6]
Length = 641
Score = 314 bits (805), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 195/519 (37%), Positives = 285/519 (54%), Gaps = 33/519 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ LDV+ L+ SFR A + K GGWE+ ELRGH GH LSA A M+
Sbjct: 69 WMTSLDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSA+P EL + K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYS 188
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QAL + T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 189 GLIDQYLYADNQQALSVVTKMGDWAYNKLKPL----SEETRRLMIRNEFGGINESFYNLY 244
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + K
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKK 304
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+ + L ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWT 364
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
+G E+ +K G++IY+ N G+Y+ +I S WK + L Q+ D + R+T
Sbjct: 419 SGFENHAKYGEAIYYH---NDKGIYVNLFIPSQVTWKEKGLTLLQETD--FPKEETTRLT 473
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
L + E + +++ LR P W S + +NG+ + + PG++++ T W D++
Sbjct: 474 L----RAEKPRHTTIYLRYPSW--SKNVKVLVNGKKVSVKQKPGSYIAITREWKDGDRIA 527
Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
P+ + EA P+ + A+L+GP +LAG E
Sbjct: 528 ATYPMQIELEAT----PDNPNKVALLYGPLVLAGERGTE 562
>gi|29345547|ref|NP_809050.1| hypothetical protein BT_0137 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337439|gb|AAO75244.1| Acetyl-CoA carboxylase-like protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 641
Score = 314 bits (805), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 195/519 (37%), Positives = 285/519 (54%), Gaps = 33/519 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ LDV+ L+ SFR A + K GGWE+ ELRGH GH LSA A M+
Sbjct: 69 WMTSLDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSA+P EL + K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYS 188
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QAL + T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 189 GLIDQYLYADNQQALSVVTKMGDWAYNKLKPL----SEETRRLMIRNEFGGINESFYNLY 244
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + K
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKK 304
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+ + L ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWT 364
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
+G E+ +K G++IY+ N G+Y+ +I S WK + L Q+ D + R+T
Sbjct: 419 SGFENHAKYGEAIYYH---NDKGIYVNLFIPSQVTWKEKGLTLLQETD--FPKEETTRLT 473
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
L + E + +++ LR P W S + +NG+ + + PG++++ T W D++
Sbjct: 474 L----RAEKPRHTTIYLRYPSW--SKNVKVLVNGKKVSVKQKPGSYIAITREWKDGDRIA 527
Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
P+ + EA P+ + A+L+GP +LAG E
Sbjct: 528 ATYPMQIELEAT----PDNPNKVALLYGPLVLAGERGTE 562
>gi|298384470|ref|ZP_06994030.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
gi|298262749|gb|EFI05613.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
Length = 641
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 195/519 (37%), Positives = 285/519 (54%), Gaps = 33/519 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ LDV+ L+ SFR A + K GGWE+ ELRGH GH LSA A M+
Sbjct: 69 WMTSLDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSA+P EL + K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYS 188
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QAL + T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 189 GLIDQYLYADNQQALSVVTKMGDWAYNKLKPL----SEETRRLMIRNEFGGINESFYNLY 244
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + K
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKK 304
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+ + L ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWT 364
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
+G E+ +K G++IY+ N G+Y+ +I S WK + L Q+ D + R+T
Sbjct: 419 SGFENHAKYGEAIYYH---NDKGIYVNLFIPSQVTWKEKGLTLLQETD--FPKEETTRLT 473
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
L + E + +++ LR P W S + +NG+ + + PG++++ T W D++
Sbjct: 474 L----RAEKPRHTTIYLRYPSW--SKNVKVLVNGKKVSVKQKPGSYIAITREWKDGDRIA 527
Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
P+ + EA P+ + A+L+GP +LAG E
Sbjct: 528 ATYPMQIELEAT----PDNPNKVALLYGPLVLAGERGTE 562
>gi|423287825|ref|ZP_17266676.1| hypothetical protein HMPREF1069_01719 [Bacteroides ovatus
CL02T12C04]
gi|392671840|gb|EIY65311.1| hypothetical protein HMPREF1069_01719 [Bacteroides ovatus
CL02T12C04]
Length = 643
Score = 313 bits (802), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 194/520 (37%), Positives = 287/520 (55%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV+ L+ SFR A + K GGWE+ ELRGH GH LSA A ++
Sbjct: 69 WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALIY 128
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSAFP EL + K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QALK+ T M ++ YN+++ + + E + E GG+N+ Y LY
Sbjct: 189 GLIDQYLYADNLQALKVVTKMGDWAYNKLKSL----TEETRKLMIRNEFGGINESFYNLY 244
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + +
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARSYELTRNETSRK 304
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+L+ L ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 364
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 418
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + + Q+ + P
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 468
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S + +NG+ + + PG+++ T W D++
Sbjct: 469 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVLVNGKKISVKQKPGSYIVITREWKDGDQI 526
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ ++ EA D+ P A A+L+GP +LAG E
Sbjct: 527 SATYPMQIKLEATPDN-PNKA---ALLYGPLVLAGERGTE 562
>gi|329957171|ref|ZP_08297738.1| hypothetical protein HMPREF9445_02614 [Bacteroides clarus YIT
12056]
gi|328523439|gb|EGF50538.1| hypothetical protein HMPREF9445_02614 [Bacteroides clarus YIT
12056]
Length = 694
Score = 313 bits (801), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 190/519 (36%), Positives = 281/519 (54%), Gaps = 33/519 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASL-------PTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV+ L+ SFR A + K YGGWE+ ELRGH GH LSA M+
Sbjct: 121 WMTSIDVNRLIHSFRTNAGIWAGREGGYVTVKKYGGWESLDCELRGHTTGHLLSAYGLMY 180
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L + Q+ +G GYLSAFP EL + + VWAP+YT+HK+ +
Sbjct: 181 AATGSEIFKLKGDSIVTELGKVQDALGNGYLSAFPEELINRNIKGQSVWAPWYTLHKLFS 240
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADNAQAL + T M ++ Y++++ + S E + E GG+N+ Y LY
Sbjct: 241 GLIDQYLYADNAQALAVVTKMGDWAYDKLKPL----SEETRRRMIRNEFGGINESFYNLY 296
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
++T D ++ LAH F + L Q D L H NT IP V+ YE+TGD K
Sbjct: 297 AVTGDERYRWLAHFFYHNDVIDPLKEQNDDLGTKHTNTFIPKVLAEARNYELTGDKDSKA 356
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++D KR + L ETC TYNMLK+SRHLF W
Sbjct: 357 LSDFFWHTMIDHHTFAPGCSSQKEHYFDTKRFSHFLNGYTGETCCTYNMLKLSRHLFCWQ 416
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ ADYYERAL N +L Q+ + G++ Y LPL G K S TK NSFWCC G
Sbjct: 417 PDARIADYYERALYNHILG-QQDPQTGMVCYFLPLLSGAHKVYS-----TKENSFWCCVG 470
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
+G E+ +K G+ IY+ + G+YI +I S WK + L Q+ ++
Sbjct: 471 SGFENHAKYGEGIYYR---SAAGIYINLFIPSVVRWKEKGITLKQE----TAFPAGEATV 523
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
LT + + V +++ LR P W S +NG+ + + PG++++ W D++
Sbjct: 524 LTVEADRPV--RTTVYLRYPSW--SEKVTVRVNGKKVQVKRKPGSYIALNRLWQNGDRIE 579
Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
P+ + E D+ P+ A+L+GP +LAG E
Sbjct: 580 AAYPMRVHLETTPDN-PQKG---ALLYGPLVLAGERGTE 614
>gi|270296104|ref|ZP_06202304.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|423303646|ref|ZP_17281645.1| hypothetical protein HMPREF1072_00585 [Bacteroides uniformis
CL03T00C23]
gi|423307631|ref|ZP_17285621.1| hypothetical protein HMPREF1073_00371 [Bacteroides uniformis
CL03T12C37]
gi|270273508|gb|EFA19370.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|392688010|gb|EIY81301.1| hypothetical protein HMPREF1072_00585 [Bacteroides uniformis
CL03T00C23]
gi|392689500|gb|EIY82777.1| hypothetical protein HMPREF1073_00371 [Bacteroides uniformis
CL03T12C37]
Length = 641
Score = 313 bits (801), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 188/488 (38%), Positives = 277/488 (56%), Gaps = 26/488 (5%)
Query: 149 KAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL 208
K GGWE+ ELRGH GH LSA A M+AST + K K ++V L+E Q +G GYL
Sbjct: 99 KKLGGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYL 158
Query: 209 SAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
SA+P EL + VWAP+YT+HK+ +GL+DQY+ ADN AL++ T M ++ YN++ K
Sbjct: 159 SAYPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYADNKPALEVVTRMGDWAYNKL-K 217
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + +R + E GG+N+ Y LY+IT D ++ LA F + L Q D L
Sbjct: 218 PLDEATRKR---MIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDL 274
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
H NT IP V+ YE+T D + + FF + H++A G +S +E ++DP++
Sbjct: 275 GTKHTNTFIPKVLAEARNYELTQDNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQ 334
Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
L+ L ETC TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y
Sbjct: 335 LSKHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSY 393
Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
LPL G K S T+ NSFWCC G+G ES +K G++IY E G+Y+ +I
Sbjct: 394 FLPLLSGSHKVYS-----TRENSFWCCVGSGFESHAKYGEAIYCHNE---KGIYVNLFIP 445
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
S +WK+ + L Q+ + TLT + + V +++ LR P W S G + +
Sbjct: 446 SEVNWKAKGITLRQE----TGFPAEENTTLTIQTDKPV--TTTIYLRYPSW--SEGVKVN 497
Query: 569 LNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
+NG+ + + PG++++ T +W D++ P+SL+ E D+ P+ A+L+GP +
Sbjct: 498 VNGKKVSVKQKPGSYIAVTRQWKDGDRIEANYPMSLQLETTSDN-PQKG---ALLYGPLV 553
Query: 628 LAGHTSGE 635
LAG E
Sbjct: 554 LAGELGTE 561
>gi|423313782|ref|ZP_17291717.1| hypothetical protein HMPREF1058_02329 [Bacteroides vulgatus
CL09T03C04]
gi|392684317|gb|EIY77645.1| hypothetical protein HMPREF1058_02329 [Bacteroides vulgatus
CL09T03C04]
Length = 640
Score = 312 bits (800), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 187/519 (36%), Positives = 284/519 (54%), Gaps = 33/519 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ ++VD L+ SFR A + K GGWE+ ELRGH GH LSA M+
Sbjct: 67 WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYGLMY 126
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K+K ++V L+E Q +G GYLSA+P EL + VWAP+YT+HK+ +
Sbjct: 127 AATGSEQFKQKGDSLVNGLAEVQTALGNGYLSAYPEELINRNICGTSVWAPWYTLHKLFS 186
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ +DN +AL++ M ++ Y++++ + + + E GG+N+ Y LY
Sbjct: 187 GLIDQYLYSDNQKALEVVVRMADWAYHKLKPLDETTRQK----MIRNEFGGVNESFYNLY 242
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D +H LA F + L D L H NT IP VI YE+T D +
Sbjct: 243 AITGDERHRWLAQFFYHNEVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTEDENSRK 302
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DP R + + ETC TYNMLK+SRHLF WT
Sbjct: 303 LSDFFWHTMIDHHTFAPGCSSDKEHYFDPARFSKHVSGYTGETCCTYNMLKLSRHLFCWT 362
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ A ADYYERAL N +L Q+ + G++ Y LPL G K S TK NSFWCC G
Sbjct: 363 ADAAIADYYERALYNHILG-QQDPQTGMVTYFLPLLSGSHKVYS-----TKENSFWCCVG 416
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
+G E+ +K G++IY+ N G+Y+ +I S +W+ + L Q+ D +
Sbjct: 417 SGFENHAKYGEAIYYH---NDKGIYVNLFIPSVVNWRKKGLTLRQETD----FPAEETTV 469
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
LT ++ V +++ LR P W S G + +NG+ + + PG++++ T W D++T
Sbjct: 470 LTIRAQNPVE--TTVYLRYPSW--SKGVKVFVNGKKIAVKQKPGSYIAITRLWKDGDRIT 525
Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
P+ LR E D+ P+ A+++GP +LAG E
Sbjct: 526 ADYPMCLRVETTPDN-PQKG---ALVYGPVVLAGKRGTE 560
>gi|299146414|ref|ZP_07039482.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
gi|298516905|gb|EFI40786.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
Length = 642
Score = 311 bits (798), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 196/520 (37%), Positives = 283/520 (54%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV L+ SFR A + K GGWE+ ELRGH GH LSA A M+
Sbjct: 69 WMTSIDVSRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN + GYLSAFP EL + K VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN QALK T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 189 GLIDQYLYADNQQALKTVTKMGDWAYNKLKPL----SEETRKLMIRNEFGGVNESFYNLY 244
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D ++ LA F + L D L H NT IP VI YE+T + K
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 304
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+ + L ETC TYNMLK+SRHLF WT
Sbjct: 305 LSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWT 364
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 365 GDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVG 418
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + L Q+ + P
Sbjct: 419 SGFENHAKYGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTLLQETEFP-------KEE 468
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F + E +++ LR P W S A+ +NG+ + + G++++ T W ND++
Sbjct: 469 TTRFIIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKSGSYIAITRDWKDNDRI 526
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ + EA P+ + A+L+GP +LAG E
Sbjct: 527 SATYPMQIELEAT----PDNPNKVALLYGPLVLAGERGTE 562
>gi|255692201|ref|ZP_05415876.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides
finegoldii DSM 17565]
gi|260622065|gb|EEX44936.1| hypothetical protein BACFIN_07304 [Bacteroides finegoldii DSM
17565]
Length = 644
Score = 311 bits (798), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 193/520 (37%), Positives = 286/520 (55%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV+ L+ SFR A + K GGWE+ ELRGH GH LSA M+
Sbjct: 70 WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L E QN + GYLSA+P EL + K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN +AL + T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 190 GLIDQYLYADNKKALIIVTRMGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
SIT D ++ LA F + L D L H NT IP VI YE+T + +
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+L+ L ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + + Q+ + P
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S + +NG+ + + PG++++ T W +D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVLVNGKKISVKQKPGSYIAITREWKDDDQI 527
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ ++ EA D+ P A A+L+GP +LAG E
Sbjct: 528 SATYPMQIKLEATPDN-PNKA---ALLYGPLVLAGERGTE 563
>gi|319643216|ref|ZP_07997844.1| hypothetical protein HMPREF9011_03445 [Bacteroides sp. 3_1_40A]
gi|345520493|ref|ZP_08799881.1| hypothetical protein BSFG_01473 [Bacteroides sp. 4_3_47FAA]
gi|254835017|gb|EET15326.1| hypothetical protein BSFG_01473 [Bacteroides sp. 4_3_47FAA]
gi|317385120|gb|EFV66071.1| hypothetical protein HMPREF9011_03445 [Bacteroides sp. 3_1_40A]
Length = 640
Score = 311 bits (797), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 187/519 (36%), Positives = 284/519 (54%), Gaps = 33/519 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ ++VD L+ SFR A + K GGWE+ ELRGH GH LSA M+
Sbjct: 67 WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYGLMY 126
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K+K ++V L+E Q +G GYLSA+P EL + VWAP+YT+HK+ +
Sbjct: 127 AATGSEQFKQKGDSLVNGLAEVQTALGNGYLSAYPEELINRNIRGTSVWAPWYTLHKLFS 186
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ +DN +AL++ M ++ Y++++ + + + E GG+N+ Y LY
Sbjct: 187 GLIDQYLYSDNQKALEVVVRMADWAYHKLKPLDETTRQK----MIRNEFGGVNESFYNLY 242
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+IT D +H LA F + L D L H NT IP VI YE+T D +
Sbjct: 243 AITGDERHRWLAQFFYHNEVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTEDENSRK 302
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DP R + + ETC TYNMLK+SRHLF WT
Sbjct: 303 LSDFFWHTMIDHHTFAPGCSSDKEHYFDPARFSKHVSGYTGETCCTYNMLKLSRHLFCWT 362
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ A ADYYERAL N +L Q+ + G++ Y LPL G K S TK NSFWCC G
Sbjct: 363 ADAAIADYYERALYNHILG-QQDPQTGMVTYFLPLLSGSHKVYS-----TKENSFWCCVG 416
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
+G E+ +K G++IY+ N G+Y+ +I S +W+ + L Q+ D +
Sbjct: 417 SGFENHAKYGEAIYYH---NDKGIYVNLFIPSVVNWREKGLTLRQETD----FPAEETTV 469
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLT 596
LT ++ V +++ LR P W S G + +NG+ + + PG++++ T W D++T
Sbjct: 470 LTIRAQNPVE--TTVYLRYPSW--SKGVKVFVNGKKIAVKQKPGSYIAITRLWKDGDRIT 525
Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
P+ LR E D+ P+ A+++GP +LAG E
Sbjct: 526 ADYPMCLRVETTPDN-PQKG---ALVYGPVVLAGKRGTE 560
>gi|424790951|ref|ZP_18217449.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
pv. graminis ART-Xtg29]
gi|422797791|gb|EKU25992.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
pv. graminis ART-Xtg29]
Length = 651
Score = 311 bits (797), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 183/525 (34%), Positives = 278/525 (52%), Gaps = 45/525 (8%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG-HYLSASAQ 175
+A+ + YL+ + D L+ +FR A L + + GGWE+P E+RGHF G HYLSA A
Sbjct: 74 QARDRDRRYLMSIPNDRLLHTFRLVAGLDSQAEPLGGWESPHCEIRGHFAGGHYLSACAL 133
Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
++A+T +A +K+K +V L+ CQ GY+ A+P+ +D + VW P YT HKI
Sbjct: 134 LYAATGDAALKDKADALVAELARCQR--ADGYIGAYPSSFYDRLGRHEEVWVPIYTAHKI 191
Query: 236 LAGLLDQYVLADNAQALKMA--------TWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
LAG LD A NAQAL+ A WM + + Q+++ + E G
Sbjct: 192 LAGHLDMARHAGNAQALRTAQRFADWLGAWMDGFDDAQWQRILGV------------EFG 239
Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
G++ L LY ++ D K+ A +++ L LA Q D L+ HANT IP ++ + Y
Sbjct: 240 GVHASLLELYLLSGDAKYQRWATRYEQASLLEPLAQQRDALAGLHANTQIPKIVAAARAY 299
Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNML 407
E+ G P + I FF V+ H+Y TGG S E + P A L + E C +YNML
Sbjct: 300 EIDGAPRQRQIAEFFWRTVSGHHAYCTGGVSDYEMFGKPDHFAGHLSGHSHECCCSYNML 359
Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
K++RHL+ W + A DYYER L N L Q E G+M+Y +P+ G K + T
Sbjct: 360 KLTRHLYTWQPDAALMDYYERVLFNARLGTQ--DEAGMMMYFVPMDAGYWKL-----YNT 412
Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI 527
F SFWCC GTG+E F+K DSIYF ++ GL + +I+S DW + + Q+
Sbjct: 413 PFASFWCCTGTGVEEFAKSNDSIYFRDDA---GLTVNLFIASQLDWAERGLRVVQR---- 465
Query: 528 VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSAT 586
+ L F K+ Q +L LR+P W + G + +NG+ + PG++L+
Sbjct: 466 TRFPQQEGTALEFQCKRP--QQMTLRLRIPYWA-TQGVRLRINGKAQAVKATPGSYLALE 522
Query: 587 ERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
R++ D++ + LP++L + P+ S+QA+++GP +LA
Sbjct: 523 RRFADGDRIELDLPMALHAAPL----PDEPSLQAMMYGPLVLAAQ 563
>gi|423222645|ref|ZP_17209115.1| hypothetical protein HMPREF1062_01301 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392641932|gb|EIY35705.1| hypothetical protein HMPREF1062_01301 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 641
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 185/488 (37%), Positives = 278/488 (56%), Gaps = 26/488 (5%)
Query: 149 KAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL 208
K GGWE+ ELRGH GH LSA A M+AST + K K ++V L+E Q +G GYL
Sbjct: 99 KKLGGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYL 158
Query: 209 SAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
SA+P EL + VWAP+YT+HK+ +GL+DQY+ DN QAL++ T M ++ YN++ K
Sbjct: 159 SAYPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALEVVTRMGDWAYNKL-K 217
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + +R + E GG+N+ Y LY+IT D ++ LA F + L Q D L
Sbjct: 218 PLDEPTRKR---MIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDL 274
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
H NT IP V+ YE+T D + + FF + H++A G +S +E ++DP++
Sbjct: 275 GTKHTNTFIPKVLAEARNYELTQDNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQ 334
Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
L+ L ETC TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y
Sbjct: 335 LSKHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSY 393
Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
LPL G K S T+ NSFWCC G+G E+ +K G++IY+ N G+Y+ +I
Sbjct: 394 FLPLLSGSHKVYS-----TRENSFWCCVGSGFENHAKYGEAIYYH---NDQGIYVNLFIP 445
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
S +WK+ + L Q+ ++ LT + + V +++ LR P W S + +
Sbjct: 446 SEVNWKAKRITLRQE----TAFPAAENTALTIQTDKPV--TTTIYLRYPSW--SKNVKVN 497
Query: 569 LNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
+NG+ + + PG++++ T +W D++ P+SL+ E D+ P+ A+L+GP +
Sbjct: 498 VNGKKVSVKQKPGSYIAVTRQWKDGDRIEANYPMSLQLETTPDN-PQKG---ALLYGPLV 553
Query: 628 LAGHTSGE 635
LAG + E
Sbjct: 554 LAGESGTE 561
>gi|224539132|ref|ZP_03679671.1| hypothetical protein BACCELL_04034 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519254|gb|EEF88359.1| hypothetical protein BACCELL_04034 [Bacteroides cellulosilyticus
DSM 14838]
Length = 641
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 186/488 (38%), Positives = 279/488 (57%), Gaps = 26/488 (5%)
Query: 149 KAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL 208
K GGWE+ ELRGH GH LSA A M+AST + K K ++V L+E Q +G GYL
Sbjct: 99 KKLGGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYL 158
Query: 209 SAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
SA+P EL + VWAP+YT+HK+ +GL+DQY+ DN QAL++ T M ++ YN++ K
Sbjct: 159 SAYPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALEVVTRMGDWAYNKL-K 217
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + +R + E GG+N+ Y LY+IT D ++ LA F + L Q D L
Sbjct: 218 PLDEPTRKR---MIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDL 274
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
H NT IP V+ YE+T D + + FF + H++A G +S +E ++DP++
Sbjct: 275 GTKHTNTFIPKVLAEARNYELTQDNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQ 334
Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
L+ L ETC TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y
Sbjct: 335 LSKHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSY 393
Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
LPL G K S T+ NSFWCC G+G E+ +K G++IY+ N G+Y+ +I
Sbjct: 394 FLPLLSGSHKVYS-----TRENSFWCCVGSGFENHAKYGEAIYYH---NDQGIYVNLFIP 445
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
S +WK+ + L+Q+ V + LT + + V +++ LR P W S + +
Sbjct: 446 SEVNWKAKGITLHQETAFPVEEN----TALTIQTDKPV--TTTIYLRYPSW--SKNVKVN 497
Query: 569 LNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
+NG+ + + PG++++ T +W D++ P+SL+ E D+ P+ A+L+GP +
Sbjct: 498 VNGKKVSVKQKPGSYIAVTRQWKDGDRIEANYPMSLQLETTPDN-PQKG---ALLYGPLV 553
Query: 628 LAGHTSGE 635
LAG + E
Sbjct: 554 LAGESGTE 561
>gi|345512540|ref|ZP_08792066.1| hypothetical protein BSEG_01611 [Bacteroides dorei 5_1_36/D4]
gi|423229086|ref|ZP_17215491.1| hypothetical protein HMPREF1063_01311 [Bacteroides dorei
CL02T00C15]
gi|423244926|ref|ZP_17226000.1| hypothetical protein HMPREF1064_02206 [Bacteroides dorei
CL02T12C06]
gi|345456387|gb|EEO45470.2| hypothetical protein BSEG_01611 [Bacteroides dorei 5_1_36/D4]
gi|392634839|gb|EIY28751.1| hypothetical protein HMPREF1063_01311 [Bacteroides dorei
CL02T00C15]
gi|392640967|gb|EIY34758.1| hypothetical protein HMPREF1064_02206 [Bacteroides dorei
CL02T12C06]
Length = 646
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 192/545 (35%), Positives = 294/545 (53%), Gaps = 34/545 (6%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
+K L DV L S + ++ ++ ++VD L+ SFR A + K
Sbjct: 48 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 106
Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
GGWE+ ELRGH GH LSA M+A+T + + K ++V L+E QN +G GYLSA+
Sbjct: 107 GGWESLDCELRGHTTGHLLSAYGLMYAATGSKLFRHKGDSLVSGLAEVQNALGNGYLSAY 166
Query: 212 PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
P EL + VWAP+YT+HK+ +GL+DQY+ +DN +AL++ M ++ Y++++ +
Sbjct: 167 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVIRMADWAYHKLKPLDE 226
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
+ + E GG+N+ Y LY+IT D +H LA F + L D L
Sbjct: 227 TTRQK----MIRNEFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDLGTK 282
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
H NT IP VI YE+T D + + FF + H++A G +S +E ++DP R +
Sbjct: 283 HTNTFIPKVIAEARNYELTEDENSRKLSDFFWHTMIDHHTFAPGCSSDKEHYFDPARFSK 342
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
+ ETC TYNMLK+SRHLF WT + A ADYYERAL N +L Q+ + G++ Y LP
Sbjct: 343 HVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTYFLP 401
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
L G K S TK NSFWCC G+G E+ +K G++IY+ N G+Y+ +I S
Sbjct: 402 LLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYH---NDKGIYVNLFIPSVV 453
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
+W+ + L Q+ D + LT ++ V +++ LR P W S + ++NG
Sbjct: 454 NWQEKGLTLRQETD----FPAEETTVLTIGTQSPVE--TTVYLRYPSW--SKEVKVAVNG 505
Query: 572 QNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ + + PG++++ T W D++T P+ LR E D+ P+ A+++GP +LAG
Sbjct: 506 KKVAVKQKPGSYIAITRLWKDGDRITADYPMRLRVETTPDN-PQKG---ALVYGPVVLAG 561
Query: 631 HTSGE 635
E
Sbjct: 562 ERGTE 566
>gi|265752243|ref|ZP_06088036.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263237035|gb|EEZ22505.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 640
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 192/545 (35%), Positives = 294/545 (53%), Gaps = 34/545 (6%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
+K L DV L S + ++ ++ ++VD L+ SFR A + K
Sbjct: 42 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 100
Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
GGWE+ ELRGH GH LSA M+A+T + + K ++V L+E QN +G GYLSA+
Sbjct: 101 GGWESLDCELRGHTTGHLLSAYGLMYAATGSKLFRHKGDSLVSGLAEVQNALGNGYLSAY 160
Query: 212 PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
P EL + VWAP+YT+HK+ +GL+DQY+ +DN +AL++ M ++ Y++++ +
Sbjct: 161 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVIRMADWAYHKLKPLDE 220
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
+ + E GG+N+ Y LY+IT D +H LA F + L D L
Sbjct: 221 TTRQK----MIRNEFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDLGTK 276
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
H NT IP VI YE+T D + + FF + H++A G +S +E ++DP R +
Sbjct: 277 HTNTFIPKVIAEARNYELTEDENSRKLSDFFWHTMIDHHTFAPGCSSDKEHYFDPARFSK 336
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
+ ETC TYNMLK+SRHLF WT + A ADYYERAL N +L Q+ + G++ Y LP
Sbjct: 337 HVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTYFLP 395
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
L G K S TK NSFWCC G+G E+ +K G++IY+ N G+Y+ +I S
Sbjct: 396 LLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYH---NDKGIYVNLFIPSVV 447
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
+W+ + L Q+ D + LT ++ V +++ LR P W S + ++NG
Sbjct: 448 NWQEKGLTLRQETD----FPAEETTVLTIGTQSPVE--TTVYLRYPSW--SKEVKVAVNG 499
Query: 572 QNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ + + PG++++ T W D++T P+ LR E D+ P+ A+++GP +LAG
Sbjct: 500 KKVAVKQKPGSYIAITRLWKDGDRITADYPMRLRVETTPDN-PQKG---ALVYGPVVLAG 555
Query: 631 HTSGE 635
E
Sbjct: 556 ERGTE 560
>gi|383115004|ref|ZP_09935763.1| hypothetical protein BSGG_0819 [Bacteroides sp. D2]
gi|313693284|gb|EFS30119.1| hypothetical protein BSGG_0819 [Bacteroides sp. D2]
Length = 643
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 192/520 (36%), Positives = 285/520 (54%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV+ L+ SFR A + K GGWE+ ELRGH GH LSA M+
Sbjct: 70 WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L E QN + GYLSA+P EL + K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN +AL + T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 190 GLIDQYLYADNKKALTIVTRMGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
SIT D ++ LA F + L D L H NT IP VI YE+T + +
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+L+ L ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + + Q+ + P
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S + S+NG+ + + G++++ T W D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVSVNGKKISVKQKSGSYIAITREWKDGDQI 527
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ ++ E D+ P+ A A+L+GP +LAG E
Sbjct: 528 SATYPMQIKLETTPDN-PDKA---ALLYGPLVLAGERGTE 563
>gi|237722400|ref|ZP_04552881.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448210|gb|EEO54001.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
Length = 644
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 192/520 (36%), Positives = 285/520 (54%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV+ L+ SFR A + K GGWE+ ELRGH GH LSA M+
Sbjct: 70 WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L E QN + GYLSA+P EL + K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN +AL + T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 190 GLIDQYLYADNKKALTIVTRMGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
SIT D ++ LA F + L D L H NT IP VI YE+T + +
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+L+ L ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + + Q+ + P
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S + S+NG+ + + G++++ T W D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVSVNGKKISVKQKSGSYIAITREWKDGDQI 527
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ ++ E D+ P+ A A+L+GP +LAG E
Sbjct: 528 SATYPMQIKLETTPDN-PDKA---ALLYGPLVLAGERGTE 563
>gi|150002728|ref|YP_001297472.1| hypothetical protein BVU_0120 [Bacteroides vulgatus ATCC 8482]
gi|294776982|ref|ZP_06742443.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|149931152|gb|ABR37850.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
gi|294449230|gb|EFG17769.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 640
Score = 310 bits (793), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 192/545 (35%), Positives = 294/545 (53%), Gaps = 34/545 (6%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
+K L DV L S + ++ ++ ++V+ L+ SFR A + K
Sbjct: 42 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVNRLLHSFRTNAGVFAGREGGYMTVKKL 100
Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
GGWE+ ELRGH GH LSA M+A+T + K+K ++V L+E Q +G GYLSA+
Sbjct: 101 GGWESLDCELRGHTTGHLLSAYGLMYAATGSEQFKQKGDSLVNGLAEVQTALGNGYLSAY 160
Query: 212 PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
P EL + VWAP+YT+HK+ +GL+DQY+ +DN +AL++ M ++ Y++++ +
Sbjct: 161 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVIRMADWAYHKLKPLDE 220
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
+ + E GG+N+ Y LY+IT D +H LA F + L D L
Sbjct: 221 TTRQK----MIRNEFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDLGTK 276
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
H NT IP VI YE+T D + + FF + H++A G +S +E ++DP R +
Sbjct: 277 HTNTFIPKVIAEARNYELTEDENSRKLSDFFWHTMIDHHTFAPGCSSDKEHYFDPARFSK 336
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
+ ETC TYNMLK+SRHLF WT + A ADYYERAL N +L Q+ + G++ Y LP
Sbjct: 337 HVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTYFLP 395
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
L G K S TK NSFWCC G+G E+ +K G++IY+ N G+Y+ +I S
Sbjct: 396 LLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYH---NDKGIYVNLFIPSVV 447
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
+W+ + L Q+ D + LT ++ V +++ LR P W S G + +NG
Sbjct: 448 NWREKGLTLRQETD----FPAEETTVLTIRAQNPVE--TTVYLRYPSW--SKGVKVFVNG 499
Query: 572 QNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ + + PG++++ T W D++T P+ LR E D+ P+ A+++GP +LAG
Sbjct: 500 KKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQKG---ALVYGPVVLAG 555
Query: 631 HTSGE 635
E
Sbjct: 556 KRGTE 560
>gi|293369447|ref|ZP_06616030.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292635445|gb|EFF53954.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 644
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 191/520 (36%), Positives = 285/520 (54%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV+ L+ SFR A + K GGWE+ ELRGH GH LSA M+
Sbjct: 70 WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L E QN + GYLSA+P EL + K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN +AL + T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 190 GLIDQYLYADNKKALTIVTRMGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
SIT D ++ LA F + L D L H NT IP VI YE+T + +
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DP++L+ L ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPRKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + + Q+ + P
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S + S+NG+ + + G++++ T W D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVSVNGKKISVKQKSGSYIAITREWKDGDQI 527
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ ++ E D+ P+ A A+L+GP +LAG E
Sbjct: 528 SATYPMQIKLETTPDN-PDKA---ALLYGPLVLAGERGTE 563
>gi|212690961|ref|ZP_03299089.1| hypothetical protein BACDOR_00451 [Bacteroides dorei DSM 17855]
gi|212666193|gb|EEB26765.1| hypothetical protein BACDOR_00451 [Bacteroides dorei DSM 17855]
Length = 646
Score = 309 bits (791), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 187/516 (36%), Positives = 285/516 (55%), Gaps = 37/516 (7%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ ++VD L+ SFR A + K GGWE+ ELRGH GH LSA M+
Sbjct: 73 WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYGLMY 132
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E QN +G GYLSA+P EL + VWAP+YT+HK+ +
Sbjct: 133 AATGSELFKHKGDSLVSGLAEVQNALGNGYLSAYPEELINRNIRGTSVWAPWYTLHKLFS 192
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKV--ITMYSVERHWYSLNEETGGMNDVLYR 295
GL+DQY+ +DN +AL++ T M ++ Y++++ + +T + R+ E GG+N+ Y
Sbjct: 193 GLIDQYLYSDNQKALEIVTRMADWAYHKLKPLDEVTRRKMIRN------EFGGINESFYN 246
Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY 355
LY+IT D ++ LA F + L D L H NT IP V+ YE+T D
Sbjct: 247 LYAITGDERYRWLARFFYHNEVIDPLKELRDDLGTKHTNTFIPKVLAEARNYELTEDEDS 306
Query: 356 KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
+ + FF + H++A G +S +E ++DP + + ETC TYNMLK+SRHLF
Sbjct: 307 RKLSGFFWHTMIDRHTFAPGCSSDKEHYFDPDHFSKHISGYTGETCCTYNMLKLSRHLFC 366
Query: 416 WTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCC 475
WT + A ADYYERAL N +L Q+ G++ Y LPL G K S TK NSFWCC
Sbjct: 367 WTADAAVADYYERALYNHILG-QQDPHTGMVTYFLPLLSGSHKVYS-----TKENSFWCC 420
Query: 476 YGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR 535
G+G E+ +K G++IY+ N G+Y+ +I S +W+ + L Q+ D +
Sbjct: 421 VGSGFENHAKYGEAIYYH---NDKGIYVNLFIPSVVNWREKGLTLRQETD----FPAEET 473
Query: 536 MTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDK 594
LT ++ V +++ LR P W S G + +NG+ + + PG++++ T W D+
Sbjct: 474 TVLTIGAQNPVE--TTVYLRYPSW--SKGVKVFVNGKKIAVKQKPGSYIAITRLWKDGDR 529
Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+T P+ LR E D+ P+ A+++GP +LAG
Sbjct: 530 ITADYPMCLRVETTPDN-PQKG---ALIYGPLVLAG 561
>gi|423295661|ref|ZP_17273788.1| hypothetical protein HMPREF1070_02453 [Bacteroides ovatus
CL03T12C18]
gi|392672370|gb|EIY65839.1| hypothetical protein HMPREF1070_02453 [Bacteroides ovatus
CL03T12C18]
Length = 644
Score = 308 bits (789), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 191/520 (36%), Positives = 285/520 (54%), Gaps = 35/520 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV+ L+ SFR A + K GGWE+ ELRGH GH LSA M+
Sbjct: 70 WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L E QN + GYLSA+P EL + K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN +AL + T + ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 190 GLIDQYLYADNKKALTIVTRVGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
SIT D ++ LA F + L D L H NT IP VI YE+T + +
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+L+ L ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + + Q+ + P
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S + S+NG+ + + G++++ T W D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVSVNGKKISVKQKSGSYIAITREWKDGDQI 527
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ P+ ++ E D+ P+ A A+L+GP +LAG E
Sbjct: 528 SATYPMQIKLETTPDN-PDKA---ALLYGPLVLAGERGTE 563
>gi|433678837|ref|ZP_20510648.1| hypothetical protein BN444_02893 [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430816044|emb|CCP41169.1| hypothetical protein BN444_02893 [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 648
Score = 308 bits (789), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 185/518 (35%), Positives = 277/518 (53%), Gaps = 31/518 (5%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG-HYLSASAQ 175
+A++ N YL+ + L+ +FR A L + + GGWE+P ELRGHF G HYLSA A
Sbjct: 71 QARERNRRYLMSIPNARLLHNFRLVAGLSSDAEPLGGWESPKCELRGHFAGGHYLSACAL 130
Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
++A+T +A +K+K +V L+ CQ + GYL A+P + + VW P YT HKI
Sbjct: 131 LYAATSDAALKDKADALVAELARCQRQ--DGYLGAYPAAFYARLRRGEDVWVPLYTAHKI 188
Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLY 294
LAG LD A NAQAL+ A ++ + + W + L E GG+ + L
Sbjct: 189 LAGHLDMARHAGNAQALRSAQRFADWLGAWMDGCD-----DAQWQHILGVEFGGVQESLL 243
Query: 295 RLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL 354
LY ++ DPK+ A + +P L LA Q D L+ HANT IP ++ + YE+ G+P
Sbjct: 244 ELYLLSGDPKYQRWAARYAQPALLEPLAQQRDALAGLHANTQIPKIVAAARAYEIGGEPR 303
Query: 355 YKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLF 414
+ I FF V+ H+Y TGGTS E + P A L + E C +YNMLK++RHL+
Sbjct: 304 QRDIAAFFWRTVSGHHAYCTGGTSDYELFGKPDHFAGRLSGHSHECCCSYNMLKLTRHLY 363
Query: 415 RWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC 474
W + A DYYER L N L Q E G+++Y +P+ G K + T F SFWC
Sbjct: 364 TWQPDAALMDYYERVLFNARLGTQ--DEAGMLMYFVPMDAGYWKL-----YNTPFASFWC 416
Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
C GTG+E F+K DSIYF + GL + +I+S DW + + Q+ +
Sbjct: 417 CTGTGVEEFAKSNDSIYFR---DAAGLTVNLFIASQLDWPERGLRVVQR----TRFPQQE 469
Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYND 593
L F K+ Q +L LR+P W + G + +NG+ + PG++L+ R++ D
Sbjct: 470 GTALEFQCKRP--QQMTLRLRIPYWA-TQGVRLRINGKAQAIKATPGSYLALQRRFADGD 526
Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
++ + LP++L + P+ S+QA+++GP +LA
Sbjct: 527 RIELDLPMALHAAPL----PDEPSLQAMMYGPLVLAAQ 560
>gi|336415976|ref|ZP_08596314.1| hypothetical protein HMPREF1017_03422 [Bacteroides ovatus
3_8_47FAA]
gi|335939879|gb|EGN01751.1| hypothetical protein HMPREF1017_03422 [Bacteroides ovatus
3_8_47FAA]
Length = 644
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 191/515 (37%), Positives = 284/515 (55%), Gaps = 35/515 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
++ +DV+ L+ SFR A + K GGWE+ ELRGH GH LSA M+
Sbjct: 70 WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHMLSALGLMY 129
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L E QN + GYLSA+P EL + K VWAP+YT+HK+ +
Sbjct: 130 AATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEELINRNIQGKGVWAPWYTLHKLFS 189
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ ADN +AL + T M ++ YN+++ + S E + E GG+N+ Y LY
Sbjct: 190 GLIDQYLYADNKKALIIVTRMGDWAYNKLKPL----SEETRKLMIRNEFGGINESFYNLY 245
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
SIT D ++ LA F + L D L H NT IP VI YE+T + +
Sbjct: 246 SITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRK 305
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DPK+L+ L ETC TYNMLK+SRHLF WT
Sbjct: 306 LSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWT 365
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ + ADYYERAL N +L Q+ E G++ Y LPL G K S TK NSFWCC G
Sbjct: 366 GDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVG 419
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRM 536
+G E+ +K G++IY+ N G+Y+ +I S WK + + Q+ + P
Sbjct: 420 SGFENHAKFGEAIYYH---NNQGIYVNLFIPSQVTWKEKGLTIRQETEFP-------QEE 469
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKL 595
T F+ + E +++ LR P W S + S+NG+ + + G++++ T W D++
Sbjct: 470 TTRFTLQAENPVRTTIYLRYPSW--SKDVKVSVNGKKIFVKQKSGSYIAITREWKDGDQI 527
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ P+ ++ E D+ P+ A A+L+GP +LAG
Sbjct: 528 SATYPMQIKLETTPDN-PDKA---ALLYGPLVLAG 558
>gi|237712552|ref|ZP_04543033.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|229453873|gb|EEO59594.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
Length = 640
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 192/542 (35%), Positives = 294/542 (54%), Gaps = 38/542 (7%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
+K L DV L S + ++ ++ ++VD L+ SFR A + K
Sbjct: 42 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 100
Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
GGWE+ ELRGH GH LSA M+A+T + K K ++V L+E QN +G GYLSA+
Sbjct: 101 GGWESLDCELRGHTTGHLLSAYGLMYAATGSELFKHKGDSLVSGLAEVQNALGNGYLSAY 160
Query: 212 PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV-- 269
P EL + VWAP+YT+HK+ +GL+DQY+ +DN +AL++ T M ++ Y++++ +
Sbjct: 161 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEIVTRMADWAYHKLKPLDE 220
Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
+T + R+ E GG+N+ Y LY+IT D ++ LA F + L D L
Sbjct: 221 VTRRKMIRN------EFGGINESFYNLYAITGDERYRWLARFFYHNEVIDPLKELRDDLG 274
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
H NT IP V+ YE+T D + + FF + H++A G +S +E ++DP
Sbjct: 275 TKHTNTFIPKVLAEARNYELTEDEDSRKLSGFFWHTMIDRHTFAPGCSSDKEHYFDPDHF 334
Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
+ + ETC TYNMLK+S HLF WT + A ADYYERAL N +L Q+ G++ Y
Sbjct: 335 SKHISGYTGETCCTYNMLKLSSHLFCWTADAAVADYYERALYNHILG-QQDPHTGMVTYF 393
Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
LPL G K S TK NSFWCC G+G E+ +K G++IY+ N G+Y+ +I S
Sbjct: 394 LPLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYH---NDKGIYVNLFIPS 445
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
+W+ + L Q+ D + LT ++ V +++ LR P W S G + +
Sbjct: 446 VVNWREKGLTLRQETD----FPAEETTVLTIGAQNPVE--TTVYLRYPSW--SKGVKVFV 497
Query: 570 NGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
NG+ + + PG++++ T W D++T P+ LR E D+ P+ A+++GP +L
Sbjct: 498 NGKKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQKG---ALIYGPLVL 553
Query: 629 AG 630
AG
Sbjct: 554 AG 555
>gi|427386207|ref|ZP_18882404.1| hypothetical protein HMPREF9447_03437 [Bacteroides oleiciplenus YIT
12058]
gi|425726247|gb|EKU89112.1| hypothetical protein HMPREF9447_03437 [Bacteroides oleiciplenus YIT
12058]
Length = 641
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 179/488 (36%), Positives = 276/488 (56%), Gaps = 26/488 (5%)
Query: 149 KAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL 208
K GGWE+ E+RGH GH LSA A M+A++ + K K ++V L+E Q+ +G GYL
Sbjct: 99 KKLGGWESLDCEIRGHTTGHLLSAYALMYAASGSEIFKLKGDSLVSGLAEVQDALGNGYL 158
Query: 209 SAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
SA+P EL + VWAP+YT+HK+ +GL+DQY+ DN QALK+ T M ++ YN+++
Sbjct: 159 SAYPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALKVVTRMGDWAYNKLKP 218
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ E + E GG+N+ Y LY+IT D ++ LA+ F + L Q D L
Sbjct: 219 L----DEETRKRMIRNEFGGVNESFYNLYAITGDERYHWLANFFYHNDVIDPLKEQRDDL 274
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
H NT IP V+ YE+T + + + FF + A H++A G +S +E ++DP++
Sbjct: 275 GTKHTNTFIPKVLAEARNYELTQNAESRTLTDFFWHTMIAHHTFAPGCSSDKEHYFDPQQ 334
Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
+ L ETC TYNMLK+SRHLF WT + + ADYYERAL N +L Q+ E G+ Y
Sbjct: 335 FSKHLTGYTGETCCTYNMLKLSRHLFCWTGDASIADYYERALYNHILG-QQDPETGMFSY 393
Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
LPL G K S T+ NSFWCC G+G E+ +K G++IY++ E G+Y+ +I
Sbjct: 394 FLPLLSGSHKVYS-----TQENSFWCCVGSGFENHAKYGEAIYYQNE---KGIYVNLFIP 445
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
S +WK + + Q+ + + L+ +K+ V +++ LR P W S S
Sbjct: 446 SEVNWKEKGMTIRQETN----FPAEETTILSIHAKEPVK--TTVYLRYPSW--SKKVTVS 497
Query: 569 LNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
+NG+ + + PG++++ T +W DK+ P+ ++ E D+ P+ A+++GP +
Sbjct: 498 VNGKKVSVKQKPGSYIAVTRQWKDGDKIEANYPMEIQLETTPDN-PQKG---ALVYGPLV 553
Query: 628 LAGHTSGE 635
LAG E
Sbjct: 554 LAGELGTE 561
>gi|440732599|ref|ZP_20912422.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
DAR61454]
gi|440368630|gb|ELQ05659.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
DAR61454]
Length = 652
Score = 306 bits (783), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 184/518 (35%), Positives = 276/518 (53%), Gaps = 31/518 (5%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG-HYLSASAQ 175
+A++ N YL+ + L+ +FR A L + + GGWE+P ELRGHF G HYLSA A
Sbjct: 75 QARERNRRYLMSIPNARLLHNFRLVAGLSSDAEPLGGWESPKCELRGHFAGGHYLSACAL 134
Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
++A+T +A +K+K +V L+ CQ + GYL A+P + + VW P YT HKI
Sbjct: 135 LYAATGDAALKDKADALVAELARCQRQ--DGYLGAYPAAFYARLRRGEDVWVPLYTAHKI 192
Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLY 294
LAG LD A NAQAL+ A ++ + + W + L E GG+ + L
Sbjct: 193 LAGHLDMARHAGNAQALRSAQRFADWLGAWMDGCD-----DAQWQHILGVEFGGVQESLL 247
Query: 295 RLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL 354
LY ++ DPK+ A + +P L LA Q D L+ HANT IP ++ + YE+ DP
Sbjct: 248 ELYLLSGDPKYQRWAARYAQPALLEPLAQQRDALAGLHANTQIPKIVAAARAYEIGRDPR 307
Query: 355 YKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLF 414
+ + FF V+ H+Y TGGTS E + P A L + E C +YNMLK++RHL+
Sbjct: 308 QRDVAAFFWRTVSGHHAYCTGGTSDYELFGKPDHFAGRLSGHSHECCCSYNMLKLTRHLY 367
Query: 415 RWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC 474
W + A DYYER L N L Q E G+++Y +P+ G K + T F SFWC
Sbjct: 368 TWQPDAALMDYYERVLFNARLGTQ--DEAGMLMYFVPMDAGYWKL-----YNTPFASFWC 420
Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
C GTG+E F+K DSIYF + GL + +I+S DW + + Q+ +
Sbjct: 421 CTGTGVEEFAKSNDSIYFR---DAAGLTVNLFIASQLDWPERGLRVVQR----TRFPQQE 473
Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYND 593
L F K+ Q +L LR+P W + G + +NG+ + PG++L+ R++ D
Sbjct: 474 GTALVFQCKRP--QQMTLRLRIPYWA-TQGVRLRINGKAQAIKATPGSYLALQRRFADGD 530
Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
++ + LP++L + P+ S+QA+++GP +LA
Sbjct: 531 RIELDLPMALHAAPL----PDEPSLQAMMYGPLVLAAQ 564
>gi|395774802|ref|ZP_10455317.1| protein [Streptomyces acidiscabies 84-104]
Length = 818
Score = 306 bits (783), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 198/525 (37%), Positives = 273/525 (52%), Gaps = 33/525 (6%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
Q+ N YL +D+D L+ +FR LP+ + GWE P ELRGH GH LS A A
Sbjct: 43 QRRNTAYLRFVDLDRLLHTFRLNVGLPSTAQPCSGWEGPNVELRGHSTGHLLSGLALTHA 102
Query: 179 STHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYTIH 233
+T + +++K +V +L+ECQ GYLSAFP FD EA VWAPYYT+H
Sbjct: 103 NTGDTELRDKGRRLVAALAECQAASPAAGFNAGYLSAFPESFFDRLEAGTGVWAPYYTLH 162
Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
KI+AGL+DQY L+ N QAL + ++ R + S ER L+ E GGMNDVL
Sbjct: 163 KIMAGLVDQYRLSGNEQALDVVLRKGDWVDRRTAGL----SYERMQRVLDTEFGGMNDVL 218
Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
L+ IT D + L +A F LA D L+ HANT IP ++G+ +E D
Sbjct: 219 ADLHEITGDARWLAVAERFTHARVFDPLARGEDRLAGLHANTQIPKMVGALRMWEEGLDV 278
Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
Y+ IG F IV H+Y GG S E + +P +A L E C +YNMLK++R L
Sbjct: 279 RYRTIGENFWRIVTGHHTYVIGGNSNGEAFHEPDVIAGQLSDSTCENCNSYNMLKLTRLL 338
Query: 414 -FRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARST-----HGWG 466
F DYYERAL N +L Q G+E G IY L G +K + + +
Sbjct: 339 HFHAPGRTDLLDYYERALFNQMLGEQDPGSEHGYNIYYTGLAPGSAKRQPSFMSPEDAYS 398
Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDP 526
T + +F C +GTG+E+ +K D+IY +E L + +I S DWK+ + Q
Sbjct: 399 TDYTNFSCDHGTGMETHAKFADTIYTHDEQR---LLVNLFIPSEVDWKAKGITWRQTTR- 454
Query: 527 IVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLSA 585
+ D TLT ++ Q +L +R+P W + GA+ LNG+ LP P PG + +
Sbjct: 455 LPDQDT---ATLTVTAGQ---ARHALVVRVPGW--ARGARVRLNGRTLPDRPAPGTWFTL 506
Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
W D++ + LPL EA DD PE +QA+L GP +LAG
Sbjct: 507 DRAWRRGDRVDVTLPLRTTVEATPDD-PE---VQAVLHGPVVLAG 547
>gi|423239921|ref|ZP_17221036.1| hypothetical protein HMPREF1065_01659 [Bacteroides dorei
CL03T12C01]
gi|392644910|gb|EIY38644.1| hypothetical protein HMPREF1065_01659 [Bacteroides dorei
CL03T12C01]
Length = 646
Score = 305 bits (782), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 192/542 (35%), Positives = 293/542 (54%), Gaps = 38/542 (7%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
+K L DV L S + ++ ++ ++VD L+ SFR A + K
Sbjct: 48 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 106
Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
GGWE+ ELRGH GH LSA M+A+T + K K ++V L E QN +G GYLSA+
Sbjct: 107 GGWESLDCELRGHTTGHLLSAYGLMYAATGSELFKHKGDSLVSGLVEVQNALGNGYLSAY 166
Query: 212 PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV-- 269
P EL + VWAP+YT+HK+ +GL+DQY+ +DN +AL++ T M ++ Y++++ +
Sbjct: 167 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEIVTRMADWAYHKLKPLDE 226
Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
+T + R+ E GG+N+ Y LY+IT D ++ LA F + L D L
Sbjct: 227 VTRRKMIRN------EFGGINESFYNLYAITGDERYRWLARFFYHNEVIDPLKELRDDLG 280
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
H NT IP V+ YE+T D + + FF + H++A G +S +E ++DP
Sbjct: 281 TKHTNTFIPKVLAEARNYELTEDEDSRKLSGFFWHTMIDRHTFAPGCSSDKEHYFDPDHF 340
Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
+ + ETC TYNMLK+S HLF WT + A ADYYERAL N +L Q+ G++ Y
Sbjct: 341 SKHISGYTGETCCTYNMLKLSSHLFCWTADAAVADYYERALYNHILG-QQDPHTGMVTYF 399
Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
LPL G K S TK NSFWCC G+G E+ +K G++IY+ N G+Y+ +I S
Sbjct: 400 LPLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYH---NDKGIYVNLFIPS 451
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
+W+ + L Q+ D + LT ++ V +++ LR P W S G + +
Sbjct: 452 VVNWREKGLTLRQETD----FPAEETTVLTIGAQNPVE--TTVYLRYPSW--SKGVKVFV 503
Query: 570 NGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
NG+ + + PG++++ T W D++T P+ LR E D+ P+ A+++GP +L
Sbjct: 504 NGKKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQKG---ALIYGPLVL 559
Query: 629 AG 630
AG
Sbjct: 560 AG 561
>gi|332880745|ref|ZP_08448418.1| hypothetical protein HMPREF9074_04196 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357045883|ref|ZP_09107513.1| hypothetical protein HMPREF9441_01526 [Paraprevotella clara YIT
11840]
gi|332681379|gb|EGJ54303.1| hypothetical protein HMPREF9074_04196 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355530889|gb|EHH00292.1| hypothetical protein HMPREF9441_01526 [Paraprevotella clara YIT
11840]
Length = 618
Score = 304 bits (778), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 213/621 (34%), Positives = 301/621 (48%), Gaps = 67/621 (10%)
Query: 105 HDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGH 164
HDV L S V R + N +L L+ D L+ +FR A LP+ K GWE+P LRGH
Sbjct: 39 HDVELASSWVKQR-EDLNTAFLRSLEPDRLLHNFRVNAGLPSVAKPLEGWESPGVGLRGH 97
Query: 165 FVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-LK 223
FVGHYLSA + + +A + + VV + CQ G GYLSAFP + E
Sbjct: 98 FVGHYLSAVSALVERYEDAGLARNLEKVVEGMYACQQAHGNGYLSAFPETDIEVLETRFT 157
Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
VWAPYYT+HKI+ GLLD Y+ N +A M + Y R+ K + +V R Y+ +
Sbjct: 158 GVWAPYYTLHKIMQGLLDVYLRTGNEKAYAMVEGLAGYVDRRMSK-LDPATVARMMYTAD 216
Query: 284 ----EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
E GGMN+VLY+LY ++ P++L LA LFD FL L D LS HANTHI +
Sbjct: 217 ANPQNEMGGMNEVLYQLYCVSGKPRYLELASLFDPSWFLEPLVRNEDILSGLHANTHIAL 276
Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA------------REFWWDPK 387
V G RYE TG+ Y F +++ H+Y G +S E W +P
Sbjct: 277 VNGFARRYESTGEECYGKSVANFWNMLMHFHAYVNGTSSGPRPNVTTETSLTAEHWGEPC 336
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ-RGTEPGVM 446
L +TL E+C T+N +++ LF WT YAD Y N VL +Q R T G
Sbjct: 337 HLCNTLTKGIAESCVTHNTQRLNASLFSWTGNPCYADVYMNMFYNAVLPVQSRST--GAY 394
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
+Y LPLG KA N F CC G+ E+F+KL + IY+ ++ V Y+ Y
Sbjct: 395 VYHLPLGSPRHKAYMAD------NDFKCCSGSCAEAFAKLNNGIYYHDDSAV---YVNLY 445
Query: 507 ISSSFDWKSGHVVLNQK----VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
+ S W V L Q V+PIV + +R + F LNL +P WT
Sbjct: 446 VPSKVHWADKKVGLEQAGGFPVEPIVDFTVSVRRPVDF----------VLNLFIPAWT-- 493
Query: 563 NGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
+GA +NG+ +P P +FL + RW+ D++ I+ + R +++ P+ ++ A+
Sbjct: 494 DGAVVYVNGEKQEMPVRPSSFLKLSRRWADGDRVRIEFRYAFRLQSM----PDKENMLAV 549
Query: 622 LFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQS 681
+GP LLA T E +K L+ L SF +S + FV+ N +
Sbjct: 550 FYGPMLLAFETRDEVILKGNKDEILAGL------SF--------ADSESGRFVLKNGERE 595
Query: 682 ITMEE-FPVSGTDAALHATFR 701
+ F V ++AT R
Sbjct: 596 FRLRPLFDVDKESYGVYATIR 616
>gi|326204047|ref|ZP_08193908.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
gi|325985814|gb|EGD46649.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
Length = 743
Score = 302 bits (774), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 189/518 (36%), Positives = 271/518 (52%), Gaps = 31/518 (5%)
Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
L A + +EYL D D L+ F T L + Y GWEN +E+RGH +GHYL+A A
Sbjct: 11 LVNAFKKEIEYLEAFDCDKLLSCFYITKGLTPKAENYRGWEN--TEIRGHTMGHYLTALA 68
Query: 175 QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK 234
Q +++T+++ I E++ ++ LS CQ +GYLSAFP E FD E KP+W P+YT+HK
Sbjct: 69 QAYSATNDSKIYERLQYLMKELSLCQ--FESGYLSAFPEEFFDRVENRKPIWVPWYTMHK 126
Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLY 294
I+ GL+ Y LA ALK+ + + E+ ++R K ++ E H L E GGMND +Y
Sbjct: 127 IITGLISVYKLAKIETALKIVSRLGEWVFSRTDK----WTPEIHANVLAVEYGGMNDCMY 182
Query: 295 RLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP- 353
LY I+ + KH AH+FD+ + D L++ HANT IP +G+ RY G+
Sbjct: 183 ELYKISGNEKHCTAAHMFDEIELFKEIHDGKDILNNRHANTTIPKFLGALNRYLAIGEEE 242
Query: 354 -LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
Y F IV +HSY TGG S E + +P L S N ETC TYNMLK++R
Sbjct: 243 QFYLDTCKEFWSIVTNNHSYVTGGNSEWEHFGEPGILDAERTSTNCETCNTYNMLKMTRE 302
Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
LF+ T YAD+YE TN +LS Q + G+ +Y P+ G K +G F F
Sbjct: 303 LFKITGNKKYADFYENTFTNAILSSQ-NPDTGMTMYFQPMETGYFKV-----YGKPFEHF 356
Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
WCC GTG+E+F+KL +SIYF EE LY+ Y S+ +W+ V L Q D I D
Sbjct: 357 WCCTGTGMENFTKLNNSIYFYEEDR---LYVNMYYSTELNWEEKGVKLTQNSD-IPGTD- 411
Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYN 592
F+ K E G +L +R+P W + G + ++N + W N
Sbjct: 412 ----RAGFTIKAETGAEFTLCMRIPTW--AKGVKINVNNNLSIFTEERGYALIHRTWKDN 465
Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
D + I + + + P+ + A +GP +L+
Sbjct: 466 DTVEIIFKIEPQLSTL----PDNPNAVAFTYGPVVLSA 499
>gi|325106457|ref|YP_004276111.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324975305|gb|ADY54289.1| protein of unknown function DUF1680 [Pedobacter saltans DSM 12145]
Length = 648
Score = 302 bits (774), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 194/551 (35%), Positives = 293/551 (53%), Gaps = 41/551 (7%)
Query: 101 EVSLHDVWLDQSSVLWRAQQTNLE----YLLMLDVDSLVWSFRKTASLPTPG-------K 149
+V ++ L +L A + N+E +L+ LDV+ L+ SFR TA + + K
Sbjct: 39 DVKVYSFDLKDVRLLPSAFRDNMERDSKWLMSLDVNRLLHSFRNTAGVFSSKEGGYMTIK 98
Query: 150 AYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQ---NKIG-T 205
GGWE+ +LRGH GH +SA + ++AST + K K ++V L+E Q K+G
Sbjct: 99 KLGGWESLDCDLRGHTTGHIMSALSYLYASTGDERYKIKSDSIVNGLAEVQYALTKVGQN 158
Query: 206 GYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
G++SAFP + A + +WAP+YT+HKI AGL+DQY+ N +AL + T + Y
Sbjct: 159 GFISAFPENFINRNIAGQSIWAPWYTLHKIYAGLIDQYLYCGNEKALDIMTKAASWAY-- 216
Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
QK++ + + E+ L E GG N+ Y LY+IT +P+HL LA F L LA +
Sbjct: 217 -QKLMPL-TEEQRATMLRNEFGGTNEAFYNLYAITGNPEHLKLAEFFYHNAVLDPLAERK 274
Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
L HANT IP +IG YE+ D K + TFF D V +Y TGG S +E +
Sbjct: 275 SDLYFKHANTFIPKLIGEARNYELNADKRSKDVATFFWDEVVNHQTYCTGGNSHKEKFIH 334
Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
++++ L +ETC + NMLK++RHLF W YAD+YERAL N +L Q+ + G+
Sbjct: 335 TDKVSENLTGYTQETCNSNNMLKLTRHLFSWDANPKYADFYERALYNHILG-QQDPQTGM 393
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
+ Y LPL G K ST NSFWCC GTG E+ +K G++IY+ N LY+
Sbjct: 394 VAYFLPLLPGSYKVYSTAE-----NSFWCCVGTGFENHAKYGEAIYYHNNTN---LYVNL 445
Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
+I S W V L Q+ + +++T+ + Q+ +LNLR P W ++G
Sbjct: 446 FIPSELTWNEKGVKLKQET--VFPESDLVKLTVQTAKSQKF----ALNLRYPYW--ASGV 497
Query: 566 QASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
Q +NG+ + + P +++ W D++ I+ P+SL D+ A+++G
Sbjct: 498 QVKINGKAVKVKQVPSSYIVIDRTWKNGDQIIIKYPMSLHLAEANDN----VDKAAVMYG 553
Query: 625 PYLLAGHTSGE 635
P +LAG E
Sbjct: 554 PLVLAGMMGTE 564
>gi|326203856|ref|ZP_08193718.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
gi|325985954|gb|EGD46788.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
Length = 854
Score = 301 bits (771), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 187/547 (34%), Positives = 281/547 (51%), Gaps = 41/547 (7%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
+++V++ D +L A + YL +D + L+ +R+TA L T YGGWEN
Sbjct: 43 MEQVNITDTYLAN------AFNKEISYLQSIDPNRLLVGYRQTAGLSTSYSKYGGWEN-- 94
Query: 159 SELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVVFSLSECQNKIGTGYLSAFPT 213
+ L+GH +GHY+SA AQ + +T NA +K+++ ++ L +CQNK G GY+ A
Sbjct: 95 TPLKGHTLGHYMSALAQAYKNTKSNATVNADMKKRIDLIISELQQCQNKRGDGYIYAETP 154
Query: 214 ELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
E F+ E A +WAP+YT+HKI++GL+ Y L N AL +A+ + ++ YNRV
Sbjct: 155 EQFNVVEGKATGTLWAPWYTMHKIMSGLISIYELEGNPTALTVASKLGDWIYNRVN---- 210
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
+ L E GGMND L LY +T HL A F++P L +A + L+
Sbjct: 211 AWDSATQAKVLGVEYGGMNDCLIELYKLTGKSNHLAAAKKFEEPSLLNTIASGNNVLAGK 270
Query: 332 HANTHIPIVIGSQMRYEVTG--DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
HANT IP IG+ RY G + Y F ++V H+Y TGG S E + +L
Sbjct: 271 HANTTIPKFIGAINRYRTLGTSEASYLTAAQQFWNMVIRDHTYVTGGNSQWEAFRAAGKL 330
Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
N ETC +YNMLK++R LF+ T ++ YAD+YER+ N +L+ Q E G+ Y
Sbjct: 331 DQYRDEVNNETCNSYNMLKLTRELFQVTGDVKYADFYERSFINEILASQN-PETGMTTYF 389
Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
P+G G K S F++FWCC GTG+E+F+KL DSIYF N LY+ YISS
Sbjct: 390 KPMGTGYFKVFS-----KPFDNFWCCTGTGMENFTKLNDSIYFN---NGSDLYVNMYISS 441
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN-GAQAS 568
+ +W + L QK D +S T+TF+ + R P W ++
Sbjct: 442 TLNWSEKGLSLTQKADVPLS------DTVTFTIDSAPSSEVKIKFRSPYWVAADKKVTVK 495
Query: 569 LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
+NG ++ +L + W DKL + +P ++ D++ ++ A +GP +L
Sbjct: 496 VNGSSVNASVVNGYLDVSRVWKVGDKLELTIPAEVQISRCTDNQ----NVAAFTYGPVVL 551
Query: 629 AGHTSGE 635
E
Sbjct: 552 CAGLGNE 558
>gi|332663228|ref|YP_004446016.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332332042|gb|AEE49143.1| protein of unknown function DUF1680 [Haliscomenobacter hydrossis
DSM 1100]
Length = 791
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 197/561 (35%), Positives = 300/561 (53%), Gaps = 43/561 (7%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L D+ L S + A + + YLL ++ D L+ F A LPT YGGWE+ L G
Sbjct: 50 LEDLRLLPGSAFYNAMEKDAAYLLKIESDRLLHRFYANAGLPTKAPVYGGWES--EGLSG 107
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-- 221
H +GHYLSA A M+A + + E+++ +V L+ CQ TGY+ A P E DS A
Sbjct: 108 HTLGHYLSACALMYAGSKDEKYLERVNYLVQELARCQVARKTGYVGAIPKE--DSIFAQV 165
Query: 222 -----------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
L W+P+YTIHK++AGL D Y+ +N QAL++ M ++ + V K+
Sbjct: 166 ARGDIRSSGFDLNGGWSPWYTIHKVMAGLADAYLYTNNDQALQVLRGMSDWTASVVDKL- 224
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
+ + L E GGMN++L +Y+ T + K+L L++ F + L+ + D L
Sbjct: 225 ---NDPQRQKMLKCEYGGMNEILANVYAFTGEKKYLDLSYKFYDDFVMEPLSKKIDPLPG 281
Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
H+NT++P IGS +YE+TG+ + I +FF + + +H+Y GG S E+ D +L
Sbjct: 282 KHSNTNVPKAIGSARQYELTGNTRDQTIASFFWETMVHNHTYVIGGNSNYEYCGDAGKLN 341
Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
D L ETC TYNMLK++RHLF W ADYYERAL N +L+ Q E G+M Y +
Sbjct: 342 DRLSDNTCETCNTYNMLKLTRHLFCWQPSAELADYYERALYNHILASQH-PETGMMTYFV 400
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE-EEGNVPGLYIIQYISS 509
PL G K S +F++F CC G+G+E+ K +SIY+ ++GN LY+ +I S
Sbjct: 401 PLRMGSKKEFS-----NEFHTFTCCVGSGMENHVKYTESIYYRGQDGN--SLYLNLFIPS 453
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
+WK + L Q+ D + ++ T + Q++ +LNLR P W ++ Q +
Sbjct: 454 ELNWKERGLTLRQETK--FPQDGKVTLSFTCAKSQKL----ALNLRRPWWMKADW-QIKV 506
Query: 570 NGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
NG+ + P+ + RW DKL +++P+ L TE++ P+ + A L+GP +L
Sbjct: 507 NGKAVQPVAGTNGYYVLNRRWKNGDKLELEMPMQLYTESM----PDNPNRIAFLYGPLVL 562
Query: 629 AGHTSGEW-DIKTGTARSLSA 648
AG + D GT LSA
Sbjct: 563 AGQLGDKMPDPVYGTPVLLSA 583
>gi|325281981|ref|YP_004254523.1| hypothetical protein Odosp_3391 [Odoribacter splanchnicus DSM
20712]
gi|324313790|gb|ADY34343.1| protein of unknown function DUF1680 [Odoribacter splanchnicus DSM
20712]
Length = 782
Score = 297 bits (761), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 196/566 (34%), Positives = 292/566 (51%), Gaps = 45/566 (7%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
LK+V L D S A N ++L +D+D L+ +F K A L G++YG WE+
Sbjct: 45 LKDVRLLD------SPFKNAMDRNAAWMLEMDMDRLLSNFLKNAGLEPKGESYGSWES-- 96
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
+ GH +GHYLSA AQ +AST + K+++ +V L CQ G++ P +F
Sbjct: 97 MGIAGHTLGHYLSAVAQQYASTGDERFKQRVDYIVHELDSCQQYFVNGFIGGMPGGDRVF 156
Query: 217 DSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
+ L +W P+Y HK + GL D Y+LA N A K+ + +Y +
Sbjct: 157 KQVKKGIIRSAGFDLNGLWVPWYNEHKTMMGLNDAYLLAGNKTAKKVLVNLADYLVD--- 213
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
V+ + E+ LN E GGMN+ L ++Y++T D K+L ++ F + LA D
Sbjct: 214 -VLAGLTDEQVQTMLNCEFGGMNEALAQVYALTGDKKYLDASYRFYHRRLMEPLAEGKDI 272
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L H+NT IP +IGS +YE+TG+P + I FF + HSYA GG S+ E+ P
Sbjct: 273 LPGLHSNTQIPKIIGSARQYELTGNPKDERIAEFFWTTMVNHHSYANGGNSSGEYLSTPD 332
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+L D L ETC TYNMLK+SRHL+ WT + Y D+YE+AL N +L+ Q E G+
Sbjct: 333 KLNDRLTHSTCETCNTYNMLKLSRHLYEWTGDPKYLDFYEKALYNHILASQH-PETGMTC 391
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y +PL G K + K+NSF CC G+G E+ SK G +IY + L++ YI
Sbjct: 392 YFVPLAMGTRKD-----FCDKYNSFTCCMGSGFENHSKYGGAIY-SHGSDDRSLFVNLYI 445
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S WK + KV + R+TL + Q +LNLR PVW G
Sbjct: 446 PSVLTWKEKGL----KVRLETVYPENGRVTLKVVEGER--QPLALNLRYPVWA-GEGIVV 498
Query: 568 SLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
+NG + PG+F++ +W D++ + +P++L T+ + P+ A +A+ +GP
Sbjct: 499 KVNGTKQKITSKPGSFVTLERKWKAGDRIELNIPMNLYTKEM----PDNADRRAVFYGPT 554
Query: 627 LLAGHTSGEWDIKTGTARSLSALISP 652
LLAG GE +I+ R + +SP
Sbjct: 555 LLAG-ALGEKEIE--PIRGVPVFVSP 577
>gi|427386203|ref|ZP_18882400.1| hypothetical protein HMPREF9447_03433 [Bacteroides oleiciplenus YIT
12058]
gi|425726590|gb|EKU89454.1| hypothetical protein HMPREF9447_03433 [Bacteroides oleiciplenus YIT
12058]
Length = 616
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 188/560 (33%), Positives = 287/560 (51%), Gaps = 51/560 (9%)
Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
+EV+L W+ Q ++ N+ +L LD D L+ +FR TA LP+ + GWE+P
Sbjct: 37 EEVTLKSSWIKQR------EELNITFLKSLDPDRLLHNFRVTAGLPSNAEPLEGWESPKI 90
Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSF 219
LRGHFVGHYLSA + + + + E++ ++ L +CQ G YLSAFP + FD+
Sbjct: 91 GLRGHFVGHYLSAVSSLVEKYKDLELVERLRYMIDELCKCQQSFGNSYLSAFPDKDFDAL 150
Query: 220 EA-LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
EA VWAPYYT +K++ GLLD Y N +A M M Y NR+ K ++ ++E+
Sbjct: 151 EAKFTGVWAPYYTYNKVMQGLLDAYTHTGNQKAYDMLLDMAAYVDNRMSK-LSGETIEKM 209
Query: 279 WYSLN----EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHAN 334
Y+++ E G MN+VLY+LY I+ +PKHL LA +FD+ F+ LA D LS H+N
Sbjct: 210 LYTVDANPQNEPGAMNEVLYKLYKISRNPKHLALAEIFDRNWFITPLAENKDILSGLHSN 269
Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA------------REF 382
TH+ +V G RY +TG+ Y T F D++ + H YA G +S E
Sbjct: 270 THLVLVNGFAQRYSITGESKYYAASTNFWDMLISQHVYANGTSSGPRPNATTRTSVTAEH 329
Query: 383 WWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
W P L +TL E E+C ++N K++ +F WT YAD Y N VL+ Q
Sbjct: 330 WGVPGHLCNTLTKEIAESCVSHNTQKLTSSIFTWTAAPKYADAYMNTFYNAVLASQ-SAH 388
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
G +Y LPLG +K K N F CC G+ E++S+L IY+ ++ L+
Sbjct: 389 TGAYMYHLPLGSPRNKKY------LKDNDFACCSGSSAEAYSRLNSGIYYHDDS---ALW 439
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
+ ++ S +WK +V L Q + + + T S+K++VG +L L +P W +
Sbjct: 440 VNLFVPSEVNWKEKNVRLEQNGN----FPKDTNICFTISTKKKVG--FALKLFIPSW--A 491
Query: 563 NGAQASLNGQNLPLPP-PGNFLSATERWSYND--KLTIQLPLSLRTEAIQDDRPEYASIQ 619
A+ +NG+ + P +++ W D KL L+T P+ +
Sbjct: 492 KNAEVYINGEKQEIETFPSSYIDLNRNWRDKDEVKLIFHYDFHLKT------MPDNKDVL 545
Query: 620 AILFGPYLLAGHTSGEWDIK 639
++ +GP LLA + E +K
Sbjct: 546 SLFYGPMLLAFESDEEVILK 565
>gi|337746495|ref|YP_004640657.1| hypothetical protein KNP414_02226 [Paenibacillus mucilaginosus
KNP414]
gi|336297684|gb|AEI40787.1| hypothetical protein KNP414_02226 [Paenibacillus mucilaginosus
KNP414]
Length = 749
Score = 296 bits (759), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 203/571 (35%), Positives = 294/571 (51%), Gaps = 64/571 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
LH V + +S L A + N YLL L+ D L+ FR+ A L Y GWE+ + G
Sbjct: 8 LHKVRI-ESGPLKHAMELNASYLLNLEADRLLSRFREYAGLAPKAPHYEGWES--RGISG 64
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H +GHYLS A M+AST + +++ VV L +CQ G+G++S P ELF +A
Sbjct: 65 HTLGHYLSGCALMYASTGREELLSRVNYVVEELQQCQRADGSGFISGIPRGKELFQEVKA 124
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYF----YN 264
L W P YT+HK+ AGL D Y+LA + +AL K+ W+ + F +
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLAGSRKALEIEIKLGLWLDDVFSGLSHE 184
Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
+VQ+V L+ E GGMN+VL L + D + L LA F LG +A +
Sbjct: 185 QVQRV------------LHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAER 232
Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
D L HANT IP +IG+ +YEVTG+ Y I FF D V HSY GG S E +
Sbjct: 233 KDTLGGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHFG 292
Query: 385 DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
+P +L D LG ETC TYNMLK++RHLF+W AYADYYERA+ N +L Q+ + G
Sbjct: 293 EPDKLNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILGSQQPVD-G 351
Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
+ Y + L G K+ + +++ F CC G+G+ES S G +IYF N L++
Sbjct: 352 RVCYFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSLYGSAIYFH---NGSALFVN 403
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL----TFSSKQEVGQLSSLNLRMPVWT 560
Q++ S+ +W+ V L Q+ + LR+ TF+ K +R P W
Sbjct: 404 QFVPSTVEWEEQGVRLTQETAFPENGRGVLRIRTAKPGTFAVK----------VRYPSWA 453
Query: 561 YSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
G +NGQ + PG +++ W D L P++LR E++ D+ P+
Sbjct: 454 -EPGISVKVNGQAVSADARPGGYVTVEREWQDGDTLEYDFPMTLRIESMPDN-PDRI--- 508
Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSLSALI 650
A+L+GP +LAG G D R+L++++
Sbjct: 509 ALLYGPLVLAGDL-GAIDAPQDGERALASVL 538
>gi|302548275|ref|ZP_07300617.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Streptomyces
hygroscopicus ATCC 53653]
gi|302465893|gb|EFL28986.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Streptomyces
himastatinicus ATCC 53653]
Length = 849
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 189/526 (35%), Positives = 279/526 (53%), Gaps = 34/526 (6%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
Q N YL +D++ L+ +FR + + + GGWE+P +ELRGH GH LS A +A
Sbjct: 72 QSRNTAYLRFVDINRLLHTFRLNVGIASSAQPCGGWESPTTELRGHSTGHLLSGLALTYA 131
Query: 179 STHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYTIH 233
+T + + +K +V +L+ CQ K TGYLSAFP FD EA VWAPYYTIH
Sbjct: 132 NTGDTALLDKSRKLVSALAACQAKSPAAGYRTGYLSAFPENFFDRLEAGSGVWAPYYTIH 191
Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
KI+AGL+DQY LA NA+AL+ + R ++ S ++ L E GGMNDVL
Sbjct: 192 KIMAGLVDQYRLAGNAEALETVLRQAAWVDTRTARL----SYDQMQRVLETEYGGMNDVL 247
Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
L++IT D + L +A F L+ D L+ HANT IP ++G+ +E D
Sbjct: 248 ADLHAITGDSRWLRVAERFTHARVFDPLSRNEDRLAGLHANTQIPKMVGALRLWEEGLDS 307
Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
Y+ IG F IV H+Y GG S E + +P +A L E C +YNMLK++R +
Sbjct: 308 RYRTIGENFWKIVTDHHTYVIGGNSNGEAFHEPDAIAAQLSGSCCENCNSYNMLKLARLI 367
Query: 414 -FRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARST------HGW 465
F + DYYER L N +L Q + G IY L G K + + + +
Sbjct: 368 HFHAPERTDLLDYYERTLFNQMLGEQDPDSAHGFNIYYTGLAPGSFKQQPSFMGPDPNQY 427
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
T +++F C +G+G+E+ +K D+IY + + L + +I S W+ + Q
Sbjct: 428 STDYDNFSCDHGSGMETHAKFADTIYTRGDRS---LLVNLFIPSELRWQEKGITWRQ--- 481
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLS 584
+ TLT SS +SL LR+ + ++++GA+A+LNG LP P PG++L
Sbjct: 482 -TTGFPDQQTTTLTVSSGG-----ASLELRVRIPSWASGARAALNGATLPDQPKPGSWLI 535
Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+W D++ + LP+ LR + DD P+ IQA+L+GP +LAG
Sbjct: 536 IDRQWKTGDRVEVTLPMKLRLDPTPDD-PD---IQAVLYGPVVLAG 577
>gi|386723005|ref|YP_006189331.1| hypothetical protein B2K_12670 [Paenibacillus mucilaginosus K02]
gi|384090130|gb|AFH61566.1| hypothetical protein B2K_12670 [Paenibacillus mucilaginosus K02]
Length = 749
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 202/571 (35%), Positives = 294/571 (51%), Gaps = 64/571 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
LH V + +S L A + N YLL L+ D L+ FR+ A L Y GWE+ + G
Sbjct: 8 LHKVRI-ESGPLKHAMELNASYLLNLEADRLLSRFREYAGLAPKAPHYEGWES--RGISG 64
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H +GHYLS A M+AST + +++ VV L +CQ G+G++S P ELF+ +A
Sbjct: 65 HTLGHYLSGCALMYASTGREELLSRVNYVVEELEQCQRADGSGFISGIPRGKELFEEVKA 124
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYF----YN 264
L W P YT+HK+ AGL D Y+L + +AL K+ W+ + F +
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLTGSRKALEIEIKLGLWLDDVFSGLSHE 184
Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
+VQ+V L+ E GGMN+VL L + D + L LA F LG +A +
Sbjct: 185 QVQRV------------LHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAER 232
Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
D L HANT IP +IG+ +YEVTG+ Y I FF D V HSY GG S E +
Sbjct: 233 KDTLGGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHFG 292
Query: 385 DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
+P +L D LG ETC TYNMLK++RHLF+W AYADYYERA+ N +L+ Q+ + G
Sbjct: 293 EPDKLNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILASQQPVD-G 351
Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
+ Y + L G K+ + +++ F CC G+G+ES S G +IYF L++
Sbjct: 352 RVCYFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSLYGSAIYFHSGST---LFVN 403
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL----TFSSKQEVGQLSSLNLRMPVWT 560
Q++ S+ DW+ V L Q+ + LR+ TF+ K +R P W
Sbjct: 404 QFVPSTVDWEEQGVRLTQETSFPENGRGVLRIRTAKPGTFAVK----------VRYPSWA 453
Query: 561 YSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
G +NGQ + PG +++ W D L P++LR E++ D+ P+
Sbjct: 454 -EPGISVKVNGQAVSADARPGGYVTVEREWQDGDTLEYDFPMTLRIESMPDN-PDRI--- 508
Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSLSALI 650
A+L+GP +LAG G D R+L++++
Sbjct: 509 ALLYGPLVLAGDL-GAIDAPQDGERALASVL 538
>gi|375148455|ref|YP_005010896.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361062501|gb|AEW01493.1| protein of unknown function DUF1680 [Niastella koreensis GR20-10]
Length = 786
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 193/529 (36%), Positives = 283/529 (53%), Gaps = 43/529 (8%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
+A + ++ YL +++ D L+ FR+ A L G+ YGGWE+ S L GH +GHYLSA A
Sbjct: 59 KAMEADVRYLQVIEPDRLLADFREHAGLKPKGEHYGGWEH--SGLAGHTLGHYLSACAMH 116
Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-------------LK 223
+A++H+ K++ +V L+ECQ K GY+ A P E DS A L
Sbjct: 117 YAASHDKQFLGKVNYIVDELAECQPK-RNGYVGAIPKE--DSMWAEVEKGNIHSRGFDLN 173
Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
W+P+YT+HKI+AGLLD Y+ DN +AL + T M ++ + ++ + S++R +
Sbjct: 174 GAWSPWYTVHKIMAGLLDAYLYCDNKKALAVETGMADWTAHLLRNLPDS-SLQRMLFC-- 230
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGS 343
E GGMNDVL Y++T + K+L L++ F L LALQ D L H+NT IP VIG
Sbjct: 231 -EYGGMNDVLNNTYALTGEKKYLDLSYKFHDKRILDSLALQKDILPGKHSNTQIPKVIGC 289
Query: 344 QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTT 403
RYE+T K IG FF V H+YA GG S E+ +L +TL ETC T
Sbjct: 290 IRRYELTAGEKDKTIGDFFWQTVVNDHTYAPGGNSNYEYLGPAGQLNETLTDNTMETCNT 349
Query: 404 YNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH 463
YNMLK++RHLF + DYYERAL N +LS Q + G+M Y +PL G K S
Sbjct: 350 YNMLKLTRHLFALQPTASLMDYYERALYNHILSSQDHST-GMMCYFVPLRMGTQKEFS-- 406
Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
FN+F CC G+G+E+ K G++IY+ +G LY+ +I+S WK VV+ Q+
Sbjct: 407 ---DSFNTFTCCVGSGMENHVKYGETIYY--QGADGSLYVNLFIASRLTWKEKGVVVEQQ 461
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPG--N 581
+ Y+R+ + + +L +R P W G ++NG+ PG
Sbjct: 462 TQ--LPESNYIRLAIKAARPVAF----TLRIRNPYWA-KQGVWIAVNGKEQTNLQPGADG 514
Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ + T W D + ++ L L T ++ P+ + AI +GP +LAG
Sbjct: 515 YFTITRTWKTGDAVIVKPSLQLYTRSM----PDNPNRLAIFYGPLVLAG 559
>gi|379720404|ref|YP_005312535.1| hypothetical protein PM3016_2500 [Paenibacillus mucilaginosus 3016]
gi|378569076|gb|AFC29386.1| hypothetical protein PM3016_2500 [Paenibacillus mucilaginosus 3016]
Length = 749
Score = 295 bits (755), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 202/571 (35%), Positives = 295/571 (51%), Gaps = 64/571 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
LH V + +S L A + N YLL L+ D L+ FR+ A L Y GWE+ + G
Sbjct: 8 LHKVRI-ESGPLKHAMELNASYLLNLEADRLLSRFREYAGLEPKAPHYEGWES--RGISG 64
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H +GHYLS A M+AST + +++ VV L +CQ G+G++S P ELF +A
Sbjct: 65 HTLGHYLSGCALMYASTGREELLSRVNYVVEELQQCQRADGSGFISGIPRGKELFQEVKA 124
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYF----YN 264
L W P YT+HK+ AGL D Y+LA + +AL K+ W+ + F +
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLAGSRKALEIEIKLGLWLDDVFSGLSHE 184
Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
+VQ+V L+ E GGMN+VL L + D + L LA F LG +A +
Sbjct: 185 QVQRV------------LHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAER 232
Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
D L HANT IP +IG+ +YEVTG+ Y I FF D V HSY GG S E +
Sbjct: 233 KDTLGGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHFG 292
Query: 385 DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
+P +L D LG ETC TYNMLK++RHLF+W AYADYYERA+ N +L+ Q+ + G
Sbjct: 293 EPDKLNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILASQQPVD-G 351
Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
+ Y + L G K+ + +++ F CC G+G+ES S G +IYF + L++
Sbjct: 352 RVCYFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSLYGSAIYFH---SGSALFVN 403
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL----TFSSKQEVGQLSSLNLRMPVWT 560
Q++ S+ +W+ V L Q+ + LR+ TF+ K +R P W
Sbjct: 404 QFVPSTVEWEEQGVRLTQETAFPENGRGVLRIRTAKPGTFAVK----------VRYPSWA 453
Query: 561 YSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
G +NGQ + PG +++ W D L P++LR E++ D+ P+
Sbjct: 454 -EPGISVKVNGQAVSADARPGGYVTVEREWQDGDTLEYDFPMTLRIESMPDN-PDRI--- 508
Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSLSALI 650
A+L+GP +LAG G D R+L++++
Sbjct: 509 ALLYGPLVLAGDL-GAIDAPQDGERALASVL 538
>gi|393783247|ref|ZP_10371422.1| hypothetical protein HMPREF1071_02290 [Bacteroides salyersiae
CL02T12C01]
gi|392669526|gb|EIY63014.1| hypothetical protein HMPREF1071_02290 [Bacteroides salyersiae
CL02T12C01]
Length = 1022
Score = 295 bits (754), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 191/549 (34%), Positives = 279/549 (50%), Gaps = 49/549 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLL-MLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
LK++ L D S A + ++L+ L D + F A LPT G YGGWEN
Sbjct: 54 LKQIRLLD------SPFKTAMNADRKWLMETLKPDRFLHRFHANAGLPTKGTIYGGWEN- 106
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE--L 215
++ G GHY+SA + ++A+T IK ++ + L CQ+K GTGY+ A P E L
Sbjct: 107 -TDQSGFSFGHYISALSMLYATTGEEDIKIRLDYCISELKRCQDKRGTGYVGAIPNEDKL 165
Query: 216 FDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRV 266
+D L VW P+Y +HK+ +GL+D Y+ +N A + + ++ ++
Sbjct: 166 WDDVSKGIIDGRNFNLNNVWVPWYNLHKLWSGLIDAYIFGENETAKTIVIALTDWACDKF 225
Query: 267 QKVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
+ + E W + L E GGMND LY +Y+IT D +HL +A+ F L L+ +
Sbjct: 226 KDL-----TEEQWQNILTCEHGGMNDALYNVYAITGDTRHLEIANKFYHKKVLDPLSKRK 280
Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
+ L+ HANT IP VIG YE+TG+ + I ++F V HSY GG S E + +
Sbjct: 281 NELAGLHANTQIPKVIGISRSYELTGNQDHHTISSYFWHTVTHEHSYCIGGNSNYEHFVE 340
Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
P +L+ L ++ ETC TYNMLK++RHLF W D+YERAL N +L+ Q E G+
Sbjct: 341 PGKLSGELSNKTTETCNTYNMLKLTRHLFAWNPSAELMDFYERALYNHILASQ-NPETGM 399
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
+ Y +PL A S + N+FWCC GTG E+ K + IY E LYI
Sbjct: 400 VCYCVPLA-----ANSQKNYCNAENNFWCCVGTGFENHVKYAEQIYSHNENE---LYINL 451
Query: 506 YISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
YI S DW ++ L Q + P T + + V Q + ++R P W S G
Sbjct: 452 YIPSELDWSEKNMKLKQTNNFP-------DTDNTTITITETVPQTLTFHVRFPNWVQS-G 503
Query: 565 AQASLNG-QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
+NG + + PG+++S T W NDK+ I LP +L E + D+ + A L
Sbjct: 504 YSIKINGTEQVFNSTPGSYVSITREWKTNDKIEINLPKTLTKEQLLGDKYK----TAFLN 559
Query: 624 GPYLLAGHT 632
GP +LAG T
Sbjct: 560 GPIVLAGKT 568
>gi|29827685|ref|NP_822319.1| protein [Streptomyces avermitilis MA-4680]
gi|29604785|dbj|BAC68854.1| putative secreted protein [Streptomyces avermitilis MA-4680]
Length = 854
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 189/526 (35%), Positives = 272/526 (51%), Gaps = 34/526 (6%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
Q+ N YL +D+D L+ +FR LP+ + GGWE P ELRGH GH LS A A
Sbjct: 77 QRRNSAYLRFVDIDRLLHTFRTNVGLPSDAEPCGGWEGPGVELRGHSTGHLLSGLALAHA 136
Query: 179 STHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYTIH 233
ST +++K +V +L+ECQ+ GTGYLSAFP FD EA VWAPYYTIH
Sbjct: 137 STGEEALRDKGRRLVAALAECQSAAPAAGFGTGYLSAFPESFFDRLEAGSGVWAPYYTIH 196
Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
KI+AGL++QY L QAL++ + R K+ S E+ L E GGMNDVL
Sbjct: 197 KIMAGLVEQYRLVGVGQALEVVLRQARWVDERTAKL----SYEQMQRVLETEFGGMNDVL 252
Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
L+++T DP+ L +A F LA D L+ HANT IP ++G+ +E
Sbjct: 253 ADLHALTGDPRWLDVAERFTHARVFDPLAGNQDKLAGLHANTQIPKMVGALRLWEEGRAD 312
Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
Y+ + F IV H+Y GG S E + +P +A L E C +YNMLK++R L
Sbjct: 313 RYRTVAENFWQIVTDHHTYVIGGNSNGEAFHEPDVIAGQLSDNTCENCNSYNMLKLTRLL 372
Query: 414 -FRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARST------HGW 465
F DYYER L N +L Q +E G IY L G K + + +
Sbjct: 373 HFHAPDRTDLLDYYERTLLNQMLGEQDPDSEHGFAIYYTGLAPGSFKRQPSFMGPDPDVY 432
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
T +++F C +GTG+E+ +K D++Y + + L + ++ S W++ + Q
Sbjct: 433 STDYDNFSCDHGTGMETPAKFADTVYSHDGRS---LRVNLFVPSEVVWRAKGISWRQ--- 486
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLS 584
+ TLT SS + +L +R+P W + GA+A+LNG+ LP P PG++L+
Sbjct: 487 -TTRFPDRSSTTLTVSSGRAAHRLL---IRVPSW--AAGARATLNGRALPDRPQPGSWLA 540
Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
W D++ + LP+ EA DD +QA++ GP +LAG
Sbjct: 541 LERVWRTGDRVEVSLPMRTAVEATPDD----PDVQAVVHGPVVLAG 582
>gi|440694505|ref|ZP_20877120.1| hypothetical protein STRTUCAR8_01091 [Streptomyces turgidiscabies
Car8]
gi|440283503|gb|ELP70762.1| hypothetical protein STRTUCAR8_01091 [Streptomyces turgidiscabies
Car8]
Length = 747
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 191/553 (34%), Positives = 280/553 (50%), Gaps = 48/553 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENP 157
L +V+L D V R + LE+ D ++ FR A L T G + GGWE
Sbjct: 90 LDQVALGD------GVFRRKRDLMLEFARSYPADRILAVFRANAGLDTRGAQPPGGWETA 143
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT---------GYL 208
LRGHF GH+L+ AQ +A T A +K K+ +V +L ECQ + G+L
Sbjct: 144 DGNLRGHFGGHFLTLVAQAYADTREAALKTKLDYLVTALGECQQALADHGSPRPSHPGFL 203
Query: 209 SAFPTE---LFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
+A+P L +S+ +WAPYYT HKI+ G LD + L N QAL +A+ M ++ ++R
Sbjct: 204 AAYPETQFILLESYTTYPTIWAPYYTCHKIMRGFLDAHTLTGNQQALTIASKMGDWVHSR 263
Query: 266 VQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
+ + + ++R W + E GGMN+VL LY++T +HL A FD L A
Sbjct: 264 LSR-LPQAQLDRMWSIYIAGEYGGMNEVLADLYALTGRAEHLAAARCFDNTALLDACADN 322
Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
D L HAN HIP G ++ TG+ Y F +V +Y+ GGT E +
Sbjct: 323 RDILDGRHANQHIPQFTGYIRLFDHTGEAEYATAARNFWGMVAGPRTYSLGGTGQGEMFR 382
Query: 385 DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
+A TLG N ETC TYNMLK+SR LF T + AY DYYE+ LTN +L+ +R
Sbjct: 383 ARNAIAATLGDNNAETCATYNMLKLSRQLFFHTPDPAYMDYYEKGLTNHILASRRDARST 442
Query: 445 V---MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGNVPG 500
V + Y + +G GV + N+ CC GTG+E+ +K DS+YF +GN
Sbjct: 443 VSPEVTYFVGMGPGVVREYD--------NTGTCCGGTGMENHTKYQDSVYFRSADGNA-- 492
Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
LY+ Y++S+ W +V++Q D + TLTF +E G L LR+P W
Sbjct: 493 LYVNLYLASTLRWPERGLVIDQTSD----FPGEGVRTLTF---REGGGSLDLKLRVPSWA 545
Query: 561 YSNGAQASLNG-QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
+ G ++NG PG++L+ + W D++T+ P LR E DD ++Q
Sbjct: 546 -TGGFTVTVNGVPQQTAAVPGSYLTLSRNWQRGDRITVSAPYRLRIERALDD----PTVQ 600
Query: 620 AILFGPYLLAGHT 632
++ +GP LL +
Sbjct: 601 SLFYGPVLLVARS 613
>gi|220928663|ref|YP_002505572.1| hypothetical protein Ccel_1236 [Clostridium cellulolyticum H10]
gi|110588920|gb|ABG76968.1| CBM22- and dockerin-containing enzyme [Clostridium cellulolyticum
H10]
gi|219998991|gb|ACL75592.1| protein of unknown function DUF1680 [Clostridium cellulolyticum
H10]
Length = 955
Score = 293 bits (749), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 186/542 (34%), Positives = 271/542 (50%), Gaps = 35/542 (6%)
Query: 98 FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
LK+ + V + + + A + YL +D + L+ F+KTA L T YGGWEN
Sbjct: 34 LLKQFDMEQVKITDTYYV-NALNKEVAYLQAIDPNRLLVGFKKTAGLSTTYSYYGGWENN 92
Query: 158 ISELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVVFSLSECQNKIGTGYLSAFP 212
+ ++GH +GHY+SA AQ + +T NA +K ++ ++ L CQNK G GYL A P
Sbjct: 93 -TLIQGHTMGHYMSALAQAYKNTKSDPTVNADLKSRIDLIISELQACQNKNGNGYLFATP 151
Query: 213 TELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
FD E A W P+YT+HKI++GLLD Y N AL +AT + + Y RV
Sbjct: 152 ATQFDVVEGKASGSSWVPWYTMHKIMSGLLDIYKFGGNQTALTIATNLGNWIYKRVN--- 208
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
+ L E GGMND LY LY +T + HL AH FD+ +A + L
Sbjct: 209 -AWDSATQSRVLGVEYGGMNDCLYELYKLTGNGNHLTAAHKFDENSLFNTIAAGTNVLPG 267
Query: 331 FHANTHIPIVIGSQMRYEVTG--DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
HANT IP IG+ RY G + Y F IV H+Y TGG S E + D +
Sbjct: 268 KHANTTIPKFIGALNRYSTLGTSESSYLKAAQQFWAIVLKDHTYVTGGNSEDERFRDAGK 327
Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
L + N ETC NMLK+++ LF+ T ++ YADYYE AL N +++ Q E G+ Y
Sbjct: 328 LDAYRDNVNNETCNVNNMLKLTKELFKATGDVKYADYYENALINEIMASQN-PETGMATY 386
Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
+G G K S ++FN FWCC GTG+E+F+KL DS+Y+ N LY+ Y+S
Sbjct: 387 FKAMGTGYFKVFS-----SQFNHFWCCTGTGMENFTKLNDSLYYN---NGSDLYVNMYLS 438
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS-NGAQA 567
S+ +W + L Q+ + +S + T+ +S EV + R P W +
Sbjct: 439 STLNWSEKGLSLTQQANLPLS--DKVTFTINSASSSEV----KIKFRSPAWIAAGQNITV 492
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
+NG + + +L + W D + + LP +R + D + A +GP +
Sbjct: 493 KVNGTPINVDKANGYLDVSRVWQTGDTVELTLPTEVRVSRLTDS----PNTVAFTYGPVV 548
Query: 628 LA 629
L+
Sbjct: 549 LS 550
>gi|345011855|ref|YP_004814209.1| hypothetical protein [Streptomyces violaceusniger Tu 4113]
gi|344038204|gb|AEM83929.1| protein of unknown function DUF1680 [Streptomyces violaceusniger Tu
4113]
Length = 849
Score = 292 bits (748), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 188/531 (35%), Positives = 274/531 (51%), Gaps = 34/531 (6%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
Q N YL +D+D L+ +FR L + + GGWE+P +ELRGH GH LS A +A
Sbjct: 72 QSRNTAYLRFVDIDRLLHTFRLNVGLSSAAQPCGGWESPTTELRGHSTGHLLSGLALTYA 131
Query: 179 STHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYTIH 233
+T + ++K +V +L+ CQ + G GYLSAFP FD EA VWAPYYTIH
Sbjct: 132 ATGDTAPRDKGRALVSALAACQARSPAAGYGQGYLSAFPESFFDRLEAGTGVWAPYYTIH 191
Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
KI+AGL+DQY LA NA+AL+ + R K+ S ++ L E GGMNDVL
Sbjct: 192 KIMAGLVDQYRLAGNAEALQTVLRQAAWVDTRTGKL----SYDQMQRVLQTEFGGMNDVL 247
Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
L+ IT D + L +A F LA D L+ HANT IP ++G+ +E D
Sbjct: 248 ADLHEITGDSRWLKVAERFTHARVFDPLARNEDRLAGLHANTQIPKMVGAMRLWEEGLDS 307
Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
Y+ IG F IV H+Y GG S E + +P +A L E C +YNMLK++R +
Sbjct: 308 RYRTIGENFWKIVTDHHTYVIGGNSNGEAFHEPDAIAAQLSDNACENCNSYNMLKLTRLI 367
Query: 414 -FRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARST------HGW 465
F + DYYER L N +L Q + G IY L G K + + + +
Sbjct: 368 HFHAPERTDLLDYYERTLLNQMLGEQDPDSAHGFNIYYTGLAPGSFKQQPSFMGTDPNQY 427
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
T +++F C +G+G+E+ +K D+IY + + L + +I S W+ + Q
Sbjct: 428 STDYDNFSCDHGSGMETQAKFADTIYTYADRS---LLVNLFIPSELRWQDKGITWRQ--- 481
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLS 584
+ TLT +S +SL LR+ + +++ GA+A+LNG L P PG++L
Sbjct: 482 -TTGFPDQQTTTLTVASGG-----ASLELRVRIPSWAAGARATLNGTTLADRPEPGSWLI 535
Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+W D++ + LP+ L + DD +QA+L+GP +LAG G
Sbjct: 536 IDRQWRTGDRVEVTLPMKLTFDPTPDD----PDVQAVLYGPVVLAGAYGGR 582
>gi|300785310|ref|YP_003765601.1| hypothetical protein AMED_3413 [Amycolatopsis mediterranei U32]
gi|384148599|ref|YP_005531415.1| hypothetical protein RAM_17360 [Amycolatopsis mediterranei S699]
gi|399537193|ref|YP_006549855.1| hypothetical protein AMES_3374 [Amycolatopsis mediterranei S699]
gi|299794824|gb|ADJ45199.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340526753|gb|AEK41958.1| hypothetical protein RAM_17360 [Amycolatopsis mediterranei S699]
gi|398317963|gb|AFO76910.1| hypothetical protein AMES_3374 [Amycolatopsis mediterranei S699]
Length = 740
Score = 292 bits (747), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 196/533 (36%), Positives = 265/533 (49%), Gaps = 37/533 (6%)
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
L Y +D D L+ +FR A L + + GGWE+P +ELRGH GH LS AQ +A+T +
Sbjct: 68 LAYFRFVDADRLLHTFRLNAGLASSAQPCGGWESPGTELRGHSTGHLLSGLAQAYANTGD 127
Query: 183 ATIKEKMSTVVFSLSECQ-----NKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
K K +V +L+ CQ GYLSAFP FD E+ + VWAPYYT+HKI+A
Sbjct: 128 TAHKTKGDYLVNALAACQAAAPGRGFHAGYLSAFPENFFDRLESGQSVWAPYYTLHKIMA 187
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GLLDQY+LA N QAL + + R + SV + +L E GGM +VL LY
Sbjct: 188 GLLDQYLLAGNQQALDVLLRKAAWTKTRTDPL----SVTQMQAALRTEFGGMPEVLTNLY 243
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+T D HL A FD L LA D LS FHANT IP ++G+ Y TG Y+
Sbjct: 244 QVTGDANHLATAQRFDHAQILDPLAANQDRLSGFHANTQIPKILGAIREYHATGTTRYRD 303
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
I F IV H+Y GG S E++ P +A L E C TYNMLK++R LF
Sbjct: 304 IAVNFWRIVLDHHTYVIGGNSDGEYFQAPDAIASQLSDTTCEVCNTYNMLKLTRQLFFTN 363
Query: 418 KEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCY 476
Y DYYE AL N +L Q + G + Y PL G K + ++ F C +
Sbjct: 364 PAPEYMDYYELALFNQILGEQDPDSSHGFVTYYTPLRAGGIKT-----YANDYDDFTCDH 418
Query: 477 GTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM 536
GTG+ES +K DS+YF LY+ +I+S W + + Q ++
Sbjct: 419 GTGMESQTKFADSVYFFTGET---LYVNLFIASVLTWPGRGITVRQD----TTFPASSGT 471
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLT 596
LT + +L LR+P WT +GA +NG P PG+F + W+ D +
Sbjct: 472 KLTIGGSGHI----ALKLRIPKWT--SGAVVKVNGVAQGSPSPGSFCTIDRTWAAGDVVD 525
Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG-----HTSGEWDIKTGTAR 644
+ +P SL DD AS+ A +G +LAG + S ++TGT R
Sbjct: 526 VSVPASLTFPRANDD----ASVGAAKYGAIVLAGQYGSTNLSALPTLQTGTVR 574
>gi|427385120|ref|ZP_18881625.1| hypothetical protein HMPREF9447_02658 [Bacteroides oleiciplenus YIT
12058]
gi|425727288|gb|EKU90148.1| hypothetical protein HMPREF9447_02658 [Bacteroides oleiciplenus YIT
12058]
Length = 778
Score = 291 bits (746), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 199/599 (33%), Positives = 300/599 (50%), Gaps = 58/599 (9%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A N E+LL L D L+ FR A L G+ YGGWE+ + GH +GHYLSA A M+
Sbjct: 55 AMDKNGEWLLDLSPDRLLHRFRLNAGLTPKGEIYGGWES--RGVSGHTLGHYLSACAMMY 112
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-------------LKP 224
A++ + KE++ +V L+ECQ+ TGY+ P E D A L
Sbjct: 113 AASGDKRFKERVDYIVKELAECQDARKTGYVGGIPDE--DKIWAEVSSGDIRSQGFDLNG 170
Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWY 280
W P+YT+HK+ AGL+D Y A + QA K++ W V F + S E
Sbjct: 171 GWVPWYTLHKLWAGLIDAYRYAGSEQAKEVGTKLSDWAVRSFGD--------LSEEDFQK 222
Query: 281 SLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV 340
L E GGMN+ +Y+IT + +L LA F L L Q D L H+NT +P +
Sbjct: 223 MLACEFGGMNESFADMYAITGNESYLKLARQFYHKAILDPLKEQRDELEGKHSNTQVPKI 282
Query: 341 IGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEET 400
IG YE+TGD I TF+ D + H+Y GG S E P L D L ET
Sbjct: 283 IGEARLYELTGDKDMHTIATFYWDRIVNHHTYVNGGNSNYEHLGKPDCLNDRLSPFTSET 342
Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
C TYNMLK+++HLF W + AY DYYE+AL N +L+ Q + G++ Y +PL G K
Sbjct: 343 CNTYNMLKLTKHLFSWDPQAAYMDYYEQALYNHILASQN-PDDGMVCYSVPLESGTKKEF 401
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
S T+F+SFWCC +GIE+ K +S++F+ + GL++ +I +S +WK + +
Sbjct: 402 S-----TRFDSFWCCVASGIENHVKYAESVFFQSVKD-GGLFVNLFIPTSLNWKEKGMEV 455
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PP 579
K++ + D ++++ SK+ L++R P W + G + +LNG+ + P
Sbjct: 456 --KLETQLPADNKVQISFKGKSKE-----FPLHIRYPRWA-TQGIKVTLNGKEEKVTGTP 507
Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH-TSGE--- 635
G++ + W + +L I++P+ L T ++ P+ A I +GP LLA +GE
Sbjct: 508 GSYFTLQGEWDTDTQLVIEIPMELYTVSM----PDNADRMGIFYGPVLLAAPLGTGELQA 563
Query: 636 WDIKT--GTARSLSALISPIPP---SFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPV 689
+DI S+ I+P+P +F A Q + + ++ + FPV
Sbjct: 564 YDIPCFISDTESIVQSIAPVPDKPLTFTANTTANAQLLLVPFYTIHGQKHAVYFDRFPV 622
>gi|345851934|ref|ZP_08804893.1| secreted protein [Streptomyces zinciresistens K42]
gi|345636594|gb|EGX58142.1| secreted protein [Streptomyces zinciresistens K42]
Length = 867
Score = 291 bits (745), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 205/570 (35%), Positives = 280/570 (49%), Gaps = 54/570 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L L +V L +S L ++T+ YLL +D D L+ +FR TA LP+ + GGWE P
Sbjct: 63 LDAFGLSEVRLLESPFLANMRRTS-AYLLFVDADRLLHTFRLTAGLPSSAQPCGGWEAPD 121
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPT 213
+LRGH GH LSA AQ A T EK +V +L+ECQ GYLSAFP
Sbjct: 122 VQLRGHTTGHLLSALAQAHAHTGERAYAEKGRALVAALAECQRAAPAAGFTRGYLSAFPE 181
Query: 214 ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWM----VEYFYNR 265
+F EA WAPYYT+HKI+AGLLDQY+LA + QAL +MA W Y +
Sbjct: 182 SVFARLEAGGKPWAPYYTLHKIMAGLLDQYLLAGDRQALDVLREMAAWAEARTAPLPYPQ 241
Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
+Q V+ + E GGMNDVL RLY T DP HL A FD LA
Sbjct: 242 MQNVLRV------------EFGGMNDVLMRLYLETGDPAHLRTARRFDHEDLYAPLAAGR 289
Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
D L+ HANT I ++G+ YE TGD Y I F V HSYA GG S +E +
Sbjct: 290 DELAGRHANTEIAKIVGTVPSYEATGDTRYLDIADTFWTTVVRHHSYAIGGNSNQELFGP 349
Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQR-GTEP 443
P + L E C +YNMLK+ R LF + A Y D+YE L N +L Q +
Sbjct: 350 PDEIVSRLSDVTCENCNSYNMLKLGRGLFLHRPDRAGYMDHYEWTLYNQMLGEQDPASAH 409
Query: 444 GVMIYMLPLGRGVSKARSTHGWG-------TKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
G + Y L G S+ G G + +++F C +GTG+E+ +K DS+YF G
Sbjct: 410 GFVTYYTGLWAG-SRREPKAGLGSAPGSYSSDYDNFSCDHGTGLETHTKFADSVYFRSRG 468
Query: 497 ---NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
VP LY+ +I S W+ V + QK S+ R LT + + +L
Sbjct: 469 TRDGVPSLYVNLFIPSEVRWRQTGVTVRQK----TSYPSEGRTRLTVVAGR---ARFALR 521
Query: 554 LRMPVWTYSNGAQASL--NGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
+R+P W G +A L NG+ + PG + + W D + + LP +
Sbjct: 522 IRIPSWVAGTGREAVLEVNGRGVAARLRPGTYATVERTWHTGDTVDLTLP----RRPVWT 577
Query: 611 DRPEYASIQAILFGPYLLAGHTSGEWDIKT 640
P+ ++++ +GP +LAG G+ D+ T
Sbjct: 578 AAPDNPQVRSVSYGPLVLAGEY-GDDDLAT 606
>gi|251798261|ref|YP_003012992.1| hypothetical protein Pjdr2_4282 [Paenibacillus sp. JDR-2]
gi|247545887|gb|ACT02906.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 758
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 182/515 (35%), Positives = 267/515 (51%), Gaps = 30/515 (5%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A Q L+YL DVD L+ FR+T+ L Y GWEN +E+RGH +GHYL+A +Q +
Sbjct: 28 AFQKELDYLRSYDVDRLLAGFRETSGLQPKADKYPGWEN--TEIRGHTLGHYLTAVSQAY 85
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A T ++ + EK+ +V L+E Q + GYLSAFP LFD+ E KP W P+YT+HKI+A
Sbjct: 86 AQTQDSGLLEKLKYLVAELAEAQQE--NGYLSAFPETLFDNVENRKPAWVPWYTMHKIIA 143
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+ Y QA ++ + + ++ +R +S E L E GGMND +Y LY
Sbjct: 144 GLIAVYQATKLQQAYEVVSRLGDWVADRA----CSWSEELQATVLAVEYGGMNDCMYDLY 199
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL--Y 355
+T + HL AH FD+ L D L HANT IP IG+ RY G+ Y
Sbjct: 200 KLTGNNLHLEAAHKFDEISLFEALREGKDVLKGKHANTMIPKFIGALNRYLTLGESERGY 259
Query: 356 KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
F D V HSY TGG S E + +P L ETC +YNMLK+++ LF+
Sbjct: 260 LEAAVNFWDTVVYHHSYLTGGNSECEHFGEPDILDGKRSDVTCETCNSYNMLKLTKELFK 319
Query: 416 WTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCC 475
T+ YAD+YER N +LS Q E G+ +Y P+ G K S + F FWCC
Sbjct: 320 LTQNSKYADFYERTYINAILSSQ-NPETGMTMYFQPMATGYFKIYS-----SPFEHFWCC 373
Query: 476 YGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR 535
GTG+ESF+KL DSIYF + N LY+ Q+ SS DW V+ Q P+
Sbjct: 374 TGTGMESFTKLNDSIYFHLDHN---LYVNQFYSSRLDWTEQQTVVTQTTSL-----PHSD 425
Query: 536 MTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKL 595
+ + F+ + + ++++R+P W + LNG+ +P ++ W D +
Sbjct: 426 L-VHFTVGTDSPKRLAIHIRVPSWA-AGEVDILLNGETVPASVQQQYVVLDRIWKDGDTI 483
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
++P+ + ++ P+ + + +GP +L+
Sbjct: 484 EARIPMKVSFSSL----PDAPHVIGLQYGPIVLSA 514
>gi|376260753|ref|YP_005147473.1| hypothetical protein [Clostridium sp. BNL1100]
gi|373944747|gb|AEY65668.1| hypothetical protein Clo1100_1435 [Clostridium sp. BNL1100]
Length = 743
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 185/518 (35%), Positives = 268/518 (51%), Gaps = 31/518 (5%)
Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
L A + +EYL D D L+ F KT L K Y GWE+ +E+RGH +GHYL+A A
Sbjct: 11 LVNAFKKEIEYLESFDCDKLLSCFYKTKGLAPKAKNYHGWED--TEIRGHTMGHYLTALA 68
Query: 175 QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHK 234
Q +++T+++ I E++ ++ LS CQ +GYLSAFP E FD E KPVW P+YT+HK
Sbjct: 69 QAYSATNDSKIYERLQYLLKELSLCQ--FESGYLSAFPEEFFDRVENRKPVWVPWYTMHK 126
Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLY 294
I+ GL+ Y L AL + + + ++ ++R K ++ E H L E GGMND LY
Sbjct: 127 IITGLISVYKLTKIETALNIVSGLGDWVFSRTDK----WTPEIHANVLAVEYGGMNDCLY 182
Query: 295 RLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP- 353
LY IT + KH AH+FD+ + D L++ HANT IP +G+ R+ G+
Sbjct: 183 ELYKITGNEKHSAAAHMFDEIELFKEIHDGKDILNNRHANTTIPKFLGALNRFLAIGEEE 242
Query: 354 -LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
Y F IV +HSY TGG S E + +P L S N ETC TYNMLK++R
Sbjct: 243 QFYLDTCKEFWSIVTNNHSYVTGGNSEWEHFGEPNILDAERTSTNCETCNTYNMLKMTRV 302
Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
LF+ T + YAD+YE N +LS Q + G+ +Y P+ G K + F F
Sbjct: 303 LFKITGDKKYADFYENTFINAILSSQ-NPDTGMTMYFQPMATGYFKV-----YSKPFEHF 356
Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
WCC GTG+E+F+KL +SIYF EE LY+ Y S+ +W+ V + Q D I D
Sbjct: 357 WCCTGTGMENFTKLNNSIYFHEEDR---LYVNMYYSTLLNWEEKCVRITQNSD-IPGTD- 411
Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYN 592
+F + E +L LR+P W + ++N + W N
Sbjct: 412 ----RASFIIEAETETEFTLCLRIPTW--AKDVNINVNKNPSLFTEERGYALINRTWKDN 465
Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
D T+++ + E + P+ + A +GP +L+
Sbjct: 466 D--TVEINFKIEPELVS--LPDNPNAVAFTYGPVVLSA 499
>gi|374984433|ref|YP_004959928.1| secreted protein [Streptomyces bingchenggensis BCW-1]
gi|297155085|gb|ADI04797.1| secreted protein [Streptomyces bingchenggensis BCW-1]
Length = 875
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 185/526 (35%), Positives = 274/526 (52%), Gaps = 34/526 (6%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
Q N YL +D+D L+ +FR L + + GGWE+P +ELRGH GH LS A +A
Sbjct: 99 QSRNTAYLRYVDIDRLLHTFRLNVGLASSAQPCGGWESPTTELRGHSTGHLLSGLALSYA 158
Query: 179 STHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYTIH 233
+T + + +K +V +L+ CQ K G GYLSAFP FD E+ VWAPYYTIH
Sbjct: 159 NTGDTALLDKGRKLVSALAACQAKSPAAGYGQGYLSAFPENFFDRLESGSGVWAPYYTIH 218
Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
KI+AGL+DQ+ LA NA+AL + + R K+ ++ L E GGMN+VL
Sbjct: 219 KIMAGLVDQHRLAGNAEALDVVERQAAWVDTRTGKL----GYDQMQRVLQTEFGGMNEVL 274
Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
L++IT D + L +A F LA D L+ HANT IP ++G+ +E +
Sbjct: 275 ADLHAITGDTRWLRVAERFTHARVFDPLARNEDQLAGLHANTQIPKMVGALRLWEQGLNS 334
Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
Y+ IG F IV H+Y GG S E + +P +A L + E C +YNMLK++R +
Sbjct: 335 RYRTIGENFWKIVTDHHTYVIGGNSNGEAFHEPDAIAAQLSNNCCENCNSYNMLKLTRLI 394
Query: 414 -FRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARST------HGW 465
F DYYER L N +L Q + G IY L G K + + + +
Sbjct: 395 HFHAPDRTDLLDYYERTLFNQMLGEQDPDSAHGFNIYYTGLAPGAFKQQPSFMGTDPNQY 454
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
T +N+F C +G+G+E+ +K D+IY + + L + +I S W+ + Q
Sbjct: 455 STDYNNFSCDHGSGMETQAKFADTIYTYADRS---LLVNLFIPSELRWQEKAITWRQN-- 509
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLS 584
+ TLT +S +SL LR+ + ++ GA+A+LNG LP P PG++L
Sbjct: 510 --TGFPDQQTTTLTVASGA-----ASLELRVRIPAWATGARAALNGTTLPDQPKPGSWLV 562
Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
W D++ + LP++L+ + DD +QA+L+GP +LAG
Sbjct: 563 IDRSWKAGDRVDVTLPMALKLDPTPDD----PDVQAVLYGPVVLAG 604
>gi|383779461|ref|YP_005464027.1| hypothetical protein AMIS_42910 [Actinoplanes missouriensis 431]
gi|381372693|dbj|BAL89511.1| hypothetical protein AMIS_42910 [Actinoplanes missouriensis 431]
Length = 777
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 199/554 (35%), Positives = 283/554 (51%), Gaps = 43/554 (7%)
Query: 96 GNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGW 154
GN E V L S +L Q + YL +DV+ +++ FR L T G A GGW
Sbjct: 48 GNAASEFMPGQVRLTASRLL-DNQNRTMNYLRFVDVNRMLYVFRANHRLSTAGAAANGGW 106
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQ--NKIG---TGYLS 209
+ P R H GH+L+A AQ +A T + T ++K +V L++CQ N + GYLS
Sbjct: 107 DAPNFPFRSHMQGHFLTAWAQAYAYTGDTTCRDKADYMVAELAKCQANNAVAGFNAGYLS 166
Query: 210 AFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNR 265
FP D+ E+ KP+ YY IHK LAGLLD + L N QA LK+A W V++ R
Sbjct: 167 GFPESDLDAVESGKPIAVSYYCIHKTLAGLLDVWRLIGNTQAKDVLLKLAGW-VDWRTGR 225
Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
+ S + +L E GGMN+VL LY T D + L +A FD LA
Sbjct: 226 L-------SYSQMQTTLQTEFGGMNEVLANLYQQTGDARWLRVAQRFDHAAIFDPLAANR 278
Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
D L+ HANT+IP +G+ ++ TG Y+ I +I +H+YA GG S E +
Sbjct: 279 DELNGKHANTNIPKWVGAIREFKATGTTRYRDIAGNAWNITVGAHTYAIGGNSQAEHFKA 338
Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEP- 443
P +A L ++ E C TYNMLK++R L++ A Y D+YE AL N ++ Q +
Sbjct: 339 PNAIAGYLTNDTCEQCNTYNMLKLTRELWQLDPNRAGYFDFYENALYNHLIGAQNPADSH 398
Query: 444 GVMIYMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
G + Y PL RGV A W T +NSFWCC GTGIE+ +KL DSIYF
Sbjct: 399 GHITYFTPLKAGGRRGVGPAWGGGTWSTDYNSFWCCQGTGIETNTKLMDSIYFRGG---T 455
Query: 500 GLYIIQYISSSFDW-KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
L + Y+ S+ +W + G V P+ T TF+ V + R+P
Sbjct: 456 TLTVNLYVPSTLNWSERGLTVTQTTAYPVGD-------TSTFTLSGSVSGSWGIRFRIPA 508
Query: 559 WTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W + GA ++NG N + PG++ + T W+ D +T++LP+ + +A D+ A
Sbjct: 509 W--AAGATIAVNGANQNITVTPGSYATVTRTWADGDTITVRLPMRVIIKAANDN----AD 562
Query: 618 IQAILFGPYLLAGH 631
IQAI +GP +LAG+
Sbjct: 563 IQAITYGPSVLAGN 576
>gi|256394133|ref|YP_003115697.1| hypothetical protein Caci_4996 [Catenulispora acidiphila DSM 44928]
gi|256360359|gb|ACU73856.1| protein of unknown function DUF1680 [Catenulispora acidiphila DSM
44928]
Length = 846
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 199/519 (38%), Positives = 260/519 (50%), Gaps = 38/519 (7%)
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
L YL +D D L++ FR T + T GGWE+P ELRGH GH +SA AQ +AST +
Sbjct: 84 LAYLRFVDPDRLLYMFRTTVGIATSASPCGGWEDPTEELRGHSTGHIMSALAQAYASTGD 143
Query: 183 ATIKEKMSTVVFSLSECQNKIG-----TGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
+T+K K V SL+ CQ TGYLSAFP FD E+ + VWAPYYTIHKI+A
Sbjct: 144 STLKSKGDYFVSSLAACQAASPAAGFHTGYLSAFPESFFDRLESGQSVWAPYYTIHKIMA 203
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GLLDQY++A N QAL + M + R + S + L E GGM +VL LY
Sbjct: 204 GLLDQYLVAGNTQALTVLKGMAAWVKTRTDPL----SHSQMQAVLQTEFGGMPEVLAHLY 259
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
+T D L A FD LA D L+ FHANT +P +IG+ Y TG Y
Sbjct: 260 QVTGDANTLTAAQRFDHAQIEDPLAAGTDQLAGFHANTQVPKIIGALREYLATGTARYLT 319
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL-FRW 416
I F I H Y GG S E++ P +A L + E C TYN LK+SR L F
Sbjct: 320 IAQNFWAITTGHHMYEIGGFSNGEYFQTPNAIASQLSNTTCEVCVTYNELKLSRGLFFTD 379
Query: 417 TKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCC 475
AY DYYER L N VL Q + G + Y PL G K S +N F C
Sbjct: 380 PTRAAYLDYYERGLFNTVLGQQDPASSHGFVCYYTPLQPGGYKTYS-----NDYNDFTCD 434
Query: 476 YGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYL 534
+GTG+ES +K DSIYF N LY+ +I+S W + + Q P S
Sbjct: 435 HGTGMESNTKYADSIYFY---NGETLYVNLFIASQLAWPGRAITVRQDTTFPAASSS--- 488
Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG--QNLPLPPPGNFLSATERWSYN 592
R+T+T G + +L +R+P W +G +NG QNL PG +L+ W+
Sbjct: 489 RLTIT-----GAGHI-ALKIRVPSW--CSGMTVKVNGTLQNL-TATPGTYLTIDRTWASG 539
Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
D + + LP L DD +++Q + +G +LAG
Sbjct: 540 DVVDLALPAKLTFVPAPDD----STVQVVKYGGIVLAGQ 574
>gi|334364979|ref|ZP_08513951.1| conserved hypothetical protein [Alistipes sp. HGB5]
gi|313158812|gb|EFR58195.1| conserved hypothetical protein [Alistipes sp. HGB5]
Length = 778
Score = 290 bits (741), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 191/540 (35%), Positives = 285/540 (52%), Gaps = 38/540 (7%)
Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
V L+DV + L AQ+ + +L +D D + FR A L YGGWE+ +
Sbjct: 45 VPLNDVRITGGPFL-HAQEMDRRWLDSMDPDRYLSGFRSEAGLEPKAPRYGGWES--AGC 101
Query: 162 RGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE--LFDSF 219
GH GH+LSA+A M+A+T + + +K++ + L+ECQ K GTG L+ F LF
Sbjct: 102 SGHGFGHFLSAAAMMYAATGDRALLDKINYSIDGLAECQQKEGTGLLAGFERSRALFAEL 161
Query: 220 EA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
E L W P+YT+HK+ AGL+D NA+AL + ++ V K+
Sbjct: 162 ERGDIRSQGFDLNGGWVPFYTLHKMYAGLVDVCRYTPNAKALTVLVRFADWLDGLVAKL- 220
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
S E+ L E GG+ + L +Y +T + K+L LA FD L LA D L
Sbjct: 221 ---SDEQMDKILICEHGGITESLADIYVLTGERKYLELARRFDHREILRPLAAGVDSLPG 277
Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
HANT IP ++G+ YE +GD Y+ I +F V HSYA GG S E + P LA
Sbjct: 278 KHANTQIPKIVGAVREYECSGDERYRRIADYFWHRVVGFHSYAIGGNSEYEHFGAPGMLA 337
Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
+ L ETC TYNMLK+++HL++ + ADYYERAL N +L+ Q + G++ YM
Sbjct: 338 NRLSDGTCETCNTYNMLKLTKHLYQLDPTVRRADYYERALYNQILASQ-NPDDGMVCYMS 396
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
P+G G K G+ F+SFWCC G+G+E+ ++ G+ IYF + LY+ YI S+
Sbjct: 397 PMGSGHRK-----GFCLPFDSFWCCVGSGMENHARYGEFIYFTDARE--NLYVNLYIPST 449
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
DWKS V + Q D S + LR+ ++ + Q LNLR P W + G + ++N
Sbjct: 450 LDWKSRGVKVEQLTDFPCSDEVRLRVEMSGA------QRFVLNLRYPEWA-AEGYELTVN 502
Query: 571 GQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
G+ + PG+++S +W D++ L SL +E I D ++++A +GP +L+
Sbjct: 503 GRPVKQKAKPGSYISVNRKWRSGDEVRFVLRQSLHSEPIPGD----STLRAYFYGPVVLS 558
>gi|456393067|gb|EMF58410.1| putative glycosylase [Streptomyces bottropensis ATCC 25435]
Length = 714
Score = 290 bits (741), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 188/530 (35%), Positives = 270/530 (50%), Gaps = 44/530 (8%)
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMWASTH 181
L Y D ++ FR A L T G + GGWE LRGH+ GH+L+ AQ +A T
Sbjct: 75 LNYARSYPADRILAVFRANAGLDTRGARPPGGWETSDGNLRGHYGGHFLTLVAQAYADTR 134
Query: 182 NATIKEKMSTVVFSLSECQNKIGT---------GYLSAFPTE---LFDSFEALKPVWAPY 229
A +K K+ +V +L ECQ + G+L+A+P L +S+ +WAPY
Sbjct: 135 EAALKSKLDQLVGALGECQAALAERGSPRPSHPGFLAAYPETQFILLESYATYPTIWAPY 194
Query: 230 YTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETGG 288
YT HKI+ GLLD + LA NAQAL + + M ++ ++R+ + +ER W + E GG
Sbjct: 195 YTCHKIMRGLLDAHTLAGNAQALTIVSRMGDWVHSRL-GALPRAQLERMWSLYIAGEYGG 253
Query: 289 MNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYE 348
MN+VL LY++T +HL A FD L A D L HAN HIP G ++
Sbjct: 254 MNEVLADLYALTGKAEHLAAARCFDNTALLDACAQDRDILDGRHANQHIPQFTGYLRLFD 313
Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLK 408
TG+ Y F +V +Y+ GGT E + +A TL +N ETC TYNMLK
Sbjct: 314 ETGEERYAEAARNFWGMVAGPRTYSLGGTGQGEMFKARGAIAATLDDKNAETCATYNMLK 373
Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGT----EPGVMIYMLPLGRGVSKARSTHG 464
+SRHLF + A DYYER LTN +L+ +R T P V Y + +G GV +
Sbjct: 374 LSRHLFFREPDAARMDYYERGLTNHILASRRDTASTSSPEV-TYFVGMGPGVVREYG--- 429
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
N+ CC GTG+E+ +K DS+YF +GN LY+ Y++S+ W +V+ Q
Sbjct: 430 -----NTGTCCGGTGMENHTKYQDSVYFRSADGNA--LYVNLYLASTLRWPERGLVVEQ- 481
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNF 582
++ TLTF +EV L LR+P W + G ++NG + PG++
Sbjct: 482 ---TSAYPAEGVRTLTF---REVRGTLDLRLRVPSWA-TGGFTVTVNGVRQQVEATPGSY 534
Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
L+ + W D++ I P LR E DD ++Q++ FGP LL +
Sbjct: 535 LTLSRNWRRGDRVGISAPYRLRVERALDD----PTVQSVFFGPLLLVAQS 580
>gi|374991816|ref|YP_004967311.1| secreted protein [Streptomyces bingchenggensis BCW-1]
gi|297162468|gb|ADI12180.1| secreted protein [Streptomyces bingchenggensis BCW-1]
Length = 858
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 199/552 (36%), Positives = 277/552 (50%), Gaps = 41/552 (7%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L L V L +S L ++T L YL +D + L+ +FR LP+ + GGWE P
Sbjct: 55 LAPFPLSAVRLLESPFLANMRRT-LAYLRFVDPERLLHTFRLNVQLPSTAQPCGGWEAPN 113
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPT 213
LRGH GH LSA A A T T +K +V +L+ECQ TGYLSAFP
Sbjct: 114 VLLRGHSTGHLLSALAFAHAHTGEQTYADKARGIVAALAECQAASPGAGYRTGYLSAFPE 173
Query: 214 ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV--IT 271
+FD EA WAPYYTIHKI+AGLLDQ+ L+ N QAL++ M + +R + T
Sbjct: 174 RIFDELEAGGKPWAPYYTIHKIMAGLLDQHRLSGNDQALEVLRGMAAWVDSRTAPLDEAT 233
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
M + L E GGMN+VL LY +T DP HL A FD G L D L
Sbjct: 234 MQRL------LGVEFGGMNEVLAGLYLVTGDPVHLRTARRFDHQSLYGPLDEGRDELDGR 287
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
HANT I ++G+ Y TGDP Y I F DIV HSY GG S +EF+ P ++
Sbjct: 288 HANTEIAKIVGAAEEYRATGDPRYLRIARNFWDIVVRDHSYVIGGNSNQEFFGPPGQIVS 347
Query: 392 TLGSENEETCTTYNMLKVSRHLF-RWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYM 449
L + E C +YNMLK+ R LF AY D+YE L N +L Q ++ G + Y
Sbjct: 348 RLSEDTCENCNSYNMLKIGRQLFLHEPGRAAYMDHYEWTLYNQMLGEQDPDSDHGFVTYY 407
Query: 450 LPLGRGVSKARSTHGWGT-------KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
L G S+ + G G+ +++F C +GTG+E+ +K D+IYF +E + LY
Sbjct: 408 TGLWAG-SRRQPKGGLGSAPGSYSGDYDNFSCDHGTGMETHTKFADTIYFRDE-HAGALY 465
Query: 503 IIQYISSSFDW-KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
+ +I S W + G ++ + P +R+T+ E G +L +R+P W
Sbjct: 466 VNLFIPSEVTWAERGFRLVQRSGYPDTD---TVRLTVA-----EGGGRLALKVRVPGWLA 517
Query: 562 SNGAQASLNGQNLPL---PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
G +A + P+ P PG +L+ RW D + + P L + P+ I
Sbjct: 518 DAGPRARVLVAGRPVDATPVPGRYLTLDRRWRTGDTVELTFPREL----VWRPAPDNPHI 573
Query: 619 QAILFGPYLLAG 630
+A+ +GP +LAG
Sbjct: 574 KAVSYGPLVLAG 585
>gi|333382563|ref|ZP_08474231.1| hypothetical protein HMPREF9455_02397 [Dysgonomonas gadei ATCC
BAA-286]
gi|332828505|gb|EGK01205.1| hypothetical protein HMPREF9455_02397 [Dysgonomonas gadei ATCC
BAA-286]
Length = 644
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 192/545 (35%), Positives = 284/545 (52%), Gaps = 45/545 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
LK+V L D Q+ + +++L L VD L+ SFR TA + K
Sbjct: 46 LKDVRLLDSPFRQN------MERESKWILSLGVDRLLHSFRNTAGVYAGREGGYMTIKKL 99
Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI----GTGY 207
GGWE+ ELRGH +GH +S A ++AST + K K ++V L+E Q+ + GY
Sbjct: 100 GGWESLDCELRGHSIGHIMSGLAYLYASTGDERYKIKADSLVAGLAEVQDILIENGQKGY 159
Query: 208 LSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
+SA+P L + A K VWAP+YT+HK+ AGL+DQY+ DN +AL + + Y Q
Sbjct: 160 ISAYPENLINRNIAGKSVWAPWYTLHKVYAGLIDQYLYCDNKEALDIMKEAASWAY---Q 216
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
K++ + S E+ L E GG+N+ Y LY+IT +P+H A F + LA
Sbjct: 217 KLMPL-SEEQRALMLRNEFGGVNEAFYNLYAITGNPEHKKSAEFFYHADVIDPLAEHKAD 275
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L HANT IP VIG YE+ K I FF + V +Y TGG S +E +
Sbjct: 276 LYFKHANTFIPKVIGEARNYELHNSERSKDIANFFWNTVIDHQTYCTGGNSHKEKFIHSD 335
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
++ L +ETC T NMLK++RHLF W YADYYERAL N +L Q+ + G++
Sbjct: 336 SISKNLTGYTQETCNTNNMLKLTRHLFCWDANAKYADYYERALYNHILG-QQDPQSGMVA 394
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y LP+ G K S T NSFWCC GTG E+ +K G++IY+ + GLY+ +I
Sbjct: 395 YFLPMLPGAHKVYS-----TPENSFWCCVGTGFENHAKYGEAIYYHDNN---GLYVNLFI 446
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S WK + + Q+ ++ + LT ++ +++ + LR P WT + +
Sbjct: 447 PSELTWKEKGIKIKQE----TAFPEEGNICLTVTTDKDIKM--PVYLRYPSWT--SNVEV 498
Query: 568 SLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLR-TEAIQDDRPEYASIQAILFGP 625
+NG+ + P +++ W DK+ + P+ L TE +D P+ A AI++GP
Sbjct: 499 KVNGKKTKIKQSPSGYITIDRTWKNGDKIEVHYPMHLYLTET--NDNPDKA---AIMYGP 553
Query: 626 YLLAG 630
+LAG
Sbjct: 554 LVLAG 558
>gi|376260258|ref|YP_005146978.1| hypothetical protein [Clostridium sp. BNL1100]
gi|373944252|gb|AEY65173.1| hypothetical protein Clo1100_0916 [Clostridium sp. BNL1100]
Length = 952
Score = 289 bits (739), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 180/517 (34%), Positives = 262/517 (50%), Gaps = 34/517 (6%)
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH- 181
+ YL +D + L+ F+K A L T YGGWEN + ++GH +GHY+SA AQ + +T
Sbjct: 58 VAYLRAIDPNRLLVGFKKAAGLSTTYSYYGGWENN-TLIQGHTMGHYMSALAQAYKNTKS 116
Query: 182 ----NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFE--ALKPVWAPYYTIHKI 235
NA +K ++ ++ L CQNK G GYL A P FD E A W P+YT+HKI
Sbjct: 117 DATVNADLKSRIDLIISELQACQNKNGNGYLFATPVTQFDVVEGKASGSSWVPWYTMHKI 176
Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYR 295
++GLLD Y N AL +AT + + Y RV + L E GGMND LY
Sbjct: 177 MSGLLDVYKFEGNQTALTIATNLGNWIYKRVNA----WDSATQSKVLGVEYGGMNDCLYE 232
Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTG--DP 353
LY +T + HL AH FD+ +A + L HANT IP IG+ RY G +
Sbjct: 233 LYKLTGNSNHLTAAHKFDETSLFNTIAAGTNVLPGKHANTTIPKFIGALNRYRTLGTTES 292
Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHL 413
Y F +IV H+Y TGG S E + +L + N ETC NMLK++R L
Sbjct: 293 SYLTAAQQFWNIVLKDHTYVTGGNSEDEHFRAAGKLDAYRDNVNNETCNVNNMLKLTREL 352
Query: 414 FRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW 473
F+ T ++ YADYYE AL N +++ Q E G+ Y +G G K S ++F+ FW
Sbjct: 353 FKVTGDVKYADYYENALINEIMASQN-PETGMATYFKAMGTGYFKVFS-----SQFDHFW 406
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC GTG+E+F+KL DS+Y+ N LY+ Y+SS +W + L Q+ + +S
Sbjct: 407 CCTGTGMENFTKLNDSLYYN---NGSDLYVNMYLSSILNWSEKGLSLTQQANLPLS--DK 461
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYS-NGAQASLNGQNLPLPPPGNFLSATERWSYN 592
+ T+ + EV + R P W + A +NG ++ + +L + W
Sbjct: 462 VTFTINSAPSSEV----KIKFRSPSWIAAGQTATVKVNGTSINIAKVNGYLDVSRVWQAG 517
Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
D + + LP +R + D+ + A +GP +L+
Sbjct: 518 DTVELTLPTEVRVSRLTDN----PNAVAFTYGPVVLS 550
>gi|315506549|ref|YP_004085436.1| hypothetical protein ML5_5828 [Micromonospora sp. L5]
gi|315413168|gb|ADU11285.1| protein of unknown function DUF1680 [Micromonospora sp. L5]
Length = 917
Score = 289 bits (739), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 190/532 (35%), Positives = 270/532 (50%), Gaps = 42/532 (7%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFVGHYLSASAQMW 177
Q + YL +DV+ L+++FR L T G A GGW+ P R H GH+L+A AQ W
Sbjct: 71 QNRTVAYLRFVDVNRLLYNFRANHRLSTGGAAANGGWDAPNFPFRTHMQGHFLTAWAQAW 130
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEA--LKPVWAPYY 230
A + T ++K T+V L+ CQ G GYLS FP F + EA L PYY
Sbjct: 131 AVLGDTTCRDKALTMVAELARCQANNGAAGFSAGYLSGFPESDFTALEARTLSNGNVPYY 190
Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
IHK LAGLLD + L + QA L +A W+ Q+ + S + L E
Sbjct: 191 CIHKTLAGLLDVWRLIGSTQARDVLLALAGWVD-------QRTGRLTSAQMQ-AMLGTEF 242
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGMN VL LY T D + L +A FD LA +D L+ HANT +P IG+
Sbjct: 243 GGMNAVLTDLYQQTGDGRWLTVAQRFDHAAVFNPLAANSDQLNGLHANTQVPKWIGAARE 302
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
Y+ TG Y+ I I +H+YA GG S E + P +A L ++ E C TYNM
Sbjct: 303 YKATGVTRYRDIAANAWAITVGAHTYAIGGNSQAEHFRAPNAIAGYLRNDTCEACNTYNM 362
Query: 407 LKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKAR 460
LK++R L++ + +AYAD+YERAL N ++ Q + G + Y PL RGV A
Sbjct: 363 LKLTRELWQLDPDRVAYADFYERALLNHMIGQQNPADAHGHVTYFTPLNPGGRRGVGPAW 422
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
W T +NSFWCC GTG+E+ + L D+IYF N L + ++ S W + +
Sbjct: 423 GGGTWSTDYNSFWCCQGTGLETNTTLADAIYFH---NGTTLTVNLFVPSVLTWSQRGITV 479
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-P 579
Q V T T + V ++ +R+P WT +GA S+NG + P
Sbjct: 480 TQATSYPVG------DTTTLTVTGSVAGSWTMRIRIPAWT--SGASVSVNGVAAGIAATP 531
Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
G++ T W+ D +T++LP+ + T A DD A++QA+ +GP +L+G+
Sbjct: 532 GSYAVLTRAWTSGDTVTVRLPMRVTTVAANDD----AAVQAVTYGPVVLSGN 579
>gi|427384240|ref|ZP_18880745.1| hypothetical protein HMPREF9447_01778 [Bacteroides oleiciplenus YIT
12058]
gi|425727501|gb|EKU90360.1| hypothetical protein HMPREF9447_01778 [Bacteroides oleiciplenus YIT
12058]
Length = 777
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 192/546 (35%), Positives = 283/546 (51%), Gaps = 43/546 (7%)
Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
K + DV L +S L A N +++ LD+D L+ +FRK A+L + Y WE+
Sbjct: 37 KYFGIQDVRLLESPFL-HAMNQNEQWMKELDLDRLLSNFRKNANLRPKAEPYDSWES--M 93
Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFD 217
+ GH +GH L+A +Q +A+T + T K K+ VV L CQ G++ P ++F
Sbjct: 94 GIAGHTLGHLLTAMSQHYAATGDETFKTKIDYVVNELDSCQMNFVNGFIGGMPGGDKVFK 153
Query: 218 SFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
+ L +W P+Y HK + GL D Y+LA N A K+ + +Y +
Sbjct: 154 EVKKGIIRSMGFDLNGIWVPWYNEHKTMMGLNDAYLLAGNETAKKVLINLSDYLAD---- 209
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
VI + E+ LN E GGMN+ ++Y++T D K+L ++ F LA D L
Sbjct: 210 VIAPLNEEQMQTMLNCEYGGMNEAFAQVYALTGDEKYLDASYAFYHKRLQDKLAEGIDAL 269
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
H+NT IP +IGS +YE+TG+ + I F + + HSYA GG S E+ P +
Sbjct: 270 QGLHSNTQIPKLIGSARQYELTGNQRDEKIARFSWETIVLHHSYANGGNSMGEYLSVPDK 329
Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
L+D LGS ETC TYNMLK++ HL+ WT ++ Y DYYERAL N +L+ Q E G + Y
Sbjct: 330 LSDRLGSNTCETCNTYNMLKLTGHLYEWTNDVQYLDYYERALYNHILASQH-PETGNVCY 388
Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ--- 505
L LG G K G+G++ N+F CC G+G E+ SK G +IY VPG +I
Sbjct: 389 FLSLGMGTHK-----GFGSRHNNFSCCMGSGFENHSKYGGTIY----SYVPGKEMININL 439
Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
YI S WK + L D + + L +SKQ + ++NLR P W +
Sbjct: 440 YIPSVLTWKEKSLKLRMTTD--YPEHGKIVIKLEETSKQSL----TINLRRPAWA-TGDV 492
Query: 566 QASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
+NG + PG+F+S RW ND + + LP+ L T ++ P+ A +A+ +G
Sbjct: 493 VVRINGSKQKVGNTPGSFISLHHRWKKNDVIELILPMPLYTVSM----PDNADRRAVFYG 548
Query: 625 PYLLAG 630
P +LAG
Sbjct: 549 PTILAG 554
>gi|302867043|ref|YP_003835680.1| hypothetical protein Micau_2566 [Micromonospora aurantiaca ATCC
27029]
gi|302569902|gb|ADL46104.1| protein of unknown function DUF1680 [Micromonospora aurantiaca ATCC
27029]
Length = 917
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 190/532 (35%), Positives = 270/532 (50%), Gaps = 42/532 (7%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFVGHYLSASAQMW 177
Q + YL +DV+ L+++FR L T G A GGW+ P R H GH+L+A AQ W
Sbjct: 71 QNRTVAYLRFVDVNRLLYNFRANHRLSTGGAAANGGWDAPNFPFRTHMQGHFLTAWAQAW 130
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEA--LKPVWAPYY 230
A + T ++K T+V L+ CQ G GYLS FP F + EA L PYY
Sbjct: 131 AVLGDTTCRDKALTMVAELARCQANNGAAGFSAGYLSGFPESDFTALEARTLSNGNVPYY 190
Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
IHK LAGLLD + L + QA L +A W+ Q+ + S + L E
Sbjct: 191 CIHKTLAGLLDVWRLIGSTQARDVLLALAGWVD-------QRTGRLTSAQMQ-AMLGTEF 242
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGMN VL LY T D + L +A FD LA +D L+ HANT +P IG+
Sbjct: 243 GGMNAVLTDLYQQTGDGRWLTVAQRFDHAAVFNPLAANSDQLNGLHANTQVPKWIGAARE 302
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
Y+ TG Y+ I I +H+YA GG S E + P +A L ++ E C TYNM
Sbjct: 303 YKATGVTRYRDIAANAWAITVGAHTYAIGGNSQAEHFRAPNAIAGYLRNDTCEACNTYNM 362
Query: 407 LKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKAR 460
LK++R L++ + +AYAD+YERAL N ++ Q + G + Y PL RGV A
Sbjct: 363 LKLTRELWQLDPDRVAYADFYERALLNHMIGQQNPADAHGHVTYFTPLNPGGRRGVGPAW 422
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
W T +NSFWCC GTG+E+ + L D+IYF N L + ++ S W + +
Sbjct: 423 GGGTWSTDYNSFWCCQGTGLETNTTLADAIYFH---NGTTLTVNLFVPSVLTWSQRGITV 479
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-P 579
Q V T T + V ++ +R+P WT +GA S+NG + P
Sbjct: 480 TQATSYPVG------DTTTLTVTGSVAGSWTMRIRIPAWT--SGASVSVNGVAAGIAATP 531
Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
G++ T W+ D +T++LP+ + T A DD A++QA+ +GP +L+G+
Sbjct: 532 GSYAVLTRAWTSGDTVTVRLPMRVTTVAANDD----AAVQAVTYGPVVLSGN 579
>gi|189464178|ref|ZP_03012963.1| hypothetical protein BACINT_00515 [Bacteroides intestinalis DSM
17393]
gi|189437968|gb|EDV06953.1| hypothetical protein BACINT_00515 [Bacteroides intestinalis DSM
17393]
Length = 777
Score = 288 bits (738), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 197/571 (34%), Positives = 293/571 (51%), Gaps = 47/571 (8%)
Query: 75 DEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSL 134
D+V +A Y+ P D+ + S+ DV L S L A N +++ LD+D L
Sbjct: 16 DQVGFAQNYK----PAVKDVISPKTRYFSIQDVRLLDSPFL-HAMNQNEQWMKELDLDRL 70
Query: 135 VWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVF 194
+ +FRK A+L + YG WE+ + GH +GH L+A +Q +A+T + T K K+ VV
Sbjct: 71 LSNFRKNANLKPKAEPYGSWES--MGIAGHTLGHLLTAMSQHYAATGDETFKAKIDYVVN 128
Query: 195 SLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVWAPYYTIHKILAGLLDQY 243
L CQ G++ P ++F + L +W P+Y HK + GL D Y
Sbjct: 129 ELDSCQMNFVNGFIGGMPGGDKVFKEVKKGIIRSMGFDLNGIWVPWYNEHKTMMGLNDAY 188
Query: 244 VLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDP 303
+LA N A K+ + +Y + VI S E+ LN E GGMN+ ++Y++T D
Sbjct: 189 LLAGNETAKKVLINLSDYLAD----VIAPLSEEQMQTMLNCEYGGMNEAFAQMYALTGDK 244
Query: 304 KHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFM 363
K L ++ F LA D L H+NT IP +IGS +YE+TG+ + I F
Sbjct: 245 KFLDASYAFYHKRLQDKLAEGVDVLQGLHSNTQIPKLIGSARQYELTGNHRDEEIARFSW 304
Query: 364 DIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYA 423
+ + HSYA GG S E+ P +L + LG+ ETC TYNMLK++ HL+ WT ++ Y
Sbjct: 305 ETIVHHHSYANGGNSMGEYLSVPDKLNNRLGTNTCETCNTYNMLKLTAHLYEWTNDVQYL 364
Query: 424 DYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESF 483
DYYERAL N +L+ Q E G + Y L LG G K G+G++ N+F CC G+G E+
Sbjct: 365 DYYERALYNHILASQH-PETGNVCYFLSLGMGTHK-----GFGSRHNNFSCCMGSGFENH 418
Query: 484 SKLGDSIYFEEEGNVPG---LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
SK G +IY VPG + I YI S WK + L D +++ T
Sbjct: 419 SKYGGAIY----SYVPGKEMMNINLYIPSVLTWKEKSLKLRMTTDYPEHGKVVIKLEET- 473
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQL 599
SK+ + ++NLR PVW + A +NG + PG+F+S +W ND + + L
Sbjct: 474 -SKEPL----TINLRRPVWAAGDVA-IRINGSKQKVESVPGSFISLHRKWKKNDVIELIL 527
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
P+ L T ++ P+ +A+ +GP +LAG
Sbjct: 528 PMPLYTVSM----PDNVDRRAVFYGPTILAG 554
>gi|33113961|gb|AAP94583.1| putative protein [Zea mays]
Length = 786
Score = 288 bits (738), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 134/205 (65%), Positives = 160/205 (78%)
Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSF 219
+L GHFVGHYL A+A+MWASTHN T+ KMS +V +L +CQ K+G GYLSAFP+E F
Sbjct: 475 QLWGHFVGHYLGATAKMWASTHNDTLNAKMSYIVNALYDCQKKMGIGYLSAFPSEFFVWV 534
Query: 220 EALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
EA+ VWAPYYTIHKI+ GLLDQY +A N+ AL M MV YF +RV+ VI YS+E HW
Sbjct: 535 EAITSVWAPYYTIHKIMQGLLDQYTVAGNSVALVMVVKMVNYFSDRVKNVIQNYSIETHW 594
Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
SLNE+TGGMNDV Y+LY+I +D KHL LA LFDKPCFLG LA Q D +S FH+NT IP+
Sbjct: 595 ESLNEKTGGMNDVFYQLYTIMNDTKHLTLAPLFDKPCFLGLLAGQDDSISGFHSNTRIPV 654
Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMD 364
IG+QMRY+VTGDPLYK I +FFMD
Sbjct: 655 AIGAQMRYKVTGDPLYKQIASFFMD 679
>gi|332880466|ref|ZP_08448140.1| hypothetical protein HMPREF9074_03915 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357046164|ref|ZP_09107794.1| hypothetical protein HMPREF9441_01810 [Paraprevotella clara YIT
11840]
gi|332681454|gb|EGJ54377.1| hypothetical protein HMPREF9074_03915 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355531170|gb|EHH00573.1| hypothetical protein HMPREF9441_01810 [Paraprevotella clara YIT
11840]
Length = 641
Score = 288 bits (737), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 182/514 (35%), Positives = 269/514 (52%), Gaps = 33/514 (6%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPG-------KAYGGWENPISELRGHFVGHYLSASAQMW 177
+++ + D L+ FR TA + K GGWE+ ELRGH GH LSA A M+
Sbjct: 67 WMVSIGADRLLHGFRTTAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHVLSALALMY 126
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A+T + K K ++V L+E Q GYLSA+P EL + + VWAP+YT+HK+ +
Sbjct: 127 AATGSDVFKMKGDSLVAGLAEVQAAGTGGYLSAYPEELINRNIRGESVWAPWYTLHKLFS 186
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GL+DQY+ A NAQAL + M ++ Y +++ + E + E GG+N+ Y LY
Sbjct: 187 GLIDQYLYARNAQALDVVRKMGDWAYGKLRPL----PEEMRRKMIRNEFGGINESFYNLY 242
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
++T D ++ LA F + L Q D L H NT IP V+ YE+TGD K
Sbjct: 243 ALTGDERYRWLAGFFYHNDVIDPLKEQRDDLGTKHTNTFIPKVLAEARNYELTGDGDSKA 302
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
+ FF + H++A G +S +E ++DP + + ETC TYNMLK+SRHLF W
Sbjct: 303 LSEFFWHTMIGRHTFAPGCSSDKEHYFDPDEFSKHISGYTGETCCTYNMLKLSRHLFCWE 362
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
ADYYERAL N +L Q+ G++ Y LPL G K S T NSFWCC G
Sbjct: 363 ASPEVADYYERALYNHILG-QQDPATGMVSYFLPLQSGTHKVYS-----TPENSFWCCVG 416
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
+G ES +K +SIY+ E LY+ +I S WK + L Q+ + R+T
Sbjct: 417 SGFESHAKYAESIYYRGEDC---LYVNLFIPSELAWKEKGLNLRQETR--FPEEETTRLT 471
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLT 596
L + + + ++ LR P W S +NG+++ + PG++++ RW D++
Sbjct: 472 LALETPRRL----AVKLRYPSW--SGRPTVRVNGKSVRVKQHPGSYITLDRRWEDGDRIE 525
Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ P+ L E + P+ A+L+GP +LAG
Sbjct: 526 VTYPMRLAMERM----PDNPHKGALLYGPIVLAG 555
>gi|429199615|ref|ZP_19191363.1| hypothetical protein STRIP9103_08616 [Streptomyces ipomoeae 91-03]
gi|428664699|gb|EKX63974.1| hypothetical protein STRIP9103_08616 [Streptomyces ipomoeae 91-03]
Length = 655
Score = 287 bits (735), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 184/539 (34%), Positives = 273/539 (50%), Gaps = 40/539 (7%)
Query: 113 SVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLS 171
V R + LEY D ++ FR A L T G + GGWE LRGH+ GH+L+
Sbjct: 6 GVFRRKRDLMLEYARSYPADRILAVFRANAGLDTRGARPPGGWETSDGNLRGHYGGHFLT 65
Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT---------GYLSAFPTE---LFDSF 219
AQ +A T A +K K+ +V +L+ECQ + G+L+A+P L +S+
Sbjct: 66 LVAQAYADTREAALKAKLDYLVGALAECQRTLAERGNPRPSHPGFLAAYPETQFILLESY 125
Query: 220 EALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
+WAPYYT HKI+ GLLD + LA NA+AL +A+ M ++ ++R+ + + ++R W
Sbjct: 126 TTYPTIWAPYYTCHKIMRGLLDAHTLAGNAEALTVASKMGDWVHSRLGR-LPKAQLDRMW 184
Query: 280 -YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
+ E GGMN+V+ LY++T +HL A FD L A D L HAN HIP
Sbjct: 185 SIYIAGEYGGMNEVMADLYALTGRAEHLAAARCFDNTALLDACAEDRDILDGRHANQHIP 244
Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
G ++ TG+ Y F +V +Y+ GGT E + +A TL +N
Sbjct: 245 QFTGYLRMFDHTGEERYADAARNFWGMVAGHRTYSLGGTGQGEMFRARDAVAATLDDKNA 304
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ---RGTEPGVMIYMLPLGRG 455
ETC TYNMLK+SR LF + AY D+YER LTN +L+ + R T+ + Y + +G G
Sbjct: 305 ETCATYNMLKLSRQLFFRDPDPAYMDHYERGLTNHILASRRDARSTDGPEVTYFVGMGPG 364
Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
V + G CC GTG+E+ +K DS+YF + LY+ Y++S+ W
Sbjct: 365 VVREYGNIG--------TCCGGTGMENHTKYQDSVYF-RSADGGALYVNLYLASTLRWPE 415
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
+V+ Q D + TLTF +E G L LR+P W + G ++NG
Sbjct: 416 RGIVVEQTSD----FPAEGVRTLTF---REGGGTLDLKLRIPSWA-TEGVTVTVNGVRQR 467
Query: 576 LPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+ PG +L+ + W D++ I P LR E DD ++Q++ GP LL ++
Sbjct: 468 VEAVPGTYLTLSRSWQRGDRVAISTPYRLRIERALDD----PAVQSVFHGPVLLVARSA 522
>gi|399029634|ref|ZP_10730435.1| hypothetical protein PMI10_02273 [Flavobacterium sp. CF136]
gi|398072450|gb|EJL63666.1| hypothetical protein PMI10_02273 [Flavobacterium sp. CF136]
Length = 642
Score = 286 bits (733), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 171/486 (35%), Positives = 263/486 (54%), Gaps = 28/486 (5%)
Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSA 210
GGWE+ +LRGH GH LS A ++A+T K K ++V L E Q + GYLSA
Sbjct: 102 GGWESLDCDLRGHSTGHILSGLALLYAATGEKMYKIKADSLVTGLDEVQKVLNQNGYLSA 161
Query: 211 FPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
FP L D A K VWAP+YT HK+ +GL+DQY+ D+ AL++ M ++ Y +++ +
Sbjct: 162 FPQNLIDRAIAGKSVWAPWYTQHKLFSGLMDQYLYCDSEPALEIVKGMADWAYEKLKSLT 221
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
E L E GGMND Y LY IT + K+ LA F L L + D L+
Sbjct: 222 N----EERKRMLRNEFGGMNDSFYALYEITAESKYKFLAEFFYHEDALDPLLNKTDNLNK 277
Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
HANT+IP +IG YE+ G + I FF + V H++ TG S +E +++P L+
Sbjct: 278 KHANTYIPKLIGISRDYELEGGSKNREIPEFFWNTVVNHHTFVTGSNSDKEKFFEPDHLS 337
Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
+ L E+C YNMLK++RHL+ +I Y DYYE+AL N +L Q+ + G++ Y L
Sbjct: 338 EHLSGFTGESCNVYNMLKLTRHLYGVNPQIKYVDYYEKALYNHILG-QQDPKTGMVAYFL 396
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
P+ G K S T NSFWCC G+G E+ +K G+ IY+ ++ GLY+ +I S
Sbjct: 397 PMMPGAHKVYS-----TPENSFWCCVGSGFENQAKYGEFIYYHDK----GLYVNLFIPSE 447
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
+WK +++ Q+ S+ TLT S+K V +++R P W + GA+ +N
Sbjct: 448 LNWKEKGIIVKQE----TSFPNVGSTTLTLSTKNPVSM--PISIRYPSW--AAGAEVKVN 499
Query: 571 GQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
G+ + PG++++ +WS D++ + + ++ P+ ++ A+ +GP +LA
Sbjct: 500 GKKQIINVKPGSYITLERKWSDGDRIEVSFGIQIKLAPT----PDNPNVVAVTYGPIVLA 555
Query: 630 GHTSGE 635
G E
Sbjct: 556 GEMGTE 561
>gi|374983575|ref|YP_004959070.1| putative glycosylase [Streptomyces bingchenggensis BCW-1]
gi|297154227|gb|ADI03939.1| putative glycosylase [Streptomyces bingchenggensis BCW-1]
Length = 713
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 189/538 (35%), Positives = 273/538 (50%), Gaps = 42/538 (7%)
Query: 114 VLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSA 172
V R + L Y D ++ FR A L T G + GGWE LRGH+ GH+L+
Sbjct: 65 VFRRKRDLMLGYARSYPADRILAVFRANAGLDTRGARPPGGWETSDGNLRGHYGGHFLTL 124
Query: 173 SAQMWASTHNATIKEKMSTVVFSLSECQNKIGT---------GYLSAFPTE---LFDSFE 220
AQ +A T A +K K+ +V +L ECQ + GYL+A+P L +S+
Sbjct: 125 IAQAYADTREAALKTKLDYLVGALGECQKALADHGSPIPSHPGYLAAYPETQFILLESYT 184
Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW- 279
+WAPYYT HKI+ GLLD + L N QAL++A+ M ++ ++R+ + +ER W
Sbjct: 185 TYPTIWAPYYTCHKIMRGLLDAHTLGGNQQALQIASGMGDWVHSRLGH-LPAAQLERMWS 243
Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
+ E GGMN+VL LY++T +HL A FD L A D L HAN HIP
Sbjct: 244 IYIAGEYGGMNEVLADLYALTGRAEHLAAARCFDNTALLKACAENRDILEGRHANQHIPQ 303
Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE 399
G ++ T Y F +V S Y+ GGT E + +A TL +N E
Sbjct: 304 FTGYLRLFDHTAKQEYSSAARNFWGMVTGSRMYSLGGTGQGEMFRARGAIAATLDDKNAE 363
Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR---GTEPGVMIYMLPLGRGV 456
TC TYNMLK++R LF + AY DYYER LTN +L+ +R T+ + Y + +G GV
Sbjct: 364 TCATYNMLKLTRQLFFHQPDPAYMDYYERGLTNHILASRRDAAATDSPEVTYFVGMGPGV 423
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGNVPGLYIIQYISSSFDWKS 515
+ N+ CC GTG+E+ +K DS+YF +GN LY+ Y++S+ W
Sbjct: 424 RREFD--------NTGTCCGGTGMENHTKYQDSVYFRSADGNA--LYVNLYLASTLRWPE 473
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG-QNL 574
V+ Q D + TLTF ++ G+L L LR+P W + G ++NG +
Sbjct: 474 RGFVIEQSSD----FPAEGVRTLTF--REGSGRL-DLRLRVPAWA-TAGFTVTVNGVRQR 525
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
PG++LS + W D++ I P SLR E DD ++Q++ +GP LL +
Sbjct: 526 AEAEPGSYLSLSRDWRPGDRVRISAPNSLRIERALDD----PTVQSVFYGPVLLTAQS 579
>gi|256376951|ref|YP_003100611.1| hypothetical protein Amir_2836 [Actinosynnema mirum DSM 43827]
gi|255921254|gb|ACU36765.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 614
Score = 286 bits (731), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 190/521 (36%), Positives = 265/521 (50%), Gaps = 34/521 (6%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
R + YL LD D L+ +FR+ L + GGWE+P +ELRGH GH LSA AQ
Sbjct: 66 RNESRTHAYLKFLDPDRLLHTFRRNVGLASGATPCGGWESPTTELRGHSTGHVLSALAQA 125
Query: 177 WASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEALKPVWAPYYT 231
ST + K K +V L+ CQ++ TGYLSAFP D EA + VWAPYYT
Sbjct: 126 HTSTGDTAFKTKSDYLVAGLAACQDRAAAAGFNTGYLSAFPESFIDRVEARQQVWAPYYT 185
Query: 232 IHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMND 291
+HKILAGLLD + L +AQAL + T + R ++ + + L E GGMN+
Sbjct: 186 LHKILAGLLDAHQLTGSAQALTVLTRKAAWVAWRNGRL----TQAQRQAMLGTEFGGMNE 241
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTG 351
VL LY +T DP HL A FD LA D LS FHANT IP +G+ Y TG
Sbjct: 242 VLANLYQLTGDPLHLTAARYFDHAQVFDPLAAGRDALSGFHANTQIPKALGAIREYHATG 301
Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSR 411
+ Y+ I F + V +H+YA GG S E++ +P R+A L E C T+NMLK++R
Sbjct: 302 ETRYRDIARNFWNFVVGAHTYAIGGNSNGEYFKNPGRIASELSDSTCECCNTHNMLKLTR 361
Query: 412 HLFRWTK-EIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARSTHGWGTKF 469
LFR D++E+AL N +L Q + G Y +PL G + S +
Sbjct: 362 QLFRTEPGRPELFDFHEKALYNHLLGAQNPDSAHGHHSYYVPLRAGGQRTFS-----NDY 416
Query: 470 NSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVS 529
F CC+GTG+E+ +K DSIYF L++ +I S+ W + + Q D
Sbjct: 417 QDFTCCHGTGMETNTKHRDSIYFHGGET---LWVNLFIPSTLTWPGRGITVRQ--DTGFP 471
Query: 530 WDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERW 589
++T+T S + + L LR+P W + GA+ LNG + PG + W
Sbjct: 472 DTASTKLTITGSGRVD------LRLRVPAW--ATGARLRLNGAPV-AATPGGYARIDRTW 522
Query: 590 SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ D + + LP++L E+ DD + Q + GP +LAG
Sbjct: 523 ASGDTVELTLPMALTRESAPDD----PAAQVVKHGPIVLAG 559
>gi|289773961|ref|ZP_06533339.1| secreted protein [Streptomyces lividans TK24]
gi|289704160|gb|EFD71589.1| secreted protein [Streptomyces lividans TK24]
Length = 854
Score = 286 bits (731), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 202/562 (35%), Positives = 274/562 (48%), Gaps = 43/562 (7%)
Query: 91 GFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA 150
G PG L+ L V L S L ++T YL +D D L+ +FR LP+ +
Sbjct: 43 GAHRPGPLLEPFPLSAVRLLDSPFLANMRRT-CAYLRFVDPDRLLHTFRLNVGLPSAAEP 101
Query: 151 YGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT----- 205
GGWE P +LRGH GH LSA AQ A T +K +V +L+ECQ
Sbjct: 102 CGGWEAPDVQLRGHTTGHLLSALAQAHAGTGETAYADKARLLVSALAECQRAAPAAGFHR 161
Query: 206 GYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEY 261
GYLSAFP +FD EA WAPYYT+HKI+AGLLDQY L+ N +A L+MA W
Sbjct: 162 GYLSAFPESVFDQLEAGGKPWAPYYTLHKIMAGLLDQYRLSGNREAFDVLLEMAAW---- 217
Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
+ S ER L E GGMNDVL RL+ T DP HL A FD L
Sbjct: 218 ----TEARTAPLSRERMQSVLKVEFGGMNDVLARLHLETGDPVHLRTARRFDHDELYAPL 273
Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
A D L+ HANT I V+G+ YE TGD Y I F V HSYA GG S +E
Sbjct: 274 AAGRDELAGRHANTEIAKVVGAVPAYEATGDRRYLDIADTFWTTVVRHHSYAIGGNSNQE 333
Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQR- 439
+ P +A L E C +YNMLK+ R LFR E Y D+YE L N +L+ Q
Sbjct: 334 LFGPPDEIASRLSEVTCENCNSYNMLKLGRDLFRHDPERTEYLDHYEWTLYNQMLAEQDP 393
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGT-------KFNSFWCCYGTGIESFSKLGDSIYF 492
+ G + Y L G S+ G G+ +++F C +GTG+E+ +K D++YF
Sbjct: 394 DSAHGFVTYYTGLWAG-SRREPKGGLGSAPGSYSGDYDNFSCDHGTGLETHTKFADTVYF 452
Query: 493 EEEG-NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
G P L++ ++ S W V L Q D + + D R+T+T + +
Sbjct: 453 RTPGTRRPALHVNLFVPSEVCWDDLGVTLRQDTD-MPTGD-RTRLTVTGGEAR-----FA 505
Query: 552 LNLRMPVWTYSNGAQASL--NGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
L +R+P W + +A L NG+ PG + + T W D++ + LP +
Sbjct: 506 LRIRVPGWLAAGDGRAGLTVNGRRTGGRLEPGTYTTVTRHWRTGDRVELVLPRV----PV 561
Query: 609 QDDRPEYASIQAILFGPYLLAG 630
P+ ++A+ +GP +LAG
Sbjct: 562 WRPAPDNPQVKAVSYGPLVLAG 583
>gi|399071242|ref|ZP_10749941.1| hypothetical protein PMI01_00976 [Caulobacter sp. AP07]
gi|398043612|gb|EJL36503.1| hypothetical protein PMI01_00976 [Caulobacter sp. AP07]
Length = 789
Score = 285 bits (729), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 187/532 (35%), Positives = 269/532 (50%), Gaps = 44/532 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A + N YLL L D + +F A LP G+ YGGWE+ + GH +GHY+SA M+
Sbjct: 53 AVEVNRAYLLRLSPDRFLHNFMTFAGLPAKGEIYGGWES--DTIAGHTLGHYVSALVVMY 110
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE-----LFDSFEALKPV------- 225
T + + + +V L+ Q K G GY+ A + + D E V
Sbjct: 111 EQTGDVECRRRADYIVGELARAQAKRGDGYIGALQRKRKDGTVVDGEEIFAEVMKGDIRS 170
Query: 226 --------WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
W+P YT+HK AGLLD + N QAL +A + YF ++V + E+
Sbjct: 171 GGFDLNGSWSPLYTVHKTFAGLLDVHRAWGNQQALDVAVGLGGYF----ERVFAALNDEQ 226
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
L E GG+N+ LY+ T D + L++A L L Q D L++FHANT +
Sbjct: 227 MQTLLGCEYGGLNESYAELYARTGDRRWLVVAERIYDRKVLDPLVAQQDKLANFHANTQV 286
Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
P +IG YE+TG P FF + V HSY GG + RE++ +P +A + +
Sbjct: 287 PKLIGLGRLYELTGKPQDAAAARFFWNTVTQHHSYVIGGNADREYFAEPDTIAAHISEQT 346
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
E C TYNMLK++R L+ W E A DYYERA N V++ Q + G YM PL G
Sbjct: 347 CEHCNTYNMLKLTRQLYSWRPEGALFDYYERAHLNHVMAAQ-NPKTGGFTYMTPLLTGAD 405
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
+ ST+ + ++FWCC GTG+ES +K G+SI++E EG L + YI + WK+
Sbjct: 406 RGYSTN----EDDAFWCCVGTGMESHAKHGESIFWEGEG---ALLVNLYIPAEAQWKARG 458
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
L ++D ++P R+TL +K G+ ++ LR+P W S A+ S+NGQ +
Sbjct: 459 AAL--RLDTRYPFEPESRLTLAKLAKP--GRF-TIALRVPAWAGSE-AKVSVNGQVVTPE 512
Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
G + RW D + I LPL LR EA D AS A++ GP +LA
Sbjct: 513 MAGGYALVDRRWREGDVVAITLPLGLRLEATPGD----ASTVAVVRGPMVLA 560
>gi|21231831|ref|NP_637748.1| hypothetical protein XCC2394 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66768042|ref|YP_242804.1| hypothetical protein XC_1718 [Xanthomonas campestris pv. campestris
str. 8004]
gi|21113547|gb|AAM41672.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66573374|gb|AAY48784.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 791
Score = 285 bits (729), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 203/612 (33%), Positives = 297/612 (48%), Gaps = 63/612 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
++ V L V L S+ A TN YL+ L D L+ +F A L AYGGWE
Sbjct: 49 IRAVPLAQVRL-MPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWE--A 105
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
+ GH +GHYLSA A M A T +A + + S +V L+ CQ G GY++ F
Sbjct: 106 DTIAGHTLGHYLSALALMHAQTDDAQCRTRASYLVAELARCQAHAGDGYVAGFTRKNAAG 165
Query: 214 ------ELFDSFE--ALKPV-------WAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
E+FD + ++P+ WAP YT HK+ AGLLD +V DNAQAL++A +
Sbjct: 166 QIESGREVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVGL 225
Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
Y +Q V ++ + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQAVFSVLDDAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHAVL 281
Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
L Q D L H H+NT+IP +IG YEVTGD FF + V HSY GG
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGNG 341
Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
RE++ P +A L + E C++YNMLK++RHL++W + AY DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSIARFLTEQTCEHCSSYNMLKLTRHLYQWGPQAAYFDYYERTLLNHVMA-Q 400
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
+ G+ YM P+ G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+E+
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWEDG--- 452
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
G+ I Y+ S +G + P LR+ ++++ +L+LR+P
Sbjct: 453 QGVAINLYVPSRVRNAAGLDMTLHSALP-AQGSVSLRIDAAPAAQR------TLSLRVPG 505
Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
W + Q LNG + +L T W D L + L + LR EA DD P + S
Sbjct: 506 WAAAPVLQ--LNGAVVDAAAVDGYLRVTRIWHPGDTLNLSLQMPLRLEATPDD-PAWVS- 561
Query: 619 QAILFGPYLLA---GHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
+L GP +LA G + W KT ++ + P+ +G ++V
Sbjct: 562 --VLRGPLVLAADLGDAATPWSGKTLALIGGDEVLQQLQPA-----------AGQGSYVY 608
Query: 676 SNSNQSITMEEF 687
S+ Q F
Sbjct: 609 SDGAQQWRFSPF 620
>gi|374324035|ref|YP_005077164.1| hypothetical protein HPL003_21035 [Paenibacillus terrae HPL-003]
gi|357203044|gb|AET60941.1| hypothetical protein HPL003_21035 [Paenibacillus terrae HPL-003]
Length = 767
Score = 285 bits (728), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 190/569 (33%), Positives = 288/569 (50%), Gaps = 39/569 (6%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENP 157
+KE HDV L++ S A L+Y+ +D D ++++FR TA++ T G + GW+ P
Sbjct: 191 VKEFKGHDVRLEKESEFGAAMDRFLQYVRSVDDDQMLYNFRATAAVDTKGAQPMTGWDAP 250
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI------GTGYLSAF 211
L+GH GHYLSA A + +T ++ + K+ +V L +CQ + G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYNATEDSALLGKIQYMVAELGKCQTALSEQAGYGRGFLSAY 310
Query: 212 PTELFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
E F+ E +WAPYYT+HKI+AGLLD Y LA +AL++ + + +NR+ +
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALEICDKLGHWLHNRLSR 370
Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ + + W + E GGMN+VL +LY+IT +L+ A FD + D
Sbjct: 371 -LPREQLHKMWSLYIAGEFGGMNEVLAKLYAITSHEHYLITAKYFDNEKLFLPMKENVDT 429
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L + HAN HIP VIG+ +EV G+ Y I F +V H Y+ GG E + +P
Sbjct: 430 LGNMHANQHIPQVIGALKLFEVAGEKAYFKIAENFWTMVTQRHIYSIGGAGETEMFREPD 489
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP-GVM 446
+A L + ETC +YNMLK+++ LF++ Y DYYE+AL N +L+ + + G
Sbjct: 490 AIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEGGS 549
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
Y +PL G K TH CC+GTG+E+ K ++IYF +E LY+ Y
Sbjct: 550 TYFMPLAPGSIKKFDTH-------ENTCCHGTGLENHFKYQEAIYFYDEDR---LYVNLY 599
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
I S DW + L QK D + + E G ++L R+P W S Q
Sbjct: 600 IPSQLDWSEQGLSLIQKRDQSSLEKAHFYI--------EGGTETTLMFRIPDWV-SEPVQ 650
Query: 567 ASLNGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+NG+ L +L + W D++ + LP SLR + +D + ++ +GP
Sbjct: 651 VKINGEPCRDLEYEHGYLKLRKVWK-EDEIELTLPRSLRLASAPNDH----TFMSLTYGP 705
Query: 626 YLLAGHTSGEWDIKTGTARSLSALISPIP 654
Y+LA SGE D + T L IP
Sbjct: 706 YVLAA-ISGEQDYISWTYSEQEFLEQIIP 733
>gi|374322441|ref|YP_005075570.1| hypothetical protein HPL003_12970 [Paenibacillus terrae HPL-003]
gi|357201450|gb|AET59347.1| hypothetical protein HPL003_12970 [Paenibacillus terrae HPL-003]
Length = 774
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 194/530 (36%), Positives = 263/530 (49%), Gaps = 46/530 (8%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
+A + N YLL L D L+ FR+ A L T Y GWE + GH +GHYLSA + M
Sbjct: 28 QAMELNRSYLLELQPDRLLARFREYAGLSTKAPQYEGWE--AMSISGHTLGHYLSACSMM 85
Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPV 225
+AST + KE + L CQ G GY+S P ELF+ A L
Sbjct: 86 YASTGDNRFKEIAHYITDELDVCQEAHGDGYVSGIPGGKELFEEVSAGNIRSKGFDLNGA 145
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYS 281
WAP YT+HK+ AGL D Y L +AL K+A W+ ++T S E+
Sbjct: 146 WAPLYTLHKLFAGLRDAYHLTGCNKALLVERKLADWL--------GGILTPMSDEQMQQM 197
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
+ E GGMN+VL LY+ T + +L LA F L L+ Q D L HANT IP +I
Sbjct: 198 MFCEYGGMNEVLADLYADTGEESYLRLAECFWHKLVLDPLSSQEDCLQGIHANTQIPKLI 257
Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
G YE+T D + FF D V HSY GG S E++ P L D +G ETC
Sbjct: 258 GLAKEYELTNDTKRRATVEFFWDRVVDHHSYVIGGNSFGEYFGAPGGLNDRIGPHTTETC 317
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
TYNMLK++ HLF+W AD+YER L N +L+ Q GV Y L L G K
Sbjct: 318 NTYNMLKLTSHLFQWNVSAKEADFYERGLFNHILASQDPVHGGV-TYFLSLAMGGHKH-- 374
Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
+ +KF+ F CC GTG+E+ + G IYF + LY+ Q+I+S+ +WK V L
Sbjct: 375 ---FESKFDDFTCCVGTGMENHASYGSGIYFHDHDK---LYVNQFIASTLEWKDTGVTLK 428
Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPG 580
Q S+ TL Q + L +R P W G +NG+ + PG
Sbjct: 429 QS----TSYPDTDHTTLEIQCDQPAKFM--LLVRYPYWA-EKGITIRVNGKEQSVVSEPG 481
Query: 581 NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+F+S W D + + +P+SLR E + D+ P+ A A+++GP +LAG
Sbjct: 482 SFVSIARTWIDGDVVEVTIPMSLRLEQMPDN-PDRA---AVMYGPLVLAG 527
>gi|436837799|ref|YP_007323015.1| hypothetical protein FAES_4423 [Fibrella aestuarina BUZ 2]
gi|384069212|emb|CCH02422.1| hypothetical protein FAES_4423 [Fibrella aestuarina BUZ 2]
Length = 781
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 187/534 (35%), Positives = 272/534 (50%), Gaps = 56/534 (10%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A + + +LL L D L+ FR A L YGGWE+ S L GH +GHYLSA A +
Sbjct: 58 AMEADTRFLLNLQPDRLLAQFRAHAGLAPKAAKYGGWES--SGLAGHSLGHYLSALALQY 115
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-------------LKP 224
A+T++ ++++ +V L++CQ TGY+ A P E D+ A L
Sbjct: 116 AATNDPEYLKRVNYIVDELADCQRARKTGYVGAIPRE--DTVFAEVAQGNIRSRGFDLNG 173
Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALK----MATWMVEYFYN----RVQKVITMYSVE 276
W+P+YT+HK++AGLLD Y+ A N +AL MA W E N +VQK++
Sbjct: 174 AWSPWYTVHKVMAGLLDAYLYAHNDKALAVTVGMADWTGETLKNLTDEQVQKMLLC---- 229
Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
E GGMNDVL +Y++T + K+L L++ F L LA Q D L HANT
Sbjct: 230 --------EYGGMNDVLANIYALTGNKKYLDLSYKFHDRVVLDSLAHQKDILPGRHANTQ 281
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
+P +IG+ RYE+TG + FF V H+YA GG S E+ P +L D L
Sbjct: 282 VPKLIGTIRRYELTGSQPDLAMSDFFWKTVVNHHTYAPGGNSNYEYLSTPDQLTDKLTDN 341
Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
ETC T+NMLK++RHLF AY DYYERAL N +L+ Q + G++ Y +PL G
Sbjct: 342 TMETCNTHNMLKLTRHLFALQPNAAYMDYYERALYNHILASQH-HKTGMVCYFVPLRMGT 400
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
K + + F CC GTG+E+ K G+SI+F +G L++ +I S +W
Sbjct: 401 RKH-----FSDEEEDFTCCVGTGMENHVKYGESIFF--KGADQSLFVNLFIPSELNWAEK 453
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
+ L + + DP +R+T+ ++ + LR P W + Q +NG+
Sbjct: 454 GLRLTLNAN--LPADPTVRLTVQADKPTKL----PIRLRKPYW-LAGPMQVRVNGKAATS 506
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
++ +RW D + + LP SLR + P+ + QA +GP LLAG
Sbjct: 507 TVQDGYVVIDQRWKTGDVVELTLPASLRAMPM----PDNIARQAFFYGPVLLAG 556
>gi|298246853|ref|ZP_06970658.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
gi|297549512|gb|EFH83378.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
Length = 600
Score = 282 bits (722), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 182/537 (33%), Positives = 279/537 (51%), Gaps = 36/537 (6%)
Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL----PTPGKAYGGWENPISELRGHFV 166
Q L + + N Y+L L +L+ + A L P + GWE+P +LRGHF+
Sbjct: 16 QPGPLKKRAELNRAYMLSLKSTNLLQNHYGEAGLWNPPQQPTDCHRGWESPTCQLRGHFL 75
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVW 226
GH+LSA+A++ AST + IK K +V L+ CQ ++ ++ + P + D K VW
Sbjct: 76 GHWLSAAARLVASTGDTEIKGKADFIVAELARCQQEMEGEWIGSIPEKYLDWIARGKRVW 135
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
AP+YT+HK L GL D Y + N QAL + ++F+ + +S E+ L+ ET
Sbjct: 136 APHYTLHKTLMGLYDMYEIGQNEQALDILIHWADWFH----RWTGQFSREQMDDILDVET 191
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGM +V LY +T+ +HL L +D+ L D L++ HANT IP V G+
Sbjct: 192 GGMLEVWANLYGVTNRQEHLDLIRRYDRSRLFDRLLAGEDVLTYMHANTTIPEVHGAARA 251
Query: 347 YEVTGDPLYK-LIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
+EVTG+ ++ ++ ++ V + TGG ++ E W P +L LG EN+E CT YN
Sbjct: 252 WEVTGEQRWRDIVEAYWRLAVTDRGYFCTGGQTSDEVWCPPHQLGGQLGPENQEHCTVYN 311
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
+++++ +LFRWT ++ YADYYER NG+L+ Q+ + G++ Y LPL G +K W
Sbjct: 312 LMRLANYLFRWTGDVVYADYYERNFYNGILA-QQNAQTGMVAYYLPLETGGTKV-----W 365
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH----VVLN 521
GT N FWCC+GT +++ + IYF N GL + QYI S W V L
Sbjct: 366 GTPTNDFWCCHGTLVQAQASHTRDIYFT---NDEGLVVSQYIPSRLQWHHDGSEVIVTLE 422
Query: 522 QKVDPIVSWDP---YLRMT----LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
K + + R T T S E +L LR+P W ++ ++NG+
Sbjct: 423 SKAHNVYALKAPREQPRQTSHPEYTLSVNCEQPTEYTLTLRLPWWL-ADEPMITINGERQ 481
Query: 575 PLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+P P ++ W +NDKLTI LP +L+ + P + + A + GP +LAG
Sbjct: 482 RVPHTPSSYYHIRRTW-HNDKLTILLPKALQIVPL----PGASDMMAFMDGPIVLAG 533
>gi|21218915|ref|NP_624694.1| hypothetical protein SCO0371 [Streptomyces coelicolor A3(2)]
gi|5881940|emb|CAB55733.1| putative secreted protein [Streptomyces coelicolor A3(2)]
Length = 869
Score = 282 bits (721), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 201/562 (35%), Positives = 273/562 (48%), Gaps = 43/562 (7%)
Query: 91 GFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA 150
G PG L+ L V L S L ++T YL +D D L+ +FR LP+ +
Sbjct: 58 GAHRPGPLLEPFPLSAVRLLDSPFLANMRRT-CAYLRFVDPDRLLHTFRLNVGLPSAAEP 116
Query: 151 YGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT----- 205
GGWE P +LRGH GH LSA AQ A T +K +V +L+ECQ
Sbjct: 117 CGGWEAPDVQLRGHTTGHLLSALAQAHAGTGETAYADKARLLVSALAECQRAAPAAGFHR 176
Query: 206 GYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEY 261
GYLSAFP +FD EA WAPYYT+HKI+AGLLDQY L+ N +A L+MA W
Sbjct: 177 GYLSAFPESVFDQLEAGGKPWAPYYTLHKIMAGLLDQYRLSGNREAFDVLLEMAAW---- 232
Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
+ S ER L E GGMNDVL RL+ T DP HL A FD L
Sbjct: 233 ----TEARTAPLSRERMQSVLKVEFGGMNDVLARLHLETGDPVHLRTARRFDHDELYAPL 288
Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
A D L+ HANT I V+G+ YE TGD Y I F V HSYA GG S +E
Sbjct: 289 AAGRDELAGRHANTEIAKVVGAVPAYEATGDRRYLDIADTFWTTVVRHHSYAIGGNSNQE 348
Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQR- 439
+ P +A L E C +YNMLK+ R LFR E Y D+YE L N +L+ Q
Sbjct: 349 LFGPPDEIASRLSEVTCENCNSYNMLKLGRDLFRHDPERTEYLDHYEWTLYNQMLAEQDP 408
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGT-------KFNSFWCCYGTGIESFSKLGDSIYF 492
+ G + Y L G S+ G G+ +++F C +GTG+E+ +K D++YF
Sbjct: 409 DSAHGFVTYYTGLWAG-SRREPKGGLGSAPGSYSGDYDNFSCDHGTGLETHTKFADTVYF 467
Query: 493 EEEG-NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
G P L++ ++ S W V L Q D + + D R+T+T + +
Sbjct: 468 RTPGTRRPALHVNLFVPSEVCWDDLGVTLRQDTD-MPTGD-RTRLTVTGGEAR-----FA 520
Query: 552 LNLRMPVWTYSNGAQASL--NGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
L +R+ W + +A L NG+ PG + + T W D++ + LP +
Sbjct: 521 LRIRVAGWLAAGDGRAGLTVNGRRTGGRLEPGTYTTVTRHWRTGDRVELVLPRV----PV 576
Query: 609 QDDRPEYASIQAILFGPYLLAG 630
P+ ++A+ +GP +LAG
Sbjct: 577 WRPAPDNPQVKAVSYGPLVLAG 598
>gi|188991168|ref|YP_001903178.1| hypothetical protein xccb100_1772 [Xanthomonas campestris pv.
campestris str. B100]
gi|167732928|emb|CAP51124.1| Putative secreted protein [Xanthomonas campestris pv. campestris]
Length = 791
Score = 282 bits (721), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 201/612 (32%), Positives = 296/612 (48%), Gaps = 63/612 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
++ V L V L S L A TN YL+ L D L+ +F A L AYGGWE
Sbjct: 49 IRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWE--A 105
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE---- 214
+ GH +GHYLSA A M A T +A + + S +V L+ CQ +G GY++ F +
Sbjct: 106 DTIAGHTLGHYLSALALMHAQTDDAHCRTRASYLVAELARCQAHVGDGYVAGFTRKNAAG 165
Query: 215 -------LFDSFE--ALKPV-------WAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
+FD + ++P+ WAP YT HK+ AGLLD +V DNAQAL++A +
Sbjct: 166 QIESGRAVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVAL 225
Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
Y +Q + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFAALDDTQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHTVL 281
Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
L Q D L H H+NT+IP +IG YEVTGD FF + V HSY GG
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGNG 341
Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
RE++ P ++ L + E C++YNMLK++RHL++W + AY DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCSSYNMLKLTRHLYQWGPQAAYFDYYERTLLNHVMA-Q 400
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
+ G+ YM P+ G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+E+
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWEDG--- 452
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
G+ I Y+ S +G + P LR+ ++++ +L+LR+P
Sbjct: 453 QGVAINLYVPSRVRNAAGLDMTLHSALP-AQGSVSLRIDAAPAAQR------TLSLRVPG 505
Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
W + Q LNG + +L T W D L + L + LR EA DD P + S
Sbjct: 506 WAAAPVLQ--LNGAVVDAAAVDGYLRVTRTWHPGDTLNLSLQMPLRLEATPDD-PAWVS- 561
Query: 619 QAILFGPYLLA---GHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
+L GP +LA G + W KT ++ + P+ +G ++V
Sbjct: 562 --VLRGPLVLAADLGDAATPWSGKTPALIGGDEVLQQLQPA-----------AGQGSYVY 608
Query: 676 SNSNQSITMEEF 687
S+ Q F
Sbjct: 609 SDGAQQWRFSPF 620
>gi|333381736|ref|ZP_08473415.1| hypothetical protein HMPREF9455_01581 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829665|gb|EGK02311.1| hypothetical protein HMPREF9455_01581 [Dysgonomonas gadei ATCC
BAA-286]
Length = 775
Score = 282 bits (721), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 194/555 (34%), Positives = 285/555 (51%), Gaps = 49/555 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
LK SL DV L SS A + ++LL + D + FR + L YGGWE+
Sbjct: 35 LKPFSLSDVRL-TSSPFMSAMSLDEKWLLSFEPDRFLSGFRSESGLQPKAPKYGGWES-- 91
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSAFP----- 212
+ G GHYLSA + M+AST N + +++ + L CQ G G ++AFP
Sbjct: 92 QGVAGQTFGHYLSALSMMYASTGNEQLNDRIKYSINELDSCQQAFGMNGIVAAFPRAKGL 151
Query: 213 ----------TELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYF 262
TE FD L W P Y++HK+ AGL+D Y N QA K+ + +
Sbjct: 152 FTEISTGDIRTEGFD----LNGGWVPLYSMHKLFAGLIDVYEYTGNKQAYKIYINLAD-- 205
Query: 263 YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLA 322
V K+++ S E+ L E GG+N+ L +Y++T + K+L LA + L L+
Sbjct: 206 --GVDKMLSGLSDEQIQKILICEHGGINESLAEVYALTGNKKYLNLATRLNHKAVLDPLS 263
Query: 323 LQADYLSHFHANTHIPIVIGSQMRYEVTG-DPLYKLIGTFFMDIVNASHSYATGGTSARE 381
D L+ HANT IP VIG YE+TG D L+K FF + V SHSY GG S E
Sbjct: 264 KGVDELAGKHANTQIPKVIGVIREYELTGNDDLFK-TAEFFWNTVVHSHSYVIGGNSEAE 322
Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
+ R D + + E C TYNMLK+++HLF +I ADYYERAL N +L+ Q
Sbjct: 323 HFGVAGRTYDRITDKTCENCNTYNMLKLTKHLFSLQPDIQKADYYERALYNQILASQ-NP 381
Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
+ G++ YM PL G S G+ T F+SFWCC GTG+E+ ++ G+ IYF ++ L
Sbjct: 382 QDGMVCYMSPLAAG-----SRRGFSTPFDSFWCCVGTGLENHARYGEFIYFSDKDK--NL 434
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
+I +I S DWK ++V+ Q + S T+ + K + Q ++N+R P+W
Sbjct: 435 FINLFIPSKLDWKDRNMVIEQITNFPES------DTVRYKIKAKKTQEFTVNIRYPLWA- 487
Query: 562 SNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
+G +NG+ + + PGN++ T +W ND + LP L +EA D +++A
Sbjct: 488 QDGFSLFVNGKRVEINSSPGNYIQLTRKWKNNDDICYVLPKRLLSEAALGD----TNLRA 543
Query: 621 ILFGPYLLAGHTSGE 635
L+GP +L+ E
Sbjct: 544 YLYGPIVLSAVLDNE 558
>gi|330997549|ref|ZP_08321396.1| hypothetical protein HMPREF9442_02496 [Paraprevotella xylaniphila
YIT 11841]
gi|329570407|gb|EGG52138.1| hypothetical protein HMPREF9442_02496 [Paraprevotella xylaniphila
YIT 11841]
Length = 622
Score = 282 bits (721), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 182/549 (33%), Positives = 273/549 (49%), Gaps = 48/549 (8%)
Query: 94 LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG----- 148
LPG F + VW+ + + VD L+ FR TA +
Sbjct: 37 LPGRFRDNMMRDSVWM-----------------VSIGVDRLLHGFRTTAGIFAGREGGYM 79
Query: 149 --KAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTG 206
K GGWE+ ELRGH GH+LSA + M+A+T + K K ++V L+E Q +G G
Sbjct: 80 TVKKLGGWESLDCELRGHTTGHFLSALSLMYAATGSEVFKLKGDSLVAGLAEVQVALGNG 139
Query: 207 YLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRV 266
YLSAFP EL + VWAP+YT+HKI +GL+DQY+ A N QAL++ M ++ Y ++
Sbjct: 140 YLSAFPEELINRNIRATSVWAPWYTLHKIFSGLIDQYLYAGNTQALEVVRKMGDWAYAKL 199
Query: 267 QKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
+ + S E + E GG+N+ Y LY++T D ++ LA F + L Q D
Sbjct: 200 KPL----SEETRRKMIRNEFGGVNESFYNLYALTGDERYKWLAGFFYHNEVIDPLKAQKD 255
Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
L H NT IP V+ YE+TGD K + FF + H++A G +S +E ++
Sbjct: 256 DLGTKHTNTFIPKVLAEARNYELTGDADSKALSEFFWHTMIDRHTFAPGCSSDKEHYFPT 315
Query: 387 KRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
+ + ETC TYNMLK+SRHLF W ADYYERAL N +L Q+ G++
Sbjct: 316 DKFTAHISGYTGETCCTYNMLKLSRHLFCWDASPEVADYYERALYNHILG-QQDPASGMV 374
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
Y LPL G + S T NSFWCC G+G E+ +K ++IY+ + G+++ +
Sbjct: 375 AYFLPLQTGTHRVYS-----TPENSFWCCVGSGFENHAKYAEAIYYHDRD---GIFVNLF 426
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
I S W+ +VL Q D + + T+ +++ ++ LR P W+ S +
Sbjct: 427 IPSEVKWREKGLVLRQ--DTRFPEEGKVTFTVGLDEPKQL----TVRLRYPSWS-SEVSV 479
Query: 567 ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
+ PG+++ + RW D++ + LR E P+ A+L+GP
Sbjct: 480 KVNGKKVKVRQKPGSYILLSRRWKDGDRIEADYAMGLRLERT----PDGTERGALLYGPV 535
Query: 627 LLAGHTSGE 635
+LAG E
Sbjct: 536 VLAGELGTE 544
>gi|256377207|ref|YP_003100867.1| hypothetical protein Amir_3107 [Actinosynnema mirum DSM 43827]
gi|255921510|gb|ACU37021.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 771
Score = 281 bits (720), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 202/586 (34%), Positives = 287/586 (48%), Gaps = 60/586 (10%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPISELRGHFV 166
W+D Q L YL +D D L+++FR L T G A GWE P R H
Sbjct: 62 WMDN-------QNRALSYLRFVDPDRLLYNFRANHRLSTAGAAPLAGWEAPDFPFRTHSQ 114
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
GH+L+A AQ WA + T +++ + +V L++CQ GYLS FP D+ EA
Sbjct: 115 GHFLTAWAQAWAVLGDTTSRDRANHLVAELAKCQANNAAAGFTAGYLSGFPESDLDALEA 174
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVER 277
P YY +HK LAGLLD + + QA L+ A W V++ R+ + TM V
Sbjct: 175 GTPKAVSYYALHKTLAGLLDVWRHLGSTQARDVLLRFAGW-VDWRTARLSQA-TMQRV-- 230
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
L E GGMN VL LY T D + L A FD LA D L+ HANT +
Sbjct: 231 ----LATEFGGMNAVLADLYQQTGDARWLATAQRFDHAAAFDPLAANQDRLNGLHANTQV 286
Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
P IG+ Y+ TG Y+ I T +I A+H+Y GG S E + P +A L ++
Sbjct: 287 PKWIGAAREYKATGTTRYRDIATNAWNITVAAHTYVIGGNSQAEHFRAPNAIAAHLATDT 346
Query: 398 EETCTTYNMLKVSRHLFRWTKE---IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG 453
E C TYNMLK++R L W E AY D+YERAL N ++ Q + G + Y L
Sbjct: 347 AEACNTYNMLKLTREL--WLLEPTKAAYFDFYERALLNHLIGQQNPADAHGHICYFTGLN 404
Query: 454 RGVSKARSTHGWG-----TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
G + R+ WG T +++FWCC GTGIE+ +KL DSIYF + L + Y
Sbjct: 405 PGHRRGRTGPAWGGGTWSTDYSTFWCCQGTGIETNTKLADSIYFRDGTT---LTVNLYTP 461
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
S+ W + + Q S L +T + S ++ LR+P WT +GA +
Sbjct: 462 STLTWSERGITVTQSTTYPASDTTTLTVTGSASGSW------TMRLRIPAWT--SGATVA 513
Query: 569 LNG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
+NG QN+ PG++ S T W+ +D +T++LP+ + T P+ ++ A+ +GP
Sbjct: 514 VNGTPQNV-AAAPGSYASLTRSWTSDDTVTLRLPMRVTTAPA----PDNPNVVAVTYGPV 568
Query: 627 LLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNST 672
+LAG + + T +L AL + +TFT SG ST
Sbjct: 569 VLAG------NFGSTTLSALPALDVASITRTSTTALTFTARSGGST 608
>gi|329849035|ref|ZP_08264063.1| hypothetical protein ABI_21080 [Asticcacaulis biprosthecum C19]
gi|328844098|gb|EGF93667.1| hypothetical protein ABI_21080 [Asticcacaulis biprosthecum C19]
Length = 773
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 183/536 (34%), Positives = 264/536 (49%), Gaps = 44/536 (8%)
Query: 115 LWR-AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSAS 173
+WR A N YLL L+ D L+ +F K+A L G YGGWEN + GH +GHYL+A
Sbjct: 44 VWRDAVDANGHYLLSLEPDRLLHNFHKSAGLAPKGDIYGGWEN--MGIAGHSLGHYLTAL 101
Query: 174 AQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV-------- 225
+A T + K K+ V ++ Q G GY+ E + K V
Sbjct: 102 GLAYAQTRDPAYKAKLDYTVSEMAIIQKAHGDGYIGGTTVERDGKLQDGKIVYEEVRKHV 161
Query: 226 -----------WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYS 274
W P YT HK+ AGLLD + A+N QALK+A M +Y V+ S
Sbjct: 162 ITSHGFDLNGGWVPLYTWHKVHAGLLDAHRYANNGQALKIAIGMSDYLIG----VLGDLS 217
Query: 275 VERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHAN 334
E L E GG+N+ +Y T D ++L A L LA + D L HAN
Sbjct: 218 DEEMQKVLAAEHGGLNETYAEMYVRTGDKRYLDTARRIYHKAVLTPLAQRRDELEGKHAN 277
Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
T IP +IG YEVTGD Y ++F D V HSY GG SA E + P +L+ L
Sbjct: 278 TQIPKLIGLARLYEVTGDKAYGDTASYFWDRVIHHHSYVIGGNSAGEHFGAPDKLSGRLD 337
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ E+C TYNMLK++RHL++W + A+ DYYERA N +L+ Q + G +Y +PL
Sbjct: 338 DKTCESCNTYNMLKLTRHLYQWQPDAAWFDYYERAHLNHILAHQ-DPQTGAFVYFVPLAS 396
Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
G + S T SFWCC G+G+ES +K GDSI++ + G +Y +I S W
Sbjct: 397 GSQRLYS-----TPDTSFWCCVGSGMESHAKHGDSIWWRQAGGGDTVYANLFIPSELSWT 451
Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
+ D I+ +P +TF+ + +L +R+P W ++G + S+NG+N
Sbjct: 452 DKATKIALSGD-ILKGEP-----VTFTVTPQGTADFTLAIRVPKW--ADGPRLSVNGKNT 503
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
PL ++ W D + + LP +L+ E + P+ + A + GP ++AG
Sbjct: 504 PLLVKNGYVRVRRAWKAGDTVVLTLPHALKVETM----PDNPRLAAFIKGPMVMAG 555
>gi|294667526|ref|ZP_06732741.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292602646|gb|EFF46082.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 791
Score = 281 bits (718), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 205/627 (32%), Positives = 294/627 (46%), Gaps = 86/627 (13%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A QTN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRLTPSLFL-DALQTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + +NAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCENAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q V + +L+ E GG+N+ L+ T D + L LA
Sbjct: 222 AVALAGY----LQGVFAALDDAQLQKALSCEFGGLNESFVELHVQTGDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L+H H+NT+IP +IG YEVTGDP FF V H+Y
Sbjct: 278 HAVLDPLIAQRDALAHQHSNTNIPKLIGLAREYEVTGDPASGAAARFFWHTVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSG-----HVVLNQKVDPIVSWD---PYLRMTLTFSSKQEV 546
G+YI Y+ S+ +G H L ++ + D P RM
Sbjct: 452 G---QGVYINLYVPSTVRDAAGLNMTLHSALPEQGSASLRIDGAPPAQRM---------- 498
Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
L LR+P W + + LNGQ + +L T W D L + + LR E
Sbjct: 499 -----LALRVPGW--AQQPRLRLNGQPVDGSASDGYLRLTRVWQPGDTLQLSFDMPLRLE 551
Query: 607 AIQDDRPEYASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQ 660
A DD P + S +L GP +LA G + W KT T + + + P+P
Sbjct: 552 ATPDD-PAWVS---VLHGPLVLAVDLGDAAKPWSGKTPTLIGGQDILQRLQPVP------ 601
Query: 661 LVTFTQESGNSTFVMSNSNQSITMEEF 687
G + F S+ Q + F
Sbjct: 602 --------GKTAFTYSDGAQQWQLSPF 620
>gi|371778346|ref|ZP_09484668.1| hypothetical protein AnHS1_13085 [Anaerophaga sp. HS1]
Length = 796
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 188/546 (34%), Positives = 275/546 (50%), Gaps = 46/546 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L+EV L D A + N + LL + D L+ FR+ A L + YGGWE
Sbjct: 50 LEEVELLD------GPFLEASKLNEKILLNYEPDRLLAHFREQAHLKPKAQHYGGWEG-- 101
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELF 216
L GH +GHYLSA + M+ +T N ++++ +V L Q G GYL AF ++F
Sbjct: 102 ESLTGHSLGHYLSACSMMYKTTGNEEFLKRVNYIVNELDTVQKAHGDGYLGAFDNGKKIF 161
Query: 217 DSFEA----------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRV 266
+ A L +WAP YT HKI+AGL+D Y L N +AL++ + F + +
Sbjct: 162 EEEIANGNIRSAGFDLNGIWAPIYTQHKIMAGLMDAYKLCGNKKALEVE----QKFADWL 217
Query: 267 QKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
++ S E L+ E GG+N+ L+++T + ++L +A LF L LA D
Sbjct: 218 GSIVENLSHEEIQKMLHCEHGGINEAYAELFAVTGNERYLKIARLFHHEAVLDPLAKGID 277
Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
L HANT IP +IG YE+TGD + FF + V HSY TGG E++ P
Sbjct: 278 ILPGHHANTQIPKIIGLSRLYELTGDTTDRKTAQFFWERVVYHHSYVTGGNGDHEYFGPP 337
Query: 387 KRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
L++ L S ETC YNMLK+S HLF+W E ADYYERAL N +LS Q + G +
Sbjct: 338 DTLSNRLSSNTTETCNVYNMLKLSNHLFKWEAEAEVADYYERALFNHILSSQH-PQSGHV 396
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
IY L L G K + F F CC GTG+E+ +K +IYF N L++ Q+
Sbjct: 397 IYNLSLEMGGHKH-----YQNPF-GFTCCVGTGMENHAKYPKNIYFH---NDRELFVSQF 447
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
I+S +WK + L Q + + + F ++ V + L +R P W G
Sbjct: 448 IASRLNWKEKGLKLTQN----TRYPDEQKTSFIFECEKPVDLI--LQIRYPYWA-EKGMI 500
Query: 567 ASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
++NG+ + P +F++ W DK+ + P SLR EA+ D++ A+++GP
Sbjct: 501 VTVNGKKVSYSQKPQSFVAIHREWKTGDKVEVSFPFSLRLEAMPDNKDRV----ALMYGP 556
Query: 626 YLLAGH 631
+LAG
Sbjct: 557 LVLAGQ 562
>gi|381203003|ref|ZP_09910112.1| hypothetical protein SyanX_20925 [Sphingobium yanoikuyae XLDN2-5]
Length = 790
Score = 280 bits (716), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 179/533 (33%), Positives = 266/533 (49%), Gaps = 44/533 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A + N YLL L+ D L+ +FRK A L G YGGWEN + GH +GHYL+A A M
Sbjct: 51 AVEGNRRYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWEN--DTIAGHTLGHYLTALALMH 108
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------------- 221
A T +A + + ++ L+ECQ G GY++ F D E
Sbjct: 109 AQTGDAECARRAAYIIAELAECQAAAGDGYVAGFTRRRDDVIEDGRLIFPEIMRGDIRSA 168
Query: 222 ---LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
L W P+Y HK+ AGL D N+QA +A + Y + V +
Sbjct: 169 GFDLNGCWVPFYNWHKLFAGLFDAESHLGNSQARGVALALAAY----IDGVFAKLDDAQV 224
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
L+ E GG+N+ L++ T DP+ L LA L LA + + L HANT IP
Sbjct: 225 QQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDPLAQRQNSLPWIHANTQIP 284
Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
+IG +E+TG+ + FF + V +SY GG + RE++ DP ++ + +
Sbjct: 285 KLIGLARLHEITGNAADAIAANFFWETVVGQYSYVIGGNADREYFPDPGTISKHITEQTC 344
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C +YNMLK++RHL+ W E DYYERA N +L+ Q G+ YM+PL G +
Sbjct: 345 ESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQ-NPATGMFAYMVPLMSGSHR 403
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ-YISSSFDWKSGH 517
W F+ FWCC G+G+ES +K G+SI++E+ + I YI S DW +
Sbjct: 404 V-----WSEPFDDFWCCVGSGMESHAKHGESIWWEDADRPADMLIANLYIPSEADWAARG 458
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
L +++ +D ++ L+ G+ +L LR+P W GA+ ++NG LP P
Sbjct: 459 AKL--RIESGYPFDGHI--ALSIPKLARAGRF-TLALRIPGWC--QGARVAVNGTPLPAP 511
Query: 578 PPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
+ + +R W D++T+ LP++LR EA DD A A+L GP +LA
Sbjct: 512 RIADGYALIDRKWKAGDQVTLDLPMALRIEATPDD----ARTIALLHGPVVLA 560
>gi|374321589|ref|YP_005074718.1| hypothetical protein HPL003_08660 [Paenibacillus terrae HPL-003]
gi|357200598|gb|AET58495.1| hypothetical protein HPL003_08660 [Paenibacillus terrae HPL-003]
Length = 755
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 205/598 (34%), Positives = 305/598 (51%), Gaps = 58/598 (9%)
Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
K LH V +D S L+ A + N YLL L+ D L+ FR+ A L Y GWE
Sbjct: 6 KAFDLHKVSID-SGPLYHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--AR 62
Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFD 217
+ GH +GHYLS A M+AST + + E+++ VV L CQN G GY+S P ELF+
Sbjct: 63 GISGHTLGHYLSGCALMFASTGDERLLERVNYVVNELEICQNNHGNGYISGIPRGKELFE 122
Query: 218 SFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFY- 263
+A L W P YT+HK+ AGL D ++LA + +AL+M W+ + F
Sbjct: 123 EVKAGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLARHPKALQMEIKLGDWLEDVFKG 182
Query: 264 ---NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF 320
++VQ+V L+ E GGMN+VL L + + + L LA F L
Sbjct: 183 LNDDQVQQV------------LHCEFGGMNEVLTDLAEHSGEERFLRLAERFYHGEVLND 230
Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR 380
LA D L+ HANT IP +IG+ +YE+TG P Y + FF + V HSY GG S
Sbjct: 231 LADSRDTLAGRHANTQIPKIIGAARQYEMTGKPQYADLSRFFWERVVHKHSYVIGGNSYN 290
Query: 381 EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRG 440
E + +P +L D LG ETC TYNMLK++RH+F W AYADYYERA+ N +L+ Q+
Sbjct: 291 EHFGEPGKLNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQP 350
Query: 441 TEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
+ G + Y + L G K+ + ++++ F CC G+G+ES S G +IYF +
Sbjct: 351 VD-GRVCYFVSLEMGGHKS-----FNSQYDDFTCCVGSGMESHSMYGTAIYFHTPETI-- 402
Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
Y+ QY+ S+ W+ V L Q+ + R TL SK+ +L ++ LR P W
Sbjct: 403 -YVNQYVPSTVTWEEMDVQLKQE----TLFPQNGRGTLRVISKEP--KLFTIKLRCPHWA 455
Query: 561 YSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
G +NG+ P +++ W+ D + +P+++R E + P+
Sbjct: 456 -EQGMMIKINGEEYATEACPTSYVVIEREWNDADTIEYDIPMTVRIEEM----PDNPRRI 510
Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN 677
A ++GP +LAG G K+ R L++++ S +L+ E +TF M++
Sbjct: 511 AFMYGPLVLAGDL-GPVTPKSNEERLLASVLIGAADSLTTKLIADGNEP--NTFRMND 565
>gi|390452646|ref|ZP_10238174.1| hypothetical protein PpeoK3_01345 [Paenibacillus peoriae KCTC 3763]
Length = 767
Score = 279 bits (714), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 185/552 (33%), Positives = 283/552 (51%), Gaps = 39/552 (7%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENP 157
+KE V L++ S A L+++ ++ D ++++FR+ A++ T G + GW+ P
Sbjct: 191 VKEFKGQKVSLERESEFEAAMNRFLQFVRSVNDDQMLYNFREAAAIDTKGAQPMTGWDAP 250
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI------GTGYLSAF 211
L+GH GHYLSA A + +T ++ + K+ +V L +CQ + G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYNATEDSALLGKIQYMVVELGKCQTALSEQAGYGRGFLSAY 310
Query: 212 PTELFDSFE---ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
E F+ E +WAPYYT+HKI+AGLLD Y LA +AL + + + +NR+ +
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALDICDKLGHWLHNRLGR 370
Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ + + W + E GGMN+VL +LY+IT + +L+ A FD + D
Sbjct: 371 -LPREQLHKMWSLYIAGEFGGMNEVLAKLYAITGNKNYLMTAKYFDNEKLFLPMKENVDT 429
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L + HAN HIP VIG+ +EV GD Y I F +V SH Y GGT E + +P
Sbjct: 430 LGNTHANQHIPQVIGALKLFEVAGDEAYFNIAENFWTMVTQSHIYPIGGTGETEMFREPD 489
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP-GVM 446
+A L + ETC +YNMLK+++ LF++ Y DYYE+AL N +L+ + + G
Sbjct: 490 AIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEGGS 549
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
Y +PL G K TH CC+GTG+E+ K ++IYF +E LY+ Y
Sbjct: 550 TYFMPLAPGSIKKFDTH-------ENTCCHGTGLENHFKYQEAIYFHDEDR---LYVNLY 599
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
I S DW + L QK D D + E ++L R+P W S Q
Sbjct: 600 IPSRLDWSDQGLSLVQKRDS----DGLETVRFYIEGVPE----TTLMFRIPDWI-SEPVQ 650
Query: 567 ASLNGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+NG+ L +L + W D++ + LP SLR D P+ +++++ +GP
Sbjct: 651 VKINGEPCRDLEYEDGYLKLRKVWK-KDEIELTLPCSLRLA----DAPDDHTLKSLAYGP 705
Query: 626 YLLAGHTSGEWD 637
Y+LA SGE D
Sbjct: 706 YVLAA-ISGEQD 716
>gi|325919533|ref|ZP_08181551.1| hypothetical protein XGA_0490 [Xanthomonas gardneri ATCC 19865]
gi|325549987|gb|EGD20823.1| hypothetical protein XGA_0490 [Xanthomonas gardneri ATCC 19865]
Length = 791
Score = 279 bits (714), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 196/575 (34%), Positives = 276/575 (48%), Gaps = 65/575 (11%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A QTN YL+ L+ D L+ +F A L AYGGW
Sbjct: 46 PGS-IRAVPLAQVRLTPSLFL-DALQTNRRYLMRLEPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAHYLVAELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DNAQAL++
Sbjct: 162 NAAGKIESGRAVFDELKKGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q V + + L+ E GG+N+ L+ T D + L LA
Sbjct: 222 AVGLAGY----LQAVFSALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P + L + E C +YNMLK++RHL++W + + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSTSKFLTEQTCEHCASYNMLKLTRHLYQWGPQAEFFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM P+ G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL------TFSSKQEVGQ 548
G+Y+ Y+ SS V D LR T+ + +
Sbjct: 452 G---QGVYVNLYVPSS-------------VRDAAGLDMTLRSTMPEQGSASLRVDAAPAE 495
Query: 549 LSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
+L LR+P W S Q LNGQ + +L T W D L + + LR EA
Sbjct: 496 QRTLALRVPGWAQSPVLQ--LNGQPVGAAVSDGYLRITRVWRAGDTLDLSFEMPLRLEAA 553
Query: 609 QDDRPEYASIQAILFGPYLLA---GHTSGEWDIKT 640
DD P + S +L GP +LA G + W KT
Sbjct: 554 ADD-PAWVS---VLRGPLVLAADLGDAAKPWSGKT 584
>gi|374372949|ref|ZP_09630610.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
gi|373235025|gb|EHP54817.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
Length = 653
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 184/541 (34%), Positives = 278/541 (51%), Gaps = 39/541 (7%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-------KAY 151
L EV L D ++ + R Q +LL + + SL+ SF A + K Y
Sbjct: 57 LSEVKLLDSRFKEN--MLREQH----WLLAISLKSLLHSFYTNAGMYDANEGGYDEIKKY 110
Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-TGYLSA 210
GWE+ ELRGH GH LS A M+AST K K T++ +L+ Q + GY+SA
Sbjct: 111 AGWESMDCELRGHSTGHILSGLALMYASTGEQIYKSKGDTIIKALAAIQKTLNQNGYISA 170
Query: 211 FPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
FP E + + VWAP+YT+HKILAG+LDQY+ +N QAL +A + Y ++ +
Sbjct: 171 FPQEFINRNIRGEKVWAPWYTLHKILAGVLDQYLYCNNDQALDIAKNFSAWAYKKLHPL- 229
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
+ + L E GGMN+V + LY+IT D K L + F L L D L
Sbjct: 230 ---TAGQRTLMLRNEFGGMNEVFFNLYAITGDEKDKWLGNFFYDNRMLDPLKAGIDNLKG 286
Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
HANT+IP ++G YE+ G+ + FF V HS+ATG S RE ++ P ++
Sbjct: 287 AHANTYIPKLLGVTRDYEIEGNAGGDAVVRFFWQRVTTHHSFATGSNSDREHFFQPDAIS 346
Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
L E+C YNMLK++RHL+ + + YADYYE+AL N +L Q+ G++ Y L
Sbjct: 347 THLTGYTGESCNVYNMLKLTRHLYIHSGNVKYADYYEKALFNHILG-QQDPATGMIAYFL 405
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
P+ G K ST +SFWCC GTG E+ +K G+ IY+ + + LYI +I S
Sbjct: 406 PMLPGAHKVYSTPD-----SSFWCCVGTGFENQAKYGEGIYYHTQND---LYINLFIPSD 457
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
+WK L Q+ D ++ T+ + + + ++N+R P W + ++N
Sbjct: 458 LNWKEKSFRLMQQTK--FPEDGNMKFTIDEAPEFPL----TINIRYPDWV-AGRPTITIN 510
Query: 571 GQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
G+++ + + ++S W ND++ + + LRT D+ S+ AI +GP +LA
Sbjct: 511 GRSIKIEQAADSYISIKRIWKKNDRIEVNYRMQLRTIPANDN----PSVAAIAYGPVVLA 566
Query: 630 G 630
G
Sbjct: 567 G 567
>gi|443291943|ref|ZP_21031037.1| Conserved secreted hypothetical protein [Micromonospora lupini str.
Lupac 08]
gi|385885131|emb|CCH19144.1| Conserved secreted hypothetical protein [Micromonospora lupini str.
Lupac 08]
Length = 778
Score = 279 bits (713), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 194/576 (33%), Positives = 277/576 (48%), Gaps = 55/576 (9%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
WLD Q L YL +DVD ++++FR L T G A GGW+ P R H
Sbjct: 65 WLDN-------QNRTLNYLRFVDVDRMLYNFRANHRLSTNGAATNGGWDAPNFPFRTHMQ 117
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
GH+L+A AQ +A + T ++K + +V L++CQ G GYLS FP F + EA
Sbjct: 118 GHFLTAWAQAYAVLGDTTCRDKANYMVAELAKCQANNGAAGFGAGYLSGFPESDFSALEA 177
Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
L PYY IHK LAGLLD + N QA + + + R ++ S +
Sbjct: 178 RTLSNGNVPYYCIHKTLAGLLDVWRYTGNTQARTVLLALAGWVDTRTSRL----SSSQMQ 233
Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
L E GGMNDVL +Y +T D + L A FD LA D L+ HANT +P
Sbjct: 234 SMLGTEFGGMNDVLTEIYQMTGDSRWLTTAQRFDHASVFNPLANNQDQLNGLHANTQVPK 293
Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE 399
+G+ ++ TG Y+ I + +I +H+Y GG S E + P +A L ++ E
Sbjct: 294 WVGAAREFKATGTTRYRDIASNAWNITVRAHTYVIGGNSQAEHFRAPNAIAGYLSNDTCE 353
Query: 400 TCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQRGTE-PGVMIYMLPLG---- 453
C TYNMLK++R L+ Y DYYERA N ++ Q + G + Y PL
Sbjct: 354 QCNTYNMLKLTRELWLLDPSRTDYFDYYERATINHLIGAQNPADSKGHITYFTPLKPGGR 413
Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
RGV A W T +NSFWCC GTG+E +KL DSIYF L + ++ S +W
Sbjct: 414 RGVGPAWGGGTWSTDYNSFWCCQGTGVEINTKLMDSIYFYSG---TTLTVNLFVPSELNW 470
Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
+ + Q VS L + T S S+ +R+P WT NGA S+NG
Sbjct: 471 SQRGITVTQSTTYPVSDTTTLTLGGTMSGSW------SVRVRIPAWT--NGATVSVNGVE 522
Query: 574 LPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
+ PG++ + T W+ D +T++LP+ + + D+ +SI A+ +GP +LAG+
Sbjct: 523 QSVATTPGSYATVTRTWAAGDTITVRLPMRVVVQPTNDN----SSIAAVTYGPSVLAGNY 578
Query: 633 SGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQES 668
+LSA PP+ N + T S
Sbjct: 579 GNS---------TLSA-----PPALNVSSIARTSTS 600
>gi|307110572|gb|EFN58808.1| hypothetical protein CHLNCDRAFT_56904 [Chlorella variabilis]
Length = 937
Score = 278 bits (712), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 149/322 (46%), Positives = 197/322 (61%), Gaps = 7/322 (2%)
Query: 124 EYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNA 183
+YLL L+ D L+++FRK A LPTPG +YGGWE SE+RG F+GHY+SA A T
Sbjct: 51 QYLLALEPDRLLFNFRKNAGLPTPGASYGGWEWSESEVRGQFIGHYMSAVAFAALHTGRT 110
Query: 184 TIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQY 243
++ +V L + Q+ G GYLSAFP FD EAL+PVWAPYY IHKI+AGLLDQ+
Sbjct: 111 EFYDRSKLMVHELKKVQDAFGNGYLSAFPESHFDRLEALQPVWAPYYVIHKIMAGLLDQH 170
Query: 244 VLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWY-SLNEETGGMNDVLYRLYSITHD 302
LA +ALKMA M YF R Q+V + E +WY L E GGMN+VLY L+++T D
Sbjct: 171 QLAGTDEALKMAEQMASYFCGRAQRV-RENNGEDYWYRCLENEFGGMNEVLYNLFAVTAD 229
Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
H AH FDKP F L D L HANTH+ V G RYE GD F
Sbjct: 230 DHHAECAHWFDKPVFYRPLVEGTDPLPGLHANTHLAQVQGFAARYEHLGDEEAMAAVRNF 289
Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN-----EETCTTYNMLKVSRHLFRWT 417
++ H+++TGG++ E W + LA+ + + + EE+CT YN+LK++R+LFR T
Sbjct: 290 FALILQHHTFSTGGSNWYERWGNEDSLAEAINNTDASRITEESCTQYNILKLARYLFRHT 349
Query: 418 KEIAYADYYERALTNGVLSIQR 439
+ A AD+YERA+ N V+ IQ+
Sbjct: 350 GDPALADFYERAILNDVIGIQK 371
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 102/242 (42%), Gaps = 63/242 (26%)
Query: 424 DYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESF 483
D Y A N V + PGV IY LPLG G K WGT +++FWCCYGT +ESF
Sbjct: 441 DPYAAAHANSV----QPAGPGVYIYYLPLGVGHDK-----NWGTPWDTFWCCYGTAVESF 491
Query: 484 SKLGDSIYFEE---------------EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
S L SIYF+ ++P L++ Q +SSS W+
Sbjct: 492 SSLAGSIYFKHMPGTAPSASSSGPTAAEDLPQLFVNQMVSSSVHWRE------------- 538
Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN--------------- 573
L + + + + Q LN R+P W + +NG+
Sbjct: 539 -----LGVEGSANGDKPQAQF-VLNWRVPGWAKGDEVMLRVNGKEYLECAQGAAAAAHDA 592
Query: 574 LPLPPP-----GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
L PP F S WS D + +P+ + TE + D R S++AI+ GP+++
Sbjct: 593 LGFQPPQFGAGARFCSLGSTWSDGDVVEADMPMWVVTEDLNDSRKAMQSLKAIMMGPFVM 652
Query: 629 AG 630
AG
Sbjct: 653 AG 654
>gi|310639749|ref|YP_003944507.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus polymyxa
SC2]
gi|386038950|ref|YP_005957904.1| hypothetical protein PPM_0260 [Paenibacillus polymyxa M1]
gi|309244699|gb|ADO54266.1| Acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus polymyxa
SC2]
gi|343094988|emb|CCC83197.1| DUF1680 domain containing protein [Paenibacillus polymyxa M1]
Length = 751
Score = 278 bits (711), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 190/540 (35%), Positives = 281/540 (52%), Gaps = 39/540 (7%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
LH V +D S L+ A + N YLL L+ D L+ FR+ A L Y GWE +
Sbjct: 7 DLHKVSID-SGPLYHAMELNTTYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--ARGIS 63
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
GH +GHYLS A M+AST + + E+++ V+ L CQN G GY+S P E+F+ +
Sbjct: 64 GHTLGHYLSGCALMFASTGDKRLLERVNYVIDELEICQNSHGNGYISGIPRGKEIFEEVK 123
Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
A L W P YT+HK+ AGL D ++LA + +AL M + ++ ++ V
Sbjct: 124 AGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLAHHPKALAMEIQLGDW----LEDVFQ 179
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
S E+ L+ E GGMN+VL L + + + L LA F L LA D L+
Sbjct: 180 GLSDEQVQQVLHCEFGGMNEVLTDLAEHSGEKRFLNLAERFYHGEVLNDLADSRDTLAGR 239
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
HANT IP +IG+ ++EVTG PLY + FF D V HSY GG S E + +P +L D
Sbjct: 240 HANTQIPKIIGAARQFEVTGKPLYADLSRFFWDRVVHKHSYVIGGNSYNEHFGEPGKLND 299
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
LG ETC TYNMLK++RH+F W AYADYYERA+ N +L+ Q+ + G + Y +
Sbjct: 300 RLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-GRVCYFVS 358
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
L G K+ + +++ F CC G+G+ES S G +IYF + Y+ QY+ S+
Sbjct: 359 LEMGGHKS-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTANTI---YVNQYVPSTV 410
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
W ++ L Q+ + R TL SK+ + ++ LR P W G + +NG
Sbjct: 411 TWDEMNIQLKQE----TLFPQNGRGTLHLISKEP--KFFTIKLRCPHWA-EQGMKIKING 463
Query: 572 QNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ P +++ W D + +P+++R E + P+ A ++GP +LAG
Sbjct: 464 EEYAAEACPTSYIVIEREWKDGDTVEYDIPMTVRVEEM----PDNPRRIAFMYGPLVLAG 519
>gi|384428325|ref|YP_005637684.1| hypothetical protein XCR_2693 [Xanthomonas campestris pv. raphani
756C]
gi|341937427|gb|AEL07566.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
756C]
Length = 791
Score = 278 bits (711), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 194/565 (34%), Positives = 279/565 (49%), Gaps = 52/565 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
++ V L V L S L A TN YL+ L D L+ +F A L AYGGWE
Sbjct: 49 IRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWE--A 105
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE---- 214
+ GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 106 DTIAGHTLGHYLSALALMHAQTDDAQCRTRARYLVAELARCQAHAGDGYVAGFTRKNAAG 165
Query: 215 -------LFDSFE--ALKPV-------WAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
+FD + ++P+ WAP YT HK+ AGLLD +V DNAQAL++A +
Sbjct: 166 QIESGRAVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVAL 225
Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
Y +Q + + L+ E GG+N+ L+ T + L LA
Sbjct: 226 AGY----LQGIFAALDDTQLQKVLSCEFGGLNESFVELHVRTGHAQWLALAQRLHHHAVF 281
Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
L Q D L H H+NT+IP +IG YEVTGD FF + V HSY GG
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGNG 341
Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
RE++ P ++ L + E C++YNMLK++RHL+RW + AY DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCSSYNMLKLTRHLYRWGPQAAYFDYYERTLLNHVMA-Q 400
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
+ G+ YM P+ G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+E+
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWEDG--- 452
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
G+ I Y+ S +G + P LR+ ++++ +L+LR+P
Sbjct: 453 QGVAINLYVPSRVRNAAGLDMTLHSALP-AQGSVSLRIDAAPAAQR------TLSLRVPG 505
Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
W + Q LNG + P +L T W D L + L + LR EA DD P + S
Sbjct: 506 WAATPVLQ--LNGAVVDAAPVDGYLRVTRIWHPGDTLDLSLHMPLRLEATPDD-PAWVS- 561
Query: 619 QAILFGPYLLA---GHTSGEWDIKT 640
+L GP +LA G + W KT
Sbjct: 562 --LLRGPLVLAADLGDAATPWSGKT 584
>gi|374313035|ref|YP_005059465.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358755045|gb|AEU38435.1| protein of unknown function DUF1680 [Granulicella mallensis
MP5ACTX8]
Length = 798
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 195/594 (32%), Positives = 285/594 (47%), Gaps = 61/594 (10%)
Query: 100 KEVSLHDVWLDQSSV------LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGG 153
++V L V L SSV L RAQ + +YLL L + ++ R+ A+L + YGG
Sbjct: 28 QKVQLKAVPLPFSSVRLTGGPLKRAQDLDAQYLLDLQPERMLARLRQRANLAPKAEGYGG 87
Query: 154 WENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-- 211
W+ +L GH GHYLSA + M+A+T + K + V L QN G GY+ A
Sbjct: 88 WDGDGRQLTGHIAGHYLSAISMMYATTGDVRFKNRADDFVTELQNIQNAQGDGYIGALLD 147
Query: 212 --------------PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQAL----K 253
E+ L +W+P+Y HK+ AGL D Y L N +AL K
Sbjct: 148 AKGVDGKVRFQDLSKGEIHSGGFDLNGLWSPWYVEHKLFAGLRDAYHLTGNRKALDVEIK 207
Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
A W + ++ S E+ L E GGMN+VL LY+ T+DP+ L L+ F+
Sbjct: 208 FAGW--------AETIVGHLSDEQLQRMLATEFGGMNEVLADLYADTNDPRWLKLSDKFE 259
Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYA 373
+ L+ D L+ HANT IP +IG RY TGD FF D V+ HS+A
Sbjct: 260 HHAIVDPLSRGQDILAGKHANTQIPKMIGELARYVYTGDETDGKAAMFFFDEVSEHHSFA 319
Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
TGG E++ P ++ D + E+C YNM+K++R LF + YAD+ ERA N
Sbjct: 320 TGGDGKNEYFGQPDKMNDMIDGRTAESCAAYNMIKMARDLFSLDPQARYADFIERADLNA 379
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
+L Q E G + YM+P+GRGV H + KF SF CC G+ +E+ + IY
Sbjct: 380 ILGGQ-DPEDGRVSYMVPVGRGVQ-----HEYQDKFESFTCCVGSQMETHAFHAYGIY-S 432
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
E GN L++ QY ++ DW S + L + + L++T S K +V ++
Sbjct: 433 ESGNK--LWVSQYDPTTVDWASQGMKLEMVTNLPMGDSAALKIT---SGKTKV---FTIA 484
Query: 554 LRMPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
LR P W + G +NG+ L P ++ +W D + I LP +LR EA+
Sbjct: 485 LRRPYWVGA-GFSVKVNGETLQNTSTPDTYIEINRKWKVGDTVEIVLPKTLRKEAL---- 539
Query: 613 PEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQ 666
P+ + AI++GP +LAG D+ +R S + P L+T Q
Sbjct: 540 PDNPNRMAIMWGPLVLAG------DLGPEVSRRHSGGQGGVAPEPAPALITAEQ 587
>gi|325927064|ref|ZP_08188334.1| hypothetical protein XPE_2335 [Xanthomonas perforans 91-118]
gi|325542563|gb|EGD14035.1| hypothetical protein XPE_2335 [Xanthomonas perforans 91-118]
Length = 791
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 197/619 (31%), Positives = 293/619 (47%), Gaps = 70/619 (11%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + + + L+ E GG+N+ L+ T+D + L LA
Sbjct: 222 AVGLAGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L+H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
G+Y+ Y+ S +G L+ + + + + + ++ +L L
Sbjct: 452 G---QGVYVNLYVPSMVHDAAG---LDMTLHSALPEQGSASLRIDAAPAEQ----RTLAL 501
Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
R+P W Q LNGQ + +L T W D L++ + LR EA DD P
Sbjct: 502 RVPGWAQQPRLQ--LNGQPVDTAASDGYLRITRVWQRGDTLSLAFDMPLRLEATSDD-PA 558
Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQLVTFTQES 668
+ S +L GP +LA G + W KT + + + P+P
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKTPALIGGQDILQRLQPVP-------------- 601
Query: 669 GNSTFVMSNSNQSITMEEF 687
G + FV ++ Q + F
Sbjct: 602 GKTAFVYNDGVQQWQLSPF 620
>gi|427384529|ref|ZP_18881034.1| hypothetical protein HMPREF9447_02067 [Bacteroides oleiciplenus YIT
12058]
gi|425727790|gb|EKU90649.1| hypothetical protein HMPREF9447_02067 [Bacteroides oleiciplenus YIT
12058]
Length = 777
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 188/539 (34%), Positives = 267/539 (49%), Gaps = 55/539 (10%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A++ YLL L+ D + FR A L Y GWE+ + G +GHY+SA A +
Sbjct: 51 AEEKEATYLLELEPDRFLSGFRSEAGLVPKAPKYEGWES--LGVAGQTLGHYMSACAMYY 108
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVW 226
A++ + +K+ ++ L CQ G GYL+A P ++F A L W
Sbjct: 109 ATSGDERFLQKLEYIINELDSCQQANGNGYLAATPGGKKIFAEVSAGNIYSQGFDLNGGW 168
Query: 227 APYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFY----NRVQKVITMYSVERH 278
P Y +HK+LAGL+D Y A + QAL K+A WM FY +++QKV+
Sbjct: 169 VPLYVMHKVLAGLIDAYQYARSEQALRIAEKLADWMYGTFYHLTEDQMQKVLAC------ 222
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLGFLALQADYLSHFHANTHI 337
E GGMN+ L LY+ T + K LLLA FD + LA+ D L HANT +
Sbjct: 223 ------EFGGMNEALANLYAYTKNDKFLLLAQRFDNHKAIMDSLAIGVDDLEGKHANTQV 276
Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
P +IG+ YE+TG I +FF V +HSY GG S E + P++L + L + N
Sbjct: 277 PKMIGAARLYELTGSKRDSSIASFFWHTVVDNHSYVNGGNSDGEHFGTPRKLNERLSTSN 336
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
ETC TYNMLK++RHLF W Y+ YYERA+ N +L+ Q + G+ Y PL G
Sbjct: 337 TETCNTYNMLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGK 395
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
K G+ + F SF CC G+G+E+ K GD IY EG+ L++ +I S W +
Sbjct: 396 K-----GYLSPFQSFCCCSGSGMENHVKYGDFIY--SEGSDSSLFVNLFIPSRLTWTARD 448
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
+++ Q D P T+ + K E+ Q LR P W S +NG+++ L
Sbjct: 449 LIVTQDTDI-----PSSNKTV-LTVKTEMPQSVVFRLRYPEWAES--MSLKVNGKSVSLK 500
Query: 578 PPG-NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
G N++S W NDKL I + T A+ D+ + +GP LLAG E
Sbjct: 501 ASGNNYVSIEREWKDNDKLEITFGIKFYTVAMPDNEKRV----GLFYGPVLLAGELGQE 555
>gi|294624781|ref|ZP_06703443.1| secreted protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB
11122]
gi|292600913|gb|EFF44988.1| secreted protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB
11122]
Length = 791
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 194/569 (34%), Positives = 277/569 (48%), Gaps = 53/569 (9%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+F + V L V L S L A TN YL+ L+ D L+ +F A L AYGGW
Sbjct: 46 PGSF-RAVPLAQVRLTPSLFL-DALHTNRRYLMRLEPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVAELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD L WAP YT HK+ AGLLD + DNAQAL++
Sbjct: 162 NAAGKIESGRAVFDELRRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + + L+ E GG+N+ L+ T D + L LA
Sbjct: 222 AVSLAGY----LQGIFAALDDAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ + + E C +YNMLK++RHL++W + + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFVTEQTCEHCASYNMLKLTRHLYQWGPQAEFFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
L+ Q+ G+ YM P+ G ++A W + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 LA-QQHPRTGMFTYMTPMLAGEARA-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
G+Y+ Y+ SS +G + + P LR+ + + ++ L L
Sbjct: 452 G---QGVYVNLYVPSSVRDAAGLDMTLRSTMPEQG-SASLRIDVAPAEQR------MLAL 501
Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
R+P W S Q LNGQ + +L W D LT+ + LR EA DD P
Sbjct: 502 RLPGWAQSPRLQ--LNGQPVDTTVNEGYLRIARFWRAGDTLTLSFEMPLRLEATTDD-PA 558
Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
+ S +L GP +LA G + W KT
Sbjct: 559 WVS---VLRGPLVLAADLGAAAKPWSGKT 584
>gi|398384929|ref|ZP_10542957.1| hypothetical protein PMI04_02662 [Sphingobium sp. AP49]
gi|397722209|gb|EJK82754.1| hypothetical protein PMI04_02662 [Sphingobium sp. AP49]
Length = 802
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 178/533 (33%), Positives = 266/533 (49%), Gaps = 44/533 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A + N YLL L+ D L+ +FRK A L G YGGWEN + GH +GHYL+A A M
Sbjct: 63 AVEGNRLYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWEN--DTIAGHTLGHYLTALALMH 120
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------------- 221
A T +A + + ++ L+ CQ G GY++ F D E
Sbjct: 121 AQTGDAECARRAAYIIDELAACQAAAGDGYVAGFTRRRDDVIEDGRLIFPEIMRGDIRSA 180
Query: 222 ---LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
L W P+Y HK+ AGL D N+QA +A + Y + V +
Sbjct: 181 GFDLNGCWVPFYNWHKLFAGLFDAEAHLGNSQARGVALALAAY----IDGVFAKLDDAQV 236
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
L+ E GG+N+ L++ T DP+ L LA L LA + + L HANT IP
Sbjct: 237 QQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDPLAQRQNSLPWIHANTQIP 296
Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
+IG +E+TG+ + FF + V +SY GG + RE++ DP ++ + +
Sbjct: 297 KLIGLARLHEITGNAADAIAANFFWETVVGQYSYVIGGNADREYFPDPGTISKHITEQTC 356
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C +YNMLK++RHL+ W E DYYERA N +L+ Q G+ YM+PL G +
Sbjct: 357 ESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQNPAT-GMFAYMVPLMSGSHR 415
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ-YISSSFDWKSGH 517
W F+ FWCC G+G+ES +K G+SI++E+ + I YI S DW +
Sbjct: 416 V-----WSEPFDDFWCCVGSGMESHAKHGESIWWEDTDRPADMLIANLYIPSEADWAARG 470
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
L +++ +D ++ L+ + G+ +L LR+P W GA+ ++NG LP P
Sbjct: 471 AKL--RIETGYPFDGHI--ALSIPTLARAGRF-TLALRIPGW--CQGARVAVNGTPLPTP 523
Query: 578 PPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
+ + +R W D++T+ LP++LR EA DD A A+L GP +LA
Sbjct: 524 RIVDGYALIDRKWKAGDQVTLDLPMALRVEATPDD----ARTIALLHGPVVLA 572
>gi|78048280|ref|YP_364455.1| hypothetical protein XCV2724 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78036710|emb|CAJ24403.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 791
Score = 276 bits (706), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 197/619 (31%), Positives = 293/619 (47%), Gaps = 70/619 (11%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + + + L+ E GG+N+ L+ T+D + L LA
Sbjct: 222 AVSLAGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L+H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
G+Y+ Y+ S +G L+ + + + + + ++ +L L
Sbjct: 452 G---QGVYVNLYVPSMVHDAAG---LDMTLHSALPEQGSASLRIDAAPAEQ----RTLAL 501
Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
R+P W Q LNGQ + +L T W D L++ + LR EA DD P
Sbjct: 502 RVPGWAQQPRLQ--LNGQPVDTAASDGYLRITRVWQRGDTLSLAFDMPLRLEATSDD-PA 558
Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQLVTFTQES 668
+ S +L GP +LA G + W KT + + + P+P
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKTPALIGGQDILQRLQPVP-------------- 601
Query: 669 GNSTFVMSNSNQSITMEEF 687
G + FV ++ Q + F
Sbjct: 602 GKTAFVYNDGVQQWQLSPF 620
>gi|300785876|ref|YP_003766167.1| hypothetical protein AMED_3987 [Amycolatopsis mediterranei U32]
gi|384149186|ref|YP_005532002.1| hypothetical protein RAM_20325 [Amycolatopsis mediterranei S699]
gi|399537759|ref|YP_006550421.1| hypothetical protein AMES_3940 [Amycolatopsis mediterranei S699]
gi|299795390|gb|ADJ45765.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340527340|gb|AEK42545.1| hypothetical protein RAM_20325 [Amycolatopsis mediterranei S699]
gi|398318529|gb|AFO77476.1| hypothetical protein AMES_3940 [Amycolatopsis mediterranei S699]
Length = 775
Score = 276 bits (706), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 194/548 (35%), Positives = 273/548 (49%), Gaps = 51/548 (9%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
WLD Q YL +DV+ L++ FR L T G A GGW+ P R H
Sbjct: 67 WLDN-------QNRTQNYLRFVDVNRLLYVFRANHRLSTGGAATNGGWDAPSFPFRSHVQ 119
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEA 221
GH+L+A AQ+WA T + T ++K +T+V L++CQ G GYLS FP FD+ EA
Sbjct: 120 GHFLTAWAQLWAVTGDTTSRDKATTMVAELAKCQANNGAAGFSAGYLSGFPEADFDNLEA 179
Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSV 275
L PYY IHK +AGLLD + + QA L +A W V + S
Sbjct: 180 GRLSNGNVPYYCIHKTMAGLLDVWRYIGSTQARDVLLNLAGW--------VDRRTARLST 231
Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
+ LN E GGMNDVL LY T D + L A FD LA D L+ HANT
Sbjct: 232 SQLQSVLNTEFGGMNDVLADLYQYTGDARWLTAAQRFDHAAVFDPLAANRDQLNGLHANT 291
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+P IG+ Y+ TG Y+ I T +I +H+YA GG S E + P +A L
Sbjct: 292 QVPKWIGAAREYKATGTTRYRDIATNAWNITVGAHTYAIGGNSQAEHFRAPNAIAAYLNQ 351
Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEP-GVMIYMLPLG 453
+ E+C TYNMLK++R L + A ADYYERAL N ++ Q + G + Y L
Sbjct: 352 DTCESCNTYNMLKLTRELIALYPDRADLADYYERALLNQMIGQQNPADSHGHITYFSSLN 411
Query: 454 ----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
RG+ A W T ++SFWCC GTG+E+ +KL DSIYF + L + ++ S
Sbjct: 412 PGGRRGLGPAWGGGTWSTDYDSFWCCQGTGLETQTKLADSIYFYNDTT---LTVNLFLPS 468
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
W + + Q S T T + V ++ +R+P WT GA S+
Sbjct: 469 VLTWTQRGITVTQTTSFPAS------DTSTLTVTGSVSGTWAMRIRIPGWT--TGATISV 520
Query: 570 NG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
NG QN+ PG++ + + W+ D +T++LP+ + +A + A++ A+ +GP +
Sbjct: 521 NGVAQNVAT-TPGSYATLSRSWASGDAVTVRLPMKVALKAAN----DNANVAAVTYGPVV 575
Query: 628 LAGHTSGE 635
LAG+ SG
Sbjct: 576 LAGNYSGS 583
>gi|289661682|ref|ZP_06483263.1| putative secreted protein, partial [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 756
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 195/619 (31%), Positives = 294/619 (47%), Gaps = 70/619 (11%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S+ A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRL-MPSLFLDALNTNRRYLMRLQPDRLLHNFVLYAGLDPQAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + + + L+ E GG+N+ L+ T+D + L LA
Sbjct: 222 AVGLAGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L+H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLVTQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRSGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
G+++ Y+ S+ +G L+ + + + + + ++ +L L
Sbjct: 452 G---QGVFVNLYVPSTVRDAAG---LDMTLHSALPEQGSASLRIDAAPAEQ----RTLAL 501
Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
R+P W Q LNGQ + +L T W D L++ + LR EA DD P
Sbjct: 502 RVPGWAQQPRLQ--LNGQPVDSAASDGYLRITRVWQRGDTLSLAFDMPLRLEATPDD-PA 558
Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQLVTFTQES 668
+ S +L GP +LA G + W KT + + + P+P
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSSKTPALIGGQDILQRLQPVP-------------- 601
Query: 669 GNSTFVMSNSNQSITMEEF 687
G + FV ++ Q + F
Sbjct: 602 GKTAFVYNDGAQQWQLSPF 620
>gi|408393860|gb|EKJ73118.1| hypothetical protein FPSE_06731 [Fusarium pseudograminearum CS3096]
Length = 623
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 200/560 (35%), Positives = 273/560 (48%), Gaps = 44/560 (7%)
Query: 89 PGGFDLPGNF-LKEVSLHDV-WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT 146
P DL F L +VSL D W+D Q + YLL +D D L++ FRK L T
Sbjct: 25 PKVSDLADAFELSDVSLTDSRWMDN-------QGRTVNYLLSIDPDRLLYVFRKNHGLDT 77
Query: 147 PGKAY-GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN---K 202
G A GGW+ P R H GH+LSA + +A+ N + S V L++CQ K
Sbjct: 78 KGAAKNGGWDAPDFPFRSHVQGHFLSAWSNCYATLGNKECGSRASYFVKELAKCQANNAK 137
Query: 203 IG--TGYLSAFPTELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
+G +GYLS FP E L PYY IHK LAGLLD Y + A + +
Sbjct: 138 VGFTSGYLSGFPESEITKVEDRTLSSGNVPYYAIHKTLAGLLDVYRRVGDNDAKTVMLSL 197
Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
+ R K+ S + + E GGMN+VL + T D K L +A FD
Sbjct: 198 ASWVDARTGKL----SYAKMQQMMQTEFGGMNEVLADIAYYTQDNKWLKVAQRFDHAAIF 253
Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
L D LS HANT +P IG+ Y+V+GD Y IG D+ H+YA GG S
Sbjct: 254 DPLQNNVDKLSGLHANTQVPKWIGALREYKVSGDKKYLDIGRNAWDLTVHKHTYAIGGNS 313
Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSI 437
E + +P +A L + E C TYNMLK++R L+ + +Y DYYE AL N +L
Sbjct: 314 QAEHFREPNAIAKYLTKDTCEACNTYNMLKLTRELWALNPTDASYFDYYENALMNHLLGQ 373
Query: 438 QRGTEP-GVMIYMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
Q + G + Y PL RGV A W T +NSFWCC G+GIE+ +KL DSIYF
Sbjct: 374 QNPKDSHGHVTYFTPLTPGGRRGVGPAWGGGTWSTDYNSFWCCQGSGIETNTKLMDSIYF 433
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LY+ + S +W V + Q + + TL K L+
Sbjct: 434 HTKDT---LYVNLFTPSKLNWSQQGVSIIQTTE----YPQKDSSTLQIGGKAGTWTLA-- 484
Query: 553 NLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDD 611
+R+P WT + A +NGQ++ + PG + T W+ DK+TI LP+SLRT A D+
Sbjct: 485 -VRIPSWT--SKASIQVNGQSVNVNTTPGKYALVTRNWNSGDKVTITLPMSLRTIAANDN 541
Query: 612 RPEYASIQAILFGPYLLAGH 631
+ + A+ FGP +LA +
Sbjct: 542 ----SQVAAVAFGPVILAAN 557
>gi|312621677|ref|YP_004023290.1| hypothetical protein Calkro_0576 [Caldicellulosiruptor
kronotskyensis 2002]
gi|312202144|gb|ADQ45471.1| protein of unknown function DUF1680 [Caldicellulosiruptor
kronotskyensis 2002]
Length = 588
Score = 275 bits (703), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 178/580 (30%), Positives = 297/580 (51%), Gaps = 43/580 (7%)
Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT----PGKAYGGWENPISELRGHFVG 167
S +R + N Y+L L ++L+ +F + L + P +GGWE+P +LRGHF+G
Sbjct: 18 ESEFYRRFEINRNYMLSLKTENLLQNFYLESGLVSWSFLPQDIHGGWESPTCQLRGHFLG 77
Query: 168 HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWA 227
H+LSA+A+++A+ + IK K ++ L +CQ + G ++ + P + F+ K VWA
Sbjct: 78 HWLSAAAKIYANFGDEEIKGKADYIINELEKCQRENGGEWVGSIPEKYFEWMARGKYVWA 137
Query: 228 PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
P+YT+HK GL+D Y A N +AL++A +FY + +S E+ L+ ETG
Sbjct: 138 PHYTVHKTFMGLVDMYKYASNQKALEIADKWANWFY----RWSGQFSREKMDDILDYETG 193
Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
GM ++ LY IT D K+ L + + L + D L+ HANT IP + G+ +
Sbjct: 194 GMLEIWAELYDITKDSKYKDLMERYYRGRLFDRLLMGEDVLTGKHANTTIPEIHGAARVW 253
Query: 348 EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
E+TG+ + K++ +++ + V+ + TGG + E W +++ + LG+ N+E C YNM
Sbjct: 254 EITGEEKFRKIVESYWKEAVDERGYFCTGGQTLGEVWTPKQKIKNYLGTTNQEHCVVYNM 313
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
++++ LFRWT + Y+DY ER + NG+ + QR + G++ Y LPL G K WG
Sbjct: 314 IRLAEFLFRWTGDKRYSDYIERNIYNGLFAQQR-LKDGMVTYYLPLMPGSQKR-----WG 367
Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDP 526
T N FWCC+GT +++ + D IY++ + G+ I Q+I SS WK + K +
Sbjct: 368 TPTNDFWCCHGTLVQAHTIYNDLIYYKSQN---GIVISQFIPSSVTWK------DDKGND 418
Query: 527 IVSWDPYLRMTLTFSSKQEVGQLS-----------SLNLRMPVWTYSNGAQASLNGQNLP 575
I + R +F+ E ++ L +R P W + + +NG +
Sbjct: 419 ITITQYFERKHGSFAYTAEKDEIYIEIQCKSPVEFELAIRKPWW--AKKVEIEINGNSYY 476
Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
++ T+RW+ N+K+ I ++ T ++ DD P+ A + GP +LAG
Sbjct: 477 AADDSPYIQLTQRWN-NEKIKITFYKAVETCSMPDD-PQQV---AFMIGPVVLAGLCERR 531
Query: 636 WDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
I G R + +I PI L+ TQ F +
Sbjct: 532 RKIYIG-ERKIEEIIVPIDKRGYGPLLYTTQGQIEDIFFL 570
>gi|390456441|ref|ZP_10241969.1| hypothetical protein PpeoK3_20683 [Paenibacillus peoriae KCTC 3763]
Length = 759
Score = 275 bits (703), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 176/544 (32%), Positives = 284/544 (52%), Gaps = 38/544 (6%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT-PGKAYGGWENP 157
L ++S V L+ S+L AQ L++LL ++ D ++++FRK A L T A GW++
Sbjct: 185 LHDISTQKVHLEGPSLLKTAQNRRLQFLLTVNDDQMLYNFRKAAGLDTLNAPAMIGWDSD 244
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQ------NKIGTGYLSAF 211
S L+GH GHYLSA A +AST N I++K++ ++ L++ Q ++ G+LSA+
Sbjct: 245 DSLLKGHTTGHYLSALALCYASTGNERIRQKLAYLIDELNKVQLAFEADDRYHYGFLSAY 304
Query: 212 PTELFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
E FD E +WAPYYT+HKI AGLLD Y +A AL +A + ++ YNR+
Sbjct: 305 SEEQFDLLEVYTRYPEIWAPYYTLHKIFAGLLDSYHIAGIELALVIADKVGDWIYNRLS- 363
Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
V+ +++ W + E GG+N+ L LY+ T H+ A LFD + D
Sbjct: 364 VLPQEQLKKMWGLYIAGEYGGINESLAELYTYTQKEHHIAAAKLFDNDRLFFPMEQHVDA 423
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L HAN HIP ++G+ +E TG+ Y I FF + V +H Y+ GGT E + P
Sbjct: 424 LGGMHANQHIPQIVGAFKIFEATGEQKYYDIAKFFWESVVNAHIYSIGGTGEGEMFKQPY 483
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
++ L ETC +YNMLK+++ L+ + ++ Y DYYER + N +LS G
Sbjct: 484 QIGAHLTEHTAETCASYNMLKLTKQLYVYENDVKYMDYYERTMINHILSSTDHECLGAST 543
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y +P G K NS CC+GTG+E+ K ++I+FE + LY+ ++
Sbjct: 544 YFMPTSSGGQKGYDEE------NS--CCHGTGLENHFKYAEAIFFE---DADSLYVNLFV 592
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRM-TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
S+ + ++ + + Q V I + + + + TLT ++L +R+P W +
Sbjct: 593 PSALNDEAKGLQVVQSVPEIFNGEVEIHIETLT---------RTNLRVRIPYW-HQGEVT 642
Query: 567 ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
A +N + +L +++W+ D++T++ LR E P+ A I ++ FGPY
Sbjct: 643 AFVNHTKVNTVEENGYLVLSQKWNKGDQVTMKFTPRLRLERT----PDKADIASLAFGPY 698
Query: 627 LLAG 630
+LA
Sbjct: 699 ILAA 702
>gi|322433089|ref|YP_004210338.1| hypothetical protein AciX9_4244 [Granulicella tundricola MP5ACTX9]
gi|321165316|gb|ADW71020.1| protein of unknown function DUF1680 [Granulicella tundricola
MP5ACTX9]
Length = 800
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 185/564 (32%), Positives = 275/564 (48%), Gaps = 42/564 (7%)
Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
L +AQ + +YLL L + ++ R+ A L + YGGW+ P +L GH GHYLSA +
Sbjct: 49 LKKAQDLDAQYLLELQPERMLAFLRQRAGLEAKAQGYGGWDGPGRQLTGHIAGHYLSAIS 108
Query: 175 QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF----------------PTELFDS 218
M+A+T + KE+ V L QN G GY+ A E+
Sbjct: 109 MMYATTGDVRFKERADEFVAELQTIQNAQGDGYIGALLDAKGVDGKVKFQDLSKGEIKSG 168
Query: 219 FEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
L +W+P+Y HK+ AGL D Y L + AL++ F V+ ++ + ++
Sbjct: 169 GFDLDGLWSPWYVEHKLFAGLRDAYHLTGDRTALEVEI----EFAGWVEGILKNLNEDQI 224
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
L E GGMN+VL LY+ T+D + + L+ F+ + L+ D L+ HANT+IP
Sbjct: 225 QRMLATEFGGMNEVLADLYADTNDTRWMKLSDKFEHHAIVDPLSQGQDILAGKHANTNIP 284
Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
+IG RYE TGD FF D V+ HS+ATGG E++ P ++ D +
Sbjct: 285 KMIGELARYEYTGDEKDGKAANFFFDEVSLHHSFATGGDGKNEYFGQPDKMNDMIDGRTA 344
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C YNM+K++R LF + YAD+ ERA N +L Q + G + YM+P+GRGV
Sbjct: 345 ESCAAYNMIKMARTLFSLDPQARYADFVERADLNAILGGQ-DPDDGRVSYMVPVGRGVQ- 402
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
H + KF SF CC G+ +E+ + IY E GN L++ QY ++ DW S V
Sbjct: 403 ----HEYQNKFESFTCCVGSQMETHAFHAYGIY-NESGN--KLWVSQYDPTTVDWASQGV 455
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LP 577
L D + L+MT S ++ +L LR P W S G +NG L +
Sbjct: 456 KLEMVTDLPMGDTATLKMTSGQS------KVFTLALRRPYWATS-GFAVKVNGVLLKNVS 508
Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWD 637
P ++ RW D + + LP +LR E + P+ + AI++GP +LAG E
Sbjct: 509 GPDTYIEINRRWKVGDAVEVVLPKTLRKEPL----PDNPNRMAIMWGPLVLAGDLGPEVS 564
Query: 638 -IKTGTARSLSALISPIPPSFNAQ 660
+ G S SA+ P A+
Sbjct: 565 RRRNGGEGSASAVPEAAPALITAE 588
>gi|427411824|ref|ZP_18902026.1| hypothetical protein HMPREF9718_04500 [Sphingobium yanoikuyae ATCC
51230]
gi|425710114|gb|EKU73137.1| hypothetical protein HMPREF9718_04500 [Sphingobium yanoikuyae ATCC
51230]
Length = 802
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 178/533 (33%), Positives = 264/533 (49%), Gaps = 44/533 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A + N YLL L+ D L+ +FRK A L G YGGWEN + GH +GHYL+A A M
Sbjct: 63 AVEGNRLYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWEN--DTIAGHTLGHYLTALALMH 120
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------------- 221
A T +A + + ++ L+ CQ G GY++ F D E
Sbjct: 121 AQTGDAECARRAAYIIDELAACQAAAGDGYVAGFTRRRDDVIEDGRLIFPEIMRGDIRSA 180
Query: 222 ---LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
L W P+Y HK+ AGL D N+QA +A + Y + V +
Sbjct: 181 GFDLNGCWVPFYNWHKLFAGLFDAETHLGNSQARGVALALAAY----IDGVFAKLDDAQV 236
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
L+ E GG+N+ L++ T DP+ L LA L LA + + L HANT IP
Sbjct: 237 QQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDPLAQRQNSLPWIHANTQIP 296
Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
+IG +E+TG+ + FF + V +SY GG + RE++ DP ++ + +
Sbjct: 297 KLIGLARLHEITGNAADAIAANFFWETVVGQYSYVIGGNADREYFPDPGTISKHITEQTC 356
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C +YNMLK++RHL+ W E DYYERA N +L+ Q G+ YM+PL G +
Sbjct: 357 ESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQNPAT-GMFAYMVPLMSGSHR 415
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ-YISSSFDWKSGH 517
W F+ FWCC G+G+ES +K G+SI++E+ + I YI S DW +
Sbjct: 416 V-----WSEPFDDFWCCVGSGMESHAKHGESIWWEDADRPADMLIANLYIPSEADWAARG 470
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
L +++ +D ++ L+ G+ +L LR+P W GA+ ++NG LP P
Sbjct: 471 AKL--RIETGYPFDGHI--ALSIPKLARAGRF-TLALRIPGW--CQGARIAVNGTPLPAP 523
Query: 578 PPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
+ + R W D++T+ LP++LR EA DD A A+L GP +LA
Sbjct: 524 RIADGYALIGRKWKAGDQVTLDLPMALRVEATPDD----ARTIALLHGPVVLA 572
>gi|375308750|ref|ZP_09774033.1| hypothetical protein WG8_2558 [Paenibacillus sp. Aloe-11]
gi|375079377|gb|EHS57602.1| hypothetical protein WG8_2558 [Paenibacillus sp. Aloe-11]
Length = 770
Score = 275 bits (702), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 183/548 (33%), Positives = 282/548 (51%), Gaps = 44/548 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENP 157
+KE + V L++ S A L+++ ++ D ++++FR+ A++ T G + GW+ P
Sbjct: 191 VKEFTGPKVSLERESEFAAAMNRFLQFVRSVNDDQMLYNFREAAAIDTKGAQPMTGWDAP 250
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI------GTGYLSAF 211
L+GH GHYLSA A + +T ++ + K+ +V L +CQ + G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYHATEDSALLGKIQYMVAELGKCQTALSEQAGYGRGFLSAY 310
Query: 212 PTELFDSFE---ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
E F+ E +WAPYYT+HKI+AGLLD Y LA +AL + + + ++R+ +
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALDICDKLGHWLHSRLSR 370
Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ + + W + E GGMN+ L +LY+IT + +L+ A FD + D
Sbjct: 371 -LPREQLHKMWSLYIAGEFGGMNEALAKLYAITGNENYLMTAKYFDNAKLFLPMKENVDT 429
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L + HAN HIP VIG+ +EV GD Y I F +V SH Y GGT E + +P
Sbjct: 430 LGNMHANQHIPQVIGALKLFEVAGDKAYFNIAENFWTMVTQSHIYPIGGTGETEMFREPD 489
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP-GVM 446
+A L + ETC +YNMLK+++ LF++ Y DYYE+AL N +L+ + + G
Sbjct: 490 AIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEGGS 549
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
Y +PL G K TH CC+GTG+E+ K ++IYF +E LY+ Y
Sbjct: 550 TYFMPLAPGSIKKFDTH-------ENTCCHGTGLENHFKYQEAIYFHDEDR---LYVNLY 599
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL-TFSSKQEVGQLSSLNLRMPVWTYSNGA 565
I S DW + L QK D R L T E G ++L R+P W S
Sbjct: 600 IPSRLDWSEQGISLMQKRD---------RDGLETVRFYIEGGPETTLMFRIPDWV-SEPV 649
Query: 566 QASLNGQNLP---LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
Q +NG +P L +L + W D++ + LP SLR D P+ +++++
Sbjct: 650 QVKING--VPCRDLEYEHGYLKLRKVWK-KDEIELTLPCSLRLA----DAPDDHTLKSLT 702
Query: 623 FGPYLLAG 630
+GPY+LA
Sbjct: 703 YGPYVLAA 710
>gi|325915124|ref|ZP_08177450.1| hypothetical protein XVE_1336 [Xanthomonas vesicatoria ATCC 35937]
gi|325538646|gb|EGD10316.1| hypothetical protein XVE_1336 [Xanthomonas vesicatoria ATCC 35937]
Length = 791
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 188/569 (33%), Positives = 275/569 (48%), Gaps = 53/569 (9%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGR-MRAVPLAQVRLTPSLFL-DALNTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCATRAAYLVSELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + NAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKKGKIDSAPFYLNGSWAPLYTWHKLFAGLLDVHAHCGNAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + + + L+ E GG+N+ L+ T D + L LA
Sbjct: 222 AVGLAGY----LQGIFAALNDAQLQQVLSCEFGGLNESFVELHVQTDDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
+ L Q D L H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVIDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWQTVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RHL++W + + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAVHFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+E+
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWED 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
G+++ Y+ S+ +G + + P R +T +L L
Sbjct: 452 G---QGVFVNLYVPSTVRDAAGFALSLRSTLPE-------RGEVTLQIDAAPAAARTLAL 501
Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
R+P W + Q +NGQ L P +L W+ D +++QL + LR E DD P
Sbjct: 502 RVPGWAGAFTLQ--VNGQLQTLQPVDGYLRIERVWAAGDTVSLQLGMPLRLEPTSDD-PA 558
Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
+ ++ GP +LA G + WD T
Sbjct: 559 WV---VVMRGPLVLAADLGDAATPWDNTT 584
>gi|451851952|gb|EMD65250.1| hypothetical protein COCSADRAFT_141970 [Cochliobolus sativus
ND90Pr]
Length = 620
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 189/538 (35%), Positives = 280/538 (52%), Gaps = 41/538 (7%)
Query: 112 SSVLWRAQQT-NLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHY 169
S+ W+ + L YL ++VD L+++FR T L T G + GGW+ P R H GHY
Sbjct: 45 SNSRWKDNENRTLNYLKFVNVDRLLYNFRATHKLSTNGAQPNGGWDAPNFPFRSHVQGHY 104
Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEALKP 224
L+A +A+ ++T K++ + V L++CQ G GYLS FP F + EA K
Sbjct: 105 LTAWVNCYATLRDSTCKDRAAYFVQELAKCQANNGVAGFSPGYLSGFPESEFAALEAGKL 164
Query: 225 VWA--PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
PYY +HK +AGLLD + + + +A + + + R +K+ S + L
Sbjct: 165 TGGNVPYYAVHKTMAGLLDAWRIIGDQKARDVLLALAGWVDGRTKKL----STAQMQTML 220
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
E GGMNDVL +Y +T + + L +A FD LA + D LS HANT +P IG
Sbjct: 221 GTEFGGMNDVLAEIYQLTGNKQWLTVAQRFDHAKVFDPLANKQDQLSGNHANTQVPKWIG 280
Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCT 402
+ Y+ TG Y I D +H+YA GG S E + P ++++ L ++ E C
Sbjct: 281 AAREYKSTGTKRYLDIARNAWDFTINAHTYAIGGNSQAEHFRPPNQISNFLTNDTAEQCN 340
Query: 403 TYNMLKVSRHLFRWTKE---IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPL----GR 454
TYNMLK++R L WT + Y DYYERAL N +L Q + G + Y PL R
Sbjct: 341 TYNMLKLTRDL--WTTDPTSTKYFDYYERALINHLLGAQNAADNHGHITYFTPLRSGGRR 398
Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
GV A W T +NSFWCC GT +E+ +KL DSIYF + LY+ + S+ DWK
Sbjct: 399 GVGPAWGGGTWSTDYNSFWCCQGTALETNTKLMDSIYFYDNS---ALYVNLFTPSTLDWK 455
Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
+V + Q + L++T T + ++ +R+P WT +GA SLNGQ
Sbjct: 456 QRNVKITQVTTFPIGDTTTLKVTGTGN--------WAMKIRIPSWT--SGATISLNGQAS 505
Query: 575 PLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
+ PG++ + + W D +T++LP+ LRT A + A+I AI +GP +L+G+
Sbjct: 506 GVAANPGSYATLSRNWVSGDTVTVKLPMKLRTVAAN----DNANIAAIAYGPTILSGN 559
>gi|373958137|ref|ZP_09618097.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373894737|gb|EHQ30634.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 789
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 194/541 (35%), Positives = 281/541 (51%), Gaps = 43/541 (7%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L DV L +S +A + + YLL ++ D L+ FR + L GK YGGWE+ S L G
Sbjct: 52 LQDVRLLESP-FKQAMEKDAAYLLSVEPDRLLSGFRSHSGLTPKGKMYGGWES--SGLAG 108
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA-- 221
H +GHYLSA + +AS+ N E+++ +V L ECQ TGY+ A P E D+ A
Sbjct: 109 HTLGHYLSAISMQYASSRNPQFLERVNYIVKELKECQVARKTGYIGAIPKE--DTIWAEI 166
Query: 222 -----------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
L W+P+YT+HK++AGLLD Y+ +NA+AL + M ++ +Q +
Sbjct: 167 KKGDIRSRGFDLNGGWSPWYTVHKVMAGLLDAYLYCNNAEALNICKGMGDWTGELLQNL- 225
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
+ E+ L E GGM + L LY+IT + +L ++ F L L+ D L
Sbjct: 226 ---NDEQIQSMLLCEYGGMAETLVNLYAITGNKAYLATSYKFYDKRILNPLSENKDILPG 282
Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
H+NT IP VI S RYE+TG+ + I F +I+ HSYATGG S E+ +P +L
Sbjct: 283 KHSNTQIPKVIASARRYELTGEKKDEDISVNFWNIITKDHSYATGGNSNYEYLSEPDKLN 342
Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
D L ETC TYNMLK++RHLF A DYYE+AL N +L+ Q + G+M Y +
Sbjct: 343 DKLTENTTETCNTYNMLKLTRHLFSVNPSAALMDYYEKALYNHILASQNHDD-GMMCYFV 401
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
PL G K S + F++F CC G+G+E+ K +SIY+ GN LY+ +I S
Sbjct: 402 PLRMGGKKEYS-----SPFDTFTCCVGSGMENHVKYNESIYY--RGNDGSLYVNLFIPSV 454
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
WK + L Q+ + S T +S + V +L +R P W + +N
Sbjct: 455 LTWKEKGITLTQQNNFPAS----DVTTFVINSTKPVN--FALKIRKPKW--AGNCLIKVN 506
Query: 571 GQ-NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
G+ + +L W NDK+ P S+ TEAI P+ + +A+ +GP LLA
Sbjct: 507 GKAGITTTNEQGYLVINRLWKNNDKIEFVTPESIYTEAI----PDNINRKALFYGPVLLA 562
Query: 630 G 630
G
Sbjct: 563 G 563
>gi|345302361|ref|YP_004824263.1| hypothetical protein Rhom172_0482 [Rhodothermus marinus
SG0.5JP17-172]
gi|345111594|gb|AEN72426.1| protein of unknown function DUF1680 [Rhodothermus marinus
SG0.5JP17-172]
Length = 641
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 183/528 (34%), Positives = 267/528 (50%), Gaps = 42/528 (7%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A Q ++ YL LD D L+ FR+ A L YGGWE+ + GH +GHYLSA + +
Sbjct: 56 AMQRDVAYLFELDPDRLLSRFRRFAGLEPKAPEYGGWES--QGISGHTLGHYLSALSMYY 113
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFE-ALKPV 225
A+T + + ++ +V L+E Q G GY+ A P E++ + +L
Sbjct: 114 AATGDEKARARIDYIVSELAEVQRAHGNGYVGAIPEGDRLWAEIARGEIWQAEPFSLNGA 173
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS-LNE 284
W P+YT+HKI GL+D Y N QAL++ T + ++ Y + + W L
Sbjct: 174 WVPWYTMHKIFQGLIDAYWYGGNEQALEVVTRLADWAYETTKNLTPA-----QWQQMLRT 228
Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
E GGMN+ L LYSIT +PKH L+ F L LA L+ HANT IP VIG
Sbjct: 229 EHGGMNEALANLYSITGNPKHRELSQKFYHAAVLSPLARGIPNLTGLHANTQIPKVIGVV 288
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
+YE+ G + + FF + V H+Y GG S E + LA+ LG ETC TY
Sbjct: 289 RQYELIGSDSLRAVAEFFWEEVVQHHTYVIGGNSQNEHFGPRDSLANRLGEGTAETCNTY 348
Query: 405 NMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH 463
NML+++RHLF E + Y D+YERAL N +L+ Q + G+ Y + L G K
Sbjct: 349 NMLRLTRHLFALHPEKVRYVDFYERALYNHILASQ-DPKHGMFTYYMSLRPGHFKT---- 403
Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
+ T NSFWCC GTG+E+ K + IYF N LY+ +I S +W+ + L +
Sbjct: 404 -YATPENSFWCCVGTGMENHVKYNEFIYFY---NGDTLYVNLFIPSELNWERRALRLRLE 459
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNF 582
++ R+ L F EV Q + +R P W + + +NG+ + PG++
Sbjct: 460 ----TAFPESNRVRLDFDP--EVPQRLVVKVRHPSWA-QDALEVRINGEVQSVTSRPGSY 512
Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
L+ W D++ I LP+ LR E + D+ + AIL+GP +LAG
Sbjct: 513 LTLARLWQPGDEVEITLPMRLRVETMPDNPDRF----AILYGPIVLAG 556
>gi|386847956|ref|YP_006265969.1| Glucan endo-1,3-beta-glucosidase [Actinoplanes sp. SE50/110]
gi|359835460|gb|AEV83901.1| Glucan endo-1,3-beta-glucosidase [Actinoplanes sp. SE50/110]
Length = 765
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 192/580 (33%), Positives = 283/580 (48%), Gaps = 46/580 (7%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFVGHYLSASAQMW 177
Q L YL +D D L+++FR T G A GGW+ P R H GH+L+A AQ W
Sbjct: 65 QTRTLNYLRFVDADRLLYNFRANHGRSTGGAAANGGWDAPDFPFRTHVQGHFLTAWAQAW 124
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA--LKPVWAPYYTIHKI 235
A+ + T +++ + +V L++CQ GYLS FP F + EA L PYY +HK
Sbjct: 125 AALGDTTCRDRANYMVAELAKCQ--AANGYLSGFPESDFTALEAGTLSNGNVPYYCVHKT 182
Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYR 295
LAGLLD + L QA + + + R ++ T + L E GGMN+VL
Sbjct: 183 LAGLLDVWRLIGGTQARDVLLRLAGWVDTRTARLTT----SQMQAMLGTEFGGMNEVLAD 238
Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY 355
+Y T D + L A FD LA AD L+ HANT +P +G+ Y+ TG Y
Sbjct: 239 IYQQTGDGRWLATAQRFDHAAVFTPLAAGADQLNGLHANTQVPKWVGAVREYKATGTTRY 298
Query: 356 KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
+ IG +I +H+YA GG S E + P +A L ++ E C +YNMLK++R L+
Sbjct: 299 RDIGLNAWNITTGAHTYAIGGNSQAEHFRAPNAIAGYLTNDTCEHCNSYNMLKLTRELWL 358
Query: 416 WTKE-IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPL----GRGVSKARSTHGWGTKF 469
+ AY D+YERAL N ++ Q + G + Y PL RGV A W T +
Sbjct: 359 TDPDRAAYFDFYERALLNHLIGAQNPADSHGHITYFTPLRPGGRRGVGPAWGGGTWSTDY 418
Query: 470 NSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVS 529
SFWCC GTG+E+ +KL +SIYF L + + S W + + Q VS
Sbjct: 419 ASFWCCQGTGVETNTKLMESIYFFSGTT---LTVNLFTPSVLSWAERGITVTQATAYPVS 475
Query: 530 WDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATER 588
L ++ T S S+ +R+P WT GA ++NG + PG + + T
Sbjct: 476 DTTTLTVSGTPSGTW------SIRVRIPGWT--TGATLAVNGVAQGVGATPGGYATVTRA 527
Query: 589 WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSA 648
W+ D LT++LP+ + + D+ ++QAI +GP +L G+ G +LSA
Sbjct: 528 WAAGDVLTVRLPMRVIMQPAADN----PAVQAITYGPVVLCGNYGGT---------TLSA 574
Query: 649 LISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP 688
PS N + T SG+ F + + ++++ FP
Sbjct: 575 -----HPSLNVSSIART-GSGSLAFTATANGATVSLGPFP 608
>gi|255075873|ref|XP_002501611.1| predicted protein [Micromonas sp. RCC299]
gi|226516875|gb|ACO62869.1| predicted protein [Micromonas sp. RCC299]
Length = 1214
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 206/694 (29%), Positives = 298/694 (42%), Gaps = 164/694 (23%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYL-LMLDVDSLVWSFRKTASLPT------- 146
P N L +H LD AQ+ N YL ++D L+ +FR A LP
Sbjct: 180 PANVLHGAGVH---LD-------AQRLNARYLTAVVDPRRLLANFRVVAGLPPETIPDRH 229
Query: 147 --------------------PGKAYGGWENPISELRGHFVGHYLSA-------------- 172
PG WE P ELRGHF GHYLSA
Sbjct: 230 PTETVAPYCDVGSGLSYAEHPGAC---WEAPDCELRGHFAGHYLSALAFVAAGAGDRPNT 286
Query: 173 -----------SAQMWASTHNATI------KEKMSTVVFSLSECQNKIGT--GYLSAFPT 213
S + + H + + +E + V L+ Q GT GY+SAFP
Sbjct: 287 SPDRTSSSDHLSDPEYVTGHQSDVATARHAREMLDRFVDGLATAQASSGTSAGYVSAFPE 346
Query: 214 ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMY 273
E+ D A+ WAPYYT+HKI GL+D +V+A NA+AL + + RV +I
Sbjct: 347 EVLDRQGAVGGAWAPYYTLHKIGQGLMDAHVVAGNAKALDVLKGLANAVLTRVMGLIQQR 406
Query: 274 SVERHWY---------SLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
HW+ + E+GG N++ +RLY +T + ++ LA LFD P FLG +
Sbjct: 407 GAS-HWFGGALEYSKAAFGAESGGFNELAWRLYQLTGNGDYVTLASLFDHPTFLGRMRAG 465
Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
D L+ HAN H PI +G+ RYE+TGD + F++++ + SYATGGT E W
Sbjct: 466 GDGLTREHANFHEPIAMGAYSRYEITGDTESRRAFRNFIELLRDTRSYATGGTCDGERWQ 525
Query: 385 DPKRLADTL-GSENEETCTTYNMLKVSRHL---FRWTKEIAYADYYERALTNGVLSIQRG 440
P RL + +E +ETCT N +++ F + +ADY ERA +G + +QR
Sbjct: 526 APGRLERIIVSTETQETCTQVNFERLANAAVASFGEAEARDWADYSERASLHGPVGLQR- 584
Query: 441 TEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY--FEEEGNV 498
+PG ++Y PLG GVSK RS HGWG +FWCCYGTG+E+ ++L D ++ E V
Sbjct: 585 -KPGELLYTTPLGVGVSKGRSGHGWGRPDAAFWCCYGTGVEALARLQDGVFWRLEAGATV 643
Query: 499 PG-----------LYIIQYISSSF-DWKSGHVVLNQKVDPIVSWDPY------------- 533
PG +YI + +S+ W V VDP P
Sbjct: 644 PGDDTSSTTATDVVYIARVTTSAVATWDEKGVTTRVSVDPFNVGGPVQREGGRDGRRRRG 703
Query: 534 --------LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPG----- 580
+ +T+ + E +S+ +++P W G++ +LNG+ + G
Sbjct: 704 TAGFFASAVAITVHAEGRNEP---TSIRVKLPRWA-GGGSRITLNGERVRCENGGDSSSS 759
Query: 581 -----------------NFLSATERWSYNDKLTIQLPLSLRTEAI--QDDRPEY------ 615
+ T W D L P+ +R E + D P +
Sbjct: 760 EDSDSDSDSDSDSDSDSGWCDVTRVWRKTDLLRASFPIVVRAEPLLGSDLTPGFGTGSNQ 819
Query: 616 -----ASIQAILFGPYLLAGHTSGEWDIKTGTAR 644
+ AI+ GPY+LA G W G R
Sbjct: 820 RLDGKGARHAIVAGPYVLAALGPGAWIADLGVKR 853
>gi|346725400|ref|YP_004852069.1| hypothetical protein XACM_2511 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346650147|gb|AEO42771.1| hypothetical protein XACM_2511 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 791
Score = 273 bits (697), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 197/619 (31%), Positives = 292/619 (47%), Gaps = 70/619 (11%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DNAQAL++
Sbjct: 162 DAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + + + L+ E GG+N+ L+ T D + L LA
Sbjct: 222 AMGLAGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTDDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L+H H+NT+IP +IG YEVTG+ FF V H+Y
Sbjct: 278 HAVLDPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGNAASGAAARFFWHTVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRSGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
G+Y+ Y+ S +G L+ + + + + + ++ +L L
Sbjct: 452 G---QGVYVNLYVPSMVHDAAG---LDMTLHSALPEQGSASLRIDAAPAEQ----RTLAL 501
Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
R+P W Q LNGQ + +L T W D L++ + LR EA DD P
Sbjct: 502 RVPGWAKQPRLQ--LNGQPVDSTVSDGYLRITRTWQRGDTLSLAFDMPLRLEATPDD-PA 558
Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQLVTFTQES 668
+ S +L GP +LA G S W KT + + + P+P
Sbjct: 559 WVS---VLRGPLVLAVDLGDASKPWSGKTPALIGGQDILQRLQPVP-------------- 601
Query: 669 GNSTFVMSNSNQSITMEEF 687
G + FV ++ Q + F
Sbjct: 602 GKTAFVYNDGVQQWQLSPF 620
>gi|116182754|ref|XP_001221226.1| hypothetical protein CHGG_02005 [Chaetomium globosum CBS 148.51]
gi|88186302|gb|EAQ93770.1| hypothetical protein CHGG_02005 [Chaetomium globosum CBS 148.51]
Length = 797
Score = 273 bits (697), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 198/607 (32%), Positives = 301/607 (49%), Gaps = 59/607 (9%)
Query: 99 LKEVSL-HDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWEN 156
L +SL + W+D Q + YL +DV+ L+++FR L T G A GGW+
Sbjct: 34 LSTISLTNSRWMDN-------QNRTVSYLKWVDVNRLLYNFRANHRLSTQGASANGGWDA 86
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAF 211
P R H GHYL+A A +AS + +++ + V L++CQ G GYLS F
Sbjct: 87 PNFPFRTHAQGHYLTAWAFCYASLRDTECRDRAAYFVAELAKCQKNNGAAGFSAGYLSGF 146
Query: 212 PTELFDSFEA--LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
P F + EA L PYY IHK +AGLLD + + A + + + +R K+
Sbjct: 147 PESEFAALEARTLNNGNVPYYAIHKTMAGLLDVWRHLGDTNARDVLLALAGWVDSRTGKL 206
Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
S ++ L E GGMNDVL L+ T D + L +A FD LA D L+
Sbjct: 207 ----SYQQMQSMLGTEFGGMNDVLADLHKQTKDERWLKVAQRFDHAAVFDPLAAGRDQLN 262
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
HANT +P IG+ + Y+ TG Y+ I ++ +H+YA GG S E + P +
Sbjct: 263 GLHANTQVPKWIGAALEYKATGSTRYRDIAKNAWELTVGAHTYAIGGNSQAEHFRPPNAI 322
Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRW-TKEIAYADYYERALTNGVLSIQR-GTEPGVMI 447
A L + E C TYNML+++R L+ AY D+YERAL N +L Q + G +
Sbjct: 323 AGYLQKDTAEACNTYNMLRLTRELWPLDAASTAYFDFYERALLNHLLGQQDPASHHGHVT 382
Query: 448 YMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
Y PL RGV A W T ++SFWCC GT +E+ +KL DSIYF +E L++
Sbjct: 383 YFTPLNPGGRRGVGPAWGGGTWSTDYDSFWCCQGTALETNTKLMDSIYFHDEA---ALFV 439
Query: 504 IQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
+ S W + +V + Q D P T T + + G+ L +R+P WT +
Sbjct: 440 NLFTPSVLKWAAQNVTVTQATDFPAGD-------TTTLTIGGQPGESWDLFVRIPSWT-T 491
Query: 563 NGAQASLNGQNLPLP-PPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
+ A+ S+NG+ + PG + +R W DK+T++LP++LRT D+ ++ A
Sbjct: 492 DQAEISVNGEKANIDTKPGTYAVIQDRAWKAGDKVTVRLPMTLRTVPANDN----PNVAA 547
Query: 621 ILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQ 680
+ +GP +L+G G+A SLS++ P+ + V + SG TF + +
Sbjct: 548 VAYGPVVLSG--------DYGSA-SLSSM-----PTLSLDSVR-REGSGGLTFTATAGGK 592
Query: 681 SITMEEF 687
++ ++ F
Sbjct: 593 TVKLKPF 599
>gi|390456178|ref|ZP_10241706.1| hypothetical protein PpeoK3_19346 [Paenibacillus peoriae KCTC 3763]
Length = 753
Score = 272 bits (695), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 190/540 (35%), Positives = 275/540 (50%), Gaps = 39/540 (7%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
LH V +D S L A + N YLL L+ D L+ FR+ A L Y GWE +
Sbjct: 9 DLHKVSID-SGPLCHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--ARGIS 65
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
GH +GHYLS + M+AST + + E+++ V+ L CQN G GY+S P E+F+ +
Sbjct: 66 GHTLGHYLSGCSLMYASTGDERLLERVNYVIDELEICQNSHGNGYISGIPRGKEIFEEVK 125
Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
A L W P YT+HK+ AGL D Y+L + +AL M + ++ ++ V
Sbjct: 126 AGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLVHHPKALPMEIKLGDW----LEDVFR 181
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
E+ L+ E GGMN+VL L + + + L LA F L LA D L+
Sbjct: 182 GLDDEQMQRVLHCEFGGMNEVLTDLAEHSGEERFLKLAERFYHGEVLNDLADSRDTLAGR 241
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
HANT IP +IG+ +YEVTG P Y + FF D V HSY GG S E + +P +L D
Sbjct: 242 HANTQIPKIIGAARQYEVTGKPHYADLSRFFWDRVVHKHSYVIGGNSYNEHFGEPGKLND 301
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
LG ETC TYNMLK++RH+F W AYADYYERA+ N +L+ Q+ + G + Y +
Sbjct: 302 RLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-GRVCYFVS 360
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
L G K+ + +++ F CC G+G+ES S G +IYF + Y+ QY+ S+
Sbjct: 361 LEMGGHKS-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTPQTI---YVNQYVPSTV 412
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
W V L Q+ + R TL SK+ Q ++ LR P W G +NG
Sbjct: 413 TWDEMDVQLKQE----TLFPQTGRGTLCVISKKP--QSFTIKLRCPYWA-EQGMIIKING 465
Query: 572 QNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ P +++ W D + +P+++R E + P+ A ++GP +LAG
Sbjct: 466 EAFAAEACPTSYVVIEREWKDGDTVEYDIPMTVRIEEM----PDNPRRIAFMYGPLVLAG 521
>gi|418517157|ref|ZP_13083324.1| hypothetical protein MOU_10158 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410706214|gb|EKQ64677.1| hypothetical protein MOU_10158 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 791
Score = 272 bits (695), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 190/569 (33%), Positives = 275/569 (48%), Gaps = 53/569 (9%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T + + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + + L+ E GG+N+ L+ T D + L LA
Sbjct: 222 AVGLAGY----LQGIFAALDAAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAKLFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
G+Y+ Y+ S+ +G LN + + + + + + +L L
Sbjct: 452 G---QGVYVNLYVPSTVRDAAG---LNMTLHSALPEQGSASLRIDGAPPAQ----RTLAL 501
Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
R+P WT Q LNGQ + +L T W D L++ + LR E+ DD P
Sbjct: 502 RVPGWTQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PA 558
Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
+ S +L GP +LA G + W KT
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKT 584
>gi|384418897|ref|YP_005628257.1| hypothetical protein XOC_1936 [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353461810|gb|AEQ96089.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 791
Score = 271 bits (694), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 191/574 (33%), Positives = 279/574 (48%), Gaps = 63/574 (10%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRIRAGYLVSELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DN QAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + + + L+ E GG+N+ L+ T+D + L LA
Sbjct: 222 AVGLAGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDTASGAAARFFWHTVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RH+++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHVYQWGPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM P+ G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSG-----HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL 549
G+YI Y+ S+ +G H L ++ + LR+ +++
Sbjct: 452 G---QGVYINLYVPSTVRDAAGLDMTLHSALPEQGSAL------LRIDAAPPAQR----- 497
Query: 550 SSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+L LR+P W Q LNGQ + +L T W D L++ + LR EA
Sbjct: 498 -TLALRVPGWAQQPRLQ--LNGQPVDTAASDGYLRITRVWQRGDTLSLSFDMPLRLEATP 554
Query: 610 DDRPEYASIQAILFGPYLLA---GHTSGEWDIKT 640
DD P + S +L GP +LA G + W KT
Sbjct: 555 DD-PAWVS---VLRGPLVLAVDLGDAAKPWSGKT 584
>gi|46113732|ref|XP_383116.1| hypothetical protein FG02940.1 [Gibberella zeae PH-1]
Length = 1393
Score = 271 bits (694), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 196/563 (34%), Positives = 272/563 (48%), Gaps = 50/563 (8%)
Query: 89 PGGFDLPGNF-LKEVSLHDV-WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT 146
P DL F L +VSL D W+D Q + YLL +D D L++ FRK L T
Sbjct: 25 PKVNDLADAFELSDVSLTDSRWMDN-------QGRTVNYLLSIDPDRLLYVFRKNHGLDT 77
Query: 147 PGKAY-GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK--- 202
G GGW+ P R H GH+L+A + +A+ N + S V L++CQ K
Sbjct: 78 KGATKNGGWDAPDFPFRSHVQGHFLTAWSNCYATLGNKECGSRASYFVKELAKCQAKNAK 137
Query: 203 --IGTGYLSAFPTELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
+GYLS FP E L PYY IHK LAGLLD Y + A + +
Sbjct: 138 AGFTSGYLSGFPESEIAKVENRTLNNGNVPYYAIHKTLAGLLDVYRRVGDNDAKAVMLSL 197
Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
+ R K+ S + + E GGMN+VL + T D K L +A FD
Sbjct: 198 AGWVDTRTGKL----SYAQMQQMMQTEFGGMNEVLADIAYYTQDNKWLKVAQRFDHAAIF 253
Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
L D LS HANT +P IG+ Y+V+GD Y IG D+ H+YA GG S
Sbjct: 254 DPLQNNVDKLSGLHANTQVPKWIGALREYKVSGDKKYLDIGRNAWDLTVHKHTYAIGGNS 313
Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSI 437
E + DP +A L S+ E C TYNMLK++R L+ + +Y D+YE AL N +L
Sbjct: 314 QAEHFRDPDAIAKYLTSDTCEACNTYNMLKLTRELWALDPSDASYFDFYENALMNHLLGQ 373
Query: 438 QRGTEP-GVMIYMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
Q + G + Y PL RGV A W T +NSFWCC G+GIE+ +KL DSIYF
Sbjct: 374 QNPKDNHGHVTYFTPLNPGGRRGVGPAWGGGTWSTDYNSFWCCQGSGIETNTKLMDSIYF 433
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-- 550
+ LY+ + S +W V + Q + SS ++G +
Sbjct: 434 HTKDT---LYVNLFTPSKLNWSQQQVSIIQTTE----------YPQKDSSTLQIGGKAGT 480
Query: 551 -SLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
+L +R+P WT + A +NGQ++ + PG + W+ DK+T+ LP+SLRT A
Sbjct: 481 WTLAVRIPSWT--SKASIQVNGQSVNVNATPGKYALVKRNWNSGDKVTVTLPMSLRTIAA 538
Query: 609 QDDRPEYASIQAILFGPYLLAGH 631
D+ + + A+ FGP +LA +
Sbjct: 539 NDN----SQVAAVAFGPVILAAN 557
>gi|393718114|ref|ZP_10338041.1| hypothetical protein SechA1_00115 [Sphingomonas echinoides ATCC
14820]
Length = 789
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 186/544 (34%), Positives = 269/544 (49%), Gaps = 51/544 (9%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A + N YLL L D L+ +FR A L G+ YGGWE+ + GH +GHY+SA +
Sbjct: 54 AVEVNRAYLLRLSADRLLHNFRAYAGLKPKGEVYGGWES--DTIAGHTLGHYMSALVLLH 111
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE-----LFDSFEA----------- 221
T +A K + +V L++ Q G GY+ A + + D+ E
Sbjct: 112 EQTGDAQAKRRADYIVDELADAQAARGNGYIGAMQRKRKDGTVVDAIEIFPEIIKGDIRS 171
Query: 222 ----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
L W+P+YT+HK+ AGLLD + NA+AL +A YF + V +
Sbjct: 172 GGFDLNGAWSPFYTVHKLFAGLLDIHASWGNAKALSVAIAFAGYF----EPVFAALDDAQ 227
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTH 336
L E GG+N+ L++ T D K L +A L+D+ A Q D L++FHANT
Sbjct: 228 MQTMLGTEYGGLNESFAELFARTKDRKWLAIAERLYDRKVLDPLTAGQ-DKLANFHANTQ 286
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
+P +IG +E+TG+P FF V HSY GG + RE++ +P ++ + +
Sbjct: 287 VPKLIGLARIHELTGEPAKAAAPRFFWQAVTKHHSYVIGGNADREYFSEPDSISRHITEQ 346
Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
E C TYNMLK++R L+ W + A DYYERA N V++ Q G YM PL G
Sbjct: 347 TCEHCNTYNMLKLTRQLYSWQPDGALFDYYERAHLNHVMAAQDPKTAG-FTYMTPLLTGA 405
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
+ ST + ++FWCC GTG+ES +K G+SI++E EG L + YI + W++
Sbjct: 406 VRGYST----SADDAFWCCVGTGMESHAKHGESIFWEGEG---ALLVNLYIPADATWRAR 458
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
L +D ++P TLT + G+ ++ LR+P W + A +NGQ +
Sbjct: 459 GATLT--LDTRYPFEPT--STLTLTQLARPGRF-AIALRVPGWA-AGKAVVRVNGQPVTP 512
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ-DDRPEYASIQAILFGPYLLA---GHT 632
+ RW D + I LPL LR EA DDR AIL GP +LA G T
Sbjct: 513 SFASGYAIVERRWKAGDSVAITLPLELRIEATPGDDR-----TVAILRGPMVLAADLGTT 567
Query: 633 SGEW 636
G+W
Sbjct: 568 EGDW 571
>gi|375306379|ref|ZP_09771677.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus sp.
Aloe-11]
gi|375081632|gb|EHS59842.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus sp.
Aloe-11]
Length = 753
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 192/548 (35%), Positives = 278/548 (50%), Gaps = 55/548 (10%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
LH V +D S L+ A + N YLL L+ D L+ FR+ A L Y GWE +
Sbjct: 9 DLHKVSID-SGPLYHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--ARGIS 65
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
GH +GHYLS + M+A+T + + E++S V+ L CQN G GY+S P E+F+ +
Sbjct: 66 GHTLGHYLSGCSLMYAATGDERLLERVSYVIDELEICQNNHGNGYISGIPRGKEIFEEVK 125
Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFY---- 263
A L W P YT+HK+ AGL D ++LA + +AL K+ W+ + F
Sbjct: 126 AGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLAHHPKALPIEIKLGAWLEDVFRGLDD 185
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
++Q+V L+ E GGMN+VL L + + + L LA F L LA
Sbjct: 186 EQMQRV------------LHCEFGGMNEVLTDLAEHSGEERFLKLAERFYHGEVLNDLAD 233
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
D L+ HANT IP +IG+ +YEVTG P Y + FF D V HSY GG S E +
Sbjct: 234 SRDTLAGRHANTQIPKIIGAARQYEVTGKPHYADLSRFFWDRVVHKHSYVIGGNSYNEHF 293
Query: 384 WDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
+P +L D LG ETC TYNMLK++RH+F W AYADYYERA+ N +L+ Q+ +
Sbjct: 294 GEPGKLNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD- 352
Query: 444 GVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
G + Y + L G K + +++ F CC G+G+ES S G +IYF + Y+
Sbjct: 353 GRVCYFVSLEMGGHKT-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTPQTI---YV 404
Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
QY+ S+ W V L Q+ + R TL SK+ Q ++ LR P W
Sbjct: 405 NQYVPSTVTWDDMDVQLKQE----TLFPQTGRGTLRVISKKP--QSFTIKLRCPHWA-EQ 457
Query: 564 GAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
G +NG+ P +++ W D + +P+++R E + P+ A +
Sbjct: 458 GMIIKINGEAFTAEACPTSYVVIEREWKDGDTVEYDIPMTVRIEEM----PDNPRRIAFM 513
Query: 623 FGPYLLAG 630
+GP +LAG
Sbjct: 514 YGPLVLAG 521
>gi|367031082|ref|XP_003664824.1| hypothetical protein MYCTH_55581 [Myceliophthora thermophila ATCC
42464]
gi|347012095|gb|AEO59579.1| hypothetical protein MYCTH_55581 [Myceliophthora thermophila ATCC
42464]
Length = 608
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 194/593 (32%), Positives = 287/593 (48%), Gaps = 54/593 (9%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
Q + YL +DVD L+++FR L T G + GGW+ P R H GH+L+A + +
Sbjct: 26 QNRTVTYLKWVDVDRLLYNFRANHGLSTQGARQNGGWDAPDFPFRTHVQGHFLTAWSHCY 85
Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA--LKPVWAPYY 230
AS + +++ + V L++CQ G GYLS FP FD+ EA L PYY
Sbjct: 86 ASLRDDACRDRATYFVAELAKCQANNDAVGFGAGYLSGFPESEFDALEARTLSNGNVPYY 145
Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
IHK +AGLLD + + A + + + +R ++ S E+ L E GGMN
Sbjct: 146 AIHKTMAGLLDVWRHVGDTTARDVLLALAGWVDSRTGRL----SYEQMQAVLGTEFGGMN 201
Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
DVL L T DP+ L +A FD LA + D L HANT +P IG+ + Y+ T
Sbjct: 202 DVLTELSLQTGDPRWLEVAQRFDHAAVFDPLASRQDRLDGLHANTQVPKWIGAVLEYKAT 261
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
G Y+ I + +HSYA GG S E + +P +A L + E C TYNML+++
Sbjct: 262 GTARYRDIAANAWNFTVGAHSYAIGGNSQAEHFHEPDAIAKYLLEDTAEACNTYNMLRLT 321
Query: 411 RHLFRWT-KEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKARSTHG 464
R L+ AY D+YERAL N +L Q +P G + Y PL RGV A
Sbjct: 322 RELWMLDPASTAYFDFYERALLNHLLGQQNPADPHGHVTYFTPLNPGGRRGVGPAWGGGT 381
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFE------EEGNVPGLYIIQYISSSFDWKSGHV 518
W T ++SFWCC GT +E+ +KL DSIY+ ++ L++ + S W V
Sbjct: 382 WSTDYDSFWCCQGTALETNTKLMDSIYWHDDDDDADDDGAANLWVNLFTPSVLRWTERGV 441
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
L Q+ D T+T + E +++R+P WT S GA+ +NG+ +
Sbjct: 442 TLTQETAFPAGSD-----TITLTVGGEPTGGWDMHVRIPSWTTS-GAEVLVNGEKAGVAA 495
Query: 579 --PGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
PG ++S R W D +T++LP++LRT A D+ + A+ +GP +L+G
Sbjct: 496 AVPGTYVSIRGRDWKAGDVVTVRLPMTLRTVAANDN----PGVAALAYGPVVLSG----- 546
Query: 636 WDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNS-TFVMSNSNQSITMEEF 687
D + + SL L L + + GN F + QS+T+ F
Sbjct: 547 -DYGSASLASLPTL----------DLDSVRRAKGNGLVFTATADGQSVTLGPF 588
>gi|84624616|ref|YP_451988.1| hypothetical protein XOO_2959 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84368556|dbj|BAE69714.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 791
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 200/627 (31%), Positives = 291/627 (46%), Gaps = 86/627 (13%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DN QAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + + + L+ E GG+N+ L+ T+D + L LA
Sbjct: 222 AVGLAGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++ H+++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKCLTEQTCEHCASYNMLKLTCHVYQWCPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM P+ G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSG-----HVVLNQKVDPIVSWD---PYLRMTLTFSSKQEV 546
G+YI Y+ S+ +G H L ++ + D P RM
Sbjct: 452 G---QGVYINLYVPSTVRDAAGLDMTLHSALPEQGSASLRIDAAPPEQRM---------- 498
Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
L LR+P W Q LNGQ + +L T W D L++ + LR E
Sbjct: 499 -----LALRVPGWAQQPRLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLE 551
Query: 607 AIQDDRPEYASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQ 660
A DD P + S +L GP +LA G + W KT + + + P+P
Sbjct: 552 ATPDD-PAWVS---VLRGPLVLAVDLGDAAKPWSGKTPALIGGQDILQRLQPVP------ 601
Query: 661 LVTFTQESGNSTFVMSNSNQSITMEEF 687
GN+ FV ++ Q + F
Sbjct: 602 --------GNTAFVYNDGLQQWQLSPF 620
>gi|330467876|ref|YP_004405619.1| hypothetical protein VAB18032_19585 [Verrucosispora maris
AB-18-032]
gi|328810847|gb|AEB45019.1| hypothetical protein VAB18032_19585 [Verrucosispora maris
AB-18-032]
Length = 913
Score = 271 bits (692), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 206/628 (32%), Positives = 296/628 (47%), Gaps = 78/628 (12%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPISELRGHFV 166
WLD Q L YL +DV+ L+++FR L T G A GGWE P R H
Sbjct: 63 WLDN-------QNRTLNYLRFVDVNRLLYNFRANHRLSTAGAAALGGWEAPTFPFRTHSQ 115
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
GH+L+A + MWA + T ++K + +V L++CQ GYL +P F + EA
Sbjct: 116 GHFLTAWSHMWAVLGDTTCRDKANYMVAELAKCQANNAAAGFNPGYLCGYPESDFTAVEA 175
Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSV 275
L PYYTIHK L GLLD + N QA L +A W V++ R+ M ++
Sbjct: 176 RTLNNGNVPYYTIHKTLVGLLDVWRHIGNNQARDVLLALAGW-VDWRTGRLSSA-QMQAM 233
Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
L E GGMN VL LY T D + L +A FD LA D L+ HANT
Sbjct: 234 ------LGTEFGGMNAVLTDLYQQTGDARWLTVAQRFDHAAVFNPLAANQDQLNGLHANT 287
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
IP IG+ ++ TG Y+ I + ++ + +YA GG S E + P ++ L +
Sbjct: 288 QIPKWIGAAREFKATGTTRYRDIASNAWNLTVNTRTYAIGGNSQAEHFRAPNAISGYLRN 347
Query: 396 ENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG 453
+ E C TYNMLK++R L+ +AY D+YERAL N ++ Q + G + Y PL
Sbjct: 348 DTCEHCNTYNMLKLTRELWLLDPNRVAYFDFYERALLNHLIGAQNPADNHGHITYFTPLQ 407
Query: 454 ----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
RGV A W T +NSFWCC GTG+E+ + L DSIYF N L + ++ S
Sbjct: 408 PGGRRGVGPAWGGGTWSTDYNSFWCCQGTGLENNTTLMDSIYFH---NGSTLTVNLFMPS 464
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
+W + + Q S T T + VG ++ +R+P WT A S+
Sbjct: 465 VLNWSQRGITVTQSTSYPAS------DTSTLTVTGTVGGSWTMRIRIPAWTQD--ATVSV 516
Query: 570 NG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
NG QN+ PG + S T W+ D +T++LP+ + E D+ S+ A+ +GP +
Sbjct: 517 NGTVQNIAT-TPGTYASLTRTWTSGDTVTVRLPMRVVVEPTNDN----PSVVALTYGPAV 571
Query: 628 LAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEF 687
L+G+ +LSAL P+ VT T + TF + +N + + F
Sbjct: 572 LSGNYGNT---------ALSAL-----PALATASVTRTSSTA-LTFTATANNTQVNLLPF 616
Query: 688 ------------PVSGTDAALHATFRLI 703
G+ ATFRL+
Sbjct: 617 YDAHGHNYTVYWSSGGSSGPAQATFRLV 644
>gi|375308065|ref|ZP_09773352.1| hypothetical protein WG8_1877 [Paenibacillus sp. Aloe-11]
gi|375080396|gb|EHS58617.1| hypothetical protein WG8_1877 [Paenibacillus sp. Aloe-11]
Length = 759
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 177/544 (32%), Positives = 282/544 (51%), Gaps = 38/544 (6%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT-PGKAYGGWENP 157
L +S V L+ S+L AQ L++LL ++ D ++++FRK ASL T A GW++
Sbjct: 185 LHGISTQKVHLEGPSLLKSAQNRRLQFLLTVNDDQMLYNFRKAASLDTLNAPAMIGWDSD 244
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQ------NKIGTGYLSAF 211
S L+GH GHYLSA A +AST N I +K++ +V L++ Q ++ G+LSA+
Sbjct: 245 ESLLKGHTTGHYLSALALCYASTGNERIHQKLAYLVDELNKVQLAFEADDRYHYGFLSAY 304
Query: 212 PTELFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
E FD E +WAPYYT+HKILAGLLD Y +A AL +A + ++ YNR+
Sbjct: 305 SEEQFDLLEVYTRYPEIWAPYYTLHKILAGLLDSYHIAGIELALAIADKVGDWIYNRLS- 363
Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
V+ +++ W + E GG+N+ L L++ T H+ A LFD + Q D
Sbjct: 364 VLPHEQLKKMWGLYIAGEFGGINESLAELFTYTQKEHHIAAAKLFDNDRLFFPMEQQVDA 423
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L HAN HIP ++G+ +E TG+ Y I FF + V +H Y+ GGT E + P
Sbjct: 424 LGAMHANQHIPQIVGAFKIFEATGEQKYYDIAKFFWESVVNAHIYSIGGTGEGEMFKQPH 483
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
++ L ETC +YN+LK+++ L+ + + Y DYYER + N +LS G
Sbjct: 484 KIGTHLTEHTAETCASYNLLKLTKQLYVYENDAKYMDYYERTMLNHILSSTDHECLGAST 543
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y +P G K NS CC+GTG+E+ K ++I+FE +V LY+ ++
Sbjct: 544 YFMPTSPGGQKGYDEE------NS--CCHGTGLENHFKYAEAIFFE---DVDSLYVNLFV 592
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRM-TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
++ + + + + Q V I + + + + TLT ++L +R+P W +
Sbjct: 593 PAALNDEGKGLQVVQSVPEIFNGEVEIHIETLT---------RTNLRVRIPYW-HQGEIT 642
Query: 567 ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
+N + +L ++ W+ D++T++ LR E P+ A I ++ FGPY
Sbjct: 643 TFVNHTKVNTIEENGYLVLSQEWNKGDQVTMKFTPRLRLEHT----PDKADIASLAFGPY 698
Query: 627 LLAG 630
+LA
Sbjct: 699 ILAA 702
>gi|58582735|ref|YP_201751.1| hypothetical protein XOO3112 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|188577523|ref|YP_001914452.1| hypothetical protein PXO_01470 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|58427329|gb|AAW76366.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|188521975|gb|ACD59920.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
PXO99A]
Length = 783
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 200/627 (31%), Positives = 291/627 (46%), Gaps = 86/627 (13%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 38 PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 95
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 96 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRK 153
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DN QAL++
Sbjct: 154 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQV 213
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + + + L+ E GG+N+ L+ T+D + L LA
Sbjct: 214 AVGLAGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHH 269
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 270 HAVLDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 329
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++ H+++W + DYYER L N V
Sbjct: 330 GGNGDREYFQQPDSISKCLTEQTCEHCASYNMLKLTCHVYQWGPQAELFDYYERTLLNHV 389
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM P+ G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 390 MA-QQHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 443
Query: 495 EGNVPGLYIIQYISSSFDWKSG-----HVVLNQKVDPIVSWD---PYLRMTLTFSSKQEV 546
G+YI Y+ S+ +G H L ++ + D P RM
Sbjct: 444 G---QGVYINLYVPSTVRDAAGLDMTLHSALPEQGSASLRIDAAPPEQRM---------- 490
Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
L LR+P W Q LNGQ + +L T W D L++ + LR E
Sbjct: 491 -----LALRVPGWAQQPRLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLE 543
Query: 607 AIQDDRPEYASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSFNAQ 660
A DD P + S +L GP +LA G + W KT + + + P+P
Sbjct: 544 ATPDD-PAWVS---VLRGPLVLAVDLGDAAKPWSGKTPALIGGQDILQRLQPVP------ 593
Query: 661 LVTFTQESGNSTFVMSNSNQSITMEEF 687
GN+ FV ++ Q + F
Sbjct: 594 --------GNTAFVYNDGLQQWQLSPF 612
>gi|342872240|gb|EGU74628.1| hypothetical protein FOXB_14856 [Fusarium oxysporum Fo5176]
Length = 616
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 197/557 (35%), Positives = 269/557 (48%), Gaps = 59/557 (10%)
Query: 99 LKEVSLHDV-WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWEN 156
L +VSL D W+D Q L YLL +D D L++ FRK + T G + GGW+
Sbjct: 34 LTQVSLTDSRWMDN-------QNRTLNYLLSVDPDRLLYVFRKNHGVDTKGAQTNGGWDA 86
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAF 211
P R H GH+LSA Q +AS + + V L++CQ GYLS F
Sbjct: 87 PDFPFRSHVQGHFLSAWTQCYASAGVKECGSRATYFVQELAKCQANNAKAGFNKGYLSGF 146
Query: 212 PTELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWM----VEY 261
P E L PYY IHK LAGLLD Y + A L +A+W+ +
Sbjct: 147 PESDITKVEDRTLNNGNVPYYAIHKTLAGLLDVYRRLGDQTAKDTMLSLASWVDTRTSKL 206
Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
YN++Q + L E GGMN+VL + T D K L +A FD L
Sbjct: 207 SYNQMQSM------------LQTEFGGMNEVLADIAFYTKDAKWLKVAQRFDHAVIFDPL 254
Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
D LS HANT +P IG+ Y+V GD Y IG ++V H+YA GG S E
Sbjct: 255 QQNVDKLSGLHANTQLPKWIGALREYKVGGDKKYLDIGRNAWNMVVNKHTYAIGGNSQAE 314
Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQR- 439
+ P +A L + E C +YNMLK++R L+ + +Y D+YE+AL N +L Q
Sbjct: 315 HFRAPDAIAGFLTDDTCEACNSYNMLKLTRELWALNPTDASYFDFYEKALLNHLLGQQDP 374
Query: 440 GTEPGVMIYMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEE 495
++ G + Y PL RGV A W T +NSFWCC GTG+E+ +KL DSIYF
Sbjct: 375 SSDHGHVTYFTPLKAGGRRGVGPAWGGGTWSTDYNSFWCCQGTGVETNTKLMDSIYFHTS 434
Query: 496 GNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
LY+ + S +W V + Q D S T TF + + +L +R
Sbjct: 435 DT---LYVNLFTPSKLNWSQKKVSVTQTTDFPES------DTSTFKISGDTSEW-TLAVR 484
Query: 556 MPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
+P WT + A +NGQ + PG + +W D +T+QLP+SL T A DD+
Sbjct: 485 IPSWT--SKASIKVNGQAANVAVQPGKYALIKRQWKSGDTVTVQLPMSLHTVAANDDQ-- 540
Query: 615 YASIQAILFGPYLLAGH 631
++ AI FGP +LAG+
Sbjct: 541 --TLGAIAFGPVILAGN 555
>gi|386837867|ref|YP_006242925.1| hypothetical protein SHJG_1777 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374098168|gb|AEY87052.1| hypothetical protein SHJG_1777 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451791159|gb|AGF61208.1| hypothetical protein SHJGH_1542 [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 769
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 185/556 (33%), Positives = 273/556 (49%), Gaps = 47/556 (8%)
Query: 94 LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYG 152
LP +F + WLD Q YL +DVD L+++FR L T G A G
Sbjct: 48 LPFDFGQVRLTASRWLDN-------QDRAAAYLRFVDVDRLLYNFRANHRLSTGGASATG 100
Query: 153 GWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGY 207
GW+ P R H GH+L+A AQ++A T +A ++K +V L++CQ G GY
Sbjct: 101 GWDAPTFPFRSHVQGHFLTAWAQLYAVTGDAVARDKALYMVAELAKCQANNGAAGFGAGY 160
Query: 208 LSAFPTELFDSFEA--LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
LS +P F + EA L+ PYYT+HK ++GLLD + + QA + + + R
Sbjct: 161 LSGYPESDFTALEAGTLRNGNVPYYTVHKTMSGLLDVWRHLGSTQARDVLLALAGWVDAR 220
Query: 266 VQKVIT--MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
++ T M +V L E GGMN VL LY T D + L +A FD LA
Sbjct: 221 TGRLTTAQMQAV------LGTEFGGMNAVLADLYQQTGDARWLTVAQRFDHAAVFDPLAA 274
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
D L+ HANT +P IG+ Y+ TG Y+ I T + SH+YA GG S E +
Sbjct: 275 NQDALAGLHANTQVPKWIGAVRAYKATGITRYRDIATNAWNHCVGSHTYAIGGNSQAEHF 334
Query: 384 WDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTE 442
P +A L + E+C + NML ++R LF T + +A DYYE+A N ++ Q +
Sbjct: 335 RAPNAIAAYLADDTCESCNSVNMLTLTRELFTLTPDRVALFDYYEQAWLNHIIGNQNPAD 394
Query: 443 P-GVMIYMLPL----GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
P G + Y PL RGV A W T + +FWCC GTG+E ++L DS+YF
Sbjct: 395 PHGHITYFTPLRPGGRRGVGPAWGGGTWSTDYTTFWCCQGTGVEIHTRLMDSVYFHSGTT 454
Query: 498 VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
L + ++ S W + + Q S LR+T +VG ++ +R+P
Sbjct: 455 ---LTVNMFVPSVLTWTQRGITVTQTTSYPASDTTTLRVT------GDVGGTWAMRVRIP 505
Query: 558 VWTYSNGAQASLNG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
WT GA S+NG QN+P G++ + W+ D +T++LP+ D+
Sbjct: 506 GWT--TGASVSVNGVVQNIP-AATGSYATLDRAWASGDTVTVRLPMRTALRPANDN---- 558
Query: 616 ASIQAILFGPYLLAGH 631
++ A+ +GP +LAG+
Sbjct: 559 PNVSAVTYGPVVLAGN 574
>gi|381170950|ref|ZP_09880102.1| Tat (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas citri pv. mangiferaeindicae LMG
941]
gi|380688673|emb|CCG36589.1| Tat (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas citri pv. mangiferaeindicae LMG
941]
Length = 791
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 189/569 (33%), Positives = 277/569 (48%), Gaps = 53/569 (9%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T + + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + ++ + L+ E GG+N+ L+ T D + L LA
Sbjct: 222 AVDLAGY----LQGIFSVLDDTQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L+H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLIAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHAVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
G+Y+ Y+ S+ +G LN + + + + + + +L L
Sbjct: 452 G---QGVYVNLYVPSTVRDAAG---LNMTLHSALPEQGSASLRIDGAPPAQ----RTLAL 501
Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
R+P W Q LNGQ + +L T W D L++ + LR E+ DD P
Sbjct: 502 RVPGWAQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PA 558
Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
+ S +L GP +LA G + W KT
Sbjct: 559 WVS---VLRGPLVLAADLGDAAKPWSGKT 584
>gi|390993493|ref|ZP_10263643.1| TAT (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|372551771|emb|CCF70618.1| TAT (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas axonopodis pv. punicae str. LMG
859]
Length = 791
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 190/569 (33%), Positives = 274/569 (48%), Gaps = 53/569 (9%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T + + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDTQCRTRTRYLVSELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q V + L+ E GG+N+ L+ T D + L LA
Sbjct: 222 AVALAGY----LQGVFAALEDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
G+Y+ Y+ S+ +G LN + + + + + + +L L
Sbjct: 452 G---QGVYVNLYVPSTVRDAAG---LNMTLHSALPEQGSASLRIDGAPPAQ----RTLAL 501
Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
R+P W Q LNGQ + +L T W D L++ + LR E+ DD P
Sbjct: 502 RVPGWAQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PA 558
Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
+ S +L GP +LA G + W KT
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKT 584
>gi|418520534|ref|ZP_13086583.1| hypothetical protein WS7_05828 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410703915|gb|EKQ62403.1| hypothetical protein WS7_05828 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 791
Score = 269 bits (687), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 190/569 (33%), Positives = 274/569 (48%), Gaps = 53/569 (9%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T + + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDTQCRTRTRYLVSELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DNAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPSPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q V + L+ E GG+N+ L+ T D + L LA
Sbjct: 222 AVALAGY----LQGVFAALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
G+Y+ Y+ S+ +G LN + + + + + + +L L
Sbjct: 452 G---QGVYVNLYVPSTVRDAAG---LNMTLHSALPKQGSASLRIDGAPPAQ----RTLAL 501
Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
R+P W Q LNGQ + +L T W D L++ + LR E+ DD P
Sbjct: 502 RVPGWAQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PA 558
Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
+ S +L GP +LA G + W KT
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKT 584
>gi|115399582|ref|XP_001215378.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114192261|gb|EAU33961.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 614
Score = 269 bits (687), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 196/552 (35%), Positives = 266/552 (48%), Gaps = 49/552 (8%)
Query: 99 LKEVSLHD-VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWEN 156
L E+SL D +LD Q+ L YL +D + L+ +FR L T G A GGW+
Sbjct: 31 LSELSLGDGRFLDN-------QERTLSYLKFVDTERLLLNFRANHKLDTKGAVANGGWDA 83
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAF 211
P R H GH+L+A AQ +A + +E+ + V L++CQ TGYLS F
Sbjct: 84 PTFPFRTHVQGHFLTAWAQCYAVLGDTDCQERATYFVSELAKCQANNEAAGFKTGYLSGF 143
Query: 212 PTELFDSFEA--LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
P FD+ EA L PYY IHK LAGLLD + L + A + + + R +
Sbjct: 144 PESDFDALEAGTLNNGNVPYYNIHKTLAGLLDVWRLVGDTTARDVLLALAGWVDTRTSAL 203
Query: 270 --ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
M SV L E GGMNDVL LY T D K L A FD LA D
Sbjct: 204 SEAQMQSV------LGTEFGGMNDVLADLYHQTSDEKWLKTAQRFDHAAVFDPLAANEDQ 257
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L+ HANT +P IG+ Y+ TGD Y I I +H+YA G S E + P
Sbjct: 258 LNGLHANTQVPKWIGAVREYKATGDTRYLDIARNAWTITVNAHTYAIGANSQAEHFHAPN 317
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEP-GV 445
+A L S+ E C +YNMLK++R L+ E Y D+YE AL N +L Q + G
Sbjct: 318 AIAQYLDSDTAEACNSYNMLKLTRELWTLDPENTTYFDFYENALLNHLLGQQNPADSHGH 377
Query: 446 MIYMLPL----GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
+ Y L RGV A W T ++SFWCC GT +E+ +KL DSI+F + L
Sbjct: 378 ITYFTSLNPGGNRGVGPAWGGGTWSTDYDSFWCCQGTALETNTKLMDSIFFHSDS---AL 434
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
Y+ Q+I S W V + Q VS +TL + L +R+P WT
Sbjct: 435 YVNQFIPSVLTWSEKGVKVTQSTTFPVS----DTITLDIDGNGDW----ELYVRIPSWT- 485
Query: 562 SNGAQASLNGQNLP--LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
+ A ++NG+ + PG++ W+ DK+ IQLP+ LRT DD S+
Sbjct: 486 -SNAAITINGEQVTDVDVSPGSYAKIARTWASGDKVQIQLPMHLRTVPANDD----PSLM 540
Query: 620 AILFGPYLLAGH 631
AI +GP +L+G+
Sbjct: 541 AIAYGPVILSGN 552
>gi|308067040|ref|YP_003868645.1| hypothetical protein PPE_00225 [Paenibacillus polymyxa E681]
gi|305856319|gb|ADM68107.1| DUF1680 domain containing protein [Paenibacillus polymyxa E681]
Length = 752
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 201/598 (33%), Positives = 301/598 (50%), Gaps = 58/598 (9%)
Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
K LH V +D S L A + N YLL L+ D L+ FR+ A L Y GWE
Sbjct: 4 KAFDLHKVRID-SGPLLHAMELNTAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--AR 60
Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFD 217
+ GH +GHYLS A M+AST + + E+++ VV L CQN G GY+S P E+F+
Sbjct: 61 GISGHTLGHYLSGCALMFASTGDERLLERVNYVVDELEICQNSHGNGYISGIPRGKEIFE 120
Query: 218 SFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFY- 263
+A L W P YT+HK+ AGL D ++ A + +AL K+ W+ +
Sbjct: 121 EVKAGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLPAHHPKALSIEIKLGNWLEDVLQG 180
Query: 264 ---NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF 320
++VQ+V L+ E GGMN+VL L + + + L LA F L
Sbjct: 181 LDDDQVQQV------------LHCEFGGMNEVLTDLAEHSGEERFLSLAERFYHGEVLND 228
Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR 380
LA D L+ HANT IP +IG+ ++E+TG P Y + FF D V HSY GG S
Sbjct: 229 LADSQDTLAGRHANTQIPKIIGAARQFEMTGKPQYADLSRFFWDRVVHKHSYVIGGNSYN 288
Query: 381 EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRG 440
E + +P +L D LG ETC TYNMLK++RH+F W AYADYYERA+ N +L+ Q+
Sbjct: 289 EHFGEPGKLNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQP 348
Query: 441 TEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
+ G + Y + L G K+ + +++ F CC G+G+ES S G +IYF +
Sbjct: 349 VD-GRVCYFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTPETI-- 400
Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
Y+ QY+ S+ W V L Q D + + R TL SK+ + ++ LR P W
Sbjct: 401 -YVNQYVPSTVTWDEMGVQLKQ--DTLFPQNG--RGTLRVISKEP--KSFAIKLRCPHWA 453
Query: 561 YSNGAQASLNGQN-LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
G +NG+ + P +++ WS D + +P+++R E + P+
Sbjct: 454 -EQGMMIKINGEKYVTEACPTSYVVMEREWSNGDTIEYDIPMTVRVEEM----PDNPRRV 508
Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN 677
A ++GP +LAG G + ++ L++++ S +L+ E +TF M++
Sbjct: 509 AFMYGPLVLAGDL-GPVEQESNEEHLLASVLIGSADSLTTKLIADGNEP--NTFHMTD 563
>gi|373954098|ref|ZP_09614058.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373890698|gb|EHQ26595.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 787
Score = 268 bits (685), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 190/550 (34%), Positives = 282/550 (51%), Gaps = 59/550 (10%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
+L DV L +S +A + + YLL ++ D L+ FR + L GK Y GWE+ S L
Sbjct: 49 NLKDVKL-LNSPFKQAMEVDAAYLLSIEPDRLLSGFRAHSGLKPKGKMYEGWES--SGLA 105
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA- 221
GH +GHYLSA + +A+T + ++++ +V L ECQ TGY+ A P E D+ A
Sbjct: 106 GHTLGHYLSAISMHYAATRDPEFLKRVNYIVKELGECQVARKTGYVGAIPKE--DTVWAE 163
Query: 222 ------------LKPVWAPYYTIHKILAGLLDQYVLADNAQALK----MATWMVEYFYN- 264
L W+P+YT+HK++AGLLD ++ ++ QAL MA W E N
Sbjct: 164 VAKGDIRSRGFDLNGGWSPWYTVHKVMAGLLDAFLYCNSTQALHVCKGMADWTGETLKNL 223
Query: 265 ---RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
++QK++ E GGM + L LY+I + K+L L++ F L L
Sbjct: 224 DDEKLQKMLLC------------EYGGMAETLVNLYAINGNKKYLDLSYKFYDKRILDPL 271
Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
A Q D L H+NT IP +I S RYE+ GD K I FF + + +HSYATGG S E
Sbjct: 272 ANQQDILPGKHSNTQIPKIIASARRYELNGDKKDKAIAEFFWETIVNNHSYATGGNSNYE 331
Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
+ +P +L D L ETC TYNMLK++RHLF DYYE+AL N +L+ Q
Sbjct: 332 YLSEPNKLNDKLTENTTETCNTYNMLKLTRHLFALEPSAKLMDYYEKALYNHILASQ-NH 390
Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
E G+M Y +PL G K S + F++F CC G+G+E+ K +SIYF G L
Sbjct: 391 ETGMMCYFVPLRMGGKKEYS-----SPFDTFTCCVGSGMENHVKYNESIYF--RGADGSL 443
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
Y+ +I S +WK + + Q+ + S T + ++ +R P W
Sbjct: 444 YVNLFIPSVLNWKEKGLSITQESNLPQS------DKTTLTVTTLKPVAMAIRVRKPKW-- 495
Query: 562 SNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
++ +NG+ + +L +W NDK+ +P ++ TEA+ P+ A+ +A
Sbjct: 496 ADNTTVGVNGKKQQVTADAQGYLVINRKWKNNDKIEFIMPENIHTEAM----PDNANRRA 551
Query: 621 ILFGPYLLAG 630
+ +GP LLAG
Sbjct: 552 VFYGPVLLAG 561
>gi|336319285|ref|YP_004599253.1| hypothetical protein Celgi_0157 [[Cellvibrio] gilvus ATCC 13127]
gi|336102866|gb|AEI10685.1| protein of unknown function DUF1680 [[Cellvibrio] gilvus ATCC
13127]
Length = 1577
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 194/579 (33%), Positives = 284/579 (49%), Gaps = 66/579 (11%)
Query: 93 DLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWS-FRKTASLPTPGKAY 151
DL + L++ L D++L + L A EYLL L + ++ +R PT Y
Sbjct: 362 DLTEHALQDSGLEDLYL-TDAYLTNAAAKEHEYLLSLSSEKFLYEWYRNVGLTPTTTSGY 420
Query: 152 GGWE-NPISELRGHFVGHYLSASAQMWASTHNAT----IKEKMSTVVFSLSECQNKIGT- 205
GGWE + ++ RGH GHY+SA +Q +++T +AT + E++ V L+ Q+
Sbjct: 421 GGWERSDVTNFRGHAFGHYMSALSQSYSATADATTKAALLEQVEDAVAGLTLVQDTYAAA 480
Query: 206 -----GYLSAFPTELFDSFEAL----KPVWAPYYTIHKILAGLLD--QYVL-ADNAQALK 253
GY+SAFP D+ + V P+Y +HK+LAGLLD YV A AQAL
Sbjct: 481 HPASAGYVSAFPESALDAVDGTGTTTDKVLVPWYNLHKVLAGLLDIHDYVGGATGAQALD 540
Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
+A+ EY Y R+ ++ + R Y GGMND LYRLY +T DP A FD
Sbjct: 541 IASQFGEYTYQRISRLTDRTRMLRTEY------GGMNDALYRLYDLTDDPHVKTAAEAFD 594
Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEV-TGD---------------PLYKL 357
+ LA D L+ HANT IP +IG+ RY V T D P Y
Sbjct: 595 ETALFTQLAAGQDVLNGKHANTTIPKLIGALKRYTVFTSDADRLASLTEAERAQLPTYLA 654
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRL-------ADTLGSENEETCTTYNMLKVS 410
F I H+YATG S E + DP L +T ++ ETC YNMLK+S
Sbjct: 655 AAEEFWQITVDHHTYATGSNSQSEHFHDPDSLHEFATQQGETGNAQTSETCNEYNMLKLS 714
Query: 411 RHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFN 470
R LF+ TK++ YA YYE N VL+ Q + G+ Y P+ G + + +
Sbjct: 715 RELFKLTKDVKYAHYYENTFINTVLASQN-PDTGMTTYFQPMAAGYDRI-----YSMPYT 768
Query: 471 SFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSW 530
FWCC GTG+ESFSKLGDS+YF + +V Y+ + SS FD+ ++ L Q+ D +
Sbjct: 769 EFWCCTGTGMESFSKLGDSMYFTDRRSV---YVTMFFSSRFDYAEQNLRLTQEAD--LPS 823
Query: 531 DPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWS 590
D + + +V ++L LR+P W A ++NG+ + P E +
Sbjct: 824 DDTVTFRVAAIDGDQVADGTTLRLRVPQW-IDGAATLTVNGEAV-TPQVVRGFVVLEGVA 881
Query: 591 YNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
D +T ++P+ ++ A D+ P +A A +GP +L+
Sbjct: 882 AGDVITYRMPMKVQAHAAPDN-PTWA---AFSYGPVVLS 916
>gi|429858822|gb|ELA33628.1| secreted protein [Colletotrichum gloeosporioides Nara gc5]
Length = 623
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 187/542 (34%), Positives = 273/542 (50%), Gaps = 52/542 (9%)
Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTP-GKAYGGWENPISELRGHFVGHYL 170
S L+ Q L YL +DV+ L+++FRK L T +A GGW+ P R HF GH+L
Sbjct: 51 SGRLFDNQARTLTYLKWVDVERLLYNFRKNHGLSTNNAQANGGWDAPDFPFRTHFQGHFL 110
Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE--ALK 223
+A A +A H+ K++ + L +CQ TGYLS FP + E +L
Sbjct: 111 NAWAFCYAQLHDTECKDRATYFAAELKKCQANNANVGFNTGYLSGFPESEITAVEDRSLS 170
Query: 224 PVWAPYYTIHKILAGLLD--QYVLADNAQA--LKMATWMV----EYFYNRVQKVITMYSV 275
PYY IHK +AGLLD +++ NA+ L+MA W+ + Y ++Q +++
Sbjct: 171 NGNVPYYAIHKTMAGLLDVWRHIGDTNARDVLLEMAAWVDLRTGKLTYAQMQNMMST--- 227
Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
E GGMN+V+ ++ T D + L +A FD LA D L+ HANT
Sbjct: 228 ---------EFGGMNEVMADIFHQTGDQRWLTVAQRFDHAAIFDPLASNQDSLNGLHANT 278
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+P IG+ Y+ TG Y+ I +I ++HSYA GG S E + P +A L S
Sbjct: 279 QVPKWIGASREYKATGTSRYQDIARNAWNITVSAHSYAIGGNSQAEHFRLPNAIAGFLNS 338
Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEP-GVMIYMLPLG 453
+ E C TYNMLK++R L+ Y D+YERAL N +L Q ++ G + Y PL
Sbjct: 339 DTCEACNTYNMLKLTRELWLTNPSATHYFDFYERALLNHLLGQQDPSDSHGHITYFTPLN 398
Query: 454 ----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
RGV A W T ++SFWCC GTG+E+ +KL DSIYF + LY+ ++ S
Sbjct: 399 PGGRRGVGPAWGGGTWSTDYDSFWCCQGTGLETNTKLMDSIYFYDNS---ALYVNLFVPS 455
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
W V + Q D + R T GQ +L +R+P WT +GAQ ++
Sbjct: 456 VLRWTQRGVTVTQTTD-------FPRGDTTTLKVSGSGQW-TLRVRIPSWT--SGAQVTV 505
Query: 570 NGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
NGQ + G + + W+ D + + LP+ L+T A D+ SI A+ FGP +L+
Sbjct: 506 NGQAV-TATSGAYAAIDRTWADGDTVVVTLPMKLQTIAANDN----PSIAALAFGPVILS 560
Query: 630 GH 631
G+
Sbjct: 561 GN 562
>gi|290955577|ref|YP_003486759.1| hypothetical protein SCAB_10131 [Streptomyces scabiei 87.22]
gi|260645103|emb|CBG68189.1| putative secreted protein [Streptomyces scabiei 87.22]
Length = 786
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 196/585 (33%), Positives = 285/585 (48%), Gaps = 59/585 (10%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPISELRGHFV 166
WLD Q YL +DVD L+++FR T L T G GGW+ P R H
Sbjct: 81 WLDN-------QNRTQNYLRFIDVDRLLYNFRATHKLSTNGATPNGGWDAPNFGFRTHIQ 133
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
GH+L+A AQ++A T + T ++K + +V L++CQ TGYLS +P F + E
Sbjct: 134 GHFLTAWAQLYAVTGDTTCRDKATRMVAELAKCQANNSAAGFNTGYLSGYPESNFTALEQ 193
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRV--QKVITMYSV 275
YYTIHK L GLLD + L + QA L +A W V++ R+ Q++ TM +
Sbjct: 194 GTSGEVLYYTIHKTLTGLLDVWRLIGSTQARDVLLALAGW-VDWRTGRLTGQQMQTMLRI 252
Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
E GGMN VL LY T D + L +A FD LA D L+ HANT
Sbjct: 253 E---------FGGMNTVLTDLYQQTGDARWLTVAQRFDHAAVFDPLAANQDKLNGLHANT 303
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+P IG+ Y+ TG Y+ I T +I A+H+YA GG S E + P +A L +
Sbjct: 304 QVPKWIGAAREYKATGTTRYRDIATNAWNITVAAHTYAIGGNSQAEHFRAPNAIAGFLNN 363
Query: 396 ENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLG 453
+ E+C T NML ++R L+ + + DYYERA N ++ Q + G + Y PL
Sbjct: 364 DTCESCNTVNMLTLTRELYTLDPDRVELFDYYERAWLNQMIGQQNPADDHGHVTYFTPLK 423
Query: 454 ----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
RGV A W T + SFWCC GTG+E ++L DSIYF N L + ++ S
Sbjct: 424 PGGRRGVGPALGGGTWSTDYGSFWCCQGTGLEMHTRLMDSIYFH---NDTTLTVNMFVPS 480
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
W + + Q S L++T + S ++ +R+P WT GA S+
Sbjct: 481 VLTWTERGITVTQTTTYPTSDTTTLQVTGSVSGTW------AMRIRIPGWT--TGAAVSV 532
Query: 570 NG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
NG QN+ PG++ + W+ D +T++LP+ + D+ A++ AI +GP +
Sbjct: 533 NGVAQNIT-TTPGSYATLNRSWTSGDTVTVRLPMRIGIRPANDN----ANVAAITYGPVV 587
Query: 628 LAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNST 672
L+G + T SL AL + ++ + FT S ST
Sbjct: 588 LSG------NYGDSTLSSLPALTTSSIKRTSSSSLAFTATSSGST 626
>gi|268316049|ref|YP_003289768.1| hypothetical protein Rmar_0478 [Rhodothermus marinus DSM 4252]
gi|262333583|gb|ACY47380.1| protein of unknown function DUF1680 [Rhodothermus marinus DSM 4252]
Length = 641
Score = 267 bits (683), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 180/528 (34%), Positives = 266/528 (50%), Gaps = 42/528 (7%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A Q ++ YL LD D L+ FR+ A L YGGWE+ + GH +GHYLSA + +
Sbjct: 56 AMQRDVAYLFELDPDRLLSRFRRFAGLEPKAPEYGGWES--QGISGHTLGHYLSALSMYY 113
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFE-ALKPV 225
A+T + + ++ +V L+E Q G GY+ A P E++ + +L
Sbjct: 114 AATGDEKARARIDYIVSELAEVQRAHGNGYVGAIPEGDRLWAEIARGEIWQAEPFSLNGA 173
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS-LNE 284
W P+YT+HKI GL+D Y + QAL++ T + ++ Y + + W L
Sbjct: 174 WVPWYTMHKIFQGLIDAYWYGGSEQALEVVTRLADWAYETTKNLTPA-----QWQQMLRT 228
Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
E GGMN+ L LYSIT +PKH L+ F L L+ L+ HANT IP VIG
Sbjct: 229 EHGGMNEALANLYSITGNPKHRELSEKFYHAAVLSPLSRGIPNLTGLHANTQIPKVIGVV 288
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
+YE+ G + + FF + V H+Y GG S E + LA+ LG ETC TY
Sbjct: 289 RQYELIGSDSLRAVAEFFWEEVVQHHTYVIGGNSQNEHFGPRDSLANRLGEGTAETCNTY 348
Query: 405 NMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH 463
NML+++RHLF E + Y D+YERAL N +L+ Q + G+ Y + L G K
Sbjct: 349 NMLRLTRHLFALHPEKVRYVDFYERALYNHILASQ-DPKRGMFTYYMSLRPGHFKT---- 403
Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
+ T +SFWCC GTG+E+ K + IYF N LY+ +I S +W+ + L +
Sbjct: 404 -YATPEHSFWCCVGTGMENHVKYNEFIYFY---NGDTLYVNLFIPSELNWERRALRLRLE 459
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNF 582
++ R+ L F EV Q + +R P W + +NG+ + PG++
Sbjct: 460 ----TAFPESNRVRLDFDP--EVPQRLVVKVRHPSWA-QDALDVRINGEVQSVTSRPGSY 512
Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
L+ W D++ I LP+ LR E + D+ + AIL+GP +LAG
Sbjct: 513 LTLARVWQPGDEVEITLPMRLRVETMPDNPDRF----AILYGPIVLAG 556
>gi|402080566|gb|EJT75711.1| hypothetical protein GGTG_05643 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 640
Score = 267 bits (683), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 186/536 (34%), Positives = 263/536 (49%), Gaps = 47/536 (8%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
Q L Y+ ++VD L+++FR + T G ++ GW+ P R HF GH+L+A AQ +
Sbjct: 67 QDRALTYIKSVNVDRLLYNFRANHRVSTNGAQSNKGWDAPDFPFRTHFQGHFLTAWAQCY 126
Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE--ALKPVWAPYY 230
A+ +AT ++ + V L++CQN GYLS FP D E L PYY
Sbjct: 127 ATLGDATCRDHANYFVAELAKCQNNNAAAGFKAGYLSGFPESEIDKVEQRTLSNGNVPYY 186
Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
IHK +AGLLD + + + QA L+MA W V S ++ L E
Sbjct: 187 AIHKTMAGLLDVWRVMGSTQARDVLLRMAGW--------VDTRTAALSYQQMQNMLGTEF 238
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGMN+VL ++ T D + + A FD LA D LS HANT +P IG+
Sbjct: 239 GGMNEVLADVFHQTGDARWIKTARRFDHAAVFDPLAQGQDRLSGLHANTQVPKWIGAARE 298
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
Y+ T + Y+ + + A+H+YA GG S E + P +A L + E C +YNM
Sbjct: 299 YKATKEERYRTVARAAWNFTVAAHTYAIGGNSQSEHFRSPNAIAGYLAKDTAEACNSYNM 358
Query: 407 LKVSRHLFRWTKE---IAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLG----RGVSK 458
LK++R L W + AY D+YERAL N +L Q + G + Y PL RGV
Sbjct: 359 LKLTREL--WLADPSAAAYFDFYERALLNHMLGQQDPRSAHGHVTYFTPLNPGGRRGVGP 416
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KSG 516
A + T ++SFWCC GTGIE+ +KL DSIYF + LY+ +ISSS W K G
Sbjct: 417 AWGGGTYSTDYDSFWCCQGTGIETNTKLMDSIYFRGRDDAT-LYVNLFISSSVKWTQKGG 475
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP- 575
VV P T T G +L +R+P W + A ++NGQ +
Sbjct: 476 VVVTQTTTFPKSD-------TTTLDVSGAGGGRWTLAVRVPSWV-AGQAVITVNGQAVQG 527
Query: 576 -LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
PG + S T W DK+ ++LP+ L T A DD + A+ +GP +L+G
Sbjct: 528 VSTAPGTYASITRDWQAGDKVVVRLPMRLYTIAANDD----MGLVAVAYGPAVLSG 579
>gi|21243263|ref|NP_642845.1| hypothetical protein XAC2530 [Xanthomonas axonopodis pv. citri str.
306]
gi|21108798|gb|AAM37381.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 791
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 189/569 (33%), Positives = 274/569 (48%), Gaps = 53/569 (9%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S L A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRLTPSLFL-DALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T + + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + +NAQAL++
Sbjct: 162 NAAGQIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCENAQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q V + L+ E GG+N+ L+ T D + L LA
Sbjct: 222 AVALAGY----LQGVFAALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
G+Y+ Y+ S+ +G LN + + + + + + +L L
Sbjct: 452 G---QGVYVNLYVPSTVRDAAG---LNMTLHSALPEQGSASLRIDGAPPAQ----RTLAL 501
Query: 555 RMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
R+P W Q LNGQ + +L T W D L++ + LR E+ DD P
Sbjct: 502 RVPGWAQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PA 558
Query: 615 YASIQAILFGPYLLA---GHTSGEWDIKT 640
+ S +L GP +LA G + W KT
Sbjct: 559 WVS---VLRGPLVLAVDLGDAAKPWSGKT 584
>gi|290954983|ref|YP_003486165.1| hypothetical protein SCAB_3871 [Streptomyces scabiei 87.22]
gi|260644509|emb|CBG67594.1| putative secreted protein [Streptomyces scabiei 87.22]
Length = 768
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 202/601 (33%), Positives = 289/601 (48%), Gaps = 68/601 (11%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
WLD Q YL +DVD L+++FR L T G A GGW+ P R H
Sbjct: 62 WLDN-------QDRTRNYLRFVDVDRLLYNFRANHRLSTAGAAATGGWDAPTFPFRTHVQ 114
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-----TGYLSAFPTELFDSFE- 220
GH+L+A AQ++A T + T ++K + +V L++CQ G TGYLS +P F + E
Sbjct: 115 GHFLTAWAQLYAVTGDTTCRDKATRMVAELAKCQANNGAAGFNTGYLSGYPESDFTALEQ 174
Query: 221 -ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRV--QKVITMY 273
L PYYTIHK LAGLLD + + QA L +A W V++ R+ Q++ M
Sbjct: 175 RTLSNGNVPYYTIHKTLAGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRLTGQQMQAM- 232
Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
L E GGMN VL LY T D + L A FD LA D LS HA
Sbjct: 233 --------LQTEFGGMNAVLTDLYQQTGDARWLTAARRFDHAAVFDPLASNQDRLSGLHA 284
Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
NT +P IG+ Y+ TG Y+ I T I A+H+YA GG S E + P +A L
Sbjct: 285 NTQVPKWIGAAREYKATGTTRYRDIATNAWSITVAAHTYAIGGNSQAEHFRAPNAIAGFL 344
Query: 394 GSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLP 451
+ E+C T+NML ++R LF A DYYERA N ++ Q + G + Y P
Sbjct: 345 NQDTCESCNTFNMLVLTRELFALDPNRAALFDYYERAWLNQMIGQQNPADDHGHVTYFTP 404
Query: 452 L----GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
L RGV A W T + +FWCC GTG+E ++L DS+Y+ + L + ++
Sbjct: 405 LRPGGRRGVGPAWGGGTWSTDYGTFWCCQGTGLEMHTRLMDSVYYRSDTT---LIVNMFV 461
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S W + + Q D LR+T VG ++ LR+P WT +GA
Sbjct: 462 PSVLTWSERGITVTQTTDYPAGDTTTLRVT------GSVGGTWAMRLRIPGWT--SGATI 513
Query: 568 SLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
S+NG + PG++ + T W+ D +T++LP+ + + + A+I AI +GP
Sbjct: 514 SVNGTAQDIATTPGSYATLTRSWTSGDTVTVRLPMRI----VMRAANDNANIAAITYGPV 569
Query: 627 LLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEE 686
+L+G SAL S PPS +T T +G+ F + + ++ +
Sbjct: 570 VLSGDYGD------------SALGS--PPSLKTSSITRT-STGSLAFTATANGSTVGLGP 614
Query: 687 F 687
F
Sbjct: 615 F 615
>gi|407923357|gb|EKG16430.1| Six-hairpin glycosidase-like protein [Macrophomina phaseolina MS6]
Length = 612
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 178/531 (33%), Positives = 269/531 (50%), Gaps = 39/531 (7%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFVGHYLSASAQMW 177
Q+ YL +D+D L++++R T L T G A GGW+ P R H GH+L+A Q W
Sbjct: 43 QERTRTYLKFVDLDRLLYNYRATHGLSTNGAASNGGWDAPDFPFRSHAQGHFLTAWVQCW 102
Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA--LKPVWAPYY 230
++T + +++ L +CQ GYLS FP FD+ E L PYY
Sbjct: 103 STTGDTECRDRAVQFTAELLKCQENNEAAGFTAGYLSGFPESEFDALEGRTLSNGNVPYY 162
Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
+HK++AGLLD + + A + + + R + I+ ++R L E GGM+
Sbjct: 163 VVHKLMAGLLDVWRGIGDLTARDVLLALAGWVDARTEN-ISYGDMQR---ILQTEFGGMS 218
Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
+VL +Y + D + L +A F+ L LA D L+ HANT +P IG+ Y+ T
Sbjct: 219 EVLADIYYQSGDSRWLTVAQRFEHAAVLTPLANNRDQLNGLHANTQVPKWIGAAREYKAT 278
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
G+ Y I DI +H+YA GG S E + P +A L ++ E+C +YNMLK++
Sbjct: 279 GNTTYYDIARNAWDITVRAHTYAIGGNSQAEHFRPPNAIAGYLTADTAESCNSYNMLKLT 338
Query: 411 RHLFRWTKE---IAYADYYERALTNGVLSIQRGTEP-GVMIY---MLPLG-RGVSKARST 462
R L WT E AY DYYER L N ++ Q +P G + Y + P G RGV A
Sbjct: 339 REL--WTTEPSSSAYFDYYERTLMNHLVGQQDPEDPHGHVTYFNSLQPGGVRGVGPAWGG 396
Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
W T ++SFWCC GTG+E+ +KL DSIYF +G+ LY+ + S DW+ V + Q
Sbjct: 397 GTWSTDYDSFWCCQGTGVETNTKLMDSIYF-RDGDSSALYVNLFAPSVLDWRQRAVTVTQ 455
Query: 523 KVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PG 580
P+ TL + ++ +R+P WT +GA+ +NG++ + PG
Sbjct: 456 TTSFPVTD-----NTTLQVAGAAGAWDMA---IRIPDWT--SGAEILVNGESANVAAEPG 505
Query: 581 NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
+ + + W+ D +T+ LP+ R DD SI A+ +GP +L G+
Sbjct: 506 TYATISRDWASGDTVTVTLPMGFRLVPANDD----TSIAALAYGPVILCGN 552
>gi|289668636|ref|ZP_06489711.1| putative secreted protein [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 793
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 200/630 (31%), Positives = 289/630 (45%), Gaps = 92/630 (14%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG+ ++ V L V L S+ A TN YL+ L D L+ +F A L AYGGW
Sbjct: 46 PGS-VRAVPLAQVRL-MPSLFLDALNTNRRYLMRLQPDRLLHNFVLYAGLDPQAPAYGGW 103
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A M A T +A + + +V L+ CQ G GY++ F +
Sbjct: 104 E--ADTIAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRK 161
Query: 215 -----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM 254
+FD + L WAP YT HK+ AGLLD + DN QAL++
Sbjct: 162 NAAGKIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNVQALQV 221
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A + Y +Q + + + L+ E GG+N+ L+ T D + L LA
Sbjct: 222 AVSLAGY----LQGIFSALDDAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHH 277
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYAT 374
L L Q D L H H+NT+IP +IG YEVTGD FF V H+Y
Sbjct: 278 HAVLDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVI 337
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG RE++ P ++ L + E C +YNMLK++RH+++W + DYYER L N V
Sbjct: 338 GGNGDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHVYQWGPQAELFDYYERTLLNHV 397
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
++ Q+ G+ YM PL G ++ GW + F+ FWCC G+G+E+ ++ GDSIY+++
Sbjct: 398 MA-QQHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGMEAHAQFGDSIYWQD 451
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS---- 550
G+YI Y+ S+ +G L MTL S+ E G S
Sbjct: 452 G---QGVYINLYVPSTVRDAAG-----------------LDMTL-HSALPEQGSASLRID 490
Query: 551 -------SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
+L LR+P W Q LNGQ + +L T W D L++ + L
Sbjct: 491 AAPPAQRTLALRVPGWVQQPHLQ--LNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPL 548
Query: 604 RTEAIQDDRPEYASIQAILFGPYLLA---GHTSGEWDIKTGT---ARSLSALISPIPPSF 657
R E DD P + S +L GP +LA G + W K+ + + + P+P
Sbjct: 549 RLETTPDD-PAWVS---VLRGPLVLAVDLGDAAKPWSGKSPALIGGQDILQRLQPVP--- 601
Query: 658 NAQLVTFTQESGNSTFVMSNSNQSITMEEF 687
G + F S+ Q + F
Sbjct: 602 -----------GKNAFTYSDGAQQWQLSPF 620
>gi|423223548|ref|ZP_17210017.1| hypothetical protein HMPREF1062_02203 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638305|gb|EIY32149.1| hypothetical protein HMPREF1062_02203 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 777
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 184/531 (34%), Positives = 259/531 (48%), Gaps = 39/531 (7%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A++ YLL L+ D + FR A L Y GWE+ + G +GHYLSA A +
Sbjct: 51 AEEKETAYLLELEPDRFLSGFRSEAGLVPKAPKYEGWES--LGVAGQTLGHYLSACAMYY 108
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVW 226
A++ + +++ + L CQ G GYL+A P +F A L W
Sbjct: 109 ATSGDERFLQRLEYTINELDSCQQANGDGYLAATPDGKRIFKEVSAGKIYSQGFDLNGGW 168
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
P Y +HK+LAGL+D Y A N +AL +A + + Y Q + + E+ L E
Sbjct: 169 VPLYVMHKVLAGLIDTYQYAHNERALAVAEKLANWMYGTFQHL----TEEQMQKVLACEF 224
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
GGMN+ L LY+ T + K L LA FD + LA+ D L HANT +P +IG+
Sbjct: 225 GGMNEALANLYACTKNEKFLALAQRFDNHKAIMDSLAVGVDDLEGKHANTQVPKIIGAAR 284
Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
YE+TG I +FF V +HSY GG S E + P +L + L + N ETC TYN
Sbjct: 285 LYELTGSKRDSAIASFFWHTVVQNHSYVNGGNSDGEHFGTPGQLNERLSTSNTETCNTYN 344
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
MLK++RHLF W Y+ YYERA+ N +L+ Q + G+ Y PL G K G+
Sbjct: 345 MLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGKK-----GY 398
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
+ F SF CC G+G+E+ K GD IY EG+ L++ +I S +W +++ Q D
Sbjct: 399 LSPFQSFCCCSGSGMENHVKYGDFIY--SEGSDSSLWVNLFIPSQLNWTDRKMIVTQDTD 456
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
I S D + LT K E Q LR P W S + +NG ++ N +
Sbjct: 457 -IPSSD---KTVLTV--KTEKPQSVIFRLRYPEWAES--MRIRVNGSSVSFEASNNSYVS 508
Query: 586 TER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
ER W NDK+ I + T ++ D+ I +GP LLAG E
Sbjct: 509 IEREWKDNDKIEITFKIKFYTVSMPDNEKRV----GIFYGPVLLAGELGTE 555
>gi|354583886|ref|ZP_09002783.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
gi|353197148|gb|EHB62641.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
Length = 778
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 185/547 (33%), Positives = 271/547 (49%), Gaps = 47/547 (8%)
Query: 102 VSLHDVWLDQ--------SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGG 153
V L D+W D + +Q+T YLL LDVD L+ + ASL YGG
Sbjct: 3 VKLVDLWGDDIMPKTELLEGIFKESQETGKGYLLHLDVDRLMAPCYEAASLEPKKPRYGG 62
Query: 154 WEN-PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP 212
WE PI+ GH +GH+LSA+A M +T + + +K+ V L+ Q+ GY+S FP
Sbjct: 63 WEETPIA---GHSIGHWLSAAAAMIDATSDEELLKKLVYAVNELAYVQSHDKDGYVSGFP 119
Query: 213 TELFD-----SFE----ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFY 263
+ FD FE +L W P+Y++HKI AGL+D Y L QAL++ + ++
Sbjct: 120 RDCFDIVFTGDFEVHNFSLAGSWVPWYSLHKIFAGLIDAYRLTGIEQALEVVIRLADW-- 177
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
+K + E+ L E GGMND + LY +T++ +L LA F L LA
Sbjct: 178 --AKKGTDRLTDEQFQRMLICEHGGMNDTMADLYRLTNNHAYLELAIRFCHRAILEPLAR 235
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
D L HANT IP VIG+ YE+TGD Y+ FF V + SY GG S E +
Sbjct: 236 GVDELEGKHANTQIPKVIGAAKLYEITGDDFYRKAAEFFWKEVTRNRSYIIGGNSIFEHF 295
Query: 384 WDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
+ + LG E ETC TYNMLK++ HLF W+++ Y D+YERAL N +L+ Q +
Sbjct: 296 RAANQ--EKLGVETAETCNTYNMLKLTDHLFGWSQDAEYMDFYERALYNHILASQ-DPDT 352
Query: 444 GVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
G+ +Y + G K +GT +SFWCC GTG+E+ ++ IY +Y+
Sbjct: 353 GMKMYFVSTEPGHFKV-----YGTAEHSFWCCTGTGMENPARYTHEIY---HATSNAIYV 404
Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
+I+S + VV+ Q+ + P T + + L +R+P WT +
Sbjct: 405 NLFIASKATFDDHQVVIRQETEF-----PKQSRTRLIIEEAKAAHF-KLRIRIPQWT-AG 457
Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
A +NG + +L+ W+ D + + LP+ LR +DD A IL+
Sbjct: 458 AVTAVVNGSEIYADAEPGYLNIERDWNAGDTIEVTLPMELRLYHAKDD----AKKVGILY 513
Query: 624 GPYLLAG 630
GP +LAG
Sbjct: 514 GPIVLAG 520
>gi|383644433|ref|ZP_09956839.1| hypothetical protein SeloA3_13744 [Sphingomonas elodea ATCC 31461]
Length = 746
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 188/571 (32%), Positives = 274/571 (47%), Gaps = 67/571 (11%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A + N LL L+ D L+ +FRK A L GK YGGWE+ + GH +GHYL+A MW
Sbjct: 14 AVEVNHRALLQLEPDRLLHNFRKYAGLEPKGKLYGGWES--DTIAGHTLGHYLTALVLMW 71
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA----------------FPTELFDSFEA 221
T + ++ + +V L+E Q K GTGY+ A FP + ++
Sbjct: 72 QQTGDPEMRRRADYIVAELAEAQAKRGTGYVGALGRKRKDGTIVDGEEIFPEIMRGEIKS 131
Query: 222 ----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
L W+P YT+HK+ AGLLD + NAQAL++ + YF +KV + +
Sbjct: 132 GGFDLNGSWSPLYTVHKVFAGLLDVHAGWGNAQALQVTLGLAGYF----EKVFAALNDAQ 187
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
L E GG+N+ LY+ T D + +++A LG L D L++FHANT +
Sbjct: 188 MQQMLGCEYGGLNESYAELYARTRDARWMVVAKRLYDDRVLGPLKAGEDKLANFHANTQV 247
Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
P +IG +E+TGD FF + V HSY GG + RE++ P +A + +
Sbjct: 248 PKLIGLARIHELTGDAGDATAARFFWERVTGHHSYVIGGNADREYFSAPDSIAQHITDQT 307
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
E C TYNMLK++ HLF W DYYERA N V++ Q + G YM PL G
Sbjct: 308 CEHCNTYNMLKLTSHLFAWQPNGVLFDYYERAHLNHVMAAQ-NPKTGGFTYMTPLMSGAE 366
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
+ S ++FWCC G+G+ES +K G++ +++ EG L + YI + DWK+
Sbjct: 367 RQYSQ----PNEDAFWCCIGSGLESHAKHGEAAFWQGEG---ALLVNLYIPAEIDWKA-- 417
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVWTYSNGAQASLNGQNLPL 576
QK ++ T T +Q ++ LR+P W A ++NG+
Sbjct: 418 ----QKAKLVLDTAYPFEGTATLKVEQLARAARFAIALRVPGWAEGK-AVVTVNGK---- 468
Query: 577 PPPGN------FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
PG+ + W +D + I LP++LR EA D S A+L GP +LAG
Sbjct: 469 --PGDAVFDRGYAIVARSWKRDDTIAISLPMALRLEAAPGDD----STVAVLRGPMVLAG 522
Query: 631 H---TSGEWDIK----TGTARSLSALISPIP 654
TS W+ GT L A +P P
Sbjct: 523 DLGPTSTPWNAGDPALVGT--DLLAAFTPAP 551
>gi|383779543|ref|YP_005464109.1| hypothetical protein AMIS_43730 [Actinoplanes missouriensis 431]
gi|381372775|dbj|BAL89593.1| hypothetical protein AMIS_43730 [Actinoplanes missouriensis 431]
Length = 799
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 180/537 (33%), Positives = 261/537 (48%), Gaps = 41/537 (7%)
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
+ YL +D+D ++ FR TA LP+ + GGWE P +LRGH GH LS AQ +
Sbjct: 61 VAYLRFVDLDRMLHMFRVTAGLPSAAEPLGGWEAPTVQLRGHTTGHLLSGLAQAAYHLDD 120
Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQ 242
+K + + +V L CQ GYLSAFP +FD EA K WAPYYTIHKI AGLLDQ
Sbjct: 121 RDLKARSAALVDGLKACQAP--NGYLSAFPETIFDQLEAGKNPWAPYYTIHKIFAGLLDQ 178
Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
+ L N AL +A M ++ +RV K+ + E+ L+ E GGMN+ LY +T +
Sbjct: 179 HRLLGNTTALDVARRMADWVGSRVSKL----TREQMQKVLHVEFGGMNESFVNLYRVTGE 234
Query: 303 PKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
HL LA FD L+ + D L+ HANT IP V+G+ Y+ TG ++ I T+F
Sbjct: 235 AAHLELARAFDHDEIFVPLSEKRDTLAGRHANTDIPKVVGAAAMYQATGSDYHRTIATYF 294
Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRW-TKEIA 421
D V HSY GG S EF+ P ++ LG E C TYNMLK++ L+
Sbjct: 295 WDQVVRHHSYVIGGNSNAEFFGPPGQVVSQLGENTCENCNTYNMLKLTERLYAIDPSRTD 354
Query: 422 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG----------WGTKFNS 471
Y DY+E AL N +L Q +P + G+S S G + + + +
Sbjct: 355 YLDYHEWALINQMLGEQ---DPDSAHGNVTYYTGLSSTASRKGKEGLVSDPGSYSSDYGN 411
Query: 472 FWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWD 531
F C +G+G+E+ +K + IY L + +I S ++ + +N
Sbjct: 412 FSCDHGSGLETHTKFAEPIYDTSRDT---LSVKLFIPSETTFRGAKIQINTMF------- 461
Query: 532 PYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSY 591
PY R T+ G +L +R+P W + +NG+ +P PG F + W
Sbjct: 462 PY-RETVRLRV-DGTGAPFTLRVRIPSWVRDPALR--VNGKPVPA-HPGRFATIRRVWRR 516
Query: 592 NDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH--TSGEWDIKTGTARSL 646
D +T+ LP R P+ ++ A+ +GP +LAG G + T R+L
Sbjct: 517 GDVVTLHLPFRTRWLPA----PDNPAVHALTYGPLVLAGRYGAQGPATLPTADPRTL 569
>gi|374992736|ref|YP_004968231.1| hypothetical protein SBI_09982 [Streptomyces bingchenggensis BCW-1]
gi|297163388|gb|ADI13100.1| hypothetical protein SBI_09982 [Streptomyces bingchenggensis BCW-1]
Length = 733
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 189/544 (34%), Positives = 274/544 (50%), Gaps = 51/544 (9%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
WLD + YL +D D L+++FR LPT G A GGW+ P R H
Sbjct: 18 WLDN-------ENRTRNYLRFVDADRLLYNFRANHRLPTNGAASNGGWDGPTFPFRTHVQ 70
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEA 221
GH+L+A AQ++A T + T ++K + +V L++CQ G GYLS FP F + EA
Sbjct: 71 GHFLTAWAQVYAVTGDTTCRDKAAYMVAELAKCQANNGAAGFNGGYLSGFPESDFSALEA 130
Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSV 275
L PYY IHKILAGLLD + + QA L +A W V++ R+ S
Sbjct: 131 GTLSNGNVPYYVIHKILAGLLDVWRHMGSTQARDMLLSLAGW-VDWRTGRL-------SG 182
Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
++ +L E GGMN VL LY T D + L A FD LA D L+ HANT
Sbjct: 183 QQMQSTLGTEFGGMNAVLSDLYLQTSDSRWLTTAQRFDHGAVFDPLASNQDRLNGLHANT 242
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+P IG+ Y+ TG Y+ I T +I +H+Y GG S E + P +A L
Sbjct: 243 QVPKWIGAAREYKATGTTRYRDIATNAWNICVNAHTYVIGGNSQAEHFRPPNAIAAYLNQ 302
Query: 396 ENEETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG 453
+ E+C TYNML ++R LF + +A DYYERA N ++ Q + G + Y PL
Sbjct: 303 DACESCNTYNMLTLTRELFTLDPDRVALFDYYERAWLNQMIGQQNPADNHGHVTYFTPLN 362
Query: 454 ----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
RGV A W T ++SFWCC GTG+E +KL DS+YF + L + ++ S
Sbjct: 363 PGGRRGVGPAWGGGTWSTDYDSFWCCQGTGLEMHTKLMDSVYFSSD---TTLIVNLFVPS 419
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
+W + + Q VS L++T S ++ +R+P WT GA S+
Sbjct: 420 VLNWSQRGITVTQTTSYPVSDTTTLQVTGNLSGTW------AMRIRIPSWTA--GATISV 471
Query: 570 NG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
NG QN+ PG++ + T W+ D +T++LP+ + I + A++ A+ +GP +
Sbjct: 472 NGTTQNIT-TTPGSYATLTRSWTSGDTVTVRLPMRI----IMRAANDNANVAAVTYGPVV 526
Query: 628 LAGH 631
L+G+
Sbjct: 527 LSGN 530
>gi|339021543|ref|ZP_08645591.1| hypothetical protein ATPR_1899 [Acetobacter tropicalis NBRC 101654]
gi|338751393|dbj|GAA08895.1| hypothetical protein ATPR_1899 [Acetobacter tropicalis NBRC 101654]
Length = 799
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 192/572 (33%), Positives = 278/572 (48%), Gaps = 67/572 (11%)
Query: 88 NPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTP 147
PGG G + V L DV L S L A ++N YLL L D L+ +FR+ A LP
Sbjct: 34 GPGGVG-AGESVTPVPLQDVRLLPSHWL-DAVESNRAYLLSLSADRLLHNFRRQAGLPPK 91
Query: 148 GKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGY 207
G+ YGGWEN + GH +GHYLSA A M+A T + + +++ +V L+ Q+K G GY
Sbjct: 92 GEVYGGWEN--DTIAGHTLGHYLSALALMYAQTGDTECRRRVAYIVQELAIVQDKWGDGY 149
Query: 208 LSAFPTE-----------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLAD 247
++ F + +F E L W+P Y IHK AGL D
Sbjct: 150 VAGFTRKEKDGTITDGKVIFAEMEKGDIRSGGFDLNGAWSPLYNIHKTFAGLFDAQTYCQ 209
Query: 248 NAQALKMATWM---VEYFYNRV-----QKVITMYSVERHWYSLNEETGGMNDVLYRLYSI 299
+ AL +A + E FY+++ QKV+T E GG+N+ L +
Sbjct: 210 DPNALAVAVKLGGFFEAFYSKLTDAQLQKVLTC------------EYGGLNESFAELAAR 257
Query: 300 THDPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLI 358
T D K L LA +D+P +A D L++ HANT IP +IG EV+ D +++
Sbjct: 258 TGDAKWLRLAKRTYDRPVLDPLMARHDD-LANRHANTQIPKLIGLGRIAEVSRDAHWQVG 316
Query: 359 GTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTK 418
FF V HSY GG + RE++ +P ++ + + E C TYNMLK++R L+ W
Sbjct: 317 PRFFWQAVTQHHSYVIGGNADREYFSEPDTISQHITEQTCEHCNTYNMLKLTRQLYTWQP 376
Query: 419 EIAYADYYERALTNGVLSIQRGTEPGVMIYMLP-LGRGVSKARSTHGWGTKFNSFWCCYG 477
+ A DYYERA N VL+ + G+ YM P + GV + W T +SFWCC G
Sbjct: 377 DSALFDYYERAHLNHVLAAH-DPQTGMFTYMTPTITAGVRE------WSTPTDSFWCCVG 429
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
TG+ES +K G+SI++E L++ YI S W +V K PY
Sbjct: 430 TGMESHAKHGESIWWE---GAETLFVNLYIPSRVQWARKNVSWRMKTR-----YPYDGQV 481
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTI 597
+ + +L LR+P W + ++NGQ++ P G +L W D + +
Sbjct: 482 TLKVEDVKAPEPFALALRVPGWVKGD-LSLTVNGQSVSATPSGGYLMLNRTWHAGDTVAL 540
Query: 598 QLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
LPL+LRTEA E + ++L GP +LA
Sbjct: 541 TLPLALRTEAPV----EAPHLVSLLHGPMVLA 568
>gi|285018715|ref|YP_003376426.1| hypothetical protein XALc_1948 [Xanthomonas albilineans GPE PC73]
gi|283473933|emb|CBA16434.1| conserved hypothetical protein [Xanthomonas albilineans GPE PC73]
Length = 810
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 204/647 (31%), Positives = 308/647 (47%), Gaps = 61/647 (9%)
Query: 57 DDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLW 116
D +A ++L PS+ Q ++ A + PG ++ + L V L + S+
Sbjct: 21 DHAAGAALDPSRRRFLQWSALAMAAGLLRFPQDAAASTPGR-VQALPLRQVTL-KPSLFL 78
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
+ QTN YLL L+ D L+ +F + A LP G YGGWE + GH +GHYLSA ++M
Sbjct: 79 DSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGAVYGGWEG--DTIAGHTLGHYLSALSKM 136
Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDS--FEALKPV--------- 225
A T +++++ ++ +V L+ Q + GY+ F T D+ E K V
Sbjct: 137 HAQTRDSSLRTRIDYIVAELARAQAQDPDGYVGGF-TRKNDNGKIEGGKAVLEDLRRGII 195
Query: 226 ----------WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSV 275
W+P YT HK+ AGLLD + L NAQAL + + YF V
Sbjct: 196 KGGKFNLNGSWSPLYTQHKLFAGLLDAHALGGNAQALTVLVKVAGYFAG----VFDALDH 251
Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
+ L+ E GG+N+ L + T + + + + LA D L H HANT
Sbjct: 252 AQMQTLLDTEFGGLNESFIELGARTGQERWIAIGKRLRHEKIIDPLAAGHDVLPHIHANT 311
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+P IG ++EV GD FF + V A +SY GG S RE++ +P +A L
Sbjct: 312 QVPKFIGEARQFEVAGDADAAAAARFFWETVTAHYSYVIGGNSDREYFQEPDSIAGFLTE 371
Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
+ E C +YNMLK++RHL++WT + Y DYYER L N ++ Q G+ YM P+ G
Sbjct: 372 QTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAAQHPAT-GMFTYMTPMISG 430
Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
+ G+ KF+SFWCC G+G+E+ ++ GD+IY+++E LY+ YI S DW
Sbjct: 431 GER-----GFSEKFDSFWCCVGSGMEAHAQFGDAIYWQDEA---ALYVNLYIPSRLDWSE 482
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
+ L ++D V + +R+ + + + +L LR+P W + LNG+ L
Sbjct: 483 RDLAL--ELDSGVPENGKVRLQVLRAGARAPRRLL---LRVPAWCQGS-YTLRLNGKPLR 536
Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA---GHT 632
P +L+ W D + ++L LR E D PE ++ GP LA G
Sbjct: 537 RTPIDGYLALERDWRSGDVIELELATPLRLEHAAGD-PESV---VVMRGPLALAADLGPV 592
Query: 633 SGEWDIK----TGTARSLSALIS-PIPPSFNAQLVTFTQESGNSTFV 674
S +D TA L+ + P P F L + TQ G TFV
Sbjct: 593 STPYDAPDPALVATADPLAGFVELPQPGHF---LASDTQPPG-LTFV 635
>gi|224536588|ref|ZP_03677127.1| hypothetical protein BACCELL_01463 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521844|gb|EEF90949.1| hypothetical protein BACCELL_01463 [Bacteroides cellulosilyticus
DSM 14838]
Length = 777
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 183/526 (34%), Positives = 258/526 (49%), Gaps = 39/526 (7%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A++ YLL L+ D + FR A L Y GWE+ + G +GHYLSA A +
Sbjct: 51 AEEKETAYLLELEPDRFLSGFRSEAGLVPKAPKYEGWES--LGVAGQTLGHYLSACAMYY 108
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVW 226
A++ + +++ + L CQ G GYL+A P +F A L W
Sbjct: 109 ATSGDERFLQRLEYTINELDSCQQANGDGYLAATPDGKRIFKEVSAGKIYSQGFDLNGGW 168
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
P Y +HK+LAGL+D Y A N +AL +A + + Y Q + + E+ L E
Sbjct: 169 VPLYVMHKVLAGLIDTYQYAHNERALVVAEKLANWMYGTFQHL----TEEQMQKVLACEF 224
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
GGMN+ L LY+ T + K L LA FD + LA+ D L HANT +P +IG+
Sbjct: 225 GGMNEALANLYACTKNEKFLALAQRFDNHKAIMDSLAVGVDDLEGKHANTQVPKIIGAAR 284
Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
YE+TG I +FF V +HSY GG S E + P +L + L + N ETC TYN
Sbjct: 285 LYELTGSKRDSAIASFFWHTVVQNHSYVNGGNSDGEHFGTPGQLNERLSTSNTETCNTYN 344
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
MLK++RHLF W Y+ YYERA+ N +L+ Q + G+ Y PL G K G+
Sbjct: 345 MLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGKK-----GY 398
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
+ F SF CC G+G+E+ K GD IY EG+ L++ +I S +W +++ Q D
Sbjct: 399 LSPFQSFCCCSGSGMENHVKYGDFIY--SEGSDSSLWVNLFIPSQLNWTDRKMIVTQDTD 456
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
I S D + LT K E Q LR P W S + +NG ++ N +
Sbjct: 457 -IPSSD---KTVLTV--KTEKSQSVIFRLRYPEWAES--MRIKVNGSSVSFEASNNSYVS 508
Query: 586 TER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
ER W NDK+ I + T ++ D+ I +GP LLAG
Sbjct: 509 IEREWKDNDKIEITFKIKFYTVSMPDNEKRV----GIFYGPVLLAG 550
>gi|86196151|gb|EAQ70789.1| hypothetical protein MGCH7_ch7g196 [Magnaporthe oryzae 70-15]
gi|440463815|gb|ELQ33359.1| hypothetical protein OOU_Y34scaffold00969g44 [Magnaporthe oryzae
Y34]
gi|440485206|gb|ELQ65183.1| hypothetical protein OOW_P131scaffold00516g8 [Magnaporthe oryzae
P131]
Length = 633
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 183/535 (34%), Positives = 262/535 (48%), Gaps = 40/535 (7%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
Q L Y+ +D++ L+++FR + T G +A GGW+ P R H GH+L+A A +
Sbjct: 53 QDRTLTYIKFVDLNRLLYNFRANHGVSTNGAQANGGWDAPDFPFRSHIQGHFLTAWANCY 112
Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE--ALKPVWAPYY 230
A + + + V L++CQ+ GYLS FP + E L PYY
Sbjct: 113 AVLKDQECRSRAEQFVEELAKCQDNNAAAGFQAGYLSGFPESDITAVEQRTLTNGNVPYY 172
Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
IHK +AGLLD + + +A + M + R ++ S + + E GGM+
Sbjct: 173 AIHKTMAGLLDVWRNVGSTKAKDVLVKMAGWVDTRTARL----SYAQMQSMMGTEFGGMS 228
Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
+VL ++ T D + L +A FD L LA D L HANT +P IG+ Y+ T
Sbjct: 229 EVLADMFHQTGDERWLTVARRFDHAAVLDPLARSQDSLDGLHANTQVPKWIGAAREYKAT 288
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
D Y I D +H+YA GG S E + P +A L + E C TYNMLK++
Sbjct: 289 KDQRYLDIARNAWDFTVEAHTYAIGGNSQSEHFRPPNAIAGYLLHDTAEACNTYNMLKLT 348
Query: 411 RHLFR-----WTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLG----RGVSKAR 460
R LF + A D+YERAL N +L Q G G + Y PL RGV A
Sbjct: 349 RELFMHDAAPGMNDTAKFDFYERALLNHLLGQQDPGDGHGHVTYFTPLNPGGRRGVGPAW 408
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KSGHV 518
W T + SFWCC GTGIE+ +KL DSIYF N LY+ +I SS W + G V
Sbjct: 409 GGGTWSTDYESFWCCQGTGIETNTKLMDSIYFRSRDN-NALYVNLFIPSSVQWSDRDGVV 467
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP--- 575
V + P+ TLT S G +L++R+P W + GA+ S+NGQ +
Sbjct: 468 VTQETEFPLGD-----ATTLTVSGAG--GGRWTLSVRIPSWV-AGGAEVSVNGQKVGGDV 519
Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
PG + + T W+ DK+T++LP+ L T A DD ++ A+ +GP +L+G
Sbjct: 520 RTTPGGYAAITREWAVGDKVTVRLPMKLHTVAANDD----PTLVALAYGPAILSG 570
>gi|169596765|ref|XP_001791806.1| hypothetical protein SNOG_01152 [Phaeosphaeria nodorum SN15]
gi|111069681|gb|EAT90801.1| hypothetical protein SNOG_01152 [Phaeosphaeria nodorum SN15]
Length = 620
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 192/557 (34%), Positives = 278/557 (49%), Gaps = 60/557 (10%)
Query: 99 LKEVSLHDV-WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWEN 156
L +VSL + W D + L YL ++VD L+++FR T L T G + GGW+
Sbjct: 39 LSQVSLSNSRWKDN-------ENRTLNYLKAVNVDRLLYNFRATHKLSTNGAQPNGGWDA 91
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-----TGYLSAF 211
P R H GHYL+A +A+ + K + S V L++CQ G TGYLS F
Sbjct: 92 PNFPFRSHAQGHYLTAWVHCYATLRDNECKNRASYFVQELAKCQANNGAAQFSTGYLSGF 151
Query: 212 PTELFDSFEA--LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
P F + EA LK PYY +HK +AGLLD + + + +A + + + R +K+
Sbjct: 152 PESEFVALEAGQLKGGNVPYYAVHKTMAGLLDAWRIIGDTKARDVLLALAGWVDGRTKKL 211
Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
S + L E GGMNDVL +Y +T + + L +A FD LA D LS
Sbjct: 212 ----SSSQMQTMLGTEFGGMNDVLAAIYQLTGNQQWLTVAQRFDHASQFDPLANNQDRLS 267
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
HANT +P IG+ Y+ TG Y I D +H+YA GG S E + P ++
Sbjct: 268 GNHANTQVPKWIGAAREYKSTGTKRYLDIAKNAWDFTINAHTYAIGGNSQAEHFRPPNQI 327
Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKE---IAYADYYERALTNGVLSIQRGTEP-GV 445
++ L ++ E C TYNMLK++R L WT + Y DYYERAL N +L Q T+ G
Sbjct: 328 SNFLTNDTAEQCNTYNMLKLTRDL--WTTDPSSTKYFDYYERALINHLLGAQNPTDNHGH 385
Query: 446 MIYMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
+ Y PL RG+ A W T +NSFWCC GT +E+ +KL DSIYF + L
Sbjct: 386 ITYFTPLKSGGRRGIGPAWGGGTWSTDYNSFWCCQGTALETNTKLMDSIYFYDSS---AL 442
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS------SLNLR 555
Y+ + S+ DWK V ++Q TF + ++ +R
Sbjct: 443 YVNLFTPSTLDWKQRSVKISQ--------------VTTFPASDTTTLTVTGTGNWAMKIR 488
Query: 556 MPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
+P WT +GA S+N Q + PG++ + + W D +T++LP+ LRT A +
Sbjct: 489 IPSWT--SGATISINRQASGVAANPGSYATLSRDWKSGDIVTVKLPMKLRTVAAN----D 542
Query: 615 YASIQAILFGPYLLAGH 631
A+I A+ FGP +L+G+
Sbjct: 543 NANIAAVAFGPVILSGN 559
>gi|389647349|ref|XP_003721306.1| hypothetical protein MGG_09030 [Magnaporthe oryzae 70-15]
gi|351638698|gb|EHA46563.1| hypothetical protein MGG_09030 [Magnaporthe oryzae 70-15]
Length = 680
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 183/535 (34%), Positives = 262/535 (48%), Gaps = 40/535 (7%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
Q L Y+ +D++ L+++FR + T G +A GGW+ P R H GH+L+A A +
Sbjct: 100 QDRTLTYIKFVDLNRLLYNFRANHGVSTNGAQANGGWDAPDFPFRSHIQGHFLTAWANCY 159
Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE--ALKPVWAPYY 230
A + + + V L++CQ+ GYLS FP + E L PYY
Sbjct: 160 AVLKDQECRSRAEQFVEELAKCQDNNAAAGFQAGYLSGFPESDITAVEQRTLTNGNVPYY 219
Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
IHK +AGLLD + + +A + M + R ++ S + + E GGM+
Sbjct: 220 AIHKTMAGLLDVWRNVGSTKAKDVLVKMAGWVDTRTARL----SYAQMQSMMGTEFGGMS 275
Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
+VL ++ T D + L +A FD L LA D L HANT +P IG+ Y+ T
Sbjct: 276 EVLADMFHQTGDERWLTVARRFDHAAVLDPLARSQDSLDGLHANTQVPKWIGAAREYKAT 335
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
D Y I D +H+YA GG S E + P +A L + E C TYNMLK++
Sbjct: 336 KDQRYLDIARNAWDFTVEAHTYAIGGNSQSEHFRPPNAIAGYLLHDTAEACNTYNMLKLT 395
Query: 411 RHLFR-----WTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLG----RGVSKAR 460
R LF + A D+YERAL N +L Q G G + Y PL RGV A
Sbjct: 396 RELFMHDAAPGMNDTAKFDFYERALLNHLLGQQDPGDGHGHVTYFTPLNPGGRRGVGPAW 455
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KSGHV 518
W T + SFWCC GTGIE+ +KL DSIYF N LY+ +I SS W + G V
Sbjct: 456 GGGTWSTDYESFWCCQGTGIETNTKLMDSIYFRSRDN-NALYVNLFIPSSVQWSDRDGVV 514
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP--- 575
V + P+ TLT S G +L++R+P W + GA+ S+NGQ +
Sbjct: 515 VTQETEFPLGD-----ATTLTVSGAG--GGRWTLSVRIPSWV-AGGAEVSVNGQKVGGDV 566
Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
PG + + T W+ DK+T++LP+ L T A DD ++ A+ +GP +L+G
Sbjct: 567 RTTPGGYAAITREWAVGDKVTVRLPMKLHTVAANDD----PTLVALAYGPAILSG 617
>gi|114047478|ref|YP_738028.1| hypothetical protein Shewmr7_1982 [Shewanella sp. MR-7]
gi|113888920|gb|ABI42971.1| protein of unknown function DUF1680 [Shewanella sp. MR-7]
Length = 795
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 178/554 (32%), Positives = 276/554 (49%), Gaps = 54/554 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L + L+DV L L AQQT+L Y++ +D + L+ +RK A + T Y WEN
Sbjct: 28 LTPIPLNDVRLTAGPFL-HAQQTDLAYIMSMDPERLLAPYRKEAGIATTADNYPNWEN-- 84
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
+ L GH GHYLSA A M+A+T + + E+++ +V L +CQ G GY+ P +L+
Sbjct: 85 TGLDGHIGGHYLSALALMYAATGDQAVLERLNYMVAELEKCQQAHGNGYVGGVPHGDKLW 144
Query: 217 DSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFY 263
A L W P+Y +HK+ AGL D Y+ N A KM A WM++
Sbjct: 145 QQVAAGHIEADLFTLNQSWVPWYNLHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDLSR 204
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
N + + + L E GG+N+ L +YSIT K+L LA+ + L L
Sbjct: 205 NLTDEQLQLM--------LRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQ 256
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
D L+ HANT IP ++G E++ + + +F V + + GG S RE +
Sbjct: 257 HQDKLTRLHANTQIPKIVGVARIAELSHNKAWLESADYFWQQVVHQRTVSIGGNSVREHF 316
Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
+ + L S E ETC TYNMLK+S+ L+ +++ Y DYYERAL N +LS Q +
Sbjct: 317 HPSEDFSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-PQ 375
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
G ++Y P+ + + + S WCC G+GIE+ +K G+ IY EE+ N L+
Sbjct: 376 TGGLVYFTPM-----RPDHYRVYSSAQESMWCCVGSGIENHAKYGELIYAEEDNN---LF 427
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS--SLNLRMPVWT 560
+ ++ S +WK+ + L+QK + +S+ + Q + +LNLR P W
Sbjct: 428 VNLFVDSEVNWKAKGISLSQKT----------QFPDDNTSQMIIHQEADFTLNLRYPTWA 477
Query: 561 YSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
+ S+NG+ P G ++ T W D +TI LP+ + E + D Y
Sbjct: 478 KGD-VTVSINGEPQRFTPTQGQYIPLTRHWRKGDSVTITLPMDISLEQLPDKTAYY---- 532
Query: 620 AILFGPYLLAGHTS 633
++L+GP +LA T+
Sbjct: 533 SVLYGPIVLAAKTA 546
>gi|325106128|ref|YP_004275782.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324974976|gb|ADY53960.1| protein of unknown function DUF1680 [Pedobacter saltans DSM 12145]
Length = 782
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 194/632 (30%), Positives = 285/632 (45%), Gaps = 67/632 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L+EV L D + A+Q +L+Y+L +D+D L+ + + A L K+YG WEN
Sbjct: 32 LQEVKLLD------GIFKNAEQVDLKYILSMDMDKLLAPYLREAGLSEKAKSYGNWEN-- 83
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
S L GH GHYLSA + M+AST N I +++ + L CQ+ G GYL P
Sbjct: 84 SGLDGHIGGHYLSALSLMYASTKNPDINKRIDYYLSELKRCQDANGDGYLGGVPDGKAMW 143
Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
++ + +L W P Y IHK+ AGL D +V N A +K+ W F
Sbjct: 144 RDISDGKIDAATFSLNKKWVPLYNIHKVFAGLYDAWVYTGNNTAKDMFIKLCDWATTTFG 203
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
N ++ I L E GG+N+ Y +T K++ LA F L L
Sbjct: 204 NLNEQQIQQM--------LKSEHGGINESFADAYKLTGQQKYMDLALKFSHKAILDPLRN 255
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
Q D L+ HANT IP VIG + E+ + TFF D V + A GG S RE +
Sbjct: 256 QEDKLTGIHANTQIPKVIGFEKISEIEHKDDWHKAATFFWDNVVYKRTVAIGGNSVREHF 315
Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
+ E ETC TYNM+K+S+ L+ + E Y DY E+AL N +LS Q E
Sbjct: 316 HPINNFMPMIEDIEGPETCNTYNMIKLSKALYNQSGETKYIDYIEKALYNHILSSQH-PE 374
Query: 443 PGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
G +Y P+ R H + S WCC G+G+E+ +K G+ IY N
Sbjct: 375 KGGFVYFTPM-------RPNHYRVYSQPETSMWCCVGSGLENHAKYGEFIYAH---NDKD 424
Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
L++ +I S DWK + + Q + + +++T + + ++N+R+P W
Sbjct: 425 LFVNLFIPSELDWKEKKIKITQTTNFPEEGNTSIKLT------EIKNENFNINIRIPNWA 478
Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
N +NG+ + G +++ ++W D++ I LPLS R E + D P YAS
Sbjct: 479 SENDISVKINGKQIQPIVEGKYITLNKKWKKGDEINIDLPLSNRIEQMPDGLP-YAS--- 534
Query: 621 ILFGPYLLAGHT------------SGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQES 668
I +GP LLA T S I G LS I + L T++S
Sbjct: 535 IFYGPILLAAKTDTIDLKGLFADDSRGGHIAKGKQLPLSTAPQFIVEKKDDILKNLTKQS 594
Query: 669 GNSTFVMSNSNQSITMEEFPVSGTDAALHATF 700
N F +N S +E P +A +
Sbjct: 595 NNLIFKSANIKYSKNLELVPFYKVHDTRYAVY 626
>gi|329847073|ref|ZP_08262101.1| hypothetical protein ABI_01350 [Asticcacaulis biprosthecum C19]
gi|328842136|gb|EGF91705.1| hypothetical protein ABI_01350 [Asticcacaulis biprosthecum C19]
Length = 800
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 183/566 (32%), Positives = 277/566 (48%), Gaps = 51/566 (9%)
Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
V L DV L S L A + N +YL+ L D ++ ++ K A LP G+ YGGWE+ +
Sbjct: 46 VPLSDVRLLPSPFL-TAVEANTKYLMFLSPDRMLHNYHKFAGLPVKGEIYGGWES--DTI 102
Query: 162 RGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF---------- 211
G +GHYLSA + ++A T +A + ++ ++ L++ Q G GY + F
Sbjct: 103 AGEALGHYLSALSLLYAQTGHAEARTRIEYIIAELAKVQAAHGDGYAAGFMRKRKDASIV 162
Query: 212 -PTELFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEY 261
E+F A L W P+Y HK+ AGL+D A + +A + Y
Sbjct: 163 DGKEIFAEIMAGDIRSAGFDLNGCWVPFYNWHKLFAGLMDAQTYAGIDAGIPVAVALGGY 222
Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
++KV + E+ L+ E GG+N+ LY+ T DP+ L LA L L
Sbjct: 223 ----IEKVFAALNDEQVQKVLDCEHGGINESFAELYTRTKDPRWLALAERIYHHRILDPL 278
Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
D L++ HANT +P ++G YE+TG P Y+ +FF D V HS+A GG + RE
Sbjct: 279 TAGEDKLANNHANTQVPKLVGLARLYEITGKPGYRKASSFFWDRVVNHHSFAIGGNADRE 338
Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
++++P +A + + E+C TYNMLK++RHL+ WT A+ DYYERA N +++ Q
Sbjct: 339 YFFEPDTIAKHITEQTCESCNTYNMLKLTRHLYAWTPNAAWFDYYERAHLNHIMAHQN-P 397
Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
E G+ YM+PL G + S T +SFWCC +GIES SK GDSIY++ + L
Sbjct: 398 ETGMFAYMVPLMSGTGREYS-----TPEDSFWCCVLSGIESHSKHGDSIYWQSDDT---L 449
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVWT 560
++ +I S W N+ + + PY + F Q G + ++ +R+P W
Sbjct: 450 FVNLFIPSKLTW-------NKAAFELTTQYPY-DSRVAFKVTQSSGAKAFTVAVRIPGWA 501
Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
S+ +NG+ + W D +T+ LPL LR E D + A
Sbjct: 502 KSH--TLLVNGKPALAAIDKGYALIRRTWKAGDVVTLDLPLELRFEGTAGDD----KVVA 555
Query: 621 ILFGPYLLAGHTSGEWDIKTGTARSL 646
+L GP +LA D G A +L
Sbjct: 556 LLRGPMVLAADLGAIEDSWQGDAPAL 581
>gi|313204495|ref|YP_004043152.1| hypothetical protein Palpr_2030 [Paludibacter propionicigenes WB4]
gi|312443811|gb|ADQ80167.1| protein of unknown function DUF1680 [Paludibacter propionicigenes
WB4]
Length = 788
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 177/539 (32%), Positives = 268/539 (49%), Gaps = 38/539 (7%)
Query: 106 DVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHF 165
DV L +S A+ ++ YLL LD D L+ + K L + Y WEN + L GH
Sbjct: 38 DVRLTESP-FKHAEDMDINYLLGLDADRLMAPYLKGGGLTPKAENYPNWEN--TGLDGHI 94
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE--- 220
GHYLSA + M+A+T N IKE++ + L Q+ G GYL P +++D +
Sbjct: 95 GGHYLSALSYMYAATGNTRIKERLDYSLNELKRAQDAAGDGYLGGTPNGRKIWDEIKKGT 154
Query: 221 ------ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYS 274
L W P Y IHK AGL D Y+ + A M + ++ YN V +
Sbjct: 155 INASSFGLNGGWVPLYNIHKTYAGLRDAYLQGGSLLAKDMLIKLTDWMYNTVSGLTDAQV 214
Query: 275 VERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHAN 334
E L E GG+N+V + SIT + K+L LAH F L L D L+ HAN
Sbjct: 215 QEM----LKSEHGGLNEVFADVASITGNKKYLELAHKFSHQTLLQLLLQHQDKLTGMHAN 270
Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
T IP VIG + ++ G+ + +FF V + S + GG S RE +
Sbjct: 271 TQIPKVIGFKRIADLEGNKDWSDAASFFWKTVVDNRSVSIGGNSVREHFHPSDNFTSMFE 330
Query: 395 SEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG 453
SE ETC TYNML++++ LF+ + E ++ DYYERAL N +LS Q + G +Y P+
Sbjct: 331 SEQGPETCNTYNMLRLTKLLFQTSGEASFMDYYERALYNHILSTQDPIQGG-FVYFTPMR 389
Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
G + + SFWCC G+G+E+ ++ G+ IY ++ + LY+ +I S W
Sbjct: 390 AGHYRV-----YSQPQTSFWCCVGSGLENHARYGEMIYGFKDND---LYVNLFIPSVLTW 441
Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
K+ ++ + Q+ + + + +K+ L +L++R P W N + S+NGQ+
Sbjct: 442 KAKNIRIEQQNN----FAKQEAADIIVDAKKTA--LFTLHIRKPEWVKDNDLKVSVNGQS 495
Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
P+ +LS T WS DK+ ++LP+ LR D+ EY + L+GPY+LA T
Sbjct: 496 TPVTIKDGYLSITRNWSKGDKVHLELPMQLRAVTTPDNAQEY----SFLYGPYVLAAKT 550
>gi|325679069|ref|ZP_08158663.1| hypothetical protein CUS_6624 [Ruminococcus albus 8]
gi|324109193|gb|EGC03415.1| hypothetical protein CUS_6624 [Ruminococcus albus 8]
Length = 791
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 193/538 (35%), Positives = 270/538 (50%), Gaps = 62/538 (11%)
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPISELRGHFVGHYLSASAQMWASTH 181
+ YLL D D L+ FR+TA L G Y GWE+ + + GH VGHY++A AQ +AS
Sbjct: 29 IAYLLSFDTDRLLAGFRETAGLDMRGAVRYSGWEDDL--IGGHCVGHYMTAVAQAYASLQ 86
Query: 182 NATIKE----KMS-TVVFSLSECQNKIGTGYLSAFPTEL---------FDSFEA-----L 222
+ K++ T L ECQ +GTG++ F ++ FD+ E +
Sbjct: 87 EGDSRRDALYKLAVTTTDGLKECQQALGTGFI--FGAKIIDKNNVEAQFDNVEKNLSNIM 144
Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
W PYYT+HKILAG +D Y L A +A+ + ++ Y RV + +S E L
Sbjct: 145 TQAWVPYYTLHKILAGAIDIYRLTGYENAKTVASRLGDWVYRRVSR----WSEETQRTVL 200
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLGFLALQADYLSHFHANTHIPIVI 341
E GGMND LY LY++T +H + AH FD+ P F A + L++ HANT IP +
Sbjct: 201 GIEYGGMNDCLYELYAVTGKEEHAIAAHCFDEVPLFENVYAGTENALNNKHANTTIPKFL 260
Query: 342 GSQMRYE------VTGDPL----YKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
G+ RY V G+ + Y F D+V HSY TGG S E + L
Sbjct: 261 GALKRYAILDGRTVNGETVDAGRYLGYAERFWDMVVQKHSYITGGNSEWEHFGCDYVLDA 320
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
+ N ETC TYNMLK+SR LF T E YADYYE N +LS Q E G+ Y P
Sbjct: 321 ERTNANCETCNTYNMLKLSRLLFEITGEKKYADYYENTFINAILSSQN-PETGMSTYFQP 379
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
+ G K S T + FWCC G+G+E+F+KLGDSIYF EGN L + QYISSS
Sbjct: 380 MASGYFKVYS-----TPYTKFWCCTGSGMENFTKLGDSIYF-TEGNA--LIVNQYISSSA 431
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
+W V + Q D I + D T F + G SL LR+P W + A +++G
Sbjct: 432 EWSEKGVKVEQMTD-IPNSD-----TAKFMIHGKGG--ISLKLRLPDWLAGD-AVITVDG 482
Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
+ G + + + + I+LP+ +R ++ D++ Y +GP +L+
Sbjct: 483 KAYDADINGGYAEVS-GIADGSVVEIKLPMEVRAHSLPDNKNTY----GFRYGPIVLS 535
>gi|379719928|ref|YP_005312059.1| hypothetical protein PM3016_2010 [Paenibacillus mucilaginosus 3016]
gi|378568600|gb|AFC28910.1| hypothetical protein PM3016_2010 [Paenibacillus mucilaginosus 3016]
Length = 641
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 181/531 (34%), Positives = 265/531 (49%), Gaps = 55/531 (10%)
Query: 153 GWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP 212
GWE+P ELRGH +GH+LSA+A ++ T + +K K +V L+ CQ G +L+AFP
Sbjct: 76 GWESPTCELRGHIMGHWLSAAATIYGQTQDGLVKAKADYIVAELARCQEANGGEWLAAFP 135
Query: 213 TELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITM 272
K VWAP+YTIHK+L GL D Y LA +A AL++ T M +FY +
Sbjct: 136 ESYMHRIARGKYVWAPHYTIHKLLMGLYDMYRLAGSAAALELMTNMAAWFY----RWTDG 191
Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
++ E L+ ETGGM + LY +T HL L +D+ F L D L++ H
Sbjct: 192 FTREEMDDLLDLETGGMLETWADLYGVTGSGAHLELVRRYDRRRFFDALLEGRDVLTNKH 251
Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSY-ATGGTSAREFWWDPKRLAD 391
ANT IP ++G+ +EVTG+ Y+ I F + Y ATG E W +A
Sbjct: 252 ANTQIPEILGAARAWEVTGEERYRRIVEAFWRCAVSERGYTATGAGDNGELWMPQGEMAA 311
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
LG+ +E C YNM+++++ L RWT + AYADY+ER NGVL+ Q G E G++ Y +
Sbjct: 312 RLGA-GQEHCCNYNMMRLAQVLLRWTGDPAYADYWERRFVNGVLAHQHG-ETGMISYFIG 369
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
LG G K WGT FWCC+GT +++ + I+ EEE GL + Q++ S
Sbjct: 370 LGAGSRKT-----WGTPTGHFWCCHGTLMQANASYEGQIFMEEE---DGLAVCQWLPSKL 421
Query: 512 DWKSGHVVLNQKV--------DPIVSWD---------------PYLR-----MTLTFSSK 543
+++ G + ++ +P+ SW P R LTF ++
Sbjct: 422 EYEIGGTAIRLRIEQDGQHGLEPLSSWSVTGMTAITRADLPPIPVHRPDRFMYRLTFEAE 481
Query: 544 QEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP---PPGNFLSATERWSYNDKLTIQLP 600
+ V L +R+P W S ++NG+ PL P F+ W D +T++LP
Sbjct: 482 RAV--TFKLRMRLP-WWLSGEPVITVNGEA-PLQGELKPSTFVELEREWKSGDTITVELP 537
Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALIS 651
L+ EA+ P A L GP +LAG T+ E I TG L++
Sbjct: 538 KGLKAEAL----PGEPGTVAFLDGPIVLAGLTAEE-RILTGNLEQPETLLA 583
>gi|427384528|ref|ZP_18881033.1| hypothetical protein HMPREF9447_02066 [Bacteroides oleiciplenus YIT
12058]
gi|425727789|gb|EKU90648.1| hypothetical protein HMPREF9447_02066 [Bacteroides oleiciplenus YIT
12058]
Length = 1145
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 180/561 (32%), Positives = 278/561 (49%), Gaps = 39/561 (6%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQQ + ++LL LD D L+ F K A LP G+ YGGWE RG Y+SA A MW
Sbjct: 421 AQQLDAKWLLSLDPDRLLHRFHKNAGLPPKGENYGGWEEHRGGGRGLGH--YMSACAMMW 478
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------LKPVWAP 228
AST K++ V+ L CQ GTGY+ + ++ L P
Sbjct: 479 ASTGEPEFKQRTDYVINELERCQKARGTGYIGSVEDSIWTQVGRGDIRSTGFDLNGGIVP 538
Query: 229 YYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGG 288
++ +HK+ AGL D Y+ N +A + + ++ Y + + + E+ L E GG
Sbjct: 539 WFILHKLFAGLYDIYIYTGNEKAKTVLVNLCDWAYRQFGNL----NDEQWQKMLACEHGG 594
Query: 289 MNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYE 348
M +VL +YSI D K+L ++H FD F L+ Q D L+ HANT IP V+G + R++
Sbjct: 595 MLEVLANVYSIVGDKKYLDMSHWFDHKQFFSPLSHQVDSLAGLHANTQIPKVVGLERRHQ 654
Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLK 408
+T K+ FF + V +H+Y GG E + L++ L ETC TYNMLK
Sbjct: 655 LTHSEEDKVKSHFFWETVVKNHTYCIGGNGDGEHFGPKGILSNRLSDRTAETCNTYNMLK 714
Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
+++ L T + Y DYYE+AL N +L+ Q E G+ Y +PL G K G+ +
Sbjct: 715 LTKMLLAETGDTKYGDYYEKALYNHILASQ-NPETGMTTYYVPLVAGGKK-----GYSSA 768
Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
F +F CC GTG E+ ++ G++IYF+ N L + YI S+ W+ + + Q+
Sbjct: 769 FETFTCCVGTGFENHARYGEAIYFKGRKN--NLLVNLYIPSALTWEETGITIRQE----G 822
Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATE 587
+++ ++ T +S + + +SL RMP WT + + +NG+ + P PG +L T
Sbjct: 823 AYEKNGKVKFTINSSKP--KKASLFFRMPYWTTAK-TEVKVNGRKIDNPVIPGMYLEITG 879
Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLS 647
W ND + I + + TE P+ + AI +GP +LAG + K + +
Sbjct: 880 EWKKNDIIEIHFDMPVYTEPT----PDNPNRLAIKYGPLVLAGKLGNK---KIDPVKDIP 932
Query: 648 ALISPIPPSFNAQLVTFTQES 668
LI P N + +Q+S
Sbjct: 933 VLIVDDKP-VNEWVSRISQDS 952
>gi|337745980|ref|YP_004640142.1| hypothetical protein KNP414_01710 [Paenibacillus mucilaginosus
KNP414]
gi|336297169|gb|AEI40272.1| protein of unknown function DUF1680 [Paenibacillus mucilaginosus
KNP414]
Length = 636
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 181/531 (34%), Positives = 265/531 (49%), Gaps = 55/531 (10%)
Query: 153 GWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP 212
GWE+P ELRGH +GH+LSA+A ++ T + +K K +V L+ CQ G +L+AFP
Sbjct: 71 GWESPTCELRGHIMGHWLSAAATIYGQTQDGLVKAKADYIVAELARCQEANGGEWLAAFP 130
Query: 213 TELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITM 272
K VWAP+YTIHK+L GL D Y LA +A AL++ T M +FY +
Sbjct: 131 ESYMHRIARGKYVWAPHYTIHKLLMGLYDMYRLAGSAAALELMTNMAAWFY----RWTDG 186
Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
++ E L+ ETGGM + LY +T HL L +D+ F L D L++ H
Sbjct: 187 FTREEMDDLLDLETGGMLETWADLYGVTGSGAHLELVRRYDRRRFFDALLEGRDVLTNKH 246
Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSY-ATGGTSAREFWWDPKRLAD 391
ANT IP ++G+ +EVTG+ Y+ I F + Y ATG E W +A
Sbjct: 247 ANTQIPEILGAARAWEVTGEERYRRIVEAFWRCAVSERGYTATGAGDNGELWMPQGEMAA 306
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
LG+ +E C YNM+++++ L RWT + AYADY+ER NGVL+ Q G E G++ Y +
Sbjct: 307 RLGA-GQEHCCNYNMMRLAQVLLRWTGDPAYADYWERRFVNGVLAHQHG-ETGMISYFIG 364
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
LG G K WGT FWCC+GT +++ + I+ EEE GL + Q++ S
Sbjct: 365 LGAGSRKT-----WGTPTGHFWCCHGTLMQANASYEGQIFMEEE---DGLAVCQWLPSKL 416
Query: 512 DWKSGHVVLNQKV--------DPIVSWD---------------PYLR-----MTLTFSSK 543
+++ G + ++ +P+ SW P R LTF ++
Sbjct: 417 EYEIGGTAIRLRIEQDGQHGLEPLSSWSVTGMTAITRADLPPIPVHRPDRFMYRLTFEAE 476
Query: 544 QEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP---PPGNFLSATERWSYNDKLTIQLP 600
+ V L +R+P W S ++NG+ PL P F+ W D +T++LP
Sbjct: 477 RAV--TFKLRMRLP-WWLSGEPVITVNGEA-PLQGELKPSTFVELEREWKSGDTITVELP 532
Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALIS 651
L+ EA+ P A L GP +LAG T+ E I TG L++
Sbjct: 533 KGLKAEAL----PGEPGTVAFLDGPIVLAGLTAEE-RILTGNLEQPETLLA 578
>gi|117920524|ref|YP_869716.1| acetyl-CoA carboxylase, biotin carboxylase [Shewanella sp. ANA-3]
gi|117612856|gb|ABK48310.1| acetyl-CoA carboxylase, biotin carboxylase [Shewanella sp. ANA-3]
Length = 795
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 178/554 (32%), Positives = 276/554 (49%), Gaps = 54/554 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L + L+DV L L AQQT+L Y++ +D + L+ +RK A + T Y WEN
Sbjct: 28 LTPIPLNDVRLTAGPFL-HAQQTDLAYIMSMDPERLLAPYRKAAGIATTADNYPNWEN-- 84
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
+ L GH GHYLSA A M+A+T + + +++ +V L +CQ G GY+ P +L+
Sbjct: 85 TGLDGHIGGHYLSALALMYAATGDQAVLSRLNYMVAELEKCQQAHGNGYVGGVPHGDKLW 144
Query: 217 DSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFY 263
A L W P+Y +HK+ AGL D Y+ N A KM A WM++
Sbjct: 145 QQVAAGHIEADLFTLNQSWVPWYNVHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDLSR 204
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
N S E+ L E GG+N+ L +YSIT K+L LA+ + L L
Sbjct: 205 N--------LSDEQLQLMLRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQ 256
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
D L+ HANT IP ++G E++ + + +F V + + GG S RE++
Sbjct: 257 HQDKLTGLHANTQIPKIVGVARIAELSNNKEWLESADYFWQQVVHQRTVSIGGNSVREYF 316
Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
+ + L S E ETC TYNMLK+S+ L+ +++ Y DYYERAL N +LS Q +
Sbjct: 317 HPSEDFSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-PQ 375
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
G ++Y P+ + + + S WCC G+GIE+ +K G+ IY EE+ N L+
Sbjct: 376 TGGLVYFTPM-----RPDHYRVYSSAQESMWCCVGSGIENHAKYGELIYAEEDNN---LF 427
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS--SLNLRMPVWT 560
+ ++ S WK+ + L+QK + +S+ + Q + +LNLR P W
Sbjct: 428 VNLFVDSEVHWKAKGISLSQKT----------QFPDDNTSQMIIHQEADFTLNLRYPTWA 477
Query: 561 YSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
S+NG+ P G ++ T W D +TI LP+ + E + P+ ++
Sbjct: 478 -KGEVTVSINGEPQRFTPTQGQYIPLTRHWRKGDSVTITLPMDISLEQL----PDKSAYY 532
Query: 620 AILFGPYLLAGHTS 633
++L+GP +LA T+
Sbjct: 533 SVLYGPIVLAAKTA 546
>gi|383641951|ref|ZP_09954357.1| hypothetical protein SchaN1_14318 [Streptomyces chartreusis NRRL
12338]
Length = 768
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 187/545 (34%), Positives = 269/545 (49%), Gaps = 53/545 (9%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
WLD Q YL +DVD L+++FR L T G A GGW+ P R H
Sbjct: 62 WLDN-------QDRTRNYLRFVDVDRLLYNFRANHRLSTNGAAANGGWDAPDFPFRTHVQ 114
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFE- 220
GH+L+A AQ++A T + T ++K +T+V L++CQ T GYLS +P F + E
Sbjct: 115 GHFLTAWAQLYAVTGDTTCRDKATTMVAELAKCQANNSTAGFNAGYLSGYPESDFTALEQ 174
Query: 221 -ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRV--QKVITMY 273
L PYYTIHK L GLLD + + QA L +A W V++ R+ Q++ M
Sbjct: 175 RTLSNGNVPYYTIHKTLVGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRLSGQQMQAM- 232
Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
L E GGMN VL LY T D + L +A FD LA D LS HA
Sbjct: 233 --------LQTEFGGMNTVLTDLYQQTGDARWLTVARRFDHAAVFDPLAAGQDQLSGLHA 284
Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
NT +P IG+ Y+ TG Y+ I T +I SH+YA GG S E + P +A L
Sbjct: 285 NTQVPKWIGAAREYKATGTTRYRDIATNAWNICVNSHTYAIGGNSQAEHFRAPNAIAGFL 344
Query: 394 GSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLP 451
+ E+C T+NML ++R LF +A DYYERA N ++ Q + G + Y P
Sbjct: 345 NKDTCESCNTFNMLTLTRELFALDPNRVALFDYYERAWLNQMIGQQNPADDHGHVTYFTP 404
Query: 452 LG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
L RGV A W T + +FWCC GTG+E ++L DSIYF + L + ++
Sbjct: 405 LNPGGRRGVGPAWGGGTWSTDYGTFWCCQGTGLEMHTRLMDSIYFRSDNT---LIVNMFV 461
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S +W + + Q S L +T S ++ +R+P WT GA
Sbjct: 462 PSVLNWSERGITVTQTTSYPNSDTTTLHVTGNASGTW------AMRIRIPSWT--TGATV 513
Query: 568 SLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
S+NG + PG++ + + W+ D +T++LP+ + I + A++ AI +GP
Sbjct: 514 SVNGVAQTITTTPGSYATLSRSWASGDTVTVRLPMRV----IMRAANDNANVAAITYGPV 569
Query: 627 LLAGH 631
+L+G+
Sbjct: 570 VLSGN 574
>gi|418466296|ref|ZP_13037222.1| secreted protein [Streptomyces coelicoflavus ZG0656]
gi|371553101|gb|EHN80323.1| secreted protein [Streptomyces coelicoflavus ZG0656]
Length = 773
Score = 262 bits (670), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 179/538 (33%), Positives = 259/538 (48%), Gaps = 40/538 (7%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
WLD Q L YL +DVD L+ +FR L T G A GGWE P R H
Sbjct: 64 WLDN-------QSRTLSYLRFVDVDRLLHNFRANHRLSTNGAAATGGWEAPDFPFRSHVQ 116
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
GH+L+A AQ +A T + ++K +V L++CQ GTGYLS +P F + E+
Sbjct: 117 GHFLTAWAQAYAVTGDTACRDKALYMVAELAKCQANNGAAGFGTGYLSGYPESDFAALES 176
Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
L PYYTIHK LAGLL+ + L + +A + + + R ++ S R
Sbjct: 177 GTLNNGNVPYYTIHKTLAGLLEVWRLLGSTRARDVLLALAGWVDRRTGRL----STTRMQ 232
Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
L E GGMN VL L T D + L +A FD LA D L+ HANT +P
Sbjct: 233 AVLGTEFGGMNAVLTDLCQQTGDTRWLAVAQRFDHAAVFDPLAANQDRLAGLHANTQVPK 292
Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE 399
IG+ Y+ TG Y+ I T ++ +H+YA GG S E + P +A L ++ E
Sbjct: 293 WIGAVREYKATGSTRYRDIATNAWNMCVTTHTYAVGGNSQAEHFRPPNAIAAHLANDTCE 352
Query: 400 TCTTYNMLKVSRHLFRWTKEIAYA-DYYERALTNGVLSIQRGTEP-GVMIYMLPLG---- 453
+C T NML ++R LF + + A DYYE+A N ++ Q +P G + Y PL
Sbjct: 353 SCNTVNMLGLTRELFALSPDRAELFDYYEQAWLNHMIGQQNPADPHGHVTYFTPLKPGGR 412
Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
RGV A W T + +FWCC GTG+E ++L DS+YF + G L + ++ S W
Sbjct: 413 RGVGPAWGGGTWSTDYTTFWCCQGTGLEMHTRLMDSVYFHDGGTT--LTVNLFVPSVLTW 470
Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG-Q 572
+ + Q S LR+T + ++ +R+P WT GA S+NG +
Sbjct: 471 AERGITVTQSTSYPASDTTTLRIT------GDAAGTWAMRVRIPGWT--TGAVVSVNGVR 522
Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
PG + + W D +T++LP+ DD ++ A+ GP +L+G
Sbjct: 523 QHVTAAPGTYATLDRAWDSGDTVTVRLPMRTVVRPANDD----PAVGAVTHGPVVLSG 576
>gi|251795999|ref|YP_003010730.1| hypothetical protein Pjdr2_1987 [Paenibacillus sp. JDR-2]
gi|247543625|gb|ACT00644.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 626
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 178/523 (34%), Positives = 251/523 (47%), Gaps = 58/523 (11%)
Query: 147 PGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTG 206
P + GWE+ ELRGH +GH+LSA+AQ++A T +A +K K +V L CQ G
Sbjct: 65 PEHWHWGWESVTCELRGHIMGHWLSAAAQIYAQTSDALVKAKADYIVEELVRCQEANGGE 124
Query: 207 YLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRV 266
+L+AFP VWAP+YTIHK+L GL D Y +A N QAL++ + ++FY
Sbjct: 125 WLAAFPESYMHRIAKGSFVWAPHYTIHKLLMGLYDMYAIAGNEQALRVMRGIADWFY--- 181
Query: 267 QKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
K +S E L+ ETGGM +V LY IT + KHL L +D+ F L D
Sbjct: 182 -KWTGNFSQEEMDELLDLETGGMLEVWADLYGITKEDKHLNLVKRYDRRRFFDALLEGQD 240
Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSY-ATGGTSAREFWWD 385
L++ HANT IP ++G+ +EVTG+ Y+ I F + Y ATG E W
Sbjct: 241 VLTNKHANTQIPEILGAARAWEVTGEDRYRRIVEAFWRLAVTDRGYVATGAGDNGELWMP 300
Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
+ LG +E C YNM++++ L RWT + AYADY+ER NGVL+ Q G + G+
Sbjct: 301 RGEMGSRLGV-GQEHCCNYNMMRLAHVLLRWTGDPAYADYWERRFYNGVLAHQHG-DTGM 358
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
+ Y L +G G K+ WGT FWCC+GT +++ + I+ E+E G+ I Q
Sbjct: 359 ISYFLGMGAGSKKS-----WGTPTQHFWCCHGTLMQANAAYESQIFMEDEN---GIAICQ 410
Query: 506 YISSSF-------------------------DWKSGHVVLNQKVD--PIVSWDPYLRMTL 538
+I S +W + KVD PI P R
Sbjct: 411 WIPSELQLSRADGNLRIRIEQDGQYGVYPLNNWSVKGMTAITKVDMPPIPEHRPD-RFVY 469
Query: 539 TFSSKQEVGQLSSLNLRMPVWTYS------NGAQASLNGQNLPLPPPGNFLSATERWSYN 592
T + E L LR+P W NG+Q N P ++ + WS
Sbjct: 470 TVTIGLEHASTFELKLRLPWWLSGPPVIRVNGSQVEQNEAK-----PSSYTAIAREWSNG 524
Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
D +T++LP +L E + D Y A GP ++AG T E
Sbjct: 525 DVVTVELPKTLTMEPLPGDTGTY----AFFDGPIVMAGLTEEE 563
>gi|113970330|ref|YP_734123.1| hypothetical protein Shewmr4_1993 [Shewanella sp. MR-4]
gi|113885014|gb|ABI39066.1| protein of unknown function DUF1680 [Shewanella sp. MR-4]
Length = 795
Score = 262 bits (669), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 177/554 (31%), Positives = 276/554 (49%), Gaps = 54/554 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L + L+DV L L AQQT+L Y++ +D + L+ +RK A + T Y WEN
Sbjct: 28 LTPIPLNDVRLTAGPFL-HAQQTDLAYIMSMDPERLLAPYRKEAGIATTADNYPNWEN-- 84
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
+ L GH GHYLSA A M+A+T + + E+++ +V L +CQ G GY+ P +L+
Sbjct: 85 TGLDGHIGGHYLSALALMYAATGDQAVLERLNYMVAELEKCQQAHGNGYVGGVPHGDKLW 144
Query: 217 DSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFY 263
A L W P+Y +HK+ AGL D Y+ N A KM A WM++
Sbjct: 145 QQVAAGHIEADLFTLNQSWVPWYNLHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDLSR 204
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
N + + + L E GG+N+ L +YSIT K+L LA+ + L L
Sbjct: 205 NLTDEQLQLM--------LRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQ 256
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
+ L+ HANT IP ++G E++ + + +F V + + GG S RE +
Sbjct: 257 HQEKLTGLHANTQIPKIVGVARIAELSHNKAWLESADYFWQQVVHQRTVSIGGNSVREHF 316
Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
+ + L S E ETC TYNMLK+S+ L+ +++ Y DYYERAL N +LS Q +
Sbjct: 317 HPSEDFSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-PQ 375
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
G ++Y P+ + + + S WCC G+GIE+ +K G+ IY EE+ N L+
Sbjct: 376 TGGLVYFTPM-----RPDHYRVYSSAQESMWCCVGSGIENHAKYGELIYAEEDNN---LF 427
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS--SLNLRMPVWT 560
+ ++ S +WK+ + L+QK + +S+ + Q + +LNLR P W
Sbjct: 428 VNLFVDSEVNWKAKGISLSQKT----------QFPDDNTSQMIIHQEADFTLNLRYPTWA 477
Query: 561 YSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
+ S+NG+ P G ++ T W D +TI LP+ + E + D Y
Sbjct: 478 KGD-VTVSINGEPQRFTPTQGQYIPLTRHWRKGDSVTITLPMDISLEQLPDKTAYY---- 532
Query: 620 AILFGPYLLAGHTS 633
++L+GP +LA T+
Sbjct: 533 SVLYGPIVLAAKTA 546
>gi|433676676|ref|ZP_20508761.1| hypothetical protein BN444_00795 [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430818203|emb|CCP39076.1| hypothetical protein BN444_00795 [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 807
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 176/551 (31%), Positives = 268/551 (48%), Gaps = 51/551 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
LK+V+L + S+ + QTN YLL L+ D L+ +F + A LP G+ YGGWE
Sbjct: 65 LKQVTL------KPSLFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGEVYGGWEG-- 116
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE---- 214
+ GH +GHYLSA A+M A T +A +++++ +V L+ Q K GY+ +
Sbjct: 117 DTIAGHTLGHYLSALAKMHAQTRDAALRQRIDYIVAELARAQAKDPDGYVGGLTRKNDKG 176
Query: 215 -------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
+F+ L W+P YT+HK+ AGLLD + LA NAQAL++ +
Sbjct: 177 AIDNGKLVFEEVRRGIIKGSKFNLNGSWSPLYTVHKLFAGLLDAHELAGNAQALQVLLPL 236
Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
Y V + L+ E GG+N+ L + T DP+ + L +
Sbjct: 237 AGYLGG----VFDALDHAQMQALLDTEFGGLNESYIELGARTGDPRWIALGKRLRHEKVI 292
Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
A D L H HANT +P IG ++EV GD FF + V +SY GG +
Sbjct: 293 DPAAAGRDELPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTGHYSYVIGGNA 352
Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
RE++ +P +A L + E C +YNMLK++RHL++WT + Y DYYER L N ++ Q
Sbjct: 353 DREYFQEPDTIAAFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAAQ 412
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
G+ YM P+ G + G+ KF+SFWCC G+G+E+ ++ GDSIY+++ +
Sbjct: 413 H-PATGMFTYMTPMIGGGER-----GFSDKFDSFWCCVGSGMEAHAQFGDSIYWQDAAS- 465
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
LY+ YI S+ DW + L ++D V + +R+ L + + +L
Sbjct: 466 --LYVNLYIPSTLDWPERDLAL--ELDSGVPDNGKVRLQLRCAGARTPRRLLLRLP---A 518
Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
W G LNG+ +L+ RW D + + L + LR E D A
Sbjct: 519 WC-QGGYTLRLNGKAQRGTAADGYLALERRWRSGDMIELDLAMPLRLEHAAGD----ADT 573
Query: 619 QAILFGPYLLA 629
++ GP LA
Sbjct: 574 VVVMRGPLALA 584
>gi|238059692|ref|ZP_04604401.1| secreted protein [Micromonospora sp. ATCC 39149]
gi|237881503|gb|EEP70331.1| secreted protein [Micromonospora sp. ATCC 39149]
Length = 740
Score = 261 bits (666), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 181/539 (33%), Positives = 268/539 (49%), Gaps = 41/539 (7%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
WLD Q L YL +DVD L+++FR L T G A GGW+ P R H
Sbjct: 28 WLDN-------QNRTLSYLRFVDVDRLLYNFRANHRLSTNGAASNGGWDAPSFPFRTHVQ 80
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEA 221
GH+L+A AQ +A + T ++K + +V L++CQ G GYLS FP F + EA
Sbjct: 81 GHFLTAWAQAYAVLGDTTCRDKANYMVAELAKCQANNGAAGFTAGYLSGFPESDFTALEA 140
Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
L PYY IHK L GLLD + N QA + + + R ++ S +
Sbjct: 141 RTLSNGNVPYYCIHKTLLGLLDVWRYIGNTQARSVLLALAGWVDTRTARL----SSSQMQ 196
Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
L E GGMN+ L LY T D + L +A FD LA +D L+ HANT +P
Sbjct: 197 AMLGTEFGGMNEALADLYQQTGDGRWLTVAQRFDHAAVFNPLAANSDQLNGLHANTQVPK 256
Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE 399
IG+ Y+ TG Y+ I + ++ +H+YA GG S E + P +A L ++ E
Sbjct: 257 WIGAAREYKATGTTRYRDIASNAWNMTVNAHTYAIGGNSQAEHFRAPNAIAGYLTNDTCE 316
Query: 400 TCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG---- 453
C T NMLK++R L+ + AY DY+ERAL N V+ Q + G + Y PL
Sbjct: 317 HCNTVNMLKLTRELWLIDPNQAAYFDYFERALANHVIGAQNPADGHGHVTYFTPLKPGGR 376
Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
RGV A W T ++SFWCC GTGIE ++L DSIYF N L + + S+ +W
Sbjct: 377 RGVGPAWGGGTWSTDYDSFWCCQGTGIEINTRLMDSIYFH---NGTTLTVNLFAPSTLNW 433
Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
+ + Q + V L ++ T S S+ +R+P W ++GA ++NG
Sbjct: 434 SQRGITVTQSTNYPVGDTTTLTLSGTMSGSW------SIRVRIPAW--ASGATIAVNGAT 485
Query: 574 LPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
+ PG++ + T W+ D +T++LP+ + + + A++ A+ +GP +L G+
Sbjct: 486 QSVATTPGSYATVTRTWASGDTITVRLPMRV----VLSPANDNAAVAAVTYGPMVLCGN 540
>gi|399074049|ref|ZP_10750795.1| hypothetical protein PMI01_01867 [Caulobacter sp. AP07]
gi|398040822|gb|EJL33912.1| hypothetical protein PMI01_01867 [Caulobacter sp. AP07]
Length = 775
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 177/573 (30%), Positives = 273/573 (47%), Gaps = 48/573 (8%)
Query: 77 VSWALLYRKIKNPGGFDLPGNFLKE-VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLV 135
S A+ + +PG G + E V V L + S+ +AQ N YL+ L D L+
Sbjct: 14 ASSAMAFVGAASPGLAAPAGRVVAEPVPARHVAL-KPSIFQQAQAANRAYLVSLSADRLL 72
Query: 136 WSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFS 195
+F + A L YGGWE + GH +GHYL+A A A T + + ++++ +V
Sbjct: 73 HNFHQGAGLSVKAPVYGGWE--AQSIAGHTLGHYLTACALQVAGTGDPVLSDRLTYIVAE 130
Query: 196 LSECQNKIGTGYL----------SAFPTELFDSFE---------ALKPVWAPYYTIHKIL 236
L+ Q G GY+ +A ++F+ +L W P YT HK+
Sbjct: 131 LARVQAAHGDGYVGGTTRWGQSDAAGGKQVFEELRRGDIRASRFSLNDGWVPIYTWHKVH 190
Query: 237 AGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRL 296
AGLLD + LA +AL +A + YF ++ S + L E GG+N+
Sbjct: 191 AGLLDAHRLAGTPRALAVAVGLAGYFAT----IVEGLSDAQVQQILITEHGGINEAYAET 246
Query: 297 YSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYK 356
Y++T D + L +A L +A D L+ HANT IP VIG YEV GDP
Sbjct: 247 YALTGDERWLKVARRLRHKAVLDPIAEGRDELAGLHANTQIPKVIGLARLYEVGGDPAEA 306
Query: 357 LIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRW 416
FF +V +HSY GG S RE + P +A + E C TYNMLK++R L+ W
Sbjct: 307 RAARFFHQVVTENHSYVIGGNSDREHFGKPNEIARHMAETTCEACNTYNMLKLTRRLWSW 366
Query: 417 TKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCY 476
A DYYERA N +++ QR ++ G+ +Y +P+ G ++ S T +SFWCC
Sbjct: 367 APNGALFDYYERAQLNHIMAHQRPSD-GMFVYFMPMAAGGRRSYS-----TPEDSFWCCV 420
Query: 477 GTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM 536
G+G+ES +K DSI++ G+ LY+ ++ S D G ++ +D + +R+
Sbjct: 421 GSGMESHAKHADSIWW-RGGDT--LYLNLFLPSRLDLPDGDFAID--LDTRYPAEGLVRL 475
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLT 596
++ + E + LR+P W + +NG + P + RW D++
Sbjct: 476 SVVRAPSAE----REIALRLPAWCAA--PLVKVNGAAIGRPGRDGYARLKRRWKAGDRIE 529
Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
+ LP+ LR E DD ++ A + GP +LA
Sbjct: 530 LVLPMHLRAEPTPDD----PNLVAFVSGPLVLA 558
>gi|440730056|ref|ZP_20910155.1| hypothetical protein A989_02030 [Xanthomonas translucens DAR61454]
gi|440379682|gb|ELQ16270.1| hypothetical protein A989_02030 [Xanthomonas translucens DAR61454]
Length = 807
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 179/576 (31%), Positives = 277/576 (48%), Gaps = 53/576 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
LK+V+L + S+ + QTN YLL L+ D L+ +F + A LP G+ YGGWE
Sbjct: 65 LKQVTL------KPSLFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGEVYGGWEG-- 116
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE---- 214
+ GH +GHYLSA A+M A T +A +++++ +V L+ Q K GY+ +
Sbjct: 117 DTIAGHTLGHYLSALAKMHAQTRDAALRQRIDYIVAELARAQAKDPDGYVGGLTRKNDKG 176
Query: 215 -------LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
+F+ L W+P YT+HK+ AGLLD + LA NAQAL++ +
Sbjct: 177 AIDNGKLVFEEVRRGIIKGSKFNLNGSWSPLYTVHKLFAGLLDAHALAGNAQALQVLLPL 236
Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
Y V + L+ E GG+N+ L + T DP+ + L +
Sbjct: 237 AGYLGG----VFDALDHAQMQTLLDTEFGGLNESYIELGARTGDPRWIALGKRLRHEKVI 292
Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
A D L H HANT +P IG ++EV GD FF + V +SY GG +
Sbjct: 293 DPAAAGRDELPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTGHYSYVIGGNA 352
Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
RE++ +P +A L + E C +YNMLK++RHL++WT + Y DYYER L N ++ Q
Sbjct: 353 DREYFQEPDTIAAFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAAQ 412
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
G+ YM P+ G + G+ KF+SFWCC G+G+E+ ++ GDSIY++ +
Sbjct: 413 H-PATGMFTYMTPMISGGER-----GFSDKFDSFWCCVGSGMEAHAQFGDSIYWQ---DA 463
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
LY+ YI S+ DW + L ++D V + +R+ L + G + L + +
Sbjct: 464 VSLYVNLYIPSTLDWPERDLTL--ELDSGVPDNGKVRLQL-----RRAGARTPRRLLLRL 516
Query: 559 WTYSNGAQA-SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
+ GA +NG++ +L+ +W D + + L + LR E D A
Sbjct: 517 PAWCQGAYTLRVNGKSQRGTAADGYLALERQWRSGDVIELDLAMPLRLEHAAGD----AD 572
Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPI 653
++ GP LA D +L A P+
Sbjct: 573 TVVVMRGPLALAADLGPVADPYDAPDPALVAAADPL 608
>gi|332185145|ref|ZP_08386894.1| hypothetical protein SUS17_217 [Sphingomonas sp. S17]
gi|332014869|gb|EGI56925.1| hypothetical protein SUS17_217 [Sphingomonas sp. S17]
Length = 782
Score = 259 bits (662), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 188/554 (33%), Positives = 268/554 (48%), Gaps = 55/554 (9%)
Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYL 170
+ S+ A +TN YL LD D L+ +FR A L YGGWE+ + GH +GHY+
Sbjct: 39 RPSIYATAVETNRRYLYRLDPDRLLHNFRLYAGLKPKAPIYGGWES--DTIAGHTLGHYM 96
Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSF 219
SA W T + ++ + +V L+E Q K GTGY+ A E+F
Sbjct: 97 SALVLTWQQTGDTEMRRRADYIVSELAEAQAKRGTGYVGALGRKRADGTIVDGEEIFHEI 156
Query: 220 EA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
A L W+P YT+HK+ AGLLD + NAQAL +A + YF +V
Sbjct: 157 MAGKIKSGGFDLNGSWSPLYTVHKLFAGLLDIHGGWGNAQALDVAVKLGGYF----ARVF 212
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
R L E GG+N+ LY T D + L LA L L D L++
Sbjct: 213 AALDDARLQDVLGCEYGGLNESFAELYQRTGDRQWLALAERIYDNKVLDPLVAGKDQLAN 272
Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
HANT +P +IG +E+T P FF + V HSY GG + RE++ +P +A
Sbjct: 273 LHANTQVPKLIGLARIHEITAAPAPAAGARFFWENVTGHHSYVIGGNADREYFSEPDTIA 332
Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
+ + E C +YNMLK++RHL+ W + DYYERA N V++ Q G YM
Sbjct: 333 RHITEQTCEHCNSYNMLKLTRHLYGWQPDGRLFDYYERAHLNHVMAAQHPVHAG-FTYMT 391
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
PL G+++ ST K ++FWCC G+G+ES +K G+SI++ + G+ L++ YI +
Sbjct: 392 PLMTGMAREFST----DKDDAFWCCVGSGMESHAKHGESIFW-QGGDT--LFVNLYIPAE 444
Query: 511 FDW-KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG-AQAS 568
W K G VV P+ L FS G+ + LR+P W +NG A
Sbjct: 445 ARWDKRGAVVTLDTAYPMDG-----AAKLAFSRLDRAGRF-PVALRVPGW--ANGQAAVE 496
Query: 569 LNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
+NGQ P+ P + RW D + I+LPL LR E D S+ A++ GP
Sbjct: 497 VNGQ--PVTPVFERGYAVVDRRWKTGDTVAIRLPLDLRVEPTPGDD----SVVAVVRGPM 550
Query: 627 LLA---GHTSGEWD 637
++A G T+ WD
Sbjct: 551 VMAADLGPTTTPWD 564
>gi|407790778|ref|ZP_11137869.1| Acetyl-CoA carboxylase, biotin carboxylase [Gallaecimonas
xiamenensis 3-C-1]
gi|407202325|gb|EKE72317.1| Acetyl-CoA carboxylase, biotin carboxylase [Gallaecimonas
xiamenensis 3-C-1]
Length = 780
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 185/556 (33%), Positives = 269/556 (48%), Gaps = 57/556 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L+ + L +V L S +AQ TN YL LD D L+ FR A LP P YG WE
Sbjct: 20 LETLPLQEVRL-LPSPFKQAQDTNRHYLDSLDPDRLLAPFRAEAGLPQPKPGYGNWE--A 76
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
L GH GHYLSA + M+AST + + ++ ++ L +CQ+K+GTGY+ P
Sbjct: 77 DGLGGHMGGHYLSALSLMYASTGDPALLARLQYMLDELKKCQDKLGTGYIGGVPGGSALW 136
Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKM-------ATWMVE 260
++ L W P+Y +HK+ AGL D Y +AQAL M W+VE
Sbjct: 137 QQIHQGDIQADLFTLNQKWVPWYNLHKLYAGLRDAYRYTGSAQALAMWIKLSDWTDWLVE 196
Query: 261 YFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF 320
+ + + L E GGMN+V LY IT K+L LA F + L
Sbjct: 197 GLSDEQMQAM-----------LVTEYGGMNEVFADLYEITGQDKYLQLAKRFSQQQLLQP 245
Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR 380
LA D L+ HANT IP VIG + +V+GD +F V + A GG S R
Sbjct: 246 LAHGQDQLNGLHANTQIPKVIGFERIAQVSGDRAMGAAADYFWHQVVEQRTVAIGGNSVR 305
Query: 381 EFWWDPKRLADTLGSENE--ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
E + PK ++ E E ETC +YNMLK++R L++ + Y YYERAL N +L+ Q
Sbjct: 306 EH-FHPKDDFSSMVEEVEGPETCNSYNMLKLARLLYQRQGGLDYLAYYERALYNHILASQ 364
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
+ G ++Y P+ + + + WCC G+GIES SK G IY ++
Sbjct: 365 H-PDDGGLVYFTPM-----RPNHYRVYSQADKAMWCCVGSGIESHSKYGAMIYATDQS-- 416
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
LYI +I S DW V L+ +D D + +T +S L +R P
Sbjct: 417 -ALYINLFIPSRLDWTEKGVKLS--LDTRFPDDDSVFITFEQASS------LPLKIRYPS 467
Query: 559 WTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W + + +NG + PG +LS +W D+++++LP++L E + P+ ++
Sbjct: 468 WVKAGQLELRVNGTPRAVTAKPGQYLSLAGQWQKGDQISLKLPMALSLEQM----PDQSN 523
Query: 618 IQAILFGPYLLAGHTS 633
A+LFGP +LA T+
Sbjct: 524 YYAVLFGPIVLAAKTN 539
>gi|347528202|ref|YP_004834949.1| hypothetical protein SLG_18170 [Sphingobium sp. SYK-6]
gi|345136883|dbj|BAK66492.1| hypothetical protein SLG_18170 [Sphingobium sp. SYK-6]
Length = 805
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 176/536 (32%), Positives = 257/536 (47%), Gaps = 47/536 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A N YLL L+ D L+ +F A L G+AYGGWE + GH +GHY++A A M
Sbjct: 61 AVDANRRYLLQLEPDRLLHNFLVHAGLEPKGEAYGGWEG--DTIAGHTLGHYMTALALMH 118
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV------------ 225
A T +A + +V L Q G GY++ F D E K +
Sbjct: 119 AQTGDAECARRALYIVDELERAQKASGDGYVAGFTRRNGDVVEDGKAIFPEIMAGDIRSA 178
Query: 226 -------WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
W P+Y HK+ AGL D + +A+ +A + Y ++KV +
Sbjct: 179 GFDLNGCWVPFYNWHKLYAGLFDIQTWIGSDKAIPIAVSLSGY----IEKVFASLDDTQL 234
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
L+ E GG+N+ L+ T DP+ L LA L L+ + L HANT IP
Sbjct: 235 QTVLDCEHGGINESFAELHVRTGDPRWLALAERIRHRKVLDPLSRGENSLPWIHANTQIP 294
Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
VIG +E+TG + + +F D V +SY GG + RE++ DP ++ + +
Sbjct: 295 KVIGLARLHEITGRADHAIAARYFWDTVVHRYSYVIGGNADREYFPDPDTVSRHITEQTC 354
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C TYNMLK++RHL+ W E + DYYERA N +L+ QR T+ G+ YM+PL G +
Sbjct: 355 ESCNTYNMLKLTRHLYAWRPEASLFDYYERAHINHILAQQR-TDNGMFAYMVPLMSGTHR 413
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV---PGLYIIQYISSSFDWKS 515
A W F+SFWCC G+GIES SK G+SI++EE+ L YI S W +
Sbjct: 414 A-----WSDPFDSFWCCVGSGIESHSKHGESIWWEEDDQRRAGEALVANLYIPSRTQWSA 468
Query: 516 -GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
G ++ + P +D + + LT +K +L LR+P W + +NG+
Sbjct: 469 RGATLVMETAYP---FDGEIDIALTELAKPGT---FTLALRIPAWC--DEPAVLINGKAW 520
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
P +++ W D + + LP+ LR E DD S A L GP +LA
Sbjct: 521 KATPADGYIAIKRPWKRGDSIRLSLPMKLRMEPTPDD----PSTVAFLRGPVVLAA 572
>gi|347738800|ref|ZP_08870212.1| hypothetical protein AZA_89687 [Azospirillum amazonense Y2]
gi|346918071|gb|EGY00199.1| hypothetical protein AZA_89687 [Azospirillum amazonense Y2]
Length = 804
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 174/531 (32%), Positives = 253/531 (47%), Gaps = 44/531 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A NL YL L+ D L+ +FR A L G AYGGWE + GH +GHYLSA + M
Sbjct: 53 AVDANLAYLHSLEADRLLHNFRSGAGLQPKGAAYGGWEG--DTIAGHTLGHYLSALSLMH 110
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV------------ 225
A T +A K ++ +V L+ECQ G GY++ F + D E K V
Sbjct: 111 AQTGDAECKRRVDYIVAELAECQKAQGDGYVAGFTRKRGDIVEDGKVVFDELRRGEIRSA 170
Query: 226 -------WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
W P Y HK+ GL D L N QAL + + Y + +V + + E+
Sbjct: 171 GFDLNGCWVPLYNWHKLYTGLFDAQTLCGNTQALDVGVKLGGY----IDEVFSHLNDEQV 226
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIP 338
L+ E GG+N+ LY+ T D + LLLA L L+ D L++ HANT IP
Sbjct: 227 QKVLDCEHGGINESFAELYARTGDRRWLLLAERLYHAKVLVPLSEGRDELANIHANTQIP 286
Query: 339 IVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
+IG E+TG + FF V +HSY GG + RE++ +P+ ++ + +
Sbjct: 287 KLIGLARLAELTGSERHAKASAFFWQTVTTNHSYVIGGNADREYFQEPRSISRHITEQTC 346
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E C +YNMLK++R L+ + Y D+YERA N VL+ Q+ G+ YM PL G
Sbjct: 347 EGCNSYNMLKLTRLLYARQADAHYFDFYERAHLNHVLA-QQNPATGMFTYMTPLMSG--- 402
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
S + T FWCC GTG+ES +K G+S+Y+ L + YI S+ W
Sbjct: 403 --SAREFSTPTEDFWCCVGTGMESHAKHGESVYWRR--GAEDLAVNLYIPSTLTWGERGA 458
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
V VD + + LT + + +++ R+P W GA ++NG+ L
Sbjct: 459 V----VDLDTRYPEAETVLLTLKALKRPATF-AVSFRIPAW--CTGATLAVNGKPQDLVV 511
Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
+ W D + ++LP++LR E+ DD A A L GP +LA
Sbjct: 512 QNGYAVVRREWKAGDAVALRLPMALRLESTNDD----ADTVAFLHGPLVLA 558
>gi|408527846|emb|CCK26020.1| secreted protein [Streptomyces davawensis JCM 4913]
Length = 731
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 182/540 (33%), Positives = 263/540 (48%), Gaps = 47/540 (8%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY-GGWENPISELRGHFV 166
WLD Q YL +DVD L+++FR L T G A GGW+ P R H
Sbjct: 27 WLDN-------QNRTGNYLRFVDVDRLLYNFRANHKLSTNGAAANGGWDAPDFPFRTHIQ 79
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFEA 221
GH+L+A AQ++A T + T ++K + +V L++CQ GYLS +P F + E
Sbjct: 80 GHFLTAWAQLYAVTGDTTCRDKATYMVAELAKCQANNSAAGFSPGYLSGYPEANFTALEQ 139
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVER 277
YYTIHK LAGLLD + + QA L +A W V++ R+ + E+
Sbjct: 140 GTKGDVLYYTIHKTLAGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRL-------TSEQ 191
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
L E GGMN VL L+ T D + L +A FD LA D L+ HANT +
Sbjct: 192 MQNMLRIEFGGMNAVLTDLHVRTGDARWLAVAQRFDHAAVFDPLAANQDKLNGLHANTQV 251
Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
P IG+ Y+ TG Y+ I T +I SH+YA GG S E + P +A L +
Sbjct: 252 PKWIGAAREYKATGTTRYRDIATNAWNITLDSHTYAIGGNSQAEHFRAPHAIAGFLNKDT 311
Query: 398 EETCTTYNMLKVSRHLFRWTKE-IAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLG-- 453
E+C T+NML ++R LF + A DYYERA N ++ Q + G + Y PL
Sbjct: 312 CESCNTFNMLVLTRELFELDPDRAALFDYYERAWLNQMIGQQNPADDHGHVTYFTPLNPG 371
Query: 454 --RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
RGV A W T + +FWCC GTG+E ++L DSIY+ + L + ++ S
Sbjct: 372 GRRGVGPAWGGGTWSTDYGTFWCCQGTGLEMNTRLMDSIYYRRDDT---LIVNLFVPSVL 428
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
W + + Q S T T G ++ +R+P WT GA S+NG
Sbjct: 429 TWPERGITVTQTTSYPNS------DTTTLKVTGNAGGTWAMRIRIPSWT--TGASISVNG 480
Query: 572 QNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ PG++ + + WS D +T++LP+ + A DD P ++ A+ +GP +L+G
Sbjct: 481 VAQTVATTPGSYATLSRAWSSGDTVTVRLPMRIILRA-ADDNP---NVTAVTYGPVVLSG 536
>gi|297203356|ref|ZP_06920753.1| secreted protein [Streptomyces sviceus ATCC 29083]
gi|297148382|gb|EDY55480.2| secreted protein [Streptomyces sviceus ATCC 29083]
Length = 723
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 182/545 (33%), Positives = 263/545 (48%), Gaps = 53/545 (9%)
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENPISELRGHFV 166
WLD Q YL +DVD L+++FR L T G A GGW+ P R H
Sbjct: 17 WLDN-------QNRTQNYLRFVDVDRLLYNFRANHRLSTNGAVATGGWDAPDFPFRTHVQ 69
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE- 220
GH+L+A AQ++A + + ++K + +V L++CQ GYLS +P F + E
Sbjct: 70 GHFLTAWAQLYAVSGDTVCRDKATYMVAELAKCQANNSAAGFSAGYLSGYPESDFTALEQ 129
Query: 221 -ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRV--QKVITMY 273
L PYYTIHK LAGLLD + + QA L +A W V++ R+ Q++ TM
Sbjct: 130 RTLSNGNVPYYTIHKTLAGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRLSGQQMQTM- 187
Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
L E GGMN VL LY T D + L A FD LA D LS HA
Sbjct: 188 --------LQTEFGGMNTVLTDLYQQTGDARWLTAARRFDHAAVFDPLASGQDQLSGLHA 239
Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
NT +P IG+ Y+ TG Y+ I T + +H+YA GG S E + P +A L
Sbjct: 240 NTQVPKWIGAAREYKATGTTRYRDIATNAWNFTVNAHTYAIGGNSQAEHFRAPNAIAGYL 299
Query: 394 GSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLP 451
+ E+C T NML ++R LF A DYYE+A N ++ Q + G + Y P
Sbjct: 300 NKDTCESCNTVNMLTLTRELFALDPNRAALFDYYEQAWLNQMIGQQNPADGHGHVTYFTP 359
Query: 452 LG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
L RGV A W T + +FWCC GTG+E ++L DS+YF + L + ++
Sbjct: 360 LNPGGRRGVGPAWGGGTWSTDYGTFWCCQGTGLEMHTRLMDSLYFRSDDT---LIVNLFV 416
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S +W + + Q S T T V ++ +R+P WT GA
Sbjct: 417 PSVLNWSERGITVTQTTSYPNS------DTTTLQVTGNVSGTWAMRIRIPGWTA--GATI 468
Query: 568 SLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
S+NG + PG++ + T W+ D +T++LP+ + A D+ ++ AI +GP
Sbjct: 469 SVNGTRQDITTTPGSYATLTRSWTSGDTVTVRLPMRVVMRAANDN----PNVAAITYGPV 524
Query: 627 LLAGH 631
+L+G+
Sbjct: 525 VLSGN 529
>gi|408357216|ref|YP_006845747.1| hypothetical protein AXY_18530 [Amphibacillus xylanus NBRC 15112]
gi|407727987|dbj|BAM47985.1| hypothetical protein AXY_18530 [Amphibacillus xylanus NBRC 15112]
Length = 755
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 184/545 (33%), Positives = 268/545 (49%), Gaps = 46/545 (8%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
+ V L+Q + +QQ EYLL LD+D L+ + YGGWE+ E+ G
Sbjct: 1 MDQVQLNQG-MFKESQQKGKEYLLYLDIDRLIAPCYEAVGQEPRAPRYGGWES--MEIAG 57
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFE--- 220
H +GH+LSA++ M+ T + +K K+ + L+ Q GY+S FP + FD
Sbjct: 58 HSIGHWLSAASLMYNVTGDLLLKHKIDYAIDELAHVQAFDPEGYVSGFPRDCFDEVFTGE 117
Query: 221 ------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVI 270
L W P+Y+IHKI AGL+D Y LA N +A +K++ W + +
Sbjct: 118 FRVDNFGLGGSWVPWYSIHKIYAGLVDAYRLASNEKAKTVLVKLSNW--------ADQGL 169
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
+ + E+ L E GGMN+ + +Y IT D + L LA F+ L L D L+
Sbjct: 170 SKLNDEQFQRMLICEFGGMNETMADVYEITGDKRFLKLAERFNHKAVLDPLIEGIDDLAG 229
Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
HANT IP VIG+ Y++TG Y+ + FF D V SYA GG S E +
Sbjct: 230 KHANTQIPKVIGAAKLYDMTGKEEYQKLSRFFWDQVVYHRSYAFGGNSNAEHFGPVD--T 287
Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
+ LG + ETC TYNMLK++ HLF W + Y DYYE AL N +L Q E G+ Y +
Sbjct: 288 EPLGIISTETCNTYNMLKLTEHLFDWQPDSRYMDYYENALYNHILGSQ-DPESGMKSYFI 346
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
P G K + + NSFWCC G+G+E+ ++ +IY + LY+ +I S+
Sbjct: 347 PTEPGHFKV-----YCSPDNSFWCCTGSGMENPARYTKNIYTRK---ADSLYVNLFIPST 398
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
+ Q+ D PY T+ F+ K+ G+ ++ LR P W A +N
Sbjct: 399 LTIAEKDLQFIQETDF-----PY-DETVHFTVKEGNGERLTVYLRKPNWLAGEMA-LQIN 451
Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
G+ + L + +W ND +T QLP+ LRT + D+PE +A +GP LLAG
Sbjct: 452 GEPVALELVNGYYEIDRKWYKNDTVTFQLPMGLRTYTAK-DQPEK---KAFFYGPILLAG 507
Query: 631 HTSGE 635
E
Sbjct: 508 RLGRE 512
>gi|385677991|ref|ZP_10051919.1| hypothetical protein AATC3_18830 [Amycolatopsis sp. ATCC 39116]
Length = 886
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 184/582 (31%), Positives = 288/582 (49%), Gaps = 55/582 (9%)
Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAS 179
+ + YL +D D L+ FR TA LP+ + GGWE P +LRGH GH LS A A+
Sbjct: 54 ERTVAYLRFVDADRLLHMFRVTAGLPSTAEPCGGWEAPDIQLRGHTTGHLLSGLALAAAN 113
Query: 180 THNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPTELFDSFEALKPVWAPYYTIHK 234
T + + K +++V +L+ECQ GYLSAFP F EA K VWAPYYTIHK
Sbjct: 114 TGDTELAAKGASIVAALAECQAAAPAAGFTEGYLSAFPERAFADLEAGKVVWAPYYTIHK 173
Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLY 294
I+AGLLDQY L N QAL + M + R+ + + E L+ E GGMN+ L
Sbjct: 174 IMAGLLDQYRLLGNRQALDVLLGMARWARARMANL----TREAQQKVLHTEFGGMNETLA 229
Query: 295 RLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL 354
L +T D +HL A LFD L+ + D L+ HANT I ++G+ + ++ TG+
Sbjct: 230 SLALVTGDRQHLETAKLFDHDEIFVPLSQRRDTLAGRHANTDIAKIVGAAVEWDATGEEY 289
Query: 355 YKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLF 414
Y+ I T+F D V H+Y GG + EF+ P ++ LG E C +YNMLK+SR LF
Sbjct: 290 YRTIATYFWDQVVHHHTYVIGGNANAEFFGPPDQIVSQLGENTCENCNSYNMLKLSRLLF 349
Query: 415 -RWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARSTHG-------W 465
R Y DY E L N +L Q + G + Y L G + + G +
Sbjct: 350 LRDPSRTDYLDYSEWTLLNQMLGEQDPDSAHGFVTYYTGLVPGAQR-KGKEGVVSDPGTY 408
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
+ + +F C +GTG+E+ K ++IY+ + GL++ Q+I S D+ + L +
Sbjct: 409 SSDYGNFTCDHGTGLETHVKYAENIYYAADD---GLWVNQFIPSEVDYGGVRIRLETE-- 463
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
+D +R+ ++ + +L +R+P W + A+ +NG+ + PG F
Sbjct: 464 --YPYDETVRLHVSGAGA------FALRVRIPSW--ATHARLFVNGEAM-RAEPGRFAVV 512
Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARS 645
RW D + ++LP++++ P+ ++ A+ +GP +LA S
Sbjct: 513 GRRWRDGDVVELRLPMTVQWRPA----PDNPAVHALTYGPLVLAARHGD----------S 558
Query: 646 LSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEF 687
+ A+I + P + +E G + F + ++ + + F
Sbjct: 559 VPAVIPTVDPR------SLRREPGRAEFSVQAGDRRLRLSPF 594
>gi|255936447|ref|XP_002559250.1| Pc13g08250 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583870|emb|CAP91894.1| Pc13g08250 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 627
Score = 256 bits (653), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 185/535 (34%), Positives = 264/535 (49%), Gaps = 49/535 (9%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTP-GKAYGGWENPISELRGHFVGHYLSASAQMW 177
Q L+YL +DVD L++ FR T L T GGW+ P R H GH+LSA AQ +
Sbjct: 58 QDRTLKYLKEIDVDRLLYVFRATHGLSTQQATPNGGWDAPDFPFRSHVQGHFLSAWAQCY 117
Query: 178 ASTHNATIKEKMSTVVFSLSECQ--NK-IG--TGYLSAFPTELFDSFE--ALKPVWAPYY 230
A + T ++ L++CQ NK +G GY+S FP F E L PYY
Sbjct: 118 AVLRDQTCYDRAIYFAAELAKCQANNKAVGFTDGYVSGFPESEFAKLENDTLTNGNVPYY 177
Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
+HK LAGLLD + L ++ + L +A+W V K +S L E
Sbjct: 178 AVHKTLAGLLDIWRLTNDTTSRDILLSLASW--------VDKRTEPFSYAAMQKLLQTEF 229
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGMN+V+ +Y T D + L +A FD LA D L HANT +P IG+ +
Sbjct: 230 GGMNEVMADIYHQTGDERWLTVAQRFDHAVIFDPLAANKDELDGLHANTQVPKWIGAARQ 289
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
Y+ TG+ Y I +I SH+YA GG S E + P +A L ++ E C +YNM
Sbjct: 290 YKATGESRYLDIARNAWEINVKSHTYAIGGNSQAEHFRAPNAIAAYLTNDTCEACNSYNM 349
Query: 407 LKVSRHLFRWTKE-IAYADYYERALTNGVLSIQRGTE-PGVMIYMLPLG----RGVSKAR 460
LK++R L+ + AY D+YE +L N +L Q + G + Y PL RGV A
Sbjct: 350 LKLTRELWLLDSDNSAYFDFYENSLLNHLLGQQDPHDHHGHITYFTPLNAGGRRGVGPAW 409
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
W T ++SFWCC GT +E+ +KL DSIYF + L+I ++SS W + L
Sbjct: 410 GGGTWSTDYDSFWCCQGTALETNTKLMDSIYFYNDST---LFINLFMSSVLKWPEMGITL 466
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLS--SLNLRMPVWTYSNGAQASLNGQNLP--L 576
Q V +SK EV ++N+R+P W S A+ +LNG+ L
Sbjct: 467 KQSTTYPVG----------DTSKLEVSGSGAWTMNIRIPAWASS--AELTLNGEALSDVK 514
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
PG + + W+ D + I+ P++LRT A D+ +S+ AI +GP +L G+
Sbjct: 515 AAPGKYAQISRTWADGDVIEIRFPMTLRTVAANDN----SSMVAIAYGPTVLCGN 565
>gi|393782435|ref|ZP_10370619.1| hypothetical protein HMPREF1071_01487 [Bacteroides salyersiae
CL02T12C01]
gi|392673263|gb|EIY66726.1| hypothetical protein HMPREF1071_01487 [Bacteroides salyersiae
CL02T12C01]
Length = 781
Score = 256 bits (653), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 181/565 (32%), Positives = 278/565 (49%), Gaps = 43/565 (7%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A + + ++L+ L D + F + A Y GWE+ S G GHYLSA + ++
Sbjct: 62 AMEADRKWLMSLQPDRFLHRFHENAGFTPKAPMYDGWED--SSQSGFSFGHYLSAMSMLY 119
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELF-DSFEA----LKPVW 226
A+T + + ++ + + +CQ IGTGY++A P EL D E + W
Sbjct: 120 AATGDNELLGRIEYSINEIRKCQLAIGTGYVAAIPDGDRLWNELVADKIEPGGSWINGFW 179
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL-NEE 285
AP+Y +HK+ +G +D Y+ A +A + ++ ++ + + + W + + E
Sbjct: 180 APWYNLHKLWSGFIDVYLYTGVETAKTVAIELTDWACDKFRDM-----TDDQWQRMISCE 234
Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
TGGMND LY +Y+IT + ++L LA F + L+ Q D L+ HANT IP V G
Sbjct: 235 TGGMNDALYNMYAITGNLRYLQLADKFYHYSVMEPLSQQRDELNGLHANTQIPKVTGIAR 294
Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
YE+ G K I TFF + V H+Y GG S E + P L L + ETC TYN
Sbjct: 295 SYELRGREKDKTIATFFWNTVLKKHTYCIGGNSNYEHFGKPGELF--LSDKTTETCNTYN 352
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
MLK++ HLF W + Y DYYERAL N +L+ Q E G+++Y LPL K ST
Sbjct: 353 MLKLTGHLFAWEPKAEYMDYYERALYNHILASQ-NHETGMVVYSLPLAYASFKEFSTPE- 410
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
+SFWCC GTG E+ K + IY E E + LYI +++S +W+ +++ Q+ +
Sbjct: 411 ----HSFWCCVGTGFENHVKYAEGIYSESEND---LYINLFVASRLNWRRKGMIIEQQTE 463
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLS 584
S L + S Q +L++R P W + G +N + + PG+++S
Sbjct: 464 FPESDKSSLILRCAKS------QTLTLHIRYPQWA-TTGYTIKVNDKIQEIEKKPGSYIS 516
Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTAR 644
W DK+ I++P SL E + D ++ A L GP +LAG + +
Sbjct: 517 LNRLWKDGDKIEIEMPKSLHKEVLPGDEHKF----AFLNGPIVLAGEMDLDERKIVFLEK 572
Query: 645 SLSALISPIPPSFNAQLVTFTQESG 669
S L I PS N +F ++G
Sbjct: 573 KDSELRDWIQPS-NRTKTSFITKTG 596
>gi|150003078|ref|YP_001297822.1| hypothetical protein BVU_0490 [Bacteroides vulgatus ATCC 8482]
gi|149931502|gb|ABR38200.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 783
Score = 256 bits (653), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 170/543 (31%), Positives = 269/543 (49%), Gaps = 47/543 (8%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
+ DV L +S A+ ++ YLL +D D L+ + K A L + Y WEN + L G
Sbjct: 33 VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
H GHYLSA + M+A+T N IK ++ ++ L CQ+ G GYL P +++ E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
L W P Y IHKI AGL D + D+ +A +K+ WM+ +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMI--------R 201
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+++ S E+ L E GG+N+ + +IT D ++L LAH F L L Q D L
Sbjct: 202 LVSKLSDEQIQEMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
+ HANT IP VIG + ++ G+ + +F + V S GG S RE +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+ L SE ETC TYNML++++ L+ + ++ + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG-FV 380
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y P+ G + + SFWCC G+G+E+ ++ G+ IY ++ N LY+ +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S+ W + +++ ++ TL S ++ + + L R+P WT +
Sbjct: 433 PSTLRWG------DTQIEQQTAFPDEEGSTLVISPEKGKKEFTLL-FRIPEWTKPEALRL 485
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
S+NG+ + ++S WS DK+ ++LP+ LR A+ D Y +IL+GP +
Sbjct: 486 SVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541
Query: 628 LAG 630
LA
Sbjct: 542 LAA 544
>gi|357032903|ref|ZP_09094838.1| tat twin-arginine translocation pathway signal sequence domain
protein [Gluconobacter morbifer G707]
gi|356413894|gb|EHH67546.1| tat twin-arginine translocation pathway signal sequence domain
protein [Gluconobacter morbifer G707]
Length = 790
Score = 255 bits (652), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 176/543 (32%), Positives = 275/543 (50%), Gaps = 65/543 (11%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A + N YLL L+ D L+ +FRK A LP G YGGWE+ + GH +GHYLSA A M+
Sbjct: 57 AVERNRIYLLSLEADRLLHNFRKQAGLPPKGALYGGWES--DTIAGHTLGHYLSALALMY 114
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE-----------LFDSFEA----- 221
A T +A +E+++ +V L Q + G GY++ F + +F EA
Sbjct: 115 AQTDDAACRERVAYIVQELVVVQKQWGDGYVAGFTRKEKNGALVDGKRIFAEIEAGDIRS 174
Query: 222 ----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEY---FYNRV-----QKV 269
L W+P Y IHK AGLLD ++ QAL +A + ++ F+ ++ QKV
Sbjct: 175 SGFDLNGAWSPLYNIHKTFAGLLDAHIYCHCDQALNVAVGLGQFLKAFFGKLTDAQMQKV 234
Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAH-LFDKPCFLGFLALQADYL 328
+T E GG+N+ L + T D + L LA+ ++D+P L L + D L
Sbjct: 235 LTC------------EYGGLNESFAELAARTGDEEWLRLAYRIYDRP-VLDPLMEERDDL 281
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
++ HANT IP ++G EV+ + + FF V HSY GG + RE++ +P
Sbjct: 282 ANRHANTQIPKLVGLARIAEVSQNRHWMTGPQFFWKAVTRHHSYVIGGNADREYFSEPDT 341
Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
++ + + E C TYNMLK++R + + A DYYERA N +L+ + G+ Y
Sbjct: 342 ISQHITEQTCEHCNTYNMLKLTRQCYASNPQAALFDYYERAHLNHILAAH-DPQTGMFTY 400
Query: 449 MLP-LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
M P + GV + W T SFWCC GTG+ES +K GDSI+++ E L++ YI
Sbjct: 401 MTPTITAGVRE------WSTPTESFWCCVGTGMESHAKHGDSIWWQREET---LFVNLYI 451
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S W V + K++ D + + L + +L+ LR+P W Q
Sbjct: 452 PSRMVWDRKDV--SWKMETGYPHDGRVSLLLEDLNSPVAFRLA---LRVPGWV-REPIQV 505
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
++NG+++P P ++ +WS D + + LP+++RTE+ DD + + +L GP +
Sbjct: 506 AVNGRDVPATPSDGYIVLDRKWSAGDHVVLDLPMTVRTESPVDD----SKLVTVLRGPMV 561
Query: 628 LAG 630
+A
Sbjct: 562 MAA 564
>gi|325836901|ref|ZP_08166283.1| hypothetical protein HMPREF9402_1694 [Turicibacter sp. HGF1]
gi|325491107|gb|EGC93399.1| hypothetical protein HMPREF9402_1694 [Turicibacter sp. HGF1]
Length = 763
Score = 255 bits (652), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 181/536 (33%), Positives = 278/536 (51%), Gaps = 46/536 (8%)
Query: 107 VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
V L++ S+ +Q +YLL LDV+ L+ + AS P +YGGWE+ E++GH +
Sbjct: 6 VRLEKDSLFEISQAVGKQYLLDLDVERLLAPIYEGASQIPPKPSYGGWES--LEIKGHSI 63
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF---------- 216
GHYLSA A M+ +T + +KE+M ++ + S Q GYL F + F
Sbjct: 64 GHYLSALACMYEATKDLELKERMDYIIETFSLLQR--ADGYLGGFLSTPFEQVFTGEFHV 121
Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
D F +L W P+Y+IHKI AGL+D Y + N +AL + + ++ Y + + S E
Sbjct: 122 DHF-SLSHYWVPWYSIHKIYAGLMDAYQIGKNVEALNILKKLADWAYEGSR----LMSDE 176
Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
+ L E GGMN+V+ LY IT D ++L LA F + + LA D L HANT
Sbjct: 177 QFQRMLICEYGGMNEVMAELYEITQDERYLYLAKRFTQHLIMDPLAAGVDDLQGRHANTQ 236
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
IP V+G+ YEVTGD Y + FF + V SY GG S+ E + + L E
Sbjct: 237 IPKVLGAAKLYEVTGDDYYYRVAKFFFETVVLHRSYVIGGNSSGEHFGPSD--TEPLSRE 294
Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
ETC TYNM+K++++LF+WTK+ Y D+ ERA N +L+ Q G IY G
Sbjct: 295 AAETCNTYNMIKLAKYLFKWTKDSKYIDFIERATYNHILASQ-DPHTGCKIYFTSNYPGH 353
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
K +GTK +SFWCC GTG+E+ + I+F+E+ + Y+ +++SSF +
Sbjct: 354 FKV-----YGTKEDSFWCCTGTGMENPGRYTHHIFFKEDED---FYVNLFMASSFVKEDE 405
Query: 517 HVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQL-SSLNLRMPVWTYSNGAQASLNGQNL 574
+ + + D PI + + L F +E QL ++ +R+P W + + GQ+
Sbjct: 406 QLKVVLQTDFPISN-----VVKLVF---EEANQLFLNVKIRVPYWL-NAPIEVRFKGQSY 456
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
G +L ++ + +D++ I LP+ L E + D P A ++GP +LA
Sbjct: 457 EANGQG-YLMISDTFHADDEIEIVLPMGLH-EYVSMDDPHKV---AFMYGPVVLAA 507
>gi|319640591|ref|ZP_07995310.1| hypothetical protein HMPREF9011_00907 [Bacteroides sp. 3_1_40A]
gi|345517952|ref|ZP_08797412.1| acetyl-CoA carboxylase [Bacteroides sp. 4_3_47FAA]
gi|254835150|gb|EET15459.1| acetyl-CoA carboxylase [Bacteroides sp. 4_3_47FAA]
gi|317387761|gb|EFV68621.1| hypothetical protein HMPREF9011_00907 [Bacteroides sp. 3_1_40A]
Length = 783
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 170/543 (31%), Positives = 269/543 (49%), Gaps = 47/543 (8%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
+ DV L +S A+ ++ YLL +D D L+ + K A L + Y WEN + L G
Sbjct: 33 VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
H GHYLSA + M+A+T N IK ++ ++ L CQ+ G GYL P +++ E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
L W P Y IHKI AGL D + D+ +A +K+ WM+ +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMI--------R 201
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+++ S E+ L E GG+N+ + +IT D ++L LAH F L L Q D L
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
+ HANT IP VIG + ++ G+ + +F + V S GG S RE +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+ L SE ETC TYNML++++ L+ + ++ + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG-FV 380
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y P+ G + + SFWCC G+G+E+ ++ G+ IY ++ N LY+ +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S+ W + +++ ++ TL S ++ + + L R+P WT +
Sbjct: 433 PSTLRWG------DTQIEQQTAFPDEEGSTLVISPEKGKKEFTLL-FRIPEWTKPEALRL 485
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
S+NG+ + ++S WS DK+ ++LP+ LR A+ D Y +IL+GP +
Sbjct: 486 SVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541
Query: 628 LAG 630
LA
Sbjct: 542 LAA 544
>gi|15614440|ref|NP_242743.1| hypothetical protein BH1877 [Bacillus halodurans C-125]
gi|10174495|dbj|BAB05596.1| BH1877 [Bacillus halodurans C-125]
Length = 758
Score = 255 bits (651), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 168/526 (31%), Positives = 270/526 (51%), Gaps = 35/526 (6%)
Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLS 171
+ + +Q+ + +L LD+D L+ + + A+LP ++YGGWE E+RGH +GH+LS
Sbjct: 11 KGLFYNSQKKGNDVILALDIDRLLAPYYEAANLPPKKRSYGGWEE--REIRGHSLGHWLS 68
Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQNKIG--TGYLSAFPTELFD-SFEA----LKP 224
A+A M+ +T + + E++ V L+ Q+ +G G A E+F F+ +
Sbjct: 69 AAAAMYETTGDKALLERIDRAVQELATIQDDVGYVGGVKRAHFDEMFSGEFQVGHFNIAG 128
Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
W P+Y +HK+ AGL+D + L ++ AL + T + ++ +K + ++ L
Sbjct: 129 TWVPWYNLHKLFAGLIDVHQLTGHSLALTVVTKLADW----AKKGTDQLTDDQFQRMLIC 184
Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
E GGMN+ + LY++T +L LA F L LA D L HANT IP VIG+
Sbjct: 185 EHGGMNEAMADLYTLTGHKDYLQLAIRFCHWAVLEPLANGIDELEGKHANTQIPKVIGAA 244
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
+E+TGD Y+ I FF V SY GG S E + + +TLG E ETC TY
Sbjct: 245 KLFEITGDDTYRAIAEFFWRQVTNDRSYIIGGNSNSEHFGPANK--ETLGVETAETCNTY 302
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
NMLK++ HLFRW + DYYE+AL N +L+ Q + G+ Y + L G K S+
Sbjct: 303 NMLKLTEHLFRWNRSSQLMDYYEKALYNHILASQ-DPDSGMKTYFVSLQPGHFKVYSSLE 361
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
SFWCC+GTG+E+ ++ +IY ++ ++ Y+ +++S K V + Q+
Sbjct: 362 -----ESFWCCFGTGLENPARYTRTIYDRDDRHI---YVNLFMASEIHLKDLQVQIRQET 413
Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLS 584
+ + R LTF V L++R+P W + A +NG+ ++L+
Sbjct: 414 N----FPETDRTKLTFVKADGVS--IKLHIRVPEWV-AGPVTARINGKETFSESGADYLT 466
Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
W D++ + LP+ LR +DD + I++GP +LAG
Sbjct: 467 IEREWQKGDEIEVHLPMELRIYEAKDDSHKV----GIMYGPIVLAG 508
>gi|423313734|ref|ZP_17291670.1| hypothetical protein HMPREF1058_02282 [Bacteroides vulgatus
CL09T03C04]
gi|392684669|gb|EIY77993.1| hypothetical protein HMPREF1058_02282 [Bacteroides vulgatus
CL09T03C04]
Length = 783
Score = 255 bits (651), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 170/543 (31%), Positives = 269/543 (49%), Gaps = 47/543 (8%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
+ DV L +S A+ ++ YLL +D D L+ + K A L + Y WEN + L G
Sbjct: 33 VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
H GHYLSA + M+A+T N IK ++ ++ L CQ+ G GYL P +++ E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
L W P Y IHKI AGL D + D+ +A +K+ WM+ +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMI--------R 201
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+++ S E+ L E GG+N+ + +IT D ++L LAH F L L Q D L
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
+ HANT IP VIG + ++ G+ + +F + V S GG S RE +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+ L SE ETC TYNML++++ L+ + ++ + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG-FV 380
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y P+ G + + SFWCC G+G+E+ ++ G+ IY ++ N LY+ +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S+ W + +++ ++ TL S ++ + + L R+P WT +
Sbjct: 433 PSTLRWG------DTQIEQQTAFPDEEGSTLVISPEKGKKEFTLL-FRIPEWTKPEALRL 485
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
S+NG+ + ++S WS DK+ ++LP+ LR A+ D Y +IL+GP +
Sbjct: 486 SVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541
Query: 628 LAG 630
LA
Sbjct: 542 LAA 544
>gi|333380462|ref|ZP_08472153.1| hypothetical protein HMPREF9455_00319 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826457|gb|EGJ99286.1| hypothetical protein HMPREF9455_00319 [Dysgonomonas gadei ATCC
BAA-286]
Length = 790
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 177/547 (32%), Positives = 271/547 (49%), Gaps = 41/547 (7%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
SL DV L S A+ + +YLL L D L+ F + + L ++Y WEN + L
Sbjct: 29 SLKDVRL-LDSPFKHAEDLDKQYLLELKADRLLSPFLRESGLTPKAESYTNWEN--TGLD 85
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELF 216
GH GHYLSA + M+AST + IKE++ +V L CQ+ GY+ P E+
Sbjct: 86 GHIGGHYLSALSLMYASTGDKQIKERLDYMVSELKRCQDANDNGYIGGVPGGKAIWEEVA 145
Query: 217 DS------FEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
+ F+ L W P Y IHK AGL D Y+ A++ A +M M ++ N V K+
Sbjct: 146 NGNIRAGGFD-LNGKWVPLYNIHKTYAGLRDAYLYANSDMAKEMLIKMTDWAINLVSKL- 203
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
S E+ L E GG+N+ + +IT D K+L LAH F L L D L+
Sbjct: 204 ---SEEQIQDMLRSEHGGLNETFADVAAITGDKKYLKLAHQFSHQLVLNPLLNHEDKLTG 260
Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
HANT IP V+G + +V G+ + FF + V S + GG S E + +
Sbjct: 261 MHANTQIPKVLGFKRIADVEGNESWSEASRFFWETVVEHRSVSIGGNSVGEHFNPTNDFS 320
Query: 391 DTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
+ S E ETC TYNML++S+ L++ +++ Y DYYERAL N +LS Q E G +Y
Sbjct: 321 RVIKSIEGPETCNTYNMLRLSKMLYQTSQDEKYMDYYERALYNHILSTQ-NPEQGGFVYF 379
Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
+ G + + SFWCC G+GIE+ +K G+ IY + LY+ +I S
Sbjct: 380 TQMRPGHYRV-----YSQPQTSFWCCVGSGIENHAKYGEMIYAHTDNE---LYVNLFIPS 431
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
+WK + Q+ S+ + L + ++ +L LR PVW G + S+
Sbjct: 432 RLNWKEKKTEIIQE----NSFPDEAKTQLIINPEKTAA--FTLKLRYPVWVKKWGLKVSV 485
Query: 570 NGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
NG++ P+ P +++S +W DK+ +++P+ + E + P+ ++ +I +GP L
Sbjct: 486 NGKDYPVSQDPASYISIDRKWKKGDKVVVEMPMRITVEQL----PDKSNYYSIFYGPVTL 541
Query: 629 AGHTSGE 635
A T E
Sbjct: 542 AAKTGTE 548
>gi|445497812|ref|ZP_21464667.1| hypothetical protein Jab_2c14210 [Janthinobacterium sp. HH01]
gi|444787807|gb|ELX09355.1| hypothetical protein Jab_2c14210 [Janthinobacterium sp. HH01]
Length = 789
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 178/548 (32%), Positives = 266/548 (48%), Gaps = 41/548 (7%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L+ L DV L S L AQ+T+L YLL ++ D L+ F + A LP +YG WE+
Sbjct: 29 LQLFPLADVRLGDSPFL-EAQRTDLHYLLEMEPDRLLAPFLREAGLPPKQPSYGNWES-- 85
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
+ L GH GHYLSA A M+AST + + +++ V L CQ + G GY+ P
Sbjct: 86 TGLDGHLGGHYLSALALMYASTGDEEVLRRLNYFVAELKRCQERNGNGYIGGIPDGSAAW 145
Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
EL ++ W P+Y +HK+ AGL D Y A NA A M M ++
Sbjct: 146 QAIARGELHVDNFSVNGKWVPWYNLHKVYAGLRDAYAYAGNADARAMLVSMSDW----AL 201
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
++ + S E+ L E GGMN+VL + +T K++ LA F L L D
Sbjct: 202 ELTSHLSEEQMQAMLRSEHGGMNEVLADVAQMTGQKKYMDLAVRFSHQAILRPLEEGKDQ 261
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L+ HANT IP VIG + ++TG ++ FF V + A GG S +E + D +
Sbjct: 262 LTGLHANTQIPKVIGFKHIGDMTGRRDWQQAAQFFWQTVRDHRTVAIGGNSVKEHFHDDR 321
Query: 388 RLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
+ E ETC TYNMLK++ LF + +Y DYYERAL N +LS QR + G
Sbjct: 322 DFLPMVDEVEGPETCNTYNMLKLTELLFLGDAKGSYTDYYERALYNHILSSQR-PDSGGF 380
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
+Y P+ + + + WCC G+GIES +K G+ IY LY+ +
Sbjct: 381 VYFTPM-----RPNHYRVYSQVDKAMWCCVGSGIESHAKYGEFIYAHRGDQ---LYVNLF 432
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
I S+ +W+S V + Q + D R T+T + ++ +R P W +
Sbjct: 433 IPSTLNWRSQGVTITQ-ANRFPDED---RSTITVQGSKAF----TMKIRYPEWVARGALR 484
Query: 567 ASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
++NG+ +P + ++S W DK+ IQLP+ E + P+ ++ A+L GP
Sbjct: 485 ITVNGKPVPADAGADRYVSLRRIWRDGDKVDIQLPMKTHLEQM----PDKSNYYAVLHGP 540
Query: 626 YLLAGHTS 633
+LA T+
Sbjct: 541 IVLAAKTN 548
>gi|402300545|ref|ZP_10820034.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus alcalophilus
ATCC 27647]
gi|401724312|gb|EJS97686.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus alcalophilus
ATCC 27647]
Length = 761
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 169/519 (32%), Positives = 272/519 (52%), Gaps = 41/519 (7%)
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
++YLL LD+D LV F + ASL + YGGWE + + GH +GH+LSA+A M+ +T N
Sbjct: 19 MDYLLFLDIDRLVAPFYEAASLAPKKQRYGGWEE--TGISGHSLGHWLSAAAYMYRNTMN 76
Query: 183 ATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD-----SFEA----LKPVWAPYYTIH 233
+K+K++ + L Q+ ++ FP+ F+ +FE L W P+Y++H
Sbjct: 77 RALKDKINKAIDELEYIQSVHDRNFIGGFPSTCFEKVFTGNFEVDHFTLAGHWVPWYSMH 136
Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVL 293
K+ AGL+D Y L N +AL + T + ++ V+ + + L E GGMNDV+
Sbjct: 137 KLFAGLIDVYKLVKNEKALSVVTKLADW----VESGTVRLTEAQFQKMLICEHGGMNDVM 192
Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
LY +T + +L LA F + L L+ + D L HANT IP VIG+ Y++T +
Sbjct: 193 AELYLLTQNQTYLQLAIRFCEQQILEPLSNRRDLLEGKHANTQIPKVIGAAKLYDITKEE 252
Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD-TLGSENEETCTTYNMLKVSRH 412
YK TFF V SY GG S E + R++D TLG + ETC TYNMLK++ H
Sbjct: 253 KYKTAATFFWQEVTRVRSYIIGGNSINEHF---GRVSDETLGVQTTETCNTYNMLKLTAH 309
Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
LF W ++ Y D+YERAL N +L+ Q + G+ Y + G K + + +SF
Sbjct: 310 LFLWEQKSEYYDFYERALYNHILASQ-DPDSGMKAYFVSTEPGHFKV-----YHSPEDSF 363
Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
WCC GTG+E+ ++ + IY++ + L++ +I+S + + L + D S
Sbjct: 364 WCCTGTGMENPTRYSEHIYYQRDDE---LFVNLFIASQLQLEEKELRLKLETDFPHSGRV 420
Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS-LNGQNLPLPPPGNFLSATERWSY 591
L++ ++ G+ S++LR+P W NG + +N + L +++ + RW
Sbjct: 421 QLKV------EEGDGRFLSIHLRIPYWI--NGKVSIFVNKKQTFLTDKKGYVTLSRRWKA 472
Query: 592 NDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
D++ + PL L + +DD + ++GP +LAG
Sbjct: 473 GDRVEVDFPLGLHSYIAKDD----PNKVGFMYGPIVLAG 507
>gi|373955475|ref|ZP_09615435.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373892075|gb|EHQ27972.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 782
Score = 254 bits (649), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 171/545 (31%), Positives = 272/545 (49%), Gaps = 44/545 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
LK L +V L + A+ +L+Y++ L D L+ + + A L ++Y WEN
Sbjct: 24 LKTFRLQEVKL-LPGIFNDAENADLKYMMQLSPDKLLAPYLREAGLKPKAESYTNWEN-- 80
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
S L GH GHYLSA A M+AST + ++++ ++ L CQ+K G GY+ P EL+
Sbjct: 81 SGLDGHIGGHYLSALAMMYASTGDKQALDRLNYMIAELKICQDKNGNGYVGGVPGSKELW 140
Query: 217 DSF-----EALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
+ A+ W P+Y IHK AGL D Y A N A +K A W V
Sbjct: 141 AAVMQGDVGAINKKWVPFYNIHKTFAGLRDAYTYAGNETAKVMLIKFADWFV-------- 192
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ T + ++ L E GG+N+VL +Y++T D K+L A+ F L L D
Sbjct: 193 MIATSITPQKMQEMLKTEHGGVNEVLADVYALTGDKKYLTAAYSFSHQAILEPLEQGQDK 252
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L++ HANT IP VIG + +VT D Y FF V + A GG S RE +
Sbjct: 253 LNNLHANTQIPKVIGFKRISDVTADSNYNKAAQFFWQTVVQHRTVAIGGNSVREHFNPSN 312
Query: 388 RLADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
+ + +E ETC TYNMLK++ L+ ++Y DYYERAL N +LS +R G
Sbjct: 313 DFSSMITTEQGPETCNTYNMLKLTEDLYLSDPRVSYIDYYERALYNHILSTER--PGGGF 370
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
+Y P+ G + + S WCC G+G+E+ +K G+ IY ++ NV ++ +
Sbjct: 371 VYFTPMRPGHYRV-----YSQPQTSMWCCVGSGMENHAKYGEMIYAHDQNNV---FVNLF 422
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
I S+ +WK +VL Q + + + ++T ++ + G ++N+R P W ++ +
Sbjct: 423 IPSTLNWKQKGLVLTQHTN----FPEEEKTSITINAVRP-GAF-AINIRYPSWVHTGALK 476
Query: 567 ASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
++NG + + + ++S W D + + LP+ TE + P+ + +A+L GP
Sbjct: 477 VTVNGTPIKVSAKSSAYVSINRVWKKGDVIGVTLPMQTTTEQL----PDGLNYEAVLHGP 532
Query: 626 YLLAG 630
+LA
Sbjct: 533 IVLAA 537
>gi|294775898|ref|ZP_06741397.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|294450267|gb|EFG18768.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 783
Score = 254 bits (649), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 172/543 (31%), Positives = 267/543 (49%), Gaps = 47/543 (8%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
+ DV L +S A+ ++ YLL +D D L+ + K A L + Y WEN + L G
Sbjct: 33 VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
H GHYLSA + M+A+T N IK ++ ++ L CQ+ G GYL P +++ E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIED 149
Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
L W P Y IHKI AGL D + N +A +K+ WM+ +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTGNKEAKEMLVKLTDWMI--------R 201
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+++ S E+ L E GG+N+ + +IT D ++L LAH F L L Q D L
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
+ HANT IP VIG + ++ G+ + +F + V S GG S RE +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+ L SE ETC TYNML++++ L+ + + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHFMDYYERALYNHILSTQDPVQGG-FV 380
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y P+ G + + SFWCC G+G+E+ ++ G+ IY ++ N LY+ +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S+ W G + + Q+ ++ TL S ++ + + L R+P WT
Sbjct: 433 PSTLRW--GDIQIEQQ----TAFPDEEETTLVISPEKGKKEFTLL-FRIPEWTKPEALCL 485
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
S+NG+ + ++S WS DK+ ++LP+ LR A+ D Y +IL+GP +
Sbjct: 486 SVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541
Query: 628 LAG 630
LA
Sbjct: 542 LAA 544
>gi|452750721|ref|ZP_21950468.1| Putative glycosyl hydrolase of unknown function (DUF1680) [alpha
proteobacterium JLT2015]
gi|451961915|gb|EMD84324.1| Putative glycosyl hydrolase of unknown function (DUF1680) [alpha
proteobacterium JLT2015]
Length = 744
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 181/577 (31%), Positives = 270/577 (46%), Gaps = 51/577 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A + N EYL+ LD D L+ ++R +A L G YGGWE+ + GH +GHYLSA A
Sbjct: 9 AVERNREYLMSLDPDRLLHNYRTSAGLAPKGDVYGGWES--DTIAGHTLGHYLSALALTH 66
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----ELFDSFEA----------- 221
A T + + + +V L+ Q G GY++ F E+ D E
Sbjct: 67 AQTGDEESCRRANYIVGELATVQAAHGDGYVAGFTRKRPDGEIVDGKEIFPEIMAGDIRS 126
Query: 222 ----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
L W P Y HK+ GL D L N AL +A + +Y + ++ E+
Sbjct: 127 AGFDLNGCWVPLYNWHKLYTGLYDVADLCGNRTALPIAVALGDY----IDRMFAALDDEQ 182
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
L E GG+N+ LY+ T + + L L L L D L++FHANT +
Sbjct: 183 VQTVLACEYGGLNESFAELYARTGERRWLRLGERIYDNKVLDPLTRGEDRLANFHANTQV 242
Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
P +IG YE+T P FF D V HSY GG + RE++ +P ++ + +
Sbjct: 243 PKLIGLARLYELTSKPAQGAAAEFFWDTVTKRHSYVIGGNADREYFSEPNSISKHITEQT 302
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
E C +YNMLK++RHL+ W A D+YERA N +LS Q+ E G YM PL G +
Sbjct: 303 CEHCNSYNMLKLTRHLYSWRPRSALFDFYERAHLNHILS-QQHPETGGFSYMTPLMSGTA 361
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KS 515
+ S G ++FWCC GTG+ES +K GDSI+++ + L + YI ++ +W +
Sbjct: 362 REYSEPG----KDAFWCCVGTGMESHAKHGDSIFWQGDD---ALIVNLYIPAAANWRPRG 414
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
V L + S LTF+ + G+ + LR+P W S +NG+ +
Sbjct: 415 ASVRLETRYPEEGS------ANLTFTELAKPGRF-PVALRVPAWAES--VDVRVNGKAVA 465
Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+++ + RW D+L I +P+ LR E DD + A+L GP +LA
Sbjct: 466 AKVEDGYVTVSRRWQAGDRLAIAMPMRLRIEPTADD----PDMIALLRGPMVLAADLGPA 521
Query: 636 WDIKTGTARSL--SALISPIPPSFNAQLVTFTQESGN 670
+ G A +L S L++ P + TQ G
Sbjct: 522 EEEFDGAAPALVGSDLLAKFVPEAGSATAFATQGIGR 558
>gi|404254065|ref|ZP_10958033.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas sp. PAMC
26621]
Length = 646
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 192/614 (31%), Positives = 287/614 (46%), Gaps = 59/614 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE-NP 157
L+ L DV L + L AQ+ YLL LD D ++ +FR A L YGGWE +P
Sbjct: 46 LQPFDLADVDLGEGPFL-HAQRKTEAYLLSLDPDRMLHAFRVNAGLKPKAAVYGGWESDP 104
Query: 158 I---SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
I +GH +GHYLSA A + ST ++++ + L+ CQ+ +G + AFP
Sbjct: 105 IWADINCQGHTLGHYLSACALAYRSTRKPAFRQRIDHIARELAACQDAAKSGLVCAFPKG 164
Query: 215 ---LFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
+ P+YT+HK+ AGL D +LAD+A++ L++A W V
Sbjct: 165 PALVAAHLRGDAITGVPWYTLHKVFAGLRDATLLADSAESRAVLLRLADWAV-------- 216
Query: 268 KVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
V T + + ++ E E GGMN+V LY +T +P + +A F L LA D
Sbjct: 217 -VATRPLSDAQFETMLETEHGGMNEVFADLYLMTGNPDYRTMAERFSHKALLTPLAAGRD 275
Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
L HANT +P ++G Q +E TG P Y FF V + S+ATGG E ++
Sbjct: 276 QLDGLHANTQLPKIVGFQRVFEATGTPHYHEAAAFFWRTVALTRSFATGGHGDNEHFFPM 335
Query: 387 KRL-ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
++ ETC +NMLK++R LF + YADYYER L NG+L+ Q + G+
Sbjct: 336 AEFDKHVFSAKGSETCGQHNMLKLTRALFLQDPQAEYADYYERTLYNGILASQ-DPDTGM 394
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
+ Y G K + T +SFWCC GTG+E+ K DSIYF ++ LY+
Sbjct: 395 VTYFQGARPGYMKL-----YHTPEHSFWCCTGTGMENHVKYRDSIYFHDD---KALYVNL 446
Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--- 562
++ S+ W+ V L Q+ P + T +V +L LR P W+ S
Sbjct: 447 FVPSAVRWREKGVALRQETR--FPDAPTTTLHWTVERPTDV----TLQLRHPRWSRSAIV 500
Query: 563 --NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
NG +A+ + PG+++ W D T++L L++ E + D P I A
Sbjct: 501 LVNGVEAARSDT------PGSYVKLARTWHSGD--TVELRLAM--EVVPDQAPAAPDIVA 550
Query: 621 ILFGPYLLAGHTSGEWDIKTGTARSLSALISPIP-PSFNAQLVTFTQESGNSTFVMSNSN 679
+GP +LAG E G A +++ +NA LVT GN + +
Sbjct: 551 FSYGPMVLAGVLGRE-----GLAPGADVIVNERKYGEYNAGLVTVPTLVGNPATLAAQVR 605
Query: 680 QSITMEEFPVSGTD 693
++ EF + D
Sbjct: 606 KADGPLEFTIPAAD 619
>gi|336425130|ref|ZP_08605160.1| hypothetical protein HMPREF0994_01166 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013039|gb|EGN42928.1| hypothetical protein HMPREF0994_01166 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 628
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 175/549 (31%), Positives = 272/549 (49%), Gaps = 62/549 (11%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPGKA----YGGWENPISELRGHFVGHYLSASAQMWAST 180
Y++ L+ L+ +F + T +A +GGWE P +LRGHF+GH+LSA+A + +T
Sbjct: 32 YMMHLENRFLLLNFNLESGRDTSAEAIEGMHGGWEFPTCQLRGHFLGHWLSAAAMHYHAT 91
Query: 181 HNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLL 240
+ +K K T+V L+ECQ + G + + P + K VWAP+YTIHK+ GLL
Sbjct: 92 GDRELKAKADTLVEELAECQKENGGKWAAPIPEKYLYRIAEGKQVWAPHYTIHKVFMGLL 151
Query: 241 DQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSIT 300
D Y A NA AL++A ++FY+ + +S + L+ ETGGM ++ +LY+IT
Sbjct: 152 DMYEYAGNAIALEIAENFADWFYDWTKD----FSRDEMDDILDFETGGMLEIWVQLYAIT 207
Query: 301 HDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGT 360
K+ L + + L D L++ HANT IP +IG Y+VTGD ++ I
Sbjct: 208 GKDKYAALMERYYRGRLFDPLLKGEDVLTNMHANTTIPEIIGCARAYDVTGDEKWRKIAE 267
Query: 361 FFMDI-VNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
+ D+ V YATGG + E W K+L LG + +E CT YNM++++ LFRW+ +
Sbjct: 268 NYWDLAVTQRGQYATGGQTCGEIWSPKKKLGARLGLKGQEHCTVYNMIRLAGFLFRWSLD 327
Query: 420 IAYADYYERALTNGVLS-------IQRG-TEP----GVMIYMLPLGRGVSKARSTHGWGT 467
AY DY E+ L NG+++ + G T P G++ Y LP+ G K GW +
Sbjct: 328 PAYLDYQEKLLYNGLMAQAYWQSNLSHGFTSPYPSKGLLTYFLPMQAGGRK-----GWSS 382
Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS--SFDWKSGHVVLNQKVD 525
K F+CC+GT +++ + IY++ E + LYI QY+ S SF V + QK D
Sbjct: 383 KTGDFFCCHGTLVQANAAFNRGIYYQSEDS---LYICQYLDSQVSFSVNDSRVTILQKAD 439
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLS------------------------SLNLRMPVWTY 561
P+ + T S++Q V + + +L LR+P W
Sbjct: 440 PLTGSS---HLASTSSARQSVLEDTRKYPSQPDCLVPCLKMELEKETEMTLQLRIPGWLA 496
Query: 562 SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
+ + F+ W D + I LP +++T + PE + A
Sbjct: 497 GEAVILINDTEVYRSNDSCLFVPLKRVWKDGDIIRILLPKAVKTFPL----PEDENTVAF 552
Query: 622 LFGPYLLAG 630
L+GP +LAG
Sbjct: 553 LYGPVVLAG 561
>gi|302872476|ref|YP_003841112.1| hypothetical protein COB47_1852 [Caldicellulosiruptor obsidiansis
OB47]
gi|302575335|gb|ADL43126.1| protein of unknown function DUF1680 [Caldicellulosiruptor
obsidiansis OB47]
Length = 587
Score = 253 bits (647), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 172/544 (31%), Positives = 271/544 (49%), Gaps = 46/544 (8%)
Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH 181
NL L+ + WSF P +GGWE+P +LRGHF+GH+LSA+A+++AS
Sbjct: 39 NLLQNFYLESGIMSWSF-------LPQDIHGGWESPTCQLRGHFLGHWLSAAARIYASFG 91
Query: 182 NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLD 241
+ IK K +V L CQ + G ++ + P + F+ K VWAP+YT+HK GL+D
Sbjct: 92 DEEIKGKADYIVDELERCQKENGGEWVGSIPEKYFEWMARGKWVWAPHYTVHKTFMGLVD 151
Query: 242 QYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITH 301
Y N +AL++A +FY + +S E+ L+ ETGGM ++ LY+IT
Sbjct: 152 MYKYTSNQKALEIADRWANWFY----RWSGQFSREKMDDILDYETGGMLEIWAELYNITK 207
Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY-KLIGT 360
D K+ L + + L D L+ HANT IP + G+ +EVTG+ + K++ +
Sbjct: 208 DSKYKELMERYYRGRLFDRLLNGEDVLTGRHANTTIPEIHGAARVWEVTGEEKFRKIVES 267
Query: 361 FFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEI 420
++ + V + TGG + E W R+ + LG N+E C YNM++++ LFRWT +
Sbjct: 268 YWREAVEERGYFCTGGQTLGEVWTPKHRIRNYLGPTNQEHCVVYNMIRLAEFLFRWTGDK 327
Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
Y+DY ER + NG+ + QR + G++ Y LPL G K WGT N FWCC+GT +
Sbjct: 328 KYSDYIERNIYNGLFAQQR-LKDGMVTYFLPLMPGSQKR-----WGTPTNDFWCCHGTLV 381
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
++ + D IY++ G+ I Q+I S WK + K + I Y R +F
Sbjct: 382 QAHTIYNDIIYYKTPN---GVVISQFIPSFVTWK------DDKGNGITIKQYYGRRQESF 432
Query: 541 SSKQEVGQLS-----------SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERW 589
+ E ++ L +R P W + + ++N +++ T RW
Sbjct: 433 AYTAEKDEICIEVQCKDPIEFELAIRKPWW--AKKIEVAVNEDLNYGVDDSSYIKLTRRW 490
Query: 590 SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSAL 649
+ +DK+ I ++ T + DD P+ A + GP +LAG I R + +
Sbjct: 491 N-SDKIKITFYKTVETCPMPDD-PQQV---AFMVGPVVLAGLCERRRKIYI-NGRKIEEV 544
Query: 650 ISPI 653
I PI
Sbjct: 545 IVPI 548
>gi|302422424|ref|XP_003009042.1| secreted protein [Verticillium albo-atrum VaMs.102]
gi|261352188|gb|EEY14616.1| secreted protein [Verticillium albo-atrum VaMs.102]
Length = 635
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 181/532 (34%), Positives = 263/532 (49%), Gaps = 43/532 (8%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
Q L Y+ +DVD L++ FR+T LP G + GGW+ P R HF GH+L+A + W
Sbjct: 65 QDRTLNYIKFVDVDRLLYVFRQTHGLPLQGAQPNGGWDAPDFPFRSHFQGHFLNAWSYCW 124
Query: 178 ASTHNATIKEKMSTVVFSLSECQ---NKIG--TGYLSAFPTELFDSFE--ALKPVWAPYY 230
A + +++ S L++CQ +K G GYLS FP ++ E L PYY
Sbjct: 125 AVLRDEACRDRASYFATELAKCQGNNDKAGFNPGYLSGFPESEIEAVEKRTLSNGNVPYY 184
Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
+IHK +AGLLD + + A + M + R K+ S + ++ E GGMN
Sbjct: 185 SIHKTMAGLLDVWRHIGDETARDVLLGMAGWVDLRTGKL----SYSQMQTMMSTEFGGMN 240
Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
+V+ ++ T D + L +A FD LA D L+ HANT +P IG+ Y+ T
Sbjct: 241 EVMADIFHQTGDERWLTVAQRFDHASVFDPLAGNRDSLNGLHANTQVPKWIGAAREYKAT 300
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
G Y I +I +H+YA G S E + P +A L + E C TYNMLK++
Sbjct: 301 GTTRYSDIAHNAWNITVQAHTYAIGANSQSEHFRPPNAIASYLDEDTAEACNTYNMLKLT 360
Query: 411 RHLFRWTKEIA---YADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKARST 462
R L W + + Y D+YE+AL N + Q + G + Y L RGV A
Sbjct: 361 REL--WVMDPSNSKYFDFYEQALINHAIGQQDPSSAHGHVTYFTSLNPGGHRGVGPAWGG 418
Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
W T + + WCC GT +E+ +KL DSIYF +E + LY+ Y S +W V + Q
Sbjct: 419 GTWSTDYGTAWCCQGTALETNTKLMDSIYFYDESS---LYVNLYAPSRLNWTQRKVTVLQ 475
Query: 523 KVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP--LPPP 579
+ D P L+ T T + K G L LR+P+W S GA ++NGQ L P
Sbjct: 476 ETDFP-------LQETSTLTVKG--GGDWDLRLRIPIW--SKGATIAINGQALDGVETVP 524
Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
G + + W D +TI LP++L T + DD P S+ A+ +GP +LA +
Sbjct: 525 GTYATIKRSWGEEDIVTITLPMALHTISA-DDEP---SVAALAYGPVVLAAN 572
>gi|16126789|ref|NP_421353.1| hypothetical protein CC_2550 [Caulobacter crescentus CB15]
gi|221235569|ref|YP_002518006.1| membrane-bound glycosyl hydrolase [Caulobacter crescentus NA1000]
gi|13424115|gb|AAK24521.1| conserved hypothetical protein [Caulobacter crescentus CB15]
gi|220964742|gb|ACL96098.1| membrane-bound glycosyl hydrolase [Caulobacter crescentus NA1000]
Length = 786
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 172/539 (31%), Positives = 260/539 (48%), Gaps = 47/539 (8%)
Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYL 170
+ S+ +AQ N YL+ L D L+ +F A LP YGGWE + GH +GHYL
Sbjct: 57 KPSIFAQAQGANRAYLVSLQPDRLLHNFHLGAGLPVKAPVYGGWE--AQSIAGHTLGHYL 114
Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLS-------AFPTELFDSFEALK 223
SA A A+ + + ++++ V L+ Q G GY+ A P FE L+
Sbjct: 115 SACALQVANDGDPVLSQRLAYTVAQLARVQAAHGDGYVGGTTRWGQADPVGGKAVFEELR 174
Query: 224 ------------PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
W P YT HKI AGLLD + LA AL +A + Y ++
Sbjct: 175 RGDIRANRFSLNDGWVPIYTWHKIHAGLLDAHRLAATPGALDVALGLAGYLAT----ILE 230
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
+ ++ L E GG+ + Y++T DP+ L +A + LA D L+
Sbjct: 231 GLNDDQVQAILVAEHGGLCEAYAETYALTGDPRWLNIARRLRHRELVDPLAQGRDELAGL 290
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
HANT IP +IG YEV GDP FF V HSYA GG S RE + P +A
Sbjct: 291 HANTQIPKIIGLARLYEVAGDPAEARTARFFHQTVTRRHSYAIGGNSDREHFGPPDAIAT 350
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
L E C +YNMLK++R L+ W + A D YERA N +++ QR ++ G+ +Y +P
Sbjct: 351 RLSETTCEACNSYNMLKLTRRLWSWAPDGALFDDYERAQLNHIMAHQRPSD-GMFVYFMP 409
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
+ G ++ S T +SFWCC G+G+ES +K DSI++ LY+ +I+S
Sbjct: 410 MAAGGRRSYS-----TPEDSFWCCVGSGMESHAKHADSIWWRGGQT---LYLNLFIASRL 461
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
D ++ +D + +T+T + + L + LR+P W + + S+NG
Sbjct: 462 DLPGDDFAID--LDTAFPQSGQVDLTVTRAPR----GLREIALRLPAWCAA--PRLSVNG 513
Query: 572 QNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
P+ G+ + + RW D++T+ LP+++R E DD ++ A L GP +LA
Sbjct: 514 APTPIQTRGDGYARLSRRWKAGDRVTLMLPMAVRAEPTPDD----PNLVAFLSGPLVLA 568
>gi|312135764|ref|YP_004003102.1| hypothetical protein Calow_1766 [Caldicellulosiruptor owensensis
OL]
gi|311775815|gb|ADQ05302.1| protein of unknown function DUF1680 [Caldicellulosiruptor
owensensis OL]
Length = 587
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 168/550 (30%), Positives = 279/550 (50%), Gaps = 43/550 (7%)
Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPT----PGKAYGGWENPISELRGHFVGHYLSASAQ 175
+ N Y+L L ++L+ +F + + + P +GGWE+P +LRGHF+GH+LSA+A+
Sbjct: 26 KLNRSYMLSLKTENLLQNFYLESGIMSWSFLPQDIHGGWESPTCQLRGHFLGHWLSAAAR 85
Query: 176 MWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKI 235
++A+ + IK K +V L CQ + G ++ + P + F+ K VWAP+YT+HK
Sbjct: 86 IYANFGDEEIKGKADYIVDELERCQKENGGEWVGSIPEKYFEWMARGKWVWAPHYTVHKT 145
Query: 236 LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYR 295
GL+D Y N +AL++ +FY + +S E+ L+ ETGGM ++
Sbjct: 146 FMGLVDMYKYTSNQKALEIVDRWANWFY----RWSGQFSREKMDDILDYETGGMLEIWAE 201
Query: 296 LYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLY 355
LY+IT D K+ L + + L D L+ HANT IP + G+ +EVTG+ +
Sbjct: 202 LYNITKDIKYRDLMERYYRGRLFDRLLNGEDVLTGRHANTTIPEIHGAARVWEVTGEEKF 261
Query: 356 -KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLF 414
K++ +++ + V + TGG + E W +++ + LG N+E C YNM++++ LF
Sbjct: 262 RKIVESYWREAVEERGYFCTGGQTLGEVWTPKQKIKNYLGPTNQEHCVVYNMIRLAEFLF 321
Query: 415 RWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC 474
RWT + Y+DY ER + NG+ + QR + G++ Y LPL G K WGT N FWC
Sbjct: 322 RWTGDKRYSDYIERNIYNGLFAQQR-LKDGMVTYFLPLMPGSQKR-----WGTPTNDFWC 375
Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
C+GT +++ + D IY++ + G+ I Q+I S WK + K + I Y
Sbjct: 376 CHGTLVQAHTIYNDIIYYKGQN---GIVISQFIPSFVTWK------DDKGNDITIKQYYG 426
Query: 535 RMTLTFSSKQEVGQLS-----------SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
R +F+ + ++ L +R P W + ++N +++
Sbjct: 427 RRQESFAYTAKKDEICIEIQCKNPIEFELAIRKPWWAMK--IEVAVNEDLYYSIDDSSYI 484
Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTA 643
+RW+ NDK+ I ++ T + DD P+ A + GP +LAG I T
Sbjct: 485 QLMQRWN-NDKVKITFYKTVETCPMPDD-PQQV---AFMIGPVVLAGLCENRKKI-TING 538
Query: 644 RSLSALISPI 653
+ + +I PI
Sbjct: 539 KEIKDVIIPI 548
>gi|302670053|ref|YP_003830013.1| glycoside hydrolase [Butyrivibrio proteoclasticus B316]
gi|302394526|gb|ADL33431.1| glycoside hydrolase [Butyrivibrio proteoclasticus B316]
Length = 780
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 204/625 (32%), Positives = 299/625 (47%), Gaps = 58/625 (9%)
Query: 98 FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
LKE L V ++ A ++ YL LD + L+ F + A L Y GWEN
Sbjct: 1 MLKEFDLTQVCVNDEYCA-NALNKDVAYLKSLDPERLLAGFYENAGLTPKKIRYSGWENM 59
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEK-----MSTVVFSLSECQNK--------IG 204
+ + GH +GHYL+A+AQ +A+ +K + T+V L ECQ G
Sbjct: 60 L--IGGHTLGHYLTAAAQGYANPGTRKEDKKALFDIIKTLVDGLLECQEHSQGKKGFVFG 117
Query: 205 TGYLSAFPTEL-FDSFE-----ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
+ + EL FD E + W P+YT+HKIL GL+ +V ALK+A +
Sbjct: 118 AIIMDSNNVELQFDHVEHGRTNIITESWVPWYTMHKILDGLVSTFVFTGYEPALKVAEGI 177
Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
++ YNR + +S E H L+ E GGMND LY+LY +T +HL AH FD+
Sbjct: 178 GDWTYNRA----SGWSEETHKTVLSIEYGGMNDALYKLYRLTGKKEHLEAAHAFDEEELF 233
Query: 319 GFLAL-QADYLSHFHANTHIPIVIGSQMRYEVTGDPL--YKLIGTFFMDIVNASHSYATG 375
+A A+ L++ HANT IP +G+ RY GD Y F D+V H+YATG
Sbjct: 234 KKVATGDANVLNNRHANTTIPKFLGALQRYMTLGDVAGEYLTYVQKFWDMVVERHTYATG 293
Query: 376 GTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVL 435
G S E + + L + N ETC TYNMLK+SR LFR T + YADYYE N +L
Sbjct: 294 GNSEWEHFGEDFVLDAERTNCNNETCNTYNMLKMSRDLFRITGDKKYADYYENTFINAIL 353
Query: 436 SIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEE 495
S Q E G+ +Y P+ G K +GT F+ FWCC GTG+E+F+KL DSIYF ++
Sbjct: 354 SSQN-PESGMTMYFQPMATGYYKV-----YGTPFDKFWCCTGTGMENFTKLNDSIYFLDD 407
Query: 496 GNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
+V + YISS + L QK S P L F+ E + L R
Sbjct: 408 ESV---IVNMYISSVVCDSKKKLTLTQK-----SLIPKGNTAL-FTINLEEPVKTKLRFR 458
Query: 556 MPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
+P W + +A +G+ G F T ++ND I++ + T + P+
Sbjct: 459 VPDWAVNATCKALSSGKTYQAEADGYF---TVEETFNDGDQIEISFEMHT--VVKRLPDC 513
Query: 616 ASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
++ A +GP LL+ E I T ++ IP + A T + G+ + +
Sbjct: 514 ENVFAFKYGPVLLSADLGCENMIDGTTGVDVT-----IPTNKIAGKEYLTVQDGSVSDYI 568
Query: 676 SNSNQSITMEE----FPVSGTDAAL 696
++ ++ + + F ++GTD L
Sbjct: 569 ADIDKHLIRKGDELCFTLTGTDREL 593
>gi|293375008|ref|ZP_06621302.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
gi|292646370|gb|EFF64386.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
Length = 763
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 180/536 (33%), Positives = 277/536 (51%), Gaps = 46/536 (8%)
Query: 107 VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
V L++ S+ +Q +YLL LDV+ L+ + AS P +YGGWE+ E++GH +
Sbjct: 6 VRLEKDSLFEISQAVGKQYLLDLDVERLLAPIYEGASQIPPKPSYGGWES--LEIKGHSI 63
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF---------- 216
GHYLSA M+ +T + +KE+M ++ + S Q GYL F + F
Sbjct: 64 GHYLSALTCMYEATKDLELKERMDYIIETFSLLQR--ADGYLGGFLSTPFEQVFTGEFHV 121
Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
D F +L W P+Y+IHKI AGL+D Y + N +AL + + ++ Y + + S E
Sbjct: 122 DHF-SLSHYWVPWYSIHKIYAGLMDAYQIGKNVEALNILKKLADWAYEGSR----LMSDE 176
Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
+ L E GGMN+V+ LY IT D ++L LA F + + LA D L HANT
Sbjct: 177 QFQRMLICEYGGMNEVMAELYEITQDERYLYLAKRFTQHLIMDPLAAGVDDLQGRHANTQ 236
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
IP V+G+ YEVTGD Y + FF + V SY GG S+ E + + L E
Sbjct: 237 IPKVLGAAKLYEVTGDDYYYRVAKFFFETVVLHRSYVIGGNSSGEHFGPSD--TEALSRE 294
Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
ETC TYNM+K++++LF+WTK+ Y D+ ERA N +L+ Q G IY G
Sbjct: 295 AAETCNTYNMIKLAKYLFKWTKDSKYIDFIERATYNHILASQ-DPHTGCKIYFTSNYPGH 353
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
K +GTK +SFWCC GTG+E+ + I+F+E+ + Y+ +++SSF +
Sbjct: 354 FKV-----YGTKEDSFWCCTGTGMENPGRYTHHIFFKEDED---FYVNLFMASSFVKEDE 405
Query: 517 HVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQL-SSLNLRMPVWTYSNGAQASLNGQNL 574
+ + + D PI + + L F +E QL ++ +R+P W + + GQ+
Sbjct: 406 QLKVVLQTDFPISN-----VVKLVF---EEANQLFLNVKIRVPYWL-NAPIEVRFKGQSY 456
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
G +L ++ + +D++ I LP+ L E + D P A ++GP +LA
Sbjct: 457 EGNGQG-YLMISDTFHADDEIEIVLPMGLH-EYVSMDDPHKV---AFMYGPVVLAA 507
>gi|336321977|ref|YP_004601945.1| hypothetical protein Celgi_2884 [[Cellvibrio] gilvus ATCC 13127]
gi|336105558|gb|AEI13377.1| protein of unknown function DUF1680 [[Cellvibrio] gilvus ATCC
13127]
Length = 781
Score = 253 bits (646), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 194/581 (33%), Positives = 270/581 (46%), Gaps = 73/581 (12%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWE-- 155
L EVSL + SV RAQQ ++ VD ++ FR+ A+L G A GGWE
Sbjct: 91 LTEVSLGE------SVFTRAQQQMVDLARAYPVDRVLVVFRRNANLDVRGASAPGGWEEL 144
Query: 156 NPISE---------------------LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVF 194
P + LRGH+ GH+LS A +A+T + I +K+ V
Sbjct: 145 GPAPDEQRWGPAEYVRGQNTRGAGGLLRGHYGGHFLSMLAMAYATTGDQAILDKVDDFVD 204
Query: 195 SLSECQNKIGT-------GYLSAFPTELFDSFEALKP---VWAPYYTIHKILAGLLDQYV 244
L EC+ + G+L+A+ F + EA P +WAP+YT HKILAGL+D Y
Sbjct: 205 GLEECRAALAATGKYSHPGFLAAYGEWQFSALEAYAPYGEIWAPWYTCHKILAGLIDAYR 264
Query: 245 LADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDP 303
+A AL++A + + + R+ T +ER W + E GGMND L LY+++
Sbjct: 265 YTGSALALQLAEGLGRWTHARL-SACTPEQLERMWGIYIGGEAGGMNDALVDLYTLSAAA 323
Query: 304 KH---LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGT 360
L A LFD + A D L+ HAN HIP +G TGD Y
Sbjct: 324 DRDDFLAAAALFDLRSLVTACAQDRDTLNGKHANMHIPTFVGYAKLGAWTGDATYTAATR 383
Query: 361 FFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEI 420
F ++ YA GGT E W +A +G N E+C YNMLKV+R LF ++
Sbjct: 384 NFFGMIVPGRMYAHGGTGEGEMWGPANTVAGDIGPRNAESCAAYNMLKVARTLFFEQQDP 443
Query: 421 AYADYYERALTNGVLSIQR---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
AY DYYER + N +L +R T +YM P+G G K GT CC G
Sbjct: 444 AYMDYYERTVLNHILGGKRDQASTTSPQNLYMFPVGPGARKEYGNGNIGT------CCGG 497
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
TG+ES K DSI+F + L++ Y+ S W S + + Q+ D T
Sbjct: 498 TGLESPVKYQDSIWFRSADD-SALWVNLYVPSELRWTSRGLRIVQEGDYPND------ET 550
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYS-----NGAQASLNGQNLPLPPPGNFLSATERWSYN 592
+T + G+L L LR+P W S NGA + PG +LS W+
Sbjct: 551 VTLRIAEGAGEL-DLRLRVPAWATSFVVAVNGATVASTAAGTAT--PGTYLSVDRTWAAG 607
Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
D++TI L L LR E DRP+ IQ++ GP +L+ +S
Sbjct: 608 DQVTITLALPLRAEPTI-DRPD---IQSLQRGPVVLSALSS 644
>gi|390943351|ref|YP_006407112.1| hypothetical protein Belba_1756 [Belliella baltica DSM 15883]
gi|390416779|gb|AFL84357.1| hypothetical protein Belba_1756 [Belliella baltica DSM 15883]
Length = 785
Score = 253 bits (645), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 183/611 (29%), Positives = 298/611 (48%), Gaps = 58/611 (9%)
Query: 109 LDQSSVL----WRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGH 164
LDQ +L AQ+ + +Y+L +DVD L+ + K A + + YG WE+ + L GH
Sbjct: 32 LDQVRLLDSPFKNAQEVDKKYILEMDVDRLLAPYMKDAGIEWIAENYGNWED--TGLDGH 89
Query: 165 FVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE-- 220
GHYLSA + M+AST + IK ++ ++ L Q+K GY+ P ++++
Sbjct: 90 IGGHYLSALSMMYASTGDIEIKSRLDYMIEQLKLAQDKNANGYIGGVPNGQKIWEEIRVG 149
Query: 221 -------ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMY 273
+L W P Y IHKI AGL D Y++A A A M + ++FY+ + +
Sbjct: 150 NIKAGSFSLNDRWVPLYNIHKIYAGLKDAYLIAGIADAKPMLIALSDWFYDLTEG----F 205
Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
S + L E GG+N+V + ++T +PK+L LA L L+ + D L+ HA
Sbjct: 206 SEAQFQEILISEHGGLNEVFADVSAMTGNPKYLELAKKMSHNLILDPLSKRQDNLTGMHA 265
Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
NT IP VIG Q +++ + + T+F + V S + GG S RE + + L
Sbjct: 266 NTQIPKVIGFQRIAQLSDEAKWNNSATYFWENVTNQRSVSIGGNSVREHFHPKDDFSPML 325
Query: 394 GSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
S+ ETC TYNM+++S LF + + Y DYYERAL N +LS Q T+ G +Y P+
Sbjct: 326 SSDQGPETCNTYNMMRLSEKLFESSPDRKYIDYYERALYNHILSSQHPTKGG-FVYFTPM 384
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
+ + + +FWCC G+G+E+ +K G IY +E L++ +I+S
Sbjct: 385 -----RPQHYRVYSQPHENFWCCVGSGLENHAKYGQVIYAHKEDE---LFVNLFIASELS 436
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
W+ + L QK D S TL F K + + L +R P W + +NG+
Sbjct: 437 WEEKGIKLTQKTDFPFS----ESTTLQFDHKGK--KEFKLKIRYPDWVKGGAMEVKVNGK 490
Query: 573 NLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
+ P+ ++ +W D++++ LP+S + E + D P +AS + GP +LA
Sbjct: 491 SFPISLSKDGYVVIDRKWKSKDQVSVTLPMSTKVEYLADGSP-WAS---FVHGPIVLAAE 546
Query: 632 TSGEWDIK---TGTARSLSALISPIPPSFNAQLVTFTQE-------SGNSTFVMS----- 676
T G+ D+K +R + P F ++ T+E + N F ++
Sbjct: 547 T-GKEDLKGVFADDSRMGHVASGKMIPIFQTAILKQTEEKISPAKSNDNFNFYLAENQFH 605
Query: 677 NSNQSITMEEF 687
N N+S+++ F
Sbjct: 606 NQNESVSLVPF 616
>gi|399025507|ref|ZP_10727503.1| hypothetical protein PMI13_03476 [Chryseobacterium sp. CF314]
gi|398077884|gb|EJL68831.1| hypothetical protein PMI13_03476 [Chryseobacterium sp. CF314]
Length = 791
Score = 253 bits (645), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 169/536 (31%), Positives = 260/536 (48%), Gaps = 38/536 (7%)
Query: 113 SVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSA 172
SV +A QT+ +Y+L +D D L+ + K A L Y WEN + L GH GHY+SA
Sbjct: 37 SVFSKAMQTDEKYILSMDADRLLAPYLKEAGLKPKKANYPNWEN--TGLDGHIGGHYISA 94
Query: 173 SAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEA 221
A M+AST +A +K+++ ++ L CQN GYLS P + +
Sbjct: 95 LALMYASTGDAKVKQRLDYMIDELERCQNLSENGYLSGVPNGKKIWKEIAGGNIRAATFG 154
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
L W P Y IHKI +GL D Y AD+ +A KM + ++ V V++ ++
Sbjct: 155 LNDRWVPLYNIHKIYSGLRDAYWYADSGKAKKMLIRLTDWMVGEVS-VLSDAQIQN---M 210
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
L E GG+N+V +Y IT +PK+L LAH F L L D + HANT IP VI
Sbjct: 211 LRSEHGGLNEVFADVYDITKNPKYLRLAHRFSHLAILNPLLNGEDKFTGIHANTQIPKVI 270
Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEET 400
G + ++ + + FF V S GG S E + + + S E ET
Sbjct: 271 GFKRIADLENNKEWSNAADFFWINVTQKRSAVIGGNSVSEHFNPINDFSGMIKSIEGPET 330
Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
C TYNMLK+S+ L+ + +Y DYYERAL N +LS Q E G +Y P+ G +
Sbjct: 331 CNTYNMLKLSKELYATNPKSSYIDYYERALYNHILSTQ-NPEKGGFVYFTPMRPGHYRV- 388
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
+ SFWCC G+G+E+ +K G+ IY + + LY+ +I S W +VL
Sbjct: 389 ----YSQPETSFWCCVGSGMENHAKYGEMIYAHSDED---LYVNLFIPSILKWSEKKMVL 441
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPG 580
Q+ + ++ SK ++ ++ LR P W+ ++ S+N +N+ +P
Sbjct: 442 RQENN--FPESASTKLIFDVVSKSDI----NMKLRAPEWSDASQITISVNHKNINVPIDA 495
Query: 581 N-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ S +W D + +++P+ L E + P+++ A +GP +LA E
Sbjct: 496 EGYFSVKRKWKKGDVIEMKMPMHLSAEQL----PDHSDYFAFKYGPIVLAAKYGKE 547
>gi|388259955|ref|ZP_10137121.1| hypothetical protein O59_001162 [Cellvibrio sp. BR]
gi|387936316|gb|EIK42881.1| hypothetical protein O59_001162 [Cellvibrio sp. BR]
Length = 803
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 174/553 (31%), Positives = 268/553 (48%), Gaps = 56/553 (10%)
Query: 106 DVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHF 165
DV L S L +AQ TN +YL+ LD + L+ FR+ A LP + YG WE+ + L GH
Sbjct: 31 DVQLLDSPFL-QAQNTNKDYLMALDTEKLLAPFRREAGLPFK-ETYGNWES--TGLDGHM 86
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELF--- 216
GHY++A A ++A+T + + ++++ V+ L +CQ+K+G+GY+ P +E+
Sbjct: 87 GGHYVTALALLYAATKDDVVLQRLNYVIAELKKCQDKLGSGYIGGIPDSNTMWSEIARGD 146
Query: 217 ---DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMY 273
D+F + W P+Y +HKI AGL D Y+ A N A KM + ++ +K+
Sbjct: 147 IRADNF-STNERWVPWYNLHKIYAGLRDAYLYAGNEDAKKMLVRLSDWTIELTKKL---- 201
Query: 274 SVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA 333
S E+ L E GGMN+V + IT D K+L LA F L L Q D L+ HA
Sbjct: 202 SPEQMQTMLRTEHGGMNEVFVDVAEITGDKKYLKLAEAFSHQAILQPLEKQQDQLTGLHA 261
Query: 334 NTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
NT IP +IG + + T + + FF V + A GG S +E + D +
Sbjct: 262 NTQIPKIIGFKKVADATHNESWNKAAEFFWQTVVDKRTVAIGGNSVKEHFHDSHDFTAMI 321
Query: 394 GS-ENEETCTTYNMLKVSRHLFRWTKE--------------IAYADYYERALTNGVLSIQ 438
E ETC TYNMLK+++ LF +++ + Y DYYERAL N +LS Q
Sbjct: 322 EDVEGPETCNTYNMLKLTQLLFLSSRDNSAADMKKSKNNPAMKYVDYYERALYNHILSSQ 381
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGN 497
G++ + K H + WCC G+GIES SK + IY + +
Sbjct: 382 HPQTGGLVYFTSMRPNHYRKYSQVH------DGMWCCVGSGIESHSKYAEFIYARDLDKK 435
Query: 498 VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
+P +++ +I S W + Q + L M E + L LR P
Sbjct: 436 IPEVFLNLFIPSRMTWAEQGISFTQNTQFPDAETTELVM--------ETSKRFRLQLRYP 487
Query: 558 VWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W + Q +NG+ + + PG++++ RW DK+ + LP+ R E + P+ +
Sbjct: 488 RWVEAGQLQLRVNGKTVSVKQQPGDYIALERRWKKGDKVQLALPMKPRLEKL----PDGS 543
Query: 617 SIQAILFGPYLLA 629
+ A+L GP +LA
Sbjct: 544 NYYAVLHGPIVLA 556
>gi|315498357|ref|YP_004087161.1| hypothetical protein Astex_1338 [Asticcacaulis excentricus CB 48]
gi|315416369|gb|ADU13010.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
48]
Length = 797
Score = 252 bits (643), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 180/593 (30%), Positives = 284/593 (47%), Gaps = 49/593 (8%)
Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
+ + L V L S L A + N YLL L D ++++ K A +P G+ YGGWE+
Sbjct: 39 RPIPLTQVRLLPSPFL-EAVEANRRYLLFLSPDRFLYNYHKFAGMPVKGEIYGGWES--D 95
Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-------- 211
+ G +GHYLSA + M A T + ++ ++ L + Q G GY++ F
Sbjct: 96 TIAGEGLGHYLSALSLMHAQTGDNECVARIHYIISELEKVQAAHGDGYVAGFMRKRKDGS 155
Query: 212 ---PTELFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
E+F A L W P+Y HK+ AGLLD + + +A +
Sbjct: 156 IVDGKEIFPEIMAGDIRSAGFDLNGCWVPFYNWHKLFAGLLDAQAYCGVDRGIPVAEKLG 215
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
Y ++ V + L+ E GG+N+ LYS T++P+ L L+ L
Sbjct: 216 GY----IEMVFAALDDAQTQKVLDCEHGGINESFAELYSRTNNPRWLKLSERLYHHRMLD 271
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
LA + D L++ HANT +P +IG YE+T P Y+ +FF + V HS+ GG +
Sbjct: 272 PLAAREDKLANNHANTQVPKLIGLARLYELTQKPQYQTASSFFWERVVNHHSFVIGGNAD 331
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
RE++++P ++ + + E+C TYNMLK++RHL+ W+ + A+ DYYERA N +L+ Q
Sbjct: 332 REYFFEPDTISAHITEQTCESCNTYNMLKLTRHLYSWSPKAAWFDYYERAHLNHMLAHQN 391
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
+ G+ YM+PL G ++ G+ + NSFWCC +GIE+ SK GDSIY+ +E
Sbjct: 392 -PKTGMFTYMMPLMSGAAR-----GFSDEENSFWCCVLSGIETHSKHGDSIYWHQEKT-- 443
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
L++ +I S +W + + PY S+ + ++ +R+P W
Sbjct: 444 -LFVNLFIPSKVNWAEQKAAFE-----LTTKYPYEGQVALKLSQLSGAKTFTVAVRIPGW 497
Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
++ Q +NG+ + T +W D +T+ LPL LR E D +
Sbjct: 498 AEASTLQ--VNGKPALAKMNDGYALITRKWRAGDVVTLDLPLKLRFETAAGDN----KVV 551
Query: 620 AILFGPYLLAGHTSGEWDIKTGTARSL--SALISPIPPSFNAQLVTFTQESGN 670
A+L GP +LA G A +L S LI P A+ V ++ SG
Sbjct: 552 ALLRGPMVLAADLGPADQPWGGDAPALVGSDLIGSFYPVSAAEAVYVSKGSGR 604
>gi|90020425|ref|YP_526252.1| Acetyl-CoA carboxylase, biotin carboxylase [Saccharophagus
degradans 2-40]
gi|89950025|gb|ABD80040.1| protein of unknown function DUF1680 [Saccharophagus degradans 2-40]
Length = 803
Score = 252 bits (643), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 171/534 (32%), Positives = 262/534 (49%), Gaps = 41/534 (7%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ N+EY+L L D L+ F K A LP + YG WE+ L GH GHYL+A + +
Sbjct: 49 AQDKNVEYVLALQPDKLLAPFLKEAGLPVKAENYGNWES--QGLDGHIGGHYLTALSLAY 106
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE---------ALKPVW 226
A+T + + ++++ ++ L QNK GY+ L+D+ AL W
Sbjct: 107 AATGDKRLLDRLNYMLNELERAQNKNSNGYIGGVRNGKALWDNIAKGDIRADLFALNDYW 166
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
P+Y +HKI AGL D Y+ + QA M + E+ + + E+ L E
Sbjct: 167 VPWYNLHKIYAGLRDAYIYTGSEQAKAMLIGLGEW----TIALTADLNDEQIEKMLTTEY 222
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGMN+V + +IT D ++L LA F L L + D L+ HANT IP V+G Q
Sbjct: 223 GGMNEVFADMAAITGDKRYLSLAKQFSHKKILNPLLQKRDALNGLHANTQIPKVVGYQRV 282
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
E+TGD + +F V + + A GG S RE + D + A + E ETC TYN
Sbjct: 283 AELTGDEEWHKAADYFWHHVVNNRTVAIGGNSVREHFHDSEDFAPMINDVEGPETCNTYN 342
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
MLK+SR LF + Y DY+ERAL N +LS Q E G ++Y P+ + + +
Sbjct: 343 MLKLSRMLFSVNPSVDYVDYFERALYNHILSSQH-PETGGLVYFTPM-----RPQHYRMY 396
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
+ WCC G+GIE+ K G+ IY ++ N LY+ +I+S+ W+ V L Q+
Sbjct: 397 SQVDTAMWCCVGSGIENHVKYGEFIYAKQNNN---LYVNLFIASTLVWQEKGVHLTQE-- 451
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLS-----SLNLRMPVWTYSNGAQASLNGQNLPL-PPP 579
++ R TLT + +V ++++R P W + +NG+ + +
Sbjct: 452 --NTFPDSNRTTLTVALDSKVKSSKKHAKFTMHIRYPRWAQAGKVVVKVNGKPINVKAKA 509
Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
G ++ RW D + + LP+++ EA+ D Y A+L+GP +LA T
Sbjct: 510 GEYIEINRRWHNGDNVELSLPMNIALEALPDQSDYY----AVLYGPIVLAAKTQ 559
>gi|315499577|ref|YP_004088380.1| hypothetical protein Astex_2584 [Asticcacaulis excentricus CB 48]
gi|315417589|gb|ADU14229.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
48]
Length = 791
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 170/540 (31%), Positives = 265/540 (49%), Gaps = 52/540 (9%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A N YLL ++ D L+ ++RK A L + YGGWE + GH +GHYLSA + M
Sbjct: 56 AVDVNEAYLLSVNPDRLLHNYRKFAGLTPKAELYGGWER--DTIAGHSLGHYLSAISLMH 113
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP-----------TELFDSFEA----- 221
A T NA +K + + ++ L+ Q G GY++ F E+F A
Sbjct: 114 AQTGNAALKLRAAYIIDELALVQGAHGDGYVAGFTRKRKDGRVVDGKEIFPELMAGDIRS 173
Query: 222 ----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
L W P Y HK+ +GL D +AL +A + Y + KV + ++
Sbjct: 174 AGFDLNGCWVPLYNWHKLYSGLFDAQTFCGYDKALTVAVGLGVY----IDKVFRALTDDQ 229
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
LN E GG+ND LY T +P+ L LA + L D L++ HANT +
Sbjct: 230 VQTVLNCEFGGLNDSFAELYRRTENPRWLALAQRLHHKRIIDPLTAGEDKLANNHANTQV 289
Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN 397
P ++G +EVTG+ + +FF + V HSY GG + RE++++P ++ +
Sbjct: 290 PKLLGEATLFEVTGNENNRKAASFFWERVVNHHSYVIGGNADREYFFEPDTISKHITEAT 349
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
E C TYNMLK++RHL+ W + Y DY+ERA N VL+ Q+ + G+ YM PL G +
Sbjct: 350 CEHCNTYNMLKLTRHLYGWEPDARYFDYFERAHFNHVLA-QQNPKTGMFSYMTPLFTGAA 408
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KS 515
+ G+ +++ CC+G+G+ES +K G+SI+++ L++ YI ++ W K
Sbjct: 409 R-----GFSDPVDNWTCCHGSGMESHAKHGESIFWQSSDT---LFVNLYIPATARWATKG 460
Query: 516 GHVVLNQKVDPIVSWDPYL-RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
H+ L+ PY + + SS + + L LR+P W + A +LN + +
Sbjct: 461 AHLRLDTGY-------PYDGNIVFSLSSLRRPTKF-KLALRVPAW--AKRADLTLNNKPV 510
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
G +L W+ D + + LPL LR EA +DD + A+L GP +LA G
Sbjct: 511 KATRDGGYLVIDRAWAVGDTVRLSLPLDLRFEATRDD----GKVVAVLRGPLVLAADLGG 566
>gi|392964292|ref|ZP_10329713.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
gi|387847187|emb|CCH51757.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
Length = 739
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 176/554 (31%), Positives = 272/554 (49%), Gaps = 53/554 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
++ +L +V L S +AQ +L+Y+L L+ D L+ + A LP + YG WE+
Sbjct: 1 MQPFTLQEVRL-TSGPFKQAQDVDLKYILALNPDKLLAPYLIDAGLPLKAQRYGNWES-- 57
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
L GH GHYLSA A M+AST +K+++ ++ L+ CQ K G GY+ P
Sbjct: 58 VGLDGHIGGHYLSALAMMYASTGEPELKKRLDYMIGELARCQAKNGNGYVGGIPQGKVFW 117
Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
++ S L W P Y IHK+ AGL D Y A N QA ++ + ++F
Sbjct: 118 DRIHKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDAYAYAGNGQAKQVLIGLGDWFV---- 173
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
++I S E+ L E GG+N+ LY +T+D K+L A L L Q D
Sbjct: 174 ELIKPLSDEQIQQVLRTEHGGINETFADLYILTNDKKYLETAQRLSHRALLYPLLEQQDK 233
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L+ HANT IP VIG + +TG + +F V+ + S A GG S RE +
Sbjct: 234 LTGLHANTQIPKVIGFEKIATLTGKTDWSEAAMYFWRNVSQTRSVAFGGNSVREHFNPTT 293
Query: 388 RLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
+ L S + ETC ++NML++S+ LF +++Y D+YER L N +LS Q E G
Sbjct: 294 DFSQVLRSNQGPETCNSFNMLRLSKALFLDKNDVSYLDFYERTLYNHILSSQH-PEKGGF 352
Query: 447 IYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
+Y P+ R H + S WCC G+G+E+ +K G+ IY + L++
Sbjct: 353 VYFTPI-------RPNHYRVYSQSETSMWCCVGSGLENHTKYGELIYSHSTND---LFVN 402
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS-- 562
+I S+ +WK V LNQ+ + PY T +Q Q+ S+ +R P W +
Sbjct: 403 LFIPSTLNWKEKGVRLNQRTNF-----PYENGT-ELVVQQAKPQVFSVQIRYPKWAENLE 456
Query: 563 ---NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
NG Q ++NG+ P +++ + +W D +T++ S R E + P+ ++
Sbjct: 457 VLVNGKQQAVNGK------PSEYVAISRKWKAGDIITVRFKTSTRLEQL----PDGSNWA 506
Query: 620 AILFGPYLLAGHTS 633
A + GP +LA TS
Sbjct: 507 AFVHGPIVLAAKTS 520
>gi|395493738|ref|ZP_10425317.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas sp. PAMC
26617]
Length = 646
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 191/614 (31%), Positives = 286/614 (46%), Gaps = 59/614 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE-NP 157
L+ L DV L + L AQ+ YLL LD D ++ +FR A L YGGWE +P
Sbjct: 46 LQPFDLADVDLGEGPFL-HAQRKTEAYLLSLDPDRMLHAFRVNAGLKPKAAVYGGWESDP 104
Query: 158 I---SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
I +GH +GHYLSA A + ST ++++ + L+ CQ+ +G + AFP
Sbjct: 105 IWADINCQGHTLGHYLSACALAYRSTRKPAFRQRIDHIARELAACQDAARSGLVCAFPKG 164
Query: 215 ---LFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
+ P+YT+HK+ AGL D ++AD+A++ L++A W V
Sbjct: 165 PALVAAHLRGDAITGVPWYTLHKVFAGLRDATLMADSAESRAVLLRLADWAV-------- 216
Query: 268 KVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
V T + + ++ E E GGMN+V LY +T +P + +A F L LA D
Sbjct: 217 -VATRPLSDAQFETMLETEHGGMNEVFADLYLMTGNPDYRTMAERFSHKALLTPLAAGRD 275
Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
L HANT +P ++G Q +E TG P Y FF V + S+ATGG E ++
Sbjct: 276 QLDGLHANTQLPKIVGFQRVFEATGTPHYHEAAAFFWRTVALTRSFATGGHGDNEHFFPM 335
Query: 387 KRL-ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
++ ETC +NMLK++R LF + YADYYER L NG+L+ Q + G+
Sbjct: 336 AEFDKHVFSAKGSETCGQHNMLKLTRALFLQDPQAEYADYYERTLYNGILASQ-DPDTGM 394
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
+ Y G K + T +SFWCC GTG+E+ K DSIYF ++ LY+
Sbjct: 395 VTYFQGARPGYMKL-----YHTPEHSFWCCTGTGMENHVKYRDSIYFHDDK---ALYVNL 446
Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--- 562
++ S+ W+ V L Q+ P + T +V +L LR P W+ S
Sbjct: 447 FVPSAVRWREKGVALRQETR--FPDAPTTTLHWTVERPTDV----TLQLRHPRWSRSAIV 500
Query: 563 --NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
NG +A+ + PG+++ W D T++L L++ E + D P I A
Sbjct: 501 LVNGVEAARSDT------PGSYVKLARTWHSGD--TVELRLAM--EVVPDQAPAAPDIVA 550
Query: 621 ILFGPYLLAGHTSGEWDIKTGTARSLSALISPIP-PSFNAQLVTFTQESGNSTFVMSNSN 679
+GP +LAG E G A +I+ +NA VT GN + +
Sbjct: 551 FSYGPMVLAGVLGRE-----GLAPGADVIINERKYGEYNAGPVTVPTLVGNPATLAAQVR 605
Query: 680 QSITMEEFPVSGTD 693
++ EF + D
Sbjct: 606 KADGPLEFTIPAAD 619
>gi|380512705|ref|ZP_09856112.1| hypothetical protein XsacN4_15862 [Xanthomonas sacchari NCPPB 4393]
Length = 799
Score = 251 bits (640), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 175/556 (31%), Positives = 266/556 (47%), Gaps = 49/556 (8%)
Query: 95 PGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW 154
PG ++ + L V L + S+ + QTN YLL L+ D L+ +F + A LP G YGGW
Sbjct: 51 PGR-VQALPLQQVTL-KPSLFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGAVYGGW 108
Query: 155 ENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
E + GH +GHYLSA A+M A T + ++E++ +V L+ Q + GY+ F T
Sbjct: 109 EG--DTIAGHTLGHYLSALAKMHAQTRDPVLRERIDYIVAELARAQAQDPDGYVGGF-TR 165
Query: 215 LFDSFEA---------------------LKPVWAPYYTIHKILAGLLDQYVLADNAQALK 253
D E L W+P YT HK+ AGLLD + LA + QAL+
Sbjct: 166 KNDKGEIEGGKAVLEDVRRGIIKGSKFNLNGSWSPLYTQHKLFAGLLDAHALAGSKQALE 225
Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
+ + Y V + L+ E GG+N+ L + T D + + +
Sbjct: 226 VLLPLAAY----TAGVFDALDHAQMQTLLDTEFGGLNESYIELGARTGDARWVAIGKRLR 281
Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYA 373
+ A D L H HANT +P IG ++EV GD FF + V A +SY
Sbjct: 282 HEKVIDPAAAGRDELPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTAHYSYV 341
Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
GG + RE++ +P +A L + E C +YNMLK++RHL++WT + Y DYYER L N
Sbjct: 342 IGGNADREYFQEPDTIAAFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNH 401
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
++ Q G+ YM P+ G + G+ KF+SFWCC G+G+E+ ++ GD+IY++
Sbjct: 402 TMAAQHPAT-GMFTYMTPMISGGER-----GFSDKFDSFWCCVGSGMEAHAQFGDAIYWQ 455
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
+ LY+ YI S DW + L ++D V + +R+ + + ++ +L
Sbjct: 456 ---DATSLYVNLYIPSRLDWTERDLAL--ELDSGVPDNGKVRLQVLRAGQRAPRRLL--- 507
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
LR+P W A +NG +L+ W D + + L LR E D
Sbjct: 508 LRVPAWCQGRYA-LRVNGSPARAALVDGYLTLERDWRAGDVIDLDLATPLRLEHAAGD-- 564
Query: 614 EYASIQAILFGPYLLA 629
A ++ GP LA
Sbjct: 565 --ADTVVVMRGPLALA 578
>gi|192360871|ref|YP_001981311.1| hypothetical protein CJA_0803 [Cellvibrio japonicus Ueda107]
gi|190687036|gb|ACE84714.1| conserved hypothetical protein [Cellvibrio japonicus Ueda107]
Length = 802
Score = 250 bits (639), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 178/539 (33%), Positives = 257/539 (47%), Gaps = 50/539 (9%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ TN +YL+ LDV+ L+ FR+ A LP + YG WE+ + L GH GHY+SA A +
Sbjct: 49 AQNTNKQYLMALDVEKLLAPFRREAGLPYK-ETYGNWES--TGLDGHIGGHYISALALTY 105
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL------------FDSFEALKPV 225
AST + + ++ V+ L +CQ+K G GYL+ P D+F +
Sbjct: 106 ASTGDPAVLARLEYVITELKKCQDKNGNGYLAGLPEGAGIWQEIARGDIRADNF-STNER 164
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
W P+Y +HK AGL D Y N A M E+ + + + S E+ L+ E
Sbjct: 165 WVPWYNLHKTFAGLRDAYRYTGNETAKAMLVAFSEWTWALTKDL----SDEQMQTLLHTE 220
Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
GGMNDV + IT D ++L LA F L L + D L+ HANT IP VIG +
Sbjct: 221 HGGMNDVFVDVADITGDKRYLHLAERFSHRAILQPLLEKRDALTGLHANTQIPKVIGFKR 280
Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
+ ++ FF + V S A GG S RE + + E ETC TY
Sbjct: 281 VGDAEQLAEWQSAAEFFWETVVNKRSVAIGGNSVREHFHPQDNFHSMIEDVEGPETCNTY 340
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
NMLK++ LF Y DYYERAL N +L Q + G +Y P+ +
Sbjct: 341 NMLKLTEQLFLDNPLGKYGDYYERALYNHILGSQH-PQTGGFVYFTPM-----RPNHYRV 394
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEE--------EGNVPGLYIIQYISSSFDWKSG 516
+ + WCC G+G+ES SK + IY N+P +Y+ +I S +WK
Sbjct: 395 YSQVHDGMWCCVGSGLESHSKYAEFIYARGMKKSAGWFARNIPQVYVNLFIPSQLNWKET 454
Query: 517 HVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
+ L Q+ P V P + L S + +L+LR P W ++ Q +NG+
Sbjct: 455 GIRLRQENQFPDV---PETSIVLESSGR------FTLHLRYPQWVEADTLQLRINGKVEK 505
Query: 576 L-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+ PGN+L+ RW DKL I+LP+ E++ P+ +S A+L+GP +LA T
Sbjct: 506 ISSQPGNYLAIERRWKKGDKLDIRLPMKPHLESL----PDGSSYYAVLYGPIVLAAKTQ 560
>gi|296331240|ref|ZP_06873712.1| hypothetical protein BSU6633_09061 [Bacillus subtilis subsp.
spizizenii ATCC 6633]
gi|296151355|gb|EFG92232.1| hypothetical protein BSU6633_09061 [Bacillus subtilis subsp.
spizizenii ATCC 6633]
Length = 761
Score = 250 bits (638), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 171/527 (32%), Positives = 264/527 (50%), Gaps = 39/527 (7%)
Query: 114 VLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSAS 173
+ + +Q EYLL LDVD L+ + S YGGWE E+ GH +GH+LSA+
Sbjct: 10 MFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAGHSIGHWLSAA 67
Query: 174 AQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------LKP 224
+ M+ ++ + +K K V LS Q GY+S F FD + L
Sbjct: 68 SAMYQASGDEKLKRKAEYAVNELSHIQQFDEEGYISGFSRACFDEVFSGDFRVDHFSLGG 127
Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
W P+Y++HK+ AGL+D Y L N AL++ + ++ +K + + E+ L
Sbjct: 128 SWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRMLIC 183
Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
E GGMN+ + LY +T + +L LA F L LA D L HANT IP VIG+
Sbjct: 184 EHGGMNEAMADLYILTKNKSYLDLAERFCHRAILQPLAEGKDELEGKHANTQIPKVIGAA 243
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
Y++TG+ Y+ FF + V SYA GG S E + ++ LG ETC TY
Sbjct: 244 KLYDITGNEAYRNPALFFWEQVVYQRSYAIGGNSIGEHF--GAEGSEELGVTTAETCNTY 301
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
NMLK++ HLFRW E + DYYE AL N +LS Q E G+ Y + G K
Sbjct: 302 NMLKLTGHLFRWFHEARFTDYYENALYNHILSSQ-DPESGMKTYFVSTQPGHFKV----- 355
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
+ + +SFWCC GTG+E+ ++ +IY ++ + LY+ +I S + + +++ Q+
Sbjct: 356 YCSPEDSFWCCTGTGMENPARYTQNIYHLDQDD---LYVNLFIPSQINVREKQMIITQE- 411
Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA-QASLNGQNLPLPPPGNFL 583
S+ + L K+ G +L +R+P WT NG+ +A +NG+ + +L
Sbjct: 412 ---TSFPAANKTKLVV--KKADGVPMTLQIRIPYWT--NGSLKAVVNGKRVQSVEKNGYL 464
Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ + W+ D + I LP+ L +DD + +++GP +LAG
Sbjct: 465 AIHKHWNTGDCIEIDLPMKLHIYQAKDDPKK----SVLMYGPVVLAG 507
>gi|398305096|ref|ZP_10508682.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus vallismortis
DV1-F-3]
Length = 762
Score = 249 bits (637), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 168/528 (31%), Positives = 261/528 (49%), Gaps = 37/528 (7%)
Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLS 171
+ + +Q EYLL LDVD L+ + S YGGWE E+ GH VGH+LS
Sbjct: 8 KGMFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAGHSVGHWLS 65
Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------L 222
A++ M+ ++ + +K K + V LS Q GY+S F FD + L
Sbjct: 66 AASAMYRASGDEELKRKTAYAVNELSHIQQFDQEGYVSGFSRACFDEVFSGDFRVDHFSL 125
Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
W P+Y++HK+ AGL+D Y L N AL++ + ++ +K + + E+ L
Sbjct: 126 GGSWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLNDEQFQRML 181
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
E GGMN+ + LY +T + +L LA F L LA D L HANT IP VIG
Sbjct: 182 ICEHGGMNEAMADLYMLTKNKAYLELAERFCHRAILQPLAEGKDELEGKHANTQIPKVIG 241
Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCT 402
+ Y++TG+ Y+ FF + V SYA GG S E + ++ LG ETC
Sbjct: 242 AAKLYDITGNEAYRNAALFFWEQVVYQRSYAIGGNSIGEHF--GAEGSEELGVTTAETCN 299
Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARST 462
TYNMLK++ HLFRW +E + DYYE AL N +L+ Q + G+ Y + G K
Sbjct: 300 TYNMLKLTAHLFRWFQESKFMDYYENALYNHILASQ-DPDSGMKTYFVSTQPGHFKV--- 355
Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
+ + +SFWCC GTG+E+ ++ IY + + LY+ +I S + H+++ Q
Sbjct: 356 --YCSPEDSFWCCTGTGMENPARYTKHIYHIDRDD---LYVNLFIPSQIHVREKHMLIAQ 410
Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNF 582
+ P T K + G +L++R+P W + G +A++NG+ + +
Sbjct: 411 ETSF-----PAAEQTRLMVKKAD-GVPMALHIRIPYWAHG-GLKAAVNGKRIQPVEKNGY 463
Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
L + W+ D + + LP+ L +DD + +++GP +LAG
Sbjct: 464 LVIHKHWNTGDCIEVDLPMKLHLYQAKDDPKK----NVLMYGPVVLAG 507
>gi|350267868|ref|YP_004879175.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus subtilis
subsp. spizizenii TU-B-10]
gi|349600755|gb|AEP88543.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus subtilis
subsp. spizizenii TU-B-10]
Length = 761
Score = 249 bits (636), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 172/529 (32%), Positives = 267/529 (50%), Gaps = 39/529 (7%)
Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENPISELRGHFVGHYL 170
+ + +Q EYLL LDVD L+ + A L TP K YGGWE E+ GH +GH+L
Sbjct: 8 KGMFYDSQMKGKEYLLFLDVDRLLAPCYE-AVLQTPKKPRYGGWE--AKEIAGHSIGHWL 64
Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA--------- 221
SA++ M+ ++ + +K K V LS Q GY+S F FD +
Sbjct: 65 SAASAMYQASGDEELKRKAEYAVNELSHIQQFDEEGYVSGFSRACFDEVFSGDFRVDHFS 124
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
L W P+Y+IHK+ AGL+D Y L N AL++ + ++ +K + + E+
Sbjct: 125 LGGSWVPWYSIHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRM 180
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
L E GGMN+ + L+ +T + +L LA F L LA D L HANT IP VI
Sbjct: 181 LICEHGGMNEAMADLFMLTKNKAYLELAERFCHRAILQPLAEGKDELEGKHANTQIPKVI 240
Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
G+ Y++TG+ Y+ FF + V SYA GG S E + ++ LG ETC
Sbjct: 241 GAAKLYDITGNEAYRNAALFFWEQVVYQRSYAIGGNSIGEHF--GAEGSEELGVTTAETC 298
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
TYNMLK++ HLFRW E + DYYE AL N +L+ Q + G+ Y + G K
Sbjct: 299 NTYNMLKLTGHLFRWFHEARFMDYYENALYNHILASQ-DPDSGMKTYFVSTQPGHFKV-- 355
Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
+ + +SFWCC GTG+E+ ++ IY ++ + LY+ +I S + + +++
Sbjct: 356 ---YCSPEDSFWCCTGTGMENPARYTQHIYDIDQDD---LYVNLFIPSQINMQEKQLIIT 409
Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
Q+ S+ + L K+ G +L++R+P WT + G +A++NG+ +
Sbjct: 410 QE----TSFPAAEKTRLVV--KKADGVPMTLHIRIPYWT-NGGLKAAVNGKRIQSVEKNG 462
Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+L + W+ D + I LP+ L +DD + +++GP +LAG
Sbjct: 463 YLVIHKHWNTGDCIEIDLPMKLHIYQAKDDPKK----SVLMYGPVVLAG 507
>gi|237708621|ref|ZP_04539102.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
gi|229457321|gb|EEO63042.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
Length = 783
Score = 249 bits (635), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 170/544 (31%), Positives = 265/544 (48%), Gaps = 47/544 (8%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
+ DV L +S A+ ++ YLL +D D L+ + K A L + Y WEN + L G
Sbjct: 33 VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
H GHYLSA + M+A+T N IK ++ ++ L CQ+ G GYL P +++ E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
L W P Y IHK+ AGL D + + +A +K+ WM+ +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+I+ S E+ L E GG+N+ + +IT D ++L LAH F L L Q D L
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
+ HANT IP VIG + ++ G+ + +F + V S GG S RE +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+ L SE ETC TYNML++++ L+ + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDSVQGG-FV 380
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y P+ G + + SFWCC G+G+E+ ++ G+ IY ++ N LY+ +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S+ W H ++ ++ TL S ++ + + L R+P WT +
Sbjct: 433 PSTLRWGDIH------IEQQTAFPDEEGTTLAVSPEKGEKEFTLL-FRVPEWTNPEALRL 485
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
S+NG+ + ++S WS DK+ ++LP+ LR A+ D Y +IL+GP +
Sbjct: 486 SVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541
Query: 628 LAGH 631
LA
Sbjct: 542 LAAQ 545
>gi|329847096|ref|ZP_08262124.1| tat twin-arginine translocation pathway signal sequence domain
protein [Asticcacaulis biprosthecum C19]
gi|328842159|gb|EGF91728.1| tat twin-arginine translocation pathway signal sequence domain
protein [Asticcacaulis biprosthecum C19]
Length = 795
Score = 248 bits (634), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 174/556 (31%), Positives = 270/556 (48%), Gaps = 47/556 (8%)
Query: 94 LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGG 153
LP ++L DV L S A N YLL L+ D + ++RK A L + YGG
Sbjct: 36 LPQKRTTSLALGDVRL-LPSPFKTALDVNHTYLLTLEPDRFLHNYRKGAGLTPKAEKYGG 94
Query: 154 WENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP- 212
WEN + GH +GHYLSA + M+A T +AT+K + + V+ L+ Q G GY++ F
Sbjct: 95 WEN--DTIAGHSLGHYLSAISLMYAQTGDATLKARAAYVIDELALIQGMQGDGYVAGFTR 152
Query: 213 ----------TELFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALK 253
ELF +A L W P Y HK+ GL D + +
Sbjct: 153 KRPDGTIVDGKELFAEIKAGDIRSAGFDLNGCWVPLYNWHKLYTGLFDAQTFCGLNKGVV 212
Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
+AT + Y + V + ++ LN E GG+N+ L++ T D + L LA
Sbjct: 213 VATGLGHY----IDSVFAALNDDQVQQVLNCEFGGLNESFAELHARTGDARWLTLAERMH 268
Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYA 373
L + + D L++ H+NT IP V+G YE+TG Y FF + V HSY
Sbjct: 269 HNRVLDPMIKREDKLANIHSNTTIPKVLGLARLYEITGKADYHTASDFFWERVTGHHSYV 328
Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
GG RE++++P ++ + E C TYNML+++R L+ W + + DY+ERA N
Sbjct: 329 IGGNGDREYFFEPDTISRHITEATCEHCATYNMLRLTRFLYSWQPDASRFDYFERAHLNH 388
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
VLS Q+ + G+ YM PL G + G+ +++ CC+GTG+ES ++ +SI+++
Sbjct: 389 VLS-QQNPKTGMFSYMTPLFTGAER-----GFSDPVDNWTCCHGTGMESHARHAESIWWQ 442
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
L++ YI S+ W + L ++D +D +++ +T + +L+
Sbjct: 443 SADT---LFVNLYIPSTAQWTTKGASL--RMDTGYPYDGGVKLAVTALRRPTRFKLA--- 494
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
LR+P W + A +LNG+ G +L W DK+ + LPL LR EA D+
Sbjct: 495 LRVPGWAKT--AAVTLNGKPAQAVRDGGYLVIDRVWQAGDKIALDLPLDLRLEATSDN-- 550
Query: 614 EYASIQAILFGPYLLA 629
I A+L GP +LA
Sbjct: 551 --TGIVAVLRGPMVLA 564
>gi|345513549|ref|ZP_08793069.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
gi|229437570|gb|EEO47647.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
Length = 783
Score = 248 bits (634), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 170/544 (31%), Positives = 265/544 (48%), Gaps = 47/544 (8%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
+ DV L +S A+ ++ YLL +D D L+ + K A L + Y WEN + L G
Sbjct: 33 VRDVRL-TASPFKHAEDMDIRYLLGMDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
H GHYLSA + M+A+T N IK ++ ++ L CQ+ G GYL P +++ E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
L W P Y IHK+ AGL D + + +A +K+ WM+ +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+I+ S E+ L E GG+N+ + +IT D ++L LAH F L L Q D L
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
+ HANT IP VIG + ++ G+ + +F + V S GG S RE +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+ L SE ETC TYNML++++ L+ + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y P+ G + + SFWCC G+G+E+ ++ G+ IY ++ N LY+ +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S+ W H ++ ++ TL S ++ + + L R+P WT +
Sbjct: 433 PSTLRWGDIH------IEQQTAFPDEEGTTLAVSPEKGEKEFTLL-FRVPEWTNPEALRL 485
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
S+NG+ + ++S WS DK+ ++LP+ LR A+ D Y +IL+GP +
Sbjct: 486 SVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541
Query: 628 LAGH 631
LA
Sbjct: 542 LAAQ 545
>gi|346970201|gb|EGY13653.1| secreted protein [Verticillium dahliae VdLs.17]
Length = 634
Score = 248 bits (633), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 175/532 (32%), Positives = 259/532 (48%), Gaps = 43/532 (8%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMW 177
Q L Y+ +DVD L++ FR+T LP G + GGW+ P R HF GH+L+A + W
Sbjct: 65 QDRTLSYIKFVDVDRLLYVFRQTHGLPLQGAQPNGGWDAPDFPFRSHFQGHFLNAWSYCW 124
Query: 178 ASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAFPTELFDSFE--ALKPVWAPYY 230
A + +++ S L++CQ GYLS FP ++ E L PYY
Sbjct: 125 AVLRDEECRDRASYFATELAKCQANNEQAGFNPGYLSGFPESEIEALEKRTLSNGNVPYY 184
Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
+IHK +AGLLD + + A + M + R K+ S + ++ E GGMN
Sbjct: 185 SIHKTMAGLLDVWRHIGDETARDVLLGMAGWVDLRTGKL----SYSQMQTMMSTEFGGMN 240
Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
+V+ ++ T D + L +A FD LA D L+ HANT +P IG+ Y+ T
Sbjct: 241 EVMADIFHQTGDERWLTVAQRFDHASVFDPLAGNRDSLNGLHANTQVPKWIGAAREYKAT 300
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
G Y I +I +H+YA G S E + P +A L + E C TYNMLK++
Sbjct: 301 GTTRYSDIARNAWNITVQAHTYAIGANSQSEHFRPPNAIASYLDEDTAEACNTYNMLKLT 360
Query: 411 RHLFRWTKEIA---YADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKARST 462
R L W + + Y D+YE+AL N + Q + G + Y L RGV A
Sbjct: 361 REL--WVMDPSNSKYFDFYEQALINHAIGQQDPSSAHGHVTYFTSLNPGGHRGVGPAWGG 418
Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
W T + + WCC GT +E+ +KL DSIYF +E + LY+ Y S +W V + Q
Sbjct: 419 GTWSTDYGTAWCCQGTALETNTKLMDSIYFYDESS---LYVNLYAPSKLNWTQRKVTVLQ 475
Query: 523 KVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP--LPPP 579
+ + P L+ T T + K G L +R+P+W S GA ++NGQ L P
Sbjct: 476 ETEFP-------LQDTSTLTVKG--GGDWDLRVRIPMW--SKGATIAINGQALDGVEAAP 524
Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
G + + W D +TI LP++L T + D+ S+ A+ +GP +LA +
Sbjct: 525 GTYATIKRSWGEEDIVTITLPMALHTISANDE----PSVAALAYGPVVLAAN 572
>gi|423242461|ref|ZP_17223569.1| hypothetical protein HMPREF1065_04192 [Bacteroides dorei
CL03T12C01]
gi|392639254|gb|EIY33080.1| hypothetical protein HMPREF1065_04192 [Bacteroides dorei
CL03T12C01]
Length = 783
Score = 248 bits (633), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 170/544 (31%), Positives = 265/544 (48%), Gaps = 47/544 (8%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
+ DV L +S A+ ++ YLL +D D L+ + K A L + Y WEN + L G
Sbjct: 33 VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
H GHYLSA + M+A+T N IK ++ ++ L CQ+ G GYL P +++ E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
L W P Y IHK+ AGL D + + +A +K+ WM+ +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+I+ S E+ L E GG+N+ + +IT D ++L LAH F L L Q D L
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
+ HANT IP VIG + ++ G+ + +F + V S GG S RE +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+ L SE ETC TYNML++++ L+ + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y P+ G + + SFWCC G+G+E+ ++ G+ IY ++ N LY+ +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S+ W H ++ ++ TL S ++ + + L R+P WT +
Sbjct: 433 PSTLRWGDIH------IEQQTAFPDEEGTTLAVSPEKGEKEFTLL-FRVPEWTNPEALRL 485
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
S+NG+ + ++S WS DK+ ++LP+ LR A+ D Y +IL+GP +
Sbjct: 486 SVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541
Query: 628 LAGH 631
LA
Sbjct: 542 LAAQ 545
>gi|291544094|emb|CBL17203.1| Uncharacterized protein conserved in bacteria [Ruminococcus
champanellensis 18P13]
Length = 1075
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 192/619 (31%), Positives = 297/619 (47%), Gaps = 63/619 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENP 157
+++ SL D+ + + + A +EYLL D D L+ FR+ A L T G K Y GWEN
Sbjct: 36 IEDFSLADLTMTDAYTV-NAFSKEVEYLLSFDTDRLLCGFRENAKLDTKGAKRYAGWENT 94
Query: 158 ISELRGHFVGHYLSASAQMW-----ASTHNATIKEKMSTVVFSLSECQ--NKIGTGYL-- 208
+ + GH VGHYL+A AQ + + + ++ K+ ++ + CQ +K G+L
Sbjct: 95 L--IAGHSVGHYLTAVAQAYQNPTLTAAQRSALEGKIKALLDGMRVCQQNSKGKPGFLWA 152
Query: 209 ----SAFPTEL-FDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWM 258
+A E+ FD E + W P+YT+HKI+ GL+D Y N A +A+ +
Sbjct: 153 GQIKNANNVEVQFDLVEQGKTNIINESWVPWYTMHKIVQGLVDVYNATGNETAKTIASDL 212
Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCF- 317
++ YNR K +S + H L+ E GGMND LY LY IT H + AH FD+
Sbjct: 213 GDWTYNRASK----WSAQTHNTVLSIEYGGMNDCLYELYEITGKDTHAVAAHYFDETNLH 268
Query: 318 LGFLALQADYLSHFHANTHIPIVIGSQMRY------EVTGDPL----YKLIGTFFMDIVN 367
L + L++ HANT IP IG+ RY V G+ + Y F D+V
Sbjct: 269 EAVLKGGRNVLTNKHANTTIPKFIGALKRYIVLDGKTVNGEKIDASRYLEYAEAFWDMVT 328
Query: 368 ASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
H+Y TGG S E + + L + N ETC +YNMLK+SR LF+ T + Y D+YE
Sbjct: 329 THHTYITGGNSEWEHFGEDDILDKERTNCNCETCNSYNMLKLSRELFKITGDRKYMDFYE 388
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLG 487
N +LS Q E G+ Y P+ G K S + ++SFWCC G+G+ESF+KLG
Sbjct: 389 GTYYNSILSSQN-PESGMTTYFQPMATGYFKVYS-----SPYDSFWCCTGSGMESFTKLG 442
Query: 488 DSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVG 547
D++Y GN LY+ Y SS +W+ V + Q D + + T+ S G
Sbjct: 443 DTMYM-HSGNT--LYVNMYQSSVLNWEDQKVKITQ--DSNIPESDTAKFTIDGS-----G 492
Query: 548 QLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
L R+P W + ++NG ++ T + D +++ +P E
Sbjct: 493 SL-DFRFRIPSWK-AGKMTIAVNGTKYTYKTVNDYAQVTGDFKTGDVISVTIP----AEV 546
Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQE 667
+ + P+ ++ +GP +L+ E K+ T ++ PI S N +T ++E
Sbjct: 547 VAYNLPDNKAVYGFKYGPVVLSAELGTENMEKSSTGMWVTIPKDPIGSSQN---ITISKE 603
Query: 668 SGNSTFVMSNSNQSITMEE 686
+ T M+ N + ++
Sbjct: 604 GQSVTSFMAEINDHLVKDK 622
>gi|212691787|ref|ZP_03299915.1| hypothetical protein BACDOR_01282 [Bacteroides dorei DSM 17855]
gi|212665688|gb|EEB26260.1| hypothetical protein BACDOR_01282 [Bacteroides dorei DSM 17855]
Length = 783
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 170/544 (31%), Positives = 265/544 (48%), Gaps = 47/544 (8%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
+ DV L +S A+ ++ YLL +D D L+ + K A L + Y WEN + L G
Sbjct: 33 VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
H GHYLSA + M+A+T N IK ++ ++ L CQ+ G GYL P +++ E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
L W P Y IHK+ AGL D + + +A +K+ WM+ +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+I+ S E+ L E GG+N+ + +IT D ++L LAH F L L Q D L
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
+ HANT IP VIG + ++ G+ + +F + V S GG S RE +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+ L SE ETC TYNML++++ L+ + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y P+ G + + SFWCC G+G+E+ ++ G+ IY ++ N LY+ +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S+ W H ++ ++ TL S ++ + + L R+P WT +
Sbjct: 433 PSTLRWGDIH------IEQQTAFPDEEGTTLAVSPEKGEKEFALL-FRVPEWTNPEALRL 485
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
S+NG+ + ++S WS DK+ ++LP+ LR A+ D Y +IL+GP +
Sbjct: 486 SVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541
Query: 628 LAGH 631
LA
Sbjct: 542 LAAQ 545
>gi|265755220|ref|ZP_06089990.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|423231114|ref|ZP_17217517.1| hypothetical protein HMPREF1063_03337 [Bacteroides dorei
CL02T00C15]
gi|423246788|ref|ZP_17227840.1| hypothetical protein HMPREF1064_04046 [Bacteroides dorei
CL02T12C06]
gi|263234362|gb|EEZ19952.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|392629229|gb|EIY23239.1| hypothetical protein HMPREF1063_03337 [Bacteroides dorei
CL02T00C15]
gi|392634665|gb|EIY28581.1| hypothetical protein HMPREF1064_04046 [Bacteroides dorei
CL02T12C06]
Length = 783
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 170/544 (31%), Positives = 265/544 (48%), Gaps = 47/544 (8%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
+ DV L +S A+ ++ YLL +D D L+ + K A L + Y WEN + L G
Sbjct: 33 VRDVRL-TASPFKHAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
H GHYLSA + M+A+T N IK ++ ++ L CQ+ G GYL P +++ E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQK 268
L W P Y IHK+ AGL D + + +A +K+ WM+ +
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+I+ S E+ L E GG+N+ + +IT D ++L LAH F L L Q D L
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
+ HANT IP VIG + ++ G+ + +F + V S GG S RE +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 389 LADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+ L SE ETC TYNML++++ L+ + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y P+ G + + SFWCC G+G+E+ ++ G+ IY ++ N LY+ +I
Sbjct: 381 YFTPMRAGHYRV-----YSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVNLFI 432
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S+ W H ++ ++ TL S ++ + + L R+P WT +
Sbjct: 433 PSTLRWGDIH------IEQQTAFPDEEGTTLAVSPEKGEKEFTLL-FRVPEWTNPEALRL 485
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
S+NG+ + ++S WS DK+ ++LP+ LR A+ D Y +IL+GP +
Sbjct: 486 SVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGPIV 541
Query: 628 LAGH 631
LA
Sbjct: 542 LAAQ 545
>gi|317476834|ref|ZP_07936077.1| hypothetical protein HMPREF1016_03061 [Bacteroides eggerthii
1_2_48FAA]
gi|316907009|gb|EFV28720.1| hypothetical protein HMPREF1016_03061 [Bacteroides eggerthii
1_2_48FAA]
Length = 781
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 170/546 (31%), Positives = 270/546 (49%), Gaps = 43/546 (7%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L D+ L +S L +AQQT+L Y++ ++ D L+ F + A L +Y WEN + L G
Sbjct: 30 LQDIKLLESPFL-QAQQTDLHYIMAMNPDRLLAPFLREAGLAPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF------- 216
H GHY+SA + M+A+T + T+ +++ ++ L Q +G G++ P L
Sbjct: 87 HIGGHYISALSMMYAATGDTTVYNRLNYMLNELHRAQQAVGNGFIGGTPGSLQLWKEIKE 146
Query: 217 -----DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
+SF +L W P Y IHK AGL D Y+ A + A +M + ++ + + +
Sbjct: 147 GNIRPESF-SLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIALTDW----MAGITS 201
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
+ ++ L E GG+N++ + IT D K+L LA F L L D+L+
Sbjct: 202 GLTEQQMQDMLRSEHGGLNEIFADVADITGDKKYLELARRFSHKTLLEPLIGGEDHLTGM 261
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
HANT IP VIG + ++T + + FF + V S GG S RE +
Sbjct: 262 HANTQIPKVIGYKRIADLTQNDAWDQAARFFWNTVVNHRSVCIGGNSVREHFHPADNFTS 321
Query: 392 TLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
L + ETC TYNML++++ LF+ + +I +ADYYERAL N +L+ Q+ + G +Y
Sbjct: 322 MLNDVQGPETCNTYNMLRLTKMLFQTSPDIRFADYYERALYNHILASQQPAKGG-FVYFT 380
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
P+ G + + S WCC G+G+E+ +K G+ IY E LY+ +I S
Sbjct: 381 PMRSGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAHAEDT---LYVNLFIPSR 432
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
WK + L Q+ + +R + S+K+ SL R P W + GA S+N
Sbjct: 433 LTWKEQKLTLVQESR--FPDEAQIRFRIEKSNKKTF----SLKFRYPSW--AKGASVSVN 484
Query: 571 GQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
G+ + PG +L+ +W D++T+ LP+ + E I D Y A ++GP +LA
Sbjct: 485 GKVQDINAQPGEYLTVRRKWKAGDEITLNLPMQVTLEQIPDQEHFY----AFMYGPIVLA 540
Query: 630 GHTSGE 635
T E
Sbjct: 541 SPTGTE 546
>gi|217973327|ref|YP_002358078.1| hypothetical protein Sbal223_2153 [Shewanella baltica OS223]
gi|217498462|gb|ACK46655.1| protein of unknown function DUF1680 [Shewanella baltica OS223]
Length = 792
Score = 247 bits (631), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 174/555 (31%), Positives = 270/555 (48%), Gaps = 57/555 (10%)
Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
+ L+DV + L AQQT+L Y++ +D + L+ +RK A + T + Y WE+ + L
Sbjct: 23 IPLNDVRITAGPFL-HAQQTDLHYIMSMDPERLLAPYRKDAGIATTAENYPNWED--TGL 79
Query: 162 RGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSF 219
GH GHYLSA A M+A+T + + +++ +V L +CQ G GYL P +L+
Sbjct: 80 DGHIGGHYLSALALMYAATSDKAVLARLNYMVAELEKCQQAHGNGYLGGVPNSRKLWQQI 139
Query: 220 E---------ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
E L W P+Y +HK+ +GL D ++ +N A KM ++ + K+
Sbjct: 140 EQGKIEADLFTLNQAWVPWYNVHKVFSGLRDAHLYTNNPTAKKMLVHFADWMLHLSNKL- 198
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
S E+ L E GG+N+ L +Y IT K+L LA + L L D L+
Sbjct: 199 ---SDEQLQLMLRTEYGGLNETLADVYVITGQDKYLALAKRYTDQSLLQPLLHHEDKLTG 255
Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
HANT IP ++G E++ + ++ FF V + + GG S RE + +
Sbjct: 256 LHANTQIPKIVGVARIAELSNNKVWLDSADFFWQQVVHKRTVSIGGNSVREHFHPSDDFS 315
Query: 391 DTLGS-ENEETCTTYNMLKVSRHLF------RWTKEIAYADYYERALTNGVLSIQRGTEP 443
L S E ETC TYNMLK+S+ L+ ++AY +YYERAL N +LS Q E
Sbjct: 316 SMLESAEGPETCNTYNMLKLSKLLYENKLLDENKADLAYIEYYERALYNHILSSQH-PEN 374
Query: 444 GVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
G ++Y P+ R H + + S WCC G+GIE+ +K G+ IY E +
Sbjct: 375 GGLVYFTPM-------RPDHYRVYSSAQQSMWCCVGSGIENHAKYGELIYASEGDD---F 424
Query: 502 YIIQYISSSFDWKSGHVVLNQKV---DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
Y+ ++ S W+ + L QK D S +TL ++ +LN+R P
Sbjct: 425 YVNLFVDSEVHWQEKGITLTQKTLFPDANTS-----EITLDKDAQ------FALNVRYPQ 473
Query: 559 WTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W N S+NGQ G ++ +W DK++I LP+++ E I P+ +S
Sbjct: 474 WVQHNDLTLSINGQAQKFNAVAGQYIKIKRQWHKGDKISITLPMTVTLEQI----PDRSS 529
Query: 618 IQAILFGPYLLAGHT 632
++L+GP +LA T
Sbjct: 530 YYSVLYGPIVLAAKT 544
>gi|436835729|ref|YP_007320945.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
gi|384067142|emb|CCH00352.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
Length = 760
Score = 247 bits (631), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 174/549 (31%), Positives = 266/549 (48%), Gaps = 43/549 (7%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
++ +L DV L AQ + Y+L L+ D L+ + A LP YG WE+
Sbjct: 22 MQPFALQDVKL-TGGPFKNAQDVDQRYILALNPDKLLAPYLIDAGLPVKAPRYGNWES-- 78
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
S L GH GHYLSA A ++AST +A +K+++ +V L++CQ K G GY+ P
Sbjct: 79 SGLDGHIGGHYLSALAMLYASTGDAELKKRLDYMVDQLAQCQAKNGNGYVGGIPQGKVFW 138
Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
++ S L W P Y IHK+ AGL D Y A N QA ++ + ++F
Sbjct: 139 ERIHKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDAYEYAGNQQAKQVLIGLGDWFV---- 194
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
++I S E+ L E GG+N+ LY +T+D K+L A L L + D
Sbjct: 195 ELIKPLSDEQIQQVLRTEHGGINETFADLYILTNDKKYLETAQRISHRAILEPLLAKQDK 254
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L+ HANT IP VIG + + G P + T+F V+ S A GG S RE +
Sbjct: 255 LTGLHANTQIPKVIGFEKIAMLAGKPDWSDAATYFWQNVSQHRSVAFGGNSVREHFNPTT 314
Query: 388 RLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
+ L S + ETC ++NML++S+ LF ++ Y D+YERAL N +LS Q E G
Sbjct: 315 DFSQVLRSNQGPETCNSFNMLRLSKALFLDKSDVTYLDFYERALYNHILSSQH-PEKGGF 373
Query: 447 IYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
+Y P+ R H + S WCC G+GIE+ +K G+ IY + L++
Sbjct: 374 VYFTPI-------RPNHYRVYSQPETSMWCCVGSGIENHTKYGELIYSHSAND---LFVN 423
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
+I S+ +W +V L Q+ + + + + + QE SLN+R P W +
Sbjct: 424 LFIPSTVNWADKNVKLTQRTE--FPYKNESDLVIETTKPQEF----SLNIRYPKWAENLV 477
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
+ Q + P G +++ +W DK+T++ S R E + P+ ++ A + G
Sbjct: 478 VLVNGKAQAVADAPAG-YVAVARKWRAGDKVTVRFNTSTRLEQL----PDGSNWSAFVHG 532
Query: 625 PYLLAGHTS 633
P +LA TS
Sbjct: 533 PIVLAAKTS 541
>gi|218129947|ref|ZP_03458751.1| hypothetical protein BACEGG_01530 [Bacteroides eggerthii DSM 20697]
gi|217988057|gb|EEC54382.1| hypothetical protein BACEGG_01530 [Bacteroides eggerthii DSM 20697]
Length = 781
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 170/546 (31%), Positives = 270/546 (49%), Gaps = 43/546 (7%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L D+ L +S L +AQQT+L Y++ ++ D L+ F + A L +Y WEN + L G
Sbjct: 30 LQDIKLLESPFL-QAQQTDLYYIMAMNPDRLLAPFLREAGLAPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF------- 216
H GHY+SA + M+A+T + T+ +++ ++ L Q +G G++ P L
Sbjct: 87 HIGGHYISALSMMYAATGDTTVYNRLNYMLNELHRAQQAVGNGFIGGTPGSLQLWKEIKE 146
Query: 217 -----DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
+SF +L W P Y IHK AGL D Y+ A + A +M + ++ + + +
Sbjct: 147 GSIRPESF-SLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIALTDW----MAGITS 201
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
+ ++ L E GG+N++ + IT D K+L LA F L L D+L+
Sbjct: 202 GLTEQQMQDMLRSEHGGLNEIFADVADITGDKKYLELARRFSHKTLLEPLIGGEDHLTGM 261
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
HANT IP VIG + ++T + + FF + V S GG S RE +
Sbjct: 262 HANTQIPKVIGYKRIADLTQNDAWDQAARFFWNTVVNHRSVCIGGNSVREHFHPADNFTS 321
Query: 392 TLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
L + ETC TYNML++++ LF+ + +I +ADYYERAL N +L+ Q+ + G +Y
Sbjct: 322 MLNDVQGPETCNTYNMLRLTKMLFQTSPDIRFADYYERALYNHILASQQPAKGG-FVYFT 380
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
P+ G + + S WCC G+G+E+ +K G+ IY E LY+ +I S
Sbjct: 381 PMRSGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAHAEDT---LYVNLFIPSR 432
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
WK + L Q+ + +R + S+K+ SL R P W + GA S+N
Sbjct: 433 LTWKEQKLTLVQESR--FPDEAQIRFRIEKSNKKTF----SLKFRYPSW--AKGASVSVN 484
Query: 571 GQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
G+ + PG +L+ +W D++T+ LP+ + E I D Y A ++GP +LA
Sbjct: 485 GKVQDINAQPGEYLTVRRKWKAGDEITLNLPMQVTLEQIPDQEHFY----AFMYGPIVLA 540
Query: 630 GHTSGE 635
T E
Sbjct: 541 SPTGTE 546
>gi|224540696|ref|ZP_03681235.1| hypothetical protein BACCELL_05610 [Bacteroides cellulosilyticus
DSM 14838]
gi|224517692|gb|EEF86797.1| hypothetical protein BACCELL_05610 [Bacteroides cellulosilyticus
DSM 14838]
Length = 782
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 172/552 (31%), Positives = 271/552 (49%), Gaps = 44/552 (7%)
Query: 100 KEVS---LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
+EVS L DV L +S L +AQQT+L Y++ ++ D L+ F + A L +Y WEN
Sbjct: 24 QEVSYFPLQDVKLLESPFL-QAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWEN 82
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TE 214
+ L GH GHY+SA + M+A+T + I +++ ++ L Q +GTG++ P +
Sbjct: 83 --TGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQ 140
Query: 215 LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
L+ +A L W P Y IHK AGL D Y+ A + A +M + ++ +
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLVALTDWMID- 199
Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
+ + ++ L E GG+N+ + IT D K+L LA F L L
Sbjct: 200 ---ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPLVKDE 256
Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
D L+ HANT IP VIG + ++ D + FF + V S GG S RE +
Sbjct: 257 DRLTGMHANTQIPKVIGYKRIADLAQDQNWDHAARFFWNTVVNHRSVCIGGNSVREHFHP 316
Query: 386 PKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
L + ETC TYNML++++ L++ + +I +ADYYERAL N +L+ Q+ T+ G
Sbjct: 317 ADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNHILASQQPTKGG 376
Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
+Y P+ G + + S WCC G+G+E+ +K G+ IY + LY+
Sbjct: 377 -FVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAHAKDT---LYVN 427
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
+I S WK + L Q+ + +R + S K+ SL LR P W + G
Sbjct: 428 LFIPSRLTWKDKKITLVQETR--FPDEEQIRFRVEKSKKKAF----SLKLRYPSW--AKG 479
Query: 565 AQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
A S+NG+ PG +L+ +W D++T+ +P+ + E I P+ + A ++
Sbjct: 480 ASVSVNGKVQETNAQPGEYLTIHRKWKAGDEITLNMPMQVALEQI----PDRENFYAFMY 535
Query: 624 GPYLLAGHTSGE 635
GP +LA T E
Sbjct: 536 GPIVLASPTGTE 547
>gi|251798256|ref|YP_003012987.1| hypothetical protein Pjdr2_4277 [Paenibacillus sp. JDR-2]
gi|247545882|gb|ACT02901.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 605
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 172/529 (32%), Positives = 254/529 (48%), Gaps = 57/529 (10%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNAT 184
Y+ D++ L+ +F+ A + + + GGWE P LRGHFVGHYLSA A+ H+ T
Sbjct: 27 YIREFDLERLMHTFKINAGISSTAEPLGGWEAPDCGLRGHFVGHYLSACAKFAYGDHDGT 86
Query: 185 IKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD--SFEALKPVWAPYYTIHKILAGLLDQ 242
+K +V + C +GYLSAF E D E + VWAPYYT+HKI+ GL+D
Sbjct: 87 LKTMADEIVDVMQACAQP--SGYLSAFEEEKLDVLELEENRDVWAPYYTLHKIMQGLIDC 144
Query: 243 YVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWY--------SLN--EETGGMNDV 292
YV N QAL++A + Y R + + HW LN E GG+ D
Sbjct: 145 YVYLQNTQALELAVNLAHYIRRRFEYL-------SHWKIDGILRCTKLNPVNEFGGLGDS 197
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
LY LY +T D L LAHLFD+ +L LA D L HANTH+P+++ RY++ +
Sbjct: 198 LYTLYELTGDAALLGLAHLFDRDYWLWPLAEGRDVLEDLHANTHLPMILACMHRYKIREE 257
Query: 353 PLYK---------LIGTFFMDIVNASHSYA--TGGTSAR-EFWWDPKRLADTLGSENEET 400
YK L+G F + N+S + A GG S + E W LAD L E+
Sbjct: 258 DSYKKSALHFYDFLMGRTFANGNNSSKATAFIQGGVSEKAEHWGGYGELADALTGGESES 317
Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
C +N K+ L W+ EI Y D+ E N +L+ + G+ Y PLG K
Sbjct: 318 CCAHNTEKIVERLLEWSPEIGYLDHLESLKYNAILN-SASAKTGLSQYHQPLGTNAVKKF 376
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
S ++SFWCC G+GIE+ S+L +I+F N + + ++SS WK +V+
Sbjct: 377 S-----EPYHSFWCCTGSGIEAMSELQKNIWFR---NGNAILLNAFVSSKAAWKERGIVI 428
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPG 580
+Q+ S+ L L F + + V LRM ++ N + + L
Sbjct: 429 HQR----TSFPDSLISALHFETDEPV------ELRM-MFKEKAIKNIRFNDEGIHLQKEE 477
Query: 581 NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
++ + D++ I++ SLR + P + A+L+G LLA
Sbjct: 478 GYIVVERLFRNGDRMDIEIEASLRLIPL----PGSEAESALLYGNVLLA 522
>gi|423224675|ref|ZP_17211143.1| hypothetical protein HMPREF1062_03329 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392635115|gb|EIY29021.1| hypothetical protein HMPREF1062_03329 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 782
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 172/552 (31%), Positives = 271/552 (49%), Gaps = 44/552 (7%)
Query: 100 KEVS---LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
+EVS L DV L +S L +AQQT+L Y++ ++ D L+ F + A L +Y WEN
Sbjct: 24 QEVSYFPLQDVKLLESPFL-QAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWEN 82
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TE 214
+ L GH GHY+SA + M+A+T + I +++ ++ L Q +GTG++ P +
Sbjct: 83 --TGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQ 140
Query: 215 LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
L+ +A L W P Y IHK AGL D Y+ A + A +M + ++ +
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLVALTDWMID- 199
Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
+ + ++ L E GG+N+ + IT D K+L LA F L L
Sbjct: 200 ---ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPLVKDE 256
Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
D L+ HANT IP VIG + ++ D + FF + V S GG S RE +
Sbjct: 257 DCLTGMHANTQIPKVIGYKRIADLAQDQNWDHAARFFWNTVVNHRSVCIGGNSVREHFHP 316
Query: 386 PKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
L + ETC TYNML++++ L++ + +I +ADYYERAL N +L+ Q+ T+ G
Sbjct: 317 ADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNHILASQQPTKGG 376
Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
+Y P+ G + + S WCC G+G+E+ +K G+ IY + LY+
Sbjct: 377 -FVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAHAKDT---LYVN 427
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
+I S WK + L Q+ + +R + S K+ SL LR P W + G
Sbjct: 428 LFIPSRLTWKEKKITLVQETR--FPDEEQIRFRVEKSKKKAF----SLKLRYPSW--AKG 479
Query: 565 AQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
A S+NG+ PG +L+ +W D++T+ +P+ + E I P+ + A ++
Sbjct: 480 ASVSVNGKVQETNAQPGEYLTIHRKWKAGDEITLNMPMQVALEQI----PDRENFYAFMY 535
Query: 624 GPYLLAGHTSGE 635
GP +LA T E
Sbjct: 536 GPIVLASPTGTE 547
>gi|284036341|ref|YP_003386271.1| hypothetical protein Slin_1422 [Spirosoma linguale DSM 74]
gi|283815634|gb|ADB37472.1| protein of unknown function DUF1680 [Spirosoma linguale DSM 74]
Length = 760
Score = 246 bits (628), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 169/531 (31%), Positives = 262/531 (49%), Gaps = 44/531 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ +L+Y+L L+ + L+ + A LP YG WE+ S L GH GHYLSA A M+
Sbjct: 40 AQDVDLKYILALNPNKLLAPYLIDAGLPEKAPRYGNWES--SGLDGHIGGHYLSALAMMY 97
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEALKPVW 226
AST NA K+++ +V L++CQ K G GY+ P ++ S L W
Sbjct: 98 ASTGNAETKKRLDYMVDQLAQCQAKNGNGYVGGIPQGKVFWERIHKGDIDGSSFGLNNTW 157
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
P Y IHK+ AGL D Y A N QA ++ + ++F ++I S E+ L E
Sbjct: 158 VPLYNIHKLFAGLRDAYEYAGNQQAKQVLIGLGDWFV----ELIKPLSDEQIQQVLRTEH 213
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GG+N+ LY +T D K+L A L L + D L+ HANT IP VIG +
Sbjct: 214 GGINETFADLYILTKDQKYLETAQRISHRAILDPLIDKQDKLTGLHANTQIPKVIGFEKI 273
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
+TG + +F V+ + S A GG S RE + + L S + ETC ++N
Sbjct: 274 ATLTGKSDWSDAAQYFWQNVSQTRSVAFGGNSVREHFNPTTDFSQLLRSNQGPETCNSFN 333
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH-- 463
ML++S+ LF +++Y D+YER + N +LS Q E G +Y P+ R H
Sbjct: 334 MLRLSKALFLDKNDVSYLDFYERTMYNHILSSQH-PEKGGFVYFTPI-------RPNHYR 385
Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
+ S WCC G+GIE+ +K G+ IY + L++ +I S+ +W + L Q+
Sbjct: 386 VYSQPETSMWCCVGSGIENHTKYGELIYSHSAND---LFVNLFIPSTVNWADKKLKLTQQ 442
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNF 582
+ + + S QE+ SLN+R P W + + +NG+ P+ P ++
Sbjct: 443 TQ--FPYQNQSELIIETSRPQEL----SLNIRYPKW--AENLEVLVNGKAQPVTGKPASY 494
Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
++ +W DK+T++ + R E + P+ ++ A + GP +LA TS
Sbjct: 495 VAVNRKWKSGDKVTVRFKTTTRLEQL----PDGSNWAAFVNGPIVLAAKTS 541
>gi|300777572|ref|ZP_07087430.1| acetyl-CoA carboxylase [Chryseobacterium gleum ATCC 35910]
gi|300503082|gb|EFK34222.1| acetyl-CoA carboxylase [Chryseobacterium gleum ATCC 35910]
Length = 791
Score = 246 bits (628), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 172/538 (31%), Positives = 259/538 (48%), Gaps = 42/538 (7%)
Query: 113 SVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSA 172
SV +A + + +YL+ L+ D L+ + K A L Y WEN + L GH GHY+SA
Sbjct: 37 SVFSKAMKADHKYLMALEPDRLLAPYLKEAGLKPKANNYPNWEN--TGLDGHIGGHYISA 94
Query: 173 SAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEA 221
+ M+AST + I+E+++ ++ L CQ GY+S P + S
Sbjct: 95 LSLMYASTGDKAIQERINYMISELERCQKASPDGYISGIPNGKKIWKEIKQGNIRASGFG 154
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
L W P Y IHK+ +GL D Y A N +A M + ++ N V + S E+
Sbjct: 155 LNDRWVPLYNIHKLYSGLRDAYWYAKNEKAKAMLIKLTDWMANEVSNL----SDEQIQDM 210
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
L E GG+N+V +Y ITHD K+L LAH F L L D L+ HANT IP VI
Sbjct: 211 LRSEHGGLNEVFADVYEITHDQKYLKLAHRFSHQAILSPLLTGEDKLTGLHANTQIPKVI 270
Query: 342 GSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEET 400
G + ++ + + FF V S GG S E + + + S E ET
Sbjct: 271 GYKRIADLENNTSWSNAADFFWHNVTEKRSSVIGGNSVSEHFNPVNDFSSMIKSIEGPET 330
Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
C TYNMLK+++ L+ E Y DYYE+AL N +LS + + G +Y P+ G +
Sbjct: 331 CNTYNMLKLTKELYATLPESYYIDYYEKALYNHILSTE-NHDHGGFVYFTPMRPGHYRVY 389
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
S SFWCC G+GIE+ +K G+ IY + + LY+ +I S+ WK +VVL
Sbjct: 390 SQPQ-----TSFWCCVGSGIENHAKYGEMIYARSDKD---LYVNLFIPSTLTWKQQNVVL 441
Query: 521 NQKVDPIVSWDPYLRMTLTFSS--KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
Q + ++ TL F + K E L LR P WT + + +NG+ +
Sbjct: 442 RQ----VNNFPEAPETTLIFDAAGKSEF----DLKLRCPEWTTPSEVKILVNGKQERVQR 493
Query: 579 PGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ + + T++W D + + LP+ L E + P++++ A +GP +LA E
Sbjct: 494 GSDGYFTLTKKWKKGDVVKMTLPMQLSAEQL----PDHSNYYAFKYGPVVLAAKYGTE 547
>gi|357046482|ref|ZP_09108109.1| hypothetical protein HMPREF9441_02134 [Paraprevotella clara YIT
11840]
gi|355530721|gb|EHH00127.1| hypothetical protein HMPREF9441_02134 [Paraprevotella clara YIT
11840]
Length = 762
Score = 245 bits (625), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 171/538 (31%), Positives = 264/538 (49%), Gaps = 40/538 (7%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L+DV L QS A+ ++ YLL LD D L+ + K A L Y WEN + L G
Sbjct: 8 LNDVRLTQSP-FKHAEDLDVRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWEN--TGLDG 64
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT---------- 213
H GHY+SA + M+A+T + IK+++ ++ L Q+ G GYL P
Sbjct: 65 HIGGHYVSALSYMYAATGDEEIKQRLDYMLSELKRAQDAAGDGYLCGAPNGRKIWEAVSK 124
Query: 214 -ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITM 272
++ S L W P Y IHK AGL D Y+LA + +A M + ++ N + +
Sbjct: 125 GDIQASSFGLNGGWVPLYNIHKTYAGLRDAYLLAGSKEARDMLVKLTDWMMNLTKDL--- 181
Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
S E+ L E GG+N+V + +T +L LA F L L D L+ H
Sbjct: 182 -SDEQIQDMLRSEHGGLNEVFADVADLTGKDNYLQLARRFSHREILDPLLEHEDRLTGKH 240
Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
ANT IP VIG + ++ GD + FF + V S + GG S RE + + +
Sbjct: 241 ANTQIPKVIGYKRIADLQGDEGWDDAARFFWETVVERRSISIGGNSVREHFHPSEDFSSM 300
Query: 393 LGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
L SE ETC TYNML++++ L++ + ++ Y DYYERAL N +LS + G +Y P
Sbjct: 301 LTSEQGPETCNTYNMLRLTKMLYQTSADVHYMDYYERALYNHILSTIDPVQGG-FVYFTP 359
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
+ G + S SFWCC G+G+E+ +K G+ IY E LY+ +I S
Sbjct: 360 MRSGHYRVYS-----QPQTSFWCCVGSGMENHAKYGEMIYGHSEDE---LYVNLFIPSVL 411
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
W G V + Q ++ PY T S + + ++ R+P WT + + ++NG
Sbjct: 412 QW--GKVRVEQ-----LTGFPYEEATTLHLSCGKAKEF-TVKFRVPEWTDVSQMELTVNG 463
Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
P+ G +++ + +W+ D++ + LP+SLR A+ D Y + ++GP +LA
Sbjct: 464 TAQPVSVSGGYVTVSRKWADGDEVRLTLPMSLRVAALPDGSDNY----SFMYGPIVLA 517
>gi|94494954|ref|ZP_01301535.1| hypothetical protein SKA58_00635 [Sphingomonas sp. SKA58]
gi|94425220|gb|EAT10240.1| hypothetical protein SKA58_00635 [Sphingomonas sp. SKA58]
Length = 665
Score = 245 bits (625), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 181/549 (32%), Positives = 262/549 (47%), Gaps = 51/549 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
LK + DV LD L AQ+ YLL L D ++ +FR A L YGGWE+
Sbjct: 64 LKPFDMADVTLDDGPFL-HAQRMTETYLLRLQPDRMLHNFRINAGLKPKAPVYGGWESEP 122
Query: 159 S----ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP-- 212
+ GH +GHYLSA A + ST + K+++ + L+ CQ +G + AFP
Sbjct: 123 TWAEINCHGHTLGHYLSACALAYRSTRDRRFKQRLDYIASELAACQKAAHSGLICAFPDG 182
Query: 213 TELFDSFEALKPVWA-PYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
L + +P+ P+YT+HKI AGL D +LAD+ +A L++A W V
Sbjct: 183 PALVAAHINGEPITGVPWYTLHKIYAGLRDAALLADSREAREVLLRLADWGV-------- 234
Query: 268 KVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
V T + + + L E GGMN++ LY++T ++ LA F + L D
Sbjct: 235 -VATRPLSDAQFEAMLATEHGGMNEIYADLYAMTGKEEYRTLARRFSHKAVMEPLVAGKD 293
Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
L HANT +P ++G Q YE TGD Y FF V + S+ATGG E ++
Sbjct: 294 LLDGMHANTQVPKIVGFQRVYEETGDDRYAKAADFFFRTVAHTRSFATGGHGDNEHFF-- 351
Query: 387 KRLAD----TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
+AD ++ ETC +NMLK++R LF + YADYYER L NG+L+ Q +
Sbjct: 352 -AMADFESHVFSAKGSETCCQHNMLKLARLLFMQDPQADYADYYERTLYNGILASQ-DPD 409
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
G+ Y G K + T +SFWCC GTG+E+ K DSIYF ++ + LY
Sbjct: 410 SGMATYFQGARPGYMKL-----YHTPEDSFWCCTGTGMENHVKYRDSIYFHDDRS---LY 461
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
+ ++ S+ W L Q P + T + E+ +L+LR P W S
Sbjct: 462 VSLFLPSAVQWADKGARLEQATS--FPDTPSTSLKWTLRTPVEI----ALHLRHPRW--S 513
Query: 563 NGAQASLNGQN-LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
A +NG+ L PG FL T W D++ + L + E+ P +I A
Sbjct: 514 PTATVRVNGREVLRSTAPGRFLEVTRLWRDGDRVELTLDMMPGVESA----PAAPNIVAF 569
Query: 622 LFGPYLLAG 630
+GP +LAG
Sbjct: 570 TYGPLVLAG 578
>gi|332882274|ref|ZP_08449902.1| hypothetical protein HMPREF9074_05700 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|332679658|gb|EGJ52627.1| hypothetical protein HMPREF9074_05700 [Capnocytophaga sp. oral
taxon 329 str. F0087]
Length = 786
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 170/538 (31%), Positives = 264/538 (49%), Gaps = 40/538 (7%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L+DV L QS A+ ++ YLL LD D L+ + K A L Y WEN + L G
Sbjct: 32 LNDVRLTQSP-FKHAEDLDVRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWEN--TGLDG 88
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT---------- 213
H GHY+SA + M+A+T + IK+++ ++ L Q+ G GYL P
Sbjct: 89 HIGGHYVSALSYMYAATGDEEIKQRLDYMLSELKRAQDAAGDGYLCGAPNGRKIWEAVSK 148
Query: 214 -ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITM 272
++ S L W P Y IHK AGL D Y+LA + +A M + ++ N + +
Sbjct: 149 GDIQASSFGLNGGWVPLYNIHKTYAGLRDAYLLAGSKEARDMLVKLTDWMMNLTKDL--- 205
Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
S E+ L E GG+N+V + +T +L LA F L L D L+ H
Sbjct: 206 -SDEQIQDMLRSEHGGLNEVFADVADLTGKDNYLQLARRFSHREILDPLLEHEDRLTGKH 264
Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
ANT IP VIG + ++ GD + FF + V S + GG S RE + + +
Sbjct: 265 ANTQIPKVIGYKRIADLQGDEGWDDAARFFWETVVERRSISIGGNSVREHFHPSEDFSSM 324
Query: 393 LGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
L SE ETC TYNML++++ L++ + ++ Y DYYERAL N +LS + G +Y P
Sbjct: 325 LTSEQGPETCNTYNMLRLTKMLYQTSADVHYMDYYERALYNHILSTIDPVQGG-FVYFTP 383
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
+ G + + SFWCC G+G+E+ +K G+ IY E LY+ +I S
Sbjct: 384 MRSGHYRV-----YSQPQTSFWCCVGSGMENHAKYGEMIYGHSEDE---LYVNLFIPSVL 435
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
W G V + Q ++ PY T S + + ++ R+P WT + + ++NG
Sbjct: 436 QW--GKVRVEQ-----LTGFPYEEATTLHLSCGKAKEF-TVKFRVPEWTDVSQMELTVNG 487
Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
P+ G +++ + +W+ D++ + LP+SLR A+ D Y + ++GP +LA
Sbjct: 488 TAQPVSVSGGYVTVSRKWADGDEVRLTLPMSLRVAALPDGSDNY----SFMYGPIVLA 541
>gi|392554933|ref|ZP_10302070.1| Acetyl-CoA carboxylase, biotin carboxylase [Pseudoalteromonas
undina NCIMB 2128]
Length = 816
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 179/545 (32%), Positives = 267/545 (48%), Gaps = 46/545 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L++VSL +S AQQTN+ YLL L D L+ + + A + +YG WE+
Sbjct: 51 LEQVSL------SASPFLHAQQTNVRYLLALHPDQLLAPYLREAGIEPKASSYGNWED-- 102
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
S L GH GHYLSA + WA+T + +K ++ ++ L Q ++ GYL P
Sbjct: 103 SGLDGHIGGHYLSALSLAWAATGDEELKRRLDYMLNELQRAQ-QVNDGYLGGIPNGQAMW 161
Query: 214 -ELFD-----SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
++ D +L W P Y I KI GL D Y++A + QA M + E+F N
Sbjct: 162 QQIHDGNIKADLFSLNDRWVPLYNIDKIFHGLRDAYLIAGSEQAKTMLFGLGEWFLNLTS 221
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
K+ S E+ L E GG+N V + +I +D ++L LA F + L + D
Sbjct: 222 KL----SDEQIQQMLYSEYGGLNAVFADMATIGNDKRYLKLARQFTHHSIVDPLLKKQDK 277
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L+ HANT IP +IG E + D ++ +F V S A GG S RE + D K
Sbjct: 278 LTGLHANTQIPKIIGMLKVAETSDDEAWQQGADYFWQTVTKERSVAIGGNSVREHFHDKK 337
Query: 388 RLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
+ E ETC TYNM+K+S+ LF T + Y +YYERA N +LS Q E G +
Sbjct: 338 DFTAMVEDVEGPETCNTYNMMKLSKLLFLKTADTRYLEYYERATYNHILSSQH-PEHGGL 396
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
+Y P+ G + S+ +S WCC G+GIE+ SK G+ IY + + N L++ +
Sbjct: 397 VYFTPMRPGHYRMYSSVQ-----DSMWCCVGSGIENHSKYGELIYSKNDDN---LWVNLF 448
Query: 507 ISSSFDW-KSGHVVLNQKVDPIVSWDPYLRMTLTFSS-KQEVGQLSSLNLRMPVWTYSNG 564
ISS+ DW + G V Q P + +TL F++ ++ + L++R P W +
Sbjct: 449 ISSTLDWQQQGLKVTQQSHFPDAN-----NVTLVFNTLDKKDNSPAQLHIRKPSWI-TGD 502
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
Q LNG+ + + + W DKLT L L TE + D + Y A+L+G
Sbjct: 503 LQFKLNGKPINATAEQGYYAIKHDWHDGDKLTFTLAPKLYTEQLPDGQDYY----AVLYG 558
Query: 625 PYLLA 629
P ++A
Sbjct: 559 PVVMA 563
>gi|146301615|ref|YP_001196206.1| hypothetical protein Fjoh_3876 [Flavobacterium johnsoniae UW101]
gi|146156033|gb|ABQ06887.1| protein of unknown function DUF1680 [Flavobacterium johnsoniae
UW101]
Length = 765
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 195/645 (30%), Positives = 292/645 (45%), Gaps = 89/645 (13%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L+EV L D +AQ +L+Y+L L+ D L+ + A LP YG WE+
Sbjct: 32 LQEVRLED------GPFKKAQDVDLKYILALNPDKLLAPYLIDAGLPVKSTRYGNWES-- 83
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
L GH GHYLSA + M+AST N +K ++ ++ L+ CQ+K G GY+ P
Sbjct: 84 LGLDGHIAGHYLSALSMMYASTGNPELKNRLDYMISELARCQDKNGNGYVGGIPQGKVFW 143
Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
++ S L W P Y IHK+ AGL D Y N QA +K+ W +E
Sbjct: 144 DRIHKGDIDGSSFGLNNTWVPIYNIHKLFAGLNDAYQYTGNQQAKEVLIKLGDWFIEMIK 203
Query: 264 ----NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+++QK+ L E GG+N+ LY IT D K+L A + FL
Sbjct: 204 PLSDDQIQKI------------LKTEHGGINESFADLYLITKDKKYLETAQKISQKSFLE 251
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
L + D L+ HANT IP VIG + ++ D + TFF D V S A GG S
Sbjct: 252 SLIKKEDKLTGLHANTQIPKVIGFEKIASISADKEWSEAVTFFWDNVTQKRSVAFGGNSV 311
Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
E + + L S E ETC +YNM ++S+ LF +E+ Y D+YER L N +LS Q
Sbjct: 312 SEHFNPVNDFSGMLKSNEGPETCNSYNMERLSKALFLEKQEMNYLDFYERTLYNHILSSQ 371
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIY--FEE 494
E G +Y P+ R H + S WCC G+G+E+ +K G+ IY F+E
Sbjct: 372 H-PEKGGFVYFTPI-------RPNHYRVYSQPETSMWCCVGSGLENHTKYGELIYSHFDE 423
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT-LTFSSKQEVGQLSSLN 553
+++ +I+S+ +W +V+ Q+ PY T + + K+ + LN
Sbjct: 424 -----AVFVNLFIASTLNWNEKGIVIEQRTKF-----PYENSTEIVLNLKK--AKTFDLN 471
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
+R P W + + Q L P G ++S +W D + I+ E + P
Sbjct: 472 IRRPKWAENFRVFINDKEQKTELKPSG-YISLKRKWKSKDHVRIEFETKTHLEQL----P 526
Query: 614 EYASIQAILFGPYLLAGHTSGEW-------DIKTGTARSLSALISPIPPSF-----NAQL 661
+ ++ A + GP +LA TS E D + G S + P+ ++ A
Sbjct: 527 DGSNWSAFVNGPIVLAAKTSKEALDGLFADDSRMGHVASGKYM--PMDKAYALVGEKASY 584
Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKD 706
V+ +E GN F + S+ +E F DA F+ KD
Sbjct: 585 VSRLKELGNMRFALD----SLELEPF-FELHDARYQMYFQTFTKD 624
>gi|291544618|emb|CBL17727.1| Uncharacterized protein conserved in bacteria [Ruminococcus
champanellensis 18P13]
Length = 597
Score = 244 bits (622), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 173/552 (31%), Positives = 278/552 (50%), Gaps = 45/552 (8%)
Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTP---GKAYGGWENPISELRGHFVGHYLSASAQMWA 178
N YL+ L ++L+ +F A + T + + GWE+P +LRGHF+GH+LSA+A + A
Sbjct: 24 NRAYLMELKSENLLQNFLLEAGVRTDRDVTEMHLGWESPTCQLRGHFLGHWLSAAALLIA 83
Query: 179 STHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAG 238
+ +K K+ T++ +L+ CQ G ++ + P + F+ + + +W+P YT+HK L G
Sbjct: 84 QNQDRELKAKLDTIIDALARCQELNGGRWIGSIPEKYFEKLKKNEYIWSPQYTLHKTLLG 143
Query: 239 LLDQYVLADNAQALKM----ATWMVEYFYNRVQK-VITMYSVERHWYSLNEETGGMNDVL 293
L + A N AL++ A W +E+ +QK +YS E GGM +V
Sbjct: 144 LYHSALYAKNQVALEILGRAADWYLEWTEKMMQKNPHAVYSGEE---------GGMLEVW 194
Query: 294 YRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP 353
LY +T D ++L LA + P G LA D LS+ HAN IP G+ YE+TGD
Sbjct: 195 AGLYQLTEDERYLTLAQRYAHPSIFGRLADGEDPLSNCHANASIPWAHGAAKMYEITGDA 254
Query: 354 LY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
+ +L+ F+ V+ ++ TGG ++ EFW P++L LG +E CT YNM++++ +
Sbjct: 255 AWLELVKRFWQCAVSDRDAFCTGGQNSGEFWIPPRKLGMFLGERTQEFCTVYNMVRLADY 314
Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
LF +T Y DY E L NG L+ Q+ G+ Y LP+ KA S WG+K F
Sbjct: 315 LFCFTGAHEYLDYIENNLYNGFLA-QQNKYTGMPAYFLPM-----KAGSVKKWGSKTKDF 368
Query: 473 WCCYGTGIESFSKLGDSIYF-EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI---- 527
WCC+GT +++ + ++ ++E N L + QYI+S + + HV + Q VD
Sbjct: 369 WCCHGTTVQAHTIYPQLCWYADKEQN--RLILAQYINSVCKF-NAHVTITQSVDMKYYND 425
Query: 528 -VSWDP-----YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
S+D R + K E + +L+LR+P W + +NGQ+ +
Sbjct: 426 GASFDERDDSRMFRWYIKLHVKAEQPERFTLSLRIPAWV-AGELVILVNGQHAEVESVNG 484
Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTG 641
F W +D + + P +L T ++ P+ + A GP +LAG + I
Sbjct: 485 FAELDRVWE-DDTVNLYFPAALTTCSL----PDMPQLLAFREGPIVLAGLCESDRGIYLA 539
Query: 642 TARSLSALISPI 653
SAL +P+
Sbjct: 540 QNDPTSAL-TPV 550
>gi|408357351|ref|YP_006845882.1| hypothetical protein AXY_19880 [Amphibacillus xylanus NBRC 15112]
gi|407728122|dbj|BAM48120.1| hypothetical protein AXY_19880 [Amphibacillus xylanus NBRC 15112]
Length = 622
Score = 243 bits (621), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 176/581 (30%), Positives = 282/581 (48%), Gaps = 72/581 (12%)
Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL----PTPGKAYGGWE 155
K V++HD L R + N YL+ L D+L++++R A P A+GGWE
Sbjct: 7 KNVTVHD------GDLKRREAANKSYLMSLTNDNLLFNYRVEAGRFHGREIPKDAHGGWE 60
Query: 156 NPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL 215
P+ ++RGHF+GH+LSA+A + + + +K K +V L+ECQ G ++ P +
Sbjct: 61 TPVCQIRGHFLGHWLSAAALHYHQSGDLELKVKADLIVSELAECQKDNGGQWVGPIPEKY 120
Query: 216 FDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSV 275
K +WAP Y +HK+ GL+D Y N QAL +A ++F K ++
Sbjct: 121 LHWIAEGKNIWAPQYNLHKLFMGLIDMYSYTGNQQALDIADNFADWFVKWSGK----FTR 176
Query: 276 ERHWYSLNEETGGMNDVLYRLYSIT-HDPKHLLLAHLFDKPCFLGFLALQADYLSHFHAN 334
E+ L+ ETGGM +V L IT HD LL + + F L + D L++ HAN
Sbjct: 177 EQFDDILDVETGGMLEVWADLLEITGHDKYKFLLDRYYRQRLFQPLLEGK-DPLTNMHAN 235
Query: 335 THIPIVIGSQMRYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL 393
T IP V+G YEVTGD + ++ ++ V + ATGG ++ E W ++ L
Sbjct: 236 TTIPEVLGCARAYEVTGDNRWLDIVKAYWNCAVTERGTLATGGNTSGEVWMPKMKIKARL 295
Query: 394 GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ-------RGTEP--- 443
G +N+E CT YNM++++ LF+ TK+ AY Y E L NG+++ GT
Sbjct: 296 GDKNQEHCTVYNMIRLADFLFQQTKDPAYGQYIEYNLYNGIMAQAYYQSYHVAGTGKNHP 355
Query: 444 --GVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
G++ Y LP+ G+ K W ++ NSF+CC+GT +++ + L IY++++ +
Sbjct: 356 WTGLLTYFLPMKAGLYKE-----WSSETNSFFCCHGTMVQANATLNRGIYYQDQDQI--- 407
Query: 502 YIIQYISSSFDWKSG--HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS-------- 551
Y+ QY +S + G V + Q D + L + + + +Q + +++S
Sbjct: 408 YVSQYFNSELETTIGSDRVRIKQSQDIMSG---SLLDSSSIAGQQRLSEITSIHENTPDF 464
Query: 552 ----------------LNLRMPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERWSYNDK 594
L LR+P W + A LNG+ + F T WS DK
Sbjct: 465 KKYDFTIQLDQKKTFTLGLRIPEWIMKD-ASIYLNGELIGKTNDSSAFYKLTREWSDGDK 523
Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
++I P+ +R + DD + A +GP +LAG T E
Sbjct: 524 VSITFPIGIRFIQLPDD----LNTGAFRYGPDVLAGITEHE 560
>gi|305676227|ref|YP_003867899.1| hypothetical protein BSUW23_17775, partial [Bacillus subtilis
subsp. spizizenii str. W23]
gi|305414471|gb|ADM39590.1| hypothetical protein BSUW23_17775 [Bacillus subtilis subsp.
spizizenii str. W23]
Length = 497
Score = 243 bits (621), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 166/508 (32%), Positives = 254/508 (50%), Gaps = 35/508 (6%)
Query: 114 VLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSAS 173
+ + +Q EYLL LDVD L+ + S YGGWE E+ GH +GH+LSA+
Sbjct: 10 MFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAGHSIGHWLSAA 67
Query: 174 AQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEA---------LKP 224
+ M+ ++ + +K K V LS Q GY+S F FD + L
Sbjct: 68 SAMYQASGDEKLKRKAEYAVNELSHIQQFDEEGYISGFSRACFDEVFSGDFRVDHFSLGG 127
Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
W P+Y++HK+ AGL+D Y L N AL++ + ++ +K + + E+ L
Sbjct: 128 SWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRMLIC 183
Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
E GGMN+ + LY +T + +L LA F L LA D L HANT IP VIG+
Sbjct: 184 EHGGMNEAMADLYILTKNKSYLDLAERFCHRAILQPLAEGKDELEGKHANTQIPKVIGAA 243
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
Y++TG+ Y+ FF + V SYA GG S E + ++ LG ETC TY
Sbjct: 244 KLYDITGNEAYRNPALFFWEQVVYQRSYAIGGNSIGEHF--GAEGSEELGVTTAETCNTY 301
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
NMLK++ HLFRW E + DYYE AL N +LS Q E G+ Y + G K
Sbjct: 302 NMLKLTGHLFRWFHEARFTDYYENALYNHILSSQ-DPESGMKTYFVSTQPGHFKV----- 355
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
+ + +SFWCC GTG+E+ ++ +IY ++ + LY+ +I S + + +++ Q+
Sbjct: 356 YCSPEDSFWCCTGTGMENPARYTQNIYHLDQDD---LYVNLFIPSQINVREKQMIITQE- 411
Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA-QASLNGQNLPLPPPGNFL 583
S+ + L K+ G +L +R+P WT NG+ +A +NG+ + +L
Sbjct: 412 ---TSFPAANKTKLVV--KKADGVPMTLQIRIPYWT--NGSLKAVVNGKRVQSVEKNGYL 464
Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDD 611
+ + W+ D + I LP+ L +DD
Sbjct: 465 AIHKHWNTGDCIEIDLPMKLHIYQAKDD 492
>gi|317057297|ref|YP_004105764.1| hypothetical protein Rumal_2655 [Ruminococcus albus 7]
gi|315449566|gb|ADU23130.1| protein of unknown function DUF1680 [Ruminococcus albus 7]
Length = 602
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 179/572 (31%), Positives = 289/572 (50%), Gaps = 56/572 (9%)
Query: 139 RKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSE 198
R+ S P + + GWE+P +LRGHF+GH++SA+A + AS +A ++ K+ +V L
Sbjct: 51 RQVISEPEKAELHWGWESPACQLRGHFLGHWMSAAAMLSASDGDAELRAKLVKIVDELER 110
Query: 199 CQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KM 254
CQ + G ++ + P + F E+ + +W+P YT+HK L GL+D Y A +AL ++
Sbjct: 111 CQQRNGGKWVGSIPEKYFKLMESEEYIWSPQYTMHKTLMGLVDAYRFAGIQKALDIADRL 170
Query: 255 ATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK 314
A W +E+ + V+K ++V E GGM + LY +T+DPK+ L ++ +
Sbjct: 171 ADWYIEWAAS-VEKTAP-FTV------FKGEQGGMLEEWCILYELTNDPKYRKLMDIYRE 222
Query: 315 PCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLI-GTFFMDIVNASHSYA 373
L + L+ HAN IP+ G+ Y++TG+ +K+I F+ V +A
Sbjct: 223 NGLYHKLEQHREALTDDHANASIPLSHGAARMYDITGEERWKIITDEFWRQAVTERGMFA 282
Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
T G ++ EFW P + LG ++E CT YNM++++ L+R T + YADY ERAL NG
Sbjct: 283 TTGANSGEFWVPPHSMGSYLGDTDQEFCTVYNMVRLADFLYRRTGDTVYADYIERALYNG 342
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
L+ Q+ G+ Y LPL G K WG+K + FWCC+GT +++ + I++
Sbjct: 343 FLA-QQNMHSGMPAYFLPLSSGSRKK-----WGSKRHDFWCCHGTMVQAQTLYPQLIWYT 396
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLN-------QKVDPIVSWD-----PYLRMTLTFS 541
E+ L + QYI S + G + + ++ V +D R ++ F
Sbjct: 397 EDST---LTVAQYIPSEAELDIGGKKIKVSQCTELKNLNNQVFFDEDEGGEKSRWSIRFD 453
Query: 542 SKQEVGQLSSLNLRMPVWTYSNG-AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
K + +L LRMP W NG Q ++G ++ N+L+ + W +ND + + L
Sbjct: 454 IKCDEPTFFTLWLRMPKWL--NGRPQLIIDGGSVQADIADNYLTISRTW-HNDTIQLLLI 510
Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQ 660
+L TE + D PE A A+L GP +LAG T D G SA P SF +
Sbjct: 511 PTLYTEPLA-DMPETA---ALLDGPIVLAGMT----DKDAGITGDFSA-----PESFLHR 557
Query: 661 LVTFTQES---GNSTFVMSNSNQSITMEEFPV 689
T ++ +T+V NQ + +E P+
Sbjct: 558 RTTHEYKTYVWKQNTYV--TRNQPVNIEFKPL 587
>gi|330996333|ref|ZP_08320217.1| hypothetical protein HMPREF9442_01300 [Paraprevotella xylaniphila
YIT 11841]
gi|329573383|gb|EGG54994.1| hypothetical protein HMPREF9442_01300 [Paraprevotella xylaniphila
YIT 11841]
Length = 811
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 169/538 (31%), Positives = 263/538 (48%), Gaps = 40/538 (7%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L+DV L Q A+ ++ YLL LD D L+ + K A L Y WEN + L G
Sbjct: 57 LNDVRLTQGP-FKHAEDLDIRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWEN--TGLDG 113
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFE- 220
H GHY+SA A M+A+T N IK+++ ++ Q+ G GYL P +++D+
Sbjct: 114 HIGGHYVSALAYMYAATGNEEIKQRLDYMLSEWKRAQDAAGDGYLCGAPNGRKIWDAVSK 173
Query: 221 --------ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITM 272
L W P Y IHK AGL D YV+A AQA M + ++ N + +
Sbjct: 174 GDIQASSFGLNGGWVPLYNIHKTYAGLRDAYVVAGCAQAKDMLVKLTDWMMNLTKDL--- 230
Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
S E+ L E GG+N+V + +T ++ LA F L L Q D L+ H
Sbjct: 231 -SDEQIQDMLRSEHGGLNEVFADVADLTGKDGYMQLARRFSHREILDPLLKQEDQLTGKH 289
Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
ANT IP VIG + ++ GD + FF V S + GG S RE + + +
Sbjct: 290 ANTQIPKVIGYKRIADLEGDESWDDAARFFWKTVVDQRSISIGGNSVREHFHPSEDFSSM 349
Query: 393 LGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
L SE ETC TYNML++++ L++ + + Y DYYERAL N +LS + G +Y P
Sbjct: 350 LTSEQGPETCNTYNMLRLTKMLYQTSADAHYMDYYERALYNHILSTIDPVQGG-FVYFTP 408
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
+ G + S SFWCC G+G+E+ +K G+ IY + LY+ +I S
Sbjct: 409 MRSGHYRVYS-----QPQTSFWCCVGSGMENHAKYGEMIYAHGGDD---LYVNLFIPSVL 460
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
W G V + Q+ LR++ + + + ++ R+P WT ++ + ++NG
Sbjct: 461 QW--GKVRVEQRTSFPYEEATTLRLSCSKA------KTFTVKFRVPEWTDASRMELTVNG 512
Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
P+ G +++ + +W+ D++ + LP+SLR + D Y + ++GP +LA
Sbjct: 513 TAQPVSVSGGYVAVSRKWTDGDEVRLTLPMSLRAVVLPDGSDNY----SFMYGPVVLA 566
>gi|406027774|ref|YP_006726606.1| hypothetical protein LBUCD034_2040 [Lactobacillus buchneri CD034]
gi|405126263|gb|AFS01024.1| hypothetical protein LBUCD034_2040 [Lactobacillus buchneri CD034]
Length = 803
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 185/564 (32%), Positives = 263/564 (46%), Gaps = 77/564 (13%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPIS-ELRGHFVGHYLSASA 174
RAQQ ++YLL LD + +F + A + + G Y GWE RGHF GHYLSA +
Sbjct: 19 RAQQMTVKYLLALDPKRFLVTFDQVAGIDSGGVTGYQGWERTDGLNFRGHFFGHYLSALS 78
Query: 175 QMWASTHNATIKE----KMSTVVFSLSECQNKIG------TGYLSAFPTELFDSFEALK- 223
Q +T + I++ K+ V L Q GY+SAF D E +
Sbjct: 79 QAILATEDNAIRQQLLDKLRLGVNGLQSAQAAYAKKHPESAGYVSAFREVALDEVEGREV 138
Query: 224 ------PVWAPYYTIHKILAGLLDQYVLADN------AQALKMATWMVEYFYNRVQKVIT 271
V P+Y +HK+LAGLL V N +ALK A Y + R+ ++
Sbjct: 139 PKDEKENVLVPWYNLHKVLAGLLAVNVNLQNIDPLLSEKALKSAHQFGLYVFKRINQLAD 198
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
+ L E GGMND LY L+ +T D + L A FD+ LA D L+
Sbjct: 199 PTQM------LKIEYGGMNDALYELFDLTDDKRMLTAATYFDETTLFKQLAKGDDVLAGK 252
Query: 332 HANTHIPIVIGSQMRYEVTGD----------------PLYKLIGTFFMDIVNASHSYATG 375
HANT IP +IG+ RYE D +Y F IV H+Y TG
Sbjct: 253 HANTTIPKLIGALHRYESLHDVKRADQYLSPEEKGSLNMYLKAAVNFWQIVIDDHTYVTG 312
Query: 376 GTSAREFWWDPKRL-ADTL---GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
G S E + +P +L D + G+ ETC TYNMLK+SR LFR T + Y DYYE+ T
Sbjct: 313 GNSQSEHFHEPGQLFHDAVLEDGATTCETCNTYNMLKLSRELFRVTGDKKYLDYYEQTYT 372
Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY 491
N +L Q G+M Y P+ G +K + F+ FWCC GTGIESF+KLGDS Y
Sbjct: 373 NAILGSQ-NPNTGMMTYFQPMAAGYTKV-----YNRPFDEFWCCTGTGIESFTKLGDSYY 426
Query: 492 FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
F LY+ Y S+ S ++ + ++VD + +T+ Q+ +
Sbjct: 427 FRSGDQ---LYLSLYFSNVLRLDSRNLQMTEQVDRKAG---KVHLTVVKIRSQDSAGTIN 480
Query: 552 LNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK---LTIQLPLSLRTEAI 608
L LR P W + A+ +++G + + +F W ++ T+ L + + E +
Sbjct: 481 LKLRNPAWLVQS-AKLAVDGISQQMDQNADF------WEIDNAGPGTTVDLEMPMSLEMV 533
Query: 609 Q-DDRPEYASIQAILFGPYLLAGH 631
Q D P Y + + +GPY+LAG
Sbjct: 534 QTKDNPHYLAFK---YGPYVLAGQ 554
>gi|302897238|ref|XP_003047498.1| hypothetical protein NECHADRAFT_97856 [Nectria haematococca mpVI
77-13-4]
gi|256728428|gb|EEU41785.1| hypothetical protein NECHADRAFT_97856 [Nectria haematococca mpVI
77-13-4]
Length = 626
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 181/550 (32%), Positives = 261/550 (47%), Gaps = 42/550 (7%)
Query: 99 LKEVSLHDV-WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWEN 156
L EV+L D W+D Q L YLL +D D L++ FR L T G + GGW+
Sbjct: 42 LSEVTLTDSRWMDN-------QNRTLTYLLSVDPDRLLYVFRANHGLDTKGAQKNGGWDA 94
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-----IGTGYLSAF 211
P R H GH+L+A +Q +A+ N + + L +CQ GYLS F
Sbjct: 95 PDFPFRSHIQGHFLTAWSQCYATLRNEECGSRATYFAKELGKCQANNEKANFTEGYLSGF 154
Query: 212 PTELFDSFE--ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
P + E L PYY IHK LAGLLD + L + A + + + R +K+
Sbjct: 155 PESEITAVEKRTLNNGNVPYYAIHKTLAGLLDVHRLVGDEDAKDVMLALAGWVDTRTKKL 214
Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
+ ++ + E GGMN+VL + D K L +A FD L D LS
Sbjct: 215 ----TYDQMQAMMQTEFGGMNEVLADIAYYIGDKKWLEVAQRFDHATIFDPLEKGQDKLS 270
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
HANT +P IG+ Y+V+G Y IG D+ H+YA GG S E + P +
Sbjct: 271 GLHANTQVPKWIGAIREYKVSGLQKYLDIGRNAWDLTVHKHTYAIGGNSQAEHFRAPDAI 330
Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWT-KEIAYADYYERALTNGVLSIQRGTE-PGVMI 447
A+ L ++ E C TYNMLK++R L+ + ++ D+YE AL N +L Q + G +
Sbjct: 331 AEYLDNDTCEACNTYNMLKLTRELWVMDPSDASFFDFYENALMNHLLGQQNPEDHHGHIT 390
Query: 448 YMLPLG----RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
Y PL RGV A W T ++SFWCC G+GIE+ +KL DSIYF ++ LY+
Sbjct: 391 YFTPLNPGGRRGVGPAWGGGTWSTDYDSFWCCQGSGIETNTKLMDSIYFHDD---ETLYV 447
Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
+ S DW + + Q D P T Q ++ +R+P WT +
Sbjct: 448 NLFTPSQLDWSDRKISITQSTDF-----PERDTTTLKVGNQGENNEWTMAIRVPSWT--S 500
Query: 564 GAQASLNGQNLPLP--PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
A +NG+ + G + +WS D +T+ LP+SLRT A + A+ AI
Sbjct: 501 KASIKINGEAVEGVDIESGKYAIIKRKWSSGDAVTVTLPMSLRTIAAN----DDAATAAI 556
Query: 622 LFGPYLLAGH 631
FGP +L+ +
Sbjct: 557 AFGPVILSAN 566
>gi|319786479|ref|YP_004145954.1| hypothetical protein Psesu_0871 [Pseudoxanthomonas suwonensis 11-1]
gi|317464991|gb|ADV26723.1| protein of unknown function DUF1680 [Pseudoxanthomonas suwonensis
11-1]
Length = 806
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 184/562 (32%), Positives = 275/562 (48%), Gaps = 63/562 (11%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L++V L D +SS L NL YL LD D L+ FR A LP+P Y WE+
Sbjct: 40 LEDVRLGDGAFARSSAL------NLRYLAALDPDRLLAPFRIEAGLPSPAPKYPNWES-- 91
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF-- 216
L GH GHYLSA AQ A+ +A ++ ++ +V +LS+ Q G GY+ P
Sbjct: 92 MGLDGHTAGHYLSALAQQ-AAQGSAGMRRRLDYMVAALSQVQAANGDGYVGGVPNGRVLW 150
Query: 217 -----DSFEA----LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
F+A L+ W P+Y +HK AGL D ++LA NAQA ++ A W
Sbjct: 151 NRIASGDFQAESFSLEGAWVPFYNLHKTYAGLRDAWLLAGNAQARDVLVRFADWAGALVA 210
Query: 264 N----RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
N ++Q+V L+ E GGMN+VL +Y+IT D ++L LA F L
Sbjct: 211 NLDDTQLQRV------------LDTEHGGMNEVLADVYAITGDRRYLALARRFSHRAILD 258
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
L + D L HANT IP VIG E+ GD + FF + V S A GG S
Sbjct: 259 PLLRREDRLDGLHANTQIPKVIGFARIGELDGDVEWIEAAQFFWERVALHRSIAFGGNST 318
Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
RE + + + S E ETC +YNML+++ L R + +AD+YERAL N +LS Q
Sbjct: 319 REHFNPADDFSGMIASREGPETCNSYNMLRLTLLLERLRPDPRHADFYERALFNHILSTQ 378
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
+ G ++Y P+ + R + FWCC G+G+E+ + G Y +E +
Sbjct: 379 H-PDHGGLVYFTPI-----RPRHYRVYSQPQECFWCCVGSGMENHGRHGAFAYTHDESS- 431
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
L + Y+ S W+ +VL Q+ + R L ++ + Q+ +L LR P
Sbjct: 432 --LRVNLYLDSELHWRERGLVLRQR----TRFPEEPRSVLEVATPRP--QVFALELRHPH 483
Query: 559 WTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W + + LNG+ P+ P ++ +W D++ ++LP+S R E++ P+ +
Sbjct: 484 W-LAGPLRVKLNGRRWPVESSPSSYARIERQWQDGDRIEVELPMSTRIESL----PDGSD 538
Query: 618 IQAILFGPYLLAGHTSGEWDIK 639
A++ GP +LA SGE DI+
Sbjct: 539 WVAVMHGPLMLAAR-SGEEDIE 559
>gi|254444174|ref|ZP_05057650.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
gi|198258482|gb|EDY82790.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
Length = 788
Score = 243 bits (619), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 171/533 (32%), Positives = 255/533 (47%), Gaps = 49/533 (9%)
Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAS 179
+ ++ Y+L D D L+ F A L + YG WE+ S L GH GH+LSA A +
Sbjct: 47 EADVTYVLAHDPDRLLAPFLTAAGLEPKAEKYGNWES--SGLDGHSAGHFLSAYATLSLQ 104
Query: 180 THNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELF------DSFEALKPVWA 227
+ N ++E++ ++ L+ CQ+ IGTGYL P T LF D F +L W
Sbjct: 105 SDNPLLRERLDYMLDELTRCQDAIGTGYLGGVPNSQEFTTRLFAGEIKADRF-SLNGAWV 163
Query: 228 PYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
P+Y +HK AGL D +++AD+ +A + +A W V + E+ L
Sbjct: 164 PWYNLHKTYAGLKDAWLVADSEKAKNILIALADWTV--------AATAKLTDEQMQEMLY 215
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGS 343
E GGMN++ LY T D ++L LA+ F L L D L+ FHANT IP VIG
Sbjct: 216 TEHGGMNEIFADLYLHTQDQRYLELAYRFTHHELLDPLLENQDKLTGFHANTQIPKVIGY 275
Query: 344 QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCT 402
Q D FF D V S + GG S RE + L S E ETC
Sbjct: 276 QRTALAAQDEKLHQASQFFWDTVVNHRSVSIGGNSVREHFHPADDFRSMLESREGPETCN 335
Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARST 462
T+NML+++ LF A DYYERAL N +LS Q E G ++Y P + R
Sbjct: 336 THNMLRLTTLLFEAEPTAALTDYYERALYNHILSAQH-PETGGLVYFTP-----QRPRHY 389
Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
+ N+FWCC G+GIE+ + + IY + L++ +++SS +W+ + L Q
Sbjct: 390 RVYSVPENAFWCCVGSGIENPGRYSEFIYAHTDD---ALFVNLFLASSLNWQEKGLRLTQ 446
Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN- 581
+ +T+ + K+++ +L +R P WT ++ Q +LN + + N
Sbjct: 447 STN--FPQTASTELTIDQAPKKKL----TLKIRRPAWT-TDAFQITLNDKPVKTKTNANG 499
Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
+ S T +W D L++ LP+ + E I D P Y + L+GP +LA T
Sbjct: 500 YASLTRKWKTGDTLSVALPMQVHVEQIPDHSPFY----SFLYGPIVLAAKTDA 548
>gi|332185536|ref|ZP_08387284.1| tat (twin-arginine translocation) pathway signal sequence domain
protein [Sphingomonas sp. S17]
gi|332014514|gb|EGI56571.1| tat (twin-arginine translocation) pathway signal sequence domain
protein [Sphingomonas sp. S17]
Length = 639
Score = 242 bits (618), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 178/561 (31%), Positives = 262/561 (46%), Gaps = 57/561 (10%)
Query: 90 GGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK 149
G LP ++ + DV LD L AQ+ YL+ L D L+ +FR A L
Sbjct: 33 GATRLPATVVQPFDMADVTLDGGPFL-HAQRMTEAYLMRLQPDRLLANFRANAGLKPKAP 91
Query: 150 AYGGWENPIS----ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT 205
AYGGWE+ GH +GHYLSA A + +T + ++++ + L+ CQ G+
Sbjct: 92 AYGGWESEPEWADINCHGHTLGHYLSACALAYRATKDKRYRQRIDYIANELAACQKASGS 151
Query: 206 GYLSAFPT--ELFDSFEALKPVWA-PYYTIHKILAGLLDQYVLADNAQA----LKMATWM 258
G + AFP L + +P+ P+YT+HK+ AGL D LAD+ + ++A W
Sbjct: 152 GLVCAFPKGPALVAAHLRGEPITGVPWYTLHKVYAGLRDSVQLADSEPSRGVLFRLADWG 211
Query: 259 VEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL 318
V S E+ L E GGMN++ LY +T + + +A F + +
Sbjct: 212 V--------VATKPLSDEQFEKMLETEYGGMNEIYADLYFMTGNEDYRRVAERFSQKAIM 263
Query: 319 GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS 378
LA DYL HANT IP +IG Q +E TGD Y FF V + ++ATGG
Sbjct: 264 NPLAQGRDYLDGMHANTQIPKIIGFQRVFEATGDDKYHNAAAFFWRTVAHTRAFATGGHG 323
Query: 379 AREFWWDPKRLAD----TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
E ++ +AD ++ ETC +NMLK++R LF YADYYER L NG+
Sbjct: 324 DAEHFF---AMADFDKHVFSAKGSETCCQHNMLKLTRALFLRDPRAEYADYYERTLYNGI 380
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
L+ Q + G+ Y G K + T +SFWCC GTG+E+ K DSIYF +
Sbjct: 381 LASQ-DPDSGMATYFQGARPGYMKL-----YHTPEDSFWCCTGTGMENHVKYRDSIYFHD 434
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
+ LY+ +I S+ W VL Q + + R L ++ +L L
Sbjct: 435 DR---ALYVNLFIPSTVTWADKGAVLTQATTFPDAANTQFRWKLRQPTEL------TLKL 485
Query: 555 RMPVWTYS-----NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
R P W+ + NGA+ S + + PG++ T W D + ++L + E+
Sbjct: 486 RHPKWSPTATLLVNGAEVSHSDK------PGSYAELTRTWKTGDTVEMRLVMEPAVESA- 538
Query: 610 DDRPEYASIQAILFGPYLLAG 630
P I A +GP +LAG
Sbjct: 539 ---PAAPEIVAFTYGPLVLAG 556
>gi|295133987|ref|YP_003584663.1| glycosyl hydrolase [Zunongwangia profunda SM-A87]
gi|294982002|gb|ADF52467.1| putative glycosyl hydrolase [Zunongwangia profunda SM-A87]
Length = 794
Score = 242 bits (618), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 168/563 (29%), Positives = 273/563 (48%), Gaps = 65/563 (11%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
LK+V LH + + A T+L+Y+L ++ D L+ F + A L ++Y WEN
Sbjct: 36 LKDVKLH------TGLFEEAMYTDLDYILQMEPDRLLAPFLREAGLQPKAESYPNWEN-- 87
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDS 218
+ L GH GHYL+A AQM+AS + ++++ ++ L + Q+ G GY+ P DS
Sbjct: 88 TGLDGHIGGHYLTALAQMYASAGSDEALQRLNYMIGELKKAQDANGNGYVGGIP----DS 143
Query: 219 FEALKPV---------------WAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMV 259
K + W P Y IHK AGL D Y++A N +A +M WM+
Sbjct: 144 ERIWKEISEGKINAGGFSLNGGWVPLYNIHKTYAGLRDAYLIAGNEEAKQMLIDLTDWMI 203
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ N + I L E GG+N+ +Y +T D K+L LA+ F + L
Sbjct: 204 DITANLSEAQIQEM--------LKSEHGGLNETFADVYKMTGDKKYLDLAYAFTQKQVLD 255
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
L + D L+ HANT IP VIG + + + Y T+F + V + + + GG S
Sbjct: 256 PLEHEKDILNGMHANTQIPKVIGYETIAALDQNKDYHNAATYFWENVVNNRTVSIGGNSV 315
Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
RE + + + S + ETC TYNMLK+S LF E Y D+YE+ L N +LS Q
Sbjct: 316 REHFHPADDFSSMINSVQGPETCNTYNMLKLSEKLFLANPEEKYIDFYEQGLYNHILSSQ 375
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
G +Y P+ G + + S WCC G+G+E+ K + IY +
Sbjct: 376 HPE--GGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHGKYNEMIYAHSDD-- 426
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
LY+ +I S +W+ + L Q+ D P T +F + + Q ++N R P
Sbjct: 427 -ALYVNLFIPSEVNWEDKNFKLIQETDFPNAE-------TASFKIETQKPQKLTINFRYP 478
Query: 558 VWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W G +N + + PG+++S T +W +D+++++LP+++ +E + P+ +
Sbjct: 479 SWA-GEGFDVQVNDKKVKFDKKPGSYISITRKWEDDDQISMRLPMNITSERL----PDGS 533
Query: 617 SIQAILFGPYLLAGHTSGEWDIK 639
+++ +GP +LA T G+ D+K
Sbjct: 534 DYESLKYGPLVLAAKT-GKEDLK 555
>gi|198275797|ref|ZP_03208328.1| hypothetical protein BACPLE_01972 [Bacteroides plebeius DSM 17135]
gi|198271426|gb|EDY95696.1| hypothetical protein BACPLE_01972 [Bacteroides plebeius DSM 17135]
Length = 796
Score = 242 bits (618), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 168/532 (31%), Positives = 256/532 (48%), Gaps = 43/532 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ+ NL+ L+ DVD L+ F K A LP + + W + L GH GHYLSA A +
Sbjct: 48 AQELNLKVLMEYDVDRLLAPFLKEAGLPLKAEPFPNW----AGLDGHVGGHYLSAMAMNY 103
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSF-----EALKPVWAPYY 230
A+T N +++M ++ L CQ G GY+ P EL+ E++ WAP+Y
Sbjct: 104 AATGNEECRKRMEYMLGELKRCQESNGDGYIGGVPNGKELWADIKNGKVESIWKYWAPWY 163
Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
+HKI AGL D ++ N +AL M + ++ + V + ++ +E+ L E GGM+
Sbjct: 164 NVHKIFAGLRDAWMYTGNKEALDMFLRLCDWGVS-VTEGLSDNQMEQ---MLANEFGGMD 219
Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
++ Y IT K+L A F + D L + HANT IP VIG Q EV
Sbjct: 220 EIFADAYQITGKKKYLTTAKRFSHRWLFDSMVAHKDNLDNIHANTQIPKVIGYQRIAEVC 279
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFW--WDPKRLADTLGSENEETCTTYNMLK 408
GD Y FF +IV S A GG S RE++ D R + E E+C TYNMLK
Sbjct: 280 GDNQYMDAADFFWNIVACKRSLALGGNSRREYFSSMDDFR-SHVEDREGPESCNTYNMLK 338
Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH--GWG 466
++ LFR T + Y D+YE+AL N +LS Q G + + + AR H +
Sbjct: 339 LTEGLFRMTGKAVYVDFYEKALYNHILSTQHPKHGGYVYF--------TSARPAHYRVYS 390
Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDP 526
++ WCC GTG+E+ K G+ IY + L++ +ISS +W+ V + Q+ +
Sbjct: 391 KPNSAMWCCVGTGMENHGKYGEFIYTHSSDS---LFVNLFISSRLNWEQEKVTITQETN- 446
Query: 527 IVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP---GNFL 583
+ R+T+ S + L LR P W + G + NG+ + + +++
Sbjct: 447 -FPDEETSRLTVKLKSGESCH--FKLLLRRPAWV-TEGYEVKCNGKVVDVSEKVAGSSYI 502
Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+W DK+ + LP+ +R E +Q + AI+ GP L+ E
Sbjct: 503 CIDRKWKDGDKVEVSLPMKMRLETLQGE----DDFVAIMRGPILMGASVGTE 550
>gi|220928430|ref|YP_002505339.1| hypothetical protein Ccel_0997 [Clostridium cellulolyticum H10]
gi|219998758|gb|ACL75359.1| protein of unknown function DUF1680 [Clostridium cellulolyticum
H10]
Length = 597
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 168/538 (31%), Positives = 272/538 (50%), Gaps = 63/538 (11%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWA 178
++T +Y+ D++ L+ +FRK A + + + GGWE+ LRGHFVGH+LSA ++
Sbjct: 21 RETAKKYVNDFDINRLMHTFRKNAGIESLAEPLGGWESEECNLRGHFVGHFLSACSKFAF 80
Query: 179 STHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEAL--KPVWAPYYTIHKIL 236
S ++ +K K +V ++EC ++ GYLSAF E+ D E + VWAPYYT+HKIL
Sbjct: 81 SDNDDCLKTKADNIVKIMAECASE--NGYLSAFGEEMLDILETEEDRGVWAPYYTLHKIL 138
Query: 237 AGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS--------LN--EET 286
GL+D Y+ +N AL +A + Y R +++ +W + +N E
Sbjct: 139 QGLVDCYLFLNNKTALSLAVNLAHYIRRRFERL-------SYWKTDGILRCTRVNPVNEF 191
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GG+ DVLY LY IT D K LA +F++ F+G LA D L HANTH+P+VI + R
Sbjct: 192 GGIGDVLYSLYEITGDRKIFDLADIFNRDYFIGNLAADRDVLEDLHANTHLPMVISAIHR 251
Query: 347 YEVTGDPLYK---------LIGTFFMDIVNASH--SYATGGTSAR-EFWWDPKRLADTLG 394
+ +TG+ YK L+G F++ ++S S+ G S + E W L ++L
Sbjct: 252 FNLTGEYKYKHAAQNFYKYLLGRTFVNGNSSSKATSFKKGEVSEKSEHWGAHNHLENSLT 311
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
E+C +N K+ + LF WT++ + ++ E N VL+ T G+ Y P+G
Sbjct: 312 GGESESCCAHNTEKIVQQLFAWTEDERFLEHLEILKYNAVLN-STSTVTGLSQYQQPMGT 370
Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
GV K + F++FWCC GTGIE+ S++ +I+F+++ L + +I+S+ W
Sbjct: 371 GVKK-----NFSGLFDTFWCCTGTGIEAMSEIQKNIWFKDKDT---LLLNMFIASTVQWD 422
Query: 515 SGHVVLNQKV---DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
+V + Q D VS LT S+ V +L LR S +NG
Sbjct: 423 EKNVKIVQNTAYPDNTVS-------VLTVSTSNPVS--FTLMLRK-----SQVKSVKING 468
Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
++ ++ ++ ND + I++ SL ++ + A+++ LLA
Sbjct: 469 KSFNFIADNGYIYIKRIFNNNDTIEIEIDSSLHLIQLKGSENK----AAVMYDRILLA 522
>gi|334144880|ref|YP_004538089.1| acetyl-CoA carboxylase, biotin carboxylase [Novosphingobium sp.
PP1Y]
gi|333936763|emb|CCA90122.1| acetyl-CoA carboxylase, biotin carboxylase [Novosphingobium sp.
PP1Y]
Length = 651
Score = 241 bits (616), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 177/549 (32%), Positives = 260/549 (47%), Gaps = 51/549 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN-- 156
L+ L DV L++ L AQ+ YLL L D L+ +FR A L YGGWE+
Sbjct: 50 LEPFDLSDVTLEEGPFL-HAQRLTEAYLLRLQPDRLLHNFRVNAGLAPRAAVYGGWESDE 108
Query: 157 --PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
GH +GHYLSA A + ST++ K+++ + L+ CQ G+G + AFP
Sbjct: 109 IWADINCHGHTLGHYLSACALAFRSTNDRRFKQRVDYIANELAACQKATGSGLVCAFPDG 168
Query: 215 ---LFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
L K P+YT+HK+ AGL D +LAD+ + +++A W V
Sbjct: 169 PALLTAHLRGDKITGVPWYTLHKVYAGLRDGALLADSTVSREVLIRLADWGV-------- 220
Query: 268 KVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD 326
V T + + + L E GGMN+V LY++T + + L+ F + L D
Sbjct: 221 -VATRPLTDGQFETMLATEHGGMNEVYADLYAMTGNEDYRELSQRFSHKAVMDPLVQGRD 279
Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
L HANT +P ++G Q YE+TGD Y FF V + S+ATGG E ++
Sbjct: 280 LLDGMHANTQVPKIVGFQRVYEITGDDRYAQAANFFFRTVAHTRSFATGGHGDNEHFF-- 337
Query: 387 KRLAD----TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
+AD ++ ETC +NMLK++R LF YADYYER L NG+L+ Q +
Sbjct: 338 -AMADFDRHVFSAKGSETCCQHNMLKLARLLFMQDPNADYADYYERTLYNGILASQ-DPD 395
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
G++ Y G K + T +SFWCC GTG+E+ K DSIYF +E + LY
Sbjct: 396 SGMVTYFQGARPGYMKL-----YHTPEHSFWCCTGTGMENHVKYRDSIYFHDERS---LY 447
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
+ ++ SS WK L Q+ P + + ++ +L LR P W S
Sbjct: 448 VNLFVPSSVAWKEKGAELIQRT--AFPEKPTTGLQWKLRAPAKI----ALQLRHPRW--S 499
Query: 563 NGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
A +NGQ + G+++ W D++ +QL + E+ P I A
Sbjct: 500 RTAVVRVNGQEVARSATAGSYVEVARTWKDGDRVELQLEMEPTVESA----PAAPDIVAF 555
Query: 622 LFGPYLLAG 630
+GP +LAG
Sbjct: 556 TYGPIVLAG 564
>gi|302340651|ref|YP_003805857.1| hypothetical protein Spirs_4187 [Spirochaeta smaragdinae DSM 11293]
gi|301637836|gb|ADK83263.1| protein of unknown function DUF1680 [Spirochaeta smaragdinae DSM
11293]
Length = 764
Score = 241 bits (615), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 172/559 (30%), Positives = 266/559 (47%), Gaps = 39/559 (6%)
Query: 107 VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENPISELRGHF 165
V L + SV Q +++L+ D D ++++FR A + T G GW+ P LRGH
Sbjct: 196 VMLKEGSVFCDEQDKMIQHLIDTDDDQMLYNFRVAAGVDTRGALPMTGWDAPSCNLRGHT 255
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----GTGYLSAFPTELFDSFE 220
GHYLS+ A W+ T + +K+ ++ SLSECQN + G+LSA+ FD E
Sbjct: 256 TGHYLSSLALGWSVTKKTELMDKIVYLIESLSECQNALEERGCSKGFLSAYSERQFDLLE 315
Query: 221 ALKP---VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
P +WAPYYT+ KI++GL D Y LAD++ AL + M ++ Y R+ + ++ +++
Sbjct: 316 TYTPYPTIWAPYYTLDKIMSGLYDCYSLADSSLALNILCKMGDWVYERLSR-LSRNQLDK 374
Query: 278 HW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
W + E GGM V+ +LY++T +L A+ FD + D L HAN H
Sbjct: 375 MWSMYIAGEFGGMISVMVKLYTLTKKKTYLQTAYYFDNEKLFYPMQENIDTLKDMHANQH 434
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
IP ++G+ YE G Y I F +IV ASH Y+ GG E + +P + + +
Sbjct: 435 IPQIMGAVELYEADGSGRYYDIAKNFWNIVTASHVYSIGGIGETEMFHEPNEIMTYITDK 494
Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
E+C +YN+L+++ LF E D+YE L N +LS G Y +PL G
Sbjct: 495 TAESCASYNILRLTGQLFALEPERRKMDFYETVLYNHILSSFSHKSDGGTTYFMPLRPGG 554
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
K + TK N+ CC+G+G+E+ + IY N LYI YI S+ +W+
Sbjct: 555 HKE-----FNTKENT--CCHGSGLETRFRYVQDIY---ACNHDTLYINLYIPSAVEWE-- 602
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
N +++ + D TF +L R+P W N +++
Sbjct: 603 ----NFRIEQTTASDA----AGTFIFLIHSSGWRNLAFRIPHWAEDEYKVTINNQESVEE 654
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
+ W D++ I P R + D +P YA + +GPY+LA + E
Sbjct: 655 MAQDGYFYLHRDWREGDRIEILTPYHFRKLPVPDGKP-YA---CMAYGPYILAALSDQEE 710
Query: 637 DIK----TGTARSLSALIS 651
+ TG R L+A IS
Sbjct: 711 YLPFPELTGDDRVLTASIS 729
>gi|431795908|ref|YP_007222812.1| hypothetical protein Echvi_0518 [Echinicola vietnamensis DSM 17526]
gi|430786673|gb|AGA76802.1| hypothetical protein Echvi_0518 [Echinicola vietnamensis DSM 17526]
Length = 784
Score = 241 bits (614), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 166/527 (31%), Positives = 254/527 (48%), Gaps = 40/527 (7%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQQ ++ Y+ ++VD L+ + A + Y WEN + L GH GHYLSA A M+
Sbjct: 46 AQQVDMTYMKAMEVDRLLAPYMLEAGVDWAADRYPNWEN--TGLDGHIGGHYLSALAMMY 103
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP-----------TELFDSFEALKPVW 226
AST +A +K +M +V L+ Q K G GY+ P E+ +L W
Sbjct: 104 ASTGDAEMKRRMDYMVEQLAMAQAKNGNGYVGGIPGGMAMWEEIGQGEIDAGGFSLNQKW 163
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
P Y IHKI AGL D Y++ NAQA ++ + ++FY + + E+ L E
Sbjct: 164 VPLYNIHKIYAGLRDAYLIGGNAQAKEVLLDLTDWFYELTKGLTD----EQFQQMLVSEH 219
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GG+N+V + +IT + K+L LA L L Q D L+ HANT IP VIG Q R
Sbjct: 220 GGLNEVFADVAAITGEAKYLELAKKMSHEWLLEPLEEQEDKLTGMHANTQIPKVIGFQ-R 278
Query: 347 YEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
GD ++ FF V + + A GG S RE + + + S + ETC TY
Sbjct: 279 VAQEGDLAEWQEAADFFWHTVVENRTVAIGGNSVREHFHPEDDFSPMVSSNQGPETCNTY 338
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
NML++S LF + Y D++ER L N +LS Q E G +Y P+ +
Sbjct: 339 NMLRLSEQLFMSNPQAEYVDFFERGLYNHILSSQH-PEKGGFVYFTPM-----RPEHYRV 392
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
+ FWCC G+G+E+ +K G+ IY E LYI +I S +W+ +VL Q
Sbjct: 393 YSQPQQGFWCCVGSGLENHAKYGEFIYAHSEEE---LYINLFIPSELNWEEKGMVLTQTN 449
Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFL 583
+ +P T +++ + LR P W Q S+NG+ + P +++
Sbjct: 450 N--FPEEPQSVFTFEMDKARKM----PVKLRYPSWVAEGALQVSVNGRPFEVNASPSSYI 503
Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ +W D+L ++LP+ ++ E + P+ + A ++GP +LA
Sbjct: 504 TINRKWKDGDRLEVKLPMEMQWEQL----PDGSDWGAFVYGPIVLAA 546
>gi|268609237|ref|ZP_06142964.1| hypothetical protein RflaF_07037 [Ruminococcus flavefaciens FD-1]
Length = 1082
Score = 241 bits (614), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 180/576 (31%), Positives = 274/576 (47%), Gaps = 61/576 (10%)
Query: 96 GNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGW 154
G+ + + S+ DV + A + ++YLL D + L+ FR+ A L T G K YGGW
Sbjct: 37 GSRISDFSISDVKM-TDDYCTNAFEKEMKYLLSFDTERLLAGFRENAGLSTNGAKRYGGW 95
Query: 155 ENPISELRGHFVGHYLSASAQMW-----ASTHNATIKEKMSTVVFSLSECQN--KIGTGY 207
EN + + GH VGHYL+A AQ + S + ++M T++ + CQ + G+
Sbjct: 96 EN--TNIAGHCVGHYLTALAQAYQNPNVTSDQKDALYKRMKTLIDGMQACQQHPRGKKGF 153
Query: 208 LSAFPT-------ELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMA 255
L A P FD E K W P+YT+HK++AG++D Y A A +
Sbjct: 154 LWAAPVPSDGNVERQFDRVEIGKANIFDDAWVPWYTMHKLIAGIVDVYNATQYAPAKDVG 213
Query: 256 TWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKP 315
+ + ++ YNR + +S + L+ E GGMND +Y LY IT H AH+FD+
Sbjct: 214 SALGDWVYNRC----SGWSQQTRNTVLSIEYGGMNDCMYDLYRITGKDSHAAAAHVFDED 269
Query: 316 CFLGFLALQA-DYLSHFHANTHIPIVIGSQMRY------EVTGDPL----YKLIGTFFMD 364
++ D L+ HANT IP IG+ RY V G + Y F D
Sbjct: 270 ALFQKVSNGGRDVLNGRHANTTIPKFIGALKRYMVLDGKTVNGQKVDASAYLKYAENFWD 329
Query: 365 IVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYAD 424
+V H+Y TGG S E + L + N ETC +YNMLK+SR LF+ T + Y D
Sbjct: 330 MVTTHHTYITGGNSEWEHFGKDDILDAERTNCNCETCNSYNMLKLSRELFKITHDSKYMD 389
Query: 425 YYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFS 484
+YE N +LS Q E G+ Y P+ G K S T+++ FWCC G+G+ESF+
Sbjct: 390 FYENTYYNSILSSQN-PETGMTTYFQPMATGYFKVYS-----TQWDKFWCCTGSGMESFT 443
Query: 485 KLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQ 544
KLGD+IY + + LY+ Y SS +W +V + Q + + ++ T+ SS
Sbjct: 444 KLGDTIYMHDNDS---LYVNFYQSSVINWAEKNVSITQ--ESTIPDGASVKFTIKGSSDL 498
Query: 545 EVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+ L R+P W S+NG + + +S D + + +P +R
Sbjct: 499 D------LRFRIPDWI-DGTMGVSVNGTKYSYKTVNGYADVSGSFSNGDVIELTVPSKVR 551
Query: 605 TEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKT 640
+ P+ + +GP +L+ G+ D+KT
Sbjct: 552 AYPL----PDSPDVYGFKYGPLVLSAEL-GKDDMKT 582
>gi|383640258|ref|ZP_09952664.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas elodea
ATCC 31461]
Length = 652
Score = 239 bits (611), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 179/549 (32%), Positives = 262/549 (47%), Gaps = 51/549 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE-NP 157
L+ + DV L + L AQ+ YLL L+ D L+ FR A L AYGGWE +P
Sbjct: 51 LQPFDMADVTLGEGPFL-HAQRATEAYLLRLEPDRLLHQFRVNAGLEPKAPAYGGWESDP 109
Query: 158 I---SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE 214
+ +GH +GHYLSA A + +T A ++++ + L CQ+ +G ++AFP
Sbjct: 110 LWSDIHCQGHTLGHYLSACALAYRATGEARYRQRVDYIATELGACQDAAKSGLVTAFPKG 169
Query: 215 ---LFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
+ K P+YT+HK+ AGL D +LAD+ A L++A W V
Sbjct: 170 AALVSAHLRGEKITGVPWYTLHKVYAGLRDGALLADSEPARATLLRLADWGVV-----AS 224
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ ++ E L E GGMN++ LY +T ++ +A F L LA D+
Sbjct: 225 RPLSDAEFEA---MLETEHGGMNEIYADLYFMTGKEEYRAIARRFSHKALLAPLARAQDH 281
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L HANT +P V+G Q YE TGD Y+ FF V + S+ATGG E ++
Sbjct: 282 LDGLHANTQVPKVVGFQRVYEATGDAAYRDAAAFFWKTVAQTRSFATGGHGDNEHFFA-- 339
Query: 388 RLAD----TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
+AD ++ ETC +NMLK++R LF + AYADYYER L NG+L+ Q +
Sbjct: 340 -MADFETHVFSAKGSETCCQHNMLKLTRALFLHDPDPAYADYYERTLYNGILASQ-DPDS 397
Query: 444 GVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
G+ Y G K + T +SFWCC GTG+E+ K DSIYF + LY+
Sbjct: 398 GMATYFQGARPGYMKL-----YHTPEHSFWCCTGTGMENHVKYRDSIYFHDAST---LYV 449
Query: 504 IQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
++ S+ W+ VL Q+ P V T T + + +L+LR P W S
Sbjct: 450 NLFLPSTLRWRDKGAVLVQETRFPEVP-------TTTLRWRLDKPVDVTLSLRHPGW--S 500
Query: 563 NGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
A +NG+ PG+ ++ W D + +QL + E P + A
Sbjct: 501 RTATVRVNGKVAARSVAPGSRIALPRNWRDGDVVELQLVMEPGVERA----PAAPDVVAF 556
Query: 622 LFGPYLLAG 630
+GP +LAG
Sbjct: 557 TYGPLVLAG 565
>gi|379726800|ref|YP_005318985.1| hypothetical protein MPD5_0184 [Melissococcus plutonius DAT561]
gi|376317703|dbj|BAL61490.1| hypothetical protein MPD5_0184 [Melissococcus plutonius DAT561]
Length = 883
Score = 239 bits (611), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 178/565 (31%), Positives = 264/565 (46%), Gaps = 70/565 (12%)
Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL-PTPGKAYGGWENPIS-ELRGHFVGH 168
Q + +AQ+ + YLL LDV ++ F K A + P Y GWE RGHF GH
Sbjct: 12 QDPYIHKAQENVIHYLLSLDVQKFLFEFYKVAGMKPLTESGYQGWERSDQVNFRGHFFGH 71
Query: 169 YLSASAQMWASTHNATIKEKM----STVVFSLSECQNKIG------TGYLSAFPTELFDS 218
+LSA A + + +K+K+ T + L Q GY+SAF D
Sbjct: 72 FLSALALSYQAEKQPILKKKIHQQIKTAITGLKAVQKNYAKQHPEHAGYISAFKEVALDE 131
Query: 219 FEALKPV--------WAPYYTIHKILAGLLDQYVLADNA------QALKMATWMVEYFYN 264
E KPV +Y +HKILAGLL+ + +AL +A+W +Y Y
Sbjct: 132 VEG-KPVDPKEKENVLVSWYNLHKILAGLLEVNISLKEVDSQLSKEALFIASWFGDYIYK 190
Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
R+ + + L E GGMND LY L+ +T +H + A FD+ LA
Sbjct: 191 RMMNLTDKNQM------LTIEYGGMNDALYCLFELTQKKEHAIAATYFDEDNLFNQLAND 244
Query: 325 ADYLSHFHANTHIPIVIGSQMRYEV----------TGDPLYKLIGTF-----FMDIVNAS 369
+ L HANT IP +IG+ RY V + + L+ F F IV +
Sbjct: 245 ENVLPGKHANTTIPKLIGALKRYMVFQSEDLSAWLSNEEKEHLMSYFKAAEKFWQIVVDN 304
Query: 370 HSYATGGTSAREFWWDPKRL----ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
H+Y TGG S E + +P L G ETC T+NMLK++R L+ TK Y DY
Sbjct: 305 HTYCTGGNSQSEHFHEPNELFYDSEIRQGDCTCETCNTHNMLKLTRKLYECTKNPKYLDY 364
Query: 426 YERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSK 485
YE N +L+ Q ++ G+M+Y P+G G +K + ++ FWCC GTGIESFSK
Sbjct: 365 YETTYINAILASQ-NSKTGMMMYFQPMGAGYNKV-----YNRPYDEFWCCSGTGIESFSK 418
Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
L D+ YF+E L++ Y S++ K ++ + QK D + + + L + +
Sbjct: 419 LADTYYFKENNR---LFVNLYFSNTLKLKENNLKIIQKTD---RKNGNVTIDLKTLTDKN 472
Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
+ Q L LR+P W + + L P F +E + ND++ +++ L+
Sbjct: 473 IIQPLQLALRLPNW--AKQVTIKKGKKLLNYEPHLGFAYLSELVTANDQIILEMEQELQL 530
Query: 606 EAIQDDRPEYASIQAILFGPYLLAG 630
D P+ A+ A +GPY+LAG
Sbjct: 531 L----DTPDNANYIAFKYGPYILAG 551
>gi|430751026|ref|YP_007213934.1| hypothetical protein Theco_2852 [Thermobacillus composti KWC4]
gi|430734991|gb|AGA58936.1| hypothetical protein Theco_2852 [Thermobacillus composti KWC4]
Length = 621
Score = 239 bits (611), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 164/563 (29%), Positives = 271/563 (48%), Gaps = 60/563 (10%)
Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFR----KTASLPTPGKAYGGWENPISELRGHFVGHYL 170
L R +Q N YL+ L+ DSL++++R + + P A+GGWE+P+ +LRGHF+GH+L
Sbjct: 16 LRRREQANRAYLMKLNSDSLLFNYRLEAGRYSGREIPPWAHGGWESPVCQLRGHFLGHWL 75
Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYY 230
SA+A + +T +A +K K ++ L+ECQ G + P + A K +WAP Y
Sbjct: 76 SAAAIHYHATGDAELKAKADGIIDELAECQKDNGGQWAGPIPEKYLHWIAAGKAIWAPQY 135
Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
+HK+ GL+D + A N +AL +A ++F + ++ ++ L+ ETGGM
Sbjct: 136 NLHKLFMGLVDSFQYAGNQKALDIADRFADWFVEWSGR----FTRDQFDDILDVETGGML 191
Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
+V L IT + K+ L + + L D L++ HANT IP V+G YEVT
Sbjct: 192 EVWADLLHITGNGKYKTLLERYYRGRLFQPLLEGKDPLTNMHANTTIPEVLGCARAYEVT 251
Query: 351 GDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
GD + ++ ++ V ATGG ++ E W ++ LG +N+E CT YNM+++
Sbjct: 252 GDSRWMDVVKAYWNCAVTERGFLATGGQTSGEVWMPKMKMKARLGDKNQEHCTVYNMMRL 311
Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTE------------PGVMIYMLPLGRGVS 457
+ LFR T + YA Y E L NGV++ E G++ Y LP+ G+
Sbjct: 312 AEFLFRHTGDPGYAQYREYNLYNGVMAQTYYREYALNGNPHNHPGTGLLTYFLPMKAGLR 371
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF--DWKS 515
K W T+ +SF+CC+GT +++ + IY+++ ++ YI QY +S +
Sbjct: 372 K-----DWSTETSSFFCCHGTMVQANAAWNRGIYYQDRDDI---YICQYFNSEMTTEING 423
Query: 516 GHVVLNQKVDPI-----------------------VSWDPYLRMTLTFSSKQEVGQLSSL 552
G + + Q DP+ + PY + F + V Q ++
Sbjct: 424 GELRIIQTQDPMNGNSMTSSNTAGYQSINEVAAIHENLPPYRK--YDFVIRTSVQQPFAI 481
Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
+ R+P W S+ + + F W DK+++ LP+ +R + DD
Sbjct: 482 HFRIPEWIMSDAVLYVNDEFHGKTSDSTRFYPIRRVWRDGDKISVLLPIGIRFVPLPDDE 541
Query: 613 PEYASIQAILFGPYLLAGHTSGE 635
+ A +GP +LAG E
Sbjct: 542 ----NTGAFRYGPEVLAGICDAE 560
>gi|410638732|ref|ZP_11349285.1| acetyl-CoA carboxylase, biotin carboxylase [Glaciecola lipolytica
E3]
gi|410141260|dbj|GAC16490.1| acetyl-CoA carboxylase, biotin carboxylase [Glaciecola lipolytica
E3]
Length = 818
Score = 239 bits (609), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 168/528 (31%), Positives = 263/528 (49%), Gaps = 44/528 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQQTN+ YLL + D L+ + + A L +YG WEN + L GH GHYLSA + W
Sbjct: 67 AQQTNVGYLLAIQPDKLLAPYLREAGLEPKVDSYGNWEN--TGLDGHIGGHYLSALSLAW 124
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTE--LFDSFE---------ALKPVW 226
A+T + +K ++ ++ L + QN G GYL P ++D + +L W
Sbjct: 125 AATQDTELKRRLDYMLNELQKAQNANG-GYLGGIPNGKVMWDEIKQGNIKADLFSLNDRW 183
Query: 227 APYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
P Y I KI GL D Y++A++ QA L + WM++ N ++ +++ YS
Sbjct: 184 VPLYNIDKIFHGLRDAYLIANSEQAKTMLLSLGQWMLDVTNN-----LSDEQIQQMLYS- 237
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
E GG+N+V + +I+ D +L LA F + L D L+ HANT IP +IG
Sbjct: 238 --EHGGLNEVFADMSTISGDKAYLELARKFSHKRIIDPLVAHKDELNGLHANTQIPKIIG 295
Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETC 401
+ ++ D +K FF + V S A GG S RE + D + + E ETC
Sbjct: 296 ALKVAQLNNDESWKEAARFFWETVTKQRSVAIGGNSVREHFHDAADFSPMVEDPEGPETC 355
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
TYNM+K+S+ LF T + Y DYYERA N +LS Q E G ++Y + G + S
Sbjct: 356 NTYNMIKLSKLLFLQTADTRYLDYYERATYNHILSSQH-PEHGGLVYFTSMRPGHYRMYS 414
Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
+ +S WCC G+GIE+ SK G+ IY +V L + +ISS+ W + L
Sbjct: 415 SVQ-----DSMWCCVGSGIENHSKYGELIY---SHSVDNLSVNLFISSTLRWPEKGLKLT 466
Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
+ S + +++ +++++G+ LN+R P W +S+ NG+ +
Sbjct: 467 LETQFPDSQNVVIKLHQL--AEKQMGEF-VLNIRKPAW-FSHDISMFKNGEKINYVENEG 522
Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
++ + W D+L+ +L L TE + D + Y A+L+GP +LA
Sbjct: 523 YIQIQQNWQDGDELSFELAAGLSTEQLPDGQNYY----AVLYGPVVLA 566
>gi|312133546|ref|YP_004000885.1| protein [Bifidobacterium longum subsp. longum BBMN68]
gi|322690281|ref|YP_004219851.1| hypothetical protein BLLJ_0089 [Bifidobacterium longum subsp.
longum JCM 1217]
gi|311772796|gb|ADQ02284.1| Hypothetical protein BBMN68_1283 [Bifidobacterium longum subsp.
longum BBMN68]
gi|320455137|dbj|BAJ65759.1| conserved hypothetical protein [Bifidobacterium longum subsp.
longum JCM 1217]
Length = 800
Score = 238 bits (608), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 181/597 (30%), Positives = 268/597 (44%), Gaps = 82/597 (13%)
Query: 94 LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL---PTPGKA 150
LPG + L +V + +SV RA++ L+Y VD + FR A+L +
Sbjct: 81 LPGWKVAPFPLRNVAITSNSVFDRAKEGMLDYARNYPVDRWLVCFRAQANLLPKDNTTQP 140
Query: 151 YGGWENPISE---------------------------LRGHFVGHYLSASAQMWASTHNA 183
GGWEN S LRGHF GH L +Q +A T
Sbjct: 141 SGGWENFPSGSLDKAVEQQWGDAEYTRGQNKNGADGLLRGHFAGHALHMLSQAYAETGEE 200
Query: 184 TIKEKMSTVVFSLSECQNKIGT------------GYLSAFPTELFDSFEALKP---VWAP 228
I K++ V L EC++ + G+L+A+ F + E P +WAP
Sbjct: 201 AILNKINEFVSGLKECRDSLREMKYNGKARYSHPGFLAAYGEWQFKALEEYAPYGEIWAP 260
Query: 229 YYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETG 287
+YT HKILAGL+ Y A NA AL +A + + Y R+ K T +++ W + E G
Sbjct: 261 WYTEHKILAGLIAAYEFAGNADALDLAEGIGHWTYARLSKC-TKTQLQKMWDIYIGGEYG 319
Query: 288 GMNDVLYRLYSITHDPKH---LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
GMND L LY+++ D L + FD + D L++ HAN HIP +G
Sbjct: 320 GMNDSLVDLYNVSKDKDRSEFLKASAFFDTDKLIDNCGAGVDILNNLHANQHIPQFVGYA 379
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNA-------SHSYATGGTSAREFWWDPKRLADTLGSEN 397
+ + ++ V YA GGT E W +A +G N
Sbjct: 380 KDAAMGDADIDADARARYLKAVEGYWGMIVPGRMYAHGGTGEGEMWGPAHTVAGDIGKRN 439
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ-RGTEPGVMI-----YMLP 451
E+C YNMLKV+R+LF ++ AY DYYER + N +L + R + G + YM P
Sbjct: 440 AESCAAYNMLKVARYLFFIEQKPAYMDYYERTILNHILGGKSRDLDSGTALTPGNCYMYP 499
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
+ K GT CC GT +ES SK DSIYF N LY+ + +S+
Sbjct: 500 VNPATQKEYGDGNIGT------CCGGTALESHSKYQDSIYFHSTDNKE-LYVNLFTASTL 552
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
DW + L Q+ + + +++T + K V + +R+P W S GA+ +NG
Sbjct: 553 DWTDTGLKLAQETN--YPEEETSTISITAAPKSAV----TFRIRIPAW--SKGAKIEVNG 604
Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
+ + G + + W DK+ + +PL LRTE+ DDR + IQ + +GP +L
Sbjct: 605 KAIDGVTAGEYATVAGSWKVGDKIVVTIPLQLRTEST-DDRKD---IQTLFYGPTVL 657
>gi|338209455|ref|YP_004646426.1| hypothetical protein Runsl_5734 [Runella slithyformis DSM 19594]
gi|336308918|gb|AEI52019.1| protein of unknown function DUF1680 [Runella slithyformis DSM
19594]
Length = 760
Score = 238 bits (608), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 166/531 (31%), Positives = 257/531 (48%), Gaps = 44/531 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ +L Y+L L+ D L+ + A LP + YG WE+ S L GH GHYLSA A M+
Sbjct: 40 AQDVDLRYILSLNPDKLLAPYLIDAGLPLKAERYGNWES--SGLDGHIGGHYLSALAMMY 97
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEALKPVW 226
AST NA +K+++ ++ L++CQ K G GY+ P ++ S L W
Sbjct: 98 ASTGNAELKKRLDYMIDQLAQCQAKNGNGYVGGIPQGKVFWERIYKGDIDGSSFGLNNTW 157
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
P Y IHK+ AGL D Y N QA ++ + ++F ++I S ++ L E
Sbjct: 158 VPLYNIHKLFAGLRDSYEFGGNQQAKQVLIGLGDWF----AELIRPLSDDQIQQILRTEH 213
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGMN+ LY +T + K+L A L L + D L+ HANT IP VIG +
Sbjct: 214 GGMNEAFADLYILTKNQKYLETAQRISHRAILNPLVQKQDKLTGLHANTQIPKVIGFEKI 273
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
+T + + +F V+ + + A GG S RE + + L S + ETC ++N
Sbjct: 274 AMLTENAKWSEAARYFWQNVSQTRTVAFGGNSVREHFNPTNDFSSMLKSNQGPETCNSFN 333
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH-- 463
ML++S+ LF + +Y D+YER L N +LS Q + G +Y P+ R H
Sbjct: 334 MLRLSKALFLDKNDPSYLDFYERTLYNHILSSQH-PQKGGFVYFTPI-------RPNHYR 385
Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
+ S WCC G+G+E+ +K + IY + L++ +I S+ WK + L Q
Sbjct: 386 VYSQPETSMWCCVGSGLENHTKYSELIYSHSAND---LFVNLFIPSTLHWKEKSIQLTQA 442
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNF 582
+ PY + F K Q +LN+R P W ++ + +NG+ P P N+
Sbjct: 443 TEF-----PYKNQS-EFVLKLAKSQAFTLNIRYPKW--ADDVEVMVNGKLYPTSAQPSNY 494
Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+ +W DKL+++ S E + P+ ++ A + GP +LA TS
Sbjct: 495 IGIRRKWKTGDKLSVRFTTSTHLEYL----PDGSNWAAFVHGPIVLAAKTS 541
>gi|182415028|ref|YP_001820094.1| hypothetical protein Oter_3214 [Opitutus terrae PB90-1]
gi|177842242|gb|ACB76494.1| protein of unknown function DUF1680 [Opitutus terrae PB90-1]
Length = 844
Score = 238 bits (607), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 179/554 (32%), Positives = 265/554 (47%), Gaps = 46/554 (8%)
Query: 102 VSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISEL 161
+ L V L + + A + N YLL LD D L+ FR+ A LP + YG WE+ L
Sbjct: 76 LPLASVRLLEGGPFFTAVKANRTYLLALDADRLLAPFRREAGLPALAQPYGNWES--GGL 133
Query: 162 RGHFVGHYLSASAQMWASTHN---ATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELF 216
GH GHYLSA A M A+ H+ ++ ++ +V L CQ+ G GY+ P EL+
Sbjct: 134 DGHTAGHYLSALAHMIAAGHDTPEGELRRRLDHMVAELKACQDANGNGYVGGVPGSHELW 193
Query: 217 D-----SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQ 267
A+ W P+Y +HK AGL D ++ N A +++ W V
Sbjct: 194 QRVAAGDVTAVNRKWVPWYNLHKTFAGLRDAWLQTGNTTARDVLVRLGDWCV-------- 245
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ + + E+ L +E GGMN+VL +Y+IT D K+L A F+ L L D
Sbjct: 246 ALTSPLTDEQMQRMLAQEHGGMNEVLADIYAITGDKKYLTAAERFNHHAVLDPLEQHRDE 305
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L+ HANT IP V+G + +TGD FF + V S A GG S E + DP
Sbjct: 306 LTGKHANTQIPKVVGLERIATLTGDKAADSGARFFWETVTQHRSVAFGGNSVSEHFNDPH 365
Query: 388 RL-ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
A + E ETC TYNML+++ LF E AYADYYERAL N +L+ PG
Sbjct: 366 NFHALLVHREGPETCNTYNMLRLTEGLFASAPEAAYADYYERALFNHILASINPDHPG-Y 424
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
+Y P+ + S G FWCC GTG+E+ K G+ IY G+++ +
Sbjct: 425 VYFTPIRPNHYRVYSQPDQG-----FWCCVGTGMENPGKYGEFIYARAHD---GVFVNLF 476
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
I+S + L Q+ D ++TL + Q +L++R P W +
Sbjct: 477 IASELTVAPLGLTLRQQT--AFPDDERSQLTLKLAQPQTF----TLHVRQPGWVAAGTFT 530
Query: 567 ASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
++NG+ + + P ++++ W D++ I+ P+ E + D P Y AIL GP
Sbjct: 531 LTVNGEPVAVTSAPSSYVTIHREWRDGDRVEIRFPMHTSIEGLPDGSPWY----AILRGP 586
Query: 626 YLLAGHTSGEWDIK 639
+LA H +G W++K
Sbjct: 587 IVLA-HPAGTWELK 599
>gi|189466409|ref|ZP_03015194.1| hypothetical protein BACINT_02784 [Bacteroides intestinalis DSM
17393]
gi|189434673|gb|EDV03658.1| hypothetical protein BACINT_02784 [Bacteroides intestinalis DSM
17393]
Length = 789
Score = 238 bits (607), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 174/563 (30%), Positives = 269/563 (47%), Gaps = 59/563 (10%)
Query: 100 KEVS---LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWEN 156
+EVS L DV L +S L +AQQT+L Y++ ++ D L+ F + A L +Y WEN
Sbjct: 24 QEVSYFPLQDVKLLESPFL-QAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWEN 82
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TE 214
+ L GH GHY+SA + M+A+T + I +++ ++ L Q +GTG++ P +
Sbjct: 83 --TGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLAELHRAQQAVGTGFIGGTPGSLQ 140
Query: 215 LFDSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEY 261
L+ +A L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSNLAREMLIALTDWMID- 199
Query: 262 FYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFL 321
+ + ++ L E GG+N+ + IT D K+L LA F L L
Sbjct: 200 -------ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPL 252
Query: 322 ALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYAT 374
D L+ HANT IP VIG + ++ D + FF + V S
Sbjct: 253 VKDEDRLTGMHANTQIPKVIGYKRIADLAQDDKDWNHASEWDHAARFFWNTVVNHRSVCI 312
Query: 375 GGTSAREFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
GG S RE + L + ETC TYNML++++ L++ + +I +ADYYERAL N
Sbjct: 313 GGNSVREHFHPADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNH 372
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
+L+ Q+ E G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 373 ILASQQ-PEKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAH 426
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
LY+ +I S W+ V L Q+ + +R + S K+ SL
Sbjct: 427 TNDT---LYVNLFIPSRLTWQEKKVTLVQETR--FPDEEQIRFRVEKSRKKAF----SLK 477
Query: 554 LRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
LR P W + GA S+NG+ PG +L+ +W D++T+ +P+ + E I
Sbjct: 478 LRYPSW--AKGASVSVNGKVQETNAQPGEYLTIHRKWKAGDEITLNMPMQVALEQI---- 531
Query: 613 PEYASIQAILFGPYLLAGHTSGE 635
P+ + A ++GP +LA T E
Sbjct: 532 PDRENFYAFMYGPIVLASPTGTE 554
>gi|419849455|ref|ZP_14372501.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
gi|419852148|ref|ZP_14375044.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
gi|386411767|gb|EIJ26479.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
gi|386411993|gb|EIJ26692.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
Length = 800
Score = 238 bits (606), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 180/597 (30%), Positives = 269/597 (45%), Gaps = 82/597 (13%)
Query: 94 LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL---PTPGKA 150
LPG + L +V + +SV RA++ L+Y VD + FR A+L +
Sbjct: 81 LPGWKVAPFPLRNVAITSNSVFDRAKEGMLDYARNYPVDRWLVCFRAQANLLPKDNTTQP 140
Query: 151 YGGWEN--------PISE-------------------LRGHFVGHYLSASAQMWASTHNA 183
GGWEN + + LRGHF GH L +Q +A T
Sbjct: 141 SGGWENFPNGSLDKAVEQQWGDAEYTRGQNKNGADGLLRGHFAGHALHMLSQAYAETGEE 200
Query: 184 TIKEKMSTVVFSLSECQNKIGT------------GYLSAFPTELFDSFEALKP---VWAP 228
I K++ V L EC++ + G+L+A+ F + E P +WAP
Sbjct: 201 AILNKINEFVSGLKECRDSLREMKYNGKARYSHPGFLAAYGEWQFKALEEYAPYGEIWAP 260
Query: 229 YYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETG 287
+YT HKILAGL+ Y A NA AL +A + + Y R+ K T +++ W + E G
Sbjct: 261 WYTEHKILAGLIAAYEFAGNADALDLAEGIGHWTYARLSKC-TKTQLQKMWDIYIGGEYG 319
Query: 288 GMNDVLYRLYSITHDPKH---LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
GMND L LY+++ D L + FD + D L++ HAN HIP +G
Sbjct: 320 GMNDSLVDLYNVSKDKDRSEFLKASAFFDTDKLIDNCGAGVDILNNLHANQHIPQFVGYA 379
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNA-------SHSYATGGTSAREFWWDPKRLADTLGSEN 397
+ + ++ V YA GGT E W +A +G N
Sbjct: 380 KDAAMGDADIDADARARYLKAVEGYWGMIVPGRMYAHGGTGEGEMWGPAHTVAGDIGKRN 439
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ-RGTEPGVMI-----YMLP 451
E+C YNMLKV+R+LF ++ AY DYYER + N +L + R + G + YM P
Sbjct: 440 AESCAAYNMLKVARYLFFIEQKPAYMDYYERTILNHILGGKSRDLDSGTALTPGNCYMYP 499
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
+ K GT CC GT +ES SK DSIYF N LY+ + +S+
Sbjct: 500 VNPATQKEYGDGNIGT------CCGGTALESHSKYQDSIYFHSTDNKE-LYVNLFTASTL 552
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
DW + L Q+ + + +++T + K V + +R+P W S GA+ +NG
Sbjct: 553 DWTDTGLKLAQETN--YPEEETSTISITAAPKSAV----TFRIRIPAW--SKGAKIEVNG 604
Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
+ + G + + W DK+ + +PL LRTE+ DDR + IQ + +GP +L
Sbjct: 605 KAIDGVTAGEYATVAGSWKVGDKIVVTIPLQLRTEST-DDRKD---IQTLFYGPTVL 657
>gi|399030291|ref|ZP_10730797.1| hypothetical protein PMI10_02670 [Flavobacterium sp. CF136]
gi|398071797|gb|EJL63044.1| hypothetical protein PMI10_02670 [Flavobacterium sp. CF136]
Length = 771
Score = 238 bits (606), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 173/556 (31%), Positives = 262/556 (47%), Gaps = 53/556 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
++E L ++ L S AQ +L+YLL L+ D L+ + +A +PT YG WEN
Sbjct: 34 MQEFKLQEIKL-TSGPFKNAQNVDLKYLLDLNPDRLLAPYLISAGIPTKADRYGNWENI- 91
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
L GH GHYL+A + M+AST N IK ++ ++ L+ CQ K GTGY+ P
Sbjct: 92 -GLDGHIGGHYLAALSMMYASTGNKEIKSRLDYMISELALCQEKDGTGYVGGIPEGKVFW 150
Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
++ S L W P Y IHK+ AGL+D Y N +A +K+ W +E
Sbjct: 151 DRIHKGDIDGSGFGLNNTWVPIYNIHKLFAGLIDAYNYTGNEKAKEIVIKLGDWFIE--- 207
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
+I S E+ L E GG+N+ LYSIT + K+L A + L L
Sbjct: 208 -----LIRPLSDEQIQKILKTEHGGINESFADLYSITKNKKYLETAEKLSQKAILDPLIK 262
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
+ D L+ HANT IP VIG + +++ + + FF V + A GG S E +
Sbjct: 263 KEDKLTGLHANTQIPKVIGFEKIGKLSDNKQWSDAAQFFWMNVTEKRTVAFGGNSVAEHF 322
Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
+ L S + ETC +YNM ++S+ LF ++Y D+YER L N +LS Q
Sbjct: 323 NPINDFSGMLKSNQGPETCNSYNMERLSKALFLDKNNVSYLDFYERTLYNHILSSQEPNR 382
Query: 443 PGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
G +Y P+ R H + S WCC GTG+E+ SK G+ IY E ++
Sbjct: 383 GG-FVYFTPI-------RPNHYRVYSQPETSMWCCVGTGLENHSKYGELIYSHSERDI-- 432
Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
++ +I S+ +WK + L Q ++ + L + + LN+R P W
Sbjct: 433 -FVNLFIPSTLNWKEKGIELEQTTK--FPYENNTEIVLKLKNPKSF----VLNIRYPKW- 484
Query: 561 YSNGAQASLNGQ-NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ 619
+ + +NG+ P N++S +W DK+TI S E + P+ ++
Sbjct: 485 -ATNFEILVNGKLQKAEAKPTNYVSMARKWKSGDKITIAFKTSTHLEKL----PDGSNWA 539
Query: 620 AILFGPYLLAGHTSGE 635
A + GP +LA TS E
Sbjct: 540 AFVNGPIVLAAKTSTE 555
>gi|333378944|ref|ZP_08470671.1| hypothetical protein HMPREF9456_02266 [Dysgonomonas mossii DSM
22836]
gi|332885756|gb|EGK06002.1| hypothetical protein HMPREF9456_02266 [Dysgonomonas mossii DSM
22836]
Length = 787
Score = 237 bits (605), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 171/553 (30%), Positives = 265/553 (47%), Gaps = 51/553 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
LK+++L D S RAQ + +YLL LD D L+ F + A L ++Y WEN
Sbjct: 31 LKDITLLD------SPFKRAQDLDKKYLLDLDADRLLAPFIREAGLQKKAESYTNWEN-- 82
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
+ L GH GHY+SA A M+AST + IK+++ ++ L CQ++ G GY+ P ++
Sbjct: 83 TGLDGHIGGHYVSALALMYASTGDQQIKDRLDYMISELKRCQDENGNGYIGGVPGGKAIW 142
Query: 217 DSFE---------ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
D L W P Y IHK AGL D Y++A N A +KM W V
Sbjct: 143 DEIAKGDIQASGFGLNNRWVPLYNIHKTYAGLRDAYLIAGNETAKDMLIKMTDWAV---- 198
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
K+++ S E+ L E GG+N+ + IT + K+L LAH F L L
Sbjct: 199 ----KLVSNLSEEQIQDMLRSEHGGLNETFADVAVITQNEKYLKLAHQFSHQLILNPLLA 254
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
D L+ HANT IP V+G + ++ G+ + FF + V S GG S RE +
Sbjct: 255 HEDKLTGLHANTQIPKVLGFKRIADIEGNESWSEASRFFWETVVEHRSVCIGGNSVREHF 314
Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
+ + S E ETC TYNML++S+ ++ + + Y DYYE+AL N +LS Q +
Sbjct: 315 HPTNDFSSMITSNEGPETCNTYNMLRLSKMFYQTSLDKKYIDYYEKALYNHILSSQ-NPQ 373
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
G ++Y + G + + S WCC G+GIES +K G+ IY LY
Sbjct: 374 TGGLVYFTQMRPGHYRV-----YSQPQTSMWCCVGSGIESHAKYGEMIYAHTSD---ALY 425
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
+ +I S +WK +V + Q D + +T+ K E ++ +R P W
Sbjct: 426 VNLFIPSLLNWKDRNVEIVQ--DNKFPDESKTEITVNPKKKSEF----TVYVRYPSWVEK 479
Query: 563 NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
+ LNG+ P ++ W D+++++LP+++ E + P+ ++ +
Sbjct: 480 GTMKIKLNGKTYPGVEKDGYIGIKRTWQKGDRISVELPMTIVAEQL----PDKSNYYSFR 535
Query: 623 FGPYLLAGHTSGE 635
+GP +LA T E
Sbjct: 536 YGPIVLAAKTGVE 548
>gi|226325822|ref|ZP_03801340.1| hypothetical protein COPCOM_03635 [Coprococcus comes ATCC 27758]
gi|225205946|gb|EEG88300.1| hypothetical protein COPCOM_03635 [Coprococcus comes ATCC 27758]
Length = 761
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 173/592 (29%), Positives = 282/592 (47%), Gaps = 49/592 (8%)
Query: 61 WSSLIPSK----------ILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLD 110
W++ P+K ++ + E S + Y+K P P + L V L
Sbjct: 147 WNTYEPAKEEKKVVAVAGVIDGTEKEASAEIHYKKEIVP--VKGPKKKVGYFPLGQVRLK 204
Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPISELRGHFVGHY 169
+ ++ ++ Q+ EYLL +D D ++++FRK L T G GW+ +L+GH GHY
Sbjct: 205 EGTLYYKYQKLMEEYLLGIDDDQMLYNFRKATGLDTKGAPPMTGWDEESCKLKGHTTGHY 264
Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQN------KIGTGYLSAFPTELFDSFEALK 223
LS A +A+T N +K++ +V L +CQ+ K G+LSA+ E FD E
Sbjct: 265 LSGIALAFAATGNLKFLDKVNYMVAELKKCQDAFAATGKYHRGFLSAYSEEQFDLLEVYT 324
Query: 224 P---VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW- 279
+WAPYYT+ KI++GL D +VLA N A ++ M ++ Y+R+ + + ++++ W
Sbjct: 325 KYPEIWAPYYTLDKIMSGLYDCHVLAGNETAKEILDLMGDWVYDRLSR-LPKETLDKMWA 383
Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
+ E GGM + ++Y +T HL A LF+ + + D L HAN HIP
Sbjct: 384 MYIAGEFGGMLGTMVKVYELTGKENHLKAAKLFENEKLFYPMEEECDTLEDMHANQHIPQ 443
Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE 399
+IG+ Y TGD +Y IG F +IV H+Y GG E + L + E
Sbjct: 444 IIGAMDLYRATGDEIYWEIGKNFWNIVTGGHTYCIGGVGETEMFHRANTTCSYLTDKAAE 503
Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
+C +YNML+++ LF +T+ DYY+ L N +L+ G Y LPLG G K
Sbjct: 504 SCASYNMLRLTSQLFEYTRSGNLMDYYDNTLRNHILTSSSHKCDGGTTYFLPLGPGGRKE 563
Query: 460 RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
+ NS CC+GTG+ES + ++IY ++E LYI + S ++G +
Sbjct: 564 -----FFLSENS--CCHGTGMESRFRYMENIYAQDE---DALYINLLVDSVLTDENGKTM 613
Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPP 578
+ + S D M + Q+ L + +P W + S+NG+ L
Sbjct: 614 IE-----LQSVDEEGVMEIRCQKDQK----KVLKIHIPAWGQKD-FNVSVNGKVLANTAL 663
Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+L D + ++LP+ R + D++ + A + + +GPY+LA
Sbjct: 664 HDGYLVIDADPKAGDVIRLELPMEFR---VLDNKSDAAFVN-LAYGPYILAA 711
>gi|404450474|ref|ZP_11015456.1| hypothetical protein A33Q_14151 [Indibacter alkaliphilus LW1]
gi|403763872|gb|EJZ24792.1| hypothetical protein A33Q_14151 [Indibacter alkaliphilus LW1]
Length = 782
Score = 236 bits (603), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 171/558 (30%), Positives = 273/558 (48%), Gaps = 53/558 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L++V L D S RAQ+ + +Y+L +DVD L+ + K A L YG WEN
Sbjct: 33 LRQVKLKD------SPFKRAQEVDKKYILEMDVDRLLAPYMKEAGLTWSADNYGNWEN-- 84
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
+ L GH GHYLSA + M+AST + I +++ ++ L Q++ G GYLS P +++
Sbjct: 85 TGLDGHIGGHYLSALSLMFASTGDPEINKRLDYMLEQLKHAQDQSGDGYLSGVPYGRKIW 144
Query: 217 DSFEA---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
+ ++ L W P Y IHKI AGL D Y + A M + ++F +
Sbjct: 145 NELKSGKINAGNFSLNDRWVPLYNIHKIFAGLRDAYWIGGKEIAKPMLVSLSDWFLD--- 201
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ ++ ++ L E GG+N+V + +T D K+L LA L L + D
Sbjct: 202 -LTDGFTEDQFQEMLISEHGGLNEVFADVAVMTGDSKYLSLAKKMSHNAILQPLKEEKDE 260
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L+ HANT IP VIG Q +V+ D FF V S + GG S RE +
Sbjct: 261 LNGLHANTQIPKVIGFQRIAQVSKDQNLHQASDFFWKNVVYQRSVSIGGNSVREHFHPTS 320
Query: 388 RLADTLGSEN-EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
+ L SE ETC TYNM+++S LF+ + Y DYYERA+ N +LS Q + G +
Sbjct: 321 DFSSMLSSEQGPETCNTYNMMRLSEMLFQLAPDRKYIDYYERAVFNHILSTQHPKKGGFV 380
Query: 447 IYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
+ + R H + +FWCC G+G+E+ +K G +IY + + LY+
Sbjct: 381 YF--------TSMRPQHYRVYSQPHENFWCCVGSGLENHAKYGQAIYAYRKDD---LYLN 429
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT-LTFSSKQEVGQLS-SLNLRMPVWTYS 562
+I+S DW+ + L Q D PY + +TFS K G+ S +L +R P W
Sbjct: 430 LFIASELDWEEKGIKLIQNTDF-----PYKDESEITFSHK---GKKSFNLKIRYPNWVKE 481
Query: 563 NGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
+ ++NG+ + + + +++ W+ DK+ ++LP+ + E + P+ ++ +
Sbjct: 482 GMLEVTINGEQVEVSVDRHGYITLNREWTSKDKINLKLPMETKAERL----PDGSNWVSF 537
Query: 622 LFGPYLLAGHTSGEWDIK 639
GP +L T + D+K
Sbjct: 538 SHGPIVLGAKTGAD-DLK 554
>gi|224537183|ref|ZP_03677722.1| hypothetical protein BACCELL_02060 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521238|gb|EEF90343.1| hypothetical protein BACCELL_02060 [Bacteroides cellulosilyticus
DSM 14838]
Length = 790
Score = 236 bits (603), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 184/606 (30%), Positives = 280/606 (46%), Gaps = 65/606 (10%)
Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
L A+ N+E LL D D L+ +RK A L K Y W+ L GH GHYL+A A
Sbjct: 40 LKHARDLNIETLLKYDCDRLMAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMA 95
Query: 175 QMWASTHNATIKEKMSTVVFSLSECQN-------KIGTGYLSAFPTELF-------DSFE 220
+ A+T N +++M ++ ++EC + G GY+ P F
Sbjct: 96 -INAATGNEECRKRMEYIISEIAECAEANSKNHPQWGIGYMGGMPNSQNIWNGFKDGDFR 154
Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVE 276
WAP+Y +HK+ AGL D ++ N QA L+ W + + + S E
Sbjct: 155 VYSGSWAPFYNLHKMYAGLRDAWLYCGNEQAKSLFLQFCNWAIH--------ITSGLSDE 206
Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
+ L E GGMN+VL Y+ITH+ K+L A F ++ + D L + HANT
Sbjct: 207 QMERMLGNEHGGMNEVLADAYAITHEQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQ 266
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS- 395
+P VIG + E++G+ Y + +FF DIV S A GG S RE + D +
Sbjct: 267 VPKVIGFERISELSGNEDYHVASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDI 326
Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
+ E+C T NMLK++ L R E YADYYE A N +LS Q E G +Y P
Sbjct: 327 DGPESCNTNNMLKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTP---- 381
Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
++ R + + WCC GTG+E+ K G IY G+ L++ Y +S DWK
Sbjct: 382 -ARPRHYRNYSAPNEAMWCCVGTGMENHGKYGQFIY-THAGDA--LFVNLYAASQLDWKE 437
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
+ L Q+ + PY + T + + G +L +R P W + + S+NG+ +
Sbjct: 438 RGITLRQE-----TAFPYSENS-TITIAEGKGTF-NLMVRYPGWVHPGEFKVSVNGKPVD 490
Query: 576 -LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
+ P +++S +W D + I P+ + ++ P+Y A++ GP LL
Sbjct: 491 IITGPSSYVSINRKWKKGDVVNINFPMHSSLRYLPNE-PQYV---ALMHGPILLG----- 541
Query: 635 EWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDA 694
+KTGT S+++LI+ F Q + +++N SI + PVSG
Sbjct: 542 ---MKTGT-ESMASLIAD-DSRFGQYAGGPKQPIDKAPILINNDITSIPSQLTPVSG--K 594
Query: 695 ALHATF 700
LH T
Sbjct: 595 PLHFTL 600
>gi|302873208|ref|YP_003841841.1| hypothetical protein Clocel_0296 [Clostridium cellulovorans 743B]
gi|307688627|ref|ZP_07631073.1| hypothetical protein Ccel74_10733 [Clostridium cellulovorans 743B]
gi|302576065|gb|ADL50077.1| protein of unknown function DUF1680 [Clostridium cellulovorans
743B]
Length = 607
Score = 236 bits (602), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 165/558 (29%), Positives = 265/558 (47%), Gaps = 50/558 (8%)
Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPG----------KAYGGWENPISELRGHFVGHYLS 171
N YL+ + L+ +F A + PG + + GW+ P +LRGHF+GH+LS
Sbjct: 24 NRNYLINVKNQGLLQNFYLEAGIILPGLQVLHNPDTDEIHWGWDAPTCQLRGHFLGHWLS 83
Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYT 231
A+A ++ S + +K K+ ++ L +CQ G ++ P + F E VW+P Y
Sbjct: 84 AAASIFVSEQDHELKAKLDKIIDELIKCQELNGGEWIGPIPEKYFQKLENSHHVWSPQYV 143
Query: 232 IHKILAGLLDQYVLADNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
+HK+L GL++ Y+ ++ +AL K++ W +++ + + K R Y E
Sbjct: 144 MHKVLMGLMNSYIDTNSDKALAILDKLSNWYIKWTDDMLIK------NPRAIYG--GEEA 195
Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
GM +V +Y IT + K+L LA + P L D L++ HAN IP G+ Y
Sbjct: 196 GMLEVWITMYEITAEEKYLELAKKYSNPRIFRDLEAGRDTLTNCHANASIPWSHGAAKLY 255
Query: 348 EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
EVTGD + K+ F+ + V Y +GG A E+W P +L L N+E CT YNM
Sbjct: 256 EVTGDEKWRKITEAFWKNAVTDRGYYCSGGQGAGEYWTPPFKLGLFLSDSNQEFCTVYNM 315
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
++ + +L++WT + ++ADY E L NG L+ Q+ G+ Y LPLG G K WG
Sbjct: 316 IRTASYLYKWTGDTSFADYIELNLYNGFLA-QQNKYTGMPTYFLPLGAGSKKK-----WG 369
Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--VVLNQKV 524
T+ FWCC+GT +++ + IYFE++ L + QYI S W + + + Q+V
Sbjct: 370 TETRDFWCCHGTMVQAQTLYNSLIYFEDKER---LVVSQYIPSELKWNYNNTDITIQQRV 426
Query: 525 DPIVSWDPYL----------RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
+ D R +L F E + +L+ R+P W + N +
Sbjct: 427 NMKYYNDLAFFDERDESQMSRWSLKFQVAAEKNESFTLSFRVPKWVKELPSVTINNEKID 486
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
L +++ WS D++ I P L + P+ A + GP +LAG
Sbjct: 487 DLTVDEGYINIKREWS-QDEVLIYFPCRLEISPL----PDMPDTFAFMEGPIVLAGICDE 541
Query: 635 EWDIKTGTARSLSALISP 652
E + G A S ++ P
Sbjct: 542 ERRL-YGDADKPSEILMP 558
>gi|86142285|ref|ZP_01060795.1| hypothetical protein MED217_11584 [Leeuwenhoekiella blandensis
MED217]
gi|85831037|gb|EAQ49494.1| hypothetical protein MED217_11584 [Leeuwenhoekiella blandensis
MED217]
Length = 793
Score = 236 bits (601), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 168/536 (31%), Positives = 258/536 (48%), Gaps = 41/536 (7%)
Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYL 170
+S V A T+ Y+ LD D L+ F + A L +Y WEN + L GH GHY+
Sbjct: 37 ESGVFKEAALTDFNYIQALDADRLLAPFLREAGLEPKADSYTNWEN--TGLDGHTAGHYI 94
Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA------- 221
SA + +AST + KE + + L Q G GY+ P L+ +A
Sbjct: 95 SALSMYYASTGDPKAKEMLEYALAELDRVQKSNGNGYIGGVPGSDALWAEIKAGKINAGS 154
Query: 222 --LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
L W P Y IHK GL D ++ A+ QA +M + ++F + + S +
Sbjct: 155 FSLNDKWVPLYNIHKTFNGLKDAWIHAELPQAKRMLIELTDWFLD----ITADLSEAQIQ 210
Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI 339
L E GG+N+V +Y+IT D K+L LA F + L LA D L+ HANT IP
Sbjct: 211 DMLRSEHGGLNEVFAEVYAITSDKKYLKLAEDFSQHALLKPLAANEDILTGMHANTQIPK 270
Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN-E 398
IG + ++ Y + F D V S + GG S RE + + + SE
Sbjct: 271 FIGFERISQLEEAKDYHDAASNFFDNVTTRRSISIGGNSVREHFNPVDDFSSVVSSEQGP 330
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C TYNMLK+S+ LF T E Y D+YER L N +LS Q G +Y P+ G +
Sbjct: 331 ESCNTYNMLKLSKLLFEDTSEEHYIDFYERGLYNHILSSQNPD--GGFVYFTPIRPGHYR 388
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
+ SFWCC G+G+E+ +K + IY ++E LY+ +I S +W+ +
Sbjct: 389 V-----YSQPETSFWCCVGSGMENHTKYNELIYAKKEDK---LYVNLFIPSEVNWEEKNA 440
Query: 519 VLNQKVDPIVSWDPYLRMT-LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
L QK + P +T L ++S+++ ++L LR P W + + +N + +
Sbjct: 441 TLTQKTNF-----PEEALTELIWNSRKKTK--ATLMLRYPQWVNAGELKVYVNDKLEKID 493
Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
PG+++S +W D++ ++LP+ L E + DD Y S++ +GP +LA T
Sbjct: 494 ATPGSYVSLERKWKNGDRIKMELPMHLSLEELPDDSG-YVSVK---YGPIVLAAVT 545
>gi|332685731|ref|YP_004455505.1| hypothetical protein MPTP_0197 [Melissococcus plutonius ATCC 35311]
gi|332369740|dbj|BAK20696.1| hypothetical protein MPTP_0197 [Melissococcus plutonius ATCC 35311]
Length = 883
Score = 236 bits (601), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 178/569 (31%), Positives = 264/569 (46%), Gaps = 78/569 (13%)
Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL-PTPGKAYGGWENPIS-ELRGHFVGH 168
Q + +AQ+ + YLL LDV ++ F K A + P Y GWE RGHF GH
Sbjct: 12 QDPYIHKAQENVIHYLLSLDVQKFLFEFYKVAGMKPLTESGYQGWERSDQVNFRGHFFGH 71
Query: 169 YLSASAQMWASTHNATIKEKM----STVVFSLSECQNKIG------TGYLSAFPTELFDS 218
+LSA A + + +K+K+ T + L Q GY+SAF D
Sbjct: 72 FLSALALSYQAEKQPILKKKIHQQIKTAITGLKAIQKNYAKQHPEHAGYISAFKEVALDE 131
Query: 219 FEALKPV--------WAPYYTIHKILAGLLDQYVLADNA------QALKMATWMVEYFYN 264
E KPV P+Y +HKILAGLL+ + +AL +A+W +Y Y
Sbjct: 132 VEG-KPVDPKEKENVLVPWYNLHKILAGLLEVNISLKEVDSQLSKEALFIASWFGDYIYK 190
Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
R+ + + L E GGMND LY L+ +T +H + A FD+ LA
Sbjct: 191 RMMNLTDKNQM------LTIEYGGMNDALYYLFELTQKKEHAIAATYFDEDNLFNQLAND 244
Query: 325 ADYLSHFHANTHIPIVIGSQMRYEV----------TGDPLYKLIGTF-----FMDIVNAS 369
+ L HANT IP +IG+ RY V + + L+ F F IV +
Sbjct: 245 ENVLPGKHANTTIPKLIGALKRYMVFQSEDLSAWLSNEEKEHLMSYFKAAENFWQIVVDN 304
Query: 370 HSYATGGTSAREFWWDPKRL----ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
H+Y TGG S E + P L G ETC T+NMLK++R L+ TK+ Y DY
Sbjct: 305 HTYCTGGNSQSEHFHGPNELFYDSEIRQGDCTCETCNTHNMLKLTRKLYECTKDPKYLDY 364
Query: 426 YERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSK 485
YE N +L+ Q ++ G+M+Y P+G G +K + ++ FWCC GTGIESFSK
Sbjct: 365 YETTYINAILASQ-NSKTGMMMYFQPMGAGYNKV-----YNRPYDEFWCCSGTGIESFSK 418
Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
L D+ YF+E L++ Y S++ K ++ + QK D + + + L + +
Sbjct: 419 LADTYYFKENNR---LFVNLYFSNTLKLKENNLKIIQKTD---RKNGNVTIDLKTLTDKN 472
Query: 546 VGQLSSLNLRMPVW----TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
+ Q L LR+P W T G + +L ++A ND++ +++
Sbjct: 473 IIQPLQLALRLPNWAKQVTIKKGKKLLNYKSHLGFAYLSGLVTA------NDQIILEMEQ 526
Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAG 630
L+ D P+ + A +GPY+LAG
Sbjct: 527 ELQLL----DTPDNTNYIAFKYGPYILAG 551
>gi|328956144|ref|YP_004373477.1| hypothetical protein Corgl_1563 [Coriobacterium glomerans PW2]
gi|328456468|gb|AEB07662.1| protein of unknown function DUF1680 [Coriobacterium glomerans PW2]
Length = 751
Score = 235 bits (600), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 184/545 (33%), Positives = 268/545 (49%), Gaps = 44/545 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
++ ++L V L + AQQ L +L +D D ++ +FR+ A + T G GW+ P
Sbjct: 182 MRPINLTCVRLAPGTPAAAAQQRRLSFLKQVDDDQMLINFRRAAHMDTKGAPEMIGWDTP 241
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK------IGTGYLSAF 211
S LRGH GHYLSA A WA+T + T+ K+S +V SL E Q I G+LSA+
Sbjct: 242 DSNLRGHTTGHYLSALALAWAATGDETVHSKLSYMVHSLGEVQAAFRGQPGIHEGFLSAY 301
Query: 212 PTELFDSFEALKP---VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQK 268
FD E P +WAPYYT+HKILAGLLD Y A N QAL++A + + YNR+ +
Sbjct: 302 DESQFDLLERYTPYPEIWAPYYTLHKILAGLLDSYRYAGNRQALEIAIGVGHWVYNRLSQ 361
Query: 269 VITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ-AD 326
+ + +++ W + E GGMN+ L L +IT + + A FD + F ALQ D
Sbjct: 362 LDPI-QLKKMWAMYIAGEFGGMNESLAMLGAITGEESFVKAARFFDNDKLI-FPALQKVD 419
Query: 327 YLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
L HAN HIP VIG+ Y VT + Y + FF V A H YA GGT E + P
Sbjct: 420 ALGTLHANQHIPQVIGALSLYGVTHEESYYQVAEFFWHSVVAHHIYAFGGTGDGEMFQQP 479
Query: 387 KRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
+A + + E+C +YNM+K++R L+ + Y E L N +LS G
Sbjct: 480 CEIAAKIDEFSAESCASYNMIKLTRDLYEYEPTADKMAYCENVLINHILSSTDHEGTGGS 539
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
Y + G K T NS CC+GTG+ES G SIY++ EG L + Y
Sbjct: 540 TYFMETQPGARKGFDTE------NS--CCHGTGLESQFMYGQSIYYQGEGQ---LIVALY 588
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVWTYSNGA 565
++S V +D + +R+ +G+L L LR P W S+
Sbjct: 589 LASHLKTDDTDVT----IDCDFNHPETVRIA--------IGRLEGKLVLRHPDW--SDRM 634
Query: 566 QASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
S+NG + +++ + + D++T++L LR DD + AI +GP
Sbjct: 635 TVSINGAAARIAEKDGYVTVEDSLAPGDEITVRLNPELRLIPTPDD----PNRVAIGYGP 690
Query: 626 YLLAG 630
++LA
Sbjct: 691 FVLAA 695
>gi|120435050|ref|YP_860736.1| hypothetical protein GFO_0692 [Gramella forsetii KT0803]
gi|117577200|emb|CAL65669.1| conserved hypothetical protein, membrane or secreted [Gramella
forsetii KT0803]
Length = 796
Score = 235 bits (600), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 184/587 (31%), Positives = 278/587 (47%), Gaps = 64/587 (10%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A +LEY+L LD D L+ F K A L T ++Y WEN + L GH GHYL+A + M+
Sbjct: 52 AMLVDLEYILKLDPDRLLAPFLKEAGLETKVESYPNWEN--TGLDGHIGGHYLTALSLMY 109
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE---------ALKPVW 226
A+T N + E+++ ++ L + Q + GY+ P EL+ +L W
Sbjct: 110 AATGNQEVLERLNYMLDELQKVQ-QANVGYIGGVPDSKELWQQISEGNINAGSFSLNDRW 168
Query: 227 APYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
P Y IHK AGL D Y +A +A + ++ WM+E V + S E+ L
Sbjct: 169 VPLYNIHKTYAGLRDAYQIAGIERAKTMLIDLSDWMLE--------VTSDLSEEQIQELL 220
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
E GG+N+ +Y IT + K+L LA+ F + L L D L+ HANT IP VIG
Sbjct: 221 ISEYGGLNETFADVYEITGEKKYLDLAYAFSQKELLKPLEDDQDVLTGMHANTQIPKVIG 280
Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS--ENEET 400
Q + + Y+ +FF D V S A GG S RE + PK T+ S + ET
Sbjct: 281 FQTIAALNDNREYRDAASFFWDNVVNERSVAIGGNSVREH-FHPKDDFSTMMSSVQGPET 339
Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
C TYNMLK+S LF Y DYYE+AL N +LS Q E G +Y P+ G +
Sbjct: 340 CNTYNMLKLSEKLFLTEANEKYVDYYEQALYNHILSSQH-PEKGGFVYFTPMRPGHYRVY 398
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
S SFWCC G+G+E+ K + IY E LY+ +I S +W+ + L
Sbjct: 399 SQPE-----TSFWCCVGSGLENHGKYNEFIYAHTENE---LYVNLFIPSILNWEEKGLKL 450
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPP 579
QK + + ++++ +E +L LR P W + G +N + + L P
Sbjct: 451 TQKTE--FPNEETSKISINLKEVEEF----TLMLRYPTW--AKGFNILVNQEKVELNNEP 502
Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW--- 636
G+++S W+ D++ +Q+P+++ + + D + A+ +GP +L T E+
Sbjct: 503 GSYVSIKREWTDGDEIELQIPMNISSVGLPDGSNNF----ALKYGPLVLGAKTGNEYMEG 558
Query: 637 ---------DIKTGTARSLSALISPIPPSFNAQLVTF-TQESGNSTF 673
I G LS + + NA LV + ++E G F
Sbjct: 559 LFADASRGGHIAAGKKIPLSETPIFLADTKNADLVNYISKEEGELKF 605
>gi|380694971|ref|ZP_09859830.1| hypothetical protein BfaeM_13572 [Bacteroides faecis MAJ27]
Length = 802
Score = 235 bits (599), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 179/568 (31%), Positives = 266/568 (46%), Gaps = 65/568 (11%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
SL DV L SS +AQQT+L Y+L LD D L F + A L +Y WEN + L
Sbjct: 29 SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
GH GHYLSA + M+A+T + I +++ ++ L Q +GTG++ P +L+ +
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQ 267
A L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ + S + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDSQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPLIKDEDR 257
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSAR 380
L+ HANT IP VIG + EV+ D + FF + V S GG S R
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKDDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 381 EFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI--------AYADYYERALT 431
E + L + ETC TYNML++++ L++ + ++ Y DYYERAL
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY 491
N +LS Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIY 431
Query: 492 FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
+ LY+ +I S +WK V L Q+ + D + + + +SK+++ +
Sbjct: 432 AHRQDT---LYVNLFIPSQLNWKEQGVTLTQET--LFPDDGKVTLRIDKASKKKL----T 482
Query: 552 LNLRMPVWTYSNGAQA-SLNGQNLPL---PPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
L +R+P W S+ A ++NGQ P +L +W D +T LP+ + E
Sbjct: 483 LMIRIPGWAGSSKDYAITINGQKKKYAIRPGVSTYLPIHRKWKKGDVITFNLPMEVSLEQ 542
Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGE 635
I D + Y A L+GP +LA T E
Sbjct: 543 IPDKKDYY----AFLYGPIVLAASTGTE 566
>gi|331702303|ref|YP_004399262.1| hypothetical protein Lbuc_1953 [Lactobacillus buchneri NRRL
B-30929]
gi|329129646|gb|AEB74199.1| protein of unknown function DUF1680 [Lactobacillus buchneri NRRL
B-30929]
Length = 803
Score = 235 bits (599), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 196/637 (30%), Positives = 287/637 (45%), Gaps = 96/637 (15%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPIS-ELRGHFVGHYLSASAQ 175
AQQ ++YLL LD + +F + A + + G Y GWE RGHF GHYLSA +Q
Sbjct: 20 AQQMTVKYLLALDPKRFLVTFDEVAGIDSGGVTGYQGWERTDGLNFRGHFFGHYLSALSQ 79
Query: 176 MWASTHNATIKE----KMSTVVFSLSECQNKIG------TGYLSAFPTELFDSFEALK-- 223
+T I++ K+ V L Q GY+SAF D E +
Sbjct: 80 AILATEENDIRQQLLDKLRLGVNGLQSAQAAYAKSHPDSAGYVSAFREVALDEVEGREVP 139
Query: 224 -----PVWAPYYTIHKILAGLLDQYVLAD------NAQALKMATWMVEYFYNRVQKVITM 272
V P+Y +HK+LAGLL V + +ALK+A Y + R+ ++
Sbjct: 140 KDEKENVLVPWYNLHKVLAGLLAVKVNLQGIDPLLSEKALKIAHQFGIYVFKRLNQLADP 199
Query: 273 YSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFH 332
+ L E GGMND LY L+ +T D + L A FD+ LA D L+ H
Sbjct: 200 TQM------LKIEYGGMNDALYELFDLTDDKRMLTAATYFDETALFKQLAEGDDVLAGKH 253
Query: 333 ANTHIPIVIGSQMRYEVTGD----------------PLYKLIGTFFMDIVNASHSYATGG 376
ANT IP +IG+ RYE D +Y F IV H+Y TGG
Sbjct: 254 ANTTIPKLIGALHRYESLHDVKRADQYLSPEEKGSLNMYLKAAVNFWQIVVDDHTYVTGG 313
Query: 377 TSAREFWWDPKRL-ADTL---GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
S E + +P +L D + G+ ETC TYNMLK+SR LFR T + Y DYYE+ TN
Sbjct: 314 NSQSEHFHEPGQLFHDAVLEDGATTCETCNTYNMLKLSRELFRVTGDKKYLDYYEQTYTN 373
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L Q G+M Y P+ G +K + F+ FWCC GTGIE+F+KLGDS F
Sbjct: 374 AILGSQ-NPNTGMMTYFQPMAAGYTKV-----YNRPFDEFWCCTGTGIENFTKLGDSYDF 427
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
LY+ Y S+ S ++ + ++VD + +T+ Q+ +L
Sbjct: 428 MSGDQ---LYLSLYFSNVLRLDSNNLQMTEQVDRKTG---KVHLTVAKLRSQDSAGAINL 481
Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK-----LTIQLPLSLRTEA 607
LR P W + A+ +++G + + +F W ++ + +++P+SL+
Sbjct: 482 KLRNPAWLVQS-AKLAVDGISQQVDQNADF------WEIDNAGPGTTVDLEIPMSLKMVQ 534
Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEW---DIKTGTARSLSALISPIP---------- 654
+D+ P Y + + +GPY+LAG D G +S +P
Sbjct: 535 TKDN-PHYVAFK---YGPYVLAGQLGKHHINDDRPNGVLVRISTHDQAVPSTLTTGMDWH 590
Query: 655 ---PSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFP 688
S N+Q V T E+ N+ F + N S T+ P
Sbjct: 591 DWQQSLNSQAVVDT-ETTNTLFELKLPNTSETITFVP 626
>gi|326798346|ref|YP_004316165.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326549110|gb|ADZ77495.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
Length = 1022
Score = 234 bits (597), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 191/645 (29%), Positives = 293/645 (45%), Gaps = 104/645 (16%)
Query: 93 DLPGNFLKEVSLHDVWLD-----QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKT--ASLP 145
D+P + L +L V L+ + + + L D +S ++ FR P
Sbjct: 368 DIPSSKLAPFNLDQVSLEADAHGHKTKFIENRDKFINTLAATDPNSFLYMFRHAFGQKQP 427
Query: 146 TPGKAYGGWENPISELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVV---FSLS 197
+ G W++ ++LRGH GHYL+A AQ +A T A EKM +V + LS
Sbjct: 428 EGARPLGVWDSQETKLRGHATGHYLTAIAQAYAGTGYDKALQAKFAEKMEYMVNTLYELS 487
Query: 198 ECQNKI---------------------------------------GTGYLSAFPTELFDS 218
+ K G G++SA+P + F
Sbjct: 488 QLSGKPKEAGGIHVSDPTAVPYGPGKTEYDSDFSDEGIRTDYWNWGEGFISAYPPDQFIM 547
Query: 219 FE-------ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
E VWAPYYT+HKILAGL+D Y ++ N +AL++AT M ++ Y R+ K+ T
Sbjct: 548 LERGAKYGGQKNQVWAPYYTLHKILAGLMDVYEVSGNKKALEIATGMGDWVYARLSKLPT 607
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQ 324
++ + E GGMN+V+ RLY IT+ P +L A LFD F G LA
Sbjct: 608 ETLIKMWNTYIAGEFGGMNEVMARLYRITNKPNYLKTAQLFDNIKMFYGDASHSHGLAKN 667
Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS------ 378
D HAN HIP ++GS Y V+ +P+Y I F V + Y+ GG +
Sbjct: 668 VDTFRGLHANQHIPQIVGSIEMYRVSNNPVYYSIADNFWYKVVNDYMYSIGGVAGARNPA 727
Query: 379 -AREFWWDPKRLAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
A F P L + + G +N ETC TYNMLK++ LF + + DYYER L N +
Sbjct: 728 NAECFISQPATLYENGFSAGGQN-ETCATYNMLKLTSDLFLFDQRPELMDYYERGLYNHI 786
Query: 435 LSIQRGTEPGVMIYMLPLGRG-VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
L+ P Y +PL G + + + H G F CC GT IES +KL +SIYF+
Sbjct: 787 LASVAEDSP-ANTYHVPLRPGSIKQFGNPHMTG-----FTCCNGTAIESSTKLQNSIYFK 840
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
+ N LY+ +I S+ +W + + Q D + + R+T+ K + ++
Sbjct: 841 SKDN-DALYVNLFIPSTLEWAERKITVQQTTD--FPNEDHTRLTIKGGGKFD------MH 891
Query: 554 LRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
+R+P W + G +NG++ L PG++L + W D + +Q+P + + D +
Sbjct: 892 VRVPGWA-TKGFFVRVNGKDQKLEAKPGSYLKISRNWKDGDVVDLQMPFQFHLDPVMDQQ 950
Query: 613 PEYASIQAILFGPYLLAGH---TSGEWDIKTGTARSLSALISPIP 654
+I ++ +GP LLA +W + A +S I P
Sbjct: 951 ----NIASLFYGPILLAAQEPEARKDWRTVSLDAEDISKSIKGDP 991
>gi|423223047|ref|ZP_17209516.1| hypothetical protein HMPREF1062_01702 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640316|gb|EIY34118.1| hypothetical protein HMPREF1062_01702 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 790
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 183/606 (30%), Positives = 278/606 (45%), Gaps = 65/606 (10%)
Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
L A+ N+E LL D D L+ +RK A L K Y W+ L GH GHYL+A A
Sbjct: 40 LKHARDLNIETLLKYDCDRLMAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMA 95
Query: 175 QMWASTHNATIKEKMSTVVFSLSECQN-------KIGTGYLSAFPTELF-------DSFE 220
+ A+T N +++M ++ ++EC + G GY+ P F
Sbjct: 96 -INAATGNEECRKRMEYIISEIAECAEANCKNHPQWGVGYMGGMPNSQNIWNGFKDGDFR 154
Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVE 276
WAP+Y +HK+ AGL D ++ N QA L+ W + + + S E
Sbjct: 155 VYSGSWAPFYNLHKMYAGLRDAWLYCGNEQAKSLFLQFCNWAIH--------ITSGLSDE 206
Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
+ L E GGMN+VL Y+ITH+ K+L A F ++ + D L + HANT
Sbjct: 207 QMERMLGNEHGGMNEVLADAYAITHEQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQ 266
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS- 395
+P VIG + E++G+ Y + +FF DIV S A GG S RE + D +
Sbjct: 267 VPKVIGFERISELSGNEDYHVASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDI 326
Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
+ E+C T NMLK++ L R E YADYYE A N +LS Q E G +Y P
Sbjct: 327 DGPESCNTNNMLKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTP---- 381
Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
++ R + + WCC GTG+E+ K G IY G+ L++ Y +S DWK
Sbjct: 382 -ARPRHYRNYSAPNEAMWCCVGTGMENHGKYGQFIY-THAGDA--LFVNLYAASQLDWKE 437
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
+ L Q+ + PY + T + + G +L +R P W + + S+NG+
Sbjct: 438 RGITLRQE-----TAFPYSENS-TITIAEGKGTF-NLMVRYPGWVHPGEFKVSVNGKPAD 490
Query: 576 -LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
+ P +++S +W D + I P+ + ++ P+Y A++ GP LL
Sbjct: 491 IITGPSSYVSINRKWKKGDVVNINFPMHSSLRYLPNE-PQYV---ALMHGPILLG----- 541
Query: 635 EWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDA 694
+KTGT S+++LI+ F Q + +++N SI + PV G
Sbjct: 542 ---MKTGT-ESMASLIAD-DSRFGQYAGGPKQPIDKAPILINNDIASIPSQLTPVPGK-- 594
Query: 695 ALHATF 700
LH T
Sbjct: 595 PLHFTL 600
>gi|195643412|gb|ACG41174.1| hypothetical protein [Zea mays]
gi|413926261|gb|AFW66193.1| hypothetical protein ZEAMMB73_983510 [Zea mays]
Length = 262
Score = 233 bits (594), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 132/248 (53%), Positives = 161/248 (64%), Gaps = 17/248 (6%)
Query: 4 GFVLFFFFCFGL--ALGKQCTNQSP-YDSHAFRYELT---STNKTWKEEVLSHF------ 51
G V+ G A GK CTN P SH R T + ++ H
Sbjct: 14 GIVVVMLLAAGFRGAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQ 73
Query: 52 HLTPTDDSAWSSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPG----NFLKEVSLHDV 107
HLTPTD+S W SL+P + L +++ W +LYR+++ GG PG FL E SLHDV
Sbjct: 74 HLTPTDESTWMSLMPRRAL-RREEAFDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDV 132
Query: 108 WLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG 167
L+ S+ WRAQQTNLEYLL+LDVD LVWSFRK A L PG YGGWE P +LRGHFVG
Sbjct: 133 RLEPGSMYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVG 192
Query: 168 HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWA 227
HYLSA+A+MWASTHN T+ KMS+VV +L +CQ K+GTGYLSAFP++ FD EA+K VWA
Sbjct: 193 HYLSATAKMWASTHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWA 252
Query: 228 PYYTIHKI 235
PYYTIHK+
Sbjct: 253 PYYTIHKV 260
>gi|440700043|ref|ZP_20882328.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
gi|440277439|gb|ELP65547.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
Length = 934
Score = 233 bits (594), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 158/462 (34%), Positives = 233/462 (50%), Gaps = 31/462 (6%)
Query: 206 GYLSAFPTELFDSFEALK-----PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F + E++ VWAPYYT HKIL GLLD Y+ D+++AL +A+ M +
Sbjct: 383 GFLAAYPETQFIALESMTSGDYTKVWAPYYTAHKILKGLLDAYLATDDSRALDLASGMCD 442
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ Y+R+ K+ +++R W + E GG+ + + LY+IT+ +HL LA LFD +
Sbjct: 443 WMYSRLSKLPDA-TLQRMWGIFSSGEFGGIVETIVDLYTITNKAEHLALAKLFDLDTLID 501
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L+ HAN HIPI G Y+ TG+ Y F +V Y GGTS
Sbjct: 502 ACAANTDTLNGLHANQHIPIFTGYVRLYDATGEARYLTAAKNFWGMVIPQRMYGIGGTST 561
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW +A T+ N ETC YN+LK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 562 GEFWKARGVIAGTVSDTNAETCCAYNLLKLSRTLFFHEQDPKYMDYYERALYNQVLGSKQ 621
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E ++ Y + L G + T GT CC GTG+ES +K DS+YF+
Sbjct: 622 DKADAEKPLVTYFIGLNPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYFKSA- 674
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
+ LY+ Y S+ W V + Q + + TLT +L LR+
Sbjct: 675 DGGSLYVNLYSPSTLTWAEKGVTVTQTTE----YPKEQGTTLTIGGGSAA---FALRLRV 727
Query: 557 PVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
P+W + G Q ++NGQ + P G++ + + W D + I +P LR E DD
Sbjct: 728 PLWA-TAGFQVTVNGQAVSGTPVAGSYFAVSRTWQSGDVVRISVPFRLRVEKALDD---- 782
Query: 616 ASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSF 657
S+Q + +GP L ++ + G R+ SAL + PS
Sbjct: 783 PSLQTLFYGPVNLVARSASTSYLSVGLYRN-SALSGDLLPSL 823
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 61/112 (54%), Gaps = 6/112 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
L+ L DV L Q V +Q L++ DV+ L+ FR A L T G A GGWE
Sbjct: 44 LRPFELKDVALGQG-VFASKRQLMLDHGRGYDVNRLLQVFRANAGLSTGGAVAPGGWEGL 102
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT 205
E LRGH+ GH+LS +Q +AST + ++++T+V +L++ + + T
Sbjct: 103 DGEANGNLRGHYTGHFLSMLSQAYASTRDQAYADRIATMVGALTDVRAALRT 154
>gi|189467200|ref|ZP_03015985.1| hypothetical protein BACINT_03584 [Bacteroides intestinalis DSM
17393]
gi|189435464|gb|EDV04449.1| beta-lactamase [Bacteroides intestinalis DSM 17393]
Length = 720
Score = 233 bits (593), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 143/405 (35%), Positives = 225/405 (55%), Gaps = 26/405 (6%)
Query: 232 IHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMND 291
+HK+ +GL+ QY+ ADN QAL++ T M + YN++ K + + +R + E GG+N+
Sbjct: 1 MHKLFSGLIYQYLYADNKQALEVVTRMGNWTYNKL-KPLDESTRKR---MIRNEFGGVNE 56
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTG 351
Y LY+IT D ++ LA F + L Q D L H NT IP V+ YE+T
Sbjct: 57 SFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDLGTKHTNTFIPKVLTEARNYELTQ 116
Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSR 411
D + + FF + H++A G +S +E ++DP++L+ L ETC TYNMLK+SR
Sbjct: 117 DNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQLSKHLTGYTGETCCTYNMLKLSR 176
Query: 412 HLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNS 471
HLF WT + ADYYERAL N +L Q+ E G++ Y LPL G K + T+ NS
Sbjct: 177 HLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSYFLPLLSGSHKV-----YSTRENS 230
Query: 472 FWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWD 531
FWCC G+G E+ +K G++IY+ N G+Y+ +I S +WK+ + L Q+ ++
Sbjct: 231 FWCCVGSGFENHAKYGEAIYYH---NDQGIYVNLFIPSEVNWKAKGITLRQE----TAFP 283
Query: 532 PYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWS 590
LT + + V +++ LR P W S + ++NG+ + + PG+++ T +W
Sbjct: 284 AEENTALTIQTDKPV--TTTIYLRYPSW--SKNVKVNVNGKKVSVKQKPGSYIPVTRQWK 339
Query: 591 YNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
D++ P+SL+ E D+ P+ A+L+GP +LAG + E
Sbjct: 340 DGDRIEANYPMSLQLETTPDN-PQKG---ALLYGPLVLAGESGTE 380
>gi|383123086|ref|ZP_09943771.1| hypothetical protein BSIG_0174 [Bacteroides sp. 1_1_6]
gi|251841821|gb|EES69901.1| hypothetical protein BSIG_0174 [Bacteroides sp. 1_1_6]
Length = 802
Score = 232 bits (591), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 178/568 (31%), Positives = 268/568 (47%), Gaps = 65/568 (11%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
SL DV L SS +AQQT+L Y+L LD D L F + A L +Y WEN + L
Sbjct: 29 SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
GH GHYLSA + M+A+T + I +++ ++ L Q +GTG++ P +L+ +
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQ 267
A L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ + S + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFFHKVILDPLIKNEDR 257
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSAR 380
L+ HANT IP VIG + EV+ D + FF + V S GG S R
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKDDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 381 EFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI--------AYADYYERALT 431
E + L + ETC TYNML++++ L++ + ++ Y DYYERAL
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY 491
N +LS Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIY 431
Query: 492 FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
++ LY+ +I S +WK V L Q+ + D + + + ++K+ + +
Sbjct: 432 AHQQDT---LYVNLFIPSQLNWKEQGVTLTQET--LFPDDEKVTLRIDKAAKKNL----T 482
Query: 552 LNLRMPVWT-YSNGAQASLNG-QNLPLPPPG--NFLSATERWSYNDKLTIQLPLSLRTEA 607
L +R+P W S G + ++NG ++L G +L +W D +T LP+ + E
Sbjct: 483 LMIRIPEWAGNSKGYEITINGKKHLSDIQTGASTYLPIRRKWKKGDMITFHLPMKVSLEQ 542
Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGE 635
I D + Y A L+GP +LA T E
Sbjct: 543 IPDKKDYY----AFLYGPIVLATSTGTE 566
>gi|189464752|ref|ZP_03013537.1| hypothetical protein BACINT_01096 [Bacteroides intestinalis DSM
17393]
gi|189437026|gb|EDV06011.1| hypothetical protein BACINT_01096 [Bacteroides intestinalis DSM
17393]
Length = 790
Score = 232 bits (591), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 182/606 (30%), Positives = 278/606 (45%), Gaps = 65/606 (10%)
Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
L A+ N+E LL D D L+ +RK A L K Y W+ L GH GHYL+A A
Sbjct: 40 LKHARDLNIETLLKYDCDRLIAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMA 95
Query: 175 QMWASTHNATIKEKMSTVVFSLSECQN-------KIGTGYLSAFPTELF-------DSFE 220
+ A+T N +++M ++ ++EC K G GY+ P F
Sbjct: 96 -INAATGNEECRKRMEYIINEIAECAEANYKNHPKWGVGYMGGMPNSQNIWSGFKNGDFR 154
Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVE 276
WAP+Y +HK+ AGL D ++ N QA L+ W ++ + + S E
Sbjct: 155 VYSGSWAPFYNLHKMYAGLRDAWLYCGNEQAKTLFLQFCNWAID--------ITSGLSDE 206
Query: 277 RHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTH 336
+ L E GGMN+VL Y+IT + K+L A F ++ + D L + HANT
Sbjct: 207 QMERMLGNEHGGMNEVLADAYAITREQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQ 266
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS- 395
+P VIG + E++G+ Y + +FF DIV S A GG S RE + D +
Sbjct: 267 VPKVIGFERISELSGNEDYHMASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDI 326
Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
+ E+C T N+LK++ L R E YADYYE A N +LS Q E G +Y P
Sbjct: 327 DGPESCNTNNILKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTP---- 381
Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
++ R + + WCC GTG+E+ K G IY G+ L++ Y +S DWK
Sbjct: 382 -ARPRHYRNYSAPNEAMWCCVGTGMENHGKYGQFIY-THVGDA--LFVNLYAASQLDWKE 437
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
+ L Q+ + PY + T + + G +L +R P W + + S+NG+ +
Sbjct: 438 RGITLRQE-----TAFPYSENS-TITIAEGKGTF-NLMVRYPGWVHPGEFKVSVNGKPVD 490
Query: 576 -LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
+ P +++S +W D + I P+ + ++ P+Y A + GP LL
Sbjct: 491 IITGPSSYVSINRKWKKGDVVNINFPMHSSLRYLPNE-PQYI---AFMHGPILLG----- 541
Query: 635 EWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEFPVSGTDA 694
+KTGT S+++LI+ F Q + +++N SI + PV G
Sbjct: 542 ---MKTGT-ESMASLIAD-DSRFGQYAGGPKQPIDKAPILINNDIASIPSQLTPVPGK-- 594
Query: 695 ALHATF 700
LH T
Sbjct: 595 PLHFTL 600
>gi|298384655|ref|ZP_06994215.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
gi|298262934|gb|EFI05798.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
Length = 802
Score = 231 bits (590), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 177/568 (31%), Positives = 269/568 (47%), Gaps = 65/568 (11%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
SL DV L SS +AQQT+L Y+L LD D L F + A L +Y WEN + L
Sbjct: 29 SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
GH GHYLSA + M+A+T + I +++ ++ L Q +GTG++ P +L+ +
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQ 267
A L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ + S + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKVILDRLIKNEDR 257
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSAR 380
L+ HANT IP VIG + EV+ + + FF + V S GG S R
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKNDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 381 EFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI--------AYADYYERALT 431
E + L + ETC TYNML++++ L++ + ++ Y DYYERAL
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY 491
N +LS Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIY 431
Query: 492 FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
++ LY+ +I S +WK V L Q+ + D + + + ++K+++ +
Sbjct: 432 AHQQDT---LYVNLFIPSQLNWKEQGVTLTQET--LFPDDEKVTLRIDKAAKKKL----T 482
Query: 552 LNLRMPVWT-YSNGAQASLNG-QNLPLPPPG--NFLSATERWSYNDKLTIQLPLSLRTEA 607
L +R+P W S G + ++NG ++L G +L +W D +T LP+ + E
Sbjct: 483 LMIRIPEWAGNSKGYEITINGKKHLSDIQAGTSTYLPLRRKWKKGDVITFHLPMKVSLEQ 542
Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGE 635
I D + Y A L+GP +LA T E
Sbjct: 543 IPDKKDYY----AFLYGPIVLATSTGTE 566
>gi|404451488|ref|ZP_11016452.1| hypothetical protein A33Q_19258 [Indibacter alkaliphilus LW1]
gi|403762834|gb|EJZ23856.1| hypothetical protein A33Q_19258 [Indibacter alkaliphilus LW1]
Length = 1019
Score = 231 bits (590), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 194/626 (30%), Positives = 294/626 (46%), Gaps = 108/626 (17%)
Query: 95 PGNFLKEVSLHDVWL--DQSSVLWRAQQTNLEYLLML---DVDSLVWSFRKTASLPTPGK 149
P L+ LH + L DQ+ + + ++LL L D +S ++ FR P P
Sbjct: 369 PQQKLELFKLHQINLEEDQTGQKTKFIENRDKFLLTLAETDPNSFLYMFRHAFDQPQPEN 428
Query: 150 AY--GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEK--------MSTVVFSLSEC 199
A G W++ ++LRGH GHYL+A AQ +AST + ++ M V++ LS+
Sbjct: 429 AVPLGVWDSQETKLRGHATGHYLTAIAQAYASTGYDEVLQQNFLDKMDYMVNVLYDLSKL 488
Query: 200 Q-NKI------------------------------------GTGYLSAFPTELFDSFEA- 221
NK+ G GY+SA+P + F E
Sbjct: 489 SGNKVNGKGNEDPVLVPKGPGKSDFDSDLSDEGIRSDYWNWGKGYISAYPPDQFIMLEKG 548
Query: 222 ------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSV 275
+WAPYYT+HKILAGL+D Y ++ N +AL++A M E+ Y R+ + ++
Sbjct: 549 ATYGGQKNQIWAPYYTLHKILAGLIDIYKVSGNEKALEIAKGMGEWVYTRLD-ALPQETL 607
Query: 276 ERHWYS-LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADY 327
+ W + + E GGMN+ + LY IT DP+ L A LFD F G LA D
Sbjct: 608 IKMWNTYIAGEFGGMNETMATLYEITQDPRFLKGAQLFDNIQMFFGDAEYSHGLAKNVDT 667
Query: 328 LSHFHANTHIPIVIGSQMRYEVTG-DPLYKLIGTFFMDIVNASHSYATGGTS-------A 379
HAN HIP V+GS Y V+ D +++ ++ VN + Y+ GG + A
Sbjct: 668 FRGLHANQHIPQVVGSLEMYRVSAKDEYFRVADNYWFKAVN-DYMYSIGGVAGARNPANA 726
Query: 380 REFWWDPKRLAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
F +P L + + G +N ETC TYNMLK++ +LF + + DY+ER L N +L+
Sbjct: 727 ECFIAEPATLYENGFSSGGQN-ETCATYNMLKLTGNLFLFEQRGELMDYFERGLYNHILA 785
Query: 437 IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE--E 494
P Y +PL G K H K F CC GT IES +KL SIY++ E
Sbjct: 786 SVAEDSPA-NTYHVPLRPGSIK----HFGNAKMTGFTCCNGTSIESNTKLQQSIYYKSIE 840
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
E V Y+ +I S+ DW+ ++ + Q S+ + L + E L+L
Sbjct: 841 ENAV---YVNLFIPSTLDWEERNIKIKQA----TSFPKEDKTQLLVEGEGEF----VLHL 889
Query: 555 RMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
R+P W G S+NG+ + L PG++++ + W DK+ +++P + + D+P
Sbjct: 890 RVPSWA-RKGYHVSINGKEIQLDVKPGSYIAISRFWEDGDKVDLRMPFDFYLDPVM-DQP 947
Query: 614 EYASIQAILFGPYLLAGHTSG---EW 636
AS + +GP LLA S EW
Sbjct: 948 NIAS---LFYGPILLAAQESDARKEW 970
>gi|317476510|ref|ZP_07935758.1| hypothetical protein HMPREF1016_02741 [Bacteroides eggerthii
1_2_48FAA]
gi|316907322|gb|EFV29028.1| hypothetical protein HMPREF1016_02741 [Bacteroides eggerthii
1_2_48FAA]
Length = 793
Score = 231 bits (589), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 169/558 (30%), Positives = 258/558 (46%), Gaps = 62/558 (11%)
Query: 114 VLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSAS 173
L A+ N+ LL + D L+ +RK A L + Y W+ L GH GHYL+A
Sbjct: 38 TLKSARDLNINTLLKYNCDRLLAPYRKEAGLTPKAECYPNWDG----LDGHVGGHYLTAM 93
Query: 174 AQMWASTHNATIKEKMSTVVFSLSECQN-------KIGTGYLSAFPTEL-------FDSF 219
A + A+T N +++M ++ ++EC + G GY+ P F
Sbjct: 94 A-INAATGNEECRKRMEYIIKEIAECAEANRKNHPEWGVGYMGGMPNSQNIWSNFKKGDF 152
Query: 220 EALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSV 275
WAP+Y +HK+ AGL D ++ N QA L+ W ++ N K +
Sbjct: 153 RVYSGSWAPFYNLHKMYAGLRDAWLYCGNEQAKDLFLQFCDWAIDVTSNLSDKQMEQM-- 210
Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
L E GGMN+VL Y+ITH+ K+L A F L + D L + HANT
Sbjct: 211 ------LGNEHGGMNEVLADAYAITHEQKYLDCAKRFSHKQLFTPLLQRQDCLDNLHANT 264
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+P IG + E++G+ Y + +FF DIV S A GG S RE + D +
Sbjct: 265 QVPKAIGFERISELSGNEDYHMASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFIND 324
Query: 396 -ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ E+C T NMLK++ +L R E YADYYE A N +LS Q G +Y P
Sbjct: 325 IDGPESCNTNNMLKLTENLHRRNPEARYADYYELATFNHILSTQHPKHGGY-VYFTP--- 380
Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
++ R + + WCC GTG+E+ K G IY G+ L++ Y +S DWK
Sbjct: 381 --ARPRHYRNYSAPNEAMWCCVGTGMENHGKYGQFIY-THVGDA--LFVNLYAASQLDWK 435
Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
+ L Q+ S + L +T E +L +R P W + + S+NGQ++
Sbjct: 436 KRGITLRQETTFPYSENSTLTIT-------EGKGAFNLMVRYPEWVHPGEFKVSVNGQSV 488
Query: 575 P-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+ P +++S +W D + I P+ + ++ P+Y A ++GP LL
Sbjct: 489 DVITGPSSYVSINRKWKKGDVVNISFPMHASLRYLPNE-PQYV---AFMYGPILLG---- 540
Query: 634 GEWDIKTGTARSLSALIS 651
+KTGT S+++LI+
Sbjct: 541 ----MKTGT-ESMTSLIA 553
>gi|262405235|ref|ZP_06081785.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|345508054|ref|ZP_08787694.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
gi|229444700|gb|EEO50491.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
gi|262356110|gb|EEZ05200.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
Length = 801
Score = 231 bits (589), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 191/692 (27%), Positives = 309/692 (44%), Gaps = 74/692 (10%)
Query: 98 FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
+ E + DV L V A++ N+E LL DVD L+ +RK A L K Y W+
Sbjct: 27 YKNEFPIADVKL-LDGVFKHARELNIEVLLKYDVDRLLAPYRKEAGLTERKKTYPNWDG- 84
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSEC--QNKIGT-----GYLSA 210
L GH GHYLSA + +A+T N +M ++ L C N I GY+
Sbjct: 85 ---LDGHVAGHYLSAMSMNYAATGNKECGRRMEYMISELQLCLEANAINNTEWAIGYIGG 141
Query: 211 FPT--ELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMV 259
FP L+ +F+ WAP+Y +HK+ AGL D ++ +N QA LK W +
Sbjct: 142 FPNSKNLWSTFKKGDLRIYNSAWAPFYNLHKMYAGLRDAWLYCNNKQAKTLFLKFCDWAI 201
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ + E+ L E GGMN++L Y IT + K+L+ A + + L
Sbjct: 202 S--------ITDDLNEEQMQTVLKMEYGGMNEILADAYQITGNKKYLVAAKRYSQNILLD 253
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
L+ D L + HANT IP IG E++GD Y F + + + S A GG S
Sbjct: 254 PLSQGIDNLDNKHANTQIPKFIGFARIAELSGDTKYTNASRFSWETITGNRSLAFGGNSR 313
Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
RE + +D + + E+C +YNMLK++ LFR YADYYER + N +LS Q
Sbjct: 314 REHFPSVTSCSDYINDVDGPESCNSYNMLKLTEDLFRMQPSAHYADYYERTVFNHILSTQ 373
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
E G +Y ++ R + + WCC GTG+E+ SK IY + +
Sbjct: 374 H-PEHGGYVYFTS-----ARPRHYRVYSAPNEAMWCCVGTGMENHSKYNQFIYTHSDDS- 426
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
L++ +I+S +WK+ + L Q+ + ++ ++T+T +S L +R P
Sbjct: 427 --LFVNLFIASELNWKNKKISLRQETN--FPYEERTKLTVTKASSP-----FKLMIRYPG 477
Query: 559 WTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W + S+NG+++ P +++ +W+ D + ++LP+ E + P +
Sbjct: 478 WVDKGALKVSVNGKSMNYSALPSSYICIDRKWNKGDVVEVELPMRSTIEHL----PNVPN 533
Query: 618 IQAILFGPYLLAGHTSGEWDIK---TGTAR-------------SLSALISPIPPSFNAQL 661
A + GP LL T E D++ G R LI + ++L
Sbjct: 534 YIAFMHGPILLGAKTGTE-DLRGLIAGDGRWGQYPSGKLLPVDQAPILIVDDMENITSKL 592
Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNF-SSLNNVIG 720
V E + + +N SI ++ P + A + + L L + + SL+ +
Sbjct: 593 VPIKNEPLHFKANIKAAN-SIDIKLEPFANIHDARYMMYWLTLTNKGYQTYIDSLSTIEK 651
Query: 721 KSVMLEPF--DFPGMLVQQGKEDELVVSESPK 750
+ ++LE DF QQ + D ++ E +
Sbjct: 652 EKIILEKLTVDFVAPGEQQPETDHKILQEKSR 683
>gi|346226219|ref|ZP_08847361.1| Acetyl-CoA carboxylase, biotin carboxylase [Anaerophaga
thermohalophila DSM 12881]
Length = 795
Score = 231 bits (589), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 179/608 (29%), Positives = 282/608 (46%), Gaps = 53/608 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A+ N +Y++ D D L+ F A L YG WE+ S L GHF GHYL++ + M
Sbjct: 49 AEALNEQYVMAHDPDRLLAPFLIDAGLEPKAPGYGNWES--SGLNGHFGGHYLTSLSLMI 106
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE---------ALKPVW 226
AST N +E+++ ++ L+ CQ G GY+ P +++ +L W
Sbjct: 107 ASTGNEEARERLNYMIDELARCQEANGNGYVGGVPGGQDMWAEIAKGNIDAGNFSLNGKW 166
Query: 227 APYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
P Y IHK+ AGL D ++ A N +A +K+ W ++ + S ++ L
Sbjct: 167 VPLYNIHKLYAGLRDAWLYAGNEKAREILIKLTDWCID--------LTAALSDDQIQEML 218
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
E GG+N+V +Y IT D K+L LA F L L D L+ HANT IP VIG
Sbjct: 219 VSEHGGLNEVFADVYDITGDEKYLELARRFSHREILEPLLQHEDRLTGLHANTQIPKVIG 278
Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETC 401
E+T D + FF + V + + GG S E + + + S + ETC
Sbjct: 279 YMRIAELTHDSAWIDASDFFWNTVVNNRTITIGGNSTHEHFHPVDDFSSMIESRQGPETC 338
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
TYNMLK+S+HLF + ++ Y DYYE+AL N +LS Q G ++Y P+ + R
Sbjct: 339 NTYNMLKLSKHLFLYKNDLKYIDYYEQALYNHILSSQHPGHGG-LVYFTPM-----RPRH 392
Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLN 521
+ +FWCC G+GIE+ K G+ IY ++ +V ++ +I S +WK + L
Sbjct: 393 YRVYSNPEETFWCCVGSGIENHEKYGELIYAHDDEDV---FVNLFIPSELNWKEKGLKLV 449
Query: 522 QKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PG 580
QK + LR+ L S + VG +R P W + ++NG ++ G
Sbjct: 450 QKNNFPDIEKSTLRVELDESDEFIVG------IRCPAWANPGEMEVTVNGNSVNGEAVSG 503
Query: 581 NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT-SGEWD-I 638
+ + +W D + + LP+ + + D P Y S ++ GP++L T S + D +
Sbjct: 504 QYFLVSRKWDDGDVIEVHLPMHTFGKYLPDKSP-YLS---LMHGPFVLGAATDSTDLDGL 559
Query: 639 KTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEF----PVSGTDA 694
+R P+ P A ++ E+ V+ +Q +T + P S D
Sbjct: 560 IADDSRMGHIAHGPLYPLDEAPMLLIDGENWEKK-VIPVDDQPMTFKALGLIVPDSEDDL 618
Query: 695 ALHATFRL 702
L FR+
Sbjct: 619 VLEPFFRI 626
>gi|290958971|ref|YP_003490153.1| glycosylase [Streptomyces scabiei 87.22]
gi|260648497|emb|CBG71608.1| putative secreted glycosylase [Streptomyces scabiei 87.22]
Length = 936
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 159/465 (34%), Positives = 232/465 (49%), Gaps = 32/465 (6%)
Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F E++ VWAPYYT HKIL GLLD Y+ D+ +AL +A+ + +
Sbjct: 384 GFLAAYPETQFIELESMTSGDYTRVWAPYYTAHKILRGLLDAYLHVDDERALDLASGLCD 443
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ Y+R+ K+ +++R W + E GG+ + + LY+IT HL LA LFD +
Sbjct: 444 WMYSRLSKLPDA-TLQRMWGIFSSGEYGGLVEAIVDLYAITGKADHLALARLFDLDKLID 502
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L HAN HIPI G Y+VTG+ Y F +V Y GGTS
Sbjct: 503 ACAANTDTLDGLHANQHIPIFTGLVRLYDVTGEARYLSAAKNFWGMVIPQRMYGIGGTST 562
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW +A T+ N ETC YN+LK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 563 AEFWKARGAVAGTISDTNAETCCAYNLLKLSRSLFFHEQDPKYMDYYERALLNQVLGSKQ 622
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E ++ Y + L G + T GT CC GTG+ES +K DS+YF
Sbjct: 623 DKADAEKPLVTYFIGLEPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYF-ARA 675
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
+ LY+ Y +++ DW + V + Q D Y R T + G ++ LR+
Sbjct: 676 DGSALYVNLYSAATLDWSAKGVTIAQSTD-------YPREQGTTITVGGGGAAFAMRLRV 728
Query: 557 PVWTYSNGAQASLNGQNLP-LPPPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPE 614
P W + G + ++NG + P PG++ + R W D + + +P LRTE DD+
Sbjct: 729 PSWA-TAGFRVTVNGGVVDGTPDPGSYFTIPSRTWDDGDVVRVSIPFRLRTEKALDDQ-- 785
Query: 615 YASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNA 659
S+Q + +GP L G + G R+ + L + PS A
Sbjct: 786 --SLQTLFYGPVNLVGRNRATSYLPVGLYRN-AGLSGDLLPSLTA 827
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 58/113 (51%), Gaps = 6/113 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
++ L DV L Q + ++ L++ DVD L+ FR A L T G A GGWE
Sbjct: 45 VRPFELKDVTLGQG-LFAEKRRLMLDHGRGYDVDRLLQVFRANAGLSTKGAVAPGGWEGL 103
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTG 206
E LRGH+ GH+L+ AQ A T + +++ ++ +L+E + + TG
Sbjct: 104 DGEANGNLRGHYTGHFLTMLAQAHAGTRDTVYSDRIRYMIGALAEVREALRTG 156
>gi|294646986|ref|ZP_06724603.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294806386|ref|ZP_06765229.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|292637657|gb|EFF56058.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294446401|gb|EFG15025.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 813
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 191/692 (27%), Positives = 309/692 (44%), Gaps = 74/692 (10%)
Query: 98 FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
+ E + DV L V A++ N+E LL DVD L+ +RK A L K Y W+
Sbjct: 39 YKNEFPIADVKL-LDGVFKHARELNIEVLLKYDVDRLLAPYRKEAGLTERKKTYPNWDG- 96
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSEC--QNKIGT-----GYLSA 210
L GH GHYLSA + +A+T N +M ++ L C N I GY+
Sbjct: 97 ---LDGHVAGHYLSAMSMNYAATGNKECGRRMEYMISELQLCLEANAINNTEWAIGYIGG 153
Query: 211 FPT--ELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMV 259
FP L+ +F+ WAP+Y +HK+ AGL D ++ +N QA LK W +
Sbjct: 154 FPNSKNLWSTFKKGDLRIYNSAWAPFYNLHKMYAGLRDAWLYCNNKQAKTLFLKFCDWAI 213
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ + E+ L E GGMN++L Y IT + K+L+ A + + L
Sbjct: 214 S--------ITDDLNEEQMQTVLKMEYGGMNEILADAYQITGNKKYLVAAKRYSQNILLD 265
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
L+ D L + HANT IP IG E++GD Y F + + + S A GG S
Sbjct: 266 PLSQGIDNLDNKHANTQIPKFIGFARIAELSGDTKYTNASRFSWETITGNRSLAFGGNSR 325
Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
RE + +D + + E+C +YNMLK++ LFR YADYYER + N +LS Q
Sbjct: 326 REHFPSVTSCSDYINDVDGPESCNSYNMLKLTEDLFRMQPSAHYADYYERTVFNHILSTQ 385
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
E G +Y ++ R + + WCC GTG+E+ SK IY + +
Sbjct: 386 H-PEHGGYVYFTS-----ARPRHYRVYSAPNEAMWCCVGTGMENHSKYNQFIYTHSDDS- 438
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
L++ +I+S +WK+ + L Q+ + ++ ++T+T +S L +R P
Sbjct: 439 --LFVNLFIASELNWKNKKISLRQETN--FPYEERTKLTVTKASSP-----FKLMIRYPG 489
Query: 559 WTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W + S+NG+++ P +++ +W+ D + ++LP+ E + P +
Sbjct: 490 WVDKGALKVSVNGKSMNYSALPSSYICIDRKWNKGDVVEVELPMRSTIEHL----PNVPN 545
Query: 618 IQAILFGPYLLAGHTSGEWDIK---TGTAR-------------SLSALISPIPPSFNAQL 661
A + GP LL T E D++ G R LI + ++L
Sbjct: 546 YIAFMHGPILLGAKTGTE-DLRGLIAGDGRWGQYPSGKLLPVDQAPILIVDDMENITSKL 604
Query: 662 VTFTQESGNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLILKDASLSNF-SSLNNVIG 720
V E + + +N SI ++ P + A + + L L + + SL+ +
Sbjct: 605 VPIKNEPLHFKANIKAAN-SIDIKLEPFANIHDARYMMYWLTLTNKGYQTYIDSLSTIEK 663
Query: 721 KSVMLEPF--DFPGMLVQQGKEDELVVSESPK 750
+ ++LE DF QQ + D ++ E +
Sbjct: 664 EKIILEKLTVDFVAPGEQQPETDHKILQEKSR 695
>gi|29345759|ref|NP_809262.1| hypothetical protein BT_0349 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337652|gb|AAO75456.1| Acetyl-CoA carboxylase, biotin carboxylase [Bacteroides
thetaiotaomicron VPI-5482]
Length = 802
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 177/568 (31%), Positives = 268/568 (47%), Gaps = 65/568 (11%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
SL DV L SS +AQQT+L Y+L LD D L F + A L +Y WEN + L
Sbjct: 29 SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE 220
GH GHYLSA + M+A+T + I +++ ++ L Q +GTG++ P +L+ +
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 221 A---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQ 267
A L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ + S + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKVILDPLIKNEDR 257
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSAR 380
L+ HANT IP VIG + EV+ + + FF + V S GG S R
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKNDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 381 EFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEI--------AYADYYERALT 431
E + L + ETC TYNML++++ L++ + ++ Y DYYERAL
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY 491
N +LS Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIY 431
Query: 492 FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS 551
++ LY+ +I S +WK V L Q+ + D + + + ++K+ + +
Sbjct: 432 AHQQDT---LYVNLFIPSQLNWKEQGVTLTQET--LFPDDEKVTLRIDKAAKKNL----T 482
Query: 552 LNLRMPVWT-YSNGAQASLNG-QNLPLPPPG--NFLSATERWSYNDKLTIQLPLSLRTEA 607
L +R+P W S G + ++NG ++L G +L +W D +T LP+ + E
Sbjct: 483 LMIRIPEWAGNSKGYEITINGKKHLSDIQTGASTYLPIRRKWKKGDMITFHLPMKVSLEQ 542
Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGE 635
I D + Y A L+GP +LA T E
Sbjct: 543 IPDKKDYY----AFLYGPIVLATSTGTE 566
>gi|399033094|ref|ZP_10732120.1| hypothetical protein PMI10_04007 [Flavobacterium sp. CF136]
gi|398068528|gb|EJL59944.1| hypothetical protein PMI10_04007 [Flavobacterium sp. CF136]
Length = 1019
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 189/635 (29%), Positives = 290/635 (45%), Gaps = 101/635 (15%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--YGGWEN 156
L +V L+D + + L L D DS ++ FR P +A G W+
Sbjct: 376 LDQVVLNDNLDGHHTKFMENRDKFLTTLATTDPDSFLYMFRNAFGQEQPKEAEPLGVWDT 435
Query: 157 PISELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVVFSL----------SECQN 201
++LRGH GHYL+A AQ +AST A K+KM +V +L E
Sbjct: 436 QETKLRGHATGHYLTAIAQAYASTGYDKTLQANFKDKMEYMVNTLYDLEQLSGKPKEAGG 495
Query: 202 KI--------------------------------GTGYLSAFPTELFDSFE-------AL 222
K G G++SA+P + F E
Sbjct: 496 KFVSDPTAIPFGPGKTNYDSDLSAEGIRTDYWNWGKGFISAYPPDQFIMLENGATYGGQK 555
Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
+WAPYYT+HKILAGL+D Y ++ N +AL+ A M ++ Y R++K+ T + +
Sbjct: 556 TQIWAPYYTLHKILAGLMDVYEVSGNEKALETAKGMGDWVYARMKKLPTETLISMWNRYI 615
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
E GGMN+ + RLY IT DP +L +A LFD F G LA D HAN
Sbjct: 616 AGEFGGMNEAMARLYRITKDPHYLEVAQLFDNIKVFYGDANHSHGLAKNVDTFRGLHANQ 675
Query: 336 HIPIVIGSQMRYEVTGDP-LYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPK 387
HIP ++G+ Y + P Y++ F+ VN + Y+ GG + A F P
Sbjct: 676 HIPQIMGALEMYRDSNTPDYYRVADNFWYKTVN-DYMYSIGGVAGARNPANAECFISQPA 734
Query: 388 RLAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
+ + + G +N ETC TYNMLK++ LF + + DYYER L N +LS P
Sbjct: 735 TIYENGFSSGGQN-ETCATYNMLKLTGDLFLYEQRGELMDYYERGLYNHILSSVAENSP- 792
Query: 445 VMIYMLPLGRG-VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
Y +PL G V + + H G F CC GT IES +K +SIYF+ N LY+
Sbjct: 793 ANTYHVPLRPGSVKQFGNPHMTG-----FTCCNGTAIESNTKFQNSIYFKSADN-NSLYV 846
Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
Y+ S+ W ++ + Q D + + ++T+ + K + L +R+P W +
Sbjct: 847 NLYVPSTLKWTEKNITVKQTTD--FPNEDFTKLTIKGNGKFD------LKVRVPHWA-TK 897
Query: 564 GAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
G +NG++ + PG++L+ ++W D + +++P E + D + +I ++
Sbjct: 898 GFFVKINGKSEKVKAQPGSYLTLNKKWKDGDVIELRMPFQFHLEPVMDQQ----NIASLF 953
Query: 623 FGPYLLAGHTS---GEWDIKTGTARSLSALISPIP 654
+GP LLA S +W T + +S I+ P
Sbjct: 954 YGPILLAAQESEPGKDWRKVTLDVKDISKSIAGDP 988
>gi|395803808|ref|ZP_10483051.1| hypothetical protein FF52_18073 [Flavobacterium sp. F52]
gi|395434079|gb|EJG00030.1| hypothetical protein FF52_18073 [Flavobacterium sp. F52]
Length = 760
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 171/560 (30%), Positives = 255/560 (45%), Gaps = 66/560 (11%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L EV L D AQ +L+Y+L LD D L+ + + LP YG WEN
Sbjct: 27 LSEVKLKD------GPFKNAQDVDLKYILALDPDKLLAPYLLESRLPPKADRYGNWEN-- 78
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT----- 213
L GH GHYLSA A M+ ST N +K+++ ++ L+ CQ K G GY+ P
Sbjct: 79 IGLDGHIGGHYLSALALMYKSTGNKELKDRLDYMLSELARCQAKNGNGYVGGIPQGKVFW 138
Query: 214 ------ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFY 263
++ S L W P Y IHK+ AGL D Y + QA +K+ W +E
Sbjct: 139 DRIHKGDIDGSSFGLNNTWVPIYNIHKLFAGLTDAYQYTGSEQAKDIVIKLGDWFIE--- 195
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
+I S E+ L E GG+N+ LY IT D K+L A L L
Sbjct: 196 -----LIRPLSDEQIQKVLATEHGGINESFADLYIITKDKKYLETAEKLSHKALLNPLLQ 250
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
+ D L+ HANT IP V+G + ++ + + FF + V + A GG S E +
Sbjct: 251 KEDKLTGLHANTQIPKVVGFEKIAALSDNKEWSDGVQFFWNNVTQKRTVAFGGNSVAEHF 310
Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
+ + S E ETC +YNM ++++ LF ++ Y D+YER L N +LS Q E
Sbjct: 311 NPVNDFSGMVKSNEGPETCNSYNMERLAKALFLDKNDVHYLDFYERTLYNHILSSQH-PE 369
Query: 443 PGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
G +Y P+ R H + S WCC GTG+E+ +K G+ IY + +
Sbjct: 370 KGGFVYFTPI-------RPNHYRVYSQPQTSMWCCVGTGLENHTKYGELIYSHTQSD--- 419
Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
L++ +I S WK V L Q + PY T ++ +LN+R P W
Sbjct: 420 LFVNLFIPSVLKWKENGVELEQNTNF-----PYENQTELVLKLKKTKNF-ALNIRYPKWA 473
Query: 561 -----YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
+ NG + + Q P ++S +++W DK+ ++ S+ E + P+
Sbjct: 474 ENFEIFVNGKEQKIASQ------PSEYVSISKKWKTGDKIIVRFKTSIHLENL----PDG 523
Query: 616 ASIQAILFGPYLLAGHTSGE 635
++ A + GP +LA TS E
Sbjct: 524 SNWSAFVKGPIVLAAKTSTE 543
>gi|431799831|ref|YP_007226735.1| hypothetical protein Echvi_4552 [Echinicola vietnamensis DSM 17526]
gi|430790596|gb|AGA80725.1| putative glycosyl hydrolase of unknown function (DUF1680) [Echinicola
vietnamensis DSM 17526]
Length = 1042
Score = 230 bits (586), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 196/665 (29%), Positives = 295/665 (44%), Gaps = 97/665 (14%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY--GGWEN 156
L VSL SS + + L + D ++ FR P A G W++
Sbjct: 398 LDAVSLETDIHGHSSKFIENRDKFISTLAGTNPDDFLYMFRNAFGQEQPAGAVPLGVWDS 457
Query: 157 PISELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVV---FSLSECQNKI----- 203
++LRGH GHYL+A AQ +AST A +KM+ +V ++LS+ K
Sbjct: 458 QETKLRGHATGHYLTAIAQAYASTGYDTALQANFADKMAYMVNTLYNLSQMAGKPSAEAD 517
Query: 204 ----------------------------------GTGYLSAFPTELFDSFE-------AL 222
G GY+SA+P + F E
Sbjct: 518 GHNADPTAVPMGPGKDFYDSDLSEEGIRTDYWNWGEGYISAYPPDQFIMLEHGAKYGGQK 577
Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
VWAPYYT+HKILAGL+D Y ++ N +AL +A M + R+ K+ T + +
Sbjct: 578 DQVWAPYYTLHKILAGLMDIYEVSGNEKALSVAKGMGTWVAARLDKLPTSTLISMWNTYI 637
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
E GGMN+ + RLY IT ++L A LFD F G LA D HAN
Sbjct: 638 AGEFGGMNEAMARLYRITGSSRYLAAAKLFDNITVFYGNADHDHGLAKNVDTFRGLHANQ 697
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPKR 388
HIP ++G+ Y T Y I F I + Y+ GG + A F +P
Sbjct: 698 HIPQIMGALEMYRDTESAPYFHIADNFWHIATNDYMYSIGGVAGARTPANAECFTTEPAT 757
Query: 389 LAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
L + + G +N ETC TYNMLK+SR+LF + ++ AY DYYER L N +L+ P
Sbjct: 758 LYEFGFSAGGQN-ETCATYNMLKLSRNLFLFQQDPAYMDYYERGLYNHILASVAKDSP-A 815
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
Y +PL G K K F CC GT IES +KL +SIYF+ + LY+
Sbjct: 816 NTYHVPLRPGSIKQFGN----PKMKGFTCCNGTAIESSTKLQNSIYFKSVDD-QSLYVNL 870
Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
++ S+ WK ++ + Q + + R+T+ Q G+ L +R+P W + G
Sbjct: 871 FVPSTLHWKERNLTIVQST--AFPKEDHTRLTV-----QGKGKF-VLKIRVPQWA-TEGI 921
Query: 566 QASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
+ S+NG+ + PG + + +W D + I +P E + D + +I ++ +G
Sbjct: 922 KVSINGKPAQVDAVPGTYATIQRKWKNGDTIDINIPFQFHLEPVMDQQ----NIASLFYG 977
Query: 625 PYLLAGHTS---GEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQS 681
P LLA EW T A+++ A I+ P + + T + T+ +
Sbjct: 978 PVLLAAQEEEPRKEWRKVTLNAKNIGATINGNPEALEFTIDGVTYKPFYETYGRHSVYLD 1037
Query: 682 ITMEE 686
+T+E+
Sbjct: 1038 VTLED 1042
>gi|261407096|ref|YP_003243337.1| hypothetical protein GYMC10_3284 [Paenibacillus sp. Y412MC10]
gi|261283559|gb|ACX65530.1| protein of unknown function DUF1680 [Paenibacillus sp. Y412MC10]
Length = 622
Score = 229 bits (585), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 171/572 (29%), Positives = 265/572 (46%), Gaps = 78/572 (13%)
Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASL----PTPGKAYGGWENPISELRGHFVGHYL 170
L R ++ N YL+ LD L++++ A P A+GGWE P+ +LRGHF+GH+L
Sbjct: 16 LIRRERANRSYLMKLDSGHLLFNYHLEAGRFHGRTIPEGAHGGWETPVCQLRGHFLGHWL 75
Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYY 230
S +A + + + +K K+ +V L ECQ G ++ P + + K +WAP Y
Sbjct: 76 SGAALHYEESGDIELKAKLDAIVHELHECQRDNGGQWVGPIPEKYLHWIASGKSIWAPQY 135
Query: 231 TIHKILAGLLDQYVLADNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
HKIL GL+D + A N QAL + A W VE+ ++ E+ L+ ET
Sbjct: 136 NCHKILMGLVDAWQYAGNRQALDIVDRFADWFVEW--------SGTFTREQFDDILDVET 187
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGM +V L IT K+ +L + + L D L++ HANT IP V+G
Sbjct: 188 GGMLEVWADLLHITGADKYRVLLDRYYRSRLFQPLLEGKDPLTNMHANTTIPEVLGCARA 247
Query: 347 YEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
YEVTGD + ++ ++ V S ATGG +A E W ++ LG +N+E CT YN
Sbjct: 248 YEVTGDDRWLSIVQAYWNCAVTERGSLATGGQTAGEVWMPKMKMKARLGDKNQEHCTVYN 307
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE------------PGVMIYMLPLG 453
M++++ LFR + + YA Y E L NG+++ E G++ Y LP+
Sbjct: 308 MIRLADFLFRQSGDPTYAQYIEYNLYNGIMAQAYYQEYGLTGSQHNYPRTGLLTYFLPMK 367
Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
G+ K W T+ +SF+CC+GT +++ + IY+ ++G++ +YI QY S D
Sbjct: 368 AGLRKE-----WSTETDSFFCCHGTMVQANAAWNMGIYY-QDGDI--VYISQYFDSELDA 419
Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSK----QEVGQLSSLN---------------- 553
++ IV + +L SS Q + +S+N
Sbjct: 420 SIAGTLIR-----IVQTQDKMSGSLLSSSNTAGYQAINDTASINENIPTFRKYDFIVSAA 474
Query: 554 --------LRMPVWTYSNGAQASLNG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
R+P W + GA +N Q L NF W D ++I LP+ +
Sbjct: 475 APTTFTLRFRIPEWIMA-GASVYVNDVLQGTTLDSE-NFYDIHRAWKEGDTVSIMLPIGI 532
Query: 604 RTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
R + DD A +GP +LAG E
Sbjct: 533 RFVPLPDDE----RTGAFRYGPEVLAGLCESE 560
>gi|293370109|ref|ZP_06616674.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292634837|gb|EFF53361.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 800
Score = 229 bits (585), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 172/567 (30%), Positives = 266/567 (46%), Gaps = 64/567 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L LD D L+ F + A L +Y WEN + L G
Sbjct: 30 LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L+ Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + EV+ D + FF + V S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAEVSQDDKTWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LY+ +I S WK ++L Q+ D + + + + K++ +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETR--FPDDDKVTLRIDEAPKKK----RTL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGEW 636
D + Y A L+GP +LA T E+
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTEY 566
>gi|295085157|emb|CBK66680.1| Uncharacterized protein conserved in bacteria [Bacteroides
xylanisolvens XB1A]
Length = 800
Score = 229 bits (584), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 171/567 (30%), Positives = 266/567 (46%), Gaps = 64/567 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L LD D L+ F + A L +Y WEN + L G
Sbjct: 30 LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L+ Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LY+ +I S WK ++L Q+ D + + + + K++ +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETR--FPDDDKVTLRIDEAPKKK----RTL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGEW 636
D + Y A L+GP +LA T E+
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTEY 566
>gi|419850639|ref|ZP_14373619.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
gi|419851584|ref|ZP_14374510.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
gi|386408481|gb|EIJ23391.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
gi|386413301|gb|EIJ27914.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
Length = 1834
Score = 229 bits (584), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 186/610 (30%), Positives = 276/610 (45%), Gaps = 87/610 (14%)
Query: 85 KIKNPGGFDLP--GNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTA 142
+ +N GG D+ N+L E + +V + L A + +EYLL + D L+ FR A
Sbjct: 208 QTENGGGHDVQYLKNYLSEQGMENVTV-ADEYLQNAGKKEVEYLLSFEPDRLLVEFRAQA 266
Query: 143 SLPTPG-KAYGGWENPISELR------------GHFVGHYLSASAQMWAST-----HNAT 184
L T G K YGGWEN E R GHFVGH++SA++Q ST A
Sbjct: 267 GLDTKGAKNYGGWENGPDESRNPDGSSKPGRFTGHFVGHWISAASQAQRSTFATADQKAQ 326
Query: 185 IKEKMSTVVFSLSECQ------NKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAG 238
+ ++ VV + E Q + G+ AF + + + P+Y +HK+ AG
Sbjct: 327 LSANLTAVVKGIREAQEAYAKKDTANAGFFPAFSASVVPNGGG--GLIVPFYNLHKVEAG 384
Query: 239 LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYS 298
++ Y + +A+ + A F + V+ S L E GGMND LY++
Sbjct: 385 MVQAYDYSTDAETRETAKAAAVDF---AKWVVNWKSAHASTDMLRTEYGGMNDALYQVAE 441
Query: 299 ITH--DPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY-------- 347
I D + +L A HLFD+ LA D L+ HANT IP + G+ RY
Sbjct: 442 IADASDKQTVLTAAHLFDETALFQKLANGQDPLNGLHANTTIPKLTGAMQRYVAYTEDED 501
Query: 348 ---EVTGD------PLYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPKRLAD 391
++ D LY F DIV H+Y GG S A E W D + D
Sbjct: 502 LYNSLSADERGELTSLYLKAAQNFFDIVVKDHTYVNGGNSQSEHFHVAGELWKDATQNGD 561
Query: 392 TLGS----ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
G ETC YNMLK++R LF+ TK+ Y++YYE N +++ Q E G+
Sbjct: 562 QNGGYRNFSTVETCNEYNMLKLARILFQVTKDSKYSEYYEHTFINAIVASQN-PETGMTT 620
Query: 448 YMLPLGRGVSKARSTHG-------WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
Y P+ G K G +G +WCC GTGIE+F+KL DS YF +E NV
Sbjct: 621 YFQPMKAGYPKVFGITGTDYDADWFGGAIGEYWCCQGTGIENFAKLNDSFYFTDENNV-- 678
Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
Y+ + SS++ ++ + Q + + D ++ T S ++L LR+P W
Sbjct: 679 -YVNMFWSSTYTDTRHNLTITQTANVPKTEDVTFEVSGTGS--------ANLKLRVPDWA 729
Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
+NG + ++G L N T K+T LP L+T D++ ++ + Q
Sbjct: 730 ITNGVKLVVDGTEQALTKDENGW-VTVAIKDGAKITYTLPAKLQTIDAADNK-DWVAFQ- 786
Query: 621 ILFGPYLLAG 630
+GP +LAG
Sbjct: 787 --YGPVVLAG 794
>gi|427403045|ref|ZP_18894042.1| hypothetical protein HMPREF9710_03638 [Massilia timonae CCUG 45783]
gi|425718056|gb|EKU81008.1| hypothetical protein HMPREF9710_03638 [Massilia timonae CCUG 45783]
Length = 781
Score = 229 bits (584), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 168/537 (31%), Positives = 249/537 (46%), Gaps = 45/537 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ TNL YL+ ++ D L+ F + A L +YG WE+ + L GH GHYLSA A M
Sbjct: 38 AQTTNLNYLMAMEPDRLLAPFLREAGLQPRQPSYGNWES--TGLDGHMGGHYLSALALMH 95
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL------------FDSFEALKPV 225
AST + +++ V L Q G GYL P D+F ++
Sbjct: 96 ASTGDQEALRRLNYFVAELKRAQQANGDGYLGGIPGGRQAWRDIAAGKLEADNF-SVNGK 154
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
W P+Y +HK+ AGL D Y A N A M + ++ K+ S E+ L E
Sbjct: 155 WVPWYNLHKVYAGLRDAYRYAGNEDAKAMLVQLSDWALALSAKL----SPEQMQTMLRSE 210
Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
GGMN++ + +T + K+L LA F L LA + D L+ HANT IP VIG +
Sbjct: 211 HGGMNEIFVDVAEMTGERKYLDLALAFSHQAVLQPLARKQDQLTGLHANTQIPKVIGFKR 270
Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
++TG FF V + A GG S +E + + E ETC TY
Sbjct: 271 IADMTGRQDMGEAARFFWQTVVDKRTVAIGGNSVKEHFHSTDDFDPMVHEVEGPETCNTY 330
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
NMLK++ LFR ++ Y+DYYERAL N +LS QR G +Y P+ + S
Sbjct: 331 NMLKLTGMLFRSEQKGMYSDYYERALYNHILSSQR--PEGGFVYFTPMRPNHYRVYSQVD 388
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
G WCC G+GIES +K G+ IY ++ L++ +++S+ DWK V + Q
Sbjct: 389 KG-----MWCCVGSGIESHAKYGEFIYARDKDT---LFVNLFVASTLDWKDKGVRVTQAT 440
Query: 525 D-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNF 582
P R+T+ + ++ +R P W +NG + + PG +
Sbjct: 441 TFPDAD---TTRLTVDGEGR------FTMKIRYPAWVAPGRMAVRVNGAEVKIDARPGGY 491
Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIK 639
+ W D++ ++LP++ E + P ++ A+L GP +LA T D K
Sbjct: 492 ATIARAWRKGDRVDVRLPMTTHLEQM----PGRSNYYAVLHGPVVLAARTRMVGDDK 544
>gi|427386394|ref|ZP_18882591.1| hypothetical protein HMPREF9447_03624 [Bacteroides oleiciplenus YIT
12058]
gi|425726434|gb|EKU89299.1| hypothetical protein HMPREF9447_03624 [Bacteroides oleiciplenus YIT
12058]
Length = 792
Score = 229 bits (583), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 184/625 (29%), Positives = 284/625 (45%), Gaps = 68/625 (10%)
Query: 113 SVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSA 172
S +AQQT+L Y+L ++ D L+ F + A L +Y WEN + L GH GHY+SA
Sbjct: 38 SPFLQAQQTDLHYILAMEPDRLLAPFLREAGLAPKAPSYTNWEN--TGLDGHIGGHYISA 95
Query: 173 SAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL---------------FD 217
+ M+A+T + + +++ ++ L Q +GTG++ P L FD
Sbjct: 96 LSMMYAATGDTAVYNRLNYMLDELHRAQQAVGTGFIGGTPGSLQLWKEIKEGNIRAGGFD 155
Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
L W P Y IHK AGL D Y+ A + A +M + ++ + + ++
Sbjct: 156 ----LNSKWVPLYNIHKTYAGLRDAYLYAGSDLAREMLIALTDWMIG----ITAGLTDQQ 207
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHI 337
L E GG+N+ + +IT D K+L LA F L L D L+ HANT I
Sbjct: 208 MQDMLRSEHGGLNETFADVAAITGDKKYLELARRFSHKVILDPLIKDEDRLTGMHANTQI 267
Query: 338 PIVIGSQMRYEVTGDP-------LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
P VIG + E++ D + FF + V S GG S RE + +
Sbjct: 268 PKVIGYKRIAELSQDDNVWNHATEWDHAARFFWNTVVNHRSVCIGGNSVREHFHPANDFS 327
Query: 391 DTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
L E ETC TYNML++++ L++ + + +ADYYERAL N +L+ Q + G +Y
Sbjct: 328 PMLNDIEGPETCNTYNMLRLTKMLYQDSPDSRFADYYERALYNHILASQE-PDKGGFVYF 386
Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
P+ G + + S WCC G+G+E+ +K G+ IY ++ LY+ +I S
Sbjct: 387 TPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYAHQKDT---LYVNLFIPS 438
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-YSNGAQAS 568
WK V L Q+ + LR + +SK+ ++++R P W S G
Sbjct: 439 QLTWKEKGVSLVQETRFPDNGQVTLR--IDKASKKAF----TISIRQPEWADSSKGYNLK 492
Query: 569 LNGQNLPLPPPGN--FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
+NG+ N +LS +W D +T LP+ ++ E I D Y A L+GP
Sbjct: 493 VNGKEQSSATATNSGYLSVNRKWKKGDVVTFTLPMQIKMEQIPDKENYY----AFLYGPI 548
Query: 627 LLAGHTSGEW-------DIKTG-TARSLSALISPIPPSF-NAQLVTFTQESGNSTFVMSN 677
+LA T E D + G A +S IP N + ++ + NST + N
Sbjct: 549 VLAASTGTEHLDGLYADDSRGGHIAHGKQIPVSEIPMLIGNPEAISQSLHKENSTQLAFN 608
Query: 678 SNQSITMEEFPVSGTDAALHATFRL 702
+ + +P SG L FRL
Sbjct: 609 YDGKV----YPASGKAMKLIPFFRL 629
>gi|322692034|ref|YP_004221604.1| cell surface protein [Bifidobacterium longum subsp. longum JCM
1217]
gi|320456890|dbj|BAJ67512.1| putative cell surface protein [Bifidobacterium longum subsp. longum
JCM 1217]
Length = 1984
Score = 228 bits (582), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 185/610 (30%), Positives = 276/610 (45%), Gaps = 87/610 (14%)
Query: 85 KIKNPGGFDLP--GNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTA 142
+ +N GG D+ N+L E + +V + L A + +EYLL + D L+ FR A
Sbjct: 358 QTENGGGHDVQYLKNYLSEQGMENVTV-ADEYLQNAGKKEVEYLLSFEPDRLLVEFRAQA 416
Query: 143 SLPTPG-KAYGGWENPISELR------------GHFVGHYLSASAQMWAST-----HNAT 184
L T G K YGGWEN E R GHFVGH++SA++Q ST A
Sbjct: 417 GLDTKGAKNYGGWENGPDESRNPDGSSKPGRFTGHFVGHWISAASQAQRSTFATADQKAQ 476
Query: 185 IKEKMSTVVFSLSECQ------NKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAG 238
+ ++ VV + E Q + G+ AF + + + P+Y +HK+ AG
Sbjct: 477 LSANLTAVVKGIREAQEAYAKKDTANAGFFPAFSASVVPNGGG--GLIVPFYNLHKVEAG 534
Query: 239 LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYS 298
++ Y + +A+ + A F + V+ S L E GGMND LY++
Sbjct: 535 MVQAYDYSTDAETRETAKAAAVDF---AKWVVNWKSAHASTDMLRTEYGGMNDALYQVAE 591
Query: 299 ITH--DPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY-------- 347
I D + +L A HLFD+ LA D L+ HANT IP + G+ RY
Sbjct: 592 IADASDKQTVLTAAHLFDETALFQKLANGQDPLNGLHANTTIPKLTGAMQRYVAYTEDED 651
Query: 348 ---EVTGDPLYKLIGTF------FMDIVNASHSYATGGTS-------AREFWWDPKRLAD 391
++ D KL + F DIV H+Y GG S A E W D + D
Sbjct: 652 LYNSLSADERGKLTSLYLKAAQNFFDIVVKDHTYVNGGNSQSEHFHVAGELWKDATQNGD 711
Query: 392 TLGS----ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
G ETC YNMLK++R LF+ TK+ Y++YYE N +++ Q E G+
Sbjct: 712 QNGGYRNFSTVETCNEYNMLKLARILFQVTKDSKYSEYYEHTFINAIVASQN-PETGMTT 770
Query: 448 YMLPLGRGVSKARSTHG-------WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
Y P+ G K G +G +WCC GTGIE+F+KL DS YF +E NV
Sbjct: 771 YFQPMKAGYPKVFGITGTDYDADWFGGAIGEYWCCQGTGIENFAKLNDSFYFTDENNV-- 828
Query: 501 LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
Y+ + SS++ ++ + Q + + D ++ T S ++L LR+P W
Sbjct: 829 -YVNMFWSSTYTDTRHNLTITQTANVPKTEDVTFEVSGTGS--------ANLKLRVPDWA 879
Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
+NG + ++G L N T K+T LP L+ D++ ++ + Q
Sbjct: 880 ITNGVKLVVDGTEQALTKDENGW-VTVAIKDGAKITYTLPAKLQAIDAADNK-DWVAFQ- 936
Query: 621 ILFGPYLLAG 630
+GP +LAG
Sbjct: 937 --YGPVVLAG 944
>gi|255691978|ref|ZP_05415653.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides
finegoldii DSM 17565]
gi|260622387|gb|EEX45258.1| hypothetical protein BACFIN_07051 [Bacteroides finegoldii DSM
17565]
Length = 800
Score = 228 bits (582), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 173/566 (30%), Positives = 269/566 (47%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L DV L S L +AQQT+L Y+L L+ D L+ F + A L +Y WEN + L G
Sbjct: 30 LQDVKLLDSPFL-QAQQTDLHYILALNPDRLLAPFLREAGLTPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + I +++ ++ L Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAIYNRLNYMLDELYRAQQAVGTGFIGGTPGSLQLWKEIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ + QA +M WM++
Sbjct: 147 GNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYTGSDQARRMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S ++ L E G+N+ + +IT D K+L LA F L L D L
Sbjct: 199 ITSGLSDQQIQDMLRSEHSGLNETFADVAAITGDKKYLELARRFSHKIILDPLIKDKDRL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V + S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAEEWDHAARFFWNTVVNNRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ + + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPADNFTSMINDVQGPETCNTYNMLRLTKMLYQNSHNPCNINEPDPNYINYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
++ LY+ +I S +WK V+L Q+ D + + + +SK++ +L
Sbjct: 433 HQKDT---LYVNLFIPSQLNWKEQGVILTQETR--FPDDNKVTLRIDKASKKQ----RTL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQNLPLP-PPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S+ S+NG+ P GN +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSSNYSISINGKKETFPTKKGNQYLPLSRKWKKGDVITFNLPMKVTIEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565
>gi|325299889|ref|YP_004259806.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
gi|324319442|gb|ADY37333.1| protein of unknown function DUF1680 [Bacteroides salanitronis DSM
18170]
Length = 797
Score = 228 bits (582), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 172/537 (32%), Positives = 247/537 (45%), Gaps = 47/537 (8%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
+A N++ L D D L+ + K A LP+ + + WE L GH GHYLSA A
Sbjct: 43 QACDLNVKTLKQYDTDRLLAPYLKEAGLPSKAEGFSNWEG----LDGHVGGHYLSALAIH 98
Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA-----LKPVWAPY 229
+A+T +A +++M +V L CQ G GY+ P L+ + + W P+
Sbjct: 99 YAATGDAECRQRMDYMVSELKRCQEAHGNGYIGGVPDGERLWKEIQQGNVGLIWKYWVPW 158
Query: 230 YTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGM 289
Y +HK AGL D + N +A +M + ++ VI S E+ L E GGM
Sbjct: 159 YNLHKTYAGLRDAWAYGGNEEARQMFLDLCDWGLT----VIAPLSDEQMEQMLENEFGGM 214
Query: 290 NDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEV 349
++V Y +T D K+L A F L +A D L + HANT +P V+G Q E+
Sbjct: 215 DEVYADAYEMTGDVKYLDAAKRFSHHWLLDSMAAGIDNLDNKHANTQVPKVVGYQRIAEL 274
Query: 350 TGD-------PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR-LADTLGSENEETC 401
+ LY+ FF V + S A GG S RE + + L+ E E+C
Sbjct: 275 SARSGHTEDAALYRKASEFFWQTVVETRSLALGGNSRREHFAPAEDCLSYVYDREGPESC 334
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
T NMLK++ LFR E YADYYERA+ N +LS Q E G +Y P AR
Sbjct: 335 NTNNMLKLTEGLFRLNPEARYADYYERAVLNHILSTQH-PEHGGYVYFTP-------ARP 386
Query: 462 TH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
H + ++ WCC GTG+E+ K G+ IY E LY+ +I+S DW V
Sbjct: 387 AHYRVYSAPNSAMWCCVGTGMENHGKYGELIYTHTENE---LYVNLFIASELDWAERGVR 443
Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP 579
+ Q+ + +R+T+ + E L +R P W + QA LNGQ+
Sbjct: 444 IIQETK--FPDEESVRLTI----RTEKPMKFKLLIRHPHWCRTGAMQAVLNGQDYAAASV 497
Query: 580 GNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ ER W DK+ ++LP+S+ E + P AIL GP LL E
Sbjct: 498 SSSYIEIERIWKDGDKVQLELPMSVSVEEL----PNVPQYIAILRGPVLLGARMGTE 550
>gi|371776971|ref|ZP_09483293.1| Acetyl-CoA carboxylase, biotin carboxylase [Anaerophaga sp. HS1]
Length = 794
Score = 228 bits (582), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 163/534 (30%), Positives = 255/534 (47%), Gaps = 48/534 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A++ N +Y++ D D ++ F A L + YG WE S L GHF GHYL++ + M
Sbjct: 49 AEELNEKYVMAHDPDRILAPFLIDAGLKPKAQGYGNWEG--SGLNGHFGGHYLTSLSLMI 106
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEALKPVW 226
AST + ++++ +V L+ CQ G GY+ P + +L W
Sbjct: 107 ASTGSEEARKRLDYMVDQLARCQKANGNGYVGGIPGGQAMWAEIAKGNINAGNFSLNGKW 166
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
P Y IHK+ AGL D ++LA N +A ++ + ++F N + K +T +++ L E
Sbjct: 167 VPLYNIHKLFAGLRDAWLLAQNKKAKEVLINLTDWFLN-LTKNLTDDQIQK---MLVSEH 222
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GG+N+V +Y IT + +L LA F L L Q D L+ HANT IP VIG
Sbjct: 223 GGLNEVFADVYDITGNENYLKLARRFSHQAILRPLLQQKDQLTGLHANTQIPKVIGFMRI 282
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
E+ D + FF + V + + + GG S E + + + S + ETC TYN
Sbjct: 283 GELAHDTAWINAADFFWNTVVQNRTVSIGGNSTHEHFHAVDDFSSMIESRQGPETCNTYN 342
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
MLK+S+ LF + ++ Y DYYE+AL N +LS Q G ++Y + + R +
Sbjct: 343 MLKLSKQLFLFKNDLKYIDYYEQALYNHILSSQHPLHGG-LVYFTSM-----RPRHYRVY 396
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK-- 523
+FWCC G+GIE+ K G+ IY ++ NV Y+ +I S WK + L Q+
Sbjct: 397 SRPEQTFWCCVGSGIENHEKYGELIYAHDDENV---YVNLFIPSILHWKEKQLKLVQENH 453
Query: 524 ---VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-P 579
+D I +R+ ++ VG +R P WT +NG+ P
Sbjct: 454 FPDIDKIT-----IRVEPQRKTEFVVG------IRCPAWTRPEDMNVLVNGKAFKGKAIP 502
Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
G++ W ND + + LP+ + + D P Y S ++ GP++LA T
Sbjct: 503 GHYFLIRRYWEKNDVIEVHLPMHTYGKFLPDGSP-YLS---LMHGPFVLAATTD 552
>gi|408369881|ref|ZP_11167661.1| hypothetical protein I215_03228 [Galbibacter sp. ck-I2-15]
gi|407744935|gb|EKF56502.1| hypothetical protein I215_03228 [Galbibacter sp. ck-I2-15]
Length = 1011
Score = 228 bits (581), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 189/618 (30%), Positives = 281/618 (45%), Gaps = 99/618 (16%)
Query: 126 LLMLDVDSLVWSFRKT--ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAST-HN 182
L D DS ++ FR S P K G W++ ++LRGH GHYL+A AQ +AS+ ++
Sbjct: 395 LAKTDPDSFLYMFRNAFGVSQPQDAKPLGVWDSQETKLRGHATGHYLTAIAQAYASSSYD 454
Query: 183 ATIKE----KMSTVV---FSLSECQNKI-------------------------------- 203
+KE KM+ +V + LS+ +
Sbjct: 455 EQLKELFAQKMNYMVETLYDLSKLSGQPINSGGEHVSDPTKVPFGPGKTDYNSDLSEQGI 514
Query: 204 -------GTGYLSAFPTELFDSFEA-------LKPVWAPYYTIHKILAGLLDQYVLADNA 249
GTGY+SA+P + F E+ +WAPYYT+HKILAGLLD Y ++ N
Sbjct: 515 RNDYWNWGTGYISAYPPDQFIMLESGATYGGQNDQIWAPYYTLHKILAGLLDVYEISGNK 574
Query: 250 QALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLA 309
+AL +A M ++ R+ ++ T + + E GGMN+V+ RLY +T +L +A
Sbjct: 575 KALSVAQGMGDWVSARMVELPTSTLISMWNRYIAGEYGGMNEVMARLYRLTGTESYLKVA 634
Query: 310 HLFDK-PCFLG------FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFF 362
LFD F G LA D H+N HIP ++G+ Y T + Y I F
Sbjct: 635 GLFDNIKMFYGDAQHTHGLAKNVDTFRGLHSNQHIPQIVGALEMYRDTDEVEYFKIADNF 694
Query: 363 MDIVNASHSYATGGTS-------AREFWWDPKRLAD---TLGSENEETCTTYNMLKVSRH 412
+ Y+ GG + A F P L + + G +N ETC TYNMLK++R
Sbjct: 695 WFKATHDYMYSIGGVAGARNPANAECFPVQPATLYENGFSSGGQN-ETCATYNMLKLTRD 753
Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
LF + + DYYER L N +L+ P Y +PL G K H F
Sbjct: 754 LFFFEPKAQLMDYYERGLYNHILASVAKDSPA-NTYHVPLLPGSVK----HFGNPDMTGF 808
Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
CC GT IES +KL +SIYF+ + N LY+ +I S+ W ++ + Q + S+
Sbjct: 809 TCCNGTAIESSTKLQNSIYFKGKDN-KSLYVNLFIPSTLHWTERNIEIQQ----VTSFPK 863
Query: 533 YLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSY 591
TL + K L LR+P W +NG S+NG+ + + PG++LS +W
Sbjct: 864 EDNTTLKVTGKGRF----DLKLRVPNWA-TNGYHVSINGKEMDIQVTPGSYLSIDRKWKN 918
Query: 592 NDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG---EWDIKTGTARSLSA 648
D + + +P R E + D + +I ++ +GP LLA W T A +
Sbjct: 919 GDIIELSMPFDFRLEPVMDQQ----NIASLFYGPVLLAAQEESPLTHWRKVTFDAEQIGK 974
Query: 649 LISPIPPS--FNAQLVTF 664
I P + FN + + F
Sbjct: 975 FIKGDPSTLEFNYKGIEF 992
>gi|409196987|ref|ZP_11225650.1| Acetyl-CoA carboxylase, biotin carboxylase [Marinilabilia
salmonicolor JCM 21150]
Length = 788
Score = 228 bits (581), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 160/528 (30%), Positives = 252/528 (47%), Gaps = 38/528 (7%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A+Q N +Y+ D D L+ F A L YG WE S L GH GHYL++ A M
Sbjct: 43 AEQLNEKYVFAHDPDRLLAPFLIDAGLEPKAPGYGNWEG--SGLNGHIGGHYLTSLALMV 100
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELFD-SFEA----LKPVW 226
AST N +E++ ++ L+ CQ G GY+ P E+ + +A L W
Sbjct: 101 ASTGNEEAQERLDYMIEELARCQEANGNGYVGGIPGGQPMWAEIAKGNIDAGGFSLNGKW 160
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
P Y IHK+ AGL D + A +AL++ + ++F + V + S E+ L E
Sbjct: 161 VPLYNIHKLFAGLHDAWKYAGKEKALEILIQLTDWFID----VNSGLSDEQIQEILVSEH 216
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GG+N+V +Y IT + K+L LA + L L D L+ HANT IP V+G
Sbjct: 217 GGLNEVFADVYDITGEDKYLTLARQYSHRSILEPLLNHEDKLTGLHANTQIPKVVGFMRV 276
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
E+ GD + FF + V ++ + GG S E + + + S + ETC TYN
Sbjct: 277 GELAGDSAWIDASDFFWNTVVSNRTITIGGNSTHEHFHPVDDFSSMVESRQGPETCNTYN 336
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
MLK+S+ L+ + ++ Y DYYE+AL N +LS Q E G ++Y P+ + + +
Sbjct: 337 MLKLSKQLYLYKNDLRYVDYYEQALYNHILSSQH-PEHGGLVYFTPM-----RPQHYRVY 390
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
+FWCC G+GIE+ K G+ IY + +V ++ +I S +W+ + L QK +
Sbjct: 391 SNPEETFWCCVGSGIENHEKYGELIYAHSDDDV---FVNLFIPSELNWEEKGLKLTQKTN 447
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLS 584
+ L++ L + +G +R P W + ++NG+ PG +
Sbjct: 448 FPDNEQTTLKVELPEARSFTIG------IRYPQWMKEGEMKVTVNGKRARGGGAPGAYYQ 501
Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
W D++T+ L + E + D+ P +I GP++LA T
Sbjct: 502 VKREWQDGDEITVNLKMHTSGEYLPDNSP----FLSIKHGPFVLAAVT 545
>gi|359453850|ref|ZP_09243152.1| hypothetical protein P20495_1902 [Pseudoalteromonas sp. BSi20495]
gi|358049097|dbj|GAA79401.1| hypothetical protein P20495_1902 [Pseudoalteromonas sp. BSi20495]
Length = 816
Score = 228 bits (581), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 165/526 (31%), Positives = 254/526 (48%), Gaps = 38/526 (7%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
AQQTN+ YLL L D L+ + + A + +YG WE+ + L GH GHYLS+ +
Sbjct: 63 HAQQTNVRYLLALYPDQLLAPYLREAGIEQKAPSYGNWED--TGLDGHIGGHYLSSLSLA 120
Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP------TELFD-----SFEALKPV 225
WA+T + +K ++ ++ L Q ++ GYL P ++ D +L
Sbjct: 121 WAATGDEELKRRLDYMLNELQRAQ-QVNDGYLGGIPDGQAMWQQIHDGNIKADLFSLNDR 179
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
W P Y I KI GL D Y++A + QA M + E+F N K+ S E+ L E
Sbjct: 180 WVPLYNIDKIFHGLRDAYLIAGSEQAKTMLFDLGEWFLNLTAKL----SDEQIQQMLYSE 235
Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
GG+N V + +I +D ++L LA F + L + D L+ HANT IP +IG
Sbjct: 236 YGGLNAVFADMATIGNDKRYLKLARQFTHNNIIDPLLEKQDKLTGLHANTQIPKIIGMLK 295
Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
E + D ++ +F V S A GG S E + D + E ETC TY
Sbjct: 296 VAEASDDKAWQQGADYFWQTVTKQRSVAIGGNSVSEHFHDKNDFTPMVEDVEGPETCNTY 355
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
NM+K+S+ LF T + Y +YYERA N +LS Q E G ++Y + G + S+
Sbjct: 356 NMMKLSKLLFLKTADTRYLEYYERATYNHILSSQH-PEHGGLVYFTSMRPGHYRMYSSVQ 414
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW-KSGHVVLNQK 523
+S WCC G+GIE+ SK G+ IY + + N L++ +I S+ DW + G V Q
Sbjct: 415 -----DSMWCCVGSGIENHSKYGEQIYSKNDDN---LWVNLFIPSTLDWQQQGLKVTQQS 466
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
+ P + + + + K+ + + L++R P W ++ Q LNG+ + +
Sbjct: 467 LFPDAN---NITLVINTLDKKHISS-AQLHIRKPSWV-TDELQFELNGKAINATAEQGYY 521
Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
+ W D LT L L TE + D + Y A+L+GP ++A
Sbjct: 522 AIKHDWHDGDNLTFTLAPKLYTEQLPDGQDYY----AVLYGPVVMA 563
>gi|299146241|ref|ZP_07039309.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
gi|298516732|gb|EFI40613.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
Length = 800
Score = 228 bits (581), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 171/566 (30%), Positives = 265/566 (46%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L LD D L+ F + A L +Y WEN + L G
Sbjct: 30 LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L+ Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LY+ +I S WK ++L Q+ D + + + + K++ +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETR--FPDDDKVTLRIDEAPKKK----RTL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKIFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565
>gi|354580825|ref|ZP_08999729.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
gi|353201153|gb|EHB66606.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
Length = 623
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 162/557 (29%), Positives = 260/557 (46%), Gaps = 58/557 (10%)
Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASL----PTPGKAYGGWENPISELRGHFVGHYL 170
L R ++ N YL+ LD L+++++ A P A+GGWE P+ +LRGHF+GH+L
Sbjct: 16 LIRRERANRSYLMKLDSGHLLFNYQLEAGRFHGRTIPEGAHGGWETPVCQLRGHFLGHWL 75
Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYY 230
S +A + + + +K K+ +V L ECQ G ++ P + K +WAP Y
Sbjct: 76 SGAAMHYEKSGDMELKAKLDAIVQELHECQRDNGGQWVGPIPEKYLHWIARGKSIWAPQY 135
Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
+HKIL GL+D + A N QAL + ++F N ++ E+ L+ ETGGM
Sbjct: 136 NLHKILMGLVDAWQYAGNRQALDIVDRFADWFVNWS----GTFTREQFDDILDVETGGML 191
Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
+V L IT K+ +L + + L D L++ HANT IP V+G YEVT
Sbjct: 192 EVWADLLHITGADKYRVLLERYYRSRLFQPLLEGKDPLTNMHANTTIPEVLGCARAYEVT 251
Query: 351 GDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
GD + ++ ++ V S ATGG +A E W ++ LG +N+E CT YNM+++
Sbjct: 252 GDDRWLSIVQAYWKCAVTERGSLATGGQTAGEVWMPKMKMKARLGDKNQEHCTVYNMIRL 311
Query: 410 SRHLFRWTKEIAYADYYERALTNGVL------------SIQRGTEPGVMIYMLPLGRGVS 457
+ LFR T + +YA Y E L NG++ S + G++ Y LP+ G+
Sbjct: 312 AEFLFRQTGDPSYAQYIEYNLYNGIMAQAYYQEYGLTGSQHKHPHTGLLTYFLPMKAGLR 371
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF------ 511
K W T+ +SF+CC+GT +++ + IY+ ++G + +YI QY S
Sbjct: 372 KE-----WSTETDSFFCCHGTMVQANAAWNKGIYY-QDGEI--IYISQYFDSELRTSIDG 423
Query: 512 ---------DWKSGHVVLN------QKVDPIVSWD---PYLRMTLTFSSKQEVGQLSSLN 553
D SG ++ + Q ++ + + P R F +L
Sbjct: 424 TDIQIVQTQDKMSGSLLSSSNTAGYQAINDTAATNENMPAFR-KYDFIVSTAAPTTFTLR 482
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
R+P W + + + +F W D ++I LP+ +R + DD
Sbjct: 483 FRIPEWIMAEVSVYVNDRLQGTTRDSSSFYDIHRAWKEGDTVSIMLPIGIRFVPLPDDE- 541
Query: 614 EYASIQAILFGPYLLAG 630
A +GP +LAG
Sbjct: 542 ---RTGAFRYGPEVLAG 555
>gi|423287556|ref|ZP_17266407.1| hypothetical protein HMPREF1069_01450 [Bacteroides ovatus
CL02T12C04]
gi|392672671|gb|EIY66138.1| hypothetical protein HMPREF1069_01450 [Bacteroides ovatus
CL02T12C04]
Length = 800
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 172/566 (30%), Positives = 262/566 (46%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L LD D L+ F + A L +Y WEN + L G
Sbjct: 30 LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L+ Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHNLILDPLIKEEDKL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LY+ +I S WK ++L Q+ LR+ K+ +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRIDEAPKKKR------TL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWEKGDVITFHLPMKVSVEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565
>gi|395772531|ref|ZP_10453046.1| glycosylase [Streptomyces acidiscabies 84-104]
Length = 828
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 156/464 (33%), Positives = 232/464 (50%), Gaps = 32/464 (6%)
Query: 206 GYLSAFPTELFDSFEAL-----KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F E++ VWAPYYT HKIL GLLD Y +A+AL +A M +
Sbjct: 340 GFLAAYPETQFIQLESMTASDYSKVWAPYYTAHKILRGLLDAYAATGDARALDLAGGMAD 399
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ ++R+ K + +++R W + E GG+ + L LY +T +HL LA LFD +
Sbjct: 400 WMHSRLSK-LPGATLQRMWGLFSSGEFGGIVEALCDLYDLTGKGEHLALARLFDLDRLID 458
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L HAN HIPI G Y+ TG+ Y F D+V Y+ GGTS
Sbjct: 459 ACAANTDVLDGLHANQHIPIFTGYLRLYDATGEERYLAAARNFWDMVVPHRMYSIGGTSD 518
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW +A + + E+C YNMLK+SR LF ++ Y DYYERAL N VL +R
Sbjct: 519 AEFWRARDVVAGAISGASAESCCAYNMLKLSRALFLHAQDAKYMDYYERALFNQVLGSKR 578
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E ++ Y L L G + T GT CC GTG+ES +K D++YF
Sbjct: 579 DVADAEKPLVTYFLGLNPGHVR-DYTPKQGTT-----CCEGTGLESATKYQDTVYFVAA- 631
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
+ LY+ + S+ +W + V + Q D ++ +T+ G L + LR+
Sbjct: 632 DGSSLYVNLFSPSTLEWAAKGVRVVQ--DTAFPFEQGTTLTV------RGGGLFEMRLRV 683
Query: 557 PVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
PVW +G + +NGQ + P PG++ + W D + +++P +R E DD
Sbjct: 684 PVWAV-DGFRVFVNGQAVSGSPMPGSYFGVSREWRDGDVVRVEVPFRMRVERTPDD---- 738
Query: 616 ASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNA 659
+S+QA+ +GP L ++ + R+ SAL + SF A
Sbjct: 739 SSVQAVFYGPVNLVARSASTSYLSVALYRN-SALSGDLVSSFTA 781
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 51/90 (56%), Gaps = 5/90 (5%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENPISE----LRGHFVGHYLSAS 173
+Q L++ DV+ L+ FR A L T G A GGWE E LRGH+ GH+L+
Sbjct: 26 RQLMLDHARGYDVNRLLQVFRANAGLATLGAVAPGGWEGLDGEANGNLRGHYTGHFLTML 85
Query: 174 AQMWASTHNATIKEKMSTVVFSLSECQNKI 203
+Q +AST + EK+ T+V +L+E + +
Sbjct: 86 SQAYASTGDEVYAEKIRTIVGALTESREAL 115
>gi|336417295|ref|ZP_08597620.1| hypothetical protein HMPREF1017_04728 [Bacteroides ovatus
3_8_47FAA]
gi|335936275|gb|EGM98208.1| hypothetical protein HMPREF1017_04728 [Bacteroides ovatus
3_8_47FAA]
Length = 800
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 171/566 (30%), Positives = 265/566 (46%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L LD D L+ F + A L +Y WEN + L G
Sbjct: 30 LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L+ Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 147 GKIHAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LY+ +I S WK ++L Q+ D + + + + K++ +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILRQETR--FPDDDKVTLRIDEAPKKK----RTL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWKKGDVITFNLPMRVSMEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565
>gi|336405535|ref|ZP_08586212.1| hypothetical protein HMPREF0127_03525 [Bacteroides sp. 1_1_30]
gi|335937406|gb|EGM99306.1| hypothetical protein HMPREF0127_03525 [Bacteroides sp. 1_1_30]
Length = 800
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 172/566 (30%), Positives = 262/566 (46%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L LD D L+ F + A L +Y WEN + L G
Sbjct: 30 LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L+ Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLAHQMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LY+ +I S WK ++L Q+ LR+ K+ +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRIDEAPKKKR------TL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSKGYSISINGKRKMFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565
>gi|423299329|ref|ZP_17277354.1| hypothetical protein HMPREF1057_00495 [Bacteroides finegoldii
CL09T03C10]
gi|408473138|gb|EKJ91660.1| hypothetical protein HMPREF1057_00495 [Bacteroides finegoldii
CL09T03C10]
Length = 800
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 174/566 (30%), Positives = 272/566 (48%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L DV L S L +AQQT+L Y+L L+ D L+ F + A L +Y WEN + L G
Sbjct: 30 LQDVKLLDSPFL-QAQQTDLHYILALNPDRLLAPFLREAGLTPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + I +++ ++ L Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAIYNRLNYMLDELYRAQQAVGTGFIGGTPGSLQLWKEIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVL--ADNAQALKMA--TWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ +D A+ + +A WM++
Sbjct: 147 GNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYTGSDRARLMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S ++ L E GG+N+ + +IT D K+L LA F L L D L
Sbjct: 199 ITSGLSDQQIQDMLRSEHGGLNETFADVAAITGDKKYLELARRFSHKIILDPLIKDEDRL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V + S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAEEWDHAARFFWNTVVNNRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ + + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPADNFTSMINDVQGPETCNTYNMLRLTKMLYQNSHNPCNINEPDPNYINYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
++ LY+ +I S +WK V+L Q+ D + + + +SK++ +L
Sbjct: 433 HQKDT---LYVNLFIPSQLNWKEQGVILTQETR--FPDDNKVTLRIDKASKKQ----RTL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQNLPLP-PPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S+ S+NG+ P GN +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSSNYSISINGKKETFPTKKGNQYLPLSRKWKKGDVITFNLPMKVTIEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565
>gi|160883737|ref|ZP_02064740.1| hypothetical protein BACOVA_01709 [Bacteroides ovatus ATCC 8483]
gi|423297720|ref|ZP_17275780.1| hypothetical protein HMPREF1070_04445 [Bacteroides ovatus
CL03T12C18]
gi|156110822|gb|EDO12567.1| hypothetical protein BACOVA_01709 [Bacteroides ovatus ATCC 8483]
gi|392665078|gb|EIY58610.1| hypothetical protein HMPREF1070_04445 [Bacteroides ovatus
CL03T12C18]
Length = 800
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 172/566 (30%), Positives = 262/566 (46%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L LD D L+ F + A L +Y WEN + L G
Sbjct: 30 LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L+ Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHNLILDPLIKEEDKL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LY+ +I S WK ++L Q+ LR+ K+ +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRINEAPKKKR------TL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWEKGDVITFHLPMKVSVEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565
>gi|295133234|ref|YP_003583910.1| hypothetical protein ZPR_1378 [Zunongwangia profunda SM-A87]
gi|294981249|gb|ADF51714.1| putative secreted protein [Zunongwangia profunda SM-A87]
Length = 1016
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 178/607 (29%), Positives = 271/607 (44%), Gaps = 94/607 (15%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKT--ASLPTPGKAYGGWEN 156
L +VSL Q++ + + L + DS ++ FR P K G W+
Sbjct: 373 LDQVSLESNTNGQNTKFIENRDKFINTLAQTNPDSFLYMFRNAFGQEQPVGAKPLGVWDT 432
Query: 157 PISELRGHFVGHYLSASAQMWAST--------HNATIKEKMSTVVFSLSECQNKI----- 203
++LRGH GHYL+A AQ +AST + A E M ++ LS+ K
Sbjct: 433 QETKLRGHATGHYLTAIAQAYASTGYDKALQQNFADKMEYMVNTLYQLSQMSGKPAEEGG 492
Query: 204 ----------------------------------GTGYLSAFPTELFDSFE-------AL 222
G G++SA+P + F E
Sbjct: 493 DFNANPTAVPMGPGKEIYSSDLSEEGIRTDYWNWGEGFISAYPPDQFIMLENGAVYGTEE 552
Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
+WAPYYT+HKILAGL+D Y ++ N +AL +A M ++ Y R+ ++ T + +
Sbjct: 553 TKIWAPYYTLHKILAGLMDIYEVSGNEKALAVAEGMGDWVYARLSELPTDTLISMWNRYI 612
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
E GGMN+ + RLY IT +L A LFD F G LA D HAN
Sbjct: 613 AGEFGGMNEAMARLYRITGKDTYLETARLFDNIKVFFGDANHSHGLAKNVDTFRGLHANQ 672
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPKR 388
HIP ++G+ Y + P Y + F + Y+ GG + A F P
Sbjct: 673 HIPQIVGALEMYRDSDKPEYFNVADNFWVKATNDYMYSIGGVAGARNPANAECFIAQPGT 732
Query: 389 LAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
L + + G +N ETC TYNMLK++R+LF + + DYYER L N +L+ P
Sbjct: 733 LYENGLSAGGQN-ETCATYNMLKLTRNLFLYEQRPELMDYYERGLYNHILASVAEDSP-A 790
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
Y +PL G K+ F CC GT +ES +KL +SIYF+ N LY+
Sbjct: 791 NTYHVPLRPGSKKSFGN----PNMTGFTCCNGTALESSTKLQNSIYFKGADN-KALYVNL 845
Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
Y+ S+ W ++ L Q+ + + + ++T+ K + L LR+P W +NG
Sbjct: 846 YVPSTLHWHEKNIELTQETN--FPKEDHTKLTINGKGKFD------LKLRVPGWA-TNGF 896
Query: 566 QASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
+NG++ + PG +LS + +W D + +Q+P + I D + +I ++ +G
Sbjct: 897 TVKINGKDQKVKATPGTYLSLSRKWKDGDTVELQMPFGFYLDPIMDQQ----NIASLFYG 952
Query: 625 PYLLAGH 631
P LLA
Sbjct: 953 PVLLAAQ 959
>gi|239627978|ref|ZP_04671009.1| secreted protein [Clostridiales bacterium 1_7_47_FAA]
gi|239518124|gb|EEQ57990.1| secreted protein [Clostridiales bacterium 1_7_47FAA]
Length = 822
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 161/566 (28%), Positives = 268/566 (47%), Gaps = 52/566 (9%)
Query: 101 EVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENPIS 159
EV V L + + W AQ+ + +LL +D D ++++FR A L G GW+ P
Sbjct: 225 EVPAGSVRLSEGTRFWDAQERMIRWLLSVDDDQMLYNFRSAAGLDVRGAGPMTGWDAPEC 284
Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----GTGYLSAFPTE 214
L+GH GHYLS A + +K+K++ +V +L+ECQ + G+LSA+ +
Sbjct: 285 NLKGHTTGHYLSGLALACSVHGQPELKDKINYMVNALAECQKALEAKGCAKGFLSAYSEQ 344
Query: 215 LFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVIT 271
FD E +WAPYYT+ KI++GL D Y LA + +A + T + ++ Y R+ + ++
Sbjct: 345 QFDLLEVYTRYPEIWAPYYTLDKIMSGLYDCYCLAGSKEAFHLLTGLGDWIYGRLSR-LS 403
Query: 272 MYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
+++ W + E GGM V+ RLY T D ++ A F + D L
Sbjct: 404 RAQLDKMWSMYIAGEFGGMISVMVRLYRETGDGRYRRAALFFRNEKLFYPMEENVDTLKD 463
Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
HAN HIP IG+ Y+ G Y I F +V SH Y+ GG E + +P +A
Sbjct: 464 MHANQHIPQAIGALELYKAGGGKRYLAIARNFWQMVVRSHEYSIGGVGETEMFHEPGDIA 523
Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
+ ++ E+C +YN+++++ LF + + DYYE L N +LS G Y +
Sbjct: 524 HYMTDKSAESCASYNLMRLTFGLFGLSPDSRKMDYYENVLYNHILSSASHKADGGTTYFM 583
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
P+ G K + T N+ CC+GTG+ES + +IY E + +Y+ YI S
Sbjct: 584 PVRPGGRKE-----FNTSENT--CCHGTGLESRFRYIRNIYAAGE-DKKEVYVNLYIPSE 635
Query: 511 FDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT--------- 560
D + G + L + + +TF+ ++ G+ ++ LR+P W
Sbjct: 636 LDMEDGWKLKLEEDARTQGGY------RITFNGPKDGGE-RTVALRIPCWAGEDWDIRIH 688
Query: 561 --YSNGAQA---------SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+ GA+A + Q + G ++ +W +D++ I+LP R
Sbjct: 689 TVHPEGAEADGLAKTDAVTEASQGFTVDSDG-YVRIRRQWMPDDRMEIRLPFRFRKLPA- 746
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
P+ ++ ++ +GPY+LA GE
Sbjct: 747 ---PDGSAYSSVAYGPYILAALNDGE 769
>gi|237722208|ref|ZP_04552689.1| acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
gi|229448018|gb|EEO53809.1| acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
Length = 800
Score = 226 bits (577), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 172/566 (30%), Positives = 261/566 (46%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L LD D L+ F + A L +Y WEN + L G
Sbjct: 30 LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L+ Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYSRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKDEDKL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNNTQEPDPNYVNYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
++ LY+ +I S WK + L Q+ LR+ K+ +L
Sbjct: 433 HQKDT---LYVNLFIPSQLTWKEQGITLTQETRFPDDGKVTLRIDEAHKKKR------TL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKIFVMGKGNQYLPLSRKWKKGDVVTFNLPMKVTMEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565
>gi|359776490|ref|ZP_09279799.1| hypothetical protein ARGLB_045_00070 [Arthrobacter globiformis NBRC
12137]
gi|359306199|dbj|GAB13628.1| hypothetical protein ARGLB_045_00070 [Arthrobacter globiformis NBRC
12137]
Length = 1025
Score = 226 bits (577), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 151/433 (34%), Positives = 217/433 (50%), Gaps = 31/433 (7%)
Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F E+ VWAPYYT HKIL GLLD Y +AL +AT + +
Sbjct: 391 GFLAAYPETQFIELESRTTPDYFRVWAPYYTAHKILKGLLDAYTATAEPKALDLATGLCD 450
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ ++R+ K +T +R W + E GG+ + + Y + P+HL LA FD +
Sbjct: 451 WMHSRLSK-LTPAVRQRMWGIFSSGEYGGVVEAILETYGHSGKPEHLELAKYFDLDSLID 509
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L+ HAN HIPI G + Y TG+ Y F +V + ++ GGTS
Sbjct: 510 ACAQDKDILAGLHANQHIPIFTGLVLMYNATGEERYLAAARNFWTMVVPTRMFSIGGTSQ 569
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW + R+A TL + + E+C YNMLK+SR LF + AY DYYERAL N VL ++
Sbjct: 570 GEFWKERDRIAATLNATDAESCCAYNMLKLSRELFFREQNPAYMDYYERALFNQVLGSKQ 629
Query: 440 GTEPG---VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E + Y + L G + T GT CC GTG+ES +K DS+YF G
Sbjct: 630 DKESAELPLATYFIGLQPGAVR-DFTPKQGTT-----CCEGTGLESATKYQDSVYF-TAG 682
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
+ LY+ Y+ S+ W + +V + Q+ S+ R TL + GQ L LR+
Sbjct: 683 DGSALYVNLYMPSTLRWAAKNVTVTQQ----TSYPFEQRTTLQVAGS---GQF-ELRLRV 734
Query: 557 PVWTYSNGAQASLNGQ-NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
P W + G +NG PG +LS W D + +++P +LR E DD
Sbjct: 735 PAWA-TAGFTVRVNGAVTEAAATPGTYLSIARAWKNGDTVDVEMPFTLRAERALDD---- 789
Query: 616 ASIQAILFGPYLL 628
S+Q +++GP L
Sbjct: 790 PSVQTLMYGPVHL 802
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 50/116 (43%), Gaps = 15/116 (12%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL-------PTPGKAY 151
++ L DV L V R ++ L + D V FR A L P P
Sbjct: 49 VRPFKLSDVSLG-PGVFARKRELILNFARGYDERRYVNVFRANAGLRPLDGVVPLPA--- 104
Query: 152 GGWENPISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
GGWE E LRGHF GH++S AQ +A T K+ +V SL EC+ +
Sbjct: 105 GGWEGLDGEANGNLRGHFTGHHMSMLAQAYAGTGEEVFGTKLRNLVASLHECRQAL 160
>gi|332662487|ref|YP_004445275.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332331301|gb|AEE48402.1| protein of unknown function DUF1680 [Haliscomenobacter hydrossis
DSM 1100]
Length = 793
Score = 226 bits (577), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 169/557 (30%), Positives = 263/557 (47%), Gaps = 59/557 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L EVSL D A+ N++ LL D+D L+ +RK A LP +Y W+
Sbjct: 32 LAEVSLLD------GPFKHARDLNIQTLLQYDIDRLLNPYRKEAGLPEKAASYPNWDG-- 83
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-------GTGYLSAF 211
L GH GHYLSA A M A+T NA +++++ ++ L CQ G GYL
Sbjct: 84 --LDGHVGGHYLSAMA-MNAATGNAECRKRLAYMLSELKACQEAHALKHPAWGIGYLGGV 140
Query: 212 P--TELFDSFE-----ALKPVWAPYYTIHKILAGLLDQYVLADNAQA----LKMATWMVE 260
P E++ +F+ AL+ W P+Y +HK+ +GL D ++ + A L W +
Sbjct: 141 PKSAEIWSTFKNGDFKALRAAWVPWYNVHKLYSGLRDAWLYTGDETAKTLFLDFCDWGIA 200
Query: 261 YFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF 320
N + M S+ L+ E GGMN++ Y +T D K+L A F L
Sbjct: 201 ITANLSEA--QMQSM------LDIEHGGMNEIFADAYQMTGDEKYLKAAKGFSHQALLDP 252
Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR 380
+++ D L + HANT +P +G Q E++ + Y G FF + V + S A GG S R
Sbjct: 253 MSMGKDNLDNKHANTQVPKAVGFQRIAELSKEDKYAKAGRFFWETVTSKRSLALGGNSRR 312
Query: 381 EFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EF+ D + E E+C +YNMLK++ LFR Y DYYER L N +LS Q
Sbjct: 313 EFFPSIAAGRDFVHDVEGPESCNSYNMLKLTEELFRANPSGHYIDYYERTLYNHILSTQH 372
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
E G +Y P ++ R + WCC G+G+E+ K IY +++ +
Sbjct: 373 -PEHGGYVYFTP-----ARPRHYRVYSAPNQGMWCCVGSGMENHGKYNQLIYTQQKDS-- 424
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
L++ +I+S+ +W++ +VL Q+ + + ++T+T E +L +R P W
Sbjct: 425 -LFLNLFIASALNWRAKGIVLKQQTN--FPEEEQTKLTIT-----EGRARFTLMIRYPSW 476
Query: 560 TYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
+ Q +N + + P +++ W D + I LP+ E + + PEY
Sbjct: 477 VQAGALQIRVNNKRVTYTTSPSAYVAIKRLWKKGDVVQIVLPMRNTLEHLT-NAPEYV-- 533
Query: 619 QAILFGPYLLAGHTSGE 635
A+L GP LL T E
Sbjct: 534 -ALLHGPILLGAKTGTE 549
>gi|383112514|ref|ZP_09933306.1| hypothetical protein BSGG_0614 [Bacteroides sp. D2]
gi|313693079|gb|EFS29914.1| hypothetical protein BSGG_0614 [Bacteroides sp. D2]
Length = 800
Score = 226 bits (577), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 176/566 (31%), Positives = 262/566 (46%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L L+ D L+ F + A L +Y WEN + L G
Sbjct: 30 LQNVKLLDSPFL-QAQQTDLHYILALNPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKDIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A KM WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARKMLIDLTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKLILDPLIKDEDKL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKSWSHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNNTQEPDPNYVNYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LYI +I S WK V L Q+ LR+ K+ +L
Sbjct: 433 HQRDT---LYINLFIPSQLTWKEQGVTLTQETRFPDDGKVTLRIDEAPKKKR------TL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQ-NLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + + GN +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSKGYSISINGKRKIFIMAKGNQYLPLSRKWKKGDVITFNLPMRVSMEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565
>gi|374992692|ref|YP_004968187.1| hypothetical protein SBI_09938 [Streptomyces bingchenggensis BCW-1]
gi|297163344|gb|ADI13056.1| hypothetical protein SBI_09938 [Streptomyces bingchenggensis BCW-1]
Length = 769
Score = 226 bits (577), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 170/541 (31%), Positives = 253/541 (46%), Gaps = 55/541 (10%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ T L+YLL LD D L+ R+ A LP ++YG WE+ S L GH VGH LS +A M
Sbjct: 19 AQATALDYLLSLDTDRLLAPLRREAGLPPVAESYGNWES--SGLDGHTVGHALSGAALMS 76
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF------------DSFEALKPV 225
A T + + + +V + ECQ+ +GTGY+ P + DSFE L
Sbjct: 77 AVTDDPRPRAMVDRLVQGVVECQDALGTGYVGGVPDGVRLWQRVAAGQVERDSFE-LGGA 135
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
W P+Y +HK+ AGLLD Y + AL + +++ +V + H L E
Sbjct: 136 WVPWYNLHKLFAGLLDAYRHTGSEPALTAVRRLADWW----GRVAAGMDDDTHEAMLRTE 191
Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
GGM +VL L +T ++ LA F L L D L HANT I V+G Q
Sbjct: 192 FGGMCEVLADLAEVTGTDRYAALARRFLDQSLLRPLCEHRDVLDGMHANTQIAKVVGYQR 251
Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
EV DP + FF + + + GG S RE + L S E ETC TY
Sbjct: 252 LGEVVDDPGLRDAARFFWQAMTRHRTVSFGGNSVREHLHPRDDFSSALQSPEGPETCNTY 311
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLGRGVSKARSTH 463
NMLK+SR LF + D+YERA N +LS +P G ++Y P+ G + S
Sbjct: 312 NMLKLSRALFLERPDTEVLDHYERATVNHILS---SLQPKGGLVYFTPVRPGHYRVVS-- 366
Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
T N FWCC GTG+E+ +K G+ +Y E + L++ +I+S ++VL Q
Sbjct: 367 ---TPQNCFWCCVGTGLENHAKYGELVYTTEGDD---LFVNLFIASRLSRPEQNLVLEQT 420
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG---QNLPLP--- 577
+D +R+ + + + +++R+P W + Q +NG ++ P P
Sbjct: 421 G--TAPYDEEVRLVVRGAPATPL----PIHIRVPGW-HEGTPQIRINGAPPEDGPGPLTT 473
Query: 578 ------PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGH 631
P ++ +W D +T++L + E + D P + S + FGP +LA
Sbjct: 474 RRAAGGQPLTYVRLERQWCEGDTVTMRLRPRISAELLPDGSP-WVSYR---FGPSVLAAE 529
Query: 632 T 632
+
Sbjct: 530 S 530
>gi|386820708|ref|ZP_10107924.1| putative glycosyl hydrolase of unknown function (DUF1680)
[Joostella marina DSM 19592]
gi|386425814|gb|EIJ39644.1| putative glycosyl hydrolase of unknown function (DUF1680)
[Joostella marina DSM 19592]
Length = 1018
Score = 226 bits (576), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 186/633 (29%), Positives = 279/633 (44%), Gaps = 97/633 (15%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--YGGWEN 156
L EVSL S + + L + D+ ++ FR T P P A G W++
Sbjct: 375 LDEVSLDVDTHGHESKFIENRDKFISTLAQTNPDAFLYMFRNTFGQPQPDAAEPLGVWDS 434
Query: 157 PISELRGHFVGHYLSASAQMWAST--------HNATIKEKMSTVVFSLSECQNKI----- 203
++LRGH GHYL+A AQ +AST + A E M ++ L++
Sbjct: 435 QETKLRGHATGHYLTAIAQAYASTGYDKSLQNNFADKMEYMVNTLYKLAQMSGNPKTKDG 494
Query: 204 ----------------------------------GTGYLSAFPTELFDSFE-------AL 222
G G++SA+P + F E
Sbjct: 495 SYVANPTEVPPGPGKSNYDSDLSEDGIRTDYWNWGEGFISAYPPDQFIMLENGATYGGQQ 554
Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
VWAPYYT+HKILAGLLD Y ++ N +AL++A M + Y R+ ++ T + +
Sbjct: 555 TQVWAPYYTLHKILAGLLDIYEVSGNKKALEVAEGMGSWVYARLNELPTETLISMWNRYI 614
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
E GGMN+V+ RLY +T + K+L +A LFD F G LA D HAN
Sbjct: 615 AGEFGGMNEVMARLYRLTDEEKYLQVAQLFDNIKVFYGDANHSNGLAKNVDTFRGLHANQ 674
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPKR 388
HIP ++G+ Y + Y I F + Y+ GG + A F P
Sbjct: 675 HIPQIVGAIEMYRDSNTAEYYRIADNFWFKSKNDYMYSIGGVAGARNPANAECFISQPAT 734
Query: 389 LAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
+ + + G +N ETC TYNMLK++R+LF + + Y DYYER L N +L+ P
Sbjct: 735 IYENGLSAGGQN-ETCATYNMLKLTRNLFLFDQRAEYMDYYERGLYNHILASVAEKTPA- 792
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
Y +PL G K H F CC GT IES +KL +SIYF+ N LY+
Sbjct: 793 NTYHVPLRPGSVK----HFGNPDMKGFTCCNGTAIESSTKLQNSIYFKSVEN-DALYVNL 847
Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
Y+ S+ W + + QK + + ++T+ + K + L +R+P W + G
Sbjct: 848 YVPSTLHWAEKKLTITQKT--AFPKEDFTQLTINGNGKFD------LKVRVPNWA-TKGF 898
Query: 566 QASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
+NG+ + PG++L+ W D + +++P E+I D + +I ++ +G
Sbjct: 899 IVKINGKEEKVEAIPGSYLTLNRTWKDGDTVELKMPFQFHLESIMDQQ----NIASLFYG 954
Query: 625 PYLLAGHTS---GEWDIKTGTARSLSALISPIP 654
P LL S EW T + IS P
Sbjct: 955 PILLVAQESEPRTEWRKVTFDKNEIGKDISGDP 987
>gi|302561993|ref|ZP_07314335.1| secreted protein [Streptomyces griseoflavus Tu4000]
gi|302479611|gb|EFL42704.1| secreted protein [Streptomyces griseoflavus Tu4000]
Length = 950
Score = 226 bits (576), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 153/466 (32%), Positives = 229/466 (49%), Gaps = 38/466 (8%)
Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F + E++ VWAPYYT HKIL GLLD Y+ D+ +AL +A+ M +
Sbjct: 399 GFLAAYPETQFIALESMTGSDYTRVWAPYYTAHKILRGLLDAYLATDDERALDLASGMCD 458
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ + R+ V+ +++R W + E GG+ + + L+++T P+HL LA LFD +
Sbjct: 459 WMHARLS-VLPAATLQRMWGLFSSGEFGGIVEAVCDLHALTGRPEHLALARLFDLDRLID 517
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L HAN HIP+ G ++ TG+ Y F +V +YA GGTS+
Sbjct: 518 ACAADTDVLEGLHANQHIPVFTGLVRLHDETGEQRYLTAAKNFWGMVVPHRTYAIGGTSS 577
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW +A T+G E+C YNMLK+SR LF ++ AY DYYER L N VL ++
Sbjct: 578 GEFWKARGVIAGTIGDTTAESCCAYNMLKLSRALFFHEQDPAYMDYYERTLYNQVLGSKQ 637
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E ++ Y + L G + T CC GTG+ES +K DS+YF +
Sbjct: 638 DRPDAEKPLVTYFVGLTPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYFAKA- 690
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLR 555
+ LY+ Y S W V + Q + TLT G+ S +L LR
Sbjct: 691 DGSALYVNLYSDSRLAWAEKGVTVTQS----TRYPEEQGSTLTIGG----GRASFTLLLR 742
Query: 556 MPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
+P W + G + ++NG+ +P P PG + + W D + I +P LR E DD
Sbjct: 743 VPSWA-TAGFRVTVNGRAVPGAPVPGRYFGVSRSWRDGDTVRISVPFRLRVEKAPDD--- 798
Query: 615 YASIQAILFGPYLLAGHTSGEWDIK------TGTARSLSALISPIP 654
+QA+ GP L G ++ G + L ++P+P
Sbjct: 799 -PGLQALFLGPVCLVARRPGPEPVRFGLYGNAGLSGDLLPSLTPVP 843
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 54/110 (49%), Gaps = 6/110 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
++ L DV L V ++ L++ DV+ L+ FR A L T G A GGWE
Sbjct: 60 VRPFGLEDVTLG-PGVFAAKRRLMLDHARGYDVNRLLQVFRANAGLSTRGAVAPGGWEGL 118
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
E LRGH+ GH+L+ AQ ST +++ TVV +L E + +
Sbjct: 119 DGEANGNLRGHYTGHFLTMLAQAHRSTGEQVFADRIDTVVGALVEVREAL 168
>gi|423213125|ref|ZP_17199654.1| hypothetical protein HMPREF1074_01186 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694381|gb|EIY87609.1| hypothetical protein HMPREF1074_01186 [Bacteroides xylanisolvens
CL03T12C04]
Length = 800
Score = 226 bits (575), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 173/566 (30%), Positives = 267/566 (47%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L LD D L+ F + A L +Y WEN + L G
Sbjct: 30 LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L+ Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LY+ +I S WK + L Q+ D + + + + K++ +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGITLTQET--CFPDDGKVTLRIDEAPKKK----HTL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQ-NLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + + GN +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFIMAKGNQYLPLSRKWKKGDVVTFHLPMKVSVEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565
>gi|443629445|ref|ZP_21113773.1| putative Secreted protein [Streptomyces viridochromogenes Tue57]
gi|443337063|gb|ELS51377.1| putative Secreted protein [Streptomyces viridochromogenes Tue57]
Length = 941
Score = 226 bits (575), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 157/463 (33%), Positives = 228/463 (49%), Gaps = 31/463 (6%)
Query: 206 GYLSAFPTELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F E+ VWAPYYT HKIL G+LD Y+ D+A+AL +A+ M +
Sbjct: 390 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGVLDAYLATDDARALDLASGMCD 449
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ Y+R+ K + +++R W + E GG+ + + L++IT +HL LA LFD +
Sbjct: 450 WMYSRLSK-LPEATLQRMWGLFSSGEFGGIVEAVCDLHTITGKAEHLALAQLFDLDRLID 508
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L HAN HIPI G Y+ TG+ Y F +V Y GGTS
Sbjct: 509 NCAANTDILDGLHANQHIPIFTGYLRLYDATGEQRYLDAARNFWGMVVPHRMYGIGGTST 568
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW +A T+ + N ETC YNMLK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 569 GEFWKARDVIAGTISATNAETCCAYNMLKLSRTLFFHEQQPKYMDYYERALFNQVLGSKQ 628
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E ++ Y + L G + T GT CC GTG+ES +K DS+YF+
Sbjct: 629 DKADAEKPLVTYFIGLTPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYFKAA- 681
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
+ LY+ Y S W V + Q ++ TLT +L LR+
Sbjct: 682 DGSALYVNLYSPSRLAWAEKGVTVTQ----TTAFPREQGTTLTIGGGSAA---FALRLRV 734
Query: 557 PVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
P W + G + ++NG + P PG++ + + W D + I +P LR E DD
Sbjct: 735 PSWA-TAGFRVTVNGSAVSGTPKPGSYFTVSRTWRSGDTVRISMPFRLRVEKAIDD---- 789
Query: 616 ASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFN 658
S+Q + +GP L G S ++ G R+ + L + PS
Sbjct: 790 PSLQTLFYGPVNLVGRNSATSYLQLGLYRN-AGLSGDLLPSLT 831
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 58/110 (52%), Gaps = 6/110 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
++ +L DV L + + +Q L++ DV+ L+ FR A L T G A GGWE
Sbjct: 51 VQPFALDDVAL-RPGLFADKRQLMLDHARGYDVNRLLQVFRANAGLSTGGAVAPGGWEGL 109
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
E LRGH+ GH+L+ +Q +A T +++ T+V +L+E + +
Sbjct: 110 DGEANGNLRGHYTGHFLTMLSQAYAGTGEQVFVDRIRTMVGALTEVREAL 159
>gi|344201935|ref|YP_004787078.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
gi|343953857|gb|AEM69656.1| protein of unknown function DUF1680 [Muricauda ruestringensis DSM
13258]
Length = 1022
Score = 226 bits (575), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 182/629 (28%), Positives = 285/629 (45%), Gaps = 97/629 (15%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKT--ASLPTPGKAYGGWEN 156
L +VSL+ Q + + + L+ + DS ++ FR P K G W++
Sbjct: 379 LDQVSLNADAHGQQTKFIENRDKFINTLVQTNPDSFLYMFRNAFGQEQPEGAKPLGVWDS 438
Query: 157 PISELRGHFVGHYLSASAQMWASTH-----NATIKEKMS---TVVFSLSE---------- 198
++LRGH GHYL+A AQ +AST A +KM+ V++ LS+
Sbjct: 439 QETKLRGHATGHYLTAIAQAYASTGYDKALQANFADKMNYMVDVLYQLSQMSGQSAKAGG 498
Query: 199 ----------------------CQNKI-------GTGYLSAFPTELFDSFE-----ALKP 224
+N I G G++SA+P + F E +P
Sbjct: 499 EHVADPTAVPPGPGKSTYDSDLSENGIRTDYWNWGEGFISAYPPDQFIMLENGATYGTQP 558
Query: 225 --VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
VWAPYYT+HKILAGL+D Y ++ N +AL++A M ++ Y R+ ++ T + +
Sbjct: 559 TQVWAPYYTLHKILAGLMDIYEVSGNEKALEIAKGMGDWVYARLSQLPTDTLISMWNTYI 618
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
E GGMN+ + RL IT +P++L +A LFD F G LA D HAN
Sbjct: 619 AGEFGGMNEAMARLDRITDEPRYLKVAQLFDNIKMFFGDAEHSHGLARNVDSFRGLHANQ 678
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG-------TSAREFWWDPKR 388
HIP ++G+ Y + P Y + F + Y+ GG T+A F P
Sbjct: 679 HIPQIVGALEIYRDSESPEYYQVADNFWYKAKNDYMYSIGGVAGARNPTNAECFIAQPAT 738
Query: 389 LAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
L + + G +N ETC TYNMLK++++LF + + DYYER L N +L+ P
Sbjct: 739 LYENGFSSGGQN-ETCATYNMLKLTKNLFLFDQRTELMDYYERGLYNHILASVAEDSP-A 796
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
Y +PL G K + F CC GT +ES +KL +SIYF+ + N LY+
Sbjct: 797 NTYHVPLRPGSVKRFGN----SDMTGFTCCNGTALESSTKLQNSIYFKSQDNST-LYVNL 851
Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
++ S+ W + + QK ++ LT K + LN+R+P W + G
Sbjct: 852 FVPSTLKWAEKDITVEQK----TAFPKEDNTQLTIKGKGKF----DLNIRVPQWA-TKGF 902
Query: 566 QASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
+NG+ + PG +L+ + +W D + +++P + + D + +I ++ +G
Sbjct: 903 FVKINGKEEKVEAKPGTYLTLSRKWKDGDVIDLKMPFQFHLDPVMDQQ----NIASLFYG 958
Query: 625 PYLLAGHT---SGEWDIKTGTARSLSALI 650
P LL EW T A + I
Sbjct: 959 PVLLVAQEPEPRNEWRKITLDAEDIGKTI 987
>gi|86140890|ref|ZP_01059449.1| putative secreted protein [Leeuwenhoekiella blandensis MED217]
gi|85832832|gb|EAQ51281.1| putative secreted protein [Leeuwenhoekiella blandensis MED217]
Length = 1004
Score = 225 bits (574), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 181/607 (29%), Positives = 276/607 (45%), Gaps = 94/607 (15%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--YGGWEN 156
L EV+L++ L S + ++ L + DS ++ FR P A G W+
Sbjct: 360 LNEVNLNNTSLGDHSKFIENRNKFIDTLAQTNPDSFLYMFRNAFGQEQPEGATPLGVWDT 419
Query: 157 PISELRGHFVGHYLSASAQMWASTH-----NATIKEKMSTVV---FSLSECQNKI----- 203
++LRGH GHYL+A AQ +AST ++KM+ +V + LS+ K
Sbjct: 420 QETKLRGHATGHYLTAIAQAYASTGYDKALQKNFEDKMNYMVNTLYDLSQLSGKPKTEGG 479
Query: 204 ----------------------------------GTGYLSAFPTELFDSFE-------AL 222
G G++SA+P + F E
Sbjct: 480 AYVEDPSSVPPGPGSTAYTSDLSEDGIRTDYWNWGKGFISAYPPDQFIMLEHGAKYGGQE 539
Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
VWAPYYT+HKILAGL+D Y ++ N +AL++A M + + R+ K+ T + +
Sbjct: 540 TQVWAPYYTLHKILAGLIDVYEVSGNPKALQVAEGMAAWVHTRLSKLPTETLITMWNTYI 599
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-PCFLG------FLALQADYLSHFHANT 335
E GG+N+ L L+ IT ++L A LFD F G LA D HAN
Sbjct: 600 AGELGGINESLAHLHRITGKSEYLETAKLFDNIKVFYGDAEHTHGLAKNVDTYRGLHANQ 659
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS-------AREFWWDPKR 388
HIP ++G+ Y + P Y I F + Y+ GG + A F P
Sbjct: 660 HIPQIMGALELYRNSNSPEYYHIADNFWYKTKNDYMYSIGGVAGARNPANAECFVAQPAT 719
Query: 389 LAD---TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
L + + G +N ETC TYNMLK++R LF + ++ DYYE+AL N +L+ P
Sbjct: 720 LYENGLSAGGQN-ETCGTYNMLKLTRGLFFYNQQPELMDYYEQALYNQILASVAENSPA- 777
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
Y +PL G K S + F CC GT IES +KL +SIYF+ N LY+
Sbjct: 778 NTYHIPLRPGSRKQFSN----ADMSGFTCCNGTAIESSTKLQNSIYFKSVDN-KALYVNL 832
Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGA 565
++ S+ WK VV+ Q+ + + ++T+ K E LNLR+P W + G
Sbjct: 833 FVPSTLTWKEQDVVITQETS--FPREDHTKLTVNGKGKFE------LNLRIPGWATA-GV 883
Query: 566 QASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
+ +NG+ + G++LS +W D + +++P + + I D +I ++ +G
Sbjct: 884 ELKINGKTQKIAIEAGSYLSLDRKWKNGDTIELKMPFTFHLDPIMDQE----NIASLFYG 939
Query: 625 PYLLAGH 631
P LLA
Sbjct: 940 PVLLAAQ 946
>gi|298484121|ref|ZP_07002288.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
gi|298269711|gb|EFI11305.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
Length = 776
Score = 225 bits (574), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 173/566 (30%), Positives = 267/566 (47%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L LD D L+ F + A L +Y WEN + L G
Sbjct: 6 LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 62
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L+ Q +GTG++ P +L+ +A
Sbjct: 63 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 122
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 123 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 174
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L + D L
Sbjct: 175 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 234
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V S GG S RE
Sbjct: 235 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 294
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYNML++++ L++ + + Y +YYERAL N
Sbjct: 295 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 354
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 355 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 408
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LY+ +I S WK + L Q+ D + + + + K++ +L
Sbjct: 409 YRKDT---LYVNLFIPSQLTWKEQGITLTQET--CFPDDGKVTLRIDEAPKKK----RTL 459
Query: 553 NLRMPVW-TYSNGAQASLNGQ-NLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + + GN +L + +W D +T LP+ + E I
Sbjct: 460 MIRIPEWANQSKGYSVSINGKRKMFIMAKGNQYLPLSRKWKKGDVVTFHLPMKVSVEQIP 519
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 520 DKKDYY----AFLYGPIVLAASTGTE 541
>gi|126348374|emb|CAJ90096.1| conserved hypothetical protein [Streptomyces ambofaciens ATCC
23877]
Length = 942
Score = 225 bits (574), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 152/438 (34%), Positives = 221/438 (50%), Gaps = 40/438 (9%)
Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F E++ VWAPYYT HKIL GLLD ++ + +AL +A+ + +
Sbjct: 391 GFLAAYPETQFVELESMTGSDYTRVWAPYYTAHKILRGLLDAHLATGDGRALDLASGLCD 450
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ Y+R+ K + +++R W + E GG+ + + L+++T + HL LA LFD +
Sbjct: 451 WMYSRLSK-LPAATLQRMWGLFSSGEFGGIVEAICDLHAVTGEAHHLALARLFDLDRLID 509
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L HAN HIPI G ++ TG+ Y F +V YA GGTS
Sbjct: 510 ACAADDDVLDGLHANQHIPIFTGLVRLHDATGEERYLTAAKNFWGMVVPHRMYAIGGTST 569
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW +A TLG+ E+C YNMLK+SR LF ++ AY DYYERAL N VL ++
Sbjct: 570 GEFWQARDVIAGTLGATTAESCCAYNMLKLSRTLFFHEQDPAYMDYYERALYNQVLGSKQ 629
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF-EEE 495
E ++ Y + L G + T CC GTG+ES +K DS+YF +
Sbjct: 630 DAADAEKPLVTYFVGLTPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYFAAAD 683
Query: 496 GNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR---MTLTFSSKQEVGQLS-S 551
GN LY+ Y S+ W V + Q D Y R TLT G S +
Sbjct: 684 GNA--LYVNLYSRSTLTWAERGVTVTQDTD-------YPREQGSTLTLGG----GSASFA 730
Query: 552 LNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
L LR+P W + G + ++NG +P PG++ + + W D + +++P LR E D
Sbjct: 731 LRLRVPAWA-TAGFRVTVNGHAVPGTATPGSYFTVSRTWRRGDTVRVRVPFRLRVEKALD 789
Query: 611 DRPEYASIQAILFGPYLL 628
D S+QA+ GP L
Sbjct: 790 D----PSLQALFLGPVHL 803
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 56/110 (50%), Gaps = 6/110 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
++ L DV L + V ++ L++ DVD L+ FR A L T G A GGWE
Sbjct: 52 VRPFGLEDVTLGRG-VFADKRRLMLDHARGYDVDRLLQVFRANAGLSTLGAVAPGGWEGL 110
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
E LRGH+ GH+L+ AQ T E+++++V +L+E + +
Sbjct: 111 DGEANGNLRGHYTGHFLTMLAQAHRGTGEEVFAERITSMVTALTEVRESL 160
>gi|262407626|ref|ZP_06084174.1| acetyl-CoA carboxylase [Bacteroides sp. 2_1_22]
gi|294644495|ref|ZP_06722254.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294808396|ref|ZP_06767149.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|345511903|ref|ZP_08791442.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
gi|262354434|gb|EEZ03526.1| acetyl-CoA carboxylase [Bacteroides sp. 2_1_22]
gi|292640162|gb|EFF58421.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294444324|gb|EFG13038.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|345453983|gb|EEO49450.2| acetyl-CoA carboxylase [Bacteroides sp. D1]
Length = 800
Score = 225 bits (573), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 172/566 (30%), Positives = 267/566 (47%), Gaps = 64/566 (11%)
Query: 104 LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRG 163
L +V L S L +AQQT+L Y+L LD D L+ F + A L +Y WEN + L G
Sbjct: 30 LQNVKLLDSPFL-QAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 164 HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA 221
H GHYLSA + M+A+T + + +++ ++ L+ Q +GTG++ P +L+ +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQK 268
L W P Y IHK AGL D Y+ A + A +M WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 269 VITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
+ + S E+ L E GG+N+ + IT D K+L LA F L L + D L
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPL-------YKLIGTFFMDIVNASHSYATGGTSARE 381
+ HANT IP VIG + E++ D + FF + V S GG S RE
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 382 FWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTK--------EIAYADYYERALTN 432
+ L + ETC TYN+L++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNILRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+L+ Q + G +Y P+ G + + S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ LY+ +I S WK + L Q+ D + + + + K++ +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGITLTQET--CFPDDGKVTLRIDEAPKKK----RTL 483
Query: 553 NLRMPVW-TYSNGAQASLNGQ-NLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+R+P W S G S+NG+ + + GN +L + +W D +T LP+ + E I
Sbjct: 484 MIRIPEWANQSKGYSVSINGKRKMFIMAKGNQYLPLSRKWKKGDVVTFHLPMKVSVEQIP 543
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGE 635
D + Y A L+GP +LA T E
Sbjct: 544 DKKDYY----AFLYGPIVLAASTGTE 565
>gi|374712027|gb|AEZ64557.1| putative secreted protein [Streptomyces chromofuscus]
Length = 933
Score = 225 bits (573), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 152/468 (32%), Positives = 236/468 (50%), Gaps = 41/468 (8%)
Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F + E++ VWAPYYT HKIL GLLD ++ D+ +AL +A+ + +
Sbjct: 382 GFLAAYPETQFITLESMTSSDYGVVWAPYYTAHKILRGLLDAHLYTDDPRALDLASGLCD 441
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ Y+R+ + + +++R W + E GG+ + + L+++T P+HL LA LFD +
Sbjct: 442 WMYSRLSR-LPASTLQRMWGIFSSGEFGGLVEAVCDLHALTGKPEHLALARLFDLDSLID 500
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L HAN HIPI G ++ TG+ Y F D+V + Y GGTS
Sbjct: 501 ACAANRDVLDGLHANQHIPIFTGLLRLHDATGEARYLAAAKNFWDMVVPTRMYGIGGTST 560
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW +A T+ + E+C YNMLK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 561 GEFWRGRGSVAGTISATTAESCCAYNMLKLSRLLFFHEQDPKYMDYYERALYNQVLGSKQ 620
Query: 440 GT---EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
T E ++ Y + L G + + T CC GTG+ES +K DS+YF +
Sbjct: 621 DTADAEKPLVTYFIGLTPGHVRDYTPKAGTT------CCEGTGMESATKYQDSVYFRKAD 674
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR---MTLTFSSKQEVGQLSSLN 553
+ LY+ Y +S+ W + + Q D Y R TLT + L
Sbjct: 675 DSV-LYVNLYSASTLTWAERGITVTQTTD-------YPREQGSTLTIGGGSAAFE---LR 723
Query: 554 LRMPVWTYSNGAQASLNG---QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
LR+P W G Q ++NG Q PL PG++ + + W D + +++P LR E D
Sbjct: 724 LRVPSWA-DAGFQVTVNGTAVQGKPL--PGSYFAVSRTWRGGDIVRVRVPFRLRVEPTPD 780
Query: 611 DRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFN 658
D ++Q++ GP L ++ ++ G R+ +AL + P+
Sbjct: 781 D----PALQSLFHGPVNLVARSASTSPLRFGLYRN-AALSGDLLPTLT 823
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 59/112 (52%), Gaps = 6/112 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
L+ L DV L + ++ L++ DVD L+ FR A L T G A GGWE
Sbjct: 44 LRPFDLKDVTLG-PGIFATKRRFMLDHGRGYDVDRLLQVFRANAGLSTRGAVAPGGWEGL 102
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT 205
E LRGH+ GH+L+ AQ + ST + +++ ++V +L+E ++ + T
Sbjct: 103 DGEANGNLRGHYTGHFLTMLAQSYGSTGDQVYADRIRSMVDALTEVRSALRT 154
>gi|256423606|ref|YP_003124259.1| hypothetical protein Cpin_4617 [Chitinophaga pinensis DSM 2588]
gi|256038514|gb|ACU62058.1| protein of unknown function DUF1680 [Chitinophaga pinensis DSM
2588]
Length = 1025
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 187/611 (30%), Positives = 282/611 (46%), Gaps = 101/611 (16%)
Query: 123 LEYLLMLDVDSLVWSFRKT--ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAST 180
+ L D +S ++ FR P K W++ ++LRGH GHYL+A AQ +AST
Sbjct: 406 IRTLATTDPNSFLYMFRHAFGRQQPEGAKPLDVWDSQDTKLRGHATGHYLTAIAQAYAST 465
Query: 181 -HNATIKE----KMSTVVFSLSE------------------------------------- 198
++ T+++ KM+ +V +L E
Sbjct: 466 GYDKTLQQNFEQKMAYMVNTLYELSLLSGNPKETGGVAVSDPTAVPYGPGKSGYDSDLSN 525
Query: 199 --CQNKI---GTGYLSAFPTELFDSFEA-------LKPVWAPYYTIHKILAGLLDQYVLA 246
+N G G++SA+P + F E +WAPYYT+HKILAGL+D Y ++
Sbjct: 526 EGIRNDYWNWGKGFISAYPPDQFIMLEKGAKYGGQKNQIWAPYYTLHKILAGLMDVYEVS 585
Query: 247 DNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKH 305
N +AL +AT M ++ Y R+ V ++ + W + + E GGMN+ + RLY IT ++
Sbjct: 586 GNQKALTVATGMGDWVYARLSHV-PQDTLIKMWNTYIAGEFGGMNEAMARLYLITGKQQY 644
Query: 306 LLLAHLFDK-PCFLG------FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP-LYKL 357
L A LFD F G LA D HAN HIP ++GS Y + +P YK+
Sbjct: 645 LQTAQLFDNIRVFFGDTAHSHGLAKNVDIFRGLHANQHIPQIVGSIEMYRASNNPEYYKI 704
Query: 358 IGTFFMDIVNASHSYATGGTS-------AREFWWDPKRLAD---TLGSENEETCTTYNML 407
F+ VN + Y+ GG + A F P L + + G +N ETC TYNML
Sbjct: 705 ADNFWYKAVN-DYMYSIGGVAGARNPANAECFISQPATLYENGFSSGGQN-ETCATYNML 762
Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
K++ LF + + + DYYERAL N +L+ P Y +PL G K
Sbjct: 763 KLTSDLFLFDQRAEFMDYYERALYNHILASVAKDNP-ANTYHVPLRPGAIKQFGN----P 817
Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI 527
F CC GT IES +KL ++IYF+ N LY+ YI S+ W +V + Q D
Sbjct: 818 DMTGFTCCNGTAIESNTKLQNTIYFKSRDN-QALYVNLYIPSTLQWTERNVTIEQTTDFP 876
Query: 528 VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNFLSAT 586
D R+T+ + GQ +N+R+P W + G +NG+ L PG +L+
Sbjct: 877 KEDD--TRLTIKGN-----GQF-DINVRVPGWA-TKGFFVKINGKEQALTAKPGTYLTIR 927
Query: 587 ERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA---GHTSGEWDIKTGTA 643
+W D + +++P + + D + +I ++ +GP LLA G +W T A
Sbjct: 928 RQWKDGDIIDLKMPFRFHLDPVMDQQ----NIASLFYGPILLAAQEGEARKDWRKITLNA 983
Query: 644 RSLSALISPIP 654
+S I P
Sbjct: 984 DDISKSIKGDP 994
>gi|383641062|ref|ZP_09953468.1| glycosylase [Streptomyces chartreusis NRRL 12338]
Length = 900
Score = 224 bits (571), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 158/466 (33%), Positives = 228/466 (48%), Gaps = 38/466 (8%)
Query: 206 GYLSAFPTELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F E+ VWAPYYT HKIL GLLD Y D+ +AL +A+ M +
Sbjct: 349 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGLLDAYGATDDDRALDLASGMCD 408
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ ++R+ K + +++R W + E GG+ + + L++IT +HL LA LFD +
Sbjct: 409 WMHSRLSK-LPESTLQRMWGIFSSGEFGGIVEAVCDLHTITGKAEHLALAQLFDLDRLID 467
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L HAN HIPI G Y+ TG+ Y F D+V Y GGTS
Sbjct: 468 ACAANTDILDGLHANQHIPIFTGYVRLYDETGEERYLTSAKNFWDMVVPHRMYGIGGTST 527
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
+EFW +A T+ + ETC YNMLK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 528 QEFWKARDVIAGTISATTAETCCAYNMLKLSRTLFFHEQDPKYMDYYERALYNQVLGSKQ 587
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E ++ Y + L G + T GT CC GTG+ES +K DS+YF +
Sbjct: 588 DKPDAEKPLVTYFIGLTPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYF-AKA 640
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLR 555
+ LY+ Y S+ W V + Q + TL F G+ S +L LR
Sbjct: 641 DGSALYVNLYSPSTLTWAEKGVTVTQ----TTGFPEEQGSTLAFGG----GRASFTLRLR 692
Query: 556 MPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
+P W + G + ++NG+ + P PGN+ + W D + I +P R E DD
Sbjct: 693 VPSWA-TAGFRVTVNGRAVSGTPKPGNYFEVSRTWRAGDTVRIAMPFRTRVEKALDD--- 748
Query: 615 YASIQAILFGPYLLAGHTSGEWDIKTGTAR------SLSALISPIP 654
S+Q + GP L + +K G R LS ++P+P
Sbjct: 749 -PSLQTLFHGPVNLVARDAATEYLKVGLYRDAGLSGDLSHSLTPVP 793
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 58/110 (52%), Gaps = 6/110 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
++ +L DV L + + ++ L++ DV+ L+ FR A LPT G A GGWE
Sbjct: 10 VQPFALEDVAL-RPGLFAEKRRLMLDHARGYDVNRLLQVFRANAGLPTGGAVAPGGWEGL 68
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
E LRGH+ GH+L+ AQ + T +++ T+V +L+E + +
Sbjct: 69 DGEANGNLRGHYTGHFLTMLAQAYRGTKERVFADRIGTMVGALTEVRAAL 118
>gi|294675240|ref|YP_003575856.1| hypothetical protein PRU_2607 [Prevotella ruminicola 23]
gi|294471633|gb|ADE81022.1| conserved hypothetical protein [Prevotella ruminicola 23]
Length = 788
Score = 224 bits (571), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 184/629 (29%), Positives = 296/629 (47%), Gaps = 63/629 (10%)
Query: 98 FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
+ E L DV L + L A+ N+E LL D D L+ + K A L GK+Y W+
Sbjct: 17 YANEFPLGDVTL-LNGPLKHARDLNIETLLKYDNDRLLAPYLKEAGLTPKGKSYPNWDG- 74
Query: 158 ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-------GTGYLSA 210
L GH GHYL+A A + A+T + +++M + L C + G GY+
Sbjct: 75 ---LDGHVGGHYLTAMA-INAATGSQECRKRMEYWISELQACADANAKNHPDWGRGYVGG 130
Query: 211 FP--TELFDSFEA--LKP---VWAPYYTIHKILAGLLDQYVLADNAQALKM----ATWMV 259
P ++ +F+ P W P+Y IHK+ AGL D +V N QA K+ W +
Sbjct: 131 VPGSDRIWSNFKKGNFGPYFGAWVPFYNIHKMYAGLRDAWVYCGNEQAKKLFLGFCDWAI 190
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ N +T +ER +L+ E GGMN+VL Y+IT + K+L +A F L
Sbjct: 191 DLTAN-----LTDAQMER---ALDTEHGGMNEVLADAYAITGEQKYLDVARRFSHRRLLN 242
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
L + D L + HANT +P VIG + E++GD Y G +F DIV + A GG S
Sbjct: 243 PLMQRRDVLDNMHANTQVPKVIGFERIAELSGDEAYHTAGAYFWDIVTGERTLAFGGNSR 302
Query: 380 REFWWDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
RE + + D + + E+C T NMLK++ L R E YAD++E A N +LS Q
Sbjct: 303 REHFPSREACQDFVQDIDGPESCNTNNMLKLTEDLHRRNPEARYADFFELATFNHILSTQ 362
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
E G +Y ++ R + + WCC GTG+E+ K IY G+
Sbjct: 363 H-PEHGGYVYFTS-----ARPRHYRNYSAPNEAMWCCVGTGMENHGKYNQFIY-THSGDA 415
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
L++ +++S +WK+ + L Q+ + R+T+T SS + Q + + +R P
Sbjct: 416 --LFVNLFVASELNWKAKGITLRQETS--FPYSENSRITITQSSNTK--QPTPIMVRYPG 469
Query: 559 WTYSNGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W +NG+ + + P ++++ +W D + IQ P+ + + P
Sbjct: 470 WVKPGQFSVKVNGKPVSIVTGPSSYVAINRQWKKGDVIDIQFPMYNSVKYL----PNLPQ 525
Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN 677
A++ GP +LA +KTGT L+ LI+ S QL T + + ++ N
Sbjct: 526 YIALMHGPIMLA--------MKTGT-EDLAHLIA--DDSRFGQLATGKKLPIDQAPILVN 574
Query: 678 SN-QSITMEEFPVSGTDAALHATFRLILK 705
+ +SI + P++G + + +++ K
Sbjct: 575 KDVESIANQLQPIAGKPLHFNLSTKMVNK 603
>gi|297191370|ref|ZP_06908768.1| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
gi|197720620|gb|EDY64528.1| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
Length = 942
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 228/444 (51%), Gaps = 36/444 (8%)
Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F + E++ VWAPYYT HKIL GLLD ++ + +AL +A+ M +
Sbjct: 393 GFLAAYPETQFITLESMTSPDYTVVWAPYYTAHKILKGLLDAHLSTGDVRALDLASGMCD 452
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ ++R+ ++ + R W + E GGM + + ++S+T +HL LA +FD +
Sbjct: 453 WMHSRL-ALLPSATRRRMWGLFSSGEYGGMVEAVVDVHSLTGRAEHLELARMFDLDPLID 511
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D LS HAN HIPI G ++ TG+ Y F D+V + Y GGTS
Sbjct: 512 ACAENRDVLSGLHANQHIPIFTGLIRLHDATGEERYLTAARNFWDMVVPTRMYGIGGTST 571
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW D +A TLG ETC +NMLK+SR LF ++ YAD+YER L N +L ++
Sbjct: 572 GEFWRDAGVIAGTLGDTTAETCCAHNMLKLSRLLFLHEQDPKYADHYERTLFNQILGSKQ 631
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E +M Y + L G + T GT CC GTGIES +K DS+YF
Sbjct: 632 DLADAELPLMTYFIGLAPGAVR-DFTPKQGTT-----CCEGTGIESATKYQDSVYFRTR- 684
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM--TLTFSSKQEVGQLSSLNL 554
+ GLY+ Y++S+ DW V + Q LR+ + TF L+L
Sbjct: 685 DGSGLYVNLYMASTLDWTDRGVRVTQTTRFPYEQGSTLRIAGSGTF----------DLHL 734
Query: 555 RMPVWTYSNGAQASLNGQ-NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
R+P W G +NG+ + PG++L+ + W D + I +P +LRTE DD
Sbjct: 735 RVPHWA-DAGFFVRVNGRAHHGGAAPGSYLTVSRAWRDGDTVEISMPFTLRTEPALDDH- 792
Query: 614 EYASIQAILFGP-YLLAGHTSGEW 636
+Q +++GP +L+A H E+
Sbjct: 793 ---DVQCLMYGPVHLVARHEQREF 813
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 123 LEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENPISE----LRGHFVGHYLSASAQMW 177
L++ DV L+ FR A L T G A GGWE E LRGHF GH+LS +Q +
Sbjct: 77 LDFGRSYDVHRLLQVFRANAGLSTRGAVAPGGWEGLDGEARGNLRGHFTGHFLSMLSQAY 136
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKI 203
ST +K+ T+V L+EC+ +
Sbjct: 137 VSTREQVFADKIGTMVDGLAECREAL 162
>gi|295132897|ref|YP_003583573.1| glycosyl hydrolase [Zunongwangia profunda SM-A87]
gi|294980912|gb|ADF51377.1| putative glycosyl hydrolase [Zunongwangia profunda SM-A87]
Length = 797
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 170/538 (31%), Positives = 252/538 (46%), Gaps = 45/538 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A+ N+ LL DVD L+ +RK A L +Y WE L GH GHYLSA A +
Sbjct: 49 ARDLNMSVLLQYDVDRLLAPYRKEAGLEPRKPSYPNWEG----LDGHIGGHYLSALAMNY 104
Query: 178 ASTHNATIKEKMSTVVFSLSECQ-------NKIGTGYLSAFPTE--LFDSF-----EALK 223
A+T N +M+ ++ L ECQ + G GY+ FP L+ SF E
Sbjct: 105 AATDNQEFLARMNYMLKELRECQLANTKKHPEWGVGYVGGFPNSEALWSSFKKGNFEKYN 164
Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
WAP+Y +HK+ AGL D ++ AD+ +A +M ++ + + S E+ LN
Sbjct: 165 SAWAPFYNLHKMYAGLRDAWLYADSEKAKEMFLDFCDWGITLTKDL----SHEQMQSVLN 220
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGS 343
E GGM +V Y IT + K+L A + L L+ D L + HANT IP +G
Sbjct: 221 MEHGGMPEVYADAYQITGEKKYLEAAKRYSHEQVLHPLSKGIDNLDNKHANTQIPKFVGF 280
Query: 344 QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN-EETCT 402
+ EV GD + G++F + V + S A GG S +E + D + ++ E+C
Sbjct: 281 ERIAEVDGDEKFAKAGSYFWETVTKNRSLAFGGNSRKEHFPSTSASIDYINEDDGPESCN 340
Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARST 462
+YNMLK++ LFR E YADYYER L N +LS Q + G +Y P ++ R
Sbjct: 341 SYNMLKLTEDLFRVNPEAKYADYYERTLYNHILSTQH-PQHGGYVYFTP-----ARPRHY 394
Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
+ + WCC GTG+E+ K IY +G+ LYI +I S +W+ V + Q
Sbjct: 395 RIYSAPEEAMWCCVGTGMENHGKYNQFIY-THQGD--SLYINLFIPSELNWEKQGVKIRQ 451
Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGN 581
+ + L++T E L LR P W + +N + + L P +
Sbjct: 452 ETNFPSEEGTSLKIT-------EGTAEFPLFLRYPGWIKEGEMKIKINSEEIELIGKPSS 504
Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIK 639
++ W D + + LP+ E + + P+Y A GP LL G SG D+K
Sbjct: 505 YVKIDRNWQKGDIVDVSLPMHNHMERLP-NVPQYV---AFFHGPILL-GAPSGSEDLK 557
>gi|429195121|ref|ZP_19187172.1| hypothetical protein STRIP9103_04852 [Streptomyces ipomoeae 91-03]
gi|428669175|gb|EKX68147.1| hypothetical protein STRIP9103_04852 [Streptomyces ipomoeae 91-03]
Length = 936
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 152/442 (34%), Positives = 224/442 (50%), Gaps = 37/442 (8%)
Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F E++ VWAPYYT HKIL GLLD Y+ D+A+AL +A+ + +
Sbjct: 384 GFLAAYPETQFIELESMTSGDYTRVWAPYYTAHKILRGLLDAYLNVDDARALDLASGLCD 443
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ Y+R+ K+ +++R W + E GG+ + + LY+IT +HL LA LFD +
Sbjct: 444 WMYSRLSKLPDA-TLQRMWGIFSSGEFGGLVEAIVDLYTITGKAEHLALARLFDLDKLID 502
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L HAN HIPI G Y+ TG+ Y F +V Y GGTS
Sbjct: 503 ACAANTDTLDGLHANQHIPIFTGLARLYDATGEVRYLTAAKNFWGMVVPPRMYGIGGTST 562
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW +A T+ N ETC YN+LK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 563 GEFWKARGVIAGTISDTNAETCCAYNLLKLSRTLFFHEQDPKYMDYYERALLNQVLGSKQ 622
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E ++ Y + L G + T GT CC GTG+ES +K DS+YF +
Sbjct: 623 DKTDAEKPLVTYFIGLKPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYFTKA- 675
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS---LN 553
+ LY+ Y +++ +W + V + Q D Y R S +G S+ L
Sbjct: 676 DGSALYVNLYSATTLNWSAKGVTVTQTTD-------YPREQ---GSTITIGGGSAAFELR 725
Query: 554 LRMPVWTYSNGAQASLNGQNLP-LPPPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDD 611
LR+P W + G + ++NG + P G++ + + R W D + + +P LR E DD
Sbjct: 726 LRVPSWA-TAGFRVTVNGGAVSGTPTAGSYFTISSRTWRGGDVVRVTMPFRLRVEKALDD 784
Query: 612 RPEYASIQAILFGPYLLAGHTS 633
S+Q + +GP L G +
Sbjct: 785 ----PSLQTLFYGPVNLVGRNT 802
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 59/112 (52%), Gaps = 6/112 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
++ L DV L Q + +Q L++ DVD L+ FR A L T G A GGWE
Sbjct: 45 VRPFELKDVTLGQG-LFAGKRQLMLDHGRGYDVDRLLQVFRANAGLSTKGAVAPGGWEGL 103
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT 205
E LRGH+ GH+L+ AQ +AST + +K+ +V +L+E + + T
Sbjct: 104 DGEANGNLRGHYTGHFLTTLAQAYASTADTVYADKIRYMVGALTEVRAALRT 155
>gi|423303007|ref|ZP_17281028.1| hypothetical protein HMPREF1057_04169 [Bacteroides finegoldii
CL09T03C10]
gi|408470336|gb|EKJ88871.1| hypothetical protein HMPREF1057_04169 [Bacteroides finegoldii
CL09T03C10]
Length = 801
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 172/540 (31%), Positives = 245/540 (45%), Gaps = 55/540 (10%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ N LL DVD L+ F A L + + W L GH GHYLSA A +
Sbjct: 47 AQDLNRSVLLEYDVDRLLAPFLIEAGLEPKAEKFPNWPG----LDGHVAGHYLSAMAMNY 102
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
+ K +M ++ L CQ G GY+ P E K WAP+Y
Sbjct: 103 RAGGGEEFKRRMEYILSELYRCQQANGDGYIGGIPNGKAGWKEIKKGNVGIIWKYWAPWY 162
Query: 231 TIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
+HK+ AGL D ++ AD+ A KM W + VI+ + E+ LN E
Sbjct: 163 NLHKLYAGLRDAWLYADSELAKKMFLDYCDWGI--------GVISGLNDEQMEQMLNNEF 214
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGMN+V Y I+ D K+L A F + D L + HANT +P +G Q
Sbjct: 215 GGMNEVFADAYQISGDTKYLDAAKRFSHKWLFESMRDGKDNLDNKHANTQVPKAVGYQRV 274
Query: 347 YEVT------GDPL-YKLIGTFFMDIVNASHSYATGGTSARE-FWWDPKRLADTLGSENE 398
E++ GD + Y FF V A+ S A GG S RE F D L+ E
Sbjct: 275 AELSVQAKRSGDAVDYTRAAYFFWQTVTANRSLAFGGNSRREHFPDDADYLSYVDDREGP 334
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C TYNML+++ LFR + AYAD+YERAL N +LS Q G +Y P
Sbjct: 335 ESCNTYNMLRLTEGLFRKDPKAAYADFYERALFNHILSTQHPVHGGY-VYFTP------- 386
Query: 459 ARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
AR H + + WCC GTG+E+ K G+ IY + LY+ +ISS +WK
Sbjct: 387 ARPAHYRVYSAPNEAMWCCVGTGMENHGKYGEFIYAHTGDS---LYVNLFISSRLEWKKR 443
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
+ L Q S+ + LT ++K+ L +R P W ++NG+++
Sbjct: 444 RISLTQ----TTSFPDEGKTCLTITAKKSTK--FPLFVRKPGWVGDGKVIITVNGKSIET 497
Query: 577 PPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
N + + +W D + +Q+P+++R E ++ PEY AI+ GP LL + E
Sbjct: 498 TTAANSYYTINRKWKNGDVVEVQMPMNIRIEELK-HHPEYI---AIMRGPILLGANVGKE 553
>gi|160882548|ref|ZP_02063551.1| hypothetical protein BACOVA_00499 [Bacteroides ovatus ATCC 8483]
gi|156112129|gb|EDO13874.1| hypothetical protein BACOVA_00499 [Bacteroides ovatus ATCC 8483]
Length = 801
Score = 223 bits (568), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 172/540 (31%), Positives = 246/540 (45%), Gaps = 55/540 (10%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ N LL DVD L+ F A L + + W L GH GHYLSA A +
Sbjct: 47 AQDLNRSVLLEYDVDRLLAPFLIEAGLKPKAEKFPNWPG----LDGHVAGHYLSAMAMNY 102
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
+ K +M ++ L +CQ G GY+ P E K WAP+Y
Sbjct: 103 RAGDGEEFKRRMEYMLSELYKCQQANGDGYIGGIPNGKAGWKEIKKGNVGIIWKYWAPWY 162
Query: 231 TIHKILAGLLDQYVLADNAQALKM----ATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
+HK+ AGL D ++ AD+ A KM W + VI+ + E+ LN E
Sbjct: 163 NLHKLYAGLRDAWLYADSELAKKMFLDYCDWGI--------GVISGLNDEQMEQMLNNEF 214
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGMN+V Y I+ D K+L A F + D L + HANT +P +G Q
Sbjct: 215 GGMNEVFADAYQISGDTKYLDAAKRFSHKWLFESMRDGKDNLDNKHANTQVPKAVGYQRV 274
Query: 347 YEVT------GDPL-YKLIGTFFMDIVNASHSYATGGTSARE-FWWDPKRLADTLGSENE 398
E++ GD + Y FF V A+ S A GG S RE F D L+ E
Sbjct: 275 AELSVQAKRSGDAVDYTRAAYFFWQTVTANRSLAFGGNSRREHFPDDADYLSYVDDREGP 334
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C TYNML+++ LFR + AYAD+YERAL N +LS Q G +Y P
Sbjct: 335 ESCNTYNMLRLTEGLFRKDPKAAYADFYERALFNHILSTQHPVHGGY-VYFTP------- 386
Query: 459 ARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
AR H + + WCC GTG+E+ K G+ IY + LY+ +ISS +WK
Sbjct: 387 ARPAHYRVYSAPNEAMWCCVGTGMENHGKYGEFIYAHTGDS---LYVNLFISSRLEWKKR 443
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
+ L Q S+ + LT ++K+ L +R P W ++NG+++
Sbjct: 444 RISLTQ----TTSFPNEGKTCLTITAKKSTK--FPLFVRKPGWVGDGKVIITVNGKSIET 497
Query: 577 PPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
N + + +W D + +Q+P+++R E ++ PEY AI+ GP LL + E
Sbjct: 498 TTAANSYYTINRKWKNGDVVEVQMPMNIRIEELK-HHPEYI---AIMRGPILLGANVGKE 553
>gi|302549595|ref|ZP_07301937.1| secreted protein [Streptomyces viridochromogenes DSM 40736]
gi|302467213|gb|EFL30306.1| secreted protein [Streptomyces viridochromogenes DSM 40736]
Length = 943
Score = 223 bits (567), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 158/466 (33%), Positives = 228/466 (48%), Gaps = 38/466 (8%)
Query: 206 GYLSAFPTELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F E+ VWAPYYT HKIL GLLD Y D+ +AL +A+ M +
Sbjct: 392 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGLLDAYTATDDDRALDLASGMCD 451
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ ++R+ K + +++R W + E GG+ + + L+++T +HL LA LFD +
Sbjct: 452 WMHSRLSK-LPESTLQRMWGIFSSGEFGGIVEAICDLHTLTGKAEHLALAQLFDLDRLIE 510
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L HAN HIPI G Y+ TG+ Y F D+V Y GGTS
Sbjct: 511 ACAANTDILDGLHANQHIPIFTGYVRLYDETGEERYLRSAKNFWDMVVPHRMYGIGGTST 570
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
+EFW +A T+ + ETC YNMLK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 571 QEFWKARDVIAGTISATTAETCCAYNMLKLSRTLFFHEQDPKYMDYYERALYNQVLGSKQ 630
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E ++ Y + L G + T GT CC GTG+ES +K DS+YF +
Sbjct: 631 DKPDVEKPLVTYFIGLTPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYF-AQA 683
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLR 555
+ LY+ Y S+ W V + Q S+ TLT G+ S +L LR
Sbjct: 684 DGSALYVNLYSPSTLTWAEKGVTVTQS----TSFPREQGSTLTLGG----GRASFTLRLR 735
Query: 556 MPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
+P W + G ++NG+ + P PG++ + W D + I +P R E DD
Sbjct: 736 VPSWA-TAGFGVTVNGRAVSGTPRPGSYFDVSRTWRAGDTVRIAMPFRTRVEKALDD--- 791
Query: 615 YASIQAILFGPYLLAGHTSGEWDIKTGTAR------SLSALISPIP 654
S+Q + GP L S +K G R LS ++P+P
Sbjct: 792 -PSLQTLFHGPVNLVARDSATEYLKVGLYRDAGLSGDLSHSLTPVP 836
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 56/110 (50%), Gaps = 6/110 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
++ L DV L + V +Q L++ DV+ L+ FR A L T G A GGWE
Sbjct: 53 VRPFGLEDVSLGRG-VFADKRQLMLDHARGYDVNRLLQVFRANAGLATGGAVAPGGWEGL 111
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
E LRGH+ GH+L+ AQ + ST +++ VV +L+E + +
Sbjct: 112 DGEANGNLRGHYTGHFLTMLAQAYRSTKEQVFADRIGAVVGALTEVRAAL 161
>gi|336428272|ref|ZP_08608256.1| hypothetical protein HMPREF0994_04262 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336006508|gb|EGN36542.1| hypothetical protein HMPREF0994_04262 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 601
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 137/418 (32%), Positives = 212/418 (50%), Gaps = 18/418 (4%)
Query: 112 SSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASL-----PTPGKAYGGWENPISELRGHFV 166
S + R Q N + LL L+ S+ A L P + GWE P SE+RGHFV
Sbjct: 17 DSEIRRRFQVNEDLLLRYQSKDLLRSYYFEAGLWKDNSENPKIEHWGWEGPTSEIRGHFV 76
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVW 226
GH+LSA+A +AS N + + ++ L CQ G ++ A P + E +
Sbjct: 77 GHWLSAAAITYASDGNRELLGRAEYMLDELERCQKANGGEWIGAIPEKQLRWTEEGRNFG 136
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
P Y +HKI+ GL+D YV A N +AL++ ++FY V+ + T +R + ET
Sbjct: 137 VPLYNLHKIIMGLIDMYVYAGNCKALEIVGHFADWFYRWVKDIPT----DRMDIIMETET 192
Query: 287 GGMNDVLYRLYSITHDPKH-LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
GG+ + RLY IT + K+ +L+ +P F L D L++ HANT IP ++G
Sbjct: 193 GGILEEWCRLYEITGEEKYQVLMEKFLRRPLFHALLE-NKDVLTNMHANTTIPEILGIAR 251
Query: 346 RYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
YEVTG+P Y K + ++ V + TGG ++ E W P + + LG N+E C Y
Sbjct: 252 MYEVTGNPEYLKAVKNYWSIAVTKRGGFVTGGQTSGEVWIPPFHIRERLGKLNQEHCAVY 311
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
NM++++ L+++T +I + +Y E L NG+L+ Q+ G Y LP+ G K
Sbjct: 312 NMMRLAEFLYQYTGDIEFENYRELNLYNGILA-QQNPNTGAAAYYLPMQAGSRKI----- 365
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
W T+ SFWCC G+GI++ + G IY E + + I + +S W+ + Q
Sbjct: 366 WSTEKKSFWCCCGSGIQAGASHGMGIYAENKNQIAVNQFIPSVLTSDRWERKVKITQQ 423
>gi|408533805|emb|CCK31979.1| secreted protein [Streptomyces davawensis JCM 4913]
Length = 943
Score = 219 bits (559), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 151/439 (34%), Positives = 218/439 (49%), Gaps = 32/439 (7%)
Query: 206 GYLSAFPTELFDSFEA-----LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F E+ VWAPYYT HKIL G+LD Y+ D+A+AL +A+ M +
Sbjct: 392 GFLAAYPETQFIDLESRTSSDYTKVWAPYYTAHKILRGVLDAYLATDDARALDLASGMAD 451
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ ++R+ K + +++R W + E GG+ + + L++IT +HL LA LFD +
Sbjct: 452 WMHSRLSK-LPEATLQRMWGLFSSGEFGGIVEAICDLHAITGKAEHLALARLFDLDRLID 510
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L HAN HIPI G Y+ TG+ Y F +V Y GGTS
Sbjct: 511 SCAANTDILDGLHANQHIPIFTGYLRLYDATGEQRYLDAARNFWGMVVPHRMYGIGGTST 570
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW +A T+ + ETC YN+LK+SR LF Y DYYERAL N VL ++
Sbjct: 571 GEFWKARDVIAGTISATTAETCCAYNLLKLSRTLFFHEPSPKYMDYYERALYNQVLGSKQ 630
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E ++ Y + L G + T GT CC GTG+ES +K DS+YF +
Sbjct: 631 DKPDAEKPLVTYFIGLTPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDSVYFTTD- 683
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLR 555
+ LY+ Y S +W V + Q ++ TLT G S L LR
Sbjct: 684 DGSALYVNLYSPSRLNWADKGVTVTQA----TAFPQEQGTTLTIGG----GSASFELRLR 735
Query: 556 MPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
+P W + G + ++NG+ + P PG++ + + W D + I +P LR E DD
Sbjct: 736 VPSWA-TAGFRVTVNGRAVSGTPAPGSYFAVSRTWRSGDTVRISMPFRLRAEKALDD--- 791
Query: 615 YASIQAILFGPYLLAGHTS 633
S+Q + +GP L G S
Sbjct: 792 -PSLQTLCYGPVNLVGRNS 809
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 57/110 (51%), Gaps = 6/110 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPT-PGKAYGGWENP 157
+K +L V L Q + ++ L++ DVD L+ FR A LPT A GGWE
Sbjct: 53 VKPFALDQVTLGQG-LFADKRELMLDHARGYDVDRLLQVFRANAGLPTGDAVAPGGWEGL 111
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
E LRGH+ GH+++ AQ WA T +++ T++ +L+E + +
Sbjct: 112 DGEANGNLRGHYTGHFMTMLAQAWAGTGEQVFADRLRTMIGALTEVRAAL 161
>gi|384109447|ref|ZP_10010323.1| hypothetical protein MSI_18910 [Treponema sp. JC4]
gi|383868978|gb|EID84601.1| hypothetical protein MSI_18910 [Treponema sp. JC4]
Length = 727
Score = 219 bits (558), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 166/546 (30%), Positives = 254/546 (46%), Gaps = 68/546 (12%)
Query: 109 LDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGH 168
L++ S+ ++Q+ LEY+L + D ++ + YGGWEN +++GH +GH
Sbjct: 8 LEKDSLFEKSQRLGLEYVLEYEPDRMLAPCYRALGKNPCAINYGGWEN--RQIQGHMLGH 65
Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD-------SFE- 220
YLSA + + T KEK+ + + E Q K GY P++ FD +FE
Sbjct: 66 YLSALSGFYYQTGKQDAKEKLDYTIDLIKELQRK--DGYFGGIPSDSFDKVFYSGGNFEV 123
Query: 221 ---ALKPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFYN----RVQKV 269
+L W P+Y+IHKI AGL+D YV N AL KMA W + N +QK+
Sbjct: 124 ERFSLAGWWVPWYSIHKIYAGLIDAYVYGGNEDALQIVFKMADWAINGTKNLSDSSIQKM 183
Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLS 329
+T E GGM V LY IT + K+L A + + + + D L
Sbjct: 184 LTC------------EHGGMCKVFADLYGITGNKKYLSEAERWIHHEIIDPASKKEDKLQ 231
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
+HANT IP IG YE+TG Y+ FF + V + SYA GG S E + +
Sbjct: 232 GYHANTQIPKFIGIARLYELTGKSEYRTAAEFFFETVTKNRSYAIGGNSKGEHF--GREF 289
Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
+ L + ETC TYNML+++ H+F W K AD+YE AL N +L+ Q + G Y
Sbjct: 290 EEPLMRDTCETCNTYNMLELAEHIFAWNKTSDIADFYENALYNHILASQ-DPQTGAKTYF 348
Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
+ + +G K +H N+ WCC GTG+E+ S+ I + + LYI +I +
Sbjct: 349 VSMQQGFHKVYCSHD-----NAMWCCTGTGLENPSRYNRFIACDFD---DVLYINLFIPA 400
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
+ + + G V KV+ +D +++ + K+ G L +R P W +A
Sbjct: 401 TVETEDGWKV---KVETDFPYDAAVKIKVLERGKENKG----LKVRKPGWADKMAEKAGE 453
Query: 570 NGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
+G GN S +E + + LP+ L +D + A+ +GP +LA
Sbjct: 454 DG----YIDFGNLSSESE-------IELSLPMKLSIYKAKDHSGNF----AVKYGPLVLA 498
Query: 630 GHTSGE 635
E
Sbjct: 499 ADLGNE 504
>gi|238061684|ref|ZP_04606393.1| secreted protein [Micromonospora sp. ATCC 39149]
gi|237883495|gb|EEP72323.1| secreted protein [Micromonospora sp. ATCC 39149]
Length = 933
Score = 219 bits (557), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 151/464 (32%), Positives = 229/464 (49%), Gaps = 36/464 (7%)
Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F + E++ VWAPYYT HKIL G+LD Y+ + +AL +AT M +
Sbjct: 382 GFLAAYPETQFITLESMTASDYAKVWAPYYTAHKILQGILDAYLNTGDERALDLATGMCD 441
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ ++R+ K + +++R W + E GG+ + + ++ IT P HL LA LFD +
Sbjct: 442 WMHSRLSK-LPAATLQRMWGLFSSGEFGGIVETICDVHRITGSPNHLALARLFDLNSLID 500
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D ++ HAN HIPI G ++ TG+ Y F +V + Y+ GGTS
Sbjct: 501 AAAAGTDTITGLHANQHIPIFTGLLRLHDETGEQRYLNAARNFWPMVVPTRMYSIGGTST 560
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
EFW +P +A +L N ETC YN+LK+SR LF ++ Y DYYERAL N +L +R
Sbjct: 561 VEFWKEPGAIAGSLSDTNAETCCAYNLLKLSRTLFLHEQDPKYMDYYERALYNQILGSKR 620
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E ++ Y + L G + T GT CC GTG+ES +K D++Y +
Sbjct: 621 DLADAEKPLVTYFIGLVPGHVR-DYTPKQGTT-----CCEGTGMESATKYQDTVYL-DTA 673
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS--LNL 554
+ LY+ Y SS W + L Q R ++ +VG ++ L L
Sbjct: 674 DGRALYVNLYSSSKLTWARRGITLTQTT----------RYPFEQNTTIKVGGNATFELRL 723
Query: 555 RMPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
R+P W + + +NG+ P PG++ RW D + + +P LR E DD
Sbjct: 724 RVPGWVKGD-FKVYVNGRRAPGKATPGSYFPVARRWRAGDTVRVHIPFQLRVEKALDD-- 780
Query: 614 EYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSF 657
S Q + +GP L ++ +K G R+ AL + PS
Sbjct: 781 --PSTQTLFYGPVNLVARSASTNFLKIGLYRN-CALSGDLLPSL 821
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 54/110 (49%), Gaps = 11/110 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
L EV+L D V R + LE+ +VD L+ FR A L T G A GWE
Sbjct: 54 LGEVALRD------GVFARKRDLMLEHARGYNVDRLLQVFRANAGLDTLGAVAPSGWEGL 107
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
E LRGH+ GH+L+ AQ + ST + +K+ +V +L E + +
Sbjct: 108 DGEANGNLRGHYTGHFLTMLAQAYGSTGDKVFADKLKYMVGALVEARAAL 157
>gi|373463723|ref|ZP_09555310.1| hypothetical protein HMPREF9104_01016 [Lactobacillus kisonensis
F0435]
gi|371763942|gb|EHO52383.1| hypothetical protein HMPREF9104_01016 [Lactobacillus kisonensis
F0435]
Length = 747
Score = 219 bits (557), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 165/575 (28%), Positives = 266/575 (46%), Gaps = 61/575 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWENP 157
+K VS ++V +S L + N+ ++L L D L++++R A L T G WE+P
Sbjct: 22 MKPVSYYNVKYLPNSTLKEKFERNVNWMLSLTPDQLLYNYRINAGLDTKGATPLTVWESP 81
Query: 158 ISELRGHFVGHYLSASAQMWASTHNAT-------IKEKMSTVVFSLSECQNKIGT----- 205
RGHF GHYLS +++ + +N +K++++ +V L ECQ K T
Sbjct: 82 DWFFRGHFTGHYLSGASRSFVELNNMEDTKEANELKDRVNKIVDGLKECQEKFDTFEEFP 141
Query: 206 GYLSAFPTELFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYF 262
GYL+A P++ FD E L+ + PYY + K++ GL+D Y A N AL++ M YF
Sbjct: 142 GYLAAEPSKRFDDVEKLRFNGNHYVPYYAVQKLMDGLMDAYEFAGNQTALELTMNMTHYF 201
Query: 263 YNRVQKVITMY---SVERHWYS------LNEETGGMNDVLYRLYSITHDPKHLL--LAHL 311
R++++ ++ WY ++E G M+ L RLY IT + + LA
Sbjct: 202 EKRMERLTPEQINAMIDTRWYQGKGHYVYHQEFGAMHRTLLRLYEITDKKQKDIFDLAQK 261
Query: 312 FDKPCFLGFLALQADYLSHF--HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNAS 369
FD+ F L D L ++ HANT + G Y VTGD YK +M+ ++
Sbjct: 262 FDRKWFRDMLINNDDELGYYSCHANTELVCAEGMLEYYHVTGDENYKKGVVNYMNWMHDG 321
Query: 370 HSYATGGTSAR-----------EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTK 418
H T G S R E + P+ L N E+C ++++ +S LF TK
Sbjct: 322 HELPTKGISGRSAYPAPADYGSELYDYPEMFFKHLSMLNGESCCSHDLNFLSSELFADTK 381
Query: 419 EIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGT 478
+ D YE N +++ Q+ + + Y+ L + + G FWCC G+
Sbjct: 382 DATLLDDYEIRFINAIMA-QQNNDSAIAEYLYNLSVAPNSTKEYSHTG-----FWCCTGS 435
Query: 479 GIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL 538
G E S L D IY+ ++ ++ Y+ QY S D K V + Q D + +T+
Sbjct: 436 GTERHSTLVDGIYYTDKKDI---YVGQYFDSILDLKDQGVTVTQ--DSHYPEQHFAHITV 490
Query: 539 TFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
+ QE ++ LR+P W S S++G+N+ P F++ W ++T+
Sbjct: 491 EAAKSQEF----TVYLRVPKW--SRNTTISVDGENVDAEPKNGFVAIKRTWGKKAEITVN 544
Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
LR + + D + + AI +GP LLA T
Sbjct: 545 FDFELRYQTLAD---RFNRV-AIYYGPILLAAQTK 575
>gi|345513939|ref|ZP_08793454.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
gi|423241465|ref|ZP_17222578.1| hypothetical protein HMPREF1065_03201 [Bacteroides dorei
CL03T12C01]
gi|229435753|gb|EEO45830.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
gi|392641358|gb|EIY35135.1| hypothetical protein HMPREF1065_03201 [Bacteroides dorei
CL03T12C01]
Length = 797
Score = 219 bits (557), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 178/554 (32%), Positives = 258/554 (46%), Gaps = 58/554 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L EV L D + ++ L + YLL LDVD L+ R++ L G YGGWE
Sbjct: 44 LSEVELTDSYFKKAMDLHKG------YLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 94
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----------GTGY 207
+ G GHY+SA A M+AST + +K++ ++ L ECQ + GY
Sbjct: 95 -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 153
Query: 208 LSAFPTE--LFDSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
L L E +P W +Y IHKILAGL D YV A QA + +
Sbjct: 154 LQLLQGNVVLNQPDETGQP-WNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLA 212
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
++ + + + + +L+ E GGMN+V +YSIT D K L A F+ +
Sbjct: 213 DF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIY 268
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
+A D L HAN IP +G YE + + +Y F +IV H+ A GG S
Sbjct: 269 PIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSC 328
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + P + L + ETC TYNMLK+SR LF + Y +YYE AL N +L+ Q
Sbjct: 329 YERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQD 388
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
PG + Y L G K S T F+SFWCC GTG+E+ SK +SIYF++
Sbjct: 389 PDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTGMENHSKYAESIYFKDNQE-- 441
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLSSLNL-RMP 557
L + YI S WK + L + D Y + T + + E+G + + L R P
Sbjct: 442 -LLVNLYIPSRLHWKEKGLKL--------TLDTYFPESDTVTVRMDEIGSYTGMLLFRYP 492
Query: 558 VWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W S A +NG+ G+++ + D +T+ +L + +D+ P +
Sbjct: 493 DWV-SGDAVVRINGKPAQTEAHKGSYIRLLDSVKSGDVITLVFTRNLYIDYAKDE-PHFG 550
Query: 617 SIQAILFGPYLLAG 630
S +++GP LLAG
Sbjct: 551 S---VMYGPILLAG 561
>gi|212695367|ref|ZP_03303495.1| hypothetical protein BACDOR_04914 [Bacteroides dorei DSM 17855]
gi|212662096|gb|EEB22670.1| hypothetical protein BACDOR_04914 [Bacteroides dorei DSM 17855]
Length = 807
Score = 219 bits (557), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 178/554 (32%), Positives = 258/554 (46%), Gaps = 58/554 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L EV L D + ++ L + YLL LDVD L+ R++ L G YGGWE
Sbjct: 54 LSEVELTDSYFKKAMDLHKG------YLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 104
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----------GTGY 207
+ G GHY+SA A M+AST + +K++ ++ L ECQ + GY
Sbjct: 105 -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 163
Query: 208 LSAFPTE--LFDSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
L L E +P W +Y IHKILAGL D YV A QA + +
Sbjct: 164 LQLLQGNVVLNQPDETGQP-WNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLA 222
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
++ + + + + +L+ E GGMN+V +YSIT D K L A F+ +
Sbjct: 223 DF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIY 278
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
+A D L HAN IP +G YE + + +Y F +IV H+ A GG S
Sbjct: 279 PIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSC 338
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + P + L + ETC TYNMLK+SR LF + Y +YYE AL N +L+ Q
Sbjct: 339 YERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQD 398
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
PG + Y L G K S T F+SFWCC GTG+E+ SK +SIYF++
Sbjct: 399 PDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTGMENHSKYAESIYFKDNQE-- 451
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLSSLNL-RMP 557
L + YI S WK + L + D Y + T + + E+G + + L R P
Sbjct: 452 -LLVNLYIPSRLHWKEKGLKL--------TLDTYFPESDTVTVRMDEIGSYTGMLLFRYP 502
Query: 558 VWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W S A +NG+ G+++ + D +T+ +L + +D+ P +
Sbjct: 503 DWV-SGDAVVRINGKPAQTEAHKGSYIRLLDSVKSGDVITLVFTRNLYIDYAKDE-PHFG 560
Query: 617 SIQAILFGPYLLAG 630
S +++GP LLAG
Sbjct: 561 S---VMYGPILLAG 571
>gi|265753023|ref|ZP_06088592.1| acetyl-CoA carboxylase [Bacteroides sp. 3_1_33FAA]
gi|263236209|gb|EEZ21704.1| acetyl-CoA carboxylase [Bacteroides sp. 3_1_33FAA]
Length = 797
Score = 218 bits (556), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 178/554 (32%), Positives = 258/554 (46%), Gaps = 58/554 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L EV L D + ++ L + YLL LDVD L+ R++ L G YGGWE
Sbjct: 44 LSEVELTDSYFKKAMDLHKG------YLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 94
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----------GTGY 207
+ G GHY+SA A M+AST + +K++ ++ L ECQ + GY
Sbjct: 95 -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 153
Query: 208 LSAFPTE--LFDSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
L L E +P W +Y IHKILAGL D YV A QA + +
Sbjct: 154 LQLLQGNVVLNQPDETGQP-WNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLA 212
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
++ + + + + +L+ E GGMN+V +YSIT D K L A F+ +
Sbjct: 213 DF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIY 268
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
+A D L HAN IP +G YE + + +Y F +IV H+ A GG S
Sbjct: 269 PIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSC 328
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + P + L + ETC TYNMLK+SR LF + Y +YYE AL N +L+ Q
Sbjct: 329 YERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQD 388
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
PG + Y L G K S T F+SFWCC GTG+E+ SK +SIYF++
Sbjct: 389 PDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTGMENHSKYAESIYFKDNQE-- 441
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLS-SLNLRMP 557
L + YI S WK + L + D Y + T + + E+G + +L R P
Sbjct: 442 -LLVNLYIPSRLHWKEKGLKL--------TLDTYFPESDTVTVRMDEIGSYTGTLLFRYP 492
Query: 558 VWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W S A +NG+ G+++ + D +T+ +L + +D+ P +
Sbjct: 493 DWV-SGDAVVRINGEPAQTEAHKGSYIRLLDSVKSGDVITLVFTRNLYIDYAKDE-PHFG 550
Query: 617 SIQAILFGPYLLAG 630
S +++GP LLAG
Sbjct: 551 S---VMYGPILLAG 561
>gi|302539859|ref|ZP_07292201.1| putative secreted protein [Streptomyces hygroscopicus ATCC 53653]
gi|302457477|gb|EFL20570.1| putative secreted protein [Streptomyces himastatinicus ATCC 53653]
Length = 940
Score = 218 bits (556), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 146/433 (33%), Positives = 214/433 (49%), Gaps = 36/433 (8%)
Query: 206 GYLSAFPTELFDSFEALKP-----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVE 260
G+L+A+P F + E++ VWAPYYT HKIL GLLD ++ +A+AL +A M +
Sbjct: 389 GFLAAYPETQFITLESMTSGDYTVVWAPYYTAHKILRGLLDAHLATGDARALDLAMGMCD 448
Query: 261 YFYNRVQKVITMYSVERHWYSLNE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
+ Y+R+ K + +++R W + E GG+ + + LY+++ +HL LA LFD +
Sbjct: 449 WMYSRLSK-LPRSTLQRMWGIFSSGEFGGIVEAICDLYALSGKAQHLALARLFDLDKLID 507
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
A D L HAN HIPI G Y+ T + Y F D+V + Y GGTS
Sbjct: 508 ACAAGDDTLDGLHANQHIPIFTGLVRLYDETEEERYLTAAKNFWDMVVPTRMYGIGGTSN 567
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
REFW +A TL ETC YNMLK+SR LF ++ AY DYYERAL N VL ++
Sbjct: 568 REFWGARGAIAKTLSDTTAETCCAYNMLKLSRMLFFHEQDPAYMDYYERALYNQVLGSKQ 627
Query: 440 ---GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
E ++ Y + L G + + T CC GTG+ES +K DS+YF+
Sbjct: 628 DRADAEKPLVTYFIGLVPGHVRDYTPKAGTT------CCEGTGMESATKYQDSVYFKRAD 681
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR---MTLTFSSKQEVGQLSSLN 553
LY+ Y S+ W + + Q Y R TLT + L
Sbjct: 682 GT-ALYVNLYSPSTLTWAEKGITVTQSTG-------YPREQGSTLTVRGRTAA---FDLR 730
Query: 554 LRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
LR+P W ++G + ++NG+ + PG++ S + W D + + +P LR E DD
Sbjct: 731 LRVPAWA-TDGFRVTVNGRAVKGTWTPGSYASVSRTWRDGDTVRVDIPFRLRVEKALDD- 788
Query: 613 PEYASIQAILFGP 625
+Q + GP
Sbjct: 789 ---PRVQTLFHGP 798
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 56/110 (50%), Gaps = 6/110 (5%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWENP 157
L+ + DV L ++SV +Q L++ DVD L+ FR A L T G A GGWE
Sbjct: 50 LRPFNPEDVAL-RTSVFTAKRQLMLDFGRGYDVDRLLQVFRANAGLSTRGAVAPGGWEGL 108
Query: 158 ISE----LRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
E LRGHF GH+L+ +Q + T +K+ +V +L E + +
Sbjct: 109 DGEANGNLRGHFTGHFLTMLSQAYTGTGEKVYADKIRHMVGALDEVREAL 158
>gi|312131189|ref|YP_003998529.1| hypothetical protein Lbys_2513 [Leadbetterella byssophila DSM
17132]
gi|311907735|gb|ADQ18176.1| protein of unknown function DUF1680 [Leadbetterella byssophila DSM
17132]
Length = 737
Score = 218 bits (556), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 170/555 (30%), Positives = 259/555 (46%), Gaps = 64/555 (11%)
Query: 100 KEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPIS 159
+ + L+ V L + V AQ +L+Y+L LD D L+ +R A L + YG WE+ S
Sbjct: 18 QNIPLNQVKL-KEGVFKNAQDVDLKYILALDPDKLLAPYRIDAGLEKKAERYGNWES--S 74
Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT------ 213
L GH GHYLSA A ++AS+ +K+++ +V L+ CQ K G GY+ P
Sbjct: 75 GLDGHIGGHYLSALAMLYASSGEPELKKRLDYMVSELAACQKKNGNGYVGGIPQGKVFWE 134
Query: 214 -----ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMAT----WMVEYFYN 264
++ S L W P Y IHK+ AGL D Y N +AL + T WM+E F
Sbjct: 135 RIGKGDIDGSSFGLNNTWVPLYNIHKLFAGLYDAYHFTGNNEALTVLTGLSDWMIELF-- 192
Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
+T VE+ L E GG+N+ +YS T + K+L A F + FL +
Sbjct: 193 ---SALTDEQVEK---VLRTEHGGLNEAFLDVYSATGEQKYLRAAERFTQKAFLQPMIEG 246
Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
D L+ HANT IP ++G++ +VT + + ++F D V S A GG S RE +
Sbjct: 247 KDILTGLHANTQIPKMVGAEKISQVTKNQDWHKGASYFWDNVALHRSVAFGGNSYREHFH 306
Query: 385 DPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
+ R L + + ETC +YNMLK+S+ L+ T + Y D+YE+ L N +LS Q E
Sbjct: 307 ELDRFDKMLETNQGPETCNSYNMLKLSKALYESTGDNKYLDFYEKTLFNHILSSQH-PEK 365
Query: 444 GVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
G +Y P+ R H + S WCC GTG+E+ +K G+ I+ G L
Sbjct: 366 GGFVYFTPI-------RPNHYRVYSQPETSMWCCVGTGLENHTKYGEMIFSRRAGV---L 415
Query: 502 YIIQYISSSFDWKSGH-VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
+ I++ + GH V L+ K PY + V ++ R+P W
Sbjct: 416 QVNLLIAAKLE---GHSVTLDTKY-------PYENTAVL-----RVDGEKTVKWRIPAWM 460
Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
+ + ++NG+ + F + ++ LS + + Q+ P A
Sbjct: 461 --DEVKFTVNGKKVNPKMESGFA------VFTGLKKAEIHLSFQPKMGQEFLPNDQKWAA 512
Query: 621 ILFGPYLLAGHTSGE 635
+GP +LA TS E
Sbjct: 513 FTYGPLVLAAETSKE 527
>gi|315498334|ref|YP_004087138.1| hypothetical protein Astex_1314 [Asticcacaulis excentricus CB 48]
gi|315416346|gb|ADU12987.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
48]
Length = 774
Score = 218 bits (555), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 168/552 (30%), Positives = 255/552 (46%), Gaps = 79/552 (14%)
Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYL 170
+ S+ + + N YLL L D + +FRK A L G+ YGGWE + GH +GHYL
Sbjct: 44 KPSIFLTSIEANQRYLLSLSPDRFLHNFRKGAGLEPKGEVYGGWE--ARGIAGHSLGHYL 101
Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA-------------------- 210
S + M+A T +++ + V+ L Q K GY
Sbjct: 102 SGLSLMYAQTGKPEFRDRAAHVLSELKTIQAKHSDGYAGGTTVGRNGQEVDGKVVYEELR 161
Query: 211 ---FPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
T FD L W P YT HK+ AG LD + A A AL +AT + +Y ++
Sbjct: 162 KGDIRTSGFD----LNGGWVPLYTYHKVFAGALDAHQYAGLADALIVATGLGDYLGTILE 217
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ E L E GG+ + LY+ T + + L L+ + LA D
Sbjct: 218 SLSDAQIQE----ILRAEHGGLTESYAELYARTKNQRWLTLSQRLRHRAIVDPLAAGHDE 273
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L+ HANT IP ++GS +E+T + I FF V+ HSY GG S E + P+
Sbjct: 274 LAGKHANTQIPKIVGSARLFELTQNADDARIARFFWQTVSRDHSYVIGGNSDHEHFGAPR 333
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+LA L + E C +YNML+++RHL+ W+ + A D+YER N ++S Q+ + G+
Sbjct: 334 QLASRLDQQTCEACNSYNMLRLTRHLYGWSGDAALFDFYERTHLNHIMS-QQDPQTGMFT 392
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGNVPGLYIIQY 506
Y L G+ + S N FWCC G+G+ES SK G+SIY++ EG LY
Sbjct: 393 YFTGLASGLGRVHS-----DPTNDFWCCVGSGMESHSKHGESIYWKRGEGVAVNLYYAST 447
Query: 507 IS---SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS- 562
++ + + ++ + +Q V +T+ + K +L+LR+P W +
Sbjct: 448 LNAPETQLEMETAFPLSDQVV-----------ITVHKAPK-------ALDLRVPGWCDTP 489
Query: 563 ----NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
NG A + GQ G +L T D++ + L + +R EA+ DD A +
Sbjct: 490 VLRVNGKAAGV-GQ-------GGYLRLTG-LKNGDRIELCLAMHVRVEAMPDD----AKL 536
Query: 619 QAILFGPYLLAG 630
A L GP +LAG
Sbjct: 537 IAFLSGPLVLAG 548
>gi|224537186|ref|ZP_03677725.1| hypothetical protein BACCELL_02063 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521241|gb|EEF90346.1| hypothetical protein BACCELL_02063 [Bacteroides cellulosilyticus
DSM 14838]
Length = 805
Score = 218 bits (554), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 161/536 (30%), Positives = 245/536 (45%), Gaps = 45/536 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A N++ LL D D L+ F + A LP + YG WE L GH GHYL+A A +
Sbjct: 44 ACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEK--DGLDGHIGGHYLTALAIHY 101
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
A+T N K++M +V + Q G G + FP + E K W +Y
Sbjct: 102 AATGNLECKKRMDYMVSEFARVQQANGDGSICGFPNSKKFAEEIRKGNVGIVWNYWVAWY 161
Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
+HK AGL D ++ N +A LK W V+ N + +ER L+ E
Sbjct: 162 NMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISNLDDR-----QMER---MLDNEF 213
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGMN+V + +T +PK+L A F +A + D L + HANT +P +G Q
Sbjct: 214 GGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMARRIDNLDNKHANTQVPKAVGYQRV 273
Query: 347 YEVTGD--PLYKLIGT---FFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEET 400
E+ P Y T FF + V + S + GG S E + + + +D + + E+
Sbjct: 274 AELNSKIAPDYNDFMTAAEFFWETVVSHRSLSLGGNSRGEHFPEAGKCSDYMHERQGPES 333
Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
C T NMLK++ LFR ++ YAD+YERA+ N +LS Q E G +Y P +
Sbjct: 334 CNTNNMLKLTEGLFRMHPKVEYADFYERAMYNHILSTQH-PEHGGYVYFTPACPSHYRVY 392
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
S G + WCC GTG+E+ K G IY + + LY+ +I S +WK + +
Sbjct: 393 SAPG-----KAMWCCVGTGMENHGKYGQFIYTHDMAD-NALYVNLFIPSELNWKEKKIKI 446
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-P 579
Q+ D P T + + Q L +R P W Q NG + P
Sbjct: 447 VQETDF-----PNEEGTTLTVNPSKATQFKLL-IRYPSWVEQGKMQVVCNGVDYAKSAQP 500
Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
G++++ +WS D + ++ P++++ E + P + +I+ GP LL T E
Sbjct: 501 GSYIAIDRQWSKGDVVEVKTPMTVKIEEL----PNVPNAISIMRGPILLGARTGTE 552
>gi|423223044|ref|ZP_17209513.1| hypothetical protein HMPREF1062_01699 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640313|gb|EIY34115.1| hypothetical protein HMPREF1062_01699 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 805
Score = 217 bits (553), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 161/536 (30%), Positives = 244/536 (45%), Gaps = 45/536 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A N++ LL D D L+ F + A LP + YG WE L GH GHYL+A A +
Sbjct: 44 ACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEK--DGLDGHIGGHYLTALAIHY 101
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
A+T N K++M +V + Q G G + FP + E K W +Y
Sbjct: 102 AATGNLECKKRMDYMVSEFARVQQANGDGSICGFPNSKKFAEEIRKGNVGIVWNYWVAWY 161
Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
+HK AGL D ++ N +A LK W V+ N + +ER L+ E
Sbjct: 162 NMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISNLDDR-----QMER---MLDNEF 213
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGMN+V + +T +PK+L A F +A D L + HANT +P +G Q
Sbjct: 214 GGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMARHIDNLDNKHANTQVPKAVGYQRV 273
Query: 347 YEVTGD--PLYKLIGT---FFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEET 400
E+ P Y T FF + V + S + GG S E + + + +D + + E+
Sbjct: 274 AELNSKTAPDYNDFMTAAEFFWETVVSHRSLSLGGNSRGEHFPEAGKCSDYMHERQGPES 333
Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
C T NMLK++ LFR ++ YAD+YERA+ N +LS Q E G +Y P +
Sbjct: 334 CNTNNMLKLTEGLFRMHPKVEYADFYERAMYNHILSTQH-PEHGGYVYFTPACPSHYRVY 392
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
S G + WCC GTG+E+ K G IY + + LY+ +I S +WK + +
Sbjct: 393 SAPG-----KAMWCCVGTGMENHGKYGQFIYTHDMAD-NALYVNLFIPSELNWKEKKIKI 446
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-P 579
Q+ D P T + + Q L +R P W Q NG + P
Sbjct: 447 VQETDF-----PNEEGTTLTVNPSKATQFKLL-IRYPSWVEQGKMQVVCNGVDYAKSAQP 500
Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
G++++ +WS D + ++ P++++ E + P + +I+ GP LL T E
Sbjct: 501 GSYIAIDRQWSKGDVVEVKTPMTVKIEEL----PNVPNAISIMRGPILLGARTGTE 552
>gi|451820300|ref|YP_007456501.1| acetyl-CoA carboxylase, biotin carboxylase [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451786279|gb|AGF57247.1| acetyl-CoA carboxylase, biotin carboxylase [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 766
Score = 216 bits (550), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 159/522 (30%), Positives = 252/522 (48%), Gaps = 38/522 (7%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
+Q +Y+L LDVD + + L K Y GWE + GH +GH++SA A +
Sbjct: 24 SQDLGEKYILSLDVDRFLAPCYEAHGLEPKKKRYSGWE--ARAISGHSLGHFMSALAVTY 81
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP----TELFDSFEA----LKPVWAPY 229
+T N +K+ + V LS Q G GY+ E+ D + W P+
Sbjct: 82 QATGNEELKKILDYAVSELSHIQQVTGRGYIGGLVETPFVEIIDGTNIGKFDINGYWVPW 141
Query: 230 YTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGM 289
Y+IHKI GL+D Y LA+N++AL + V F + ++ S E+ L E GGM
Sbjct: 142 YSIHKIYKGLIDAYELAENSEALNV----VVNFADWAVSILNQMSDEQVQAMLECEHGGM 197
Query: 290 NDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG-SQMRYE 348
N + +LY T + +L A F + L D L HANT IP +IG +++ +
Sbjct: 198 NHIFAKLYGFTCNSIYLDTAVRFSHKAIVEPLEQCVDDLQGKHANTQIPKIIGIAEIYNQ 257
Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLK 408
YK FF + V SY GG S +E + ++LG + E+C T+NML
Sbjct: 258 EHAYEKYKTAAQFFWNTVVNRRSYVIGGNSLKEHF--EAIDMESLGIKTAESCNTHNMLL 315
Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
+++ LF W AY DYYE AL N ++ Q G Y L G + S TK
Sbjct: 316 LTKLLFSWNHYSAYMDYYENALFNHIIGTQ-DCHTGNKTYFTSLLPGHYRIYS-----TK 369
Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
++WCC GTG+E+ K ++IYF+E+ + LY+ +ISS FDW++ + + Q+ +
Sbjct: 370 DTAWWCCTGTGMENPGKYAEAIYFQEQDD---LYVNLFISSQFDWEAKGLTIRQESNLPY 426
Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATER 588
S L++ K E +++N+R+P W S A +NG++ + +L+ +
Sbjct: 427 SDTVILKI---IEGKAE----ANINIRVPSWITSELV-AVVNGKDRFVQREKGYLTVSGA 478
Query: 589 WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
W +++ I P+++ +D+ A A +GP +LAG
Sbjct: 479 WDKGNEIRITFPMAVSKYTSKDN----AGKIAFTYGPVVLAG 516
>gi|307109022|gb|EFN57261.1| hypothetical protein CHLNCDRAFT_143813 [Chlorella variabilis]
Length = 349
Score = 216 bits (549), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 124/267 (46%), Positives = 156/267 (58%), Gaps = 10/267 (3%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
SL DV L + S R + N EYLL L+ D L+++FRKTA LP PG +YGGWE E+R
Sbjct: 27 SLADVQLARGSEYARNFEQNSEYLLALEPDRLLYNFRKTAGLPAPGASYGGWEWSGVEIR 86
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEAL 222
GHFVGHYLSA A + ++E+ +V L + Q+ GTGYLSAFP FD EAL
Sbjct: 87 GHFVGHYLSALALATLHSGRPELRERCGVMVSELKKVQDAAGTGYLSAFPESHFDRLEAL 146
Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
+PV HKILAGLLDQ+ L A AL A M +F RV+ V+ + HW+ +
Sbjct: 147 QPV-------HKILAGLLDQHRLVGTAGALGAARRMASHFCARVRAVVAANGTD-HWHRV 198
Query: 283 NE-ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
E E GGMN+ LY LY+IT P+H AH FDKP F LA D L HANTH+ V
Sbjct: 199 LEVEFGGMNEALYNLYAITKSPEHAECAHFFDKPAFFRPLAEGRDPLPGLHANTHMAQVP 258
Query: 342 GSQMRYEVTGDPLYKL-IGTFFMDIVN 367
G RYE+ GD ++ TFF ++
Sbjct: 259 GFTARYELLGDGEAQVAAATFFGTLLQ 285
>gi|427383714|ref|ZP_18880434.1| hypothetical protein HMPREF9447_01467 [Bacteroides oleiciplenus YIT
12058]
gi|425728419|gb|EKU91277.1| hypothetical protein HMPREF9447_01467 [Bacteroides oleiciplenus YIT
12058]
Length = 791
Score = 216 bits (549), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 161/528 (30%), Positives = 243/528 (46%), Gaps = 48/528 (9%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A N++ LL DVD L+ F K A L G+++ WE L GH GHYLSA A +
Sbjct: 46 ACDLNVQILLQYDVDRLLAPFLKEAGLQPKGESFPNWEG----LDGHVGGHYLSALAIHY 101
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
A+T N K++M ++ L CQ K GY+ P + E K W P+Y
Sbjct: 102 AATGNVDCKKRMEYMISELKRCQQKHADGYVGGVPDGMKVWNEIKKGNVGIVWKYWVPWY 161
Query: 231 TIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN 290
+HKI AGL D ++ N +A M + ++ +I + E+ L E GGM+
Sbjct: 162 NLHKIYAGLRDAWIYGGNEEARMMFLELCDWG----MTIIAPLNDEQMEQMLANEFGGMD 217
Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
+V Y +T D K+L A F L +A Q D L + HANT +P V+G Q E+
Sbjct: 218 EVYADAYQMTGDMKYLNTAKRFSHKWLLDSMAAQVDNLDNKHANTQVPKVVGYQRIAELG 277
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-----ENEETCTTYN 405
D Y++ +F + V + S + GG S RE + AD S E E+C T N
Sbjct: 278 HDKKYEVATEYFWNTVVYNRSLSLGGNSRREHF----AAADDCKSYVEDREGPESCNTNN 333
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH-- 463
MLK++ LFR E YAD+YERA+ N +LS Q E G +Y + AR H
Sbjct: 334 MLKLTEGLFRMHPEARYADFYERAMYNHILSTQH-PEHGGYVYF-------TSARPAHYR 385
Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
+ ++ WCC GTG+E+ K G+ IY + L++ +++S +WK + L Q+
Sbjct: 386 VYSAPNSAMWCCVGTGMENHGKYGEFIYTHAHDS---LFVNLFVASELNWKEKGITLIQE 442
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPPGNF 582
+ R+T+ + L +R P W N + G++ P ++
Sbjct: 443 TR--FPDEESSRLTIRVKKPTKF----KLLVRHPWWADGNDMKVLCKGKDYASGSSPSSY 496
Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ W D + I P+ + EA+ P + +I+ GP LL
Sbjct: 497 IVIERTWKNGDVVDITTPMKVHIEAL----PNVSEYISIMRGPILLGA 540
>gi|312131938|ref|YP_003999278.1| hypothetical protein Lbys_3265 [Leadbetterella byssophila DSM
17132]
gi|311908484|gb|ADQ18925.1| protein of unknown function DUF1680 [Leadbetterella byssophila DSM
17132]
Length = 1004
Score = 215 bits (548), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 184/613 (30%), Positives = 268/613 (43%), Gaps = 99/613 (16%)
Query: 123 LEYLLMLDVDSLVWSFRKT--ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWAST 180
++ L D +S ++ FR P K G W++ ++LRGH GHYL+A AQ +AST
Sbjct: 385 IQGLAKTDPNSFLYMFRHAFGQKQPEGAKPLGVWDSQNTKLRGHATGHYLTAIAQAYAST 444
Query: 181 H-----NATIKEKMSTVVFSLSECQNKIGT------------------------------ 205
A KM +V +L E GT
Sbjct: 445 GYDKNLQANFAGKMDQLVNTLYELSRLSGTPKVQGGEAVADPTKVPMGPGKTEYDSDLTD 504
Query: 206 ------------GYLSAFPTELFDSFEA-------LKPVWAPYYTIHKILAGLLDQYVLA 246
GY+SA+P + F E VWAPYYT+HKILAGL+D Y ++
Sbjct: 505 EGIRTDYWNWGKGYISAYPPDQFIMLEQGAKYGGQKNQVWAPYYTLHKILAGLMDVYEVS 564
Query: 247 DNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS-LNEETGGMNDVLYRLYSITHDPKH 305
N +AL +A M E+ + R+ + ++ + W + + E GGMN+ + RL+ +T + K
Sbjct: 565 GNKKALDVAVGMSEWVHARL-AALPQDTLIKMWNTYIAGEYGGMNESMARLFFLTKNEKF 623
Query: 306 LLLAHLFDK-PCFLG------FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLI 358
L A LFD F G LA D HAN HIP ++GS Y V+ +P Y I
Sbjct: 624 LKTAQLFDNIKMFYGDASHSHGLARNVDTFRGLHANQHIPQIVGSIEMYAVSQNPDYYFI 683
Query: 359 GTFFMDIVNASHSYATGGTS-------AREFWWDPKRLAD---TLGSENEETCTTYNMLK 408
F + + Y+ GG + A F P + + + G +N ETC TYNMLK
Sbjct: 684 AENFWHRTVSDYMYSIGGVAGARNPANAECFIAQPATIYENGFSQGGQN-ETCATYNMLK 742
Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
++ LF + ++ Y DYYER L N +L+ P Y +PL G K
Sbjct: 743 LTSSLFMFDQKAEYMDYYERGLYNHILASVAKDSP-ANTYHVPLRPGSIKQFGN----PN 797
Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
F CC GT IES +KL +SIYF+ N LY+ +I S+ +W+ + + Q
Sbjct: 798 MTGFTCCNGTAIESNTKLQNSIYFKSLDNST-LYVNLFIPSTLNWEEKGIKVVQTTSFPK 856
Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATE 587
LR+ E L +R+P W G +NG+ + PG++ +
Sbjct: 857 EDQTKLRI--------EGNGKFDLQVRVPGWA-KKGFVVKINGKKQKIKATPGSYAKISR 907
Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS---GEWDIKTGTAR 644
W D L I +P + + D+P AS + +GP LLA + EW T A+
Sbjct: 908 TWKNGDVLEITMPFEFHLDYVM-DQPNIAS---LFYGPVLLAAQETEARKEWRQVTFDAK 963
Query: 645 SLSALISPIPPSF 657
LS I P +
Sbjct: 964 DLSKNIKGNPETL 976
>gi|326801658|ref|YP_004319477.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326552422|gb|ADZ80807.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
Length = 790
Score = 214 bits (546), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 158/542 (29%), Positives = 246/542 (45%), Gaps = 54/542 (9%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ+T+L Y+L L+ D L+ + + A L +YG WEN + L GH GHYLSA + M
Sbjct: 51 AQETDLRYILALNPDRLLAPYLREAGLEPKASSYGNWEN--TGLDGHIGGHYLSALSLMA 108
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFE---------ALKPVW 226
A+T N I+++++ ++ L CQ++ GY+ P ++++ + +L W
Sbjct: 109 AATGNHAIQDRLTYMLSELKRCQDQDSDGYVGGIPGGKQMWNDIKRGKIEAQSFSLNGKW 168
Query: 227 APYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
P Y IHK+ AGL+D Y N A LK+ W + F + I L
Sbjct: 169 VPIYNIHKLFAGLIDAYRYTGNEHARQMVLKLGKWWLSVFGGLTDEQIQTI--------L 220
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIG 342
E GG+N+V L I+ D K+L +A L L D L+ HANT IP VIG
Sbjct: 221 RSEHGGINEVFADLAQISGDQKYLTMAKRLSHRAILQPLIAGKDELTGLHANTQIPKVIG 280
Query: 343 SQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETC 401
+ + + FF + V + + GG S E + L S E ETC
Sbjct: 281 FEKIAALADSMSWANAARFFWETVVEHRTVSIGGNSESEHFHALNSFGKMLSSREGPETC 340
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG----RGVS 457
TYNM+K+S+ LF + + DYYERA N +LS Q E G +Y P+ R S
Sbjct: 341 NTYNMMKLSKDLFLQGPDRKFIDYYERATYNHILSSQHPKEGG-FVYFTPMRPNHYRVYS 399
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
+A++ FWCC G+G+E+ K G+ IY + LYI +I S+ W+
Sbjct: 400 QAQAC---------FWCCVGSGLENHGKYGELIYTHSGQD---LYINLFIPSTLKWQEQG 447
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
+ L Q+ ++ +T+ ++ + S+ +R P W +NG+ +
Sbjct: 448 ISLTQRTR--FPYEQKSSVTIEVANPKTF----SVFIRKPKWLGKQPINLLVNGKQISYQ 501
Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWD 637
+L +W +T LP+ + E + P + +GP +LA E D
Sbjct: 502 EDKGYLKINRKWVGQSIITFNLPMQINAELLPSGEPWV----SYTYGPIVLASKNGTE-D 556
Query: 638 IK 639
+K
Sbjct: 557 LK 558
>gi|423230906|ref|ZP_17217310.1| hypothetical protein HMPREF1063_03130 [Bacteroides dorei
CL02T00C15]
gi|423244617|ref|ZP_17225692.1| hypothetical protein HMPREF1064_01898 [Bacteroides dorei
CL02T12C06]
gi|392630026|gb|EIY24028.1| hypothetical protein HMPREF1063_03130 [Bacteroides dorei
CL02T00C15]
gi|392641466|gb|EIY35242.1| hypothetical protein HMPREF1064_01898 [Bacteroides dorei
CL02T12C06]
Length = 797
Score = 214 bits (546), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 177/554 (31%), Positives = 257/554 (46%), Gaps = 58/554 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L EV L D + ++ L + YLL LDVD L+ R++ L G YGGWE
Sbjct: 44 LSEVELTDSYFKKAMDLHKG------YLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 94
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----------GTGY 207
+ G GHY+SA A M+AST + +K++ ++ L ECQ + GY
Sbjct: 95 -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 153
Query: 208 LSAFPTE--LFDSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
L L E +P W +Y IHKILAGL D YV A QA + +
Sbjct: 154 LQLLQGNVVLNQPDETGQP-WNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLA 212
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
++ + + + + +L+ E GGMN+V +YSIT D K L A F+ +
Sbjct: 213 DF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIY 268
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
+A D L HAN IP +G YE + + +Y F +IV H+ A GG S
Sbjct: 269 PIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSC 328
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + + L + ETC TYNMLK+SR LF + Y +YYE AL N +L+ Q
Sbjct: 329 YERFGVLGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQD 388
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
PG + Y L G K S T F+SFWCC GTG+E+ SK +SIYF++
Sbjct: 389 PDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTGMENHSKYAESIYFKDNQE-- 441
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLS-SLNLRMP 557
L + YI S WK + L + D Y + T + + E+G + +L R P
Sbjct: 442 -LLVNLYIPSRLHWKEKGLKL--------TLDTYFPESDTVTVRMDEIGSYTGTLLFRYP 492
Query: 558 VWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W S A +NG+ G+++ + D +T+ +L + +D+ P +
Sbjct: 493 DWV-SGDAVVRINGEPAQTEAHKGSYIRLLDSVKSGDVITLVFTRNLYIDYAKDE-PHFG 550
Query: 617 SIQAILFGPYLLAG 630
S +++GP LLAG
Sbjct: 551 S---VMYGPILLAG 561
>gi|237711613|ref|ZP_04542094.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
gi|229454308|gb|EEO60029.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
Length = 770
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 177/554 (31%), Positives = 257/554 (46%), Gaps = 58/554 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L EV L D + ++ L + YLL LDVD L+ R++ L G YGGWE
Sbjct: 17 LSEVELTDSYFKKAMDLHKG------YLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 67
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI-----------GTGY 207
+ G GHY+SA A M+AST + +K++ ++ L ECQ + GY
Sbjct: 68 -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 126
Query: 208 LSAFPTE--LFDSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
L L E +P W +Y IHKILAGL D YV A QA + +
Sbjct: 127 LQLLQGNVVLNQPDETGQP-WNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLA 185
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
++ + + + + +L+ E GGMN+V +YSIT D K L A F+ +
Sbjct: 186 DF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIY 241
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
+A D L HAN IP +G YE + + +Y F +IV H+ A GG S
Sbjct: 242 PIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSC 301
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + + L + ETC TYNMLK+SR LF + Y +YYE AL N +L+ Q
Sbjct: 302 YERFGVLGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQD 361
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
PG + Y L G K S T F+SFWCC GTG+E+ SK +SIYF++
Sbjct: 362 PDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTGMENHSKYAESIYFKDNQE-- 414
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLS-SLNLRMP 557
L + YI S WK + L + D Y + T + + E+G + +L R P
Sbjct: 415 -LLVNLYIPSRLHWKEKGLKL--------TLDTYFPESDTVTVRMDEIGSYTGTLLFRYP 465
Query: 558 VWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W S A +NG+ G+++ + D +T+ +L + +D+ P +
Sbjct: 466 DWV-SGDAVVRINGEPAQTEAHKGSYIRLLDSVKSGDVITLVFTRNLYIDYAKDE-PHFG 523
Query: 617 SIQAILFGPYLLAG 630
S +++GP LLAG
Sbjct: 524 S---VMYGPILLAG 534
>gi|189464749|ref|ZP_03013534.1| hypothetical protein BACINT_01093 [Bacteroides intestinalis DSM
17393]
gi|189437023|gb|EDV06008.1| hypothetical protein BACINT_01093 [Bacteroides intestinalis DSM
17393]
Length = 805
Score = 212 bits (539), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 159/536 (29%), Positives = 240/536 (44%), Gaps = 45/536 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
A N++ LL D D L+ F + A LP + YG WE L GH GHYLSA A +
Sbjct: 44 ACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEK--DGLDGHIGGHYLSALAIHY 101
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALK-------PVWAPYY 230
A+T N K++M +V + Q G + FP + E K W +Y
Sbjct: 102 AATGNQECKKRMDYMVSEFARVQQANDDGSICGFPNSKKFAEEIRKGNVGIVWNYWVAWY 161
Query: 231 TIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
+HK AGL D ++ N +A LK W V+ N + +ER L+ E
Sbjct: 162 NMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISNLDDR-----QMER---MLDNEF 213
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGMN+V + +T +PK+L A F + + D L + HANT +P +G Q
Sbjct: 214 GGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMTRRIDNLDNKHANTQVPKAVGYQRV 273
Query: 347 YEVTGDPL-----YKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEET 400
E+ + FF + V S + GG S E + + + +D + + E+
Sbjct: 274 AELNSKTASDYNEFMTAAEFFWETVVFHRSLSLGGNSRGEHFPEAGKCSDYMHERQGPES 333
Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
C T NMLK++ LFR ++ YAD+YERAL N +LS Q E G +Y P +
Sbjct: 334 CNTNNMLKLTEGLFRIHPKVEYADFYERALYNHILSTQH-PEHGGYVYFTPACPSHYRVY 392
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
S G + WCC GTG+E+ K G IY + + LY+ +I S +WK + +
Sbjct: 393 SAPG-----EAMWCCVGTGMENHGKYGQFIYTHDTVD-NALYVNLFIPSELNWKEKKIKI 446
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL-PPP 579
Q+ D P T + + Q L +R P W Q +G + P
Sbjct: 447 VQETDF-----PNEEGTTLTVNPSKATQFKLL-IRYPSWVEQGKMQVVCDGVDYAKNAQP 500
Query: 580 GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
G++++ +WS D + I+ P+++R E + P + +I+ GP LL T E
Sbjct: 501 GSYIAIDRQWSKGDVVEIKTPMTVRIEEL----PNVPNAISIMRGPILLGARTGTE 552
>gi|393782713|ref|ZP_10370896.1| hypothetical protein HMPREF1071_01764 [Bacteroides salyersiae
CL02T12C01]
gi|392672940|gb|EIY66406.1| hypothetical protein HMPREF1071_01764 [Bacteroides salyersiae
CL02T12C01]
Length = 796
Score = 208 bits (530), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 172/547 (31%), Positives = 249/547 (45%), Gaps = 47/547 (8%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
SL DV L S + A + YLL LDVD L+ R+ L + YGGWE
Sbjct: 41 SLSDVKL-TSGIFKGAMDLHKGYLLSLDVDRLIPHVRRNVGLTGKNENYGGWETH----G 95
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN-----------KIGTGYLSAF 211
G GHY+SA A M+AST ++++ ++ L ECQ + GY
Sbjct: 96 GCTYGHYMSACAMMYASTGEKIFRDRLEYMMDELKECQQQTQDGWFISGERAKEGYRKLL 155
Query: 212 PTELF-DSFEALKPVWA------PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYN 264
E+F + + K W +Y IHK+LAGL D Y+ A +A ++ + ++
Sbjct: 156 HGEVFLNRPDETKQPWNYNQNGNSWYCIHKVLAGLRDVYLYAGIQKAKEILMPLADF--- 212
Query: 265 RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQ 324
+ + + + +L+ E GGMN+V +Y+ T D K+L A F+ + +A
Sbjct: 213 -IADIALNSNKDLFQSTLSVEQGGMNEVFTDIYAFTGDYKYLETACRFNHINVIYPVANG 271
Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWW 384
D L HAN IP IG Y +Y+ F D+V +H+ A GG S E +
Sbjct: 272 EDVLFGRHANDQIPKFIGVAKEYAYDTKEIYRKAAENFWDMVVNNHTLAIGGNSCYERFG 331
Query: 385 DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
P + L + ETC TYNMLK+SR LF + Y +YYE AL N +L+ Q G
Sbjct: 332 MPGEESKRLDYSSAETCNTYNMLKLSRLLFMMNGDYKYLNYYEHALYNHILASQDPDMAG 391
Query: 445 VMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
+ Y L G K S T ++SFWCC GTG+E+ +K +SIYF+ N L I
Sbjct: 392 CVTYYTSLLPGSFKQYS-----TPYDSFWCCVGTGMENHAKYAESIYFK---NGNSLLIN 443
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
YI S +WK L D S T++ + S+ LR P W N
Sbjct: 444 LYIPSELNWKEQGFRLRLDTDFPES------DTISVCVVDKGRFSGSVMLRYPEWVEGN- 496
Query: 565 AQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
+ LNG+ + L ++ + D + I LP L +D+ P + S I++
Sbjct: 497 PEMMLNGRPVKLEYGKKEYIRLPDSIKSGDTIKIVLPRKLSVRYAKDE-PHFGS---IMY 552
Query: 624 GPYLLAG 630
GP LLAG
Sbjct: 553 GPILLAG 559
>gi|383777661|ref|YP_005462227.1| hypothetical protein AMIS_24910 [Actinoplanes missouriensis 431]
gi|381370893|dbj|BAL87711.1| hypothetical protein AMIS_24910 [Actinoplanes missouriensis 431]
Length = 939
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 150/490 (30%), Positives = 236/490 (48%), Gaps = 48/490 (9%)
Query: 160 ELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSF 219
ELRG+ + + + +A+ ++ + V+ + G+L+A+P F
Sbjct: 355 ELRGNLAWYRFDETEGT--TVADASGRDWDAAVITGVGGAPGPSHAGFLAAYPETQFVLL 412
Query: 220 EALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
E L +WAPYYT HKI+ GLLD + L NA AL + M E+ ++R+ K + ++
Sbjct: 413 EQLTTYPAIWAPYYTCHKIMRGLLDAHTLGGNATALDVVRGMGEWAHSRLSK-LPREQLD 471
Query: 277 RHW-YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
R W + E GGMN+V+ L ++T + L A FD L D L HAN
Sbjct: 472 RMWALYIAGEYGGMNEVMVDLATLTGNKTFLETARFFDNTKLLADCVADIDSLDGKHANQ 531
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTL-G 394
HIP +G YE D Y+ F D+V +Y GGT E + +A ++
Sbjct: 532 HIPQFLGYLRLYENGADKTYRTAAANFFDMVVPHRTYMHGGTGQGEVFRKRDVIAGSIVN 591
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRG----TEPGVMIYML 450
+ N E+C YNMLKV+R+LF + + DYYE+AL N +L+ +R T+P ++ YM+
Sbjct: 592 TTNAESCAAYNMLKVARNLFSHAPDGRFMDYYEKALVNQILASRRDVDSTTDP-LVTYMV 650
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
P+G G + G+G N CC GTG+E+ +K D+I+F LY+ YI S+
Sbjct: 651 PVGPGARR-----GYG---NIGTCCGGTGLENHTKYQDTIWF-RSAKSDTLYVNLYIPST 701
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-------TYSN 563
+W + + + Q D S P +T+T S++ + L LR+P W T ++
Sbjct: 702 LNWAAKKLTVTQTGDYPRS--PETTLTITGSARLD------LRLRVPSWADDDFSVTVNS 753
Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
Q G++ ++S W D +T+ P L E DD S+QA+L+
Sbjct: 754 KIQRVRAGRD-------GYVSLDRHWRSGDTITVSSPYRLHVERALDD----PSLQALLY 802
Query: 624 GPYLLAGHTS 633
GP L ++
Sbjct: 803 GPLALVAKST 812
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 49/90 (54%), Gaps = 1/90 (1%)
Query: 113 SVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLS 171
S+ + L Y D D +V +FR A L G + GGW++ LRGH+ GH++S
Sbjct: 79 SIFTEKRDRILAYARAYDADRIVSNFRTAAGLDNRGAQPPGGWDDATGNLRGHYSGHFIS 138
Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQN 201
AQ WA T A KEK+ +V +L ECQ+
Sbjct: 139 MLAQAWADTGEAIFKEKLDYIVTALKECQD 168
>gi|393782709|ref|ZP_10370892.1| hypothetical protein HMPREF1071_01760 [Bacteroides salyersiae
CL02T12C01]
gi|392672936|gb|EIY66402.1| hypothetical protein HMPREF1071_01760 [Bacteroides salyersiae
CL02T12C01]
Length = 673
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 167/601 (27%), Positives = 259/601 (43%), Gaps = 100/601 (16%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-------- 150
+ L +V L R Q + +Y+ L+ D + FR+ A + K
Sbjct: 34 FRSFGLDEVRLKDREFKLR-QNHDFDYIRTLEPDRYLSPFRRNAGIEVDSKGIPVDNTKH 92
Query: 151 YGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-------I 203
Y GWE L GHYLSA + M+ T + T+ K++ ++ L+ Q +
Sbjct: 93 YDGWEF----LGSSTFGHYLSAISMMYKVTGDTTLLHKINYIIDELNFIQRNPSYENENL 148
Query: 204 GTGYLSAFPTE---------LFDSFEALKP-----VWAP--------------------- 228
G L AF + +++ L+ AP
Sbjct: 149 RHGALVAFDRDRHKHVREPNFLRTYDELRQGQVNLTSAPDNRGATVENVYFKTFYWLSGG 208
Query: 229 --YYTIHKILAGLLDQYVLADNAQALKM-------ATWMVEYFYNRVQKVITMYSVERHW 279
+YT HKI AG+ D Y+ N +A K+ A W+ E +T ++ R
Sbjct: 209 LSWYTNHKIYAGIRDAYLYTGNPKAKKVFLSFCDWACWVTE--------KLTDHAFARML 260
Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-----PCFLGFLALQADYLSHFHAN 334
YS E G MN++L Y+ + + K+L A F++ PC G + A+ +SH HAN
Sbjct: 261 YS---EHGAMNEMLTDAYAFSGERKYLDCAFRFNEQETMVPCIDGDIKKIAETISHTHAN 317
Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
IP G +E TGD L+K+ F V S+ TGG S E + P + +
Sbjct: 318 AQIPQFYGLIKEFEYTGDSLFKVAAENFFKYVTNYQSFVTGGNSEWEQFRAPGNIMAQVT 377
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ ETC TYNMLK+++ LF T + Y +Y ERAL N +L ++PG Y L L
Sbjct: 378 RRSGETCNTYNMLKIAKGLFELTGDTLYLNYMERALYNHILPSIHTSQPGAFTYFLSLEP 437
Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
G K S ++S WCC GTG+E+ +K G+ IYF E V Y+ +++S+ W+
Sbjct: 438 GYFKTFS-----RPYDSHWCCVGTGMENHAKYGEFIYFHHEKEV---YVNLFVASALCWE 489
Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
+ D D R+ Q G++++L +R+P W G + +NG+ +
Sbjct: 490 KEGFQMETITDFPYESDVRFRIL------QNKGRIATLKIRIPRWAKEVGVK--VNGKMI 541
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
+L + W D + + LP+ LR E + P + A +GP LLAG
Sbjct: 542 KYKNRDGYLKLEKLWKIGDLVELTLPMYLRKEYV----PNCSDKFAFFYGPVLLAGRLGN 597
Query: 635 E 635
E
Sbjct: 598 E 598
>gi|365852804|ref|ZP_09393150.1| hypothetical protein HMPREF9103_01934 [Lactobacillus parafarraginis
F0439]
gi|363714017|gb|EHL97570.1| hypothetical protein HMPREF9103_01934 [Lactobacillus parafarraginis
F0439]
Length = 728
Score = 203 bits (517), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 158/578 (27%), Positives = 262/578 (45%), Gaps = 62/578 (10%)
Query: 97 NFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA-YGGWE 155
N +K VS ++V +S L + N+ ++L L D L++++RK A L T G WE
Sbjct: 3 NIMKPVSYYNVEYLPNSTLKEKFERNINWMLSLTPDQLLYNYRKNAGLDTKGATPLTVWE 62
Query: 156 NPISELRGHFVGHYLSASAQMWASTHN--------ATIKEKMSTVVFSLSECQNKIGT-- 205
+P RGHF GHYLS +++ + N +K ++ +V L E Q+K+
Sbjct: 63 SPDFFFRGHFTGHYLSGASKTFVELTNTDEKDPQAVELKNRVDLIVTGLKEVQDKLSETS 122
Query: 206 ---GYLSAFPTELFDSFEALK---PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
GYL+A P + FD+ E L+ + PYY I K++ GL+D Y N AL++ +
Sbjct: 123 EFPGYLAAEPEKRFDNLEKLRFNGNHYVPYYAIQKLMDGLMDAYQYTGNQTALQLVKNLT 182
Query: 260 EYFYNRVQKVI---TMYSVERHWYS------LNEETGGMNDVLYRLYSITHDPKHLL--L 308
Y R+ K+ ++ WY ++E G M+ L RLY +T + + L
Sbjct: 183 SYVEKRMAKLTPERISAMLDTRWYQGSGQYIFHQEFGAMHRTLLRLYELTGKKEQDVFDL 242
Query: 309 AHLFDKPCFLGFLALQADYLSHF--HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIV 366
A FD+ F L D L ++ H+NT + G Y VTGD YK +MD +
Sbjct: 243 AEKFDRKWFRDMLINNEDKLGYYSMHSNTELVCAEGMLEYYHVTGDDQYKKGVENYMDWM 302
Query: 367 NASHSYATGGTSAR-----------EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
+ H T G S R E + P+ L N E+C ++++ +S LF
Sbjct: 303 HTGHELPTKGISGRSAYPAPADYGSELYDYPEMFFKHLSKLNGESCCSHDLNYLSSELFA 362
Query: 416 WTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCC 475
TK+ + YE N +++ Q+ + + Y+ L + + G FWCC
Sbjct: 363 DTKDPVLMNDYEIRFINAIMA-QQNNDSAIAEYLYNLSVAPNSVKHYDRGG-----FWCC 416
Query: 476 YGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLR 535
G+G E S L D IY+++ ++ Y+ QY S + K V + Q D +
Sbjct: 417 VGSGTERHSTLVDGIYYQDNDDI---YVAQYFDSILNLKDQGVKVTQ--DAHYPDQHFAH 471
Query: 536 MTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKL 595
+T+ ++ ++ +R+P W S +++G+ + + P F++ WS ++
Sbjct: 472 ITVETEQPKDF----TIYVRVPKW--SAETTITVDGKAVKVQPENGFVAIKRNWSKKSEI 525
Query: 596 TIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
TI LR + + D + I AI +GP LLA +
Sbjct: 526 TINFDFQLRYQVLAD---RFNRI-AIYYGPILLAAQKA 559
>gi|302818287|ref|XP_002990817.1| hypothetical protein SELMODRAFT_429245 [Selaginella moellendorffii]
gi|300141378|gb|EFJ08090.1| hypothetical protein SELMODRAFT_429245 [Selaginella moellendorffii]
Length = 226
Score = 201 bits (511), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 95/133 (71%), Positives = 108/133 (81%)
Query: 168 HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWA 227
HYLSASA WASTHN TI E M+ VV +L+ECQ KIGTGYLSAFPT LFD FEAL+ VWA
Sbjct: 25 HYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSAFPTSLFDRFEALESVWA 84
Query: 228 PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
PYYTIHKI+AGLLDQY A N+ A +M M +YF +RV++VI YS+ERHW SLNEETG
Sbjct: 85 PYYTIHKIMAGLLDQYTYAANSFAFEMLLGMTDYFGSRVERVIEKYSIERHWQSLNEETG 144
Query: 288 GMNDVLYRLYSIT 300
GMNDVLYR+Y IT
Sbjct: 145 GMNDVLYRVYQIT 157
>gi|159491178|ref|XP_001703550.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280474|gb|EDP06232.1| predicted protein [Chlamydomonas reinhardtii]
Length = 226
Score = 200 bits (509), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 108/197 (54%), Positives = 136/197 (69%), Gaps = 4/197 (2%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLL-MLDVDSLVWSFRKTASLPTPGKAY-GGWEN 156
++ + L DV L +++ R ++ N +YLL ML+ D L+WSFRKT+ LPTPG Y WE+
Sbjct: 28 IEPLPLSDVRLLDTALQARYEKLNAKYLLDMLEPDRLLWSFRKTSGLPTPGTPYIASWED 87
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
P ELRGHFVGHYLSA + A T N+ K ++ +V L + Q K+GTGYLSAFPTE F
Sbjct: 88 PGCELRGHFVGHYLSALSLALAGTGNSAFKTRLDLMVSELGKVQEKLGTGYLSAFPTEFF 147
Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
D EALKPVWAPYYTIHKI+AGL+D + LA + AL MAT MV+Y +NR Q VI E
Sbjct: 148 DRVEALKPVWAPYYTIHKIIAGLVDAHELAGHPSALAMATRMVDYHWNRTQAVIAAKGRE 207
Query: 277 RHWYS-LNEETGGMNDV 292
HW + LN E GGMN+V
Sbjct: 208 -HWNAVLNCEFGGMNEV 223
>gi|256378728|ref|YP_003102388.1| hypothetical protein Amir_4712 [Actinosynnema mirum DSM 43827]
gi|255923031|gb|ACU38542.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 881
Score = 199 bits (506), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 188/645 (29%), Positives = 285/645 (44%), Gaps = 92/645 (14%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGK-AYGGWEN- 156
L+ L DV L V RA L + VD ++ FR A L T G G WE+
Sbjct: 9 LEPFPLRDVEL-LDGVQSRAAGQMLHLARVFPVDRVLAVFRANAGLDTRGALPPGNWEDF 67
Query: 157 --------------------PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSL 196
S LRGH+ GH+LS A AST +++ K +V L
Sbjct: 68 GHPDERPWSAEEYPGAGVAPTASLLRGHYAGHFLSMVALAHASTGEESLRAKAWEIVAGL 127
Query: 197 SECQNKIGT-------GYLSAFPTELFDSFEALKP---VWAPYYTIHKILAGLLDQYVLA 246
+E ++ + G+L+A+ F E L P +WAPYYT HKI+AGLLD +
Sbjct: 128 AEVRDALAATGRYSHPGFLAAYGEWQFSRLEDLAPYGEIWAPYYTCHKIMAGLLDAHEHT 187
Query: 247 DNAQALKMATWMVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPKH 305
+ QAL++A M + RV ++ + ++R W + E GGMN+ L L+ IT +
Sbjct: 188 GSEQALELAVGMGHWVAGRVLRLERAH-LQRMWSLYIAGEFGGMNESLAALHRITGEEVF 246
Query: 306 LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDI 365
L A F+ L A D L HAN H+P+++G +Y+ TG+ Y T D
Sbjct: 247 LRAAAAFELDHLLEGAAQGRDLLDGMHANQHLPMLVGHLDQYDATGETRYLDAVTALWDQ 306
Query: 366 VNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
V ++A GGT E W +A +G N E+C TYN+LK++R LF T + Y +Y
Sbjct: 307 VVPGRTFAHGGTGEGELWGPADTVAGFIGRRNAESCATYNLLKIARSLFARTGDARYPEY 366
Query: 426 YERALTNGVLSIQRGTEPGV---MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIES 482
ERA N ++ + + V ++YM P+ G + N CC GTG+E+
Sbjct: 367 AERAWLNHMVGSRADLDSDVSPEVVYMYPVDAGAVREYD--------NVGTCCGGTGLET 418
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
K D ++F G L + +++ S G V + P R+ + F +
Sbjct: 419 HVKHQDWVWFHAPGK---LVVARHVPSRVTLPGGGSVALRTGYPRDG-----RVVVEFDA 470
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
L+LR+P W A ++G+ +PL G F + + D++ + LPL
Sbjct: 471 DFS----GELHLRVPSWAT---AGYLVDGERVPL-TDGGFAVLSRDFRRGDEVELVLPLP 522
Query: 603 LRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPS----FN 658
LR + DD P S++ GP +L AR +A + P+ P+ +
Sbjct: 523 LRLVSTVDD-PTLVSVE---LGPTVL-------------LARDDAATVLPVSPAAFRGLD 565
Query: 659 AQLVTFTQESGNSTFVMSNSNQSITMEEFPV-SGTDAALHATFRL 702
LV + ++ +F +T E P SG DA HA RL
Sbjct: 566 GSLVGYERDGDLVSF------GGLTFE--PAWSGGDARYHAYLRL 602
>gi|357472913|ref|XP_003606741.1| hypothetical protein MTR_4g065110 [Medicago truncatula]
gi|355507796|gb|AES88938.1| hypothetical protein MTR_4g065110 [Medicago truncatula]
Length = 203
Score = 199 bits (505), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 100/165 (60%), Positives = 120/165 (72%), Gaps = 3/165 (1%)
Query: 3 FGFVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVLSHFHLTPTDDSAW 61
F F+ G K+CTN P SH FRYEL S N+TWK+EV+SH+H+TPTD+SAW
Sbjct: 6 FMFMFMALMLRGCVTIKECTN-IPTQSHTFRYELFASKNETWKKEVMSHYHVTPTDESAW 64
Query: 62 SSLIPSKILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQT 121
++L+P KIL ++ ++ WAL+YRKIKN G F P FLKEV L DV L + S+ AQQT
Sbjct: 65 ATLLPRKILSEE-NQHDWALMYRKIKNLGVFKPPVGFLKEVPLGDVRLLEGSIHAVAQQT 123
Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
NLEYLLMLDVD L+WSFRKTA LPTPG YGGWE P +ELRGHFV
Sbjct: 124 NLEYLLMLDVDRLIWSFRKTAGLPTPGNPYGGWEEPNTELRGHFV 168
>gi|29348320|ref|NP_811823.1| hypothetical protein BT_2911 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383124515|ref|ZP_09945178.1| hypothetical protein BSIG_1739 [Bacteroides sp. 1_1_6]
gi|29340224|gb|AAO78017.1| putative Acetyl-CoA carboxylase, biotin carboxylase [Bacteroides
thetaiotaomicron VPI-5482]
gi|251841333|gb|EES69414.1| hypothetical protein BSIG_1739 [Bacteroides sp. 1_1_6]
Length = 655
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 166/558 (29%), Positives = 257/558 (46%), Gaps = 49/558 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP- 157
L+EV L D S Q+ EYLL L+ DSL+ +R A LP+ Y GWE+
Sbjct: 48 LREVRLLD------SPFLDLQRKGKEYLLWLNPDSLLHFYRIEAGLPSKAAPYAGWESQD 101
Query: 158 ---ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP-- 212
LRG F+G YLS+ + M+ ST + + +++ V+ L CQ G+L
Sbjct: 102 VWGAGPLRGGFLGFYLSSVSMMYQSTDDKRLLKRLKYVLKELELCQKAGKDGFLLGLKDG 161
Query: 213 TELFDSFEALK---------PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFY 263
+LF + K WAP Y I+K+L GL Y +AL + + ++F
Sbjct: 162 RKLFAEVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCQMEEALPILIRLADWFG 221
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
+V +T ++R L E G +N+ Y +T + + L A + G L+
Sbjct: 222 YQVLDKLTDDQIQR---LLICEHGSINESYVEAYELTGEKRFLDWARRLNDHAMWGPLSE 278
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
D L +HANT IP G Y+ TGD + T F +IV +H++ GG S E +
Sbjct: 279 GKDILFGWHANTQIPKFTGFHKYYQFTGDERFLTAATNFWNIVTQNHTWVIGGNSTGEHF 338
Query: 384 WDPKRLAD-TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
+ + AD L ETC + NML+++ LF + A A YYER L N +LS E
Sbjct: 339 FPKEEFADRVLLVGGPETCNSVNMLRLTESLFCQYPDAAKASYYERVLFNHILS-AYDPE 397
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV---P 499
G+ Y + G + + ++ +SFWCC TG+ES +KL IY + + P
Sbjct: 398 KGMCCYFTSMRPGHYRI-----YASRDSSFWCCGHTGLESPAKLSKFIYSHSKRIIDGDP 452
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
+ + +I S WK + L Q+ + + L KQE+ L +R P W
Sbjct: 453 DIRVNLFIPSILFWKEKGIELIQQNR--LPESEQVSFMLNLKKKQEL----ILRIRKPDW 506
Query: 560 TYSNGAQASLNGQ-NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ-DDRPEYAS 617
++ +NG+ P+ + W+ +K+ +QLP+ + E++ DR YA
Sbjct: 507 --ADKVTFIINGKVEYPILDKDGYWVVNRTWARKNKIILQLPMHVYVESLMGSDR--YA- 561
Query: 618 IQAILFGPYLLAGHTSGE 635
A+L+GPY+LAG E
Sbjct: 562 --ALLYGPYVLAGRMGTE 577
>gi|408500683|ref|YP_006864602.1| acetyl-CoA carboxylase [Bifidobacterium asteroides PRL2011]
gi|408465507|gb|AFU71036.1| acetyl-CoA carboxylase [Bifidobacterium asteroides PRL2011]
Length = 807
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 151/492 (30%), Positives = 225/492 (45%), Gaps = 33/492 (6%)
Query: 107 VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFV 166
V L S+ AQQ +YLL LD D L+ +R+ A L Y WE+ L GH
Sbjct: 26 VRLTPGSIYADAQQAGADYLLSLDPDRLLAPYRREAGLTATADPYPNWES--MGLDGHIG 83
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF-------- 216
GHYLS A W S E+ + ++ L ECQ G G+L P ELF
Sbjct: 84 GHYLSGLAAYWQSLQTWPFLERATRMLTGLLECQEASGDGFLGGMPHSAELFRNLREGHV 143
Query: 217 --DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYS 274
SF+ L W P Y +HK+ AGLLD + A +MA MV + +
Sbjct: 144 QAQSFDLLG-SWVPLYNLHKLFAGLLDCWQSFQTKGASEMARVMVLRLADWWCDLADNID 202
Query: 275 VERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAH-LFDKPCFLGFLALQADYLSHFHA 333
+ L E GG+N+ RLY +T ++L A L D+P F LA+ D L+ HA
Sbjct: 203 EQDFQTMLTCEYGGLNEAFARLYQLTGKDRYLRQARRLTDRP-FFEPLAVGKDQLTGLHA 261
Query: 334 NTHIPIVIGSQMRYEVTGDPLYKL-IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
NT IP V+G + E+TGD ++ + TF+ +V+ + + G S E + P +
Sbjct: 262 NTQIPKVLGYERLAEITGDQAFRTAVDTFWHGVVD-KRTVSIGAHSISEHFNPPDDFSAM 320
Query: 393 LGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
+ S E ETC +YNM K++ L+ T + Y D+YER L N ++S E G +Y P
Sbjct: 321 VTSREGLETCNSYNMAKLALRLYDRTGQARYLDFYERVLVNHLVSTVGIREHG-FVYFTP 379
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG-----LYIIQY 506
+ + R + + SFWCC GTG+E+ ++ G I+ G PG L + +
Sbjct: 380 M-----RPRHYRVYSSAQRSFWCCVGTGLENHARYGAMIFERRPGKDPGQESESLAVNLF 434
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
I +S DW + ++ P R+ L + + Q L++R P W +
Sbjct: 435 IPASLDWSQRGLRVSLAYAPGPGTTNLGRIDLEADDQSQ--QTLDLDIRHPWWVEDADYR 492
Query: 567 ASLNGQNLPLPP 578
+ N+ + P
Sbjct: 493 IAQGQANMTVEP 504
>gi|261415299|ref|YP_003248982.1| hypothetical protein Fisuc_0892 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|385790233|ref|YP_005821356.1| hypothetical protein FSU_1340 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|261371755|gb|ACX74500.1| protein of unknown function DUF1680 [Fibrobacter succinogenes
subsp. succinogenes S85]
gi|302327243|gb|ADL26444.1| conserved hypothetical protein [Fibrobacter succinogenes subsp.
succinogenes S85]
Length = 897
Score = 196 bits (499), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 173/624 (27%), Positives = 277/624 (44%), Gaps = 60/624 (9%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
+L DV L VL Q N+E LL DVD L+ F + A + + W + L
Sbjct: 36 ALSDVQL-LDGVLKERQDLNVETLLSYDVDRLLAPFYEEAGMKPKASKFPNW----AGLD 90
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGT-----GYLSAFPT--EL 215
GH +GHYLSA A +A + +KE++ ++ L Q++ GY+S P ++
Sbjct: 91 GHVLGHYLSALAMHYADNDDVQVKERLEYILKELKTIQDQNSKDNNFKGYISGVPNGKQM 150
Query: 216 FDSFE-----ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVI 270
+ + A W P+Y IHK+ AGL D YV A QA M + ++ +
Sbjct: 151 WLKMKNGDAGAQNGYWVPWYNIHKLYAGLRDAYVYAGYEQAKTMFLALCDWGIT----IT 206
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSH 330
+ + L E GGM +V Y +T D K+L A + L ++ D L++
Sbjct: 207 NGLNDSKMQQMLGTEHGGMPEVYADAYKLTKDEKYLNAAKKWSHQWLLNPMSQGNDNLTN 266
Query: 331 FHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW---WDPK 387
HANT +P V+G E++GD YK FF V S A GG S E + + K
Sbjct: 267 VHANTQVPKVVGFARIAELSGDEKYKKGSDFFWQTVVNKRSIAIGGNSISEHFPALNNHK 326
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+ + E E+C TYNMLK++ LF + Y D+YERAL N +LS T G +
Sbjct: 327 KFIEE--REGPESCNTYNMLKLTERLFNIKHDAHYTDFYERALFNHILSTIHPTHGG-YV 383
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y P ++ R + WCC G+G+E+ +K IY +++ LY+ +
Sbjct: 384 YFTP-----ARPRHYRVYSKVNAGMWCCVGSGMENPAKYNQFIYTKDKD---ALYVNLFA 435
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
+S +WK V + Q+ + T+T S + + + +R P W +
Sbjct: 436 ASILNWKDKSVKIKQET--AFPKGESSKFTITGSGEFD------MQIRHPYWVKEGAFKV 487
Query: 568 SLNGQN-LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
+NG + P +++SA + W D + + P+ E D P A+L GP
Sbjct: 488 IVNGDTVVKKSTPSSYVSAGKSWKSGDVVEVLYPMYTHVE----DLPGVTDYVALLHGPI 543
Query: 627 LLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEE 686
+L+ KTGTA +L+ L++ + + + ES + ++++ + I +
Sbjct: 544 VLSA--------KTGTA-NLNGLVA--DDGRWSHIASGALESLDQAPMLASKKEDIPSKV 592
Query: 687 FPVSGTDAALHATFRLI-LKDASL 709
PV G A + KDA+L
Sbjct: 593 EPVKGEPLHFKAPYLFAKQKDANL 616
>gi|265753026|ref|ZP_06088595.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263236212|gb|EEZ21707.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 808
Score = 196 bits (497), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 162/574 (28%), Positives = 255/574 (44%), Gaps = 54/574 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE--- 155
LKEV L D S N Y+L L+ D L+ FR+ A L + Y WE
Sbjct: 39 LKEVRLLD------SDFKHIMDLNHAYMLSLEPDRLLSWFRREAGLTPKAQPYPFWESEY 92
Query: 156 -NPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL------ 208
N L GH +G YLS + M+ ST + I ++S ++ LS CQ G GYL
Sbjct: 93 MNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQAGGDGYLLPTICG 152
Query: 209 -SAFPTELFDSFEALKP--------VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
+ F L +F+ P W P Y ++KI+ GL Y+ D QA ++ M
Sbjct: 153 RAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMRCDLLQAKEILVKMA 212
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
++F V ++ +++ L E G +N+ +Y IT + K+L A +
Sbjct: 213 DWFGYSVIDKLSHDDLQK---LLVCEHGSINESFIDVYQITGEEKYLKWAQRLNDEDMWV 269
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
++ D L +HANT IP G + Y + + FF D V H++ GG S
Sbjct: 270 PMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDTVVRKHTWVMGGNST 329
Query: 380 REFWWDPKRLADTLG-SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
E ++ P+ + + E+C + NML+++ L+ E+ DYYE+ L N +L+
Sbjct: 330 GEHFFAPEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEKVDYYEKVLFNHILA-N 388
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
+ G+ +Y + G K +GTK++SFWCC GTG E +K G IY +
Sbjct: 389 YDPDQGMCVYYTSMKPGHYKI-----YGTKYDSFWCCTGTGFEQTAKFGQMIYAHTDD-- 441
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
LY+ +I S W G + + P +LT S + + +L +R P
Sbjct: 442 -ALYVNMFIPSVVTWNKGVSIHQETAFPDEG-----VTSLTVSGE----AVFNLKIRCPY 491
Query: 559 WTYSNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W S+ +NG+ + + ++S +W DK+ I+LP+ L + E A
Sbjct: 492 WVGSSSLNVIVNGKREKIKAGMDGYVSINRQWKDGDKVRIELPMKLEIVPLN----EAAH 547
Query: 618 IQAILFGPYLLAGHTSGEWDIKTG--TARSLSAL 649
A+ +GP +LA S E K +ARS A+
Sbjct: 548 YLALKYGPIVLAARISDEHLSKDDFRSARSTVAM 581
>gi|423219866|ref|ZP_17206362.1| hypothetical protein HMPREF1061_03135 [Bacteroides caccae
CL03T12C61]
gi|392625071|gb|EIY19149.1| hypothetical protein HMPREF1061_03135 [Bacteroides caccae
CL03T12C61]
Length = 655
Score = 195 bits (496), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 169/559 (30%), Positives = 256/559 (45%), Gaps = 51/559 (9%)
Query: 99 LKEVSLHD-VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
LKE+ L D +LD QQ EYLL L+ DSL+ +R A L + Y GWE+
Sbjct: 48 LKEIRLSDGPFLD-------LQQKGKEYLLWLNPDSLLHFYRIEAGLSSKAGPYAGWESQ 100
Query: 158 ----ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP- 212
LRG F+G YLS+ + M+ ST + + ++ V+ L CQ G+L
Sbjct: 101 DVWGAGPLRGGFLGFYLSSVSMMYQSTGDRELLRRLKYVLKELKLCQEAGKDGFLLGVKG 160
Query: 213 -TELFDSFEALK---------PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYF 262
ELF + K WAP Y I+K+L GL Y D +AL + + ++F
Sbjct: 161 GRELFREVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCDLKEALPILVRLADWF 220
Query: 263 YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLA 322
++V +T +++ L E G +N+ +Y +T + L A + L+
Sbjct: 221 GSQVLDKLTDEQIQQ---LLICEHGSINESYVEVYELTGQKRFLDWARRLNDRAMWVPLS 277
Query: 323 LQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREF 382
D L +HANT IP G Y TGD + L T F +IV +H++ GG S E
Sbjct: 278 EGKDVLFGWHANTQIPKFTGFHKYYMFTGDRAFLLAATNFWNIVKQNHTWVIGGNSTGEH 337
Query: 383 WWDPKRLAD-TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
++ K D L ETC + NML+++ LF + A YYER L N +LS
Sbjct: 338 FFSKKEFIDRMLHISGPETCNSVNMLRLTEALFMQQPDATKAAYYERTLFNHILSAYDPV 397
Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV--- 498
+ G+ Y + G + + ++ +SFWCC TG+ES +KLG IY + N
Sbjct: 398 K-GMCCYFTSMRPGHYRI-----YASRDSSFWCCGHTGLESPAKLGKFIYSHKVTNRHQE 451
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+ + +I S WK V L Q+ + + +TL KQ++ L +R P
Sbjct: 452 KDIRVNLFIPSILSWKEEGVELIQQSR--IPESEQVDLTLNLKKKQKL----ILRIRKPD 505
Query: 559 WTYSNGAQASLNG-QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ-DDRPEYA 616
WT + A +NG + PL + W + +T++LP+ + TE + DR
Sbjct: 506 WT--DKATFIINGEEEQPLLGSDGYWIIDRVWERKNVITLRLPMHIYTENLTGTDR---- 559
Query: 617 SIQAILFGPYLLAGHTSGE 635
A+L+GPY+LAG E
Sbjct: 560 -YVALLYGPYVLAGRMGKE 577
>gi|237711616|ref|ZP_04542097.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|229454311|gb|EEO60032.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
Length = 780
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 161/574 (28%), Positives = 254/574 (44%), Gaps = 54/574 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE--- 155
LKEV L D S N Y+L L+ D L+ FR+ A L + Y WE
Sbjct: 11 LKEVRLLD------SDFKHIMDLNHAYMLSLEPDRLLSWFRREAGLTPKAQPYPFWESEY 64
Query: 156 -NPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL------ 208
N L GH +G YLS + M+ ST + I ++S ++ LS CQ G GYL
Sbjct: 65 MNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQAGGDGYLLPTICG 124
Query: 209 -SAFPTELFDSFEALKP--------VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
+ F L +F+ P W P Y ++KI+ GL Y+ D QA ++ M
Sbjct: 125 RAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMRCDLLQAKEILVKMA 184
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
++F V ++ +++ L E G +N+ +Y IT + K+L A +
Sbjct: 185 DWFGYSVIDKLSHDDLQK---LLVCEHGSINESFIDVYQITGEEKYLKWAQRLNDEDMWV 241
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
++ D L +HANT IP G + Y + + FF D V H++ GG S
Sbjct: 242 PMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDTVVRKHTWVMGGNST 301
Query: 380 REFWWDPKRLADTLG-SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
E ++ P+ + + E+C + NML+++ L+ E+ DYYE+ L N +L+
Sbjct: 302 GEHFFAPEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEKVDYYEKVLFNHILA-N 360
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
+ G+ +Y + G K +GTK++SFWCC GTG E +K G IY +
Sbjct: 361 YDPDQGMCVYYTSMKPGHYKI-----YGTKYDSFWCCTGTGFEQTAKFGQMIYAHTDD-- 413
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
LY+ +I S W G + + P +LT S + + +L +R P
Sbjct: 414 -ALYVNMFIPSVVTWDKGISIHQETAFPDEG-----VTSLTVSGE----AVFNLKIRCPY 463
Query: 559 WTYSNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W S+ +NG+ + + ++S +W DK+ I+LP+ L + E
Sbjct: 464 WVGSSSLNVIVNGKREKIKAGVDGYVSINRQWKDGDKVRIELPMKLEIVPLN----EATH 519
Query: 618 IQAILFGPYLLAGHTSGEWDIKTG--TARSLSAL 649
A+ +GP +LA S E K +ARS A+
Sbjct: 520 YLALKYGPIVLAARISDEHLSKDDFRSARSTVAM 553
>gi|153805786|ref|ZP_01958454.1| hypothetical protein BACCAC_00022 [Bacteroides caccae ATCC 43185]
gi|149130463|gb|EDM21669.1| hypothetical protein BACCAC_00022 [Bacteroides caccae ATCC 43185]
Length = 659
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 169/559 (30%), Positives = 255/559 (45%), Gaps = 51/559 (9%)
Query: 99 LKEVSLHD-VWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
LKE+ L D +LD QQ EYLL L+ DSL+ +R A L + Y GWE+
Sbjct: 52 LKEIRLSDGPFLD-------LQQKGKEYLLWLNPDSLLHFYRIEAGLSSKAGPYAGWESQ 104
Query: 158 ----ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP- 212
LRG F+G YLS+ + M+ ST + + ++ V+ L CQ G+L
Sbjct: 105 DVWGAGPLRGGFLGFYLSSVSMMYQSTGDRELLRRLKYVLKELKLCQEAGKDGFLLGVKG 164
Query: 213 -TELFDSFEALK---------PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYF 262
ELF + K WAP Y I+K+L GL Y D +AL + + ++F
Sbjct: 165 GRELFREVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCDLKEALPILVRLADWF 224
Query: 263 YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLA 322
++V +T +++ L E G +N+ +Y +T + L A + L+
Sbjct: 225 GSQVLDKLTDEQIQQ---LLICEHGSINESYVEVYELTGQKRFLDWARRLNDRAMWVPLS 281
Query: 323 LQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREF 382
D L HANT IP G Y TGD + L T F +IV +H++ GG S E
Sbjct: 282 EGKDVLFGGHANTQIPKFTGFHKYYMFTGDRAFLLAATNFWNIVKQNHTWVIGGNSTGEH 341
Query: 383 WWDPKRLAD-TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
++ K D L ETC + NML+++ LF + A YYER L N +LS
Sbjct: 342 FFSKKEFIDRMLHISGPETCNSVNMLRLTEALFMQQPDATKAAYYERTLFNHILSAYDPV 401
Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV--- 498
+ G+ Y + G + + ++ +SFWCC TG+ES +KLG IY + N
Sbjct: 402 K-GMCCYFTSMRPGHYRI-----YASRDSSFWCCGHTGLESPAKLGKFIYSHKVTNRHQE 455
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+ + +I S WK V L Q+ + + +TL KQ++ L +R P
Sbjct: 456 KDIRVNLFIPSILSWKEEGVELIQQSR--IPESEQVDLTLNLKKKQKL----ILRIRKPD 509
Query: 559 WTYSNGAQASLNG-QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD-DRPEYA 616
WT + A +NG + PL + W + +T++LP+ + TE + DR
Sbjct: 510 WT--DKATFIINGEEEQPLLGSDGYWIIDRVWERKNVITLRLPMHIYTENLTGTDR---- 563
Query: 617 SIQAILFGPYLLAGHTSGE 635
A+L+GPY+LAG E
Sbjct: 564 -YVALLYGPYVLAGRMGKE 581
>gi|212695364|ref|ZP_03303492.1| hypothetical protein BACDOR_04911 [Bacteroides dorei DSM 17855]
gi|345513936|ref|ZP_08793451.1| hypothetical protein BSEG_01968 [Bacteroides dorei 5_1_36/D4]
gi|423230909|ref|ZP_17217313.1| hypothetical protein HMPREF1063_03133 [Bacteroides dorei
CL02T00C15]
gi|423241462|ref|ZP_17222575.1| hypothetical protein HMPREF1065_03198 [Bacteroides dorei
CL03T12C01]
gi|423244620|ref|ZP_17225695.1| hypothetical protein HMPREF1064_01901 [Bacteroides dorei
CL02T12C06]
gi|212662093|gb|EEB22667.1| hypothetical protein BACDOR_04911 [Bacteroides dorei DSM 17855]
gi|229435750|gb|EEO45827.1| hypothetical protein BSEG_01968 [Bacteroides dorei 5_1_36/D4]
gi|392630029|gb|EIY24031.1| hypothetical protein HMPREF1063_03133 [Bacteroides dorei
CL02T00C15]
gi|392641355|gb|EIY35132.1| hypothetical protein HMPREF1065_03198 [Bacteroides dorei
CL03T12C01]
gi|392641469|gb|EIY35245.1| hypothetical protein HMPREF1064_01901 [Bacteroides dorei
CL02T12C06]
Length = 808
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 161/574 (28%), Positives = 254/574 (44%), Gaps = 54/574 (9%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE--- 155
LKEV L D S N Y+L L+ D L+ FR+ A L + Y WE
Sbjct: 39 LKEVRLLD------SDFKHIMDLNHAYMLSLEPDRLLSWFRREAGLTPKAQPYPFWESEY 92
Query: 156 -NPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL------ 208
N L GH +G YLS + M+ ST + I ++S ++ LS CQ G GYL
Sbjct: 93 MNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQAGGDGYLLPTICG 152
Query: 209 -SAFPTELFDSFEALKP--------VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMV 259
+ F L +F+ P W P Y ++KI+ GL Y+ D QA ++ M
Sbjct: 153 RAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMRCDLLQAKEILVKMA 212
Query: 260 EYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLG 319
++F V ++ +++ L E G +N+ +Y IT + K+L A +
Sbjct: 213 DWFGYSVIDKLSHDDLQK---LLVCEHGSINESFIDVYQITGEEKYLKWAQRLNDEDMWV 269
Query: 320 FLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA 379
++ D L +HANT IP G + Y + + FF D V H++ GG S
Sbjct: 270 PMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDTVVRKHTWVMGGNST 329
Query: 380 REFWWDPKRLADTLG-SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
E ++ P+ + + E+C + NML+++ L+ E+ DYYE+ L N +L+
Sbjct: 330 GEHFFAPEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEKVDYYEKVLFNHILA-N 388
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
+ G+ +Y + G K +GTK++SFWCC GTG E +K G IY +
Sbjct: 389 YDPDQGMCVYYTSMKPGHYKI-----YGTKYDSFWCCTGTGFEQTAKFGQMIYAHTDD-- 441
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
LY+ +I S W G + + P +LT S + + +L +R P
Sbjct: 442 -ALYVNMFIPSVVTWDKGISIHQETAFPDEG-----VTSLTVSGE----AVFNLKIRCPY 491
Query: 559 WTYSNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
W S+ +NG+ + + ++S +W DK+ I+LP+ L + E
Sbjct: 492 WVGSSSLNVIVNGKREKIKAGVDGYVSINRQWKDGDKVRIELPMKLEIVPLN----EATH 547
Query: 618 IQAILFGPYLLAGHTSGEWDIKTG--TARSLSAL 649
A+ +GP +LA S E K +ARS A+
Sbjct: 548 YLALKYGPIVLAARISDEHLSKDDFRSARSTVAM 581
>gi|444305788|ref|ZP_21141565.1| hypothetical protein G205_09453 [Arthrobacter sp. SJCon]
gi|443481842|gb|ELT44760.1| hypothetical protein G205_09453 [Arthrobacter sp. SJCon]
Length = 444
Score = 191 bits (485), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 137/410 (33%), Positives = 194/410 (47%), Gaps = 29/410 (7%)
Query: 117 RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQM 176
+AQ T++ Y+L LD D L + A L +AYG WE+ L GH GHYLS A++
Sbjct: 23 QAQDTSVRYILSLDADRLFAPYLHEAGLVRAAEAYGNWES--DGLGGHIGGHYLSGCARL 80
Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------ELFDSFEALKPV 225
+A+T NA + K+ V L CQ G GY+ P E+ L
Sbjct: 81 YAATGNAELLAKVRAAVVILGNCQAAHGDGYVGGVPRGGDLGQELARGEVDADLFTLNGR 140
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
W P Y +HK LAGLLD V A + +AL +A + ++ RV + + E L+ E
Sbjct: 141 WVPLYNLHKTLAGLLDARVFAGSGEALDIAVGLAGWWL-RVSAHLADDAFEE---VLHAE 196
Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
GGMN+ L+ +T ++L A F L LA D L HANT IP V+G
Sbjct: 197 FGGMNEAFALLWELTGREEYLREARRFSHRALLDPLAAGQDLLDGLHANTQIPKVVGYAR 256
Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTY 404
T D F + V + S + GG S RE + + + + ETC TY
Sbjct: 257 LAGPTHDADLAHACDIFWESVVSRRSVSIGGNSVREHFHPASDFSPMVQDPQGPETCNTY 316
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGVSKARSTH 463
NMLK+++ F + A D++ERA N +LS Q GT G ++Y P+ G + S
Sbjct: 317 NMLKLAKLRFEAHGDAAAVDFFERATYNHILSSQHPGT--GGLVYFTPMRPGHYRVYS-- 372
Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
S WCC G+G+E+ ++ G+ IY GN L + YI S+ DW
Sbjct: 373 ---RAQESMWCCVGSGLENHARYGELIY-SRAGN--DLLVNLYIPSTLDW 416
>gi|393782707|ref|ZP_10370890.1| hypothetical protein HMPREF1071_01758 [Bacteroides salyersiae
CL02T12C01]
gi|392672934|gb|EIY66400.1| hypothetical protein HMPREF1071_01758 [Bacteroides salyersiae
CL02T12C01]
Length = 1293
Score = 189 bits (480), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 153/569 (26%), Positives = 253/569 (44%), Gaps = 61/569 (10%)
Query: 115 LWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASA 174
L +A N+ YL DV+ L+ K K YGG + HYLSA +
Sbjct: 457 LKQAMDKNITYLKSFDVNRLLAQTFKYNLGIDDYKLYGGANDAT-------FAHYLSAIS 509
Query: 175 QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA--FPTELFDSFEALKPV------- 225
+A+T + + ++++ +V + + Q+ +G G S PT F K +
Sbjct: 510 MGYAATGDEDLLQRVNHMVDVMIQAQDVMGDGLYSNNDAPTWGFYKMAKEKVITPYGWDE 569
Query: 226 ----WA------PYYTIHKILAGLLDQYVLADNAQA----LKMATWMVEYFYNRVQKVIT 271
W P+Y HK A D Y+ A N A +K W+V + N
Sbjct: 570 NGHPWGNNNIGFPFYAHHKAFAAFRDAYIYAGNENARVAFVKFCEWLVMWMQN------- 622
Query: 272 MYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
++ + L E GGM +VL Y+++ K L A F + F ++ D LS
Sbjct: 623 -FTDDNLQKMLESEHGGMVEVLSDAYALSGKIKFLDAARRFTRDNFAAAMSGNRDDLSGR 681
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
H+N H+P+ +G+ + Y +GD F IV+ H+ GG E + P L
Sbjct: 682 HSNFHVPMAVGAAIHYLYSGDERSGKTAHNFFHIVHDHHTLCNGGNGNNERFGTPDLLTY 741
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
LG ETC++YNMLK+++ LF + Y DYYE + N +L+I + Y +
Sbjct: 742 RLGQRGPETCSSYNMLKLAKDLFCQEGDTEYLDYYENTMWNHILAILSPRSDAGVCYHVN 801
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
L G K S +++ WCC GTG+ES +K D+IYF +G++ G+ + + S+
Sbjct: 802 LKPGTFKMYS-----DLYSNLWCCVGTGMESHAKYVDAIYF--KGDI-GILVNLFTPSTL 853
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
+W+ + L + D V+ + L + + S +++ +R P W G ++NG
Sbjct: 854 NWEETGLKLTMETDFPVTNNVKLIINESGSFNKDIC------IRYPSWVEEGGIAITING 907
Query: 572 QNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
+ PG + + W+ D++ I +P LR + DD ++ AI +GP LLA
Sbjct: 908 AKQKISAKPGEIIKLSSSWAAGDEILITIPCKLRLVDLPDD----INVSAIFYGPVLLAA 963
Query: 631 HTS--GEWDIKTGTARSLSALISPIPPSF 657
+ G+ DI G + + P P ++
Sbjct: 964 NMGEVGQSDI--GFSWPQEEIKDPAPDAY 990
>gi|265751351|ref|ZP_06087414.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263238247|gb|EEZ23697.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 791
Score = 189 bits (480), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 145/548 (26%), Positives = 252/548 (45%), Gaps = 45/548 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L EV + D + Q + +YLL L+ D L+ FR+ A L + Y WE+
Sbjct: 18 LSEVRITDKYFKH------IQDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPFWESED 71
Query: 159 ----SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA---- 210
L GH +G Y+S+ + M+ +T++ I ++++ +V L CQ G GYL A
Sbjct: 72 VWGGGPLAGHILGFYMSSMSMMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLLATVNG 131
Query: 211 ---FPTELFDSFEALKPV----WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFY 263
F + F P+ W P Y ++KI+ GL Y A ++ M ++F
Sbjct: 132 KQVFEDMIDGDFTTSNPLINQTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGMADWFG 191
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
V + ++++ L E G +N+ +Y IT D K+L A + L+
Sbjct: 192 YEVLDKLNHENIQK---MLVCEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMWVPLSK 248
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
D L+ +HANT IP G Y T + Y T F DIV H++ GG S E +
Sbjct: 249 GEDILNGWHANTQIPKFTGFNAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNSTGEHF 308
Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
++ + E+C + NM++++ L++ + DYYER L N +L+ E
Sbjct: 309 FEESMFEKKIPQYGGPESCNSVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA-NYDPE 367
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
G+ +Y P+ G K +GT+++SFWCC GTG E+ +K IY ++ + LY
Sbjct: 368 EGMCVYYTPMRPGHYKI-----YGTRYHSFWCCTGTGFEAPAKFAKMIYAHKDNS---LY 419
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
+ +I+S+ DW ++++ Q + D L +T+ SS Q++ L +R+P W +
Sbjct: 420 VNMFIASTLDWNEKNIMITQSTN-FPDEDQTL-LTIKSSSTQQI----DLKIRIPFWIKN 473
Query: 563 NGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
+N + + + +++ + WS D++ + L +++ A+
Sbjct: 474 KSMVVRVNNKIVKGIKSEKGYVTISREWSDGDEIKVTFTPLLEIVPLKNSE----RYLAM 529
Query: 622 LFGPYLLA 629
+GP +LA
Sbjct: 530 TYGPIVLA 537
>gi|212693864|ref|ZP_03301992.1| hypothetical protein BACDOR_03386 [Bacteroides dorei DSM 17855]
gi|212663396|gb|EEB23970.1| hypothetical protein BACDOR_03386 [Bacteroides dorei DSM 17855]
Length = 811
Score = 189 bits (480), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 145/548 (26%), Positives = 252/548 (45%), Gaps = 45/548 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
L EV + D + Q + +YLL L+ D L+ FR+ A L + Y WE+
Sbjct: 38 LSEVRITDKYFKH------IQDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPFWESED 91
Query: 159 ----SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA---- 210
L GH +G Y+S+ + M+ +T++ I ++++ +V L CQ G GYL A
Sbjct: 92 VWGGGPLAGHILGFYMSSMSMMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLLATVNG 151
Query: 211 ---FPTELFDSFEALKPV----WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFY 263
F + F P+ W P Y ++KI+ GL Y A ++ M ++F
Sbjct: 152 KQVFEDMIDGDFTTSNPLINQTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGMADWFG 211
Query: 264 NRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLAL 323
V + ++++ L E G +N+ +Y IT D K+L A + L+
Sbjct: 212 YEVLDKLNHENIQK---MLVCEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMWVPLSK 268
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW 383
D L+ +HANT IP G Y T + Y T F DIV H++ GG S E +
Sbjct: 269 GEDILNGWHANTQIPKFTGFNAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNSTGEHF 328
Query: 384 WDPKRLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
++ + E+C + NM++++ L++ + DYYER L N +L+ E
Sbjct: 329 FEESMFEKKIPQYGGPESCNSVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA-NYDPE 387
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
G+ +Y P+ G K +GT+++SFWCC GTG E+ +K IY ++ + LY
Sbjct: 388 EGMCVYYTPMRPGHYKI-----YGTRYHSFWCCTGTGFEAPAKFAKMIYAHKDNS---LY 439
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
+ +I+S+ DW ++++ Q + D L +T+ SS Q++ L +R+P W +
Sbjct: 440 VNMFIASTLDWNEKNIMITQSTN-FPDEDQTL-LTIKSSSTQQI----DLKIRIPFWIKN 493
Query: 563 NGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
+N + + + +++ + WS D++ + L +++ A+
Sbjct: 494 KSMVVRVNNKIVKGIKSEKGYVTISREWSDGDEIKVTFTPLLEIVPLKNSE----RYLAM 549
Query: 622 LFGPYLLA 629
+GP +LA
Sbjct: 550 TYGPIVLA 557
>gi|423223251|ref|ZP_17209720.1| hypothetical protein HMPREF1062_01906 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392639352|gb|EIY33177.1| hypothetical protein HMPREF1062_01906 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 643
Score = 189 bits (479), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 163/554 (29%), Positives = 252/554 (45%), Gaps = 50/554 (9%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP----I 158
SL DV L +S L QQ EYLL L+ DSL+ +R A L +AY GWE+
Sbjct: 41 SLEDVRLLESPFL-DLQQKGKEYLLWLNPDSLLHFYRIEAGLQPKARAYAGWESQDVWGA 99
Query: 159 SELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELF 216
LRG F+G YLS+ + M+ +T + + +++ V+ L CQ G+L +LF
Sbjct: 100 GPLRGGFLGFYLSSVSMMYQATGDKELLKRLQYVLNELELCQKAGKDGFLLGIKDGRKLF 159
Query: 217 DSFEALK---------PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
+ K WAP Y I+K+L GL Y +AL M + ++F +V
Sbjct: 160 SEVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYAQCGQEKALPMMIRLADWFGYQVL 219
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+T V+R L E G +N+ +Y +T + + L A + L+ D
Sbjct: 220 DKLTDEQVQR---LLVCEHGSINESFVEIYKLTGEIRFLEWAGRLNDRAMWVPLSEGKDI 276
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L +HANT IP G + YE TGD F DIVN +H++ GG S E ++ K
Sbjct: 277 LFGWHANTQIPKFTGFEKYYEATGDKRLLNAAMNFWDIVNQNHTWVIGGNSTGEHFFPKK 336
Query: 388 RLAD-TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
+ L ETC + NML+++ LF + + A YYER L N +LS + G+
Sbjct: 337 EFEERVLLKGGPETCNSVNMLRLTETLFSYQPDAKKAAYYERVLFNHILSAYDPVK-GMC 395
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
Y + G + + ++ +SFWCC TG+ES +KLG IY ++G G+ + +
Sbjct: 396 CYFTSMRPGHYRI-----YASRDSSFWCCGHTGLESPAKLGKFIYSRDKG---GIRVNLF 447
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS---- 562
I S K + L Q S R+ L Q+ L +L +R P W +
Sbjct: 448 IPSVLTSKELGMELAQYSHMPESDKVEFRLNL-----QDERTL-TLRIRRPDWAKNPILV 501
Query: 563 -NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
NG + +++ + +W +++ ++LP+ TE + A+
Sbjct: 502 INGKEEAIDTDT------SGYWVLDRKWKKKNRIILKLPMEPYTENLVGS----DKYVAL 551
Query: 622 LFGPYLLAGHTSGE 635
L+GPY+LAG E
Sbjct: 552 LYGPYVLAGRLGME 565
>gi|423228769|ref|ZP_17215175.1| hypothetical protein HMPREF1063_00995 [Bacteroides dorei
CL02T00C15]
gi|423247580|ref|ZP_17228629.1| hypothetical protein HMPREF1064_04835 [Bacteroides dorei
CL02T12C06]
gi|392631910|gb|EIY25877.1| hypothetical protein HMPREF1064_04835 [Bacteroides dorei
CL02T12C06]
gi|392635508|gb|EIY29407.1| hypothetical protein HMPREF1063_00995 [Bacteroides dorei
CL02T00C15]
Length = 811
Score = 189 bits (479), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 141/528 (26%), Positives = 246/528 (46%), Gaps = 39/528 (7%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI----SELRGHFVGHYLSASA 174
Q + +YLL L+ D L+ FR+ A L + Y WE+ L GH +G Y+S+ +
Sbjct: 52 QDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPFWESEDVWGGGPLAGHILGFYMSSMS 111
Query: 175 QMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSA-------FPTELFDSFEALKPV-- 225
M+ +T++ I ++++ +V L CQ G GYL A F + F P+
Sbjct: 112 MMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLLATVNGKQVFEDMIDGDFTTSNPLIN 171
Query: 226 --WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
W P Y ++KI+ GL Y A ++ M ++F V + ++++ L
Sbjct: 172 QTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGMADWFGYEVLDKLNHENIQK---MLV 228
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGS 343
E G +N+ +Y IT D K+L A + L+ D L+ +HANT IP G
Sbjct: 229 CEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMWVPLSKGEDILNGWHANTQIPKFTGF 288
Query: 344 QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCT 402
Y T + Y T F DIV H++ GG S E +++ + E+C
Sbjct: 289 NAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNSTGEHFFEESMFEKKIPQYGGPESCN 348
Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARST 462
+ NM++++ L++ + DYYER L N +L+ E G+ +Y P+ G K
Sbjct: 349 SVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA-NYDPEEGMCVYYTPMRPGHYKI--- 404
Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
+GT+++SFWCC GTG E+ +K IY ++ + LY+ +I+S+ DW ++++ Q
Sbjct: 405 --YGTRYHSFWCCTGTGFEAPAKFAKMIYAHKDNS---LYVNMFIASTLDWNEKNIMITQ 459
Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGN 581
+ D L +T+ SS Q++ L +R+P W + +N + + +
Sbjct: 460 STN-FPDEDQTL-LTIKSSSTQQI----DLKIRIPFWIKNKSMVVRVNNKIVKGIKSEKG 513
Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
+++ + WS D++ + L +++ A+ +GP +LA
Sbjct: 514 YVTISREWSDGDEIKVTFTPLLEIVPLKNSE----RYLAMTYGPIVLA 557
>gi|336404182|ref|ZP_08584880.1| hypothetical protein HMPREF0127_02193 [Bacteroides sp. 1_1_30]
gi|335943510|gb|EGN05349.1| hypothetical protein HMPREF0127_02193 [Bacteroides sp. 1_1_30]
Length = 650
Score = 188 bits (478), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 172/568 (30%), Positives = 264/568 (46%), Gaps = 48/568 (8%)
Query: 93 DLPGNFLKEVS----LHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG 148
+LP +K S L++V L S L QQ EYLL L+ DSL+ +R A LP
Sbjct: 24 NLPSTMVKPESVYFPLNEVRLLDSPFL-TLQQKGKEYLLWLNPDSLLHFYRVEAGLPPKA 82
Query: 149 KAYGGWENP----ISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG 204
AY GWE+ LRG F+G YLS+ + M ST + + +++ V+ L CQ+
Sbjct: 83 DAYAGWESQNVWGAGPLRGGFLGFYLSSVSMMHQSTGDKELLKRLKYVLKELKLCQDAGK 142
Query: 205 TGYLSAFP--TELFDSFEALK---------PVWAPYYTIHKILAGLLDQYVLADNAQALK 253
G+L LF + K WAP Y I+K+L GL Y +AL
Sbjct: 143 DGFLLGIKDGRMLFKEVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCGLEEALP 202
Query: 254 MATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFD 313
M + ++F +V ++ +++ L E G +N+ Y +T + L A
Sbjct: 203 MMIRLADWFGYQVLDKLSDEQIQK---LLVCEHGSINESYVEAYELTGQKRFLDWARRLH 259
Query: 314 KPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYA 373
L+ D L +HANT IP G Y TGD + T F +IVN +H++
Sbjct: 260 DRAMWVPLSEGKDILYGWHANTQIPKFTGFHKYYMFTGDKRFLTAATNFWNIVNRNHTWV 319
Query: 374 TGGTSAREFWWDPKRLADTLGSE-NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
GG S E ++ + AD L + ETC + NML+++ LF + A YYER L N
Sbjct: 320 IGGNSTGEHFFPKEEFADRLLLKGGPETCNSVNMLRLTESLFSQYPDAVKASYYERVLFN 379
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+LS + G+ Y + G + + ++ +SFWCC TG+ES +KLG IY
Sbjct: 380 HILSAY-DPKKGMCCYFTSMRPGHYRI-----YASRDSSFWCCGHTGLESPAKLGKFIYS 433
Query: 493 EEEGNVPGLYIIQ---YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL 549
+ N I+ +I S W G V L Q+ + + D R+ LT + K++ Q
Sbjct: 434 HKATNRKEEKEIRVNLFIPSVLTWHEGGVELVQR-NRLPDSD---RVELTMNLKKK--QR 487
Query: 550 SSLNLRMPVWTYSNGAQASLNG--QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
L +R P W ++ A +NG + L L G ++ + W+ +++++QLP+ TE
Sbjct: 488 LILWIRKPDW--ADKATLIINGKAEQLLLGNDGYWM-IDKVWNRKNRISLQLPMHTYTEN 544
Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGE 635
+ A+L+GPY+LAG E
Sbjct: 545 LIGT----GRYVALLYGPYVLAGRMGKE 568
>gi|427384823|ref|ZP_18881328.1| hypothetical protein HMPREF9447_02361 [Bacteroides oleiciplenus YIT
12058]
gi|425728084|gb|EKU90943.1| hypothetical protein HMPREF9447_02361 [Bacteroides oleiciplenus YIT
12058]
Length = 813
Score = 187 bits (474), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 162/578 (28%), Positives = 252/578 (43%), Gaps = 54/578 (9%)
Query: 103 SLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELR 162
L +V L S + A Q + +YLL D++ ++ RK +P KAY G P R
Sbjct: 42 CLSEVRLLPGSPFYHAMQVSQQYLLDADIERMLNGRRKEVGIPEK-KAYPGSNQPAG-TR 99
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN-----------KIGTGYLSAF 211
HY+S ++ M+A T + ++++ ++ L+ N K+ Y
Sbjct: 100 ATDWHHYISGTSLMYAQTGDRRFLDRVNYLIDELAMLDNRKDSLYRVQGKKLELPYAKLM 159
Query: 212 PTELF--DSFEALKP----VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNR 265
EL EA P W P+Y HK A D Y+ DN +AL + E
Sbjct: 160 KGELLLNSPDEAGYPWGGLCWIPFYWQHKEFAAYRDAYLYCDNLKALNLWIKQAE----P 215
Query: 266 VQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA 325
V + I + + L+ E GG+N V LY++T D ++L ++ + + +A
Sbjct: 216 VTEFILKVNPDLFEGFLDIENGGINAVFADLYALTGDERYLAVSMKLNHQKVILNIANGK 275
Query: 326 DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
D L HAN +P G+ +Y++TGD + + F I H GG S E +
Sbjct: 276 DVLYGRHANFQLPAFEGTARQYQLTGDEVCRKATQNFAGIYYRDHMNCIGGNSCYERFGR 335
Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
+ LGS + ETC TYNM+K++ + F T ++ + DY+ERAL N +L+ Q GV
Sbjct: 336 SGEITKRLGSTSSETCNTYNMMKIALNTFESTGDLHHMDYFERALYNHILASQDPETGGV 395
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFN--SFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
Y + L G + +FN WCC GTG+E+ SK G+ IYF N LY+
Sbjct: 396 TYYTMLLPGGFK------SYSDRFNIEGIWCCVGTGMENHSKYGECIYF---NNHQSLYV 446
Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS-LNLRMPVWTYS 562
+I S +WK ++ L Q+ D + + T + E G + + +R P W
Sbjct: 447 NLFIPSELNWKEKNLHLKQETD-------FPQGDCTTLTILESGAYNHPIYIRYPHWA-G 498
Query: 563 NGAQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
+N + PL G ++ W D++ I++ + R EA DD + I
Sbjct: 499 REVSVRINDEEYPLHAQAGEYIRLQHPWKTGDRIRIEMKQTFRLEAAPDD----PFMNVI 554
Query: 622 LFGPY-----LLAGHTSGEWDIKTGTARSLSALISPIP 654
GP L A H E+ IKT S + IP
Sbjct: 555 FRGPIAYAAQLGADHLPNEY-IKTSRQNSSFLPLDDIP 591
>gi|396489945|ref|XP_003843216.1| hypothetical protein LEMA_P073260.1 [Leptosphaeria maculans JN3]
gi|312219795|emb|CBX99737.1| hypothetical protein LEMA_P073260.1 [Leptosphaeria maculans JN3]
Length = 748
Score = 187 bits (474), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 160/527 (30%), Positives = 232/527 (44%), Gaps = 90/527 (17%)
Query: 152 GGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNK-------IG 204
GGWE+ L GH+ GHY+SA +Q + + KEK+ +V L+ CQ
Sbjct: 100 GGWEDG-GLLSGHWTGHYMSALSQAYIDKGESIFKEKLDWMVAELAACQEAYTEYKQPTH 158
Query: 205 TGYLSAFPTELFDSFEALKP-------------VWAPYYTIHKILAGLLDQYVLADNAQA 251
GYL A P D+ L P WA +YT HKI+ GLLD Y A+N QA
Sbjct: 159 LGYLGALPE---DTVLRLGPPRFAVYGSNISTDTWAGWYTQHKIMRGLLDAYYNANNTQA 215
Query: 252 LKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHL 311
L + M ++ + + + E GG N+V +Y++T + KHL A
Sbjct: 216 LDIVIKMADWAHLALTDTY-----------IAGEFGGANEVFPEIYALTGEEKHLQTAKA 264
Query: 312 FDKPCFLGFLALQAD---------------YLSHFHANTHIPIVIGSQMRYEVTGDPLYK 356
FD L F A +D HANTH+P IG YE TG Y
Sbjct: 265 FDNRESL-FSAAVSDQDILVMTPERKPGRRRRERLHANTHVPQFIGYLRIYEHTGSNEYL 323
Query: 357 LIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEETCTTYNMLK 408
L F V +A+G T E + + +A+++ E ETC TYN L
Sbjct: 324 LAAKNFFGWVVPHREFASGSTGGNVPGFSANPELFQNRDNIANSIADEGAETCITYNTLN 383
Query: 409 VSRHLFRWTKEIAYADYYERALTNGV----LSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
++R+LF Y D+ ER L N + + ++P + Y PL G +
Sbjct: 384 LARNLFLDEHNATYMDHCERGLFNMIAGSRVDTSNNSDP-QLTYFQPLSPGFGREYG--- 439
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKV 524
N+ CC GTG+ES +K +++Y + P L+I +I S+ W + Q+
Sbjct: 440 -----NTGTCCGGTGMESHTKYQETVYL-RSAHSPVLWINLFIPSTLHWMERGFAIKQET 493
Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG--QNLPLPPPGNF 582
+ + R T + G L + LR+P W NG ++NG Q P +
Sbjct: 494 N-------FPREGSTKLTIAGEGAL-VIKLRVPGWV-RNGFAVTINGEAQATKNVQPSTY 544
Query: 583 LSATERWSYNDKLTIQLPLSLRTE-AIQDDRPEYASIQAILFGPYLL 628
LS W ND + +Q+PLS+RTE AI DRP+ QA+++GP LL
Sbjct: 545 LSLKRIWKTNDVIEVQMPLSIRTERAI--DRPD---TQAVMWGPVLL 586
>gi|227509161|ref|ZP_03939210.1| possible acetyl-CoA carboxylase, biotin carboxylase [Lactobacillus
brevis subsp. gravesensis ATCC 27305]
gi|227191368|gb|EEI71435.1| possible acetyl-CoA carboxylase, biotin carboxylase [Lactobacillus
brevis subsp. gravesensis ATCC 27305]
Length = 606
Score = 185 bits (470), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 127/369 (34%), Positives = 185/369 (50%), Gaps = 38/369 (10%)
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
L E GGMND LY L+SIT D +HL A FD+ LA D L HANT IP ++
Sbjct: 2 LKVEYGGMNDALYHLFSITKDERHLTAATYFDEVELFKDLAAAKDVLPGKHANTTIPKLL 61
Query: 342 GSQMRYEVTGD----------------PLYKLIGTFFMDIVNASHSYATGGTSAREFWWD 385
G+ RYE+ D P+Y F IV H+YATGG S E + D
Sbjct: 62 GAIRRYEIFDDPQMAGQYLYEKDQKQLPIYLKAAENFWRIVINHHTYATGGNSQSEHFHD 121
Query: 386 PKRL-ADTL---GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
P +L D + G+ ETC T+NMLK+SR LFR T + Y DYY+R +N +L Q
Sbjct: 122 PNQLYHDAVIEDGATTCETCNTHNMLKLSRELFRVTGDKKYLDYYDRTYSNAILGSQ-NP 180
Query: 442 EPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
+ G+M Y P+ G K + ++ FWCC GTGIESF+KLGDS YF+E L
Sbjct: 181 KTGMMTYFQPMAAGYRKV-----FNRPYDEFWCCTGTGIESFTKLGDSYYFKEGQT---L 232
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
Y Y S+ ++ L+ +VD V +++T++ + + ++ R P W++
Sbjct: 233 YATGYFSNQLSLPKENLKLDMQVDRKVGA---VKLTVSKLIDNKTSEPLNVKFRHPDWSH 289
Query: 562 SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
N + P F+ ++ D + I L ++L + D++ +Y S++
Sbjct: 290 GR-LSVKKNQKTQPNNETFGFVEV-KKLVPGDVIEINLSMTLTVGSTPDNQ-QYISLK-- 344
Query: 622 LFGPYLLAG 630
+GPY+LAG
Sbjct: 345 -YGPYVLAG 352
>gi|340347550|ref|ZP_08670658.1| hypothetical protein HMPREF9136_1656 [Prevotella dentalis DSM 3688]
gi|339609246|gb|EGQ14121.1| hypothetical protein HMPREF9136_1656 [Prevotella dentalis DSM 3688]
Length = 1007
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 174/674 (25%), Positives = 275/674 (40%), Gaps = 120/674 (17%)
Query: 69 ILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEV-SLHDVWLDQSSVLWRAQQTNLEYLL 127
I+GD + + + + PG + SL DV LD + L + L +
Sbjct: 136 IIGDATTDKGYPIKAQVRVVATAVAAPGQEMAHAFSLADVTLDGDNRLTHNRDEALREIC 195
Query: 128 MLDVDSLVWSFRKTASLPTPGKAYG-GWENPISELRGHFVGHYLSASAQMWASTHN---- 182
DV ++++R T L T G GW++P ++L+GH GHY+SA AQ +A T +
Sbjct: 196 SWDVSQQLYNYRDTYGLSTDGYTRSDGWDSPDTKLKGHGSGHYMSAIAQAYAVTKDPRQK 255
Query: 183 ATIKEKMSTVVFSLSECQNKI--------------------------------------- 203
A +++ ++ +V L CQ K
Sbjct: 256 AILRKNITRMVNELRACQEKTFVFDKALNRYWEARDFAPEEELRGLKGTWEAFDEYKKHP 315
Query: 204 ---GTGYLSAFPT------ELFDSFEALKPVWAPYYTIHKILAGLLD------QYVLADN 248
G GY++A P E++ ++ VWAPYY++HK LAGL+D + D
Sbjct: 316 EKYGYGYINAIPAQHCALIEMYRAYNNSDWVWAPYYSVHKQLAGLIDIATYFDDKAICDK 375
Query: 249 A--QALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE----------ETGGMNDVLYRL 296
A A M W+ + R ER N E GGM++ L RL
Sbjct: 376 ALLTAKDMGLWVWNRMHYRTYVKEDGTEAERRSKPGNRYEMWDMYIAGEVGGMSESLARL 435
Query: 297 YSITHDP----KHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
+ DP K + A FD P F L+ D + HAN HIP+++G+ Y+ +
Sbjct: 436 SEMVSDPGEKAKLIEAAGCFDAPKFYNPLSKNVDDIRTRHANQHIPMIVGALRSYKTNKN 495
Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK----RLADTLGSENE--------ET 400
P Y + F +V + YATGG E + P +A E E ET
Sbjct: 496 PFYYHLSQNFWHLVQGRYMYATGGVGNGEMFRQPYTQILSMATNGMQEGERQANPDINET 555
Query: 401 CTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
C TYN+LK++ L + + A Y DYYER L N ++ + Y +G +K
Sbjct: 556 CCTYNLLKLTSDLNCYNPDDARYMDYYERGLYNQIVG-SLNPDKYETCYQYAVGLNATKP 614
Query: 460 RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
+G + CC GTG E+ +K + YF N L++ Y+ ++ WK+ +
Sbjct: 615 -----FGNETPQSTCCGGTGSENHTKYQAAAYF---ANTHTLWVGLYMPTTLHWKAKGLT 666
Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPP 578
+ Q+ +W K E +L LR+P W + G + +NG+ + L
Sbjct: 667 IRQE----CAWPAQHTAIQIAEGKGEF----TLKLRVPYWA-TGGFEVKVNGKKVKQLFR 717
Query: 579 PGNFLSATE-RWSYNDKLTIQLPLSLRTE----------AIQDDRP-EYASIQAILFGPY 626
P ++++ + RW D + I +P + E A D P A + +++GP
Sbjct: 718 PSSYVALEKTRWKAGDVVEIDMPFTKHIEYGADKLTSEVASMDGTPLRTAWVGTLMYGPL 777
Query: 627 LLAGHTSGEWDIKT 640
+ G S W T
Sbjct: 778 AMTGTGSAIWKEAT 791
>gi|433653573|ref|YP_007297427.1| hypothetical protein Prede_2696 [Prevotella dentalis DSM 3688]
gi|433304106|gb|AGB29921.1| hypothetical protein Prede_2696 [Prevotella dentalis DSM 3688]
Length = 986
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 174/674 (25%), Positives = 275/674 (40%), Gaps = 120/674 (17%)
Query: 69 ILGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEV-SLHDVWLDQSSVLWRAQQTNLEYLL 127
I+GD + + + + PG + SL DV LD + L + L +
Sbjct: 115 IIGDATTDKGYPIKAQVRVVATAVAAPGQEMAHAFSLADVTLDGDNRLTHNRDEALREIC 174
Query: 128 MLDVDSLVWSFRKTASLPTPGKAYG-GWENPISELRGHFVGHYLSASAQMWASTHN---- 182
DV ++++R T L T G GW++P ++L+GH GHY+SA AQ +A T +
Sbjct: 175 SWDVSQQLYNYRDTYGLSTDGYTRSDGWDSPDTKLKGHGSGHYMSAIAQAYAVTKDPRQK 234
Query: 183 ATIKEKMSTVVFSLSECQNKI--------------------------------------- 203
A +++ ++ +V L CQ K
Sbjct: 235 AILRKNITRMVNELRACQEKTFVFDKALNRYWEARDFAPEEELRGLKGTWEAFDEYKKHP 294
Query: 204 ---GTGYLSAFPT------ELFDSFEALKPVWAPYYTIHKILAGLLD------QYVLADN 248
G GY++A P E++ ++ VWAPYY++HK LAGL+D + D
Sbjct: 295 EKYGYGYINAIPAQHCALIEMYRAYNNSDWVWAPYYSVHKQLAGLIDIATYFDDKAICDK 354
Query: 249 A--QALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE----------ETGGMNDVLYRL 296
A A M W+ + R ER N E GGM++ L RL
Sbjct: 355 ALLTAKDMGLWVWNRMHYRTYVKEDGTEAERRSKPGNRYEMWDMYIAGEVGGMSESLARL 414
Query: 297 YSITHDP----KHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
+ DP K + A FD P F L+ D + HAN HIP+++G+ Y+ +
Sbjct: 415 SEMVSDPGEKAKLIEAAGCFDAPKFYNPLSKNVDDIRTRHANQHIPMIVGALRSYKTNKN 474
Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK----RLADTLGSENE--------ET 400
P Y + F +V + YATGG E + P +A E E ET
Sbjct: 475 PFYYHLSQNFWHLVQGRYMYATGGVGNGEMFRQPYTQILSMATNGMQEGERQANPDINET 534
Query: 401 CTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
C TYN+LK++ L + + A Y DYYER L N ++ + Y +G +K
Sbjct: 535 CCTYNLLKLTSDLNCYNPDDARYMDYYERGLYNQIVG-SLNPDKYETCYQYAVGLNATKP 593
Query: 460 RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
+G + CC GTG E+ +K + YF N L++ Y+ ++ WK+ +
Sbjct: 594 -----FGNETPQSTCCGGTGSENHTKYQAAAYF---ANTHTLWVGLYMPTTLHWKAKGLT 645
Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPP 578
+ Q+ +W K E +L LR+P W + G + +NG+ + L
Sbjct: 646 IRQE----CAWPAQHTAIQIAEGKGEF----TLKLRVPYWA-TGGFEVKVNGKKVKQLFR 696
Query: 579 PGNFLSATE-RWSYNDKLTIQLPLSLRTE----------AIQDDRP-EYASIQAILFGPY 626
P ++++ + RW D + I +P + E A D P A + +++GP
Sbjct: 697 PSSYVALEKTRWKAGDVVEIDMPFTKHIEYGADKLTSEVASMDGTPLRTAWVGTLMYGPL 756
Query: 627 LLAGHTSGEWDIKT 640
+ G S W T
Sbjct: 757 AMTGTGSAIWKEAT 770
>gi|336397986|ref|ZP_08578786.1| protein of unknown function DUF1680 [Prevotella multisaccharivorax
DSM 17128]
gi|336067722|gb|EGN56356.1| protein of unknown function DUF1680 [Prevotella multisaccharivorax
DSM 17128]
Length = 943
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 181/680 (26%), Positives = 290/680 (42%), Gaps = 132/680 (19%)
Query: 69 ILGDQKDEVSWALLYRKIKNPGGFDLPGNF-LKEVS----LHDVWLDQSSVLWRAQQTNL 123
I+GD+ + + + KIK +P N KE++ L DV ++ + L + +
Sbjct: 93 IIGDETTDNGYPIT-AKIK---VVSMPANEEKKEIAQTFPLSDVTINGDNRLTHNRDEAI 148
Query: 124 EYLLMLDVDSLVWSFRKTASLPTPG-KAYGGWENPISELRGHFVGHYLSASAQMWASTHN 182
+ DV ++++R T ++ T G K GW++P ++L+GH GHY+SA AQ +A T +
Sbjct: 149 AAICSWDVTQQLYNYRDTYNMSTEGYKVADGWDSPDTKLKGHGSGHYMSAIAQAYAVTKD 208
Query: 183 ----ATIKEKMSTVVFSLSECQNKI----------------------------------- 203
A +K+ ++ +V L CQ K
Sbjct: 209 PQQKAILKKNITRMVNELRACQEKTFVWNDSLGRYWEARDFAPESELKNMKGTWAAFDEY 268
Query: 204 -------GTGYLSAFPTELFDSFEALKP------VWAPYYTIHKILAGLLDQYVLADN-- 248
G GY++A P++ E +P VWAPYYTIHK LAGL+D L D+
Sbjct: 269 KKHPEKYGYGYINAIPSQHCALIEMYRPYNNSDWVWAPYYTIHKELAGLIDIATLFDDKE 328
Query: 249 --AQALKMATWMVEYFYNRVQKVITMYS----VERHWYSLNE----------ETGGMNDV 292
A+AL +A M + +NR+ + + ER N E GGM +
Sbjct: 329 VAAKALLIAKDMGLWVWNRMHYRTYVKADGTQEERRAKPGNRYEMWDMYIAGEVGGMQES 388
Query: 293 LYRLYSI----THDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYE 348
L RL + T + L A FD P F LA D + HAN HIP+++G+ Y+
Sbjct: 389 LSRLSEMVSNSTDKARLLEAAQCFDAPKFYEPLAKNIDDIRTRHANQHIPMIVGALRSYK 448
Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP-----------KRLADTLGSEN 397
D Y + F +V + YATGG E + P + + + + N
Sbjct: 449 SNHDIHYYNVADNFWHLVQGRYMYATGGVGNGEMFRQPYTQVLSMATNGMQEGEAMANPN 508
Query: 398 -EETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEPG--VMIYMLPLG 453
ETC TYN+LK+++ L + + A DYYER L N ++ +P + Y +G
Sbjct: 509 LNETCCTYNLLKLTKDLNVYNPDDAELMDYYERGLYNQIVG---SLDPDHYAVTYQYAVG 565
Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
+K +G + CC GTG E+ +K + YF + L++ Y+ ++ W
Sbjct: 566 LNATKP-----FGNETPQSTCCGGTGSENHTKYQQAAYFHNDST---LWVCLYMPTTLQW 617
Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
+ + L Q +W P R + + + G +L LR+P W + G + LNG+
Sbjct: 618 RDKGITLEQD----CTW-PAQRSVIRLTKGE--GNF-TLKLRVPYWA-TRGFEILLNGKP 668
Query: 574 LP--LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP-EYASIQAI--------- 621
+ P + W+ +D+L I +P S E D P + AS I
Sbjct: 669 VQHHYQPSSYVTISGHHWTVSDRLEIIMPFSTHIEYGADKLPAKVASADGIPLKSAWTGV 728
Query: 622 -LFGPYLLAGHTSGEWDIKT 640
++GP + G + W T
Sbjct: 729 VMYGPLCMTGTNATTWKQAT 748
>gi|389638620|ref|XP_003716943.1| acetyl-CoA carboxylase [Magnaporthe oryzae 70-15]
gi|351642762|gb|EHA50624.1| acetyl-CoA carboxylase [Magnaporthe oryzae 70-15]
Length = 1018
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 142/478 (29%), Positives = 214/478 (44%), Gaps = 76/478 (15%)
Query: 206 GYLSAFPTELFDSFEALKP-------------VWAPYYTIHKILAGLLDQYVLADNAQAL 252
GYL A P D+ L P WAP+YT HKI+ GLLD Y +N+QAL
Sbjct: 390 GYLGALPE---DTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQAL 446
Query: 253 KMATWMVEYFY----------NRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITH 301
++ T M ++ + + +T + W + E GG N+V +Y +T
Sbjct: 447 QVVTRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTG 506
Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYL--------------SHFHANTHIPIVIGSQMRY 347
DPKHL A FD L A+ D + HANTH+P IG +
Sbjct: 507 DPKHLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIF 566
Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEE 399
E G Y F V +A+GGT E + + +A+ +G E
Sbjct: 567 EQGGGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAE 626
Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV----MIYMLPLGRG 455
TCT YNMLK++R+LF Y D YER L N + + T + Y PL G
Sbjct: 627 TCTAYNMLKLARNLFLHNHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPG 686
Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
S +G N+ CC GTG+ES +K +++Y + L++ Y+ S+ W+
Sbjct: 687 -----SNRDYG---NTGTCCGGTGLESHTKYQETVYL-RSADGSALWVNLYVPSTLTWEE 737
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--NGAQASLNGQN 573
+ + Q+ D ++ T+T SS+QE + LR+P W G S+NG+
Sbjct: 738 KGITVRQET--AFPRDDTVKFTVTTSSRQEP---LDMKLRVPAWIQKTPGGFNVSINGEQ 792
Query: 574 L---PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
P PG++++ + W+ D + I++P ++R E DRP+ QAI++GP LL
Sbjct: 793 FRPGETPTPGSYMTVSRTWATGDVVEIKMPFAVRIERAP-DRPD---TQAIMWGPLLL 846
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 5/62 (8%)
Query: 142 ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN 201
A LP PG GWE+ L GH+ GH+++A +Q +A K K+ +V L+ CQ+
Sbjct: 79 AGLPVPG----GWEDG-GLLSGHWAGHFMTALSQAFADQGEELYKTKLDWMVKELAACQD 133
Query: 202 KI 203
I
Sbjct: 134 AI 135
>gi|440483441|gb|ELQ63839.1| acetyl-CoA carboxylase [Magnaporthe oryzae P131]
Length = 1055
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 142/478 (29%), Positives = 214/478 (44%), Gaps = 76/478 (15%)
Query: 206 GYLSAFPTELFDSFEALKP-------------VWAPYYTIHKILAGLLDQYVLADNAQAL 252
GYL A P D+ L P WAP+YT HKI+ GLLD Y +N+QAL
Sbjct: 427 GYLGALPE---DTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQAL 483
Query: 253 KMATWMVEYFY----------NRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITH 301
++ T M ++ + + +T + W + E GG N+V +Y +T
Sbjct: 484 QVVTRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTG 543
Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYL--------------SHFHANTHIPIVIGSQMRY 347
DPKHL A FD L A+ D + HANTH+P IG +
Sbjct: 544 DPKHLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIF 603
Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEE 399
E G Y F V +A+GGT E + + +A+ +G E
Sbjct: 604 EQGGGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAE 663
Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV----MIYMLPLGRG 455
TCT YNMLK++R+LF Y D YER L N + + T + Y PL G
Sbjct: 664 TCTAYNMLKLARNLFLHDHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPG 723
Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
S +G N+ CC GTG+ES +K +++Y + L++ Y+ S+ W+
Sbjct: 724 -----SNRDYG---NTGTCCGGTGLESHTKYQETVYL-RSADGSALWVNLYVPSTLTWEE 774
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--NGAQASLNGQN 573
+ + Q+ D ++ T+T SS+QE + LR+P W G S+NG+
Sbjct: 775 KGITVRQET--AFPRDDTVKFTVTTSSRQEP---LDMKLRVPAWIQKTPGGFNVSINGEQ 829
Query: 574 L---PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
P PG++++ + W+ D + I++P ++R E DRP+ QAI++GP LL
Sbjct: 830 FRPGETPTPGSYMTVSRTWATGDVVEIKMPFAVRIERAP-DRPD---TQAIMWGPLLL 883
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 5/62 (8%)
Query: 142 ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN 201
A LP PG GWE+ L GH+ GH+++A +Q +A K K+ +V L+ CQ+
Sbjct: 116 AGLPVPG----GWEDG-GLLSGHWAGHFMTALSQAFADQGEELYKTKLDWMVKELAACQD 170
Query: 202 KI 203
I
Sbjct: 171 AI 172
>gi|300726603|ref|ZP_07060044.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Prevotella
bryantii B14]
gi|299776135|gb|EFI72704.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Prevotella
bryantii B14]
Length = 832
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 162/563 (28%), Positives = 247/563 (43%), Gaps = 84/563 (14%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--------YGGWENPISELRGHFVGHY 169
A + N LL DVD L+ F + A L A + W +L GH GHY
Sbjct: 38 AMEINFNTLLAYDVDRLLTPFIRQAGLHEGRYADWQKKHPNFKNWGGDGFDLSGHIGGHY 97
Query: 170 LSASAQMWASTHNATIKEKMST----VVFSLSECQNKIGT------GYLSAFPTELFDSF 219
LSA A +A+ +A KE++ + ++ L +CQN G++ P + + +
Sbjct: 98 LSALAMAYAACQDAATKERLQSRLLYMIDVLKDCQNSFDQNTTGLYGFIGGQP--INEDW 155
Query: 220 EAL----------KPVWAPYYTIHKILAGLLDQYVLADNAQAL----KMATWMVEYFYN- 264
E L W P+Y HK++AGL D Y+ A N A KMA W +
Sbjct: 156 EKLYQGDISGIWQHRGWVPFYCEHKVMAGLRDAYLYAHNQDAKLMLKKMADWCTQLIAKV 215
Query: 265 ---RVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL-GF 320
+QK++T+ E GG+N+ + Y+I D ++L A + + L G
Sbjct: 216 SDADMQKMLTI------------EHGGINESMADCYAIFKDTRYLEAAKKYSQREMLEGL 263
Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL-YKLIGTFFMDIVNASHSYATGGTSA 379
+L A +L + HANT +P IG + E L Y + F V + GG S
Sbjct: 264 QSLNATFLDNRHANTQVPKYIGFERIVEEDPAALQYATAASNFWQDVAHHRTVCIGGNSI 323
Query: 380 REFWW---DPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
E + + R D L E E+C T NMLK+S L T + YAD+YE A+ N +LS
Sbjct: 324 SEHFLSKTNSNRYIDNL--EGPESCNTNNMLKLSEMLSDRTHDAGYADFYEYAMWNHILS 381
Query: 437 IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
Q + G +Y L + + + WCC GTG+E+ SK G +Y +
Sbjct: 382 TQ-DPQTGGYVYFTTL-----RPQGYRIYSVPNQGMWCCVGTGMENHSKYGHFVYTHDGD 435
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
LY+ + +S D K L Q+ + ++P +T+ S + + +R
Sbjct: 436 RT--LYVNLFTASKLDGKK--FKLTQQTN--YPYEPKTTITIEKSGRYAIA------IRR 483
Query: 557 PVWTYSNGAQASLNG--QNLPLPPPGNFLSAT--ERWSYNDKLTIQLPLSLRTEAIQDDR 612
P WT S+ + +NG Q L +P G AT +W D +T+ +P++LR EA
Sbjct: 484 PWWTTSD-YRIQVNGQTQQLNIPSAGTSAYATLERKWKKGDVITVDIPMTLRQEAC---- 538
Query: 613 PEYASIQAILFGPYLLAGHTSGE 635
P Y A +GP LL T+ +
Sbjct: 539 PNYEDYIAFEYGPILLGAQTTSQ 561
>gi|440466410|gb|ELQ35678.1| acetyl-CoA carboxylase [Magnaporthe oryzae Y34]
Length = 1055
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 142/478 (29%), Positives = 214/478 (44%), Gaps = 76/478 (15%)
Query: 206 GYLSAFPTELFDSFEALKP-------------VWAPYYTIHKILAGLLDQYVLADNAQAL 252
GYL A P D+ L P WAP+YT HKI+ GLLD Y +N+QAL
Sbjct: 427 GYLGALPE---DTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQAL 483
Query: 253 KMATWMVEYFY----------NRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITH 301
++ T M ++ + + +T + W + E GG N+V +Y +T
Sbjct: 484 QVVTRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTG 543
Query: 302 DPKHLLLAHLFDKPCFLGFLALQADYL--------------SHFHANTHIPIVIGSQMRY 347
DPKHL A FD L A+ D + HANTH+P IG +
Sbjct: 544 DPKHLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIF 603
Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEE 399
E G Y F V +A+GGT E + + +A+ +G E
Sbjct: 604 EQGGGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAE 663
Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV----MIYMLPLGRG 455
TCT YNMLK++R+LF Y D YER L N + + T + Y PL G
Sbjct: 664 TCTAYNMLKLARNLFLHDHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPG 723
Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
S +G N+ CC GTG+ES +K +++Y + L++ Y+ S+ W+
Sbjct: 724 -----SNRDYG---NTGTCCGGTGLESHTKYQETVYL-RSADGSALWVNLYVPSTLTWEE 774
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--NGAQASLNGQN 573
+ + Q+ D ++ T+T SS+QE + LR+P W G S+NG+
Sbjct: 775 KGITVRQET--AFPRDDTVKFTVTTSSRQEP---LDMKLRVPAWIQKTPGGFNVSINGEQ 829
Query: 574 L---PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
P PG++++ + W+ D + I++P ++R E DRP+ QAI++GP LL
Sbjct: 830 FRPGETPTPGSYMTVSRTWATGDVVEIKMPFAVRIERAP-DRPD---TQAIMWGPLLL 883
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 5/62 (8%)
Query: 142 ASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQN 201
A LP PG GWE+ L GH+ GH+++A +Q +A K K+ +V L+ CQ+
Sbjct: 116 AGLPVPG----GWEDG-GLLSGHWAGHFMTALSQAFADQGEELYKTKLDWMVKELAACQD 170
Query: 202 KI 203
I
Sbjct: 171 AI 172
>gi|330467692|ref|YP_004405435.1| glycosylase [Verrucosispora maris AB-18-032]
gi|328810663|gb|AEB44835.1| glycosylase [Verrucosispora maris AB-18-032]
Length = 1126
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 146/472 (30%), Positives = 214/472 (45%), Gaps = 73/472 (15%)
Query: 206 GYLSAFPTELFDSF----------EALKPVWAPYYTIHKILAGLLDQYVLADNAQAL--- 252
GYL A P + A WAP+YT HKI+ GLLD Y DNA AL
Sbjct: 416 GYLGAIPEDAVLRLGPPRWAVYGSNATTNTWAPWYTQHKIMRGLLDAYYHTDNATALDVV 475
Query: 253 -KMATW------MVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPK 304
KMA W + + + IT ++ W + ETGG N+V +Y++T D K
Sbjct: 476 VKMAGWAHLALTIGDKNHPAYTGPITRDNLNYMWDLYIAGETGGANEVFPEIYALTGDQK 535
Query: 305 HLLLAHLFD-KPCFLGFLALQADYL-------------SHFHANTHIPIVIGSQMRYEVT 350
HL A LFD + D L HAN+H+P +G YE +
Sbjct: 536 HLETAKLFDNRESLFDACVENRDILVVTPQNNPGRRRPDRLHANSHVPQFVGYLRVYEHS 595
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEETCT 402
GD Y F +V YA GGT E + + +A+++ ETCT
Sbjct: 596 GDTEYFQAAKNFYGMVVPHRMYANGGTGGNYPGSNNNIELFQNRGNIANSIAQGGAETCT 655
Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT----EPGVMIYMLPLGRGVSK 458
TYN+LK++R+LF + AY DYYER L N + + T P V Y PL G ++
Sbjct: 656 TYNLLKLARNLFFHEHDAAYLDYYERGLINQIAGSRADTTTVSNPQVT-YFQPLTPGANR 714
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE-EGNVPGLYIIQYISSSFDWKSGH 517
G+G N+ CC GTG+E+ +K ++IYF+ +G+ L++ Y++S+ W
Sbjct: 715 -----GYG---NTGTCCGGTGVENHTKYQETIYFKSADGDT--LWVNLYVASTLTWAERD 764
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
+ Q+ D Y R T + G L + LR+P W G ++NG +
Sbjct: 765 FTITQQTD-------YPRADRTRLTVDGSGPL-DIKLRVPGWV-RKGFFVTINGLAQQVT 815
Query: 578 PPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
N +L+ + W D + I++P S+R E DRP+ Q++ +GP LL
Sbjct: 816 ATANSYLTLSRTWQRGDVIEIRMPFSIRIERAL-DRPD---TQSVFWGPVLL 863
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 49/107 (45%), Gaps = 9/107 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPG--KAYGGWEN 156
L++V+L D + R + N YL LD + F A P P A GGWE+
Sbjct: 67 LRDVTLGDGLFQEK----RDRMKN--YLRQLDERRFLVLFNNQAGRPNPAGVTAPGGWED 120
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
L GH+ GH ++A AQ +A K K+ +V L+ CQ I
Sbjct: 121 G-GLLSGHWAGHVMTALAQGYADHGEPIFKSKLDWIVDELAACQTAI 166
>gi|261879318|ref|ZP_06005745.1| acetyl-CoA carboxylase [Prevotella bergensis DSM 17361]
gi|270334148|gb|EFA44934.1| acetyl-CoA carboxylase [Prevotella bergensis DSM 17361]
Length = 839
Score = 177 bits (449), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 177/587 (30%), Positives = 257/587 (43%), Gaps = 94/587 (16%)
Query: 95 PGNF-LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--- 150
P +F L EV+L D S L A N++ L+ DVD L+ F + A L T A
Sbjct: 29 PHHFNLDEVTLLD------SPLKTAMDLNIKMLMQYDVDRLLTPFIRQAGLHTGRYADWQ 82
Query: 151 -----YGGWENPISELRGHFVGHYLSASAQMWASTHN----ATIKEKMSTVVFSLSECQN 201
+ W +L GH GHY+SA A +A+ H+ A IKE++ ++ L +CQ+
Sbjct: 83 SRHPNFMNWGGNNFDLSGHVGGHYVSALAMAYAACHDTATKARIKERLDYMIDVLKDCQD 142
Query: 202 KIGT------GYLSAFPTELF---------DSFEALKPVWAPYYTIHKILAGLLDQYVLA 246
T G++ P SF + W P+Y HK+LAGL D Y+
Sbjct: 143 AYDTNTEGLYGFIGGQPINDMWKKMYAGDISSFRQHRG-WVPFYCQHKVLAGLRDAYLYT 201
Query: 247 DNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHD 302
N A K+A W V N TM +V L+ E GGMN+ L Y++ D
Sbjct: 202 GNTTARDLFRKLADWSVNLVSNLSDA--TMQTV------LDTEHGGMNETLADAYTLFGD 253
Query: 303 PKHLLLAHLFDKPCFL-GFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTF 361
K+L A + L G +L + HANT +P IG + E DP T
Sbjct: 254 SKYLAAARKYSHQTMLNGMQTPNPTFLDNRHANTQVPKYIGFERVAE--EDPTATTYATA 311
Query: 362 ---FMDIVNASHSYATGGTSAREFWW---DPKRLADTLGSENEETCTTYNMLKVSRHLFR 415
F D V + + GG S E + + R D L + E+C T NM+K+S +
Sbjct: 312 ASNFWDDVAQNRTVCIGGNSVGEHFLSVGNSNRYIDHL--DGPESCNTNNMMKLSEMMAD 369
Query: 416 WTKEIAYADYYERALTNGVLSIQRGTEPGVMIY--MLPLG-RGVSKARSTHGWGTKFNSF 472
T + YAD+YE A+ N +LS Q T G + + + P G R SK
Sbjct: 370 RTHDARYADFYEYAMYNHILSTQDPTTGGYVYFTTLRPQGYRIYSKVNE---------GM 420
Query: 473 WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDP 532
WCC GTG+E+ SK G +Y + +YI + +S D K H +L Q+ + P
Sbjct: 421 WCCVGTGMENHSKYGHFVYTHDADT--AVYINLFTASKLDNK--HFMLTQE-----TAYP 471
Query: 533 YLRMTLTFSSKQEVGQLS--SLNLRMPVWTYSNGAQASLNGQNLPLP---PPGNFLSATE 587
Y + T K VG+ ++ +R P WT ++ + S+NG PL ++
Sbjct: 472 YEQRT-----KITVGKSGTYTIAVRHPWWTTADYS-ISVNGTKQPLDVLQGQASYCRLKR 525
Query: 588 RWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSG 634
W D +T+ LP+SLR P Y+ A +GP LL T+
Sbjct: 526 AWKAGDVITVDLPMSLRVAEC----PNYSDYIAFEYGPVLLGAQTTA 568
>gi|332669733|ref|YP_004452741.1| hypothetical protein Celf_1219 [Cellulomonas fimi ATCC 484]
gi|332338771|gb|AEE45354.1| protein of unknown function DUF1680 [Cellulomonas fimi ATCC 484]
Length = 752
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 168/594 (28%), Positives = 247/594 (41%), Gaps = 49/594 (8%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ+T+L YLL LD L+ FR+ A LP + YG WE+ L GH GH LSA++ +W
Sbjct: 19 AQRTDLAYLLRLDPQRLLAPFRREAGLPPLAEPYGNWES--MGLDGHTGGHALSAASLLW 76
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVW 226
A+T + E + +V L CQ +GTGY+ P LF+ A L W
Sbjct: 77 AATGDPRTAELAAALVDGLDACQEALGTGYVGGVPHGVALFERIAAGEVSADSFGLNGAW 136
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
P+Y +HK +AGL+D A A + A +V F V + L E
Sbjct: 137 VPWYNLHKTVAGLVDAVRYAPAGTA-ERARRVVLRFAEWWLGVAAGLDDAQFAAMLRTEF 195
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGM + L ++T +A F L L D L HANT I V+G
Sbjct: 196 GGMCEAFADLAALTGRDDLRAMAVRFADRTLLDPLLDGRDALDGLHANTQIAKVVGWAAL 255
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS-ENEETCTTYN 405
E GD ++ F D V S GG S E + + L S E E+C T N
Sbjct: 256 AEQDGDGGWERAARTFWDAVTTHRSLVFGGDSVGEHFHPVDDFSGALTSPEGPESCNTAN 315
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH-- 463
ML+++R L + D+ ERAL N VLS Q G +Y P AR H
Sbjct: 316 MLELTRRLLLRRPDPTLLDFAERALVNHVLSAQH--PDGGFVYFTP-------ARPDHYR 366
Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
+ + FWCC GTG+E++++LG+ + +G+ L + + W V L
Sbjct: 367 VYSQPEDGFWCCVGTGLETYARLGE-LALATQGD--DLIVHLPVPVRATWGDAVVTLRSP 423
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
P +S +TL + ++ +R P W + A ++ G G +L
Sbjct: 424 Y-PDLSAAAPTTLTLDLPGPRRF----AVRVRRPAWVGGDLAL-TVGGAPADATDDGTYL 477
Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA--GHTSGEWDIKTG 641
S T W D LT + P + E + P+ + A GP +LA G T ++
Sbjct: 478 SVTRTWHDGDVLTWEHPARVVAERL----PDGSDWVAFRRGPVVLAARGGTDDLPGLRAD 533
Query: 642 TAR-------SLSALI-SPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEF 687
+R L AL +P+ + +A + V+ + +E F
Sbjct: 534 ASRMGHVAAGPLHALAGTPVVEAVDATAAASRVRTAGREVVLDTDAGPVALEPF 587
>gi|345514178|ref|ZP_08793691.1| hypothetical protein BSEG_03388 [Bacteroides dorei 5_1_36/D4]
gi|229437170|gb|EEO47247.1| hypothetical protein BSEG_03388 [Bacteroides dorei 5_1_36/D4]
Length = 1118
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 172/692 (24%), Positives = 292/692 (42%), Gaps = 124/692 (17%)
Query: 69 ILGDQKDEVSWALLYR-KIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLL 127
I+GD E + + + ++ + P + L++V +D ++ L + ++ ++
Sbjct: 117 IIGDDTTENGYPITAKIEVVDTKNTIFPKLIAHTIPLNNVKIDGNNRLTSNRDLAIKEII 176
Query: 128 MLDVDSLVWSFRKTASLPTPGKAYG-GWENPISELRGHFVGHYLSASAQMWAS----THN 182
DV ++++R T L T G GW++P ++L+GH GHY+SA A +A+ +H
Sbjct: 177 SWDVSQQLYNYRDTYGLSTEGYTRSDGWDSPETKLKGHGSGHYMSALALAYAAATNPSHK 236
Query: 183 ATIKEKMSTVVFSLSECQNKI--------------------------------------- 203
++ ++ +V L ECQ +
Sbjct: 237 EILRRNITRMVNELRECQERTFVWSEELGRYLEARDFAPEEELKKMKGTWEAFDEHKTKW 296
Query: 204 ---GTGYLSAFP------TELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNA----Q 250
G GYL+A P E++ ++ VWAPYY+IHK LAGL+D D+ +
Sbjct: 297 ATYGYGYLNAIPPHHPALIEMYRAYNNSDWVWAPYYSIHKQLAGLIDIATYMDDKSIADK 356
Query: 251 ALKMATWMVEYFYNR------VQKVITMYSVERHWYSLNE----------ETGGMNDVLY 294
AL +A M + +NR V+K T ER N E GGM + L
Sbjct: 357 ALLIAKDMGLWVWNRMHYRTYVKKDGT--QEERRTRPGNRYEMWNMYIAGEVGGMGESLA 414
Query: 295 RLYSITHDPKH----LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
RL + P+ + ++ FD P F L+ D + + HAN HIP++IG+ Y
Sbjct: 415 RLSEMVSAPEEKARLIEASNCFDSPAFYEPLSKNIDDIRNRHANQHIPMIIGALRSYLSN 474
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG----SENE-------- 398
D Y + F +++ + Y+TGG E + P ++ SE E
Sbjct: 475 NDTFYYHVSHNFWNLIQGRYRYSTGGVGNGEMFRQPYTQIVSMAMNGVSEGESHSNPHIN 534
Query: 399 ETCTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
ETC TYN+LK+++ L + + A Y DYYER L N ++ E Y +G S
Sbjct: 535 ETCCTYNLLKLTKDLNCFNPDDARYMDYYERTLYNQIIG-SLHPEHYQTTYQYAVGLNAS 593
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
K WG + CC GTG E+ K ++ YF + L++ Y+ ++ W+ +
Sbjct: 594 KP-----WGNETPQSTCCGGTGSENHVKYQEATYFVSDNT---LWVALYMPTTLHWEEKN 645
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL- 576
+ L Q+ W P T+ ++ + ++ LR+P W ++G LNG ++
Sbjct: 646 ITLQQE----CLW-PAKSSTIKVTAGE---ARFAMKLRVPYWA-TDGFDVKLNGISIATH 696
Query: 577 -PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP-----------EYASIQAILFG 624
P + +W ND + I +P + + D P E A + +++G
Sbjct: 697 YQPCSYAVIPARQWKENDIVEITMPFTKHIDYGPDKLPAKIASKDGHQLETAWVGTLMYG 756
Query: 625 PYLLAGHTSGEWDIKTGTARSLSALISPIPPS 656
P+ + W T S A I+ + P+
Sbjct: 757 PFAMTATDITNWTEATLNIDSRLASIAVVEPN 788
>gi|296129045|ref|YP_003636295.1| hypothetical protein Cfla_1194 [Cellulomonas flavigena DSM 20109]
gi|296020860|gb|ADG74096.1| protein of unknown function DUF1680 [Cellulomonas flavigena DSM
20109]
Length = 749
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 169/619 (27%), Positives = 264/619 (42%), Gaps = 79/619 (12%)
Query: 94 LPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGG 153
LPG L+ V L D Q AQ+T LEYLL LD D L+ FR+ A LP + YG
Sbjct: 10 LPG--LRAVRLTDGLFAQ------AQRTALEYLLGLDPDRLLAPFRREAGLPPVAEPYGS 61
Query: 154 WENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP- 212
WE+ L GH GH LSA++ WA+T + +V L CQ+ +GTGY+ P
Sbjct: 62 WES--LGLDGHIGGHALSAASLQWAATGDDRAAGMAHALVDGLVLCQDALGTGYVGGLPG 119
Query: 213 -TELFDSFEA---------LKPVWAPYYTIHKILAGLLD--QYVLADNA-----QALKMA 255
L++S + L W P+Y +HK AGL+D +Y AD A A+++
Sbjct: 120 GVALWESVASGGAEAGTFDLGGAWVPWYNVHKTYAGLIDAARYAPADVAVRAMRAAVRLG 179
Query: 256 TWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKP 315
W V +R+ L E GGM + L ++T D ++ LA F
Sbjct: 180 DWGVA-LSDRLDDAAFA-------RMLRTEFGGMCEAYGDLAALTGDARYAALARRFADE 231
Query: 316 CFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATG 375
LG L D L HANT + V+G + G+ L F+ V + G
Sbjct: 232 SLLGPLRESRDELDGLHANTQVAKVVG----WPAIGEADAALA---FVRTVLDHRTLVLG 284
Query: 376 GTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVL 435
G S E + P+ E E+C T N+L+V R L+ T ++A D ER L N VL
Sbjct: 285 GHSVAEH-FTPRPERHVTHREGPESCNTANLLEVERRLYERTGDVALLDAAERQLVNHVL 343
Query: 436 SIQRGTEPGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
S Q G +Y P AR H + T+ WCC GT +E++++LG+ Y
Sbjct: 344 SAQH--PDGGFVYFTP-------ARPGHYRVYSTRDACMWCCVGTALETYARLGELAYAL 394
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
+ L + + S+ + V L+ P + +T+ + ++ +++
Sbjct: 395 CGHD---LLVNLPVPSTLEEPGLRVRLDSTY-PRALATTHATLTVDVDAPTDL----AVH 446
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
LR P W + A +++G +P + +++ W + L +L E + D
Sbjct: 447 LRRPSWARGDLAP-TVDGVGVPATAERDGYVTVRRTWRAGEVLAWRLVAGPAAERLPGDD 505
Query: 613 PEYASIQAILFGPYLLA--GHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQE--- 667
A+ +GP LA G T ++ G AR P+ P + ++ + +
Sbjct: 506 ----GWVALRWGPVALAVRGDTDDLVGLRAGDARMGHVAHGPLRPLADTPVLVGSDDDIS 561
Query: 668 -----SGNSTFVMSNSNQS 681
+ TFV+ ++
Sbjct: 562 AALRPGPDGTFVLDRGAEA 580
>gi|150003704|ref|YP_001298448.1| hypothetical protein BVU_1135 [Bacteroides vulgatus ATCC 8482]
gi|149932128|gb|ABR38826.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 1116
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 170/690 (24%), Positives = 291/690 (42%), Gaps = 120/690 (17%)
Query: 69 ILGDQKDEVSWALLYR-KIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWRAQQTNLEYLL 127
I+GD E + + + ++ + P + L++V ++ ++ L + ++ ++
Sbjct: 115 IIGDDTTENGYPITAKIEVVDTKNTISPKLIAHTIPLNNVKINGNNRLTSNRDLAIKEII 174
Query: 128 MLDVDSLVWSFRKTASLPTPGKAYG-GWENPISELRGHFVGHYLSASAQMWAS----THN 182
DV ++++R T L T G GW++P ++L+GH GHY+SA A +A+ +H
Sbjct: 175 SWDVSQQLYNYRDTYGLSTEGYTRSDGWDSPETKLKGHGSGHYMSALALAYAAATNPSHK 234
Query: 183 ATIKEKMSTVVFSLSECQNKI--------------------------------------- 203
++ ++ +V L ECQ +
Sbjct: 235 EILRRNITRMVNELRECQERTFVWSEELGRYLEARDFAPEEELKKMKGTWEAFDEHKTKW 294
Query: 204 ---GTGYLSAFP------TELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNA----Q 250
G GYL+A P E++ ++ VWAPYY+IHK LAGL+D D+ +
Sbjct: 295 ATYGYGYLNAIPPHHPALIEMYRAYNNSDWVWAPYYSIHKQLAGLIDIATYMDDKSIADK 354
Query: 251 ALKMATWMVEYFYNR------VQKVITMYSVERHWYSLNE--------ETGGMNDVLYRL 296
AL +A M + +NR V+K T H + E E GGM + L RL
Sbjct: 355 ALLIAKDMGLWVWNRMHYRTYVKKDGTQEERRTHPGNRYEMWNMYIAGEVGGMGESLARL 414
Query: 297 YSITHDPKH----LLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGD 352
+ P+ + ++ FD P F L+ D + + HAN HIP++IG+ Y D
Sbjct: 415 SEMVSAPEEKARLIEASNCFDSPAFYEPLSKNIDDIRNRHANQHIPMIIGALRSYLSNND 474
Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG----SENE--------ET 400
Y + F +++ + Y+TGG E + P ++ SE E ET
Sbjct: 475 TFYYHVSHNFWNLIQGRYRYSTGGVGNGEMFRQPYTQIVSMAMNGVSEGESHSNPHINET 534
Query: 401 CTTYNMLKVSRHLFRWTKEIA-YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
C YN+LK+++ L + + A Y DYYER L N ++ E Y +G SK
Sbjct: 535 CCAYNLLKLTKDLNCFNPDDARYMDYYERTLYNQIIG-SLHPEHYQTTYQYAVGLNASKP 593
Query: 460 RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
WG + CC GTG E+ K ++ YF + L++ Y+ ++ W+ ++
Sbjct: 594 -----WGNETPQSTCCGGTGSENHVKYQEATYFVSDNT---LWVALYMPTTLHWEEKNIT 645
Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL--P 577
L Q+ W P T+ ++ + ++ LR+P W ++G LNG ++
Sbjct: 646 LQQE----CLW-PAKSSTIKVTAGE---ARFAMKLRVPYWA-TDGFDVKLNGISIATHYQ 696
Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP-----------EYASIQAILFGPY 626
P + T +W ND + I +P + + D P E A + ++ GP+
Sbjct: 697 PCSYAVIPTRQWKENDIVEITMPFTKHIDYGPDKLPAEIASKDGHQLETAWVGTLMHGPF 756
Query: 627 LLAGHTSGEWDIKTGTARSLSALISPIPPS 656
+ W T S A I+ + P+
Sbjct: 757 AMTATDITNWTEATLNIDSRLASITVVEPN 786
>gi|297606173|ref|NP_001058068.2| Os06g0613000 [Oryza sativa Japonica Group]
gi|255677225|dbj|BAF19982.2| Os06g0613000, partial [Oryza sativa Japonica Group]
Length = 279
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 115/281 (40%), Positives = 149/281 (53%), Gaps = 43/281 (15%)
Query: 610 DDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPI---------------- 653
DDRPEY+SIQA+LFGP+LLAG T G +KT + S S L +
Sbjct: 4 DDRPEYSSIQAVLFGPHLLAGLTHGNQTVKT-SNDSNSGLTPGVWEVNATHAAAAVAVWV 62
Query: 654 ---PPSFNAQLVTFTQESGNS----TFVMSNS--NQSITMEEFPVSGTDAALHATFRLIL 704
S N+QLVT TQ G++ FV+S S + ++TM+E PV+G+DA +HATFR
Sbjct: 63 TPVSQSLNSQLVTLTQRDGDAQAAAAFVLSVSIADGALTMQESPVAGSDACVHATFRAYH 122
Query: 705 KDASLSNFSSLNNVI-GKSVMLEPFDFPGMLVQQGKEDELVVSESPKEMGSSGFRLVAGL 763
+ S + + G+ V LEPFD PGM V D L V + ++ F VAGL
Sbjct: 123 SPSGASAIDAATGRLQGRDVALEPFDRPGMAV----TDALSVG---RPGPATRFNAVAGL 175
Query: 764 DKRNETVSLEAENRKGCFVSSGVN-FEPGASLKLLCSTESL--------DAGFNRAASFM 814
D TVSLE R GCFV++ + GA ++ C + D F RAASF
Sbjct: 176 DGLPGTVSLELATRPGCFVAAPTTAYLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASFT 235
Query: 815 MEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
+ YHP+SF A G RNFLL PL S +DE YTVYFN+
Sbjct: 236 QAAPLRLYHPLSFSATGTDRNFLLEPLQSLQDEFYTVYFNV 276
>gi|402081502|gb|EJT76647.1| acetyl-CoA carboxylase [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1032
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 138/491 (28%), Positives = 206/491 (41%), Gaps = 76/491 (15%)
Query: 206 GYLSAFPTELFDSF----------EALKPVWAPYYTIHKILAGLLDQYVLADNAQAL--- 252
GYL A P + +A WAP+YT HKI+ GLLD Y +N QAL
Sbjct: 404 GYLGALPEDTVLRLGPPRWAIYGGDAATNTWAPWYTQHKIMRGLLDAYYNTNNTQALDVV 463
Query: 253 -KMATW------MVEYFYNRVQKVITMYSVERHW-YSLNEETGGMNDVLYRLYSITHDPK 304
KMA W + + Y +T + R W + E+GG N+V LY +T D +
Sbjct: 464 VKMADWAHLALTIGDKNYPGYTGNLTRDDLNRMWDLYIAGESGGANEVFPELYELTGDSR 523
Query: 305 HLLLAHLFDKPCFL--------GFLALQAD------YLSHFHANTHIPIVIGSQMRYEVT 350
HL A FD L L L D HAN H+P IG +E +
Sbjct: 524 HLETAKAFDNRASLFDAAVEDRDILVLTRDKNPGPRRTDRLHANMHVPQFIGYLRIFEQS 583
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAR--------EFWWDPKRLADTLGSENEETCT 402
+ Y F V +A+GGT E + + +A+ + ETCT
Sbjct: 584 REQDYLDAARNFYSWVFPHRQFASGGTGGNYPGSNNNAEMFQNRGNIANAIAENGAETCT 643
Query: 403 TYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV---MIYMLPLGRGVSKA 459
TYNMLK++R+LF Y D YER L N + + T + Y PL G S+
Sbjct: 644 TYNMLKLARNLFMHEHNATYMDGYERGLFNMIAGSRADTATTADPQLTYFQPLTPGASRD 703
Query: 460 RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
N+ CC G+G+ES +K +++Y + L++ ++ S+ W G
Sbjct: 704 YG--------NTGTCCGGSGLESHTKYQETVYLRSA-DGSALWVNLFVPSTLTW--GEKA 752
Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP---L 576
+ + D ++T+T + G + LR+P W ++NG+ P
Sbjct: 753 FSLRQDTAFPRADSTKLTVTAAGG---GGPLDIKLRVPAWAQRGTVTVTVNGEADPAAQT 809
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL-------- 628
P PG +L+ W D + +++P +R E DRP+ QA++ GP LL
Sbjct: 810 PLPGTYLTLARAWRAGDTIEMRMPFRVRVERAP-DRPD---TQALMRGPVLLQIVGRPPA 865
Query: 629 -AGHTSGEWDI 638
G SG W++
Sbjct: 866 TGGANSGYWEL 876
Score = 47.0 bits (110), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 48/107 (44%), Gaps = 9/107 (8%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY--GGWEN 156
L +V L D +L + ++L D + F K A P+ G GGWE+
Sbjct: 50 LDQVRLGD------GLLQEKRDRTKDFLREFDERRFLVLFNKQAGRPSAGGVAVPGGWED 103
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKI 203
L GH+ GHY++A +Q +A K K+ +V L+ CQ I
Sbjct: 104 G-GLLSGHWAGHYMTALSQAYADQGEEVFKAKLDWMVQELAACQKAI 149
>gi|357472937|ref|XP_003606753.1| hypothetical protein MTR_4g065240 [Medicago truncatula]
gi|355507808|gb|AES88950.1| hypothetical protein MTR_4g065240 [Medicago truncatula]
Length = 184
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 91/176 (51%), Positives = 112/176 (63%), Gaps = 6/176 (3%)
Query: 3 FGFVLFFFFCFGLALGKQCTNQSPYDSHAFRYEL-TSTNKTWKEEVL---SHFHLTPTDD 58
F +V G A K+C N P SH R EL S N+TWK+EV+ SH H+TP+D+
Sbjct: 4 FVYVFLALILCGCANSKECINNLP-QSHTLRTELMASKNETWKKEVMMYQSHVHVTPSDE 62
Query: 59 SAWSSLIPSKI-LGDQKDEVSWALLYRKIKNPGGFDLPGNFLKEVSLHDVWLDQSSVLWR 117
SAW +IP ++ L +K V L R++KN P FLKEV L DV L + S+ +
Sbjct: 63 SAWQEMIPKEMFLTQEKPNVIGLLSNREMKNADVSKPPVGFLKEVPLGDVRLLEGSIHAQ 122
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSAS 173
AQ+TNLEYLLMLDVD L+WSFRK A LPTPG YGGWE P ELRGHFVG +SA+
Sbjct: 123 AQKTNLEYLLMLDVDRLIWSFRKMAGLPTPGAPYGGWEKPDQELRGHFVGCNVSAT 178
>gi|433651701|ref|YP_007278080.1| hypothetical protein Prede_0695 [Prevotella dentalis DSM 3688]
gi|433302234|gb|AGB28050.1| hypothetical protein Prede_0695 [Prevotella dentalis DSM 3688]
Length = 1032
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 166/579 (28%), Positives = 261/579 (45%), Gaps = 78/579 (13%)
Query: 95 PGNF-LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--- 150
P +F L EV+L D S A + N + LL D D L+ F + A L T A
Sbjct: 22 PHHFDLSEVTLFD------SPFKTAMELNFKVLLDYDADRLLAPFVRQAGLNTGDYAGWQ 75
Query: 151 -----YGGWENPISELRGHFVGHYLSASAQMWASTHNA----TIKEKMSTVVFSLSECQN 201
+ W +L GH GHYLSA A +A+ +A +K+++ ++ L +CQ+
Sbjct: 76 TLHPNFANWGGNGFDLSGHVGGHYLSALALAYAACRDAGMKARLKQRLEYMLKVLKDCQD 135
Query: 202 KIG------TGYLSAFP-----TELF----DSFEALKPVWAPYYTIHKILAGLLDQYVLA 246
G++ P +L+ F +++ W P+Y HK+LAGL D YV A
Sbjct: 136 AYDGNTEGLRGFIGGQPINEAWKKLYAGDVSGFRSVRG-WVPFYCQHKVLAGLRDAYVYA 194
Query: 247 DNAQALKMATWMVEYFYNRVQKV--ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPK 304
N +A +M + ++ N V ++ M SV L+ E GGMN+ L Y++ D K
Sbjct: 195 GNKEAREMFRKLADWSVNVVARLDNAAMQSV------LDTEHGGMNESLADAYTLFGDQK 248
Query: 305 HLLLAHLFDKPCFLGFLALQ-ADYLSHFHANTHIPIVIGSQMRYEVTGDPL---YKLIGT 360
++ A + L + +Q A +L + HANT +P IG + E G L Y+L
Sbjct: 249 YMDAAQKYSHQTMLNGMQMQNATFLDNRHANTQVPKYIGFERIGEQGGSELQKKYELAAG 308
Query: 361 FFMDIVNASHSYATGGTSAREFWW---DPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
F + V + + GG S E + + R D L + E+C + NMLK+S L T
Sbjct: 309 NFWNDVALNRTVCIGGNSVAEHFLSAANSHRYIDHL--DGPESCNSNNMLKLSEMLSDNT 366
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ YAD+YE N +LS Q + G +Y L + + + WCC G
Sbjct: 367 HDARYADFYEYTTWNHILSTQD-PKTGGYVYFTTL-----RPQGYRIYSQVNQGMWCCVG 420
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
TG+E+ SK G +Y + +V +Y+ + +S + L Q+ ++P R+T
Sbjct: 421 TGMENHSKYGHFVYTHDGDSV--IYVNLFTASKL--ANAKFALTQQT--AYPYEPQTRIT 474
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL---PPPGNFLSATERWSYNDK 594
+ + G +L +R P WT + G +NG+ + P + T +W D
Sbjct: 475 I------DKGGSYTLAVRHPWWT-TEGYAILVNGEKQQVAVTPGKAGYARLTRKWKRGDV 527
Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+T+ LP+ LRT P Y A +GP LLA T+
Sbjct: 528 VTVALPMQLRTVEC----PNYTDYVAFEYGPLLLAAQTT 562
>gi|340345934|ref|ZP_08669064.1| acetyl-CoA carboxylase [Prevotella dentalis DSM 3688]
gi|339612921|gb|EGQ17717.1| acetyl-CoA carboxylase [Prevotella dentalis DSM 3688]
Length = 1039
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 166/579 (28%), Positives = 261/579 (45%), Gaps = 78/579 (13%)
Query: 95 PGNF-LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKA--- 150
P +F L EV+L D S A + N + LL D D L+ F + A L T A
Sbjct: 29 PHHFDLSEVTLFD------SPFKTAMELNFKVLLDYDADRLLAPFVRQAGLNTGDYAGWQ 82
Query: 151 -----YGGWENPISELRGHFVGHYLSASAQMWASTHNA----TIKEKMSTVVFSLSECQN 201
+ W +L GH GHYLSA A +A+ +A +K+++ ++ L +CQ+
Sbjct: 83 TLHPNFANWGGNGFDLSGHVGGHYLSALALAYAACRDAGMKARLKQRLEYMLKVLKDCQD 142
Query: 202 KIG------TGYLSAFP-----TELF----DSFEALKPVWAPYYTIHKILAGLLDQYVLA 246
G++ P +L+ F +++ W P+Y HK+LAGL D YV A
Sbjct: 143 AYDGNTEGLRGFIGGQPINEAWKKLYAGDVSGFRSVRG-WVPFYCQHKVLAGLRDAYVYA 201
Query: 247 DNAQALKMATWMVEYFYNRVQKV--ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPK 304
N +A +M + ++ N V ++ M SV L+ E GGMN+ L Y++ D K
Sbjct: 202 GNKEAREMFRKLADWSVNVVARLDNAAMQSV------LDTEHGGMNESLADAYTLFGDQK 255
Query: 305 HLLLAHLFDKPCFLGFLALQ-ADYLSHFHANTHIPIVIGSQMRYEVTGDPL---YKLIGT 360
++ A + L + +Q A +L + HANT +P IG + E G L Y+L
Sbjct: 256 YMDAAQKYSHQTMLNGMQMQNATFLDNRHANTQVPKYIGFERIGEQGGSELQKKYELAAG 315
Query: 361 FFMDIVNASHSYATGGTSAREFWW---DPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
F + V + + GG S E + + R D L + E+C + NMLK+S L T
Sbjct: 316 NFWNDVALNRTVCIGGNSVAEHFLSAANSHRYIDHL--DGPESCNSNNMLKLSEMLSDNT 373
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
+ YAD+YE N +LS Q + G +Y L + + + WCC G
Sbjct: 374 HDARYADFYEYTTWNHILSTQD-PKTGGYVYFTTL-----RPQGYRIYSQVNQGMWCCVG 427
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
TG+E+ SK G +Y + +V +Y+ + +S + L Q+ ++P R+T
Sbjct: 428 TGMENHSKYGHFVYTHDGDSV--IYVNLFTASKL--ANAKFALTQQT--AYPYEPQTRIT 481
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL---PPPGNFLSATERWSYNDK 594
+ + G +L +R P WT + G +NG+ + P + T +W D
Sbjct: 482 I------DKGGSYTLAVRHPWWT-TEGYAILVNGEKQQVAVTPGKAGYARLTRKWKRGDV 534
Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+T+ LP+ LRT P Y A +GP LLA T+
Sbjct: 535 VTVALPMQLRTVEC----PNYTDYVAFEYGPLLLAAQTT 569
>gi|256831608|ref|YP_003160335.1| hypothetical protein Jden_0363 [Jonesia denitrificans DSM 20603]
gi|256685139|gb|ACV08032.1| protein of unknown function DUF1680 [Jonesia denitrificans DSM
20603]
Length = 744
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 152/540 (28%), Positives = 234/540 (43%), Gaps = 62/540 (11%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLS--ASAQM 176
+ T L+Y L LD LV +R+ + LP +YG WEN S L GH +GH LS A A +
Sbjct: 20 RNTALDYTLALDPQRLVAPYRRESGLPLLAPSYGNWEN--SGLDGHTLGHVLSALAYASV 77
Query: 177 WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL------------FDSFEALKP 224
+ +A +E++ +V + ECQ +GTGY+ P DSF L
Sbjct: 78 THTPRSAEARERLEWLVAQVQECQAAVGTGYVGGIPQGRALWERIGNGDVDADSF-GLHG 136
Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
W P+Y +HK+ AGL+D +A A A + + ++ +V E+ L
Sbjct: 137 AWVPWYNLHKVFAGLVDAGWVAGVAVARDVVVGLANWWL----RVAARLRDEQFQAMLVT 192
Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
E G +N L T D ++L +A F L D L HANT I +G
Sbjct: 193 EFGAINGAFADLAVHTGDARYLEMAKRFTDRALFDALVAGEDPLVGLHANTQIAKALGWA 252
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW-WDPKRLADTLGSENEETCTT 403
G Y + D+V H+ + GG S RE DP A + + E+C T
Sbjct: 253 RVALAGGGREYLVAARRVWDVVVRDHTLSFGGNSVREHCAGDP--WAPFVSEQGPESCNT 310
Query: 404 YNMLKVSRHLFRWTKEI-AYADYYERALTNGVLSIQRGTEP-GVMIYMLPLGRGVSKARS 461
+NML+++ L + D+ E AL N V+S P G +Y P AR
Sbjct: 311 HNMLRLTGALLELGESPRPLVDFVEVALMNHVVS---SVHPEGGFVYFTP-------ARP 360
Query: 462 TH--GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVV 519
H + FWCC GTG+E K G+ +Y + GL++ ++S +W S V
Sbjct: 361 QHYRVYSQVHECFWCCVGTGMEHLMKNGELVYSPD---ATGLFVHLGVASVGEWASRGVR 417
Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS------NGAQASLNGQN 573
+ Q P D + + + + E G+ ++++R+P W N A S ++
Sbjct: 418 VRQ---PWTLDDAGITVGIDAVGQGE-GEF-AIHVRVPGWVDGPVTVRVNDAVISTRVEH 472
Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+++ T WS D+L + LP +LR + P + S Q GP++LA +
Sbjct: 473 ------SGYVTVTRVWSAGDRLDVSLPATLRLRPAPRNAP-FVSFQK---GPWVLAARAT 522
>gi|257068350|ref|YP_003154605.1| hypothetical protein Bfae_11690 [Brachybacterium faecium DSM 4810]
gi|256559168|gb|ACU85015.1| uncharacterized conserved protein [Brachybacterium faecium DSM
4810]
Length = 752
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 149/538 (27%), Positives = 225/538 (41%), Gaps = 52/538 (9%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMW 177
AQ+T+LEYLL L+ + L+ FR+ A + T YG WE+ L GH GH L+A++ MW
Sbjct: 25 AQRTDLEYLLGLEAERLLAPFRREAGIATTAAPYGNWES--MGLDGHIGGHALAAASLMW 82
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSFEA---------LKPVW 226
A+T + E +V L ECQ ++GTGY+ P EL+ L W
Sbjct: 83 AATGDERAAELARQLVEGLRECQARLGTGYVGGIPGGAELWAQIRTIASQAQTWDLGGAW 142
Query: 227 APYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
P+Y +HK AGL++ A A A ++ + ++ E L E
Sbjct: 143 VPWYNLHKTFAGLIEAVRHAPAGTA-SCALEVLRGLGDWGARLGEQLDDEAFARMLRTEF 201
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
GGM L IT + +H +A F L L D L HANT I VIG
Sbjct: 202 GGMCAAYADLAEITGEERHARMARRFADESLLAPLRAGRDELDGMHANTQIAKVIGWPAL 261
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTS-AREFWWDPKRLADTLGSENEETCTTYN 405
E F+ V + A GG S A F +P LA E E+C T N
Sbjct: 262 GETAA-------AETFVRTVLERRTLAFGGNSVAEHFTAEP--LAHVTDREGPESCNTVN 312
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
ML+ + L+ D ER L VLS Q G +Y P G + S
Sbjct: 313 MLEAEQRLYEHGGGPWLFDAIERQLVGHVLSAQH--PEGGFVYFTPARPGHYRVYS---- 366
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
T+ N WCC GTG+E +++ G + + G+ L + + +S W+ + +
Sbjct: 367 -TRENGMWCCVGTGLEVYARTGRFTFAAQGGD---LLVNLPLPASLRWEEQGIAAHLD-- 420
Query: 526 PIVSWDPYLR----MTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP-G 580
PY R +T + + ++++R+P W + S++GQ++
Sbjct: 421 -----SPYPRPAPETPVTLRIEADAPSDVAVHVRVPAWA-TTPPTVSVDGQDVTAHAELD 474
Query: 581 NFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDI 638
+++ RW + L L E + P S ++ +GP +LA GE D+
Sbjct: 475 GYVTVRRRWQGGEVLRWTLHAGPSWEPL----PGEDSWGSLRWGPVVLAAR-DGEEDL 527
>gi|82523843|emb|CAI78585.1| hypothetical protein [uncultured candidate division OP8 bacterium]
Length = 766
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 113/385 (29%), Positives = 169/385 (43%), Gaps = 72/385 (18%)
Query: 98 FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAY--GGWE 155
L V L+ + ++ + + L L ++ D+ +++FR LP P A GGW+
Sbjct: 378 LLGRVVLNRDAAGRETLFMKNRDKFLSTLAEVNPDNFLYNFRDAFGLPQPEGAVQLGGWD 437
Query: 156 NPISELRGHFVGHYLSASAQMWA-----STHNATIKEKMSTVVFSLSECQNKIG------ 204
+ + LRGH GHYLSA AQ +A S A +KM+ ++ +L + K G
Sbjct: 438 DQTTRLRGHASGHYLSALAQAYAGSVYDSALQANFLQKMNYMIDTLYDLAQKSGRPVESG 497
Query: 205 ------------------------------------TGYLSAFPTELFDSFE-------A 221
G++SA+P + F E
Sbjct: 498 GLCNPDPTTVPSGPGKSGYDSDLSQKGLRHDYWNWGVGFISAYPPDQFIMLEQGATYGGT 557
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
+WAPYYT+HKILAGLLD Y + N +AL++A M + R+Q V +
Sbjct: 558 NAQIWAPYYTLHKILAGLLDCYEVGGNPKALQIAEGMGGWALKRLQAVPEATRIAMWSRY 617
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFL-------GFLALQADYLSHFHAN 334
+ E GGMN+V+ RL+ +T L A LFD F LA D + HAN
Sbjct: 618 IAGEYGGMNEVMARLFRLTGKRDFLACAKLFDNTNFFFGNAGREHGLAKNVDTVRGRHAN 677
Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGT-------SAREFWWDPK 387
HIP +IG+ Y +G+P+Y I F +I + Y GG +A F +P
Sbjct: 678 QHIPQIIGTLETYRGSGEPVYHEIAENFWEIARNHYMYNIGGVGGAKNPRNAECFTAEPD 737
Query: 388 -RLADTLGSENE-ETCTTYNMLKVS 410
+ A+ + + ETC TYN+LK +
Sbjct: 738 TQFANGFSMDGQNETCATYNLLKCA 762
>gi|225351247|ref|ZP_03742270.1| hypothetical protein BIFPSEUDO_02839 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
gi|225158703|gb|EEG71945.1| hypothetical protein BIFPSEUDO_02839 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
Length = 853
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 154/571 (26%), Positives = 230/571 (40%), Gaps = 90/571 (15%)
Query: 118 AQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISE--------LRGHFVGHY 169
AQQ YLL LDVD L++ FR+ A LP P A G NP++ L GH GHY
Sbjct: 24 AQQAGARYLLDLDVDRLLYPFRREAGLPQPTDADG---NPVTSYPNWEETGLDGHIAGHY 80
Query: 170 LSASAQMWASTHNAT-IKEKMSTVVFSLSECQ-----NKIGTGYLSAFPTE--LFDSFEA 221
LSA + ++ +TVV S ECQ + + GY+ P +F A
Sbjct: 81 LSACVGFAQVADDPQPFIDRAATVVRSWHECQQSFAGDAVMRGYVGGVPDSRTVFGRLAA 140
Query: 222 ---------LKPVWAPYYTIHKILAGLLDQY-----VLADNAQALKMATWMVEYFYNRVQ 267
+ W P Y +HK AGLLD + + +Q + + ++ R+
Sbjct: 141 GDVESQNFSMNDAWVPMYNVHKTFAGLLDTWADFASIDEQTSQLARTVVLDLADWWCRIA 200
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
+ + + +R L E GGM + LY+ T + ++ ++A F LA D
Sbjct: 201 EPLDDETFDR---ILVSEFGGMCESFAELYARTGEERYHVMADRFKDHAIFDPLAQGEDV 257
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
L+ HANT IP V+G + + D F D V S + G S E +
Sbjct: 258 LTGMHANTQIPKVLGWERLGAICNDEQADAATNTFWDSVVHHRSVSIGAHSVSEHFHPTD 317
Query: 388 RLADTLGS-ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
+ + S E ETC +YNM K++ L+ + Y ++YER L N +LS +PG
Sbjct: 318 DFSSMIESREGPETCNSYNMSKLAERLWLRSGSADYINFYERVLENHLLSTINPKQPG-F 376
Query: 447 IYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIESFSKLGDSIY------------- 491
+Y P+ RS H + T FWCC G+G+E+ ++ G IY
Sbjct: 377 VYFTPM-------RSQHYRAYSTPQECFWCCVGSGLENHARYGRLIYALQRPAAQDSADS 429
Query: 492 --------FEEEGNVPG---------LYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
E GN L + YI S+FD + + Q+ I Y
Sbjct: 430 AAAGFASSAAETGNTVSNNAEAEATRLLVNLYIDSTFDCPEQGLRITQRAARIEDGVDYT 489
Query: 535 RMTLTFSSKQE-----VGQL--SSLNLRMPVWTYSNGAQASLNGQNLPLPP-----PGNF 582
+T T S E G L ++L LR P W G + P P +
Sbjct: 490 -VTFTLESTAEHVPDTPGGLRETTLFLRRPWWAEHYGVMEATCAVCTLDPARTNDIPEGY 548
Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
L RW+ ++ ++L + E + D P
Sbjct: 549 LPLRLRWNGVAEVVMRLRPRITVERMPDGSP 579
>gi|297725075|ref|NP_001174901.1| Os06g0612950 [Oryza sativa Japonica Group]
gi|255677224|dbj|BAH93629.1| Os06g0612950 [Oryza sativa Japonica Group]
Length = 198
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 82/168 (48%), Positives = 98/168 (58%), Gaps = 24/168 (14%)
Query: 19 KQCTN-QSPYDSHAFRYELTSTNKT---WKEEVLSHFHLTPTDDSAWSSLIPSKILGDQK 74
K+CTN + SH R L S++ W+EE HL PTD++AW L+P +
Sbjct: 23 KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMP--LAAASA 80
Query: 75 DEVSWALLYRKIKNPGGFDLPGN-----------FLKEVSLHDVWLDQSS----VLWRAQ 119
E WA+LYR +K G + G+ FL+EVSLHDV LD V RAQ
Sbjct: 81 SEFDWAMLYRSLK---GAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQ 137
Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVG 167
QTNLEYLL+L+VD LVWSFR A LP PGK YGGWE P ELRGHFVG
Sbjct: 138 QTNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVG 185
>gi|302547294|ref|ZP_07299636.1| putative secreted protein [Streptomyces hygroscopicus ATCC 53653]
gi|302464912|gb|EFL28005.1| putative secreted protein [Streptomyces himastatinicus ATCC 53653]
Length = 740
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 142/294 (48%), Gaps = 28/294 (9%)
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVS 410
G+ Y F +V Y+ GGT E + +A TL +N ETC TYNMLK+S
Sbjct: 337 GETAYAAAARNFWGMVAGPRMYSLGGTGQGEMFRARNAIAATLDGKNAETCATYNMLKLS 396
Query: 411 RHLFRWTKEIAYADYYERALTNGVLSIQRG----TEPGVMIYMLPLGRGVSKARSTHGWG 466
R LF + AY DYYER LTN +L+ +R T P V Y + +G GV +
Sbjct: 397 RQLFFREPDAAYMDYYERGLTNHILASRRDAPSTTSPEV-TYFVGMGPGVRREYD----- 450
Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDP 526
N+ CC GTG+E+ +K DS+YF LY+ ++S+ W V+ Q D
Sbjct: 451 ---NTGTCCGGTGMENHTKYQDSVYFRSADGT-ALYVNLALASTLRWPERGFVIEQTGD- 505
Query: 527 IVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG-QNLPLPPPGNFLSA 585
+ TLTF +E G + LR+P W + G ++NG + PG++L+
Sbjct: 506 ---YPAEGVRTLTF---REGGGRLEVKLRVPAWA-TGGFTVTVNGVRQRGKAVPGSYLTL 558
Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIK 639
+ W D++ I P LR E DD ++Q++ +GP LL SGE + +
Sbjct: 559 SRDWRRGDRIRISAPYRLRIERALDD----PAVQSVFYGPVLLVAR-SGETEFR 607
>gi|427409221|ref|ZP_18899423.1| hypothetical protein HMPREF9718_01897 [Sphingobium yanoikuyae ATCC
51230]
gi|425711354|gb|EKU74369.1| hypothetical protein HMPREF9718_01897 [Sphingobium yanoikuyae ATCC
51230]
Length = 616
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 146/572 (25%), Positives = 236/572 (41%), Gaps = 78/572 (13%)
Query: 125 YLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNAT 184
+ L LD D ++ FR+ A LP PG GGW + + G G Y+S A++ A+T +
Sbjct: 82 HYLALDNDRVLKVFRQQAGLPAPGPDMGGWYDRDGFVPGLAFGQYMSGLARIGATTGDKA 141
Query: 185 IKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYV 244
+ K++ +V E K Y A P + WA YT+ K + GL+D Y
Sbjct: 142 VHAKVAALVQGFGEFITKTRNPY--AGPKA--------QDQWAA-YTMDKYVVGLIDAYR 190
Query: 245 LADNAQALKMATWMVEYF--------YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRL 296
L+ QA + +E +R+ KV Y +ET +++ L+ +
Sbjct: 191 LSGVEQAKTLLPITIEKCRPYISPVSRDRIGKVDPPY----------DETYVLSENLFHV 240
Query: 297 YSITHDPKHLLLA--HLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPL 354
IT K+ +A +L +K F A Q D L HA +H + Y GD
Sbjct: 241 ADITGQDKYRQMAIHYLLNKEWFDPLAAGQ-DVLPTKHAYSHTIALSSGAQAYLHLGDEK 299
Query: 355 YKLIGTFFMDIVNA-----SHSYATGGTSAREFWWD--PKRLADTLGSEN---EETCTTY 404
Y+ +VNA +A+GG E + + +LA +L S E C ++
Sbjct: 300 YRKA------LVNAWTYMEPQRFASGGWGPEEQFVELHQGKLAASLKSSKAHFETPCGSF 353
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG 464
+K++R+L R+T E Y D ER L N +L+ + G Y G K
Sbjct: 354 ADMKLARYLVRFTGEPVYGDGLERTLYNTMLATRLPDSDGGYPYYSNYGAAAEKLYYHQK 413
Query: 465 WGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK--SGHVVLNQ 522
W CC GT ++ + ++YF ++ L + + S+ W G V + Q
Sbjct: 414 WP-------CCSGTLVQGVADYVLNLYFHDDN---ALVVNMFAPSTVKWDRPGGAVQVEQ 463
Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNF 582
+ + + R+T+T ++ LR+P W + GAQ +NG + PG
Sbjct: 464 QTN--YPAEDTTRLTVTAPGNGRF----AMKLRIPAW--AKGAQLRVNGAAQGV-QPGTL 514
Query: 583 LSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGT 642
W D + + LP +LRT +I D P+ I A++ G + G W
Sbjct: 515 AVIDRTWKAGDMVELTLPQALRTLSIDDKNPD---IAAVMRGAVMYVGLNP--WTGVEDQ 569
Query: 643 ARSLSALISPIPPSFNAQLVTFTQESGNSTFV 674
+L A + P+P S + + E+G V
Sbjct: 570 PLALPASLKPVPGSS----LNYAMETGGRNLV 597
>gi|336429869|ref|ZP_08609826.1| hypothetical protein HMPREF0994_05832 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336001322|gb|EGN31460.1| hypothetical protein HMPREF0994_05832 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 606
Score = 126 bits (317), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 161/619 (26%), Positives = 248/619 (40%), Gaps = 111/619 (17%)
Query: 99 LKEVSLHDVWLDQSSVLW-RAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGW-EN 156
LK+ +V L S LW R ++ E L + DSL++ FR A L PG+ GW N
Sbjct: 4 LKDFRYRNVELKNS--LWERQRRETAETYLAIPNDSLLYYFRTLAGLEAPGEGLTGWYGN 61
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
S G L A A+++A T + +KEK L+E G G +A ++F
Sbjct: 62 GASTF-----GQKLGAFAKLYAVTGDYRLKEK----AVYLAE-----GWGKCAAANKKVF 107
Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
D + Y K+L G LD Y + L + + + R ++ I ++
Sbjct: 108 DCNDT--------YVYEKLLGGFLDMYENLGYEKGLAYCSGLTDSAAARFKRDIPRDGLQ 159
Query: 277 R---------HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADY 327
WY+L E LYR Y +T + K+L A +D L +
Sbjct: 160 GPELCENNMIEWYTLPEN-------LYRAYQLTGEQKYLDFAQEWDYTYLWDKLNNKDSA 212
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFW--- 383
+ HA + + + + M YEVTG Y I + +I H+YATGG E
Sbjct: 213 IGPRHAYSQVNSLSSAAMAYEVTGKKYYLDAIENGYTEITE-RHTYATGGYGPAECLFAE 271
Query: 384 ------------WDPKRLA--------------DTLGSENEETCTTYNMLKVSRHLFRWT 417
WDP R + D GS E +C + + K+ +L R T
Sbjct: 272 EEGFLGEMLKDSWDPTRKSPVYRNFGGGLVGRNDNWGS-CEVSCCAWAVFKICNYLLRIT 330
Query: 418 KEIAYADYYERALTNGVLSIQRGTEPG-VMIYMLPLGRGVSKA---RSTHGWGTKFNSFW 473
+ Y + E+ L NGV G VM Y G K+ R G G F +
Sbjct: 331 GKAKYGAWAEQMLINGVAGQPPIDSQGHVMYYADYFVDGAVKSVQDRRLQGNGANF-EWQ 389
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS--SFDWKSGHVVLNQKVDPIVSWD 531
CC GT + ++ + +Y+ +E G+Y+ QY+ S F + VL + VS
Sbjct: 390 CCTGTFPQDVAEYANMLYYTDE---EGIYVSQYMKSRAEFTIRGEKAVLENCSEEDVS-- 444
Query: 532 PYLRMTLTFSSKQEVGQLS-SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATER-W 589
P R + Q G+L ++ R+P W + +NG++ L P + + ER W
Sbjct: 445 PIRRFRI-----QTRGELPFRISFRIPHWAKGEN-RILVNGEDSGLEPLPDSWAVLERVW 498
Query: 590 SYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG----------HTSGEW--- 636
+D +T+ P SL + + + + I A++FGP +LA EW
Sbjct: 499 QEDDVITVTCPFSLAFKPVDEKNKD---IAALMFGPVVLAADKMTLFDGDMEKPEEWITC 555
Query: 637 -DIKTGTARSLSALISPIP 654
D K R+L + P P
Sbjct: 556 VDEKEMLFRTLPGHVCPYP 574
>gi|310794204|gb|EFQ29665.1| hypothetical protein GLRG_04809 [Glomerella graminicola M1.001]
Length = 436
Score = 125 bits (314), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 149/348 (42%), Gaps = 49/348 (14%)
Query: 119 QQTNLEYLLMLDVDSLVWSFRKTASLPTP-GKAYGGWENPISELRGHFVGHYLSASAQMW 177
Q L YL +DVD L++ FRK L T + GW+ P R H GH+L+A A +
Sbjct: 59 QARTLVYLKWIDVDRLLYVFRKNHGLYTNNAQPNAGWDAPDFPFRSHVQGHFLNAWAFCY 118
Query: 178 ASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILA 237
A ++ K + + L +CQ+ PYY IHK +A
Sbjct: 119 AQLQDSECKRRATYFAAELKKCQH------------------NNTNSRNVPYYAIHKTMA 160
Query: 238 GLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLY 297
GLLD + L + A + M + R K+ + ++ + GGMN+VL L
Sbjct: 161 GLLDVWRLIGDTNARDVLLAMAAWVDLRTGKL----TYQQMQDMMGTVFGGMNEVLADLC 216
Query: 298 SITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKL 357
T D + + +A FD LA D LS HANT +
Sbjct: 217 RQTGDQRWVTVAQRFDHAAIFNPLASNQDSLSGLHANT--------------------QD 256
Query: 358 IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWT 417
I +I ++HSYA GG S E + P +A L S+ E C TYNMLK++ L+
Sbjct: 257 IARNAWNITVSAHSYAIGGNSQAEHFRLPNAIAGFLTSDTCEACNTYNMLKLTGELWLTN 316
Query: 418 KE-IAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSKA 459
+ Y D+YERAL N +L Q + G + Y PL RGV A
Sbjct: 317 PDTTTYFDFYERALLNHLLGQQDPSNSHGHVTYFTPLNPGGRRGVGPA 364
>gi|94967195|ref|YP_589243.1| hypothetical protein Acid345_0164 [Candidatus Koribacter versatilis
Ellin345]
gi|94549245|gb|ABF39169.1| conserved hypothetical protein [Candidatus Koribacter versatilis
Ellin345]
Length = 602
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 140/550 (25%), Positives = 234/550 (42%), Gaps = 58/550 (10%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWE--N 156
L E DV L +S + R Q + L+ L+ D+L+ FR P PG+ GGW +
Sbjct: 37 LDEFGYGDVSL-ESELHNRQFQNTHDVLMGLEDDALLKPFRAMVGQPPPGRDLGGWYCFD 95
Query: 157 PISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELF 216
P VG +A+ W S + + + V N++ + +
Sbjct: 96 PNYNPNDVGVGFAPTATFGQWISALSRSYALRPDPAVRDKVIRLNRL-------YAQTIS 148
Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVE 276
F LK + P Y K++ GL+D + + ALK+ +E + ++ ++VE
Sbjct: 149 PEFYGLKNRF-PAYCYDKLVCGLIDAHQYVGDPDALKI----LERTTDTATPLLPGHAVE 203
Query: 277 RH--WYSLNE------ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYL 328
W S+ + E+ +++ L+ Y ++ L + + LA L
Sbjct: 204 HGTVWRSVKDDGYTWDESYTISENLFLAYRRGAGDRYRALGKQYLDDTYYNPLAEGRSDL 263
Query: 329 SHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK- 387
HA +H+ + + Y GD Y D V A SYATGG A E P
Sbjct: 264 EGRHAYSHVNSLCSAMQAYLTLGDEKYFRAAKNGFDFVLA-QSYATGGWGADETLRAPNS 322
Query: 388 -RLADTLGSEN---EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
+A +L + E C +Y K++R+L R T++ Y D ER + N +L G P
Sbjct: 323 PEVAKSLTGTHHSFETPCGSYAHFKLTRYLLRVTRDSRYGDSMERVMYNTIL----GALP 378
Query: 444 GVMIYMLPLGRGVSKARSTHGWGTKF--NSFW-CCYGTGIESFSKLGDSIYFEEEGNVPG 500
++P GR + G+KF ++ W CC GT + + G S Y + G
Sbjct: 379 -----LMPDGRTFYYSDYNFK-GSKFYHDARWPCCSGTMPQIATDYGISTYLRDPQ---G 429
Query: 501 LYIIQYISSSFDWKS--GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+Y+ YI S+ W+ V L QK +DP + + L+ + ++E ++LR+P
Sbjct: 430 IYVNLYIPSTVRWQQDGAQVSLTQKT--AYPFDPVVEIELSTTKQREF----EVHLRIPA 483
Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
W + A +NG+ +P F + W D++ ++LPL R E + +R A +
Sbjct: 484 W--AEQASIEVNGKREGVPVAERFATIRRTWKNGDRIQLELPLKNRLEPLNRER---AKL 538
Query: 619 QAILFGPYLL 628
A+L GP +L
Sbjct: 539 VALLNGPLVL 548
>gi|237718517|ref|ZP_04548998.1| LOW QUALITY PROTEIN: acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
gi|229452224|gb|EEO58015.1| LOW QUALITY PROTEIN: acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
Length = 502
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/274 (32%), Positives = 132/274 (48%), Gaps = 25/274 (9%)
Query: 366 VNASHSYATGGTSARE-FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYAD 424
V A+ S A GG S RE F D L+ E E+C TYNML+++ LFR YAD
Sbjct: 2 VTANRSLAFGGNSRREHFPDDTDYLSYVDDREGPESCNTYNMLRLTEGLFRMNPTADYAD 61
Query: 425 YYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH--GWGTKFNSFWCCYGTGIES 482
+YERAL N +LS Q E G +Y P AR H + + WCC GTG+E+
Sbjct: 62 FYERALFNHILSTQH-PEHGGYVYFTP-------ARPAHYRVYSAPNEAMWCCVGTGMEN 113
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
K G+ IY + LY+ +ISS +WK + L Q S+ + LT ++
Sbjct: 114 HGKYGEFIYAHTGDS---LYVNLFISSRLEWKKRRISLTQ----TTSFPNEGKTCLTITA 166
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN-FLSATERWSYNDKLTIQLPL 601
K+ L +R P W ++NG+++ N + + +W D + +Q+P+
Sbjct: 167 KKSTK--FPLFVRKPGWVGDGKVIITVNGKSIETTTAANSYYTINRKWKNGDVVEVQMPM 224
Query: 602 SLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
++R E ++ PEY AI+ GP LL + E
Sbjct: 225 NIRIEELK-HHPEYI---AIMRGPILLGANVGKE 254
>gi|256375993|ref|YP_003099653.1| hypothetical protein Amir_1859 [Actinosynnema mirum DSM 43827]
gi|255920296|gb|ACU35807.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 736
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 127/292 (43%), Gaps = 40/292 (13%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVTG 351
L L + T P+HL A +FD + A D L+ HAN HIPI G E TG
Sbjct: 278 ALRDLRARTGKPEHLAPARMFDLDALIDACAENRDVLAGLHANQHIPIFTGLVRLREATG 337
Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSR 411
+ Y F D+V Y GGTS EFW P +A+TL +N ETC +NMLK+ R
Sbjct: 338 EQRYLDAARNFWDMVVPRRLYRIGGTSTGEFWRAPGVIAETLADDNAETCCAHNMLKLGR 397
Query: 412 HLFRWTKEIAYADYYERALTNGVLSIQRGTEPG---VMIYMLPLGRGVSKARSTHGWGTK 468
LF N +L ++ +M Y + L G + + T
Sbjct: 398 ALF-----------------NQILGSKQDAPSADVPLMTYFIGLAPGSVRDFTPEQGAT- 439
Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
CC GTG+ES +K DS+YF +E LY+ + ++ W +
Sbjct: 440 -----CCEGTGLESAAKYQDSVYFHDEKT---LYVNLFAPTTAHWNETTITRGAHF---- 487
Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPG 580
P+ R T + G ++ +R+P W + GA ASLNG+ L +P G
Sbjct: 488 ---PHERGTSPGIGGK--GGRVTIKVRVPSW--ARGASASLNGRPLAVPAAG 532
>gi|225874351|ref|YP_002755810.1| hypothetical protein ACP_2792 [Acidobacterium capsulatum ATCC
51196]
gi|225791337|gb|ACO31427.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
51196]
Length = 611
Score = 119 bits (297), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 130/516 (25%), Positives = 213/516 (41%), Gaps = 71/516 (13%)
Query: 120 QTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISE----------LRGHFVGHY 169
Q N + L LD D+L+ FR+ A LP PG GGW N E + GH G Y
Sbjct: 62 QANHAFFLALDEDALLKPFRERAGLPAPGPQMGGWYNFSKEFDPPNNMTGYIPGHSFGQY 121
Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPY 229
LS A+ +A+T + K K+ +V G+ A + +D + P+ P
Sbjct: 122 LSGLARAYAATGDQPTKAKVHRLV-----------RGFAEAVSPKFYDDY----PL--PC 164
Query: 230 YTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN------ 283
YT K GL+D + A + AL + ++ V + +++ R +
Sbjct: 165 YTFDKSNCGLIDAHQFAGDPNALHALSRALD----AVMPYLPSHALTRPEMAARPHPNIA 220
Query: 284 ---EETGGMNDVLYRLYSITHDPKHLLLAHLF--DKPCFLGFLALQADYLSHFHANTHIP 338
+E+ + + + Y + D K+L++A F DK + LA + L H HA +H+
Sbjct: 221 FTWDESYTLPENFFLAYKRSGDEKYLVMAQRFLQDK-SYFDPLAEGDNVLPHQHAYSHVN 279
Query: 339 IVIGSQMRYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDP------KRLAD 391
+ + Y V G + + F +++ S+ATGG E + +P K L +
Sbjct: 280 ALNSASQAYLVLGSEKHLRAARNGFQFVLD--QSFATGGWGPNETFVEPGSGGLYKSLTE 337
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
T S E C Y KV+R+L R T + Y D E+ L N +L + G Y
Sbjct: 338 THAS-FETPCGAYGHFKVTRYLMRITGDSRYGDSMEQVLYNTILGAMPLEQGGFSFYYSD 396
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
+K W CC GT + + G S YF + GLY+ ++ S
Sbjct: 397 YNNYAAKNYYPEQWP-------CCSGTFPQVTADYGISSYFH---SPEGLYVNLFVPSRA 446
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
++ G + + ++ + M + + Q S+ LR+P W G ++NG
Sbjct: 447 KFQIGGARFSLEQRTHYPYENDIAMQVRGDNPQTF----SIALRVPAWA-GKGTSITVNG 501
Query: 572 QNLPLP-PPGNFLSATERWSYNDKL--TIQLPLSLR 604
+ PG F+ W D++ +I PLSL+
Sbjct: 502 RKAEAEVKPGTFVRLHREWKDGDRIEYSIDRPLSLQ 537
>gi|336425065|ref|ZP_08605095.1| hypothetical protein HMPREF0994_01101 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336012974|gb|EGN42863.1| hypothetical protein HMPREF0994_01101 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 575
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 141/582 (24%), Positives = 240/582 (41%), Gaps = 82/582 (14%)
Query: 99 LKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI 158
KEV+L++ ++ + L + L + D+++ R++A P PG Y GW
Sbjct: 6 FKEVTLNE------GMMKKVLDETLAFYLKIPNDNILKYMRESAGKPAPGIFYTGW---Y 56
Query: 159 SELRG-HFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
RG +G +LSA ++M+A + + ++K + +C Y SA T F
Sbjct: 57 PNSRGIALIGQWLSAYSRMYAISGDEAFRQKAVYLADEFWDC-------YESAQHTAPFL 109
Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
+ + +Y + K+L D ++ A + A +++++ + +
Sbjct: 110 TSRS-------HYDVEKLLRAHCDLFLYCKYPCAKERAGYLIDFAADNLTAENIFGDNST 162
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD----------Y 327
WY+L E + + I P+ +A F+ F AD Y
Sbjct: 163 EWYTLAES-------FWDAFEILEIPRAQQMAERFEYREFWDLFYKDADPFSKRPQAGLY 215
Query: 328 LSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK 387
HA +H+ YE+T P + F + ATGG PK
Sbjct: 216 SEFCHAYSHVNSFNSCAKAYEMTKSPYFLKSLRSFYRFMQTEEVMATGGYGPNYEHLMPK 275
Query: 388 -RLADTL--GSENEET-CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP 443
R+ D L G ++ ET C TY ++ ++L R+T E Y ++ E L N + TE
Sbjct: 276 NRIIDALRTGHDSFETQCDTYAAFRLCKYLTRFTDEPEYGNWVESLLYNAAAATIPMTEE 335
Query: 444 GVMIYM--LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
G +IY + G K R GW CC GT +++ IYFE +G L
Sbjct: 336 GNIIYYSDYNMYAGYKKNRQD-GWT-------CCTGTRPLLVAEIQRLIYFEGDGE---L 384
Query: 502 YIIQYISSSFDW-KSGH-VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
YI QYI S+ W ++G+ + + Q+ + L ++L+ S+ ++ R+P W
Sbjct: 385 YISQYIPSTLHWNRNGNDISIRQETGFPEGKETTLILSLSCSAA------FPIHFRLPGW 438
Query: 560 TYSNGAQASLNGQNLPLPP---PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
+ ++ N+PLP +L+ W D+LTI LP + ++ P
Sbjct: 439 L---SGEMKVSCNNVPLPATVDKNGWLTIHSEWKEGDRLTISLPAEVWMHSLD---PVKN 492
Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGT----ARSLSALISPIP 654
A L+GP +LA SG I+T +SL+ + P+P
Sbjct: 493 GPNAFLYGPVVLAADYSG---IQTPNDWMDVQSLTEKMKPVP 531
>gi|413954826|gb|AFW87475.1| hypothetical protein ZEAMMB73_309562 [Zea mays]
Length = 161
Score = 108 bits (269), Expect = 1e-20, Method: Composition-based stats.
Identities = 69/170 (40%), Positives = 97/170 (57%), Gaps = 25/170 (14%)
Query: 691 GTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDELVVSESPK 750
GT+AA+HATFRL+ + + + G + MLEP D PGM+V D L V+ + K
Sbjct: 10 GTEAAVHATFRLVPQGGAGA---------GAAAMLEPLDMPGMVVT----DRLTVA-AEK 55
Query: 751 EMGSSGFRLVAGLDKRNETVSLEAENRKGCFVSSGVNFEPGASLKLLCSTESLD-----A 805
G++ F +V GL +VSLE +R GCF+ G G +++ C+ + A
Sbjct: 56 SSGAA-FNVVPGLAGAPGSVSLELASRPGCFLVGG-----GEKVQVGCAGGAQQKRGDGA 109
Query: 806 GFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRDEAYTVYFNI 855
F R+ASF + YHP+SF A+G RR+FLL PL + RDE YTVYFN+
Sbjct: 110 WFRRSASFARGEPLRRYHPMSFAARGVRRSFLLEPLFTLRDEFYTVYFNL 159
>gi|94967351|ref|YP_589399.1| hypothetical protein Acid345_0320 [Candidatus Koribacter versatilis
Ellin345]
gi|94549401|gb|ABF39325.1| Protein of unknown function DUF1680 [Candidatus Koribacter
versatilis Ellin345]
Length = 607
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 124/533 (23%), Positives = 215/533 (40%), Gaps = 64/533 (12%)
Query: 122 NLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPI----------SELRGHFVGHYLS 171
N + L LD D L+ FR+ A LP PG+ GGW + + GH +G Y+S
Sbjct: 58 NHAFFLKLDEDRLLKVFRQKAGLPAPGEDMGGWYDLTGFDLAKGDFHGFVPGHTLGQYVS 117
Query: 172 ASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYT 231
A A+ +A+T + K K+ +V + L D P YT
Sbjct: 118 ALARCYAATGSEETKAKVHRLV---------------KGYGATLDDKASFFAGYRLPAYT 162
Query: 232 IHKILAGLLDQYVLADNAQAL----KMATWMVEYFYNRVQKVITMYSVERHWYSLN-EET 286
K+ GL+D + A + A+ K+ M++Y + + S +E+
Sbjct: 163 YDKLSCGLIDAHEFAHDPDAMAIHEKLTRGMLQYLPEKALSRAEQRARPHKDESFTWDES 222
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIVIGSQM 345
+ + L+ Y T + + L F + + L+ + L+ HA +H+ +
Sbjct: 223 YTLPENLFLAYRRTGNKFYRELGTRFLEDDTYFNPLSEGINVLAGEHAYSHMNAFCSAMQ 282
Query: 346 RYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW--WDPKRLADTLGSEN---EET 400
Y ++ +V A S+ATGG E + ++ +L D+L + E
Sbjct: 283 AYLTLDSERHRKAARNGFRMV-AEQSFATGGWGPSEAFVEFNKGQLGDSLEKSHSSFETP 341
Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
C Y K++R+L + + Y D ER + N VL + G Y
Sbjct: 342 CGAYAHFKLTRYLLQTDGDSTYGDSMERVMYNTVLGAKPIQPDGTSFYY--------SDY 393
Query: 461 STHGWGTKFNSFW-CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS--GH 517
+T G N W CC GT + + SIY + G+ + ++ S+ WK+ G
Sbjct: 394 ATVGKKVYHNDKWPCCSGTLPQVAADYHISIYLKA---TDGVCVNLFVPSTLIWKASDGS 450
Query: 518 VVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
L Q+ P + + + F++ Q V Q +L +R+P W S A +NGQ +
Sbjct: 451 CKLTQETKYPFET-----SVAMRFATTQPVEQ--TLYIRIPAWVTSEPA-LRVNGQRTDV 502
Query: 577 PP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
PG F + W D++ + LP+ + + ++ + A++ GP +L
Sbjct: 503 AAKPGAFAAIRRTWKDGDRIDLDLPMGFELQPVDG---QHEKLVALVHGPLVL 552
>gi|224072775|ref|XP_002303875.1| predicted protein [Populus trichocarpa]
gi|222841307|gb|EEE78854.1| predicted protein [Populus trichocarpa]
Length = 103
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/131 (44%), Positives = 75/131 (57%), Gaps = 30/131 (22%)
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
+R+P WT+ GA+ +N T Q+P S DDRP
Sbjct: 1 MRIPTWTHLEGAETVINDS-----------------------TWQIPAS-------DDRP 30
Query: 614 EYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTF 673
EYASIQAIL+GPYL AGHT+ +WDIK +A SLS +PIP ++N LVTF+Q+S N TF
Sbjct: 31 EYASIQAILYGPYLFAGHTTADWDIKNVSADSLSEWSTPIPAAYNDHLVTFSQKSRNPTF 90
Query: 674 VMSNSNQSITM 684
+ NSN IT+
Sbjct: 91 FLINSNHIITV 101
>gi|224072771|ref|XP_002303873.1| predicted protein [Populus trichocarpa]
gi|222841305|gb|EEE78852.1| predicted protein [Populus trichocarpa]
Length = 103
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/131 (41%), Positives = 72/131 (54%), Gaps = 30/131 (22%)
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRP 613
+R+P WT+ GA+ +N T Q+P S DDRP
Sbjct: 1 MRIPTWTHLEGAETVINDS-----------------------TWQIPAS-------DDRP 30
Query: 614 EYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTF 673
EYASIQAIL+GP L AGHT+ +WDIK +A SL +PIP ++N LVTF+Q+S N F
Sbjct: 31 EYASIQAILYGPSLFAGHTTADWDIKNVSADSLPEWSTPIPAAYNDHLVTFSQKSRNPNF 90
Query: 674 VMSNSNQSITM 684
+ NSN IT+
Sbjct: 91 FLINSNHIITV 101
>gi|255624614|ref|XP_002540501.1| hypothetical protein RCOM_2107350 [Ricinus communis]
gi|223495313|gb|EEF21882.1| hypothetical protein RCOM_2107350 [Ricinus communis]
Length = 208
Score = 95.9 bits (237), Expect = 8e-17, Method: Composition-based stats.
Identities = 65/207 (31%), Positives = 96/207 (46%), Gaps = 15/207 (7%)
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT-----------EL 215
GHYLSA A M A+T + ++E++ VV L CQ G GY+ P +L
Sbjct: 3 GHYLSALAMMVAATGDEQVRERLDYVVAELKRCQAANGNGYIGGVPGGAAAWRDIAQGKL 62
Query: 216 FDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSV 275
++ W P+Y +HK AGL D Y A N A M + ++ ++ + S
Sbjct: 63 HADNFSVNGKWVPWYNLHKTFAGLRDAYTYAGNQDAHAMLIALCDW----TLELTSHLSD 118
Query: 276 ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT 335
E+ + E GGMN+VL + +T K++ LA F L L D L+ HANT
Sbjct: 119 EQMQSMMRAEHGGMNEVLADVAQMTGQQKYMDLAIRFSHQALLRPLEEGKDQLTGLHANT 178
Query: 336 HIPIVIGSQMRYEVTGDPLYKLIGTFF 362
IP VIG + ++T ++ FF
Sbjct: 179 QIPKVIGFKRIGDITSRDDWQRAAAFF 205
>gi|284043399|ref|YP_003393739.1| hypothetical protein Cwoe_1938 [Conexibacter woesei DSM 14684]
gi|283947620|gb|ADB50364.1| protein of unknown function DUF1680 [Conexibacter woesei DSM 14684]
Length = 711
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 126/527 (23%), Positives = 222/527 (42%), Gaps = 84/527 (15%)
Query: 132 DSLVWSFRKTASLPTPGKAYGGWENPISELRGHF--VGHYLSASAQMWASTHNATIKEKM 189
D+L++ FR PG GW G F +G + + A+++A+T EK
Sbjct: 47 DALLYPFRIRKGSWAPGIPLRGWYG-----EGLFNNLGQFFTLYARLYAATGEHRFAEKA 101
Query: 190 STVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNA 249
++ E + G G+LS + + E Y+ K++ GLLD + +
Sbjct: 102 LALLDGWEETIEEDG-GFLS---SHFAGTVE---------YSYDKLVCGLLDLHEYVGSE 148
Query: 250 QAL----KMATWMVEYFYNRVQKVIT-MYSVERHWYSLNEETGGMNDVLYRLYSITHDPK 304
+AL +++ WM + + + M +E WY+L E L R Y++T DP
Sbjct: 149 RALPVLERVSRWMQRHGGSSKPYAWSGMGPLE--WYTLPE-------YLLRAYAVTSDPL 199
Query: 305 HLLLAHLFDKPCF--------LGFLALQAD-----YLSHFHANTHIPIVIGSQMRYEVTG 351
+ LA+ + F +G L +AD Y +H HANT + + YE TG
Sbjct: 200 YRELANAYRYDEFYDALLERDVGALMRRADEARNFYQAHSHANT----LNSAAAVYETTG 255
Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSEN---EETCTTYNMLK 408
DP Y + T +++ S ++ATG E + P++ + L SE E C ++ M++
Sbjct: 256 DPRYLDVLTAGYELLRESQTFATGMFGPLEAFMKPRQRVEVLHSEEGHAEVACPSWAMMR 315
Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG-VMIYMLPLGRGVSKARSTHGWGT 467
+ RHL T E + D+ E + NG+ S G Y G R+T WG
Sbjct: 316 LVRHLIELTGEAQFGDWMELNVYNGIGSAPPTRADGRATQYFADYG----LDRATKTWGV 371
Query: 468 KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF--DWKSGHVVLNQK-- 523
+++ CC T + ++ + IY+ L++ Y+ SS + + L Q+
Sbjct: 372 EWS---CCSTTSGINMAEYVNQIYY---AGPDALHVCLYLPSSVTCEIDGATLWLTQRTA 425
Query: 524 --VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN 581
VD V++D + L ++ R+P WT + + +L+G+ +
Sbjct: 426 YPVDERVAFDVRVERPLR----------GTIAFRVPAWT-AGEPRLTLDGEPVEHVVRDG 474
Query: 582 FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
+ + W D + + LP+ L ++ A A+ +GP +L
Sbjct: 475 WATVERTWEDGDAIELTLPMELAVLPVEPATD--AGPVALRYGPVVL 519
>gi|557474|gb|AAA50392.1| ORF1, partial [Bacteroides ovatus]
Length = 436
Score = 93.2 bits (230), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 100/217 (46%), Gaps = 22/217 (10%)
Query: 422 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
Y +YYERAL N +L+ Q + G +Y P+ G + + S WCC G+G+E
Sbjct: 4 YVNYYERALYNHILASQE-PDKGGFVYFTPMRPGHYRV-----YSQPETSMWCCVGSGLE 57
Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFS 541
+ +K G+ IY + LY+ +I S WK ++L Q+ LR+
Sbjct: 58 NHTKYGEFIYAYRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRINEAPK 114
Query: 542 SKQEVGQLSSLNLRMPVW-TYSNGAQASLNGQN--LPLPPPGNFLSATERWSYNDKLTIQ 598
K+ +L +R+P W S G S+NG+ +P +L + +W D +T
Sbjct: 115 KKR------TLMIRIPEWANQSKGYSVSINGKRKMFVMPKGNQYLPLSRKWEKGDVITFH 168
Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
LP+ + E I D + Y A L+GP +LA T E
Sbjct: 169 LPMKVSVEQIPDKKDYY----AFLYGPIVLAASTGTE 201
>gi|229818564|ref|YP_002880090.1| hypothetical protein Bcav_0062 [Beutenbergia cavernae DSM 12333]
gi|229564477|gb|ACQ78328.1| protein of unknown function DUF1680 [Beutenbergia cavernae DSM
12333]
Length = 596
Score = 89.7 bits (221), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 122/542 (22%), Positives = 205/542 (37%), Gaps = 94/542 (17%)
Query: 124 EYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNA 183
E L + D +V FR A LP PG GW + S+ G ++S A++ + A
Sbjct: 42 ETYLGMSPDDVVHGFRLQAGLPAPGNPMTGWSSRTSQ---PTFGQWVSGLARLGVTAGVA 98
Query: 184 TIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQY 243
++ +V AF + D +A + Y K++ GL D
Sbjct: 99 EASQRAVDLV---------------DAFAATVGDDGDARMGL----YGYEKLVCGLADTA 139
Query: 244 VLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDP 303
+ A + AL + E+ ++ R S N+ GG R+ +H
Sbjct: 140 LYAGHEDALALLGRTAEWASRTFERA-------RPAASPNDFAGG------RIGPASH-- 184
Query: 304 KHLLLAHLFDKPCFLGFLALQADYLSHF-----------------------------HAN 334
+ + F + + G+LA D + F HA
Sbjct: 185 ARTMEWYTFAENLYRGWLAGADDAVREFASEWHYDAYWDRFLTPPPPGQPWDVPTWLHAY 244
Query: 335 THIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK----RLA 390
+H+ + YEVTG+ Y I + + +YATGG E R
Sbjct: 245 SHVNTFASAAAAYEVTGEVRYLDILRNAHTYLTTTQTYATGGYGPSELTLPEDGSLGRSI 304
Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
+ E C ++ K+S L + T E YAD+ E+ + +G+ ++ G Y
Sbjct: 305 EWRTDTAEIVCGSWAAFKLSSALLKHTGEARYADWVEQLVYSGIGAVTPVRPGGRTPYYQ 364
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
L G++ + H ++ + CC GT +++ S L D +YF ++ GL + Y+ S+
Sbjct: 365 DLRLGIAT-KLPH-----WDDWPCCSGTYLQAVSHLPDLVYFGDDDG--GLAVALYVPST 416
Query: 511 FDWKSGH--VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
W+S V L Q+ + +S VG LR+ V +S G + S
Sbjct: 417 VSWESAGSTVTLTQRT----------AFPVEDTSTITVGGSGRFRLRLRVPPWSEGFRVS 466
Query: 569 LNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
+NG + + PG++ W+ D +T+ L LR + P A GP +
Sbjct: 467 VNGVAVDGVATPGDWFVLERDWADGDVVTVTLGAGLRVLPVDRWHPNRV---AFAHGPVV 523
Query: 628 LA 629
LA
Sbjct: 524 LA 525
>gi|361069271|gb|AEW08947.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
Length = 75
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/69 (55%), Positives = 53/69 (76%)
Query: 787 NFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRD 846
+++ G +++L C D FNRA+SF G ++YHPISF+A+GARR +LLAPLL++RD
Sbjct: 5 SYQVGQAVELRCKPLVTDLAFNRASSFTWNTGFAKYHPISFIARGARRAYLLAPLLTYRD 64
Query: 847 EAYTVYFNI 855
E+YTVYFNI
Sbjct: 65 ESYTVYFNI 73
>gi|383146477|gb|AFG54937.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146481|gb|AFG54941.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
Length = 75
Score = 87.4 bits (215), Expect = 3e-14, Method: Composition-based stats.
Identities = 38/69 (55%), Positives = 53/69 (76%)
Query: 787 NFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRD 846
+++ G +++L C D FNRA+SF G ++YHPISF+A+GARR +LLAPLL++RD
Sbjct: 5 SYQVGQAVELRCKPLVTDLAFNRASSFTWNTGFAKYHPISFIARGARRAYLLAPLLAYRD 64
Query: 847 EAYTVYFNI 855
E+YTVYFNI
Sbjct: 65 ESYTVYFNI 73
>gi|383146472|gb|AFG54932.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146473|gb|AFG54933.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146474|gb|AFG54934.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146475|gb|AFG54935.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146476|gb|AFG54936.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146478|gb|AFG54938.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146479|gb|AFG54939.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146480|gb|AFG54940.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146482|gb|AFG54942.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146483|gb|AFG54943.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146484|gb|AFG54944.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146485|gb|AFG54945.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146486|gb|AFG54946.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146487|gb|AFG54947.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146488|gb|AFG54948.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146489|gb|AFG54949.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
Length = 75
Score = 86.7 bits (213), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 37/69 (53%), Positives = 53/69 (76%)
Query: 787 NFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLLAPLLSFRD 846
+++ G +++L C D FNRA+SF G ++YHPISF+A+GARR +LLAPLL+++D
Sbjct: 5 SYQVGQAVELRCKPLVTDLAFNRASSFTWNTGFAKYHPISFIARGARRAYLLAPLLAYKD 64
Query: 847 EAYTVYFNI 855
E+YTVYFNI
Sbjct: 65 ESYTVYFNI 73
>gi|380482670|emb|CCF41095.1| secreted protein [Colletotrichum higginsianum]
Length = 246
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/289 (28%), Positives = 122/289 (42%), Gaps = 70/289 (24%)
Query: 406 MLKVSRHLFRWT--KEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGVSK 458
MLK++R L+ + AY D+YERAL N +L Q ++ G + Y PL RGV
Sbjct: 1 MLKLTRELWLTSPGTTTAYFDFYERALLNHLLGQQDPSDDHGHVTYFTPLNPGGRRGVGP 60
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
A W T ++SFWCC GTG+E+ +KL DSIYF + LY+ +I S +W V
Sbjct: 61 AWGGGTWSTDYDSFWCCQGTGLETNTKLTDSIYFYD---ASALYVNLFIPSVLEWTQRGV 117
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
+ Q + + R T G S+ +R+P W S GA
Sbjct: 118 TVTQTTE-------FPRGDTTTLKVAGAGTW-SMRVRIPSWA-SGGA------------- 155
Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDI 638
QLP+ L DD ++ A+ FGP +L+G+ E
Sbjct: 156 -------------------QLPMKLHVIPANDD----PNVAALAFGPVILSGNYGSE--- 189
Query: 639 KTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSITMEEF 687
+LS P+ N V T +SG + F + +++ + F
Sbjct: 190 ------TLSTT-----PALNLTTVRRTGDSGLA-FTATAGGKTVNLGPF 226
>gi|357472929|ref|XP_003606749.1| hypothetical protein MTR_4g065190 [Medicago truncatula]
gi|355507804|gb|AES88946.1| hypothetical protein MTR_4g065190 [Medicago truncatula]
Length = 111
Score = 82.8 bits (203), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 56/139 (40%), Positives = 67/139 (48%), Gaps = 33/139 (23%)
Query: 724 MLEPFDFPGMLV-QQGKEDELVVSES----PKEMGSSGFRLVAGLDKRNETVSLEAENRK 778
MLEPFD PGM V QG E L++ +S P + S G R+ G K N + K
Sbjct: 1 MLEPFDLPGMTVSHQGPEKPLIIVDSSHGGPSSVFSCGTRI--GWTKSNNIFRITKLLLK 58
Query: 779 GCFVSSGVNFEPGASLKLLCSTESLDAGFNRAASFMMEIGISEYHPISFVAKGARRNFLL 838
V F+ G+ +YHPISFVAKGA +NFLL
Sbjct: 59 LVLTKQLV--------------------------FVSGKGLRQYHPISFVAKGANQNFLL 92
Query: 839 APLLSFRDEAYTVYFNIQD 857
PL +FRDE YTVYFNIQD
Sbjct: 93 DPLFNFRDEHYTVYFNIQD 111
>gi|423223914|ref|ZP_17210383.1| hypothetical protein HMPREF1062_02569 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392637516|gb|EIY31383.1| hypothetical protein HMPREF1062_02569 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 664
Score = 82.4 bits (202), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 122/291 (41%), Gaps = 42/291 (14%)
Query: 318 LGFLALQADYLSH-FHANTHIPIVIGSQMRYEVTGDP--LYKLIGTFFMDIVNASHSYAT 374
LG LQ SH FH N +G Y +TGD L K+ G + D ++ Y T
Sbjct: 270 LGVDKLQPYVHSHTFHMN-----FMGFLRLYRITGDKTLLRKVSGAW--DDIHERQMYIT 322
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG S E + L ETC T + +++++ L T E YAD ER + N V
Sbjct: 323 GGVSVAEHY--EHDYVKPLSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHV 380
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSI 490
+ Q E GV Y T G+K + ++ CC +G S L I
Sbjct: 381 FAAQ-DCESGVCRY------------HTAPNGSKPDGYFHGPDCCTASGHRIISMLPTFI 427
Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
Y E E YI QY+ S + K + ++ M LT S E +
Sbjct: 428 YAEREKE---FYINQYMPSQYTGKDFAFEITG------NYPESENMQLTIVS--EKARNK 476
Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
+LNLR+P W + +NG+N+ PG +L +W+ DK++I P+
Sbjct: 477 TLNLRIPSW--CEHPEIKVNGENIADVKPGTYLKLPRKWTKGDKVSITFPM 525
>gi|224537087|ref|ZP_03677626.1| hypothetical protein BACCELL_01964 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521314|gb|EEF90419.1| hypothetical protein BACCELL_01964 [Bacteroides cellulosilyticus
DSM 14838]
Length = 664
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 123/291 (42%), Gaps = 42/291 (14%)
Query: 318 LGFLALQADYLSH-FHANTHIPIVIGSQMRYEVTGDP--LYKLIGTFFMDIVNASHSYAT 374
LG LQ SH FH N +G Y +TGD L K+ G + D ++ Y T
Sbjct: 270 LGVDKLQPYVHSHTFHMN-----FMGFLRLYRITGDKTLLRKVSGAW--DDIHERQMYIT 322
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG S E + L ETC T + +++++ L T E YAD ER + N V
Sbjct: 323 GGVSVAEHY--EHDYVKPLSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHV 380
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSI 490
+ Q E GV Y T G+K + ++ CC +G S L I
Sbjct: 381 FAAQ-DCESGVCRY------------HTAPNGSKPDGYFHGPDCCTASGHRIISMLPTFI 427
Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
Y E+ YI QYI S + K + ++ M LT S E +
Sbjct: 428 YAEKGKE---FYINQYIPSQYTGKDFAFEITG------NYPESENMQLTIVS--EKAKNK 476
Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
+LNLR+P W + +NG+N+ PG +L + +W+ DK++I P+
Sbjct: 477 TLNLRIPSWC--EHPEIKVNGENIADVKPGAYLKLSRKWTKGDKVSITFPM 525
>gi|237719720|ref|ZP_04550201.1| predicted protein [Bacteroides sp. 2_2_4]
gi|229450989|gb|EEO56780.1| predicted protein [Bacteroides sp. 2_2_4]
Length = 663
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 74/285 (25%), Positives = 122/285 (42%), Gaps = 30/285 (10%)
Query: 318 LGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDP-LYKLIGTFFMDIVNASHSYATGG 376
LG LQ + H++T +G Y +TGD L++ + + DI + Y TGG
Sbjct: 272 LGVDKLQP----YVHSHTFQMNFMGFLRLYRITGDKSLFRKVAGAWDDI-HKRQMYITGG 326
Query: 377 TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
S E + + ETC T + +++++ L T E YAD ER + N V +
Sbjct: 327 VSVAEHY--EHDYVKPISGHVVETCATMSWMQLTQMLLELTGESKYADAMERLMINHVFA 384
Query: 437 IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEG 496
Q + P G HG+ F+ CC +G S L +Y E+
Sbjct: 385 AQDCETGSCRYHTAPNG------SKPHGY---FHGPDCCTASGHRIISMLPTFMYAEKGK 435
Query: 497 NVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
Y+ QY+ S + K+ ++ + + M LT +S++ ++ LNLR+
Sbjct: 436 E---FYVNQYVPSQYAGKAFSFEISGNYPEVEN------MELTVTSERVADRV--LNLRI 484
Query: 557 PVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
P W Q S+NG+ + PG +L + +W DK+ I P+
Sbjct: 485 PSW--CEKPQVSVNGEKMAGVQPGTYLKISRKWVKGDKVCIVFPM 527
>gi|332881627|ref|ZP_08449275.1| hypothetical protein HMPREF9074_05065 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357045708|ref|ZP_09107342.1| hypothetical protein HMPREF9441_01351 [Paraprevotella clara YIT
11840]
gi|332680266|gb|EGJ53215.1| hypothetical protein HMPREF9074_05065 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355531373|gb|EHH00772.1| hypothetical protein HMPREF9441_01351 [Paraprevotella clara YIT
11840]
Length = 586
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 116/493 (23%), Positives = 195/493 (39%), Gaps = 87/493 (17%)
Query: 153 GWENPISELRGHFV-GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF 211
W+ +E G ++ YLSA + ++ + +K T++ + + Q + +GY+ A
Sbjct: 2 AWDWTKAEQHGKWIESAYLSAIQR-----NDKALLDKARTMLKRIVDSQEE--SGYVGAT 54
Query: 212 ---------PTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYF 262
P D++E Y+ H + + A A A K+A + ++YF
Sbjct: 55 SKNYRSDERPVRGMDAYEL-------YFVFHAFITVYEETGDKASLAAAEKLADYYLKYF 107
Query: 263 ---------------YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHL- 306
NR + + + H + E + D + RLY +T K+L
Sbjct: 108 GPGKLEFWPSDLRDPENRHKSIDALSQFAGHGVHYSWEGTLLCDPIARLYEVTGKKKYLD 167
Query: 307 ----LLAHL--------FDKPCFLGFLALQADYLS-HFHANTHIPIVIGSQMRYEVTGDP 353
++ ++ F + + L D L + H++T +G Y +TGD
Sbjct: 168 WSLWVVGNIDKWSGWDAFSRLDSVADGTLGVDKLQPYVHSHTFQMNFMGFLRLYRITGDK 227
Query: 354 -LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
L++ + + DI N Y TGG S E + + ETC T + +++++
Sbjct: 228 SLFRKVAGAWDDICN-RQMYITGGVSVAEHY--EHGYVKPVSGNVVETCATMSWMQLTQM 284
Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF 472
L T E YAD ER + N V + Q E G Y T GTK + +
Sbjct: 285 LLELTGESKYADAMERLMMNHVFAAQ-DCESGTCRY------------HTAPNGTKPHDY 331
Query: 473 W----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
+ CC +G S L Y E N YI QY+ S +D K ++
Sbjct: 332 FHGPDCCTASGHRIISLLPTFFYAE---NGKDFYINQYLPSRYDGKDFAFEISGNYPESE 388
Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATER 588
S M LT S + ++ LNLR+P W + S+NG+ + G +L+ T +
Sbjct: 389 S------MVLTVLSSKNKNKI--LNLRIPSWC--KAPEVSVNGERVSGIEAGKYLAITRK 438
Query: 589 WSYNDKLTIQLPL 601
W DK+ I P+
Sbjct: 439 WEKGDKIGITFPM 451
>gi|427384256|ref|ZP_18880761.1| hypothetical protein HMPREF9447_01794 [Bacteroides oleiciplenus YIT
12058]
gi|425727517|gb|EKU90376.1| hypothetical protein HMPREF9447_01794 [Bacteroides oleiciplenus YIT
12058]
Length = 662
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 121/291 (41%), Gaps = 42/291 (14%)
Query: 318 LGFLALQADYLSH-FHANTHIPIVIGSQMRYEVTGDP--LYKLIGTFFMDIVNASHSYAT 374
LG LQ SH FH N +G Y +TGD L K+ G + D ++ Y T
Sbjct: 270 LGVDKLQPYVHSHTFHMN-----FMGFLRLYRITGDKSLLRKVAGAW--DDIHERQMYIT 322
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
GG S E + L ETC T + +++++ L T E YAD ER + N V
Sbjct: 323 GGVSVAEHY--EHDYVKPLSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHV 380
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSI 490
+ Q E GV Y T G+K + ++ CC +G S L I
Sbjct: 381 FAAQ-DCENGVCRY------------HTAPNGSKPDGYFHGPDCCTASGHRIISMLPTFI 427
Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
Y E+ Y+ QY+ S ++ K + ++ M L S E +
Sbjct: 428 YAEKGKE---FYVNQYMPSQYNGKDFAFSITG------NYPESENMELVIES--EKAKNK 476
Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
++NLR+P W + S+NG+ + PG +L + +W DK+ I P+
Sbjct: 477 TINLRIPSWC--ENPKVSVNGEAVADIKPGTYLKLSRKWGKGDKINIIFPM 525
>gi|330998039|ref|ZP_08321870.1| hypothetical protein HMPREF9442_02974 [Paraprevotella xylaniphila
YIT 11841]
gi|329569340|gb|EGG51120.1| hypothetical protein HMPREF9442_02974 [Paraprevotella xylaniphila
YIT 11841]
Length = 661
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 72/276 (26%), Positives = 118/276 (42%), Gaps = 26/276 (9%)
Query: 330 HFHANTHIPIVIGSQMRYEVTGDP-LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
+ H++T +G Y +TGD L++ + + DI + Y TGG S E +
Sbjct: 278 YVHSHTFQMNFMGFLRLYRITGDKSLFRKVEGAWEDI-HKRQMYITGGVSVAEHY--EHG 334
Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
+ ETC T + +++++ L T E YAD ER + N V + Q +
Sbjct: 335 YVKPVSGNVVETCATMSWMQLTQMLLELTGESKYADAMERLMMNHVFAAQDCETGTCRYH 394
Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
P G A HG CC +G S L +Y E ++ QY+
Sbjct: 395 TAP--NGTKPASYFHGPD-------CCTASGHRIISMLPTFMYAERGKE---FFVNQYLP 442
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
S + K ++ ++ M LT S++ V ++ LNLR+P W + S
Sbjct: 443 SHYIGKDFAFQISG------NYPEAENMELTVLSEKAVDRV--LNLRIPSWC--KAPRVS 492
Query: 569 LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+NG+N+ PG +L + +WS DK++I P+ R
Sbjct: 493 VNGKNVIGVEPGTYLKISRKWSKGDKVSIVFPMEER 528
>gi|189467307|ref|ZP_03016092.1| hypothetical protein BACINT_03695 [Bacteroides intestinalis DSM
17393]
gi|189435571|gb|EDV04556.1| hypothetical protein BACINT_03695 [Bacteroides intestinalis DSM
17393]
Length = 611
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 119/525 (22%), Positives = 204/525 (38%), Gaps = 73/525 (13%)
Query: 128 MLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKE 187
+ DVD L FR +N + + F G ++ + + H+ +
Sbjct: 46 LQDVDHLTAPFRT--------------KNDTASWQTEFWGKWVQGAIASYRYNHSVALYA 91
Query: 188 KMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAGLLDQYVLAD 247
K+ V + Q GY+ + D+ +W YT GLL Y ++
Sbjct: 92 KIKKSVDDIISTQQP--DGYIGNYR---LDAQLKSWDIWGRKYTT----LGLLSWYEISG 142
Query: 248 NAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLL 307
QAL A ++++ +V + T ++Y + + + V+Y LY T D K+L
Sbjct: 143 EKQALNAACRVIDHLMTQVGEGGTNIVTTGNYYGM-ASSSILEPVMY-LYKYTGDYKYLQ 200
Query: 308 LAHLF-------DKPCFL----GFLALQADYLSHF---------HANTHIPIVIGSQMRY 347
A + P + + + A + F A + IG Y
Sbjct: 201 FAKYIVAQWETPEGPQLITKAINGVPVAARFPHPFDWFSPENGQKAYEMMSCYIGLLELY 260
Query: 348 EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
+VT + Y + DI N + A G SA E W+ ++ + ETC T+
Sbjct: 261 KVTHNAAYLDAVQKTVNDIANTEINVAGSG-SAFESWYSGRKYQTSPTYHTMETCVTFTW 319
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
+++ L T YAD E++L N +++ + + Y G + G
Sbjct: 320 IQLCDKLLALTGNPFYADQIEKSLYNALMAALKDDASQIAKYSPMEGH---RCEGEEQCG 376
Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--VVLNQKV 524
N CC G +F+ + D ++ GN +Y+ Y S ++GH V++ Q
Sbjct: 377 MHIN---CCNANGPRAFALIPD-FAVKKMGN--EVYVNYYGDMSASLENGHNKVLVKQHT 430
Query: 525 DPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLS 584
VS + +T+ + + G L+LR+PVW S +LNG+ L PG + +
Sbjct: 431 TYPVS--NVIDITIDVTKENVFG----LHLRVPVW--SAQTVITLNGEELKDICPGTYHA 482
Query: 585 ATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
T +W D + I L + R E +QAI+ GP +LA
Sbjct: 483 ITRKWKKGDHIQIILDMPARL-------LEQNQMQAIVRGPIVLA 520
>gi|374374779|ref|ZP_09632437.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
gi|373231619|gb|EHP51414.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
Length = 614
Score = 72.8 bits (177), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 115/522 (22%), Positives = 203/522 (38%), Gaps = 60/522 (11%)
Query: 163 GHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEAL 222
G VG YL A+A W T NA +K +M + L + Q + GYL + L DS+
Sbjct: 89 GEHVGKYLEAAANTWIITKNAALKTQMDRIFNELIKTQ--LPDGYLGTY---LPDSYWTS 143
Query: 223 KPVWAPYYTIHKI-LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYS 281
VW +HK L GLL Y + + +AL A + + + + + +
Sbjct: 144 WDVW-----VHKYDLVGLLAYYRVTGDRRALTAAVKVGDLLLKNIGDLPGQKDIIKTGSH 198
Query: 282 LNEETGGMNDVLYRLYSITHDPKHL----LLAHLFDKPCFLGFLAL-----QADYLSHFH 332
+ + D + LY T D ++L + +D P + Q D +++
Sbjct: 199 VGMAATSVIDPMTDLYQWTGDRRYLDFCKYIIKAYDHPAGPSIVTTLLKEKQVDKVANGK 258
Query: 333 ANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
A + ++G Y +TGD Y D + A + TG TS E + L
Sbjct: 259 AYEMLSNLVGIIKLYRLTGDEKYLQACRNAFDDIAAKRLFVTGTTSDHERFMPDNILQAD 318
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
+ E C T ++ + LF T ++ Y + E+++ N +L + E G + Y PL
Sbjct: 319 TAAHMGEGCVTTTWIQFNVQLFAITGDLKYYNEIEKSVYNHLLGAE-NPETGCVSYYTPL 377
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
G+ R + CC + + L + + + N P + + + + D
Sbjct: 378 -IGIKPYRC---------NITCCLSSVPRGIA-LIPYLNYGKLNNRPTVLLYE----AAD 422
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV--------GQLSSLNLRMPVWTYSNG 564
K V + P+ L++ TF + + +L LR+P W +NG
Sbjct: 423 IKDRVVTAGGRETPVA-----LQINTTFPKEGKATIKVALPSAARFALQLRVPAW--ANG 475
Query: 565 AQASLNGQNLPLPPPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
+A + G+ N L +R W+ + + I + + P Y +I+
Sbjct: 476 FKAVIAGKT--YTAQANELVVIDRNWARENIIAISFEIPVTVLQGGASYPNYIAIKR--- 530
Query: 624 GPYLLAGHTS--GEWDI-KTGTARSLSALISPIPPSFNAQLV 662
GP +L+ S +DI KT ++ ++ P AQ +
Sbjct: 531 GPQVLSADQSLNPSFDITKTAFRTPVAVQLTSTPAKLPAQWI 572
>gi|340619901|ref|YP_004738354.1| hypothetical protein zobellia_3937 [Zobellia galactanivorans]
gi|339734698|emb|CAZ98075.1| Conserved hypothetical periplasmic protein [Zobellia
galactanivorans]
Length = 629
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 100/463 (21%), Positives = 182/463 (39%), Gaps = 57/463 (12%)
Query: 160 ELRGHFVGH--YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
E+ G F+G + AS ++ A +H+ + E + +V + + Q K GY + E
Sbjct: 78 EVVGAFIGMGMLIDASVRLAAYSHDPKMMEIKNEIVDKVIDEQLK--NGYSGFYKPE--- 132
Query: 218 SFEALKPVW-----APYYTIHK---ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKV 269
+ +W + IH+ I+ GL Y L N ++LK A ++ ++
Sbjct: 133 -----RRLWNSQGGGDNWDIHEMAFIIDGLTSDYELFGNKRSLKAAIKTADFIMEHWHEM 187
Query: 270 ITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLA------HLFDKPCFLGFLAL 323
Y+ E + L+ G++ ++RLY T + + L + + +D +G
Sbjct: 188 PDDYAAEVDMHVLDT---GIDWAIFRLYKTTGEKRFLNFSEKTKSLYQWDTKIEIGRRPG 244
Query: 324 QADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSA-REF 382
+ ++ + A I + Y TG+ M A G++ RE
Sbjct: 245 VSGHMFAYFAMCMAQIEL-----YRYTGNKELLQQTENAMRFFLAEDGLTISGSAGQREI 299
Query: 383 WWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
W D + + LG ETC T +V L R T + Y D ER + NG+ Q +
Sbjct: 300 WTDDQDGENELG----ETCATAYQTRVYESLLRLTGKAEYGDLIERTVYNGLFGAQ-SPD 354
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
G + Y P H + + + CC G S+L +Y+ + + +
Sbjct: 355 GGKLRYYTPF------EGERHYYDVE---YMCCPGNFRRIISELPGMVYYRSKEDGVAVN 405
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
+ + + G V V S+ R+ L+ S + L+LR+P W +
Sbjct: 406 LYAQSEARVELNDGITV---DVQQKTSYPTSGRVELSVSPNK--ASTFPLSLRIPSW--A 458
Query: 563 NGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLR 604
A +NG+ PG F+ T +W+ D++ + P+ +R
Sbjct: 459 KEATIMVNGEKWQGEIKPGTFVDITRKWTSKDRVLLDFPMDIR 501
>gi|189467199|ref|ZP_03015984.1| hypothetical protein BACINT_03583 [Bacteroides intestinalis DSM
17393]
gi|189435463|gb|EDV04448.1| hypothetical protein BACINT_03583 [Bacteroides intestinalis DSM
17393]
Length = 175
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/93 (43%), Positives = 52/93 (55%), Gaps = 7/93 (7%)
Query: 132 DSLVWSFRKTASL-------PTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNAT 184
+ L+ SFR A + K GGWE+ ELRGH GH LSA A M+AST +
Sbjct: 75 NRLLHSFRDNAGVFAGREGGDMTVKKLGGWESLDCELRGHTTGHLLSAYALMYASTGSEI 134
Query: 185 IKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
K K ++V L+E Q +G GYLSA+P EL +
Sbjct: 135 FKLKGDSLVTGLAEVQAALGNGYLSAYPEELIN 167
>gi|365847237|ref|ZP_09387726.1| hypothetical protein HMPREF0880_01230 [Yokenella regensburgei ATCC
43003]
gi|364572491|gb|EHM50031.1| hypothetical protein HMPREF0880_01230 [Yokenella regensburgei ATCC
43003]
Length = 659
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 112/292 (38%), Gaps = 41/292 (14%)
Query: 364 DIVNASHSYATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
D + + Y TGG +S F D DT+ +E +C + ++ +R + +
Sbjct: 306 DNMASRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEAD 362
Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
YAD ERAL N VL + Y+ PL H KFN +
Sbjct: 363 SRYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHIKPV 413
Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
CC + LG +Y + LYI YI +S + L +
Sbjct: 414 RQRWFGCACCPPNIARVLTSLGHYLYTSRD---EALYINLYIGNSVEIPVAGHALRLHIS 470
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
W ++++T S V +L LR+P W + AQ LNG+ +PL P +L
Sbjct: 471 GDYPWQE--QVSITVESPDTVNH--TLALRIPDWCVN--AQVMLNGEEIPLLPHKGYLHI 524
Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
T W DKL + LP+ +R A AI GP Y L +GE
Sbjct: 525 TRDWQEGDKLLLTLPMPVRRVYANPLMRHAAGKIAIQRGPLVYCLEQADNGE 576
>gi|262381468|ref|ZP_06074606.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262296645|gb|EEY84575.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 623
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 74/315 (23%), Positives = 121/315 (38%), Gaps = 45/315 (14%)
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
Y+VT +PLY + M+ + G SA E W+ K L ETC T+
Sbjct: 269 YKVTKNPLYLSVVEKTMNHIINEEINVAGSGSAFECWYGGKALQTYPTYHTMETCVTFTW 328
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW- 465
+++ + T YAD E+A+ N +L+ + ++K GW
Sbjct: 329 MQICDRMLGLTGNSLYADQIEKAMYNALLASLKAD-----------ASQIAKYSPLEGWR 377
Query: 466 -------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP-GLYIIQYISSSFDWKSGH 517
G N CC G +F+ + Y + LY + D K
Sbjct: 378 HEGEEQCGMHIN---CCNANGPRAFAMIPQFAYQVNGRRIDVNLYAASSVEVELD-KKTR 433
Query: 518 VVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
V + Q+ D PI D +R+ + + ++ LR+P W S S+NG+ L
Sbjct: 434 VSMTQETDYPI---DGQVRIVVEPEKTSDF----TIALRIPAW--SERTVVSVNGEPLTD 484
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
G +L W D++T++L + R + + QAI+ GP +LA +
Sbjct: 485 LLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARDSR--- 534
Query: 637 DIKTGTARSLSALIS 651
K G S ++S
Sbjct: 535 -FKDGDVDEASVIVS 548
>gi|436834929|ref|YP_007320145.1| hypothetical protein FAES_1542 [Fibrella aestuarina BUZ 2]
gi|384066342|emb|CCG99552.1| hypothetical protein FAES_1542 [Fibrella aestuarina BUZ 2]
Length = 636
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 131/310 (42%), Gaps = 31/310 (10%)
Query: 347 YEVTGDPLYKL-IGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYN 405
Y +TG P YK + + +I + + A G+S E W+ K L + +ETC T
Sbjct: 282 YRLTGKPAYKAAVEKTWQNIRDTEINLAGSGSSV-ECWFGGKALQTLSINHYQETCVTAT 340
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
+K+S+ L R T + YAD E+ N +L + Y PL + G
Sbjct: 341 WIKLSQQLLRLTGDARYADAIEQTYYNALLGSMKADGSDWTKYT-PLSGQRLEGGEQCGM 399
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ--YISSSFDWKSGHVVLNQK 523
G CC +G L ++ V + + Y++++ +S V L Q+
Sbjct: 400 GLN-----CCVASGPRGLFTLPQTVVMSRADGVQVNFYAEGTYLANTPGGQS--VSLRQQ 452
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
D VS L ++L + ++ +R+P W+ + ++NGQ +P G ++
Sbjct: 453 TDYPVSGQSTLHLSLPKTES------FTVRVRIPAWSVQ--STVTVNGQAVPTVVAGEYV 504
Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTA 643
+ W D+L++ L + R + D P++ AI+ GP +L D + G
Sbjct: 505 AIKRTWQTGDQLSLTLDMRGRVVRL-GDMPQHL---AIVRGPVVLTR------DARLG-G 553
Query: 644 RSLSALISPI 653
S+ ISP+
Sbjct: 554 PSVDETISPV 563
>gi|301309993|ref|ZP_07215932.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|423340426|ref|ZP_17318165.1| hypothetical protein HMPREF1059_04090 [Parabacteroides distasonis
CL09T03C24]
gi|300831567|gb|EFK62198.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|409227861|gb|EKN20757.1| hypothetical protein HMPREF1059_04090 [Parabacteroides distasonis
CL09T03C24]
Length = 623
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 74/315 (23%), Positives = 121/315 (38%), Gaps = 45/315 (14%)
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
Y+VT +PLY + M+ + G SA E W+ K L ETC T+
Sbjct: 269 YKVTKNPLYLSVVEKTMNHIINEEINVAGSGSAFECWYGGKALQTYPTYHTMETCVTFTW 328
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW- 465
+++ + T YAD E+A+ N +L+ + ++K GW
Sbjct: 329 MQICDRMLGLTGNSLYADQIEKAMYNALLASLKAD-----------ASQIAKYSPLEGWR 377
Query: 466 -------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP-GLYIIQYISSSFDWKSGH 517
G N CC G +F+ + Y + LY + D K
Sbjct: 378 HEGEEQCGMHIN---CCNANGPRAFAMIPRFAYQVNGRRIDVNLYAASSVEVELD-KKTR 433
Query: 518 VVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
V + Q+ D PI D +R+ + + ++ LR+P W S S+NG+ L
Sbjct: 434 VSMTQETDYPI---DGQVRIVVEPEKTSDF----TIALRIPAW--SERTVVSVNGEPLTD 484
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
G +L W D++T++L + R + + QAI+ GP +LA +
Sbjct: 485 LLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARDSR--- 534
Query: 637 DIKTGTARSLSALIS 651
K G S ++S
Sbjct: 535 -FKDGDVDEASVIVS 548
>gi|423345501|ref|ZP_17323190.1| hypothetical protein HMPREF1060_00862 [Parabacteroides merdae
CL03T12C32]
gi|409223287|gb|EKN16224.1| hypothetical protein HMPREF1060_00862 [Parabacteroides merdae
CL03T12C32]
Length = 625
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 115/296 (38%), Gaps = 47/296 (15%)
Query: 347 YEVTGDPLY-----KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
Y+VTG+PLY K +G + +N + G SA E W+ K ETC
Sbjct: 269 YKVTGNPLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETC 323
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
T+ +++ L + T YADY E A+ N +++ + + Y PL
Sbjct: 324 VTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKYS-PL--------- 373
Query: 462 THGW--------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
GW G N CC G +F+ + Y ++ V + +
Sbjct: 374 -EGWRHEGEEQCGMHIN---CCNANGPRAFAMIPQFAYQVQDDCVRVNFYAPSEAELVLP 429
Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
V L Q D + ++ + +E ++ LR+P W S A S+NGQ
Sbjct: 430 DKKPVRLKQTTD----YPRTDQIEIEVDPAKETA--FTIALRIPAW--SKIAVVSVNGQP 481
Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
G +L +W D++T++L L R E QAI+ GP +LA
Sbjct: 482 QDGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQAIVRGPIVLA 530
>gi|154495303|ref|ZP_02034308.1| hypothetical protein PARMER_04360 [Parabacteroides merdae ATCC
43184]
gi|423722505|ref|ZP_17696681.1| hypothetical protein HMPREF1078_00744 [Parabacteroides merdae
CL09T00C40]
gi|154085227|gb|EDN84272.1| hypothetical protein PARMER_04360 [Parabacteroides merdae ATCC
43184]
gi|409242350|gb|EKN35113.1| hypothetical protein HMPREF1078_00744 [Parabacteroides merdae
CL09T00C40]
Length = 625
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 115/296 (38%), Gaps = 47/296 (15%)
Query: 347 YEVTGDPLY-----KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
Y+VTG+PLY K +G + +N + G SA E W+ K ETC
Sbjct: 269 YKVTGNPLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETC 323
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
T+ +++ L + T YADY E A+ N +++ + + Y PL
Sbjct: 324 VTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKYS-PL--------- 373
Query: 462 THGW--------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
GW G N CC G +F+ + Y ++ V + +
Sbjct: 374 -EGWRHEGEEQCGMHIN---CCNANGPRAFAMIPQFAYQVQDDCVRVNFYAPSEAELVLP 429
Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
V L Q D + ++ + +E ++ LR+P W S A S+NGQ
Sbjct: 430 GKKPVRLKQTTD----YPRTDQIEIEVDPAKETA--FTIALRIPAW--SKIAVVSVNGQP 481
Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
G +L +W D++T++L L R E QAI+ GP +LA
Sbjct: 482 QDGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQAIVRGPIVLA 530
>gi|423343638|ref|ZP_17321351.1| hypothetical protein HMPREF1077_02781 [Parabacteroides johnsonii
CL02T12C29]
gi|409214660|gb|EKN07669.1| hypothetical protein HMPREF1077_02781 [Parabacteroides johnsonii
CL02T12C29]
Length = 625
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 115/302 (38%), Gaps = 59/302 (19%)
Query: 347 YEVTGDPLY-----KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
Y+VTG+PLY K +G + +N + G SA E W+ K ETC
Sbjct: 269 YKVTGNPLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETC 323
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
T+ +++ L + T YADY E A+ N +++ + + Y PL
Sbjct: 324 VTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKYS-PL--------- 373
Query: 462 THGW--------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
GW G N CC G +F+ + Y ++ V + +
Sbjct: 374 -EGWRHEGEEQCGMHIN---CCNANGPRAFAMIPGFAYQVQDDCVRVNFYAPSEAELVLP 429
Query: 514 KSGHVVLNQ-----KVDPI-VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
V L Q + D I + DP T T + LR+P W S A
Sbjct: 430 GKKSVWLRQTTEYPRTDQIEIEVDPTKETTFTIA------------LRIPAW--SKIATV 475
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
S+NG+ G +L +W D++T++L L R E QAI+ GP +
Sbjct: 476 SVNGRPEAGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQAIVRGPLV 528
Query: 628 LA 629
LA
Sbjct: 529 LA 530
>gi|218261883|ref|ZP_03476568.1| hypothetical protein PRABACTJOHN_02239 [Parabacteroides johnsonii
DSM 18315]
gi|218223731|gb|EEC96381.1| hypothetical protein PRABACTJOHN_02239 [Parabacteroides johnsonii
DSM 18315]
Length = 625
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 115/302 (38%), Gaps = 59/302 (19%)
Query: 347 YEVTGDPLY-----KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETC 401
Y+VTG+PLY K +G + +N + G SA E W+ K ETC
Sbjct: 269 YKVTGNPLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETC 323
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
T+ +++ L + T YADY E A+ N +++ + + Y PL
Sbjct: 324 VTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKYS-PL--------- 373
Query: 462 THGW--------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
GW G N CC G +F+ + Y ++ V + +
Sbjct: 374 -EGWRHEGEEQCGMHIN---CCNANGPRAFAMIPGFAYQVQDDCVRVNFYAPSEAELVLP 429
Query: 514 KSGHVVLNQ-----KVDPI-VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
V L Q + D I + DP T T + LR+P W S A
Sbjct: 430 GKKSVWLRQTTEYPRTDQIEIEVDPTKETTFTIA------------LRIPAW--SKIATV 475
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
S+NG+ G +L +W D++T++L L R E QAI+ GP +
Sbjct: 476 SVNGRPEAGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQAIVRGPLV 528
Query: 628 LA 629
LA
Sbjct: 529 LA 530
>gi|256840863|ref|ZP_05546371.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|256738135|gb|EEU51461.1| conserved hypothetical protein [Parabacteroides sp. D13]
Length = 625
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 73/315 (23%), Positives = 121/315 (38%), Gaps = 45/315 (14%)
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
Y+VT +PLY + M+ + G SA E W+ K L ETC T+
Sbjct: 271 YKVTKNPLYLSVVEKTMNHIINEEINVAGSGSAFECWYGGKALQTYPTYHTMETCVTFTW 330
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW- 465
+++ + T YAD E+A+ N +L+ + ++K GW
Sbjct: 331 MQICDRMLGLTGNSLYADQIEKAMYNALLASLKAD-----------ASQIAKYSPLEGWR 379
Query: 466 -------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP-GLYIIQYISSSFDWKSGH 517
G N CC G +F+ + Y + LY + D K
Sbjct: 380 HEGEEQCGMHIN---CCNANGPRAFAMIPQFAYQINGRRIDVNLYAASSVEVELD-KKTR 435
Query: 518 VVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
V + Q+ + PI D +R+ + + ++ LR+P W S S+NG+ L
Sbjct: 436 VSMTQETNYPI---DGQVRIVVEPEKTSDF----TIALRIPAW--SERTVVSVNGEPLTD 486
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
G +L W D++T++L + R + + QAI+ GP +LA +
Sbjct: 487 LLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARDSR--- 536
Query: 637 DIKTGTARSLSALIS 651
K G S ++S
Sbjct: 537 -FKDGDVDEASVIVS 550
>gi|150007964|ref|YP_001302707.1| hypothetical protein BDI_1325 [Parabacteroides distasonis ATCC
8503]
gi|149936388|gb|ABR43085.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 623
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 73/315 (23%), Positives = 121/315 (38%), Gaps = 45/315 (14%)
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
Y+VT +PLY + M+ + G SA E W+ K L ETC T+
Sbjct: 269 YKVTKNPLYLSVVEKTMNHIINEEINVAGSGSAFECWYGGKALQTYPTYHTMETCVTFTW 328
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW- 465
+++ + T YAD E+A+ N +L+ + ++K GW
Sbjct: 329 MQICDRMLGLTGNSLYADQIEKAMYNALLASLKAD-----------ASQIAKYSPLEGWR 377
Query: 466 -------GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP-GLYIIQYISSSFDWKSGH 517
G N CC G +F+ + Y + LY + D K
Sbjct: 378 HEGEEQCGMHIN---CCNANGPRAFAMIPQFAYQINGRRIDVNLYAASSVEVELD-KKTR 433
Query: 518 VVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
V + Q+ + PI D +R+ + + ++ LR+P W S S+NG+ L
Sbjct: 434 VSMTQETNYPI---DGQVRIVVEPEKTSDF----TIALRIPAW--SERTVVSVNGEPLTD 484
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
G +L W D++T++L + R + + QAI+ GP +LA +
Sbjct: 485 LLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARDSR--- 534
Query: 637 DIKTGTARSLSALIS 651
K G S ++S
Sbjct: 535 -FKDGDVDEASVIVS 548
>gi|429738051|ref|ZP_19271876.1| hypothetical protein HMPREF9151_00303 [Prevotella saccharolytica
F0055]
gi|429161156|gb|EKY03584.1| hypothetical protein HMPREF9151_00303 [Prevotella saccharolytica
F0055]
Length = 603
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 68/283 (24%), Positives = 115/283 (40%), Gaps = 20/283 (7%)
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
Y +TG+ YK + + TG SA E W+ K++ +ETC T
Sbjct: 247 YRLTGNESYKAAVEKTWQSIMDTEINITGSGSAMESWFGGKQVQYMPIKHYQETCVTATW 306
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
+K+SR L T YAD E++L N +L R Y G+ + + G
Sbjct: 307 IKLSRQLLMLTGNSKYADAIEQSLYNALLGAMRPDGSDWAKYTPLSGQRLPGSEQC---G 363
Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFE-EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
N CC +G + + + EG V LYI + ++ Q
Sbjct: 364 MGLN---CCTASGPRGLFVIPQTAVMQSSEGAVVNLYIPGTYTLQSPKNKTVTLVQQGEY 420
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
P M + F ++Q + +L+LR+P W S + ++NGQ + G++L
Sbjct: 421 PKTG-----NMRIVFQAQQP--EEMTLSLRIPAW--SKTTRVAVNGQEVSAVRSGSYLQI 471
Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
+WS D++ + + + + + + P+Y AI GP +L
Sbjct: 472 NRQWSAGDRVELTMDMQAQLHFMGTN-PQYL---AITRGPVVL 510
>gi|255034442|ref|YP_003085063.1| hypothetical protein Dfer_0635 [Dyadobacter fermentans DSM 18053]
gi|254947198|gb|ACT91898.1| protein of unknown function DUF1680 [Dyadobacter fermentans DSM
18053]
Length = 656
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 84/370 (22%), Positives = 151/370 (40%), Gaps = 55/370 (14%)
Query: 271 TMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF--LALQADYL 328
T R W S ++E + L +LY +TH+ ++L LA F + G+ + ++
Sbjct: 191 TFRVANRPWVSGHQE---IELALMKLYHLTHEDRYLKLADWFLEQRGRGYGKGKIWDEWK 247
Query: 329 SHFHANTHIPI-----VIGSQMR-----------YEVTGDPLYKLIGTFFMDIVNASHSY 372
+ +P+ + G +R VTGDP Y T + V + Y
Sbjct: 248 DPKYCQDDVPVKQQKEITGHAVRAMYQYTGAADVASVTGDPGYMNAMTAVWEDVVYRNMY 307
Query: 373 ATGGTSA---REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERA 429
TGG + E + D L + G+ ETC + M+ ++ + T + Y D ER+
Sbjct: 308 LTGGIGSSGHNEGFTDDYDLPN--GAAYSETCASVGMVFWNQRMNALTGDAKYIDVLERS 365
Query: 430 LTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDS 489
L NG L T Y PL + ARS +GT CC + +GD
Sbjct: 366 LYNGALDGLSLT-GDRFFYGNPLSSIGNNARSAW-FGTA-----CCPSNIARLVASVGDY 418
Query: 490 IYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL 549
IY + +G + ++ ++ S+ ++ G + ++ W+ +R+ +T K +
Sbjct: 419 IYGKADGKI---WVNLFVGSNTTFQVGKTAVPLQMSTDYPWNGSIRIKVTPPQKVKY--- 472
Query: 550 SSLNLRMPVW--------------TYSNG-AQASLNGQNLPLPPPGNFLSATERWSYNDK 594
+LN+R+P W NG + LNG+++ + W D+
Sbjct: 473 -ALNVRIPGWAAGTPVPGGLYNFAAAGNGRVEVLLNGKSVNYQSDKGYAVIDRTWQNGDE 531
Query: 595 LTIQLPLSLR 604
+ ++LP+ +R
Sbjct: 532 IEVRLPMDVR 541
>gi|423122678|ref|ZP_17110362.1| hypothetical protein HMPREF9690_04684 [Klebsiella oxytoca 10-5246]
gi|376391959|gb|EHT04626.1| hypothetical protein HMPREF9690_04684 [Klebsiella oxytoca 10-5246]
Length = 653
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 81/365 (22%), Positives = 132/365 (36%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYL 328
L RLY IT +P++L L + F +P F ++ + Y
Sbjct: 192 ALMRLYDITQEPRYLALVNYFVEERGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYS 251
Query: 329 SHFHANTHIPIVIGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
+ P+ IG +R Y +TG D + + + Y TGG
Sbjct: 252 QAHQPISEQPVAIGHAVRFVYLMTGVAHLARLSQDEGKRQDCLRLWNNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPTSLKFNHIYDHVKPVRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY + LYI Y+ +S + G L ++ W +++ +
Sbjct: 420 ARVLTSLGHYIYTPHQD---ALYINLYVGNSAEIPVGDETLRLRISGNYPWQEQVKIAVD 476
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
+ +L LR+P W + Q +LNG+ + +L + RW D L + L
Sbjct: 477 SPTPIN----HTLALRLPDWC--DNPQVTLNGKPVAQDVRKGYLHISHRWQEGDTLLLTL 530
Query: 600 PLSLR 604
P+ +R
Sbjct: 531 PMPVR 535
>gi|409730702|ref|ZP_11272263.1| hypothetical protein Hham1_15864 [Halococcus hamelinensis 100A6]
gi|448723717|ref|ZP_21706233.1| hypothetical protein C447_11225 [Halococcus hamelinensis 100A6]
gi|445787256|gb|EMA38004.1| hypothetical protein C447_11225 [Halococcus hamelinensis 100A6]
Length = 639
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/227 (24%), Positives = 95/227 (41%), Gaps = 17/227 (7%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + ++ + T + YAD ER L NG L+ G E Y PL
Sbjct: 335 ETCAAIGSVFWNQRMLERTGDAKYADLIERTLYNGFLA-GVGLEGKEFFYENPLESSGDH 393
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
R GW T CC F+ LG +Y ++ + L++ QY+ S + G
Sbjct: 394 HRK--GWFTCA----CCPPNAARLFASLGGYLYGDDGDD---LFVHQYVGSRVSTEVGGT 444
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
++ V+ + W + + +T S G+ +L LR+P W S G +NG+++
Sbjct: 445 AVDLDVETDLPWSGDVSLDVTASE----GESFALRLRVPAW--SEGTTVEVNGESVDAAV 498
Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+L+ W+ +D + + +++T A + A+ GP
Sbjct: 499 EDGYLALDREWT-DDTVELTFEQTVQTVRAHPAVEADAGLVAVERGP 544
>gi|432865910|ref|ZP_20088760.1| hypothetical protein A311_04524 [Escherichia coli KTE146]
gi|431401839|gb|ELG85171.1| hypothetical protein A311_04524 [Escherichia coli KTE146]
Length = 654
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 94/404 (23%), Positives = 145/404 (35%), Gaps = 87/404 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q +LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQVTLNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
L + LP+ +R A AI GP Y L +GE
Sbjct: 525 TLNLTLPMPVRRVYGNPLMRHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|386626404|ref|YP_006146132.1| hypothetical protein CE10_4140 [Escherichia coli O7:K1 str. CE10]
gi|349740140|gb|AEQ14846.1| hypothetical protein CE10_4140 [Escherichia coli O7:K1 str. CE10]
Length = 573
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 85/370 (22%), Positives = 135/370 (36%), Gaps = 85/370 (22%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 193 LMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYSQ 252
Query: 335 THIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
H+PI IG +R+ +Y + G + ++ S Y
Sbjct: 253 AHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQLY 306
Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG +S F D DT+ +E +C + ++ +R + + YAD ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMER 363
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
AL N VL + Y+ PL H KFN + C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCAC 414
Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
C + +G +Y E LYI Y +S + + L +V W
Sbjct: 415 CPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE-- 469
Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
++T+ S Q V +L LR+P W Q +LNG+ + +L T W D
Sbjct: 470 QVTIAVESPQPVRH--TLALRLPDWC--TQPQITLNGEEVEQDIRKGYLHITREWQEGDT 525
Query: 595 LTIQLPLSLR 604
L + LP+ +R
Sbjct: 526 LNLTLPMPVR 535
>gi|323344406|ref|ZP_08084631.1| hypothetical protein HMPREF0663_11167 [Prevotella oralis ATCC
33269]
gi|323094533|gb|EFZ37109.1| hypothetical protein HMPREF0663_11167 [Prevotella oralis ATCC
33269]
Length = 627
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 65/259 (25%), Positives = 110/259 (42%), Gaps = 26/259 (10%)
Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
TG SA E W+ K++ +ETC T +K+SR L T YAD E++L N
Sbjct: 300 TGSGSAMESWFGGKQVQYMPIKHYQETCVTATWIKLSRQLLMLTGNSKYADAIEQSLYNA 359
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
+L + Y PL + G G CC +G + + +
Sbjct: 360 LLGAMKSDGSDWAKYT-PLSGQRLQGSEQCGMGLN-----CCTASGPRGLFIIPQTAVMQ 413
Query: 494 EEGNVPGLYIIQYISSSFDWKS---GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
++ G I YI ++ +S +++ Q+ D + + + F KQ +
Sbjct: 414 ---SIKGAVINLYIPGTYTLQSPKGQEIIITQQGD----YPQTGTVRIAFKVKQT--EEF 464
Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA-IQ 609
+L+LR+P W S + +LNG ++ G++L +WS D ++L L +R +
Sbjct: 465 TLSLRIPEW--SKDTKVTLNGNDVVPAHNGSYLQINRKWSDGDH--VELVLDMRAQLHFM 520
Query: 610 DDRPEYASIQAILFGPYLL 628
+ P+Y AI GP +L
Sbjct: 521 GENPQYL---AITRGPVVL 536
>gi|284122982|ref|ZP_06386886.1| protein of unknown function DUF1680 [Candidatus Poribacteria sp.
WGA-A3]
gi|283829311|gb|EFC33713.1| protein of unknown function DUF1680 [Candidatus Poribacteria sp.
WGA-A3]
Length = 577
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 115/485 (23%), Positives = 192/485 (39%), Gaps = 92/485 (18%)
Query: 171 SASAQMWASTH-NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV--WA 227
+AS +W TH N T + ++ V+ ++ CQ GYL+++ F ++P W
Sbjct: 21 AASYTLW--THPNPTWEPELDEVIAKIAACQQP--DGYLNSY-------FTLVEPTKRWQ 69
Query: 228 PYYTIHKI-LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
+H++ AG L + +A + QA T +++ + + ++ E
Sbjct: 70 NLGMMHELYCAGHLFEAAVA-HYQATGKQT-LLDVACRFADLIDNTFGFDKRDGLPGHE- 126
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLF------------------DKPCFLG----FLALQ 324
G+ L +L +T +P+++ LA F D P LG
Sbjct: 127 -GIELALVKLARVTGEPRYMALAEYFVTRRGHSPSIFEKELENPDLPGGLGAYQHHFTRD 185
Query: 325 ADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG---TSARE 381
Y H+ A H+PI Q + E G + ++ A +Y TG T+A E
Sbjct: 186 GKYEGHY-AQAHLPI----QEQTECVG----HAVRAMYLYSGAADIAYETGDSAITNALE 236
Query: 382 FWWD--PKRLADTLG----SENE---------------ETCTTYNMLKVSRHLFRWTKEI 420
W KRL T G NE ETC + ++ + +F E
Sbjct: 237 ALWQNVGKRLYITGGVGPSGHNEGFTTDYELPNFSAYAETCASIGLIFWAHRMFLLRAES 296
Query: 421 AYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGI 480
+ D E AL NG LS G Y PL + R H W F CC
Sbjct: 297 RFVDVLETALYNGALSGISLDGTG-FFYQNPLASHGDRHR--HEW---FGCA-CCPPNIA 349
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD-WKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ +G IY E E G+Y+ Y+S + D +G+V + + W + +T+T
Sbjct: 350 RLLASVGQYIYAESE---EGIYVNLYVSITADAIAAGNVPVRLTQETDYPWAGDVTLTIT 406
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ-NLPLPPPGNFLSATERWSYNDKLTIQ 598
++ +LNLR+P W + + +NG+ + P +L+ T W D++ +Q
Sbjct: 407 PTTPVPF----TLNLRIPGW--CDQCEVRVNGEADNSQPNATGYLTITREWRAGDRVQLQ 460
Query: 599 LPLSL 603
LP+ +
Sbjct: 461 LPMPV 465
>gi|387609318|ref|YP_006098174.1| hypothetical protein EC042_3892 [Escherichia coli 042]
gi|419917404|ref|ZP_14435664.1| hypothetical protein ECKD2_05705 [Escherichia coli KD2]
gi|284923618|emb|CBG36715.1| conserved hypothetical protein [Escherichia coli 042]
gi|388394341|gb|EIL55642.1| hypothetical protein ECKD2_05705 [Escherichia coli KD2]
Length = 656
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 95/404 (23%), Positives = 145/404 (35%), Gaps = 87/404 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L LA+ F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALANYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVGQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
L + LP+ +R A AI GP Y L +GE
Sbjct: 525 TLNLTLPMPVRRVYGNPQVRHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|301020201|ref|ZP_07184325.1| conserved hypothetical protein [Escherichia coli MS 69-1]
gi|300398864|gb|EFJ82402.1| conserved hypothetical protein [Escherichia coli MS 69-1]
Length = 664
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 95/404 (23%), Positives = 145/404 (35%), Gaps = 87/404 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L LA+ F +P + + SH+H +
Sbjct: 200 ALMRLYEVTEEPRYLALANYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 260 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 313
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 370
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPIRQRWFGCA 421
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 422 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 477
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 478 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVGQDIRKGYLHITREWQEGD 532
Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
L + LP+ +R A AI GP Y L +GE
Sbjct: 533 TLNLTLPMPVRRVYGNPQVRHVAGKVAIQRGPLVYCLEQADNGE 576
>gi|298248099|ref|ZP_06971904.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
gi|297550758|gb|EFH84624.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
Length = 638
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 132/320 (41%), Gaps = 34/320 (10%)
Query: 332 HANTHIPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
HA + + G+ Y TG+ L I + D+ Y TGG +R +D + +
Sbjct: 260 HAVRALYLYAGATDAYTETGEQALLHAINALWADL-QQHKVYVTGGVGSR---YDGEAVG 315
Query: 391 DTLGSENE----ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGV 445
++ N+ ETC + + L T YAD E L NG+L+ I E
Sbjct: 316 ESYELPNDQAYTETCAAIAHIMWAWRLLLLTGNALYADAMELTLYNGMLAGISLDGE--S 373
Query: 446 MIYMLPLG-RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
Y PL RG + R +GT CC + L IY + + L++
Sbjct: 374 YFYQNPLADRG--RHRRQPWFGTA-----CCPPNVARLLASLPGYIYTTSDAD---LWVH 423
Query: 505 QYISSSFDWKSGH-VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
Y SS + + VL K W+ +++++ ++ + LNLR+P W ++
Sbjct: 424 LYTSSEANVRLPQGSVLKCKQTSNYPWEGKIKLSI---EPKQANAIFGLNLRIPAW--AH 478
Query: 564 GAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
GA S+NG+ LP P PG++ W D++ + LPL +R A+L
Sbjct: 479 GATVSVNGETLPPPIQPGSYYRIERTWQPGDQVELVLPLLMRAVTSHPYISNNNGRVALL 538
Query: 623 FGPYLL----AGHTSGEWDI 638
GP + + H + WD+
Sbjct: 539 RGPLVYCVEQSDHEADVWDL 558
>gi|337749269|ref|YP_004643431.1| hypothetical protein KNP414_05037 [Paenibacillus mucilaginosus
KNP414]
gi|336300458|gb|AEI43561.1| protein of unknown function DUF1680 [Paenibacillus mucilaginosus
KNP414]
Length = 660
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 140/386 (36%), Gaps = 69/386 (17%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
L +LY T + ++L LA F +P FL Q D SH+ A +PI QM
Sbjct: 195 ALVKLYGATGEERYLKLAQFFIDERGTEPNFLVEECRQRDGYSHW-AKKKLPIPTAEQMA 253
Query: 347 YEVTGDPLYKL-------IGTFFMDIVNASHSYATGGT----SAREFWWDPKR----LAD 391
Y P+ + + +M A + TG + R W + + +
Sbjct: 254 YNQAHKPVRQQDTAVGHSVRAVYMYTAMADLARLTGDAELLEACRRLWDNTTKKQMYITG 313
Query: 392 TLGSENE-----------------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
+GS + ETC + ++ +R + + + YAD ERAL N V
Sbjct: 314 GIGSTHHGEAFSFDYDLPNDTVYAETCASIGLIFFARRMLQLEAKSEYADVLERALYNNV 373
Query: 435 LS--IQRGTEPGVMIYMLPL-----------GRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
+ Q G Y+ PL GR KA +G CC
Sbjct: 374 IGSMSQDGKH---YFYVNPLEVWPKASEQNPGRHHVKAVRQPWFGCS-----CCPPNVAR 425
Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
S L D IY G+ +Y +I S SF +G V L Q+ + W+ R LT
Sbjct: 426 LLSSLNDYIYSASPGDNT-VYTHLFIGSEASFTLAAGQVALKQESR--LPWEGCARFELT 482
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
+ V +L LR+P W+ A+ +NG + T RW+ D +
Sbjct: 483 AVPEAPV----TLALRIPSWS-GGRAELRINGAAEAYEVENGYAVVTRRWTAGDVVEWAP 537
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP 625
L + A + A AI GP
Sbjct: 538 ALQAQLTAAHPEIRANAGRAAIERGP 563
>gi|417116562|ref|ZP_11967423.1| putative glycosyhydrolase [Escherichia coli 1.2741]
gi|422801520|ref|ZP_16850016.1| hypothetical protein ERJG_02686 [Escherichia coli M863]
gi|323965978|gb|EGB61421.1| hypothetical protein ERJG_02686 [Escherichia coli M863]
gi|386139106|gb|EIG80261.1| putative glycosyhydrolase [Escherichia coli 1.2741]
Length = 656
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q +LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQITLNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|417664178|ref|ZP_12313758.1| secreted protein [Escherichia coli AA86]
gi|330909651|gb|EGH38165.1| secreted protein [Escherichia coli AA86]
Length = 657
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 108/499 (21%), Positives = 181/499 (36%), Gaps = 97/499 (19%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
V +L A A +A +++ V+ ++ Q K GYL+ + T +A +
Sbjct: 74 VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCK--DGYLNTYFT-----VKAPEER 126
Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
W+ H++ L++ V A + +V + + V + H Y +
Sbjct: 127 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLTDHIDSVFGPDESKLHGYPGH 186
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
E + L RLY +T +P++L L + F +P + + SH+H
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243
Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
+ H+PI IG +R+ +Y + G + ++ S
Sbjct: 244 MVKDKAYSQAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLW 297
Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
Y TGG +S F D DT+ +E +C + ++ +R + +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354
Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
YAD ERAL N VL + Y+ PL H KFN +
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPI 405
Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
CC + +G +Y E LYI Y +S + + L +V
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
W ++T+ S Q V +L LR+P W Q LNG+ + +L
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWCIQ--PQIILNGEEVEQDIRKGYLHI 516
Query: 586 TERWSYNDKLTIQLPLSLR 604
T W D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535
>gi|416899982|ref|ZP_11929388.1| hypothetical protein ECSTEC7V_4230 [Escherichia coli STEC_7v]
gi|327251242|gb|EGE62935.1| hypothetical protein ECSTEC7V_4230 [Escherichia coli STEC_7v]
Length = 656
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q +LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQITLNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|293413020|ref|ZP_06655688.1| conserved hypothetical protein [Escherichia coli B354]
gi|291468667|gb|EFF11160.1| conserved hypothetical protein [Escherichia coli B354]
Length = 656
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q +LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWCAQ--PQVTLNGEEVGQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|170681898|ref|YP_001745874.1| hypothetical protein EcSMS35_3909 [Escherichia coli SMS-3-5]
gi|170519616|gb|ACB17794.1| conserved hypothetical protein [Escherichia coli SMS-3-5]
Length = 656
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q +LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQITLNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|336239737|ref|XP_003342727.1| hypothetical protein SMAC_10375 [Sordaria macrospora k-hell]
Length = 159
Score = 60.1 bits (144), Expect = 6e-06, Method: Composition-based stats.
Identities = 35/98 (35%), Positives = 51/98 (52%), Gaps = 2/98 (2%)
Query: 111 QSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENPISELRGHFVGHYL 170
Q S A N YLL LD + L+ +F +A LP P YGGWE + GH +GH+L
Sbjct: 60 QPSPFADAFAANRRYLLDLDPERLLHNFYISAGLPAPKPVYGGWE--AQGIAGHSLGHWL 117
Query: 171 SASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYL 208
SA A A++ +A I ++ + ++ Q G GY+
Sbjct: 118 SACALTVANSGDAAIAARLDHALKEMARIQAAHGDGYV 155
>gi|422783824|ref|ZP_16836607.1| hypothetical protein ERFG_04064 [Escherichia coli TW10509]
gi|323975001|gb|EGB70110.1| hypothetical protein ERFG_04064 [Escherichia coli TW10509]
Length = 656
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSHYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q +LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQITLNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|423105419|ref|ZP_17093121.1| hypothetical protein HMPREF9686_04025 [Klebsiella oxytoca 10-5242]
gi|376380736|gb|EHS93479.1| hypothetical protein HMPREF9686_04025 [Klebsiella oxytoca 10-5242]
Length = 653
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 79/370 (21%), Positives = 131/370 (35%), Gaps = 85/370 (22%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
L RLY +T +P+++ L F +P F ++ + Y
Sbjct: 193 LMRLYDVTQEPRYMALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
+ P+ IG +R+ +Y + G + ++ Y
Sbjct: 253 AHQPISEQPVAIGHAVRF------VYLMTGVAHLARLSQDEGKRRDCLRLWKNMARRQLY 306
Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG +S F D DT+ +E +C + ++ +R + + YAD ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
AL N VL + Y+ PL H KFN + C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPVRQRWFGCAC 414
Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
C + LG IY + LYI YI +S + G+ L ++ W +
Sbjct: 415 CPPNIARVLTSLGHYIYTPHDD---ALYINLYIGNSVEIPVGNEALRLRISGNYPWQEQV 471
Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
++ + SS +L LR+P W + Q +LNG + +L + W D
Sbjct: 472 KIVIDSSSPVN----HTLALRLPDWC--DKPQVTLNGAPVTQDVRKGYLHISHLWQEGDT 525
Query: 595 LTIQLPLSLR 604
L + LP+ +R
Sbjct: 526 LQLTLPMPVR 535
>gi|152968091|ref|YP_001363875.1| hypothetical protein Krad_4148 [Kineococcus radiotolerans SRS30216]
gi|151362608|gb|ABS05611.1| protein of unknown function DUF1680 [Kineococcus radiotolerans
SRS30216]
Length = 652
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 109/253 (43%), Gaps = 39/253 (15%)
Query: 368 ASHSYATGGTSAREFWWDPKRLAD--TLGSENE--ETCTTYNMLKVSRHLFRWTKEIAYA 423
AS +Y TGG AR WD ++ D LG E ETC ++ + + T E YA
Sbjct: 301 ASKTYVTGGIGAR---WDWEQFGDHYELGPERAYAETCAAIGSVQWTWRMLLATGEARYA 357
Query: 424 DYYERALTNGVLSIQRGTEPGVMI----------YMLPLGRGVSKARST-HGWGTKFNSF 472
D ER L N L PGV + L G + RS HG F+
Sbjct: 358 DLVERTLYNAFL-------PGVSLAGTEYFYVNALQLRHGAFAEEERSVAHGRRPWFDCA 410
Query: 473 WCCYGTGIESFSKLGDSIYFEEEGN-VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWD 531
CC + + S L + + V G+ + Q+ + + + + L+ D WD
Sbjct: 411 -CCPPNIMRTLSSLDAYVATSSATDGVAGVQVHQFTTGTIE--AAGAALSVTTD--YPWD 465
Query: 532 PYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSY 591
+R+ +T + + L LR+P W + GA A+++G+ + + PG +L ++
Sbjct: 466 GTVRVEVTATPGE-----FELALRVPAW--AQGATATVDGEAVAV-TPGEYLRVRRDFAV 517
Query: 592 NDKLTIQLPLSLR 604
D + + LP+++R
Sbjct: 518 GDVVELVLPMTVR 530
>gi|317048885|ref|YP_004116533.1| hypothetical protein Pat9b_2677 [Pantoea sp. At-9b]
gi|316950502|gb|ADU69977.1| protein of unknown function DUF1680 [Pantoea sp. At-9b]
Length = 651
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 68/271 (25%), Positives = 100/271 (36%), Gaps = 39/271 (14%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H FN + CC + LG IY E LYI
Sbjct: 387 --EVHPKTLSFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTPRE---EALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + G L +++ W +T+T S Q V +L LR+P W +
Sbjct: 442 LYVGNSLEVPVGEQTLRLRINGNFPWQE--TVTITIDSPQPVQH--TLALRLPDWC--DA 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
Q +LN + +L WS D LT+ LP+ +R A A+ G
Sbjct: 496 PQVTLNDAAVASDIRKGYLHINRSWSEGDTLTLTLPMPVRRVYGNPLVRHVAGKVALQRG 555
Query: 625 P--YLLAGHTSGE-----WDIKTGTARSLSA 648
P Y L +GE W +T T R+
Sbjct: 556 PLVYCLEQADNGEELHNLWLPQTATFRTFEG 586
>gi|402843427|ref|ZP_10891823.1| putative glycosyhydrolase [Klebsiella sp. OBRC7]
gi|402277059|gb|EJU26151.1| putative glycosyhydrolase [Klebsiella sp. OBRC7]
Length = 653
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 79/370 (21%), Positives = 131/370 (35%), Gaps = 85/370 (22%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
L RLY +T +P+++ L F +P F ++ + Y
Sbjct: 193 LMRLYDVTQEPRYMALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
+ P+ IG +R+ +Y + G + ++ Y
Sbjct: 253 AHQPISEQPVAIGHAVRF------VYLMTGVAHLARLSQDEGKRRDCLRLWKNMARRQLY 306
Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG +S F D DT+ +E +C + ++ +R + + YAD ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
AL N VL + Y+ PL H KFN + C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPVRQRWFGCAC 414
Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
C + LG IY + LYI YI +S + G+ L ++ W +
Sbjct: 415 CPPNIARVLTSLGHYIYTPHDD---ALYINLYIGNSVEIPVGNEALRLRISGNYPWQEQV 471
Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
++ + SS +L LR+P W + Q +LNG + +L + W D
Sbjct: 472 KIVIDSSSPVN----HTLALRLPDWC--DKPQVTLNGAPVTQDVRKGYLHISHLWREGDT 525
Query: 595 LTIQLPLSLR 604
L + LP+ +R
Sbjct: 526 LQLTLPMPVR 535
>gi|331685249|ref|ZP_08385835.1| putative cytoplasmic protein [Escherichia coli H299]
gi|450194438|ref|ZP_21892361.1| hypothetical protein A364_18755 [Escherichia coli SEPT362]
gi|331077620|gb|EGI48832.1| putative cytoplasmic protein [Escherichia coli H299]
gi|449316669|gb|EMD06777.1| hypothetical protein A364_18755 [Escherichia coli SEPT362]
Length = 656
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 108/499 (21%), Positives = 182/499 (36%), Gaps = 97/499 (19%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
V +L A A +A +++ V+ ++ Q + GYL+A+ T +A +
Sbjct: 74 VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNAYFT-----VKAPEER 126
Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
W+ H++ L++ V A + +V + + V + H Y +
Sbjct: 127 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLADHIDSVFGPGESKLHGYPGH 186
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
E + L RLY +T +P++L L + F +P + + SH+H
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243
Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
+ H+PI IG +R+ +Y + G + ++ S
Sbjct: 244 MVKDKAYSQAHLPIAQQQTAIGHTVRF------VYLMTGVAHLARLSHDESKRQDCLRLW 297
Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
Y TGG +S F D DT+ +E +C + ++ +R + +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354
Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
YAD ERAL N VL + Y+ PL H KFN +
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPI 405
Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
CC + +G +Y E LYI Y +S + + L +V
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
W ++T+ S Q V +L LR+P W Q LNG+ + +L
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 516
Query: 586 TERWSYNDKLTIQLPLSLR 604
T W D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535
>gi|432618844|ref|ZP_19854944.1| hypothetical protein A1UM_04296 [Escherichia coli KTE75]
gi|431151056|gb|ELE52093.1| hypothetical protein A1UM_04296 [Escherichia coli KTE75]
Length = 659
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 108/499 (21%), Positives = 182/499 (36%), Gaps = 97/499 (19%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
V +L A A +A +++ V+ ++ Q + GYL+A+ T +A +
Sbjct: 74 VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNAYFT-----VKAPEER 126
Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
W+ H++ L++ V A + +V + + V + H Y +
Sbjct: 127 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLADHIDSVFGPGESKLHGYPGH 186
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
E + L RLY +T +P++L L + F +P + + SH+H
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243
Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
+ H+PI IG +R+ +Y + G + ++ S
Sbjct: 244 MVKDKAYSQAHLPIAQQQTAIGHTVRF------VYLMTGVAHLARLSHDESKRQDCLRLW 297
Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
Y TGG +S F D DT+ +E +C + ++ +R + +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354
Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
YAD ERAL N VL + Y+ PL H KFN +
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPI 405
Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
CC + +G +Y E LYI Y +S + + L +V
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
W ++T+ S Q V +L LR+P W Q LNG+ + +L
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 516
Query: 586 TERWSYNDKLTIQLPLSLR 604
T W D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535
>gi|387831475|ref|YP_003351412.1| hypothetical protein ECSF_3422 [Escherichia coli SE15]
gi|432399540|ref|ZP_19642313.1| hypothetical protein WEI_04484 [Escherichia coli KTE25]
gi|432408662|ref|ZP_19651364.1| hypothetical protein WEO_03868 [Escherichia coli KTE28]
gi|432502151|ref|ZP_19743901.1| hypothetical protein A177_04267 [Escherichia coli KTE216]
gi|432696461|ref|ZP_19931652.1| hypothetical protein A31I_03948 [Escherichia coli KTE162]
gi|432725058|ref|ZP_19959971.1| hypothetical protein WE1_04112 [Escherichia coli KTE17]
gi|432729639|ref|ZP_19964512.1| hypothetical protein WE3_04113 [Escherichia coli KTE18]
gi|432743329|ref|ZP_19978043.1| hypothetical protein WEE_04046 [Escherichia coli KTE23]
gi|432922799|ref|ZP_20125572.1| hypothetical protein A133_04523 [Escherichia coli KTE173]
gi|432929459|ref|ZP_20130509.1| hypothetical protein A135_04587 [Escherichia coli KTE175]
gi|432983040|ref|ZP_20171809.1| hypothetical protein A15W_04187 [Escherichia coli KTE211]
gi|432992699|ref|ZP_20181347.1| hypothetical protein A179_04489 [Escherichia coli KTE217]
gi|433098416|ref|ZP_20284583.1| hypothetical protein WK3_03618 [Escherichia coli KTE139]
gi|433107854|ref|ZP_20293813.1| hypothetical protein WK7_03720 [Escherichia coli KTE148]
gi|433112834|ref|ZP_20298684.1| hypothetical protein WK9_03711 [Escherichia coli KTE150]
gi|281180632|dbj|BAI56962.1| conserved hypothetical protein [Escherichia coli SE15]
gi|430912702|gb|ELC33874.1| hypothetical protein WEI_04484 [Escherichia coli KTE25]
gi|430926036|gb|ELC46624.1| hypothetical protein WEO_03868 [Escherichia coli KTE28]
gi|431025819|gb|ELD38905.1| hypothetical protein A177_04267 [Escherichia coli KTE216]
gi|431231105|gb|ELF26873.1| hypothetical protein A31I_03948 [Escherichia coli KTE162]
gi|431262277|gb|ELF54267.1| hypothetical protein WE1_04112 [Escherichia coli KTE17]
gi|431270780|gb|ELF61923.1| hypothetical protein WE3_04113 [Escherichia coli KTE18]
gi|431281486|gb|ELF72389.1| hypothetical protein WEE_04046 [Escherichia coli KTE23]
gi|431435293|gb|ELH16905.1| hypothetical protein A133_04523 [Escherichia coli KTE173]
gi|431440867|gb|ELH22195.1| hypothetical protein A135_04587 [Escherichia coli KTE175]
gi|431488798|gb|ELH68428.1| hypothetical protein A15W_04187 [Escherichia coli KTE211]
gi|431490717|gb|ELH70325.1| hypothetical protein A179_04489 [Escherichia coli KTE217]
gi|431612416|gb|ELI81663.1| hypothetical protein WK3_03618 [Escherichia coli KTE139]
gi|431623752|gb|ELI92378.1| hypothetical protein WK7_03720 [Escherichia coli KTE148]
gi|431625172|gb|ELI93765.1| hypothetical protein WK9_03711 [Escherichia coli KTE150]
Length = 657
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWCIQ--PQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|354603632|ref|ZP_09021629.1| hypothetical protein HMPREF9450_00544 [Alistipes indistinctus YIT
12060]
gi|353348727|gb|EHB92995.1| hypothetical protein HMPREF9450_00544 [Alistipes indistinctus YIT
12060]
Length = 630
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 68/272 (25%), Positives = 111/272 (40%), Gaps = 50/272 (18%)
Query: 375 GGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
G SA E ++ +R+ T ETC T +++ HL T + YAD ER + N +
Sbjct: 304 GSGSADECFYHGRRMQTTPAYSMMETCVTMTWMQLCGHLLELTHDPLYADQIERTVYNAL 363
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHG--WGTKFNSFWCCYGTGIESFSKLGD---- 488
L+ +G + Y PL GV RS G G N CC G +F+ + +
Sbjct: 364 LAALKGDGSQIAKYS-PL-EGV---RSPGGPQCGMHVN---CCNMNGPRAFAMIPELMAT 415
Query: 489 --------SIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
++Y E VP G V+L Q+ + + + LT
Sbjct: 416 CAADTLFVNLYGESVSKVP-------------LAGGEVILRQQTN----YPEQGSVELTV 458
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
+ ++ + ++ +R+P W S ++NGQ + PG++L+ + W DK+ +
Sbjct: 459 NPRKS--REFAVAVRIPAW--SKITMVTVNGQAVADVRPGSYLTVSRTWKEGDKIALNFD 514
Query: 601 LSLRTEAIQDDRPEYASIQAILFGPYLLAGHT 632
+ R E QAI GP +LA T
Sbjct: 515 MRGRLT-------ELNGYQAIERGPVVLARDT 539
>gi|386621273|ref|YP_006140853.1| hypothetical protein ECNA114_3739 [Escherichia coli NA114]
gi|432423998|ref|ZP_19666535.1| hypothetical protein A137_04431 [Escherichia coli KTE178]
gi|432560859|ref|ZP_19797513.1| hypothetical protein A1S7_04513 [Escherichia coli KTE49]
gi|432707936|ref|ZP_19943011.1| hypothetical protein WCG_01214 [Escherichia coli KTE6]
gi|432891143|ref|ZP_20103901.1| hypothetical protein A31K_00995 [Escherichia coli KTE165]
gi|333971774|gb|AEG38579.1| Hypothetical protein ECNA114_3739 [Escherichia coli NA114]
gi|430941626|gb|ELC61768.1| hypothetical protein A137_04431 [Escherichia coli KTE178]
gi|431088585|gb|ELD94458.1| hypothetical protein A1S7_04513 [Escherichia coli KTE49]
gi|431254890|gb|ELF48151.1| hypothetical protein WCG_01214 [Escherichia coli KTE6]
gi|431430258|gb|ELH12090.1| hypothetical protein A31K_00995 [Escherichia coli KTE165]
Length = 657
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWCIQ--PQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|432491369|ref|ZP_19733231.1| hypothetical protein A171_03302 [Escherichia coli KTE213]
gi|432841396|ref|ZP_20074855.1| hypothetical protein A1YQ_04362 [Escherichia coli KTE140]
gi|433205327|ref|ZP_20389073.1| hypothetical protein WGY_03902 [Escherichia coli KTE95]
gi|431018040|gb|ELD31485.1| hypothetical protein A171_03302 [Escherichia coli KTE213]
gi|431386628|gb|ELG70584.1| hypothetical protein A1YQ_04362 [Escherichia coli KTE140]
gi|431716416|gb|ELJ80548.1| hypothetical protein WGY_03902 [Escherichia coli KTE95]
Length = 654
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|432682342|ref|ZP_19917698.1| hypothetical protein A1YW_04093 [Escherichia coli KTE143]
gi|431217316|gb|ELF14895.1| hypothetical protein A1YW_04093 [Escherichia coli KTE143]
Length = 659
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|432604420|ref|ZP_19840650.1| hypothetical protein A1U5_04274 [Escherichia coli KTE66]
gi|431137800|gb|ELE39645.1| hypothetical protein A1U5_04274 [Escherichia coli KTE66]
Length = 654
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|300937197|ref|ZP_07152048.1| conserved hypothetical protein [Escherichia coli MS 21-1]
gi|300457729|gb|EFK21222.1| conserved hypothetical protein [Escherichia coli MS 21-1]
Length = 667
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 260 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 313
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 370
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 421
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 422 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 477
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 478 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 532
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 533 TLNLTLPMPVR 543
>gi|417141197|ref|ZP_11984110.1| putative glycosyhydrolase [Escherichia coli 97.0259]
gi|417310126|ref|ZP_12096949.1| hypothetical protein PPECC33_35210 [Escherichia coli PCN033]
gi|338768332|gb|EGP23129.1| hypothetical protein PPECC33_35210 [Escherichia coli PCN033]
gi|386155687|gb|EIH12037.1| putative glycosyhydrolase [Escherichia coli 97.0259]
Length = 654
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|422334703|ref|ZP_16415708.1| hypothetical protein HMPREF0986_04202 [Escherichia coli 4_1_47FAA]
gi|432871119|ref|ZP_20091498.1| hypothetical protein A313_02338 [Escherichia coli KTE147]
gi|373244312|gb|EHP63799.1| hypothetical protein HMPREF0986_04202 [Escherichia coli 4_1_47FAA]
gi|431408324|gb|ELG91511.1| hypothetical protein A313_02338 [Escherichia coli KTE147]
Length = 654
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 MLNLTLPMPVR 535
>gi|422829813|ref|ZP_16877977.1| hypothetical protein ESNG_02482 [Escherichia coli B093]
gi|371607765|gb|EHN96330.1| hypothetical protein ESNG_02482 [Escherichia coli B093]
Length = 659
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|432817355|ref|ZP_20051112.1| hypothetical protein A1Y1_03761 [Escherichia coli KTE115]
gi|431361237|gb|ELG47834.1| hypothetical protein A1Y1_03761 [Escherichia coli KTE115]
Length = 656
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|293417024|ref|ZP_06659661.1| hypothetical protein ECDG_04192 [Escherichia coli B185]
gi|291431600|gb|EFF04585.1| hypothetical protein ECDG_04192 [Escherichia coli B185]
Length = 656
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 135/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKREQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|423126346|ref|ZP_17114025.1| hypothetical protein HMPREF9694_03037 [Klebsiella oxytoca 10-5250]
gi|376397918|gb|EHT10548.1| hypothetical protein HMPREF9694_03037 [Klebsiella oxytoca 10-5250]
Length = 653
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/395 (21%), Positives = 142/395 (35%), Gaps = 71/395 (17%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
L RLY +T +P+++ L F +P F ++ + Y
Sbjct: 193 LMRLYDVTQEPRYIALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
+ + P+ IG +R+ +Y + G + ++ Y
Sbjct: 253 AHQSISEQPVAIGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWKNMAQRQLY 306
Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG +S F D DT+ +E +C + ++ +R + + YAD ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIES 482
AL N VL + Y+ PL + H + W CC
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPLEVNPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARV 422
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
+ LG IY + LYI Y+ +S + G+ L ++ W +++ + SS
Sbjct: 423 LTSLGHYIYTPHDD---ALYINLYVGNSVEIPVGNEALRLRISGNYPWQEQVKIVVDSSS 479
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
+L LR+P W + Q +LNG + +L + W D L + LP+
Sbjct: 480 PVH----HTLALRLPDWC--DKPQVTLNGVPVTQDVRKGYLHISHLWQEGDTLQLTLPMP 533
Query: 603 LRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
+R A + A+ GP Y L +GE
Sbjct: 534 VRRIYGNPLVRHQAGLVAVQRGPLVYCLEQADNGE 568
>gi|417588723|ref|ZP_12239485.1| hypothetical protein ECSTECC16502_4394 [Escherichia coli
STEC_C165-02]
gi|345331722|gb|EGW64181.1| hypothetical protein ECSTECC16502_4394 [Escherichia coli
STEC_C165-02]
Length = 654
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVRGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 MLNLTLPMPVR 535
>gi|432545326|ref|ZP_19782157.1| hypothetical protein A197_03919 [Escherichia coli KTE236]
gi|432550808|ref|ZP_19787564.1| hypothetical protein A199_04285 [Escherichia coli KTE237]
gi|432623948|ref|ZP_19859963.1| hypothetical protein A1UO_03835 [Escherichia coli KTE76]
gi|431071355|gb|ELD79491.1| hypothetical protein A197_03919 [Escherichia coli KTE236]
gi|431077175|gb|ELD84442.1| hypothetical protein A199_04285 [Escherichia coli KTE237]
gi|431156242|gb|ELE56979.1| hypothetical protein A1UO_03835 [Escherichia coli KTE76]
Length = 654
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWCIQ--PQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|419924680|ref|ZP_14442556.1| hypothetical protein EC54115_16625 [Escherichia coli 541-15]
gi|388389076|gb|EIL50615.1| hypothetical protein EC54115_16625 [Escherichia coli 541-15]
Length = 659
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 106/499 (21%), Positives = 182/499 (36%), Gaps = 97/499 (19%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
V +L A A +A +++ V+ ++ Q + GYL+ + T +A +
Sbjct: 74 VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNTYFT-----VKAPEER 126
Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
W+ H++ L++ V A + +V + + +V + H Y +
Sbjct: 127 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLADHIDRVFGPDESKLHGYPGH 186
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
E + L RLY +T +P++L L + F +P + + SH+H
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243
Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
+ H+P+ IG +R+ +Y + G + ++ S
Sbjct: 244 MVKDKAYSQAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLW 297
Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
Y TGG +S F D DT+ +E +C + ++ +R + +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354
Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
YAD ERAL N VL + Y+ PL H KFN +
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPI 405
Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
CC + +G +Y E LYI Y +S + + L +V
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
W ++T+ S Q V +L LR+P W Q LNG+ + +L
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 516
Query: 586 TERWSYNDKLTIQLPLSLR 604
T W D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535
>gi|379722221|ref|YP_005314352.1| hypothetical protein PM3016_4439 [Paenibacillus mucilaginosus 3016]
gi|386724962|ref|YP_006191288.1| hypothetical protein B2K_22975 [Paenibacillus mucilaginosus K02]
gi|378570893|gb|AFC31203.1| hypothetical protein PM3016_4439 [Paenibacillus mucilaginosus 3016]
gi|384092087|gb|AFH63523.1| hypothetical protein B2K_22975 [Paenibacillus mucilaginosus K02]
Length = 660
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 129/354 (36%), Gaps = 69/354 (19%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
L +LY T + ++L LA F +P FL Q D SH+ A +PI QM
Sbjct: 195 ALVKLYGATGEERYLKLAQFFIDERGTEPNFLVEECRQRDGYSHW-AKKKLPIPTAEQMA 253
Query: 347 YEVTGDPLYKL-------IGTFFMDIVNASHSYATGGT----SAREFWWDPKR----LAD 391
Y P+ + + +M A + TG + R W + + +
Sbjct: 254 YNQAHKPVRQQDTAVGHSVRAVYMYTAMADLARLTGDAELLEACRRLWDNTTKKQMYITG 313
Query: 392 TLGSENE-----------------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
+GS + ETC + ++ +R + + + YAD ERAL N V
Sbjct: 314 GIGSTHHGEAFSFDYDLPNDTVYAETCASIGLIFFARRMLQLEAKSEYADVLERALYNNV 373
Query: 435 LS--IQRGTEPGVMIYMLPL-----------GRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
+ Q G Y+ PL GR KA +G CC
Sbjct: 374 IGSMSQDGKH---YFYVNPLEVWPKASEQNPGRHHVKAVRQPWFGCS-----CCPPNVAR 425
Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
S L D IY G +Y +I S SF +G V L Q+ + W+ R LT
Sbjct: 426 LLSSLNDYIYSASAGENT-VYTHLFIGSEASFKLAAGQVALKQESR--LPWEGCARFELT 482
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
+ V +L LR+P W+ A+ +NG + T RW+ D
Sbjct: 483 AVPEAPV----TLALRIPSWS-GGRAELRINGAAEAYEVENGYAVVTRRWTAGD 531
>gi|423230660|ref|ZP_17217064.1| hypothetical protein HMPREF1063_02884 [Bacteroides dorei
CL02T00C15]
gi|423244371|ref|ZP_17225446.1| hypothetical protein HMPREF1064_01652 [Bacteroides dorei
CL02T12C06]
gi|392630310|gb|EIY24303.1| hypothetical protein HMPREF1063_02884 [Bacteroides dorei
CL02T00C15]
gi|392641945|gb|EIY35717.1| hypothetical protein HMPREF1064_01652 [Bacteroides dorei
CL02T12C06]
Length = 811
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 83/386 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L A F + G LS + + H PI ++G +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y T + + + TGG +R P+ + G
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + + +F T + YAD ERAL NGV+S GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G I F + +GN +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCLGN-ITRFMASVPYYMYATQGN--DV 432
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
Y+ +I S D ++ +N + WD + + +T +QE +L +R+P W
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488
Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
++++ AQA S+NG + + + W D + I LP+ +R
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548
Query: 605 -TEAIQDDRPEYASIQAILFGPYLLA 629
+ ++DDR + AI GP +
Sbjct: 549 ANDQVEDDRGKL----AIERGPIMFC 570
>gi|300898699|ref|ZP_07117012.1| conserved hypothetical protein [Escherichia coli MS 198-1]
gi|300357662|gb|EFJ73532.1| conserved hypothetical protein [Escherichia coli MS 198-1]
Length = 662
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 260 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 313
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 370
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPLRQRWFGCA 421
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 422 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLCLRVSGNYPWQE- 477
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 478 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 532
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 533 TLNLTLPMPVR 543
>gi|421448505|ref|ZP_15897898.1| hypothetical protein SEEE6482_08181 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|396073159|gb|EJI81465.1| hypothetical protein SEEE6482_08181 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
Length = 651
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/398 (22%), Positives = 143/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGVQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + G+ L ++ W +++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI- 475
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V L +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 476 -DSVQPV--LHTLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|422975185|ref|ZP_16976637.1| hypothetical protein ESRG_03271 [Escherichia coli TA124]
gi|371595315|gb|EHN84166.1| hypothetical protein ESRG_03271 [Escherichia coli TA124]
Length = 654
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLCLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|218707221|ref|YP_002414740.1| hypothetical protein ECUMN_4099 [Escherichia coli UMN026]
gi|293407210|ref|ZP_06651134.1| conserved hypothetical protein [Escherichia coli FVEC1412]
gi|298382958|ref|ZP_06992553.1| hypothetical protein ECFG_04115 [Escherichia coli FVEC1302]
gi|419934131|ref|ZP_14451275.1| hypothetical protein EC5761_10490 [Escherichia coli 576-1]
gi|432355611|ref|ZP_19598877.1| hypothetical protein WCA_04610 [Escherichia coli KTE2]
gi|432403987|ref|ZP_19646731.1| hypothetical protein WEK_04194 [Escherichia coli KTE26]
gi|432428252|ref|ZP_19670733.1| hypothetical protein A139_03650 [Escherichia coli KTE181]
gi|432462951|ref|ZP_19705084.1| hypothetical protein A15I_03831 [Escherichia coli KTE204]
gi|432477946|ref|ZP_19719933.1| hypothetical protein A15Q_04149 [Escherichia coli KTE208]
gi|432519807|ref|ZP_19756986.1| hypothetical protein A17U_02789 [Escherichia coli KTE228]
gi|432539967|ref|ZP_19776859.1| hypothetical protein A195_03603 [Escherichia coli KTE235]
gi|432633483|ref|ZP_19869403.1| hypothetical protein A1UW_03877 [Escherichia coli KTE80]
gi|432643180|ref|ZP_19879004.1| hypothetical protein A1W1_04060 [Escherichia coli KTE83]
gi|432668175|ref|ZP_19903747.1| hypothetical protein A1Y3_04798 [Escherichia coli KTE116]
gi|432772362|ref|ZP_20006675.1| hypothetical protein A1SG_00431 [Escherichia coli KTE54]
gi|432889014|ref|ZP_20102658.1| hypothetical protein A31C_04403 [Escherichia coli KTE158]
gi|432915187|ref|ZP_20120514.1| hypothetical protein A13Q_04153 [Escherichia coli KTE190]
gi|433020828|ref|ZP_20208923.1| hypothetical protein WI7_03757 [Escherichia coli KTE105]
gi|433055258|ref|ZP_20242416.1| hypothetical protein WIK_04060 [Escherichia coli KTE122]
gi|433069946|ref|ZP_20256714.1| hypothetical protein WIQ_03826 [Escherichia coli KTE128]
gi|433160742|ref|ZP_20345560.1| hypothetical protein WKU_03818 [Escherichia coli KTE177]
gi|433180460|ref|ZP_20364837.1| hypothetical protein WGM_04099 [Escherichia coli KTE82]
gi|218434318|emb|CAR15240.1| conserved hypothetical protein [Escherichia coli UMN026]
gi|291426021|gb|EFE99055.1| conserved hypothetical protein [Escherichia coli FVEC1412]
gi|298276794|gb|EFI18312.1| hypothetical protein ECFG_04115 [Escherichia coli FVEC1302]
gi|388409694|gb|EIL69966.1| hypothetical protein EC5761_10490 [Escherichia coli 576-1]
gi|430872588|gb|ELB96188.1| hypothetical protein WCA_04610 [Escherichia coli KTE2]
gi|430923400|gb|ELC44137.1| hypothetical protein WEK_04194 [Escherichia coli KTE26]
gi|430951024|gb|ELC70250.1| hypothetical protein A139_03650 [Escherichia coli KTE181]
gi|430986214|gb|ELD02797.1| hypothetical protein A15I_03831 [Escherichia coli KTE204]
gi|431002149|gb|ELD17675.1| hypothetical protein A15Q_04149 [Escherichia coli KTE208]
gi|431048059|gb|ELD58044.1| hypothetical protein A17U_02789 [Escherichia coli KTE228]
gi|431067015|gb|ELD75632.1| hypothetical protein A195_03603 [Escherichia coli KTE235]
gi|431167666|gb|ELE67931.1| hypothetical protein A1UW_03877 [Escherichia coli KTE80]
gi|431177575|gb|ELE77497.1| hypothetical protein A1W1_04060 [Escherichia coli KTE83]
gi|431198006|gb|ELE96833.1| hypothetical protein A1Y3_04798 [Escherichia coli KTE116]
gi|431323599|gb|ELG11078.1| hypothetical protein A1SG_00431 [Escherichia coli KTE54]
gi|431413832|gb|ELG96595.1| hypothetical protein A31C_04403 [Escherichia coli KTE158]
gi|431436255|gb|ELH17862.1| hypothetical protein A13Q_04153 [Escherichia coli KTE190]
gi|431526942|gb|ELI03673.1| hypothetical protein WI7_03757 [Escherichia coli KTE105]
gi|431566044|gb|ELI39087.1| hypothetical protein WIK_04060 [Escherichia coli KTE122]
gi|431578915|gb|ELI51501.1| hypothetical protein WIQ_03826 [Escherichia coli KTE128]
gi|431673865|gb|ELJ40054.1| hypothetical protein WKU_03818 [Escherichia coli KTE177]
gi|431697952|gb|ELJ63031.1| hypothetical protein WGM_04099 [Escherichia coli KTE82]
Length = 654
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPLRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLCLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|291086404|ref|ZP_06355701.2| putative cytoplasmic protein [Citrobacter youngae ATCC 29220]
gi|291068139|gb|EFE06248.1| putative cytoplasmic protein [Citrobacter youngae ATCC 29220]
Length = 659
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/220 (24%), Positives = 86/220 (39%), Gaps = 32/220 (14%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 342 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 394
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + +G IY + LYI
Sbjct: 395 --EVHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARILTSIGHYIYTPRQD---ALYIN 449
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + VL ++ W + ++T+ S Q V +L LR+P W +
Sbjct: 450 LYVGNSMEVPVADGVLKLRISGNYPW--HEQVTIAIESPQPVKH--TLALRLPDWC--SA 503
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
Q LNGQ + +L + W D L++ LP+ +R
Sbjct: 504 PQVLLNGQPVAQDIRKGYLHISRTWQEGDTLSLTLPMPVR 543
>gi|345514174|ref|ZP_08793688.1| six-hairpin glycosidase, partial [Bacteroides dorei 5_1_36/D4]
gi|345456089|gb|EEO48255.2| six-hairpin glycosidase [Bacteroides dorei 5_1_36/D4]
Length = 810
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 100/439 (22%), Positives = 168/439 (38%), Gaps = 92/439 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L A F + G LS + + H PI ++G +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y T + + + TGG +R P+ + G
Sbjct: 276 AGYLYSGVADVATLTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + + +F T + YAD ERAL NGV+S GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G I F + +GN +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
Y+ +I S D ++ +N + WD + + +T +QE +L +R+P W
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488
Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
++++ AQA S+NG + + + W D + I LP+ +R
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548
Query: 605 -TEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVT 663
+ ++DDR + AI GP + G+ + +P+ S++A L+
Sbjct: 549 ANDQVEDDRGKL----AIERGPIMFC--LEGQDQADSTVFNKFIPDGTPMEASYDADLL- 601
Query: 664 FTQESGNSTFVMSNSNQSI 682
N V+S + + I
Sbjct: 602 ------NGVMVLSGTAKEI 614
>gi|265752762|ref|ZP_06088331.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263235948|gb|EEZ21443.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 811
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 96/415 (23%), Positives = 156/415 (37%), Gaps = 77/415 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L A F + G LS + + H PI ++G +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDKIVGHAVR 275
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y T + + + TGG +R P+ + G
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + + +F T + YAD ERAL NGV+S GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G I F + +GN +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT- 560
Y+ +I S D ++ +N + WD + + +T +QE +L +R+P WT
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWTQ 488
Query: 561 ----------YSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
+++ AQA S+NG + + + W D + I LP+ +R
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548
Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
D + AI GP + G+ + +P+ SF+A L+
Sbjct: 549 ANDQVEDDHGKLAIERGPIMFC--LEGQDQADSTVFNKFIPDGTPMEASFHADLL 601
>gi|222530205|ref|YP_002574087.1| hypothetical protein Athe_2242 [Caldicellulosiruptor bescii DSM
6725]
gi|222457052|gb|ACM61314.1| protein of unknown function DUF1680 [Caldicellulosiruptor bescii
DSM 6725]
Length = 652
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 116/515 (22%), Positives = 199/515 (38%), Gaps = 74/515 (14%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFEALK 223
V +L A++ + N +++K+ V+ + + Q + GYL+ + T E + L+
Sbjct: 81 VAKWLEAASYILEKYPNPDLEKKVDEVIDIIEKAQWE--DGYLNTYFTIKEKGKRWTNLE 138
Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
Y H I AG+ + LA +L +E V +++ E
Sbjct: 139 ECHELYTAGHMIEAGV--AHFLATGKTSL------LEIIKKLADHVYSIFGKEEGKIPGY 190
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFL--------------GFLALQ 324
+ + L +LY +T D K+L LA F +P + GF L
Sbjct: 191 DGHPEIELALVKLYEVTGDRKYLELAKFFIDERGQEPYYFDIEWEKRGRKEHWQGFKRLG 250
Query: 325 ADYLSHFHANTHIPIVIGSQMR----YEVTGD--------PLYKLIGTFFMDIVNASHSY 372
+YL + +G +R Y D L+ + T F DIV Y
Sbjct: 251 REYLQVYRPVRQQKEAVGHAVRAVYLYSGMADVAAYTQDKELFDVCKTLFDDIVKRK-MY 309
Query: 373 ATG--GTSA--REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TG G+SA F ++ DT +E TC + ++ + L + Y D ER
Sbjct: 310 ITGAIGSSAHGEAFTFEYDLPNDTAYAE---TCASVGLIFFAHRLNKIEPHAKYYDVVER 366
Query: 429 ALTNGVLSI--QRGTEPGVMIYMLPLG---RGVSKARSTHGWGTKFNSFW---CCYGTGI 480
AL N V+ Q G + Y+ PL + V K H + ++ CC
Sbjct: 367 ALYNTVIGSMSQDGKK---YFYVNPLEVYPKEVEKRFDRHHVKPERQPWFGCACCPPNVA 423
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV-VLNQKVDPIVSWDPYLRMTLT 539
+ LG +Y N G+Y+ YI SS + G + VL Q+V ++ +++ L
Sbjct: 424 RLLASLGRYVY---SYNHDGIYVNLYIGSSVQVEVGGIKVLLQQVSSY-PFEDMVKIDLK 479
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERWSYNDKLTIQ 598
S + L LR+P W S + +NG+ P PP ++ W ND++ ++
Sbjct: 480 PSKEARF----KLYLRIPGWCES--YEVYVNGKKEEPEEPPSGYVCIERLWKENDQVVLK 533
Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+P ++ + A++ GP + +
Sbjct: 534 IPTEVKMVSSHPQVRSNVGKVAVVKGPVVFCAEEA 568
>gi|86359423|ref|YP_471315.1| hypothetical protein RHE_CH03841 [Rhizobium etli CFN 42]
gi|86283525|gb|ABC92588.1| hypothetical conserved protein [Rhizobium etli CFN 42]
Length = 640
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 90/396 (22%), Positives = 159/396 (40%), Gaps = 61/396 (15%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFHANT------HIPI 339
L +L +T + K+L L+ F +P F A++ LS +H T H+P+
Sbjct: 198 ALVKLARVTGEKKYLDLSKFFIDERGTEPHFFTAEAVRDGRSLSDYHQKTYEYGQAHLPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 258 REQKKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + + YAD E+AL NG L
Sbjct: 317 NEGFTDYYDLPN--ATAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
T+ Y PL A H W K++ CC + +G +Y + +
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426
Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+++ ++ +G V L Q + WD + F++K +L+LR+P
Sbjct: 427 AVHLYGESTARLKLANGAEVELEQATN--YPWDG----AVAFTAKLAKSAKFALSLRIPD 480
Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W + GA S+NG + L ++ W++ D++ + LP++LR + + A
Sbjct: 481 W--AEGASLSVNGTGVELGAHLRDGYIRIEREWAHGDRVALDLPMALRPQYANPKVRQDA 538
Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
A++ GP + T T + L+A+I P
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGQDLNAIILP 567
>gi|294777480|ref|ZP_06742931.1| putative lipoprotein [Bacteroides vulgatus PC510]
gi|294448548|gb|EFG17097.1| putative lipoprotein [Bacteroides vulgatus PC510]
Length = 811
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 100/439 (22%), Positives = 168/439 (38%), Gaps = 92/439 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L A F + G LS + + H PI ++G +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y T + + + TGG +R P+ + G
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + + +F T + YAD ERAL NGV+S GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G I F + +GN +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
Y+ +I S D ++ +N + WD + + +T +QE +L +R+P W
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488
Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
++++ AQA S+NG + + + W D + I LP+ +R
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548
Query: 605 -TEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVT 663
+ ++DDR + AI GP + G+ + +P+ S++A L+
Sbjct: 549 ANDQVEDDRGKL----AIERGPIIFC--LEGQDQADSTVFNKFIPDGTPMEASYDAGLL- 601
Query: 664 FTQESGNSTFVMSNSNQSI 682
N V+S + + I
Sbjct: 602 ------NGVMVLSGTAKEI 614
>gi|397660575|ref|YP_006501277.1| hypothetical protein A225_5616 [Klebsiella oxytoca E718]
gi|394348582|gb|AFN34703.1| putative secreted protein [Klebsiella oxytoca E718]
Length = 653
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/370 (21%), Positives = 131/370 (35%), Gaps = 85/370 (22%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
L RLY +T +P+++ L F +P F ++ + Y
Sbjct: 193 LMRLYDVTQEPRYIALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
+ P+ IG +R+ +Y + G + ++ Y
Sbjct: 253 AHQPISEQPVAIGHAVRF------VYLMTGVAHLARLSQDEGKRRDCLRLWKNMARRQLY 306
Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG +S F D DT+ +E +C + ++ +R + + YAD ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
AL N VL + Y+ PL H KFN + C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPVRQRWFGCAC 414
Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
C + LG IY + LYI Y+ +S + G+ L ++ W +
Sbjct: 415 CPPNIARVLTSLGHYIYTPHDDV---LYINLYVGNSVEIPVGNEALRLRISGNYPWQEQV 471
Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
++ + SS +L LR+P W + Q +LNG + +L + W D
Sbjct: 472 KIVIDSSSPVN----HTLALRLPDWC--DKPQVTLNGAPVTQDVRKGYLHISHLWQEGDT 525
Query: 595 LTIQLPLSLR 604
L + LP+ +R
Sbjct: 526 LQLTLPMPVR 535
>gi|375257948|ref|YP_005017118.1| hypothetical protein KOX_05730 [Klebsiella oxytoca KCTC 1686]
gi|365907426|gb|AEX02879.1| hypothetical protein KOX_05730 [Klebsiella oxytoca KCTC 1686]
Length = 653
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/370 (21%), Positives = 131/370 (35%), Gaps = 85/370 (22%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
L RLY +T +P+++ L F +P F ++ + Y
Sbjct: 193 LMRLYDVTQEPRYIALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
+ P+ IG +R+ +Y + G + ++ Y
Sbjct: 253 AHQPISEQPVAIGHAVRF------VYLMTGVAHLARLSQDEGKRRDCLRLWKNMARRQLY 306
Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG +S F D DT+ +E +C + ++ +R + + YAD ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
AL N VL + Y+ PL H KFN + C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPVRQRWFGCAC 414
Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
C + LG IY + LYI Y+ +S + G+ L ++ W +
Sbjct: 415 CPPNIARVLTSLGHYIYTPHDDV---LYINLYVGNSVEIPVGNEALRLRISGNYPWQEQV 471
Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
++ + SS +L LR+P W + Q +LNG + +L + W D
Sbjct: 472 KIVIDSSSPVN----HTLALRLPDWC--DKPQVTLNGAPVTQDVRKGYLHISHLWQEGDT 525
Query: 595 LTIQLPLSLR 604
L + LP+ +R
Sbjct: 526 LQLTLPMPVR 535
>gi|432394191|ref|ZP_19637011.1| hypothetical protein WE9_04517 [Escherichia coli KTE21]
gi|430914340|gb|ELC35436.1| hypothetical protein WE9_04517 [Escherichia coli KTE21]
Length = 656
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L ++ W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRISGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVGQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|421728042|ref|ZP_16167199.1| hypothetical protein KOXM_22161 [Klebsiella oxytoca M5al]
gi|410371224|gb|EKP25948.1| hypothetical protein KOXM_22161 [Klebsiella oxytoca M5al]
Length = 653
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 79/370 (21%), Positives = 131/370 (35%), Gaps = 85/370 (22%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL------------------GFLALQADYLS 329
L RLY +T +P+++ L F +P F ++ + Y
Sbjct: 193 LMRLYDVTQEPRYIALVKYFVEARGTQPHFYDIEYEKRGKTSYWNTYGPAWMVMDKPYSQ 252
Query: 330 HFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHS-----------------Y 372
+ P+ IG +R+ +Y + G + ++ Y
Sbjct: 253 AHQPISEQPVAIGHAVRF------VYLMAGVAHLARLSQDEGKRQDCLRLWKNMAQRQLY 306
Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG +S F D DT+ +E +C + ++ +R + + YAD ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------C 474
AL N VL + Y+ PL H KFN + C
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPVRQRWFGCAC 414
Query: 475 CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYL 534
C + LG IY + LYI YI +S + G+ L ++ W +
Sbjct: 415 CPPNIARVLTSLGHYIYTPHDD---ALYINLYIGNSAEIPVGNEALRLRISGNYPWQEQV 471
Query: 535 RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDK 594
++ + SS +L LR+P W + Q +LNG + +L + W D
Sbjct: 472 QIVIDSSSPVH----HTLALRLPDWC--DKPQVTLNGAPVTQDVRKGYLYISHLWQEGDT 525
Query: 595 LTIQLPLSLR 604
L + LP+ +R
Sbjct: 526 LLLTLPMPVR 535
>gi|423313151|ref|ZP_17291087.1| hypothetical protein HMPREF1058_01699 [Bacteroides vulgatus
CL09T03C04]
gi|392686365|gb|EIY79671.1| hypothetical protein HMPREF1058_01699 [Bacteroides vulgatus
CL09T03C04]
Length = 811
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 83/386 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L A F + G LS + + H PI ++G +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y T + + + TGG +R P+ + G
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + + +F T + YAD ERAL NGV+S GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G I F + +GN +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
Y+ +I S D ++ +N + WD + + +T +QE +L +R+P W
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488
Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
++++ AQA S+NG + + + W D + I LP+ +R
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548
Query: 605 -TEAIQDDRPEYASIQAILFGPYLLA 629
+ ++DDR + AI GP +
Sbjct: 549 ANDQVEDDRGKL----AIERGPIMFC 570
>gi|417243728|ref|ZP_12038126.1| putative glycosyhydrolase [Escherichia coli 9.0111]
gi|386211280|gb|EII21745.1| putative glycosyhydrolase [Escherichia coli 9.0111]
Length = 654
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|319640078|ref|ZP_07994805.1| hypothetical protein HMPREF9011_00402 [Bacteroides sp. 3_1_40A]
gi|345517097|ref|ZP_08796575.1| six-hairpin glycosidase [Bacteroides sp. 4_3_47FAA]
gi|254833866|gb|EET14175.1| six-hairpin glycosidase [Bacteroides sp. 4_3_47FAA]
gi|317388356|gb|EFV69208.1| hypothetical protein HMPREF9011_00402 [Bacteroides sp. 3_1_40A]
Length = 811
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 83/386 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L A F + G LS + + H PI ++G +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y T + + + TGG +R P+ + G
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + + +F T + YAD ERAL NGV+S GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G I F + +GN +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
Y+ +I S D ++ +N + WD + + +T +QE +L +R+P W
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488
Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
++++ AQA S+NG + + + W D + I LP+ +R
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548
Query: 605 -TEAIQDDRPEYASIQAILFGPYLLA 629
+ ++DDR + AI GP +
Sbjct: 549 ANDQVEDDRGKL----AIERGPIMFC 570
>gi|312621510|ref|YP_004023123.1| hypothetical protein Calkro_0404 [Caldicellulosiruptor
kronotskyensis 2002]
gi|312201977|gb|ADQ45304.1| protein of unknown function DUF1680 [Caldicellulosiruptor
kronotskyensis 2002]
Length = 652
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 115/515 (22%), Positives = 197/515 (38%), Gaps = 74/515 (14%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFEALK 223
V +L A++ + N +++K+ V+ + + Q + GYL+ + T E + L+
Sbjct: 81 VAKWLEAASYVLEKYPNPDLEKKVDEVIQLIGKAQWE--DGYLNTYFTIKEKGKRWTNLE 138
Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
Y H I AG ++ L++ + ++ YN ++ E
Sbjct: 139 ECHELYTAGHMIEAGCA-HFLATGKTTLLEIVKKIADHIYN-------VFGKEEGKIPGY 190
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-------------------DKPCFLGFLALQ 324
+ + L +LY +T D K+L LA F K + GF +L
Sbjct: 191 DGHPEIELALVKLYEVTGDRKYLELAKFFVDERGQEPYYFDIEYEKRGKKSHWAGFKSLG 250
Query: 325 ADYLSHFHANTHIPIVIGSQMR----YEVTGD--------PLYKLIGTFFMDIVNASHSY 372
+YL + +G +R Y D L+ + T F DIV Y
Sbjct: 251 REYLQAYRPLRQQKEAVGHAVRAVYLYSGAADVAAYTQDKELFDVCKTLFDDIVKRK-MY 309
Query: 373 ATG--GTSA--REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TG G+SA F ++ DT +E TC + ++ + L + Y D ER
Sbjct: 310 ITGAIGSSAHGEAFTFEYDLPNDTAYAE---TCASVGLIFFAHRLNKIEPHAKYYDVVER 366
Query: 429 ALTNGVLSI--QRGTEPGVMIYMLPLG---RGVSKA---RSTHGWGTKFNSFWCCYGTGI 480
AL N V+ Q G + Y+ PL + V K R + CC
Sbjct: 367 ALYNTVIGSMSQDGKK---YFYVNPLEVYPKEVEKRFDRRHVKPERQPWFGCACCPPNVA 423
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV-VLNQKVDPIVSWDPYLRMTLT 539
+ LG IY N G+Y+ YI SS + G V VL Q++ ++ +++ L
Sbjct: 424 RLLASLGRYIY---SYNHEGIYVNLYIGSSVQVEVGGVKVLLQQMSSY-PFEDIVKIDLK 479
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERWSYNDKLTIQ 598
S + L LR+P W S + +NG+ P PP ++ W ND++ ++
Sbjct: 480 PSKEARF----KLYLRIPSWCES--YEVYVNGKKEEPEEPPSGYVCIERLWKENDQVILK 533
Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+P ++ + A++ GP + +
Sbjct: 534 IPTEVKMVSSHPQVRSNVGKVAVVKGPVVFCAEEA 568
>gi|150003698|ref|YP_001298442.1| hypothetical protein BVU_1129 [Bacteroides vulgatus ATCC 8482]
gi|149932122|gb|ABR38820.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 811
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 83/386 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L A F + G LS + + H PI ++G +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y T + + + TGG +R P+ + G
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + + +F T + YAD ERAL NGV+S GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G I F + +GN +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
Y+ +I S D ++ +N + WD + + +T +QE +L +R+P W
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 488
Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
++++ AQA S+NG + + + W D + I LP+ +R
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548
Query: 605 -TEAIQDDRPEYASIQAILFGPYLLA 629
+ ++DDR + AI GP +
Sbjct: 549 ANDQVEDDRGKL----AIERGPIMFC 570
>gi|237711356|ref|ZP_04541837.1| six-hairpin glycosidase [Bacteroides sp. 9_1_42FAA]
gi|229454051|gb|EEO59772.1| six-hairpin glycosidase [Bacteroides sp. 9_1_42FAA]
Length = 806
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 83/386 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L A F + G LS + + H PI ++G +R
Sbjct: 215 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 270
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y T + + + TGG +R P+ + G
Sbjct: 271 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 323
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + + +F T + YAD ERAL NGV+S GV +
Sbjct: 324 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 376
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G I F + +GN +
Sbjct: 377 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 427
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
Y+ +I S D ++ +N + WD + + +T +QE +L +R+P W
Sbjct: 428 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWAQ 483
Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR--- 604
++++ AQA S+NG + + + W D + I LP+ +R
Sbjct: 484 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 543
Query: 605 -TEAIQDDRPEYASIQAILFGPYLLA 629
+ ++DDR + AI GP +
Sbjct: 544 ANDQVEDDRGKL----AIERGPIMFC 565
>gi|261420102|ref|YP_003253784.1| hypothetical protein GYMC61_2720 [Geobacillus sp. Y412MC61]
gi|319766914|ref|YP_004132415.1| hypothetical protein [Geobacillus sp. Y412MC52]
gi|261376559|gb|ACX79302.1| protein of unknown function DUF1680 [Geobacillus sp. Y412MC61]
gi|317111780|gb|ADU94272.1| protein of unknown function DUF1680 [Geobacillus sp. Y412MC52]
Length = 640
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 106/265 (40%), Gaps = 24/265 (9%)
Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSA----REFWWDPKRLADTLGSENEETCTTYN 405
TGD K + V Y TGG + F +D DT+ +E TC +
Sbjct: 275 TGDESLKQACQTLWENVTKRQMYITGGVGSSAFGESFTFDFDLPNDTVYTE---TCASIA 331
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG---RGVSK--AR 460
++ +R + + YAD ERAL NG +S + Y+ PL + + R
Sbjct: 332 LVFWARRMLELEMDGKYADVMERALYNGTIS-GMDLDGKRFFYVNPLEVWPKACERHDKR 390
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
K+ S CC + + IY + L++ Y+ S + G +
Sbjct: 391 HVKPVRQKWFSCACCPPNLARLIASISHYIYSQ---TSDALFVHLYVGSDIQTEMGGRSV 447
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-- 578
+ WD +R+T++ S QE +L LR+P W GA+ ++NG+N+ + P
Sbjct: 448 EIVQETNYPWDGKVRLTISPESAQEF----TLGLRIPGW--GRGAEVTINGENVDIAPLT 501
Query: 579 PGNFLSATERWSYNDKLTIQLPLSL 603
+ W D++ + P+ +
Sbjct: 502 KKGYAYIRRVWRQGDEMVLHFPMPV 526
>gi|397166966|ref|ZP_10490409.1| hypothetical protein Y71_0972 [Enterobacter radicincitans DSM
16656]
gi|396091112|gb|EJI88679.1| hypothetical protein Y71_0972 [Enterobacter radicincitans DSM
16656]
Length = 651
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 82/212 (38%), Gaps = 16/212 (7%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLEVHPKT 392
Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
R H + W CC + LG IY + LYI Y+ +S +
Sbjct: 393 LRFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTPHQD---ALYINLYVGNSIE 449
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
G VL +V W ++ + S V +L LRMP W + Q +LNG
Sbjct: 450 VPVGDKVLRLRVSGNFPWQE--KVMIAVESPLPVQH--TLALRMPDW--CDAPQVTLNGV 503
Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+ +L W D LT+ LP+ +R
Sbjct: 504 AVEKAVHKGYLHIHRLWQEGDTLTLTLPMPVR 535
>gi|331665212|ref|ZP_08366113.1| putative cytoplasmic protein [Escherichia coli TA143]
gi|432767960|ref|ZP_20002352.1| hypothetical protein A1S9_00751 [Escherichia coli KTE50]
gi|432964211|ref|ZP_20153463.1| hypothetical protein A15E_04412 [Escherichia coli KTE202]
gi|433065055|ref|ZP_20251959.1| hypothetical protein WIO_03878 [Escherichia coli KTE125]
gi|331057722|gb|EGI29708.1| putative cytoplasmic protein [Escherichia coli TA143]
gi|431321992|gb|ELG09585.1| hypothetical protein A1S9_00751 [Escherichia coli KTE50]
gi|431469844|gb|ELH49772.1| hypothetical protein A15E_04412 [Escherichia coli KTE202]
gi|431578217|gb|ELI50831.1| hypothetical protein WIO_03878 [Escherichia coli KTE125]
Length = 654
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGNSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|422768624|ref|ZP_16822348.1| hypothetical protein ERCG_03884 [Escherichia coli E1520]
gi|323934869|gb|EGB31251.1| hypothetical protein ERCG_03884 [Escherichia coli E1520]
Length = 659
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|331655213|ref|ZP_08356212.1| putative cytoplasmic protein [Escherichia coli M718]
gi|331047228|gb|EGI19306.1| putative cytoplasmic protein [Escherichia coli M718]
Length = 664
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 260 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 313
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 314 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 370
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 421
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 422 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 477
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 478 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 532
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 533 TLNLTLPMPVR 543
>gi|432720730|ref|ZP_19955692.1| hypothetical protein WCK_04370 [Escherichia coli KTE9]
gi|432794804|ref|ZP_20028883.1| hypothetical protein A1US_04040 [Escherichia coli KTE78]
gi|432796321|ref|ZP_20030359.1| hypothetical protein A1UU_01028 [Escherichia coli KTE79]
gi|431259905|gb|ELF52266.1| hypothetical protein WCK_04370 [Escherichia coli KTE9]
gi|431336741|gb|ELG23843.1| hypothetical protein A1US_04040 [Escherichia coli KTE78]
gi|431348554|gb|ELG35405.1| hypothetical protein A1UU_01028 [Escherichia coli KTE79]
Length = 654
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 83/365 (22%), Positives = 133/365 (36%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + + Y TGG
Sbjct: 252 QAHLPIAQQQTAIGHAVRFVYLMTAVAHLARLSHDESKRQDCLRLWNNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ +G +Y E LYI Y +S + + +L +V W ++T+
Sbjct: 420 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGMLRLRVSGNYPWQE--QVTIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W Q LNG+ + +L T W D L + L
Sbjct: 475 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVGQDIRKGYLHITREWQEGDTLNLTL 530
Query: 600 PLSLR 604
P+ +R
Sbjct: 531 PMPVR 535
>gi|432451832|ref|ZP_19694088.1| hypothetical protein A13W_02801 [Escherichia coli KTE193]
gi|433035497|ref|ZP_20223187.1| hypothetical protein WIC_04061 [Escherichia coli KTE112]
gi|430977578|gb|ELC94414.1| hypothetical protein A13W_02801 [Escherichia coli KTE193]
gi|431546634|gb|ELI21027.1| hypothetical protein WIC_04061 [Escherichia coli KTE112]
Length = 656
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|300920475|ref|ZP_07136906.1| conserved hypothetical protein [Escherichia coli MS 115-1]
gi|300412519|gb|EFJ95829.1| conserved hypothetical protein [Escherichia coli MS 115-1]
Length = 664
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 85/365 (23%), Positives = 133/365 (36%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259
Query: 334 NTHIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
H+P+ IG +R Y +TG D + + + Y TGG
Sbjct: 260 QAHLPLAQQQTAIGHAVRFAYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQLYITGGI 319
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 320 GSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 376
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 377 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 427
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ +G +Y E LYI Y +S + + L +V W ++T+
Sbjct: 428 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIA 482
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W Q LNG+ + +L T W D L + L
Sbjct: 483 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTL 538
Query: 600 PLSLR 604
P+ +R
Sbjct: 539 PMPVR 543
>gi|417631018|ref|ZP_12281252.1| hypothetical protein ECSTECMHI813_3969 [Escherichia coli
STEC_MHI813]
gi|345370297|gb|EGX02275.1| hypothetical protein ECSTECMHI813_3969 [Escherichia coli
STEC_MHI813]
Length = 656
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|432752040|ref|ZP_19986617.1| hypothetical protein WEQ_03462 [Escherichia coli KTE29]
gi|431293661|gb|ELF83953.1| hypothetical protein WEQ_03462 [Escherichia coli KTE29]
Length = 659
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 85/365 (23%), Positives = 133/365 (36%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
H+P+ IG +R Y +TG D + + + Y TGG
Sbjct: 252 QAHLPLAQQQTAIGHAVRFAYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ +G +Y E LYI Y +S + + L +V W ++T+
Sbjct: 420 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W Q LNG+ + +L T W D L + L
Sbjct: 475 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTL 530
Query: 600 PLSLR 604
P+ +R
Sbjct: 531 PMPVR 535
>gi|425263519|ref|ZP_18655509.1| hypothetical protein ECEC96038_4736 [Escherichia coli EC96038]
gi|408177761|gb|EKI04521.1| hypothetical protein ECEC96038_4736 [Escherichia coli EC96038]
Length = 656
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|422836105|ref|ZP_16884154.1| hypothetical protein ESOG_03755 [Escherichia coli E101]
gi|371609666|gb|EHN98200.1| hypothetical protein ESOG_03755 [Escherichia coli E101]
Length = 656
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|432855232|ref|ZP_20083284.1| hypothetical protein A1YY_03450 [Escherichia coli KTE144]
gi|431397569|gb|ELG81016.1| hypothetical protein A1YY_03450 [Escherichia coli KTE144]
Length = 654
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+PI IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPIAQQQTAIGHAVRF------VYLMTGVAHLARLSHDESKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGKLCLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|432949979|ref|ZP_20144543.1| hypothetical protein A153_04333 [Escherichia coli KTE196]
gi|433045129|ref|ZP_20232605.1| hypothetical protein WIG_03662 [Escherichia coli KTE117]
gi|431453768|gb|ELH34151.1| hypothetical protein A153_04333 [Escherichia coli KTE196]
gi|431552786|gb|ELI26734.1| hypothetical protein WIG_03662 [Escherichia coli KTE117]
Length = 659
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|416342142|ref|ZP_11676508.1| hypothetical protein ECoL_01429 [Escherichia coli EC4100B]
gi|419280237|ref|ZP_13822479.1| hypothetical protein ECDEC10E_4232 [Escherichia coli DEC10E]
gi|419347353|ref|ZP_13888721.1| hypothetical protein ECDEC13A_3946 [Escherichia coli DEC13A]
gi|419351812|ref|ZP_13893141.1| hypothetical protein ECDEC13B_3790 [Escherichia coli DEC13B]
gi|419357284|ref|ZP_13898530.1| hypothetical protein ECDEC13C_4352 [Escherichia coli DEC13C]
gi|419362259|ref|ZP_13903466.1| hypothetical protein ECDEC13D_4067 [Escherichia coli DEC13D]
gi|419367374|ref|ZP_13908523.1| hypothetical protein ECDEC13E_4123 [Escherichia coli DEC13E]
gi|419377671|ref|ZP_13918688.1| hypothetical protein ECDEC14B_4282 [Escherichia coli DEC14B]
gi|419383008|ref|ZP_13923950.1| hypothetical protein ECDEC14C_4191 [Escherichia coli DEC14C]
gi|419388306|ref|ZP_13929174.1| hypothetical protein ECDEC14D_4146 [Escherichia coli DEC14D]
gi|425424537|ref|ZP_18805687.1| hypothetical protein EC01288_3892 [Escherichia coli 0.1288]
gi|432535989|ref|ZP_19772946.1| hypothetical protein A193_04433 [Escherichia coli KTE234]
gi|432811308|ref|ZP_20045165.1| hypothetical protein A1WM_02459 [Escherichia coli KTE101]
gi|320201393|gb|EFW75974.1| hypothetical protein ECoL_01429 [Escherichia coli EC4100B]
gi|378125150|gb|EHW86553.1| hypothetical protein ECDEC10E_4232 [Escherichia coli DEC10E]
gi|378182886|gb|EHX43534.1| hypothetical protein ECDEC13A_3946 [Escherichia coli DEC13A]
gi|378195992|gb|EHX56482.1| hypothetical protein ECDEC13C_4352 [Escherichia coli DEC13C]
gi|378196853|gb|EHX57338.1| hypothetical protein ECDEC13B_3790 [Escherichia coli DEC13B]
gi|378199461|gb|EHX59926.1| hypothetical protein ECDEC13D_4067 [Escherichia coli DEC13D]
gi|378210031|gb|EHX70398.1| hypothetical protein ECDEC13E_4123 [Escherichia coli DEC13E]
gi|378215636|gb|EHX75932.1| hypothetical protein ECDEC14B_4282 [Escherichia coli DEC14B]
gi|378224949|gb|EHX85150.1| hypothetical protein ECDEC14C_4191 [Escherichia coli DEC14C]
gi|378228861|gb|EHX89012.1| hypothetical protein ECDEC14D_4146 [Escherichia coli DEC14D]
gi|408341050|gb|EKJ55523.1| hypothetical protein EC01288_3892 [Escherichia coli 0.1288]
gi|431057624|gb|ELD67052.1| hypothetical protein A193_04433 [Escherichia coli KTE234]
gi|431360470|gb|ELG47081.1| hypothetical protein A1WM_02459 [Escherichia coli KTE101]
Length = 656
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 85/365 (23%), Positives = 133/365 (36%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
H+P+ IG +R Y +TG D + + + Y TGG
Sbjct: 252 QAHLPLAQQQTAIGHAVRFAYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ +G +Y E LYI Y +S + + L +V W ++T+
Sbjct: 420 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W Q LNG+ + +L T W D L + L
Sbjct: 475 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTL 530
Query: 600 PLSLR 604
P+ +R
Sbjct: 531 PMPVR 535
>gi|15804123|ref|NP_290162.1| hypothetical protein Z5002 [Escherichia coli O157:H7 str. EDL933]
gi|15833713|ref|NP_312486.1| hypothetical protein ECs4459 [Escherichia coli O157:H7 str. Sakai]
gi|168746875|ref|ZP_02771897.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4113]
gi|168753398|ref|ZP_02778405.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4401]
gi|168759671|ref|ZP_02784678.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4501]
gi|168765993|ref|ZP_02791000.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4486]
gi|168772459|ref|ZP_02797466.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4196]
gi|168779729|ref|ZP_02804736.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4076]
gi|168797417|ref|ZP_02822424.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC508]
gi|195935108|ref|ZP_03080490.1| hypothetical protein EscherichcoliO157_01410 [Escherichia coli
O157:H7 str. EC4024]
gi|208809591|ref|ZP_03251928.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208813747|ref|ZP_03255076.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208821480|ref|ZP_03261800.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209399472|ref|YP_002273062.1| hypothetical protein ECH74115_4952 [Escherichia coli O157:H7 str.
EC4115]
gi|217324274|ref|ZP_03440358.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254795534|ref|YP_003080371.1| hypothetical protein ECSP_4573 [Escherichia coli O157:H7 str.
TW14359]
gi|291284953|ref|YP_003501771.1| hypothetical protein G2583_4318 [Escherichia coli O55:H7 str.
CB9615]
gi|387508986|ref|YP_006161242.1| hypothetical protein ECO55CA74_20675 [Escherichia coli O55:H7 str.
RM12579]
gi|387884760|ref|YP_006315062.1| hypothetical protein CDCO157_4196 [Escherichia coli Xuzhou21]
gi|416315758|ref|ZP_11659571.1| hypothetical protein ECoA_05479 [Escherichia coli O157:H7 str.
1044]
gi|416320011|ref|ZP_11662563.1| hypothetical protein ECoD_02794 [Escherichia coli O157:H7 str.
EC1212]
gi|416330228|ref|ZP_11669265.1| hypothetical protein ECF_04238 [Escherichia coli O157:H7 str. 1125]
gi|416778240|ref|ZP_11875812.1| hypothetical protein ECO5101_02775 [Escherichia coli O157:H7 str.
G5101]
gi|416789533|ref|ZP_11880657.1| hypothetical protein ECO9389_12801 [Escherichia coli O157:H- str.
493-89]
gi|416801447|ref|ZP_11885596.1| hypothetical protein ECO2687_16571 [Escherichia coli O157:H- str. H
2687]
gi|416812344|ref|ZP_11890513.1| hypothetical protein ECO7815_03795 [Escherichia coli O55:H7 str.
3256-97]
gi|416832964|ref|ZP_11900127.1| hypothetical protein ECOSU61_16870 [Escherichia coli O157:H7 str.
LSU-61]
gi|419047735|ref|ZP_13594666.1| hypothetical protein ECDEC3A_4594 [Escherichia coli DEC3A]
gi|419053393|ref|ZP_13600259.1| hypothetical protein ECDEC3B_4713 [Escherichia coli DEC3B]
gi|419059343|ref|ZP_13606144.1| hypothetical protein ECDEC3C_4952 [Escherichia coli DEC3C]
gi|419064888|ref|ZP_13611608.1| hypothetical protein ECDEC3D_4717 [Escherichia coli DEC3D]
gi|419071821|ref|ZP_13617428.1| hypothetical protein ECDEC3E_4925 [Escherichia coli DEC3E]
gi|419077685|ref|ZP_13623186.1| hypothetical protein ECDEC3F_4810 [Escherichia coli DEC3F]
gi|419082821|ref|ZP_13628266.1| hypothetical protein ECDEC4A_4458 [Escherichia coli DEC4A]
gi|419088700|ref|ZP_13634051.1| hypothetical protein ECDEC4B_4655 [Escherichia coli DEC4B]
gi|419094624|ref|ZP_13639902.1| hypothetical protein ECDEC4C_4545 [Escherichia coli DEC4C]
gi|419106234|ref|ZP_13651356.1| hypothetical protein ECDEC4E_4577 [Escherichia coli DEC4E]
gi|419111620|ref|ZP_13656671.1| hypothetical protein ECDEC4F_4462 [Escherichia coli DEC4F]
gi|419117157|ref|ZP_13662166.1| hypothetical protein ECDEC5A_4358 [Escherichia coli DEC5A]
gi|419122875|ref|ZP_13667817.1| hypothetical protein ECDEC5B_4720 [Escherichia coli DEC5B]
gi|419128272|ref|ZP_13673144.1| hypothetical protein ECDEC5C_4407 [Escherichia coli DEC5C]
gi|419133720|ref|ZP_13678547.1| hypothetical protein ECDEC5D_4501 [Escherichia coli DEC5D]
gi|419138882|ref|ZP_13683672.1| hypothetical protein ECDEC5E_4420 [Escherichia coli DEC5E]
gi|420271748|ref|ZP_14774099.1| hypothetical protein ECPA22_4925 [Escherichia coli PA22]
gi|420283060|ref|ZP_14785292.1| hypothetical protein ECTW06591_4436 [Escherichia coli TW06591]
gi|420288947|ref|ZP_14791129.1| hypothetical protein ECTW10246_5014 [Escherichia coli TW10246]
gi|420294768|ref|ZP_14796878.1| hypothetical protein ECTW11039_4946 [Escherichia coli TW11039]
gi|420300624|ref|ZP_14802667.1| hypothetical protein ECTW09109_5140 [Escherichia coli TW09109]
gi|420306468|ref|ZP_14808456.1| hypothetical protein ECTW10119_5262 [Escherichia coli TW10119]
gi|420311766|ref|ZP_14813694.1| hypothetical protein ECEC1738_4805 [Escherichia coli EC1738]
gi|420317423|ref|ZP_14819294.1| hypothetical protein ECEC1734_4754 [Escherichia coli EC1734]
gi|421814567|ref|ZP_16250269.1| hypothetical protein EC80416_4344 [Escherichia coli 8.0416]
gi|421821215|ref|ZP_16256686.1| hypothetical protein EC100821_5204 [Escherichia coli 10.0821]
gi|421833209|ref|ZP_16268489.1| hypothetical protein ECPA7_5406 [Escherichia coli PA7]
gi|423727615|ref|ZP_17701493.1| hypothetical protein ECPA31_4706 [Escherichia coli PA31]
gi|424079832|ref|ZP_17816792.1| hypothetical protein ECFDA505_4757 [Escherichia coli FDA505]
gi|424086239|ref|ZP_17822721.1| hypothetical protein ECFDA517_5074 [Escherichia coli FDA517]
gi|424099319|ref|ZP_17834587.1| hypothetical protein ECFRIK1985_5028 [Escherichia coli FRIK1985]
gi|424112173|ref|ZP_17846397.1| hypothetical protein EC93001_4874 [Escherichia coli 93-001]
gi|424118115|ref|ZP_17851944.1| hypothetical protein ECPA3_4889 [Escherichia coli PA3]
gi|424124302|ref|ZP_17857602.1| hypothetical protein ECPA5_4745 [Escherichia coli PA5]
gi|424130447|ref|ZP_17863346.1| hypothetical protein ECPA9_4920 [Escherichia coli PA9]
gi|424136776|ref|ZP_17869217.1| hypothetical protein ECPA10_5070 [Escherichia coli PA10]
gi|424143329|ref|ZP_17875187.1| hypothetical protein ECPA14_4913 [Escherichia coli PA14]
gi|424149721|ref|ZP_17881088.1| hypothetical protein ECPA15_5031 [Escherichia coli PA15]
gi|424155573|ref|ZP_17886500.1| hypothetical protein ECPA24_4636 [Escherichia coli PA24]
gi|424255558|ref|ZP_17892047.1| hypothetical protein ECPA25_4614 [Escherichia coli PA25]
gi|424334046|ref|ZP_17897955.1| hypothetical protein ECPA28_4952 [Escherichia coli PA28]
gi|424452012|ref|ZP_17903674.1| hypothetical protein ECPA32_4774 [Escherichia coli PA32]
gi|424458199|ref|ZP_17909303.1| hypothetical protein ECPA33_4772 [Escherichia coli PA33]
gi|424464678|ref|ZP_17915033.1| hypothetical protein ECPA39_4850 [Escherichia coli PA39]
gi|424477467|ref|ZP_17926776.1| hypothetical protein ECPA42_4929 [Escherichia coli PA42]
gi|424483230|ref|ZP_17932202.1| hypothetical protein ECTW07945_4773 [Escherichia coli TW07945]
gi|424489411|ref|ZP_17937952.1| hypothetical protein ECTW09098_4852 [Escherichia coli TW09098]
gi|424502761|ref|ZP_17949642.1| hypothetical protein ECEC4203_4848 [Escherichia coli EC4203]
gi|424509021|ref|ZP_17955394.1| hypothetical protein ECEC4196_4900 [Escherichia coli EC4196]
gi|424516380|ref|ZP_17960994.1| hypothetical protein ECTW14313_4703 [Escherichia coli TW14313]
gi|424522562|ref|ZP_17966668.1| hypothetical protein ECTW14301_4626 [Escherichia coli TW14301]
gi|424528439|ref|ZP_17972147.1| hypothetical protein ECEC4421_4687 [Escherichia coli EC4421]
gi|424534588|ref|ZP_17977927.1| hypothetical protein ECEC4422_4816 [Escherichia coli EC4422]
gi|424540646|ref|ZP_17983581.1| hypothetical protein ECEC4013_4958 [Escherichia coli EC4013]
gi|424546791|ref|ZP_17989143.1| hypothetical protein ECEC4402_4833 [Escherichia coli EC4402]
gi|424552999|ref|ZP_17994833.1| hypothetical protein ECEC4439_4786 [Escherichia coli EC4439]
gi|424559188|ref|ZP_18000588.1| hypothetical protein ECEC4436_4739 [Escherichia coli EC4436]
gi|424565524|ref|ZP_18006519.1| hypothetical protein ECEC4437_4898 [Escherichia coli EC4437]
gi|424571655|ref|ZP_18012193.1| hypothetical protein ECEC4448_4800 [Escherichia coli EC4448]
gi|424577810|ref|ZP_18017853.1| hypothetical protein ECEC1845_4760 [Escherichia coli EC1845]
gi|424583627|ref|ZP_18023264.1| hypothetical protein ECEC1863_4496 [Escherichia coli EC1863]
gi|425100295|ref|ZP_18503019.1| hypothetical protein EC34870_4837 [Escherichia coli 3.4870]
gi|425106397|ref|ZP_18508705.1| hypothetical protein EC52239_4795 [Escherichia coli 5.2239]
gi|425112407|ref|ZP_18514320.1| hypothetical protein EC60172_4950 [Escherichia coli 6.0172]
gi|425128335|ref|ZP_18529494.1| hypothetical protein EC80586_5102 [Escherichia coli 8.0586]
gi|425134077|ref|ZP_18534919.1| hypothetical protein EC82524_4714 [Escherichia coli 8.2524]
gi|425140695|ref|ZP_18541067.1| hypothetical protein EC100833_5124 [Escherichia coli 10.0833]
gi|425146362|ref|ZP_18546346.1| hypothetical protein EC100869_4617 [Escherichia coli 10.0869]
gi|425152482|ref|ZP_18552087.1| hypothetical protein EC880221_4757 [Escherichia coli 88.0221]
gi|425158354|ref|ZP_18557610.1| hypothetical protein ECPA34_4912 [Escherichia coli PA34]
gi|425164699|ref|ZP_18563578.1| hypothetical protein ECFDA506_5110 [Escherichia coli FDA506]
gi|425170445|ref|ZP_18568910.1| hypothetical protein ECFDA507_4850 [Escherichia coli FDA507]
gi|425176495|ref|ZP_18574606.1| hypothetical protein ECFDA504_4775 [Escherichia coli FDA504]
gi|425188821|ref|ZP_18586085.1| hypothetical protein ECFRIK1997_5034 [Escherichia coli FRIK1997]
gi|425202058|ref|ZP_18598257.1| hypothetical protein ECNE037_5169 [Escherichia coli NE037]
gi|425214195|ref|ZP_18609587.1| hypothetical protein ECPA4_4926 [Escherichia coli PA4]
gi|425220319|ref|ZP_18615273.1| hypothetical protein ECPA23_4795 [Escherichia coli PA23]
gi|425226960|ref|ZP_18621418.1| hypothetical protein ECPA49_5019 [Escherichia coli PA49]
gi|425233121|ref|ZP_18627153.1| hypothetical protein ECPA45_4973 [Escherichia coli PA45]
gi|425239047|ref|ZP_18632758.1| hypothetical protein ECTT12B_4670 [Escherichia coli TT12B]
gi|425257257|ref|ZP_18649759.1| hypothetical protein ECCB7326_4836 [Escherichia coli CB7326]
gi|425269512|ref|ZP_18661133.1| hypothetical protein EC5412_4766 [Escherichia coli 5412]
gi|425296972|ref|ZP_18687122.1| hypothetical protein ECPA38_4622 [Escherichia coli PA38]
gi|425313655|ref|ZP_18702824.1| hypothetical protein ECEC1735_4763 [Escherichia coli EC1735]
gi|425319635|ref|ZP_18708414.1| hypothetical protein ECEC1736_4708 [Escherichia coli EC1736]
gi|425325746|ref|ZP_18714090.1| hypothetical protein ECEC1737_4710 [Escherichia coli EC1737]
gi|425332099|ref|ZP_18719925.1| hypothetical protein ECEC1846_4816 [Escherichia coli EC1846]
gi|425338276|ref|ZP_18725622.1| hypothetical protein ECEC1847_4840 [Escherichia coli EC1847]
gi|425344593|ref|ZP_18731474.1| hypothetical protein ECEC1848_4958 [Escherichia coli EC1848]
gi|425350429|ref|ZP_18736886.1| hypothetical protein ECEC1849_4723 [Escherichia coli EC1849]
gi|425356701|ref|ZP_18742759.1| hypothetical protein ECEC1850_4950 [Escherichia coli EC1850]
gi|425362661|ref|ZP_18748298.1| hypothetical protein ECEC1856_4773 [Escherichia coli EC1856]
gi|425368889|ref|ZP_18753993.1| hypothetical protein ECEC1862_4786 [Escherichia coli EC1862]
gi|425375193|ref|ZP_18759826.1| hypothetical protein ECEC1864_4920 [Escherichia coli EC1864]
gi|425388083|ref|ZP_18771633.1| hypothetical protein ECEC1866_4680 [Escherichia coli EC1866]
gi|425394775|ref|ZP_18777875.1| hypothetical protein ECEC1868_4993 [Escherichia coli EC1868]
gi|425400871|ref|ZP_18783568.1| hypothetical protein ECEC1869_4939 [Escherichia coli EC1869]
gi|425406963|ref|ZP_18789176.1| hypothetical protein ECEC1870_4738 [Escherichia coli EC1870]
gi|425413349|ref|ZP_18795102.1| hypothetical protein ECNE098_4927 [Escherichia coli NE098]
gi|425419660|ref|ZP_18800921.1| hypothetical protein ECFRIK523_4776 [Escherichia coli FRIK523]
gi|425430935|ref|ZP_18811535.1| hypothetical protein EC01304_4902 [Escherichia coli 0.1304]
gi|428955440|ref|ZP_19027224.1| hypothetical protein EC881042_4795 [Escherichia coli 88.1042]
gi|428961439|ref|ZP_19032721.1| hypothetical protein EC890511_4747 [Escherichia coli 89.0511]
gi|428968048|ref|ZP_19038750.1| hypothetical protein EC900091_5143 [Escherichia coli 90.0091]
gi|428980186|ref|ZP_19049993.1| hypothetical protein EC902281_4755 [Escherichia coli 90.2281]
gi|428985972|ref|ZP_19055354.1| hypothetical protein EC930055_4673 [Escherichia coli 93.0055]
gi|428992156|ref|ZP_19061135.1| hypothetical protein EC930056_4727 [Escherichia coli 93.0056]
gi|428998047|ref|ZP_19066631.1| hypothetical protein EC940618_4635 [Escherichia coli 94.0618]
gi|429010405|ref|ZP_19077843.1| hypothetical protein EC951288_4497 [Escherichia coli 95.1288]
gi|429016933|ref|ZP_19083806.1| hypothetical protein EC950943_4910 [Escherichia coli 95.0943]
gi|429022675|ref|ZP_19089186.1| hypothetical protein EC960428_4625 [Escherichia coli 96.0428]
gi|429028846|ref|ZP_19094826.1| hypothetical protein EC960427_4802 [Escherichia coli 96.0427]
gi|429041099|ref|ZP_19106187.1| hypothetical protein EC960932_4872 [Escherichia coli 96.0932]
gi|429046954|ref|ZP_19111657.1| hypothetical protein EC960107_4684 [Escherichia coli 96.0107]
gi|429052309|ref|ZP_19116869.1| hypothetical protein EC970003_4427 [Escherichia coli 97.0003]
gi|429057821|ref|ZP_19122084.1| hypothetical protein EC971742_4298 [Escherichia coli 97.1742]
gi|429063366|ref|ZP_19127341.1| hypothetical protein EC970007_4185 [Escherichia coli 97.0007]
gi|429070723|ref|ZP_19134102.1| hypothetical protein EC990678_5252 [Escherichia coli 99.0678]
gi|429081416|ref|ZP_19144532.1| hypothetical protein EC990713_5250 [Escherichia coli 99.0713]
gi|429828751|ref|ZP_19359758.1| hypothetical protein EC960109_4876 [Escherichia coli 96.0109]
gi|429835191|ref|ZP_19365469.1| hypothetical protein EC970010_4835 [Escherichia coli 97.0010]
gi|444927256|ref|ZP_21246521.1| hypothetical protein EC09BKT78844_4882 [Escherichia coli
09BKT078844]
gi|444932846|ref|ZP_21251863.1| hypothetical protein EC990814_4226 [Escherichia coli 99.0814]
gi|444938322|ref|ZP_21257070.1| hypothetical protein EC990815_4259 [Escherichia coli 99.0815]
gi|444943914|ref|ZP_21262410.1| hypothetical protein EC990816_4314 [Escherichia coli 99.0816]
gi|444949405|ref|ZP_21267701.1| hypothetical protein EC990839_4245 [Escherichia coli 99.0839]
gi|444955079|ref|ZP_21273151.1| hypothetical protein EC990848_4357 [Escherichia coli 99.0848]
gi|444960466|ref|ZP_21278295.1| hypothetical protein EC991753_4289 [Escherichia coli 99.1753]
gi|444965679|ref|ZP_21283249.1| hypothetical protein EC991775_4151 [Escherichia coli 99.1775]
gi|444971675|ref|ZP_21289020.1| hypothetical protein EC991793_4596 [Escherichia coli 99.1793]
gi|444976975|ref|ZP_21294065.1| hypothetical protein EC991805_4182 [Escherichia coli 99.1805]
gi|444982346|ref|ZP_21299247.1| hypothetical protein ECATCC700728_4176 [Escherichia coli ATCC
700728]
gi|444988560|ref|ZP_21305317.1| hypothetical protein ECPA11_5184 [Escherichia coli PA11]
gi|444993068|ref|ZP_21309704.1| hypothetical protein ECPA19_4335 [Escherichia coli PA19]
gi|444998301|ref|ZP_21314794.1| hypothetical protein ECPA13_4095 [Escherichia coli PA13]
gi|445004788|ref|ZP_21321157.1| hypothetical protein ECPA2_5358 [Escherichia coli PA2]
gi|445004922|ref|ZP_21321282.1| hypothetical protein ECPA47_5269 [Escherichia coli PA47]
gi|445015398|ref|ZP_21331479.1| hypothetical protein ECPA48_5173 [Escherichia coli PA48]
gi|445015754|ref|ZP_21331819.1| hypothetical protein ECPA8_5306 [Escherichia coli PA8]
gi|445021071|ref|ZP_21337012.1| hypothetical protein EC71982_5249 [Escherichia coli 7.1982]
gi|445028321|ref|ZP_21344063.1| hypothetical protein EC991781_1757 [Escherichia coli 99.1781]
gi|445031935|ref|ZP_21347574.1| hypothetical protein EC991762_5373 [Escherichia coli 99.1762]
gi|445042200|ref|ZP_21357565.1| hypothetical protein ECPA35_4500 [Escherichia coli PA35]
gi|445043905|ref|ZP_21359240.1| hypothetical protein EC34880_0888 [Escherichia coli 3.4880]
gi|445052978|ref|ZP_21367995.1| hypothetical protein EC950083_4258 [Escherichia coli 95.0083]
gi|445061011|ref|ZP_21373522.1| hypothetical protein EC990670_4482 [Escherichia coli 99.0670]
gi|452968310|ref|ZP_21966537.1| hypothetical protein EC4009_RS06445 [Escherichia coli O157:H7 str.
EC4009]
gi|12518318|gb|AAG58726.1|AE005584_8 orf; hypothetical protein [Escherichia coli O157:H7 str. EDL933]
gi|13363934|dbj|BAB37882.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|187771563|gb|EDU35407.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4196]
gi|188018366|gb|EDU56488.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4113]
gi|189002301|gb|EDU71287.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4076]
gi|189358833|gb|EDU77252.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4401]
gi|189364486|gb|EDU82905.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4486]
gi|189369459|gb|EDU87875.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4501]
gi|189380134|gb|EDU98550.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC508]
gi|208729392|gb|EDZ78993.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208735024|gb|EDZ83711.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208741603|gb|EDZ89285.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209160872|gb|ACI38305.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4115]
gi|217320495|gb|EEC28919.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254594934|gb|ACT74295.1| hypothetical protein ECSP_4573 [Escherichia coli O157:H7 str.
TW14359]
gi|290764826|gb|ADD58787.1| hypothetical protein G2583_4318 [Escherichia coli O55:H7 str.
CB9615]
gi|320191367|gb|EFW66017.1| hypothetical protein ECoD_02794 [Escherichia coli O157:H7 str.
EC1212]
gi|320639897|gb|EFX09491.1| hypothetical protein ECO5101_02775 [Escherichia coli O157:H7 str.
G5101]
gi|320645061|gb|EFX14085.1| hypothetical protein ECO9389_12801 [Escherichia coli O157:H- str.
493-89]
gi|320650327|gb|EFX18810.1| hypothetical protein ECO2687_16571 [Escherichia coli O157:H- str. H
2687]
gi|320655901|gb|EFX23824.1| hypothetical protein ECO7815_03795 [Escherichia coli O55:H7 str.
3256-97 TW 07815]
gi|320666706|gb|EFX33689.1| hypothetical protein ECOSU61_16870 [Escherichia coli O157:H7 str.
LSU-61]
gi|326337419|gb|EGD61254.1| hypothetical protein ECoA_05479 [Escherichia coli O157:H7 str.
1044]
gi|326339944|gb|EGD63751.1| hypothetical protein ECF_04238 [Escherichia coli O157:H7 str. 1125]
gi|374360980|gb|AEZ42687.1| hypothetical protein ECO55CA74_20675 [Escherichia coli O55:H7 str.
RM12579]
gi|377889685|gb|EHU54145.1| hypothetical protein ECDEC3A_4594 [Escherichia coli DEC3A]
gi|377889783|gb|EHU54242.1| hypothetical protein ECDEC3B_4713 [Escherichia coli DEC3B]
gi|377903272|gb|EHU67570.1| hypothetical protein ECDEC3C_4952 [Escherichia coli DEC3C]
gi|377907386|gb|EHU71622.1| hypothetical protein ECDEC3D_4717 [Escherichia coli DEC3D]
gi|377908341|gb|EHU72558.1| hypothetical protein ECDEC3E_4925 [Escherichia coli DEC3E]
gi|377918108|gb|EHU82161.1| hypothetical protein ECDEC3F_4810 [Escherichia coli DEC3F]
gi|377924259|gb|EHU88215.1| hypothetical protein ECDEC4A_4458 [Escherichia coli DEC4A]
gi|377927762|gb|EHU91677.1| hypothetical protein ECDEC4B_4655 [Escherichia coli DEC4B]
gi|377939056|gb|EHV02814.1| hypothetical protein ECDEC4C_4545 [Escherichia coli DEC4C]
gi|377944467|gb|EHV08170.1| hypothetical protein ECDEC4E_4577 [Escherichia coli DEC4E]
gi|377954643|gb|EHV18202.1| hypothetical protein ECDEC4F_4462 [Escherichia coli DEC4F]
gi|377957760|gb|EHV21288.1| hypothetical protein ECDEC5A_4358 [Escherichia coli DEC5A]
gi|377962943|gb|EHV26395.1| hypothetical protein ECDEC5B_4720 [Escherichia coli DEC5B]
gi|377970279|gb|EHV33643.1| hypothetical protein ECDEC5C_4407 [Escherichia coli DEC5C]
gi|377972443|gb|EHV35793.1| hypothetical protein ECDEC5D_4501 [Escherichia coli DEC5D]
gi|377981006|gb|EHV44266.1| hypothetical protein ECDEC5E_4420 [Escherichia coli DEC5E]
gi|386798218|gb|AFJ31252.1| hypothetical protein CDCO157_4196 [Escherichia coli Xuzhou21]
gi|390639210|gb|EIN18690.1| hypothetical protein ECFDA505_4757 [Escherichia coli FDA505]
gi|390639622|gb|EIN19093.1| hypothetical protein ECFDA517_5074 [Escherichia coli FDA517]
gi|390657072|gb|EIN34899.1| hypothetical protein ECFRIK1985_5028 [Escherichia coli FRIK1985]
gi|390657374|gb|EIN35192.1| hypothetical protein EC93001_4874 [Escherichia coli 93-001]
gi|390674723|gb|EIN50894.1| hypothetical protein ECPA3_4889 [Escherichia coli PA3]
gi|390678199|gb|EIN54182.1| hypothetical protein ECPA5_4745 [Escherichia coli PA5]
gi|390682075|gb|EIN57859.1| hypothetical protein ECPA9_4920 [Escherichia coli PA9]
gi|390693074|gb|EIN67718.1| hypothetical protein ECPA10_5070 [Escherichia coli PA10]
gi|390697368|gb|EIN71789.1| hypothetical protein ECPA14_4913 [Escherichia coli PA14]
gi|390698263|gb|EIN72649.1| hypothetical protein ECPA15_5031 [Escherichia coli PA15]
gi|390712206|gb|EIN85163.1| hypothetical protein ECPA22_4925 [Escherichia coli PA22]
gi|390719137|gb|EIN91871.1| hypothetical protein ECPA25_4614 [Escherichia coli PA25]
gi|390720026|gb|EIN92739.1| hypothetical protein ECPA24_4636 [Escherichia coli PA24]
gi|390725222|gb|EIN97742.1| hypothetical protein ECPA28_4952 [Escherichia coli PA28]
gi|390738126|gb|EIO09345.1| hypothetical protein ECPA31_4706 [Escherichia coli PA31]
gi|390738929|gb|EIO10125.1| hypothetical protein ECPA32_4774 [Escherichia coli PA32]
gi|390742351|gb|EIO13360.1| hypothetical protein ECPA33_4772 [Escherichia coli PA33]
gi|390761275|gb|EIO30571.1| hypothetical protein ECPA39_4850 [Escherichia coli PA39]
gi|390765920|gb|EIO35069.1| hypothetical protein ECPA42_4929 [Escherichia coli PA42]
gi|390779851|gb|EIO47565.1| hypothetical protein ECTW06591_4436 [Escherichia coli TW06591]
gi|390786558|gb|EIO54065.1| hypothetical protein ECTW07945_4773 [Escherichia coli TW07945]
gi|390787899|gb|EIO55372.1| hypothetical protein ECTW10246_5014 [Escherichia coli TW10246]
gi|390793629|gb|EIO60962.1| hypothetical protein ECTW11039_4946 [Escherichia coli TW11039]
gi|390801428|gb|EIO68486.1| hypothetical protein ECTW09098_4852 [Escherichia coli TW09098]
gi|390804995|gb|EIO71943.1| hypothetical protein ECTW09109_5140 [Escherichia coli TW09109]
gi|390814183|gb|EIO80763.1| hypothetical protein ECTW10119_5262 [Escherichia coli TW10119]
gi|390823323|gb|EIO89388.1| hypothetical protein ECEC4203_4848 [Escherichia coli EC4203]
gi|390828114|gb|EIO93799.1| hypothetical protein ECEC4196_4900 [Escherichia coli EC4196]
gi|390841966|gb|EIP05848.1| hypothetical protein ECTW14313_4703 [Escherichia coli TW14313]
gi|390843557|gb|EIP07344.1| hypothetical protein ECTW14301_4626 [Escherichia coli TW14301]
gi|390848287|gb|EIP11762.1| hypothetical protein ECEC4421_4687 [Escherichia coli EC4421]
gi|390858717|gb|EIP21090.1| hypothetical protein ECEC4422_4816 [Escherichia coli EC4422]
gi|390863135|gb|EIP25287.1| hypothetical protein ECEC4013_4958 [Escherichia coli EC4013]
gi|390867335|gb|EIP29163.1| hypothetical protein ECEC4402_4833 [Escherichia coli EC4402]
gi|390875728|gb|EIP36731.1| hypothetical protein ECEC4439_4786 [Escherichia coli EC4439]
gi|390881173|gb|EIP41787.1| hypothetical protein ECEC4436_4739 [Escherichia coli EC4436]
gi|390890973|gb|EIP50619.1| hypothetical protein ECEC4437_4898 [Escherichia coli EC4437]
gi|390892686|gb|EIP52258.1| hypothetical protein ECEC4448_4800 [Escherichia coli EC4448]
gi|390898319|gb|EIP57592.1| hypothetical protein ECEC1738_4805 [Escherichia coli EC1738]
gi|390906250|gb|EIP65153.1| hypothetical protein ECEC1734_4754 [Escherichia coli EC1734]
gi|390916344|gb|EIP74812.1| hypothetical protein ECEC1863_4496 [Escherichia coli EC1863]
gi|390916988|gb|EIP75422.1| hypothetical protein ECEC1845_4760 [Escherichia coli EC1845]
gi|408062465|gb|EKG96971.1| hypothetical protein ECPA7_5406 [Escherichia coli PA7]
gi|408066781|gb|EKH01227.1| hypothetical protein ECPA34_4912 [Escherichia coli PA34]
gi|408077084|gb|EKH11298.1| hypothetical protein ECFDA506_5110 [Escherichia coli FDA506]
gi|408080700|gb|EKH14758.1| hypothetical protein ECFDA507_4850 [Escherichia coli FDA507]
gi|408088919|gb|EKH22258.1| hypothetical protein ECFDA504_4775 [Escherichia coli FDA504]
gi|408101414|gb|EKH33866.1| hypothetical protein ECFRIK1997_5034 [Escherichia coli FRIK1997]
gi|408112898|gb|EKH44512.1| hypothetical protein ECNE037_5169 [Escherichia coli NE037]
gi|408125331|gb|EKH55940.1| hypothetical protein ECPA4_4926 [Escherichia coli PA4]
gi|408135214|gb|EKH65012.1| hypothetical protein ECPA23_4795 [Escherichia coli PA23]
gi|408137363|gb|EKH67065.1| hypothetical protein ECPA49_5019 [Escherichia coli PA49]
gi|408144386|gb|EKH73624.1| hypothetical protein ECPA45_4973 [Escherichia coli PA45]
gi|408152571|gb|EKH81000.1| hypothetical protein ECTT12B_4670 [Escherichia coli TT12B]
gi|408171077|gb|EKH98219.1| hypothetical protein ECCB7326_4836 [Escherichia coli CB7326]
gi|408180941|gb|EKI07530.1| hypothetical protein EC5412_4766 [Escherichia coli 5412]
gi|408214152|gb|EKI38607.1| hypothetical protein ECPA38_4622 [Escherichia coli PA38]
gi|408224415|gb|EKI48128.1| hypothetical protein ECEC1735_4763 [Escherichia coli EC1735]
gi|408235748|gb|EKI58682.1| hypothetical protein ECEC1736_4708 [Escherichia coli EC1736]
gi|408239233|gb|EKI61987.1| hypothetical protein ECEC1737_4710 [Escherichia coli EC1737]
gi|408244183|gb|EKI66641.1| hypothetical protein ECEC1846_4816 [Escherichia coli EC1846]
gi|408252867|gb|EKI74491.1| hypothetical protein ECEC1847_4840 [Escherichia coli EC1847]
gi|408256804|gb|EKI78168.1| hypothetical protein ECEC1848_4958 [Escherichia coli EC1848]
gi|408263244|gb|EKI84109.1| hypothetical protein ECEC1849_4723 [Escherichia coli EC1849]
gi|408271922|gb|EKI92038.1| hypothetical protein ECEC1850_4950 [Escherichia coli EC1850]
gi|408274623|gb|EKI94619.1| hypothetical protein ECEC1856_4773 [Escherichia coli EC1856]
gi|408283205|gb|EKJ02419.1| hypothetical protein ECEC1862_4786 [Escherichia coli EC1862]
gi|408289130|gb|EKJ07907.1| hypothetical protein ECEC1864_4920 [Escherichia coli EC1864]
gi|408304578|gb|EKJ22002.1| hypothetical protein ECEC1868_4993 [Escherichia coli EC1868]
gi|408305359|gb|EKJ22756.1| hypothetical protein ECEC1866_4680 [Escherichia coli EC1866]
gi|408316515|gb|EKJ32784.1| hypothetical protein ECEC1869_4939 [Escherichia coli EC1869]
gi|408321867|gb|EKJ37871.1| hypothetical protein ECEC1870_4738 [Escherichia coli EC1870]
gi|408324176|gb|EKJ40122.1| hypothetical protein ECNE098_4927 [Escherichia coli NE098]
gi|408334438|gb|EKJ49326.1| hypothetical protein ECFRIK523_4776 [Escherichia coli FRIK523]
gi|408343399|gb|EKJ57802.1| hypothetical protein EC01304_4902 [Escherichia coli 0.1304]
gi|408545930|gb|EKK23352.1| hypothetical protein EC52239_4795 [Escherichia coli 5.2239]
gi|408546745|gb|EKK24159.1| hypothetical protein EC34870_4837 [Escherichia coli 3.4870]
gi|408547047|gb|EKK24447.1| hypothetical protein EC60172_4950 [Escherichia coli 6.0172]
gi|408564499|gb|EKK40604.1| hypothetical protein EC80586_5102 [Escherichia coli 8.0586]
gi|408576191|gb|EKK51804.1| hypothetical protein EC100833_5124 [Escherichia coli 10.0833]
gi|408579122|gb|EKK54601.1| hypothetical protein EC82524_4714 [Escherichia coli 8.2524]
gi|408588994|gb|EKK63538.1| hypothetical protein EC100869_4617 [Escherichia coli 10.0869]
gi|408594205|gb|EKK68496.1| hypothetical protein EC880221_4757 [Escherichia coli 88.0221]
gi|408599378|gb|EKK73290.1| hypothetical protein EC80416_4344 [Escherichia coli 8.0416]
gi|408606541|gb|EKK79968.1| hypothetical protein EC100821_5204 [Escherichia coli 10.0821]
gi|427201963|gb|EKV72321.1| hypothetical protein EC881042_4795 [Escherichia coli 88.1042]
gi|427202497|gb|EKV72822.1| hypothetical protein EC890511_4747 [Escherichia coli 89.0511]
gi|427218432|gb|EKV87442.1| hypothetical protein EC900091_5143 [Escherichia coli 90.0091]
gi|427221712|gb|EKV90524.1| hypothetical protein EC902281_4755 [Escherichia coli 90.2281]
gi|427238946|gb|EKW06445.1| hypothetical protein EC930056_4727 [Escherichia coli 93.0056]
gi|427239084|gb|EKW06577.1| hypothetical protein EC930055_4673 [Escherichia coli 93.0055]
gi|427243369|gb|EKW10745.1| hypothetical protein EC940618_4635 [Escherichia coli 94.0618]
gi|427258569|gb|EKW24654.1| hypothetical protein EC950943_4910 [Escherichia coli 95.0943]
gi|427260727|gb|EKW26692.1| hypothetical protein EC951288_4497 [Escherichia coli 95.1288]
gi|427273802|gb|EKW38469.1| hypothetical protein EC960428_4625 [Escherichia coli 96.0428]
gi|427276260|gb|EKW40835.1| hypothetical protein EC960427_4802 [Escherichia coli 96.0427]
gi|427289537|gb|EKW53075.1| hypothetical protein EC960932_4872 [Escherichia coli 96.0932]
gi|427296261|gb|EKW59321.1| hypothetical protein EC960107_4684 [Escherichia coli 96.0107]
gi|427298383|gb|EKW61393.1| hypothetical protein EC970003_4427 [Escherichia coli 97.0003]
gi|427308631|gb|EKW70996.1| hypothetical protein EC971742_4298 [Escherichia coli 97.1742]
gi|427311712|gb|EKW73893.1| hypothetical protein EC970007_4185 [Escherichia coli 97.0007]
gi|427324889|gb|EKW86347.1| hypothetical protein EC990713_5250 [Escherichia coli 99.0713]
gi|427336056|gb|EKW97058.1| hypothetical protein EC990678_5252 [Escherichia coli 99.0678]
gi|429251455|gb|EKY36050.1| hypothetical protein EC960109_4876 [Escherichia coli 96.0109]
gi|429252515|gb|EKY37047.1| hypothetical protein EC970010_4835 [Escherichia coli 97.0010]
gi|444535665|gb|ELV15735.1| hypothetical protein EC990814_4226 [Escherichia coli 99.0814]
gi|444536994|gb|ELV16959.1| hypothetical protein EC09BKT78844_4882 [Escherichia coli
09BKT078844]
gi|444545831|gb|ELV24637.1| hypothetical protein EC990815_4259 [Escherichia coli 99.0815]
gi|444555151|gb|ELV32633.1| hypothetical protein EC990839_4245 [Escherichia coli 99.0839]
gi|444555319|gb|ELV32789.1| hypothetical protein EC990816_4314 [Escherichia coli 99.0816]
gi|444560365|gb|ELV37532.1| hypothetical protein EC990848_4357 [Escherichia coli 99.0848]
gi|444569733|gb|ELV46300.1| hypothetical protein EC991753_4289 [Escherichia coli 99.1753]
gi|444573453|gb|ELV49819.1| hypothetical protein EC991775_4151 [Escherichia coli 99.1775]
gi|444577174|gb|ELV53320.1| hypothetical protein EC991793_4596 [Escherichia coli 99.1793]
gi|444588184|gb|ELV63570.1| hypothetical protein ECPA11_5184 [Escherichia coli PA11]
gi|444589994|gb|ELV65310.1| hypothetical protein EC991805_4182 [Escherichia coli 99.1805]
gi|444590079|gb|ELV65394.1| hypothetical protein ECATCC700728_4176 [Escherichia coli ATCC
700728]
gi|444604008|gb|ELV78694.1| hypothetical protein ECPA13_4095 [Escherichia coli PA13]
gi|444604410|gb|ELV79084.1| hypothetical protein ECPA19_4335 [Escherichia coli PA19]
gi|444611225|gb|ELV85574.1| hypothetical protein ECPA2_5358 [Escherichia coli PA2]
gi|444618641|gb|ELV92715.1| hypothetical protein ECPA48_5173 [Escherichia coli PA48]
gi|444634620|gb|ELW08085.1| hypothetical protein ECPA47_5269 [Escherichia coli PA47]
gi|444639829|gb|ELW13128.1| hypothetical protein ECPA8_5306 [Escherichia coli PA8]
gi|444646552|gb|ELW19556.1| hypothetical protein EC991781_1757 [Escherichia coli 99.1781]
gi|444649874|gb|ELW22742.1| hypothetical protein EC71982_5249 [Escherichia coli 7.1982]
gi|444652152|gb|ELW24923.1| hypothetical protein ECPA35_4500 [Escherichia coli PA35]
gi|444655466|gb|ELW28079.1| hypothetical protein EC991762_5373 [Escherichia coli 99.1762]
gi|444660513|gb|ELW32876.1| hypothetical protein EC950083_4258 [Escherichia coli 95.0083]
gi|444666637|gb|ELW38700.1| hypothetical protein EC34880_0888 [Escherichia coli 3.4880]
gi|444667586|gb|ELW39621.1| hypothetical protein EC990670_4482 [Escherichia coli 99.0670]
Length = 656
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|332666559|ref|YP_004449347.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332335373|gb|AEE52474.1| protein of unknown function DUF1680 [Haliscomenobacter hydrossis
DSM 1100]
Length = 656
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/410 (22%), Positives = 159/410 (38%), Gaps = 76/410 (18%)
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLA--LQADYLSHFHANT 335
HW + ++E + L ++Y +T+D + L +H + G+ D+ +A
Sbjct: 198 HWVTGHQE---LELALVKVYQVTNDKRFLDFSHWLLEERGHGYAHGYTWTDWKDTAYAQD 254
Query: 336 HIPI-----VIGSQMRY-----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTS 378
P+ + G +R TGD Y K + T + D+V + Y TGG
Sbjct: 255 IKPVSLTTEITGHAVRAMYLYTGAADVAAYTGDESYLKAMNTVWDDVVE-RNMYITGGIG 313
Query: 379 AREFWWDPKRLADTLGSENE----ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
+ + + NE ETC + M+ ++ + R T + + D E++L NG
Sbjct: 314 SSG---SNEGFSKDYDLPNERAYCETCASVGMVFWNQRMNRLTGQTKFIDVLEKSLYNGA 370
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSI 490
L G+ + G A S GT F W CC + LGD I
Sbjct: 371 LD-------GLSLAGDRFFYGNPLASS----GTHFRREWFGTACCPSNIARLIASLGDYI 419
Query: 491 YFEEEGNVPGLYIIQYISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQ 548
Y + ++ Y+ ++ S + D G V + Q+ + W +++T+ E Q
Sbjct: 420 YASDPQSI---YVNLFVGSNTTIDLAKGKVEIRQETE--YPWKGLIKLTVN----PEKAQ 470
Query: 549 LSSLNLRMPVWTYSN-GAQA---------------SLNGQNLPLPPPGNFLSATERWSYN 592
+L +R+P W N GA A +NGQ L +L W+
Sbjct: 471 SFALKIRLPGWAKGNPGAGALYKFLDEGPTNFATLKVNGQAQNLKLDNGYLIVERNWNKG 530
Query: 593 DKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAG--HTSGEWDI 638
D + + L + +R +D+ + + A+ GP Y + G H W++
Sbjct: 531 DVVELNLAMPIRRVVARDEVKDNENRMALQRGPLVYCVEGVDHNGSAWNL 580
>gi|432672680|ref|ZP_19908201.1| hypothetical protein A1Y7_04240 [Escherichia coli KTE119]
gi|431207880|gb|ELF06125.1| hypothetical protein A1Y7_04240 [Escherichia coli KTE119]
Length = 656
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 85/365 (23%), Positives = 133/365 (36%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
H+P+ IG +R Y +TG D + + + Y TGG
Sbjct: 252 QAHLPLAQQQTAIGHAVRFAYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ +G +Y E LYI Y +S + + L +V W ++T+
Sbjct: 420 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W Q LNG+ + +L T W D L + L
Sbjct: 475 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTL 530
Query: 600 PLSLR 604
P+ +R
Sbjct: 531 PMPVR 535
>gi|397691075|ref|YP_006528329.1| six-hairpin glycosidase [Melioribacter roseus P3M]
gi|395812567|gb|AFN75316.1| six-hairpin glycosidase [Melioribacter roseus P3M]
Length = 643
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 90/385 (23%), Positives = 142/385 (36%), Gaps = 67/385 (17%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF----HANTHIPIV-----IGS 343
L +LY IT +++ LA F L ++ D +H +A HIP+V +G
Sbjct: 219 LIKLYQITGKKEYMELAKFF--------LDIRGDSTTHKLYGEYAQDHIPLVEQKEAVGH 270
Query: 344 QMR----YEVTGD--------PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
+R Y D K + T + ++VN +Y TGG AR D + D
Sbjct: 271 AVRALYMYAAMTDIAVLHDDEDYRKAVFTLWDNVVN-KKTYITGGLGARH---DGEAFGD 326
Query: 392 TLGSEN----EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
N ETC + + LF T + YAD ER L NG++S +
Sbjct: 327 DYELPNLTAYGETCAAIGSVYWNYRLFEMTGDSKYADVIERTLYNGLIS-GISLDGKNFF 385
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
Y PL S G W CC I L IY + +V Y+
Sbjct: 386 YPNPLE---SDGEYKFNMGACTRQPWFDCSCCPTNLIRFIPSLPGLIYSVDRDSV---YV 439
Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT--- 560
++ S D + G N+ V I L +T + + + +L +R+P W+
Sbjct: 440 NLFVGSKADIELG----NKNVRIIQKTSYPLDYKVTLNIEPQAATQFTLKIRIPGWSRNI 495
Query: 561 --------YSNGAQASL----NGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
Y+N + NG+ L + T+ W DK+ + LP ++
Sbjct: 496 PLPGDLYRYANKQNGKIRLLVNGEEQSLNISSGYAVITKLWEKGDKVDLILPKEVKKVLA 555
Query: 609 QDDRPEYASIQAILFGPYLLAGHTS 633
+ E + AI GP++ +
Sbjct: 556 NEKVKENRNKVAIELGPFVYCAEEA 580
>gi|168785451|ref|ZP_02810458.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC869]
gi|261224895|ref|ZP_05939176.1| hypothetical protein EscherichiacoliO157_09907 [Escherichia coli
O157:H7 str. FRIK2000]
gi|261254205|ref|ZP_05946738.1| hypothetical protein EscherichiacoliO157EcO_00065 [Escherichia coli
O157:H7 str. FRIK966]
gi|419100283|ref|ZP_13645472.1| hypothetical protein ECDEC4D_4507 [Escherichia coli DEC4D]
gi|420277651|ref|ZP_14779931.1| hypothetical protein ECPA40_4914 [Escherichia coli PA40]
gi|421826457|ref|ZP_16261810.1| hypothetical protein ECFRIK920_4874 [Escherichia coli FRIK920]
gi|424092641|ref|ZP_17828567.1| hypothetical protein ECFRIK1996_4807 [Escherichia coli FRIK1996]
gi|424105524|ref|ZP_17840261.1| hypothetical protein ECFRIK1990_4921 [Escherichia coli FRIK1990]
gi|424470965|ref|ZP_17920770.1| hypothetical protein ECPA41_4862 [Escherichia coli PA41]
gi|424496110|ref|ZP_17943684.1| hypothetical protein ECTW09195_4922 [Escherichia coli TW09195]
gi|425182551|ref|ZP_18580237.1| hypothetical protein ECFRIK1999_4970 [Escherichia coli FRIK1999]
gi|425195581|ref|ZP_18592342.1| hypothetical protein ECNE1487_5176 [Escherichia coli NE1487]
gi|425208438|ref|ZP_18604226.1| hypothetical protein ECFRIK2001_5178 [Escherichia coli FRIK2001]
gi|425245279|ref|ZP_18638577.1| hypothetical protein ECMA6_4977 [Escherichia coli MA6]
gi|428949368|ref|ZP_19021633.1| hypothetical protein EC881467_4850 [Escherichia coli 88.1467]
gi|428973751|ref|ZP_19044065.1| hypothetical protein EC900039_4634 [Escherichia coli 90.0039]
gi|429004396|ref|ZP_19072475.1| hypothetical protein EC950183_4871 [Escherichia coli 95.0183]
gi|429035002|ref|ZP_19100516.1| hypothetical protein EC960939_4827 [Escherichia coli 96.0939]
gi|429069551|ref|ZP_19132995.1| hypothetical protein EC990672_4784 [Escherichia coli 99.0672]
gi|189374407|gb|EDU92823.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC869]
gi|377938510|gb|EHV02277.1| hypothetical protein ECDEC4D_4507 [Escherichia coli DEC4D]
gi|390638393|gb|EIN17905.1| hypothetical protein ECFRIK1996_4807 [Escherichia coli FRIK1996]
gi|390660758|gb|EIN38450.1| hypothetical protein ECFRIK1990_4921 [Escherichia coli FRIK1990]
gi|390756526|gb|EIO26037.1| hypothetical protein ECPA40_4914 [Escherichia coli PA40]
gi|390764034|gb|EIO33252.1| hypothetical protein ECPA41_4862 [Escherichia coli PA41]
gi|390824028|gb|EIO90037.1| hypothetical protein ECTW09195_4922 [Escherichia coli TW09195]
gi|408064841|gb|EKG99322.1| hypothetical protein ECFRIK920_4874 [Escherichia coli FRIK920]
gi|408095070|gb|EKH28064.1| hypothetical protein ECFRIK1999_4970 [Escherichia coli FRIK1999]
gi|408106180|gb|EKH38296.1| hypothetical protein ECNE1487_5176 [Escherichia coli NE1487]
gi|408119214|gb|EKH50301.1| hypothetical protein ECFRIK2001_5178 [Escherichia coli FRIK2001]
gi|408157817|gb|EKH85958.1| hypothetical protein ECMA6_4977 [Escherichia coli MA6]
gi|427205698|gb|EKV75938.1| hypothetical protein EC881467_4850 [Escherichia coli 88.1467]
gi|427225134|gb|EKV93792.1| hypothetical protein EC900039_4634 [Escherichia coli 90.0039]
gi|427256997|gb|EKW23140.1| hypothetical protein EC950183_4871 [Escherichia coli 95.0183]
gi|427281172|gb|EKW45506.1| hypothetical protein EC960939_4827 [Escherichia coli 96.0939]
gi|427316599|gb|EKW78533.1| hypothetical protein EC990672_4784 [Escherichia coli 99.0672]
Length = 656
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|331675072|ref|ZP_08375829.1| putative cytoplasmic protein [Escherichia coli TA280]
gi|331067981|gb|EGI39379.1| putative cytoplasmic protein [Escherichia coli TA280]
Length = 662
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 83/365 (22%), Positives = 131/365 (35%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259
Query: 334 NTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + + Y TGG
Sbjct: 260 QAHLPIAQQQTAIGHAVRFVYLMTAVAHLARLSHDESKRQDCLRLWNNMAQRQLYITGGI 319
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + YAD ERAL N
Sbjct: 320 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGNSQYADVMERALYNT 376
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 377 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 427
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ +G +Y E LYI Y +S + + L +V W ++T+
Sbjct: 428 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIA 482
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W Q LNG+ + +L T W D L + L
Sbjct: 483 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDMLNLTL 538
Query: 600 PLSLR 604
P+ +R
Sbjct: 539 PMPVR 543
>gi|423240714|ref|ZP_17221828.1| hypothetical protein HMPREF1065_02451 [Bacteroides dorei
CL03T12C01]
gi|392643676|gb|EIY37425.1| hypothetical protein HMPREF1065_02451 [Bacteroides dorei
CL03T12C01]
Length = 811
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 99/435 (22%), Positives = 163/435 (37%), Gaps = 84/435 (19%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L A F + G LS + + H PI ++G +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---TDGHKLSEY-SQDHKPILQQDEIVGHAVR 275
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y T + + + TGG +R P+ + G
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + + +F T + YAD ERAL NGV+S GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G I F + +GN +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFMASVPYYMYATQGN--DV 432
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT- 560
Y+ +I S D ++ +N + WD + + +T +QE +L +R+P WT
Sbjct: 433 YVNLFIQSKADIETESNKINVEQTTGYPWDGKISIAVTPEKEQEF----ALRVRIPGWTQ 488
Query: 561 ----------YSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
+++ AQA S+NG + + + W D + I LP+ +R
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGFKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548
Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQE 667
D + AI GP + G+ + +P+ S++A L+
Sbjct: 549 ANDQVEDDHGKLAIERGPIMFC--LEGQDQADSTVFNKFIPDGTPMEASYDADLL----- 601
Query: 668 SGNSTFVMSNSNQSI 682
N V+S + + I
Sbjct: 602 --NGVMVLSGTAKEI 614
>gi|150009917|ref|YP_001304660.1| hypothetical protein BDI_3334 [Parabacteroides distasonis ATCC
8503]
gi|423333684|ref|ZP_17311465.1| hypothetical protein HMPREF1075_03116 [Parabacteroides distasonis
CL03T12C09]
gi|149938341|gb|ABR45038.1| putative exported protein [Parabacteroides distasonis ATCC 8503]
gi|409226994|gb|EKN19896.1| hypothetical protein HMPREF1075_03116 [Parabacteroides distasonis
CL03T12C09]
Length = 683
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 89/380 (23%), Positives = 143/380 (37%), Gaps = 37/380 (9%)
Query: 279 WYSLNEETGGMN-DVLYRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTH 336
W E+ GG N V+Y LY+IT D L L L K F + L D+LS +
Sbjct: 207 WTFWGEQRGGDNLMVVYWLYNITGDKFLLDLGELIHKQTFNWTDIFLNQDHLSRQLSLHC 266
Query: 337 IPIVIGSQ---MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD-T 392
+ + G + + Y+ DP + ++ + TG E R + T
Sbjct: 267 VNLAQGFKEPVVYYQQNQDPKQICAVKKAVKDIHNTIGLPTGLWGGDEL----LRFGEPT 322
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
GSE CT M+ + T ++ +ADY ER N L Q + Y
Sbjct: 323 TGSE---LCTAVEMMFSLEEMLEITGDVQWADYLERVAYNA-LPTQVTDDYSARQYYQQT 378
Query: 453 GRGVSKARSTHGWGT----------KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
+ V+ R + T + + CC + + KL ++++ N G+
Sbjct: 379 NQ-VAVTREWRNFSTPHDDTDILFGELTGYPCCTSNLHQGWPKLVQNLWYATADN--GIA 435
Query: 503 IIQYISSSFDWKSGHVVLNQ-KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
+ Y SS K + V Q + + +D L F K+ ++R+P W
Sbjct: 436 ALVYAPSSVKAKVANGVTVQIEEETAYPFDETLHFKFAFEDKKIKRAFFPFHIRIPAW-- 493
Query: 562 SNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
N LNG+N+ + PG W D LT++LP+ + Y
Sbjct: 494 CNQPVIKLNGENVVVDAYPGEIARINREWKQGDVLTVELPMQVAASRW------YGGSAV 547
Query: 621 ILFGPYLLAGHTSGEWDIKT 640
I GP + A + +W+ KT
Sbjct: 548 IERGPLVYALKMNEKWEKKT 567
>gi|432487351|ref|ZP_19729258.1| hypothetical protein A15Y_03854 [Escherichia coli KTE212]
gi|433175488|ref|ZP_20359993.1| hypothetical protein WGQ_03753 [Escherichia coli KTE232]
gi|431013718|gb|ELD27447.1| hypothetical protein A15Y_03854 [Escherichia coli KTE212]
gi|431688314|gb|ELJ53849.1| hypothetical protein WGQ_03753 [Escherichia coli KTE232]
Length = 656
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 85/365 (23%), Positives = 133/365 (36%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG- 376
H+P+ IG +R Y +TG D + + + Y TGG
Sbjct: 252 QAHLPLAQQQTAIGHAVRFAYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ +G +Y E LYI Y +S + + L +V W ++T+
Sbjct: 420 ARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPLENGTLRLRVSGNYPWQE--QVTIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W Q LNG+ + +L T W D L + L
Sbjct: 475 VESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTL 530
Query: 600 PLSLR 604
P+ +R
Sbjct: 531 PMPVR 535
>gi|432836527|ref|ZP_20070058.1| hypothetical protein A1YO_03904 [Escherichia coli KTE136]
gi|431382143|gb|ELG66487.1| hypothetical protein A1YO_03904 [Escherichia coli KTE136]
Length = 659
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGVQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|301307791|ref|ZP_07213747.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|423337090|ref|ZP_17314834.1| hypothetical protein HMPREF1059_00759 [Parabacteroides distasonis
CL09T03C24]
gi|300834134|gb|EFK64748.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|409238278|gb|EKN31071.1| hypothetical protein HMPREF1059_00759 [Parabacteroides distasonis
CL09T03C24]
Length = 680
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 92/424 (21%), Positives = 170/424 (40%), Gaps = 41/424 (9%)
Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
++ +L QY A N Q ++ +++ YF ++ ++ S W E+ GG N V+
Sbjct: 164 VMLKVLQQYYSATNDQ--RVISFLTNYFKYQLSELPK--SPLGKWTFWAEQRGGDNLMVV 219
Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
Y LY+IT DP L L L K F + L D+L+ ++ + + G + + Y+
Sbjct: 220 YWLYNITGDPFLLELGELIHKQTFNWTDVFLNQDHLARQNSLHCVNLAQGFKEPVIYYQQ 279
Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
+ +P + + + + TG W + L ++ E CT M+
Sbjct: 280 SHNPQNLEAVKEAVRKMRHTIGFPTG------LWAGDELLRFGNPTQGSELCTAVEMMFS 333
Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL--------GRG-VSKAR 460
+ T ++ +AD+ E+ N VL Q + Y + GR VS
Sbjct: 334 LEKMLEITGDVQWADHLEKVAYN-VLPTQIKDDFSARQYYQQVNQIAITCEGRNFVSPHE 392
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--- 517
T + + + CC + + K ++F N G+ + Y S + G+
Sbjct: 393 DTDIIFGELSGYPCCTSNLHQGWPKFTRHLWFATADN--GIASLIYAPSEVTAQVGNDIT 450
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
V + +K D ++ + L+F SK++ +LR+P W N ++NG+ + +
Sbjct: 451 VKIAEKTD--YPFEEKIDFNLSFPSKKDKKAFFPFHLRIPAW--CNNPVITINGEAVSIA 506
Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
G + W D + ++LP+ + T DD I GP L + +W
Sbjct: 507 AHSGEIVRINREWKDGDHIQLELPMRISTSNWYDD------AVVIERGPLLYSLKMDEKW 560
Query: 637 DIKT 640
+ K
Sbjct: 561 ERKV 564
>gi|238910286|ref|ZP_04654123.1| hypothetical protein SentesTe_04004 [Salmonella enterica subsp.
enterica serovar Tennessee str. CDC07-0191]
Length = 651
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 90/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + G+ L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEIPVGNGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|419864579|ref|ZP_14387018.1| hypothetical protein ECO9340_21841 [Escherichia coli O103:H25 str.
CVM9340]
gi|388339862|gb|EIL06180.1| hypothetical protein ECO9340_21841 [Escherichia coli O103:H25 str.
CVM9340]
Length = 659
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGVQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|255012841|ref|ZP_05284967.1| hypothetical protein B2_02974 [Bacteroides sp. 2_1_7]
gi|410102231|ref|ZP_11297158.1| hypothetical protein HMPREF0999_00930 [Parabacteroides sp. D25]
gi|409238953|gb|EKN31741.1| hypothetical protein HMPREF0999_00930 [Parabacteroides sp. D25]
Length = 680
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 92/424 (21%), Positives = 170/424 (40%), Gaps = 41/424 (9%)
Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
++ +L QY A N Q ++ +++ YF ++ ++ S W E+ GG N V+
Sbjct: 164 VMLKVLQQYYSATNDQ--RVISFLTNYFKYQLSELPK--SPLGKWTFWAEQRGGDNLMVV 219
Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
Y LY+IT DP L L L K F + L D+L+ ++ + + G + + Y+
Sbjct: 220 YWLYNITGDPFLLELGELIHKQTFNWTDVFLNQDHLARQNSLHCVNLAQGFKEPVIYYQQ 279
Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
+ +P + + + + TG W + L ++ E CT M+
Sbjct: 280 SHNPQNLEAVKEAVRKMRHTIGFPTG------LWAGDELLRFGNPTQGSELCTAVEMMFS 333
Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL--------GRG-VSKAR 460
+ T ++ +AD+ E+ N VL Q + Y + GR VS
Sbjct: 334 LEKMLEITGDVQWADHLEKVAYN-VLPTQIKDDFSARQYYQQVNQIAITCEGRNFVSPHE 392
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--- 517
T + + + CC + + K ++F N G+ + Y S + G+
Sbjct: 393 DTDIVFGELSGYPCCTSNLHQGWPKFTRHLWFATADN--GIASLIYAPSEVTAQVGNDIT 450
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
V + +K D ++ + L+F SK++ +LR+P W N ++NG+ + +
Sbjct: 451 VKIAEKTD--YPFEEKIDFNLSFPSKKDKKAFFPFHLRIPAW--CNNPVITINGEAVSIA 506
Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
G + W D + ++LP+ + T DD I GP L + +W
Sbjct: 507 AHSGEIVRINREWKDGDHIQLELPMRISTSNWYDD------AVVIERGPLLYSLKMDEKW 560
Query: 637 DIKT 640
+ K
Sbjct: 561 ERKV 564
>gi|300822009|ref|ZP_07102152.1| conserved hypothetical protein [Escherichia coli MS 119-7]
gi|331679667|ref|ZP_08380337.1| putative cytoplasmic protein [Escherichia coli H591]
gi|300525372|gb|EFK46441.1| conserved hypothetical protein [Escherichia coli MS 119-7]
gi|331072839|gb|EGI44164.1| putative cytoplasmic protein [Escherichia coli H591]
Length = 667
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 200 ALMRLYEVTEEPRYLALTNYFVEQRGVQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 259
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 260 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 313
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 370
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 421
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 422 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 477
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 478 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 532
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 533 TLNLTLPMPVR 543
>gi|193068520|ref|ZP_03049482.1| conserved hypothetical protein [Escherichia coli E110019]
gi|331670421|ref|ZP_08371260.1| putative cytoplasmic protein [Escherichia coli TA271]
gi|332282156|ref|ZP_08394569.1| conserved hypothetical protein [Shigella sp. D9]
gi|417222825|ref|ZP_12026265.1| putative glycosyhydrolase [Escherichia coli 96.154]
gi|417267012|ref|ZP_12054373.1| putative glycosyhydrolase [Escherichia coli 3.3884]
gi|417604475|ref|ZP_12255039.1| hypothetical protein ECSTEC94C_4306 [Escherichia coli STEC_94C]
gi|418040528|ref|ZP_12678768.1| hypothetical protein ECW26_09970 [Escherichia coli W26]
gi|419926997|ref|ZP_14444741.1| hypothetical protein EC5411_02255 [Escherichia coli 541-1]
gi|423707870|ref|ZP_17682250.1| hypothetical protein ESTG_02341 [Escherichia coli B799]
gi|432378754|ref|ZP_19621737.1| hypothetical protein WCQ_03650 [Escherichia coli KTE12]
gi|432482897|ref|ZP_19724846.1| hypothetical protein A15U_04037 [Escherichia coli KTE210]
gi|432676705|ref|ZP_19912149.1| hypothetical protein A1YU_03256 [Escherichia coli KTE142]
gi|433200343|ref|ZP_20384227.1| hypothetical protein WGW_03892 [Escherichia coli KTE94]
gi|192958171|gb|EDV88612.1| conserved hypothetical protein [Escherichia coli E110019]
gi|331062483|gb|EGI34403.1| putative cytoplasmic protein [Escherichia coli TA271]
gi|332104508|gb|EGJ07854.1| conserved hypothetical protein [Shigella sp. D9]
gi|345347843|gb|EGW80147.1| hypothetical protein ECSTEC94C_4306 [Escherichia coli STEC_94C]
gi|383476508|gb|EID68447.1| hypothetical protein ECW26_09970 [Escherichia coli W26]
gi|385709502|gb|EIG46500.1| hypothetical protein ESTG_02341 [Escherichia coli B799]
gi|386202627|gb|EII01618.1| putative glycosyhydrolase [Escherichia coli 96.154]
gi|386229370|gb|EII56725.1| putative glycosyhydrolase [Escherichia coli 3.3884]
gi|388408480|gb|EIL68825.1| hypothetical protein EC5411_02255 [Escherichia coli 541-1]
gi|430896388|gb|ELC18632.1| hypothetical protein WCQ_03650 [Escherichia coli KTE12]
gi|431003915|gb|ELD19148.1| hypothetical protein A15U_04037 [Escherichia coli KTE210]
gi|431210613|gb|ELF08667.1| hypothetical protein A1YU_03256 [Escherichia coli KTE142]
gi|431717675|gb|ELJ81769.1| hypothetical protein WGW_03892 [Escherichia coli KTE94]
Length = 659
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGVQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|415831195|ref|ZP_11516965.1| hypothetical protein ECOK1357_3952 [Escherichia coli OK1357]
gi|323182744|gb|EFZ68146.1| hypothetical protein ECOK1357_3952 [Escherichia coli OK1357]
Length = 659
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGVQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|256838375|ref|ZP_05543885.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|256739294|gb|EEU52618.1| conserved hypothetical protein [Parabacteroides sp. D13]
Length = 680
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 92/424 (21%), Positives = 170/424 (40%), Gaps = 41/424 (9%)
Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
++ +L QY A N Q ++ +++ YF ++ ++ S W E+ GG N V+
Sbjct: 164 VMLKVLQQYYSATNDQ--RVISFLTNYFKYQLSELPK--SPLGKWTFWAEQRGGDNLMVV 219
Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
Y LY+IT DP L L L K F + L D+L+ ++ + + G + + Y+
Sbjct: 220 YWLYNITGDPFLLELGELIHKQTFNWTDVFLNQDHLARQNSLHCVNLAQGFKEPVIYYQQ 279
Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
+ +P + + + + TG W + L ++ E CT M+
Sbjct: 280 SHNPQNLEAVKEAVRKMRHTIGFPTG------LWAGDELLRFGNPTQGSELCTAVEMMFS 333
Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL--------GRG-VSKAR 460
+ T ++ +AD+ E+ N VL Q + Y + GR VS
Sbjct: 334 LEKMLEITGDVQWADHLEKVAYN-VLPTQIKDDFSARQYYQQVNQIAITCEGRNFVSPHE 392
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--- 517
T + + + CC + + K ++F N G+ + Y S + G+
Sbjct: 393 DTDIVFGELSGYPCCTSNLHQGWPKFTRHLWFATADN--GIASLIYAPSEVTAQVGNDIT 450
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
V + +K D ++ + L+F SK++ +LR+P W N ++NG+ + +
Sbjct: 451 VKIAEKTD--YPFEEKIDFNLSFPSKKDKKAFFPFHLRIPAW--CNNPVITINGEAVSIA 506
Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
G + W D + ++LP+ + T DD I GP L + +W
Sbjct: 507 AHSGEIVRINREWKDGDHIQLELPMRISTSNWYDD------AVVIERGPLLYSLKMDEKW 560
Query: 637 DIKT 640
+ K
Sbjct: 561 ERKV 564
>gi|448238160|ref|YP_007402218.1| hypothetical protein GHH_c19490 [Geobacillus sp. GHH01]
gi|445207002|gb|AGE22467.1| hypothetical protein GHH_c19490 [Geobacillus sp. GHH01]
Length = 640
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 64/271 (23%), Positives = 107/271 (39%), Gaps = 36/271 (13%)
Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSA----REFWWDPKRLADTLGSENEETCTTYN 405
TGD K + V Y TGG + F +D DT+ +E TC +
Sbjct: 275 TGDESLKQACQTLWENVTKRQMYITGGVGSSAFGESFTFDFDLPNDTVYAE---TCASIA 331
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-----------GR 454
++ +R + + YAD ERAL NG +S + Y+ PL R
Sbjct: 332 LVFWARRMLELEMDGKYADVMERALYNGTIS-GMDLDGKRFFYVNPLEVWPKACERHDKR 390
Query: 455 GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK 514
V R K+ S CC + +G IY + L++ Y+ S+ +
Sbjct: 391 HVKPVRQ------KWFSCACCPPNLARLIASIGHYIYSQ---TSDALFVHLYVGSNIQTE 441
Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
G + + WD +R+T++ S QE +L LR+P W GA+ ++NG+N+
Sbjct: 442 IGGRSVEIVQETNYPWDGTVRLTISPESAQEF----TLGLRIPGWC--RGAEVTINGENV 495
Query: 575 PLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
+ P + W D++ + + +
Sbjct: 496 DIAPLTKKGYAYIRRVWRQGDEMVLHFSMPV 526
>gi|298374270|ref|ZP_06984228.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
gi|298268638|gb|EFI10293.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
Length = 680
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 92/424 (21%), Positives = 170/424 (40%), Gaps = 41/424 (9%)
Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
++ +L QY A N Q ++ +++ YF ++ ++ S W E+ GG N V+
Sbjct: 164 VMLKVLQQYYSATNDQ--RVISFLTNYFKYQLSELPK--SPLGKWTFWAEQRGGDNLMVV 219
Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
Y LY+IT DP L L L K F + L D+L+ ++ + + G + + Y+
Sbjct: 220 YWLYNITGDPFLLELGELIHKQTFNWTDVFLNQDHLARQNSLHCVNLAQGFKEPVIYYQQ 279
Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
+ +P + + + + TG W + L ++ E CT M+
Sbjct: 280 SHNPQNLEAVKEAVRKMRHTIGFPTG------LWAGDELLRFGNPTQGSELCTAVEMMFS 333
Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL--------GRG-VSKAR 460
+ T ++ +AD+ E+ N VL Q + Y + GR VS
Sbjct: 334 LEKMLEITGDVQWADHLEKVAYN-VLPTQIKDDFSARQYYQQVNQIAITCEGRNFVSPHE 392
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--- 517
T + + + CC + + K ++F N G+ + Y S + G+
Sbjct: 393 DTDIVFGELSGYPCCTSNLHQGWPKFTRHLWFATADN--GIASLIYAPSEVTAQVGNDIT 450
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
V + +K D ++ + L+F SK++ +LR+P W N ++NG+ + +
Sbjct: 451 VKIAEKTD--YPFEEKIDFNLSFPSKKDKKAFFPFHLRIPAW--CNNPVITINGEAVSIA 506
Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
G + W D + ++LP+ + T DD I GP L + +W
Sbjct: 507 AHSGEIVRINREWKDGDHIQLELPMRISTSNWYDD------AVVIERGPLLYSLKMDEKW 560
Query: 637 DIKT 640
+ K
Sbjct: 561 ERKV 564
>gi|212692449|ref|ZP_03300577.1| hypothetical protein BACDOR_01945 [Bacteroides dorei DSM 17855]
gi|212665028|gb|EEB25600.1| hypothetical protein BACDOR_01945 [Bacteroides dorei DSM 17855]
Length = 811
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 95/415 (22%), Positives = 157/415 (37%), Gaps = 77/415 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L A F + G LS + + H PI ++G +R
Sbjct: 220 ALVKLYKVTGDEKYLQTAKYFVEETGRG---SDGHKLSEY-SQDHKPILQQDEIVGHAVR 275
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y T + + + TGG +R P+ + G
Sbjct: 276 AGYLYSGVADVAALTHDTAYFNALTRIWENMAGKKLFITGGIGSR-----PQ--GEGFGP 328
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + + +F T + YAD ERAL NGV+S GV +
Sbjct: 329 NYELNNHTAYCETCASIANVYWNHRMFLATGDAKYADVLERALYNGVIS-------GVSL 381
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G I F + +GN +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-ITRFVASVPYYMYATQGN--DV 432
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-- 559
Y+ YI S D ++ +N + W+ + +++T +QE +L +R+P W
Sbjct: 433 YVNLYIQSKADIETESNKINVEQTTDYPWNGKISISVTPEKEQEF----ALRVRIPGWAQ 488
Query: 560 ---------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
++++ AQA S+NG + + + W D + I LP+ +R
Sbjct: 489 DAPVPTDLYSFTDKAQAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVK 548
Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLV 662
D + AI GP + G+ + +P+ SF+A L+
Sbjct: 549 ANDQVEDDHGKLAIERGPIMFC--LEGQDQADSTVFNKFIPDGTPMEASFHADLL 601
>gi|373958292|ref|ZP_09618252.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373894892|gb|EHQ30789.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 679
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 95/423 (22%), Positives = 168/423 (39%), Gaps = 52/423 (12%)
Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
+ +Y T +PK+L L+ +L D + + + + HA + G+
Sbjct: 228 VVEMYRTTREPKYLELSKNLIDIRGLMKDGTDDNQDRIPFREQTQALGHAVRANYLYAGA 287
Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATGGTSA----------REFWWDPKRLADT 392
Y TGD L + + D+VN Y TGG A D +++
Sbjct: 288 ADVYAETGDTTLMHTLNLVWNDVVN-RKMYITGGCGAIYDGASPDGTSYLLKDVQQIHQA 346
Query: 393 LGSE--------NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRG--- 440
G + + ETC + + + + + T + YAD E L NG+LS I
Sbjct: 347 YGRDYQLPNFTAHNETCASVGNVLWNWRMLQLTGKAQYADVMELTLYNGMLSGISLNGKK 406
Query: 441 ---TEPGVMIYMLPLGRGVSKARSTH-GWGTKFNSFWCCYGTGIESFSKLGDSIY-FEEE 495
T P + +P + SK R + G+ CC I + +++G+ Y ++
Sbjct: 407 FLYTNPLSVSDDMPFQQRWSKDRVDYIGYSD------CCPPNVIRTIAEIGNYAYSISDK 460
Query: 496 GNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
G LY +S+ + L+Q+ D WD + + L + + SL LR
Sbjct: 461 GVWVNLYGGNNLSTQLLKDGSKIKLSQQTD--YPWDGKISIALN----EVPAKAFSLFLR 514
Query: 556 MPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPE 614
+P W S GA ++NG+ + + PG + +W DK+ + LP+ ++ E
Sbjct: 515 IPGWCGS-GASVTVNGKAVNTILTPGQYAEINGKWHAGDKIELLLPMPVKMIEANPLVEE 573
Query: 615 YASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFV 674
+ A+ GP + ++G K + SLS+ I+ +P +GN+T
Sbjct: 574 VRNQIAVKRGPVVYCVESAGMPKDKKVFSLSLSSKINLVPQKIVIDNSDIVALNGNATLE 633
Query: 675 MSN 677
+N
Sbjct: 634 NAN 636
>gi|438041968|ref|ZP_20855782.1| hypothetical protein SEEE5646_18016 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-5646]
gi|435321796|gb|ELO94162.1| hypothetical protein SEEE5646_18016 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-5646]
Length = 646
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 90/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGVQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + G+ L ++ W +++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI- 475
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 476 -DSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|205354717|ref|YP_002228518.1| hypothetical protein SG3751 [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|375125607|ref|ZP_09770771.1| hypothetical protein SG9_3828 [Salmonella enterica subsp. enterica
serovar Gallinarum str. SG9]
gi|445130406|ref|ZP_21381321.1| hypothetical protein SEEG9184_014572 [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
gi|205274498|emb|CAR39532.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|326629857|gb|EGE36200.1| hypothetical protein SG9_3828 [Salmonella enterica subsp. enterica
serovar Gallinarum str. SG9]
gi|444852215|gb|ELX77297.1| hypothetical protein SEEG9184_014572 [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
Length = 651
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 90/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGVQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + G+ L ++ W +++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI- 475
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 476 -DSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|207858916|ref|YP_002245567.1| hypothetical protein SEN3501 [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|421357264|ref|ZP_15807576.1| hypothetical protein SEEE3139_04408 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|421362069|ref|ZP_15812325.1| hypothetical protein SEEE0166_05558 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|421368596|ref|ZP_15818785.1| hypothetical protein SEEE0631_15493 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|421370704|ref|ZP_15820867.1| hypothetical protein SEEE0424_03357 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|421376619|ref|ZP_15826719.1| hypothetical protein SEEE3076_10306 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|421379882|ref|ZP_15829946.1| hypothetical protein SEEE4917_03813 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|421387196|ref|ZP_15837201.1| hypothetical protein SEEE6622_17936 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|421388833|ref|ZP_15838818.1| hypothetical protein SEEE6670_03439 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|421393233|ref|ZP_15843178.1| hypothetical protein SEEE6426_02862 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|421400876|ref|ZP_15850758.1| hypothetical protein SEEE6437_19177 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|421404698|ref|ZP_15854538.1| hypothetical protein SEEE7246_15620 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|421408356|ref|ZP_15858156.1| hypothetical protein SEEE7250_11351 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|421414364|ref|ZP_15864109.1| hypothetical protein SEEE1427_18818 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|421418252|ref|ZP_15867957.1| hypothetical protein SEEE2659_15744 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|421423488|ref|ZP_15873147.1| hypothetical protein SEEE1757_19367 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|421427667|ref|ZP_15877286.1| hypothetical protein SEEE5101_17714 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|421429796|ref|ZP_15879391.1| hypothetical protein SEEE8B1_05655 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|421437646|ref|ZP_15887162.1| hypothetical protein SEEE5518_21932 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|421438534|ref|ZP_15888029.1| hypothetical protein SEEE1618_03571 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|421443523|ref|ZP_15892964.1| hypothetical protein SEEE3079_05727 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|436605457|ref|ZP_20513395.1| hypothetical protein SEE22704_08488 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|436694238|ref|ZP_20518150.1| hypothetical protein SEE30663_08486 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|436803411|ref|ZP_20525841.1| hypothetical protein SEECHS44_21392 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|436810025|ref|ZP_20529267.1| hypothetical protein SEEE1882_15710 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|436816420|ref|ZP_20533798.1| hypothetical protein SEEE1884_15817 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|436832038|ref|ZP_20536533.1| hypothetical protein SEEE1594_06730 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|436849358|ref|ZP_20540514.1| hypothetical protein SEEE1566_04035 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|436858888|ref|ZP_20547165.1| hypothetical protein SEEE1580_15161 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|436862962|ref|ZP_20549538.1| hypothetical protein SEEE1543_04463 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|436874233|ref|ZP_20556894.1| hypothetical protein SEEE1441_19202 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|436876728|ref|ZP_20558061.1| hypothetical protein SEEE1810_02403 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|436886249|ref|ZP_20562678.1| hypothetical protein SEEE1558_02954 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|436893215|ref|ZP_20567194.1| hypothetical protein SEEE1018_02862 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|436900848|ref|ZP_20571778.1| hypothetical protein SEEE1010_03453 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|436913977|ref|ZP_20579179.1| hypothetical protein SEEE1729_18338 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|436919198|ref|ZP_20582051.1| hypothetical protein SEEE0895_09976 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|436928295|ref|ZP_20587740.1| hypothetical protein SEEE0899_15810 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|436937155|ref|ZP_20592450.1| hypothetical protein SEEE1457_16920 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|436944088|ref|ZP_20596699.1| hypothetical protein SEEE1747_15796 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|436953454|ref|ZP_20601804.1| hypothetical protein SEEE0968_18795 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|436962937|ref|ZP_20605560.1| hypothetical protein SEEE1444_14849 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|436967670|ref|ZP_20607424.1| hypothetical protein SEEE1445_01381 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|436978926|ref|ZP_20612901.1| hypothetical protein SEEE1559_06556 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|436995892|ref|ZP_20619592.1| hypothetical protein SEEE1565_17627 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|437011806|ref|ZP_20624610.1| hypothetical protein SEEE1808_20431 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|437019323|ref|ZP_20627061.1| hypothetical protein SEEE1811_09865 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|437026609|ref|ZP_20629868.1| hypothetical protein SEEE0956_01180 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|437041181|ref|ZP_20635197.1| hypothetical protein SEEE1455_05273 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|437051574|ref|ZP_20641455.1| hypothetical protein SEEE1575_14346 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|437056616|ref|ZP_20644024.1| hypothetical protein SEEE1725_04709 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|437067549|ref|ZP_20650399.1| hypothetical protein SEEE1745_14184 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|437073604|ref|ZP_20653177.1| hypothetical protein SEEE1791_05329 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|437082599|ref|ZP_20658441.1| hypothetical protein SEEE1795_09343 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|437089107|ref|ZP_20661970.1| hypothetical protein SEEE6709_04610 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|437103922|ref|ZP_20666960.1| hypothetical protein SEEE9058_06907 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|437126597|ref|ZP_20674605.1| hypothetical protein SEEE0816_23102 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|437131843|ref|ZP_20677676.1| hypothetical protein SEEE0819_15610 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|437136794|ref|ZP_20680031.1| hypothetical protein SEEE3072_04644 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|437143889|ref|ZP_20684687.1| hypothetical protein SEEE3089_05314 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|437154248|ref|ZP_20690986.1| hypothetical protein SEEE9163_14382 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|437162604|ref|ZP_20696211.1| hypothetical protein SEEE151_18100 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|437166884|ref|ZP_20698338.1| hypothetical protein SEEEN202_06125 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|437178010|ref|ZP_20704356.1| hypothetical protein SEEE3991_14003 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|437183055|ref|ZP_20707414.1| hypothetical protein SEEE3618_06829 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|437198906|ref|ZP_20711454.1| hypothetical protein SEEE1831_04503 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|437262882|ref|ZP_20719212.1| hypothetical protein SEEE2490_17657 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|437271416|ref|ZP_20723680.1| hypothetical protein SEEEL909_17675 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|437275478|ref|ZP_20725823.1| hypothetical protein SEEEL913_05569 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|437291505|ref|ZP_20731569.1| hypothetical protein SEEE4941_12100 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|437304204|ref|ZP_20733917.1| hypothetical protein SEEE7015_01180 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|437324305|ref|ZP_20739563.1| hypothetical protein SEEE7927_06991 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|437339496|ref|ZP_20744149.1| hypothetical protein SEEECHS4_07410 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|437430625|ref|ZP_20755828.1| hypothetical protein SEEE2217_21365 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|437447211|ref|ZP_20758929.1| hypothetical protein SEEE4018_14080 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|437464509|ref|ZP_20763586.1| hypothetical protein SEEE6211_14683 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|437474444|ref|ZP_20766236.1| hypothetical protein SEEE4441_05197 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|437490700|ref|ZP_20771023.1| hypothetical protein SEEE4647_06743 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|437518116|ref|ZP_20778521.1| hypothetical protein SEEE9845_22471 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|437563498|ref|ZP_20786805.1| hypothetical protein SEEE0116_18550 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|437572857|ref|ZP_20789281.1| hypothetical protein SEEE1117_07915 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|437593902|ref|ZP_20795526.1| hypothetical protein SEEE1392_17150 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|437607245|ref|ZP_20800160.1| hypothetical protein SEEE0268_17650 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|437617397|ref|ZP_20802955.1| hypothetical protein SEEE0316_08756 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|437653610|ref|ZP_20810238.1| hypothetical protein SEEE0436_23184 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|437661278|ref|ZP_20812888.1| hypothetical protein SEEE1319_12828 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|437677654|ref|ZP_20817320.1| hypothetical protein SEEE4481_12502 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|437691966|ref|ZP_20820894.1| hypothetical protein SEEE6297_07188 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|437707522|ref|ZP_20825711.1| hypothetical protein SEEE4220_08863 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|437725054|ref|ZP_20829741.1| hypothetical protein SEEE1616_05992 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 16-16]
gi|437789741|ref|ZP_20837126.1| hypothetical protein SEEE2651_21208 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 76-2651]
gi|437814063|ref|ZP_20842185.1| hypothetical protein SEEE3944_22067 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|437862553|ref|ZP_20847967.1| hypothetical protein SEEE5621_05022 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
gi|438086893|ref|ZP_20859191.1| hypothetical protein SEEE2625_08240 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|438102729|ref|ZP_20865150.1| hypothetical protein SEEE1976_15564 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|438113496|ref|ZP_20869671.1| hypothetical protein SEEE3407_15758 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|445168673|ref|ZP_21394919.1| hypothetical protein SEE8A_021969 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|445186279|ref|ZP_21399191.1| hypothetical protein SE20037_06257 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|445231881|ref|ZP_21405859.1| hypothetical protein SEE10_020946 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|445237706|ref|ZP_21407161.1| hypothetical protein SEE436_026260 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 436]
gi|445333559|ref|ZP_21414841.1| hypothetical protein SEE18569_008126 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|445345844|ref|ZP_21418446.1| hypothetical protein SEE13_018274 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|445356148|ref|ZP_21421740.1| hypothetical protein SEE23_007297 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
gi|206710719|emb|CAR35080.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|395984836|gb|EJH94014.1| hypothetical protein SEEE0631_15493 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|395991902|gb|EJI01024.1| hypothetical protein SEEE0166_05558 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|395992120|gb|EJI01241.1| hypothetical protein SEEE3139_04408 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|396001983|gb|EJI10994.1| hypothetical protein SEEE3076_10306 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|396004947|gb|EJI13927.1| hypothetical protein SEEE4917_03813 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|396005988|gb|EJI14959.1| hypothetical protein SEEE0424_03357 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|396010336|gb|EJI19249.1| hypothetical protein SEEE6622_17936 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|396017969|gb|EJI26832.1| hypothetical protein SEEE6426_02862 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|396018877|gb|EJI27737.1| hypothetical protein SEEE6670_03439 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|396022763|gb|EJI31575.1| hypothetical protein SEEE6437_19177 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|396025631|gb|EJI34407.1| hypothetical protein SEEE7246_15620 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|396028864|gb|EJI37623.1| hypothetical protein SEEE7250_11351 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|396036970|gb|EJI45625.1| hypothetical protein SEEE1427_18818 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|396037577|gb|EJI46226.1| hypothetical protein SEEE1757_19367 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|396038879|gb|EJI47511.1| hypothetical protein SEEE2659_15744 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|396049784|gb|EJI58322.1| hypothetical protein SEEE5518_21932 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|396050924|gb|EJI59443.1| hypothetical protein SEEE5101_17714 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|396058175|gb|EJI66643.1| hypothetical protein SEEE8B1_05655 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|396070205|gb|EJI78534.1| hypothetical protein SEEE3079_05727 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|396072341|gb|EJI80651.1| hypothetical protein SEEE1618_03571 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|434956555|gb|ELL50284.1| hypothetical protein SEECHS44_21392 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|434966085|gb|ELL58983.1| hypothetical protein SEEE1882_15710 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|434972090|gb|ELL64574.1| hypothetical protein SEE22704_08488 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|434972217|gb|ELL64683.1| hypothetical protein SEEE1884_15817 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|434981889|gb|ELL73751.1| hypothetical protein SEEE1594_06730 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|434987983|gb|ELL79584.1| hypothetical protein SEEE1580_15161 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|434988731|gb|ELL80315.1| hypothetical protein SEEE1566_04035 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|434997520|gb|ELL88761.1| hypothetical protein SEEE1441_19202 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|434998217|gb|ELL89439.1| hypothetical protein SEEE1543_04463 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|435000158|gb|ELL91309.1| hypothetical protein SEE30663_08486 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|435010814|gb|ELM01577.1| hypothetical protein SEEE1810_02403 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|435012005|gb|ELM02695.1| hypothetical protein SEEE1558_02954 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|435018866|gb|ELM09311.1| hypothetical protein SEEE1018_02862 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|435022069|gb|ELM12420.1| hypothetical protein SEEE1010_03453 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|435023777|gb|ELM14017.1| hypothetical protein SEEE1729_18338 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|435030256|gb|ELM20297.1| hypothetical protein SEEE0895_09976 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|435034856|gb|ELM24713.1| hypothetical protein SEEE0899_15810 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|435036430|gb|ELM26251.1| hypothetical protein SEEE1457_16920 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|435040717|gb|ELM30470.1| hypothetical protein SEEE1747_15796 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|435048135|gb|ELM37702.1| hypothetical protein SEEE0968_18795 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|435049092|gb|ELM38627.1| hypothetical protein SEEE1444_14849 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|435060990|gb|ELM50227.1| hypothetical protein SEEE1445_01381 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|435062727|gb|ELM51908.1| hypothetical protein SEEE1565_17627 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|435064420|gb|ELM53549.1| hypothetical protein SEEE1808_20431 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|435069121|gb|ELM58130.1| hypothetical protein SEEE1559_06556 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|435080300|gb|ELM68982.1| hypothetical protein SEEE1811_09865 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|435086361|gb|ELM74900.1| hypothetical protein SEEE0956_01180 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|435086388|gb|ELM74926.1| hypothetical protein SEEE1455_05273 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|435092283|gb|ELM80650.1| hypothetical protein SEEE1575_14346 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|435095779|gb|ELM84062.1| hypothetical protein SEEE1745_14184 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|435097290|gb|ELM85551.1| hypothetical protein SEEE1725_04709 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|435108390|gb|ELM96357.1| hypothetical protein SEEE1791_05329 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|435109351|gb|ELM97304.1| hypothetical protein SEEE1795_09343 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|435115756|gb|ELN03511.1| hypothetical protein SEEE0816_23102 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|435115924|gb|ELN03677.1| hypothetical protein SEEE6709_04610 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|435121957|gb|ELN09480.1| hypothetical protein SEEE9058_06907 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|435123743|gb|ELN11235.1| hypothetical protein SEEE0819_15610 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|435136035|gb|ELN23136.1| hypothetical protein SEEE3072_04644 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|435139610|gb|ELN26601.1| hypothetical protein SEEE3089_05314 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|435139761|gb|ELN26742.1| hypothetical protein SEEE9163_14382 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|435143085|gb|ELN29964.1| hypothetical protein SEEE151_18100 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|435152694|gb|ELN39323.1| hypothetical protein SEEEN202_06125 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|435153800|gb|ELN40397.1| hypothetical protein SEEE3991_14003 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|435161457|gb|ELN47685.1| hypothetical protein SEEE2490_17657 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|435162986|gb|ELN49124.1| hypothetical protein SEEE3618_06829 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|435169890|gb|ELN55648.1| hypothetical protein SEEEL909_17675 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|435174737|gb|ELN60178.1| hypothetical protein SEEEL913_05569 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|435181699|gb|ELN66752.1| hypothetical protein SEEE4941_12100 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|435188330|gb|ELN73047.1| hypothetical protein SEEE7015_01180 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|435194134|gb|ELN78592.1| hypothetical protein SEEE7927_06991 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|435195768|gb|ELN80158.1| hypothetical protein SEEECHS4_07410 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|435199033|gb|ELN83153.1| hypothetical protein SEEE2217_21365 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|435209540|gb|ELN92853.1| hypothetical protein SEEE4018_14080 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|435217080|gb|ELN99522.1| hypothetical protein SEEE6211_14683 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|435220781|gb|ELO03061.1| hypothetical protein SEEE1831_04503 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|435224213|gb|ELO06185.1| hypothetical protein SEEE4441_05197 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|435228101|gb|ELO09552.1| hypothetical protein SEEE9845_22471 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|435229852|gb|ELO11187.1| hypothetical protein SEEE4647_06743 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|435237063|gb|ELO17777.1| hypothetical protein SEEE0116_18550 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|435247221|gb|ELO27192.1| hypothetical protein SEEE1117_07915 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|435251581|gb|ELO31186.1| hypothetical protein SEEE1392_17150 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|435253937|gb|ELO33352.1| hypothetical protein SEEE0268_17650 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|435260557|gb|ELO39749.1| hypothetical protein SEEE0316_08756 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|435264830|gb|ELO43722.1| hypothetical protein SEEE0436_23184 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|435268721|gb|ELO47301.1| hypothetical protein SEEE1319_12828 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|435274894|gb|ELO52988.1| hypothetical protein SEEE4481_12502 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|435280067|gb|ELO57793.1| hypothetical protein SEEE6297_07188 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|435290984|gb|ELO67872.1| hypothetical protein SEEE4220_08863 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|435293025|gb|ELO69762.1| hypothetical protein SEEE1616_05992 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 16-16]
gi|435295196|gb|ELO71717.1| hypothetical protein SEEE2651_21208 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 76-2651]
gi|435295991|gb|ELO72414.1| hypothetical protein SEEE3944_22067 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|435318636|gb|ELO91560.1| hypothetical protein SEEE2625_08240 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|435323736|gb|ELO95733.1| hypothetical protein SEEE1976_15564 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|435329624|gb|ELP01026.1| hypothetical protein SEEE3407_15758 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|435336306|gb|ELP06273.1| hypothetical protein SEEE5621_05022 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
gi|444862919|gb|ELX87757.1| hypothetical protein SEE10_020946 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|444864401|gb|ELX89201.1| hypothetical protein SEE8A_021969 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|444869705|gb|ELX94276.1| hypothetical protein SE20037_06257 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|444875839|gb|ELY00033.1| hypothetical protein SEE18569_008126 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|444878778|gb|ELY02892.1| hypothetical protein SEE13_018274 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|444887218|gb|ELY10942.1| hypothetical protein SEE23_007297 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
gi|444891559|gb|ELY14803.1| hypothetical protein SEE436_026260 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 436]
Length = 651
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 90/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGVQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + G+ L ++ W +++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI- 475
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 476 -DSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|198242542|ref|YP_002217640.1| hypothetical protein SeD_A4064 [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gi|375121158|ref|ZP_09766325.1| Six-hairpin glycosidase-like domain protein [Salmonella enterica
subsp. enterica serovar Dublin str. SD3246]
gi|445143487|ref|ZP_21386535.1| hypothetical protein SEEDSL_010877 [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
gi|445149123|ref|ZP_21388948.1| hypothetical protein SEEDHWS_023670 [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
gi|197937058|gb|ACH74391.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gi|326625425|gb|EGE31770.1| Six-hairpin glycosidase-like domain protein [Salmonella enterica
subsp. enterica serovar Dublin str. SD3246]
gi|444848141|gb|ELX73271.1| hypothetical protein SEEDSL_010877 [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
gi|444858418|gb|ELX83404.1| hypothetical protein SEEDHWS_023670 [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
Length = 651
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 90/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----KPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGVQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + G+ L ++ W +++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI- 475
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 476 -DSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|378580796|ref|ZP_09829449.1| secreted protein [Pantoea stewartii subsp. stewartii DC283]
gi|377816535|gb|EHT99637.1| secreted protein [Pantoea stewartii subsp. stewartii DC283]
Length = 651
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 95/251 (37%), Gaps = 39/251 (15%)
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H FN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLPFNHIYDHVKPVRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + LG IY E L+I YI + + G+ L ++ + W
Sbjct: 414 CCPPNIARLLTSLGHYIYTPRED---ALFINLYIGNRVEIPVGNQTLGLRISGNLPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
+T+T S Q V +L LR+P W S Q + NG + +L W D
Sbjct: 470 -TVTITIDSTQPVNH--ALALRLPDWCAS--PQITCNGTEVNEAARKGYLYLNRHWQEGD 524
Query: 594 KLTIQLPLSLR 604
+T+ LP+ +R
Sbjct: 525 TVTLTLPMPVR 535
>gi|209551193|ref|YP_002283110.1| hypothetical protein Rleg2_3619 [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|209536949|gb|ACI56884.1| protein of unknown function DUF1680 [Rhizobium leguminosarum bv.
trifolii WSM2304]
Length = 640
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 90/396 (22%), Positives = 155/396 (39%), Gaps = 61/396 (15%)
Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
L +L +T + K+L L+ F + F A + FH T H P+
Sbjct: 198 ALVKLARVTGEKKYLELSKFFIDERGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 258 REQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + + YAD E+AL NG L
Sbjct: 317 NEGFTDYYDLPN--ATAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
T+ Y PL A H W K++ CC + +G +Y + +
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426
Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+++ ++ +G V L Q + WD +TF+++ + +L+LR+P
Sbjct: 427 AVHLYGESTARLKLANGAEVELQQTTN--YPWDG----AVTFATRLKAPAKFALSLRIPD 480
Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W + GA S+NG+ L L + +W+ D++ + LPLSLR + + A
Sbjct: 481 W--AEGATLSVNGEMLDLAANIRDGYARIDRQWTDGDRVALSLPLSLRPQYANPKVRQDA 538
Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
A++ GP + T T L+A++ P
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGEDLNAIVLP 567
>gi|340346785|ref|ZP_08669904.1| hypothetical protein HMPREF9136_0902 [Prevotella dentalis DSM 3688]
gi|433652020|ref|YP_007278399.1| hypothetical protein Prede_1032 [Prevotella dentalis DSM 3688]
gi|339611002|gb|EGQ15842.1| hypothetical protein HMPREF9136_0902 [Prevotella dentalis DSM 3688]
gi|433302553|gb|AGB28369.1| hypothetical protein Prede_1032 [Prevotella dentalis DSM 3688]
Length = 663
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 74/327 (22%), Positives = 131/327 (40%), Gaps = 42/327 (12%)
Query: 291 DVLYRLYSITHDPKHLLLAH-------------LFDKPCFLGFLALQADYLS-HFHANTH 336
D + RLY+IT ++L A F + + L D L + HA+T
Sbjct: 229 DPIARLYTITGKKRYLDWAKWVVGNIDKWSGWDAFSRLDSIADGKLGVDQLQPYVHAHTF 288
Query: 337 IPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+G Y++TGD L + + + DI Y TGG S E + K L
Sbjct: 289 QMNFMGFLRLYQITGDRSLLRKVEGAWNDIYR-RQMYITGGVSVAEHY--EKGYVKPLSG 345
Query: 396 ENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
ETC T + +++++ L T + YAD E+ + N V + Q + P G
Sbjct: 346 NIIETCATMSWMQLTQMLLELTGDTKYADAIEKIMLNHVFAAQDALSGTCRYHTAP--NG 403
Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
HG CC +G S L + ++ E+G YI Q + +++ K+
Sbjct: 404 FKPDGYFHGPD-------CCTASGHRIISLL-PTFFYAEKGK--SFYINQLLPANYRGKA 453
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
+D +S + + ++ + G + L +R+P W + ++NG+
Sbjct: 454 --------IDFNISGNYPVSDSVVIDVNRMQG--NKLFIRVPAWC--DNPSITVNGKPQG 501
Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLS 602
G + ++WS D++ + LP+
Sbjct: 502 NVAAGKYYVVNKKWSKGDRIVMHLPMK 528
>gi|283787780|ref|YP_003367645.1| hypothetical protein ROD_42311 [Citrobacter rodentium ICC168]
gi|282951234|emb|CBG90928.1| conserved hypothetical protein [Citrobacter rodentium ICC168]
Length = 651
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 82/371 (22%), Positives = 134/371 (36%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P++L LA+ F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYLALANYFVEQRGTQPHFYDQEYEKRGKTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H P+ IG +R+ +Y + G + +N S
Sbjct: 252 QAHQPLAEQQTAIGHAVRF------VYLMTGVAHLARLNNDESKRQDCLRLWRNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASVGLMMFARRMLEMEADSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H FN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLSFNHIYDHVKPVRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G IY LYI Y+ +S + L ++ W +
Sbjct: 414 CCPPNIARVLTSIGHYIYTPRP---EALYINLYVGNSMELPLAGGTLRLRISGDYPW--H 468
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q + +L LR+P W A+ +LNG+ + ++ T W D
Sbjct: 469 EQVTIAVDSPQSIHH--TLALRLPDWCPQ--AKVALNGEEVAQDIRKGYIHITRSWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLRLTLPMPVR 535
>gi|161616753|ref|YP_001590718.1| hypothetical protein SPAB_04572 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|161366117|gb|ABX69885.1| hypothetical protein SPAB_04572 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
Length = 651
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 143/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P++++LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMMLASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG ++ +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLDVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|373456252|ref|ZP_09548019.1| protein of unknown function DUF1680 [Caldithrix abyssi DSM 13497]
gi|371717916|gb|EHO39687.1| protein of unknown function DUF1680 [Caldithrix abyssi DSM 13497]
Length = 676
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 121/584 (20%), Positives = 211/584 (36%), Gaps = 92/584 (15%)
Query: 185 IKEKMSTVVFSLSECQNKIGTGYLSAFP--TELFDSF---------EALKPVWAPYYTIH 233
IK+ + + L+ Q GY P T +FD+ E +K W P H
Sbjct: 119 IKKAKKWIEYILTHQQE---DGYFGPLPDSTRVFDNTKWGRRQAWQEKVKQDWWP----H 171
Query: 234 KILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DV 292
I+ ++ Y A Q ++ +M YF +++ + +W + GG N
Sbjct: 172 MIVLKVMQTYYEA--TQDERVLDFMRRYFQYQMKNIKE--KPLDYWTHWAKSRGGENLAS 227
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA--DYLSHFHA------NTH-IPIVIGS 343
+Y LY+ T D A L D LG + + D+ F + N H + +G
Sbjct: 228 IYWLYNHTGD------AFLLD----LGKIIFEQTLDWTQRFESANPQDWNWHGVNTAMGI 277
Query: 344 Q---MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEET 400
+ + Y+ + D Y ++ + H G +A E LA E+
Sbjct: 278 KQPGVWYQYSKDERYLKAVKTGIEKLMKHHGQVYGLWAADEL------LAGKDPVRGTES 331
Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR 460
CT + + + + + Y D ER N + + + Y L V R
Sbjct: 332 CTVVEYMFSLETMLQISGDAEYGDILERVALNALPAFLKPGHTARQYYQL--ANQVICDR 389
Query: 461 STHGWGTKFNS----------FWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
H + TK + CC + + K ++++ + N GL + Y S
Sbjct: 390 GWHNFSTKHGETELLFGLETGYGCCTANYHQGWPKYVMNLWYATQDN--GLAALVYAPSE 447
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
+ V N +V + D + + F K+ G +LR+P W + A +N
Sbjct: 448 V---TARVADNVEVTFVEETDYPFKERIKFICKKSNGVAFPFHLRIPEW--CDNAVVFVN 502
Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAG 630
G+ P G+ T RW D L + LP+ +R + A+ GP + A
Sbjct: 503 GKVYGKPQAGSITKVTRRWKKGDVLELYLPMKIRISYW------FQRSAAVERGPLVFAL 556
Query: 631 HTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSN---SNQSITMEEF 687
+ EW G + P P +N L+ + ++TF++ NQ T++
Sbjct: 557 GLNEEWKKIGGKEPYADYEVLPKDP-WNYGLLRNYVDHPDTTFIVKEFTVKNQPWTLKNA 615
Query: 688 PVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFP 731
PV ++I K + + + G + PF +P
Sbjct: 616 PV-----------KIIAKAKKIPEWKLYGGITG-PIPYSPFWYP 647
>gi|168260569|ref|ZP_02682542.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Hadar str. RI_05P066]
gi|205350487|gb|EDZ37118.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Hadar str. RI_05P066]
Length = 651
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + G+ L ++ W + ++ + S Q V +L LR+P W
Sbjct: 442 MYVGNSMEIPVGNGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +T+ LP+ +R A AI G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 556 PLVYCLEQADNGE 568
>gi|168818493|ref|ZP_02830493.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Weltevreden str. HI_N05-537]
gi|409247363|ref|YP_006888062.1| hypothetical protein SENTW_3778 [Salmonella enterica subsp.
enterica serovar Weltevreden str. 2007-60-3289-1]
gi|205344524|gb|EDZ31288.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Weltevreden str. HI_N05-537]
gi|320088097|emb|CBY97859.1| hypothetical protein SENTW_3778 [Salmonella enterica subsp.
enterica serovar Weltevreden str. 2007-60-3289-1]
Length = 651
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P++++LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMMLASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWC--PAAKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|378957466|ref|YP_005214953.1| hypothetical protein SPUL_3886 [Salmonella enterica subsp. enterica
serovar Gallinarum/pullorum str. RKS5078]
gi|438120755|ref|ZP_20872004.1| hypothetical protein SEEP9120_00215 [Salmonella enterica subsp.
enterica serovar Pullorum str. ATCC 9120]
gi|357208077|gb|AET56123.1| hypothetical protein SPUL_3886 [Salmonella enterica subsp. enterica
serovar Gallinarum/pullorum str. RKS5078]
gi|434943466|gb|ELL49584.1| hypothetical protein SEEP9120_00215 [Salmonella enterica subsp.
enterica serovar Pullorum str. ATCC 9120]
Length = 651
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + G+ L ++ W +++ + S Q V +L LR+P W
Sbjct: 442 MYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI--DSVQPVRH--TLALRLPDWCPE-- 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +T+ LP+ +R A AI G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 556 PLVYCLEQADNGE 568
>gi|417369073|ref|ZP_12140391.1| secreted protein [Salmonella enterica subsp. enterica serovar
Hvittingfoss str. A4-620]
gi|353585087|gb|EHC45022.1| secreted protein [Salmonella enterica subsp. enterica serovar
Hvittingfoss str. A4-620]
Length = 651
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + G+ L ++ W +++ + S Q V +L LR+P W
Sbjct: 442 MYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI--DSVQPVRH--TLALRLPDWCPE-- 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +T+ LP+ +R A AI G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 556 PLVYCLEQADNGE 568
>gi|197261863|ref|ZP_03161937.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gi|197240118|gb|EDY22738.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
Length = 651
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + G+ L ++ W +++ + S Q V +L LR+P W
Sbjct: 442 MYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI--DSVQPVRH--TLALRLPDWCPE-- 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +T+ LP+ +R A AI G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 556 PLVYCLEQADNGE 568
>gi|437530472|ref|ZP_20780573.1| hypothetical protein SEEE9317_09781, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 648899 3-17]
gi|435244046|gb|ELO24278.1| hypothetical protein SEEE9317_09781, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 648899 3-17]
Length = 349
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 32 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 84
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY LYI
Sbjct: 85 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTPR---ADALYIN 139
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + G+ L ++ W +++ + S Q V +L LR+P W
Sbjct: 140 MYVGNSMEIPVGNGALKLRIGGNYPWQEQVKIAI--DSVQPVRH--TLALRLPDWCPE-- 193
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +T+ LP+ +R A AI G
Sbjct: 194 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 253
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 254 PLVYCLEQADNGE 266
>gi|440285639|ref|YP_007338404.1| hypothetical protein D782_0140 [Enterobacteriaceae bacterium strain
FGI 57]
gi|440045161|gb|AGB76219.1| hypothetical protein D782_0140 [Enterobacteriaceae bacterium strain
FGI 57]
Length = 652
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 81/364 (22%), Positives = 131/364 (35%), Gaps = 73/364 (20%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
L RLY +T +P++ L F +P F + S++H +
Sbjct: 193 LMRLYDVTQEPRYQQLVRYFVEERGKQPHFYDIEYEKRGKTSYWHTYGPAWMVKDKAYSQ 252
Query: 335 THIPIV-----IGSQMR--YEVTG---------DPLYKLIGTFFMDIVNASHSYATGG-- 376
H PI IG +R Y +TG D + + + Y TGG
Sbjct: 253 AHQPIAEQPKAIGHAVRFVYLMTGVAHLARLSHDEGKRQDCLRLWNNMAQRQLYITGGIG 312
Query: 377 --TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
+S F D DT+ +E +C + ++ +R + + YAD ERAL N V
Sbjct: 313 SQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNTV 369
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGI 480
L + Y+ PL H FN + CC
Sbjct: 370 LG-GMALDGKHFFYVNPL--------EVHPKSLNFNHIYDHVKPVRQRWFGCACCPPNIA 420
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
+ +G IY + LY+ Y+ +S + G+ L + W +++T+
Sbjct: 421 RVLTSIGHYIYTPRD---EALYVNLYVGNSVEIPVGNETLRLTISGNYPWQEQIKITIDS 477
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
S + +L LR+P W + + LNG +L + RW D LT+ LP
Sbjct: 478 PSPVQ----HTLALRLPDWCVN--PRVILNGDAAEGTVEKGYLHLSRRWQEGDTLTLTLP 531
Query: 601 LSLR 604
+ +R
Sbjct: 532 MPIR 535
>gi|395228933|ref|ZP_10407251.1| cytoplasmic protein [Citrobacter sp. A1]
gi|424732388|ref|ZP_18160966.1| sugar (glycoside-pentoside-hexuronide) transporter [Citrobacter sp.
L17]
gi|394717639|gb|EJF23323.1| cytoplasmic protein [Citrobacter sp. A1]
gi|422893047|gb|EKU32896.1| sugar (glycoside-pentoside-hexuronide) transporter [Citrobacter sp.
L17]
Length = 651
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 60/243 (24%), Positives = 97/243 (39%), Gaps = 23/243 (9%)
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIE 481
RAL N VL + Y+ PL + H + W CC
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPLEVHPKSLKLNHIYDHVKPVRQRWFGCACCPPNIAR 421
Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFS 541
+ +G IY + LYI Y+ +S + + L ++ W + ++ +T
Sbjct: 422 VLTSIGHYIYTPRQD---ALYINMYVGNSMEVPVVNGSLKLRISGDYPW--HEQVKITIE 476
Query: 542 SKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPL 601
S Q V +L LR+P W + Q LNGQ + +L + W D L++ LP+
Sbjct: 477 SPQSV--YHTLALRLPDWC--SAPQVLLNGQPIEQDIRKGYLHISRTWQEGDTLSLTLPM 532
Query: 602 SLR 604
+R
Sbjct: 533 PVR 535
>gi|254163510|ref|YP_003046618.1| hypothetical protein ECB_03438 [Escherichia coli B str. REL606]
gi|253975411|gb|ACT41082.1| conserved hypothetical protein [Escherichia coli B str. REL606]
Length = 659
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 105/499 (21%), Positives = 181/499 (36%), Gaps = 97/499 (19%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
V +L A A +A +++ V+ ++ Q + GYL+ + T +A +
Sbjct: 74 VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNTYFT-----VKAPEER 126
Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
W+ H++ L++ V A + +V + + +V + H Y +
Sbjct: 127 WSNLAECHELYCAGHLIEAEVAFFQATGKRRLLEVVCRLADHIDRVFGPDESKLHGYPGH 186
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
E + L RLY +T +P++L L + F +P + + SH+H
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243
Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
+ H+ + IG +R+ +Y + G + ++ S
Sbjct: 244 MVKDKAYSQAHLSLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLW 297
Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
Y TGG +S F D DT+ +E +C + ++ +R + +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354
Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
YAD ERAL N VL + Y+ PL H KFN +
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPI 405
Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
CC + +G +Y E LYI Y +S + + L +V
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
W ++T+ S Q V +L LR+P W Q LNG+ + +L
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 516
Query: 586 TERWSYNDKLTIQLPLSLR 604
T W D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535
>gi|424897290|ref|ZP_18320864.1| hypothetical protein Rleg4DRAFT_3241 [Rhizobium leguminosarum bv.
trifolii WSM2297]
gi|393181517|gb|EJC81556.1| hypothetical protein Rleg4DRAFT_3241 [Rhizobium leguminosarum bv.
trifolii WSM2297]
Length = 640
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 90/395 (22%), Positives = 153/395 (38%), Gaps = 59/395 (14%)
Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
L +L +T + K+L L+ F + F A + FH T H P+
Sbjct: 198 ALVKLARVTGEKKYLDLSKFFIDERGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 258 REQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + + YAD E+AL NG L
Sbjct: 317 NEGFTDYYDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
T+ Y PL A H W K++ CC + +G +Y + +
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
+++ ++ +G V Q+V WD + F+++ E +L+LR+P W
Sbjct: 427 AVHLYGESTTRLKLANGAEVELQQVTNY-PWD----GAVAFTTRLEKPARFALSLRIPDW 481
Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
+ GA S+NG+ L L + +W+ D + + LPLSLR + + A
Sbjct: 482 --AEGATLSVNGEKLDLAATMRDGYARIDRQWADGDSVALHLPLSLRPQYANPKVRQDAG 539
Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
A++ GP + T T L+A++ P
Sbjct: 540 RVALMRGPLVYCVET-------TDNGADLNAIVLP 567
>gi|417514299|ref|ZP_12178139.1| secreted protein [Salmonella enterica subsp. enterica serovar
Senftenberg str. A4-543]
gi|353634280|gb|EHC80885.1| secreted protein [Salmonella enterica subsp. enterica serovar
Senftenberg str. A4-543]
Length = 651
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P++++LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMMLASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|167537610|ref|XP_001750473.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771013|gb|EDQ84687.1| predicted protein [Monosiga brevicollis MX1]
Length = 2823
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 72/172 (41%), Gaps = 21/172 (12%)
Query: 98 FLKEVSLHDVWLDQSSVLWRAQQTNLEYLLMLDVDSLVWSFRKTASLPTPGKAYGGWENP 157
F EV +V L SVL RA N+ YLL D L++ FR P P GW+
Sbjct: 93 FQVEVPTSNVTLTPGSVLRRAFDANIIYLLGHPTDDLLYFFRLRNGNPNPPGQCWGWD-- 150
Query: 158 ISELRGHFVGHYLSASAQM--WASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTEL 215
+ LRG G +L S + W NAT++ +M VV + Q + GY F
Sbjct: 151 -ANLRGSLAGEFLMGSGGISRWPMA-NATLRARMDEVVAGI--LQEQEADGYAMGF---- 202
Query: 216 FDSFEALKPVWA---PYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYN 264
A W P Y + GLL + +A N QAL + + +F N
Sbjct: 203 -----ARNETWTHENPDYVTSWVTHGLL-EAAIAGNEQALPLIRRHLNWFNN 248
>gi|417521365|ref|ZP_12183078.1| secreted protein [Salmonella enterica subsp. enterica serovar
Uganda str. R8-3404]
gi|353641628|gb|EHC86306.1| secreted protein [Salmonella enterica subsp. enterica serovar
Uganda str. R8-3404]
Length = 651
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P++++LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMMLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|424886647|ref|ZP_18310255.1| hypothetical protein Rleg10DRAFT_4682 [Rhizobium leguminosarum bv.
trifolii WSM2012]
gi|393175998|gb|EJC76040.1| hypothetical protein Rleg10DRAFT_4682 [Rhizobium leguminosarum bv.
trifolii WSM2012]
Length = 640
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 90/395 (22%), Positives = 155/395 (39%), Gaps = 59/395 (14%)
Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
L +L +T + K+L L+ F + F A + FH T H P+
Sbjct: 198 ALVKLARVTAEKKYLDLSKFFIDERGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 258 RQQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + + YAD E+AL NG L
Sbjct: 317 NEGFTDYYDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
T+ Y PL A H W K++ CC + +G +Y + +
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
+++ ++ +G V Q+V WD + F++K + +L+LR+P W
Sbjct: 427 AVHLYGESTARLKLANGAEVELQQVTNY-PWD----GAVAFATKLKTPARFALSLRIPDW 481
Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
+ GA S+NG+ L L + +W+ D++ + LPLSLR + + A
Sbjct: 482 --AEGATLSVNGERLDLGATMRDGYARLDRQWADGDRVDLFLPLSLRPQYANPKVRQDAG 539
Query: 618 IQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
A++ GP + T T + L+A++ P
Sbjct: 540 RVALMRGPLVYCVET-------TDNGQDLNAIVLP 567
>gi|416529897|ref|ZP_11744588.1| hypothetical protein SEEM010_12041 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
gi|416538915|ref|ZP_11749679.1| hypothetical protein SEEM030_12964 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB30]
gi|416553241|ref|ZP_11757602.1| hypothetical protein SEEM29N_15108 [Salmonella enterica subsp.
enterica serovar Montevideo str. 29N]
gi|417470705|ref|ZP_12166835.1| secreted protein [Salmonella enterica subsp. enterica serovar
Montevideo str. S5-403]
gi|353624652|gb|EHC73633.1| secreted protein [Salmonella enterica subsp. enterica serovar
Montevideo str. S5-403]
gi|363551713|gb|EHL36026.1| hypothetical protein SEEM010_12041 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
gi|363561277|gb|EHL45405.1| hypothetical protein SEEM030_12964 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB30]
gi|363563119|gb|EHL47199.1| hypothetical protein SEEM29N_15108 [Salmonella enterica subsp.
enterica serovar Montevideo str. 29N]
Length = 651
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIEQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSCDYDLPNDSIYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|194435948|ref|ZP_03068051.1| conserved hypothetical protein [Escherichia coli 101-1]
gi|253771579|ref|YP_003034410.1| hypothetical protein ECBD_0148 [Escherichia coli
'BL21-Gold(DE3)pLysS AG']
gi|254290260|ref|YP_003056008.1| hypothetical protein ECD_03438 [Escherichia coli BL21(DE3)]
gi|422788952|ref|ZP_16841686.1| hypothetical protein ERGG_04098 [Escherichia coli H489]
gi|442600526|ref|ZP_21018201.1| Putative glycosyl hydrolase of unknown function (DUF1680)
[Escherichia coli O5:K4(L):H4 str. ATCC 23502]
gi|194425491|gb|EDX41475.1| conserved hypothetical protein [Escherichia coli 101-1]
gi|253322623|gb|ACT27225.1| protein of unknown function DUF1680 [Escherichia coli
'BL21-Gold(DE3)pLysS AG']
gi|253979567|gb|ACT45237.1| conserved hypothetical protein [Escherichia coli BL21(DE3)]
gi|323959403|gb|EGB55063.1| hypothetical protein ERGG_04098 [Escherichia coli H489]
gi|441650536|emb|CCQ03630.1| Putative glycosyl hydrolase of unknown function (DUF1680)
[Escherichia coli O5:K4(L):H4 str. ATCC 23502]
Length = 659
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 105/499 (21%), Positives = 181/499 (36%), Gaps = 97/499 (19%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
V +L A A +A +++ V+ ++ Q + GYL+ + T +A +
Sbjct: 74 VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNTYFT-----VKAPEER 126
Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
W+ H++ L++ V A + +V + + +V + H Y +
Sbjct: 127 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLADHIDRVFGPDESKLHGYPGH 186
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
E + L RLY +T +P++L L + F +P + + SH+H
Sbjct: 187 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 243
Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
+ H+ + IG +R+ +Y + G + ++ S
Sbjct: 244 MVKDKAYSQAHLSLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLW 297
Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
Y TGG +S F D DT+ +E +C + ++ +R + +
Sbjct: 298 NNMAQRQLYITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 354
Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
YAD ERAL N VL + Y+ PL H KFN +
Sbjct: 355 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPI 405
Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
CC + +G +Y E LYI Y +S + + L +V
Sbjct: 406 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 462
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
W ++T+ S Q V +L LR+P W Q LNG+ + +L
Sbjct: 463 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 516
Query: 586 TERWSYNDKLTIQLPLSLR 604
T W D L + LP+ +R
Sbjct: 517 TREWQEGDTLNLTLPMPVR 535
>gi|262382783|ref|ZP_06075920.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262295661|gb|EEY83592.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 680
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 91/424 (21%), Positives = 170/424 (40%), Gaps = 41/424 (9%)
Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
++ +L QY A N Q ++ +++ YF ++ ++ S W E+ GG N V+
Sbjct: 164 VMLKVLQQYYSATNDQ--RVISFLTNYFKYQLSELPK--SPLGKWTFWAEQRGGDNLMVV 219
Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
Y LY+IT DP L L L K F + L D+L+ ++ + + G + + Y+
Sbjct: 220 YWLYNITGDPFLLELGELIHKQTFNWTDVFLNQDHLARQNSLHCVNLAQGFKEPVIYYQQ 279
Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKV 409
+ +P + + + + TG W + L ++ E CT M+
Sbjct: 280 SHNPQNLEAVKEAVRKMRHTIGFPTG------LWAGDELLRFGNPTQGSELCTAVEMMFS 333
Query: 410 SRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL--------GRG-VSKAR 460
+ T ++ +AD+ E+ N VL Q + Y + GR VS
Sbjct: 334 LEKMLEITGDVQWADHLEKVAYN-VLPTQIKDDFSARQYYQQVNQVAITCEGRNFVSPHE 392
Query: 461 STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--- 517
T + + + CC + + K ++F N G+ + Y S + G+
Sbjct: 393 DTDIVFGELSGYPCCTSNLHQGWPKFTRHLWFATADN--GIASLIYAPSEVTVQVGNDIT 450
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
V + +K + ++ + L+F SK++ +LR+P W N ++NG+ + +
Sbjct: 451 VKIAEKTN--YPFEEKIDFNLSFPSKKDKKAFFPFHLRIPAW--CNNPVITINGEAVSIA 506
Query: 578 P-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
G + W D + ++LP+ + T DD I GP L + +W
Sbjct: 507 AHSGEIVRINREWKDGDHIQLELPMRISTSNWYDD------AVVIERGPLLYSLKMDEKW 560
Query: 637 DIKT 640
+ K
Sbjct: 561 ERKV 564
>gi|251786831|ref|YP_003001135.1| ybl149 [Escherichia coli BL21(DE3)]
gi|242379104|emb|CAQ33906.1| ybl149 [Escherichia coli BL21(DE3)]
Length = 667
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 105/499 (21%), Positives = 181/499 (36%), Gaps = 97/499 (19%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
V +L A A +A +++ V+ ++ Q + GYL+ + T +A +
Sbjct: 82 VAKWLEAVAWSLCQKPDAELEKTADEVIELIASAQCE--DGYLNTYFT-----VKAPEER 134
Query: 226 WAPYYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
W+ H++ L++ V A + +V + + +V + H Y +
Sbjct: 135 WSNLAECHELYCAGHLIEAGVAFFQATGKRRLLEVVCRLADHIDRVFGPDESKLHGYPGH 194
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH------ 332
E + L RLY +T +P++L L + F +P + + SH+H
Sbjct: 195 PE---IELALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAW 251
Query: 333 -------ANTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS--------- 371
+ H+ + IG +R+ +Y + G + ++ S
Sbjct: 252 MVKDKAYSQAHLSLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLW 305
Query: 372 --------YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
Y TGG +S F D DT+ +E +C + ++ +R + +
Sbjct: 306 NNMAQRQLYITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGD 362
Query: 420 IAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW------ 473
YAD ERAL N VL + Y+ PL H KFN +
Sbjct: 363 SQYADVMERALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPI 413
Query: 474 --------CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
CC + +G +Y E LYI Y +S + + L +V
Sbjct: 414 RQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVS 470
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
W ++T+ S Q V +L LR+P W Q LNG+ + +L
Sbjct: 471 GNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHI 524
Query: 586 TERWSYNDKLTIQLPLSLR 604
T W D L + LP+ +R
Sbjct: 525 TREWQEGDTLNLTLPMPVR 543
>gi|386016685|ref|YP_005934975.1| hypothetical protein PAJ_2099 [Pantoea ananatis AJ13355]
gi|327394757|dbj|BAK12179.1| hypothetical protein PAJ_2099 [Pantoea ananatis AJ13355]
Length = 659
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 82/362 (22%), Positives = 131/362 (36%), Gaps = 69/362 (19%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
L RLY +T P++L L + F +P F + S++H +
Sbjct: 201 LMRLYEVTQQPRYLALVNTFVSQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 260
Query: 335 THIPIV-----IGSQMRYEVTGDPLYKLIGTFFM-----------DIVNASHS------Y 372
H P+ +G +R+ +Y + G + D + H+ Y
Sbjct: 261 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 314
Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG +S F D DT+ +E +C + ++ +R + + YAD ER
Sbjct: 315 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 371
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIES 482
AL N VL + Y+ PL H + W CC
Sbjct: 372 ALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIARL 430
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
+ LG IY E L+I Y+ + D G L ++ W+ +T++
Sbjct: 431 LTSLGHYIYTPHEN---ALFINLYVGNRVDVPVGDRTLGIRISGNFPWEE--TVTISVDV 485
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
Q V +L LR+P W Q S NG+ + +L W D LT+ LP+
Sbjct: 486 TQPVKH--TLALRLPDW--CEAPQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPMP 541
Query: 603 LR 604
+R
Sbjct: 542 VR 543
>gi|417386570|ref|ZP_12151238.1| secreted protein [Salmonella enterica subsp. enterica serovar
Johannesburg str. S5-703]
gi|353602920|gb|EHC58138.1| secreted protein [Salmonella enterica subsp. enterica serovar
Johannesburg str. S5-703]
Length = 651
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSCDYDLPNDSIYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|291618364|ref|YP_003521106.1| hypothetical Protein PANA_2811 [Pantoea ananatis LMG 20103]
gi|291153394|gb|ADD77978.1| Hypothetical Protein PANA_2811 [Pantoea ananatis LMG 20103]
Length = 659
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 82/362 (22%), Positives = 131/362 (36%), Gaps = 69/362 (19%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
L RLY +T P++L L + F +P F + S++H +
Sbjct: 201 LMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 260
Query: 335 THIPIV-----IGSQMRYEVTGDPLYKLIGTFFM-----------DIVNASHS------Y 372
H P+ +G +R+ +Y + G + D + H+ Y
Sbjct: 261 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 314
Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG +S F D DT+ +E +C + ++ +R + + YAD ER
Sbjct: 315 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 371
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIES 482
AL N VL + Y+ PL H + W CC
Sbjct: 372 ALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIARL 430
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
+ LG IY E L+I Y+ + D G L ++ W+ +T++
Sbjct: 431 LTSLGHYIYTPHEN---ALFINLYVGNRVDVPVGDRTLGIRISGNFPWEE--TVTISVDV 485
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
Q V +L LR+P W Q S NG+ + +L W D LT+ LP+
Sbjct: 486 TQPVKH--TLALRLPDW--CEAPQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPMP 541
Query: 603 LR 604
+R
Sbjct: 542 VR 543
>gi|416425586|ref|ZP_11692369.1| hypothetical protein SEEM315_05068 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|416430384|ref|ZP_11695001.1| hypothetical protein SEEM971_12506 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|416437565|ref|ZP_11698915.1| hypothetical protein SEEM973_00650 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|416443382|ref|ZP_11702995.1| hypothetical protein SEEM974_19090 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|416450281|ref|ZP_11707410.1| hypothetical protein SEEM201_09767 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|416460310|ref|ZP_11714693.1| hypothetical protein SEEM202_18350 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|416463475|ref|ZP_11715992.1| hypothetical protein SEEM954_09563 [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|416480379|ref|ZP_11722779.1| hypothetical protein SEEM054_17193 [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|416487797|ref|ZP_11725654.1| hypothetical protein SEEM675_20283 [Salmonella enterica subsp.
enterica serovar Montevideo str. OH_2009072675]
gi|416501897|ref|ZP_11732445.1| hypothetical protein SEEM965_19843 [Salmonella enterica subsp.
enterica serovar Montevideo str. CASC_09SCPH15965]
gi|416504577|ref|ZP_11733224.1| hypothetical protein SEEM031_12180 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|416517070|ref|ZP_11739340.1| hypothetical protein SEEM710_04917 [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|416543079|ref|ZP_11752034.1| hypothetical protein SEEM19N_14012 [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|416562276|ref|ZP_11762033.1| hypothetical protein SEEM42N_02114 [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
gi|416573654|ref|ZP_11767961.1| hypothetical protein SEEM41H_07852 [Salmonella enterica subsp.
enterica serovar Montevideo str. 4441 H]
gi|416578850|ref|ZP_11770886.1| hypothetical protein SEEM801_22092 [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|416584544|ref|ZP_11774245.1| hypothetical protein SEEM507_12719 [Salmonella enterica subsp.
enterica serovar Montevideo str. MD_MDA09249507]
gi|416589552|ref|ZP_11777137.1| hypothetical protein SEEM877_17791 [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|416607005|ref|ZP_11788219.1| hypothetical protein SEEM180_02276 [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|416611569|ref|ZP_11790943.1| hypothetical protein SEEM600_22079 [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|416624752|ref|ZP_11798278.1| hypothetical protein SEEM581_05149 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|416626628|ref|ZP_11798711.1| hypothetical protein SEEM501_13368 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|416644435|ref|ZP_11806741.1| hypothetical protein SEEM460_09301 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|416648059|ref|ZP_11808823.1| hypothetical protein SEEM020_002973 [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|416658271|ref|ZP_11814206.1| hypothetical protein SEEM6152_18571 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|416668027|ref|ZP_11818653.1| hypothetical protein SEEM0077_13550 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|416681176|ref|ZP_11823586.1| hypothetical protein SEEM0047_06329 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB102109-0047]
gi|416694001|ref|ZP_11826910.1| hypothetical protein SEEM0055_00612 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|416708995|ref|ZP_11833799.1| hypothetical protein SEEM0052_07867 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|416712890|ref|ZP_11836552.1| hypothetical protein SEEM3312_21482 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|416721065|ref|ZP_11842596.1| hypothetical protein SEEM5258_14872 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|416722793|ref|ZP_11843619.1| hypothetical protein SEEM1156_02237 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|416729527|ref|ZP_11848104.1| hypothetical protein SEEM9199_08226 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|416741866|ref|ZP_11855415.1| hypothetical protein SEEM8282_16962 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|416745954|ref|ZP_11857573.1| hypothetical protein SEEM8283_19676 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|416755322|ref|ZP_11861983.1| hypothetical protein SEEM8284_15210 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008284]
gi|416763125|ref|ZP_11866955.1| hypothetical protein SEEM8285_00470 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|416771775|ref|ZP_11872954.1| hypothetical protein SEEM8287_16934 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|418485126|ref|ZP_13054112.1| hypothetical protein SEEM906_06936 [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|418491104|ref|ZP_13057631.1| hypothetical protein SEEM5278_11244 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|418494659|ref|ZP_13061110.1| hypothetical protein SEEM5318_03413 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|418499800|ref|ZP_13066201.1| hypothetical protein SEEM5320_09457 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|418503417|ref|ZP_13069781.1| hypothetical protein SEEM5321_21740 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|418508996|ref|ZP_13075294.1| hypothetical protein SEEM5327_04049 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|418525130|ref|ZP_13091112.1| hypothetical protein SEEM8286_03017 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
gi|322613936|gb|EFY10872.1| hypothetical protein SEEM315_05068 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|322620305|gb|EFY17173.1| hypothetical protein SEEM971_12506 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|322625311|gb|EFY22138.1| hypothetical protein SEEM973_00650 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|322630022|gb|EFY26795.1| hypothetical protein SEEM974_19090 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|322634213|gb|EFY30948.1| hypothetical protein SEEM201_09767 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|322635886|gb|EFY32595.1| hypothetical protein SEEM202_18350 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|322643086|gb|EFY39661.1| hypothetical protein SEEM954_09563 [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|322644583|gb|EFY41119.1| hypothetical protein SEEM054_17193 [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|322650825|gb|EFY47217.1| hypothetical protein SEEM675_20283 [Salmonella enterica subsp.
enterica serovar Montevideo str. OH_2009072675]
gi|322653011|gb|EFY49346.1| hypothetical protein SEEM965_19843 [Salmonella enterica subsp.
enterica serovar Montevideo str. CASC_09SCPH15965]
gi|322659974|gb|EFY56214.1| hypothetical protein SEEM19N_14012 [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|322663307|gb|EFY59511.1| hypothetical protein SEEM801_22092 [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|322668793|gb|EFY64946.1| hypothetical protein SEEM507_12719 [Salmonella enterica subsp.
enterica serovar Montevideo str. MD_MDA09249507]
gi|322674404|gb|EFY70497.1| hypothetical protein SEEM877_17791 [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|322680894|gb|EFY76928.1| hypothetical protein SEEM180_02276 [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|322687170|gb|EFY83143.1| hypothetical protein SEEM600_22079 [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|323192129|gb|EFZ77362.1| hypothetical protein SEEM581_05149 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|323200633|gb|EFZ85707.1| hypothetical protein SEEM501_13368 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|323201343|gb|EFZ86409.1| hypothetical protein SEEM460_09301 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|323211827|gb|EFZ96659.1| hypothetical protein SEEM6152_18571 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|323216186|gb|EGA00914.1| hypothetical protein SEEM0077_13550 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|323220409|gb|EGA04863.1| hypothetical protein SEEM0047_06329 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB102109-0047]
gi|323226266|gb|EGA10481.1| hypothetical protein SEEM0055_00612 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|323228386|gb|EGA12517.1| hypothetical protein SEEM0052_07867 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|323234207|gb|EGA18295.1| hypothetical protein SEEM3312_21482 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|323237192|gb|EGA21259.1| hypothetical protein SEEM5258_14872 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|323244711|gb|EGA28715.1| hypothetical protein SEEM1156_02237 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|323249192|gb|EGA33110.1| hypothetical protein SEEM9199_08226 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|323250689|gb|EGA34569.1| hypothetical protein SEEM8282_16962 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|323257564|gb|EGA41251.1| hypothetical protein SEEM8283_19676 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|323262273|gb|EGA45834.1| hypothetical protein SEEM8284_15210 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008284]
gi|323266172|gb|EGA49663.1| hypothetical protein SEEM8285_00470 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|323268806|gb|EGA52264.1| hypothetical protein SEEM8287_16934 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|363557827|gb|EHL42031.1| hypothetical protein SEEM031_12180 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|363561441|gb|EHL45559.1| hypothetical protein SEEM710_04917 [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|363571665|gb|EHL55571.1| hypothetical protein SEEM41H_07852 [Salmonella enterica subsp.
enterica serovar Montevideo str. 4441 H]
gi|363573358|gb|EHL57244.1| hypothetical protein SEEM42N_02114 [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
gi|366056585|gb|EHN20901.1| hypothetical protein SEEM906_06936 [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|366061420|gb|EHN25666.1| hypothetical protein SEEM5318_03413 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|366063348|gb|EHN27567.1| hypothetical protein SEEM5278_11244 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|366069988|gb|EHN34105.1| hypothetical protein SEEM5320_09457 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|366073016|gb|EHN37095.1| hypothetical protein SEEM5321_21740 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|366078850|gb|EHN42847.1| hypothetical protein SEEM5327_04049 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|366830119|gb|EHN56993.1| hypothetical protein SEEM020_002973 [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|372206701|gb|EHP20203.1| hypothetical protein SEEM8286_03017 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
Length = 651
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIEQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVHH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|386078433|ref|YP_005991958.1| hypothetical protein [Pantoea ananatis PA13]
gi|354987614|gb|AER31738.1| hypothetical protein PAGR_g1212 [Pantoea ananatis PA13]
Length = 651
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 82/362 (22%), Positives = 131/362 (36%), Gaps = 69/362 (19%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
L RLY +T P++L L + F +P F + S++H +
Sbjct: 193 LMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 252
Query: 335 THIPIV-----IGSQMRYEVTGDPLYKLIGTFFM-----------DIVNASHS------Y 372
H P+ +G +R+ +Y + G + D + H+ Y
Sbjct: 253 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 306
Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG +S F D DT+ +E +C + ++ +R + + YAD ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIES 482
AL N VL + Y+ PL H + W CC
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIARL 422
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
+ LG IY E L+I Y+ + D G L ++ W+ +T++
Sbjct: 423 LTSLGHYIYTPHEN---ALFINLYVGNRVDVPVGDRTLGIRISGNFPWEE--TVTISVDV 477
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
Q V +L LR+P W Q S NG+ + +L W D LT+ LP+
Sbjct: 478 TQPVKH--TLALRLPDW--CEAPQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPMP 533
Query: 603 LR 604
+R
Sbjct: 534 VR 535
>gi|298247843|ref|ZP_06971648.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
gi|297550502|gb|EFH84368.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
Length = 643
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 101/464 (21%), Positives = 180/464 (38%), Gaps = 60/464 (12%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPV 225
V ++ A++ A T +A +++++ V+ ++ Q+ GYL+ + SFE
Sbjct: 92 VYKWVEAASWTLAQTPDARLEQQLDEVIALIASAQDD--DGYLNTY-----YSFERQAER 144
Query: 226 WAPYYTIHKI-LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
W+ +H++ AG L Q +A + K + +++ + +++ +
Sbjct: 145 WSNLTDMHELYCAGHLLQAAVAHHRATGKAS--LLDVATRVANNIASVFGPQGR-----P 197
Query: 285 ETGGMNDV---LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF----- 331
T G ++ L L T +P++L A F KP L D+L
Sbjct: 198 GTCGHPEIELALVELARETGEPRYLQQAQFFIGQRGQKPPVLNGSPYCQDHLPVREQQEV 257
Query: 332 --HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL 389
HA + + G Y TG+ + +Y TGG +R W+ +
Sbjct: 258 VGHAVRALYLYAGVTDAYLETGEAALDHAQEALWQNLTERKTYVTGGVGSR---WEGEAF 314
Query: 390 ADTLGSENE----ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
+ NE ETC + + L + E + D E+ L NGV++ + +
Sbjct: 315 GENYELPNERAYTETCAAIASVMWNWRLLQARPEARFTDVIEQTLYNGVIA-GSSLDGKL 373
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
Y PL R H F++ CC + L Y E G+++
Sbjct: 374 YFYQNPLA-----DRGKHRRQPWFDTA-CCPPNIARLLASLPGYFYSTSE---EGIWLHL 424
Query: 506 YISSS--FDWKSGHVV-LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
Y S++ SG + + Q+ + WD + + L Q+ +L +R+P W +
Sbjct: 425 YASNTAQIPLASGEAITIEQQTN--YPWDEEIGVRLQMREAQDF----TLFVRIPAW--A 476
Query: 563 NGAQASLNGQNLP--LPPPGNFLSATERWSYNDKLTIQLPLSLR 604
GAQ +N Q + PG + W DK+TI LPL +R
Sbjct: 477 TGAQIQVNKQPVEGLAIKPGTYAQLNRTWQPGDKVTIVLPLEVR 520
>gi|416597563|ref|ZP_11782144.1| hypothetical protein SEEM867_21594 [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
gi|322678388|gb|EFY74449.1| hypothetical protein SEEM867_21594 [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
Length = 651
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIEQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVHH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|168465016|ref|ZP_02698908.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
gi|418762014|ref|ZP_13318148.1| hypothetical protein SEEN185_06317 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35185]
gi|418768178|ref|ZP_13324234.1| hypothetical protein SEEN199_22735 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35199]
gi|418769292|ref|ZP_13325327.1| hypothetical protein SEEN539_08791 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21539]
gi|418774344|ref|ZP_13330315.1| hypothetical protein SEEN953_04837 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 33953]
gi|418782301|ref|ZP_13338167.1| hypothetical protein SEEN188_15334 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35188]
gi|418784431|ref|ZP_13340269.1| hypothetical protein SEEN559_22215 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21559]
gi|418804570|ref|ZP_13360175.1| hypothetical protein SEEN202_19624 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35202]
gi|419790711|ref|ZP_14316381.1| hypothetical protein SEENLE01_19942 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|419795154|ref|ZP_14320760.1| hypothetical protein SEENLE15_09022 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|195632371|gb|EDX50855.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
gi|392613400|gb|EIW95860.1| hypothetical protein SEENLE01_19942 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|392613862|gb|EIW96317.1| hypothetical protein SEENLE15_09022 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|392732968|gb|EIZ90175.1| hypothetical protein SEEN199_22735 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35199]
gi|392738037|gb|EIZ95186.1| hypothetical protein SEEN185_06317 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35185]
gi|392740729|gb|EIZ97848.1| hypothetical protein SEEN539_08791 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21539]
gi|392744606|gb|EJA01653.1| hypothetical protein SEEN188_15334 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35188]
gi|392751846|gb|EJA08794.1| hypothetical protein SEEN953_04837 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 33953]
gi|392754775|gb|EJA11691.1| hypothetical protein SEEN559_22215 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21559]
gi|392770727|gb|EJA27452.1| hypothetical protein SEEN202_19624 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35202]
Length = 651
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + + L ++ W +++T+ S Q V +L LR+P W
Sbjct: 442 MYVGNSMEIPVENGALKLRISGNYPWQEQVKITI--DSVQPVRH--TLALRLPDWCPE-- 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +T+ LP+ +R A AI G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 556 PLVYCLEQADNGE 568
>gi|167549076|ref|ZP_02342835.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
gi|205325554|gb|EDZ13393.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
Length = 651
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 93/253 (36%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY LYI
Sbjct: 387 --EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + + L ++ W + +M + S Q V +L LR+P W
Sbjct: 442 MYVGNSLEVPVENGALKLRIGGNYPW--HEQMKIAIDSVQPVRH--TLALRLPDWCPE-- 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +T+ LP+ +R A AI G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 556 PLVYCLEQADNGE 568
>gi|417329582|ref|ZP_12114395.1| secreted protein [Salmonella enterica subsp. enterica serovar
Adelaide str. A4-669]
gi|353564565|gb|EHC30601.1| secreted protein [Salmonella enterica subsp. enterica serovar
Adelaide str. A4-669]
Length = 651
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + + L ++ W +++T+ S Q V +L LR+P W
Sbjct: 442 MYVGNSMEIPVENGALKLRISGNYPWQEQVKITI--DSVQPVRH--TLALRLPDWCPE-- 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +T+ LP+ +R A AI G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 556 PLVYCLEQADNGE 568
>gi|315607261|ref|ZP_07882261.1| conserved hypothetical protein [Prevotella buccae ATCC 33574]
gi|315250964|gb|EFU30953.1| conserved hypothetical protein [Prevotella buccae ATCC 33574]
Length = 813
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 103/451 (22%), Positives = 173/451 (38%), Gaps = 95/451 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T ++L +A F + G LS + + H PI ++G +R
Sbjct: 224 ALCKLYKVTGSRRYLDMARYFVEETGRG---TDGHRLSEY-SQDHKPILRQQEIVGHAVR 279
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+TGD Y + + + TGG +R + G
Sbjct: 280 AGYLYSGVADVAALTGDTAYFHALERLWNNMAGKKLFITGGMGSRA-------QGEGFGP 332
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
+ E ETC + + + +F T E Y D YERAL NGVLS GV +
Sbjct: 333 DYELNNMTAYQETCASIANVFWNYRMFLATGEGKYVDVYERALYNGVLS-------GVSL 385
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G + + Y ++
Sbjct: 386 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGNVTRFVASVPQYQYAVRGSDI--- 436
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT- 560
Y+ YI + D +G + Q P WD +T+T K+ + +L R+P W
Sbjct: 437 YVNLYIQGTAD-VNGVRLAQQTRYP---WDG--DITVTVDPKRS--RRFALRFRIPGWAG 488
Query: 561 ----------YSNGAQ---ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA 607
+++ ++ +NG+ + P ++ RW D++ I LP+ +R A
Sbjct: 489 ACPVGTNLYHFADSSRPFTVKVNGRKIAGEPVDGYMVIDRRWMRGDRVEISLPMEVRRVA 548
Query: 608 ----IQDDRPEYASIQAILFGP--YLLAGHTSGEWDIKTGTARSLSALISPIPPSFNA-Q 660
++DDR +Y A+ GP Y L G + + R L +PI + A +
Sbjct: 549 ANDNVEDDRGKY----ALERGPIVYCLEGRDQAHSTVFDKSVR----LDAPIRADYRADK 600
Query: 661 LVTFTQESGNSTFVMSN-SNQSITMEEFPVS 690
L + SG + V ++ S + + + P S
Sbjct: 601 LNGIVELSGEAEEVEADGSVRPVAFKAIPYS 631
>gi|16766964|ref|NP_462579.1| hypothetical protein STM3679 [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gi|167990915|ref|ZP_02572014.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|374978319|ref|ZP_09719662.1| secreted protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. TN061786]
gi|378447048|ref|YP_005234680.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. D23580]
gi|378452556|ref|YP_005239916.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. 14028S]
gi|378701566|ref|YP_005183524.1| hypothetical protein SL1344_3644 [Salmonella enterica subsp.
enterica serovar Typhimurium str. SL1344]
gi|378986276|ref|YP_005249432.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|378990981|ref|YP_005254145.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. UK-1]
gi|379702940|ref|YP_005244668.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. ST4/74]
gi|383498313|ref|YP_005399002.1| hypothetical protein UMN798_3997 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
gi|422027921|ref|ZP_16374245.1| hypothetical protein B571_18397 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|422032964|ref|ZP_16379054.1| hypothetical protein B572_18529 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|427555556|ref|ZP_18929550.1| hypothetical protein B576_18491 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|427573106|ref|ZP_18934155.1| hypothetical protein B577_17365 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|427594481|ref|ZP_18939063.1| hypothetical protein B573_17940 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|427618885|ref|ZP_18943976.1| hypothetical protein B574_18451 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|427642409|ref|ZP_18948833.1| hypothetical protein B575_18572 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|427657950|ref|ZP_18953577.1| hypothetical protein B578_18134 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|427663174|ref|ZP_18958453.1| hypothetical protein B579_18597 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|427679110|ref|ZP_18963359.1| hypothetical protein B580_18826 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|427801169|ref|ZP_18968792.1| hypothetical protein B581_21834 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm5]
gi|16422244|gb|AAL22538.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gi|205330807|gb|EDZ17571.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|261248827|emb|CBG26680.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. D23580]
gi|267995935|gb|ACY90820.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. 14028S]
gi|301160215|emb|CBW19737.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. SL1344]
gi|312914705|dbj|BAJ38679.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|321226733|gb|EFX51783.1| secreted protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. TN061786]
gi|323132039|gb|ADX19469.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. ST4/74]
gi|332990528|gb|AEF09511.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. UK-1]
gi|380465134|gb|AFD60537.1| hypothetical protein UMN798_3997 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
gi|414013156|gb|EKS97053.1| hypothetical protein B571_18397 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|414014140|gb|EKS97993.1| hypothetical protein B576_18491 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|414014578|gb|EKS98419.1| hypothetical protein B572_18529 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|414027997|gb|EKT11199.1| hypothetical protein B577_17365 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|414029273|gb|EKT12434.1| hypothetical protein B573_17940 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|414031641|gb|EKT14688.1| hypothetical protein B574_18451 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|414042773|gb|EKT25304.1| hypothetical protein B575_18572 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|414043221|gb|EKT25734.1| hypothetical protein B578_18134 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|414047893|gb|EKT30155.1| hypothetical protein B579_18597 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|414056107|gb|EKT37949.1| hypothetical protein B580_18826 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|414062669|gb|EKT43947.1| hypothetical protein B581_21834 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm5]
Length = 651
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 142/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRRRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG ++ +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLDVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|237728888|ref|ZP_04559369.1| conserved hypothetical protein [Citrobacter sp. 30_2]
gi|226909510|gb|EEH95428.1| conserved hypothetical protein [Citrobacter sp. 30_2]
Length = 651
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 83/220 (37%), Gaps = 32/220 (14%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + +G IY + LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTPRQD---ALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + L ++ W + ++ + S Q + +L LR+P W
Sbjct: 442 MYVGNSMEVPVADGSLKLRISGDYPW--HEQVKIAIESPQSI--YHTLALRLPDWC--TA 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
Q LNGQ + +L + W D L++ LP+ +R
Sbjct: 496 PQVLLNGQPVEQDIRKGYLHISRTWQEGDTLSLTLPMPVR 535
>gi|418511390|ref|ZP_13077652.1| hypothetical protein SEEPO729_11665 [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
gi|366084797|gb|EHN48695.1| hypothetical protein SEEPO729_11665 [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
Length = 651
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|421844899|ref|ZP_16278055.1| hypothetical protein D186_07681 [Citrobacter freundii ATCC 8090 =
MTCC 1658]
gi|411773762|gb|EKS57290.1| hypothetical protein D186_07681 [Citrobacter freundii ATCC 8090 =
MTCC 1658]
gi|455645502|gb|EMF24562.1| hypothetical protein H262_06439 [Citrobacter freundii GTC 09479]
Length = 651
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/220 (23%), Positives = 86/220 (39%), Gaps = 32/220 (14%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + +G IY + LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTPRQD---ALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + + L ++ W + ++ +T S + V +L LR+P W +
Sbjct: 442 MYVGNSMEVPVVNGSLKLRISGDYPW--HEQVKITIESPRSV--YHTLALRLPDWC--SA 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
Q LNGQ + +L + W D L++ LP+ +R
Sbjct: 496 PQVLLNGQPVEQDIRKGYLHISRTWQEGDTLSLTLPMPVR 535
>gi|302809111|ref|XP_002986249.1| hypothetical protein SELMODRAFT_425170 [Selaginella moellendorffii]
gi|300146108|gb|EFJ12780.1| hypothetical protein SELMODRAFT_425170 [Selaginella moellendorffii]
Length = 192
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 33/52 (63%), Gaps = 3/52 (5%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD 217
GHYLSA+A++WASTHNA +K++M +V L+ECQ S P LF
Sbjct: 7 AGHYLSATAKLWASTHNAEVKKRMDALVNILAECQ---AASRKSELPVNLFQ 55
>gi|417361434|ref|ZP_12135327.1| secreted protein [Salmonella enterica subsp. enterica serovar Give
str. S5-487]
gi|353584072|gb|EHC44282.1| secreted protein [Salmonella enterica subsp. enterica serovar Give
str. S5-487]
Length = 651
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSCDYDLPNDSIYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSVEIPVENGALKLRIGGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|194430977|ref|ZP_03063270.1| conserved hypothetical protein [Shigella dysenteriae 1012]
gi|417675158|ref|ZP_12324583.1| hypothetical protein SD15574_4764 [Shigella dysenteriae 155-74]
gi|194420432|gb|EDX36508.1| conserved hypothetical protein [Shigella dysenteriae 1012]
gi|332084488|gb|EGI89683.1| hypothetical protein SD15574_4764 [Shigella dysenteriae 155-74]
Length = 656
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + L + +R
Sbjct: 525 TLNLTLSMPVR 535
>gi|429083191|ref|ZP_19146237.1| COG3533 secreted protein [Cronobacter condimenti 1330]
gi|426548006|emb|CCJ72278.1| COG3533 secreted protein [Cronobacter condimenti 1330]
Length = 651
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 80/212 (37%), Gaps = 16/212 (7%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLEVHPKT 392
Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
H + W CC + LG IY LYI Y+ +S +
Sbjct: 393 LCLNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTPRPD---ALYINLYVGNSIE 449
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
G VL +V W ++ + S V +L LRMP W + Q +LNG
Sbjct: 450 VPVGENVLRLRVSGNFPWQE--KVVIAIDSPLPVQH--TLALRMPDWC--DAPQVTLNGI 503
Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+ +L W D LT+ LP+ +R
Sbjct: 504 EVEKSVRKGYLHIPRVWREGDTLTLTLPMPVR 535
>gi|365102501|ref|ZP_09332802.1| hypothetical protein HMPREF9428_03810 [Citrobacter freundii
4_7_47CFAA]
gi|363646229|gb|EHL85477.1| hypothetical protein HMPREF9428_03810 [Citrobacter freundii
4_7_47CFAA]
Length = 651
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 83/220 (37%), Gaps = 32/220 (14%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + +G IY + LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTPRQD---ALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + L ++ W + ++ + S Q + +L LR+P W
Sbjct: 442 MYVGNSMEVPVADGSLKLRISGDYPW--HEQVKIAIESPQSI--YHTLALRLPDWC--TA 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
Q LNGQ + +L + W D L++ LP+ +R
Sbjct: 496 PQVLLNGQPVEQDIRKGYLHISRTWQEGDTLSLTLPMPVR 535
>gi|420349607|ref|ZP_14850981.1| hypothetical protein SB96558_4565 [Shigella boydii 965-58]
gi|391265984|gb|EIQ24949.1| hypothetical protein SB96558_4565 [Shigella boydii 965-58]
Length = 656
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + L + +R
Sbjct: 525 TLNLTLSMPVR 535
>gi|340619112|ref|YP_004737565.1| hypothetical protein zobellia_3147 [Zobellia galactanivorans]
gi|339733909|emb|CAZ97286.1| Conserved hypothetical periplasmic protein [Zobellia
galactanivorans]
Length = 681
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 90/402 (22%), Positives = 148/402 (36%), Gaps = 45/402 (11%)
Query: 293 LYRLYSITHDPKHLLLAHLF---------DKPCFLGF------LALQADYLSHFHANTHI 337
L +Y T D K+L L F D+ G A++ + + HA
Sbjct: 235 LIEMYRTTGDKKYLELTETFVDMLGTAPKDRLDHRGMDHSQRGTAIREESKAVGHAGHAN 294
Query: 338 PIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRL-ADTLGSE 396
+ G Y TGD K V+ Y TG T F + A+ G +
Sbjct: 295 YLYAGVADLYAETGDQALKDALERIWTNVSTQKMYITGATGPHHFGISNHAIVAEAYGQD 354
Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMI 447
E ETC + +F E +AD E N +S I E
Sbjct: 355 YELPNIKAYNETCANIGNAMWNWRMFLMNGEGRFADIMELIFYNSAISGISLDGEHFFYT 414
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
L G + G +F S +CC I + +K+ Y E G+++ Y
Sbjct: 415 NPLRFIEGHPQNTKDEGKRGEFMSVFCCPPNIIRTIAKMHTYAYSTSE---KGIWVNLYG 471
Query: 508 SSSFD---WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
S+ D ++ L Q+ + WD +++T+ K+E +L LR+P W + G
Sbjct: 472 SNVLDTDLADGSNIKLTQESN--YPWDGNIKITIDSKKKKEY----ALMLRIPAW--AEG 523
Query: 565 AQASLNGQNLPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
A +NG+ P G++ +W D + ++LP++ R + E + A+
Sbjct: 524 ANIKVNGEKQDQSPKAGSYAEVNRKWKKGDVVELELPMAPRLITADPNVEETRNQVAVKR 583
Query: 624 GPYLLAGHTSGEWDIKTGTARSLSALISPIP--PSFNAQLVT 663
GP + + D+ G+ L S I P + A L++
Sbjct: 584 GPIVYCLESK---DLAAGSNIKDIVLPSDIKLQPKYEADLLS 622
>gi|416288023|ref|ZP_11649060.1| hypothetical protein SGB_04738 [Shigella boydii ATCC 9905]
gi|320178140|gb|EFW53118.1| hypothetical protein SGB_04738 [Shigella boydii ATCC 9905]
Length = 656
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + L + +R
Sbjct: 525 TLNLTLSMPVR 535
>gi|419730921|ref|ZP_14257856.1| hypothetical protein SEEH1579_14647 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|419735086|ref|ZP_14261970.1| hypothetical protein SEEH1563_04074 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|419740253|ref|ZP_14266986.1| hypothetical protein SEEH1573_01156 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|419743535|ref|ZP_14270200.1| hypothetical protein SEEH1566_22821 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|419746688|ref|ZP_14273264.1| hypothetical protein SEEH1565_23798 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|381293311|gb|EIC34483.1| hypothetical protein SEEH1579_14647 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|381295529|gb|EIC36640.1| hypothetical protein SEEH1573_01156 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|381295907|gb|EIC37016.1| hypothetical protein SEEH1563_04074 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|381312020|gb|EIC52830.1| hypothetical protein SEEH1566_22821 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|381320971|gb|EIC61499.1| hypothetical protein SEEH1565_23798 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
Length = 651
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRSLKFNHIYEHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|168241855|ref|ZP_02666787.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL486]
gi|194451278|ref|YP_002047708.1| hypothetical protein SeHA_C4002 [Salmonella enterica subsp.
enterica serovar Heidelberg str. SL476]
gi|386593352|ref|YP_006089752.1| hypothetical protein SU5_04156 [Salmonella enterica subsp. enterica
serovar Heidelberg str. B182]
gi|421571246|ref|ZP_16016925.1| hypothetical protein CFSAN00322_13684 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00322]
gi|421575202|ref|ZP_16020815.1| hypothetical protein CFSAN00325_10357 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00325]
gi|421579160|ref|ZP_16024730.1| hypothetical protein CFSAN00326_07107 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00326]
gi|421586317|ref|ZP_16031800.1| hypothetical protein CFSAN00328_20048 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00328]
gi|194409582|gb|ACF69801.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL476]
gi|205339076|gb|EDZ25840.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL486]
gi|383800393|gb|AFH47475.1| DUF1680 Glycosyl hydrolase [Salmonella enterica subsp. enterica
serovar Heidelberg str. B182]
gi|402521555|gb|EJW28891.1| hypothetical protein CFSAN00322_13684 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00322]
gi|402522242|gb|EJW29566.1| hypothetical protein CFSAN00325_10357 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00325]
gi|402523131|gb|EJW30450.1| hypothetical protein CFSAN00326_07107 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00326]
gi|402529042|gb|EJW36291.1| hypothetical protein CFSAN00328_20048 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00328]
Length = 651
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|417337268|ref|ZP_12119473.1| secreted protein [Salmonella enterica subsp. enterica serovar
Alachua str. R6-377]
gi|353565179|gb|EHC31033.1| secreted protein [Salmonella enterica subsp. enterica serovar
Alachua str. R6-377]
Length = 651
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|238023985|ref|YP_002908217.1| hypothetical protein [Burkholderia glumae BGR1]
gi|237878650|gb|ACR30982.1| Hypothetical protein bglu_2g05390 [Burkholderia glumae BGR1]
Length = 655
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 98/485 (20%), Positives = 184/485 (37%), Gaps = 74/485 (15%)
Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAP 228
+L A A + A +A +++ + L+ Q+ GYL+ + T +A W
Sbjct: 78 WLEAVAYLLAEQRDAELEQIADETIDLLARAQHD--DGYLNTYFT-----IKAPGQRWTN 130
Query: 229 YYTIHKILAG--LLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEET 286
H++ L++ V A + + E F + V + + + Y + E
Sbjct: 131 LAECHELYCAGHLIEAAVAYWQATGKRKLLEVAERFVAHIDTVFGTEAGKLNGYPGHPE- 189
Query: 287 GGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF---------- 331
+ L RL+ ++ +P+HL LA F +P + + +SH+
Sbjct: 190 --IELALMRLHEVSGNPRHLALARYFVEQRGARPHYYDIEYEKRGRVSHWDVHGRAWITT 247
Query: 332 ---HANTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSY 372
++ H PI +G +R V+GD + + Y
Sbjct: 248 HKAYSQAHKPIAEQDAAVGHAVRLVYLYAGVAHLARVSGDAAKLNVCKAVWRNMVTRQMY 307
Query: 373 ATGGTSAR----EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG A+ F D + DT +E TC + ++ +R + ++E YAD ER
Sbjct: 308 VTGGIGAQVWGESFTCDYELPNDTAYTE---TCASVGLVFFARRMLEASRESGYADVLER 364
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW------GTKFNSFWCCYGTGIES 482
AL N VL+ G + Y+ PL + R H + ++ CC
Sbjct: 365 ALYNTVLA-GIGLDGRSFFYVNPLETHPAGIRGNHKYEHVKPVRQRWFGCACCPPNVARL 423
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG--HVVLNQKVDPIVSWDPYLRMTLTF 540
+ L +Y ++ + Y+ Y++ +G V L Q+ + W LR+ +
Sbjct: 424 IASLDQYVYLVDDSII---YVNLYVAGEARLNAGTSRVTLRQQGN--YPWRGDLRIVV-- 476
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP-GNFLSATERWSYNDKLTIQL 599
+Q G ++ +R+P W + + +NG + +L W D + + L
Sbjct: 477 --EQADGFDGTIAVRLPDWCAA--PEVRVNGDTVACSAAVDGYLHLPRVWHDGDTIELVL 532
Query: 600 PLSLR 604
P+++R
Sbjct: 533 PMTVR 537
>gi|200389015|ref|ZP_03215627.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Virchow str. SL491]
gi|199606113|gb|EDZ04658.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Virchow str. SL491]
Length = 651
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|197247483|ref|YP_002148608.1| hypothetical protein SeAg_B3893 [Salmonella enterica subsp.
enterica serovar Agona str. SL483]
gi|440762586|ref|ZP_20941641.1| hypothetical protein F434_06500 [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|440769697|ref|ZP_20948654.1| hypothetical protein F514_18649 [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|440774815|ref|ZP_20953701.1| hypothetical protein F515_20523 [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|197211186|gb|ACH48583.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Agona str. SL483]
gi|436412179|gb|ELP10122.1| hypothetical protein F515_20523 [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|436414203|gb|ELP12135.1| hypothetical protein F514_18649 [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|436422862|gb|ELP20686.1| hypothetical protein F434_06500 [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
Length = 651
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|402306205|ref|ZP_10825256.1| putative glycosyhydrolase [Prevotella sp. MSX73]
gi|400379972|gb|EJP32801.1| putative glycosyhydrolase [Prevotella sp. MSX73]
Length = 816
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 131/321 (40%), Gaps = 60/321 (18%)
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLP 451
+ETC + + + +F T E Y D YERAL NGVLS GV + Y P
Sbjct: 346 QETCASIANVFWNYRMFLATGEGKYVDVYERALYNGVLS-------GVSLSGDKFFYDNP 398
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
L + + H +G CC G + + Y ++ Y+ YI +
Sbjct: 399 L-ESMGQHERQHWFGCA-----CCPGNVTRFVASVPQYQYAVRGSDI---YVNLYIQGTA 449
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT----------- 560
D +G + Q P WD +T+T K+ + +L R+P W
Sbjct: 450 D-VNGVRLAQQTRYP---WDG--DITVTVDPKRS--RRFALRFRIPGWAGACPVGTNLYH 501
Query: 561 YSNGAQ---ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA----IQDDRP 613
+++ ++ +NG+ + P ++ RW D++ I LP+ +R A ++DDR
Sbjct: 502 FADSSRPFTVKVNGREIAGEPVDGYMVIDRRWMRGDRVEISLPMEVRRVAANDNVEDDRG 561
Query: 614 EYASIQAILFGP--YLLAGHTSGEWDIKTGTARSLSALISPIPPSFNA-QLVTFTQESGN 670
+Y A+ GP Y L G + + R L +PI + A +L + SG
Sbjct: 562 KY----ALERGPIVYCLEGRDQAHSTVFDKSVR----LDAPIRADYRADKLNGIVELSGE 613
Query: 671 STFVMSN-SNQSITMEEFPVS 690
+ V ++ S + + + P S
Sbjct: 614 AEEVEADGSVRPVAFKAIPYS 634
>gi|417376625|ref|ZP_12145767.1| secreted protein [Salmonella enterica subsp. enterica serovar
Inverness str. R8-3668]
gi|353592514|gb|EHC50495.1| secreted protein [Salmonella enterica subsp. enterica serovar
Inverness str. R8-3668]
Length = 651
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|156935976|ref|YP_001439892.1| hypothetical protein ESA_03870 [Cronobacter sakazakii ATCC BAA-894]
gi|156534230|gb|ABU79056.1| hypothetical protein ESA_03870 [Cronobacter sakazakii ATCC BAA-894]
Length = 655
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/365 (21%), Positives = 130/365 (35%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF-------------HA 333
L RLY T +P++ +LA F +P F + S++ ++
Sbjct: 195 ALMRLYEATQEPRYQVLARYFVEQRGTQPHFYDIEYEKRGRTSYWNTYGPAWMVQDKAYS 254
Query: 334 NTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H P+ +G +R+ ++GD + + + Y TGG
Sbjct: 255 QAHQPLAEQTRAVGHAVRFVYLMTGVAHLARLSGDEEKRRACLRLWENMARRQLYITGGI 314
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 315 GSQSSGEAFSTDYDLPNDTVYAE---SCASIGLIMFARRMLEMEGDSQYADVMERALYNT 371
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 372 VLG-GMALDGKHFFYVNPL--------EVHPKTLKFNHIYDHVKPVRQRWFGCACCPPNI 422
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY E L+I YI ++ G L ++ W +R+ +
Sbjct: 423 ARLLTSLGHYIYTARED---ALFINLYIGNNVQLPVGDSTLRLRISGDFPWHEEVRIHID 479
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
E +L LR+P W + + LNG+ +L T W D LT+ L
Sbjct: 480 SPRPVE----HTLALRLPDW--CDAPRVMLNGRPCEGDIRKGYLWLTRTWHEGDTLTLTL 533
Query: 600 PLSLR 604
P+ +R
Sbjct: 534 PMPVR 538
>gi|168235286|ref|ZP_02660344.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. SL480]
gi|194737873|ref|YP_002116613.1| hypothetical protein SeSA_A3877 [Salmonella enterica subsp.
enterica serovar Schwarzengrund str. CVM19633]
gi|194713375|gb|ACF92596.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. CVM19633]
gi|197291306|gb|EDY30658.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. SL480]
Length = 651
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSVEIPVENGALKLRIGGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|417394187|ref|ZP_12156450.1| secreted protein [Salmonella enterica subsp. enterica serovar
Minnesota str. A4-603]
gi|353606439|gb|EHC60665.1| secreted protein [Salmonella enterica subsp. enterica serovar
Minnesota str. A4-603]
Length = 651
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSVEIPVENGALKLRIGGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|437834770|ref|ZP_20845077.1| hypothetical protein SEEERB17_013756 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SARB17]
gi|435300940|gb|ELO76997.1| hypothetical protein SEEERB17_013756 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SARB17]
Length = 651
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVHH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|345297339|ref|YP_004826697.1| hypothetical protein Entas_0157 [Enterobacter asburiae LF7a]
gi|345091276|gb|AEN62912.1| protein of unknown function DUF1680 [Enterobacter asburiae LF7a]
Length = 649
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 84/366 (22%), Positives = 130/366 (35%), Gaps = 77/366 (21%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY 347
L RLY +T P++L L F +P F + SH+ NT+ P + Y
Sbjct: 193 LMRLYDVTQKPRYLALVKYFIEERGAQPHFYDIEYEKRGKTSHW--NTYGPAWMVKDKAY 250
Query: 348 EVTGDPLYK---LIG-----TFFM----DIVNASHS-------------------YATGG 376
PL + IG + M + SH Y TGG
Sbjct: 251 SQAHQPLAEQQTAIGHAVRFVYLMAGMAHLARLSHDEGKRQDCLRLWNNMAQRQLYITGG 310
Query: 377 ----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 311 IGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYN 367
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGT 478
VL + Y+ PL H FN + CC
Sbjct: 368 TVLG-GMALDGKHFFYVNPL--------EVHPKTLSFNHIYDHVKPVRQRWFGCACCPPN 418
Query: 479 GIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL 538
+ LG IY E L+I Y+ + G L ++ W +++ +
Sbjct: 419 IARVLTSLGHYIYTVRED---ALFINLYVGNDVAIPVGDRKLQLRISGNYPWHEQVKIDI 475
Query: 539 TFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
T V +L LR+P W + + +LNG+ + +L T RW D +T+
Sbjct: 476 T----SPVPVTHTLALRLPDWCAN--PEIALNGEVITGEVTRGYLYLTRRWQEGDAITLT 529
Query: 599 LPLSLR 604
LP+ +R
Sbjct: 530 LPMPVR 535
>gi|297520697|ref|ZP_06939083.1| hypothetical protein EcolOP_23892 [Escherichia coli OP50]
Length = 563
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 94/251 (37%), Gaps = 39/251 (15%)
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 210 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 266
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 267 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 317
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 318 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 373
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 374 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 428
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 429 TLNLTLPMPVR 439
>gi|168232522|ref|ZP_02657580.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CDC 191]
gi|194471797|ref|ZP_03077781.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
gi|194458161|gb|EDX47000.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
gi|205333286|gb|EDZ20050.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CDC 191]
Length = 651
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|417353052|ref|ZP_12130092.1| secreted protein [Salmonella enterica subsp. enterica serovar
Gaminara str. A4-567]
gi|353564767|gb|EHC30749.1| secreted protein [Salmonella enterica subsp. enterica serovar
Gaminara str. A4-567]
Length = 651
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|298247044|ref|ZP_06970849.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
gi|297549703|gb|EFH83569.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
Length = 639
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/377 (23%), Positives = 147/377 (38%), Gaps = 59/377 (15%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLA-LQADYLSHFHANT------HIPI 339
L +LY +T + ++L L+ F +P + A L+ D F A T H+PI
Sbjct: 199 ALVKLYRVTGEKRYLNLSQYFVDERGKQPHYFDEEAHLRGDDPRDFWAQTYEYNQSHVPI 258
Query: 340 VIGSQMRYEVTGDPLYKL-IGTFFMDIVNASHSYATGGTSAREFWWD--PKRLADTLG-- 394
+ + EV G + + + + D+V + + T R W KRL T G
Sbjct: 259 ----REQREVVGHAVRAMYLYSAVADLVKERYDESLFQTGER-LWHHLVSKRLYITGGIG 313
Query: 395 --SENE---------------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
++NE E+C + ++ + L + + YAD ERAL NG+LS
Sbjct: 314 STAKNEGFTEDYDLPNLTAYAESCASIGLVMWNHRLLQLDADSRYADLLERALYNGMLS- 372
Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
+ Y+ PL R GW F CC + LG +Y + +
Sbjct: 373 GISLDGSKYFYVNPLESKGDHHRV--GW---FKCA-CCPPNIARTLMSLGQYVYTVSDTD 426
Query: 498 VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
+ + YI + + G + + + WD + + + + G LNLR+P
Sbjct: 427 I---FTHLYIQGTGELSVGGHNVKVEQETKYPWDGAISLKMELDEPADFG----LNLRIP 479
Query: 558 VWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
W AQ SLNG+ + L ++ RW D++ + L + + D E
Sbjct: 480 GW--CQAAQLSLNGEAIALDDHLQKGYVRIERRWQSGDQIVLNLAMPVMRVYAHPDIREN 537
Query: 616 ASIQAILFGP--YLLAG 630
+ A+ GP Y L G
Sbjct: 538 SDRVALQRGPLVYCLEG 554
>gi|253575972|ref|ZP_04853305.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
gi|251844547|gb|EES72562.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
Length = 637
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 99/494 (20%), Positives = 191/494 (38%), Gaps = 62/494 (12%)
Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAP 228
+L A A +++ T +A + +KM + +++ Q+ GY+S +L + ++
Sbjct: 78 FLEACAHVYSITKDAALDQKMDKYIGFIAKAQDP--DGYIST-NIQLSHKKRWGQRIYHE 134
Query: 229 YYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGG 288
Y +L + + L +A Y N + + + W N G
Sbjct: 135 DYNFGHLLTAACVHHTATGKSNFLDVAVKAANYL-NEIFNPCPKHLIHYGWNPSN--IMG 191
Query: 289 MNDVLYRLYSITHDPKHLLLAHLFDKPCFLGF---------LALQADYLSHFHANTHIPI 339
+ D LY IT + +L LA +F G+ L+ + + HA T + +
Sbjct: 192 LVD----LYRITGNETYLKLADIFMTMRGAGYGGEDQNQDRTPLREETEATGHAVTAVYL 247
Query: 340 VIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK--RLADTLGSEN 397
G+ Y TG+ + + Y TGG + P ++ + G++
Sbjct: 248 YAGAADVYSHTGEEAVMRALEKIWNNMYTKKMYLTGGIGSIYNGLSPNGDKIWEAFGTDY 307
Query: 398 E--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
ETC + +F T+E Y D +E+ + N +L + Y
Sbjct: 308 HLPNRSAYTETCANIGNAMWAMRMFNLTQEPKYMDAFEKVVYNSLLG-SMTLDGHHFCYT 366
Query: 450 LPLGRGVSKARSTHGWGTKF--------NSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
PL K + H T+ ++ +CC + + ++L Y + GL
Sbjct: 367 NPLETRGGKLFNHHSPQTQHFRTARWFTHTCYCCPPQVLRTIARLHQWAYGQSN---DGL 423
Query: 502 YIIQYISSSFD--WKSGHVV-LNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
YI Y + + SG + L K D P T++ + + +S++LR+P
Sbjct: 424 YIHLYSGNELNTTLSSGETLSLTMKSDFPA-------EETISITINNSLNTETSIHLRIP 476
Query: 558 VWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA----IQDDRP 613
W ++GA +NG G + +W ND++ + LP+ ++ A +++DR
Sbjct: 477 QW--ADGATVKVNGVQQGDVEAGTYHELKRKWQANDQIELLLPMRVKRIAANPMVEEDRG 534
Query: 614 EYASIQAILFGPYL 627
+ A ++GP++
Sbjct: 535 QV----AFMYGPFV 544
>gi|375003535|ref|ZP_09727874.1| hypothetical protein SEENIN0B_03911 [Salmonella enterica subsp.
enterica serovar Infantis str. SARB27]
gi|353074450|gb|EHB40211.1| hypothetical protein SEENIN0B_03911 [Salmonella enterica subsp.
enterica serovar Infantis str. SARB27]
Length = 651
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRMWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSCDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSVEIPVENGALKLRIGGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLAL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|204928680|ref|ZP_03219879.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Javiana str. GA_MM04042433]
gi|452122524|ref|YP_007472772.1| hypothetical protein CFSAN001992_15250 [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
gi|204322113|gb|EDZ07311.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Javiana str. GA_MM04042433]
gi|451911528|gb|AGF83334.1| hypothetical protein CFSAN001992_15250 [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
Length = 651
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/365 (21%), Positives = 131/365 (35%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGTQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLR 604
P+ +R
Sbjct: 531 PMPVR 535
>gi|417691895|ref|ZP_12341101.1| hypothetical protein SB521682_4174 [Shigella boydii 5216-82]
gi|332085042|gb|EGI90222.1| hypothetical protein SB521682_4174 [Shigella boydii 5216-82]
Length = 656
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 82/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHLFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + + T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYFHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + L + +R
Sbjct: 525 TLNLTLSMPVR 535
>gi|416822592|ref|ZP_11895028.1| hypothetical protein ECO5905_02787 [Escherichia coli O55:H7 str.
USDA 5905]
gi|425251470|ref|ZP_18644405.1| hypothetical protein EC5905_5091 [Escherichia coli 5905]
gi|320661682|gb|EFX29097.1| hypothetical protein ECO5905_02787 [Escherichia coli O55:H7 str.
USDA 5905]
gi|408161718|gb|EKH89653.1| hypothetical protein EC5905_5091 [Escherichia coli 5905]
Length = 656
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 54/220 (24%), Positives = 82/220 (37%), Gaps = 32/220 (14%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + +G +Y E LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y +S + + L +V W ++T+ S Q V +L LR+P W
Sbjct: 442 IYAGNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESPQPVRH--TLALRLPDWC--TQ 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
Q LNG+ + +L T W D L + LP+ +R
Sbjct: 496 PQIILNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|418846200|ref|ZP_13400973.1| hypothetical protein SEEN443_06909 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19443]
gi|418858162|ref|ZP_13412783.1| hypothetical protein SEEN470_08679 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19470]
gi|418865229|ref|ZP_13419709.1| hypothetical protein SEEN536_04836 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19536]
gi|418867555|ref|ZP_13422012.1| hypothetical protein SEEN176_24132 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 4176]
gi|392811425|gb|EJA67435.1| hypothetical protein SEEN443_06909 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19443]
gi|392828511|gb|EJA84203.1| hypothetical protein SEEN536_04836 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19536]
gi|392834500|gb|EJA90106.1| hypothetical protein SEEN470_08679 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19470]
gi|392839395|gb|EJA94937.1| hypothetical protein SEEN176_24132 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 4176]
Length = 651
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 61/253 (24%), Positives = 94/253 (37%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + + L ++ W + ++ + S Q V +L LR+P W
Sbjct: 442 MYVGNSMEIPVENGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG ++ +L W D +T+ LP+ +R A AI G
Sbjct: 496 AKVTLNGLDVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 556 PLVYCLEQADNGE 568
>gi|378766201|ref|YP_005194662.1| hypothetical protein PANA5342_1232 [Pantoea ananatis LMG 5342]
gi|365185675|emb|CCF08625.1| hypothetical protein PANA5342_1232 [Pantoea ananatis LMG 5342]
Length = 651
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 81/362 (22%), Positives = 131/362 (36%), Gaps = 69/362 (19%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------AN 334
L RLY +T P++L L + F +P F + S++H +
Sbjct: 193 LMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 252
Query: 335 THIPIV-----IGSQMRYEVTGDPLYKLIGTFFM-----------DIVNASHS------Y 372
H P+ +G +R+ +Y + G + D + H+ Y
Sbjct: 253 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 306
Query: 373 ATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
TGG +S F D DT+ +E +C + ++ +R + + YAD ER
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMER 363
Query: 429 ALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIES 482
AL N VL + Y+ PL H + W CC
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIARL 422
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
+ LG IY + L+I Y+ + D G L + W+ +T++ +
Sbjct: 423 LTSLGHYIYTPHQN---ALFINLYVGNRVDVPVGDRTLGIHISGNFPWEE--TVTISVDA 477
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
Q V +L LR+P W Q S NG+ + +L W D LT+ LP+
Sbjct: 478 TQPVKH--TLALRLPDW--CEAPQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPMP 533
Query: 603 LR 604
+R
Sbjct: 534 VR 535
>gi|288925304|ref|ZP_06419239.1| cytoplasmic protein [Prevotella buccae D17]
gi|288338069|gb|EFC76420.1| cytoplasmic protein [Prevotella buccae D17]
Length = 813
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 95/403 (23%), Positives = 158/403 (39%), Gaps = 78/403 (19%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T ++L +A F + G LS + + H PI ++G +R
Sbjct: 224 ALCKLYKVTGSRRYLDMARYFVEETGRG---TDGHRLSEY-SQDHKPILRQQEIVGHAVR 279
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW--WDPKRLADTL 393
+TGD Y + + + TGG +R + P + +
Sbjct: 280 AGYLYSGVADVAALTGDTAYFHALERLWNNMAGKKLFITGGMGSRAQGEGFGPDYELNNM 339
Query: 394 GSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------ 447
+ +ETC + + + +F T E Y D YERAL NGVLS GV +
Sbjct: 340 -TAYQETCASIANVFWNYRMFLATGEGKYVDVYERALYNGVLS-------GVSLSGDKFF 391
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y PL + + H +G CC G + + Y ++ Y+ YI
Sbjct: 392 YDNPL-ESMGQHERQHWFGCA-----CCPGNVTRFVASVPQYQYAVRGSDI---YVNLYI 442
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT------- 560
+ D +G + Q P WD +T+T K+ + +L R+P W
Sbjct: 443 QGTAD-VNGVRLAQQTRYP---WDG--DITVTVDPKRS--RRFALRFRIPGWAGACPVGT 494
Query: 561 ----YSNGAQ---ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEA----IQ 609
+++ ++ +NG+ + P ++ RW D++ I LP+ +R A ++
Sbjct: 495 NLYHFADSSRPFTVKVNGREIAGEPVDGYMVIDRRWMRGDRVEISLPMEVRRVAANDNVE 554
Query: 610 DDRPEYASIQAILFGP--YLLAGHTSGEWDIKTGTARSLSALI 650
DDR +Y A+ GP Y L G + + R L ALI
Sbjct: 555 DDRGKY----ALERGPIVYCLEGRDQAHSTVFDKSVR-LDALI 592
>gi|429121562|ref|ZP_19182182.1| COG3533 secreted protein [Cronobacter sakazakii 680]
gi|426323943|emb|CCK12919.1| COG3533 secreted protein [Cronobacter sakazakii 680]
Length = 655
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 106/291 (36%), Gaps = 39/291 (13%)
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG----TSAREFWWDPK 387
HA + ++ G ++GD + + + Y TGG +S F D
Sbjct: 269 HAVRFVYLMTGVAHLARLSGDEEKRRACLRLWENMARRQLYITGGIGSQSSGEAFSTDYD 328
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
DT+ +E +C + ++ +R + + YAD ERAL N VL +
Sbjct: 329 LPNDTVYAE---SCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFF 384
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFE 493
Y+ PL H KFN + CC + LG IY
Sbjct: 385 YVNPL--------EVHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTA 436
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
E L+I YI ++ G L ++ W +R+ + E +L
Sbjct: 437 RED---ALFINLYIGNNVQLPVGDSTLRLRISGDFPWHEEVRIHIDSPRPVE----HTLA 489
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
LR+P W + + LNG+ +L T W D LT+ LP+ +R
Sbjct: 490 LRLPDW--CDAPRVMLNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 538
>gi|449310077|ref|YP_007442433.1| hypothetical protein CSSP291_17885 [Cronobacter sakazakii SP291]
gi|449100110|gb|AGE88144.1| hypothetical protein CSSP291_17885 [Cronobacter sakazakii SP291]
Length = 655
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 106/291 (36%), Gaps = 39/291 (13%)
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG----TSAREFWWDPK 387
HA + ++ G ++GD + + + Y TGG +S F D
Sbjct: 269 HAVRFVYLMTGVAHLARLSGDEEKRRACLRLWENMARRQLYITGGIGSQSSGEAFSTDYD 328
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
DT+ +E +C + ++ +R + + YAD ERAL N VL +
Sbjct: 329 LPNDTVYAE---SCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFF 384
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFE 493
Y+ PL H KFN + CC + LG IY
Sbjct: 385 YVNPL--------EVHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTA 436
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
E L+I YI ++ G L ++ W +R+ + E +L
Sbjct: 437 RED---ALFINLYIGNNVQLPVGDSTLRLRISGDFPWHEEVRIHIDSPRPVE----HTLA 489
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
LR+P W + + LNG+ +L T W D LT+ LP+ +R
Sbjct: 490 LRLPDW--CDAPRVMLNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 538
>gi|389842783|ref|YP_006344867.1| hypothetical protein ES15_3783 [Cronobacter sakazakii ES15]
gi|387853259|gb|AFK01357.1| hypothetical protein ES15_3783 [Cronobacter sakazakii ES15]
Length = 655
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 105/291 (36%), Gaps = 39/291 (13%)
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG----TSAREFWWDPK 387
HA + ++ G ++GD + + + Y TGG +S F D
Sbjct: 269 HAVRFVYLMTGVAHLARLSGDEEKRRACLRLWENMARRQLYITGGIGSQSSGEAFSTDYD 328
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
DT+ +E +C + ++ +R + + YAD ERAL N VL +
Sbjct: 329 LPNDTVYAE---SCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFF 384
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFE 493
Y+ PL H KFN + CC + LG IY
Sbjct: 385 YVNPL--------EVHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTA 436
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
E L+I YI + G L ++ W +R+ + E +L
Sbjct: 437 RED---ALFINLYIGNDVQLPVGDSTLRLRISGDFPWHEEVRIHIDSPRPVE----HTLA 489
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
LR+P W + + LNG+ +L T W D LT+ LP+ +R
Sbjct: 490 LRLPDW--CDAPRVMLNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 538
>gi|110807746|ref|YP_691266.1| hypothetical protein SFV_3953 [Shigella flexneri 5 str. 8401]
gi|418259896|ref|ZP_12882543.1| hypothetical protein SF660363_4453 [Shigella flexneri 6603-63]
gi|424840119|ref|ZP_18264756.1| hypothetical protein SF5M90T_3865 [Shigella flexneri 5a str. M90T]
gi|110617294|gb|ABF05961.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
gi|383469171|gb|EID64192.1| hypothetical protein SF5M90T_3865 [Shigella flexneri 5a str. M90T]
gi|397894067|gb|EJL10519.1| hypothetical protein SF660363_4453 [Shigella flexneri 6603-63]
Length = 659
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E + + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SYASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|284034063|ref|YP_003383994.1| hypothetical protein Kfla_6192 [Kribbella flavida DSM 17836]
gi|283813356|gb|ADB35195.1| protein of unknown function DUF1680 [Kribbella flavida DSM 17836]
Length = 637
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 98/428 (22%), Positives = 166/428 (38%), Gaps = 57/428 (13%)
Query: 229 YYTIHKILAGLLDQYVLADNA---QALKMATWMVEYFYN----RVQKVITMYSVERHWYS 281
Y H I A + D A A+K+A +V F + +++ V +E
Sbjct: 150 YCAGHLIQAAVAQIRCTGDRALLDVAIKLADHLVATFGDSGQGKIRDVDGHPVIEMALVE 209
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVI 341
L ETG + + + ++ H F + ++ HA + +
Sbjct: 210 LYRETGTTAYLELARWFVEARGHGIIEGHGHHPAYFSDRVPVREATTVEGHAVRAVYLAA 269
Query: 342 GS-QMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE-- 398
G+ + E D L +++ F + + + +Y TGG +R WD + G E E
Sbjct: 270 GAADVALETGDDDLLRVLEGQFAHMWS-TKTYLTGGLGSR---WD----GEAFGDEYELP 321
Query: 399 ------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLP 451
ETC ++ + + T YAD ER L NG L+ + G + Y+ P
Sbjct: 322 PDRAYAETCAAIGGVQWAWRMLLATGNAFYADAIERMLYNGFLAGVSLGGDE--YFYVNP 379
Query: 452 LG-RGV-------SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
L RG S A GW F+ CC + + S L + +G + +
Sbjct: 380 LQLRGAAEPDGNRSPAHGRRGW---FDCA-CCPPNIMRTLSSLDGYLASTTDGAI---QL 432
Query: 504 IQYISSSF--DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
QY + D +G V L +VD W+ +++T+ +Q +L LR+P W
Sbjct: 433 HQYAEGAVAADLPAGTVEL--QVDTEYPWNGSIKVTV----QQTPDTPWALELRIPGWAE 486
Query: 562 SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
A+LNG+ + G + + W+ D + +QLP++ RT A A+
Sbjct: 487 G----ATLNGKPVDA---GRYARVEQTWATGDTVELQLPMATRTVAADPRIDAVRGCVAL 539
Query: 622 LFGPYLLA 629
GP + A
Sbjct: 540 ERGPLVYA 547
>gi|420368547|ref|ZP_14869294.1| hypothetical protein SF123566_9752 [Shigella flexneri 1235-66]
gi|391322141|gb|EIQ78842.1| hypothetical protein SF123566_9752 [Shigella flexneri 1235-66]
Length = 659
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 133/371 (35%), Gaps = 85/371 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L + F +P + + SH+H +
Sbjct: 192 ALMRLYEVTEEPRYLALTNYFVEQRGAQPHYYDQEYEKRGQTSHWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRYEVTGDPLYKLIGTFFMDIVNASHS----------------- 371
H+P+ IG +R+ +Y + G + ++ S
Sbjct: 252 QAHLPLAQQQTAIGHAVRF------VYLMTGVAHLARLSHDDSKRQDCLRLWNNMAQRQL 305
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E + + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFTSDYDLPNDTVYAE---SYASIGLMMFARRMLEMEGDSQYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H KFN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G +Y E LYI Y +S + + L +V W
Sbjct: 414 CCPPNIARVLTSIGHYLYTPRED---ALYINIYAGNSMEVPVENGTLRLRVSGNYPWQE- 469
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
++T+ S Q V +L LR+P W Q LNG+ + +L T W D
Sbjct: 470 -QVTIAVESPQPVRH--TLALRLPDWC--TQPQIILNGEEVEQDIRKGYLHITREWQEGD 524
Query: 594 KLTIQLPLSLR 604
L + LP+ +R
Sbjct: 525 TLNLTLPMPVR 535
>gi|386820698|ref|ZP_10107914.1| hypothetical protein JoomaDRAFT_2662 [Joostella marina DSM 19592]
gi|386425804|gb|EIJ39634.1| hypothetical protein JoomaDRAFT_2662 [Joostella marina DSM 19592]
Length = 660
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 103/431 (23%), Positives = 174/431 (40%), Gaps = 94/431 (21%)
Query: 251 ALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAH 310
ALK A MVE F K+ ++V H ETG L RLY IT++ K+L LA
Sbjct: 208 ALKNADLMVETFGPEDGKI---HTVPGHQII---ETG-----LIRLYRITNEKKYLELAK 256
Query: 311 LFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMRY-----------EVTGDPL 354
F GF + D+ + A H+P+ V+G +R + D
Sbjct: 257 YFLDG--RGFHEGRMDFGPY--AQDHVPVIKQDEVVGHAVRAVYMYAAMTDIAAIENDTA 312
Query: 355 Y-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADTLGSENEETCTTYNMLKVS 410
Y K + + ++VN Y TGG AR E + + L + L + NE TC + +
Sbjct: 313 YHKAVDNLWENMVN-KKMYLTGGIGARHEGEAFGENYELPN-LTAYNE-TCAAIGDVYWN 369
Query: 411 RHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG---RGVSKARSTHGWGT 467
L T + Y D ER L NG++S G + P GV K G T
Sbjct: 370 HRLHNMTGNVKYFDVIERTLYNGLIS---GLSLNGTQFFYPNALESDGVYKF--NQGACT 424
Query: 468 KFNSFWC-CYGTGIESF---------SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
+ + F C C T + F SK D+++ LY ++ +
Sbjct: 425 RKDWFDCSCCPTNVIRFIPSLPGLIYSKTSDTVFV-------NLYAAN--QATIGLEETA 475
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQ 566
+ + Q+ W+ +++T+T E ++ LR+P W +Y +
Sbjct: 476 IAITQETS--YPWNGSVKLTVT----PETASDFTIKLRIPGWARNEVLPGTLYSYKEKIK 529
Query: 567 A----SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR----TEAIQDDRPEYASI 618
A +NG+ + +++ T W + +++++P+ +R E +++DR +
Sbjct: 530 AVPEVKVNGELVEATIDNGYITLTRNWKKGETISLEIPMKVREVLANEKVEEDRGKI--- 586
Query: 619 QAILFGPYLLA 629
A+ +GP + A
Sbjct: 587 -ALEYGPIVYA 596
>gi|427384245|ref|ZP_18880750.1| hypothetical protein HMPREF9447_01783 [Bacteroides oleiciplenus YIT
12058]
gi|425727506|gb|EKU90365.1| hypothetical protein HMPREF9447_01783 [Bacteroides oleiciplenus YIT
12058]
Length = 811
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 91/388 (23%), Positives = 150/388 (38%), Gaps = 87/388 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L +A F + G LS + + H PI ++G +R
Sbjct: 220 ALAKLYKVTGDEKYLKMAKYFVEETGRG---TDGHRLSEY-SQDHKPILQQDEIVGHAVR 275
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y + + + + + TGG +R P+ + G
Sbjct: 276 AGYLYSGVADVAALTQDTAYFNALSRIWENMASKKLFITGGIGSR-----PQ--GEGFGP 328
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + +F T YAD ERAL NGV+S GV +
Sbjct: 329 NYELNNHTAYCETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVIS-------GVSL 381
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G + F + +GN +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-VTRFMASVPYYMYATQGN--DI 432
Query: 502 YIIQYISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
Y+ YI S D S +V L Q + W+ + + +T +QE +L R+P W
Sbjct: 433 YVNLYIQSKADLNTDSNNVALEQTTE--YPWEGKVSILVTPEKEQEF----ALRFRIPGW 486
Query: 560 -----------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR- 604
++++ A A S+NG+ + + + + W D + I LP+ +R
Sbjct: 487 AQDAPVPTDLYSFTDKAGAYSISVNGKKVNAKQYDGYATISRTWKAGDVVEISLPMDVRR 546
Query: 605 ---TEAIQDDRPEYASIQAILFGPYLLA 629
+ ++DDR + AI GP +
Sbjct: 547 IKANDNVEDDRGKL----AIERGPIMFC 570
>gi|435854425|ref|YP_007315744.1| hypothetical protein Halha_1714 [Halobacteroides halobius DSM 5150]
gi|433670836|gb|AGB41651.1| hypothetical protein Halha_1714 [Halobacteroides halobius DSM 5150]
Length = 647
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 91/215 (42%), Gaps = 20/215 (9%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVS 457
ETC ++ + + + YAD ERAL NGVLS + + E + L +
Sbjct: 332 ETCAAIGLMFWAHRMLHLDLDSQYADVMERALYNGVLSGMSQDGEKFFYVNPLEVWPEAC 391
Query: 458 KARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS--SF 511
+ R W CC + +G+ IY +E YI Y +S F
Sbjct: 392 EERKDKEHVKPTRQKWFGCACCPPNIARLLASIGEYIYSTDE---QAAYIHLYTASVTEF 448
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
+ V L+Q+ D WD +T+T + ++EV +L LR+P W S A+ +NG
Sbjct: 449 EIDGTSVELDQETD--YPWDE--NITITVNPREEVE--FTLALRIPDWCES--AELKVNG 500
Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLR 604
+ L L ++ WS D++ + L + ++
Sbjct: 501 RTLELDSIIDNGYVEVNRSWSKGDQIELVLAMPVK 535
>gi|408673627|ref|YP_006873375.1| protein of unknown function DUF1680 [Emticicia oligotrophica DSM
17448]
gi|387855251|gb|AFK03348.1| protein of unknown function DUF1680 [Emticicia oligotrophica DSM
17448]
Length = 652
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 107/501 (21%), Positives = 194/501 (38%), Gaps = 77/501 (15%)
Query: 182 NATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAPYYTIHKILAG-LL 240
+A +++K + ++ Q + GYL+ + T L+ W AG L+
Sbjct: 106 DAELEKKTDEWIDKIAAAQ--LPDGYLNTYYT-----LNGLQNRWTDMEKHEDYCAGHLI 158
Query: 241 DQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSIT 300
+ V N + + F N + + + R W S ++E + L +LY T
Sbjct: 159 EAAVAYYNTTGKRKLLDVAIRFANHIDETFRL--ANRPWVSGHQE---IELALVKLYRTT 213
Query: 301 HDPKHLLLAHLF--DKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMRYEV---- 349
D ++L L+ F + G + D+ + IP+ + G +R
Sbjct: 214 KDERYLKLSEWFLNQRGRGNGKGVIWDDWKDPAYCQDAIPVKDQKEITGHAVRAMYLYTG 273
Query: 350 -------TGDPLY-KLIGTFFMDIVNASHSYATGGT----SAREFWWDPKRLADTLGSEN 397
TGD Y + T + D+V+ + Y TGG S F D L +EN
Sbjct: 274 AADVAVNTGDTGYMNAMKTVWEDVVH-RNMYITGGIGSSGSNEGFSQDFD-----LPNEN 327
Query: 398 E--ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 455
ETC + M+ ++ + T E Y D ER+L NG L Y PL
Sbjct: 328 AYCETCASVGMVFWNQRMNALTGESKYIDVLERSLYNGALD-GLSLSGDRFFYGNPLASI 386
Query: 456 VSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
AR +GT CC + LGD IY + E G+++ ++ S+ + K
Sbjct: 387 GRHARR-EWFGTA-----CCPSNIARLVASLGDYIYGKSEN---GIWVNLFVGSNTNIKL 437
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT--------------- 560
G+ + ++ + +++++ S+K + +L++R+P WT
Sbjct: 438 GNTEILTSIETNYPLNGKVKISMNPSTKTKY----TLHVRIPSWTTNEPVAGNLYHYLGN 493
Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
Y+ +NG+ + + WS D ++ +LP+ +R +++ + A
Sbjct: 494 YAANIAMMVNGRKIDYKIENGYAIIDREWSAGDIVSFELPMDVRKIVARNELKQDNDRMA 553
Query: 621 ILFGP--YLLAG--HTSGEWD 637
+ GP Y + G + WD
Sbjct: 554 LQRGPLVYCVEGIDNEGKAWD 574
>gi|374374966|ref|ZP_09632624.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
gi|373231806|gb|EHP51601.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
Length = 629
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 66/286 (23%), Positives = 108/286 (37%), Gaps = 47/286 (16%)
Query: 361 FFMDIVNASHS------YATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLF 414
+ IVN + S + TG S+ E W + ++ T + ETC T +K+ L
Sbjct: 285 YLEAIVNTAESIRKDEIFVTGSGSSMESWINGAKIQATPLRHSNETCVTATWMKLCLQLL 344
Query: 415 RWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG---------RGVSKARSTHGW 465
R T + +A+ ER N +L M+P G RGV K +
Sbjct: 345 RTTGDAKWANEIERTFYNALLGA-----------MMPDGHTWNKYTDLRGV-KYLGENQC 392
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH--VVLNQK 523
G N CC G L + N G+ + Y ++S G V LN
Sbjct: 393 GMDIN---CCIANGPRGLMVLPKEAFMI---NAAGIAVNFYGTASATLSVGQNKVTLNT- 445
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
+ + +T+ + + + +L LR+P W S S+NG + PG +
Sbjct: 446 ---VTEYPKNGAVTIIVNPGKPLD--FNLQLRIPEW--SAHTNISINGVAVDNAVPGKYT 498
Query: 584 SATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
+ W D + +Q + +R + D Y + +GP +LA
Sbjct: 499 AIKRTWKQGDIVKLQFQMDVRQYFVPGDSTRY----CLQYGPLVLA 540
>gi|284172576|ref|YP_003405958.1| protein of unknown function DUF1680 [Haloterrigena turkmenica DSM
5511]
gi|284017336|gb|ADB63285.1| protein of unknown function DUF1680 [Haloterrigena turkmenica DSM
5511]
Length = 636
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 118/544 (21%), Positives = 198/544 (36%), Gaps = 116/544 (21%)
Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKP--VW 226
++ A++ + A + ++ K+ V+ +++ Q GYL+ + F ++P W
Sbjct: 75 WIEAASYVLAQRDDPELEAKVDGVISLIADAQQP--DGYLNTY-------FSLVEPENRW 125
Query: 227 APYYTIHKI-LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
+ +H++ AG L + +A + +A + T ++E + V ++ E +EE
Sbjct: 126 TNLHMMHELYCAGHLIEAAVA-HYRATEKET-LLEVAVDFADLVDDVFGDEVEGVPGHEE 183
Query: 286 TGGMNDVLYRLYSITHDPKHLLLAHLF--------------DKPCFLG--------FLAL 323
+ L +LY +T + ++L LA F D P LG +
Sbjct: 184 ---IELALLKLYRVTDETRYLELAKYFIDLRGKDDRLAWEIDNPETLGGGEYEDGSIIPA 240
Query: 324 QADYLSH-------FHANTHIPI-----VIGSQMR------------YEVTGDPLYKLIG 359
D +H +A H P+ V G +R E D L + +
Sbjct: 241 ARDVFTHEDGTYDGRYAQAHEPLRDQETVEGHSVRAMYLFAAATDLAIETGEDELIESLE 300
Query: 360 TFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
+ ++ Y TGG E D ETC + ++ LF + E
Sbjct: 301 RLWTNMTT-KRMYVTGGLGPEEAHEGFTTDYDLRNDAYAETCAAIGSVYWNQRLFELSGE 359
Query: 420 IAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYG 477
YAD ER L NG L+ GTE Y PL R GW T CC
Sbjct: 360 AKYADLIERTLYNGFLAGVSLDGTE---FFYENPLESDGDHHRK--GWFTCA----CCPP 410
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
+ LG+ +Y + + +Y+ QY+ SS + D + W + +
Sbjct: 411 NAARLLASLGEYVYSQRDS---AIYVNQYLGSSVTTAVDGATVELSQDSSLPWSGEVTVD 467
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTI 597
+ G L LR+P W S + ++NG+++ P G +L W +D++ +
Sbjct: 468 VDAD-----GASVPLRLRIPEWAES--STVTVNGESVETPSEG-YLEIERVWD-DDRIEL 518
Query: 598 QL-------------------------PLSLRTEAIQDDRP----EYASIQAILFGPYLL 628
PL EAI +DRP E S + P LL
Sbjct: 519 TFEQTVTRLEAHPDVAADAGRVALKRGPLVYCLEAIDNDRPLHQYEDPSPTSTTHRPDLL 578
Query: 629 AGHT 632
G T
Sbjct: 579 EGVT 582
>gi|146295756|ref|YP_001179527.1| hypothetical protein [Caldicellulosiruptor saccharolyticus DSM
8903]
gi|145409332|gb|ABP66336.1| protein of unknown function DUF1680 [Caldicellulosiruptor
saccharolyticus DSM 8903]
Length = 653
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 88/363 (24%), Positives = 135/363 (37%), Gaps = 74/363 (20%)
Query: 293 LYRLYSITHDPKHLLLAHLF-------------------DKPCFLGFLALQADYLSHFHA 333
L +LY +T++ K+L LA F K + GF L +YL
Sbjct: 200 LVKLYEVTNNSKYLELAKFFIDERGQEPYYFDIEWEKRGKKEHWKGFKGLGKEYLQ---- 255
Query: 334 NTHIPI-----VIGSQMR------------YEVTGDPLYKLIGTFFMDIVNASHSYATG- 375
H P+ +G +R Y LY++ F DI N Y TG
Sbjct: 256 -AHKPVREQREAVGHAVRAVYLYSGMADVAYYTKDKELYEVCEALFNDIRNRK-MYITGA 313
Query: 376 -GTSAR----EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERAL 430
G+SA F +D A ETC + ++ + + R Y D ERAL
Sbjct: 314 IGSSAHGEAFTFEYDLPNAAAYA-----ETCASVGLVFFAHRMNRIKPHRKYYDVVERAL 368
Query: 431 TNGVLSI--QRGTEPGVMIYMLPLG---RGVSKARSTHGWGTKFNSFW---CCYGTGIES 482
N ++ Q G + Y+ PL + V K H + ++ CC
Sbjct: 369 YNTIIGAMSQDGKK---YFYVNPLEVFPKEVEKRFDRHHVKPERQPWFGCACCPPNVARL 425
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
+ +G IY N +Y+ YI S +S ++ NQKV I + F
Sbjct: 426 LASIGKYIYLY---NNNEIYVNLYIGS----ESEFLINNQKVKIIQDSGYPFNDEVNFKI 478
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP-LPPPGNFLSATERWSYNDKLTIQLPL 601
+LNLR+P W + + +NG+ L ++S T W +D++ I LP
Sbjct: 479 ITNGEMYFTLNLRIPSWC--DKFEIKINGELLTGFSLKDGYVSITRGWKSDDRIEIILPT 536
Query: 602 SLR 604
L+
Sbjct: 537 QLK 539
>gi|417344582|ref|ZP_12124897.1| secreted protein [Salmonella enterica subsp. enterica serovar
Baildon str. R6-199]
gi|417542477|ref|ZP_12193911.1| secreted protein [Salmonella enterica subsp. enterica serovar
Wandsworth str. A4-580]
gi|353658599|gb|EHC98734.1| secreted protein [Salmonella enterica subsp. enterica serovar
Wandsworth str. A4-580]
gi|357953998|gb|EHJ80341.1| secreted protein [Salmonella enterica subsp. enterica serovar
Baildon str. R6-199]
Length = 651
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/253 (24%), Positives = 93/253 (36%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY LYI
Sbjct: 387 --EVHPRSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RAHALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + + L ++ W + ++ + S Q V +L LR+P W
Sbjct: 442 MYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +T+ LP+ +R A AI G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 556 PLVYCLEQADNGE 568
>gi|417109929|ref|ZP_11963472.1| hypothetical protein RHECNPAF_800032 [Rhizobium etli CNPAF512]
gi|327188729|gb|EGE55928.1| hypothetical protein RHECNPAF_800032 [Rhizobium etli CNPAF512]
Length = 640
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 93/429 (21%), Positives = 167/429 (38%), Gaps = 73/429 (17%)
Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
L +L +T + K+L L+ F + F A + FH T H P+
Sbjct: 198 ALVKLARVTGEKKYLDLSKYFIDERGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 258 REQTKVVGHAVRAMYLYSGMADVATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + + YAD E+AL NG L
Sbjct: 317 NEGFTDYYDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
T+ Y PL A H W K++ CC + +G +Y + +
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAIADDEI- 426
Query: 500 GLYIIQYISSSFDWKSGHVV-LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+++ ++ +G V L Q + W+ + F+++ E +L+LR+P
Sbjct: 427 AVHLYGESTTRLKLANGAAVELQQATN--YPWE----GAVAFTTRLEKPAKFALSLRIPD 480
Query: 559 WTYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W ++GA S+NG+ L L + +W D++ + LPLSLR + + A
Sbjct: 481 W--ADGATLSVNGEKLDLGAATRDGYARIDRQWVDGDRVDLFLPLSLRPQYANPKVRQDA 538
Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMS 676
A++ GP + T T + L+A++ P + S T V++
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGQDLNAIVLP------------RELSAAETVVLN 579
Query: 677 NSNQSITME 685
+ N ++ ++
Sbjct: 580 DLNDAVALD 588
>gi|224585478|ref|YP_002639277.1| hypothetical protein SPC_3759 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gi|224470006|gb|ACN47836.1| hypothetical protein SPC_3759 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
Length = 651
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/253 (24%), Positives = 93/253 (36%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY LYI
Sbjct: 387 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + + L ++ W + ++ + S Q V +L LR+P W
Sbjct: 442 MYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +T+ LP+ +R A AI G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 556 PLVYCLEQADNGE 568
>gi|261409833|ref|YP_003246074.1| hypothetical protein GYMC10_6062 [Paenibacillus sp. Y412MC10]
gi|261286296|gb|ACX68267.1| protein of unknown function DUF1680 [Paenibacillus sp. Y412MC10]
Length = 658
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 108/494 (21%), Positives = 194/494 (39%), Gaps = 79/494 (15%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFEALK 223
V +L A+A A+ + ++E++ ++ +++ Q GYL+ + T E + L
Sbjct: 79 VAKWLEAAAYSLATHRDPKLEEQVDELIDLVADAQQP--DGYLNTYFTVKEPEKRWTNLT 136
Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
Y H I AG+ Y + L + + ++ + V + H + +
Sbjct: 137 DCHELYCAGHMIEAGVA-HYRATGKRKLLDVVCRLADH----IDTVFGPEDGKIHGFDGH 191
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFL------ALQADYLSHFH 332
+E + L +LY +T +P++L L+ F +P F FL ++ Y S H
Sbjct: 192 QE---IELALVKLYEVTQEPRYLSLSQYFIDERGTEPHF--FLQEWEQRGKKSFYRSVLH 246
Query: 333 A------NTHIPI-----VIGSQMRY-----------EVTGDP-LYKLIGTFFMDIVNAS 369
A +H+P+ +G +R T DP L + T + ++V+
Sbjct: 247 APHLAYHQSHLPVREQKEAVGHSVRAVYMYTAMADLAARTKDPALLEACDTLWRNMVH-K 305
Query: 370 HSYATGGTSA----REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
Y TGG + F D DT+ SE TC + ++ ++ + + + + YAD
Sbjct: 306 QMYITGGIGSTHHGEAFTTDYDLPNDTVYSE---TCASIGLIFFAQRMLQLSPKSEYADV 362
Query: 426 YERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYG 477
ERAL N V+ Q G Y+ PL + R G W CC
Sbjct: 363 MERALFNTVIGSMAQDGRH---FFYVNPLEVWPAACRHNPGKAHVKPVRPGWFACACCPP 419
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
S LG+ +Y + LY YI + + G V + + + WD
Sbjct: 420 NVARLLSSLGEYVYTMNDDT---LYAHLYIGGEAEVRFGDVPVKVMQNSTLPWD----GD 472
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKL 595
+TF+ + E ++ LR+P W+ A +NGQ + + + W+ D
Sbjct: 473 VTFTLQPEQAVEWTVALRIPDWSRGK-AGLRVNGQEMNVEDITQDGYACVKRVWAPGD-- 529
Query: 596 TIQLPLSLRTEAIQ 609
T++L S+ ++
Sbjct: 530 TVELAFSMEIHQVR 543
>gi|424916536|ref|ZP_18339900.1| hypothetical protein Rleg9DRAFT_4102 [Rhizobium leguminosarum bv.
trifolii WSM597]
gi|392852712|gb|EJB05233.1| hypothetical protein Rleg9DRAFT_4102 [Rhizobium leguminosarum bv.
trifolii WSM597]
Length = 640
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 90/396 (22%), Positives = 155/396 (39%), Gaps = 61/396 (15%)
Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
L +L +T + K+L L+ F + F A + FH T H P+
Sbjct: 198 ALVKLARVTGEKKYLELSKFFIDARGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 258 REQKKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + + YAD E+AL NG L
Sbjct: 317 NEGFTDYYDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
T+ Y PL V K H W K++ CC + +G +Y + +
Sbjct: 374 STDGKTFFYDNPL-ESVGK---HHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426
Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+++ ++ +G V L Q + WD + F+++ + +L+LR+P
Sbjct: 427 AVHLYGESTARLKLANGADVELEQTTN--YPWD----GAVAFTTRLKTPAKFALSLRIPD 480
Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W + GA S+NG+ L L + +W+ D++ + LPLSLR + + A
Sbjct: 481 W--AEGATLSVNGEMLDLAANIRDGYARIDRQWADGDRVALSLPLSLRPQYANPKVRQDA 538
Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
A++ GP + T T L+A++ P
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGEDLNAIVLP 567
>gi|383111125|ref|ZP_09931943.1| hypothetical protein BSGG_2229 [Bacteroides sp. D2]
gi|313694694|gb|EFS31529.1| hypothetical protein BSGG_2229 [Bacteroides sp. D2]
Length = 621
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 88/434 (20%), Positives = 160/434 (36%), Gaps = 65/434 (14%)
Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
+W YT LL Y L + +AL ++ + ++Q I ++ Y L
Sbjct: 126 IWGRKYT----SLSLLSYYRLTGDKKALNAVERLINHLMEQLQ--IHNINIAATGYYLGM 179
Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT--HIPIVIG 342
+ + + + LY IT +P++L A +++ + S T +IP+
Sbjct: 180 ASCSILEPVVYLYDITRNPRYLSFAKSI-------VSSIEREGSSQLITKTLKNIPVSER 232
Query: 343 S------------QMRYE-------------VTGDPLYKLIGTFFMDIVNASHSYATGGT 377
S Q YE + DP Y I ++ + G
Sbjct: 233 SAFPKSWWSFENGQKAYEMMSCYEGLIELGTIVNDPFYIRIAEKAVNNIQEDEINIAGSG 292
Query: 378 SAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
+A E W+ K ETC T+ +++ L T YA+ +E + N +++
Sbjct: 293 AAFECWYKGKEKQTLPTYHTMETCVTFTYMQLCHRLLCKTGNSFYAEEFEHTMYNALMAT 352
Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
+ + Y GR + G N CC G F+ + + ++ +
Sbjct: 353 MKNDGSQISKYSPLEGR---RQPGEEQCGMHIN---CCNANGPRGFALIPKTACTIKDNH 406
Query: 498 V-PGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
+ LY+ + S + K V LN + D PI + + + K++ +L LR
Sbjct: 407 IYLNLYLPLQATISLN-KKNKVHLNVESDYPI---HGKVNVNIGVQKKEKF----TLALR 458
Query: 556 MPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
+P T +A +NG+ + G +L W DK+T+ + + + +
Sbjct: 459 IP--TQIEKMKAYINGEEQEITHKGGYLYIERIWENADKVTLDFKIETKVVKLNNS---- 512
Query: 616 ASIQAILFGPYLLA 629
QAI+ GP L A
Sbjct: 513 ---QAIVRGPLLFA 523
>gi|325298731|ref|YP_004258648.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
gi|324318284|gb|ADY36175.1| protein of unknown function DUF1680 [Bacteroides salanitronis DSM
18170]
Length = 666
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 101/455 (22%), Positives = 171/455 (37%), Gaps = 91/455 (20%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T D K+L A F DK G+ + + Y + H P+V +G +R
Sbjct: 219 LAKLYLVTGDKKYLDEAKFFLDK---RGYTSRKDAY-----SQAHKPVVQQDEAVGHAVR 270
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+TGD Y D + Y TGG A + G+
Sbjct: 271 ATYMYSGMADVAALTGDTAYVHAIDRIWDNIVGKKLYLTGGIGATAH-------GEAFGA 323
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + V+ LF + + Y D ER+L NGVLS + G
Sbjct: 324 NYELPNATAYCETCAAIGNVYVNHRLFLFHGDAKYYDVLERSLYNGVLS-GISLDGGRFF 382
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y PL S G+ K C + + F + G+ LY+ ++
Sbjct: 383 YPNPL-------ESAGGYERKAWFGCACCPSNLCRFLPSVPGYMYATRGD--SLYVNLFM 433
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT------- 560
+ + + G ++ + +D +R+TL S + V +R+P WT
Sbjct: 434 EGTSEIQVGKRKISIRQQTAYPFDGNIRLTLQKGSGEFV-----WKVRVPGWTRGEVVPG 488
Query: 561 ----YSNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR----TEAI 608
+++G Q S +NG+ + + S + RW D + + ++ R E +
Sbjct: 489 GLYRFADGKQTSYSVKVNGEKVEGSIEKGYFSISRRWKKGDVVEVSFDMTPRLVLADEKV 548
Query: 609 QDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQES 668
+ DR + AI GP + EW G L +++ P P +L +++
Sbjct: 549 EADR----GMLAIERGPLVYC----AEWCDNQGI--DLFSVLLPRKP----KLEVMDEKA 594
Query: 669 GNSTFVMSNSNQSITMEEFPVSGTDAALHATFRLI 703
++S Q+++ + V G A A +LI
Sbjct: 595 PGGAQMISAGVQTLS---YDVEGKLHASDAVLKLI 626
>gi|429117671|ref|ZP_19178589.1| COG3533 secreted protein [Cronobacter sakazakii 701]
gi|426320800|emb|CCK04702.1| COG3533 secreted protein [Cronobacter sakazakii 701]
Length = 372
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/220 (24%), Positives = 81/220 (36%), Gaps = 32/220 (14%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 54 ESCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 106
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + LG IY E L+I
Sbjct: 107 --EVHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTARED---ALFIN 161
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
YI ++ G L ++ W +R+ + E +L LR+P W +
Sbjct: 162 LYIGNNVQLPVGDSTLRLRISGDFPWHEEVRIHIDSPRPVE----HTLALRLPDW--CDA 215
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+ LNG+ +L T W D LT+ LP+ +R
Sbjct: 216 PRVMLNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 255
>gi|336417454|ref|ZP_08597777.1| hypothetical protein HMPREF1017_04885 [Bacteroides ovatus
3_8_47FAA]
gi|335935949|gb|EGM97896.1| hypothetical protein HMPREF1017_04885 [Bacteroides ovatus
3_8_47FAA]
Length = 621
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 88/434 (20%), Positives = 160/434 (36%), Gaps = 65/434 (14%)
Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
+W YT LL Y L + +AL ++ + ++Q I ++ Y L
Sbjct: 126 IWGRKYT----SLSLLSYYRLTGDKKALNAVERLINHLMEQLQ--IHNINIAATGYYLGM 179
Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT--HIPIVIG 342
+ + + + LY IT +P++L A +++ + S T +IP+
Sbjct: 180 ASCSILEPVVYLYDITRNPRYLSFAKSI-------VSSIEREGSSQLITKTLRNIPVSER 232
Query: 343 S------------QMRYE-------------VTGDPLYKLIGTFFMDIVNASHSYATGGT 377
S Q YE + DP Y I ++ + G
Sbjct: 233 SAFPKSWWSFENGQKAYEMMSCYEGLIELGTIVNDPFYIRIAEKAVNNIQEDEINIAGSG 292
Query: 378 SAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
+A E W+ K ETC T+ +++ L T YA+ +E + N +++
Sbjct: 293 AAFECWYKGKEKQTLPTYHTMETCVTFTYMQLCHRLLCKTGNSFYAEEFEHTMYNALMAT 352
Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
+ + Y GR + G N CC G F+ + + ++ +
Sbjct: 353 MKNDGSQISKYSPLEGR---RQPGEEQCGMHIN---CCNANGPRGFALIPKTACTIKDNH 406
Query: 498 V-PGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
+ LY+ + S + K V LN + D PI + + + K++ +L LR
Sbjct: 407 IYLNLYLPLQATISLN-KKNKVHLNVESDYPI---HGKVNVNIGVQKKEKF----TLALR 458
Query: 556 MPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
+P T +A +NG+ + G +L W DK+T+ + + + +
Sbjct: 459 IP--TQIEKMKAYINGEEQEITHKGGYLYIERIWENADKVTLDFKIETKVVKLNNS---- 512
Query: 616 ASIQAILFGPYLLA 629
QAI+ GP L A
Sbjct: 513 ---QAIVRGPLLFA 523
>gi|296100552|ref|YP_003610698.1| hypothetical protein ECL_00181 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
gi|295055011|gb|ADF59749.1| hypothetical protein ECL_00181 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
Length = 651
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 79/365 (21%), Positives = 126/365 (34%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L F +P F + S++H +
Sbjct: 192 ALMRLYDVTEEPRYLNLVKYFIEARGTQPHFYDIEYEKRGRTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H P+ IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHQPLAEQQTAIGHAVRFVYLMAGMAHLARLSKDDAKRQDCLRLWSNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSRYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H FN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRTLAFNHIYDHVKPVRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY L+I Y+ + G L ++ W + + +
Sbjct: 420 ARVLTSLGHYIYTVRPD---ALFINLYVGNEVTIPVGDETLKLRISGNYPWQEEVNIEIA 476
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
V +L LR+P W + SLNG+ + +L T RW D LT+ L
Sbjct: 477 ----SPVPVTHTLALRLPDWCAN--PHVSLNGEGMTGEVSRGYLHLTRRWQEGDTLTLTL 530
Query: 600 PLSLR 604
P+ +R
Sbjct: 531 PMPVR 535
>gi|372209243|ref|ZP_09497045.1| hypothetical protein FbacS_03931 [Flavobacteriaceae bacterium S85]
Length = 671
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 92/214 (42%), Gaps = 20/214 (9%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC S + E YAD E L N LS E Y PL VS
Sbjct: 354 ETCANVCNSMFSYRMLGLHGEAKYADVMELVLFNSALS-GISIEGKDYFYANPLR--VSH 410
Query: 459 ARSTHGWGTKFN------SFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSF 511
G T+F+ +CC + + +KL Y G LY ++++
Sbjct: 411 KGHDPGNDTEFDMRRPYIPCFCCPPNLVRTIAKLSGWAYSLTTNGVAVNLYGGNKLTTTL 470
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
S ++ Q P W+ + + + + K+ + +R+P W + G+Q +NG
Sbjct: 471 LDGSKLELVQQSGYP---WNGKVTLIIKKAKKEAF----DIKIRVPEW--AKGSQIQING 521
Query: 572 QNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLR 604
+ + LP G++++ ++WS NDK+T+Q+P+ ++
Sbjct: 522 KAVSLPVKAGSYVTLHQKWSKNDKITLQMPMEIK 555
>gi|189464183|ref|ZP_03012968.1| hypothetical protein BACINT_00520 [Bacteroides intestinalis DSM
17393]
gi|189437973|gb|EDV06958.1| hypothetical protein BACINT_00520 [Bacteroides intestinalis DSM
17393]
Length = 812
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 88/384 (22%), Positives = 145/384 (37%), Gaps = 79/384 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L +A F + G LS + + H PI ++G +R
Sbjct: 221 ALAKLYKVTGDGKYLKMAKYFVEETGRG---TDGHRLSEY-SQDHKPILQQDEIVGHAVR 276
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y + + + + Y GG +R P+ + G
Sbjct: 277 AGYLYSGVADVAALTQDTAYFNALSRIWENMVSKKLYIIGGIGSR-----PQ--GEGFGP 329
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + +F T YAD ERAL NGV+S GV +
Sbjct: 330 NYELNNHTNYCETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVIS-------GVSL 382
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + H +G CC G + F + +GN +
Sbjct: 383 SGDKFFYDNPL-ESMGQHERQHWFGCA-----CCPGN-VTRFMASVPYYMYATQGN--DI 433
Query: 502 YIIQYISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
Y+ YI S D S ++ L Q + W+ + + +T +QE +L R+P W
Sbjct: 434 YVNLYIQSKADLNTDSNNIALEQTTE--YPWEGKVSILVTPEKEQEF----ALRFRIPGW 487
Query: 560 -----------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
++++ A A S+NG+ + + + + W D + I LP+ +R
Sbjct: 488 AQDAPVPTDLYSFTDKAGAYSISVNGKKVNAKQYDGYATISRTWKVGDVVEINLPMDVRR 547
Query: 606 EAIQDDRPEYASIQAILFGPYLLA 629
D+ + AI GP +
Sbjct: 548 IKANDNVEDDCGKLAIERGPIMFC 571
>gi|194444786|ref|YP_002042927.1| hypothetical protein SNSL254_A3957 [Salmonella enterica subsp.
enterica serovar Newport str. SL254]
gi|418790980|ref|ZP_13346748.1| hypothetical protein SEEN447_08030 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19447]
gi|418795399|ref|ZP_13351104.1| hypothetical protein SEEN449_11603 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19449]
gi|418798645|ref|ZP_13354319.1| hypothetical protein SEEN567_19023 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19567]
gi|418806870|ref|ZP_13362440.1| hypothetical protein SEEN550_01554 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21550]
gi|418811033|ref|ZP_13366570.1| hypothetical protein SEEN513_20561 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22513]
gi|418819963|ref|ZP_13375400.1| hypothetical protein SEEN425_16720 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|418824033|ref|ZP_13379418.1| hypothetical protein SEEN462_22404 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22462]
gi|418832501|ref|ZP_13387442.1| hypothetical protein SEEN486_20353 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N18486]
gi|418834359|ref|ZP_13389267.1| hypothetical protein SEEN543_19458 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N1543]
gi|418839823|ref|ZP_13394654.1| hypothetical protein SEEN554_21236 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21554]
gi|418851856|ref|ZP_13406562.1| hypothetical protein SEEN978_01684 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 37978]
gi|418853203|ref|ZP_13407898.1| hypothetical protein SEEN593_02318 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
gi|194403449|gb|ACF63671.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL254]
gi|392756265|gb|EJA13162.1| hypothetical protein SEEN447_08030 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19447]
gi|392758783|gb|EJA15648.1| hypothetical protein SEEN449_11603 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19449]
gi|392766123|gb|EJA22905.1| hypothetical protein SEEN567_19023 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19567]
gi|392780719|gb|EJA37371.1| hypothetical protein SEEN513_20561 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22513]
gi|392782028|gb|EJA38666.1| hypothetical protein SEEN550_01554 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21550]
gi|392793888|gb|EJA50323.1| hypothetical protein SEEN425_16720 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|392797650|gb|EJA53956.1| hypothetical protein SEEN486_20353 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N18486]
gi|392805302|gb|EJA61433.1| hypothetical protein SEEN543_19458 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N1543]
gi|392811613|gb|EJA67613.1| hypothetical protein SEEN554_21236 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21554]
gi|392816063|gb|EJA71993.1| hypothetical protein SEEN978_01684 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 37978]
gi|392825252|gb|EJA81005.1| hypothetical protein SEEN462_22404 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22462]
gi|392827750|gb|EJA83452.1| hypothetical protein SEEN593_02318 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
Length = 651
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 88/398 (22%), Positives = 140/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H FN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLNFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +T+ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|423214778|ref|ZP_17201306.1| hypothetical protein HMPREF1074_02838 [Bacteroides xylanisolvens
CL03T12C04]
gi|423294029|ref|ZP_17272156.1| hypothetical protein HMPREF1070_00821 [Bacteroides ovatus
CL03T12C18]
gi|392676837|gb|EIY70260.1| hypothetical protein HMPREF1070_00821 [Bacteroides ovatus
CL03T12C18]
gi|392692684|gb|EIY85921.1| hypothetical protein HMPREF1074_02838 [Bacteroides xylanisolvens
CL03T12C04]
Length = 621
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 88/434 (20%), Positives = 160/434 (36%), Gaps = 65/434 (14%)
Query: 225 VWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNE 284
+W YT LL Y L + +AL ++ + ++Q I ++ Y L
Sbjct: 126 IWGRKYT----SLSLLSYYRLTGDKKALNAVERLINHLMEQLQ--IHNINIAATGYYLGM 179
Query: 285 ETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANT--HIPIVIG 342
+ + + + LY IT +P++L A +++ + S T +IP+
Sbjct: 180 ASCSILEPVVYLYDITRNPRYLSFAKSI-------VSSIEREGSSQLITKTLKNIPVSER 232
Query: 343 S------------QMRYE-------------VTGDPLYKLIGTFFMDIVNASHSYATGGT 377
S Q YE + DP Y I ++ + G
Sbjct: 233 SAFPKSWWSFENGQKAYEMMSCYEGLIELGTIVNDPFYIKIAEKAVNNIQEDEINIAGSG 292
Query: 378 SAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
+A E W+ K ETC T+ +++ L T YA+ +E + N +++
Sbjct: 293 AAFECWYKGKEKQTLPTYHTMETCVTFTYMQLCHRLLCKTGNSFYAEEFEHTMYNALMAT 352
Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
+ + Y GR + G N CC G F+ + + ++ +
Sbjct: 353 MKNDGSQISKYSPLEGR---RQPGEEQCGMHIN---CCNANGPRGFALIPKTACTIKDNH 406
Query: 498 V-PGLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLR 555
+ LY+ + S + K V LN + D PI + + + K++ +L LR
Sbjct: 407 IYLNLYLPLQATISLN-KKNKVHLNVESDYPI---HGKVNVNIGVQKKEKF----TLALR 458
Query: 556 MPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEY 615
+P T +A +NG+ + G +L W DK+T+ + + + +
Sbjct: 459 IP--TQIEKMKAYINGEEQEITHKGGYLYIERIWENADKVTLDFKIETKVVKLNNS---- 512
Query: 616 ASIQAILFGPYLLA 629
QAI+ GP L A
Sbjct: 513 ---QAIVRGPLLFA 523
>gi|307719149|ref|YP_003874681.1| hypothetical protein STHERM_c14680 [Spirochaeta thermophila DSM
6192]
gi|306532874|gb|ADN02408.1| putative cytoplasmic protein [Spirochaeta thermophila DSM 6192]
Length = 643
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 79/350 (22%), Positives = 137/350 (39%), Gaps = 53/350 (15%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFL-------GFLALQADY--LSHFHANTHI 337
L +LY +T + +HL LA F +P + G + + L H ++ +HI
Sbjct: 194 ALLKLYELTGEKRHLDLASFFIEERGRQPHYFEWEWEKRGRTSFWPRFRELGHEYSQSHI 253
Query: 338 PI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE 381
P+ +G +R +TGD L V Y TGG A
Sbjct: 254 PVREQREAVGHAVRAMYMYTALADLARITGDTLLWETAQALWKDVTRRKMYLTGGIGASA 313
Query: 382 FWWDPKRLADTLGSEN--EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
F + +A L ++ ETC + + + + R + Y+D E AL NG+LS
Sbjct: 314 FG-ESFSIAYDLPNDRAYNETCASIGLFFWASRMLRKEIDAEYSDVMELALYNGILS-GM 371
Query: 440 GTEPGVMIYMLPL------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
+ Y+ PL R R K+ CC + +G Y+
Sbjct: 372 SLDGSRFFYVNPLEVWPEACRHREDLRHVMTTRQKWFGCACCPPNLARLLASIG-GYYYS 430
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
G+ L++ Y SS+ + V + Q+ + WD +++++ +E +L+
Sbjct: 431 RSGS--SLFVHFYGSSNLTIEDWGVTVEQETE--YPWDGEVKLSVIAREPREF----TLS 482
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
LR+P W N +NG+ P +++ W N + T++L LS+
Sbjct: 483 LRIPGWC--NDFSLEMNGEAYTSTPERGYVAIRRTW--NGRDTVRLRLSM 528
>gi|423142165|ref|ZP_17129803.1| hypothetical protein SEHO0A_03740 [Salmonella enterica subsp.
houtenae str. ATCC BAA-1581]
gi|379050094|gb|EHY67987.1| hypothetical protein SEHO0A_03740 [Salmonella enterica subsp.
houtenae str. ATCC BAA-1581]
Length = 651
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 78/365 (21%), Positives = 124/365 (33%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF----------------------------------DKPCF 317
L RLY IT P+++ LA F DK
Sbjct: 192 ALMRLYEITQQPRYMALADYFVEQRGTQPHYYDEEYAKRGKTAYWHTYGPAWMVKDKAYS 251
Query: 318 LGFLALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
L L A + HA + ++ G ++ D + + + Y TGG
Sbjct: 252 QAHLPLSAQQTATGHAVRFVYLMAGVAHLARLSQDEDKRQTCLRLWNNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSRYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H FN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKTLTFNHIYDHVKPVRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG +Y LYI Y+ +S + + L ++ W ++T+T
Sbjct: 420 ARVLTSLGHYLYTPRN---EALYINMYVGNSVEIPLENGALKLRISGNYPWQE--QITIT 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q + +L LR+P W Q +NGQ + +L W D + + L
Sbjct: 475 VESSQPLRH--TLALRLPEWCPQ--PQVEVNGQPVEQDIRKGYLHIQRDWQEGDTIALTL 530
Query: 600 PLSLR 604
P+ +R
Sbjct: 531 PMPVR 535
>gi|312126770|ref|YP_003991644.1| hypothetical protein Calhy_0533 [Caldicellulosiruptor
hydrothermalis 108]
gi|311776789|gb|ADQ06275.1| protein of unknown function DUF1680 [Caldicellulosiruptor
hydrothermalis 108]
Length = 654
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 92/390 (23%), Positives = 149/390 (38%), Gaps = 70/390 (17%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL--------------GFLALQADYLSHFHA 333
L +LY +T D K+L LA F +P + GF +L +YL
Sbjct: 200 LVKLYEVTGDRKYLELAKFFVDERGQEPYYFDIEYEKREKKSHWPGFKSLGREYLQAHKP 259
Query: 334 NTHIPIVIGSQMR----YEVTGD--------PLYKLIGTFFMDIVNASHSYATG--GTSA 379
+G +R Y D L+ + T F DIV Y TG G+SA
Sbjct: 260 LRQQKEAVGHAVRAVYLYSGAADVAAYTQDKELFDVCKTLFDDIVKRK-MYITGAIGSSA 318
Query: 380 --REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS- 436
F ++ +D +E TC + ++ + L + Y D ERAL N V+
Sbjct: 319 HGEAFTFEYDLPSDAAYAE---TCASVGLIFFAHRLNKIEPHAKYYDVVERALYNTVIGS 375
Query: 437 -IQRGTEPGVMIYMLPLG---RGVSKARSTHGWGTKFNSFW---CCYGTGIESFSKLGDS 489
Q G + Y+ PL + V K H + ++ CC + LG
Sbjct: 376 MSQDGKK---YFYVNPLEVYPKEVEKRFDRHHVKPERQPWFGCACCPPNVARLLASLGRY 432
Query: 490 IYFEEEGNVPGLYIIQYISSSFDWKSGHV-VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQ 548
+Y N G+Y+ YI SS + G V VL Q+V ++ +++ L S +
Sbjct: 433 VY---SYNHDGIYVNLYIGSSVQVEVGGVKVLLQQVSSY-PFEDMVKIDLKPSKEARF-- 486
Query: 549 LSSLNLRMPVW-----TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
L LR+P W Y NG + + Q L P ++ W ND++ +++P +
Sbjct: 487 --KLYLRIPGWCENYEVYVNGKKEEM--QKL----PSGYVCIERLWKENDQVVLKIPTEV 538
Query: 604 RTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+ + A++ GP + +
Sbjct: 539 KMVSSHPQVRSNVGKVAVVKGPVVFCAEEA 568
>gi|251798052|ref|YP_003012783.1| hypothetical protein Pjdr2_4067 [Paenibacillus sp. JDR-2]
gi|247545678|gb|ACT02697.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 622
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 92/406 (22%), Positives = 152/406 (37%), Gaps = 50/406 (12%)
Query: 253 KMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDV-LYRLYSITHDPKHLLLAHL 311
++ +M YF +++++ ER + GG N + +Y LY+ T DP + LA L
Sbjct: 135 RVIPFMTNYFRYQLKQL-----PERPLADWAKARGGDNLISVYWLYNRTGDPFLMELAQL 189
Query: 312 F--DKPCFLGFLALQADYL---SHFHANTHIPIVIGS----QMRYEVTGDPLYKLIGTFF 362
+ G L Q Y + F H+ V S ++Y +TGD K +
Sbjct: 190 LIVQTEDWKG-LYEQYPYWYRQTSFDHRVHVVNVAMSFKQPALQYLLTGDETDKAVVYKA 248
Query: 363 MDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAY 422
++ V A H G S E+ LA T S+ E C+ + +L R T + +
Sbjct: 249 INSVMACHGQVNGMFSGDEW------LAGTHPSQGTELCSVVEYMYSLENLIRITGDGFF 302
Query: 423 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG---WGTKFNS-------- 471
D E+ N ++ P ++ + ++ TH W N
Sbjct: 303 GDILEKIAYN---ALPAAISPDWKVHQY--DQQANQIMCTHAKRNWTENNNEANLFGVEP 357
Query: 472 -FWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSW 530
F CC + + KL ++ EG G+ I Y G + + +
Sbjct: 358 HFGCCTANMHQGWPKLAARLWMASEGG--GIAAISYAPCLVTAALGSDKKTKAEIQVETS 415
Query: 531 DPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWS 590
P+ R T+ E ++ LR+P W Q +NG+ PL P F+S W
Sbjct: 416 YPF-RDTVNIKVGLESSAAFAMKLRIPAWCEEPVLQ--INGEPYPLQPVNGFVSIERIWM 472
Query: 591 YNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEW 636
D+L + LP R + P + +GP +LA +W
Sbjct: 473 PEDELLLTLP---RHATL---IPRANGAAGVQYGPLMLAIPVKEQW 512
>gi|402489910|ref|ZP_10836703.1| hypothetical protein RCCGE510_19298 [Rhizobium sp. CCGE 510]
gi|401811249|gb|EJT03618.1| hypothetical protein RCCGE510_19298 [Rhizobium sp. CCGE 510]
Length = 640
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 88/396 (22%), Positives = 154/396 (38%), Gaps = 61/396 (15%)
Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
L +L +T + K+L L+ F + F A + FH T H P+
Sbjct: 198 ALVKLARVTGEKKYLELSKFFIDERGSEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 258 RDQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + + YAD E+AL NG L
Sbjct: 317 NEGFTDYYDLPN--ATAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
T+ Y PL A H W K++ CC + +G +Y + +
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426
Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+++ ++ +G L Q + WD + F+++ + +L+LR+P
Sbjct: 427 AVHLYGESTARLKLANGAEGELQQTTN--YPWD----GAVAFTTRLKTPATFALSLRIPD 480
Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W ++GA S+NG+ L L + +W+ D++ + LPL+LR + + A
Sbjct: 481 W--ADGATLSVNGEMLDLNANIRDGYARIDRQWADGDRVALHLPLALRPQYANPKVRQDA 538
Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
A++ GP + T T L+A+I P
Sbjct: 539 GRVALMRGPLVYCIET-------TDNGEDLNAIILP 567
>gi|56962984|ref|YP_174711.1| hypothetical protein ABC1212 [Bacillus clausii KSM-K16]
gi|56909223|dbj|BAD63750.1| conserved hypothetical protein [Bacillus clausii KSM-K16]
Length = 641
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 110/495 (22%), Positives = 190/495 (38%), Gaps = 93/495 (18%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG---TGYLSAFPTELFDSFEAL 222
V ++ A+A A + ++++ ++ +S Q G T Y PT+ + +
Sbjct: 72 VAKWIEAAAYTLAERPDPELEQRCDELIALISRAQQPDGYLNTHYTIKAPTKRWTNLRDN 131
Query: 223 KPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSL 282
++ + I +A Y QAL +V F + + +V + Y
Sbjct: 132 HELYVAGHLIEAAVA-----YYETTGKQALLD---VVCKFADLIDQVFGPEPGKLRGYDG 183
Query: 283 NEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF------ 331
++E + L +LY + D ++L LA F +P F A + F
Sbjct: 184 HQE---IELALLKLYRVKGDRRYLRLAQFFIEERGKEPHFFDDEAKKRGEDGTFWYSGRY 240
Query: 332 -HANTHIPI-----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYA 373
++ +H+P+ G +R E + L K+ T + ++ N Y
Sbjct: 241 EYSQSHLPVRQQQEATGHAVRAVYMYTAMADLANETDDEQLAKVCRTLWDNVTN-QQMYI 299
Query: 374 TGGTSAREFW------WD-PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYY 426
TGG + EF +D P LA T ETC + ++ ++++ + Y D
Sbjct: 300 TGGIGSAEFGEAFTFAYDLPNDLAYT------ETCASIGLVFWAKNMLELEADSRYGDVM 353
Query: 427 ERALTNGVLS-IQ-RGTEPGVMIYMLPLGRGVSKARSTHGW---GTKFNSFW---CCYGT 478
ERAL NG +S IQ GT+ Y+ PL A+ H T+ ++ CC
Sbjct: 354 ERALYNGTISGIQLDGTK---FFYVNPLEVWPQAAKHRHDLKHVKTERQPWFGCACCPPN 410
Query: 479 GIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL 538
+ +G IY + N G +I YI N+ I S + L+M
Sbjct: 411 IARLLASIGQYIYTTK--NQTG-FIHLYIG------------NESTLTIGSGEVGLKMKS 455
Query: 539 TFSSKQEVG--------QLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWS 590
+F K EVG + +L R+P W +N Q ++NG + + + W
Sbjct: 456 SFPWKGEVGLEVNPDTSRPFTLAFRIPSW--ANDYQLTVNGHFVDVEVRDGYAYVERTWQ 513
Query: 591 YNDKLTIQLPLSLRT 605
D ++IQ PL +
Sbjct: 514 KGDHISIQFPLETKV 528
>gi|16762630|ref|NP_458247.1| hypothetical protein STY4117 [Salmonella enterica subsp. enterica
serovar Typhi str. CT18]
gi|29144119|ref|NP_807461.1| hypothetical protein t3840 [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|213052815|ref|ZP_03345693.1| hypothetical protein Salmoneentericaenterica_07808 [Salmonella
enterica subsp. enterica serovar Typhi str. E00-7866]
gi|213428126|ref|ZP_03360876.1| hypothetical protein SentesTyphi_22630 [Salmonella enterica subsp.
enterica serovar Typhi str. E02-1180]
gi|213650623|ref|ZP_03380676.1| hypothetical protein SentesTy_27330 [Salmonella enterica subsp.
enterica serovar Typhi str. J185]
gi|213854603|ref|ZP_03382843.1| hypothetical protein SentesT_11074 [Salmonella enterica subsp.
enterica serovar Typhi str. M223]
gi|289826027|ref|ZP_06545185.1| hypothetical protein Salmonellentericaenterica_11725 [Salmonella
enterica subsp. enterica serovar Typhi str. E98-3139]
gi|378962007|ref|YP_005219493.1| hypothetical protein STBHUCCB_40530 [Salmonella enterica subsp.
enterica serovar Typhi str. P-stx-12]
gi|25333173|pir||AG0977 conserved hypothetical protein STY4117 [imported] - Salmonella
enterica subsp. enterica serovar Typhi (strain CT18)
gi|16504936|emb|CAD07947.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi]
gi|29139756|gb|AAO71321.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|374355879|gb|AEZ47640.1| hypothetical protein STBHUCCB_40530 [Salmonella enterica subsp.
enterica serovar Typhi str. P-stx-12]
Length = 651
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 87/398 (21%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ +G IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSIGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +++ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTISLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|365968450|ref|YP_004950011.1| hypothetical protein EcWSU1_00150 [Enterobacter cloacae EcWSU1]
gi|365747363|gb|AEW71590.1| hypothetical protein EcWSU1_00150 [Enterobacter cloacae EcWSU1]
Length = 667
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 82/359 (22%), Positives = 127/359 (35%), Gaps = 61/359 (16%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFHANTHIPIVIGSQMR 346
L RLY +T +P++L L F +P F + SH+ NT+ P +
Sbjct: 208 ALMRLYDVTQEPRYLALVKYFIDTRGTQPHFYDIEYEKRGRTSHW--NTYGPAWMVKDKA 265
Query: 347 YEVTGDPL---YKLIG-----TFFM----DIVNASHS-------------------YATG 375
Y PL + IG + M + SH Y TG
Sbjct: 266 YSQAHQPLAEQHTAIGHAVRFVYLMAGMAHLARLSHDEDKRQDCLRLWNNMAQRQLYITG 325
Query: 376 G----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
G +S F D DT+ +E +C + ++ +R + + YAD ERAL
Sbjct: 326 GIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALY 382
Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIESFSK 485
N VL + Y+ PL H + W CC +
Sbjct: 383 NTVLG-GMALDGKHFFYVNPLEVHPKTLAFNHVYDHVKPVRQRWFGCACCPPNIARVLTS 441
Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
LG +Y + L+I Y+ + L ++ W + + +T +
Sbjct: 442 LGHYLYTVRQD---ALFINLYVGNDVAIPVDEGTLQLRISGNYPWQEEVNIEVTSPAPV- 497
Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+L LR+P W S SLNG+ + +L T RW D LT+ LP+ +R
Sbjct: 498 ---THTLALRLPDWCASPAM--SLNGERVTGDVSRGYLYLTRRWQEGDTLTLTLPMPVR 551
>gi|329927011|ref|ZP_08281398.1| hypothetical protein HMPREF9412_4716 [Paenibacillus sp. HGF5]
gi|328938722|gb|EGG35099.1| hypothetical protein HMPREF9412_4716 [Paenibacillus sp. HGF5]
Length = 658
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 110/494 (22%), Positives = 195/494 (39%), Gaps = 79/494 (15%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFEALK 223
V +L A+A A+ + ++E++ ++ +++ Q GYL+ + T E + L
Sbjct: 79 VAKWLEAAAYSLATHPDPKLEEQVDGLIDLVADAQQP--DGYLNTYFTVKEPEKRWTNLT 136
Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
Y H I AG+ Y + L + + ++ + V + H + +
Sbjct: 137 DCHELYCAGHMIEAGVA-HYRATGKRKLLDVVCRLADH----IDTVFGPEDGKIHGFDGH 191
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFL------ALQADYLSHFH 332
+E + L +LY +T +P++L L+ F +P F FL ++ Y S H
Sbjct: 192 QE---IELALVKLYEVTQEPRYLSLSQYFIDERGTEPHF--FLQEWEQRGKKSFYRSVLH 246
Query: 333 A------NTHIPI-----VIGSQMRY-----------EVTGDP-LYKLIGTFFMDIVNAS 369
A +H+P+ +G +R T DP L + T + ++V+
Sbjct: 247 APHLAYHQSHLPVREQKEAVGHSVRAVYMYTAMADLAARTKDPALLEACDTLWRNMVH-K 305
Query: 370 HSYATGGTSA----REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
Y TGG + F D DT+ SE TC + ++ ++ + + + + YAD
Sbjct: 306 QMYITGGIGSTHHGEAFTTDYDLPNDTVYSE---TCASIGLIFFAQRMLQLSPKSEYADV 362
Query: 426 YERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYG 477
ERAL N V+ Q G Y+ PL + R G W CC
Sbjct: 363 MERALFNTVIGSMAQDGRH---FFYVNPLEVWPAACRYNPGKAHVKPVRPGWFACACCPP 419
Query: 478 TGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
S LG+ +Y + LY YI + + G V + + + WD +T
Sbjct: 420 NVARLLSSLGEYVYTMNDDT---LYAHLYIGGEAEVRFGDVPVKVMQNSALPWDG--DVT 474
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKL 595
LT +Q V ++ LR+P W+ A +NGQ + + + W+ D
Sbjct: 475 LTLQPEQAVEW--TVALRIPDWSRGK-AGLRVNGQEMNVEDITQDGYACVKRVWAPGD-- 529
Query: 596 TIQLPLSLRTEAIQ 609
T++L S+ ++
Sbjct: 530 TVELAFSMEIHQVR 543
>gi|159041539|ref|YP_001540791.1| hypothetical protein Cmaq_0969 [Caldivirga maquilingensis IC-167]
gi|157920374|gb|ABW01801.1| protein of unknown function DUF1680 [Caldivirga maquilingensis
IC-167]
Length = 634
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 109/262 (41%), Gaps = 30/262 (11%)
Query: 350 TGD-PLYKLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADTLGSENEETCTTYN 405
TGD L++ + ++D+ + Y TGG +R E +P L + ETC
Sbjct: 275 TGDKALWEALSNLWVDL-TGTRMYVTGGVGSRHEGEAIGEPYELPND--RAYSETCAAVA 331
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
+ + + T + YAD E AL N L+ + Y+ PL + GW
Sbjct: 332 NVMWNYRMLLATGDAKYADIMELALYNAALA-GISLDGKSYFYVNPL--------ANRGW 382
Query: 466 GTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
+ F CC + L IY G++I YI+S ++ K
Sbjct: 383 HRRQPWFDVACCPPNIARLIASLPGYIYSTSSD---GVWIHLYIASEAKVNLNGGIVELK 439
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG--QNLPLPPPGN 581
V+ WD +++T+ S + E ++ LR+P W S G + +NG Q + L P
Sbjct: 440 VNTDYPWDGEVKVTVNPSKEDEF----TIYLRIPGW--SRGGKLLINGVEQGVEL-KPST 492
Query: 582 FLSATERWSYNDKLTIQLPLSL 603
+L W D++ +++P+S+
Sbjct: 493 YLGVKRTWRSGDEVILRIPMSI 514
>gi|448408500|ref|ZP_21574295.1| hypothetical protein C475_08191 [Halosimplex carlsbadense 2-9-1]
gi|445674355|gb|ELZ26899.1| hypothetical protein C475_08191 [Halosimplex carlsbadense 2-9-1]
Length = 637
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 75/192 (39%), Gaps = 19/192 (9%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + ++ LF + AYAD ER L NG L+ G + Y+ PL
Sbjct: 338 ETCAAVGSVFWNQRLFELEPDPAYADLIERTLYNGFLA-GVGMDGEEFFYVNPLASDGDH 396
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
RS GW T CC F+ LG +Y G LY+ QY+ S
Sbjct: 397 HRS--GWFTCA----CCPPNAARLFASLGQYVYSTTGGE---LYVTQYVGSDLSTTVEGT 447
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
+ + + WD + + + V NLR+P W ++ A +++G +
Sbjct: 448 AVELDQESALPWDGEVAIEVDADGAVPV------NLRIPEW--ADEATVTVDGDEVSHDG 499
Query: 579 PGNFLSATERWS 590
G F+ W+
Sbjct: 500 SG-FVRVEREWN 510
>gi|56415571|ref|YP_152646.1| hypothetical protein SPA3530 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|197364498|ref|YP_002144135.1| hypothetical protein SSPA3296 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
gi|56129828|gb|AAV79334.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|197095975|emb|CAR61560.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
Length = 651
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 87/398 (21%), Positives = 141/398 (35%), Gaps = 75/398 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMTLASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHTVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDSVYAE---SCASIGLMMFARRMLEMEADSQYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H KFN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ +G IY LYI Y+ +S + + L ++ W + ++ +
Sbjct: 420 ARVLTSIGHYIYTP---RADALYINMYVGNSLEVPVENGALKLRIGGNYPW--HEQVKIA 474
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
S Q V +L LR+P W A+ +LNG + +L W D +++ L
Sbjct: 475 IDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTISLTL 530
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
P+ +R A AI GP Y L +GE
Sbjct: 531 PMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 568
>gi|116254107|ref|YP_769945.1| hypothetical protein RL4374 [Rhizobium leguminosarum bv. viciae
3841]
gi|115258755|emb|CAK09861.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 640
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 89/396 (22%), Positives = 157/396 (39%), Gaps = 61/396 (15%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFH------ANTHIPI 339
L +L +T + K+L L+ F +P F A + +S +H A H P+
Sbjct: 198 ALVKLARVTDEKKYLDLSKYFIDERGTEPHFFTAEASRDGRDVSEYHQKTYEYAQAHQPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 258 RAQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + + YAD E+AL NG L
Sbjct: 317 NEGFTDYFDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
T+ Y PL A H W K++ CC + +G +Y + +
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVSDNEI- 426
Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+++ ++ +G V L Q + W+ + F+++ E +L+LR+P
Sbjct: 427 AVHLYGESTARLKLANGAEVELEQTTN--YPWEG----AVAFTTRLEKPAKFALSLRIPD 480
Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W + GA S+NG+ L L ++ W+ D++ + LPL+LR + + A
Sbjct: 481 W--AEGATLSVNGEMLDLNANMRDGYIRIDREWAAGDRVALYLPLALRPQYANPKVRQDA 538
Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
A++ GP + T T L+A++ P
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGEDLNAIVLP 567
>gi|332980748|ref|YP_004462189.1| hypothetical protein Mahau_0144 [Mahella australiensis 50-1 BON]
gi|332698426|gb|AEE95367.1| protein of unknown function DUF1680 [Mahella australiensis 50-1
BON]
Length = 647
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 116/539 (21%), Positives = 200/539 (37%), Gaps = 75/539 (13%)
Query: 140 KTASLPTPGKAYGGWENPISELRGHFVGHYLSASAQMWASTHNATIKEKMSTVVFSLSEC 199
+ A+ GK YG P+ + + ++ A + A + +K + + +S+
Sbjct: 56 RIAAGEVSGKHYG----PV--FQDSDLAKWMEAVSCSLALRSDDDLKLHLEEAIALVSKA 109
Query: 200 QNKIGTGYLSAFPT--ELFDSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATW 257
Q GYL + T E + L+ Y H I A + + Y + N L +A
Sbjct: 110 QE--ADGYLDTYFTIEEPSARWTNLRDKHELYCAGHMIEAAVAN-YEVTGNKTLLNVACR 166
Query: 258 MVEYFYNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK--- 314
+ ++ + ++ S +RH Y +EE + L +LY T++ K+L LAH F +
Sbjct: 167 LADH----ICEMFGPESTKRHGYPGHEE---IELALVKLYHATNERKYLDLAHYFIRERG 219
Query: 315 --PCFLGFLAL-----------QADYLSHFHANTHIPI----VIGSQMRYEV-------- 349
P + A+ L +F A H+P+ IG +R
Sbjct: 220 KAPYYFKIEAMARGEAKLDELWDPSKLEYFQA--HMPVTEQEAIGHAVRAMYLYSGMTDV 277
Query: 350 ---TGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--ETCTTY 404
TGD D V Y TGG + F + A L ++ ETC +
Sbjct: 278 ALETGDETIAQACRRLWDDVVKRKMYITGGVGSSSFG-EAFTFAYDLPNDTAYTETCASI 336
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR--GVSKARST 462
++ + +F+ ++ Y D ERAL N V + + Y+ PL V R
Sbjct: 337 GLIFWAHRMFKMDQDAKYIDVMERALYNTVFA-SMSLDGKRYFYVNPLEVWPEVCHKRED 395
Query: 463 HGWGTKFNSFW----CCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISS--SFDWKS 515
H W CC + +G +Y +E+ N+ L++ Y+ F+
Sbjct: 396 HRHVKTERQKWYDCACCPPNIARLLTSIGKYVYALDEDKNM--LFVNLYMDGQVKFNLND 453
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
++L Q D + WD + T+T ++ SL R+P W +NGQ +
Sbjct: 454 KEIMLEQ--DTVYPWDGSISFTVTSNTPVTF----SLAFRIPDWC--KKWSIKINGQEIQ 505
Query: 576 LPPPGN-FLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+ T W DK+ + L + + + A AI GP + +
Sbjct: 506 EHEKNKGYAVITRAWVAGDKVELMLDMPVMMMRANPEVRADAGKVAIQRGPVVYCAEEA 564
>gi|255531160|ref|YP_003091532.1| hypothetical protein Phep_1254 [Pedobacter heparinus DSM 2366]
gi|255344144|gb|ACU03470.1| protein of unknown function DUF1680 [Pedobacter heparinus DSM 2366]
Length = 684
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 62/308 (20%), Positives = 114/308 (37%), Gaps = 42/308 (13%)
Query: 345 MRYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTT 403
+ ++ TGD Y K + T F D++ H G SA E L ++ E C T
Sbjct: 287 INFQRTGDSTYLKSLKTVFNDLMTL-HGLPNGIFSADE------DLHGNQPTQGTELCAT 339
Query: 404 YNMLKVSRHLFRWTKEIAYADYYERALTNGV---------------LSIQRGTEPGVMIY 448
+ + T + Y D ER N + ++ Q GV +
Sbjct: 340 VEAMYSLEEIINITGDTHYIDALERMTFNAMPSQTTDDYHEKQYFQMANQIEISRGVFAF 399
Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
LP R ++ + + CCY + ++K +++ + E GL + Y
Sbjct: 400 TLPFDRKMNCVLGAK------SGYTCCYVNMHQGWTKFSQNLWHKTEN---GLAALIYGP 450
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
++ K G + ++ + ++ ++ S K+ V LR+P W A
Sbjct: 451 NTLSTKVGAQQTDVTIEEVTNYPFEDQINFNLSLKKAVA--FPFQLRIPTWCKE--AVIL 506
Query: 569 LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLL 628
+NG+ G ++ W D+LT+QLP+ + D+ +A+ GP +
Sbjct: 507 INGKIYSKEKGGKIITVNRTWQNKDRLTLQLPMEIAVSEWADNS------RAVERGPLVY 560
Query: 629 AGHTSGEW 636
+W
Sbjct: 561 GLKVQEKW 568
>gi|389805630|ref|ZP_10202778.1| hypothetical protein UUA_00270 [Rhodanobacter thiooxydans LCS2]
gi|388447325|gb|EIM03335.1| hypothetical protein UUA_00270 [Rhodanobacter thiooxydans LCS2]
Length = 607
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/209 (22%), Positives = 85/209 (40%), Gaps = 23/209 (11%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC++ ++++R L T E YA+ ER N +L Q Y+ P GR V
Sbjct: 303 ETCSSLAWIQLNRELLAITGEARYAEEIERTGYNDLLGAQAPNGEDWCYYVFPNGRRV-- 360
Query: 459 ARSTHGWGTKFNSFW-CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK-SG 516
++W CC +G + +L Y ++ + + S+SF +G
Sbjct: 361 ----------HTTYWRCCKSSGAMALEELPALAYARDDDGAIAVNLYGAGSASFALDGAG 410
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
+ + Q D LR+ + + +L LR+P W + A +NG++ +
Sbjct: 411 ELRIEQHTAYPYPDDVRLRIAVGRPMR------FTLKLRIPSW--AKDATLVINGEDAGV 462
Query: 577 P-PPGNFLSATERWSYNDKLTIQLPLSLR 604
PG++ W D+L + P+ R
Sbjct: 463 ALSPGHYAVLEREWHDGDELVARFPMQPR 491
>gi|392977054|ref|YP_006475642.1| hypothetical protein A3UG_00935 [Enterobacter cloacae subsp.
dissolvens SDM]
gi|392322987|gb|AFM57940.1| hypothetical protein A3UG_00935 [Enterobacter cloacae subsp.
dissolvens SDM]
Length = 651
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 78/365 (21%), Positives = 127/365 (34%), Gaps = 73/365 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T +P++L L F +P F + S++H +
Sbjct: 192 ALMRLYDVTQEPRYLNLVKYFIEARGTQPHFYDTEYEKRGRTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H P+ IG +R+ ++ D + + + Y TGG
Sbjct: 252 QAHQPLAEQQTAIGHAVRFVYLMAGMAHLARLSKDDAKRQDCLRLWNNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D DT+ +E +C + ++ +R + + YAD ERAL N
Sbjct: 312 GSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSRYADVMERALYNT 368
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTG 479
VL + Y+ PL H FN + CC
Sbjct: 369 VLG-GMALDGKHFFYVNPL--------EVHPRTLAFNHIYDHVKPVRQRWFGCACCPPNI 419
Query: 480 IESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ LG IY L+I ++ + G L ++ W + + +
Sbjct: 420 ARVLTSLGHYIYTVRPD---ALFINLFVGNEVTIPVGDETLKLRISGNYPWQKEVNIEIA 476
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQL 599
V +L LR+P W + SLNG+ + +L T RW D LT+ L
Sbjct: 477 ----SPVPVTHTLALRLPDWCAN--PHVSLNGEGMTGEVSRGYLHLTRRWQEGDTLTLTL 530
Query: 600 PLSLR 604
P+ +R
Sbjct: 531 PMPVR 535
>gi|190333374|gb|ACE73687.1| hypothetical protein [Geobacillus stearothermophilus]
Length = 642
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 63/265 (23%), Positives = 107/265 (40%), Gaps = 26/265 (9%)
Query: 351 GDPLYKLIGTFFMDIVNASHSYATGGTSA----REFWWDPKRLADTLGSENEETCTTYNM 406
GD K + V Y TGG + F +D DT +E TC + +
Sbjct: 278 GDESLKQACQTLWENVTKRQMYITGGVGSSAFGESFTFDFDLPNDTAYAE---TCASIAL 334
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG---RGVSK--ARS 461
+ +R + + YAD ERAL NG +S + Y+ PL + + R
Sbjct: 335 VFWTRRMLELEMDGKYADVMERALYNGTIS-GMDLDGKKFFYVNPLEVWPKACERHDKRH 393
Query: 462 THGWGTKFNSFWCCYGTGIESFSKLGDSIYFE-EEGNVPGLYIIQYISSSFDWKSGHVVL 520
K+ S CC + +G IY + + LY+ I + D +S V +
Sbjct: 394 VKPVRQKWFSCACCPPNLARLIASIGHYIYLQTSDALFVHLYVGSDIQTEIDGRS--VKI 451
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP-- 578
Q+ + WD +R+T++ S E +L LR+P W GA+ ++NG+ + + P
Sbjct: 452 MQETN--YPWDGTVRLTVSPESAGEF----TLGLRIPGW--CRGAEVTINGEKVDIVPLI 503
Query: 579 PGNFLSATERWSYNDKLTIQLPLSL 603
+ W D++ + P+ +
Sbjct: 504 KKGYAYIRRVWQQGDEVKLYFPMPV 528
>gi|421075310|ref|ZP_15536325.1| protein of unknown function DUF1680 [Pelosinus fermentans JBW45]
gi|392526752|gb|EIW49863.1| protein of unknown function DUF1680 [Pelosinus fermentans JBW45]
Length = 650
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 113/500 (22%), Positives = 183/500 (36%), Gaps = 89/500 (17%)
Query: 181 HNATIKEKMSTVVFSLSECQNKIGTGYLSAFP---------TELFDSFEALKPVWAPYYT 231
H + EK++ + C + GYL+ + T L D+ E Y
Sbjct: 92 HKDSALEKVADAAIDIV-CAAQQADGYLNTYYILNGLDKRWTNLQDNHEL--------YC 142
Query: 232 IHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMND 291
+ ++ G + Y + LK A V+Y V ++ ++H Y +E +
Sbjct: 143 LGHMIEGAISYYQATGKDKLLKAAIRYVDY----VDTILGPEQGKKHGYPGHEV---IEL 195
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----------------------KPCFLGFLALQADY- 327
L +LY IT D KHL LA F K + + QAD
Sbjct: 196 ALVKLYQITKDEKHLKLAKYFIDERGQQPLYFQEETKRYGNDFPWKDSYFQYKYYQADQP 255
Query: 328 -----LSHFHANTHIPIVIGSQMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG--GTSA 379
++ HA + G +T D LY + ++ Y TG G SA
Sbjct: 256 VRSQQVAEGHAVRATYLYSGMADVARLTKDEELYAACKRIWNNMTQ-RQMYITGSIGASA 314
Query: 380 --REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
F +D DT+ E TC + + +R + + E YAD E+ L NG+LS
Sbjct: 315 YGESFTYDYDLPNDTVYGE---TCASIGAVFFARRMLEISPEGEYADVIEKELFNGILS- 370
Query: 438 QRGTEPGVMIYMLPLGR--GVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIY 491
+ Y+ PL SK H W CC F+ LG IY
Sbjct: 371 GMSMDGKSFFYVNPLEVVPEASKKDQLHHHVEVERQKWFGCACCPPNIARLFASLGSYIY 430
Query: 492 -FEEEGNV--PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQ 548
+ + N LYI ++ +FD + +N V WD + +T++ + +E
Sbjct: 431 SYSAKSNTLWLHLYIGGELTHTFDSQE----VNFTVATNYPWDEDVEITVSLAESKEF-- 484
Query: 549 LSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
+ LR+P W + ++NG+ P + W D I L ++ E +
Sbjct: 485 --TYALRIPGWC--KAYEVNVNGEKTNAPIVNGYAYLQREWKNGD--VIHLHFAMPIEVM 538
Query: 609 QDD---RPEYASIQAILFGP 625
Q + R + + A++ GP
Sbjct: 539 QANPRVREDLGKV-AMMRGP 557
>gi|213418442|ref|ZP_03351508.1| hypothetical protein Salmonentericaenterica_11358 [Salmonella
enterica subsp. enterica serovar Typhi str. E01-6750]
Length = 385
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 93/253 (36%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 68 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 120
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + +G IY LYI
Sbjct: 121 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYIYTPR---ADALYIN 175
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + + L ++ W + ++ + S Q V +L LR+P W
Sbjct: 176 MYVGNSMEIPVENGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 229
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +++ LP+ +R A AI G
Sbjct: 230 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTISLTLPMPVRRVYGNPLARHVAGKVAIQRG 289
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 290 PLVYCLEQADNGE 302
>gi|418817745|ref|ZP_13373230.1| hypothetical protein SEEN538_23719 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
gi|392787738|gb|EJA44277.1| hypothetical protein SEEN538_23719 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
Length = 651
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 60/253 (23%), Positives = 92/253 (36%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 334 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 386
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H FN + CC + LG IY LYI
Sbjct: 387 --EVHPKSLNFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYIN 441
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + + L ++ W + ++ + S Q V +L LR+P W
Sbjct: 442 MYVGNSMEIPVENGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 495
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +T+ LP+ +R A AI G
Sbjct: 496 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRG 555
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 556 PLVYCLEQADNGE 568
>gi|424792517|ref|ZP_18218744.1| hypothetical protein XTG29_01554 [Xanthomonas translucens pv.
graminis ART-Xtg29]
gi|422797058|gb|EKU25452.1| hypothetical protein XTG29_01554 [Xanthomonas translucens pv.
graminis ART-Xtg29]
Length = 664
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 47/211 (22%), Positives = 79/211 (37%), Gaps = 18/211 (8%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ + + + + YAD ERAL N VL+ + Y+ PL
Sbjct: 339 ESCASIGLMMFANRMLQLAPDSRYADVMERALYNTVLA-GMALDGRHFFYVNPLEVHPPT 397
Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
HG+ W CC + LG +Y + LY+ Y+ S
Sbjct: 398 VHGNHGFDHVKPVRQRWFGCACCPPNIARVVTSLGHYLYTRRDDT---LYVNLYVGSDAA 454
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
+ G L + W + +++ + E G L LR+P W Q LNG+
Sbjct: 455 FDVGGQTLTLRQRGEYPWQEQVELSMDCDAPIEAG----LALRLPDWC--RAPQLQLNGE 508
Query: 573 NLPLPP--PGNFLSATERWSYNDKLTIQLPL 601
+ + + +RW D L + LP+
Sbjct: 509 AVAIAAHLQHGYCVLRQRWQRGDTLHLHLPM 539
>gi|213582277|ref|ZP_03364103.1| hypothetical protein SentesTyph_14169 [Salmonella enterica subsp.
enterica serovar Typhi str. E98-0664]
Length = 380
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 93/253 (36%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ PL
Sbjct: 63 ESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPL------ 115
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + +G IY LYI
Sbjct: 116 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYIYTPR---ADALYIN 170
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + + L ++ W + ++ + S Q V +L LR+P W
Sbjct: 171 MYVGNSMEIPVENGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 224
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +++ LP+ +R A AI G
Sbjct: 225 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTISLTLPMPVRRVYGNPLARHVAGKVAIQRG 284
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 285 PLVYCLEQADNGE 297
>gi|241206592|ref|YP_002977688.1| hypothetical protein Rleg_3907 [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|240860482|gb|ACS58149.1| protein of unknown function DUF1680 [Rhizobium leguminosarum bv.
trifolii WSM1325]
Length = 648
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 91/402 (22%), Positives = 159/402 (39%), Gaps = 73/402 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFH------ANTHIPI 339
L +L +T + K+L L+ F +P F A + +S +H A H P+
Sbjct: 206 ALVKLARVTDEKKYLELSKYFIDERGTEPHFFTAEASRDGRDVSEYHQKTYEYAQAHQPV 265
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 266 RAQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 324
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + + YAD E+AL NG L
Sbjct: 325 NEGFTDYFDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL---- 378
Query: 440 GTEPGVMI------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
PG+ I Y PL A H W K++ CC + +G +Y
Sbjct: 379 ---PGLSIDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAV 429
Query: 494 EEGNVPGLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ + +++ ++ +G V L Q + W+ + F+++ E +L
Sbjct: 430 SDNEI-AVHLYGESTARLKLANGAEVELEQTTN--YPWEG----AVAFTTRLEKPAKFAL 482
Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
+LR+P W ++GA S+NG+ L L + W+ D++ + LPL+LR +
Sbjct: 483 SLRVPDW--ADGATLSVNGEMLDLNANMRDGYARIDREWAAGDRVALYLPLALRPQYANP 540
Query: 611 DRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
+ A A++ GP + T T L+A++ P
Sbjct: 541 KVRQDAGRVALMRGPLVYCVET-------TDNGEDLNAIVLP 575
>gi|89067251|ref|ZP_01154764.1| hypothetical protein OG2516_10441 [Oceanicola granulosus HTCC2516]
gi|89046820|gb|EAR52874.1| hypothetical protein OG2516_10441 [Oceanicola granulosus HTCC2516]
Length = 633
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 55/228 (24%), Positives = 92/228 (40%), Gaps = 18/228 (7%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVS 457
ETC + M+ + + + YAD E AL N L+ + R E L
Sbjct: 332 ETCASVAMVFWAARMLNLDLDGQYADILELALYNNALAGLSRDGEHYFYDNKL------E 385
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
S H W ++ CC + + Y E + +++ +++ G
Sbjct: 386 SDGSHHRWA--WHECPCCTMNVSRLVASVAGYFYGVAETEI-AVHLYGGATATLPVAGGR 442
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
V L + D WD +R+ L + E + +L+LR+P W + GA AS+NG+ L +
Sbjct: 443 VTLTETSD--YPWDGAVRIAL----EPEGTRTFTLSLRVPGWCH--GATASVNGEALEVA 494
Query: 578 PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
P +L T W+ D + + LP+ D + A A+ GP
Sbjct: 495 PERGYLKITRDWAPGDVVELNLPMQAERLYAHPDVRQDAGRVALRRGP 542
>gi|383189042|ref|YP_005199170.1| hypothetical protein Rahaq2_1140 [Rahnella aquatilis CIP 78.65 =
ATCC 33071]
gi|371587300|gb|AEX51030.1| hypothetical protein Rahaq2_1140 [Rahnella aquatilis CIP 78.65 =
ATCC 33071]
Length = 657
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 84/221 (38%), Gaps = 34/221 (15%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + ++ + + + + YAD ERAL N VL+ + Y+ PL
Sbjct: 339 ETCASIGLMMFANRMLQMDADSRYADVMERALYNTVLA-GMALDGKHFFYVNPL------ 391
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H FN + CC + LG IY + G+ I
Sbjct: 392 --EVHPKSIPFNHIYDHVKPVRQRWFGCACCPPNIARLLASLGHYIYTQRPD---GVDIN 446
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
YI S D G L K W R+ + + Q + ++L LR+P W S
Sbjct: 447 LYIGSDVDATIGGKALRLKQSGGYPWAE--RVLIEIDTDQPLE--ATLALRLPDWCGS-- 500
Query: 565 AQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
Q +LNG L L +L T+ W D++ + LP+ +
Sbjct: 501 PQVTLNGHPLELASLTQRGYLRLTQEWQKGDRIEMTLPMPV 541
>gi|190893687|ref|YP_001980229.1| hypothetical protein RHECIAT_CH0004122 [Rhizobium etli CIAT 652]
gi|190698966|gb|ACE93051.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
Length = 640
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 92/429 (21%), Positives = 165/429 (38%), Gaps = 73/429 (17%)
Query: 292 VLYRLYSITHDPKHLLLAHLF------DKPCFLGFLALQADYLSHFHANT------HIPI 339
L +L +T + K+L L+ F + F A + FH T H P+
Sbjct: 198 ALVKLARVTGEKKYLDLSKYFIDERGTEPHFFTAEAARDGRSAADFHQKTYEYGQAHQPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 258 REQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + + YAD E+AL NG L
Sbjct: 317 NEGFTDYYDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 373
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
T+ Y PL A H W K++ CC + +G +Y + +
Sbjct: 374 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVADDEI- 426
Query: 500 GLYIIQYISSSFDWKSGHVV-LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+++ ++ +G V L Q + W+ + F+++ E +L+LR+P
Sbjct: 427 AVHLYGESTTRLKLANGAAVELQQATN--YPWE----GAVAFTTRLEKPAKFALSLRIPD 480
Query: 559 WTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W ++GA S+NG+ L L + +W D++ + LPLSLR + + A
Sbjct: 481 W--ADGATLSVNGEKLDLGAVTRDGYARIDRQWVDGDRVDLFLPLSLRPQYANPKVRQDA 538
Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMS 676
A++ GP + T T + L+ ++ P + S T V+
Sbjct: 539 GRVALMRGPLVYCVET-------TDNGQDLNTIVLP------------RELSAAETVVLK 579
Query: 677 NSNQSITME 685
+ N ++ ++
Sbjct: 580 DLNDAVALD 588
>gi|384538328|ref|YP_005722412.1| hypothetical protein SM11_pD0078 [Sinorhizobium meliloti SM11]
gi|336036981|gb|AEH82911.1| hypothetical protein SM11_pD0078 [Sinorhizobium meliloti SM11]
Length = 640
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 105/248 (42%), Gaps = 33/248 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
ETC + ++ + + + YAD E+AL NG L PG+ I Y PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFFYDNPL 386
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
S R H W K++ CC + +G +Y E + +++ ++
Sbjct: 387 E---STGRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESTARLK 439
Query: 513 WKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
SG V L Q+ + W+ + F++K + +L+LR+P W + GA S+NG
Sbjct: 440 LASGAEVELRQETN--YPWE----GAIAFTTKLDRPAKFALSLRIPEW--AAGATLSVNG 491
Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
L L G + WS D++ + LPL+LR + + A++ GP Y
Sbjct: 492 TMLDLSAHLTGGYARIEREWSDGDRVALYLPLALRPQYANPKVRQDVGRVALMRGPLVYC 551
Query: 628 LAGHTSGE 635
+ +GE
Sbjct: 552 VEATDNGE 559
>gi|384534128|ref|YP_005716792.1| hypothetical protein [Sinorhizobium meliloti BL225C]
gi|433610342|ref|YP_007193803.1| hypothetical protein C770_GR4pD0078 [Sinorhizobium meliloti GR4]
gi|333816304|gb|AEG08971.1| protein of unknown function DUF1680 [Sinorhizobium meliloti BL225C]
gi|429555284|gb|AGA10204.1| hypothetical protein C770_GR4pD0078 [Sinorhizobium meliloti GR4]
Length = 640
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 105/248 (42%), Gaps = 33/248 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
ETC + ++ + + + YAD E+AL NG L PG+ I Y PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFFYDNPL 386
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
S R H W K++ CC + +G +Y E + +++ ++
Sbjct: 387 E---STGRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESTARLK 439
Query: 513 WKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
SG V L Q+ + W+ + F++K + +L+LR+P W + GA S+NG
Sbjct: 440 LASGAEVELRQETN--YPWE----GAIAFTTKLDRPAKFALSLRIPEW--AAGATLSVNG 491
Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
L L G + WS D++ + LPL+LR + + A++ GP Y
Sbjct: 492 TMLDLSAHLTGGYARIEREWSDGDRVALYLPLALRPQYANPKVRQDVGRVALMRGPLVYC 551
Query: 628 LAGHTSGE 635
+ +GE
Sbjct: 552 VEATDNGE 559
>gi|302883148|ref|XP_003040476.1| hypothetical protein NECHADRAFT_44741 [Nectria haematococca mpVI
77-13-4]
gi|256721360|gb|EEU34763.1| hypothetical protein NECHADRAFT_44741 [Nectria haematococca mpVI
77-13-4]
Length = 645
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 89/212 (41%), Gaps = 23/212 (10%)
Query: 354 LYKLIGTFFMDIVNASHSYATGGTSAREFW--WDPKRLADTLGSEN--EETCTTYNMLKV 409
L +G + D+V+ Y TG + W + P + L E ETC T+ ++
Sbjct: 291 LKAALGRLWRDMVD-KRMYVTGSLGSVRQWEGFGPAYILPDLEHEGCYAETCATFALINW 349
Query: 410 SRHLFRWTKEIAYADYYERALTNGVL-SIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
+ R + YAD E AL NG L ++ + + +L +G K RS K
Sbjct: 350 CARMLRLDLDAEYADVMEVALYNGFLGAVNQDGDAFYYENVLRTRKGEFKERS------K 403
Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIV 528
+ CC + LG S+ + ++ + + I QYI S V++ QK D +
Sbjct: 404 WFGVACCPPNVAKLLGNLG-SLIYSQDASTNLVAIHQYIDSELKIPESGVIIRQKTD--M 460
Query: 529 SWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
WD + +++ S ++L LR+P W
Sbjct: 461 PWDGQVVLSIQGS--------ANLALRIPSWA 484
>gi|424872619|ref|ZP_18296281.1| hypothetical protein Rleg5DRAFT_4131 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393168320|gb|EJC68367.1| hypothetical protein Rleg5DRAFT_4131 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 648
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 89/396 (22%), Positives = 156/396 (39%), Gaps = 61/396 (15%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFH------ANTHIPI 339
L +L +T + K+L L+ F +P F A + +S +H A H P+
Sbjct: 206 ALVKLARVTDEKKYLELSKYFIDERGTEPHFFTAEASRDGRDVSEYHQKTYEYAQAHQPV 265
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 266 RAQTKVVGHAVRAMYLYSGMADIATEYKDDSLTAALETLWDDLTT-KQMYITGGIGPAAS 324
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + + YAD E+AL NG L
Sbjct: 325 NEGFTDYFDLPND--TAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GL 381
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
T+ Y PL A H W K++ CC + +G +Y + +
Sbjct: 382 STDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIARLVTSIGSYMYAVSDNEI- 434
Query: 500 GLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+++ ++ +G V L Q + W+ + F+++ E +L+LR+P
Sbjct: 435 AVHLYGESTARLKLANGAEVELEQTTN--YPWEG----AVAFTTRLEKPARFALSLRIPD 488
Query: 559 WTYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYA 616
W + GA S+NG+ L L + W+ D++ + LPL+LR + + A
Sbjct: 489 W--AEGATLSVNGEMLDLNANMYDGYARIDREWAAGDRVALYLPLALRPQYANPKVRQDA 546
Query: 617 SIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
A++ GP + T T L+A++ P
Sbjct: 547 GRVALMRGPLVYCVET-------TDNGEDLNAIVLP 575
>gi|359791407|ref|ZP_09294266.1| hypothetical protein MAXJ12_18203 [Mesorhizobium alhagi CCNWXJ12-2]
gi|359252565|gb|EHK55793.1| hypothetical protein MAXJ12_18203 [Mesorhizobium alhagi CCNWXJ12-2]
Length = 634
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 99/478 (20%), Positives = 189/478 (39%), Gaps = 77/478 (16%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG---TGYLSAFP----TELFDS 218
VG ++ A++ + +A I+ K+ +V L + Q G YL P T L D+
Sbjct: 75 VGKWIEAASYALSHRRDADIEAKIEKIVDDLEKAQAPDGYLNCWYLQREPDKRWTNLRDN 134
Query: 219 FEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERH 278
E Y + +L G + ++ + L + +E + V++ ++
Sbjct: 135 HE--------LYNLGHLLEGGIAYFLATGRRRLLDI----LERYVEHVRETFGPNPGQKR 182
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLA-----------HLFDKPCFLGFLALQADY 327
Y ++E + L +LY +T + KHL LA H FD+ + + +
Sbjct: 183 GYCGHQE---IELALIKLYRLTGERKHLDLAAYFINERGRQPHYFDQEAVARGESPRDFW 239
Query: 328 LSHFHAN-THIPI-----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNAS 369
+ N +H P+ V+G +R E+ L + + D++N S
Sbjct: 240 AKSYEYNQSHRPVREQTKVVGHAVRAMYMFSAMADLAAELNDASLKQACEVLWADVMN-S 298
Query: 370 HSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYY 426
Y T G +A E + + L + + ETC + ++ ++ + + YAD
Sbjct: 299 KIYITSGLGPAAANEGFTEDYDLPND--TAYAETCASVALIFWAQRMLHLDLDGRYADVM 356
Query: 427 ERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSK 485
E+AL NG L+ + R E Y PL +R W +++ CC +
Sbjct: 357 EQALFNGALTGLSRDGEH--YFYSNPLDSDGRHSR----WA--WHTCPCCTMNSSRLIAS 408
Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
+G + + ++ IS++ +G+V L + W +R+ ++ E
Sbjct: 409 VG-GYFVSASDDAIAFHLYGGISTNIRLATGNVSLRET--SAYPWSGSVRIAVSPDEPAE 465
Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPL 601
++ L +P W S A AS+NG+ + + +LS W D + ++LP+
Sbjct: 466 F----TVKLHIPGWAQS--ATASVNGEPVDVKRGIEAGYLSIKRMWREGDTIALELPM 517
>gi|329930292|ref|ZP_08283894.1| hypothetical protein HMPREF9412_2909 [Paenibacillus sp. HGF5]
gi|328935161|gb|EGG31645.1| hypothetical protein HMPREF9412_2909 [Paenibacillus sp. HGF5]
Length = 626
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 65/300 (21%), Positives = 120/300 (40%), Gaps = 28/300 (9%)
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
YE+ G+P+ + +D + H A G S E+ L+ T S+ E C
Sbjct: 237 YELNGNPVERESVHRGIDSLMTYHGQAHGMFSGDEW------LSGTHPSQGVELCAVVEY 290
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGV--------LSIQRGTEPGVMIYMLPLGRGVSK 458
+ L R E + D E+ N + S Q + MI + R S
Sbjct: 291 MFSMEQLTRIFGEGRFGDILEKVAFNALPAAISADWTSHQYDQQVNQMICNV-APRAWSN 349
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
+ + +G + N F CC + + KL ++ +++ + GL + Y + G
Sbjct: 350 SPDANVFGLEPN-FGCCTANMHQGWPKLASHLWMKDQED--GLVAVSYAPCTVRTTVGRQ 406
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
++ +V+ + R+ + S E + ++LR+P W + +LNG+ LP+
Sbjct: 407 GVSAEVEVTGEYPFKDRVQIHLSL--ERAESFPISLRIPAWC--DHPVITLNGRELPIQA 462
Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDI 638
+ + W D L + LP+ ++TE+ R YA+ +I GP + W +
Sbjct: 463 ESGYAKIVQTWQSGDLLELYLPMEVKTES----RSMYAT--SITRGPLVYVLPVKENWQM 516
>gi|418401306|ref|ZP_12974836.1| hypothetical protein SM0020_14414 [Sinorhizobium meliloti
CCNWSX0020]
gi|359504683|gb|EHK77215.1| hypothetical protein SM0020_14414 [Sinorhizobium meliloti
CCNWSX0020]
Length = 640
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 104/248 (41%), Gaps = 33/248 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
ETC + ++ + + + YAD E+AL NG L PG+ I Y PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFFYDNPL 386
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
S R H W K++ CC + +G +Y E + +++ ++
Sbjct: 387 E---STGRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESTARLK 439
Query: 513 WKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
SG V L Q+ + W+ + F++K + L+LR+P W + GA S+NG
Sbjct: 440 LASGAEVELRQETN--YPWE----GAIAFTTKLDRPAKFELSLRIPEW--AAGATLSVNG 491
Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
L L G + WS D++ + LPL+LR + + A++ GP Y
Sbjct: 492 TMLDLSAHLTGGYARIEREWSDGDRVALYLPLALRPQYANPKVRQDVGRVALMRGPLVYC 551
Query: 628 LAGHTSGE 635
+ +GE
Sbjct: 552 VEATDNGE 559
>gi|417432692|ref|ZP_12161408.1| secreted protein [Salmonella enterica subsp. enterica serovar
Mississippi str. A4-633]
gi|353614176|gb|EHC66091.1| secreted protein [Salmonella enterica subsp. enterica serovar
Mississippi str. A4-633]
Length = 352
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 93/253 (36%), Gaps = 34/253 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERAL N VL + Y+ P+
Sbjct: 35 ESCASIGLMMFARQMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPM------ 87
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H KFN + CC + +G IY LYI
Sbjct: 88 --EVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYIYTPR---ADALYIN 142
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
Y+ +S + + L ++ W + ++ + S Q V +L LR+P W
Sbjct: 143 MYVGNSLEVPVENGALKLRISGNYPW--HEQVKIAIDSVQPVRH--TLALRLPDWCPE-- 196
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
A+ +LNG + +L W D +++ LP+ +R A AI G
Sbjct: 197 AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTISLTLPMPVRRVYGNPLARHVAGKVAIQRG 256
Query: 625 P--YLLAGHTSGE 635
P Y L +GE
Sbjct: 257 PLVYCLEQADNGE 269
>gi|224537081|ref|ZP_03677620.1| hypothetical protein BACCELL_01958 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521308|gb|EEF90413.1| hypothetical protein BACCELL_01958 [Bacteroides cellulosilyticus
DSM 14838]
Length = 801
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 134/348 (38%), Gaps = 59/348 (16%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T D K+L A F D+ G+ + +Y + H P+V +G +R
Sbjct: 222 LAKLYLVTGDQKYLDQAKFFLDQ---RGYTSRTDEY-----SQAHKPVVQQDEAVGHAVR 273
Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADT 392
+TGD Y D + Y TGG T+A E + L +
Sbjct: 274 AAYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATAAGEAFGKNYELPNM 333
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
S ETC + V+ LF E Y D ER L NG++S + G Y PL
Sbjct: 334 --SAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL 390
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
++ H F CC L IY ++ +V Y+ ++S++ D
Sbjct: 391 -----ESMGQHQRQPWFGCA-CCPSNICRFIPSLPGYIYAVKDKDV---YVNLFMSNTSD 441
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TY 561
K G ++ + W+ + + + +K GQ +L +R+P W TY
Sbjct: 442 LKVGGKAVSIEQTTKYPWNGDITIGI---NKNNAGQF-NLKVRIPGWVRGQVVPSDLYTY 497
Query: 562 SNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
S+G + +NG+ + + RW DK+ + + RT
Sbjct: 498 SDGKRLKYTVKVNGEAVQNELKDGYFCIDRRWKKGDKVEVHFDMEPRT 545
>gi|423109493|ref|ZP_17097188.1| hypothetical protein HMPREF9687_02739 [Klebsiella oxytoca 10-5243]
gi|376382227|gb|EHS94961.1| hypothetical protein HMPREF9687_02739 [Klebsiella oxytoca 10-5243]
Length = 655
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 50/212 (23%), Positives = 80/212 (37%), Gaps = 16/212 (7%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERA N VL + Y+ PL
Sbjct: 339 ESCASIGLMMFARRMLEMEGDAHYADVMERAFYNTVLG-GMALDGKHFFYVNPLETYPKS 397
Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
H + W CC + +G ++ L+I Y S
Sbjct: 398 IPHNHIYDHIKPVRQRWFGCACCPPNIARTLVAIGHYLFTPRRD---ALFINFYAGSEAQ 454
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
+ L K+ WD + +TFS Q V +L LR+P W Q +NG+
Sbjct: 455 FTINDQPLALKISGNYPWDE--EVNITFSHPQAVQH--TLALRLPEW--CEAPQVLINGE 508
Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+L T +W D +T++LP++LR
Sbjct: 509 AAQGEQLKGYLHITRQWQQGDIITLRLPMTLR 540
>gi|294673046|ref|YP_003573662.1| hypothetical protein PRU_0271 [Prevotella ruminicola 23]
gi|294472095|gb|ADE81484.1| conserved hypothetical protein [Prevotella ruminicola 23]
Length = 774
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 95/441 (21%), Positives = 161/441 (36%), Gaps = 96/441 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLF---DKPCFLGFLALQADYLSHFHANTHIPI-----VIGS 343
L +LY +T + K+L A F C G + ++ H+PI ++G
Sbjct: 187 ALCKLYKVTGNKKYLEGAKYFVDETGRCTDGHRPSE-------YSQDHMPILQQQEIVGH 239
Query: 344 QMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
+R +TGD Y+ + +++ + TGG +R P+ +
Sbjct: 240 AVRAGYLYSGVADVAALTGDKAYQEALERIWENMSSKKLFITGGIGSR-----PQ--GEG 292
Query: 393 LGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
G + E ETC + + +F T E Y D ERAL N VLS G
Sbjct: 293 FGPDYELNNHTAYCETCAAIANVYWNYRMFLATGESKYIDVCERALYNNVLS-------G 345
Query: 445 VMI------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
V + Y PL R K+ CC G + + IY +
Sbjct: 346 VSLSGDKFFYDNPLESDGEHERQ------KWFGCACCPGNITRFVASVPGYIYARQ---- 395
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
G I + + K G++ L Q D WD +R+ +T S G+ ++ LR+P
Sbjct: 396 -GKDIFVNLYAQGKAKIGNIELEQTTD--YPWDGKIRIKVTKGS----GKF-AIKLRVPS 447
Query: 559 W-----------TYSNGAQ---ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
W Y + A+ S+NG+ L P +++ + W D + + P+ +R
Sbjct: 448 WLKTSPTNNDLYQYQDKAKTYSVSVNGKAL-YPENRDYIEISRSWKKGDTIELDFPMDVR 506
Query: 605 TEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTF 664
D+ + A GP + + + D K L + +P+ F L+
Sbjct: 507 RIVANDNAEDDRGKVAFERGPIVFCLEGADQTDHKVFNKYILDS--APVSAHFEQDLL-- 562
Query: 665 TQESGNSTFVMSNSNQSITME 685
N V+ S + + +
Sbjct: 563 -----NGVMVLEGSAKELQQD 578
>gi|269839244|ref|YP_003323936.1| hypothetical protein Tter_2215 [Thermobaculum terrenum ATCC
BAA-798]
gi|269790974|gb|ACZ43114.1| protein of unknown function DUF1680 [Thermobaculum terrenum ATCC
BAA-798]
Length = 638
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 81/367 (22%), Positives = 131/367 (35%), Gaps = 60/367 (16%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR- 346
L LY T + ++L A F G L + + H+P ++G +R
Sbjct: 204 LVELYRATGNERYLEQAKYFLDVRGQGLLGRAWGHFGPEYHQDHVPFREMREIVGHAVRA 263
Query: 347 ----------YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
Y TGD + + Y TGG +R + G E
Sbjct: 264 VYLNAGAADIYAETGDEAIMRALERLWENMTTKKMYVTGGIGSR-------YEGEAFGKE 316
Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI- 447
E ETC + + + T + YAD E L N VL PG+ +
Sbjct: 317 YELPNARAYAETCAAIGSVMWNWRMLLLTADARYADLIEHTLYNAVL-------PGISLD 369
Query: 448 -----YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
Y PL + TH F CC + + LG Y + ++
Sbjct: 370 GALYFYQNPL-----EDEGTHRRQEWFGCA-CCPPNVARTLASLGGYFYSTSRDGI-WVH 422
Query: 503 IIQYISSSFDWKSGH-VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
+ + + G V+L+Q S + +R+ E G+L + LR+P W
Sbjct: 423 LYSEGRAKLGLQDGREVLLSQHTSYPWSGEVAIRL----EQVPEEGELG-IYLRIPSWC- 476
Query: 562 SNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
+ ++NG++ P PG +L W D++ ++LP+++R E A A
Sbjct: 477 -ERGEVAINGEDAATPITPGTYLELRRTWRAGDEVRLRLPMTVRRLEAHPYLSEDAGRVA 535
Query: 621 ILFGPYL 627
I+ GP L
Sbjct: 536 IMRGPIL 542
>gi|334320143|ref|YP_004556772.1| hypothetical protein [Sinorhizobium meliloti AK83]
gi|407722785|ref|YP_006842446.1| hypothetical protein BN406_05164 [Sinorhizobium meliloti Rm41]
gi|334097882|gb|AEG55892.1| protein of unknown function DUF1680 [Sinorhizobium meliloti AK83]
gi|407322845|emb|CCM71446.1| hypothetical protein BN406_05164 [Sinorhizobium meliloti Rm41]
Length = 640
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 105/248 (42%), Gaps = 33/248 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
ETC + ++ + + + YAD E+AL NG L PG+ I Y PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFFYDNPL 386
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
S R H W K++ CC + +G +Y E + +++ ++
Sbjct: 387 E---STGRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESTARLK 439
Query: 513 WKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
SG V L Q+ + W+ + F++K + +L+LR+P W + GA S+NG
Sbjct: 440 LASGAEVELRQETN--YPWE----GAIAFATKLDRPAKFALSLRIPEW--AAGATLSVNG 491
Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
L L G + WS D++ + LPL++R + + A++ GP Y
Sbjct: 492 TMLDLSAHLTGGYARIEREWSDGDRVALYLPLAIRPQYANPKVRQDVGRVALMRGPLVYC 551
Query: 628 LAGHTSGE 635
+ +GE
Sbjct: 552 VEATDNGE 559
>gi|402306264|ref|ZP_10825315.1| putative glycosyhydrolase [Prevotella sp. MSX73]
gi|400380031|gb|EJP32860.1| putative glycosyhydrolase [Prevotella sp. MSX73]
Length = 825
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 83/360 (23%), Positives = 141/360 (39%), Gaps = 77/360 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T + K+L A F + G A++ +Y + +H+P++ +G +R
Sbjct: 226 ALCKLYLVTGNRKYLNEAKFFLD--YRGKTAVRQEY-----SQSHLPVLEQSEAVGHAVR 278
Query: 347 YE-----------VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
+TGD Y I + +IV Y TGG A + G
Sbjct: 279 AAYMYAGMADVAALTGDTAYIHAIDRIWNNIV-GRKLYITGGIGA-------TNNGEAFG 330
Query: 395 SENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
++ E ETC + V+ LF E Y D ER L NG++S + G
Sbjct: 331 ADYELPNMSAYAETCAAIGNVYVNYRLFLLHGESKYFDVLERTLYNGLIS-GVSMDGGGF 389
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
Y PL ++R H F CC L +Y ++ NV Y+ +
Sbjct: 390 FYPNPL-----ESRGQHQRQAWFGCA-CCPSNICRFLPSLPGYVYAVKDRNV---YVNLF 440
Query: 507 ISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT---- 560
+SS S + V L+Q+ W+ + +T+ + G +L +R+P W
Sbjct: 441 LSSSASLEVAGKRVALSQQTQ--YPWNGDIALTV---DENRAGAF-ALKIRIPGWVKGQP 494
Query: 561 -------YSNGAQA----SLNGQNLPLP----PPGNFLSATERWSYNDKLTIQLPLSLRT 605
YS+G + ++NG+ L P + + +W D+++I + +RT
Sbjct: 495 VPSDLYEYSDGKRTGYTIAVNGRRLTATDINFSPDGYCTIARKWKKGDRVSIHFDMEVRT 554
>gi|16265291|ref|NP_438083.1| hypothetical protein SM_b20631 [Sinorhizobium meliloti 1021]
gi|15141431|emb|CAC49943.1| conserved hypothetical protein [Sinorhizobium meliloti 1021]
Length = 640
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 105/248 (42%), Gaps = 33/248 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
ETC + ++ + + + YAD E+AL NG L PG+ I Y PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFFYDNPL 386
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
S R H W K++ CC + +G +Y E + +++ ++
Sbjct: 387 E---STGRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESTARLK 439
Query: 513 WKSG-HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
SG V L Q+ + W+ + F++K + +L+LR+P W + GA S+NG
Sbjct: 440 LASGAEVELRQETN--YPWE----GAIAFATKLDRPAKFALSLRIPEW--AAGATLSVNG 491
Query: 572 QNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
L L G + WS D++ + LPL++R + + A++ GP Y
Sbjct: 492 TMLDLSAHLTGGYARIEREWSDGDRVALYLPLAIRPQYANPKVRQDVGRVALMRGPLVYC 551
Query: 628 LAGHTSGE 635
+ +GE
Sbjct: 552 VEATDNGE 559
>gi|288927800|ref|ZP_06421647.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
F0108]
gi|288330634|gb|EFC69218.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
F0108]
Length = 623
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 60/284 (21%), Positives = 107/284 (37%), Gaps = 20/284 (7%)
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
Y +TG+ Y +N + TG ++ E W+ K L +ETC T
Sbjct: 266 YRLTGNTEYLSAVEQVWQNINDTEINITGSGASMESWFGGKHLQYMPIRHFQETCVTATW 325
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
+K+SR L T YAD E + N +L R T+ PL G G
Sbjct: 326 IKLSRQLLLLTGNTKYADAVEISFYNALLGAMR-TDASDWAKYTPLSGQRLPGSEQCGMG 384
Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS-GHVVLNQKVD 525
CC +G + + + G+ + YI+ + + H + K++
Sbjct: 385 LN-----CCNASGPRGLFVIPQTAVLT---SAKGVDVNLYIAGDYKLTTPRHQQMVLKLE 436
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSA 585
+ + L+ + + ++ LR+P W S + +N + G ++
Sbjct: 437 GEYPKNNKMSFLLSLKKAENI----TIRLRIPEW--STATKVIVNDVAVEHVQAGKYMEL 490
Query: 586 TERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
+ W + D+++I+ + + PEY AI GP +LA
Sbjct: 491 SRTWHHGDRISIEFDMPGIVHRL-GQHPEYV---AITRGPIVLA 530
>gi|295098715|emb|CBK87805.1| Uncharacterized protein conserved in bacteria [Enterobacter cloacae
subsp. cloacae NCTC 9394]
Length = 657
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 91/251 (36%), Gaps = 39/251 (15%)
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADGHYADVME 370
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H FN +
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIYDHVKPVRQRWFGCA 421
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + LG IY L I Y+ + G +L ++ W
Sbjct: 422 CCPPNIARVLTSLGHYIYTVRPD---ALLINLYVGNDVAIPVGDNILQLRISGNYPWHEQ 478
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
+++ +T V + +L LR+P W SLNGQ + +L W D
Sbjct: 479 VKIEIT----SPVPVIHTLALRLPDWCAEPA--VSLNGQAITGEVSRGYLYLNRSWQEGD 532
Query: 594 KLTIQLPLSLR 604
LT+ LP+ +R
Sbjct: 533 TLTLTLPMPVR 543
>gi|270295052|ref|ZP_06201253.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270274299|gb|EFA20160.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 688
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 114/545 (20%), Positives = 202/545 (37%), Gaps = 70/545 (12%)
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
W P + KIL QY A N + ++ +M +YF R Q HW S E
Sbjct: 171 WWPRMVVLKILQ----QYYSATNDK--RVVAFMTKYF--RYQLNTLPQKPLGHWSSWAEF 222
Query: 286 TGGMN-DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
N +Y LY++T + L L HL + F F+ + + P I
Sbjct: 223 RACDNLQAVYWLYNLTGEDFLLELGHLLHRQSF-SFIDMVD------RGDLRRPCTIHCV 275
Query: 345 MRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK-------RLADTLGSEN 397
+ +P+ + ++A G R F P+ L ++
Sbjct: 276 NLAQGIKEPIIYYLQDTDRKYIDAVKE---GFRDIRRFHGQPQGMYGGDEALHGNNPTQG 332
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYER--------ALTNGVLSIQRGTEPG-VMIY 448
E C+ ++ + T +I +AD+ ER +++ ++ Q +P VM+
Sbjct: 333 SELCSAVELMYSLEKMVEITGDIDFADHLERIAFNALPAQISDDFMTKQYFQQPNQVMVT 392
Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
+ +GT + CC+ + + K +++ N G+ I Y
Sbjct: 393 RHRRNFDQDHEGTDLAFGT-LTGYPCCFSNMHQGWPKFTQHLWYATPDN--GIAAIVYSP 449
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRM--TLTFSSKQ---EVGQLS-SLNLRMPVWTYS 562
S G V ++S D Y M +TF+ K+ +V Q+ +LR+P W
Sbjct: 450 SEVTANVG-----DNVPVVISEDTYYPMDHQITFTIKEVRNKVKQVKFPFHLRVPKWC-- 502
Query: 563 NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAIL 622
A+ +NG+ G W NDK+ + LP+ + T Y + +I
Sbjct: 503 KQAEIRVNGKMEQTVKGGKIAIVDRIWKRNDKIELYLPMEVFTSTW------YENAVSIE 556
Query: 623 FGPYLLAGHTSGEWDIKTGTARSLSALISPIPPS--FNAQLVTFTQESGNSTFVMSNSNQ 680
GP + A W+ K + + S +N LV F + N +S ++Q
Sbjct: 557 RGPLVYALKMEENWEKKEFKDSWYGSYYYQVTSSDPWNYGLVDFDRNRMNEVAQVSINSQ 616
Query: 681 SITMEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKE 740
+ +FP + +A + + L + ++ N + G +PF F G +G E
Sbjct: 617 KQQL-DFPWNQENAPVEIKMKARL----IPTWTVYNEMAGP----QPFSFCGSA--EGGE 665
Query: 741 DELVV 745
E+ +
Sbjct: 666 QEITL 670
>gi|255012840|ref|ZP_05284966.1| hypothetical protein B2_02969 [Bacteroides sp. 2_1_7]
gi|410102232|ref|ZP_11297159.1| hypothetical protein HMPREF0999_00931 [Parabacteroides sp. D25]
gi|409238954|gb|EKN31742.1| hypothetical protein HMPREF0999_00931 [Parabacteroides sp. D25]
Length = 618
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 53/231 (22%), Positives = 99/231 (42%), Gaps = 25/231 (10%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPL-GRGV 456
ETC + M+ ++ + + T + Y D ER+L NG L+ I G + Y+ PL +G
Sbjct: 336 ETCASVGMVLWNQRMNQLTGDSKYIDVLERSLYNGALAGISLGGDR--FFYVNPLESKGD 393
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
+ +G CC +G+ IY + L++ YI ++ + G
Sbjct: 394 HHRQEWYGCA-------CCPSQLSRFLPSIGNYIYASSDD---ALWVNLYIGNTGQIRIG 443
Query: 517 H--VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
++L Q+ D WD +++T++ S E + LR+P W + S+NG+ +
Sbjct: 444 ETDILLTQETD--YPWDGSVKLTISTSQPLE----KEIRLRIPDWCKT--YDLSINGKRI 495
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+P + + + W D + + + + + A E +AI GP
Sbjct: 496 NVPKEKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFDKRAIQRGP 545
>gi|160887789|ref|ZP_02068792.1| hypothetical protein BACUNI_00192 [Bacteroides uniformis ATCC 8492]
gi|423304369|ref|ZP_17282368.1| hypothetical protein HMPREF1072_01308 [Bacteroides uniformis
CL03T00C23]
gi|423310517|ref|ZP_17288501.1| hypothetical protein HMPREF1073_03251 [Bacteroides uniformis
CL03T12C37]
gi|156862731|gb|EDO56162.1| hypothetical protein BACUNI_00192 [Bacteroides uniformis ATCC 8492]
gi|392681688|gb|EIY75045.1| hypothetical protein HMPREF1073_03251 [Bacteroides uniformis
CL03T12C37]
gi|392684698|gb|EIY78021.1| hypothetical protein HMPREF1072_01308 [Bacteroides uniformis
CL03T00C23]
Length = 688
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 116/542 (21%), Positives = 201/542 (37%), Gaps = 64/542 (11%)
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
W P + KIL QY A N + ++ +M +YF R Q HW S E
Sbjct: 171 WWPRMVVLKILQ----QYYSATNDK--RVVAFMTKYF--RYQLNTLPQKPLGHWSSWAEF 222
Query: 286 TGGMN-DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQ 344
N +Y LY++T + L L HL + F + L + + G +
Sbjct: 223 RACDNLQAVYWLYNLTGEDFLLELGHLLHRQSFSFIDMVDRGDLRRPCTIHCVNLAQGIK 282
Query: 345 ---MRYEVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEET 400
+ Y+ D Y + F DI H G E L ++ E
Sbjct: 283 EPIIYYQQDTDRKYIDAVKEGFRDI-RRFHGQPQGMYGGDE------ALHGNNPTQGSEL 335
Query: 401 CTTYNMLKVSRHLFRWTKEIAYADYYER--------ALTNGVLSIQRGTEPG-VMIYMLP 451
C+ ++ + T +I +AD+ ER +++ ++ Q +P VM+
Sbjct: 336 CSAVELMYSLEKMVEITGDIDFADHLERIAFNALPAQISDDFMTKQYFQQPNQVMVTRHR 395
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
+ +GT + CC+ + + K +++ N G+ I Y S
Sbjct: 396 RNFDQDHEGTDLAFGT-LTGYPCCFSNMHQGWPKFTQHLWYATPDN--GIAAIVYSPSEV 452
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRM--TLTFSSKQ---EVGQLS-SLNLRMPVWTYSNGA 565
G V ++S D Y M +TF+ K+ +V Q+ +LR+P W A
Sbjct: 453 TANVG-----DNVPVVISEDTYYPMDHQITFTIKEVRNKVKQVKFPFHLRVPKWC--KQA 505
Query: 566 QASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+ +NG+ G W NDK+ + LP+ + T Y + +I GP
Sbjct: 506 EIRVNGKMEQTVKGGKIAIVDRIWKRNDKIELYLPMEVFTSTW------YENAVSIERGP 559
Query: 626 YLLAGHTSGEWDIKTGTARSLSALISPIPPS--FNAQLVTFTQESGNSTFVMSNSNQSIT 683
+ A W+ K + + S +N LV F + N +S ++Q
Sbjct: 560 LVYALKMEENWEKKEFKDSWYGSYYYQVTSSDPWNYGLVDFDRNRMNEVAQVSINSQKQQ 619
Query: 684 MEEFPVSGTDAALHATFRLILKDASLSNFSSLNNVIGKSVMLEPFDFPGMLVQQGKEDEL 743
+ +FP + +A + + L + ++ N + G +PF F G +G E E+
Sbjct: 620 L-DFPWNQENAPVEIKMKARL----IPTWTVYNEMAGP----QPFSFCGSA--EGGEQEI 668
Query: 744 VV 745
+
Sbjct: 669 TL 670
>gi|227509160|ref|ZP_03939209.1| conserved hypothetical protein, partial [Lactobacillus brevis
subsp. gravesensis ATCC 27305]
gi|227191367|gb|EEI71434.1| conserved hypothetical protein [Lactobacillus brevis subsp.
gravesensis ATCC 27305]
Length = 106
Score = 49.3 bits (116), Expect = 0.010, Method: Composition-based stats.
Identities = 34/100 (34%), Positives = 45/100 (45%), Gaps = 17/100 (17%)
Query: 161 LRGHFVGHYLSASAQMWASTHN----ATIKEKMSTVVFSLSECQNKIG------TGYLSA 210
RGHF GHYLSA +Q S + + + K+ + L Q GY+SA
Sbjct: 1 FRGHFFGHYLSALSQAIDSVSDDDTRSQLLSKLRIGIEGLFRAQQAYAKSHPQSAGYVSA 60
Query: 211 FPTELFDSFEALK-------PVWAPYYTIHKILAGLLDQY 243
F D E + V P+Y +HKILAGL+D Y
Sbjct: 61 FREVALDEVEGKRVPESEKENVIVPWYNLHKILAGLIDGY 100
>gi|423115429|ref|ZP_17103120.1| hypothetical protein HMPREF9689_03177 [Klebsiella oxytoca 10-5245]
gi|376381515|gb|EHS94252.1| hypothetical protein HMPREF9689_03177 [Klebsiella oxytoca 10-5245]
Length = 655
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 49/212 (23%), Positives = 80/212 (37%), Gaps = 16/212 (7%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ +R + + YAD ERA N VL + Y+ PL
Sbjct: 339 ESCASIGLMMFARRMLEMEGDAHYADVMERAFYNTVLG-GMALDGKHFFYVNPLETYPKS 397
Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
H + W CC + +G ++ L+I Y S
Sbjct: 398 IPHNHIYDHIKPVRQRWFGCACCPPNIARTLVAIGHYLFTPRRD---ALFINFYAGSEAQ 454
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
+ L K+ WD + +TFS Q + +L LR+P W Q +NG+
Sbjct: 455 FTINDQPLALKISGNYPWDE--EVNITFSHPQAIQH--TLALRLPEW--CEAPQVLINGE 508
Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+L T +W D +T++LP++LR
Sbjct: 509 AAQGEQLKGYLHITRQWQQGDIITLRLPMTLR 540
>gi|298386781|ref|ZP_06996336.1| hypothetical protein HMPREF9007_03534 [Bacteroides sp. 1_1_14]
gi|298260455|gb|EFI03324.1| hypothetical protein HMPREF9007_03534 [Bacteroides sp. 1_1_14]
Length = 668
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 83/356 (23%), Positives = 136/356 (38%), Gaps = 78/356 (21%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
L +LY +T D K+L A F G+ + + Y + H P+V +G +R
Sbjct: 219 LVKLYMVTGDKKYLDQAKFFLDT--RGYTSRKDAY-----SQAHKPVVEQDEAVGHAVRA 271
Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+TGD Y K I + +IV + Y TGG AR + G+
Sbjct: 272 VYMYSGMADVAAITGDSSYIKAIDKIWDNIV-SKKIYITGGIGARH-------AGEAFGN 323
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + ++ LF + Y D ER L NG++S + G
Sbjct: 324 NYELPNQSAYCETCAAIGNVYMNYRLFLLHGDAKYFDVLERTLYNGLIS-GVSLDGGSFF 382
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQ 505
Y PL S++G ++ F C C + + F L +Y + V Y+
Sbjct: 383 YPNPL--------SSNGKYSRKPWFGCACCPSNVSRFIPSLPGYVYAVKNDQV---YVNL 431
Query: 506 YISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
Y+S+ + K ++L Q+ W+ +R+ +T + Q ++ LR+P W N
Sbjct: 432 YLSNKAELKVDKKKILLEQETG--YPWNGDIRLKIT-----QGNQDFTMKLRIPGWVRGN 484
Query: 564 ---------------GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
Q S+NGQ + +LS +W D + + + R
Sbjct: 485 VLPSDLYSYADNQKPAYQVSVNGQTVESDVNDGYLSIARKWKKGDVVEVHFDMIPR 540
>gi|433678396|ref|ZP_20510262.1| hypothetical protein BN444_02464 [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430816487|emb|CCP40741.1| hypothetical protein BN444_02464 [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 664
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 46/213 (21%), Positives = 80/213 (37%), Gaps = 18/213 (8%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ + + + + YAD ERAL N VL+ + Y+ PL
Sbjct: 339 ESCASIGLMMFANRMLQLAPDSRYADVMERALYNTVLA-GMALDGRHFFYVNPLEVHPPT 397
Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
HG+ W CC + LG +Y + LY+ Y+ S
Sbjct: 398 VHGNHGFDHVKPVRQRWFGCACCPPNIARVLTSLGHYLYTRRDDT---LYVNLYVGSDAA 454
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
+ G L + W + +++ + E ++L LR+P W Q LNG+
Sbjct: 455 FDVGGQTLTLRQRGEYPWQEQVELSVDCDAPVE----AALALRLPDWC--RAPQLRLNGE 508
Query: 573 NLPLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
+ + + RW D L + LP+ +
Sbjct: 509 AVAIAAHLQHGYCVLRRRWQRGDTLHLHLPMPV 541
>gi|423288216|ref|ZP_17267067.1| hypothetical protein HMPREF1069_02110 [Bacteroides ovatus
CL02T12C04]
gi|392671105|gb|EIY64581.1| hypothetical protein HMPREF1069_02110 [Bacteroides ovatus
CL02T12C04]
Length = 666
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 60/236 (25%), Positives = 97/236 (41%), Gaps = 26/236 (11%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC T+ S LF T Y D E+A N + S+ G + Y L R K
Sbjct: 353 ETCATFYGAYYSWRLFMLTGNPMYLDVMEKAFYNNLSSM--GLDGKSYFYTNVL-RWYGK 409
Query: 459 AR-----STHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
H T+ + CC + + ++ D Y ++E + L++ Y S+ D
Sbjct: 410 QHPLLSLDFHQRWTEECTCVCCPTSLVRFLAETKDYAYAKDENS---LFVTLYGSNEIDT 466
Query: 514 K-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
K +G V ++V WD + M E SL LR+P W GA +NG
Sbjct: 467 KINGKNVRFEQVTNY-PWDDKIEMNYKGDKNAEF----SLKLRIPAWAI--GATLKVNGI 519
Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ---AILFGP 625
++P+ G F +W DK+ + LP+ + + P+ ++ A+ +GP
Sbjct: 520 DMPI-NTGVFAVVNRKWKSGDKVELVLPMK---PILNEGNPKVEEVRNQLAVSYGP 571
>gi|440731554|ref|ZP_20911563.1| hypothetical protein A989_09226 [Xanthomonas translucens DAR61454]
gi|440372448|gb|ELQ09250.1| hypothetical protein A989_09226 [Xanthomonas translucens DAR61454]
Length = 664
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/211 (21%), Positives = 79/211 (37%), Gaps = 18/211 (8%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ + + + + YAD ERAL N VL+ + Y+ PL
Sbjct: 339 ESCASIGLMMFANRMLQLAPDSRYADVMERALYNTVLA-GMALDGRHFFYVNPLEVHPPT 397
Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
HG+ W CC + LG +Y + LY+ Y+ S
Sbjct: 398 VHGNHGFDHVKPVRQRWFGCACCPPNIARVLTSLGHYLYTRRDDT---LYVNLYVGSDAA 454
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
+ G L + W + +++ + E ++L LR+P W Q LNG+
Sbjct: 455 FDVGGQTLTLRQRGEYPWQEQVELSVDCDAPVE----AALALRLPDWC--RAPQLRLNGE 508
Query: 573 NLPLPP--PGNFLSATERWSYNDKLTIQLPL 601
+ + + RW D L + LP+
Sbjct: 509 AVAIAAHLQHGYCVLRRRWQRGDTLHLHLPM 539
>gi|354604714|ref|ZP_09022703.1| hypothetical protein HMPREF9450_01618 [Alistipes indistinctus YIT
12060]
gi|353347293|gb|EHB91569.1| hypothetical protein HMPREF9450_01618 [Alistipes indistinctus YIT
12060]
Length = 623
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 94/408 (23%), Positives = 164/408 (40%), Gaps = 57/408 (13%)
Query: 229 YYTIHKILAGLLDQYVLADNAQAL-KMATWMVEYFYNRVQKVITMYSVERHWYSLNEETG 287
Y H I AG+ Y+LA + L +++T MV + N +RHW +EE
Sbjct: 160 YCAGHMIEAGI--AYLLATGDRTLLEVSTRMVGHMMNEFG------PGKRHWVPGHEE-- 209
Query: 288 GMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIG 342
+ L +LYS+T +PK+L A + G+ + + + IP+ + G
Sbjct: 210 -IELALAKLYSVTGEPKYLEFARWLLEERGHGYGRNEEGTWNAAYYQDSIPVSRMTDITG 268
Query: 343 SQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSA---REFWWDPKR 388
+R ++GD +Y+ D V + Y TGG + E + +
Sbjct: 269 HAVRCMYLFCGMADMSMLSGDTVYRAALDRVWDDVVQRNMYITGGIGSSHQNEGFTEDYD 328
Query: 389 LADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
L + ETC + M+ + + R + YAD ERAL NG L+ + Y
Sbjct: 329 LPNL--EAYCETCASVGMVLWNARMNRLKGDAKYADVMERALYNGALA-GISLDGKRFFY 385
Query: 449 MLPL-GRGVSKARSTHGWG---TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
+ PL +G ++ +G ++ + F G+ I S S D+++ LY+
Sbjct: 386 VNPLESKGDHHRKAWYGCACCPSQLSRFLPSIGSYIYSHSLDSDTVWVN-------LYLG 438
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL-SSLNLRMPVWTYSN 563
+ S V+ P W+ R+T++ + G++ L LR+P W ++
Sbjct: 439 SNAAIPTQDGSRFVLTQTTRYP---WEGNARITVS----EAPGKIRKELRLRIPGWCKNH 491
Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDD 611
+NG+ P + W D+ I L L++ TE + D
Sbjct: 492 --TLWVNGELFDHPTDKGYAVVNRSWKKGDR--IDLSLAMPTEVVAAD 535
>gi|315607259|ref|ZP_07882259.1| conserved hypothetical protein [Prevotella buccae ATCC 33574]
gi|315250962|gb|EFU30951.1| conserved hypothetical protein [Prevotella buccae ATCC 33574]
Length = 825
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 82/360 (22%), Positives = 141/360 (39%), Gaps = 77/360 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T + K+L A F + G A++ +Y + +H+P++ +G +R
Sbjct: 226 ALCKLYLVTGNRKYLDEAKFFLD--YRGKTAIRQEY-----SQSHLPVLEQSEAVGHAVR 278
Query: 347 YE-----------VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
+TGD Y I + +IV Y TGG A + G
Sbjct: 279 AAYMYAGMADVAALTGDTAYIHAIDRIWNNIV-GRKLYITGGIGATNN-------GEAFG 330
Query: 395 SENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
++ E ETC + V+ LF E Y D ER L NG++S + G
Sbjct: 331 ADYELPNMSAYAETCAAIGNVYVNYRLFLLHGESKYFDVLERTLYNGLIS-GVSMDGGGF 389
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
Y PL ++R H F CC L +Y ++ NV Y+ +
Sbjct: 390 FYPNPL-----ESRGQHQRQAWFGCA-CCPSNICRFLPSLPGYVYAVKDRNV---YVNLF 440
Query: 507 I--SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT---- 560
+ S+S + V L+Q+ W+ + +T+ + G +L +R+P W
Sbjct: 441 LSNSASLEVAGKRVALSQQTQ--YPWNGDIALTV---DENRAGAF-ALKIRIPGWVKGQP 494
Query: 561 -------YSNGAQA----SLNGQNLPLP----PPGNFLSATERWSYNDKLTIQLPLSLRT 605
YS+G + ++NG+ L P + + +W D+++I + +RT
Sbjct: 495 VPSDLYEYSDGKRTGYTIAVNGRRLTATDINFSPDGYCTIVRKWKKGDRVSIHFDMEVRT 554
>gi|378763347|ref|YP_005191963.1| hypothetical protein SFHH103_05359 [Sinorhizobium fredii HH103]
gi|365182975|emb|CCE99824.1| hypothetical protein SFHH103_05359 [Sinorhizobium fredii HH103]
Length = 879
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 79/370 (21%), Positives = 150/370 (40%), Gaps = 53/370 (14%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL--SHFHANTHIPI 339
L +L +T + K+L L+ F +P F A++ DY+ +H ++ +H P+
Sbjct: 435 ALVKLARVTGETKYLDLSKFFIDERGREPHFFTEEAIRDGRSPKDYIQKTHEYSQSHEPV 494
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 495 RRQKKVVGHAVRAMYMYSGMADLATEYKDDTLTDALETLWDDLTT-KQMYVTGGIGPSAK 553
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + +AD E+AL NG LS
Sbjct: 554 NEGFTDCYDLPND--TAYAETCASVALVFWASRMLGRGPNRRFADIMEQALYNGALS-GL 610
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
+ Y PL H W K+++ CC + +G +Y +
Sbjct: 611 SLDGKTFFYDNPL----ESTGKHHRW--KWHNCPCCPPNIARLVASVGAYMYGVAAEEI- 663
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
+++ + + V L Q + WD + + L ++ +L+LR+P W
Sbjct: 664 AVHLYGESTVRLEVGGSDVTLQQVTN--YPWDGAVSIKLDLKEPRQF----ALSLRIPEW 717
Query: 560 TYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
++GA+ ++NG ++ L + +W+ D ++++LPL LR + + A
Sbjct: 718 --ADGARIAINGSSVDLDAVMTDGYARIERQWANGDAVSLELPLQLRPQYANPKVRQDAG 775
Query: 618 IQAILFGPYL 627
A++ GP +
Sbjct: 776 RVALMRGPLV 785
>gi|288925306|ref|ZP_06419241.1| cytoplasmic protein [Prevotella buccae D17]
gi|288338071|gb|EFC76422.1| cytoplasmic protein [Prevotella buccae D17]
Length = 825
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 82/360 (22%), Positives = 141/360 (39%), Gaps = 77/360 (21%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T + K+L A F + G A++ +Y + +H+P++ +G +R
Sbjct: 226 ALCKLYLVTGNRKYLDEAKFFLD--YRGKTAVRQEY-----SQSHLPVLKQSEAVGHAVR 278
Query: 347 YE-----------VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
+TGD Y I + +IV Y TGG A + G
Sbjct: 279 AAYMYAGMADVAALTGDTAYIHAIDRIWNNIV-GRKLYITGGIGATNN-------GEAFG 330
Query: 395 SENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
++ E ETC + V+ LF E Y D ER L NG++S + G
Sbjct: 331 ADYELPNMSAYAETCAAIGNVYVNYRLFLLHGESKYFDVLERTLYNGLIS-GVSMDGGGF 389
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
Y PL ++R H F CC L +Y ++ NV Y+ +
Sbjct: 390 FYPNPL-----ESRGQHQRQAWFGCA-CCPSNICRFLPSLPGYVYAVKDRNV---YVNLF 440
Query: 507 I--SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT---- 560
+ S+S + V L+Q+ W+ + +T+ + G +L +R+P W
Sbjct: 441 LSNSASLEVAGKRVALSQQTQ--YPWNGDIALTV---DENRAGAF-ALKIRIPGWVKGQP 494
Query: 561 -------YSNGAQA----SLNGQNLPLP----PPGNFLSATERWSYNDKLTIQLPLSLRT 605
YS+G + ++NG+ L P + + +W D+++I + +RT
Sbjct: 495 VPSDLYEYSDGKRTGYTIAVNGRRLTATDINFSPDGYCTIARKWKKGDRVSIHFDMEVRT 554
>gi|256394126|ref|YP_003115690.1| hypothetical protein Caci_4989 [Catenulispora acidiphila DSM 44928]
gi|256360352|gb|ACU73849.1| protein of unknown function DUF1680 [Catenulispora acidiphila DSM
44928]
Length = 647
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 83/369 (22%), Positives = 134/369 (36%), Gaps = 59/369 (15%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFL---ALQADYLSHFHANTHIPI-----VIGS 343
L LY T + ++L LA F G L A + + H+P+ V G
Sbjct: 202 ALVELYRETGEQRYLDLAAYFVDRRGHGLLNPEATRGTAAGPAYCQDHLPVREANAVAGH 261
Query: 344 QMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSA---REFWWDPKRL 389
+R TGD + + A ++ TGG A E + DP L
Sbjct: 262 AVRQLYFLAGVTDLAVETGDASLRAAAERLWTEMAARKTHITGGLGAHHAEEDFGDPYEL 321
Query: 390 ADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI-- 447
+ ETC ++ + + T E Y+D ER L N VL PGV +
Sbjct: 322 PNE--RAYCETCAAIASVQWNWRMALLTGEAKYSDLAERTLYNAVL-------PGVSLDG 372
Query: 448 ----YMLPLGRGVSKARSTH-----GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
Y PL + R H G +++ C L ++ G+
Sbjct: 373 TRWFYANPL-----QVRDEHLDRHGDHGVSRKAWFRCACCPPNVMRLLASLPHYFVSGDA 427
Query: 499 PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
G+ + QY + S++ +G V +V+ W + +T+ E G +L+LR+P
Sbjct: 428 DGIQLHQYATGSYEAVAGTV----RVETGYPWSGGIAVTI------ERGGEWTLSLRVPG 477
Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASI 618
W +A +NG + P +L W D +++ L + +R A
Sbjct: 478 WCAD--VEAGVNGVAVDTVVPDGWLRIRRAWQPGDVVSLNLAMPIRLTAADPRVDAVRGC 535
Query: 619 QAILFGPYL 627
AI GP +
Sbjct: 536 AAIERGPLV 544
>gi|336427168|ref|ZP_08607172.1| hypothetical protein HMPREF0994_03178 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336010021|gb|EGN40008.1| hypothetical protein HMPREF0994_03178 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 687
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 80/363 (22%), Positives = 129/363 (35%), Gaps = 66/363 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQA------DYLSHFHANTHIPI- 339
L RLY +T + K+L L+ F KP + +A D + + H+P+
Sbjct: 225 ALVRLYEVTGEDKYLNLSRFFVDQRGTKPYYYDTEHPEAVKKGHEDEQRYSYNQAHLPVR 284
Query: 340 ----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSARE--- 381
+G +R +TGD D + Y TGG A
Sbjct: 285 EQDEAVGHAVRAVYLYSGMADVARLTGDEALLEACEKLWDNITQKKMYITGGIGATHMGE 344
Query: 382 ---FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
F +D S ETC + ++ +R + YAD E+AL NG+LS
Sbjct: 345 AFSFNYDLPN-----DSAYAETCASIGLVFFARRMLEIKASSKYADVMEKALYNGILS-G 398
Query: 439 RGTEPGVMIYMLPLGRGVSKARSTHGWGTKFN-----SFW----CCYGTGIESFSKLGDS 489
+ Y+ PL S + H KF+ W CC S +
Sbjct: 399 MALDGKSFFYVNPLE---SLPEACHKDERKFHVKPVRQKWFGCACCPPNIARLLSSIASY 455
Query: 490 IYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQL 549
Y E E LY+ Y+ S + G L+ ++ WD ++ ++++ V
Sbjct: 456 AYTEAED---ALYVHLYMGSVLEKDCGGKKLDIRISSDFPWDG--KVMAEINAEEPVA-- 508
Query: 550 SSLNLRMPVWTYS---NGAQASLNGQNLPLPP-----PGNFLSATERWSYNDKLTIQLPL 601
L R+P W S NG + G+ + +L W+ +KL + P+
Sbjct: 509 CRLAFRIPGWCSSYTLNGQKGLEEGETVTADGETRQVKDGYLIIDRVWNGGEKLELDFPM 568
Query: 602 SLR 604
+R
Sbjct: 569 EVR 571
>gi|315644006|ref|ZP_07897176.1| hypothetical protein PVOR_00550 [Paenibacillus vortex V453]
gi|315280381|gb|EFU43670.1| hypothetical protein PVOR_00550 [Paenibacillus vortex V453]
Length = 653
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 109/511 (21%), Positives = 196/511 (38%), Gaps = 79/511 (15%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPT--ELFDSFEALK 223
V +L A+A A + ++E++ ++ ++ Q GYL+ + T E + L
Sbjct: 79 VAKWLEAAAYSLAIHPDPKLEEQVDQLIDLVAAAQQP--DGYLNTYFTVKEPEKRWTNLT 136
Query: 224 PVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLN 283
Y H + AG+ Y+ + L + + +Y + V + H + +
Sbjct: 137 DCHELYCAGHMMEAGVA-HYLATGKRKLLDVVCRLADY----IDSVFGPEDGKIHGFDGH 191
Query: 284 EETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFL----------GFLALQADYL 328
+E + L +LY +T +P++L L+ F +P F F + A+
Sbjct: 192 QE---IELALVKLYEVTREPRYLSLSQYFIDVRGTEPHFFLQEWEQRGRKSFYSSVANPP 248
Query: 329 SHFHANTHIPI-----VIGSQMRY-----------EVTGDP-LYKLIGTFFMDIVNASHS 371
+ +H+P+ +G +R T DP L + + ++V+
Sbjct: 249 HLPYHQSHLPVREQREAVGHSVRAVYMYTAMADLAARTKDPALLEACENLWFNMVH-KQM 307
Query: 372 YATGGTSA----REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG + F D DT+ +E TC + ++ +R + + YAD E
Sbjct: 308 YITGGIGSTHHGEAFTTDYDLPNDTVYAE---TCASIGLIFFARRMLELAPKSEYADVME 364
Query: 428 RALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFN---------SFWCCY 476
RAL N V+ Q G Y+ PL + R G KF+ + CC
Sbjct: 365 RALFNTVIGSMAQDGRH---FFYVNPLEVWPAACRHNPG---KFHVKPVRPGWFACACCP 418
Query: 477 GTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM 536
S LG+ +Y E LY Y+ + G V + + + W+ +
Sbjct: 419 PNVARLLSSLGEYVYTMNEDT---LYTHLYMGGEASVQFGDVPVKVIQNSALPWNG--DV 473
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDK 594
TLT ++ V ++ LRMP W+ A LNG+++ + ++ W+ D
Sbjct: 474 TLTIQPEKAVEW--TVALRMPDWSRGK-ADLRLNGEDVSIEDVMKDGYVYIKRVWAPGDT 530
Query: 595 LTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
L ++L + + + A AI GP
Sbjct: 531 LELELSMEIHQVRANPNIRANAGKAAIQRGP 561
>gi|423223926|ref|ZP_17210395.1| hypothetical protein HMPREF1062_02581 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392637372|gb|EIY31243.1| hypothetical protein HMPREF1062_02581 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 820
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 95/447 (21%), Positives = 168/447 (37%), Gaps = 92/447 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L +A F + G LS + + H PI ++G +R
Sbjct: 229 ALAKLYKVTGDEKYLKMAKYFVEETGRG---TDGHRLSEY-SQDHKPILQQDEIVGHAVR 284
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y + + + + + TGG +R P+ + G
Sbjct: 285 AGYLYSGVADVASLTQDTAYFNALSRIWENMASKKLFITGGIGSR-----PQ--GEGFGP 337
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + +F T YAD ERAL NGV+S GV +
Sbjct: 338 NYELNNHTAYCETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVIS-------GVSL 390
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + +G CC G + F + +GN +
Sbjct: 391 SGDKFFYDNPL-ESMGQHERQQWFGCA-----CCPGN-VTRFMASVPFYMYATQGN--DI 441
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVS--WDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
Y+ YI S + + N K++ I + WD + +++ +QE +L +R+P W
Sbjct: 442 YVNLYIQSKAELNTE--TNNVKLEQITTYPWDGKVSISVNPEKEQEF----ALRVRIPGW 495
Query: 560 -----------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
++++ A+A S+NG+ + + + W D + I P+ +R
Sbjct: 496 AQDAPVPTDLYSFTDKAKAYTISINGKKVNATQLDGYATILHDWKTGDIVEINFPMDVRR 555
Query: 606 EAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFT 665
D+ + AI GP + + D S + + P + T+
Sbjct: 556 VKANDNVEDDRGKLAIERGPIMFCLEGKDQVD---------SIVFNKFIPDGTSMEATYD 606
Query: 666 QESGNSTFVMSNSNQSI----TMEEFP 688
+ N V++ + + I +M+E P
Sbjct: 607 ADLLNGVMVLTGTAKEIEKDGSMKEVP 633
>gi|405380414|ref|ZP_11034253.1| hypothetical protein PMI11_04239 [Rhizobium sp. CF142]
gi|397323106|gb|EJJ27505.1| hypothetical protein PMI11_04239 [Rhizobium sp. CF142]
Length = 642
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 81/349 (23%), Positives = 132/349 (37%), Gaps = 57/349 (16%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFHANT------HIPI 339
L +L +T + K+L L+ F +P F A + + FH T H+P+
Sbjct: 198 ALVKLARVTGEKKYLDLSKFFVDERGTEPHFFTDEATRDGRSAADFHQKTYEYGQAHLPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 258 REQKKVVGHAVRAMYLYAGMADIATEYNDDTLTAALETLWDDLTT-KQMYVTGGIGPAAS 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--I 437
E + D L + S ETC + ++ + + YAD E+AL NG ++
Sbjct: 317 NEGFTDYYDLPNE--SAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMAGLS 374
Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGN 497
GT Y PL A H W ++ CC + +G +Y E
Sbjct: 375 LDGTR---FFYENPL----ESAGKHHRW--IWHHCPCCPPNIARLLASVGSYMYAIAEDE 425
Query: 498 VPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
+ +++ + FD V L+Q+ WD + LT +L+LR+P
Sbjct: 426 I-AVHLYGESKARFDLAGAKVELSQQTR--YPWDGAIHFDLTLDRPAHF----ALSLRIP 478
Query: 558 VWTYSNGAQASLNGQNLPLPPPG--NFLSATERWSYNDKLTIQLPLSLR 604
W + G S+NG+ L L + W DK+ + +PL+ R
Sbjct: 479 EW--AEGVALSVNGEKLDLQSTTVEGYARIERDWKSGDKVDLSIPLAAR 525
>gi|398351289|ref|YP_006396753.1| cytoplasmic protein [Sinorhizobium fredii USDA 257]
gi|390126615|gb|AFL49996.1| putative cytoplasmic protein [Sinorhizobium fredii USDA 257]
Length = 937
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 84/370 (22%), Positives = 149/370 (40%), Gaps = 53/370 (14%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL--SHFHANTHIPI 339
L +L +T + K+L L+ F +P F A++ DY+ +H ++ +H P+
Sbjct: 493 ALVKLARVTGETKYLDLSKFFIDERGQEPHFFTEEAIRDGRSPKDYVHKTHEYSQSHEPV 552
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGGT--SAR 380
V+G +R E D L + T + D+ Y TGG SAR
Sbjct: 553 RQQKKVVGHAVRAMYMYSGMADLATEYKDDTLTDALETLWDDLTT-KQMYVTGGIGPSAR 611
Query: 381 -EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + +AD E+AL NG LS
Sbjct: 612 NEGFTDYYDLPND--TAYAETCASVALVFWASRMLGRGPNRRFADIMEQALYNGALS-GL 668
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
+ Y PL H W ++++ CC + +G +Y +
Sbjct: 669 SLDGKTFFYDNPL----ESTGKHHRW--RWHNCPCCPPNIARLVASVGAYMYGVATDEI- 721
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
+++ ++ + +V L Q + W+ + + L ++ +L+LR+P W
Sbjct: 722 AVHLYGESTARLELDGSNVTLRQVTN--YPWEGAVSIRLELEEPRQF----ALSLRIPEW 775
Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
++GA S+NG + L + WS D ++I LPL LR + + A
Sbjct: 776 --ADGASISVNGSGIDLEHVTLDGYARIEREWSDGDAVSIDLPLKLRPQFANPKVRQDAG 833
Query: 618 IQAILFGPYL 627
A+L GP +
Sbjct: 834 RIALLRGPLV 843
>gi|251797630|ref|YP_003012361.1| hypothetical protein Pjdr2_3643 [Paenibacillus sp. JDR-2]
gi|247545256|gb|ACT02275.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 645
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 73/322 (22%), Positives = 122/322 (37%), Gaps = 32/322 (9%)
Query: 327 YLSHFHANTHIPIVIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATG 375
Y SH P+ +G +R +TGD + Y TG
Sbjct: 241 YQSHLPVREQ-PVAVGHAVRAVYLYTAMADLARLTGDVKLREACERLWANTTGKQMYITG 299
Query: 376 GTSA----REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
G A F +D D + +E TC + ++ +R + + + YAD ERAL
Sbjct: 300 GIGATHLGEAFTFDHDLPNDIVYAE---TCASIGLIFWARRMLQLEAKSEYADVMERALY 356
Query: 432 NGVLSIQRGTEPGVMIYMLPLGR-GVSKARSTHGWGTK-FNSFW----CCYGTGIESFSK 485
N VL + Y+ PL + A+S + K W CC
Sbjct: 357 NNVLG-SMAKDGKHFFYVNPLEVWPEASAKSPDKFHVKPVRQKWFGCSCCPPNVARLLGS 415
Query: 486 LGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQ 544
L + IY E+G+ +++ +F+ + +VLNQK + + W+ ++ S ++
Sbjct: 416 LDEYIYDVSEDGSTVRVHLFIGSEVAFETEGKKIVLNQKSE--LPWNG--QVEFKVSLQE 471
Query: 545 EVGQLS-SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
+ G + L LR+P W S A +NG+ + + + W D++ LP+
Sbjct: 472 DKGDVPFMLALRIPNWFSSKEALLKINGETVRYHVDKGYATVYRVWQDGDRVEWLLPIET 531
Query: 604 RTEAIQDDRPEYASIQAILFGP 625
+ A A AI GP
Sbjct: 532 QLIAANPLIRADAGKAAIQRGP 553
>gi|392965453|ref|ZP_10330872.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
gi|387844517|emb|CCH52918.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
Length = 650
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 86/400 (21%), Positives = 144/400 (36%), Gaps = 87/400 (21%)
Query: 293 LYRLYSITHDPKHLLLAH-------------LFDKPCFLGFLALQADYLSHFHANTHIPI 339
L +LY +T+D ++L A LF P G + YL T
Sbjct: 217 LVKLYRVTNDKRYLDFARFLLDMRGRSDKRELFPDPSRTGN---GSQYLQDHQPVTQQRE 273
Query: 340 VIGSQMR----YEVTGDPLYKLIGTFFMDIVNA-------SHSYATGGTSAREFWWDPKR 388
+G +R Y D ++D + A Y TGG ARE
Sbjct: 274 AVGHAVRAGYMYAAMTDIAAIQQDKAYLDALMAIWNDVVERKQYLTGGLGAREH------ 327
Query: 389 LADTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRG 440
+ G+ E ETC L + +F T + Y D +ER L NG L+
Sbjct: 328 -GEAFGNAYELPNDVAYAETCAAVANLLWNHRMFLLTGQSKYMDVFERVLYNGFLA-GVS 385
Query: 441 TEPGVMIYMLPLGR--------GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
E Y+ PL GV+ R+ +GT CC + L +Y
Sbjct: 386 LEGDKFFYVNPLASDGKRKFNVGVAAERAPW-FGTS-----CCPTNVVRFLPSLPGYVYA 439
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ +V ++ ++++S + G + + WD + MT++ + Q L
Sbjct: 440 VKNNDV---FVNLFLTNSSELTVGKTPVQVQQQTNYPWDGAVTMTVSPRNAQAF----DL 492
Query: 553 NLRMPVWTYSN-------------GAQASL--NGQNLPLPPPGNFLSATERWSYNDKLTI 597
+R+P WT GA SL NG+ +P+ + + W D++ +
Sbjct: 493 LVRIPGWTLGKPMPGNLYSYRRNIGATPSLKVNGKAVPVKMDNGYARISRTWKPGDRVEL 552
Query: 598 QLPLSLR----TEAIQDDRPEYASIQAILFGPYLLAGHTS 633
++ + +R + ++DD A AI GP + +
Sbjct: 553 RMEMPVREVIANQQVKDD----AGRVAIERGPIVYCAEAA 588
>gi|281424179|ref|ZP_06255092.1| conserved hypothetical protein [Prevotella oris F0302]
gi|281401448|gb|EFB32279.1| conserved hypothetical protein [Prevotella oris F0302]
Length = 638
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 56/257 (21%), Positives = 98/257 (38%), Gaps = 20/257 (7%)
Query: 374 TGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
TG ++ E W+ K L +ETC T +K+SR L T YAD E + N
Sbjct: 308 TGSGASMESWFGGKHLQYMPIRHFQETCVTATWIKLSRQLLLLTGNTKYADAVEISFYNA 367
Query: 434 VLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE 493
+L R T+ PL G G CC +G + +
Sbjct: 368 LLGAMR-TDASDWAKYTPLSGQRLPGSEQCGMGLN-----CCNASGPRGLFVIPQTAVLT 421
Query: 494 EEGNVPGLYIIQYISSSFDWKS-GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
+ G+ + YI+ + + H + K++ + + L+ + + ++
Sbjct: 422 ---SAKGVDVNLYIAGDYKLTTPRHQQMVLKLEGEYPKNNKMSFLLSLKKAENI----TI 474
Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
LR+P W S + +N + G +L + W + D+++I+ + +
Sbjct: 475 RLRIPEW--STATKVIVNDVAVEHVQAGKYLELSRTWHHGDRISIEFDMPGIVHRL-GQH 531
Query: 613 PEYASIQAILFGPYLLA 629
PEY AI GP +LA
Sbjct: 532 PEYV---AITRGPIVLA 545
>gi|150376304|ref|YP_001312900.1| hypothetical protein Smed_4162 [Sinorhizobium medicae WSM419]
gi|150030851|gb|ABR62967.1| protein of unknown function DUF1680 [Sinorhizobium medicae WSM419]
Length = 640
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 71/298 (23%), Positives = 121/298 (40%), Gaps = 37/298 (12%)
Query: 348 EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTY 404
E D L + T + D+V Y TGG ++ E + D L + + ETC +
Sbjct: 283 EYKDDSLTAALETLWDDLVT-KQMYVTGGIGPAASNEGFTDYYDLPND--TAYAETCASV 339
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPLGRGVSK 458
++ + + + YAD E+AL NG L PG+ I Y PL S
Sbjct: 340 GLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGKTFFYDNPL---EST 389
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG-H 517
R H W K++ CC + +G +Y E + +++ ++ +G
Sbjct: 390 GRH-HRW--KWHHCPCCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGESAARLKLANGAE 445
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP 577
V L Q + WD + F+++ + +L+LR+P W + GA S+NG L L
Sbjct: 446 VELRQATN--YPWD----GAIAFTARLDRPARFALSLRIPEW--AAGATLSVNGSMLDLS 497
Query: 578 P--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTS 633
+ WS D++ + LPL+LR + + A++ GP + +
Sbjct: 498 AHLADGYARIEREWSDGDRVALYLPLTLRPQYANPKVRQDVGRVALMRGPLVYCAEAA 555
>gi|320161641|ref|YP_004174866.1| hypothetical protein ANT_22400 [Anaerolinea thermophila UNI-1]
gi|319995495|dbj|BAJ64266.1| hypothetical protein ANT_22400 [Anaerolinea thermophila UNI-1]
Length = 664
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 101/237 (42%), Gaps = 41/237 (17%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-GRGVS 457
ETC + + L + T + Y++ +E L N S+ G + +Y PL RG
Sbjct: 353 ETCAALASMFWNWELAQITGKARYSELFEWQLYNAA-SVGMGLDGTTYLYNNPLTCRGGV 411
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS-- 515
+ R + + CC +F+ LGD +Y + G LY+ QY+SS +
Sbjct: 412 ERRP-------WYAVPCCPSNLSRTFAWLGDYLYSAKPGR---LYVHQYLSSDLPAQEIP 461
Query: 516 ----GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN--LRMPVWTYSNGAQASL 569
V L+ ++D + W ++ + L + Q + L LR+P W + + +L
Sbjct: 462 CANGNRVRLSLQMDSQLPWHGHVVLRLRRWEVLDPDQPAPLEILLRLPSW--AENPRLTL 519
Query: 570 NGQNL----PLP-----PPGN--------FLSATERWSYNDKLTIQ--LPLSLRTEA 607
NGQ L P P PP + FL ++ W+ D L ++ LP+ LR A
Sbjct: 520 NGQPLFLQIPQPQQDGEPPADGYDPRQAVFLPLSQPWAEGDTLELRFDLPIRLRHAA 576
>gi|284039567|ref|YP_003389497.1| hypothetical protein Slin_4720 [Spirosoma linguale DSM 74]
gi|283818860|gb|ADB40698.1| protein of unknown function DUF1680 [Spirosoma linguale DSM 74]
Length = 655
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 87/378 (23%), Positives = 138/378 (36%), Gaps = 73/378 (19%)
Query: 293 LYRLYSITHDPKHLLLAH-------------LFDKPCFLGFLALQADYLSHFHANTHIPI 339
L +LY +T+D ++L A LF P G A YL T
Sbjct: 216 LVKLYRVTNDKRYLDFARFLLDMRGRADKRPLFPDPAKTG---QGASYLQDHLPVTQQKT 272
Query: 340 VIGSQMR----YEVTGDPLYKLIGTFFMDIVNA-------SHSYATGGTSAR---EFWWD 385
+G +R Y D +MD + A Y TGG AR E + +
Sbjct: 273 AVGHSVRAGYMYAAMSDIAAIQKDKAYMDALLAIWNDVVERKQYLTGGLGARGHGEAFGE 332
Query: 386 PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
L + + ETC + + +F T E Y D +ER L NG L+ E
Sbjct: 333 AYELPNDVAYA--ETCAAVANMLWNHRMFLLTGESKYMDVFERVLYNGFLA-GVSLEGDS 389
Query: 446 MIYMLPLGR------GVSKARSTHGW-GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
Y+ PL V +A + W GT CC + L +Y + N+
Sbjct: 390 FFYVNPLASDGKRKFNVGQAATRAPWFGTS-----CCPTNVVRFLPSLPGYVYATKGDNL 444
Query: 499 -PGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP 557
L++ S + KS V + Q+ + WD + +T+ + ++ Q ++ LR+P
Sbjct: 445 FINLFLTNQSKLSVNGKS--VQIRQETN--YPWDGNVAITV----QPKLAQTFTIQLRLP 496
Query: 558 VWT-----------YSNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
W Y N + +NG+ +P + + W D+L L +
Sbjct: 497 GWASGTPMPGYLYEYVNTTAKTPVLLVNGKPVPYKIENGYARISRTWKPGDRLEWTLDMP 556
Query: 603 LR----TEAIQDDRPEYA 616
+R E + DDR + A
Sbjct: 557 VREVKANEQVTDDRKKVA 574
>gi|322831792|ref|YP_004211819.1| hypothetical protein Rahaq_1069 [Rahnella sp. Y9602]
gi|321166993|gb|ADW72692.1| protein of unknown function DUF1680 [Rahnella sp. Y9602]
Length = 657
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 53/221 (23%), Positives = 82/221 (37%), Gaps = 34/221 (15%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + ++ + + + + YAD ERAL N VL+ + Y+ PL
Sbjct: 339 ETCASIGLMMFANRMLQMDSDSRYADVMERALYNTVLA-GMALDGKHFFYVNPL------ 391
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H FN + CC + LG IY + G+ I
Sbjct: 392 --EVHPKSIPFNHIYDHVKPVRQRWFGCACCPPNIARLLASLGHYIYTQRPD---GVDIN 446
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
YI S + G L K W + + + E ++L LR+P W S
Sbjct: 447 LYIGSDVEATIGGKALRLKQSGGYPWAEGVLIEIDTDQPLE----ATLALRLPDWCVS-- 500
Query: 565 AQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
Q +LNG L L +L T+ W D++ + LP+ +
Sbjct: 501 PQVTLNGNPLELTSLTQRGYLRLTQEWQKGDRIEMMLPMPV 541
>gi|448238166|ref|YP_007402224.1| AraN-like protein [Geobacillus sp. GHH01]
gi|445207008|gb|AGE22473.1| AraN-like protein [Geobacillus sp. GHH01]
Length = 643
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 78/359 (21%), Positives = 129/359 (35%), Gaps = 70/359 (19%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGF----------LALQADYLSHFHANTHI 337
L +LY +T + +L L+ F +P + + DY H HI
Sbjct: 193 LLKLYEVTGNESYLKLSQYFIDQRGQQPHYFDWEKKARGETKPFWFHDDYRYH---QAHI 249
Query: 338 PI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSA-- 379
P+ +G +R TGD K + V Y TGG +
Sbjct: 250 PVREQKQAVGHAVRALYMYTAMAGLAAKTGDESLKQACQTLWENVTKRQMYITGGVGSSA 309
Query: 380 --REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
F +D DT +E TC + ++ +R + + YAD ERAL NG +S
Sbjct: 310 FGESFTFDFDLPNDTAYAE---TCASIALVFWARRMLELETDGKYADVMERALYNGTIS- 365
Query: 438 QRGTEPGVMIYMLPL-----------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKL 486
+ Y+ PL R V R K+ S CC + +
Sbjct: 366 GMDLDGKKFFYVNPLEVWPKACERHDKRHVKPVRQ------KWFSCACCPPNLARLIASI 419
Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV 546
G IY + L++ Y+ S + G + + WD +R+T+ S E
Sbjct: 420 GHYIYSQ---TSDALFVHLYVGSDIRTELGGRSVEIVQETNYPWDGTVRLTVLPESAGEF 476
Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
++ LR+P W GA ++NG+ + + P + W D++ + P+ +
Sbjct: 477 ----TIGLRIPGW--CRGATLTINGEKVDMVPLIQKGYAYIKRIWKKGDQVELVFPMPV 529
>gi|384256908|ref|YP_005400842.1| hypothetical protein Q7S_05115 [Rahnella aquatilis HX2]
gi|380752884|gb|AFE57275.1| hypothetical protein Q7S_05115 [Rahnella aquatilis HX2]
Length = 657
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 53/221 (23%), Positives = 82/221 (37%), Gaps = 34/221 (15%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + ++ + + + + YAD ERAL N VL+ + Y+ PL
Sbjct: 339 ETCASIGLMMFANRMLQMDSDSRYADVMERALYNTVLA-GMALDGKHFFYVNPL------ 391
Query: 459 ARSTHGWGTKFNSFW--------------CCYGTGIESFSKLGDSIYFEEEGNVPGLYII 504
H FN + CC + LG IY + G+ I
Sbjct: 392 --EVHPKSIPFNHIYDHVKPVRQRWFGCACCPPNIARLLASLGHYIYTQRPD---GVDIN 446
Query: 505 QYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
YI S + G L K W + + + E ++L LR+P W S
Sbjct: 447 LYIGSDVEATIGGKALRLKQSGGYPWAEGVLIEIDTDQPLE----ATLALRLPDWCAS-- 500
Query: 565 AQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSL 603
Q +LNG L L +L T+ W D++ + LP+ +
Sbjct: 501 PQVTLNGNPLELTSLTQRGYLRLTQEWQKGDRIEMMLPMPV 541
>gi|383777979|ref|YP_005462545.1| hypothetical protein AMIS_28090 [Actinoplanes missouriensis 431]
gi|381371211|dbj|BAL88029.1| hypothetical protein AMIS_28090 [Actinoplanes missouriensis 431]
Length = 640
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 69/309 (22%), Positives = 116/309 (37%), Gaps = 39/309 (12%)
Query: 369 SHSYATGGTSAR---EFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADY 425
S +Y TGG +R E + D L ETC ++ L T YAD
Sbjct: 288 SRTYLTGGQGSRHRDEAYGDAYELPPD--RAYAETCAAIASFQLGFRLLLATGSAKYADE 345
Query: 426 YERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK---ARSTHGWGTKFNSFWCCYGTGIES 482
ER L N + + + Y PL R + G + CC +
Sbjct: 346 MERVLYNAI-AASTAVDGKAFFYSQPLQRRTGHDGGGENAPGHRLDWYECACC----PPN 400
Query: 483 FSKLGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFS 541
++L S++ + G+ GL + Y S +F + V +V+ WD + +T+T S
Sbjct: 401 LARLMASLHTYAATGDAGGLELHLYGSGTFTSANRSV----EVETRYPWDEQITVTVTSS 456
Query: 542 SKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQL 599
+L+LR+P W + + ++NG P P +L W D++ + L
Sbjct: 457 PDDPW----TLSLRIPAW--CDDVRLTVNGTAAPAGPQIHDGYLRLNRIWHEGDRVVLTL 510
Query: 600 PLSLRTEAIQDDRPEYASIQAILFGPYL-------------LAGHTSGEWDIKTGTARSL 646
+ R A A++ GP + AGH + ++ TG+ S+
Sbjct: 511 AMPARLVAAHPRVDATRGTAALVRGPIVHCLEHADIPATGPFAGHCFEDLELDTGSPVSV 570
Query: 647 SALISPIPP 655
+ S + P
Sbjct: 571 AYHSSGLAP 579
>gi|423299822|ref|ZP_17277847.1| hypothetical protein HMPREF1057_00988 [Bacteroides finegoldii
CL09T03C10]
gi|408473631|gb|EKJ92153.1| hypothetical protein HMPREF1057_00988 [Bacteroides finegoldii
CL09T03C10]
Length = 698
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 58/216 (26%), Positives = 93/216 (43%), Gaps = 16/216 (7%)
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ + ETC + + + T + YAD E L N VLS + Y PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434
Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
+ T W T++ S +CC + + + + Y EG LY ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
+ K G V L Q+ D WD +R+TL + ++ G SL LR+P W A ++N
Sbjct: 495 WKGK-GEVALTQETD--YPWDGNVRVTLD-KAPRKAGTF-SLFLRIPEW--CEKATLTVN 547
Query: 571 GQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
GQ L + N + R W D +L + +P+ L
Sbjct: 548 GQPLQVNAKANSYAEVNRAWKKGDVVELVMNMPVRL 583
>gi|150009918|ref|YP_001304661.1| hypothetical protein BDI_3335 [Parabacteroides distasonis ATCC
8503]
gi|423333683|ref|ZP_17311464.1| hypothetical protein HMPREF1075_03115 [Parabacteroides distasonis
CL03T12C09]
gi|149938342|gb|ABR45039.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
gi|409226993|gb|EKN19895.1| hypothetical protein HMPREF1075_03115 [Parabacteroides distasonis
CL03T12C09]
Length = 617
Score = 47.4 bits (111), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 48/212 (22%), Positives = 93/212 (43%), Gaps = 24/212 (11%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-GRGVS 457
ETC + M+ ++ + ++T + Y D ER++ NG L+ E Y+ PL +G
Sbjct: 334 ETCASVGMVLWNQRMNQFTGDSKYIDVLERSMYNGALA-GISLEGDRFFYVNPLESKGDH 392
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
++ +G CC +G+ IY +++ YI +S + + +
Sbjct: 393 HRQAWYGCA-------CCPSQISRFLPSIGNYIYGTSN---EAIWVNLYIGNSTEINTDN 442
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSS--KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
+ + + WD +++T+T S+ K+E+ LR+P W S+NGQ +
Sbjct: 443 TNVTLRQETNYPWDGTVKLTVTPSNPLKKEI------RLRIPSWCEQ--YTLSVNGQLVK 494
Query: 576 LPPPGNFLSATERWSYND--KLTIQLPLSLRT 605
P + + W D L++++P+ L T
Sbjct: 495 APTEKGYAVLNKEWKQGDVISLSMEMPVKLMT 526
>gi|227820086|ref|YP_002824057.1| hypothetical protein NGR_b18560 [Sinorhizobium fredii NGR234]
gi|227339085|gb|ACP23304.1| putative cytoplasmic protein [Sinorhizobium fredii NGR234]
Length = 640
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 77/370 (20%), Positives = 149/370 (40%), Gaps = 53/370 (14%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL--SHFHANTHIPI 339
L +L +T + K+L L+ F +P F A++ DY+ +H ++ +H P+
Sbjct: 196 ALVKLARVTGEKKYLALSKFFIDERGQEPHFFTEEAIRDGRSPKDYIQKTHEYSQSHEPV 255
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + + T + D+ Y TGG ++
Sbjct: 256 RRQKKVVGHAVRAMYMYSGMADLATEYKDDTLTEALETLWDDLTT-KQMYVTGGIGPSAK 314
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + + ETC + ++ + + +AD E+AL NG +S
Sbjct: 315 NEGFTDYYDLPND--TAYAETCASVALVFWASRMLGRGPNRRFADIMEQALYNGAIS-GL 371
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
+ Y PL H W K+++ CC + +G +Y +
Sbjct: 372 SLDGKTFFYDNPL----ESTGKHHRW--KWHNCPCCPPNIARLVASVGAYMYGVAADEI- 424
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
+++ + + V L Q + W+ + + + + +L+LR+P W
Sbjct: 425 AVHLYGESTVRLELGGSQVTLRQVTN--YPWEGAVSIRIELDEPRHF----ALSLRIPEW 478
Query: 560 TYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
++GA+ ++NG ++ L + WS D++++ LPL LR + + A
Sbjct: 479 --ADGARVAVNGSSIDLDGVMTDGYALIEREWSDGDEISLDLPLRLRPQYANPKVRQDAG 536
Query: 618 IQAILFGPYL 627
A++ GP +
Sbjct: 537 RVALMRGPLV 546
>gi|299145521|ref|ZP_07038589.1| putative cytoplasmic protein [Bacteroides sp. 3_1_23]
gi|298516012|gb|EFI39893.1| putative cytoplasmic protein [Bacteroides sp. 3_1_23]
Length = 698
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 58/216 (26%), Positives = 91/216 (42%), Gaps = 16/216 (7%)
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ + ETC + + + T + YAD E L N VLS + Y PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434
Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
+ T W T++ S +CC + + + + Y EG LY ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494
Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
WK G V L Q+ D WD +R+TL ++VG SL LR+P W A +
Sbjct: 495 --WKEKGEVALTQETD--YPWDGNVRVTLD-KVPRKVGTF-SLFLRIPEW--CEKATLRV 546
Query: 570 NGQNLPLPPPGNFLSATER-WSYNDKLTIQLPLSLR 604
NGQ L + N + R W D + + + + +R
Sbjct: 547 NGQPLQVNAKANSYAEVNRAWKKGDIVELMMDMPVR 582
>gi|417487787|ref|ZP_12172639.1| secreted protein [Salmonella enterica subsp. enterica serovar
Rubislaw str. A4-653]
gi|353632529|gb|EHC79566.1| secreted protein [Salmonella enterica subsp. enterica serovar
Rubislaw str. A4-653]
Length = 663
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 87/401 (21%), Positives = 140/401 (34%), Gaps = 69/401 (17%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHFH-------------A 333
L RLY +T P+++ LA F +P F + S++H +
Sbjct: 192 ALMRLYEVTEQPRYMALASYFIGQRGAQPHFYDEEYEKRGQTSYWHTYGPAWMVKDKAYS 251
Query: 334 NTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG- 376
H+PI IG +R+ ++ D + + Y TGG
Sbjct: 252 QAHLPISQQQTAIGHAVRFVYLMTGVAHLARLSNDEGKRQDCLRLWKNMAQRQLYITGGI 311
Query: 377 ---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNG 433
+S F D D++ +E +C + ++ +R + + YAD ERA
Sbjct: 312 GSQSSGEAFSCDYDLPNDSIYAE---SCASIGLMMFARRMLEMEADSQYADVMERAREYA 368
Query: 434 -VLSIQRGTEPGVMIYMLPLGRGV--SKARSTHGWGTKFNSFW--------------CCY 476
V+ R V+ M G+ H KFN + CC
Sbjct: 369 DVMERARALYNTVLGGMALDGKHFFYVNPLEVHPKSLKFNHIYDHVKPIRQRWFGCACCP 428
Query: 477 GTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM 536
+ LG IY LYI Y+ +S + + L ++ W + ++
Sbjct: 429 PNIARVLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPW--HEQV 483
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLT 596
+ S Q V +L LR+P W A+ +LNG + +L W D +T
Sbjct: 484 KIAIDSVQPVRH--TLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTIT 539
Query: 597 IQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSGE 635
+ LP+ +R A AI GP Y L +GE
Sbjct: 540 LTLPMPVRRVYGNPLARHVAGKVAIQRGPLVYCLEQADNGE 580
>gi|251797570|ref|YP_003012301.1| hypothetical protein Pjdr2_3583 [Paenibacillus sp. JDR-2]
gi|247545196|gb|ACT02215.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 674
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 77/299 (25%), Positives = 114/299 (38%), Gaps = 22/299 (7%)
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLAD 391
HA + G Y TG+ Y D ++ S+ TGG A D K A+
Sbjct: 292 HAVRATLLYTGLTALYLCTGEVPYLETAKKLWDNISHQKSHVTGGVGAVHH--DEKFGAN 349
Query: 392 TLGSENE--ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYM 449
+N ETC M S +LF T E Y D E + N VL+ R + Y
Sbjct: 350 YELPDNGYLETCAGVGMGFFSWNLFLATGESRYIDKLETIIYNIVLA-GRSMDGHKYFYE 408
Query: 450 LPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS 509
PL VSK W +++S CC ++ +L IY + G +I YI S
Sbjct: 409 NPL---VSKGGHNR-W--EWHSCPCCPPMIMKLMPELASYIYAYDG---KGAFINLYIGS 459
Query: 510 SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
+ G V + K W + +T+T E L LR+P W + +
Sbjct: 460 ESELLIGDVPVTVKQQTNYPWSGAVGITVTPERDAEF----DLRLRIPEWCGQYAIRVND 515
Query: 570 NGQNLPLPPPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
N L N + R WS D++ ++L + + + + +A AI GP L
Sbjct: 516 QAANYELE---NGYAVLHRVWSPGDRIQLELDMPVHLVEVHPNVTTHADKAAIRRGPVL 571
>gi|336416221|ref|ZP_08596557.1| hypothetical protein HMPREF1017_03665 [Bacteroides ovatus
3_8_47FAA]
gi|335938952|gb|EGN00831.1| hypothetical protein HMPREF1017_03665 [Bacteroides ovatus
3_8_47FAA]
Length = 698
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 58/216 (26%), Positives = 91/216 (42%), Gaps = 16/216 (7%)
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ + ETC + + + T + YAD E L N VLS + Y PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434
Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
+ T W T++ S +CC + + + + Y EG LY ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494
Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
WK G V L Q+ D WD +R+TL ++VG SL LR+P W A +
Sbjct: 495 --WKEKGEVALTQETD--YPWDGNVRVTLD-KVPRKVGTF-SLFLRIPEW--CEKATLRV 546
Query: 570 NGQNLPLPPPGNFLSATER-WSYNDKLTIQLPLSLR 604
NGQ L + N + R W D + + + + +R
Sbjct: 547 NGQPLQVNAKANSYAEVNRAWKKGDIVELMMDMPVR 582
>gi|333994236|ref|YP_004526849.1| hypothetical protein TREAZ_1028 [Treponema azotonutricium ZAS-9]
gi|333736667|gb|AEF82616.1| conserved hypothetical protein [Treponema azotonutricium ZAS-9]
Length = 675
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 78/373 (20%), Positives = 126/373 (33%), Gaps = 51/373 (13%)
Query: 292 VLYRLYSITHDPKHLLLAHLFD-----------------------KPCFLGFLALQA--- 325
L RLY +T D KHL LA F K ++ + QA
Sbjct: 220 ALVRLYDVTKDEKHLKLARYFIDQRGQSPLYFEEETKRNGNEFYWKDSYVKYQYYQAGKP 279
Query: 326 ---DYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTS---- 378
+++ HA + + G +TGD + + + Y TGG
Sbjct: 280 VRDQHIAEGHAVRAVYLYSGMADIARLTGDDTLIKSCSDLWENITQKQMYITGGIGQSAY 339
Query: 379 AREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQ 438
F +D DT+ +E TC + + +R + + ++AD E AL NG++S
Sbjct: 340 GEAFSYDYDLPNDTVYAE---TCASIGLAFFARRMLSIAPKGSFADVLETALYNGIIS-G 395
Query: 439 RGTEPGVMIYMLPL------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYF 492
+ Y+ PL R G K+ + CC S LG IY
Sbjct: 396 MSLDGKSFFYVNPLEVIPEANEKDRIRRHVKGVRQKWFACACCPPNLARIISSLGSYIYS 455
Query: 493 EEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSL 552
++ LY +I S+ + + K++ W+ +R+ + G
Sbjct: 456 VKDN---ALYTHLFIGSTAKAQLSGKEVTVKLETSYPWEEKVRVDFQVPGE---GAKFDY 509
Query: 553 NLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
R+P W S LNG + + W D L+I + +
Sbjct: 510 AFRLPGWCRS--CSVELNGAKADYKKADGYAIISREWKSGDSLSIVFDMPVNFVEANPKV 567
Query: 613 PEYASIQAILFGP 625
E + AI GP
Sbjct: 568 RENSGKLAITRGP 580
>gi|189464189|ref|ZP_03012974.1| hypothetical protein BACINT_00526 [Bacteroides intestinalis DSM
17393]
gi|189437979|gb|EDV06964.1| hypothetical protein BACINT_00526 [Bacteroides intestinalis DSM
17393]
Length = 801
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 81/347 (23%), Positives = 133/347 (38%), Gaps = 59/347 (17%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T D K+L A F D+ G+ + +Y + H P+V +G +R
Sbjct: 222 LAKLYLVTGDKKYLDQAKFFLDQ---RGYTSRTDEY-----SQAHKPVVQQDEAVGHAVR 273
Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADT 392
+TGD Y D + Y TGG T+A E + L +
Sbjct: 274 AAYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATAAGEAFGKNYELPNM 333
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
S ETC + V+ LF E Y D ER L NG++S + G Y P+
Sbjct: 334 --SAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPM 390
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
++ H F CC L IY ++ +V Y+ ++S++ D
Sbjct: 391 -----ESMGQHQRQPWFGCA-CCPSNICRFIPSLPGYIYAVKDKDV---YVNLFMSNTSD 441
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TY 561
K G ++ + W+ + + + +K GQ +L +R+P W TY
Sbjct: 442 LKVGGKAVSIEQTTQYPWNGDITIGI---NKNSAGQF-NLKVRIPGWVRGQVVPSDLYTY 497
Query: 562 SNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
S+G + +NG+ + + RW DK+ + + R
Sbjct: 498 SDGKRLKYTVKVNGEAVQNELKDGYFCIDRRWKKGDKVEVHFDMEPR 544
>gi|435854457|ref|YP_007315776.1| hypothetical protein Halha_1747 [Halobacteroides halobius DSM 5150]
gi|433670868|gb|AGB41683.1| hypothetical protein Halha_1747 [Halobacteroides halobius DSM 5150]
Length = 655
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 111/298 (37%), Gaps = 48/298 (16%)
Query: 332 HANTHIPI-----VIGSQMR------------YEVTGDPLYKLIGTFFMDIVNASHSYAT 374
+A H+P+ V+G +R E L + +G + ++ Y T
Sbjct: 265 YAQDHLPVREQDKVVGHAVRAMYLYCGMADVAMETKDHELIQALGNLWANMT-KKRMYVT 323
Query: 375 GGTSARE----FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERAL 430
GG + F D DT +E TC + ++ + + T E +AD ER L
Sbjct: 324 GGIGSAHHNEGFTADYDLPNDTAYAE---TCAAVGSMMWNQRMLKLTGEACFADIIERTL 380
Query: 431 TNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSI 490
NG LS T Y+ PL + R GW CC + L I
Sbjct: 381 YNGFLSGVSLT-GDKFFYVNPLESDGTHHRK--GWF----KVSCCPPNIARFLASLEKYI 433
Query: 491 YFEEEGNVPGLYIIQYIS--SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQ 548
Y + E + +I QYIS V++ Q D WD + + + + E
Sbjct: 434 YLKNEDCI---FINQYISGKGKVSIAEEEVIIRQ--DTAYPWDDKVNIKINLKNPSEF-- 486
Query: 549 LSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGN---FLSATERWSYNDKLTIQLPLSL 603
+L+LR+P W A +N Q+L + N + +W D++ ++ + +
Sbjct: 487 --TLSLRIPDWCQE--ASLQINNQSLEIESIINDNGYAQIRRKWRNGDQIRLEFAMPI 540
>gi|427384250|ref|ZP_18880755.1| hypothetical protein HMPREF9447_01788 [Bacteroides oleiciplenus YIT
12058]
gi|425727511|gb|EKU90370.1| hypothetical protein HMPREF9447_01788 [Bacteroides oleiciplenus YIT
12058]
Length = 801
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 136/350 (38%), Gaps = 61/350 (17%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQM 345
L +LY +T K+L A F D+ G+ + +Y + H P+V +G +
Sbjct: 221 ALAKLYLVTGQQKYLDQAKFFLDQ---RGYTSRTDEY-----SQAHKPVVQQDEAVGHAV 272
Query: 346 RYE-----------VTGDPLY-KLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLA 390
R +TGD Y I + +IV + Y TGG T+A E + L
Sbjct: 273 RAAYMYAGMADVAALTGDTAYIHAIDRIWNNIVGKKY-YITGGIGATAAGEAFGKNYELP 331
Query: 391 DTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
+ S ETC + V+ LF E Y D ER L NG++S + G Y
Sbjct: 332 NM--SAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPN 388
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
PL ++ H F CC L IY ++ +V Y+ ++S++
Sbjct: 389 PL-----ESMGQHQRQPWFGCA-CCPSNICRFIPSLPGYIYAVKDKDV---YVNLFMSNT 439
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW----------- 559
D K G ++ + W+ + + + K GQ ++ +R+P W
Sbjct: 440 SDLKVGGKAVSIEQTTKYPWNGDIAIGI---KKNNAGQF-TMKVRIPGWVRGQVVPSDLY 495
Query: 560 TYSNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
TYS+G + ++NG+ + RW DK+ I + RT
Sbjct: 496 TYSDGKRLKYTVAVNGEPAQSELKDGYFCIDRRWKKGDKIEIHFDMEPRT 545
>gi|218260014|ref|ZP_03475493.1| hypothetical protein PRABACTJOHN_01154 [Parabacteroides johnsonii
DSM 18315]
gi|218224797|gb|EEC97447.1| hypothetical protein PRABACTJOHN_01154 [Parabacteroides johnsonii
DSM 18315]
Length = 816
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 85/386 (22%), Positives = 136/386 (35%), Gaps = 79/386 (20%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMRY 347
L +LY +T D K+L +A F + G + + S H+PI ++G +R
Sbjct: 219 LAKLYKVTGDRKYLDMAKYFVEETGRGTDGHRLNAYSQ----DHMPILQQEEIVGHAVRA 274
Query: 348 -----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
+T D Y D + Y TGG +R + G E
Sbjct: 275 GYLYSGVADVAALTKDTAYFHAICRIWDNMATKKLYITGGIGSRA-------QGEGFGPE 327
Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI- 447
E ETC + + ++ +F T + Y D ERAL NGV+S GV +
Sbjct: 328 YELHNHSAYCETCASIANVYWNQRMFLATGDAKYIDVLERALYNGVIS-------GVSLS 380
Query: 448 -----YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
Y PL ++ H F CC G + F + +GN LY
Sbjct: 381 GDKFFYDNPL-----ESMGQHERAPWFGCA-CCPGN-VTRFMASVPKYMYATQGN--SLY 431
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
+ Y+ S + + D WD +++T++ SL LR+P WT +
Sbjct: 432 VNLYVGSESRVALANDTVTLVQDTEYPWDGLVKLTVSPRKASSF----SLKLRIPSWTGN 487
Query: 563 NGAQAS----------------LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
S +NG L ++ W D + +++P+ +R
Sbjct: 488 EPVPGSDLYTYIKRDREPCAVFVNGTPLKEKAHHGYVVIEREWEPGDVIELRMPMDVRRV 547
Query: 607 AIQDDRPEYASIQAILFGP--YLLAG 630
+ + A+ GP Y L G
Sbjct: 548 KAHEKVRADQGLLAVERGPVVYCLEG 573
>gi|336251952|ref|YP_004585920.1| hypothetical protein Halxa_0515 [Halopiger xanaduensis SH-6]
gi|335339876|gb|AEH39114.1| protein of unknown function DUF1680 [Halopiger xanaduensis SH-6]
Length = 636
Score = 47.0 bits (110), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 55/209 (26%), Positives = 86/209 (41%), Gaps = 26/209 (12%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGV 456
ETC + +R +F T + YAD ER L NG L+ GTE Y L
Sbjct: 335 ETCAAIGSVFWNRRMFELTGDAKYADLIERTLYNGFLAGVSLDGTE---FFYDNRLESDG 391
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG--LYIIQYISSSFDWK 514
S R GW F+ CC F+ L +Y V G LY+ QY+ S+
Sbjct: 392 SHGR--QGW---FDCA-CCPPNVARLFASLERYLY-----TVDGRELYVNQYVESTATPT 440
Query: 515 SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
L WD + + + + ++++LR+P W + A +NG+
Sbjct: 441 VDDAELEVAQTTDYPWDSEVTIDVEAPEPTQ----ATISLRVPEW--CDEASIEVNGE-- 492
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSL 603
P+P G+ + ER +D++T +S+
Sbjct: 493 PIPVDGDGYVSLERTWDDDRITATFEMSV 521
>gi|326802069|ref|YP_004319888.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326552833|gb|ADZ81218.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
Length = 659
Score = 47.0 bits (110), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 58/244 (23%), Positives = 93/244 (38%), Gaps = 33/244 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + M+ ++ + T E Y D ER+L NG L Y PL
Sbjct: 335 ETCASVGMVFWNQRMNLLTGEAKYFDILERSLYNGALD-GLSYSGNRFFYGNPLASHGGY 393
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISS--SFDWKSG 516
RS +GT CC LGD IY + V ++ ++ S + G
Sbjct: 394 GRS-EWFGTA-----CCPSNIARLVESLGDYIYAHSDKAV---WVNLFVGSKAAIPLSQG 444
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW---------------TY 561
V + Q+ W + + +T K++ L++R+P W T
Sbjct: 445 TVEIAQQTG--YPWQGDVNIRVTPDRKRKF----PLHIRIPGWLLGQPAPGDTYRFLDTT 498
Query: 562 SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAI 621
N +NG+N+P ++ W ND ++IQ+PL ++ A D + A+
Sbjct: 499 ENKYTLQVNGKNVPYHIEKGYVVIDRIWDKNDAVSIQMPLEVKKIAANDQVVANKNRIAL 558
Query: 622 LFGP 625
GP
Sbjct: 559 QRGP 562
>gi|224537077|ref|ZP_03677616.1| hypothetical protein BACCELL_01954 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521304|gb|EEF90409.1| hypothetical protein BACCELL_01954 [Bacteroides cellulosilyticus
DSM 14838]
Length = 811
Score = 46.6 bits (109), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 85/384 (22%), Positives = 146/384 (38%), Gaps = 79/384 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L +A F + G LS + + H PI ++G +R
Sbjct: 220 ALAKLYKVTGDEKYLKMAKYFVEETGRG---TDGHRLSEY-SQDHKPILQQDEIVGHAVR 275
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y + + + + + TGG +R P+ + G
Sbjct: 276 AGYLYSGVADVASLTQDTAYFNALSRIWENMASKKLFITGGIGSR-----PQ--GEGFGP 328
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + +F T YAD ERAL NGV+S GV +
Sbjct: 329 NYELNNHTAYCETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVIS-------GVSL 381
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL + + +G CC G + F + +GN +
Sbjct: 382 SGDKFFYDNPL-ESMGQHERQQWFGCA-----CCPGN-VTRFMASVPFYMYATQGN--DI 432
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVS--WDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
Y+ YI S + + N K++ I + WD + +++ +QE +L +R+P W
Sbjct: 433 YVNLYIQSKAELNTE--TNNVKLEQITTYPWDGKVSISVNPEKEQEF----ALRVRIPGW 486
Query: 560 -----------TYSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
++++ A+A S+NG+ + + + W D + I P+ +R
Sbjct: 487 AQDAPVPTDLYSFTDKAKAYTISINGKKVNATQLDGYATILHDWKTGDVVEINFPMDVRR 546
Query: 606 EAIQDDRPEYASIQAILFGPYLLA 629
D+ + AI GP +
Sbjct: 547 VKANDNVEDDRGKLAIERGPIMFC 570
>gi|423223921|ref|ZP_17210390.1| hypothetical protein HMPREF1062_02576 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392637419|gb|EIY31288.1| hypothetical protein HMPREF1062_02576 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 801
Score = 46.6 bits (109), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 82/348 (23%), Positives = 133/348 (38%), Gaps = 59/348 (16%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T D K+L A F D+ G+ + +Y + H P+V +G +R
Sbjct: 222 LAKLYLVTGDQKYLDQAKFFLDQ---RGYTSRTDEY-----SQAHKPVVQQDEAVGHAVR 273
Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADT 392
+TGD Y D + Y TGG T+A E + L +
Sbjct: 274 AAYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATAAGEAFGANYELPNM 333
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
S ETC + V+ LF E Y D ER L NG++S + G Y PL
Sbjct: 334 --SAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL 390
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
++ H F CC L IY ++ +V Y+ ++S++ D
Sbjct: 391 -----ESMGQHQRQPWFGCA-CCPSNICRFIPSLPGYIYAVKDKDV---YVNLFMSNTSD 441
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TY 561
K G ++ + W+ + + + +K G +L +R+P W TY
Sbjct: 442 LKVGGKAVSIEQTTKYPWNGDITIGI---NKNSAGPF-NLKVRIPGWVRGQVVPSDLYTY 497
Query: 562 SNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
S+G + +NG+ + + RW DK+ + + RT
Sbjct: 498 SDGKRLKYTVKVNGEAVQNELKDGYFCIDRRWKKGDKVEVHFDMEPRT 545
>gi|409439808|ref|ZP_11266847.1| conserved hypothetical protein [Rhizobium mesoamericanum STM3625]
gi|408748645|emb|CCM78028.1| conserved hypothetical protein [Rhizobium mesoamericanum STM3625]
Length = 637
Score = 46.6 bits (109), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 79/348 (22%), Positives = 133/348 (38%), Gaps = 53/348 (15%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFHANT------HIPI 339
L +L +T + K+L LA F +P F A++ + FH T H P+
Sbjct: 198 ALVKLARVTGEKKYLDLAKFFIDERGTEPHFFTEEAIRDGRSAADFHQKTYEYGQAHQPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG +A
Sbjct: 258 REQKKVVGHAVRAMYLYSGMADIATEYDDDSLTGALETLWDDLTT-KQMYVTGGIGPAAA 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + S ETC + ++ + + YAD E+AL NG ++
Sbjct: 317 NEGFTDYYDLPNE--SAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GL 373
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
+ Y PL A H W ++ CC + +G +Y E +
Sbjct: 374 SLDGKKFFYENPL----ESAGKHHRW--IWHHCPCCPPNIARLLASIGSYMYGVAEDEI- 426
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
+++ + F V L QK W +R+ + ++ L +++LR+P W
Sbjct: 427 AVHLYGEGRARFKIGGTDVELTQKTR--YPWHGAVRLDIKLNAP----VLFAISLRIPEW 480
Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRT 605
+NGA ++NG+ + L + W DK+ + +PL R
Sbjct: 481 --ANGATLAVNGEAIDLGSADVDGYARIEREWRDGDKIDLNIPLETRA 526
>gi|409098498|ref|ZP_11218522.1| hypothetical protein PagrP_08844 [Pedobacter agri PB92]
Length = 673
Score = 46.2 bits (108), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 108/480 (22%), Positives = 185/480 (38%), Gaps = 79/480 (16%)
Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF----PTELFDSFEALKPV 225
L A A ++AST N + M + + + Q + G Y A T + F+ +
Sbjct: 107 LEAVASLYASTKNPKLNAMMDKAIVVIGKSQREDGYIYTKAMIEQRKTGSNNQFQD-RLS 165
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
+ Y H + AG + Y L +A +Y YN + ++ R+ +
Sbjct: 166 FESYNIGHLMTAGCI-HYRATGKTTLLNIAKKATDYLYNFYKSASP--TLARNAICPSHY 222
Query: 286 TGGMNDVLYRLYSITHDPKHLLLA-HLF-----------DKPCFLGFLALQADYLSHFHA 333
G + +Y T+DP++L LA HL D + FL Q + H
Sbjct: 223 MG-----VVEMYRTTNDPRYLELAQHLIAIKGKIDDGTDDNQDRIPFLQ-QTKAMGHAVR 276
Query: 334 NTHIPIVIGSQMRYEVTG-DPLYKLIGTFFMDIVNASHSYATGG-------TSAREFWWD 385
+++ G Y TG D L + + D+ N Y TGG TS ++
Sbjct: 277 ASYL--YAGVADLYAETGKDSLLNTLNLMWNDVQN-HKMYITGGLGSLYDGTSPDGTSYN 333
Query: 386 P---KRLADTLGSE--------NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
P +++ G + + ETC + + + + T + YAD E AL N V
Sbjct: 334 PVDVQKIHQAFGRDYQLPNFTAHNETCANIGNMLWNWRMLQITGDAKYADVMELALHNSV 393
Query: 435 LS-IQRG------TEPGVMIYMLPLGRGVSKARSTH-GWGTKFNSFWCCYGTGIESFSKL 486
LS I T P LP + SK R + G CC + + +++
Sbjct: 394 LSGISLDGKNFLYTNPLAQSNDLPFKQRWSKDRVPYIGLSN------CCPPNVVRTIAEV 447
Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWK---SGHVVLNQKVDPIVSWDPYLRMTLTFSSK 543
D Y GL+ Y ++ K + L+++ + WD +++++
Sbjct: 448 SDYAYSVSN---KGLWFNLYGGNNLTTKLADGSKISLSEETN--YPWDGNIKISV----- 497
Query: 544 QEVGQLS-SLNLRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPL 601
+E+G + S+ LR+P WT AQ S+NG+ + G + W D + + LP+
Sbjct: 498 KEIGNKAYSVFLRIPAWT--QNAQISINGKPENIKAISGTYAEINRVWKKGDIIELNLPM 555
>gi|393782812|ref|ZP_10370994.1| hypothetical protein HMPREF1071_01862 [Bacteroides salyersiae
CL02T12C01]
gi|392672197|gb|EIY65667.1| hypothetical protein HMPREF1071_01862 [Bacteroides salyersiae
CL02T12C01]
Length = 675
Score = 46.2 bits (108), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 88/397 (22%), Positives = 150/397 (37%), Gaps = 42/397 (10%)
Query: 226 WAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEE 285
W P + KI+ QY A + ++ T+M YF ++++ + ++R W +
Sbjct: 155 WWPKMVVLKIM----QQYYSATGDE--RVITFMTNYFKYQLEQ-LPQNPLDR-WTHWGKF 206
Query: 286 TGGMN-DVLYRLYSITHDPKHLLLAHLFDKP------CFLGFLALQADYLSHF--HANTH 336
GG N V+Y LY+IT D L L L + FL L + H A
Sbjct: 207 RGGDNLMVIYWLYNITGDKFLLELGDLVHQQTLDWTNVFLEGTQLMTQHSLHTVNLAQGF 266
Query: 337 IPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
VI Q Y+ K +++ + + TG + E R D ++
Sbjct: 267 KEPVIYYQRDYDRKRIDAVKKAS----EVIRNTIGFPTGIWAGDEL----IRFGDP--TQ 316
Query: 397 NEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR-- 454
E C M+ + T + +AD ER N L Q V Y + +
Sbjct: 317 GSELCAAVEMMFSLEKMLEITGDTQWADQLERIAYNA-LPTQVDDNCSVRQYYQQVNQIK 375
Query: 455 -------GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
V+ T F CC + + KL +++F N G+ + Y
Sbjct: 376 VSYEPRTFVTPHSHTGNLFGVLAGFPCCTSNLHQGWPKLVQNLWFATYDN--GIAALVYA 433
Query: 508 SSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
S K +G+V ++ + + +D +R + F K+ +LR+P W
Sbjct: 434 PSKVTAKVAGNVTVDIEENTGYPFDEIIRFKMNFPDKKARTARFPFHLRIPEW--CEKPV 491
Query: 567 ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL 603
+NG+ + P N W ND++T++LP+S+
Sbjct: 492 IRVNGEVVSCVPVANIAVLERTWKSNDEVTLELPMSV 528
>gi|312135914|ref|YP_004003252.1| hypothetical protein Calow_1923 [Caldicellulosiruptor owensensis
OL]
gi|311775965|gb|ADQ05452.1| protein of unknown function DUF1680 [Caldicellulosiruptor
owensensis OL]
Length = 652
Score = 46.2 bits (108), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 86/387 (22%), Positives = 141/387 (36%), Gaps = 64/387 (16%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL--------------GFLALQADYLSHFHA 333
L +LY +T D K+L L+ F +P + GF L +YL
Sbjct: 200 LVKLYEVTGDRKYLELSKFFVDERGQEPYYFDIEYEERGKKSHWNGFKGLGREYLQAHKP 259
Query: 334 NTHIPIVIGSQMR----YEVTGD--------PLYKLIGTFFMDIVNASHSYATG--GTSA 379
+G +R Y D L+ + T F DIVN Y TG G+SA
Sbjct: 260 LRQQREAVGHAVRAVYLYSGAADVAAYTHDKELFDVCKTLFNDIVNRK-MYITGAIGSSA 318
Query: 380 R----EFWWD-PKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
F +D P A ETC + ++ + L R Y D ERAL N V
Sbjct: 319 HGEAFTFEYDLPNDAAYA------ETCASVGLIFFAHRLNRIEPHAKYYDAVERALYNTV 372
Query: 435 LS--IQRGTEPGVMIYMLPLG---RGVSK---ARSTHGWGTKFNSFWCCYGTGIESFSKL 486
+ Q G + Y+ PL + V K R + CC + L
Sbjct: 373 IGSMSQDGKK---YFYVNPLEVYPKEVEKRFDRRHVKPERQPWFGCACCPPNVARLLASL 429
Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV 546
G IY N +Y+ YI SS + G + + + ++ +++ L S +
Sbjct: 430 GRYIY---SYNQEEIYVNLYIGSSVQVEVGSAKVLLQQESGYPFEDMVKIDLKTSKEARF 486
Query: 547 GQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
L LR+P W + + + P G ++ W+ N+++ +++P ++
Sbjct: 487 ----KLYLRIPSWCEKYEVYVNEKKEEMQKLPSG-YVCIERLWTENNQVVLKIPTEVKMV 541
Query: 607 AIQDDRPEYASIQAILFGPYLLAGHTS 633
+ S A++ GP + +
Sbjct: 542 SSHPQVRSNVSKVAVVKGPVVFCAEEA 568
>gi|365837320|ref|ZP_09378689.1| hypothetical protein HMPREF0454_03564 [Hafnia alvei ATCC 51873]
gi|364562052|gb|EHM39922.1| hypothetical protein HMPREF0454_03564 [Hafnia alvei ATCC 51873]
Length = 665
Score = 46.2 bits (108), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 69/302 (22%), Positives = 111/302 (36%), Gaps = 44/302 (14%)
Query: 321 LALQADYLSHFHANTHIPIVIGSQMRYEVTGDPLYKLIGTFFMDIVNASHSYATGG---- 376
LALQ + H A + ++ G + D + I + + Y TGG
Sbjct: 275 LALQQSAIGH--AVRFVYLLAGVAHLARLNNDEEKRQICLRLWNNMVQRQLYITGGIGSQ 332
Query: 377 TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
+S F D DT+ +E +C + ++ + + + + YAD ERAL N VL
Sbjct: 333 SSGEAFSSDYDLPNDTVYAE---SCASIGLMMFANRMLQMEGDSQYADVMERALYNTVLG 389
Query: 437 IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGIES 482
+ Y+ PL H FN + CC
Sbjct: 390 -GMALDGRHFFYVNPL--------EVHPKSIPFNHIYDHVKPIRQRWFGCACCPPNIARI 440
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
+ +G IY + LYI Y+ + +G L + WD +++ +
Sbjct: 441 LTSIGHYIYTQRSD---ALYINLYVGNETHLDNG---LKIAISGNYPWDE--NVSVHIRT 492
Query: 543 KQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
++ + Q +L LRMP W Q LNG+ +L T W D+L I LP+
Sbjct: 493 EKPLHQ--TLALRMPEWCEKPSVQ--LNGKTCEGLLKRGYLHITREWHDGDRLEIVLPMP 548
Query: 603 LR 604
+R
Sbjct: 549 VR 550
>gi|330996652|ref|ZP_08320530.1| hypothetical protein HMPREF9442_01617 [Paraprevotella xylaniphila
YIT 11841]
gi|329572724|gb|EGG54357.1| hypothetical protein HMPREF9442_01617 [Paraprevotella xylaniphila
YIT 11841]
Length = 816
Score = 46.2 bits (108), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 92/444 (20%), Positives = 158/444 (35%), Gaps = 102/444 (22%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKP---CFLGFLALQADYLSHFHANTHIPI-----VIGS 343
L +LY +T ++L A F + C G + + ++ H PI ++G
Sbjct: 221 ALAKLYKVTGKEEYLRTARYFVEETGRCTDG-------HAPNAYSQDHKPILEQDEIVGH 273
Query: 344 QMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADT 392
+R TGD Y T + + Y TGG +R +
Sbjct: 274 AVRAGYLYSGVADVAAQTGDTAYFHALTRIWENMAGRKLYITGGIGSRA-------QGEG 326
Query: 393 LGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
G + E ETC + + + +F T + Y D ERAL NGV+S G
Sbjct: 327 FGPDYELNNHTAYCETCASIANVYWNHRMFLATGDSRYEDILERALYNGVIS-------G 379
Query: 445 VMI------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV 498
V + Y PL ++ HG F CC G + + + +Y + +V
Sbjct: 380 VSLSGDRFFYDNPL-----ESMGQHGRQAWFGCA-CCPGNVTRFMASVPNYMYATQGKDV 433
Query: 499 PGLYIIQYISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
++ YI S S + + Q D WD +R+ + KQ +L R+
Sbjct: 434 ---FVNLYIQSTASLSTSQNKIEIRQTTD--YPWDGNIRLAVHPEKKQTF----ALRCRI 484
Query: 557 PVWTY--------------SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
P W G +NG+++ + +W D + + P+
Sbjct: 485 PGWAQGRPVPTDLYHYTGKGKGYTIQVNGKDVDFHVENGYAVILRKWKKGDTVQLDFPMD 544
Query: 603 L-RTEA---IQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFN 658
+ R EA ++DDR + AI GP + + D S + + + P+
Sbjct: 545 VRRVEARVEVEDDRGK----AAIERGPIVYCIEDKDQPD---------SLIFNKVIPAGT 591
Query: 659 AQLVTFTQESGNSTFVMSNSNQSI 682
A T+ + N + + Q++
Sbjct: 592 AISATYAPDMLNGIVTLEGTAQAV 615
>gi|255691741|ref|ZP_05415416.1| putative cytoplasmic protein [Bacteroides finegoldii DSM 17565]
gi|260622626|gb|EEX45497.1| hypothetical protein BACFIN_06788 [Bacteroides finegoldii DSM
17565]
Length = 700
Score = 46.2 bits (108), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 86/347 (24%), Positives = 140/347 (40%), Gaps = 47/347 (13%)
Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
+ +Y T +P++L L+ +L D + + + Y + HA + G
Sbjct: 250 VVEMYRATGNPRYLELSKNLIDIRGMVENGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 309
Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
Y TG+ L K + + + DIV Y TG GTS ++P +++ +
Sbjct: 310 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 368
Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
G + + ETC + + + T + YAD E L N VLS +
Sbjct: 369 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 427
Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
Y PL R + T W T++ S +CC + + + + Y EG
Sbjct: 428 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 486
Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
LY ++++ WK G V L Q+ D WD +R+TL ++ G SL LR+P W
Sbjct: 487 LYGANTLTTT--WKEKGEVALTQETD--YPWDGNIRVTLD-KVPRKAGTF-SLFLRIPEW 540
Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
A +NGQ L + N + R W D +L + +P+ L
Sbjct: 541 --CEKATLRVNGQPLQVNAKANSYAEVNRAWKKGDVVELVMDMPVRL 585
>gi|332882007|ref|ZP_08449642.1| hypothetical protein HMPREF9074_05436 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357048165|ref|ZP_09109719.1| hypothetical protein HMPREF9441_03768 [Paraprevotella clara YIT
11840]
gi|332679931|gb|EGJ52893.1| hypothetical protein HMPREF9074_05436 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355528748|gb|EHG98226.1| hypothetical protein HMPREF9441_03768 [Paraprevotella clara YIT
11840]
Length = 800
Score = 45.8 bits (107), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 82/355 (23%), Positives = 132/355 (37%), Gaps = 73/355 (20%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T D K+L A F D+ G + + Y + H P+V +G +R
Sbjct: 221 LAKLYIVTGDQKYLDEAKFFLDQ---RGHTSRRDAY-----SQAHKPVVEQDEAVGHAVR 272
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+TGD Y D + Y TGG A + G+
Sbjct: 273 ATYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATAN-------GEAFGA 325
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + V+ LF E Y D ER L NG++S + G
Sbjct: 326 NYELPNMSAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFF 384
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y PL ++R H F CC L +Y ++ +V Y+ ++
Sbjct: 385 YPNPL-----ESRGQHQRQPWFGCA-CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFM 435
Query: 508 SSSFDWKSGH--VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT----- 560
S+ + + G VVL Q+ WD + +++ K +VG ++ +R+P W
Sbjct: 436 SNEANLEVGKKSVVLEQQTR--YPWDGDVAVSV---KKNKVGAF-AMKIRIPGWVRGQVV 489
Query: 561 ------YSNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
YS+G + +NGQ + + + RW DK+ + + R
Sbjct: 490 PSDLYRYSDGKRLGYSVKVNGQPVESELQDGYFTIERRWKKGDKVEVHFDMEPRV 544
>gi|405383237|ref|ZP_11037007.1| hypothetical protein PMI11_07039 [Rhizobium sp. CF142]
gi|397320335|gb|EJJ24773.1| hypothetical protein PMI11_07039 [Rhizobium sp. CF142]
Length = 643
Score = 45.8 bits (107), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 100/481 (20%), Positives = 185/481 (38%), Gaps = 83/481 (17%)
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-----PTELFDSFEA 221
G ++ A++ + N I+ K+ +V L Q + GYL+++ P + + +
Sbjct: 88 GKWIEAASYTLKNNPNPDIEAKIDAIVEKLEHGQ--MADGYLNSWFIRREPEKRWTNLRD 145
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER---H 278
L + Y++ +L G + + + L + V++ +I + E
Sbjct: 146 LHEM----YSMGHLLEGAVAYFEATGKRRFLNVMIRAVDH-------IIDTFGREPGKLR 194
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL 328
Y +EE + L +LY +T DP+HL LA F P + A + A Y+
Sbjct: 195 GYDAHEE---IELALVKLYRVTKDPRHLDLAIYFVDERGQMPSYYDEEARKRGEDPASYV 251
Query: 329 --SHFHANTHIPI-----VIGSQMR------------YEVTGDPLYKLIGTFFMDIVNAS 369
++ ++ H+P+ V+G +R +E + L G F ++V
Sbjct: 252 FQTYAYSQAHMPVREQTQVVGHAVRAMYLFSAMADLAFENDDESLKSACGRLFDNLV-GR 310
Query: 370 HSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYY 426
Y TGG +++ E + L + + ETC + S + + + + D
Sbjct: 311 QLYVTGGLGPSASNEGFTREYDLPNE--TAYAETCAAVALGFFSHRMAQIELDSKFTDKL 368
Query: 427 ERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF- 483
E L NG LS I R + +L +HG ++ +C C T I F
Sbjct: 369 ETVLYNGALSGISRDGQHYFYENVL----------ESHGQNRRWKWHYCPCCPTNIARFI 418
Query: 484 SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK 543
+ LG Y V + I Y ++ + G+ L K W+ + ++L
Sbjct: 419 TSLGQYFY---STKVDEVAIHLYGENAAELTVGNSFLRLKQKTEYPWNGDVGISLGLDQP 475
Query: 544 QEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND--KLTIQLPL 601
+ +L LR+P W A+A +NG+ + L + W D +L +P+
Sbjct: 476 KRF----TLRLRIPGWC--RDAKALVNGEAIKLNVSKGYAPIEREWKDGDEVRLAFDMPV 529
Query: 602 S 602
Sbjct: 530 D 530
>gi|423344367|ref|ZP_17322079.1| hypothetical protein HMPREF1077_03509 [Parabacteroides johnsonii
CL02T12C29]
gi|409212765|gb|EKN05799.1| hypothetical protein HMPREF1077_03509 [Parabacteroides johnsonii
CL02T12C29]
Length = 816
Score = 45.8 bits (107), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 87/388 (22%), Positives = 138/388 (35%), Gaps = 83/388 (21%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMRY 347
L +LY +T D K+L +A F + G + + S H+PI ++G +R
Sbjct: 219 LAKLYKVTRDRKYLDMAKYFVEETGRGTDGHRLNAYSQ----DHMPILQQEEIVGHAVRA 274
Query: 348 -----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
+T D Y D + Y TGG +R + G E
Sbjct: 275 GYLYSGVADVAALTKDTAYFHAICRIWDNMATKKLYITGGIGSRA-------QGEGFGPE 327
Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI- 447
E ETC + + ++ +F T + Y D ERAL NGV+S GV +
Sbjct: 328 YELHNHSAYCETCASIANVYWNQRMFLATGDAKYIDVLERALYNGVIS-------GVSLS 380
Query: 448 -----YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
Y PL ++ H F CC G + F + +GN LY
Sbjct: 381 GDKFFYDNPL-----ESMGQHERAPWFGCA-CCPGN-VTRFMASVPKYMYATQGN--SLY 431
Query: 503 IIQYISSS--FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
+ Y+ S + V L Q + WD +++T++ SL LR+P WT
Sbjct: 432 VNLYVGSESRVALANDTVTLVQNTE--YPWDGLVKLTVSPRKASSF----SLKLRIPSWT 485
Query: 561 YSNGAQAS----------------LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+ S +NG L ++ W D + +++P+ +R
Sbjct: 486 GNEPVPGSDLYTYIKRDREPCAVFVNGTPLKEKAHHGYVVIEREWEPGDVIELRMPMDVR 545
Query: 605 TEAIQDDRPEYASIQAILFGP--YLLAG 630
+ + A+ GP Y L G
Sbjct: 546 RVKAHEKVRADQGLLAVERGPVVYCLEG 573
>gi|374985914|ref|YP_004961409.1| hypothetical protein SBI_03157 [Streptomyces bingchenggensis BCW-1]
gi|297156566|gb|ADI06278.1| hypothetical protein SBI_03157 [Streptomyces bingchenggensis BCW-1]
Length = 644
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 81/366 (22%), Positives = 134/366 (36%), Gaps = 57/366 (15%)
Query: 275 VERHWYSLNEETGGMNDV---LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF 331
VER+ + G +V L LY T D ++L A LF G + + ++F
Sbjct: 173 VERYGPQGEDAVCGHPEVEMALVELYRETGDERYLTQARLFVDRRGRGTVPSRGMGSAYF 232
Query: 332 HAN---THIPIVIGSQMR-----------YEVTGD-PLYKLIGTFFMDIVNASHSYATGG 376
+ +P V G +R + TGD L + + D+V A+ Y TGG
Sbjct: 233 QDHLPLRELPSVTGHAVRMAYLAAGATDVFLETGDRTLLDALRRLWDDMV-ATKLYVTGG 291
Query: 377 TSAREFWWDPKRLADT--LGSENE--ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTN 432
+R + + D L SE ETC ++ + +F T + Y D ER L N
Sbjct: 292 LGSRH---SDEAVGDRYELPSERSYSETCAAIGTMQWAWRMFLATGDARYPDVLERVLYN 348
Query: 433 GVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG--WGTKFNSFW----CCYGTGIESFSKL 486
++ + Y PL R + + G W CC + ++L
Sbjct: 349 -AFAVGLSADGRAFFYDNPLQRRPDHEQRSGAEEGGEPLRQAWFSCPCCPPNVVRWMAQL 407
Query: 487 GDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEV 546
D + E G L + Y + D L+ WD +R+T+ + +
Sbjct: 408 ADFLVAERPGE---LLVAGYAQAGVD--GAEAALDMATG--YPWDGEVRLTVRRAPDEPY 460
Query: 547 GQLSSLNLRMPVW--------TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
++LR+P W T + + G +L+ RW D+L +
Sbjct: 461 ----RISLRVPGWADPGQVRLTVGTAGEETAAGDV-----SDGWLTVERRWRPGDELRLS 511
Query: 599 LPLSLR 604
LP+ +R
Sbjct: 512 LPMPVR 517
>gi|393780984|ref|ZP_10369185.1| hypothetical protein HMPREF1071_00053 [Bacteroides salyersiae
CL02T12C01]
gi|392677319|gb|EIY70736.1| hypothetical protein HMPREF1071_00053 [Bacteroides salyersiae
CL02T12C01]
Length = 672
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 141/368 (38%), Gaps = 74/368 (20%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T D K+L A F D + G ++ H P++ +G +R
Sbjct: 222 LVKLYLVTGDRKYLDQAKFFLDARGYTG--------RKDAYSQAHKPVIEQDEAVGHAVR 273
Query: 347 Y-----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLAD 391
+TGD Y K I + +IV + Y TGG AR E + D L +
Sbjct: 274 AVYMYSGMADVAAITGDSSYIKAIDRIWDNIV-SKKMYITGGIGARHQGEAFGDNYELPN 332
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
S ETC + ++ LF + Y D ER L NG++S + G Y P
Sbjct: 333 L--SAYCETCAAIGSVYMNYRLFLLHGDAKYFDVLERTLYNGLIS-GVSLDGGSFFYPNP 389
Query: 452 LGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISS 509
L +R W F C C + I F L +Y ++ V Y+ ++S+
Sbjct: 390 LASDGGYSRKP--W------FGCACCPSNISRFIPSLPGYVYAVKDRQV---YVNLFLSN 438
Query: 510 SFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT------- 560
+ K VVL Q+ W +R+ + Q G +N+R+P W
Sbjct: 439 RAELKVNDKKVVLEQETS--YPWKGDIRLKV-LQGNQPFG----MNVRIPGWVRGSVLPS 491
Query: 561 ----YSNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR----TEAI 608
Y++ Q + +NGQ + +L+ +W ND + I + R E +
Sbjct: 492 DLYAYADHQQPAYRVMVNGQEVEGELHNGYLTIDRKWKKNDVVEIHFDMLPRLVKANEKV 551
Query: 609 QDDRPEYA 616
DR A
Sbjct: 552 AADRGRVA 559
>gi|380693440|ref|ZP_09858299.1| hypothetical protein BfaeM_05587 [Bacteroides faecis MAJ27]
gi|380693449|ref|ZP_09858308.1| hypothetical protein BfaeM_05644 [Bacteroides faecis MAJ27]
Length = 668
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 81/356 (22%), Positives = 135/356 (37%), Gaps = 78/356 (21%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
L +LY +T D K+L A F G+ + + Y + H P+V +G +R
Sbjct: 219 LVKLYMVTGDKKYLDQAKFFLDT--RGYTSRKDAY-----SQAHKPVVEQDEAVGHAVRA 271
Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+TGD Y K I + +IV + Y TGG AR + G+
Sbjct: 272 VYMYSGMADVAAITGDSSYIKAIDKIWDNIV-SKKIYITGGIGARH-------AGEAFGN 323
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + ++ LF + Y D ER L NG++S + G
Sbjct: 324 NYELPNLSAYCETCAAIGNVYMNYRLFLLHGDAKYFDVLERTLYNGLIS-GVSLDGGSFF 382
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQ 505
Y PL S+ G ++ F C C + + F L +Y ++ V Y+
Sbjct: 383 YPNPL--------SSSGKYSRKPWFGCACCPSNVSRFIPSLPGYVYAVKDDQV---YVNL 431
Query: 506 YISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
++S+ + K ++L Q+ D W +R+ + + Q ++ LR+P W N
Sbjct: 432 FLSNKAELKVDKKKIILEQETD--YPWKGDIRLKIA-----QGNQNFTMKLRIPGWVRGN 484
Query: 564 GA---------------QASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+ S+NGQ + +LS +W D + + + R
Sbjct: 485 VLPGDLYAYADNQKPVYRVSVNGQPVESDVNNGYLSIARKWKKGDVVEVHFDMLPR 540
>gi|347530932|ref|YP_004837695.1| hypothetical protein RHOM_03205 [Roseburia hominis A2-183]
gi|345501080|gb|AEN95763.1| hypothetical protein RHOM_03205 [Roseburia hominis A2-183]
Length = 646
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 51/245 (20%), Positives = 99/245 (40%), Gaps = 22/245 (8%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVS 457
ETC + ++ +R++ + K YAD ERAL NG++S +Q + + L + GVS
Sbjct: 336 ETCASIGLVFFARNMLKTEKNGRYADVMERALYNGIISGMQLDGKRFFYVNPLEVNPGVS 395
Query: 458 KARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
+ W CC + + LG + E+E V Y
Sbjct: 396 GEIFGYKHVIPERPGWYACACCPPNLVRMVTSLGKYAWDEDETAV-------YSHLFLGQ 448
Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
++ + +V+ W+ ++T+ ++ +L +L + +P Y + ++NG+
Sbjct: 449 EAALGKADIRVESAYPWEG----SVTYHVSAKIDELFTLAIHIP--AYVKDLRVTVNGEA 502
Query: 574 LPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLA 629
+L + +W +D++ + PL +R E A++ GP Y
Sbjct: 503 FDTAGEIRDGYLYISRKWGSDDQVELHFPLPVRKIYASTHVREDVGCVALMRGPVVYCFE 562
Query: 630 GHTSG 634
G +G
Sbjct: 563 GADNG 567
>gi|261407601|ref|YP_003243842.1| hypothetical protein GYMC10_3802 [Paenibacillus sp. Y412MC10]
gi|261284064|gb|ACX66035.1| protein of unknown function DUF1680 [Paenibacillus sp. Y412MC10]
Length = 626
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 62/300 (20%), Positives = 119/300 (39%), Gaps = 28/300 (9%)
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
YE+ G+P+ + +D + H A G S E+ L+ T S+ E C
Sbjct: 237 YELHGNPVERESVHRGIDSLMTYHGQAHGMFSGDEW------LSGTHPSQGVELCAVVEY 290
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGV--------LSIQRGTEPGVMIYMLPLGRGVSK 458
+ L R E + D E+ N + S Q + MI + R S
Sbjct: 291 MFSMEQLTRIFGEGRFGDILEKVAFNALPAAISADWTSHQYDQQVNQMICNV-APRAWSN 349
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
+ + +G + N F CC + + KL ++ +++ + G+ + Y + G
Sbjct: 350 SPDANVFGLEPN-FGCCTANMHQGWPKLASHLWMKDQED--GVVAVSYAPCTVRTTVGRQ 406
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
++ ++ + R+ + S E + ++LR+P W + +LNG+ +P+
Sbjct: 407 GVSAEIAVTGEYPFKDRIQIHLSL--ERAESFRISLRIPAWC--DHPVITLNGREMPIQA 462
Query: 579 PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDI 638
+ + W D L + LP+ ++TE+ R YA+ +I GP + W +
Sbjct: 463 ESGYAEIMQTWQSGDLLELYLPMEVKTES----RSMYAT--SITRGPLVYVLPVKENWQM 516
>gi|423344366|ref|ZP_17322078.1| hypothetical protein HMPREF1077_03508 [Parabacteroides johnsonii
CL02T12C29]
gi|409212764|gb|EKN05798.1| hypothetical protein HMPREF1077_03508 [Parabacteroides johnsonii
CL02T12C29]
Length = 657
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 82/353 (23%), Positives = 122/353 (34%), Gaps = 69/353 (19%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T K+L LA F DK G+ + Y + H P++ +G +R
Sbjct: 219 LCKLYLVTGQKKYLDLAKFFLDK---RGYTERKDAY-----SQAHKPVLEQDEAVGHAVR 270
Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+TGD Y + V Y TGG A + G
Sbjct: 271 AAYMYSGMADVAALTGDTGYVHAIDRIWENVVTKKLYITGGIGATNN-------GEAFGK 323
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + LF E Y D ER L NG++S E
Sbjct: 324 NYELPNLSAYCETCAAIGNVYWNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLEGNGFF 382
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y PL R + CC L IY + NV Y+ ++
Sbjct: 383 YPNPLASTGQHQRKP------WFGCACCPSNICRFIPSLPGYIYAVHDKNV---YVNLFM 433
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT------- 560
S+S D K G L WD +R+ + KQ+ +L +R+P W
Sbjct: 434 SNSSDLKVGGKSLKLTQSTGYPWDGDVRLDMAPKGKQDF----TLKIRVPGWVRGEVVPS 489
Query: 561 ----YSNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
+S+G Q +NG+ + + S T +W D + + + RT
Sbjct: 490 DLYMFSDGKQLGYSVKVNGEPVESNLDKGYFSITRQWKKGDVVEVHFDMEPRT 542
>gi|389844758|ref|YP_006346838.1| hypothetical protein Theba_1950 [Mesotoga prima MesG1.Ag.4.2]
gi|387859504|gb|AFK07595.1| hypothetical protein Theba_1950 [Mesotoga prima MesG1.Ag.4.2]
Length = 621
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 95/496 (19%), Positives = 194/496 (39%), Gaps = 52/496 (10%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF--PTELFDSFEALK 223
V ++ A++ + + I+ ++ +++ + + Q G GY++ + + + ++ LK
Sbjct: 78 VYKWIEAASYSLSYNEDPEIRARIESLITLIEKAQEISGDGYINTYFVGQKAGERWKDLK 137
Query: 224 PVWAPYYTIHKILAGLLDQYVLADNA---QALKMATWMVEYFYNRVQKVITMY-SVERHW 279
+ Y H I AG+ ++ D + A +++ F + KV T + +E
Sbjct: 138 NMHELYCAGHLIQAGIANKRASGDETLFKVCVSAADNILDSFRDDDCKVTTGHPELEMAM 197
Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHF-------- 331
L+ ETG + +L A + G++ ++ H
Sbjct: 198 IELHRETGNRD--------------YLKFAQMLIDNRGRGYVGGDEYHIDHVPFRELKEL 243
Query: 332 --HANTHIPIVIGSQMRYEVTGDP-LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKR 388
HA + ++ G+ + TGD L ++ ++D+ A Y TGG +R + +
Sbjct: 244 TGHAVRMLYLLAGAADIFLETGDETLLAVLERLWIDLT-ARKMYVTGGAGSR-YEGESFG 301
Query: 389 LADTLGSENE--ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
L S ETC + + ++ + + Y D +E++ NGVLS +
Sbjct: 302 EEFELPSRRAYAETCAAVGNVFWNWRMYMISGDAKYLDLFEQSFYNGVLS-GISLDGKRY 360
Query: 447 IYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
Y+ PL + R ++ CC + G IY + + +
Sbjct: 361 FYVNPLEDAGKRERE------EWFECACCPPNIARLLTSFGGYIYGTTLNEIR-VNFYEE 413
Query: 507 ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQ 566
++ ++ G V + QK S+ + LT ++ + +LS L LR+P WT +
Sbjct: 414 SKATIPFRDGEVSIIQK----TSYPHSEEVQLTVATDLDT-ELSIL-LRIPEWTEGE-FE 466
Query: 567 ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP- 625
++G L P F+ W ++ + LP+ +R E ++ GP
Sbjct: 467 VQVDGIKQKLRPEKGFVRLEGNWKGKTEVYLALPMRIRLMTANPLLRENTDKVSVQRGPL 526
Query: 626 -YLLAGHTSGEWDIKT 640
Y G + ++D++T
Sbjct: 527 VYCAEGVDNPDFDVRT 542
>gi|334121751|ref|ZP_08495800.1| protein of hypothetical function DUF1680 [Enterobacter hormaechei
ATCC 49162]
gi|333392772|gb|EGK63868.1| protein of hypothetical function DUF1680 [Enterobacter hormaechei
ATCC 49162]
Length = 657
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 58/251 (23%), Positives = 90/251 (35%), Gaps = 39/251 (15%)
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 314 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADGHYADVME 370
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H FN +
Sbjct: 371 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIYDHVKPVRQRWFGCA 421
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + LG IY L I Y+ + G +L ++ W
Sbjct: 422 CCPPNIARVLTSLGHYIYTVRPD---ALLINLYVGNDVAIPVGDNILQLRISGNYPWHEQ 478
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
+++ +T V +L LR+P W SLNG+ + +L W D
Sbjct: 479 VKIEIT----SPVPVTHTLALRLPDWCAEPA--VSLNGEAITGEVSRGYLYLNRSWQEGD 532
Query: 594 KLTIQLPLSLR 604
L++ LP+ +R
Sbjct: 533 TLSLTLPMPVR 543
>gi|298374271|ref|ZP_06984229.1| cytoplasmic protein [Bacteroides sp. 3_1_19]
gi|301307792|ref|ZP_07213748.1| putative cytoplasmic protein [Bacteroides sp. 20_3]
gi|423337089|ref|ZP_17314833.1| hypothetical protein HMPREF1059_00758 [Parabacteroides distasonis
CL09T03C24]
gi|298268639|gb|EFI10294.1| cytoplasmic protein [Bacteroides sp. 3_1_19]
gi|300834135|gb|EFK64749.1| putative cytoplasmic protein [Bacteroides sp. 20_3]
gi|409238277|gb|EKN31070.1| hypothetical protein HMPREF1059_00758 [Parabacteroides distasonis
CL09T03C24]
Length = 618
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 52/231 (22%), Positives = 98/231 (42%), Gaps = 25/231 (10%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPL-GRGV 456
ETC + M+ ++ + + T + Y D ER+L NG L+ I G + Y+ PL +G
Sbjct: 336 ETCASVGMVLWNQRMNQLTGDSKYIDILERSLYNGALAGISLGGDR--FFYVNPLESKGD 393
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
+ +G CC +G+ IY + L++ YI ++ + G
Sbjct: 394 HHRQEWYGCA-------CCPSQLSRFLPSIGNYIYASSD---DALWVNLYIGNTGQIRIG 443
Query: 517 H--VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
++L Q+ D WD +++T++ S E + LR+P W + S+NG+ +
Sbjct: 444 ETDILLTQETD--YPWDGSVKLTISTSQPLE----KEIRLRIPNWCKT--YDLSINGKRI 495
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+ + + + W D + + + + + A E +AI GP
Sbjct: 496 NVSEKKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFGKRAIQRGP 545
>gi|333378296|ref|ZP_08470027.1| hypothetical protein HMPREF9456_01622 [Dysgonomonas mossii DSM
22836]
gi|332883272|gb|EGK03555.1| hypothetical protein HMPREF9456_01622 [Dysgonomonas mossii DSM
22836]
Length = 826
Score = 45.4 bits (106), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 84/390 (21%), Positives = 148/390 (37%), Gaps = 83/390 (21%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMRY 347
L +LY +T DP +L +A F + ++ +S +A H P+ +G +R
Sbjct: 226 LVKLYRVTGDPLYLNMAKKFIDIRGVTYVPDGKGTMSPEYAQQHAPVREQDKAVGHAVRA 285
Query: 348 -----------EVTGDP-LYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+TGD L + + +IV+ + + TGG A + G
Sbjct: 286 VYLYSGMSDVGTLTGDTTLSPALDKIWGNIVD-TRMHITGGLGAIHG-------IEGFGP 337
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E E ETC + + +F K+ Y D E +L N VL+ E
Sbjct: 338 EYELPNKEAYNETCAAVGNVFFNHRMFLLEKDGKYMDVAEVSLLNNVLA-GVNLEGNKFF 396
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
Y+ PL GT S+W CC ++ +Y + + +
Sbjct: 397 YVNPLASD----------GTVDRSYWFGTACCPTNLARLIPQISGLMYAHTDNEI---FC 443
Query: 504 IQYISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
Y S D+ SG V L QK + P+ + + ++ Q S+ +R+P W
Sbjct: 444 SFYTGSKVDFALTSGKVALEQKTNY-----PFDESIVLTVNPEKNDQTFSIKMRIPTWVG 498
Query: 562 S------------NGAQA-----------SLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
S N ++A +L+ + + F+S + +W DK+ ++
Sbjct: 499 SQFVPGKLYSYVDNNSKAWELYINDKKVGNLSFKKGEVSLDKGFVSISRKWKKGDKVELK 558
Query: 599 LPLSLR-TEAIQDDRPEYASIQAILFGPYL 627
LP+ +R + AI + + + + AI GP +
Sbjct: 559 LPMPVRYSHAINEVKADNDRV-AITRGPLV 587
>gi|29349082|ref|NP_812585.1| hypothetical protein BT_3674 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383124304|ref|ZP_09944969.1| hypothetical protein BSIG_3668 [Bacteroides sp. 1_1_6]
gi|29340989|gb|AAO78779.1| Six-hairpin glycosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|251839199|gb|EES67283.1| hypothetical protein BSIG_3668 [Bacteroides sp. 1_1_6]
Length = 668
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 82/356 (23%), Positives = 134/356 (37%), Gaps = 78/356 (21%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
L +LY T D K+L A F G+ + + Y + H P+V +G +R
Sbjct: 219 LVKLYMATGDKKYLDQAKFFLDT--RGYTSRKDTY-----SQAHKPVVEQDEAVGHAVRA 271
Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+TGD Y K I + +IV + Y TGG A + G+
Sbjct: 272 VYMYSGMADVAAITGDSSYIKAIDKIWDNIV-SKKIYITGGIGAH-------HAGEAFGN 323
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + ++ LF + Y D ER L NG++S + G
Sbjct: 324 NYELPNLSAYCETCAAIGNVYMNYRLFLLHGDAKYFDVLERTLYNGLIS-GVSLDGGSFF 382
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQ 505
Y PL S++G ++ F C C + + F L +Y + V Y+
Sbjct: 383 YPNPL--------SSNGKYSRKPWFGCACCPSNVSRFIPSLPGYVYAVKNDQV---YVNL 431
Query: 506 YISSSFDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
Y+S+ + K ++L Q+ W+ +R+ +T + Q ++ LR+P W N
Sbjct: 432 YLSNKAELKVDKKKILLEQETG--YPWNGDIRLKIT-----QGNQDFTMKLRIPGWVRGN 484
Query: 564 ---------------GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
Q S+NGQ + +LS +W D + + + R
Sbjct: 485 VLPGDLYSYADNQKPAYQVSVNGQTVESDVNDGYLSIARKWKKGDVVEVHFDMIPR 540
>gi|430748744|ref|YP_007211652.1| hypothetical protein Theco_0434 [Thermobacillus composti KWC4]
gi|430732709|gb|AGA56654.1| hypothetical protein Theco_0434 [Thermobacillus composti KWC4]
Length = 806
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 81/212 (38%), Gaps = 14/212 (6%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL------ 452
ETC + ++ +R + R YAD ERAL N VL+ + Y+ PL
Sbjct: 323 ETCASIVLIFWARRMLRLEARSEYADVMERALYNTVLA-GMARDGKHFFYVNPLEVWPEA 381
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS-- 510
R K+ CC + L D IY +E +++ YI S
Sbjct: 382 SLKNPDRRHVKPIRQKWFGCSCCPPNVARLLASLDDYIYDIDEA-AGRVHVHLYIGSEAR 440
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
F V L+Q+ + WD + L+ S V +L LR+P W + ++N
Sbjct: 441 FAAAGREVTLHQRSG--LPWDGTVTFGLSVSGGGAV--RLALALRVPDWFQTAEPVLAVN 496
Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
G+ P + W+ D+ +LP+
Sbjct: 497 GEACPYRMEKGYAVVEREWADGDRAEWRLPME 528
>gi|299141574|ref|ZP_07034710.1| hypothetical protein HMPREF0665_01155 [Prevotella oris C735]
gi|298576910|gb|EFI48780.1| hypothetical protein HMPREF0665_01155 [Prevotella oris C735]
Length = 673
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 72/321 (22%), Positives = 121/321 (37%), Gaps = 48/321 (14%)
Query: 349 VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--------E 399
+TGD Y K I + +I++ + Y TGG AR + + G++ E E
Sbjct: 290 LTGDSAYIKAIDCIWDNILSKKY-YLTGGVGARHY-------GEAFGADYELPNLTAYNE 341
Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA 459
TC ++ LF + Y D ER L NGV+S + G Y PL
Sbjct: 342 TCAAIAQCYLNMRLFMLHGDSKYIDCLERTLYNGVIS-GMSIDGGRFFYPNPLSADGIYK 400
Query: 460 RSTHGWGTKFNSFWC-CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
+ G T+ F C C + + F + GN +Y+ ++ S + K G
Sbjct: 401 FNADGTTTRQPWFGCACCPSNLSRFIPSVPGYVYAVRGN--DVYVNLFMGSKANVKVGGK 458
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT---------YS------N 563
+ + + WD + + + K + +SL +R+P W YS +
Sbjct: 459 EMKIETETNYPWDGKVSICI----KGNANKHASLLVRIPGWARGEVTPGGLYSFTDKQKD 514
Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAIQDDRPEYASIQ 619
G ++NG+N + D +T+ L + RT + + DDR
Sbjct: 515 GWSIAVNGKNRNAEKLEKGYIRIDNVKKGDVITLNLDMEPRTVVADKRVMDDR----GCV 570
Query: 620 AILFGPYLLAGHTSGEWDIKT 640
A+ GP + + +KT
Sbjct: 571 AVERGPLVYCAESVDNNGMKT 591
>gi|116254709|ref|YP_770545.1| hypothetical protein pRL100266 [Rhizobium leguminosarum bv. viciae
3841]
gi|115259357|emb|CAK10492.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 647
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 101/482 (20%), Positives = 175/482 (36%), Gaps = 95/482 (19%)
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-----PTELFDSFEA 221
G ++ A++ NA ++ K+ +V L + Q + GYL+++ P + +
Sbjct: 90 GKWIEAASYTLKVHPNAALEAKIDAIVEKLEKGQ--MADGYLNSWFIRREPDRRWTNLRD 147
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER---H 278
L + Y++ +L G + Y + L + V++ +I + E
Sbjct: 148 LHEM----YSMGHLLEGAVAYYEATGKRRFLDVMIRAVDH-------IIATFGAEPGKLR 196
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQAD------- 326
Y +EE + L +LY +T DP+HL LA F P + A +
Sbjct: 197 GYDAHEE---IELALVKLYRVTRDPRHLKLATYFVDERGRMPSYYDEEARKRGESPDDYV 253
Query: 327 YLSHFHANTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASH 370
Y ++ ++ H+P+ V+G +R DP K D +
Sbjct: 254 YKTYAYSQAHMPVRDQHQVVGHAVRAMYLFSAMADLSHENDDPTLKEACDRLFDNLVGRQ 313
Query: 371 SYATGGTS--------AREFWWDPKRLADTLGSEN--EETCTTYNMLKVSRHLFRWTKEI 420
Y TGG REF L +E ETC + S + + +
Sbjct: 314 LYVTGGLGPSASNEGFTREF---------DLPNETAYAETCAAVALGFWSHRMAQVDLDS 364
Query: 421 AYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGT 478
+ D E L NG LS I R E +L +HG ++ +C C T
Sbjct: 365 KFTDRLETVLYNGALSGISRDGEHYFYENVL----------ESHGQHRRWKWHYCPCCPT 414
Query: 479 GIESF-SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
I F + LG Y ++ L + Y ++S + G+ + + + WD + +
Sbjct: 415 NIARFITSLGQYFYSTDDHQ---LAVHLYGTNSAELTVGNSFVRLIQETLYPWDGDISLR 471
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKL 595
L LR+P W AQ S+NG + L + + + W D++
Sbjct: 472 FAVERPSRF----QLRLRIPGWC--RQAQISVNGVAVDLDQCVTKGYAAISREWRNGDEV 525
Query: 596 TI 597
I
Sbjct: 526 RI 527
>gi|373954097|ref|ZP_09614057.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373890697|gb|EHQ26594.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 800
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 61/246 (24%), Positives = 97/246 (39%), Gaps = 51/246 (20%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
ETC + + +F + Y D ER L NG+LS GV + Y PL
Sbjct: 335 ETCAAIGNVYWNNRMFLLHGDAKYIDVLERTLYNGLLS-------GVSLSGDRFFYPNPL 387
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
RS + S CC L +Y + + + LY+ ++S+S +
Sbjct: 388 ASMFQHQRSA------WISCACCISNMTRFLPSLPGYVYAKNKND---LYVNLFMSNSSN 438
Query: 513 WK--SGHVVLNQKVD------------PIVSWDPYLRMTLTFSSKQE--VGQLSSLNLR- 555
K SG+V + Q+ D P+ + D LR+ + +KQ+ G L S +
Sbjct: 439 IKLASGNVNIVQQTDYPWKGQVDMTINPVKTTDFTLRVRIPGWAKQQPVPGNLYSFMDKT 498
Query: 556 -MPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS----LRTEAIQD 610
+PV Y NG S + + W DK+++ LPL L + ++D
Sbjct: 499 PLPVVIYINGKATSFVTEK-------GYAVLKRNWKKGDKVSLALPLETEKVLANDKVKD 551
Query: 611 DRPEYA 616
DR +A
Sbjct: 552 DRGRFA 557
>gi|332882008|ref|ZP_08449643.1| hypothetical protein HMPREF9074_05437 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357048166|ref|ZP_09109720.1| hypothetical protein HMPREF9441_03769 [Paraprevotella clara YIT
11840]
gi|332679932|gb|EGJ52894.1| hypothetical protein HMPREF9074_05437 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355528749|gb|EHG98227.1| hypothetical protein HMPREF9441_03769 [Paraprevotella clara YIT
11840]
Length = 818
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 90/412 (21%), Positives = 147/412 (35%), Gaps = 84/412 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKP---CFLGFL--ALQADYLSHFHANTHIPIVIGSQMR 346
L +LY +T ++L A F + C G A DY + + + +
Sbjct: 221 ALAKLYKVTGKEEYLRTARYFVEETGRCTDGHAPSAYSQDYKPILEQDEIVGHAVRAGYL 280
Query: 347 YE-------VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE- 398
Y +TGD Y T + + Y TGG +R + G + E
Sbjct: 281 YSGVADVAALTGDTAYFHALTRIWENMAGRKLYLTGGIGSRA-------QGEGFGPDYEL 333
Query: 399 -------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI---- 447
ETC + + + +F T + Y D ERAL NGV+S GV +
Sbjct: 334 NNHTAYCETCASIANVYWNHRMFLATGDSRYEDVLERALYNGVIS-------GVSLSGDR 386
Query: 448 --YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
Y PL ++ H F CC G + + + +Y + +V ++
Sbjct: 387 FFYDNPL-----ESMGQHERQAWFGCA-CCPGNVTRFMASVPNYMYATQGKDV---FVNL 437
Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS--- 562
YI S+ + + + WD +RMT+ KQ +L R+P W
Sbjct: 438 YIQSTAHLSTSQNKIEIRQTTDYPWDGKIRMTVHPEKKQTF----ALRCRIPGWAQDRPV 493
Query: 563 -----------NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSL-RTEA--- 607
G +NG++ + +W D + + P+ + R EA
Sbjct: 494 PTDLYHYTGKGKGYTIQVNGKDAEFRVENGYAVILRKWKKGDTVQLDFPMDVRRVEARGE 553
Query: 608 IQDDRPEYASIQAILFGPYLLAGHTSGEWD-------IKTGTARSLSALISP 652
++DDR + AI GP + + D I TGT ++SA +P
Sbjct: 554 VEDDRGK----AAIERGPIVYCIEDKDQPDSLIFNKFIPTGT--TISATYAP 599
>gi|154495096|ref|ZP_02034101.1| hypothetical protein PARMER_04143 [Parabacteroides merdae ATCC
43184]
gi|423725062|ref|ZP_17699202.1| hypothetical protein HMPREF1078_03096 [Parabacteroides merdae
CL09T00C40]
gi|154085646|gb|EDN84691.1| hypothetical protein PARMER_04143 [Parabacteroides merdae ATCC
43184]
gi|409235418|gb|EKN28236.1| hypothetical protein HMPREF1078_03096 [Parabacteroides merdae
CL09T00C40]
Length = 679
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 87/424 (20%), Positives = 163/424 (38%), Gaps = 43/424 (10%)
Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
++ ++ QY A Q ++ +M YF ++ ++ + W E+ GG N V+
Sbjct: 163 VMLKVMQQYYTA--TQDRRVIDFMTRYFRYQLDELPK--NPLGKWTFWGEQRGGDNLMVV 218
Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
Y LY+IT D L L L K F + L ++L H+ + + G + + Y+
Sbjct: 219 YWLYNITGDKFLLDLGELIHKQTFNWTDIFLNQNHLRRQHSLHCVNLAQGFKEPIVYYQQ 278
Query: 350 TGDPLYKLIGTFFMDIVNASHSYA--TGGTSAREFWWDPKRLADTLGSENEETCTTYNML 407
D K I + + H+ TG W + L + E CT M+
Sbjct: 279 GKDS--KQIQATRQAVNDIRHTIGLPTG------LWGGDELLRFGKPTTGSELCTAVEMM 330
Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
+ T ++ +ADY ER N L Q + Y + ++ R + T
Sbjct: 331 YSLETILEVTGDMQWADYLERVAYNA-LPTQVTDDYSARQYYQQTNQ-IAVTREWREFST 388
Query: 468 ----------KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK-SG 516
+ + CC + + K ++++ N GL + + S + +G
Sbjct: 389 PHDDTDLLFGELTGYPCCTSNLHQGWPKFVQNLWYATADN--GLASLLFAPSQVTARVAG 446
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
+ +N K + ++ +R ++F+ K+ +LR+P W LNG+ L +
Sbjct: 447 GIEVNLKEETAYPFEETVRYHVSFTDKKVKKVFFPFHLRIPGW--CKQPVVKLNGKPLTV 504
Query: 577 PP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
PG W D L+++LP+ + Y + + GP + A + +
Sbjct: 505 DAYPGTVTRINREWKEGDILSLELPMEVTVSRW------YENSAVVERGPLVYALKMNEK 558
Query: 636 WDIK 639
W+ K
Sbjct: 559 WEKK 562
>gi|354725692|ref|ZP_09039907.1| hypothetical protein EmorL2_22781 [Enterobacter mori LMG 25706]
Length = 649
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 57/251 (22%), Positives = 93/251 (37%), Gaps = 39/251 (15%)
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ +R + + YAD E
Sbjct: 306 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADGHYADVME 362
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H FN +
Sbjct: 363 RALYNTVLG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIFDHVKPVRQRWFGCA 413
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + LG IY + L+I Y+ + G L ++ W
Sbjct: 414 CCPPNIARVLTSLGHYIYTVRQD---ALFINLYVGNDVAIPVGDETLALRISGNYPWHEQ 470
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
+++ +T ++ +L LR+P W + LNG+ + +L T W D
Sbjct: 471 VKIDITSTAP----VTHTLALRLPDWGAT--PDVLLNGEAVTGEISRGYLYLTRSWQEGD 524
Query: 594 KLTIQLPLSLR 604
+T+ LP+ +R
Sbjct: 525 VITLTLPMPVR 535
>gi|241554299|ref|YP_002979512.1| hypothetical protein Rleg_6525 [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|240863605|gb|ACS61267.1| protein of unknown function DUF1680 [Rhizobium leguminosarum bv.
trifolii WSM1325]
Length = 647
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 105/482 (21%), Positives = 180/482 (37%), Gaps = 95/482 (19%)
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-----PTELFDSFEA 221
G ++ A++ + NA ++ K+ +V L + Q + GYL+++ P + +
Sbjct: 90 GKWIEAASYTLKAHPNAALETKIDAIVEKLEKGQ--MADGYLNSWFIRREPDRRWTNLRD 147
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER---H 278
L + Y++ +L G + Y + L + V++ +I + E
Sbjct: 148 LHEM----YSMGHLLEGAVAYYEATGKRRFLDVMIRAVDH-------IIETFGAEPGKLR 196
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL 328
Y +EE + L +LY +T DP+HL LA F P + A + DY+
Sbjct: 197 GYDAHEE---IELALVKLYRVTGDPRHLKLATYFVDERGRMPSYYDEEARKRGESPEDYV 253
Query: 329 --SHFHANTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASH 370
++ ++ H+P+ V+G +R DP K D + +
Sbjct: 254 YKTYAYSQAHLPVRDQHQVVGHAVRAMYLFSAMADLSRENDDPTLKEACDRLFDNLVSRQ 313
Query: 371 SYATGGTS--------AREFWWDPKRLADTLGSEN--EETCTTYNMLKVSRHLFRWTKEI 420
Y TGG REF L +E ETC + S + + +
Sbjct: 314 LYVTGGLGPSASNEGFTREF---------DLPNETAYAETCAAVALGFWSHRMAQVDLDS 364
Query: 421 AYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGT 478
+ D E L NG LS I R E +L +HG ++ +C C T
Sbjct: 365 KFTDRLETVLYNGALSGISRDGERYFYENVL----------ESHGQHRRWKWHYCPCCPT 414
Query: 479 GIESF-SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
I F + LG Y ++ + +++ S+ V L QK D LR
Sbjct: 415 NIARFITSLGQYFYSTDDHQL-AVHLYGTNSAELTVGDSFVRLIQKTQYPWDGDISLRFA 473
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKL 595
+ S+ + L LR+P W AQ S+NG + L + + + W D++
Sbjct: 474 VERPSRFQ------LRLRIPGWC--RQAQISVNGVAVDLDQCVTKGYAAISREWRNGDEV 525
Query: 596 TI 597
I
Sbjct: 526 RI 527
>gi|270339568|ref|ZP_06005245.2| conserved hypothetical protein [Prevotella bergensis DSM 17361]
gi|270334558|gb|EFA45344.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
Length = 813
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 66/294 (22%), Positives = 114/294 (38%), Gaps = 47/294 (15%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGV 456
ETC + + + +F T + Y D YERAL NGVLS G E Y PL
Sbjct: 344 ETCASIANVYWNYRMFLATGDAKYVDVYERALYNGVLSGVSLSGKE---FFYDNPLESMG 400
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
AR W F CC G + F + GN +++ YI D
Sbjct: 401 QHAR--QAW---FGCA-CCPGN-VTRFVASVPQYQYATRGN--DIFVNLYIQGKADING- 450
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS-------- 568
V L Q + WD + + ++ + ++ R+P W ++ +
Sbjct: 451 -VQLTQTTN--YPWDGNISIQVSPKRRSTF----AIRFRIPGWAHNKPVSTNLYHFIDKA 503
Query: 569 ------LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR----TEAIQDDRPEYASI 618
LNG + ++ + +W D++ I+LP+ +R + ++DDR +
Sbjct: 504 KPYAVKLNGDVVDATLEDGYVVISRKWKKGDRVEIELPMDVRRVQANDNVEDDRGKI--- 560
Query: 619 QAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNA-QLVTFTQESGNS 671
A+ GP + + D + L +PI S+++ +L + +GN+
Sbjct: 561 -ALERGPVMFCLEGKDQSD--NTVFNKIITLTTPITASYHSDKLNGIVELTGNA 611
>gi|255532639|ref|YP_003093011.1| hypothetical protein Phep_2748 [Pedobacter heparinus DSM 2366]
gi|255345623|gb|ACU04949.1| protein of unknown function DUF1680 [Pedobacter heparinus DSM 2366]
Length = 684
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 109/484 (22%), Positives = 179/484 (36%), Gaps = 87/484 (17%)
Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG------------TGYLSAFPTELFD 217
L A A M+AST++ + M + ++ Q G TG + F L
Sbjct: 118 LEAMASMYASTNDPKLDAMMDKAIAVIARSQRDDGYIYTKAMIEQRKTGSKNQFQDRL-- 175
Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
SFEA Y I ++ Y L +A EY YN QK ++ R
Sbjct: 176 SFEA--------YNIGHLMTAACVHYRATGKTTLLNVAKKATEYLYNFYQKASP--ALAR 225
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLA-HLF-----------DKPCFLGFLALQA 325
+ + G + +Y DP++L LA HL D + FL Q
Sbjct: 226 NAICPSHYMG-----VIEMYRTIKDPRYLELAKHLIAIKGKIEDGTDDNQDRIPFLQ-QT 279
Query: 326 DYLSH-FHANTHIPIVIGSQMRYEVTG-DPLYKLIGTFFMDIVNASHSYATGG------- 376
+ H AN + G Y TG D L K + + D VN Y TGG
Sbjct: 280 KAMGHAVRANY---LYAGVADLYAETGNDSLMKTLNLMW-DDVNQHKMYITGGCGSLYDG 335
Query: 377 TSAREFWWDP---KRLADTLGSE--------NEETCTTYNMLKVSRHLFRWTKEIAYADY 425
TS ++P +++ G + + ETC + + + + + + YAD
Sbjct: 336 TSPDGTSYNPTEVQKIHQAFGRDFQLPNFTAHNETCANIGNVLWNWRMLQISGDAKYADV 395
Query: 426 YERALTNGVLS-IQRG------TEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGT 478
E AL N VLS I T P LP + SK R + CC
Sbjct: 396 MELALHNSVLSGISLDGKKFLYTNPLSYSDELPFKQRWSKDRVPY-----IGLSNCCPPN 450
Query: 479 GIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
+ + +++ D Y ++G LY ++++ + L+Q+ + WD +++
Sbjct: 451 VVRTIAEVSDYAYSISDKGLWFNLYGGNTVNTTLT-DGTKLKLSQETN--YPWDGNIKIK 507
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTI 597
+ + + SL R+P W + + +N+ L PG + +W D + +
Sbjct: 508 ILSTGSKPY----SLFFRIPGWAARADLKVNGKVENMDL-RPGTYAELNRKWKAGDLVEL 562
Query: 598 QLPL 601
LP+
Sbjct: 563 VLPM 566
>gi|237719717|ref|ZP_04550198.1| six-hairpin glycosidase [Bacteroides sp. 2_2_4]
gi|229450986|gb|EEO56777.1| six-hairpin glycosidase [Bacteroides sp. 2_2_4]
Length = 668
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 87/363 (23%), Positives = 142/363 (39%), Gaps = 72/363 (19%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
L +LY +T D K+L A F G+ + + Y + H P+V +G +R
Sbjct: 219 LVKLYLVTGDKKYLDQAKFFLDA--RGYTSRKDAY-----SQAHKPVVEQDEAVGHAVRA 271
Query: 348 E-----------VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADT 392
+TGD Y K I + +IV + Y TGG AR E + + L ++
Sbjct: 272 AYMYSGMADVAAITGDSSYIKAIDKIWDNIV-SKKIYVTGGIGARHAGEAFGNNYELPNS 330
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
S ETC + ++ LF + Y D ER L NG++S + G Y PL
Sbjct: 331 --SAYCETCAAIGNVYMNYRLFLLHGDAKYFDVLERTLYNGLIS-GVSLDGGSFFYPNPL 387
Query: 453 GRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISSS 510
+R W F C C + + F L +Y ++ V Y+ Y+S+
Sbjct: 388 ASNGKYSRKP--W------FGCACCPSNVSRFIPSLPGYVYAVKDNQV---YVNLYLSNK 436
Query: 511 FDW--KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW--------- 559
+ VVL Q+ W+ +R+ + + Q +L LR+P W
Sbjct: 437 AELIVNKKKVVLEQETG--YPWNGDIRVKVA-----QGNQEFALKLRIPGWVRNEVLPSG 489
Query: 560 --TYSNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR----TEAIQ 609
+Y++ + + +NGQ +LS +W D + I + R E +
Sbjct: 490 LYSYADNQKPTYRIIVNGQETANTLNNGYLSIERKWKKGDVVKIHFDMLPRIVKANEKVV 549
Query: 610 DDR 612
DD+
Sbjct: 550 DDK 552
>gi|218260015|ref|ZP_03475494.1| hypothetical protein PRABACTJOHN_01155 [Parabacteroides johnsonii
DSM 18315]
gi|218224798|gb|EEC97448.1| hypothetical protein PRABACTJOHN_01155 [Parabacteroides johnsonii
DSM 18315]
Length = 665
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 82/353 (23%), Positives = 122/353 (34%), Gaps = 69/353 (19%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T K+L LA F DK G+ + Y + H P++ +G +R
Sbjct: 227 LCKLYLVTGQKKYLDLAKFFLDK---RGYTERKDAY-----SQAHKPVLEQDEAVGHAVR 278
Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+TGD Y + V Y TGG A + G
Sbjct: 279 AAYMYSGMADVAALTGDTGYVHAIDRIWENVVTKKLYITGGIGATNN-------GEAFGK 331
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + LF E Y D ER L NG++S E
Sbjct: 332 NYELPNLSAYCETCAAIGNVYWNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLEGNGFF 390
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y PL R + CC L IY + NV Y+ ++
Sbjct: 391 YPNPLASTGQHQRKP------WFGCACCPSNICRFIPSLPGYIYAVHDKNV---YVNLFM 441
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT------- 560
S+S D K G L WD +R+ + KQ+ +L +R+P W
Sbjct: 442 SNSSDLKVGGKSLKLTQSTGYPWDGDVRLDVAPKGKQDF----TLKIRVPGWVRGEVVPS 497
Query: 561 ----YSNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
+S+G Q +NG+ + + S T +W D + + + RT
Sbjct: 498 DLYMFSDGKQLGYSVKVNGEPVESNLDKGYFSITRQWKKGDVVEVHFDMEPRT 550
>gi|261878820|ref|ZP_06005247.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
gi|270334561|gb|EFA45347.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
Length = 819
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 137/352 (38%), Gaps = 64/352 (18%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY T + K+L A F + G ++ +Y + +H P+V +G +R
Sbjct: 223 ALCKLYLATGNRKYLDQAKFFLD--YRGKTTIRQEY-----SQSHKPVVEQDEAVGHAVR 275
Query: 347 YE-----------VTGDPLY-KLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLAD 391
+TGD Y K I + +IV Y TGG TS E + L +
Sbjct: 276 AAYMYAGMADVAALTGDADYIKAIDRIWDNIV-GKKLYITGGIGATSNGEAFGKNYELPN 334
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
S ETC + V+ LF E Y D ER+L NG++S + G Y P
Sbjct: 335 M--SAYAETCAAIGNVYVNYRLFLLHGESKYFDVLERSLYNGLIS-GVSMDGGGFFYPNP 391
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
L ++ H F CC L +Y ++ N LY+ ++S+S
Sbjct: 392 L-----ESMGQHQRQAWFGCA-CCPSNICRFLPSLPGYVYAVKDNN---LYVNLFLSNSA 442
Query: 512 DWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT--------- 560
K +V L Q + WD + + + + G L +R+P W
Sbjct: 443 TMKVNGKNVSLTQSTN--YPWDGDIAIRVDRNKAGSFG----LKIRIPGWIKGQPVPSDL 496
Query: 561 --YSNGAQAS----LNGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
YS+G + + +NG+ + P + + RW D +TI + +RT
Sbjct: 497 YYYSDGKRPNYTILVNGKAIEPTITDDGYCTINRRWKKGDVVTIHFDMEVRT 548
>gi|189462782|ref|ZP_03011567.1| hypothetical protein BACCOP_03480 [Bacteroides coprocola DSM 17136]
gi|189430398|gb|EDU99382.1| hypothetical protein BACCOP_03480 [Bacteroides coprocola DSM 17136]
Length = 578
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 51/227 (22%), Positives = 87/227 (38%), Gaps = 33/227 (14%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + + +F K+ Y D E AL N VL+ + Y+ PL +
Sbjct: 109 ETCAAVGNVMFNYRMFLTKKDARYVDVAEVALYNNVLA-GVNLDGNKFFYVNPLE---AD 164
Query: 459 ARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS--FD 512
AR+ G K S W CC ++ +Y + ++ Y Y +S
Sbjct: 165 ARNAFNQGLKGRSPWFGTACCPSNIARLIPQIPGMMYAHTDNDI---YCTFYAGTSTVVP 221
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-----------Y 561
G V + Q + +D +R + ++ Q +++ R+P W Y
Sbjct: 222 LSDGKVTIKQTTN--YPFDESVRFEI---KPEQSKQKFAMHFRIPTWAGKQFVPGKLYHY 276
Query: 562 SNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
N A LNG+ + + P F++ W D + +QLP+ +R
Sbjct: 277 LNDKPAEWKVLLNGKEVSVKPHKGFVTIERAWKSGDLVELQLPMLVR 323
>gi|266624999|ref|ZP_06117934.1| putative cytoplasmic protein, partial [Clostridium hathewayi DSM
13479]
gi|288863113|gb|EFC95411.1| putative cytoplasmic protein [Clostridium hathewayi DSM 13479]
Length = 323
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 50/234 (21%), Positives = 96/234 (41%), Gaps = 20/234 (8%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + ++ +R + + + YAD ER L NGVLS + Y+ PL V +
Sbjct: 8 ETCASVGLVFFARRMLQIRPDAQYADVMERVLYNGVLS-GMALDGKSFFYVNPL-EVVPE 65
Query: 459 A-----RSTHGWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
A R +H + F CC S +G Y E+E + +I YI +
Sbjct: 66 ACHRDERKSHVKPVRQKWFGCACCPPNVARLLSSVGSYAYTEKEDTI---FIHLYIGAIL 122
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
+ + K+ W+ + + + + V ++ ++ +P W + + +NG
Sbjct: 123 KKQINGKEMEVKIQSEFPWNGKVNVYV-----KGVREVCTIAFHIPEWGEAYQL-SKING 176
Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+ + +L T++W +++ +Q P+ +R E A++ GP
Sbjct: 177 ATIKVKE--RYLYVTKKWEEEEEIHLQFPMEVRLIEANPFVRENIGKNAVMRGP 228
>gi|325282251|ref|YP_004254793.1| hypothetical protein Odosp_3669 [Odoribacter splanchnicus DSM
20712]
gi|324314060|gb|ADY34613.1| protein of unknown function DUF1680 [Odoribacter splanchnicus DSM
20712]
Length = 796
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 78/320 (24%), Positives = 118/320 (36%), Gaps = 59/320 (18%)
Query: 372 YATGGTSAREFWWDPKRLADTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYA 423
Y TGG AR + + G E ETC + + + + LF T E Y
Sbjct: 309 YITGGIGARAW-------GEGFGENYELPNMTSYCETCASISNVYWNYRLFLLTGESKYY 361
Query: 424 DYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTGIES 482
D ERAL NGV+S + Y PL S RS W F C C + I
Sbjct: 362 DVLERALYNGVIS-GVSLDGKRYFYDNPLMSDGSHDRSE--W------FGCSCCPSNITR 412
Query: 483 FSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSS 542
F + GN L++ Y+ + + K + W+ +++TL S
Sbjct: 413 FMPSIPGYVYAVRGNT--LFVNLYMGNEGQITLEGQPVRIKQETRYPWEGRIKLTLDHSP 470
Query: 543 KQEVGQLSSLNLRMPVW-----------TY----SNGAQASLNGQNLPLPPPGNFLSATE 587
+L LR+P W TY + SLNG+ + +
Sbjct: 471 ASSF----TLALRIPGWVQQQPLPGTLYTYLDKDTPSYTISLNGKTVKPEVRNGYALLRG 526
Query: 588 RWSYNDKLTIQLPLSLRT----EAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTA 643
W ND++ + LP+ +R + DDR +Y A+++GP + S G A
Sbjct: 527 DWKGNDQIVLNLPMQVRKVIADPQVIDDRNKY----ALIYGPIVYCVEASDH----DGYA 578
Query: 644 RSL-SALISPIPPSFNAQLV 662
L + +P P F L+
Sbjct: 579 LDLFTEEDTPFSPEFKPDLL 598
>gi|261341800|ref|ZP_05969658.1| hypothetical protein ENTCAN_08284 [Enterobacter cancerogenus ATCC
35316]
gi|288316173|gb|EFC55111.1| putative cytoplasmic protein [Enterobacter cancerogenus ATCC 35316]
Length = 651
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 78/364 (21%), Positives = 127/364 (34%), Gaps = 73/364 (20%)
Query: 293 LYRLYSITHDPKHLLLAHLFDK-----PCFLGFLALQADYLSHFH-------------AN 334
L RLY +T +P+++ L + F + P F + SH+H +
Sbjct: 193 LMRLYDVTQEPRYMALVNYFIEARGTTPHFYDIEYEKRGRTSHWHNYGPAWMVKDKAYSQ 252
Query: 335 THIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG-- 376
H P+ IG +R+ ++ D + + Y TGG
Sbjct: 253 AHQPLSEQQTAIGHAVRFVYLMAGMAHLARLSNDDGKRQDCLRLWRNMAQRQLYITGGIG 312
Query: 377 --TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
+S F D DT+ +E +C + ++ +R + + YAD ERAL N V
Sbjct: 313 SQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMETDSQYADVMERALYNTV 369
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGI 480
L + Y+ PL H FN + CC
Sbjct: 370 LG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIYDHVKPVRQRWFGCACCPPNIA 420
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
+ LG IY L+I Y+ + G L ++ W + ++ +
Sbjct: 421 RVLTSLGHYIYTLHPET---LFINLYVGNDIAVPVGDQQLQLRISGNYPW--HEQVNIEI 475
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
+S V +L LR+P W + SLNG + +L W D LT+ LP
Sbjct: 476 ASPVPVTH--TLALRLPDWC--ENPEVSLNGAAVTGEVSRGYLYLRRSWQEGDVLTLTLP 531
Query: 601 LSLR 604
+ +R
Sbjct: 532 MPVR 535
>gi|256420772|ref|YP_003121425.1| hypothetical protein Cpin_1728 [Chitinophaga pinensis DSM 2588]
gi|256035680|gb|ACU59224.1| protein of unknown function DUF1680 [Chitinophaga pinensis DSM
2588]
Length = 675
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 41/169 (24%), Positives = 71/169 (42%), Gaps = 26/169 (15%)
Query: 444 GVMIYMLPLGR---GVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPG 500
GV + LP R V ARS + CC + ++K ++++ G G
Sbjct: 384 GVFNFSLPFDREMCNVLGARS---------GYTCCLANMHQGWTKYTSHLWYQTSGK--G 432
Query: 501 LYIIQY----ISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRM 556
+ ++Y +++ K V + + D ++ +R + + E L LR+
Sbjct: 433 VAALEYGPCVMTAEVGKKHRDVTITEVTD--YPFNEEIRFQIAIKKETEF----PLQLRI 486
Query: 557 PVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
P W N A LNGQ L G ++ W D+LT+QLP+++ T
Sbjct: 487 PAW--CNEAVILLNGQPLRKDKGGQIITIEREWQDKDELTLQLPMTITT 533
>gi|399041428|ref|ZP_10736483.1| hypothetical protein PMI09_04045 [Rhizobium sp. CF122]
gi|398060198|gb|EJL52027.1| hypothetical protein PMI09_04045 [Rhizobium sp. CF122]
Length = 640
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 77/348 (22%), Positives = 131/348 (37%), Gaps = 53/348 (15%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-ADYLSHFHANT------HIPI 339
L +L +T + K+L LA F +P F A++ + FH T H P+
Sbjct: 198 ALVKLARVTGEKKYLDLAKFFIDERGTEPNFFTEEAIRDGRDAADFHQKTYEYGQAHEPV 257
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG +A
Sbjct: 258 REQKKVVGHAVRAMYLYSGMADIATEYNDDSLTGALETLWDDLTT-KQMYVTGGIGPAAA 316
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + S ETC + ++ + + YAD E+AL NG ++
Sbjct: 317 NEGFTDYYDLPNE--SAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GL 373
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
+ Y PL A H W ++ CC + +G +Y E +
Sbjct: 374 SLDGKTFFYENPL----ESAGKHHRW--IWHHCPCCPPNIARLLASIGSYMYGVAEDEI- 426
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
+++ + F V L QK W + + S + +++LR+P W
Sbjct: 427 AVHLYGEGRARFKMAGADVALTQKTR--YPWHGAVHFDIKTSKPAQF----AVSLRIPGW 480
Query: 560 TYSNGAQASLNGQNLPLP--PPGNFLSATERWSYNDKLTIQLPLSLRT 605
+NGA ++NG+ + + + W DK+ + +PL R+
Sbjct: 481 --ANGATLAVNGEAIDIGSVDVDGYARIEREWRDGDKIDLDIPLEARS 526
>gi|448391565|ref|ZP_21566711.1| hypothetical protein C477_10858 [Haloterrigena salina JCM 13891]
gi|445665886|gb|ELZ18561.1| hypothetical protein C477_10858 [Haloterrigena salina JCM 13891]
Length = 637
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 53/269 (19%), Positives = 102/269 (37%), Gaps = 32/269 (11%)
Query: 350 TGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE----ETCTTY 404
TGD LY + + ++ +Y TGG + +R D N ETC
Sbjct: 269 TGDRELYDQLQALWRNMTE-RRTYVTGGIGSTHH---GERFTDDYDLPNRTSYAETCAAV 324
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK------ 458
+ + +F+ + ++ Y + ER L NG L+ + Y PL G
Sbjct: 325 GSVFWNHRMFQLSGDVQYPELVERTLYNGFLA-GLSLDATEFFYANPLEVGPDGHALADE 383
Query: 459 -----ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
+ GW F+ CC + LG IY + P +Y+ Q++ S
Sbjct: 384 NPDRFSNQRQGW---FDCA-CCPPNAARLIASLGRYIY-ARATDEPAVYVNQFVGSEAAL 438
Query: 514 KSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
+ + + + W + +T+ + + +L +R+P W + A++ G++
Sbjct: 439 TIDDTDVRLRQESALPWAGDVTLTVDPAEPTDF----ALRVRVPEW--CSDVTATVAGES 492
Query: 574 LPLPPPGNFLSATERWSYNDKLTIQLPLS 602
+ P ++ W D+LT+ ++
Sbjct: 493 RSVEPDDGYIEVAREWEDGDELTVTFGMA 521
>gi|262382782|ref|ZP_06075919.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262295660|gb|EEY83591.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 618
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 49/229 (21%), Positives = 94/229 (41%), Gaps = 21/229 (9%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPL-GRGV 456
ETC + M+ ++ + + T + Y D ER+L NG L+ I G + Y+ PL +G
Sbjct: 336 ETCASVGMVLWNQRMNQLTGDSKYIDVLERSLYNGALAGISLGGDR--FFYVNPLESKGD 393
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
+ +G CC +G+ IY + L++ YI ++ + G
Sbjct: 394 HHRQEWYGCA-------CCPSQLSRFLPSIGNYIYASSD---DALWVNLYIGNTGQIRIG 443
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
+ + WD +++T++ S E + LR+P W + S+NG+ + +
Sbjct: 444 ETDIQLTQETDYPWDGSVKLTISTSQPLE----KEIRLRIPNWCKT--YDLSINGKRINV 497
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+ + + W D + + + + + A E +AI GP
Sbjct: 498 SEEKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFGKRAIQRGP 545
>gi|255035900|ref|YP_003086521.1| hypothetical protein Dfer_2133 [Dyadobacter fermentans DSM 18053]
gi|254948656|gb|ACT93356.1| protein of unknown function DUF1680 [Dyadobacter fermentans DSM
18053]
Length = 673
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 109/486 (22%), Positives = 173/486 (35%), Gaps = 87/486 (17%)
Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFD----------SF 219
A A ++A+T + + E M + +++ Q K G Y A + + SF
Sbjct: 107 FEAVASLYAATKDPKLDELMDKTIAVIAKAQRKDGYIYTKAIIEQKQNGEGKMFADRLSF 166
Query: 220 EALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHW 279
EA Y ++ Y L +A ++ +IT Y
Sbjct: 167 EA--------YNFGHLMTAACVHYRATGKTSLLDVAKKAADF-------LITFYGAATPE 211
Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHL-LLAHLF----------DKPCFLGFLALQADYL 328
S N L LY THD K+L L+ HL D + FL Q +
Sbjct: 212 QSRNAICPAHYMGLSELYRTTHDEKYLTLVKHLIAIKGATEGTDDNQDRIPFLK-QTKVM 270
Query: 329 SH-FHANTHIPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDP 386
H AN + G Y TGD L + T + D+ Y TGG A P
Sbjct: 271 GHAVRANY---LYAGVADVYAETGDEALLAQLHTMWDDVTQ-HKMYVTGGCGALYDGTSP 326
Query: 387 ----------KRLADTLGSE--------NEETCTTYNMLKVSRHLFRWTKEIAYADYYER 428
+++ G + + ETC + + + + T E YAD E
Sbjct: 327 DGTSYKPDEVQKIHQAYGRDYQLPNFTAHNETCANIGNVLWNWRMLQITGEAKYADIVEL 386
Query: 429 ALTNGVLS--IQRG-----TEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
AL N VLS +G T P LP + K R + +K N CC +
Sbjct: 387 ALYNSVLSGISLKGDKFLYTNPLAYSDALPFKQRWEKDR--QAYISKSN---CCPPNTVR 441
Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW--KSGHVVLNQKVDPIVSWDPYLRMTLT 539
+ +++ Y + G++ Y + F K G + L Q D W+ + +TL
Sbjct: 442 TVAEVSQYAYSLSDA---GVFFNLYGGNKFQTAVKGGQLQLTQVTD--YPWNGKISITLD 496
Query: 540 FSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYNDKLTIQ 598
+ K + SL R+P W + A +NG+ G++ W DK+ +
Sbjct: 497 QAPKDAL----SLFFRIPGW--CSNASMVINGKKETAKLASGSYAELRRTWKSGDKIELM 550
Query: 599 LPLSLR 604
L + ++
Sbjct: 551 LEMPVK 556
>gi|338730906|ref|YP_004660298.1| hypothetical protein Theth_1126 [Thermotoga thermarum DSM 5069]
gi|335365257|gb|AEH51202.1| protein of unknown function DUF1680 [Thermotoga thermarum DSM 5069]
Length = 621
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 83/361 (22%), Positives = 132/361 (36%), Gaps = 47/361 (13%)
Query: 332 HANTHIPIVIGSQMRY-EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFW---WDPK 387
HA + + G+ Y E G ++K + + D+ Y TGG +R W +P
Sbjct: 249 HAVRMLYLCCGATDLYLETEGKAIWKTLENLWKDMT-TRKMYITGGVGSRHDWESIGEPY 307
Query: 388 RLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
L + ETC + +F + E + D E+ + NG+LS +
Sbjct: 308 ELPNRRAYA--ETCAAIANFMWNYRMFLASGEARFVDVMEQVVYNGLLS-GISLDGDKYF 364
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
Y PL GTK W CC + + L IY + + L++
Sbjct: 365 YDNPL----------EDMGTKRRQRWFDCACCPPNIARTIASLPHYIYAQSKDK---LWV 411
Query: 504 IQYISSSFDWKSGHVVLN--QKVDPIVSWDPYLRM----TLTFSSKQEVGQLSSLNLRMP 557
Y SS+F V + Q+ D S D ++R+ TL+F +L LR+P
Sbjct: 412 NLYESSTFKIIHNDVPIEIVQQTDYPWSGDVHIRIAARETLSF----------TLLLRIP 461
Query: 558 VWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR--PEY 615
W S LNG+++ + W + +QL L LR E +Q E
Sbjct: 462 EW--SADFDLKLNGKSVKFHLNNGYAELQNSWKGTN--NVQLTLKLRPECLQSHPYVSEN 517
Query: 616 ASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
A+ GP L D T + S +P + + F +G +T +
Sbjct: 518 HGKVAVRSGPVLYCIEQVDNPDFDIWTLKIDSDSFEMVPGEILGKRMFFLLGNGKATNIR 577
Query: 676 S 676
S
Sbjct: 578 S 578
>gi|338212418|ref|YP_004656473.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336306239|gb|AEI49341.1| protein of unknown function DUF1680 [Runella slithyformis DSM
19594]
Length = 618
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 58/282 (20%), Positives = 109/282 (38%), Gaps = 22/282 (7%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR-GVS 457
ETC + M+ ++ + ++ E Y D ER+L NG L+ + T + Y+ PL G+
Sbjct: 331 ETCASVGMVFWNQRMNLYSGEAKYVDVLERSLYNGALAGVQLT-GNLFFYVNPLASFGLH 389
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
R +G CC +G IY E L++ Y+ S + G+
Sbjct: 390 HRRPWYGTA-------CCPSNVSRLMPSVGGYIYNTSENT---LWVNLYVGSETEVMLGN 439
Query: 518 VVLNQKVDPIVSWDPYLRM-TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL-P 575
+ W + + + SSK + +L LR+P W + +NG+ +
Sbjct: 440 HKVKFAKKTNYPWAGEVEIKAIPDSSKADF----ALKLRIPAW--CDKYTVEINGKPVEK 493
Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTS 633
L +++ W+ ND L +++ + ++ A +AI GP Y + +
Sbjct: 494 LTVDKGYVTVARTWAKNDVLKLRMDMPVKVVAADPRVKANEGKRAIQRGPLVYCVEEQDN 553
Query: 634 GEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVM 675
D + + P+ + T ++GN F +
Sbjct: 554 RHLDYDQILLSKKTQFSTTFEPTLLGGVTTIKAQNGNENFTL 595
>gi|424870152|ref|ZP_18293818.1| hypothetical protein Rleg5DRAFT_7481 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393171573|gb|EJC71619.1| hypothetical protein Rleg5DRAFT_7481 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 647
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 105/483 (21%), Positives = 179/483 (37%), Gaps = 97/483 (20%)
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKP-- 224
G ++ A++ + +A ++ K+ +V L + Q + GYL+++ F +P
Sbjct: 90 GKWIEAASYTLKAHPDAALEAKIDAIVEKLEKGQ--MADGYLNSW-------FIRREPDR 140
Query: 225 VWAPYYTIHKI--LAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER---HW 279
W +H++ + LL+ V A + ++ V +I + E
Sbjct: 141 RWTNLRDLHEMYSMGHLLEGAVAYREATGKRR---FLDVMIRAVDHIIATFGAEPGKLRG 197
Query: 280 YSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQAD-------Y 327
Y +EE + L +LY +T DP+HL LA F P + A + Y
Sbjct: 198 YDAHEE---IELALVKLYRVTRDPRHLKLATYFVDERGRMPSYYDEEARKRGESPDDYVY 254
Query: 328 LSHFHANTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHS 371
++ ++ H+P+ V+G +R DP K D +
Sbjct: 255 KTYAYSQAHMPVRDQHQVVGHAVRAMYLFSAMADLSHENDDPTLKEACNRLFDNLVGRQL 314
Query: 372 YATGGTS--------AREFWWDPKRLADTLGSEN--EETCTTYNMLKVSRHLFRWTKEIA 421
Y TGG REF L +E ETC + S + + +
Sbjct: 315 YVTGGLGPSASNEGFTREF---------DLPNETAYAETCAAVALGFWSHRMAQVDLDSK 365
Query: 422 YADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGTG 479
+ D E L NG LS I R E +L +HG ++ +C C T
Sbjct: 366 FTDRLETVLYNGALSGISRDGEHYFYENVL----------ESHGQHRRWKWHYCPCCPTN 415
Query: 480 IESF-SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY--LRM 536
I F + LG Y ++ L + Y ++S + G+ + + + WD LR
Sbjct: 416 IARFITSLGQYFYSTDDHQ---LAVHLYGTNSAELTVGNSFVRLIQETLYPWDGDIGLRF 472
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDK 594
L S+ + L LR+P W AQ S+NG + L + + + W D+
Sbjct: 473 ALERPSRFQ------LRLRIPGWCRQ--AQISVNGVAVDLDQCVTKGYAAISREWRNGDE 524
Query: 595 LTI 597
+ I
Sbjct: 525 VRI 527
>gi|380510716|ref|ZP_09854123.1| hypothetical protein XsacN4_05853 [Xanthomonas sacchari NCPPB 4393]
Length = 660
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 45/211 (21%), Positives = 79/211 (37%), Gaps = 18/211 (8%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E+C + ++ + + + + YAD ERAL N VL + Y+ PL
Sbjct: 339 ESCASIGLMMFANRMLQLAPDGRYADVMERALYNTVLG-GMALDGRHFFYVNPLEVHPPT 397
Query: 459 ARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
H + W CC + LG +Y + LY+ Y+ S
Sbjct: 398 LHGNHTFDHVKPVRQRWFGCACCPPNIARVLTSLGHYLYTRHDDT---LYVNLYVGSDAR 454
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
++ G +L + W + + S+ + ++L LR+P W Q LNG+
Sbjct: 455 FEVGGQILTLRQRGEYPWQDTIDFDVACSAPMD----AALALRLPDWC--QAPQLLLNGE 508
Query: 573 NLPLPP--PGNFLSATERWSYNDKLTIQLPL 601
+ + + RW D L ++LP+
Sbjct: 509 PVAIEAHRQHGYCVLRRRWQSGDTLQLRLPM 539
>gi|148269779|ref|YP_001244239.1| hypothetical protein Tpet_0643 [Thermotoga petrophila RKU-1]
gi|147735323|gb|ABQ46663.1| protein of unknown function DUF1680 [Thermotoga petrophila RKU-1]
Length = 620
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 72/341 (21%), Positives = 138/341 (40%), Gaps = 50/341 (14%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD-----YLSHF----------HANTHI 337
L LY T D K+L LA F G ++ + ++ H HA +
Sbjct: 196 LVELYRETGDRKYLDLARYFIYARGKGLASVPRNPGPEYFIDHKPFVELEEITGHAVRAL 255
Query: 338 PIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
+ G+ Y TGD +++ + + + V Y TGG +R W ++ G E
Sbjct: 256 YLCSGATDLYLETGDEKIWQALNRLWENFV-TKKMYITGGAGSRHDW-------ESFGEE 307
Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
E E+C + + + T E +AD E+ L NG+LS + Y
Sbjct: 308 YELSNRRSYAESCASIANFMWNFRMLLATGEGKFADVMEQVLYNGLLS-GISLDGKHYFY 366
Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
PL + + R K+ CC + +Y + V +++ + +
Sbjct: 367 FNPL-EDLGRTRR-----QKWFDCACCPPNLARFIASFPGYMYTTSDDGVQ-VHLYEKST 419
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQAS 568
S ++K+ V + Q+ D W +TF+ + ++ + S++LR+P W ++
Sbjct: 420 SKLNFKNSVVEIEQETD--YPWSG----EVTFTVETDIEEPFSISLRIPSW--ADDFVLR 471
Query: 569 LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
++G+ + P ++ ++ W K T++L L ++ E I+
Sbjct: 472 VDGKTVTANPQNGYVKLSQSW--KGKHTVELSLPMKVEFIE 510
>gi|374984436|ref|YP_004959931.1| hypothetical protein SBI_01679 [Streptomyces bingchenggensis BCW-1]
gi|297155088|gb|ADI04800.1| hypothetical protein SBI_01679 [Streptomyces bingchenggensis BCW-1]
Length = 666
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 60/258 (23%), Positives = 97/258 (37%), Gaps = 15/258 (5%)
Query: 350 TGDPLYKLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADTLGSENEETCTTYNM 406
TGDP + + + A+ +Y TGG +R E + D L ETC
Sbjct: 289 TGDPGLREALVRLWEDMAATKTYLTGGVGSRHDLEAFGDAYELPPD--RAYAETCAAIAS 346
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG 466
++ + T E Y+D ER L NG LS + +Y+ PL A G
Sbjct: 347 IQFGWRMALLTGEARYSDLVERTLYNGFLS-GVSLDGNRWLYVNPLQVREDYAGPHGDQG 405
Query: 467 TKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDP 526
+ ++ C L ++ G+ GL + QY S S+ G V +V
Sbjct: 406 ARRTEWFRCACCPPNVMRLLASLPHYVASGDADGLQLHQYASGSYAAGGGAV----RVGT 461
Query: 527 IVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSAT 586
W+ R+ + G +L+LR+P W G ++ G+ + +L
Sbjct: 462 GYPWE--GRIAVVVDEVPGDGDW-TLSLRIPHWADEYG--VTVGGEPVAARAESGWLRLR 516
Query: 587 ERWSYNDKLTIQLPLSLR 604
W + + + LPL R
Sbjct: 517 RHWRPGETVVLALPLRPR 534
>gi|343085566|ref|YP_004774861.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342354100|gb|AEL26630.1| protein of unknown function DUF1680 [Cyclobacterium marinum DSM
745]
Length = 690
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 106/250 (42%), Gaps = 23/250 (9%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG--VMIYMLPLGRGV 456
ETC + + + T + +AD E +L N VLS GT+ G Y PL R
Sbjct: 373 ETCANIGNVLWNHRMLLVTGDSRFADILELSLFNSVLS---GTDLGGTNFNYTNPL-RVD 428
Query: 457 SKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSF 511
T W + CC + + ++ + Y + G V LY + +S
Sbjct: 429 KDLPFTFRWNKVREPYISKSNCCPPNVVRTVAETHNYAYALSDNGLVVNLYGSNELKTSL 488
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVWTYSNGAQASLN 570
S + L Q+ D WD +++++ Q+ GQ +++LR+P W ++ A+ ++N
Sbjct: 489 PNGSS-LELKQETD--YPWDGKIKLSI-----QKTGQDPLAIDLRVPAW--ASQAEITVN 538
Query: 571 GQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
G+ P G++ S +W D + + LP++ R E + A++ GP +
Sbjct: 539 GEKSKEKPIAGSYFSLVRQWEKGDVIELNLPMTARLMEANPLVEETRNQVAVVRGPIVYC 598
Query: 630 GHTSGEWDIK 639
+S D +
Sbjct: 599 IESSDLQDAR 608
>gi|421598168|ref|ZP_16041640.1| hypothetical protein BCCGELA001_11816 [Bradyrhizobium sp.
CCGE-LA001]
gi|404269708|gb|EJZ33916.1| hypothetical protein BCCGELA001_11816 [Bradyrhizobium sp.
CCGE-LA001]
Length = 276
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 37/154 (24%), Positives = 61/154 (39%), Gaps = 9/154 (5%)
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC F+ +G IY LY+ YI +S G L +++ W+
Sbjct: 39 CCPPNIARLFTSVGHYIYTPRS---EALYVNLYIGNSVAIAVGGHTLRLRMNGNYPWEDL 95
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
+ + + S+Q + +L LR+P W + + LNG+ + P +L W D
Sbjct: 96 VEIAV--ESEQPITH--TLALRLPEWC--SAPEVKLNGEPVNCEPRKGYLHIHRTWRKGD 149
Query: 594 KLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
+ +QLP+ R A AI GP +
Sbjct: 150 RCKLQLPMKSRRVYGHPQLRHLAGKVAIQRGPLI 183
>gi|256838374|ref|ZP_05543884.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|256739293|gb|EEU52617.1| conserved hypothetical protein [Parabacteroides sp. D13]
Length = 618
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 51/231 (22%), Positives = 97/231 (41%), Gaps = 25/231 (10%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRGTEPGVMIYMLPL-GRGV 456
ETC + M+ ++ + + T + Y D ER+L NG L+ I G + Y+ PL +G
Sbjct: 336 ETCASVGMVLWNQRMNQLTGDSKYIDVLERSLYNGALAGISLGGDR--FFYVNPLESKGD 393
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
+ +G CC +G+ IY + L++ YI ++ + G
Sbjct: 394 HHRQEWYGCA-------CCPSQLSRFLPSIGNYIYASSD---DALWVNLYIGNTGQIRIG 443
Query: 517 H--VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
++L Q+ D WD +++T++ S E + LR+P W + S+NG+ +
Sbjct: 444 ETDILLTQETD--YPWDGSVKLTISTSQPLE----KEIRLRIPNWCKT--YDLSINGKRI 495
Query: 575 PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+ + + + W D + + + + + A E + I GP
Sbjct: 496 NVSEEKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFGKRVIQRGP 545
>gi|281412335|ref|YP_003346414.1| hypothetical protein Tnap_0910 [Thermotoga naphthophila RKU-10]
gi|281373438|gb|ADA67000.1| protein of unknown function DUF1680 [Thermotoga naphthophila
RKU-10]
Length = 620
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 72/342 (21%), Positives = 138/342 (40%), Gaps = 50/342 (14%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD-----YLSHF----------HANTH 336
L LY T D K+L LA F G ++ + ++ H HA
Sbjct: 195 ALVELYRETGDRKYLDLARYFIYTRGKGLASVPRNPGPEYFIDHKPFVELEEITGHAVRA 254
Query: 337 IPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+ + G+ Y TGD +++ + + + V Y TGG +R W ++ G
Sbjct: 255 LYLCSGATDLYLETGDEKIWQALNRLWENFV-TKKMYITGGAGSRHDW-------ESFGE 306
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E E E+C + + + T E +AD E+ L NG+LS +
Sbjct: 307 EYELSNRRSYAESCASIANFMWNFRMLLATGEGKFADVMEQVLYNGLLS-GISLDGKHYF 365
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y PL + + R K+ CC + +Y + V +++ +
Sbjct: 366 YFNPL-EDLGRTRR-----QKWFDCACCPPNLARFIASFPGYMYTTSDDGVQ-VHLYEKS 418
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
+S ++K+ V + Q+ D W +TF+ + ++ + S++LR+P W ++
Sbjct: 419 TSKLNFKNSVVEIEQETD--YPWSG----EVTFTVETDIEEPFSISLRIPSW--ADDFVL 470
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
++G+ + P ++ ++ W K T++L L ++ E I+
Sbjct: 471 RVDGKTVTANPQNGYVKLSQSW--KGKHTVELSLPMKVEFIE 510
>gi|271965305|ref|YP_003339501.1| hypothetical protein [Streptosporangium roseum DSM 43021]
gi|270508480|gb|ACZ86758.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021]
Length = 654
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 47/213 (22%), Positives = 84/213 (39%), Gaps = 25/213 (11%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + ++ L T ++ YAD ER + N VL+ E Y PL V
Sbjct: 304 ETCAGIGSIMLAHRLLLATGDVRYADLAERTMFN-VLATSPALEGRSFFYANPLHVRVPA 362
Query: 459 A-------RSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
A + G + + + CC +++ L + + G+ I + +
Sbjct: 363 APPEGMNPAAEGGLRSPWFTVSCCPNNIARTYASLA---AYVATSDASGVQIHHHTPAEI 419
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
H L +V+ W + + + G ++LR+P W ++GA+ S G
Sbjct: 420 H----HEGLVLRVETGYPWSGEVTVRVVR------GGSGRISLRVPPW--ASGARISHGG 467
Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
P+ P + A RW D++ + LP++ R
Sbjct: 468 TTRPV--PAGYAVAEGRWRPGDEIRLHLPMTPR 498
>gi|448418968|ref|ZP_21580124.1| hypothetical protein C474_15274 [Halosarcina pallida JCM 14848]
gi|445675954|gb|ELZ28481.1| hypothetical protein C474_15274 [Halosarcina pallida JCM 14848]
Length = 642
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 60/243 (24%), Positives = 90/243 (37%), Gaps = 50/243 (20%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGV 456
ETC + ++ L T E YAD ER L NG L+ GT Y PL
Sbjct: 342 ETCAAIGSIFWNQRLLELTGEAKYADLIERTLYNGFLAGVSLDGTR---FFYENPLESSG 398
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYII-QYISSSFDWKS 515
R GW T CC F+ LG +Y NV G+ + QY+ S+
Sbjct: 399 DHHRK--GWFTCA----CCPPNAARLFASLGRYVY----SNVDGVLTVNQYVGSTVTTTV 448
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
G + + W +TLT + + V + LR+P W + A S++G+
Sbjct: 449 GGTEVELTQSSSLPWSG--EVTLTVDADEAV----PIRLRVPAW--ATDASVSIDGEEAE 500
Query: 576 LPPPGNFLSATERWSYNDKLTIQL-------------------------PLSLRTEAIQD 610
G ++ W+ D++T++ PL EA+ +
Sbjct: 501 RSDDGAYVELDGEWN-GDRITVRFGQETELVRAHPAVESDAGRVAVERGPLVYCAEAVDN 559
Query: 611 DRP 613
DRP
Sbjct: 560 DRP 562
>gi|241895790|ref|ZP_04783086.1| protein of hypothetical function DUF1680 [Weissella
paramesenteroides ATCC 33313]
gi|241870833|gb|EER74584.1| protein of hypothetical function DUF1680 [Weissella
paramesenteroides ATCC 33313]
Length = 655
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 111/530 (20%), Positives = 198/530 (37%), Gaps = 85/530 (16%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-----PTELFDSFE 220
V +L A+A ++ + +K+ ++ +++ Q+ GYLS + P F+
Sbjct: 86 VYKWLEAAAYSFSYHQDDNLKKMTDELIDLIADAQDD--DGYLSTYFQIDAPER---KFK 140
Query: 221 ALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWY 280
L+ Y H I AG+ Y N +AL++A M + + K + + H Y
Sbjct: 141 RLQQSHELYTMGHYIEAGVA-YYQATGNQKALQIAERMADC----IDKNFGLKDGQIHGY 195
Query: 281 SLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLG-----------FLALQ 324
+ E + L RL+ T + ++L LAH F P F +A
Sbjct: 196 DGHPE---IELALARLFEATQEQRYLDLAHYFLNQRGQNPEFFDEQIKADGVDRDLIAGM 252
Query: 325 ADYLSHF---------------HANTHIPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNA 368
D+ + HA + + G M TGD L F+ DIV
Sbjct: 253 RDFPRRYYQAAEPIKDQQTADGHAVRVVYLCTGMAMVARHTGDQELLAACKRFWNDIVK- 311
Query: 369 SHSYATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYAD 424
Y TG T+ F +D DT+ E TC + M ++ + + + Y D
Sbjct: 312 RRMYITGNIGSTTTGEAFTYDYDLPNDTMYGE---TCASVGMSFFAKEMLKIEAKGEYGD 368
Query: 425 YYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR----STHGWGTKFNSFW--CCYGT 478
E+ L NG LS + Y+ PL + ++ +H + + F CC
Sbjct: 369 ILEKELFNGSLS-GMSLDGKHFFYVNPLEADPTASKLNPGKSHILTHRADWFGCACCPAN 427
Query: 479 GIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTL 538
+ + IY + + Q+I++ + G V P W ++ L
Sbjct: 428 LARLITSVDQYIYTVHDNTILSH---QFIANEASFSDGVTVTQTNNFP---WQGDIKYHL 481
Query: 539 TFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
++ + +R+P W+ + ++NGQN+ F+ T D + I+
Sbjct: 482 ENANHKTY----QFGIRVPQWS-QDEFSVAVNGQNVDATIEDGFIYLTID---QDNVDIE 533
Query: 599 LPLSLRTEAIQDDRPEYASIQ--AILFGPYLLAGHTSGE----WDIKTGT 642
L L++ T+ ++ + A+ A+ GP + A + WD T
Sbjct: 534 LTLNMATKLMRSNNRVKANFGQVAVTRGPLVYAAEEADNEAPLWDYHVNT 583
>gi|325103091|ref|YP_004272745.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324971939|gb|ADY50923.1| protein of unknown function DUF1680 [Pedobacter saltans DSM 12145]
Length = 673
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 90/212 (42%), Gaps = 23/212 (10%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS-IQRG------TEPGVMIYMLP 451
ETC + + + + + YAD E AL N VLS I T P LP
Sbjct: 357 ETCANIGNVLWNWRMLQLEGDAKYADVMELALYNSVLSGISLDGKRFLYTNPLSYSDNLP 416
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
+ SK R + K ++ CC + + +++ + Y +G LY +S+
Sbjct: 417 FKQRWSKERVEY---IKLSN--CCPPNTVRTIAEVSNYAYSISNKGVYVNLYGSNNLSTK 471
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
D S + Q P W+ + +T++ S K S+ +R+P W +N A+ S+N
Sbjct: 472 LDDGSTIKLTQQTEYP---WEGRVAITISESKKSPF----SIFMRIPGW--ANSAKVSIN 522
Query: 571 GQNLPLP-PPGNFLSATERWSYNDKLTIQLPL 601
G+++ G +L W D++ + LP+
Sbjct: 523 GKSVDADIKSGQYLELNRNWKKGDQIVLNLPM 554
>gi|302875896|ref|YP_003844529.1| hypothetical protein Clocel_3075 [Clostridium cellulovorans 743B]
gi|307689330|ref|ZP_07631776.1| hypothetical protein Ccel74_14336 [Clostridium cellulovorans 743B]
gi|302578753|gb|ADL52765.1| protein of unknown function DUF1680 [Clostridium cellulovorans
743B]
Length = 648
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 98/446 (21%), Positives = 162/446 (36%), Gaps = 62/446 (13%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF----------HANTHI 337
L +LY +T++ K+L L+ F +P + + D +SHF + H
Sbjct: 197 LVKLYDVTNNSKYLALSKYFIDQRGQEPNYFKEEYEKRDGVSHFLKTKIPLDLPYNQAHK 256
Query: 338 PI-----VIGSQMR--YEVTG----------DPLYKLIGTFFMDIVNASHSYATGGTSA- 379
P+ +G +R Y +G + L K T F +I + Y TGG +
Sbjct: 257 PVREQEVAVGHAVRAVYMYSGMADIAAKTNDETLKKACETIFNNIKD-KQMYITGGVGST 315
Query: 380 ---REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS 436
F +D DT+ SE TC ++ ++ + + ++ YAD ERAL N V S
Sbjct: 316 AHGEAFTYDYDLPNDTVYSE---TCAAIGLIFFAQRMLKLDQDRKYADVLERALYNTVTS 372
Query: 437 IQRGTEPGVMIYMLPL------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSI 490
+ Y+ PL R K+ CC + LG I
Sbjct: 373 -GMALDGRHFFYVNPLEVQPEASEKSPIKRHVKAERQKWYGCACCPPNVARLLTSLGQYI 431
Query: 491 YFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS 550
Y E + + YI S D+ V N+KV + + TF
Sbjct: 432 YTESNDTI---FTHLYIGSKADF----TVNNKKVTVKQTTNYPSEGKATFVFDMSENNEF 484
Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQD 610
+ LR+P W N N + L +L T + +D + I + + A
Sbjct: 485 TFALRIPEWC-KNYKIFINNEEYRELDLNKGYLYITREFLNSDVVEISMEIETVLVASNP 543
Query: 611 DRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGN 670
A AI GP + + E D + L P+ +N +++ E
Sbjct: 544 LVRANAGKVAICRGPLV---YCLEEIDNCKNLSSILIDTSKPVKEQYNPEVLGGAIELKA 600
Query: 671 STFVMSNSNQ----SITMEEFPVSGT 692
S +++S+ +Q S ++E P + T
Sbjct: 601 SGYIVSSESQDLYTSFNVKEMPFNIT 626
>gi|410616495|ref|ZP_11327487.1| hypothetical protein GPLA_0709 [Glaciecola polaris LMG 21857]
gi|410164204|dbj|GAC31625.1| hypothetical protein GPLA_0709 [Glaciecola polaris LMG 21857]
Length = 659
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 48/221 (21%), Positives = 92/221 (41%), Gaps = 32/221 (14%)
Query: 399 ETCTTYNMLKVSRHLFRW-----TKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG 453
ETC +S +F W T E +AD E L N + + TE Y PL
Sbjct: 340 ETCAN-----ISNAMFNWRLLGITGEAKHADVIELVLHNSAM-VGISTEGDKYFYANPLR 393
Query: 454 RGVSKAR-STHGWGTK------FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQY 506
+ S H T+ + +CC + + +++ Y + GL + +
Sbjct: 394 MNFGQREYSDHCDCTESPDREAYIECFCCPPNLVRTIAQVSAWAYSLTD---VGLAVNLF 450
Query: 507 ISSSFDWK---SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
S++ + K + L+Q+ D WD + + + ++ L + +R+P W +
Sbjct: 451 GSNALNTKLLDGSTLRLSQQTD--FPWDGKVALKI----EECKSALFDIQIRIPSW--AK 502
Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
GA S+NG+ +P+ G + +W D +T+ +P+ ++
Sbjct: 503 GATLSVNGETIPVVEAGQYTKIERQWQAGDNITLNMPMDIQ 543
>gi|423348680|ref|ZP_17326362.1| hypothetical protein HMPREF1060_04034 [Parabacteroides merdae
CL03T12C32]
gi|409213201|gb|EKN06225.1| hypothetical protein HMPREF1060_04034 [Parabacteroides merdae
CL03T12C32]
Length = 679
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 86/424 (20%), Positives = 162/424 (38%), Gaps = 43/424 (10%)
Query: 235 ILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVERHWYSLNEETGGMN-DVL 293
++ ++ QY A Q ++ +M YF ++ ++ + W E+ GG N V+
Sbjct: 163 VMLKVMQQYYTA--TQDRRVIDFMTRYFRYQLDELPK--NPLGKWTFWGEQRGGDNLMVV 218
Query: 294 YRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRYEV 349
Y LY+IT D L L L K F + L ++L H+ + + G + + Y+
Sbjct: 219 YWLYNITGDKFLLDLGELIHKQTFNWTDIFLNQNHLRRQHSLHCVNLAQGFKEPIVYYQQ 278
Query: 350 TGDPLYKLIGTFFMDIVNASHSYA--TGGTSAREFWWDPKRLADTLGSENEETCTTYNML 407
D K I + + H+ TG W + L + E CT M+
Sbjct: 279 GKDS--KQIQATRQAVNDIRHTIGLPTG------LWGGDELLRFGKPTTGSELCTAVEMM 330
Query: 408 KVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGT 467
+ T ++ +ADY ER N L Q + Y + ++ R + T
Sbjct: 331 YSLETILEVTGDMQWADYLERVAYNA-LPTQVTDDYSARQYYQQTNQ-IAVTREWREFST 388
Query: 468 ----------KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWK-SG 516
+ + CC + + K ++++ N GL + + S + +G
Sbjct: 389 PHDDTDLLFGELTGYPCCTSNLHQGWPKFVQNLWYATADN--GLASLLFAPSQVTARVAG 446
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
+ +N K + ++ +R ++F+ K+ +LR+P W NG+ L +
Sbjct: 447 GIEVNLKEETAYPFEETVRYHVSFTDKKVKKVFFPFHLRIPGW--CKQPVVKFNGKPLTV 504
Query: 577 PP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
PG W D L+++LP+ + Y + + GP + A + +
Sbjct: 505 DAYPGTVTRINREWKEGDILSLELPMEVTVSRW------YENSAVVERGPLVYALKMNEK 558
Query: 636 WDIK 639
W+ K
Sbjct: 559 WEKK 562
>gi|340619115|ref|YP_004737568.1| hypothetical protein zobellia_3150 [Zobellia galactanivorans]
gi|339733912|emb|CAZ97289.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
Length = 694
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 8/85 (9%)
Query: 520 LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP 579
L QK D WD +++T+ + L LR+P W + G Q +NG + P
Sbjct: 502 LTQKTD--YPWDGAVKITVDECKAEAFEVL----LRIPSW--AKGTQIKVNGTKVAKAQP 553
Query: 580 GNFLSATERWSYNDKLTIQLPLSLR 604
G F +W+ D++TI +P+ +
Sbjct: 554 GTFAKIERQWAEGDEITIDMPMETK 578
>gi|116625572|ref|YP_827728.1| hypothetical protein Acid_6519 [Candidatus Solibacter usitatus
Ellin6076]
gi|116228734|gb|ABJ87443.1| protein of unknown function DUF1680 [Candidatus Solibacter usitatus
Ellin6076]
Length = 631
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 57/261 (21%), Positives = 99/261 (37%), Gaps = 34/261 (13%)
Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNMLKVSR 411
D LYK+ D ++ H +G SA E A S+ E C +
Sbjct: 268 DSLYKMF-----DALDRYHGQPSGIFSADE------HFAGRDPSQGTELCAVVEAMFSLE 316
Query: 412 HLFRWTKEIAYADYYERALTNGVLSI--------QRGTEPGVMIYMLPLGRGVSKARSTH 463
+ A+ D E+ N + + Q + +I + R + ++
Sbjct: 317 QDMAIMGDAAFGDRLEKIAYNALPATLSPDLWAHQYDQQANQVICSISNRRWATNGPESN 376
Query: 464 GWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQK 523
+G + N F CC + + KL S++ N G + Y SG V + ++
Sbjct: 377 IFGLEPN-FGCCTANMHQGWPKLAASLWMAT--NDGGFAAVAYGPGEV--TSGGVTIEER 431
Query: 524 VDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFL 583
D P+ R ++ K + + L LR+P W +NGA ++NGQ PG F
Sbjct: 432 TD-----YPF-RENVSLLVKTD--KSFPLVLRIPAW--ANGATVAVNGQQQAGVKPGAFF 481
Query: 584 SATERWSYNDKLTIQLPLSLR 604
W D++ + P+++R
Sbjct: 482 RVQRAWRAGDRVELHFPMAVR 502
>gi|330996651|ref|ZP_08320529.1| hypothetical protein HMPREF9442_01616 [Paraprevotella xylaniphila
YIT 11841]
gi|329572723|gb|EGG54356.1| hypothetical protein HMPREF9442_01616 [Paraprevotella xylaniphila
YIT 11841]
Length = 800
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 81/355 (22%), Positives = 128/355 (36%), Gaps = 73/355 (20%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T D K+L A F D+ G + + Y + H P+V +G +R
Sbjct: 221 LAKLYIVTGDRKYLDEAKFFLDQ---RGHTSRRDAY-----SQAHKPVVEQDEAVGHAVR 272
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+TGD Y D + Y TGG A + G+
Sbjct: 273 ATYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATAN-------GEAFGA 325
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + V+ LF E Y D ER L NG++S + G
Sbjct: 326 NYELPNMSAYCETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFF 384
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y PL ++R H F CC L +Y ++ +V Y+ ++
Sbjct: 385 YPNPL-----ESRGQHQRQPWFGCA-CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFM 435
Query: 508 S--SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT----- 560
S ++ + VVL Q+ WD + S K+ + +L +R+P W
Sbjct: 436 SNEANLEVDKKGVVLEQQTR--YPWD----GDVAVSVKKNKAGVFALKIRIPGWVRGQVV 489
Query: 561 ------YSNGAQ----ASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
YS+G + +NGQ + + + RW DK+ + + R
Sbjct: 490 PSDLYRYSDGKRLGYSVKVNGQPVESGLQDGYFTIERRWKKGDKVEVHFDMEPRV 544
>gi|421589478|ref|ZP_16034616.1| hypothetical protein RCCGEPOP_11663, partial [Rhizobium sp. Pop5]
gi|403705566|gb|EJZ21118.1| hypothetical protein RCCGEPOP_11663, partial [Rhizobium sp. Pop5]
Length = 299
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 95/234 (40%), Gaps = 26/234 (11%)
Query: 422 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIE 481
YAD E+AL NG L T+ Y PL A H W K++ CC
Sbjct: 16 YADIMEQALYNGALP-GLSTDGKTFFYDNPL----ESAGKHHRW--KWHHCPCCPPNIAR 68
Query: 482 SFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG-HVVLNQKVDPIVSWDPYLRMTLTF 540
+ +G +Y + + +++ ++ +G V L Q + WD + F
Sbjct: 69 LVTSIGSYMYAVADDEI-AVHLYGESTARLKLANGAEVELEQATN--YPWD----GAVAF 121
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQ 598
+++ +L+LR+P W + GA S+NG L L + W+ D++ +
Sbjct: 122 TTRLTKPARFALSLRIPDW--AEGATLSVNGAMLDLGAHVRDGYARINREWADGDRVALY 179
Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
LPL+LR + + A A++ GP + T T L+A++ P
Sbjct: 180 LPLALRPQYANPKVRQDAGRVALMRGPLVYCVET-------TDNGADLNAIVLP 226
>gi|160934492|ref|ZP_02081878.1| hypothetical protein CLOLEP_03364 [Clostridium leptum DSM 753]
gi|156865945|gb|EDO59317.1| hypothetical protein CLOLEP_03364 [Clostridium leptum DSM 753]
Length = 650
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 76/184 (41%), Gaps = 15/184 (8%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL----GR 454
E+C + ++ ++ + T E Y D ERAL N VL E Y+ PL
Sbjct: 334 ESCASVGLMMFAQRMASLTGEAVYYDVVERALCNTVLG-GISKEGKRYFYVNPLEVWPQN 392
Query: 455 GVSKARSTHGWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
++ H + F CC + + LG IY + E + LY+ Q+ISSS
Sbjct: 393 CLASTSMAHVKPVRQKWFGCACCPPNIARTLASLGQYIYAQSEDS---LYVNQFISSSSA 449
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
+ G + +D D +R+T ++E +L LR+ + Y +NG+
Sbjct: 450 VEIGGQEIEFSMDSTYMKDGAVRITAKCGKREE-----ALYLRVRIPEYFKKPTLKVNGK 504
Query: 573 NLPL 576
+ L
Sbjct: 505 DATL 508
>gi|449137673|ref|ZP_21772993.1| protein containing DUF1680 [Rhodopirellula europaea 6C]
gi|448883726|gb|EMB14239.1| protein containing DUF1680 [Rhodopirellula europaea 6C]
Length = 688
Score = 43.1 bits (100), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 71/273 (26%), Positives = 109/273 (39%), Gaps = 47/273 (17%)
Query: 347 YEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPK---------RLADTLG-- 394
Y TGD L+ + T + ++V+ Y TGG A P R+ G
Sbjct: 304 YAETGDKALWSSLETIWRNVVD-KKMYITGGCGALHDGASPDGSKNQREITRVHQAFGRN 362
Query: 395 ------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVM 446
+ + ETC + + +F + E + D E AL N VLS GT
Sbjct: 363 YQLPNATAHNETCANIGNVLWNWRMFLASGEAKHIDTLELALYNSVLSGVDLNGTN---F 419
Query: 447 IYMLPLGRGVSKARSTHGW--GTK-FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYI 503
Y+ PL R A W G K F + +CC + + +G Y + V ++
Sbjct: 420 FYINPL-RQSDMAPVALRWAGGRKPFVTSFCCPPNLARTIAGVGQYAYGKSNDTV---WV 475
Query: 504 IQYISSSFDWK---SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT 560
Y S++ D K SGHV + Q WD + +T+ Q + L LR+P WT
Sbjct: 476 NLYGSNTLDTKLIDSGHVRIEQTTG--YPWDGRIEITIAECQNQPM----CLKLRIPGWT 529
Query: 561 YSNGAQASLNGQNLPLPP---PGNFLSATERWS 590
+ A++N +P PG+++S WS
Sbjct: 530 TT----ATVNIDGVPTDAKIEPGSYVSLKRVWS 558
>gi|423214410|ref|ZP_17200938.1| hypothetical protein HMPREF1074_02470 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692825|gb|EIY86061.1| hypothetical protein HMPREF1074_02470 [Bacteroides xylanisolvens
CL03T12C04]
Length = 679
Score = 43.1 bits (100), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 52/254 (20%), Positives = 98/254 (38%), Gaps = 24/254 (9%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
E CT M+ ++ T + +AD ER N L Q + Y + + ++
Sbjct: 320 ELCTAVEMMYSLENMLEITGNMQWADQLERIAYNA-LPTQISDDAQARQYYQQVNQ-IAV 377
Query: 459 ARSTHGWGT----------KFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
H + T + CC + + K +++ N G+ + Y S
Sbjct: 378 VNDYHNFSTPHEGTDNLFGTLTGYPCCSSNLHQGWPKFVQHLWYATVDN--GVAALVYAS 435
Query: 509 SSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
S + + ++++N K + +D + ++T+ K+ +LR+P W
Sbjct: 436 SEVKMQVANNILVNIKEETYYPFDETVSFSITYPDKKIKKATFPFHLRVPEW--CKKPIV 493
Query: 568 SLNGQNLPLPPPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPY 626
+LNGQ + G + R W NDK+TI+ P ++ D + GP
Sbjct: 494 NLNGQTIKTDVTGERMIILNREWQQNDKITIEFPATISISHWFDGG------AVVERGPL 547
Query: 627 LLAGHTSGEWDIKT 640
+ A + +W+ KT
Sbjct: 548 VYALKLNEKWEKKT 561
>gi|375146847|ref|YP_005009288.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361060893|gb|AEV99884.1| protein of unknown function DUF1680 [Niastella koreensis GR20-10]
Length = 674
Score = 43.1 bits (100), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 106/490 (21%), Positives = 176/490 (35%), Gaps = 91/490 (18%)
Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG-------------TGYLSAFPTELF 216
L ++A T + ++ + T + +++ CQ G T AF L
Sbjct: 107 LEGVTSLYAVTKDKNLEVMLDTAIATIAACQRADGYIHTPVLIEERKATNKEKAFADRL- 165
Query: 217 DSFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEY---FYNRVQKVITMY 273
+FE Y H + AG + Y + L +A +Y FY R +
Sbjct: 166 -NFET-------YNLGHLMTAGCI-HYRVTGKRTLLDVAIKAADYLDNFYKRASPELARN 216
Query: 274 SV-ERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLA-HLFDKPCFLGFLALQAD----- 326
++ H+ + E LY T DPK+L LA +L + G + D
Sbjct: 217 AICPSHYMGVVE-----------LYRTTRDPKYLQLAINLIN---IRGLVEEGTDDNQDR 262
Query: 327 --YLSHFHANTHIP----IVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSA 379
+ A H + G Y TGD L + + + D+VN Y TGG A
Sbjct: 263 VPFRQQMEAMGHAVRANYLYAGVADVYAETGDDSLMTCLNSIWNDVVN-KKLYVTGGCGA 321
Query: 380 REFWWDP----------KRLADTLG--------SENEETCTTYNMLKVSRHLFRWTKEIA 421
P ++ G + + ETC L + + + +
Sbjct: 322 LYDGVSPYGTSYKPPVIQKTHQAYGRAYQLPNITAHNETCANIGNLLWNWRMLLLSGDAK 381
Query: 422 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH----GWGTKFNSFWCCYG 477
YAD E L NG+LS + Y PL + G CC
Sbjct: 382 YADVMELELYNGILS-GISLDGNNFFYTNPLSHSADYPYTLRWQEAGRVPYIKLSNCCPP 440
Query: 478 TGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRM 536
+ + +++GD Y +G LY IS+ + S + Q P WD +++
Sbjct: 441 NTVRTMAEVGDYAYTTSNKGLWVHLYGANKISTKLEDGSALEMTQQSNYP---WDGHIKF 497
Query: 537 TLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLP-PPGNFLSATERWSYND-- 593
T+T + + SL LR+P W + A ++NG+ + P P ++ W D
Sbjct: 498 TVTKAEAKAF----SLYLRIPGW--CDKAALTVNGKPVTGPNKPATYVELNRAWKAGDVV 551
Query: 594 KLTIQLPLSL 603
+L + +P++L
Sbjct: 552 ELNLSMPVTL 561
>gi|300854538|ref|YP_003779522.1| hypothetical protein CLJU_c13520 [Clostridium ljungdahlii DSM
13528]
gi|300434653|gb|ADK14420.1| conserved hypothetical protein [Clostridium ljungdahlii DSM 13528]
Length = 658
Score = 43.1 bits (100), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 105/496 (21%), Positives = 186/496 (37%), Gaps = 80/496 (16%)
Query: 159 SELRGHFVG---------HYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLS 209
S+++GH G +L A A N +K+ ++ ++E Q GYLS
Sbjct: 70 SKIKGHHSGFPFQDTDVYKWLEAVAYSLRYHPNDDLKQIADKLIDLIAEAQEY--DGYLS 127
Query: 210 A-FPTELFD-SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQ 267
F E + F+ LK Y H I A + Y + N +AL +A M + N
Sbjct: 128 TYFQIEAPERKFKRLKQSHELYTMGHYIEAAVA-YYQVTGNEKALNIARKMADCIDNN-- 184
Query: 268 KVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLFDK-----PCFLGFLA 322
+ +E+ + + L RLY +TH+ K+L LA+ F K P F
Sbjct: 185 -----FGLEKGKIPGYDGHPEIELALSRLYELTHEKKYLNLAYYFLKQRGQDPKFFDHQI 239
Query: 323 LQADY------------LSHF--------------HANTHIPIVIGSQMRYEVTGDPLYK 356
Q + LS++ HA + + G +TGD
Sbjct: 240 EQDGFDHDLIEGMRNFPLSYYQAAEPIVDQETAEGHAVRVVYLCTGIAYVARLTGDQDLL 299
Query: 357 LIGTFFMDIVNASHSYATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRH 412
+ F + + Y TG T+ F +D DT+ E TC + M ++
Sbjct: 300 TVCKRFWNNIVKKRMYVTGNIGSTTTGESFTYDYDLPNDTMYGE---TCASVGMTFFAKQ 356
Query: 413 LFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKAR----STHGWGTK 468
+ + E Y D E+ L NG LS + Y+ PL + ++ +H +
Sbjct: 357 MLQIEPEGEYGDILEKELFNGSLS-GISLDGKHFFYVNPLEADPTASKGNPGKSHILTRR 415
Query: 469 FNSFWC-CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPI 527
+ F C C + + D + G+ + Q+IS+ ++ + ++ P
Sbjct: 416 ADWFGCACCPSNVARLIASVDQYIYTVHGST--ILSHQFISNEANFDNNISIIQSNNFP- 472
Query: 528 VSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATE 587
WD +++ K +R+P W+ N + +N +++ LP F+
Sbjct: 473 --WDG----NISYKIKNPGENKFKFGIRIPSWSQCN-YKLQVNKKDVNLPVKSGFV---- 521
Query: 588 RWSYNDKLTIQLPLSL 603
+ + + +Q+ LSL
Sbjct: 522 -YIFVESSQMQIDLSL 536
>gi|297545103|ref|YP_003677405.1| hypothetical protein Tmath_1689 [Thermoanaerobacter mathranii
subsp. mathranii str. A3]
gi|296842878|gb|ADH61394.1| protein of unknown function DUF1680 [Thermoanaerobacter mathranii
subsp. mathranii str. A3]
Length = 648
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 82/376 (21%), Positives = 141/376 (37%), Gaps = 58/376 (15%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLAL------QADYLSHFHANTHIPI-- 339
L +LY +T + K+L L+ F +KP + A + S+F H+P+
Sbjct: 199 LVKLYRVTGEEKYLRLSKYFIDERGEKPLYFEIEAKARGDEWDEQWASYFQ--VHLPVRE 256
Query: 340 ---VIGSQMRYEV-----------TGDPLYKLIGTFFMDIVNASHSYATGGTSA----RE 381
G +R TGD D + Y TGG +
Sbjct: 257 QTSAEGHAVRAAYLYSGMVDVAVETGDESLIQACKKLWDNITTKRMYITGGIGSSSFGEA 316
Query: 382 FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGT 441
F +D DT+ +E TC ++ + + + + YAD ERAL N V+S
Sbjct: 317 FTFDFDLPNDTVYAE---TCAAIGLVFFAHRMLQIDPDRRYADVMERALYNSVIS-GMSL 372
Query: 442 EPGVMIYMLPL-------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEE 494
+ Y+ PL + KA + F CC + LG IY
Sbjct: 373 DGKKYFYVNPLEVWPEACEKNKVKAHVKYTRQPWFKCA-CCPPNLARLLASLGKYIYSIR 431
Query: 495 EGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNL 554
+ LY+ Y+ S K + + + WD R+ + ++E+ +L L
Sbjct: 432 DNE---LYVHLYVDSEVQTKISENEVKVRQETEYPWDG--RIVINILPERELD--FTLAL 484
Query: 555 RMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLS-LRTEAIQDD 611
R+P W A+ S+NG+ + + + W D++ + L ++ +R +A +
Sbjct: 485 RIPGWC--KDAKVSVNGEEIDISGIMDKGYAKIKRLWKPGDRIELLLSMTVMRVKANPNV 542
Query: 612 RPEYASIQAILFGPYL 627
R + + AI GP +
Sbjct: 543 REDEGRV-AIQRGPVI 557
>gi|424879315|ref|ZP_18302950.1| hypothetical protein Rleg8DRAFT_5297 [Rhizobium leguminosarum bv.
trifolii WU95]
gi|392519986|gb|EIW44717.1| hypothetical protein Rleg8DRAFT_5297 [Rhizobium leguminosarum bv.
trifolii WU95]
Length = 647
Score = 43.1 bits (100), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 104/482 (21%), Positives = 178/482 (36%), Gaps = 95/482 (19%)
Query: 167 GHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAF-----PTELFDSFEA 221
G ++ A++ + NA ++ K+ +V L + Q + GYL+++ P + +
Sbjct: 90 GKWIEAASYTLKAHPNAALETKIDAIVEKLEKGQ--MADGYLNSWFIRREPDRRWTNLRD 147
Query: 222 LKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER---H 278
L + Y++ +L G + Y + L + V++ +I + E
Sbjct: 148 LHEM----YSMGHLLEGAVAYYEATGKRRFLDVMIRAVDH-------IIETFGAEPGKLR 196
Query: 279 WYSLNEETGGMNDVLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL 328
Y +EE + L +LY +T DP+HL LA F P + A + DY+
Sbjct: 197 GYDAHEE---IELALVKLYRVTGDPRHLKLATYFVDERGRMPSYYDEEARKRGESPEDYV 253
Query: 329 --SHFHANTHIPI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASH 370
++ ++ H+P+ V+G +R DP K D + +
Sbjct: 254 YKTYAYSQAHLPVRDQHQVVGHAVRAMYLFSAMADLSRENDDPTLKEACDRLFDNLVSRQ 313
Query: 371 SYATGGTS--------AREFWWDPKRLADTLGSEN--EETCTTYNMLKVSRHLFRWTKEI 420
Y TGG REF L +E ETC + S + + +
Sbjct: 314 LYVTGGLGPSASNEGFTREF---------DLPNETAYAETCAAVALGFWSHRMAQVDLDS 364
Query: 421 AYADYYERALTNGVLS-IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC-CYGT 478
+ D E L NG LS I R E +L +HG ++ +C C T
Sbjct: 365 KFTDRLETVLYNGALSGISRDGERYFYENVL----------ESHGQHRRWKWHYCPCCPT 414
Query: 479 GIESF-SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMT 537
I F + LG YF G+ L + Y ++S + G + + WD + +
Sbjct: 415 NIARFITSLGQ--YFYSTGD-HQLAVHLYGTNSAELTVGDSFVRLIQETQYPWDGDISLR 471
Query: 538 LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKL 595
L LR+P W AQ S+NG + L + + + W D++
Sbjct: 472 FAVERPSRF----QLRLRIPGWC--RQAQISVNGVAVDLDQCVTKGYAAISREWRNGDEV 525
Query: 596 TI 597
I
Sbjct: 526 RI 527
>gi|317492212|ref|ZP_07950641.1| hypothetical protein HMPREF0864_01405 [Enterobacteriaceae bacterium
9_2_54FAA]
gi|316919551|gb|EFV40881.1| hypothetical protein HMPREF0864_01405 [Enterobacteriaceae bacterium
9_2_54FAA]
Length = 661
Score = 42.7 bits (99), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 59/251 (23%), Positives = 93/251 (37%), Gaps = 42/251 (16%)
Query: 372 YATGG----TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYE 427
Y TGG +S F D DT+ +E +C + ++ + + + + YAD E
Sbjct: 320 YITGGIGSQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFANRMLQMEGDSQYADVME 376
Query: 428 RALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW-------------- 473
RAL N VL + Y+ PL H FN +
Sbjct: 377 RALYNTVLG-GMALDGRHFFYVNPL--------EVHPKSIPFNHIYDHVKPIRQRWFGCA 427
Query: 474 CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
CC + +G IY + LYI Y+ + +G L + WD
Sbjct: 428 CCPPNIARILTSIGHYIYTQRSD---ALYINLYVGNETLLDNG---LKIAISGNYPWDE- 480
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYND 593
+++ +++ + Q +L LRMP W Q LNG+ +L W D
Sbjct: 481 -NVSVHIRTEKPLHQ--TLALRMPEWCEKPRVQ--LNGETCEDLLQRGYLHIAREWQDGD 535
Query: 594 KLTIQLPLSLR 604
+L I LP+ +R
Sbjct: 536 RLEIVLPMPVR 546
>gi|448360425|ref|ZP_21549056.1| hypothetical protein C481_00200 [Natrialba asiatica DSM 12278]
gi|445653038|gb|ELZ05910.1| hypothetical protein C481_00200 [Natrialba asiatica DSM 12278]
Length = 674
Score = 42.7 bits (99), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 69/178 (38%), Gaps = 14/178 (7%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + +R LF +T YAD ER L N VL + R + Y L +
Sbjct: 348 ETCAAIGSVFWNRRLFEFTGRARYADLIERTLYNAVL-VGRSRDGTEFFYDNRLASDGNH 406
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFE-EEGNVPGLYIIQYISSSFDWKSGH 517
R ++ CC + LG +Y E + LY+ QYI SS G
Sbjct: 407 HRQ------EWFECACCPPNIARVLAALGRYLYATGGESDERCLYVNQYIGSSATATIGD 460
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
V+ W+ + + + ++ E +L LR+P W +NG+ +P
Sbjct: 461 TVVELDQTSGFPWNGEVTLDVEPATPTEF----ALRLRVPSWC--EDVSIRVNGEAVP 512
>gi|408372126|ref|ZP_11169874.1| hypothetical protein I215_14481 [Galbibacter sp. ck-I2-15]
gi|407742435|gb|EKF54034.1| hypothetical protein I215_14481 [Galbibacter sp. ck-I2-15]
Length = 664
Score = 42.7 bits (99), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 83/381 (21%), Positives = 138/381 (36%), Gaps = 67/381 (17%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY IT + +L LA F D+ DY A H+P+ V+G +R
Sbjct: 241 LVKLYRITKNEDYLELARFFLDQRGHHDNRPSLGDY-----AQDHLPVTEQKEVVGHAVR 295
Query: 347 ----YEVTGDPLYKLIGTFFMDIVNA-------SHSYATGGTSAREFWWDPKRLADTLGS 395
Y D T +++ VN Y TGG A + G+
Sbjct: 296 AVYMYAGMTDIAAIDKDTAYLNAVNNLWDNMVNKKMYITGGIGA-------IHDGEAFGA 348
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGV 445
E ETC + + L T ++ Y D ER+L NG+LS GTE
Sbjct: 349 NYELPNLTAYSETCAAIGDVYWNHRLHNLTGDVKYMDVLERSLYNGLLSGISLSGTE--- 405
Query: 446 MIYMLPL-GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV-PGLYI 503
Y L G K ++ CC I L + +Y +++ + LY+
Sbjct: 406 FFYPNALESDGTYKFNRGSCTRQEWFDCSCCPTNMIRFLPSLPELVYSKKDDTIFVNLYV 465
Query: 504 IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW---- 559
+ D S +V++Q+ + WD + T+T + +L LR+P W
Sbjct: 466 AN--QAQIDLPSTSLVIDQQTN--YPWDGLVNFTVTPEKEANF----TLKLRIPGWLRNE 517
Query: 560 -----------TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
++ + +N Q + +++ W + L++ LP+ R
Sbjct: 518 VLPGTLYQYKDDMTSEFELKINDQLVDATLKDGYITINRDWKKGETLSLNLPMQPREVIT 577
Query: 609 QDDRPEYASIQAILFGPYLLA 629
D + A+ +GP + A
Sbjct: 578 NDKVEDNLGKLALEYGPIVYA 598
>gi|344201929|ref|YP_004787072.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
gi|343953851|gb|AEM69650.1| protein of unknown function DUF1680 [Muricauda ruestringensis DSM
13258]
Length = 656
Score = 42.7 bits (99), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 85/391 (21%), Positives = 139/391 (35%), Gaps = 87/391 (22%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY T ++ LA F D L DY + H+P+ V+G +R
Sbjct: 228 LIKLYQTTGKKEYFDLAKYFLDHRGKSEHHQLFGDY-----SQDHVPVTEQDEVVGHAVR 282
Query: 347 Y-----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLG 394
+ D Y K + + ++VN Y TGG A K + G
Sbjct: 283 AVYMYAGMTDIAAIEKDTAYLKAVNALWDNMVN-KKMYITGGIGA-------KHEGEAFG 334
Query: 395 SENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVM 446
E ETC + + L T ++ Y D ER L NG++S G
Sbjct: 335 ENYELPNLTAYNETCAAIGDVYWNHRLHNLTGDVKYFDVIERTLYNGLIS---GLSLDGQ 391
Query: 447 IYMLPLG---RGVSKARSTHGWGTKFNSFWC-CYGTGIESF---------SKLGDSIYFE 493
+ P GV K G T+ + F C C T + F SK D+IY
Sbjct: 392 KFFYPNALESDGVYKF--NQGACTRKDWFDCSCCPTNVIRFLPAMPGLIYSKTDDTIYV- 448
Query: 494 EEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLN 553
LY ++ + K V L+Q+ WD +++ + + K + ++
Sbjct: 449 ------NLYAAN--GATVNLKDRAVKLSQETK--YPWDGKVKLMVDPTEKGKF----TIK 494
Query: 554 LRMPVW---------------TYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQ 598
R+P W + + SLNG+ L L + + + W D + ++
Sbjct: 495 FRVPGWARNKVLPGNLYQYATVINKKNKISLNGEELDLQAGDGYFTIAKEWEKGDVVELE 554
Query: 599 LPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
P+ +R E ++ +GP + A
Sbjct: 555 FPMEVRKVEANQLVEENKDKMSLEYGPMVYA 585
>gi|399031138|ref|ZP_10731277.1| hypothetical protein PMI10_03155 [Flavobacterium sp. CF136]
gi|398070607|gb|EJL61899.1| hypothetical protein PMI10_03155 [Flavobacterium sp. CF136]
Length = 673
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 100/239 (41%), Gaps = 29/239 (12%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTE-----PGVMIYMLP 451
ETC + + + + T + YAD E AL N VLS G E P + LP
Sbjct: 358 ETCANIGNVLWNWRMLQITGDAKYADIVELALYNSVLSGISLEGKEFFYNNPLNVSKDLP 417
Query: 452 LGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
+ SK R G+ N CC + +++ + Y F +E GLY+ Y S++
Sbjct: 418 FKQRWSKER--EGYIALSN---CCAPNVTRTIAEVSNYAYNFSKE----GLYVNLYGSNN 468
Query: 511 FDWKS---GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
+ K+ + + Q+ + WD + + + K+ L LR+P W S G
Sbjct: 469 LNSKTLAGEKIEIEQQTN--YPWDGKITLKIVKVPKEAYAFL----LRIPGW--SQGTTI 520
Query: 568 SLNGQNL-PLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
S+NG+N+ G++ ++W D + + +P+ + E + A+ GP
Sbjct: 521 SVNGKNINDAIVSGSYQKIAQKWKKGDVIELNIPMPVELMQANPLVEEVKNQVAVKRGP 579
>gi|374385208|ref|ZP_09642716.1| hypothetical protein HMPREF9449_01102 [Odoribacter laneus YIT
12061]
gi|373226413|gb|EHP48739.1| hypothetical protein HMPREF9449_01102 [Odoribacter laneus YIT
12061]
Length = 614
Score = 42.7 bits (99), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 72/349 (20%), Positives = 123/349 (35%), Gaps = 35/349 (10%)
Query: 293 LYRLYSITHDPKHLLLAH-LFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRY---- 347
L +LY T + +L LA L D+ DY + + G +R
Sbjct: 213 LVKLYRTTQNSAYLKLAQWLLDQRGHHKGDWKAKDYYQDLKPVRELSKISGHAVRAMYMF 272
Query: 348 -------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEE- 399
+T D Y++ + V Y TGG + + ++ NEE
Sbjct: 273 TGMADVAAITQDSGYRIALDRLWEDVVEKKMYLTGGIGSSRH---NEGFSEDYDLPNEEA 329
Query: 400 ---TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGV 456
TC + M+ ++ + E Y D ERA+ NG L+ Y+ PL
Sbjct: 330 YCETCASVGMVFWNQRMNMLKGESRYEDVLERAMYNGALA-GISLSGDRFFYVNPLASSG 388
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
R +GT CC +G+ IY E V ++ YI S + ++
Sbjct: 389 KHHRKAW-YGTA-----CCPSQISRFLPSVGNYIYALSENTV---WVNLYIGSETEVETS 439
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPL 576
V + K + + WD +TF + + LR+P W +NGQ
Sbjct: 440 GVTVALKQETLYPWDG----NVTFYVNPRESKDFKMKLRIPAWC--EKYVVKVNGQIEEG 493
Query: 577 PPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
++ W+ D + + + ++++ A A +A+ GP
Sbjct: 494 KKEKGYVVIDRLWAAGDVMELNMNMTVKVVAADPRVKANAGKRALQRGP 542
>gi|418468281|ref|ZP_13039095.1| hypothetical protein SMCF_2011 [Streptomyces coelicoflavus ZG0656]
gi|371551122|gb|EHN78456.1| hypothetical protein SMCF_2011 [Streptomyces coelicoflavus ZG0656]
Length = 796
Score = 42.7 bits (99), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 52/222 (23%), Positives = 87/222 (39%), Gaps = 36/222 (16%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEP-GVMIYMLPLGRGVS 457
ETC + L R T + +AD E N ++ +P G ++ + V
Sbjct: 305 ETCGVVEFMAAHELLVRITGDPVWADRCEDLAFN---ALPAALDPEGRAVHYVTSANSVD 361
Query: 458 --KARSTHG----------WGTKFNSFWCC---YGTGIESFSK---LGDSIYFEEEGNVP 499
AR T G + +++ CC YG G F++ LG + G
Sbjct: 362 LDNARKTQGQFQNGFAMQAYQPGVDNYRCCPHNYGMGWPYFTEELWLGTP----DRGLAA 417
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVD-PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPV 558
+Y ++++ V + + D P +TLT S + V L+LR+P
Sbjct: 418 AMYAPSRVTAAVGADGTRVTVTEDTDYPFDD-----TITLTVSGPRRVA--FPLSLRIPG 470
Query: 559 WTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
W G Q +NG+ +P F+ WS D++T++LP
Sbjct: 471 W--CEGPQVRVNGRPVPAADGPAFVRVERTWSDGDRVTLRLP 510
>gi|281425429|ref|ZP_06256342.1| conserved hypothetical protein [Prevotella oris F0302]
gi|281400422|gb|EFB31253.1| conserved hypothetical protein [Prevotella oris F0302]
Length = 673
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 121/324 (37%), Gaps = 54/324 (16%)
Query: 349 VTGDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--------E 399
+TGD Y K I + +IV+ + Y TGG AR + + G++ E E
Sbjct: 290 LTGDSAYIKAIDHIWNNIVSKKY-YLTGGVGARHY-------GEAFGADYELPNLTAYNE 341
Query: 400 TCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-GRGVSK 458
TC ++ LF + Y D ER L NGV+S + G Y PL G+ K
Sbjct: 342 TCAAIAQCYLNMRLFMLHGDSKYIDCLERTLYNGVIS-GMSIDGGRFFYPNPLSADGIYK 400
Query: 459 ARSTHGWGTKFNSFW---CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
+ T W C + + F + GN +Y+ ++ S + K
Sbjct: 401 FNADR---TTTRQLWFGCACCPSNLSRFIPSVPGYVYAVRGN--DVYVNLFMGSKANVKV 455
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT---------YS---- 562
G + + + WD + + + K + +SL +R+P W YS
Sbjct: 456 GGKEMKIETETNYPWDGKVAIRV----KGNANKHASLLIRIPGWARGEVTPGGLYSFTDK 511
Query: 563 --NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAIQDDRPEYA 616
+G ++NG+N D +T+ L + RT + + DDR
Sbjct: 512 QKDGWSIAVNGKNRNAGKLEKGYIRINNVKKGDVITLNLDMEPRTVVADKRVMDDR---- 567
Query: 617 SIQAILFGPYLLAGHTSGEWDIKT 640
A+ GP + ++ +KT
Sbjct: 568 GCVAVERGPLVYCAESADNNGMKT 591
>gi|336402464|ref|ZP_08583200.1| hypothetical protein HMPREF0127_00513 [Bacteroides sp. 1_1_30]
gi|335948631|gb|EGN10334.1| hypothetical protein HMPREF0127_00513 [Bacteroides sp. 1_1_30]
Length = 698
Score = 42.7 bits (99), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 84/347 (24%), Positives = 140/347 (40%), Gaps = 47/347 (13%)
Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
+ +Y T +P++L L+ +L D + + + Y + HA + G
Sbjct: 248 VVEMYRATGNPRYLELSKNLIDIRGMVENGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307
Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
Y TG+ L K + + + DIV Y TG GTS ++P +++ +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366
Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
G + + ETC + + + T + YAD E L N VLS +
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 425
Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
Y PL R + T W T++ S +CC + + + + Y EG
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484
Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
LY ++++ WK G + L Q+ D W+ +R+TL ++ G SL LR+P W
Sbjct: 485 LYGANTLTTT--WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW 538
Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
A ++NGQ L N + R W D +L + +P+ L
Sbjct: 539 --CEKATLTVNGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|423286830|ref|ZP_17265681.1| hypothetical protein HMPREF1069_00724 [Bacteroides ovatus
CL02T12C04]
gi|392674368|gb|EIY67816.1| hypothetical protein HMPREF1069_00724 [Bacteroides ovatus
CL02T12C04]
Length = 698
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 18/217 (8%)
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ + ETC + + + T + YAD E L N VLS + Y PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434
Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
+ T W T++ S +CC + + + + Y EG LY ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494
Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
WK G + L Q+ D W+ +R+TL ++ G SL LR+P W A ++
Sbjct: 495 --WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGTF-SLFLRIPEW--CEKATLTV 546
Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
NGQ L N + R W D +L + +P+ L
Sbjct: 547 NGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|410100001|ref|ZP_11294966.1| hypothetical protein HMPREF1076_04144 [Parabacteroides goldsteinii
CL02T12C30]
gi|409216556|gb|EKN09540.1| hypothetical protein HMPREF1076_04144 [Parabacteroides goldsteinii
CL02T12C30]
Length = 618
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 65/288 (22%), Positives = 117/288 (40%), Gaps = 31/288 (10%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-GRGVS 457
ETC + M+ + + + T + Y D ER++ NGVL+ Y+ PL +G
Sbjct: 336 ETCASVGMVFWNHRMNQITGDAKYIDILERSMYNGVLA-GISLSGDRFFYVNPLESKGDH 394
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI--SSSFDWKS 515
+ +G CC +G+ IY + L++ YI ++ F
Sbjct: 395 HRQEWYGCA-------CCPSQLSRFLPTIGNYIYAISD---DALWVNLYIGNTTRFTLND 444
Query: 516 GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
+V+L Q+ + WD ++ LT SS +++ + + LR+P W ++NG+ +
Sbjct: 445 DNVILRQETN--YPWDGSVK--LTVSSTKDLDK--EIRLRIPGW--CKNYTITINGKEVG 496
Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGE 635
L + + W D +++ + + + E+ E +AI GP + + + E
Sbjct: 497 LSQEKGY-AIVYDWKPGDMISLDMDMPVEVESADPLVTENIGKRAIQRGPLV---YCAEE 552
Query: 636 WDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMSNSNQSIT 683
D R + SF A L+ +G T N QSIT
Sbjct: 553 TDNSAYFDRLTLTSDTEYHTSFEAGLL-----NGVKTINAKNEQQSIT 595
>gi|293371493|ref|ZP_06617913.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292633530|gb|EFF52093.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 698
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 18/217 (8%)
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ + ETC + + + T + YAD E L N VLS + Y PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434
Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
+ T W T++ S +CC + + + + Y EG LY ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494
Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
WK G + L Q+ D W+ +R+TL ++ G SL LR+P W A ++
Sbjct: 495 --WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW--CEKATLTV 546
Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
NGQ L N + R W D +L + +P+ L
Sbjct: 547 NGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|295084107|emb|CBK65630.1| Uncharacterized protein conserved in bacteria [Bacteroides
xylanisolvens XB1A]
Length = 698
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 18/217 (8%)
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ + ETC + + + T + YAD E L N VLS + Y PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434
Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
+ T W T++ S +CC + + + + Y EG LY ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494
Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
WK G + L Q+ D W+ +R+TL ++ G SL LR+P W A ++
Sbjct: 495 --WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW--CEKATLTV 546
Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
NGQ L N + R W D +L + +P+ L
Sbjct: 547 NGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|326799752|ref|YP_004317571.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326550516|gb|ADZ78901.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
Length = 679
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 111/489 (22%), Positives = 186/489 (38%), Gaps = 97/489 (19%)
Query: 170 LSASAQMWASTHNATIKEKMSTVVFSLSECQNKIG------------TGYLSAFPTELFD 217
L A A ++A T + + +KM V+ +++ Q + G TG + F L
Sbjct: 110 LEAVASLYAVTKDPALDKKMDEVIKTIALSQREDGYIYTLSMIQQRKTGVKNQFEDRL-- 167
Query: 218 SFEALKPVWAPYYTIHKILAGLLDQYVLADNAQALKMATWMVEYFYNRVQKVITMYSVER 277
SFEA Y I ++ Y L +A +Y Y R K + ++ R
Sbjct: 168 SFEA--------YNIGHLMTAACVHYRATGKRNLLDVAIKATDYLY-RFYKSASP-TLAR 217
Query: 278 HWYSLNEETGGMNDVLYRLYSITHDPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTH 336
+ + G + +Y D ++L LA HL D G + D
Sbjct: 218 NAICPSHYMG-----VVEMYRTLGDKRYLELAKHLID---IKGQIEDGTD-----DNQDR 264
Query: 337 IPI-----VIGSQMR-----------YEVTGD-PLYKLIGTFFMDIVNASHSYATGG--- 376
IP V+G +R Y TGD L+ + + D V + Y TGG
Sbjct: 265 IPFREQQKVMGHAVRANYLYAGVADVYAETGDTSLFNQLHKMWTD-VTSHKMYITGGCGS 323
Query: 377 ----TSAREFWWDPK---RLADTLGSE--------NEETCTTYNMLKVSRHLFRWTKEIA 421
S +DPK ++ G + + ETC + + + T
Sbjct: 324 LYDGVSPDGTSYDPKEVQKIHQAYGRDYQLPNFTAHNETCANIGNMLWNWRMLLLTGNAK 383
Query: 422 YADYYERALTNGVLS-----IQR--GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWC 474
+AD E AL N VLS +R T P LP + SK R + + C
Sbjct: 384 FADVLELALYNSVLSGISLDGERFLYTNPLAYSDKLPFKQRWSKDRVPYIALSN-----C 438
Query: 475 CYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPY 533
C + + +++ + Y +EG LY + +S G V L Q+ WD
Sbjct: 439 CPPNVVRTLAEVHNYFYSISDEGIWINLYGGSELKTSLP-NGGTVKLKQET--AYPWDGA 495
Query: 534 LRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL-PLPPPGNFLSATERWSYN 592
+++ + + K + SL LR+P W ++ A +NGQ++ + PG++ +W
Sbjct: 496 IKVVVEEAVKDDF----SLFLRIPGW--ADQAMIQVNGQDVDKVLKPGSYTMIRRKWKKG 549
Query: 593 DKLTIQLPL 601
D + +++P+
Sbjct: 550 DVVFLKMPM 558
>gi|160882339|ref|ZP_02063342.1| hypothetical protein BACOVA_00287 [Bacteroides ovatus ATCC 8483]
gi|156112253|gb|EDO13998.1| hypothetical protein BACOVA_00287 [Bacteroides ovatus ATCC 8483]
Length = 698
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 18/217 (8%)
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ + ETC + + + T + YAD E L N VLS + Y PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434
Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
+ T W T++ S +CC + + + + Y EG LY ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494
Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
WK G + L Q+ D W+ +R+TL ++ G SL LR+P W A ++
Sbjct: 495 --WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW--CEKATLAV 546
Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
NGQ L N + R W D +L + +P+ L
Sbjct: 547 NGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|395803606|ref|ZP_10482850.1| hypothetical protein FF52_17068 [Flavobacterium sp. F52]
gi|395434160|gb|EJG00110.1| hypothetical protein FF52_17068 [Flavobacterium sp. F52]
Length = 682
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 56/236 (23%), Positives = 97/236 (41%), Gaps = 23/236 (9%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + + + + T + YAD E AL N VLS E +Y PL VS
Sbjct: 367 ETCANIGNVLWNWRMLQITGDAKYADIVELALYNSVLS-GMNLEGDKFLYNNPL--NVSN 423
Query: 459 ARSTHG-WGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
H WG + CC + +++G+ Y + GLY+ Y S++ +
Sbjct: 424 DLPFHQRWGNVREGYIALSNCCAPNVTRTVAEVGNYAYNLSKD---GLYVNLYGSNTLNT 480
Query: 514 KS--GHVV-LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
K+ G + + Q+ + WD + + + + K L + LR+P W S A+ S+N
Sbjct: 481 KTLNGETLEIEQQTN--YPWDGKVTLKILKAPK----DLQNFFLRIPGW--SQNAEVSVN 532
Query: 571 GQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+ G +L ++W D + + +P+ + E + A+ GP
Sbjct: 533 NSKISDKIVSGTYLKLNQKWKKGDVIELNMPMPVELMEANPLVEEVKNQVAVKRGP 588
>gi|253574873|ref|ZP_04852213.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
gi|251845919|gb|EES73927.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
Length = 665
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 79/361 (21%), Positives = 133/361 (36%), Gaps = 68/361 (18%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFL----------GFLALQADYLSHFHANTHI 337
L +LY +T ++L L+ F KP F A AD++ + H+
Sbjct: 208 LVKLYEVTGQERYLRLSQYFLEQRGQKPSFFEEELKRRGGQTHWAGHADHVDLTYHQAHL 267
Query: 338 PI-----VIGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSA-- 379
P+ +G +R +TGD D + Y TGG +
Sbjct: 268 PVREQETAVGHAVRLLYMLTGMADVAALTGDESMLAACRKLWDNIVGKQMYITGGVGSMP 327
Query: 380 --REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSI 437
F +D DT+ SE TC + ++ ++ + R + + YA+ ERAL N V+
Sbjct: 328 QGEAFSFDYDLPNDTVYSE---TCASIGLIFFAQRMLRISPDSRYANVMERALYNTVVG- 383
Query: 438 QRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSF------W----CCYGTGIESFSKLG 487
+ Y+ PL ++ G KF+ W CC + LG
Sbjct: 384 GMARDGKHFFYVNPL---EVDPKACGGANHKFDHIKTVRQEWFGCACCPPNIARLLASLG 440
Query: 488 DSIYFEEEGNVPGLYIIQYISSSFDWKS--GHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
+ IY + V Y YI + ++ G V L Q + W +R F + E
Sbjct: 441 EYIYTVQGDTV---YAHLYIGGEAELQTSGGKVKLTQTTN--YPWGGNVR----FEVQPE 491
Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPP---GNFLSATERWSYNDKLTIQLPLS 602
+L LR+P W A +NG+ + L ++ +W D + ++L +
Sbjct: 492 GEGRFTLALRLPDWCPE--ASLQVNGEVVELEGALLQDGYIRLARQWCAGDVVELKLAMP 549
Query: 603 L 603
+
Sbjct: 550 V 550
>gi|401761699|ref|YP_006576706.1| hypothetical protein ECENHK_00925 [Enterobacter cloacae subsp.
cloacae ENHKU01]
gi|400173233|gb|AFP68082.1| hypothetical protein ECENHK_00925 [Enterobacter cloacae subsp.
cloacae ENHKU01]
Length = 649
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 77/364 (21%), Positives = 125/364 (34%), Gaps = 73/364 (20%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF-------------HAN 334
L RLY +T +P++L L F +P F + S++ ++
Sbjct: 193 LMRLYDVTQEPRYLNLVKYFIEERGTQPHFYDIEYEKRGKTSYWNTYGPAWMVKDKAYSQ 252
Query: 335 THIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG-- 376
H P+ IG +R+ ++GD + + + Y TGG
Sbjct: 253 AHQPLAEQQTAIGHAVRFVYLMAGMAHLARLSGDEGKRQDCLRLWNNMAQRQLYITGGIG 312
Query: 377 --TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
+S F D DT+ +E +C + ++ +R + + YAD ERAL N V
Sbjct: 313 SQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSHYADVMERALYNTV 369
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGI 480
L + Y+ PL H FN + CC
Sbjct: 370 LG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIYDHVKPVRQRWFGCACCPPNIA 420
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
+ LG IY L I Y+ + + L ++ W + + +T
Sbjct: 421 RVLTSLGHYIYTVRPD---ALLINLYVGNDVAIQIDENTLRLRISGNYPWQDQVTIEIT- 476
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
V +L LR+P W SLNG+ + +L W D LT+ LP
Sbjct: 477 ---SPVPVTHTLALRLPDWCAEPA--VSLNGERVTGEVVRGYLYLNRCWHEGDTLTLTLP 531
Query: 601 LSLR 604
+ +R
Sbjct: 532 MPVR 535
>gi|294643636|ref|ZP_06721438.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294808056|ref|ZP_06766829.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|292641013|gb|EFF59229.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294444697|gb|EFG13391.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 698
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 18/217 (8%)
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ + ETC + + + T + YAD E L N VLS + Y PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGKKYFYTNPL-R 434
Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
+ T W T++ S +CC + + + + Y EG LY ++++
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTT 494
Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
WK G + L Q+ D W+ +R+TL ++ G SL LR+P W A ++
Sbjct: 495 --WKDKGKLALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW--CEKATLTV 546
Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
NGQ L N + R W D +L + +P+ L
Sbjct: 547 NGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|333381631|ref|ZP_08473310.1| hypothetical protein HMPREF9455_01476 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829560|gb|EGK02206.1| hypothetical protein HMPREF9455_01476 [Dysgonomonas gadei ATCC
BAA-286]
Length = 811
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 74/364 (20%), Positives = 138/364 (37%), Gaps = 65/364 (17%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
L ++Y +T ++L LA F L L+ S ++ TH P++ +G +R
Sbjct: 232 LAKMYRVTGKKEYLDLAKYF--------LDLKGHGHSGEYSQTHKPVIEQDEAVGHAVRA 283
Query: 348 E-----------VTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSE 396
+TG+ Y D V Y TGG A + G
Sbjct: 284 AYMYSGMADVAALTGNEAYLHAIDKIWDNVVTKKLYITGGIGATGH-------GEAFGKN 336
Query: 397 NE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIY 448
E ETC + + LF + Y D ER L NG++S + Y
Sbjct: 337 YELPNMSAYCETCAAIANVYWNHRLFLLHGDSKYYDVLERTLYNGLIS-GINLDGNRFFY 395
Query: 449 MLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
PL ++ HG F CC + +Y +++ + Y+ ++
Sbjct: 396 PNPL-----ESVGQHGRSEWFGCA-CCPSNVCRFMPSIPGYVYAKKDDKI---YVSLFVE 446
Query: 509 SSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-----SLNLRMP--VWTY 561
S + + G +N WD + + + + ++ L +LN +P ++TY
Sbjct: 447 SEGEIELGKNKINLSQKTGYPWDGNVTINVDPAKSEKFDVLVRIPGWALNKPVPSDLYTY 506
Query: 562 SNGAQASL----NGQNLPLPPPGN-FLSATERWSYNDKLTIQLPLSLR----TEAIQDDR 612
N + ++ NG+++ N +++ +++W DK+ + P+ + E ++DDR
Sbjct: 507 LNPKKETVKIKVNGKDVDYTIGSNGYVTLSQKWKKGDKIDVSFPMDVHKDVANEKVEDDR 566
Query: 613 PEYA 616
+ A
Sbjct: 567 GKVA 570
>gi|423303854|ref|ZP_17281853.1| hypothetical protein HMPREF1072_00793 [Bacteroides uniformis
CL03T00C23]
gi|423307425|ref|ZP_17285415.1| hypothetical protein HMPREF1073_00165 [Bacteroides uniformis
CL03T12C37]
gi|392686852|gb|EIY80152.1| hypothetical protein HMPREF1072_00793 [Bacteroides uniformis
CL03T00C23]
gi|392690034|gb|EIY83305.1| hypothetical protein HMPREF1073_00165 [Bacteroides uniformis
CL03T12C37]
Length = 663
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 143/368 (38%), Gaps = 66/368 (17%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
L RLY++T D K+L A F G A + YL +H P++ +G +R
Sbjct: 223 LVRLYTVTGDKKYLDQAKFFLDA--RGTTARKDIYLQ-----SHKPVLEQEEAVGHAVRA 275
Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADT 392
+TGD Y K I + +IV Y TGG AR E + D L +
Sbjct: 276 GYMYSGMADVAAITGDSSYIKAIDKIWENIV-GKKIYITGGIGARHAGEAFGDNYELPNL 334
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
+ ETC + ++ LF + Y D ER L NG++S + G Y PL
Sbjct: 335 --TAYNETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLIS-GVSLDGGKFFYPNPL 391
Query: 453 GRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISSS 510
+ T+ F C C + I F L +Y ++ V Y+ ++S+
Sbjct: 392 SCDGKYHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQV---YVNLFLSNR 448
Query: 511 FDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVW-------- 559
+ K VVL Q+ W+ +R+ K G L ++N+R+P W
Sbjct: 449 AELKLNEKKVVLEQETG--YPWNGDIRV------KVAQGNLPFTMNIRIPGWVRGSVLPS 500
Query: 560 ---TYSN----GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAI 608
+Y++ G + +NG+ + +L +W D + + + R E +
Sbjct: 501 DLYSYADDLKLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMQPRVVKANEKV 560
Query: 609 QDDRPEYA 616
DR A
Sbjct: 561 VADRGRVA 568
>gi|380695298|ref|ZP_09860157.1| hypothetical protein BfaeM_15227 [Bacteroides faecis MAJ27]
Length = 698
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 91/217 (41%), Gaps = 18/217 (8%)
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 454
+ + ETC + + + T + YA+ E L N VLS + Y PL R
Sbjct: 377 TAHNETCANIGNMLFNWRMLEVTGDAKYAELVETCLYNSVLS-GISLDGKRYFYTNPL-R 434
Query: 455 GVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSS 510
+ T W T++ S +CC + + + + Y +EG LY + +
Sbjct: 435 ISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLNDEGIYCNLYGANTL--T 492
Query: 511 FDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASL 569
WK G +VL Q+ D WD +R+ L ++ G SL R+P W A ++
Sbjct: 493 IHWKDKGEIVLTQETD--YPWDGNVRVRLN-KLPRKAGAF-SLFFRIPEW--CEKATLTV 546
Query: 570 NGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
NG+ + + N + R W D +LT+ +P+ L
Sbjct: 547 NGEPVQIAAKANTYAEVNRIWKKGDMAELTMDMPVRL 583
>gi|354583084|ref|ZP_09001984.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
gi|353198501|gb|EHB63971.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
Length = 626
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 54/266 (20%), Positives = 101/266 (37%), Gaps = 22/266 (8%)
Query: 347 YEVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTYNM 406
+E+ G P+ + +D + H A G S E+ L+ T S+ E C
Sbjct: 237 FELNGSPMERESVHRGIDSLMTYHGQAHGMFSGDEW------LSGTHPSQGVELCAVVEY 290
Query: 407 LKVSRHLFRWTKEIAYADYYERALTNGV--------LSIQRGTEPGVMIYMLPLGRGVSK 458
+ L R E + D E+ N + S Q + +I + R S
Sbjct: 291 MFSMEQLTRILGEGRFGDILEKVAFNALPAAISPDWTSHQYDQQVNQIICNV-APRAWSN 349
Query: 459 ARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHV 518
+ +G + N F CC + + KL ++ +++ GL + Y + G
Sbjct: 350 GPDANVFGLEPN-FGCCTANMHQGWPKLAAHLWMKDQEE--GLVAVSYAPCTVMTTVGRH 406
Query: 519 VLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP 578
+ ++ V+ + + + E + L+LR+P W + +LNG+ LP
Sbjct: 407 DVAAVIE--VTGEYPFKDRIRIHMSLERAESFPLSLRIPAWC--DDPVITLNGRELPFQV 462
Query: 579 PGNFLSATERWSYNDKLTIQLPLSLR 604
+ + W D+L + LP+ +R
Sbjct: 463 ESGYARIVQHWQNGDRLELHLPMEVR 488
>gi|218195658|gb|EEC78085.1| hypothetical protein OsI_17564 [Oryza sativa Indica Group]
Length = 640
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 77/364 (21%), Positives = 125/364 (34%), Gaps = 73/364 (20%)
Query: 293 LYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQADYLSHF-------------HAN 334
L RLY +T +P++L L F +P F + S++ ++
Sbjct: 184 LMRLYDVTEEPRYLNLVKYFIEERGAQPHFYDIEYEKRGKTSYWNTYGPAWMVKDKAYSQ 243
Query: 335 THIPIV-----IGSQMRY-----------EVTGDPLYKLIGTFFMDIVNASHSYATGG-- 376
H P+ IG +R+ ++GD + + + Y TGG
Sbjct: 244 AHQPLAEQQTAIGHAVRFVYLMAGMAHLARLSGDEGKRQDCLRLWNNMAQRQLYITGGIG 303
Query: 377 --TSAREFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGV 434
+S F D DT+ +E +C + ++ +R + + YAD ERAL N V
Sbjct: 304 SQSSGEAFSSDYDLPNDTVYAE---SCASIGLMMFARRMLEMEADSHYADVMERALYNTV 360
Query: 435 LSIQRGTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFW--------------CCYGTGI 480
L + Y+ PL H FN + CC
Sbjct: 361 LG-GMALDGKHFFYVNPL--------EVHPKTLAFNHIYDHVKPVRQRWFGCACCPPNIA 411
Query: 481 ESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTF 540
+ LG IY L I Y+ + + L ++ W + + +T
Sbjct: 412 RVLTSLGHYIYTVRPD---ALLINLYVGNDVAIQIDENTLRLRISGNYPWQDQVTIEIT- 467
Query: 541 SSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLP 600
V +L LR+P W SLNG+ + +L W D LT+ LP
Sbjct: 468 ---SPVPVTHTLALRLPDWCAEPA--VSLNGERVTGEVVRGYLYLNRCWHEGDTLTLTLP 522
Query: 601 LSLR 604
+ +R
Sbjct: 523 MPVR 526
>gi|270295877|ref|ZP_06202077.1| six-hairpin glycosidase [Bacteroides sp. D20]
gi|270273281|gb|EFA19143.1| six-hairpin glycosidase [Bacteroides sp. D20]
Length = 663
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 143/368 (38%), Gaps = 66/368 (17%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
L RLY++T D K+L A F G A + YL +H P++ +G +R
Sbjct: 223 LVRLYTVTGDKKYLDQAKFFLDA--RGTTARKDIYLQ-----SHKPVLEQEEAVGHAVRA 275
Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADT 392
+TGD Y K I + +IV Y TGG AR E + D L +
Sbjct: 276 GYMYSGMADVAAITGDSSYIKAIDKIWENIV-GKKIYITGGIGARHAGEAFGDNYELPNL 334
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
+ ETC + ++ LF + Y D ER L NG++S + G Y PL
Sbjct: 335 --TAYNETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLIS-GVSLDGGKFFYPNPL 391
Query: 453 GRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISSS 510
+ T+ F C C + I F L +Y ++ V Y+ ++S+
Sbjct: 392 SCDGKYHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQV---YVNLFLSNR 448
Query: 511 FDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVW-------- 559
+ K VVL Q+ W+ +R+ K G L ++N+R+P W
Sbjct: 449 AELKLNEKKVVLEQETG--YPWNGDIRV------KVAQGNLPFTMNIRIPGWVRGSVLPS 500
Query: 560 ---TYSN----GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAI 608
+Y++ G + +NG+ + +L +W D + + + R E +
Sbjct: 501 DLYSYADDLKLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMHPRVVKANEKV 560
Query: 609 QDDRPEYA 616
DR A
Sbjct: 561 VADRGRVA 568
>gi|403743937|ref|ZP_10953416.1| hypothetical protein URH17368_0706 [Alicyclobacillus hesperidum
URH17-3-68]
gi|403122527|gb|EJY56741.1| hypothetical protein URH17368_0706 [Alicyclobacillus hesperidum
URH17-3-68]
Length = 712
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 85/217 (39%), Gaps = 25/217 (11%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG----R 454
ETC + ++ + + R + YAD ERAL N V+ + Y+ PL
Sbjct: 384 ETCASIGLIFFANRMIRISPRREYADVMERALYNVVIG-SMALDGKHYCYVNPLALWPPA 442
Query: 455 GVSKARSTHGWGTKFNSFW--CCYGTGIESFSKLGDSIYF--EEEGNVPGLYIIQYISS- 509
+ H + F CC LGD IY EE+G V Y+ YI S
Sbjct: 443 NIQNPDRKHVKPVRQAWFGCACCPPNVARLMMSLGDYIYTIDEEKGKV---YVHLYIGSE 499
Query: 510 -SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVWTYSNGAQA 567
SF +VL Q D + W ++ + G ++ SL LR+P W ++
Sbjct: 500 ASFSVGGRKIVLIQ--DSEMPWQGRVKFRVALGE----GPVNFSLALRIPSWC-ADTPSV 552
Query: 568 SLNGQNLPLPP---PGNFLSATERWSYNDKLTIQLPL 601
+NG L + ++ W+ D L + LP+
Sbjct: 553 RVNGNLLSIASVTTKDGYIEIERTWTDGDVLELDLPM 589
>gi|251796469|ref|YP_003011200.1| hypothetical protein Pjdr2_2459 [Paenibacillus sp. JDR-2]
gi|247544095|gb|ACT01114.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 659
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 85/242 (35%), Gaps = 31/242 (12%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPL---- 452
ETC + ++ ++ + + + YAD ERAL N V+ Q G Y+ PL
Sbjct: 338 ETCASIGLIFFAQRMLKLEAKSEYADVLERALYNNVVGSMSQDGKH---YFYVNPLEVWP 394
Query: 453 -------GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
GR KA +G CC S L D IY N +Y
Sbjct: 395 QASEKNPGRHHVKAERQKWFGCS-----CCPPNVARLLSSLNDYIYTVSAANNT-IYTHL 448
Query: 506 YISS--SFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSN 563
+I S F+ +G V L Q+ + W Y R F G + LR+P W+
Sbjct: 449 FIGSVARFELAAGSVSLKQQSQ--LPWKGYTR----FEFDDVPGAAFTFALRIPSWSRGK 502
Query: 564 GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF 623
A ++NGQ + W D + L + A A AI
Sbjct: 503 -AVLNINGQAAEYTEENGYALVNRNWQQGDVAEWEPALEAQLTAAHPQIRANAGKVAIER 561
Query: 624 GP 625
GP
Sbjct: 562 GP 563
>gi|410096807|ref|ZP_11291792.1| hypothetical protein HMPREF1076_00970 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225424|gb|EKN18343.1| hypothetical protein HMPREF1076_00970 [Parabacteroides goldsteinii
CL02T12C30]
Length = 675
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 84/416 (20%), Positives = 159/416 (38%), Gaps = 59/416 (14%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCF-LGFLALQADYLSHFHANTHIPIVIGSQ---MRY 347
+Y LY+IT D L L HL K + + L D L+ F+ + + G + + Y
Sbjct: 214 AVYWLYNITGDAFLLDLGHLLHKQSYDFVDMFLNRDDLTRFNTIHCVNLAQGIKEPVIYY 273
Query: 348 EVTGDPLY-KLIGTFFMDI--VNASHSYATGGTSAREFWWDPKRLADTLGSENEETCTTY 404
+ D Y + F DI N GG L ++ E C+
Sbjct: 274 QQHPDKKYLDAVKKGFADIRQYNGQPQGMYGGDEG---------LHGNNPTQGSELCSAV 324
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLS-----------IQRGTEPGVMIYMLPLG 453
++ + T ++A+ D+ ER N + + Q+ + + +
Sbjct: 325 ELMYSLEKIMEITGDLAFTDHLERIAFNALPTQVTDDFMDKQYFQQANQVMITRHAHNFY 384
Query: 454 RGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
+ A + +GT+ + CC+ + + K S+++ N G+ + Y S
Sbjct: 385 EDANHAETDIIYGTR-TGYPCCFSNMHQGWPKFTQSLWYATPDN--GIAALAYSPSEVTA 441
Query: 514 KSGH-VVLNQKVDPIVSWDPYLRMTLTFSSK-QEVGQLSSLNLRMPVWTYSNGAQASLNG 571
K G+ + + D +++T+ K +E+ L+LR+P W A ++NG
Sbjct: 442 KVGNGCKIKITEETCYPMDDKIQLTIRLLDKTKEIA--FPLHLRIPGWCKE--ATVTVNG 497
Query: 572 QNLPLP---PPGNFLSATER-WSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYL 627
+P GN ++ R W D++ + LP+ + T Y + A+ GP +
Sbjct: 498 ----VPESTAKGNSVAIIRRTWKSGDQVLLHLPMEVSTSKW------YENSVAVERGPLV 547
Query: 628 LAGHTSGEWDIK-------TGTARSLSALISPIPPSFNAQLVTFTQESGNSTFVMS 676
A +W+ K T +S + SP +N +V F ++ F ++
Sbjct: 548 YALKMDEKWEKKEFKGDEITQFGKSYYEVTSPT--KWNYGIVAFDPDNMQENFQVT 601
>gi|160890885|ref|ZP_02071888.1| hypothetical protein BACUNI_03330 [Bacteroides uniformis ATCC 8492]
gi|156859884|gb|EDO53315.1| hypothetical protein BACUNI_03330 [Bacteroides uniformis ATCC 8492]
Length = 663
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 143/368 (38%), Gaps = 66/368 (17%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
L RLY++T D K+L A F G A + YL +H P++ +G +R
Sbjct: 223 LVRLYTVTGDKKYLDQAKFFLDA--RGTTARKDIYLQ-----SHKPVLEQEEAVGHAVRA 275
Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADT 392
+TGD Y K I + +IV Y TGG AR E + D L +
Sbjct: 276 GYMYSGMADVAAITGDSSYIKAIDKIWENIV-GKKIYITGGIGARHTGEAFGDNYELPNL 334
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
+ ETC + ++ LF + Y D ER L NG++S + G Y PL
Sbjct: 335 --TAYNETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLIS-GVSLDGGKFFYPNPL 391
Query: 453 GRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISSS 510
+ T+ F C C + I F L +Y ++ V Y+ ++S+
Sbjct: 392 SCDGKYHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQV---YVNLFLSNR 448
Query: 511 FDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVW-------- 559
+ K VVL Q+ W+ +R+ K G L ++N+R+P W
Sbjct: 449 AELKLNEKKVVLEQETG--YPWNGDIRV------KVAQGNLPFTMNIRIPGWVRGSVLPS 500
Query: 560 ---TYSN----GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAI 608
+Y++ G + +NG+ + +L +W D + + + R E +
Sbjct: 501 DLYSYADDLKLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMHPRVVKANEKV 560
Query: 609 QDDRPEYA 616
DR A
Sbjct: 561 VADRGRVA 568
>gi|198274386|ref|ZP_03206918.1| hypothetical protein BACPLE_00531 [Bacteroides plebeius DSM 17135]
gi|198272752|gb|EDY97021.1| hypothetical protein BACPLE_00531 [Bacteroides plebeius DSM 17135]
Length = 821
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 80/384 (20%), Positives = 140/384 (36%), Gaps = 79/384 (20%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPI-----VIGSQMR 346
L +LY +T D K+L +A F + G + + S H PI ++G +R
Sbjct: 230 ALCKLYKVTGDKKYLDMARYFVEETGRGTDGHKLNEYSQ----DHKPILQQDEIVGHAVR 285
Query: 347 Y-----------EVTGDPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+T D Y T D + + Y TGG +R + G
Sbjct: 286 AGYLYSGVADVAALTNDTAYFHALTRLWDNLVSKKLYITGGMGSRA-------QGEGFGP 338
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E ETC + + +F T + Y D ERAL NGV+S GV +
Sbjct: 339 NYELQNHTAYCETCAAIANVYWNYRMFLATGDSKYVDVLERALYNGVIS-------GVSL 391
Query: 448 ------YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGL 501
Y PL R ++ CC G + + Y ++ ++
Sbjct: 392 SGDKFFYDNPLESMGEHERQ------RWFGCACCPGNVTRFMASVPSYAYATQQNDI--- 442
Query: 502 YIIQYISSSFDWKSG--HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
Y+ YI + ++ V L Q + W+ ++T+ + ++E G+ ++ LR+P W
Sbjct: 443 YVNLYIQGKAEMQTADNKVTLEQTTE--YPWNG--KVTIKVTPEKE-GKF-AIRLRIPGW 496
Query: 560 T-----------YSNGAQA---SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
T Y++ A+ +NG + + W D + +++P+ +R
Sbjct: 497 TKAAPVASDLYAYTDAAKKYTLKVNGSATRGAEGDGYETIVRTWKAGDVIELEMPMDVRR 556
Query: 606 EAIQDDRPEYASIQAILFGPYLLA 629
D + A+ GP +
Sbjct: 557 IKANDKVEVDRGMVALERGPIMFC 580
>gi|265752773|ref|ZP_06088342.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263235959|gb|EEZ21454.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 801
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 107/275 (38%), Gaps = 34/275 (12%)
Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYN 405
+TGD Y D + Y TGG TS E + L + S ETC
Sbjct: 287 LTGDSAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM--SAYCETCAAIG 344
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
+ V+ LF E Y D ER L NG++S + G Y PL + + + +
Sbjct: 345 NVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL-ESIGQHQRQPWF 402
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
G CC L +Y ++ +V Y+ ++S++ + K ++ +
Sbjct: 403 GCA-----CCPSNVCRFIPSLPGYVYAVKDKDV---YVNLFMSNTSNLKVEGKAVSLEQA 454
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQAS----LN 570
WD + + + +K GQ ++ +R+P W TYS+G + S +N
Sbjct: 455 THYPWDGDVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTYSDGKRLSYTVKVN 510
Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
G+++ + RW DK+ + + RT
Sbjct: 511 GESVQSELKDGYFCIDRRWKKGDKVEVHFDMEPRT 545
>gi|317479689|ref|ZP_07938812.1| hypothetical protein HMPREF1007_01928 [Bacteroides sp. 4_1_36]
gi|316904142|gb|EFV25973.1| hypothetical protein HMPREF1007_01928 [Bacteroides sp. 4_1_36]
Length = 647
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 143/368 (38%), Gaps = 66/368 (17%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMRY 347
L RLY++T D K+L A F G A + YL +H P++ +G +R
Sbjct: 223 LVRLYTVTGDKKYLDQAKFFLDA--RGTTARKDIYLQ-----SHKPVLEQEEAVGHAVRA 275
Query: 348 -----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLADT 392
+TGD Y K I + +IV Y TGG AR E + D L +
Sbjct: 276 GYMYSGMADVAAITGDSSYIKAIDKIWENIV-GKKIYITGGIGARHTGEAFGDNYELPNL 334
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
+ ETC + ++ LF + Y D ER L NG++S + G Y PL
Sbjct: 335 --TAYNETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLIS-GVSLDGGKFFYPNPL 391
Query: 453 GRGVSKARSTHGWGTKFNSFWC-CYGTGIESF-SKLGDSIYFEEEGNVPGLYIIQYISSS 510
+ T+ F C C + I F L +Y ++ V Y+ ++S+
Sbjct: 392 SCDGKYHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQV---YVNLFLSNR 448
Query: 511 FDWK--SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS-SLNLRMPVW-------- 559
+ K VVL Q+ W+ +R+ K G L ++N+R+P W
Sbjct: 449 AELKLNEKKVVLEQETG--YPWNGDIRV------KVAQGNLPFTMNIRIPGWVRGSVLPS 500
Query: 560 ---TYSN----GAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT----EAI 608
+Y++ G + +NG+ + +L +W D + + + R E +
Sbjct: 501 DLYSYADDLKLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMHPRVVKANEKV 560
Query: 609 QDDRPEYA 616
DR A
Sbjct: 561 VADRGRVA 568
>gi|237720781|ref|ZP_04551262.1| six-hairpin glycosidase [Bacteroides sp. 2_2_4]
gi|229449616|gb|EEO55407.1| six-hairpin glycosidase [Bacteroides sp. 2_2_4]
Length = 698
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 139/347 (40%), Gaps = 47/347 (13%)
Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
+ +Y T +P++L L+ +L D + + + Y + HA + G
Sbjct: 248 VVEMYRATENPRYLELSKNLIDIRGMVENGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307
Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
Y TG+ L K + + + DIV Y TG GTS ++P +++ +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366
Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
G + + ETC + + + T + YAD E L N VLS +
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 425
Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
Y PL R + T W T++ S +CC + + + + Y EG
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484
Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
LY ++++ WK G + L Q+ D W+ +R+TL ++ G SL LR+P W
Sbjct: 485 LYGANTLTTT--WKDKGELTLTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW 538
Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
++NGQ L N + R W D +L + +P+ L
Sbjct: 539 --CEKTTLTVNGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|304316161|ref|YP_003851306.1| hypothetical protein Tthe_0663 [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
gi|302777663|gb|ADL68222.1| protein of unknown function DUF1680 [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
Length = 673
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 95/234 (40%), Gaps = 17/234 (7%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLG---RG 455
ETC + ++ + + + + Y+D ERAL N V+S + Y+ PL
Sbjct: 358 ETCASVGLVFFAHRMLQIDPDRQYSDVMERALYNTVIS-GMSLDGKKFFYVNPLEVWPEA 416
Query: 456 VSKAR-STHGWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
K + +H T+ F CC + LG IY ++ V ++ Y+ S
Sbjct: 417 CEKNKVKSHVKYTRQPWFGCACCPPNIARLLTSLGKYIYSKKAKEV---FVHLYVDSELK 473
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQ 572
K +N K WD ++ + SK+E +L++R+P W + + N
Sbjct: 474 EKISESEVNIKQSTQYPWDE--KIIIDIDSKKETE--FTLSIRIPGWCKEAKVKVNNNEI 529
Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLS-LRTEAIQDDRPEYASIQAILFGP 625
+L + RW + D L I L + +R +A + R + + AI GP
Sbjct: 530 DLDSVMEKGYAKINRRWKH-DSLEIYLSMPVMRIKANPNVREDEGKV-AIQRGP 581
>gi|281421440|ref|ZP_06252439.1| putative cytoplasmic protein [Prevotella copri DSM 18205]
gi|281404512|gb|EFB35192.1| putative cytoplasmic protein [Prevotella copri DSM 18205]
Length = 690
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 90/213 (42%), Gaps = 34/213 (15%)
Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L RLY++T + K+L A +L D + G ++ Y + + +PI+ +G +R
Sbjct: 238 LARLYTLTGEKKYLDEAKYLLD---YRGKTHIRNPY-----SQSQVPILEQKEAVGHAVR 289
Query: 347 Y-----------EVTGDPLY-KLIGTFFMDIVNASHSYATGGTSAR---EFWWDPKRLAD 391
+T D Y K+I F +IV + Y TGG AR E + + L +
Sbjct: 290 AGYMYAGIADVAALTKDSAYMKVIDRIFENIVGKKY-YLTGGVGARHAGEAFGENYELPN 348
Query: 392 TLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLP 451
+ ETC +M+ + +F E Y D ER L NGV+S + G Y P
Sbjct: 349 M--TAYNETCAAISMVYLFERMFLLHGESKYIDCMERTLYNGVIS-GMSMDGGRFFYPNP 405
Query: 452 LGRGVSKARSTHGWGTKFNSFWC-CYGTGIESF 483
L A + G T+ F C C + + F
Sbjct: 406 LSSDGKYAFNADGNTTRQPWFGCACCPSNLSRF 438
>gi|345514164|ref|ZP_08793678.1| six-hairpin glycosidase [Bacteroides dorei 5_1_36/D4]
gi|229435978|gb|EEO46055.1| six-hairpin glycosidase [Bacteroides dorei 5_1_36/D4]
Length = 801
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 78/348 (22%), Positives = 132/348 (37%), Gaps = 59/348 (16%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T K+L A F D+ G+ +Y + H P+V +G +R
Sbjct: 222 LAKLYLVTGQQKYLDQAKFFLDQ---RGYTTRTDEY-----SQAHKPVVEQDEAVGHAVR 273
Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADT 392
+TGD Y D + Y TGG TS E + L +
Sbjct: 274 AAYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM 333
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
S ETC + V+ LF E Y D ER L NG++S + G Y PL
Sbjct: 334 --SAYCETCAAIGNVYVNYRLFLLHGEAKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL 390
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
+ + + +G CC L +Y ++ +V Y+ ++S++ +
Sbjct: 391 -ESIGQHQRQPWFGCA-----CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFMSNTSN 441
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TY 561
K ++ + W+ + + + +K GQ ++ +R+P W TY
Sbjct: 442 LKVEGKAVSLEQTTHYPWNGEVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTY 497
Query: 562 SNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
S+G + S +NG+ + + RW DK+ + + RT
Sbjct: 498 SDGKRLSYTVKVNGEPVQSELKDGYFCIDRRWKKGDKIAVHFDMEPRT 545
>gi|218678364|ref|ZP_03526261.1| hypothetical protein RetlC8_05602 [Rhizobium etli CIAT 894]
Length = 345
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 54/233 (23%), Positives = 93/233 (39%), Gaps = 24/233 (10%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + ++ + + + YAD E+AL NG L T+ Y PLG
Sbjct: 131 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALP-GLSTDGKTFFYDNPLGSAGKH 189
Query: 459 ARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKS 515
+G + N G ++ D I +++ ++ +
Sbjct: 190 HPLENGIIAPAARPNIARLVTSIGSYMYAVADDEI---------AVHLYGESTTRLKLAN 240
Query: 516 GHVV-LNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNL 574
G V L Q + WD + F+++ E +L+LR+P W + GA S+NG+ L
Sbjct: 241 GAAVELQQATN--YPWD----GAVAFTTRLEKPAKFALSLRIPDW--AEGATLSVNGEKL 292
Query: 575 PLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
L + +W+ D++ + LPLSLR + + A A++ GP
Sbjct: 293 DLGAAVRDGYARIDRQWADGDRVDLFLPLSLRPQYANPKVRQDAGRVALMRGP 345
>gi|257067398|ref|YP_003153653.1| hypothetical protein Bfae_01840 [Brachybacterium faecium DSM 4810]
gi|256558216|gb|ACU84063.1| uncharacterized conserved protein [Brachybacterium faecium DSM
4810]
Length = 643
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 48/214 (22%), Positives = 91/214 (42%), Gaps = 23/214 (10%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL-----G 453
ETC + S L+ T + YAD+ ER L N V+++ + Y PL G
Sbjct: 332 ETCAGIAAIMFSWRLYLATGGVEYADFIERVLYN-VVAVSPSPDGRAFFYSNPLHQREPG 390
Query: 454 RGVSKARSTHGWGTKFNSFW---CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
S + + G+ ++ CC + + + DS + +G GL ++QY S +
Sbjct: 391 DSASSSVNMRAEGSTRAPWFDVSCCPTNVARTLASV-DSFFAATDGE--GLTLLQYASGT 447
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
+ + V ++ + + + LT E ++L LR+P W ++GA ++
Sbjct: 448 YRTPALTVAVHTE------YPAQGAIALTVLDAAE--DPATLRLRVPSW--ADGAALTVG 497
Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+ + PG + T W +++ + LP+ R
Sbjct: 498 SEPVRTVTPG-WSEVTRTWRAGERVLLDLPVVPR 530
>gi|433774251|ref|YP_007304718.1| hypothetical protein Mesau_02961 [Mesorhizobium australicum
WSM2073]
gi|433666266|gb|AGB45342.1| hypothetical protein Mesau_02961 [Mesorhizobium australicum
WSM2073]
Length = 666
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 97/482 (20%), Positives = 187/482 (38%), Gaps = 81/482 (16%)
Query: 166 VGHYLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKP- 224
+G + +A N +++K+ V+ Q + GYLS++ ++ ++P
Sbjct: 108 LGKTIETAAYSLYRRKNPELEKKIDAVIDMYGRLQQE--DGYLSSW-------YQRIQPG 158
Query: 225 -VWAPYYTIHKI-LAGLLDQYVLA--DNAQALKMATWMVEYFYNRVQKVITMYSVERHWY 280
W H++ AG L + +A K+ M Y + + V+ ++ Y
Sbjct: 159 KRWTNLRDCHELYCAGHLIEGAVAYYQATGKRKLLDIMCRYA-DHIASVLGPEPGKKKGY 217
Query: 281 SLNEETGGMNDVLYRLYSITHDPKHLLLA-----------HLFDKPCFLGFLALQADYLS 329
+EE + L +L +T + K++ LA H FD+ +A +
Sbjct: 218 CGHEE---IELALVKLARVTGEQKYMELAKYFIDQRGQQPHYFDEEARARGADPKAYHFK 274
Query: 330 HF-HANTHIPI-----VIGSQMRYEVT-----------GDPLYKLIGTFFMDIVNASHSY 372
+ ++ +HIP+ V+G +R GD ++ D + + Y
Sbjct: 275 TYEYSQSHIPVREQDKVVGHAVRAMYLYSGMADIATEYGDDTLRVALDRLWDDLTTKNLY 334
Query: 373 ATGGTSAREFWWDPKRLADTLGSE----NE----ETCTTYNMLKVSRHLFRWTKEIAYAD 424
TGG P + S+ NE ETC + ++ + + YAD
Sbjct: 335 ITGGLG-------PSAHNEGFTSDYDLPNETAYAETCASVGLVFWATRMLGMGPNARYAD 387
Query: 425 YYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHG-WGTKFNSFWCCYGTGIESF 483
ERAL NG +S + + Y PL ++R H W K++ CC
Sbjct: 388 MMERALYNGSIS-GLSLDGSLFFYENPL-----ESRGKHNRW--KWHRCPCCPPNIGRMV 439
Query: 484 SKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSK 543
+ +G S ++ + +++ ++ FD V L Q WD + +T+ +
Sbjct: 440 ASIG-SYFYSLADDALAVHLYGDSTARFDIADTPVTLTQASR--YPWDGAVEITVEPQTS 496
Query: 544 QEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPL 601
E +L+LR+P W S+ A+ +NG+ + L + + +W D++ + L +
Sbjct: 497 VEF----TLHLRVPAW--SSKAKLEINGEAIDLAEVTSDGYAAIRRQWKKGDRVRLDLEM 550
Query: 602 SL 603
+
Sbjct: 551 PI 552
>gi|170288466|ref|YP_001738704.1| hypothetical protein TRQ2_0668 [Thermotoga sp. RQ2]
gi|170175969|gb|ACB09021.1| protein of unknown function DUF1680 [Thermotoga sp. RQ2]
Length = 620
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 71/342 (20%), Positives = 133/342 (38%), Gaps = 50/342 (14%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD-----YLSHF----------HANTH 336
L LY T D K+L LA F G ++ + ++ H HA
Sbjct: 195 ALVELYRETGDRKYLDLARYFIYTRGKGLASVPRNPGPEYFIDHKPFVELEEITGHAVRA 254
Query: 337 IPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+ + G+ Y TGD +++ + + + V Y TGG +R W ++ G
Sbjct: 255 LYLCSGATDLYLETGDEKIWQALNRLWENFVT-KKMYITGGAGSRHDW-------ESFGE 306
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E E E+C + + + T + +AD E+ L NG+LS +
Sbjct: 307 EYELPNRRSYAESCASIANFMWNFRMLLATGDGKFADVMEQVLYNGLLS-GISLDGKHYF 365
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y PL + R K+ CC + +Y + V +++ +
Sbjct: 366 YFNPL-EDYGRTRR-----QKWFDCACCPPNLARFIASFPGYMYTTSDDGVQ-VHLYEKS 418
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
++ D+K V + Q+ D W + F+ K ++ + S+ LR+P W ++
Sbjct: 419 TARLDFKGSVVEIEQETD--YPWSG----EIAFTIKTDIEEPFSIYLRLPSW--ADDFVL 470
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+ G+ + P ++ ++ W K T++L L ++ E I+
Sbjct: 471 RVGGKTVTAKPQNGYVKLSQNW--KGKHTVELSLPMKAEFIE 510
>gi|298481311|ref|ZP_06999504.1| hypothetical protein HMPREF0106_01752 [Bacteroides sp. D22]
gi|298272515|gb|EFI14083.1| hypothetical protein HMPREF0106_01752 [Bacteroides sp. D22]
Length = 698
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 84/347 (24%), Positives = 139/347 (40%), Gaps = 47/347 (13%)
Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
+ +Y T +P++L L+ +L D + + + Y + HA + G
Sbjct: 248 VVEMYRATGNPRYLELSKNLIDIRGMVENGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307
Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
Y TG+ L K + + + DIV Y TG GTS ++P +++ +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366
Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
G + + ETC + + + T + YAD E L N VLS +
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 425
Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
Y PL R + T W T++ S +CC + + + + Y EG
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484
Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
LY +++ WK G + L Q+ D W+ +R+TL ++ G SL LR+P W
Sbjct: 485 LYGANTLTTI--WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGAF-SLFLRIPEW 538
Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
A ++NGQ L N + R W D +L + +P+ L
Sbjct: 539 --CEKATLTVNGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|146301833|ref|YP_001196424.1| hypothetical protein Fjoh_4097 [Flavobacterium johnsoniae UW101]
gi|146156251|gb|ABQ07105.1| protein of unknown function DUF1680 [Flavobacterium johnsoniae
UW101]
Length = 672
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 55/236 (23%), Positives = 93/236 (39%), Gaps = 23/236 (9%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC + + + + T + YAD E AL N VLS E +Y PL VS
Sbjct: 357 ETCANIGNVLWNWRMLQITGDAKYADIIELALYNSVLS-GMDLEGEKFLYNNPL--NVSN 413
Query: 459 ARSTHG-WGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW 513
H WG + + CC + +++G+ Y + GLY+ Y S+
Sbjct: 414 DLPFHQRWGNEREGYIALSNCCAPNVTRTIAEVGNYAYNISK---EGLYVNLYGSNQLKT 470
Query: 514 KS---GHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
KS + + Q+ + WD + + + + K L + LR+P W S A+ +N
Sbjct: 471 KSLNGEEIEIEQQTN--YPWDGKITLKIVKAPK----DLQNFFLRIPGW--SQNAEILIN 522
Query: 571 GQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+ G +L ++W D + + P+ + E + A+ GP
Sbjct: 523 NSKINDKIVSGTYLKLNQKWKKGDVIELNFPMPVELMEANPLVEEVKNQVAVKRGP 578
>gi|298385749|ref|ZP_06995307.1| hypothetical protein HMPREF9007_02451 [Bacteroides sp. 1_1_14]
gi|298261890|gb|EFI04756.1| hypothetical protein HMPREF9007_02451 [Bacteroides sp. 1_1_14]
Length = 698
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 140/347 (40%), Gaps = 47/347 (13%)
Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
+ +Y T +P++L L+ +L D + + + Y + HA + G
Sbjct: 248 VVEMYRATGNPRYLELSKNLIDIRGMVESGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307
Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
Y TG+ L K + + + DIV Y TG GTS ++P +++ +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366
Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
G + + ETC + + + T + YAD E L N VLS +
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 425
Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
Y PL R + T W T++ S +CC + + + + Y EG
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484
Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
LY +++ +WK G + L Q+ D W+ +R+TL ++ G SL R+P W
Sbjct: 485 LYGANTLTT--NWKDKGELALVQETD--YPWEGNVRVTLN-KVPRKAGAF-SLFFRIPEW 538
Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
A ++NGQ + + N + R W D +L + +P+ L
Sbjct: 539 --CGKAALTVNGQPVSMNAKANTYAEVNRTWKKGDVVELVMDMPVCL 583
>gi|403252915|ref|ZP_10919220.1| hypothetical protein EMP_04025 [Thermotoga sp. EMP]
gi|402811677|gb|EJX26161.1| hypothetical protein EMP_04025 [Thermotoga sp. EMP]
Length = 621
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 68/335 (20%), Positives = 130/335 (38%), Gaps = 48/335 (14%)
Query: 292 VLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQAD-----YLSHF----------HANTH 336
L LY T D K+L LA F GF ++ + ++ H HA
Sbjct: 195 ALVELYRETGDRKYLDLARYFIYARGKGFASVPRNPGPEYFIDHKPFVELEEITGHAVRA 254
Query: 337 IPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGS 395
+ + G+ Y TGD +++ + + + V Y TGG +R W ++ G
Sbjct: 255 LYLCSGATDLYLETGDEKIWQALNKLWENFVT-KKMYITGGAGSRHDW-------ESFGE 306
Query: 396 ENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI 447
E E E+C + + + T + +AD E+ L NG+LS +
Sbjct: 307 EYELPNRRSYAESCASIANFMWNFRMLLATGDGKFADVMEQVLYNGLLS-GISLDGKHYF 365
Query: 448 YMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYI 507
Y PL + + R K+ CC + +Y + V +++ +
Sbjct: 366 YFNPL-EDLGRTRR-----QKWFDCACCPPNLARFIASFPGYMYTTSDDGVQ-VHLYEKS 418
Query: 508 SSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
++ D+K V + Q+ D W +TF+ + ++ + S+ LR+P W ++
Sbjct: 419 TARLDFKGSVVEIEQETD--YPWSG----EVTFTVETDIEEPFSIYLRIPSW--ADDFVL 470
Query: 568 SLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
++G+ + P ++ + W + + LP+
Sbjct: 471 RVDGKAVIAKPQNGYVKLNQSWKGKHTVELSLPMK 505
>gi|436837570|ref|YP_007322786.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
gi|384068983|emb|CCH02193.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
Length = 683
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 126/334 (37%), Gaps = 38/334 (11%)
Query: 291 DVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHANTHIPIVIGSQMRYEVT 350
D LY Y + + K L L K QA+ L ++H N +I Y +
Sbjct: 217 DNLYSAYWLYNRTKAPFLLELAQKIHRNTANWRQANNLPNWH-NVNIAQCFREPATYYLQ 275
Query: 351 GDPLYKLIGTFF-MDIVNASHSYATGGT-----SAREFWWDPKRLADTLGSENEETCTTY 404
L+ T+ ++V + GG ++R + DP++ ETC
Sbjct: 276 SGDQSDLMATYHNFELVRQRYGQVPGGMWGGDENSRPGYTDPRQAV--------ETCGMV 327
Query: 405 NMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTH- 463
+ L R+T + +AD E N L + + Y+ S A + H
Sbjct: 328 EQMASDELLLRFTGDPFWADNCEDVAFN-TLPAAFMPDYRSLRYLTAPNMVRSDAANHHP 386
Query: 464 -----GWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
G N F CC + +++Y N GL ++ Y +S K G
Sbjct: 387 GIDNQGPFLMMNPFSSRCCQHNHANGWVYYAENLYMATPDN--GLAVVLYNASEVTAKVG 444
Query: 517 H---VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQN 573
+ V L Q+ ++ +R+T+ + L LR+P W + +NG+
Sbjct: 445 NGSAVTLKQETS--YPFEEQVRLTVQAARPTAF----PLYLRVPAWC--SNPTVRVNGRA 496
Query: 574 LPL-PPPGNFLSATERWSYNDKLTIQLPLSLRTE 606
+P+ G ++ T+ W DK+T+ LP+ LR
Sbjct: 497 VPVTAKAGQYIVLTDTWQSGDKITLDLPMRLRVR 530
>gi|423296614|ref|ZP_17274699.1| hypothetical protein HMPREF1070_03364 [Bacteroides ovatus
CL03T12C18]
gi|392670337|gb|EIY63822.1| hypothetical protein HMPREF1070_03364 [Bacteroides ovatus
CL03T12C18]
Length = 698
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 139/347 (40%), Gaps = 47/347 (13%)
Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
+ +Y T +P++L L+ +L D + + + Y + HA + G
Sbjct: 248 VVEMYRATGNPRYLELSKNLIDIRGMVENGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307
Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
Y TG+ L K + + + DIV Y TG GTS ++P +++ +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TQKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366
Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
G + + ETC + + + T + YAD E L N VLS +
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLS-GISLDGK 425
Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
Y PL R + T W T++ S +CC + + + + Y EG
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484
Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
LY ++++ WK G + L Q+ D W+ +R+TL ++ G SL LR+P W
Sbjct: 485 LYGANTLTTT--WKDKGELALTQETD--YPWEGKVRVTLDRVPRK-AGTF-SLFLRIPEW 538
Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
++NGQ L N + R W D +L + +P+ L
Sbjct: 539 --CEKTTLTVNGQPLQTNTKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|319951999|ref|YP_004163266.1| hypothetical protein [Cellulophaga algicola DSM 14237]
gi|319420659|gb|ADV47768.1| protein of unknown function DUF1680 [Cellulophaga algicola DSM
14237]
Length = 699
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 83/213 (38%), Gaps = 19/213 (8%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSK 458
ETC S + E YAD E L N LS E Y PL R V
Sbjct: 374 ETCANICNSMFSYRMLGLHGESKYADVMETVLYNSALS-GINIEGDRYYYANPL-RTVHG 431
Query: 459 ARSTHGWGTKFN------SFWCCYGTGIESFSKLGDSIYFEEEGNVP-GLYIIQYISSSF 511
+R T+F +CC + + +++ Y + E + LY ++++
Sbjct: 432 SRDYDKMNTEFPVRQDYLECFCCPPNLVRTIAQVSGWAYSKSENGIAVNLYGGNKLATTL 491
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
+ S L K + W+ + +T+ L LR+P W + G++ +NG
Sbjct: 492 NDGSS---LKLKQETKYPWEGDVEITIEACRSDAFDIL----LRIPEW--AEGSKIMING 542
Query: 572 QNLP-LPPPGNFLSATERWSYNDKLTIQLPLSL 603
+ L PG + + W ND + + LPL++
Sbjct: 543 KESEILATPGTYATLNRTWKANDTIRLDLPLAI 575
>gi|153852636|ref|ZP_01994073.1| hypothetical protein DORLON_00046 [Dorea longicatena DSM 13814]
gi|149754278|gb|EDM64209.1| hypothetical protein DORLON_00046 [Dorea longicatena DSM 13814]
Length = 649
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 74/174 (42%), Gaps = 20/174 (11%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPL---- 452
ETC + + R + + TK+ +Y D ERAL N +LS Q G Y+ PL
Sbjct: 333 ETCASIGLALFGRRMAQITKDASYMDMVERALYNTLLSGIAQDGKS---FFYVNPLEVWP 389
Query: 453 GRGVSKARSTHGWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
+ + H + F CC + + +G IYF ++ Y+ YIS+
Sbjct: 390 DNCIDRTSKEHVKPVRQKWFGVACCPPNIARTLASMGQYIYFTDKNTA---YVNLYISNE 446
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMP--VWTYS 562
+ L +++ ++ ++RM +T + E L LR+P V TY+
Sbjct: 447 AQIELEEGALKIQIESDLTNTGHIRMAITPDGEGE----HRLALRIPDYVKTYT 496
>gi|222099378|ref|YP_002533946.1| hypothetical protein CTN_0404 [Thermotoga neapolitana DSM 4359]
gi|221571768|gb|ACM22580.1| Putative uncharacterized protein [Thermotoga neapolitana DSM 4359]
Length = 623
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 58/287 (20%), Positives = 110/287 (38%), Gaps = 35/287 (12%)
Query: 332 HANTHIPIVIGSQMRYEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA 390
HA + + G+ Y TGD +++ + + + V Y TGG +R W
Sbjct: 252 HAVRALYLCAGATDLYLETGDEKIWQALNRLWENFVT-KKMYITGGAGSRHDW------- 303
Query: 391 DTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTE 442
++ G E E E+C + + + T + +AD E+ L NG+LS +
Sbjct: 304 ESFGEEYELPNRRSYAESCASIANFMWNFRMLLATGDGKFADVMEQVLYNGLLS-GISLD 362
Query: 443 PGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLY 502
Y PL R K+ CC + +Y V ++
Sbjct: 363 GKHYFYFNPLEDSGRTRRQ------KWFDCACCPPNLARFIASFPGYMYTTSNDGVQ-VH 415
Query: 503 IIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYS 562
+ + ++ +K V + Q+ D W + S + E+ + S+ LR+P W +
Sbjct: 416 LYEKSTAKVSFKGSTVKIEQETD--YPWSG----EIVLSIETEIEEPFSIYLRIPTW--A 467
Query: 563 NGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQ 609
+ ++G+ L L P ++ W ++ + LP +R E ++
Sbjct: 468 DDFSIRVDGETLDLEPQNGYVKLNRNWKGGHRIELSLP--MRVELVE 512
>gi|212692436|ref|ZP_03300564.1| hypothetical protein BACDOR_01932 [Bacteroides dorei DSM 17855]
gi|212665015|gb|EEB25587.1| F5/8 type C domain protein [Bacteroides dorei DSM 17855]
Length = 801
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 106/275 (38%), Gaps = 34/275 (12%)
Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYN 405
+TGD Y D + Y TGG TS E + L + S ETC
Sbjct: 287 LTGDSAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM--SAYCETCAAIG 344
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
+ V+ LF E Y D ER L NG++S + G Y PL + + + +
Sbjct: 345 NVYVNYRLFLLHGESKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL-ESIGQHQRQPWF 402
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
G CC L +Y ++ +V Y+ ++S++ + K ++ +
Sbjct: 403 GCA-----CCPSNVCRFIPSLPGYVYAVKDKDV---YVNLFMSNTSNLKVEGKAVSLEQA 454
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQAS----LN 570
WD + + + +K GQ ++ +R+P W TYS+G + S +N
Sbjct: 455 THYPWDGDVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTYSDGKRLSYTVKVN 510
Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
G+ + + RW DK+ + + RT
Sbjct: 511 GEPVQSELKDGYFCIDRRWKKGDKVEVHFDMEPRT 545
>gi|154495095|ref|ZP_02034100.1| hypothetical protein PARMER_04142 [Parabacteroides merdae ATCC
43184]
gi|423725063|ref|ZP_17699203.1| hypothetical protein HMPREF1078_03097 [Parabacteroides merdae
CL09T00C40]
gi|154085645|gb|EDN84690.1| hypothetical protein PARMER_04142 [Parabacteroides merdae ATCC
43184]
gi|409235419|gb|EKN28237.1| hypothetical protein HMPREF1078_03097 [Parabacteroides merdae
CL09T00C40]
Length = 617
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 51/234 (21%), Positives = 96/234 (41%), Gaps = 31/234 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
ETC + M+ ++ + ++T + Y D ER++ NG L+ GV + Y+ PL
Sbjct: 335 ETCASVGMVYWNQRMNQFTGDSKYIDVLERSMYNGALA-------GVSLAGDRFFYVNPL 387
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV-PGLYIIQYISSSF 511
R + CC +G+ IY + + L+I +
Sbjct: 388 ESNGDHHRQA------WYGCACCPSQISRFLPSIGNYIYGTSDKALWVNLFIGNTTEVTI 441
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
D K VV+ Q+ D WD +++T+T S+Q +G+ L +R+P W S S+NG
Sbjct: 442 DGKK--VVMKQETD--YPWDGLVKLTVT--SEQPLGK--ELRIRIPGWCKS--YTLSVNG 491
Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+ + + + W D + + + + + + + +A+ GP
Sbjct: 492 NKVDSTTDKGY-TVIKEWKTGDLIVLNMDMPVEKVSADPRVRQNTGKRALQRGP 544
>gi|423348679|ref|ZP_17326361.1| hypothetical protein HMPREF1060_04033 [Parabacteroides merdae
CL03T12C32]
gi|409213200|gb|EKN06224.1| hypothetical protein HMPREF1060_04033 [Parabacteroides merdae
CL03T12C32]
Length = 617
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 51/234 (21%), Positives = 96/234 (41%), Gaps = 31/234 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPL 452
ETC + M+ ++ + ++T + Y D ER++ NG L+ GV + Y+ PL
Sbjct: 335 ETCASVGMVYWNQRMNQFTGDSKYIDVLERSMYNGALA-------GVSLAGDRFFYVNPL 387
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNV-PGLYIIQYISSSF 511
R + CC +G+ IY + + L+I +
Sbjct: 388 ESNGDHHRQA------WYGCACCPSQISRFLPSIGNYIYGTSDKALWVNLFIGNTTEVTI 441
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNG 571
D K VV+ Q+ D WD +++T+T S+Q +G+ L +R+P W S S+NG
Sbjct: 442 DGKK--VVMKQETD--YPWDGLVKLTVT--SEQPLGK--ELRIRIPGWCKS--YTLSVNG 491
Query: 572 QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP 625
+ + + + W D + + + + + + + +A+ GP
Sbjct: 492 NKVDSTTDKGY-TVIKEWKTGDLIVLNMDMPVEKVSADPRVRQNTGKRALQRGP 544
>gi|67538270|ref|XP_662909.1| hypothetical protein AN5305.2 [Aspergillus nidulans FGSC A4]
gi|40743275|gb|EAA62465.1| hypothetical protein AN5305.2 [Aspergillus nidulans FGSC A4]
gi|259485256|tpe|CBF82133.1| TPA: DUF1680 domain protein (AFU_orthologue; AFUA_1G08910)
[Aspergillus nidulans FGSC A4]
Length = 629
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 63/254 (24%), Positives = 93/254 (36%), Gaps = 27/254 (10%)
Query: 332 HANTHIPIVIGSQMRYEVTGDPLYKL-IGTFFMDIVNASHSYATGGTSAREFWWD---PK 387
HA + I + +TGD K + +MD+ Y TGG A W
Sbjct: 264 HAVRAMYYYIAATDLVRLTGDEEIKAALDRMWMDMTE-RKLYVTGGIGAMRQWEGFGAKY 322
Query: 388 RLADT--LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGV 445
LADT G ETC + ++ + + + + YAD E L NG L G + G
Sbjct: 323 VLADTDESGICYAETCACFALIIWCQRMLQLDLDAKYADVMEVGLYNGFLGAV-GLDGGS 381
Query: 446 MIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQ 505
Y PL + W CC + + IY ++ V I
Sbjct: 382 FYYQNPLRTYTGHPKERSEW----FEVACCPPNVAKLLGSMESLIYSFKDDLVA---IHL 434
Query: 506 YISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT--YSN 563
YI S F VV++QK + M + + V ++L LR+P W YS+
Sbjct: 435 YIESDFTVPETGVVVSQKTN----------MPWSGDVEISVKGTTALALRIPTWAEGYSS 484
Query: 564 GAQASLNGQNLPLP 577
Q + L +P
Sbjct: 485 SVQGEVKNGYLYIP 498
>gi|160932141|ref|ZP_02079532.1| hypothetical protein CLOLEP_00975 [Clostridium leptum DSM 753]
gi|156868743|gb|EDO62115.1| hypothetical protein CLOLEP_00975 [Clostridium leptum DSM 753]
Length = 705
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 58/238 (24%), Positives = 90/238 (37%), Gaps = 24/238 (10%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVL-SIQRGTEPGVMIYMLPL----- 452
ETC + ++ + + + + Y D ERAL N VL S R + Y+ PL
Sbjct: 389 ETCASIGLIFFAHRMLQMDMDSRYGDVMERALYNVVLGSASRDGK--RFFYVNPLEVWPK 446
Query: 453 -GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSF 511
G + K+ CC + L +Y +E + Y YIS
Sbjct: 447 ACGGNPDKQHVKPVRQKWFGCACCPPNVARLMASLNQYLYSTDEDTI---YTHLYISGEA 503
Query: 512 DWKSGHVVLNQKVDPIVSWDPYLRMT-LTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
K + K + WD +++ T L+ + E+ SL LR+P W N
Sbjct: 504 GIKIAGGEMRLKQESSYPWDGHIKFTVLSALPEDEL----SLGLRLPGWC--RNWSVLFN 557
Query: 571 GQNLPLP-PPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILF--GP 625
G+ +P P +L W D T++L L + E +Q + A I F GP
Sbjct: 558 GKPVPRPVVQKGYLKVAAHWHEGD--TVELRLEMPVECLQANPQVRADAGKIAFQRGP 613
>gi|428217725|ref|YP_007102190.1| alpha-2-macroglobulin domain-containing protein [Pseudanabaena sp.
PCC 7367]
gi|427989507|gb|AFY69762.1| alpha-2-macroglobulin domain protein [Pseudanabaena sp. PCC 7367]
Length = 1968
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 54/121 (44%), Gaps = 12/121 (9%)
Query: 587 ERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSL 646
+RW ++ T+ +L+ Q +PE+ +L G LLA G W A++L
Sbjct: 1667 KRWRWSSSTTVAQAEALKLMVEQGQKPEFTD--RLLQG--LLAQRRDGLWRNDYENAKAL 1722
Query: 647 SALIS-----PIPPSFNAQLVTFTQESGNSTFVMSNSNQS---ITMEEFPVSGTDAALHA 698
+AL++ P PP+F A Q+ G + FV NQ+ I M E P + L
Sbjct: 1723 AALVAYARNEPTPPNFVAIANLDEQQIGTAQFVGYRDNQTQFEIPMAELPQGEQNLVLSK 1782
Query: 699 T 699
T
Sbjct: 1783 T 1783
>gi|86143571|ref|ZP_01061956.1| hypothetical protein MED217_13269 [Leeuwenhoekiella blandensis
MED217]
gi|85830018|gb|EAQ48479.1| hypothetical protein MED217_13269 [Leeuwenhoekiella blandensis
MED217]
Length = 723
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 57/224 (25%), Positives = 86/224 (38%), Gaps = 31/224 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADY--------YERALTNGVLSIQRGTEPGVMIYML 450
ETC + HL R T +I +AD+ Y AL S++ T P + +L
Sbjct: 350 ETCGMVEQMNSDEHLLRITGDIFWADHAEEVAFNTYPAALMPDFRSLRYITSPNM---VL 406
Query: 451 PLGRGVSKARSTHGWGTKFNSF--WCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYIS 508
S + G N F CC + ++ ++++ N GL + Y +
Sbjct: 407 NDDANHSPGIANAGPFLMMNPFSSRCCQHNHGQGWAYFTENLFMATPDN--GLAAVLYAA 464
Query: 509 SSFDWKSGH----VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNG 564
S K V N P R+ T ++ + V L R+P W +
Sbjct: 465 GSVTAKVSDGKEVTVTNNSNYPFTE-----RLDFTVNTSEAVE--FPLYFRIPAW--AKQ 515
Query: 565 AQASLNGQNLPLPPPGNFLSATER-WSYNDKLTIQLP--LSLRT 605
A +LNG+ L P N ER W D++T+ LP L LRT
Sbjct: 516 ASVALNGEALDANPSANTYIRIEREWKDGDQVTLTLPKELGLRT 559
>gi|395771959|ref|ZP_10452474.1| hypothetical protein Saci8_19398 [Streptomyces acidiscabies 84-104]
Length = 654
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 104/479 (21%), Positives = 175/479 (36%), Gaps = 84/479 (17%)
Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAP 228
+L A+ A T + T+ ++ +V ++ Q + GYL + +L +P W
Sbjct: 93 WLEAACWQLADTPDETLATEVEAIVELIAAAQRE--DGYLQTY-YQLGGGTPWTEPGWGH 149
Query: 229 --YYTIHKILAGLLDQYVLADN---AQALKMATWMVEYFY--NRVQKVITMYSVERHWYS 281
Y H I A + + A A ++A + F +V+ V VE
Sbjct: 150 ELYCAGHLIQAAVAHHRATGSDRLLAVARRLADHIDSVFGPGKQVETVCGHPEVE----- 204
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA-----NTH 336
L L+ T + ++L LA F + G L+ AD H
Sbjct: 205 ---------TALVELHRTTDEKRYLDLARYFLERRGHGTLSSGADRGHDRDPGPEYWQDH 255
Query: 337 IPI-----VIGSQMRYEV-----------TGD-PLYKLIGTFFMDIVNASHSYATGGTSA 379
PI V G +R TGD L + + D+V + +Y TG +
Sbjct: 256 TPIRAADEVTGHAVRQLYLLAGAADLAAETGDTELRTALERLWRDMVT-TKTYLTGAVGS 314
Query: 380 REFWWDPKRLADTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
R W + G +E ETC + S + T E Y+D ER L
Sbjct: 315 RHDW-------EAFGDAHELPADRAYAETCAAIASVHFSWRMALLTGEARYSDLVERTLF 367
Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIESFSK 485
NG L+ G + +Y+ PL R +ARS G T + W CC + +
Sbjct: 368 NGFLA-GAGLDGRTWLYVNPLHR---RARSHERPGDQTAHRTPWFRCACCPPNVMRLLAG 423
Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
L ++ + GL + QY + + G L +V W+ + +T+ +
Sbjct: 424 L---PHYLATADDSGLQLHQYATGVY----GGDGLTVRVTTEYPWEGTVTVTV---DEAP 473
Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+L+LR+P W + ++NG + +L T ++ D + + L + R
Sbjct: 474 TALPRTLSLRLPAWCADH--TLTVNGTTVEDGADSGWLRITRAFTPGDTVRLDLAMPAR 530
>gi|421613335|ref|ZP_16054421.1| protein containing DUF1680 [Rhodopirellula baltica SH28]
gi|408495929|gb|EKK00502.1| protein containing DUF1680 [Rhodopirellula baltica SH28]
Length = 688
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 90/221 (40%), Gaps = 28/221 (12%)
Query: 395 SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPL 452
+ + ETC + + +F E + D E AL N VLS GT Y PL
Sbjct: 369 TAHNETCANIGNVLWNWRMFLANGESKHIDVLELALYNSVLSGVDLDGTN---FFYTNPL 425
Query: 453 GRGVSKARSTHGWGTK--FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
+ + + G + F + +CC + + +G Y + + V ++ Y S++
Sbjct: 426 RQSDTAPVALRWSGGRKPFVTSFCCPPNLARTIAGVGQYAYGKSDDTV---WVNLYGSNT 482
Query: 511 FD---WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQA 567
D GHV + Q D WD ++++T+ Q V L LR+P W +
Sbjct: 483 LDTHLTNGGHVRIEQTTD--YPWDGHIQITIAECQNQPV----CLKLRIPGWATTT---- 532
Query: 568 SLNGQNLPLP---PPGNFLSATERWSYND--KLTIQLPLSL 603
+L +P PG+++S WS +L +P SL
Sbjct: 533 TLKIDGVPTETTIKPGSYVSLRRAWSPGTVIELDFAMPASL 573
>gi|225018685|ref|ZP_03707877.1| hypothetical protein CLOSTMETH_02635, partial [Clostridium
methylpentosum DSM 5476]
gi|224948545|gb|EEG29754.1| hypothetical protein CLOSTMETH_02635 [Clostridium methylpentosum
DSM 5476]
Length = 1108
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 58/251 (23%), Positives = 98/251 (39%), Gaps = 41/251 (16%)
Query: 398 EETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
+ETC + +K + T + YAD E+ N +L +G P V
Sbjct: 529 QETCISVTWMKFCEKMLSITGDPIYADQIEKTAYNALLGAMQG--PNAQ---------VD 577
Query: 458 KARSTHGW-------GTKFNSFW--------CCYGTGIESFSKLG-DSIYFEEEGNVPGL 501
ST W GT+ + F CC +GI + I G V L
Sbjct: 578 DVCSTLYWDYFTLYNGTRHHEFGGHIEGVDSCCSASGISGLGVIPLAQIMNSAAGPVINL 637
Query: 502 YIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTY 561
Y ++++ SG+ V VD + ++M + + +V + ++ LR+P W
Sbjct: 638 YSPGSMAANT--PSGNKV-RFDVDTNYPVEGEIKMVV----QPDVQEQFTVKLRIPAW-- 688
Query: 562 SNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQ-- 619
S +NG PG FL W D TI++ + RT ++ + + + +
Sbjct: 689 SEQTVVKVNGAEQKDVVPGTFLELNRTWKPGD--TIEISMDFRTWIVESPKGKGSDTEGN 746
Query: 620 -AILFGPYLLA 629
A++ GP +LA
Sbjct: 747 IALVRGPVVLA 757
>gi|423269691|ref|ZP_17248663.1| hypothetical protein HMPREF1079_01745 [Bacteroides fragilis
CL05T00C42]
gi|423272751|ref|ZP_17251698.1| hypothetical protein HMPREF1080_00351 [Bacteroides fragilis
CL05T12C13]
gi|392700537|gb|EIY93699.1| hypothetical protein HMPREF1079_01745 [Bacteroides fragilis
CL05T00C42]
gi|392708315|gb|EIZ01422.1| hypothetical protein HMPREF1080_00351 [Bacteroides fragilis
CL05T12C13]
Length = 695
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 72/318 (22%), Positives = 116/318 (36%), Gaps = 44/318 (13%)
Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--ETCTTYNMLKVS 410
P Y + D + + TGG A F D K D + ETC S
Sbjct: 347 PEYIAAVSRLWDNMIGKRMFITGGVGAVHF--DEKFGPDYFLPTDAYLETCAAVGAGFFS 404
Query: 411 RHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
+ + T + Y D ER L N VL+ GT+ Y PL S + GW
Sbjct: 405 QRMNELTGDAKYMDELERTLYNNVLTGISLSGTQ---YTYQNPLN---SAKHARWGW--- 455
Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW---KSGHVVLNQKVD 525
+ CC ++ S + IY ++ N+ Y+ +I S + + L QK
Sbjct: 456 -HDCPCCPPMFLKMMSAMPGFIYSQKGDNI---YVNLFIGSETELSLSDQSRIRLTQKTG 511
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-------------YSNGAQASLNGQ 572
WD + MT+ + E + L +R+P W + +NG+
Sbjct: 512 --YPWDGSVVMTV----EPEKEKTFLLKVRIPGWAQGVENPYDLYRSEVKSAVNLKVNGK 565
Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAG 630
++ + + +W D++ + LP+ R + + + AI GP Y L G
Sbjct: 566 SIAMKIFKGYAEIQRKWKKGDRVELTLPVQPRLVTANEAVADLQNKVAIAAGPFVYCLEG 625
Query: 631 -HTSGEWDIKTGTARSLS 647
G D++ T LS
Sbjct: 626 CDNEGVADLRLDTRAPLS 643
>gi|423248317|ref|ZP_17229333.1| hypothetical protein HMPREF1066_00343 [Bacteroides fragilis
CL03T00C08]
gi|423253266|ref|ZP_17234197.1| hypothetical protein HMPREF1067_00841 [Bacteroides fragilis
CL03T12C07]
gi|392657166|gb|EIY50803.1| hypothetical protein HMPREF1067_00841 [Bacteroides fragilis
CL03T12C07]
gi|392660424|gb|EIY54038.1| hypothetical protein HMPREF1066_00343 [Bacteroides fragilis
CL03T00C08]
Length = 695
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 72/318 (22%), Positives = 116/318 (36%), Gaps = 44/318 (13%)
Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--ETCTTYNMLKVS 410
P Y + D + + TGG A F D K D + ETC S
Sbjct: 347 PKYIAAVSRLWDNMIGKRMFITGGVGAVHF--DEKFGPDYFLPTDAYLETCAAVGAGFFS 404
Query: 411 RHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
+ + T + Y D ER L N VL+ GT+ Y PL S + GW
Sbjct: 405 QRMNELTGDAKYMDELERTLYNNVLTGISLSGTQ---YTYQNPLN---SAKHARWGW--- 455
Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW---KSGHVVLNQKVD 525
+ CC ++ S + IY ++ N+ Y+ +I S + + L QK
Sbjct: 456 -HDCPCCPPMFLKMMSAMPGFIYSQKGDNI---YVNLFIGSETELSLSDQSRIRLTQKTG 511
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-------------YSNGAQASLNGQ 572
WD + MT+ + E + L +R+P W + +NG+
Sbjct: 512 --YPWDGSVVMTV----EPEKEKTFLLKVRIPGWAQRVENPYDLYRSEVKSAVNLKVNGK 565
Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAG 630
++ + + +W D++ + LP+ R + + + AI GP Y L G
Sbjct: 566 SIAMKIFKGYAEIQRKWKKGDRVELTLPVQPRLVTANEAVADLQNKVAIAAGPFVYCLEG 625
Query: 631 -HTSGEWDIKTGTARSLS 647
G D++ T LS
Sbjct: 626 CDNEGVADLRLDTRAPLS 643
>gi|383124478|ref|ZP_09945142.1| hypothetical protein BSIG_3498 [Bacteroides sp. 1_1_6]
gi|251839029|gb|EES67113.1| hypothetical protein BSIG_3498 [Bacteroides sp. 1_1_6]
Length = 687
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 7/77 (9%)
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
LR+P WT GA+ +NG+ + + P G +L W+ DK+ + LP+SL Q ++
Sbjct: 483 LRIPSWT--EGAEVRVNGKKISVKPVSGKYLCIEREWADGDKVEMTLPMSLSMRTWQVNK 540
Query: 613 PEYASIQAILFGPYLLA 629
+ ++ +GP L+
Sbjct: 541 ----NSVSVDYGPLTLS 553
>gi|29348940|ref|NP_812443.1| hypothetical protein BT_3531 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340847|gb|AAO78637.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 687
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 7/77 (9%)
Query: 554 LRMPVWTYSNGAQASLNGQNLPLPP-PGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDR 612
LR+P WT GA+ +NG+ + + P G +L W+ DK+ + LP+SL Q ++
Sbjct: 483 LRIPSWT--EGAEVRVNGKKISVKPVSGKYLCIEREWADGDKVEMTLPMSLSMRTWQVNK 540
Query: 613 PEYASIQAILFGPYLLA 629
+ ++ +GP L+
Sbjct: 541 ----NSVSVDYGPLTLS 553
>gi|430751377|ref|YP_007214285.1| hypothetical protein Theco_3232 [Thermobacillus composti KWC4]
gi|430735342|gb|AGA59287.1| hypothetical protein Theco_3232 [Thermobacillus composti KWC4]
Length = 672
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 70/286 (24%), Positives = 116/286 (40%), Gaps = 39/286 (13%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGV 456
E+C + ++ S+ + + + Y D ERAL N L+ Q G Y+ PL
Sbjct: 341 ESCASIGLIMFSKRMLQIEAKGEYGDVMERALYNTELAGMSQDGKR---YFYVNPLEVWP 397
Query: 457 SKARSTHGWG--TKFNSFW----CCYGTGIESFSKLGDSIYF--EEEGNV-PGLYI---- 503
RS G W CC + LG +Y E G V LYI
Sbjct: 398 EACRSNPGKHHVKPVRQRWFGCACCPPNIARLIASLGGYVYDVDAESGIVYTHLYIGGEA 457
Query: 504 -IQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSS--LNLRMPVWT 560
+ G VV+ Q+ + WD + +T+T E G L++ L LR+P W
Sbjct: 458 RLNVGKEGGGHDGGTVVVRQETN--YPWDGAVMLTVT----PEAGGLTAFTLALRLPGW- 510
Query: 561 YSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQA 620
S ++ ++NG+ + + W D + ++L +++R A + + A A
Sbjct: 511 -SRTSEIAVNGERIAPEVRDGYAYICRDWQPGDTVELKLDMTIRLLAARPEVRADAGRVA 569
Query: 621 ILFGPYLLAGHTSGEWDIKTGTARSLSALI----SPIPPSFNAQLV 662
I GP + ++ D G LSAL +P+ +++AQL+
Sbjct: 570 IQRGPLVYCLESA---DNPGG---PLSALAIDTQTPLTATYDAQLL 609
>gi|53711660|ref|YP_097652.1| hypothetical protein BF0369 [Bacteroides fragilis YCH46]
gi|52214525|dbj|BAD47118.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 689
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 72/318 (22%), Positives = 116/318 (36%), Gaps = 44/318 (13%)
Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--ETCTTYNMLKVS 410
P Y + D + + TGG A F D K D + ETC S
Sbjct: 341 PEYIAAVSRLWDNMIGKRMFITGGVGAVHF--DEKFGPDYFLPTDAYLETCAAVGAGFFS 398
Query: 411 RHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
+ + T + Y D ER L N VL+ GT+ Y PL S + GW
Sbjct: 399 QRMNELTGDAKYMDELERTLYNNVLTGISLSGTQ---YTYQNPLN---SAKHARWGW--- 449
Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW---KSGHVVLNQKVD 525
+ CC ++ S + IY ++ N+ Y+ +I S + + L QK
Sbjct: 450 -HDCPCCPPMFLKMMSAMPGFIYSQKGDNI---YVNLFIGSETELSLSDQSRIRLTQKTG 505
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-------------YSNGAQASLNGQ 572
WD + MT+ + E + L +R+P W + +NG+
Sbjct: 506 --YPWDGSVVMTV----EPEKEKTFLLKVRIPGWAQGVENPYDLYRSEVKSAVNLKVNGK 559
Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAG 630
++ + + +W D++ + LP+ R + + + AI GP Y L G
Sbjct: 560 SIAMKIFKGYAEIQRKWKKGDRVELTLPVQPRLVTANEAVADLQNKVAIAAGPFVYCLEG 619
Query: 631 -HTSGEWDIKTGTARSLS 647
G D++ T LS
Sbjct: 620 CDNEGVADLRLDTRAPLS 637
>gi|336407845|ref|ZP_08588341.1| hypothetical protein HMPREF1018_00356 [Bacteroides sp. 2_1_56FAA]
gi|335944924|gb|EGN06741.1| hypothetical protein HMPREF1018_00356 [Bacteroides sp. 2_1_56FAA]
Length = 695
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 72/318 (22%), Positives = 116/318 (36%), Gaps = 44/318 (13%)
Query: 353 PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--ETCTTYNMLKVS 410
P Y + D + + TGG A F D K D + ETC S
Sbjct: 347 PEYIAAVSRLWDNMIGKRMFITGGVGAVHF--DEKFGPDYFLPTDAYLETCAAVGAGFFS 404
Query: 411 RHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGVSKARSTHGWGTK 468
+ + T + Y D ER L N VL+ GT+ Y PL S + GW
Sbjct: 405 QRMNELTGDAKYMDELERTLYNNVLTGISLSGTQ---YTYQNPLN---SAKHARWGW--- 455
Query: 469 FNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDW---KSGHVVLNQKVD 525
+ CC ++ S + IY ++ N+ Y+ +I S + + L QK
Sbjct: 456 -HDCPCCPPMFLKMMSAMPGFIYSQKGDNI---YVNLFIGSETELSLSDQSRIRLTQKTG 511
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-------------YSNGAQASLNGQ 572
WD + MT+ + E + L +R+P W + +NG+
Sbjct: 512 --YPWDGSVVMTV----EPEKEKTFLLKVRIPGWAQGVENPYDLYRSEVKSAVNLKVNGK 565
Query: 573 NLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAG 630
++ + + +W D++ + LP+ R + + + AI GP Y L G
Sbjct: 566 SIAMKIFKGYAEIQRKWKKGDRVELTLPVQPRLVTANEAVADLQNKVAIAAGPFVYCLEG 625
Query: 631 -HTSGEWDIKTGTARSLS 647
G D++ T LS
Sbjct: 626 CDNEGVADLRLDTRAPLS 643
>gi|423230666|ref|ZP_17217070.1| hypothetical protein HMPREF1063_02890 [Bacteroides dorei
CL02T00C15]
gi|423244377|ref|ZP_17225452.1| hypothetical protein HMPREF1064_01658 [Bacteroides dorei
CL02T12C06]
gi|392630316|gb|EIY24309.1| hypothetical protein HMPREF1063_02890 [Bacteroides dorei
CL02T00C15]
gi|392641951|gb|EIY35723.1| hypothetical protein HMPREF1064_01658 [Bacteroides dorei
CL02T12C06]
Length = 801
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 62/275 (22%), Positives = 106/275 (38%), Gaps = 34/275 (12%)
Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYN 405
+TGD Y D + Y TGG TS E + L + S ETC
Sbjct: 287 LTGDSAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM--SAYCETCAAIG 344
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
+ V+ LF E Y D ER L NG++S + G Y PL + + + +
Sbjct: 345 NVYVNYRLFLLHGEAKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL-ESIGQHQRQPWF 402
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
G CC L +Y ++ +V Y+ ++S++ + K ++ +
Sbjct: 403 GCA-----CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFMSNTSNLKVEGKAVSLEQT 454
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQAS----LN 570
W+ + + + +K GQ ++ +R+P W TYS+G + S +N
Sbjct: 455 THYPWNGEVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTYSDGKRLSYTVKVN 510
Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
G+ + + RW DK+ + + RT
Sbjct: 511 GEPVQSELKDGYFCIDRRWKKGDKVAVHFDMEPRT 545
>gi|373462448|ref|ZP_09554170.1| hypothetical protein HMPREF9944_02434 [Prevotella maculosa OT 289]
gi|371948225|gb|EHO66109.1| hypothetical protein HMPREF9944_02434 [Prevotella maculosa OT 289]
Length = 932
Score = 40.0 bits (92), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 98/235 (41%), Gaps = 25/235 (10%)
Query: 399 ETCTTYNMLKVS-RHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVS 457
ETC + + ++ R L W + YA E++L N V + Q E G + Y +
Sbjct: 648 ETCGSVFWIDLNHRFLQLWPTKERYASEIEKSLYNVVFAAQ--GENGCIRYFNQVNDAKY 705
Query: 458 KARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
A +N+ CC + L +Y G+++ + +S D+K
Sbjct: 706 PAMC-------YNT--CCEIQATALYGMLPQYVYSVAPD---GVFVNLFSASDIDFK--- 750
Query: 518 VVLNQKVDPIVSWD-PYL-RMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLP 575
V +Q V + PY ++ L S+ + V + +R+P W G +N + +
Sbjct: 751 -VKDQPVKLTMKTQFPYSNQVALRVSADRPV--TMKVRVRIPEWA-KGGVVLRVNDRKVK 806
Query: 576 LPPPGNFLSATERWSYNDKLTIQLPLSLRTEA-IQDDRPEYASIQAILFGPYLLA 629
PG+++ W ND++T LP++ E I R A+ A +GP L+A
Sbjct: 807 TGMPGSYVEIDRTWKDNDEITWSLPMTWSYEKYIGATRIAGATRYAFFYGPMLMA 861
>gi|372221612|ref|ZP_09500033.1| hypothetical protein MzeaS_04798 [Mesoflavibacter
zeaxanthinifaciens S86]
Length = 664
Score = 40.0 bits (92), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 95/425 (22%), Positives = 153/425 (36%), Gaps = 81/425 (19%)
Query: 251 ALKMATWMVEYF---YNRVQKVITMYSVERHWYSLNEETGGMNDVLYRLYSITHDPKHLL 307
ALK A MV+ F N++Q V +E TG L +LY IT + +
Sbjct: 211 ALKNANLMVKTFGAEQNQIQTVPGHQIIE---------TG-----LLKLYQITGEVAYKD 256
Query: 308 LAHLFDKPCFLGFLALQADY-LSHFHANTHIPI-----VIGSQMRY-----------EVT 350
LA F L + D L ++ H+P+ V+G +R +T
Sbjct: 257 LAKFF-----LDNRGVAKDRKLFGAYSQDHLPVTQQKEVVGHAVRAVYMYAAMTDIAAIT 311
Query: 351 GDPLY-KLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE--------ETC 401
D Y + + T + ++V Y TGG A K + G+ E ETC
Sbjct: 312 KDSTYLRAVDTLWQNMVE-KKMYITGGIGA-------KHEGEAFGANYELPNITAYNETC 363
Query: 402 TTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARS 461
+ + L + Y D ER L NG++S + Y PL +
Sbjct: 364 AAIGDVYWNHRLHNLKGKAHYFDIIERTLYNGLIS-GISLDGKQFFYPNPL-ESDGLYQF 421
Query: 462 THGWGTKFNSFWC-CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVL 520
G T+ + F C C T + F + + + N L++ Y S+S L
Sbjct: 422 NQGACTRKDWFDCSCCPTNLIRFIPSIPGLLYSKGAN--ELFVNLYASNSATINLKSTEL 479
Query: 521 NQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-----------YSNG----- 564
N + WD +R T+ + ++ R+P W Y N
Sbjct: 480 NVVQETNYPWDGTIRFTVNTAKPYTF----PIHFRVPGWAQNQVVPSGLYQYENPNPSFP 535
Query: 565 AQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFG 624
+ +NG+ + +LS RW+ ND + I+ P+ ++ E A+ G
Sbjct: 536 IKIKVNGKATAIDSKEGYLSLDRRWANNDVIEIEFPMDVKLVKTNTRVVENRGKVALERG 595
Query: 625 PYLLA 629
P + A
Sbjct: 596 PIVYA 600
>gi|423240707|ref|ZP_17221821.1| hypothetical protein HMPREF1065_02444 [Bacteroides dorei
CL03T12C01]
gi|392643669|gb|EIY37418.1| hypothetical protein HMPREF1065_02444 [Bacteroides dorei
CL03T12C01]
Length = 801
Score = 40.0 bits (92), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 62/275 (22%), Positives = 106/275 (38%), Gaps = 34/275 (12%)
Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYN 405
+TGD Y D + Y TGG TS E + L + S ETC
Sbjct: 287 LTGDSAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM--SAYCETCAAIG 344
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
+ V+ LF E Y D ER L NG++S + G Y PL + + + +
Sbjct: 345 NVYVNYRLFLLHGEAKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL-ESIGQHQRQPWF 402
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
G CC L +Y ++ +V Y+ ++S++ + K ++ +
Sbjct: 403 GCA-----CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFMSNTSNLKVEGKAVSLEQT 454
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQAS----LN 570
W+ + + + +K GQ ++ +R+P W TYS+G + S +N
Sbjct: 455 THYPWNGEVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTYSDGKRLSYTVKVN 510
Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
G+ + + RW DK+ + + RT
Sbjct: 511 GEPVQSELKDGYFCIDRRWKKGDKVAVHFDMEPRT 545
>gi|237711367|ref|ZP_04541848.1| six-hairpin glycosidase [Bacteroides sp. 9_1_42FAA]
gi|229454062|gb|EEO59783.1| six-hairpin glycosidase [Bacteroides sp. 9_1_42FAA]
Length = 781
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 62/275 (22%), Positives = 106/275 (38%), Gaps = 34/275 (12%)
Query: 349 VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADTLGSENEETCTTYN 405
+TGD Y D + Y TGG TS E + L + S ETC
Sbjct: 267 LTGDSAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM--SAYCETCAAIG 324
Query: 406 MLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGW 465
+ V+ LF E Y D ER L NG++S + G Y PL + + + +
Sbjct: 325 NVYVNYRLFLLHGEAKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL-ESIGQHQRQPWF 382
Query: 466 GTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVD 525
G CC L +Y ++ +V Y+ ++S++ + K ++ +
Sbjct: 383 GCA-----CCPSNICRFIPSLPGYVYAVKDKDV---YVNLFMSNTSNLKVEGKAVSLEQT 434
Query: 526 PIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TYSNGAQAS----LN 570
W+ + + + +K GQ ++ +R+P W TYS+G + S +N
Sbjct: 435 THYPWNGEVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPSDLYTYSDGKRLSYTVKVN 490
Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
G+ + + RW DK+ + + RT
Sbjct: 491 GEPVQSELKDGYFCIDRRWKKGDKVAVHFDMEPRT 525
>gi|222082345|ref|YP_002541710.1| hypothetical protein Arad_8964 [Agrobacterium radiobacter K84]
gi|221727024|gb|ACM30113.1| conserved hypothetical protein [Agrobacterium radiobacter K84]
Length = 643
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 80/376 (21%), Positives = 143/376 (38%), Gaps = 52/376 (13%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL--SHFHANTHIPI 339
L +L +T + K+L LA F +P F AL+ A ++ ++ + H P+
Sbjct: 197 ALVKLARVTGEKKYLDLAKYFVDERGQEPHFFTDEALRDGRDPAKFVQKTYEYNQAHQPV 256
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 257 REQTKVVGHAVRAMYLYSGMADIATEYNDDSLTSALETLWDDLTT-KQMYVTGGIGPAAS 315
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + S ETC + ++ + + YAD E+AL NG ++
Sbjct: 316 NEGFTDYYDLPNE--SAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GL 372
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
+ Y PL G R T ++ CC + +G +Y + +
Sbjct: 373 SLDGKTFFYENPLESGGKHHRWT------WHHCPCCPPNIARLLASIGSYMYAAADNEI- 425
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
+++ + SG V + + WD +R F + +L+LR+P W
Sbjct: 426 AVHLYGESKARVPLASG-VTVELAQETRYPWDGAIR----FEVNPDRNARFALSLRIPEW 480
Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
++GA ++NG + L + W D++ + +PL RT + A
Sbjct: 481 --ADGATLAVNGVPVDLSAVTIDGYARIERDWQAGDRVDLNIPLIPRTLFANPKVRQDAG 538
Query: 618 IQAILFGPYLLAGHTS 633
A++ GP + T+
Sbjct: 539 RAALMRGPLVYCVETT 554
>gi|256424326|ref|YP_003124979.1| hypothetical protein Cpin_5347 [Chitinophaga pinensis DSM 2588]
gi|256039234|gb|ACU62778.1| protein of unknown function DUF1680 [Chitinophaga pinensis DSM
2588]
Length = 686
Score = 40.0 bits (92), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 69/344 (20%), Positives = 124/344 (36%), Gaps = 46/344 (13%)
Query: 352 DPLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLA-DTLGSENE--ETCTTYNMLK 408
+P Y + D + + TGG A D ++ D E+ ETC +
Sbjct: 341 NPAYFTTAVRYWDNMTGKRMFVTGGEGAIA---DQEKFGPDYFLPESAYLETCASIGAAF 397
Query: 409 VSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMI------YMLPLGRGVSKARST 462
S+ + + + Y D +ER + N +LS GV + Y PL ++K
Sbjct: 398 FSQRMNQLLADGKYMDEFERVMYNNLLS-------GVSLSGDHYFYENPL---IAKDHKR 447
Query: 463 HGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQ 522
W +S CC ++ + + IY + GLY+ +ISS + G ++
Sbjct: 448 WAW----HSCPCCPPMILKMVAAIPAYIYAADN---TGLYVNLFISSEYKGAVGDKKVSL 500
Query: 523 KVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-------------YSNGAQASL 569
K W ++ + + + + ++++R+P W + +
Sbjct: 501 KQSTQYPWKGTTQIAVNPAEEGDF----AVSVRIPGWAQGRENYFGLYTSQVTTPVSLRV 556
Query: 570 NGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGPYLLA 629
NG +P+ P ++ W DK+ + LP+ R D I GP +
Sbjct: 557 NGAAVPVQPENGYVRIKRHWKKGDKIILALPMQPRLIFPHDSIRTVQGKATIAAGPVIYG 616
Query: 630 GHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQESGNSTF 673
+ + T +AL P F + T + G STF
Sbjct: 617 LEGIDNSKLDSLTISRNTALQLAFKPGFLGGVNVVTGQLGGSTF 660
>gi|398379890|ref|ZP_10538009.1| hypothetical protein PMI03_03641 [Rhizobium sp. AP16]
gi|397721906|gb|EJK82452.1| hypothetical protein PMI03_03641 [Rhizobium sp. AP16]
Length = 643
Score = 39.7 bits (91), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 80/376 (21%), Positives = 143/376 (38%), Gaps = 52/376 (13%)
Query: 292 VLYRLYSITHDPKHLLLAHLF-----DKPCFLGFLALQ-----ADYL--SHFHANTHIPI 339
L +L +T + K+L LA F +P F AL+ A ++ ++ + H P+
Sbjct: 197 ALVKLARVTGEKKYLDLAKYFVDERGQEPHFFTDEALRDGRDPAKFVQKTYEYNQAHQPV 256
Query: 340 -----VIGSQMRY------------EVTGDPLYKLIGTFFMDIVNASHSYATGG---TSA 379
V+G +R E D L + T + D+ Y TGG ++
Sbjct: 257 REQTKVVGHAVRAMYLYSGMADIATEYNDDSLTSALETLWDDLTT-KQMYVTGGIGPAAS 315
Query: 380 REFWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR 439
E + D L + S ETC + ++ + + YAD E+AL NG ++
Sbjct: 316 NEGFTDYYDLPNE--SAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GL 372
Query: 440 GTEPGVMIYMLPLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVP 499
+ Y PL G R T ++ CC + +G +Y + +
Sbjct: 373 SLDGKTFFYENPLESGGKHHRWT------WHHCPCCPPNIARLLASIGSYMYAAADNEI- 425
Query: 500 GLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
+++ + SG V + + WD +R F + +L+LR+P W
Sbjct: 426 AVHLYGESKARVPLASG-VTVELAQETRYPWDGAIR----FEVNPDRNARFALSLRIPEW 480
Query: 560 TYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYAS 617
++GA ++NG + L + W D++ + +PL RT + A
Sbjct: 481 --ADGATLAVNGVPVDLSAVTIDGYARIERDWQAGDRVDLNIPLIPRTLFANPKVRQDAG 538
Query: 618 IQAILFGPYLLAGHTS 633
A++ GP + T+
Sbjct: 539 RAALMRGPLVYCVETT 554
>gi|403252790|ref|ZP_10919097.1| hypothetical protein EMP_03410 [Thermotoga sp. EMP]
gi|402811900|gb|EJX26382.1| hypothetical protein EMP_03410 [Thermotoga sp. EMP]
Length = 622
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 67/332 (20%), Positives = 126/332 (37%), Gaps = 46/332 (13%)
Query: 293 LYRLYSITHDPKHLLLAHLFDKPCFLGFLALQA--DYLSHFHANTHIPIVIGSQMR---- 346
L LY T D K+L LA F G + +YL + + G +R
Sbjct: 196 LVELYRETGDRKYLDLAKYFIYTRGKGLTGFKKNPEYLIDHKPFVELEEITGHAVRALYL 255
Query: 347 -------YEVTGD-PLYKLIGTFFMDIVNASHSYATGGTSAREFWWDPKRLADTLGSENE 398
Y TGD +++ + + + V Y TGG +R W ++ G E E
Sbjct: 256 CSGATDLYLETGDEKIWQALNKLWENFVT-KKMYITGGAGSRHDW-------ESFGEEYE 307
Query: 399 --------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYML 450
E+C + + + T + +AD E+ L NG+LS + Y
Sbjct: 308 LPNRRSYAESCASIANFMWNFRMLLATGDGKFADVMEQVLYNGLLS-GISLDGKHYFYFN 366
Query: 451 PLGRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
PL + + R K+ CC + +Y + V +++ + +
Sbjct: 367 PL-EDLGRTRRQ-----KWFDCACCPPNLARFIASFPGYMYTTSDDGVQ-VHLYEKSTVR 419
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
D+K V + Q+ D W +TF+ + ++ + S++LR+P W ++ ++
Sbjct: 420 LDFKGSVVEIEQETD--YPWSG----EVTFTVEADIEEPFSISLRIPSW--ADDFVLRVD 471
Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLS 602
G+ + P ++ + W + + LP+
Sbjct: 472 GKTVIAKPQNGYVKLNQSWKGKHTVELSLPMK 503
>gi|383763276|ref|YP_005442258.1| hypothetical protein CLDAP_23210 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381383544|dbj|BAM00361.1| hypothetical protein CLDAP_23210 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 636
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 45/177 (25%), Positives = 71/177 (40%), Gaps = 23/177 (12%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLS--IQRGTEPGVMIYMLPLGRGV 456
ETC ++ + L ++ E YAD E+ L NG +S RG Y+ PL
Sbjct: 329 ETCAAIALILWNHRLLQFAGEGKYADVMEQTLYNGFISGVSLRGDS---FFYVNPLASNG 385
Query: 457 SKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSG 516
S R T + CC + LG+ +Y EG GL++ Y +S
Sbjct: 386 SHHR------TPWFECPCCPPNVGRILASLGNYLYSTGEG---GLWVHFYAQNSARTTVD 436
Query: 517 HVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWT-----YSNGAQAS 568
+ +++ WD +++ +T + Q +L LR+P W NGA A
Sbjct: 437 GTEVGLRLESRYPWDGAVKLMITPAQPQRF----TLYLRIPGWCDRWSLRVNGAAAD 489
>gi|440699526|ref|ZP_20881821.1| hypothetical protein STRTUCAR8_01370 [Streptomyces turgidiscabies
Car8]
gi|440277899|gb|ELP65960.1| hypothetical protein STRTUCAR8_01370 [Streptomyces turgidiscabies
Car8]
Length = 654
Score = 39.7 bits (91), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 103/479 (21%), Positives = 174/479 (36%), Gaps = 84/479 (17%)
Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAP 228
+L A+ A T + T+ ++ +V ++ Q + GYL + +L +P W
Sbjct: 93 WLEAACWQLADTPDETLATEVEAIVELIAAAQRE--DGYLQTY-YQLGGGIPWTEPGWGH 149
Query: 229 --YYTIHKILAGLLDQYVLADN---AQALKMATWMVEYFY--NRVQKVITMYSVERHWYS 281
Y H I A + + A A ++A + F +V V VE
Sbjct: 150 ELYCAGHLIQAAVAHHRATGSDRLLAVARRLADHIDSVFGPGKQVDTVCGHPEVE----- 204
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA-----NTH 336
L L+ T + ++L LA F + G L+ AD H
Sbjct: 205 ---------TALVELHRTTDEKRYLDLARYFLERRGHGTLSSGADRGHDRDPGPEYWQDH 255
Query: 337 IPI-----VIGSQMRYEV-----------TGD-PLYKLIGTFFMDIVNASHSYATGGTSA 379
P+ V G +R TGD L + + D+V + +Y TG +
Sbjct: 256 TPVRAADEVTGHAVRQLYLLAGAADLAAETGDTELRTALERLWRDMVT-TKTYLTGAVGS 314
Query: 380 REFWWDPKRLADTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
R W + G +E ETC + S + T E Y+D ER L
Sbjct: 315 RHDW-------EAFGDAHELPADRAYAETCAAIASVHFSWRMALLTGEARYSDLVERTLF 367
Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIESFSK 485
NG L+ G + +Y+ PL R +ARS G T + W CC + +
Sbjct: 368 NGFLA-GAGLDGRTWLYVNPLHR---RARSHERPGDQTAHRTPWFRCACCPPNVMRLLAG 423
Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
L ++ + GL + QY + + G L +V W+ + +T+ +
Sbjct: 424 L---PHYLATADDSGLQLHQYATGVY----GGDGLTVRVTTEYPWEGTVTVTV---DEAP 473
Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+L+LR+P W + ++NG + +L T ++ D + + L + R
Sbjct: 474 TALPRTLSLRLPAWCADH--TLTVNGTTVEDGADSGWLRITRAFTPGDTVRLDLAMPAR 530
>gi|290962053|ref|YP_003493235.1| hypothetical protein SCAB_77341 [Streptomyces scabiei 87.22]
gi|260651579|emb|CBG74703.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length = 654
Score = 39.7 bits (91), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 103/479 (21%), Positives = 174/479 (36%), Gaps = 84/479 (17%)
Query: 169 YLSASAQMWASTHNATIKEKMSTVVFSLSECQNKIGTGYLSAFPTELFDSFEALKPVWAP 228
+L A+ A T + T+ ++ +V ++ Q + GYL + +L +P W
Sbjct: 93 WLEAACWQLADTPDETLATEVEAIVELIAAAQRE--DGYLQTY-YQLGGGIPWTEPGWGH 149
Query: 229 --YYTIHKILAGLLDQYVLADN---AQALKMATWMVEYFY--NRVQKVITMYSVERHWYS 281
Y H I A + + A A ++A + F +V V VE
Sbjct: 150 ELYCAGHLIQAAVAHHRATGSDRLLAVARRLADHIDSVFGPGKQVDTVCGHPEVE----- 204
Query: 282 LNEETGGMNDVLYRLYSITHDPKHLLLAHLFDKPCFLGFLALQADYLSHFHA-----NTH 336
L L+ T + ++L LA F + G L+ AD H
Sbjct: 205 ---------TALVELHRTTDEKRYLDLARYFLERRGHGTLSSGADRGHDRDPGPEYWQDH 255
Query: 337 IPI-----VIGSQMRYEV-----------TGD-PLYKLIGTFFMDIVNASHSYATGGTSA 379
P+ V G +R TGD L + + D+V + +Y TG +
Sbjct: 256 TPVRAADEVTGHAVRQLYLLAGAADLAAETGDTELRTALERLWRDMVT-TKTYLTGAVGS 314
Query: 380 REFWWDPKRLADTLGSENE--------ETCTTYNMLKVSRHLFRWTKEIAYADYYERALT 431
R W + G +E ETC + S + T E Y+D ER L
Sbjct: 315 RHDW-------EAFGDAHELPADRAYAETCAAIASVHFSWRMALLTGEARYSDLVERTLF 367
Query: 432 NGVLSIQRGTEPGVMIYMLPLGRGVSKARSTHGWG--TKFNSFW----CCYGTGIESFSK 485
NG L+ G + +Y+ PL R +ARS G T + W CC + +
Sbjct: 368 NGFLA-GAGLDGRTWLYVNPLHR---RARSHERPGDQTAHRTPWFRCACCPPNVMRLLAG 423
Query: 486 LGDSIYFEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQE 545
L ++ + GL + QY + + G L +V W+ + +T+ +
Sbjct: 424 L---PHYLATADDSGLQLHQYATGVY----GGDGLTVRVTTEYPWEGTVTVTV---DEAP 473
Query: 546 VGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+L+LR+P W + ++NG + +L T ++ D + + L + R
Sbjct: 474 TALPRTLSLRLPAWCADH--TLTVNGTTVEDGADSGWLRITRAFTPGDTVRLDLAMPAR 530
>gi|218508305|ref|ZP_03506183.1| hypothetical protein RetlB5_12284 [Rhizobium etli Brasil 5]
Length = 177
Score = 39.7 bits (91), Expect = 7.6, Method: Composition-based stats.
Identities = 33/137 (24%), Positives = 62/137 (45%), Gaps = 23/137 (16%)
Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPPP--GNFLSATERWSYNDKLTIQLPLSLRTEAI 608
+L+LR+P W ++GA S+NG+ L L + +W D++ + LPLSLR +
Sbjct: 10 ALSLRIPDW--ADGATLSVNGEKLDLGAATRDGYARIDRQWVDGDRVDLFLPLSLRPQYA 67
Query: 609 QDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISPIPPSFNAQLVTFTQES 668
+ A A++ GP + T T + L+A++ P + S
Sbjct: 68 NPKVRQDAGRVALMRGPLVYCVET-------TDNGQDLNAIVLP------------RELS 108
Query: 669 GNSTFVMSNSNQSITME 685
T V+++ N ++ ++
Sbjct: 109 AAETVVLNDLNDAVALD 125
>gi|326789389|ref|YP_004307210.1| hypothetical protein Clole_0260 [Clostridium lentocellum DSM 5427]
gi|326540153|gb|ADZ82012.1| protein of unknown function DUF1680 [Clostridium lentocellum DSM
5427]
Length = 638
Score = 39.3 bits (90), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 51/214 (23%), Positives = 87/214 (40%), Gaps = 20/214 (9%)
Query: 399 ETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQR--GTEPGVMIYMLPLGR-- 454
ETC ++ +R + K YAD ERAL N VL+ + GT+ Y+ PL
Sbjct: 328 ETCAAIGLIFFARKMIDLEKNNEYADIMERALYNCVLAGMQLDGTK---FFYVNPLESIP 384
Query: 455 GVSKARSTHGWGTKFNSFW----CCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSS 510
G+S TH W CC S +G + EEGN +Y +I +
Sbjct: 385 GISGEAVTHRHALPQRPKWFTCACCPPNVARLLSSMG-RYAWSEEGNT--VYSHLFIGGT 441
Query: 511 FDWKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLN 570
D L+ K+ S+ ++ F E L +L +R+P+W S L+
Sbjct: 442 LDLTD---TLHGKIKVETSYPYGNQVRYRFEPNDESMDL-TLAIRLPLW--SENTSIMLD 495
Query: 571 GQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLR 604
+ ++ T+ ++ D +T+ ++++
Sbjct: 496 EKKANYEIRNGYVYLTKAFTQEDMVTVTFDMNVK 529
>gi|218675303|ref|ZP_03524972.1| hypothetical protein RetlG_29862 [Rhizobium etli GR56]
Length = 175
Score = 39.3 bits (90), Expect = 8.4, Method: Composition-based stats.
Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 11/104 (10%)
Query: 551 SLNLRMPVWTYSNGAQASLNGQNLPLPP--PGNFLSATERWSYNDKLTIQLPLSLRTEAI 608
+L+LR+P W + GA S+NG L L + +W+ D++ + LPLSLR +
Sbjct: 8 ALSLRIPDW--AEGATLSVNGTMLDLSTHIRDGYARIDRQWADGDRVALHLPLSLRPQYA 65
Query: 609 QDDRPEYASIQAILFGPYLLAGHTSGEWDIKTGTARSLSALISP 652
+ A A++ GP + T T L+A++ P
Sbjct: 66 NPKVRQDAGRVALMRGPLVYCVET-------TDNGEDLNAIVLP 102
>gi|410725713|ref|ZP_11364076.1| hypothetical protein A370_02153 [Clostridium sp. Maddingley
MBC34-26]
gi|410601724|gb|EKQ56224.1| hypothetical protein A370_02153 [Clostridium sp. Maddingley
MBC34-26]
Length = 648
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 61/286 (21%), Positives = 112/286 (39%), Gaps = 31/286 (10%)
Query: 364 DIVNASHSYATGGTSARE----FWWDPKRLADTLGSENEETCTTYNMLKVSRHLFRWTKE 419
D + Y TGG + + F +D DT+ +E TC + ++ +R + + +
Sbjct: 297 DNMTKKRMYITGGIGSSQYGEAFTYDYDLPNDTIYAE---TCASIGLVFFARRMLEISPK 353
Query: 420 IAYADYYERALTNGVLSIQR--GTEPGVMIYMLPLGRGVSKARSTHGWG------TKFNS 471
YAD E+AL NGV+S GT+ Y+ PL + H K+
Sbjct: 354 SKYADIMEKALYNGVISGMSLDGTK---FFYVNPLEVVPESSEKDHLRAHVKVERQKWFG 410
Query: 472 FWCCYGTGIESFSKLGDSIY-FEEEGNVPGLYIIQYISSSFDWKSGHVVLNQKVDPIVSW 530
CC + +G Y +E LY+ I+++ + +V KV+ W
Sbjct: 411 CACCPPNLARLLASIGSYAYSIKENTMFMHLYMGGEITTNL--SNNNVAF--KVETNYPW 466
Query: 531 DPYLRMTLTFSSKQEVGQLSSLNLRMPVWTYSNGAQASLNGQNLPLPPPGNFLSATERWS 590
D +++TL K+E+ + +R+P W +NG+++ + W
Sbjct: 467 DENVKITLNI--KEEIN--FEVAIRIPEWC--GNYNIKVNGEDVEYKIIYGYAYIDRVWK 520
Query: 591 YNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YLLAGHTSG 634
D + + + + + + E A++ GP Y L +G
Sbjct: 521 NADAIDVDFKMPVEVMSANVNVRENIGKVAVMRGPIVYCLEEEDNG 566
>gi|212715353|ref|ZP_03323481.1| hypothetical protein BIFCAT_00247 [Bifidobacterium catenulatum DSM
16992 = JCM 1194]
gi|212661728|gb|EEB22303.1| hypothetical protein BIFCAT_00247 [Bifidobacterium catenulatum DSM
16992 = JCM 1194]
Length = 727
Score = 39.3 bits (90), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 115/308 (37%), Gaps = 28/308 (9%)
Query: 349 VTGDP-LYKLIGTFFMDIVNASHSYATGGTSA----REFWWDPKRLADTLGSENEETCTT 403
+TG+ L + T + +IV+ Y TGG A F +D DT SE +C
Sbjct: 323 ITGEAALLESCETLWRNIVD-RKLYITGGIGATHMGEAFSFDYDLPNDTAYSE---SCAA 378
Query: 404 YNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGVSKA---- 459
+ +R + + YAD E AL N L+ + Y+ PL V +A
Sbjct: 379 IALAFFARRMLEIQPKSEYADVMESALYNTTLA-GMALDGKSFFYVNPL-EVVPEACHRD 436
Query: 460 -RSTHGWGTKFNSFWC-CYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFDWKSGH 517
R H + F C C I + + + LY+ Y+ K G
Sbjct: 437 ERKFHVKPVRQKWFGCACCPPNIARMVESVQQYAYTVADDASTLYVHLYMGGVVSAKLGG 496
Query: 518 VVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLS---SLNLRMPVWTYSNGAQASLNG--- 571
++ +V + W+ +T+T S E GQ+ +L LR+P W A S++
Sbjct: 497 SDVSLEVRAGMPWNGAGAITVTLPSSDE-GQVPESFALALRLPAWAGGESAADSIHATGE 555
Query: 572 --QNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRTEAIQDDRPEYASIQAILFGP--YL 627
+ +L T W D + P+ +R A E A A + GP Y
Sbjct: 556 KDSRITRTTRDGYLYLTGTWRDGDVIDFDFPMPVRMIAANPLVREDAGKVAFIRGPLAYC 615
Query: 628 LAGHTSGE 635
G +G+
Sbjct: 616 AEGTDNGD 623
>gi|150003691|ref|YP_001298435.1| hypothetical protein BVU_1122 [Bacteroides vulgatus ATCC 8482]
gi|149932115|gb|ABR38813.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 801
Score = 39.3 bits (90), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 78/348 (22%), Positives = 131/348 (37%), Gaps = 59/348 (16%)
Query: 293 LYRLYSITHDPKHLLLAHLF-DKPCFLGFLALQADYLSHFHANTHIPIV-----IGSQMR 346
L +LY +T K+L A F D+ G+ +Y + H P+V +G +R
Sbjct: 222 LAKLYLVTGQQKYLDQAKFFLDQ---RGYTTRTDEY-----SQAHKPVVEQDEAVGHAVR 273
Query: 347 YE-----------VTGDPLYKLIGTFFMDIVNASHSYATGG---TSAREFWWDPKRLADT 392
+TGD Y D + Y TGG TS E + L +
Sbjct: 274 AAYMYAGMADVAALTGDTAYIHAIDRIWDNIVGKKYYITGGIGATSNGEAFGKNYELPNM 333
Query: 393 LGSENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPGVMIYMLPL 452
S ETC + V+ LF E Y D ER L NG++S + G Y PL
Sbjct: 334 --SAYCETCAAIGNVYVNYRLFLLHGEAKYYDVLERTLYNGLIS-GVSLDGGGFFYPNPL 390
Query: 453 GRGVSKARSTHGWGTKFNSFWCCYGTGIESFSKLGDSIYFEEEGNVPGLYIIQYISSSFD 512
+ + + +G CC L +Y + +V Y+ ++S++ +
Sbjct: 391 -ESIGQHQRQPWFGCA-----CCPSNICRFIPSLPGYVYAVKGKDV---YVNLFMSNTSN 441
Query: 513 WKSGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW-----------TY 561
K ++ + W+ + + + +K GQ ++ +R+P W TY
Sbjct: 442 LKVEGKAVSLEQATHYPWNGDVTIGV---NKNNAGQF-TMKIRIPGWVRNQVVPCDLYTY 497
Query: 562 SNGAQAS----LNGQNLPLPPPGNFLSATERWSYNDKLTIQLPLSLRT 605
S+G + S +NG+ + + RW DK+ + + RT
Sbjct: 498 SDGKRLSYTVKVNGEPVQSELKDGYFCIDRRWKKGDKVAVHFDMEPRT 545
>gi|29346413|ref|NP_809916.1| hypothetical protein BT_1003 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29338309|gb|AAO76110.1| Six-hairpin glycosidase [Bacteroides thetaiotaomicron VPI-5482]
Length = 698
Score = 39.3 bits (90), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 81/347 (23%), Positives = 140/347 (40%), Gaps = 47/347 (13%)
Query: 293 LYRLYSITHDPKHLLLA-HLFDKPCFL--------GFLALQADYLSHFHANTHIPIVIGS 343
+ +Y T +P++L L+ +L D + + + Y + HA + G
Sbjct: 248 VVEMYRATGNPRYLELSKNLIDIRGMVESGTDDNQDRIPFRDQYRAMGHAVRANYLYAGV 307
Query: 344 QMRYEVTGDP-LYKLIGTFFMDIVNASHSYATG-------GTSAREFWWDP---KRLADT 392
Y TG+ L K + + + DIV Y TG GTS ++P +++ +
Sbjct: 308 ADVYAETGEQQLMKNLTSIWNDIV-TRKMYVTGACGALYDGTSPDGTCYEPDSIQKVHQS 366
Query: 393 LG--------SENEETCTTYNMLKVSRHLFRWTKEIAYADYYERALTNGVLSIQRGTEPG 444
G + + ETC + + + T + YA+ E L N VLS +
Sbjct: 367 YGRPYQLPNSTAHNETCANIGNMLFNWRMLEVTGDAKYAELVETCLYNSVLS-GISLDGK 425
Query: 445 VMIYMLPLGRGVSKARSTHGW---GTKFNSFWCCYGTGIESFSKLGDSIY-FEEEGNVPG 500
Y PL R + T W T++ S +CC + + + + Y EG
Sbjct: 426 KYFYTNPL-RISADLPYTLRWPKERTEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCN 484
Query: 501 LYIIQYISSSFDWK-SGHVVLNQKVDPIVSWDPYLRMTLTFSSKQEVGQLSSLNLRMPVW 559
LY +++ +WK G + L Q+ D W+ +R+TL ++ G SL R+P W
Sbjct: 485 LYGANTLTT--NWKDKGELALVQETD--YPWEGNVRVTLN-KVPRKAGAF-SLFFRIPEW 538
Query: 560 TYSNGAQASLNGQNLPLPPPGNFLSATER-WSYND--KLTIQLPLSL 603
A ++NGQ + + N + R W D +L + +P+ L
Sbjct: 539 --CGKAALTVNGQPVSMNAKANTYAEVNRTWKKGDVVELVMDMPVCL 583
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.134 0.413
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,037,255,670
Number of Sequences: 23463169
Number of extensions: 600505259
Number of successful extensions: 1204287
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 500
Number of HSP's successfully gapped in prelim test: 514
Number of HSP's that attempted gapping in prelim test: 1199978
Number of HSP's gapped (non-prelim): 1589
length of query: 857
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 705
effective length of database: 8,792,793,679
effective search space: 6198919543695
effective search space used: 6198919543695
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 82 (36.2 bits)