BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 002940
(863 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224053368|ref|XP_002297785.1| predicted protein [Populus trichocarpa]
gi|222845043|gb|EEE82590.1| predicted protein [Populus trichocarpa]
Length = 858
Score = 1233 bits (3190), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 594/863 (68%), Positives = 708/863 (82%), Gaps = 24/863 (2%)
Query: 14 LLTFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWL 73
L+ ++ +KECTN +L+SHTFR LLSS+NE++ +++ +H HLTP+DDSAW
Sbjct: 7 LVVLSMLCGFGTSKECTNTPTQLSSHTFRYALLSSENETWKEEMFAHY-HLTPTDDSAWA 65
Query: 74 SLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWR 133
+L+PRKILREE++ +SWAM+YR +K+P + SG FLKEVSLH+VRL S+HW+
Sbjct: 66 NLLPRKILREEDE---YSWAMMYRNLKSP-----LKSSGNFLKEVSLHNVRLDPSSIHWQ 117
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
AQQTNLEYLLMLDVD LVW+FRKTA L PG YGGWE P+CELRGHFVGHYLSASA MW
Sbjct: 118 AQQTNLEYLLMLDVDSLVWSFRKTAGLSTPGTAYGGWEAPNCELRGHFVGHYLSASAQMW 177
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILA 253
ASTHN+ L+++MSAVVSALS+CQ+++GSGYLSAFP+E FDR EA+ PVWAPYYTIHKILA
Sbjct: 178 ASTHNDILEKQMSAVVSALSSCQEKMGSGYLSAFPSELFDRFEAIKPVWAPYYTIHKILA 237
Query: 254 GLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLF 313
GLLDQYT+ADNA+AL+M WMV+YFYNRV+NVI +S+ERH+Q+LNEE GGMNDVLYKLF
Sbjct: 238 GLLDQYTFADNAQALKMVKWMVDYFYNRVRNVITNFSVERHYQSLNEETGGMNDVLYKLF 297
Query: 314 CITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE 373
IT DPKHL+LAHLFDKPCFLGLLA+QA+DISGFH+NTHIPIVIG+QMRYE+TGD L+K+
Sbjct: 298 SITGDPKHLVLAHLFDKPCFLGLLAVQAEDISGFHANTHIPIVIGAQMRYEITGDPLYKD 357
Query: 374 -----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFR 422
H + GT++ F SDPKRLAS L + EESCTTYNMLKVSRHLFR
Sbjct: 358 IGTFFMDIVNSSHSYATGGTSVSE--FWSDPKRLASTLQTENEESCTTYNMLKVSRHLFR 415
Query: 423 WTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCC 482
WTKE+AYADYYER+LTNGVLGIQRGTEPGVMIY+LP PGSSK +SYH WGT D+FWCC
Sbjct: 416 WTKEMAYADYYERALTNGVLGIQRGTEPGVMIYMLPQHPGSSKGKSYHGWGTLYDTFWCC 475
Query: 483 YGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLR 542
YGTGIESFSKLGDSIYFEEEG+ PG+YIIQYISS LDWKSGQI++NQKVDPVVS DPYLR
Sbjct: 476 YGTGIESFSKLGDSIYFEEEGEAPGLYIIQYISSSLDWKSGQIMINQKVDPVVSSDPYLR 535
Query: 543 VTLTFS-SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKL 601
VT TFS +KGS ++LNLRIP WT +GA AT+N Q L +P+PG+FLSV + WSS DKL
Sbjct: 536 VTFTFSPNKGSSQASTLNLRIPVWTHLDGATATINSQSLAIPAPGSFLSVNRKWSSGDKL 595
Query: 602 TIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE-SATSLSDWITPIPA 660
++QLP++LRTEAIQDDR +YASIQAILYGPY+LAGH+ GDW++ SA SLSD ITPIPA
Sbjct: 596 SLQLPISLRTEAIQDDRHQYASIQAILYGPYLLAGHTSGDWNLKAGSAGSLSDSITPIPA 655
Query: 661 SYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSL 720
SYN QL++F+Q+ GN+ FVLTNSNQSITME+ PKSGTDA L ATFR++ NDSS SE +
Sbjct: 656 SYNEQLVSFSQDSGNSTFVLTNSNQSITMEEHPKSGTDACLQATFRIVFNDSSSSEVLGI 715
Query: 721 NDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLES 780
ND I KSVMLEPFD PGML++Q D L VT+S GSS+FH+V GLDG D TVSLES
Sbjct: 716 NDVIDKSVMLEPFDLPGMLLVQQGKDSSLAVTNSAADDGSSIFHVVLGLDGKDGTVSLES 775
Query: 781 ETYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANR 840
+ +GC++Y+ VN +S +S KL C S++ GFN ASFV+ KGLSEYHPISFVA+G R
Sbjct: 776 GSQEGCYIYSGVNYKSGQSMKLSCKLGSSDPGFNQGASFVMNKGLSEYHPISFVAEGDKR 835
Query: 841 NFLLAPLLSLRDESYTVYFDFQS 863
NFLLAPL SLRDE YT+YF+ Q+
Sbjct: 836 NFLLAPLHSLRDEFYTIYFNIQA 858
>gi|225435510|ref|XP_002285548.1| PREDICTED: uncharacterized protein LOC100246702 [Vitis vinifera]
Length = 864
Score = 1203 bits (3112), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 589/872 (67%), Positives = 701/872 (80%), Gaps = 34/872 (3%)
Query: 13 FLLTFLLIVSAA-------QAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLT 65
F+L+ +LIV A KECTN +L+SH+FR LL+S NES+ ++ H HL
Sbjct: 4 FVLSEVLIVVFAFVLCGCVLGKECTNVPTQLSSHSFRYELLASNNESWKAEMFQHY-HLI 62
Query: 66 PSDDSAWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRL 125
+DDSAW +L+PRK+LREE++ FSWAM+YR +KN + FLKE+SLHDVRL
Sbjct: 63 HTDDSAWSNLLPRKLLREEDE---FSWAMMYRNMKN-----YDGSNSNFLKEMSLHDVRL 114
Query: 126 GSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHY 185
SDS+H RAQQTNL+YLL+LDVD+LVW+FRKTA L PG PYGGWE P+ ELRGHFVGHY
Sbjct: 115 DSDSLHGRAQQTNLDYLLILDVDRLVWSFRKTAGLSTPGLPYGGWEAPNVELRGHFVGHY 174
Query: 186 LSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPY 245
+SASA MWASTHN++LKEKMSAVVSAL+ CQ+++G+GYLSAFP+E FDR EA+ PVWAPY
Sbjct: 175 MSASAQMWASTHNDTLKEKMSAVVSALATCQEKMGTGYLSAFPSELFDRFEAIKPVWAPY 234
Query: 246 YTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGM 305
YTIHKILAGLLDQYT+A N++AL+M TWMVE+FY RVQNVI YS+ERHW +LNEE GGM
Sbjct: 235 YTIHKILAGLLDQYTFAGNSQALKMMTWMVEHFYKRVQNVITMYSLERHWLSLNEETGGM 294
Query: 306 NDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEV 365
NDVLY+L+ IT D KHL+LAHLFDKPCFLGLLA+QAD ISGFH+NTHIP+VIGSQMRYEV
Sbjct: 295 NDVLYRLYSITGDQKHLVLAHLFDKPCFLGLLAVQADSISGFHANTHIPVVIGSQMRYEV 354
Query: 366 TGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNML 414
TGD L+K H + GT++G F SDPKRLAS L EESCTTYNML
Sbjct: 355 TGDPLYKAIGTFFMDIVNSSHSYATGGTSVGE--FWSDPKRLASTLQRENEESCTTYNML 412
Query: 415 KVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGT 474
KVSRHLFRWTKE+ YADYYER+LTNGVL IQRGT+PGVMIY+LPL G SK RSYH WGT
Sbjct: 413 KVSRHLFRWTKEVVYADYYERALTNGVLSIQRGTDPGVMIYMLPLGRGDSKARSYHGWGT 472
Query: 475 PSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPV 534
DSFWCCYGTGIESFSKLGDSIYFEEEGK P VYIIQYISS LDWKSGQIV+NQKVDPV
Sbjct: 473 KFDSFWCCYGTGIESFSKLGDSIYFEEEGKSPEVYIIQYISSSLDWKSGQIVLNQKVDPV 532
Query: 535 VSWDPYLRVTLTFSSK-GSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTK 593
VSWDPYLR TLTF+ K G+G ++++NLRIP W SS+GAKA++N QDLP+P+P +FLS+T+
Sbjct: 533 VSWDPYLRTTLTFTPKEGAGQSSTINLRIPVWASSSGAKASINAQDLPVPAPSSFLSLTR 592
Query: 594 TWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI-TESATSLS 652
WS DKLT+QLP+ LRTEAI+DDRP+YASIQAILYGPY+LAG + DWDI T SATSLS
Sbjct: 593 NWSPGDKLTLQLPIRLRTEAIKDDRPKYASIQAILYGPYLLAGLTSDDWDIKTGSATSLS 652
Query: 653 DWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDS 712
DWITPIPAS NS+L++ +QE GN+ FV +NSNQSITMEKFP+ GTDA+LHATFRL+L D+
Sbjct: 653 DWITPIPASDNSRLVSLSQESGNSSFVFSNSNQSITMEKFPEEGTDASLHATFRLVLKDA 712
Query: 713 SGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGG 772
+ + S D IGKSVMLEP D PGM+V+Q T+ L + +S +G S+FHLVAGLDG
Sbjct: 713 TSLKVLSPKDAIGKSVMLEPIDLPGMVVVQQGTNQNLGIANSAAGKG-SLFHLVAGLDGK 771
Query: 773 DRTVSLESETYKGCFVYTAVNLQSSESTKLGCISE--STEAGFNNAASFVIEKGLSEYHP 830
D TVSLESE+ K C+VY+ ++ S S KL +SE S++ FN A SF++++G+S+YHP
Sbjct: 772 DGTVSLESESQKDCYVYSGIDYNSGTSIKLKSLSESGSSDEDFNKATSFILKEGISQYHP 831
Query: 831 ISFVAKGANRNFLLAPLLSLRDESYTVYFDFQ 862
ISFVAKG RNFLL PLL LRDESYTVYF+ Q
Sbjct: 832 ISFVAKGMKRNFLLTPLLGLRDESYTVYFNIQ 863
>gi|224075776|ref|XP_002304762.1| predicted protein [Populus trichocarpa]
gi|222842194|gb|EEE79741.1| predicted protein [Populus trichocarpa]
Length = 858
Score = 1189 bits (3075), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 584/858 (68%), Positives = 694/858 (80%), Gaps = 26/858 (3%)
Query: 19 LIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWLSLMPR 78
++ S +KECTN +L+SH+FR LLSS+NE++ +++ H HL P+DDSAW SL+PR
Sbjct: 12 MLCSFGISKECTNIPTQLSSHSFRYELLSSQNETWKEEMFEHY-HLIPTDDSAWSSLLPR 70
Query: 79 KILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTN 138
KILREE++ SW M+YR +K+P + SG FL E+SLH+VRL S+HW+AQQTN
Sbjct: 71 KILREEDEH---SWEMMYRNLKSP-----LKSSGNFLNEMSLHNVRLDPSSIHWKAQQTN 122
Query: 139 LEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHN 198
LEYLLMLDV+ LVW+FRKTA PG+ YGGWE+P ELRGHFVGHYLSASA MWASTHN
Sbjct: 123 LEYLLMLDVNNLVWSFRKTAGSSTPGKAYGGWEKPDSELRGHFVGHYLSASAQMWASTHN 182
Query: 199 ESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQ 258
E+LK+KMSAVVSALSACQ ++G+GYLSAFP+E FDR EA+ PVWAPYYTIHKILAGLLDQ
Sbjct: 183 ETLKKKMSAVVSALSACQVKMGTGYLSAFPSELFDRFEAIKPVWAPYYTIHKILAGLLDQ 242
Query: 259 YTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQD 318
YT ADNA+AL+M WMV+YFYNRV+NVI YS+ERH+ +LNEE GGMNDVLYKLF IT D
Sbjct: 243 YTLADNAQALKMVKWMVDYFYNRVRNVITNYSVERHYLSLNEETGGMNDVLYKLFSITGD 302
Query: 319 PKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----- 373
PKHL+LAHLFDKPCFLGLLA+QADDISGFH+NTHIP+VIG+QMRYE+TGD L+K+
Sbjct: 303 PKHLVLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGAQMRYEITGDPLYKDIGAFF 362
Query: 374 ------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEI 427
H + GT++ F SDPKRLAS L + EESCTTYNMLKVSRHLFRWTKE+
Sbjct: 363 MDVVNSSHSYATGGTSVSE--FWSDPKRLASTLQTENEESCTTYNMLKVSRHLFRWTKEM 420
Query: 428 AYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGI 487
AYADYYER+LTNGVLGIQRGTEPGVMIY+LP PGSSK +SYH WGT DSFWCCYGTGI
Sbjct: 421 AYADYYERALTNGVLGIQRGTEPGVMIYMLPQYPGSSKAKSYHGWGTSYDSFWCCYGTGI 480
Query: 488 ESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTF 547
ESFSKLGDSIYF EEG+ PG+YIIQYISS LDWKSGQIV+NQKVDP+VS DPYLRVTLTF
Sbjct: 481 ESFSKLGDSIYF-EEGEAPGLYIIQYISSSLDWKSGQIVLNQKVDPIVSSDPYLRVTLTF 539
Query: 548 S-SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLP 606
S KG+ ++L LRIP WT+S GA AT+N Q L LP+PG+FLSV + W S DKLT+Q+P
Sbjct: 540 SPKKGTSQASTLYLRIPIWTNSEGATATINSQSLRLPAPGSFLSVNRKWRSSDKLTLQIP 599
Query: 607 LTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI-TESATSLSDWITPIPASYNSQ 665
++LRTEAI+D+R EYAS+QAILYGPY+LAGH+ GDW++ + S SLSD ITPIP SYN Q
Sbjct: 600 ISLRTEAIKDERHEYASVQAILYGPYLLAGHTSGDWNLKSGSGNSLSDSITPIPGSYNGQ 659
Query: 666 LITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFIG 725
L++F+QE G + FVLTNSNQSI+MEK P+SGTDA+L ATFRL+ DSS S+ SS+ D IG
Sbjct: 660 LVSFSQESGISTFVLTNSNQSISMEKLPESGTDASLQATFRLVFKDSSSSKLSSVKDVIG 719
Query: 726 KSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYKG 785
KSVMLEPF PGML++Q D +T+S GSS+F +V+GLDG D TVSLES G
Sbjct: 720 KSVMLEPFHLPGMLLVQQGKDRSFTLTNSADDDGSSIFRVVSGLDGKDGTVSLESGIQNG 779
Query: 786 CFVYTAVNLQSSESTKLGCIS-ESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLL 844
C+VY+ V+ +S +S KL C S S++ GFN ASFV+ KGLS+YHPISFVAKG RNFLL
Sbjct: 780 CYVYSGVDYKSGQSMKLSCKSGSSSDTGFNQGASFVMNKGLSQYHPISFVAKGDKRNFLL 839
Query: 845 APLLSLRDESYTVYFDFQ 862
APL SLRDESYT+YF+ Q
Sbjct: 840 APLHSLRDESYTIYFNIQ 857
>gi|359478753|ref|XP_002283032.2| PREDICTED: uncharacterized protein LOC100250068 [Vitis vinifera]
Length = 874
Score = 1176 bits (3043), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 571/859 (66%), Positives = 676/859 (78%), Gaps = 25/859 (2%)
Query: 20 IVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWLSLMPRK 79
+ K+CTN+ L+SHT R LL SKNES + +H +L +D S WL+ +PRK
Sbjct: 18 LCGCGLGKKCTNSGSPLSSHTLRYELLFSKNESRKAEALAHYSNLIRTDGSGWLTSLPRK 77
Query: 80 ILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNL 139
LREE++ FS AM Y+ +K+ + +FLKE SLHDVRLGSDS+HWRAQQTNL
Sbjct: 78 ALREEDE---FSRAMKYQTMKS-----YDGSNSKFLKEFSLHDVRLGSDSLHWRAQQTNL 129
Query: 140 EYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNE 199
EYLLMLD D+LVW+FR+TA LP P PYGGWE P ELRGHFVGHYLSASA MWASTHNE
Sbjct: 130 EYLLMLDADRLVWSFRRTAGLPTPCSPYGGWESPDGELRGHFVGHYLSASAQMWASTHNE 189
Query: 200 SLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQY 259
SLKEKMSAVV AL CQK++G+GYLSAFP+E FDR EAL VWAPYYTIHKILAGLLDQY
Sbjct: 190 SLKEKMSAVVCALGECQKKMGTGYLSAFPSELFDRFEALEEVWAPYYTIHKILAGLLDQY 249
Query: 260 TYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDP 319
T NA+AL+M TWMVEYFYNRVQNVI YSIERHW +LNEE GGMND LY L+ IT D
Sbjct: 250 TLGGNAQALKMVTWMVEYFYNRVQNVISSYSIERHWLSLNEETGGMNDFLYNLYRITGDQ 309
Query: 320 KHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK------- 372
KH +LAHLFDKPCFLGLLA+QADDISGFH+NTHIPIV+G+QMRYE+TGD L+K
Sbjct: 310 KHFVLAHLFDKPCFLGLLAMQADDISGFHANTHIPIVVGAQMRYEITGDPLYKTIGAFFI 369
Query: 373 ----EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIA 428
H + GT++ F SDPKR+A+ L + ESCTTYNMLKVSR+LFRWTKE+A
Sbjct: 370 DTVNSSHSYATGGTSVDE--FWSDPKRMATTLQTENAESCTTYNMLKVSRNLFRWTKEVA 427
Query: 429 YADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIE 488
YADYYER+LTNG+L IQRGT+PGVM+Y+LPL G+SK RSYH WGT SFWCCYGTGIE
Sbjct: 428 YADYYERALTNGILSIQRGTDPGVMLYMLPLGHGNSKARSYHGWGTKFHSFWCCYGTGIE 487
Query: 489 SFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFS 548
SFSKLGDSIYFEEEG+ PG+YIIQYISS LDWKSGQ+V+NQKVD VVSWDPYLR+TLTFS
Sbjct: 488 SFSKLGDSIYFEEEGEVPGLYIIQYISSSLDWKSGQVVLNQKVDTVVSWDPYLRITLTFS 547
Query: 549 SK---GSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQL 605
K G+G ++++NLRIP W S+GAKA +N Q LP+P+P +FLS + WS DDKLT+QL
Sbjct: 548 PKKMQGAGQSSAINLRIPVWAYSSGAKAAVNAQALPVPAPNSFLSFRRKWSPDDKLTLQL 607
Query: 606 PLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI-TESATSLSDWITPIPASYNS 664
P+ LRTEAI+DDRP+YA +QAILYGPY+L G + DWDI T+ A SLSDWITPIPAS+NS
Sbjct: 608 PIALRTEAIKDDRPKYACLQAILYGPYLLVGLTNNDWDIQTDLAASLSDWITPIPASHNS 667
Query: 665 QLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFI 724
LI+ +QE GN+ F TNSNQS+TME++P+SGTDA+L+ATFRLIL DS+ S+ SS D I
Sbjct: 668 HLISLSQESGNSSFAFTNSNQSLTMERYPESGTDASLNATFRLILEDSTSSKISSPKDAI 727
Query: 725 GKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYK 784
GK VMLEP + PGM V+Q T++ L +T+S GSS+FHLVAGLDG D TVSLES+T K
Sbjct: 728 GKFVMLEPINFPGMAVVQRGTNESLGITNSASVVGSSLFHLVAGLDGKDGTVSLESKTQK 787
Query: 785 GCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLL 844
GCFVY+ VN S + KL C S++ FN A SF ++ G+SEYHPISFVAKG R++LL
Sbjct: 788 GCFVYSDVNYDSGSAIKLKCKLASSDVVFNQATSFTLKHGISEYHPISFVAKGLRRDYLL 847
Query: 845 APLLSLRDESYTVYFDFQS 863
APLLSLRDESYTVYF+ Q+
Sbjct: 848 APLLSLRDESYTVYFNIQA 866
>gi|449448754|ref|XP_004142130.1| PREDICTED: uncharacterized protein LOC101207833 [Cucumis sativus]
Length = 868
Score = 1118 bits (2892), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 549/851 (64%), Positives = 671/851 (78%), Gaps = 24/851 (2%)
Query: 27 KECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWLSLMPRKILREEEQ 86
KECTN +L SHTFR LLSS N ++ K++ SH HLTP+DD AW +L+PRK+L+EE +
Sbjct: 28 KECTNTPTQLGSHTFRYELLSSGNVTWKKELFSHY-HLTPTDDFAWSNLLPRKMLKEENE 86
Query: 87 DELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLD 146
++W M+YR++KN ++P G LKE+SLHDVRL +S+H AQ TNL+YLLMLD
Sbjct: 87 ---YNWEMMYRQMKNKDGLRIP---GGMLKEISLHDVRLDPNSLHGTAQTTNLKYLLMLD 140
Query: 147 VDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMS 206
VD+L+W+FRKTA LP PGEPY GWE+ CELRGHFVGHYLSASA MWAST N LKEKMS
Sbjct: 141 VDRLLWSFRKTAGLPTPGEPYVGWEKSDCELRGHFVGHYLSASAQMWASTGNSVLKEKMS 200
Query: 207 AVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAE 266
A+VS L+ CQ ++G+GYLSAFP+E+FDR EA+ PVWAPYYTIHKILAGLLDQYT+A N++
Sbjct: 201 ALVSGLATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAGLLDQYTFAGNSQ 260
Query: 267 ALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAH 326
AL+M TWMVEYFYNRVQNVI KY++ERH+++LNEE GGMNDVLY+L+ IT + KHL+LAH
Sbjct: 261 ALKMVTWMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYRITGNTKHLLLAH 320
Query: 327 LFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GH 375
LFDKPCFLGLLA+QA+DISGFH NTHIPIV+GSQMRYEVTGD L+KE H
Sbjct: 321 LFDKPCFLGLLAVQAEDISGFHVNTHIPIVVGSQMRYEVTGDPLYKEISTYFMDIVNSSH 380
Query: 376 QLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
+ GT++ F DPKRLA L + TEESCTTYNMLKVSR+LF+WTKEIAYADYYER
Sbjct: 381 SYATGGTSV--HEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTKEIAYADYYER 438
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+LTNGVL IQRGT+PGVMIY+LPL GSSK SYH WGTP +SFWCCYGTGIESFSKLGD
Sbjct: 439 ALTNGVLSIQRGTDPGVMIYMLPLGSGSSKAISYHGWGTPFESFWCCYGTGIESFSKLGD 498
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSK-GSGL 554
SIYFEEE + P +Y+IQYISS LDWKSG +++NQ VDP+ S DP LR+TLTFS K GS
Sbjct: 499 SIYFEEELQTPTLYVIQYISSSLDWKSGNVLLNQTVDPIHSEDPKLRMTLTFSPKVGSVH 558
Query: 555 TTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
++++NLRIP+WTS++GAK LNGQ L GNF SVT +WSS +KL+++LP+ LRTEAI
Sbjct: 559 SSTINLRIPSWTSASGAKVVLNGQSLGNNINGNFKSVTNSWSSGNKLSLELPINLRTEAI 618
Query: 615 QDDRPEYASIQAILYGPYVLAGHSIGDWDI-TESATSLSDWITPIPASYNSQLITFTQEY 673
DDR EYAS++AIL+GPY+LA +S GDW+I T+ A SLSDWIT +P++YN+ L+TF+Q
Sbjct: 619 DDDRSEYASVKAILFGPYLLAAYSNGDWEIKTQQADSLSDWITHVPSAYNTFLVTFSQAS 678
Query: 674 GNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFIGKSVMLEPF 733
G T F LTNSNQSITMEK+P GTD+A+HATFRLI++D S ++ + L D IGK VMLEPF
Sbjct: 679 GKTSFALTNSNQSITMEKYPGQGTDSAVHATFRLIIDDPS-AKVTELQDVIGKRVMLEPF 737
Query: 734 DSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYKGCFVYTAVN 793
PGM++ D+ L + D+ SS F+LV GLDG + TVSL S +GCFVY+ VN
Sbjct: 738 SFPGMVLGNKGKDERLEIADANSEGHSSDFYLVEGLDGKNGTVSLASIDNEGCFVYSGVN 797
Query: 794 LQSSESTKLGCISE-STEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRD 852
+S KL C S+ S + GF+ A+SF++E G S+YHPISFV KG RNFLLAPLLS D
Sbjct: 798 YESGAQLKLSCKSKLSLDDGFDEASSFLLESGASQYHPISFVTKGMTRNFLLAPLLSFVD 857
Query: 853 ESYTVYFDFQS 863
ESYTVYF+F +
Sbjct: 858 ESYTVYFNFNA 868
>gi|356541181|ref|XP_003539059.1| PREDICTED: uncharacterized protein LOC100781521 [Glycine max]
Length = 854
Score = 1114 bits (2881), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 552/867 (63%), Positives = 669/867 (77%), Gaps = 32/867 (3%)
Query: 11 FKFLLTFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDS 70
F F+ +L+ AKECTN + SHTFR LL SKN ++ ++ H HLTP+D++
Sbjct: 4 FVFVFVAILLCGCVAAKECTNIPTQ--SHTFRYELLMSKNATWKAEVMDHY-HLTPTDET 60
Query: 71 AWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGE-FLKEVSLHDVRLGSDS 129
W L+PRK L E+ Q + W ++YRKIKN G FK SGE FLKEV L DVRL DS
Sbjct: 61 VWADLLPRKFLSEQNQHD---WGVMYRKIKNMGVFK----SGEGFLKEVPLQDVRLHKDS 113
Query: 130 MHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSAS 189
+H RAQQTNLEYLLMLDVD L+W+FRKTA L PG PYGGWE P ELRGHFVGHYLSAS
Sbjct: 114 IHARAQQTNLEYLLMLDVDSLIWSFRKTAGLSTPGTPYGGWEGPEVELRGHFVGHYLSAS 173
Query: 190 ALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIH 249
ALMWAST N++LK+KMS++V+ LSACQ++IG+GYLSAFP+E FDR E + PVWAPYYTIH
Sbjct: 174 ALMWASTQNDTLKQKMSSLVAGLSACQEKIGTGYLSAFPSEFFDRFETVQPVWAPYYTIH 233
Query: 250 KILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVL 309
KILAGLLDQ+T+A N +AL+M TWMV+YFYNRVQNVI KY++ RH+++LNEE GGMNDVL
Sbjct: 234 KILAGLLDQHTFAGNPQALKMVTWMVDYFYNRVQNVITKYTVNRHYESLNEETGGMNDVL 293
Query: 310 YKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQ 369
Y+L+ IT D KHL+LAHLFDKPCFLGLLA+QA+DI+ FH+NTHIP+V+GSQMRYE+TGD
Sbjct: 294 YRLYSITGDSKHLVLAHLFDKPCFLGLLAMQANDIANFHANTHIPVVVGSQMRYEITGDP 353
Query: 370 LHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSN-TEESCTTYNMLKVS 417
L+K+ H + GT++ F SDPKR+A NL + EESCTTYNMLKVS
Sbjct: 354 LYKQIGTFFMDLVNSSHSYATGGTSVSE--FWSDPKRIADNLRTTENEESCTTYNMLKVS 411
Query: 418 RHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSD 477
RHLFRWTKE++YADYYER+LTNGVL IQRGT+PGVMIY+LPL SK R+ H WGT D
Sbjct: 412 RHLFRWTKEVSYADYYERALTNGVLSIQRGTDPGVMIYMLPLGFAVSKARTGHSWGTQFD 471
Query: 478 SFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSW 537
SFWCCYGTGIESFSKLGDSIYFEEEGK P +YIIQYI S +WKSG+I++NQ V PV S
Sbjct: 472 SFWCCYGTGIESFSKLGDSIYFEEEGKDPTLYIIQYIPSSFNWKSGKILLNQTVVPVASS 531
Query: 538 DPYLRVTLTFSS-KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWS 596
DPYLRVT TFS + + ++LN R+P+WT +GAK LNGQ L LP+PG +LSVT+ WS
Sbjct: 532 DPYLRVTFTFSPVEVTNTLSTLNFRLPSWTLLDGAKGILNGQTLSLPNPGKYLSVTRQWS 591
Query: 597 SDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSI-GDWDITESATSLSDWI 655
DKLT+QLPLT+RTEAI+DDRPEYAS+QAILYGPY+LAGH+ GDWD+ A + +DWI
Sbjct: 592 GSDKLTLQLPLTVRTEAIKDDRPEYASVQAILYGPYLLAGHTTGGDWDLKAGANN-ADWI 650
Query: 656 TPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGS 715
TPIPASYNSQL++F +++ + FVLTNSN+S++M+K P+ GTD L ATFR++L DSS S
Sbjct: 651 TPIPASYNSQLVSFFRDFEGSTFVLTNSNKSVSMQKLPEYGTDLTLQATFRIVLKDSS-S 709
Query: 716 EFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRT 775
+FS+L D +SVMLEPFD PGM VI L++ DS SSVF LV GLDG + T
Sbjct: 710 KFSTLADANDRSVMLEPFDFPGMNVIHQGAGKPLLIADSSHGGPSSVFLLVPGLDGRNET 769
Query: 776 VSLESETYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVA 835
VSLES++ KGC+VY+ + S KL C S+S +A FN A SFV +GLS+Y+PISFVA
Sbjct: 770 VSLESQSNKGCYVYSG--MSPSSGVKLSCKSDS-DATFNKATSFVALQGLSQYNPISFVA 826
Query: 836 KGANRNFLLAPLLSLRDESYTVYFDFQ 862
KG NRNFLL PLLS RDE YTVYF+ Q
Sbjct: 827 KGTNRNFLLQPLLSFRDEHYTVYFNIQ 853
>gi|356541912|ref|XP_003539416.1| PREDICTED: uncharacterized protein LOC100783150 [Glycine max]
Length = 854
Score = 1112 bits (2875), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 553/865 (63%), Positives = 671/865 (77%), Gaps = 32/865 (3%)
Query: 13 FLLTFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAW 72
F L +L+ AKECTN + SHTFR LL S N ++ ++ H HLTP+D++AW
Sbjct: 6 FALVAILLCGCDAAKECTNIPTQ--SHTFRYELLMSTNATWKAEVMDHY-HLTPTDETAW 62
Query: 73 LSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGE-FLKEVSLHDVRLGSDSMH 131
L+PRK+L E+ Q + W ++YRKIKN G FK SGE FLKEV L DVRL DS+H
Sbjct: 63 ADLLPRKLLSEQNQHD---WGVMYRKIKNMGVFK----SGEGFLKEVPLQDVRLHKDSIH 115
Query: 132 WRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASAL 191
RAQQTNLEYLLMLDVD L+W+FRKTA L PG PYGGWE P ELRGHFVGHYLSASAL
Sbjct: 116 GRAQQTNLEYLLMLDVDSLIWSFRKTAALSTPGTPYGGWEGPEVELRGHFVGHYLSASAL 175
Query: 192 MWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKI 251
MWAST N++LK+KMS++V+ LSACQ++IG+GYLSAFP+E FDR EA+ PVWAPYYTIHKI
Sbjct: 176 MWASTQNDTLKQKMSSLVAGLSACQEKIGTGYLSAFPSEFFDRFEAVQPVWAPYYTIHKI 235
Query: 252 LAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYK 311
LAGLLDQ+T+A N +AL+M TWMV+YFYNRVQNVI KY++ RH+Q++NEE GGMNDVLY+
Sbjct: 236 LAGLLDQHTFAGNPQALKMVTWMVDYFYNRVQNVITKYTVNRHYQSMNEETGGMNDVLYR 295
Query: 312 LFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLH 371
L+ IT D KHL+LAHLFDKPCFLGLLA+QA+DI+ H+NTHIPIV+GSQMRYE+TGD L+
Sbjct: 296 LYSITGDSKHLVLAHLFDKPCFLGLLAVQANDIADLHANTHIPIVVGSQMRYEITGDPLY 355
Query: 372 KE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSN-TEESCTTYNMLKVSRH 419
K+ H + GT++ F SDPKR+A NL + EESCTTYNMLKVSRH
Sbjct: 356 KQIGTFFMDLVNSSHSYATGGTSVRE--FWSDPKRIADNLRTTENEESCTTYNMLKVSRH 413
Query: 420 LFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSF 479
LFRWTKE++YADYYER+LTNGVL IQRGT+PGVMIY+LPL SK R+ H WGT DSF
Sbjct: 414 LFRWTKEVSYADYYERALTNGVLSIQRGTDPGVMIYMLPLGFAVSKARTGHSWGTQFDSF 473
Query: 480 WCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDP 539
WCCYGTGIESFSKLGDSIYFEEEGK P +YIIQYISS +WKSG+I++NQ V P S DP
Sbjct: 474 WCCYGTGIESFSKLGDSIYFEEEGKDPTLYIIQYISSSFNWKSGKILLNQTVVPASSSDP 533
Query: 540 YLRVTLTFSS-KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSD 598
YLRVT TFS + + ++LN R+P+WT +GAK LNGQ L LP+PGN+LS+T+ WS+
Sbjct: 534 YLRVTFTFSPVEVTNTLSTLNFRLPSWTLLDGAKGILNGQTLSLPNPGNYLSITRQWSAS 593
Query: 599 DKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSI-GDWDITESATSLSDWITP 657
DKLT+QLPLT+RTEAI+DDRPEYAS+QAILYGPY+LAGH+ GDW++ A + +DWITP
Sbjct: 594 DKLTLQLPLTVRTEAIKDDRPEYASVQAILYGPYLLAGHTTGGDWNLKAGANN-ADWITP 652
Query: 658 IPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEF 717
IPASYNSQL++F +++ + FVL NSNQS++M+K P+ GTD AL ATFR++L +SS S+F
Sbjct: 653 IPASYNSQLVSFFRDFEGSTFVLANSNQSVSMQKLPEFGTDLALQATFRIVLEESS-SKF 711
Query: 718 SSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVS 777
S L D +SVMLEPFD PGM VI L+ DS S+VF LV GLDG + TVS
Sbjct: 712 SKLADANDRSVMLEPFDLPGMNVIHQGAGKPLLTVDSSQGGPSAVFLLVPGLDGRNETVS 771
Query: 778 LESETYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKG 837
LES++ KGC+VY+ + S KL C S+S +A FN AASFV +GLS+Y+PISFVAKG
Sbjct: 772 LESQSNKGCYVYSG--MSPSAGVKLSCKSDS-DATFNQAASFVALQGLSQYNPISFVAKG 828
Query: 838 ANRNFLLAPLLSLRDESYTVYFDFQ 862
ANRNFLL PLLS RDE YTVYF+ Q
Sbjct: 829 ANRNFLLQPLLSFRDEHYTVYFNIQ 853
>gi|15239944|ref|NP_196799.1| uncharacterized protein [Arabidopsis thaliana]
gi|7630051|emb|CAB88259.1| putative protein [Arabidopsis thaliana]
gi|26451123|dbj|BAC42665.1| unknown protein [Arabidopsis thaliana]
gi|332004451|gb|AED91834.1| uncharacterized protein [Arabidopsis thaliana]
Length = 861
Score = 1097 bits (2838), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 534/873 (61%), Positives = 661/873 (75%), Gaps = 31/873 (3%)
Query: 5 MCSIGFFKFLLTFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHL 64
+ +I + +F+L+ + AKECTN +L+SHTFRS LL SKNE+ ++ SH HL
Sbjct: 6 IITIALLLYTSSFVLV---SVAKECTNTPTQLSSHTFRSELLQSKNETLKTELFSHY-HL 61
Query: 65 TPSDDSAWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVR 124
TP+DDSAW SL+PRK+L+EE + F+W MLYRK FK SG FLK+VSLHDVR
Sbjct: 62 TPADDSAWSSLLPRKMLKEEADE--FAWTMLYRK------FKDSNSSGNFLKDVSLHDVR 113
Query: 125 LGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGH 184
L DS HWRAQQTNLEYLLMLDVD L W+FRK A L APG+ YGGWE P ELRGHFVGH
Sbjct: 114 LDPDSFHWRAQQTNLEYLLMLDVDGLAWSFRKEAGLDAPGDYYGGWERPDSELRGHFVGH 173
Query: 185 YLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAP 244
YLSA+A MWASTHN++LKEKMSA+VSALS CQ++ G+GYLSAFP+ FDR EA+ PVWAP
Sbjct: 174 YLSATAYMWASTHNDTLKEKMSALVSALSECQQKSGTGYLSAFPSSFFDRFEAITPVWAP 233
Query: 245 YYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGG 304
YYTIHKILAGL+DQY A N++AL+M T M +YFY RV+NVI+KYS+ERHWQ+LNEE GG
Sbjct: 234 YYTIHKILAGLVDQYKLAGNSQALKMATGMADYFYGRVRNVIRKYSVERHWQSLNEETGG 293
Query: 305 MNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYE 364
MNDVLY+L+ IT D K+L+LAHLFDKPCFLG+LA+QADDISGFH+NTHIPIV+GSQ RYE
Sbjct: 294 MNDVLYQLYSITGDSKYLLLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYE 353
Query: 365 VTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNM 413
+TGD LHKE H + GT++ F DPKR+A+ L + EESCTTYNM
Sbjct: 354 ITGDLLHKEISMFFMDIFNASHSYATGGTSVSE--FWQDPKRMATALQTENEESCTTYNM 411
Query: 414 LKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWG 473
LKVSR+LFRWTKE++YADYYER+LTNGVLGIQRGT+PG+MIY+LPL G SK +YH WG
Sbjct: 412 LKVSRNLFRWTKEVSYADYYERALTNGVLGIQRGTQPGLMIYMLPLGKGVSKAVTYHGWG 471
Query: 474 TPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDP 533
TP DSFWCCYGTGIESFSKLGDSIYF+E+G P +Y+ QYISS LDWKS + ++QKV+P
Sbjct: 472 TPYDSFWCCYGTGIESFSKLGDSIYFQEDGATPALYVTQYISSSLDWKSAGLSISQKVNP 531
Query: 534 VVSWDPYLRVTLTFSSKGSGLT--TSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSV 591
VVSWDPY+RVT T SS G+ ++LNLRIP WT+S GAK +LNG+ L +P+ GNFLS+
Sbjct: 532 VVSWDPYMRVTFTLSSSKVGVAKESTLNLRIPVWTNSIGAKVSLNGRPLNVPTSGNFLSI 591
Query: 592 TKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSL 651
+ W S D++T++LP+++RTEAI+DDRPEYAS+QAILYGPY+LAGH+ DW IT A
Sbjct: 592 KQKWKSGDQVTMELPMSIRTEAIKDDRPEYASLQAILYGPYLLAGHTSRDWSITTQAKP- 650
Query: 652 SDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILND 711
WITPIP + NS L+T +Q+ GN +V +NSNQ+ITM P+ GT A+ ATFRL+ D
Sbjct: 651 GKWITPIPETQNSYLVTLSQQSGNVSYVFSNSNQTITMRVSPEPGTQDAVAATFRLV-TD 709
Query: 712 SSGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIA-QGSSVFHLVAGLD 770
+S S IG+ VMLEPFD PGM+V Q TD L V S + +G+S F LV+GLD
Sbjct: 710 NSKPRISGPEGLIGRLVMLEPFDFPGMIVKQ-ATDSSLTVQASSPSDKGASSFRLVSGLD 768
Query: 771 GGDRTVSLESETYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHP 830
G +VSL E+ KGCFVY+ L+ +L C S++T+ F AASF ++ G+ +Y+P
Sbjct: 769 GKLGSVSLRLESKKGCFVYSDQTLKQGTKLRLECGSDATDEKFKEAASFSLKTGMHQYNP 828
Query: 831 ISFVAKGANRNFLLAPLLSLRDESYTVYFDFQS 863
+SFV G RNF+L+PL SLRDE+Y VYF Q+
Sbjct: 829 MSFVMSGTQRNFVLSPLFSLRDETYNVYFSVQT 861
>gi|297807309|ref|XP_002871538.1| hypothetical protein ARALYDRAFT_350453 [Arabidopsis lyrata subsp.
lyrata]
gi|297317375|gb|EFH47797.1| hypothetical protein ARALYDRAFT_350453 [Arabidopsis lyrata subsp.
lyrata]
Length = 860
Score = 1079 bits (2791), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 525/864 (60%), Positives = 650/864 (75%), Gaps = 28/864 (3%)
Query: 14 LLTFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWL 73
LL F V AKECT+ +L+SHT RS LL S+NE+ ++ SH HLTP+DD+AW
Sbjct: 11 LLLFTSFVLVCVAKECTDIPTKLSSHTLRSELLQSQNETLKTELSSHY-HLTPTDDAAWS 69
Query: 74 SLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWR 133
+L+PRK+L+EE D F+W MLYRK FK SG FLK+VSLHDVRL S HWR
Sbjct: 70 TLLPRKMLKEETDD--FAWTMLYRK------FKDSNSSGNFLKDVSLHDVRLDPSSFHWR 121
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
AQQTNLEYLLML+VD L ++FRK A L APG PYGGWE+P ELRGHFVGHYLSA+A MW
Sbjct: 122 AQQTNLEYLLMLNVDGLAYSFRKVAGLDAPGVPYGGWEKPDSELRGHFVGHYLSATAYMW 181
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILA 253
ASTHN++LK KMSA+VSAL+ CQ++ G+GYLSAFP+ FDR EA+ VWAPYYTIHKILA
Sbjct: 182 ASTHNDTLKTKMSALVSALAECQQKSGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILA 241
Query: 254 GLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLF 313
GL+DQY A N +AL+M T M +YFY RVQNVI+KYS+ERHW +LNEE GGMNDVLY+L+
Sbjct: 242 GLVDQYKLAGNTQALKMATGMADYFYGRVQNVIRKYSVERHWLSLNEETGGMNDVLYQLY 301
Query: 314 CITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE 373
IT+D K+L LAHLFDKPCFLG+LA+QADDISGFH+NTHIPIV+GSQ RYE+TGD LHKE
Sbjct: 302 SITRDSKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKE 361
Query: 374 -----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFR 422
H + GT++ F DPKR+A+ L + EESCTTYNMLKVSR+LFR
Sbjct: 362 ISMFFMDIVNASHSYATGGTSVKE--FWQDPKRMATTLQTENEESCTTYNMLKVSRNLFR 419
Query: 423 WTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCC 482
WTKE++YADYYER+LTNGVLGIQRGT+PG MIY+LPL G SK +YH WGTP DSFWCC
Sbjct: 420 WTKEVSYADYYERALTNGVLGIQRGTQPGRMIYMLPLGQGVSKAVTYHGWGTPYDSFWCC 479
Query: 483 YGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLR 542
YGTGIESFSKLGDSIYF+E+G P +Y+ QYISS LDWKS ++++QKV+PVVSWDPY+R
Sbjct: 480 YGTGIESFSKLGDSIYFQEDGASPALYVTQYISSSLDWKSAGLLLSQKVNPVVSWDPYMR 539
Query: 543 VTLTFSSKGSGLT--TSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDK 600
VT T SS G+ ++LNLRIP WT+S GAK +LNG+ L +P+ GNFLS+ + W S D+
Sbjct: 540 VTFTLSSSKVGVAKKSTLNLRIPVWTNSIGAKVSLNGKPLKVPTSGNFLSIKQNWKSGDQ 599
Query: 601 LTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPA 660
+T++LP+++RTEAI+DDRPEYAS+QAILYGPY+LAGH+ DW IT A + +WITPIP
Sbjct: 600 VTMELPMSIRTEAIKDDRPEYASLQAILYGPYLLAGHTSRDWSITTQAKA-GNWITPIPE 658
Query: 661 SYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSL 720
+YNS L+T +Q+ GN +VL+N+NQ+ITM P+ GT A+ ATFRL+ D+S S
Sbjct: 659 TYNSHLVTLSQQSGNISYVLSNTNQTITMRVSPELGTQDAVAATFRLV-TDNSKPRISGP 717
Query: 721 NDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIA-QGSSVFHLVAGLDGGDRTVSLE 779
IG VMLEPFD PGM+V Q TD L V S + +G+S F LV+G+DG +VSL
Sbjct: 718 EALIGSLVMLEPFDFPGMIVKQ-ATDSSLTVQASSPSDKGASSFRLVSGVDGKPGSVSLR 776
Query: 780 SETYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGAN 839
E+ GCFVY+ L+ KL C +T+ F AASF + G+++Y+P+SFV G
Sbjct: 777 LESNNGCFVYSDQTLKQGTKLKLECGPVATDEKFKEAASFKLNTGMNQYNPMSFVMSGTQ 836
Query: 840 RNFLLAPLLSLRDESYTVYFDFQS 863
RNF+L+PL SLRDE+Y VYF Q+
Sbjct: 837 RNFVLSPLFSLRDETYNVYFSVQT 860
>gi|356557388|ref|XP_003546998.1| PREDICTED: uncharacterized protein LOC100815634 [Glycine max]
Length = 841
Score = 1078 bits (2787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 537/866 (62%), Positives = 660/866 (76%), Gaps = 44/866 (5%)
Query: 13 FLLTFLLIV--SAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDS 70
FL F+ IV A KECTN + SHTFR L +S NE++ I SHN HLT DD
Sbjct: 3 FLFAFVAIVVWGCAAGKECTNN--DAQSHTFRYQLSTSTNETW--NIMSHN-HLTTKDDH 57
Query: 71 AWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSM 130
L+PRK+L+EE Q L + RKI+ G K P++ FLK VSLHDVRL S+
Sbjct: 58 LLADLLPRKLLKEENQRNL----DMLRKIEKVGVLKPPQQPQGFLKPVSLHDVRLNQGSI 113
Query: 131 HWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASA 190
H +AQ+TNLEYLLML+VD+L+W+FRKTA LP PG PYGGWE+P ELRGHFVGHYLSASA
Sbjct: 114 HAQAQRTNLEYLLMLNVDRLLWSFRKTAGLPTPGTPYGGWEDPKMELRGHFVGHYLSASA 173
Query: 191 LMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHK 250
LMWASTHN+SLK+KMSA+V+ LS CQ++IG+GYLSAFP+E FDRLEA VWAPYYT HK
Sbjct: 174 LMWASTHNDSLKKKMSALVANLSICQEKIGTGYLSAFPSEFFDRLEATKYVWAPYYTTHK 233
Query: 251 ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLY 310
ILAGLLDQ++ A+N +AL+M TWMV+YFYNRVQNVI K+SI RH+Q+LNEE GGMNDVLY
Sbjct: 234 ILAGLLDQHSIAENPQALKMVTWMVDYFYNRVQNVITKFSISRHYQSLNEETGGMNDVLY 293
Query: 311 KLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQL 370
KL+ IT DP+HL+LAHLFDKPCFLGLLA++A+DI+ FH+NTHIP+++GSQMRYEVTGD L
Sbjct: 294 KLYSITGDPRHLLLAHLFDKPCFLGLLAVKANDIAHFHANTHIPVIVGSQMRYEVTGDPL 353
Query: 371 HKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDS-NTEESCTTYNMLKVSR 418
+KE H + GT++ F SDPKR+A L+S + EESCTTYNMLKVSR
Sbjct: 354 YKEIGTLFMDLVNSSHTYATGGTSVNE--FWSDPKRMADTLESTDNEESCTTYNMLKVSR 411
Query: 419 HLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDS 478
HLF WTK+++YADYYER+LTNGVL IQRGTEPGVMIY+LP G SK ++Y WGT DS
Sbjct: 412 HLFTWTKKVSYADYYERALTNGVLSIQRGTEPGVMIYMLPQGRGVSKAKTYFGWGTKFDS 471
Query: 479 FWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWD 538
FWCCYGTGIESFSKLGDSIYFEE+G+ P +YIIQYISS +WKSGQI++NQ V P SWD
Sbjct: 472 FWCCYGTGIESFSKLGDSIYFEEQGENPTLYIIQYISSLFNWKSGQIILNQTVVPPASWD 531
Query: 539 PYLRVTLTFS-SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSS 597
P+LRV+ TFS +K +G ++LN R+PT NG K LN + L LP PGNFLS+T+ W++
Sbjct: 532 PFLRVSFTFSPAKKTGALSTLNFRLPTRMHKNGEKGILNNETLTLPGPGNFLSITRKWNA 591
Query: 598 DDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESA-TSLSDWIT 656
DKL++QLPLTLR EAI+DDR +YASIQAILYGPY+LAGH+ GDW+I +A S++DWIT
Sbjct: 592 GDKLSLQLPLTLRAEAIKDDRTKYASIQAILYGPYLLAGHTTGDWNIKTAANASIADWIT 651
Query: 657 PIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSE 716
PIPASYN L F+Q + N+ FVLTNSNQS+ ++K P+ GTD+AL ATFR+I SS ++
Sbjct: 652 PIPASYNIHLFYFSQAFANSTFVLTNSNQSLAVKKVPEPGTDSALGATFRVIQGKSS-TK 710
Query: 717 FSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTV 776
F++L D IGKSVMLEPFD PGM + SSVF +V GLDG T+
Sbjct: 711 FTTLTDAIGKSVMLEPFDHPGMQALPS-------------GGPSSVFVVVPGLDGRKETI 757
Query: 777 SLESETYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAK 836
SLES+++ GCFV++ L+S KL C + S +A FN AASF+ ++G+S+Y+PISFVAK
Sbjct: 758 SLESKSHNGCFVHSG--LRSGRGVKLSCKTTS-DATFNQAASFIAKRGISKYNPISFVAK 814
Query: 837 GANRNFLLAPLLSLRDESYTVYFDFQ 862
G NRNFLL PLL+ RDESYTVYF+ +
Sbjct: 815 GENRNFLLEPLLAFRDESYTVYFNIK 840
>gi|30684197|ref|NP_196800.2| uncharacterized protein [Arabidopsis thaliana]
gi|28393685|gb|AAO42255.1| unknown protein [Arabidopsis thaliana]
gi|332004452|gb|AED91835.1| uncharacterized protein [Arabidopsis thaliana]
Length = 865
Score = 1075 bits (2780), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 530/862 (61%), Positives = 650/862 (75%), Gaps = 31/862 (3%)
Query: 16 TFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWLSL 75
+FLL+ AKECT+ +L+SHT RS LL S+N + + SH HLTP+DDSAW +L
Sbjct: 21 SFLLV---CLAKECTDIPTKLSSHTLRSELLQSQNANLKSEEFSHY-HLTPTDDSAWSTL 76
Query: 76 MPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQ 135
+PRK+L+EE D F+W MLYRK FK SG FLK+VSLHDVRL S HWRAQ
Sbjct: 77 LPRKMLKEETDD--FAWTMLYRK------FKDSNSSGNFLKDVSLHDVRLDPSSFHWRAQ 128
Query: 136 QTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWAS 195
QTNLEYLLMLDVD L +NFRK A L APG PYGGWE+P ELRGHFVGHYLSA+A MWAS
Sbjct: 129 QTNLEYLLMLDVDGLAYNFRKEAGLNAPGVPYGGWEKPDSELRGHFVGHYLSATAYMWAS 188
Query: 196 THNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGL 255
THNE+LK KM+A+VSAL+ CQ++ G+GYLSAFP+ FDR EA+ VWAPYYTIHKILAGL
Sbjct: 189 THNETLKAKMTALVSALAECQQKYGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAGL 248
Query: 256 LDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCI 315
+DQY A N +AL+M T M +YFY RVQNVIKKYS+ERHW +LNEE GGMNDVLY+L+ I
Sbjct: 249 VDQYKLAGNTQALKMATGMADYFYGRVQNVIKKYSVERHWLSLNEETGGMNDVLYQLYSI 308
Query: 316 TQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-- 373
T+D K+L LAHLFDKPCFLG+LA+QADDISGFH+NTHIPIV+GSQ RYE+TGD LHKE
Sbjct: 309 TRDSKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEIP 368
Query: 374 ---------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWT 424
H + GT++ F DPKR+A+ L + EESCTTYNMLKVSR+LFRWT
Sbjct: 369 MFFMDIVNASHSYATGGTSVKE--FWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWT 426
Query: 425 KEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYG 484
KE++YADYYER+LTNGVLGIQRGT+PG MIY+LPL G SK +YH WGTP DSFWCCYG
Sbjct: 427 KEVSYADYYERALTNGVLGIQRGTDPGRMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYG 486
Query: 485 TGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVT 544
TGIESFSKLGDSIYF+E+G P +Y+ QYISS LDWKS + ++QKV+PVVSWDPY+RVT
Sbjct: 487 TGIESFSKLGDSIYFQEDGATPALYVTQYISSSLDWKSAGLSISQKVNPVVSWDPYMRVT 546
Query: 545 LTFSSKGSGLT--TSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLT 602
T SS G+ ++LNLRIP WT+S GAK +LNG+ L +P+ GNFLS+ + W S D++T
Sbjct: 547 FTLSSSKVGVAKESTLNLRIPVWTNSIGAKVSLNGRPLNVPTSGNFLSIKQKWKSGDQVT 606
Query: 603 IQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASY 662
++LP+++RTEAI+DDRPEYAS+QAILYGPY+LAGH+ DW IT A + +WITPIP +
Sbjct: 607 MELPMSIRTEAIKDDRPEYASLQAILYGPYLLAGHTSMDWSITTQAKA-GNWITPIPETL 665
Query: 663 NSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLND 722
NS L+T +Q+ GN +VL+NSNQ+I M+ P+ GT A+ ATFRL+ +DS SS
Sbjct: 666 NSHLVTLSQQSGNISYVLSNSNQTIIMKVSPEPGTQDAVSATFRLVTDDSK-HPISSPEG 724
Query: 723 FIGKSVMLEPFDSPGMLVIQHETDDELVV-TDSFIAQGSSVFHLVAGLDGGDRTVSLESE 781
IG VMLEPFD PGM+V Q TD L V S +GSS F LV+GLDG +VSL E
Sbjct: 725 LIGSLVMLEPFDFPGMIVKQ-ATDSSLTVQASSPSDKGSSSFRLVSGLDGKPGSVSLSLE 783
Query: 782 TYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRN 841
+ KGCFVY+ L+ +L C S +T+ F AASF ++ G+++Y+P+SFV G RN
Sbjct: 784 SKKGCFVYSDQTLKQGTKLRLECGSAATDEKFKQAASFSLKTGMNQYNPMSFVMSGTQRN 843
Query: 842 FLLAPLLSLRDESYTVYFDFQS 863
F+L+PL SLRDE+Y VYF Q+
Sbjct: 844 FVLSPLFSLRDETYNVYFSVQA 865
>gi|7630052|emb|CAB88260.1| putative protein [Arabidopsis thaliana]
Length = 860
Score = 1074 bits (2777), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 530/862 (61%), Positives = 650/862 (75%), Gaps = 31/862 (3%)
Query: 16 TFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWLSL 75
+FLL+ AKECT+ +L+SHT RS LL S+N + + SH HLTP+DDSAW +L
Sbjct: 16 SFLLV---CLAKECTDIPTKLSSHTLRSELLQSQNANLKSEEFSHY-HLTPTDDSAWSTL 71
Query: 76 MPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQ 135
+PRK+L+EE D F+W MLYRK FK SG FLK+VSLHDVRL S HWRAQ
Sbjct: 72 LPRKMLKEETDD--FAWTMLYRK------FKDSNSSGNFLKDVSLHDVRLDPSSFHWRAQ 123
Query: 136 QTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWAS 195
QTNLEYLLMLDVD L +NFRK A L APG PYGGWE+P ELRGHFVGHYLSA+A MWAS
Sbjct: 124 QTNLEYLLMLDVDGLAYNFRKEAGLNAPGVPYGGWEKPDSELRGHFVGHYLSATAYMWAS 183
Query: 196 THNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGL 255
THNE+LK KM+A+VSAL+ CQ++ G+GYLSAFP+ FDR EA+ VWAPYYTIHKILAGL
Sbjct: 184 THNETLKAKMTALVSALAECQQKYGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAGL 243
Query: 256 LDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCI 315
+DQY A N +AL+M T M +YFY RVQNVIKKYS+ERHW +LNEE GGMNDVLY+L+ I
Sbjct: 244 VDQYKLAGNTQALKMATGMADYFYGRVQNVIKKYSVERHWLSLNEETGGMNDVLYQLYSI 303
Query: 316 TQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-- 373
T+D K+L LAHLFDKPCFLG+LA+QADDISGFH+NTHIPIV+GSQ RYE+TGD LHKE
Sbjct: 304 TRDSKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEIP 363
Query: 374 ---------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWT 424
H + GT++ F DPKR+A+ L + EESCTTYNMLKVSR+LFRWT
Sbjct: 364 MFFMDIVNASHSYATGGTSVKE--FWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWT 421
Query: 425 KEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYG 484
KE++YADYYER+LTNGVLGIQRGT+PG MIY+LPL G SK +YH WGTP DSFWCCYG
Sbjct: 422 KEVSYADYYERALTNGVLGIQRGTDPGRMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYG 481
Query: 485 TGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVT 544
TGIESFSKLGDSIYF+E+G P +Y+ QYISS LDWKS + ++QKV+PVVSWDPY+RVT
Sbjct: 482 TGIESFSKLGDSIYFQEDGATPALYVTQYISSSLDWKSAGLSISQKVNPVVSWDPYMRVT 541
Query: 545 LTFSSKGSGLT--TSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLT 602
T SS G+ ++LNLRIP WT+S GAK +LNG+ L +P+ GNFLS+ + W S D++T
Sbjct: 542 FTLSSSKVGVAKESTLNLRIPVWTNSIGAKVSLNGRPLNVPTSGNFLSIKQKWKSGDQVT 601
Query: 603 IQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASY 662
++LP+++RTEAI+DDRPEYAS+QAILYGPY+LAGH+ DW IT A + +WITPIP +
Sbjct: 602 MELPMSIRTEAIKDDRPEYASLQAILYGPYLLAGHTSMDWSITTQAKA-GNWITPIPETL 660
Query: 663 NSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLND 722
NS L+T +Q+ GN +VL+NSNQ+I M+ P+ GT A+ ATFRL+ +DS SS
Sbjct: 661 NSHLVTLSQQSGNISYVLSNSNQTIIMKVSPEPGTQDAVSATFRLVTDDSK-HPISSPEG 719
Query: 723 FIGKSVMLEPFDSPGMLVIQHETDDELVV-TDSFIAQGSSVFHLVAGLDGGDRTVSLESE 781
IG VMLEPFD PGM+V Q TD L V S +GSS F LV+GLDG +VSL E
Sbjct: 720 LIGSLVMLEPFDFPGMIVKQ-ATDSSLTVQASSPSDKGSSSFRLVSGLDGKPGSVSLSLE 778
Query: 782 TYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRN 841
+ KGCFVY+ L+ +L C S +T+ F AASF ++ G+++Y+P+SFV G RN
Sbjct: 779 SKKGCFVYSDQTLKQGTKLRLECGSAATDEKFKQAASFSLKTGMNQYNPMSFVMSGTQRN 838
Query: 842 FLLAPLLSLRDESYTVYFDFQS 863
F+L+PL SLRDE+Y VYF Q+
Sbjct: 839 FVLSPLFSLRDETYNVYFSVQA 860
>gi|297807305|ref|XP_002871536.1| hypothetical protein ARALYDRAFT_909245 [Arabidopsis lyrata subsp.
lyrata]
gi|297317373|gb|EFH47795.1| hypothetical protein ARALYDRAFT_909245 [Arabidopsis lyrata subsp.
lyrata]
Length = 862
Score = 1069 bits (2765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 524/866 (60%), Positives = 651/866 (75%), Gaps = 30/866 (3%)
Query: 14 LLTFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWL 73
LL + V AKECTN +L+SHTFRS LL SKNE+ ++ SH HLTP+DD+AW
Sbjct: 11 LLLYTSFVLVCVAKECTNTPTQLSSHTFRSELLQSKNETLKTELFSHY-HLTPTDDAAWS 69
Query: 74 SLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWR 133
+L+PRK+L+EE + F+W MLYR FK SG FLKEVSLHDVRL +S H R
Sbjct: 70 TLLPRKMLKEEADE--FAWTMLYRT------FKDSNSSGNFLKEVSLHDVRLDPNSFHGR 121
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
AQQTNLEYLLMLDVD L W+FRK A L APG+ YGGWE+P ELRGHFVGHYLSA+A MW
Sbjct: 122 AQQTNLEYLLMLDVDGLAWSFRKEAGLDAPGDHYGGWEKPDSELRGHFVGHYLSATAYMW 181
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILA 253
ASTHN++LKEKMSA+VSALS CQ++ G+GYLSAFP+ FDR EA+ PVWAPYYTIHKI+A
Sbjct: 182 ASTHNDTLKEKMSALVSALSECQQKSGTGYLSAFPSSFFDRFEAITPVWAPYYTIHKIIA 241
Query: 254 GLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLF 313
GL+DQY A N++AL+M T M +YFY RV+NVI+KYS+ERHWQ+LNEE GGMND+LY+L+
Sbjct: 242 GLVDQYKLAGNSQALQMATGMADYFYGRVRNVIRKYSVERHWQSLNEETGGMNDILYQLY 301
Query: 314 CITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE 373
IT D K+L+LAHLFDKPCFLG+LA+QADDISGFHSNTHIPIV+GSQ RYE+TGD LHKE
Sbjct: 302 SITGDSKYLLLAHLFDKPCFLGVLAIQADDISGFHSNTHIPIVVGSQQRYEITGDPLHKE 361
Query: 374 -----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFR 422
H + GT++ F +PKR+A+ L + EESCTTYNMLKVSR+LFR
Sbjct: 362 ISIFFMDIVNASHSYATGGTSVSE--FWQNPKRMATTLQTENEESCTTYNMLKVSRNLFR 419
Query: 423 WTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCC 482
WTKE++YADYYER+LTNGVLGIQRGT+PG+MIY+LPL G SK +YH WGTP DSFWCC
Sbjct: 420 WTKEVSYADYYERALTNGVLGIQRGTQPGLMIYMLPLGKGVSKAVTYHGWGTPYDSFWCC 479
Query: 483 YGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLR 542
YGTGIESFSKLGDSIYF+E+ P +Y+ QYISS LDWKS + ++QKV+PVVSWDPY+R
Sbjct: 480 YGTGIESFSKLGDSIYFQEDDVSPALYVTQYISSSLDWKSAGLSLSQKVNPVVSWDPYMR 539
Query: 543 VTLTFSSKGSGLT--TSLNLRIPTWTSSNGAKATLNGQDLPLPS--PGNFLSVTKTWSSD 598
VT +FSS G+ ++LNLRIP WT+S GAK +LNGQ L +P+ NFLS+ + W S
Sbjct: 540 VTFSFSSSKGGMAKESTLNLRIPVWTNSVGAKISLNGQSLKVPNFRTRNFLSIKQNWKSG 599
Query: 599 DKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPI 658
D+LT++LPL++RTEAI+DDR EY+S+QAILYGPY+LAGH+ DW IT A + WITPI
Sbjct: 600 DQLTMELPLSIRTEAIKDDRQEYSSLQAILYGPYLLAGHTSRDWSITTQAKA-GKWITPI 658
Query: 659 PASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFS 718
P + NS L+T +Q+ G+ +V +NSNQ+ITM P+ GT A+ ATFRL+ D+S S
Sbjct: 659 PETQNSYLVTLSQQSGDISYVFSNSNQTITMRVSPEPGTQDAVAATFRLV-TDNSKPRIS 717
Query: 719 SLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIA-QGSSVFHLVAGLDGGDRTVS 777
IG V LEPFD PGM+V Q TD L V S + +G+S F LV+G+DG +VS
Sbjct: 718 GPEALIGSLVKLEPFDFPGMIVKQ-ATDSSLTVQASSPSDKGASSFRLVSGVDGKPGSVS 776
Query: 778 LESETYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKG 837
L E+ KGCFVY+ L+ +L C S +T+ F AASF ++ G+++Y+P+SFV G
Sbjct: 777 LRLESKKGCFVYSDQTLKQGTKLRLECGSAATDEKFKEAASFKLKTGMNQYNPMSFVMSG 836
Query: 838 ANRNFLLAPLLSLRDESYTVYFDFQS 863
RNF+L+PL SLRDE+Y VYF Q+
Sbjct: 837 TQRNFVLSPLFSLRDETYNVYFSVQT 862
>gi|297811349|ref|XP_002873558.1| hypothetical protein ARALYDRAFT_488069 [Arabidopsis lyrata subsp.
lyrata]
gi|297319395|gb|EFH49817.1| hypothetical protein ARALYDRAFT_488069 [Arabidopsis lyrata subsp.
lyrata]
Length = 860
Score = 1066 bits (2758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 520/862 (60%), Positives = 650/862 (75%), Gaps = 31/862 (3%)
Query: 16 TFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWLSL 75
+FLL+ A KECT+ +L+SHT S LL S N++ ++ SH HLTP+DD+AW +L
Sbjct: 16 SFLLVCVA---KECTDIPTKLSSHTLNSELLQSHNKTLKTELFSHY-HLTPTDDAAWSTL 71
Query: 76 MPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQ 135
+PRK+L+EE + F+W MLYRK FK G FLK+VSLHDVRL +S HWRAQ
Sbjct: 72 LPRKMLKEETDE--FAWTMLYRK------FKDSNSVGNFLKDVSLHDVRLDPNSFHWRAQ 123
Query: 136 QTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWAS 195
QTNLEYLLMLDVD L ++FRK A L A G PYGGWE+P ELRGHFVGHYLSA+A MWAS
Sbjct: 124 QTNLEYLLMLDVDGLAYSFRKVAGLDASGVPYGGWEKPDSELRGHFVGHYLSATAHMWAS 183
Query: 196 THNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGL 255
THN++LK KMSA+VSAL+ CQ++ G+GYLSAFP+ FDR EA+ VWAPYYTIHKILAGL
Sbjct: 184 THNDTLKAKMSALVSALAECQQKSGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAGL 243
Query: 256 LDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCI 315
+DQY A N +AL+M T M +YFY RV+NVI KYS+ERH+Q+LNEE GGMNDVLY+L+ I
Sbjct: 244 VDQYKLAGNIQALKMATGMADYFYGRVRNVITKYSVERHYQSLNEETGGMNDVLYQLYSI 303
Query: 316 TQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-- 373
T+D K+L LAHLFDKPCFLG+LA+QADDISGFH+NTHIPIV+GSQ RYE+TGD LHKE
Sbjct: 304 TRDSKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEIS 363
Query: 374 ---------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWT 424
H + GT++ F DPKR+A+ L + EESCTTYNMLKVSR+LFRWT
Sbjct: 364 MFFMDIINASHSYATGGTSVRE--FWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWT 421
Query: 425 KEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYG 484
KE++YADYYER+LTNGVLGIQRGT+PG MIY+LPL G SK +YH WGTP DSFWCCYG
Sbjct: 422 KEVSYADYYERALTNGVLGIQRGTQPGRMIYMLPLGQGVSKAVTYHGWGTPYDSFWCCYG 481
Query: 485 TGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVT 544
TGIESFSKLGDSIYF+E+G P +Y+ QYISS LDWKS ++++QKV+PVVSWDPY+RVT
Sbjct: 482 TGIESFSKLGDSIYFQEDGASPALYVTQYISSSLDWKSAGLLLSQKVNPVVSWDPYMRVT 541
Query: 545 LTFSSKGSGLT--TSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLT 602
T SS G+ ++LNLRIP WT+S GAK +LNG+ L +P+ GNFLS+ + W S D++T
Sbjct: 542 FTLSSSKVGVAKKSTLNLRIPVWTNSIGAKVSLNGKPLKVPTSGNFLSIKQNWKSGDQVT 601
Query: 603 IQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASY 662
++LP+++RTEAI+DDRPEYAS+QAILYGPY+LAGH+ DW IT A + +WITPIP +Y
Sbjct: 602 MELPMSIRTEAIKDDRPEYASLQAILYGPYLLAGHTSRDWSITTQAKA-GNWITPIPETY 660
Query: 663 NSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLND 722
NS L+T +Q+ GN +VL+N+NQ+ITM P+ GT A+ ATFRL+ D+S + S L
Sbjct: 661 NSHLVTLSQQSGNISYVLSNTNQTITMRVSPELGTQDAVAATFRLV-TDNSKPQISGLEA 719
Query: 723 FIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIA-QGSSVFHLVAGLDGGDRTVSLESE 781
IG VMLEPFD PGM+V Q TD L V S + +G+S F LV+G+DG +VSL E
Sbjct: 720 LIGSLVMLEPFDFPGMIVKQ-TTDSSLTVQASSPSDKGASSFRLVSGVDGKPGSVSLRLE 778
Query: 782 TYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRN 841
+ GCFVY+ L+ KL C +T+ F AASF + G+++Y+P+SFV G RN
Sbjct: 779 SNNGCFVYSDQTLKQGTKLKLECGPVATDEKFKQAASFKLNIGMNQYNPMSFVMSGTQRN 838
Query: 842 FLLAPLLSLRDESYTVYFDFQS 863
F+L+PL SLRDE+Y VYF Q+
Sbjct: 839 FVLSPLFSLRDETYNVYFSVQT 860
>gi|297746368|emb|CBI16424.3| unnamed protein product [Vitis vinifera]
Length = 741
Score = 1062 bits (2747), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 508/735 (69%), Positives = 595/735 (80%), Gaps = 17/735 (2%)
Query: 144 MLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKE 203
MLD D+LVW+FR+TA LP P PYGGWE P ELRGHFVGHYLSASA MWASTHNESLKE
Sbjct: 1 MLDADRLVWSFRRTAGLPTPCSPYGGWESPDGELRGHFVGHYLSASAQMWASTHNESLKE 60
Query: 204 KMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYAD 263
KMSAVV AL CQK++G+GYLSAFP+E FDR EAL VWAPYYTIHKILAGLLDQYT
Sbjct: 61 KMSAVVCALGECQKKMGTGYLSAFPSELFDRFEALEEVWAPYYTIHKILAGLLDQYTLGG 120
Query: 264 NAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLM 323
NA+AL+M TWMVEYFYNRVQNVI YSIERHW +LNEE GGMND LY L+ IT D KH +
Sbjct: 121 NAQALKMVTWMVEYFYNRVQNVISSYSIERHWLSLNEETGGMNDFLYNLYRITGDQKHFV 180
Query: 324 LAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK----------- 372
LAHLFDKPCFLGLLA+QADDISGFH+NTHIPIV+G+QMRYE+TGD L+K
Sbjct: 181 LAHLFDKPCFLGLLAMQADDISGFHANTHIPIVVGAQMRYEITGDPLYKTIGAFFIDTVN 240
Query: 373 EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADY 432
H + GT++ F SDPKR+A+ L + ESCTTYNMLKVSR+LFRWTKE+AYADY
Sbjct: 241 SSHSYATGGTSVDE--FWSDPKRMATTLQTENAESCTTYNMLKVSRNLFRWTKEVAYADY 298
Query: 433 YERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSK 492
YER+LTNG+L IQRGT+PGVM+Y+LPL G+SK RSYH WGT SFWCCYGTGIESFSK
Sbjct: 299 YERALTNGILSIQRGTDPGVMLYMLPLGHGNSKARSYHGWGTKFHSFWCCYGTGIESFSK 358
Query: 493 LGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSK-- 550
LGDSIYFEEEG+ PG+YIIQYISS LDWKSGQ+V+NQKVD VVSWDPYLR+TLTFS K
Sbjct: 359 LGDSIYFEEEGEVPGLYIIQYISSSLDWKSGQVVLNQKVDTVVSWDPYLRITLTFSPKKM 418
Query: 551 -GSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTL 609
G+G ++++NLRIP W S+GAKA +N Q LP+P+P +FLS + WS DDKLT+QLP+ L
Sbjct: 419 QGAGQSSAINLRIPVWAYSSGAKAAVNAQALPVPAPNSFLSFRRKWSPDDKLTLQLPIAL 478
Query: 610 RTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI-TESATSLSDWITPIPASYNSQLIT 668
RTEAI+DDRP+YA +QAILYGPY+L G + DWDI T+ A SLSDWITPIPAS+NS LI+
Sbjct: 479 RTEAIKDDRPKYACLQAILYGPYLLVGLTNNDWDIQTDLAASLSDWITPIPASHNSHLIS 538
Query: 669 FTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFIGKSV 728
+QE GN+ F TNSNQS+TME++P+SGTDA+L+ATFRLIL DS+ S+ SS D IGK V
Sbjct: 539 LSQESGNSSFAFTNSNQSLTMERYPESGTDASLNATFRLILEDSTSSKISSPKDAIGKFV 598
Query: 729 MLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYKGCFV 788
MLEP + PGM V+Q T++ L +T+S GSS+FHLVAGLDG D TVSLES+T KGCFV
Sbjct: 599 MLEPINFPGMAVVQRGTNESLGITNSASVVGSSLFHLVAGLDGKDGTVSLESKTQKGCFV 658
Query: 789 YTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLL 848
Y+ VN S + KL C S++ FN A SF ++ G+SEYHPISFVAKG R++LLAPLL
Sbjct: 659 YSDVNYDSGSAIKLKCKLASSDVVFNQATSFTLKHGISEYHPISFVAKGLRRDYLLAPLL 718
Query: 849 SLRDESYTVYFDFQS 863
SLRDESYTVYF+ Q+
Sbjct: 719 SLRDESYTVYFNIQA 733
>gi|297746357|emb|CBI16413.3| unnamed protein product [Vitis vinifera]
Length = 767
Score = 1045 bits (2701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 509/735 (69%), Positives = 599/735 (81%), Gaps = 31/735 (4%)
Query: 13 FLLTFLLIVSAA-------QAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLT 65
F+L+ +LIV A KECTN +L+SH+FR LL+S NES+ ++ H HL
Sbjct: 4 FVLSEVLIVVFAFVLCGCVLGKECTNVPTQLSSHSFRYELLASNNESWKAEMFQHY-HLI 62
Query: 66 PSDDSAWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRL 125
+DDSAW +L+PRK+LREE++ FSWAM+YR +KN + FLKE+SLHDVRL
Sbjct: 63 HTDDSAWSNLLPRKLLREEDE---FSWAMMYRNMKN-----YDGSNSNFLKEMSLHDVRL 114
Query: 126 GSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHY 185
SDS+H RAQQTNL+YLL+LDVD+LVW+FRKTA L PG PYGGWE P+ ELRGHFVGHY
Sbjct: 115 DSDSLHGRAQQTNLDYLLILDVDRLVWSFRKTAGLSTPGLPYGGWEAPNVELRGHFVGHY 174
Query: 186 LSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPY 245
+SASA MWASTHN++LKEKMSAVVSAL+ CQ+++G+GYLSAFP+E FDR EA+ PVWAPY
Sbjct: 175 MSASAQMWASTHNDTLKEKMSAVVSALATCQEKMGTGYLSAFPSELFDRFEAIKPVWAPY 234
Query: 246 YTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGM 305
YTIHKILAGLLDQYT+A N++AL+M TWMVE+FY RVQNVI YS+ERHW +LNEE GGM
Sbjct: 235 YTIHKILAGLLDQYTFAGNSQALKMMTWMVEHFYKRVQNVITMYSLERHWLSLNEETGGM 294
Query: 306 NDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEV 365
NDVLY+L+ IT D KHL+LAHLFDKPCFLGLLA+QAD ISGFH+NTHIP+VIGSQMRYEV
Sbjct: 295 NDVLYRLYSITGDQKHLVLAHLFDKPCFLGLLAVQADSISGFHANTHIPVVIGSQMRYEV 354
Query: 366 TGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNML 414
TGD L+K H + GT++G F SDPKRLAS L EESCTTYNML
Sbjct: 355 TGDPLYKAIGTFFMDIVNSSHSYATGGTSVGE--FWSDPKRLASTLQRENEESCTTYNML 412
Query: 415 KVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGT 474
KVSRHLFRWTKE+ YADYYER+LTNGVL IQRGT+PGVMIY+LPL G SK RSYH WGT
Sbjct: 413 KVSRHLFRWTKEVVYADYYERALTNGVLSIQRGTDPGVMIYMLPLGRGDSKARSYHGWGT 472
Query: 475 PSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPV 534
DSFWCCYGTGIESFSKLGDSIYFEEEGK P VYIIQYISS LDWKSGQIV+NQKVDPV
Sbjct: 473 KFDSFWCCYGTGIESFSKLGDSIYFEEEGKSPEVYIIQYISSSLDWKSGQIVLNQKVDPV 532
Query: 535 VSWDPYLRVTLTFSSK-GSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTK 593
VSWDPYLR TLTF+ K G+G ++++NLRIP W SS+GAKA++N QDLP+P+P +FLS+T+
Sbjct: 533 VSWDPYLRTTLTFTPKEGAGQSSTINLRIPVWASSSGAKASINAQDLPVPAPSSFLSLTR 592
Query: 594 TWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI-TESATSLS 652
WS DKLT+QLP+ LRTEAI+DDRP+YASIQAILYGPY+LAG + DWDI T SATSLS
Sbjct: 593 NWSPGDKLTLQLPIRLRTEAIKDDRPKYASIQAILYGPYLLAGLTSDDWDIKTGSATSLS 652
Query: 653 DWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDS 712
DWITPIPAS NS+L++ +QE GN+ FV +NSNQSITMEKFP+ GTDA+LHATFRL+L D+
Sbjct: 653 DWITPIPASDNSRLVSLSQESGNSSFVFSNSNQSITMEKFPEEGTDASLHATFRLVLKDA 712
Query: 713 SGSEFSSLNDFIGKS 727
+ + S D IGKS
Sbjct: 713 TSLKVLSPKDAIGKS 727
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 56/105 (53%), Gaps = 19/105 (18%)
Query: 774 RTVSLESETYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIE----------- 822
R VSL E+ FV++ N QS K E T+A + V++
Sbjct: 665 RLVSLSQESGNSSFVFSNSN-QSITMEKFP--EEGTDASLHATFRLVLKDATSLKVLSPK 721
Query: 823 -----KGLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFDFQ 862
G+S+YHPISFVAKG RNFLL PLL LRDESYTVYF+ Q
Sbjct: 722 DAIGKSGISQYHPISFVAKGMKRNFLLTPLLGLRDESYTVYFNIQ 766
>gi|357139358|ref|XP_003571249.1| PREDICTED: uncharacterized protein LOC100841742 [Brachypodium
distachyon]
Length = 883
Score = 1001 bits (2587), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 506/894 (56%), Positives = 631/894 (70%), Gaps = 52/894 (5%)
Query: 5 MCSIGFFKFLLTFLLIVSAAQAKECTNAYPEL--ASHTFRS--NLLSSKNE-------SY 53
+ + G LL ++ A+AK CTN +P ASHT R+ L ++++E
Sbjct: 3 LAAFGVVAVLLA-TAVLRGAEAKVCTNTFPASGSASHTERAAAQLRAAESEDAALRLPGL 61
Query: 54 IKQIHSHNDHLTPSDDSAWLSLMPRKILREEEQD------ELFSWAMLYRKIKNPGQFKV 107
+ H H HL P+D+SAW++LMPR++L E F W MLYRK++ G +
Sbjct: 62 VDHGHGHEQHLIPTDESAWMALMPRRLLAGGAGGNGAPPREAFDWLMLYRKLRGGGDGAI 121
Query: 108 ----PERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAP 163
+G FL E SLHDVRL +++W+AQQTNLEYLL+LD D+LVW+FR A LPA
Sbjct: 122 DGPAAAAAGPFLSEASLHDVRLQPGTVYWQAQQTNLEYLLLLDADRLVWSFRTQAGLPAT 181
Query: 164 GEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGY 223
G PYGGWE PS ELRGHFVGHYL+A+A MWASTHN++L+ KMS+V+ L CQK++G GY
Sbjct: 182 GTPYGGWEGPSVELRGHFVGHYLTAAAKMWASTHNDTLRTKMSSVIDTLYDCQKKMGMGY 241
Query: 224 LSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
LSAFPTE FDR EAL VWAPYYTIHKI+ GLLDQYT A +++AL M M +YF RV+
Sbjct: 242 LSAFPTEFFDRAEALTTVWAPYYTIHKIMQGLLDQYTVAGSSKALEMVVGMADYFSGRVK 301
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
NVI+KYSIERHW +LNEE GGMNDVLY+L+ IT D KHL LAHLFDKPCFLGLLA+QAD
Sbjct: 302 NVIQKYSIERHWASLNEETGGMNDVLYQLYAITNDLKHLTLAHLFDKPCFLGLLAVQADS 361
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSD 392
ISGFHSNTHIP+VIG+QMRYEVTGD L+K+ H + GT+ G F + D
Sbjct: 362 ISGFHSNTHIPVVIGAQMRYEVTGDVLYKQIASSFMDMINSSHSYATGGTSAGEFWY--D 419
Query: 393 PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV 452
PKRLA+ L + EESCTTYNMLKVSR+LFRWTKEI+YADYYER+L NGVL IQRGT+PGV
Sbjct: 420 PKRLAATLSTENEESCTTYNMLKVSRNLFRWTKEISYADYYERALINGVLSIQRGTDPGV 479
Query: 453 MIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
MIY+LP APG SK YH WGT DSFWCCYGTGIESFSKLGDSIYFEE+G P + IIQ
Sbjct: 480 MIYMLPQAPGRSKAVGYHGWGTLYDSFWCCYGTGIESFSKLGDSIYFEEKGHAPALNIIQ 539
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
YI S +WK+ + V Q+++ + S DPYLRV+L+ S+KG T LN+RIPTWTS+NG K
Sbjct: 540 YIPSTFNWKTAGLTVTQQLESLSSSDPYLRVSLSVSAKGQSAT--LNVRIPTWTSANGTK 597
Query: 573 ATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
ATL G+DL L +PG LS++K W+SD+ L++Q P++LRTEAI+DDRP+YAS+QAIL+GP+
Sbjct: 598 ATLTGKDLGLVTPGTLLSISKQWNSDEHLSLQFPISLRTEAIKDDRPQYASLQAILFGPF 657
Query: 633 VLAGHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKF 692
VLAG S GDWD ++++++SDWIT +P+SYNSQL+TFTQE FVL++SN S+TM++
Sbjct: 658 VLAGLSSGDWD-AKASSAVSDWITAVPSSYNSQLMTFTQESNGKTFVLSSSNGSLTMQER 716
Query: 693 PK-SGTDAALHATFRLILNDSSGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVV 751
P GTD A+HATFR+ DS+ + + G V +EPFD PG ++ + T
Sbjct: 717 PSIDGTDTAVHATFRVHSQDSTSQQGTYNAALKGTPVQIEPFDLPGTVITNNLT------ 770
Query: 752 TDSFIAQGSSV--FHLVAGLDGGDRTVSLESETYKGCFVYTAVNLQSSESTKLGCISEST 809
F AQ SS F +V GLDG +VSLE T GCF+ + + + ++ C S
Sbjct: 771 ---FSAQKSSASFFDIVPGLDGKPNSVSLELGTKSGCFMVSGADYSAGTKIQVSCKSSLQ 827
Query: 810 EAG--FNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFDF 861
G F AASFV L +YHPISFVAKG RNFLL PL SLRDE YTVYF+
Sbjct: 828 SIGGIFEQAASFVQATPLRQYHPISFVAKGVRRNFLLEPLYSLRDEFYTVYFNL 881
>gi|242060854|ref|XP_002451716.1| hypothetical protein SORBIDRAFT_04g006520 [Sorghum bicolor]
gi|241931547|gb|EES04692.1| hypothetical protein SORBIDRAFT_04g006520 [Sorghum bicolor]
Length = 888
Score = 977 bits (2525), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 505/872 (57%), Positives = 623/872 (71%), Gaps = 47/872 (5%)
Query: 23 AAQAKECTNAYPEL-ASHTFRS--NLLSSKNESYIKQI---------HSHNDHLTPSDDS 70
A+ K CTNA+P L +SHT R+ L + ++ + H H HLTP+D+S
Sbjct: 29 GAEGKSCTNAFPGLTSSHTERAAAQLQRGPPATALQPVVHRHGHDHDHGHEQHLTPTDES 88
Query: 71 AWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPER----SGEFLKEVSLHDVRLG 126
W+SLMPR+ LR EE F W MLYRK++ P R +G FL + SLHDVRL
Sbjct: 89 TWMSLMPRRALRREEA---FDWLMLYRKLRGATAGGAPRRPGVAAGTFLSDASLHDVRLE 145
Query: 127 SDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYL 186
S++WRAQQTNLEYLL+LDVD+LVW+FRK A L APG PYGGWE P ELRGHFVGHYL
Sbjct: 146 PGSLYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPDVELRGHFVGHYL 205
Query: 187 SASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYY 246
SA+A MWASTHN++L KMS+V+ ALS CQK++G+GYLSAFPTE FDR+EA+ PVWAPYY
Sbjct: 206 SATAKMWASTHNDTLNAKMSSVIDALSDCQKKMGTGYLSAFPTEFFDRVEAIKPVWAPYY 265
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN 306
TIHKI+ GLLDQYT A N++AL M M YF +RV+NVI+KYSIERHW++LNEE GGMN
Sbjct: 266 TIHKIMQGLLDQYTVAGNSKALDMVVNMANYFSDRVKNVIQKYSIERHWESLNEETGGMN 325
Query: 307 DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVT 366
DVLY+L+ IT D KHL LAHLFDKPCFLGLLA+QAD ISGFHSNTHIP+VIG+QMRYEVT
Sbjct: 326 DVLYQLYTITNDLKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRYEVT 385
Query: 367 GDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLK 415
GD L+K+ H + GT+ G F +DPK LA L + EESCTTYNMLK
Sbjct: 386 GDPLYKQIASFFMDTINSSHSYATGGTSAGE--FWTDPKHLAGTLSTENEESCTTYNMLK 443
Query: 416 VSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTP 475
+SR+LFRWTKEIAYADYYER+L NGVL IQRGT+PGVMIY+LP APG SK SYH WGT
Sbjct: 444 ISRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHSWGTK 503
Query: 476 SDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVV 535
DSFWCCYGTGIESFSKLGDSIYFEE+ P + IIQYI S DWK+ ++V QKV+ +
Sbjct: 504 YDSFWCCYGTGIESFSKLGDSIYFEEKEDLPALNIIQYIPSTYDWKAAGLIVTQKVNTLS 563
Query: 536 SWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTW 595
S D YL+++L+ S+K G T LN+RIP+WT ++GA ATLN +DL SPG+FLS+TK W
Sbjct: 564 SSDQYLQISLSISAKTKGQTAKLNVRIPSWTFADGAGATLNDKDLGSISPGSFLSITKQW 623
Query: 596 SSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE-SATSLSDW 654
+SDD L ++ P+ LRTEAI+DDRPEYAS+QA+L+GP+VLAG S GDWD + +++SDW
Sbjct: 624 NSDDHLALRFPIRLRTEAIKDDRPEYASLQAVLFGPFVLAGLSTGDWDAKAGNGSAISDW 683
Query: 655 ITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPK-SGTDAALHATFRLILNDSS 713
IT +P ++NSQL+TF+Q FVL+++N ++TM++ P+ GTD A+HATFR DS
Sbjct: 684 ITAVPPAHNSQLVTFSQVSNGKTFVLSSANGTLTMQERPEVDGTDTAIHATFRAHPQDS- 742
Query: 714 GSEFSSLNDFI--GKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDG 771
+E + I G S+++EPFD PG ++ + T TD +F+LV GLDG
Sbjct: 743 -TELHDIYRTIAKGASILIEPFDLPGTVITNNLTLSAQKSTD-------CLFNLVPGLDG 794
Query: 772 GDRTVSLESETYKGCFVYTAVNLQSSESTKLGCIS--ESTEAGFNNAASFVIEKGLSEYH 829
+VSLE T GCF+ T N + ++ C S ES AASF L +YH
Sbjct: 795 NPNSVSLELGTRPGCFLVTGTNYSAGTKIQVSCKSSLESIGGILEQAASFSQTDPLRQYH 854
Query: 830 PISFVAKGANRNFLLAPLLSLRDESYTVYFDF 861
PISFVAKG RNFLL PL SLRDE YTVYF+
Sbjct: 855 PISFVAKGMTRNFLLEPLYSLRDEFYTVYFNI 886
>gi|326495110|dbj|BAJ85651.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 868
Score = 974 bits (2517), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 491/859 (57%), Positives = 615/859 (71%), Gaps = 40/859 (4%)
Query: 27 KECTNAYPE---LASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWLSLMPRKILR- 82
K CTN +P +A+H R+ + H H HLTP+D+SAW+ LMPR+ L
Sbjct: 24 KVCTNTFPSSDSVATHAERAAAQLRLPAGH-GHGHDHEQHLTPTDESAWMELMPRRSLSG 82
Query: 83 ---EEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNL 139
E F W MLYR+++ G V +G FL E SLHDVRL +++W+AQQTNL
Sbjct: 83 GGGSTPPREAFDWLMLYRRLRG-GAAAVDGPAGPFLSEASLHDVRLQPGTIYWQAQQTNL 141
Query: 140 EYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNE 199
EYLL+LD D+LVW+FR A L A G PYGGWE P+ ELRGHFVGHYLSA+A MWASTHN+
Sbjct: 142 EYLLLLDTDRLVWSFRTQAGLTATGTPYGGWEGPNVELRGHFVGHYLSATAKMWASTHND 201
Query: 200 SLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQY 259
+L+ KMS+VV L CQK++G+GYLSAFP+E FDR EAL VWAPYYTIHK++ GLLDQY
Sbjct: 202 TLRAKMSSVVDVLYDCQKKMGTGYLSAFPSEFFDRAEALTTVWAPYYTIHKVMQGLLDQY 261
Query: 260 TYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDP 319
T A N++AL M M YF +RV+N+I+KYSIERHW +LNEE GGMNDVLY+L+ IT D
Sbjct: 262 TVAGNSKALEMVVGMANYFSDRVKNIIQKYSIERHWASLNEETGGMNDVLYQLYTITDDL 321
Query: 320 KHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE------ 373
KHL LAHLFDKPCFLGLLALQAD ISGFHSNTHIP+V+G+QMRYEVTGD L+K+
Sbjct: 322 KHLTLAHLFDKPCFLGLLALQADSISGFHSNTHIPVVVGAQMRYEVTGDVLYKQIATSFM 381
Query: 374 -----GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIA 428
H + GT+ G F SDPKRLA+ L + ESCTTYNMLKVSR+LFRWTKEIA
Sbjct: 382 DMINSSHSYATGGTSAGE--FWSDPKRLAATLSTENAESCTTYNMLKVSRNLFRWTKEIA 439
Query: 429 YADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIE 488
YADYYER+L NGVL IQRGT+PGVMIY+LP APG SK SYH WGT DSFWCCYGTGIE
Sbjct: 440 YADYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVSYHGWGTKYDSFWCCYGTGIE 499
Query: 489 SFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFS 548
SFSKLGDSIYFEE+G+ P + IIQYI S +WK+ + V Q+++P+ S D ++V+L+FS
Sbjct: 500 SFSKLGDSIYFEEKGETPALSIIQYIPSTFNWKTAGVTVTQQLEPLSSPDMNVQVSLSFS 559
Query: 549 SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLT 608
K +G + +LN+RIPTWTS++GAKATLN +DL +PG+ LSVTK W+S+D L++Q P+
Sbjct: 560 GK-NGQSATLNVRIPTWTSASGAKATLNDKDLGSVTPGSLLSVTKQWNSNDHLSLQFPIA 618
Query: 609 LRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLIT 668
LRTEAI+DDRPEYAS+QAIL+GP+VLAG S D D ++ +++SDWIT +P+S+NSQL+T
Sbjct: 619 LRTEAIKDDRPEYASLQAILFGPFVLAGLSSSDCD-AKTGSAVSDWITAVPSSHNSQLMT 677
Query: 669 FTQEYGNTKFVLTNSNQSITMEKFPK-SGTDAALHATFRLILNDSS---GSEFSSLNDFI 724
FTQE FVL++SN S+TM++ P GTD A+HATFR+ D++ G+ ++L D
Sbjct: 678 FTQESSGKTFVLSSSNGSLTMQERPTVDGTDTAIHATFRVHPQDTARLHGTYGATLQD-- 735
Query: 725 GKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYK 784
SV++EPFD PG + +T S S+F++V+GLDG +VSLE T
Sbjct: 736 -TSVLIEPFDMPGTAIAND-------LTLSTQKSTGSLFNIVSGLDGKPNSVSLELGTKP 787
Query: 785 GCFVYTAVNLQSSESTKLGCISESTEAG--FNNAASFVIEKGLSEYHPISFVAKGANRNF 842
GCF+ + + + ++ C S G F AASF L +YHPISFVAKG RNF
Sbjct: 788 GCFLVSGADYSAGTKIQVSCKSSIQSIGGIFEQAASFAQAAPLRQYHPISFVAKGVQRNF 847
Query: 843 LLAPLLSLRDESYTVYFDF 861
LL PL SLRDE YT YF+
Sbjct: 848 LLEPLYSLRDEFYTAYFNL 866
>gi|226497412|ref|NP_001145969.1| uncharacterized protein LOC100279496 precursor [Zea mays]
gi|223945575|gb|ACN26871.1| unknown [Zea mays]
Length = 879
Score = 968 bits (2502), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 498/865 (57%), Positives = 615/865 (71%), Gaps = 38/865 (4%)
Query: 23 AAQAKECTNAYPELASHTFRS--NLLSSKNESYIKQI-----HSHNDHLTPSDDSAWLSL 75
A+ K CTNA+P L SHT R+ L + ++ I H HLTP+D+S W+SL
Sbjct: 27 GAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQHLTPTDESTWMSL 86
Query: 76 MPRKILREEEQDELFSWAMLYRKIKNPGQFKVPE-RSGEFLKEVSLHDVRLGSDSMHWRA 134
MPR+ LR EE F W MLYR+++ G P +G FL E SLHDVRL SM+WRA
Sbjct: 87 MPRRALRREEA---FDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDVRLEPGSMYWRA 143
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWA 194
QQTNLEYLL+LDVD+LVW+FRK A L APG PYGGWE P +LRGHFVGHYLSA+A MWA
Sbjct: 144 QQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVGHYLSATAKMWA 203
Query: 195 STHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAG 254
STHN++L KMS+VV AL CQK++G+GYLSAFP++ FD LEA+ VWAPYYTIHKI+ G
Sbjct: 204 STHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWAPYYTIHKIMQG 263
Query: 255 LLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFC 314
LLDQYT A N+ AL M M YF +RV+NVI+ YSIERHW++LNEE GGMNDVLY+L+
Sbjct: 264 LLDQYTVAGNSMALDMVIKMANYFSDRVKNVIQNYSIERHWESLNEETGGMNDVLYQLYT 323
Query: 315 ITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE- 373
IT D KHL LAHLFDKPCFLGLLA+QAD ISGFHSNTHIP+VIG+QMRYEVTGD L+K+
Sbjct: 324 ITHDMKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRYEVTGDPLYKQI 383
Query: 374 ----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRW 423
H + GT+ G F +DPKRLA L + EESCTTYNMLKVSR+LFRW
Sbjct: 384 ASFFMDTINSSHSYATGGTSAGE--FWTDPKRLAGTLSTENEESCTTYNMLKVSRNLFRW 441
Query: 424 TKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCY 483
TKEIAYADYYER+L NGVL IQRGT+PGVMIY+LP APG SK SYH WGT DSFWCCY
Sbjct: 442 TKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHGWGTKYDSFWCCY 501
Query: 484 GTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRV 543
GTGIESFSKLGDSIYFEE+G P + IIQYI S +WK+ + V Q++ + S D YL++
Sbjct: 502 GTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQQIKTLSSSDQYLQI 561
Query: 544 TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTI 603
+ + S+ SG T ++N RIP+WT ++GA ATLNG+DL SPG+FLS+TK W+SDD L +
Sbjct: 562 SFSISANTSGQTANINFRIPSWTFADGAGATLNGKDLGSISPGSFLSITKQWNSDDHLAL 621
Query: 604 QLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE-SATSLSDWITPIPASY 662
P+ LRTEAI+DDR EYAS+QA+L+GP+VLAG S GDWD + +++SDWI +P ++
Sbjct: 622 HFPIRLRTEAIKDDRLEYASLQAVLFGPFVLAGLSTGDWDAKAGNGSAISDWIAAVPPAH 681
Query: 663 NSQLITFTQEYGNTKFVLTNSNQSITMEKFPK-SGTDAALHATFRLILNDSSGSEFSSL- 720
NSQL+TFTQ FVL+++N ++TM++ P+ GTDAA+HATFR + S +E +
Sbjct: 682 NSQLVTFTQVSNGKAFVLSSANGTLTMQERPEVDGTDAAIHATFRAHPQEDS-TELHDIY 740
Query: 721 -NDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLE 779
G S++LEPFD PG ++ + T +D S+F++V GLDG +VSLE
Sbjct: 741 STTLTGTSILLEPFDLPGTVITNNLTLSAQKSSD-------SLFNIVPGLDGNPNSVSLE 793
Query: 780 SETYKGCFVYTAVNLQSSESTKLGCIS--ESTEAGFNNAASFVIEKGLSEYHPISFVAKG 837
T GCF+ T N + ++ C S ES AASF L +YHPISFVAKG
Sbjct: 794 LGTKPGCFLVTGTNYSAGTRIEVNCKSSLESIGGILEQAASFSQTDPLRQYHPISFVAKG 853
Query: 838 ANRNFLLAPLLSLRDESYTVYFDFQ 862
RNFLL PL SLRDE YTVYF+ +
Sbjct: 854 VARNFLLEPLYSLRDEFYTVYFNVR 878
>gi|219885159|gb|ACL52954.1| unknown [Zea mays]
Length = 879
Score = 967 bits (2501), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 498/865 (57%), Positives = 615/865 (71%), Gaps = 38/865 (4%)
Query: 23 AAQAKECTNAYPELASHTFRS--NLLSSKNESYIKQI-----HSHNDHLTPSDDSAWLSL 75
A+ K CTNA+P L SHT R+ L + ++ I H HLTP+D+S W+SL
Sbjct: 27 GAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQHLTPTDESTWMSL 86
Query: 76 MPRKILREEEQDELFSWAMLYRKIKNPGQFKVPE-RSGEFLKEVSLHDVRLGSDSMHWRA 134
MPR+ LR EE F W MLYR+++ G P +G FL E SLHDVRL SM+WRA
Sbjct: 87 MPRRALRREEA---FDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDVRLEPGSMYWRA 143
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWA 194
QQTNLEYLL+LDVD+LVW+FRK A L APG PYGGWE P +LRGHFVGHYLSA+A MWA
Sbjct: 144 QQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVGHYLSATAKMWA 203
Query: 195 STHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAG 254
STHN++L KMS+VV AL CQK++G+GYLSAFP++ FD LEA+ VWAPYYTIHKI+ G
Sbjct: 204 STHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWAPYYTIHKIMQG 263
Query: 255 LLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFC 314
LLDQYT A N+ AL M M YF +RV+NVI+ YSIERHW++LNEE GGMNDVLY+L+
Sbjct: 264 LLDQYTVAGNSMALDMVIKMANYFSDRVKNVIQNYSIERHWESLNEETGGMNDVLYQLYT 323
Query: 315 ITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE- 373
IT D KHL LAHLFDKPCFLGLLA+QAD ISGFHSNTHIP+VIG+QMRYEVTGD L+K+
Sbjct: 324 ITHDMKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRYEVTGDPLYKQI 383
Query: 374 ----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRW 423
H + GT+ G F +DPKRLA L + EESCTTYNMLKVSR+LFRW
Sbjct: 384 ASFFMDTINSSHSYATGGTSAGE--FWTDPKRLAGTLSTENEESCTTYNMLKVSRNLFRW 441
Query: 424 TKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCY 483
TKEIAYADYYER+L NGVL IQRGT+PGVMIY+LP APG SK SYH WGT DSFWCCY
Sbjct: 442 TKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHGWGTKYDSFWCCY 501
Query: 484 GTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRV 543
GTGIESFSKLGDSIYFEE+G P + IIQYI S +WK+ + V Q++ + S D YL++
Sbjct: 502 GTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQQIKTLSSSDQYLQI 561
Query: 544 TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTI 603
+ + S+ SG T ++N RIP+WT ++GA ATLNG+DL SPG+FLS+TK W+SDD L +
Sbjct: 562 SFSISANTSGQTANINFRIPSWTFADGAGATLNGKDLGSISPGSFLSITKQWNSDDHLAL 621
Query: 604 QLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE-SATSLSDWITPIPASY 662
P+ LRTEAI+DDR EYAS+QA+L+GP+VLAG S GDWD + +++SDWI +P ++
Sbjct: 622 HFPIRLRTEAIKDDRLEYASLQAVLFGPFVLAGLSTGDWDAKAGNGSAISDWIAAVPPAH 681
Query: 663 NSQLITFTQEYGNTKFVLTNSNQSITMEKFPK-SGTDAALHATFRLILNDSSGSEFSSL- 720
NSQL+TFTQ FVL+++N ++TM++ P+ GTDAA+HATFR + S +E +
Sbjct: 682 NSQLVTFTQVSNGKAFVLSSANGTLTMQERPEVDGTDAAVHATFRAHPQEDS-TELHDIY 740
Query: 721 -NDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLE 779
G S++LEPFD PG ++ + T +D S+F++V GLDG +VSLE
Sbjct: 741 STTLTGTSILLEPFDLPGTVITNNLTLSAQKSSD-------SLFNIVPGLDGNPNSVSLE 793
Query: 780 SETYKGCFVYTAVNLQSSESTKLGCIS--ESTEAGFNNAASFVIEKGLSEYHPISFVAKG 837
T GCF+ T N + ++ C S ES AASF L +YHPISFVAKG
Sbjct: 794 LGTKPGCFLVTGTNYSAGTRIEVNCKSSLESIGGILEQAASFSQTDPLRQYHPISFVAKG 853
Query: 838 ANRNFLLAPLLSLRDESYTVYFDFQ 862
RNFLL PL SLRDE YTVYF+ +
Sbjct: 854 VARNFLLEPLYSLRDEFYTVYFNVR 878
>gi|125538467|gb|EAY84862.1| hypothetical protein OsI_06226 [Oryza sativa Indica Group]
Length = 891
Score = 967 bits (2499), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 499/874 (57%), Positives = 622/874 (71%), Gaps = 47/874 (5%)
Query: 26 AKECTNAYPEL-ASHTFRSNL---LSSKNESYIKQI--------------HSHNDHLTPS 67
K+CTN +P L ASHT R+ L E ++ H + HLTP+
Sbjct: 25 GKDCTNGFPGLTASHTERAAAAAELRPDGEVEAARVLDLLLPHGHGHGDDHDGDRHLTPT 84
Query: 68 DDSAWLSLMPRKILRE---EEQDELFSWAMLYRKIKNPGQFKVPERSGE--FLKEVSLHD 122
D+S W+SLMPR++L + + F W MLYR ++ G + L E SLHD
Sbjct: 85 DESTWMSLMPRRLLASPASSPRRDAFDWLMLYRNLRGSGSGAGAIAASGGALLAEASLHD 144
Query: 123 VRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFV 182
VRL +++W+AQQTNLEYLL+LDVD+LVW+FR A LPA G PYGGWE P ELRGHFV
Sbjct: 145 VRLQPGTVYWQAQQTNLEYLLLLDVDRLVWSFRTQAGLPASGAPYGGWEGPGVELRGHFV 204
Query: 183 GHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVW 242
GHYLSA+A MWASTHN++L+ KMS+VV AL CQK++GSGYLSAFP+E FDR+E++ VW
Sbjct: 205 GHYLSATAKMWASTHNDTLQAKMSSVVDALHDCQKKMGSGYLSAFPSEFFDRVESIKAVW 264
Query: 243 APYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
APYYTIHKI+ GLLDQYT A N++AL + M YF +RV+NVI+KYSIERHW +LNEE+
Sbjct: 265 APYYTIHKIMQGLLDQYTVAGNSKALDLVVGMANYFSDRVKNVIQKYSIERHWASLNEES 324
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMR 362
GGMNDVLY+L+ IT D KHL LAHLFDKPCFLGLLA+QAD ISGFHSNTHIP+VIG+QMR
Sbjct: 325 GGMNDVLYQLYTITNDQKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMR 384
Query: 363 YEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTY 411
YEVTGD L+K+ H + GT+ G F ++PKRLA L + EESCTTY
Sbjct: 385 YEVTGDLLYKQIATFFMDTINSSHSYATGGTSAGE--FWTNPKRLADTLSTENEESCTTY 442
Query: 412 NMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHH 471
NMLKVSR+LFRWTKE++YADYYER+L NGVL IQRGT+PGVMIY+LP APG SK SYH
Sbjct: 443 NMLKVSRNLFRWTKELSYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVSYHG 502
Query: 472 WGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKV 531
WGT DSFWCCYGTGIESFSKLGDSIYFEE+G P + IIQYI S +WK+ + VNQ++
Sbjct: 503 WGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDRPVLNIIQYIPSAYNWKAAGLTVNQQL 562
Query: 532 DPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSV 591
P+ S D +L+V+L+ S+K +G + +LN+RIP+WTS+NGAKATLN DL L SPG+FLS+
Sbjct: 563 KPISSLDMFLQVSLSTSAKTNGQSATLNVRIPSWTSANGAKATLNDNDLGLMSPGSFLSI 622
Query: 592 TKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATS- 650
+K W+SDD L++Q P+TLRTEAI+DDRPEYAS+QAIL+GP+VLAG S GDW+ TS
Sbjct: 623 SKQWNSDDHLSLQFPITLRTEAIKDDRPEYASLQAILFGPFVLAGLSTGDWNAEAGNTSA 682
Query: 651 LSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPK-SGTDAALHATFRLIL 709
+SDWI+P+P+SYNSQL+TFTQE FVL+++N S+ M++ P GTD A+HATFR+
Sbjct: 683 ISDWISPVPSSYNSQLVTFTQESSGKTFVLSSANGSLAMQERPTVDGTDTAIHATFRVHP 742
Query: 710 NDSSGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGL 769
DS+G + G SV +EPFD PG ++ + +T S S+F++V GL
Sbjct: 743 QDSAGQLDTQGATLKGTSVQIEPFDLPGTVITNN-------LTQSAQKSSDSLFNIVPGL 795
Query: 770 DGGDRTVSLESETYKGCFVYTAVNLQSSESTKLGCISE--STEAGFNNAASFVIEKGLSE 827
DG +VSLE T GCF+ T V+ ++ C S S F A SFV L +
Sbjct: 796 DGNPNSVSLELGTKPGCFLVTGVDYSVGTKIQVSCKSSLPSINGIFEQATSFVQAAPLRQ 855
Query: 828 YHPISFVAKGANRNFLLAPLLSLRDESYTVYFDF 861
YHPISF+AKG RNFLL PL SLRDE YTVYF+
Sbjct: 856 YHPISFIAKGVKRNFLLEPLYSLRDEFYTVYFNL 889
>gi|115444811|ref|NP_001046185.1| Os02g0195500 [Oryza sativa Japonica Group]
gi|49388119|dbj|BAD25250.1| unknown protein [Oryza sativa Japonica Group]
gi|113535716|dbj|BAF08099.1| Os02g0195500 [Oryza sativa Japonica Group]
gi|125581152|gb|EAZ22083.1| hypothetical protein OsJ_05746 [Oryza sativa Japonica Group]
Length = 891
Score = 966 bits (2496), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 498/874 (56%), Positives = 620/874 (70%), Gaps = 47/874 (5%)
Query: 26 AKECTNAYPEL-ASHTFRSNLLSSKN-----------------ESYIKQIHSHNDHLTPS 67
K+CTN +P L ASHT R+ + + H + HLTP+
Sbjct: 25 GKDCTNGFPGLTASHTERAAAAAEQRPDGEVEAARVLDLLLPHGHGHGDDHDGDRHLTPT 84
Query: 68 DDSAWLSLMPRKILRE---EEQDELFSWAMLYRKIKNPGQFKVPERSGE--FLKEVSLHD 122
D+S W+SLMPR++L + + F W MLYR ++ G + L E SLHD
Sbjct: 85 DESTWMSLMPRRLLASPVSSPRRDAFDWLMLYRNLRGSGSGAGAIAASGGALLAEASLHD 144
Query: 123 VRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFV 182
VRL +++W+AQQTNLEYLL+LDVD+LVW+FR A LPA G PYGGWE P ELRGHFV
Sbjct: 145 VRLQPGTVYWQAQQTNLEYLLLLDVDRLVWSFRTQAGLPASGAPYGGWEGPGVELRGHFV 204
Query: 183 GHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVW 242
GHYLSA+A MWASTHN++L KMS+VV AL CQK++GSGYLSAFP+E FDR+E++ VW
Sbjct: 205 GHYLSATAKMWASTHNDTLLAKMSSVVDALHDCQKKMGSGYLSAFPSEFFDRVESIKAVW 264
Query: 243 APYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
APYYTIHKI+ GLLDQYT A N++AL + M YF +RV+NVI+KYSIERHW +LNEE+
Sbjct: 265 APYYTIHKIMQGLLDQYTVAGNSKALDLVVGMANYFSDRVKNVIQKYSIERHWASLNEES 324
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMR 362
GGMNDVLY+L+ IT D KHL LAHLFDKPCFLGLLA+QAD ISGFHSNTHIP+VIG+QMR
Sbjct: 325 GGMNDVLYQLYTITNDQKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMR 384
Query: 363 YEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTY 411
YEVTGD L+K+ H + GT+ G F ++PKRLA L + EESCTTY
Sbjct: 385 YEVTGDLLYKQIATFFMDTINSSHSYATGGTSAGE--FWTNPKRLADTLSTENEESCTTY 442
Query: 412 NMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHH 471
NMLKVSR+LFRWTKE++YADYYER+L NGVL IQRGT+PGVMIY+LP APG SK SYH
Sbjct: 443 NMLKVSRNLFRWTKELSYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVSYHG 502
Query: 472 WGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKV 531
WGT DSFWCCYGTGIESFSKLGDSIYFEE+G P + IIQYI S +WK+ + VNQ++
Sbjct: 503 WGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDRPVLNIIQYIPSAYNWKAAGLTVNQQL 562
Query: 532 DPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSV 591
P+ S D +L+V+L+ S+K +G + +LN+RIP+WTS+NGAKATLN DL L SPG+FLS+
Sbjct: 563 KPISSLDMFLQVSLSTSAKTNGQSATLNVRIPSWTSANGAKATLNDNDLGLMSPGSFLSI 622
Query: 592 TKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATS- 650
+K W+SDD L++Q P+TLRTEAI+DDRPEYAS+QAIL+GP+VLAG S GDW+ TS
Sbjct: 623 SKQWNSDDHLSLQFPITLRTEAIKDDRPEYASLQAILFGPFVLAGLSTGDWNAEAGNTSA 682
Query: 651 LSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPK-SGTDAALHATFRLIL 709
+SDWI+P+P+SYNSQL+TFTQE FVL+++N S+TM++ P GTD A+HATFR+
Sbjct: 683 ISDWISPVPSSYNSQLVTFTQESSGKTFVLSSANGSLTMQERPTVDGTDTAIHATFRVHP 742
Query: 710 NDSSGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGL 769
DS+G + G SV +EPFD PG ++ + +T S S+F++V GL
Sbjct: 743 QDSAGQLDTQGATLKGTSVQIEPFDLPGTVITNN-------LTQSAQKSSDSLFNIVPGL 795
Query: 770 DGGDRTVSLESETYKGCFVYTAVNLQSSESTKLGCISE--STEAGFNNAASFVIEKGLSE 827
DG +VSLE T GCF+ V+ ++ C S S F AASFV L +
Sbjct: 796 DGNPNSVSLELGTKPGCFLVIGVDYSVGTKIQVSCKSSLPSINGIFEQAASFVQAAPLRQ 855
Query: 828 YHPISFVAKGANRNFLLAPLLSLRDESYTVYFDF 861
YHPISF+AKG RNFLL PL SLRDE YTVYF+
Sbjct: 856 YHPISFIAKGVKRNFLLEPLYSLRDEFYTVYFNL 889
>gi|357123866|ref|XP_003563628.1| PREDICTED: uncharacterized protein LOC100829886 [Brachypodium
distachyon]
Length = 850
Score = 922 bits (2382), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/865 (55%), Positives = 603/865 (69%), Gaps = 45/865 (5%)
Query: 24 AQAKECTNAYPELASHTFRSNLLS--SKNESYIKQIHSHNDHLTPSDDSAWLSLMPRKIL 81
A AKECTN +L+SHT R+ L S E ++ + + H++P+D++ W+ L R L
Sbjct: 2 AVAKECTNVPTQLSSHTVRARLQGDPSAEEWRLRALFHDHAHVSPTDEATWMDL--RAPL 59
Query: 82 REEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLG--SDSMHWRAQQTNL 139
E WAMLYR +K + FL+EV L DVRL D+++ RAQQTNL
Sbjct: 60 ASSAATEESGWAMLYRALKGSASGGSASAAAGFLEEVPLQDVRLDMEEDAVYGRAQQTNL 119
Query: 140 EYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNE 199
EYLL+LDVD+L+W+FR A LPAPG+PYGGWE ELRGHFVGHYLSA+A WASTHN
Sbjct: 120 EYLLLLDVDRLLWSFRTQAGLPAPGKPYGGWEGADVELRGHFVGHYLSAAAKTWASTHNG 179
Query: 200 SLKEKMSAVVSALSACQKEI----GSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGL 255
+L KMSAVV AL CQ+ G+GYLSAFP E FDR EA+ PVWAPYYT+HKI+ GL
Sbjct: 180 TLAAKMSAVVDALHECQQAAAANGGNGYLSAFPAEFFDRFEAIQPVWAPYYTVHKIMQGL 239
Query: 256 LDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCI 315
LDQ+T A N +AL M M YF RV++VI+++ IERHW +LNEE GGMNDVLY+L+ I
Sbjct: 240 LDQHTVAGNGKALAMAVAMAGYFGGRVRSVIQRHGIERHWTSLNEETGGMNDVLYQLYTI 299
Query: 316 TQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-- 373
T D +HL+LAHLFDKPCFLGLLA+QAD ++GFH+NTHIP+V+G QMRYEVTGD L+KE
Sbjct: 300 TNDQRHLVLAHLFDKPCFLGLLAVQADSLTGFHANTHIPVVVGGQMRYEVTGDPLYKEIS 359
Query: 374 ---------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWT 424
H + GT++ F SDPKRLAS L + EESCTTYNMLKVSRHLFRWT
Sbjct: 360 TFFMDIVNTSHSYATGGTSVS--EFWSDPKRLASTLTTENEESCTTYNMLKVSRHLFRWT 417
Query: 425 KEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYG 484
KEIAYADYYER+L NGVL IQRG +PGVMIY+LP PG SK SYH WGT DSFWCCYG
Sbjct: 418 KEIAYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAVSYHGWGTQYDSFWCCYG 477
Query: 485 TGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVT 544
TGIESFSKLGD+IYFEE+G P +Y++QYI S +WKS + V Q++ P+ S D YL+V+
Sbjct: 478 TGIESFSKLGDTIYFEEKGSKPTLYVVQYIPSIFNWKSAGLTVTQRLKPLSSSDQYLQVS 537
Query: 545 LTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQ 604
L+ S+K +G ++N+RIP+W S+NGAKATLN + L L SPG FL+VTK W+S D LT+Q
Sbjct: 538 LSISAKTNGQYATVNVRIPSWASANGAKATLNDKYLQLGSPGTFLTVTKQWNSGDHLTLQ 597
Query: 605 LPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITES--ATSLSDWITPIPASY 662
LP+ LRTEAI+DDR E+AS+QA+L+GP++LAG S GDWD A ++SDWI+P+P+SY
Sbjct: 598 LPINLRTEAIKDDRAEFASLQAVLFGPFLLAGLSTGDWDAKTGAAAAAISDWISPVPSSY 657
Query: 663 NSQLITFTQEYGNTKFVLTNSN-QSITMEKFPK-SGTDAALHATFRLILNDSSGSEFSSL 720
+SQL+T TQE G + FVL+ N S+ M+ P+ GT+AA+H TFRL+ S ++
Sbjct: 658 SSQLVTLTQESGGSTFVLSTVNGTSLAMQPRPEGGGTEAAVHGTFRLVPQGFSPPPTTNR 717
Query: 721 NDFIG---KSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVS 777
S M+EPFD PGM + TD VV + GS +F++V GLDG +VS
Sbjct: 718 RHGAPTNLASAMIEPFDLPGMAI----TDALTVVRSEEKSSGSLLFNVVPGLDGKPGSVS 773
Query: 778 LESETYKGCFVYTAVNLQSSESTKLGCISESTEAGFNN-AASFVIEKGLSEYHPISFVAK 836
LE T GCFV TA ++GC AGF+ AASF + L YHPISFVA+
Sbjct: 774 LELGTRPGCFVVTA-----GAKVQVGC-----GAGFSQAAASFARAEPLRRYHPISFVAR 823
Query: 837 GANRNFLLAPLLSLRDESYTVYFDF 861
GA R FLL PL +LRDE YTVYF+
Sbjct: 824 GARRGFLLEPLFTLRDEFYTVYFNL 848
>gi|242096362|ref|XP_002438671.1| hypothetical protein SORBIDRAFT_10g024070 [Sorghum bicolor]
gi|241916894|gb|EER90038.1| hypothetical protein SORBIDRAFT_10g024070 [Sorghum bicolor]
Length = 887
Score = 917 bits (2369), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/882 (55%), Positives = 611/882 (69%), Gaps = 70/882 (7%)
Query: 26 AKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWLSLMP---RKILR 82
AKECTN EL+SHT R+ L +S + + ++HL P+D++AW+ LMP R L+
Sbjct: 28 AKECTNIPTELSSHTVRARLQASPGAAEWRWRELFHEHLNPTDEAAWMDLMPPPPRGGLQ 87
Query: 83 ----------EEEQDELFSWAMLYRKIKNPGQFKV---------PERSGEFLKEVSLHDV 123
+++E W MLYR +K GQ V +G FL+EVSLHDV
Sbjct: 88 TAAAADAGHHHHQEEEELDWVMLYRSLK--GQQVVVGGAVPASGAAAAGPFLEEVSLHDV 145
Query: 124 RL---GSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGH 180
RL G D+ + RAQ+TNLEYLL+LDVD+LVW+FR A LPAPGEPYGGWE+P ELRGH
Sbjct: 146 RLDPDGDDAAYGRAQRTNLEYLLLLDVDRLVWSFRSQAALPAPGEPYGGWEKPDSELRGH 205
Query: 181 FVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIP 240
FVGHYLSA+A MWASTHN +L KMSAVV AL CQ+ G+GYLSAFP E FDR EA+ P
Sbjct: 206 FVGHYLSATAKMWASTHNGTLAGKMSAVVDALDECQRAAGTGYLSAFPAEFFDRFEAIKP 265
Query: 241 VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNE 300
VWAPYYTIHKI+ GLLDQ+ A N +AL M M +YF RV+NVI++YSIERHW +LNE
Sbjct: 266 VWAPYYTIHKIMQGLLDQHVVAGNGKALGMVVAMADYFAGRVRNVIRRYSIERHWTSLNE 325
Query: 301 EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
E GGMNDVLY+L+ IT D +HL+LAHLFDKPCFLGLLA+QAD +S FH+NTHIP+VIG Q
Sbjct: 326 ETGGMNDVLYQLYTITHDQRHLVLAHLFDKPCFLGLLAVQADSLSNFHANTHIPVVIGGQ 385
Query: 361 MRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCT 409
MRYEVTGD L+KE H + GT++ F SDPKRLA L + TEESCT
Sbjct: 386 MRYEVTGDPLYKEIATFFMDTVNSSHAYATGGTSVS--EFWSDPKRLAEALTTETEESCT 443
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSY 469
TYNMLKVSRHLFRWTKE+AYADYYER+L NGVL IQRG +PGVMIY+LP PG SK +SY
Sbjct: 444 TYNMLKVSRHLFRWTKEVAYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAKSY 503
Query: 470 HHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQ 529
H WGT ++SFWCCYGTGIESFSKLGDSIYFEE+G+ P +YI+Q+I S +W++ + V Q
Sbjct: 504 HGWGTQNESFWCCYGTGIESFSKLGDSIYFEEKGQKPALYIVQFIPSTFNWRTTGLTVTQ 563
Query: 530 KVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFL 589
K+ P+ SWD YL+V+ + S+K G +LN+RIP+WTS NGAKATLN +DL L SPG FL
Sbjct: 564 KLMPLSSWDQYLQVSFSISAKTDGQFATLNVRIPSWTSLNGAKATLNDKDLQLASPGTFL 623
Query: 590 SVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITES-- 647
+V+K W S D+L +QLP+ LRTEAI+DDRPEYASIQA+L+GP++LAG + G+WD
Sbjct: 624 TVSKQWGSGDQLLLQLPIHLRTEAIKDDRPEYASIQAVLFGPFLLAGLTTGEWDAKTGAA 683
Query: 648 ATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPK--SGTDAALHATF 705
A + +DWITP+P NSQL+T QE G FVL+ N S+TM++ PK GTDAA+HATF
Sbjct: 684 AAAATDWITPVPPGSNSQLVTLAQESGGKAFVLSAVNGSLTMQERPKDSGGTDAAVHATF 743
Query: 706 RLILNDSSGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHL 765
RL+ ++ + + LEP D PGM+V D L V+ ++F++
Sbjct: 744 RLVPQGTNST----------AAATLEPLDMPGMVVT-----DTLTVSAE--KSSGALFNV 786
Query: 766 VAGLDGGDRTVSLESETYKGCFVYTAVNLQSSESTKLGCISESTEAG------FNNAASF 819
V GL G +VSLE + GCF+ V S E ++GC + G F AASF
Sbjct: 787 VPGLAGAPGSVSLELGSRPGCFL---VAGGSGEKVQVGCTGGVKKHGNGGGDWFRQAASF 843
Query: 820 VIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFDF 861
+ + YHP+SF A+G R+FLL PL +LRDE YT+YF+
Sbjct: 844 ARAEPMRRYHPMSFAARGVRRSFLLEPLFTLRDEFYTIYFNL 885
>gi|51090917|dbj|BAD35522.1| hypothetical protein [Oryza sativa Japonica Group]
gi|51090951|dbj|BAD35554.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 883
Score = 893 bits (2308), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 478/878 (54%), Positives = 596/878 (67%), Gaps = 58/878 (6%)
Query: 27 KECTNAYPELASHTFRSNLLSSKNESYI-KQIHSHNDHLTPSDDSAWLSLMPRKILREEE 85
KECTN +L+SHT R+ L SS + ++ + H DHL P+D++AW+ LMP E
Sbjct: 23 KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMPLAAASASE 82
Query: 86 QDELFSWAMLYRKIKNPG-----QFKVPERSGEFLKEVSLHDVRL----GSDSMHWRAQQ 136
F WAMLYR +K FL+EVSLHDVRL G D ++ RAQQ
Sbjct: 83 ----FDWAMLYRSLKGAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQQ 138
Query: 137 TNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWAST 196
TNLEYLL+L+VD+LVW+FR A LPAPG+PYGGWE P ELRGHFVGHYLSA+A MWAST
Sbjct: 139 TNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVGHYLSAAAKMWAST 198
Query: 197 HNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLL 256
HN +L KM+AVV AL CQ G+GYLSAFP E FDR EA+ PVWAPYYTIH I+ GLL
Sbjct: 199 HNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIH-IMQGLL 257
Query: 257 DQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCIT 316
DQ+T A N +AL M M +YF RV++VI++Y+IERHW +LNEE GGMNDVLY+L+ IT
Sbjct: 258 DQHTVAGNGKALGMVVAMADYFAGRVRSVIQRYTIERHWTSLNEETGGMNDVLYQLYTIT 317
Query: 317 QDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE--- 373
+D +HL+LAHLFDKPCFLGLLA+QAD +SGFH+NTHIP+VIG QMRYEVTGD L+KE
Sbjct: 318 KDQRHLVLAHLFDKPCFLGLLAVQADSLSGFHANTHIPVVIGGQMRYEVTGDPLYKEIAT 377
Query: 374 --------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK 425
H + GT++ F S+PK LA L + TEESCTTYNMLKVSRHLFRWTK
Sbjct: 378 FFMDIVNSSHSYATGGTSVSEF--WSNPKHLAEALTTETEESCTTYNMLKVSRHLFRWTK 435
Query: 426 EIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGT 485
EIAYADYYER+L NGVL IQRG +PGVMIY+LP PG SK SYH WGT +SFWCCYGT
Sbjct: 436 EIAYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAVSYHGWGTQYNSFWCCYGT 495
Query: 486 GIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTL 545
GIESFSKLGDSIYFE++G PG+YIIQYI S +W++ + V Q+V P+ S D YL+V+L
Sbjct: 496 GIESFSKLGDSIYFEQKGDKPGLYIIQYIPSTFNWRTAGLTVTQQVKPLSSSDQYLQVSL 555
Query: 546 TFS-SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTW-SSDDKLTI 603
+ S +K +G +LN+RIP+WTS NGAKATLN +DL L SPG FL+++K W S DD L +
Sbjct: 556 SISAAKTNGQYATLNVRIPSWTSMNGAKATLNDKDLQLASPGTFLTISKQWDSGDDHLLL 615
Query: 604 QLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWD--ITESATSLSDWITPIPAS 661
Q P+ LRTEAI+DDRP+ AS+ AIL+GP++LAG + GDWD +AT+ SDWITP+PAS
Sbjct: 616 QFPINLRTEAIKDDRPQVASLNAILFGPFLLAGLTTGDWDAKTGGAATAASDWITPVPAS 675
Query: 662 YNSQLITFTQEYGNTKFVLTNSNQ-SITMEKFPK--SGTDAALHATFRLILNDSSGS--- 715
YNSQL+T TQE G +L+ N S+ M + P+ GTDAA+ ATFR++ S
Sbjct: 676 YNSQLVTLTQESGGKTMLLSTVNDTSLAMLERPEGAGGTDAAVRATFRVVPPGSRAELRQ 735
Query: 716 -----EFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLD 770
+ +EPF PG V + L V + + S++F++ GLD
Sbjct: 736 RAGAGAGEGAARLKVAAATIEPFGLPGTAV-----SNGLAVVRAGNSS-STLFNVAPGLD 789
Query: 771 GGDRTVSLESETYKGCFVYTAVNLQSSESTKLGCISE-----STEAGFNNAASFVIEKGL 825
G +VSLE + GCF+ + +GC + + AGF AASF + L
Sbjct: 790 GKPGSVSLELGSKPGCFLVAGAGAK----VHVGCRTRGGAAAAAAAGFEQAASFAQAEPL 845
Query: 826 SEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFDFQS 863
YH ISF A G R+FLL PL +LRDE YT+YF+ +
Sbjct: 846 RRYHAISFFASGVRRSFLLEPLFTLRDEFYTIYFNLAA 883
>gi|218198543|gb|EEC80970.1| hypothetical protein OsI_23693 [Oryza sativa Indica Group]
Length = 905
Score = 850 bits (2197), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/904 (51%), Positives = 584/904 (64%), Gaps = 88/904 (9%)
Query: 27 KECTNAYPELASHTFRSNLLSSKNESYI-KQIHSHNDHLTPSDDSAWLSLMPRKILREEE 85
KECTN +L+SHT R+ L SS + ++ + H DHL P+D++AW+ LMP E
Sbjct: 23 KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMPLAAASASE 82
Query: 86 QDELFSWAMLYRKIKNPG-----QFKVPERSGEFLKEVSLHDVRL----GSDSMHWRAQQ 136
F WAMLYR +K FL+EVSLHDVRL G D ++ RAQQ
Sbjct: 83 ----FDWAMLYRSLKGAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQQ 138
Query: 137 TNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWAST 196
TNLEYLL+L+VD+LVW+FR A LPAPG+PYGGWE P ELRGHFVGHYLSA+A MWAST
Sbjct: 139 TNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVGHYLSAAAKMWAST 198
Query: 197 HNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHK------ 250
HN +L KM+AVV AL CQ G+GYLSAFP E FDR EA+ PVWAPYYTIHK
Sbjct: 199 HNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKARNATQ 258
Query: 251 --------------------ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYS 290
I+ GLLDQ+T A N +AL M M +YF RV++VI++Y+
Sbjct: 259 SICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGKALGMVVAMADYFAGRVRSVIQRYT 318
Query: 291 IERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSN 350
IERHW +LNEE GGMNDVLY+L + F + CFLGLLA+QAD +SGFH+N
Sbjct: 319 IERHWTSLNEETGGMNDVLYQL-----KTEAFGAGSSFRQACFLGLLAVQADSLSGFHAN 373
Query: 351 THIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASN 399
THIP+VIG QMRYEVTGD L+KE H + GT++ F S+PK LA
Sbjct: 374 THIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVSEF--WSNPKHLAEA 431
Query: 400 LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL 459
L + TEESCTTYNMLKVSRHLFRWTKEIAYADYYER+L NGVL IQRG +PGVMIY+LP
Sbjct: 432 LTTETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMIYMLPQ 491
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
PG SK SYH WGT +SFWCCYGTGIESFSKLGDSIYFE++G PG+YIIQYI S +
Sbjct: 492 GPGRSKAVSYHGWGTQYNSFWCCYGTGIESFSKLGDSIYFEQKGDKPGLYIIQYIPSTFN 551
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFS-SKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
W++ + V Q+V P+ S D YL+V+L+ S +K +G +LN+RIP+WTS NGAKATLN +
Sbjct: 552 WRTAGLTVTQQVKPLSSSDQYLQVSLSISAAKTNGQYATLNVRIPSWTSMNGAKATLNDK 611
Query: 579 DLPLPSPGNFLSVTKTW-SSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
DL L SPG FL+++K W S DD L +Q P+ LRTEAI+DDRP+ AS+ AIL+GP++LAG
Sbjct: 612 DLQLASPGTFLTISKQWDSGDDHLLLQFPINLRTEAIKDDRPQVASLNAILFGPFLLAGL 671
Query: 638 SIGDWD--ITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQ-SITMEKFPK 694
+ GDWD +AT+ SDWITP+PASYNSQL+T TQE G +L+ N S+ M + P+
Sbjct: 672 TTGDWDAKTGGAATAASDWITPVPASYNSQLVTLTQESGGKTMLLSTVNDTSLAMLERPE 731
Query: 695 --SGTDAALHATFRLILNDSSG--------SEFSSLNDFIGKSVMLEPFDSPGMLVIQHE 744
GTDAA+ ATFR++ S + +EPF PG V
Sbjct: 732 GAGGTDAAVRATFRVVPPGSRAELRQRAGAGAGEGAARLKVAAATIEPFGLPGTAV---- 787
Query: 745 TDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYKGCFVYTAVNLQSSESTKLGC 804
+ L V + + S++F++V GLDG +VSLE + GCF+ + +GC
Sbjct: 788 -SNGLAVVRAGNSS-STLFNVVPGLDGKPGSVSLELGSKPGCFLVAGAGAK----VHVGC 841
Query: 805 ISE-----STEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYF 859
+ + AGF AASF + L YH ISF A G R+FLL PL +LRDE YT+YF
Sbjct: 842 RTRGGAAAAAAAGFEQAASFAQAEPLRRYHAISFFASGVRRSFLLEPLFTLRDEFYTIYF 901
Query: 860 DFQS 863
+ +
Sbjct: 902 NLAA 905
>gi|255544804|ref|XP_002513463.1| conserved hypothetical protein [Ricinus communis]
gi|223547371|gb|EEF48866.1| conserved hypothetical protein [Ricinus communis]
Length = 759
Score = 831 bits (2147), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/616 (65%), Positives = 483/616 (78%), Gaps = 26/616 (4%)
Query: 249 HKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDV 308
H +LAGLLDQY +ADNA+AL+M WMVEYFYNRVQNVI KYS+ERH+ +LNEE GGMNDV
Sbjct: 169 HFVLAGLLDQYIFADNAQALKMVNWMVEYFYNRVQNVITKYSVERHFLSLNEETGGMNDV 228
Query: 309 LYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGD 368
LYKLF IT +PKHL+LAHLFDKPCFLGLLA+Q +I F + ++ S Y G
Sbjct: 229 LYKLFSITGEPKHLVLAHLFDKPCFLGLLAVQ--EIGTFFMD-----IVNSSHTYATGG- 280
Query: 369 QLHKEGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIA 428
+ F SDPKRLAS L+ TEESCTTYNMLKVSRHLFRWTKE+A
Sbjct: 281 ---------------TSDYEFWSDPKRLASTLNDQTEESCTTYNMLKVSRHLFRWTKEMA 325
Query: 429 YADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIE 488
YADYYER+LTNGVLGIQRGTEPGVMIYLLP PG SK R+ H WGTP DSFWCCYGTGIE
Sbjct: 326 YADYYERALTNGVLGIQRGTEPGVMIYLLPQNPGGSKARTIHKWGTPDDSFWCCYGTGIE 385
Query: 489 SFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFS 548
SFSKLGDSIYFEE + PG+Y+IQYISS LDWK GQIV+NQKVDP+ SWDP+LRVT TF
Sbjct: 386 SFSKLGDSIYFEEGSQIPGLYVIQYISSSLDWKLGQIVLNQKVDPIFSWDPFLRVTFTF- 444
Query: 549 SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLT 608
+G+ +++LNLRIP WT S+ KAT+N Q LP+P PGNFLSVT +WSS DKL +QLP+
Sbjct: 445 DQGASQSSTLNLRIPIWTHSDDVKATINAQSLPVPPPGNFLSVTGSWSSSDKLFLQLPII 504
Query: 609 LRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI-TESATSLSDWITPIPASYNSQLI 667
LRTEAI+DDRPEYASIQAIL+GPY+LAGHS GDWD+ +ESA SLSDWIT IPA+YNS L+
Sbjct: 505 LRTEAIKDDRPEYASIQAILFGPYLLAGHSSGDWDLKSESAKSLSDWITAIPATYNSHLV 564
Query: 668 TFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFIGKS 727
+F+Q+ G++ F LTNSNQS+TME FP+ GTD ++HATFRLILNDSS SE ++ D +GK
Sbjct: 565 SFSQDSGDSVFALTNSNQSLTMEIFPQPGTDDSVHATFRLILNDSSSSELANFEDAVGKL 624
Query: 728 VMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYKGCF 787
VMLEPF+ PGML++Q + L V + + GSS+F LV+GLDG D +VSLES + + CF
Sbjct: 625 VMLEPFNLPGMLLVQQGKEVSLAVGYTDGSDGSSLFRLVSGLDGKDGSVSLESVSNENCF 684
Query: 788 VYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPL 847
V++ V+ +S + KL C +S+E FN ASF++ KG+S YHPISFVAKGA RNFLL+PL
Sbjct: 685 VFSGVDYKSGTALKLSC-KKSSETKFNQGASFMVNKGISHYHPISFVAKGAKRNFLLSPL 743
Query: 848 LSLRDESYTVYFDFQS 863
S RDESYT+YF+ Q+
Sbjct: 744 FSFRDESYTIYFNIQA 759
Score = 207 bits (526), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 105/178 (58%), Positives = 129/178 (72%), Gaps = 13/178 (7%)
Query: 9 GFFKFLLTFLLIVSA----AQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHL 64
GF F L L+ S +KECTN +L+SHTFR LLSS NES +++ +H HL
Sbjct: 3 GFVVFELLVLVAASVLCGFGMSKECTNIPTQLSSHTFRYALLSSNNESLKQEMFAHY-HL 61
Query: 65 TPSDDSAWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVR 124
TP+DDS W SL+PRK+L+EE++ F WAM+Y+K+K+P Q SG FLKEVSLH+VR
Sbjct: 62 TPTDDSVWSSLLPRKMLKEEDE---FDWAMMYKKLKSPLQ-----SSGNFLKEVSLHNVR 113
Query: 125 LGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFV 182
L S HWRAQQTNLEYLLML++D+LVW+FRKTA LP PG YGGWE P+ ELRGHFV
Sbjct: 114 LDLGSFHWRAQQTNLEYLLMLNLDRLVWSFRKTAGLPTPGTAYGGWEAPNVELRGHFV 171
>gi|357472931|ref|XP_003606750.1| hypothetical protein MTR_4g065200 [Medicago truncatula]
gi|355507805|gb|AES88947.1| hypothetical protein MTR_4g065200 [Medicago truncatula]
Length = 646
Score = 829 bits (2142), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/684 (59%), Positives = 508/684 (74%), Gaps = 55/684 (8%)
Query: 11 FKFLLTFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDS 70
F F+ +++ KEC N P+ SHTFR L +SKNE++ K++ SH HLTP+D+S
Sbjct: 4 FVFMFMAIMLFGCVAGKECMNNLPQ--SHTFRYELWASKNETWKKEVMSHY-HLTPTDES 60
Query: 71 AWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSM 130
AW L+PRK+L EE Q + WA YR++KN K P FLKEV L DVRL S+
Sbjct: 61 AWADLLPRKLLSEENQRD---WAAKYREMKNADLSKPPVG---FLKEVPLGDVRLLEGSI 114
Query: 131 HWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASA 190
H +AQ+TNLEYLLMLDVD L+W+FRKTA LP PG PYGGWE+PS ELRGHFVGHYLSASA
Sbjct: 115 HAQAQKTNLEYLLMLDVDSLIWSFRKTAGLPTPGTPYGGWEDPSIELRGHFVGHYLSASA 174
Query: 191 LMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHK 250
LMWAST N++L EKMSA+VS LSACQ++IG+GYLSAFPTE FDR+EAL WAPYYTIHK
Sbjct: 175 LMWASTKNDNLNEKMSALVSGLSACQEKIGTGYLSAFPTELFDRVEALQYAWAPYYTIHK 234
Query: 251 ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLY 310
ILAGLLDQYT N +AL+M TWMV+YFYNRV NVI+K ++ H+Q+LNEEAGGMNDVLY
Sbjct: 235 ILAGLLDQYTIGGNPQALKMVTWMVDYFYNRVMNVIQKLTVNGHYQSLNEEAGGMNDVLY 294
Query: 311 KLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQL 370
+L+ IT+D KHL+LAHLFDKPCFLG+LA+QA+DI+ FH+NTHIPIV+GSQ+RYEVTGD L
Sbjct: 295 RLYSITRDSKHLVLAHLFDKPCFLGVLAVQANDIANFHANTHIPIVVGSQLRYEVTGDPL 354
Query: 371 HKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSN-TEESCTTYNMLKVSR 418
+K+ H + GT++ F +DPKR+A NL S EESCTTYNMLKVSR
Sbjct: 355 YKDIGAFFMDIVNSSHTYATGGTSVRE--FWNDPKRIADNLKSTENEESCTTYNMLKVSR 412
Query: 419 HLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDS 478
HLFRWTKE++YADYYER+LTNGVL IQRGT+PGVMIY+LPL G SK ++ WG P ++
Sbjct: 413 HLFRWTKEVSYADYYERALTNGVLSIQRGTDPGVMIYMLPLGLGVSKAKTDKGWGNPFNT 472
Query: 479 FWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWD 538
FWCCYGTGIESFSKLGDSIYFEEEG P +YIIQYISS +WKSG+I++ Q V P S D
Sbjct: 473 FWCCYGTGIESFSKLGDSIYFEEEGHNPSLYIIQYISSSFNWKSGKILLTQTVVPAASSD 532
Query: 539 PYLRVTLTFS-SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSS 597
PYLRVT TFS ++ +G +++LN R+P+W+ ++GAKA LN + L LP+P
Sbjct: 533 PYLRVTFTFSPNETTGTSSTLNFRVPSWSHADGAKAILNSETLSLPAP------------ 580
Query: 598 DDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE-SATSLSDWIT 656
DDRPE+AS+QAILYGPY+LAGH+ WDI + +++DWIT
Sbjct: 581 ------------------DDRPEFASLQAILYGPYLLAGHTTSIWDIKGVTNKAVADWIT 622
Query: 657 PIPASYNSQLITFTQEYGNTKFVL 680
PIP++Y+SQL+ F + + +L
Sbjct: 623 PIPSNYSSQLVFFIHKTSTNQLLL 646
>gi|242096364|ref|XP_002438672.1| hypothetical protein SORBIDRAFT_10g024080 [Sorghum bicolor]
gi|241916895|gb|EER90039.1| hypothetical protein SORBIDRAFT_10g024080 [Sorghum bicolor]
Length = 933
Score = 818 bits (2114), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/891 (48%), Positives = 567/891 (63%), Gaps = 102/891 (11%)
Query: 60 HND---HLTPSDDSAWLSLMPRKILREEEQDEL----FSWAMLYRKIKNPGQFKVPERS- 111
HND HLTP++++ W++L+PR++ F W LYR + G P+
Sbjct: 49 HNDGLPHLTPTEEATWMALLPRRLRGGGGGGARARAEFDWLALYRSLTRGGG---PDDDA 105
Query: 112 -------GEFLKEVSLHDVRL----------------GSDSMHWRAQQTNLEYLLMLDVD 148
GE L SLHDVRL S +M+W+AQQTNLEYLL LD D
Sbjct: 106 DAGKPGPGELLTPASLHDVRLHGDDDDDDRVLTGSSSSSAAMYWQAQQTNLEYLLYLDPD 165
Query: 149 KLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAV 208
+L W FR+ A LP G+PYGGWE P +LRGHF GHYLSASA MWA+THN +L+E+M+ V
Sbjct: 166 RLTWTFRRQAGLPTVGDPYGGWEAPGGQLRGHFTGHYLSASAHMWAATHNSTLRERMTRV 225
Query: 209 VSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEAL 268
V L CQK++G+GYL+A+P FD E L W+PYYTIHKI+ GLLDQY A N + L
Sbjct: 226 VDILYDCQKKMGTGYLAAYPETMFDLYEQLDEAWSPYYTIHKIMQGLLDQYMLASNKKGL 285
Query: 269 RMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLF 328
+ WM +YF NRV+N+I+KY+I+RHW+ +NEE GG NDV+Y+L+ IT++ KHL +AHLF
Sbjct: 286 DVVVWMTDYFSNRVKNLIQKYTIQRHWEAMNEETGGFNDVMYQLYTITKNQKHLTMAHLF 345
Query: 329 DKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQL 377
DKPCFLG L L DDISG H NTH+P++IG+Q RYEV GD L+K+ H
Sbjct: 346 DKPCFLGPLGLHKDDISGLHVNTHLPVIIGTQKRYEVVGDHLYKDISTYLFDVVNSSHTF 405
Query: 378 ESSGTN-IGHFNFKSDPKRLASNLD-SNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
+ GT+ + H++ DPKRL + S+ EE+C TYN LKVSR+LFRWTKE YAD+YER
Sbjct: 406 ATGGTSTMEHWH---DPKRLVDEIKISSNEETCATYNFLKVSRNLFRWTKEAKYADHYER 462
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKE-----------RSYHHWGTPSDSFWCCYG 484
L NG++G QRGT+PGVM+Y LP+ PG SK ++ WG P+D+FWCCYG
Sbjct: 463 LLINGIMGNQRGTQPGVMLYFLPMGPGRSKSVSGLSPSGLPPKNPGGWGGPNDTFWCCYG 522
Query: 485 TGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVT 544
TGIESFSKLGDSIYF EEG+ PG+YIIQYI S DWK+ + VNQ+ P++S DP+ +V+
Sbjct: 523 TGIESFSKLGDSIYFLEEGEAPGLYIIQYIPSTFDWKATGLTVNQQAKPLLSTDPFFKVS 582
Query: 545 LTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGN-----FLSVTKTWSSDD 599
LTFS+KG +++RIP+WTS++G ATLNGQ L L S GN FL+VTK W ++D
Sbjct: 583 LTFSAKGDAQLAKVSVRIPSWTSTDGTTATLNGQKLNLTSTGNSTNGGFLTVTKLW-AED 641
Query: 600 KLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE------------- 646
LT+Q P+TLRTEAI+DDRPEYASIQA+L+GP++LAG + G +T+
Sbjct: 642 TLTLQFPITLRTEAIKDDRPEYASIQAVLFGPHLLAGLTHGKLPVTDSNHSNDGLTPSIW 701
Query: 647 -----SATSLSDWITPIPA-SYNSQLITFTQEYGNTKFVLTNS--NQSITMEKFPKSGTD 698
SAT+++DW+TP+P+ + NSQL+T TQ G VL+ S + + M++ P GTD
Sbjct: 702 EVNATSATAVTDWVTPLPSETLNSQLVTLTQTAGGRTLVLSVSIADAKLEMQEQPAPGTD 761
Query: 699 AALHATFRLILNDSSGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQ 758
A +HATFR + + S SL G +V +EPFD PGM V + L+
Sbjct: 762 ACVHATFR-VYGQAGSSSSESLLPMQGPNVTIEPFDRPGMAVT-----NGLLAVGRPAGG 815
Query: 759 GSSVFHLVAGLDGGDRTVSLESETYKGCFVYTAVNLQSSESTKLGCISESTEAG------ 812
++F+ V GLDG +VSLE T GCFV TA ++ +T++ C G
Sbjct: 816 RDTLFNAVPGLDGAPGSVSLELATRPGCFVATAPAAGANAATQVVCRGNKNNGGSASGDG 875
Query: 813 --FNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFDF 861
AASFV L Y+P+SF A+G RNFLL PL SL+DE YTVYF
Sbjct: 876 AALRRAASFVRAAPLRRYNPLSFAARGTARNFLLEPLRSLQDEFYTVYFSL 926
>gi|125556053|gb|EAZ01659.1| hypothetical protein OsI_23694 [Oryza sativa Indica Group]
Length = 898
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/865 (49%), Positives = 556/865 (64%), Gaps = 84/865 (9%)
Query: 60 HND---HLTPSDDSAWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLK 116
H+D HL ++++ W+ L+PR R +DEL W LYR I G E +G FL
Sbjct: 50 HSDGLPHLNQAEEATWMGLLPR---RAGPRDEL-DWLALYRSITRGGG---GEPAG-FLS 101
Query: 117 EVSLHDVRLG--SDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
SLHDVR+ +M+W+ QQTNLEYLL LD D+L W FR+ A+LP GEPYGGWE P
Sbjct: 102 PASLHDVRVDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPIVGEPYGGWEAPD 161
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
+LRGHF GHYLSA+A MWASTHN++L+EKM+ VV L +CQK++ +GYLSA+P FD
Sbjct: 162 GQLRGHFTGHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDA 221
Query: 235 LEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
+ L W+PYYTIHKI+ GLLDQYT A N + L + WM +YF RV+ +I++YSI+RH
Sbjct: 222 YDELAEAWSPYYTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSIQRH 281
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
W+ +NEE GG NDV+Y+L+ IT++ KHL +AHLFDKPCFLG L L DDISG H NTH+P
Sbjct: 282 WEAINEETGGFNDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNTHVP 341
Query: 355 IVIGSQMRYEVTGDQLHKE-----------GHQLESSGTN-IGHFNFKSDPKRLASNLD- 401
+++G+Q RYEV GDQL+KE H + GT+ + H++ DPKRL +
Sbjct: 342 VIVGAQKRYEVVGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWH---DPKRLVDEIKI 398
Query: 402 SNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAP 461
S+ EE+C TYN+LKVSR+LFRWTKE Y D+YER L NG++G QRG EPGVMIY LP+ P
Sbjct: 399 SSNEETCATYNLLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGP 458
Query: 462 GSSKE-----------RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G SK ++ WG + +FWCCYGTGIESFSKLGDSIYF EEG+ PG+YI
Sbjct: 459 GRSKSISGMPTSGLPPKNPGGWGNANATFWCCYGTGIESFSKLGDSIYFLEEGEIPGLYI 518
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
IQYI S DWK+ + V Q+ P+ S D + V++ SSKG ++N+RIP+WTS +G
Sbjct: 519 IQYIPSTFDWKAAGLTVKQQAKPLSSTDSHFEVSIFISSKGDARPANVNVRIPSWTSVDG 578
Query: 571 AKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
A ATLNGQ L L S G+FLSVTK W DD L+++ P+TLRTE I+DDRPEY+SIQA+L+G
Sbjct: 579 AIATLNGQKLNLTSAGDFLSVTKLW-GDDTLSLKFPITLRTEPIKDDRPEYSSIQAVLFG 637
Query: 631 PYVLAGHSIGDWDITESATSLS-------------------DWITPIPASYNSQLITFTQ 671
P++LAG + G+ + S S S W+TP+ S NSQL+T TQ
Sbjct: 638 PHLLAGLTHGNQTVKTSNDSNSGLTPGVWEVNATHAAAAVAGWVTPVSQSLNSQLVTLTQ 697
Query: 672 EYGNTK----FVLTNS--NQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFI- 724
G+ + FVL+ S + ++TM++ P +G+DA +HATFR + S S + +
Sbjct: 698 RDGDAQAAAAFVLSVSIADGALTMQESPVAGSDACVHATFRAYHSPSGASAIDAATGRLQ 757
Query: 725 GKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYK 784
G++V LEPFD PGM V D L V A + F+ VAGLDG TVSLE T
Sbjct: 758 GRNVALEPFDRPGMAVT-----DALSVGRPGPA---TRFNAVAGLDGLPGTVSLELATRP 809
Query: 785 GCFV------YTA---VNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVA 835
GCFV Y A + + T G + + F AASF L YHP+SF A
Sbjct: 810 GCFVAAPTTAYLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASFTQAAPLRLYHPLSFSA 869
Query: 836 KGANRNFLLAPLLSLRDESYTVYFD 860
G +RNFLL PL SL+DE YTVYF+
Sbjct: 870 TGTDRNFLLEPLQSLQDEFYTVYFN 894
>gi|125597849|gb|EAZ37629.1| hypothetical protein OsJ_21963 [Oryza sativa Japonica Group]
Length = 902
Score = 797 bits (2058), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/866 (49%), Positives = 554/866 (63%), Gaps = 83/866 (9%)
Query: 60 HND---HLTPSDDSAWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLK 116
H+D HL ++++ W+ L+PR R +DEL W LYR I G E +G FL
Sbjct: 51 HSDGLPHLNQAEEATWMGLLPR---RAGPRDEL-DWLALYRSITRGGGDVGGEPAG-FLS 105
Query: 117 EVSLHDVRLG--SDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
SLHDVR+ +M+W+ QQTNLEYLL LD D+L W FR+ A+LP GEPYGGWE P
Sbjct: 106 PASLHDVRVDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWEAPD 165
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
+LRGHF GHYLSA+A MWASTHN++L+EKM+ VV L +CQK++ +GYLSA+P FD
Sbjct: 166 GQLRGHFTGHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDA 225
Query: 235 LEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
+ L W+PYYTIHKI+ GLLDQYT A N + L + WM +YF RV+ +I++YSI+RH
Sbjct: 226 YDELAEAWSPYYTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSIQRH 285
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
W+ +NEE GG NDV+Y+L+ IT++ KHL +AHLFDKPCFLG L L DDISG H NTH+P
Sbjct: 286 WEAINEETGGFNDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNTHVP 345
Query: 355 IVIGSQMRYEVTGDQLHKE-----------GHQLESSGTN-IGHFNFKSDPKRLASNLD- 401
+++G+Q RYEV GDQL+KE H + GT+ + H++ DPKRL +
Sbjct: 346 VIVGAQKRYEVVGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWH---DPKRLVDEIKI 402
Query: 402 SNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAP 461
S+ EE+C TYN+LKVSR+LFRWTKE Y D+YER L NG++G QRG EPGVMIY LP+ P
Sbjct: 403 SSNEETCATYNLLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGP 462
Query: 462 GSSKE-----------RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G SK ++ WG + +FWCCYGTGIESFSKLGDSIYF EEG+ PG+YI
Sbjct: 463 GRSKSISGMPTSGLPPKNPGGWGNANATFWCCYGTGIESFSKLGDSIYFLEEGEIPGLYI 522
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
IQYI S DWK+ + V Q+ P+ S D + V++ SSKG ++N+RIP+WTS +G
Sbjct: 523 IQYIPSTFDWKAAGLTVKQQAKPLSSTDSHFEVSIFISSKGDARPANVNVRIPSWTSVDG 582
Query: 571 AKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
A ATLNGQ L L S G+FLSVTK W DD L+++ P+TLRTE I+DDRPEY+SIQA+L+G
Sbjct: 583 AIATLNGQKLNLTSAGDFLSVTKLW-GDDTLSLKFPITLRTEPIKDDRPEYSSIQAVLFG 641
Query: 631 PYVLAGHSIGDWDITESATSLSDWITP--------------------IPASYNSQLITFT 670
P++LAG + G+ + S S S +TP + S NSQL+T T
Sbjct: 642 PHLLAGLTHGNQTVKTSNDSNSG-LTPGVWEVNATHAAAAVAVWVTPVSQSLNSQLVTLT 700
Query: 671 QEYGNTK----FVLTNS--NQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFI 724
Q G+ + FVL+ S + ++TM++ P +G+DA +HATFR + S S + +
Sbjct: 701 QRDGDAQAAAAFVLSVSIADGALTMQESPVAGSDACVHATFRAYQSPSGASAIDAATGRL 760
Query: 725 -GKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETY 783
G+ V LEPFD PGM V D L V A + F+ VAGLDG TVSLE T
Sbjct: 761 QGRDVALEPFDRPGMAVT-----DALSVGRPGPA---TRFNAVAGLDGLPGTVSLELATR 812
Query: 784 KGCFV------YTA---VNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFV 834
GCFV Y A + + T G + + F AASF L YHP+SF
Sbjct: 813 PGCFVAAPTTAYLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASFTQAAPLRLYHPLSFS 872
Query: 835 AKGANRNFLLAPLLSLRDESYTVYFD 860
A G +RNFLL PL SL+DE YTVYF+
Sbjct: 873 ATGTDRNFLLEPLQSLQDEFYTVYFN 898
>gi|51090918|dbj|BAD35523.1| unknown protein [Oryza sativa Japonica Group]
gi|51090952|dbj|BAD35555.1| unknown protein [Oryza sativa Japonica Group]
Length = 902
Score = 796 bits (2056), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/866 (49%), Positives = 554/866 (63%), Gaps = 83/866 (9%)
Query: 60 HND---HLTPSDDSAWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLK 116
H+D HL ++++ W+ L+PR R +DEL W LYR I G E +G FL
Sbjct: 51 HSDGLPHLNQAEEATWMGLLPR---RAGPRDEL-DWLALYRSITRGGGDVGGEPAG-FLS 105
Query: 117 EVSLHDVRLG--SDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
SLHDVR+ +M+W+ QQTNLEYLL LD D+L W FR+ A+LP GEPYGGWE P
Sbjct: 106 PASLHDVRVDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWEAPD 165
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
+LRGHF GHYLSA+A MWASTHN++L+EKM+ VV L +CQK++ +GYLSA+P FD
Sbjct: 166 GQLRGHFTGHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDA 225
Query: 235 LEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
+ L W+PYYTIHKI+ GLLDQYT A N + L + WM +YF RV+ +I++YSI+RH
Sbjct: 226 YDELAEAWSPYYTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSIQRH 285
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
W+ +NEE GG NDV+Y+L+ IT++ KHL +AHLFDKPCFLG L L DDISG H NTH+P
Sbjct: 286 WEAINEETGGFNDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNTHVP 345
Query: 355 IVIGSQMRYEVTGDQLHKE-----------GHQLESSGTN-IGHFNFKSDPKRLASNLD- 401
+++G+Q RYEV GDQL+KE H + GT+ + H++ DPKRL +
Sbjct: 346 VIVGAQKRYEVVGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWH---DPKRLVDEIKI 402
Query: 402 SNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAP 461
S+ EE+C TYN+LKVSR+LFRWTKE Y D+YER L NG++G QRG EPGVMIY LP+ P
Sbjct: 403 SSNEETCATYNLLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGP 462
Query: 462 GSSKE-----------RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G SK ++ WG + +FWCCYGTGIESFSKLGDSIYF EEG+ PG+YI
Sbjct: 463 GRSKSISGMPTSGLPPKNPGGWGNANATFWCCYGTGIESFSKLGDSIYFLEEGEIPGLYI 522
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
IQYI S DWK+ + V Q+ P+ S D + V++ SSKG ++N+RIP+WTS +G
Sbjct: 523 IQYIPSTFDWKAAGLTVKQQAKPLSSTDSHFEVSIFISSKGDARPANVNVRIPSWTSVDG 582
Query: 571 AKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
A ATLNGQ L L S G+FLSVTK W DD L+++ P+TLRTE I+DDRPEY+SIQA+L+G
Sbjct: 583 AIATLNGQKLNLTSAGDFLSVTKLW-GDDTLSLKFPITLRTEPIKDDRPEYSSIQAVLFG 641
Query: 631 PYVLAGHSIGDWDITESATSLSDWITP--------------------IPASYNSQLITFT 670
P++LAG + G+ + S S S +TP + S NSQL+T T
Sbjct: 642 PHLLAGLTHGNQTVKTSNDSNSG-LTPGVWEVNATHAAAAVAVWVTPVSQSLNSQLVTLT 700
Query: 671 QEYGNTK----FVLTNS--NQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFI 724
Q G+ + FVL+ S + ++TM++ P +G+DA +HATFR + S S + +
Sbjct: 701 QRDGDAQAAAAFVLSVSIADGALTMQESPVAGSDACVHATFRAYHSPSGASAIDAATGRL 760
Query: 725 -GKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETY 783
G+ V LEPFD PGM V D L V A + F+ VAGLDG TVSLE T
Sbjct: 761 QGRDVALEPFDRPGMAVT-----DALSVGRPGPA---TRFNAVAGLDGLPGTVSLELATR 812
Query: 784 KGCFV------YTA---VNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFV 834
GCFV Y A + + T G + + F AASF L YHP+SF
Sbjct: 813 PGCFVAAPTTAYLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASFTQAAPLRLYHPLSFS 872
Query: 835 AKGANRNFLLAPLLSLRDESYTVYFD 860
A G +RNFLL PL SL+DE YTVYF+
Sbjct: 873 ATGTDRNFLLEPLQSLQDEFYTVYFN 898
>gi|293331149|ref|NP_001170532.1| uncharacterized protein LOC100384546 precursor [Zea mays]
gi|238005884|gb|ACR33977.1| unknown [Zea mays]
gi|413954824|gb|AFW87473.1| hypothetical protein ZEAMMB73_711416 [Zea mays]
Length = 902
Score = 795 bits (2052), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/869 (49%), Positives = 559/869 (64%), Gaps = 86/869 (9%)
Query: 60 HND---HLTPSDDSAWLSLMPRKILREEEQDELFSWAMLYRKIK-------NPGQFKVPE 109
H+D HLTP++++ W+SL+PR+ LR + E F W LYR + G+ PE
Sbjct: 51 HDDGLPHLTPTEEATWMSLLPRR-LRGGGRAE-FDWLALYRSLTRGDGPDGGAGKAAGPE 108
Query: 110 RSGEFLKEVSLHDVRLGSD----SMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE 165
L SLHDVRL D SM+WRAQQTNLEYLL LD D+L W FR+ A LP G+
Sbjct: 109 ---GLLSPASLHDVRLHGDGSLSSMYWRAQQTNLEYLLYLDPDRLTWTFRQQAGLPTVGD 165
Query: 166 PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLS 225
PYGGWE P +LRGHFVGHYLSASA WA+THN +L+E+M+ VV L ACQK++G+GYLS
Sbjct: 166 PYGGWEAPDGQLRGHFVGHYLSASAHAWAATHNGTLRERMARVVDILHACQKKMGTGYLS 225
Query: 226 AFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNV 285
A+P FD E L W+PYYT HKI+ GLLDQYT A N + L + M +YF NRV+N+
Sbjct: 226 AYPETMFDLYEQLDEAWSPYYTTHKIMQGLLDQYTLASNEKGLDVVLRMADYFSNRVKNL 285
Query: 286 IKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDIS 345
++ ++I+RHW+ +NEE GG NDV+Y+L+ IT+D KHL +AHLFDKPCFLG L L DDIS
Sbjct: 286 VQIHTIQRHWEAMNEETGGFNDVMYQLYTITRDQKHLTMAHLFDKPCFLGPLGLHKDDIS 345
Query: 346 GFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTN-IGHFNFKSDP 393
G H NTH+P+++G+Q RYEV GD+L+K+ H + GT+ + H++ DP
Sbjct: 346 GLHVNTHLPVLVGAQKRYEVVGDRLYKDISTYLFDVVNSSHTFATGGTSTMEHWH---DP 402
Query: 394 KRLASNLD-SNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV 452
KRL + S+ EE+C TYN LKVSR+LFRWTKE YAD+YER L NG++G QRGT+PGV
Sbjct: 403 KRLVDEIKISSNEETCATYNFLKVSRNLFRWTKEAKYADHYERLLINGIMGNQRGTQPGV 462
Query: 453 MIYLLPLAPGSSKERSYHH-----------WGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
M+Y LP+ PG SK S WG P+D+FWCCYGTGIESFSKLGDSIYF E
Sbjct: 463 MLYFLPMGPGRSKSVSGQSPSGLPPKNPGGWGGPNDTFWCCYGTGIESFSKLGDSIYFLE 522
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
EG PG+YIIQYI S DWK+ + VNQ+ P++S DP+ +V+LT S+K +++R
Sbjct: 523 EGDTPGLYIIQYIPSTFDWKATGLTVNQRAKPLLSTDPFFKVSLTISAKRGARQAKVSVR 582
Query: 562 IPTWTSSNGAKATLNGQDLPLPSPGN-----FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
IP+WT+++GA A LNGQ L L GN FL++TK W ++D LT+ P+TLRTEAI+D
Sbjct: 583 IPSWTTTDGATAILNGQKLNLTPTGNSTNGGFLTITKLW-ANDTLTLHFPITLRTEAIKD 641
Query: 617 DRPEYASIQAILYGPYVLAGHSIGDWDITES------------------ATSLSDWITPI 658
DRPEYASIQA+L+GP++LAG + G +T+S A S++ W+TP+
Sbjct: 642 DRPEYASIQAVLFGPHLLAGLTHGKLPVTDSSHSNDGLTAGIWEVDATGAASVAGWVTPL 701
Query: 659 PA-SYNSQLITFTQEYGNTKFVLTNS--NQSITMEKFPKSGTDAALHATFRLILNDSSGS 715
+ + NSQL+T Q G VL+ S + + M++ P GTDA +HATFR + G
Sbjct: 702 HSETLNSQLVTLKQSIGGRTLVLSVSIADAKLEMQEQPAPGTDACVHATFR-----AYGQ 756
Query: 716 EFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRT 775
S G +V +EPFD PGM V + L V ++F+ V GLDG +
Sbjct: 757 AGGSSQLLRGPNVTIEPFDRPGMAVT-----NGLAV--GCRGGRDTLFNAVPGLDGAPGS 809
Query: 776 VSLESETYKGCFVYTA-VNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFV 834
VSLE T G FV TA + ++ +T++ C + A F AASF L YHP+SF
Sbjct: 810 VSLELATRPGWFVATAPTAMHANATTQVVCRANKGGAAFRRAASFARAPPLRRYHPLSFA 869
Query: 835 AKGANRNFLLAPLLSLRDESYTVYFDFQS 863
A+G RNFLL PL SL+DE YTVYF S
Sbjct: 870 ARGTARNFLLEPLRSLQDEFYTVYFSLVS 898
>gi|168021740|ref|XP_001763399.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685534|gb|EDQ71929.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 757
Score = 790 bits (2041), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/767 (51%), Positives = 529/767 (68%), Gaps = 32/767 (4%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEP 173
LK+VSLH VRLG+DS + AQ TNL+YLL LDVD ++W+FRK + L APG+PYGGWE P
Sbjct: 1 LLKDVSLHKVRLGADSPQFMAQNTNLQYLLELDVDNMMWSFRKVSNLNAPGQPYGGWESP 60
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD 233
+ ELRGHFVGHYLSASALMWASTHNE L EKM+A++ AL CQ IG+GYLSAFP+E FD
Sbjct: 61 ASELRGHFVGHYLSASALMWASTHNEVLHEKMNALLGALKECQMSIGTGYLSAFPSEFFD 120
Query: 234 RLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIER 293
R EA+ VWAPYYTIHKI+AGLLDQY A + +AL M M YFY RV+ VI+K++IER
Sbjct: 121 RFEAIEYVWAPYYTIHKIMAGLLDQYLLAGSKDALDMVVEMANYFYKRVKTVIEKFTIER 180
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHI 353
HW++LNEE GGMNDVLY+L+ +T D KHL LAHLFDKPCFLG LALQAD +SGFHSNTHI
Sbjct: 181 HWRSLNEETGGMNDVLYRLYTVTGDNKHLELAHLFDKPCFLGPLALQADHLSGFHSNTHI 240
Query: 354 PIVIGSQMRYEVTGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRLASNLDS 402
PIV+G+QMRYEVT D +++ H + GT++ F +D R L +
Sbjct: 241 PIVVGAQMRYEVTSDLIYRSIAEYFMGIVNSSHSYATGGTSVS--EFWTDSMRQGDTLHT 298
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
+E+CTTYNMLK++R LFRWTK+I Y DYY+R+L NG+LG QRG +PGVMIY+LP+ PG
Sbjct: 299 ENQETCTTYNMLKIARTLFRWTKDIKYMDYYDRALINGILGTQRGQQPGVMIYMLPMGPG 358
Query: 463 SSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
SK RSYH WG +SFWCCYGT IESF+KLGDSIYFE++G+ P VY+ Q++SS W S
Sbjct: 359 VSKGRSYHGWGNKFNSFWCCYGTAIESFAKLGDSIYFEDDGEIPSVYVAQFVSSDFVWDS 418
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKG---SGLTTSLNLRIPTWTSSNGAKATLNGQD 579
+V++Q + P+ + L VT +FS + +++R+P+W G +A LNGQ+
Sbjct: 419 AGLVLHQSLKPLNAEQSILEVTFSFSHATIVRASQDAVIHVRLPSWV--RGCRAHLNGQE 476
Query: 580 LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSI 639
+ PG FLS+ + WSSDD+L + LP++L E IQDDR +Y+++ AI+YGP+V+AG S
Sbjct: 477 IESLIPGKFLSIARAWSSDDELVLLLPMSLGLEKIQDDRAQYSALHAIMYGPFVMAGLST 536
Query: 640 GDWDITESATSLSDWITPIPASYNSQLITFTQ-----EYGNTKFVLTNSNQSITMEKFPK 694
GDW + +L+ W+ P+PA+Y+SQL TF+Q EY + ++ N+ +I M P+
Sbjct: 537 GDWKLGHK-ENLTQWVYPVPAAYHSQLSTFSQFHVNGEYSGSLYLACNNGTAI-MRYAPE 594
Query: 695 SGTDAALHATFRLILNDSSGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDS 754
GTD +TFR+ + S+ S+ +D + V LE F PG+ +QH +D+ + T
Sbjct: 595 DGTDECGLSTFRVSDPFGNYSQLSAGDD--KRLVSLELFSQPGIF-LQHNGEDKPISTG- 650
Query: 755 FIAQGSSVFHLVAGLDGGDRTVSLESETYKGCFVYTAVNLQSSESTK-LGCISESTEAGF 813
SVF + GL G TVS E+ GCF+ ++ + S L C + +
Sbjct: 651 --PPSWSVFFYLPGLTGKSGTVSFEAVDKPGCFLSSSFSGSSVLGGVFLRCKTSRNDNTL 708
Query: 814 NNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFD 860
N ++F ++ G++ YHP+SF+A+G +RNFLLAPL SLRDESYT+YFD
Sbjct: 709 NAFSTFDVQMGVAAYHPVSFIAEGQHRNFLLAPLNSLRDESYTIYFD 755
>gi|326520888|dbj|BAJ92807.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 683
Score = 787 bits (2032), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/697 (57%), Positives = 483/697 (69%), Gaps = 43/697 (6%)
Query: 192 MWASTHNESLKEKMSAVVSALSACQKEI---GSGYLSAFPTEQFDRLEALIPVWAPYYTI 248
MWASTHN +L KMSAVV AL ACQ+ G+GYLSAFP E FDR EA+ PVWAPYYTI
Sbjct: 1 MWASTHNGTLAGKMSAVVDALHACQQAPANGGAGYLSAFPAEFFDRFEAIKPVWAPYYTI 60
Query: 249 HKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDV 308
HKI+ GLLDQYT A N +AL M M YF RV++VI+++SIERHW +LNEE GGMNDV
Sbjct: 61 HKIMQGLLDQYTVAGNGKALAMVVAMAGYFGERVRSVIQRHSIERHWTSLNEETGGMNDV 120
Query: 309 LYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGD 368
LY+L+ IT D +HL+LAHLFDKPCFLGLLA+QAD +S FH+NTHIPIV+G QMRYEVTGD
Sbjct: 121 LYQLYAITNDQRHLVLAHLFDKPCFLGLLAVQADSLSDFHANTHIPIVVGGQMRYEVTGD 180
Query: 369 QLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVS 417
L+KE H + GT++ F F DPKRLA L + EESCTTYNMLKVS
Sbjct: 181 PLYKEIATFFMNVVNSSHSYATGGTSVSEFWF--DPKRLAETLTTENEESCTTYNMLKVS 238
Query: 418 RHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSD 477
RHLFRWTKEIAYADYYER+L NGV IQRG +PGVMIY+LP PG SK SYH WGT D
Sbjct: 239 RHLFRWTKEIAYADYYERALINGVQSIQRGRDPGVMIYMLPQGPGRSKALSYHGWGTQYD 298
Query: 478 SFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSW 537
SFWCCYGTGIESFSKLGDSIYFEE+G P +Y++QYI S +W+S + V Q + P+ S
Sbjct: 299 SFWCCYGTGIESFSKLGDSIYFEEKGGKPALYLVQYIPSTFNWRSVGLTVTQTLKPLSSS 358
Query: 538 DPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSS 597
D L+V+L+ S+K +G ++N+RIP+W SSNGAKATLNG+DL + SPG FLSVTK W
Sbjct: 359 DQNLQVSLSISAKTNGQYATVNVRIPSWASSNGAKATLNGKDLTMASPGTFLSVTKQWGG 418
Query: 598 DDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITP 657
D L +QLP+ LRTEAI+DDRPEYAS+QA+L+GP++LAG + GDWD ++S+WIT
Sbjct: 419 GDHLALQLPIRLRTEAIKDDRPEYASLQAVLFGPFLLAGLTTGDWDAKTGGGAISEWITA 478
Query: 658 IPASYNSQLITFTQEYGNTKFVL----TNSNQSITMEKFPK-SGTDAALHATFRLILNDS 712
IPA+YNSQL+T TQE GN+ VL T S+TM+ P+ GTDAA+HATFRL+
Sbjct: 479 IPATYNSQLVTLTQESGNSTLVLSLLSTAKATSLTMQPRPEGGGTDAAVHATFRLVTQGQ 538
Query: 713 S----GSEFSSLNDFIG-KSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVA 767
G + N S ++EPFD PGM V +T S SS+F++V
Sbjct: 539 GTPPMGERRHATNATAALASAVIEPFDMPGMAVTNS-------LTLSAEKGPSSLFNVVP 591
Query: 768 GLDGGDRTVSLESETYKGCFVYTA---VNLQSSESTKLGCISESTEAGFNNAASFVIEKG 824
GLDG +VSLE GCF+ TA N+Q S AASF +
Sbjct: 592 GLDGQPGSVSLELGARPGCFLVTAGAKANVQVGCGGGGTGFSR-------QAASFARAEP 644
Query: 825 LSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFDF 861
L YHPISF AKGA R+FLL PL +LRDE YTVYF+
Sbjct: 645 LRRYHPISFAAKGARRSFLLEPLFTLRDEFYTVYFNL 681
>gi|357472921|ref|XP_003606745.1| hypothetical protein MTR_4g065150 [Medicago truncatula]
gi|355507800|gb|AES88942.1| hypothetical protein MTR_4g065150 [Medicago truncatula]
Length = 617
Score = 783 bits (2022), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/607 (62%), Positives = 471/607 (77%), Gaps = 30/607 (4%)
Query: 270 MTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFD 329
M TWMV+YFY+RV NVI KY++ RH+Q+LNEE GGMNDVLYKL+ +T D KHL+LAHLFD
Sbjct: 1 MVTWMVDYFYDRVVNVISKYTVNRHYQSLNEETGGMNDVLYKLYSVTGDSKHLLLAHLFD 60
Query: 330 KPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLE 378
KPCFLGLLA+QA+DI+ FH+NTHIPIV+GSQMRYEVTGD L++E H
Sbjct: 61 KPCFLGLLAVQANDIADFHANTHIPIVVGSQMRYEVTGDPLYREIGSFFMDIVNSSHSYA 120
Query: 379 SSGTNIGHFNFKSDPKRLASNLDSN-TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSL 437
+ GT++ F S+PKR+A NL + EESCTTYNMLKVSRHLFRWTKE+ YADYYER+L
Sbjct: 121 TGGTSVREF--WSNPKRIADNLGTTENEESCTTYNMLKVSRHLFRWTKEVTYADYYERAL 178
Query: 438 TNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSI 497
TNGVLGIQRGT+PGVMIY+LPL G SK ++ H WG P D+FWCCYGTGIESFSKLGDSI
Sbjct: 179 TNGVLGIQRGTDPGVMIYMLPLGIGVSKAKTGHSWGNPFDTFWCCYGTGIESFSKLGDSI 238
Query: 498 YFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSS-KGSGLTT 556
YFEEEG P +YIIQYISS +WKSG+ ++ Q V P S DPYLRVT TFSS + +G ++
Sbjct: 239 YFEEEGNSPSLYIIQYISSSFNWKSGKTLLTQTVVPAASSDPYLRVTFTFSSNEKTGTSS 298
Query: 557 SLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+LN R+P+W+ ++GAKA LN + L LP+PGNFLS+T+ WS+ DKLT+QLPL +RTEAI+D
Sbjct: 299 TLNFRVPSWSHADGAKAILNSEALSLPAPGNFLSITRQWSAGDKLTLQLPLIIRTEAIKD 358
Query: 617 DRPEYASIQAILYGPYVLAGHSIGDWDI-TESATSLSDWITPIPASYNSQLITFTQEYGN 675
DRPEYAS+QAILYGPY+LAGH+ +WDI ++ +++DWITPIP+SYNSQL++F+Q++
Sbjct: 359 DRPEYASVQAILYGPYLLAGHTTRNWDIKADTNKAVADWITPIPSSYNSQLVSFSQDFDQ 418
Query: 676 TKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFIGKSVMLEPFDS 735
+ FV+TNSNQS+TM+K P+ GTD AL ATFRLIL + + K+VMLEP D
Sbjct: 419 STFVITNSNQSLTMQKSPEPGTDVALQATFRLILKGA-----------VSKTVMLEPIDL 467
Query: 736 PGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYKGCFVYTAVNLQ 795
PGM+V E D L+V DS + SSVF +V GLDG ++T+SL+S++ K C+VY+ ++
Sbjct: 468 PGMIVSHQEPDQPLIVVDSSLGGPSSVFLVVPGLDGRNQTISLQSQSNKDCYVYS--DMS 525
Query: 796 SSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDESY 855
S KL C S+S EA FN AASFV KGL +YHPISFVAKG N+NFLL PL + RDE Y
Sbjct: 526 SGSGVKLRCKSDS-EASFNQAASFVSGKGLRQYHPISFVAKGGNQNFLLEPLFNFRDEHY 584
Query: 856 TVYFDFQ 862
TVYF+ Q
Sbjct: 585 TVYFNIQ 591
>gi|302818405|ref|XP_002990876.1| hypothetical protein SELMODRAFT_20509 [Selaginella moellendorffii]
gi|300141437|gb|EFJ08149.1| hypothetical protein SELMODRAFT_20509 [Selaginella moellendorffii]
Length = 755
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/772 (52%), Positives = 517/772 (66%), Gaps = 43/772 (5%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEP 173
FL+ VSLHDVRL DS AQQTNL+YLLMLDVD LV++FR TA L A G YGGWE P
Sbjct: 1 FLEAVSLHDVRLLPDSWQAIAQQTNLDYLLMLDVDNLVYSFRTTAGLNASGSAYGGWELP 60
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD 233
+ ELRGHFVGHYLSASA+ WASTHN ++ E M+AVV+AL+ CQ +IG+GYLSAFPT FD
Sbjct: 61 TSELRGHFVGHYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSAFPTSLFD 120
Query: 234 RLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIER 293
R EAL VWAPYYTIHKI+AGLLDQYTYA N+ A M M +YF +RV+ VI+KYSIER
Sbjct: 121 RFEALESVWAPYYTIHKIMAGLLDQYTYAANSLAFEMLLGMTDYFGSRVERVIEKYSIER 180
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHI 353
HWQ+LNEE GGMNDVLY+++ IT D KHL LAHLFDKPCFLGLLA++AD ISGFH+NTHI
Sbjct: 181 HWQSLNEETGGMNDVLYRVYQITGDAKHLKLAHLFDKPCFLGLLAVRADSISGFHANTHI 240
Query: 354 PIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDS 402
PIVIG+Q+RYEV GD+L+K+ H + GT+ G F SDP RL L +
Sbjct: 241 PIVIGAQLRYEVVGDKLYKDLSEYFMKIVSSSHTYATGGTSAG--EFWSDPSRLGDTLGT 298
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
EESCTTYNMLKV+R+LFRWTK++ YAD+YER+L NGVL IQRG EPGVMIY+LPLAPG
Sbjct: 299 ENEESCTTYNMLKVARNLFRWTKQMHYADFYERALINGVLTIQRGKEPGVMIYMLPLAPG 358
Query: 463 SSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK-YPGVYIIQYISSRLDWK 521
SSK SYH WGTP SFWCCYGT IESFSKLGDSIYF +E + P +Y+IQY+SS++ W
Sbjct: 359 SSKATSYHGWGTPFSSFWCCYGTAIESFSKLGDSIYFTDEVQDTPQLYVIQYLSSKVLWT 418
Query: 522 SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT--SLNLRIPTWTSSNGAKATLNGQD 579
+ + V+Q+V + S DP + VT F+ G T+ L++R+P W S ++ LNG +
Sbjct: 419 AAGLSVDQRVYHMTSTDPVMTVTFNFTQLVLGKTSEAKLSVRVPYWAQS--SRCLLNGLE 476
Query: 580 LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSI 639
L +PG F V++ W + DKL+ LR E IQD+R +Y+S+ AI YGPY+LAG S
Sbjct: 477 LQNLTPGTFFDVSREWKTGDKLSFTFSAMLRLEKIQDERSKYSSLYAIYYGPYLLAGMSD 536
Query: 640 GDWDI-TESATSLSDWITPIPASYNSQLITFTQ-EYGNTKFVLTNSNQSITMEKFPKSGT 697
G++ + + + ++ S WI P+ +S L +FTQ + G +++ +S+ +++M P+ G+
Sbjct: 537 GNYKLGSVNVSTPSRWIKPVR---DSNLFSFTQLQQGKLQYLAASSDGALSMISKPQHGS 593
Query: 698 DAALHATFRLILNDSSGS-EFSSLND----FIGKSVMLEPFDSPGMLVIQHETDDELVVT 752
+ A ATFRL L S + E + D + + V LE + PG V +D + +T
Sbjct: 594 EEAPLATFRLKLLPSLKTIEKFQVKDVTSLLLDREVSLELLNRPGRFVTHFGIEDGVRLT 653
Query: 753 DS---FIAQGSSVFHLVAGLDGGDRTVSLESETYKGCFVYTAVNLQSSESTKLGCISEST 809
+ SSVF L + L G +S E+ +GCF+ + L C
Sbjct: 654 NGKSSGFPSSSSVFKLRSALSGHPGEISFEASGIQGCFL-----VAQGRDITLEC----- 703
Query: 810 EAGFNN-AASFVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFD 860
FN AASF + G + YHP+SF A G N +L+ PL S DE Y VYF+
Sbjct: 704 -ERFNKMAASFGVTAGRASYHPMSFEAYGDNDTYLMFPLSSYSDEKYAVYFE 754
>gi|302785087|ref|XP_002974315.1| hypothetical protein SELMODRAFT_30650 [Selaginella moellendorffii]
gi|300157913|gb|EFJ24537.1| hypothetical protein SELMODRAFT_30650 [Selaginella moellendorffii]
Length = 755
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/772 (51%), Positives = 516/772 (66%), Gaps = 43/772 (5%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEP 173
FL VSLHDVRL DS AQQTNL+YLLMLDVD LV++FR TA L A G YGGWE P
Sbjct: 1 FLGAVSLHDVRLLPDSWQAIAQQTNLDYLLMLDVDNLVYSFRTTAGLNASGSAYGGWELP 60
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD 233
+ ELRGHFVGHYLSASA+ WASTHN ++ E M+AVV+AL+ CQ +IG+GYLSAFPT FD
Sbjct: 61 TSELRGHFVGHYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSAFPTSLFD 120
Query: 234 RLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIER 293
R EAL VWAPYYTIHKI+AGLLDQYTYA N+ A M M +YF +RV+ VI+KYSIER
Sbjct: 121 RFEALESVWAPYYTIHKIMAGLLDQYTYAANSLAFEMLLGMTDYFGSRVEMVIEKYSIER 180
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHI 353
HWQ+LNEE GGMNDVLY+++ IT D KHL LAHLFDKPCFLGLLA++AD ISGFH+NTHI
Sbjct: 181 HWQSLNEETGGMNDVLYRIYQITGDAKHLKLAHLFDKPCFLGLLAVRADSISGFHANTHI 240
Query: 354 PIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDS 402
PIVIG+Q+RYEV GD+L+K+ H + GT+ G F S+P RL L +
Sbjct: 241 PIVIGAQLRYEVVGDKLYKDLSEYFMKIVSSSHTYATGGTSSGE--FWSNPNRLGDTLGT 298
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
EESCTTYNMLKV+R+LFRWTK++ YAD+YER+L NGVL IQRG EPGVMIY+LPLAPG
Sbjct: 299 ENEESCTTYNMLKVARNLFRWTKQMHYADFYERALINGVLTIQRGKEPGVMIYMLPLAPG 358
Query: 463 SSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK-YPGVYIIQYISSRLDWK 521
SSK +SYH WGTP SFWCCYGT IESFSKLGDSIYF E + P +Y+IQY+SS++ W
Sbjct: 359 SSKAKSYHGWGTPFTSFWCCYGTAIESFSKLGDSIYFTNEVQDTPQLYVIQYLSSKVLWT 418
Query: 522 SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT--SLNLRIPTWTSSNGAKATLNGQD 579
+ + ++Q+V + S DP + VT F+ G T+ L++R+P W S ++ LNG +
Sbjct: 419 AAGLSLDQRVYHMTSTDPVMTVTFNFTQLVLGKTSEAKLSVRVPYWAQS--SRCLLNGLE 476
Query: 580 LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSI 639
L +PG F V++ W + DKL+ LR E IQD+R +Y+S+ AI YGPY+LAG S
Sbjct: 477 LQNLTPGTFFDVSREWKTGDKLSFTFSAMLRLEKIQDERSKYSSLYAIYYGPYLLAGMSD 536
Query: 640 GDWDI-TESATSLSDWITPIPASYNSQLITFTQ-EYGNTKFVLTNSNQSITMEKFPKSGT 697
G++ + + + ++ S WI P+ +S L +FTQ + G +++ +S+ +++M P+ G+
Sbjct: 537 GNYKLGSVNVSTPSRWIKPVR---DSNLFSFTQLQQGKLQYLAASSDGALSMISKPQHGS 593
Query: 698 DAALHATFRLILNDSSGS-EFSSLND----FIGKSVMLEPFDSPGMLVIQHETDDELVVT 752
+ A ATFRL L S + E + D + + V LE + PG V +D + +T
Sbjct: 594 EEASLATFRLKLLPSLKTIEKIQVKDVTSLLLDREVSLELLNRPGRFVTYFGIEDGVRLT 653
Query: 753 DS---FIAQGSSVFHLVAGLDGGDRTVSLESETYKGCFVYTAVNLQSSESTKLGCISEST 809
+ SSVF L + L G +S E+ +GCF+ + L C
Sbjct: 654 NGKSSGFPSSSSVFKLRSALSGHPGEISFEASGIQGCFL-----VAQGRDITLEC----- 703
Query: 810 EAGFNN-AASFVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFD 860
FN AASF + G + YHP+SF A G N +L+ PL S DE Y VYF+
Sbjct: 704 -ERFNKMAASFGVTTGRASYHPMSFEAYGGNDTYLMFPLSSYSDEKYAVYFE 754
>gi|302788790|ref|XP_002976164.1| hypothetical protein SELMODRAFT_104205 [Selaginella moellendorffii]
gi|300156440|gb|EFJ23069.1| hypothetical protein SELMODRAFT_104205 [Selaginella moellendorffii]
Length = 797
Score = 749 bits (1935), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/802 (49%), Positives = 512/802 (63%), Gaps = 59/802 (7%)
Query: 102 PGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLP 161
P F L+ SLH VR+ +DS+ + QQTNLEYLLMLDVD L ++FR + LP
Sbjct: 10 PASFAAAASKIHLLEGSSLHKVRIDADSLQGKGQQTNLEYLLMLDVDSLAYSFRNNSGLP 69
Query: 162 APGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS 221
G PYGGWE P ELRGHFVGHYLSA+A MWASTHNE LK +M +V L CQ++IG+
Sbjct: 70 TKGVPYGGWEAPDQELRGHFVGHYLSATAKMWASTHNEELKRRMDHLVDILDECQQKIGT 129
Query: 222 GYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNR 281
GYLSAFP F R E PVWAPYYTIHKI+AGLLDQYT A N +ALRM WM +YF R
Sbjct: 130 GYLSAFPLNLFTRFETYRPVWAPYYTIHKIMAGLLDQYTEAGNMKALRMVIWMAQYFSKR 189
Query: 282 VQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQA 341
V+N I+KYSI+ H+Q LNEE GGMNDVLY L+ IT DP+HL LAHLFDKPCFLG LALQ
Sbjct: 190 VENYIEKYSIQAHFQALNEETGGMNDVLYDLYKITGDPQHLKLAHLFDKPCFLGPLALQQ 249
Query: 342 DDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFK 390
D +SGFH+NTHIPI+IG+Q RYE+TGDQ+ KE H+ + GT+ F
Sbjct: 250 DTLSGFHANTHIPILIGAQKRYELTGDQVSKELVTFFMDAVNSSHRFVTGGTSDN--EFW 307
Query: 391 SDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
DP R+AS+L + EESC++YNMLK++R+LFRWTKE +Y DYYER + NGVL IQRG EP
Sbjct: 308 KDPNRMASSLGKDVEESCSSYNMLKIARNLFRWTKEASYMDYYERLILNGVLTIQRG-EP 366
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEG------- 503
GVMIY+LP+ PG +K S WG P DSFWCCYGTGIESFSK GDSIYFE+ G
Sbjct: 367 GVMIYMLPMGPGMAKTSSTMGWGDPFDSFWCCYGTGIESFSKFGDSIYFEDYGVRDENPG 426
Query: 504 ---KYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTF----------SSK 550
P +Y+ Q++ S L+W S +++ Q V P+ S+DP + VT+ +S
Sbjct: 427 AQRPIPALYVAQFVPSTLEWDSAGLILKQTVKPLTSFDPVMEVTIHLHENPKATIEETSP 486
Query: 551 GSGLTTSLNLRIPTWTSSNGAKATLNG--QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLT 608
L +L +RIP+W +S G +A N QD+ +PG+FL++ + W + D+LT + P
Sbjct: 487 YHKLINTLYVRIPSWVAS-GYEAYFNDEPQDI---TPGSFLAIQREWKAGDRLTFKFPAE 542
Query: 609 LRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESAT-SLSDWITPIPASYNSQLI 667
+R E IQDDR E+ S+ I++GP+VLAG S G++D+ T S SDWITP+ S N L
Sbjct: 543 VRLEHIQDDREEHQSLNGIMFGPFVLAGLSHGEFDLGPVDTSSPSDWITPVNPSDNDLLY 602
Query: 668 TFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFIGKS 727
TF + L + ++++T++ +GTD ATF++I + S S + +G+
Sbjct: 603 TFRM----GDYQLGHKHRTVTIDSASTNGTDWDFQATFKVISSSSPSLAASKHSGLVGRV 658
Query: 728 VMLEPFDSPGMLVIQHETDDELVVTDS--------FIAQGSSVFHLVAGLDGGDRTVSLE 779
V LE D PG ++ + LVV D+ +++Q + F +V GL DR VS E
Sbjct: 659 VSLELMDQPGRIIAHSGINKNLVVVDTSQFADSTNYLSQANLGFKVVPGL-ASDRLVSFE 717
Query: 780 SETYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGAN 839
S+ GC++Y +L C S+ + GF+ ASF + +GL YHP+SFVA
Sbjct: 718 SQDLPGCYIYVD---DWRVPAQLKCRSKEND-GFDAKASFKVSQGLRSYHPLSFVATSQG 773
Query: 840 -RNFLLAPLLSLRDESYTVYFD 860
RNFLL P L+ RDE Y +YFD
Sbjct: 774 LRNFLLFPQLAYRDEHYAIYFD 795
>gi|302769588|ref|XP_002968213.1| hypothetical protein SELMODRAFT_89765 [Selaginella moellendorffii]
gi|300163857|gb|EFJ30467.1| hypothetical protein SELMODRAFT_89765 [Selaginella moellendorffii]
Length = 797
Score = 748 bits (1930), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/802 (49%), Positives = 511/802 (63%), Gaps = 59/802 (7%)
Query: 102 PGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLP 161
P F L+ SLH VR+ +DS+ + QQTNLEYLLMLDVD L ++FR + LP
Sbjct: 10 PASFAAAASKIHLLEGSSLHKVRIDADSLQGKGQQTNLEYLLMLDVDSLAYSFRNNSGLP 69
Query: 162 APGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS 221
G PYGGWE P ELRGHFVGHYLSA+A MWASTHNE LK +M +V L CQ++IG+
Sbjct: 70 TKGVPYGGWEAPDQELRGHFVGHYLSATAKMWASTHNEELKRRMDHLVDILDECQQKIGT 129
Query: 222 GYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNR 281
GYLSAFP F R E PVWAPYYTIHKI+AGLLDQYT A N +ALRM WM +YF R
Sbjct: 130 GYLSAFPLNLFTRFETYRPVWAPYYTIHKIMAGLLDQYTEAGNMKALRMVIWMAQYFSKR 189
Query: 282 VQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQA 341
V+N I+KYSI+ H+Q LNEE GGMNDVLY L+ IT DP+HL LAHLFDKPCFLG LALQ
Sbjct: 190 VENYIEKYSIQAHFQALNEETGGMNDVLYDLYKITGDPQHLKLAHLFDKPCFLGPLALQQ 249
Query: 342 DDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFK 390
D +SGFH+NTHIPI+IG+Q RYE+TGDQ+ KE H+ + GT+ F
Sbjct: 250 DTLSGFHANTHIPILIGAQKRYELTGDQVSKELVTFFMDAVNSSHRFVTGGTSDN--EFW 307
Query: 391 SDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
DP R+AS+L + EESC++YNMLK++R+LFRWTK+ +Y DYYER + NGVL IQRG EP
Sbjct: 308 KDPNRMASSLGKDVEESCSSYNMLKIARNLFRWTKDASYMDYYERLILNGVLTIQRG-EP 366
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEG------- 503
GVMIY+LP+ PG +K S WG P DSFWCCYGTGIESFSK GDSIYFE+ G
Sbjct: 367 GVMIYMLPMGPGMAKTSSTMGWGDPFDSFWCCYGTGIESFSKFGDSIYFEDYGVRDENPG 426
Query: 504 ---KYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTF----------SSK 550
P +Y+ Q++ S L+W S +++ Q V P+ S+DP + VT+ +S
Sbjct: 427 AQRPIPALYVAQFVPSTLEWDSAGLILKQTVKPLTSFDPVMEVTIHLHENPKATIEETSP 486
Query: 551 GSGLTTSLNLRIPTWTSSNGAKATLNG--QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLT 608
L +L +RIP+W +S G +A N QD+ +PG+FL++ + W + DKLT + P
Sbjct: 487 YHKLINTLYVRIPSWVAS-GYEAYFNDEPQDI---TPGSFLAIQREWKAGDKLTFKFPAE 542
Query: 609 LRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESAT-SLSDWITPIPASYNSQLI 667
+R E IQDDR E+ S+ I++GP+VLAG S G++D+ T S SDWITP+ S N L
Sbjct: 543 VRLEHIQDDREEHQSLNGIMFGPFVLAGLSHGEFDLGPVDTSSPSDWITPVNPSDNDLLY 602
Query: 668 TFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFIGKS 727
TF + L + ++++T++ +GTD ATF++I + S S + +G+
Sbjct: 603 TFRM----GDYQLGHKHRTVTLDSASTNGTDWDFEATFKVISSSSPSLAASKHSGLVGRV 658
Query: 728 VMLEPFDSPGMLVIQHETDDELVVTDS--------FIAQGSSVFHLVAGLDGGDRTVSLE 779
V LE D PG ++ + LVV D+ +++Q + F +V GL DR VS E
Sbjct: 659 VSLELLDQPGRIIAHSGINKNLVVVDTSQFADSTNYLSQANLGFKVVPGL-ASDRLVSFE 717
Query: 780 SETYKGCFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGAN 839
S+ GC++Y +L C S+ + GF+ ASF +GL YHP+SFVA
Sbjct: 718 SQDLPGCYIYVD---DWRVPAQLKCRSKEND-GFDAKASFKASQGLRSYHPLSFVATSQG 773
Query: 840 -RNFLLAPLLSLRDESYTVYFD 860
RNFLL P L+ RDE Y +YFD
Sbjct: 774 LRNFLLFPQLAYRDEHYAIYFD 795
>gi|297606169|ref|NP_001058067.2| Os06g0612900 [Oryza sativa Japonica Group]
gi|255677223|dbj|BAF19981.2| Os06g0612900 [Oryza sativa Japonica Group]
Length = 717
Score = 717 bits (1850), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/729 (53%), Positives = 484/729 (66%), Gaps = 69/729 (9%)
Query: 192 MWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHK- 250
MWASTHN +L KM+AVV AL CQ G+GYLSAFP E FDR EA+ PVWAPYYTIHK
Sbjct: 1 MWASTHNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKA 60
Query: 251 -------------------------ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNV 285
I+ GLLDQ+T A N +AL M M +YF RV++V
Sbjct: 61 RNATQSICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGKALGMVVAMADYFAGRVRSV 120
Query: 286 IKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDIS 345
I++Y+IERHW +LNEE GGMNDVLY+L+ IT+D +HL+LAHLFDKPCFLGLLA+QAD +S
Sbjct: 121 IQRYTIERHWTSLNEETGGMNDVLYQLYTITKDQRHLVLAHLFDKPCFLGLLAVQADSLS 180
Query: 346 GFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPK 394
GFH+NTHIP+VIG QMRYEVTGD L+KE H + GT++ F S+PK
Sbjct: 181 GFHANTHIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVSEF--WSNPK 238
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI 454
LA L + TEESCTTYNMLKVSRHLFRWTKEIAYADYYER+L NGVL IQRG +PGVMI
Sbjct: 239 HLAEALTTETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMI 298
Query: 455 YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
Y+LP PG SK SYH WGT +SFWCCYGTGIESFSKLGDSIYFE++G PG+YIIQYI
Sbjct: 299 YMLPQGPGRSKAVSYHGWGTQYNSFWCCYGTGIESFSKLGDSIYFEQKGDKPGLYIIQYI 358
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFS-SKGSGLTTSLNLRIPTWTSSNGAKA 573
S +W++ + V Q+V P+ S D YL+V+L+ S +K +G +LN+RIP+WTS NGAKA
Sbjct: 359 PSTFNWRTAGLTVTQQVKPLSSSDQYLQVSLSISAAKTNGQYATLNVRIPSWTSMNGAKA 418
Query: 574 TLNGQDLPLPSPGNFLSVTKTW-SSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
TLN +DL L SPG FL+++K W S DD L +Q P+ LRTEAI+DDRP+ AS+ AIL+GP+
Sbjct: 419 TLNDKDLQLASPGTFLTISKQWDSGDDHLLLQFPINLRTEAIKDDRPQVASLNAILFGPF 478
Query: 633 VLAGHSIGDWD--ITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQ-SITM 689
+LAG + GDWD +AT+ SDWITP+PASYNSQL+T TQE G +L+ N S+ M
Sbjct: 479 LLAGLTTGDWDAKTGGAATAASDWITPVPASYNSQLVTLTQESGGKTMLLSTVNDTSLAM 538
Query: 690 EKFPK--SGTDAALHATFRLILNDSSG--------SEFSSLNDFIGKSVMLEPFDSPGML 739
+ P+ GTDAA+ ATFR++ S + +EPF PG
Sbjct: 539 LERPEGAGGTDAAVRATFRVVPPGSRAELRQRAGAGAGEGAARLKVAAATIEPFGLPGTA 598
Query: 740 VIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYKGCFVYTAVNLQSSES 799
V + L V + + S++F++ GLDG +VSLE + GCF+ +
Sbjct: 599 V-----SNGLAVVRAGNSS-STLFNVAPGLDGKPGSVSLELGSKPGCFLVAGAGAK---- 648
Query: 800 TKLGCISE-----STEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDES 854
+GC + + AGF AASF + L YH ISF A G R+FLL PL +LRDE
Sbjct: 649 VHVGCRTRGGAAAAAAAGFEQAASFAQAEPLRRYHAISFFASGVRRSFLLEPLFTLRDEF 708
Query: 855 YTVYFDFQS 863
YT+YF+ +
Sbjct: 709 YTIYFNLAA 717
>gi|357472933|ref|XP_003606751.1| hypothetical protein MTR_4g065220 [Medicago truncatula]
gi|355507806|gb|AES88948.1| hypothetical protein MTR_4g065220 [Medicago truncatula]
Length = 593
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/677 (52%), Positives = 450/677 (66%), Gaps = 104/677 (15%)
Query: 205 MSAVVSALSACQKEIGSGYLSAFPTEQF-DRLEALIPVWAPYYTIHKIL------AGLLD 257
MSA+VS LSACQ++ +G F L+ L WAPYYTIHK+ LD
Sbjct: 1 MSALVSGLSACQEKNWNGISVCISNRVFLIELKNLEYAWAPYYTIHKLFDFDRSWLAFLD 60
Query: 258 QYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQ 317
QYT A N + L+M TWMV+YFYNRV NVI+K+++ RH+Q+LNEEAGGMND+LY+L+ +T+
Sbjct: 61 QYTIAGNPQGLKMVTWMVDYFYNRVMNVIQKFTVNRHYQSLNEEAGGMNDLLYRLYSLTR 120
Query: 318 DPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE---- 373
DPKHL LAHLFDKPCFLG+LA+Q +DI+ FH+NTHIPIV+G+Q+RYE+TGD +K+
Sbjct: 121 DPKHLELAHLFDKPCFLGVLAVQGNDIADFHANTHIPIVVGAQLRYELTGDLHYKDIGQY 180
Query: 374 -------GHQLESSGTNIGHFNFKSDPKRLASNLDS-NTEESCTTYNMLKVSRHLFRWTK 425
H + GT++G F +PKR+A NL S TEESC+TYNMLKVSRHLFRWTK
Sbjct: 181 FMDIVNSSHAYATGGTSVGEF--WRNPKRIADNLKSAETEESCSTYNMLKVSRHLFRWTK 238
Query: 426 EIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGT 485
E+ YADYYER+LTNGVL IQRGT+PGVMIY+LPL G SK ++Y WGTP DSFWCCYGT
Sbjct: 239 EVTYADYYERALTNGVLSIQRGTDPGVMIYMLPLGLGVSKAQTYWKWGTPFDSFWCCYGT 298
Query: 486 GIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTL 545
GIESFSKLGDSIYFEEEGK+ +YIIQYISS +W SG +
Sbjct: 299 GIESFSKLGDSIYFEEEGKHRSLYIIQYISSSFNWNSGTAI------------------- 339
Query: 546 TFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQL 605
G +++LN RIP+WT +NGAKA LN + LPLP+P
Sbjct: 340 -------GTSSTLNFRIPSWTLANGAKALLNSETLPLPAP-------------------- 372
Query: 606 PLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ 665
DDRPE+AS+QAILYGPY+LAGH+ ++WITPIP++Y+SQ
Sbjct: 373 ----------DDRPEFASLQAILYGPYLLAGHT-------------TNWITPIPSNYSSQ 409
Query: 666 LITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFIG 725
L++++Q+ + V+TNS QS+TME P GT+ A HATFRLI D+ G
Sbjct: 410 LVSYSQDINKSTLVITNSKQSLTMEILPGPGTENAPHATFRLIPKDAD-----------G 458
Query: 726 KSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYKG 785
K+VMLEPFD PGM V + L++ DS SSVF +V GLDG ++T+SLES++ K
Sbjct: 459 KTVMLEPFDLPGMTVSHQGPEKPLIIVDSSHGGPSSVFLVVPGLDGRNQTISLESQSNKD 518
Query: 786 CFVYTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLA 845
C+V++ ++ + KL C S S E FN A SFV KGL +Y+PISFVAKGAN+NFLL
Sbjct: 519 CYVHS--DMSAGSGVKLVCKSAS-ETSFNQANSFVSGKGLRQYNPISFVAKGANQNFLLE 575
Query: 846 PLLSLRDESYTVYFDFQ 862
PL + RDE YTVYF+ Q
Sbjct: 576 PLFNFRDEHYTVYFNLQ 592
>gi|449522353|ref|XP_004168191.1| PREDICTED: uncharacterized protein LOC101224273 [Cucumis sativus]
Length = 495
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 309/491 (62%), Positives = 380/491 (77%), Gaps = 5/491 (1%)
Query: 375 HQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYE 434
H + GT++ F DPKRLA L + TEESCTTYNMLKVSR+LF+WTKEIAYADYYE
Sbjct: 8 HSYATGGTSV--HEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTKEIAYADYYE 65
Query: 435 RSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLG 494
R+LTNGVL IQRGT+PGVMIY+LPL GSSK SYH WGTP +SFWCCYGTGIESFSKLG
Sbjct: 66 RALTNGVLSIQRGTDPGVMIYMLPLGSGSSKAISYHGWGTPFESFWCCYGTGIESFSKLG 125
Query: 495 DSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGL 554
DSIYFEEE + P +Y+IQYISS LDWKSG +++NQ VDP+ S DP LR+TLTFS KGS
Sbjct: 126 DSIYFEEELQTPTLYVIQYISSSLDWKSGNVLLNQTVDPIHSEDPKLRMTLTFSPKGSVH 185
Query: 555 TTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
++++NLRIP+WTS++GAK LNGQ L GNF SVT +WSS +KL+++LP+ LRTEAI
Sbjct: 186 SSTINLRIPSWTSASGAKVVLNGQSLGNNINGNFKSVTNSWSSGNKLSLELPINLRTEAI 245
Query: 615 QDDRPEYASIQAILYGPYVLAGHSIGDWDI-TESATSLSDWITPIPASYNSQLITFTQEY 673
DDR EYAS++AIL+GPY+LA +S GDW+I T+ A SLSDWIT +P++YN+ L+TF+Q
Sbjct: 246 DDDRSEYASVKAILFGPYLLAAYSNGDWEIKTQQADSLSDWITHVPSAYNTFLVTFSQAS 305
Query: 674 GNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFIGKSVMLEPF 733
G T F LTNSNQSITMEK+P GTD+A+HATFRLI++D S ++ + L D IGK VMLEPF
Sbjct: 306 GKTSFALTNSNQSITMEKYPGQGTDSAVHATFRLIIDDPS-AKVTELQDVIGKRVMLEPF 364
Query: 734 DSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYKGCFVYTAVN 793
PGM++ D+ L + D+ SS F+LV GLDG + TVSL S +GCFVY+ VN
Sbjct: 365 SFPGMVLGNKGKDERLEIADANSEGHSSDFYLVEGLDGKNGTVSLASIDNEGCFVYSGVN 424
Query: 794 LQSSESTKLGCISE-STEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRD 852
+S KL C S+ S + GF+ A+SF++E G S+YHPISFV KG RNFLLAPLLS D
Sbjct: 425 YESGAQLKLSCKSKLSLDDGFDEASSFLLESGASQYHPISFVTKGMTRNFLLAPLLSFVD 484
Query: 853 ESYTVYFDFQS 863
ESYTVYF+F +
Sbjct: 485 ESYTVYFNFNA 495
>gi|125556048|gb|EAZ01654.1| hypothetical protein OsI_23690 [Oryza sativa Indica Group]
Length = 466
Score = 560 bits (1443), Expect = e-156, Method: Compositional matrix adjust.
Identities = 281/463 (60%), Positives = 337/463 (72%), Gaps = 40/463 (8%)
Query: 192 MWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHK- 250
MWASTHN +L KM+AVV AL CQ G+GYLSAFP E FDR EA+ PVWAPYYTIHK
Sbjct: 1 MWASTHNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKA 60
Query: 251 -------------------------ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNV 285
I+ GLLDQ+T A N AL M M +YF RV++V
Sbjct: 61 RNATQSICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGRALGMVVAMADYFAGRVRSV 120
Query: 286 IKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDIS 345
I++Y+IERHW +LNEE GGMNDVLY+L+ IT+D +HL+LAHLFDKPCFLGLLA+QAD +S
Sbjct: 121 IQRYTIERHWTSLNEETGGMNDVLYQLYTITKDQRHLVLAHLFDKPCFLGLLAVQADSLS 180
Query: 346 GFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPK 394
GFH+NTHIP+VIG QMRYEVTGD L+KE H + GT++ F S+PK
Sbjct: 181 GFHANTHIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVS--EFWSNPK 238
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI 454
LA L + TEESCTTYNMLKVSRHLFRWTKEIAYADYYER+L NGVL IQRG +PGVMI
Sbjct: 239 HLAEALTTETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMI 298
Query: 455 YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
Y+LP PG SK SYH WGT +SFWCCYGTGIESFSKLGDSIYFE++G PG+YIIQYI
Sbjct: 299 YMLPQGPGRSKAVSYHGWGTQYNSFWCCYGTGIESFSKLGDSIYFEQKGDKPGLYIIQYI 358
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFS-SKGSGLTTSLNLRIPTWTSSNGAKA 573
S +W++ + V Q+V P+ S D YL+V+L+ S +K +G +LN+RIP+WTS NGAKA
Sbjct: 359 PSTFNWRTAGLTVTQQVKPLSSSDQYLQVSLSISAAKTNGQYATLNVRIPSWTSMNGAKA 418
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
TLN +DL L SPG FL+++K W S D L +Q P+ LRTEAI+D
Sbjct: 419 TLNDKDLQLASPGTFLTISKQWDSGDHLLLQFPINLRTEAIKD 461
>gi|413926260|gb|AFW66192.1| hypothetical protein ZEAMMB73_605676 [Zea mays]
gi|413952504|gb|AFW85153.1| hypothetical protein ZEAMMB73_422486 [Zea mays]
Length = 510
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 278/519 (53%), Positives = 354/519 (68%), Gaps = 27/519 (5%)
Query: 361 MRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCT 409
MRYEVTGD L+K+ H + GT+ G F +DPKRLA L + EESCT
Sbjct: 1 MRYEVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEF--WTDPKRLAGTLSTENEESCT 58
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSY 469
TYNMLKVSR+LFRWTKEIAYADYYER+L NGVL IQRGT+PGVMIY+LP APG SK SY
Sbjct: 59 TYNMLKVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSY 118
Query: 470 HHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQ 529
H WGT DSFWCCYGTGIESFSKLGDSIYFEE+G P + IIQYI S +WK+ + V Q
Sbjct: 119 HGWGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQ 178
Query: 530 KVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFL 589
++ + S D YL+++ + S+ SG T ++N RIP+WT ++GA ATLNG+DL SPG+FL
Sbjct: 179 QIKTLSSSDQYLQISFSISANTSGQTANINFRIPSWTFADGAGATLNGKDLGSISPGSFL 238
Query: 590 SVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE-SA 648
S+TK W+SDD L + P+ LRTEAI+DDR EYAS+QA+L+GP+VLAG S GDWD +
Sbjct: 239 SITKQWNSDDHLALHFPIRLRTEAIKDDRLEYASLQAVLFGPFVLAGLSTGDWDAKAGNG 298
Query: 649 TSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPK-SGTDAALHATFRL 707
+++SDWI +P ++NSQL+TFTQ FVL+++N ++TM++ P+ GTDAA+HATFR
Sbjct: 299 SAISDWIAAVPPAHNSQLVTFTQVSNGKAFVLSSANGTLTMQERPEVDGTDAAIHATFRA 358
Query: 708 ILNDSSGSEFSSL--NDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHL 765
+ S +E + G S++LEPFD PG ++ + T +D S+F++
Sbjct: 359 HPQEDS-TELHDIYSTTLTGTSILLEPFDLPGTVITNNLTLSAQKSSD-------SLFNI 410
Query: 766 VAGLDGGDRTVSLESETYKGCFVYTAVNLQSSESTKLGCIS--ESTEAGFNNAASFVIEK 823
V GLDG +VSLE T GCF+ T N + ++ C S ES AASF
Sbjct: 411 VPGLDGNPNSVSLELGTKPGCFLVTGTNYSAGTRIEVNCKSSLESIGGILEQAASFSQTD 470
Query: 824 GLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFDFQ 862
L +YHPISFVAKG RNFLL PL SLRDE YTVYF+ +
Sbjct: 471 PLRQYHPISFVAKGVARNFLLEPLYSLRDEFYTVYFNVR 509
>gi|449531121|ref|XP_004172536.1| PREDICTED: uncharacterized LOC101224273, partial [Cucumis sativus]
Length = 366
Score = 507 bits (1306), Expect = e-141, Method: Compositional matrix adjust.
Identities = 239/346 (69%), Positives = 290/346 (83%), Gaps = 7/346 (2%)
Query: 27 KECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDSAWLSLMPRKILREEEQ 86
KECTN +L SHTFR LLSS N ++ K++ SH HLTP+DD AW +L+PRK+L+EE +
Sbjct: 28 KECTNTPTQLGSHTFRYELLSSGNVTWKKELFSHY-HLTPTDDFAWSNLLPRKMLKEENE 86
Query: 87 DELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLD 146
++W M+YR++KN ++P G LKE+SLHDVRL +S+H AQ TNL+YLLMLD
Sbjct: 87 ---YNWEMMYRQMKNKDGLRIP---GGMLKEISLHDVRLDPNSLHGTAQTTNLKYLLMLD 140
Query: 147 VDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMS 206
VD+L+W+FRKTA LP PGEPY GWE+ CELRGHFVGHYLSASA MWAST N LKEKMS
Sbjct: 141 VDRLLWSFRKTAGLPTPGEPYVGWEKSDCELRGHFVGHYLSASAQMWASTGNSVLKEKMS 200
Query: 207 AVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAE 266
A+VS L+ CQ ++G+GYLSAFP+E+FDR EA+ PVWAPYYTIHKILAGLLDQYT+A N++
Sbjct: 201 ALVSGLATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAGLLDQYTFAGNSQ 260
Query: 267 ALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAH 326
AL+M TWMVEYFYNRVQNVI KY++ERH+++LNEE GGMNDVLY+L+ IT + KHL+LAH
Sbjct: 261 ALKMVTWMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYRITGNTKHLLLAH 320
Query: 327 LFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK 372
LFDKPCFLGLLA+QA+DISGFH NTHIPIV+GSQMRYEVTGD L+K
Sbjct: 321 LFDKPCFLGLLAVQAEDISGFHVNTHIPIVVGSQMRYEVTGDPLYK 366
>gi|413954825|gb|AFW87474.1| hypothetical protein ZEAMMB73_309562 [Zea mays]
Length = 483
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 261/497 (52%), Positives = 335/497 (67%), Gaps = 33/497 (6%)
Query: 375 HQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYE 434
H + GT++ F S+PKRLA L + TEESCTTYNMLKVSRHLFRWTKEIAYADYYE
Sbjct: 8 HAYATGGTSVSEF--WSNPKRLAEALTTETEESCTTYNMLKVSRHLFRWTKEIAYADYYE 65
Query: 435 RSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLG 494
R+L NGVL IQRG +PGVMIY+LP PG SK +SYH WGT +SFWCCYGTGIESFSKLG
Sbjct: 66 RALINGVLSIQRGRDPGVMIYMLPQGPGRSKAKSYHGWGTQYESFWCCYGTGIESFSKLG 125
Query: 495 DSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGS-G 553
DSIYFEE G+ P +Y++Q+I S W++ + V Q++ P+ S D YL+V+ + S+K + G
Sbjct: 126 DSIYFEERGERPALYVVQFIPSTFSWRTAGLTVAQQLMPLSSSDQYLQVSFSVSAKTTNG 185
Query: 554 LTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEA 613
+LN+RIP+WTS NGAKATLNG+ L L SPG FL+++K W S D+L++QLP+ LRTEA
Sbjct: 186 QFATLNVRIPSWTSLNGAKATLNGKHLELASPGTFLTISKQWGSGDQLSLQLPIHLRTEA 245
Query: 614 IQDDRPEYASIQAILYGPYVLAGHSIGDWDITES--ATSLSDWITPIPASYNSQLITFTQ 671
I+DDRPEYASIQA+L+GP++LAG + GDWD + SDWITP+P NSQL+T Q
Sbjct: 246 IKDDRPEYASIQAVLFGPFLLAGLTTGDWDAKTGAADAAASDWITPVPVESNSQLVTLAQ 305
Query: 672 EYGNTKFVLTNSNQSITMEKFPK--SGTDAALHATFRLILNDSSGSEFSSLNDFIGKSVM 729
E G FVL+ N S+TM + PK GT+AA+HATFRL+ +G+ + M
Sbjct: 306 ESGGEAFVLSALNGSLTMLQRPKDGGGTEAAVHATFRLVPQGGAGAG---------AAAM 356
Query: 730 LEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYKGCFVY 789
LEP D PGM+V D L V + F++V GL G +VSLE + GCF+
Sbjct: 357 LEPLDMPGMVVT-----DRLTVAAE--KSSGAAFNVVPGLAGAPGSVSLELASRPGCFL- 408
Query: 790 TAVNLQSSESTKLGCISESTE-----AGFNNAASFVIEKGLSEYHPISFVAKGANRNFLL 844
+ E ++GC + + A F +ASF + L YHP+SF A+G R+FLL
Sbjct: 409 ----VGGGEKVQVGCAGGAQQKRGDGAWFRRSASFARGEPLRRYHPMSFAARGVRRSFLL 464
Query: 845 APLLSLRDESYTVYFDF 861
PL +LRDE YTVYF+
Sbjct: 465 EPLFTLRDEFYTVYFNL 481
>gi|218198541|gb|EEC80968.1| hypothetical protein OsI_23691 [Oryza sativa Indica Group]
Length = 759
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 246/522 (47%), Positives = 317/522 (60%), Gaps = 58/522 (11%)
Query: 387 FNFKSDPKRLASNLD-SNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ 445
+ + DPKRL + S+ EE+C TYN+LKVSR+LFRWTKE Y D+YER L NG++G Q
Sbjct: 244 LHVRHDPKRLVDEIKISSNEETCATYNLLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQ 303
Query: 446 RGTEPGVMIYLLPLAPGSSKE-----------RSYHHWGTPSDSFWCCYGTGIESFSKLG 494
RG EPGVMIY LP+ PG SK ++ WG + +FWCCYGTGIESFSKLG
Sbjct: 304 RGKEPGVMIYFLPMGPGRSKSISGMPTSGLPPKNPGGWGNANATFWCCYGTGIESFSKLG 363
Query: 495 DSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGL 554
DSIYF EEG+ PG+YIIQYI S DWK+ + V Q+ P+ S D + V++ SSKG
Sbjct: 364 DSIYFLEEGEIPGLYIIQYIPSTFDWKAAGLTVKQQAKPLSSTDSHFEVSIFISSKGDAR 423
Query: 555 TTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
++N+RIP+WTS +GA ATLNGQ L L S G+FLSVTK W DD L+++ P+TLRTE I
Sbjct: 424 PANVNVRIPSWTSVDGAIATLNGQKLNLTSAGDFLSVTKLW-GDDTLSLKFPITLRTEPI 482
Query: 615 QDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITP----------------- 657
+DDRPEY+SIQA+L+GP++LAG + G+ + S S S +TP
Sbjct: 483 KDDRPEYSSIQAVLFGPHLLAGLTHGNQTVKTSNDSNSG-LTPGVWEVNATHAAAAVAVW 541
Query: 658 ---IPASYNSQLITFTQEYGNTK----FVLTNS--NQSITMEKFPKSGTDAALHATFRLI 708
+ S NSQL+T TQ G+ + FVL+ S + ++TM++ P +G+DA +HATFR
Sbjct: 542 VTPVSQSLNSQLVTLTQRDGDAQAAAAFVLSVSIADGALTMQESPVAGSDACVHATFRAY 601
Query: 709 LNDSSGSEFSSLNDFI-GKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVA 767
+ S S + + G+ V LEPFD PGM V D L V A + F+ VA
Sbjct: 602 HSPSGASAIDAATGRLQGRDVALEPFDRPGMAVT-----DALSVGRPGPA---TRFNAVA 653
Query: 768 GLDGGDRTVSLESETYKGCFV------YTA---VNLQSSESTKLGCISESTEAGFNNAAS 818
GLDG TVSLE T GCFV Y A + + T G + + F AAS
Sbjct: 654 GLDGLPGTVSLELATRPGCFVAAPTTAYLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAAS 713
Query: 819 FVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFD 860
F L YHP+SF A G +RNFLL PL SL+DE YTVYF+
Sbjct: 714 FTQAAPLRLYHPLSFSATGTDRNFLLEPLQSLQDEFYTVYFN 755
Score = 210 bits (535), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 104/198 (52%), Positives = 134/198 (67%), Gaps = 10/198 (5%)
Query: 60 HND---HLTPSDDSAWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLK 116
H+D HL ++++ W+ L+PR R +DEL W LYR I G E +G FL
Sbjct: 51 HSDGLPHLNQAEEATWMGLLPR---RAGPRDEL-DWLALYRSITRGGGDVGGEPAG-FLS 105
Query: 117 EVSLHDVRLG--SDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
SLHDVR+ +M+W+ QQTNLEYLL LD D+L W FR+ A+LP GEPYGGWE P
Sbjct: 106 PASLHDVRVDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWEAPD 165
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
+LRGHF GHYLSA+A MWASTHN++L+EKM+ VV L +CQK++ +GYLSA+P FD
Sbjct: 166 GQLRGHFTGHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDA 225
Query: 235 LEALIPVWAPYYTIHKIL 252
+ L W+PYYTIHK +
Sbjct: 226 YDELAEAWSPYYTIHKFI 243
>gi|159491176|ref|XP_001703549.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280473|gb|EDP06231.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1485
Score = 410 bits (1055), Expect = e-111, Method: Compositional matrix adjust.
Identities = 279/873 (31%), Positives = 415/873 (47%), Gaps = 188/873 (21%)
Query: 133 RAQQTNLEYLL-MLDVDKLVWNFRKTARLPAPGEPY-GGWEEPSCELRGHFVGHYLSASA 190
R ++ N +YLL MLD D+L+W FRK A LP PGEPY G WE+P+CELRGHFVGHYLSA +
Sbjct: 557 RYERINSKYLLDMLDADRLLWVFRKNAGLPTPGEPYVGSWEDPNCELRGHFVGHYLSALS 616
Query: 191 LMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHK 250
L WA T N + K ++ +VS L Q+++G+GYLSAFPT FDR+E+L VWAPYYTIHK
Sbjct: 617 LAWAGTGNSAFKTRLDLMVSELGKVQEKLGTGYLSAFPTSWFDRVESLQAVWAPYYTIHK 676
Query: 251 ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVL 309
I+AGL+D + A + AL M T MV+Y +NR Q VI K +HWQ + E E GGMN++L
Sbjct: 677 IIAGLVDAHELAGHPSALTMATRMVDYHWNRTQAVISKKGA-KHWQKVLEFEYGGMNEIL 735
Query: 310 YKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGD- 368
Y+L+ IT H A LFDK FLG +A D + H+NTH+ ++G YE TG+
Sbjct: 736 YRLYLITGKDDHRDFASLFDKTVFLGHMAAHDDVLYDLHANTHLAQIVGFAAGYEATGNP 795
Query: 369 ----------QLHKEGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSR 418
++ + H + GT++ + + + L T E+CT YNMLK++R
Sbjct: 796 KLRTAVNNFFEIVVQHHGYATGGTSVFERWWGRRGRGPRNAL--KTHETCTQYNMLKIAR 853
Query: 419 HLFRWTKEIAYADYYERSLTNGVLGIQR-------------------------------- 446
LF WT ++ YAD+YER++ NG+ G+ R
Sbjct: 854 QLFMWTGDVYYADHYERAMVNGMWGVARLPADELPENGAAGAGGVDKGGQPVSPYTRFHD 913
Query: 447 --------------------GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTG 486
PGV +YLLP+ G+SK + HHWG P SFWCCYGT
Sbjct: 914 DEWMDYISFSKPKPEWNASDAAGPGVYLYLLPMGHGNSKSDNLHHWGFPFHSFWCCYGTI 973
Query: 487 IESFSKLGDSIYF-------------EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDP 533
IES++KL DSI+F E+ G ++ + D + K+ P
Sbjct: 974 IESYAKLADSIFFKWVRVRDMSPESDEDAGAKTAKKRTRHDVNPSDGSASGAKGAVKLPP 1033
Query: 534 VVSWDPYL--RVTLTFSSKGSGLTT---SLNLRIPTWTSSNGAKATLNGQDL----PLPS 584
+ + ++ R++ S+ SG T +L LRIP W G LNGQ P
Sbjct: 1034 RLYLNQFVSSRLSKASSTTASGPTDGVFTLMLRIPAWARDGGVLLELNGQAFNGCPGAPL 1093
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI 644
P ++ +T+ W + D L++++ L QD R EY S++A++ GPY++AG
Sbjct: 1094 PDSYCRITRKWQARDVLSVRVALRWWFSPAQDAREEYRSLKAVMMGPYMMAG-------- 1145
Query: 645 TESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHAT 704
W + + +++Q++ G++ +S+ S+ +G ++L +
Sbjct: 1146 ---------WNSSLHLRHDAQILYIEDADGSSG----HSHGSL-------AGAFSSLRSM 1185
Query: 705 FRLILNDSSGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELV--------VTDSFI 756
RL DS G ++ LE P + TD ++ + F
Sbjct: 1186 MRLGAADS------------GSALSLEAMSYPNHYLAHDHTDVIVLQPGPPREDASHPFA 1233
Query: 757 AQGSSVFHLVAGLDGGDRTVSLESETYKGCFVYTAVNLQSS------------ESTKLGC 804
+++ + GLDG TVS E+ G FV A S ++ ++ C
Sbjct: 1234 PCSRAMWMMRPGLDGAADTVSFEAVARPGWFVTAARPPGESAAAAKDSPVTCVDANEVDC 1293
Query: 805 ISESTEAGFNNA------------------------------------ASFVIEKGLSEY 828
+ + NA ASF + +
Sbjct: 1294 TAAVPDGCGTNAFLARVLCRKSCRSCLGTEQALRLRQQVPGSAVYAATASFRLAPPVRRA 1353
Query: 829 HPI-SFVAKGANRNFLLAPLLSLRDESYTVYFD 860
+P + V G+NR++L+APL +L DE Y+ YF+
Sbjct: 1354 YPAGAHVLAGSNRHYLIAPLGNLVDERYSAYFN 1386
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 70/213 (32%), Positives = 110/213 (51%), Gaps = 37/213 (17%)
Query: 450 PGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY---- 505
PGV IYLLPL G SK + HHWG P SFWCCYGT IES++KL DSIYF+E
Sbjct: 195 PGVFIYLLPLGTGQSKSDNIHHWGFPFHSFWCCYGTVIESYAKLADSIYFKEMSPANPES 254
Query: 506 -----------PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTF-SSKGSG 553
P +Y+ Q +SS+ W + V + D + + P LT S+K G
Sbjct: 255 RAHDKAGVRLPPRLYVNQLVSSKATWAEMNLRVTMQAD-MFTPGPAAVAQLTLDSTKAPG 313
Query: 554 LTT------SLNLRIPTWTSSN----------GAKATLNGQ---DLPLP-SPGNFLSVTK 593
T +L +R+P W + + GA +NGQ P P G++ ++ +
Sbjct: 314 PGTHDLGTFTLMVRVPEWLAPDRHGGVAQGGSGASIEVNGQLWTSCPGPVKAGSYCALMR 373
Query: 594 TWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
W+S D ++++LP+ R +++ ++R ++ +++
Sbjct: 374 RWASGDGVSLRLPMRWRLQSLAENRAQHQGLKS 406
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 72/131 (54%), Gaps = 13/131 (9%)
Query: 321 HLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESS 380
H+ A LF+KP F + D + H+NTH+ V G Y+ ++ ++
Sbjct: 2 HMEFAQLFNKPFFRKPMEAGNDMLMNLHANTHLAQVAGFAEEYDTVDKRVF-------AT 54
Query: 381 GTNIGHFNFKSDPKRLASNLDSN-----TEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
G + H F P LA ++ + T+E+CT YN+LK++R LFRWT ++ YAD+YER
Sbjct: 55 GGSTDH-EFWQAPDELADSVLTQKHGVETQETCTQYNILKIARSLFRWTGDVRYADFYER 113
Query: 436 SLTNGVLGIQR 446
+L NG+LG R
Sbjct: 114 ALVNGILGTAR 124
>gi|384252025|gb|EIE25502.1| DUF1680-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 648
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 229/634 (36%), Positives = 343/634 (54%), Gaps = 48/634 (7%)
Query: 113 EFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY-GGWE 171
+ ++ L + L DS+ +A N +Y+L L+ D+L+ FR A LP+ +P+ G WE
Sbjct: 20 DIIQPFPLDQITLERDSLFDKALALNTDYMLQLNADQLLHTFRLNAGLPSSAQPFTGSWE 79
Query: 172 EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ 231
+PSCE+RG F+GHYLSA +++ T N ++ +++ ++ L Q + GYLSAFP E
Sbjct: 80 DPSCEVRGQFMGHYLSACSMLVNHTGNGKIESRLTYIIDELRKVQIALSGGYLSAFPEEH 139
Query: 232 FDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSI 291
F RL++L VWAP+Y IHKI+AGLLD + + AL M E+F +V+
Sbjct: 140 FVRLQSLQTVWAPFYVIHKIMAGLLDAHNFLGYDVALEMVKDEAEHFTRYYNDVVATNGT 199
Query: 292 ERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNT 351
E + L E GGMN+VL+ L+ +T DP+H+ LA F KP F L D + G H+NT
Sbjct: 200 EHWLRMLEVEFGGMNEVLFNLYDVTGDPEHIRLAEAFTKPKFFEPLLQNTDPLPGLHANT 259
Query: 352 HIPIVIGSQMRYEVTGDQ-----------LHKEGHQLESSGTNIGHFNFKSDPKRLASNL 400
H+ V G R+E + GH + G N + P++LA ++
Sbjct: 260 HLAQVNGFAARFEKASHDGSYAAVTNFFSIVTRGHSFATGGNN--DHEYWGPPRQLADSI 317
Query: 401 ---DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR--------GTE 449
+ TEE+CT YNMLK++R+LFRWT +ADYYER++ NG+LG QR +
Sbjct: 318 LLHATETEETCTQYNMLKIARYLFRWTGAPVFADYYERAILNGLLGTQRMPADYSPHTSR 377
Query: 450 PGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEG------ 503
PGV+IYLLP+ G +K S WG P SFWCCYG+ +ESFSKL DSI+F +
Sbjct: 378 PGVVIYLLPMGSGQTKGGSTRGWGDPLHSFWCCYGSSVESFSKLADSIFFYRQAHSSCLT 437
Query: 504 --KYPG-VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNL 560
YP Y ++S L S Q+ + S + + L+ ++ S +L L
Sbjct: 438 LHAYPAHFYTSASLASPLVGLSVQLQASFFQGTTASANITV-APLSAAAHDSTAEVTLKL 496
Query: 561 RIPTWTSSNGAKATLNGQD------LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
RIP+W S+G + +NGQ P G+F +V + +++ DK+T+ LP+++R E +
Sbjct: 497 RIPSWAVSSGVRVEVNGQSWADCAPAAGPQAGSFCTVRRRFAAGDKVTLALPMSIRAERV 556
Query: 615 QDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYG 674
QDDRPEY+S AI+ GP ++AG + G I ++D +T I + + LI G
Sbjct: 557 QDDRPEYSSQHAIMMGPLLMAGITNGSRSIQADPRKVADLLTDISSQGLASLII----PG 612
Query: 675 NTKFVLTNSNQSITMEKFPKSGTDAALHATFRLI 708
+ + + + E P G AL +TFRL+
Sbjct: 613 DLPLHIRHEGAMLRAE--PMKGP-YALDSTFRLL 643
>gi|383316642|ref|YP_005377484.1| hypothetical protein [Frateuria aurantia DSM 6220]
gi|379043746|gb|AFC85802.1| hypothetical protein Fraau_1370 [Frateuria aurantia DSM 6220]
Length = 651
Score = 349 bits (896), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 204/543 (37%), Positives = 296/543 (54%), Gaps = 47/543 (8%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVG-HYLSASALM 192
A N YL L VD+L NF + A LP+ +P GGWE P CELRGHF G H+LSA+AL+
Sbjct: 77 AAAINARYLHQLPVDRLAHNFLRQAGLPSTAQPLGGWESPECELRGHFCGGHWLSAAALV 136
Query: 193 WASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKIL 252
WA+T + +LK++ +V+ L+ CQ+ GYLSAFP F+RL VWAP+YT+HKIL
Sbjct: 137 WATTADRTLKQRADELVAILARCQRS--DGYLSAFPDSFFERLSHGQKVWAPFYTLHKIL 194
Query: 253 AGLLDQYTYADNAEALRMTT----WMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDV 308
G LD Y +A N +AL + T W V + R + + L E GGMND
Sbjct: 195 CGHLDMYMHAGNQQALDIATGLGDWTVHWLNGRSDAQMN--------EILRTEYGGMNDA 246
Query: 309 LYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGD 368
L +L+ IT + ++L AH FD+ L LA D++ G HSNT +P +IG+ RYE+TG+
Sbjct: 247 LCELYAITGNGRYLDAAHRFDQASLLDPLAAHRDELKGLHSNTQLPKIIGAARRYELTGE 306
Query: 369 QLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSR 418
Q ++ G + ++G + + + P L L E C YN+LK++R
Sbjct: 307 QRYRRMAEFGWETISGTRCYANGGSSNDEFWNNGPDDLHDQLGVAAAECCVAYNLLKLTR 366
Query: 419 HLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDS 478
H++ WT + DYYER+L N LG Q G+ +Y PLAPG SY ++ +P S
Sbjct: 367 HVYGWTGDPRAFDYYERNLYNARLGTQ--DPAGMKLYYYPLAPG-----SYKYFNSPLHS 419
Query: 479 FWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWD 538
FWCC GTG E F++ DSIYF G+ +Y+ YI+SRL W + ++Q
Sbjct: 420 FWCCTGTGAEEFARFNDSIYFHTPGE---LYVNLYIASRLKWAEQGLTLSQLTRFPEQDV 476
Query: 539 PYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSS 597
++ LT ++ +NLRIP+WT + + +N Q + + PG++LS+ + W
Sbjct: 477 SDFKLQLTAPAR-----LRINLRIPSWT-AGAPQLWINDQLQNVSALPGSYLSIERMWHD 530
Query: 598 DDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITP 657
D L +QLP+ L+ + + D ++ A+LYGP LA GD +T + W P
Sbjct: 531 KDHLRLQLPMQLKMQPLPGDDAQF----ALLYGPITLAAELPGD-PVTPAMQHCDYWADP 585
Query: 658 IPA 660
PA
Sbjct: 586 KPA 588
>gi|390957656|ref|YP_006421413.1| hypothetical protein Terro_1782 [Terriglobus roseus DSM 18391]
gi|390412574|gb|AFL88078.1| hypothetical protein Terro_1782 [Terriglobus roseus DSM 18391]
Length = 635
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 209/542 (38%), Positives = 294/542 (54%), Gaps = 44/542 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
L + VRL D R+ N +YL L VD+L+ +FR TA + + +PYGGWE P+
Sbjct: 43 LSPFPMSAVRL-LDGEFKRSADVNEKYLDSLQVDRLLHSFRLTAGITSSAKPYGGWEIPN 101
Query: 175 CELRGHFVG-HYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD 233
ELRGHF G HYLSA A A N +L+EK +A+V+ L+ACQK G+GYLSA+P E F
Sbjct: 102 GELRGHFAGGHYLSAVAFASAGAGNTTLREKGNALVAGLAACQKANGNGYLSAYPPELFQ 161
Query: 234 RLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALR----MTTWMVEYFYNRVQNVIKKY 289
RL VWAP+YT HKI+AGL+D YT N +AL+ M W YF +
Sbjct: 162 RLALGKQVWAPFYTYHKIMAGLVDMYTQTGNEDALKVAEGMAGWSSAYFAD--------M 213
Query: 290 SIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHS 349
S + L E GGMN+VL L+ +T ++L A F++P FL LA D++ G H+
Sbjct: 214 SDAQRQGILRIEYGGMNEVLVNLYSLTGKERYLSQARKFEQPTFLDPLAAHRDELQGLHA 273
Query: 350 NTHIPIVIGSQMRYEVTGDQLHKE------GHQLESSGTNIGHF----NFKSDPKRLASN 399
NT IP +IG+ YE TGD+ ++E L + IG+ ++++ LA +
Sbjct: 274 NTSIPKIIGAARMYEATGDRRYQEIASYFLDDVLSAHTYAIGNTSDDEHWRTPAGSLAGS 333
Query: 400 LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL 459
L E C YN++K+ RHL WT + + D YER+L N LG Q G+ Y PL
Sbjct: 334 LSLKNAECCVAYNLMKLERHLSAWTGDARWMDAYERTLFNARLGTQDAA--GLKQYFFPL 391
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
A G + +G+P +SFWCC GTG E F+K GDSIYF VY+ Q+I+S L
Sbjct: 392 AAG-----YWRVYGSPEESFWCCTGTGAEDFAKFGDSIYFHANDT---VYVNQFIASVLT 443
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
WK + Q+ S+ + LT + S+ +RIP+W + G A + +
Sbjct: 444 WKEKGFTLRQE----TSFPSESQTRLTIQT-AQPQERSIAIRIPSWIADGGFVAVNDKRL 498
Query: 580 LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSI 639
PG++L + +TW + D +T+ LP+ LR E + P + A LYGP VLAG ++
Sbjct: 499 EAFAEPGSYLVIRRTWHAGDTVTVHLPMALREEPL----PGSPNTAAALYGPLVLAG-TL 553
Query: 640 GD 641
GD
Sbjct: 554 GD 555
>gi|116620365|ref|YP_822521.1| hypothetical protein Acid_1242 [Candidatus Solibacter usitatus
Ellin6076]
gi|116223527|gb|ABJ82236.1| protein of unknown function DUF1680 [Candidatus Solibacter usitatus
Ellin6076]
Length = 664
Score = 339 bits (870), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 213/535 (39%), Positives = 299/535 (55%), Gaps = 63/535 (11%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWE---EPS--------CELRGHFV 182
A + N Y+ L D+L+ FR A LP+ +P GGWE EP+ ELRGHFV
Sbjct: 82 AAEWNRGYMNRLPADRLLHAFRLNAGLPSSAQPLGGWEIYVEPTPGKRINSEGELRGHFV 141
Query: 183 GHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIG-SGYLSAFPTEQFDRLEALIPV 241
GH+LSASA ++AS ++ K K +V+ L+ CQ+++G SGYLSAFP E FDRL+A PV
Sbjct: 142 GHFLSASAQLYASMGDKDAKAKADYIVAELAKCQQKLGPSGYLSAFPIEWFDRLDARKPV 201
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALR----MTTWMVEYFYNRVQNVIKKYSIERHWQ- 296
WAP+YTIHKI+AG+ D YT A N +AL+ M+ W E+ ++ E H Q
Sbjct: 202 WAPFYTIHKIMAGMFDMYTLAGNQQALQVLEGMSNWADEWTASKS---------EAHMQD 252
Query: 297 TLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIV 356
L E GGMN+VLY L +T + + F K F LAL+ D ++G H NTHIP V
Sbjct: 253 ILRTEYGGMNEVLYNLAAVTGNDRWAKAGDRFTKKEFFNPLALRNDALTGLHVNTHIPQV 312
Query: 357 IGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSN-- 403
IG+ RYE++ D + + GT+ G + + P+ LA+ L +
Sbjct: 313 IGAAARYEISSDMRFHDVADYFWYEVVTARSYVTEGTSNGE-GWLTQPRMLAAELKRSVA 371
Query: 404 TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG-IQRGTEPGVMIYLLPLAPG 462
T E C +YNMLK++RHL+ W + AY DYYER+L N LG IQ T G Y L L PG
Sbjct: 372 TAECCCSYNMLKLTRHLYGWKPDPAYFDYYERALFNHRLGTIQPKT--GYTQYYLSLTPG 429
Query: 463 SSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
+ K + T SFWCC G+G+E +SKL DSIY+ + G+ + +I S L+W+
Sbjct: 430 AWKT-----FNTEDKSFWCCTGSGVEEYSKLNDSIYWHDAE---GLTVNLFIPSELNWEE 481
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL 582
+ Q+ + TLT ++ S ++ LRIP WT S K +NG+ + +
Sbjct: 482 KGFRLRQE----TKFPEQQSTTLTVTAAKSA-PMAMRLRIPAWTKSAAVK--INGRAVDV 534
Query: 583 -PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
P+PG++L++T+ W + DK+ + LP+ L E + DD QA LYGP VLAG
Sbjct: 535 TPTPGSYLTLTRPWKAGDKIEMTLPMHLSVEYMPDD----PKTQAFLYGPIVLAG 585
>gi|225872906|ref|YP_002754363.1| Tat pathway signal sequence domain-containing protein
[Acidobacterium capsulatum ATCC 51196]
gi|225794208|gb|ACO34298.1| Tat pathway signal sequence domain protein [Acidobacterium
capsulatum ATCC 51196]
Length = 644
Score = 337 bits (864), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 206/549 (37%), Positives = 299/549 (54%), Gaps = 53/549 (9%)
Query: 116 KEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC 175
K+ + VR+ D + A + N +YL ++ D+L+ FR TA LP EP GGWE P C
Sbjct: 56 KDFPMTQVRM-RDGVLKNALEINRQYLYLVPNDRLLHTFRLTAGLPTSAEPLGGWEAPDC 114
Query: 176 ELRGHFVG-HYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
ELRGHF G HYLSA ALM+AST +E +K K A+V+ L+ CQ+ GYLSAFP FDR
Sbjct: 115 ELRGHFAGGHYLSACALMYASTGDEKIKAKGDALVAELAKCQQP--DGYLSAFPASFFDR 172
Query: 235 LEALIPVWAPYYTIHKILAGLLDQYTYADNAEAL----RMTTWMVEYFYNRVQNVIKKYS 290
L VWAP+YT HKI+AG LD Y + N +AL RM W +EY K
Sbjct: 173 LRHYQKVWAPFYTYHKIMAGHLDMYVHTGNQQALETCKRMADWAIEY--------TKPIP 224
Query: 291 IERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSN 350
++ + L E GGMN+V + L+ +T + K+ L F+ LA + D ++G H+N
Sbjct: 225 ADQWQRMLLVEQGGMNEVSFNLYAVTGEKKYRDLGFRFEHKLIFDPLAKREDHLAGNHAN 284
Query: 351 THIPIVIGSQMRYEVTGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRLASN 399
T+IP VIG+ YEV D+ + H + GT+ G F K P LA +
Sbjct: 285 TNIPKVIGAARGYEVADDKRYHTIAEFFWGAVTSQHAYATGGTSDGEFWHK--PGTLAEH 342
Query: 400 LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL 459
L EE C +YNM+K+SRHL+ WT + DYYER + N +G Q G+++Y + L
Sbjct: 343 LGPAAEECCCSYNMMKLSRHLYGWTGDPRIFDYYERLMYNVRIGTQ--DPKGMLMYYVSL 400
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
PG K +GTP D+FWCC GTG+E +SK+ DSIYF + +Y+ + S +
Sbjct: 401 KPGYWKT-----FGTPFDAFWCCTGTGVEEYSKVNDSIYFHDAKN---IYVNLFAGSEVQ 452
Query: 520 WKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
W + + Q+ + P+ TLT ++ L +R+P W ++NG +NGQ
Sbjct: 453 WPEKNVSLVQETNFPLEE-----ATTLTVRAQKPS-AFGLKIRVPYW-ATNGFTIHINGQ 505
Query: 579 DLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+ + P ++ ++ +TW D + + +P++L I P+ +QA+LYGP VLAG
Sbjct: 506 PQSVEAKPESYATLHRTWHDGDTIKVSMPMSLHISPI----PDSPDVQAVLYGPLVLAG- 560
Query: 638 SIGDWDITE 646
+G +TE
Sbjct: 561 EMGRHGLTE 569
>gi|423313782|ref|ZP_17291717.1| hypothetical protein HMPREF1058_02329 [Bacteroides vulgatus
CL09T03C04]
gi|392684317|gb|EIY77645.1| hypothetical protein HMPREF1058_02329 [Bacteroides vulgatus
CL09T03C04]
Length = 640
Score = 322 bits (824), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 194/555 (34%), Positives = 300/555 (54%), Gaps = 44/555 (7%)
Query: 100 KNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTAR 159
++ G+ K + +K L DVRL + ++ ++ ++VD+L+ +FR A
Sbjct: 27 QHAGKLKRETVAPMKVKSFDLKDVRLLPSRFRENMMRDSV-WMASIEVDRLLHSFRTNAG 85
Query: 160 LPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSAL 212
+ A E GGWE CELRGH GH LSA LM+A+T +E K+K ++V+ L
Sbjct: 86 VFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYGLMYAATGSEQFKQKGDSLVNGL 145
Query: 213 SACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTT 272
+ Q +G+GYLSA+P E +R VWAP+YT+HK+ +GL+DQY Y+DN +AL +
Sbjct: 146 AEVQTALGNGYLSAYPEELINRNICGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVV 205
Query: 273 WMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPC 332
M ++ Y++ +K + + E GG+N+ Y L+ IT D +H LA F
Sbjct: 206 RMADWAYHK----LKPLDETTRQKMIRNEFGGVNESFYNLYAITGDERHRWLAQFFYHNE 261
Query: 333 FLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS- 391
+ L DD+ H+NT IP VI YE+T D+ ++ T I H F
Sbjct: 262 VIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTEDENSRKLSDFFWH-TMIDHHTFAPG 320
Query: 392 ---------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL 442
DP R + ++ T E+C TYNMLK+SRHLF WT + A ADYYER+L N +L
Sbjct: 321 CSSDKEHYFDPARFSKHVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHIL 380
Query: 443 GIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEE 502
G Q+ + G++ Y LPL GS K S T +SFWCC G+G E+ +K G++IY+ +
Sbjct: 381 G-QQDPQTGMVTYFLPLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYHND 434
Query: 503 GKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRI 562
G+Y+ +I S ++W+ + + Q+ D P T+ + + T++ LR
Sbjct: 435 ---KGIYVNLFIPSVVNWRKKGLTLRQETD-----FPAEETTVLTIRAQNPVETTVYLRY 486
Query: 563 PTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
P+W S G K +NG+ + + PG+++++T+ W D++T P+ LR E D+ P+
Sbjct: 487 PSW--SKGVKVFVNGKKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQK 543
Query: 622 ASIQAILYGPYVLAG 636
A++YGP VLAG
Sbjct: 544 G---ALVYGPVVLAG 555
>gi|319643216|ref|ZP_07997844.1| hypothetical protein HMPREF9011_03445 [Bacteroides sp. 3_1_40A]
gi|345520493|ref|ZP_08799881.1| hypothetical protein BSFG_01473 [Bacteroides sp. 4_3_47FAA]
gi|254835017|gb|EET15326.1| hypothetical protein BSFG_01473 [Bacteroides sp. 4_3_47FAA]
gi|317385120|gb|EFV66071.1| hypothetical protein HMPREF9011_03445 [Bacteroides sp. 3_1_40A]
Length = 640
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 194/555 (34%), Positives = 300/555 (54%), Gaps = 44/555 (7%)
Query: 100 KNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTAR 159
++ G+ K + +K L DVRL + ++ ++ ++VD+L+ +FR A
Sbjct: 27 QHAGKLKRETVAPMKVKSFDLKDVRLLPSRFRENMMRDSV-WMASIEVDRLLHSFRTNAG 85
Query: 160 LPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSAL 212
+ A E GGWE CELRGH GH LSA LM+A+T +E K+K ++V+ L
Sbjct: 86 VFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYGLMYAATGSEQFKQKGDSLVNGL 145
Query: 213 SACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTT 272
+ Q +G+GYLSA+P E +R VWAP+YT+HK+ +GL+DQY Y+DN +AL +
Sbjct: 146 AEVQTALGNGYLSAYPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVV 205
Query: 273 WMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPC 332
M ++ Y++ +K + + E GG+N+ Y L+ IT D +H LA F
Sbjct: 206 RMADWAYHK----LKPLDETTRQKMIRNEFGGVNESFYNLYAITGDERHRWLAQFFYHNE 261
Query: 333 FLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS- 391
+ L DD+ H+NT IP VI YE+T D+ ++ T I H F
Sbjct: 262 VIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTEDENSRKLSDFFWH-TMIDHHTFAPG 320
Query: 392 ---------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL 442
DP R + ++ T E+C TYNMLK+SRHLF WT + A ADYYER+L N +L
Sbjct: 321 CSSDKEHYFDPARFSKHVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHIL 380
Query: 443 GIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEE 502
G Q+ + G++ Y LPL GS K S T +SFWCC G+G E+ +K G++IY+ +
Sbjct: 381 G-QQDPQTGMVTYFLPLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYHND 434
Query: 503 GKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRI 562
G+Y+ +I S ++W+ + + Q+ D P T+ + + T++ LR
Sbjct: 435 ---KGIYVNLFIPSVVNWREKGLTLRQETD-----FPAEETTVLTIRAQNPVETTVYLRY 486
Query: 563 PTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
P+W S G K +NG+ + + PG+++++T+ W D++T P+ LR E D+ P+
Sbjct: 487 PSW--SKGVKVFVNGKKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQK 543
Query: 622 ASIQAILYGPYVLAG 636
A++YGP VLAG
Sbjct: 544 G---ALVYGPVVLAG 555
>gi|150002728|ref|YP_001297472.1| hypothetical protein BVU_0120 [Bacteroides vulgatus ATCC 8482]
gi|294776982|ref|ZP_06742443.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|149931152|gb|ABR37850.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
gi|294449230|gb|EFG17769.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 640
Score = 319 bits (817), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 193/555 (34%), Positives = 300/555 (54%), Gaps = 44/555 (7%)
Query: 100 KNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTAR 159
++ G+ K + +K L DVRL + ++ ++ ++V++L+ +FR A
Sbjct: 27 QHAGKLKRETVAPMKVKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVNRLLHSFRTNAG 85
Query: 160 LPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSAL 212
+ A E GGWE CELRGH GH LSA LM+A+T +E K+K ++V+ L
Sbjct: 86 VFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYGLMYAATGSEQFKQKGDSLVNGL 145
Query: 213 SACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTT 272
+ Q +G+GYLSA+P E +R VWAP+YT+HK+ +GL+DQY Y+DN +AL +
Sbjct: 146 AEVQTALGNGYLSAYPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVI 205
Query: 273 WMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPC 332
M ++ Y++ +K + + E GG+N+ Y L+ IT D +H LA F
Sbjct: 206 RMADWAYHK----LKPLDETTRQKMIRNEFGGVNESFYNLYAITGDERHRWLAQFFYHNE 261
Query: 333 FLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS- 391
+ L DD+ H+NT IP VI YE+T D+ ++ T I H F
Sbjct: 262 VIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTEDENSRKLSDFFWH-TMIDHHTFAPG 320
Query: 392 ---------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL 442
DP R + ++ T E+C TYNMLK+SRHLF WT + A ADYYER+L N +L
Sbjct: 321 CSSDKEHYFDPARFSKHVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHIL 380
Query: 443 GIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEE 502
G Q+ + G++ Y LPL GS K S T +SFWCC G+G E+ +K G++IY+ +
Sbjct: 381 G-QQDPQTGMVTYFLPLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYHND 434
Query: 503 GKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRI 562
G+Y+ +I S ++W+ + + Q+ D P T+ + + T++ LR
Sbjct: 435 ---KGIYVNLFIPSVVNWREKGLTLRQETD-----FPAEETTVLTIRAQNPVETTVYLRY 486
Query: 563 PTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
P+W S G K +NG+ + + PG+++++T+ W D++T P+ LR E D+ P+
Sbjct: 487 PSW--SKGVKVFVNGKKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQK 543
Query: 622 ASIQAILYGPYVLAG 636
A++YGP VLAG
Sbjct: 544 G---ALVYGPVVLAG 555
>gi|270296104|ref|ZP_06202304.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|423303646|ref|ZP_17281645.1| hypothetical protein HMPREF1072_00585 [Bacteroides uniformis
CL03T00C23]
gi|423307631|ref|ZP_17285621.1| hypothetical protein HMPREF1073_00371 [Bacteroides uniformis
CL03T12C37]
gi|270273508|gb|EFA19370.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|392688010|gb|EIY81301.1| hypothetical protein HMPREF1072_00585 [Bacteroides uniformis
CL03T00C23]
gi|392689500|gb|EIY82777.1| hypothetical protein HMPREF1073_00371 [Bacteroides uniformis
CL03T12C37]
Length = 641
Score = 317 bits (813), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 203/542 (37%), Positives = 293/542 (54%), Gaps = 53/542 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
LK+V L R + M A T++ ++L+ FR A + A E
Sbjct: 48 LKDVRLLPSRFRDNMMRDSAWMTSIA------TNRLLHGFRNNAGVFAGREGGYMTVKKL 101
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA ALM+AST +E K K ++V+ L+ Q +G+GYLSA+
Sbjct: 102 GGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYLSAY 161
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY YADN AL + T M ++ YN+ +K
Sbjct: 162 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYADNKPALEVVTRMGDWAYNK----LK 217
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
+ + E GG+N+ Y L+ IT D ++ LA F + L Q DD+
Sbjct: 218 PLDEATRKRMIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDLGTK 277
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESS--GTNIGHFNFKS----------DPKR 395
H+NT IP V+ YE+T D + +L T I H F DP++
Sbjct: 278 HTNTFIPKVLAEARNYELTQDN---DSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQ 334
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
L+ +L T E+C TYNMLK+SRHLF WT + ADYYER+L N +LG Q+ E G++ Y
Sbjct: 335 LSKHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSY 393
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
LPL GS K S T +SFWCC G+G ES +K G++IY E G+Y+ +I
Sbjct: 394 FLPLLSGSHKVYS-----TRENSFWCCVGSGFESHAKYGEAIYCHNE---KGIYVNLFIP 445
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S ++WK+ I + Q+ + TLT + +TT++ LR P+W S G K +
Sbjct: 446 SEVNWKAKGITLRQE----TGFPAEENTTLTIQTD-KPVTTTIYLRYPSW--SEGVKVNV 498
Query: 576 NGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
NG+ + + PG++++VT+ W D++ P++L+ E D+ P+ A+LYGP VL
Sbjct: 499 NGKKVSVKQKPGSYIAVTRQWKDGDRIEANYPMSLQLETTSDN-PQKG---ALLYGPLVL 554
Query: 635 AG 636
AG
Sbjct: 555 AG 556
>gi|265752243|ref|ZP_06088036.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263237035|gb|EEZ22505.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 640
Score = 316 bits (810), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 191/540 (35%), Positives = 292/540 (54%), Gaps = 44/540 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
+K L DVRL + ++ ++ ++VD+L+ +FR A + A E
Sbjct: 42 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 100
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA LM+A+T ++ + K ++VS L+ Q +G+GYLSA+
Sbjct: 101 GGWESLDCELRGHTTGHLLSAYGLMYAATGSKLFRHKGDSLVSGLAEVQNALGNGYLSAY 160
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY Y+DN +AL + M ++ Y++ +K
Sbjct: 161 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVIRMADWAYHK----LK 216
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
+ + E GG+N+ Y L+ IT D +H LA F + L DD+
Sbjct: 217 PLDETTRQKMIRNEFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDLGTK 276
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS----------DPKRLA 397
H+NT IP VI YE+T D+ ++ T I H F DP R +
Sbjct: 277 HTNTFIPKVIAEARNYELTEDENSRKLSDFFWH-TMIDHHTFAPGCSSDKEHYFDPARFS 335
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
++ T E+C TYNMLK+SRHLF WT + A ADYYER+L N +LG Q+ + G++ Y L
Sbjct: 336 KHVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTYFL 394
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL GS K S T +SFWCC G+G E+ +K G++IY+ + G+Y+ +I S
Sbjct: 395 PLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYHND---KGIYVNLFIPSV 446
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
++W+ + + Q+ D P T+ S + T++ LR P+W S K +NG
Sbjct: 447 VNWQEKGLTLRQETD-----FPAEETTVLTIGTQSPVETTVYLRYPSW--SKEVKVAVNG 499
Query: 578 QDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + + PG+++++T+ W D++T P+ LR E D+ P+ A++YGP VLAG
Sbjct: 500 KKVAVKQKPGSYIAITRLWKDGDRITADYPMRLRVETTPDN-PQKG---ALVYGPVVLAG 555
>gi|345512540|ref|ZP_08792066.1| hypothetical protein BSEG_01611 [Bacteroides dorei 5_1_36/D4]
gi|423229086|ref|ZP_17215491.1| hypothetical protein HMPREF1063_01311 [Bacteroides dorei
CL02T00C15]
gi|423244926|ref|ZP_17226000.1| hypothetical protein HMPREF1064_02206 [Bacteroides dorei
CL02T12C06]
gi|345456387|gb|EEO45470.2| hypothetical protein BSEG_01611 [Bacteroides dorei 5_1_36/D4]
gi|392634839|gb|EIY28751.1| hypothetical protein HMPREF1063_01311 [Bacteroides dorei
CL02T00C15]
gi|392640967|gb|EIY34758.1| hypothetical protein HMPREF1064_02206 [Bacteroides dorei
CL02T12C06]
Length = 646
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 191/540 (35%), Positives = 292/540 (54%), Gaps = 44/540 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
+K L DVRL + ++ ++ ++VD+L+ +FR A + A E
Sbjct: 48 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 106
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA LM+A+T ++ + K ++VS L+ Q +G+GYLSA+
Sbjct: 107 GGWESLDCELRGHTTGHLLSAYGLMYAATGSKLFRHKGDSLVSGLAEVQNALGNGYLSAY 166
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY Y+DN +AL + M ++ Y++ +K
Sbjct: 167 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVIRMADWAYHK----LK 222
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
+ + E GG+N+ Y L+ IT D +H LA F + L DD+
Sbjct: 223 PLDETTRQKMIRNEFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDLGTK 282
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS----------DPKRLA 397
H+NT IP VI YE+T D+ ++ T I H F DP R +
Sbjct: 283 HTNTFIPKVIAEARNYELTEDENSRKLSDFFWH-TMIDHHTFAPGCSSDKEHYFDPARFS 341
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
++ T E+C TYNMLK+SRHLF WT + A ADYYER+L N +LG Q+ + G++ Y L
Sbjct: 342 KHVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTYFL 400
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL GS K S T +SFWCC G+G E+ +K G++IY+ + G+Y+ +I S
Sbjct: 401 PLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYHND---KGIYVNLFIPSV 452
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
++W+ + + Q+ D P T+ S + T++ LR P+W S K +NG
Sbjct: 453 VNWQEKGLTLRQETD-----FPAEETTVLTIGTQSPVETTVYLRYPSW--SKEVKVAVNG 505
Query: 578 QDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + + PG+++++T+ W D++T P+ LR E D+ P+ A++YGP VLAG
Sbjct: 506 KKVAVKQKPGSYIAITRLWKDGDRITADYPMRLRVETTPDN-PQKG---ALVYGPVVLAG 561
>gi|427385118|ref|ZP_18881623.1| hypothetical protein HMPREF9447_02656 [Bacteroides oleiciplenus YIT
12058]
gi|425727286|gb|EKU90146.1| hypothetical protein HMPREF9447_02656 [Bacteroides oleiciplenus YIT
12058]
Length = 629
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 196/527 (37%), Positives = 288/527 (54%), Gaps = 33/527 (6%)
Query: 122 DVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHF 181
DVRL D RA + + +L DV++ + FR TA L + GGWE CELRGH
Sbjct: 50 DVRL-LDGPFKRAMEVDQRWLKEADVNRFLHAFRVTAGLATGAQNLGGWESLDCELRGHT 108
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIG-SGYLSAFPTEQFDRLEALIP 240
GH LSA +LM+AST +E + K + +V L+ CQ+ +G +GYLSAFP DR
Sbjct: 109 TGHLLSALSLMYASTGDEQYRTKGAELVKGLAECQQTLGKNGYLSAFPEYFIDRAIKEEI 168
Query: 241 VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNE 300
VWAP+YT+HK+ AGLLDQYT N +AL + T M ++ YN+ +K + + LN
Sbjct: 169 VWAPFYTLHKVYAGLLDQYTLCGNQQALDVLTGMCDWAYNK----LKPLTPTQLQGMLNS 224
Query: 301 EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
E GGM + Y L+ +T + +H LA +F L LA + D ++G H NT IP V+G
Sbjct: 225 EFGGMPETFYNLYALTGNARHKELAEMFYHNSILDPLAARRDSLAGIHVNTQIPKVLGEA 284
Query: 361 MRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTT 410
YE+TG+ G +G N F S P L+ L NT E+C T
Sbjct: 285 RGYEMTGNPQSATIANFFWEAVVGDHTYVTGGNSDKEIF-SKPGILSDQLSENTTETCNT 343
Query: 411 YNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYH 470
YNMLK++RHLF W A ADYYER+L N +L Q E G + Y L PGS K+ Y
Sbjct: 344 YNMLKLTRHLFTWDASPARADYYERALYNHILSSQN-PETGGVTYYHTLHPGSCKKFHY- 401
Query: 471 HWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK 530
P CC GTG E+ +K G++IY++ + G+Y+ +I+S L+WK + V Q+
Sbjct: 402 ----PFRDNTCCVGTGYENHAKYGEAIYYKTADQ-SGLYVNLFIASVLNWKEKDLTVRQE 456
Query: 531 VDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFL 589
+ + R+T+ + + +G+ LR P+W + +G +NG+ + +PG+++
Sbjct: 457 TN--YPDEASTRITIAAAPE-AGIQMPFMLRYPSW-AVDGVTIKVNGKKQHVKKAPGSYI 512
Query: 590 SVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ +TW D +T+++P++L E + D + + AILYGP VLA
Sbjct: 513 HIDRTWRQGDVITMEMPMSLHIEYMPDTKEK----GAILYGPIVLAA 555
>gi|116625830|ref|YP_827986.1| hypothetical protein Acid_6783 [Candidatus Solibacter usitatus
Ellin6076]
gi|116228992|gb|ABJ87701.1| protein of unknown function DUF1680 [Candidatus Solibacter usitatus
Ellin6076]
Length = 675
Score = 315 bits (807), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 207/585 (35%), Positives = 306/585 (52%), Gaps = 68/585 (11%)
Query: 102 PGQFKVP--------ERSGEFLKEV--------SLHDVRLGSDSMHWRAQQTNLEYLLML 145
PG F+ P E EF +++ + VRL S + +Q+ N Y+ L
Sbjct: 33 PGNFRRPLAPETPAFETPLEFTRKIVTPRAEPFPMPQVRLLPGSAYHDSQEWNRGYMERL 92
Query: 146 DVDKLVWNFRKTARLP-APGEPYGGWEEP-----SCELRGHFVGHYLSASALMWASTHNE 199
D+L+ FR A LP +P GGWE+P S ELRGHF GH+LSASA + ++ ++
Sbjct: 93 AADRLLHTFRANAGLPVGSAKPLGGWEQPENGQRSSELRGHFAGHFLSASAQL-SANGDK 151
Query: 200 SLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQY 259
+ + K +V+ ++ CQ+++G YLSAFPT +DRL VWAP+YTIHKI+AG+ D Y
Sbjct: 152 NAQSKGDFMVAEMARCQQKLGGKYLSAFPTTWWDRLGKGERVWAPFYTIHKIMAGMFDMY 211
Query: 260 TYADNAEALR----MTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCI 315
+ A N +AL M W E+ + E Q L E GG+ + LY+L
Sbjct: 212 SLAGNQQALEVLEGMAAWADEW--------TAPKAAEHMQQILTIEFGGIAETLYRLAAA 263
Query: 316 TQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGD-QLHK-- 372
T + + F K FL LA + D++ G H NTHIP V+ + RY+++GD + H
Sbjct: 264 TDQDRWGRVGDRFQKKSFLNPLAARRDELRGLHVNTHIPQVMAAARRYDLSGDMRFHDVA 323
Query: 373 -------EGHQLESSGTNIGHFNFKSDPKRLAS--NLDSNTEESCTTYNMLKVSRHLFRW 423
G + +G + + P+RLA+ L NT E C YNMLK++RHL+ W
Sbjct: 324 DYFFSEVAGARTYVTGGTSNAEAWLAPPRRLATELKLSVNTAECCCAYNMLKLARHLYSW 383
Query: 424 TKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCY 483
+ +Y DYYE L N +G R + G+ Y L L PG+ K + T +FWCC
Sbjct: 384 DPKPSYFDYYEHLLLNHRIGTIR-PKVGLTQYYLSLTPGAWKT-----FNTEDQTFWCCT 437
Query: 484 GTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRV 543
G+G+E +SKL DSIY+ + G+Y+ +ISS LDW + Q S P +
Sbjct: 438 GSGVEEYSKLNDSIYWRDG---EGLYVNLFISSELDWAERGFKLRQATQYPAS--PSTAL 492
Query: 544 TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLT 602
T+T + G ++ LRIP W S LNG+ L +PG++L + + W D++
Sbjct: 493 TVTAARAGD---LAIRLRIPGWLQS-APSVKLNGKALDASAAPGSYLVLKRNWKVGDRID 548
Query: 603 IQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITES 647
++LP+ L +A+ DD ++QA LYGP VLAG +G +TE+
Sbjct: 549 MELPMRLHVQAMPDD----PAMQAFLYGPLVLAG-DLGGEGLTEA 588
>gi|423222645|ref|ZP_17209115.1| hypothetical protein HMPREF1062_01301 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392641932|gb|EIY35705.1| hypothetical protein HMPREF1062_01301 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 641
Score = 314 bits (805), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 196/544 (36%), Positives = 297/544 (54%), Gaps = 48/544 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
++ L DVRL + ++ ++ + ++L+ +FR A + A E
Sbjct: 43 VESFDLKDVRLLPSRFRDNMMRDSV-WMTSIATNRLLHSFRNNAGVFAGREGGYMTIKKL 101
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA ALM+AST +E K K ++V+ L+ Q +G+GYLSA+
Sbjct: 102 GGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYLSAY 161
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY Y DN +AL + T M ++ YN+ +K
Sbjct: 162 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALEVVTRMGDWAYNK----LK 217
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
+ + E GG+N+ Y L+ IT D ++ LA F + L Q DD+
Sbjct: 218 PLDEPTRKRMIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDLGTK 277
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESS--GTNIGHFNFKS----------DPKR 395
H+NT IP V+ YE+T D + +L T I H F DP++
Sbjct: 278 HTNTFIPKVLAEARNYELTQDN---DSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQ 334
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
L+ +L T E+C TYNMLK+SRHLF WT + ADYYER+L N +LG Q+ E G++ Y
Sbjct: 335 LSKHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSY 393
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
LPL GS K S T +SFWCC G+G E+ +K G++IY+ + G+Y+ +I
Sbjct: 394 FLPLLSGSHKVYS-----TRENSFWCCVGSGFENHAKYGEAIYYHND---QGIYVNLFIP 445
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S ++WK+ +I + Q+ ++ LT + +TT++ LR P+W S K +
Sbjct: 446 SEVNWKAKRITLRQE----TAFPAAENTALTIQTD-KPVTTTIYLRYPSW--SKNVKVNV 498
Query: 576 NGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
NG+ + + PG++++VT+ W D++ P++L+ E D+ P+ A+LYGP VL
Sbjct: 499 NGKKVSVKQKPGSYIAVTRQWKDGDRIEANYPMSLQLETTPDN-PQKG---ALLYGPLVL 554
Query: 635 AGHS 638
AG S
Sbjct: 555 AGES 558
>gi|224539132|ref|ZP_03679671.1| hypothetical protein BACCELL_04034 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519254|gb|EEF88359.1| hypothetical protein BACCELL_04034 [Bacteroides cellulosilyticus
DSM 14838]
Length = 641
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 196/544 (36%), Positives = 296/544 (54%), Gaps = 48/544 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
++ L DVRL + ++ ++ + ++L+ +FR A + A E
Sbjct: 43 VESFDLKDVRLLPSRFRDNMMRDSV-WMTSIATNRLLHSFRNNAGVFAGREGGYMTVKKL 101
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA ALM+AST +E K K ++V+ L+ Q +G+GYLSA+
Sbjct: 102 GGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYLSAY 161
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY Y DN +AL + T M ++ YN+ +K
Sbjct: 162 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALEVVTRMGDWAYNK----LK 217
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
+ + E GG+N+ Y L+ IT D ++ LA F + L Q DD+
Sbjct: 218 PLDEPTRKRMIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDLGTK 277
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESS--GTNIGHFNFKS----------DPKR 395
H+NT IP V+ YE+T D + +L T I H F DP++
Sbjct: 278 HTNTFIPKVLAEARNYELTQDN---DSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQ 334
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
L+ +L T E+C TYNMLK+SRHLF WT + ADYYER+L N +LG Q+ E G++ Y
Sbjct: 335 LSKHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSY 393
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
LPL GS K S T +SFWCC G+G E+ +K G++IY+ + G+Y+ +I
Sbjct: 394 FLPLLSGSHKVYS-----TRENSFWCCVGSGFENHAKYGEAIYYHND---QGIYVNLFIP 445
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S ++WK+ I ++Q+ V + L + +TT++ LR P+W S K +
Sbjct: 446 SEVNWKAKGITLHQETAFPVEENTALTI-----QTDKPVTTTIYLRYPSW--SKNVKVNV 498
Query: 576 NGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
NG+ + + PG++++VT+ W D++ P++L+ E D+ P+ A+LYGP VL
Sbjct: 499 NGKKVSVKQKPGSYIAVTRQWKDGDRIEANYPMSLQLETTPDN-PQKG---ALLYGPLVL 554
Query: 635 AGHS 638
AG S
Sbjct: 555 AGES 558
>gi|212690961|ref|ZP_03299089.1| hypothetical protein BACDOR_00451 [Bacteroides dorei DSM 17855]
gi|212666193|gb|EEB26765.1| hypothetical protein BACDOR_00451 [Bacteroides dorei DSM 17855]
Length = 646
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 188/541 (34%), Positives = 296/541 (54%), Gaps = 46/541 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
++ L DVRL + ++ ++ ++VD+L+ +FR A + A E
Sbjct: 48 VRSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 106
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA LM+A+T +E K K ++VS L+ Q +G+GYLSA+
Sbjct: 107 GGWESLDCELRGHTTGHLLSAYGLMYAATGSELFKHKGDSLVSGLAEVQNALGNGYLSAY 166
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY Y+DN +AL + T M ++ Y++++ + +
Sbjct: 167 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEIVTRMADWAYHKLKPLDE 226
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
+ R + + E GG+N+ Y L+ IT D ++ LA F + L DD+
Sbjct: 227 ---VTRR-KMIRNEFGGINESFYNLYAITGDERYRWLARFFYHNEVIDPLKELRDDLGTK 282
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRL 396
H+NT IP V+ YE+T D+ + + H ++ F DP
Sbjct: 283 HTNTFIPKVLAEARNYELTEDEDSRKLSGFFWHTMIDRHTFAPGCSSDKEHYF--DPDHF 340
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
+ ++ T E+C TYNMLK+SRHLF WT + A ADYYER+L N +LG Q+ G++ Y
Sbjct: 341 SKHISGYTGETCCTYNMLKLSRHLFCWTADAAVADYYERALYNHILG-QQDPHTGMVTYF 399
Query: 457 LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
LPL GS K S T +SFWCC G+G E+ +K G++IY+ + G+Y+ +I S
Sbjct: 400 LPLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYHND---KGIYVNLFIPS 451
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
++W+ + + Q+ D P T+ + + T++ LR P+W S G K +N
Sbjct: 452 VVNWREKGLTLRQETD-----FPAEETTVLTIGAQNPVETTVYLRYPSW--SKGVKVFVN 504
Query: 577 GQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G+ + + PG+++++T+ W D++T P+ LR E D+ P+ A++YGP VLA
Sbjct: 505 GKKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQKG---ALIYGPLVLA 560
Query: 636 G 636
G
Sbjct: 561 G 561
>gi|329957171|ref|ZP_08297738.1| hypothetical protein HMPREF9445_02614 [Bacteroides clarus YIT
12056]
gi|328523439|gb|EGF50538.1| hypothetical protein HMPREF9445_02614 [Bacteroides clarus YIT
12056]
Length = 694
Score = 312 bits (800), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 200/559 (35%), Positives = 296/559 (52%), Gaps = 52/559 (9%)
Query: 102 PGQF----KVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKT 157
PGQF K+ + ++ L DVRL + ++ ++ +DV++L+ +FR
Sbjct: 79 PGQFAGKMKLNTVAPVKVESFDLQDVRLLPSRFRDNMLRDSV-WMTSIDVNRLIHSFRTN 137
Query: 158 ARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVS 210
A + A E YGGWE CELRGH GH LSA LM+A+T +E K K ++V+
Sbjct: 138 AGIWAGREGGYVTVKKYGGWESLDCELRGHTTGHLLSAYGLMYAATGSEIFKLKGDSIVT 197
Query: 211 ALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRM 270
L Q +G+GYLSAFP E +R VWAP+YT+HK+ +GL+DQY YADNA+AL +
Sbjct: 198 ELGKVQDALGNGYLSAFPEELINRNIKGQSVWAPWYTLHKLFSGLIDQYLYADNAQALAV 257
Query: 271 TTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK 330
T M ++ Y++ +K S E + + E GG+N+ Y L+ +T D ++ LAH F
Sbjct: 258 VTKMGDWAYDK----LKPLSEETRRRMIRNEFGGINESFYNLYAVTGDERYRWLAHFFYH 313
Query: 331 PCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESS--GTNIGHFN 388
+ L Q DD+ H+NT IP V+ YE+TGD K+ L T I H
Sbjct: 314 NDVIDPLKEQNDDLGTKHTNTFIPKVLAEARNYELTGD---KDSKALSDFFWHTMIDHHT 370
Query: 389 FKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLT 438
F D KR + L+ T E+C TYNMLK+SRHLF W + ADYYER+L
Sbjct: 371 FAPGCSSQKEHYFDTKRFSHFLNGYTGETCCTYNMLKLSRHLFCWQPDARIADYYERALY 430
Query: 439 NGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY 498
N +LG Q+ + G++ Y LPL G+ K S T +SFWCC G+G E+ +K G+ IY
Sbjct: 431 NHILG-QQDPQTGMVCYFLPLLSGAHKVYS-----TKENSFWCCVGSGFENHAKYGEGIY 484
Query: 499 FEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSL 558
+ G+YI +I S + WK I + Q+ P T+ + T++
Sbjct: 485 YRSAA---GIYINLFIPSVVRWKEKGITLKQETA-----FPAGEATVLTVEADRPVRTTV 536
Query: 559 NLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDD 617
LR P+W S +NG+ + + PG+++++ + W + D++ P+ + E D+
Sbjct: 537 YLRYPSW--SEKVTVRVNGKKVQVKRKPGSYIALNRLWQNGDRIEAAYPMRVHLETTPDN 594
Query: 618 RPEYASIQAILYGPYVLAG 636
P+ A+LYGP VLAG
Sbjct: 595 -PQKG---ALLYGPLVLAG 609
>gi|237712552|ref|ZP_04543033.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|229453873|gb|EEO59594.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
Length = 640
Score = 312 bits (799), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 188/541 (34%), Positives = 295/541 (54%), Gaps = 46/541 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
+K L DVRL + ++ ++ ++VD+L+ +FR A + A E
Sbjct: 42 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 100
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA LM+A+T +E K K ++VS L+ Q +G+GYLSA+
Sbjct: 101 GGWESLDCELRGHTTGHLLSAYGLMYAATGSELFKHKGDSLVSGLAEVQNALGNGYLSAY 160
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY Y+DN +AL + T M ++ Y++++ + +
Sbjct: 161 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEIVTRMADWAYHKLKPLDE 220
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
+ R + + E GG+N+ Y L+ IT D ++ LA F + L DD+
Sbjct: 221 ---VTRR-KMIRNEFGGINESFYNLYAITGDERYRWLARFFYHNEVIDPLKELRDDLGTK 276
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRL 396
H+NT IP V+ YE+T D+ + + H ++ F DP
Sbjct: 277 HTNTFIPKVLAEARNYELTEDEDSRKLSGFFWHTMIDRHTFAPGCSSDKEHYF--DPDHF 334
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
+ ++ T E+C TYNMLK+S HLF WT + A ADYYER+L N +LG Q+ G++ Y
Sbjct: 335 SKHISGYTGETCCTYNMLKLSSHLFCWTADAAVADYYERALYNHILG-QQDPHTGMVTYF 393
Query: 457 LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
LPL GS K S T +SFWCC G+G E+ +K G++IY+ + G+Y+ +I S
Sbjct: 394 LPLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYHND---KGIYVNLFIPS 445
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
++W+ + + Q+ D P T+ + + T++ LR P+W S G K +N
Sbjct: 446 VVNWREKGLTLRQETD-----FPAEETTVLTIGAQNPVETTVYLRYPSW--SKGVKVFVN 498
Query: 577 GQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G+ + + PG+++++T+ W D++T P+ LR E D+ P+ A++YGP VLA
Sbjct: 499 GKKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQKG---ALIYGPLVLA 554
Query: 636 G 636
G
Sbjct: 555 G 555
>gi|255692201|ref|ZP_05415876.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides
finegoldii DSM 17565]
gi|260622065|gb|EEX44936.1| hypothetical protein BACFIN_07304 [Bacteroides finegoldii DSM
17565]
Length = 644
Score = 312 bits (799), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 198/540 (36%), Positives = 296/540 (54%), Gaps = 49/540 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
LK+V L R + + A T++ DV++L+ +FR A + A E
Sbjct: 50 LKDVRLLPSRFRDNMLRDSAWMTSI------DVNRLLHSFRTNAGVFAGREGGYMTVKKL 103
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA LM+A+T +E K K ++V+ L Q + +GYLSA+
Sbjct: 104 GGWESLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAW 163
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY YADN +AL + T M ++ YN+ +K
Sbjct: 164 PEELINRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALIIVTRMGDWAYNK----LK 219
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
S E + E GG+N+ Y L+ IT D ++ LA F + L DD+
Sbjct: 220 PLSEETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTK 279
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS----------DPKRLA 397
H+NT IP VI YE+T ++ ++ + T I H F DPK+L+
Sbjct: 280 HTNTFIPKVIAEARNYELTRNETSRKLSEFFWH-TMIDHHTFAPGCSSDKEHYFDPKKLS 338
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
+L T E+C TYNMLK+SRHLF WT + + ADYYER+L N +LG Q+ E G++ Y L
Sbjct: 339 QHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFL 397
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL GS K S T +SFWCC G+G E+ +K G++IY+ G+Y+ +I S+
Sbjct: 398 PLLSGSHKLYS-----TKENSFWCCVGSGFENHAKFGEAIYYHNN---QGIYVNLFIPSQ 449
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ WK + + Q+ + + R TL + + T++ LR P+W S K +NG
Sbjct: 450 VTWKEKGLTIRQETE--FPQEETTRFTLQAENP---VRTTIYLRYPSW--SKDVKVLVNG 502
Query: 578 QDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + + PG+++++T+ W DD+++ P+ ++ EA D+ P A A+LYGP VLAG
Sbjct: 503 KKISVKQKPGSYIAITREWKDDDQISATYPMQIKLEATPDN-PNKA---ALLYGPLVLAG 558
>gi|423212948|ref|ZP_17199477.1| hypothetical protein HMPREF1074_01009 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694204|gb|EIY87432.1| hypothetical protein HMPREF1074_01009 [Bacteroides xylanisolvens
CL03T12C04]
Length = 642
Score = 311 bits (798), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 203/562 (36%), Positives = 298/562 (53%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK+V L R + + A T++ DV +L+
Sbjct: 27 PGQHQGKMKKETVAPVRVESFDLKDVCLLPSRFRDNMLRDSAWMTSI------DVSRLLH 80
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA ALM+A+T +E K K
Sbjct: 81 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKG 140
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSAFP E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 141 DSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQ 200
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL+ T M ++ YN+ +K S E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 201 QALKTVTKMGDWAYNK----LKPLSEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLA 256
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ K+ + T I
Sbjct: 257 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKKLSEFFWH-TMID 315
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK + +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 316 HHTFAPGCSSDKEHFFDPKNFSKHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 375
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC G+G E+ +K G+
Sbjct: 376 ALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVGSGFENHAKYGE 429
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ G+Y+ +I S++ WK + + Q+ + P TL +
Sbjct: 430 AIYYHNN---QGIYVNLFIPSQVTWKEKGVTLLQETE-----FPKEETTLLTIRAEKPVR 481
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S A+ +NG+ + + PG+++++T+ W +D+++ P+ + EA
Sbjct: 482 TTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRISATYPMQIELEAT 539
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
P+ + A+LYGP VLAG
Sbjct: 540 ----PDNPNKVALLYGPLVLAG 557
>gi|423287825|ref|ZP_17266676.1| hypothetical protein HMPREF1069_01719 [Bacteroides ovatus
CL02T12C04]
gi|392671840|gb|EIY65311.1| hypothetical protein HMPREF1069_01719 [Bacteroides ovatus
CL02T12C04]
Length = 643
Score = 311 bits (798), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 201/562 (35%), Positives = 305/562 (54%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK++ L R + + A T++ DV++L+
Sbjct: 27 PGQHQGKMKKETVAPVRVESFDLKDIRLLPSRFRDNMLRDSAWMTSI------DVNRLLH 80
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA AL++A+T +E K K
Sbjct: 81 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALIYAATGSEIFKLKG 140
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSAFP E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 141 DSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNL 200
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL++ T M ++ YN+++++ + E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 201 QALKVVTKMGDWAYNKLKSLTE----ETRKLMIRNEFGGINESFYNLYAITGDERYRWLA 256
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ ++ + T I
Sbjct: 257 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARSYELTRNETSRKLSEFFWH-TMID 315
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK+L+ +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 316 HHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 375
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC G+G E+ +K G+
Sbjct: 376 ALYNHILG-QQDPETGMVAYFLPLLSGSHKLYS-----TKENSFWCCVGSGFENHAKYGE 429
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ G+Y+ +I S++ WK + + Q+ + + R TL + +
Sbjct: 430 AIYYHNN---QGIYVNLFIPSQVTWKEKGLTIRQETE--FPQEETTRFTLQAENP---VR 481
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S K +NG+ + + PG+++ +T+ W D+++ P+ ++ EA
Sbjct: 482 TTIYLRYPSW--SKDVKVLVNGKKISVKQKPGSYIVITREWKDGDQISATYPMQIKLEAT 539
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
D+ P A A+LYGP VLAG
Sbjct: 540 PDN-PNKA---ALLYGPLVLAG 557
>gi|424790951|ref|ZP_18217449.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
pv. graminis ART-Xtg29]
gi|422797791|gb|EKU25992.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
pv. graminis ART-Xtg29]
Length = 651
Score = 311 bits (797), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 185/533 (34%), Positives = 290/533 (54%), Gaps = 40/533 (7%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVG-HYLSASAL 191
+A+ + YL+ + D+L+ FR A L + EP GGWE P CE+RGHF G HYLSA AL
Sbjct: 74 QARDRDRRYLMSIPNDRLLHTFRLVAGLDSQAEPLGGWESPHCEIRGHFAGGHYLSACAL 133
Query: 192 MWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKI 251
++A+T + +LK+K A+V+ L+ CQ+ GY+ A+P+ +DRL VW P YT HKI
Sbjct: 134 LYAATGDAALKDKADALVAELARCQR--ADGYIGAYPSSFYDRLGRHEEVWVPIYTAHKI 191
Query: 252 LAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYK 311
LAG LD +A NA+ALR + F + + + + + + L E GG++ L +
Sbjct: 192 LAGHLDMARHAGNAQALRTA----QRFADWLGAWMDGFDDAQWQRILGVEFGGVHASLLE 247
Query: 312 LFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLH 371
L+ ++ D K+ A +++ L LA Q D ++G H+NT IP ++ + YE+ G
Sbjct: 248 LYLLSGDAKYQRWATRYEQASLLEPLAQQRDALAGLHANTQIPKIVAAARAYEIDGAPRQ 307
Query: 372 KE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLF 421
++ GH +G + + P A +L ++ E C +YNMLK++RHL+
Sbjct: 308 RQIAEFFWRTVSGHHAYCTG-GVSDYEMFGKPDHFAGHLSGHSHECCCSYNMLKLTRHLY 366
Query: 422 RWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWC 481
W + A DYYER L N LG Q E G+M+Y +P+ G K + TP SFWC
Sbjct: 367 TWQPDAALMDYYERVLFNARLGTQ--DEAGMMMYFVPMDAGYWKL-----YNTPFASFWC 419
Query: 482 CYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYL 541
C GTG+E F+K DSIYF ++ G+ + +I+S+LDW + V Q+ +
Sbjct: 420 CTGTGVEEFAKSNDSIYFRDDA---GLTVNLFIASQLDWAERGLRVVQR----TRFPQQE 472
Query: 542 RVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDK 600
L F K T L LRIP W ++ G + +NG+ + +PG++L++ + ++ D+
Sbjct: 473 GTALEFQCKRPQQMT-LRLRIPYW-ATQGVRLRINGKAQAVKATPGSYLALERRFADGDR 530
Query: 601 LTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSD 653
+ + LP+ L + P+ S+QA++YGP VLA +G I + +SD
Sbjct: 531 IELDLPMALHAAPL----PDEPSLQAMMYGPLVLAAQ-LGSDGIDPAQLHVSD 578
>gi|423239921|ref|ZP_17221036.1| hypothetical protein HMPREF1065_01659 [Bacteroides dorei
CL03T12C01]
gi|392644910|gb|EIY38644.1| hypothetical protein HMPREF1065_01659 [Bacteroides dorei
CL03T12C01]
Length = 646
Score = 311 bits (797), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 188/541 (34%), Positives = 294/541 (54%), Gaps = 46/541 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
+K L DVRL + ++ ++ ++VD+L+ +FR A + A E
Sbjct: 48 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGVFAGREGGYMTVKKL 106
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA LM+A+T +E K K ++VS L Q +G+GYLSA+
Sbjct: 107 GGWESLDCELRGHTTGHLLSAYGLMYAATGSELFKHKGDSLVSGLVEVQNALGNGYLSAY 166
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY Y+DN +AL + T M ++ Y++++ + +
Sbjct: 167 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEIVTRMADWAYHKLKPLDE 226
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
+ R + + E GG+N+ Y L+ IT D ++ LA F + L DD+
Sbjct: 227 ---VTRR-KMIRNEFGGINESFYNLYAITGDERYRWLARFFYHNEVIDPLKELRDDLGTK 282
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRL 396
H+NT IP V+ YE+T D+ + + H ++ F DP
Sbjct: 283 HTNTFIPKVLAEARNYELTEDEDSRKLSGFFWHTMIDRHTFAPGCSSDKEHYF--DPDHF 340
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
+ ++ T E+C TYNMLK+S HLF WT + A ADYYER+L N +LG Q+ G++ Y
Sbjct: 341 SKHISGYTGETCCTYNMLKLSSHLFCWTADAAVADYYERALYNHILG-QQDPHTGMVTYF 399
Query: 457 LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
LPL GS K S T +SFWCC G+G E+ +K G++IY+ + G+Y+ +I S
Sbjct: 400 LPLLSGSHKVYS-----TKENSFWCCVGSGFENHAKYGEAIYYHND---KGIYVNLFIPS 451
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
++W+ + + Q+ D P T+ + + T++ LR P+W S G K +N
Sbjct: 452 VVNWREKGLTLRQETD-----FPAEETTVLTIGAQNPVETTVYLRYPSW--SKGVKVFVN 504
Query: 577 GQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G+ + + PG+++++T+ W D++T P+ LR E D+ P+ A++YGP VLA
Sbjct: 505 GKKIAVKQKPGSYIAITRLWKDGDRITADYPMCLRVETTPDN-PQKG---ALIYGPLVLA 560
Query: 636 G 636
G
Sbjct: 561 G 561
>gi|160883345|ref|ZP_02064348.1| hypothetical protein BACOVA_01314 [Bacteroides ovatus ATCC 8483]
gi|156111329|gb|EDO13074.1| hypothetical protein BACOVA_01314 [Bacteroides ovatus ATCC 8483]
Length = 643
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 201/562 (35%), Positives = 304/562 (54%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK++ L R + + A T++ DV++L+
Sbjct: 27 PGQHQGKMKKETVAPVRVESFDLKDIRLLPSRFRDNMLRDSAWMTSI------DVNRLLH 80
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA AL++A+T +E K K
Sbjct: 81 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALIYAATGSEIFKLKG 140
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSAFP E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 141 DSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNL 200
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL++ T M ++ YN+ +K + E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 201 QALKVVTKMGDWAYNK----LKPLTEETRKLMIRNEFGGINESFYNLYAITGDERYRWLA 256
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ ++ + T I
Sbjct: 257 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTRNETSRKLSEFFWH-TMID 315
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK+L+ +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 316 HHTFAPGCSSDKEHYFDPKKLSQHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 375
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL G+ K S T +SFWCC G+G E+ +K G+
Sbjct: 376 ALYNHILG-QQDPETGMVAYFLPLLSGAHKLYS-----TKENSFWCCVGSGFENHAKYGE 429
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ G+Y+ +I S++ WK + + Q+ + + R TL + +
Sbjct: 430 AIYYHNN---QGIYVNLFIPSQVTWKEKGLTIRQETE--FPQEETTRFTLRTENP---VR 481
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S K +NG+ + + PG+++ +T+ W D+++ P+ ++ EA
Sbjct: 482 TTIYLRYPSW--SKDVKVLVNGKKISVKQKPGSYIVITREWKDGDQISATYPMQIKLEAT 539
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
D+ P+ A A+LYGP VLAG
Sbjct: 540 PDN-PDKA---ALLYGPLVLAG 557
>gi|298384470|ref|ZP_06994030.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
gi|298262749|gb|EFI05613.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
Length = 641
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 204/562 (36%), Positives = 299/562 (53%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK+V L R + + A T+L DV++L+
Sbjct: 27 PGQHQGKMKKETVAPIRVQSFDLKDVRLLASRFRDNMLRDSAWMTSL------DVNRLLH 80
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA ALM+A+T +E K K
Sbjct: 81 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKG 140
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSA+P E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 141 DSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYSGLIDQYLYADNQ 200
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL + T M ++ YN+ +K S E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 201 QALSVVTKMGDWAYNK----LKPLSEETRRLMIRNEFGGINESFYNLYAITGDERYRWLA 256
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ K+ + T I
Sbjct: 257 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKKLSEFFWH-TMID 315
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK+ + +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 316 HHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 375
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC G+G E+ +K G+
Sbjct: 376 ALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVGSGFENHAKYGE 429
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ + G+Y+ +I S++ WK + + Q+ D + R+TL
Sbjct: 430 AIYYHND---KGIYVNLFIPSQVTWKEKGLTLLQETD--FPKEETTRLTLRAEKPRH--- 481
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S K +NG+ + + PG+++++T+ W D++ P+ + EA
Sbjct: 482 TTIYLRYPSW--SKNVKVLVNGKKVSVKQKPGSYIAITREWKDGDRIAATYPMQIELEAT 539
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
P+ + A+LYGP VLAG
Sbjct: 540 ----PDNPNKVALLYGPLVLAG 557
>gi|29345547|ref|NP_809050.1| hypothetical protein BT_0137 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337439|gb|AAO75244.1| Acetyl-CoA carboxylase-like protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 641
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 204/562 (36%), Positives = 299/562 (53%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK+V L R + + A T+L DV++L+
Sbjct: 27 PGQHQGKMKKETVAPIRVQSFDLKDVRLLASRFRDNMLRDSAWMTSL------DVNRLLH 80
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA ALM+A+T +E K K
Sbjct: 81 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKG 140
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSA+P E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 141 DSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYSGLIDQYLYADNQ 200
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL + T M ++ YN+ +K S E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 201 QALSVVTKMGDWAYNK----LKPLSEETRRLMIRNEFGGINESFYNLYAITGDERYRWLA 256
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ K+ + T I
Sbjct: 257 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKKLSEFFWH-TMID 315
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK+ + +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 316 HHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 375
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC G+G E+ +K G+
Sbjct: 376 ALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVGSGFENHAKYGE 429
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ + G+Y+ +I S++ WK + + Q+ D + R+TL
Sbjct: 430 AIYYHND---KGIYVNLFIPSQVTWKEKGLTLLQETD--FPKEETTRLTLRAEKPRH--- 481
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S K +NG+ + + PG+++++T+ W D++ P+ + EA
Sbjct: 482 TTIYLRYPSW--SKNVKVLVNGKKVSVKQKPGSYIAITREWKDGDRIAATYPMQIELEAT 539
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
P+ + A+LYGP VLAG
Sbjct: 540 ----PDNPNKVALLYGPLVLAG 557
>gi|383123868|ref|ZP_09944538.1| hypothetical protein BSIG_4114 [Bacteroides sp. 1_1_6]
gi|251838901|gb|EES66986.1| hypothetical protein BSIG_4114 [Bacteroides sp. 1_1_6]
Length = 641
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 204/562 (36%), Positives = 299/562 (53%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK+V L R + + A T+L DV++L+
Sbjct: 27 PGQHQGKMKKETVAPIRVQSFDLKDVRLLASRFRDNMLRDSAWMTSL------DVNRLLH 80
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA ALM+A+T +E K K
Sbjct: 81 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKG 140
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSA+P E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 141 DSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYSGLIDQYLYADNQ 200
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL + T M ++ YN+ +K S E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 201 QALSVVTKMGDWAYNK----LKPLSEETRRLMIRNEFGGINESFYNLYAITGDERYRWLA 256
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ K+ + T I
Sbjct: 257 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKKLSEFFWH-TMID 315
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK+ + +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 316 HHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 375
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC G+G E+ +K G+
Sbjct: 376 ALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVGSGFENHAKYGE 429
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ + G+Y+ +I S++ WK + + Q+ D + R+TL
Sbjct: 430 AIYYHND---KGIYVNLFIPSQVTWKEKGLTLLQETD--FPKEETTRLTLRAEKPRH--- 481
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S K +NG+ + + PG+++++T+ W D++ P+ + EA
Sbjct: 482 TTIYLRYPSW--SKNVKVLVNGKKVSVKQKPGSYIAITREWKDGDRIAATYPMQIELEAT 539
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
P+ + A+LYGP VLAG
Sbjct: 540 ----PDNPNKVALLYGPLVLAG 557
>gi|345011855|ref|YP_004814209.1| hypothetical protein [Streptomyces violaceusniger Tu 4113]
gi|344038204|gb|AEM83929.1| protein of unknown function DUF1680 [Streptomyces violaceusniger Tu
4113]
Length = 849
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 201/526 (38%), Positives = 279/526 (53%), Gaps = 44/526 (8%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWA 194
Q N YL +D+D+L+ FR L + +P GGWE P+ ELRGH GH LS AL +A
Sbjct: 72 QSRNTAYLRFVDIDRLLHTFRLNVGLSSAAQPCGGWESPTTELRGHSTGHLLSGLALTYA 131
Query: 195 STHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEALIPVWAPYYTIH 249
+T + + ++K A+VSAL+ACQ G GYLSAFP FDRLEA VWAPYYTIH
Sbjct: 132 ATGDTAPRDKGRALVSALAACQARSPAAGYGQGYLSAFPESFFDRLEAGTGVWAPYYTIH 191
Query: 250 KILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVL 309
KI+AGL+DQY A NAEAL+ + R K S ++ + L E GGMNDVL
Sbjct: 192 KIMAGLVDQYRLAGNAEALQTVLRQAAWVDTRT----GKLSYDQMQRVLQTEFGGMNDVL 247
Query: 310 YKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQM-------- 361
L IT D + L +A F LA D ++G H+NT IP ++G+
Sbjct: 248 ADLHEITGDSRWLKVAERFTHARVFDPLARNEDRLAGLHANTQIPKMVGAMRLWEEGLDS 307
Query: 362 RYEVTGDQLHK--EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRH 419
RY G+ K H G N F +P +A+ L N E+C +YNMLK++R
Sbjct: 308 RYRTIGENFWKIVTDHHTYVIGGNSNGEAFH-EPDAIAAQLSDNACENCNSYNMLKLTRL 366
Query: 420 L-FRWTKEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSKERSY------HH 471
+ F + DYYER+L N +LG Q + G IY LAPGS K++ +
Sbjct: 367 IHFHAPERTDLLDYYERTLLNQMLGEQDPDSAHGFNIYYTGLAPGSFKQQPSFMGTDPNQ 426
Query: 472 WGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKV 531
+ T D+F C +G+G+E+ +K D+IY + + + +I S L W+ I Q
Sbjct: 427 YSTDYDNFSCDHGSGMETQAKFADTIYTYADRS---LLVNLFIPSELRWQDKGITWRQ-- 481
Query: 532 DPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPSPGNFLS 590
+ TLT +S G+ L L +RIP+W + GA+ATLNG L P PG++L
Sbjct: 482 --TTGFPDQQTTTLTVASGGASL--ELRVRIPSWAA--GARATLNGTTLADRPEPGSWLI 535
Query: 591 VTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + W + D++ + LP+ L + DD +QA+LYGP VLAG
Sbjct: 536 IDRQWRTGDRVEVTLPMKLTFDPTPDD----PDVQAVLYGPVVLAG 577
>gi|336404833|ref|ZP_08585521.1| hypothetical protein HMPREF0127_02834 [Bacteroides sp. 1_1_30]
gi|335940654|gb|EGN02520.1| hypothetical protein HMPREF0127_02834 [Bacteroides sp. 1_1_30]
Length = 640
Score = 309 bits (792), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 202/562 (35%), Positives = 298/562 (53%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK+V L R + + A T++ DV +L+
Sbjct: 25 PGQHQGKMKKETVAPVRVESFDLKDVRLLPSRFRDNMLRDSAWMTSI------DVSRLLH 78
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA ALM+A+T +E K K
Sbjct: 79 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKG 138
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSAFP E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 139 DSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQ 198
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL+ T M ++ YN+ +K S E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 199 QALKTVTKMGDWAYNK----LKPLSEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLA 254
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ K+ + T I
Sbjct: 255 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKKLSEFFWH-TMID 313
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK+ + +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 314 HHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 373
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC G+G E+ +K G+
Sbjct: 374 ALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVGSGFENHAKYGE 427
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ G+Y+ +I S++ WK + + Q+ + P T +
Sbjct: 428 AIYYHNN---QGIYVNLFIPSQVTWKEKGLTLLQETE-----FPKEETTRFTIRAEKPVR 479
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S A+ +NG+ + + PG+++++T+ W +D+++ P+ + EA
Sbjct: 480 TTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRISATYPMQIALEAT 537
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
P+ + A+LYGP VLAG
Sbjct: 538 ----PDNPNKVALLYGPLVLAG 555
>gi|294646892|ref|ZP_06724513.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|292637837|gb|EFF56234.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
Length = 640
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 202/562 (35%), Positives = 298/562 (53%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK+V L R + + A T++ DV +L+
Sbjct: 25 PGQHQGKMKKETVAPVRVESFDLKDVRLLPSRFRDNMLRDSAWMTSI------DVSRLLH 78
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA ALM+A+T +E K K
Sbjct: 79 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKG 138
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSAFP E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 139 DSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQ 198
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL+ T M ++ YN+ +K S E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 199 QALKTVTKMGDWAYNK----LKPLSEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLA 254
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ K+ + T I
Sbjct: 255 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKKLSEFFWH-TMID 313
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK+ + +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 314 HHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 373
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC G+G E+ +K G+
Sbjct: 374 ALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVGSGFENHAKYGE 427
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ G+Y+ +I S++ WK + + Q+ + P T +
Sbjct: 428 AIYYHNN---QGIYVNLFIPSQVTWKEKGLTLLQETE-----FPKEETTRFTIRAEKPVR 479
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S A+ +NG+ + + PG+++++T+ W +D+++ P+ + EA
Sbjct: 480 TTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRISATYPMQIALEAT 537
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
P+ + A+LYGP VLAG
Sbjct: 538 ----PDNPNKVALLYGPLVLAG 555
>gi|345512074|ref|ZP_08791613.1| hypothetical protein BSAG_00984 [Bacteroides sp. D1]
gi|229443482|gb|EEO49273.1| hypothetical protein BSAG_00984 [Bacteroides sp. D1]
Length = 640
Score = 309 bits (791), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 202/562 (35%), Positives = 298/562 (53%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK+V L R + + A T++ DV +L+
Sbjct: 25 PGQHQGKMKKETVAPVRVESFDLKDVRLLPSRFRDNMLRDSAWMTSI------DVSRLLH 78
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA ALM+A+T +E K K
Sbjct: 79 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKG 138
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSAFP E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 139 DSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQ 198
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL+ T M ++ YN+ +K S E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 199 QALKTVTKMGDWAYNK----LKPLSEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLA 254
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ K+ + T I
Sbjct: 255 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKKLSEFFWH-TMID 313
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK+ + +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 314 HHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 373
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC G+G E+ +K G+
Sbjct: 374 ALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVGSGFENHAKYGE 427
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ G+Y+ +I S++ WK + + Q+ + P T +
Sbjct: 428 AIYYHNN---QGIYVNLFIPSQVTWKEKGLTLLQETE-----FPKEETTRFTIRAEKPVR 479
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S A+ +NG+ + + PG+++++T+ W +D+++ P+ + EA
Sbjct: 480 TTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRISATYPMQIALEAT 537
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
P+ + A+LYGP VLAG
Sbjct: 538 ----PDNPNKVALLYGPLVLAG 555
>gi|262407449|ref|ZP_06083997.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|262354257|gb|EEZ03349.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
Length = 642
Score = 309 bits (791), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 202/562 (35%), Positives = 298/562 (53%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK+V L R + + A T++ DV +L+
Sbjct: 27 PGQHQGKMKKETVAPVRVESFDLKDVRLLPSRFRDNMLRDSAWMTSI------DVSRLLH 80
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA ALM+A+T +E K K
Sbjct: 81 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKG 140
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSAFP E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 141 DSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQ 200
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL+ T M ++ YN+ +K S E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 201 QALKTVTKMGDWAYNK----LKPLSEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLA 256
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ K+ + T I
Sbjct: 257 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKKLSEFFWH-TMID 315
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK+ + +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 316 HHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 375
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC G+G E+ +K G+
Sbjct: 376 ALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVGSGFENHAKYGE 429
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ G+Y+ +I S++ WK + + Q+ + P T +
Sbjct: 430 AIYYHNN---QGIYVNLFIPSQVTWKEKGLTLLQETE-----FPKEETTRFTIRAEKPVR 481
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S A+ +NG+ + + PG+++++T+ W +D+++ P+ + EA
Sbjct: 482 TTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRISATYPMQIALEAT 539
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
P+ + A+LYGP VLAG
Sbjct: 540 ----PDNPNKVALLYGPLVLAG 557
>gi|427386207|ref|ZP_18882404.1| hypothetical protein HMPREF9447_03437 [Bacteroides oleiciplenus YIT
12058]
gi|425726247|gb|EKU89112.1| hypothetical protein HMPREF9447_03437 [Bacteroides oleiciplenus YIT
12058]
Length = 641
Score = 309 bits (791), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 190/542 (35%), Positives = 293/542 (54%), Gaps = 48/542 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
++ L D+RL + +L ++ + ++L+ +FR A + A E
Sbjct: 43 VQSFDLKDIRLLPSRFRDNMMRDSL-WMTSIATNRLLHSFRNNAGVFAGREGGYMTVKKL 101
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CE+RGH GH LSA ALM+A++ +E K K ++VS L+ Q +G+GYLSA+
Sbjct: 102 GGWESLDCEIRGHTTGHLLSAYALMYAASGSEIFKLKGDSLVSGLAEVQDALGNGYLSAY 161
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY Y DN +AL++ T M ++ YN+ +K
Sbjct: 162 PEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALKVVTRMGDWAYNK----LK 217
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
E + + E GG+N+ Y L+ IT D ++ LA+ F + L Q DD+
Sbjct: 218 PLDEETRKRMIRNEFGGVNESFYNLYAITGDERYHWLANFFYHNDVIDPLKEQRDDLGTK 277
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESS--GTNIGHFNFKS----------DPKR 395
H+NT IP V+ YE+T + E L T I H F DP++
Sbjct: 278 HTNTFIPKVLAEARNYELTQN---AESRTLTDFFWHTMIAHHTFAPGCSSDKEHYFDPQQ 334
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
+ +L T E+C TYNMLK+SRHLF WT + + ADYYER+L N +LG Q+ E G+ Y
Sbjct: 335 FSKHLTGYTGETCCTYNMLKLSRHLFCWTGDASIADYYERALYNHILG-QQDPETGMFSY 393
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
LPL GS K S T +SFWCC G+G E+ +K G++IY++ E G+Y+ +I
Sbjct: 394 FLPLLSGSHKVYS-----TQENSFWCCVGSGFENHAKYGEAIYYQNE---KGIYVNLFIP 445
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S ++WK + + Q+ + P T+ + T++ LR P+W S ++
Sbjct: 446 SEVNWKEKGMTIRQETN-----FPAEETTILSIHAKEPVKTTVYLRYPSW--SKKVTVSV 498
Query: 576 NGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
NG+ + + PG++++VT+ W DK+ P+ ++ E D+ P+ A++YGP VL
Sbjct: 499 NGKKVSVKQKPGSYIAVTRQWKDGDKIEANYPMEIQLETTPDN-PQKG---ALVYGPLVL 554
Query: 635 AG 636
AG
Sbjct: 555 AG 556
>gi|294810816|ref|ZP_06769462.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|294442004|gb|EFG10825.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 642
Score = 308 bits (790), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 202/562 (35%), Positives = 298/562 (53%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK+V L R + + A T++ DV +L+
Sbjct: 27 PGQHQGKMKKETVAPVRVESFDLKDVRLLPSRFRDNMLRDSAWMTSI------DVSRLLH 80
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA ALM+A+T +E K K
Sbjct: 81 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKG 140
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSAFP E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 141 DSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQ 200
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL+ T M ++ YN+ +K S E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 201 QALKTVTKMGDWAYNK----LKPLSEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLA 256
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ K+ + T I
Sbjct: 257 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKKLSEFFWH-TMID 315
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK+ + +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 316 HHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 375
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC G+G E+ +K G+
Sbjct: 376 ALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVGSGFENHAKYGE 429
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ G+Y+ +I S++ WK + + Q+ + P T +
Sbjct: 430 AIYYHNN---QGIYVNLFIPSQVTWKEKGLTLLQETE-----FPKEETTRFTIRAEKPVR 481
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S A+ +NG+ + + PG+++++T+ W +D+++ P+ + EA
Sbjct: 482 TTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRISATYPMQIALEAT 539
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
P+ + A+LYGP VLAG
Sbjct: 540 ----PDNPNKVALLYGPLVLAG 557
>gi|433678837|ref|ZP_20510648.1| hypothetical protein BN444_02893 [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430816044|emb|CCP41169.1| hypothetical protein BN444_02893 [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 648
Score = 308 bits (789), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 190/539 (35%), Positives = 288/539 (53%), Gaps = 42/539 (7%)
Query: 128 DSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFV-GHYL 186
D +A++ N YL+ + +L+ NFR A L + EP GGWE P CELRGHF GHYL
Sbjct: 66 DGPFLQARERNRRYLMSIPNARLLHNFRLVAGLSSDAEPLGGWESPKCELRGHFAGGHYL 125
Query: 187 SASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYY 246
SA AL++A+T + +LK+K A+V+ L+ CQ++ GYL A+P + RL VW P Y
Sbjct: 126 SACALLYAATSDAALKDKADALVAELARCQRQ--DGYLGAYPAAFYARLRRGEDVWVPLY 183
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQ-TLNEEAGGM 305
T HKILAG LD +A NA+ALR ++ + + WQ L E GG+
Sbjct: 184 TAHKILAGHLDMARHAGNAQALRSAQRFADWLGAWMDGC-----DDAQWQHILGVEFGGV 238
Query: 306 NDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEV 365
+ L +L+ ++ DPK+ A + +P L LA Q D ++G H+NT IP ++ + YE+
Sbjct: 239 QESLLELYLLSGDPKYQRWAARYAQPALLEPLAQQRDALAGLHANTQIPKIVAAARAYEI 298
Query: 366 TGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLK 415
G+ ++ GH +G + P A L ++ E C +YNMLK
Sbjct: 299 GGEPRQRDIAAFFWRTVSGHHAYCTG-GTSDYELFGKPDHFAGRLSGHSHECCCSYNMLK 357
Query: 416 VSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTP 475
++RHL+ W + A DYYER L N LG Q E G+++Y +P+ G K + TP
Sbjct: 358 LTRHLYTWQPDAALMDYYERVLFNARLGTQ--DEAGMLMYFVPMDAGYWKL-----YNTP 410
Query: 476 SDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVV 535
SFWCC GTG+E F+K DSIYF + G+ + +I+S+LDW + V Q+
Sbjct: 411 FASFWCCTGTGVEEFAKSNDSIYFRDAA---GLTVNLFIASQLDWPERGLRVVQR----T 463
Query: 536 SWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKT 594
+ L F K T L LRIP W ++ G + +NG+ + +PG++L++ +
Sbjct: 464 RFPQQEGTALEFQCKRPQQMT-LRLRIPYW-ATQGVRLRINGKAQAIKATPGSYLALQRR 521
Query: 595 WSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSD 653
++ D++ + LP+ L + P+ S+QA++YGP VLA +G I + +SD
Sbjct: 522 FADGDRIELDLPMALHAAPL----PDEPSLQAMMYGPLVLAAQ-LGSDGIDPAQLHVSD 575
>gi|302548275|ref|ZP_07300617.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Streptomyces
hygroscopicus ATCC 53653]
gi|302465893|gb|EFL28986.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Streptomyces
himastatinicus ATCC 53653]
Length = 849
Score = 308 bits (789), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 202/530 (38%), Positives = 282/530 (53%), Gaps = 52/530 (9%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWA 194
Q N YL +D+++L+ FR + + +P GGWE P+ ELRGH GH LS AL +A
Sbjct: 72 QSRNTAYLRFVDINRLLHTFRLNVGIASSAQPCGGWESPTTELRGHSTGHLLSGLALTYA 131
Query: 195 STHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEALIPVWAPYYTIH 249
+T + +L +K +VSAL+ACQ + +GYLSAFP FDRLEA VWAPYYTIH
Sbjct: 132 NTGDTALLDKSRKLVSALAACQAKSPAAGYRTGYLSAFPENFFDRLEAGSGVWAPYYTIH 191
Query: 250 KILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGM 305
KI+AGL+DQY A NAEA LR W V + S ++ + L E GGM
Sbjct: 192 KIMAGLVDQYRLAGNAEALETVLRQAAW--------VDTRTARLSYDQMQRVLETEYGGM 243
Query: 306 NDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGS------ 359
NDVL L IT D + L +A F L+ D ++G H+NT IP ++G+
Sbjct: 244 NDVLADLHAITGDSRWLRVAERFTHARVFDPLSRNEDRLAGLHANTQIPKMVGALRLWEE 303
Query: 360 --QMRYEVTGDQLHK--EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLK 415
RY G+ K H G N F +P +A+ L + E+C +YNMLK
Sbjct: 304 GLDSRYRTIGENFWKIVTDHHTYVIGGNSNGEAFH-EPDAIAAQLSGSCCENCNSYNMLK 362
Query: 416 VSRHL-FRWTKEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSKERSY---- 469
++R + F + DYYER+L N +LG Q + G IY LAPGS K++
Sbjct: 363 LARLIHFHAPERTDLLDYYERTLFNQMLGEQDPDSAHGFNIYYTGLAPGSFKQQPSFMGP 422
Query: 470 --HHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVV 527
+ + T D+F C +G+G+E+ +K D+IY + + + +I S L W+ I
Sbjct: 423 DPNQYSTDYDNFSCDHGSGMETHAKFADTIYTRGDRS---LLVNLFIPSELRWQEKGITW 479
Query: 528 NQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPSPG 586
Q + TLT SS G+ L L +RIP+W S GA+A LNG LP P PG
Sbjct: 480 RQ----TTGFPDQQTTTLTVSSGGASL--ELRVRIPSWAS--GARAALNGATLPDQPKPG 531
Query: 587 NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
++L + + W + D++ + LP+ LR + DD P+ IQA+LYGP VLAG
Sbjct: 532 SWLIIDRQWKTGDRVEVTLPMKLRLDPTPDD-PD---IQAVLYGPVVLAG 577
>gi|298483785|ref|ZP_07001958.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
gi|298270079|gb|EFI11667.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
Length = 642
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 189/514 (36%), Positives = 282/514 (54%), Gaps = 43/514 (8%)
Query: 141 YLLMLDVDKLVWNFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMW 193
++ +DV++L+ +FR A + A E GGWE CELRGH GH LSA ALM+
Sbjct: 69 WMTSIDVNRLLHSFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMY 128
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILA 253
A+T +E K K ++V+ L+ Q + GYLSAFP E +R VWAP+YT+HK+ +
Sbjct: 129 AATGSEIFKLKGDSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYS 188
Query: 254 GLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLF 313
GL+DQY YADN +AL+ T M ++ YN+ +K S E + E GG+N+ Y L+
Sbjct: 189 GLIDQYLYADNQQALKTVTKMGDWAYNK----LKPLSEETRKLMIRNEFGGVNESFYNLY 244
Query: 314 CITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE 373
IT D ++ LA F + L DD+ H+NT IP VI YE+T ++ K+
Sbjct: 245 AITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKK 304
Query: 374 GHQLESSGTNIGHFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRW 423
+ T I H F DPK+ + +L T E+C TYNMLK+SRHLF W
Sbjct: 305 LSEFFWH-TMIDHHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCW 363
Query: 424 TKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCY 483
T + + ADYYER+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC
Sbjct: 364 TGDSSIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCV 417
Query: 484 GTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRV 543
G+G E+ +K G++IY+ G+Y+ +I S++ WK + + Q+ P
Sbjct: 418 GSGFENHAKYGEAIYYHNN---QGIYVNLFIPSQVTWKEKGLTLLQETG-----FPKEET 469
Query: 544 TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLT 602
T + T++ LR P+W S A+ +NG+ + + PG+++++T+ W +D+++
Sbjct: 470 TRFTIRAEKPVRTTVYLRYPSW--SKKAEVLVNGKKVAVKQKPGSYIAITRDWKDNDRIS 527
Query: 603 IQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
P+ + EA P+ + A+LYGP VLAG
Sbjct: 528 ATYPMQIALEAT----PDNPNKVALLYGPLVLAG 557
>gi|440732599|ref|ZP_20912422.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
DAR61454]
gi|440368630|gb|ELQ05659.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
DAR61454]
Length = 652
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 190/539 (35%), Positives = 287/539 (53%), Gaps = 42/539 (7%)
Query: 128 DSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFV-GHYL 186
D +A++ N YL+ + +L+ NFR A L + EP GGWE P CELRGHF GHYL
Sbjct: 70 DGPFLQARERNRRYLMSIPNARLLHNFRLVAGLSSDAEPLGGWESPKCELRGHFAGGHYL 129
Query: 187 SASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYY 246
SA AL++A+T + +LK+K A+V+ L+ CQ++ GYL A+P + RL VW P Y
Sbjct: 130 SACALLYAATGDAALKDKADALVAELARCQRQ--DGYLGAYPAAFYARLRRGEDVWVPLY 187
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQ-TLNEEAGGM 305
T HKILAG LD +A NA+ALR ++ + + WQ L E GG+
Sbjct: 188 TAHKILAGHLDMARHAGNAQALRSAQRFADWLGAWMDGC-----DDAQWQHILGVEFGGV 242
Query: 306 NDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEV 365
+ L +L+ ++ DPK+ A + +P L LA Q D ++G H+NT IP ++ + YE+
Sbjct: 243 QESLLELYLLSGDPKYQRWAARYAQPALLEPLAQQRDALAGLHANTQIPKIVAAARAYEI 302
Query: 366 TGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLK 415
D ++ GH +G + P A L ++ E C +YNMLK
Sbjct: 303 GRDPRQRDVAAFFWRTVSGHHAYCTG-GTSDYELFGKPDHFAGRLSGHSHECCCSYNMLK 361
Query: 416 VSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTP 475
++RHL+ W + A DYYER L N LG Q E G+++Y +P+ G K + TP
Sbjct: 362 LTRHLYTWQPDAALMDYYERVLFNARLGTQ--DEAGMLMYFVPMDAGYWKL-----YNTP 414
Query: 476 SDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVV 535
SFWCC GTG+E F+K DSIYF + G+ + +I+S+LDW + V Q+
Sbjct: 415 FASFWCCTGTGVEEFAKSNDSIYFRDAA---GLTVNLFIASQLDWPERGLRVVQR----T 467
Query: 536 SWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKT 594
+ L F K T L LRIP W ++ G + +NG+ + +PG++L++ +
Sbjct: 468 RFPQQEGTALVFQCKRPQQMT-LRLRIPYW-ATQGVRLRINGKAQAIKATPGSYLALQRR 525
Query: 595 WSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSD 653
++ D++ + LP+ L + P+ S+QA++YGP VLA +G I + +SD
Sbjct: 526 FADGDRIELDLPMALHAAPL----PDEPSLQAMMYGPLVLAAQ-LGSDGIDPAQLHVSD 579
>gi|299146414|ref|ZP_07039482.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
gi|298516905|gb|EFI40786.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
Length = 642
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 201/562 (35%), Positives = 297/562 (52%), Gaps = 58/562 (10%)
Query: 102 PGQFK--------VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
PGQ + P R F LK+V L R + + A T++ DV +L+
Sbjct: 27 PGQHQGKMKKETVAPVRVESFDLKDVRLLPSRFRDNMLRDSAWMTSI------DVSRLLH 80
Query: 153 NFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKM 205
+FR A + A E GGWE CELRGH GH LSA ALM+A+T +E K K
Sbjct: 81 SFRTNAGVFAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKG 140
Query: 206 SAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNA 265
++V+ L+ Q + GYLSAFP E +R VWAP+YT+HK+ +GL+DQY YADN
Sbjct: 141 DSLVNGLTEVQNALKGGYLSAFPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQ 200
Query: 266 EALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
+AL+ T M ++ YN+ +K S E + E GG+N+ Y L+ IT D ++ LA
Sbjct: 201 QALKTVTKMGDWAYNK----LKPLSEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLA 256
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIG 385
F + L DD+ H+NT IP VI YE+T ++ K+ + T I
Sbjct: 257 EYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTENETSKKLSEFFWH-TMID 315
Query: 386 HFNFKS----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
H F DPK+ + +L T E+C TYNMLK+SRHLF WT + + ADYYER
Sbjct: 316 HHTFAPGCSSDKEHFFDPKKFSKHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYER 375
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ E G++ Y LPL GS K S T +SFWCC G+G E+ +K G+
Sbjct: 376 ALYNHILG-QQDPETGMVTYFLPLLSGSHKLYS-----TKENSFWCCVGSGFENHAKYGE 429
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ G+Y+ +I S++ WK + + Q+ + P T +
Sbjct: 430 AIYYHNN---QGIYVNLFIPSQVTWKEKGLTLLQETE-----FPKEETTRFIIRAEKPVR 481
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLPSP-GNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T++ LR P+W S A+ +NG+ + + G+++++T+ W +D+++ P+ + EA
Sbjct: 482 TTVYLRYPSW--SKKAEVLVNGKKVAVKQKSGSYIAITRDWKDNDRISATYPMQIELEAT 539
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
P+ + A+LYGP VLAG
Sbjct: 540 ----PDNPNKVALLYGPLVLAG 557
>gi|383115004|ref|ZP_09935763.1| hypothetical protein BSGG_0819 [Bacteroides sp. D2]
gi|313693284|gb|EFS30119.1| hypothetical protein BSGG_0819 [Bacteroides sp. D2]
Length = 643
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 195/540 (36%), Positives = 295/540 (54%), Gaps = 49/540 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
LK+V L R + + A T++ DV++L+ +FR A + A E
Sbjct: 50 LKDVRLLPSRFRDNMLRDSAWMTSI------DVNRLLHSFRTNAGVFAGREGGYMTVKKL 103
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA LM+A+T +E K K ++V+ L Q + +GYLSA+
Sbjct: 104 GGWESLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAW 163
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY YADN +AL + T M ++ YN+ +K
Sbjct: 164 PEELINRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALTIVTRMGDWAYNK----LK 219
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
S E + E GG+N+ Y L+ IT D ++ LA F + L DD+
Sbjct: 220 PLSEETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTK 279
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS----------DPKRLA 397
H+NT IP VI YE+T ++ ++ + T I H F DPK+L+
Sbjct: 280 HTNTFIPKVIAEARNYELTRNETSRKLSEFFWH-TMIDHHTFAPGCSSDKEHYFDPKKLS 338
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
+L T E+C TYNMLK+SRHLF WT + + ADYYER+L N +LG Q+ E G++ Y L
Sbjct: 339 QHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFL 397
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL GS K S T +SFWCC G+G E+ +K G++IY+ G+Y+ +I S+
Sbjct: 398 PLLSGSHKLYS-----TKENSFWCCVGSGFENHAKFGEAIYYHNN---QGIYVNLFIPSQ 449
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ WK + + Q+ + + R TL + + T++ LR P+W S K ++NG
Sbjct: 450 VTWKEKGLTIRQETE--FPQEETTRFTLQAENP---VRTTIYLRYPSW--SKDVKVSVNG 502
Query: 578 QDLPLPSP-GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + + G+++++T+ W D+++ P+ ++ E D+ P+ A A+LYGP VLAG
Sbjct: 503 KKISVKQKSGSYIAITREWKDGDQISATYPMQIKLETTPDN-PDKA---ALLYGPLVLAG 558
>gi|237722400|ref|ZP_04552881.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448210|gb|EEO54001.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
Length = 644
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 195/540 (36%), Positives = 295/540 (54%), Gaps = 49/540 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
LK+V L R + + A T++ DV++L+ +FR A + A E
Sbjct: 50 LKDVRLLPSRFRDNMLRDSAWMTSI------DVNRLLHSFRTNAGVFAGREGGYMTVKKL 103
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA LM+A+T +E K K ++V+ L Q + +GYLSA+
Sbjct: 104 GGWESLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAW 163
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY YADN +AL + T M ++ YN+ +K
Sbjct: 164 PEELINRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALTIVTRMGDWAYNK----LK 219
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
S E + E GG+N+ Y L+ IT D ++ LA F + L DD+
Sbjct: 220 PLSEETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTK 279
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS----------DPKRLA 397
H+NT IP VI YE+T ++ ++ + T I H F DPK+L+
Sbjct: 280 HTNTFIPKVIAEARNYELTRNETSRKLSEFFWH-TMIDHHTFAPGCSSDKEHYFDPKKLS 338
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
+L T E+C TYNMLK+SRHLF WT + + ADYYER+L N +LG Q+ E G++ Y L
Sbjct: 339 QHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFL 397
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL GS K S T +SFWCC G+G E+ +K G++IY+ G+Y+ +I S+
Sbjct: 398 PLLSGSHKLYS-----TKENSFWCCVGSGFENHAKFGEAIYYHNN---QGIYVNLFIPSQ 449
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ WK + + Q+ + + R TL + + T++ LR P+W S K ++NG
Sbjct: 450 VTWKEKGLTIRQETE--FPQEETTRFTLQAENP---VRTTIYLRYPSW--SKDVKVSVNG 502
Query: 578 QDLPLPSP-GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + + G+++++T+ W D+++ P+ ++ E D+ P+ A A+LYGP VLAG
Sbjct: 503 KKISVKQKSGSYIAITREWKDGDQISATYPMQIKLETTPDN-PDKA---ALLYGPLVLAG 558
>gi|395774802|ref|ZP_10455317.1| protein [Streptomyces acidiscabies 84-104]
Length = 818
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 198/531 (37%), Positives = 283/531 (53%), Gaps = 55/531 (10%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWA 194
Q+ N YL +D+D+L+ FR LP+ +P GWE P+ ELRGH GH LS AL A
Sbjct: 43 QRRNTAYLRFVDLDRLLHTFRLNVGLPSTAQPCSGWEGPNVELRGHSTGHLLSGLALTHA 102
Query: 195 STHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEALIPVWAPYYTIH 249
+T + L++K +V+AL+ CQ +GYLSAFP FDRLEA VWAPYYT+H
Sbjct: 103 NTGDTELRDKGRRLVAALAECQAASPAAGFNAGYLSAFPESFFDRLEAGTGVWAPYYTLH 162
Query: 250 KILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVL 309
KI+AGL+DQY + N +AL + ++ R + S ER + L+ E GGMNDVL
Sbjct: 163 KIMAGLVDQYRLSGNEQALDVVLRKGDWVDRRTAGL----SYERMQRVLDTEFGGMNDVL 218
Query: 310 YKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGS--------QM 361
L IT D + L +A F LA D ++G H+NT IP ++G+ +
Sbjct: 219 ADLHEITGDARWLAVAERFTHARVFDPLARGEDRLAGLHANTQIPKMVGALRMWEEGLDV 278
Query: 362 RYEVTGDQLHK--EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRH 419
RY G+ + GH G N F +P +A L +T E+C +YNMLK++R
Sbjct: 279 RYRTIGENFWRIVTGHHTYVIGGNSNGEAFH-EPDVIAGQLSDSTCENCNSYNMLKLTRL 337
Query: 420 L-FRWTKEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSKERSYHHWGTPSD 477
L F DYYER+L N +LG Q G+E G IY LAPGS+K + + +P D
Sbjct: 338 LHFHAPGRTDLLDYYERALFNQMLGEQDPGSEHGYNIYYTGLAPGSAKRQP--SFMSPED 395
Query: 478 S-------FWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK 530
+ F C +GTG+E+ +K D+IY +E + + + +I S +DWK+ I
Sbjct: 396 AYSTDYTNFSCDHGTGMETHAKFADTIYTHDEQR---LLVNLFIPSEVDWKAKGI----- 447
Query: 531 VDPVVSWDPYLRV----TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPSP 585
+W R+ T T + +L +R+P W + GA+ LNG+ LP P+P
Sbjct: 448 -----TWRQTTRLPDQDTATLTVTAGQARHALVVRVPGW--ARGARVRLNGRTLPDRPAP 500
Query: 586 GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
G + ++ + W D++ + LPL EA DD PE +QA+L+GP VLAG
Sbjct: 501 GTWFTLDRAWRRGDRVDVTLPLRTTVEATPDD-PE---VQAVLHGPVVLAG 547
>gi|293369447|ref|ZP_06616030.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292635445|gb|EFF53954.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 644
Score = 306 bits (783), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 194/540 (35%), Positives = 295/540 (54%), Gaps = 49/540 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
LK+V L R + + A T++ DV++L+ +FR A + A E
Sbjct: 50 LKDVRLLPSRFRDNMLRDSAWMTSI------DVNRLLHSFRTNAGVFAGREGGYMTVKKL 103
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA LM+A+T +E K K ++V+ L Q + +GYLSA+
Sbjct: 104 GGWESLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAW 163
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY YADN +AL + T M ++ YN+ +K
Sbjct: 164 PEELINRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALTIVTRMGDWAYNK----LK 219
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
S E + E GG+N+ Y L+ IT D ++ LA F + L DD+
Sbjct: 220 PLSEETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTK 279
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS----------DPKRLA 397
H+NT IP VI YE+T ++ ++ + T I H F DP++L+
Sbjct: 280 HTNTFIPKVIAEARNYELTRNETSRKLSEFFWH-TMIDHHTFAPGCSSDKEHYFDPRKLS 338
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
+L T E+C TYNMLK+SRHLF WT + + ADYYER+L N +LG Q+ E G++ Y L
Sbjct: 339 QHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFL 397
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL GS K S T +SFWCC G+G E+ +K G++IY+ G+Y+ +I S+
Sbjct: 398 PLLSGSHKLYS-----TKENSFWCCVGSGFENHAKFGEAIYYHNN---QGIYVNLFIPSQ 449
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ WK + + Q+ + + R TL + + T++ LR P+W S K ++NG
Sbjct: 450 VTWKEKGLTIRQETE--FPQEETTRFTLQAENP---VRTTIYLRYPSW--SKDVKVSVNG 502
Query: 578 QDLPLPSP-GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + + G+++++T+ W D+++ P+ ++ E D+ P+ A A+LYGP VLAG
Sbjct: 503 KKISVKQKSGSYIAITREWKDGDQISATYPMQIKLETTPDN-PDKA---ALLYGPLVLAG 558
>gi|336415976|ref|ZP_08596314.1| hypothetical protein HMPREF1017_03422 [Bacteroides ovatus
3_8_47FAA]
gi|335939879|gb|EGN01751.1| hypothetical protein HMPREF1017_03422 [Bacteroides ovatus
3_8_47FAA]
Length = 644
Score = 305 bits (781), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 195/540 (36%), Positives = 295/540 (54%), Gaps = 49/540 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
LK+V L R + + A T++ DV++L+ +FR A + A E
Sbjct: 50 LKDVRLLPSRFRDNMLRDSAWMTSI------DVNRLLHSFRTNAGVFAGREGGYMTVKKL 103
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA LM+A+T +E K K ++V+ L Q + +GYLSA+
Sbjct: 104 GGWESLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAW 163
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY YADN +AL + T M ++ YN+ +K
Sbjct: 164 PEELINRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALIIVTRMGDWAYNK----LK 219
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
S E + E GG+N+ Y L+ IT D ++ LA F + L DD+
Sbjct: 220 PLSEETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTK 279
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS----------DPKRLA 397
H+NT IP VI YE+T ++ ++ + T I H F DPK+L+
Sbjct: 280 HTNTFIPKVIAEARNYELTRNETSRKLSEFFWH-TMIDHHTFAPGCSSDKEHYFDPKKLS 338
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
+L T E+C TYNMLK+SRHLF WT + + ADYYER+L N +LG Q+ E G++ Y L
Sbjct: 339 QHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFL 397
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL GS K S T +SFWCC G+G E+ +K G++IY+ G+Y+ +I S+
Sbjct: 398 PLLSGSHKLYS-----TKENSFWCCVGSGFENHAKFGEAIYYHNN---QGIYVNLFIPSQ 449
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ WK + + Q+ + + R TL + + T++ LR P+W S K ++NG
Sbjct: 450 VTWKEKGLTIRQETE--FPQEETTRFTLQAENP---VRTTIYLRYPSW--SKDVKVSVNG 502
Query: 578 QDLPLPSP-GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + + G+++++T+ W D+++ P+ ++ E D+ P+ A A+LYGP VLAG
Sbjct: 503 KKIFVKQKSGSYIAITREWKDGDQISATYPMQIKLETTPDN-PDKA---ALLYGPLVLAG 558
>gi|423295661|ref|ZP_17273788.1| hypothetical protein HMPREF1070_02453 [Bacteroides ovatus
CL03T12C18]
gi|392672370|gb|EIY65839.1| hypothetical protein HMPREF1070_02453 [Bacteroides ovatus
CL03T12C18]
Length = 644
Score = 305 bits (780), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 194/540 (35%), Positives = 295/540 (54%), Gaps = 49/540 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-------PY 167
LK+V L R + + A T++ DV++L+ +FR A + A E
Sbjct: 50 LKDVRLLPSRFRDNMLRDSAWMTSI------DVNRLLHSFRTNAGVFAGREGGYMTVKKL 103
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE CELRGH GH LSA LM+A+T +E K K ++V+ L Q + +GYLSA+
Sbjct: 104 GGWESLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAW 163
Query: 228 PTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P E +R VWAP+YT+HK+ +GL+DQY YADN +AL + T + ++ YN+ +K
Sbjct: 164 PEELINRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALTIVTRVGDWAYNK----LK 219
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
S E + E GG+N+ Y L+ IT D ++ LA F + L DD+
Sbjct: 220 PLSEETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTK 279
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS----------DPKRLA 397
H+NT IP VI YE+T ++ ++ + T I H F DPK+L+
Sbjct: 280 HTNTFIPKVIAEARNYELTRNETSRKLSEFFWH-TMIDHHTFAPGCSSDKEHYFDPKKLS 338
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
+L T E+C TYNMLK+SRHLF WT + + ADYYER+L N +LG Q+ E G++ Y L
Sbjct: 339 QHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFL 397
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL GS K S T +SFWCC G+G E+ +K G++IY+ G+Y+ +I S+
Sbjct: 398 PLLSGSHKLYS-----TKENSFWCCVGSGFENHAKFGEAIYYHNN---QGIYVNLFIPSQ 449
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ WK + + Q+ + + R TL + + T++ LR P+W S K ++NG
Sbjct: 450 VTWKEKGLTIRQETE--FPQEETTRFTLQAENP---VRTTIYLRYPSW--SKDVKVSVNG 502
Query: 578 QDLPLPSP-GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + + G+++++T+ W D+++ P+ ++ E D+ P+ A A+LYGP VLAG
Sbjct: 503 KKISVKQKSGSYIAITREWKDGDQISATYPMQIKLETTPDN-PDKA---ALLYGPLVLAG 558
>gi|302844990|ref|XP_002954034.1| hypothetical protein VOLCADRAFT_106211 [Volvox carteri f.
nagariensis]
gi|300260533|gb|EFJ44751.1| hypothetical protein VOLCADRAFT_106211 [Volvox carteri f.
nagariensis]
Length = 1160
Score = 304 bits (778), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 165/363 (45%), Positives = 220/363 (60%), Gaps = 34/363 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLL-MLDVDKLVWNFRKTARLPAPGEPY-GGWEE 172
++ +L DVRL S R ++ N +YLL MLD D+L+W+FRKTA LP PG+PY WE+
Sbjct: 30 IEPFALSDVRLLDTSHQIRYERLNAKYLLEMLDPDRLLWSFRKTAGLPTPGQPYIASWED 89
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIG-SGYLSAFPTEQ 231
P CELRGHFVGHYLSA +L +AST N + +++ +VS L Q+ +G GYLSAFP+E
Sbjct: 90 PGCELRGHFVGHYLSALSLAYASTGNIAFHTRLALMVSELGKVQQALGLGGYLSAFPSEF 149
Query: 232 FDRLEALIPVWAPYYTI-----------HKILAGLLDQYTYADNAEALRMTTWMVEYFYN 280
FDR+EAL PVWAPYYTI HKI+AGL+D Y EAL M + MV Y +N
Sbjct: 150 FDRVEALKPVWAPYYTIPIAPFPDTTQIHKIIAGLVDAYELGGQKEALAMASRMVAYHWN 209
Query: 281 RVQNVIKKYSIERHWQ-TLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
R Q +I E HW LN E GGMN++LY++ IT+DP HL A LF+KP F+ +
Sbjct: 210 RTQALIASKGRE-HWNGVLNCEFGGMNEILYRMHRITKDPTHLEFARLFEKPFFMKPMVN 268
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQLESSGTNIGHFN 388
D + H+NTH+ V G Y+ GD+ + H + G+N
Sbjct: 269 NFDILESLHANTHLAQVAGFAEAYDTVGDEAARNATRNFFDIVTTHHSFATGGSN--DHE 326
Query: 389 FKSDPKRLASNLDSN-----TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG 443
F P R+A ++ T+E+CT YN+LK++R LFRWT +AYAD+YER+L NG+LG
Sbjct: 327 FWQAPDRMADSVIKQKDAVETQETCTQYNILKIARSLFRWTGNVAYADFYERALLNGILG 386
Query: 444 IQR 446
R
Sbjct: 387 TAR 389
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 75/220 (34%), Positives = 119/220 (54%), Gaps = 33/220 (15%)
Query: 450 PGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY---- 505
PGV +YL PL G SK + HHWG P SFWCCYGT +ES +KL DSIYF++
Sbjct: 486 PGVFLYLTPLGTGQSKSDNIHHWGFPYHSFWCCYGTVVESHAKLADSIYFKDMNPQQGGP 545
Query: 506 ---------PGVYIIQYISSRLDWKSGQIVVNQKVD---PVVSWDPYLRV-TLTFSSKGS 552
P +YI Q + S++ W + + + D P + +R L+ ++ GS
Sbjct: 546 SDPSAPKLPPRLYINQLVPSKVTWHELGLRITTEADMFAPGPAATAQIRFDPLSAAAAGS 605
Query: 553 GLTT--SLNLRIPTWTSSNGAKAT----------LNGQ---DLP-LPSPGNFLSVTKTWS 596
L+ +L +R+P W + A T +NGQ P P PG++ VT+ WS
Sbjct: 606 QLSAMFTLMVRVPEWAAREAASGTAGRGRGISIGVNGQSWTSCPGAPVPGSYCQVTRQWS 665
Query: 597 SDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ D ++++LP+ + + ++RP+Y+ +QA++ GP+V+AG
Sbjct: 666 TGDVVSLRLPMRWWLKPLPENRPQYSGLQAVMMGPFVMAG 705
>gi|29827685|ref|NP_822319.1| protein [Streptomyces avermitilis MA-4680]
gi|29604785|dbj|BAC68854.1| putative secreted protein [Streptomyces avermitilis MA-4680]
Length = 854
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 198/528 (37%), Positives = 281/528 (53%), Gaps = 48/528 (9%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWA 194
Q+ N YL +D+D+L+ FR LP+ EP GGWE P ELRGH GH LS AL A
Sbjct: 77 QRRNSAYLRFVDIDRLLHTFRTNVGLPSDAEPCGGWEGPGVELRGHSTGHLLSGLALAHA 136
Query: 195 STHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEALIPVWAPYYTIH 249
ST E+L++K +V+AL+ CQ G+GYLSAFP FDRLEA VWAPYYTIH
Sbjct: 137 STGEEALRDKGRRLVAALAECQSAAPAAGFGTGYLSAFPESFFDRLEAGSGVWAPYYTIH 196
Query: 250 KILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVL 309
KI+AGL++QY +AL + + R K S E+ + L E GGMNDVL
Sbjct: 197 KIMAGLVEQYRLVGVGQALEVVLRQARWVDERT----AKLSYEQMQRVLETEFGGMNDVL 252
Query: 310 YKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQM-------- 361
L +T DP+ L +A F LA D ++G H+NT IP ++G+
Sbjct: 253 ADLHALTGDPRWLDVAERFTHARVFDPLAGNQDKLAGLHANTQIPKMVGALRLWEEGRAD 312
Query: 362 RYEVTGD---QLHKEGHQLESSGTNIGH-FNFKSDPKRLASNLDSNTEESCTTYNMLKVS 417
RY + Q+ + H G + G F+ +P +A L NT E+C +YNMLK++
Sbjct: 313 RYRTVAENFWQIVTDHHTYVIGGNSNGEAFH---EPDVIAGQLSDNTCENCNSYNMLKLT 369
Query: 418 RHL-FRWTKEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSKERSYHHWGTP 475
R L F DYYER+L N +LG Q +E G IY LAPGS K + P
Sbjct: 370 RLLHFHAPDRTDLLDYYERTLLNQMLGEQDPDSEHGFAIYYTGLAPGSFKRQPSFMGPDP 429
Query: 476 S------DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQ 529
D+F C +GTG+E+ +K D++Y +G+ + + ++ S + W++ I Q
Sbjct: 430 DVYSTDYDNFSCDHGTGMETPAKFADTVY-SHDGR--SLRVNLFVPSEVVWRAKGISWRQ 486
Query: 530 KVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPSPGNF 588
+ TLT SS + L +R+P+W + GA+ATLNG+ LP P PG++
Sbjct: 487 ----TTRFPDRSSTTLTVSSGRA--AHRLLIRVPSWAA--GARATLNGRALPDRPQPGSW 538
Query: 589 LSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
L++ + W + D++ + LP+ EA DD +QA+++GP VLAG
Sbjct: 539 LALERVWRTGDRVEVSLPMRTAVEATPDD----PDVQAVVHGPVVLAG 582
>gi|333382563|ref|ZP_08474231.1| hypothetical protein HMPREF9455_02397 [Dysgonomonas gadei ATCC
BAA-286]
gi|332828505|gb|EGK01205.1| hypothetical protein HMPREF9455_02397 [Dysgonomonas gadei ATCC
BAA-286]
Length = 644
Score = 302 bits (774), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 204/563 (36%), Positives = 299/563 (53%), Gaps = 58/563 (10%)
Query: 98 KIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKT 157
KIK P +V S L DVRL DS + + +++L L VD+L+ +FR T
Sbjct: 30 KIKQPLNGEVKAFS------FDLKDVRL-LDSPFRQNMERESKWILSLGVDRLLHSFRNT 82
Query: 158 ARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVS 210
A + A E GGWE CELRGH +GH +S A ++AST +E K K ++V+
Sbjct: 83 AGVYAGREGGYMTIKKLGGWESLDCELRGHSIGHIMSGLAYLYASTGDERYKIKADSLVA 142
Query: 211 ALSACQK---EIG-SGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAE 266
L+ Q E G GY+SA+P +R A VWAP+YT+HK+ AGL+DQY Y DN E
Sbjct: 143 GLAEVQDILIENGQKGYISAYPENLINRNIAGKSVWAPWYTLHKVYAGLIDQYLYCDNKE 202
Query: 267 ALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAH 326
AL + + Y ++ + S E+ L E GG+N+ Y L+ IT +P+H A
Sbjct: 203 ALDIMKEAASWAYQKLMPL----SEEQRALMLRNEFGGVNEAFYNLYAITGNPEHKKSAE 258
Query: 327 LFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQ 376
F + LA D+ H+NT IP VIG YE+ + K+ HQ
Sbjct: 259 FFYHADVIDPLAEHKADLYFKHANTFIPKVIGEARNYELHNSERSKDIANFFWNTVIDHQ 318
Query: 377 LESSGTNIGHFNF-KSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
+G N F SD ++ NL T+E+C T NMLK++RHLF W YADYYER
Sbjct: 319 TYCTGGNSHKEKFIHSD--SISKNLTGYTQETCNTNNMLKLTRHLFCWDANAKYADYYER 376
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +LG Q+ + G++ Y LP+ PG+ K S TP +SFWCC GTG E+ +K G+
Sbjct: 377 ALYNHILG-QQDPQSGMVAYFLPMLPGAHKVYS-----TPENSFWCCVGTGFENHAKYGE 430
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
+IY+ + G+Y+ +I S L WK I + Q+ ++ + LT ++ +
Sbjct: 431 AIYYHDNN---GLYVNLFIPSELTWKEKGIKIKQE----TAFPEEGNICLTVTTD-KDIK 482
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLR-TEA 613
+ LR P+WTS+ + +NG+ + SP ++++ +TW + DK+ + P+ L TE
Sbjct: 483 MPVYLRYPSWTSN--VEVKVNGKKTKIKQSPSGYITIDRTWKNGDKIEVHYPMHLYLTET 540
Query: 614 IQDDRPEYASIQAILYGPYVLAG 636
+D P+ A AI+YGP VLAG
Sbjct: 541 --NDNPDKA---AIMYGPLVLAG 558
>gi|332880745|ref|ZP_08448418.1| hypothetical protein HMPREF9074_04196 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357045883|ref|ZP_09107513.1| hypothetical protein HMPREF9441_01526 [Paraprevotella clara YIT
11840]
gi|332681379|gb|EGJ54303.1| hypothetical protein HMPREF9074_04196 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355530889|gb|EHH00292.1| hypothetical protein HMPREF9441_01526 [Paraprevotella clara YIT
11840]
Length = 618
Score = 302 bits (773), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 194/581 (33%), Positives = 294/581 (50%), Gaps = 66/581 (11%)
Query: 89 LFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVD 148
LF W + +++ G+ V + E L HDV L S + R + N +L L+ D
Sbjct: 9 LFLWVAV--RMEAGGKMAVSPSATEMLLPFPSHDVELASSWVKQR-EDLNTAFLRSLEPD 65
Query: 149 KLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAV 208
+L+ NFR A LP+ +P GWE P LRGHFVGHYLSA + + + L + V
Sbjct: 66 RLLHNFRVNAGLPSVAKPLEGWESPGVGLRGHFVGHYLSAVSALVERYEDAGLARNLEKV 125
Query: 209 VSALSACQKEIGSGYLSAFPTEQFDRLEA-LIPVWAPYYTIHKILAGLLDQYTYADNAEA 267
V + ACQ+ G+GYLSAFP + LE VWAPYYT+HKI+ GLLD Y N +A
Sbjct: 126 VEGMYACQQAHGNGYLSAFPETDIEVLETRFTGVWAPYYTLHKIMQGLLDVYLRTGNEKA 185
Query: 268 LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN----EEAGGMNDVLYKLFCITQDPKHLM 323
M + Y +R + + ++ R T + E GGMN+VLY+L+C++ P++L
Sbjct: 186 YAMVEGLAGYV-DRRMSKLDPATVARMMYTADANPQNEMGGMNEVLYQLYCVSGKPRYLE 244
Query: 324 LAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTN 383
LA LFD FL L D +SG H+NTHI +V G RYE TG++ + G + +
Sbjct: 245 LASLFDPSWFLEPLVRNEDILSGLHANTHIALVNGFARRYESTGEECY--GKSVANFWNM 302
Query: 384 IGHFNFK-----------------------SDPKRLASNLDSNTEESCTTYNMLKVSRHL 420
+ HF+ +P L + L ESC T+N +++ L
Sbjct: 303 LMHFHAYVNGTSSGPRPNVTTETSLTAEHWGEPCHLCNTLTKGIAESCVTHNTQRLNASL 362
Query: 421 FRWTKEIAYADYYERSLTNGVLGIQ-RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSF 479
F WT YAD Y N VL +Q R T G +Y LPL GS + ++Y + F
Sbjct: 363 FSWTGNPCYADVYMNMFYNAVLPVQSRST--GAYVYHLPL--GSPRHKAY----MADNDF 414
Query: 480 WCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK----VDPVV 535
CC G+ E+F+KL + IY+ ++ VY+ Y+ S++ W ++ + Q V+P+V
Sbjct: 415 KCCSGSCAEAFAKLNNGIYYHDDS---AVYVNLYVPSKVHWADKKVGLEQAGGFPVEPIV 471
Query: 536 SWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKT 594
+ +R + F LNL IP WT +GA +NG+ +P P +FL +++
Sbjct: 472 DFTVSVRRPVDF---------VLNLFIPAWT--DGAVVYVNGEKQEMPVRPSSFLKLSRR 520
Query: 595 WSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
W+ D++ I+ R +++ P+ ++ A+ YGP +LA
Sbjct: 521 WADGDRVRIEFRYAFRLQSM----PDKENMLAVFYGPMLLA 557
>gi|374984433|ref|YP_004959928.1| secreted protein [Streptomyces bingchenggensis BCW-1]
gi|297155085|gb|ADI04797.1| secreted protein [Streptomyces bingchenggensis BCW-1]
Length = 875
Score = 301 bits (772), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 197/526 (37%), Positives = 281/526 (53%), Gaps = 44/526 (8%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWA 194
Q N YL +D+D+L+ FR L + +P GGWE P+ ELRGH GH LS AL +A
Sbjct: 99 QSRNTAYLRYVDIDRLLHTFRLNVGLASSAQPCGGWESPTTELRGHSTGHLLSGLALSYA 158
Query: 195 STHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEALIPVWAPYYTIH 249
+T + +L +K +VSAL+ACQ + G GYLSAFP FDRLE+ VWAPYYTIH
Sbjct: 159 NTGDTALLDKGRKLVSALAACQAKSPAAGYGQGYLSAFPENFFDRLESGSGVWAPYYTIH 218
Query: 250 KILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVL 309
KI+AGL+DQ+ A NAEAL +VE V K ++ + L E GGMN+VL
Sbjct: 219 KIMAGLVDQHRLAGNAEALD----VVERQAAWVDTRTGKLGYDQMQRVLQTEFGGMNEVL 274
Query: 310 YKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGS--------QM 361
L IT D + L +A F LA D ++G H+NT IP ++G+
Sbjct: 275 ADLHAITGDTRWLRVAERFTHARVFDPLARNEDQLAGLHANTQIPKMVGALRLWEQGLNS 334
Query: 362 RYEVTGDQLHK--EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRH 419
RY G+ K H G N F +P +A+ L +N E+C +YNMLK++R
Sbjct: 335 RYRTIGENFWKIVTDHHTYVIGGNSNGEAFH-EPDAIAAQLSNNCCENCNSYNMLKLTRL 393
Query: 420 L-FRWTKEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSKERSY------HH 471
+ F DYYER+L N +LG Q + G IY LAPG+ K++ +
Sbjct: 394 IHFHAPDRTDLLDYYERTLFNQMLGEQDPDSAHGFNIYYTGLAPGAFKQQPSFMGTDPNQ 453
Query: 472 WGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKV 531
+ T ++F C +G+G+E+ +K D+IY + + + +I S L W+ I Q
Sbjct: 454 YSTDYNNFSCDHGSGMETQAKFADTIYTYADRS---LLVNLFIPSELRWQEKAITWRQN- 509
Query: 532 DPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPSPGNFLS 590
+ TLT +S + L L +RIP W + GA+A LNG LP P PG++L
Sbjct: 510 ---TGFPDQQTTTLTVASGAASL--ELRVRIPAWAT--GARAALNGTTLPDQPKPGSWLV 562
Query: 591 VTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ ++W + D++ + LP+ L+ + DD P+ +QA+LYGP VLAG
Sbjct: 563 IDRSWKAGDRVDVTLPMALKLDPTPDD-PD---VQAVLYGPVVLAG 604
>gi|325106457|ref|YP_004276111.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324975305|gb|ADY54289.1| protein of unknown function DUF1680 [Pedobacter saltans DSM 12145]
Length = 648
Score = 298 bits (763), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 194/574 (33%), Positives = 305/574 (53%), Gaps = 53/574 (9%)
Query: 89 LFSWAMLYRKIKNPGQF--KVPE--RSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLM 144
LF AM + + PGQ K+ + R + L DVRL + ++ + ++L+
Sbjct: 13 LFPIAMFAQSVY-PGQHRNKITKHLRGDVKVYSFDLKDVRLLPSAFRDNMERDS-KWLMS 70
Query: 145 LDVDKLVWNFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTH 197
LDV++L+ +FR TA + + E GGWE C+LRGH GH +SA + ++AST
Sbjct: 71 LDVNRLLHSFRNTAGVFSSKEGGYMTIKKLGGWESLDCDLRGHTTGHIMSALSYLYASTG 130
Query: 198 NESLKEKMSAVVSALSACQ---KEIG-SGYLSAFPTEQFDRLEALIPVWAPYYTIHKILA 253
+E K K ++V+ L+ Q ++G +G++SAFP +R A +WAP+YT+HKI A
Sbjct: 131 DERYKIKSDSIVNGLAEVQYALTKVGQNGFISAFPENFINRNIAGQSIWAPWYTLHKIYA 190
Query: 254 GLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLF 313
GL+DQY Y N +AL + T + Y ++ + + E+ L E GG N+ Y L+
Sbjct: 191 GLIDQYLYCGNEKALDIMTKAASWAYQKLMPLTE----EQRATMLRNEFGGTNEAFYNLY 246
Query: 314 CITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE 373
IT +P+HL LA F L LA + D+ H+NT IP +IG YE+ D+ K+
Sbjct: 247 AITGNPEHLKLAEFFYHNAVLDPLAERKSDLYFKHANTFIPKLIGEARNYELNADKRSKD 306
Query: 374 ----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRW 423
HQ +G N F K ++ NL T+E+C + NMLK++RHLF W
Sbjct: 307 VATFFWDEVVNHQTYCTGGNSHKEKFIHTDK-VSENLTGYTQETCNSNNMLKLTRHLFSW 365
Query: 424 TKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCY 483
YAD+YER+L N +LG Q+ + G++ Y LPL PG SY + T +SFWCC
Sbjct: 366 DANPKYADFYERALYNHILG-QQDPQTGMVAYFLPLLPG-----SYKVYSTAENSFWCCV 419
Query: 484 GTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRV 543
GTG E+ +K G++IY+ +Y+ +I S L W + + Q+ V +++
Sbjct: 420 GTGFENHAKYGEAIYYHNN---TNLYVNLFIPSELTWNEKGVKLKQET--VFPESDLVKL 474
Query: 544 TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLT 602
T+ ++K +LNLR P W S G + +NG+ + + P +++ + +TW + D++
Sbjct: 475 TVQ-TAKSQKF--ALNLRYPYWAS--GVQVKINGKAVKVKQVPSSYIVIDRTWKNGDQII 529
Query: 603 IQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
I+ P++L D+ A++YGP VLAG
Sbjct: 530 IKYPMSLHLAEANDN----VDKAAVMYGPLVLAG 559
>gi|399029634|ref|ZP_10730435.1| hypothetical protein PMI10_02273 [Flavobacterium sp. CF136]
gi|398072450|gb|EJL63666.1| hypothetical protein PMI10_02273 [Flavobacterium sp. CF136]
Length = 642
Score = 295 bits (754), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 183/553 (33%), Positives = 298/553 (53%), Gaps = 46/553 (8%)
Query: 103 GQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPA 162
G+ K+ + + +L DV+L DS ++++ + +L+ +F+ A + +
Sbjct: 31 GKLKMDDTKNVKVLGFNLQDVKL-LDSPFKDNMMRESKWIMDISTKRLLHSFKTNAGVFS 89
Query: 163 PGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSAC 215
E GGWE C+LRGH GH LS AL++A+T + K K ++V+ L
Sbjct: 90 SQEGGYFTVDKLGGWESLDCDLRGHSTGHILSGLALLYAATGEKMYKIKADSLVTGLDEV 149
Query: 216 QKEIG-SGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
QK + +GYLSAFP DR A VWAP+YT HK+ +GL+DQY Y D+ AL + M
Sbjct: 150 QKVLNQNGYLSAFPQNLIDRAIAGKSVWAPWYTQHKLFSGLMDQYLYCDSEPALEIVKGM 209
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
++ Y +++++ E + L E GGMND Y L+ IT + K+ LA F L
Sbjct: 210 ADWAYEKLKSLTN----EERKRMLRNEFGGMNDSFYALYEITAESKYKFLAEFFYHEDAL 265
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNI 384
L + D+++ H+NT+IP +IG YE+ G ++E H +G+N
Sbjct: 266 DPLLNKTDNLNKKHANTYIPKLIGISRDYELEGGSKNREIPEFFWNTVVNHHTFVTGSNS 325
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
F +P L+ +L T ESC YNMLK++RHL+ +I Y DYYE++L N +LG
Sbjct: 326 DKEKF-FEPDHLSEHLSGFTGESCNVYNMLKLTRHLYGVNPQIKYVDYYEKALYNHILG- 383
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ + G++ Y LP+ PG+ K S TP +SFWCC G+G E+ +K G+ IY+ ++
Sbjct: 384 QQDPKTGMVAYFLPMMPGAHKVYS-----TPENSFWCCVGSGFENQAKYGEFIYYHDK-- 436
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
G+Y+ +I S L+WK I+V Q+ S+ TLT S+K ++ +++R P+
Sbjct: 437 --GLYVNLFIPSELNWKEKGIIVKQE----TSFPNVGSTTLTLSTKNP-VSMPISIRYPS 489
Query: 565 WTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYAS 623
W + GA+ +NG+ + PG+++++ + WS D++ + + ++ P+ +
Sbjct: 490 WAA--GAEVKVNGKKQIINVKPGSYITLERKWSDGDRIEVSFGIQIKLAPT----PDNPN 543
Query: 624 IQAILYGPYVLAG 636
+ A+ YGP VLAG
Sbjct: 544 VVAVTYGPIVLAG 556
>gi|413926259|gb|AFW66191.1| hypothetical protein ZEAMMB73_605676 [Zea mays]
gi|413952505|gb|AFW85154.1| hypothetical protein ZEAMMB73_422486 [Zea mays]
Length = 250
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 144/240 (60%), Positives = 174/240 (72%), Gaps = 13/240 (5%)
Query: 361 MRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCT 409
MRYEVTGD L+K+ H + GT+ G F +DPKRLA L + EESCT
Sbjct: 1 MRYEVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEF--WTDPKRLAGTLSTENEESCT 58
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSY 469
TYNMLKVSR+LFRWTKEIAYADYYER+L NGVL IQRGT+PGVMIY+LP APG SK SY
Sbjct: 59 TYNMLKVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSY 118
Query: 470 HHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQ 529
H WGT DSFWCCYGTGIESFSKLGDSIYFEE+G P + IIQYI S +WK+ + V Q
Sbjct: 119 HGWGTKYDSFWCCYGTGIESFSKLGDSIYFEEKGDPPALNIIQYIPSTYNWKAAGLTVTQ 178
Query: 530 KVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFL 589
++ + S D YL+++ + S+ SG T ++N RIP+WT ++GA ATLNG+DL SPG +
Sbjct: 179 QIKTLSSSDQYLQISFSISANTSGQTANINFRIPSWTFADGAGATLNGKDLGSISPGKIV 238
>gi|330995449|ref|ZP_08319354.1| hypothetical protein HMPREF9442_00414 [Paraprevotella xylaniphila
YIT 11841]
gi|329575517|gb|EGG57055.1| hypothetical protein HMPREF9442_00414 [Paraprevotella xylaniphila
YIT 11841]
Length = 618
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 196/569 (34%), Positives = 296/569 (52%), Gaps = 68/569 (11%)
Query: 103 GQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLE--YLLMLDVDKLVWNFRKTARL 160
G KV S L+ S DV L + W Q+ +L+ YL ++ D+L+ NFR TA L
Sbjct: 21 GNGKVESPSVVELRPFSGKDVELEAS---WIKQREDLDVAYLQSVEADRLLHNFRVTAGL 77
Query: 161 PAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIG 220
P+ +P GWE P LRGHF GHYLSA +++ + +++ +V L CQ+ G
Sbjct: 78 PSLAKPLEGWESPGVGLRGHFTGHYLSALSVLAERYGDGWASQRLEYMVDELYKCQQAHG 137
Query: 221 SGYLSAFPTEQFDRLEA-LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFY 279
+GYLSAFP + F+ LE VWAPYYT+HKIL GLLD YT N +A M + Y
Sbjct: 138 NGYLSAFPEKDFETLETRFTGVWAPYYTLHKILQGLLDAYTKTGNRKAYGMVEALAGYVE 197
Query: 280 NRVQNVIKKYSIERHWQTL----NEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
R+ + + IER T+ EAG MN+ LY+L+ I+ +P+HL LA FD FL
Sbjct: 198 GRMAKLSPE-RIERMMYTVEANPQNEAGAMNEALYELYGISGNPRHLALAACFDPAWFLE 256
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNI 384
L D ++G H+NTHI +V G RYEVTG++ +K+ GH +GT+
Sbjct: 257 PLVRNEDILAGLHANTHIVLVNGFARRYEVTGEEKYKKAAMQFWDILQRGHAY-VNGTSS 315
Query: 385 GHFNFKS-----------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYY 433
G + +P L + L ESC T+N K+S +LF WT + YAD Y
Sbjct: 316 GPRPVVTTRTSLTAEHWGEPGHLCNTLTREIAESCVTHNTQKLSAYLFGWTGDPCYADAY 375
Query: 434 ERSLTNGVLGIQ-RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSK 492
+ NG L +Q R T G +Y LPL GS + + Y + F+CC G+ E+F+K
Sbjct: 376 MNTFYNGALPVQSRST--GAYVYHLPL--GSPRNKKY----LKDNDFFCCSGSCAEAFAK 427
Query: 493 LGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK----VDPVVSWDPYLRVTLTFS 548
L IY+ ++ V++ Y+ S L W S ++ + Q + P+ + +R ++F
Sbjct: 428 LNSGIYYHDDS---AVFVNLYVPSELHWTSKKVELEQTGGFPLQPIADFTVSVRRPVSF- 483
Query: 549 SKGSGLTTSLNLRIPTWTSSNGAKATLNG--QDLPLPSPGNFLSVTKTWSSDDKLTIQLP 606
+LNL +P W + G +NG QD+P+ P +FL +++ W+ D++ +
Sbjct: 484 --------TLNLFVPAW--AEGTVVYVNGEKQDMPV-RPSSFLRISRRWADGDRVRMDFR 532
Query: 607 LTLRTEAIQDDRPEYASIQAILYGPYVLA 635
R +++ P+ ++ A+ YGP +LA
Sbjct: 533 YAFRLQSM----PDKENMFAVFYGPMLLA 557
>gi|329849035|ref|ZP_08264063.1| hypothetical protein ABI_21080 [Asticcacaulis biprosthecum C19]
gi|328844098|gb|EGF93667.1| hypothetical protein ABI_21080 [Asticcacaulis biprosthecum C19]
Length = 773
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 184/537 (34%), Positives = 277/537 (51%), Gaps = 58/537 (10%)
Query: 132 WR-AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASA 190
WR A N YLL L+ D+L+ NF K+A L G+ YGGWE + + GH +GHYL+A
Sbjct: 45 WRDAVDANGHYLLSLEPDRLLHNFHKSAGLAPKGDIYGGWE--NMGIAGHSLGHYLTALG 102
Query: 191 LMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA------------- 237
L +A T + + K K+ VS ++ QK G GY+ E+ +L+
Sbjct: 103 LAYAQTRDPAYKAKLDYTVSEMAIIQKAHGDGYIGGTTVERDGKLQDGKIVYEEVRKHVI 162
Query: 238 ------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSI 291
L W P YT HK+ AGLLD + YA+N +AL++ M +Y V+ S
Sbjct: 163 TSHGFDLNGGWVPLYTWHKVHAGLLDAHRYANNGQALKIAIGMSDYLIG----VLGDLSD 218
Query: 292 ERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNT 351
E + L E GG+N+ +++ T D ++L A L LA + D++ G H+NT
Sbjct: 219 EEMQKVLAAEHGGLNETYAEMYVRTGDKRYLDTARRIYHKAVLTPLAQRRDELEGKHANT 278
Query: 352 HIPIVIGSQMRYEVTGDQLHKEG-----------HQLESSGTNIG-HFNFKSDPKRLASN 399
IP +IG YEVTGD+ + + H G + G HF P +L+
Sbjct: 279 QIPKLIGLARLYEVTGDKAYGDTASYFWDRVIHHHSYVIGGNSAGEHFGA---PDKLSGR 335
Query: 400 LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL 459
LD T ESC TYNMLK++RHL++W + A+ DYYER+ N +L Q + G +Y +PL
Sbjct: 336 LDDKTCESCNTYNMLKLTRHLYQWQPDAAWFDYYERAHLNHILAHQ-DPQTGAFVYFVPL 394
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
A GS + S TP SFWCC G+G+ES +K GDSI++ + G VY +I S L
Sbjct: 395 ASGSQRLYS-----TPDTSFWCCVGSGMESHAKHGDSIWWRQAGGGDTVYANLFIPSELS 449
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
W + D ++ +P VT T + +G+ T L +R+P W ++G + ++NG++
Sbjct: 450 WTDKATKIALSGD-ILKGEP---VTFTVTPQGTADFT-LAIRVPKW--ADGPRLSVNGKN 502
Query: 580 LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
PL ++ V + W + D + + LP L+ E + P+ + A + GP V+AG
Sbjct: 503 TPLLVKNGYVRVRRAWKAGDTVVLTLPHALKVETM----PDNPRLAAFIKGPMVMAG 555
>gi|345851934|ref|ZP_08804893.1| secreted protein [Streptomyces zinciresistens K42]
gi|345636594|gb|EGX58142.1| secreted protein [Streptomyces zinciresistens K42]
Length = 867
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 203/572 (35%), Positives = 284/572 (49%), Gaps = 62/572 (10%)
Query: 110 RSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGG 169
R L L +VRL ++T+ YLL +D D+L+ FR TA LP+ +P GG
Sbjct: 58 RGTPALDAFGLSEVRLLESPFLANMRRTS-AYLLFVDADRLLHTFRLTAGLPSSAQPCGG 116
Query: 170 WEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS-----GYL 224
WE P +LRGH GH LSA A A T + EK A+V+AL+ CQ+ + GYL
Sbjct: 117 WEAPDVQLRGHTTGHLLSALAQAHAHTGERAYAEKGRALVAALAECQRAAPAAGFTRGYL 176
Query: 225 SAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEAL----RMTTWM----VE 276
SAFP F RLEA WAPYYT+HKI+AGLLDQY A + +AL M W
Sbjct: 177 SAFPESVFARLEAGGKPWAPYYTLHKIMAGLLDQYLLAGDRQALDVLREMAAWAEARTAP 236
Query: 277 YFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGL 336
Y ++QNV++ E GGMNDVL +L+ T DP HL A FD
Sbjct: 237 LPYPQMQNVLRV------------EFGGMNDVLMRLYLETGDPAHLRTARRFDHEDLYAP 284
Query: 337 LALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGH 386
LA D+++G H+NT I ++G+ YE TGD + + H + G N
Sbjct: 285 LAAGRDELAGRHANTEIAKIVGTVPSYEATGDTRYLDIADTFWTTVVRHHSYAIGGNSNQ 344
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIA-YADYYERSLTNGVLGIQ 445
F P + S L T E+C +YNMLK+ R LF + A Y D+YE +L N +LG Q
Sbjct: 345 ELF-GPPDEIVSRLSDVTCENCNSYNMLKLGRGLFLHRPDRAGYMDHYEWTLYNQMLGEQ 403
Query: 446 R-GTEPGVMIYLLPLAPGSSKERSYHHWGTPS------DSFWCCYGTGIESFSKLGDSIY 498
+ G + Y L GS +E P D+F C +GTG+E+ +K DS+Y
Sbjct: 404 DPASAHGFVTYYTGLWAGSRREPKAGLGSAPGSYSSDYDNFSCDHGTGLETHTKFADSVY 463
Query: 499 FEEEGKYPGV---YIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
F G GV Y+ +I S + W+ + V QK S+ R LT + +
Sbjct: 464 FRSRGTRDGVPSLYVNLFIPSEVRWRQTGVTVRQK----TSYPSEGRTRLTVVAGRARF- 518
Query: 556 TSLNLRIPTWTSSNGAKATL--NGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTE 612
+L +RIP+W + G +A L NG+ + PG + +V +TW + D + + LP
Sbjct: 519 -ALRIRIPSWVAGTGREAVLEVNGRGVAARLRPGTYATVERTWHTGDTVDLTLP----RR 573
Query: 613 AIQDDRPEYASIQAILYGPYVLAGHSIGDWDI 644
+ P+ ++++ YGP VLAG GD D+
Sbjct: 574 PVWTAAPDNPQVRSVSYGPLVLAGE-YGDDDL 604
>gi|374324035|ref|YP_005077164.1| hypothetical protein HPL003_21035 [Paenibacillus terrae HPL-003]
gi|357203044|gb|AET60941.1| hypothetical protein HPL003_21035 [Paenibacillus terrae HPL-003]
Length = 767
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 190/576 (32%), Positives = 292/576 (50%), Gaps = 51/576 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEP 173
+KE HDVRL +S A L+Y+ +D D++++NFR TA + G +P GW+ P
Sbjct: 191 VKEFKGHDVRLEKESEFGAAMDRFLQYVRSVDDDQMLYNFRATAAVDTKGAQPMTGWDAP 250
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI------GSGYLSAF 227
C L+GH GHYLSA AL + +T + +L K+ +V+ L CQ + G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYNATEDSALLGKIQYMVAELGKCQTALSEQAGYGRGFLSAY 310
Query: 228 PTEQFDRLEALI---PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
EQF+ LE +WAPYYT+HKI+AGLLD Y A EAL + + + +NR+
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALEICDKLGHWLHNRLSR 370
Query: 285 VIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ ++ + + W + E GGMN+VL KL+ IT +L+ A FD + D
Sbjct: 371 LPRE-QLHKMWSLYIAGEFGGMNEVLAKLYAITSHEHYLITAKYFDNEKLFLPMKENVDT 429
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLH-----------KEGHQLESSGTNIGHFNFKSD 392
+ H+N HIP VIG+ +EV G++ + + H G G +
Sbjct: 430 LGNMHANQHIPQVIGALKLFEVAGEKAYFKIAENFWTMVTQRHIYSIGGA--GETEMFRE 487
Query: 393 PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP-G 451
P +A L T E+C +YNMLK+++ LF++ Y DYYE++L N +L + + G
Sbjct: 488 PDAIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEG 547
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
Y +PLAPGS K+ H CC+GTG+E+ K ++IYF +E + +Y+
Sbjct: 548 GSTYFMPLAPGSIKKFDTH-------ENTCCHGTGLENHFKYQEAIYFYDEDR---LYVN 597
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
YI S+LDW + + QK D + + G T+L RIP W S
Sbjct: 598 LYIPSQLDWSEQGLSLIQKRDQSSLEKAHFYIE-------GGTETTLMFRIPDWVSEP-V 649
Query: 572 KATLNGQDL-PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+ +NG+ L +L + K W +D++ + LP +LR + +D + ++ YG
Sbjct: 650 QVKINGEPCRDLEYEHGYLKLRKVW-KEDEIELTLPRSLRLASAPNDH----TFMSLTYG 704
Query: 631 PYVLAGHSIGDWDITESATSLSDWITPIPASYNSQL 666
PYVLA S G+ D S +++ I +S L
Sbjct: 705 PYVLAAIS-GEQDYISWTYSEQEFLEQIIPQKDSPL 739
>gi|256394133|ref|YP_003115697.1| hypothetical protein Caci_4996 [Catenulispora acidiphila DSM 44928]
gi|256360359|gb|ACU73856.1| protein of unknown function DUF1680 [Catenulispora acidiphila DSM
44928]
Length = 846
Score = 288 bits (737), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 202/520 (38%), Positives = 273/520 (52%), Gaps = 50/520 (9%)
Query: 139 LEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHN 198
L YL +D D+L++ FR T + P GGWE+P+ ELRGH GH +SA A +AST +
Sbjct: 84 LAYLRFVDPDRLLYMFRTTVGIATSASPCGGWEDPTEELRGHSTGHIMSALAQAYASTGD 143
Query: 199 ESLKEKMSAVVSALSACQKEIG-----SGYLSAFPTEQFDRLEALIPVWAPYYTIHKILA 253
+LK K VS+L+ACQ +GYLSAFP FDRLE+ VWAPYYTIHKI+A
Sbjct: 144 STLKSKGDYFVSSLAACQAASPAAGFHTGYLSAFPESFFDRLESGQSVWAPYYTIHKIMA 203
Query: 254 GLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLF 313
GLLDQY A N +AL + M + R + S + L E GGM +VL L+
Sbjct: 204 GLLDQYLVAGNTQALTVLKGMAAWVKTRTDPL----SHSQMQAVLQTEFGGMPEVLAHLY 259
Query: 314 CITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLH-- 371
+T D L A FD LA D ++GFH+NT +P +IG+ Y TG +
Sbjct: 260 QVTGDANTLTAAQRFDHAQIEDPLAAGTDQLAGFHANTQVPKIIGALREYLATGTARYLT 319
Query: 372 --------KEGHQL-ESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFR 422
GH + E G + G + F++ P +AS L + T E C TYN LK+SR LF
Sbjct: 320 IAQNFWAITTGHHMYEIGGFSNGEY-FQT-PNAIASQLSNTTCEVCVTYNELKLSRGLFF 377
Query: 423 W-TKEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFW 480
AY DYYER L N VLG Q + G + Y PL PG K S + + F
Sbjct: 378 TDPTRAAYLDYYERGLFNTVLGQQDPASSHGFVCYYTPLQPGGYKTYSNDY-----NDFT 432
Query: 481 CCYGTGIESFSKLGDSIYFEEEGKYPG--VYIIQYISSRLDWKSGQIVVNQKVD-PVVSW 537
C +GTG+ES +K DSIYF Y G +Y+ +I+S+L W I V Q P S
Sbjct: 433 CDHGTGMESNTKYADSIYF-----YNGETLYVNLFIASQLAWPGRAITVRQDTTFPAASS 487
Query: 538 DPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSS 597
R+T+T G+G +L +R+P+W S K Q+L +PG +L++ +TW+S
Sbjct: 488 S---RLTIT----GAG-HIALKIRVPSWCSGMTVKVNGTLQNL-TATPGTYLTIDRTWAS 538
Query: 598 DDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
D + + LP L DD +++Q + YG VLAG
Sbjct: 539 GDVVDLALPAKLTFVPAPDD----STVQVVKYGGIVLAGQ 574
>gi|393783247|ref|ZP_10371422.1| hypothetical protein HMPREF1071_02290 [Bacteroides salyersiae
CL02T12C01]
gi|392669526|gb|EIY63014.1| hypothetical protein HMPREF1071_02290 [Bacteroides salyersiae
CL02T12C01]
Length = 1022
Score = 287 bits (735), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 192/584 (32%), Positives = 298/584 (51%), Gaps = 80/584 (13%)
Query: 115 LKEVSLH--DVRLGSDSMHWRAQQTNLEYLL-MLDVDKLVWNFRKTARLPAPGEPYGGWE 171
+K S H +RL DS A + ++L+ L D+ + F A LP G YGGWE
Sbjct: 47 IKAYSFHLKQIRL-LDSPFKTAMNADRKWLMETLKPDRFLHRFHANAGLPTKGTIYGGWE 105
Query: 172 EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ 231
+ + G GHY+SA ++++A+T E +K ++ +S L CQ + G+GY+ A P E
Sbjct: 106 --NTDQSGFSFGHYISALSMLYATTGEEDIKIRLDYCISELKRCQDKRGTGYVGAIPNE- 162
Query: 232 FDRL-----EALIP--------VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYF 278
D+L + +I VW P+Y +HK+ +GL+D Y + +N A + + ++
Sbjct: 163 -DKLWDDVSKGIIDGRNFNLNNVWVPWYNLHKLWSGLIDAYIFGENETAKTIVIALTDWA 221
Query: 279 YNRVQNVIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLL 337
++ +++ E WQ L E GGMND LY ++ IT D +HL +A+ F L L
Sbjct: 222 CDKFKDLT-----EEQWQNILTCEHGGMNDALYNVYAITGDTRHLEIANKFYHKKVLDPL 276
Query: 338 ALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLES---------------SGT 382
+ + ++++G H+NT IP VIG YE+TG+Q H H + S +
Sbjct: 277 SKRKNELAGLHANTQIPKVIGISRSYELTGNQDH---HTISSYFWHTVTHEHSYCIGGNS 333
Query: 383 NIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL 442
N HF +P +L+ L + T E+C TYNMLK++RHLF W D+YER+L N +L
Sbjct: 334 NYEHF---VEPGKLSGELSNKTTETCNTYNMLKLTRHLFAWNPSAELMDFYERALYNHIL 390
Query: 443 GIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEE 502
Q E G++ Y +PLA S K ++ ++FWCC GTG E+ K + IY E
Sbjct: 391 ASQN-PETGMVCYCVPLAANSQK-----NYCNAENNFWCCVGTGFENHVKYAEQIYSHNE 444
Query: 503 GKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRI 562
+ +YI YI S LDW + + Q + P T ++ T + ++R
Sbjct: 445 NE---LYINLYIPSELDWSEKNMKLKQTNN-----FPDTDNTTITITETVPQTLTFHVRF 496
Query: 563 PTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
P W S G +NG + S PG+++S+T+ W ++DK+ I LP TL E + D+ +
Sbjct: 497 PNWVQS-GYSIKINGTEQVFNSTPGSYVSITREWKTNDKIEINLPKTLTKEQLLGDKYK- 554
Query: 622 ASIQAILYGPYVLAGHSIGDWDITESA--------TSLSDWITP 657
A L GP VLAG + DIT++ ++SDW+TP
Sbjct: 555 ---TAFLNGPIVLAGKT----DITQTPPVFIRHENKNISDWMTP 591
>gi|300785310|ref|YP_003765601.1| hypothetical protein AMED_3413 [Amycolatopsis mediterranei U32]
gi|384148599|ref|YP_005531415.1| hypothetical protein RAM_17360 [Amycolatopsis mediterranei S699]
gi|399537193|ref|YP_006549855.1| hypothetical protein AMES_3374 [Amycolatopsis mediterranei S699]
gi|299794824|gb|ADJ45199.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340526753|gb|AEK41958.1| hypothetical protein RAM_17360 [Amycolatopsis mediterranei S699]
gi|398317963|gb|AFO76910.1| hypothetical protein AMES_3374 [Amycolatopsis mediterranei S699]
Length = 740
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 197/517 (38%), Positives = 265/517 (51%), Gaps = 46/517 (8%)
Query: 139 LEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHN 198
L Y +D D+L+ FR A L + +P GGWE P ELRGH GH LS A +A+T +
Sbjct: 68 LAYFRFVDADRLLHTFRLNAGLASSAQPCGGWESPGTELRGHSTGHLLSGLAQAYANTGD 127
Query: 199 ESLKEKMSAVVSALSACQ-----KEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILA 253
+ K K +V+AL+ACQ + +GYLSAFP FDRLE+ VWAPYYT+HKI+A
Sbjct: 128 TAHKTKGDYLVNALAACQAAAPGRGFHAGYLSAFPENFFDRLESGQSVWAPYYTLHKIMA 187
Query: 254 GLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLF 313
GLLDQY A N +AL + + R + S+ + L E GGM +VL L+
Sbjct: 188 GLLDQYLLAGNQQALDVLLRKAAWTKTRTDPL----SVTQMQAALRTEFGGMPEVLTNLY 243
Query: 314 CITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE 373
+T D HL A FD L LA D +SGFH+NT IP ++G+ Y TG +++
Sbjct: 244 QVTGDANHLATAQRFDHAQILDPLAANQDRLSGFHANTQIPKILGAIREYHATGTTRYRD 303
Query: 374 ----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRW 423
H G N F++ P +AS L T E C TYNMLK++R LF
Sbjct: 304 IAVNFWRIVLDHHTYVIGGNSDGEYFQA-PDAIASQLSDTTCEVCNTYNMLKLTRQLFFT 362
Query: 424 TKEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCC 482
Y DYYE +L N +LG Q + G + Y PL G K + + D F C
Sbjct: 363 NPAPEYMDYYELALFNQILGEQDPDSSHGFVTYYTPLRAGGIKTYANDY-----DDFTCD 417
Query: 483 YGTGIESFSKLGDSIYFEEEGKYPG--VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPY 540
+GTG+ES +K DS+YF + G +Y+ +I+S L W I V Q S
Sbjct: 418 HGTGMESQTKFADSVYF-----FTGETLYVNLFIASVLTWPGRGITVRQDTTFPASSGTK 472
Query: 541 LRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDK 600
L + GSG +L LRIP WTS GA +NG PSPG+F ++ +TW++ D
Sbjct: 473 LTI------GGSG-HIALKLRIPKWTS--GAVVKVNGVAQGSPSPGSFCTIDRTWAAGDV 523
Query: 601 LTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+ + +P +L DD AS+ A YG VLAG
Sbjct: 524 VDVSVPASLTFPRANDD----ASVGAAKYGAIVLAGQ 556
>gi|429199615|ref|ZP_19191363.1| hypothetical protein STRIP9103_08616 [Streptomyces ipomoeae 91-03]
gi|428664699|gb|EKX63974.1| hypothetical protein STRIP9103_08616 [Streptomyces ipomoeae 91-03]
Length = 655
Score = 286 bits (733), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 188/542 (34%), Positives = 277/542 (51%), Gaps = 54/542 (9%)
Query: 127 SDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEPSCELRGHFVGHY 185
D + R + LEY D+++ FR A L G P GGWE LRGH+ GH+
Sbjct: 4 GDGVFRRKRDLMLEYARSYPADRILAVFRANAGLDTRGARPPGGWETSDGNLRGHYGGHF 63
Query: 186 LSASALMWASTHNESLKEKMSAVVSALSACQKEIGS---------GYLSAFPTEQFDRLE 236
L+ A +A T +LK K+ +V AL+ CQ+ + G+L+A+P QF LE
Sbjct: 64 LTLVAQAYADTREAALKAKLDYLVGALAECQRTLAERGNPRPSHPGFLAAYPETQFILLE 123
Query: 237 ALIP---VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIER 293
+ +WAPYYT HKI+ GLLD +T A NAEAL + + M ++ ++R+ + K ++R
Sbjct: 124 SYTTYPTIWAPYYTCHKIMRGLLDAHTLAGNAEALTVASKMGDWVHSRLGR-LPKAQLDR 182
Query: 294 HWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTH 352
W + E GGMN+V+ L+ +T +HL A FD L A D + G H+N H
Sbjct: 183 MWSIYIAGEYGGMNEVMADLYALTGRAEHLAAARCFDNTALLDACAEDRDILDGRHANQH 242
Query: 353 IPIVIGSQMRYEVTGDQLHKE----------GHQLES-SGTNIGHFNFKSDPKRLASNLD 401
IP G ++ TG++ + + GH+ S GT G D +A+ LD
Sbjct: 243 IPQFTGYLRMFDHTGEERYADAARNFWGMVAGHRTYSLGGTGQGEMFRARDA--VAATLD 300
Query: 402 SNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ---RGTEPGVMIYLLP 458
E+C TYNMLK+SR LF + AY D+YER LTN +L + R T+ + Y +
Sbjct: 301 DKNAETCATYNMLKLSRQLFFRDPDPAYMDHYERGLTNHILASRRDARSTDGPEVTYFVG 360
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
+ PG +E Y + GT CC GTG+E+ +K DS+YF +Y+ Y++S L
Sbjct: 361 MGPGVVRE--YGNIGT------CCGGTGMENHTKYQDSVYFRSADG-GALYVNLYLASTL 411
Query: 519 DWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
W IVV Q D P TLTF G T L LRIP+W ++ G T+NG
Sbjct: 412 RWPERGIVVEQTSDFPAEGVR-----TLTFREGGG--TLDLKLRIPSW-ATEGVTVTVNG 463
Query: 578 QDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + PG +L+++++W D++ I P LR E DD ++Q++ +GP +L
Sbjct: 464 VRQRVEAVPGTYLTLSRSWQRGDRVAISTPYRLRIERALDD----PAVQSVFHGPVLLVA 519
Query: 637 HS 638
S
Sbjct: 520 RS 521
>gi|332880466|ref|ZP_08448140.1| hypothetical protein HMPREF9074_03915 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357046164|ref|ZP_09107794.1| hypothetical protein HMPREF9441_01810 [Paraprevotella clara YIT
11840]
gi|332681454|gb|EGJ54377.1| hypothetical protein HMPREF9074_03915 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355531170|gb|EHH00573.1| hypothetical protein HMPREF9441_01810 [Paraprevotella clara YIT
11840]
Length = 641
Score = 286 bits (733), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 188/552 (34%), Positives = 285/552 (51%), Gaps = 44/552 (7%)
Query: 103 GQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPA 162
GQF+V + + L DVRL + + +++ + D+L+ FR TA + A
Sbjct: 30 GQFRVSVQVPLAAESFDLQDVRLLPGRFRDNMMRDS-AWMVSIGADRLLHGFRTTAGVFA 88
Query: 163 PGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSAC 215
E GGWE CELRGH GH LSA ALM+A+T ++ K K ++V+ L+
Sbjct: 89 GREGGYMTVKKLGGWESLDCELRGHTTGHVLSALALMYAATGSDVFKMKGDSLVAGLAEV 148
Query: 216 QKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMV 275
Q GYLSA+P E +R VWAP+YT+HK+ +GL+DQY YA NA+AL + M
Sbjct: 149 QAAGTGGYLSAYPEELINRNIRGESVWAPWYTLHKLFSGLIDQYLYARNAQALDVVRKMG 208
Query: 276 EYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
++ Y +++ + + E + + E GG+N+ Y L+ +T D ++ LA F +
Sbjct: 209 DWAYGKLRPLPE----EMRRKMIRNEFGGINESFYNLYALTGDERYRWLAGFFYHNDVID 264
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS---- 391
L Q DD+ H+NT IP V+ YE+TGD K + T IG F
Sbjct: 265 PLKEQRDDLGTKHTNTFIPKVLAEARNYELTGDGDSKALSEFFWH-TMIGRHTFAPGCSS 323
Query: 392 ------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ 445
DP + ++ T E+C TYNMLK+SRHLF W ADYYER+L N +LG Q
Sbjct: 324 DKEHYFDPDEFSKHISGYTGETCCTYNMLKLSRHLFCWEASPEVADYYERALYNHILG-Q 382
Query: 446 RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
+ G++ Y LPL G+ K S TP +SFWCC G+G ES +K +SIY+ E
Sbjct: 383 QDPATGMVSYFLPLQSGTHKVYS-----TPENSFWCCVGSGFESHAKYAESIYYRGED-- 435
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
+Y+ +I S L WK + + Q+ + R+TL + ++ LR P+W
Sbjct: 436 -CLYVNLFIPSELAWKEKGLNLRQETR--FPEEETTRLTLALETP---RRLAVKLRYPSW 489
Query: 566 TSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
+ + +NG+ + + PG+++++ + W D++ + P+ L E + D+
Sbjct: 490 SGRPTVR--VNGKSVRVKQHPGSYITLDRRWEDGDRIEVTYPMRLAMERMPDN----PHK 543
Query: 625 QAILYGPYVLAG 636
A+LYGP VLAG
Sbjct: 544 GALLYGPIVLAG 555
>gi|330997549|ref|ZP_08321396.1| hypothetical protein HMPREF9442_02496 [Paraprevotella xylaniphila
YIT 11841]
gi|329570407|gb|EGG52138.1| hypothetical protein HMPREF9442_02496 [Paraprevotella xylaniphila
YIT 11841]
Length = 622
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 186/549 (33%), Positives = 282/549 (51%), Gaps = 48/549 (8%)
Query: 106 KVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE 165
KVP + F L DVRL + ++ +++ + VD+L+ FR TA + A E
Sbjct: 21 KVPLAAESF----ELQDVRLLPGRFRDNMMRDSV-WMVSIGVDRLLHGFRTTAGIFAGRE 75
Query: 166 -------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE 218
GGWE CELRGH GH+LSA +LM+A+T +E K K ++V+ L+ Q
Sbjct: 76 GGYMTVKKLGGWESLDCELRGHTTGHFLSALSLMYAATGSEVFKLKGDSLVAGLAEVQVA 135
Query: 219 IGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYF 278
+G+GYLSAFP E +R VWAP+YT+HKI +GL+DQY YA N +AL + M ++
Sbjct: 136 LGNGYLSAFPEELINRNIRATSVWAPWYTLHKIFSGLIDQYLYAGNTQALEVVRKMGDWA 195
Query: 279 YNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLA 338
Y + +K S E + + E GG+N+ Y L+ +T D ++ LA F + L
Sbjct: 196 YAK----LKPLSEETRRKMIRNEFGGVNESFYNLYALTGDERYKWLAGFFYHNEVIDPLK 251
Query: 339 LQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK-----------EGHQLESSGTNIGHF 387
Q DD+ H+NT IP V+ YE+TGD K + H ++
Sbjct: 252 AQKDDLGTKHTNTFIPKVLAEARNYELTGDADSKALSEFFWHTMIDRHTFAPGCSSDKEH 311
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
F +D + +++ T E+C TYNMLK+SRHLF W ADYYER+L N +LG Q+
Sbjct: 312 YFPTD--KFTAHISGYTGETCCTYNMLKLSRHLFCWDASPEVADYYERALYNHILG-QQD 368
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
G++ Y LPL G+ + S TP +SFWCC G+G E+ +K ++IY+ + G
Sbjct: 369 PASGMVAYFLPLQTGTHRVYS-----TPENSFWCCVGSGFENHAKYAEAIYYHDRD---G 420
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+++ +I S + W+ +V+ Q + +VT T T + LR P+W S
Sbjct: 421 IFVNLFIPSEVKWREKGLVLRQD----TRFPEEGKVTFTVGLDEPKQLT-VRLRYPSW-S 474
Query: 568 SNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAI 627
S + + PG+++ +++ W D++ + LR E P+ A+
Sbjct: 475 SEVSVKVNGKKVKVRQKPGSYILLSRRWKDGDRIEADYAMGLRLERT----PDGTERGAL 530
Query: 628 LYGPYVLAG 636
LYGP VLAG
Sbjct: 531 LYGPVVLAG 539
>gi|374372949|ref|ZP_09630610.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
gi|373235025|gb|EHP54817.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
Length = 653
Score = 286 bits (731), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 183/556 (32%), Positives = 285/556 (51%), Gaps = 44/556 (7%)
Query: 100 KNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTAR 159
++ G+F + +R + L +V+L DS +LL + + L+ +F A
Sbjct: 37 QHEGKFAIKDRLKPAVYSFDLSEVKL-LDSRFKENMLREQHWLLAISLKSLLHSFYTNAG 95
Query: 160 LPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSAL 212
+ E Y GWE CELRGH GH LS ALM+AST + K K ++ AL
Sbjct: 96 MYDANEGGYDEIKKYAGWESMDCELRGHSTGHILSGLALMYASTGEQIYKSKGDTIIKAL 155
Query: 213 SACQKEIG-SGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMT 271
+A QK + +GY+SAFP E +R VWAP+YT+HKILAG+LDQY Y +N +AL +
Sbjct: 156 AAIQKTLNQNGYISAFPQEFINRNIRGEKVWAPWYTLHKILAGVLDQYLYCNNDQALDIA 215
Query: 272 TWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKP 331
+ Y ++ + + + L E GGMN+V + L+ IT D K L + F
Sbjct: 216 KNFSAWAYKKLHPL----TAGQRTLMLRNEFGGMNEVFFNLYAITGDEKDKWLGNFFYDN 271
Query: 332 CFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQ----------LHKEGHQLESSG 381
L L D++ G H+NT+IP ++G YE+ G+ H ++G
Sbjct: 272 RMLDPLKAGIDNLKGAHANTYIPKLLGVTRDYEIEGNAGGDAVVRFFWQRVTTHHSFATG 331
Query: 382 TNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGV 441
+N +F P ++++L T ESC YNMLK++RHL+ + + YADYYE++L N +
Sbjct: 332 SNSDREHF-FQPDAISTHLTGYTGESCNVYNMLKLTRHLYIHSGNVKYADYYEKALFNHI 390
Query: 442 LGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
LG Q+ G++ Y LP+ PG+ K S TP SFWCC GTG E+ +K G+ IY+
Sbjct: 391 LG-QQDPATGMIAYFLPMLPGAHKVYS-----TPDSSFWCCVGTGFENQAKYGEGIYYHT 444
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
+ +YI +I S L+WK + Q+ D ++ T+ + ++N+R
Sbjct: 445 QND---LYINLFIPSDLNWKEKSFRLMQQTK--FPEDGNMKFTI---DEAPEFPLTINIR 496
Query: 562 IPTWTSSNGAKATLNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
P W + T+NG+ + + + ++S+ + W +D++ + + LRT D+
Sbjct: 497 YPDWVAGR-PTITINGRSIKIEQAADSYISIKRIWKKNDRIEVNYRMQLRTIPANDN--- 552
Query: 621 YASIQAILYGPYVLAG 636
S+ AI YGP VLAG
Sbjct: 553 -PSVAAIAYGPVVLAG 567
>gi|33113961|gb|AAP94583.1| putative protein [Zea mays]
Length = 786
Score = 285 bits (729), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 135/212 (63%), Positives = 161/212 (75%), Gaps = 4/212 (1%)
Query: 166 PYGGWEEP----SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS 221
P W P +L GHFVGHYL A+A MWASTHN++L KMS +V+AL CQK++G
Sbjct: 461 PTSDWRSPGRFLDVQLWGHFVGHYLGATAKMWASTHNDTLNAKMSYIVNALYDCQKKMGI 520
Query: 222 GYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNR 281
GYLSAFP+E F +EA+ VWAPYYTIHKI+ GLLDQYT A N+ AL M MV YF +R
Sbjct: 521 GYLSAFPSEFFVWVEAITSVWAPYYTIHKIMQGLLDQYTVAGNSVALVMVVKMVNYFSDR 580
Query: 282 VQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQA 341
V+NVI+ YSIE HW++LNE+ GGMNDV Y+L+ I D KHL LA LFDKPCFLGLLA Q
Sbjct: 581 VKNVIQNYSIETHWESLNEKTGGMNDVFYQLYTIMNDTKHLTLAPLFDKPCFLGLLAGQD 640
Query: 342 DDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE 373
D ISGFHSNT IP+ IG+QMRY+VTGD L+K+
Sbjct: 641 DSISGFHSNTRIPVAIGAQMRYKVTGDPLYKQ 672
>gi|334364979|ref|ZP_08513951.1| conserved hypothetical protein [Alistipes sp. HGB5]
gi|313158812|gb|EFR58195.1| conserved hypothetical protein [Alistipes sp. HGB5]
Length = 778
Score = 285 bits (729), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 186/541 (34%), Positives = 296/541 (54%), Gaps = 50/541 (9%)
Query: 118 VSLHDVRL-GSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCE 176
V L+DVR+ G +H AQ+ + +L +D D+ + FR A L YGGWE C
Sbjct: 45 VPLNDVRITGGPFLH--AQEMDRRWLDSMDPDRYLSGFRSEAGLEPKAPRYGGWESAGCS 102
Query: 177 LRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--FDR 234
GH GH+LSA+A+M+A+T + +L +K++ + L+ CQ++ G+G L+ F + F
Sbjct: 103 --GHGFGHFLSAAAMMYAATGDRALLDKINYSIDGLAECQQKEGTGLLAGFERSRALFAE 160
Query: 235 LEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNV 285
LE L W P+YT+HK+ AGL+D Y NA+AL T +V F + + +
Sbjct: 161 LERGDIRSQGFDLNGGWVPFYTLHKMYAGLVDVCRYTPNAKAL---TVLVR-FADWLDGL 216
Query: 286 IKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDIS 345
+ K S E+ + L E GG+ + L ++ +T + K+L LA FD L LA D +
Sbjct: 217 VAKLSDEQMDKILICEHGGITESLADIYVLTGERKYLELARRFDHREILRPLAAGVDSLP 276
Query: 346 GFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKR 395
G H+NT IP ++G+ YE +GD+ ++ G + G N + +F + P
Sbjct: 277 GKHANTQIPKIVGAVREYECSGDERYRRIADYFWHRVVGFHSYAIGGNSEYEHFGA-PGM 335
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
LA+ L T E+C TYNMLK+++HL++ + ADYYER+L N +L Q + G++ Y
Sbjct: 336 LANRLSDGTCETCNTYNMLKLTKHLYQLDPTVRRADYYERALYNQILASQ-NPDDGMVCY 394
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
+ P+ G K + P DSFWCC G+G+E+ ++ G+ IYF + + +Y+ YI
Sbjct: 395 MSPMGSGHRK-----GFCLPFDSFWCCVGSGMENHARYGEFIYFTDARE--NLYVNLYIP 447
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S LDWKS + V Q D S + LRV ++ + + LNLR P W ++ G + T+
Sbjct: 448 STLDWKSRGVKVEQLTDFPCSDEVRLRVEMSGAQR-----FVLNLRYPEW-AAEGYELTV 501
Query: 576 NGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
NG+ + + PG+++SV + W S D++ L +L +E I D ++++A YGP VL
Sbjct: 502 NGRPVKQKAKPGSYISVNRKWRSGDEVRFVLRQSLHSEPIPGD----STLRAYFYGPVVL 557
Query: 635 A 635
+
Sbjct: 558 S 558
>gi|375148455|ref|YP_005010896.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361062501|gb|AEW01493.1| protein of unknown function DUF1680 [Niastella koreensis GR20-10]
Length = 786
Score = 285 bits (728), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 188/542 (34%), Positives = 292/542 (53%), Gaps = 52/542 (9%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
+L DV+L D +A + ++ YL +++ D+L+ +FR+ A L GE YGGWE L
Sbjct: 46 NLQDVQL-LDGPFKKAMEADVRYLQVIEPDRLLADFREHAGLKPKGEHYGGWEHSG--LA 102
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ------- 231
GH +GHYLSA A+ +A++H++ K++ +V L+ CQ + +GY+ A P E
Sbjct: 103 GHTLGHYLSACAMHYAASHDKQFLGKVNYIVDELAECQPK-RNGYVGAIPKEDSMWAEVE 161
Query: 232 ----FDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
R L W+P+YT+HKI+AGLLD Y Y DN +AL + T M ++ + ++N +
Sbjct: 162 KGNIHSRGFDLNGAWSPWYTVHKIMAGLLDAYLYCDNKKALAVETGMADWTAHLLRN-LP 220
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
S++R L E GGMNDVL + +T + K+L L++ F L LALQ D + G
Sbjct: 221 DSSLQR---MLFCEYGGMNDVLNNTYALTGEKKYLDLSYKFHDKRILDSLALQKDILPGK 277
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRL 396
HSNT IP VIG RYE+T + K H G + ++ + +L
Sbjct: 278 HSNTQIPKVIGCIRRYELTAGEKDKTIGDFFWQTVVNDHTYAPGGNS--NYEYLGPAGQL 335
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
L NT E+C TYNMLK++RHLF + DYYER+L N +L Q + G+M Y
Sbjct: 336 NETLTDNTMETCNTYNMLKLTRHLFALQPTASLMDYYERALYNHILSSQDHST-GMMCYF 394
Query: 457 LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
+PL G+ KE S ++F CC G+G+E+ K G++IY+ +G +Y+ +I+S
Sbjct: 395 VPLRMGTQKEFS-----DSFNTFTCCVGSGMENHVKYGETIYY--QGADGSLYVNLFIAS 447
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
RL WK +VV Q+ + Y+R+ + + + +L +R P W + G +N
Sbjct: 448 RLTWKEKGVVVEQQTQ--LPESNYIRLAIKAARP---VAFTLRIRNPYW-AKQGVWIAVN 501
Query: 577 GQDLPLPSPG--NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
G++ PG + ++T+TW + D + ++ L L T ++ P+ + AI YGP VL
Sbjct: 502 GKEQTNLQPGADGYFTITRTWKTGDAVIVKPSLQLYTRSM----PDNPNRLAIFYGPLVL 557
Query: 635 AG 636
AG
Sbjct: 558 AG 559
>gi|289773961|ref|ZP_06533339.1| secreted protein [Streptomyces lividans TK24]
gi|289704160|gb|EFD71589.1| secreted protein [Streptomyces lividans TK24]
Length = 854
Score = 284 bits (726), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 200/578 (34%), Positives = 283/578 (48%), Gaps = 44/578 (7%)
Query: 110 RSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGG 169
R G L+ L VRL DS + YL +D D+L+ FR LP+ EP GG
Sbjct: 46 RPGPLLEPFPLSAVRL-LDSPFLANMRRTCAYLRFVDPDRLLHTFRLNVGLPSAAEPCGG 104
Query: 170 WEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS-----GYL 224
WE P +LRGH GH LSA A A T + +K +VSAL+ CQ+ + GYL
Sbjct: 105 WEAPDVQLRGHTTGHLLSALAQAHAGTGETAYADKARLLVSALAECQRAAPAAGFHRGYL 164
Query: 225 SAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
SAFP FD+LEA WAPYYT+HKI+AGLLDQY + N EA + M + R
Sbjct: 165 SAFPESVFDQLEAGGKPWAPYYTLHKIMAGLLDQYRLSGNREAFDVLLEMAAWTEARTAP 224
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S ER L E GGMNDVL +L T DP HL A FD LA D++
Sbjct: 225 L----SRERMQSVLKVEFGGMNDVLARLHLETGDPVHLRTARRFDHDELYAPLAAGRDEL 280
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPK 394
+G H+NT I V+G+ YE TGD+ + + H + G N F P
Sbjct: 281 AGRHANTEIAKVVGAVPAYEATGDRRYLDIADTFWTTVVRHHSYAIGGNSNQELF-GPPD 339
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKE-IAYADYYERSLTNGVLGIQR-GTEPGV 452
+AS L T E+C +YNMLK+ R LFR E Y D+YE +L N +L Q + G
Sbjct: 340 EIASRLSEVTCENCNSYNMLKLGRDLFRHDPERTEYLDHYEWTLYNQMLAEQDPDSAHGF 399
Query: 453 MIYLLPLAPGSSKERSYHHWGTPS------DSFWCCYGTGIESFSKLGDSIYFEEEG-KY 505
+ Y L GS +E P D+F C +GTG+E+ +K D++YF G +
Sbjct: 400 VTYYTGLWAGSRREPKGGLGSAPGSYSGDYDNFSCDHGTGLETHTKFADTVYFRTPGTRR 459
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
P +++ ++ S + W + + Q D + R+T+T G +L +R+P W
Sbjct: 460 PALHVNLFVPSEVCWDDLGVTLRQDTD--MPTGDRTRLTVT----GGEARFALRIRVPGW 513
Query: 566 TSSNGAKA--TLNGQDL-PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYA 622
++ +A T+NG+ PG + +VT+ W + D++ + LP + P+
Sbjct: 514 LAAGDGRAGLTVNGRRTGGRLEPGTYTTVTRHWRTGDRVELVLPRV----PVWRPAPDNP 569
Query: 623 SIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPA 660
++A+ YGP VLAG + GD +T D + P
Sbjct: 570 QVKAVSYGPLVLAG-AYGDTPLTTLPAVRPDTLRRTPG 606
>gi|390452646|ref|ZP_10238174.1| hypothetical protein PpeoK3_01345 [Paenibacillus peoriae KCTC 3763]
Length = 767
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 198/589 (33%), Positives = 301/589 (51%), Gaps = 54/589 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEP 173
+KE V L +S A L+++ ++ D++++NFR+ A + G +P GW+ P
Sbjct: 191 VKEFKGQKVSLERESEFEAAMNRFLQFVRSVNDDQMLYNFREAAAIDTKGAQPMTGWDAP 250
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI------GSGYLSAF 227
C L+GH GHYLSA AL + +T + +L K+ +V L CQ + G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYNATEDSALLGKIQYMVVELGKCQTALSEQAGYGRGFLSAY 310
Query: 228 PTEQFDRLEALI---PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
EQF+ LE +WAPYYT+HKI+AGLLD Y A EAL + + + +NR+
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALDICDKLGHWLHNRLGR 370
Query: 285 VIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ ++ + + W + E GGMN+VL KL+ IT + +LM A FD + D
Sbjct: 371 LPRE-QLHKMWSLYIAGEFGGMNEVLAKLYAITGNKNYLMTAKYFDNEKLFLPMKENVDT 429
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLH-----------KEGHQLESSGTNIGHFNFKSD 392
+ H+N HIP VIG+ +EV GD+ + + H GT G +
Sbjct: 430 LGNTHANQHIPQVIGALKLFEVAGDEAYFNIAENFWTMVTQSHIYPIGGT--GETEMFRE 487
Query: 393 PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP-G 451
P +A L T E+C +YNMLK+++ LF++ Y DYYE++L N +L + + G
Sbjct: 488 PDAIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEG 547
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
Y +PLAPGS K+ H CC+GTG+E+ K ++IYF +E + +Y+
Sbjct: 548 GSTYFMPLAPGSIKKFDTHENT-------CCHGTGLENHFKYQEAIYFHDEDR---LYVN 597
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
YI SRLDW + + QK D D T+ F +G TT L RIP W S
Sbjct: 598 LYIPSRLDWSDQGLSLVQKRDS----DGL--ETVRFYIEGVPETT-LMFRIPDWISEP-V 649
Query: 572 KATLNGQDL-PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+ +NG+ L +L + K W D+ + + LP +LR D P+ +++++ YG
Sbjct: 650 QVKINGEPCRDLEYEDGYLKLRKVWKKDE-IELTLPCSLRLA----DAPDDHTLKSLAYG 704
Query: 631 PYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFV 679
PYVLA S G+ D S +++ I +S L TF + + KFV
Sbjct: 705 PYVLAAIS-GEQDYISWTYSEQEFLKQIIQQKDSPL-TFVLD--SIKFV 749
>gi|332663228|ref|YP_004446016.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332332042|gb|AEE49143.1| protein of unknown function DUF1680 [Haliscomenobacter hydrossis
DSM 1100]
Length = 791
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 188/548 (34%), Positives = 294/548 (53%), Gaps = 55/548 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L D+RL S + A + + YLL ++ D+L+ F A LP YGGWE S L G
Sbjct: 50 LEDLRLLPGSAFYNAMEKDAAYLLKIESDRLLHRFYANAGLPTKAPVYGGWE--SEGLSG 107
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-----QFDR 234
H +GHYLSA ALM+A + +E E+++ +V L+ CQ +GY+ A P E Q R
Sbjct: 108 HTLGHYLSACALMYAGSKDEKYLERVNYLVQELARCQVARKTGYVGAIPKEDSIFAQVAR 167
Query: 235 LEA------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
+ L W+P+YTIHK++AGL D Y Y +N +AL++ M ++ +V+ K
Sbjct: 168 GDIRSSGFDLNGGWSPWYTIHKVMAGLADAYLYTNNDQALQVLRGMSDW----TASVVDK 223
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
+ + + L E GGMN++L ++ T + K+L L++ F + L+ + D + G H
Sbjct: 224 LNDPQRQKMLKCEYGGMNEILANVYAFTGEKKYLDLSYKFYDDFVMEPLSKKIDPLPGKH 283
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRLA 397
SNT++P IGS +YE+TG+ + H G + ++ + D +L
Sbjct: 284 SNTNVPKAIGSARQYELTGNTRDQTIASFFWETMVHNHTYVIGGNS--NYEYCGDAGKLN 341
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
L NT E+C TYNMLK++RHLF W ADYYER+L N +L Q E G+M Y +
Sbjct: 342 DRLSDNTCETCNTYNMLKLTRHLFCWQPSAELADYYERALYNHILASQH-PETGMMTYFV 400
Query: 458 PLAPGSSKERS--YHHWGTPSDSFWCCYGTGIESFSKLGDSIYFE-EEGKYPGVYIIQYI 514
PL GS KE S +H +F CC G+G+E+ K +SIY+ ++G +Y+ +I
Sbjct: 401 PLRMGSKKEFSNEFH-------TFTCCVGSGMENHVKYTESIYYRGQDGN--SLYLNLFI 451
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
S L+WK + + Q+ + +VTL+F+ S +LNLR P W ++ +
Sbjct: 452 PSELNWKERGLTLRQE----TKFPQDGKVTLSFTCAKSQ-KLALNLRRPWWMKAD-WQIK 505
Query: 575 LNGQDL-PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NG+ + P+ + + + W + DKL +++P+ L TE++ P+ + A LYGP V
Sbjct: 506 VNGKAVQPVAGTNGYYVLNRRWKNGDKLELEMPMQLYTESM----PDNPNRIAFLYGPLV 561
Query: 634 LAGHSIGD 641
LAG +GD
Sbjct: 562 LAGQ-LGD 568
>gi|375308750|ref|ZP_09774033.1| hypothetical protein WG8_2558 [Paenibacillus sp. Aloe-11]
gi|375079377|gb|EHS57602.1| hypothetical protein WG8_2558 [Paenibacillus sp. Aloe-11]
Length = 770
Score = 282 bits (721), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 186/550 (33%), Positives = 285/550 (51%), Gaps = 54/550 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEP 173
+KE + V L +S A L+++ ++ D++++NFR+ A + G +P GW+ P
Sbjct: 191 VKEFTGPKVSLERESEFAAAMNRFLQFVRSVNDDQMLYNFREAAAIDTKGAQPMTGWDAP 250
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI------GSGYLSAF 227
C L+GH GHYLSA AL + +T + +L K+ +V+ L CQ + G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYHATEDSALLGKIQYMVAELGKCQTALSEQAGYGRGFLSAY 310
Query: 228 PTEQFDRLEALI---PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
EQF+ LE +WAPYYT+HKI+AGLLD Y A EAL + + + ++R+
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALDICDKLGHWLHSRLSR 370
Query: 285 VIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ ++ + + W + E GGMN+ L KL+ IT + +LM A FD + D
Sbjct: 371 LPRE-QLHKMWSLYIAGEFGGMNEALAKLYAITGNENYLMTAKYFDNAKLFLPMKENVDT 429
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLH-----------KEGHQLESSGTNIGHFNFKSD 392
+ H+N HIP VIG+ +EV GD+ + + H GT G +
Sbjct: 430 LGNMHANQHIPQVIGALKLFEVAGDKAYFNIAENFWTMVTQSHIYPIGGT--GETEMFRE 487
Query: 393 PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP-G 451
P +A L T E+C +YNMLK+++ LF++ Y DYYE++L N +L + + G
Sbjct: 488 PDAIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEG 547
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
Y +PLAPGS K+ H CC+GTG+E+ K ++IYF +E + +Y+
Sbjct: 548 GSTYFMPLAPGSIKKFDTHENT-------CCHGTGLENHFKYQEAIYFHDEDR---LYVN 597
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
YI SRLDW I + QK D T+ F +G G T+L RIP W S
Sbjct: 598 LYIPSRLDWSEQGISLMQKRDRDG------LETVRFYIEG-GPETTLMFRIPDWVSEP-V 649
Query: 572 KATLNG---QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAIL 628
+ +NG +DL +L + K W D+ + + LP +LR D P+ +++++
Sbjct: 650 QVKINGVPCRDLEYEH--GYLKLRKVWKKDE-IELTLPCSLRLA----DAPDDHTLKSLT 702
Query: 629 YGPYVLAGHS 638
YGPYVLA S
Sbjct: 703 YGPYVLAAIS 712
>gi|427385120|ref|ZP_18881625.1| hypothetical protein HMPREF9447_02658 [Bacteroides oleiciplenus YIT
12058]
gi|425727288|gb|EKU90148.1| hypothetical protein HMPREF9447_02658 [Bacteroides oleiciplenus YIT
12058]
Length = 778
Score = 281 bits (719), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 196/615 (31%), Positives = 312/615 (50%), Gaps = 81/615 (13%)
Query: 123 VRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFV 182
+RL S A N E+LL L D+L+ FR A L GE YGGWE S + GH +
Sbjct: 44 LRLLPGSPFKHAMDKNGEWLLDLSPDRLLHRFRLNAGLTPKGEIYGGWE--SRGVSGHTL 101
Query: 183 GHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPV- 241
GHYLSA A+M+A++ ++ KE++ +V L+ CQ +GY+ P E D++ A +
Sbjct: 102 GHYLSACAMMYAASGDKRFKERVDYIVKELAECQDARKTGYVGGIPDE--DKIWAEVSSG 159
Query: 242 ------------WAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNV 285
W P+YT+HK+ AGL+D Y YA + +A +++ W V F + +
Sbjct: 160 DIRSQGFDLNGGWVPWYTLHKLWAGLIDAYRYAGSEQAKEVGTKLSDWAVRSFGDLSEED 219
Query: 286 IKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDIS 345
+K L E GGMN+ ++ IT + +L LA F L L Q D++
Sbjct: 220 FQK--------MLACEFGGMNESFADMYAITGNESYLKLARQFYHKAILDPLKEQRDELE 271
Query: 346 GFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLES----------SGTNIGHFNFK--SDP 393
G HSNT +P +IG YE+TGD K+ H + + + N G+ N++ P
Sbjct: 272 GKHSNTQVPKIIGEARLYELTGD---KDMHTIATFYWDRIVNHHTYVNGGNSNYEHLGKP 328
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
L L T E+C TYNMLK+++HLF W + AY DYYE++L N +L Q + G++
Sbjct: 329 DCLNDRLSPFTSETCNTYNMLKLTKHLFSWDPQAAYMDYYEQALYNHILASQN-PDDGMV 387
Query: 454 IYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y +PL G+ KE S T DSFWCC +GIE+ K +S++F+ K G+++ +
Sbjct: 388 CYSVPLESGTKKEFS-----TRFDSFWCCVASGIENHVKYAESVFFQSV-KDGGLFVNLF 441
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
I + L+WK + V K++ + D ++++ KG L++R P W ++ G K
Sbjct: 442 IPTSLNWKEKGMEV--KLETQLPADNKVQISF----KGKSKEFPLHIRYPRW-ATQGIKV 494
Query: 574 TLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
TLNG++ + +PG++ ++ W +D +L I++P+ L T ++ P+ A I YGP
Sbjct: 495 TLNGKEEKVTGTPGSYFTLQGEWDTDTQLVIEIPMELYTVSM----PDNADRMGIFYGPV 550
Query: 633 VLAG----HSIGDWDI---TESATSLSDWITPIPASYNSQLITFTQE-YGNTKFVLT--- 681
+LA + +DI S+ I P+P + +TFT N + +L
Sbjct: 551 LLAAPLGTGELQAYDIPCFISDTESIVQSIAPVP----DKPLTFTANTTANAQLLLVPFY 606
Query: 682 ---NSNQSITMEKFP 693
++ ++FP
Sbjct: 607 TIHGQKHAVYFDRFP 621
>gi|251798261|ref|YP_003012992.1| hypothetical protein Pjdr2_4282 [Paenibacillus sp. JDR-2]
gi|247545887|gb|ACT02906.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 758
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 182/551 (33%), Positives = 280/551 (50%), Gaps = 46/551 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
L L+ V+L S+ A Q L+YL DVD+L+ FR+T+ L + Y GWE +
Sbjct: 10 LNHFELNRVKLYSE-YQTNAFQKELDYLRSYDVDRLLAGFRETSGLQPKADKYPGWE--N 66
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
E+RGH +GHYL+A + +A T + L EK+ +V+ L+ Q+E +GYLSAFP FD
Sbjct: 67 TEIRGHTLGHYLTAVSQAYAQTQDSGLLEKLKYLVAELAEAQQE--NGYLSAFPETLFDN 124
Query: 235 LEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
+E P W P+YT+HKI+AGL+ Y +A + + + ++ +R + +S E
Sbjct: 125 VENRKPAWVPWYTMHKIIAGLIAVYQATKLQQAYEVVSRLGDWVADRACS----WSEELQ 180
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
L E GGMND +Y L+ +T + HL AH FD+ L D + G H+NT IP
Sbjct: 181 ATVLAVEYGGMNDCMYDLYKLTGNNLHLEAAHKFDEISLFEALREGKDVLKGKHANTMIP 240
Query: 355 IVIGSQMRYEVTG--------------DQLHKEGHQLESSGTNIGHFNFKSDPKRLASNL 400
IG+ RY G D + L + HF +P L
Sbjct: 241 KFIGALNRYLTLGESERGYLEAAVNFWDTVVYHHSYLTGGNSECEHF---GEPDILDGKR 297
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
T E+C +YNMLK+++ LF+ T+ YAD+YER+ N +L Q E G+ +Y P+A
Sbjct: 298 SDVTCETCNSYNMLKLTKELFKLTQNSKYADFYERTYINAILSSQ-NPETGMTMYFQPMA 356
Query: 461 PGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW 520
G K S +P + FWCC GTG+ESF+KL DSIYF + +Y+ Q+ SSRLDW
Sbjct: 357 TGYFKIYS-----SPFEHFWCCTGTGMESFTKLNDSIYFHLD---HNLYVNQFYSSRLDW 408
Query: 521 KSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDL 580
Q VV Q P+ + S ++++R+P+W + LNG+ +
Sbjct: 409 TEQQTVVTQTTSL-----PHSDLVHFTVGTDSPKRLAIHIRVPSWAAGE-VDILLNGETV 462
Query: 581 PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIG 640
P ++ + + W D + ++P+ + ++ P+ + + YGP VL+ ++G
Sbjct: 463 PASVQQQYVVLDRIWKDGDTIEARIPMKVSFSSL----PDAPHVIGLQYGPIVLSA-ALG 517
Query: 641 DWDITESATSL 651
D+ ES T +
Sbjct: 518 KEDMVESRTGV 528
>gi|21218915|ref|NP_624694.1| hypothetical protein SCO0371 [Streptomyces coelicolor A3(2)]
gi|5881940|emb|CAB55733.1| putative secreted protein [Streptomyces coelicolor A3(2)]
Length = 869
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 203/610 (33%), Positives = 289/610 (47%), Gaps = 57/610 (9%)
Query: 110 RSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGG 169
R G L+ L VRL DS + YL +D D+L+ FR LP+ EP GG
Sbjct: 61 RPGPLLEPFPLSAVRL-LDSPFLANMRRTCAYLRFVDPDRLLHTFRLNVGLPSAAEPCGG 119
Query: 170 WEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS-----GYL 224
WE P +LRGH GH LSA A A T + +K +VSAL+ CQ+ + GYL
Sbjct: 120 WEAPDVQLRGHTTGHLLSALAQAHAGTGETAYADKARLLVSALAECQRAAPAAGFHRGYL 179
Query: 225 SAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
SAFP FD+LEA WAPYYT+HKI+AGLLDQY + N EA + M + R
Sbjct: 180 SAFPESVFDQLEAGGKPWAPYYTLHKIMAGLLDQYRLSGNREAFDVLLEMAAWTEARTAP 239
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S ER L E GGMNDVL +L T DP HL A FD LA D++
Sbjct: 240 L----SRERMQSVLKVEFGGMNDVLARLHLETGDPVHLRTARRFDHDELYAPLAAGRDEL 295
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPK 394
+G H+NT I V+G+ YE TGD+ + + H + G N F P
Sbjct: 296 AGRHANTEIAKVVGAVPAYEATGDRRYLDIADTFWTTVVRHHSYAIGGNSNQELF-GPPD 354
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKE-IAYADYYERSLTNGVLGIQR-GTEPGV 452
+AS L T E+C +YNMLK+ R LFR E Y D+YE +L N +L Q + G
Sbjct: 355 EIASRLSEVTCENCNSYNMLKLGRDLFRHDPERTEYLDHYEWTLYNQMLAEQDPDSAHGF 414
Query: 453 MIYLLPLAPGSSKERSYHHWGTPS------DSFWCCYGTGIESFSKLGDSIYFEEEG-KY 505
+ Y L GS +E P D+F C +GTG+E+ +K D++YF G +
Sbjct: 415 VTYYTGLWAGSRREPKGGLGSAPGSYSGDYDNFSCDHGTGLETHTKFADTVYFRTPGTRR 474
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
P +++ ++ S + W + + Q D + R+T+T G +L +R+ W
Sbjct: 475 PALHVNLFVPSEVCWDDLGVTLRQDTD--MPTGDRTRLTVT----GGEARFALRIRVAGW 528
Query: 566 TSSNGAKA--TLNGQDL-PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYA 622
++ +A T+NG+ PG + +VT+ W + D++ + LP + P+
Sbjct: 529 LAAGDGRAGLTVNGRRTGGRLEPGTYTTVTRHWRTGDRVELVLPRV----PVWRPAPDNP 584
Query: 623 SIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTN 682
++A+ YGP VLAG + GD +T D + P T+F
Sbjct: 585 QVKAVSYGPLVLAG-AYGDTPLTTLPAVRPDTLRRTPGE-------------PTRFTAVA 630
Query: 683 SNQSITMEKF 692
+ I + F
Sbjct: 631 DGRRIPLRPF 640
>gi|326204047|ref|ZP_08193908.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
gi|325985814|gb|EGD46649.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
Length = 743
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 196/642 (30%), Positives = 311/642 (48%), Gaps = 65/642 (10%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A + +EYL D DKL+ F T L E Y GWE + E+RGH +GHYL+A A +
Sbjct: 14 AFKKEIEYLEAFDCDKLLSCFYITKGLTPKAENYRGWE--NTEIRGHTMGHYLTALAQAY 71
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILA 253
++T++ + E++ ++ LS CQ E SGYLSAFP E FDR+E P+W P+YT+HKI+
Sbjct: 72 SATNDSKIYERLQYLMKELSLCQFE--SGYLSAFPEEFFDRVENRKPIWVPWYTMHKIIT 129
Query: 254 GLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLF 313
GL+ Y A AL++ + + E+ ++R K++ E H L E GGMND +Y+L+
Sbjct: 130 GLISVYKLAKIETALKIVSRLGEWVFSRTD----KWTPEIHANVLAVEYGGMNDCMYELY 185
Query: 314 CITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQ---- 369
I+ + KH AH+FD+ + D ++ H+NT IP +G+ RY G++
Sbjct: 186 KISGNEKHCTAAHMFDEIELFKEIHDGKDILNNRHANTTIPKFLGALNRYLAIGEEEQFY 245
Query: 370 ---------LHKEGHQLESSG-TNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRH 419
+ H + G + HF +P L + S E+C TYNMLK++R
Sbjct: 246 LDTCKEFWSIVTNNHSYVTGGNSEWEHF---GEPGILDAERTSTNCETCNTYNMLKMTRE 302
Query: 420 LFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSF 479
LF+ T YAD+YE + TN +L Q + G+ +Y P+ G K +G P + F
Sbjct: 303 LFKITGNKKYADFYENTFTNAILSSQ-NPDTGMTMYFQPMETGYFKV-----YGKPFEHF 356
Query: 480 WCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDP 539
WCC GTG+E+F+KL +SIYF EE + +Y+ Y S+ L+W+ + + Q D + D
Sbjct: 357 WCCTGTGMENFTKLNNSIYFYEEDR---LYVNMYYSTELNWEEKGVKLTQNSD-IPGTD- 411
Query: 540 YLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDD 599
R T ++ +G +L +RIPTW + G K +N + + +TW +D
Sbjct: 412 --RAGFTIKAE-TGAEFTLCMRIPTW--AKGVKINVNNNLSIFTEERGYALIHRTWKDND 466
Query: 600 KLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIP 659
+ I + + + P+ + A YGP VL+ +G ++ ES T + I
Sbjct: 467 TVEIIFKIEPQLSTL----PDNPNAVAFTYGPVVLSA-GLGADEMEESTTGVMVTIPSKH 521
Query: 660 ASYNSQLITFTQEY---------------GNTKFVLTNSNQSITMEKFPKSGTDAALHAT 704
L+ Q G +F L +++ + P + +
Sbjct: 522 VEIKDYLVIMNQSVDEWKKDIALNLKKAEGKLEFRLNGTDEDGRLVFTPHYRQHSQRYGI 581
Query: 705 FRLILNDSSGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETD 746
+ L++ D S LN +I + +E S + IQ D
Sbjct: 582 YWLLVEDGS----DELNKYIDEKKKVEDIKSAEIDSIQIGND 619
>gi|371778346|ref|ZP_09484668.1| hypothetical protein AnHS1_13085 [Anaerophaga sp. HS1]
Length = 796
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 188/565 (33%), Positives = 288/565 (50%), Gaps = 63/565 (11%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A + N + LL + D+L+ +FR+ A L + YGGWE S L GH +GHYLSA ++M+
Sbjct: 63 ASKLNEKILLNYEPDRLLAHFREQAHLKPKAQHYGGWEGES--LTGHSLGHYLSACSMMY 120
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--FDRLEA----------LIPV 241
+T NE ++++ +V+ L QK G GYL AF + F+ A L +
Sbjct: 121 KTTGNEEFLKRVNYIVNELDTVQKAHGDGYLGAFDNGKKIFEEEIANGNIRSAGFDLNGI 180
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEE 301
WAP YT HKI+AGL+D Y N +AL + ++ + V+N+ S E + L+ E
Sbjct: 181 WAPIYTQHKIMAGLMDAYKLCGNKKALEVEQKFADWLGSIVENL----SHEEIQKMLHCE 236
Query: 302 AGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQM 361
GG+N+ +LF +T + ++L +A LF L LA D + G H+NT IP +IG
Sbjct: 237 HGGINEAYAELFAVTGNERYLKIARLFHHEAVLDPLAKGIDILPGHHANTQIPKIIGLSR 296
Query: 362 RYEVTGDQLHKEG----------HQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTY 411
YE+TGD ++ H +G N H F P L++ L SNT E+C Y
Sbjct: 297 LYELTGDTTDRKTAQFFWERVVYHHSYVTGGNGDHEYF-GPPDTLSNRLSSNTTETCNVY 355
Query: 412 NMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHH 471
NMLK+S HLF+W E ADYYER+L N +L Q + G +IY L L G K H
Sbjct: 356 NMLKLSNHLFKWEAEAEVADYYERALFNHILSSQH-PQSGHVIYNLSLEMGGHK-----H 409
Query: 472 WGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKV 531
+ P F CC GTG+E+ +K +IYF + + +++ Q+I+SRL+WK + + Q
Sbjct: 410 YQNPF-GFTCCVGTGMENHAKYPKNIYFHNDRE---LFVSQFIASRLNWKEKGLKLTQN- 464
Query: 532 DPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLS 590
+ + + F + + L +R P W + G T+NG+ + P +F++
Sbjct: 465 ---TRYPDEQKTSFIFECE-KPVDLILQIRYPYW-AEKGMIVTVNGKKVSYSQKPQSFVA 519
Query: 591 VTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATS 650
+ + W + DK+ + P +LR EA+ D++ A++YGP VLAG +G D ++
Sbjct: 520 IHREWKTGDKVEVSFPFSLRLEAMPDNKDRV----ALMYGPLVLAG-QLGPVDDPKANDP 574
Query: 651 L------------SDWITPIPASYN 663
L W P+P N
Sbjct: 575 LYVPVLMVEDRNPQSWTIPVPDEPN 599
>gi|427386203|ref|ZP_18882400.1| hypothetical protein HMPREF9447_03433 [Bacteroides oleiciplenus YIT
12058]
gi|425726590|gb|EKU89454.1| hypothetical protein HMPREF9447_03433 [Bacteroides oleiciplenus YIT
12058]
Length = 616
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 172/536 (32%), Positives = 277/536 (51%), Gaps = 55/536 (10%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALM 192
+ ++ N+ +L LD D+L+ NFR TA LP+ EP GWE P LRGHFVGHYLSA + +
Sbjct: 48 QREELNITFLKSLDPDRLLHNFRVTAGLPSNAEPLEGWESPKIGLRGHFVGHYLSAVSSL 107
Query: 193 WASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA-LIPVWAPYYTIHKI 251
+ L E++ ++ L CQ+ G+ YLSAFP + FD LEA VWAPYYT +K+
Sbjct: 108 VEKYKDLELVERLRYMIDELCKCQQSFGNSYLSAFPDKDFDALEAKFTGVWAPYYTYNKV 167
Query: 252 LAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN----EEAGGMND 307
+ GLLD YT+ N +A M M Y NR+ + + +IE+ T++ E G MN+
Sbjct: 168 MQGLLDAYTHTGNQKAYDMLLDMAAYVDNRMSKLSGE-TIEKMLYTVDANPQNEPGAMNE 226
Query: 308 VLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTG 367
VLYKL+ I+++PKHL LA +FD+ F+ LA D +SG HSNTH+ +V G RY +TG
Sbjct: 227 VLYKLYKISRNPKHLALAEIFDRNWFITPLAENKDILSGLHSNTHLVLVNGFAQRYSITG 286
Query: 368 DQLHKEG----------HQLESSGTNIG--------------HFNFKSDPKRLASNLDSN 403
+ + + ++GT+ G H+ P L + L
Sbjct: 287 ESKYYAASTNFWDMLISQHVYANGTSSGPRPNATTRTSVTAEHWGV---PGHLCNTLTKE 343
Query: 404 TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGS 463
ESC ++N K++ +F WT YAD Y + N VL Q G +Y LPL GS
Sbjct: 344 IAESCVSHNTQKLTSSIFTWTAAPKYADAYMNTFYNAVLASQ-SAHTGAYMYHLPL--GS 400
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
+ + Y + F CC G+ E++S+L IY+ ++ +++ ++ S ++WK
Sbjct: 401 PRNKKY----LKDNDFACCSGSSAEAYSRLNSGIYYHDDS---ALWVNLFVPSEVNWKEK 453
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
+ + Q + + + T S+K + +L L IP+W + A+ +NG+ +
Sbjct: 454 NVRLEQNGN----FPKDTNICFTISTK-KKVGFALKLFIPSW--AKNAEVYINGEKQEIE 506
Query: 584 S-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
+ P +++ + + W D++ + + + D++ + ++ YGP +LA S
Sbjct: 507 TFPSSYIDLNRNWRDKDEVKLIFHYDFHLKTMPDNK----DVLSLFYGPMLLAFES 558
>gi|256377207|ref|YP_003100867.1| hypothetical protein Amir_3107 [Actinosynnema mirum DSM 43827]
gi|255921510|gb|ACU37021.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 771
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 186/533 (34%), Positives = 273/533 (51%), Gaps = 55/533 (10%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWEEPSCELRGHFVGHYLSASALMW 193
Q L YL +D D+L++NFR RL G P GWE P R H GH+L+A A W
Sbjct: 66 QNRALSYLRFVDPDRLLYNFRANHRLSTAGAAPLAGWEAPDFPFRTHSQGHFLTAWAQAW 125
Query: 194 ASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEALIPVWAPYYTI 248
A + + +++ + +V+ L+ CQ +GYLS FP D LEA P YY +
Sbjct: 126 AVLGDTTSRDRANHLVAELAKCQANNAAAGFTAGYLSGFPESDLDALEAGTPKAVSYYAL 185
Query: 249 HKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGG 304
HK LAGLLD + + + +A LR W V++ R + + +++R L E GG
Sbjct: 186 HKTLAGLLDVWRHLGSTQARDVLLRFAGW-VDWRTAR----LSQATMQR---VLATEFGG 237
Query: 305 MNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYE 364
MN VL L+ T D + L A FD LA D ++G H+NT +P IG+ Y+
Sbjct: 238 MNAVLADLYQQTGDARWLATAQRFDHAAAFDPLAANQDRLNGLHANTQVPKWIGAAREYK 297
Query: 365 VTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNML 414
TG +++ G N +F++ P +A++L ++T E+C TYNML
Sbjct: 298 ATGTTRYRDIATNAWNITVAAHTYVIGGNSQAEHFRA-PNAIAAHLATDTAEACNTYNML 356
Query: 415 KVSRHLFRWTKE---IAYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSKERSYH 470
K++R L W E AY D+YER+L N ++G Q + G + Y L PG + R+
Sbjct: 357 KLTREL--WLLEPTKAAYFDFYERALLNHLIGQQNPADAHGHICYFTGLNPGHRRGRTGP 414
Query: 471 HWG-----TPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
WG T +FWCC GTGIE+ +KL DSIYF + + + Y S L W I
Sbjct: 415 AWGGGTWSTDYSTFWCCQGTGIETNTKLADSIYFRDGTT---LTVNLYTPSTLTWSERGI 471
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PS 584
V Q ++ TLT + SG T + LRIP WTS GA +NG + +
Sbjct: 472 TVTQS----TTYPASDTTTLTVTGSASGSWT-MRLRIPAWTS--GATVAVNGTPQNVAAA 524
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
PG++ S+T++W+SDD +T++LP+ + T P+ ++ A+ YGP VLAG+
Sbjct: 525 PGSYASLTRSWTSDDTVTLRLPMRVTTAPA----PDNPNVVAVTYGPVVLAGN 573
>gi|440694505|ref|ZP_20877120.1| hypothetical protein STRTUCAR8_01091 [Streptomyces turgidiscabies
Car8]
gi|440283503|gb|ELP70762.1| hypothetical protein STRTUCAR8_01091 [Streptomyces turgidiscabies
Car8]
Length = 747
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 186/555 (33%), Positives = 282/555 (50%), Gaps = 57/555 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEP 173
++ L V LG D + R + LE+ D+++ FR A L G +P GGWE
Sbjct: 85 VQPFPLDQVALG-DGVFRRKRDLMLEFARSYPADRILAVFRANAGLDTRGAQPPGGWETA 143
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS---------GYL 224
LRGHF GH+L+ A +A T +LK K+ +V+AL CQ+ + G+L
Sbjct: 144 DGNLRGHFGGHFLTLVAQAYADTREAALKTKLDYLVTALGECQQALADHGSPRPSHPGFL 203
Query: 225 SAFPTEQFDRLEALIP---VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNR 281
+A+P QF LE+ +WAPYYT HKI+ G LD +T N +AL + + M ++ ++R
Sbjct: 204 AAYPETQFILLESYTTYPTIWAPYYTCHKIMRGFLDAHTLTGNQQALTIASKMGDWVHSR 263
Query: 282 VQNVIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQ 340
+ + + ++R W + E GGMN+VL L+ +T +HL A FD L A
Sbjct: 264 LSR-LPQAQLDRMWSIYIAGEYGGMNEVLADLYALTGRAEHLAAARCFDNTALLDACADN 322
Query: 341 ADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ-----------LESSGTNIGHFNF 389
D + G H+N HIP G ++ TG+ + + GT G F
Sbjct: 323 RDILDGRHANQHIPQFTGYIRLFDHTGEAEYATAARNFWGMVAGPRTYSLGGTGQGEM-F 381
Query: 390 KSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTE 449
++ +A+ L N E+C TYNMLK+SR LF T + AY DYYE+ LTN +L +R
Sbjct: 382 RAR-NAIAATLGDNNAETCATYNMLKLSRQLFFHTPDPAYMDYYEKGLTNHILASRRDAR 440
Query: 450 PGV---MIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE-EGKY 505
V + Y + + PG +E Y + GT CC GTG+E+ +K DS+YF +G
Sbjct: 441 STVSPEVTYFVGMGPGVVRE--YDNTGT------CCGGTGMENHTKYQDSVYFRSADGN- 491
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRV-TLTFSSKGSGLTTSLNLRIPT 564
+Y+ Y++S L W +V++Q D P V TLTF G L L LR+P+
Sbjct: 492 -ALYVNLYLASTLRWPERGLVIDQTSD-----FPGEGVRTLTFREGGGSL--DLKLRVPS 543
Query: 565 WTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYAS 623
W ++ G T+NG + PG++L++++ W D++T+ P LR E DD +
Sbjct: 544 W-ATGGFTVTVNGVPQQTAAVPGSYLTLSRNWQRGDRITVSAPYRLRIERALDD----PT 598
Query: 624 IQAILYGPYVLAGHS 638
+Q++ YGP +L S
Sbjct: 599 VQSLFYGPVLLVARS 613
>gi|456393067|gb|EMF58410.1| putative glycosylase [Streptomyces bottropensis ATCC 25435]
Length = 714
Score = 276 bits (706), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 192/533 (36%), Positives = 276/533 (51%), Gaps = 60/533 (11%)
Query: 139 LEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEPSCELRGHFVGHYLSASALMWASTH 197
L Y D+++ FR A L G P GGWE LRGH+ GH+L+ A +A T
Sbjct: 75 LNYARSYPADRILAVFRANAGLDTRGARPPGGWETSDGNLRGHYGGHFLTLVAQAYADTR 134
Query: 198 NESLKEKMSAVVSALSACQK---EIGS------GYLSAFPTEQFDRLE--ALIP-VWAPY 245
+LK K+ +V AL CQ E GS G+L+A+P QF LE A P +WAPY
Sbjct: 135 EAALKSKLDQLVGALGECQAALAERGSPRPSHPGFLAAYPETQFILLESYATYPTIWAPY 194
Query: 246 YTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT-LNEEAGG 304
YT HKI+ GLLD +T A NA+AL + + M ++ ++R+ + + +ER W + E GG
Sbjct: 195 YTCHKIMRGLLDAHTLAGNAQALTIVSRMGDWVHSRL-GALPRAQLERMWSLYIAGEYGG 253
Query: 305 MNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYE 364
MN+VL L+ +T +HL A FD L A D + G H+N HIP G ++
Sbjct: 254 MNEVLADLYALTGKAEHLAAARCFDNTALLDACAQDRDILDGRHANQHIPQFTGYLRLFD 313
Query: 365 VTGDQLHKEGHQ-----------LESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNM 413
TG++ + E + GT G FK+ +A+ LD E+C TYNM
Sbjct: 314 ETGEERYAEAARNFWGMVAGPRTYSLGGTGQGEM-FKARGA-IAATLDDKNAETCATYNM 371
Query: 414 LKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGT----EPGVMIYLLPLAPGSSKERSY 469
LK+SRHLF + A DYYER LTN +L +R T P V Y + + PG +E Y
Sbjct: 372 LKLSRHLFFREPDAARMDYYERGLTNHILASRRDTASTSSPEV-TYFVGMGPGVVRE--Y 428
Query: 470 HHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE-EGKYPGVYIIQYISSRLDWKSGQIVVN 528
+ GT CC GTG+E+ +K DS+YF +G +Y+ Y++S L W +VV
Sbjct: 429 GNTGT------CCGGTGMENHTKYQDSVYFRSADGN--ALYVNLYLASTLRWPERGLVVE 480
Query: 529 QKVDPVVSWDPYLRV-TLTFSS-KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSP 585
Q S P V TLTF +G T L LR+P+W ++ G T+NG + +P
Sbjct: 481 Q-----TSAYPAEGVRTLTFREVRG---TLDLRLRVPSW-ATGGFTVTVNGVRQQVEATP 531
Query: 586 GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
G++L++++ W D++ I P LR E DD ++Q++ +GP +L S
Sbjct: 532 GSYLTLSRNWRRGDRVGISAPYRLRVERALDD----PTVQSVFFGPLLLVAQS 580
>gi|333380462|ref|ZP_08472153.1| hypothetical protein HMPREF9455_00319 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826457|gb|EGJ99286.1| hypothetical protein HMPREF9455_00319 [Dysgonomonas gadei ATCC
BAA-286]
Length = 790
Score = 275 bits (704), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 183/559 (32%), Positives = 284/559 (50%), Gaps = 68/559 (12%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
SL DVRL DS A+ + +YLL L D+L+ F + + L E Y WE + L
Sbjct: 29 SLKDVRL-LDSPFKHAEDLDKQYLLELKADRLLSPFLRESGLTPKAESYTNWE--NTGLD 85
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ------- 231
GH GHYLSA +LM+AST ++ +KE++ +VS L CQ +GY+ P +
Sbjct: 86 GHIGGHYLSALSLMYASTGDKQIKERLDYMVSELKRCQDANDNGYIGGVPGGKAIWEEVA 145
Query: 232 --------FDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFY 279
FD L W P Y IHK AGL D Y YA++ A ++MT W +
Sbjct: 146 NGNIRAGGFD----LNGKWVPLYNIHKTYAGLRDAYLYANSDMAKEMLIKMTDWAI---- 197
Query: 280 NRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
N++ K S E+ L E GG+N+ + IT D K+L LAH F L L
Sbjct: 198 ----NLVSKLSEEQIQDMLRSEHGGLNETFADVAAITGDKKYLKLAHQFSHQLVLNPLLN 253
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HF 387
D ++G H+NT IP V+G + +V G++ E + +E +IG HF
Sbjct: 254 HEDKLTGMHANTQIPKVLGFKRIADVEGNESWSEASRFFWETVVEHRSVSIGGNSVGEHF 313
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
N +D R+ +++ E+C TYNML++S+ L++ +++ Y DYYER+L N +L Q
Sbjct: 314 NPTNDFSRVIKSIEG--PETCNTYNMLRLSKMLYQTSQDEKYMDYYERALYNHILSTQ-N 370
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
E G +Y + PG Y + P SFWCC G+GIE+ +K G+ IY + +
Sbjct: 371 PEQGGFVYFTQMRPG-----HYRVYSQPQTSFWCCVGSGIENHAKYGEMIYAHTDNE--- 422
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+Y+ +I SRL+WK + + Q+ S+ + L + + + T L LR P W
Sbjct: 423 LYVNLFIPSRLNWKEKKTEIIQE----NSFPDEAKTQLIINPEKTAAFT-LKLRYPVWVK 477
Query: 568 SNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
G K ++NG+D P+ P +++S+ + W DK+ +++P+ + E + P+ ++ +
Sbjct: 478 KWGLKVSVNGKDYPVSQDPASYISIDRKWKKGDKVVVEMPMRITVEQL----PDKSNYYS 533
Query: 627 ILYGPYVLAGHSIGDWDIT 645
I YGP LA + G D+T
Sbjct: 534 IFYGPVTLAAKT-GTEDMT 551
>gi|256376951|ref|YP_003100611.1| hypothetical protein Amir_2836 [Actinosynnema mirum DSM 43827]
gi|255921254|gb|ACU36765.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 614
Score = 275 bits (703), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 188/522 (36%), Positives = 266/522 (50%), Gaps = 46/522 (8%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALM 192
R + YL LD D+L+ FR+ L + P GGWE P+ ELRGH GH LSA A
Sbjct: 66 RNESRTHAYLKFLDPDRLLHTFRRNVGLASGATPCGGWESPTTELRGHSTGHVLSALAQA 125
Query: 193 WASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEALIPVWAPYYT 247
ST + + K K +V+ L+ACQ +GYLSAFP DR+EA VWAPYYT
Sbjct: 126 HTSTGDTAFKTKSDYLVAGLAACQDRAAAAGFNTGYLSAFPESFIDRVEARQQVWAPYYT 185
Query: 248 IHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMND 307
+HKILAGLLD + +A+AL + T + R + + + L E GGMN+
Sbjct: 186 LHKILAGLLDAHQLTGSAQALTVLTRKAAWVAWRNGRLTQA----QRQAMLGTEFGGMNE 241
Query: 308 VLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTG 367
VL L+ +T DP HL A FD LA D +SGFH+NT IP +G+ Y TG
Sbjct: 242 VLANLYQLTGDPLHLTAARYFDHAQVFDPLAAGRDALSGFHANTQIPKALGAIREYHATG 301
Query: 368 DQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKV 416
+ +++ H G + G + FK +P R+AS L +T E C T+NMLK+
Sbjct: 302 ETRYRDIARNFWNFVVGAHTYAIGGNSNGEY-FK-NPGRIASELSDSTCECCNTHNMLKL 359
Query: 417 SRHLFRWTK-EIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSKERSYHHWGT 474
+R LFR D++E++L N +LG Q + G Y +PL G + S +
Sbjct: 360 TRQLFRTEPGRPELFDFHEKALYNHLLGAQNPDSAHGHHSYYVPLRAGGQRTFSNDY--- 416
Query: 475 PSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPV 534
F CC+GTG+E+ +K DSIYF +++ +I S L W I V Q D
Sbjct: 417 --QDFTCCHGTGMETNTKHRDSIYFHGGET---LWVNLFIPSTLTWPGRGITVRQ--DTG 469
Query: 535 VSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKT 594
++T+T S + L LR+P W + GA+ LNG + +PG + + +T
Sbjct: 470 FPDTASTKLTITGSGR-----VDLRLRVPAW--ATGARLRLNGAPV-AATPGGYARIDRT 521
Query: 595 WSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
W+S D + + LP+ L E+ DD + Q + +GP VLAG
Sbjct: 522 WASGDTVELTLPMALTRESAPDD----PAAQVVKHGPIVLAG 559
>gi|189464178|ref|ZP_03012963.1| hypothetical protein BACINT_00515 [Bacteroides intestinalis DSM
17393]
gi|189437968|gb|EDV06953.1| hypothetical protein BACINT_00515 [Bacteroides intestinalis DSM
17393]
Length = 777
Score = 275 bits (703), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 194/603 (32%), Positives = 304/603 (50%), Gaps = 60/603 (9%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
S+ DVRL DS A N +++ LD+D+L+ NFRK A L EPYG WE S +
Sbjct: 40 SIQDVRL-LDSPFLHAMNQNEQWMKELDLDRLLSNFRKNANLKPKAEPYGSWE--SMGIA 96
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLE 236
GH +GH L+A + +A+T +E+ K K+ VV+ L +CQ +G++ P + F ++
Sbjct: 97 GHTLGHLLTAMSQHYAATGDETFKAKIDYVVNELDSCQMNFVNGFIGGMPGGDKVFKEVK 156
Query: 237 ALI---------PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
I +W P+Y HK + GL D Y A N A ++ + +Y + +VI
Sbjct: 157 KGIIRSMGFDLNGIWVPWYNEHKTMMGLNDAYLLAGNETAKKVLINLSDY----LADVIA 212
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
S E+ LN E GGMN+ +++ +T D K L ++ F LA D + G
Sbjct: 213 PLSEEQMQTMLNCEYGGMNEAFAQMYALTGDKKFLDASYAFYHKRLQDKLAEGVDVLQGL 272
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRL 396
HSNT IP +IGS +YE+TG+ +E H + G ++G + S P +L
Sbjct: 273 HSNTQIPKLIGSARQYELTGNHRDEEIARFSWETIVHHHSYANGGNSMGEY--LSVPDKL 330
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
+ L +NT E+C TYNMLK++ HL+ WT ++ Y DYYER+L N +L Q E G + Y
Sbjct: 331 NNRLGTNTCETCNTYNMLKLTAHLYEWTNDVQYLDYYERALYNHILASQH-PETGNVCYF 389
Query: 457 LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
L L G+ K +G+ ++F CC G+G E+ SK G +IY GK + I YI S
Sbjct: 390 LSLGMGTHK-----GFGSRHNNFSCCMGSGFENHSKYGGAIYSYVPGK-EMMNINLYIPS 443
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
L WK + + D + + +V + S ++NLR P W + + A +N
Sbjct: 444 VLTWKEKSLKLRMTTD----YPEHGKVVIKLEET-SKEPLTINLRRPVWAAGDVA-IRIN 497
Query: 577 GQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G + S PG+F+S+ + W +D + + LP+ L T ++ P+ +A+ YGP +LA
Sbjct: 498 GSKQKVESVPGSFISLHRKWKKNDVIELILPMPLYTVSM----PDNVDRRAVFYGPTILA 553
Query: 636 G------HSIGDWDI-TESATSLSDWITPIPASYNSQLITFTQEYGNTKFV----LTNSN 684
G +GD + SL+++I I + S + T N K + + + N
Sbjct: 554 GTFGTEKRKMGDIPVFVSEEKSLTNYIKKISDTSVSFVTTLPGGPDNVKMLPFYKVADEN 613
Query: 685 QSI 687
Q++
Sbjct: 614 QTV 616
>gi|436837799|ref|YP_007323015.1| hypothetical protein FAES_4423 [Fibrella aestuarina BUZ 2]
gi|384069212|emb|CCH02422.1| hypothetical protein FAES_4423 [Fibrella aestuarina BUZ 2]
Length = 781
Score = 275 bits (703), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 183/538 (34%), Positives = 279/538 (51%), Gaps = 62/538 (11%)
Query: 128 DSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLS 187
DS A + + +LL L D+L+ FR A L YGGWE S L GH +GHYLS
Sbjct: 52 DSPFKTAMEADTRFLLNLQPDRLLAQFRAHAGLAPKAAKYGGWE--SSGLAGHSLGHYLS 109
Query: 188 ASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA---------- 237
A AL +A+T++ ++++ +V L+ CQ+ +GY+ A P E E
Sbjct: 110 ALALQYAATNDPEYLKRVNYIVDELADCQRARKTGYVGAIPREDTVFAEVAQGNIRSRGF 169
Query: 238 -LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQ 296
L W+P+YT+HK++AGLLD Y YA N +AL +T M ++ +K + E+ +
Sbjct: 170 DLNGAWSPWYTVHKVMAGLLDAYLYAHNDKALAVTVGMADW----TGETLKNLTDEQVQK 225
Query: 297 TLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIV 356
L E GGMNDVL ++ +T + K+L L++ F L LA Q D + G H+NT +P +
Sbjct: 226 MLLCEYGGMNDVLANIYALTGNKKYLDLSYKFHDRVVLDSLAHQKDILPGRHANTQVPKL 285
Query: 357 IGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEE 406
IG+ RYE+TG Q H + G N ++ + S P +L L NT E
Sbjct: 286 IGTIRRYELTGSQPDLAMSDFFWKTVVNHHTYAPGGN-SNYEYLSTPDQLTDKLTDNTME 344
Query: 407 SCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKE 466
+C T+NMLK++RHLF AY DYYER+L N +L Q + G++ Y +PL G+ K
Sbjct: 345 TCNTHNMLKLTRHLFALQPNAAYMDYYERALYNHILASQH-HKTGMVCYFVPLRMGTRK- 402
Query: 467 RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW--KSGQ 524
H+ + F CC GTG+E+ K G+SI+F +G +++ +I S L+W K +
Sbjct: 403 ----HFSDEEEDFTCCVGTGMENHVKYGESIFF--KGADQSLFVNLFIPSELNWAEKGLR 456
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS------NGAKATLNGQ 578
+ +N + DP +R+T+ + K + L + LR P W + NG AT Q
Sbjct: 457 LTLNANLPA----DPTVRLTVQ-ADKPTKL--PIRLRKPYWLAGPMQVRVNGKAATSTVQ 509
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
D ++ + + W + D + + LP +LR + P+ + QA YGP +LAG
Sbjct: 510 D-------GYVVIDQRWKTGDVVELTLPASLRAMPM----PDNIARQAFFYGPVLLAG 556
>gi|188991168|ref|YP_001903178.1| hypothetical protein xccb100_1772 [Xanthomonas campestris pv.
campestris str. B100]
gi|167732928|emb|CAP51124.1| Putative secreted protein [Xanthomonas campestris pv. campestris]
Length = 791
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 198/608 (32%), Positives = 292/608 (48%), Gaps = 66/608 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 IRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + S +V+ L+ CQ +G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTDDAHCRTRASYLVAELARCQAHVGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVAL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGIFAALDDTQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHTVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C++YNMLK++RHL++W + AY DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCSSYNMLKLTRHLYQWGPQAAYFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ P+ G ++ W +P D FWCC G+G+E+ ++ GDSIY+E+
Sbjct: 400 QQHPRTGMFTYMTPMLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWEDG-- 452
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GV I Y+ SR+ +G + P V+L + + T L+LR+P
Sbjct: 453 -QGVAINLYVPSRVRNAAGLDMTLHSALPAQG-----SVSLRIDAAPAAQRT-LSLRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W ++ + LNG + + +L VT+TW D L + L + LR EA DD P + S
Sbjct: 506 WAAAPVLQ--LNGAVVDAAAVDGYLRVTRTWHPGDTLNLSLQMPLRLEATPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSN 684
+L GP VLA D+ ++AT S TP + L G +V ++
Sbjct: 562 --VLRGPLVLAA------DLGDAATPWSG-KTPALIGGDEVLQQLQPAAGQGSYVYSDGA 612
Query: 685 QSITMEKF 692
Q F
Sbjct: 613 QQWRFSPF 620
>gi|115399582|ref|XP_001215378.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114192261|gb|EAU33961.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 614
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 195/552 (35%), Positives = 276/552 (50%), Gaps = 59/552 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWEEP 173
L E+SL D R + Q+ L YL +D ++L+ NFR +L G GGW+ P
Sbjct: 31 LSELSLGDGRFLDN------QERTLSYLKFVDTERLLLNFRANHKLDTKGAVANGGWDAP 84
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFP 228
+ R H GH+L+A A +A + +E+ + VS L+ CQ +GYLS FP
Sbjct: 85 TFPFRTHVQGHFLTAWAQCYAVLGDTDCQERATYFVSELAKCQANNEAAGFKTGYLSGFP 144
Query: 229 TEQFDRLEA--LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVI 286
FD LEA L PYY IHK LAGLLD + + A + + + R +
Sbjct: 145 ESDFDALEAGTLNNGNVPYYNIHKTLAGLLDVWRLVGDTTARDVLLALAGWVDTRTSAL- 203
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
S + L E GGMNDVL L+ T D K L A FD LA D ++G
Sbjct: 204 ---SEAQMQSVLGTEFGGMNDVLADLYHQTSDEKWLKTAQRFDHAAVFDPLAANEDQLNG 260
Query: 347 FHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRL 396
H+NT +P IG+ Y+ TGD + + + G N +F + P +
Sbjct: 261 LHANTQVPKWIGAVREYKATGDTRYLDIARNAWTITVNAHTYAIGANSQAEHFHA-PNAI 319
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWT---KEIAYADYYERSLTNGVLGIQRGTEP-GV 452
A LDS+T E+C +YNMLK++R L WT + Y D+YE +L N +LG Q + G
Sbjct: 320 AQYLDSDTAEACNSYNMLKLTREL--WTLDPENTTYFDFYENALLNHLLGQQNPADSHGH 377
Query: 453 MIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGV 508
+ Y L PG ++ W T DSFWCC GT +E+ +KL DSI+F + +
Sbjct: 378 ITYFTSLNPGGNRGVGPAWGGGTWSTDYDSFWCCQGTALETNTKLMDSIFFHSDS---AL 434
Query: 509 YIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS 568
Y+ Q+I S L W + V Q VS T+T G+G L +RIP+WTS+
Sbjct: 435 YVNQFIPSVLTWSEKGVKVTQSTTFPVS------DTITLDIDGNG-DWELYVRIPSWTSN 487
Query: 569 NGAKATLNGQ---DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
A T+NG+ D+ + SPG++ + +TW+S DK+ IQLP+ LRT DD S+
Sbjct: 488 --AAITINGEQVTDVDV-SPGSYAKIARTWASGDKVQIQLPMHLRTVPANDD----PSLM 540
Query: 626 AILYGPYVLAGH 637
AI YGP +L+G+
Sbjct: 541 AIAYGPVILSGN 552
>gi|220928663|ref|YP_002505572.1| hypothetical protein Ccel_1236 [Clostridium cellulolyticum H10]
gi|110588920|gb|ABG76968.1| CBM22- and dockerin-containing enzyme [Clostridium cellulolyticum
H10]
gi|219998991|gb|ACL75592.1| protein of unknown function DUF1680 [Clostridium cellulolyticum
H10]
Length = 955
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 183/573 (31%), Positives = 287/573 (50%), Gaps = 54/573 (9%)
Query: 113 EFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEE 172
E LK+ + V++ +D+ + A + YL +D ++L+ F+KTA L YGGWE
Sbjct: 33 ELLKQFDMEQVKI-TDTYYVNALNKEVAYLQAIDPNRLLVGFKKTAGLSTTYSYYGGWEN 91
Query: 173 PSCELRGHFVGHYLSASALMWASTH-----NESLKEKMSAVVSALSACQKEIGSGYLSAF 227
+ ++GH +GHY+SA A + +T N LK ++ ++S L ACQ + G+GYL A
Sbjct: 92 NTL-IQGHTMGHYMSALAQAYKNTKSDPTVNADLKSRIDLIISELQACQNKNGNGYLFAT 150
Query: 228 PTEQFDRLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNV 285
P QFD +E A W P+YT+HKI++GLLD Y + N AL + T + + Y RV
Sbjct: 151 PATQFDVVEGKASGSSWVPWYTMHKIMSGLLDIYKFGGNQTALTIATNLGNWIYKRVN-- 208
Query: 286 IKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDIS 345
+ + L E GGMND LY+L+ +T + HL AH FD+ +A + +
Sbjct: 209 --AWDSATQSRVLGVEYGGMNDCLYELYKLTGNGNHLTAAHKFDENSLFNTIAAGTNVLP 266
Query: 346 GFHSNTHIPIVIGSQMRYEVTG---DQLHKEGHQLES---------SGTNIGHFNFKSDP 393
G H+NT IP IG+ RY G K Q + +G N F+ D
Sbjct: 267 GKHANTTIPKFIGALNRYSTLGTSESSYLKAAQQFWAIVLKDHTYVTGGNSEDERFR-DA 325
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
+L + D+ E+C NMLK+++ LF+ T ++ YADYYE +L N ++ Q E G+
Sbjct: 326 GKLDAYRDNVNNETCNVNNMLKLTKELFKATGDVKYADYYENALINEIMASQN-PETGMA 384
Query: 454 IYLLPLAPGSSKERS--YHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
Y + G K S ++H FWCC GTG+E+F+KL DS+Y+ +Y+
Sbjct: 385 TYFKAMGTGYFKVFSSQFNH-------FWCCTGTGMENFTKLNDSLYYNNGSD---LYVN 434
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS-NG 570
Y+SS L+W + + Q+ + +S +VT T +S S + R P W ++
Sbjct: 435 MYLSSTLNWSEKGLSLTQQANLPLS----DKVTFTINSASSS-EVKIKFRSPAWIAAGQN 489
Query: 571 AKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NG + + +L V++ W + D + + LP +R + D + A YG
Sbjct: 490 ITVKVNGTPINVDKANGYLDVSRVWQTGDTVELTLPTEVRVSRLTDS----PNTVAFTYG 545
Query: 631 PYVLAGHSIGDWDITESATSLSDWITPIPASYN 663
P VL+ +G TES T+ S + + A+ N
Sbjct: 546 PVVLSA-GLG----TESMTTQSHGVQVLKATKN 573
>gi|302867043|ref|YP_003835680.1| hypothetical protein Micau_2566 [Micromonospora aurantiaca ATCC
27029]
gi|302569902|gb|ADL46104.1| protein of unknown function DUF1680 [Micromonospora aurantiaca ATCC
27029]
Length = 917
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 180/529 (34%), Positives = 274/529 (51%), Gaps = 46/529 (8%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY-GGWEEPSCELRGHFVGHYLSASALMW 193
Q + YL +DV++L++NFR RL G GGW+ P+ R H GH+L+A A W
Sbjct: 71 QNRTVAYLRFVDVNRLLYNFRANHRLSTGGAAANGGWDAPNFPFRTHMQGHFLTAWAQAW 130
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGS-----GYLSAFPTEQFDRLEA--LIPVWAPYY 246
A + + ++K +V+ L+ CQ G+ GYLS FP F LEA L PYY
Sbjct: 131 AVLGDTTCRDKALTMVAELARCQANNGAAGFSAGYLSGFPESDFTALEARTLSNGNVPYY 190
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN 306
IHK LAGLLD + + +A + + + R + + + L E GGMN
Sbjct: 191 CIHKTLAGLLDVWRLIGSTQARDVLLALAGWVDQRT----GRLTSAQMQAMLGTEFGGMN 246
Query: 307 DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVT 366
VL L+ T D + L +A FD LA +D ++G H+NT +P IG+ Y+ T
Sbjct: 247 AVLTDLYQQTGDGRWLTVAQRFDHAAVFNPLAANSDQLNGLHANTQVPKWIGAAREYKAT 306
Query: 367 GDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKV 416
G +++ G + G N +F++ P +A L ++T E+C TYNMLK+
Sbjct: 307 GVTRYRDIAANAWAITVGAHTYAIGGNSQAEHFRA-PNAIAGYLRNDTCEACNTYNMLKL 365
Query: 417 SRHLFRWTKE-IAYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----ERSYH 470
+R L++ + +AYAD+YER+L N ++G Q + G + Y PL PG +
Sbjct: 366 TRELWQLDPDRVAYADFYERALLNHMIGQQNPADAHGHVTYFTPLNPGGRRGVGPAWGGG 425
Query: 471 HWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK 530
W T +SFWCC GTG+E+ + L D+IYF + + ++ S L W I V Q
Sbjct: 426 TWSTDYNSFWCCQGTGLETNTTLADAIYFHNGTT---LTVNLFVPSVLTWSQRGITVTQA 482
Query: 531 VD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNF 588
PV +T+T S GS ++ +RIP WTS GA ++NG + + PG++
Sbjct: 483 TSYPV---GDTTTLTVTGSVAGS---WTMRIRIPAWTS--GASVSVNGVAAGIAATPGSY 534
Query: 589 LSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+T+ W+S D +T++LP+ + T A DD A++QA+ YGP VL+G+
Sbjct: 535 AVLTRAWTSGDTVTVRLPMRVTTVAANDD----AAVQAVTYGPVVLSGN 579
>gi|315506549|ref|YP_004085436.1| hypothetical protein ML5_5828 [Micromonospora sp. L5]
gi|315413168|gb|ADU11285.1| protein of unknown function DUF1680 [Micromonospora sp. L5]
Length = 917
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 180/529 (34%), Positives = 274/529 (51%), Gaps = 46/529 (8%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY-GGWEEPSCELRGHFVGHYLSASALMW 193
Q + YL +DV++L++NFR RL G GGW+ P+ R H GH+L+A A W
Sbjct: 71 QNRTVAYLRFVDVNRLLYNFRANHRLSTGGAAANGGWDAPNFPFRTHMQGHFLTAWAQAW 130
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGS-----GYLSAFPTEQFDRLEA--LIPVWAPYY 246
A + + ++K +V+ L+ CQ G+ GYLS FP F LEA L PYY
Sbjct: 131 AVLGDTTCRDKALTMVAELARCQANNGAAGFSAGYLSGFPESDFTALEARTLSNGNVPYY 190
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN 306
IHK LAGLLD + + +A + + + R + + + L E GGMN
Sbjct: 191 CIHKTLAGLLDVWRLIGSTQARDVLLALAGWVDQRT----GRLTSAQMQAMLGTEFGGMN 246
Query: 307 DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVT 366
VL L+ T D + L +A FD LA +D ++G H+NT +P IG+ Y+ T
Sbjct: 247 AVLTDLYQQTGDGRWLTVAQRFDHAAVFNPLAANSDQLNGLHANTQVPKWIGAAREYKAT 306
Query: 367 GDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKV 416
G +++ G + G N +F++ P +A L ++T E+C TYNMLK+
Sbjct: 307 GVTRYRDIAANAWAITVGAHTYAIGGNSQAEHFRA-PNAIAGYLRNDTCEACNTYNMLKL 365
Query: 417 SRHLFRWTKE-IAYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----ERSYH 470
+R L++ + +AYAD+YER+L N ++G Q + G + Y PL PG +
Sbjct: 366 TRELWQLDPDRVAYADFYERALLNHMIGQQNPADAHGHVTYFTPLNPGGRRGVGPAWGGG 425
Query: 471 HWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK 530
W T +SFWCC GTG+E+ + L D+IYF + + ++ S L W I V Q
Sbjct: 426 TWSTDYNSFWCCQGTGLETNTTLADAIYFHNGTT---LTVNLFVPSVLTWSQRGITVTQA 482
Query: 531 VD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNF 588
PV +T+T S GS ++ +RIP WTS GA ++NG + + PG++
Sbjct: 483 TSYPV---GDTTTLTVTGSVAGS---WTMRIRIPAWTS--GASVSVNGVAAGIAATPGSY 534
Query: 589 LSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+T+ W+S D +T++LP+ + T A DD A++QA+ YGP VL+G+
Sbjct: 535 AVLTRAWTSGDTVTVRLPMRVTTVAANDD----AAVQAVTYGPVVLSGN 579
>gi|374983575|ref|YP_004959070.1| putative glycosylase [Streptomyces bingchenggensis BCW-1]
gi|297154227|gb|ADI03939.1| putative glycosylase [Streptomyces bingchenggensis BCW-1]
Length = 713
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 190/555 (34%), Positives = 281/555 (50%), Gaps = 57/555 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEP 173
++ L V LG D + R + L Y D+++ FR A L G P GGWE
Sbjct: 51 IRPFPLDGVTLG-DGVFRRKRDLMLGYARSYPADRILAVFRANAGLDTRGARPPGGWETS 109
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS---------GYL 224
LRGH+ GH+L+ A +A T +LK K+ +V AL CQK + GYL
Sbjct: 110 DGNLRGHYGGHFLTLIAQAYADTREAALKTKLDYLVGALGECQKALADHGSPIPSHPGYL 169
Query: 225 SAFPTEQFDRLEALIP---VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNR 281
+A+P QF LE+ +WAPYYT HKI+ GLLD +T N +AL++ + M ++ ++R
Sbjct: 170 AAYPETQFILLESYTTYPTIWAPYYTCHKIMRGLLDAHTLGGNQQALQIASGMGDWVHSR 229
Query: 282 VQNVIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQ 340
+ + + +ER W + E GGMN+VL L+ +T +HL A FD L A
Sbjct: 230 LGH-LPAAQLERMWSIYIAGEYGGMNEVLADLYALTGRAEHLAAARCFDNTALLKACAEN 288
Query: 341 ADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLES-SGTNIGHFNF 389
D + G H+N HIP G ++ T Q + G ++ S GT G F
Sbjct: 289 RDILEGRHANQHIPQFTGYLRLFDHTAKQEYSSAARNFWGMVTGSRMYSLGGTGQGEM-F 347
Query: 390 KSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR--- 446
++ +A+ LD E+C TYNMLK++R LF + AY DYYER LTN +L +R
Sbjct: 348 RARGA-IAATLDDKNAETCATYNMLKLTRQLFFHQPDPAYMDYYERGLTNHILASRRDAA 406
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE-EGKY 505
T+ + Y + + PG +E + + GT CC GTG+E+ +K DS+YF +G
Sbjct: 407 ATDSPEVTYFVGMGPGVRRE--FDNTGT------CCGGTGMENHTKYQDSVYFRSADGN- 457
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
+Y+ Y++S L W V+ Q D P TLTF +GSG L LR+P
Sbjct: 458 -ALYVNLYLASTLRWPERGFVIEQSSDFPAEGVR-----TLTF-REGSG-RLDLRLRVPA 509
Query: 565 WTSSNGAKATLNG-QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYAS 623
W ++ G T+NG + PG++LS+++ W D++ I P +LR E DD +
Sbjct: 510 WATA-GFTVTVNGVRQRAEAEPGSYLSLSRDWRPGDRVRISAPNSLRIERALDD----PT 564
Query: 624 IQAILYGPYVLAGHS 638
+Q++ YGP +L S
Sbjct: 565 VQSVFYGPVLLTAQS 579
>gi|337746495|ref|YP_004640657.1| hypothetical protein KNP414_02226 [Paenibacillus mucilaginosus
KNP414]
gi|336297684|gb|AEI40787.1| hypothetical protein KNP414_02226 [Paenibacillus mucilaginosus
KNP414]
Length = 749
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 187/543 (34%), Positives = 281/543 (51%), Gaps = 57/543 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
LH VR+ S + A + N YLL L+ D+L+ FR+ A L Y GWE S + G
Sbjct: 8 LHKVRIESGPLK-HAMELNASYLLNLEADRLLSRFREYAGLAPKAPHYEGWE--SRGISG 64
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA 237
H +GHYLS ALM+AST E L +++ VV L CQ+ GSG++S P E F ++A
Sbjct: 65 HTLGHYLSGCALMYASTGREELLSRVNYVVEELQQCQRADGSGFISGIPRGKELFQEVKA 124
Query: 238 ---------LIPVWAPYYTIHKILAGLLDQYTYADNAEAL----RMTTWMVEYFYNRVQN 284
L W P YT+HK+ AGL D Y A + +AL ++ W+ +
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLAGSRKALEIEIKLGLWL--------DD 176
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
V S E+ + L+ E GGMN+VL L + D + L LA F LG +A + D +
Sbjct: 177 VFSGLSHEQVQRVLHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAERKDTL 236
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPK 394
G H+NT IP +IG+ +YEVTG++ + H G N + +F +P
Sbjct: 237 GGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHF-GEPD 295
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI 454
+L L T E+C TYNMLK++RHLF+W AYADYYER++ N +LG Q+ + G +
Sbjct: 296 KLNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILGSQQPVD-GRVC 354
Query: 455 YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
Y + L G K + + + F CC G+G+ES S G +IYF +++ Q++
Sbjct: 355 YFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSLYGSAIYFHNG---SALFVNQFV 406
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
S ++W+ + + Q+ ++ R L + G T ++ +R P+W G
Sbjct: 407 PSTVEWEEQGVRLTQE----TAFPENGRGVLRIRTAKPG-TFAVKVRYPSWAEP-GISVK 460
Query: 575 LNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NGQ + + PG +++V + W D L P+TLR E++ D+ P+ A+LYGP V
Sbjct: 461 VNGQAVSADARPGGYVTVEREWQDGDTLEYDFPMTLRIESMPDN-PDRI---ALLYGPLV 516
Query: 634 LAG 636
LAG
Sbjct: 517 LAG 519
>gi|345302361|ref|YP_004824263.1| hypothetical protein Rhom172_0482 [Rhodothermus marinus
SG0.5JP17-172]
gi|345111594|gb|AEN72426.1| protein of unknown function DUF1680 [Rhodothermus marinus
SG0.5JP17-172]
Length = 641
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 193/558 (34%), Positives = 286/558 (51%), Gaps = 63/558 (11%)
Query: 110 RSGEFLKEVSL--HDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY 167
RS E L+ + VRL DS A Q ++ YL LD D+L+ FR+ A L Y
Sbjct: 31 RSRERLRAFAFPPRAVRL-LDSPFLEAMQRDVAYLFELDPDRLLSRFRRFAGLEPKAPEY 89
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE S + GH +GHYLSA ++ +A+T +E + ++ +VS L+ Q+ G+GY+ A
Sbjct: 90 GGWE--SQGISGHTLGHYLSALSMYYAATGDEKARARIDYIVSELAEVQRAHGNGYVGAI 147
Query: 228 PTEQFDRLEALIP--------------VWAPYYTIHKILAGLLDQYTYADNAEALRMTTW 273
P + DRL A I W P+YT+HKI GL+D Y Y N +AL + T
Sbjct: 148 P--EGDRLWAEIARGEIWQAEPFSLNGAWVPWYTMHKIFQGLIDAYWYGGNEQALEVVTR 205
Query: 274 MVEYFYNRVQNVIKKYSIERHWQ-TLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPC 332
+ ++ Y +N+ WQ L E GGMN+ L L+ IT +PKH L+ F
Sbjct: 206 LADWAYETTKNLTPA-----QWQQMLRTEHGGMNEALANLYSITGNPKHRELSQKFYHAA 260
Query: 333 FLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTG-DQLHKEG---------HQLESSGT 382
L LA +++G H+NT IP VIG +YE+ G D L H G
Sbjct: 261 VLSPLARGIPNLTGLHANTQIPKVIGVVRQYELIGSDSLRAVAEFFWEEVVQHHTYVIGG 320
Query: 383 NIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE-IAYADYYERSLTNGV 441
N + +F LA+ L T E+C TYNML+++RHLF E + Y D+YER+L N +
Sbjct: 321 NSQNEHFGPR-DSLANRLGEGTAETCNTYNMLRLTRHLFALHPEKVRYVDFYERALYNHI 379
Query: 442 LGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
L Q + G+ Y + L PG K + TP +SFWCC GTG+E+ K + IYF
Sbjct: 380 LASQ-DPKHGMFTYYMSLRPGHFKT-----YATPENSFWCCVGTGMENHVKYNEFIYF-- 431
Query: 502 EGKYPG--VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
Y G +Y+ +I S L+W+ + + + ++ RV L F + +
Sbjct: 432 ---YNGDTLYVNLFIPSELNWERRALRLRLE----TAFPESNRVRLDFDPEVPQRLV-VK 483
Query: 560 LRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
+R P+W + + + +NG+ + S PG++L++ + W D++ I LP+ LR E + D+
Sbjct: 484 VRHPSW-AQDALEVRINGEVQSVTSRPGSYLTLARLWQPGDEVEITLPMRLRVETMPDNP 542
Query: 619 PEYASIQAILYGPYVLAG 636
+ AILYGP VLAG
Sbjct: 543 DRF----AILYGPIVLAG 556
>gi|376260258|ref|YP_005146978.1| hypothetical protein [Clostridium sp. BNL1100]
gi|373944252|gb|AEY65173.1| hypothetical protein Clo1100_0916 [Clostridium sp. BNL1100]
Length = 952
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 186/583 (31%), Positives = 289/583 (49%), Gaps = 58/583 (9%)
Query: 105 FKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG 164
V S E LK+ + V++ +D+ + A + YL +D ++L+ F+K A L
Sbjct: 25 LSVSAASVEALKQFDMEQVKI-TDAYYVNAFNKEVAYLRAIDPNRLLVGFKKAAGLSTTY 83
Query: 165 EPYGGWEEPSCELRGHFVGHYLSASALMWASTH-----NESLKEKMSAVVSALSACQKEI 219
YGGWE + ++GH +GHY+SA A + +T N LK ++ ++S L ACQ +
Sbjct: 84 SYYGGWENNTL-IQGHTMGHYMSALAQAYKNTKSDATVNADLKSRIDLIISELQACQNKN 142
Query: 220 GSGYLSAFPTEQFDRLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEY 277
G+GYL A P QFD +E A W P+YT+HKI++GLLD Y + N AL + T + +
Sbjct: 143 GNGYLFATPVTQFDVVEGKASGSSWVPWYTMHKIMSGLLDVYKFEGNQTALTIATNLGNW 202
Query: 278 FYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLL 337
Y RV + + L E GGMND LY+L+ +T + HL AH FD+ +
Sbjct: 203 IYKRVN----AWDSATQSKVLGVEYGGMNDCLYELYKLTGNSNHLTAAHKFDETSLFNTI 258
Query: 338 ALQADDISGFHSNTHIPIVIGSQMRYEVTG--------------DQLHKEGHQLESSGTN 383
A + + G H+NT IP IG+ RY G + + K+ + +
Sbjct: 259 AAGTNVLPGKHANTTIPKFIGALNRYRTLGTTESSYLTAAQQFWNIVLKDHTYVTGGNSE 318
Query: 384 IGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG 443
HF +L + D+ E+C NMLK++R LF+ T ++ YADYYE +L N ++
Sbjct: 319 DEHFRAAG---KLDAYRDNVNNETCNVNNMLKLTRELFKVTGDVKYADYYENALINEIMA 375
Query: 444 IQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEG 503
Q E G+ Y + G K S D FWCC GTG+E+F+KL DS+Y+
Sbjct: 376 SQN-PETGMATYFKAMGTGYFKVFSSQF-----DHFWCCTGTGMENFTKLNDSLYYNNGS 429
Query: 504 KYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIP 563
+Y+ Y+SS L+W + + Q+ + +S +VT T +S S + R P
Sbjct: 430 D---LYVNMYLSSILNWSEKGLSLTQQANLPLS----DKVTFTINSAPSS-EVKIKFRSP 481
Query: 564 TWTSSNGAKAT--LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
+W ++ G AT +NG + + +L V++ W + D + + LP +R + D+
Sbjct: 482 SWIAA-GQTATVKVNGTSINIAKVNGYLDVSRVWQAGDTVELTLPTEVRVSRLTDN---- 536
Query: 622 ASIQAILYGPYVL-AGHSIGDWDITESATSLSDWITPIPASYN 663
+ A YGP VL AG I ES T+ S + + A+ N
Sbjct: 537 PNAVAFTYGPVVLSAGLGI------ESMTTQSHGVQVLKATKN 573
>gi|333381736|ref|ZP_08473415.1| hypothetical protein HMPREF9455_01581 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829665|gb|EGK02311.1| hypothetical protein HMPREF9455_01581 [Dysgonomonas gadei ATCC
BAA-286]
Length = 775
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 189/551 (34%), Positives = 290/551 (52%), Gaps = 61/551 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
LK SL DVRL S S A + ++LL + D+ + FR + L YGGWE S
Sbjct: 35 LKPFSLSDVRLTS-SPFMSAMSLDEKWLLSFEPDRFLSGFRSESGLQPKAPKYGGWE--S 91
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIG-SGYLSAFP----- 228
+ G GHYLSA ++M+AST NE L +++ ++ L +CQ+ G +G ++AFP
Sbjct: 92 QGVAGQTFGHYLSALSMMYASTGNEQLNDRIKYSINELDSCQQAFGMNGIVAAFPRAKGL 151
Query: 229 ----------TEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYF 278
TE FD L W P Y++HK+ AGL+D Y Y N +A ++ + +
Sbjct: 152 FTEISTGDIRTEGFD----LNGGWVPLYSMHKLFAGLIDVYEYTGNKQAYKIYINLAD-- 205
Query: 279 YNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLA 338
V ++ S E+ + L E GG+N+ L +++ +T + K+L LA + L L+
Sbjct: 206 --GVDKMLSGLSDEQIQKILICEHGGINESLAEVYALTGNKKYLNLATRLNHKAVLDPLS 263
Query: 339 LQADDISGFHSNTHIPIVIGSQMRYEVTG-DQLHKEGH-----QLESSGTNIG------H 386
D+++G H+NT IP VIG YE+TG D L K + S IG H
Sbjct: 264 KGVDELAGKHANTQIPKVIGVIREYELTGNDDLFKTAEFFWNTVVHSHSYVIGGNSEAEH 323
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
F R + T E+C TYNMLK+++HLF +I ADYYER+L N +L Q
Sbjct: 324 FGVAG---RTYDRITDKTCENCNTYNMLKLTKHLFSLQPDIQKADYYERALYNQILASQ- 379
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
+ G++ Y+ PLA GS + S TP DSFWCC GTG+E+ ++ G+ IYF ++ K
Sbjct: 380 NPQDGMVCYMSPLAAGSRRGFS-----TPFDSFWCCVGTGLENHARYGEFIYFSDKDK-- 432
Query: 507 GVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
++I +I S+LDWK +V+ Q + ++ V +K + T +N+R P W
Sbjct: 433 NLFINLFIPSKLDWKDRNMVIEQ----ITNFPESDTVRYKIKAKKTQEFT-VNIRYPLW- 486
Query: 567 SSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
+ +G +NG+ + + SPGN++ +T+ W ++D + LP L +EA D +++
Sbjct: 487 AQDGFSLFVNGKRVEINSSPGNYIQLTRKWKNNDDICYVLPKRLLSEAALGD----TNLR 542
Query: 626 AILYGPYVLAG 636
A LYGP VL+
Sbjct: 543 AYLYGPIVLSA 553
>gi|386723005|ref|YP_006189331.1| hypothetical protein B2K_12670 [Paenibacillus mucilaginosus K02]
gi|384090130|gb|AFH61566.1| hypothetical protein B2K_12670 [Paenibacillus mucilaginosus K02]
Length = 749
Score = 273 bits (697), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 187/543 (34%), Positives = 281/543 (51%), Gaps = 57/543 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
LH VR+ S + A + N YLL L+ D+L+ FR+ A L Y GWE S + G
Sbjct: 8 LHKVRIESGPLK-HAMELNASYLLNLEADRLLSRFREYAGLAPKAPHYEGWE--SRGISG 64
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA 237
H +GHYLS ALM+AST E L +++ VV L CQ+ GSG++S P E F+ ++A
Sbjct: 65 HTLGHYLSGCALMYASTGREELLSRVNYVVEELEQCQRADGSGFISGIPRGKELFEEVKA 124
Query: 238 ---------LIPVWAPYYTIHKILAGLLDQYTYADNAEAL----RMTTWMVEYFYNRVQN 284
L W P YT+HK+ AGL D Y + +AL ++ W+ +
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLTGSRKALEIEIKLGLWL--------DD 176
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
V S E+ + L+ E GGMN+VL L + D + L LA F LG +A + D +
Sbjct: 177 VFSGLSHEQVQRVLHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAERKDTL 236
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPK 394
G H+NT IP +IG+ +YEVTG++ + H G N + +F +P
Sbjct: 237 GGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHF-GEPD 295
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI 454
+L L T E+C TYNMLK++RHLF+W AYADYYER++ N +L Q+ + G +
Sbjct: 296 KLNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILASQQPVD-GRVC 354
Query: 455 YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
Y + L G K + + + F CC G+G+ES S G +IYF +++ Q++
Sbjct: 355 YFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSLYGSAIYFHSGST---LFVNQFV 406
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
S +DW+ + + Q+ S+ R L + G T ++ +R P+W + G
Sbjct: 407 PSTVDWEEQGVRLTQE----TSFPENGRGVLRIRTAKPG-TFAVKVRYPSW-AEPGISVK 460
Query: 575 LNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NGQ + + PG +++V + W D L P+TLR E++ D+ P+ A+LYGP V
Sbjct: 461 VNGQAVSADARPGGYVTVEREWQDGDTLEYDFPMTLRIESMPDN-PDRI---ALLYGPLV 516
Query: 634 LAG 636
LAG
Sbjct: 517 LAG 519
>gi|325281981|ref|YP_004254523.1| hypothetical protein Odosp_3391 [Odoribacter splanchnicus DSM
20712]
gi|324313790|gb|ADY34343.1| protein of unknown function DUF1680 [Odoribacter splanchnicus DSM
20712]
Length = 782
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 187/566 (33%), Positives = 293/566 (51%), Gaps = 51/566 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
+K L DVRL DS A N ++L +D+D+L+ NF K A L GE YG WE S
Sbjct: 40 VKYFGLKDVRL-LDSPFKNAMDRNAAWMLEMDMDRLLSNFLKNAGLEPKGESYGSWE--S 96
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQF 232
+ GH +GHYLSA A +AST +E K+++ +V L +CQ+ +G++ P F
Sbjct: 97 MGIAGHTLGHYLSAVAQQYASTGDERFKQRVDYIVHELDSCQQYFVNGFIGGMPGGDRVF 156
Query: 233 DRLEALI---------PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
+++ I +W P+Y HK + GL D Y A N A ++ + +Y +
Sbjct: 157 KQVKKGIIRSAGFDLNGLWVPWYNEHKTMMGLNDAYLLAGNKTAKKVLVNLADYLVD--- 213
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
V+ + E+ LN E GGMN+ L +++ +T D K+L ++ F + LA D
Sbjct: 214 -VLAGLTDEQVQTMLNCEFGGMNEALAQVYALTGDKKYLDASYRFYHRRLMEPLAEGKDI 272
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSD 392
+ G HSNT IP +IGS +YE+TG+ + H + G + G + S
Sbjct: 273 LPGLHSNTQIPKIIGSARQYELTGNPKDERIAEFFWTTMVNHHSYANGGNSSGEY--LST 330
Query: 393 PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV 452
P +L L +T E+C TYNMLK+SRHL+ WT + Y D+YE++L N +L Q E G+
Sbjct: 331 PDKLNDRLTHSTCETCNTYNMLKLSRHLYEWTGDPKYLDFYEKALYNHILASQH-PETGM 389
Query: 453 MIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
Y +PLA G+ K+ + +SF CC G+G E+ SK G +IY +++
Sbjct: 390 TCYFVPLAMGTRKD-----FCDKYNSFTCCMGSGFENHSKYGGAIYSHGSDD-RSLFVNL 443
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
YI S L WK + KV + RVTL +G +LNLR P W + G
Sbjct: 444 YIPSVLTWKEKGL----KVRLETVYPENGRVTLKV-VEGERQPLALNLRYPVW-AGEGIV 497
Query: 573 ATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+NG + S PG+F+++ + W + D++ + +P+ L T+ + P+ A +A+ YGP
Sbjct: 498 VKVNGTKQKITSKPGSFVTLERKWKAGDRIELNIPMNLYTKEM----PDNADRRAVFYGP 553
Query: 632 YVLAGHSIGDWDITESATSLSDWITP 657
+LAG ++G+ +I E + +++P
Sbjct: 554 TLLAG-ALGEKEI-EPIRGVPVFVSP 577
>gi|116182754|ref|XP_001221226.1| hypothetical protein CHGG_02005 [Chaetomium globosum CBS 148.51]
gi|88186302|gb|EAQ93770.1| hypothetical protein CHGG_02005 [Chaetomium globosum CBS 148.51]
Length = 797
Score = 272 bits (695), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 180/529 (34%), Positives = 272/529 (51%), Gaps = 46/529 (8%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEPSCELRGHFVGHYLSASALMW 193
Q + YL +DV++L++NFR RL G GGW+ P+ R H GHYL+A A +
Sbjct: 48 QNRTVSYLKWVDVNRLLYNFRANHRLSTQGASANGGWDAPNFPFRTHAQGHYLTAWAFCY 107
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGS-----GYLSAFPTEQFDRLEA--LIPVWAPYY 246
AS + +++ + V+ L+ CQK G+ GYLS FP +F LEA L PYY
Sbjct: 108 ASLRDTECRDRAAYFVAELAKCQKNNGAAGFSAGYLSGFPESEFAALEARTLNNGNVPYY 167
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN 306
IHK +AGLLD + + + A + + + +R K S ++ L E GGMN
Sbjct: 168 AIHKTMAGLLDVWRHLGDTNARDVLLALAGWVDSRT----GKLSYQQMQSMLGTEFGGMN 223
Query: 307 DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVT 366
DVL L T+D + L +A FD LA D ++G H+NT +P IG+ + Y+ T
Sbjct: 224 DVLADLHKQTKDERWLKVAQRFDHAAVFDPLAAGRDQLNGLHANTQVPKWIGAALEYKAT 283
Query: 367 GDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKV 416
G +++ G + G N +F+ P +A L +T E+C TYNML++
Sbjct: 284 GSTRYRDIAKNAWELTVGAHTYAIGGNSQAEHFRP-PNAIAGYLQKDTAEACNTYNMLRL 342
Query: 417 SRHLFRW-TKEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSK----ERSYH 470
+R L+ AY D+YER+L N +LG Q + G + Y PL PG +
Sbjct: 343 TRELWPLDAASTAYFDFYERALLNHLLGQQDPASHHGHVTYFTPLNPGGRRGVGPAWGGG 402
Query: 471 HWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK 530
W T DSFWCC GT +E+ +KL DSIYF +E +++ + S L W + + V Q
Sbjct: 403 TWSTDYDSFWCCQGTALETNTKLMDSIYFHDEA---ALFVNLFTPSVLKWAAQNVTVTQA 459
Query: 531 VD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNF 588
D P TLT + G + L +RIP+WT+ A+ ++NG+ + + PG +
Sbjct: 460 TDFPAGD-----TTTLTIGGQ-PGESWDLFVRIPSWTTDQ-AEISVNGEKANIDTKPGTY 512
Query: 589 LSVT-KTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + W + DK+T++LP+TLRT D+ ++ A+ YGP VL+G
Sbjct: 513 AVIQDRAWKAGDKVTVRLPMTLRTVPANDN----PNVAAVAYGPVVLSG 557
>gi|443291943|ref|ZP_21031037.1| Conserved secreted hypothetical protein [Micromonospora lupini str.
Lupac 08]
gi|385885131|emb|CCH19144.1| Conserved secreted hypothetical protein [Micromonospora lupini str.
Lupac 08]
Length = 778
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 182/535 (34%), Positives = 270/535 (50%), Gaps = 58/535 (10%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY-GGWEEPSCELRGHFVGHYLSASALMW 193
Q L YL +DVD++++NFR RL G GGW+ P+ R H GH+L+A A +
Sbjct: 69 QNRTLNYLRFVDVDRMLYNFRANHRLSTNGAATNGGWDAPNFPFRTHMQGHFLTAWAQAY 128
Query: 194 ASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEA--LIPVWAPYY 246
A + + ++K + +V+ L+ CQ G+GYLS FP F LEA L PYY
Sbjct: 129 AVLGDTTCRDKANYMVAELAKCQANNGAAGFGAGYLSGFPESDFSALEARTLSNGNVPYY 188
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN 306
IHK LAGLLD + Y N +A + + + R + S + L E GGMN
Sbjct: 189 CIHKTLAGLLDVWRYTGNTQARTVLLALAGWVDTRT----SRLSSSQMQSMLGTEFGGMN 244
Query: 307 DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVT 366
DVL +++ +T D + L A FD LA D ++G H+NT +P +G+ ++ T
Sbjct: 245 DVLTEIYQMTGDSRWLTTAQRFDHASVFNPLANNQDQLNGLHANTQVPKWVGAAREFKAT 304
Query: 367 GDQLHKEGHQLESSGTNIG---------------HFNFKSDPKRLASNLDSNTEESCTTY 411
G +++ + S+ NI HF P +A L ++T E C TY
Sbjct: 305 GTTRYRD---IASNAWNITVRAHTYVIGGNSQAEHFRA---PNAIAGYLSNDTCEQCNTY 358
Query: 412 NMLKVSRHLFRWT-KEIAYADYYERSLTNGVLGIQRGTE-PGVMIYLLPLAPGSSK---- 465
NMLK++R L+ Y DYYER+ N ++G Q + G + Y PL PG +
Sbjct: 359 NMLKLTRELWLLDPSRTDYFDYYERATINHLIGAQNPADSKGHITYFTPLKPGGRRGVGP 418
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ--YISSRLDWKSG 523
W T +SFWCC GTG+E +KL DSIYF Y G + ++ S L+W
Sbjct: 419 AWGGGTWSTDYNSFWCCQGTGVEINTKLMDSIYF-----YSGTTLTVNLFVPSELNWSQR 473
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
I V Q VS L + T S + S+ +RIP WT NGA ++NG + +
Sbjct: 474 GITVTQSTTYPVSDTTTLTLGGTMSG-----SWSVRVRIPAWT--NGATVSVNGVEQSVA 526
Query: 584 -SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+PG++ +VT+TW++ D +T++LP+ + + D+ +SI A+ YGP VLAG+
Sbjct: 527 TTPGSYATVTRTWAAGDTITVRLPMRVVVQPTNDN----SSIAAVTYGPSVLAGN 577
>gi|379720404|ref|YP_005312535.1| hypothetical protein PM3016_2500 [Paenibacillus mucilaginosus 3016]
gi|378569076|gb|AFC29386.1| hypothetical protein PM3016_2500 [Paenibacillus mucilaginosus 3016]
Length = 749
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 186/543 (34%), Positives = 281/543 (51%), Gaps = 57/543 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
LH VR+ S + A + N YLL L+ D+L+ FR+ A L Y GWE S + G
Sbjct: 8 LHKVRIESGPLK-HAMELNASYLLNLEADRLLSRFREYAGLEPKAPHYEGWE--SRGISG 64
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA 237
H +GHYLS ALM+AST E L +++ VV L CQ+ GSG++S P E F ++A
Sbjct: 65 HTLGHYLSGCALMYASTGREELLSRVNYVVEELQQCQRADGSGFISGIPRGKELFQEVKA 124
Query: 238 ---------LIPVWAPYYTIHKILAGLLDQYTYADNAEAL----RMTTWMVEYFYNRVQN 284
L W P YT+HK+ AGL D Y A + +AL ++ W+ +
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLAGSRKALEIEIKLGLWL--------DD 176
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
V S E+ + L+ E GGMN+VL L + D + L LA F LG +A + D +
Sbjct: 177 VFSGLSHEQVQRVLHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAERKDTL 236
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPK 394
G H+NT IP +IG+ +YEVTG++ + H G N + +F +P
Sbjct: 237 GGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHF-GEPD 295
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI 454
+L L T E+C TYNMLK++RHLF+W AYADYYER++ N +L Q+ + G +
Sbjct: 296 KLNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILASQQPVD-GRVC 354
Query: 455 YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
Y + L G K + + + F CC G+G+ES S G +IYF +++ Q++
Sbjct: 355 YFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSLYGSAIYFHSG---SALFVNQFV 406
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
S ++W+ + + Q+ ++ R L + G T ++ +R P+W + G
Sbjct: 407 PSTVEWEEQGVRLTQE----TAFPENGRGVLRIRTAKPG-TFAVKVRYPSW-AEPGISVK 460
Query: 575 LNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NGQ + + PG +++V + W D L P+TLR E++ D+ P+ A+LYGP V
Sbjct: 461 VNGQAVSADARPGGYVTVEREWQDGDTLEYDFPMTLRIESMPDN-PDRI---ALLYGPLV 516
Query: 634 LAG 636
LAG
Sbjct: 517 LAG 519
>gi|307110572|gb|EFN58808.1| hypothetical protein CHLNCDRAFT_56904 [Chlorella variabilis]
Length = 937
Score = 271 bits (692), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 149/346 (43%), Positives = 196/346 (56%), Gaps = 14/346 (4%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ SL V+L +D +YLL L+ D+L++NFRK A LP PG YGGWE
Sbjct: 26 IQGFSLAVVQLAADGEFADNFNMTSQYLLALEPDRLLFNFRKNAGLPTPGASYGGWEWSE 85
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
E+RG F+GHY+SA A T ++ +V L Q G+GYLSAFP FDR
Sbjct: 86 SEVRGQFIGHYMSAVAFAALHTGRTEFYDRSKLMVHELKKVQDAFGNGYLSAFPESHFDR 145
Query: 235 LEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
LEAL PVWAPYY IHKI+AGLLDQ+ A EAL+M M YF R Q V + +
Sbjct: 146 LEALQPVWAPYYVIHKIMAGLLDQHQLAGTDEALKMAEQMASYFCGRAQRVRENNGEDYW 205
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
++ L E GGMN+VLY LF +T D H AH FDKP F L D + G H+NTH+
Sbjct: 206 YRCLENEFGGMNEVLYNLFAVTADDHHAECAHWFDKPVFYRPLVEGTDPLPGLHANTHLA 265
Query: 355 IVIGSQMRYEVTGDQ-----------LHKEGHQLESSGTN-IGHFNFKSDPKRLASNLDS 402
V G RYE GD+ L + H + G+N + + +N D+
Sbjct: 266 QVQGFAARYEHLGDEEAMAAVRNFFALILQHHTFSTGGSNWYERWGNEDSLAEAINNTDA 325
Query: 403 N--TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
+ TEESCT YN+LK++R+LFR T + A AD+YER++ N V+GIQ+
Sbjct: 326 SRITEESCTQYNILKLARYLFRHTGDPALADFYERAILNDVIGIQK 371
Score = 104 bits (259), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 95/208 (45%), Gaps = 30/208 (14%)
Query: 450 PGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVY 509
PGV IY LPL G K +WGTP D+FWCCYGT +ESFS L SIYF+ PG
Sbjct: 456 PGVYIYYLPLGVGHDK-----NWGTPWDTFWCCYGTAVESFSSLAGSIYFKH---MPGTA 507
Query: 510 IIQYISSRLDWKS-GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS 568
S + Q+ VNQ V V W L V + + LN R+P W
Sbjct: 508 PSASSSGPTAAEDLPQLFVNQMVSSSVHWR-ELGVEGSANGDKPQAQFVLNWRVPGWAKG 566
Query: 569 NGAKATLNGQD---------------LPLPSP-----GNFLSVTKTWSSDDKLTIQLPLT 608
+ +NG++ L P F S+ TWS D + +P+
Sbjct: 567 DEVMLRVNGKEYLECAQGAAAAAHDALGFQPPQFGAGARFCSLGSTWSDGDVVEADMPMW 626
Query: 609 LRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ TE + D R S++AI+ GP+V+AG
Sbjct: 627 VVTEDLNDSRKAMQSLKAIMMGPFVMAG 654
>gi|21231831|ref|NP_637748.1| hypothetical protein XCC2394 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66768042|ref|YP_242804.1| hypothetical protein XC_1718 [Xanthomonas campestris pv. campestris
str. 8004]
gi|21113547|gb|AAM41672.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66573374|gb|AAY48784.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 791
Score = 271 bits (692), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 193/568 (33%), Positives = 280/568 (49%), Gaps = 65/568 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 IRAVPLAQVRL-MPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT----- 229
+ GH +GHYLSA ALM A T + + + S +V+ L+ CQ G GY++ F
Sbjct: 108 --IAGHTLGHYLSALALMHAQTDDAQCRTRASYLVAELARCQAHAGDGYVAGFTRKNAAG 165
Query: 230 ------EQFDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
E FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 QIESGREVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVGL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y V +V+ +++ L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGYL-QAVFSVLDDAQLQK---VLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P +A L T E C++YNMLK++RHL++W + AY DYYER+L N V+
Sbjct: 341 GDREYFQQPDSIARFLTEQTCEHCSSYNMLKLTRHLYQWGPQAAYFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ P+ G ++ W +P D FWCC G+G+E+ ++ GDSIY+E+
Sbjct: 400 QQHPRTGMFTYMTPMLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWEDG-- 452
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GV I Y+ SR+ +G + P V+L + + T L+LR+P
Sbjct: 453 -QGVAINLYVPSRVRNAAGLDMTLHSALPAQG-----SVSLRIDAAPAAQRT-LSLRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W ++ + LNG + + +L VT+ W D L + L + LR EA DD P + S
Sbjct: 506 WAAAPVLQ--LNGAVVDAAAVDGYLRVTRIWHPGDTLNLSLQMPLRLEATPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLS 652
+L GP VLA D+ ++AT S
Sbjct: 562 --VLRGPLVLAA------DLGDAATPWS 581
>gi|374991816|ref|YP_004967311.1| secreted protein [Streptomyces bingchenggensis BCW-1]
gi|297162468|gb|ADI12180.1| secreted protein [Streptomyces bingchenggensis BCW-1]
Length = 858
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 180/529 (34%), Positives = 267/529 (50%), Gaps = 46/529 (8%)
Query: 136 QTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWAS 195
+ L YL +D ++L+ FR +LP+ +P GGWE P+ LRGH GH LSA A A
Sbjct: 75 RRTLAYLRFVDPERLLHTFRLNVQLPSTAQPCGGWEAPNVLLRGHSTGHLLSALAFAHAH 134
Query: 196 THNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEALIPVWAPYYTIHK 250
T ++ +K +V+AL+ CQ +GYLSAFP FD LEA WAPYYTIHK
Sbjct: 135 TGEQTYADKARGIVAALAECQAASPGAGYRTGYLSAFPERIFDELEAGGKPWAPYYTIHK 194
Query: 251 ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLY 310
I+AGLLDQ+ + N +AL + M + +R + + +++R L E GGMN+VL
Sbjct: 195 IMAGLLDQHRLSGNDQALEVLRGMAAWVDSRTAP-LDEATMQR---LLGVEFGGMNEVLA 250
Query: 311 KLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQL 370
L+ +T DP HL A FD G L D++ G H+NT I ++G+ Y TGD
Sbjct: 251 GLYLVTGDPVHLRTARRFDHQSLYGPLDEGRDELDGRHANTEIAKIVGAAEEYRATGDPR 310
Query: 371 H-----------KEGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRH 419
+ H G + + F P ++ S L +T E+C +YNMLK+ R
Sbjct: 311 YLRIARNFWDIVVRDHSYVIGGNS--NQEFFGPPGQIVSRLSEDTCENCNSYNMLKIGRQ 368
Query: 420 LF-RWTKEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSKERSYHHWGTPS- 476
LF AY D+YE +L N +LG Q ++ G + Y L GS ++ P
Sbjct: 369 LFLHEPGRAAYMDHYEWTLYNQMLGEQDPDSDHGFVTYYTGLWAGSRRQPKGGLGSAPGS 428
Query: 477 -----DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW-KSGQIVVNQK 530
D+F C +GTG+E+ +K D+IYF +E +Y+ +I S + W + G +V +
Sbjct: 429 YSGDYDNFSCDHGTGMETHTKFADTIYFRDE-HAGALYVNLFIPSEVTWAERGFRLVQRS 487
Query: 531 VDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL---PSPGN 587
P V LT + G L +L +R+P W + G +A + P+ P PG
Sbjct: 488 GYPDTD-----TVRLTVAEGGGRL--ALKVRVPGWLADAGPRARVLVAGRPVDATPVPGR 540
Query: 588 FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+L++ + W + D + + P E + P+ I+A+ YGP VLAG
Sbjct: 541 YLTLDRRWRTGDTVELTFP----RELVWRPAPDNPHIKAVSYGPLVLAG 585
>gi|408393860|gb|EKJ73118.1| hypothetical protein FPSE_06731 [Fusarium pseudograminearum CS3096]
Length = 623
Score = 270 bits (690), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 193/565 (34%), Positives = 285/565 (50%), Gaps = 66/565 (11%)
Query: 106 KVPERSGEF-LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG 164
KV + + F L +VSL D R + Q + YLL +D D+L++ FRK L G
Sbjct: 26 KVSDLADAFELSDVSLTDSRWMDN------QGRTVNYLLSIDPDRLLYVFRKNHGLDTKG 79
Query: 165 EPY-GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQK---EIG 220
GGW+ P R H GH+LSA + +A+ N+ + S V L+ CQ ++G
Sbjct: 80 AAKNGGWDAPDFPFRSHVQGHFLSAWSNCYATLGNKECGSRASYFVKELAKCQANNAKVG 139
Query: 221 --SGYLSAFPTEQFDRLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTT 272
SGYLS FP + ++E L PYY IHK LAGLLD Y + +A L + +
Sbjct: 140 FTSGYLSGFPESEITKVEDRTLSSGNVPYYAIHKTLAGLLDVYRRVGDNDAKTVMLSLAS 199
Query: 273 WMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPC 332
W V K S + Q + E GGMN+VL + TQD K L +A FD
Sbjct: 200 W--------VDARTGKLSYAKMQQMMQTEFGGMNEVLADIAYYTQDNKWLKVAQRFDHAA 251
Query: 333 FLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQ-------------LHKEGHQLES 379
L D +SG H+NT +P IG+ Y+V+GD+ +HK + +
Sbjct: 252 IFDPLQNNVDKLSGLHANTQVPKWIGALREYKVSGDKKYLDIGRNAWDLTVHKHTYAI-- 309
Query: 380 SGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWT-KEIAYADYYERSLT 438
G N +F+ +P +A L +T E+C TYNMLK++R L+ + +Y DYYE +L
Sbjct: 310 -GGNSQAEHFR-EPNAIAKYLTKDTCEACNTYNMLKLTRELWALNPTDASYFDYYENALM 367
Query: 439 NGVLGIQRGTEP-GVMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKL 493
N +LG Q + G + Y PL PG + W T +SFWCC G+GIE+ +KL
Sbjct: 368 NHLLGQQNPKDSHGHVTYFTPLTPGGRRGVGPAWGGGTWSTDYNSFWCCQGSGIETNTKL 427
Query: 494 GDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSG 553
DSIYF + +Y+ + S+L+W Q V + + + + + T G
Sbjct: 428 MDSIYFHTKDT---LYVNLFTPSKLNWSQ------QGVSIIQTTEYPQKDSSTLQIGGKA 478
Query: 554 LTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTE 612
T +L +RIP+WTS A +NGQ + + +PG + VT+ W+S DK+TI LP++LRT
Sbjct: 479 GTWTLAVRIPSWTSK--ASIQVNGQSVNVNTTPGKYALVTRNWNSGDKVTITLPMSLRTI 536
Query: 613 AIQDDRPEYASIQAILYGPYVLAGH 637
A D+ + + A+ +GP +LA +
Sbjct: 537 AANDN----SQVAAVAFGPVILAAN 557
>gi|326203856|ref|ZP_08193718.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
gi|325985954|gb|EGD46788.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
Length = 854
Score = 270 bits (689), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 180/561 (32%), Positives = 282/561 (50%), Gaps = 45/561 (8%)
Query: 107 VPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEP 166
V S + L+ + V + +D+ A + YL +D ++L+ +R+TA L
Sbjct: 30 VSAESVDKLQPFDMEQVNI-TDTYLANAFNKEISYLQSIDPNRLLVGYRQTAGLSTSYSK 88
Query: 167 YGGWEEPSCELRGHFVGHYLSASALMWASTH-----NESLKEKMSAVVSALSACQKEIGS 221
YGGWE + L+GH +GHY+SA A + +T N +K+++ ++S L CQ + G
Sbjct: 89 YGGWE--NTPLKGHTLGHYMSALAQAYKNTKSNATVNADMKKRIDLIISELQQCQNKRGD 146
Query: 222 GYLSAFPTEQFDRLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFY 279
GY+ A EQF+ +E A +WAP+YT+HKI++GL+ Y N AL + + + ++ Y
Sbjct: 147 GYIYAETPEQFNVVEGKATGTLWAPWYTMHKIMSGLISIYELEGNPTALTVASKLGDWIY 206
Query: 280 NRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
NRV + + L E GGMND L +L+ +T HL A F++P L +A
Sbjct: 207 NRVN----AWDSATQAKVLGVEYGGMNDCLIELYKLTGKSNHLAAAKKFEEPSLLNTIAS 262
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQ-------LHKEGHQLESSGTNIGHFNFKSD 392
+ ++G H+NT IP IG+ RY G + + + T + N + +
Sbjct: 263 GNNVLAGKHANTTIPKFIGAINRYRTLGTSEASYLTAAQQFWNMVIRDHTYVTGGNSQWE 322
Query: 393 PKRLASNLDSNTE----ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGT 448
R A LD + E+C +YNMLK++R LF+ T ++ YAD+YERS N +L Q
Sbjct: 323 AFRAAGKLDQYRDEVNNETCNSYNMLKLTRELFQVTGDVKYADFYERSFINEILASQN-P 381
Query: 449 EPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGV 508
E G+ Y P+ G K S P D+FWCC GTG+E+F+KL DSIYF +
Sbjct: 382 ETGMTTYFKPMGTGYFKVFS-----KPFDNFWCCTGTGMENFTKLNDSIYFNNGSD---L 433
Query: 509 YIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS 568
Y+ YISS L+W + + QK D +S VT T S S + R P W ++
Sbjct: 434 YVNMYISSTLNWSEKGLSLTQKADVPLS----DTVTFTIDSAPSS-EVKIKFRSPYWVAA 488
Query: 569 N-GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAI 627
+ +NG + +L V++ W DKL + +P ++ D++ ++ A
Sbjct: 489 DKKVTVKVNGSSVNASVVNGYLDVSRVWKVGDKLELTIPAEVQISRCTDNQ----NVAAF 544
Query: 628 LYGPYVLAGHSIGDWDITESA 648
YGP VL +G+ +T S+
Sbjct: 545 TYGPVVLCA-GLGNESMTTSS 564
>gi|339021543|ref|ZP_08645591.1| hypothetical protein ATPR_1899 [Acetobacter tropicalis NBRC 101654]
gi|338751393|dbj|GAA08895.1| hypothetical protein ATPR_1899 [Acetobacter tropicalis NBRC 101654]
Length = 799
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 196/604 (32%), Positives = 293/604 (48%), Gaps = 71/604 (11%)
Query: 111 SGEFLKEVSLHDVRLGSDSMHW-RAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGG 169
+GE + V L DVRL HW A ++N YLL L D+L+ NFR+ A LP GE YGG
Sbjct: 40 AGESVTPVPLQDVRLLPS--HWLDAVESNRAYLLSLSADRLLHNFRRQAGLPPKGEVYGG 97
Query: 170 WEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT 229
WE + + GH +GHYLSA ALM+A T + + +++ +V L+ Q + G GY++ F
Sbjct: 98 WENDT--IAGHTLGHYLSALALMYAQTGDTECRRRVAYIVQELAIVQDKWGDGYVAGFTR 155
Query: 230 EQ-----------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALR 269
++ F +E L W+P Y IHK AGL D TY + AL
Sbjct: 156 KEKDGTITDGKVIFAEMEKGDIRSGGFDLNGAWSPLYNIHKTFAGLFDAQTYCQDPNALA 215
Query: 270 MTTWM---VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA- 325
+ + E FY+++ + + + L E GG+N+ +L T D K L LA
Sbjct: 216 VAVKLGGFFEAFYSKLTDAQLQ-------KVLTCEYGGLNESFAELAARTGDAKWLRLAK 268
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------H 375
+D+P L+A + DD++ H+NT IP +IG EV+ D + G H
Sbjct: 269 RTYDRPVLDPLMA-RHDDLANRHANTQIPKLIGLGRIAEVSRDAHWQVGPRFFWQAVTQH 327
Query: 376 QLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
G N F S+P ++ ++ T E C TYNMLK++R L+ W + A DYYER
Sbjct: 328 HSYVIGGNADREYF-SEPDTISQHITEQTCEHCNTYNMLKLTRQLYTWQPDSALFDYYER 386
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+ N VL + G+ Y+ P +E W TP+DSFWCC GTG+ES +K G+
Sbjct: 387 AHLNHVLAAH-DPQTGMFTYMTPTITAGVRE-----WSTPTDSFWCCVGTGMESHAKHGE 440
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPY-LRVTLTFSSKGSGL 554
SI++E +++ YI SR+ W + K PY +VTL +
Sbjct: 441 SIWWEGAET---LFVNLYIPSRVQWARKNVSWRMKTR-----YPYDGQVTLKVEDVKAPE 492
Query: 555 TTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
+L LR+P W + T+NGQ + G +L + +TW + D + + LPL LRTEA
Sbjct: 493 PFALALRVPGWVKGD-LSLTVNGQSVSATPSGGYLMLNRTWHAGDTVALTLPLALRTEAP 551
Query: 615 QDDRPEYASIQAILYGPYVLAGH---SIGDWDITESATSLSDWITPIPASYNSQLITFTQ 671
E + ++L+GP VLA + +D + A SD + + + + T
Sbjct: 552 V----EAPHLVSLLHGPMVLAADLASAEAPYDAMDPALVTSDVVRDLAPVAGQEAVYRTT 607
Query: 672 EYGN 675
+ G
Sbjct: 608 QAGR 611
>gi|350267868|ref|YP_004879175.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus subtilis
subsp. spizizenii TU-B-10]
gi|349600755|gb|AEP88543.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus subtilis
subsp. spizizenii TU-B-10]
Length = 761
Score = 269 bits (688), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 177/540 (32%), Positives = 281/540 (52%), Gaps = 54/540 (10%)
Query: 130 MHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEP-YGGWEEPSCELRGHFVGHYLSA 188
M + +Q EYLL LDVD+L+ + A L P +P YGGWE + E+ GH +GH+LSA
Sbjct: 10 MFYDSQMKGKEYLLFLDVDRLLAPCYE-AVLQTPKKPRYGGWE--AKEIAGHSIGHWLSA 66
Query: 189 SALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD-------RLE--ALI 239
++ M+ ++ +E LK K V+ LS Q+ GY+S F FD R++ +L
Sbjct: 67 ASAMYQASGDEELKRKAEYAVNELSHIQQFDEEGYVSGFSRACFDEVFSGDFRVDHFSLG 126
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
W P+Y+IHK+ AGL+D Y N ALR+ + ++ + + + + E+ + L
Sbjct: 127 GSWVPWYSIHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRMLI 182
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGS 359
E GGMN+ + LF +T++ +L LA F L LA D++ G H+NT IP VIG+
Sbjct: 183 CEHGGMNEAMADLFMLTKNKAYLELAERFCHRAILQPLAEGKDELEGKHANTQIPKVIGA 242
Query: 360 QMRYEVTGDQLHKEG-----------HQLESSGTNIG-HFNFKSDPKRLASNLDSNTEES 407
Y++TG++ ++ G +IG HF + + L T E+
Sbjct: 243 AKLYDITGNEAYRNAALFFWEQVVYQRSYAIGGNSIGEHFGAEG-----SEELGVTTAET 297
Query: 408 CTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKER 467
C TYNMLK++ HLFRW E + DYYE +L N +L Q + G+ Y + PG K
Sbjct: 298 CNTYNMLKLTGHLFRWFHEARFMDYYENALYNHILASQ-DPDSGMKTYFVSTQPGHFKV- 355
Query: 468 SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVV 527
+ +P DSFWCC GTG+E+ ++ IY ++ +Y+ +I S+++ + Q+++
Sbjct: 356 ----YCSPEDSFWCCTGTGMENPARYTQHIYDIDQDD---LYVNLFIPSQINMQEKQLII 408
Query: 528 NQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGN 587
Q+ P T K G+ +L++RIP WT+ G KA +NG+ +
Sbjct: 409 TQETSF-----PAAEKTRLVVKKADGVPMTLHIRIPYWTNG-GLKAAVNGKRIQSVEKNG 462
Query: 588 FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITES 647
+L + K W++ D + I LP+ L +DD + ++YGP VLAG ++G D E+
Sbjct: 463 YLVIHKHWNTGDCIEIDLPMKLHIYQAKDDPKK----SVLMYGPVVLAG-ALGREDFPET 517
>gi|268316049|ref|YP_003289768.1| hypothetical protein Rmar_0478 [Rhodothermus marinus DSM 4252]
gi|262333583|gb|ACY47380.1| protein of unknown function DUF1680 [Rhodothermus marinus DSM 4252]
Length = 641
Score = 269 bits (687), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 191/558 (34%), Positives = 284/558 (50%), Gaps = 63/558 (11%)
Query: 110 RSGEFLKEVSL--HDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY 167
RS E L+ + VRL DS A Q ++ YL LD D+L+ FR+ A L Y
Sbjct: 31 RSRERLRAFAFPPRAVRL-LDSPFLEAMQRDVAYLFELDPDRLLSRFRRFAGLEPKAPEY 89
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF 227
GGWE S + GH +GHYLSA ++ +A+T +E + ++ +VS L+ Q+ G+GY+ A
Sbjct: 90 GGWE--SQGISGHTLGHYLSALSMYYAATGDEKARARIDYIVSELAEVQRAHGNGYVGAI 147
Query: 228 PTEQFDRLEALIP--------------VWAPYYTIHKILAGLLDQYTYADNAEALRMTTW 273
P + DRL A I W P+YT+HKI GL+D Y Y + +AL + T
Sbjct: 148 P--EGDRLWAEIARGEIWQAEPFSLNGAWVPWYTMHKIFQGLIDAYWYGGSEQALEVVTR 205
Query: 274 MVEYFYNRVQNVIKKYSIERHWQ-TLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPC 332
+ ++ Y +N+ WQ L E GGMN+ L L+ IT +PKH L+ F
Sbjct: 206 LADWAYETTKNLTPA-----QWQQMLRTEHGGMNEALANLYSITGNPKHRELSEKFYHAA 260
Query: 333 FLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTG-DQLHKEG---------HQLESSGT 382
L L+ +++G H+NT IP VIG +YE+ G D L H G
Sbjct: 261 VLSPLSRGIPNLTGLHANTQIPKVIGVVRQYELIGSDSLRAVAEFFWEEVVQHHTYVIGG 320
Query: 383 NIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE-IAYADYYERSLTNGV 441
N + +F LA+ L T E+C TYNML+++RHLF E + Y D+YER+L N +
Sbjct: 321 NSQNEHFGPR-DSLANRLGEGTAETCNTYNMLRLTRHLFALHPEKVRYVDFYERALYNHI 379
Query: 442 LGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
L Q + G+ Y + L PG K + TP SFWCC GTG+E+ K + IYF
Sbjct: 380 LASQ-DPKRGMFTYYMSLRPGHFKT-----YATPEHSFWCCVGTGMENHVKYNEFIYF-- 431
Query: 502 EGKYPG--VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
Y G +Y+ +I S L+W+ + + + ++ RV L F + +
Sbjct: 432 ---YNGDTLYVNLFIPSELNWERRALRLRLE----TAFPESNRVRLDFDPEVPQRLV-VK 483
Query: 560 LRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
+R P+W + + +NG+ + S PG++L++ + W D++ I LP+ LR E + D+
Sbjct: 484 VRHPSW-AQDALDVRINGEVQSVTSRPGSYLTLARVWQPGDEVEITLPMRLRVETMPDNP 542
Query: 619 PEYASIQAILYGPYVLAG 636
+ AILYGP VLAG
Sbjct: 543 DRF----AILYGPIVLAG 556
>gi|383779461|ref|YP_005464027.1| hypothetical protein AMIS_42910 [Actinoplanes missouriensis 431]
gi|381372693|dbj|BAL89511.1| hypothetical protein AMIS_42910 [Actinoplanes missouriensis 431]
Length = 777
Score = 269 bits (687), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 180/531 (33%), Positives = 272/531 (51%), Gaps = 52/531 (9%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY-GGWEEPSCELRGHFVGHYLSASALMW 193
Q + YL +DV+++++ FR RL G GGW+ P+ R H GH+L+A A +
Sbjct: 70 QNRTMNYLRFVDVNRMLYVFRANHRLSTAGAAANGGWDAPNFPFRSHMQGHFLTAWAQAY 129
Query: 194 ASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEALIPVWAPYYTI 248
A T + + ++K +V+ L+ CQ +GYLS FP D +E+ P+ YY I
Sbjct: 130 AYTGDTTCRDKADYMVAELAKCQANNAVAGFNAGYLSGFPESDLDAVESGKPIAVSYYCI 189
Query: 249 HKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGG 304
HK LAGLLD + N +A L++ W V++ R+ S + TL E GG
Sbjct: 190 HKTLAGLLDVWRLIGNTQAKDVLLKLAGW-VDWRTGRL-------SYSQMQTTLQTEFGG 241
Query: 305 MNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYE 364
MN+VL L+ T D + L +A FD LA D+++G H+NT+IP +G+ ++
Sbjct: 242 MNEVLANLYQQTGDARWLRVAQRFDHAAIFDPLAANRDELNGKHANTNIPKWVGAIREFK 301
Query: 365 VTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNML 414
TG +++ G + G N +FK+ P +A L ++T E C TYNML
Sbjct: 302 ATGTTRYRDIAGNAWNITVGAHTYAIGGNSQAEHFKA-PNAIAGYLTNDTCEQCNTYNML 360
Query: 415 KVSRHLFRWTKEIA-YADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----ERS 468
K++R L++ A Y D+YE +L N ++G Q + G + Y PL G +
Sbjct: 361 KLTRELWQLDPNRAGYFDFYENALYNHLIGAQNPADSHGHITYFTPLKAGGRRGVGPAWG 420
Query: 469 YHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVN 528
W T +SFWCC GTGIE+ +KL DSIYF + + Y+ S L+W + V
Sbjct: 421 GGTWSTDYNSFWCCQGTGIETNTKLMDSIYFRGGTT---LTVNLYVPSTLNWSERGLTVT 477
Query: 529 QKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP-SPG 586
Q PV T T S SG + + RIP W + GA +NG + + +PG
Sbjct: 478 QTTAYPVGD-----TSTFTLSGSVSG-SWGIRFRIPAWAA--GATIAVNGANQNITVTPG 529
Query: 587 NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
++ +VT+TW+ D +T++LP+ + +A D+ A IQAI YGP VLAG+
Sbjct: 530 SYATVTRTWADGDTITVRLPMRVIIKAANDN----ADIQAITYGPSVLAGN 576
>gi|427384240|ref|ZP_18880745.1| hypothetical protein HMPREF9447_01778 [Bacteroides oleiciplenus YIT
12058]
gi|425727501|gb|EKU90360.1| hypothetical protein HMPREF9447_01778 [Bacteroides oleiciplenus YIT
12058]
Length = 777
Score = 268 bits (685), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 184/573 (32%), Positives = 292/573 (50%), Gaps = 56/573 (9%)
Query: 116 KEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC 175
K + DVRL +S A N +++ LD+D+L+ NFRK A L EPY WE S
Sbjct: 37 KYFGIQDVRL-LESPFLHAMNQNEQWMKELDLDRLLSNFRKNANLRPKAEPYDSWE--SM 93
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFD 233
+ GH +GH L+A + +A+T +E+ K K+ VV+ L +CQ +G++ P + F
Sbjct: 94 GIAGHTLGHLLTAMSQHYAATGDETFKTKIDYVVNELDSCQMNFVNGFIGGMPGGDKVFK 153
Query: 234 RLEALI---------PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
++ I +W P+Y HK + GL D Y A N A ++ + +Y + +
Sbjct: 154 EVKKGIIRSMGFDLNGIWVPWYNEHKTMMGLNDAYLLAGNETAKKVLINLSDY----LAD 209
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
VI + E+ LN E GGMN+ +++ +T D K+L ++ F LA D +
Sbjct: 210 VIAPLNEEQMQTMLNCEYGGMNEAFAQVYALTGDEKYLDASYAFYHKRLQDKLAEGIDAL 269
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDP 393
G HSNT IP +IGS +YE+TG+Q ++ H + G ++G + S P
Sbjct: 270 QGLHSNTQIPKLIGSARQYELTGNQRDEKIARFSWETIVLHHSYANGGNSMGEY--LSVP 327
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
+L+ L SNT E+C TYNMLK++ HL+ WT ++ Y DYYER+L N +L Q E G +
Sbjct: 328 DKLSDRLGSNTCETCNTYNMLKLTGHLYEWTNDVQYLDYYERALYNHILASQH-PETGNV 386
Query: 454 IYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y L L G+ K +G+ ++F CC G+G E+ SK G +IY GK + I Y
Sbjct: 387 CYFLSLGMGTHK-----GFGSRHNNFSCCMGSGFENHSKYGGTIYSYVPGK-EMININLY 440
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
I S L WK + + D + + ++ + S + ++NLR P W + +
Sbjct: 441 IPSVLTWKEKSLKLRMTTD----YPEHGKIVIKLEET-SKQSLTINLRRPAWATGD-VVV 494
Query: 574 TLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
+NG + +PG+F+S+ W +D + + LP+ L T ++ P+ A +A+ YGP
Sbjct: 495 RINGSKQKVGNTPGSFISLHHRWKKNDVIELILPMPLYTVSM----PDNADRRAVFYGPT 550
Query: 633 VLAG------HSIGDWDI-TESATSLSDWITPI 658
+LAG +GD + SL+++I I
Sbjct: 551 ILAGTFGTEKRKMGDIPVFVSEEKSLTNYIKKI 583
>gi|300785876|ref|YP_003766167.1| hypothetical protein AMED_3987 [Amycolatopsis mediterranei U32]
gi|384149186|ref|YP_005532002.1| hypothetical protein RAM_20325 [Amycolatopsis mediterranei S699]
gi|399537759|ref|YP_006550421.1| hypothetical protein AMES_3940 [Amycolatopsis mediterranei S699]
gi|299795390|gb|ADJ45765.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340527340|gb|AEK42545.1| hypothetical protein RAM_20325 [Amycolatopsis mediterranei S699]
gi|398318529|gb|AFO77476.1| hypothetical protein AMES_3940 [Amycolatopsis mediterranei S699]
Length = 775
Score = 268 bits (685), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 192/552 (34%), Positives = 285/552 (51%), Gaps = 57/552 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLE-YLLMLDVDKLVWNFRKTARLPAPGEPY-GGWEEPSCEL 177
L VRL + W Q + YL +DV++L++ FR RL G GGW+ PS
Sbjct: 57 LGQVRL--TASRWLDNQNRTQNYLRFVDVNRLLYVFRANHRLSTGGAATNGGWDAPSFPF 114
Query: 178 RGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS-----GYLSAFPTEQF 232
R H GH+L+A A +WA T + + ++K + +V+ L+ CQ G+ GYLS FP F
Sbjct: 115 RSHVQGHFLTAWAQLWAVTGDTTSRDKATTMVAELAKCQANNGAAGFSAGYLSGFPEADF 174
Query: 233 DRLEA--LIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVI 286
D LEA L PYY IHK +AGLLD + Y + +A L + W V
Sbjct: 175 DNLEAGRLSNGNVPYYCIHKTMAGLLDVWRYIGSTQARDVLLNLAGW--------VDRRT 226
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
+ S + LN E GGMNDVL L+ T D + L A FD LA D ++G
Sbjct: 227 ARLSTSQLQSVLNTEFGGMNDVLADLYQYTGDARWLTAAQRFDHAAVFDPLAANRDQLNG 286
Query: 347 FHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRL 396
H+NT +P IG+ Y+ TG +++ G + G N +F++ P +
Sbjct: 287 LHANTQVPKWIGAAREYKATGTTRYRDIATNAWNITVGAHTYAIGGNSQAEHFRA-PNAI 345
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIA-YADYYERSLTNGVLGIQRGTEP-GVMI 454
A+ L+ +T ESC TYNMLK++R L + A ADYYER+L N ++G Q + G +
Sbjct: 346 AAYLNQDTCESCNTYNMLKLTRELIALYPDRADLADYYERALLNQMIGQQNPADSHGHIT 405
Query: 455 YLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
Y L PG + W T DSFWCC GTG+E+ +KL DSIYF + + +
Sbjct: 406 YFSSLNPGGRRGLGPAWGGGTWSTDYDSFWCCQGTGLETQTKLADSIYFYNDTT---LTV 462
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
++ S L W I V Q S+ TLT + SG T ++ +RIP WT+ G
Sbjct: 463 NLFLPSVLTWTQRGITVTQ----TTSFPASDTSTLTVTGSVSG-TWAMRIRIPGWTT--G 515
Query: 571 AKATLNG--QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAIL 628
A ++NG Q++ +PG++ +++++W+S D +T++LP+ + +A + A++ A+
Sbjct: 516 ATISVNGVAQNVAT-TPGSYATLSRSWASGDAVTVRLPMKVALKAAN----DNANVAAVT 570
Query: 629 YGPYVLAGHSIG 640
YGP VLAG+ G
Sbjct: 571 YGPVVLAGNYSG 582
>gi|376260753|ref|YP_005147473.1| hypothetical protein [Clostridium sp. BNL1100]
gi|373944747|gb|AEY65668.1| hypothetical protein Clo1100_1435 [Clostridium sp. BNL1100]
Length = 743
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 178/543 (32%), Positives = 279/543 (51%), Gaps = 49/543 (9%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A + +EYL D DKL+ F KT L + Y GWE+ E+RGH +GHYL+A A +
Sbjct: 14 AFKKEIEYLESFDCDKLLSCFYKTKGLAPKAKNYHGWED--TEIRGHTMGHYLTALAQAY 71
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILA 253
++T++ + E++ ++ LS CQ E SGYLSAFP E FDR+E PVW P+YT+HKI+
Sbjct: 72 SATNDSKIYERLQYLLKELSLCQFE--SGYLSAFPEEFFDRVENRKPVWVPWYTMHKIIT 129
Query: 254 GLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLF 313
GL+ Y AL + + + ++ ++R K++ E H L E GGMND LY+L+
Sbjct: 130 GLISVYKLTKIETALNIVSGLGDWVFSRTD----KWTPEIHANVLAVEYGGMNDCLYELY 185
Query: 314 CITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQ---- 369
IT + KH AH+FD+ + D ++ H+NT IP +G+ R+ G++
Sbjct: 186 KITGNEKHSAAAHMFDEIELFKEIHDGKDILNNRHANTTIPKFLGALNRFLAIGEEEQFY 245
Query: 370 ---------LHKEGHQLESSG-TNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRH 419
+ H + G + HF +P L + S E+C TYNMLK++R
Sbjct: 246 LDTCKEFWSIVTNNHSYVTGGNSEWEHF---GEPNILDAERTSTNCETCNTYNMLKMTRV 302
Query: 420 LFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSF 479
LF+ T + YAD+YE + N +L Q + G+ +Y P+A G K S P + F
Sbjct: 303 LFKITGDKKYADFYENTFINAILSSQ-NPDTGMTMYFQPMATGYFKVYS-----KPFEHF 356
Query: 480 WCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDP 539
WCC GTG+E+F+KL +SIYF EE + +Y+ Y S+ L+W+ + + Q D + D
Sbjct: 357 WCCTGTGMENFTKLNNSIYFHEEDR---LYVNMYYSTLLNWEEKCVRITQNSD-IPGTD- 411
Query: 540 YLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDD 599
R + ++ T L LRIPTW + +N + + +TW +D
Sbjct: 412 --RASFIIEAETETEFT-LCLRIPTW--AKDVNINVNKNPSLFTEERGYALINRTWKDND 466
Query: 600 KLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIP 659
T+++ + E + P+ + A YGP VL+ +G + +S T + + IP
Sbjct: 467 --TVEINFKIEPELVS--LPDNPNAVAFTYGPVVLSA-GLGTDKMEKSTTGI---MVRIP 518
Query: 660 ASY 662
+ +
Sbjct: 519 SKH 521
>gi|407923357|gb|EKG16430.1| Six-hairpin glycosidase-like protein [Macrophomina phaseolina MS6]
Length = 612
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 183/558 (32%), Positives = 278/558 (49%), Gaps = 52/558 (9%)
Query: 109 ERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY- 167
E +G + VRL SD Q+ YL +D+D+L++N+R T L G
Sbjct: 18 EEAGVLAYPFDISQVRL-SDGRWQENQERTRTYLKFVDLDRLLYNYRATHGLSTNGAASN 76
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSG 222
GGW+ P R H GH+L+A W++T + +++ + L CQ+ +G
Sbjct: 77 GGWDAPDFPFRSHAQGHFLTAWVQCWSTTGDTECRDRAVQFTAELLKCQENNEAAGFTAG 136
Query: 223 YLSAFPTEQFDRLEA--LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYN 280
YLS FP +FD LE L PYY +HK++AGLLD + + A + + +
Sbjct: 137 YLSGFPESEFDALEGRTLSNGNVPYYVVHKLMAGLLDVWRGIGDLTARDVLLALAGWVDA 196
Query: 281 RVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQ 340
R +N I ++R QT E GGM++VL ++ + D + L +A F+ L LA
Sbjct: 197 RTEN-ISYGDMQRILQT---EFGGMSEVLADIYYQSGDSRWLTVAQRFEHAAVLTPLANN 252
Query: 341 ADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIG-HFN 388
D ++G H+NT +P IG+ Y+ TG+ + + H G + HF
Sbjct: 253 RDQLNGLHANTQVPKWIGAAREYKATGNTTYYDIARNAWDITVRAHTYAIGGNSQAEHFR 312
Query: 389 FKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE---IAYADYYERSLTNGVLGIQ 445
P +A L ++T ESC +YNMLK++R L WT E AY DYYER+L N ++G Q
Sbjct: 313 ---PPNAIAGYLTADTAESCNSYNMLKLTREL--WTTEPSSSAYFDYYERTLMNHLVGQQ 367
Query: 446 RGTEP-GVMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFE 500
+P G + Y L PG + W T DSFWCC GTG+E+ +KL DSIYF
Sbjct: 368 DPEDPHGHVTYFNSLQPGGVRGVGPAWGGGTWSTDYDSFWCCQGTGVETNTKLMDSIYF- 426
Query: 501 EEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNL 560
+G +Y+ + S LDW+ + V Q V+ + L+V G+ + +
Sbjct: 427 RDGDSSALYVNLFAPSVLDWRQRAVTVTQTTSFPVTDNTTLQV------AGAAGAWDMAI 480
Query: 561 RIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
RIP WTS GA+ +NG+ + + PG + ++++ W+S D +T+ LP+ R DD
Sbjct: 481 RIPDWTS--GAEILVNGESANVAAEPGTYATISRDWASGDTVTVTLPMGFRLVPANDD-- 536
Query: 620 EYASIQAILYGPYVLAGH 637
SI A+ YGP +L G+
Sbjct: 537 --TSIAALAYGPVILCGN 552
>gi|429858822|gb|ELA33628.1| secreted protein [Colletotrichum gloeosporioides Nara gc5]
Length = 623
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 185/558 (33%), Positives = 278/558 (49%), Gaps = 74/558 (13%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAP-GEPYGGWEEP 173
+ +V+L RL + Q L YL +DV++L++NFRK L + GGW+ P
Sbjct: 44 MSQVTLSSGRLFDN------QARTLTYLKWVDVERLLYNFRKNHGLSTNNAQANGGWDAP 97
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFP 228
R HF GH+L+A A +A H+ K++ + + L CQ +GYLS FP
Sbjct: 98 DFPFRTHFQGHFLNAWAFCYAQLHDTECKDRATYFAAELKKCQANNANVGFNTGYLSGFP 157
Query: 229 TEQFDRLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWM----VEYF 278
+ +E +L PYY IHK +AGLLD + + + A L M W+ +
Sbjct: 158 ESEITAVEDRSLSNGNVPYYAIHKTMAGLLDVWRHIGDTNARDVLLEMAAWVDLRTGKLT 217
Query: 279 YNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLA 338
Y ++QN+ ++ E GGMN+V+ +F T D + L +A FD LA
Sbjct: 218 YAQMQNM------------MSTEFGGMNEVMADIFHQTGDQRWLTVAQRFDHAAIFDPLA 265
Query: 339 LQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIG-H 386
D ++G H+NT +P IG+ Y+ TG +++ H G + H
Sbjct: 266 SNQDSLNGLHANTQVPKWIGASREYKATGTSRYQDIARNAWNITVSAHSYAIGGNSQAEH 325
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIA-YADYYERSLTNGVLGIQ 445
F P +A L+S+T E+C TYNMLK++R L+ Y D+YER+L N +LG Q
Sbjct: 326 FRL---PNAIAGFLNSDTCEACNTYNMLKLTRELWLTNPSATHYFDFYERALLNHLLGQQ 382
Query: 446 RGTEP-GVMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFE 500
++ G + Y PL PG + W T DSFWCC GTG+E+ +KL DSIYF
Sbjct: 383 DPSDSHGHITYFTPLNPGGRRGVGPAWGGGTWSTDYDSFWCCQGTGLETNTKLMDSIYFY 442
Query: 501 EEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRV-TLTFSSKGSGLTTSLN 559
+ +Y+ ++ S L W + V Q D + R T T GSG T L
Sbjct: 443 DNS---ALYVNLFVPSVLRWTQRGVTVTQTTD-------FPRGDTTTLKVSGSGQWT-LR 491
Query: 560 LRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
+RIP+WTS GA+ T+NGQ + S G + ++ +TW+ D + + LP+ L+T A D+
Sbjct: 492 VRIPSWTS--GAQVTVNGQAVTATS-GAYAAIDRTWADGDTVVVTLPMKLQTIAANDN-- 546
Query: 620 EYASIQAILYGPYVLAGH 637
SI A+ +GP +L+G+
Sbjct: 547 --PSIAALAFGPVILSGN 562
>gi|46113732|ref|XP_383116.1| hypothetical protein FG02940.1 [Gibberella zeae PH-1]
Length = 1393
Score = 266 bits (680), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 185/547 (33%), Positives = 275/547 (50%), Gaps = 54/547 (9%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWEEPSCELR 178
L DV L +DS Q + YLL +D D+L++ FRK L G GGW+ P R
Sbjct: 36 LSDVSL-TDSRWMDNQGRTVNYLLSIDPDRLLYVFRKNHGLDTKGATKNGGWDAPDFPFR 94
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFD 233
H GH+L+A + +A+ N+ + S V L+ CQ + SGYLS FP +
Sbjct: 95 SHVQGHFLTAWSNCYATLGNKECGSRASYFVKELAKCQAKNAKAGFTSGYLSGFPESEIA 154
Query: 234 RLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSI 291
++E L PYY IHK LAGLLD Y + +A + + + R K S
Sbjct: 155 KVENRTLNNGNVPYYAIHKTLAGLLDVYRRVGDNDAKAVMLSLAGWVDTRT----GKLSY 210
Query: 292 ERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNT 351
+ Q + E GGMN+VL + TQD K L +A FD L D +SG H+NT
Sbjct: 211 AQMQQMMQTEFGGMNEVLADIAYYTQDNKWLKVAQRFDHAAIFDPLQNNVDKLSGLHANT 270
Query: 352 HIPIVIGSQMRYEVTGDQ-------------LHKEGHQLESSGTNIGHFNFKSDPKRLAS 398
+P IG+ Y+V+GD+ +HK + + + HF DP +A
Sbjct: 271 QVPKWIGALREYKVSGDKKYLDIGRNAWDLTVHKHTYAI-GGNSQAEHFR---DPDAIAK 326
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWT-KEIAYADYYERSLTNGVLGIQRGTEP-GVMIYL 456
L S+T E+C TYNMLK++R L+ + +Y D+YE +L N +LG Q + G + Y
Sbjct: 327 YLTSDTCEACNTYNMLKLTRELWALDPSDASYFDFYENALMNHLLGQQNPKDNHGHVTYF 386
Query: 457 LPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
PL PG + W T +SFWCC G+GIE+ +KL DSIYF + +Y+
Sbjct: 387 TPLNPGGRRGVGPAWGGGTWSTDYNSFWCCQGSGIETNTKLMDSIYFHTKDT---LYVNL 443
Query: 513 YISSRLDWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+ S+L+W Q+ + Q + P + + T G T +L +RIP+WTS A
Sbjct: 444 FTPSKLNWSQQQVSIIQTTEYP-------QKDSSTLQIGGKAGTWTLAVRIPSWTSK--A 494
Query: 572 KATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NGQ + + +PG + V + W+S DK+T+ LP++LRT A D+ + + A+ +G
Sbjct: 495 SIQVNGQSVNVNATPGKYALVKRNWNSGDKVTVTLPMSLRTIAANDN----SQVAAVAFG 550
Query: 631 PYVLAGH 637
P +LA +
Sbjct: 551 PVILAAN 557
>gi|367031082|ref|XP_003664824.1| hypothetical protein MYCTH_55581 [Myceliophthora thermophila ATCC
42464]
gi|347012095|gb|AEO59579.1| hypothetical protein MYCTH_55581 [Myceliophthora thermophila ATCC
42464]
Length = 608
Score = 266 bits (680), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 183/557 (32%), Positives = 284/557 (50%), Gaps = 57/557 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEP 173
+ VSL D R + Q + YL +DVD+L++NFR L G GGW+ P
Sbjct: 12 MSAVSLIDSRWTDN------QNRTVTYLKWVDVDRLLYNFRANHGLSTQGARQNGGWDAP 65
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFP 228
R H GH+L+A + +AS +++ +++ + V+ L+ CQ G+GYLS FP
Sbjct: 66 DFPFRTHVQGHFLTAWSHCYASLRDDACRDRATYFVAELAKCQANNDAVGFGAGYLSGFP 125
Query: 229 TEQFDRLEA--LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVI 286
+FD LEA L PYY IHK +AGLLD + + + A + + + +R
Sbjct: 126 ESEFDALEARTLSNGNVPYYAIHKTMAGLLDVWRHVGDTTARDVLLALAGWVDSRT---- 181
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
+ S E+ L E GGMNDVL +L T DP+ L +A FD LA + D + G
Sbjct: 182 GRLSYEQMQAVLGTEFGGMNDVLTELSLQTGDPRWLEVAQRFDHAAVFDPLASRQDRLDG 241
Query: 347 FHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIG-HFNFKSDPK 394
H+NT +P IG+ + Y+ TG +++ H G + HF+ +P
Sbjct: 242 LHANTQVPKWIGAVLEYKATGTARYRDIAANAWNFTVGAHSYAIGGNSQAEHFH---EPD 298
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWT-KEIAYADYYERSLTNGVLGIQRGTEP-GV 452
+A L +T E+C TYNML+++R L+ AY D+YER+L N +LG Q +P G
Sbjct: 299 AIAKYLLEDTAEACNTYNMLRLTRELWMLDPASTAYFDFYERALLNHLLGQQNPADPHGH 358
Query: 453 MIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFE------EE 502
+ Y PL PG + W T DSFWCC GT +E+ +KL DSIY+ ++
Sbjct: 359 VTYFTPLNPGGRRGVGPAWGGGTWSTDYDSFWCCQGTALETNTKLMDSIYWHDDDDDADD 418
Query: 503 GKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRI 562
+++ + S L W + + Q+ D +TLT + +G +++RI
Sbjct: 419 DGAANLWVNLFTPSVLRWTERGVTLTQETAFPAGSD---TITLTVGGEPTG-GWDMHVRI 474
Query: 563 PTWTSSNGAKATLNGQDLPLPS--PGNFLSVT-KTWSSDDKLTIQLPLTLRTEAIQDDRP 619
P+WT+S GA+ +NG+ + + PG ++S+ + W + D +T++LP+TLRT A D+
Sbjct: 475 PSWTTS-GAEVLVNGEKAGVAAAVPGTYVSIRGRDWKAGDVVTVRLPMTLRTVAANDN-- 531
Query: 620 EYASIQAILYGPYVLAG 636
+ A+ YGP VL+G
Sbjct: 532 --PGVAALAYGPVVLSG 546
>gi|325927064|ref|ZP_08188334.1| hypothetical protein XPE_2335 [Xanthomonas perforans 91-118]
gi|325542563|gb|EGD14035.1| hypothetical protein XPE_2335 [Xanthomonas perforans 91-118]
Length = 791
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 193/610 (31%), Positives = 284/610 (46%), Gaps = 70/610 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +V L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVGL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D+++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RHL++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+++
Sbjct: 400 QQHPRTGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWQDG-- 452
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVY+ Y+ S + +G + P LR+ + + +L LR+P
Sbjct: 453 -QGVYVNLYVPSMVHDAAGLDMTLHSALPEQG-SASLRIDAAPAEQ-----RTLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W + LNGQ + + +L +T+ W D L++ + LR EA DD P + S
Sbjct: 506 WAQQ--PRLQLNGQPVDTAASDGYLRITRVWQRGDTLSLAFDMPLRLEATSDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ--LITFTQEYGNTKFVLTN 682
+L GP VLA D+ ++A W PA Q L G T FV +
Sbjct: 562 --VLRGPLVLA------VDLGDAAKP---WSGKTPALIGGQDILQRLQPVPGKTAFVYND 610
Query: 683 SNQSITMEKF 692
Q + F
Sbjct: 611 GVQQWQLSPF 620
>gi|451851952|gb|EMD65250.1| hypothetical protein COCSADRAFT_141970 [Cochliobolus sativus
ND90Pr]
Length = 620
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 187/553 (33%), Positives = 286/553 (51%), Gaps = 62/553 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQT-NLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEE 172
L +V+L + R W+ + L YL ++VD+L++NFR T +L G +P GGW+
Sbjct: 39 LSQVALSNSR-------WKDNENRTLNYLKFVNVDRLLYNFRATHKLSTNGAQPNGGWDA 91
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIG-----SGYLSAF 227
P+ R H GHYL+A +A+ + + K++ + V L+ CQ G GYLS F
Sbjct: 92 PNFPFRSHVQGHYLTAWVNCYATLRDSTCKDRAAYFVQELAKCQANNGVAGFSPGYLSGF 151
Query: 228 PTEQFDRLEA--LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNV 285
P +F LEA L PYY +HK +AGLLD + + +A + + + R
Sbjct: 152 PESEFAALEAGKLTGGNVPYYAVHKTMAGLLDAWRIIGDQKARDVLLALAGWVDGRT--- 208
Query: 286 IKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDIS 345
KK S + L E GGMNDVL +++ +T + + L +A FD LA + D +S
Sbjct: 209 -KKLSTAQMQTMLGTEFGGMNDVLAEIYQLTGNKQWLTVAQRFDHAKVFDPLANKQDQLS 267
Query: 346 GFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIG-HFNFKSDP 393
G H+NT +P IG+ Y+ TG + + + H G + HF P
Sbjct: 268 GNHANTQVPKWIGAAREYKSTGTKRYLDIARNAWDFTINAHTYAIGGNSQAEHF---RPP 324
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE---IAYADYYERSLTNGVLGIQRGTEP 450
++++ L ++T E C TYNMLK++R L WT + Y DYYER+L N +LG Q +
Sbjct: 325 NQISNFLTNDTAEQCNTYNMLKLTRDL--WTTDPTSTKYFDYYERALINHLLGAQNAADN 382
Query: 451 -GVMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
G + Y PL G + W T +SFWCC GT +E+ +KL DSIYF +
Sbjct: 383 HGHITYFTPLRSGGRRGVGPAWGGGTWSTDYNSFWCCQGTALETNTKLMDSIYFYDNS-- 440
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
+Y+ + S LDWK + + Q + L+VT G+G ++ +RIP+W
Sbjct: 441 -ALYVNLFTPSTLDWKQRNVKITQVTTFPIGDTTTLKVT------GTG-NWAMKIRIPSW 492
Query: 566 TSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
TS GA +LNGQ + + PG++ ++++ W S D +T++LP+ LRT A + A+I
Sbjct: 493 TS--GATISLNGQASGVAANPGSYATLSRNWVSGDTVTVKLPMKLRTVAAN----DNANI 546
Query: 625 QAILYGPYVLAGH 637
AI YGP +L+G+
Sbjct: 547 AAIAYGPTILSGN 559
>gi|427384529|ref|ZP_18881034.1| hypothetical protein HMPREF9447_02067 [Bacteroides oleiciplenus YIT
12058]
gi|425727790|gb|EKU90649.1| hypothetical protein HMPREF9447_02067 [Bacteroides oleiciplenus YIT
12058]
Length = 777
Score = 265 bits (677), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 175/532 (32%), Positives = 267/532 (50%), Gaps = 61/532 (11%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A++ YLL L+ D+ + FR A L Y GWE S + G +GHY+SA A+ +
Sbjct: 51 AEEKEATYLLELEPDRFLSGFRSEAGLVPKAPKYEGWE--SLGVAGQTLGHYMSACAMYY 108
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--FDRLEA---------LIPVW 242
A++ +E +K+ +++ L +CQ+ G+GYL+A P + F + A L W
Sbjct: 109 ATSGDERFLQKLEYIINELDSCQQANGNGYLAATPGGKKIFAEVSAGNIYSQGFDLNGGW 168
Query: 243 APYYTIHKILAGLLDQYTYADNAEALR----MTTWMVEYFYNRVQNVIKKYSIERHWQTL 298
P Y +HK+LAGL+D Y YA + +ALR + WM FY+ ++ ++K L
Sbjct: 169 VPLYVMHKVLAGLIDAYQYARSEQALRIAEKLADWMYGTFYHLTEDQMQK--------VL 220
Query: 299 NEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLGLLALQADDISGFHSNTHIPIVI 357
E GGMN+ L L+ T++ K L+LA FD + LA+ DD+ G H+NT +P +I
Sbjct: 221 ACEFGGMNEALANLYAYTKNDKFLLLAQRFDNHKAIMDSLAIGVDDLEGKHANTQVPKMI 280
Query: 358 GSQMRYEVTGDQLHK-----------EGHQLESSGTNIG-HFNFKSDPKRLASNLDSNTE 405
G+ YE+TG + + H + G + G HF P++L L ++
Sbjct: 281 GAARLYELTGSKRDSSIASFFWHTVVDNHSYVNGGNSDGEHF---GTPRKLNERLSTSNT 337
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
E+C TYNMLK++RHLF W Y+ YYER++ N +L Q + G+ Y PL G K
Sbjct: 338 ETCNTYNMLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGKK 396
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
+ +P SF CC G+G+E+ K GD IY EG +++ +I SRL W + +
Sbjct: 397 -----GYLSPFQSFCCCSGSGMENHVKYGDFIY--SEGSDSSLFVNLFIPSRLTWTARDL 449
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSP 585
+V Q D S L V + LR P W S K +NG+ + L +
Sbjct: 450 IVTQDTDIPSSNKTVLTVKTEMPQ-----SVVFRLRYPEWAESMSLK--VNGKSVSLKAS 502
Query: 586 G-NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
G N++S+ + W +DKL I + T A+ D+ + YGP +LAG
Sbjct: 503 GNNYVSIEREWKDNDKLEITFGIKFYTVAMPDNEKRV----GLFYGPVLLAG 550
>gi|357032903|ref|ZP_09094838.1| tat twin-arginine translocation pathway signal sequence domain
protein [Gluconobacter morbifer G707]
gi|356413894|gb|EHH67546.1| tat twin-arginine translocation pathway signal sequence domain
protein [Gluconobacter morbifer G707]
Length = 790
Score = 265 bits (677), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 189/596 (31%), Positives = 300/596 (50%), Gaps = 62/596 (10%)
Query: 111 SGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGW 170
SG + + L +VRL S A + N YLL L+ D+L+ NFRK A LP G YGGW
Sbjct: 35 SGADVTPIPLSNVRL-LPSPWLEAVERNRIYLLSLEADRLLHNFRKQAGLPPKGALYGGW 93
Query: 171 EEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE 230
E S + GH +GHYLSA ALM+A T + + +E+++ +V L QK+ G GY++ F +
Sbjct: 94 E--SDTIAGHTLGHYLSALALMYAQTDDAACRERVAYIVQELVVVQKQWGDGYVAGFTRK 151
Query: 231 Q-----------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM 270
+ F +EA L W+P Y IHK AGLLD + Y +AL +
Sbjct: 152 EKNGALVDGKRIFAEIEAGDIRSSGFDLNGAWSPLYNIHKTFAGLLDAHIYCHCDQALNV 211
Query: 271 TTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAH-LFD 329
+ ++ ++ K + + + L E GG+N+ +L T D + L LA+ ++D
Sbjct: 212 AVGLGQF----LKAFFGKLTDAQMQKVLTCEYGGLNESFAELAARTGDEEWLRLAYRIYD 267
Query: 330 KPCFLGLLALQADDISGFHSNTHIPIVIG-------SQMRYEVTGDQLHKEG---HQLES 379
+P L+ + DD++ H+NT IP ++G SQ R+ +TG Q + H
Sbjct: 268 RPVLDPLME-ERDDLANRHANTQIPKLVGLARIAEVSQNRHWMTGPQFFWKAVTRHHSYV 326
Query: 380 SGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTN 439
G N F S+P ++ ++ T E C TYNMLK++R + + A DYYER+ N
Sbjct: 327 IGGNADREYF-SEPDTISQHITEQTCEHCNTYNMLKLTRQCYASNPQAALFDYYERAHLN 385
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L + G+ Y+ P +E W TP++SFWCC GTG+ES +K GDSI++
Sbjct: 386 HILAAH-DPQTGMFTYMTPTITAGVRE-----WSTPTESFWCCVGTGMESHAKHGDSIWW 439
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ E +++ YI SR+ W V+ K++ D RV+L S + L
Sbjct: 440 QREET---LFVNLYIPSRMVWDRKD--VSWKMETGYPHDG--RVSLLLEDLNSPVAFRLA 492
Query: 560 LRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
LR+P W + +NG+D+P ++ + + WS+ D + + LP+T+RTE+ DD
Sbjct: 493 LRVPGWVREP-IQVAVNGRDVPATPSDGYIVLDRKWSAGDHVVLDLPMTVRTESPVDD-- 549
Query: 620 EYASIQAILYGPYVLAGH---SIGDWDITESATSLSDWITP-IPASYNSQLITFTQ 671
+ + +L GP V+A + G +D + A D +PA+ + + T+
Sbjct: 550 --SKLVTVLRGPMVMAADLAPAGGVYDAVDPAVVTDDLTQDLVPAAGQASVFRTTR 603
>gi|78048280|ref|YP_364455.1| hypothetical protein XCV2724 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78036710|emb|CAJ24403.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 791
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 193/610 (31%), Positives = 284/610 (46%), Gaps = 70/610 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +V L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVSL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D+++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RHL++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+++
Sbjct: 400 QQHPRTGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWQDG-- 452
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVY+ Y+ S + +G + P LR+ + + +L LR+P
Sbjct: 453 -QGVYVNLYVPSMVHDAAGLDMTLHSALPEQG-SASLRIDAAPAEQ-----RTLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W + LNGQ + + +L +T+ W D L++ + LR EA DD P + S
Sbjct: 506 WAQQ--PRLQLNGQPVDTAASDGYLRITRVWQRGDTLSLAFDMPLRLEATSDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ--LITFTQEYGNTKFVLTN 682
+L GP VLA D+ ++A W PA Q L G T FV +
Sbjct: 562 --VLRGPLVLA------VDLGDAAKP---WSGKTPALIGGQDILQRLQPVPGKTAFVYND 610
Query: 683 SNQSITMEKF 692
Q + F
Sbjct: 611 GVQQWQLSPF 620
>gi|325919533|ref|ZP_08181551.1| hypothetical protein XGA_0490 [Xanthomonas gardneri ATCC 19865]
gi|325549987|gb|EGD20823.1| hypothetical protein XGA_0490 [Xanthomonas gardneri ATCC 19865]
Length = 791
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 185/557 (33%), Positives = 272/557 (48%), Gaps = 60/557 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A QTN YL+ L+ D+L+ NF A L YGGWE +
Sbjct: 49 IRAVPLAQVRL-TPSLFLDALQTNRRYLMRLEPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +V+ L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAHYLVAELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 KIESGRAVFDELKKGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVGL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q V + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQAVFSALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P + L T E C +YNMLK++RHL++W + + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSTSKFLTEQTCEHCASYNMLKLTRHLYQWGPQAEFFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ P+ G ++ W +P D FWCC G+G+E+ ++ GDSIY+++
Sbjct: 400 QQHPRTGMFTYMTPMLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWQDG-- 452
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVY+ Y+ S + +G + + P LRV + + +L LR+P
Sbjct: 453 -QGVYVNLYVPSSVRDAAGLDMTLRSTMPEQG-SASLRVDAAPAEQ-----RTLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W S + LNGQ + +L +T+ W + D L + + LR EA DD P + S
Sbjct: 506 WAQSPVLQ--LNGQPVGAAVSDGYLRITRVWRAGDTLDLSFEMPLRLEAAADD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGD 641
+L GP VLA +GD
Sbjct: 562 --VLRGPLVLAA-DLGD 575
>gi|427411824|ref|ZP_18902026.1| hypothetical protein HMPREF9718_04500 [Sphingobium yanoikuyae ATCC
51230]
gi|425710114|gb|EKU73137.1| hypothetical protein HMPREF9718_04500 [Sphingobium yanoikuyae ATCC
51230]
Length = 802
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 172/533 (32%), Positives = 265/533 (49%), Gaps = 54/533 (10%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A + N YLL L+ D+L+ NFRK A L G YGGWE + + GH +GHYL+A ALM
Sbjct: 63 AVEGNRLYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWENDT--IAGHTLGHYLTALALMH 120
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA---------------- 237
A T + + + ++ L+ACQ G GY++ F + D +E
Sbjct: 121 AQTGDAECARRAAYIIDELAACQAAAGDGYVAGFTRRRDDVIEDGRLIFPEIMRGDIRSA 180
Query: 238 ---LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
L W P+Y HK+ AGL D T+ N++A + + Y + V K +
Sbjct: 181 GFDLNGCWVPFYNWHKLFAGLFDAETHLGNSQARGVALALAAY----IDGVFAKLDDAQV 236
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
Q L+ E GG+N+ +L T DP+ L LA L LA + + + H+NT IP
Sbjct: 237 QQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDPLAQRQNSLPWIHANTQIP 296
Query: 355 IVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFK----------SDPKRLASNLDSNT 404
+IG +E+TG+ T +G +++ DP ++ ++ T
Sbjct: 297 KLIGLARLHEITGNAADAIAANFFWE-TVVGQYSYVIGGNADREYFPDPGTISKHITEQT 355
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSS 464
ESC +YNMLK++RHL+ W E DYYER+ N +L Q G+ Y++PL GS
Sbjct: 356 CESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQNPAT-GMFAYMVPLMSGSH 414
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ-YISSRLDWKSG 523
+ W P D FWCC G+G+ES +K G+SI++E+ + + I YI S DW +
Sbjct: 415 RV-----WSEPFDDFWCCVGSGMESHAKHGESIWWEDADRPADMLIANLYIPSEADWAAR 469
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
+ +++ +D ++ +++ ++ T L LRIP W GA+ +NG LP P
Sbjct: 470 GAKL--RIETGYPFDGHIALSIPKLARAGRFT--LALRIPGW--CQGARIAVNGTPLPAP 523
Query: 584 SPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+ + + + W + D++T+ LP+ LR EA DD A A+L+GP VLA
Sbjct: 524 RIADGYALIGRKWKAGDQVTLDLPMALRVEATPDD----ARTIALLHGPVVLA 572
>gi|374322441|ref|YP_005075570.1| hypothetical protein HPL003_12970 [Paenibacillus terrae HPL-003]
gi|357201450|gb|AET59347.1| hypothetical protein HPL003_12970 [Paenibacillus terrae HPL-003]
Length = 774
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 186/567 (32%), Positives = 275/567 (48%), Gaps = 63/567 (11%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALM 192
+A + N YLL L D+L+ FR+ A L Y GWE S + GH +GHYLSA ++M
Sbjct: 28 QAMELNRSYLLELQPDRLLARFREYAGLSTKAPQYEGWEAMS--ISGHTLGHYLSACSMM 85
Query: 193 WASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA---------LIPV 241
+AST + KE + L CQ+ G GY+S P E F+ + A L
Sbjct: 86 YASTGDNRFKEIAHYITDELDVCQEAHGDGYVSGIPGGKELFEEVSAGNIRSKGFDLNGA 145
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEE 301
WAP YT+HK+ AGL D Y +AL + + ++ + ++ S E+ Q + E
Sbjct: 146 WAPLYTLHKLFAGLRDAYHLTGCNKALLVERKLADW----LGGILTPMSDEQMQQMMFCE 201
Query: 302 AGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQM 361
GGMN+VL L+ T + +L LA F L L+ Q D + G H+NT IP +IG
Sbjct: 202 YGGMNEVLADLYADTGEESYLRLAECFWHKLVLDPLSSQEDCLQGIHANTQIPKLIGLAK 261
Query: 362 RYEVTGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTT 410
YE+T D + + H G + G + F + P L + +T E+C T
Sbjct: 262 EYELTNDTKRRATVEFFWDRVVDHHSYVIGGNSFGEY-FGA-PGGLNDRIGPHTTETCNT 319
Query: 411 YNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYH 470
YNMLK++ HLF+W AD+YER L N +L Q GV Y L LA G K
Sbjct: 320 YNMLKLTSHLFQWNVSAKEADFYERGLFNHILASQDPVHGGV-TYFLSLAMGGHK----- 373
Query: 471 HWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK 530
H+ + D F CC GTG+E+ + G IYF + K +Y+ Q+I+S L+WK + + Q
Sbjct: 374 HFESKFDDFTCCVGTGMENHASYGSGIYFHDHDK---LYVNQFIASTLEWKDTGVTLKQS 430
Query: 531 VDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFL 589
+ L + +K L +R P W + G +NG++ + S PG+F+
Sbjct: 431 TSYPDTDHTTLEIQCDQPAK-----FMLLVRYPYW-AEKGITIRVNGKEQSVVSEPGSFV 484
Query: 590 SVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWD------ 643
S+ +TW D + + +P++LR E + D+ P+ A A++YGP VLAG +G D
Sbjct: 485 SIARTWIDGDVVEVTIPMSLRLEQMPDN-PDRA---AVMYGPLVLAG-DLGPIDDPKAKD 539
Query: 644 ------ITESATSLSDWITPIPASYNS 664
L WI P+ N+
Sbjct: 540 FLYTPVFIPGTDELDTWIQPVEGKTNT 566
>gi|296331240|ref|ZP_06873712.1| hypothetical protein BSU6633_09061 [Bacillus subtilis subsp.
spizizenii ATCC 6633]
gi|296151355|gb|EFG92232.1| hypothetical protein BSU6633_09061 [Bacillus subtilis subsp.
spizizenii ATCC 6633]
Length = 761
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 173/540 (32%), Positives = 278/540 (51%), Gaps = 54/540 (10%)
Query: 130 MHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSAS 189
M + +Q EYLL LDVD+L+ + YGGWE + E+ GH +GH+LSA+
Sbjct: 10 MFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAGHSIGHWLSAA 67
Query: 190 ALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD-------RLE--ALIP 240
+ M+ ++ +E LK K V+ LS Q+ GY+S F FD R++ +L
Sbjct: 68 SAMYQASGDEKLKRKAEYAVNELSHIQQFDEEGYISGFSRACFDEVFSGDFRVDHFSLGG 127
Query: 241 VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNE 300
W P+Y++HK+ AGL+D Y N ALR+ + ++ + + + + E+ + L
Sbjct: 128 SWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRMLIC 183
Query: 301 EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
E GGMN+ + L+ +T++ +L LA F L LA D++ G H+NT IP VIG+
Sbjct: 184 EHGGMNEAMADLYILTKNKSYLDLAERFCHRAILQPLAEGKDELEGKHANTQIPKVIGAA 243
Query: 361 MRYEVTGDQLHKE-----------GHQLESSGTNIG-HFNFKSDPKRLASNLDSNTEESC 408
Y++TG++ ++ G +IG HF + + L T E+C
Sbjct: 244 KLYDITGNEAYRNPALFFWEQVVYQRSYAIGGNSIGEHFGAEG-----SEELGVTTAETC 298
Query: 409 TTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERS 468
TYNMLK++ HLFRW E + DYYE +L N +L Q E G+ Y + PG K
Sbjct: 299 NTYNMLKLTGHLFRWFHEARFTDYYENALYNHILSSQ-DPESGMKTYFVSTQPGHFKV-- 355
Query: 469 YHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVN 528
+ +P DSFWCC GTG+E+ ++ +IY ++ +Y+ +I S+++ + Q+++
Sbjct: 356 ---YCSPEDSFWCCTGTGMENPARYTQNIYHLDQDD---LYVNLFIPSQINVREKQMIIT 409
Query: 529 QKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA-KATLNGQDLPLPSPGN 587
Q+ P T K G+ +L +RIP WT NG+ KA +NG+ +
Sbjct: 410 QETSF-----PAANKTKLVVKKADGVPMTLQIRIPYWT--NGSLKAVVNGKRVQSVEKNG 462
Query: 588 FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITES 647
+L++ K W++ D + I LP+ L +DD + ++YGP VLAG ++G D E+
Sbjct: 463 YLAIHKHWNTGDCIEIDLPMKLHIYQAKDDPKK----SVLMYGPVVLAG-ALGREDFPET 517
>gi|384428325|ref|YP_005637684.1| hypothetical protein XCR_2693 [Xanthomonas campestris pv. raphani
756C]
gi|341937427|gb|AEL07566.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
756C]
Length = 791
Score = 264 bits (675), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 190/576 (32%), Positives = 277/576 (48%), Gaps = 68/576 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 IRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +V+ L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTDDAQCRTRARYLVAELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVAL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T + L LA
Sbjct: 226 AGY----LQGIFAALDDTQLQKVLSCEFGGLNESFVELHVRTGHAQWLALAQRLHHHAVF 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C++YNMLK++RHL+RW + AY DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCSSYNMLKLTRHLYRWGPQAAYFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ P+ G ++ W +P D FWCC G+G+E+ ++ GDSIY+E+
Sbjct: 400 QQHPRTGMFTYMTPMLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWEDG-- 452
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GV I Y+ SR+ +G + P V+L + + T L+LR+P
Sbjct: 453 -QGVAINLYVPSRVRNAAGLDMTLHSALPAQG-----SVSLRIDAAPAAQRT-LSLRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W ++ + LNG + +L VT+ W D L + L + LR EA DD P + S
Sbjct: 506 WAATPVLQ--LNGAVVDAAPVDGYLRVTRIWHPGDTLDLSLHMPLRLEATPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPA 660
+L GP VLA D+ ++AT W PA
Sbjct: 562 --LLRGPLVLAA------DLGDAATP---WSGKTPA 586
>gi|302422424|ref|XP_003009042.1| secreted protein [Verticillium albo-atrum VaMs.102]
gi|261352188|gb|EEY14616.1| secreted protein [Verticillium albo-atrum VaMs.102]
Length = 635
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 183/531 (34%), Positives = 269/531 (50%), Gaps = 51/531 (9%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEPSCELRGHFVGHYLSASALMW 193
Q L Y+ +DVD+L++ FR+T LP G +P GGW+ P R HF GH+L+A + W
Sbjct: 65 QDRTLNYIKFVDVDRLLYVFRQTHGLPLQGAQPNGGWDAPDFPFRSHFQGHFLNAWSYCW 124
Query: 194 ASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLE--ALIPVWAPYY 246
A +E+ +++ S + L+ CQ GYLS FP + + +E L PYY
Sbjct: 125 AVLRDEACRDRASYFATELAKCQGNNDKAGFNPGYLSGFPESEIEAVEKRTLSNGNVPYY 184
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN 306
+IHK +AGLLD + + + A + M + R K S + ++ E GGMN
Sbjct: 185 SIHKTMAGLLDVWRHIGDETARDVLLGMAGWVDLRT----GKLSYSQMQTMMSTEFGGMN 240
Query: 307 DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVT 366
+V+ +F T D + L +A FD LA D ++G H+NT +P IG+ Y+ T
Sbjct: 241 EVMADIFHQTGDERWLTVAQRFDHASVFDPLAGNRDSLNGLHANTQVPKWIGAAREYKAT 300
Query: 367 G-----DQLHKEGHQLESSGTNIGHFNFKSD----PKRLASNLDSNTEESCTTYNMLKVS 417
G D H + + T N +S+ P +AS LD +T E+C TYNMLK++
Sbjct: 301 GTTRYSDIAHNAWNITVQAHTYAIGANSQSEHFRPPNAIASYLDEDTAEACNTYNMLKLT 360
Query: 418 RHLFRWTKEIA---YADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----ERSY 469
R L W + + Y D+YE++L N +G Q + G + Y L PG +
Sbjct: 361 REL--WVMDPSNSKYFDFYEQALINHAIGQQDPSSAHGHVTYFTSLNPGGHRGVGPAWGG 418
Query: 470 HHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQ 529
W T + WCC GT +E+ +KL DSIYF +E +Y+ Y SRL+W ++ V Q
Sbjct: 419 GTWSTDYGTAWCCQGTALETNTKLMDSIYFYDESS---LYVNLYAPSRLNWTQRKVTVLQ 475
Query: 530 KVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS--PG 586
+ D P L+ T T + KG G L LRIP W S GA +NGQ L PG
Sbjct: 476 ETDFP-------LQETSTLTVKGGG-DWDLRLRIPIW--SKGATIAINGQALDGVETVPG 525
Query: 587 NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+ ++ ++W +D +TI LP+ L T + DD P S+ A+ YGP VLA +
Sbjct: 526 TYATIKRSWGEEDIVTITLPMALHTISA-DDEP---SVAALAYGPVVLAAN 572
>gi|381203003|ref|ZP_09910112.1| hypothetical protein SyanX_20925 [Sphingobium yanoikuyae XLDN2-5]
Length = 790
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 170/533 (31%), Positives = 265/533 (49%), Gaps = 54/533 (10%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A + N YLL L+ D+L+ NFRK A L G YGGWE + + GH +GHYL+A ALM
Sbjct: 51 AVEGNRRYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWENDT--IAGHTLGHYLTALALMH 108
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA---------------- 237
A T + + + +++ L+ CQ G GY++ F + D +E
Sbjct: 109 AQTGDAECARRAAYIIAELAECQAAAGDGYVAGFTRRRDDVIEDGRLIFPEIMRGDIRSA 168
Query: 238 ---LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
L W P+Y HK+ AGL D ++ N++A + + Y + V K +
Sbjct: 169 GFDLNGCWVPFYNWHKLFAGLFDAESHLGNSQARGVALALAAY----IDGVFAKLDDAQV 224
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
Q L+ E GG+N+ +L T DP+ L LA L LA + + + H+NT IP
Sbjct: 225 QQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDPLAQRQNSLPWIHANTQIP 284
Query: 355 IVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFK----------SDPKRLASNLDSNT 404
+IG +E+TG+ T +G +++ DP ++ ++ T
Sbjct: 285 KLIGLARLHEITGNAADAIAANFFWE-TVVGQYSYVIGGNADREYFPDPGTISKHITEQT 343
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSS 464
ESC +YNMLK++RHL+ W E DYYER+ N +L Q G+ Y++PL GS
Sbjct: 344 CESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQNPAT-GMFAYMVPLMSGSH 402
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ-YISSRLDWKSG 523
+ W P D FWCC G+G+ES +K G+SI++E+ + + I YI S DW +
Sbjct: 403 RV-----WSEPFDDFWCCVGSGMESHAKHGESIWWEDADRPADMLIANLYIPSEADWAAR 457
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
+ +++ +D ++ +++ ++ T L LRIP W GA+ +NG LP P
Sbjct: 458 GAKL--RIESGYPFDGHIALSIPKLARAGRFT--LALRIPGWC--QGARVAVNGTPLPAP 511
Query: 584 SPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+ + + + W + D++T+ LP+ LR EA DD A A+L+GP VLA
Sbjct: 512 RIADGYALIDRKWKAGDQVTLDLPMALRIEATPDD----ARTIALLHGPVVLA 560
>gi|383779543|ref|YP_005464109.1| hypothetical protein AMIS_43730 [Actinoplanes missouriensis 431]
gi|381372775|dbj|BAL89593.1| hypothetical protein AMIS_43730 [Actinoplanes missouriensis 431]
Length = 799
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 174/516 (33%), Positives = 261/516 (50%), Gaps = 43/516 (8%)
Query: 139 LEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHN 198
+ YL +D+D+++ FR TA LP+ EP GGWE P+ +LRGH GH LS A +
Sbjct: 61 VAYLRFVDLDRMLHMFRVTAGLPSAAEPLGGWEAPTVQLRGHTTGHLLSGLAQAAYHLDD 120
Query: 199 ESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQ 258
LK + +A+V L ACQ +GYLSAFP FD+LEA WAPYYTIHKI AGLLDQ
Sbjct: 121 RDLKARSAALVDGLKACQAP--NGYLSAFPETIFDQLEAGKNPWAPYYTIHKIFAGLLDQ 178
Query: 259 YTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQD 318
+ N AL + M ++ +RV + + E+ + L+ E GGMN+ L+ +T +
Sbjct: 179 HRLLGNTTALDVARRMADWVGSRVSKLTR----EQMQKVLHVEFGGMNESFVNLYRVTGE 234
Query: 319 PKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----- 373
HL LA FD L+ + D ++G H+NT IP V+G+ Y+ TG H+
Sbjct: 235 AAHLELARAFDHDEIFVPLSEKRDTLAGRHANTDIPKVVGAAAMYQATGSDYHRTIATYF 294
Query: 374 -----GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRW-TKEI 427
H G N + F P ++ S L NT E+C TYNMLK++ L+
Sbjct: 295 WDQVVRHHSYVIGGN-SNAEFFGPPGQVVSQLGENTCENCNTYNMLKLTERLYAIDPSRT 353
Query: 428 AYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSKERSYHHWGTPSD------SFW 480
Y DY+E +L N +LG Q + G + Y L+ +S++ P +F
Sbjct: 354 DYLDYHEWALINQMLGEQDPDSAHGNVTYYTGLSSTASRKGKEGLVSDPGSYSSDYGNFS 413
Query: 481 CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPY 540
C +G+G+E+ +K + IY + + +I S ++ +I +N PY
Sbjct: 414 CDHGSGLETHTKFAEPIYDTSRDT---LSVKLFIPSETTFRGAKIQINTMF-------PY 463
Query: 541 LRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDK 600
R T+ G+G +L +RIP+W + +NG+ +P PG F ++ + W D
Sbjct: 464 -RETVRLRVDGTGAPFTLRVRIPSWVRDPALR--VNGKPVPA-HPGRFATIRRVWRRGDV 519
Query: 601 LTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+T+ LP RT + P+ ++ A+ YGP VLAG
Sbjct: 520 VTLHLP--FRTRWLPA--PDNPAVHALTYGPLVLAG 551
>gi|289661682|ref|ZP_06483263.1| putative secreted protein, partial [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 756
Score = 263 bits (673), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 192/610 (31%), Positives = 284/610 (46%), Gaps = 70/610 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-MPSLFLDALNTNRRYLMRLQPDRLLHNFVLYAGLDPQAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +V L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVGL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D+++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLVTQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RHL++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+++
Sbjct: 400 QQHPRSGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWQDG-- 452
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GV++ Y+ S + +G + P LR+ + + +L LR+P
Sbjct: 453 -QGVFVNLYVPSTVRDAAGLDMTLHSALPEQG-SASLRIDAAPAEQ-----RTLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W + LNGQ + + +L +T+ W D L++ + LR EA DD P + S
Sbjct: 506 WAQQ--PRLQLNGQPVDSAASDGYLRITRVWQRGDTLSLAFDMPLRLEATPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ--LITFTQEYGNTKFVLTN 682
+L GP VLA D+ ++A W + PA Q L G T FV +
Sbjct: 562 --VLRGPLVLA------VDLGDAAKP---WSSKTPALIGGQDILQRLQPVPGKTAFVYND 610
Query: 683 SNQSITMEKF 692
Q + F
Sbjct: 611 GAQQWQLSPF 620
>gi|384418897|ref|YP_005628257.1| hypothetical protein XOC_1936 [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353461810|gb|AEQ96089.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 791
Score = 263 bits (673), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 194/610 (31%), Positives = 285/610 (46%), Gaps = 70/610 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRIRAGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DN +AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQVAVGL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDTASGAAARFFWHTVTDHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RH+++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHVYQWGPQAELFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ P+ G ++ W +P D FWCC G+G+E+ ++ GDSIY+ ++G+
Sbjct: 400 QQHPRTGMFTYMTPMLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYW-QDGQ 453
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVYI Y+ S + +G + P LR+ ++ +L LR+P
Sbjct: 454 --GVYINLYVPSTVRDAAGLDMTLHSALPEQG-SALLRIDAAPPAQ-----RTLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W + LNGQ + + +L +T+ W D L++ + LR EA DD P + S
Sbjct: 506 WAQQ--PRLQLNGQPVDTAASDGYLRITRVWQRGDTLSLSFDMPLRLEATPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ--LITFTQEYGNTKFVLTN 682
+L GP VLA D+ ++A W PA Q L G T FV T+
Sbjct: 562 --VLRGPLVLA------VDLGDAAKP---WSGKTPALIGGQDILQRLQPAPGKTAFVYTD 610
Query: 683 SNQSITMEKF 692
Q F
Sbjct: 611 GAQQWQFSPF 620
>gi|390456441|ref|ZP_10241969.1| hypothetical protein PpeoK3_20683 [Paenibacillus peoriae KCTC 3763]
Length = 759
Score = 263 bits (671), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 171/547 (31%), Positives = 284/547 (51%), Gaps = 50/547 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYG-GWEEP 173
L ++S V L S+ AQ L++LL ++ D++++NFRK A L P GW+
Sbjct: 185 LHDISTQKVHLEGPSLLKTAQNRRLQFLLTVNDDQMLYNFRKAAGLDTLNAPAMIGWDSD 244
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS------GYLSAF 227
L+GH GHYLSA AL +AST NE +++K++ ++ L+ Q + G+LSA+
Sbjct: 245 DSLLKGHTTGHYLSALALCYASTGNERIRQKLAYLIDELNKVQLAFEADDRYHYGFLSAY 304
Query: 228 PTEQFDRLEALI---PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
EQFD LE +WAPYYT+HKI AGLLD Y A AL + + ++ YNR+ +
Sbjct: 305 SEEQFDLLEVYTRYPEIWAPYYTLHKIFAGLLDSYHIAGIELALVIADKVGDWIYNRL-S 363
Query: 285 VIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
V+ + +++ W + E GG+N+ L +L+ TQ H+ A LFD + D
Sbjct: 364 VLPQEQLKKMWGLYIAGEYGGINESLAELYTYTQKEHHIAAAKLFDNDRLFFPMEQHVDA 423
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSD 392
+ G H+N HIP ++G+ +E TG+Q + + H GT G FK
Sbjct: 424 LGGMHANQHIPQIVGAFKIFEATGEQKYYDIAKFFWESVVNAHIYSIGGTGEGEM-FKQ- 481
Query: 393 PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV 452
P ++ ++L +T E+C +YNMLK+++ L+ + ++ Y DYYER++ N +L G
Sbjct: 482 PYQIGAHLTEHTAETCASYNMLKLTKQLYVYENDVKYMDYYERTMINHILSSTDHECLGA 541
Query: 453 MIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
Y +P + G K G ++ CC+GTG+E+ K ++I+FE+ +Y+
Sbjct: 542 STYFMPTSSGGQK-------GYDEEN-SCCHGTGLENHFKYAEAIFFEDA---DSLYVNL 590
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRV-TLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
++ S L+ ++ + V Q V + + + + + TLT T+L +RIP W
Sbjct: 591 FVPSALNDEAKGLQVVQSVPEIFNGEVEIHIETLT--------RTNLRVRIPYWHQGE-V 641
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
A +N + +L +++ W+ D++T++ LR E P+ A I ++ +GP
Sbjct: 642 TAFVNHTKVNTVEENGYLVLSQKWNKGDQVTMKFTPRLRLERT----PDKADIASLAFGP 697
Query: 632 YVLAGHS 638
Y+LA S
Sbjct: 698 YILAAVS 704
>gi|354583886|ref|ZP_09002783.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
gi|353197148|gb|EHB62641.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
Length = 778
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 174/543 (32%), Positives = 271/543 (49%), Gaps = 55/543 (10%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
+Q+T YLL LDVD+L+ + A L YGGWEE + GH +GH+LSA+A M
Sbjct: 27 SQETGKGYLLHLDVDRLMAPCYEAASLEPKKPRYGGWEE--TPIAGHSIGHWLSAAAAMI 84
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLE---------ALIPVWAP 244
+T +E L +K+ V+ L+ Q GY+S FP + FD + +L W P
Sbjct: 85 DATSDEELLKKLVYAVNELAYVQSHDKDGYVSGFPRDCFDIVFTGDFEVHNFSLAGSWVP 144
Query: 245 YYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGG 304
+Y++HKI AGL+D Y +AL + + ++ + + + E+ + L E GG
Sbjct: 145 WYSLHKIFAGLIDAYRLTGIEQALEVVIRLADW----AKKGTDRLTDEQFQRMLICEHGG 200
Query: 305 MNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYE 364
MND + L+ +T + +L LA F L LA D++ G H+NT IP VIG+ YE
Sbjct: 201 MNDTMADLYRLTNNHAYLELAIRFCHRAILEPLARGVDELEGKHANTQIPKVIGAAKLYE 260
Query: 365 VTGDQLHKEGHQL------ESSGTNIG------HFNFKSDPKRLASNLDSNTEESCTTYN 412
+TGD +++ + + IG HF + K L T E+C TYN
Sbjct: 261 ITGDDFYRKAAEFFWKEVTRNRSYIIGGNSIFEHFRAANQEK-----LGVETAETCNTYN 315
Query: 413 MLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHW 472
MLK++ HLF W+++ Y D+YER+L N +L Q + G+ +Y + PG K +
Sbjct: 316 MLKLTDHLFGWSQDAEYMDFYERALYNHILASQ-DPDTGMKMYFVSTEPGHFKV-----Y 369
Query: 473 GTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVD 532
GT SFWCC GTG+E+ ++ IY +Y+ +I+S+ + Q+V+ Q+ +
Sbjct: 370 GTAEHSFWCCTGTGMENPARYTHEIYHATSN---AIYVNLFIASKATFDDHQVVIRQETE 426
Query: 533 PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVT 592
P T + L +RIP WT+ A +NG ++ + +L++
Sbjct: 427 F-----PKQSRTRLIIEEAKAAHFKLRIRIPQWTAG-AVTAVVNGSEIYADAEPGYLNIE 480
Query: 593 KTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG----HSIGDWDITESA 648
+ W++ D + + LP+ LR +DD A ILYGP VLAG + D DI ++
Sbjct: 481 RDWNAGDTIEVTLPMELRLYHAKDD----AKKVGILYGPIVLAGALGTEAFPDSDIVDNH 536
Query: 649 TSL 651
T L
Sbjct: 537 TKL 539
>gi|330467876|ref|YP_004405619.1| hypothetical protein VAB18032_19585 [Verrucosispora maris
AB-18-032]
gi|328810847|gb|AEB45019.1| hypothetical protein VAB18032_19585 [Verrucosispora maris
AB-18-032]
Length = 913
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 196/613 (31%), Positives = 303/613 (49%), Gaps = 63/613 (10%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEP-YGGWEEPSCELRGHFVGHYLSASALMW 193
Q L YL +DV++L++NFR RL G GGWE P+ R H GH+L+A + MW
Sbjct: 67 QNRTLNYLRFVDVNRLLYNFRANHRLSTAGAAALGGWEAPTFPFRTHSQGHFLTAWSHMW 126
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGS-----GYLSAFPTEQFDRLEA--LIPVWAPYY 246
A + + ++K + +V+ L+ CQ + GYL +P F +EA L PYY
Sbjct: 127 AVLGDTTCRDKANYMVAELAKCQANNAAAGFNPGYLCGYPESDFTAVEARTLNNGNVPYY 186
Query: 247 TIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
TIHK L GLLD + + N +A L + W V++ R+ + + L E
Sbjct: 187 TIHKTLVGLLDVWRHIGNNQARDVLLALAGW-VDWRTGRLSSAQMQ-------AMLGTEF 238
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMR 362
GGMN VL L+ T D + L +A FD LA D ++G H+NT IP IG+
Sbjct: 239 GGMNAVLTDLYQQTGDARWLTVAQRFDHAAVFNPLAANQDQLNGLHANTQIPKWIGAARE 298
Query: 363 YEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYN 412
++ TG +++ + + G N +F++ P ++ L ++T E C TYN
Sbjct: 299 FKATGTTRYRDIASNAWNLTVNTRTYAIGGNSQAEHFRA-PNAISGYLRNDTCEHCNTYN 357
Query: 413 MLKVSRHLFRWT-KEIAYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----E 466
MLK++R L+ +AY D+YER+L N ++G Q + G + Y PL PG +
Sbjct: 358 MLKLTRELWLLDPNRVAYFDFYERALLNHLIGAQNPADNHGHITYFTPLQPGGRRGVGPA 417
Query: 467 RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIV 526
W T +SFWCC GTG+E+ + L DSIYF + + ++ S L+W I
Sbjct: 418 WGGGTWSTDYNSFWCCQGTGLENNTTLMDSIYFHNGST---LTVNLFMPSVLNWSQRGIT 474
Query: 527 VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG--QDLPLPS 584
V Q S L VT T G + ++ +RIP WT A ++NG Q++ +
Sbjct: 475 VTQSTSYPASDTSTLTVTGTV-----GGSWTMRIRIPAWTQD--ATVSVNGTVQNIAT-T 526
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI 644
PG + S+T+TW+S D +T++LP+ + E D+ S+ A+ YGP VL+G+ G+
Sbjct: 527 PGTYASLTRTWTSGDTVTVRLPMRVVVEPTNDN----PSVVALTYGPAVLSGN-YGN--- 578
Query: 645 TESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLT---NSNQSITMEKFPKSGTDAAL 701
+ ++L T +S +TFT NT+ L +++ + G+
Sbjct: 579 -TALSALPALATASVTRTSSTALTFTATANNTQVNLLPFYDAHGHNYTVYWSSGGSSGPA 637
Query: 702 HATFRLILNDSSG 714
ATFRL+ N +SG
Sbjct: 638 QATFRLV-NAASG 649
>gi|402080566|gb|EJT75711.1| hypothetical protein GGTG_05643 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 640
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 188/560 (33%), Positives = 277/560 (49%), Gaps = 58/560 (10%)
Query: 111 SGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGG 169
+G+ L + LGS Q L Y+ ++VD+L++NFR R+ G + G
Sbjct: 44 TGDSALAFPLSQLSLGSGRFR-ENQDRALTYIKSVNVDRLLYNFRANHRVSTNGAQSNKG 102
Query: 170 WEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSGYL 224
W+ P R HF GH+L+A A +A+ + + ++ + V+ L+ CQ +GYL
Sbjct: 103 WDAPDFPFRTHFQGHFLTAWAQCYATLGDATCRDHANYFVAELAKCQNNNAAAGFKAGYL 162
Query: 225 SAFPTEQFDRLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYF 278
S FP + D++E L PYY IHK +AGLLD + + +A LRM W
Sbjct: 163 SGFPESEIDKVEQRTLSNGNVPYYAIHKTMAGLLDVWRVMGSTQARDVLLRMAGW----- 217
Query: 279 YNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLA 338
V S ++ L E GGMN+VL +F T D + + A FD LA
Sbjct: 218 ---VDTRTAALSYQQMQNMLGTEFGGMNEVLADVFHQTGDARWIKTARRFDHAAVFDPLA 274
Query: 339 LQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFN 388
D +SG H+NT +P IG+ Y+ T ++ ++ + G N +
Sbjct: 275 QGQDRLSGLHANTQVPKWIGAAREYKATKEERYRTVARAAWNFTVAAHTYAIGGNSQSEH 334
Query: 389 FKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE---IAYADYYERSLTNGVLGIQ 445
F+S P +A L +T E+C +YNMLK++R L W + AY D+YER+L N +LG Q
Sbjct: 335 FRS-PNAIAGYLAKDTAEACNSYNMLKLTREL--WLADPSAAAYFDFYERALLNHMLGQQ 391
Query: 446 R-GTEPGVMIYLLPLAPGSSKERSYHHWG-----TPSDSFWCCYGTGIESFSKLGDSIYF 499
+ G + Y PL PG + WG T DSFWCC GTGIE+ +KL DSIYF
Sbjct: 392 DPRSAHGHVTYFTPLNPGGRRGVG-PAWGGGTYSTDYDSFWCCQGTGIETNTKLMDSIYF 450
Query: 500 EEEGKYPGVYIIQYISSRLDW-KSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSL 558
+Y+ +ISS + W + G +VV Q ++ TL S G G T L
Sbjct: 451 RGRDDAT-LYVNLFISSSVKWTQKGGVVVTQ----TTTFPKSDTTTLDVSGAGGGRWT-L 504
Query: 559 NLRIPTWTSSNGAKATLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+R+P+W + A T+NGQ + S PG + S+T+ W + DK+ ++LP+ L T A D
Sbjct: 505 AVRVPSWVAGQ-AVITVNGQAVQGVSTAPGTYASITRDWQAGDKVVVRLPMRLYTIAAND 563
Query: 617 DRPEYASIQAILYGPYVLAG 636
D + A+ YGP VL+G
Sbjct: 564 D----MGLVAVAYGPAVLSG 579
>gi|398384929|ref|ZP_10542957.1| hypothetical protein PMI04_02662 [Sphingobium sp. AP49]
gi|397722209|gb|EJK82754.1| hypothetical protein PMI04_02662 [Sphingobium sp. AP49]
Length = 802
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 171/533 (32%), Positives = 263/533 (49%), Gaps = 54/533 (10%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A + N YLL L+ D+L+ NFRK A L G YGGWE + + GH +GHYL+A ALM
Sbjct: 63 AVEGNRLYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWENDT--IAGHTLGHYLTALALMH 120
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA---------------- 237
A T + + + ++ L+ACQ G GY++ F + D +E
Sbjct: 121 AQTGDAECARRAAYIIDELAACQAAAGDGYVAGFTRRRDDVIEDGRLIFPEIMRGDIRSA 180
Query: 238 ---LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
L W P+Y HK+ AGL D + N++A + + Y + V K +
Sbjct: 181 GFDLNGCWVPFYNWHKLFAGLFDAEAHLGNSQARGVALALAAY----IDGVFAKLDDAQV 236
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
Q L+ E GG+N+ +L T DP+ L LA L LA + + + H+NT IP
Sbjct: 237 QQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDPLAQRQNSLPWIHANTQIP 296
Query: 355 IVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFK----------SDPKRLASNLDSNT 404
+IG +E+TG+ T +G +++ DP ++ ++ T
Sbjct: 297 KLIGLARLHEITGNAADAIAANFFWE-TVVGQYSYVIGGNADREYFPDPGTISKHITEQT 355
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSS 464
ESC +YNMLK++RHL+ W E DYYER+ N +L Q G+ Y++PL GS
Sbjct: 356 CESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQNPAT-GMFAYMVPLMSGSH 414
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ-YISSRLDWKSG 523
+ W P D FWCC G+G+ES +K G+SI++E+ + + I YI S DW +
Sbjct: 415 RV-----WSEPFDDFWCCVGSGMESHAKHGESIWWEDTDRPADMLIANLYIPSEADWAAR 469
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
+ +++ +D ++ +++ ++ T L LRIP W GA+ +NG LP P
Sbjct: 470 GAKL--RIETGYPFDGHIALSIPTLARAGRFT--LALRIPGW--CQGARVAVNGTPLPTP 523
Query: 584 S-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+ + + W + D++T+ LP+ LR EA DD A A+L+GP VLA
Sbjct: 524 RIVDGYALIDRKWKAGDQVTLDLPMALRVEATPDD----ARTIALLHGPVVLA 572
>gi|195643412|gb|ACG41174.1| hypothetical protein [Zea mays]
gi|413926261|gb|AFW66193.1| hypothetical protein ZEAMMB73_983510 [Zea mays]
Length = 262
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 140/237 (59%), Positives = 170/237 (71%), Gaps = 11/237 (4%)
Query: 23 AAQAKECTNAYPELASHTFRS--NLLSSKNESYIKQI-----HSHNDHLTPSDDSAWLSL 75
A+ K CTNA+P L SHT R+ L + ++ I H HLTP+D+S W+SL
Sbjct: 27 GAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQHLTPTDESTWMSL 86
Query: 76 MPRKILREEEQDELFSWAMLYRKIKNPGQFKVPE-RSGEFLKEVSLHDVRLGSDSMHWRA 134
MPR+ LR EE F W MLYR+++ G P +G FL E SLHDVRL SM+WRA
Sbjct: 87 MPRRALRREEA---FDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDVRLEPGSMYWRA 143
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWA 194
QQTNLEYLL+LDVD+LVW+FRK A L APG PYGGWE P +LRGHFVGHYLSA+A MWA
Sbjct: 144 QQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVGHYLSATAKMWA 203
Query: 195 STHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKI 251
STHN++L KMS+VV AL CQK++G+GYLSAFP++ FD LEA+ VWAPYYTIHK+
Sbjct: 204 STHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWAPYYTIHKV 260
>gi|374992736|ref|YP_004968231.1| hypothetical protein SBI_09982 [Streptomyces bingchenggensis BCW-1]
gi|297163388|gb|ADI13100.1| hypothetical protein SBI_09982 [Streptomyces bingchenggensis BCW-1]
Length = 733
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 179/525 (34%), Positives = 270/525 (51%), Gaps = 48/525 (9%)
Query: 140 EYLLMLDVDKLVWNFRKTARLPAPGEP-YGGWEEPSCELRGHFVGHYLSASALMWASTHN 198
YL +D D+L++NFR RLP G GGW+ P+ R H GH+L+A A ++A T +
Sbjct: 27 NYLRFVDADRLLYNFRANHRLPTNGAASNGGWDGPTFPFRTHVQGHFLTAWAQVYAVTGD 86
Query: 199 ESLKEKMSAVVSALSACQKEIGS-----GYLSAFPTEQFDRLEA--LIPVWAPYYTIHKI 251
+ ++K + +V+ L+ CQ G+ GYLS FP F LEA L PYY IHKI
Sbjct: 87 TTCRDKAAYMVAELAKCQANNGAAGFNGGYLSGFPESDFSALEAGTLSNGNVPYYVIHKI 146
Query: 252 LAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYK 311
LAGLLD + + + +A M + + R + S ++ TL E GGMN VL
Sbjct: 147 LAGLLDVWRHMGSTQARDMLLSLAGWVDWRT----GRLSGQQMQSTLGTEFGGMNAVLSD 202
Query: 312 LFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLH 371
L+ T D + L A FD LA D ++G H+NT +P IG+ Y+ TG +
Sbjct: 203 LYLQTSDSRWLTTAQRFDHGAVFDPLASNQDRLNGLHANTQVPKWIGAAREYKATGTTRY 262
Query: 372 KE-----------GHQLESSGTNIG-HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRH 419
++ H G + HF P +A+ L+ + ESC TYNML ++R
Sbjct: 263 RDIATNAWNICVNAHTYVIGGNSQAEHF---RPPNAIAAYLNQDACESCNTYNMLTLTRE 319
Query: 420 LFRWTKE-IAYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----ERSYHHWG 473
LF + +A DYYER+ N ++G Q + G + Y PL PG + W
Sbjct: 320 LFTLDPDRVALFDYYERAWLNQMIGQQNPADNHGHVTYFTPLNPGGRRGVGPAWGGGTWS 379
Query: 474 TPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDP 533
T DSFWCC GTG+E +KL DS+YF + + + ++ S L+W I V Q
Sbjct: 380 TDYDSFWCCQGTGLEMHTKLMDSVYFSSDTT---LIVNLFVPSVLNWSQRGITVTQTTSY 436
Query: 534 VVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVT 592
VS L+VT S T ++ +RIP+WT+ GA ++NG + +PG++ ++T
Sbjct: 437 PVSDTTTLQVTGNLSG-----TWAMRIRIPSWTA--GATISVNGTTQNITTTPGSYATLT 489
Query: 593 KTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
++W+S D +T++LP+ + I + A++ A+ YGP VL+G+
Sbjct: 490 RSWTSGDTVTVRLPMRI----IMRAANDNANVAAVTYGPVVLSGN 530
>gi|418517157|ref|ZP_13083324.1| hypothetical protein MOU_10158 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410706214|gb|EKQ64677.1| hypothetical protein MOU_10158 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 791
Score = 262 bits (669), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 195/610 (31%), Positives = 285/610 (46%), Gaps = 70/610 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVGL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGIFAALDAAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RHL++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAKLFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+ ++G+
Sbjct: 400 QQHPRTGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYW-QDGQ 453
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVY+ Y+ S + +G + P LR+ ++ +L LR+P
Sbjct: 454 --GVYVNLYVPSTVRDAAGLNMTLHSALPEQG-SASLRIDGAPPAQ-----RTLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
WT LNGQ + + +L +T+ W D L++ + LR E+ DD P + S
Sbjct: 506 WTQQ--PHLQLNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ--LITFTQEYGNTKFVLTN 682
+L GP VLA D+ ++A W PA Q L G FV T+
Sbjct: 562 --VLRGPLVLA------VDLGDAAKP---WSGKTPALIGGQEVLQRLQPAPGKPAFVYTD 610
Query: 683 SNQSITMEKF 692
Q F
Sbjct: 611 GAQQWQFSPF 620
>gi|294624781|ref|ZP_06703443.1| secreted protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB
11122]
gi|292600913|gb|EFF44988.1| secreted protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB
11122]
Length = 791
Score = 261 bits (668), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 184/567 (32%), Positives = 274/567 (48%), Gaps = 61/567 (10%)
Query: 99 IKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTA 158
++ P Q + G F + V L VRL + S+ A TN YL+ L+ D+L+ NF A
Sbjct: 35 LRFPAQASAAQ-PGSF-RAVPLAQVRL-TPSLFLDALHTNRRYLMRLEPDRLLHNFVLYA 91
Query: 159 RLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE 218
L YGGWE + + GH +GHYLSA ALM A T + + + +V+ L+ CQ
Sbjct: 92 GLDPKAPAYGGWEADT--IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVAELARCQAH 149
Query: 219 IGSGYLSAFPTEQ-----------FDRLEA---------LIPVWAPYYTIHKILAGLLDQ 258
G GY++ F + FD L L WAP YT HK+ AGLLD
Sbjct: 150 AGDGYVAGFTRKNAAGKIESGRAVFDELRRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDV 209
Query: 259 YTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQD 318
+ + DNA+AL++ + Y +Q + + + L+ E GG+N+ +L T D
Sbjct: 210 HAHCDNAQALQVAVSLAGY----LQGIFAALDDAQLQKVLSCEFGGLNESFVELHVRTGD 265
Query: 319 PKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG---- 374
+ L LA L L Q D++ HSNT+IP +IG YEVTGD
Sbjct: 266 AQWLALAQRLHHHAVLDPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFF 325
Query: 375 ------HQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIA 428
H G N G + P ++ + T E C +YNMLK++RHL++W +
Sbjct: 326 WHTVTDHHTYVIGGN-GDREYFQQPDSISKFVTEQTCEHCASYNMLKLTRHLYQWGPQAE 384
Query: 429 YADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIE 488
+ DYYER+L N VL Q+ G+ Y+ P+ G ++ W +P D FWCC G+G+E
Sbjct: 385 FFDYYERTLLNHVLA-QQHPRTGMFTYMTPMLAGEARA-----WSSPFDDFWCCVGSGME 438
Query: 489 SFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFS 548
+ ++ GDSIY+++ GVY+ Y+ S + +G + + P LR+ + +
Sbjct: 439 AHAQFGDSIYWQDG---QGVYVNLYVPSSVRDAAGLDMTLRSTMPEQG-SASLRIDVAPA 494
Query: 549 SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLT 608
+ L LR+P W S + LNGQ + +L + + W + D LT+ +
Sbjct: 495 EQ-----RMLALRLPGWAQS--PRLQLNGQPVDTTVNEGYLRIARFWRAGDTLTLSFEMP 547
Query: 609 LRTEAIQDDRPEYASIQAILYGPYVLA 635
LR EA DD P + S +L GP VLA
Sbjct: 548 LRLEATTDD-PAWVS---VLRGPLVLA 570
>gi|346725400|ref|YP_004852069.1| hypothetical protein XACM_2511 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346650147|gb|AEO42771.1| hypothetical protein XACM_2511 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 791
Score = 261 bits (668), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 194/610 (31%), Positives = 285/610 (46%), Gaps = 70/610 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +V L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRKDAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAMGL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTDDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D+++ HSNT+IP +IG YEVTG+ H G N
Sbjct: 282 DPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGNAASGAAARFFWHTVTDHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RHL++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+ ++G+
Sbjct: 400 QQHPRSGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYW-QDGQ 453
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVY+ Y+ S + +G + P LR+ + + +L LR+P
Sbjct: 454 --GVYVNLYVPSMVHDAAGLDMTLHSALPEQG-SASLRIDAAPAEQ-----RTLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W + LNGQ + +L +T+TW D L++ + LR EA DD P + S
Sbjct: 506 WAKQ--PRLQLNGQPVDSTVSDGYLRITRTWQRGDTLSLAFDMPLRLEATPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ--LITFTQEYGNTKFVLTN 682
+L GP VLA +GD + W PA Q L G T FV +
Sbjct: 562 --VLRGPLVLA-VDLGD--------ASKPWSGKTPALIGGQDILQRLQPVPGKTAFVYND 610
Query: 683 SNQSITMEKF 692
Q + F
Sbjct: 611 GVQQWQLSPF 620
>gi|325915124|ref|ZP_08177450.1| hypothetical protein XVE_1336 [Xanthomonas vesicatoria ATCC 35937]
gi|325538646|gb|EGD10316.1| hypothetical protein XVE_1336 [Xanthomonas vesicatoria ATCC 35937]
Length = 791
Score = 261 bits (668), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 190/601 (31%), Positives = 287/601 (47%), Gaps = 66/601 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 MRAVPLAQVRL-TPSLFLDALNTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCATRAAYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + NA+AL++ +
Sbjct: 166 QIESGRAVFDELKKGKIDSAPFYLNGSWAPLYTWHKLFAGLLDVHAHCGNAQALQVAVGL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + Q L+ E GG+N+ +L T D + L LA +
Sbjct: 226 AGY----LQGIFAALNDAQLQQVLSCEFGGLNESFVELHVQTDDAQWLALAQRLHHHAVI 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWQTVTDHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RHL++W + + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAVHFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+E+
Sbjct: 400 QQHPRTGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWEDG-- 452
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GV++ Y+ S + +G + + P VTL + + T L LR+P
Sbjct: 453 -QGVFVNLYVPSTVRDAAGFALSLRSTLPERG-----EVTLQIDAAPAAART-LALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W + + +NGQ L +L + + W++ D +++QL + LR E DD P +
Sbjct: 506 WAGAFTLQ--VNGQLQTLQPVDGYLRIERVWAAGDTVSLQLGMPLRLEPTSDD-PAWV-- 560
Query: 625 QAILYGPYVLA---GHSIGDWDITESATSLSDWI----TPIPASYNSQLITFTQEYGNTK 677
++ GP VLA G + WD T D + P+PA + Q Q++ +
Sbjct: 561 -VVMRGPLVLAADLGDAATPWDNTTPVLIGGDEVLQRLQPLPAHGHYQYSDGAQQWRLSP 619
Query: 678 F 678
F
Sbjct: 620 F 620
>gi|375308065|ref|ZP_09773352.1| hypothetical protein WG8_1877 [Paenibacillus sp. Aloe-11]
gi|375080396|gb|EHS58617.1| hypothetical protein WG8_1877 [Paenibacillus sp. Aloe-11]
Length = 759
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 175/562 (31%), Positives = 287/562 (51%), Gaps = 56/562 (9%)
Query: 98 KIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKT 157
K++N + K P+ G +S V L S+ AQ L++LL ++ D++++NFRK
Sbjct: 174 KVENKSK-KAPQLHG-----ISTQKVHLEGPSLLKSAQNRRLQFLLTVNDDQMLYNFRKA 227
Query: 158 ARLPAPGEPYG-GWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQ 216
A L P GW+ L+GH GHYLSA AL +AST NE + +K++ +V L+ Q
Sbjct: 228 ASLDTLNAPAMIGWDSDESLLKGHTTGHYLSALALCYASTGNERIHQKLAYLVDELNKVQ 287
Query: 217 KEIGS------GYLSAFPTEQFDRLEALI---PVWAPYYTIHKILAGLLDQYTYADNAEA 267
+ G+LSA+ EQFD LE +WAPYYT+HKILAGLLD Y A A
Sbjct: 288 LAFEADDRYHYGFLSAYSEEQFDLLEVYTRYPEIWAPYYTLHKILAGLLDSYHIAGIELA 347
Query: 268 LRMTTWMVEYFYNRVQNVIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAH 326
L + + ++ YNR+ +V+ +++ W + E GG+N+ L +LF TQ H+ A
Sbjct: 348 LAIADKVGDWIYNRL-SVLPHEQLKKMWGLYIAGEFGGINESLAELFTYTQKEHHIAAAK 406
Query: 327 LFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GH 375
LFD + Q D + H+N HIP ++G+ +E TG+Q + + H
Sbjct: 407 LFDNDRLFFPMEQQVDALGAMHANQHIPQIVGAFKIFEATGEQKYYDIAKFFWESVVNAH 466
Query: 376 QLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
GT G FK P ++ ++L +T E+C +YN+LK+++ L+ + + Y DYYER
Sbjct: 467 IYSIGGTGEGEM-FKQ-PHKIGTHLTEHTAETCASYNLLKLTKQLYVYENDAKYMDYYER 524
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
++ N +L G Y +P +PG K G ++ CC+GTG+E+ K +
Sbjct: 525 TMLNHILSSTDHECLGASTYFMPTSPGGQK-------GYDEEN-SCCHGTGLENHFKYAE 576
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRV-TLTFSSKGSGL 554
+I+FE+ +Y+ ++ + L+ + + V Q V + + + + + TLT
Sbjct: 577 AIFFED---VDSLYVNLFVPAALNDEGKGLQVVQSVPEIFNGEVEIHIETLT-------- 625
Query: 555 TTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
T+L +RIP W +N + +L +++ W+ D++T++ LR E
Sbjct: 626 RTNLRVRIPYWHQGE-ITTFVNHTKVNTIEENGYLVLSQEWNKGDQVTMKFTPRLRLE-- 682
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
P+ A I ++ +GPY+LA
Sbjct: 683 --HTPDKADIASLAFGPYILAA 702
>gi|445497812|ref|ZP_21464667.1| hypothetical protein Jab_2c14210 [Janthinobacterium sp. HH01]
gi|444787807|gb|ELX09355.1| hypothetical protein Jab_2c14210 [Janthinobacterium sp. HH01]
Length = 789
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 180/546 (32%), Positives = 274/546 (50%), Gaps = 49/546 (8%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
L+ L DVRLG DS AQ+T+L YLL ++ D+L+ F + A LP YG WE S
Sbjct: 29 LQLFPLADVRLG-DSPFLEAQRTDLHYLLEMEPDRLLAPFLREAGLPPKQPSYGNWE--S 85
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT----- 229
L GH GHYLSA ALM+AST +E + +++ V+ L CQ+ G+GY+ P
Sbjct: 86 TGLDGHLGGHYLSALALMYASTGDEEVLRRLNYFVAELKRCQERNGNGYIGGIPDGSAAW 145
Query: 230 EQFDRLEALIP------VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
+ R E + W P+Y +HK+ AGL D Y YA NA+A M M ++
Sbjct: 146 QAIARGELHVDNFSVNGKWVPWYNLHKVYAGLRDAYAYAGNADARAMLVSMSDW----AL 201
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ S E+ L E GGMN+VL + +T K++ LA F L L D
Sbjct: 202 ELTSHLSEEQMQAMLRSEHGGMNEVLADVAQMTGQKKYMDLAVRFSHQAILRPLEEGKDQ 261
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNIGHFNFKSDP 393
++G H+NT IP VIG + ++TG + ++ H+ + G N +F D
Sbjct: 262 LTGLHANTQIPKVIGFKHIGDMTGRRDWQQAAQFFWQTVRDHRTVAIGGNSVKEHFHDDR 321
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
L + E+C TYNMLK++ LF + +Y DYYER+L N +L QR + G
Sbjct: 322 DFLPMVDEVEGPETCNTYNMLKLTELLFLGDAKGSYTDYYERALYNHILSSQR-PDSGGF 380
Query: 454 IYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
+Y P+ P Y + + WCC G+GIES +K G+ IY + +Y+ +
Sbjct: 381 VYFTPMRP-----NHYRVYSQVDKAMWCCVGSGIESHAKYGEFIYAHRGDQ---LYVNLF 432
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
I S L+W+S + + Q + R T+T +GS T + +R P W + +
Sbjct: 433 IPSTLNWRSQGVTITQ----ANRFPDEDRSTITV--QGSKAFT-MKIRYPEWVARGALRI 485
Query: 574 TLNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
T+NG+ +P + + ++S+ + W DK+ IQLP+ E + P+ ++ A+L+GP
Sbjct: 486 TVNGKPVPADAGADRYVSLRRIWRDGDKVDIQLPMKTHLEQM----PDKSNYYAVLHGPI 541
Query: 633 VLAGHS 638
VLA +
Sbjct: 542 VLAAKT 547
>gi|386847956|ref|YP_006265969.1| Glucan endo-1,3-beta-glucosidase [Actinoplanes sp. SE50/110]
gi|359835460|gb|AEV83901.1| Glucan endo-1,3-beta-glucosidase [Actinoplanes sp. SE50/110]
Length = 765
Score = 261 bits (666), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 199/611 (32%), Positives = 293/611 (47%), Gaps = 78/611 (12%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKT-ARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
Q L YL +D D+L++NFR R GGW+ P R H GH+L+A A W
Sbjct: 65 QTRTLNYLRFVDADRLLYNFRANHGRSTGGAAANGGWDAPDFPFRTHVQGHFLTAWAQAW 124
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA--LIPVWAPYYTIHKI 251
A+ + + +++ + +V+ L+ CQ +GYLS FP F LEA L PYY +HK
Sbjct: 125 AALGDTTCRDRANYMVAELAKCQAA--NGYLSGFPESDFTALEAGTLSNGNVPYYCVHKT 182
Query: 252 LAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMND 307
LAGLLD + +A LR+ W V + + + L E GGMN+
Sbjct: 183 LAGLLDVWRLIGGTQARDVLLRLAGW--------VDTRTARLTTSQMQAMLGTEFGGMNE 234
Query: 308 VLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTG 367
VL ++ T D + L A FD LA AD ++G H+NT +P +G+ Y+ TG
Sbjct: 235 VLADIYQQTGDGRWLATAQRFDHAAVFTPLAAGADQLNGLHANTQVPKWVGAVREYKATG 294
Query: 368 DQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVS 417
+++ G + G N +F++ P +A L ++T E C +YNMLK++
Sbjct: 295 TTRYRDIGLNAWNITTGAHTYAIGGNSQAEHFRA-PNAIAGYLTNDTCEHCNSYNMLKLT 353
Query: 418 RHLFRWTKE---IAYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----ERSY 469
R L W + AY D+YER+L N ++G Q + G + Y PL PG +
Sbjct: 354 REL--WLTDPDRAAYFDFYERALLNHLIGAQNPADSHGHITYFTPLRPGGRRGVGPAWGG 411
Query: 470 HHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ--YISSRLDWKSGQIVV 527
W T SFWCC GTG+E+ +KL +SIYF + G + + S L W I V
Sbjct: 412 GTWSTDYASFWCCQGTGVETNTKLMESIYF-----FSGTTLTVNLFTPSVLSWAERGITV 466
Query: 528 NQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPG 586
Q VS TLT S SG T S+ +RIP WT+ GA +NG + +PG
Sbjct: 467 TQATAYPVS----DTTTLTVSGTPSG-TWSIRVRIPGWTT--GATLAVNGVAQGVGATPG 519
Query: 587 NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE 646
+ +VT+ W++ D LT++LP+ + + D+ ++QAI YGP VL G+ G
Sbjct: 520 GYATVTRAWAAGDVLTVRLPMRVIMQPAADN----PAVQAITYGPVVLCGNYGG------ 569
Query: 647 SATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKS-GTDAALHATF 705
T+LS S N I T G+ F T + ++++ FP + G D A++
Sbjct: 570 --TTLS-----AHPSLNVSSIARTGS-GSLAFTATANGATVSLGPFPDAQGFDYAVY--- 618
Query: 706 RLILNDSSGSE 716
N SG E
Sbjct: 619 ---WNTGSGGE 626
>gi|298246853|ref|ZP_06970658.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
gi|297549512|gb|EFH83378.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
Length = 600
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 171/531 (32%), Positives = 274/531 (51%), Gaps = 52/531 (9%)
Query: 136 QTNLEYLLMLDVDKLVWNFRKTARL----PAPGEPYGGWEEPSCELRGHFVGHYLSASAL 191
+ N Y+L L L+ N A L P + + GWE P+C+LRGHF+GH+LSA+A
Sbjct: 25 ELNRAYMLSLKSTNLLQNHYGEAGLWNPPQQPTDCHRGWESPTCQLRGHFLGHWLSAAAR 84
Query: 192 MWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKI 251
+ AST + +K K +V+ L+ CQ+E+ ++ + P + D + VWAP+YT+HK
Sbjct: 85 LVASTGDTEIKGKADFIVAELARCQQEMEGEWIGSIPEKYLDWIARGKRVWAPHYTLHKT 144
Query: 252 LAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYK 311
L GL D Y N +AL + ++F+ ++S E+ L+ E GGM +V
Sbjct: 145 LMGLYDMYEIGQNEQALDILIHWADWFHRWT----GQFSREQMDDILDVETGGMLEVWAN 200
Query: 312 LFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLH 371
L+ +T +HL L +D+ L D ++ H+NT IP V G+ +EVTG+Q
Sbjct: 201 LYGVTNRQEHLDLIRRYDRSRLFDRLLAGEDVLTYMHANTTIPEVHGAARAWEVTGEQRW 260
Query: 372 KEGHQ--LESSGTNIGHF--------NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLF 421
++ + + T+ G+F P +L L +E CT YN+++++ +LF
Sbjct: 261 RDIVEAYWRLAVTDRGYFCTGGQTSDEVWCPPHQLGGQLGPENQEHCTVYNLMRLANYLF 320
Query: 422 RWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWC 481
RWT ++ YADYYER+ NG+L Q+ + G++ Y LPL G +K WGTP++ FWC
Sbjct: 321 RWTGDVVYADYYERNFYNGILA-QQNAQTGMVAYYLPLETGGTKV-----WGTPTNDFWC 374
Query: 482 CYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK--SGQIVVN----------- 528
C+GT +++ + IYF + G+ + QYI SRL W +++V
Sbjct: 375 CHGTLVQAQASHTRDIYFTND---EGLVVSQYIPSRLQWHHDGSEVIVTLESKAHNVYAL 431
Query: 529 --QKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP-SP 585
+ P + P TL+ + + T L LR+P W + T+NG+ +P +P
Sbjct: 432 KAPREQPRQTSHP--EYTLSVNCEQPTEYT-LTLRLPWWLADE-PMITINGERQRVPHTP 487
Query: 586 GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
++ + +TW +DKLTI LP L+ + P + + A + GP VLAG
Sbjct: 488 SSYYHIRRTW-HNDKLTILLPKALQIVPL----PGASDMMAFMDGPIVLAG 533
>gi|407790778|ref|ZP_11137869.1| Acetyl-CoA carboxylase, biotin carboxylase [Gallaecimonas
xiamenensis 3-C-1]
gi|407202325|gb|EKE72317.1| Acetyl-CoA carboxylase, biotin carboxylase [Gallaecimonas
xiamenensis 3-C-1]
Length = 780
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 184/555 (33%), Positives = 276/555 (49%), Gaps = 67/555 (12%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
L+ + L +VRL +AQ TN YL LD D+L+ FR A LP P YG WE +
Sbjct: 20 LETLPLQEVRLLPSPFK-QAQDTNRHYLDSLDPDRLLAPFRAEAGLPQPKPGYGNWE--A 76
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP------ 228
L GH GHYLSA +LM+AST + +L ++ ++ L CQ ++G+GY+ P
Sbjct: 77 DGLGGHMGGHYLSALSLMYASTGDPALLARLQYMLDELKKCQDKLGTGYIGGVPGGSALW 136
Query: 229 --TEQFD---RLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRM-------TTWMVE 276
Q D L L W P+Y +HK+ AGL D Y Y +A+AL M T W+VE
Sbjct: 137 QQIHQGDIQADLFTLNQKWVPWYNLHKLYAGLRDAYRYTGSAQALAMWIKLSDWTDWLVE 196
Query: 277 YFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGL 336
S E+ L E GGMN+V L+ IT K+L LA F + L
Sbjct: 197 GL-----------SDEQMQAMLVTEYGGMNEVFADLYEITGQDKYLQLAKRFSQQQLLQP 245
Query: 337 LALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----HQ-LESSGTNIG----- 385
LA D ++G H+NT IP VIG + +V+GD+ HQ +E IG
Sbjct: 246 LAHGQDQLNGLHANTQIPKVIGFERIAQVSGDRAMGAAADYFWHQVVEQRTVAIGGNSVR 305
Query: 386 -HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
HF+ K D + ++ E+C +YNMLK++R L++ + Y YYER+L N +L
Sbjct: 306 EHFHPKDDFSSMVEEVEG--PETCNSYNMLKLARLLYQRQGGLDYLAYYERALYNHILAS 363
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q + G ++Y P+ P Y + + WCC G+GIES SK G IY ++
Sbjct: 364 QH-PDDGGLVYFTPMRP-----NHYRVYSQADKAMWCCVGSGIESHSKYGAMIYATDQS- 416
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
+YI +I SRLDW + ++ +D D + +T +S + L +R P+
Sbjct: 417 --ALYINLFIPSRLDWTEKGVKLS--LDTRFPDDDSVFITFEQAS-----SLPLKIRYPS 467
Query: 565 WTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYAS 623
W + + +NG + + PG +LS+ W D+++++LP+ L E + P+ ++
Sbjct: 468 WVKAGQLELRVNGTPRAVTAKPGQYLSLAGQWQKGDQISLKLPMALSLEQM----PDQSN 523
Query: 624 IQAILYGPYVLAGHS 638
A+L+GP VLA +
Sbjct: 524 YYAVLFGPIVLAAKT 538
>gi|373958137|ref|ZP_09618097.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373894737|gb|EHQ30634.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 789
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 184/542 (33%), Positives = 283/542 (52%), Gaps = 55/542 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L DVRL +S +A + + YLL ++ D+L+ FR + L G+ YGGWE S L G
Sbjct: 52 LQDVRL-LESPFKQAMEKDAAYLLSVEPDRLLSGFRSHSGLTPKGKMYGGWE--SSGLAG 108
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALI 239
H +GHYLSA ++ +AS+ N E+++ +V L CQ +GY+ A P E D + A I
Sbjct: 109 HTLGHYLSAISMQYASSRNPQFLERVNYIVKELKECQVARKTGYIGAIPKE--DTIWAEI 166
Query: 240 PV-------------WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVI 286
W+P+YT+HK++AGLLD Y Y +NAEAL + M ++ +QN+
Sbjct: 167 KKGDIRSRGFDLNGGWSPWYTVHKVMAGLLDAYLYCNNAEALNICKGMGDWTGELLQNL- 225
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
+ E+ L E GGM + L L+ IT + +L ++ F L L+ D + G
Sbjct: 226 ---NDEQIQSMLLCEYGGMAETLVNLYAITGNKAYLATSYKFYDKRILNPLSENKDILPG 282
Query: 347 FHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKR 395
HSNT IP VI S RYE+TG++ ++ H + G + ++ + S+P +
Sbjct: 283 KHSNTQIPKVIASARRYELTGEKKDEDISVNFWNIITKDHSYATGGNS--NYEYLSEPDK 340
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
L L NT E+C TYNMLK++RHLF A DYYE++L N +L Q + G+M Y
Sbjct: 341 LNDKLTENTTETCNTYNMLKLTRHLFSVNPSAALMDYYEKALYNHILASQNHDD-GMMCY 399
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
+PL G KE S +P D+F CC G+G+E+ K +SIY+ G +Y+ +I
Sbjct: 400 FVPLRMGGKKEYS-----SPFDTFTCCVGSGMENHVKYNESIYY--RGNDGSLYVNLFIP 452
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S L WK I + Q+ + P VT + + +L +R P W + K +
Sbjct: 453 SVLTWKEKGITLTQQNN-----FPASDVTTFVINSTKPVNFALKIRKPKWAGNCLIK--V 505
Query: 576 NGQ-DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
NG+ + + +L + + W ++DK+ P ++ TEAI P+ + +A+ YGP +L
Sbjct: 506 NGKAGITTTNEQGYLVINRLWKNNDKIEFVTPESIYTEAI----PDNINRKALFYGPVLL 561
Query: 635 AG 636
AG
Sbjct: 562 AG 563
>gi|346970201|gb|EGY13653.1| secreted protein [Verticillium dahliae VdLs.17]
Length = 634
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 178/533 (33%), Positives = 268/533 (50%), Gaps = 55/533 (10%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEPSCELRGHFVGHYLSASALMW 193
Q L Y+ +DVD+L++ FR+T LP G +P GGW+ P R HF GH+L+A + W
Sbjct: 65 QDRTLSYIKFVDVDRLLYVFRQTHGLPLQGAQPNGGWDAPDFPFRSHFQGHFLNAWSYCW 124
Query: 194 ASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLE--ALIPVWAPYY 246
A +E +++ S + L+ CQ GYLS FP + + LE L PYY
Sbjct: 125 AVLRDEECRDRASYFATELAKCQANNEQAGFNPGYLSGFPESEIEALEKRTLSNGNVPYY 184
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN 306
+IHK +AGLLD + + + A + M + R K S + ++ E GGMN
Sbjct: 185 SIHKTMAGLLDVWRHIGDETARDVLLGMAGWVDLRT----GKLSYSQMQTMMSTEFGGMN 240
Query: 307 DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVT 366
+V+ +F T D + L +A FD LA D ++G H+NT +P IG+ Y+ T
Sbjct: 241 EVMADIFHQTGDERWLTVAQRFDHASVFDPLAGNRDSLNGLHANTQVPKWIGAAREYKAT 300
Query: 367 GDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLK 415
G + + H + G N +F+ P +AS LD +T E+C TYNMLK
Sbjct: 301 GTTRYSDIARNAWNITVQAHTY-AIGANSQSEHFRP-PNAIASYLDEDTAEACNTYNMLK 358
Query: 416 VSRHLFRWTKEIA---YADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----ER 467
++R L W + + Y D+YE++L N +G Q + G + Y L PG +
Sbjct: 359 LTREL--WVMDPSNSKYFDFYEQALINHAIGQQDPSSAHGHVTYFTSLNPGGHRGVGPAW 416
Query: 468 SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVV 527
W T + WCC GT +E+ +KL DSIYF +E +Y+ Y S+L+W ++ V
Sbjct: 417 GGGTWSTDYGTAWCCQGTALETNTKLMDSIYFYDESS---LYVNLYAPSKLNWTQRKVTV 473
Query: 528 NQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP--LPS 584
Q+ + P L+ T T + KG G L +RIP W S GA +NGQ L +
Sbjct: 474 LQETEFP-------LQDTSTLTVKGGG-DWDLRVRIPMW--SKGATIAINGQALDGVEAA 523
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
PG + ++ ++W +D +TI LP+ L T + D+ S+ A+ YGP VLA +
Sbjct: 524 PGTYATIKRSWGEEDIVTITLPMALHTISANDE----PSVAALAYGPVVLAAN 572
>gi|294667526|ref|ZP_06732741.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292602646|gb|EFF46082.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 791
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 194/608 (31%), Positives = 283/608 (46%), Gaps = 66/608 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A QTN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-TPSLFLDALQTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + +NA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCENAQALQVAVAL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q V + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGVFAALDDAQLQKALSCEFGGLNESFVELHVQTGDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D ++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLIAQRDALAHQHSNTNIPKLIGLAREYEVTGDPASGAAARFFWHTVTDHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RHL++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+++
Sbjct: 400 QQHPRTGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWQDG-- 452
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVYI Y+ S + +G + P LR+ ++ L LR+P
Sbjct: 453 -QGVYINLYVPSTVRDAAGLNMTLHSALPEQG-SASLRIDGAPPAQ-----RMLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W + LNGQ + + +L +T+ W D L + + LR EA DD P + S
Sbjct: 506 WAQQ--PRLRLNGQPVDGSASDGYLRLTRVWQPGDTLQLSFDMPLRLEATPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSN 684
+L+GP VLA D+ ++A S TP L G T F ++
Sbjct: 562 --VLHGPLVLA------VDLGDAAKPWSG-KTPTLIGGQDILQRLQPVPGKTAFTYSDGA 612
Query: 685 QSITMEKF 692
Q + F
Sbjct: 613 QQWQLSPF 620
>gi|390993493|ref|ZP_10263643.1| TAT (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|372551771|emb|CCF70618.1| TAT (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas axonopodis pv. punicae str. LMG
859]
Length = 791
Score = 259 bits (663), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 195/610 (31%), Positives = 284/610 (46%), Gaps = 70/610 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDTQCRTRTRYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVAL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q V + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGVFAALEDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RHL++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+ ++G+
Sbjct: 400 QQHPRTGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYW-QDGQ 453
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVY+ Y+ S + +G + P LR+ ++ +L LR+P
Sbjct: 454 --GVYVNLYVPSTVRDAAGLNMTLHSALPEQG-SASLRIDGAPPAQ-----RTLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W LNGQ + + +L +T+ W D L++ + LR E+ DD P + S
Sbjct: 506 WAQQ--PHLQLNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ--LITFTQEYGNTKFVLTN 682
+L GP VLA D+ ++A W PA Q L G FV T+
Sbjct: 562 --VLRGPLVLA------VDLGDAAKP---WSGKTPALIGGQEVLQRLQPAPGKPAFVYTD 610
Query: 683 SNQSITMEKF 692
Q F
Sbjct: 611 GAQQWQFSPF 620
>gi|418520534|ref|ZP_13086583.1| hypothetical protein WS7_05828 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410703915|gb|EKQ62403.1| hypothetical protein WS7_05828 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 791
Score = 259 bits (663), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 194/609 (31%), Positives = 285/609 (46%), Gaps = 68/609 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDTQCRTRTRYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPSPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVAL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q V + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGVFAALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----HQLESSGTNI----G 385
L Q D++ HSNT+IP +IG YEVTGD H + T + G
Sbjct: 282 DPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVIGGNG 341
Query: 386 HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ 445
+ P ++ L T E C +YNMLK++RHL++W + DYYER+L N V+ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 446 RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+ ++G+
Sbjct: 401 QHPRTGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYW-QDGQ- 453
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
GVY+ Y+ S + +G + P LR+ ++ +L LR+P W
Sbjct: 454 -GVYVNLYVPSTVRDAAGLNMTLHSALPKQG-SASLRIDGAPPAQ-----RTLALRVPGW 506
Query: 566 TSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
LNGQ + + +L +T+ W D L++ + LR E+ DD P + S
Sbjct: 507 AQQ--PHLQLNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PAWVS-- 561
Query: 626 AILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ--LITFTQEYGNTKFVLTNS 683
+L GP VLA D+ ++A W PA Q L G FV T+
Sbjct: 562 -VLRGPLVLA------VDLGDAAKP---WSGKTPALIGGQEVLQRLQPAPGKPAFVYTDG 611
Query: 684 NQSITMEKF 692
Q F
Sbjct: 612 AQQWQFSPF 620
>gi|238059692|ref|ZP_04604401.1| secreted protein [Micromonospora sp. ATCC 39149]
gi|237881503|gb|EEP70331.1| secreted protein [Micromonospora sp. ATCC 39149]
Length = 740
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 182/529 (34%), Positives = 265/529 (50%), Gaps = 46/529 (8%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEP-YGGWEEPSCELRGHFVGHYLSASALMW 193
Q L YL +DVD+L++NFR RL G GGW+ PS R H GH+L+A A +
Sbjct: 32 QNRTLSYLRFVDVDRLLYNFRANHRLSTNGAASNGGWDAPSFPFRTHVQGHFLTAWAQAY 91
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGS-----GYLSAFPTEQFDRLEA--LIPVWAPYY 246
A + + ++K + +V+ L+ CQ G+ GYLS FP F LEA L PYY
Sbjct: 92 AVLGDTTCRDKANYMVAELAKCQANNGAAGFTAGYLSGFPESDFTALEARTLSNGNVPYY 151
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN 306
IHK L GLLD + Y N +A + + + R + S + L E GGMN
Sbjct: 152 CIHKTLLGLLDVWRYIGNTQARSVLLALAGWVDTRT----ARLSSSQMQAMLGTEFGGMN 207
Query: 307 DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVT 366
+ L L+ T D + L +A FD LA +D ++G H+NT +P IG+ Y+ T
Sbjct: 208 EALADLYQQTGDGRWLTVAQRFDHAAVFNPLAANSDQLNGLHANTQVPKWIGAAREYKAT 267
Query: 367 GDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKV 416
G +++ + G N +F++ P +A L ++T E C T NMLK+
Sbjct: 268 GTTRYRDIASNAWNMTVNAHTYAIGGNSQAEHFRA-PNAIAGYLTNDTCEHCNTVNMLKL 326
Query: 417 SRHLFRWT-KEIAYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----ERSYH 470
+R L+ + AY DY+ER+L N V+G Q + G + Y PL PG +
Sbjct: 327 TRELWLIDPNQAAYFDYFERALANHVIGAQNPADGHGHVTYFTPLKPGGRRGVGPAWGGG 386
Query: 471 HWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK 530
W T DSFWCC GTGIE ++L DSIYF + + + S L+W I V Q
Sbjct: 387 TWSTDYDSFWCCQGTGIEINTRLMDSIYFHNGTT---LTVNLFAPSTLNWSQRGITVTQS 443
Query: 531 VD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNF 588
+ PV TLT S SG + S+ +RIP W S GA +NG + +PG++
Sbjct: 444 TNYPVGD-----TTTLTLSGTMSG-SWSIRVRIPAWAS--GATIAVNGATQSVATTPGSY 495
Query: 589 LSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+VT+TW+S D +T++LP+ + + A++ A+ YGP VL G+
Sbjct: 496 ATVTRTWASGDTITVRLPM----RVVLSPANDNAAVAAVTYGPMVLCGN 540
>gi|255075873|ref|XP_002501611.1| predicted protein [Micromonas sp. RCC299]
gi|226516875|gb|ACO62869.1| predicted protein [Micromonas sp. RCC299]
Length = 1214
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 194/664 (29%), Positives = 288/664 (43%), Gaps = 156/664 (23%)
Query: 129 SMHWRAQQTNLEYL-LMLDVDKLVWNFRKTARLPA-------PGE--------------- 165
+H AQ+ N YL ++D +L+ NFR A LP P E
Sbjct: 188 GVHLDAQRLNARYLTAVVDPRRLLANFRVVAGLPPETIPDRHPTETVAPYCDVGSGLSYA 247
Query: 166 --PYGGWEEPSCELRGHFVGHYLSASALMWASTHNES----------------------- 200
P WE P CELRGHF GHYLSA A + A +
Sbjct: 248 EHPGACWEAPDCELRGHFAGHYLSALAFVAAGAGDRPNTSPDRTSSSDHLSDPEYVTGHQ 307
Query: 201 --------LKEKMSAVVSALSACQKEIG--SGYLSAFPTEQFDRLEALIPVWAPYYTIHK 250
+E + V L+ Q G +GY+SAFP E DR A+ WAPYYT+HK
Sbjct: 308 SDVATARHAREMLDRFVDGLATAQASSGTSAGYVSAFPEEVLDRQGAVGGAWAPYYTLHK 367
Query: 251 ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHW---------QTLNEE 301
I GL+D + A NA+AL + + RV +I++ HW E
Sbjct: 368 IGQGLMDAHVVAGNAKALDVLKGLANAVLTRVMGLIQQRGAS-HWFGGALEYSKAAFGAE 426
Query: 302 AGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQM 361
+GG N++ ++L+ +T + ++ LA LFD P FLG + D ++ H+N H PI +G+
Sbjct: 427 SGGFNELAWRLYQLTGNGDYVTLASLFDHPTFLGRMRAGGDGLTREHANFHEPIAMGAYS 486
Query: 362 RYEVTGD-----------QLHKEGHQLESSGTNIGHFNFKSDPKRLASNLDSN-TEESCT 409
RYE+TGD +L ++ + GT G +++ P RL + S T+E+CT
Sbjct: 487 RYEITGDTESRRAFRNFIELLRDTRSYATGGTCDGE-RWQA-PGRLERIIVSTETQETCT 544
Query: 410 TYNMLKVSRHL---FRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKE 466
N +++ F + +ADY ER+ +G +G+QR +PG ++Y PL G SK
Sbjct: 545 QVNFERLANAAVASFGEAEARDWADYSERASLHGPVGLQR--KPGELLYTTPLGVGVSKG 602
Query: 467 RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY--FEEEGKYPG-----------VYIIQY 513
RS H WG P +FWCCYGTG+E+ ++L D ++ E PG VYI +
Sbjct: 603 RSGHGWGRPDAAFWCCYGTGVEALARLQDGVFWRLEAGATVPGDDTSSTTATDVVYIARV 662
Query: 514 ISSRL-DWKSGQIVVNQKVDPVVSWDPYLR-------------------VTLTFSSKGSG 553
+S + W + VDP P R V +T ++G
Sbjct: 663 TTSAVATWDEKGVTTRVSVDPFNVGGPVQREGGRDGRRRRGTAGFFASAVAITVHAEGRN 722
Query: 554 LTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPG----------------------NFLSV 591
TS+ +++P W + G++ TLNG+ + + G + V
Sbjct: 723 EPTSIRVKLPRW-AGGGSRITLNGERVRCENGGDSSSSEDSDSDSDSDSDSDSDSGWCDV 781
Query: 592 TKTWSSDDKLTIQLPLTLRTEAI--QDDRPEY-----------ASIQAILYGPYVLAGHS 638
T+ W D L P+ +R E + D P + + AI+ GPYVLA
Sbjct: 782 TRVWRKTDLLRASFPIVVRAEPLLGSDLTPGFGTGSNQRLDGKGARHAIVAGPYVLAALG 841
Query: 639 IGDW 642
G W
Sbjct: 842 PGAW 845
>gi|255936447|ref|XP_002559250.1| Pc13g08250 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583870|emb|CAP91894.1| Pc13g08250 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 627
Score = 258 bits (660), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 180/534 (33%), Positives = 273/534 (51%), Gaps = 57/534 (10%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAP-GEPYGGWEEPSCELRGHFVGHYLSASALMW 193
Q L+YL +DVD+L++ FR T L P GGW+ P R H GH+LSA A +
Sbjct: 58 QDRTLKYLKEIDVDRLLYVFRATHGLSTQQATPNGGWDAPDFPFRSHVQGHFLSAWAQCY 117
Query: 194 ASTHNESLKEKMSAVVSALSACQ---KEIG--SGYLSAFPTEQFDRLE--ALIPVWAPYY 246
A +++ ++ + L+ CQ K +G GY+S FP +F +LE L PYY
Sbjct: 118 AVLRDQTCYDRAIYFAAELAKCQANNKAVGFTDGYVSGFPESEFAKLENDTLTNGNVPYY 177
Query: 247 TIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
+HK LAGLLD + ++ + L + +W V + +S + L E
Sbjct: 178 AVHKTLAGLLDIWRLTNDTTSRDILLSLASW--------VDKRTEPFSYAAMQKLLQTEF 229
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMR 362
GGMN+V+ ++ T D + L +A FD LA D++ G H+NT +P IG+ +
Sbjct: 230 GGMNEVMADIYHQTGDERWLTVAQRFDHAVIFDPLAANKDELDGLHANTQVPKWIGAARQ 289
Query: 363 YEVTGD-----------QLHKEGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTY 411
Y+ TG+ +++ + H G + +F++ P +A+ L ++T E+C +Y
Sbjct: 290 YKATGESRYLDIARNAWEINVKSHTYAIGGNSQAE-HFRA-PNAIAAYLTNDTCEACNSY 347
Query: 412 NMLKVSRHLFRW-TKEIAYADYYERSLTNGVLGIQRGTE-PGVMIYLLPLAPGSSK---- 465
NMLK++R L+ + AY D+YE SL N +LG Q + G + Y PL G +
Sbjct: 348 NMLKLTRELWLLDSDNSAYFDFYENSLLNHLLGQQDPHDHHGHITYFTPLNAGGRRGVGP 407
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
W T DSFWCC GT +E+ +KL DSIYF + ++I ++SS L W I
Sbjct: 408 AWGGGTWSTDYDSFWCCQGTALETNTKLMDSIYFYNDST---LFINLFMSSVLKWPEMGI 464
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP--LP 583
+ Q V L V+ GSG T +N+RIP W SS A+ TLNG+ L
Sbjct: 465 TLKQSTTYPVGDTSKLEVS------GSGAWT-MNIRIPAWASS--AELTLNGEALSDVKA 515
Query: 584 SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+PG + +++TW+ D + I+ P+TLRT A D+ +S+ AI YGP VL G+
Sbjct: 516 APGKYAQISRTWADGDVIEIRFPMTLRTVAANDN----SSMVAIAYGPTVLCGN 565
>gi|58582735|ref|YP_201751.1| hypothetical protein XOO3112 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|188577523|ref|YP_001914452.1| hypothetical protein PXO_01470 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|58427329|gb|AAW76366.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|188521975|gb|ACD59920.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
PXO99A]
Length = 783
Score = 258 bits (660), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 193/610 (31%), Positives = 283/610 (46%), Gaps = 70/610 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 41 VRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 99
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +VS L+ CQ G GY++ F +
Sbjct: 100 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRKNAAG 157
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DN +AL++ +
Sbjct: 158 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQVAVGL 217
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T D + L LA L
Sbjct: 218 AGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 273
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 274 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGN- 332
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++ H+++W + DYYER+L N V+
Sbjct: 333 GDREYFQQPDSISKCLTEQTCEHCASYNMLKLTCHVYQWGPQAELFDYYERTLLNHVMA- 391
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ P+ G ++ W +P D FWCC G+G+E+ ++ GDSIY+ ++G+
Sbjct: 392 QQHPRTGMFTYMTPMLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYW-QDGQ 445
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVYI Y+ S + +G + P LR+ + L LR+P
Sbjct: 446 --GVYINLYVPSTVRDAAGLDMTLHSALPEQG-SASLRIDAAPPEQ-----RMLALRVPG 497
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W + LNGQ + + +L +T+ W D L++ + LR EA DD P + S
Sbjct: 498 WAQQ--PRLQLNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLEATPDD-PAWVS- 553
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ--LITFTQEYGNTKFVLTN 682
+L GP VLA D+ ++A W PA Q L GNT FV +
Sbjct: 554 --VLRGPLVLA------VDLGDAAKP---WSGKTPALIGGQDILQRLQPVPGNTAFVYND 602
Query: 683 SNQSITMEKF 692
Q + F
Sbjct: 603 GLQQWQLSPF 612
>gi|84624616|ref|YP_451988.1| hypothetical protein XOO_2959 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84368556|dbj|BAE69714.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 791
Score = 258 bits (660), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 193/610 (31%), Positives = 283/610 (46%), Gaps = 70/610 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DN +AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQVAVGL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++ H+++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKCLTEQTCEHCASYNMLKLTCHVYQWCPQAELFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ P+ G ++ W +P D FWCC G+G+E+ ++ GDSIY+ ++G+
Sbjct: 400 QQHPRTGMFTYMTPMLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYW-QDGQ 453
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVYI Y+ S + +G + P LR+ + L LR+P
Sbjct: 454 --GVYINLYVPSTVRDAAGLDMTLHSALPEQG-SASLRIDAAPPEQ-----RMLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W + LNGQ + + +L +T+ W D L++ + LR EA DD P + S
Sbjct: 506 WAQQ--PRLQLNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLEATPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ--LITFTQEYGNTKFVLTN 682
+L GP VLA D+ ++A W PA Q L GNT FV +
Sbjct: 562 --VLRGPLVLA------VDLGDAAKP---WSGKTPALIGGQDILQRLQPVPGNTAFVYND 610
Query: 683 SNQSITMEKF 692
Q + F
Sbjct: 611 GLQQWQLSPF 620
>gi|325836901|ref|ZP_08166283.1| hypothetical protein HMPREF9402_1694 [Turicibacter sp. HGF1]
gi|325491107|gb|EGC93399.1| hypothetical protein HMPREF9402_1694 [Turicibacter sp. HGF1]
Length = 763
Score = 258 bits (659), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 179/538 (33%), Positives = 275/538 (51%), Gaps = 60/538 (11%)
Query: 123 VRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFV 182
VRL DS+ +Q +YLL LDV++L+ + A P YGGWE S E++GH +
Sbjct: 6 VRLEKDSLFEISQAVGKQYLLDLDVERLLAPIYEGASQIPPKPSYGGWE--SLEIKGHSI 63
Query: 183 GHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLE------ 236
GHYLSA A M+ +T + LKE+M ++ S Q+ GYL F + F+++
Sbjct: 64 GHYLSALACMYEATKDLELKERMDYIIETFSLLQR--ADGYLGGFLSTPFEQVFTGEFHV 121
Query: 237 ---ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIER 293
+L W P+Y+IHKI AGL+D Y N EAL + + ++ Y + + S E+
Sbjct: 122 DHFSLSHYWVPWYSIHKIYAGLMDAYQIGKNVEALNILKKLADWAYEGSRLM----SDEQ 177
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHI 353
+ L E GGMN+V+ +L+ ITQD ++L LA F + + LA DD+ G H+NT I
Sbjct: 178 FQRMLICEYGGMNEVMAELYEITQDERYLYLAKRFTQHLIMDPLAAGVDDLQGRHANTQI 237
Query: 354 PIVIGSQMRYEVTGDQLHKEGHQL---------------ESSGTNIGHFNFKSDPKRLAS 398
P V+G+ YEVTGD + + SSG + G SD + L+
Sbjct: 238 PKVLGAAKLYEVTGDDYYYRVAKFFFETVVLHRSYVIGGNSSGEHFG----PSDTEPLS- 292
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLP 458
E+C TYNM+K++++LF+WTK+ Y D+ ER+ N +L Q G IY
Sbjct: 293 ---REAAETCNTYNMIKLAKYLFKWTKDSKYIDFIERATYNHILASQ-DPHTGCKIYFTS 348
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
PG K +GT DSFWCC GTG+E+ + I+F+E+ + Y+ +++S
Sbjct: 349 NYPGHFKV-----YGTKEDSFWCCTGTGMENPGRYTHHIFFKEDEDF---YVNLFMASSF 400
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+ Q+ V + D +S V L F + + L ++ +R+P W ++ + GQ
Sbjct: 401 VKEDEQLKVVLQTDFPIS----NVVKLVF-EEANQLFLNVKIRVPYWLNA-PIEVRFKGQ 454
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
G +L ++ T+ +DD++ I LP+ L E + D P A +YGP VLA
Sbjct: 455 SYEANGQG-YLMISDTFHADDEIEIVLPMGLH-EYVSMDDPHKV---AFMYGPVVLAA 507
>gi|312621677|ref|YP_004023290.1| hypothetical protein Calkro_0576 [Caldicellulosiruptor
kronotskyensis 2002]
gi|312202144|gb|ADQ45471.1| protein of unknown function DUF1680 [Caldicellulosiruptor
kronotskyensis 2002]
Length = 588
Score = 258 bits (659), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 169/570 (29%), Positives = 288/570 (50%), Gaps = 46/570 (8%)
Query: 125 LGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPA----PGEPYGGWEEPSCELRGH 180
L ++S +R + N Y+L L + L+ NF + L + P + +GGWE P+C+LRGH
Sbjct: 15 LLNESEFYRRFEINRNYMLSLKTENLLQNFYLESGLVSWSFLPQDIHGGWESPTCQLRGH 74
Query: 181 FVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIP 240
F+GH+LSA+A ++A+ +E +K K +++ L CQ+E G ++ + P + F+ +
Sbjct: 75 FLGHWLSAAAKIYANFGDEEIKGKADYIINELEKCQRENGGEWVGSIPEKYFEWMARGKY 134
Query: 241 VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNE 300
VWAP+YT+HK GL+D Y YA N +AL + +FY ++S E+ L+
Sbjct: 135 VWAPHYTVHKTFMGLVDMYKYASNQKALEIADKWANWFYRWS----GQFSREKMDDILDY 190
Query: 301 EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
E GGM ++ +L+ IT+D K+ L + + L + D ++G H+NT IP + G+
Sbjct: 191 ETGGMLEIWAELYDITKDSKYKDLMERYYRGRLFDRLLMGEDVLTGKHANTTIPEIHGAA 250
Query: 361 MRYEVTGDQLHK------------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESC 408
+E+TG++ + E + G +G + +++ + L + +E C
Sbjct: 251 RVWEITGEEKFRKIVESYWKEAVDERGYFCTGGQTLGE--VWTPKQKIKNYLGTTNQEHC 308
Query: 409 TTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERS 468
YNM++++ LFRWT + Y+DY ER++ NG+ QR + G++ Y LPL PGS K
Sbjct: 309 VVYNMIRLAEFLFRWTGDKRYSDYIERNIYNGLFAQQR-LKDGMVTYYLPLMPGSQK--- 364
Query: 469 YHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ---I 525
WGTP++ FWCC+GT +++ + D IY++ + G+ I Q+I S + WK + I
Sbjct: 365 --RWGTPTNDFWCCHGTLVQAHTIYNDLIYYKSQN---GIVISQFIPSSVTWKDDKGNDI 419
Query: 526 VVNQKVDPVVSWDPYL----RVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP 581
+ Q + Y + + K S + L +R P W + +NG
Sbjct: 420 TITQYFERKHGSFAYTAEKDEIYIEIQCK-SPVEFELAIRKPWWAKK--VEIEINGNSYY 476
Query: 582 LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGD 641
++ +T+ W +++K+ I + T ++ DD P+ A + GP VLAG
Sbjct: 477 AADDSPYIQLTQRW-NNEKIKITFYKAVETCSMPDD-PQQV---AFMIGPVVLAGLCERR 531
Query: 642 WDITESATSLSDWITPIPASYNSQLITFTQ 671
I + + I PI L+ TQ
Sbjct: 532 RKIYIGERKIEEIIVPIDKRGYGPLLYTTQ 561
>gi|21243263|ref|NP_642845.1| hypothetical protein XAC2530 [Xanthomonas axonopodis pv. citri str.
306]
gi|21108798|gb|AAM37381.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 791
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 194/610 (31%), Positives = 284/610 (46%), Gaps = 70/610 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + +NA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCENAQALQVAVAL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q V + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGVFAALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RHL++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+ ++G+
Sbjct: 400 QQHPRTGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYW-QDGQ 453
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVY+ Y+ S + +G + P LR+ ++ +L LR+P
Sbjct: 454 --GVYVNLYVPSTVRDAAGLNMTLHSALPEQG-SASLRIDGAPPAQ-----RTLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W LNGQ + + +L +T+ W D L++ + LR E+ DD P + S
Sbjct: 506 WAQQ--PHLQLNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ--LITFTQEYGNTKFVLTN 682
+L GP VLA D+ ++A W PA Q L G FV T+
Sbjct: 562 --VLRGPLVLA------VDLGDAAKP---WSGKTPALIGGQEVLQRLQPAPGKPAFVYTD 610
Query: 683 SNQSITMEKF 692
Q F
Sbjct: 611 GAQQWQFSPF 620
>gi|381170950|ref|ZP_09880102.1| Tat (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas citri pv. mangiferaeindicae LMG
941]
gi|380688673|emb|CCG36589.1| Tat (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas citri pv. mangiferaeindicae LMG
941]
Length = 791
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 183/557 (32%), Positives = 271/557 (48%), Gaps = 60/557 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL + S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-TPSLFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DNA+AL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVDL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGIFSVLDDTQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D+++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLIAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHAVTDHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RHL++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+ ++G+
Sbjct: 400 QQHPRTGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYW-QDGQ 453
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVY+ Y+ S + +G + P LR+ ++ +L LR+P
Sbjct: 454 --GVYVNLYVPSTVRDAAGLNMTLHSALPEQG-SASLRIDGAPPAQ-----RTLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W LNGQ + + +L +T+ W D L++ + LR E+ DD P + S
Sbjct: 506 WAQQ--PHLQLNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLESTPDD-PAWVS- 561
Query: 625 QAILYGPYVLAGHSIGD 641
+L GP VLA +GD
Sbjct: 562 --VLRGPLVLAA-DLGD 575
>gi|337745980|ref|YP_004640142.1| hypothetical protein KNP414_01710 [Paenibacillus mucilaginosus
KNP414]
gi|336297169|gb|AEI40272.1| protein of unknown function DUF1680 [Paenibacillus mucilaginosus
KNP414]
Length = 636
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 184/583 (31%), Positives = 287/583 (49%), Gaps = 86/583 (14%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARL-------------- 160
+KE+S VRL + R + N Y++ L + L+ NF A L
Sbjct: 1 MKELSSGRVRLAPGPLQARLE-LNKRYVMSLTNENLLRNFYLEAGLWSYSGNGGTTSATT 59
Query: 161 ---PAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQK 217
P + GWE P+CELRGH +GH+LSA+A ++ T + +K K +V+ L+ CQ+
Sbjct: 60 TSTDGPEHWHWGWESPTCELRGHIMGHWLSAAATIYGQTQDGLVKAKADYIVAELARCQE 119
Query: 218 EIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEY 277
G +L+AFP R+ VWAP+YTIHK+L GL D Y A +A AL + T M +
Sbjct: 120 ANGGEWLAAFPESYMHRIARGKYVWAPHYTIHKLLMGLYDMYRLAGSAAALELMTNMAAW 179
Query: 278 FYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLL 337
FY ++ E L+ E GGM + L+ +T HL L +D+ F L
Sbjct: 180 FYRWTDG----FTREEMDDLLDLETGGMLETWADLYGVTGSGAHLELVRRYDRRRFFDAL 235
Query: 338 ALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK-------------EGHQLESSGTNI 384
D ++ H+NT IP ++G+ +EVTG++ ++ G+ +G N
Sbjct: 236 LEGRDVLTNKHANTQIPEILGAARAWEVTGEERYRRIVEAFWRCAVSERGYTATGAGDNG 295
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
+ + + +A+ L + +E C YNM+++++ L RWT + AYADY+ER NGVL
Sbjct: 296 ELWMPQGE---MAARLGAG-QEHCCNYNMMRLAQVLLRWTGDPAYADYWERRFVNGVLAH 351
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q G E G++ Y + L GS K WGTP+ FWCC+GT +++ + I+ EEE
Sbjct: 352 QHG-ETGMISYFIGLGAGSRKT-----WGTPTGHFWCCHGTLMQANASYEGQIFMEEE-- 403
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKV--------DPVVSWD------------PYLRV- 543
G+ + Q++ S+L+++ G + ++ +P+ SW P + V
Sbjct: 404 -DGLAVCQWLPSKLEYEIGGTAIRLRIEQDGQHGLEPLSSWSVTGMTAITRADLPPIPVH 462
Query: 544 -------TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS---PGNFLSVTK 593
LTF ++ +T L +R+P W S T+NG+ PL P F+ + +
Sbjct: 463 RPDRFMYRLTFEAE-RAVTFKLRMRLPWWLSGE-PVITVNGE-APLQGELKPSTFVELER 519
Query: 594 TWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
W S D +T++LP L+ EA+ P A L GP VLAG
Sbjct: 520 EWKSGDTITVELPKGLKAEAL----PGEPGTVAFLDGPIVLAG 558
>gi|379719928|ref|YP_005312059.1| hypothetical protein PM3016_2010 [Paenibacillus mucilaginosus 3016]
gi|378568600|gb|AFC28910.1| hypothetical protein PM3016_2010 [Paenibacillus mucilaginosus 3016]
Length = 641
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 184/583 (31%), Positives = 287/583 (49%), Gaps = 86/583 (14%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARL-------------- 160
+KE+S VRL + R + N Y++ L + L+ NF A L
Sbjct: 6 MKELSSGRVRLAPGPLQARLE-LNKRYVMSLTNENLLRNFYLEAGLWSYSGNGGTTSATT 64
Query: 161 ---PAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQK 217
P + GWE P+CELRGH +GH+LSA+A ++ T + +K K +V+ L+ CQ+
Sbjct: 65 TSTDGPEHWHWGWESPTCELRGHIMGHWLSAAATIYGQTQDGLVKAKADYIVAELARCQE 124
Query: 218 EIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEY 277
G +L+AFP R+ VWAP+YTIHK+L GL D Y A +A AL + T M +
Sbjct: 125 ANGGEWLAAFPESYMHRIARGKYVWAPHYTIHKLLMGLYDMYRLAGSAAALELMTNMAAW 184
Query: 278 FYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLL 337
FY ++ E L+ E GGM + L+ +T HL L +D+ F L
Sbjct: 185 FYRWTDG----FTREEMDDLLDLETGGMLETWADLYGVTGSGAHLELVRRYDRRRFFDAL 240
Query: 338 ALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK-------------EGHQLESSGTNI 384
D ++ H+NT IP ++G+ +EVTG++ ++ G+ +G N
Sbjct: 241 LEGRDVLTNKHANTQIPEILGAARAWEVTGEERYRRIVEAFWRCAVSERGYTATGAGDNG 300
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
+ + + +A+ L + +E C YNM+++++ L RWT + AYADY+ER NGVL
Sbjct: 301 ELWMPQGE---MAARLGAG-QEHCCNYNMMRLAQVLLRWTGDPAYADYWERRFVNGVLAH 356
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q G E G++ Y + L GS K WGTP+ FWCC+GT +++ + I+ EEE
Sbjct: 357 QHG-ETGMISYFIGLGAGSRKT-----WGTPTGHFWCCHGTLMQANASYEGQIFMEEE-- 408
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKV--------DPVVSWD------------PYLRV- 543
G+ + Q++ S+L+++ G + ++ +P+ SW P + V
Sbjct: 409 -DGLAVCQWLPSKLEYEIGGTAIRLRIEQDGQHGLEPLSSWSVTGMTAITRADLPPIPVH 467
Query: 544 -------TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS---PGNFLSVTK 593
LTF ++ +T L +R+P W S T+NG+ PL P F+ + +
Sbjct: 468 RPDRFMYRLTFEAE-RAVTFKLRMRLPWWLSGE-PVITVNGE-APLQGELKPSTFVELER 524
Query: 594 TWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
W S D +T++LP L+ EA+ P A L GP VLAG
Sbjct: 525 EWKSGDTITVELPKGLKAEAL----PGEPGTVAFLDGPIVLAG 563
>gi|398305096|ref|ZP_10508682.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus vallismortis
DV1-F-3]
Length = 762
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 166/528 (31%), Positives = 268/528 (50%), Gaps = 51/528 (9%)
Query: 130 MHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSAS 189
M + +Q EYLL LDVD+L+ + YGGWE + E+ GH VGH+LSA+
Sbjct: 10 MFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAGHSVGHWLSAA 67
Query: 190 ALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD-------RLE--ALIP 240
+ M+ ++ +E LK K + V+ LS Q+ GY+S F FD R++ +L
Sbjct: 68 SAMYRASGDEELKRKTAYAVNELSHIQQFDQEGYVSGFSRACFDEVFSGDFRVDHFSLGG 127
Query: 241 VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNE 300
W P+Y++HK+ AGL+D Y N ALR+ + ++ + + + + E+ + L
Sbjct: 128 SWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLNDEQFQRMLIC 183
Query: 301 EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
E GGMN+ + L+ +T++ +L LA F L LA D++ G H+NT IP VIG+
Sbjct: 184 EHGGMNEAMADLYMLTKNKAYLELAERFCHRAILQPLAEGKDELEGKHANTQIPKVIGAA 243
Query: 361 MRYEVTGDQLHKEG-----------HQLESSGTNIG-HFNFKSDPKRLASNLDSNTEESC 408
Y++TG++ ++ G +IG HF + + L T E+C
Sbjct: 244 KLYDITGNEAYRNAALFFWEQVVYQRSYAIGGNSIGEHFGAEG-----SEELGVTTAETC 298
Query: 409 TTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERS 468
TYNMLK++ HLFRW +E + DYYE +L N +L Q + G+ Y + PG K
Sbjct: 299 NTYNMLKLTAHLFRWFQESKFMDYYENALYNHILASQ-DPDSGMKTYFVSTQPGHFKV-- 355
Query: 469 YHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVN 528
+ +P DSFWCC GTG+E+ ++ IY + +Y+ +I S++ + +++
Sbjct: 356 ---YCSPEDSFWCCTGTGMENPARYTKHIYHIDRDD---LYVNLFIPSQIHVREKHMLIA 409
Query: 529 QKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNF 588
Q+ P T K G+ +L++RIP W + G KA +NG+ + +
Sbjct: 410 QETSF-----PAAEQTRLMVKKADGVPMALHIRIPYW-AHGGLKAAVNGKRIQPVEKNGY 463
Query: 589 LSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
L + K W++ D + + LP+ L +DD + ++YGP VLAG
Sbjct: 464 LVIHKHWNTGDCIEVDLPMKLHLYQAKDDPKK----NVLMYGPVVLAG 507
>gi|389647349|ref|XP_003721306.1| hypothetical protein MGG_09030 [Magnaporthe oryzae 70-15]
gi|351638698|gb|EHA46563.1| hypothetical protein MGG_09030 [Magnaporthe oryzae 70-15]
Length = 680
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 180/557 (32%), Positives = 270/557 (48%), Gaps = 60/557 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEP 173
L +V+L+ R + Q L Y+ +D+++L++NFR + G + GGW+ P
Sbjct: 86 LSQVTLNQGRFRDN------QDRTLTYIKFVDLNRLLYNFRANHGVSTNGAQANGGWDAP 139
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFP 228
R H GH+L+A A +A ++ + + V L+ CQ +GYLS FP
Sbjct: 140 DFPFRSHIQGHFLTAWANCYAVLKDQECRSRAEQFVEELAKCQDNNAAAGFQAGYLSGFP 199
Query: 229 TEQFDRLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVI 286
+E L PYY IHK +AGLLD + + +A + M + R
Sbjct: 200 ESDITAVEQRTLTNGNVPYYAIHKTMAGLLDVWRNVGSTKAKDVLVKMAGWVDTRT---- 255
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
+ S + + E GGM++VL +F T D + L +A FD L LA D + G
Sbjct: 256 ARLSYAQMQSMMGTEFGGMSEVLADMFHQTGDERWLTVARRFDHAAVLDPLARSQDSLDG 315
Query: 347 FHSNTHIPIVIGSQMRYEVTGDQLH-----------KEGHQLESSGTNIG-HFNFKSDPK 394
H+NT +P IG+ Y+ T DQ + E H G + HF P
Sbjct: 316 LHANTQVPKWIGAAREYKATKDQRYLDIARNAWDFTVEAHTYAIGGNSQSEHFR---PPN 372
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFR-----WTKEIAYADYYERSLTNGVLGIQR-GT 448
+A L +T E+C TYNMLK++R LF + A D+YER+L N +LG Q G
Sbjct: 373 AIAGYLLHDTAEACNTYNMLKLTRELFMHDAAPGMNDTAKFDFYERALLNHLLGQQDPGD 432
Query: 449 EPGVMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
G + Y PL PG + W T +SFWCC GTGIE+ +KL DSIYF
Sbjct: 433 GHGHVTYFTPLNPGGRRGVGPAWGGGTWSTDYESFWCCQGTGIETNTKLMDSIYFRSRDN 492
Query: 505 YPGVYIIQYISSRLDW--KSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRI 562
+Y+ +I S + W + G +V + P+ TLT S G G T L++RI
Sbjct: 493 N-ALYVNLFIPSSVQWSDRDGVVVTQETEFPLGD-----ATTLTVSGAGGGRWT-LSVRI 545
Query: 563 PTWTSSNGAKATLNGQDLP---LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
P+W + GA+ ++NGQ + +PG + ++T+ W+ DK+T++LP+ L T A DD
Sbjct: 546 PSWVAG-GAEVSVNGQKVGGDVRTTPGGYAAITREWAVGDKVTVRLPMKLHTVAANDD-- 602
Query: 620 EYASIQAILYGPYVLAG 636
++ A+ YGP +L+G
Sbjct: 603 --PTLVALAYGPAILSG 617
>gi|393782435|ref|ZP_10370619.1| hypothetical protein HMPREF1071_01487 [Bacteroides salyersiae
CL02T12C01]
gi|392673263|gb|EIY66726.1| hypothetical protein HMPREF1071_01487 [Bacteroides salyersiae
CL02T12C01]
Length = 781
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 183/577 (31%), Positives = 284/577 (49%), Gaps = 77/577 (13%)
Query: 118 VSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCEL 177
+S+ +VRL A + + ++L+ L D+ + F + A Y GWE+ S
Sbjct: 47 ISISEVRLLQGPFK-AAMEADRKWLMSLQPDRFLHRFHENAGFTPKAPMYDGWEDSS--Q 103
Query: 178 RGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRL-- 235
G GHYLSA ++++A+T + L ++ ++ + CQ IG+GY++A P DRL
Sbjct: 104 SGFSFGHYLSAMSMLYAATGDNELLGRIEYSINEIRKCQLAIGTGYVAAIPDG--DRLWN 161
Query: 236 ----EALIP-------VWAPYYTIHKILAGLLDQYTYAD----NAEALRMTTWMVEYFYN 280
+ + P WAP+Y +HK+ +G +D Y Y A+ +T W + F +
Sbjct: 162 ELVADKIEPGGSWINGFWAPWYNLHKLWSGFIDVYLYTGVETAKTVAIELTDWACDKFRD 221
Query: 281 RVQNVIKKYSIERHWQTL-NEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
+ WQ + + E GGMND LY ++ IT + ++L LA F + L+
Sbjct: 222 MTDD---------QWQRMISCETGGMNDALYNMYAITGNLRYLQLADKFYHYSVMEPLSQ 272
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK-----------EGHQLESSG-TNIGHF 387
Q D+++G H+NT IP V G YE+ G + K + H G +N HF
Sbjct: 273 QRDELNGLHANTQIPKVTGIARSYELRGREKDKTIATFFWNTVLKKHTYCIGGNSNYEHF 332
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
P L L T E+C TYNMLK++ HLF W + Y DYYER+L N +L Q
Sbjct: 333 ---GKPGELF--LSDKTTETCNTYNMLKLTGHLFAWEPKAEYMDYYERALYNHILASQ-N 386
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
E G+++Y LPLA S KE S TP SFWCC GTG E+ K + IY E E
Sbjct: 387 HETGMVVYSLPLAYASFKEFS-----TPEHSFWCCVGTGFENHVKYAEGIYSESEND--- 438
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+YI +++SRL+W+ +++ Q+ + S L + S T +L++R P W +
Sbjct: 439 LYINLFVASRLNWRRKGMIIEQQTEFPESDKSSLILRCAKSQ-----TLTLHIRYPQWAT 493
Query: 568 SNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
+ G +N + + PG+++S+ + W DK+ I++P +L E + D ++ A
Sbjct: 494 T-GYTIKVNDKIQEIEKKPGSYISLNRLWKDGDKIEIEMPKSLHKEVLPGDEHKF----A 548
Query: 627 ILYGPYVLAGHSIGDWD------ITESATSLSDWITP 657
L GP VLAG D D + + + L DWI P
Sbjct: 549 FLNGPIVLAGEM--DLDERKIVFLEKKDSELRDWIQP 583
>gi|300777572|ref|ZP_07087430.1| acetyl-CoA carboxylase [Chryseobacterium gleum ATCC 35910]
gi|300503082|gb|EFK34222.1| acetyl-CoA carboxylase [Chryseobacterium gleum ATCC 35910]
Length = 791
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 174/541 (32%), Positives = 268/541 (49%), Gaps = 51/541 (9%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L VRL S+S+ +A + + +YL+ L+ D+L+ + K A L Y WE + L G
Sbjct: 29 LETVRL-SESVFSKAMKADHKYLMALEPDRLLAPYLKEAGLKPKANNYPNWE--NTGLDG 85
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLE--- 236
H GHY+SA +LM+AST +++++E+++ ++S L CQK GY+S P + E
Sbjct: 86 HIGGHYISALSLMYASTGDKAIQERINYMISELERCQKASPDGYISGIPNGKKIWKEIKQ 145
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
L W P Y IHK+ +GL D Y YA N +A M + ++ N V N+
Sbjct: 146 GNIRASGFGLNDRWVPLYNIHKLYSGLRDAYWYAKNEKAKAMLIKLTDWMANEVSNL--- 202
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
S E+ L E GG+N+V ++ IT D K+L LAH F L L D ++G H
Sbjct: 203 -SDEQIQDMLRSEHGGLNEVFADVYEITHDQKYLKLAHRFSHQAILSPLLTGEDKLTGLH 261
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHKEGHQL------ESSGTNIG------HFNFKSDPKRL 396
+NT IP VIG + ++ + E + IG HFN +D +
Sbjct: 262 ANTQIPKVIGYKRIADLENNTSWSNAADFFWHNVTEKRSSVIGGNSVSEHFNPVNDFSSM 321
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
+++ E+C TYNMLK+++ L+ E Y DYYE++L N +L + + G +Y
Sbjct: 322 IKSIEG--PETCNTYNMLKLTKELYATLPESYYIDYYEKALYNHILSTE-NHDHGGFVYF 378
Query: 457 LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P+ PG Y + P SFWCC G+GIE+ +K G+ IY + +Y+ +I S
Sbjct: 379 TPMRPG-----HYRVYSQPQTSFWCCVGSGIENHAKYGEMIYARSDKD---LYVNLFIPS 430
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
L WK +V+ Q V ++ TL F + G L LR P WT+ + K +N
Sbjct: 431 TLTWKQQNVVLRQ----VNNFPEAPETTLIFDAAGKS-EFDLKLRCPEWTTPSEVKILVN 485
Query: 577 G-QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G Q+ + ++TK W D + + LP+ L E + P++++ A YGP VLA
Sbjct: 486 GKQERVQRGSDGYFTLTKKWKKGDVVKMTLPMQLSAEQL----PDHSNYYAFKYGPVVLA 541
Query: 636 G 636
Sbjct: 542 A 542
>gi|86196151|gb|EAQ70789.1| hypothetical protein MGCH7_ch7g196 [Magnaporthe oryzae 70-15]
gi|440463815|gb|ELQ33359.1| hypothetical protein OOU_Y34scaffold00969g44 [Magnaporthe oryzae
Y34]
gi|440485206|gb|ELQ65183.1| hypothetical protein OOW_P131scaffold00516g8 [Magnaporthe oryzae
P131]
Length = 633
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 180/557 (32%), Positives = 270/557 (48%), Gaps = 60/557 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEP 173
L +V+L+ R + Q L Y+ +D+++L++NFR + G + GGW+ P
Sbjct: 39 LSQVTLNQGRFRDN------QDRTLTYIKFVDLNRLLYNFRANHGVSTNGAQANGGWDAP 92
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFP 228
R H GH+L+A A +A ++ + + V L+ CQ +GYLS FP
Sbjct: 93 DFPFRSHIQGHFLTAWANCYAVLKDQECRSRAEQFVEELAKCQDNNAAAGFQAGYLSGFP 152
Query: 229 TEQFDRLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVI 286
+E L PYY IHK +AGLLD + + +A + M + R
Sbjct: 153 ESDITAVEQRTLTNGNVPYYAIHKTMAGLLDVWRNVGSTKAKDVLVKMAGWVDTRT---- 208
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
+ S + + E GGM++VL +F T D + L +A FD L LA D + G
Sbjct: 209 ARLSYAQMQSMMGTEFGGMSEVLADMFHQTGDERWLTVARRFDHAAVLDPLARSQDSLDG 268
Query: 347 FHSNTHIPIVIGSQMRYEVTGDQLH-----------KEGHQLESSGTNIG-HFNFKSDPK 394
H+NT +P IG+ Y+ T DQ + E H G + HF P
Sbjct: 269 LHANTQVPKWIGAAREYKATKDQRYLDIARNAWDFTVEAHTYAIGGNSQSEHFR---PPN 325
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFR-----WTKEIAYADYYERSLTNGVLGIQR-GT 448
+A L +T E+C TYNMLK++R LF + A D+YER+L N +LG Q G
Sbjct: 326 AIAGYLLHDTAEACNTYNMLKLTRELFMHDAAPGMNDTAKFDFYERALLNHLLGQQDPGD 385
Query: 449 EPGVMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
G + Y PL PG + W T +SFWCC GTGIE+ +KL DSIYF
Sbjct: 386 GHGHVTYFTPLNPGGRRGVGPAWGGGTWSTDYESFWCCQGTGIETNTKLMDSIYFRSRDN 445
Query: 505 YPGVYIIQYISSRLDW--KSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRI 562
+Y+ +I S + W + G +V + P+ TLT S G G T L++RI
Sbjct: 446 N-ALYVNLFIPSSVQWSDRDGVVVTQETEFPLGD-----ATTLTVSGAGGGRWT-LSVRI 498
Query: 563 PTWTSSNGAKATLNGQDLP---LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
P+W + GA+ ++NGQ + +PG + ++T+ W+ DK+T++LP+ L T A DD
Sbjct: 499 PSWVAG-GAEVSVNGQKVGGDVRTTPGGYAAITREWAVGDKVTVRLPMKLHTVAANDD-- 555
Query: 620 EYASIQAILYGPYVLAG 636
++ A+ YGP +L+G
Sbjct: 556 --PTLVALAYGPAILSG 570
>gi|385677991|ref|ZP_10051919.1| hypothetical protein AATC3_18830 [Amycolatopsis sp. ATCC 39116]
Length = 886
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 189/543 (34%), Positives = 288/543 (53%), Gaps = 48/543 (8%)
Query: 116 KEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC 175
+ + L VRL DS + + + + YL +D D+L+ FR TA LP+ EP GGWE P
Sbjct: 35 RPLELGRVRL-LDSRYRQNMERTVAYLRFVDADRLLHMFRVTAGLPSTAEPCGGWEAPDI 93
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTE 230
+LRGH GH LS AL A+T + L K +++V+AL+ CQ GYLSAFP
Sbjct: 94 QLRGHTTGHLLSGLALAAANTGDTELAAKGASIVAALAECQAAAPAAGFTEGYLSAFPER 153
Query: 231 QFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYS 290
F LEA VWAPYYTIHKI+AGLLDQY N +AL + M + R+ N+ +
Sbjct: 154 AFADLEAGKVVWAPYYTIHKIMAGLLDQYRLLGNRQALDVLLGMARWARARMANLTR--- 210
Query: 291 IERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSN 350
E + L+ E GGMN+ L L +T D +HL A LFD L+ + D ++G H+N
Sbjct: 211 -EAQQKVLHTEFGGMNETLASLALVTGDRQHLETAKLFDHDEIFVPLSQRRDTLAGRHAN 269
Query: 351 THIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNL 400
T I ++G+ + ++ TG++ ++ H G N + F P ++ S L
Sbjct: 270 TDIAKIVGAAVEWDATGEEYYRTIATYFWDQVVHHHTYVIGGN-ANAEFFGPPDQIVSQL 328
Query: 401 DSNTEESCTTYNMLKVSRHLF-RWTKEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLP 458
NT E+C +YNMLK+SR LF R Y DY E +L N +LG Q + G + Y
Sbjct: 329 GENTCENCNSYNMLKLSRLLFLRDPSRTDYLDYSEWTLLNQMLGEQDPDSAHGFVTYYTG 388
Query: 459 LAPGS---SKERSYHHWGTPSD---SFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
L PG+ KE GT S +F C +GTG+E+ K ++IY+ + G+++ Q
Sbjct: 389 LVPGAQRKGKEGVVSDPGTYSSDYGNFTCDHGTGLETHVKYAENIYYAADD---GLWVNQ 445
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
+I S +D+ +I +++ +D +R+ ++ G+G +L +RIP+W + A+
Sbjct: 446 FIPSEVDYGGVRI----RLETEYPYDETVRLHVS----GAG-AFALRVRIPSWATH--AR 494
Query: 573 ATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
+NG+ + PG F V + W D + ++LP+T++ P+ ++ A+ YGP
Sbjct: 495 LFVNGEAM-RAEPGRFAVVGRRWRDGDVVELRLPMTVQWRPA----PDNPAVHALTYGPL 549
Query: 633 VLA 635
VLA
Sbjct: 550 VLA 552
>gi|329847073|ref|ZP_08262101.1| hypothetical protein ABI_01350 [Asticcacaulis biprosthecum C19]
gi|328842136|gb|EGF91705.1| hypothetical protein ABI_01350 [Asticcacaulis biprosthecum C19]
Length = 800
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 188/603 (31%), Positives = 286/603 (47%), Gaps = 75/603 (12%)
Query: 102 PGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLP 161
P KVP + V L DVRL S A + N +YL+ L D+++ N+ K A LP
Sbjct: 34 PNPTKVPAAA----TAVPLSDVRL-LPSPFLTAVEANTKYLMFLSPDRMLHNYHKFAGLP 88
Query: 162 APGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS 221
GE YGGWE S + G +GHYLSA +L++A T + + ++ +++ L+ Q G
Sbjct: 89 VKGEIYGGWE--SDTIAGEALGHYLSALSLLYAQTGHAEARTRIEYIIAELAKVQAAHGD 146
Query: 222 GYLSAF-----------PTEQFDRLEA---------LIPVWAPYYTIHKILAGLLDQYTY 261
GY + F E F + A L W P+Y HK+ AGL+D TY
Sbjct: 147 GYAAGFMRKRKDASIVDGKEIFAEIMAGDIRSAGFDLNGCWVPFYNWHKLFAGLMDAQTY 206
Query: 262 ADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKH 321
A + + + Y ++ V + E+ + L+ E GG+N+ +L+ T+DP+
Sbjct: 207 AGIDAGIPVAVALGGY----IEKVFAALNDEQVQKVLDCEHGGINESFAELYTRTKDPRW 262
Query: 322 LMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG------- 374
L LA L L D ++ H+NT +P ++G YE+TG +++
Sbjct: 263 LALAERIYHHRILDPLTAGEDKLANNHANTQVPKLVGLARLYEITGKPGYRKASSFFWDR 322
Query: 375 ----HQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYA 430
H G + F +P +A ++ T ESC TYNMLK++RHL+ WT A+
Sbjct: 323 VVNHHSFAIGGNADREYFF--EPDTIAKHITEQTCESCNTYNMLKLTRHLYAWTPNAAWF 380
Query: 431 DYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESF 490
DYYER+ N ++ Q E G+ Y++PL G+ +E S TP DSFWCC +GIES
Sbjct: 381 DYYERAHLNHIMAHQN-PETGMFAYMVPLMSGTGREYS-----TPEDSFWCCVLSGIESH 434
Query: 491 SKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSK 550
SK GDSIY++ + +++ +I S+L W + + +D + +T SS
Sbjct: 435 SKHGDSIYWQSDDT---LFVNLFIPSKLTWNKAAFELTTQ----YPYDSRVAFKVTQSSG 487
Query: 551 GSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
T + +RIP W S+ +NG+ + + +TW + D +T+ LPL LR
Sbjct: 488 AKAFTVA--VRIPGWAKSH--TLLVNGKPALAAIDKGYALIRRTWKAGDVVTLDLPLELR 543
Query: 611 TEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLI-TF 669
E D + A+L GP VLA D E + W PA S L+ +F
Sbjct: 544 FEGTAGDD----KVVALLRGPMVLA----ADLGAIEDS-----WQGDAPALVGSDLLGSF 590
Query: 670 TQE 672
T E
Sbjct: 591 TPE 593
>gi|342872240|gb|EGU74628.1| hypothetical protein FOXB_14856 [Fusarium oxysporum Fo5176]
Length = 616
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 183/552 (33%), Positives = 266/552 (48%), Gaps = 59/552 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEP 173
L +VSL D R + Q L YLL +D D+L++ FRK + G + GGW+ P
Sbjct: 34 LTQVSLTDSRWMDN------QNRTLNYLLSVDPDRLLYVFRKNHGVDTKGAQTNGGWDAP 87
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFP 228
R H GH+LSA +AS + + + V L+ CQ GYLS FP
Sbjct: 88 DFPFRSHVQGHFLSAWTQCYASAGVKECGSRATYFVQELAKCQANNAKAGFNKGYLSGFP 147
Query: 229 TEQFDRLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRV 282
++E L PYY IHK LAGLLD Y + A L + +W V
Sbjct: 148 ESDITKVEDRTLNNGNVPYYAIHKTLAGLLDVYRRLGDQTAKDTMLSLASW--------V 199
Query: 283 QNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQAD 342
K S + L E GGMN+VL + T+D K L +A FD L D
Sbjct: 200 DTRTSKLSYNQMQSMLQTEFGGMNEVLADIAFYTKDAKWLKVAQRFDHAVIFDPLQQNVD 259
Query: 343 DISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSD 392
+SG H+NT +P IG+ Y+V GD+ + + + G N +F++
Sbjct: 260 KLSGLHANTQLPKWIGALREYKVGGDKKYLDIGRNAWNMVVNKHTYAIGGNSQAEHFRA- 318
Query: 393 PKRLASNLDSNTEESCTTYNMLKVSRHLFRWT-KEIAYADYYERSLTNGVLGIQR-GTEP 450
P +A L +T E+C +YNMLK++R L+ + +Y D+YE++L N +LG Q ++
Sbjct: 319 PDAIAGFLTDDTCEACNSYNMLKLTRELWALNPTDASYFDFYEKALLNHLLGQQDPSSDH 378
Query: 451 GVMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
G + Y PL G + W T +SFWCC GTG+E+ +KL DSIYF
Sbjct: 379 GHVTYFTPLKAGGRRGVGPAWGGGTWSTDYNSFWCCQGTGVETNTKLMDSIYFHTSDT-- 436
Query: 507 GVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
+Y+ + S+L+W ++ V Q D S T TF G +L +RIP+WT
Sbjct: 437 -LYVNLFTPSKLNWSQKKVSVTQTTDFPES------DTSTFKISGDTSEWTLAVRIPSWT 489
Query: 567 SSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
S A +NGQ + PG + + + W S D +T+QLP++L T A DD+ ++
Sbjct: 490 SK--ASIKVNGQAANVAVQPGKYALIKRQWKSGDTVTVQLPMSLHTVAANDDQ----TLG 543
Query: 626 AILYGPYVLAGH 637
AI +GP +LAG+
Sbjct: 544 AIAFGPVILAGN 555
>gi|399071242|ref|ZP_10749941.1| hypothetical protein PMI01_00976 [Caulobacter sp. AP07]
gi|398043612|gb|EJL36503.1| hypothetical protein PMI01_00976 [Caulobacter sp. AP07]
Length = 789
Score = 256 bits (653), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 177/533 (33%), Positives = 263/533 (49%), Gaps = 56/533 (10%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A + N YLL L D+ + NF A LPA GE YGGWE S + GH +GHY+SA +M+
Sbjct: 53 AVEVNRAYLLRLSPDRFLHNFMTFAGLPAKGEIYGGWE--SDTIAGHTLGHYVSALVVMY 110
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ-----FDRLEALIPV------- 241
T + + + +V L+ Q + G GY+ A ++ D E V
Sbjct: 111 EQTGDVECRRRADYIVGELARAQAKRGDGYIGALQRKRKDGTVVDGEEIFAEVMKGDIRS 170
Query: 242 --------WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIER 293
W+P YT+HK AGLLD + N +AL + + YF + V + E+
Sbjct: 171 GGFDLNGSWSPLYTVHKTFAGLLDVHRAWGNQQALDVAVGLGGYF----ERVFAALNDEQ 226
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA-HLFDKPCFLGLLALQADDISGFHSNTH 352
L E GG+N+ +L+ T D + L++A ++D+ L+A Q D ++ FH+NT
Sbjct: 227 MQTLLGCEYGGLNESYAELYARTGDRRWLVVAERIYDRKVLDPLVA-QQDKLANFHANTQ 285
Query: 353 IPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNIGHFNFKSDPKRLASNLDS 402
+P +IG YE+TG H G N F ++P +A+++
Sbjct: 286 VPKLIGLGRLYELTGKPQDAAAARFFWNTVTQHHSYVIGGNADREYF-AEPDTIAAHISE 344
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
T E C TYNMLK++R L+ W E A DYYER+ N V+ Q + G Y+ PL G
Sbjct: 345 QTCEHCNTYNMLKLTRQLYSWRPEGALFDYYERAHLNHVMAAQN-PKTGGFTYMTPLLTG 403
Query: 463 SSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
+ + S + D+FWCC GTG+ES +K G+SI++E EG + + YI + WK+
Sbjct: 404 ADRGYSTNE----DDAFWCCVGTGMESHAKHGESIFWEGEG---ALLVNLYIPAEAQWKA 456
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL 582
+ ++D ++P R+TL +K T + LR+P W S AK ++NGQ +
Sbjct: 457 RGAAL--RLDTRYPFEPESRLTLAKLAKPGRFT--IALRVPAWAGSE-AKVSVNGQVVTP 511
Query: 583 PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G + V + W D + I LPL LR EA D AS A++ GP VLA
Sbjct: 512 EMAGGYALVDRRWREGDVVAITLPLGLRLEATPGD----ASTVAVVRGPMVLA 560
>gi|312135764|ref|YP_004003102.1| hypothetical protein Calow_1766 [Caldicellulosiruptor owensensis
OL]
gi|311775815|gb|ADQ05302.1| protein of unknown function DUF1680 [Caldicellulosiruptor
owensensis OL]
Length = 587
Score = 256 bits (653), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 174/583 (29%), Positives = 288/583 (49%), Gaps = 58/583 (9%)
Query: 99 IKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTA 158
+K QF +P R+ L SDS +++ + N Y+L L + L+ NF +
Sbjct: 1 MKEQKQFLIPLRAS------------LYSDSEYYKRFKLNRSYMLSLKTENLLQNFYLES 48
Query: 159 RLPA----PGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSA 214
+ + P + +GGWE P+C+LRGHF+GH+LSA+A ++A+ +E +K K +V L
Sbjct: 49 GIMSWSFLPQDIHGGWESPTCQLRGHFLGHWLSAAARIYANFGDEEIKGKADYIVDELER 108
Query: 215 CQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
CQKE G ++ + P + F+ + VWAP+YT+HK GL+D Y Y N +AL +
Sbjct: 109 CQKENGGEWVGSIPEKYFEWMARGKWVWAPHYTVHKTFMGLVDMYKYTSNQKALEIVDRW 168
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
+FY ++S E+ L+ E GGM ++ +L+ IT+D K+ L + +
Sbjct: 169 ANWFYRWS----GQFSREKMDDILDYETGGMLEIWAELYNITKDIKYRDLMERYYRGRLF 224
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK------------EGHQLESSGT 382
L D ++G H+NT IP + G+ +EVTG++ + E + G
Sbjct: 225 DRLLNGEDVLTGRHANTTIPEIHGAARVWEVTGEEKFRKIVESYWREAVEERGYFCTGGQ 284
Query: 383 NIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL 442
+G + +++ + L +E C YNM++++ LFRWT + Y+DY ER++ NG+
Sbjct: 285 TLGE--VWTPKQKIKNYLGPTNQEHCVVYNMIRLAEFLFRWTGDKRYSDYIERNIYNGLF 342
Query: 443 GIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEE 502
QR + G++ Y LPL PGS K WGTP++ FWCC+GT +++ + D IY++ +
Sbjct: 343 AQQR-LKDGMVTYFLPLMPGSQK-----RWGTPTNDFWCCHGTLVQAHTIYNDIIYYKGQ 396
Query: 503 GKYPGVYIIQYISSRLDWKSGQ---IVVNQKVDPVVSWDPYL----RVTLTFSSKGSGLT 555
G+ I Q+I S + WK + I + Q Y + + K +
Sbjct: 397 N---GIVISQFIPSFVTWKDDKGNDITIKQYYGRRQESFAYTAKKDEICIEIQCKNP-IE 452
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQ 615
L +R P W + +N +++ + + W ++DK+ I T+ T +
Sbjct: 453 FELAIRKPWWAMK--IEVAVNEDLYYSIDDSSYIQLMQRW-NNDKVKITFYKTVETCPMP 509
Query: 616 DDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPI 658
DD P+ A + GP VLAG IT + + D I PI
Sbjct: 510 DD-PQQV---AFMIGPVVLAGLCENRKKITINGKEIKDVIIPI 548
>gi|293375008|ref|ZP_06621302.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
gi|292646370|gb|EFF64386.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
Length = 763
Score = 256 bits (653), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 178/538 (33%), Positives = 274/538 (50%), Gaps = 60/538 (11%)
Query: 123 VRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFV 182
VRL DS+ +Q +YLL LDV++L+ + A P YGGWE S E++GH +
Sbjct: 6 VRLEKDSLFEISQAVGKQYLLDLDVERLLAPIYEGASQIPPKPSYGGWE--SLEIKGHSI 63
Query: 183 GHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLE------ 236
GHYLSA M+ +T + LKE+M ++ S Q+ GYL F + F+++
Sbjct: 64 GHYLSALTCMYEATKDLELKERMDYIIETFSLLQR--ADGYLGGFLSTPFEQVFTGEFHV 121
Query: 237 ---ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIER 293
+L W P+Y+IHKI AGL+D Y N EAL + + ++ Y + + S E+
Sbjct: 122 DHFSLSHYWVPWYSIHKIYAGLMDAYQIGKNVEALNILKKLADWAYEGSRLM----SDEQ 177
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHI 353
+ L E GGMN+V+ +L+ ITQD ++L LA F + + LA DD+ G H+NT I
Sbjct: 178 FQRMLICEYGGMNEVMAELYEITQDERYLYLAKRFTQHLIMDPLAAGVDDLQGRHANTQI 237
Query: 354 PIVIGSQMRYEVTGDQLHKEGHQL---------------ESSGTNIGHFNFKSDPKRLAS 398
P V+G+ YEVTGD + + SSG + G SD + L+
Sbjct: 238 PKVLGAAKLYEVTGDDYYYRVAKFFFETVVLHRSYVIGGNSSGEHFG----PSDTEALS- 292
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLP 458
E+C TYNM+K++++LF+WTK+ Y D+ ER+ N +L Q G IY
Sbjct: 293 ---REAAETCNTYNMIKLAKYLFKWTKDSKYIDFIERATYNHILASQ-DPHTGCKIYFTS 348
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
PG K +GT DSFWCC GTG+E+ + I+F+E+ + Y+ +++S
Sbjct: 349 NYPGHFKV-----YGTKEDSFWCCTGTGMENPGRYTHHIFFKEDEDF---YVNLFMASSF 400
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+ Q+ V + D +S V L F + + L ++ +R+P W ++ + GQ
Sbjct: 401 VKEDEQLKVVLQTDFPIS----NVVKLVF-EEANQLFLNVKIRVPYWLNA-PIEVRFKGQ 454
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
G +L ++ T+ +DD++ I LP+ L E + D P A +YGP VLA
Sbjct: 455 SYEGNGQG-YLMISDTFHADDEIEIVLPMGLH-EYVSMDDPHKV---AFMYGPVVLAA 507
>gi|224536588|ref|ZP_03677127.1| hypothetical protein BACCELL_01463 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521844|gb|EEF90949.1| hypothetical protein BACCELL_01463 [Bacteroides cellulosilyticus
DSM 14838]
Length = 777
Score = 256 bits (653), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 170/528 (32%), Positives = 265/528 (50%), Gaps = 53/528 (10%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A++ YLL L+ D+ + FR A L Y GWE S + G +GHYLSA A+ +
Sbjct: 51 AEEKETAYLLELEPDRFLSGFRSEAGLVPKAPKYEGWE--SLGVAGQTLGHYLSACAMYY 108
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--FDRLEA---------LIPVW 242
A++ +E +++ ++ L +CQ+ G GYL+A P + F + A L W
Sbjct: 109 ATSGDERFLQRLEYTINELDSCQQANGDGYLAATPDGKRIFKEVSAGKIYSQGFDLNGGW 168
Query: 243 APYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
P Y +HK+LAGL+D Y YA N AL + + + Y Q++ + E+ + L E
Sbjct: 169 VPLYVMHKVLAGLIDTYQYAHNERALVVAEKLANWMYGTFQHLTE----EQMQKVLACEF 224
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLGLLALQADDISGFHSNTHIPIVIGSQM 361
GGMN+ L L+ T++ K L LA FD + LA+ DD+ G H+NT +P +IG+
Sbjct: 225 GGMNEALANLYACTKNEKFLALAQRFDNHKAIMDSLAVGVDDLEGKHANTQVPKIIGAAR 284
Query: 362 RYEVTGDQLHK-----------EGHQLESSGTNIG-HFNFKSDPKRLASNLDSNTEESCT 409
YE+TG + + H + G + G HF P +L L ++ E+C
Sbjct: 285 LYELTGSKRDSAIASFFWHTVVQNHSYVNGGNSDGEHF---GTPGQLNERLSTSNTETCN 341
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSY 469
TYNMLK++RHLF W Y+ YYER++ N +L Q + G+ Y PL G K
Sbjct: 342 TYNMLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGKK---- 396
Query: 470 HHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQ 529
+ +P SF CC G+G+E+ K GD IY EG +++ +I S+L+W +++V Q
Sbjct: 397 -GYLSPFQSFCCCSGSGMENHVKYGDFIY--SEGSDSSLWVNLFIPSQLNWTDRKMIVTQ 453
Query: 530 KVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGN-F 588
D + S D + LT ++ S + LR P W S + +NG + + N +
Sbjct: 454 DTD-IPSSD---KTVLTVKTEKS-QSVIFRLRYPEWAES--MRIKVNGSSVSFEASNNSY 506
Query: 589 LSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+S+ + W +DK+ I + T ++ D+ I YGP +LAG
Sbjct: 507 VSIEREWKDNDKIEITFKIKFYTVSMPDNEKRV----GIFYGPVLLAG 550
>gi|146301615|ref|YP_001196206.1| hypothetical protein Fjoh_3876 [Flavobacterium johnsoniae UW101]
gi|146156033|gb|ABQ06887.1| protein of unknown function DUF1680 [Flavobacterium johnsoniae
UW101]
Length = 765
Score = 255 bits (652), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 187/619 (30%), Positives = 291/619 (47%), Gaps = 80/619 (12%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
+K L +VRL D +AQ +L+Y+L L+ DKL+ + A LP YG WE S
Sbjct: 27 MKTFPLQEVRL-EDGPFKKAQDVDLKYILALNPDKLLAPYLIDAGLPVKSTRYGNWE--S 83
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--F 232
L GH GHYLSA ++M+AST N LK ++ ++S L+ CQ + G+GY+ P + +
Sbjct: 84 LGLDGHIAGHYLSALSMMYASTGNPELKNRLDYMISELARCQDKNGNGYVGGIPQGKVFW 143
Query: 233 DRLE---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFY 279
DR+ L W P Y IHK+ AGL D Y Y N +A +++ W +E
Sbjct: 144 DRIHKGDIDGSSFGLNNTWVPIYNIHKLFAGLNDAYQYTGNQQAKEVLIKLGDWFIE--- 200
Query: 280 NRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
+IK S ++ + L E GG+N+ L+ IT+D K+L A + FL L
Sbjct: 201 -----MIKPLSDDQIQKILKTEHGGINESFADLYLITKDKKYLETAQKISQKSFLESLIK 255
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQLESSGTNIG-HF 387
+ D ++G H+NT IP VIG + ++ D+ E + G ++ HF
Sbjct: 256 KEDKLTGLHANTQIPKVIGFEKIASISADKEWSEAVTFFWDNVTQKRSVAFGGNSVSEHF 315
Query: 388 NFKSDPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
N +D + L SN E+C +YNM ++S+ LF +E+ Y D+YER+L N +L Q
Sbjct: 316 NPVND---FSGMLKSNEGPETCNSYNMERLSKALFLEKQEMNYLDFYERTLYNHILSSQH 372
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY--FEEEGK 504
E G +Y P+ P Y + P S WCC G+G+E+ +K G+ IY F+E
Sbjct: 373 -PEKGGFVYFTPIRPN-----HYRVYSQPETSMWCCVGSGLENHTKYGELIYSHFDE--- 423
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
V++ +I+S L+W IV+ Q+ PY T + T LN+R P
Sbjct: 424 --AVFVNLFIASTLNWNEKGIVIEQRTKF-----PYENSTEIVLNLKKAKTFDLNIRRPK 476
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W + Q L P ++S+ + W S D + I+ E + P+ ++
Sbjct: 477 WAENFRVFINDKEQKTEL-KPSGYISLKRKWKSKDHVRIEFETKTHLEQL----PDGSNW 531
Query: 625 QAILYGPYVLAGHSIGD------WDITESATSLSDWITPIPASY-----NSQLITFTQEY 673
A + GP VLA + + D + S P+ +Y + ++ +E
Sbjct: 532 SAFVNGPIVLAAKTSKEALDGLFADDSRMGHVASGKYMPMDKAYALVGEKASYVSRLKEL 591
Query: 674 GNTKFVLTNSNQSITMEKF 692
GN +F L S+ +E F
Sbjct: 592 GNMRFAL----DSLELEPF 606
>gi|383644433|ref|ZP_09956839.1| hypothetical protein SeloA3_13744 [Sphingomonas elodea ATCC 31461]
Length = 746
Score = 255 bits (652), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 179/575 (31%), Positives = 276/575 (48%), Gaps = 66/575 (11%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A + N LL L+ D+L+ NFRK A L G+ YGGWE S + GH +GHYL+A LMW
Sbjct: 14 AVEVNHRALLQLEPDRLLHNFRKYAGLEPKGKLYGGWE--SDTIAGHTLGHYLTALVLMW 71
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRL----EALIP--------- 240
T + ++ + +V+ L+ Q + G+GY+ A ++ D E + P
Sbjct: 72 QQTGDPEMRRRADYIVAELAEAQAKRGTGYVGALGRKRKDGTIVDGEEIFPEIMRGEIKS 131
Query: 241 -------VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIER 293
W+P YT+HK+ AGLLD + NA+AL++T + YF + V + +
Sbjct: 132 GGFDLNGSWSPLYTVHKVFAGLLDVHAGWGNAQALQVTLGLAGYF----EKVFAALNDAQ 187
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHI 353
Q L E GG+N+ +L+ T+D + +++A LG L D ++ FH+NT +
Sbjct: 188 MQQMLGCEYGGLNESYAELYARTRDARWMVVAKRLYDDRVLGPLKAGEDKLANFHANTQV 247
Query: 354 PIVIGSQMRYEVTGDQ----------LHKEGHQLESSGTNIGHFNFKSDPKRLASNLDSN 403
P +IG +E+TGD GH G N F S P +A ++
Sbjct: 248 PKLIGLARIHELTGDAGDATAARFFWERVTGHHSYVIGGNADREYF-SAPDSIAQHITDQ 306
Query: 404 TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGS 463
T E C TYNMLK++ HLF W DYYER+ N V+ Q + G Y+ PL G+
Sbjct: 307 TCEHCNTYNMLKLTSHLFAWQPNGVLFDYYERAHLNHVMAAQN-PKTGGFTYMTPLMSGA 365
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
++ S + D+FWCC G+G+ES +K G++ +++ EG + + YI + +DWK+
Sbjct: 366 ERQYSQPN----EDAFWCCIGSGLESHAKHGEAAFWQGEG---ALLVNLYIPAEIDWKA- 417
Query: 524 QIVVNQKVDPVV--SWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP 581
QK V+ ++ TL ++ LR+P W A T+NG+
Sbjct: 418 -----QKAKLVLDTAYPFEGTATLKVEQLARAARFAIALRVPGWAEGK-AVVTVNGKPGD 471
Query: 582 LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH---S 638
+ V ++W DD + I LP+ LR EA D S A+L GP VLAG +
Sbjct: 472 AVFDRGYAIVARSWKRDDTIAISLPMALRLEAAPGDD----STVAVLRGPMVLAGDLGPT 527
Query: 639 IGDWDITESATSLSDWI-----TPIPASYNSQLIT 668
W+ + A +D + P PA + ++ I
Sbjct: 528 STPWNAGDPALVGTDLLAAFTPAPEPAVFETRGIV 562
>gi|289668636|ref|ZP_06489711.1| putative secreted protein [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 793
Score = 255 bits (651), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 179/551 (32%), Positives = 262/551 (47%), Gaps = 59/551 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ V L VRL S+ A TN YL+ L D+L+ NF A L YGGWE +
Sbjct: 49 VRAVPLAQVRL-MPSLFLDALNTNRRYLMRLQPDRLLHNFVLYAGLDPQAPAYGGWEADT 107
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA ALM A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 232 --------FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
FD L+ L WAP YT HK+ AGLLD + + DN +AL++ +
Sbjct: 166 KIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNVQALQVAVSL 225
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y +Q + + + L+ E GG+N+ +L T D + L LA L
Sbjct: 226 AGY----LQGIFSALDDAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHAVL 281
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNI 384
L Q D++ HSNT+IP +IG YEVTGD H G N
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGN- 340
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G + P ++ L T E C +YNMLK++RH+++W + DYYER+L N V+
Sbjct: 341 GDREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHVYQWGPQAELFDYYERTLLNHVMA- 399
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q+ G+ Y+ PL G ++ W +P D FWCC G+G+E+ ++ GDSIY+++
Sbjct: 400 QQHPRTGMFTYMTPLLAGEARG-----WSSPFDDFWCCVGSGMEAHAQFGDSIYWQDG-- 452
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
GVYI Y+ S + +G + P LR+ ++ +L LR+P
Sbjct: 453 -QGVYINLYVPSTVRDAAGLDMTLHSALPEQG-SASLRIDAAPPAQ-----RTLALRVPG 505
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W LNGQ + + +L +T+ W D L++ + LR E DD P + S
Sbjct: 506 WVQQ--PHLQLNGQPVDGSASDGYLRITRVWQPGDTLSLSFDMPLRLETTPDD-PAWVS- 561
Query: 625 QAILYGPYVLA 635
+L GP VLA
Sbjct: 562 --VLRGPLVLA 570
>gi|402300545|ref|ZP_10820034.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus alcalophilus
ATCC 27647]
gi|401724312|gb|EJS97686.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus alcalophilus
ATCC 27647]
Length = 761
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 165/530 (31%), Positives = 279/530 (52%), Gaps = 51/530 (9%)
Query: 128 DSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLS 187
D + + ++YLL LD+D+LV F + A L + YGGWEE + GH +GH+LS
Sbjct: 8 DGIFKESADKGMDYLLFLDIDRLVAPFYEAASLAPKKQRYGGWEETG--ISGHSLGHWLS 65
Query: 188 ASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA---------L 238
A+A M+ +T N +LK+K++ + L Q ++ FP+ F+++ L
Sbjct: 66 AAAYMYRNTMNRALKDKINKAIDELEYIQSVHDRNFIGGFPSTCFEKVFTGNFEVDHFTL 125
Query: 239 IPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTL 298
W P+Y++HK+ AGL+D Y N +AL + T + ++ V++ + + + + L
Sbjct: 126 AGHWVPWYSMHKLFAGLIDVYKLVKNEKALSVVTKLADW----VESGTVRLTEAQFQKML 181
Query: 299 NEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIG 358
E GGMNDV+ +L+ +TQ+ +L LA F + L L+ + D + G H+NT IP VIG
Sbjct: 182 ICEHGGMNDVMAELYLLTQNQTYLQLAIRFCEQQILEPLSNRRDLLEGKHANTQIPKVIG 241
Query: 359 SQMRYEVTGDQLHK--------EGHQLES---SGTNIG-HFNFKSDPKRLASNLDSNTEE 406
+ Y++T ++ +K E ++ S G +I HF SD L T E
Sbjct: 242 AAKLYDITKEEKYKTAATFFWQEVTRVRSYIIGGNSINEHFGRVSD-----ETLGVQTTE 296
Query: 407 SCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKE 466
+C TYNMLK++ HLF W ++ Y D+YER+L N +L Q + G+ Y + PG K
Sbjct: 297 TCNTYNMLKLTAHLFLWEQKSEYYDFYERALYNHILASQ-DPDSGMKAYFVSTEPGHFK- 354
Query: 467 RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIV 526
YH +P DSFWCC GTG+E+ ++ + IY++ + + +++ +I+S+L + ++
Sbjct: 355 -VYH---SPEDSFWCCTGTGMENPTRYSEHIYYQRDDE---LFVNLFIASQLQLEEKELR 407
Query: 527 VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPG 586
+ + D S L+V +G G S++LRIP W + +N + L
Sbjct: 408 LKLETDFPHSGRVQLKV-----EEGDGRFLSIHLRIPYWINGK-VSIFVNKKQTFLTDKK 461
Query: 587 NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
++++++ W + D++ + PL L + +DD + +YGP VLAG
Sbjct: 462 GYVTLSRRWKAGDRVEVDFPLGLHSYIAKDD----PNKVGFMYGPIVLAG 507
>gi|386837867|ref|YP_006242925.1| hypothetical protein SHJG_1777 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374098168|gb|AEY87052.1| hypothetical protein SHJG_1777 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451791159|gb|AGF61208.1| hypothetical protein SHJGH_1542 [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 769
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 170/529 (32%), Positives = 265/529 (50%), Gaps = 46/529 (8%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEPSCELRGHFVGHYLSASALMW 193
Q YL +DVD+L++NFR RL G GGW+ P+ R H GH+L+A A ++
Sbjct: 66 QDRAAAYLRFVDVDRLLYNFRANHRLSTGGASATGGWDAPTFPFRSHVQGHFLTAWAQLY 125
Query: 194 ASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEA--LIPVWAPYY 246
A T + ++K +V+ L+ CQ G+GYLS +P F LEA L PYY
Sbjct: 126 AVTGDAVARDKALYMVAELAKCQANNGAAGFGAGYLSGYPESDFTALEAGTLRNGNVPYY 185
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN 306
T+HK ++GLLD + + + +A + + + R + + + L E GGMN
Sbjct: 186 TVHKTMSGLLDVWRHLGSTQARDVLLALAGWVDART----GRLTTAQMQAVLGTEFGGMN 241
Query: 307 DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVT 366
VL L+ T D + L +A FD LA D ++G H+NT +P IG+ Y+ T
Sbjct: 242 AVLADLYQQTGDARWLTVAQRFDHAAVFDPLAANQDALAGLHANTQVPKWIGAVRAYKAT 301
Query: 367 GDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKV 416
G +++ G + G N +F++ P +A+ L +T ESC + NML +
Sbjct: 302 GITRYRDIATNAWNHCVGSHTYAIGGNSQAEHFRA-PNAIAAYLADDTCESCNSVNMLTL 360
Query: 417 SRHLFRWTKE-IAYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----ERSYH 470
+R LF T + +A DYYE++ N ++G Q +P G + Y PL PG +
Sbjct: 361 TRELFTLTPDRVALFDYYEQAWLNHIIGNQNPADPHGHITYFTPLRPGGRRGVGPAWGGG 420
Query: 471 HWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK 530
W T +FWCC GTG+E ++L DS+YF + + ++ S L W I V Q
Sbjct: 421 TWSTDYTTFWCCQGTGVEIHTRLMDSVYFHSGTT---LTVNMFVPSVLTWTQRGITVTQT 477
Query: 531 VDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG--QDLPLPSPGNF 588
S LRVT G T ++ +RIP WT+ GA ++NG Q++P + G++
Sbjct: 478 TSYPASDTTTLRVTGDV-----GGTWAMRVRIPGWTT--GASVSVNGVVQNIPAAT-GSY 529
Query: 589 LSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
++ + W+S D +T++LP+ D+ ++ A+ YGP VLAG+
Sbjct: 530 ATLDRAWASGDTVTVRLPMRTALRPANDN----PNVSAVTYGPVVLAGN 574
>gi|325106128|ref|YP_004275782.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324974976|gb|ADY53960.1| protein of unknown function DUF1680 [Pedobacter saltans DSM 12145]
Length = 782
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 171/554 (30%), Positives = 266/554 (48%), Gaps = 54/554 (9%)
Query: 110 RSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGG 169
+S L+ L +V+L D + A+Q +L+Y+L +D+DKL+ + + A L + YG
Sbjct: 22 QSNTTLQTFPLQEVKL-LDGIFKNAEQVDLKYILSMDMDKLLAPYLREAGLSEKAKSYGN 80
Query: 170 WEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT 229
WE + L GH GHYLSA +LM+AST N + +++ +S L CQ G GYL P
Sbjct: 81 WE--NSGLDGHIGGHYLSALSLMYASTKNPDINKRIDYYLSELKRCQDANGDGYLGGVPD 138
Query: 230 EQF-------DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWM 274
+ +++A L W P Y IHK+ AGL D + Y N A +++ W
Sbjct: 139 GKAMWRDISDGKIDAATFSLNKKWVPLYNIHKVFAGLYDAWVYTGNNTAKDMFIKLCDWA 198
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
F N + I+ Q L E GG+N+ + +T K++ LA F L
Sbjct: 199 TTTFGNLNEQQIQ--------QMLKSEHGGINESFADAYKLTGQQKYMDLALKFSHKAIL 250
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVT-GDQLHKEG---------HQLESSGTNI 384
L Q D ++G H+NT IP VIG + E+ D HK + + G N
Sbjct: 251 DPLRNQEDKLTGIHANTQIPKVIGFEKISEIEHKDDWHKAATFFWDNVVYKRTVAIGGNS 310
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
+F + D E+C TYNM+K+S+ L+ + E Y DY E++L N +L
Sbjct: 311 VREHFHPINNFMPMIEDIEGPETCNTYNMIKLSKALYNQSGETKYIDYIEKALYNHILSS 370
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q E G +Y P+ P Y + P S WCC G+G+E+ +K G+ IY +
Sbjct: 371 QH-PEKGGFVYFTPMRP-----NHYRVYSQPETSMWCCVGSGLENHAKYGEFIYAHND-- 422
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
+++ +I S LDWK +I + Q + + +++T + ++N+RIP
Sbjct: 423 -KDLFVNLFIPSELDWKEKKIKITQTTNFPEEGNTSIKLTEIKNE-----NFNINIRIPN 476
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W S N +NG+ + G ++++ K W D++ I LPL+ R E + D P YAS
Sbjct: 477 WASENDISVKINGKQIQPIVEGKYITLNKKWKKGDEINIDLPLSNRIEQMPDGLP-YAS- 534
Query: 625 QAILYGPYVLAGHS 638
I YGP +LA +
Sbjct: 535 --IFYGPILLAAKT 546
>gi|150003078|ref|YP_001297822.1| hypothetical protein BVU_0490 [Bacteroides vulgatus ATCC 8482]
gi|149931502|gb|ABR38200.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 783
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 172/545 (31%), Positives = 267/545 (48%), Gaps = 61/545 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
+ DVRL + A+ ++ YLL +D D+L+ + K A L E Y WE + L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWE--NTGLDG 89
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLE- 236
H GHYLSA + M+A+T N+ +K ++ ++S L CQ G GYL P + + +E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
L W P Y IHKI AGL D D+ EA +++T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMIR-------- 201
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
++ K S E+ + L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LVSKLSDEQIQEMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ G++ E + + IG HF+ D
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L+ + ++ + DYYER+L N +L Q + G
Sbjct: 322 ---FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG 378
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ ++ G+ IY ++ +Y+
Sbjct: 379 -FVYFTPMRAG-----HYRVYSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVN 429
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W QI + ++ TL S + +L RIP WT
Sbjct: 430 LFIPSTLRWGDTQI------EQQTAFPDEEGSTLVISPEKGKKEFTLLFRIPEWTKPEAL 483
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ ++NG+ + ++S+ +TWS DK+ ++LP+ LR A+ D Y +ILYGP
Sbjct: 484 RLSVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGP 539
Query: 632 YVLAG 636
VLA
Sbjct: 540 IVLAA 544
>gi|373955475|ref|ZP_09615435.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373892075|gb|EHQ27972.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 782
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 164/543 (30%), Positives = 275/543 (50%), Gaps = 50/543 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
LK L +V+L + A+ +L+Y++ L DKL+ + + A L E Y WE +
Sbjct: 24 LKTFRLQEVKLLPGIFN-DAENADLKYMMQLSPDKLLAPYLREAGLKPKAESYTNWE--N 80
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP------ 228
L GH GHYLSA A+M+AST ++ ++++ +++ L CQ + G+GY+ P
Sbjct: 81 SGLDGHIGGHYLSALAMMYASTGDKQALDRLNYMIAELKICQDKNGNGYVGGVPGSKELW 140
Query: 229 --TEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVI 286
Q D + A+ W P+Y IHK AGL D YTYA N A M ++F ++
Sbjct: 141 AAVMQGD-VGAINKKWVPFYNIHKTFAGLRDAYTYAGNETAKVMLIKFADWFVMIATSI- 198
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
+ ++ + L E GG+N+VL ++ +T D K+L A+ F L L D ++
Sbjct: 199 ---TPQKMQEMLKTEHGGVNEVLADVYALTGDKKYLTAAYSFSHQAILEPLEQGQDKLNN 255
Query: 347 FHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSDPK 394
H+NT IP VIG + +VT D + + Q ++ IG HFN +D
Sbjct: 256 LHANTQIPKVIGFKRISDVTADSNYNKAAQFFWQTVVQHRTVAIGGNSVREHFNPSNDFS 315
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI 454
+ + E+C TYNMLK++ L+ ++Y DYYER+L N +L +R G +
Sbjct: 316 SMITT--EQGPETCNTYNMLKLTEDLYLSDPRVSYIDYYERALYNHILSTER--PGGGFV 371
Query: 455 YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
Y P+ PG Y + P S WCC G+G+E+ +K G+ IY ++ V++ +I
Sbjct: 372 YFTPMRPG-----HYRVYSQPQTSMWCCVGSGMENHAKYGEMIYAHDQNN---VFVNLFI 423
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
S L+WK +V+ Q + + + ++T ++ G ++N+R P+W + K T
Sbjct: 424 PSTLNWKQKGLVLTQHTN----FPEEEKTSITINAVRPG-AFAINIRYPSWVHTGALKVT 478
Query: 575 LNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NG + + + + ++S+ + W D + + LP+ TE + P+ + +A+L+GP V
Sbjct: 479 VNGTPIKVSAKSSAYVSINRVWKKGDVIGVTLPMQTTTEQL----PDGLNYEAVLHGPIV 534
Query: 634 LAG 636
LA
Sbjct: 535 LAA 537
>gi|15614440|ref|NP_242743.1| hypothetical protein BH1877 [Bacillus halodurans C-125]
gi|10174495|dbj|BAB05596.1| BH1877 [Bacillus halodurans C-125]
Length = 758
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 162/540 (30%), Positives = 279/540 (51%), Gaps = 56/540 (10%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
S+ +V+L + + + +Q+ + +L LD+D+L+ + + A LP YGGWEE E+R
Sbjct: 3 SIENVKL-TKGLFYNSQKKGNDVILALDIDRLLAPYYEAANLPPKKRSYGGWEER--EIR 59
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA- 237
GH +GH+LSA+A M+ +T +++L E++ V L+ Q ++G Y+ FD + +
Sbjct: 60 GHSLGHWLSAAAAMYETTGDKALLERIDRAVQELATIQDDVG--YVGGVKRAHFDEMFSG 117
Query: 238 --------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKY 289
+ W P+Y +HK+ AGL+D + ++ AL + T + ++ + +
Sbjct: 118 EFQVGHFNIAGTWVPWYNLHKLFAGLIDVHQLTGHSLALTVVTKLADW----AKKGTDQL 173
Query: 290 SIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHS 349
+ ++ + L E GGMN+ + L+ +T +L LA F L LA D++ G H+
Sbjct: 174 TDDQFQRMLICEHGGMNEAMADLYTLTGHKDYLQLAIRFCHWAVLEPLANGIDELEGKHA 233
Query: 350 NTHIPIVIGSQMRYEVTGD------------QLHKEGHQLESSGTNIGHFNFKSDPKRLA 397
NT IP VIG+ +E+TGD Q+ + + +N HF +
Sbjct: 234 NTQIPKVIGAAKLFEITGDDTYRAIAEFFWRQVTNDRSYIIGGNSNSEHFGPAN-----K 288
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
L T E+C TYNMLK++ HLFRW + DYYE++L N +L Q + G+ Y +
Sbjct: 289 ETLGVETAETCNTYNMLKLTEHLFRWNRSSQLMDYYEKALYNHILASQ-DPDSGMKTYFV 347
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
L PG K S + +SFWCC+GTG+E+ ++ +IY ++ +Y+ +++S
Sbjct: 348 SLQPGHFKVYS-----SLEESFWCCFGTGLENPARYTRTIYDRDDRH---IYVNLFMASE 399
Query: 518 LDWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+ K Q+ + Q+ + P R LTF K G++ L++R+P W + A +N
Sbjct: 400 IHLKDLQVQIRQETNFPETD-----RTKLTF-VKADGVSIKLHIRVPEWVAGP-VTARIN 452
Query: 577 GQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
G++ S ++L++ + W D++ + LP+ LR +DD + I+YGP VLAG
Sbjct: 453 GKETFSESGADYLTIEREWQKGDEIEVHLPMELRIYEAKDDSHKV----GIMYGPIVLAG 508
>gi|423223548|ref|ZP_17210017.1| hypothetical protein HMPREF1062_02203 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638305|gb|EIY32149.1| hypothetical protein HMPREF1062_02203 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 777
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 169/528 (32%), Positives = 264/528 (50%), Gaps = 53/528 (10%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A++ YLL L+ D+ + FR A L Y GWE S + G +GHYLSA A+ +
Sbjct: 51 AEEKETAYLLELEPDRFLSGFRSEAGLVPKAPKYEGWE--SLGVAGQTLGHYLSACAMYY 108
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--FDRLEA---------LIPVW 242
A++ +E +++ ++ L +CQ+ G GYL+A P + F + A L W
Sbjct: 109 ATSGDERFLQRLEYTINELDSCQQANGDGYLAATPDGKRIFKEVSAGKIYSQGFDLNGGW 168
Query: 243 APYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
P Y +HK+LAGL+D Y YA N AL + + + Y Q++ + E+ + L E
Sbjct: 169 VPLYVMHKVLAGLIDTYQYAHNERALAVAEKLANWMYGTFQHLTE----EQMQKVLACEF 224
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLGLLALQADDISGFHSNTHIPIVIGSQM 361
GGMN+ L L+ T++ K L LA FD + LA+ DD+ G H+NT +P +IG+
Sbjct: 225 GGMNEALANLYACTKNEKFLALAQRFDNHKAIMDSLAVGVDDLEGKHANTQVPKIIGAAR 284
Query: 362 RYEVTGDQLHK-----------EGHQLESSGTNIG-HFNFKSDPKRLASNLDSNTEESCT 409
YE+TG + + H + G + G HF P +L L ++ E+C
Sbjct: 285 LYELTGSKRDSAIASFFWHTVVQNHSYVNGGNSDGEHF---GTPGQLNERLSTSNTETCN 341
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSY 469
TYNMLK++RHLF W Y+ YYER++ N +L Q + G+ Y PL G K
Sbjct: 342 TYNMLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGKK---- 396
Query: 470 HHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQ 529
+ +P SF CC G+G+E+ K GD IY EG +++ +I S+L+W +++V Q
Sbjct: 397 -GYLSPFQSFCCCSGSGMENHVKYGDFIY--SEGSDSSLWVNLFIPSQLNWTDRKMIVTQ 453
Query: 530 KVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGN-F 588
D + S D + LT ++ + LR P W S + +NG + + N +
Sbjct: 454 DTD-IPSSD---KTVLTVKTE-KPQSVIFRLRYPEWAES--MRIRVNGSSVSFEASNNSY 506
Query: 589 LSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+S+ + W +DK+ I + T ++ D+ I YGP +LAG
Sbjct: 507 VSIEREWKDNDKIEITFKIKFYTVSMPDNEKRV----GIFYGPVLLAG 550
>gi|423313734|ref|ZP_17291670.1| hypothetical protein HMPREF1058_02282 [Bacteroides vulgatus
CL09T03C04]
gi|392684669|gb|EIY77993.1| hypothetical protein HMPREF1058_02282 [Bacteroides vulgatus
CL09T03C04]
Length = 783
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 172/545 (31%), Positives = 266/545 (48%), Gaps = 61/545 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
+ DVRL + A+ ++ YLL +D D+L+ + K A L E Y WE + L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWE--NTGLDG 89
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLE- 236
H GHYLSA + M+A+T N+ +K ++ ++S L CQ G GYL P + + +E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
L W P Y IHKI AGL D D+ EA +++T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMIR-------- 201
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
++ K S E+ L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ G++ E + + IG HF+ D
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L+ + ++ + DYYER+L N +L Q + G
Sbjct: 322 ---FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG 378
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ ++ G+ IY ++ +Y+
Sbjct: 379 -FVYFTPMRAG-----HYRVYSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVN 429
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W QI + ++ TL S + +L RIP WT
Sbjct: 430 LFIPSTLRWGDTQI------EQQTAFPDEEGSTLVISPEKGKKEFTLLFRIPEWTKPEAL 483
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ ++NG+ + ++S+ +TWS DK+ ++LP+ LR A+ D Y +ILYGP
Sbjct: 484 RLSVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGP 539
Query: 632 YVLAG 636
VLA
Sbjct: 540 IVLAA 544
>gi|388259955|ref|ZP_10137121.1| hypothetical protein O59_001162 [Cellvibrio sp. BR]
gi|387936316|gb|EIK42881.1| hypothetical protein O59_001162 [Cellvibrio sp. BR]
Length = 803
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 177/565 (31%), Positives = 274/565 (48%), Gaps = 90/565 (15%)
Query: 122 DVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHF 181
DV+L DS +AQ TN +YL+ LD +KL+ FR+ A LP E YG WE S L GH
Sbjct: 31 DVQL-LDSPFLQAQNTNKDYLMALDTEKLLAPFRREAGLPFK-ETYGNWE--STGLDGHM 86
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP------------- 228
GHY++A AL++A+T ++ + ++++ V++ L CQ ++GSGY+ P
Sbjct: 87 GGHYVTALALLYAATKDDVVLQRLNYVIAELKKCQDKLGSGYIGGIPDSNTMWSEIARGD 146
Query: 229 --TEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRV 282
+ F E W P+Y +HKI AGL D Y YA N +A +R++ W +E
Sbjct: 147 IRADNFSTNER----WVPWYNLHKIYAGLRDAYLYAGNEDAKKMLVRLSDWTIE------ 196
Query: 283 QNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQAD 342
+ KK S E+ L E GGMN+V + IT D K+L LA F L L Q D
Sbjct: 197 --LTKKLSPEQMQTMLRTEHGGMNEVFVDVAEITGDKKYLKLAEAFSHQAILQPLEKQQD 254
Query: 343 DISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ----------LESSGTNIG------H 386
++G H+NT IP +IG ++ D H E ++ IG H
Sbjct: 255 QLTGLHANTQIPKIIG----FKKVADATHNESWNKAAEFFWQTVVDKRTVAIGGNSVKEH 310
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE--------------IAYADY 432
F+ D + +++ E+C TYNMLK+++ LF +++ + Y DY
Sbjct: 311 FHDSHDFTAMIEDVEG--PETCNTYNMLKLTQLLFLSSRDNSAADMKKSKNNPAMKYVDY 368
Query: 433 YERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSK 492
YER+L N +L Q + G ++Y + P ++ S H D WCC G+GIES SK
Sbjct: 369 YERALYNHILSSQH-PQTGGLVYFTSMRPNHYRKYSQVH-----DGMWCCVGSGIESHSK 422
Query: 493 LGDSIYFEE-EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKG 551
+ IY + + K P V++ +I SR+ W I Q + T
Sbjct: 423 YAEFIYARDLDKKIPEVFLNLFIPSRMTWAEQGISFTQNTQ-------FPDAETTELVME 475
Query: 552 SGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLR 610
+ L LR P W + + +NG+ + + PG+++++ + W DK+ + LP+ R
Sbjct: 476 TSKRFRLQLRYPRWVEAGQLQLRVNGKTVSVKQQPGDYIALERRWKKGDKVQLALPMKPR 535
Query: 611 TEAIQDDRPEYASIQAILYGPYVLA 635
E + P+ ++ A+L+GP VLA
Sbjct: 536 LEKL----PDGSNYYAVLHGPIVLA 556
>gi|319640591|ref|ZP_07995310.1| hypothetical protein HMPREF9011_00907 [Bacteroides sp. 3_1_40A]
gi|345517952|ref|ZP_08797412.1| acetyl-CoA carboxylase [Bacteroides sp. 4_3_47FAA]
gi|254835150|gb|EET15459.1| acetyl-CoA carboxylase [Bacteroides sp. 4_3_47FAA]
gi|317387761|gb|EFV68621.1| hypothetical protein HMPREF9011_00907 [Bacteroides sp. 3_1_40A]
Length = 783
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 172/545 (31%), Positives = 266/545 (48%), Gaps = 61/545 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
+ DVRL + A+ ++ YLL +D D+L+ + K A L E Y WE + L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWE--NTGLDG 89
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLE- 236
H GHYLSA + M+A+T N+ +K ++ ++S L CQ G GYL P + + +E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
L W P Y IHKI AGL D D+ EA +++T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMIR-------- 201
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
++ K S E+ L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ G++ E + + IG HF+ D
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L+ + ++ + DYYER+L N +L Q + G
Sbjct: 322 ---FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG 378
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ ++ G+ IY ++ +Y+
Sbjct: 379 -FVYFTPMRAG-----HYRVYSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVN 429
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W QI + ++ TL S + +L RIP WT
Sbjct: 430 LFIPSTLRWGDTQI------EQQTAFPDEEGSTLVISPEKGKKEFTLLFRIPEWTKPEAL 483
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ ++NG+ + ++S+ +TWS DK+ ++LP+ LR A+ D Y +ILYGP
Sbjct: 484 RLSVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGP 539
Query: 632 YVLAG 636
VLA
Sbjct: 540 IVLAA 544
>gi|427384528|ref|ZP_18881033.1| hypothetical protein HMPREF9447_02066 [Bacteroides oleiciplenus YIT
12058]
gi|425727789|gb|EKU90648.1| hypothetical protein HMPREF9447_02066 [Bacteroides oleiciplenus YIT
12058]
Length = 1145
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 176/531 (33%), Positives = 268/531 (50%), Gaps = 61/531 (11%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
AQQ + ++LL LD D+L+ F K A LP GE YGGWEE RG Y+SA A+MW
Sbjct: 421 AQQLDAKWLLSLDPDRLLHRFHKNAGLPPKGENYGGWEEHRGGGRGLGH--YMSACAMMW 478
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSA-------------FPTEQFDRLEALIP 240
AST K++ V++ L CQK G+GY+ + + FD ++P
Sbjct: 479 ASTGEPEFKQRTDYVINELERCQKARGTGYIGSVEDSIWTQVGRGDIRSTGFDLNGGIVP 538
Query: 241 VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT-LN 299
++ +HK+ AGL D Y Y N +A + + ++ Y + N+ + WQ L
Sbjct: 539 ----WFILHKLFAGLYDIYIYTGNEKAKTVLVNLCDWAYRQFGNLN-----DEQWQKMLA 589
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGS 359
E GGM +VL ++ I D K+L ++H FD F L+ Q D ++G H+NT IP V+G
Sbjct: 590 CEHGGMLEVLANVYSIVGDKKYLDMSHWFDHKQFFSPLSHQVDSLAGLHANTQIPKVVGL 649
Query: 360 QMRYEVTGDQLHK-----------EGHQLESSGTNIG-HFNFKSDPKRLASN-LDSNTEE 406
+ R+++T + K + H G G HF PK + SN L T E
Sbjct: 650 ERRHQLTHSEEDKVKSHFFWETVVKNHTYCIGGNGDGEHFG----PKGILSNRLSDRTAE 705
Query: 407 SCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKE 466
+C TYNMLK+++ L T + Y DYYE++L N +L Q E G+ Y +PL G K
Sbjct: 706 TCNTYNMLKLTKMLLAETGDTKYGDYYEKALYNHILASQ-NPETGMTTYYVPLVAGGKKG 764
Query: 467 RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIV 526
S + ++F CC GTG E+ ++ G++IYF +G+ + + YI S L W+ I
Sbjct: 765 YS-----SAFETFTCCVGTGFENHARYGEAIYF--KGRKNNLLVNLYIPSALTWEETGIT 817
Query: 527 VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-P 585
+ Q+ +++ +V T +S SL R+P WT++ + +NG+ + P P
Sbjct: 818 IRQE----GAYEKNGKVKFTINSSKPK-KASLFFRMPYWTTAK-TEVKVNGRKIDNPVIP 871
Query: 586 GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
G +L +T W +D + I + + TE P+ + AI YGP VLAG
Sbjct: 872 GMYLEITGEWKKNDIIEIHFDMPVYTEPT----PDNPNRLAIKYGPLVLAG 918
>gi|373954098|ref|ZP_09614058.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373890698|gb|EHQ26595.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 787
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 174/540 (32%), Positives = 275/540 (50%), Gaps = 49/540 (9%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
+L DV+L +S +A + + YLL ++ D+L+ FR + L G+ Y GWE S L
Sbjct: 49 NLKDVKL-LNSPFKQAMEVDAAYLLSIEPDRLLSGFRAHSGLKPKGKMYEGWE--SSGLA 105
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA- 237
GH +GHYLSA ++ +A+T + ++++ +V L CQ +GY+ A P E E
Sbjct: 106 GHTLGHYLSAISMHYAATRDPEFLKRVNYIVKELGECQVARKTGYVGAIPKEDTVWAEVA 165
Query: 238 ----------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
L W+P+YT+HK++AGLLD + Y ++ +AL + M ++ +K
Sbjct: 166 KGDIRSRGFDLNGGWSPWYTVHKVMAGLLDAFLYCNSTQALHVCKGMADW----TGETLK 221
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
E+ + L E GGM + L L+ I + K+L L++ F L LA Q D + G
Sbjct: 222 NLDDEKLQKMLLCEYGGMAETLVNLYAINGNKKYLDLSYKFYDKRILDPLANQQDILPGK 281
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRL 396
HSNT IP +I S RYE+ GD+ K H + G + ++ + S+P +L
Sbjct: 282 HSNTQIPKIIASARRYELNGDKKDKAIAEFFWETIVNNHSYATGGNS--NYEYLSEPNKL 339
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
L NT E+C TYNMLK++RHLF DYYE++L N +L Q E G+M Y
Sbjct: 340 NDKLTENTTETCNTYNMLKLTRHLFALEPSAKLMDYYEKALYNHILASQ-NHETGMMCYF 398
Query: 457 LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
+PL G KE S +P D+F CC G+G+E+ K +SIYF G +Y+ +I S
Sbjct: 399 VPLRMGGKKEYS-----SPFDTFTCCVGSGMENHVKYNESIYF--RGADGSLYVNLFIPS 451
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
L+WK + + Q+ + P T + + ++ +R P W +
Sbjct: 452 VLNWKEKGLSITQESNL-----PQSDKTTLTVTTLKPVAMAIRVRKPKWADNTTVGVNGK 506
Query: 577 GQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
Q + + G +L + + W ++DK+ +P + TEA+ P+ A+ +A+ YGP +LAG
Sbjct: 507 KQQVTADAQG-YLVINRKWKNNDKIEFIMPENIHTEAM----PDNANRRAVFYGPVLLAG 561
>gi|192360871|ref|YP_001981311.1| hypothetical protein CJA_0803 [Cellvibrio japonicus Ueda107]
gi|190687036|gb|ACE84714.1| conserved hypothetical protein [Cellvibrio japonicus Ueda107]
Length = 802
Score = 253 bits (647), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 188/561 (33%), Positives = 275/561 (49%), Gaps = 69/561 (12%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
L+ L VRL +S AQ TN +YL+ LDV+KL+ FR+ A LP E YG WE S
Sbjct: 31 LELFPLEQVRL-LESPFLAAQNTNKQYLMALDVEKLLAPFRREAGLPYK-ETYGNWE--S 86
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT----- 229
L GH GHY+SA AL +AST + ++ ++ V++ L CQ + G+GYL+ P
Sbjct: 87 TGLDGHIGGHYISALALTYASTGDPAVLARLEYVITELKKCQDKNGNGYLAGLPEGAGIW 146
Query: 230 EQFDRLE------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
++ R + + W P+Y +HK AGL D Y Y N A M E+ +
Sbjct: 147 QEIARGDIRADNFSTNERWVPWYNLHKTFAGLRDAYRYTGNETAKAMLVAFSEWTWA--- 203
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ K S E+ L+ E GGMNDV + IT D ++L LA F L L + D
Sbjct: 204 -LTKDLSDEQMQTLLHTEHGGMNDVFVDVADITGDKRYLHLAERFSHRAILQPLLEKRDA 262
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ----------LESSGTNIG------HF 387
++G H+NT IP VIG ++ GD Q + IG HF
Sbjct: 263 LTGLHANTQIPKVIG----FKRVGDAEQLAEWQSAAEFFWETVVNKRSVAIGGNSVREHF 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
+ + + + +++ E+C TYNMLK++ LF Y DYYER+L N +LG Q
Sbjct: 319 HPQDNFHSMIEDVEG--PETCNTYNMLKLTEQLFLDNPLGKYGDYYERALYNHILGSQH- 375
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK--- 504
+ G +Y P+ P + S H D WCC G+G+ES SK + IY K
Sbjct: 376 PQTGGFVYFTPMRPNHYRVYSQVH-----DGMWCCVGSGLESHSKYAEFIYARGMKKSAG 430
Query: 505 -----YPGVYIIQYISSRLDWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSL 558
P VY+ +I S+L+WK I + Q+ P V P + L S + +L
Sbjct: 431 WFARNIPQVYVNLFIPSQLNWKETGIRLRQENQFPDV---PETSIVLESSGR-----FTL 482
Query: 559 NLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDD 617
+LR P W ++ + +NG+ + S PGN+L++ + W DKL I+LP+ E++
Sbjct: 483 HLRYPQWVEADTLQLRINGKVEKISSQPGNYLAIERRWKKGDKLDIRLPMKPHLESL--- 539
Query: 618 RPEYASIQAILYGPYVLAGHS 638
P+ +S A+LYGP VLA +
Sbjct: 540 -PDGSSYYAVLYGPIVLAAKT 559
>gi|305676227|ref|YP_003867899.1| hypothetical protein BSUW23_17775, partial [Bacillus subtilis
subsp. spizizenii str. W23]
gi|305414471|gb|ADM39590.1| hypothetical protein BSUW23_17775 [Bacillus subtilis subsp.
spizizenii str. W23]
Length = 497
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 163/510 (31%), Positives = 262/510 (51%), Gaps = 49/510 (9%)
Query: 130 MHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSAS 189
M + +Q EYLL LDVD+L+ + YGGWE + E+ GH +GH+LSA+
Sbjct: 10 MFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAGHSIGHWLSAA 67
Query: 190 ALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD-------RLE--ALIP 240
+ M+ ++ +E LK K V+ LS Q+ GY+S F FD R++ +L
Sbjct: 68 SAMYQASGDEKLKRKAEYAVNELSHIQQFDEEGYISGFSRACFDEVFSGDFRVDHFSLGG 127
Query: 241 VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNE 300
W P+Y++HK+ AGL+D Y N ALR+ + ++ + + + + E+ + L
Sbjct: 128 SWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRMLIC 183
Query: 301 EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
E GGMN+ + L+ +T++ +L LA F L LA D++ G H+NT IP VIG+
Sbjct: 184 EHGGMNEAMADLYILTKNKSYLDLAERFCHRAILQPLAEGKDELEGKHANTQIPKVIGAA 243
Query: 361 MRYEVTGDQLHKE-----------GHQLESSGTNIG-HFNFKSDPKRLASNLDSNTEESC 408
Y++TG++ ++ G +IG HF + + L T E+C
Sbjct: 244 KLYDITGNEAYRNPALFFWEQVVYQRSYAIGGNSIGEHFGAEG-----SEELGVTTAETC 298
Query: 409 TTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERS 468
TYNMLK++ HLFRW E + DYYE +L N +L Q E G+ Y + PG K
Sbjct: 299 NTYNMLKLTGHLFRWFHEARFTDYYENALYNHILSSQ-DPESGMKTYFVSTQPGHFKV-- 355
Query: 469 YHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVN 528
+ +P DSFWCC GTG+E+ ++ +IY ++ +Y+ +I S+++ + Q+++
Sbjct: 356 ---YCSPEDSFWCCTGTGMENPARYTQNIYHLDQDD---LYVNLFIPSQINVREKQMIIT 409
Query: 529 QKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA-KATLNGQDLPLPSPGN 587
Q+ P T K G+ +L +RIP WT NG+ KA +NG+ +
Sbjct: 410 QETSF-----PAANKTKLVVKKADGVPMTLQIRIPYWT--NGSLKAVVNGKRVQSVEKNG 462
Query: 588 FLSVTKTWSSDDKLTIQLPLTLRTEAIQDD 617
+L++ K W++ D + I LP+ L +DD
Sbjct: 463 YLAIHKHWNTGDCIEIDLPMKLHIYQAKDD 492
>gi|290954983|ref|YP_003486165.1| hypothetical protein SCAB_3871 [Streptomyces scabiei 87.22]
gi|260644509|emb|CBG67594.1| putative secreted protein [Streptomyces scabiei 87.22]
Length = 768
Score = 253 bits (645), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 179/532 (33%), Positives = 269/532 (50%), Gaps = 54/532 (10%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY-GGWEEPSCELRGHFVGHYLSASALMW 193
Q YL +DVD+L++NFR RL G GGW+ P+ R H GH+L+A A ++
Sbjct: 66 QDRTRNYLRFVDVDRLLYNFRANHRLSTAGAAATGGWDAPTFPFRTHVQGHFLTAWAQLY 125
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIG-----SGYLSAFPTEQFDRLE--ALIPVWAPYY 246
A T + + ++K + +V+ L+ CQ G +GYLS +P F LE L PYY
Sbjct: 126 AVTGDTTCRDKATRMVAELAKCQANNGAAGFNTGYLSGYPESDFTALEQRTLSNGNVPYY 185
Query: 247 TIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
TIHK LAGLLD + + + +A L + W V++ R+ ++ L E
Sbjct: 186 TIHKTLAGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRLTG-------QQMQAMLQTEF 237
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMR 362
GGMN VL L+ T D + L A FD LA D +SG H+NT +P IG+
Sbjct: 238 GGMNAVLTDLYQQTGDARWLTAARRFDHAAVFDPLASNQDRLSGLHANTQVPKWIGAARE 297
Query: 363 YEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYN 412
Y+ TG +++ + G N +F++ P +A L+ +T ESC T+N
Sbjct: 298 YKATGTTRYRDIATNAWSITVAAHTYAIGGNSQAEHFRA-PNAIAGFLNQDTCESCNTFN 356
Query: 413 MLKVSRHLFRWT-KEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSK----E 466
ML ++R LF A DYYER+ N ++G Q + G + Y PL PG +
Sbjct: 357 MLVLTRELFALDPNRAALFDYYERAWLNQMIGQQNPADDHGHVTYFTPLRPGGRRGVGPA 416
Query: 467 RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIV 526
W T +FWCC GTG+E ++L DS+Y+ + + + ++ S L W I
Sbjct: 417 WGGGTWSTDYGTFWCCQGTGLEMHTRLMDSVYYRSDTT---LIVNMFVPSVLTWSERGIT 473
Query: 527 VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG--QDLPLPS 584
V Q D LRVT + G T ++ LRIP WTS GA ++NG QD+ +
Sbjct: 474 VTQTTDYPAGDTTTLRVTGSV-----GGTWAMRLRIPGWTS--GATISVNGTAQDIAT-T 525
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
PG++ ++T++W+S D +T++LP+ + + + A+I AI YGP VL+G
Sbjct: 526 PGSYATLTRSWTSGDTVTVRLPMRI----VMRAANDNANIAAITYGPVVLSG 573
>gi|336321977|ref|YP_004601945.1| hypothetical protein Celgi_2884 [[Cellvibrio] gilvus ATCC 13127]
gi|336105558|gb|AEI13377.1| protein of unknown function DUF1680 [[Cellvibrio] gilvus ATCC
13127]
Length = 781
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 189/581 (32%), Positives = 286/581 (49%), Gaps = 80/581 (13%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY-GGWEE- 172
++ +L +V LG +S+ RAQQ ++ VD+++ FR+ A L G GGWEE
Sbjct: 86 VRPFNLTEVSLG-ESVFTRAQQQMVDLARAYPVDRVLVVFRRNANLDVRGASAPGGWEEL 144
Query: 173 -PSCE---------------------LRGHFVGHYLSASALMWASTHNESLKEKMSAVVS 210
P+ + LRGH+ GH+LS A+ +A+T ++++ +K+ V
Sbjct: 145 GPAPDEQRWGPAEYVRGQNTRGAGGLLRGHYGGHFLSMLAMAYATTGDQAILDKVDDFVD 204
Query: 211 ALSACQKEIGS-------GYLSAFPTEQFDRLEALIP---VWAPYYTIHKILAGLLDQYT 260
L C+ + + G+L+A+ QF LEA P +WAP+YT HKILAGL+D Y
Sbjct: 205 GLEECRAALAATGKYSHPGFLAAYGEWQFSALEAYAPYGEIWAPWYTCHKILAGLIDAYR 264
Query: 261 YADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDP 319
Y +A AL++ + + + R+ + +ER W + EAGGMND L L+ ++
Sbjct: 265 YTGSALALQLAEGLGRWTHARLSACTPE-QLERMWGIYIGGEAGGMNDALVDLYTLSAAA 323
Query: 320 KH---LMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE--- 373
L A LFD + A D ++G H+N HIP +G TGD +
Sbjct: 324 DRDDFLAAAALFDLRSLVTACAQDRDTLNGKHANMHIPTFVGYAKLGAWTGDATYTAATR 383
Query: 374 --------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK 425
G GT G ++ +A ++ ESC YNMLKV+R LF +
Sbjct: 384 NFFGMIVPGRMYAHGGTGEGEMWGPAN--TVAGDIGPRNAESCAAYNMLKVARTLFFEQQ 441
Query: 426 EIAYADYYERSLTNGVLGIQR---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCC 482
+ AY DYYER++ N +LG +R T +Y+ P+ PG+ KE + GT CC
Sbjct: 442 DPAYMDYYERTVLNHILGGKRDQASTTSPQNLYMFPVGPGARKEYGNGNIGT------CC 495
Query: 483 YGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLR 542
GTG+ES K DSI+F +++ Y+ S L W S + + Q+ D LR
Sbjct: 496 GGTGLESPVKYQDSIWFRSADD-SALWVNLYVPSELRWTSRGLRIVQEGDYPNDETVTLR 554
Query: 543 VTLTFSSKGSGLTTSLNLRIPTWTSS-----NGAKATLNGQDLPLPSPGNFLSVTKTWSS 597
+ ++G+G L LR+P W +S NG AT+ +PG +LSV +TW++
Sbjct: 555 I-----AEGAG-ELDLRLRVPAWATSFVVAVNG--ATVASTAAGTATPGTYLSVDRTWAA 606
Query: 598 DDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
D++TI L L LR E DRP+ IQ++ GP VL+ S
Sbjct: 607 GDQVTITLALPLRAEPTI-DRPD---IQSLQRGPVVLSALS 643
>gi|392964292|ref|ZP_10329713.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
gi|387847187|emb|CCH51757.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
Length = 739
Score = 252 bits (644), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 169/553 (30%), Positives = 273/553 (49%), Gaps = 63/553 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ +L +VRL S +AQ +L+Y+L L+ DKL+ + A LP + YG WE S
Sbjct: 1 MQPFTLQEVRLTSGPFK-QAQDVDLKYILALNPDKLLAPYLIDAGLPLKAQRYGNWE--S 57
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--F 232
L GH GHYLSA A+M+AST LK+++ ++ L+ CQ + G+GY+ P + +
Sbjct: 58 VGLDGHIGGHYLSALAMMYASTGEPELKKRLDYMIGELARCQAKNGNGYVGGIPQGKVFW 117
Query: 233 DRLE---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
DR+ L W P Y IHK+ AGL D Y YA N +A ++ + ++F
Sbjct: 118 DRIHKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDAYAYAGNGQAKQVLIGLGDWFVE--- 174
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+IK S E+ Q L E GG+N+ L+ +T D K+L A L L Q D
Sbjct: 175 -LIKPLSDEQIQQVLRTEHGGINETFADLYILTNDKKYLETAQRLSHRALLYPLLEQQDK 233
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQLESSGTNIG-HFNFKS 391
++G H+NT IP VIG + +TG E + G ++ HFN +
Sbjct: 234 LTGLHANTQIPKVIGFEKIATLTGKTDWSEAAMYFWRNVSQTRSVAFGGNSVREHFNPTT 293
Query: 392 DPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
D + L SN E+C ++NML++S+ LF +++Y D+YER+L N +L Q E
Sbjct: 294 D---FSQVLRSNQGPETCNSFNMLRLSKALFLDKNDVSYLDFYERTLYNHILSSQH-PEK 349
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G +Y P+ P Y + S WCC G+G+E+ +K G+ IY +++
Sbjct: 350 GGFVYFTPIRPN-----HYRVYSQSETSMWCCVGSGLENHTKYGELIYSHSTND---LFV 401
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS-- 568
+I S L+WK + +NQ+ + PY T + S+ +R P W +
Sbjct: 402 NLFIPSTLNWKEKGVRLNQRTN-----FPYENGTELVVQQAKPQVFSVQIRYPKWAENLE 456
Query: 569 ---NGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
NG + +NG+ P ++++++ W + D +T++ + R E + P+ ++
Sbjct: 457 VLVNGKQQAVNGK------PSEYVAISRKWKAGDIITVRFKTSTRLEQL----PDGSNWA 506
Query: 626 AILYGPYVLAGHS 638
A ++GP VLA +
Sbjct: 507 AFVHGPIVLAAKT 519
>gi|383641951|ref|ZP_09954357.1| hypothetical protein SchaN1_14318 [Streptomyces chartreusis NRRL
12338]
Length = 768
Score = 252 bits (643), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 182/569 (31%), Positives = 280/569 (49%), Gaps = 58/569 (10%)
Query: 102 PGQFKVPERSGEFLKEVSLHDVRLGSDSM---HWRAQQTNLE-YLLMLDVDKLVWNFRKT 157
P +P + VS H LG + W Q YL +DVD+L++NFR
Sbjct: 31 PAHAAIPPARADI--GVSAHPFELGQVRLTASRWLDNQDRTRNYLRFVDVDRLLYNFRAN 88
Query: 158 ARLPAPGEPY-GGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQ 216
RL G GGW+ P R H GH+L+A A ++A T + + ++K + +V+ L+ CQ
Sbjct: 89 HRLSTNGAAANGGWDAPDFPFRTHVQGHFLTAWAQLYAVTGDTTCRDKATTMVAELAKCQ 148
Query: 217 KE-----IGSGYLSAFPTEQFDRLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEA-- 267
+GYLS +P F LE L PYYTIHK L GLLD + + + +A
Sbjct: 149 ANNSTAGFNAGYLSGYPESDFTALEQRTLSNGNVPYYTIHKTLVGLLDVWRHIGSTQARD 208
Query: 268 --LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA 325
L + W V++ R+ S ++ L E GGMN VL L+ T D + L +A
Sbjct: 209 VLLALAGW-VDWRTGRL-------SGQQMQAMLQTEFGGMNTVLTDLYQQTGDARWLTVA 260
Query: 326 HLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GH 375
FD LA D +SG H+NT +P IG+ Y+ TG +++
Sbjct: 261 RRFDHAAVFDPLAAGQDQLSGLHANTQVPKWIGAAREYKATGTTRYRDIATNAWNICVNS 320
Query: 376 QLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWT-KEIAYADYYE 434
+ G N +F++ P +A L+ +T ESC T+NML ++R LF +A DYYE
Sbjct: 321 HTYAIGGNSQAEHFRA-PNAIAGFLNKDTCESCNTFNMLTLTRELFALDPNRVALFDYYE 379
Query: 435 RSLTNGVLGIQR-GTEPGVMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIES 489
R+ N ++G Q + G + Y PL PG + W T +FWCC GTG+E
Sbjct: 380 RAWLNQMIGQQNPADDHGHVTYFTPLNPGGRRGVGPAWGGGTWSTDYGTFWCCQGTGLEM 439
Query: 490 FSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSS 549
++L DSIYF + + + ++ S L+W I V Q S+ TL +
Sbjct: 440 HTRLMDSIYFRSDNT---LIVNMFVPSVLNWSERGITVTQ----TTSYPNSDTTTLHVTG 492
Query: 550 KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLT 608
SG T ++ +RIP+WT+ GA ++NG + +PG++ +++++W+S D +T++LP+
Sbjct: 493 NASG-TWAMRIRIPSWTT--GATVSVNGVAQTITTTPGSYATLSRSWASGDTVTVRLPM- 548
Query: 609 LRTEAIQDDRPEYASIQAILYGPYVLAGH 637
I + A++ AI YGP VL+G+
Sbjct: 549 ---RVIMRAANDNANVAAITYGPVVLSGN 574
>gi|113970330|ref|YP_734123.1| hypothetical protein Shewmr4_1993 [Shewanella sp. MR-4]
gi|113885014|gb|ABI39066.1| protein of unknown function DUF1680 [Shewanella sp. MR-4]
Length = 795
Score = 252 bits (643), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 173/551 (31%), Positives = 283/551 (51%), Gaps = 60/551 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
L + L+DVRL + AQQT+L Y++ +D ++L+ +RK A + + Y WE +
Sbjct: 28 LTPIPLNDVRLTAGPF-LHAQQTDLAYIMSMDPERLLAPYRKEAGIATTADNYPNWE--N 84
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
L GH GHYLSA ALM+A+T ++++ E+++ +V+ L CQ+ G+GY+ P D+
Sbjct: 85 TGLDGHIGGHYLSALALMYAATGDQAVLERLNYMVAELEKCQQAHGNGYVGGVP--HGDK 142
Query: 235 L---------EA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNR 281
L EA L W P+Y +HK+ AGL D Y Y N A +M ++ +
Sbjct: 143 LWQQVAAGHIEADLFTLNQSWVPWYNLHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDL 202
Query: 282 VQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQA 341
+N+ E+ L E GG+N+ L ++ IT K+L LA+ + L L
Sbjct: 203 SRNLTD----EQLQLMLRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQHQ 258
Query: 342 DDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG---------HQLESS--GTNIG-HFNF 389
+ ++G H+NT IP ++G E++ ++ E HQ S G ++ HF+
Sbjct: 259 EKLTGLHANTQIPKIVGVARIAELSHNKAWLESADYFWQQVVHQRTVSIGGNSVREHFHP 318
Query: 390 KSDPKRLASNLDS-NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGT 448
D +S LDS E+C TYNMLK+S+ L+ +++ Y DYYER+L N +L Q
Sbjct: 319 SED---FSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-P 374
Query: 449 EPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGV 508
+ G ++Y P+ P Y + + +S WCC G+GIE+ +K G+ IY EE+ +
Sbjct: 375 QTGGLVYFTPMRPD-----HYRVYSSAQESMWCCVGSGIENHAKYGELIYAEEDNN---L 426
Query: 509 YIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS 568
++ ++ S ++WK+ I ++QK P + + + T LNLR PTW
Sbjct: 427 FVNLFVDSEVNWKAKGISLSQKTQ-----FPDDNTSQMIIHQEADFT--LNLRYPTWAKG 479
Query: 569 NGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAI 627
+ ++NG+ P+ G ++ +T+ W D +TI LP+ + E + D Y ++
Sbjct: 480 D-VTVSINGEPQRFTPTQGQYIPLTRHWRKGDSVTITLPMDISLEQLPDKTAYY----SV 534
Query: 628 LYGPYVLAGHS 638
LYGP VLA +
Sbjct: 535 LYGPIVLAAKT 545
>gi|294775898|ref|ZP_06741397.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|294450267|gb|EFG18768.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 783
Score = 252 bits (643), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 173/546 (31%), Positives = 265/546 (48%), Gaps = 61/546 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
+ DVRL + A+ ++ YLL +D D+L+ + K A L E Y WE + L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWE--NTGLDG 89
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLE- 236
H GHYLSA + M+A+T N+ +K ++ ++S L CQ G GYL P + + +E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIED 149
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
L W P Y IHKI AGL D N EA +++T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTGNKEAKEMLVKLTDWMIR-------- 201
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
++ K S E+ L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ G++ E + + IG HF+ D
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L+ + + + DYYER+L N +L Q + G
Sbjct: 322 ---FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHFMDYYERALYNHILSTQDPVQGG 378
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ ++ G+ IY ++ +Y+
Sbjct: 379 -FVYFTPMRAG-----HYRVYSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVN 429
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W G I + Q+ ++ TL S + +L RIP WT
Sbjct: 430 LFIPSTLRW--GDIQIEQQ----TAFPDEEETTLVISPEKGKKEFTLLFRIPEWTKPEAL 483
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
++NG+ + ++S+ +TWS DK+ ++LP+ LR A+ D Y +ILYGP
Sbjct: 484 CLSVNGKRQNVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGP 539
Query: 632 YVLAGH 637
VLA
Sbjct: 540 IVLAAR 545
>gi|169596765|ref|XP_001791806.1| hypothetical protein SNOG_01152 [Phaeosphaeria nodorum SN15]
gi|111069681|gb|EAT90801.1| hypothetical protein SNOG_01152 [Phaeosphaeria nodorum SN15]
Length = 620
Score = 251 bits (642), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 183/553 (33%), Positives = 279/553 (50%), Gaps = 62/553 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQT-NLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEE 172
L +VSL + R W+ + L YL ++VD+L++NFR T +L G +P GGW+
Sbjct: 39 LSQVSLSNSR-------WKDNENRTLNYLKAVNVDRLLYNFRATHKLSTNGAQPNGGWDA 91
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIG-----SGYLSAF 227
P+ R H GHYL+A +A+ + K + S V L+ CQ G +GYLS F
Sbjct: 92 PNFPFRSHAQGHYLTAWVHCYATLRDNECKNRASYFVQELAKCQANNGAAQFSTGYLSGF 151
Query: 228 PTEQFDRLEA--LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNV 285
P +F LEA L PYY +HK +AGLLD + + +A + + + R
Sbjct: 152 PESEFVALEAGQLKGGNVPYYAVHKTMAGLLDAWRIIGDTKARDVLLALAGWVDGRT--- 208
Query: 286 IKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDIS 345
KK S + L E GGMNDVL ++ +T + + L +A FD LA D +S
Sbjct: 209 -KKLSSSQMQTMLGTEFGGMNDVLAAIYQLTGNQQWLTVAQRFDHASQFDPLANNQDRLS 267
Query: 346 GFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIG-HFNFKSDP 393
G H+NT +P IG+ Y+ TG + + + H G + HF P
Sbjct: 268 GNHANTQVPKWIGAAREYKSTGTKRYLDIAKNAWDFTINAHTYAIGGNSQAEHFR---PP 324
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE---IAYADYYERSLTNGVLGIQRGTEP 450
++++ L ++T E C TYNMLK++R L WT + Y DYYER+L N +LG Q T+
Sbjct: 325 NQISNFLTNDTAEQCNTYNMLKLTRDL--WTTDPSSTKYFDYYERALINHLLGAQNPTDN 382
Query: 451 -GVMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
G + Y PL G + W T +SFWCC GT +E+ +KL DSIYF +
Sbjct: 383 HGHITYFTPLKSGGRRGIGPAWGGGTWSTDYNSFWCCQGTALETNTKLMDSIYFYDSS-- 440
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
+Y+ + S LDWK + ++Q S T + ++ +RIP+W
Sbjct: 441 -ALYVNLFTPSTLDWKQRSVKISQVTTFPAS-------DTTTLTVTGTGNWAMKIRIPSW 492
Query: 566 TSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
TS GA ++N Q + + PG++ ++++ W S D +T++LP+ LRT A + A+I
Sbjct: 493 TS--GATISINRQASGVAANPGSYATLSRDWKSGDIVTVKLPMKLRTVAAN----DNANI 546
Query: 625 QAILYGPYVLAGH 637
A+ +GP +L+G+
Sbjct: 547 AAVAFGPVILSGN 559
>gi|251795999|ref|YP_003010730.1| hypothetical protein Pjdr2_1987 [Paenibacillus sp. JDR-2]
gi|247543625|gb|ACT00644.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 626
Score = 251 bits (642), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 168/523 (32%), Positives = 253/523 (48%), Gaps = 72/523 (13%)
Query: 160 LPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
+ P + GWE +CELRGH +GH+LSA+A ++A T + +K K +V L CQ+
Sbjct: 62 MNGPEHWHWGWESVTCELRGHIMGHWLSAAAQIYAQTSDALVKAKADYIVEELVRCQEAN 121
Query: 220 GSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFY 279
G +L+AFP R+ VWAP+YTIHK+L GL D Y A N +ALR+ + ++FY
Sbjct: 122 GGEWLAAFPESYMHRIAKGSFVWAPHYTIHKLLMGLYDMYAIAGNEQALRVMRGIADWFY 181
Query: 280 NRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
N +S E + L+ E GGM +V L+ IT++ KHL L +D+ F L
Sbjct: 182 KWTGN----FSQEEMDELLDLETGGMLEVWADLYGITKEDKHLNLVKRYDRRRFFDALLE 237
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK-------------EGHQLESSGTNIGH 386
D ++ H+NT IP ++G+ +EVTG+ ++ G+ +G N
Sbjct: 238 GQDVLTNKHANTQIPEILGAARAWEVTGEDRYRRIVEAFWRLAVTDRGYVATGAGDNGEL 297
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
+ + + + S L +E C YNM++++ L RWT + AYADY+ER NGVL Q
Sbjct: 298 WMPRGE---MGSRLGVG-QEHCCNYNMMRLAHVLLRWTGDPAYADYWERRFYNGVLAHQH 353
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
G + G++ Y L + GS K WGTP+ FWCC+GT +++ + I+ E+E
Sbjct: 354 G-DTGMISYFLGMGAGSKKS-----WGTPTQHFWCCHGTLMQANAAYESQIFMEDEN--- 404
Query: 507 GVYIIQYISSRL-------------------------DWKSGQIVVNQKVD--PVVSWDP 539
G+ I Q+I S L +W + KVD P+ P
Sbjct: 405 GIAICQWIPSELQLSRADGNLRIRIEQDGQYGVYPLNNWSVKGMTAITKVDMPPIPEHRP 464
Query: 540 YLRVTLTFSSKGSGLTTSLNLRIPTWTSS------NGAKATLNGQDLPLPSPGNFLSVTK 593
V T L LR+P W S NG++ N P ++ ++ +
Sbjct: 465 DRFVYTVTIGLEHASTFELKLRLPWWLSGPPVIRVNGSQVEQNEA-----KPSSYTAIAR 519
Query: 594 TWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
WS+ D +T++LP TL E + D YA GP V+AG
Sbjct: 520 EWSNGDVVTVELPKTLTMEPLPGDTGTYAFFD----GPIVMAG 558
>gi|399074049|ref|ZP_10750795.1| hypothetical protein PMI01_01867 [Caulobacter sp. AP07]
gi|398040822|gb|EJL33912.1| hypothetical protein PMI01_01867 [Caulobacter sp. AP07]
Length = 775
Score = 251 bits (642), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 181/579 (31%), Positives = 281/579 (48%), Gaps = 65/579 (11%)
Query: 89 LFSWAMLYRKIKNPGQFKVPERSGEFLKE-VSLHDVRLGSDSMHWRAQQTNLEYLLMLDV 147
L S AM + +PG P +G + E V V L S+ +AQ N YL+ L
Sbjct: 13 LASSAMAFVGAASPG-LAAP--AGRVVAEPVPARHVAL-KPSIFQQAQAANRAYLVSLSA 68
Query: 148 DKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSA 207
D+L+ NF + A L YGGWE S + GH +GHYL+A AL A T + L ++++
Sbjct: 69 DRLLHNFHQGAGLSVKAPVYGGWEAQS--IAGHTLGHYLTACALQVAGTGDPVLSDRLTY 126
Query: 208 VVSALSACQKEIGSGYL----------SAFPTEQFDRLE---------ALIPVWAPYYTI 248
+V+ L+ Q G GY+ +A + F+ L +L W P YT
Sbjct: 127 IVAELARVQAAHGDGYVGGTTRWGQSDAAGGKQVFEELRRGDIRASRFSLNDGWVPIYTW 186
Query: 249 HKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDV 308
HK+ AGLLD + A AL + + YF +++ S + Q L E GG+N+
Sbjct: 187 HKVHAGLLDAHRLAGTPRALAVAVGLAGYF----ATIVEGLSDAQVQQILITEHGGINEA 242
Query: 309 LYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGD 368
+ + +T D + L +A L +A D+++G H+NT IP VIG YEV GD
Sbjct: 243 YAETYALTGDERWLKVARRLRHKAVLDPIAEGRDELAGLHANTQIPKVIGLARLYEVGGD 302
Query: 369 -----------QLHKEGHQLESSG-TNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKV 416
Q+ E H G ++ HF P +A ++ T E+C TYNMLK+
Sbjct: 303 PAEARAARFFHQVVTENHSYVIGGNSDREHFG---KPNEIARHMAETTCEACNTYNMLKL 359
Query: 417 SRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPS 476
+R L+ W A DYYER+ N ++ QR ++ G+ +Y +P+A G RSY TP
Sbjct: 360 TRRLWSWAPNGALFDYYERAQLNHIMAHQRPSD-GMFVYFMPMAAGG--RRSYS---TPE 413
Query: 477 DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVS 536
DSFWCC G+G+ES +K DSI++ +Y+ ++ SRLD G ++ +D
Sbjct: 414 DSFWCCVGSGMESHAKHADSIWWRGGDT---LYLNLFLPSRLDLPDGDFAID--LDTRYP 468
Query: 537 WDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWS 596
+ +R+++ + + LR+P W ++ K +NG + P + + + W
Sbjct: 469 AEGLVRLSVV---RAPSAEREIALRLPAWCAAPLVK--VNGAAIGRPGRDGYARLKRRWK 523
Query: 597 SDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+ D++ + LP+ LR E DD ++ A + GP VLA
Sbjct: 524 AGDRIELVLPMHLRAEPTPDD----PNLVAFVSGPLVLA 558
>gi|117920524|ref|YP_869716.1| acetyl-CoA carboxylase, biotin carboxylase [Shewanella sp. ANA-3]
gi|117612856|gb|ABK48310.1| acetyl-CoA carboxylase, biotin carboxylase [Shewanella sp. ANA-3]
Length = 795
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 171/550 (31%), Positives = 282/550 (51%), Gaps = 58/550 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
L + L+DVRL + AQQT+L Y++ +D ++L+ +RK A + + Y WE +
Sbjct: 28 LTPIPLNDVRLTAGPF-LHAQQTDLAYIMSMDPERLLAPYRKAAGIATTADNYPNWE--N 84
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
L GH GHYLSA ALM+A+T ++++ +++ +V+ L CQ+ G+GY+ P D+
Sbjct: 85 TGLDGHIGGHYLSALALMYAATGDQAVLSRLNYMVAELEKCQQAHGNGYVGGVP--HGDK 142
Query: 235 L---------EA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNR 281
L EA L W P+Y +HK+ AGL D Y Y N A +M ++ +
Sbjct: 143 LWQQVAAGHIEADLFTLNQSWVPWYNVHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDL 202
Query: 282 VQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQA 341
+N+ S E+ L E GG+N+ L ++ IT K+L LA+ + L L
Sbjct: 203 SRNL----SDEQLQLMLRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQHQ 258
Query: 342 DDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG---------HQLESS--GTNIGHFNFK 390
D ++G H+NT IP ++G E++ ++ E HQ S G ++ +
Sbjct: 259 DKLTGLHANTQIPKIVGVARIAELSNNKEWLESADYFWQQVVHQRTVSIGGNSVREYFHP 318
Query: 391 SDPKRLASNLDS-NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTE 449
S+ +S LDS E+C TYNMLK+S+ L+ +++ Y DYYER+L N +L Q +
Sbjct: 319 SE--DFSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-PQ 375
Query: 450 PGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVY 509
G ++Y P+ P Y + + +S WCC G+GIE+ +K G+ IY EE+ ++
Sbjct: 376 TGGLVYFTPMRPD-----HYRVYSSAQESMWCCVGSGIENHAKYGELIYAEEDNN---LF 427
Query: 510 IIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN 569
+ ++ S + WK+ I ++QK P + + + T LNLR PTW
Sbjct: 428 VNLFVDSEVHWKAKGISLSQKTQ-----FPDDNTSQMIIHQEADFT--LNLRYPTWAKGE 480
Query: 570 GAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAIL 628
++NG+ P+ G ++ +T+ W D +TI LP+ + E + P+ ++ ++L
Sbjct: 481 -VTVSINGEPQRFTPTQGQYIPLTRHWRKGDSVTITLPMDISLEQL----PDKSAYYSVL 535
Query: 629 YGPYVLAGHS 638
YGP VLA +
Sbjct: 536 YGPIVLAAKT 545
>gi|313204495|ref|YP_004043152.1| hypothetical protein Palpr_2030 [Paludibacter propionicigenes WB4]
gi|312443811|gb|ADQ80167.1| protein of unknown function DUF1680 [Paludibacter propionicigenes
WB4]
Length = 788
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 180/578 (31%), Positives = 286/578 (49%), Gaps = 54/578 (9%)
Query: 85 EQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLM 144
+++ +F+ A+ + NP F + ++ + DVRL ++S A+ ++ YLL
Sbjct: 3 KKNLIFNLAVALLCLVNP--FAANAQLAAKVESFPVSDVRL-TESPFKHAEDMDINYLLG 59
Query: 145 LDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEK 204
LD D+L+ + K L E Y WE + L GH GHYLSA + M+A+T N +KE+
Sbjct: 60 LDADRLMAPYLKGGGLTPKAENYPNWE--NTGLDGHIGGHYLSALSYMYAATGNTRIKER 117
Query: 205 MSAVVSALSACQKEIGSGYLSAFPTEQ--FDRLE---------ALIPVWAPYYTIHKILA 253
+ ++ L Q G GYL P + +D ++ L W P Y IHK A
Sbjct: 118 LDYSLNELKRAQDAAGDGYLGGTPNGRKIWDEIKKGTINASSFGLNGGWVPLYNIHKTYA 177
Query: 254 GLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLF 313
GL D Y + A M + ++ YN V + E L E GG+N+V +
Sbjct: 178 GLRDAYLQGGSLLAKDMLIKLTDWMYNTVSGLTDAQVQE----MLKSEHGGLNEVFADVA 233
Query: 314 CITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE 373
IT + K+L LAH F L LL D ++G H+NT IP VIG + ++ G++ +
Sbjct: 234 SITGNKKYLELAHKFSHQTLLQLLLQHQDKLTGMHANTQIPKVIGFKRIADLEGNKDWSD 293
Query: 374 GHQ------LESSGTNIG------HFNFKSDPKRLASNLDSNT-EESCTTYNMLKVSRHL 420
+++ +IG HF+ SD S +S E+C TYNML++++ L
Sbjct: 294 AASFFWKTVVDNRSVSIGGNSVREHFH-PSD--NFTSMFESEQGPETCNTYNMLRLTKLL 350
Query: 421 FRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFW 480
F+ + E ++ DYYER+L N +L Q + G +Y P+ G Y + P SFW
Sbjct: 351 FQTSGEASFMDYYERALYNHILSTQDPIQGG-FVYFTPMRAGH-----YRVYSQPQTSFW 404
Query: 481 CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPY 540
CC G+G+E+ ++ G+ IY ++ +Y+ +I S L WK+ I + Q+ + +
Sbjct: 405 CCVGSGLENHARYGEMIYGFKDND---LYVNLFIPSVLTWKAKNIRIEQQNN----FAKQ 457
Query: 541 LRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDK 600
+ +K + L T L++R P W N K ++NGQ P+ +LS+T+ WS DK
Sbjct: 458 EAADIIVDAKKTALFT-LHIRKPEWVKDNDLKVSVNGQSTPVTIKDGYLSITRNWSKGDK 516
Query: 601 LTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
+ ++LP+ LR D+ EY + LYGPYVLA +
Sbjct: 517 VHLELPMQLRAVTTPDNAQEY----SFLYGPYVLAAKT 550
>gi|374321589|ref|YP_005074718.1| hypothetical protein HPL003_08660 [Paenibacillus terrae HPL-003]
gi|357200598|gb|AET58495.1| hypothetical protein HPL003_08660 [Paenibacillus terrae HPL-003]
Length = 755
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 183/546 (33%), Positives = 281/546 (51%), Gaps = 55/546 (10%)
Query: 116 KEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC 175
K LH V + S + + A + N YLL L+ D+L+ FR+ A L Y GWE
Sbjct: 6 KAFDLHKVSIDSGPL-YHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWEARG- 63
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFD 233
+ GH +GHYLS ALM+AST +E L E+++ VV+ L CQ G+GY+S P E F+
Sbjct: 64 -ISGHTLGHYLSGCALMFASTGDERLLERVNYVVNELEICQNNHGNGYISGIPRGKELFE 122
Query: 234 RLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
++A L W P YT+HK+ AGL D + A + +AL+M + ++ +++
Sbjct: 123 EVKAGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLARHPKALQMEIKLGDW----LED 178
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
V K + ++ Q L+ E GGMN+VL L + + + L LA F L LA D +
Sbjct: 179 VFKGLNDDQVQQVLHCEFGGMNEVLTDLAEHSGEERFLRLAERFYHGEVLNDLADSRDTL 238
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQ-------------LHKEGHQLESSGTNIGHFNFKS 391
+G H+NT IP +IG+ +YE+TG +HK + + + N HF
Sbjct: 239 AGRHANTQIPKIIGAARQYEMTGKPQYADLSRFFWERVVHKHSYVIGGNSYN-EHF---G 294
Query: 392 DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+P +L L T E+C TYNMLK++RH+F W AYADYYER++ N +L Q+ + G
Sbjct: 295 EPGKLNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-G 353
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+ Y + L G K + + D F CC G+G+ES S G +IYF +Y+
Sbjct: 354 RVCYFVSLEMGGHKS-----FNSQYDDFTCCVGSGMESHSMYGTAIYFHTP---ETIYVN 405
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
QY+ S + W+ + + Q+ + R TL SK L T + LR P W + G
Sbjct: 406 QYVPSTVTWEEMDVQLKQE----TLFPQNGRGTLRVISKEPKLFT-IKLRCPHW-AEQGM 459
Query: 572 KATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NG++ + P +++ + + W+ D + +P+T+R E + P+ A +YG
Sbjct: 460 MIKINGEEYATEACPTSYVVIEREWNDADTIEYDIPMTVRIEEM----PDNPRRIAFMYG 515
Query: 631 PYVLAG 636
P VLAG
Sbjct: 516 PLVLAG 521
>gi|302872476|ref|YP_003841112.1| hypothetical protein COB47_1852 [Caldicellulosiruptor obsidiansis
OB47]
gi|302575335|gb|ADL43126.1| protein of unknown function DUF1680 [Caldicellulosiruptor
obsidiansis OB47]
Length = 587
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 174/564 (30%), Positives = 281/564 (49%), Gaps = 60/564 (10%)
Query: 125 LGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPA----PGEPYGGWEEPSCELRGH 180
L SDS ++ + + Y+ L + L+ NF + + + P + +GGWE P+C+LRGH
Sbjct: 15 LHSDSEYYNRFKLDRNYIASLKTENLLQNFYLESGIMSWSFLPQDIHGGWESPTCQLRGH 74
Query: 181 FVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIP 240
F+GH+LSA+A ++AS +E +K K +V L CQKE G ++ + P + F+ +
Sbjct: 75 FLGHWLSAAARIYASFGDEEIKGKADYIVDELERCQKENGGEWVGSIPEKYFEWMARGKW 134
Query: 241 VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNE 300
VWAP+YT+HK GL+D Y Y N +AL + +FY ++S E+ L+
Sbjct: 135 VWAPHYTVHKTFMGLVDMYKYTSNQKALEIADRWANWFYRWS----GQFSREKMDDILDY 190
Query: 301 EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
E GGM ++ +L+ IT+D K+ L + + L D ++G H+NT IP + G+
Sbjct: 191 ETGGMLEIWAELYNITKDSKYKELMERYYRGRLFDRLLNGEDVLTGRHANTTIPEIHGAA 250
Query: 361 MRYEVTGDQLHK------------EGHQLESSGTNIGHFNFKSDPK-RLASNLDSNTEES 407
+EVTG++ + E + G +G PK R+ + L +E
Sbjct: 251 RVWEVTGEEKFRKIVESYWREAVEERGYFCTGGQTLGEV---WTPKHRIRNYLGPTNQEH 307
Query: 408 CTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKER 467
C YNM++++ LFRWT + Y+DY ER++ NG+ QR + G++ Y LPL PGS K
Sbjct: 308 CVVYNMIRLAEFLFRWTGDKKYSDYIERNIYNGLFAQQR-LKDGMVTYFLPLMPGSQK-- 364
Query: 468 SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVV 527
WGTP++ FWCC+GT +++ + D IY++ GV I Q+I S + WK
Sbjct: 365 ---RWGTPTNDFWCCHGTLVQAHTIYNDIIYYKTPN---GVVISQFIPSFVTWK------ 412
Query: 528 NQKVDPVVSWDPYLRVTLTFSSKGSG------------LTTSLNLRIPTWTSSNGAKATL 575
+ K + + Y R +F+ + L +R P W + +
Sbjct: 413 DDKGNGITIKQYYGRRQESFAYTAEKDEICIEVQCKDPIEFELAIRKPWWAKK--IEVAV 470
Query: 576 NGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
N +DL +++ +T+ W+S DK+ I T+ T + DD P+ A + GP VL
Sbjct: 471 N-EDLNYGVDDSSYIKLTRRWNS-DKIKITFYKTVETCPMPDD-PQQV---AFMVGPVVL 524
Query: 635 AGHSIGDWDITESATSLSDWITPI 658
AG I + + + I PI
Sbjct: 525 AGLCERRRKIYINGRKIEEVIVPI 548
>gi|114047478|ref|YP_738028.1| hypothetical protein Shewmr7_1982 [Shewanella sp. MR-7]
gi|113888920|gb|ABI42971.1| protein of unknown function DUF1680 [Shewanella sp. MR-7]
Length = 795
Score = 251 bits (640), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 173/551 (31%), Positives = 282/551 (51%), Gaps = 60/551 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
L + L+DVRL + AQQT+L Y++ +D ++L+ +RK A + + Y WE +
Sbjct: 28 LTPIPLNDVRLTAGPF-LHAQQTDLAYIMSMDPERLLAPYRKEAGIATTADNYPNWE--N 84
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
L GH GHYLSA ALM+A+T ++++ E+++ +V+ L CQ+ G+GY+ P D+
Sbjct: 85 TGLDGHIGGHYLSALALMYAATGDQAVLERLNYMVAELEKCQQAHGNGYVGGVP--HGDK 142
Query: 235 L---------EA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNR 281
L EA L W P+Y +HK+ AGL D Y Y N A +M ++ +
Sbjct: 143 LWQQVAAGHIEADLFTLNQSWVPWYNLHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDL 202
Query: 282 VQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQA 341
+N+ E+ L E GG+N+ L ++ IT K+L LA+ + L L
Sbjct: 203 SRNLTD----EQLQLMLRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQHQ 258
Query: 342 DDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG---------HQLESS--GTNIG-HFNF 389
D ++ H+NT IP ++G E++ ++ E HQ S G ++ HF+
Sbjct: 259 DKLTRLHANTQIPKIVGVARIAELSHNKAWLESADYFWQQVVHQRTVSIGGNSVREHFHP 318
Query: 390 KSDPKRLASNLDS-NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGT 448
D +S LDS E+C TYNMLK+S+ L+ +++ Y DYYER+L N +L Q
Sbjct: 319 SED---FSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-P 374
Query: 449 EPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGV 508
+ G ++Y P+ P Y + + +S WCC G+GIE+ +K G+ IY EE+ +
Sbjct: 375 QTGGLVYFTPMRPD-----HYRVYSSAQESMWCCVGSGIENHAKYGELIYAEEDNN---L 426
Query: 509 YIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS 568
++ ++ S ++WK+ I ++QK P + + + T LNLR PTW
Sbjct: 427 FVNLFVDSEVNWKAKGISLSQKTQ-----FPDDNTSQMIIHQEADFT--LNLRYPTWAKG 479
Query: 569 NGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAI 627
+ ++NG+ P+ G ++ +T+ W D +TI LP+ + E + D Y ++
Sbjct: 480 D-VTVSINGEPQRFTPTQGQYIPLTRHWRKGDSVTITLPMDISLEQLPDKTAYY----SV 534
Query: 628 LYGPYVLAGHS 638
LYGP VLA +
Sbjct: 535 LYGPIVLAAKT 545
>gi|393718114|ref|ZP_10338041.1| hypothetical protein SechA1_00115 [Sphingomonas echinoides ATCC
14820]
Length = 789
Score = 251 bits (640), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 190/590 (32%), Positives = 282/590 (47%), Gaps = 67/590 (11%)
Query: 118 VSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCEL 177
+ L VRL S + A + N YLL L D+L+ NFR A L GE YGGWE S +
Sbjct: 39 LPLSAVRL-RPSDYATAVEVNRAYLLRLSADRLLHNFRAYAGLKPKGEVYGGWE--SDTI 95
Query: 178 RGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ-----F 232
GH +GHY+SA L+ T + K + +V L+ Q G+GY+ A ++
Sbjct: 96 AGHTLGHYMSALVLLHEQTGDAQAKRRADYIVDELADAQAARGNGYIGAMQRKRKDGTVV 155
Query: 233 DRLEALIPV---------------WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEY 277
D +E + W+P+YT+HK+ AGLLD + NA+AL + Y
Sbjct: 156 DAIEIFPEIIKGDIRSGGFDLNGAWSPFYTVHKLFAGLLDIHASWGNAKALSVAIAFAGY 215
Query: 278 FYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA-HLFDKPCFLGL 336
F + V + L E GG+N+ +LF T+D K L +A L+D+ L
Sbjct: 216 F----EPVFAALDDAQMQTMLGTEYGGLNESFAELFARTKDRKWLAIAERLYDRKVLDPL 271
Query: 337 LALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNIGH 386
A Q D ++ FH+NT +P +IG +E+TG+ H G N
Sbjct: 272 TAGQ-DKLANFHANTQVPKLIGLARIHELTGEPAKAAAPRFFWQAVTKHHSYVIGGNADR 330
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
F S+P ++ ++ T E C TYNMLK++R L+ W + A DYYER+ N V+ Q
Sbjct: 331 EYF-SEPDSISRHITEQTCEHCNTYNMLKLTRQLYSWQPDGALFDYYERAHLNHVMAAQD 389
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
G Y+ PL G+ + S + D+FWCC GTG+ES +K G+SI++E EG
Sbjct: 390 PKTAG-FTYMTPLLTGAVRGYST----SADDAFWCCVGTGMESHAKHGESIFWEGEG--- 441
Query: 507 GVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
+ + YI + W++ + +D ++P +TLT ++ ++ LR+P W
Sbjct: 442 ALLVNLYIPADATWRARGATLT--LDTRYPFEPTSTLTLTQLARPGRF--AIALRVPGWA 497
Query: 567 SSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQ-DDRPEYASIQ 625
+ A +NGQ + + V + W + D + I LPL LR EA DDR
Sbjct: 498 AGK-AVVRVNGQPVTPSFASGYAIVERRWKAGDSVAITLPLELRIEATPGDDR-----TV 551
Query: 626 AILYGPYVLA---GHSIGDWDITESATSLSDWI-----TPIPASYNSQLI 667
AIL GP VLA G + GDW + A +D + + PASY + I
Sbjct: 552 AILRGPMVLAADLGTTEGDWTSPDPALVGTDLLASFRPSATPASYTTSGI 601
>gi|408527846|emb|CCK26020.1| secreted protein [Streptomyces davawensis JCM 4913]
Length = 731
Score = 250 bits (638), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 181/556 (32%), Positives = 275/556 (49%), Gaps = 57/556 (10%)
Query: 111 SGEFLKEVSLHDVRLGSDSMHWRAQQTNL-EYLLMLDVDKLVWNFRKTARLPAPGEPY-G 168
+G + +L VRL + W Q YL +DVD+L++NFR +L G G
Sbjct: 8 AGVLAQPFALGQVRL--TAGRWLDNQNRTGNYLRFVDVDRLLYNFRANHKLSTNGAAANG 65
Query: 169 GWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSGY 223
GW+ P R H GH+L+A A ++A T + + ++K + +V+ L+ CQ GY
Sbjct: 66 GWDAPDFPFRTHIQGHFLTAWAQLYAVTGDTTCRDKATYMVAELAKCQANNSAAGFSPGY 125
Query: 224 LSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFY 279
LS +P F LE YYTIHK LAGLLD + + + +A L + W V++
Sbjct: 126 LSGYPEANFTALEQGTKGDVLYYTIHKTLAGLLDVWRHIGSTQARDVLLALAGW-VDWRT 184
Query: 280 NRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
R+ + E+ L E GGMN VL L T D + L +A FD LA
Sbjct: 185 GRLTS-------EQMQNMLRIEFGGMNAVLTDLHVRTGDARWLAVAQRFDHAAVFDPLAA 237
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE------GHQLESSGTNIG------HF 387
D ++G H+NT +P IG+ Y+ TG +++ L+S IG HF
Sbjct: 238 NQDKLNGLHANTQVPKWIGAAREYKATGTTRYRDIATNAWNITLDSHTYAIGGNSQAEHF 297
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE-IAYADYYERSLTNGVLGIQR 446
P +A L+ +T ESC T+NML ++R LF + A DYYER+ N ++G Q
Sbjct: 298 RA---PHAIAGFLNKDTCESCNTFNMLVLTRELFELDPDRAALFDYYERAWLNQMIGQQN 354
Query: 447 -GTEPGVMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
+ G + Y PL PG + W T +FWCC GTG+E ++L DSIY+
Sbjct: 355 PADDHGHVTYFTPLNPGGRRGVGPAWGGGTWSTDYGTFWCCQGTGLEMNTRLMDSIYYRR 414
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
+ + + ++ S L W I V Q S L+VT +G T ++ +R
Sbjct: 415 DDT---LIVNLFVPSVLTWPERGITVTQTTSYPNSDTTTLKVT-----GNAGGTWAMRIR 466
Query: 562 IPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
IP+WT+ GA ++NG + +PG++ ++++ WSS D +T++LP+ + A DD P
Sbjct: 467 IPSWTT--GASISVNGVAQTVATTPGSYATLSRAWSSGDTVTVRLPMRIILRA-ADDNP- 522
Query: 621 YASIQAILYGPYVLAG 636
++ A+ YGP VL+G
Sbjct: 523 --NVTAVTYGPVVLSG 536
>gi|290955577|ref|YP_003486759.1| hypothetical protein SCAB_10131 [Streptomyces scabiei 87.22]
gi|260645103|emb|CBG68189.1| putative secreted protein [Streptomyces scabiei 87.22]
Length = 786
Score = 250 bits (638), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 179/562 (31%), Positives = 279/562 (49%), Gaps = 57/562 (10%)
Query: 107 VPERS--GEFLKEVSLHDVRLGSDSMHWRAQQTNLE-YLLMLDVDKLVWNFRKTARLPAP 163
P R+ G L VRL + W Q + YL +DVD+L++NFR T +L
Sbjct: 56 APARTDIGVLAHPFELGQVRL--TASRWLDNQNRTQNYLRFIDVDRLLYNFRATHKLSTN 113
Query: 164 GE-PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE---- 218
G P GGW+ P+ R H GH+L+A A ++A T + + ++K + +V+ L+ CQ
Sbjct: 114 GATPNGGWDAPNFGFRTHIQGHFLTAWAQLYAVTGDTTCRDKATRMVAELAKCQANNSAA 173
Query: 219 -IGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTW 273
+GYLS +P F LE YYTIHK L GLLD + + +A L + W
Sbjct: 174 GFNTGYLSGYPESNFTALEQGTSGEVLYYTIHKTLTGLLDVWRLIGSTQARDVLLALAGW 233
Query: 274 MVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCF 333
V++ R+ ++ L E GGMN VL L+ T D + L +A FD
Sbjct: 234 -VDWRTGRLTG-------QQMQTMLRIEFGGMNTVLTDLYQQTGDARWLTVAQRFDHAAV 285
Query: 334 LGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTN 383
LA D ++G H+NT +P IG+ Y+ TG +++ + G N
Sbjct: 286 FDPLAANQDKLNGLHANTQVPKWIGAAREYKATGTTRYRDIATNAWNITVAAHTYAIGGN 345
Query: 384 IGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE-IAYADYYERSLTNGVL 442
+F++ P +A L+++T ESC T NML ++R L+ + + DYYER+ N ++
Sbjct: 346 SQAEHFRA-PNAIAGFLNNDTCESCNTVNMLTLTRELYTLDPDRVELFDYYERAWLNQMI 404
Query: 443 GIQR-GTEPGVMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSI 497
G Q + G + Y PL PG + W T SFWCC GTG+E ++L DSI
Sbjct: 405 GQQNPADDHGHVTYFTPLKPGGRRGVGPALGGGTWSTDYGSFWCCQGTGLEMHTRLMDSI 464
Query: 498 YFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTS 557
YF + + + ++ S L W I V Q S L+VT + S T +
Sbjct: 465 YFHNDTT---LTVNMFVPSVLTWTERGITVTQTTTYPTSDTTTLQVTGSVSG-----TWA 516
Query: 558 LNLRIPTWTSSNGAKATLNG--QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQ 615
+ +RIP WT+ GA ++NG Q++ +PG++ ++ ++W+S D +T++LP+ +
Sbjct: 517 MRIRIPGWTT--GAAVSVNGVAQNIT-TTPGSYATLNRSWTSGDTVTVRLPMRIGIRPAN 573
Query: 616 DDRPEYASIQAILYGPYVLAGH 637
D+ A++ AI YGP VL+G+
Sbjct: 574 DN----ANVAAITYGPVVLSGN 591
>gi|297203356|ref|ZP_06920753.1| secreted protein [Streptomyces sviceus ATCC 29083]
gi|297148382|gb|EDY55480.2| secreted protein [Streptomyces sviceus ATCC 29083]
Length = 723
Score = 250 bits (638), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 175/533 (32%), Positives = 267/533 (50%), Gaps = 54/533 (10%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWEEPSCELRGHFVGHYLSASALMW 193
Q YL +DVD+L++NFR RL G GGW+ P R H GH+L+A A ++
Sbjct: 21 QNRTQNYLRFVDVDRLLYNFRANHRLSTNGAVATGGWDAPDFPFRTHVQGHFLTAWAQLY 80
Query: 194 ASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLE--ALIPVWAPYY 246
A + + ++K + +V+ L+ CQ +GYLS +P F LE L PYY
Sbjct: 81 AVSGDTVCRDKATYMVAELAKCQANNSAAGFSAGYLSGYPESDFTALEQRTLSNGNVPYY 140
Query: 247 TIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
TIHK LAGLLD + + + +A L + W V++ R+ S ++ L E
Sbjct: 141 TIHKTLAGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRL-------SGQQMQTMLQTEF 192
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMR 362
GGMN VL L+ T D + L A FD LA D +SG H+NT +P IG+
Sbjct: 193 GGMNTVLTDLYQQTGDARWLTAARRFDHAAVFDPLASGQDQLSGLHANTQVPKWIGAARE 252
Query: 363 YEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYN 412
Y+ TG +++ + G N +F++ P +A L+ +T ESC T N
Sbjct: 253 YKATGTTRYRDIATNAWNFTVNAHTYAIGGNSQAEHFRA-PNAIAGYLNKDTCESCNTVN 311
Query: 413 MLKVSRHLFRWT-KEIAYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----E 466
ML ++R LF A DYYE++ N ++G Q + G + Y PL PG +
Sbjct: 312 MLTLTRELFALDPNRAALFDYYEQAWLNQMIGQQNPADGHGHVTYFTPLNPGGRRGVGPA 371
Query: 467 RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIV 526
W T +FWCC GTG+E ++L DS+YF + + + ++ S L+W I
Sbjct: 372 WGGGTWSTDYGTFWCCQGTGLEMHTRLMDSLYFRSDDT---LIVNLFVPSVLNWSERGIT 428
Query: 527 VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG--QDLPLPS 584
V Q S L+VT S T ++ +RIP WT+ GA ++NG QD+ +
Sbjct: 429 VTQTTSYPNSDTTTLQVTGNVSG-----TWAMRIRIPGWTA--GATISVNGTRQDIT-TT 480
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
PG++ ++T++W+S D +T++LP+ + A D+ ++ AI YGP VL+G+
Sbjct: 481 PGSYATLTRSWTSGDTVTVRLPMRVVMRAANDN----PNVAAITYGPVVLSGN 529
>gi|390943351|ref|YP_006407112.1| hypothetical protein Belba_1756 [Belliella baltica DSM 15883]
gi|390416779|gb|AFL84357.1| hypothetical protein Belba_1756 [Belliella baltica DSM 15883]
Length = 785
Score = 249 bits (637), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 171/543 (31%), Positives = 276/543 (50%), Gaps = 51/543 (9%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L VRL DS AQ+ + +Y+L +DVD+L+ + K A + E YG WE+ L G
Sbjct: 32 LDQVRL-LDSPFKNAQEVDKKYILEMDVDRLLAPYMKDAGIEWIAENYGNWEDTG--LDG 88
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ-------F 232
H GHYLSA ++M+AST + +K ++ ++ L Q + +GY+ P Q
Sbjct: 89 HIGGHYLSALSMMYASTGDIEIKSRLDYMIEQLKLAQDKNANGYIGGVPNGQKIWEEIRV 148
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
++A L W P Y IHKI AGL D Y A A+A M + ++FY+ + +
Sbjct: 149 GNIKAGSFSLNDRWVPLYNIHKIYAGLKDAYLIAGIADAKPMLIALSDWFYD----LTEG 204
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
+S + + L E GG+N+V + +T +PK+L LA L L+ + D+++G H
Sbjct: 205 FSEAQFQEILISEHGGLNEVFADVSAMTGNPKYLELAKKMSHNLILDPLSKRQDNLTGMH 264
Query: 349 SNTHIPIVIGSQMRYEVTGD-QLHKEGHQLESSGTN-----IG------HFNFKSDPKRL 396
+NT IP VIG Q +++ + + + + TN IG HF+ K D +
Sbjct: 265 ANTQIPKVIGFQRIAQLSDEAKWNNSATYFWENVTNQRSVSIGGNSVREHFHPKDDFSPM 324
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
S+ E+C TYNM+++S LF + + Y DYYER+L N +L Q T+ G +Y
Sbjct: 325 LSS--DQGPETCNTYNMMRLSEKLFESSPDRKYIDYYERALYNHILSSQHPTKGG-FVYF 381
Query: 457 LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P+ P + Y + P ++FWCC G+G+E+ +K G IY +E + +++ +I+S
Sbjct: 382 TPMRP-----QHYRVYSQPHENFWCCVGSGLENHAKYGQVIYAHKEDE---LFVNLFIAS 433
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
L W+ I + QK D S TL F KG L +R P W + +N
Sbjct: 434 ELSWEEKGIKLTQKTDFPFS----ESTTLQFDHKGKK-EFKLKIRYPDWVKGGAMEVKVN 488
Query: 577 GQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G+ P+ S ++ + + W S D++++ LP++ + E + D P +AS ++GP VLA
Sbjct: 489 GKSFPISLSKDGYVVIDRKWKSKDQVSVTLPMSTKVEYLADGSP-WAS---FVHGPIVLA 544
Query: 636 GHS 638
+
Sbjct: 545 AET 547
>gi|347738800|ref|ZP_08870212.1| hypothetical protein AZA_89687 [Azospirillum amazonense Y2]
gi|346918071|gb|EGY00199.1| hypothetical protein AZA_89687 [Azospirillum amazonense Y2]
Length = 804
Score = 249 bits (636), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 175/559 (31%), Positives = 271/559 (48%), Gaps = 54/559 (9%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A NL YL L+ D+L+ NFR A L G YGGWE + + GH +GHYLSA +LM
Sbjct: 53 AVDANLAYLHSLEADRLLHNFRSGAGLQPKGAAYGGWEGDT--IAGHTLGHYLSALSLMH 110
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLE----------------- 236
A T + K ++ +V+ L+ CQK G GY++ F ++ D +E
Sbjct: 111 AQTGDAECKRRVDYIVAELAECQKAQGDGYVAGFTRKRGDIVEDGKVVFDELRRGEIRSA 170
Query: 237 --ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
L W P Y HK+ GL D T N +AL + + Y + V + E+
Sbjct: 171 GFDLNGCWVPLYNWHKLYTGLFDAQTLCGNTQALDVGVKLGGY----IDEVFSHLNDEQV 226
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
+ L+ E GG+N+ +L+ T D + L+LA L L+ D+++ H+NT IP
Sbjct: 227 QKVLDCEHGGINESFAELYARTGDRRWLLLAERLYHAKVLVPLSEGRDELANIHANTQIP 286
Query: 355 IVIGSQMRYEVTGDQLHKEGHQL--ESSGTNIGHF-------NFKSDPKRLASNLDSNTE 405
+IG E+TG + H + ++ TN + + +P+ ++ ++ T
Sbjct: 287 KLIGLARLAELTGSERHAKASAFFWQTVTTNHSYVIGGNADREYFQEPRSISRHITEQTC 346
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
E C +YNMLK++R L+ + Y D+YER+ N VL Q+ G+ Y+ PL GS++
Sbjct: 347 EGCNSYNMLKLTRLLYARQADAHYFDFYERAHLNHVLA-QQNPATGMFTYMTPLMSGSAR 405
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
E S TP++ FWCC GTG+ES +K G+S+Y+ + V + YI S L W
Sbjct: 406 EFS-----TPTEDFWCCVGTGMESHAKHGESVYWRRGAEDLAVNL--YIPSTLTWGERGA 458
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSP 585
V VD + V LT + T +++ RIP W + GA +NG+ L
Sbjct: 459 V----VDLDTRYPEAETVLLTLKALKRPATFAVSFRIPAWCT--GATLAVNGKPQDLVVQ 512
Query: 586 GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDIT 645
+ V + W + D + ++LP+ LR E+ DD A A L+GP VLA +G +
Sbjct: 513 NGYAVVRREWKAGDAVALRLPMALRLESTNDD----ADTVAFLHGPLVLAA-DLGAAPKS 567
Query: 646 ESATSLSDWITPIPASYNS 664
E+ T S TP+ ++
Sbjct: 568 EAPTG-SPQPTPVSDAFQG 585
>gi|310639749|ref|YP_003944507.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus polymyxa
SC2]
gi|386038950|ref|YP_005957904.1| hypothetical protein PPM_0260 [Paenibacillus polymyxa M1]
gi|309244699|gb|ADO54266.1| Acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus polymyxa
SC2]
gi|343094988|emb|CCC83197.1| DUF1680 domain containing protein [Paenibacillus polymyxa M1]
Length = 751
Score = 249 bits (636), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 182/542 (33%), Positives = 276/542 (50%), Gaps = 55/542 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
LH V + S + + A + N YLL L+ D+L+ FR+ A L Y GWE + G
Sbjct: 8 LHKVSIDSGPL-YHAMELNTTYLLSLEPDRLLSRFREYAGLEPKAAHYEGWEARG--ISG 64
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA 237
H +GHYLS ALM+AST ++ L E+++ V+ L CQ G+GY+S P E F+ ++A
Sbjct: 65 HTLGHYLSGCALMFASTGDKRLLERVNYVIDELEICQNSHGNGYISGIPRGKEIFEEVKA 124
Query: 238 ---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
L W P YT+HK+ AGL D + A + +AL M + ++ +++V +
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLAHHPKALAMEIQLGDW----LEDVFQG 180
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
S E+ Q L+ E GGMN+VL L + + + L LA F L LA D ++G H
Sbjct: 181 LSDEQVQQVLHCEFGGMNEVLTDLAEHSGEKRFLNLAERFYHGEVLNDLADSRDTLAGRH 240
Query: 349 SNTHIPIVIGSQMRYEVTGDQL-------------HKEGHQLESSGTNIGHFNFKSDPKR 395
+NT IP +IG+ ++EVTG L HK + + + N HF +P +
Sbjct: 241 ANTQIPKIIGAARQFEVTGKPLYADLSRFFWDRVVHKHSYVIGGNSYN-EHF---GEPGK 296
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
L L T E+C TYNMLK++RH+F W AYADYYER++ N +L Q+ + G + Y
Sbjct: 297 LNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-GRVCY 355
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
+ L G K + + + F CC G+G+ES S G +IYF +Y+ QY+
Sbjct: 356 FVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTANT---IYVNQYVP 407
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S + W I + Q+ + R TL SK T + LR P W + G K +
Sbjct: 408 STVTWDEMNIQLKQE----TLFPQNGRGTLHLISKEPKFFT-IKLRCPHW-AEQGMKIKI 461
Query: 576 NGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
NG++ + P +++ + + W D + +P+T+R E + P+ A +YGP VL
Sbjct: 462 NGEEYAAEACPTSYIVIEREWKDGDTVEYDIPMTVRVEEM----PDNPRRIAFMYGPLVL 517
Query: 635 AG 636
AG
Sbjct: 518 AG 519
>gi|408357216|ref|YP_006845747.1| hypothetical protein AXY_18530 [Amphibacillus xylanus NBRC 15112]
gi|407727987|dbj|BAM47985.1| hypothetical protein AXY_18530 [Amphibacillus xylanus NBRC 15112]
Length = 755
Score = 249 bits (636), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 174/534 (32%), Positives = 265/534 (49%), Gaps = 61/534 (11%)
Query: 129 SMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSA 188
M +QQ EYLL LD+D+L+ + YGGWE S E+ GH +GH+LSA
Sbjct: 9 GMFKESQQKGKEYLLYLDIDRLIAPCYEAVGQEPRAPRYGGWE--SMEIAGHSIGHWLSA 66
Query: 189 SALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD-------RLE--ALI 239
++LM+ T + LK K+ + L+ Q GY+S FP + FD R++ L
Sbjct: 67 ASLMYNVTGDLLLKHKIDYAIDELAHVQAFDPEGYVSGFPRDCFDEVFTGEFRVDNFGLG 126
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHW 295
W P+Y+IHKI AGL+D Y A N +A ++++ W + K + E+
Sbjct: 127 GSWVPWYSIHKIYAGLVDAYRLASNEKAKTVLVKLSNW--------ADQGLSKLNDEQFQ 178
Query: 296 QTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPI 355
+ L E GGMN+ + ++ IT D + L LA F+ L L DD++G H+NT IP
Sbjct: 179 RMLICEFGGMNETMADVYEITGDKRFLKLAERFNHKAVLDPLIEGIDDLAGKHANTQIPK 238
Query: 356 VIGSQMRYEVTG------------DQLHKEGHQLESSGTNIGHFN-FKSDPKRLASNLDS 402
VIG+ Y++TG DQ+ +N HF ++P + S
Sbjct: 239 VIGAAKLYDMTGKEEYQKLSRFFWDQVVYHRSYAFGGNSNAEHFGPVDTEPLGIIST--- 295
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
E+C TYNMLK++ HLF W + Y DYYE +L N +LG Q E G+ Y +P PG
Sbjct: 296 ---ETCNTYNMLKLTEHLFDWQPDSRYMDYYENALYNHILGSQ-DPESGMKSYFIPTEPG 351
Query: 463 SSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
K + +P +SFWCC G+G+E+ ++ +IY K +Y+ +I S L
Sbjct: 352 HFKV-----YCSPDNSFWCCTGSGMENPARYTKNIYTR---KADSLYVNLFIPSTLTIAE 403
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL 582
+ Q+ D +D + T+ +G+G ++ LR P W + A +NG+ + L
Sbjct: 404 KDLQFIQETD--FPYDETVHFTV---KEGNGERLTVYLRKPNWLAGEMA-LQINGEPVAL 457
Query: 583 PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + + W +D +T QLP+ LRT + D+PE +A YGP +LAG
Sbjct: 458 ELVNGYYEIDRKWYKNDTVTFQLPMGLRTYTAK-DQPEK---KAFFYGPILLAG 507
>gi|284036341|ref|YP_003386271.1| hypothetical protein Slin_1422 [Spirosoma linguale DSM 74]
gi|283815634|gb|ADB37472.1| protein of unknown function DUF1680 [Spirosoma linguale DSM 74]
Length = 760
Score = 248 bits (634), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 168/549 (30%), Positives = 272/549 (49%), Gaps = 55/549 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ +L DV++ AQ +L+Y+L L+ +KL+ + A LP YG WE S
Sbjct: 22 MQPFALQDVKVTGGPFK-NAQDVDLKYILALNPNKLLAPYLIDAGLPEKAPRYGNWE--S 78
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--F 232
L GH GHYLSA A+M+AST N K+++ +V L+ CQ + G+GY+ P + +
Sbjct: 79 SGLDGHIGGHYLSALAMMYASTGNAETKKRLDYMVDQLAQCQAKNGNGYVGGIPQGKVFW 138
Query: 233 DRLE---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
+R+ L W P Y IHK+ AGL D Y YA N +A ++ + ++F
Sbjct: 139 ERIHKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDAYEYAGNQQAKQVLIGLGDWFV---- 194
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+IK S E+ Q L E GG+N+ L+ +T+D K+L A L L + D
Sbjct: 195 ELIKPLSDEQIQQVLRTEHGGINETFADLYILTKDQKYLETAQRISHRAILDPLIDKQDK 254
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGD-----------QLHKEGHQLESSGTNIG-HFNFKS 391
++G H+NT IP VIG + +TG Q + + G ++ HFN +
Sbjct: 255 LTGLHANTQIPKVIGFEKIATLTGKSDWSDAAQYFWQNVSQTRSVAFGGNSVREHFNPTT 314
Query: 392 DPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
D +L L SN E+C ++NML++S+ LF +++Y D+YER++ N +L Q E
Sbjct: 315 DFSQL---LRSNQGPETCNSFNMLRLSKALFLDKNDVSYLDFYERTMYNHILSSQH-PEK 370
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G +Y P+ P Y + P S WCC G+GIE+ +K G+ IY +++
Sbjct: 371 GGFVYFTPIRPN-----HYRVYSQPETSMWCCVGSGIENHTKYGELIYSHSAND---LFV 422
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+I S ++W ++ + Q+ PY + SLN+R P W +
Sbjct: 423 NLFIPSTVNWADKKLKLTQQTQ-----FPYQNQSELIIETSRPQELSLNIRYPKWAEN-- 475
Query: 571 AKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
+ +NG+ P+ P ++++V + W S DK+T++ T R E + P+ ++ A +
Sbjct: 476 LEVLVNGKAQPVTGKPASYVAVNRKWKSGDKVTVRFKTTTRLEQL----PDGSNWAAFVN 531
Query: 630 GPYVLAGHS 638
GP VLA +
Sbjct: 532 GPIVLAAKT 540
>gi|237708621|ref|ZP_04539102.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
gi|229457321|gb|EEO63042.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
Length = 783
Score = 248 bits (634), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 171/546 (31%), Positives = 267/546 (48%), Gaps = 61/546 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
+ DVRL + A+ ++ YLL +D D+L+ + K A L E Y WE + L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWE--NTGLDG 89
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLE- 236
H GHYLSA + M+A+T N+ +K ++ ++S L CQ G GYL P + + +E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
L W P Y IHK+ AGL D + EA +++T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+I K S E+ L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ G++ E + ++ IG HF+ D
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L+ + + DYYER+L N +L Q + G
Sbjct: 322 ---FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDSVQGG 378
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ ++ G+ IY ++ +Y+
Sbjct: 379 -FVYFTPMRAG-----HYRVYSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVN 429
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W G I + Q+ ++ TL S + +L R+P WT+
Sbjct: 430 LFIPSTLRW--GDIHIEQQ----TAFPDEEGTTLAVSPEKGEKEFTLLFRVPEWTNPEAL 483
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ ++NG+ + ++S+ +TWS DK+ ++LP+ LR A+ D Y +ILYGP
Sbjct: 484 RLSVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGP 539
Query: 632 YVLAGH 637
VLA
Sbjct: 540 IVLAAQ 545
>gi|423242461|ref|ZP_17223569.1| hypothetical protein HMPREF1065_04192 [Bacteroides dorei
CL03T12C01]
gi|392639254|gb|EIY33080.1| hypothetical protein HMPREF1065_04192 [Bacteroides dorei
CL03T12C01]
Length = 783
Score = 248 bits (633), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 171/546 (31%), Positives = 267/546 (48%), Gaps = 61/546 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
+ DVRL + A+ ++ YLL +D D+L+ + K A L E Y WE + L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWE--NTGLDG 89
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLE- 236
H GHYLSA + M+A+T N+ +K ++ ++S L CQ G GYL P + + +E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
L W P Y IHK+ AGL D + EA +++T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+I K S E+ L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ G++ E + ++ IG HF+ D
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L+ + + DYYER+L N +L Q + G
Sbjct: 322 ---FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG 378
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ ++ G+ IY ++ +Y+
Sbjct: 379 -FVYFTPMRAG-----HYRVYSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVN 429
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W G I + Q+ ++ TL S + +L R+P WT+
Sbjct: 430 LFIPSTLRW--GDIHIEQQ----TAFPDEEGTTLAVSPEKGEKEFTLLFRVPEWTNPEAL 483
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ ++NG+ + ++S+ +TWS DK+ ++LP+ LR A+ D Y +ILYGP
Sbjct: 484 RLSVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGP 539
Query: 632 YVLAGH 637
VLA
Sbjct: 540 IVLAAQ 545
>gi|347528202|ref|YP_004834949.1| hypothetical protein SLG_18170 [Sphingobium sp. SYK-6]
gi|345136883|dbj|BAK66492.1| hypothetical protein SLG_18170 [Sphingobium sp. SYK-6]
Length = 805
Score = 248 bits (633), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 173/535 (32%), Positives = 254/535 (47%), Gaps = 55/535 (10%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A N YLL L+ D+L+ NF A L GE YGGWE + + GH +GHY++A ALM
Sbjct: 61 AVDANRRYLLQLEPDRLLHNFLVHAGLEPKGEAYGGWEGDT--IAGHTLGHYMTALALMH 118
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLE---ALIP---------- 240
A T + + +V L QK G GY++ F D +E A+ P
Sbjct: 119 AQTGDAECARRALYIVDELERAQKASGDGYVAGFTRRNGDVVEDGKAIFPEIMAGDIRSA 178
Query: 241 ------VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
W P+Y HK+ AGL D T+ + +A+ + + Y ++ V +
Sbjct: 179 GFDLNGCWVPFYNWHKLYAGLFDIQTWIGSDKAIPIAVSLSGY----IEKVFASLDDTQL 234
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
L+ E GG+N+ +L T DP+ L LA L L+ + + H+NT IP
Sbjct: 235 QTVLDCEHGGINESFAELHVRTGDPRWLALAERIRHRKVLDPLSRGENSLPWIHANTQIP 294
Query: 355 IVIGSQMRYEVTGDQLHKEG---------HQLESSGTNIGHFNFKSDPKRLASNLDSNTE 405
VIG +E+TG H H+ + DP ++ ++ T
Sbjct: 295 KVIGLARLHEITGRADHAIAARYFWDTVVHRYSYVIGGNADREYFPDPDTVSRHITEQTC 354
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
ESC TYNMLK++RHL+ W E + DYYER+ N +L QR T+ G+ Y++PL G+ +
Sbjct: 355 ESCNTYNMLKLTRHLYAWRPEASLFDYYERAHINHILAQQR-TDNGMFAYMVPLMSGTHR 413
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEG-KYPGVYIIQ--YISSRLDWKS 522
W P DSFWCC G+GIES SK G+SI++EE+ + G ++ YI SR W +
Sbjct: 414 A-----WSDPFDSFWCCVGSGIESHSKHGESIWWEEDDQRRAGEALVANLYIPSRTQWSA 468
Query: 523 -GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP 581
G +V + P +D + + LT +K T +L LRIP W +NG+
Sbjct: 469 RGATLVMETAYP---FDGEIDIALTELAKPG--TFTLALRIPAWCDEPA--VLINGKAWK 521
Query: 582 LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
++++ + W D + + LP+ LR E DD S A L GP VLA
Sbjct: 522 ATPADGYIAIKRPWKRGDSIRLSLPMKLRMEPTPDD----PSTVAFLRGPVVLAA 572
>gi|345513549|ref|ZP_08793069.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
gi|229437570|gb|EEO47647.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
Length = 783
Score = 248 bits (633), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 171/546 (31%), Positives = 267/546 (48%), Gaps = 61/546 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
+ DVRL + A+ ++ YLL +D D+L+ + K A L E Y WE + L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGMDPDRLLAPYLKEAGLFPKAENYTNWE--NTGLDG 89
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLE- 236
H GHYLSA + M+A+T N+ +K ++ ++S L CQ G GYL P + + +E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
L W P Y IHK+ AGL D + EA +++T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMIR-------- 201
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+I K S E+ L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ G++ E + ++ IG HF+ D
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L+ + + DYYER+L N +L Q + G
Sbjct: 322 ---FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG 378
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ ++ G+ IY ++ +Y+
Sbjct: 379 -FVYFTPMRAG-----HYRVYSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVN 429
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W G I + Q+ ++ TL S + +L R+P WT+
Sbjct: 430 LFIPSTLRW--GDIHIEQQ----TAFPDEEGTTLAVSPEKGEKEFTLLFRVPEWTNPEAL 483
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ ++NG+ + ++S+ +TWS DK+ ++LP+ LR A+ D Y +ILYGP
Sbjct: 484 RLSVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGP 539
Query: 632 YVLAGH 637
VLA
Sbjct: 540 IVLAAQ 545
>gi|212691787|ref|ZP_03299915.1| hypothetical protein BACDOR_01282 [Bacteroides dorei DSM 17855]
gi|212665688|gb|EEB26260.1| hypothetical protein BACDOR_01282 [Bacteroides dorei DSM 17855]
Length = 783
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 171/546 (31%), Positives = 267/546 (48%), Gaps = 61/546 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
+ DVRL + A+ ++ YLL +D D+L+ + K A L E Y WE + L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWE--NTGLDG 89
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLE- 236
H GHYLSA + M+A+T N+ +K ++ ++S L CQ G GYL P + + +E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
L W P Y IHK+ AGL D + EA +++T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+I K S E+ L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ G++ E + ++ IG HF+ D
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L+ + + DYYER+L N +L Q + G
Sbjct: 322 ---FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG 378
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ ++ G+ IY ++ +Y+
Sbjct: 379 -FVYFTPMRAG-----HYRVYSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVN 429
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W G I + Q+ ++ TL S + +L R+P WT+
Sbjct: 430 LFIPSTLRW--GDIHIEQQ----TAFPDEEGTTLAVSPEKGEKEFALLFRVPEWTNPEAL 483
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ ++NG+ + ++S+ +TWS DK+ ++LP+ LR A+ D Y +ILYGP
Sbjct: 484 RLSVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGP 539
Query: 632 YVLAGH 637
VLA
Sbjct: 540 IVLAAQ 545
>gi|265755220|ref|ZP_06089990.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|423231114|ref|ZP_17217517.1| hypothetical protein HMPREF1063_03337 [Bacteroides dorei
CL02T00C15]
gi|423246788|ref|ZP_17227840.1| hypothetical protein HMPREF1064_04046 [Bacteroides dorei
CL02T12C06]
gi|263234362|gb|EEZ19952.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|392629229|gb|EIY23239.1| hypothetical protein HMPREF1063_03337 [Bacteroides dorei
CL02T00C15]
gi|392634665|gb|EIY28581.1| hypothetical protein HMPREF1064_04046 [Bacteroides dorei
CL02T12C06]
Length = 783
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 171/546 (31%), Positives = 267/546 (48%), Gaps = 61/546 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
+ DVRL + A+ ++ YLL +D D+L+ + K A L E Y WE + L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWE--NTGLDG 89
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLE- 236
H GHYLSA + M+A+T N+ +K ++ ++S L CQ G GYL P + + +E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
L W P Y IHK+ AGL D + EA +++T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+I K S E+ L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ G++ E + ++ IG HF+ D
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L+ + + DYYER+L N +L Q + G
Sbjct: 322 ---FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG 378
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ ++ G+ IY ++ +Y+
Sbjct: 379 -FVYFTPMRAG-----HYRVYSQPQTSFWCCVGSGMENHARYGEMIYGHKDNN---LYVN 429
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W G I + Q+ ++ TL S + +L R+P WT+
Sbjct: 430 LFIPSTLRW--GDIHIEQQ----TAFPDEEGTTLAVSPEKGEKEFTLLFRVPEWTNPEAL 483
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ ++NG+ + ++S+ +TWS DK+ ++LP+ LR A+ D Y +ILYGP
Sbjct: 484 RLSVNGEQQKVTVKEGYVSLNRTWSKGDKVRLELPMHLRAIALPDGSANY----SILYGP 539
Query: 632 YVLAGH 637
VLA
Sbjct: 540 IVLAAQ 545
>gi|418466296|ref|ZP_13037222.1| secreted protein [Streptomyces coelicoflavus ZG0656]
gi|371553101|gb|EHN80323.1| secreted protein [Streptomyces coelicoflavus ZG0656]
Length = 773
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 172/527 (32%), Positives = 257/527 (48%), Gaps = 43/527 (8%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY-GGWEEPSCELRGHFVGHYLSASALMW 193
Q L YL +DVD+L+ NFR RL G GGWE P R H GH+L+A A +
Sbjct: 68 QSRTLSYLRFVDVDRLLHNFRANHRLSTNGAAATGGWEAPDFPFRSHVQGHFLTAWAQAY 127
Query: 194 ASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFPTEQFDRLEA--LIPVWAPYY 246
A T + + ++K +V+ L+ CQ G+GYLS +P F LE+ L PYY
Sbjct: 128 AVTGDTACRDKALYMVAELAKCQANNGAAGFGTGYLSGYPESDFAALESGTLNNGNVPYY 187
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN 306
TIHK LAGLL+ + + A + + + R + S R L E GGMN
Sbjct: 188 TIHKTLAGLLEVWRLLGSTRARDVLLALAGWVDRRT----GRLSTTRMQAVLGTEFGGMN 243
Query: 307 DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVT 366
VL L T D + L +A FD LA D ++G H+NT +P IG+ Y+ T
Sbjct: 244 AVLTDLCQQTGDTRWLAVAQRFDHAAVFDPLAANQDRLAGLHANTQVPKWIGAVREYKAT 303
Query: 367 GDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKV 416
G +++ + G N +F+ P +A++L ++T ESC T NML +
Sbjct: 304 GSTRYRDIATNAWNMCVTTHTYAVGGNSQAEHFRP-PNAIAAHLANDTCESCNTVNMLGL 362
Query: 417 SRHLFRWTKEIA-YADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK----ERSYH 470
+R LF + + A DYYE++ N ++G Q +P G + Y PL PG +
Sbjct: 363 TRELFALSPDRAELFDYYEQAWLNHMIGQQNPADPHGHVTYFTPLKPGGRRGVGPAWGGG 422
Query: 471 HWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK 530
W T +FWCC GTG+E ++L DS+YF + G V + ++ S L W I V Q
Sbjct: 423 TWSTDYTTFWCCQGTGLEMHTRLMDSVYFHDGGTTLTVNL--FVPSVLTWAERGITVTQS 480
Query: 531 VDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG-QDLPLPSPGNFL 589
S LR+T + T ++ +RIP WT+ GA ++NG + +PG +
Sbjct: 481 TSYPASDTTTLRITGDAAG-----TWAMRVRIPGWTT--GAVVSVNGVRQHVTAAPGTYA 533
Query: 590 SVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
++ + W S D +T++LP+ DD ++ A+ +GP VL+G
Sbjct: 534 TLDRAWDSGDTVTVRLPMRTVVRPANDD----PAVGAVTHGPVVLSG 576
>gi|90020425|ref|YP_526252.1| Acetyl-CoA carboxylase, biotin carboxylase [Saccharophagus
degradans 2-40]
gi|89950025|gb|ABD80040.1| protein of unknown function DUF1680 [Saccharophagus degradans 2-40]
Length = 803
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 174/545 (31%), Positives = 271/545 (49%), Gaps = 48/545 (8%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L DVRL DS AQ N+EY+L L DKL+ F K A LP E YG WE S L G
Sbjct: 36 LADVRL-LDSPFKHAQDKNVEYVLALQPDKLLAPFLKEAGLPVKAENYGNWE--SQGLDG 92
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--FDRLE- 236
H GHYL+A +L +A+T ++ L ++++ +++ L Q + +GY+ + +D +
Sbjct: 93 HIGGHYLTALSLAYAATGDKRLLDRLNYMLNELERAQNKNSNGYIGGVRNGKALWDNIAK 152
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
AL W P+Y +HKI AGL D Y Y + +A M + E+ + +
Sbjct: 153 GDIRADLFALNDYWVPWYNLHKIYAGLRDAYIYTGSEQAKAMLIGLGEWTIALTAD-LND 211
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
IE+ L E GGMN+V + IT D ++L LA F L L + D ++G H
Sbjct: 212 EQIEK---MLTTEYGGMNEVFADMAAITGDKRYLSLAKQFSHKKILNPLLQKRDALNGLH 268
Query: 349 SNTHIPIVIGSQMRYEVTGD-QLHKEG-----HQLESSGTNIG------HFNFKSDPKRL 396
+NT IP V+G Q E+TGD + HK H + + IG HF+ D +
Sbjct: 269 ANTQIPKVVGYQRVAELTGDEEWHKAADYFWHHVVNNRTVAIGGNSVREHFHDSEDFAPM 328
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
++++ E+C TYNMLK+SR LF + Y DY+ER+L N +L Q E G ++Y
Sbjct: 329 INDVEG--PETCNTYNMLKLSRMLFSVNPSVDYVDYFERALYNHILSSQH-PETGGLVYF 385
Query: 457 LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P+ P + Y + + WCC G+GIE+ K G+ IY ++ +Y+ +I+S
Sbjct: 386 TPMRP-----QHYRMYSQVDTAMWCCVGSGIENHVKYGEFIYAKQNN---NLYVNLFIAS 437
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT--SLNLRIPTWTSSNGAKAT 574
L W+ + + Q+ S L V L K S ++++R P W +
Sbjct: 438 TLVWQEKGVHLTQENTFPDSNRTTLTVALDSKVKSSKKHAKFTMHIRYPRWAQAGKVVVK 497
Query: 575 LNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NG+ + + + G ++ + + W + D + + LP+ + EA+ D Y A+LYGP V
Sbjct: 498 VNGKPINVKAKAGEYIEINRRWHNGDNVELSLPMNIALEALPDQSDYY----AVLYGPIV 553
Query: 634 LAGHS 638
LA +
Sbjct: 554 LAAKT 558
>gi|217973327|ref|YP_002358078.1| hypothetical protein Sbal223_2153 [Shewanella baltica OS223]
gi|217498462|gb|ACK46655.1| protein of unknown function DUF1680 [Shewanella baltica OS223]
Length = 792
Score = 247 bits (631), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 170/555 (30%), Positives = 282/555 (50%), Gaps = 67/555 (12%)
Query: 118 VSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCEL 177
+ L+DVR+ + AQQT+L Y++ +D ++L+ +RK A + E Y WE+ L
Sbjct: 23 IPLNDVRITAGPF-LHAQQTDLHYIMSMDPERLLAPYRKDAGIATTAENYPNWEDTG--L 79
Query: 178 RGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT-----EQF 232
GH GHYLSA ALM+A+T ++++ +++ +V+ L CQ+ G+GYL P +Q
Sbjct: 80 DGHIGGHYLSALALMYAATSDKAVLARLNYMVAELEKCQQAHGNGYLGGVPNSRKLWQQI 139
Query: 233 D--RLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVI 286
+ ++EA L W P+Y +HK+ +GL D + Y +N A +M +F + + ++
Sbjct: 140 EQGKIEADLFTLNQAWVPWYNVHKVFSGLRDAHLYTNNPTAKKMLV----HFADWMLHLS 195
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
K S E+ L E GG+N+ L ++ IT K+L LA + L L D ++G
Sbjct: 196 NKLSDEQLQLMLRTEYGGLNETLADVYVITGQDKYLALAKRYTDQSLLQPLLHHEDKLTG 255
Query: 347 FHSNTHIPIVIGSQMRYEVTGDQLHKEG------HQLESSGTNIG------HFNFKSDPK 394
H+NT IP ++G E++ +++ + + +IG HF+ D
Sbjct: 256 LHANTQIPKIVGVARIAELSNNKVWLDSADFFWQQVVHKRTVSIGGNSVREHFHPSDD-- 313
Query: 395 RLASNLDS-NTEESCTTYNMLKVSRHLF------RWTKEIAYADYYERSLTNGVLGIQRG 447
+S L+S E+C TYNMLK+S+ L+ ++AY +YYER+L N +L Q
Sbjct: 314 -FSSMLESAEGPETCNTYNMLKLSKLLYENKLLDENKADLAYIEYYERALYNHILSSQH- 371
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
E G ++Y P+ P Y + + S WCC G+GIE+ +K G+ IY E +
Sbjct: 372 PENGGLVYFTPMRPD-----HYRVYSSAQQSMWCCVGSGIENHAKYGELIYASEGDDF-- 424
Query: 508 VYIIQYISSRLDWKSGQIVVNQKV---DPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
Y+ ++ S + W+ I + QK D S +TL ++ +LN+R P
Sbjct: 425 -YVNLFVDSEVHWQEKGITLTQKTLFPDANTS-----EITLDKDAQ-----FALNVRYPQ 473
Query: 565 WTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYAS 623
W N ++NGQ + G ++ + + W DK++I LP+T+ E I P+ +S
Sbjct: 474 WVQHNDLTLSINGQAQKFNAVAGQYIKIKRQWHKGDKISITLPMTVTLEQI----PDRSS 529
Query: 624 IQAILYGPYVLAGHS 638
++LYGP VLA +
Sbjct: 530 YYSVLYGPIVLAAKT 544
>gi|427403045|ref|ZP_18894042.1| hypothetical protein HMPREF9710_03638 [Massilia timonae CCUG 45783]
gi|425718056|gb|EKU81008.1| hypothetical protein HMPREF9710_03638 [Massilia timonae CCUG 45783]
Length = 781
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 172/548 (31%), Positives = 268/548 (48%), Gaps = 56/548 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L VRLG AQ TNL YL+ ++ D+L+ F + A L YG WE S L G
Sbjct: 25 LSAVRLGPGPF-LDAQTTNLNYLMAMEPDRLLAPFLREAGLQPRQPSYGNWE--STGLDG 81
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ-------F 232
H GHYLSA ALM AST ++ +++ V+ L Q+ G GYL P +
Sbjct: 82 HMGGHYLSALALMHASTGDQEALRRLNYFVAELKRAQQANGDGYLGGIPGGRQAWRDIAA 141
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
+LEA + W P+Y +HK+ AGL D Y YA N +A M + ++ + K
Sbjct: 142 GKLEADNFSVNGKWVPWYNLHKVYAGLRDAYRYAGNEDAKAMLVQLSDW----ALALSAK 197
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
S E+ L E GGMN++ + +T + K+L LA F L LA + D ++G H
Sbjct: 198 LSPEQMQTMLRSEHGGMNEIFVDVAEMTGERKYLDLALAFSHQAVLQPLARKQDQLTGLH 257
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSDPKRL 396
+NT IP VIG + ++TG Q E + ++ IG HF+ D +
Sbjct: 258 ANTQIPKVIGFKRIADMTGRQDMGEAARFFWQTVVDKRTVAIGGNSVKEHFHSTDDFDPM 317
Query: 397 ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
++ E+C TYNMLK++ LFR ++ Y+DYYER+L N +L QR G +Y
Sbjct: 318 VHEVEG--PETCNTYNMLKLTGMLFRSEQKGMYSDYYERALYNHILSSQR--PEGGFVYF 373
Query: 457 LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P+ P Y + WCC G+GIES +K G+ IY ++ +++ +++S
Sbjct: 374 TPMRPN-----HYRVYSQVDKGMWCCVGSGIESHAKYGEFIYARDKDT---LFVNLFVAS 425
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
LDWK + V Q ++ LT +G ++ +R P W + +N
Sbjct: 426 TLDWKDKGVRVTQ----ATTFPDADTTRLTVDGEGR---FTMKIRYPAWVAPGRMAVRVN 478
Query: 577 GQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G ++ + + PG + ++ + W D++ ++LP+T E + P ++ A+L+GP VLA
Sbjct: 479 GAEVKIDARPGGYATIARAWRKGDRVDVRLPMTTHLEQM----PGRSNYYAVLHGPVVLA 534
Query: 636 GHS--IGD 641
+ +GD
Sbjct: 535 ARTRMVGD 542
>gi|399025507|ref|ZP_10727503.1| hypothetical protein PMI13_03476 [Chryseobacterium sp. CF314]
gi|398077884|gb|EJL68831.1| hypothetical protein PMI13_03476 [Chryseobacterium sp. CF314]
Length = 791
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 173/556 (31%), Positives = 274/556 (49%), Gaps = 62/556 (11%)
Query: 112 GEFLKEVS---LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYG 168
G+ K V+ L+ V L S+S+ +A QT+ +Y+L +D D+L+ + K A L Y
Sbjct: 18 GQMKKNVNYFPLNKVHL-SESVFSKAMQTDEKYILSMDADRLLAPYLKEAGLKPKKANYP 76
Query: 169 GWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP 228
WE + L GH GHY+SA ALM+AST + +K+++ ++ L CQ +GYLS P
Sbjct: 77 NWE--NTGLDGHIGGHYISALALMYASTGDAKVKQRLDYMIDELERCQNLSENGYLSGVP 134
Query: 229 TEQFDRLE-----------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTW 273
+ E L W P Y IHKI +GL D Y YAD+ +A +R+T W
Sbjct: 135 NGKKIWKEIAGGNIRAATFGLNDRWVPLYNIHKIYSGLRDAYWYADSGKAKKMLIRLTDW 194
Query: 274 MVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCF 333
MV +V+ I+ L E GG+N+V ++ IT++PK+L LAH F
Sbjct: 195 MVGEV-----SVLSDAQIQ---NMLRSEHGGLNEVFADVYDITKNPKYLRLAHRFSHLAI 246
Query: 334 LGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQL------ESSGTNIG-- 385
L L D +G H+NT IP VIG + ++ ++ + IG
Sbjct: 247 LNPLLNGEDKFTGIHANTQIPKVIGFKRIADLENNKEWSNAADFFWINVTQKRSAVIGGN 306
Query: 386 ----HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGV 441
HFN +D + +++ E+C TYNMLK+S+ L+ + +Y DYYER+L N +
Sbjct: 307 SVSEHFNPINDFSGMIKSIEG--PETCNTYNMLKLSKELYATNPKSSYIDYYERALYNHI 364
Query: 442 LGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
L Q E G +Y P+ PG Y + P SFWCC G+G+E+ +K G+ IY
Sbjct: 365 LSTQ-NPEKGGFVYFTPMRPG-----HYRVYSQPETSFWCCVGSGMENHAKYGEMIYAHS 418
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
+ +Y+ +I S L W ++V+ Q+ + S L + S ++ LR
Sbjct: 419 D---EDLYVNLFIPSILKWSEKKMVLRQENNFPESASTKLIFDVVSKS-----DINMKLR 470
Query: 562 IPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
P W+ ++ ++N +++ +P + SV + W D + +++P+ L E + P+
Sbjct: 471 APEWSDASQITISVNHKNINVPIDAEGYFSVKRKWKKGDVIEMKMPMHLSAEQL----PD 526
Query: 621 YASIQAILYGPYVLAG 636
++ A YGP VLA
Sbjct: 527 HSDYFAFKYGPIVLAA 542
>gi|198275797|ref|ZP_03208328.1| hypothetical protein BACPLE_01972 [Bacteroides plebeius DSM 17135]
gi|198271426|gb|EDY95696.1| hypothetical protein BACPLE_01972 [Bacteroides plebeius DSM 17135]
Length = 796
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 168/545 (30%), Positives = 260/545 (47%), Gaps = 55/545 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L DV L D AQ+ NL+ L+ DVD+L+ F K A LP EP+ W L G
Sbjct: 35 LGDVEL-LDGPFKHAQELNLKVLMEYDVDRLLAPFLKEAGLPLKAEPFPNW----AGLDG 89
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ-------F 232
H GHYLSA A+ +A+T NE +++M ++ L CQ+ G GY+ P +
Sbjct: 90 HVGGHYLSAMAMNYAATGNEECRKRMEYMLGELKRCQESNGDGYIGGVPNGKELWADIKN 149
Query: 233 DRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEAL----RMTTWMVEYFYNRVQNVIKK 288
++E++ WAP+Y +HKI AGL D + Y N EAL R+ W V +V +
Sbjct: 150 GKVESIWKYWAPWYNVHKIFAGLRDAWMYTGNKEALDMFLRLCDWGV--------SVTEG 201
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
S + Q L E GGM+++ + IT K+L A F + D++ H
Sbjct: 202 LSDNQMEQMLANEFGGMDEIFADAYQITGKKKYLTTAKRFSHRWLFDSMVAHKDNLDNIH 261
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHKEGHQL----------ESSGTNIGHFNFKSDPKRLAS 398
+NT IP VIG Q EV GD + + + G N F S +
Sbjct: 262 ANTQIPKVIGYQRIAEVCGDNQYMDAADFFWNIVACKRSLALGGNSRREYFSSMDDFRSH 321
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLP 458
D ESC TYNMLK++ LFR T + Y D+YE++L N +L Q G + +
Sbjct: 322 VEDREGPESCNTYNMLKLTEGLFRMTGKAVYVDFYEKALYNHILSTQHPKHGGYVYFT-- 379
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
S++ Y + P+ + WCC GTG+E+ K G+ IY +++ +ISSRL
Sbjct: 380 ----SARPAHYRVYSKPNSAMWCCVGTGMENHGKYGEFIYTHSS---DSLFVNLFISSRL 432
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+W+ ++ + Q+ + + R+T+ S G L LR P W + G + NG+
Sbjct: 433 NWEQEKVTITQETN--FPDEETSRLTVKLKS-GESCHFKLLLRRPAWVTE-GYEVKCNGK 488
Query: 579 DLPLP---SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+ + + +++ + + W DK+ + LP+ +R E +Q + AI+ GP +L
Sbjct: 489 VVDVSEKVAGSSYICIDRKWKDGDKVEVSLPMKMRLETLQGE----DDFVAIMRGP-ILM 543
Query: 636 GHSIG 640
G S+G
Sbjct: 544 GASVG 548
>gi|182415028|ref|YP_001820094.1| hypothetical protein Oter_3214 [Opitutus terrae PB90-1]
gi|177842242|gb|ACB76494.1| protein of unknown function DUF1680 [Opitutus terrae PB90-1]
Length = 844
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 187/565 (33%), Positives = 276/565 (48%), Gaps = 63/565 (11%)
Query: 108 PERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY 167
PE E L L VRL + A + N YLL LD D+L+ FR+ A LPA +PY
Sbjct: 69 PETPAEILP---LASVRLLEGGPFFTAVKANRTYLLALDADRLLAPFRREAGLPALAQPY 125
Query: 168 GGWEEPSCELRGHFVGHYLSASALMWASTHNE---SLKEKMSAVVSALSACQKEIGSGYL 224
G WE S L GH GHYLSA A M A+ H+ L+ ++ +V+ L ACQ G+GY+
Sbjct: 126 GNWE--SGGLDGHTAGHYLSALAHMIAAGHDTPEGELRRRLDHMVAELKACQDANGNGYV 183
Query: 225 SAFPT--EQFDRLEA-----LIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTW 273
P E + R+ A + W P+Y +HK AGL D + N A +R+ W
Sbjct: 184 GGVPGSHELWQRVAAGDVTAVNRKWVPWYNLHKTFAGLRDAWLQTGNTTARDVLVRLGDW 243
Query: 274 MVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCF 333
V + + E+ + L +E GGMN+VL ++ IT D K+L A F+
Sbjct: 244 CVA--------LTSPLTDEQMQRMLAQEHGGMNEVLADIYAITGDKKYLTAAERFNHHAV 295
Query: 334 LGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTN 383
L L D+++G H+NT IP V+G + +TGD+ G H+ + G N
Sbjct: 296 LDPLEQHRDELTGKHANTQIPKVVGLERIATLTGDKAADSGARFFWETVTQHRSVAFGGN 355
Query: 384 --IGHFNFKSDPKRLASNL-DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG 440
HFN DP + L E+C TYNML+++ LF E AYADYYER+L N
Sbjct: 356 SVSEHFN---DPHNFHALLVHREGPETCNTYNMLRLTEGLFASAPEAAYADYYERALFNH 412
Query: 441 VLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFE 500
+L PG +Y P+ P Y + P FWCC GTG+E+ K G+ IY
Sbjct: 413 ILASINPDHPG-YVYFTPIRP-----NHYRVYSQPDQGFWCCVGTGMENPGKYGEFIYAR 466
Query: 501 EEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNL 560
+ GV++ +I+S L + + Q+ D ++TL + T +L++
Sbjct: 467 ---AHDGVFVNLFIASELTVAPLGLTLRQQT--AFPDDERSQLTLKLAQP---QTFTLHV 518
Query: 561 RIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
R P W ++ T+NG+ + + S P +++++ + W D++ I+ P+ E + D P
Sbjct: 519 RQPGWVAAGTFTLTVNGEPVAVTSAPSSYVTIHREWRDGDRVEIRFPMHTSIEGLPDGSP 578
Query: 620 EYASIQAILYGPYVLAGHSIGDWDI 644
Y AIL GP VLA H G W++
Sbjct: 579 WY----AILRGPIVLA-HPAGTWEL 598
>gi|399030291|ref|ZP_10730797.1| hypothetical protein PMI10_02670 [Flavobacterium sp. CF136]
gi|398071797|gb|EJL63044.1| hypothetical protein PMI10_02670 [Flavobacterium sp. CF136]
Length = 771
Score = 246 bits (627), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 166/553 (30%), Positives = 275/553 (49%), Gaps = 63/553 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++E L +++L S AQ +L+YLL L+ D+L+ + +A +P + YG WE +
Sbjct: 34 MQEFKLQEIKLTSGPFK-NAQNVDLKYLLDLNPDRLLAPYLISAGIPTKADRYGNWE--N 90
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--F 232
L GH GHYL+A ++M+AST N+ +K ++ ++S L+ CQ++ G+GY+ P + +
Sbjct: 91 IGLDGHIGGHYLAALSMMYASTGNKEIKSRLDYMISELALCQEKDGTGYVGGIPEGKVFW 150
Query: 233 DRLE---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFY 279
DR+ L W P Y IHK+ AGL+D Y Y N +A +++ W +E
Sbjct: 151 DRIHKGDIDGSGFGLNNTWVPIYNIHKLFAGLIDAYNYTGNEKAKEIVIKLGDWFIE--- 207
Query: 280 NRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
+I+ S E+ + L E GG+N+ L+ IT++ K+L A + L L
Sbjct: 208 -----LIRPLSDEQIQKILKTEHGGINESFADLYSITKNKKYLETAEKLSQKAILDPLIK 262
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQL------ESSGTNIG------HF 387
+ D ++G H+NT IP VIG + +++ ++ + Q E G HF
Sbjct: 263 KEDKLTGLHANTQIPKVIGFEKIGKLSDNKQWSDAAQFFWMNVTEKRTVAFGGNSVAEHF 322
Query: 388 NFKSDPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
N +D + L SN E+C +YNM ++S+ LF ++Y D+YER+L N +L Q
Sbjct: 323 NPIND---FSGMLKSNQGPETCNSYNMERLSKALFLDKNNVSYLDFYERTLYNHILSSQE 379
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
G +Y P+ P Y + P S WCC GTG+E+ SK G+ IY E
Sbjct: 380 PNRGG-FVYFTPIRP-----NHYRVYSQPETSMWCCVGTGLENHSKYGELIYSHSE---R 430
Query: 507 GVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
+++ +I S L+WK I + Q ++ + L + S + LN+R P W
Sbjct: 431 DIFVNLFIPSTLNWKEKGIELEQTTK--FPYENNTEIVLKLKNPKSFV---LNIRYPKWA 485
Query: 567 SSNGAKATLNGQ-DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
++ + +NG+ P N++S+ + W S DK+TI + E + P+ ++
Sbjct: 486 TN--FEILVNGKLQKAEAKPTNYVSMARKWKSGDKITIAFKTSTHLEKL----PDGSNWA 539
Query: 626 AILYGPYVLAGHS 638
A + GP VLA +
Sbjct: 540 AFVNGPIVLAAKT 552
>gi|395803808|ref|ZP_10483051.1| hypothetical protein FF52_18073 [Flavobacterium sp. F52]
gi|395434079|gb|EJG00030.1| hypothetical protein FF52_18073 [Flavobacterium sp. F52]
Length = 760
Score = 245 bits (626), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 166/553 (30%), Positives = 274/553 (49%), Gaps = 63/553 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
+K L +V+L D AQ +L+Y+L LD DKL+ + +RLP + YG WE +
Sbjct: 22 MKLFDLSEVKL-KDGPFKNAQDVDLKYILALDPDKLLAPYLLESRLPPKADRYGNWE--N 78
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--F 232
L GH GHYLSA ALM+ ST N+ LK+++ ++S L+ CQ + G+GY+ P + +
Sbjct: 79 IGLDGHIGGHYLSALALMYKSTGNKELKDRLDYMLSELARCQAKNGNGYVGGIPQGKVFW 138
Query: 233 DRLE---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFY 279
DR+ L W P Y IHK+ AGL D Y Y + +A +++ W +E
Sbjct: 139 DRIHKGDIDGSSFGLNNTWVPIYNIHKLFAGLTDAYQYTGSEQAKDIVIKLGDWFIE--- 195
Query: 280 NRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
+I+ S E+ + L E GG+N+ L+ IT+D K+L A L L
Sbjct: 196 -----LIRPLSDEQIQKVLATEHGGINESFADLYIITKDKKYLETAEKLSHKALLNPLLQ 250
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ-----------LESSGTNIG-HF 387
+ D ++G H+NT IP V+G + ++ ++ +G Q + G ++ HF
Sbjct: 251 KEDKLTGLHANTQIPKVVGFEKIAALSDNKEWSDGVQFFWNNVTQKRTVAFGGNSVAEHF 310
Query: 388 NFKSDPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
N +D + + SN E+C +YNM ++++ LF ++ Y D+YER+L N +L Q
Sbjct: 311 NPVND---FSGMVKSNEGPETCNSYNMERLAKALFLDKNDVHYLDFYERTLYNHILSSQH 367
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
E G +Y P+ P Y + P S WCC GTG+E+ +K G+ IY +
Sbjct: 368 -PEKGGFVYFTPIRPN-----HYRVYSQPQTSMWCCVGTGLENHTKYGELIYSHTQS--- 418
Query: 507 GVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
+++ +I S L WK + + Q + PY T +LN+R P W
Sbjct: 419 DLFVNLFIPSVLKWKENGVELEQNTNF-----PYENQTELVLKLKKTKNFALNIRYPKWA 473
Query: 567 SSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
+ + +NG++ + S P ++S++K W + DK+ ++ ++ E + P+ ++
Sbjct: 474 EN--FEIFVNGKEQKIASQPSEYVSISKKWKTGDKIIVRFKTSIHLENL----PDGSNWS 527
Query: 626 AILYGPYVLAGHS 638
A + GP VLA +
Sbjct: 528 AFVKGPIVLAAKT 540
>gi|325679069|ref|ZP_08158663.1| hypothetical protein CUS_6624 [Ruminococcus albus 8]
gi|324109193|gb|EGC03415.1| hypothetical protein CUS_6624 [Ruminococcus albus 8]
Length = 791
Score = 245 bits (626), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 181/565 (32%), Positives = 272/565 (48%), Gaps = 71/565 (12%)
Query: 127 SDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWEEPSCELRGHFVGHY 185
+D A + YLL D D+L+ FR+TA L G Y GWE+ + GH VGHY
Sbjct: 17 TDEYCANAFNKEIAYLLSFDTDRLLAGFRETAGLDMRGAVRYSGWEDDL--IGGHCVGHY 74
Query: 186 LSASALMWAS-----THNESLKEKMSAVVSALSACQKEIGSGYLSAFPT-------EQFD 233
++A A +AS + ++L + L CQ+ +G+G++ QFD
Sbjct: 75 MTAVAQAYASLQEGDSRRDALYKLAVTTTDGLKECQQALGTGFIFGAKIIDKNNVEAQFD 134
Query: 234 RLEA-----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
+E + W PYYT+HKILAG +D Y A + + + ++ Y RV +
Sbjct: 135 NVEKNLSNIMTQAWVPYYTLHKILAGAIDIYRLTGYENAKTVASRLGDWVYRRVS----R 190
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLGLLALQADDISGF 347
+S E L E GGMND LY+L+ +T +H + AH FD+ P F + A + ++
Sbjct: 191 WSEETQRTVLGIEYGGMNDCLYELYAVTGKEEHAIAAHCFDEVPLFENVYAGTENALNNK 250
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGHQLES---------------------SGTNIGH 386
H+NT IP +G+ RY + D G +++ +G N
Sbjct: 251 HANTTIPKFLGALKRYAIL-DGRTVNGETVDAGRYLGYAERFWDMVVQKHSYITGGNSEW 309
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
+F D A ++N E +C TYNMLK+SR LF T E YADYYE + N +L Q
Sbjct: 310 EHFGCDYVLDAERTNANCE-TCNTYNMLKLSRLLFEITGEKKYADYYENTFINAILSSQN 368
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
E G+ Y P+A G K S TP FWCC G+G+E+F+KLGDSIYF E
Sbjct: 369 -PETGMSTYFQPMASGYFKVYS-----TPYTKFWCCTGSGMENFTKLGDSIYFTEGN--- 419
Query: 507 GVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
+ + QYISS +W + V Q D + + D T F G G SL LR+P W
Sbjct: 420 ALIVNQYISSSAEWSEKGVKVEQMTD-IPNSD-----TAKFMIHGKG-GISLKLRLPDWL 472
Query: 567 SSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
+ + A T++G+ G + V+ + + I+LP+ +R ++ D++ Y
Sbjct: 473 AGD-AVITVDGKAYDADINGGYAEVSGI-ADGSVVEIKLPMEVRAHSLPDNKNTY----G 526
Query: 627 ILYGPYVLAGHSIGDWDITESATSL 651
YGP VL+ +G ++T++ T +
Sbjct: 527 FRYGPIVLSAR-LGTAEMTDTMTGI 550
>gi|404254065|ref|ZP_10958033.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas sp. PAMC
26621]
Length = 646
Score = 245 bits (625), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 187/589 (31%), Positives = 277/589 (47%), Gaps = 66/589 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEE-- 172
L+ L DV LG AQ+ YLL LD D+++ FR A L YGGWE
Sbjct: 46 LQPFDLADVDLGEGPF-LHAQRKTEAYLLSLDPDRMLHAFRVNAGLKPKAAVYGGWESDP 104
Query: 173 --PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP-- 228
+GH +GHYLSA AL + ST + ++++ + L+ACQ SG + AFP
Sbjct: 105 IWADINCQGHTLGHYLSACALAYRSTRKPAFRQRIDHIARELAACQDAAKSGLVCAFPKG 164
Query: 229 ---TEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNR 281
R +A+ V P+YT+HK+ AGL D AD+AE+ LR+ W V
Sbjct: 165 PALVAAHLRGDAITGV--PWYTLHKVFAGLRDATLLADSAESRAVLLRLADWAV------ 216
Query: 282 VQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQ 340
V + + ++T+ E E GGMN+V L+ +T +P + +A F L LA
Sbjct: 217 ---VATRPLSDAQFETMLETEHGGMNEVFADLYLMTGNPDYRTMAERFSHKALLTPLAAG 273
Query: 341 ADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGH-------QLESSGTNIGH------F 387
D + G H+NT +P ++G Q +E TG + E L S GH F
Sbjct: 274 RDQLDGLHANTQLPKIVGFQRVFEATGTPHYHEAAAFFWRTVALTRSFATGGHGDNEHFF 333
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
K + S + E+C +NMLK++R LF + YADYYER+L NG+L Q
Sbjct: 334 PMAEFDKHVFS---AKGSETCGQHNMLKLTRALFLQDPQAEYADYYERTLYNGILASQ-D 389
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
+ G++ Y PG K YH TP SFWCC GTG+E+ K DSIYF ++
Sbjct: 390 PDTGMVTYFQGARPGYMK--LYH---TPEHSFWCCTGTGMENHVKYRDSIYFHDD---KA 441
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+Y+ ++ S + W+ + + Q+ + P T + +L LR P W+
Sbjct: 442 LYVNLFVPSAVRWREKGVALRQE-----TRFPDAPTTTLHWTVERPTDVTLQLRHPRWSR 496
Query: 568 SNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
S A +NG + +PG+++ + +TW S D + ++L + E + D P I A
Sbjct: 497 S--AIVLVNGVEAARSDTPGSYVKLARTWHSGDTVELRLAM----EVVPDQAPAAPDIVA 550
Query: 627 ILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGN 675
YGP VLAG +G + A + + YN+ L+T GN
Sbjct: 551 FSYGPMVLAG-VLGREGLAPGADVIVNERK--YGEYNAGLVTVPTLVGN 596
>gi|332882274|ref|ZP_08449902.1| hypothetical protein HMPREF9074_05700 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|332679658|gb|EGJ52627.1| hypothetical protein HMPREF9074_05700 [Capnocytophaga sp. oral
taxon 329 str. F0087]
Length = 786
Score = 245 bits (625), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 172/544 (31%), Positives = 270/544 (49%), Gaps = 62/544 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L+DVRL + S A+ ++ YLL LD D+L+ + K A L + Y WE + L G
Sbjct: 32 LNDVRL-TQSPFKHAEDLDVRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWE--NTGLDG 88
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT-----EQFDR 234
H GHY+SA + M+A+T +E +K+++ ++S L Q G GYL P E +
Sbjct: 89 HIGGHYVSALSYMYAATGDEEIKQRLDYMLSELKRAQDAAGDGYLCGAPNGRKIWEAVSK 148
Query: 235 LE------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
+ L W P Y IHK AGL D Y A + EA +++T WM+ N
Sbjct: 149 GDIQASSFGLNGGWVPLYNIHKTYAGLRDAYLLAGSKEARDMLVKLTDWMM--------N 200
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ K S E+ L E GG+N+V + +T +L LA F L L D +
Sbjct: 201 LTKDLSDEQIQDMLRSEHGGLNEVFADVADLTGKDNYLQLARRFSHREILDPLLEHEDRL 260
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ GD+ + + +E +IG HF+ D
Sbjct: 261 TGKHANTQIPKVIGYKRIADLQGDEGWDDAARFFWETVVERRSISIGGNSVREHFHPSED 320
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L++ + ++ Y DYYER+L N +L + G
Sbjct: 321 ---FSSMLTSEQGPETCNTYNMLRLTKMLYQTSADVHYMDYYERALYNHILSTIDPVQGG 377
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ +K G+ IY E + +Y+
Sbjct: 378 -FVYFTPMRSGH-----YRVYSQPQTSFWCCVGSGMENHAKYGEMIYGHSEDE---LYVN 428
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W G++ V Q ++ PY T S G ++ R+P WT +
Sbjct: 429 LFIPSVLQW--GKVRVEQ-----LTGFPYEEATTLHLSCGKAKEFTVKFRVPEWTDVSQM 481
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ T+NG P+ G +++V++ W+ D++ + LP++LR A+ D Y + +YGP
Sbjct: 482 ELTVNGTAQPVSVSGGYVTVSRKWADGDEVRLTLPMSLRVAALPDGSDNY----SFMYGP 537
Query: 632 YVLA 635
VLA
Sbjct: 538 IVLA 541
>gi|357046482|ref|ZP_09108109.1| hypothetical protein HMPREF9441_02134 [Paraprevotella clara YIT
11840]
gi|355530721|gb|EHH00127.1| hypothetical protein HMPREF9441_02134 [Paraprevotella clara YIT
11840]
Length = 762
Score = 245 bits (625), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 172/544 (31%), Positives = 270/544 (49%), Gaps = 62/544 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L+DVRL + S A+ ++ YLL LD D+L+ + K A L + Y WE + L G
Sbjct: 8 LNDVRL-TQSPFKHAEDLDVRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWE--NTGLDG 64
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT-----EQFDR 234
H GHY+SA + M+A+T +E +K+++ ++S L Q G GYL P E +
Sbjct: 65 HIGGHYVSALSYMYAATGDEEIKQRLDYMLSELKRAQDAAGDGYLCGAPNGRKIWEAVSK 124
Query: 235 LE------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
+ L W P Y IHK AGL D Y A + EA +++T WM+ N
Sbjct: 125 GDIQASSFGLNGGWVPLYNIHKTYAGLRDAYLLAGSKEARDMLVKLTDWMM--------N 176
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ K S E+ L E GG+N+V + +T +L LA F L L D +
Sbjct: 177 LTKDLSDEQIQDMLRSEHGGLNEVFADVADLTGKDNYLQLARRFSHREILDPLLEHEDRL 236
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ GD+ + + +E +IG HF+ D
Sbjct: 237 TGKHANTQIPKVIGYKRIADLQGDEGWDDAARFFWETVVERRSISIGGNSVREHFHPSED 296
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L++ + ++ Y DYYER+L N +L + G
Sbjct: 297 ---FSSMLTSEQGPETCNTYNMLRLTKMLYQTSADVHYMDYYERALYNHILSTIDPVQGG 353
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ +K G+ IY E + +Y+
Sbjct: 354 -FVYFTPMRSGH-----YRVYSQPQTSFWCCVGSGMENHAKYGEMIYGHSEDE---LYVN 404
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W G++ V Q ++ PY T S G ++ R+P WT +
Sbjct: 405 LFIPSVLQW--GKVRVEQ-----LTGFPYEEATTLHLSCGKAKEFTVKFRVPEWTDVSQM 457
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ T+NG P+ G +++V++ W+ D++ + LP++LR A+ D Y + +YGP
Sbjct: 458 ELTVNGTAQPVSVSGGYVTVSRKWADGDEVRLTLPMSLRVAALPDGSDNY----SFMYGP 513
Query: 632 YVLA 635
VLA
Sbjct: 514 IVLA 517
>gi|395493738|ref|ZP_10425317.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas sp. PAMC
26617]
Length = 646
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 179/554 (32%), Positives = 263/554 (47%), Gaps = 71/554 (12%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEE-- 172
L+ L DV LG AQ+ YLL LD D+++ FR A L YGGWE
Sbjct: 46 LQPFDLADVDLGEGPF-LHAQRKTEAYLLSLDPDRMLHAFRVNAGLKPKAAVYGGWESDP 104
Query: 173 --PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP-- 228
+GH +GHYLSA AL + ST + ++++ + L+ACQ SG + AFP
Sbjct: 105 IWADINCQGHTLGHYLSACALAYRSTRKPAFRQRIDHIARELAACQDAARSGLVCAFPKG 164
Query: 229 ---TEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNR 281
R +A+ V P+YT+HK+ AGL D AD+AE+ LR+ W V
Sbjct: 165 PALVAAHLRGDAITGV--PWYTLHKVFAGLRDATLMADSAESRAVLLRLADWAV------ 216
Query: 282 VQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQ 340
V + + ++T+ E E GGMN+V L+ +T +P + +A F L LA
Sbjct: 217 ---VATRPLSDAQFETMLETEHGGMNEVFADLYLMTGNPDYRTMAERFSHKALLTPLAAG 273
Query: 341 ADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGH-------QLESSGTNIGH------F 387
D + G H+NT +P ++G Q +E TG + E L S GH F
Sbjct: 274 RDQLDGLHANTQLPKIVGFQRVFEATGTPHYHEAAAFFWRTVALTRSFATGGHGDNEHFF 333
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
K + S + E+C +NMLK++R LF + YADYYER+L NG+L Q
Sbjct: 334 PMAEFDKHVFS---AKGSETCGQHNMLKLTRALFLQDPQAEYADYYERTLYNGILASQ-D 389
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
+ G++ Y PG K YH TP SFWCC GTG+E+ K DSIYF ++
Sbjct: 390 PDTGMVTYFQGARPGYMK--LYH---TPEHSFWCCTGTGMENHVKYRDSIYFHDDK---A 441
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+Y+ ++ S + W+ + + Q+ + P T + +L LR P W+
Sbjct: 442 LYVNLFVPSAVRWREKGVALRQE-----TRFPDAPTTTLHWTVERPTDVTLQLRHPRWSR 496
Query: 568 S-----NGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYA 622
S NG +A + +PG+++ + +TW S D + ++L + E + D P
Sbjct: 497 SAIVLVNGVEAARSD------TPGSYVKLARTWHSGDTVELRLAM----EVVPDQAPAAP 546
Query: 623 SIQAILYGPYVLAG 636
I A YGP VLAG
Sbjct: 547 DIVAFSYGPMVLAG 560
>gi|392554933|ref|ZP_10302070.1| Acetyl-CoA carboxylase, biotin carboxylase [Pseudoalteromonas
undina NCIMB 2128]
Length = 816
Score = 244 bits (622), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 177/547 (32%), Positives = 267/547 (48%), Gaps = 60/547 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
L++VSL S S AQQTN+ YLL L D+L+ + + A + YG WE+
Sbjct: 51 LEQVSL------SASPFLHAQQTNVRYLLALHPDQLLAPYLREAGIEPKASSYGNWEDSG 104
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQF-- 232
L GH GHYLSA +L WA+T +E LK ++ +++ L Q ++ GYL P Q
Sbjct: 105 --LDGHIGGHYLSALSLAWAATGDEELKRRLDYMLNELQRAQ-QVNDGYLGGIPNGQAMW 161
Query: 233 ---------DRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
L +L W P Y I KI GL D Y A + +A M + E+F N
Sbjct: 162 QQIHDGNIKADLFSLNDRWVPLYNIDKIFHGLRDAYLIAGSEQAKTMLFGLGEWFLN--- 218
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ K S E+ Q L E GG+N V + I D ++L LA F + L + D
Sbjct: 219 -LTSKLSDEQIQQMLYSEYGGLNAVFADMATIGNDKRYLKLARQFTHHSIVDPLLKKQDK 277
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQL------ESSGTNIG------HFNFKS 391
++G H+NT IP +IG E + D+ ++G + IG HF+ K
Sbjct: 278 LTGLHANTQIPKIIGMLKVAETSDDEAWQQGADYFWQTVTKERSVAIGGNSVREHFHDKK 337
Query: 392 DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
D + +++ E+C TYNM+K+S+ LF T + Y +YYER+ N +L Q E G
Sbjct: 338 DFTAMVEDVEG--PETCNTYNMMKLSKLLFLKTADTRYLEYYERATYNHILSSQH-PEHG 394
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
++Y P+ PG Y + + DS WCC G+GIE+ SK G+ IY + + +++
Sbjct: 395 GLVYFTPMRPG-----HYRMYSSVQDSMWCCVGSGIENHSKYGELIYSKNDDN---LWVN 446
Query: 512 QYISSRLDWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSS--KGSGLTTSLNLRIPTWTSS 568
+ISS LDW+ + V Q+ P + VTL F++ K L++R P+W +
Sbjct: 447 LFISSTLDWQQQGLKVTQQSHFPDAN-----NVTLVFNTLDKKDNSPAQLHIRKPSWITG 501
Query: 569 NGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAIL 628
+ + LNG+ + + + ++ W DKLT L L TE + D + Y A+L
Sbjct: 502 D-LQFKLNGKPINATAEQGYYAIKHDWHDGDKLTFTLAPKLYTEQLPDGQDYY----AVL 556
Query: 629 YGPYVLA 635
YGP V+A
Sbjct: 557 YGPVVMA 563
>gi|404450474|ref|ZP_11015456.1| hypothetical protein A33Q_14151 [Indibacter alkaliphilus LW1]
gi|403763872|gb|EJZ24792.1| hypothetical protein A33Q_14151 [Indibacter alkaliphilus LW1]
Length = 782
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 169/562 (30%), Positives = 285/562 (50%), Gaps = 53/562 (9%)
Query: 105 FKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG 164
F+ + G+ ++ L V+L DS RAQ+ + +Y+L +DVD+L+ + K A L
Sbjct: 18 FQQAKAQGDQVQFFDLRQVKL-KDSPFKRAQEVDKKYILEMDVDRLLAPYMKEAGLTWSA 76
Query: 165 EPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYL 224
+ YG WE + L GH GHYLSA +LM+AST + + +++ ++ L Q + G GYL
Sbjct: 77 DNYGNWE--NTGLDGHIGGHYLSALSLMFASTGDPEINKRLDYMLEQLKHAQDQSGDGYL 134
Query: 225 SAFP--TEQFDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTW 273
S P + ++ L++ L W P Y IHKI AGL D Y A M
Sbjct: 135 SGVPYGRKIWNELKSGKINAGNFSLNDRWVPLYNIHKIFAGLRDAYWIGGKEIAKPMLVS 194
Query: 274 MVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCF 333
+ ++F + + ++ ++ + L E GG+N+V + +T D K+L LA
Sbjct: 195 LSDWFLD----LTDGFTEDQFQEMLISEHGGLNEVFADVAVMTGDSKYLSLAKKMSHNAI 250
Query: 334 LGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQ-LHKEG-----HQLESSGTNIG-- 385
L L + D+++G H+NT IP VIG Q +V+ DQ LH+ + + +IG
Sbjct: 251 LQPLKEEKDELNGLHANTQIPKVIGFQRIAQVSKDQNLHQASDFFWKNVVYQRSVSIGGN 310
Query: 386 ----HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGV 441
HF+ SD + S+ E+C TYNM+++S LF+ + Y DYYER++ N +
Sbjct: 311 SVREHFHPTSDFSSMLSS--EQGPETCNTYNMMRLSEMLFQLAPDRKYIDYYERAVFNHI 368
Query: 442 LGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
L Q + G +Y + P + Y + P ++FWCC G+G+E+ +K G +IY
Sbjct: 369 LSTQHPKKGG-FVYFTSMRP-----QHYRVYSQPHENFWCCVGSGLENHAKYGQAIY--- 419
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVT-LTFSSKGSGLTTSLNL 560
+ +Y+ +I+S LDW+ I + Q D PY + +TFS KG + +L +
Sbjct: 420 AYRKDDLYLNLFIASELDWEEKGIKLIQNTDF-----PYKDESEITFSHKGKK-SFNLKI 473
Query: 561 RIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
R P W + T+NG+ + + ++++ + W+S DK+ ++LP+ + E + P
Sbjct: 474 RYPNWVKEGMLEVTINGEQVEVSVDRHGYITLNREWTSKDKINLKLPMETKAERL----P 529
Query: 620 EYASIQAILYGPYVLAGHSIGD 641
+ ++ + +GP VL + D
Sbjct: 530 DGSNWVSFSHGPIVLGAKTGAD 551
>gi|94494954|ref|ZP_01301535.1| hypothetical protein SKA58_00635 [Sphingomonas sp. SKA58]
gi|94425220|gb|EAT10240.1| hypothetical protein SKA58_00635 [Sphingomonas sp. SKA58]
Length = 665
Score = 243 bits (619), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 184/552 (33%), Positives = 261/552 (47%), Gaps = 67/552 (12%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWE-EP 173
LK + DV L D AQ+ YLL L D+++ NFR A L YGGWE EP
Sbjct: 64 LKPFDMADVTL-DDGPFLHAQRMTETYLLRLQPDRMLHNFRINAGLKPKAPVYGGWESEP 122
Query: 174 S---CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE 230
+ GH +GHYLSA AL + ST + K+++ + S L+ACQK SG + AFP
Sbjct: 123 TWAEINCHGHTLGHYLSACALAYRSTRDRRFKQRLDYIASELAACQKAAHSGLICAFPDG 182
Query: 231 QFDRLEALI-------PVWA-PYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYF 278
AL+ P+ P+YT+HKI AGL D AD+ EA LR+ W V
Sbjct: 183 -----PALVAAHINGEPITGVPWYTLHKIYAGLRDAALLADSREAREVLLRLADWGVV-- 235
Query: 279 YNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLA 338
+ S + L E GGMN++ L+ +T ++ LA F + L
Sbjct: 236 ------ATRPLSDAQFEAMLATEHGGMNEIYADLYAMTGKEEYRTLARRFSHKAVMEPLV 289
Query: 339 LQADDISGFHSNTHIPIVIGSQMRYEVTGDQ-------------LHKEGHQLESSGTNIG 385
D + G H+NT +P ++G Q YE TGD H G N
Sbjct: 290 AGKDLLDGMHANTQVPKIVGFQRVYEETGDDRYAKAADFFFRTVAHTRSFATGGHGDN-E 348
Query: 386 HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ 445
HF +D + + + E+C +NMLK++R LF + YADYYER+L NG+L Q
Sbjct: 349 HFFAMADFE--SHVFSAKGSETCCQHNMLKLARLLFMQDPQADYADYYERTLYNGILASQ 406
Query: 446 RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
+ G+ Y PG K YH TP DSFWCC GTG+E+ K DSIYF ++
Sbjct: 407 -DPDSGMATYFQGARPGYMK--LYH---TPEDSFWCCTGTGMENHVKYRDSIYFHDDRS- 459
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
+Y+ ++ S + W + Q + L+ TL + + +L+LR P W
Sbjct: 460 --LYVSLFLPSAVQWADKGARLEQATSFPDTPSTSLKWTLR-----TPVEIALHLRHPRW 512
Query: 566 TSSNGAKATLNGQD-LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
+ + A +NG++ L +PG FL VT+ W D++ + L + E+ P +I
Sbjct: 513 SPT--ATVRVNGREVLRSTAPGRFLEVTRLWRDGDRVELTLDMMPGVESA----PAAPNI 566
Query: 625 QAILYGPYVLAG 636
A YGP VLAG
Sbjct: 567 VAFTYGPLVLAG 578
>gi|333378944|ref|ZP_08470671.1| hypothetical protein HMPREF9456_02266 [Dysgonomonas mossii DSM
22836]
gi|332885756|gb|EGK06002.1| hypothetical protein HMPREF9456_02266 [Dysgonomonas mossii DSM
22836]
Length = 787
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 176/555 (31%), Positives = 274/555 (49%), Gaps = 53/555 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
+K L D+ L DS RAQ + +YLL LD D+L+ F + A L E Y WE +
Sbjct: 26 IKYFDLKDITL-LDSPFKRAQDLDKKYLLDLDADRLLAPFIREAGLQKKAESYTNWE--N 82
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--F 232
L GH GHY+SA ALM+AST ++ +K+++ ++S L CQ E G+GY+ P + +
Sbjct: 83 TGLDGHIGGHYVSALALMYASTGDQQIKDRLDYMISELKRCQDENGNGYIGGVPGGKAIW 142
Query: 233 DRLE---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
D + L W P Y IHK AGL D Y A N A M M ++ V
Sbjct: 143 DEIAKGDIQASGFGLNNRWVPLYNIHKTYAGLRDAYLIAGNETAKDMLIKMTDWAVKLVS 202
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
N+ S E+ L E GG+N+ + ITQ+ K+L LAH F L L D
Sbjct: 203 NL----SEEQIQDMLRSEHGGLNETFADVAVITQNEKYLKLAHQFSHQLILNPLLAHEDK 258
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKS 391
++G H+NT IP V+G + ++ G++ E + +E IG HF+ +
Sbjct: 259 LTGLHANTQIPKVLGFKRIADIEGNESWSEASRFFWETVVEHRSVCIGGNSVREHFHPTN 318
Query: 392 DPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
D +S + SN E+C TYNML++S+ ++ + + Y DYYE++L N +L Q +
Sbjct: 319 D---FSSMITSNEGPETCNTYNMLRLSKMFYQTSLDKKYIDYYEKALYNHILSSQ-NPQT 374
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G ++Y + PG Y + P S WCC G+GIES +K G+ IY +Y+
Sbjct: 375 GGLVYFTQMRPG-----HYRVYSQPQTSMWCCVGSGIESHAKYGEMIYAHTSD---ALYV 426
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+I S L+WK + + Q D + +T+ K ++ +R P+W
Sbjct: 427 NLFIPSLLNWKDRNVEIVQ--DNKFPDESKTEITVNPKKKSE---FTVYVRYPSWVEKGT 481
Query: 571 AKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
K LNG+ P ++ + +TW D+++++LP+T+ E + P+ ++ + YG
Sbjct: 482 MKIKLNGKTYPGVEKDGYIGIKRTWQKGDRISVELPMTIVAEQL----PDKSNYYSFRYG 537
Query: 631 PYVLAGHSIGDWDIT 645
P VLA + G D+T
Sbjct: 538 PIVLAAKT-GVEDMT 551
>gi|308067040|ref|YP_003868645.1| hypothetical protein PPE_00225 [Paenibacillus polymyxa E681]
gi|305856319|gb|ADM68107.1| DUF1680 domain containing protein [Paenibacillus polymyxa E681]
Length = 752
Score = 242 bits (617), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 182/546 (33%), Positives = 279/546 (51%), Gaps = 55/546 (10%)
Query: 116 KEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC 175
K LH VR+ S + A + N YLL L+ D+L+ FR+ A L Y GWE
Sbjct: 4 KAFDLHKVRIDSGPL-LHAMELNTAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWEARG- 61
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFD 233
+ GH +GHYLS ALM+AST +E L E+++ VV L CQ G+GY+S P E F+
Sbjct: 62 -ISGHTLGHYLSGCALMFASTGDERLLERVNYVVDELEICQNSHGNGYISGIPRGKEIFE 120
Query: 234 RLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
++A L W P YT+HK+ AGL D + A + +AL + + N +++
Sbjct: 121 EVKAGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLPAHHPKALSIEIKLG----NWLED 176
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
V++ ++ Q L+ E GGMN+VL L + + + L LA F L LA D +
Sbjct: 177 VLQGLDDDQVQQVLHCEFGGMNEVLTDLAEHSGEERFLSLAERFYHGEVLNDLADSQDTL 236
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQ-------------LHKEGHQLESSGTNIGHFNFKS 391
+G H+NT IP +IG+ ++E+TG +HK + + + N HF
Sbjct: 237 AGRHANTQIPKIIGAARQFEMTGKPQYADLSRFFWDRVVHKHSYVIGGNSYN-EHF---G 292
Query: 392 DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+P +L L T E+C TYNMLK++RH+F W AYADYYER++ N +L Q+ + G
Sbjct: 293 EPGKLNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-G 351
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+ Y + L G K + + + F CC G+G+ES S G +IYF +Y+
Sbjct: 352 RVCYFVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTPET---IYVN 403
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
QY+ S + W ++ V K D + + R TL SK + ++ LR P W + G
Sbjct: 404 QYVPSTVTWD--EMGVQLKQDTLFPQNG--RGTLRVISK-EPKSFAIKLRCPHW-AEQGM 457
Query: 572 KATLNGQD-LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NG+ + P +++ + + WS+ D + +P+T+R E + P+ A +YG
Sbjct: 458 MIKINGEKYVTEACPTSYVVMEREWSNGDTIEYDIPMTVRVEEM----PDNPRRVAFMYG 513
Query: 631 PYVLAG 636
P VLAG
Sbjct: 514 PLVLAG 519
>gi|224540696|ref|ZP_03681235.1| hypothetical protein BACCELL_05610 [Bacteroides cellulosilyticus
DSM 14838]
gi|224517692|gb|EEF86797.1| hypothetical protein BACCELL_05610 [Bacteroides cellulosilyticus
DSM 14838]
Length = 782
Score = 241 bits (616), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 167/549 (30%), Positives = 272/549 (49%), Gaps = 60/549 (10%)
Query: 116 KEVS---LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEE 172
+EVS L DV+L +S +AQQT+L Y++ ++ D+L+ F + A L Y WE
Sbjct: 24 QEVSYFPLQDVKL-LESPFLQAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWE- 81
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TE 230
+ L GH GHY+SA ++M+A+T + ++ +++ +++ L Q+ +G+G++ P +
Sbjct: 82 -NTGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQ 140
Query: 231 QFDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEY 277
+ ++A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLVALTDWMID- 199
Query: 278 FYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLL 337
+ + ++ L E GG+N+ + IT D K+L LA F L L
Sbjct: 200 -------ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPL 252
Query: 338 ALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNIGHF 387
D ++G H+NT IP VIG + ++ DQ H+ G N
Sbjct: 253 VKDEDRLTGMHANTQIPKVIGYKRIADLAQDQNWDHAARFFWNTVVNHRSVCIGGNSVRE 312
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
+F + D E+C TYNML++++ L++ + +I +ADYYER+L N +L Q+
Sbjct: 313 HFHPADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNHILASQQP 372
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
T+ G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY +
Sbjct: 373 TKGG-FVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYAHAKDT--- 423
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+Y+ +I SRL WK +I + Q+ RV K SL LR P+W
Sbjct: 424 LYVNLFIPSRLTWKDKKITLVQETRFPDEEQIRFRV-----EKSKKKAFSLKLRYPSW-- 476
Query: 568 SNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
+ GA ++NG+ PG +L++ + W + D++T+ +P+ + E I P+ + A
Sbjct: 477 AKGASVSVNGKVQETNAQPGEYLTIHRKWKAGDEITLNMPMQVALEQI----PDRENFYA 532
Query: 627 ILYGPYVLA 635
+YGP VLA
Sbjct: 533 FMYGPIVLA 541
>gi|431795908|ref|YP_007222812.1| hypothetical protein Echvi_0518 [Echinicola vietnamensis DSM 17526]
gi|430786673|gb|AGA76802.1| hypothetical protein Echvi_0518 [Echinicola vietnamensis DSM 17526]
Length = 784
Score = 241 bits (615), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 171/547 (31%), Positives = 268/547 (48%), Gaps = 53/547 (9%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L VRL S S AQQ ++ Y+ ++VD+L+ + A + + Y WE + L G
Sbjct: 33 LDQVRL-SPSPFLNAQQVDMTYMKAMEVDRLLAPYMLEAGVDWAADRYPNWE--NTGLDG 89
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT-----EQFDR 234
H GHYLSA A+M+AST + +K +M +V L+ Q + G+GY+ P E+ +
Sbjct: 90 HIGGHYLSALAMMYASTGDAEMKRRMDYMVEQLAMAQAKNGNGYVGGIPGGMAMWEEIGQ 149
Query: 235 LE------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
E +L W P Y IHKI AGL D Y NA+A + + ++FY + K
Sbjct: 150 GEIDAGGFSLNQKWVPLYNIHKIYAGLRDAYLIGGNAQAKEVLLDLTDWFY----ELTKG 205
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
+ E+ Q L E GG+N+V + IT + K+L LA L L Q D ++G H
Sbjct: 206 LTDEQFQQMLVSEHGGLNEVFADVAAITGEAKYLELAKKMSHEWLLEPLEEQEDKLTGMH 265
Query: 349 SNTHIPIVIGSQMRYEVTGDQLH-KEGHQ------LESSGTNIG------HFNFKSDPKR 395
+NT IP VIG Q R GD +E +E+ IG HF+ + D
Sbjct: 266 ANTQIPKVIGFQ-RVAQEGDLAEWQEAADFFWHTVVENRTVAIGGNSVREHFHPEDDFSP 324
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
+ S+ + E+C TYNML++S LF + Y D++ER L N +L Q E G +Y
Sbjct: 325 MVSS--NQGPETCNTYNMLRLSEQLFMSNPQAEYVDFFERGLYNHILSSQH-PEKGGFVY 381
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P+ P Y + P FWCC G+G+E+ +K G+ IY E + +YI +I
Sbjct: 382 FTPMRP-----EHYRVYSQPQQGFWCCVGSGLENHAKYGEFIYAHSEEE---LYINLFIP 433
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S L+W+ +V+ Q + +P + TF + LR P+W + + ++
Sbjct: 434 SELNWEEKGMVLTQTNN--FPEEP--QSVFTFEMD-KARKMPVKLRYPSWVAEGALQVSV 488
Query: 576 NGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
NG+ + SP +++++ + W D+L ++LP+ ++ E + P+ + A +YGP VL
Sbjct: 489 NGRPFEVNASPSSYITINRKWKDGDRLEVKLPMEMQWEQL----PDGSDWGAFVYGPIVL 544
Query: 635 AGHSIGD 641
A D
Sbjct: 545 AAMEGSD 551
>gi|423224675|ref|ZP_17211143.1| hypothetical protein HMPREF1062_03329 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392635115|gb|EIY29021.1| hypothetical protein HMPREF1062_03329 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 782
Score = 241 bits (614), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 167/549 (30%), Positives = 272/549 (49%), Gaps = 60/549 (10%)
Query: 116 KEVS---LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEE 172
+EVS L DV+L +S +AQQT+L Y++ ++ D+L+ F + A L Y WE
Sbjct: 24 QEVSYFPLQDVKL-LESPFLQAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWE- 81
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TE 230
+ L GH GHY+SA ++M+A+T + ++ +++ +++ L Q+ +G+G++ P +
Sbjct: 82 -NTGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQ 140
Query: 231 QFDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEY 277
+ ++A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLVALTDWMID- 199
Query: 278 FYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLL 337
+ + ++ L E GG+N+ + IT D K+L LA F L L
Sbjct: 200 -------ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPL 252
Query: 338 ALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNIGHF 387
D ++G H+NT IP VIG + ++ DQ H+ G N
Sbjct: 253 VKDEDCLTGMHANTQIPKVIGYKRIADLAQDQNWDHAARFFWNTVVNHRSVCIGGNSVRE 312
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
+F + D E+C TYNML++++ L++ + +I +ADYYER+L N +L Q+
Sbjct: 313 HFHPADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNHILASQQP 372
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
T+ G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY +
Sbjct: 373 TKGG-FVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYAHAKDT--- 423
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+Y+ +I SRL WK +I + Q+ RV K SL LR P+W
Sbjct: 424 LYVNLFIPSRLTWKEKKITLVQETRFPDEEQIRFRV-----EKSKKKAFSLKLRYPSW-- 476
Query: 568 SNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
+ GA ++NG+ PG +L++ + W + D++T+ +P+ + E I P+ + A
Sbjct: 477 AKGASVSVNGKVQETNAQPGEYLTIHRKWKAGDEITLNMPMQVALEQI----PDRENFYA 532
Query: 627 ILYGPYVLA 635
+YGP VLA
Sbjct: 533 FMYGPIVLA 541
>gi|336319285|ref|YP_004599253.1| hypothetical protein Celgi_0157 [[Cellvibrio] gilvus ATCC 13127]
gi|336102866|gb|AEI10685.1| protein of unknown function DUF1680 [[Cellvibrio] gilvus ATCC
13127]
Length = 1577
Score = 240 bits (613), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 191/610 (31%), Positives = 284/610 (46%), Gaps = 82/610 (13%)
Query: 84 EEQDELFSWAMLYRKIKNPGQFK-----VPERSGEF-LKEVSLHDVRLGSDSMHWRAQQT 137
+E+D + R + P + VP E L++ L D+ L +D+ A
Sbjct: 331 DEEDATVTLTATVRYLGGPAVTRTFTVTVPADLTEHALQDSGLEDLYL-TDAYLTNAAAK 389
Query: 138 NLEYLLMLDVDKLVWN-FRKTARLPAPGEPYGGWEEPSC-ELRGHFVGHYLSASALMWAS 195
EYLL L +K ++ +R P YGGWE RGH GHY+SA + +++
Sbjct: 390 EHEYLLSLSSEKFLYEWYRNVGLTPTTTSGYGGWERSDVTNFRGHAFGHYMSALSQSYSA 449
Query: 196 THNES----LKEKMSAVVSALSACQKEIGS------GYLSAFPTEQFDRLEAL----IPV 241
T + + L E++ V+ L+ Q + GY+SAFP D ++ V
Sbjct: 450 TADATTKAALLEQVEDAVAGLTLVQDTYAAAHPASAGYVSAFPESALDAVDGTGTTTDKV 509
Query: 242 WAPYYTIHKILAGLLDQYTY---ADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTL 298
P+Y +HK+LAGLLD + Y A A+AL + + EY Y R+ + + + L
Sbjct: 510 LVPWYNLHKVLAGLLDIHDYVGGATGAQALDIASQFGEYTYQRISRLTDRTRM------L 563
Query: 299 NEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIG 358
E GGMND LY+L+ +T DP A FD+ LA D ++G H+NT IP +IG
Sbjct: 564 RTEYGGMNDALYRLYDLTDDPHVKTAAEAFDETALFTQLAAGQDVLNGKHANTTIPKLIG 623
Query: 359 SQMRYEV---TGDQLHK-----------------------EGHQLESSGTNIGHFNFKSD 392
+ RY V D+L H ++G+N +F D
Sbjct: 624 ALKRYTVFTSDADRLASLTEAERAQLPTYLAAAEEFWQITVDHHTYATGSNSQSEHFH-D 682
Query: 393 PKRL-------ASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ 445
P L ++ T E+C YNMLK+SR LF+ TK++ YA YYE + N VL Q
Sbjct: 683 PDSLHEFATQQGETGNAQTSETCNEYNMLKLSRELFKLTKDVKYAHYYENTFINTVLASQ 742
Query: 446 RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
+ G+ Y P+A G +R Y P FWCC GTG+ESFSKLGDS+YF +
Sbjct: 743 N-PDTGMTTYFQPMAAG--YDRIYSM---PYTEFWCCTGTGMESFSKLGDSMYFTDRRS- 795
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
VY+ + SSR D+ + + Q+ D RV + + TT L LR+P W
Sbjct: 796 --VYVTMFFSSRFDYAEQNLRLTQEADLPSDDTVTFRVAAIDGDQVADGTT-LRLRVPQW 852
Query: 566 TSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
A T+NG+ + P V + ++ D +T ++P+ ++ A D+ P +A
Sbjct: 853 I-DGAATLTVNGEAV-TPQVVRGFVVLEGVAAGDVITYRMPMKVQAHAAPDN-PTWA--- 906
Query: 626 AILYGPYVLA 635
A YGP VL+
Sbjct: 907 AFSYGPVVLS 916
>gi|16126789|ref|NP_421353.1| hypothetical protein CC_2550 [Caulobacter crescentus CB15]
gi|221235569|ref|YP_002518006.1| membrane-bound glycosyl hydrolase [Caulobacter crescentus NA1000]
gi|13424115|gb|AAK24521.1| conserved hypothetical protein [Caulobacter crescentus CB15]
gi|220964742|gb|ACL96098.1| membrane-bound glycosyl hydrolase [Caulobacter crescentus NA1000]
Length = 786
Score = 240 bits (613), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 169/537 (31%), Positives = 268/537 (49%), Gaps = 57/537 (10%)
Query: 129 SMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSA 188
S+ +AQ N YL+ L D+L+ NF A LP YGGWE S + GH +GHYLSA
Sbjct: 59 SIFAQAQGANRAYLVSLQPDRLLHNFHLGAGLPVKAPVYGGWEAQS--IAGHTLGHYLSA 116
Query: 189 SALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLS-------AFPT------EQFDRL 235
AL A+ + L ++++ V+ L+ Q G GY+ A P E+ R
Sbjct: 117 CALQVANDGDPVLSQRLAYTVAQLARVQAAHGDGYVGGTTRWGQADPVGGKAVFEELRRG 176
Query: 236 E------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKY 289
+ +L W P YT HKI AGLLD + A AL + + Y +++
Sbjct: 177 DIRANRFSLNDGWVPIYTWHKIHAGLLDAHRLAATPGALDVALGLAGYL----ATILEGL 232
Query: 290 SIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHS 349
+ ++ L E GG+ + + + +T DP+ L +A + LA D+++G H+
Sbjct: 233 NDDQVQAILVAEHGGLCEAYAETYALTGDPRWLNIARRLRHRELVDPLAQGRDELAGLHA 292
Query: 350 NTHIPIVIGSQMRYEVTGDQLHKEG----HQLESS------GTNIGHFNFKSDPKRLASN 399
NT IP +IG YEV GD HQ + G N +F P +A+
Sbjct: 293 NTQIPKIIGLARLYEVAGDPAEARTARFFHQTVTRRHSYAIGGNSDREHF-GPPDAIATR 351
Query: 400 LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL 459
L T E+C +YNMLK++R L+ W + A D YER+ N ++ QR ++ G+ +Y +P+
Sbjct: 352 LSETTCEACNSYNMLKLTRRLWSWAPDGALFDDYERAQLNHIMAHQRPSD-GMFVYFMPM 410
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
A G RSY TP DSFWCC G+G+ES +K DSI++ +Y+ +I+SRLD
Sbjct: 411 AAGG--RRSYS---TPEDSFWCCVGSGMESHAKHADSIWWRGGQT---LYLNLFIASRLD 462
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
++ +D + +T+T + +G + LR+P W ++ + ++NG
Sbjct: 463 LPGDDFAID--LDTAFPQSGQVDLTVTRAPRG---LREIALRLPAWCAA--PRLSVNGAP 515
Query: 580 LPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
P+ + G+ + +++ W + D++T+ LP+ +R E DD ++ A L GP VLA
Sbjct: 516 TPIQTRGDGYARLSRRWKAGDRVTLMLPMAVRAEPTPDD----PNLVAFLSGPLVLA 568
>gi|375306379|ref|ZP_09771677.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus sp.
Aloe-11]
gi|375081632|gb|EHS59842.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus sp.
Aloe-11]
Length = 753
Score = 240 bits (613), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 178/546 (32%), Positives = 272/546 (49%), Gaps = 63/546 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
LH V + S + + A + N YLL L+ D+L+ FR+ A L Y GWE + G
Sbjct: 10 LHKVSIDSGPL-YHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWEARG--ISG 66
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA 237
H +GHYLS +LM+A+T +E L E++S V+ L CQ G+GY+S P E F+ ++A
Sbjct: 67 HTLGHYLSGCSLMYAATGDERLLERVSYVIDELEICQNNHGNGYISGIPRGKEIFEEVKA 126
Query: 238 ---------LIPVWAPYYTIHKILAGLLDQYTYADNAEAL----RMTTWMVEYFYNRVQN 284
L W P YT+HK+ AGL D + A + +AL ++ W+ ++
Sbjct: 127 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLAHHPKALPIEIKLGAWL--------ED 178
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
V + E+ + L+ E GGMN+VL L + + + L LA F L LA D +
Sbjct: 179 VFRGLDDEQMQRVLHCEFGGMNEVLTDLAEHSGEERFLKLAERFYHGEVLNDLADSRDTL 238
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQ-------------LHKEGHQLESSGTNIGHFNFKS 391
+G H+NT IP +IG+ +YEVTG +HK + + + N HF
Sbjct: 239 AGRHANTQIPKIIGAARQYEVTGKPHYADLSRFFWDRVVHKHSYVIGGNSYN-EHF---G 294
Query: 392 DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+P +L L T E+C TYNMLK++RH+F W AYADYYER++ N +L Q+ + G
Sbjct: 295 EPGKLNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-G 353
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+ Y + L G K + + + F CC G+G+ES S G +IYF +Y+
Sbjct: 354 RVCYFVSLEMGGHKT-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTPQT---IYVN 405
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
QY+ S + W + + Q+ + LRV S K T + LR P W + G
Sbjct: 406 QYVPSTVTWDDMDVQLKQETLFPQTGRGTLRV---ISKKPQSFT--IKLRCPHW-AEQGM 459
Query: 572 KATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NG+ + P +++ + + W D + +P+T+R E + P+ A +YG
Sbjct: 460 IIKINGEAFTAEACPTSYVVIEREWKDGDTVEYDIPMTVRIEEM----PDNPRRIAFMYG 515
Query: 631 PYVLAG 636
P VLAG
Sbjct: 516 PLVLAG 521
>gi|390456178|ref|ZP_10241706.1| hypothetical protein PpeoK3_19346 [Paenibacillus peoriae KCTC 3763]
Length = 753
Score = 240 bits (612), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 177/542 (32%), Positives = 272/542 (50%), Gaps = 55/542 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
LH V + S + A + N YLL L+ D+L+ FR+ A L Y GWE + G
Sbjct: 10 LHKVSIDSGPL-CHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWEARG--ISG 66
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA 237
H +GHYLS +LM+AST +E L E+++ V+ L CQ G+GY+S P E F+ ++A
Sbjct: 67 HTLGHYLSGCSLMYASTGDERLLERVNYVIDELEICQNSHGNGYISGIPRGKEIFEEVKA 126
Query: 238 ---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
L W P YT+HK+ AGL D Y + +AL M + ++ +++V +
Sbjct: 127 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLVHHPKALPMEIKLGDW----LEDVFRG 182
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
E+ + L+ E GGMN+VL L + + + L LA F L LA D ++G H
Sbjct: 183 LDDEQMQRVLHCEFGGMNEVLTDLAEHSGEERFLKLAERFYHGEVLNDLADSRDTLAGRH 242
Query: 349 SNTHIPIVIGSQMRYEVTGDQ-------------LHKEGHQLESSGTNIGHFNFKSDPKR 395
+NT IP +IG+ +YEVTG +HK + + + N HF +P +
Sbjct: 243 ANTQIPKIIGAARQYEVTGKPHYADLSRFFWDRVVHKHSYVIGGNSYN-EHF---GEPGK 298
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
L L T E+C TYNMLK++RH+F W AYADYYER++ N +L Q+ + G + Y
Sbjct: 299 LNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-GRVCY 357
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
+ L G K + + + F CC G+G+ES S G +IYF +Y+ QY+
Sbjct: 358 FVSLEMGGHKS-----FNSQYEDFTCCVGSGMESHSMYGTAIYFHTPQT---IYVNQYVP 409
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S + W + + Q+ + R TL SK + ++ LR P W + G +
Sbjct: 410 STVTWDEMDVQLKQE----TLFPQTGRGTLCVISKKP-QSFTIKLRCPYW-AEQGMIIKI 463
Query: 576 NGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
NG+ + P +++ + + W D + +P+T+R E + P+ A +YGP VL
Sbjct: 464 NGEAFAAEACPTSYVVIEREWKDGDTVEYDIPMTVRIEEM----PDNPRRIAFMYGPLVL 519
Query: 635 AG 636
AG
Sbjct: 520 AG 521
>gi|226325822|ref|ZP_03801340.1| hypothetical protein COPCOM_03635 [Coprococcus comes ATCC 27758]
gi|225205946|gb|EEG88300.1| hypothetical protein COPCOM_03635 [Coprococcus comes ATCC 27758]
Length = 761
Score = 239 bits (611), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 166/543 (30%), Positives = 271/543 (49%), Gaps = 51/543 (9%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEP-YGGWEEPSCELR 178
L VRL +++++ Q+ EYLL +D D++++NFRK L G P GW+E SC+L+
Sbjct: 198 LGQVRLKEGTLYYKYQKLMEEYLLGIDDDQMLYNFRKATGLDTKGAPPMTGWDEESCKLK 257
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS------GYLSAFPTEQF 232
GH GHYLS AL +A+T N +K++ +V+ L CQ + G+LSA+ EQF
Sbjct: 258 GHTTGHYLSGIALAFAATGNLKFLDKVNYMVAELKKCQDAFAATGKYHRGFLSAYSEEQF 317
Query: 233 DRLEALIP---VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKY 289
D LE +WAPYYT+ KI++GL D + A N A + M ++ Y+R+ + K+
Sbjct: 318 DLLEVYTKYPEIWAPYYTLDKIMSGLYDCHVLAGNETAKEILDLMGDWVYDRLSRLPKE- 376
Query: 290 SIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
++++ W + E GGM + K++ +T HL A LF+ + + D + H
Sbjct: 377 TLDKMWAMYIAGEFGGMLGTMVKVYELTGKENHLKAAKLFENEKLFYPMEEECDTLEDMH 436
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHKE-----------GHQLESSGTNIGHFNFKSDPKRLA 397
+N HIP +IG+ Y TGD+++ E GH G +G
Sbjct: 437 ANQHIPQIIGAMDLYRATGDEIYWEIGKNFWNIVTGGHTYCIGG--VGETEMFHRANTTC 494
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
S L ESC +YNML+++ LF +T+ DYY+ +L N +L G Y L
Sbjct: 495 SYLTDKAAESCASYNMLRLTSQLFEYTRSGNLMDYYDNTLRNHILTSSSHKCDGGTTYFL 554
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL PG KE + +S CC+GTG+ES + ++IY ++E +YI + S
Sbjct: 555 PLGPGGRKE-----FFLSENS--CCHGTGMESRFRYMENIYAQDE---DALYINLLVDSV 604
Query: 518 LDWKSGQIVVN-QKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
L ++G+ ++ Q VD + + + K L + IP W + ++N
Sbjct: 605 LTDENGKTMIELQSVDE----EGVMEIRCQKDQK-----KVLKIHIPAWGQKD-FNVSVN 654
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G+ L + + +L + + D + ++LP+ R + D++ + A + + YGPY+LA
Sbjct: 655 GKVLANTALHDGYLVIDADPKAGDVIRLELPMEFR---VLDNKSDAAFVN-LAYGPYILA 710
Query: 636 GHS 638
S
Sbjct: 711 ALS 713
>gi|336425130|ref|ZP_08605160.1| hypothetical protein HMPREF0994_01166 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013039|gb|EGN42928.1| hypothetical protein HMPREF0994_01166 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 628
Score = 239 bits (611), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 164/548 (29%), Positives = 260/548 (47%), Gaps = 70/548 (12%)
Query: 141 YLLMLDVDKLVWNFRKTARLPAPGEP----YGGWEEPSCELRGHFVGHYLSASALMWAST 196
Y++ L+ L+ NF + E +GGWE P+C+LRGHF+GH+LSA+A+ + +T
Sbjct: 32 YMMHLENRFLLLNFNLESGRDTSAEAIEGMHGGWEFPTCQLRGHFLGHWLSAAAMHYHAT 91
Query: 197 HNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLL 256
+ LK K +V L+ CQKE G + + P + R+ VWAP+YTIHK+ GLL
Sbjct: 92 GDRELKAKADTLVEELAECQKENGGKWAAPIPEKYLYRIAEGKQVWAPHYTIHKVFMGLL 151
Query: 257 DQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCIT 316
D Y YA NA AL + ++FY+ K +S + L+ E GGM ++ +L+ IT
Sbjct: 152 DMYEYAGNAIALEIAENFADWFYDWT----KDFSRDEMDDILDFETGGMLEIWVQLYAIT 207
Query: 317 QDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK---- 372
K+ L + + L D ++ H+NT IP +IG Y+VTGD+ +
Sbjct: 208 GKDKYAALMERYYRGRLFDPLLKGEDVLTNMHANTTIPEIIGCARAYDVTGDEKWRKIAE 267
Query: 373 --------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWT 424
+ Q + G G S K+L + L +E CT YNM++++ LFRW+
Sbjct: 268 NYWDLAVTQRGQYATGGQTCG--EIWSPKKKLGARLGLKGQEHCTVYNMIRLAGFLFRWS 325
Query: 425 KEIAYADYYERSLTNGVLG-------IQRG-TEP----GVMIYLLPLAPGSSKERSYHHW 472
+ AY DY E+ L NG++ + G T P G++ Y LP+ G K W
Sbjct: 326 LDPAYLDYQEKLLYNGLMAQAYWQSNLSHGFTSPYPSKGLLTYFLPMQAGGRK-----GW 380
Query: 473 GTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW--KSGQIVVNQK 530
+ + F+CC+GT +++ + IY++ E +YI QY+ S++ + ++ + QK
Sbjct: 381 SSKTGDFFCCHGTLVQANAAFNRGIYYQSEDS---LYICQYLDSQVSFSVNDSRVTILQK 437
Query: 531 VDPVV----------SWDPYLRVTLTFSSKGSGLT------------TSLNLRIPTWTSS 568
DP+ + L T + S+ L +L LRIP W +
Sbjct: 438 ADPLTGSSHLASTSSARQSVLEDTRKYPSQPDCLVPCLKMELEKETEMTLQLRIPGWLAG 497
Query: 569 NGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAIL 628
+ + F+ + + W D + I LP ++T + PE + A L
Sbjct: 498 EAVILINDTEVYRSNDSCLFVPLKRVWKDGDIIRILLPKAVKTFPL----PEDENTVAFL 553
Query: 629 YGPYVLAG 636
YGP VLAG
Sbjct: 554 YGPVVLAG 561
>gi|322433089|ref|YP_004210338.1| hypothetical protein AciX9_4244 [Granulicella tundricola MP5ACTX9]
gi|321165316|gb|ADW71020.1| protein of unknown function DUF1680 [Granulicella tundricola
MP5ACTX9]
Length = 800
Score = 239 bits (611), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 172/567 (30%), Positives = 270/567 (47%), Gaps = 60/567 (10%)
Query: 102 PGQFKVPERSGEFLKEVSL--HDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTAR 159
P F P LK V L + VRL + +AQ + +YLL L ++++ R+ A
Sbjct: 19 PSAFCAPAPHKVQLKAVPLPLNSVRLTGGPLK-KAQDLDAQYLLELQPERMLAFLRQRAG 77
Query: 160 LPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
L A + YGGW+ P +L GH GHYLSA ++M+A+T + KE+ V+ L Q
Sbjct: 78 LEAKAQGYGGWDGPGRQLTGHIAGHYLSAISMMYATTGDVRFKERADEFVAELQTIQNAQ 137
Query: 220 GSGYLSAF-------PTEQFDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYAD 263
G GY+ A +F L L +W+P+Y HK+ AGL D Y
Sbjct: 138 GDGYIGALLDAKGVDGKVKFQDLSKGEIKSGGFDLDGLWSPWYVEHKLFAGLRDAYHLTG 197
Query: 264 NAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLM 323
+ AL + F V+ ++K + ++ + L E GGMN+VL L+ T D + +
Sbjct: 198 DRTALEVEI----EFAGWVEGILKNLNEDQIQRMLATEFGGMNEVLADLYADTNDTRWMK 253
Query: 324 LAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG--------- 374
L+ F+ + L+ D ++G H+NT+IP +IG RYE TGD+ K+G
Sbjct: 254 LSDKFEHHAIVDPLSQGQDILAGKHANTNIPKMIGELARYEYTGDE--KDGKAANFFFDE 311
Query: 375 ----HQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYA 430
H + G G + P ++ +D T ESC YNM+K++R LF + YA
Sbjct: 312 VSLHHSFATGGD--GKNEYFGQPDKMNDMIDGRTAESCAAYNMIKMARTLFSLDPQARYA 369
Query: 431 DYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESF 490
D+ ER+ N +LG Q + G + Y++P+ G H + +SF CC G+ +E+
Sbjct: 370 DFVERADLNAILGGQD-PDDGRVSYMVPVGRGVQ-----HEYQNKFESFTCCVGSQMETH 423
Query: 491 SKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSK 550
+ IY E K +++ QY + +DW S + + D + L++T
Sbjct: 424 AFHAYGIYNESGNK---LWVSQYDPTTVDWASQGVKLEMVTDLPMGDTATLKMT-----S 475
Query: 551 GSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTL 609
G +L LR P W +S G +NG L + P ++ + + W D + + LP TL
Sbjct: 476 GQSKVFTLALRRPYWATS-GFAVKVNGVLLKNVSGPDTYIEINRRWKVGDAVEVVLPKTL 534
Query: 610 RTEAIQDDRPEYASIQAILYGPYVLAG 636
R E + P+ + AI++GP VLAG
Sbjct: 535 RKEPL----PDNPNRMAIMWGPLVLAG 557
>gi|330996333|ref|ZP_08320217.1| hypothetical protein HMPREF9442_01300 [Paraprevotella xylaniphila
YIT 11841]
gi|329573383|gb|EGG54994.1| hypothetical protein HMPREF9442_01300 [Paraprevotella xylaniphila
YIT 11841]
Length = 811
Score = 239 bits (610), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 169/544 (31%), Positives = 265/544 (48%), Gaps = 62/544 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L+DVRL A+ ++ YLL LD D+L+ + K A L + Y WE + L G
Sbjct: 57 LNDVRLTQGPFK-HAEDLDIRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWE--NTGLDG 113
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--FDRLE- 236
H GHY+SA A M+A+T NE +K+++ ++S Q G GYL P + +D +
Sbjct: 114 HIGGHYVSALAYMYAATGNEEIKQRLDYMLSEWKRAQDAAGDGYLCGAPNGRKIWDAVSK 173
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQN 284
L W P Y IHK AGL D Y A A+A +++T WM+ N
Sbjct: 174 GDIQASSFGLNGGWVPLYNIHKTYAGLRDAYVVAGCAQAKDMLVKLTDWMM--------N 225
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ K S E+ L E GG+N+V + +T ++ LA F L L Q D +
Sbjct: 226 LTKDLSDEQIQDMLRSEHGGLNEVFADVADLTGKDGYMQLARRFSHREILDPLLKQEDQL 285
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSD 392
+G H+NT IP VIG + ++ GD+ + + ++ +IG HF+ D
Sbjct: 286 TGKHANTQIPKVIGYKRIADLEGDESWDDAARFFWKTVVDQRSISIGGNSVREHFHPSED 345
Query: 393 PKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+S L S E+C TYNML++++ L++ + + Y DYYER+L N +L + G
Sbjct: 346 ---FSSMLTSEQGPETCNTYNMLRLTKMLYQTSADAHYMDYYERALYNHILSTIDPVQGG 402
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+Y P+ G Y + P SFWCC G+G+E+ +K G+ IY +Y+
Sbjct: 403 -FVYFTPMRSGH-----YRVYSQPQTSFWCCVGSGMENHAKYGEMIYAHGGDD---LYVN 453
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L W G++ V Q+ PY T S T ++ R+P WT ++
Sbjct: 454 LFIPSVLQW--GKVRVEQRTSF-----PYEEATTLRLSCSKAKTFTVKFRVPEWTDASRM 506
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ T+NG P+ G +++V++ W+ D++ + LP++LR + D Y + +YGP
Sbjct: 507 ELTVNGTAQPVSVSGGYVAVSRKWTDGDEVRLTLPMSLRAVVLPDGSDNY----SFMYGP 562
Query: 632 YVLA 635
VLA
Sbjct: 563 VVLA 566
>gi|239627978|ref|ZP_04671009.1| secreted protein [Clostridiales bacterium 1_7_47_FAA]
gi|239518124|gb|EEQ57990.1| secreted protein [Clostridiales bacterium 1_7_47FAA]
Length = 822
Score = 239 bits (610), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 169/566 (29%), Positives = 269/566 (47%), Gaps = 62/566 (10%)
Query: 117 EVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWEEPSC 175
EV VRL + W AQ+ + +LL +D D++++NFR A L G P GW+ P C
Sbjct: 225 EVPAGSVRLSEGTRFWDAQERMIRWLLSVDDDQMLYNFRSAAGLDVRGAGPMTGWDAPEC 284
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI-----GSGYLSAFPTE 230
L+GH GHYLS AL + LK+K++ +V+AL+ CQK + G+LSA+ +
Sbjct: 285 NLKGHTTGHYLSGLALACSVHGQPELKDKINYMVNALAECQKALEAKGCAKGFLSAYSEQ 344
Query: 231 QFDRLEALI---PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
QFD LE +WAPYYT+ KI++GL D Y A + EA + T + ++ Y R+ +
Sbjct: 345 QFDLLEVYTRYPEIWAPYYTLDKIMSGLYDCYCLAGSKEAFHLLTGLGDWIYGRLSR-LS 403
Query: 288 KYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
+ +++ W + E GGM V+ +L+ T D ++ A F + D +
Sbjct: 404 RAQLDKMWSMYIAGEFGGMISVMVRLYRETGDGRYRRAALFFRNEKLFYPMEENVDTLKD 463
Query: 347 FHSNTHIPIVIGSQMRYEVTGD-----------QLHKEGHQLESSGTNIGHFNFKSDPKR 395
H+N HIP IG+ Y+ G Q+ H+ G +G +P
Sbjct: 464 MHANQHIPQAIGALELYKAGGGKRYLAIARNFWQMVVRSHEYSIGG--VGETEMFHEPGD 521
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
+A + + ESC +YN+++++ LF + + DYYE L N +L G Y
Sbjct: 522 IAHYMTDKSAESCASYNLMRLTFGLFGLSPDSRKMDYYENVLYNHILSSASHKADGGTTY 581
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
+P+ PG KE + T ++ CC+GTG+ES + +IY E K VY+ YI
Sbjct: 582 FMPVRPGGRKE-----FNTSENT--CCHGTGLESRFRYIRNIYAAGEDKKE-VYVNLYIP 633
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN------ 569
S LD + G + K++ R+ TF+ G ++ LRIP W +
Sbjct: 634 SELDMEDGWKL---KLEEDARTQGGYRI--TFNGPKDGGERTVALRIPCWAGEDWDIRIH 688
Query: 570 -----GAKA---------TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQ 615
GA+A T Q + S G ++ + + W DD++ I+LP R
Sbjct: 689 TVHPEGAEADGLAKTDAVTEASQGFTVDSDG-YVRIRRQWMPDDRMEIRLPFRFRKLPA- 746
Query: 616 DDRPEYASIQAILYGPYVLAGHSIGD 641
P+ ++ ++ YGPY+LA + G+
Sbjct: 747 ---PDGSAYSSVAYGPYILAALNDGE 769
>gi|338209455|ref|YP_004646426.1| hypothetical protein Runsl_5734 [Runella slithyformis DSM 19594]
gi|336308918|gb|AEI52019.1| protein of unknown function DUF1680 [Runella slithyformis DSM
19594]
Length = 760
Score = 239 bits (610), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 177/614 (28%), Positives = 292/614 (47%), Gaps = 70/614 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ SL +V++ + AQ +L Y+L L+ DKL+ + A LP E YG WE S
Sbjct: 22 MQSFSLQEVKVTGGAFK-NAQDVDLRYILSLNPDKLLAPYLIDAGLPLKAERYGNWE--S 78
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--F 232
L GH GHYLSA A+M+AST N LK+++ ++ L+ CQ + G+GY+ P + +
Sbjct: 79 SGLDGHIGGHYLSALAMMYASTGNAELKKRLDYMIDQLAQCQAKNGNGYVGGIPQGKVFW 138
Query: 233 DRLE---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
+R+ L W P Y IHK+ AGL D Y + N +A ++ + ++F
Sbjct: 139 ERIYKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDSYEFGGNQQAKQVLIGLGDWF----A 194
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+I+ S ++ Q L E GGMN+ L+ +T++ K+L A L L + D
Sbjct: 195 ELIRPLSDDQIQQILRTEHGGMNEAFADLYILTKNQKYLETAQRISHRAILNPLVQKQDK 254
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ-----------LESSGTNIG-HFNFKS 391
++G H+NT IP VIG + +T + E + + G ++ HFN +
Sbjct: 255 LTGLHANTQIPKVIGFEKIAMLTENAKWSEAARYFWQNVSQTRTVAFGGNSVREHFNPTN 314
Query: 392 DPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
D +S L SN E+C ++NML++S+ LF + +Y D+YER+L N +L Q +
Sbjct: 315 D---FSSMLKSNQGPETCNSFNMLRLSKALFLDKNDPSYLDFYERTLYNHILSSQH-PQK 370
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G +Y P+ P Y + P S WCC G+G+E+ +K + IY +++
Sbjct: 371 GGFVYFTPIRPN-----HYRVYSQPETSMWCCVGSGLENHTKYSELIYSHSAND---LFV 422
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+I S L WK I + Q + PY + +LN+R P W ++
Sbjct: 423 NLFIPSTLHWKEKSIQLTQATEF-----PYKNQSEFVLKLAKSQAFTLNIRYPKW--ADD 475
Query: 571 AKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
+ +NG+ P + P N++ + + W + DKL+++ + E + P+ ++ A ++
Sbjct: 476 VEVMVNGKLYPTSAQPSNYIGIRRKWKTGDKLSVRFTTSTHLEYL----PDGSNWAAFVH 531
Query: 630 GPYVLAGH-SIGDW-----DITESATSLSDWITPIPASY-----NSQLITFTQEYGNTKF 678
GP VLA S D D + + PI +Y I+ + GN KF
Sbjct: 532 GPIVLAAKTSTADLVGLFADDSRMGHETKGKLYPIDKAYMLIGDTDTYISKVKSVGNLKF 591
Query: 679 VLTNSNQSITMEKF 692
L S+T++ F
Sbjct: 592 SL----DSLTLQPF 601
>gi|268609237|ref|ZP_06142964.1| hypothetical protein RflaF_07037 [Ruminococcus flavefaciens FD-1]
Length = 1082
Score = 239 bits (610), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 182/614 (29%), Positives = 290/614 (47%), Gaps = 76/614 (12%)
Query: 102 PGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLP 161
P F G + + S+ DV++ +D A + ++YLL D ++L+ FR+ A L
Sbjct: 27 PAVFTANAADGSRISDFSISDVKM-TDDYCTNAFEKEMKYLLSFDTERLLAGFRENAGLS 85
Query: 162 APG-EPYGGWEEPSCELRGHFVGHYLSASALMW-----ASTHNESLKEKMSAVVSALSAC 215
G + YGGWE + + GH VGHYL+A A + S ++L ++M ++ + AC
Sbjct: 86 TNGAKRYGGWENTN--IAGHCVGHYLTALAQAYQNPNVTSDQKDALYKRMKTLIDGMQAC 143
Query: 216 QK--EIGSGYLSAFPT-------EQFDRLEA-----LIPVWAPYYTIHKILAGLLDQYTY 261
Q+ G+L A P QFDR+E W P+YT+HK++AG++D Y
Sbjct: 144 QQHPRGKKGFLWAAPVPSDGNVERQFDRVEIGKANIFDDAWVPWYTMHKLIAGIVDVYNA 203
Query: 262 ADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKH 321
A A + + + ++ YNR +S + L+ E GGMND +Y L+ IT H
Sbjct: 204 TQYAPAKDVGSALGDWVYNRCSG----WSQQTRNTVLSIEYGGMNDCMYDLYRITGKDSH 259
Query: 322 LMLAHLFDKPCFLGLLALQADDI-SGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESS 380
AH+FD+ ++ D+ +G H+NT IP IG+ RY V D G ++++S
Sbjct: 260 AAAAHVFDEDALFQKVSNGGRDVLNGRHANTTIPKFIGALKRYMVL-DGKTVNGQKVDAS 318
Query: 381 ---------------------GTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRH 419
G N +F D A + N E +C +YNMLK+SR
Sbjct: 319 AYLKYAENFWDMVTTHHTYITGGNSEWEHFGKDDILDAERTNCNCE-TCNSYNMLKLSRE 377
Query: 420 LFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSF 479
LF+ T + Y D+YE + N +L Q E G+ Y P+A G K S T D F
Sbjct: 378 LFKITHDSKYMDFYENTYYNSILSSQN-PETGMTTYFQPMATGYFKVYS-----TQWDKF 431
Query: 480 WCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDP 539
WCC G+G+ESF+KLGD+IY + +Y+ Y SS ++W + + Q+ S P
Sbjct: 432 WCCTGSGMESFTKLGDTIYMHDN---DSLYVNFYQSSVINWAEKNVSITQE-----STIP 483
Query: 540 YLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDD 599
++ F+ KGS L RIP W ++NG + + V+ ++S+ D
Sbjct: 484 -DGASVKFTIKGSS-DLDLRFRIPDWIDGT-MGVSVNGTKYSYKTVNGYADVSGSFSNGD 540
Query: 600 KLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWIT-PI 658
+ + +P +R + P+ + YGP VL+ +G D+ +T + W+T P
Sbjct: 541 VIELTVPSKVRAYPL----PDSPDVYGFKYGPLVLSAE-LGKDDMKTDSTGM--WVTIPK 593
Query: 659 PASYNSQLITFTQE 672
S+ I +++
Sbjct: 594 DKKVASETIKISKQ 607
>gi|409196987|ref|ZP_11225650.1| Acetyl-CoA carboxylase, biotin carboxylase [Marinilabilia
salmonicolor JCM 21150]
Length = 788
Score = 239 bits (610), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 167/539 (30%), Positives = 263/539 (48%), Gaps = 49/539 (9%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L VRL DS A+Q N +Y+ D D+L+ F A L YG WE L G
Sbjct: 30 LSAVRL-LDSPFKHAEQLNEKYVFAHDPDRLLAPFLIDAGLEPKAPGYGNWE--GSGLNG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLE--- 236
H GHYL++ ALM AST NE +E++ ++ L+ CQ+ G+GY+ P Q E
Sbjct: 87 HIGGHYLTSLALMVASTGNEEAQERLDYMIEELARCQEANGNGYVGGIPGGQPMWAEIAK 146
Query: 237 --------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
+L W P Y IHK+ AGL D + YA +AL + + ++F +V
Sbjct: 147 GNIDAGGFSLNGKWVPLYNIHKLFAGLHDAWKYAGKEKALEILIQLTDWFI----DVNSG 202
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
S E+ + L E GG+N+V ++ IT + K+L LA + L L D ++G H
Sbjct: 203 LSDEQIQEILVSEHGGLNEVFADVYDITGEDKYLTLARQYSHRSILEPLLNHEDKLTGLH 262
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNIGHFNFKSDPKRLAS 398
+NT IP V+G E+ GD + ++ + G N H +F +S
Sbjct: 263 ANTQIPKVVGFMRVGELAGDSAWIDASDFFWNTVVSNRTITIGGNSTHEHFHP-VDDFSS 321
Query: 399 NLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
++S E+C TYNMLK+S+ L+ + ++ Y DYYE++L N +L Q E G ++Y
Sbjct: 322 MVESRQGPETCNTYNMLKLSKQLYLYKNDLRYVDYYEQALYNHILSSQH-PEHGGLVYFT 380
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
P+ P + Y + P ++FWCC G+GIE+ K G+ IY + V++ +I S
Sbjct: 381 PMRP-----QHYRVYSNPEETFWCCVGSGIENHEKYGELIYAHSDDD---VFVNLFIPSE 432
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
L+W+ + + QK + + L+V L + ++ +R P W K T+NG
Sbjct: 433 LNWEEKGLKLTQKTNFPDNEQTTLKVELP-----EARSFTIGIRYPQWMKEGEMKVTVNG 487
Query: 578 QDL-PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+ +PG + V + W D++T+ L + E + D+ P +I +GP+VLA
Sbjct: 488 KRARGGGAPGAYYQVKREWQDGDEITVNLKMHTSGEYLPDNSP----FLSIKHGPFVLA 542
>gi|436835729|ref|YP_007320945.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
gi|384067142|emb|CCH00352.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
Length = 760
Score = 239 bits (609), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 164/549 (29%), Positives = 266/549 (48%), Gaps = 55/549 (10%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ +L DV+L AQ + Y+L L+ DKL+ + A LP YG WE S
Sbjct: 22 MQPFALQDVKLTGGPFK-NAQDVDQRYILALNPDKLLAPYLIDAGLPVKAPRYGNWE--S 78
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--F 232
L GH GHYLSA A+++AST + LK+++ +V L+ CQ + G+GY+ P + +
Sbjct: 79 SGLDGHIGGHYLSALAMLYASTGDAELKKRLDYMVDQLAQCQAKNGNGYVGGIPQGKVFW 138
Query: 233 DRLE---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
+R+ L W P Y IHK+ AGL D Y YA N +A ++ + ++F
Sbjct: 139 ERIHKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDAYEYAGNQQAKQVLIGLGDWFV---- 194
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+IK S E+ Q L E GG+N+ L+ +T D K+L A L L + D
Sbjct: 195 ELIKPLSDEQIQQVLRTEHGGINETFADLYILTNDKKYLETAQRISHRAILEPLLAKQDK 254
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGD-----------QLHKEGHQLESSGTNIG-HFNFKS 391
++G H+NT IP VIG + + G Q + + G ++ HFN +
Sbjct: 255 LTGLHANTQIPKVIGFEKIAMLAGKPDWSDAATYFWQNVSQHRSVAFGGNSVREHFNPTT 314
Query: 392 DPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
D + L SN E+C ++NML++S+ LF ++ Y D+YER+L N +L Q E
Sbjct: 315 D---FSQVLRSNQGPETCNSFNMLRLSKALFLDKSDVTYLDFYERALYNHILSSQH-PEK 370
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G +Y P+ P Y + P S WCC G+GIE+ +K G+ IY +++
Sbjct: 371 GGFVYFTPIRPN-----HYRVYSQPETSMWCCVGSGIENHTKYGELIYSHSAND---LFV 422
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+I S ++W + + Q+ + PY + SLN+R P W +
Sbjct: 423 NLFIPSTVNWADKNVKLTQRTE-----FPYKNESDLVIETTKPQEFSLNIRYPKWAEN-- 475
Query: 571 AKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
+NG+ + +P +++V + W + DK+T++ + R E + P+ ++ A ++
Sbjct: 476 LVVLVNGKAQAVADAPAGYVAVARKWRAGDKVTVRFNTSTRLEQL----PDGSNWSAFVH 531
Query: 630 GPYVLAGHS 638
GP VLA +
Sbjct: 532 GPIVLAAKT 540
>gi|386820708|ref|ZP_10107924.1| putative glycosyl hydrolase of unknown function (DUF1680)
[Joostella marina DSM 19592]
gi|386425814|gb|EIJ39644.1| putative glycosyl hydrolase of unknown function (DUF1680)
[Joostella marina DSM 19592]
Length = 1018
Score = 238 bits (607), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 184/606 (30%), Positives = 281/606 (46%), Gaps = 102/606 (16%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAP--GEPYGGWEE 172
L EVSL G +S + + L + D ++ FR T P P EP G W+
Sbjct: 375 LDEVSLDVDTHGHESKFIENRDKFISTLAQTNPDAFLYMFRNTFGQPQPDAAEPLGVWDS 434
Query: 173 PSCELRGHFVGHYLSASALMWAST-HNESLK----EKMSAVVSAL--------------- 212
+LRGH GHYL+A A +AST +++SL+ +KM +V+ L
Sbjct: 435 QETKLRGHATGHYLTAIAQAYASTGYDKSLQNNFADKMEYMVNTLYKLAQMSGNPKTKDG 494
Query: 213 --SACQKEI-------------------------GSGYLSAFPTEQFDRLE-------AL 238
A E+ G G++SA+P +QF LE
Sbjct: 495 SYVANPTEVPPGPGKSNYDSDLSEDGIRTDYWNWGEGFISAYPPDQFIMLENGATYGGQQ 554
Query: 239 IPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTL 298
VWAPYYT+HKILAGLLD Y + N +AL + M + Y R+ + + I + +
Sbjct: 555 TQVWAPYYTLHKILAGLLDIYEVSGNKKALEVAEGMGSWVYARLNELPTETLISMWNRYI 614
Query: 299 NEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLG------LLALQADDISGFHSNT 351
E GGMN+V+ +L+ +T + K+L +A LFD F G LA D G H+N
Sbjct: 615 AGEFGGMNEVMARLYRLTDEEKYLQVAQLFDNIKVFYGDANHSNGLAKNVDTFRGLHANQ 674
Query: 352 HIPIVIGSQMRYEVTGDQ----------LHKEGHQLESSGTNIGHFN------FKSDPKR 395
HIP ++G+ Y + + + S G G N F S P
Sbjct: 675 HIPQIVGAIEMYRDSNTAEYYRIADNFWFKSKNDYMYSIGGVAGARNPANAECFISQPAT 734
Query: 396 LASNLDS--NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
+ N S E+C TYNMLK++R+LF + + Y DYYER L N +L P
Sbjct: 735 IYENGLSAGGQNETCATYNMLKLTRNLFLFDQRAEYMDYYERGLYNHILASVAEKTPA-N 793
Query: 454 IYLLPLAPGSSKERSYHHWGTPS-DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
Y +PL PGS K H+G P F CC GT IES +KL +SIYF+ + +Y+
Sbjct: 794 TYHVPLRPGSVK-----HFGNPDMKGFTCCNGTAIESSTKLQNSIYFKSV-ENDALYVNL 847
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
Y+ S L W ++ + QK + + ++T+ + K L +R+P W ++ G
Sbjct: 848 YVPSTLHWAEKKLTITQKT--AFPKEDFTQLTINGNGK-----FDLKVRVPNW-ATKGFI 899
Query: 573 ATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+NG++ + + PG++L++ +TW D + +++P E+I D + +I ++ YGP
Sbjct: 900 VKINGKEEKVEAIPGSYLTLNRTWKDGDTVELKMPFQFHLESIMDQQ----NIASLFYGP 955
Query: 632 YVLAGH 637
+L
Sbjct: 956 ILLVAQ 961
>gi|374313035|ref|YP_005059465.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358755045|gb|AEU38435.1| protein of unknown function DUF1680 [Granulicella mallensis
MP5ACTX8]
Length = 798
Score = 238 bits (607), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 170/552 (30%), Positives = 260/552 (47%), Gaps = 56/552 (10%)
Query: 115 LKEVSL--HDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEE 172
LK V L VRL + RAQ + +YLL L ++++ R+ A L E YGGW+
Sbjct: 32 LKAVPLPFSSVRLTGGPLK-RAQDLDAQYLLDLQPERMLARLRQRANLAPKAEGYGGWDG 90
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF----- 227
+L GH GHYLSA ++M+A+T + K + V+ L Q G GY+ A
Sbjct: 91 DGRQLTGHIAGHYLSAISMMYATTGDVRFKNRADDFVTELQNIQNAQGDGYIGALLDAKG 150
Query: 228 --PTEQFDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
+F L L +W+P+Y HK+ AGL D Y N +AL +
Sbjct: 151 VDGKVRFQDLSKGEIHSGGFDLNGLWSPWYVEHKLFAGLRDAYHLTGNRKALDVEI---- 206
Query: 277 YFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGL 336
F + ++ S E+ + L E GGMN+VL L+ T DP+ L L+ F+ +
Sbjct: 207 KFAGWAETIVGHLSDEQLQRMLATEFGGMNEVLADLYADTNDPRWLKLSDKFEHHAIVDP 266
Query: 337 LALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLH-----------KEGHQLESSGTNIG 385
L+ D ++G H+NT IP +IG RY TGD+ E H + G G
Sbjct: 267 LSRGQDILAGKHANTQIPKMIGELARYVYTGDETDGKAAMFFFDEVSEHHSFATGGD--G 324
Query: 386 HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ 445
+ P ++ +D T ESC YNM+K++R LF + YAD+ ER+ N +LG Q
Sbjct: 325 KNEYFGQPDKMNDMIDGRTAESCAAYNMIKMARDLFSLDPQARYADFIERADLNAILGGQ 384
Query: 446 RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
E G + Y++P+ G H + +SF CC G+ +E+ + IY E K
Sbjct: 385 D-PEDGRVSYMVPVGRGVQ-----HEYQDKFESFTCCVGSQMETHAFHAYGIYSESGNK- 437
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
+++ QY + +DW S + + + + L++T G ++ LR P W
Sbjct: 438 --LWVSQYDPTTVDWASQGMKLEMVTNLPMGDSAALKIT-----SGKTKVFTIALRRPYW 490
Query: 566 TSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
+ G +NG+ L S P ++ + + W D + I LP TLR EA+ P+ +
Sbjct: 491 VGA-GFSVKVNGETLQNTSTPDTYIEINRKWKVGDTVEIVLPKTLRKEAL----PDNPNR 545
Query: 625 QAILYGPYVLAG 636
AI++GP VLAG
Sbjct: 546 MAIMWGPLVLAG 557
>gi|332185536|ref|ZP_08387284.1| tat (twin-arginine translocation) pathway signal sequence domain
protein [Sphingomonas sp. S17]
gi|332014514|gb|EGI56571.1| tat (twin-arginine translocation) pathway signal sequence domain
protein [Sphingomonas sp. S17]
Length = 639
Score = 238 bits (606), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 177/550 (32%), Positives = 261/550 (47%), Gaps = 63/550 (11%)
Query: 115 LKEVSLHDVRL-GSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWE-E 172
++ + DV L G +H AQ+ YL+ L D+L+ NFR A L YGGWE E
Sbjct: 42 VQPFDMADVTLDGGPFLH--AQRMTEAYLMRLQPDRLLANFRANAGLKPKAPAYGGWESE 99
Query: 173 P---SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP- 228
P GH +GHYLSA AL + +T ++ ++++ + + L+ACQK GSG + AFP
Sbjct: 100 PEWADINCHGHTLGHYLSACALAYRATKDKRYRQRIDYIANELAACQKASGSGLVCAFPK 159
Query: 229 ----TEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYN 280
R E + V P+YT+HK+ AGL D AD+ + R+ W V
Sbjct: 160 GPALVAAHLRGEPITGV--PWYTLHKVYAGLRDSVQLADSEPSRGVLFRLADWGVV---- 213
Query: 281 RVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQ 340
K S E+ + L E GGMN++ L+ +T + + +A F + + LA
Sbjct: 214 ----ATKPLSDEQFEKMLETEYGGMNEIYADLYFMTGNEDYRRVAERFSQKAIMNPLAQG 269
Query: 341 ADDISGFHSNTHIPIVIGSQMRYEVTGDQ-------------LHKEGHQLESSGTNIGHF 387
D + G H+NT IP +IG Q +E TGD H G + HF
Sbjct: 270 RDYLDGMHANTQIPKIIGFQRVFEATGDDKYHNAAAFFWRTVAHTRAFATGGHG-DAEHF 328
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
+D + + E+C +NMLK++R LF YADYYER+L NG+L Q
Sbjct: 329 FAMADFDKHV--FSAKGSETCCQHNMLKLTRALFLRDPRAEYADYYERTLYNGILASQ-D 385
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
+ G+ Y PG K YH TP DSFWCC GTG+E+ K DSIYF ++
Sbjct: 386 PDSGMATYFQGARPGYMK--LYH---TPEDSFWCCTGTGMENHVKYRDSIYFHDDR---A 437
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+Y+ +I S + W V+ Q + + R L ++ +L LR P W+
Sbjct: 438 LYVNLFIPSTVTWADKGAVLTQATTFPDAANTQFRWKLRQPTE-----LTLKLRHPKWSP 492
Query: 568 SNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
+ A +NG ++ PG++ +T+TW + D + ++L + E + P I A
Sbjct: 493 T--ATLLVNGAEVSHSDKPGSYAELTRTWKTGDTVEMRLVM----EPAVESAPAAPEIVA 546
Query: 627 ILYGPYVLAG 636
YGP VLAG
Sbjct: 547 FTYGPLVLAG 556
>gi|302897238|ref|XP_003047498.1| hypothetical protein NECHADRAFT_97856 [Nectria haematococca mpVI
77-13-4]
gi|256728428|gb|EEU41785.1| hypothetical protein NECHADRAFT_97856 [Nectria haematococca mpVI
77-13-4]
Length = 626
Score = 237 bits (605), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 168/526 (31%), Positives = 255/526 (48%), Gaps = 52/526 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEP 173
L EV+L D R + Q L YLL +D D+L++ FR L G + GGW+ P
Sbjct: 42 LSEVTLTDSRWMDN------QNRTLTYLLSVDPDRLLYVFRANHGLDTKGAQKNGGWDAP 95
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-----IGSGYLSAFP 228
R H GH+L+A + +A+ NE + + L CQ GYLS FP
Sbjct: 96 DFPFRSHIQGHFLTAWSQCYATLRNEECGSRATYFAKELGKCQANNEKANFTEGYLSGFP 155
Query: 229 TEQFDRLE--ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVI 286
+ +E L PYY IHK LAGLLD + + +A + + + R
Sbjct: 156 ESEITAVEKRTLNNGNVPYYAIHKTLAGLLDVHRLVGDEDAKDVMLALAGWVDTRT---- 211
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
KK + ++ + E GGMN+VL + D K L +A FD L D +SG
Sbjct: 212 KKLTYDQMQAMMQTEFGGMNEVLADIAYYIGDKKWLEVAQRFDHATIFDPLEKGQDKLSG 271
Query: 347 FHSNTHIPIVIGSQMRYEVTGDQ-------------LHKEGHQLESSGTNIGHFNFKSDP 393
H+NT +P IG+ Y+V+G Q +HK + + G N +F++ P
Sbjct: 272 LHANTQVPKWIGAIREYKVSGLQKYLDIGRNAWDLTVHKHTYAI---GGNSQAEHFRA-P 327
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWT-KEIAYADYYERSLTNGVLGIQRGTE-PG 451
+A LD++T E+C TYNMLK++R L+ + ++ D+YE +L N +LG Q + G
Sbjct: 328 DAIAEYLDNDTCEACNTYNMLKLTRELWVMDPSDASFFDFYENALMNHLLGQQNPEDHHG 387
Query: 452 VMIYLLPLAPGSSK----ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
+ Y PL PG + W T DSFWCC G+GIE+ +KL DSIYF ++
Sbjct: 388 HITYFTPLNPGGRRGVGPAWGGGTWSTDYDSFWCCQGSGIETNTKLMDSIYFHDD---ET 444
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+Y+ + S+LDW +I + Q D + TL ++G ++ +R+P+WTS
Sbjct: 445 LYVNLFTPSQLDWSDRKISITQSTD----FPERDTTTLKVGNQGENNEWTMAIRVPSWTS 500
Query: 568 SNGAKATLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRT 611
A +NG+ + G + + + WSS D +T+ LP++LRT
Sbjct: 501 K--ASIKINGEAVEGVDIESGKYAIIKRKWSSGDAVTVTLPMSLRT 544
>gi|317476834|ref|ZP_07936077.1| hypothetical protein HMPREF1016_03061 [Bacteroides eggerthii
1_2_48FAA]
gi|316907009|gb|EFV28720.1| hypothetical protein HMPREF1016_03061 [Bacteroides eggerthii
1_2_48FAA]
Length = 781
Score = 237 bits (605), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 168/543 (30%), Positives = 270/543 (49%), Gaps = 59/543 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L D++L +S +AQQT+L Y++ ++ D+L+ F + A L Y WE + L G
Sbjct: 30 LQDIKL-LESPFLQAQQTDLHYIMAMNPDRLLAPFLREAGLAPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP---------TE 230
H GHY+SA ++M+A+T + ++ +++ +++ L Q+ +G+G++ P E
Sbjct: 87 HIGGHYISALSMMYAATGDTTVYNRLNYMLNELHRAQQAVGNGFIGGTPGSLQLWKEIKE 146
Query: 231 QFDRLEA--LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
R E+ L W P Y IHK AGL D Y YA + A +M T WM
Sbjct: 147 GNIRPESFSLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIALTDWMA--------G 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ + ++ L E GG+N++ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLTEQQMQDMLRSEHGGLNEIFADVADITGDKKYLELARRFSHKTLLEPLIGGEDHL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNIGHFNFKSDPK 394
+G H+NT IP VIG + ++T + + H+ G N +F
Sbjct: 259 TGMHANTQIPKVIGYKRIADLTQNDAWDQAARFFWNTVVNHRSVCIGGNSVREHFHPADN 318
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI 454
+ D E+C TYNML++++ LF+ + +I +ADYYER+L N +L Q+ + G +
Sbjct: 319 FTSMLNDVQGPETCNTYNMLRLTKMLFQTSPDIRFADYYERALYNHILASQQPAKGG-FV 377
Query: 455 YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
Y P+ G Y + P S WCC G+G+E+ +K G+ IY E +Y+ +I
Sbjct: 378 YFTPMRSG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYAHAEDT---LYVNLFI 429
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
SRL WK ++ + Q + + +R + S+K T SL R P+W + GA +
Sbjct: 430 PSRLTWKEQKLTLVQ--ESRFPDEAQIRFRIEKSNKK---TFSLKFRYPSW--AKGASVS 482
Query: 575 LNG--QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
+NG QD+ PG +L+V + W + D++T+ LP+ + E I D Y A +YGP
Sbjct: 483 VNGKVQDIN-AQPGEYLTVRRKWKAGDEITLNLPMQVTLEQIPDQEHFY----AFMYGPI 537
Query: 633 VLA 635
VLA
Sbjct: 538 VLA 540
>gi|291544094|emb|CBL17203.1| Uncharacterized protein conserved in bacteria [Ruminococcus
champanellensis 18P13]
Length = 1075
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 193/667 (28%), Positives = 313/667 (46%), Gaps = 85/667 (12%)
Query: 89 LFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVD 148
+ S AML I + +++ SL D+ + +D+ A +EYLL D D
Sbjct: 10 MLSVAMLAGSITQLPAATTASAADIAIEDFSLADLTM-TDAYTVNAFSKEVEYLLSFDTD 68
Query: 149 KLVWNFRKTARLPAPG-EPYGGWEEPSCELRGHFVGHYLSASALMW-----ASTHNESLK 202
+L+ FR+ A+L G + Y GWE + + GH VGHYL+A A + + +L+
Sbjct: 69 RLLCGFRENAKLDTKGAKRYAGWE--NTLIAGHSVGHYLTAVAQAYQNPTLTAAQRSALE 126
Query: 203 EKMSAVVSALSACQKEIGS--GYLSAFPTE-------QFDRLEA-----LIPVWAPYYTI 248
K+ A++ + CQ+ G+L A + QFD +E + W P+YT+
Sbjct: 127 GKIKALLDGMRVCQQNSKGKPGFLWAGQIKNANNVEVQFDLVEQGKTNIINESWVPWYTM 186
Query: 249 HKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDV 308
HKI+ GL+D Y N A + + + ++ YNR K+S + H L+ E GGMND
Sbjct: 187 HKIVQGLVDVYNATGNETAKTIASDLGDWTYNRAS----KWSAQTHNTVLSIEYGGMNDC 242
Query: 309 LYKLFCITQDPKHLMLAHLFDKPCF-LGLLALQADDISGFHSNTHIPIVIGSQMRY---- 363
LY+L+ IT H + AH FD+ +L + ++ H+NT IP IG+ RY
Sbjct: 243 LYELYEITGKDTHAVAAHYFDETNLHEAVLKGGRNVLTNKHANTTIPKFIGALKRYIVLD 302
Query: 364 --EVTGDQLHKE--------------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEES 407
V G+++ H +G N +F D + N E +
Sbjct: 303 GKTVNGEKIDASRYLEYAEAFWDMVTTHHTYITGGNSEWEHFGEDDILDKERTNCNCE-T 361
Query: 408 CTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKER 467
C +YNMLK+SR LF+ T + Y D+YE + N +L Q E G+ Y P+A G K
Sbjct: 362 CNSYNMLKLSRELFKITGDRKYMDFYEGTYYNSILSSQN-PESGMTTYFQPMATGYFKVY 420
Query: 468 SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVV 527
S +P DSFWCC G+G+ESF+KLGD++Y +Y+ Y SS L+W+ ++ +
Sbjct: 421 S-----SPYDSFWCCTGSGMESFTKLGDTMYMHSGNT---LYVNMYQSSVLNWEDQKVKI 472
Query: 528 NQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGN 587
Q + S T F+ GSG + RIP+W + A +NG + +
Sbjct: 473 TQDSNIPES------DTAKFTIDGSG-SLDFRFRIPSWKAGKMTIA-VNGTKYTYKTVND 524
Query: 588 FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITES 647
+ VT + + D +++ +P E + + P+ ++ YGP VL+ +G ++ +S
Sbjct: 525 YAQVTGDFKTGDVISVTIP----AEVVAYNLPDNKAVYGFKYGPVVLSAE-LGTENMEKS 579
Query: 648 ATSLSDWIT-PIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFR 706
+T + W+T P +SQ IT ++E + + N + +K + +
Sbjct: 580 STGM--WVTIPKDPIGSSQNITISKEGQSVTSFMAEINDHLVKDK-----------NSLK 626
Query: 707 LILNDSS 713
LND+S
Sbjct: 627 FTLNDTS 633
>gi|406027774|ref|YP_006726606.1| hypothetical protein LBUCD034_2040 [Lactobacillus buchneri CD034]
gi|405126263|gb|AFS01024.1| hypothetical protein LBUCD034_2040 [Lactobacillus buchneri CD034]
Length = 803
Score = 236 bits (603), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 191/583 (32%), Positives = 272/583 (46%), Gaps = 101/583 (17%)
Query: 127 SDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEPS-CELRGHFVGH 184
SD RAQQ ++YLL LD + + F + A + + G Y GWE RGHF GH
Sbjct: 13 SDPEIARAQQMTVKYLLALDPKRFLVTFDQVAGIDSGGVTGYQGWERTDGLNFRGHFFGH 72
Query: 185 YLSASALMWASTHNESLKE----KMSAVVSALSACQKEIG------SGYLSAFPTEQFDR 234
YLSA + +T + ++++ K+ V+ L + Q +GY+SAF D
Sbjct: 73 YLSALSQAILATEDNAIRQQLLDKLRLGVNGLQSAQAAYAKKHPESAGYVSAFREVALDE 132
Query: 235 LEAL-IP------VWAPYYTIHKILAGLLDQYTYADNAE------ALRMTTWMVEYFYNR 281
+E +P V P+Y +HK+LAGLL N + AL+ Y + R
Sbjct: 133 VEGREVPKDEKENVLVPWYNLHKVLAGLLAVNVNLQNIDPLLSEKALKSAHQFGLYVFKR 192
Query: 282 VQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQA 341
+ + Q L E GGMND LY+LF +T D + L A FD+ LA
Sbjct: 193 INQLADPT------QMLKIEYGGMNDALYELFDLTDDKRMLTAATYFDETTLFKQLAKGD 246
Query: 342 DDISGFHSNTHIPIVIGSQMRYEVTGD---------------------------QLHKEG 374
D ++G H+NT IP +IG+ RYE D Q+ +
Sbjct: 247 DVLAGKHANTTIPKLIGALHRYESLHDVKRADQYLSPEEKGSLNMYLKAAVNFWQIVIDD 306
Query: 375 HQLESSGTNIG-HFNFKSDPKRLASNL----DSNTEESCTTYNMLKVSRHLFRWTKEIAY 429
H + G + HF+ +P +L + + T E+C TYNMLK+SR LFR T + Y
Sbjct: 307 HTYVTGGNSQSEHFH---EPGQLFHDAVLEDGATTCETCNTYNMLKLSRELFRVTGDKKY 363
Query: 430 ADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIES 489
DYYE++ TN +LG Q G+M Y P+A G +K + P D FWCC GTGIES
Sbjct: 364 LDYYEQTYTNAILGSQ-NPNTGMMTYFQPMAAGYTKV-----YNRPFDEFWCCTGTGIES 417
Query: 490 FSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLT--- 546
F+KLGDS YF + +Y+ Y S+ L S + + ++VD +V LT
Sbjct: 418 FTKLGDSYYFRSGDQ---LYLSLYFSNVLRLDSRNLQMTEQVDRKAG-----KVHLTVVK 469
Query: 547 FSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDK---LTI 603
S+ S T +L LR P W + AK ++G + +F W D+ T+
Sbjct: 470 IRSQDSAGTINLKLRNPAWLVQS-AKLAVDGISQQMDQNADF------WEIDNAGPGTTV 522
Query: 604 QLPLTLRTEAIQ-DDRPEYASIQAILYGPYVLAG----HSIGD 641
L + + E +Q D P Y + + YGPYVLAG HSI D
Sbjct: 523 DLEMPMSLEMVQTKDNPHYLAFK---YGPYVLAGQLGKHSIND 562
>gi|218129947|ref|ZP_03458751.1| hypothetical protein BACEGG_01530 [Bacteroides eggerthii DSM 20697]
gi|217988057|gb|EEC54382.1| hypothetical protein BACEGG_01530 [Bacteroides eggerthii DSM 20697]
Length = 781
Score = 236 bits (603), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 168/543 (30%), Positives = 270/543 (49%), Gaps = 59/543 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L D++L +S +AQQT+L Y++ ++ D+L+ F + A L Y WE + L G
Sbjct: 30 LQDIKL-LESPFLQAQQTDLYYIMAMNPDRLLAPFLREAGLAPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP---------TE 230
H GHY+SA ++M+A+T + ++ +++ +++ L Q+ +G+G++ P E
Sbjct: 87 HIGGHYISALSMMYAATGDTTVYNRLNYMLNELHRAQQAVGNGFIGGTPGSLQLWKEIKE 146
Query: 231 QFDRLEA--LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
R E+ L W P Y IHK AGL D Y YA + A +M T WM
Sbjct: 147 GSIRPESFSLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIALTDWMA--------G 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ + ++ L E GG+N++ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLTEQQMQDMLRSEHGGLNEIFADVADITGDKKYLELARRFSHKTLLEPLIGGEDHL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNIGHFNFKSDPK 394
+G H+NT IP VIG + ++T + + H+ G N +F
Sbjct: 259 TGMHANTQIPKVIGYKRIADLTQNDAWDQAARFFWNTVVNHRSVCIGGNSVREHFHPADN 318
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI 454
+ D E+C TYNML++++ LF+ + +I +ADYYER+L N +L Q+ + G +
Sbjct: 319 FTSMLNDVQGPETCNTYNMLRLTKMLFQTSPDIRFADYYERALYNHILASQQPAKGG-FV 377
Query: 455 YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
Y P+ G Y + P S WCC G+G+E+ +K G+ IY E +Y+ +I
Sbjct: 378 YFTPMRSG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYAHAEDT---LYVNLFI 429
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
SRL WK ++ + Q + + +R + S+K T SL R P+W + GA +
Sbjct: 430 PSRLTWKEQKLTLVQ--ESRFPDEAQIRFRIEKSNKK---TFSLKFRYPSW--AKGASVS 482
Query: 575 LNG--QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
+NG QD+ PG +L+V + W + D++T+ LP+ + E I D Y A +YGP
Sbjct: 483 VNGKVQDIN-AQPGEYLTVRRKWKAGDEITLNLPMQVTLEQIPDQEHFY----AFMYGPI 537
Query: 633 VLA 635
VLA
Sbjct: 538 VLA 540
>gi|315499577|ref|YP_004088380.1| hypothetical protein Astex_2584 [Asticcacaulis excentricus CB 48]
gi|315417589|gb|ADU14229.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
48]
Length = 791
Score = 236 bits (603), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 174/581 (29%), Positives = 279/581 (48%), Gaps = 58/581 (9%)
Query: 85 EQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLM 144
+D L S A L I G+ + + + + + L DVRL A N YLL
Sbjct: 9 RRDTLTSTAALLAGISVSGRAGAND-TYDSVTSLPLSDVRLLPSPFK-TAVDVNEAYLLS 66
Query: 145 LDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEK 204
++ D+L+ N+RK A L E YGGWE + + GH +GHYLSA +LM A T N +LK +
Sbjct: 67 VNPDRLLHNYRKFAGLTPKAELYGGWERDT--IAGHSLGHYLSAISLMHAQTGNAALKLR 124
Query: 205 MSAVVSALSACQKEIGSGYLSAFP-----------TEQFDRLEA---------LIPVWAP 244
+ ++ L+ Q G GY++ F E F L A L W P
Sbjct: 125 AAYIIDELALVQGAHGDGYVAGFTRKRKDGRVVDGKEIFPELMAGDIRSAGFDLNGCWVP 184
Query: 245 YYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGG 304
Y HK+ +GL D T+ +AL + + Y + V + + ++ LN E GG
Sbjct: 185 LYNWHKLYSGLFDAQTFCGYDKALTVAVGLGVY----IDKVFRALTDDQVQTVLNCEFGG 240
Query: 305 MNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYE 364
+ND +L+ T++P+ L LA + L D ++ H+NT +P ++G +E
Sbjct: 241 LNDSFAELYRRTENPRWLALAQRLHHKRIIDPLTAGEDKLANNHANTQVPKLLGEATLFE 300
Query: 365 VTGDQLHKEG----------HQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNML 414
VTG++ +++ H G N F +P ++ ++ T E C TYNML
Sbjct: 301 VTGNENNRKAASFFWERVVNHHSYVIGGNADREYF-FEPDTISKHITEATCEHCNTYNML 359
Query: 415 KVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGT 474
K++RHL+ W + Y DY+ER+ N VL Q+ + G+ Y+ PL G+++ S
Sbjct: 360 KLTRHLYGWEPDARYFDYFERAHFNHVLA-QQNPKTGMFSYMTPLFTGAARGFS-----D 413
Query: 475 PSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPV 534
P D++ CC+G+G+ES +K G+SI+++ +++ YI + W + + ++D
Sbjct: 414 PVDNWTCCHGSGMESHAKHGESIFWQSSDT---LFVNLYIPATARWATKG--AHLRLDTG 468
Query: 535 VSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKT 594
+D + + SS L LR+P W A TLN + + G +L + +
Sbjct: 469 YPYDG--NIVFSLSSLRRPTKFKLALRVPAWAKR--ADLTLNNKPVKATRDGGYLVIDRA 524
Query: 595 WSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
W+ D + + LPL LR EA +DD + A+L GP VLA
Sbjct: 525 WAVGDTVRLSLPLDLRFEATRDD----GKVVAVLRGPLVLA 561
>gi|251798256|ref|YP_003012987.1| hypothetical protein Pjdr2_4277 [Paenibacillus sp. JDR-2]
gi|247545882|gb|ACT02901.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 605
Score = 236 bits (602), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 174/567 (30%), Positives = 266/567 (46%), Gaps = 71/567 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +VRL D R + Y+ D+++L+ F+ A + + EP GGWE P C LRG
Sbjct: 7 LDEVRLTDDVFASRREHAKT-YIREFDLERLMHTFKINAGISSTAEPLGGWEAPDCGLRG 65
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD--RLEA 237
HFVGHYLSA A H+ +LK +V + AC + SGYLSAF E+ D LE
Sbjct: 66 HFVGHYLSACAKFAYGDHDGTLKTMADEIVDVMQACAQP--SGYLSAFEEEKLDVLELEE 123
Query: 238 LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT 297
VWAPYYT+HKI+ GL+D Y Y N +AL + + Y R + + HW+
Sbjct: 124 NRDVWAPYYTLHKIMQGLIDCYVYLQNTQALELAVNLAHYIRRRFEYL-------SHWKI 176
Query: 298 --------LN--EEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
LN E GG+ D LY L+ +T D L LAHLFD+ +L LA D +
Sbjct: 177 DGILRCTKLNPVNEFGGLGDSLYTLYELTGDAALLGLAHLFDRDYWLWPLAEGRDVLEDL 236
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEGH---------QLESSGTNI--------GHFNFK 390
H+NTH+P+++ RY++ + +K+ + ++G N G + K
Sbjct: 237 HANTHLPMILACMHRYKIREEDSYKKSALHFYDFLMGRTFANGNNSSKATAFIQGGVSEK 296
Query: 391 SDP----KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
++ LA L ESC +N K+ L W+ EI Y D+ E N +L
Sbjct: 297 AEHWGGYGELADALTGGESESCCAHNTEKIVERLLEWSPEIGYLDHLESLKYNAILN-SA 355
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
+ G+ Y PL + K+ S P SFWCC G+GIE+ S+L +I+F
Sbjct: 356 SAKTGLSQYHQPLGTNAVKKFS-----EPYHSFWCCTGSGIEAMSELQKNIWFRNGN--- 407
Query: 507 GVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
+ + ++SS+ WK IV++Q+ S+ L L F + + LR+ +
Sbjct: 408 AILLNAFVSSKAAWKERGIVIHQR----TSFPDSLISALHFETD-----EPVELRM-MFK 457
Query: 567 SSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
N + + L ++ V + + + D++ I++ +LR + E A
Sbjct: 458 EKAIKNIRFNDEGIHLQKEEGYIVVERLFRNGDRMDIEIEASLRLIPLPGSEAE----SA 513
Query: 627 ILYGPYVLAGHSIGDWDITESATSLSD 653
+LYG +LA +GD + T +SD
Sbjct: 514 LLYGNVLLA--RVGD---EQPLTGISD 535
>gi|334144880|ref|YP_004538089.1| acetyl-CoA carboxylase, biotin carboxylase [Novosphingobium sp.
PP1Y]
gi|333936763|emb|CCA90122.1| acetyl-CoA carboxylase, biotin carboxylase [Novosphingobium sp.
PP1Y]
Length = 651
Score = 236 bits (601), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 178/553 (32%), Positives = 259/553 (46%), Gaps = 69/553 (12%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEE-- 172
L+ L DV L + AQ+ YLL L D+L+ NFR A L YGGWE
Sbjct: 50 LEPFDLSDVTL-EEGPFLHAQRLTEAYLLRLQPDRLLHNFRVNAGLAPRAAVYGGWESDE 108
Query: 173 --PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE 230
GH +GHYLSA AL + ST++ K+++ + + L+ACQK GSG + AFP
Sbjct: 109 IWADINCHGHTLGHYLSACALAFRSTNDRRFKQRVDYIANELAACQKATGSGLVCAFPDG 168
Query: 231 --------QFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYF 278
+ D++ + P+YT+HK+ AGL D AD+ + +R+ W V
Sbjct: 169 PALLTAHLRGDKITGV-----PWYTLHKVYAGLRDGALLADSTVSREVLIRLADWGV--- 220
Query: 279 YNRVQNVIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLL 337
V + + ++T L E GGMN+V L+ +T + + L+ F + L
Sbjct: 221 ------VATRPLTDGQFETMLATEHGGMNEVYADLYAMTGNEDYRELSQRFSHKAVMDPL 274
Query: 338 ALQADDISGFHSNTHIPIVIGSQMRYEVTGDQ-------------LHKEGHQLESSGTNI 384
D + G H+NT +P ++G Q YE+TGD H G N
Sbjct: 275 VQGRDLLDGMHANTQVPKIVGFQRVYEITGDDRYAQAANFFFRTVAHTRSFATGGHGDN- 333
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
HF +D R + E+C +NMLK++R LF YADYYER+L NG+L
Sbjct: 334 EHFFAMADFDRHV--FSAKGSETCCQHNMLKLARLLFMQDPNADYADYYERTLYNGILAS 391
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q + G++ Y PG K YH TP SFWCC GTG+E+ K DSIYF +E
Sbjct: 392 Q-DPDSGMVTYFQGARPGYMK--LYH---TPEHSFWCCTGTGMENHVKYRDSIYFHDERS 445
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
+Y+ ++ S + WK + Q+ L+ L +K +L LR P
Sbjct: 446 ---LYVNLFVPSSVAWKEKGAELIQRTAFPEKPTTGLQWKLRAPAK-----IALQLRHPR 497
Query: 565 WTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYAS 623
W S A +NGQ++ + G+++ V +TW D++ +QL + E + P
Sbjct: 498 W--SRTAVVRVNGQEVARSATAGSYVEVARTWKDGDRVELQLEM----EPTVESAPAAPD 551
Query: 624 IQAILYGPYVLAG 636
I A YGP VLAG
Sbjct: 552 IVAFTYGPIVLAG 564
>gi|346226219|ref|ZP_08847361.1| Acetyl-CoA carboxylase, biotin carboxylase [Anaerophaga
thermohalophila DSM 12881]
Length = 795
Score = 236 bits (601), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 164/529 (31%), Positives = 260/529 (49%), Gaps = 56/529 (10%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A+ N +Y++ D D+L+ F A L YG WE S L GHF GHYL++ +LM
Sbjct: 49 AEALNEQYVMAHDPDRLLAPFLIDAGLEPKAPGYGNWE--SSGLNGHFGGHYLTSLSLMI 106
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLE-----------ALIPVW 242
AST NE +E+++ ++ L+ CQ+ G+GY+ P Q E +L W
Sbjct: 107 ASTGNEEARERLNYMIDELARCQEANGNGYVGGVPGGQDMWAEIAKGNIDAGNFSLNGKW 166
Query: 243 APYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHWQTL 298
P Y IHK+ AGL D + YA N +A +++T W ++ + I++ + H
Sbjct: 167 VPLYNIHKLYAGLRDAWLYAGNEKAREILIKLTDWCIDLTAALSDDQIQEMLVSEH---- 222
Query: 299 NEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIG 358
GG+N+V ++ IT D K+L LA F L L D ++G H+NT IP VIG
Sbjct: 223 ----GGLNEVFADVYDITGDEKYLELARRFSHREILEPLLQHEDRLTGLHANTQIPKVIG 278
Query: 359 SQMRYEVTGDQLHKEG----------HQLESSGTNIGHFNFKSDPKRLASNLDSNT-EES 407
E+T D + ++ + G N H +F +S ++S E+
Sbjct: 279 YMRIAELTHDSAWIDASDFFWNTVVNNRTITIGGNSTHEHFHP-VDDFSSMIESRQGPET 337
Query: 408 CTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKER 467
C TYNMLK+S+HLF + ++ Y DYYE++L N +L Q G ++Y P+ P R
Sbjct: 338 CNTYNMLKLSKHLFLYKNDLKYIDYYEQALYNHILSSQHPGHGG-LVYFTPMRP-----R 391
Query: 468 SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVV 527
Y + P ++FWCC G+GIE+ K G+ IY ++ V++ +I S L+WK + +
Sbjct: 392 HYRVYSNPEETFWCCVGSGIENHEKYGELIYAHDD---EDVFVNLFIPSELNWKEKGLKL 448
Query: 528 NQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PG 586
QK + LRV L S + + +R P W + + T+NG + + G
Sbjct: 449 VQKNNFPDIEKSTLRVELDESDE-----FIVGIRCPAWANPGEMEVTVNGNSVNGEAVSG 503
Query: 587 NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+ V++ W D + + LP+ + + D P Y S +++GP+VL
Sbjct: 504 QYFLVSRKWDDGDVIEVHLPMHTFGKYLPDKSP-YLS---LMHGPFVLG 548
>gi|302340651|ref|YP_003805857.1| hypothetical protein Spirs_4187 [Spirochaeta smaragdinae DSM 11293]
gi|301637836|gb|ADK83263.1| protein of unknown function DUF1680 [Spirochaeta smaragdinae DSM
11293]
Length = 764
Score = 236 bits (601), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 163/537 (30%), Positives = 259/537 (48%), Gaps = 47/537 (8%)
Query: 123 VRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWEEPSCELRGHF 181
V L S+ Q +++L+ D D++++NFR A + G P GW+ PSC LRGH
Sbjct: 196 VMLKEGSVFCDEQDKMIQHLIDTDDDQMLYNFRVAAGVDTRGALPMTGWDAPSCNLRGHT 255
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI-----GSGYLSAFPTEQFDRLE 236
GHYLS+ AL W+ T L +K+ ++ +LS CQ + G+LSA+ QFD LE
Sbjct: 256 TGHYLSSLALGWSVTKKTELMDKIVYLIESLSECQNALEERGCSKGFLSAYSERQFDLLE 315
Query: 237 ALIP---VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIER 293
P +WAPYYT+ KI++GL D Y+ AD++ AL + M ++ Y R+ + + +++
Sbjct: 316 TYTPYPTIWAPYYTLDKIMSGLYDCYSLADSSLALNILCKMGDWVYERLSR-LSRNQLDK 374
Query: 294 HWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTH 352
W + E GGM V+ KL+ +T+ +L A+ FD + D + H+N H
Sbjct: 375 MWSMYIAGEFGGMISVMVKLYTLTKKKTYLQTAYYFDNEKLFYPMQENIDTLKDMHANQH 434
Query: 353 IPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDS 402
IP ++G+ YE G + + + S G IG +P + + +
Sbjct: 435 IPQIMGAVELYEADGSGRYYDIAKNFWNIVTASHVYSIG-GIGETEMFHEPNEIMTYITD 493
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
T ESC +YN+L+++ LF E D+YE L N +L G Y +PL PG
Sbjct: 494 KTAESCASYNILRLTGQLFALEPERRKMDFYETVLYNHILSSFSHKSDGGTTYFMPLRPG 553
Query: 463 SSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
KE + T ++ CC+G+G+E+ + IY + +YI YI S ++W++
Sbjct: 554 GHKE-----FNTKENT--CCHGSGLETRFRYVQDIY---ACNHDTLYINLYIPSAVEWEN 603
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD-LP 581
+I D T F SG +L RIP W + + K T+N Q+ +
Sbjct: 604 FRIEQTTASDAA--------GTFIFLIHSSGW-RNLAFRIPHW-AEDEYKVTINNQESVE 653
Query: 582 LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
+ + + + W D++ I P R + D +P YA + YGPY+LA S
Sbjct: 654 EMAQDGYFYLHRDWREGDRIEILTPYHFRKLPVPDGKP-YA---CMAYGPYILAALS 706
>gi|404451488|ref|ZP_11016452.1| hypothetical protein A33Q_19258 [Indibacter alkaliphilus LW1]
gi|403762834|gb|EJZ23856.1| hypothetical protein A33Q_19258 [Indibacter alkaliphilus LW1]
Length = 1019
Score = 234 bits (597), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 215/742 (28%), Positives = 329/742 (44%), Gaps = 131/742 (17%)
Query: 3 KWMCSIGFFKFLLTFLLIVSAAQAKECTNAYPELAS---HTFRSNLLSSKNESYIKQIHS 59
+ + SI F F I + + ++ YPE + + F SN+ K E+ + +
Sbjct: 245 RQVASIYFNAFRDVNQNIAHSKKVEDDLPDYPEDEAKLYNVFLSNVEDIKVETEVGSLPR 304
Query: 60 HNDHLTPSDDSAWLSLMPRKILREEEQDELFSWAMLYR-KIKNPG--------------- 103
H+ S + R I + +EL S LY K K PG
Sbjct: 305 LPSHVKGSYVDDLNGPLVRVIWPAPKDNELVSKVGLYTVKGKVPGTDFEPVATVSVKAKT 364
Query: 104 QFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQ--QTNLEYLLML---DVDKLVWNFRKTA 158
P++ E K LH + L D + + + ++LL L D + ++ FR
Sbjct: 365 NSSPPQQKLELFK---LHQINLEEDQTGQKTKFIENRDKFLLTLAETDPNSFLYMFRHAF 421
Query: 159 RLPAP--GEPYGGWEEPSCELRGHFVGHYLSASALMWAST-HNESLKE----KMSAVVSA 211
P P P G W+ +LRGH GHYL+A A +AST ++E L++ KM +V+
Sbjct: 422 DQPQPENAVPLGVWDSQETKLRGHATGHYLTAIAQAYASTGYDEVLQQNFLDKMDYMVNV 481
Query: 212 LSACQK----------------------------------------EIGSGYLSAFPTEQ 231
L K G GY+SA+P +Q
Sbjct: 482 LYDLSKLSGNKVNGKGNEDPVLVPKGPGKSDFDSDLSDEGIRSDYWNWGKGYISAYPPDQ 541
Query: 232 FDRLEALI-------PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
F LE +WAPYYT+HKILAGL+D Y + N +AL + M E+ Y R+ +
Sbjct: 542 FIMLEKGATYGGQKNQIWAPYYTLHKILAGLIDIYKVSGNEKALEIAKGMGEWVYTRL-D 600
Query: 285 VIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLG------L 336
+ + ++ + W T + E GGMN+ + L+ ITQDP+ L A LFD F G
Sbjct: 601 ALPQETLIKMWNTYIAGEFGGMNETMATLYEITQDPRFLKGAQLFDNIQMFFGDAEYSHG 660
Query: 337 LALQADDISGFHSNTHIPIVIGSQMRYEVTG-DQLHKEGHQ---------LESSGTNIGH 386
LA D G H+N HIP V+GS Y V+ D+ + + S G G
Sbjct: 661 LAKNVDTFRGLHANQHIPQVVGSLEMYRVSAKDEYFRVADNYWFKAVNDYMYSIGGVAGA 720
Query: 387 FN------FKSDPKRLASNLDSN--TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLT 438
N F ++P L N S+ E+C TYNMLK++ +LF + + DY+ER L
Sbjct: 721 RNPANAECFIAEPATLYENGFSSGGQNETCATYNMLKLTGNLFLFEQRGELMDYFERGLY 780
Query: 439 NGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY 498
N +L P Y +PL PGS K H F CC GT IES +KL SIY
Sbjct: 781 NHILASVAEDSPA-NTYHVPLRPGSIK----HFGNAKMTGFTCCNGTSIESNTKLQQSIY 835
Query: 499 FE--EEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT 556
++ EE VY+ +I S LDW+ I + Q S+ + L +G +
Sbjct: 836 YKSIEEN---AVYVNLFIPSTLDWEERNIKIKQ----ATSFPKEDKTQLLVEGEGEFV-- 886
Query: 557 SLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQ 615
L+LR+P+W + G ++NG+++ L PG+++++++ W DK+ +++P + +
Sbjct: 887 -LHLRVPSW-ARKGYHVSINGKEIQLDVKPGSYIAISRFWEDGDKVDLRMPFDFYLDPVM 944
Query: 616 DDRPEYASIQAILYGPYVLAGH 637
D +I ++ YGP +LA
Sbjct: 945 DQ----PNIASLFYGPILLAAQ 962
>gi|379726800|ref|YP_005318985.1| hypothetical protein MPD5_0184 [Melissococcus plutonius DAT561]
gi|376317703|dbj|BAL61490.1| hypothetical protein MPD5_0184 [Melissococcus plutonius DAT561]
Length = 883
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 181/566 (31%), Positives = 267/566 (47%), Gaps = 94/566 (16%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFRKTARL-PAPGEPYGGWEEP-SCELRGHFVGHYLSASA 190
+AQ+ + YLL LDV K ++ F K A + P Y GWE RGHF GH+LSA A
Sbjct: 18 KAQENVIHYLLSLDVQKFLFEFYKVAGMKPLTESGYQGWERSDQVNFRGHFFGHFLSALA 77
Query: 191 LMWASTHNESLKEKM----SAVVSALSACQKEIG------SGYLSAFPTEQFDRLEA--L 238
L + + LK+K+ ++ L A QK +GY+SAF D +E +
Sbjct: 78 LSYQAEKQPILKKKIHQQIKTAITGLKAVQKNYAKQHPEHAGYISAFKEVALDEVEGKPV 137
Query: 239 IP-----VWAPYYTIHKILAGLLD------QYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P V +Y +HKILAGLL+ + + EAL + +W +Y Y R+ N+
Sbjct: 138 DPKEKENVLVSWYNLHKILAGLLEVNISLKEVDSQLSKEALFIASWFGDYIYKRMMNLTD 197
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
K Q L E GGMND LY LF +TQ +H + A FD+ LA + + G
Sbjct: 198 KN------QMLTIEYGGMNDALYCLFELTQKKEHAIAATYFDEDNLFNQLANDENVLPGK 251
Query: 348 HSNTHIPIVIGSQMRYEVTGD--------------------------QLHKEGHQLESSG 381
H+NT IP +IG+ RY V Q+ + H + G
Sbjct: 252 HANTTIPKLIGALKRYMVFQSEDLSAWLSNEEKEHLMSYFKAAEKFWQIVVDNHTYCTGG 311
Query: 382 TNIG-HFNFKSDPKRLASNLDSN----TEESCTTYNMLKVSRHLFRWTKEIAYADYYERS 436
+ HF+ +P L + + T E+C T+NMLK++R L+ TK Y DYYE +
Sbjct: 312 NSQSEHFH---EPNELFYDSEIRQGDCTCETCNTHNMLKLTRKLYECTKNPKYLDYYETT 368
Query: 437 LTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDS 496
N +L Q ++ G+M+Y P+ G +K + P D FWCC GTGIESFSKL D+
Sbjct: 369 YINAILASQ-NSKTGMMMYFQPMGAGYNKV-----YNRPYDEFWCCSGTGIESFSKLADT 422
Query: 497 IYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTL---TFSSKGSG 553
YF+E + +++ Y S+ L K + + QK D VT+ T + K
Sbjct: 423 YYFKENNR---LFVNLYFSNTLKLKENNLKIIQKTDRKNG-----NVTIDLKTLTDKNII 474
Query: 554 LTTSLNLRIPTWTSS---NGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
L LR+P W K LN + P G F +++ +++D++ +++ L+
Sbjct: 475 QPLQLALRLPNWAKQVTIKKGKKLLNYE----PHLG-FAYLSELVTANDQIILEMEQELQ 529
Query: 611 TEAIQDDRPEYASIQAILYGPYVLAG 636
D P+ A+ A YGPY+LAG
Sbjct: 530 LL----DTPDNANYIAFKYGPYILAG 551
>gi|332685731|ref|YP_004455505.1| hypothetical protein MPTP_0197 [Melissococcus plutonius ATCC 35311]
gi|332369740|dbj|BAK20696.1| hypothetical protein MPTP_0197 [Melissococcus plutonius ATCC 35311]
Length = 883
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 179/563 (31%), Positives = 264/563 (46%), Gaps = 88/563 (15%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFRKTARL-PAPGEPYGGWEEP-SCELRGHFVGHYLSASA 190
+AQ+ + YLL LDV K ++ F K A + P Y GWE RGHF GH+LSA A
Sbjct: 18 KAQENVIHYLLSLDVQKFLFEFYKVAGMKPLTESGYQGWERSDQVNFRGHFFGHFLSALA 77
Query: 191 LMWASTHNESLKEKM----SAVVSALSACQKEIG------SGYLSAFPTEQFDRLEA--L 238
L + + LK+K+ ++ L A QK +GY+SAF D +E +
Sbjct: 78 LSYQAEKQPILKKKIHQQIKTAITGLKAIQKNYAKQHPEHAGYISAFKEVALDEVEGKPV 137
Query: 239 IP-----VWAPYYTIHKILAGLLD------QYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
P V P+Y +HKILAGLL+ + + EAL + +W +Y Y R+ N+
Sbjct: 138 DPKEKENVLVPWYNLHKILAGLLEVNISLKEVDSQLSKEALFIASWFGDYIYKRMMNLTD 197
Query: 288 KYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGF 347
K Q L E GGMND LY LF +TQ +H + A FD+ LA + + G
Sbjct: 198 KN------QMLTIEYGGMNDALYYLFELTQKKEHAIAATYFDEDNLFNQLANDENVLPGK 251
Query: 348 HSNTHIPIVIGSQMRYEVTGD--------------------------QLHKEGHQLESSG 381
H+NT IP +IG+ RY V Q+ + H + G
Sbjct: 252 HANTTIPKLIGALKRYMVFQSEDLSAWLSNEEKEHLMSYFKAAENFWQIVVDNHTYCTGG 311
Query: 382 TNIG-HFNFKSDPKRLASNLDSN----TEESCTTYNMLKVSRHLFRWTKEIAYADYYERS 436
+ HF+ P L + + T E+C T+NMLK++R L+ TK+ Y DYYE +
Sbjct: 312 NSQSEHFH---GPNELFYDSEIRQGDCTCETCNTHNMLKLTRKLYECTKDPKYLDYYETT 368
Query: 437 LTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDS 496
N +L Q ++ G+M+Y P+ G +K + P D FWCC GTGIESFSKL D+
Sbjct: 369 YINAILASQ-NSKTGMMMYFQPMGAGYNKV-----YNRPYDEFWCCSGTGIESFSKLADT 422
Query: 497 IYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTL---TFSSKGSG 553
YF+E + +++ Y S+ L K + + QK D VT+ T + K
Sbjct: 423 YYFKENNR---LFVNLYFSNTLKLKENNLKIIQKTDRKNG-----NVTIDLKTLTDKNII 474
Query: 554 LTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEA 613
L LR+P W K + L S F ++ +++D++ +++ L+
Sbjct: 475 QPLQLALRLPNWAKQVTIKK--GKKLLNYKSHLGFAYLSGLVTANDQIILEMEQELQLL- 531
Query: 614 IQDDRPEYASIQAILYGPYVLAG 636
D P+ + A YGPY+LAG
Sbjct: 532 ---DTPDNTNYIAFKYGPYILAG 551
>gi|326798346|ref|YP_004316165.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326549110|gb|ADZ77495.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
Length = 1022
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 187/607 (30%), Positives = 282/607 (46%), Gaps = 104/607 (17%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKT--ARLPAPGEPYGGWEE 172
L +VSL G + + + L D + ++ FR + P P G W+
Sbjct: 379 LDQVSLEADAHGHKTKFIENRDKFINTLAATDPNSFLYMFRHAFGQKQPEGARPLGVWDS 438
Query: 173 PSCELRGHFVGHYLSASALMWAST-HNESLK----EKMSAVV------SALSACQKEIGS 221
+LRGH GHYL+A A +A T ++++L+ EKM +V S LS KE G
Sbjct: 439 QETKLRGHATGHYLTAIAQAYAGTGYDKALQAKFAEKMEYMVNTLYELSQLSGKPKEAGG 498
Query: 222 ------------------------------------GYLSAFPTEQFDRLEALIP----- 240
G++SA+P +QF LE
Sbjct: 499 IHVSDPTAVPYGPGKTEYDSDFSDEGIRTDYWNWGEGFISAYPPDQFIMLERGAKYGGQK 558
Query: 241 --VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT- 297
VWAPYYT+HKILAGL+D Y + N +AL + T M ++ Y R+ + + ++ + W T
Sbjct: 559 NQVWAPYYTLHKILAGLMDVYEVSGNKKALEIATGMGDWVYARLSKLPTE-TLIKMWNTY 617
Query: 298 LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLG------LLALQADDISGFHSN 350
+ E GGMN+V+ +L+ IT P +L A LFD F G LA D G H+N
Sbjct: 618 IAGEFGGMNEVMARLYRITNKPNYLKTAQLFDNIKMFYGDASHSHGLAKNVDTFRGLHAN 677
Query: 351 THIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFN------FKSDPK 394
HIP ++GS Y V+ + ++ + S G G N F S P
Sbjct: 678 QHIPQIVGSIEMYRVSNNPVYYSIADNFWYKVVNDYMYSIGGVAGARNPANAECFISQPA 737
Query: 395 RLASNLDS--NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV 452
L N S E+C TYNMLK++ LF + + DYYER L N +L P
Sbjct: 738 TLYENGFSAGGQNETCATYNMLKLTSDLFLFDQRPELMDYYERGLYNHILASVAEDSP-A 796
Query: 453 MIYLLPLAPGSSKERSYHHWGTPS-DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
Y +PL PGS K+ +G P F CC GT IES +KL +SIYF+ + +Y+
Sbjct: 797 NTYHVPLRPGSIKQ-----FGNPHMTGFTCCNGTAIESSTKLQNSIYFKSKDN-DALYVN 850
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L+W +I V Q D + + R+T+ KG G +++R+P W ++ G
Sbjct: 851 LFIPSTLEWAERKITVQQTTD--FPNEDHTRLTI----KGGG-KFDMHVRVPGW-ATKGF 902
Query: 572 KATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NG+D L + PG++L +++ W D + +Q+P + + D + +I ++ YG
Sbjct: 903 FVRVNGKDQKLEAKPGSYLKISRNWKDGDVVDLQMPFQFHLDPVMDQQ----NIASLFYG 958
Query: 631 PYVLAGH 637
P +LA
Sbjct: 959 PILLAAQ 965
>gi|408369881|ref|ZP_11167661.1| hypothetical protein I215_03228 [Galbibacter sp. ck-I2-15]
gi|407744935|gb|EKF56502.1| hypothetical protein I215_03228 [Galbibacter sp. ck-I2-15]
Length = 1011
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 201/664 (30%), Positives = 301/664 (45%), Gaps = 113/664 (17%)
Query: 105 FKVPER--SGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPA 162
+ PER + L +V L+ G + + + L D D ++ FR +
Sbjct: 356 LEAPERMVTSFKLSQVHLNKDSKGRGTKFIENRDKFVNTLAKTDPDSFLYMFRNAFGVSQ 415
Query: 163 P--GEPYGGWEEPSCELRGHFVGHYLSASALMWAST-HNESLKE----KMSAVVSALSAC 215
P +P G W+ +LRGH GHYL+A A +AS+ ++E LKE KM+ +V L
Sbjct: 416 PQDAKPLGVWDSQETKLRGHATGHYLTAIAQAYASSSYDEQLKELFAQKMNYMVETLYDL 475
Query: 216 QK------------------------------------------EIGSGYLSAFPTEQFD 233
K G+GY+SA+P +QF
Sbjct: 476 SKLSGQPINSGGEHVSDPTKVPFGPGKTDYNSDLSEQGIRNDYWNWGTGYISAYPPDQFI 535
Query: 234 RLEALIP-------VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVI 286
LE+ +WAPYYT+HKILAGLLD Y + N +AL + M ++ R+ +
Sbjct: 536 MLESGATYGGQNDQIWAPYYTLHKILAGLLDVYEISGNKKALSVAQGMGDWVSARMVELP 595
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLG------LLAL 339
I + + E GGMN+V+ +L+ +T +L +A LFD F G LA
Sbjct: 596 TSTLISMWNRYIAGEYGGMNEVMARLYRLTGTESYLKVAGLFDNIKMFYGDAQHTHGLAK 655
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLH---------KEGHQ-LESSGTNIGHFN- 388
D G HSN HIP ++G+ Y T + + K H + S G G N
Sbjct: 656 NVDTFRGLHSNQHIPQIVGALEMYRDTDEVEYFKIADNFWFKATHDYMYSIGGVAGARNP 715
Query: 389 -----FKSDPKRLASNLDSN--TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGV 441
F P L N S+ E+C TYNMLK++R LF + + DYYER L N +
Sbjct: 716 ANAECFPVQPATLYENGFSSGGQNETCATYNMLKLTRDLFFFEPKAQLMDYYERGLYNHI 775
Query: 442 LGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPS-DSFWCCYGTGIESFSKLGDSIYFE 500
L P Y +PL PGS K H+G P F CC GT IES +KL +SIYF+
Sbjct: 776 LASVAKDSP-ANTYHVPLLPGSVK-----HFGNPDMTGFTCCNGTAIESSTKLQNSIYFK 829
Query: 501 EEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNL 560
+ +Y+ +I S L W I + Q V S+ TL + KG L L
Sbjct: 830 GKDN-KSLYVNLFIPSTLHWTERNIEIQQ----VTSFPKEDNTTLKVTGKGR---FDLKL 881
Query: 561 RIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
R+P W ++NG ++NG+++ + +PG++LS+ + W + D + + +P R E + D +
Sbjct: 882 RVPNW-ATNGYHVSINGKEMDIQVTPGSYLSIDRKWKNGDIIELSMPFDFRLEPVMDQQ- 939
Query: 620 EYASIQAILYGPYVLAGHS---IGDW-DITESATSLSDWITPIPAS--YNSQLITFT--- 670
+I ++ YGP +LA + W +T A + +I P++ +N + I F
Sbjct: 940 ---NIASLFYGPVLLAAQEESPLTHWRKVTFDAEQIGKFIKGDPSTLEFNYKGIEFKPFY 996
Query: 671 QEYG 674
Q YG
Sbjct: 997 QSYG 1000
>gi|86142285|ref|ZP_01060795.1| hypothetical protein MED217_11584 [Leeuwenhoekiella blandensis
MED217]
gi|85831037|gb|EAQ49494.1| hypothetical protein MED217_11584 [Leeuwenhoekiella blandensis
MED217]
Length = 793
Score = 233 bits (594), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 165/533 (30%), Positives = 256/533 (48%), Gaps = 61/533 (11%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALM 192
A T+ Y+ LD D+L+ F + A L + Y WE + L GH GHY+SA ++
Sbjct: 43 EAALTDFNYIQALDADRLLAPFLREAGLEPKADSYTNWE--NTGLDGHTAGHYISALSMY 100
Query: 193 WASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPV----------- 241
+AST + KE + ++ L QK G+GY+ P D L A I
Sbjct: 101 YASTGDPKAKEMLEYALAELDRVQKSNGNGYIGGVPGS--DALWAEIKAGKINAGSFSLN 158
Query: 242 --WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
W P Y IHK GL D + +A+ +A RM + ++F + + S + L
Sbjct: 159 DKWVPLYNIHKTFNGLKDAWIHAELPQAKRMLIELTDWFLD----ITADLSEAQIQDMLR 214
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGS 359
E GG+N+V +++ IT D K+L LA F + L LA D ++G H+NT IP IG
Sbjct: 215 SEHGGLNEVFAEVYAITSDKKYLKLAEDFSQHALLKPLAANEDILTGMHANTQIPKFIGF 274
Query: 360 QMRYEVTGDQLHKEGHQLESS---------GTNIG------HFNFKSDPKRLASNLDSNT 404
+ ++ + K+ H S+ +IG HFN D + S+
Sbjct: 275 E---RISQLEEAKDYHDAASNFFDNVTTRRSISIGGNSVREHFNPVDDFSSVVSS--EQG 329
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSS 464
ESC TYNMLK+S+ LF T E Y D+YER L N +L Q G +Y P+ PG
Sbjct: 330 PESCNTYNMLKLSKLLFEDTSEEHYIDFYERGLYNHILSSQ--NPDGGFVYFTPIRPG-- 385
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
Y + P SFWCC G+G+E+ +K + IY ++E K +Y+ +I S ++W+
Sbjct: 386 ---HYRVYSQPETSFWCCVGSGMENHTKYNELIYAKKEDK---LYVNLFIPSEVNWEEKN 439
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-P 583
+ QK + P +T + +L LR P W ++ K +N + +
Sbjct: 440 ATLTQKTN-----FPEEALTELIWNSRKKTKATLMLRYPQWVNAGELKVYVNDKLEKIDA 494
Query: 584 SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+PG+++S+ + W + D++ ++LP+ L E + DD Y S++ YGP VLA
Sbjct: 495 TPGSYVSLERKWKNGDRIKMELPMHLSLEELPDDSG-YVSVK---YGPIVLAA 543
>gi|423303007|ref|ZP_17281028.1| hypothetical protein HMPREF1057_04169 [Bacteroides finegoldii
CL09T03C10]
gi|408470336|gb|EKJ88871.1| hypothetical protein HMPREF1057_04169 [Bacteroides finegoldii
CL09T03C10]
Length = 801
Score = 233 bits (593), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 171/565 (30%), Positives = 261/565 (46%), Gaps = 63/565 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L+DV+L D AQ N LL DVD+L+ F A L E + W L G
Sbjct: 34 LNDVQL-LDGPFKHAQDLNRSVLLEYDVDRLLAPFLIEAGLEPKAEKFPNWPG----LDG 88
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA-- 237
H GHYLSA A+ + + E K +M ++S L CQ+ G GY+ P + E
Sbjct: 89 HVAGHYLSAMAMNYRAGGGEEFKRRMEYILSELYRCQQANGDGYIGGIPNGKAGWKEIKK 148
Query: 238 -----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQNVIKK 288
+ WAP+Y +HK+ AGL D + YAD+ A +M W + VI
Sbjct: 149 GNVGIIWKYWAPWYNLHKLYAGLRDAWLYADSELAKKMFLDYCDWGI--------GVISG 200
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
+ E+ Q LN E GGMN+V + I+ D K+L A F + D++ H
Sbjct: 201 LNDEQMEQMLNNEFGGMNEVFADAYQISGDTKYLDAAKRFSHKWLFESMRDGKDNLDNKH 260
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHKEGHQLESS------------------GTNIGHFNFK 390
+NT +P +G Q E++ Q + G ++ + G N +F
Sbjct: 261 ANTQVPKAVGYQRVAELS-VQAKRSGDAVDYTRAAYFFWQTVTANRSLAFGGNSRREHFP 319
Query: 391 SDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
D L+ D ESC TYNML+++ LFR + AYAD+YER+L N +L Q
Sbjct: 320 DDADYLSYVDDREGPESCNTYNMLRLTEGLFRKDPKAAYADFYERALFNHILSTQHPVHG 379
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G +Y P P Y + P+++ WCC GTG+E+ K G+ IY +Y+
Sbjct: 380 GY-VYFTPARPA-----HYRVYSAPNEAMWCCVGTGMENHGKYGEFIYAHTGDS---LYV 430
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+ISSRL+WK +I + Q S+ + LT ++K S L +R P W
Sbjct: 431 NLFISSRLEWKKRRISLTQ----TTSFPDEGKTCLTITAKKS-TKFPLFVRKPGWVGDGK 485
Query: 571 AKATLNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
T+NG+ + + N + ++ + W + D + +Q+P+ +R E ++ PEY AI+
Sbjct: 486 VIITVNGKSIETTTAANSYYTINRKWKNGDVVEVQMPMNIRIEELK-HHPEYI---AIMR 541
Query: 630 GPYVLAGHSIGDWDITESATSLSDW 654
GP +L G ++G ++ S W
Sbjct: 542 GP-ILLGANVGKENLNGLVASDHRW 565
>gi|189466409|ref|ZP_03015194.1| hypothetical protein BACINT_02784 [Bacteroides intestinalis DSM
17393]
gi|189434673|gb|EDV03658.1| hypothetical protein BACINT_02784 [Bacteroides intestinalis DSM
17393]
Length = 789
Score = 233 bits (593), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 165/556 (29%), Positives = 269/556 (48%), Gaps = 67/556 (12%)
Query: 116 KEVS---LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEE 172
+EVS L DV+L +S +AQQT+L Y++ ++ D+L+ F + A L Y WE
Sbjct: 24 QEVSYFPLQDVKL-LESPFLQAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWE- 81
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TE 230
+ L GH GHY+SA ++M+A+T + ++ +++ +++ L Q+ +G+G++ P +
Sbjct: 82 -NTGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLAELHRAQQAVGTGFIGGTPGSLQ 140
Query: 231 QFDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEY 277
+ ++A L W P Y IHK AGL D Y YA + A M T WM++
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSNLAREMLIALTDWMID- 199
Query: 278 FYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLL 337
+ + ++ L E GG+N+ + IT D K+L LA F L L
Sbjct: 200 -------ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPL 252
Query: 338 ALQADDISGFHSNTHIPIVIGSQMRYEVTGDQL---HKE--------------GHQLESS 380
D ++G H+NT IP VIG + ++ D H H+
Sbjct: 253 VKDEDRLTGMHANTQIPKVIGYKRIADLAQDDKDWNHASEWDHAARFFWNTVVNHRSVCI 312
Query: 381 GTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG 440
G N +F + D E+C TYNML++++ L++ + +I +ADYYER+L N
Sbjct: 313 GGNSVREHFHPADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNH 372
Query: 441 VLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFE 500
+L Q+ E G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 373 ILASQQ-PEKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYAH 426
Query: 501 EEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNL 560
+Y+ +I SRL W+ ++ + Q+ RV K SL L
Sbjct: 427 TNDT---LYVNLFIPSRLTWQEKKVTLVQETRFPDEEQIRFRV-----EKSRKKAFSLKL 478
Query: 561 RIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
R P+W + GA ++NG+ PG +L++ + W + D++T+ +P+ + E I P
Sbjct: 479 RYPSW--AKGASVSVNGKVQETNAQPGEYLTIHRKWKAGDEITLNMPMQVALEQI----P 532
Query: 620 EYASIQAILYGPYVLA 635
+ + A +YGP VLA
Sbjct: 533 DRENFYAFMYGPIVLA 548
>gi|443629445|ref|ZP_21113773.1| putative Secreted protein [Streptomyces viridochromogenes Tue57]
gi|443337063|gb|ELS51377.1| putative Secreted protein [Streptomyces viridochromogenes Tue57]
Length = 941
Score = 233 bits (593), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 150/438 (34%), Positives = 224/438 (51%), Gaps = 42/438 (9%)
Query: 222 GYLSAFPTEQFDRLEA-----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE+ VWAPYYT HKIL G+LD Y D+A AL + + M +
Sbjct: 390 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGVLDAYLATDDARALDLASGMCD 449
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ Y+R+ + + +++R W + E GG+ + + L IT +HL LA LFD +
Sbjct: 450 WMYSRLSK-LPEATLQRMWGLFSSGEFGGIVEAVCDLHTITGKAEHLALAQLFDLDRLID 508
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLES-SGTNI 384
A D + G H+N HIPI G Y+ TG+Q + + H++ GT+
Sbjct: 509 NCAANTDILDGLHANQHIPIFTGYLRLYDATGEQRYLDAARNFWGMVVPHRMYGIGGTST 568
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G F D +A + + E+C YNMLK+SR LF ++ Y DYYER+L N VLG
Sbjct: 569 GEFWKARDV--IAGTISATNAETCCAYNMLKLSRTLFFHEQQPKYMDYYERALFNQVLGS 626
Query: 445 QR---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
++ E ++ Y + L PG ++ TP CC GTG+ES +K DS+YF+
Sbjct: 627 KQDKADAEKPLVTYFIGLTPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYFKA 680
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
+Y+ Y SRL W + V Q ++ TLT G +L LR
Sbjct: 681 A-DGSALYVNLYSPSRLAWAEKGVTVTQ----TTAFPREQGTTLTIG--GGSAAFALRLR 733
Query: 562 IPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
+P+W ++ G + T+NG + P PG++ +V++TW S D + I +P LR E DD
Sbjct: 734 VPSWATA-GFRVTVNGSAVSGTPKPGSYFTVSRTWRSGDTVRISMPFRLRVEKAIDD--- 789
Query: 621 YASIQAILYGPYVLAGHS 638
S+Q + YGP L G +
Sbjct: 790 -PSLQTLFYGPVNLVGRN 806
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 56/110 (50%), Gaps = 6/110 (5%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWE-- 171
++ +L DV L + +Q L++ DV++L+ FR A L G GGWE
Sbjct: 51 VQPFALDDVAL-RPGLFADKRQLMLDHARGYDVNRLLQVFRANAGLSTGGAVAPGGWEGL 109
Query: 172 --EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
E + LRGH+ GH+L+ + +A T + +++ +V AL+ ++ +
Sbjct: 110 DGEANGNLRGHYTGHFLTMLSQAYAGTGEQVFVDRIRTMVGALTEVREAL 159
>gi|332185145|ref|ZP_08386894.1| hypothetical protein SUS17_217 [Sphingomonas sp. S17]
gi|332014869|gb|EGI56925.1| hypothetical protein SUS17_217 [Sphingomonas sp. S17]
Length = 782
Score = 233 bits (593), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 178/593 (30%), Positives = 272/593 (45%), Gaps = 70/593 (11%)
Query: 107 VPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEP 166
+P+++ F L VRL S++ A +TN YL LD D+L+ NFR A L
Sbjct: 24 LPDKAEPF----PLSAVRL-RPSIYATAVETNRRYLYRLDPDRLLHNFRLYAGLKPKAPI 78
Query: 167 YGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSA 226
YGGWE S + GH +GHY+SA L W T + ++ + +VS L+ Q + G+GY+ A
Sbjct: 79 YGGWE--SDTIAGHTLGHYMSALVLTWQQTGDTEMRRRADYIVSELAEAQAKRGTGYVGA 136
Query: 227 FPTEQFD----------------RLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAE 266
++ D ++++ L W+P YT+HK+ AGLLD + NA+
Sbjct: 137 LGRKRADGTIVDGEEIFHEIMAGKIKSGGFDLNGSWSPLYTVHKLFAGLLDIHGGWGNAQ 196
Query: 267 ALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAH 326
AL + + YF V R L E GG+N+ +L+ T D + L LA
Sbjct: 197 ALDVAVKLGGYF----ARVFAALDDARLQDVLGCEYGGLNESFAELYQRTGDRQWLALAE 252
Query: 327 LFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQL----------HKEGHQ 376
L L D ++ H+NT +P +IG +E+T + GH
Sbjct: 253 RIYDNKVLDPLVAGKDQLANLHANTQVPKLIGLARIHEITAAPAPAAGARFFWENVTGHH 312
Query: 377 LESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERS 436
G N F S+P +A ++ T E C +YNMLK++RHL+ W + DYYER+
Sbjct: 313 SYVIGGNADREYF-SEPDTIARHITEQTCEHCNSYNMLKLTRHLYGWQPDGRLFDYYERA 371
Query: 437 LTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDS 496
N V+ Q G Y+ PL G ++E S D+FWCC G+G+ES +K G+S
Sbjct: 372 HLNHVMAAQHPVHAG-FTYMTPLMTGMAREFSTDK----DDAFWCCVGSGMESHAKHGES 426
Query: 497 IYFEEEGKYPGVYIIQYISSRLDW-KSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT 555
I+++ +++ YI + W K G +V P+ L FS
Sbjct: 427 IFWQGGDT---LFVNLYIPAEARWDKRGAVVTLDTAYPMDG-----AAKLAFSRLDRAGR 478
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQ 615
+ LR+P W + A +NGQ + + V + W + D + I+LPL LR E
Sbjct: 479 FPVALRVPGWANGQAA-VEVNGQPVTPVFERGYAVVDRRWKTGDTVAIRLPLDLRVEPTP 537
Query: 616 DDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLIT 668
D S+ A++ GP V+A D+ + T W +P PA + +T
Sbjct: 538 GDD----SVVAVVRGPMVMAA------DLGPTTTP---WDSPDPAMVGANPLT 577
>gi|291544618|emb|CBL17727.1| Uncharacterized protein conserved in bacteria [Ruminococcus
champanellensis 18P13]
Length = 597
Score = 232 bits (592), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 166/547 (30%), Positives = 266/547 (48%), Gaps = 46/547 (8%)
Query: 138 NLEYLLMLDVDKLVWNFRKTARLPAP---GEPYGGWEEPSCELRGHFVGHYLSASALMWA 194
N YL+ L + L+ NF A + E + GWE P+C+LRGHF+GH+LSA+AL+ A
Sbjct: 24 NRAYLMELKSENLLQNFLLEAGVRTDRDVTEMHLGWESPTCQLRGHFLGHWLSAAALLIA 83
Query: 195 STHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAG 254
+ LK K+ ++ AL+ CQ+ G ++ + P + F++L+ +W+P YT+HK L G
Sbjct: 84 QNQDRELKAKLDTIIDALARCQELNGGRWIGSIPEKYFEKLKKNEYIWSPQYTLHKTLLG 143
Query: 255 LLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFC 314
L YA N AL + +++ + +++K H + E GGM +V L+
Sbjct: 144 LYHSALYAKNQVALEILGRAADWYLEWTEKMMQK---NPH-AVYSGEEGGMLEVWAGLYQ 199
Query: 315 ITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGD----QL 370
+T+D ++L LA + P G LA D +S H+N IP G+ YE+TGD +L
Sbjct: 200 LTEDERYLTLAQRYAHPSIFGRLADGEDPLSNCHANASIPWAHGAAKMYEITGDAAWLEL 259
Query: 371 HKEGHQLESS--------GTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFR 422
K Q S G N G F P++L L T+E CT YNM++++ +LF
Sbjct: 260 VKRFWQCAVSDRDAFCTGGQNSGEFWIP--PRKLGMFLGERTQEFCTVYNMVRLADYLFC 317
Query: 423 WTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCC 482
+T Y DY E +L NG L Q+ G+ Y LP+ GS K+ WG+ + FWCC
Sbjct: 318 FTGAHEYLDYIENNLYNGFLA-QQNKYTGMPAYFLPMKAGSVKK-----WGSKTKDFWCC 371
Query: 483 YGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVD-----PVVSW 537
+GT +++ + ++ ++ + + + QYI+S + + + + Q VD S+
Sbjct: 372 HGTTVQAHTIYPQLCWYADKEQ-NRLILAQYINSVCKF-NAHVTITQSVDMKYYNDGASF 429
Query: 538 DP-----YLRVTLTFSSKGSGLTT-SLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSV 591
D R + K +L+LRIP W + +NGQ + S F +
Sbjct: 430 DERDDSRMFRWYIKLHVKAEQPERFTLSLRIPAWVAGELV-ILVNGQHAEVESVNGFAEL 488
Query: 592 TKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSL 651
+ W DD + + P L T ++ P+ + A GP VLAG D I +
Sbjct: 489 DRVW-EDDTVNLYFPAALTTCSL----PDMPQLLAFREGPIVLAGLCESDRGIYLAQNDP 543
Query: 652 SDWITPI 658
+ +TP+
Sbjct: 544 TSALTPV 550
>gi|329847096|ref|ZP_08262124.1| tat twin-arginine translocation pathway signal sequence domain
protein [Asticcacaulis biprosthecum C19]
gi|328842159|gb|EGF91728.1| tat twin-arginine translocation pathway signal sequence domain
protein [Asticcacaulis biprosthecum C19]
Length = 795
Score = 232 bits (592), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 168/548 (30%), Positives = 266/548 (48%), Gaps = 57/548 (10%)
Query: 118 VSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCEL 177
++L DVRL A N YLL L+ D+ + N+RK A L E YGGWE + +
Sbjct: 44 LALGDVRLLPSPFK-TALDVNHTYLLTLEPDRFLHNYRKGAGLTPKAEKYGGWENDT--I 100
Query: 178 RGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--------- 228
GH +GHYLSA +LM+A T + +LK + + V+ L+ Q G GY++ F
Sbjct: 101 AGHSLGHYLSAISLMYAQTGDATLKARAAYVIDELALIQGMQGDGYVAGFTRKRPDGTIV 160
Query: 229 --TEQFDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEY 277
E F ++A L W P Y HK+ GL D T+ + + + T + Y
Sbjct: 161 DGKELFAEIKAGDIRSAGFDLNGCWVPLYNWHKLYTGLFDAQTFCGLNKGVVVATGLGHY 220
Query: 278 FYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLL 337
+ +V + ++ Q LN E GG+N+ +L T D + L LA L +
Sbjct: 221 ----IDSVFAALNDDQVQQVLNCEFGGLNESFAELHARTGDARWLTLAERMHHNRVLDPM 276
Query: 338 ALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK----------EGHQLESSGTNIGHF 387
+ D ++ HSNT IP V+G YE+TG + GH G N G
Sbjct: 277 IKREDKLANIHSNTTIPKVLGLARLYEITGKADYHTASDFFWERVTGHHSYVIGGN-GDR 335
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
+ +P ++ ++ T E C TYNML+++R L+ W + + DY+ER+ N VL Q+
Sbjct: 336 EYFFEPDTISRHITEATCEHCATYNMLRLTRFLYSWQPDASRFDYFERAHLNHVLS-QQN 394
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
+ G+ Y+ PL G+ ER + P D++ CC+GTG+ES ++ +SI+++
Sbjct: 395 PKTGMFSYMTPLFTGA--ERGF---SDPVDNWTCCHGTGMESHARHAESIWWQSADT--- 446
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+++ YI S W + + ++D +D +++ +T + + L LR+P W
Sbjct: 447 LFVNLYIPSTAQWTTKG--ASLRMDTGYPYDGGVKLAVTALRRPTRF--KLALRVPGWAK 502
Query: 568 SNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAI 627
+ A TLNG+ G +L + + W + DK+ + LPL LR EA D+ I A+
Sbjct: 503 T--AAVTLNGKPAQAVRDGGYLVIDRVWQAGDKIALDLPLDLRLEATSDN----TGIVAV 556
Query: 628 LYGPYVLA 635
L GP VLA
Sbjct: 557 LRGPMVLA 564
>gi|408533805|emb|CCK31979.1| secreted protein [Streptomyces davawensis JCM 4913]
Length = 943
Score = 232 bits (591), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 149/438 (34%), Positives = 226/438 (51%), Gaps = 42/438 (9%)
Query: 222 GYLSAFPTEQFDRLEA-----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE+ VWAPYYT HKIL G+LD Y D+A AL + + M +
Sbjct: 392 GFLAAYPETQFIDLESRTSSDYTKVWAPYYTAHKILRGVLDAYLATDDARALDLASGMAD 451
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ ++R+ + + +++R W + E GG+ + + L IT +HL LA LFD +
Sbjct: 452 WMHSRLSK-LPEATLQRMWGLFSSGEFGGIVEAICDLHAITGKAEHLALARLFDLDRLID 510
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLES-SGTNI 384
A D + G H+N HIPI G Y+ TG+Q + + H++ GT+
Sbjct: 511 SCAANTDILDGLHANQHIPIFTGYLRLYDATGEQRYLDAARNFWGMVVPHRMYGIGGTST 570
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G F D +A + + T E+C YN+LK+SR LF Y DYYER+L N VLG
Sbjct: 571 GEFWKARD--VIAGTISATTAETCCAYNLLKLSRTLFFHEPSPKYMDYYERALYNQVLGS 628
Query: 445 QR---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
++ E ++ Y + L PG ++ TP CC GTG+ES +K DS+YF
Sbjct: 629 KQDKPDAEKPLVTYFIGLTPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYFTT 682
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
+ +Y+ Y SRL+W + V Q ++ TLT G + L LR
Sbjct: 683 D-DGSALYVNLYSPSRLNWADKGVTVTQ----ATAFPQEQGTTLTIG--GGSASFELRLR 735
Query: 562 IPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
+P+W ++ G + T+NG+ + P+PG++ +V++TW S D + I +P LR E DD
Sbjct: 736 VPSWATA-GFRVTVNGRAVSGTPAPGSYFAVSRTWRSGDTVRISMPFRLRAEKALDD--- 791
Query: 621 YASIQAILYGPYVLAGHS 638
S+Q + YGP L G +
Sbjct: 792 -PSLQTLCYGPVNLVGRN 808
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 57/114 (50%), Gaps = 14/114 (12%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLP-----APGEPYGG 169
+K +L V LG + ++ L++ DVD+L+ FR A LP APG G
Sbjct: 53 VKPFALDQVTLGQ-GLFADKRELMLDHARGYDVDRLLQVFRANAGLPTGDAVAPG----G 107
Query: 170 WE----EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
WE E + LRGH+ GH+++ A WA T + +++ ++ AL+ + +
Sbjct: 108 WEGLDGEANGNLRGHYTGHFMTMLAQAWAGTGEQVFADRLRTMIGALTEVRAAL 161
>gi|325299889|ref|YP_004259806.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
gi|324319442|gb|ADY37333.1| protein of unknown function DUF1680 [Bacteroides salanitronis DSM
18170]
Length = 797
Score = 232 bits (591), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 164/533 (30%), Positives = 248/533 (46%), Gaps = 52/533 (9%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALM 192
+A N++ L D D+L+ + K A LP+ E + WE L GH GHYLSA A+
Sbjct: 43 QACDLNVKTLKQYDTDRLLAPYLKEAGLPSKAEGFSNWEG----LDGHVGGHYLSALAIH 98
Query: 193 WASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QFDRLEALIPVWAPY 245
+A+T + +++M +VS L CQ+ G+GY+ P Q + + W P+
Sbjct: 99 YAATGDAECRQRMDYMVSELKRCQEAHGNGYIGGVPDGERLWKEIQQGNVGLIWKYWVPW 158
Query: 246 YTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGM 305
Y +HK AGL D + Y N EA +M + ++ VI S E+ Q L E GGM
Sbjct: 159 YNLHKTYAGLRDAWAYGGNEEARQMFLDLCDWGLT----VIAPLSDEQMEQMLENEFGGM 214
Query: 306 NDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEV 365
++V + +T D K+L A F L +A D++ H+NT +P V+G Q E+
Sbjct: 215 DEVYADAYEMTGDVKYLDAAKRFSHHWLLDSMAAGIDNLDNKHANTQVPKVVGYQRIAEL 274
Query: 366 TGDQLHKEGHQLESS-----------------GTNIGHFNFKSDPKRLASNLDSNTEESC 408
+ H E L G N +F L+ D ESC
Sbjct: 275 SARSGHTEDAALYRKASEFFWQTVVETRSLALGGNSRREHFAPAEDCLSYVYDREGPESC 334
Query: 409 TTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERS 468
T NMLK++ LFR E YADYYER++ N +L Q E G +Y P P
Sbjct: 335 NTNNMLKLTEGLFRLNPEARYADYYERAVLNHILSTQH-PEHGGYVYFTPARPA-----H 388
Query: 469 YHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVN 528
Y + P+ + WCC GTG+E+ K G+ IY E + +Y+ +I+S LDW + +
Sbjct: 389 YRVYSAPNSAMWCCVGTGMENHGKYGELIYTHTENE---LYVNLFIASELDWAERGVRII 445
Query: 529 QKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGN 587
Q+ + V LT ++ + L +R P W + +A LNGQD S +
Sbjct: 446 QE----TKFPDEESVRLTIRTE-KPMKFKLLIRHPHWCRTGAMQAVLNGQDYAAASVSSS 500
Query: 588 FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIG 640
++ + + W DK+ ++LP+++ E + P AIL GP VL G +G
Sbjct: 501 YIEIERIWKDGDKVQLELPMSVSVEEL----PNVPQYIAILRGP-VLLGARMG 548
>gi|160882548|ref|ZP_02063551.1| hypothetical protein BACOVA_00499 [Bacteroides ovatus ATCC 8483]
gi|156112129|gb|EDO13874.1| hypothetical protein BACOVA_00499 [Bacteroides ovatus ATCC 8483]
Length = 801
Score = 232 bits (591), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 171/565 (30%), Positives = 260/565 (46%), Gaps = 63/565 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L DV+L D AQ N LL DVD+L+ F A L E + W L G
Sbjct: 34 LSDVQL-LDGPFKHAQDLNRSVLLEYDVDRLLAPFLIEAGLKPKAEKFPNWPG----LDG 88
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA-- 237
H GHYLSA A+ + + E K +M ++S L CQ+ G GY+ P + E
Sbjct: 89 HVAGHYLSAMAMNYRAGDGEEFKRRMEYMLSELYKCQQANGDGYIGGIPNGKAGWKEIKK 148
Query: 238 -----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQNVIKK 288
+ WAP+Y +HK+ AGL D + YAD+ A +M W + VI
Sbjct: 149 GNVGIIWKYWAPWYNLHKLYAGLRDAWLYADSELAKKMFLDYCDWGI--------GVISG 200
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
+ E+ Q LN E GGMN+V + I+ D K+L A F + D++ H
Sbjct: 201 LNDEQMEQMLNNEFGGMNEVFADAYQISGDTKYLDAAKRFSHKWLFESMRDGKDNLDNKH 260
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHKEGHQLESS------------------GTNIGHFNFK 390
+NT +P +G Q E++ Q + G ++ + G N +F
Sbjct: 261 ANTQVPKAVGYQRVAELS-VQAKRSGDAVDYTRAAYFFWQTVTANRSLAFGGNSRREHFP 319
Query: 391 SDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
D L+ D ESC TYNML+++ LFR + AYAD+YER+L N +L Q
Sbjct: 320 DDADYLSYVDDREGPESCNTYNMLRLTEGLFRKDPKAAYADFYERALFNHILSTQHPVHG 379
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G +Y P P Y + P+++ WCC GTG+E+ K G+ IY +Y+
Sbjct: 380 GY-VYFTPARPA-----HYRVYSAPNEAMWCCVGTGMENHGKYGEFIYAHTGDS---LYV 430
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+ISSRL+WK +I + Q S+ + LT ++K S L +R P W
Sbjct: 431 NLFISSRLEWKKRRISLTQ----TTSFPNEGKTCLTITAKKS-TKFPLFVRKPGWVGDGK 485
Query: 571 AKATLNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
T+NG+ + + N + ++ + W + D + +Q+P+ +R E ++ PEY AI+
Sbjct: 486 VIITVNGKSIETTTAANSYYTINRKWKNGDVVEVQMPMNIRIEELK-HHPEYI---AIMR 541
Query: 630 GPYVLAGHSIGDWDITESATSLSDW 654
GP +L G ++G ++ S W
Sbjct: 542 GP-ILLGANVGKENLNGLVASDHRW 565
>gi|408357351|ref|YP_006845882.1| hypothetical protein AXY_19880 [Amphibacillus xylanus NBRC 15112]
gi|407728122|dbj|BAM48120.1| hypothetical protein AXY_19880 [Amphibacillus xylanus NBRC 15112]
Length = 622
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 170/576 (29%), Positives = 275/576 (47%), Gaps = 82/576 (14%)
Query: 116 KEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFR-KTARLPA---PGEPYGGWE 171
K V++HD L R + N YL+ L D L++N+R + R P + +GGWE
Sbjct: 7 KNVTVHDGDLK------RREAANKSYLMSLTNDNLLFNYRVEAGRFHGREIPKDAHGGWE 60
Query: 172 EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ 231
P C++RGHF+GH+LSA+AL + + + LK K +VS L+ CQK+ G ++ P +
Sbjct: 61 TPVCQIRGHFLGHWLSAAALHYHQSGDLELKVKADLIVSELAECQKDNGGQWVGPIPEKY 120
Query: 232 FDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSI 291
+ +WAP Y +HK+ GL+D Y+Y N +AL + ++F K++
Sbjct: 121 LHWIAEGKNIWAPQYNLHKLFMGLIDMYSYTGNQQALDIADNFADWFVKWS----GKFTR 176
Query: 292 ERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNT 351
E+ L+ E GGM +V L IT K+ L + + L D ++ H+NT
Sbjct: 177 EQFDDILDVETGGMLEVWADLLEITGHDKYKFLLDRYYRQRLFQPLLEGKDPLTNMHANT 236
Query: 352 HIPIVIGSQMRYEVTGDQLH------------KEGHQLESSGTNIGHFNFKSDPK-RLAS 398
IP V+G YEVTGD E L + G G PK ++ +
Sbjct: 237 TIPEVLGCARAYEVTGDNRWLDIVKAYWNCAVTERGTLATGGNTSGEVWM---PKMKIKA 293
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ-------RGTEP- 450
L +E CT YNM++++ LF+ TK+ AY Y E +L NG++ GT
Sbjct: 294 RLGDKNQEHCTVYNMIRLADFLFQQTKDPAYGQYIEYNLYNGIMAQAYYQSYHVAGTGKN 353
Query: 451 ----GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
G++ Y LP+ G KE W + ++SF+CC+GT +++ + L IY++++ +
Sbjct: 354 HPWTGLLTYFLPMKAGLYKE-----WSSETNSFFCCHGTMVQANATLNRGIYYQDQDQ-- 406
Query: 507 GVYIIQYISSRLDWKSG--QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT-------- 556
+Y+ QY +S L+ G ++ + Q D ++S ++ + S +T+
Sbjct: 407 -IYVSQYFNSELETTIGSDRVRIKQSQD-IMSGSLLDSSSIAGQQRLSEITSIHENTPDF 464
Query: 557 ---------------SLNLRIPTWTSSNGAKATLNGQDLPLPSPGN-FLSVTKTWSSDDK 600
+L LRIP W + A LNG+ + + + F +T+ WS DK
Sbjct: 465 KKYDFTIQLDQKKTFTLGLRIPEWIMKD-ASIYLNGELIGKTNDSSAFYKLTREWSDGDK 523
Query: 601 LTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
++I P+ +R + DD + A YGP VLAG
Sbjct: 524 VSITFPIGIRFIQLPDD----LNTGAFRYGPDVLAG 555
>gi|399033094|ref|ZP_10732120.1| hypothetical protein PMI10_04007 [Flavobacterium sp. CF136]
gi|398068528|gb|EJL59944.1| hypothetical protein PMI10_04007 [Flavobacterium sp. CF136]
Length = 1019
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 189/624 (30%), Positives = 284/624 (45%), Gaps = 104/624 (16%)
Query: 99 IKNPGQFKVPERSGEFLK--EVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRK 156
+K + PER E K +V L+D G + + L L D D ++ FR
Sbjct: 358 VKEAKETATPERKLEVFKLDQVVLNDNLDGHHTKFMENRDKFLTTLATTDPDSFLYMFRN 417
Query: 157 T--ARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNE-----SLKEKMSAVV 209
P EP G W+ +LRGH GHYL+A A +AST + + K+KM +V
Sbjct: 418 AFGQEQPKEAEPLGVWDTQETKLRGHATGHYLTAIAQAYASTGYDKTLQANFKDKMEYMV 477
Query: 210 SAL------SACQKEIGS------------------------------------GYLSAF 227
+ L S KE G G++SA+
Sbjct: 478 NTLYDLEQLSGKPKEAGGKFVSDPTAIPFGPGKTNYDSDLSAEGIRTDYWNWGKGFISAY 537
Query: 228 PTEQFDRLE-------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYN 280
P +QF LE +WAPYYT+HKILAGL+D Y + N +AL M ++ Y
Sbjct: 538 PPDQFIMLENGATYGGQKTQIWAPYYTLHKILAGLMDVYEVSGNEKALETAKGMGDWVYA 597
Query: 281 RVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLG---- 335
R++ + + I + + E GGMN+ + +L+ IT+DP +L +A LFD F G
Sbjct: 598 RMKKLPTETLISMWNRYIAGEFGGMNEAMARLYRITKDPHYLEVAQLFDNIKVFYGDANH 657
Query: 336 --LLALQADDISGFHSNTHIPIVIGS-QM--------RYEVTGDQLHKEGHQ-LESSGTN 383
LA D G H+N HIP ++G+ +M Y V + +K + + S G
Sbjct: 658 SHGLAKNVDTFRGLHANQHIPQIMGALEMYRDSNTPDYYRVADNFWYKTVNDYMYSIGGV 717
Query: 384 IGHFN------FKSDPKRLASNLDSN--TEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
G N F S P + N S+ E+C TYNMLK++ LF + + DYYER
Sbjct: 718 AGARNPANAECFISQPATIYENGFSSGGQNETCATYNMLKLTGDLFLYEQRGELMDYYER 777
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPS-DSFWCCYGTGIESFSKLG 494
L N +L P Y +PL PGS K+ +G P F CC GT IES +K
Sbjct: 778 GLYNHILSSVAENSP-ANTYHVPLRPGSVKQ-----FGNPHMTGFTCCNGTAIESNTKFQ 831
Query: 495 DSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGL 554
+SIYF+ +Y+ Y+ S L W I V Q D + + ++T+ KG+G
Sbjct: 832 NSIYFKSADN-NSLYVNLYVPSTLKWTEKNITVKQTTD--FPNEDFTKLTI----KGNG- 883
Query: 555 TTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEA 613
L +R+P W ++ G +NG+ + + PG++L++ K W D + +++P E
Sbjct: 884 KFDLKVRVPHW-ATKGFFVKINGKSEKVKAQPGSYLTLNKKWKDGDVIELRMPFQFHLEP 942
Query: 614 IQDDRPEYASIQAILYGPYVLAGH 637
+ D + +I ++ YGP +LA
Sbjct: 943 VMDQQ----NIASLFYGPILLAAQ 962
>gi|254444174|ref|ZP_05057650.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
gi|198258482|gb|EDY82790.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
Length = 788
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 163/532 (30%), Positives = 254/532 (47%), Gaps = 61/532 (11%)
Query: 136 QTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWAS 195
+ ++ Y+L D D+L+ F A L E YG WE S L GH GH+LSA A +
Sbjct: 47 EADVTYVLAHDPDRLLAPFLTAAGLEPKAEKYGNWE--SSGLDGHSAGHFLSAYATLSLQ 104
Query: 196 THNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ------------FDRLEALIPVWA 243
+ N L+E++ ++ L+ CQ IG+GYL P Q DR +L W
Sbjct: 105 SDNPLLRERLDYMLDELTRCQDAIGTGYLGGVPNSQEFTTRLFAGEIKADRF-SLNGAWV 163
Query: 244 PYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
P+Y +HK AGL D + AD+ +A + + W V K + E+ + L
Sbjct: 164 PWYNLHKTYAGLKDAWLVADSEKAKNILIALADWTVA--------ATAKLTDEQMQEMLY 215
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGS 359
E GGMN++ L+ TQD ++L LA+ F L L D ++GFH+NT IP VIG
Sbjct: 216 TEHGGMNEIFADLYLHTQDQRYLELAYRFTHHELLDPLLENQDKLTGFHANTQIPKVIGY 275
Query: 360 QMRYEVTGDQ-LHKE---------GHQLESSGTN--IGHFNFKSDPKRLASNLDSNTEES 407
Q D+ LH+ H+ S G N HF+ D + + + + E+
Sbjct: 276 QRTALAAQDEKLHQASQFFWDTVVNHRSVSIGGNSVREHFHPADDFRSMLESREG--PET 333
Query: 408 CTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKER 467
C T+NML+++ LF A DYYER+L N +L Q E G ++Y P P R
Sbjct: 334 CNTHNMLRLTTLLFEAEPTAALTDYYERALYNHILSAQH-PETGGLVYFTPQRP-----R 387
Query: 468 SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVV 527
Y + P ++FWCC G+GIE+ + + IY + +++ +++S L+W+ + +
Sbjct: 388 HYRVYSVPENAFWCCVGSGIENPGRYSEFIYAHTDD---ALFVNLFLASSLNWQEKGLRL 444
Query: 528 NQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGN 587
Q + P T + +L +R P WT ++ + TLN + + + N
Sbjct: 445 TQSTN-----FPQTASTELTIDQAPKKKLTLKIRRPAWT-TDAFQITLNDKPVKTKTNAN 498
Query: 588 -FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
+ S+T+ W + D L++ LP+ + E I D P Y + LYGP VLA +
Sbjct: 499 GYASLTRKWKTGDTLSVALPMQVHVEQIPDHSPFY----SFLYGPIVLAAKT 546
>gi|380694971|ref|ZP_09859830.1| hypothetical protein BfaeM_13572 [Bacteroides faecis MAJ27]
Length = 802
Score = 229 bits (585), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 173/562 (30%), Positives = 262/562 (46%), Gaps = 73/562 (12%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
SL DV+L S S +AQQT+L Y+L LD D+L F + A L Y WE + L
Sbjct: 29 SLQDVKLLS-SPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWE--NTGLD 85
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLE 236
GH GHYLSA ++M+A+T + ++ +++ +++ L Q+ +G+G++ P + + ++
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 237 A---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQ 283
A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ S + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDSQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPLIKDEDR 257
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQL---HKE--------------GHQLESSGTNIGH 386
++G H+NT IP VIG + EV+ D H H+ G N
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKDDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEI--------AYADYYERSLT 438
+F + D E+C TYNML++++ L++ + ++ Y DYYER+L
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 439 NGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY 498
N +L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIY 431
Query: 499 FEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSL 558
+ +Y+ +I S+L+WK + + Q+ LR+ K S +L
Sbjct: 432 AHRQDT---LYVNLFIPSQLNWKEQGVTLTQETLFPDDGKVTLRI-----DKASKKKLTL 483
Query: 559 NLRIPTWTSSNGAKA-TLNGQDLPL---PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
+RIP W S+ A T+NGQ P +L + + W D +T LP+ + E I
Sbjct: 484 MIRIPGWAGSSKDYAITINGQKKKYAIRPGVSTYLPIHRKWKKGDVITFNLPMEVSLEQI 543
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
D + Y A LYGP VLA
Sbjct: 544 PDKKDYY----AFLYGPIVLAA 561
>gi|371776971|ref|ZP_09483293.1| Acetyl-CoA carboxylase, biotin carboxylase [Anaerophaga sp. HS1]
Length = 794
Score = 229 bits (585), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 165/546 (30%), Positives = 265/546 (48%), Gaps = 51/546 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
+K L VRL DS A++ N +Y++ D D+++ F A L + YG WE
Sbjct: 31 VKSFPLSYVRL-LDSPFKHAEELNEKYVMAHDPDRILAPFLIDAGLKPKAQGYGNWE--G 87
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
L GHF GHYL++ +LM AST +E ++++ +V L+ CQK G+GY+ P Q
Sbjct: 88 SGLNGHFGGHYLTSLSLMIASTGSEEARKRLDYMVDQLARCQKANGNGYVGGIPGGQAMW 147
Query: 235 LE-----------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
E +L W P Y IHK+ AGL D + A N +A + + ++F N +
Sbjct: 148 AEIAKGNINAGNFSLNGKWVPLYNIHKLFAGLRDAWLLAQNKKAKEVLINLTDWFLNLTK 207
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
N+ ++ + L E GG+N+V ++ IT + +L LA F L L Q D
Sbjct: 208 NLTD----DQIQKMLVSEHGGLNEVFADVYDITGNENYLKLARRFSHQAILRPLLQQKDQ 263
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNIGHFNFKSDP 393
++G H+NT IP VIG E+ D ++ S G N H +F +
Sbjct: 264 LTGLHANTQIPKVIGFMRIGELAHDTAWINAADFFWNTVVQNRTVSIGGNSTHEHFHA-V 322
Query: 394 KRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV 452
+S ++S E+C TYNMLK+S+ LF + ++ Y DYYE++L N +L Q G
Sbjct: 323 DDFSSMIESRQGPETCNTYNMLKLSKQLFLFKNDLKYIDYYEQALYNHILSSQHPLHGG- 381
Query: 453 MIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
++Y + P R Y + P +FWCC G+GIE+ K G+ IY ++ VY+
Sbjct: 382 LVYFTSMRP-----RHYRVYSRPEQTFWCCVGSGIENHEKYGELIYAHDD---ENVYVNL 433
Query: 513 YISSRLDWKSGQI-VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L WK Q+ +V + P + ++T+ + + +R P WT
Sbjct: 434 FIPSILHWKEKQLKLVQENHFPDID-----KITIRVEPQ-RKTEFVVGIRCPAWTRPEDM 487
Query: 572 KATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NG+ + PG++ + + W +D + + LP+ + + D P Y S +++G
Sbjct: 488 NVLVNGKAFKGKAIPGHYFLIRRYWEKNDVIEVHLPMHTYGKFLPDGSP-YLS---LMHG 543
Query: 631 PYVLAG 636
P+VLA
Sbjct: 544 PFVLAA 549
>gi|359453850|ref|ZP_09243152.1| hypothetical protein P20495_1902 [Pseudoalteromonas sp. BSi20495]
gi|358049097|dbj|GAA79401.1| hypothetical protein P20495_1902 [Pseudoalteromonas sp. BSi20495]
Length = 816
Score = 229 bits (585), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 165/528 (31%), Positives = 257/528 (48%), Gaps = 54/528 (10%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
AQQTN+ YLL L D+L+ + + A + YG WE+ L GH GHYLS+ +L W
Sbjct: 64 AQQTNVRYLLALYPDQLLAPYLREAGIEQKAPSYGNWEDTG--LDGHIGGHYLSSLSLAW 121
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQF-----------DRLEALIPVW 242
A+T +E LK ++ +++ L Q ++ GYL P Q L +L W
Sbjct: 122 AATGDEELKRRLDYMLNELQRAQ-QVNDGYLGGIPDGQAMWQQIHDGNIKADLFSLNDRW 180
Query: 243 APYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
P Y I KI GL D Y A + +A M + E+F N + K S E+ Q L E
Sbjct: 181 VPLYNIDKIFHGLRDAYLIAGSEQAKTMLFDLGEWFLN----LTAKLSDEQIQQMLYSEY 236
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMR 362
GG+N V + I D ++L LA F + L + D ++G H+NT IP +IG
Sbjct: 237 GGLNAVFADMATIGNDKRYLKLARQFTHNNIIDPLLEKQDKLTGLHANTQIPKIIGMLKV 296
Query: 363 YEVTGDQLHKEG-----------HQLESSGTNIG-HFNFKSDPKRLASNLDSNTEESCTT 410
E + D+ ++G + G ++ HF+ K+D + +++ E+C T
Sbjct: 297 AEASDDKAWQQGADYFWQTVTKQRSVAIGGNSVSEHFHDKNDFTPMVEDVEG--PETCNT 354
Query: 411 YNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYH 470
YNM+K+S+ LF T + Y +YYER+ N +L Q E G ++Y + PG Y
Sbjct: 355 YNMMKLSKLLFLKTADTRYLEYYERATYNHILSSQH-PEHGGLVYFTSMRPG-----HYR 408
Query: 471 HWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW-KSGQIVVNQ 529
+ + DS WCC G+GIE+ SK G+ IY + + +++ +I S LDW + G V Q
Sbjct: 409 MYSSVQDSMWCCVGSGIENHSKYGEQIYSKNDDN---LWVNLFIPSTLDWQQQGLKVTQQ 465
Query: 530 KVDPVVSWDPYLRVTLTFSS--KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGN 587
+ P + +TL ++ K + L++R P+W + + LNG+ + +
Sbjct: 466 SLFPDAN-----NITLVINTLDKKHISSAQLHIRKPSWVTDE-LQFELNGKAINATAEQG 519
Query: 588 FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+ ++ W D LT L L TE + D + Y A+LYGP V+A
Sbjct: 520 YYAIKHDWHDGDNLTFTLAPKLYTEQLPDGQDYY----AVLYGPVVMA 563
>gi|256423606|ref|YP_003124259.1| hypothetical protein Cpin_4617 [Chitinophaga pinensis DSM 2588]
gi|256038514|gb|ACU62058.1| protein of unknown function DUF1680 [Chitinophaga pinensis DSM
2588]
Length = 1025
Score = 229 bits (585), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 186/633 (29%), Positives = 291/633 (45%), Gaps = 108/633 (17%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKT--ARLPAPGEPYGGWEE 172
L +V+L + G ++ + + L D + ++ FR + P +P W+
Sbjct: 382 LGQVALKNDAHGHETQFVENRDKFIRTLATTDPNSFLYMFRHAFGRQQPEGAKPLDVWDS 441
Query: 173 PSCELRGHFVGHYLSASALMWASTH-----NESLKEKMSAVV------SALSACQKEIGS 221
+LRGH GHYL+A A +AST ++ ++KM+ +V S LS KE G
Sbjct: 442 QDTKLRGHATGHYLTAIAQAYASTGYDKTLQQNFEQKMAYMVNTLYELSLLSGNPKETGG 501
Query: 222 ------------------------------------GYLSAFPTEQFDRLEALIP----- 240
G++SA+P +QF LE
Sbjct: 502 VAVSDPTAVPYGPGKSGYDSDLSNEGIRNDYWNWGKGFISAYPPDQFIMLEKGAKYGGQK 561
Query: 241 --VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT- 297
+WAPYYT+HKILAGL+D Y + N +AL + T M ++ Y R+ +V + ++ + W T
Sbjct: 562 NQIWAPYYTLHKILAGLMDVYEVSGNQKALTVATGMGDWVYARLSHVPQD-TLIKMWNTY 620
Query: 298 LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLG------LLALQADDISGFHSN 350
+ E GGMN+ + +L+ IT ++L A LFD F G LA D G H+N
Sbjct: 621 IAGEFGGMNEAMARLYLITGKQQYLQTAQLFDNIRVFFGDTAHSHGLAKNVDIFRGLHAN 680
Query: 351 THIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFN------FKSDPK 394
HIP ++GS Y + + + + + S G G N F S P
Sbjct: 681 QHIPQIVGSIEMYRASNNPEYYKIADNFWYKAVNDYMYSIGGVAGARNPANAECFISQPA 740
Query: 395 RLASNLDSN--TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV 452
L N S+ E+C TYNMLK++ LF + + + DYYER+L N +L P
Sbjct: 741 TLYENGFSSGGQNETCATYNMLKLTSDLFLFDQRAEFMDYYERALYNHILASVAKDNP-A 799
Query: 453 MIYLLPLAPGSSKERSYHHWGTPS-DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
Y +PL PG+ K+ +G P F CC GT IES +KL ++IYF+ +Y+
Sbjct: 800 NTYHVPLRPGAIKQ-----FGNPDMTGFTCCNGTAIESNTKLQNTIYFKSRDN-QALYVN 853
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
YI S L W + + Q D D L + KG+G +N+R+P W ++ G
Sbjct: 854 LYIPSTLQWTERNVTIEQTTDFPKEDDTRLTI------KGNG-QFDINVRVPGW-ATKGF 905
Query: 572 KATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NG++ L + PG +L++ + W D + +++P + + D + +I ++ YG
Sbjct: 906 FVKINGKEQALTAKPGTYLTIRRQWKDGDIIDLKMPFRFHLDPVMDQQ----NIASLFYG 961
Query: 631 PYVLA---GHSIGDW-DITESATSLSDWITPIP 659
P +LA G + DW IT +A +S I P
Sbjct: 962 PILLAAQEGEARKDWRKITLNADDISKSIKGDP 994
>gi|265753023|ref|ZP_06088592.1| acetyl-CoA carboxylase [Bacteroides sp. 3_1_33FAA]
gi|263236209|gb|EEZ21704.1| acetyl-CoA carboxylase [Bacteroides sp. 3_1_33FAA]
Length = 797
Score = 229 bits (584), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 178/585 (30%), Positives = 279/585 (47%), Gaps = 73/585 (12%)
Query: 96 YRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFR 155
Y +++ + VP L EV L +DS +A + YLL LDVD+L+ + R
Sbjct: 25 YEQVRKAPRVHVPVWQSFALSEVEL------TDSYFKKAMDLHKGYLLSLDVDRLIPHVR 78
Query: 156 KTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSAC 215
++ L G+ YGGWE+ G GHY+SA A+M+AST ++L +K++ ++ L C
Sbjct: 79 RSVGLQGKGDNYGGWEKHG----GCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQEC 134
Query: 216 QKEIGSGYLSAFPTEQFDRLEAL---IPVWAP---------------YYTIHKILAGLLD 257
QK+ G+ + L+ L + + P +Y IHKILAGL D
Sbjct: 135 QKQTPDGWFITGKRGKEGYLQLLQGNVVLNQPDETGQPWNYNQNGNSWYCIHKILAGLRD 194
Query: 258 QYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQ 317
Y YA +A + + ++ + ++ + + TL+ E GGMN+V ++ IT
Sbjct: 195 AYVYAGCRQAKDILMPLADF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITG 250
Query: 318 DPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG--- 374
D K L A F+ + +A D + G H+N IP +G YE + + ++ +
Sbjct: 251 DKKFLQTAERFNHINVIYPIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARN 310
Query: 375 --------HQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE 426
H L G + + P + LD + E+C TYNMLK+SR LF +
Sbjct: 311 FWNIVIKDHTLAIGGNSC--YERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGD 368
Query: 427 IAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTG 486
Y +YYE +L N +L Q PG + Y L PGS K+ S TP DSFWCC GTG
Sbjct: 369 YKYLNYYEHALYNHILASQDPDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTG 423
Query: 487 IESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYL----R 542
+E+ SK +SIYF++ + + + YI SRL WK + ++ D Y
Sbjct: 424 MENHSKYAESIYFKDNQE---LLVNLYIPSRLHWKEKGL--------KLTLDTYFPESDT 472
Query: 543 VTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKL 601
VT+ GS T +L R P W S + A +NG+ + G+++ + + S D +
Sbjct: 473 VTVRMDEIGS-YTGTLLFRYPDWVSGD-AVVRINGEPAQTEAHKGSYIRLLDSVKSGDVI 530
Query: 602 TIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE 646
T+ L + +D+ P + S ++YGP +LAG +G D+ E
Sbjct: 531 TLVFTRNLYIDYAKDE-PHFGS---VMYGPILLAG-GLGTDDMPE 570
>gi|319786479|ref|YP_004145954.1| hypothetical protein Psesu_0871 [Pseudoxanthomonas suwonensis 11-1]
gi|317464991|gb|ADV26723.1| protein of unknown function DUF1680 [Pseudoxanthomonas suwonensis
11-1]
Length = 806
Score = 229 bits (584), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 173/554 (31%), Positives = 272/554 (49%), Gaps = 54/554 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
L+ L DVRLG D R+ NL YL LD D+L+ FR A LP+P Y WE S
Sbjct: 35 LQAFPLEDVRLG-DGAFARSSALNLRYLAALDPDRLLAPFRIEAGLPSPAPKYPNWE--S 91
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--F 232
L GH GHYLSA A A+ + ++ ++ +V+ALS Q G GY+ P + +
Sbjct: 92 MGLDGHTAGHYLSALAQQ-AAQGSAGMRRRLDYMVAALSQVQAANGDGYVGGVPNGRVLW 150
Query: 233 DRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
+R+ + L W P+Y +HK AGL D + A NA+A + ++ V
Sbjct: 151 NRIASGDFQAESFSLEGAWVPFYNLHKTYAGLRDAWLLAGNAQARDVLVRFADWAGALVA 210
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
N + ++R L+ E GGMN+VL ++ IT D ++L LA F L L + D
Sbjct: 211 N-LDDTQLQR---VLDTEHGGMNEVLADVYAITGDRRYLALARRFSHRAILDPLLRREDR 266
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGD-------QLHKEGHQLESSGTNIG-----HFNFKS 391
+ G H+NT IP VIG E+ GD Q E L S G HFN
Sbjct: 267 LDGLHANTQIPKVIGFARIGELDGDVEWIEAAQFFWERVALHRSIAFGGNSTREHFNPAD 326
Query: 392 DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
D + ++ + E+C +YNML+++ L R + +AD+YER+L N +L Q + G
Sbjct: 327 DFSGMIASREG--PETCNSYNMLRLTLLLERLRPDPRHADFYERALFNHILSTQH-PDHG 383
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
++Y P+ P R Y + P + FWCC G+G+E+ + G Y +E + +
Sbjct: 384 GLVYFTPIRP-----RHYRVYSQPQECFWCCVGSGMENHGRHGAFAYTHDESS---LRVN 435
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
Y+ S L W+ +V+ Q+ + R L ++ + +L LR P W +
Sbjct: 436 LYLDSELHWRERGLVLRQR----TRFPEEPRSVLEVATPRPQV-FALELRHPHWLAGP-L 489
Query: 572 KATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+ LNG+ P+ SP ++ + + W D++ ++LP++ R E++ P+ + A+++G
Sbjct: 490 RVKLNGRRWPVESSPSSYARIERQWQDGDRIEVELPMSTRIESL----PDGSDWVAVMHG 545
Query: 631 PYVLAGHSIGDWDI 644
P +LA S G+ DI
Sbjct: 546 PLMLAARS-GEEDI 558
>gi|302670053|ref|YP_003830013.1| glycoside hydrolase [Butyrivibrio proteoclasticus B316]
gi|302394526|gb|ADL33431.1| glycoside hydrolase [Butyrivibrio proteoclasticus B316]
Length = 780
Score = 229 bits (584), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 177/558 (31%), Positives = 269/558 (48%), Gaps = 67/558 (12%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEP 173
LKE L V + +D A ++ YL LD ++L+ F + A L Y GWE
Sbjct: 1 MLKEFDLTQVCV-NDEYCANALNKDVAYLKSLDPERLLAGFYENAGLTPKKIRYSGWE-- 57
Query: 174 SCELRGHFVGHYLSASALMWAS--THNESLK---EKMSAVVSALSACQKE--------IG 220
+ + GH +GHYL+A+A +A+ T E K + + +V L CQ+ G
Sbjct: 58 NMLIGGHTLGHYLTAAAQGYANPGTRKEDKKALFDIIKTLVDGLLECQEHSQGKKGFVFG 117
Query: 221 SGYLSAFPTE-QFDRLE-----ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
+ + + E QFD +E + W P+YT+HKIL GL+ + + AL++ +
Sbjct: 118 AIIMDSNNVELQFDHVEHGRTNIITESWVPWYTMHKILDGLVSTFVFTGYEPALKVAEGI 177
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
++ YNR +S E H L+ E GGMND LYKL+ +T +HL AH FD+
Sbjct: 178 GDWTYNRASG----WSEETHKTVLSIEYGGMNDALYKLYRLTGKKEHLEAAHAFDEEELF 233
Query: 335 GLLAL-QADDISGFHSNTHIPIVIGSQMRYEVTGD-------------QLHKEGHQLESS 380
+A A+ ++ H+NT IP +G+ RY GD + E H +
Sbjct: 234 KKVATGDANVLNNRHANTTIPKFLGALQRYMTLGDVAGEYLTYVQKFWDMVVERHTYATG 293
Query: 381 G-TNIGHF--NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSL 437
G + HF +F D +R N E+C TYNMLK+SR LFR T + YADYYE +
Sbjct: 294 GNSEWEHFGEDFVLDAERTNCN-----NETCNTYNMLKMSRDLFRITGDKKYADYYENTF 348
Query: 438 TNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSI 497
N +L Q E G+ +Y P+A G Y +GTP D FWCC GTG+E+F+KL DSI
Sbjct: 349 INAILSSQN-PESGMTMYFQPMATGY-----YKVYGTPFDKFWCCTGTGMENFTKLNDSI 402
Query: 498 YFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTS 557
YF ++ V + YISS + ++ + QK S P L + + T
Sbjct: 403 YFLDD---ESVIVNMYISSVVCDSKKKLTLTQK-----SLIPKGNTALFTINLEEPVKTK 454
Query: 558 LNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDD 617
L R+P W + KA +G+ + G + +V +T++ D Q+ ++ +
Sbjct: 455 LRFRVPDWAVNATCKALSSGKTYQAEADG-YFTVEETFNDGD----QIEISFEMHTVVKR 509
Query: 618 RPEYASIQAILYGPYVLA 635
P+ ++ A YGP +L+
Sbjct: 510 LPDCENVFAFKYGPVLLS 527
>gi|220928430|ref|YP_002505339.1| hypothetical protein Ccel_0997 [Clostridium cellulolyticum H10]
gi|219998758|gb|ACL75359.1| protein of unknown function DUF1680 [Clostridium cellulolyticum
H10]
Length = 597
Score = 229 bits (583), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 159/554 (28%), Positives = 270/554 (48%), Gaps = 66/554 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
+ +L ++L SD ++T +Y+ D+++L+ FRK A + + EP GGWE
Sbjct: 2 FENFNLDKIKL-SDKYFSVRRETAKKYVNDFDINRLMHTFRKNAGIESLAEPLGGWESEE 60
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
C LRGHFVGH+LSA + S +++ LK K +V ++ C E +GYLSAF E D
Sbjct: 61 CNLRGHFVGHFLSACSKFAFSDNDDCLKTKADNIVKIMAECASE--NGYLSAFGEEMLDI 118
Query: 235 LEALIP--VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIE 292
LE VWAPYYT+HKIL GL+D Y + +N AL + + Y R + +
Sbjct: 119 LETEEDRGVWAPYYTLHKILQGLVDCYLFLNNKTALSLAVNLAHYIRRRFERL------- 171
Query: 293 RHWQT--------LN--EEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQAD 342
+W+T +N E GG+ DVLY L+ IT D K LA +F++ F+G LA D
Sbjct: 172 SYWKTDGILRCTRVNPVNEFGGIGDVLYSLYEITGDRKIFDLADIFNRDYFIGNLAADRD 231
Query: 343 DISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ-----------------LESSGTNIG 385
+ H+NTH+P+VI + R+ +TG+ +K Q +++ G
Sbjct: 232 VLEDLHANTHLPMVISAIHRFNLTGEYKYKHAAQNFYKYLLGRTFVNGNSSSKATSFKKG 291
Query: 386 HFNFKSD----PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGV 441
+ KS+ L ++L ESC +N K+ + LF WT++ + ++ E N V
Sbjct: 292 EVSEKSEHWGAHNHLENSLTGGESESCCAHNTEKIVQQLFAWTEDERFLEHLEILKYNAV 351
Query: 442 LGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
L T G+ Y P+ G K ++ D+FWCC GTGIE+ S++ +I+F++
Sbjct: 352 LN-STSTVTGLSQYQQPMGTGVKK-----NFSGLFDTFWCCTGTGIEAMSEIQKNIWFKD 405
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
+ + + +I+S + W + + Q P V++ S + ++ +L LR
Sbjct: 406 KDT---LLLNMFIASTVQWDEKNVKIVQNTAY-----PDNTVSVLTVSTSNPVSFTLMLR 457
Query: 562 IPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
S +NG+ + ++ + + ++++D + I++ +L ++ +
Sbjct: 458 -----KSQVKSVKINGKSFNFIADNGYIYIKRIFNNNDTIEIEIDSSLHLIQLKGSENK- 511
Query: 622 ASIQAILYGPYVLA 635
A++Y +LA
Sbjct: 512 ---AAVMYDRILLA 522
>gi|317057297|ref|YP_004105764.1| hypothetical protein Rumal_2655 [Ruminococcus albus 7]
gi|315449566|gb|ADU23130.1| protein of unknown function DUF1680 [Ruminococcus albus 7]
Length = 602
Score = 229 bits (583), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 169/547 (30%), Positives = 270/547 (49%), Gaps = 60/547 (10%)
Query: 136 QTNLEYLLMLDVDKLVWNF----------RKTARLPAPGEPYGGWEEPSCELRGHFVGHY 185
+ N YL LD L+ N R+ P E + GWE P+C+LRGHF+GH+
Sbjct: 22 ELNKRYLKELDTVCLMQNHYLEAGIILPDRQVISEPEKAELHWGWESPACQLRGHFLGHW 81
Query: 186 LSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPY 245
+SA+A++ AS + L+ K+ +V L CQ+ G ++ + P + F +E+ +W+P
Sbjct: 82 MSAAAMLSASDGDAELRAKLVKIVDELERCQQRNGGKWVGSIPEKYFKLMESEEYIWSPQ 141
Query: 246 YTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGM 305
YT+HK L GL+D Y +A +AL + + +++ +V K E GGM
Sbjct: 142 YTMHKTLMGLVDAYRFAGIQKALDIADRLADWYIEWAASVEKTAPF----TVFKGEQGGM 197
Query: 306 NDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEV 365
+ L+ +T DPK+ L ++ + L + ++ H+N IP+ G+ Y++
Sbjct: 198 LEEWCILYELTNDPKYRKLMDIYRENGLYHKLEQHREALTDDHANASIPLSHGAARMYDI 257
Query: 366 TGDQLHK------------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNM 413
TG++ K E ++G N G F P + S L +E CT YNM
Sbjct: 258 TGEERWKIITDEFWRQAVTERGMFATTGANSGEFWVP--PHSMGSYLGDTDQEFCTVYNM 315
Query: 414 LKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWG 473
++++ L+R T + YADY ER+L NG L Q+ G+ Y LPL+ GS K+ WG
Sbjct: 316 VRLADFLYRRTGDTVYADYIERALYNGFLA-QQNMHSGMPAYFLPLSSGSRKK-----WG 369
Query: 474 TPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS--RLDWKSGQIVVNQ-- 529
+ FWCC+GT +++ + I++ E+ + + QYI S LD +I V+Q
Sbjct: 370 SKRHDFWCCHGTMVQAQTLYPQLIWYTEDST---LTVAQYIPSEAELDIGGKKIKVSQCT 426
Query: 530 ---KVDPVVSWD-----PYLRVTLTFSSKGSGLT-TSLNLRIPTWTSSNGAKATLNGQDL 580
++ V +D R ++ F K T +L LR+P W + + ++G +
Sbjct: 427 ELKNLNNQVFFDEDEGGEKSRWSIRFDIKCDEPTFFTLWLRMPKWLNGR-PQLIIDGGSV 485
Query: 581 PLPSPGNFLSVTKTWSSDDKLTIQLPL--TLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
N+L++++TW +D TIQL L TL TE + D PE A A+L GP VLAG +
Sbjct: 486 QADIADNYLTISRTWHND---TIQLLLIPTLYTEPLA-DMPETA---ALLDGPIVLAGMT 538
Query: 639 IGDWDIT 645
D IT
Sbjct: 539 DKDAGIT 545
>gi|315498357|ref|YP_004087161.1| hypothetical protein Astex_1338 [Asticcacaulis excentricus CB 48]
gi|315416369|gb|ADU13010.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
48]
Length = 797
Score = 229 bits (583), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 175/586 (29%), Positives = 275/586 (46%), Gaps = 74/586 (12%)
Query: 116 KEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC 175
+ + L VRL S A + N YLL L D+ ++N+ K A +P GE YGGWE S
Sbjct: 39 RPIPLTQVRL-LPSPFLEAVEANRRYLLFLSPDRFLYNYHKFAGMPVKGEIYGGWE--SD 95
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF-------- 227
+ G +GHYLSA +LM A T + ++ ++S L Q G GY++ F
Sbjct: 96 TIAGEGLGHYLSALSLMHAQTGDNECVARIHYIISELEKVQAAHGDGYVAGFMRKRKDGS 155
Query: 228 ---PTEQFDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMV 275
E F + A L W P+Y HK+ AGLLD Y + + +
Sbjct: 156 IVDGKEIFPEIMAGDIRSAGFDLNGCWVPFYNWHKLFAGLLDAQAYCGVDRGIPVAEKLG 215
Query: 276 EYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
Y ++ V + + L+ E GG+N+ +L+ T +P+ L L+ L
Sbjct: 216 GY----IEMVFAALDDAQTQKVLDCEHGGINESFAELYSRTNNPRWLKLSERLYHHRMLD 271
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESS---GTNIGHFNFKS- 391
LA + D ++ H+NT +P +IG YE+T K +Q SS + H +F
Sbjct: 272 PLAAREDKLANNHANTQVPKLIGLARLYELT----QKPQYQTASSFFWERVVNHHSFVIG 327
Query: 392 ---------DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL 442
+P +++++ T ESC TYNMLK++RHL+ W+ + A+ DYYER+ N +L
Sbjct: 328 GNADREYFFEPDTISAHITEQTCESCNTYNMLKLTRHLYSWSPKAAWFDYYERAHLNHML 387
Query: 443 GIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEE 502
Q + G+ Y++PL G+++ S +SFWCC +GIE+ SK GDSIY+ +E
Sbjct: 388 AHQN-PKTGMFTYMMPLMSGAARGFS-----DEENSFWCCVLSGIETHSKHGDSIYWHQE 441
Query: 503 GKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYL-RVTLTFSSKGSGLTTSLNLR 561
+++ +I S+++W + + + PY +V L S T ++ +R
Sbjct: 442 KT---LFVNLFIPSKVNWAEQKAAFE-----LTTKYPYEGQVALKLSQLSGAKTFTVAVR 493
Query: 562 IPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
IP W ++ + +NG+ + +T+ W + D +T+ LPL LR E D
Sbjct: 494 IPGWAEASTLQ--VNGKPALAKMNDGYALITRKWRAGDVVTLDLPLKLRFETAAGDN--- 548
Query: 622 ASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLI 667
+ A+L GP VLA +G D W PA S LI
Sbjct: 549 -KVVALLRGPMVLAA-DLGPAD--------QPWGGDAPALVGSDLI 584
>gi|212695367|ref|ZP_03303495.1| hypothetical protein BACDOR_04914 [Bacteroides dorei DSM 17855]
gi|212662096|gb|EEB22670.1| hypothetical protein BACDOR_04914 [Bacteroides dorei DSM 17855]
Length = 807
Score = 229 bits (583), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 178/585 (30%), Positives = 278/585 (47%), Gaps = 73/585 (12%)
Query: 96 YRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFR 155
Y +++ + VP L EV L +DS +A + YLL LDVD+L+ + R
Sbjct: 35 YEQVRKAPRVHVPVWQSFALSEVEL------TDSYFKKAMDLHKGYLLSLDVDRLIPHVR 88
Query: 156 KTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSAC 215
++ L G+ YGGWE+ G GHY+SA A+M+AST ++L +K++ ++ L C
Sbjct: 89 RSVGLQGKGDNYGGWEKHG----GCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQEC 144
Query: 216 QKEIGSGYLSAFPTEQFDRLEAL---IPVWAP---------------YYTIHKILAGLLD 257
QK+ G+ + L+ L + + P +Y IHKILAGL D
Sbjct: 145 QKQTPDGWFITGKRGKEGYLQLLQGNVVLNQPDETGQPWNYNQNGNSWYCIHKILAGLRD 204
Query: 258 QYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQ 317
Y YA +A + + ++ + ++ + + TL+ E GGMN+V ++ IT
Sbjct: 205 AYVYAGCRQAKDILMPLADF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITG 260
Query: 318 DPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG--- 374
D K L A F+ + +A D + G H+N IP +G YE + + ++ +
Sbjct: 261 DKKFLQTAERFNHINVIYPIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARN 320
Query: 375 --------HQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE 426
H L G + + P + LD + E+C TYNMLK+SR LF +
Sbjct: 321 FWNIVIKDHTLAIGGNSC--YERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGD 378
Query: 427 IAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTG 486
Y +YYE +L N +L Q PG + Y L PGS K+ S TP DSFWCC GTG
Sbjct: 379 YKYLNYYEHALYNHILASQDPDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTG 433
Query: 487 IESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYL----R 542
+E+ SK +SIYF++ + + + YI SRL WK + ++ D Y
Sbjct: 434 MENHSKYAESIYFKDNQE---LLVNLYIPSRLHWKEKGL--------KLTLDTYFPESDT 482
Query: 543 VTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKL 601
VT+ GS T L R P W S + A +NG+ + G+++ + + S D +
Sbjct: 483 VTVRMDEIGS-YTGMLLFRYPDWVSGD-AVVRINGKPAQTEAHKGSYIRLLDSVKSGDVI 540
Query: 602 TIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE 646
T+ L + +D+ P + S ++YGP +LAG +G D+ E
Sbjct: 541 TLVFTRNLYIDYAKDE-PHFGS---VMYGPILLAG-GLGTDDMPE 580
>gi|345513939|ref|ZP_08793454.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
gi|423241465|ref|ZP_17222578.1| hypothetical protein HMPREF1065_03201 [Bacteroides dorei
CL03T12C01]
gi|229435753|gb|EEO45830.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
gi|392641358|gb|EIY35135.1| hypothetical protein HMPREF1065_03201 [Bacteroides dorei
CL03T12C01]
Length = 797
Score = 228 bits (582), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 178/585 (30%), Positives = 278/585 (47%), Gaps = 73/585 (12%)
Query: 96 YRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFR 155
Y +++ + VP L EV L +DS +A + YLL LDVD+L+ + R
Sbjct: 25 YEQVRKAPRVHVPVWQSFALSEVEL------TDSYFKKAMDLHKGYLLSLDVDRLIPHVR 78
Query: 156 KTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSAC 215
++ L G+ YGGWE+ G GHY+SA A+M+AST ++L +K++ ++ L C
Sbjct: 79 RSVGLQGKGDNYGGWEKHG----GCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQEC 134
Query: 216 QKEIGSGYLSAFPTEQFDRLEAL---IPVWAP---------------YYTIHKILAGLLD 257
QK+ G+ + L+ L + + P +Y IHKILAGL D
Sbjct: 135 QKQTPDGWFITGKRGKEGYLQLLQGNVVLNQPDETGQPWNYNQNGNSWYCIHKILAGLRD 194
Query: 258 QYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQ 317
Y YA +A + + ++ + ++ + + TL+ E GGMN+V ++ IT
Sbjct: 195 AYVYAGCRQAKDILMPLADF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITG 250
Query: 318 DPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG--- 374
D K L A F+ + +A D + G H+N IP +G YE + + ++ +
Sbjct: 251 DKKFLQTAERFNHINVIYPIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARN 310
Query: 375 --------HQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE 426
H L G + + P + LD + E+C TYNMLK+SR LF +
Sbjct: 311 FWNIVIKDHTLAIGGNSC--YERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGD 368
Query: 427 IAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTG 486
Y +YYE +L N +L Q PG + Y L PGS K+ S TP DSFWCC GTG
Sbjct: 369 YKYLNYYEHALYNHILASQDPDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTG 423
Query: 487 IESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYL----R 542
+E+ SK +SIYF++ + + + YI SRL WK + ++ D Y
Sbjct: 424 MENHSKYAESIYFKDNQE---LLVNLYIPSRLHWKEKGL--------KLTLDTYFPESDT 472
Query: 543 VTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKL 601
VT+ GS T L R P W S + A +NG+ + G+++ + + S D +
Sbjct: 473 VTVRMDEIGS-YTGMLLFRYPDWVSGD-AVVRINGKPAQTEAHKGSYIRLLDSVKSGDVI 530
Query: 602 TIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE 646
T+ L + +D+ P + S ++YGP +LAG +G D+ E
Sbjct: 531 TLVFTRNLYIDYAKDE-PHFGS---VMYGPILLAG-GLGTDDMPE 570
>gi|433676676|ref|ZP_20508761.1| hypothetical protein BN444_00795 [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430818203|emb|CCP39076.1| hypothetical protein BN444_00795 [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 807
Score = 228 bits (581), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 171/551 (31%), Positives = 266/551 (48%), Gaps = 61/551 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
LK+V+L S+ + QTN YLL L+ D+L+ NF + A LP GE YGGWE +
Sbjct: 65 LKQVTL------KPSLFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGEVYGGWEGDT 118
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA A M A T + +L++++ +V+ L+ Q + GY+ +
Sbjct: 119 --IAGHTLGHYLSALAKMHAQTRDAALRQRIDYIVAELARAQAKDPDGYVGGLTRKNDKG 176
Query: 232 --------FDRLEALI---------PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
F+ + I W+P YT+HK+ AGLLD + A NA+AL++ +
Sbjct: 177 AIDNGKLVFEEVRRGIIKGSKFNLNGSWSPLYTVHKLFAGLLDAHELAGNAQALQVLLPL 236
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y + V + L+ E GG+N+ +L T DP+ + L +
Sbjct: 237 AGY----LGGVFDALDHAQMQALLDTEFGGLNESYIELGARTGDPRWIALGKRLRHEKVI 292
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNI 384
A D++ H+NT +P IG ++EV GD GH G N
Sbjct: 293 DPAAAGRDELPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTGHYSYVIGGNA 352
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
F+ +P +A+ L T E C +YNMLK++RHL++WT + Y DYYER+L N +
Sbjct: 353 DREYFQ-EPDTIAAFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAA 411
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q G+ Y+ P+ G ER + DSFWCC G+G+E+ ++ GDSIY+++
Sbjct: 412 QH-PATGMFTYMTPMIGGG--ERGF---SDKFDSFWCCVGSGMEAHAQFGDSIYWQDAAS 465
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
+Y+ YI S LDW + + ++D V + +R+ L + G+ L LR+P
Sbjct: 466 ---LYVNLYIPSTLDWPERDLAL--ELDSGVPDNGKVRLQLRCA--GARTPRRLLLRLPA 518
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W G LNG+ + +L++ + W S D + + L + LR E D A
Sbjct: 519 WC-QGGYTLRLNGKAQRGTAADGYLALERRWRSGDMIELDLAMPLRLEHAAGD----ADT 573
Query: 625 QAILYGPYVLA 635
++ GP LA
Sbjct: 574 VVVMRGPLALA 584
>gi|285018715|ref|YP_003376426.1| hypothetical protein XALc_1948 [Xanthomonas albilineans GPE PC73]
gi|283473933|emb|CBA16434.1| conserved hypothetical protein [Xanthomonas albilineans GPE PC73]
Length = 810
Score = 228 bits (581), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 167/531 (31%), Positives = 252/531 (47%), Gaps = 57/531 (10%)
Query: 136 QTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWAS 195
QTN YLL L+ D+L+ NF + A LP G YGGWE + + GH +GHYLSA + M A
Sbjct: 82 QTNRRYLLELEPDRLLHNFLQYAGLPPKGAVYGGWEGDT--IAGHTLGHYLSALSKMHAQ 139
Query: 196 THNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD--RLEALIPV------------ 241
T + SL+ ++ +V+ L+ Q + GY+ F T + D ++E V
Sbjct: 140 TRDSSLRTRIDYIVAELARAQAQDPDGYVGGF-TRKNDNGKIEGGKAVLEDLRRGIIKGG 198
Query: 242 -------WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
W+P YT HK+ AGLLD + NA+AL + + YF V +
Sbjct: 199 KFNLNGSWSPLYTQHKLFAGLLDAHALGGNAQALTVLVKVAGYF----AGVFDALDHAQM 254
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
L+ E GG+N+ +L T + + + + LA D + H+NT +P
Sbjct: 255 QTLLDTEFGGLNESFIELGARTGQERWIAIGKRLRHEKIIDPLAAGHDVLPHIHANTQVP 314
Query: 355 IVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNT 404
IG ++EV GD H G N F+ +P +A L T
Sbjct: 315 KFIGEARQFEVAGDADAAAAARFFWETVTAHYSYVIGGNSDREYFQ-EPDSIAGFLTEQT 373
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSS 464
E C +YNMLK++RHL++WT + Y DYYER+L N + Q G+ Y+ P+ G
Sbjct: 374 CEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAAQHPAT-GMFTYMTPMISGG- 431
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER + DSFWCC G+G+E+ ++ GD+IY+++E +Y+ YI SRLDW
Sbjct: 432 -ERGFSE---KFDSFWCCVGSGMEAHAQFGDAIYWQDEA---ALYVNLYIPSRLDWSERD 484
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS 584
+ + ++D V + +V L G+ L LR+P W + LNG+ L
Sbjct: 485 LAL--ELDSGVPENG--KVRLQVLRAGARAPRRLLLRVPAWCQGS-YTLRLNGKPLRRTP 539
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+L++ + W S D + ++L LR E D PE ++ GP LA
Sbjct: 540 IDGYLALERDWRSGDVIELELATPLRLEHAAGD-PESV---VVMRGPLALA 586
>gi|262405235|ref|ZP_06081785.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|345508054|ref|ZP_08787694.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
gi|229444700|gb|EEO50491.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
gi|262356110|gb|EEZ05200.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
Length = 801
Score = 228 bits (581), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 192/710 (27%), Positives = 322/710 (45%), Gaps = 89/710 (12%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEP 173
+ E + DV+L D + A++ N+E LL DVD+L+ +RK A L + Y W+
Sbjct: 27 YKNEFPIADVKL-LDGVFKHARELNIEVLLKYDVDRLLAPYRKEAGLTERKKTYPNWDG- 84
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSAC-------QKEIGSGYLSA 226
L GH GHYLSA ++ +A+T N+ +M ++S L C E GY+
Sbjct: 85 ---LDGHVAGHYLSAMSMNYAATGNKECGRRMEYMISELQLCLEANAINNTEWAIGYIGG 141
Query: 227 FPTEQ-----FDRLEALI--PVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMV 275
FP + F + + I WAP+Y +HK+ AGL D + Y +N +A L+ W +
Sbjct: 142 FPNSKNLWSTFKKGDLRIYNSAWAPFYNLHKMYAGLRDAWLYCNNKQAKTLFLKFCDWAI 201
Query: 276 EYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
++ + E+ L E GGMN++L + IT + K+L+ A + + L
Sbjct: 202 --------SITDDLNEEQMQTVLKMEYGGMNEILADAYQITGNKKYLVAAKRYSQNILLD 253
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIG 385
L+ D++ H+NT IP IG E++GD + G++ + G N
Sbjct: 254 PLSQGIDNLDNKHANTQIPKFIGFARIAELSGDTKYTNASRFSWETITGNRSLAFGGNSR 313
Query: 386 HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ 445
+F S D + ESC +YNMLK++ LFR YADYYER++ N +L Q
Sbjct: 314 REHFPSVTSCSDYINDVDGPESCNSYNMLKLTEDLFRMQPSAHYADYYERTVFNHILSTQ 373
Query: 446 RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
E G +Y S++ R Y + P+++ WCC GTG+E+ SK IY +
Sbjct: 374 H-PEHGGYVYFT-----SARPRHYRVYSAPNEAMWCCVGTGMENHSKYNQFIYTHSDD-- 425
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPY-LRVTLTFSSKGSGLTTSLNLRIPT 564
+++ +I+S L+WK+ +I + Q+ + PY R LT + S L +R P
Sbjct: 426 -SLFVNLFIASELNWKNKKISLRQETNF-----PYEERTKLTVTKASSPF--KLMIRYPG 477
Query: 565 WTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYAS 623
W K ++NG+ + + P +++ + + W+ D + ++LP+ E + P +
Sbjct: 478 WVDKGALKVSVNGKSMNYSALPSSYICIDRKWNKGDVVEVELPMRSTIEHL----PNVPN 533
Query: 624 IQAILYGPYVLAGHSIGDWDITESATSLSDW-------ITPIPAS----------YNSQL 666
A ++GP +L G G D+ W + P+ + S+L
Sbjct: 534 YIAFMHGP-ILLGAKTGTEDLRGLIAGDGRWGQYPSGKLLPVDQAPILIVDDMENITSKL 592
Query: 667 ITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLIL-NDSSGSEFSSLNDFIG 725
+ E + K + +N SI ++ P + A + + L L N + SL+
Sbjct: 593 VPIKNEPLHFKANIKAAN-SIDIKLEPFANIHDARYMMYWLTLTNKGYQTYIDSLSTIEK 651
Query: 726 KSVMLEP----FDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDG 771
+ ++LE F +PG Q ETD +++ S + F A +G
Sbjct: 652 EKIILEKLTVDFVAPGEQ--QPETDHKILQEKSRTGNANQQFFREASSEG 699
>gi|430751026|ref|YP_007213934.1| hypothetical protein Theco_2852 [Thermobacillus composti KWC4]
gi|430734991|gb|AGA58936.1| hypothetical protein Theco_2852 [Thermobacillus composti KWC4]
Length = 621
Score = 228 bits (581), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 158/562 (28%), Positives = 255/562 (45%), Gaps = 82/562 (14%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFR----KTARLPAPGEPYGGWEEPSCELRGHFVGHYLSA 188
R +Q N YL+ L+ D L++N+R + + P +GGWE P C+LRGHF+GH+LSA
Sbjct: 18 RREQANRAYLMKLNSDSLLFNYRLEAGRYSGREIPPWAHGGWESPVCQLRGHFLGHWLSA 77
Query: 189 SALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTI 248
+A+ + +T + LK K ++ L+ CQK+ G + P + + A +WAP Y +
Sbjct: 78 AAIHYHATGDAELKAKADGIIDELAECQKDNGGQWAGPIPEKYLHWIAAGKAIWAPQYNL 137
Query: 249 HKILAGLLDQYTYADNAEAL----RMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGG 304
HK+ GL+D + YA N +AL R W VE+ +++ ++ L+ E GG
Sbjct: 138 HKLFMGLVDSFQYAGNQKALDIADRFADWFVEW--------SGRFTRDQFDDILDVETGG 189
Query: 305 MNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYE 364
M +V L IT + K+ L + + L D ++ H+NT IP V+G YE
Sbjct: 190 MLEVWADLLHITGNGKYKTLLERYYRGRLFQPLLEGKDPLTNMHANTTIPEVLGCARAYE 249
Query: 365 VTGDQLH------------KEGHQLESSGTNIGHFNFKSDPK-RLASNLDSNTEESCTTY 411
VTGD E L + G G PK ++ + L +E CT Y
Sbjct: 250 VTGDSRWMDVVKAYWNCAVTERGFLATGGQTSGEVWM---PKMKMKARLGDKNQEHCTVY 306
Query: 412 NMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTE------------PGVMIYLLPL 459
NM++++ LFR T + YA Y E +L NGV+ E G++ Y LP+
Sbjct: 307 NMMRLAEFLFRHTGDPGYAQYREYNLYNGVMAQTYYREYALNGNPHNHPGTGLLTYFLPM 366
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL- 518
G K+ W T + SF+CC+GT +++ + IY+++ +YI QY +S +
Sbjct: 367 KAGLRKD-----WSTETSSFFCCHGTMVQANAAWNRGIYYQDRDD---IYICQYFNSEMT 418
Query: 519 -DWKSGQIVVNQKVDPV-----------------------VSWDPYLRVTLTFSSKGSGL 554
+ G++ + Q DP+ + PY + +
Sbjct: 419 TEINGGELRIIQTQDPMNGNSMTSSNTAGYQSINEVAAIHENLPPYRKYDFVIRTSVQ-Q 477
Query: 555 TTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
+++ RIP W S+ + F + + W DK+++ LP+ +R +
Sbjct: 478 PFAIHFRIPEWIMSDAVLYVNDEFHGKTSDSTRFYPIRRVWRDGDKISVLLPIGIRFVPL 537
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
DD + A YGP VLAG
Sbjct: 538 PDDE----NTGAFRYGPEVLAG 555
>gi|383640258|ref|ZP_09952664.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas elodea
ATCC 31461]
Length = 652
Score = 228 bits (581), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 178/553 (32%), Positives = 259/553 (46%), Gaps = 69/553 (12%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWE-EP 173
L+ + DV LG AQ+ YLL L+ D+L+ FR A L YGGWE +P
Sbjct: 51 LQPFDMADVTLGEGPF-LHAQRATEAYLLRLEPDRLLHQFRVNAGLEPKAPAYGGWESDP 109
Query: 174 ---SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP-- 228
+GH +GHYLSA AL + +T ++++ + + L ACQ SG ++AFP
Sbjct: 110 LWSDIHCQGHTLGHYLSACALAYRATGEARYRQRVDYIATELGACQDAAKSGLVTAFPKG 169
Query: 229 ---TEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNR 281
R E + V P+YT+HK+ AGL D AD+ A LR+ W V
Sbjct: 170 AALVSAHLRGEKITGV--PWYTLHKVYAGLRDGALLADSEPARATLLRLADWGVV----- 222
Query: 282 VQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQA 341
+ S L E GGMN++ L+ +T ++ +A F L LA
Sbjct: 223 ---ASRPLSDAEFEAMLETEHGGMNEIYADLYFMTGKEEYRAIARRFSHKALLAPLARAQ 279
Query: 342 DDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG--------HQLESSGTNIGHFNFKSDP 393
D + G H+NT +P V+G Q YE TGD +++ Q S T GH D
Sbjct: 280 DHLDGLHANTQVPKVVGFQRVYEATGDAAYRDAAAFFWKTVAQTRSFATG-GH----GDN 334
Query: 394 KRLASNLDSNTE-------ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
+ + D T E+C +NMLK++R LF + AYADYYER+L NG+L Q
Sbjct: 335 EHFFAMADFETHVFSAKGSETCCQHNMLKLTRALFLHDPDPAYADYYERTLYNGILASQ- 393
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
+ G+ Y PG K YH TP SFWCC GTG+E+ K DSIYF +
Sbjct: 394 DPDSGMATYFQGARPGYMK--LYH---TPEHSFWCCTGTGMENHVKYRDSIYFHDAST-- 446
Query: 507 GVYIIQYISSRLDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSK-GSGLTTSLNLRIPT 564
+Y+ ++ S L W+ G ++V + P V T T + + +L+LR P
Sbjct: 447 -LYVNLFLPSTLRWRDKGAVLVQETRFPEVP-------TTTLRWRLDKPVDVTLSLRHPG 498
Query: 565 WTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYAS 623
W+ + A +NG+ +PG+ +++ + W D + +QL + E + P
Sbjct: 499 WSRT--ATVRVNGKVAARSVAPGSRIALPRNWRDGDVVELQLVM----EPGVERAPAAPD 552
Query: 624 IQAILYGPYVLAG 636
+ A YGP VLAG
Sbjct: 553 VVAFTYGPLVLAG 565
>gi|294646986|ref|ZP_06724603.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294806386|ref|ZP_06765229.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|292637657|gb|EFF56058.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294446401|gb|EFG15025.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 813
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 163/561 (29%), Positives = 269/561 (47%), Gaps = 64/561 (11%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEP 173
+ E + DV+L D + A++ N+E LL DVD+L+ +RK A L + Y W+
Sbjct: 39 YKNEFPIADVKL-LDGVFKHARELNIEVLLKYDVDRLLAPYRKEAGLTERKKTYPNWDG- 96
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSAC-------QKEIGSGYLSA 226
L GH GHYLSA ++ +A+T N+ +M ++S L C E GY+
Sbjct: 97 ---LDGHVAGHYLSAMSMNYAATGNKECGRRMEYMISELQLCLEANAINNTEWAIGYIGG 153
Query: 227 FPTEQ-----FDRLEALI--PVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMV 275
FP + F + + I WAP+Y +HK+ AGL D + Y +N +A L+ W +
Sbjct: 154 FPNSKNLWSTFKKGDLRIYNSAWAPFYNLHKMYAGLRDAWLYCNNKQAKTLFLKFCDWAI 213
Query: 276 EYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
++ + E+ L E GGMN++L + IT + K+L+ A + + L
Sbjct: 214 --------SITDDLNEEQMQTVLKMEYGGMNEILADAYQITGNKKYLVAAKRYSQNILLD 265
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIG 385
L+ D++ H+NT IP IG E++GD + G++ + G N
Sbjct: 266 PLSQGIDNLDNKHANTQIPKFIGFARIAELSGDTKYTNASRFSWETITGNRSLAFGGNSR 325
Query: 386 HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ 445
+F S D + ESC +YNMLK++ LFR YADYYER++ N +L Q
Sbjct: 326 REHFPSVTSCSDYINDVDGPESCNSYNMLKLTEDLFRMQPSAHYADYYERTVFNHILSTQ 385
Query: 446 RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
E G +Y S++ R Y + P+++ WCC GTG+E+ SK IY +
Sbjct: 386 H-PEHGGYVYFT-----SARPRHYRVYSAPNEAMWCCVGTGMENHSKYNQFIYTHSDD-- 437
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPY-LRVTLTFSSKGSGLTTSLNLRIPT 564
+++ +I+S L+WK+ +I + Q+ + PY R LT + S L +R P
Sbjct: 438 -SLFVNLFIASELNWKNKKISLRQETNF-----PYEERTKLTVTKASSPF--KLMIRYPG 489
Query: 565 WTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYAS 623
W K ++NG+ + + P +++ + + W+ D + ++LP+ E + P +
Sbjct: 490 WVDKGALKVSVNGKSMNYSALPSSYICIDRKWNKGDVVEVELPMRSTIEHL----PNVPN 545
Query: 624 IQAILYGPYVLAGHSIGDWDI 644
A ++GP +L G G D+
Sbjct: 546 YIAFMHGP-ILLGAKTGTEDL 565
>gi|328956144|ref|YP_004373477.1| hypothetical protein Corgl_1563 [Coriobacterium glomerans PW2]
gi|328456468|gb|AEB07662.1| protein of unknown function DUF1680 [Coriobacterium glomerans PW2]
Length = 751
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 185/642 (28%), Positives = 296/642 (46%), Gaps = 74/642 (11%)
Query: 40 TFRSNLLSSKNESYIKQIHSHNDHLTPSD----------DSAWLSLMPRKILREEEQDEL 89
TF +L +N+ +K + H P + ++A L+P+ ++ + +
Sbjct: 83 TFEVKILEERNKIDVKTVFPIELHHEPGETFYMPQAVAVETALGELLPQYVVWDGGEKRH 142
Query: 90 FSWAMLYRKIKNPGQFKVPERSG--------------EFLKEVSLHDVRLGSDSMHWRAQ 135
+ LY + VP R + ++ ++L VRL + AQ
Sbjct: 143 YEVPGLYEITGHIDASDVPVRGSVVVEPGVTITSMRSKKMRPINLTCVRLAPGTPAAAAQ 202
Query: 136 QTNLEYLLMLDVDKLVWNFRKTARLPAPGEP-YGGWEEPSCELRGHFVGHYLSASALMWA 194
Q L +L +D D+++ NFR+ A + G P GW+ P LRGH GHYLSA AL WA
Sbjct: 203 QRRLSFLKQVDDDQMLINFRRAAHMDTKGAPEMIGWDTPDSNLRGHTTGHYLSALALAWA 262
Query: 195 STHNESLKEKMSAVVSALSACQKE------IGSGYLSAFPTEQFDRLEALIP---VWAPY 245
+T +E++ K+S +V +L Q I G+LSA+ QFD LE P +WAPY
Sbjct: 263 ATGDETVHSKLSYMVHSLGEVQAAFRGQPGIHEGFLSAYDESQFDLLERYTPYPEIWAPY 322
Query: 246 YTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT-LNEEAGG 304
YT+HKILAGLLD Y YA N +AL + + + YNR+ + +++ W + E GG
Sbjct: 323 YTLHKILAGLLDSYRYAGNRQALEIAIGVGHWVYNRLSQ-LDPIQLKKMWAMYIAGEFGG 381
Query: 305 MNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYE 364
MN+ L L IT + + A FD + + D + H+N HIP VIG+ Y
Sbjct: 382 MNESLAMLGAITGEESFVKAARFFDNDKLIFPALQKVDALGTLHANQHIPQVIGALSLYG 441
Query: 365 VTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNML 414
VT ++ + + H + + G G P +A+ +D + ESC +YNM+
Sbjct: 442 VTHEESYYQVAEFFWHSVVAHHIYAFG-GTGDGEMFQQPCEIAAKIDEFSAESCASYNMI 500
Query: 415 KVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGT 474
K++R L+ + Y E L N +L G Y + PG+ K G
Sbjct: 501 KLTRDLYEYEPTADKMAYCENVLINHILSSTDHEGTGGSTYFMETQPGARK-------GF 553
Query: 475 PSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPV 534
+++ CC+GTG+ES G SIY++ EG+ + + Y++S L + +D
Sbjct: 554 DTEN-SCCHGTGLESQFMYGQSIYYQGEGQ---LIVALYLASHLKTDDTDVT----IDCD 605
Query: 535 VSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKT 594
+ +R+ + L L LR P W S+ ++NG + +++V +
Sbjct: 606 FNHPETVRIAI------GRLEGKLVLRHPDW--SDRMTVSINGAAARIAEKDGYVTVEDS 657
Query: 595 WSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ D++T++L LR DD + AI YGP+VLA
Sbjct: 658 LAPGDEITVRLNPELRLIPTPDD----PNRVAIGYGPFVLAA 695
>gi|383123086|ref|ZP_09943771.1| hypothetical protein BSIG_0174 [Bacteroides sp. 1_1_6]
gi|251841821|gb|EES69901.1| hypothetical protein BSIG_0174 [Bacteroides sp. 1_1_6]
Length = 802
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 173/562 (30%), Positives = 269/562 (47%), Gaps = 75/562 (13%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
SL DV+L S S +AQQT+L Y+L LD D+L F + A L Y WE + L
Sbjct: 29 SLQDVKLLS-SPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWE--NTGLD 85
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLE 236
GH GHYLSA ++M+A+T + ++ +++ +++ L Q+ +G+G++ P + + ++
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 237 A---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQ 283
A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ S + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFFHKVILDPLIKNEDR 257
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQL---HKE--------------GHQLESSGTNIGH 386
++G H+NT IP VIG + EV+ D H H+ G N
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKDDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEI--------AYADYYERSLT 438
+F + D E+C TYNML++++ L++ + ++ Y DYYER+L
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 439 NGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY 498
N +L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIY 431
Query: 499 FEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSL 558
++ +Y+ +I S+L+WK + + Q+ + D +VTL K + +L
Sbjct: 432 AHQQDT---LYVNLFIPSQLNWKEQGVTLTQET--LFPDDE--KVTLRI-DKAAKKNLTL 483
Query: 559 NLRIPTWT-SSNGAKATLNGQ----DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEA 613
+RIP W +S G + T+NG+ D+ + +L + + W D +T LP+ + E
Sbjct: 484 MIRIPEWAGNSKGYEITINGKKHLSDIQTGA-STYLPIRRKWKKGDMITFHLPMKVSLEQ 542
Query: 614 IQDDRPEYASIQAILYGPYVLA 635
I D + Y A LYGP VLA
Sbjct: 543 IPDKKDYY----AFLYGPIVLA 560
>gi|295133987|ref|YP_003584663.1| glycosyl hydrolase [Zunongwangia profunda SM-A87]
gi|294982002|gb|ADF52467.1| putative glycosyl hydrolase [Zunongwangia profunda SM-A87]
Length = 794
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 163/553 (29%), Positives = 269/553 (48%), Gaps = 71/553 (12%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
+L DV+L + + A T+L+Y+L ++ D+L+ F + A L E Y WE + L
Sbjct: 35 NLKDVKLHT-GLFEEAMYTDLDYILQMEPDRLLAPFLREAGLQPKAESYPNWE--NTGLD 91
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLE-- 236
GH GHYL+A A M+AS ++ ++++ ++ L Q G+GY+ P + E
Sbjct: 92 GHIGGHYLTALAQMYASAGSDEALQRLNYMIGELKKAQDANGNGYVGGIPDSERIWKEIS 151
Query: 237 ---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQ 283
+L W P Y IHK AGL D Y A N EA +M T WM++ N +
Sbjct: 152 EGKINAGGFSLNGGWVPLYNIHKTYAGLRDAYLIAGNEEAKQMLIDLTDWMIDITANLSE 211
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
I+ + L E GG+N+ ++ +T D K+L LA+ F + L L + D
Sbjct: 212 AQIQ--------EMLKSEHGGLNETFADVYKMTGDKKYLDLAYAFTQKQVLDPLEHEKDI 263
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTG--DQLHKEGHQ---------LESSGTNIG------H 386
++G H+NT IP VIG YE DQ +K+ H + + +IG H
Sbjct: 264 LNGMHANTQIPKVIG----YETIAALDQ-NKDYHNAATYFWENVVNNRTVSIGGNSVREH 318
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
F+ D + +++ E+C TYNMLK+S LF E Y D+YE+ L N +L Q
Sbjct: 319 FHPADDFSSMINSVQG--PETCNTYNMLKLSEKLFLANPEEKYIDFYEQGLYNHILSSQH 376
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
G +Y P+ PG Y + P S WCC G+G+E+ K + IY +
Sbjct: 377 PE--GGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHGKYNEMIYAHSDD--- 426
Query: 507 GVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
+Y+ +I S ++W+ + Q+ D + ++ + K LT +N R P+W
Sbjct: 427 ALYVNLFIPSEVNWEDKNFKLIQETDFPNAETASFKIE---TQKPQKLT--INFRYPSW- 480
Query: 567 SSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
+ G +N + + PG+++S+T+ W DD+++++LP+ + +E + P+ + +
Sbjct: 481 AGEGFDVQVNDKKVKFDKKPGSYISITRKWEDDDQISMRLPMNITSERL----PDGSDYE 536
Query: 626 AILYGPYVLAGHS 638
++ YGP VLA +
Sbjct: 537 SLKYGPLVLAAKT 549
>gi|29345759|ref|NP_809262.1| hypothetical protein BT_0349 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337652|gb|AAO75456.1| Acetyl-CoA carboxylase, biotin carboxylase [Bacteroides
thetaiotaomicron VPI-5482]
Length = 802
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 173/562 (30%), Positives = 269/562 (47%), Gaps = 75/562 (13%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
SL DV+L S S +AQQT+L Y+L LD D+L F + A L Y WE + L
Sbjct: 29 SLQDVKLLS-SPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWE--NTGLD 85
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLE 236
GH GHYLSA ++M+A+T + ++ +++ +++ L Q+ +G+G++ P + + ++
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 237 A---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQ 283
A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ S + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKVILDPLIKNEDR 257
Query: 344 ISGFHSNTHIPIVIGSQMRYEVT---GDQLHKE--------------GHQLESSGTNIGH 386
++G H+NT IP VIG + EV+ D H H+ G N
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKNDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEI--------AYADYYERSLT 438
+F + D E+C TYNML++++ L++ + ++ Y DYYER+L
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 439 NGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY 498
N +L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIY 431
Query: 499 FEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSL 558
++ +Y+ +I S+L+WK + + Q+ + D +VTL K + +L
Sbjct: 432 AHQQDT---LYVNLFIPSQLNWKEQGVTLTQET--LFPDDE--KVTLRI-DKAAKKNLTL 483
Query: 559 NLRIPTWT-SSNGAKATLNGQ----DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEA 613
+RIP W +S G + T+NG+ D+ + +L + + W D +T LP+ + E
Sbjct: 484 MIRIPEWAGNSKGYEITINGKKHLSDIQTGA-STYLPIRRKWKKGDMITFHLPMKVSLEQ 542
Query: 614 IQDDRPEYASIQAILYGPYVLA 635
I D + Y A LYGP VLA
Sbjct: 543 IPDKKDYY----AFLYGPIVLA 560
>gi|261407096|ref|YP_003243337.1| hypothetical protein GYMC10_3284 [Paenibacillus sp. Y412MC10]
gi|261283559|gb|ACX65530.1| protein of unknown function DUF1680 [Paenibacillus sp. Y412MC10]
Length = 622
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 170/563 (30%), Positives = 255/563 (45%), Gaps = 84/563 (14%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFR-KTARLPA---PGEPYGGWEEPSCELRGHFVGHYLSA 188
R ++ N YL+ LD L++N+ + R P +GGWE P C+LRGHF+GH+LS
Sbjct: 18 RRERANRSYLMKLDSGHLLFNYHLEAGRFHGRTIPEGAHGGWETPVCQLRGHFLGHWLSG 77
Query: 189 SALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTI 248
+AL + + + LK K+ A+V L CQ++ G ++ P + + + +WAP Y
Sbjct: 78 AALHYEESGDIELKAKLDAIVHELHECQRDNGGQWVGPIPEKYLHWIASGKSIWAPQYNC 137
Query: 249 HKILAGLLDQYTYADNAEAL----RMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGG 304
HKIL GL+D + YA N +AL R W VE+ ++ E+ L+ E GG
Sbjct: 138 HKILMGLVDAWQYAGNRQALDIVDRFADWFVEW--------SGTFTREQFDDILDVETGG 189
Query: 305 MNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYE 364
M +V L IT K+ +L + + L D ++ H+NT IP V+G YE
Sbjct: 190 MLEVWADLLHITGADKYRVLLDRYYRSRLFQPLLEGKDPLTNMHANTTIPEVLGCARAYE 249
Query: 365 VTGDQLH------------KEGHQLESSGTNIGHFNFKSDPK-RLASNLDSNTEESCTTY 411
VTGD E L + G G PK ++ + L +E CT Y
Sbjct: 250 VTGDDRWLSIVQAYWNCAVTERGSLATGGQTAGEVWM---PKMKMKARLGDKNQEHCTVY 306
Query: 412 NMLKVSRHLFRWTKEIAYADYYERSLTNGVL-----------GIQRG-TEPGVMIYLLPL 459
NM++++ LFR + + YA Y E +L NG++ G Q G++ Y LP+
Sbjct: 307 NMIRLADFLFRQSGDPTYAQYIEYNLYNGIMAQAYYQEYGLTGSQHNYPRTGLLTYFLPM 366
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
G KE W T +DSF+CC+GT +++ + IY+++ VYI QY S LD
Sbjct: 367 KAGLRKE-----WSTETDSFFCCHGTMVQANAAWNMGIYYQDGDI---VYISQYFDSELD 418
Query: 520 WKSGQIVVN---------------------QKVDPVVSWD---PYLRVTLTFSSKGSGLT 555
++ Q ++ S + P R S + T
Sbjct: 419 ASIAGTLIRIVQTQDKMSGSLLSSSNTAGYQAINDTASINENIPTFRKYDFIVSAAAPTT 478
Query: 556 TSLNLRIPTWTSSNGAKATLNG--QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEA 613
+L RIP W + GA +N Q L S NF + + W D ++I LP+ +R
Sbjct: 479 FTLRFRIPEWIMA-GASVYVNDVLQGTTLDSE-NFYDIHRAWKEGDTVSIMLPIGIRFVP 536
Query: 614 IQDDRPEYASIQAILYGPYVLAG 636
+ DD A YGP VLAG
Sbjct: 537 LPDDE----RTGAFRYGPEVLAG 555
>gi|359776490|ref|ZP_09279799.1| hypothetical protein ARGLB_045_00070 [Arthrobacter globiformis NBRC
12137]
gi|359306199|dbj|GAB13628.1| hypothetical protein ARGLB_045_00070 [Arthrobacter globiformis NBRC
12137]
Length = 1025
Score = 227 bits (578), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 144/431 (33%), Positives = 218/431 (50%), Gaps = 43/431 (9%)
Query: 222 GYLSAFPTEQFDRLEA-----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE+ VWAPYYT HKIL GLLD YT +AL + T + +
Sbjct: 391 GFLAAYPETQFIELESRTTPDYFRVWAPYYTAHKILKGLLDAYTATAEPKALDLATGLCD 450
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ ++R+ + +R W + E GG+ + + + + + P+HL LA FD +
Sbjct: 451 WMHSRLSKLTPAVR-QRMWGIFSSGEYGGVVEAILETYGHSGKPEHLELAKYFDLDSLID 509
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ-----------LESSGTNI 384
A D ++G H+N HIPI G + Y TG++ + + GT+
Sbjct: 510 ACAQDKDILAGLHANQHIPIFTGLVLMYNATGEERYLAAARNFWTMVVPTRMFSIGGTSQ 569
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G F + D R+A+ L++ ESC YNMLK+SR LF + AY DYYER+L N VLG
Sbjct: 570 GEFWKERD--RIAATLNATDAESCCAYNMLKLSRELFFREQNPAYMDYYERALFNQVLGS 627
Query: 445 QRGTEPG---VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
++ E + Y + L PG+ ++ TP CC GTG+ES +K DS+YF
Sbjct: 628 KQDKESAELPLATYFIGLQPGAVRDF------TPKQGTTCCEGTGLESATKYQDSVYF-T 680
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
G +Y+ Y+ S L W + + V Q+ S+ R TL + G L LR
Sbjct: 681 AGDGSALYVNLYMPSTLRWAAKNVTVTQQ----TSYPFEQRTTLQVAGSGQ---FELRLR 733
Query: 562 IPTWTSSNGAKATLNGQ-DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
+P W ++ G +NG +PG +LS+ + W + D + +++P TLR E DD
Sbjct: 734 VPAWATA-GFTVRVNGAVTEAAATPGTYLSIARAWKNGDTVDVEMPFTLRAERALDD--- 789
Query: 621 YASIQAILYGP 631
S+Q ++YGP
Sbjct: 790 -PSVQTLMYGP 799
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 55/113 (48%), Gaps = 9/113 (7%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARL-PAPGE---PYGGW 170
++ L DV LG + R ++ L + D + V FR A L P G P GGW
Sbjct: 49 VRPFKLSDVSLGP-GVFARKRELILNFARGYDERRYVNVFRANAGLRPLDGVVPLPAGGW 107
Query: 171 E----EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
E E + LRGHF GH++S A +A T E K+ +V++L C++ +
Sbjct: 108 EGLDGEANGNLRGHFTGHHMSMLAQAYAGTGEEVFGTKLRNLVASLHECRQAL 160
>gi|298384655|ref|ZP_06994215.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
gi|298262934|gb|EFI05798.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
Length = 802
Score = 227 bits (578), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 173/562 (30%), Positives = 269/562 (47%), Gaps = 75/562 (13%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
SL DV+L S S +AQQT+L Y+L LD D+L F + A L Y WE + L
Sbjct: 29 SLQDVKLLS-SPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWE--NTGLD 85
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLE 236
GH GHYLSA ++M+A+T + ++ +++ +++ L Q+ +G+G++ P + + ++
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 237 A---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQ 283
A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ S + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKVILDRLIKNEDR 257
Query: 344 ISGFHSNTHIPIVIGSQMRYEVT---GDQLHKE--------------GHQLESSGTNIGH 386
++G H+NT IP VIG + EV+ D H H+ G N
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKNDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEI--------AYADYYERSLT 438
+F + D E+C TYNML++++ L++ + ++ Y DYYER+L
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 439 NGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY 498
N +L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIY 431
Query: 499 FEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSL 558
++ +Y+ +I S+L+WK + + Q+ + D +VTL K + +L
Sbjct: 432 AHQQDT---LYVNLFIPSQLNWKEQGVTLTQET--LFPDDE--KVTLRI-DKAAKKKLTL 483
Query: 559 NLRIPTWT-SSNGAKATLNGQ----DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEA 613
+RIP W +S G + T+NG+ D+ + +L + + W D +T LP+ + E
Sbjct: 484 MIRIPEWAGNSKGYEITINGKKHLSDIQAGT-STYLPLRRKWKKGDVITFHLPMKVSLEQ 542
Query: 614 IQDDRPEYASIQAILYGPYVLA 635
I D + Y A LYGP VLA
Sbjct: 543 IPDKKDYY----AFLYGPIVLA 560
>gi|315498334|ref|YP_004087138.1| hypothetical protein Astex_1314 [Asticcacaulis excentricus CB 48]
gi|315416346|gb|ADU12987.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
48]
Length = 774
Score = 226 bits (577), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 173/556 (31%), Positives = 272/556 (48%), Gaps = 84/556 (15%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L VRL S+ + + N YLL L D+ + NFRK A L GE YGGWE + + G
Sbjct: 38 LSQVRL-KPSIFLTSIEANQRYLLSLSPDRFLHNFRKGAGLEPKGEVYGGWE--ARGIAG 94
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSA------------- 226
H +GHYLS +LM+A T +++ + V+S L Q + GY
Sbjct: 95 HSLGHYLSGLSLMYAQTGKPEFRDRAAHVLSELKTIQAKHSDGYAGGTTVGRNGQEVDGK 154
Query: 227 ----------FPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
T FD L W P YT HK+ AG LD + YA A+AL + T + +
Sbjct: 155 VVYEELRKGDIRTSGFD----LNGGWVPLYTYHKVFAGALDAHQYAGLADALIVATGLGD 210
Query: 277 YFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGL 336
Y + +++ S + + L E GG+ + +L+ T++ + L L+ +
Sbjct: 211 Y----LGTILESLSDAQIQEILRAEHGGLTESYAELYARTKNQRWLTLSQRLRHRAIVDP 266
Query: 337 LALQADDISGFHSNTHIPIVIGSQMRYEVTGD-----------QLHKEGHQLESSGTNIG 385
LA D+++G H+NT IP ++GS +E+T + Q H G N
Sbjct: 267 LAAGHDELAGKHANTQIPKIVGSARLFELTQNADDARIARFFWQTVSRDHSYVIGG-NSD 325
Query: 386 HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ 445
H +F + P++LAS LD T E+C +YNML+++RHL+ W+ + A D+YER+ N ++ Q
Sbjct: 326 HEHFGA-PRQLASRLDQQTCEACNSYNMLRLTRHLYGWSGDAALFDFYERTHLNHIMS-Q 383
Query: 446 RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
+ + G+ Y LA G + S P++ FWCC G+G+ES SK G+SIY++ +
Sbjct: 384 QDPQTGMFTYFTGLASGLGRVHS-----DPTNDFWCCVGSGMESHSKHGESIYWK---RG 435
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
GV + Y +S L+ Q+ +++ + +T+ + K +L+LR+P W
Sbjct: 436 EGVAVNLYYASTLNAPETQL----EMETAFPLSDQVVITVHKAPK------ALDLRVPGW 485
Query: 566 TSS-----NGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
+ NG KA GQ G +L +T + D++ + L + +R EA+ DD
Sbjct: 486 CDTPVLRVNG-KAAGVGQ-------GGYLRLTGL-KNGDRIELCLAMHVRVEAMPDD--- 533
Query: 621 YASIQAILYGPYVLAG 636
A + A L GP VLAG
Sbjct: 534 -AKLIAFLSGPLVLAG 548
>gi|312133546|ref|YP_004000885.1| protein [Bifidobacterium longum subsp. longum BBMN68]
gi|322690281|ref|YP_004219851.1| hypothetical protein BLLJ_0089 [Bifidobacterium longum subsp.
longum JCM 1217]
gi|311772796|gb|ADQ02284.1| Hypothetical protein BBMN68_1283 [Bifidobacterium longum subsp.
longum BBMN68]
gi|320455137|dbj|BAJ65759.1| conserved hypothetical protein [Bifidobacterium longum subsp.
longum JCM 1217]
Length = 800
Score = 226 bits (577), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 181/601 (30%), Positives = 276/601 (45%), Gaps = 94/601 (15%)
Query: 107 VPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARL-PAPG- 164
P G + L +V + S+S+ RA++ L+Y VD+ + FR A L P
Sbjct: 78 APALPGWKVAPFPLRNVAITSNSVFDRAKEGMLDYARNYPVDRWLVCFRAQANLLPKDNT 137
Query: 165 -EPYGGWEE-PSCEL--------------------------RGHFVGHYLSASALMWAST 196
+P GGWE PS L RGHF GH L + +A T
Sbjct: 138 TQPSGGWENFPSGSLDKAVEQQWGDAEYTRGQNKNGADGLLRGHFAGHALHMLSQAYAET 197
Query: 197 HNESLKEKMSAVVSALSACQKEIGS------------GYLSAFPTEQFDRLEALIP---V 241
E++ K++ VS L C+ + G+L+A+ QF LE P +
Sbjct: 198 GEEAILNKINEFVSGLKECRDSLREMKYNGKARYSHPGFLAAYGEWQFKALEEYAPYGEI 257
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT-LNE 300
WAP+YT HKILAGL+ Y +A NA+AL + + + Y R+ K +++ W +
Sbjct: 258 WAPWYTEHKILAGLIAAYEFAGNADALDLAEGIGHWTYARLSKCTKT-QLQKMWDIYIGG 316
Query: 301 EAGGMNDVLYKLFCITQDPKH---LMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVI 357
E GGMND L L+ +++D L + FD + D ++ H+N HIP +
Sbjct: 317 EYGGMNDSLVDLYNVSKDKDRSEFLKASAFFDTDKLIDNCGAGVDILNNLHANQHIPQFV 376
Query: 358 G---------------SQMRY--EVTGD-QLHKEGHQLESSGTNIGHFNFKSDPKRLASN 399
G ++ RY V G + G GT G +A +
Sbjct: 377 GYAKDAAMGDADIDADARARYLKAVEGYWGMIVPGRMYAHGGT--GEGEMWGPAHTVAGD 434
Query: 400 LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ-RGTEPGVMI---- 454
+ ESC YNMLKV+R+LF ++ AY DYYER++ N +LG + R + G +
Sbjct: 435 IGKRNAESCAAYNMLKVARYLFFIEQKPAYMDYYERTILNHILGGKSRDLDSGTALTPGN 494
Query: 455 -YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y+ P+ P + KE + GT CC GT +ES SK DSIYF +Y+ +
Sbjct: 495 CYMYPVNPATQKEYGDGNIGT------CCGGTALESHSKYQDSIYFHSTDNKE-LYVNLF 547
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+S LDW + + Q+ + + +++T + K + + +RIP W S GAK
Sbjct: 548 TASTLDWTDTGLKLAQETN--YPEEETSTISITAAPKSA---VTFRIRIPAW--SKGAKI 600
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NG+ + + G + +V +W DK+ + +PL LRTE+ DDR + IQ + YGP V
Sbjct: 601 EVNGKAIDGVTAGEYATVAGSWKVGDKIVVTIPLQLRTEST-DDRKD---IQTLFYGPTV 656
Query: 634 L 634
L
Sbjct: 657 L 657
>gi|423230906|ref|ZP_17217310.1| hypothetical protein HMPREF1063_03130 [Bacteroides dorei
CL02T00C15]
gi|423244617|ref|ZP_17225692.1| hypothetical protein HMPREF1064_01898 [Bacteroides dorei
CL02T12C06]
gi|392630026|gb|EIY24028.1| hypothetical protein HMPREF1063_03130 [Bacteroides dorei
CL02T00C15]
gi|392641466|gb|EIY35242.1| hypothetical protein HMPREF1064_01898 [Bacteroides dorei
CL02T12C06]
Length = 797
Score = 226 bits (576), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 178/586 (30%), Positives = 279/586 (47%), Gaps = 75/586 (12%)
Query: 96 YRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFR 155
Y +++ + VP L EV L +DS +A + YLL LDVD+L+ + R
Sbjct: 25 YEQVRKAPRVHVPVWQSFALSEVEL------TDSYFKKAMDLHKGYLLSLDVDRLIPHVR 78
Query: 156 KTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSAC 215
++ L G+ YGGWE+ G GHY+SA A+M+AST ++L +K++ ++ L C
Sbjct: 79 RSVGLQGKGDNYGGWEKHG----GCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQEC 134
Query: 216 QKEIGSGYLSAFPTEQFDRLEAL---IPVWAP---------------YYTIHKILAGLLD 257
QK+ G+ + L+ L + + P +Y IHKILAGL D
Sbjct: 135 QKQTPDGWFITGKRGKEGYLQLLQGNVVLNQPDETGQPWNYNQNGNSWYCIHKILAGLRD 194
Query: 258 QYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQ 317
Y YA +A + + ++ + ++ + + TL+ E GGMN+V ++ IT
Sbjct: 195 AYVYAGCRQAKDILMPLADF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITG 250
Query: 318 DPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG--- 374
D K L A F+ + +A D + G H+N IP +G YE + + ++ +
Sbjct: 251 DKKFLQTAERFNHINVIYPIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARN 310
Query: 375 --------HQLESSGTNI-GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK 425
H L G + F + + LD + E+C TYNMLK+SR LF
Sbjct: 311 FWNIVIKDHTLAIGGNSCYERFGVLGEESK---RLDYTSAETCNTYNMLKLSRQLFMLDG 367
Query: 426 EIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGT 485
+ Y +YYE +L N +L Q PG + Y L PGS K+ S TP DSFWCC GT
Sbjct: 368 DYKYLNYYEHALYNHILASQDPDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGT 422
Query: 486 GIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYL---- 541
G+E+ SK +SIYF++ + + + YI SRL WK + ++ D Y
Sbjct: 423 GMENHSKYAESIYFKDNQE---LLVNLYIPSRLHWKEKGL--------KLTLDTYFPESD 471
Query: 542 RVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDK 600
VT+ GS T +L R P W S + A +NG+ + G+++ + + S D
Sbjct: 472 TVTVRMDEIGS-YTGTLLFRYPDWVSGD-AVVRINGEPAQTEAHKGSYIRLLDSVKSGDV 529
Query: 601 LTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE 646
+T+ L + +D+ P + S ++YGP +LAG +G D+ E
Sbjct: 530 ITLVFTRNLYIDYAKDE-PHFGS---VMYGPILLAG-GLGTDDMPE 570
>gi|160883737|ref|ZP_02064740.1| hypothetical protein BACOVA_01709 [Bacteroides ovatus ATCC 8483]
gi|423297720|ref|ZP_17275780.1| hypothetical protein HMPREF1070_04445 [Bacteroides ovatus
CL03T12C18]
gi|156110822|gb|EDO12567.1| hypothetical protein BACOVA_01709 [Bacteroides ovatus ATCC 8483]
gi|392665078|gb|EIY58610.1| hypothetical protein HMPREF1070_04445 [Bacteroides ovatus
CL03T12C18]
Length = 800
Score = 226 bits (575), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 168/560 (30%), Positives = 267/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L LD D+L+ F + A L Y WE + L G
Sbjct: 30 LQNVKL-LDSPFLQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L+ Q+ +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHNLILDPLIKEEDKL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + E++ D + H+ G N
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYNML++++ L++ + + Y +YYER+L N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +Y+ +I S+L WK I++ Q+ LR+ K +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRINEAPKKK-----RTLM 484
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + + GN +L +++ W D +T LP+ + E I D
Sbjct: 485 IRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWEKGDVITFHLPMKVSVEQIPD 544
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 545 KKDYY----AFLYGPIVLAA 560
>gi|293370109|ref|ZP_06616674.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292634837|gb|EFF53361.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 800
Score = 226 bits (575), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 171/560 (30%), Positives = 268/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L LD D+L+ F + A L Y WE + L G
Sbjct: 30 LQNVKL-LDSPFLQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L+ Q+ +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQL---HKE--------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + EV+ D H H+ G N
Sbjct: 259 TGMHANTQIPKVIGYKRIAEVSQDDKTWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYNML++++ L++ + + Y +YYER+L N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +Y+ +I S+L WK I++ Q+ + +VTL T L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQE----TRFPDDDKVTLRIDEAPKKKRT-LM 484
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + + GN +L +++ W D +T LP+ + E I D
Sbjct: 485 IRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIPD 544
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 545 KKDYY----AFLYGPIVLAA 560
>gi|419849455|ref|ZP_14372501.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
gi|419852148|ref|ZP_14375044.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
gi|386411767|gb|EIJ26479.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
gi|386411993|gb|EIJ26692.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
Length = 800
Score = 226 bits (575), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 180/601 (29%), Positives = 277/601 (46%), Gaps = 94/601 (15%)
Query: 107 VPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARL-PAPG- 164
P G + L +V + S+S+ RA++ L+Y VD+ + FR A L P
Sbjct: 78 APALPGWKVAPFPLRNVAITSNSVFDRAKEGMLDYARNYPVDRWLVCFRAQANLLPKDNT 137
Query: 165 -EPYGGWE-------EPSCE--------------------LRGHFVGHYLSASALMWAST 196
+P GGWE + + E LRGHF GH L + +A T
Sbjct: 138 TQPSGGWENFPNGSLDKAVEQQWGDAEYTRGQNKNGADGLLRGHFAGHALHMLSQAYAET 197
Query: 197 HNESLKEKMSAVVSALSACQKEIGS------------GYLSAFPTEQFDRLEALIP---V 241
E++ K++ VS L C+ + G+L+A+ QF LE P +
Sbjct: 198 GEEAILNKINEFVSGLKECRDSLREMKYNGKARYSHPGFLAAYGEWQFKALEEYAPYGEI 257
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT-LNE 300
WAP+YT HKILAGL+ Y +A NA+AL + + + Y R+ K +++ W +
Sbjct: 258 WAPWYTEHKILAGLIAAYEFAGNADALDLAEGIGHWTYARLSKCTKT-QLQKMWDIYIGG 316
Query: 301 EAGGMNDVLYKLFCITQDPKH---LMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVI 357
E GGMND L L+ +++D L + FD + D ++ H+N HIP +
Sbjct: 317 EYGGMNDSLVDLYNVSKDKDRSEFLKASAFFDTDKLIDNCGAGVDILNNLHANQHIPQFV 376
Query: 358 G---------------SQMRY--EVTGD-QLHKEGHQLESSGTNIGHFNFKSDPKRLASN 399
G ++ RY V G + G GT G +A +
Sbjct: 377 GYAKDAAMGDADIDADARARYLKAVEGYWGMIVPGRMYAHGGT--GEGEMWGPAHTVAGD 434
Query: 400 LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ-RGTEPGVMI---- 454
+ ESC YNMLKV+R+LF ++ AY DYYER++ N +LG + R + G +
Sbjct: 435 IGKRNAESCAAYNMLKVARYLFFIEQKPAYMDYYERTILNHILGGKSRDLDSGTALTPGN 494
Query: 455 -YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y+ P+ P + KE + GT CC GT +ES SK DSIYF +Y+ +
Sbjct: 495 CYMYPVNPATQKEYGDGNIGT------CCGGTALESHSKYQDSIYFHSTDNKE-LYVNLF 547
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+S LDW + + Q+ + + +++T + K + + +RIP W S GAK
Sbjct: 548 TASTLDWTDTGLKLAQETN--YPEEETSTISITAAPKSA---VTFRIRIPAW--SKGAKI 600
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NG+ + + G + +V +W DK+ + +PL LRTE+ DDR + IQ + YGP V
Sbjct: 601 EVNGKAIDGVTAGEYATVAGSWKVGDKIVVTIPLQLRTEST-DDRKD---IQTLFYGPTV 656
Query: 634 L 634
L
Sbjct: 657 L 657
>gi|302561993|ref|ZP_07314335.1| secreted protein [Streptomyces griseoflavus Tu4000]
gi|302479611|gb|EFL42704.1| secreted protein [Streptomyces griseoflavus Tu4000]
Length = 950
Score = 225 bits (574), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 149/440 (33%), Positives = 223/440 (50%), Gaps = 42/440 (9%)
Query: 222 GYLSAFPTEQFDRLEALIP-----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE++ VWAPYYT HKIL GLLD Y D+ AL + + M +
Sbjct: 399 GFLAAYPETQFIALESMTGSDYTRVWAPYYTAHKILRGLLDAYLATDDERALDLASGMCD 458
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ + R+ +V+ +++R W + E GG+ + + L +T P+HL LA LFD +
Sbjct: 459 WMHARL-SVLPAATLQRMWGLFSSGEFGGIVEAVCDLHALTGRPEHLALARLFDLDRLID 517
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLES-SGTNI 384
A D + G H+N HIP+ G ++ TG+Q + H+ + GT+
Sbjct: 518 ACAADTDVLEGLHANQHIPVFTGLVRLHDETGEQRYLTAAKNFWGMVVPHRTYAIGGTSS 577
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G F +K+ +A + T ESC YNMLK+SR LF ++ AY DYYER+L N VLG
Sbjct: 578 GEF-WKAR-GVIAGTIGDTTAESCCAYNMLKLSRALFFHEQDPAYMDYYERTLYNQVLGS 635
Query: 445 QR---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
++ E ++ Y + L PG ++ TP CC GTG+ES +K DS+YF +
Sbjct: 636 KQDRPDAEKPLVTYFVGLTPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYFAK 689
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
+Y+ Y SRL W + V Q + TLT + T L LR
Sbjct: 690 A-DGSALYVNLYSDSRLAWAEKGVTVTQS----TRYPEEQGSTLTIGGGRASFT--LLLR 742
Query: 562 IPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
+P+W ++ G + T+NG+ +P P PG + V+++W D + I +P LR E DD
Sbjct: 743 VPSWATA-GFRVTVNGRAVPGAPVPGRYFGVSRSWRDGDTVRISVPFRLRVEKAPDD--- 798
Query: 621 YASIQAILYGPYVLAGHSIG 640
+QA+ GP L G
Sbjct: 799 -PGLQALFLGPVCLVARRPG 817
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 55/112 (49%), Gaps = 6/112 (5%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWE-- 171
++ L DV LG + ++ L++ DV++L+ FR A L G GGWE
Sbjct: 60 VRPFGLEDVTLGPGVFAAK-RRLMLDHARGYDVNRLLQVFRANAGLSTRGAVAPGGWEGL 118
Query: 172 --EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS 221
E + LRGH+ GH+L+ A ST + +++ VV AL ++ + S
Sbjct: 119 DGEANGNLRGHYTGHFLTMLAQAHRSTGEQVFADRIDTVVGALVEVREALRS 170
>gi|354580825|ref|ZP_08999729.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
gi|353201153|gb|EHB66606.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
Length = 623
Score = 225 bits (574), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 162/557 (29%), Positives = 258/557 (46%), Gaps = 72/557 (12%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFR-KTARLPA---PGEPYGGWEEPSCELRGHFVGHYLSA 188
R ++ N YL+ LD L++N++ + R P +GGWE P C+LRGHF+GH+LS
Sbjct: 18 RRERANRSYLMKLDSGHLLFNYQLEAGRFHGRTIPEGAHGGWETPVCQLRGHFLGHWLSG 77
Query: 189 SALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTI 248
+A+ + + + LK K+ A+V L CQ++ G ++ P + + +WAP Y +
Sbjct: 78 AAMHYEKSGDMELKAKLDAIVQELHECQRDNGGQWVGPIPEKYLHWIARGKSIWAPQYNL 137
Query: 249 HKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDV 308
HKIL GL+D + YA N +AL + ++F N ++ E+ L+ E GGM +V
Sbjct: 138 HKILMGLVDAWQYAGNRQALDIVDRFADWFVNWSGT----FTREQFDDILDVETGGMLEV 193
Query: 309 LYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGD 368
L IT K+ +L + + L D ++ H+NT IP V+G YEVTGD
Sbjct: 194 WADLLHITGADKYRVLLERYYRSRLFQPLLEGKDPLTNMHANTTIPEVLGCARAYEVTGD 253
Query: 369 QLH------------KEGHQLESSGTNIGHFNFKSDPK-RLASNLDSNTEESCTTYNMLK 415
E L + G G PK ++ + L +E CT YNM++
Sbjct: 254 DRWLSIVQAYWKCAVTERGSLATGGQTAGEVWM---PKMKMKARLGDKNQEHCTVYNMIR 310
Query: 416 VSRHLFRWTKEIAYADYYERSLTNGVL-----------GIQ-RGTEPGVMIYLLPLAPGS 463
++ LFR T + +YA Y E +L NG++ G Q + G++ Y LP+ G
Sbjct: 311 LAEFLFRQTGDPSYAQYIEYNLYNGIMAQAYYQEYGLTGSQHKHPHTGLLTYFLPMKAGL 370
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL----- 518
KE W T +DSF+CC+GT +++ + IY+ ++G+ +YI QY S L
Sbjct: 371 RKE-----WSTETDSFFCCHGTMVQANAAWNKGIYY-QDGEI--IYISQYFDSELRTSID 422
Query: 519 ----------DWKSGQIVVN------QKVDPVVSWD---PYLRVTLTFSSKGSGLTTSLN 559
D SG ++ + Q ++ + + P R S + T +L
Sbjct: 423 GTDIQIVQTQDKMSGSLLSSSNTAGYQAINDTAATNENMPAFRKYDFIVSTAAPTTFTLR 482
Query: 560 LRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
RIP W + + + +F + + W D ++I LP+ +R + DD
Sbjct: 483 FRIPEWIMAEVSVYVNDRLQGTTRDSSSFYDIHRAWKEGDTVSIMLPIGIRFVPLPDDE- 541
Query: 620 EYASIQAILYGPYVLAG 636
A YGP VLAG
Sbjct: 542 ---RTGAFRYGPEVLAG 555
>gi|255691978|ref|ZP_05415653.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides
finegoldii DSM 17565]
gi|260622387|gb|EEX45258.1| hypothetical protein BACFIN_07051 [Bacteroides finegoldii DSM
17565]
Length = 800
Score = 225 bits (574), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 169/562 (30%), Positives = 272/562 (48%), Gaps = 76/562 (13%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L DV+L DS +AQQT+L Y+L L+ D+L+ F + A L Y WE + L G
Sbjct: 30 LQDVKL-LDSPFLQAQQTDLHYILALNPDRLLAPFLREAGLTPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA 237
H GHYLSA ++M+A+T + ++ +++ ++ L Q+ +G+G++ P + + ++A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAIYNRLNYMLDELYRAQQAVGTGFIGGTPGSLQLWKEIKA 146
Query: 238 ---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
L W P Y IHK AGL D Y Y + +A RM T WM++
Sbjct: 147 GNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYTGSDQARRMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S ++ L E G+N+ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLSDQQIQDMLRSEHSGLNETFADVAAITGDKKYLELARRFSHKIILDPLIKDKDRL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQL---HKE----------GHQLESSGTNIG------ 385
+G H+NT IP VIG + E++ D H E + + IG
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAEEWDHAARFFWNTVVNNRSVCIGGNSVRE 318
Query: 386 HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSL 437
HF+ + + + D E+C TYNML++++ L++ + + Y +YYER+L
Sbjct: 319 HFHPADNFTSMIN--DVQGPETCNTYNMLRLTKMLYQNSHNPCNINEPDPNYINYYERAL 376
Query: 438 TNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSI 497
N +L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ I
Sbjct: 377 YNHILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFI 430
Query: 498 YFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTS 557
Y ++ +Y+ +I S+L+WK +++ Q+ + +VTL K S +
Sbjct: 431 YAHQKDT---LYVNLFIPSQLNWKEQGVILTQE----TRFPDDNKVTLRI-DKASKKQRT 482
Query: 558 LNLRIPTWTS-SNGAKATLNGQDLPLPS-PGN-FLSVTKTWSSDDKLTIQLPLTLRTEAI 614
L +RIP W + S+ ++NG+ P+ GN +L +++ W D +T LP+ + E I
Sbjct: 483 LMIRIPEWANQSSNYSISINGKKETFPTKKGNQYLPLSRKWKKGDVITFNLPMKVTIEQI 542
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
D + Y A LYGP VLA
Sbjct: 543 PDKKDYY----AFLYGPIVLAA 560
>gi|299146241|ref|ZP_07039309.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
gi|298516732|gb|EFI40613.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
Length = 800
Score = 225 bits (574), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 169/560 (30%), Positives = 268/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L LD D+L+ F + A L Y WE + L G
Sbjct: 30 LQNVKL-LDSPFLQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L+ Q+ +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + E++ D + H+ G N
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYNML++++ L++ + + Y +YYER+L N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +Y+ +I S+L WK I++ Q+ + +VTL T L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQE----TRFPDDDKVTLRIDEAPKKKRT-LM 484
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + + GN +L +++ W D +T LP+ + E I D
Sbjct: 485 IRIPEWANQSKGYSVSINGKRKIFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIPD 544
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 545 KKDYY----AFLYGPIVLAA 560
>gi|295085157|emb|CBK66680.1| Uncharacterized protein conserved in bacteria [Bacteroides
xylanisolvens XB1A]
Length = 800
Score = 225 bits (574), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 169/560 (30%), Positives = 268/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L LD D+L+ F + A L Y WE + L G
Sbjct: 30 LQNVKL-LDSPFLQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L+ Q+ +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + E++ D + H+ G N
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYNML++++ L++ + + Y +YYER+L N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +Y+ +I S+L WK I++ Q+ + +VTL T L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQE----TRFPDDDKVTLRIDEAPKKKRT-LM 484
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + + GN +L +++ W D +T LP+ + E I D
Sbjct: 485 IRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIPD 544
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 545 KKDYY----AFLYGPIVLAA 560
>gi|331702303|ref|YP_004399262.1| hypothetical protein Lbuc_1953 [Lactobacillus buchneri NRRL
B-30929]
gi|329129646|gb|AEB74199.1| protein of unknown function DUF1680 [Lactobacillus buchneri NRRL
B-30929]
Length = 803
Score = 225 bits (574), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 200/642 (31%), Positives = 291/642 (45%), Gaps = 117/642 (18%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEPS-CELRGHFVGHYLSA-SA 190
AQQ ++YLL LD + + F + A + + G Y GWE RGHF GHYLSA S
Sbjct: 20 AQQMTVKYLLALDPKRFLVTFDEVAGIDSGGVTGYQGWERTDGLNFRGHFFGHYLSALSQ 79
Query: 191 LMWASTHN---ESLKEKMSAVVSALSACQKEIG------SGYLSAFPTEQFDRLEAL-IP 240
+ A+ N + L +K+ V+ L + Q +GY+SAF D +E +P
Sbjct: 80 AILATEENDIRQQLLDKLRLGVNGLQSAQAAYAKSHPDSAGYVSAFREVALDEVEGREVP 139
Query: 241 ------VWAPYYTIHKILAGLLDQYTYADNAE------ALRMTTWMVEYFYNRVQNVIKK 288
V P+Y +HK+LAGLL + AL++ Y + R+ +
Sbjct: 140 KDEKENVLVPWYNLHKVLAGLLAVKVNLQGIDPLLSEKALKIAHQFGIYVFKRLNQLADP 199
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
Q L E GGMND LY+LF +T D + L A FD+ LA D ++G H
Sbjct: 200 T------QMLKIEYGGMNDALYELFDLTDDKRMLTAATYFDETALFKQLAEGDDVLAGKH 253
Query: 349 SNTHIPIVIGSQMRYEVTGD---------------------------QLHKEGHQLESSG 381
+NT IP +IG+ RYE D Q+ + H + G
Sbjct: 254 ANTTIPKLIGALHRYESLHDVKRADQYLSPEEKGSLNMYLKAAVNFWQIVVDDHTYVTGG 313
Query: 382 TNIG-HFNFKSDPKRLASNL----DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERS 436
+ HF+ +P +L + + T E+C TYNMLK+SR LFR T + Y DYYE++
Sbjct: 314 NSQSEHFH---EPGQLFHDAVLEDGATTCETCNTYNMLKLSRELFRVTGDKKYLDYYEQT 370
Query: 437 LTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDS 496
TN +LG Q G+M Y P+A G +K + P D FWCC GTGIE+F+KLGDS
Sbjct: 371 YTNAILGSQ-NPNTGMMTYFQPMAAGYTKV-----YNRPFDEFWCCTGTGIENFTKLGDS 424
Query: 497 IYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFS---SKGSG 553
F + +Y+ Y S+ L S + + ++VD +V LT + S+ S
Sbjct: 425 YDFMSGDQ---LYLSLYFSNVLRLDSNNLQMTEQVDRKTG-----KVHLTVAKLRSQDSA 476
Query: 554 LTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDK-----LTIQLPLT 608
+L LR P W + AK ++G + +F W D+ + +++P++
Sbjct: 477 GAINLKLRNPAWLVQS-AKLAVDGISQQVDQNADF------WEIDNAGPGTTVDLEIPMS 529
Query: 609 LRTEAIQDDRPEYASIQAILYGPYVLAG----HSIGDWDITESATSLSDWITPIPA---- 660
L+ +D+ P Y + + YGPYVLAG H I D +S +P+
Sbjct: 530 LKMVQTKDN-PHYVAFK---YGPYVLAGQLGKHHINDDRPNGVLVRISTHDQAVPSTLTT 585
Query: 661 ---------SYNSQLITFTQEYGNTKFVLTNSNQSITMEKFP 693
S NSQ + T E NT F L N S T+ P
Sbjct: 586 GMDWHDWQQSLNSQAVVDT-ETTNTLFELKLPNTSETITFVP 626
>gi|423287556|ref|ZP_17266407.1| hypothetical protein HMPREF1069_01450 [Bacteroides ovatus
CL02T12C04]
gi|392672671|gb|EIY66138.1| hypothetical protein HMPREF1069_01450 [Bacteroides ovatus
CL02T12C04]
Length = 800
Score = 225 bits (573), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 168/560 (30%), Positives = 267/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L LD D+L+ F + A L Y WE + L G
Sbjct: 30 LQNVKL-LDSPFLQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L+ Q+ +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHNLILDPLIKEEDKL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + E++ D + H+ G N
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYNML++++ L++ + + Y +YYER+L N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +Y+ +I S+L WK I++ Q+ LR+ K +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRIDEAPKKK-----RTLM 484
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + + GN +L +++ W D +T LP+ + E I D
Sbjct: 485 IRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWEKGDVITFHLPMKVSVEQIPD 544
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 545 KKDYY----AFLYGPIVLAA 560
>gi|336405535|ref|ZP_08586212.1| hypothetical protein HMPREF0127_03525 [Bacteroides sp. 1_1_30]
gi|335937406|gb|EGM99306.1| hypothetical protein HMPREF0127_03525 [Bacteroides sp. 1_1_30]
Length = 800
Score = 225 bits (573), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 168/560 (30%), Positives = 267/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L LD D+L+ F + A L Y WE + L G
Sbjct: 30 LQNVKL-LDSPFLQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L+ Q+ +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLAHQMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + E++ D + H+ G N
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYNML++++ L++ + + Y +YYER+L N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +Y+ +I S+L WK I++ Q+ LR+ K +L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRIDEAPKKK-----RTLM 484
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + + GN +L +++ W D +T LP+ + E I D
Sbjct: 485 IRIPEWANQSKGYSISINGKRKMFVMAKGNQYLPLSRKWKKGDVITFHLPMKVSVEQIPD 544
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 545 KKDYY----AFLYGPIVLAA 560
>gi|451820300|ref|YP_007456501.1| acetyl-CoA carboxylase, biotin carboxylase [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451786279|gb|AGF57247.1| acetyl-CoA carboxylase, biotin carboxylase [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 766
Score = 225 bits (573), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 161/534 (30%), Positives = 261/534 (48%), Gaps = 43/534 (8%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
SL VRL + +Q +Y+L LDVD+ + + L + Y GWE + +
Sbjct: 10 SLSKVRL-LEGFFKTSQDLGEKYILSLDVDRFLAPCYEAHGLEPKKKRYSGWEARA--IS 66
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEAL 238
GH +GH++SA A+ + +T NE LK+ + VS LS Q+ G GY+ F +
Sbjct: 67 GHSLGHFMSALAVTYQATGNEELKKILDYAVSELSHIQQVTGRGYIGGLVETPFVEIIDG 126
Query: 239 IPV--------WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYS 290
+ W P+Y+IHKI GL+D Y A+N+EAL + V F + +++ + S
Sbjct: 127 TNIGKFDINGYWVPWYSIHKIYKGLIDAYELAENSEALNV----VVNFADWAVSILNQMS 182
Query: 291 IERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSN 350
E+ L E GGMN + KL+ T + +L A F + L DD+ G H+N
Sbjct: 183 DEQVQAMLECEHGGMNHIFAKLYGFTCNSIYLDTAVRFSHKAIVEPLEQCVDDLQGKHAN 242
Query: 351 THIPIVIG-SQMRYEVTGDQLHKEGHQ------LESSGTNIGHFNFKSDPKRL-ASNLDS 402
T IP +IG +++ + + +K Q + IG + K + + +L
Sbjct: 243 TQIPKIIGIAEIYNQEHAYEKYKTAAQFFWNTVVNRRSYVIGGNSLKEHFEAIDMESLGI 302
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
T ESC T+NML +++ LF W AY DYYE +L N ++G Q G Y L PG
Sbjct: 303 KTAESCNTHNMLLLTKLLFSWNHYSAYMDYYENALFNHIIGTQ-DCHTGNKTYFTSLLPG 361
Query: 463 SSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
Y + T ++WCC GTG+E+ K ++IYF+E+ +Y+ +ISS+ DW++
Sbjct: 362 -----HYRIYSTKDTAWWCCTGTGMENPGKYAEAIYFQEQ---DDLYVNLFISSQFDWEA 413
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL 582
+ + Q+ + PY + +G ++N+R+P+W +S A +NG+D +
Sbjct: 414 KGLTIRQESNL-----PYSDTVILKIIEGKA-EANINIRVPSWITSELV-AVVNGKDRFV 466
Query: 583 PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+L+V+ W +++ I P+ + +D+ A A YGP VLAG
Sbjct: 467 QREKGYLTVSGAWDKGNEIRITFPMAVSKYTSKDN----AGKIAFTYGPVVLAG 516
>gi|302873208|ref|YP_003841841.1| hypothetical protein Clocel_0296 [Clostridium cellulovorans 743B]
gi|307688627|ref|ZP_07631073.1| hypothetical protein Ccel74_10733 [Clostridium cellulovorans 743B]
gi|302576065|gb|ADL50077.1| protein of unknown function DUF1680 [Clostridium cellulovorans
743B]
Length = 607
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 156/556 (28%), Positives = 270/556 (48%), Gaps = 52/556 (9%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG--------- 164
LK ++ +++L S+ N YL+ + L+ NF A + PG
Sbjct: 1 MLKPINTKNIKLLP-SIFKERYDLNRNYLINVKNQGLLQNFYLEAGIILPGLQVLHNPDT 59
Query: 165 -EPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGY 223
E + GW+ P+C+LRGHF+GH+LSA+A ++ S + LK K+ ++ L CQ+ G +
Sbjct: 60 DEIHWGWDAPTCQLRGHFLGHWLSAAASIFVSEQDHELKAKLDKIIDELIKCQELNGGEW 119
Query: 224 LSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
+ P + F +LE VW+P Y +HK+L GL++ Y ++ +AL + + ++
Sbjct: 120 IGPIPEKYFQKLENSHHVWSPQYVMHKVLMGLMNSYIDTNSDKALAILDKLSNWYIKWTD 179
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+++ I+ E GM +V ++ IT + K+L LA + P L D
Sbjct: 180 DML----IKNPRAIYGGEEAGMLEVWITMYEITAEEKYLELAKKYSNPRIFRDLEAGRDT 235
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ--LESSGTNIGHF--------NFKSDP 393
++ H+N IP G+ YEVTGD+ ++ + +++ T+ G++ + + P
Sbjct: 236 LTNCHANASIPWSHGAAKLYEVTGDEKWRKITEAFWKNAVTDRGYYCSGGQGAGEYWTPP 295
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
+L L + +E CT YNM++ + +L++WT + ++ADY E +L NG L Q+ G+
Sbjct: 296 FKLGLFLSDSNQEFCTVYNMIRTASYLYKWTGDTSFADYIELNLYNGFLA-QQNKYTGMP 354
Query: 454 IYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y LPL GS K+ WGT + FWCC+GT +++ + IYFE++ + + + QY
Sbjct: 355 TYFLPLGAGSKKK-----WGTETRDFWCCHGTMVQAQTLYNSLIYFEDKER---LVVSQY 406
Query: 514 ISSRLDW--KSGQIVVNQKVDPVVSWDPYL----------RVTLTFS-SKGSGLTTSLNL 560
I S L W + I + Q+V+ D R +L F + + +L+
Sbjct: 407 IPSELKWNYNNTDITIQQRVNMKYYNDLAFFDERDESQMSRWSLKFQVAAEKNESFTLSF 466
Query: 561 RIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
R+P W + N + L ++++ + WS D+ L I P L + P+
Sbjct: 467 RVPKWVKELPSVTINNEKIDDLTVDEGYINIKREWSQDEVL-IYFPCRLEISPL----PD 521
Query: 621 YASIQAILYGPYVLAG 636
A + GP VLAG
Sbjct: 522 MPDTFAFMEGPIVLAG 537
>gi|237722208|ref|ZP_04552689.1| acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
gi|229448018|gb|EEO53809.1| acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
Length = 800
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 168/560 (30%), Positives = 265/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L LD D+L+ F + A L Y WE + L G
Sbjct: 30 LQNVKL-LDSPFLQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L+ Q+ +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYSRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKDEDKL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + E++ D + H+ G N
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYNML++++ L++ + + Y +YYER+L N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNNTQEPDPNYVNYYERALYN 378
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
++ +Y+ +I S+L WK I + Q+ LR+ K +L
Sbjct: 433 HQKDT---LYVNLFIPSQLTWKEQGITLTQETRFPDDGKVTLRIDEAHKKK-----RTLM 484
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + GN +L +++ W D +T LP+ + E I D
Sbjct: 485 IRIPEWANQSKGYSVSINGKRKIFVMGKGNQYLPLSRKWKKGDVVTFNLPMKVTMEQIPD 544
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 545 KKDYY----AFLYGPIVLAA 560
>gi|189467200|ref|ZP_03015985.1| hypothetical protein BACINT_03584 [Bacteroides intestinalis DSM
17393]
gi|189435464|gb|EDV04449.1| beta-lactamase [Bacteroides intestinalis DSM 17393]
Length = 720
Score = 224 bits (572), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 143/404 (35%), Positives = 217/404 (53%), Gaps = 40/404 (9%)
Query: 248 IHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMND 307
+HK+ +GL+ QY YADN +AL + T M + YN+ +K + + E GG+N+
Sbjct: 1 MHKLFSGLIYQYLYADNKQALEVVTRMGNWTYNK----LKPLDESTRKRMIRNEFGGVNE 56
Query: 308 VLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTG 367
Y L+ IT D ++ LA F + L Q DD+ H+NT IP V+ YE+T
Sbjct: 57 SFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDLGTKHTNTFIPKVLTEARNYELTQ 116
Query: 368 DQLHKEGHQLESS--GTNIGHFNFKS----------DPKRLASNLDSNTEESCTTYNMLK 415
D + +L T I H F DP++L+ +L T E+C TYNMLK
Sbjct: 117 DN---DSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQLSKHLTGYTGETCCTYNMLK 173
Query: 416 VSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTP 475
+SRHLF WT + ADYYER+L N +LG Q+ E G++ Y LPL GS K S T
Sbjct: 174 LSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSYFLPLLSGSHKVYS-----TR 227
Query: 476 SDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVV 535
+SFWCC G+G E+ +K G++IY+ + G+Y+ +I S ++WK+ I + Q+
Sbjct: 228 ENSFWCCVGSGFENHAKYGEAIYYHNDQ---GIYVNLFIPSEVNWKAKGITLRQE----T 280
Query: 536 SWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKT 594
++ LT + +TT++ LR P+W S K +NG+ + + PG+++ VT+
Sbjct: 281 AFPAEENTALTIQTDKP-VTTTIYLRYPSW--SKNVKVNVNGKKVSVKQKPGSYIPVTRQ 337
Query: 595 WSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
W D++ P++L+ E D+ P+ A+LYGP VLAG S
Sbjct: 338 WKDGDRIEANYPMSLQLETTPDN-PQKG---ALLYGPLVLAGES 377
>gi|336417295|ref|ZP_08597620.1| hypothetical protein HMPREF1017_04728 [Bacteroides ovatus
3_8_47FAA]
gi|335936275|gb|EGM98208.1| hypothetical protein HMPREF1017_04728 [Bacteroides ovatus
3_8_47FAA]
Length = 800
Score = 224 bits (572), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 169/560 (30%), Positives = 268/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L LD D+L+ F + A L Y WE + L G
Sbjct: 30 LQNVKL-LDSPFLQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L+ Q+ +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 147 GKIHAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + E++ D + H+ G N
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYNML++++ L++ + + Y +YYER+L N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +Y+ +I S+L WK I++ Q+ + +VTL T L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGIILRQE----TRFPDDDKVTLRIDEAPKKKRT-LM 484
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + + GN +L +++ W D +T LP+ + E I D
Sbjct: 485 IRIPEWANQSKGYSVSINGKRKMFVMAKGNQYLPLSRKWKKGDVITFNLPMRVSMEQIPD 544
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 545 KKDYY----AFLYGPIVLAA 560
>gi|427386394|ref|ZP_18882591.1| hypothetical protein HMPREF9447_03624 [Bacteroides oleiciplenus YIT
12058]
gi|425726434|gb|EKU89299.1| hypothetical protein HMPREF9447_03624 [Bacteroides oleiciplenus YIT
12058]
Length = 792
Score = 224 bits (572), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 162/545 (29%), Positives = 259/545 (47%), Gaps = 75/545 (13%)
Query: 133 RAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALM 192
+AQQT+L Y+L ++ D+L+ F + A L Y WE + L GH GHY+SA ++M
Sbjct: 42 QAQQTDLHYILAMEPDRLLAPFLREAGLAPKAPSYTNWE--NTGLDGHIGGHYISALSMM 99
Query: 193 WASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE---------------QFDRLEA 237
+A+T + ++ +++ ++ L Q+ +G+G++ P FD
Sbjct: 100 YAATGDTAVYNRLNYMLDELHRAQQAVGTGFIGGTPGSLQLWKEIKEGNIRAGGFD---- 155
Query: 238 LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQNVIKKYSIER 293
L W P Y IHK AGL D Y YA + A M T WM+ + + ++
Sbjct: 156 LNSKWVPLYNIHKTYAGLRDAYLYAGSDLAREMLIALTDWMI--------GITAGLTDQQ 207
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHI 353
L E GG+N+ + IT D K+L LA F L L D ++G H+NT I
Sbjct: 208 MQDMLRSEHGGLNETFADVAAITGDKKYLELARRFSHKVILDPLIKDEDRLTGMHANTQI 267
Query: 354 PIVIGSQMRYEVTGDQL---------HKE--------GHQLESSGTN--IGHFNFKSDPK 394
P VIG + E++ D H H+ G N HF+ +D
Sbjct: 268 PKVIGYKRIAELSQDDNVWNHATEWDHAARFFWNTVVNHRSVCIGGNSVREHFHPANDFS 327
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI 454
+ ++++ E+C TYNML++++ L++ + + +ADYYER+L N +L Q + G +
Sbjct: 328 PMLNDIEG--PETCNTYNMLRLTKMLYQDSPDSRFADYYERALYNHILASQE-PDKGGFV 384
Query: 455 YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
Y P+ PG Y + P S WCC G+G+E+ +K G+ IY ++ +Y+ +I
Sbjct: 385 YFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYAHQKDT---LYVNLFI 436
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT-SSNGAKA 573
S+L WK + + Q+ + LR+ K S ++++R P W SS G
Sbjct: 437 PSQLTWKEKGVSLVQETRFPDNGQVTLRI-----DKASKKAFTISIRQPEWADSSKGYNL 491
Query: 574 TLNGQDLPLPSPGN--FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+NG++ + N +LSV + W D +T LP+ ++ E I D Y A LYGP
Sbjct: 492 KVNGKEQSSATATNSGYLSVNRKWKKGDVVTFTLPMQIKMEQIPDKENYY----AFLYGP 547
Query: 632 YVLAG 636
VLA
Sbjct: 548 IVLAA 552
>gi|307109022|gb|EFN57261.1| hypothetical protein CHLNCDRAFT_143813 [Chlorella variabilis]
Length = 349
Score = 224 bits (571), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 123/251 (49%), Positives = 150/251 (59%), Gaps = 9/251 (3%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
SL DV+L S + R + N EYLL L+ D+L++NFRKTA LPAPG YGGWE E+R
Sbjct: 27 SLADVQLARGSEYARNFEQNSEYLLALEPDRLLYNFRKTAGLPAPGASYGGWEWSGVEIR 86
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEAL 238
GHFVGHYLSA AL + L+E+ +VS L Q G+GYLSAFP FDRLEAL
Sbjct: 87 GHFVGHYLSALALATLHSGRPELRERCGVMVSELKKVQDAAGTGYLSAFPESHFDRLEAL 146
Query: 239 IPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHW-QT 297
PV HKILAGLLDQ+ A AL M +F RV+ V+ + HW +
Sbjct: 147 QPV-------HKILAGLLDQHRLVGTAGALGAARRMASHFCARVRAVVAANGTD-HWHRV 198
Query: 298 LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVI 357
L E GGMN+ LY L+ IT+ P+H AH FDKP F LA D + G H+NTH+ V
Sbjct: 199 LEVEFGGMNEALYNLYAITKSPEHAECAHFFDKPAFFRPLAEGRDPLPGLHANTHMAQVP 258
Query: 358 GSQMRYEVTGD 368
G RYE+ GD
Sbjct: 259 GFTARYELLGD 269
>gi|440730056|ref|ZP_20910155.1| hypothetical protein A989_02030 [Xanthomonas translucens DAR61454]
gi|440379682|gb|ELQ16270.1| hypothetical protein A989_02030 [Xanthomonas translucens DAR61454]
Length = 807
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 169/551 (30%), Positives = 263/551 (47%), Gaps = 61/551 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
LK+V+L S+ + QTN YLL L+ D+L+ NF + A LP GE YGGWE +
Sbjct: 65 LKQVTL------KPSLFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGEVYGGWEGDT 118
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
+ GH +GHYLSA A M A T + +L++++ +V+ L+ Q + GY+ +
Sbjct: 119 --IAGHTLGHYLSALAKMHAQTRDAALRQRIDYIVAELARAQAKDPDGYVGGLTRKNDKG 176
Query: 232 --------FDRLEALI---------PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
F+ + I W+P YT+HK+ AGLLD + A NA+AL++ +
Sbjct: 177 AIDNGKLVFEEVRRGIIKGSKFNLNGSWSPLYTVHKLFAGLLDAHALAGNAQALQVLLPL 236
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
Y + V + L+ E GG+N+ +L T DP+ + L +
Sbjct: 237 AGY----LGGVFDALDHAQMQTLLDTEFGGLNESYIELGARTGDPRWIALGKRLRHEKVI 292
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNI 384
A D++ H+NT +P IG ++EV GD GH G N
Sbjct: 293 DPAAAGRDELPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTGHYSYVIGGNA 352
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
F+ +P +A+ L T E C +YNMLK++RHL++WT + Y DYYER+L N +
Sbjct: 353 DREYFQ-EPDTIAAFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAA 411
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGK 504
Q G+ Y+ P+ G ER + DSFWCC G+G+E+ ++ GDSIY+++
Sbjct: 412 QH-PATGMFTYMTPMISGG--ERGF---SDKFDSFWCCVGSGMEAHAQFGDSIYWQDA-- 463
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
+Y+ YI S LDW + + ++D V + +V L G+ L LR+P
Sbjct: 464 -VSLYVNLYIPSTLDWPERDLTL--ELDSGVPDNG--KVRLQLRRAGARTPRRLLLRLPA 518
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W +NG+ + +L++ + W S D + + L + LR E D A
Sbjct: 519 WC-QGAYTLRVNGKSQRGTAADGYLALERQWRSGDVIELDLAMPLRLEHAAGD----ADT 573
Query: 625 QAILYGPYVLA 635
++ GP LA
Sbjct: 574 VVVMRGPLALA 584
>gi|237711613|ref|ZP_04542094.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
gi|229454308|gb|EEO60029.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
Length = 770
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 176/576 (30%), Positives = 276/576 (47%), Gaps = 70/576 (12%)
Query: 106 KVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE 165
K P + +L +V L +DS +A + YLL LDVD+L+ + R++ L G+
Sbjct: 3 KAPRVHVPVWQSFALSEVEL-TDSYFKKAMDLHKGYLLSLDVDRLIPHVRRSVGLQGKGD 61
Query: 166 PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLS 225
YGGWE+ G GHY+SA A+M+AST ++L +K++ ++ L CQK+ G+
Sbjct: 62 NYGGWEKHG----GCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFI 117
Query: 226 AFPTEQFDRLEAL---IPVWAP---------------YYTIHKILAGLLDQYTYADNAEA 267
+ L+ L + + P +Y IHKILAGL D Y YA +A
Sbjct: 118 TGKRGKEGYLQLLQGNVVLNQPDETGQPWNYNQNGNSWYCIHKILAGLRDAYVYAGCRQA 177
Query: 268 LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHL 327
+ + ++ + ++ + + TL+ E GGMN+V ++ IT D K L A
Sbjct: 178 KDILMPLADF----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAER 233
Query: 328 FDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQ 376
F+ + +A D + G H+N IP +G YE + + ++ + H
Sbjct: 234 FNHINVIYPIANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHT 293
Query: 377 LESSGTNI-GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
L G + F + + LD + E+C TYNMLK+SR LF + Y +YYE
Sbjct: 294 LAIGGNSCYERFGVLGEESK---RLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEH 350
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGD 495
+L N +L Q PG + Y L PGS K+ S TP DSFWCC GTG+E+ SK +
Sbjct: 351 ALYNHILASQDPDMPGCVTYYTSLLPGSFKQYS-----TPFDSFWCCVGTGMENHSKYAE 405
Query: 496 SIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYL----RVTLTFSSKG 551
SIYF++ + + + YI SRL WK + ++ D Y VT+ G
Sbjct: 406 SIYFKDNQE---LLVNLYIPSRLHWKEKGL--------KLTLDTYFPESDTVTVRMDEIG 454
Query: 552 SGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLR 610
S T +L R P W S + A +NG+ + G+++ + + S D +T+ L
Sbjct: 455 S-YTGTLLFRYPDWVSGD-AVVRINGEPAQTEAHKGSYIRLLDSVKSGDVITLVFTRNLY 512
Query: 611 TEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE 646
+ +D+ P + S ++YGP +LAG +G D+ E
Sbjct: 513 IDYAKDE-PHFGS---VMYGPILLAG-GLGTDDMPE 543
>gi|423299329|ref|ZP_17277354.1| hypothetical protein HMPREF1057_00495 [Bacteroides finegoldii
CL09T03C10]
gi|408473138|gb|EKJ91660.1| hypothetical protein HMPREF1057_00495 [Bacteroides finegoldii
CL09T03C10]
Length = 800
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 169/562 (30%), Positives = 271/562 (48%), Gaps = 76/562 (13%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L DV+L DS +AQQT+L Y+L L+ D+L+ F + A L Y WE + L G
Sbjct: 30 LQDVKL-LDSPFLQAQQTDLHYILALNPDRLLAPFLREAGLTPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA 237
H GHYLSA ++M+A+T + ++ +++ ++ L Q+ +G+G++ P + + ++A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAIYNRLNYMLDELYRAQQAVGTGFIGGTPGSLQLWKEIKA 146
Query: 238 ---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
L W P Y IHK AGL D Y Y + A M T WM++
Sbjct: 147 GNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYTGSDRARLMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S ++ L E GG+N+ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLSDQQIQDMLRSEHGGLNETFADVAAITGDKKYLELARRFSHKIILDPLIKDEDRL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQL---HKE----------GHQLESSGTNIG------ 385
+G H+NT IP VIG + E++ D H E + + IG
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAEEWDHAARFFWNTVVNNRSVCIGGNSVRE 318
Query: 386 HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSL 437
HF+ + + + D E+C TYNML++++ L++ + + Y +YYER+L
Sbjct: 319 HFHPADNFTSMIN--DVQGPETCNTYNMLRLTKMLYQNSHNPCNINEPDPNYINYYERAL 376
Query: 438 TNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSI 497
N +L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ I
Sbjct: 377 YNHILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFI 430
Query: 498 YFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTS 557
Y ++ +Y+ +I S+L+WK +++ Q+ + +VTL K S +
Sbjct: 431 YAHQKDT---LYVNLFIPSQLNWKEQGVILTQE----TRFPDDNKVTLRI-DKASKKQRT 482
Query: 558 LNLRIPTWTS-SNGAKATLNGQDLPLPS-PGN-FLSVTKTWSSDDKLTIQLPLTLRTEAI 614
L +RIP W + S+ ++NG+ P+ GN +L +++ W D +T LP+ + E I
Sbjct: 483 LMIRIPEWANQSSNYSISINGKKETFPTKKGNQYLPLSRKWKKGDVITFNLPMKVTIEQI 542
Query: 615 QDDRPEYASIQAILYGPYVLAG 636
D + Y A LYGP VLA
Sbjct: 543 PDKKDYY----AFLYGPIVLAA 560
>gi|423213125|ref|ZP_17199654.1| hypothetical protein HMPREF1074_01186 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694381|gb|EIY87609.1| hypothetical protein HMPREF1074_01186 [Bacteroides xylanisolvens
CL03T12C04]
Length = 800
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 169/560 (30%), Positives = 267/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L LD D+L+ F + A L Y WE + L G
Sbjct: 30 LQNVKL-LDSPFLQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L+ Q+ +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + E++ D + H+ G N
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYNML++++ L++ + + Y +YYER+L N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +Y+ +I S+L WK I + Q+ + +VTL T L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGITLTQE----TCFPDDGKVTLRIDEAPKKKHT-LM 484
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + + GN +L +++ W D +T LP+ + E I D
Sbjct: 485 IRIPEWANQSKGYSVSINGKRKMFIMAKGNQYLPLSRKWKKGDVVTFHLPMKVSVEQIPD 544
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 545 KKDYY----AFLYGPIVLAA 560
>gi|298484121|ref|ZP_07002288.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
gi|298269711|gb|EFI11305.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
Length = 776
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 169/560 (30%), Positives = 267/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L LD D+L+ F + A L Y WE + L G
Sbjct: 6 LQNVKL-LDSPFLQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 62
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L+ Q+ +G+G++ P +
Sbjct: 63 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 122
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 123 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 174
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 175 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 234
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + E++ D + H+ G N
Sbjct: 235 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 294
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYNML++++ L++ + + Y +YYER+L N
Sbjct: 295 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 354
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 355 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 408
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +Y+ +I S+L WK I + Q+ + +VTL T L
Sbjct: 409 YRKDT---LYVNLFIPSQLTWKEQGITLTQE----TCFPDDGKVTLRIDEAPKKKRT-LM 460
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + + GN +L +++ W D +T LP+ + E I D
Sbjct: 461 IRIPEWANQSKGYSVSINGKRKMFIMAKGNQYLPLSRKWKKGDVVTFHLPMKVSVEQIPD 520
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 521 KKDYY----AFLYGPIVLAA 536
>gi|126348374|emb|CAJ90096.1| conserved hypothetical protein [Streptomyces ambofaciens ATCC
23877]
Length = 942
Score = 223 bits (568), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 147/435 (33%), Positives = 223/435 (51%), Gaps = 50/435 (11%)
Query: 222 GYLSAFPTEQFDRLEALIP-----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE++ VWAPYYT HKIL GLLD + + AL + + + +
Sbjct: 391 GFLAAYPETQFVELESMTGSDYTRVWAPYYTAHKILRGLLDAHLATGDGRALDLASGLCD 450
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ Y+R+ + +++R W + E GG+ + + L +T + HL LA LFD +
Sbjct: 451 WMYSRLSK-LPAATLQRMWGLFSSGEFGGIVEAICDLHAVTGEAHHLALARLFDLDRLID 509
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLES-SGTNI 384
A D + G H+N HIPI G ++ TG++ + H++ + GT+
Sbjct: 510 ACAADDDVLDGLHANQHIPIFTGLVRLHDATGEERYLTAAKNFWGMVVPHRMYAIGGTST 569
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G F D +A L + T ESC YNMLK+SR LF ++ AY DYYER+L N VLG
Sbjct: 570 GEFWQARDV--IAGTLGATTAESCCAYNMLKLSRTLFFHEQDPAYMDYYERALYNQVLGS 627
Query: 445 QR---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF-E 500
++ E ++ Y + L PG ++ TP CC GTG+ES +K DS+YF
Sbjct: 628 KQDAADAEKPLVTYFVGLTPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYFAA 681
Query: 501 EEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLR---VTLTFSSKGSGLTTS 557
+G +Y+ Y S L W + V Q D Y R TLT G + +
Sbjct: 682 ADGN--ALYVNLYSRSTLTWAERGVTVTQDTD-------YPREQGSTLTLG--GGSASFA 730
Query: 558 LNLRIPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
L LR+P W ++ G + T+NG +P +PG++ +V++TW D + +++P LR E D
Sbjct: 731 LRLRVPAWATA-GFRVTVNGHAVPGTATPGSYFTVSRTWRRGDTVRVRVPFRLRVEKALD 789
Query: 617 DRPEYASIQAILYGP 631
D S+QA+ GP
Sbjct: 790 D----PSLQALFLGP 800
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 57/110 (51%), Gaps = 6/110 (5%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWE-- 171
++ L DV LG + ++ L++ DVD+L+ FR A L G GGWE
Sbjct: 52 VRPFGLEDVTLGR-GVFADKRRLMLDHARGYDVDRLLQVFRANAGLSTLGAVAPGGWEGL 110
Query: 172 --EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
E + LRGH+ GH+L+ A T E E+++++V+AL+ ++ +
Sbjct: 111 DGEANGNLRGHYTGHFLTMLAQAHRGTGEEVFAERITSMVTALTEVRESL 160
>gi|440700043|ref|ZP_20882328.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
gi|440277439|gb|ELP65547.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
Length = 934
Score = 223 bits (568), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 146/438 (33%), Positives = 220/438 (50%), Gaps = 42/438 (9%)
Query: 222 GYLSAFPTEQFDRLEALI-----PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE++ VWAPYYT HKIL GLLD Y D++ AL + + M +
Sbjct: 383 GFLAAYPETQFIALESMTSGDYTKVWAPYYTAHKILKGLLDAYLATDDSRALDLASGMCD 442
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ Y+R+ + +++R W + E GG+ + + L+ IT +HL LA LFD +
Sbjct: 443 WMYSRLSK-LPDATLQRMWGIFSSGEFGGIVETIVDLYTITNKAEHLALAKLFDLDTLID 501
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ-----------LESSGTNI 384
A D ++G H+N HIPI G Y+ TG+ + + GT+
Sbjct: 502 ACAANTDTLNGLHANQHIPIFTGYVRLYDATGEARYLTAAKNFWGMVIPQRMYGIGGTST 561
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G F +K+ +A + E+C YN+LK+SR LF ++ Y DYYER+L N VLG
Sbjct: 562 GEF-WKAR-GVIAGTVSDTNAETCCAYNLLKLSRTLFFHEQDPKYMDYYERALYNQVLGS 619
Query: 445 QR---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
++ E ++ Y + L PG ++ TP CC GTG+ES +K DS+YF+
Sbjct: 620 KQDKADAEKPLVTYFIGLNPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYFKS 673
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
+Y+ Y S L W + V Q + + TLT G +L LR
Sbjct: 674 ADG-GSLYVNLYSPSTLTWAEKGVTVTQTTE----YPKEQGTTLTIG--GGSAAFALRLR 726
Query: 562 IPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
+P W ++ G + T+NGQ + P G++ +V++TW S D + I +P LR E DD
Sbjct: 727 VPLWATA-GFQVTVNGQAVSGTPVAGSYFAVSRTWQSGDVVRISVPFRLRVEKALDD--- 782
Query: 621 YASIQAILYGPYVLAGHS 638
S+Q + YGP L S
Sbjct: 783 -PSLQTLFYGPVNLVARS 799
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 59/110 (53%), Gaps = 6/110 (5%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWE-- 171
L+ L DV LG + +Q L++ DV++L+ FR A L G GGWE
Sbjct: 44 LRPFELKDVALGQGVFASK-RQLMLDHGRGYDVNRLLQVFRANAGLSTGGAVAPGGWEGL 102
Query: 172 --EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
E + LRGH+ GH+LS + +AST +++ ++++ +V AL+ + +
Sbjct: 103 DGEANGNLRGHYTGHFLSMLSQAYASTRDQAYADRIATMVGALTDVRAAL 152
>gi|395772531|ref|ZP_10453046.1| glycosylase [Streptomyces acidiscabies 84-104]
Length = 828
Score = 223 bits (568), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 147/440 (33%), Positives = 228/440 (51%), Gaps = 45/440 (10%)
Query: 221 SGYLSAFPTEQFDRLEALIP-----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMV 275
+G+L+A+P QF +LE++ VWAPYYT HKIL GLLD Y +A AL + M
Sbjct: 339 AGFLAAYPETQFIQLESMTASDYSKVWAPYYTAHKILRGLLDAYAATGDARALDLAGGMA 398
Query: 276 EYFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
++ ++R+ + +++R W + E GG+ + L L+ +T +HL LA LFD +
Sbjct: 399 DWMHSRLSK-LPGATLQRMWGLFSSGEFGGIVEALCDLYDLTGKGEHLALARLFDLDRLI 457
Query: 335 GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLES-SGTN 383
A D + G H+N HIPI G Y+ TG++ + H++ S GT+
Sbjct: 458 DACAANTDVLDGLHANQHIPIFTGYLRLYDATGEERYLAAARNFWDMVVPHRMYSIGGTS 517
Query: 384 IGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG 443
F D +A + + ESC YNMLK+SR LF ++ Y DYYER+L N VLG
Sbjct: 518 DAEFWRARDV--VAGAISGASAESCCAYNMLKLSRALFLHAQDAKYMDYYERALFNQVLG 575
Query: 444 IQRGT---EPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF- 499
+R E ++ Y L L PG ++ TP CC GTG+ES +K D++YF
Sbjct: 576 SKRDVADAEKPLVTYFLGLNPGHVRDY------TPKQGTTCCEGTGLESATKYQDTVYFV 629
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+G +Y+ + S L+W + + V Q + P+ + T T + +G GL +
Sbjct: 630 AADGS--SLYVNLFSPSTLEWAAKGVRVVQD-----TAFPFEQGT-TLTVRGGGL-FEMR 680
Query: 560 LRIPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
LR+P W + +G + +NGQ + P PG++ V++ W D + +++P +R E DD
Sbjct: 681 LRVPVW-AVDGFRVFVNGQAVSGSPMPGSYFGVSREWRDGDVVRVEVPFRMRVERTPDD- 738
Query: 619 PEYASIQAILYGPYVLAGHS 638
+S+QA+ YGP L S
Sbjct: 739 ---SSVQAVFYGPVNLVARS 755
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 50/90 (55%), Gaps = 5/90 (5%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWE----EPSCELRGHFVGHYLSAS 189
+Q L++ DV++L+ FR A L G GGWE E + LRGH+ GH+L+
Sbjct: 26 RQLMLDHARGYDVNRLLQVFRANAGLATLGAVAPGGWEGLDGEANGNLRGHYTGHFLTML 85
Query: 190 ALMWASTHNESLKEKMSAVVSALSACQKEI 219
+ +AST +E EK+ +V AL+ ++ +
Sbjct: 86 SQAYASTGDEVYAEKIRTIVGALTESREAL 115
>gi|262407626|ref|ZP_06084174.1| acetyl-CoA carboxylase [Bacteroides sp. 2_1_22]
gi|294644495|ref|ZP_06722254.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294808396|ref|ZP_06767149.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|345511903|ref|ZP_08791442.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
gi|262354434|gb|EEZ03526.1| acetyl-CoA carboxylase [Bacteroides sp. 2_1_22]
gi|292640162|gb|EFF58421.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294444324|gb|EFG13038.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|345453983|gb|EEO49450.2| acetyl-CoA carboxylase [Bacteroides sp. D1]
Length = 800
Score = 223 bits (567), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 168/560 (30%), Positives = 267/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L LD D+L+ F + A L Y WE + L G
Sbjct: 30 LQNVKL-LDSPFLQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L+ Q+ +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLHKE-----------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + E++ D + H+ G N
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYN+L++++ L++ + + Y +YYER+L N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNILRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +Y+ +I S+L WK I + Q+ + +VTL T L
Sbjct: 433 YRKDT---LYVNLFIPSQLTWKEQGITLTQE----TCFPDDGKVTLRIDEAPKKKRT-LM 484
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + + GN +L +++ W D +T LP+ + E I D
Sbjct: 485 IRIPEWANQSKGYSVSINGKRKMFIMAKGNQYLPLSRKWKKGDVVTFHLPMKVSVEQIPD 544
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 545 KKDYY----AFLYGPIVLAA 560
>gi|452750721|ref|ZP_21950468.1| Putative glycosyl hydrolase of unknown function (DUF1680) [alpha
proteobacterium JLT2015]
gi|451961915|gb|EMD84324.1| Putative glycosyl hydrolase of unknown function (DUF1680) [alpha
proteobacterium JLT2015]
Length = 744
Score = 222 bits (566), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 169/584 (28%), Positives = 265/584 (45%), Gaps = 65/584 (11%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A + N EYL+ LD D+L+ N+R +A L G+ YGGWE S + GH +GHYLSA AL
Sbjct: 9 AVERNREYLMSLDPDRLLHNYRTSAGLAPKGDVYGGWE--SDTIAGHTLGHYLSALALTH 66
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAF----PTEQFDRLEALIP--------- 240
A T +E + + +V L+ Q G GY++ F P + + + P
Sbjct: 67 AQTGDEESCRRANYIVGELATVQAAHGDGYVAGFTRKRPDGEIVDGKEIFPEIMAGDIRS 126
Query: 241 -------VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIER 293
W P Y HK+ GL D N AL + + +Y + + E+
Sbjct: 127 AGFDLNGCWVPLYNWHKLYTGLYDVADLCGNRTALPIAVALGDY----IDRMFAALDDEQ 182
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHI 353
L E GG+N+ +L+ T + + L L L L D ++ FH+NT +
Sbjct: 183 VQTVLACEYGGLNESFAELYARTGERRWLRLGERIYDNKVLDPLTRGEDRLANFHANTQV 242
Query: 354 PIVIGSQMRYEVTG------------DQLHKEGHQLESSGTNIGHFNFKSDPKRLASNLD 401
P +IG YE+T D + K + + +F S+P ++ ++
Sbjct: 243 PKLIGLARLYELTSKPAQGAAAEFFWDTVTKRHSYVIGGNADREYF---SEPNSISKHIT 299
Query: 402 SNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAP 461
T E C +YNMLK++RHL+ W A D+YER+ N +L Q+ E G Y+ PL
Sbjct: 300 EQTCEHCNSYNMLKLTRHLYSWRPRSALFDFYERAHLNHILS-QQHPETGGFSYMTPLMS 358
Query: 462 GSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK 521
G+++E Y G D+FWCC GTG+ES +K GDSI+++ + + + YI + +W+
Sbjct: 359 GTARE--YSEPG--KDAFWCCVGTGMESHAKHGDSIFWQGDD---ALIVNLYIPAAANWR 411
Query: 522 SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP 581
V + + LTF+ + LR+P W S +NG+ +
Sbjct: 412 PRGASVRLE----TRYPEEGSANLTFTELAKPGRFPVALRVPAWAES--VDVRVNGKAVA 465
Query: 582 LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA---GHS 638
+++V++ W + D+L I +P+ LR E DD + A+L GP VLA G +
Sbjct: 466 AKVEDGYVTVSRRWQAGDRLAIAMPMRLRIEPTADD----PDMIALLRGPMVLAADLGPA 521
Query: 639 IGDWDITESATSLSDWITPIPASYNSQLITFTQ---EYGNTKFV 679
++D A SD + S TQ G+ +FV
Sbjct: 522 EEEFDGAAPALVGSDLLAKFVPEAGSATAFATQGIGRPGDMRFV 565
>gi|295133234|ref|YP_003583910.1| hypothetical protein ZPR_1378 [Zunongwangia profunda SM-A87]
gi|294981249|gb|ADF51714.1| putative secreted protein [Zunongwangia profunda SM-A87]
Length = 1016
Score = 222 bits (566), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 178/606 (29%), Positives = 267/606 (44%), Gaps = 102/606 (16%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKT--ARLPAPGEPYGGWEE 172
L +VSL G ++ + + L + D ++ FR P +P G W+
Sbjct: 373 LDQVSLESNTNGQNTKFIENRDKFINTLAQTNPDSFLYMFRNAFGQEQPVGAKPLGVWDT 432
Query: 173 PSCELRGHFVGHYLSASALMWASTH-----NESLKEKMSAVVSALSACQ----------- 216
+LRGH GHYL+A A +AST ++ +KM +V+ L
Sbjct: 433 QETKLRGHATGHYLTAIAQAYASTGYDKALQQNFADKMEYMVNTLYQLSQMSGKPAEEGG 492
Query: 217 --------------KEI-----------------GSGYLSAFPTEQFDRLE-------AL 238
KEI G G++SA+P +QF LE
Sbjct: 493 DFNANPTAVPMGPGKEIYSSDLSEEGIRTDYWNWGEGFISAYPPDQFIMLENGAVYGTEE 552
Query: 239 IPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTL 298
+WAPYYT+HKILAGL+D Y + N +AL + M ++ Y R+ + I + +
Sbjct: 553 TKIWAPYYTLHKILAGLMDIYEVSGNEKALAVAEGMGDWVYARLSELPTDTLISMWNRYI 612
Query: 299 NEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLG------LLALQADDISGFHSNT 351
E GGMN+ + +L+ IT +L A LFD F G LA D G H+N
Sbjct: 613 AGEFGGMNEAMARLYRITGKDTYLETARLFDNIKVFFGDANHSHGLAKNVDTFRGLHANQ 672
Query: 352 HIPIVIGSQMRYEVTGDQ----------LHKEGHQLESSGTNIGHFN------FKSDPKR 395
HIP ++G+ Y + + + S G G N F + P
Sbjct: 673 HIPQIVGALEMYRDSDKPEYFNVADNFWVKATNDYMYSIGGVAGARNPANAECFIAQPGT 732
Query: 396 LASNLDS--NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
L N S E+C TYNMLK++R+LF + + DYYER L N +L P
Sbjct: 733 LYENGLSAGGQNETCATYNMLKLTRNLFLYEQRPELMDYYERGLYNHILASVAEDSP-AN 791
Query: 454 IYLLPLAPGSSKERSYHHWGTPS-DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
Y +PL PGS K +G P+ F CC GT +ES +KL +SIYF+ +Y+
Sbjct: 792 TYHVPLRPGSKKS-----FGNPNMTGFTCCNGTALESSTKLQNSIYFKGADN-KALYVNL 845
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
Y+ S L W I + Q+ + + LT + KG L LR+P W ++NG
Sbjct: 846 YVPSTLHWHEKNIELTQETN----FPKEDHTKLTINGKGK---FDLKLRVPGW-ATNGFT 897
Query: 573 ATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+NG+D + +PG +LS+++ W D + +Q+P + I D + +I ++ YGP
Sbjct: 898 VKINGKDQKVKATPGTYLSLSRKWKDGDTVELQMPFGFYLDPIMDQQ----NIASLFYGP 953
Query: 632 YVLAGH 637
+LA
Sbjct: 954 VLLAAQ 959
>gi|120435050|ref|YP_860736.1| hypothetical protein GFO_0692 [Gramella forsetii KT0803]
gi|117577200|emb|CAL65669.1| conserved hypothetical protein, membrane or secreted [Gramella
forsetii KT0803]
Length = 796
Score = 222 bits (566), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 167/552 (30%), Positives = 266/552 (48%), Gaps = 62/552 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
LK DV+L DS A +LEY+L LD D+L+ F K A L E Y WE +
Sbjct: 34 LKLFPHEDVQL-LDSPFRDAMLVDLEYILKLDPDRLLAPFLKEAGLETKVESYPNWE--N 90
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQF 232
L GH GHYL+A +LM+A+T N+ + E+++ ++ L Q + GY+ P E +
Sbjct: 91 TGLDGHIGGHYLTALSLMYAATGNQEVLERLNYMLDELQKVQ-QANVGYIGGVPDSKELW 149
Query: 233 DRLE---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFY 279
++ +L W P Y IHK AGL D Y A A + ++ WM+E
Sbjct: 150 QQISEGNINAGSFSLNDRWVPLYNIHKTYAGLRDAYQIAGIERAKTMLIDLSDWMLE--- 206
Query: 280 NRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
V S E+ + L E GG+N+ ++ IT + K+L LA+ F + L L
Sbjct: 207 -----VTSDLSEEQIQELLISEYGGLNETFADVYEITGEKKYLDLAYAFSQKELLKPLED 261
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG------HQLESSGTNIG------HF 387
D ++G H+NT IP VIG Q + ++ +++ + + IG HF
Sbjct: 262 DQDVLTGMHANTQIPKVIGFQTIAALNDNREYRDAASFFWDNVVNERSVAIGGNSVREHF 321
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
+ K D + S++ E+C TYNMLK+S LF Y DYYE++L N +L Q
Sbjct: 322 HPKDDFSTMMSSVQG--PETCNTYNMLKLSEKLFLTEANEKYVDYYEQALYNHILSSQH- 378
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
E G +Y P+ PG Y + P SFWCC G+G+E+ K + IY E +
Sbjct: 379 PEKGGFVYFTPMRPG-----HYRVYSQPETSFWCCVGSGLENHGKYNEFIYAHTENE--- 430
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+Y+ +I S L+W+ + + QK + + + L + +L LR PTW
Sbjct: 431 LYVNLFIPSILNWEEKGLKLTQKTEFPNEETSKISINLKEVEE-----FTLMLRYPTW-- 483
Query: 568 SNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
+ G +N + + L + PG+++S+ + W+ D++ +Q+P+ + + + D + A
Sbjct: 484 AKGFNILVNQEKVELNNEPGSYVSIKREWTDGDEIELQIPMNISSVGLPDGSNNF----A 539
Query: 627 ILYGPYVLAGHS 638
+ YGP VL +
Sbjct: 540 LKYGPLVLGAKT 551
>gi|374712027|gb|AEZ64557.1| putative secreted protein [Streptomyces chromofuscus]
Length = 933
Score = 222 bits (565), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 147/441 (33%), Positives = 221/441 (50%), Gaps = 48/441 (10%)
Query: 222 GYLSAFPTEQFDRLEALIP-----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE++ VWAPYYT HKIL GLLD + Y D+ AL + + + +
Sbjct: 382 GFLAAYPETQFITLESMTSSDYGVVWAPYYTAHKILRGLLDAHLYTDDPRALDLASGLCD 441
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ Y+R+ + +++R W + E GG+ + + L +T P+HL LA LFD +
Sbjct: 442 WMYSRLSR-LPASTLQRMWGIFSSGEFGGLVEAVCDLHALTGKPEHLALARLFDLDSLID 500
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ-----------LESSGTNI 384
A D + G H+N HIPI G ++ TG+ + + GT+
Sbjct: 501 ACAANRDVLDGLHANQHIPIFTGLLRLHDATGEARYLAAAKNFWDMVVPTRMYGIGGTST 560
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G F +A + + T ESC YNMLK+SR LF ++ Y DYYER+L N VLG
Sbjct: 561 GEF--WRGRGSVAGTISATTAESCCAYNMLKLSRLLFFHEQDPKYMDYYERALYNQVLGS 618
Query: 445 QRGT---EPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
++ T E ++ Y + L PG ++ TP CC GTG+ES +K DS+YF +
Sbjct: 619 KQDTADAEKPLVTYFIGLTPGHVRDY------TPKAGTTCCEGTGMESATKYQDSVYFRK 672
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLR-VTLTFSSKGSGLTTSLNL 560
+Y+ Y +S L W I V Q D Y R T + G L L
Sbjct: 673 ADDSV-LYVNLYSASTLTWAERGITVTQTTD-------YPREQGSTLTIGGGSAAFELRL 724
Query: 561 RIPTWTSSNGAKATLNG---QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDD 617
R+P+W + G + T+NG Q PL PG++ +V++TW D + +++P LR E DD
Sbjct: 725 RVPSWADA-GFQVTVNGTAVQGKPL--PGSYFAVSRTWRGGDIVRVRVPFRLRVEPTPDD 781
Query: 618 RPEYASIQAILYGPYVLAGHS 638
++Q++ +GP L S
Sbjct: 782 ----PALQSLFHGPVNLVARS 798
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 57/110 (51%), Gaps = 6/110 (5%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWE-- 171
L+ L DV LG + ++ L++ DVD+L+ FR A L G GGWE
Sbjct: 44 LRPFDLKDVTLGP-GIFATKRRFMLDHGRGYDVDRLLQVFRANAGLSTRGAVAPGGWEGL 102
Query: 172 --EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
E + LRGH+ GH+L+ A + ST ++ +++ ++V AL+ + +
Sbjct: 103 DGEANGNLRGHYTGHFLTMLAQSYGSTGDQVYADRIRSMVDALTEVRSAL 152
>gi|383112514|ref|ZP_09933306.1| hypothetical protein BSGG_0614 [Bacteroides sp. D2]
gi|313693079|gb|EFS29914.1| hypothetical protein BSGG_0614 [Bacteroides sp. D2]
Length = 800
Score = 222 bits (565), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 168/560 (30%), Positives = 264/560 (47%), Gaps = 72/560 (12%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +V+L DS +AQQT+L Y+L L+ D+L+ F + A L Y WE + L G
Sbjct: 30 LQNVKL-LDSPFLQAQQTDLHYILALNPDRLLAPFLREAGLQPKAPSYTNWE--NTGLDG 86
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF 232
H GHYLSA ++M+A+T + ++ +++ +++ L Q+ +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKDIKA 146
Query: 233 DRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRM----TTWMVEYFYNRVQN 284
++ A L W P Y IHK AGL D Y YA + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARKMLIDLTDWMID-------- 198
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ S E+ L E GG+N+ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKLILDPLIKDEDKL 258
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQL---HKE--------------GHQLESSGTNIGHF 387
+G H+NT IP VIG + E++ D H H+ G N
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKSWSHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTK--------EIAYADYYERSLTN 439
+F + D E+C TYNML++++ L++ + + Y +YYER+L N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNNTQEPDPNYVNYYERALYN 378
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
+L Q + G +Y P+ PG Y + P S WCC G+G+E+ +K G+ IY
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGLENHTKYGEFIYA 432
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +YI +I S+L WK + + Q+ LR+ K +L
Sbjct: 433 HQRDT---LYINLFIPSQLTWKEQGVTLTQETRFPDDGKVTLRIDEAPKKK-----RTLM 484
Query: 560 LRIPTWTS-SNGAKATLNGQ-DLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
+RIP W + S G ++NG+ + + + GN +L +++ W D +T LP+ + E I D
Sbjct: 485 IRIPEWANQSKGYSISINGKRKIFIMAKGNQYLPLSRKWKKGDVITFNLPMRVSMEQIPD 544
Query: 617 DRPEYASIQAILYGPYVLAG 636
+ Y A LYGP VLA
Sbjct: 545 KKDYY----AFLYGPIVLAA 560
>gi|312131189|ref|YP_003998529.1| hypothetical protein Lbys_2513 [Leadbetterella byssophila DSM
17132]
gi|311907735|gb|ADQ18176.1| protein of unknown function DUF1680 [Leadbetterella byssophila DSM
17132]
Length = 737
Score = 222 bits (565), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 167/549 (30%), Positives = 262/549 (47%), Gaps = 68/549 (12%)
Query: 116 KEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC 175
+ + L+ V+L + + AQ +L+Y+L LD DKL+ +R A L E YG WE S
Sbjct: 18 QNIPLNQVKL-KEGVFKNAQDVDLKYILALDPDKLLAPYRIDAGLEKKAERYGNWE--SS 74
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--FD 233
L GH GHYLSA A+++AS+ LK+++ +VS L+ACQK+ G+GY+ P + ++
Sbjct: 75 GLDGHIGGHYLSALAMLYASSGEPELKKRLDYMVSELAACQKKNGNGYVGGIPQGKVFWE 134
Query: 234 RLE---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTT----WMVEYFYN 280
R+ L W P Y IHK+ AGL D Y + N EAL + T WM+E F
Sbjct: 135 RIGKGDIDGSSFGLNNTWVPLYNIHKLFAGLYDAYHFTGNNEALTVLTGLSDWMIELFSA 194
Query: 281 RVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQ 340
++K L E GG+N+ ++ T + K+L A F + FL +
Sbjct: 195 LTDEQVEK--------VLRTEHGGLNEAFLDVYSATGEQKYLRAAERFTQKAFLQPMIEG 246
Query: 341 ADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNIGHFNFK 390
D ++G H+NT IP ++G++ +VT +Q +G H+ + G N +F
Sbjct: 247 KDILTGLHANTQIPKMVGAEKISQVTKNQDWHKGASYFWDNVALHRSVAFGGNSYREHF- 305
Query: 391 SDPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTE 449
+ R L++N E+C +YNMLK+S+ L+ T + Y D+YE++L N +L Q E
Sbjct: 306 HELDRFDKMLETNQGPETCNSYNMLKLSKALYESTGDNKYLDFYEKTLFNHILSSQH-PE 364
Query: 450 PGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVY 509
G +Y P+ P Y + P S WCC GTG+E+ +K G+ I+ G +
Sbjct: 365 KGGFVYFTPIRP-----NHYRVYSQPETSMWCCVGTGLENHTKYGEMIFSRRAGV---LQ 416
Query: 510 IIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN 569
+ I+++L+ S + ++ K PY T G ++ RIP W
Sbjct: 417 VNLLIAAKLEGHS--VTLDTKY-------PY-ENTAVLRVDGE---KTVKWRIPAWMDE- 462
Query: 570 GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
K T+NG+ + F T ++ L+ Q + Q+ P A Y
Sbjct: 463 -VKFTVNGKKVNPKMESGFAVFTGLKKAEIHLSFQPKMG------QEFLPNDQKWAAFTY 515
Query: 630 GPYVLAGHS 638
GP VLA +
Sbjct: 516 GPLVLAAET 524
>gi|295132897|ref|YP_003583573.1| glycosyl hydrolase [Zunongwangia profunda SM-A87]
gi|294980912|gb|ADF51377.1| putative glycosyl hydrolase [Zunongwangia profunda SM-A87]
Length = 797
Score = 222 bits (565), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 168/563 (29%), Positives = 258/563 (45%), Gaps = 61/563 (10%)
Query: 105 FKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG 164
KV + + E L +V L D A+ N+ LL DVD+L+ +RK A L
Sbjct: 21 LKVSAQEKLYTNEFPLENVTL-LDGKFKNARDLNMSVLLQYDVDRLLAPYRKEAGLEPRK 79
Query: 165 EPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQ-------K 217
Y WE L GH GHYLSA A+ +A+T N+ +M+ ++ L CQ
Sbjct: 80 PSYPNWEG----LDGHIGGHYLSALAMNYAATDNQEFLARMNYMLKELRECQLANTKKHP 135
Query: 218 EIGSGYLSAFPTEQ-----FDR--LEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRM 270
E G GY+ FP + F + E WAP+Y +HK+ AGL D + YAD+ +A M
Sbjct: 136 EWGVGYVGGFPNSEALWSSFKKGNFEKYNSAWAPFYNLHKMYAGLRDAWLYADSEKAKEM 195
Query: 271 ----TTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAH 326
W + + K S E+ LN E GGM +V + IT + K+L A
Sbjct: 196 FLDFCDWGI--------TLTKDLSHEQMQSVLNMEHGGMPEVYADAYQITGEKKYLEAAK 247
Query: 327 LFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQ-LHKEGHQLESSGTNIG 385
+ L L+ D++ H+NT IP +G + EV GD+ K G + T
Sbjct: 248 RYSHEQVLHPLSKGIDNLDNKHANTQIPKFVGFERIAEVDGDEKFAKAGSYFWETVTKNR 307
Query: 386 HFNFKSDPKR-----LASNLDSNTE----ESCTTYNMLKVSRHLFRWTKEIAYADYYERS 436
F + ++ ++++D E ESC +YNMLK++ LFR E YADYYER+
Sbjct: 308 SLAFGGNSRKEHFPSTSASIDYINEDDGPESCNSYNMLKLTEDLFRVNPEAKYADYYERT 367
Query: 437 LTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDS 496
L N +L Q + G +Y P P R Y + P ++ WCC GTG+E+ K
Sbjct: 368 LYNHILSTQH-PQHGGYVYFTPARP-----RHYRIYSAPEEAMWCCVGTGMENHGKYNQF 421
Query: 497 IYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT 556
IY + +YI +I S L+W+ + + Q+ + L++T +G+
Sbjct: 422 IYTHQGD---SLYINLFIPSELNWEKQGVKIRQETNFPSEEGTSLKIT-----EGTA-EF 472
Query: 557 SLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQ 615
L LR P W K +N +++ L P +++ + + W D + + LP+ E +
Sbjct: 473 PLFLRYPGWIKEGEMKIKINSEEIELIGKPSSYVKIDRNWQKGDIVDVSLPMHNHMERLP 532
Query: 616 DDRPEYASIQAILYGPYVLAGHS 638
+ P+Y A +GP +L S
Sbjct: 533 -NVPQYV---AFFHGPILLGAPS 551
>gi|86140890|ref|ZP_01059449.1| putative secreted protein [Leeuwenhoekiella blandensis MED217]
gi|85832832|gb|EAQ51281.1| putative secreted protein [Leeuwenhoekiella blandensis MED217]
Length = 1004
Score = 221 bits (563), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 185/628 (29%), Positives = 284/628 (45%), Gaps = 102/628 (16%)
Query: 93 AMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVW 152
A + K P + V + + L EV+L++ LG S + ++ L + D ++
Sbjct: 338 ATVLVKAVQPSKTPVRKLTSFALNEVNLNNTSLGDHSKFIENRNKFIDTLAQTNPDSFLY 397
Query: 153 NFRKT--ARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTH-----NESLKEKM 205
FR P P G W+ +LRGH GHYL+A A +AST ++ ++KM
Sbjct: 398 MFRNAFGQEQPEGATPLGVWDTQETKLRGHATGHYLTAIAQAYASTGYDKALQKNFEDKM 457
Query: 206 SAVVSAL------------------------------SACQKEI------------GSGY 223
+ +V+ L +A ++ G G+
Sbjct: 458 NYMVNTLYDLSQLSGKPKTEGGAYVEDPSSVPPGPGSTAYTSDLSEDGIRTDYWNWGKGF 517
Query: 224 LSAFPTEQFDRLE-------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
+SA+P +QF LE VWAPYYT+HKILAGL+D Y + N +AL++ M
Sbjct: 518 ISAYPPDQFIMLEHGAKYGGQETQVWAPYYTLHKILAGLIDVYEVSGNPKALQVAEGMAA 577
Query: 277 YFYNRVQNVIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFL 334
+ + R+ + + I W T + E GG+N+ L L IT ++L A LFD F
Sbjct: 578 WVHTRLSKLPTETLITM-WNTYIAGELGGINESLAHLHRITGKSEYLETAKLFDNIKVFY 636
Query: 335 G------LLALQADDISGFHSNTHIPIVIG---------SQMRYEVTGDQLHKEGHQ-LE 378
G LA D G H+N HIP ++G S Y + + +K + +
Sbjct: 637 GDAEHTHGLAKNVDTYRGLHANQHIPQIMGALELYRNSNSPEYYHIADNFWYKTKNDYMY 696
Query: 379 SSGTNIGHFN------FKSDPKRLASNLDS--NTEESCTTYNMLKVSRHLFRWTKEIAYA 430
S G G N F + P L N S E+C TYNMLK++R LF + ++
Sbjct: 697 SIGGVAGARNPANAECFVAQPATLYENGLSAGGQNETCGTYNMLKLTRGLFFYNQQPELM 756
Query: 431 DYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESF 490
DYYE++L N +L P Y +PL PGS K+ S F CC GT IES
Sbjct: 757 DYYEQALYNQILASVAENSPA-NTYHIPLRPGSRKQFS----NADMSGFTCCNGTAIESS 811
Query: 491 SKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSK 550
+KL +SIYF+ +Y+ ++ S L WK +V+ Q+ S+ LT + K
Sbjct: 812 TKLQNSIYFKSVDN-KALYVNLFVPSTLTWKEQDVVITQE----TSFPREDHTKLTVNGK 866
Query: 551 GSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTL 609
G LNLRIP W ++ G + +NG+ + G++LS+ + W + D + +++P T
Sbjct: 867 GK---FELNLRIPGWATA-GVELKINGKTQKIAIEAGSYLSLDRKWKNGDTIELKMPFTF 922
Query: 610 RTEAIQDDRPEYASIQAILYGPYVLAGH 637
+ I D +I ++ YGP +LA
Sbjct: 923 HLDPIMDQE----NIASLFYGPVLLAAQ 946
>gi|410638732|ref|ZP_11349285.1| acetyl-CoA carboxylase, biotin carboxylase [Glaciecola lipolytica
E3]
gi|410141260|dbj|GAC16490.1| acetyl-CoA carboxylase, biotin carboxylase [Glaciecola lipolytica
E3]
Length = 818
Score = 221 bits (563), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 168/548 (30%), Positives = 259/548 (47%), Gaps = 62/548 (11%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
L++VS+ D AQQTN+ YLL + DKL+ + + A L + YG WE +
Sbjct: 54 LQQVSIFDGPFA------HAQQTNVGYLLAIQPDKLLAPYLREAGLEPKVDSYGNWE--N 105
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--F 232
L GH GHYLSA +L WA+T + LK ++ +++ L Q G GYL P + +
Sbjct: 106 TGLDGHIGGHYLSALSLAWAATQDTELKRRLDYMLNELQKAQNANG-GYLGGIPNGKVMW 164
Query: 233 DRLE---------ALIPVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFY 279
D ++ +L W P Y I KI GL D Y A++ +A L + WM++
Sbjct: 165 DEIKQGNIKADLFSLNDRWVPLYNIDKIFHGLRDAYLIANSEQAKTMLLSLGQWMLD--- 221
Query: 280 NRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
V S E+ Q L E GG+N+V + I+ D +L LA F + L
Sbjct: 222 -----VTNNLSDEQIQQMLYSEHGGLNEVFADMSTISGDKAYLELARKFSHKRIIDPLVA 276
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQL------ESSGTNIG------HF 387
D+++G H+NT IP +IG+ ++ D+ KE + + IG HF
Sbjct: 277 HKDELNGLHANTQIPKIIGALKVAQLNNDESWKEAARFFWETVTKQRSVAIGGNSVREHF 336
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
+ +D + D E+C TYNM+K+S+ LF T + Y DYYER+ N +L Q
Sbjct: 337 HDAADFSPMVE--DPEGPETCNTYNMIKLSKLLFLQTADTRYLDYYERATYNHILSSQH- 393
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
E G ++Y + PG Y + + DS WCC G+GIE+ SK G+ IY
Sbjct: 394 PEHGGLVYFTSMRPG-----HYRMYSSVQDSMWCCVGSGIENHSKYGELIY---SHSVDN 445
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+ + +ISS L W + + + S + +++ + K G LN+R P W S
Sbjct: 446 LSVNLFISSTLRWPEKGLKLTLETQFPDSQNVVIKLH-QLAEKQMG-EFVLNIRKPAWFS 503
Query: 568 SNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAI 627
+ + NG+ + ++ + + W D+L+ +L L TE + D + Y A+
Sbjct: 504 HDISMFK-NGEKINYVENEGYIQIQQNWQDGDELSFELAAGLSTEQLPDGQNYY----AV 558
Query: 628 LYGPYVLA 635
LYGP VLA
Sbjct: 559 LYGPVVLA 566
>gi|224537186|ref|ZP_03677725.1| hypothetical protein BACCELL_02063 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521241|gb|EEF90346.1| hypothetical protein BACCELL_02063 [Bacteroides cellulosilyticus
DSM 14838]
Length = 805
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 159/546 (29%), Positives = 249/546 (45%), Gaps = 54/546 (9%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L DVR+ + A N++ LL D D+L+ F + A LP E YG WE+ L G
Sbjct: 31 LGDVRITAGPFK-HACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEKDG--LDG 87
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT-----EQFDR 234
H GHYL+A A+ +A+T N K++M +VS + Q+ G G + FP E+ +
Sbjct: 88 HIGGHYLTALAIHYAATGNLECKKRMDYMVSEFARVQQANGDGSICGFPNSKKFAEEIRK 147
Query: 235 LEALI--PVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKK 288
I W +Y +HK AGL D + Y N +A L+ W V+ N +
Sbjct: 148 GNVGIVWNYWVAWYNMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISN-----LDD 202
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
+ER L+ E GGMN+V + +T +PK+L A F +A + D++ H
Sbjct: 203 RQMER---MLDNEFGGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMARRIDNLDNKH 259
Query: 349 SNTHIPIVIGSQMRYE------------VTGDQLHKE---GHQLESSGTNIGHFNFKSDP 393
+NT +P +G Q E +T + E H+ S G N +F
Sbjct: 260 ANTQVPKAVGYQRVAELNSKIAPDYNDFMTAAEFFWETVVSHRSLSLGGNSRGEHFPEAG 319
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
K + ESC T NMLK++ LFR ++ YAD+YER++ N +L Q E G
Sbjct: 320 KCSDYMHERQGPESCNTNNMLKLTEGLFRMHPKVEYADFYERAMYNHILSTQH-PEHGGY 378
Query: 454 IYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
+Y P P Y + P + WCC GTG+E+ K G IY + +Y+ +
Sbjct: 379 VYFTPACPS-----HYRVYSAPGKAMWCCVGTGMENHGKYGQFIYTHDMAD-NALYVNLF 432
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
I S L+WK +I + Q+ D P T + L +R P+W +
Sbjct: 433 IPSELNWKEKKIKIVQETD-----FPNEEGTTLTVNPSKATQFKLLIRYPSWVEQGKMQV 487
Query: 574 TLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
NG D + PG+++++ + WS D + ++ P+T++ E + P + +I+ GP
Sbjct: 488 VCNGVDYAKSAQPGSYIAIDRQWSKGDVVEVKTPMTVKIEEL----PNVPNAISIMRGPI 543
Query: 633 VLAGHS 638
+L +
Sbjct: 544 LLGART 549
>gi|427383714|ref|ZP_18880434.1| hypothetical protein HMPREF9447_01467 [Bacteroides oleiciplenus YIT
12058]
gi|425728419|gb|EKU91277.1| hypothetical protein HMPREF9447_01467 [Bacteroides oleiciplenus YIT
12058]
Length = 791
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 157/527 (29%), Positives = 247/527 (46%), Gaps = 49/527 (9%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A N++ LL DVD+L+ F K A L GE + WE L GH GHYLSA A+ +
Sbjct: 46 ACDLNVQILLQYDVDRLLAPFLKEAGLQPKGESFPNWEG----LDGHVGGHYLSALAIHY 101
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QFDRLEALIPVWAPYY 246
A+T N K++M ++S L CQ++ GY+ P + + + W P+Y
Sbjct: 102 AATGNVDCKKRMEYMISELKRCQQKHADGYVGGVPDGMKVWNEIKKGNVGIVWKYWVPWY 161
Query: 247 TIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN 306
+HKI AGL D + Y N EA M + ++ +I + E+ Q L E GGM+
Sbjct: 162 NLHKIYAGLRDAWIYGGNEEARMMFLELCDW----GMTIIAPLNDEQMEQMLANEFGGMD 217
Query: 307 DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVT 366
+V + +T D K+L A F L +A Q D++ H+NT +P V+G Q E+
Sbjct: 218 EVYADAYQMTGDMKYLNTAKRFSHKWLLDSMAAQVDNLDNKHANTQVPKVVGYQRIAELG 277
Query: 367 GDQLHKEGHQ------LESSGTNIG------HFNFKSDPKRLASNLDSNTEESCTTYNML 414
D+ ++ + + + ++G HF D K D ESC T NML
Sbjct: 278 HDKKYEVATEYFWNTVVYNRSLSLGGNSRREHFAAADDCKSYVE--DREGPESCNTNNML 335
Query: 415 KVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGT 474
K++ LFR E YAD+YER++ N +L Q E G +Y P Y +
Sbjct: 336 KLTEGLFRMHPEARYADFYERAMYNHILSTQH-PEHGGYVYFTSARPA-----HYRVYSA 389
Query: 475 PSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPV 534
P+ + WCC GTG+E+ K G+ IY + +++ +++S L+WK I + Q+
Sbjct: 390 PNSAMWCCVGTGMENHGKYGEFIYTH---AHDSLFVNLFVASELNWKEKGITLIQETRFP 446
Query: 535 VSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTK 593
L + + +K L +R P W N K G+D SP +++ + +
Sbjct: 447 DEESSRLTIRVKKPTK-----FKLLVRHPWWADGNDMKVLCKGKDYASGSSPSSYIVIER 501
Query: 594 TWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIG 640
TW + D + I P+ + EA+ P + +I+ GP +L G +G
Sbjct: 502 TWKNGDVVDITTPMKVHIEAL----PNVSEYISIMRGP-ILLGARMG 543
>gi|423223044|ref|ZP_17209513.1| hypothetical protein HMPREF1062_01699 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640313|gb|EIY34115.1| hypothetical protein HMPREF1062_01699 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 805
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 159/546 (29%), Positives = 248/546 (45%), Gaps = 54/546 (9%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L DVR+ + A N++ LL D D+L+ F + A LP E YG WE+ L G
Sbjct: 31 LGDVRITAGPFK-HACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEKDG--LDG 87
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT-----EQFDR 234
H GHYL+A A+ +A+T N K++M +VS + Q+ G G + FP E+ +
Sbjct: 88 HIGGHYLTALAIHYAATGNLECKKRMDYMVSEFARVQQANGDGSICGFPNSKKFAEEIRK 147
Query: 235 LEALI--PVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKK 288
I W +Y +HK AGL D + Y N +A L+ W V+ N +
Sbjct: 148 GNVGIVWNYWVAWYNMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISN-----LDD 202
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
+ER L+ E GGMN+V + +T +PK+L A F +A D++ H
Sbjct: 203 RQMER---MLDNEFGGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMARHIDNLDNKH 259
Query: 349 SNTHIPIVIGSQMRYE------------VTGDQLHKE---GHQLESSGTNIGHFNFKSDP 393
+NT +P +G Q E +T + E H+ S G N +F
Sbjct: 260 ANTQVPKAVGYQRVAELNSKTAPDYNDFMTAAEFFWETVVSHRSLSLGGNSRGEHFPEAG 319
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
K + ESC T NMLK++ LFR ++ YAD+YER++ N +L Q E G
Sbjct: 320 KCSDYMHERQGPESCNTNNMLKLTEGLFRMHPKVEYADFYERAMYNHILSTQH-PEHGGY 378
Query: 454 IYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
+Y P P Y + P + WCC GTG+E+ K G IY + +Y+ +
Sbjct: 379 VYFTPACPS-----HYRVYSAPGKAMWCCVGTGMENHGKYGQFIYTHDMAD-NALYVNLF 432
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
I S L+WK +I + Q+ D P T + L +R P+W +
Sbjct: 433 IPSELNWKEKKIKIVQETD-----FPNEEGTTLTVNPSKATQFKLLIRYPSWVEQGKMQV 487
Query: 574 TLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
NG D + PG+++++ + WS D + ++ P+T++ E + P + +I+ GP
Sbjct: 488 VCNGVDYAKSAQPGSYIAIDRQWSKGDVVEVKTPMTVKIEEL----PNVPNAISIMRGPI 543
Query: 633 VLAGHS 638
+L +
Sbjct: 544 LLGART 549
>gi|380512705|ref|ZP_09856112.1| hypothetical protein XsacN4_15862 [Xanthomonas sacchari NCPPB 4393]
Length = 799
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 162/531 (30%), Positives = 246/531 (46%), Gaps = 57/531 (10%)
Query: 136 QTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWAS 195
QTN YLL L+ D+L+ NF + A LP G YGGWE + + GH +GHYLSA A M A
Sbjct: 74 QTNRRYLLELEPDRLLHNFLQYAGLPPKGAVYGGWEGDT--IAGHTLGHYLSALAKMHAQ 131
Query: 196 THNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEA------------------ 237
T + L+E++ +V+ L+ Q + GY+ F T + D+ E
Sbjct: 132 TRDPVLRERIDYIVAELARAQAQDPDGYVGGF-TRKNDKGEIEGGKAVLEDVRRGIIKGS 190
Query: 238 ---LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERH 294
L W+P YT HK+ AGLLD + A + +AL + + Y V +
Sbjct: 191 KFNLNGSWSPLYTQHKLFAGLLDAHALAGSKQALEVLLPLAAY----TAGVFDALDHAQM 246
Query: 295 WQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIP 354
L+ E GG+N+ +L T D + + + + A D++ H+NT +P
Sbjct: 247 QTLLDTEFGGLNESYIELGARTGDARWVAIGKRLRHEKVIDPAAAGRDELPHIHANTQVP 306
Query: 355 IVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNFKSDPKRLASNLDSNT 404
IG ++EV GD H G N F+ +P +A+ L T
Sbjct: 307 KFIGEARQFEVAGDADAAAAARFFWETVTAHYSYVIGGNADREYFQ-EPDTIAAFLTEQT 365
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSS 464
E C +YNMLK++RHL++WT + Y DYYER+L N + Q G+ Y+ P+ G
Sbjct: 366 CEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAAQHPAT-GMFTYMTPMISGG- 423
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER + DSFWCC G+G+E+ ++ GD+IY+++ +Y+ YI SRLDW
Sbjct: 424 -ERGF---SDKFDSFWCCVGSGMEAHAQFGDAIYWQDATS---LYVNLYIPSRLDWTERD 476
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS 584
+ + ++D V + +V L G L LR+P W A +NG
Sbjct: 477 LAL--ELDSGVPDNG--KVRLQVLRAGQRAPRRLLLRVPAWCQGRYA-LRVNGSPARAAL 531
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+L++ + W + D + + L LR E D A ++ GP LA
Sbjct: 532 VDGYLTLERDWRAGDVIDLDLATPLRLEHAAGD----ADTVVVMRGPLALA 578
>gi|383641062|ref|ZP_09953468.1| glycosylase [Streptomyces chartreusis NRRL 12338]
Length = 900
Score = 219 bits (558), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 150/467 (32%), Positives = 228/467 (48%), Gaps = 49/467 (10%)
Query: 222 GYLSAFPTEQFDRLEA-----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE+ VWAPYYT HKIL GLLD Y D+ AL + + M +
Sbjct: 349 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGLLDAYGATDDDRALDLASGMCD 408
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ ++R+ + + +++R W + E GG+ + + L IT +HL LA LFD +
Sbjct: 409 WMHSRLSK-LPESTLQRMWGIFSSGEFGGIVEAVCDLHTITGKAEHLALAQLFDLDRLID 467
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLES-SGTNI 384
A D + G H+N HIPI G Y+ TG++ + H++ GT+
Sbjct: 468 ACAANTDILDGLHANQHIPIFTGYVRLYDETGEERYLTSAKNFWDMVVPHRMYGIGGTST 527
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
F D +A + + T E+C YNMLK+SR LF ++ Y DYYER+L N VLG
Sbjct: 528 QEFWKARDV--IAGTISATTAETCCAYNMLKLSRTLFFHEQDPKYMDYYERALYNQVLGS 585
Query: 445 QR---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
++ E ++ Y + L PG ++ TP CC GTG+ES +K DS+YF
Sbjct: 586 KQDKPDAEKPLVTYFIGLTPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYF-A 638
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
+ +Y+ Y S L W + V Q + TL F + T L LR
Sbjct: 639 KADGSALYVNLYSPSTLTWAEKGVTVTQ----TTGFPEEQGSTLAFGGGRASFT--LRLR 692
Query: 562 IPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
+P+W ++ G + T+NG+ + P PGN+ V++TW + D + I +P R E DD
Sbjct: 693 VPSWATA-GFRVTVNGRAVSGTPKPGNYFEVSRTWRAGDTVRIAMPFRTRVEKALDD--- 748
Query: 621 YASIQAILYGPYVLAGH-------SIGDWDITESATSLSDWITPIPA 660
S+Q + +GP L +G + + LS +TP+P
Sbjct: 749 -PSLQTLFHGPVNLVARDAATEYLKVGLYRDAGLSGDLSHSLTPVPG 794
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 54/110 (49%), Gaps = 6/110 (5%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWE-- 171
++ +L DV L + ++ L++ DV++L+ FR A LP G GGWE
Sbjct: 10 VQPFALEDVAL-RPGLFAEKRRLMLDHARGYDVNRLLQVFRANAGLPTGGAVAPGGWEGL 68
Query: 172 --EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
E + LRGH+ GH+L+ A + T +++ +V AL+ + +
Sbjct: 69 DGEANGNLRGHYTGHFLTMLAQAYRGTKERVFADRIGTMVGALTEVRAAL 118
>gi|290958971|ref|YP_003490153.1| glycosylase [Streptomyces scabiei 87.22]
gi|260648497|emb|CBG71608.1| putative secreted glycosylase [Streptomyces scabiei 87.22]
Length = 936
Score = 219 bits (558), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 143/438 (32%), Positives = 222/438 (50%), Gaps = 41/438 (9%)
Query: 222 GYLSAFPTEQFDRLEALIP-----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE++ VWAPYYT HKIL GLLD Y + D+ AL + + + +
Sbjct: 384 GFLAAYPETQFIELESMTSGDYTRVWAPYYTAHKILRGLLDAYLHVDDERALDLASGLCD 443
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ Y+R+ + +++R W + E GG+ + + L+ IT HL LA LFD +
Sbjct: 444 WMYSRLSK-LPDATLQRMWGIFSSGEYGGLVEAIVDLYAITGKADHLALARLFDLDKLID 502
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE------GHQLESSGTNIGHFN- 388
A D + G H+N HIPI G Y+VTG+ + G + IG +
Sbjct: 503 ACAANTDTLDGLHANQHIPIFTGLVRLYDVTGEARYLSAAKNFWGMVIPQRMYGIGGTST 562
Query: 389 --FKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
F +A + E+C YN+LK+SR LF ++ Y DYYER+L N VLG ++
Sbjct: 563 AEFWKARGAVAGTISDTNAETCCAYNLLKLSRSLFFHEQDPKYMDYYERALLNQVLGSKQ 622
Query: 447 ---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEG 503
E ++ Y + L PG ++ TP CC GTG+ES +K DS+YF
Sbjct: 623 DKADAEKPLVTYFIGLEPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYFARAD 676
Query: 504 KYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT-SLNLRI 562
+Y+ Y ++ LDW + + + Q D Y R T + G G ++ LR+
Sbjct: 677 G-SALYVNLYSAATLDWSAKGVTIAQSTD-------YPREQGTTITVGGGGAAFAMRLRV 728
Query: 563 PTWTSSNGAKATLNGQDLP-LPSPGNFLSV-TKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
P+W ++ G + T+NG + P PG++ ++ ++TW D + + +P LRTE DD+
Sbjct: 729 PSWATA-GFRVTVNGGVVDGTPDPGSYFTIPSRTWDDGDVVRVSIPFRLRTEKALDDQ-- 785
Query: 621 YASIQAILYGPYVLAGHS 638
S+Q + YGP L G +
Sbjct: 786 --SLQTLFYGPVNLVGRN 801
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 60/121 (49%), Gaps = 6/121 (4%)
Query: 107 VPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE- 165
VP S ++ L DV LG + ++ L++ DVD+L+ FR A L G
Sbjct: 37 VPTPSAWSVRPFELKDVTLGQ-GLFAEKRRLMLDHGRGYDVDRLLQVFRANAGLSTKGAV 95
Query: 166 PYGGWE----EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS 221
GGWE E + LRGH+ GH+L+ A A T + +++ ++ AL+ ++ + +
Sbjct: 96 APGGWEGLDGEANGNLRGHYTGHFLTMLAQAHAGTRDTVYSDRIRYMIGALAEVREALRT 155
Query: 222 G 222
G
Sbjct: 156 G 156
>gi|429195121|ref|ZP_19187172.1| hypothetical protein STRIP9103_04852 [Streptomyces ipomoeae 91-03]
gi|428669175|gb|EKX68147.1| hypothetical protein STRIP9103_04852 [Streptomyces ipomoeae 91-03]
Length = 936
Score = 219 bits (557), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 142/440 (32%), Positives = 223/440 (50%), Gaps = 45/440 (10%)
Query: 222 GYLSAFPTEQFDRLEALIP-----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE++ VWAPYYT HKIL GLLD Y D+A AL + + + +
Sbjct: 384 GFLAAYPETQFIELESMTSGDYTRVWAPYYTAHKILRGLLDAYLNVDDARALDLASGLCD 443
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ Y+R+ + +++R W + E GG+ + + L+ IT +HL LA LFD +
Sbjct: 444 WMYSRLSK-LPDATLQRMWGIFSSGEFGGLVEAIVDLYTITGKAEHLALARLFDLDKLID 502
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ-----------LESSGTNI 384
A D + G H+N HIPI G Y+ TG+ + + GT+
Sbjct: 503 ACAANTDTLDGLHANQHIPIFTGLARLYDATGEVRYLTAAKNFWGMVVPPRMYGIGGTST 562
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G F +K+ +A + E+C YN+LK+SR LF ++ Y DYYER+L N VLG
Sbjct: 563 GEF-WKAR-GVIAGTISDTNAETCCAYNLLKLSRTLFFHEQDPKYMDYYERALLNQVLGS 620
Query: 445 QR---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
++ E ++ Y + L PG ++ TP CC GTG+ES +K DS+YF +
Sbjct: 621 KQDKTDAEKPLVTYFIGLKPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYFTK 674
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT-SLNL 560
+Y+ Y ++ L+W + + V Q D Y R + + G G L L
Sbjct: 675 ADG-SALYVNLYSATTLNWSAKGVTVTQTTD-------YPREQGSTITIGGGSAAFELRL 726
Query: 561 RIPTWTSSNGAKATLNGQDLP-LPSPGNFLSV-TKTWSSDDKLTIQLPLTLRTEAIQDDR 618
R+P+W ++ G + T+NG + P+ G++ ++ ++TW D + + +P LR E DD
Sbjct: 727 RVPSWATA-GFRVTVNGGAVSGTPTAGSYFTISSRTWRGGDVVRVTMPFRLRVEKALDD- 784
Query: 619 PEYASIQAILYGPYVLAGHS 638
S+Q + YGP L G +
Sbjct: 785 ---PSLQTLFYGPVNLVGRN 801
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 56/110 (50%), Gaps = 6/110 (5%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWE-- 171
++ L DV LG + +Q L++ DVD+L+ FR A L G GGWE
Sbjct: 45 VRPFELKDVTLG-QGLFAGKRQLMLDHGRGYDVDRLLQVFRANAGLSTKGAVAPGGWEGL 103
Query: 172 --EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
E + LRGH+ GH+L+ A +AST + +K+ +V AL+ + +
Sbjct: 104 DGEANGNLRGHYTGHFLTTLAQAYASTADTVYADKIRYMVGALTEVRAAL 153
>gi|317476510|ref|ZP_07935758.1| hypothetical protein HMPREF1016_02741 [Bacteroides eggerthii
1_2_48FAA]
gi|316907322|gb|EFV29028.1| hypothetical protein HMPREF1016_02741 [Bacteroides eggerthii
1_2_48FAA]
Length = 793
Score = 218 bits (556), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 162/547 (29%), Positives = 256/547 (46%), Gaps = 66/547 (12%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A+ N+ LL + D+L+ +RK A L E Y W+ L GH GHYL+A A+
Sbjct: 42 ARDLNINTLLKYNCDRLLAPYRKEAGLTPKAECYPNWDG----LDGHVGGHYLTAMAIN- 96
Query: 194 ASTHNESLKEKMSAVVSALSACQK-------EIGSGYLSAFPTEQ-----FDRLEALI-- 239
A+T NE +++M ++ ++ C + E G GY+ P Q F + + +
Sbjct: 97 AATGNEECRKRMEYIIKEIAECAEANRKNHPEWGVGYMGGMPNSQNIWSNFKKGDFRVYS 156
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHW 295
WAP+Y +HK+ AGL D + Y N +A L+ W ++ V S ++
Sbjct: 157 GSWAPFYNLHKMYAGLRDAWLYCGNEQAKDLFLQFCDWAID--------VTSNLSDKQME 208
Query: 296 QTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPI 355
Q L E GGMN+VL + IT + K+L A F L + D + H+NT +P
Sbjct: 209 QMLGNEHGGMNEVLADAYAITHEQKYLDCAKRFSHKQLFTPLLQRQDCLDNLHANTQVPK 268
Query: 356 VIGSQMRYEVTGDQLHK----------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTE 405
IG + E++G++ + G + + G N +F + + D +
Sbjct: 269 AIGFERISELSGNEDYHMASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDIDGP 328
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
ESC T NMLK++ +L R E YADYYE + N +L Q G +Y P P
Sbjct: 329 ESCNTNNMLKLTENLHRRNPEARYADYYELATFNHILSTQHPKHGGY-VYFTPARP---- 383
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
R Y ++ P+++ WCC GTG+E+ K G IY +++ Y +S+LDWK I
Sbjct: 384 -RHYRNYSAPNEAMWCCVGTGMENHGKYGQFIYTHVGD---ALFVNLYAASQLDWKKRGI 439
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPS 584
+ Q+ S + L +T +G G +L +R P W K ++NGQ + +
Sbjct: 440 TLRQETTFPYSENSTLTIT-----EGKG-AFNLMVRYPEWVHPGEFKVSVNGQSVDVITG 493
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI 644
P +++S+ + W D + I P+ + ++ P+Y A +YGP +L G G
Sbjct: 494 PSSYVSINRKWKKGDVVNISFPMHASLRYLPNE-PQYV---AFMYGP-ILLGMKTG---- 544
Query: 645 TESATSL 651
TES TSL
Sbjct: 545 TESMTSL 551
>gi|297191370|ref|ZP_06908768.1| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
gi|197720620|gb|EDY64528.1| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
Length = 942
Score = 218 bits (556), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 152/467 (32%), Positives = 233/467 (49%), Gaps = 50/467 (10%)
Query: 222 GYLSAFPTEQFDRLEALIP-----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE++ VWAPYYT HKIL GLLD + + AL + + M +
Sbjct: 393 GFLAAYPETQFITLESMTSPDYTVVWAPYYTAHKILKGLLDAHLSTGDVRALDLASGMCD 452
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ ++R+ ++ + R W + E GGM + + + +T +HL LA +FD +
Sbjct: 453 WMHSRLA-LLPSATRRRMWGLFSSGEYGGMVEAVVDVHSLTGRAEHLELARMFDLDPLID 511
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ-----------LESSGTNI 384
A D +SG H+N HIPI G ++ TG++ + + GT+
Sbjct: 512 ACAENRDVLSGLHANQHIPIFTGLIRLHDATGEERYLTAARNFWDMVVPTRMYGIGGTST 571
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
G F D +A L T E+C +NMLK+SR LF ++ YAD+YER+L N +LG
Sbjct: 572 G--EFWRDAGVIAGTLGDTTAETCCAHNMLKLSRLLFLHEQDPKYADHYERTLFNQILGS 629
Query: 445 QR---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
++ E +M Y + LAPG+ ++ TP CC GTGIES +K DS+YF
Sbjct: 630 KQDLADAELPLMTYFIGLAPGAVRDF------TPKQGTTCCEGTGIESATKYQDSVYFRT 683
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
G+Y+ Y++S LDW + V Q LR+ GSG T L+LR
Sbjct: 684 R-DGSGLYVNLYMASTLDWTDRGVRVTQTTRFPYEQGSTLRIA------GSG-TFDLHLR 735
Query: 562 IPTWTSSNGAKATLNGQ-DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
+P W + G +NG+ +PG++L+V++ W D + I +P TLRTE DD
Sbjct: 736 VPHWADA-GFFVRVNGRAHHGGAAPGSYLTVSRAWRDGDTVEISMPFTLRTEPALDDH-- 792
Query: 621 YASIQAILYGP-YVLAGHS------IGDWDITESATSLSDWITPIPA 660
+Q ++YGP +++A H G + + L +TP+P
Sbjct: 793 --DVQCLMYGPVHLVARHEQREFLRFGLFPSASLSGDLVQALTPVPG 837
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 5/86 (5%)
Query: 139 LEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWE----EPSCELRGHFVGHYLSASALMW 193
L++ DV +L+ FR A L G GGWE E LRGHF GH+LS + +
Sbjct: 77 LDFGRSYDVHRLLQVFRANAGLSTRGAVAPGGWEGLDGEARGNLRGHFTGHFLSMLSQAY 136
Query: 194 ASTHNESLKEKMSAVVSALSACQKEI 219
ST + +K+ +V L+ C++ +
Sbjct: 137 VSTREQVFADKIGTMVDGLAECREAL 162
>gi|189464749|ref|ZP_03013534.1| hypothetical protein BACINT_01093 [Bacteroides intestinalis DSM
17393]
gi|189437023|gb|EDV06008.1| hypothetical protein BACINT_01093 [Bacteroides intestinalis DSM
17393]
Length = 805
Score = 218 bits (556), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 160/546 (29%), Positives = 249/546 (45%), Gaps = 54/546 (9%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L DVR+ + A N++ LL D D+L+ F + A LP E YG WE+ L G
Sbjct: 31 LGDVRITAGPFK-HACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEKDG--LDG 87
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT-----EQFDR 234
H GHYLSA A+ +A+T N+ K++M +VS + Q+ G + FP E+ +
Sbjct: 88 HIGGHYLSALAIHYAATGNQECKKRMDYMVSEFARVQQANDDGSICGFPNSKKFAEEIRK 147
Query: 235 LEALI--PVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKK 288
I W +Y +HK AGL D + Y N +A L+ W V+ N +
Sbjct: 148 GNVGIVWNYWVAWYNMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISN-----LDD 202
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
+ER L+ E GGMN+V + +T +PK+L A F + + D++ H
Sbjct: 203 RQMER---MLDNEFGGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMTRRIDNLDNKH 259
Query: 349 SNTHIPIVIGSQMRYE------------VTGDQLHKEG---HQLESSGTNIGHFNFKSDP 393
+NT +P +G Q E +T + E H+ S G N +F
Sbjct: 260 ANTQVPKAVGYQRVAELNSKTASDYNEFMTAAEFFWETVVFHRSLSLGGNSRGEHFPEAG 319
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
K + ESC T NMLK++ LFR ++ YAD+YER+L N +L Q E G
Sbjct: 320 KCSDYMHERQGPESCNTNNMLKLTEGLFRIHPKVEYADFYERALYNHILSTQH-PEHGGY 378
Query: 454 IYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
+Y P P Y + P ++ WCC GTG+E+ K G IY + +Y+ +
Sbjct: 379 VYFTPACPS-----HYRVYSAPGEAMWCCVGTGMENHGKYGQFIYTHDTVD-NALYVNLF 432
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
I S L+WK +I + Q+ D P T + L +R P+W +
Sbjct: 433 IPSELNWKEKKIKIVQETDF-----PNEEGTTLTVNPSKATQFKLLIRYPSWVEQGKMQV 487
Query: 574 TLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
+G D + PG+++++ + WS D + I+ P+T+R E + P + +I+ GP
Sbjct: 488 VCDGVDYAKNAQPGSYIAIDRQWSKGDVVEIKTPMTVRIEEL----PNVPNAISIMRGPI 543
Query: 633 VLAGHS 638
+L +
Sbjct: 544 LLGART 549
>gi|302549595|ref|ZP_07301937.1| secreted protein [Streptomyces viridochromogenes DSM 40736]
gi|302467213|gb|EFL30306.1| secreted protein [Streptomyces viridochromogenes DSM 40736]
Length = 943
Score = 218 bits (556), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 150/467 (32%), Positives = 229/467 (49%), Gaps = 49/467 (10%)
Query: 222 GYLSAFPTEQFDRLEA-----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE+ VWAPYYT HKIL GLLD YT D+ AL + + M +
Sbjct: 392 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGLLDAYTATDDDRALDLASGMCD 451
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ ++R+ + + +++R W + E GG+ + + L +T +HL LA LFD +
Sbjct: 452 WMHSRLSK-LPESTLQRMWGIFSSGEFGGIVEAICDLHTLTGKAEHLALAQLFDLDRLIE 510
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLES-SGTNI 384
A D + G H+N HIPI G Y+ TG++ + H++ GT+
Sbjct: 511 ACAANTDILDGLHANQHIPIFTGYVRLYDETGEERYLRSAKNFWDMVVPHRMYGIGGTST 570
Query: 385 GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI 444
F D +A + + T E+C YNMLK+SR LF ++ Y DYYER+L N VLG
Sbjct: 571 QEFWKARDV--IAGTISATTAETCCAYNMLKLSRTLFFHEQDPKYMDYYERALYNQVLGS 628
Query: 445 QR---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE 501
++ E ++ Y + L PG ++ TP CC GTG+ES +K DS+YF
Sbjct: 629 KQDKPDVEKPLVTYFIGLTPGHVRDY------TPKQGTTCCEGTGMESATKYQDSVYF-A 681
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
+ +Y+ Y S L W + V Q S+ TLT + T L LR
Sbjct: 682 QADGSALYVNLYSPSTLTWAEKGVTVTQS----TSFPREQGSTLTLGGGRASFT--LRLR 735
Query: 562 IPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPE 620
+P+W ++ G T+NG+ + P PG++ V++TW + D + I +P R E DD
Sbjct: 736 VPSWATA-GFGVTVNGRAVSGTPRPGSYFDVSRTWRAGDTVRIAMPFRTRVEKALDD--- 791
Query: 621 YASIQAILYGPYVLAGH-------SIGDWDITESATSLSDWITPIPA 660
S+Q + +GP L +G + + LS +TP+P
Sbjct: 792 -PSLQTLFHGPVNLVARDSATEYLKVGLYRDAGLSGDLSHSLTPVPG 837
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 56/110 (50%), Gaps = 6/110 (5%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWE-- 171
++ L DV LG + +Q L++ DV++L+ FR A L G GGWE
Sbjct: 53 VRPFGLEDVSLGR-GVFADKRQLMLDHARGYDVNRLLQVFRANAGLATGGAVAPGGWEGL 111
Query: 172 --EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
E + LRGH+ GH+L+ A + ST + +++ AVV AL+ + +
Sbjct: 112 DGEANGNLRGHYTGHFLTMLAQAYRSTKEQVFADRIGAVVGALTEVRAAL 161
>gi|344201935|ref|YP_004787078.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
gi|343953857|gb|AEM69656.1| protein of unknown function DUF1680 [Muricauda ruestringensis DSM
13258]
Length = 1022
Score = 218 bits (555), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 176/606 (29%), Positives = 277/606 (45%), Gaps = 102/606 (16%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKT--ARLPAPGEPYGGWEE 172
L +VSL+ G + + + L+ + D ++ FR P +P G W+
Sbjct: 379 LDQVSLNADAHGQQTKFIENRDKFINTLVQTNPDSFLYMFRNAFGQEQPEGAKPLGVWDS 438
Query: 173 PSCELRGHFVGHYLSASALMWAST-HNESLK----EKMSAVVSAL-----------SACQ 216
+LRGH GHYL+A A +AST ++++L+ +KM+ +V L A
Sbjct: 439 QETKLRGHATGHYLTAIAQAYASTGYDKALQANFADKMNYMVDVLYQLSQMSGQSAKAGG 498
Query: 217 KEI-------------------------------GSGYLSAFPTEQFDRLE-------AL 238
+ + G G++SA+P +QF LE
Sbjct: 499 EHVADPTAVPPGPGKSTYDSDLSENGIRTDYWNWGEGFISAYPPDQFIMLENGATYGTQP 558
Query: 239 IPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT- 297
VWAPYYT+HKILAGL+D Y + N +AL + M ++ Y R+ + I W T
Sbjct: 559 TQVWAPYYTLHKILAGLMDIYEVSGNEKALEIAKGMGDWVYARLSQLPTDTLISM-WNTY 617
Query: 298 LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLG------LLALQADDISGFHSN 350
+ E GGMN+ + +L IT +P++L +A LFD F G LA D G H+N
Sbjct: 618 IAGEFGGMNEAMARLDRITDEPRYLKVAQLFDNIKMFFGDAEHSHGLARNVDSFRGLHAN 677
Query: 351 THIPIVIG---------SQMRYEVTGDQLHKEGHQ-LESSGTNIGHFN------FKSDPK 394
HIP ++G S Y+V + +K + + S G G N F + P
Sbjct: 678 QHIPQIVGALEIYRDSESPEYYQVADNFWYKAKNDYMYSIGGVAGARNPTNAECFIAQPA 737
Query: 395 RLASNLDSN--TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV 452
L N S+ E+C TYNMLK++++LF + + DYYER L N +L P
Sbjct: 738 TLYENGFSSGGQNETCATYNMLKLTKNLFLFDQRTELMDYYERGLYNHILASVAEDSP-A 796
Query: 453 MIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
Y +PL PGS K + F CC GT +ES +KL +SIYF+ + +Y+
Sbjct: 797 NTYHVPLRPGSVK----RFGNSDMTGFTCCNGTALESSTKLQNSIYFKSQDN-STLYVNL 851
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
++ S L W I V QK ++ LT KG LN+R+P W ++ G
Sbjct: 852 FVPSTLKWAEKDITVEQK----TAFPKEDNTQLTIKGKGK---FDLNIRVPQW-ATKGFF 903
Query: 573 ATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+NG++ + + PG +L++++ W D + +++P + + D + +I ++ YGP
Sbjct: 904 VKINGKEEKVEAKPGTYLTLSRKWKDGDVIDLKMPFQFHLDPVMDQQ----NIASLFYGP 959
Query: 632 YVLAGH 637
+L
Sbjct: 960 VLLVAQ 965
>gi|312131938|ref|YP_003999278.1| hypothetical protein Lbys_3265 [Leadbetterella byssophila DSM
17132]
gi|311908484|gb|ADQ18925.1| protein of unknown function DUF1680 [Leadbetterella byssophila DSM
17132]
Length = 1004
Score = 218 bits (554), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 195/653 (29%), Positives = 293/653 (44%), Gaps = 113/653 (17%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKT--ARLPAPGEPYGGWEE 172
L V+L R D+ + ++ L D + ++ FR + P +P G W+
Sbjct: 361 LSAVTLEADRHQHDTKFIENRDKFIQGLAKTDPNSFLYMFRHAFGQKQPEGAKPLGVWDS 420
Query: 173 PSCELRGHFVGHYLSASALMWAST-HNESLKE----KMSAVV------SALSACQK---- 217
+ +LRGH GHYL+A A +AST ++++L+ KM +V S LS K
Sbjct: 421 QNTKLRGHATGHYLTAIAQAYASTGYDKNLQANFAGKMDQLVNTLYELSRLSGTPKVQGG 480
Query: 218 --------------------------------EIGSGYLSAFPTEQFDRLEALIP----- 240
G GY+SA+P +QF LE
Sbjct: 481 EAVADPTKVPMGPGKTEYDSDLTDEGIRTDYWNWGKGYISAYPPDQFIMLEQGAKYGGQK 540
Query: 241 --VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT- 297
VWAPYYT+HKILAGL+D Y + N +AL + M E+ + R+ + + ++ + W T
Sbjct: 541 NQVWAPYYTLHKILAGLMDVYEVSGNKKALDVAVGMSEWVHARLA-ALPQDTLIKMWNTY 599
Query: 298 LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLG------LLALQADDISGFHSN 350
+ E GGMN+ + +LF +T++ K L A LFD F G LA D G H+N
Sbjct: 600 IAGEYGGMNESMARLFFLTKNEKFLKTAQLFDNIKMFYGDASHSHGLARNVDTFRGLHAN 659
Query: 351 THIPIVIGSQMRYEVTGDQ---------LHKE-GHQLESSGTNIGHFN------FKSDPK 394
HIP ++GS Y V+ + H+ + S G G N F + P
Sbjct: 660 QHIPQIVGSIEMYAVSQNPDYYFIAENFWHRTVSDYMYSIGGVAGARNPANAECFIAQPA 719
Query: 395 RLASN--LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV 452
+ N E+C TYNMLK++ LF + ++ Y DYYER L N +L P
Sbjct: 720 TIYENGFSQGGQNETCATYNMLKLTSSLFMFDQKAEYMDYYERGLYNHILASVAKDSP-A 778
Query: 453 MIYLLPLAPGSSKERSYHHWGTPS-DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
Y +PL PGS K+ +G P+ F CC GT IES +KL +SIYF+ +Y+
Sbjct: 779 NTYHVPLRPGSIKQ-----FGNPNMTGFTCCNGTAIESNTKLQNSIYFKSLDN-STLYVN 832
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L+W+ I V Q LR+ +G+G L +R+P W + G
Sbjct: 833 LFIPSTLNWEEKGIKVVQTTSFPKEDQTKLRI------EGNG-KFDLQVRVPGW-AKKGF 884
Query: 572 KATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NG+ + +PG++ +++TW + D L I +P + + D+P AS + YG
Sbjct: 885 VVKINGKKQKIKATPGSYAKISRTWKNGDVLEITMPFEFHLDYVM-DQPNIAS---LFYG 940
Query: 631 PYVLAGHSI---GDW-DITESATSLSDWITPIPASY-----NSQLITFTQEYG 674
P +LA +W +T A LS I P + Q F + YG
Sbjct: 941 PVLLAAQETEARKEWRQVTFDAKDLSKNIKGNPETLEFTIDGVQFKPFYESYG 993
>gi|224537183|ref|ZP_03677722.1| hypothetical protein BACCELL_02060 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521238|gb|EEF90343.1| hypothetical protein BACCELL_02060 [Bacteroides cellulosilyticus
DSM 14838]
Length = 790
Score = 218 bits (554), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 159/547 (29%), Positives = 257/547 (46%), Gaps = 66/547 (12%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A+ N+E LL D D+L+ +RK A L + Y W+ L GH GHYL+A A+
Sbjct: 43 ARDLNIETLLKYDCDRLMAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMAIN- 97
Query: 194 ASTHNESLKEKMSAVVSALSACQK-------EIGSGYLSAFPTEQF-------DRLEALI 239
A+T NE +++M ++S ++ C + + G GY+ P Q
Sbjct: 98 AATGNEECRKRMEYIISEIAECAEANSKNHPQWGIGYMGGMPNSQNIWNGFKDGDFRVYS 157
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHW 295
WAP+Y +HK+ AGL D + Y N +A L+ W + ++ S E+
Sbjct: 158 GSWAPFYNLHKMYAGLRDAWLYCGNEQAKSLFLQFCNWAI--------HITSGLSDEQME 209
Query: 296 QTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPI 355
+ L E GGMN+VL + IT + K+L A F ++ + D + H+NT +P
Sbjct: 210 RMLGNEHGGMNEVLADAYAITHEQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQVPK 269
Query: 356 VIGSQMRYEVTGDQLHK----------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTE 405
VIG + E++G++ + G + + G N +F + + D +
Sbjct: 270 VIGFERISELSGNEDYHVASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDIDGP 329
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
ESC T NMLK++ L R E YADYYE + N +L Q E G +Y P P
Sbjct: 330 ESCNTNNMLKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTPARP---- 384
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
R Y ++ P+++ WCC GTG+E+ K G IY +++ Y +S+LDWK I
Sbjct: 385 -RHYRNYSAPNEAMWCCVGTGMENHGKYGQFIYTHAGD---ALFVNLYAASQLDWKERGI 440
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPS 584
+ Q+ + PY + ++G G T +L +R P W K ++NG+ + +
Sbjct: 441 TLRQE-----TAFPYSENSTITIAEGKG-TFNLMVRYPGWVHPGEFKVSVNGKPVDIITG 494
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI 644
P +++S+ + W D + I P+ + ++ P+Y A+++GP +L G G
Sbjct: 495 PSSYVSINRKWKKGDVVNINFPMHSSLRYLPNE-PQYV---ALMHGP-ILLGMKTG---- 545
Query: 645 TESATSL 651
TES SL
Sbjct: 546 TESMASL 552
>gi|423223047|ref|ZP_17209516.1| hypothetical protein HMPREF1062_01702 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640316|gb|EIY34118.1| hypothetical protein HMPREF1062_01702 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 790
Score = 217 bits (552), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 159/547 (29%), Positives = 256/547 (46%), Gaps = 66/547 (12%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A+ N+E LL D D+L+ +RK A L + Y W+ L GH GHYL+A A+
Sbjct: 43 ARDLNIETLLKYDCDRLMAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMAIN- 97
Query: 194 ASTHNESLKEKMSAVVSALSACQK-------EIGSGYLSAFPTEQF-------DRLEALI 239
A+T NE +++M ++S ++ C + + G GY+ P Q
Sbjct: 98 AATGNEECRKRMEYIISEIAECAEANCKNHPQWGVGYMGGMPNSQNIWNGFKDGDFRVYS 157
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHW 295
WAP+Y +HK+ AGL D + Y N +A L+ W + ++ S E+
Sbjct: 158 GSWAPFYNLHKMYAGLRDAWLYCGNEQAKSLFLQFCNWAI--------HITSGLSDEQME 209
Query: 296 QTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPI 355
+ L E GGMN+VL + IT + K+L A F ++ + D + H+NT +P
Sbjct: 210 RMLGNEHGGMNEVLADAYAITHEQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQVPK 269
Query: 356 VIGSQMRYEVTGDQLHK----------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTE 405
VIG + E++G++ + G + + G N +F + + D +
Sbjct: 270 VIGFERISELSGNEDYHVASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDIDGP 329
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
ESC T NMLK++ L R E YADYYE + N +L Q E G +Y P P
Sbjct: 330 ESCNTNNMLKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTPARP---- 384
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
R Y ++ P+++ WCC GTG+E+ K G IY +++ Y +S+LDWK I
Sbjct: 385 -RHYRNYSAPNEAMWCCVGTGMENHGKYGQFIYTHAGD---ALFVNLYAASQLDWKERGI 440
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPS 584
+ Q+ + PY + ++G G T +L +R P W K ++NG+ +
Sbjct: 441 TLRQE-----TAFPYSENSTITIAEGKG-TFNLMVRYPGWVHPGEFKVSVNGKPADIITG 494
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI 644
P +++S+ + W D + I P+ + ++ P+Y A+++GP +L G G
Sbjct: 495 PSSYVSINRKWKKGDVVNINFPMHSSLRYLPNE-PQYV---ALMHGP-ILLGMKTG---- 545
Query: 645 TESATSL 651
TES SL
Sbjct: 546 TESMASL 552
>gi|431799831|ref|YP_007226735.1| hypothetical protein Echvi_4552 [Echinicola vietnamensis DSM 17526]
gi|430790596|gb|AGA80725.1| putative glycosyl hydrolase of unknown function (DUF1680)
[Echinicola vietnamensis DSM 17526]
Length = 1042
Score = 217 bits (552), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 183/618 (29%), Positives = 272/618 (44%), Gaps = 108/618 (17%)
Query: 107 VPERSGEF--LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKT--ARLPA 162
VPE+S E L VSL G S + + L + D ++ FR PA
Sbjct: 388 VPEQSLEAFGLDAVSLETDIHGHSSKFIENRDKFISTLAGTNPDDFLYMFRNAFGQEQPA 447
Query: 163 PGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNES-----LKEKMSAVVSALSACQK 217
P G W+ +LRGH GHYL+A A +AST ++ +KM+ +V+ L +
Sbjct: 448 GAVPLGVWDSQETKLRGHATGHYLTAIAQAYASTGYDTALQANFADKMAYMVNTLYNLSQ 507
Query: 218 EIGS------------------------------------------GYLSAFPTEQFDRL 235
G GY+SA+P +QF L
Sbjct: 508 MAGKPSAEADGHNADPTAVPMGPGKDFYDSDLSEEGIRTDYWNWGEGYISAYPPDQFIML 567
Query: 236 EALIP-------VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
E VWAPYYT+HKILAGL+D Y + N +AL + M + R+ +
Sbjct: 568 EHGAKYGGQKDQVWAPYYTLHKILAGLMDIYEVSGNEKALSVAKGMGTWVAARLDKLPTS 627
Query: 289 YSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDK-PCFLG------LLALQ 340
I W T + E GGMN+ + +L+ IT ++L A LFD F G LA
Sbjct: 628 TLISM-WNTYIAGEFGGMNEAMARLYRITGSSRYLAAAKLFDNITVFYGNADHDHGLAKN 686
Query: 341 ADDISGFHSNTHIPIVIGSQMRYE-----------------VTGDQLHKEGHQLESSGTN 383
D G H+N HIP ++G+ Y T D ++ G + + T
Sbjct: 687 VDTFRGLHANQHIPQIMGALEMYRDTESAPYFHIADNFWHIATNDYMYSIG-GVAGARTP 745
Query: 384 IGHFNFKSDPKRLASNLDS--NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGV 441
F ++P L S E+C TYNMLK+SR+LF + ++ AY DYYER L N +
Sbjct: 746 ANAECFTTEPATLYEFGFSAGGQNETCATYNMLKLSRNLFLFQQDPAYMDYYERGLYNHI 805
Query: 442 LGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPS-DSFWCCYGTGIESFSKLGDSIYFE 500
L P Y +PL PGS K+ +G P F CC GT IES +KL +SIYF+
Sbjct: 806 LASVAKDSP-ANTYHVPLRPGSIKQ-----FGNPKMKGFTCCNGTAIESSTKLQNSIYFK 859
Query: 501 EEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNL 560
+Y+ ++ S L WK + + Q ++ LT KG + L +
Sbjct: 860 SVDDQ-SLYVNLFVPSTLHWKERNLTIVQS----TAFPKEDHTRLTVQGKGKFV---LKI 911
Query: 561 RIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
R+P W ++ G K ++NG+ + + PG + ++ + W + D + I +P E + D +
Sbjct: 912 RVPQW-ATEGIKVSINGKPAQVDAVPGTYATIQRKWKNGDTIDINIPFQFHLEPVMDQQ- 969
Query: 620 EYASIQAILYGPYVLAGH 637
+I ++ YGP +LA
Sbjct: 970 ---NIASLFYGPVLLAAQ 984
>gi|393782713|ref|ZP_10370896.1| hypothetical protein HMPREF1071_01764 [Bacteroides salyersiae
CL02T12C01]
gi|392672940|gb|EIY66406.1| hypothetical protein HMPREF1071_01764 [Bacteroides salyersiae
CL02T12C01]
Length = 796
Score = 216 bits (549), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 173/561 (30%), Positives = 258/561 (45%), Gaps = 59/561 (10%)
Query: 106 KVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE 165
KVP + SL DV+L S + A + YLL LDVD+L+ + R+ L E
Sbjct: 28 KVPCTHTPVWQSFSLSDVKLTS-GIFKGAMDLHKGYLLSLDVDRLIPHVRRNVGLTGKNE 86
Query: 166 PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI------ 219
YGGWE G GHY+SA A+M+AST + ++++ ++ L CQ++
Sbjct: 87 NYGGWETHG----GCTYGHYMSACAMMYASTGEKIFRDRLEYMMDELKECQQQTQDGWFI 142
Query: 220 -----GSGYLSAFPTEQF-DRLEALIPVWA------PYYTIHKILAGLLDQYTYADNAEA 267
GY E F +R + W +Y IHK+LAGL D Y YA +A
Sbjct: 143 SGERAKEGYRKLLHGEVFLNRPDETKQPWNYNQNGNSWYCIHKVLAGLRDVYLYAGIQKA 202
Query: 268 LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHL 327
+ + ++ + N K + TL+ E GGMN+V ++ T D K+L A
Sbjct: 203 KEILMPLADFIADIALNSNK----DLFQSTLSVEQGGMNEVFTDIYAFTGDYKYLETACR 258
Query: 328 FDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQ 376
F+ + +A D + G H+N IP IG Y +++++ H
Sbjct: 259 FNHINVIYPVANGEDVLFGRHANDQIPKFIGVAKEYAYDTKEIYRKAAENFWDMVVNNHT 318
Query: 377 LESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERS 436
L G + + P + LD ++ E+C TYNMLK+SR LF + Y +YYE +
Sbjct: 319 LAIGGNSC--YERFGMPGEESKRLDYSSAETCNTYNMLKLSRLLFMMNGDYKYLNYYEHA 376
Query: 437 LTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDS 496
L N +L Q G + Y L PGS K+ S TP DSFWCC GTG+E+ +K +S
Sbjct: 377 LYNHILASQDPDMAGCVTYYTSLLPGSFKQYS-----TPYDSFWCCVGTGMENHAKYAES 431
Query: 497 IYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT 556
IYF+ + I YI S L+WK + D S +++ KG +
Sbjct: 432 IYFKNGN---SLLINLYIPSELNWKEQGFRLRLDTDFPES----DTISVCVVDKGR-FSG 483
Query: 557 SLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQ 615
S+ LR P W N + LNG+ + L ++ + + S D + I LP L +
Sbjct: 484 SVMLRYPEWVEGN-PEMMLNGRPVKLEYGKKEYIRLPDSIKSGDTIKIVLPRKLSVRYAK 542
Query: 616 DDRPEYASIQAILYGPYVLAG 636
D+ P + S I+YGP +LAG
Sbjct: 543 DE-PHFGS---IMYGPILLAG 559
>gi|238061684|ref|ZP_04606393.1| secreted protein [Micromonospora sp. ATCC 39149]
gi|237883495|gb|EEP72323.1| secreted protein [Micromonospora sp. ATCC 39149]
Length = 933
Score = 216 bits (549), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 140/437 (32%), Positives = 216/437 (49%), Gaps = 41/437 (9%)
Query: 222 GYLSAFPTEQFDRLEALIP-----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE++ VWAPYYT HKIL G+LD Y + AL + T M +
Sbjct: 382 GFLAAYPETQFITLESMTASDYAKVWAPYYTAHKILQGILDAYLNTGDERALDLATGMCD 441
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ ++R+ + +++R W + E GG+ + + + IT P HL LA LFD +
Sbjct: 442 WMHSRLSK-LPAATLQRMWGLFSSGEFGGIVETICDVHRITGSPNHLALARLFDLNSLID 500
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIGH--- 386
A D I+G H+N HIPI G ++ TG+Q + + + + +IG
Sbjct: 501 AAAAGTDTITGLHANQHIPIFTGLLRLHDETGEQRYLNAARNFWPMVVPTRMYSIGGTST 560
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
F +P +A +L E+C YN+LK+SR LF ++ Y DYYER+L N +LG +R
Sbjct: 561 VEFWKEPGAIAGSLSDTNAETCCAYNLLKLSRTLFLHEQDPKYMDYYERALYNQILGSKR 620
Query: 447 ---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFE-EE 502
E ++ Y + L PG ++ TP CC GTG+ES +K D++Y + +
Sbjct: 621 DLADAEKPLVTYFIGLVPGHVRDY------TPKQGTTCCEGTGMESATKYQDTVYLDTAD 674
Query: 503 GKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRI 562
G+ +Y+ Y SS+L W I + Q + ++V G T L LR+
Sbjct: 675 GR--ALYVNLYSSSKLTWARRGITLTQTTRYPFEQNTTIKV-------GGNATFELRLRV 725
Query: 563 PTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
P W + K +NG+ P +PG++ V + W + D + + +P LR E DD
Sbjct: 726 PGWVKGD-FKVYVNGRRAPGKATPGSYFPVARRWRAGDTVRVHIPFQLRVEKALDD---- 780
Query: 622 ASIQAILYGPYVLAGHS 638
S Q + YGP L S
Sbjct: 781 PSTQTLFYGPVNLVARS 797
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 6/110 (5%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWE-- 171
L+ L +V L D + R + LE+ +VD+L+ FR A L G GWE
Sbjct: 49 LRPFPLGEVAL-RDGVFARKRDLMLEHARGYNVDRLLQVFRANAGLDTLGAVAPSGWEGL 107
Query: 172 --EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
E + LRGH+ GH+L+ A + ST ++ +K+ +V AL + +
Sbjct: 108 DGEANGNLRGHYTGHFLTMLAQAYGSTGDKVFADKLKYMVGALVEARAAL 157
>gi|189464752|ref|ZP_03013537.1| hypothetical protein BACINT_01096 [Bacteroides intestinalis DSM
17393]
gi|189437026|gb|EDV06011.1| hypothetical protein BACINT_01096 [Bacteroides intestinalis DSM
17393]
Length = 790
Score = 214 bits (546), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 158/547 (28%), Positives = 260/547 (47%), Gaps = 66/547 (12%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A+ N+E LL D D+L+ +RK A L + Y W+ L GH GHYL+A A+
Sbjct: 43 ARDLNIETLLKYDCDRLIAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMAIN- 97
Query: 194 ASTHNESLKEKMSAVVSALSACQK-------EIGSGYLSAFPTEQ-----FDRLEALI-- 239
A+T NE +++M +++ ++ C + + G GY+ P Q F + +
Sbjct: 98 AATGNEECRKRMEYIINEIAECAEANYKNHPKWGVGYMGGMPNSQNIWSGFKNGDFRVYS 157
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHW 295
WAP+Y +HK+ AGL D + Y N +A L+ W ++ + S E+
Sbjct: 158 GSWAPFYNLHKMYAGLRDAWLYCGNEQAKTLFLQFCNWAID--------ITSGLSDEQME 209
Query: 296 QTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPI 355
+ L E GGMN+VL + IT++ K+L A F ++ + D + H+NT +P
Sbjct: 210 RMLGNEHGGMNEVLADAYAITREQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQVPK 269
Query: 356 VIGSQMRYEVTGDQLHK----------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTE 405
VIG + E++G++ + G + + G N +F + + D +
Sbjct: 270 VIGFERISELSGNEDYHMASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDIDGP 329
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
ESC T N+LK++ L R E YADYYE + N +L Q E G +Y P P
Sbjct: 330 ESCNTNNILKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTPARP---- 384
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
R Y ++ P+++ WCC GTG+E+ K G IY +++ Y +S+LDWK I
Sbjct: 385 -RHYRNYSAPNEAMWCCVGTGMENHGKYGQFIYTHVGD---ALFVNLYAASQLDWKERGI 440
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPS 584
+ Q+ + PY + ++G G T +L +R P W K ++NG+ + +
Sbjct: 441 TLRQE-----TAFPYSENSTITIAEGKG-TFNLMVRYPGWVHPGEFKVSVNGKPVDIITG 494
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI 644
P +++S+ + W D + I P+ + ++ P+Y A ++GP +L G G
Sbjct: 495 PSSYVSINRKWKKGDVVNINFPMHSSLRYLPNE-PQYI---AFMHGP-ILLGMKTG---- 545
Query: 645 TESATSL 651
TES SL
Sbjct: 546 TESMASL 552
>gi|326801658|ref|YP_004319477.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326552422|gb|ADZ80807.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
Length = 790
Score = 213 bits (543), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 157/545 (28%), Positives = 251/545 (46%), Gaps = 52/545 (9%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
++ L +RL + AQ+T+L Y+L L+ D+L+ + + A L YG WE
Sbjct: 33 MESFPLASIRLADGPLK-DAQETDLRYILALNPDRLLAPYLREAGLEPKASSYGNWENTG 91
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE---- 230
L GH GHYLSA +LM A+T N +++++++ ++S L CQ + GY+ P
Sbjct: 92 --LDGHIGGHYLSALSLMAAATGNHAIQDRLTYMLSELKRCQDQDSDGYVGGIPGGKQMW 149
Query: 231 ---QFDRLEA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
+ ++EA L W P Y IHK+ AGL+D Y Y N A +M + +++
Sbjct: 150 NDIKRGKIEAQSFSLNGKWVPIYNIHKLFAGLIDAYRYTGNEHARQMVLKLGKWWL---- 205
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+V + E+ L E GG+N+V L I+ D K+L +A L L D+
Sbjct: 206 SVFGGLTDEQIQTILRSEHGGINEVFADLAQISGDQKYLTMAKRLSHRAILQPLIAGKDE 265
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQLESSGTN--IGHFNFK 390
++G H+NT IP VIG + + D + H+ S G N HF+
Sbjct: 266 LTGLHANTQIPKVIGFE-KIAALADSMSWANAARFFWETVVEHRTVSIGGNSESEHFHAL 324
Query: 391 SDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
+ ++ S+ + E+C TYNM+K+S+ LF + + DYYER+ N +L Q E
Sbjct: 325 NSFGKMLSSREG--PETCNTYNMMKLSKDLFLQGPDRKFIDYYERATYNHILSSQHPKEG 382
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G +Y P+ P Y + FWCC G+G+E+ K G+ IY G+ +YI
Sbjct: 383 G-FVYFTPMRPN-----HYRVYSQAQACFWCCVGSGLENHGKYGELIY-THSGQ--DLYI 433
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+I S L W+ I + Q+ PY + + + T S+ +R P W
Sbjct: 434 NLFIPSTLKWQEQGISLTQRTRF-----PYEQKSSVTIEVANPKTFSVFIRKPKWLGKQP 488
Query: 571 AKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NG+ + +L + + W +T LP+ + E + P + YG
Sbjct: 489 INLLVNGKQISYQEDKGYLKINRKWVGQSIITFNLPMQINAELLPSGEPWV----SYTYG 544
Query: 631 PYVLA 635
P VLA
Sbjct: 545 PIVLA 549
>gi|383777661|ref|YP_005462227.1| hypothetical protein AMIS_24910 [Actinoplanes missouriensis 431]
gi|381370893|dbj|BAL87711.1| hypothetical protein AMIS_24910 [Actinoplanes missouriensis 431]
Length = 939
Score = 213 bits (543), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 150/487 (30%), Positives = 237/487 (48%), Gaps = 44/487 (9%)
Query: 171 EEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE 230
EE S ELRG+ + + + + S ++ +AV++ + +G+L+A+P
Sbjct: 350 EEISGELRGNLAWYRFDETE--GTTVADASGRDWDAAVITGVGGAPGPSHAGFLAAYPET 407
Query: 231 QFDRLEALI---PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIK 287
QF LE L +WAPYYT HKI+ GLLD +T NA AL + M E+ ++R+ + +
Sbjct: 408 QFVLLEQLTTYPAIWAPYYTCHKIMRGLLDAHTLGGNATALDVVRGMGEWAHSRLSKLPR 467
Query: 288 KYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
+ ++R W + E GGMN+V+ L +T + L A FD L D + G
Sbjct: 468 E-QLDRMWALYIAGEYGGMNEVMVDLATLTGNKTFLETARFFDNTKLLADCVADIDSLDG 526
Query: 347 FHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQLESSGTNIGHFNFKSDPKR 395
H+N HIP +G YE D+ ++ GT G F+
Sbjct: 527 KHANQHIPQFLGYLRLYENGADKTYRTAAANFFDMVVPHRTYMHGGTGQGEV-FRKRDVI 585
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG----TEPG 451
S +++ ESC YNMLKV+R+LF + + DYYE++L N +L +R T+P
Sbjct: 586 AGSIVNTTNAESCAAYNMLKVARNLFSHAPDGRFMDYYEKALVNQILASRRDVDSTTDP- 644
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
++ Y++P+ PG+ R Y + GT CC GTG+E+ +K D+I+F K +Y+
Sbjct: 645 LVTYMVPVGPGA--RRGYGNIGT------CCGGTGLENHTKYQDTIWF-RSAKSDTLYVN 695
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
YI S L+W + ++ V Q D S P +T+T S++ L LR+P+W + +
Sbjct: 696 LYIPSTLNWAAKKLTVTQTGDYPRS--PETTLTITGSAR-----LDLRLRVPSWADDDFS 748
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ ++S+ + W S D +T+ P L E DD S+QA+LYGP
Sbjct: 749 VTVNSKIQRVRAGRDGYVSLDRHWRSGDTITVSSPYRLHVERALDD----PSLQALLYGP 804
Query: 632 YVLAGHS 638
L S
Sbjct: 805 LALVAKS 811
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 139 LEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEEPSCELRGHFVGHYLSASALMWASTH 197
L Y D D++V NFR A L G +P GGW++ + LRGH+ GH++S A WA T
Sbjct: 89 LAYARAYDADRIVSNFRTAAGLDNRGAQPPGGWDDATGNLRGHYSGHFISMLAQAWADTG 148
Query: 198 NESLKEKMSAVVSALSACQ 216
KEK+ +V+AL CQ
Sbjct: 149 EAIFKEKLDYIVTALKECQ 167
>gi|302539859|ref|ZP_07292201.1| putative secreted protein [Streptomyces hygroscopicus ATCC 53653]
gi|302457477|gb|EFL20570.1| putative secreted protein [Streptomyces himastatinicus ATCC 53653]
Length = 940
Score = 212 bits (539), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 140/433 (32%), Positives = 216/433 (49%), Gaps = 40/433 (9%)
Query: 222 GYLSAFPTEQFDRLEALIP-----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVE 276
G+L+A+P QF LE++ VWAPYYT HKIL GLLD + +A AL + M +
Sbjct: 389 GFLAAYPETQFITLESMTSGDYTVVWAPYYTAHKILRGLLDAHLATGDARALDLAMGMCD 448
Query: 277 YFYNRVQNVIKKYSIERHWQTLNE-EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLG 335
+ Y+R+ + + +++R W + E GG+ + + L+ ++ +HL LA LFD +
Sbjct: 449 WMYSRLSK-LPRSTLQRMWGIFSSGEFGGIVEAICDLYALSGKAQHLALARLFDLDKLID 507
Query: 336 LLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG---H 386
A D + G H+N HIPI G Y+ T ++ + + + + IG +
Sbjct: 508 ACAAGDDTLDGLHANQHIPIFTGLVRLYDETEEERYLTAAKNFWDMVVPTRMYGIGGTSN 567
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR 446
F +A L T E+C YNMLK+SR LF ++ AY DYYER+L N VLG ++
Sbjct: 568 REFWGARGAIAKTLSDTTAETCCAYNMLKLSRMLFFHEQDPAYMDYYERALYNQVLGSKQ 627
Query: 447 ---GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEG 503
E ++ Y + L PG ++ TP CC GTG+ES +K DS+YF+
Sbjct: 628 DRADAEKPLVTYFIGLVPGHVRDY------TPKAGTTCCEGTGMESATKYQDSVYFKRAD 681
Query: 504 KYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLR-VTLTFSSKGSGLTTSLNLRI 562
+Y+ Y S L W I V Q Y R T + +G L LR+
Sbjct: 682 G-TALYVNLYSPSTLTWAEKGITVTQSTG-------YPREQGSTLTVRGRTAAFDLRLRV 733
Query: 563 PTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
P W +++G + T+NG+ + +PG++ SV++TW D + + +P LR E DD
Sbjct: 734 PAW-ATDGFRVTVNGRAVKGTWTPGSYASVSRTWRDGDTVRVDIPFRLRVEKALDD---- 788
Query: 622 ASIQAILYGPYVL 634
+Q + +GP L
Sbjct: 789 PRVQTLFHGPVNL 801
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 59/119 (49%), Gaps = 7/119 (5%)
Query: 106 KVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE 165
KVP + L+ + DV L + S+ +Q L++ DVD+L+ FR A L G
Sbjct: 42 KVPA-AAWTLRPFNPEDVALRT-SVFTAKRQLMLDFGRGYDVDRLLQVFRANAGLSTRGA 99
Query: 166 -PYGGWE----EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
GGWE E + LRGHF GH+L+ + + T + +K+ +V AL ++ +
Sbjct: 100 VAPGGWEGLDGEANGNLRGHFTGHFLTMLSQAYTGTGEKVYADKIRHMVGALDEVREAL 158
>gi|373463723|ref|ZP_09555310.1| hypothetical protein HMPREF9104_01016 [Lactobacillus kisonensis
F0435]
gi|371763942|gb|EHO52383.1| hypothetical protein HMPREF9104_01016 [Lactobacillus kisonensis
F0435]
Length = 747
Score = 211 bits (538), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 165/576 (28%), Positives = 277/576 (48%), Gaps = 75/576 (13%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWEEP 173
+K VS ++V+ +S + N+ ++L L D+L++N+R A L G P WE P
Sbjct: 22 MKPVSYYNVKYLPNSTLKEKFERNVNWMLSLTPDQLLYNYRINAGLDTKGATPLTVWESP 81
Query: 174 SCELRGHFVGHYLSASALMWASTHN-------ESLKEKMSAVVSALSACQKEIGS----- 221
RGHF GHYLS ++ + +N LK++++ +V L CQ++ +
Sbjct: 82 DWFFRGHFTGHYLSGASRSFVELNNMEDTKEANELKDRVNKIVDGLKECQEKFDTFEEFP 141
Query: 222 GYLSAFPTEQFDRLEALI---PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYF 278
GYL+A P+++FD +E L + PYY + K++ GL+D Y +A N AL +T M YF
Sbjct: 142 GYLAAEPSKRFDDVEKLRFNGNHYVPYYAVQKLMDGLMDAYEFAGNQTALELTMNMTHYF 201
Query: 279 YNRVQNVIKKY---SIERHW------QTLNEEAGGMNDVLYKLFCITQDPKHLM--LAHL 327
R++ + + I+ W ++E G M+ L +L+ IT + + LA
Sbjct: 202 EKRMERLTPEQINAMIDTRWYQGKGHYVYHQEFGAMHRTLLRLYEITDKKQKDIFDLAQK 261
Query: 328 FDKPCFLGLLALQADDISGF---HSNTHIPIVIGSQMRYEVTGDQLHK-----------E 373
FD+ F +L + DD G+ H+NT + G Y VTGD+ +K +
Sbjct: 262 FDRKWFRDML-INNDDELGYYSCHANTELVCAEGMLEYYHVTGDENYKKGVVNYMNWMHD 320
Query: 374 GHQLESSGTN-----IGHFNFKSD----PKRLASNLDSNTEESCTTYNMLKVSRHLFRWT 424
GH+L + G + ++ S+ P+ +L ESC ++++ +S LF T
Sbjct: 321 GHELPTKGISGRSAYPAPADYGSELYDYPEMFFKHLSMLNGESCCSHDLNFLSSELFADT 380
Query: 425 KEIAYADYYERSLTNGVLGIQRGTEPGVMIYL--LPLAPGSSKERSYHHWGTPSDSFWCC 482
K+ D YE N ++ Q+ + + YL L +AP S+KE Y H G FWCC
Sbjct: 381 KDATLLDDYEIRFINAIMA-QQNNDSAIAEYLYNLSVAPNSTKE--YSHTG-----FWCC 432
Query: 483 YGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLR 542
G+G E S L D IY+ ++ +Y+ QY S LD K + V Q D +
Sbjct: 433 TGSGTERHSTLVDGIYYTDK---KDIYVGQYFDSILDLKDQGVTVTQ--DSHYPEQHFAH 487
Query: 543 VTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLT 602
+T+ ++K T + LR+P W S +++G+++ F+++ +TW ++T
Sbjct: 488 ITVE-AAKSQEFT--VYLRVPKW--SRNTTISVDGENVDAEPKNGFVAIKRTWGKKAEIT 542
Query: 603 IQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
+ LR + + D + + AI YGP +LA +
Sbjct: 543 VNFDFELRYQTLAD---RFNRV-AIYYGPILLAAQT 574
>gi|374992692|ref|YP_004968187.1| hypothetical protein SBI_09938 [Streptomyces bingchenggensis BCW-1]
gi|297163344|gb|ADI13056.1| hypothetical protein SBI_09938 [Streptomyces bingchenggensis BCW-1]
Length = 769
Score = 211 bits (537), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 166/543 (30%), Positives = 252/543 (46%), Gaps = 69/543 (12%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
AQ T L+YLL LD D+L+ R+ A LP E YG WE S L GH VGH LS +ALM
Sbjct: 19 AQATALDYLLSLDTDRLLAPLRREAGLPPVAESYGNWE--SSGLDGHTVGHALSGAALMS 76
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA---------LIPVW 242
A T + + + +V + CQ +G+GY+ P + R+ A L W
Sbjct: 77 AVTDDPRPRAMVDRLVQGVVECQDALGTGYVGGVPDGVRLWQRVAAGQVERDSFELGGAW 136
Query: 243 APYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
P+Y +HK+ AGLLD Y + + AL + +++ V + H L E
Sbjct: 137 VPWYNLHKLFAGLLDAYRHTGSEPALTAVRRLADWW----GRVAAGMDDDTHEAMLRTEF 192
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMR 362
GGM +VL L +T ++ LA F L L D + G H+NT I V+G Q
Sbjct: 193 GGMCEVLADLAEVTGTDRYAALARRFLDQSLLRPLCEHRDVLDGMHANTQIAKVVGYQRL 252
Query: 363 YEVTGDQLHKEG----------HQLESSGTN--IGHFNFKSDPKRLASNLDS-NTEESCT 409
EV D ++ H+ S G N H + + D +S L S E+C
Sbjct: 253 GEVVDDPGLRDAARFFWQAMTRHRTVSFGGNSVREHLHPRDD---FSSALQSPEGPETCN 309
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSKERS 468
TYNMLK+SR LF + D+YER+ N +L +P G ++Y P+ PG
Sbjct: 310 TYNMLKLSRALFLERPDTEVLDHYERATVNHILS---SLQPKGGLVYFTPVRPG-----H 361
Query: 469 YHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVN 528
Y TP + FWCC GTG+E+ +K G+ +Y E +++ +I+SRL +V+
Sbjct: 362 YRVVSTPQNCFWCCVGTGLENHAKYGELVYTTEGDD---LFVNLFIASRLSRPEQNLVLE 418
Query: 529 QKVDPVVSWDPYLRVTLTFSSKGSGLTT-SLNLRIPTWTSSNGAKATLNG---QDLPLP- 583
Q +D +R+ + +G+ T +++R+P W + +NG +D P P
Sbjct: 419 QTG--TAPYDEEVRLVV----RGAPATPLPIHIRVPGWHEGT-PQIRINGAPPEDGPGPL 471
Query: 584 --------SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
P ++ + + W D +T++L + E + D P + S + +GP VLA
Sbjct: 472 TTRRAAGGQPLTYVRLERQWCEGDTVTMRLRPRISAELLPDGSP-WVSYR---FGPSVLA 527
Query: 636 GHS 638
S
Sbjct: 528 AES 530
>gi|256378728|ref|YP_003102388.1| hypothetical protein Amir_4712 [Actinosynnema mirum DSM 43827]
gi|255923031|gb|ACU38542.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 881
Score = 208 bits (529), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 191/644 (29%), Positives = 302/644 (46%), Gaps = 101/644 (15%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWEE- 172
L+ L DV L D + RA L + VD+++ FR A L G P G WE+
Sbjct: 9 LEPFPLRDVEL-LDGVQSRAAGQMLHLARVFPVDRVLAVFRANAGLDTRGALPPGNWEDF 67
Query: 173 -------------------PSCEL-RGHFVGHYLSASALMWASTHNESLKEKMSAVVSAL 212
P+ L RGH+ GH+LS AL AST ESL+ K +V+ L
Sbjct: 68 GHPDERPWSAEEYPGAGVAPTASLLRGHYAGHFLSMVALAHASTGEESLRAKAWEIVAGL 127
Query: 213 SACQKEIGS-------GYLSAFPTEQFDRLEALIP---VWAPYYTIHKILAGLLDQYTYA 262
+ + + + G+L+A+ QF RLE L P +WAPYYT HKI+AGLLD + +
Sbjct: 128 AEVRDALAATGRYSHPGFLAAYGEWQFSRLEDLAPYGEIWAPYYTCHKIMAGLLDAHEHT 187
Query: 263 DNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKH 321
+ +AL + M + RV +++ ++R W + E GGMN+ L L IT +
Sbjct: 188 GSEQALELAVGMGHWVAGRVLR-LERAHLQRMWSLYIAGEFGGMNESLAALHRITGEEVF 246
Query: 322 LMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE-------- 373
L A F+ L A D + G H+N H+P+++G +Y+ TG+ + +
Sbjct: 247 LRAAAAFELDHLLEGAAQGRDLLDGMHANQHLPMLVGHLDQYDATGETRYLDAVTALWDQ 306
Query: 374 ---GHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYA 430
G GT G +D +A + ESC TYN+LK++R LF T + Y
Sbjct: 307 VVPGRTFAHGGTGEGELWGPAD--TVAGFIGRRNAESCATYNLLKIARSLFARTGDARYP 364
Query: 431 DYYERSLTNGVLGIQRGTEPGV---MIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGI 487
+Y ER+ N ++G + + V ++Y+ P+ G+ +E Y + GT CC GTG+
Sbjct: 365 EYAERAWLNHMVGSRADLDSDVSPEVVYMYPVDAGAVRE--YDNVGT------CCGGTGL 416
Query: 488 ESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTF 547
E+ K D ++F GK + + +++ SR+ G V + P RV + F
Sbjct: 417 ETHVKHQDWVWFHAPGK---LVVARHVPSRVTLPGGGSVALRTGYPRDG-----RVVVEF 468
Query: 548 SSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
+ SG L+LR+P+W + A ++G+ +PL + G F +++ + D++ + LPL
Sbjct: 469 DADFSG---ELHLRVPSWAT---AGYLVDGERVPL-TDGGFAVLSRDFRRGDEVELVLPL 521
Query: 608 TLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPI-PASY---N 663
LR + DD P S++ GP VL ++AT L P+ PA++ +
Sbjct: 522 PLRLVSTVDD-PTLVSVE---LGPTVLLARD-------DAATVL-----PVSPAAFRGLD 565
Query: 664 SQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRL 707
L+ + ++ F +T E SG DA HA RL
Sbjct: 566 GSLVGYERDGDLVSF------GGLTFEP-AWSGGDARYHAYLRL 602
>gi|294675240|ref|YP_003575856.1| hypothetical protein PRU_2607 [Prevotella ruminicola 23]
gi|294471633|gb|ADE81022.1| conserved hypothetical protein [Prevotella ruminicola 23]
Length = 788
Score = 207 bits (528), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 155/549 (28%), Positives = 260/549 (47%), Gaps = 55/549 (10%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEP 173
+ E L DV L + + A+ N+E LL D D+L+ + K A L G+ Y W+
Sbjct: 17 YANEFPLGDVTLLNGPLK-HARDLNIETLLKYDNDRLLAPYLKEAGLTPKGKSYPNWDG- 74
Query: 174 SCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSAC-------QKEIGSGYLSA 226
L GH GHYL+A A+ A+T ++ +++M +S L AC + G GY+
Sbjct: 75 ---LDGHVGGHYLTAMAIN-AATGSQECRKRMEYWISELQACADANAKNHPDWGRGYVGG 130
Query: 227 FPTEQFDRL---------EALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEY 277
P DR+ W P+Y IHK+ AGL D + Y N +A ++ ++
Sbjct: 131 VPGS--DRIWSNFKKGNFGPYFGAWVPFYNIHKMYAGLRDAWVYCGNEQAKKLFLGFCDW 188
Query: 278 FYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLL 337
+ N+ +ER L+ E GGMN+VL + IT + K+L +A F L L
Sbjct: 189 AIDLTANLTDA-QMER---ALDTEHGGMNEVLADAYAITGEQKYLDVARRFSHRRLLNPL 244
Query: 338 ALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK----------EGHQLESSGTNIGHF 387
+ D + H+NT +P VIG + E++GD+ + G + + G N
Sbjct: 245 MQRRDVLDNMHANTQVPKVIGFERIAELSGDEAYHTAGAYFWDIVTGERTLAFGGNSRRE 304
Query: 388 NFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
+F S D + ESC T NMLK++ L R E YAD++E + N +L Q
Sbjct: 305 HFPSREACQDFVQDIDGPESCNTNNMLKLTEDLHRRNPEARYADFFELATFNHILSTQH- 363
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
E G +Y S++ R Y ++ P+++ WCC GTG+E+ K IY
Sbjct: 364 PEHGGYVYFT-----SARPRHYRNYSAPNEAMWCCVGTGMENHGKYNQFIYTHSGD---A 415
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+++ +++S L+WK+ I + Q+ + R+T+T SS + T + +R P W
Sbjct: 416 LFVNLFVASELNWKAKGITLRQETS--FPYSENSRITITQSSN-TKQPTPIMVRYPGWVK 472
Query: 568 SNGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
+NG+ + + + P +++++ + W D + IQ P+ + + + P+Y A
Sbjct: 473 PGQFSVKVNGKPVSIVTGPSSYVAINRQWKKGDVIDIQFPMYNSVKYLP-NLPQYI---A 528
Query: 627 ILYGPYVLA 635
+++GP +LA
Sbjct: 529 LMHGPIMLA 537
>gi|159491178|ref|XP_001703550.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280474|gb|EDP06232.1| predicted protein [Chlamydomonas reinhardtii]
Length = 226
Score = 207 bits (526), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 107/197 (54%), Positives = 138/197 (70%), Gaps = 4/197 (2%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLL-MLDVDKLVWNFRKTARLPAPGEPY-GGWEE 172
++ + L DVRL ++ R ++ N +YLL ML+ D+L+W+FRKT+ LP PG PY WE+
Sbjct: 28 IEPLPLSDVRLLDTALQARYEKLNAKYLLDMLEPDRLLWSFRKTSGLPTPGTPYIASWED 87
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQF 232
P CELRGHFVGHYLSA +L A T N + K ++ +VS L Q+++G+GYLSAFPTE F
Sbjct: 88 PGCELRGHFVGHYLSALSLALAGTGNSAFKTRLDLMVSELGKVQEKLGTGYLSAFPTEFF 147
Query: 233 DRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIE 292
DR+EAL PVWAPYYTIHKI+AGL+D + A + AL M T MV+Y +NR Q VI E
Sbjct: 148 DRVEALKPVWAPYYTIHKIIAGLVDAHELAGHPSALAMATRMVDYHWNRTQAVIAAKGRE 207
Query: 293 RHWQ-TLNEEAGGMNDV 308
HW LN E GGMN+V
Sbjct: 208 -HWNAVLNCEFGGMNEV 223
>gi|408500683|ref|YP_006864602.1| acetyl-CoA carboxylase [Bifidobacterium asteroides PRL2011]
gi|408465507|gb|AFU71036.1| acetyl-CoA carboxylase [Bifidobacterium asteroides PRL2011]
Length = 807
Score = 206 bits (524), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 166/557 (29%), Positives = 255/557 (45%), Gaps = 62/557 (11%)
Query: 123 VRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFV 182
VRL S++ AQQ +YLL LD D+L+ +R+ A L A +PY WE S L GH
Sbjct: 26 VRLTPGSIYADAQQAGADYLLSLDPDRLLAPYRREAGLTATADPYPNWE--SMGLDGHIG 83
Query: 183 GHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA--- 237
GHYLS A W S E+ + +++ L CQ+ G G+L P E F L
Sbjct: 84 GHYLSGLAAYWQSLQTWPFLERATRMLTGLLECQEASGDGFLGGMPHSAELFRNLREGHV 143
Query: 238 ------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMV----EYFYNRVQNVIK 287
L+ W P Y +HK+ AGLLD + A M MV +++ + N+
Sbjct: 144 QAQSFDLLGSWVPLYNLHKLFAGLLDCWQSFQTKGASEMARVMVLRLADWWCDLADNID- 202
Query: 288 KYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPKHLMLAH-LFDKPCFLGLLALQADDIS 345
E+ +QT L E GG+N+ +L+ +T ++L A L D+P F LA+ D ++
Sbjct: 203 ----EQDFQTMLTCEYGGLNEAFARLYQLTGKDRYLRQARRLTDRP-FFEPLAVGKDQLT 257
Query: 346 GFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------LESSGTNIG------HFNFKSDP 393
G H+NT IP V+G + E+TGDQ + ++ +IG HFN D
Sbjct: 258 GLHANTQIPKVLGYERLAEITGDQAFRTAVDTFWHGVVDKRTVSIGAHSISEHFNPPDDF 317
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
+ ++ + E+C +YNM K++ L+ T + Y D+YER L N ++ E G
Sbjct: 318 SAMVTSREG--LETCNSYNMAKLALRLYDRTGQARYLDFYERVLVNHLVSTVGIREHG-F 374
Query: 454 IYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG-----V 508
+Y P+ P R Y + + SFWCC GTG+E+ ++ G I+ GK PG +
Sbjct: 375 VYFTPMRP-----RHYRVYSSAQRSFWCCVGTGLENHARYGAMIFERRPGKDPGQESESL 429
Query: 509 YIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS 568
+ +I + LDW + V+ P R+ L + S T L++R P W
Sbjct: 430 AVNLFIPASLDWSQRGLRVSLAYAPGPGTTNLGRIDLEADDQ-SQQTLDLDIRHPWWVED 488
Query: 569 NG-----AKATLNGQDLPLPSPGN--FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
+A + + S GN F + TW+ + L L R + P+
Sbjct: 489 ADYRIAQGQANMTVEPAKPDSEGNPRFDHLHLTWTG----RVSLELCHRVRVTAEPLPDG 544
Query: 622 ASIQAILYGPYVLAGHS 638
+ ++L G V+A S
Sbjct: 545 SDWVSLLRGVKVMAARS 561
>gi|332662487|ref|YP_004445275.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332331301|gb|AEE48402.1| protein of unknown function DUF1680 [Haliscomenobacter hydrossis
DSM 1100]
Length = 793
Score = 206 bits (524), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 151/534 (28%), Positives = 252/534 (47%), Gaps = 61/534 (11%)
Query: 134 AQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
A+ N++ LL D+D+L+ +RK A LP Y W+ L GH GHYLSA A M
Sbjct: 45 ARDLNIQTLLQYDIDRLLNPYRKEAGLPEKAASYPNWDG----LDGHVGGHYLSAMA-MN 99
Query: 194 ASTHNESLKEKMSAVVSALSACQKE-------IGSGYLSAFP-------TEQFDRLEALI 239
A+T N +++++ ++S L ACQ+ G GYL P T + +AL
Sbjct: 100 AATGNAECRKRLAYMLSELKACQEAHALKHPAWGIGYLGGVPKSAEIWSTFKNGDFKALR 159
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFYNRVQNVIKKYSIERHW 295
W P+Y +HK+ +GL D + Y + A L W + N + ++
Sbjct: 160 AAWVPWYNVHKLYSGLRDAWLYTGDETAKTLFLDFCDWGIAITANLSEAQMQS------- 212
Query: 296 QTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPI 355
L+ E GGMN++ + +T D K+L A F L +++ D++ H+NT +P
Sbjct: 213 -MLDIEHGGMNEIFADAYQMTGDEKYLKAAKGFSHQALLDPMSMGKDNLDNKHANTQVPK 271
Query: 356 VIGSQMRYEVTG-DQLHKEGHQLESSGTNIGHFNFKSDPKR-----LASNLD----SNTE 405
+G Q E++ D+ K G + T+ + +R +A+ D
Sbjct: 272 AVGFQRIAELSKEDKYAKAGRFFWETVTSKRSLALGGNSRREFFPSIAAGRDFVHDVEGP 331
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
ESC +YNMLK++ LFR Y DYYER+L N +L Q E G +Y P P
Sbjct: 332 ESCNSYNMLKLTEELFRANPSGHYIDYYERTLYNHILSTQH-PEHGGYVYFTPARP---- 386
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
R Y + P+ WCC G+G+E+ K IY +++ +++ +I+S L+W++ I
Sbjct: 387 -RHYRVYSAPNQGMWCCVGSGMENHGKYNQLIYTQQKDS---LFLNLFIASALNWRAKGI 442
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PS 584
V+ Q+ + + + LT + + T L +R P+W + + +N + + S
Sbjct: 443 VLKQQTN----FPEEEQTKLTITEGRARFT--LMIRYPSWVQAGALQIRVNNKRVTYTTS 496
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
P ++++ + W D + I LP+ E + + PEY A+L+GP +L +
Sbjct: 497 PSAYVAIKRLWKKGDVVQIVLPMRNTLEHLT-NAPEYV---ALLHGPILLGAKT 546
>gi|419850639|ref|ZP_14373619.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
gi|419851584|ref|ZP_14374510.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
gi|386408481|gb|EIJ23391.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
gi|386413301|gb|EIJ27914.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
Length = 1834
Score = 204 bits (518), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 173/607 (28%), Positives = 264/607 (43%), Gaps = 119/607 (19%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEE 172
+L E + +V + + + A + +EYLL + D+L+ FR A L G + YGGWE
Sbjct: 223 YLSEQGMENVTVADEYLQ-NAGKKEVEYLLSFEPDRLLVEFRAQAGLDTKGAKNYGGWEN 281
Query: 173 PSCELR------------GHFVGHYLSASALMWAST-----HNESLKEKMSAVVSALSAC 215
E R GHFVGH++SA++ ST L ++AVV +
Sbjct: 282 GPDESRNPDGSSKPGRFTGHFVGHWISAASQAQRSTFATADQKAQLSANLTAVVKGIREA 341
Query: 216 QKE------IGSGYLSAFPTEQFDRLEALIP-----VWAPYYTIHKILAGLLDQYTYADN 264
Q+ +G+ AF +++P + P+Y +HK+ AG++ Y Y+ +
Sbjct: 342 QEAYAKKDTANAGFFPAFSA-------SVVPNGGGGLIVPFYNLHKVEAGMVQAYDYSTD 394
Query: 265 AE--------ALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCIT 316
AE A+ W+V + S L E GGMND LY++ I
Sbjct: 395 AETRETAKAAAVDFAKWVVNW-----------KSAHASTDMLRTEYGGMNDALYQVAEIA 443
Query: 317 QDPKH---LMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRY---------- 363
L AHLFD+ LA D ++G H+NT IP + G+ RY
Sbjct: 444 DASDKQTVLTAAHLFDETALFQKLANGQDPLNGLHANTTIPKLTGAMQRYVAYTEDEDLY 503
Query: 364 ---------EVTG----------DQLHKEGHQLESSGTNIGHFNFKSDP-KRLASNLDSN 403
E+T D + K+ + + HF+ + K N D N
Sbjct: 504 NSLSADERGELTSLYLKAAQNFFDIVVKDHTYVNGGNSQSEHFHVAGELWKDATQNGDQN 563
Query: 404 -------TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
T E+C YNMLK++R LF+ TK+ Y++YYE + N ++ Q E G+ Y
Sbjct: 564 GGYRNFSTVETCNEYNMLKLARILFQVTKDSKYSEYYEHTFINAIVASQN-PETGMTTYF 622
Query: 457 LPLAPGSSK-------ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVY 509
P+ G K + +G +WCC GTGIE+F+KL DS YF +E VY
Sbjct: 623 QPMKAGYPKVFGITGTDYDADWFGGAIGEYWCCQGTGIENFAKLNDSFYFTDENN---VY 679
Query: 510 IIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN 569
+ + SS + + Q + + D +TF G+G + +L LR+P W +N
Sbjct: 680 VNMFWSSTYTDTRHNLTITQTANVPKTED------VTFEVSGTG-SANLKLRVPDWAITN 732
Query: 570 GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
G K ++G + L N VT K+T LP L+T D++ ++ + Q Y
Sbjct: 733 GVKLVVDGTEQALTKDENGW-VTVAIKDGAKITYTLPAKLQTIDAADNK-DWVAFQ---Y 787
Query: 630 GPYVLAG 636
GP VLAG
Sbjct: 788 GPVVLAG 794
>gi|393782707|ref|ZP_10370890.1| hypothetical protein HMPREF1071_01758 [Bacteroides salyersiae
CL02T12C01]
gi|392672934|gb|EIY66400.1| hypothetical protein HMPREF1071_01758 [Bacteroides salyersiae
CL02T12C01]
Length = 1293
Score = 202 bits (513), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 155/559 (27%), Positives = 258/559 (46%), Gaps = 72/559 (12%)
Query: 123 VRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFV 182
VRLG + +A N+ YL DV++L+ K + YGG + +
Sbjct: 450 VRLGEGRLK-QAMDKNITYLKSFDVNRLLAQTFKYNLGIDDYKLYGGANDAT-------F 501
Query: 183 GHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSA--FPTEQFDRL--EAL 238
HYLSA ++ +A+T +E L ++++ +V + Q +G G S PT F ++ E +
Sbjct: 502 AHYLSAISMGYAATGDEDLLQRVNHMVDVMIQAQDVMGDGLYSNNDAPTWGFYKMAKEKV 561
Query: 239 IPVWA---------------PYYTIHKILAGLLDQYTYADNAEA----LRMTTWMVEYFY 279
I + P+Y HK A D Y YA N A ++ W+V +
Sbjct: 562 ITPYGWDENGHPWGNNNIGFPFYAHHKAFAAFRDAYIYAGNENARVAFVKFCEWLVMWMQ 621
Query: 280 NRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLAL 339
N + ++K L E GGM +VL + ++ K L A F + F ++
Sbjct: 622 NFTDDNLQK--------MLESEHGGMVEVLSDAYALSGKIKFLDAARRFTRDNFAAAMSG 673
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFNF 389
DD+SG HSN H+P+ +G+ + Y +GD+ + H +G N + F
Sbjct: 674 NRDDLSGRHSNFHVPMAVGAAIHYLYSGDERSGKTAHNFFHIVHDHHTLCNGGNGNNERF 733
Query: 390 KSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTE 449
+ P L L E+C++YNMLK+++ LF + Y DYYE ++ N +L I
Sbjct: 734 GT-PDLLTYRLGQRGPETCSSYNMLKLAKDLFCQEGDTEYLDYYENTMWNHILAILSPRS 792
Query: 450 PGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVY 509
+ Y + L PG+ K S + + WCC GTG+ES +K D+IYF+ + G+
Sbjct: 793 DAGVCYHVNLKPGTFKMYSDLY-----SNLWCCVGTGMESHAKYVDAIYFKGD---IGIL 844
Query: 510 IIQYISSRLDWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS 568
+ + S L+W+ + + + D PV + V L + GS + +R P+W
Sbjct: 845 VNLFTPSTLNWEETGLKLTMETDFPVTN-----NVKLIINESGS-FNKDICIRYPSWVEE 898
Query: 569 NGAKATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAI 627
G T+NG + + PG + ++ +W++ D++ I +P LR + DD ++ AI
Sbjct: 899 GGIAITINGAKQKISAKPGEIIKLSSSWAAGDEILITIPCKLRLVDLPDD----INVSAI 954
Query: 628 LYGPYVLAGH--SIGDWDI 644
YGP +LA + +G DI
Sbjct: 955 FYGPVLLAANMGEVGQSDI 973
>gi|322692034|ref|YP_004221604.1| cell surface protein [Bifidobacterium longum subsp. longum JCM
1217]
gi|320456890|dbj|BAJ67512.1| putative cell surface protein [Bifidobacterium longum subsp. longum
JCM 1217]
Length = 1984
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 172/602 (28%), Positives = 261/602 (43%), Gaps = 109/602 (18%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG-EPYGGWEE 172
+L E + +V + + + A + +EYLL + D+L+ FR A L G + YGGWE
Sbjct: 373 YLSEQGMENVTVADEYLQ-NAGKKEVEYLLSFEPDRLLVEFRAQAGLDTKGAKNYGGWEN 431
Query: 173 PSCELR------------GHFVGHYLSASALMWAST-----HNESLKEKMSAVVSALSAC 215
E R GHFVGH++SA++ ST L ++AVV +
Sbjct: 432 GPDESRNPDGSSKPGRFTGHFVGHWISAASQAQRSTFATADQKAQLSANLTAVVKGIREA 491
Query: 216 QKE------IGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAE--- 266
Q+ +G+ AF + V P+Y +HK+ AG++ Y Y+ +AE
Sbjct: 492 QEAYAKKDTANAGFFPAFSASVVPNGGGGLIV--PFYNLHKVEAGMVQAYDYSTDAETRE 549
Query: 267 -----ALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKH 321
A+ W+V + S L E GGMND LY++ I
Sbjct: 550 TAKAAAVDFAKWVVNW-----------KSAHASTDMLRTEYGGMNDALYQVAEIADASDK 598
Query: 322 ---LMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRY-----------EVTG 367
L AHLFD+ LA D ++G H+NT IP + G+ RY ++
Sbjct: 599 QTVLTAAHLFDETALFQKLANGQDPLNGLHANTTIPKLTGAMQRYVAYTEDEDLYNSLSA 658
Query: 368 DQLHK-----------------EGHQLESSGTNIG-HFNFKSDP-KRLASNLDSN----- 403
D+ K + H + G + HF+ + K N D N
Sbjct: 659 DERGKLTSLYLKAAQNFFDIVVKDHTYVNGGNSQSEHFHVAGELWKDATQNGDQNGGYRN 718
Query: 404 --TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAP 461
T E+C YNMLK++R LF+ TK+ Y++YYE + N ++ Q E G+ Y P+
Sbjct: 719 FSTVETCNEYNMLKLARILFQVTKDSKYSEYYEHTFINAIVASQN-PETGMTTYFQPMKA 777
Query: 462 GSSK-------ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
G K + +G +WCC GTGIE+F+KL DS YF +E VY+ +
Sbjct: 778 GYPKVFGITGTDYDADWFGGAIGEYWCCQGTGIENFAKLNDSFYFTDENN---VYVNMFW 834
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
SS + + Q + + D +TF G+G + +L LR+P W +NG K
Sbjct: 835 SSTYTDTRHNLTITQTANVPKTED------VTFEVSGTG-SANLKLRVPDWAITNGVKLV 887
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
++G + L N VT K+T LP L+ D++ ++ + Q YGP VL
Sbjct: 888 VDGTEQALTKDENGW-VTVAIKDGAKITYTLPAKLQAIDAADNK-DWVAFQ---YGPVVL 942
Query: 635 AG 636
AG
Sbjct: 943 AG 944
>gi|302818287|ref|XP_002990817.1| hypothetical protein SELMODRAFT_429245 [Selaginella moellendorffii]
gi|300141378|gb|EFJ08090.1| hypothetical protein SELMODRAFT_429245 [Selaginella moellendorffii]
Length = 226
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 97/150 (64%), Positives = 116/150 (77%), Gaps = 4/150 (2%)
Query: 171 EEPSCELRGHFVG----HYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSA 226
EE SC L+ HYLSASA+ WASTHN ++ E M+AVV+AL+ CQ +IG+GYLSA
Sbjct: 8 EEISCHLKQQTACKDKRHYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSA 67
Query: 227 FPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVI 286
FPT FDR EAL VWAPYYTIHKI+AGLLDQYTYA N+ A M M +YF +RV+ VI
Sbjct: 68 FPTSLFDRFEALESVWAPYYTIHKIMAGLLDQYTYAANSFAFEMLLGMTDYFGSRVERVI 127
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCIT 316
+KYSIERHWQ+LNEE GGMNDVLY+++ IT
Sbjct: 128 EKYSIERHWQSLNEETGGMNDVLYRVYQIT 157
>gi|357472913|ref|XP_003606741.1| hypothetical protein MTR_4g065110 [Medicago truncatula]
gi|355507796|gb|AES88938.1| hypothetical protein MTR_4g065110 [Medicago truncatula]
Length = 203
Score = 198 bits (504), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 102/172 (59%), Positives = 124/172 (72%), Gaps = 9/172 (5%)
Query: 11 FKFLLTFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHLTPSDDS 70
F F+ L++ KECTN + SHTFR L +SKNE++ K++ SH H+TP+D+S
Sbjct: 6 FMFMFMALMLRGCVTIKECTNIPTQ--SHTFRYELFASKNETWKKEVMSHY-HVTPTDES 62
Query: 71 AWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSDSM 130
AW +L+PRKIL EE Q + WA++YRKIKN G FK P FLKEV L DVRL S+
Sbjct: 63 AWATLLPRKILSEENQHD---WALMYRKIKNLGVFKPPVG---FLKEVPLGDVRLLEGSI 116
Query: 131 HWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFV 182
H AQQTNLEYLLMLDVD+L+W+FRKTA LP PG PYGGWEEP+ ELRGHFV
Sbjct: 117 HAVAQQTNLEYLLMLDVDRLIWSFRKTAGLPTPGNPYGGWEEPNTELRGHFV 168
>gi|261415299|ref|YP_003248982.1| hypothetical protein Fisuc_0892 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|385790233|ref|YP_005821356.1| hypothetical protein FSU_1340 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|261371755|gb|ACX74500.1| protein of unknown function DUF1680 [Fibrobacter succinogenes
subsp. succinogenes S85]
gi|302327243|gb|ADL26444.1| conserved hypothetical protein [Fibrobacter succinogenes subsp.
succinogenes S85]
Length = 897
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 153/546 (28%), Positives = 251/546 (45%), Gaps = 58/546 (10%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
+L DV+L + R Q N+E LL DVD+L+ F + A + + W L
Sbjct: 36 ALSDVQLLDGVLKER-QDLNVETLLSYDVDRLLAPFYEEAGMKPKASKFPNW----AGLD 90
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS-----GYLSAFPTEQFD 233
GH +GHYLSA A+ +A + +KE++ ++ L Q + GY+S P +
Sbjct: 91 GHVLGHYLSALAMHYADNDDVQVKERLEYILKELKTIQDQNSKDNNFKGYISGVPNGKQM 150
Query: 234 RLE-------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVI 286
L+ A W P+Y IHK+ AGL D Y YA +A M + ++ + N +
Sbjct: 151 WLKMKNGDAGAQNGYWVPWYNIHKLYAGLRDAYVYAGYEQAKTMFLALCDWGIT-ITNGL 209
Query: 287 KKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG 346
++ Q L E GGM +V + +T+D K+L A + L ++ D+++
Sbjct: 210 NDSKMQ---QMLGTEHGGMPEVYADAYKLTKDEKYLNAAKKWSHQWLLNPMSQGNDNLTN 266
Query: 347 FHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQLESSGTNIG-HFNFKSDPK 394
H+NT +P V+G E++GD+ +K+G + G +I HF ++ K
Sbjct: 267 VHANTQVPKVVGFARIAELSGDEKYKKGSDFFWQTVVNKRSIAIGGNSISEHFPALNNHK 326
Query: 395 RLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI 454
+ + ESC TYNMLK++ LF + Y D+YER+L N +L T G +
Sbjct: 327 KFIEEREG--PESCNTYNMLKLTERLFNIKHDAHYTDFYERALFNHILSTIHPTHGG-YV 383
Query: 455 YLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
Y P P R Y + + WCC G+G+E+ +K IY +++ +Y+ +
Sbjct: 384 YFTPARP-----RHYRVYSKVNAGMWCCVGSGMENPAKYNQFIYTKDK---DALYVNLFA 435
Query: 515 SSRLDWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+S L+WK + + Q+ P + F+ GSG + +R P W K
Sbjct: 436 ASILNWKDKSVKIKQETAFPKGE-------SSKFTITGSG-EFDMQIRHPYWVKEGAFKV 487
Query: 574 TLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
+NG + S P +++S K+W S D + + P+ E D P A+L+GP
Sbjct: 488 IVNGDTVVKKSTPSSYVSAGKSWKSGDVVEVLYPMYTHVE----DLPGVTDYVALLHGPI 543
Query: 633 VLAGHS 638
VL+ +
Sbjct: 544 VLSAKT 549
>gi|332669733|ref|YP_004452741.1| hypothetical protein Celf_1219 [Cellulomonas fimi ATCC 484]
gi|332338771|gb|AEE45354.1| protein of unknown function DUF1680 [Cellulomonas fimi ATCC 484]
Length = 752
Score = 196 bits (498), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 169/540 (31%), Positives = 240/540 (44%), Gaps = 50/540 (9%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L DVRL D AQ+T+L YLL LD +L+ FR+ A LP EPYG WE S L G
Sbjct: 6 LSDVRL-LDGPFRDAQRTDLAYLLRLDPQRLLAPFRREAGLPPLAEPYGNWE--SMGLDG 62
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA 237
H GH LSA++L+WA+T + E +A+V L ACQ+ +G+GY+ P F+R+ A
Sbjct: 63 HTGGHALSAASLLWAATGDPRTAELAAALVDGLDACQEALGTGYVGGVPHGVALFERIAA 122
Query: 238 ---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
L W P+Y +HK +AGL+D YA A R +V F V
Sbjct: 123 GEVSADSFGLNGAWVPWYNLHKTVAGLVDAVRYAPAGTAERARR-VVLRFAEWWLGVAAG 181
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
+ L E GGM + L +T +A F L L D + G H
Sbjct: 182 LDDAQFAAMLRTEFGGMCEAFADLAALTGRDDLRAMAVRFADRTLLDPLLDGRDALDGLH 241
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHKEGHQ-----------LESSGTNIG-HFNFKSDPKRL 396
+NT I V+G E GD + + L G ++G HF+ D
Sbjct: 242 ANTQIAKVVGWAALAEQDGDGGWERAARTFWDAVTTHRSLVFGGDSVGEHFHPVDD---F 298
Query: 397 ASNLDS-NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
+ L S ESC T NML+++R L + D+ ER+L N VL Q G +Y
Sbjct: 299 SGALTSPEGPESCNTANMLELTRRLLLRRPDPTLLDFAERALVNHVLSAQH--PDGGFVY 356
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P P Y + P D FWCC GTG+E++++LG+ + +G V++ +
Sbjct: 357 FTPARP-----DHYRVYSQPEDGFWCCVGTGLETYARLGE-LALATQGDDLIVHL--PVP 408
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
R W + + + + P TLT G ++ +R P W + A T+
Sbjct: 409 VRATWGDAVVTLRSPYPDLSAAAP---TTLTLDLPGP-RRFAVRVRRPAWVGGDLAL-TV 463
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G G +LSVT+TW D LT + P + E + P+ + A GP VLA
Sbjct: 464 GGAPADATDDGTYLSVTRTWHDGDVLTWEHPARVVAERL----PDGSDWVAFRRGPVVLA 519
>gi|384109447|ref|ZP_10010323.1| hypothetical protein MSI_18910 [Treponema sp. JC4]
gi|383868978|gb|EID84601.1| hypothetical protein MSI_18910 [Treponema sp. JC4]
Length = 727
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 155/549 (28%), Positives = 256/549 (46%), Gaps = 71/549 (12%)
Query: 123 VRLGSDSMHWRAQQTNLEYLLMLDVDKLVW-NFRKTARLPAPGEPYGGWEEPSCELRGHF 181
+ L DS+ ++Q+ LEY+L + D+++ +R + P YGGWE +++GH
Sbjct: 6 INLEKDSLFEKSQRLGLEYVLEYEPDRMLAPCYRALGKNPCAIN-YGGWENR--QIQGHM 62
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLE----- 236
+GHYLSA + + T + KEK+ + + Q++ GY P++ FD++
Sbjct: 63 LGHYLSALSGFYYQTGKQDAKEKLDYTIDLIKELQRK--DGYFGGIPSDSFDKVFYSGGN 120
Query: 237 ------ALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYS 290
+L W P+Y+IHKI AGL+D Y Y N +AL++ M ++ N +N + S
Sbjct: 121 FEVERFSLAGWWVPWYSIHKIYAGLIDAYVYGGNEDALQIVFKMADWAINGTKN-LSDSS 179
Query: 291 IERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSN 350
I++ L E GGM V L+ IT + K+L A + + + + D + G+H+N
Sbjct: 180 IQK---MLTCEHGGMCKVFADLYGITGNKKYLSEAERWIHHEIIDPASKKEDKLQGYHAN 236
Query: 351 THIPIVIGSQMRYEVTGDQLHKEGHQL--ESSGTN----IG------HFNFKSDPKRLAS 398
T IP IG YE+TG ++ + E+ N IG HF +
Sbjct: 237 TQIPKFIGIARLYELTGKSEYRTAAEFFFETVTKNRSYAIGGNSKGEHFG-----REFEE 291
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLP 458
L +T E+C TYNML+++ H+F W K AD+YE +L N +L Q + G Y +
Sbjct: 292 PLMRDTCETCNTYNMLELAEHIFAWNKTSDIADFYENALYNHILASQ-DPQTGAKTYFVS 350
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFE-EEGKYPGVYIIQYISSR 517
+ G K H ++ WCC GTG+E+ S+ I + ++ Y ++I + +
Sbjct: 351 MQQGFHKVYCSH-----DNAMWCCTGTGLENPSRYNRFIACDFDDVLYINLFIPATVETE 405
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
WK KV+ +D +++ + K + L +R P W KA +G
Sbjct: 406 DGWKV-------KVETDFPYDAAVKIKVLERGKEN---KGLKVRKPGWADKMAEKAGEDG 455
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
GN SS+ ++ + LP+ L +D + A+ YGP VLA
Sbjct: 456 ----YIDFGNL-------SSESEIELSLPMKLSIYKAKDHSGNF----AVKYGPLVLAA- 499
Query: 638 SIGDWDITE 646
+G+ D+ E
Sbjct: 500 DLGNEDLPE 508
>gi|393782709|ref|ZP_10370892.1| hypothetical protein HMPREF1071_01760 [Bacteroides salyersiae
CL02T12C01]
gi|392672936|gb|EIY66402.1| hypothetical protein HMPREF1071_01760 [Bacteroides salyersiae
CL02T12C01]
Length = 673
Score = 194 bits (494), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 163/589 (27%), Positives = 257/589 (43%), Gaps = 96/589 (16%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTA--RLPAPGEP------ 166
+ L +VRL R Q + +Y+ L+ D+ + FR+ A + + G P
Sbjct: 34 FRSFGLDEVRLKDREFKLR-QNHDFDYIRTLEPDRYLSPFRRNAGIEVDSKGIPVDNTKH 92
Query: 167 YGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE-------I 219
Y GWE L GHYLSA ++M+ T + +L K++ ++ L+ Q+ +
Sbjct: 93 YDGWEF----LGSSTFGHYLSAISMMYKVTGDTTLLHKINYIIDELNFIQRNPSYENENL 148
Query: 220 GSGYLSAFPTEQ------------FDRLEA--LIPVWAP--------------------- 244
G L AF ++ +D L + AP
Sbjct: 149 RHGALVAFDRDRHKHVREPNFLRTYDELRQGQVNLTSAPDNRGATVENVYFKTFYWLSGG 208
Query: 245 --YYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
+YT HKI AG+ D Y Y N +A ++ ++ V +K + + L E
Sbjct: 209 LSWYTNHKIYAGIRDAYLYTGNPKAKKVFLSFCDW----ACWVTEKLTDHAFARMLYSEH 264
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLFDK-----PCFLGLLALQADDISGFHSNTHIPIVI 357
G MN++L + + + K+L A F++ PC G + A+ IS H+N IP
Sbjct: 265 GAMNEMLTDAYAFSGERKYLDCAFRFNEQETMVPCIDGDIKKIAETISHTHANAQIPQFY 324
Query: 358 GSQMRYEVTGDQLHK----------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEES 407
G +E TGD L K +Q +G N F++ P + + + + E+
Sbjct: 325 GLIKEFEYTGDSLFKVAAENFFKYVTNYQSFVTGGNSEWEQFRA-PGNIMAQVTRRSGET 383
Query: 408 CTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKER 467
C TYNMLK+++ LF T + Y +Y ER+L N +L ++PG Y L L PG K
Sbjct: 384 CNTYNMLKIAKGLFELTGDTLYLNYMERALYNHILPSIHTSQPGAFTYFLSLEPGYFKTF 443
Query: 468 SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVV 527
S P DS WCC GTG+E+ +K G+ IYF E + VY+ +++S L W+ +
Sbjct: 444 S-----RPYDSHWCCVGTGMENHAKYGEFIYFHHEKE---VYVNLFVASALCWEKEGFQM 495
Query: 528 NQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGN 587
D D R+ + G +L +RIP W G K +NG+ + +
Sbjct: 496 ETITDFPYESDVRFRIL-----QNKGRIATLKIRIPRWAKEVGVK--VNGKMIKYKNRDG 548
Query: 588 FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+L + K W D + + LP+ LR E + P + A YGP +LAG
Sbjct: 549 YLKLEKLWKIGDLVELTLPMYLRKEYV----PNCSDKFAFFYGPVLLAG 593
>gi|300726603|ref|ZP_07060044.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Prevotella
bryantii B14]
gi|299776135|gb|EFI72704.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Prevotella
bryantii B14]
Length = 832
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 166/591 (28%), Positives = 266/591 (45%), Gaps = 77/591 (13%)
Query: 107 VPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARL------ 160
V +S + L DV+L M A + N LL DVD+L+ F + A L
Sbjct: 12 VQAQSQIYPNHFDLQDVQLLDGPMK-SAMEINFNTLLAYDVDRLLTPFIRQAGLHEGRYA 70
Query: 161 --PAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSA----VVSALSA 214
+ W +L GH GHYLSA A+ +A+ + + KE++ + ++ L
Sbjct: 71 DWQKKHPNFKNWGGDGFDLSGHIGGHYLSALAMAYAACQDAATKERLQSRLLYMIDVLKD 130
Query: 215 CQKEIGS------GYLSAFP-TEQFDRL-EALIPV------WAPYYTIHKILAGLLDQYT 260
CQ G++ P E +++L + I W P+Y HK++AGL D Y
Sbjct: 131 CQNSFDQNTTGLYGFIGGQPINEDWEKLYQGDISGIWQHRGWVPFYCEHKVMAGLRDAYL 190
Query: 261 YADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPK 320
YA N +A M M ++ +I K S + L E GG+N+ + + I +D +
Sbjct: 191 YAHNQDAKLMLKKMADW----CTQLIAKVSDADMQKMLTIEHGGINESMADCYAIFKDTR 246
Query: 321 HLMLAHLFDKPCFL-GLLALQADDISGFHSNTHIPIVIG---------SQMRYEVTGDQL 370
+L A + + L GL +L A + H+NT +P IG + ++Y
Sbjct: 247 YLEAAKKYSQREMLEGLQSLNATFLDNRHANTQVPKYIGFERIVEEDPAALQYATAASNF 306
Query: 371 HKE--GHQLESSGTNI--GHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE 426
++ H+ G N HF K++ R NL+ ESC T NMLK+S L T +
Sbjct: 307 WQDVAHHRTVCIGGNSISEHFLSKTNSNRYIDNLEG--PESCNTNNMLKLSEMLSDRTHD 364
Query: 427 IAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTG 486
YAD+YE ++ N +L Q + G +Y L P + Y + P+ WCC GTG
Sbjct: 365 AGYADFYEYAMWNHILSTQ-DPQTGGYVYFTTLRP-----QGYRIYSVPNQGMWCCVGTG 418
Query: 487 IESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIV--VNQKVDPVVSWDPYLRVT 544
+E+ SK G +Y + + +Y+ + +S+LD K ++ N +P + T
Sbjct: 419 MENHSKYGHFVYTHDGDR--TLYVNLFTASKLDGKKFKLTQQTNYPYEP--------KTT 468
Query: 545 LTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG--QDLPLPSPGN--FLSVTKTWSSDDK 600
+T G ++ +R P WT+S+ + +NG Q L +PS G + ++ + W D
Sbjct: 469 ITIEKSGR---YAIAIRRPWWTTSD-YRIQVNGQTQQLNIPSAGTSAYATLERKWKKGDV 524
Query: 601 LTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSL 651
+T+ +P+TLR EA P Y A YGP +L + + AT L
Sbjct: 525 ITVDIPMTLRQEAC----PNYEDYIAFEYGPILLGAQTTSQNEAEARATGL 571
>gi|396489945|ref|XP_003843216.1| hypothetical protein LEMA_P073260.1 [Leptosphaeria maculans JN3]
gi|312219795|emb|CBX99737.1| hypothetical protein LEMA_P073260.1 [Leptosphaeria maculans JN3]
Length = 748
Score = 192 bits (488), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 164/577 (28%), Positives = 262/577 (45%), Gaps = 91/577 (15%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE--PYGGWE 171
++ L+ V LG + + Q +++ D + + F K A P GGWE
Sbjct: 45 LVRPFRLNQVHLGEGLLQEKRDQIK-DFVRTYDERRFLVLFNKVAGRANITNLSPPGGWE 103
Query: 172 EPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGS-------GYL 224
+ L GH+ GHY+SA + + KEK+ +V+ L+ACQ+ GYL
Sbjct: 104 DGGL-LSGHWTGHYMSALSQAYIDKGESIFKEKLDWMVAELAACQEAYTEYKQPTHLGYL 162
Query: 225 SAFPTEQFDRL----------EALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWM 274
A P + RL WA +YT HKI+ GLLD Y A+N +AL + M
Sbjct: 163 GALPEDTVLRLGPPRFAVYGSNISTDTWAGWYTQHKIMRGLLDAYYNANNTQALDIVIKM 222
Query: 275 VEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL 334
++ + + + + E GG N+V +++ +T + KHL A FD L
Sbjct: 223 ADWAHLALTDTY-----------IAGEFGGANEVFPEIYALTGEEKHLQTAKAFDNRESL 271
Query: 335 GLLALQADDI--------------SGFHSNTHIPIVIGSQMRYEVTGDQLHKEG------ 374
A+ DI H+NTH+P IG YE TG +
Sbjct: 272 FSAAVSDQDILVMTPERKPGRRRRERLHANTHVPQFIGYLRIYEHTGSNEYLLAAKNFFG 331
Query: 375 ----HQLESSGTNIGHF-NFKSDPK------RLASNLDSNTEESCTTYNMLKVSRHLFRW 423
H+ +SG+ G+ F ++P+ +A+++ E+C TYN L ++R+LF
Sbjct: 332 WVVPHREFASGSTGGNVPGFSANPELFQNRDNIANSIADEGAETCITYNTLNLARNLFLD 391
Query: 424 TKEIAYADYYERSLTNGVLGIQRGTEPGV---MIYLLPLAPGSSKERSYHHWGTPSDSFW 480
Y D+ ER L N + G + T + Y PL+PG +E Y + GT
Sbjct: 392 EHNATYMDHCERGLFNMIAGSRVDTSNNSDPQLTYFQPLSPGFGRE--YGNTGT------ 443
Query: 481 CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPY 540
CC GTG+ES +K +++Y P ++I +I S L W + Q+ + +
Sbjct: 444 CCGGTGMESHTKYQETVYL-RSAHSPVLWINLFIPSTLHWMERGFAIKQETN----FPRE 498
Query: 541 LRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG--QDLPLPSPGNFLSVTKTWSSD 598
LT + +G+ + + LR+P W NG T+NG Q P +LS+ + W ++
Sbjct: 499 GSTKLTIAGEGALV---IKLRVPGWV-RNGFAVTINGEAQATKNVQPSTYLSLKRIWKTN 554
Query: 599 DKLTIQLPLTLRTE-AIQDDRPEYASIQAILYGPYVL 634
D + +Q+PL++RTE AI DRP+ QA+++GP +L
Sbjct: 555 DVIEVQMPLSIRTERAI--DRPD---TQAVMWGPVLL 586
>gi|296129045|ref|YP_003636295.1| hypothetical protein Cfla_1194 [Cellulomonas flavigena DSM 20109]
gi|296020860|gb|ADG74096.1| protein of unknown function DUF1680 [Cellulomonas flavigena DSM
20109]
Length = 749
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 165/565 (29%), Positives = 247/565 (43%), Gaps = 106/565 (18%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELR 178
L VRL +D + +AQ+T LEYLL LD D+L+ FR+ A LP EPYG WE S L
Sbjct: 12 GLRAVRL-TDGLFAQAQRTALEYLLGLDPDRLLAPFRREAGLPPVAEPYGSWE--SLGLD 68
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP---------- 228
GH GH LSA++L WA+T ++ A+V L CQ +G+GY+ P
Sbjct: 69 GHIGGHALSAASLQWAATGDDRAAGMAHALVDGLVLCQDALGTGYVGGLPGGVALWESVA 128
Query: 229 -----TEQFDRLEALIPVWAPYYTIHKILAGLLD--QYTYADNA-----EALRMTTWMVE 276
FD L W P+Y +HK AGL+D +Y AD A A+R+ W V
Sbjct: 129 SGGAEAGTFD----LGGAWVPWYNVHKTYAGLIDAARYAPADVAVRAMRAAVRLGDWGVA 184
Query: 277 YFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGL 336
+R+ + + L E GGM + L +T D ++ LA F LG
Sbjct: 185 -LSDRLDDAAFA-------RMLRTEFGGMCEAYGDLAALTGDARYAALARRFADESLLGP 236
Query: 337 LALQADDISGFHSNTHIPIVIG-----------SQMRYEVTGDQLHKEGHQLESSGTNIG 385
L D++ G H+NT + V+G + +R + L GH +
Sbjct: 237 LRESRDELDGLHANTQVAKVVGWPAIGEADAALAFVRTVLDHRTLVLGGHSVAE------ 290
Query: 386 HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ 445
+F P+R ++ + ESC T N+L+V R L+ T ++A D ER L N VL Q
Sbjct: 291 --HFTPRPERHVTHREG--PESCNTANLLEVERRLYERTGDVALLDAAERQLVNHVLSAQ 346
Query: 446 RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
G +Y P PG Y + T WCC GT +E++++LG+ Y
Sbjct: 347 H--PDGGFVYFTPARPG-----HYRVYSTRDACMWCCVGTALETYARLGELAYA------ 393
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT--------- 556
++VN V P +P LRV L + + TT
Sbjct: 394 --------------LCGHDLLVNLPV-PSTLEEPGLRVRLDSTYPRALATTHATLTVDVD 438
Query: 557 -----SLNLRIPTWTSSNGAKATLNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLR 610
+++LR P+W + A T++G +P + + +++V +TW + + L +L
Sbjct: 439 APTDLAVHLRRPSWARGDLAP-TVDGVGVPATAERDGYVTVRRTWRAGEVLAWRLVAGPA 497
Query: 611 TEAIQDDRPEYASIQAILYGPYVLA 635
E + D A+ +GP LA
Sbjct: 498 AERLPGDD----GWVALRWGPVALA 518
>gi|365852804|ref|ZP_09393150.1| hypothetical protein HMPREF9103_01934 [Lactobacillus parafarraginis
F0439]
gi|363714017|gb|EHL97570.1| hypothetical protein HMPREF9103_01934 [Lactobacillus parafarraginis
F0439]
Length = 728
Score = 191 bits (485), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 156/594 (26%), Positives = 268/594 (45%), Gaps = 73/594 (12%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGE-PYGGWEEP 173
+K VS ++V +S + N+ ++L L D+L++N+RK A L G P WE P
Sbjct: 5 MKPVSYYNVEYLPNSTLKEKFERNINWMLSLTPDQLLYNYRKNAGLDTKGATPLTVWESP 64
Query: 174 SCELRGHFVGHYLSASALMWASTHNES--------LKEKMSAVVSALSACQKEIGS---- 221
RGHF GHYLS ++ + N LK ++ +V+ L Q ++
Sbjct: 65 DFFFRGHFTGHYLSGASKTFVELTNTDEKDPQAVELKNRVDLIVTGLKEVQDKLSETSEF 124
Query: 222 -GYLSAFPTEQFDRLEALI---PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEY 277
GYL+A P ++FD LE L + PYY I K++ GL+D Y Y N AL++ + Y
Sbjct: 125 PGYLAAEPEKRFDNLEKLRFNGNHYVPYYAIQKLMDGLMDAYQYTGNQTALQLVKNLTSY 184
Query: 278 FYNRVQNVIKKY---SIERHW------QTLNEEAGGMNDVLYKLFCIT--QDPKHLMLAH 326
R+ + + ++ W ++E G M+ L +L+ +T ++ LA
Sbjct: 185 VEKRMAKLTPERISAMLDTRWYQGSGQYIFHQEFGAMHRTLLRLYELTGKKEQDVFDLAE 244
Query: 327 LFDKPCFLGLLALQADDISGF--HSNTHIPIVIGSQMRYEVTGDQLHKE----------- 373
FD+ F +L D + + HSNT + G Y VTGD +K+
Sbjct: 245 KFDRKWFRDMLINNEDKLGYYSMHSNTELVCAEGMLEYYHVTGDDQYKKGVENYMDWMHT 304
Query: 374 GHQLESSGTN-----IGHFNFKSD----PKRLASNLDSNTEESCTTYNMLKVSRHLFRWT 424
GH+L + G + ++ S+ P+ +L ESC ++++ +S LF T
Sbjct: 305 GHELPTKGISGRSAYPAPADYGSELYDYPEMFFKHLSKLNGESCCSHDLNYLSSELFADT 364
Query: 425 KEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYG 484
K+ + YE N ++ Q+ + + YL L+ + + Y G FWCC G
Sbjct: 365 KDPVLMNDYEIRFINAIMA-QQNNDSAIAEYLYNLSVAPNSVKHYDRGG-----FWCCVG 418
Query: 485 TGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVT 544
+G E S L D IY+++ +Y+ QY S L+ K + V Q D + +T
Sbjct: 419 SGTERHSTLVDGIYYQDND---DIYVAQYFDSILNLKDQGVKVTQ--DAHYPDQHFAHIT 473
Query: 545 LTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQ 604
+ + + T + +R+P W++ T++G+ + + F+++ + WS ++TI
Sbjct: 474 VE-TEQPKDFT--IYVRVPKWSAE--TTITVDGKAVKVQPENGFVAIKRNWSKKSEITIN 528
Query: 605 LPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPI 658
LR + + D + I AI YGP +LA D+ S S +++ +
Sbjct: 529 FDFQLRYQVLAD---RFNRI-AIYYGPILLAAQKA---DLPASTVSAKEYLNDL 575
>gi|212693864|ref|ZP_03301992.1| hypothetical protein BACDOR_03386 [Bacteroides dorei DSM 17855]
gi|212663396|gb|EEB23970.1| hypothetical protein BACDOR_03386 [Bacteroides dorei DSM 17855]
Length = 811
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 163/658 (24%), Positives = 293/658 (44%), Gaps = 80/658 (12%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC--- 175
SL +VR+ +D Q + +YLL L+ D+L+ FR+ A L +PY WE
Sbjct: 37 SLSEVRI-TDKYFKHIQDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPFWESEDVWGG 95
Query: 176 -ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
L GH +G Y+S+ ++M+ +T+++ + ++++ +V+ L CQK G GYL A +
Sbjct: 96 GPLAGHILGFYMSSMSMMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLLATVNGKQVF 155
Query: 232 -------FDRLEALI-PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
F LI W P Y ++KI+ GL Y +A R+ M ++F V
Sbjct: 156 EDMIDGDFTTSNPLINQTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGMADWFGYEVL 215
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ + +I++ L E G +N+ ++ IT D K+L A + L+ D
Sbjct: 216 DKLNHENIQK---MLVCEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMWVPLSKGEDI 272
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQLESSGTNIGHFNFKSD 392
++G+H+NT IP G Y T ++ + + H + G + G F+
Sbjct: 273 LNGWHANTQIPKFTGFNAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNSTGEHFFEES 332
Query: 393 --PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
K++ ESC + NM++++ L++ + DYYER L N +L E
Sbjct: 333 MFEKKIPQ---YGGPESCNSVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA-NYDPEE 388
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G+ +Y P+ PG Y +GT SFWCC GTG E+ +K IY ++ +Y+
Sbjct: 389 GMCVYYTPMRPG-----HYKIYGTRYHSFWCCTGTGFEAPAKFAKMIYAHKDN---SLYV 440
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+I+S LDW I++ Q + P TL S L +RIP W +
Sbjct: 441 NMFIASTLDWNEKNIMITQSTNF-----PDEDQTLLTIKSSSTQQIDLKIRIPFWIKNKS 495
Query: 571 AKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
+N + + + S ++++++ WS D++ + L +++ A+ Y
Sbjct: 496 MVVRVNNKIVKGIKSEKGYVTISREWSDGDEIKVTFTPLLEIVPLKNSE----RYLAMTY 551
Query: 630 GPYVLA----GHSIGDWDITESATSLSDWITPI---PASYNSQLITFTQEYGNTKFV--- 679
GP VLA +IG + ++S+ + P+ P + T + GN + V
Sbjct: 552 GPIVLATKIDNTNIGKEEFRHERKTVSNVMIPMSDTPVLFG----TLNEIKGNIRRVVGK 607
Query: 680 ----LTNSNQSITMEKFPKSGTDAALHATFRLILNDS--------SGSEFSSLNDFIG 725
+ N + +++ P + + + +A + + ++D GS + ++N +G
Sbjct: 608 ELLFIYNPKEGKSVKLVPYNRINFSRYAIYMIHVDDKEEYIKTVWDGSYYVNMNQNLG 665
>gi|29348320|ref|NP_811823.1| hypothetical protein BT_2911 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383124515|ref|ZP_09945178.1| hypothetical protein BSIG_1739 [Bacteroides sp. 1_1_6]
gi|29340224|gb|AAO78017.1| putative Acetyl-CoA carboxylase, biotin carboxylase [Bacteroides
thetaiotaomicron VPI-5482]
gi|251841333|gb|EES69414.1| hypothetical protein BSIG_1739 [Bacteroides sp. 1_1_6]
Length = 655
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 165/552 (29%), Positives = 255/552 (46%), Gaps = 60/552 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC---- 175
L +VRL DS Q+ EYLL L+ D L+ +R A LP+ PY GWE
Sbjct: 48 LREVRL-LDSPFLDLQRKGKEYLLWLNPDSLLHFYRIEAGLPSKAAPYAGWESQDVWGAG 106
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYL-------SAFP 228
LRG F+G YLS+ ++M+ ST ++ L +++ V+ L CQK G+L F
Sbjct: 107 PLRGGFLGFYLSSVSMMYQSTDDKRLLKRLKYVLKELELCQKAGKDGFLLGLKDGRKLFA 166
Query: 229 TEQFDRLEALIP----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
+++ P WAP Y I+K+L GL YT EAL + + ++F +V +
Sbjct: 167 EVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCQMEEALPILIRLADWFGYQVLD 226
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
+ I+R L E G +N+ + + +T + + L A + G L+ D +
Sbjct: 227 KLTDDQIQR---LLICEHGSINESYVEAYELTGEKRFLDWARRLNDHAMWGPLSEGKDIL 283
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQ-----------LHKEGHQLESSGTNIGHFNFKSDP 393
G+H+NT IP G Y+ TGD+ + + H G + G F P
Sbjct: 284 FGWHANTQIPKFTGFHKYYQFTGDERFLTAATNFWNIVTQNHTWVIGGNSTGEHFF---P 340
Query: 394 KRLASN--LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
K ++ L E+C + NML+++ LF + A A YYER L N +L E G
Sbjct: 341 KEEFADRVLLVGGPETCNSVNMLRLTESLFCQYPDAAKASYYERVLFNHILS-AYDPEKG 399
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY---PGV 508
+ Y + PG Y + + SFWCC TG+ES +KL IY + P +
Sbjct: 400 MCCYFTSMRPG-----HYRIYASRDSSFWCCGHTGLESPAKLSKFIYSHSKRIIDGDPDI 454
Query: 509 YIIQYISSRLDWKSGQI-VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+ +I S L WK I ++ Q P +V+ + K L +R P W
Sbjct: 455 RVNLFIPSILFWKEKGIELIQQNRLPESE-----QVSFMLNLKKKQ-ELILRIRKPDW-- 506
Query: 568 SNGAKATLNGQ-DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQ-DDRPEYASIQ 625
++ +NG+ + P+ + V +TW+ +K+ +QLP+ + E++ DR YA
Sbjct: 507 ADKVTFIINGKVEYPILDKDGYWVVNRTWARKNKIILQLPMHVYVESLMGSDR--YA--- 561
Query: 626 AILYGPYVLAGH 637
A+LYGPYVLAG
Sbjct: 562 ALLYGPYVLAGR 573
>gi|389638620|ref|XP_003716943.1| acetyl-CoA carboxylase [Magnaporthe oryzae 70-15]
gi|351642762|gb|EHA50624.1| acetyl-CoA carboxylase [Magnaporthe oryzae 70-15]
Length = 1018
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 143/474 (30%), Positives = 225/474 (47%), Gaps = 78/474 (16%)
Query: 222 GYLSAFPTEQFDRL----------EALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMT 271
GYL A P + RL WAP+YT HKI+ GLLD Y +N++AL++
Sbjct: 390 GYLGALPEDTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQALQVV 449
Query: 272 TWMVEYFYNRVQNVIKKYSIERHWQTLNE-----------EAGGMNDVLYKLFCITQDPK 320
T M ++ + + K ++ + T ++ E GG N+V +++ +T DPK
Sbjct: 450 TRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTGDPK 509
Query: 321 HLMLAHLFDKPCFLGLLALQADDI--------------SGFHSNTHIPIVIGSQMRYEVT 366
HL A FD L A+ DDI H+NTH+P IG +E
Sbjct: 510 HLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIFEQG 569
Query: 367 GDQLHKEG----------HQLESSGTNIGHFNFKSDPKRL-------ASNLDSNTEESCT 409
G Q + + H+ +SG G++ +D L A+ + N E+CT
Sbjct: 570 GGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAETCT 629
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV----MIYLLPLAPGSSK 465
YNMLK++R+LF Y D YER L N + G + T + Y PL PGS+
Sbjct: 630 AYNMLKLARNLFLHNHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPGSN- 688
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
R Y + GT CC GTG+ES +K +++Y +++ Y+ S L W+ I
Sbjct: 689 -RDYGNTGT------CCGGTGLESHTKYQETVYL-RSADGSALWVNLYVPSTLTWEEKGI 740
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW--TSSNGAKATLNGQDL--- 580
V Q+ D ++ T+T SS+ L + LR+P W + G ++NG+
Sbjct: 741 TVRQET--AFPRDDTVKFTVTTSSRQEPL--DMKLRVPAWIQKTPGGFNVSINGEQFRPG 796
Query: 581 PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
P+PG++++V++TW++ D + I++P +R E DRP+ QAI++GP +L
Sbjct: 797 ETPTPGSYMTVSRTWATGDVVEIKMPFAVRIERAP-DRPD---TQAIMWGPLLL 846
Score = 46.6 bits (109), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAP-GEPY-GGWEE 172
++ L VRLG + + + +L D + + F A P P G P GGWE+
Sbjct: 31 VRPFRLDQVRLGEGLLQEKRDRIKT-FLREYDERRFLILFNNQAGRPNPAGLPVPGGWED 89
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
L GH+ GH+++A + +A E K K+ +V L+ACQ I
Sbjct: 90 GGL-LSGHWAGHFMTALSQAFADQGEELYKTKLDWMVKELAACQDAI 135
>gi|265751351|ref|ZP_06087414.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263238247|gb|EEZ23697.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 791
Score = 190 bits (483), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 163/658 (24%), Positives = 293/658 (44%), Gaps = 80/658 (12%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC--- 175
SL +VR+ +D Q + +YLL L+ D+L+ FR+ A L +PY WE
Sbjct: 17 SLSEVRI-TDKYFKHIQDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPFWESEDVWGG 75
Query: 176 -ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
L GH +G Y+S+ ++M+ +T+++ + ++++ +V+ L CQK G GYL A +
Sbjct: 76 GPLAGHILGFYMSSMSMMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLLATVNGKQVF 135
Query: 232 -------FDRLEALI-PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
F LI W P Y ++KI+ GL Y +A R+ M ++F V
Sbjct: 136 EDMIDGDFTTSNPLINQTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGMADWFGYEVL 195
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ + +I++ L E G +N+ ++ IT D K+L A + L+ D
Sbjct: 196 DKLNHENIQK---MLVCEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMWVPLSKGEDI 252
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQLESSGTNIGHFNFKSD 392
++G+H+NT IP G Y T ++ + + H + G + G F+
Sbjct: 253 LNGWHANTQIPKFTGFNAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNSTGEHFFEES 312
Query: 393 --PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
K++ ESC + NM++++ L++ + DYYER L N +L E
Sbjct: 313 MFEKKIPQ---YGGPESCNSVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA-NYDPEE 368
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G+ +Y P+ PG Y +GT SFWCC GTG E+ +K IY ++ +Y+
Sbjct: 369 GMCVYYTPMRPG-----HYKIYGTRYHSFWCCTGTGFEAPAKFAKMIYAHKDN---SLYV 420
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+I+S LDW I++ Q + P TL S L +RIP W +
Sbjct: 421 NMFIASTLDWNEKNIMITQSTNF-----PDEDQTLLTIKSSSTQQIDLKIRIPFWIKNKS 475
Query: 571 AKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
+N + + + S ++++++ WS D++ + L +++ A+ Y
Sbjct: 476 MVVRVNNKIVKGIKSEKGYVTISREWSDGDEIKVTFTPLLEIVPLKNSE----RYLAMTY 531
Query: 630 GPYVLA----GHSIGDWDITESATSLSDWITPI---PASYNSQLITFTQEYGNTKFV--- 679
GP VLA +IG + ++S+ + P+ P + T + GN + V
Sbjct: 532 GPIVLATKIDNTNIGKEEFRHERKTVSNVMIPMSDTPVLFG----TLNEIKGNIRRVVGK 587
Query: 680 ----LTNSNQSITMEKFPKSGTDAALHATFRLILNDS--------SGSEFSSLNDFIG 725
+ N + +++ P + + + +A + + ++D GS + ++N +G
Sbjct: 588 ELLFIYNPKEGKSVKLVPYNRINFSRYAIYMIHVDDKEEYIKTVWDGSYYVNMNQNLG 645
>gi|423228769|ref|ZP_17215175.1| hypothetical protein HMPREF1063_00995 [Bacteroides dorei
CL02T00C15]
gi|423247580|ref|ZP_17228629.1| hypothetical protein HMPREF1064_04835 [Bacteroides dorei
CL02T12C06]
gi|392631910|gb|EIY25877.1| hypothetical protein HMPREF1064_04835 [Bacteroides dorei
CL02T12C06]
gi|392635508|gb|EIY29407.1| hypothetical protein HMPREF1063_00995 [Bacteroides dorei
CL02T00C15]
Length = 811
Score = 190 bits (483), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 163/658 (24%), Positives = 293/658 (44%), Gaps = 80/658 (12%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC--- 175
SL +VR+ +D Q + +YLL L+ D+L+ FR+ A L +PY WE
Sbjct: 37 SLSEVRI-TDKYFKYIQDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPFWESEDVWGG 95
Query: 176 -ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--- 231
L GH +G Y+S+ ++M+ +T+++ + ++++ +V+ L CQK G GYL A +
Sbjct: 96 GPLAGHILGFYMSSMSMMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLLATVNGKQVF 155
Query: 232 -------FDRLEALI-PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
F LI W P Y ++KI+ GL Y +A R+ M ++F V
Sbjct: 156 EDMIDGDFTTSNPLINQTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGMADWFGYEVL 215
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ + +I++ L E G +N+ ++ IT D K+L A + L+ D
Sbjct: 216 DKLNHENIQK---MLVCEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMWVPLSKGEDI 272
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQLESSGTNIGHFNFKSD 392
++G+H+NT IP G Y T ++ + + H + G + G F+
Sbjct: 273 LNGWHANTQIPKFTGFNAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNSTGEHFFEES 332
Query: 393 --PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEP 450
K++ ESC + NM++++ L++ + DYYER L N +L E
Sbjct: 333 MFEKKIPQ---YGGPESCNSVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA-NYDPEE 388
Query: 451 GVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYI 510
G+ +Y P+ PG Y +GT SFWCC GTG E+ +K IY ++ +Y+
Sbjct: 389 GMCVYYTPMRPG-----HYKIYGTRYHSFWCCTGTGFEAPAKFAKMIYAHKDN---SLYV 440
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+I+S LDW I++ Q + P TL S L +RIP W +
Sbjct: 441 NMFIASTLDWNEKNIMITQSTNF-----PDEDQTLLTIKSSSTQQIDLKIRIPFWIKNKS 495
Query: 571 AKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
+N + + + S ++++++ WS D++ + L +++ A+ Y
Sbjct: 496 MVVRVNNKIVKGIKSEKGYVTISREWSDGDEIKVTFTPLLEIVPLKNSE----RYLAMTY 551
Query: 630 GPYVLA----GHSIGDWDITESATSLSDWITPI---PASYNSQLITFTQEYGNTKFV--- 679
GP VLA +IG + ++S+ + P+ P + T + GN + V
Sbjct: 552 GPIVLATKIDNTNIGKEEFRHERKTVSNVMIPMSDTPVLFG----TLNEIKGNIRRVVGK 607
Query: 680 ----LTNSNQSITMEKFPKSGTDAALHATFRLILNDS--------SGSEFSSLNDFIG 725
+ N + +++ P + + + +A + + ++D GS + ++N +G
Sbjct: 608 ELLFIYNPKEGKSVKLVPYNRINFSRYAIYMIHVDDKEEYIKTVWDGSYYVNMNQNLG 665
>gi|440483441|gb|ELQ63839.1| acetyl-CoA carboxylase [Magnaporthe oryzae P131]
Length = 1055
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 143/474 (30%), Positives = 225/474 (47%), Gaps = 78/474 (16%)
Query: 222 GYLSAFPTEQFDRL----------EALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMT 271
GYL A P + RL WAP+YT HKI+ GLLD Y +N++AL++
Sbjct: 427 GYLGALPEDTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQALQVV 486
Query: 272 TWMVEYFYNRVQNVIKKYSIERHWQTLNE-----------EAGGMNDVLYKLFCITQDPK 320
T M ++ + + K ++ + T ++ E GG N+V +++ +T DPK
Sbjct: 487 TRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTGDPK 546
Query: 321 HLMLAHLFDKPCFLGLLALQADDI--------------SGFHSNTHIPIVIGSQMRYEVT 366
HL A FD L A+ DDI H+NTH+P IG +E
Sbjct: 547 HLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIFEQG 606
Query: 367 GDQLHKEG----------HQLESSGTNIGHFNFKSDPKRL-------ASNLDSNTEESCT 409
G Q + + H+ +SG G++ +D L A+ + N E+CT
Sbjct: 607 GGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAETCT 666
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV----MIYLLPLAPGSSK 465
YNMLK++R+LF Y D YER L N + G + T + Y PL PGS+
Sbjct: 667 AYNMLKLARNLFLHDHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPGSN- 725
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
R Y + GT CC GTG+ES +K +++Y +++ Y+ S L W+ I
Sbjct: 726 -RDYGNTGT------CCGGTGLESHTKYQETVYLRSA-DGSALWVNLYVPSTLTWEEKGI 777
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW--TSSNGAKATLNGQDL--- 580
V Q+ D ++ T+T SS+ L + LR+P W + G ++NG+
Sbjct: 778 TVRQET--AFPRDDTVKFTVTTSSRQEPL--DMKLRVPAWIQKTPGGFNVSINGEQFRPG 833
Query: 581 PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
P+PG++++V++TW++ D + I++P +R E DRP+ QAI++GP +L
Sbjct: 834 ETPTPGSYMTVSRTWATGDVVEIKMPFAVRIERAP-DRPD---TQAIMWGPLLL 883
Score = 46.2 bits (108), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAP-GEPY-GGWEE 172
++ L VRLG + + + +L D + + F A P P G P GGWE+
Sbjct: 68 VRPFRLDQVRLGEGLLQEKRDRIKT-FLREYDERRFLILFNNQAGRPNPAGLPVPGGWED 126
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
L GH+ GH+++A + +A E K K+ +V L+ACQ I
Sbjct: 127 GGL-LSGHWAGHFMTALSQAFADQGEELYKTKLDWMVKELAACQDAI 172
>gi|440466410|gb|ELQ35678.1| acetyl-CoA carboxylase [Magnaporthe oryzae Y34]
Length = 1055
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 143/474 (30%), Positives = 225/474 (47%), Gaps = 78/474 (16%)
Query: 222 GYLSAFPTEQFDRL----------EALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMT 271
GYL A P + RL WAP+YT HKI+ GLLD Y +N++AL++
Sbjct: 427 GYLGALPEDTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQALQVV 486
Query: 272 TWMVEYFYNRVQNVIKKYSIERHWQTLNE-----------EAGGMNDVLYKLFCITQDPK 320
T M ++ + + K ++ + T ++ E GG N+V +++ +T DPK
Sbjct: 487 TRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTGDPK 546
Query: 321 HLMLAHLFDKPCFLGLLALQADDI--------------SGFHSNTHIPIVIGSQMRYEVT 366
HL A FD L A+ DDI H+NTH+P IG +E
Sbjct: 547 HLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIFEQG 606
Query: 367 GDQLHKEG----------HQLESSGTNIGHFNFKSDPKRL-------ASNLDSNTEESCT 409
G Q + + H+ +SG G++ +D L A+ + N E+CT
Sbjct: 607 GGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAETCT 666
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV----MIYLLPLAPGSSK 465
YNMLK++R+LF Y D YER L N + G + T + Y PL PGS+
Sbjct: 667 AYNMLKLARNLFLHDHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPGSN- 725
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
R Y + GT CC GTG+ES +K +++Y +++ Y+ S L W+ I
Sbjct: 726 -RDYGNTGT------CCGGTGLESHTKYQETVYLRSA-DGSALWVNLYVPSTLTWEEKGI 777
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW--TSSNGAKATLNGQDL--- 580
V Q+ D ++ T+T SS+ L + LR+P W + G ++NG+
Sbjct: 778 TVRQET--AFPRDDTVKFTVTTSSRQEPL--DMKLRVPAWIQKTPGGFNVSINGEQFRPG 833
Query: 581 PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
P+PG++++V++TW++ D + I++P +R E DRP+ QAI++GP +L
Sbjct: 834 ETPTPGSYMTVSRTWATGDVVEIKMPFAVRIERAP-DRPD---TQAIMWGPLLL 883
Score = 46.2 bits (108), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAP-GEPY-GGWEE 172
++ L VRLG + + + +L D + + F A P P G P GGWE+
Sbjct: 68 VRPFRLDQVRLGEGLLQEKRDRIKT-FLREYDERRFLILFNNQAGRPNPAGLPVPGGWED 126
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
L GH+ GH+++A + +A E K K+ +V L+ACQ I
Sbjct: 127 GGL-LSGHWAGHFMTALSQAFADQGEELYKTKLDWMVKELAACQDAI 172
>gi|423219866|ref|ZP_17206362.1| hypothetical protein HMPREF1061_03135 [Bacteroides caccae
CL03T12C61]
gi|392625071|gb|EIY19149.1| hypothetical protein HMPREF1061_03135 [Bacteroides caccae
CL03T12C61]
Length = 655
Score = 189 bits (479), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 163/550 (29%), Positives = 253/550 (46%), Gaps = 56/550 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC---- 175
L ++RL SD QQ EYLL L+ D L+ +R A L + PY GWE
Sbjct: 48 LKEIRL-SDGPFLDLQQKGKEYLLWLNPDSLLHFYRIEAGLSSKAGPYAGWESQDVWGAG 106
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFD 233
LRG F+G YLS+ ++M+ ST + L ++ V+ L CQ+ G+L E F
Sbjct: 107 PLRGGFLGFYLSSVSMMYQSTGDRELLRRLKYVLKELKLCQEAGKDGFLLGVKGGRELFR 166
Query: 234 RLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
+ + + WAP Y I+K+L GL YT D EAL + + ++F ++
Sbjct: 167 EVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCDLKEALPILVRLADWFGSQ--- 223
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
V+ K + E+ Q L E G +N+ +++ +T + L A + L+ D +
Sbjct: 224 VLDKLTDEQIQQLLICEHGSINESYVEVYELTGQKRFLDWARRLNDRAMWVPLSEGKDVL 283
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQ-----------LHKEGHQLESSGTNIGHFNFKSDP 393
G+H+NT IP G Y TGD+ + K+ H G + G +F S
Sbjct: 284 FGWHANTQIPKFTGFHKYYMFTGDRAFLLAATNFWNIVKQNHTWVIGGNSTGE-HFFSKK 342
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
+ + L + E+C + NML+++ LF + A YYER+L N +L + G+
Sbjct: 343 EFIDRMLHISGPETCNSVNMLRLTEALFMQQPDATKAAYYERTLFNHILSAYDPVK-GMC 401
Query: 454 IYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY---FEEEGKYPGVYI 510
Y + PG Y + + SFWCC TG+ES +KLG IY + + +
Sbjct: 402 CYFTSMRPG-----HYRIYASRDSSFWCCGHTGLESPAKLGKFIYSHKVTNRHQEKDIRV 456
Query: 511 IQYISSRLDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN 569
+I S L WK G ++ Q P +V LT + K L +R P WT +
Sbjct: 457 NLFIPSILSWKEEGVELIQQSRIPESE-----QVDLTLNLKKKQ-KLILRIRKPDWT--D 508
Query: 570 GAKATLNG-QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQD-DRPEYASIQAI 627
A +NG ++ PL + + + W + +T++LP+ + TE + DR A+
Sbjct: 509 KATFIINGEEEQPLLGSDGYWIIDRVWERKNVITLRLPMHIYTENLTGTDR-----YVAL 563
Query: 628 LYGPYVLAGH 637
LYGPYVLAG
Sbjct: 564 LYGPYVLAGR 573
>gi|265753026|ref|ZP_06088595.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263236212|gb|EEZ21707.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 808
Score = 188 bits (478), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 163/599 (27%), Positives = 261/599 (43%), Gaps = 72/599 (12%)
Query: 104 QFKVPERSGEFLKEVSLHDVRL-GSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPA 162
+ KV +G+ + SL +VRL SD H N Y+L L+ D+L+ FR+ A L
Sbjct: 23 KVKVEPVNGDKISLFSLKEVRLLDSDFKH--IMDLNHAYMLSLEPDRLLSWFRREAGLTP 80
Query: 163 PGEPYGGWEEPSCE----LRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE 218
+PY WE L GH +G YLS ++M+ ST + ++ ++S ++ LS CQ+
Sbjct: 81 KAQPYPFWESEYMNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQA 140
Query: 219 IGSGYLSAFPT------------EQFDRLEALI-----PVWAPYYTIHKILAGLLDQYTY 261
G GYL PT F I W P Y ++KI+ GL Y
Sbjct: 141 GGDGYL--LPTICGRAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMR 198
Query: 262 ADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKH 321
D +A + M ++F +VI K S + + L E G +N+ ++ IT + K+
Sbjct: 199 CDLLQAKEILVKMADWF---GYSVIDKLSHDDLQKLLVCEHGSINESFIDVYQITGEEKY 255
Query: 322 LMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG------- 374
L A + ++ D + G+H+NT IP G + Y ++
Sbjct: 256 LKWAQRLNDEDMWVPMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDT 315
Query: 375 ----HQLESSGTNIGHFNFKSDPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAY 429
H G + G F P+ ++ N ESC + NML+++ L+ E+
Sbjct: 316 VVRKHTWVMGGNSTGEHFFA--PEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEK 373
Query: 430 ADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIES 489
DYYE+ L N +L + G+ +Y + PG Y +GT DSFWCC GTG E
Sbjct: 374 VDYYEKVLFNHILA-NYDPDQGMCVYYTSMKPGH-----YKIYGTKYDSFWCCTGTGFEQ 427
Query: 490 FSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSS 549
+K G IY + +Y+ +I S + W G + + P +LT S
Sbjct: 428 TAKFGQMIYAHTDD---ALYVNMFIPSVVTWNKGVSIHQETAFPDEG-----VTSLTVSG 479
Query: 550 KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLT 608
+ +L +R P W S+ +NG+ + + + ++S+ + W DK+ I+LP+
Sbjct: 480 EA---VFNLKIRCPYWVGSSSLNVIVNGKREKIKAGMDGYVSINRQWKDGDKVRIELPMK 536
Query: 609 LRTEAIQDDRPEYASIQAILYGPYVLAGH------SIGDWDITESATSLSDW-ITPIPA 660
L + E A A+ YGP VLA S D+ S ++ D+ + +PA
Sbjct: 537 LEIVPLN----EAAHYLALKYGPIVLAARISDEHLSKDDFRSARSTVAMKDYPVIDVPA 591
>gi|153805786|ref|ZP_01958454.1| hypothetical protein BACCAC_00022 [Bacteroides caccae ATCC 43185]
gi|149130463|gb|EDM21669.1| hypothetical protein BACCAC_00022 [Bacteroides caccae ATCC 43185]
Length = 659
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 163/550 (29%), Positives = 252/550 (45%), Gaps = 56/550 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC---- 175
L ++RL SD QQ EYLL L+ D L+ +R A L + PY GWE
Sbjct: 52 LKEIRL-SDGPFLDLQQKGKEYLLWLNPDSLLHFYRIEAGLSSKAGPYAGWESQDVWGAG 110
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFD 233
LRG F+G YLS+ ++M+ ST + L ++ V+ L CQ+ G+L E F
Sbjct: 111 PLRGGFLGFYLSSVSMMYQSTGDRELLRRLKYVLKELKLCQEAGKDGFLLGVKGGRELFR 170
Query: 234 RLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
+ + + WAP Y I+K+L GL YT D EAL + + ++F ++
Sbjct: 171 EVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCDLKEALPILVRLADWFGSQ--- 227
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
V+ K + E+ Q L E G +N+ +++ +T + L A + L+ D +
Sbjct: 228 VLDKLTDEQIQQLLICEHGSINESYVEVYELTGQKRFLDWARRLNDRAMWVPLSEGKDVL 287
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQ-----------LHKEGHQLESSGTNIGHFNFKSDP 393
G H+NT IP G Y TGD+ + K+ H G + G +F S
Sbjct: 288 FGGHANTQIPKFTGFHKYYMFTGDRAFLLAATNFWNIVKQNHTWVIGGNSTGE-HFFSKK 346
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
+ + L + E+C + NML+++ LF + A YYER+L N +L + G+
Sbjct: 347 EFIDRMLHISGPETCNSVNMLRLTEALFMQQPDATKAAYYERTLFNHILSAYDPVK-GMC 405
Query: 454 IYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY---FEEEGKYPGVYI 510
Y + PG Y + + SFWCC TG+ES +KLG IY + + +
Sbjct: 406 CYFTSMRPG-----HYRIYASRDSSFWCCGHTGLESPAKLGKFIYSHKVTNRHQEKDIRV 460
Query: 511 IQYISSRLDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN 569
+I S L WK G ++ Q P +V LT + K L +R P WT +
Sbjct: 461 NLFIPSILSWKEEGVELIQQSRIPESE-----QVDLTLNLKKKQ-KLILRIRKPDWT--D 512
Query: 570 GAKATLNG-QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQD-DRPEYASIQAI 627
A +NG ++ PL + + + W + +T++LP+ + TE + DR A+
Sbjct: 513 KATFIINGEEEQPLLGSDGYWIIDRVWERKNVITLRLPMHIYTENLTGTDR-----YVAL 567
Query: 628 LYGPYVLAGH 637
LYGPYVLAG
Sbjct: 568 LYGPYVLAGR 577
>gi|336428272|ref|ZP_08608256.1| hypothetical protein HMPREF0994_04262 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336006508|gb|EGN36542.1| hypothetical protein HMPREF0994_04262 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 601
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 132/414 (31%), Positives = 202/414 (48%), Gaps = 31/414 (7%)
Query: 121 HDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARL-----PAPGEPYGGWEEPSC 175
VRL DS R Q N + LL L+ ++ A L P + GWE P+
Sbjct: 11 QQVRL-LDSEIRRRFQVNEDLLLRYQSKDLLRSYYFEAGLWKDNSENPKIEHWGWEGPTS 69
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRL 235
E+RGHFVGH+LSA+A+ +AS N L + ++ L CQK G ++ A P +Q
Sbjct: 70 EIRGHFVGHWLSAAAITYASDGNRELLGRAEYMLDELERCQKANGGEWIGAIPEKQLRWT 129
Query: 236 EALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHW 295
E P Y +HKI+ GL+D Y YA N +AL + ++FY V+++ +R
Sbjct: 130 EEGRNFGVPLYNLHKIIMGLIDMYVYAGNCKALEIVGHFADWFYRWVKDI----PTDRMD 185
Query: 296 QTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLF-DKPCFLGLLALQADDISGFHSNTHIP 354
+ E GG+ + +L+ IT + K+ +L F +P F LL D ++ H+NT IP
Sbjct: 186 IIMETETGGILEEWCRLYEITGEEKYQVLMEKFLRRPLFHALLE-NKDVLTNMHANTTIP 244
Query: 355 IVIGSQMRYEVTG--DQLHKEGHQLESSGTNIGHFNFKSD--------PKRLASNLDSNT 404
++G YEVTG + L + + T G F P + L
Sbjct: 245 EILGIARMYEVTGNPEYLKAVKNYWSIAVTKRGGFVTGGQTSGEVWIPPFHIRERLGKLN 304
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSS 464
+E C YNM++++ L+++T +I + +Y E +L NG+L Q+ G Y LP+ GS
Sbjct: 305 QEHCAVYNMMRLAEFLYQYTGDIEFENYRELNLYNGILA-QQNPNTGAAAYYLPMQAGSR 363
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
K W T SFWCC G+GI++ + G IY E + + + + Q+I S L
Sbjct: 364 K-----IWSTEKKSFWCCCGSGIQAGASHGMGIYAENKNQ---IAVNQFIPSVL 409
>gi|212695364|ref|ZP_03303492.1| hypothetical protein BACDOR_04911 [Bacteroides dorei DSM 17855]
gi|345513936|ref|ZP_08793451.1| hypothetical protein BSEG_01968 [Bacteroides dorei 5_1_36/D4]
gi|423230909|ref|ZP_17217313.1| hypothetical protein HMPREF1063_03133 [Bacteroides dorei
CL02T00C15]
gi|423241462|ref|ZP_17222575.1| hypothetical protein HMPREF1065_03198 [Bacteroides dorei
CL03T12C01]
gi|423244620|ref|ZP_17225695.1| hypothetical protein HMPREF1064_01901 [Bacteroides dorei
CL02T12C06]
gi|212662093|gb|EEB22667.1| hypothetical protein BACDOR_04911 [Bacteroides dorei DSM 17855]
gi|229435750|gb|EEO45827.1| hypothetical protein BSEG_01968 [Bacteroides dorei 5_1_36/D4]
gi|392630029|gb|EIY24031.1| hypothetical protein HMPREF1063_03133 [Bacteroides dorei
CL02T00C15]
gi|392641355|gb|EIY35132.1| hypothetical protein HMPREF1065_03198 [Bacteroides dorei
CL03T12C01]
gi|392641469|gb|EIY35245.1| hypothetical protein HMPREF1064_01901 [Bacteroides dorei
CL02T12C06]
Length = 808
Score = 186 bits (473), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 165/602 (27%), Positives = 265/602 (44%), Gaps = 78/602 (12%)
Query: 104 QFKVPERSGEFLKEVSLHDVRL-GSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPA 162
+ KV +G+ + SL +VRL SD H N Y+L L+ D+L+ FR+ A L
Sbjct: 23 KVKVEPVNGDKISLFSLKEVRLLDSDFKH--IMDLNHAYMLSLEPDRLLSWFRREAGLTP 80
Query: 163 PGEPYGGWEEPSCE----LRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKE 218
+PY WE L GH +G YLS ++M+ ST + ++ ++S ++ LS CQ+
Sbjct: 81 KAQPYPFWESEYMNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQA 140
Query: 219 IGSGYLSAFPT------------EQFDRLEALI-----PVWAPYYTIHKILAGLLDQYTY 261
G GYL PT F I W P Y ++KI+ GL Y
Sbjct: 141 GGDGYL--LPTICGRAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMR 198
Query: 262 ADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKH 321
D +A + M ++F +VI K S + + L E G +N+ ++ IT + K+
Sbjct: 199 CDLLQAKEILVKMADWF---GYSVIDKLSHDDLQKLLVCEHGSINESFIDVYQITGEEKY 255
Query: 322 LMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG------- 374
L A + ++ D + G+H+NT IP G + Y ++
Sbjct: 256 LKWAQRLNDEDMWVPMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDT 315
Query: 375 ----HQLESSGTNIGHFNFKSDPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAY 429
H G + G F P+ ++ N ESC + NML+++ L+ E+
Sbjct: 316 VVRKHTWVMGGNSTGEHFFA--PEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEK 373
Query: 430 ADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIES 489
DYYE+ L N +L + G+ +Y + PG Y +GT DSFWCC GTG E
Sbjct: 374 VDYYEKVLFNHILA-NYDPDQGMCVYYTSMKPGH-----YKIYGTKYDSFWCCTGTGFEQ 427
Query: 490 FSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKV---DPVVSWDPYLRVTLT 546
+K G IY + +Y+ +I S + W G I ++Q+ D V+ +LT
Sbjct: 428 TAKFGQMIYAHTDD---ALYVNMFIPSVVTWDKG-ISIHQETAFPDEGVT-------SLT 476
Query: 547 FSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQL 605
S + +L +R P W S+ +NG+ + + + ++S+ + W DK+ I+L
Sbjct: 477 VSGEA---VFNLKIRCPYWVGSSSLNVIVNGKREKIKAGVDGYVSINRQWKDGDKVRIEL 533
Query: 606 PLTLRTEAIQDDRPEYASIQAILYGPYVLAGH------SIGDWDITESATSLSDW-ITPI 658
P+ L + E A+ YGP VLA S D+ S ++ D+ + +
Sbjct: 534 PMKLEIVPLN----EATHYLALKYGPIVLAARISDEHLSKDDFRSARSTVAMKDYPVIDV 589
Query: 659 PA 660
PA
Sbjct: 590 PA 591
>gi|237711616|ref|ZP_04542097.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|229454311|gb|EEO60032.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
Length = 780
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 163/595 (27%), Positives = 261/595 (43%), Gaps = 78/595 (13%)
Query: 111 SGEFLKEVSLHDVRL-GSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGG 169
+G+ + SL +VRL SD H N Y+L L+ D+L+ FR+ A L +PY
Sbjct: 2 NGDKISLFSLKEVRLLDSDFKH--IMDLNHAYMLSLEPDRLLSWFRREAGLTPKAQPYPF 59
Query: 170 WEEPSCE----LRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLS 225
WE L GH +G YLS ++M+ ST + ++ ++S ++ LS CQ+ G GYL
Sbjct: 60 WESEYMNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQAGGDGYL- 118
Query: 226 AFPT------------EQFDRLEALI-----PVWAPYYTIHKILAGLLDQYTYADNAEAL 268
PT F I W P Y ++KI+ GL Y D +A
Sbjct: 119 -LPTICGRAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMRCDLLQAK 177
Query: 269 RMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLF 328
+ M ++F +VI K S + + L E G +N+ ++ IT + K+L A
Sbjct: 178 EILVKMADWF---GYSVIDKLSHDDLQKLLVCEHGSINESFIDVYQITGEEKYLKWAQRL 234
Query: 329 DKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG-----------HQL 377
+ ++ D + G+H+NT IP G + Y ++ H
Sbjct: 235 NDEDMWVPMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDTVVRKHTW 294
Query: 378 ESSGTNIGHFNFKSDPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAYADYYERS 436
G + G F P+ ++ N ESC + NML+++ L+ E+ DYYE+
Sbjct: 295 VMGGNSTGEHFFA--PEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEKVDYYEKV 352
Query: 437 LTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDS 496
L N +L + G+ +Y + PG Y +GT DSFWCC GTG E +K G
Sbjct: 353 LFNHILA-NYDPDQGMCVYYTSMKPGH-----YKIYGTKYDSFWCCTGTGFEQTAKFGQM 406
Query: 497 IYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKV---DPVVSWDPYLRVTLTFSSKGSG 553
IY + +Y+ +I S + W G I ++Q+ D V+ +LT S +
Sbjct: 407 IYAHTDD---ALYVNMFIPSVVTWDKG-ISIHQETAFPDEGVT-------SLTVSGEA-- 453
Query: 554 LTTSLNLRIPTWTSSNGAKATLNGQDLPLPSP-GNFLSVTKTWSSDDKLTIQLPLTLRTE 612
+L +R P W S+ +NG+ + + ++S+ + W DK+ I+LP+ L
Sbjct: 454 -VFNLKIRCPYWVGSSSLNVIVNGKREKIKAGVDGYVSINRQWKDGDKVRIELPMKLEIV 512
Query: 613 AIQDDRPEYASIQAILYGPYVLAGH------SIGDWDITESATSLSDW-ITPIPA 660
+ E A+ YGP VLA S D+ S ++ D+ + +PA
Sbjct: 513 PLN----EATHYLALKYGPIVLAARISDEHLSKDDFRSARSTVAMKDYPVIDVPA 563
>gi|427384823|ref|ZP_18881328.1| hypothetical protein HMPREF9447_02361 [Bacteroides oleiciplenus YIT
12058]
gi|425728084|gb|EKU90943.1| hypothetical protein HMPREF9447_02361 [Bacteroides oleiciplenus YIT
12058]
Length = 813
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 151/556 (27%), Positives = 249/556 (44%), Gaps = 64/556 (11%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L +VRL S + A Q + +YLL D+++++ RK +P + Y G +P+ R
Sbjct: 43 LSEVRLLPGSPFYHAMQVSQQYLLDADIERMLNGRRKEVGIPEK-KAYPGSNQPAGT-RA 100
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSA-----FPTEQFDR 234
HY+S ++LM+A T + ++++ ++ L+ S Y P + +
Sbjct: 101 TDWHHYISGTSLMYAQTGDRRFLDRVNYLIDELAMLDNRKDSLYRVQGKKLELPYAKLMK 160
Query: 235 LEALIP------------VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRV 282
E L+ W P+Y HK A D Y Y DN +AL + E V
Sbjct: 161 GELLLNSPDEAGYPWGGLCWIPFYWQHKEFAAYRDAYLYCDNLKALNLWIKQAE----PV 216
Query: 283 QNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQAD 342
I K + + L+ E GG+N V L+ +T D ++L ++ + + +A D
Sbjct: 217 TEFILKVNPDLFEGFLDIENGGINAVFADLYALTGDERYLAVSMKLNHQKVILNIANGKD 276
Query: 343 DISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ----------LESSGTNIGHFNFKSD 392
+ G H+N +P G+ +Y++TGD++ ++ Q + G N + F
Sbjct: 277 VLYGRHANFQLPAFEGTARQYQLTGDEVCRKATQNFAGIYYRDHMNCIGGNSCYERFGRS 336
Query: 393 PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV 452
+ + L S + E+C TYNM+K++ + F T ++ + DY+ER+L N +L Q GV
Sbjct: 337 GE-ITKRLGSTSSETCNTYNMMKIALNTFESTGDLHHMDYFERALYNHILASQDPETGGV 395
Query: 453 MIYLLPLAPGSSKERSYHHWGTPSDSF-----WCCYGTGIESFSKLGDSIYFEEEGKYPG 507
Y + L PG K SY SD F WCC GTG+E+ SK G+ IYF +
Sbjct: 396 TYYTM-LLPGGFK--SY------SDRFNIEGIWCCVGTGMENHSKYGECIYF---NNHQS 443
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
+Y+ +I S L+WK + + Q+ D P TLT G+ + +R P W
Sbjct: 444 LYVNLFIPSELNWKEKNLHLKQETDFPQGDC-----TTLTILESGA-YNHPIYIRYPHWA 497
Query: 567 SSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
+N ++ PL G ++ + W + D++ I++ T R EA DD +
Sbjct: 498 GRE-VSVRINDEEYPLHAQAGEYIRLQHPWKTGDRIRIEMKQTFRLEAAPDD----PFMN 552
Query: 626 AILYGPYVLAGHSIGD 641
I GP A D
Sbjct: 553 VIFRGPIAYAAQLGAD 568
>gi|444305788|ref|ZP_21141565.1| hypothetical protein G205_09453 [Arthrobacter sp. SJCon]
gi|443481842|gb|ELT44760.1| hypothetical protein G205_09453 [Arthrobacter sp. SJCon]
Length = 444
Score = 180 bits (456), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 138/417 (33%), Positives = 195/417 (46%), Gaps = 42/417 (10%)
Query: 128 DSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLS 187
DS +AQ T++ Y+L LD D+L + A L E YG WE S L GH GHYLS
Sbjct: 18 DSPFRQAQDTSVRYILSLDADRLFAPYLHEAGLVRAAEAYGNWE--SDGLGGHIGGHYLS 75
Query: 188 ASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT-----EQFDRLEA----- 237
A ++A+T N L K+ A V L CQ G GY+ P ++ R E
Sbjct: 76 GCARLYAATGNAELLAKVRAAVVILGNCQAAHGDGYVGGVPRGGDLGQELARGEVDADLF 135
Query: 238 -LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQ 296
L W P Y +HK LAGLLD +A + EAL + + ++ RV + + E +
Sbjct: 136 TLNGRWVPLYNLHKTLAGLLDARVFAGSGEALDIAVGLAGWWL-RVSAHLADDAFE---E 191
Query: 297 TLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIV 356
L+ E GGMN+ L+ +T ++L A F L LA D + G H+NT IP V
Sbjct: 192 VLHAEFGGMNEAFALLWELTGREEYLREARRFSHRALLDPLAAGQDLLDGLHANTQIPKV 251
Query: 357 IGSQMRYEVT--GDQLHKEGHQLES----SGTNIG------HFNFKSDPKRLASNLDSNT 404
+G T D H ES +IG HF+ SD + D
Sbjct: 252 VGYARLAGPTHDADLAHACDIFWESVVSRRSVSIGGNSVREHFHPASDFSPMVQ--DPQG 309
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQR-GTEPGVMIYLLPLAPGS 463
E+C TYNMLK+++ F + A D++ER+ N +L Q GT G ++Y P+ PG
Sbjct: 310 PETCNTYNMLKLAKLRFEAHGDAAAVDFFERATYNHILSSQHPGT--GGLVYFTPMRPG- 366
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW 520
Y + +S WCC G+G+E+ ++ G+ IY + + YI S LDW
Sbjct: 367 ----HYRVYSRAQESMWCCVGSGLENHARYGELIYSRAGND---LLVNLYIPSTLDW 416
>gi|330467692|ref|YP_004405435.1| glycosylase [Verrucosispora maris AB-18-032]
gi|328810663|gb|AEB44835.1| glycosylase [Verrucosispora maris AB-18-032]
Length = 1126
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 140/472 (29%), Positives = 223/472 (47%), Gaps = 83/472 (17%)
Query: 222 GYLSAFPTEQFDRL----------EALIPVWAPYYTIHKILAGLLDQYTYADNAEAL--- 268
GYL A P + RL A WAP+YT HKI+ GLLD Y + DNA AL
Sbjct: 416 GYLGAIPEDAVLRLGPPRWAVYGSNATTNTWAPWYTQHKIMRGLLDAYYHTDNATALDVV 475
Query: 269 -RMTTW------MVEYFYNRVQNVIKKYSIERHWQT-LNEEAGGMNDVLYKLFCITQDPK 320
+M W + + + I + ++ W + E GG N+V +++ +T D K
Sbjct: 476 VKMAGWAHLALTIGDKNHPAYTGPITRDNLNYMWDLYIAGETGGANEVFPEIYALTGDQK 535
Query: 321 HLMLAHLFDKPCFLGLLALQADDI--------------SGFHSNTHIPIVIGSQMRYEVT 366
HL A LFD L ++ DI H+N+H+P +G YE +
Sbjct: 536 HLETAKLFDNRESLFDACVENRDILVVTPQNNPGRRRPDRLHANSHVPQFVGYLRVYEHS 595
Query: 367 GDQLHKEG----------HQLESSGTNIGHF-------NFKSDPKRLASNLDSNTEESCT 409
GD + + H++ ++G G++ + +A+++ E+CT
Sbjct: 596 GDTEYFQAAKNFYGMVVPHRMYANGGTGGNYPGSNNNIELFQNRGNIANSIAQGGAETCT 655
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGT----EPGVMIYLLPLAPGSSK 465
TYN+LK++R+LF + AY DYYER L N + G + T P V Y PL PG++
Sbjct: 656 TYNLLKLARNLFFHEHDAAYLDYYERGLINQIAGSRADTTTVSNPQVT-YFQPLTPGAN- 713
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE-EGKYPGVYIIQYISSRLDWKSGQ 524
R Y + GT CC GTG+E+ +K ++IYF+ +G +++ Y++S L W
Sbjct: 714 -RGYGNTGT------CCGGTGVENHTKYQETIYFKSADGDT--LWVNLYVASTLTWAERD 764
Query: 525 IVVNQKVDPVVSWDPYLRVTLT-FSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
+ Q+ D Y R T + GSG + LR+P W G T+NG +
Sbjct: 765 FTITQQTD-------YPRADRTRLTVDGSG-PLDIKLRVPGWVRK-GFFVTINGLAQQVT 815
Query: 584 SPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
+ N +L++++TW D + I++P ++R E DRP+ Q++ +GP +L
Sbjct: 816 ATANSYLTLSRTWQRGDVIEIRMPFSIRIERAL-DRPD---TQSVFWGPVLL 863
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 51/116 (43%), Gaps = 4/116 (3%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPG--EPYGGWEE 172
++ L DV LG D + + YL LD + + F A P P GGWE+
Sbjct: 62 VRPFRLRDVTLG-DGLFQEKRDRMKNYLRQLDERRFLVLFNNQAGRPNPAGVTAPGGWED 120
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP 228
L GH+ GH ++A A +A K K+ +V L+ACQ I + S P
Sbjct: 121 GGL-LSGHWAGHVMTALAQGYADHGEPIFKSKLDWIVDELAACQTAITARMGSGGP 175
>gi|257068350|ref|YP_003154605.1| hypothetical protein Bfae_11690 [Brachybacterium faecium DSM 4810]
gi|256559168|gb|ACU85015.1| uncharacterized conserved protein [Brachybacterium faecium DSM
4810]
Length = 752
Score = 176 bits (446), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 145/532 (27%), Positives = 232/532 (43%), Gaps = 40/532 (7%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRG 179
L VRL + + AQ+T+LEYLL L+ ++L+ FR+ A + PYG WE S L G
Sbjct: 12 LESVRL-REGLFAAAQRTDLEYLLGLEAERLLAPFRREAGIATTAAPYGNWE--SMGLDG 68
Query: 180 HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRLEA 237
H GH L+A++LMWA+T +E E +V L CQ +G+GY+ P E + ++
Sbjct: 69 HIGGHALAAASLMWAATGDERAAELARQLVEGLRECQARLGTGYVGGIPGGAELWAQIRT 128
Query: 238 LIP---------VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKK 288
+ W P+Y +HK AGL++ +A A ++ + + ++
Sbjct: 129 IASQAQTWDLGGAWVPWYNLHKTFAGLIEAVRHAPAGTA-SCALEVLRGLGDWGARLGEQ 187
Query: 289 YSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFH 348
E + L E GGM L IT + +H +A F L L D++ G H
Sbjct: 188 LDDEAFARMLRTEFGGMCAAYADLAEITGEERHARMARRFADESLLAPLRAGRDELDGMH 247
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHK----EGHQLESSGTNIGHFNFKSDPKRLASNLDSNT 404
+NT I VIG E + E L G ++ +F ++P LA D
Sbjct: 248 ANTQIAKVIGWPALGETAAAETFVRTVLERRTLAFGGNSVAE-HFTAEP--LAHVTDREG 304
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSS 464
ESC T NML+ + L+ D ER L VL Q G +Y P PG
Sbjct: 305 PESCNTVNMLEAEQRLYEHGGGPWLFDAIERQLVGHVLSAQH--PEGGFVYFTPARPG-- 360
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
Y + T + WCC GTG+E +++ G + + G + + + + L W+ Q
Sbjct: 361 ---HYRVYSTRENGMWCCVGTGLEVYARTGRFTFAAQGGD---LLVNLPLPASLRWEE-Q 413
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS 584
+ P P VTL + ++++R+P W ++ +++GQD+ +
Sbjct: 414 GIAAHLDSPYPRPAPETPVTLRIEADAPS-DVAVHVRVPAWATTP-PTVSVDGQDVTAHA 471
Query: 585 P-GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+++V + W + L TL + P S ++ +GP VLA
Sbjct: 472 ELDGYVTVRRRWQGGEVLR----WTLHAGPSWEPLPGEDSWGSLRWGPVVLA 519
>gi|402081502|gb|EJT76647.1| acetyl-CoA carboxylase [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1032
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 132/471 (28%), Positives = 212/471 (45%), Gaps = 75/471 (15%)
Query: 222 GYLSAFPTEQFDRL----------EALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMT 271
GYL A P + RL +A WAP+YT HKI+ GLLD Y +N +AL +
Sbjct: 404 GYLGALPEDTVLRLGPPRWAIYGGDAATNTWAPWYTQHKIMRGLLDAYYNTNNTQALDVV 463
Query: 272 TWMVEYFYNRVQNVIKKY----------SIERHWQT-LNEEAGGMNDVLYKLFCITQDPK 320
M ++ + + K Y + R W + E+GG N+V +L+ +T D +
Sbjct: 464 VKMADWAHLALTIGDKNYPGYTGNLTRDDLNRMWDLYIAGESGGANEVFPELYELTGDSR 523
Query: 321 HLMLAHLFDKPCFLGLLALQADDI--------------SGFHSNTHIPIVIGSQMRYEVT 366
HL A FD L A++ DI H+N H+P IG +E +
Sbjct: 524 HLETAKAFDNRASLFDAAVEDRDILVLTRDKNPGPRRTDRLHANMHVPQFIGYLRIFEQS 583
Query: 367 GDQLHKEG----------HQLESSGTNIGHFNFKSDPKRL-------ASNLDSNTEESCT 409
+Q + + H+ +SG G++ ++ + A+ + N E+CT
Sbjct: 584 REQDYLDAARNFYSWVFPHRQFASGGTGGNYPGSNNNAEMFQNRGNIANAIAENGAETCT 643
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV---MIYLLPLAPGSSKE 466
TYNMLK++R+LF Y D YER L N + G + T + Y PL PG+S
Sbjct: 644 TYNMLKLARNLFMHEHNATYMDGYERGLFNMIAGSRADTATTADPQLTYFQPLTPGAS-- 701
Query: 467 RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIV 526
R Y + GT CC G+G+ES +K +++Y +++ ++ S L W
Sbjct: 702 RDYGNTGT------CCGGSGLESHTKYQETVYLRSA-DGSALWVNLFVPSTLTWGEKAFS 754
Query: 527 VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL---P 583
+ Q ++ LT ++ G G + LR+P W T+NG+ P P
Sbjct: 755 LRQD----TAFPRADSTKLTVTAAGGGGPLDIKLRVPAWAQRGTVTVTVNGEADPAAQTP 810
Query: 584 SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
PG +L++ + W + D + +++P +R E DRP+ QA++ GP +L
Sbjct: 811 LPGTYLTLARAWRAGDTIEMRMPFRVRVERAP-DRPD---TQALMRGPVLL 857
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 4/107 (3%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPY--GGWEE 172
++ L VRLG + + +T ++L D + + F K A P+ G GGWE+
Sbjct: 45 VRPFRLDQVRLGDGLLQEKRDRTK-DFLREFDERRFLVLFNKQAGRPSAGGVAVPGGWED 103
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEI 219
L GH+ GHY++A + +A E K K+ +V L+ACQK I
Sbjct: 104 GGL-LSGHWAGHYMTALSQAYADQGEEVFKAKLDWMVQELAACQKAI 149
>gi|423223251|ref|ZP_17209720.1| hypothetical protein HMPREF1062_01906 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392639352|gb|EIY33177.1| hypothetical protein HMPREF1062_01906 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 643
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 153/547 (27%), Positives = 247/547 (45%), Gaps = 54/547 (9%)
Query: 119 SLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC--- 175
SL DVRL +S QQ EYLL L+ D L+ +R A L Y GWE
Sbjct: 41 SLEDVRL-LESPFLDLQQKGKEYLLWLNPDSLLHFYRIEAGLQPKARAYAGWESQDVWGA 99
Query: 176 -ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYL-------SAF 227
LRG F+G YLS+ ++M+ +T ++ L +++ V++ L CQK G+L F
Sbjct: 100 GPLRGGFLGFYLSSVSMMYQATGDKELLKRLQYVLNELELCQKAGKDGFLLGIKDGRKLF 159
Query: 228 PTEQFDRLEALIP----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ 283
+++ P WAP Y I+K+L GL Y +AL M + ++F +V
Sbjct: 160 SEVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYAQCGQEKALPMMIRLADWFGYQVL 219
Query: 284 NVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADD 343
+ + ++R L E G +N+ +++ +T + + L A + L+ D
Sbjct: 220 DKLTDEQVQR---LLVCEHGSINESFVEIYKLTGEIRFLEWAGRLNDRAMWVPLSEGKDI 276
Query: 344 ISGFHSNTHIPIVIGSQMRYEVTGDQ-----------LHKEGHQLESSGTNIG-HFNFKS 391
+ G+H+NT IP G + YE TGD+ + + H G + G HF K
Sbjct: 277 LFGWHANTQIPKFTGFEKYYEATGDKRLLNAAMNFWDIVNQNHTWVIGGNSTGEHFFPKK 336
Query: 392 DPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+ + L E+C + NML+++ LF + + A YYER L N +L + G
Sbjct: 337 EFEERV--LLKGGPETCNSVNMLRLTETLFSYQPDAKKAAYYERVLFNHILSAYDPVK-G 393
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
+ Y + PG Y + + SFWCC TG+ES +KLG IY ++G G+ +
Sbjct: 394 MCCYFTSMRPG-----HYRIYASRDSSFWCCGHTGLESPAKLGKFIYSRDKG---GIRVN 445
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+I S L K + + Q S R+ L T +L +R P W +
Sbjct: 446 LFIPSVLTSKELGMELAQYSHMPESDKVEFRLNLQDER-----TLTLRIRRPDWAKN--P 498
Query: 572 KATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NG++ + + + + + W +++ ++LP+ TE + A+LYG
Sbjct: 499 ILVINGKEEAIDTDTSGYWVLDRKWKKKNRIILKLPMEPYTENLVGS----DKYVALLYG 554
Query: 631 PYVLAGH 637
PYVLAG
Sbjct: 555 PYVLAGR 561
>gi|336404182|ref|ZP_08584880.1| hypothetical protein HMPREF0127_02193 [Bacteroides sp. 1_1_30]
gi|335943510|gb|EGN05349.1| hypothetical protein HMPREF0127_02193 [Bacteroides sp. 1_1_30]
Length = 650
Score = 175 bits (444), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 163/551 (29%), Positives = 250/551 (45%), Gaps = 58/551 (10%)
Query: 120 LHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSC---- 175
L++VRL DS QQ EYLL L+ D L+ +R A LP + Y GWE +
Sbjct: 39 LNEVRL-LDSPFLTLQQKGKEYLLWLNPDSLLHFYRVEAGLPPKADAYAGWESQNVWGAG 97
Query: 176 ELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSA-------FP 228
LRG F+G YLS+ ++M ST ++ L +++ V+ L CQ G+L F
Sbjct: 98 PLRGGFLGFYLSSVSMMHQSTGDKELLKRLKYVLKELKLCQDAGKDGFLLGIKDGRMLFK 157
Query: 229 TEQFDRLEALIP----VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQN 284
+++ P WAP Y I+K+L GL YT EAL M + ++F
Sbjct: 158 EVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCGLEEALPMMIRLADWF---GYQ 214
Query: 285 VIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
V+ K S E+ + L E G +N+ + + +T + L A L+ D +
Sbjct: 215 VLDKLSDEQIQKLLVCEHGSINESYVEAYELTGQKRFLDWARRLHDRAMWVPLSEGKDIL 274
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQ-----------LHKEGHQLESSGTNIGHFNFKSDP 393
G+H+NT IP G Y TGD+ + H G + G F P
Sbjct: 275 YGWHANTQIPKFTGFHKYYMFTGDKRFLTAATNFWNIVNRNHTWVIGGNSTGEHFF---P 331
Query: 394 KRLASN--LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
K ++ L E+C + NML+++ LF + A YYER L N +L + G
Sbjct: 332 KEEFADRLLLKGGPETCNSVNMLRLTESLFSQYPDAVKASYYERVLFNHILSAY-DPKKG 390
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE---EGKYPGV 508
+ Y + PG Y + + SFWCC TG+ES +KLG IY + + +
Sbjct: 391 MCCYFTSMRPG-----HYRIYASRDSSFWCCGHTGLESPAKLGKFIYSHKATNRKEEKEI 445
Query: 509 YIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS 568
+ +I S L W G + + Q+ + + D RV LT + K L +R P W +
Sbjct: 446 RVNLFIPSVLTWHEGGVELVQR-NRLPDSD---RVELTMNLKKKQRLI-LWIRKPDW--A 498
Query: 569 NGAKATLNG--QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
+ A +NG + L L + G ++ + K W+ +++++QLP+ TE + A
Sbjct: 499 DKATLIINGKAEQLLLGNDGYWM-IDKVWNRKNRISLQLPMHTYTENLIGT----GRYVA 553
Query: 627 ILYGPYVLAGH 637
+LYGPYVLAG
Sbjct: 554 LLYGPYVLAGR 564
>gi|227509161|ref|ZP_03939210.1| possible acetyl-CoA carboxylase, biotin carboxylase [Lactobacillus
brevis subsp. gravesensis ATCC 27305]
gi|227191368|gb|EEI71435.1| possible acetyl-CoA carboxylase, biotin carboxylase [Lactobacillus
brevis subsp. gravesensis ATCC 27305]
Length = 606
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 129/372 (34%), Positives = 181/372 (48%), Gaps = 54/372 (14%)
Query: 298 LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVI 357
L E GGMND LY LF IT+D +HL A FD+ LA D + G H+NT IP ++
Sbjct: 2 LKVEYGGMNDALYHLFSITKDERHLTAATYFDEVELFKDLAAAKDVLPGKHANTTIPKLL 61
Query: 358 GSQMRYEV------TGDQLHKE--------------------GHQLESSGTNIGHFNFKS 391
G+ RYE+ G L+++ H ++G N +F
Sbjct: 62 GAIRRYEIFDDPQMAGQYLYEKDQKQLPIYLKAAENFWRIVINHHTYATGGNSQSEHF-H 120
Query: 392 DPKRLASNL----DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
DP +L + + T E+C T+NMLK+SR LFR T + Y DYY+R+ +N +LG Q
Sbjct: 121 DPNQLYHDAVIEDGATTCETCNTHNMLKLSRELFRVTGDKKYLDYYDRTYSNAILGSQ-N 179
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
+ G+M Y P+A G K + P D FWCC GTGIESF+KLGDS YF+E
Sbjct: 180 PKTGMMTYFQPMAAGYRKV-----FNRPYDEFWCCTGTGIESFTKLGDSYYFKEG---QT 231
Query: 508 VYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT---SLNLRIPT 564
+Y Y S++L + ++ +VD V V LT S T+ ++ R P
Sbjct: 232 LYATGYFSNQLSLPKENLKLDMQVDRKVG-----AVKLTVSKLIDNKTSEPLNVKFRHPD 286
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W S N + P F+ V K D + I L +TL + D++ +Y S+
Sbjct: 287 W-SHGRLSVKKNQKTQPNNETFGFVEVKKLVPG-DVIEINLSMTLTVGSTPDNQ-QYISL 343
Query: 625 QAILYGPYVLAG 636
+ YGPYVLAG
Sbjct: 344 K---YGPYVLAG 352
>gi|357472937|ref|XP_003606753.1| hypothetical protein MTR_4g065240 [Medicago truncatula]
gi|355507808|gb|AES88950.1| hypothetical protein MTR_4g065240 [Medicago truncatula]
Length = 184
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 87/183 (47%), Positives = 123/183 (67%), Gaps = 8/183 (4%)
Query: 11 FKFLLTFLLIVSAAQAKECTNAYPELASHTFRSNLLSSKNESYIKQIHSHNDHL--TPSD 68
F ++ L++ A +KEC N P+ SHT R+ L++SKNE++ K++ + H+ TPSD
Sbjct: 4 FVYVFLALILCGCANSKECINNLPQ--SHTLRTELMASKNETWKKEVMMYQSHVHVTPSD 61
Query: 69 DSAWLSLMPRKILREEEQDELFSWAMLYRKIKNPGQFKVPERSGEFLKEVSLHDVRLGSD 128
+SAW ++P+++ +E+ + + R++KN K P FLKEV L DVRL
Sbjct: 62 ESAWQEMIPKEMFLTQEKPNVIG-LLSNREMKNADVSKPPVG---FLKEVPLGDVRLLEG 117
Query: 129 SMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSA 188
S+H +AQ+TNLEYLLMLDVD+L+W+FRK A LP PG PYGGWE+P ELRGHFVG +SA
Sbjct: 118 SIHAQAQKTNLEYLLMLDVDRLIWSFRKMAGLPTPGAPYGGWEKPDQELRGHFVGCNVSA 177
Query: 189 SAL 191
+ L
Sbjct: 178 TLL 180
>gi|336397986|ref|ZP_08578786.1| protein of unknown function DUF1680 [Prevotella multisaccharivorax
DSM 17128]
gi|336067722|gb|EGN56356.1| protein of unknown function DUF1680 [Prevotella multisaccharivorax
DSM 17128]
Length = 943
Score = 169 bits (427), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 167/646 (25%), Positives = 269/646 (41%), Gaps = 131/646 (20%)
Query: 109 ERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYG 168
E E + L DV + D+ + + + DV + ++N+R T + G
Sbjct: 118 EEKKEIAQTFPLSDVTINGDNRLTHNRDEAIAAICSWDVTQQLYNYRDTYNMSTEGYKVA 177
Query: 169 -GWEEPSCELRGHFVGHYLSASALMWASTHNES----LKEKMSAVVSALSACQKEI---- 219
GW+ P +L+GH GHY+SA A +A T + LK+ ++ +V+ L ACQ++
Sbjct: 178 DGWDSPDTKLKGHGSGHYMSAIAQAYAVTKDPQQKAILKKNITRMVNELRACQEKTFVWN 237
Query: 220 --------------------------------------GSGYLSAFPTEQFDRLEALIP- 240
G GY++A P++ +E P
Sbjct: 238 DSLGRYWEARDFAPESELKNMKGTWAAFDEYKKHPEKYGYGYINAIPSQHCALIEMYRPY 297
Query: 241 -----VWAPYYTIHKILAGLLDQYTYADNAE--------ALRMTTWMVEYFYNRVQNVIK 287
VWAPYYTIHK LAGL+D T D+ E A M W+ + R
Sbjct: 298 NNSDWVWAPYYTIHKELAGLIDIATLFDDKEVAAKALLIAKDMGLWVWNRMHYRTYVKAD 357
Query: 288 KYSIERHWQTLNE----------EAGGMNDVLYKLFCI----TQDPKHLMLAHLFDKPCF 333
ER + N E GGM + L +L + T + L A FD P F
Sbjct: 358 GTQEERRAKPGNRYEMWDMYIAGEVGGMQESLSRLSEMVSNSTDKARLLEAAQCFDAPKF 417
Query: 334 LGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLH-----------KEGHQLESSGT 382
LA DDI H+N HIP+++G+ Y+ D +H +G + ++G
Sbjct: 418 YEPLAKNIDDIRTRHANQHIPMIVGALRSYKSNHD-IHYYNVADNFWHLVQGRYMYATG- 475
Query: 383 NIGHFNFKSDPK----RLASN--------LDSNTEESCTTYNMLKVSRHLFRWTKEIA-Y 429
+G+ P +A+N + N E+C TYN+LK+++ L + + A
Sbjct: 476 GVGNGEMFRQPYTQVLSMATNGMQEGEAMANPNLNETCCTYNLLKLTKDLNVYNPDDAEL 535
Query: 430 ADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIES 489
DYYER L N ++G +P A G + + + G + CC GTG E+
Sbjct: 536 MDYYERGLYNQIVG---SLDPDHYAVTYQYAVGLNATKPF---GNETPQSTCCGGTGSEN 589
Query: 490 FSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSS 549
+K + YF + +++ Y+ + L W+ I + Q +W P R + +
Sbjct: 590 HTKYQQAAYFHNDST---LWVCLYMPTTLQWRDKGITLEQD----CTW-PAQRSVIRL-T 640
Query: 550 KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVT-KTWSSDDKLTIQLPL 607
KG G T L LR+P W ++ G + LNG+ + P ++++++ W+ D+L I +P
Sbjct: 641 KGEGNFT-LKLRVPYW-ATRGFEILLNGKPVQHHYQPSSYVTISGHHWTVSDRLEIIMPF 698
Query: 608 TLRTEAIQDDRP-EYASIQAI----------LYGPYVLAGHSIGDW 642
+ E D P + AS I +YGP + G + W
Sbjct: 699 STHIEYGADKLPAKVASADGIPLKSAWTGVVMYGPLCMTGTNATTW 744
>gi|261879318|ref|ZP_06005745.1| acetyl-CoA carboxylase [Prevotella bergensis DSM 17361]
gi|270334148|gb|EFA44934.1| acetyl-CoA carboxylase [Prevotella bergensis DSM 17361]
Length = 839
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 150/568 (26%), Positives = 246/568 (43%), Gaps = 79/568 (13%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARL--------PAPGEP 166
L EV+L D L + A N++ L+ DVD+L+ F + A L +
Sbjct: 34 LDEVTLLDSPLKT------AMDLNIKMLMQYDVDRLLTPFIRQAGLHTGRYADWQSRHPN 87
Query: 167 YGGWEEPSCELRGHFVGHYLSASALMWASTHNES----LKEKMSAVVSALSACQKEIGS- 221
+ W + +L GH GHY+SA A+ +A+ H+ + +KE++ ++ L CQ +
Sbjct: 88 FMNWGGNNFDLSGHVGGHYVSALAMAYAACHDTATKARIKERLDYMIDVLKDCQDAYDTN 147
Query: 222 -----GYLSAFPTEQFDRLEALIPV--------WAPYYTIHKILAGLLDQYTYADNAEAL 268
G++ P + + W P+Y HK+LAGL D Y Y N A
Sbjct: 148 TEGLYGFIGGQPINDMWKKMYAGDISSFRQHRGWVPFYCQHKVLAGLRDAYLYTGNTTAR 207
Query: 269 RMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLF 328
+ + ++ N V N+ S L+ E GGMN+ L + + D K+L A +
Sbjct: 208 DLFRKLADWSVNLVSNL----SDATMQTVLDTEHGGMNETLADAYTLFGDSKYLAAARKY 263
Query: 329 DKPCFL-GLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLH------------KEGH 375
L G+ + H+NT +P IG + E +
Sbjct: 264 SHQTMLNGMQTPNPTFLDNRHANTQVPKYIGFERVAEEDPTATTYATAASNFWDDVAQNR 323
Query: 376 QLESSGTNIG-HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYE 434
+ G ++G HF + R +LD ESC T NM+K+S + T + YAD+YE
Sbjct: 324 TVCIGGNSVGEHFLSVGNSNRYIDHLDG--PESCNTNNMMKLSEMMADRTHDARYADFYE 381
Query: 435 RSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLG 494
++ N +L Q T G +Y L P + Y + ++ WCC GTG+E+ SK G
Sbjct: 382 YAMYNHILSTQDPTTGGY-VYFTTLRP-----QGYRIYSKVNEGMWCCVGTGMENHSKYG 435
Query: 495 DSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPY-LRVTLTFSSKGSG 553
+Y + VYI + +S+LD K ++ Q+ PY R +T G
Sbjct: 436 HFVYTHDADT--AVYINLFTASKLDNK--HFMLTQETAY-----PYEQRTKITVGKSG-- 484
Query: 554 LTTSLNLRIPTWTSSNGAKATLNGQDLP---LPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
T ++ +R P WT+++ ++NG P L ++ + + W + D +T+ LP++LR
Sbjct: 485 -TYTIAVRHPWWTTAD-YSISVNGTKQPLDVLQGQASYCRLKRAWKAGDVITVDLPMSLR 542
Query: 611 TEAIQDDRPEYASIQAILYGPYVLAGHS 638
P Y+ A YGP +L +
Sbjct: 543 VAEC----PNYSDYIAFEYGPVLLGAQT 566
>gi|340347550|ref|ZP_08670658.1| hypothetical protein HMPREF9136_1656 [Prevotella dentalis DSM 3688]
gi|339609246|gb|EGQ14121.1| hypothetical protein HMPREF9136_1656 [Prevotella dentalis DSM 3688]
Length = 1007
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 166/652 (25%), Positives = 276/652 (42%), Gaps = 137/652 (21%)
Query: 102 PGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLP 161
PGQ E SL DV L D+ + L + DV + ++N+R T L
Sbjct: 162 PGQ--------EMAHAFSLADVTLDGDNRLTHNRDEALREICSWDVSQQLYNYRDTYGLS 213
Query: 162 APGEPYG-GWEEPSCELRGHFVGHYLSASALMWASTHNES----LKEKMSAVVSALSACQ 216
G GW+ P +L+GH GHY+SA A +A T + L++ ++ +V+ L ACQ
Sbjct: 214 TDGYTRSDGWDSPDTKLKGHGSGHYMSAIAQAYAVTKDPRQKAILRKNITRMVNELRACQ 273
Query: 217 KEI------------------------------------------GSGYLSAFPTEQFDR 234
++ G GY++A P +
Sbjct: 274 EKTFVFDKALNRYWEARDFAPEEELRGLKGTWEAFDEYKKHPEKYGYGYINAIPAQHCAL 333
Query: 235 LEALIP------VWAPYYTIHKILAGLLDQYTYADNA----EALRMTTWMVEYFYNRV-- 282
+E VWAPYY++HK LAGL+D TY D+ +AL M + +NR+
Sbjct: 334 IEMYRAYNNSDWVWAPYYSVHKQLAGLIDIATYFDDKAICDKALLTAKDMGLWVWNRMHY 393
Query: 283 QNVIKKYSIERHWQT------------LNEEAGGMNDVLYKLFCITQDP----KHLMLAH 326
+ +K+ E ++ + E GGM++ L +L + DP K + A
Sbjct: 394 RTYVKEDGTEAERRSKPGNRYEMWDMYIAGEVGGMSESLARLSEMVSDPGEKAKLIEAAG 453
Query: 327 LFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK----------EGHQ 376
FD P F L+ DDI H+N HIP+++G+ Y+ + + +G
Sbjct: 454 CFDAPKFYNPLSKNVDDIRTRHANQHIPMIVGALRSYKTNKNPFYYHLSQNFWHLVQGRY 513
Query: 377 LESSGTNIGHFNFKSDPK----RLASN--------LDSNTEESCTTYNMLKVSRHLFRWT 424
+ ++G +G+ P +A+N + + E+C TYN+LK++ L +
Sbjct: 514 MYATG-GVGNGEMFRQPYTQILSMATNGMQEGERQANPDINETCCTYNLLKLTSDLNCYN 572
Query: 425 KEIA-YADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCY 483
+ A Y DYYER L N ++G P A G + + + G + CC
Sbjct: 573 PDDARYMDYYERGLYNQIVG---SLNPDKYETCYQYAVGLNATKPF---GNETPQSTCCG 626
Query: 484 GTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRV 543
GTG E+ +K + YF +++ Y+ + L WK+ + + Q+ +W P
Sbjct: 627 GTGSENHTKYQAAAYFANTHT---LWVGLYMPTTLHWKAKGLTIRQE----CAW-PAQHT 678
Query: 544 TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKT-WSSDDKL 601
+ ++G G T L LR+P W ++ G + +NG+ + L P +++++ KT W + D +
Sbjct: 679 AIQI-AEGKGEFT-LKLRVPYW-ATGGFEVKVNGKKVKQLFRPSSYVALEKTRWKAGDVV 735
Query: 602 TIQLPLTLRTE----------AIQDDRP-EYASIQAILYGPYVLAGHSIGDW 642
I +P T E A D P A + ++YGP + G W
Sbjct: 736 EIDMPFTKHIEYGADKLTSEVASMDGTPLRTAWVGTLMYGPLAMTGTGSAIW 787
>gi|433653573|ref|YP_007297427.1| hypothetical protein Prede_2696 [Prevotella dentalis DSM 3688]
gi|433304106|gb|AGB29921.1| hypothetical protein Prede_2696 [Prevotella dentalis DSM 3688]
Length = 986
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 166/652 (25%), Positives = 276/652 (42%), Gaps = 137/652 (21%)
Query: 102 PGQFKVPERSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLP 161
PGQ E SL DV L D+ + L + DV + ++N+R T L
Sbjct: 141 PGQ--------EMAHAFSLADVTLDGDNRLTHNRDEALREICSWDVSQQLYNYRDTYGLS 192
Query: 162 APGEPYG-GWEEPSCELRGHFVGHYLSASALMWASTHNES----LKEKMSAVVSALSACQ 216
G GW+ P +L+GH GHY+SA A +A T + L++ ++ +V+ L ACQ
Sbjct: 193 TDGYTRSDGWDSPDTKLKGHGSGHYMSAIAQAYAVTKDPRQKAILRKNITRMVNELRACQ 252
Query: 217 KEI------------------------------------------GSGYLSAFPTEQFDR 234
++ G GY++A P +
Sbjct: 253 EKTFVFDKALNRYWEARDFAPEEELRGLKGTWEAFDEYKKHPEKYGYGYINAIPAQHCAL 312
Query: 235 LEALIP------VWAPYYTIHKILAGLLDQYTYADNA----EALRMTTWMVEYFYNRV-- 282
+E VWAPYY++HK LAGL+D TY D+ +AL M + +NR+
Sbjct: 313 IEMYRAYNNSDWVWAPYYSVHKQLAGLIDIATYFDDKAICDKALLTAKDMGLWVWNRMHY 372
Query: 283 QNVIKKYSIERHWQT------------LNEEAGGMNDVLYKLFCITQDP----KHLMLAH 326
+ +K+ E ++ + E GGM++ L +L + DP K + A
Sbjct: 373 RTYVKEDGTEAERRSKPGNRYEMWDMYIAGEVGGMSESLARLSEMVSDPGEKAKLIEAAG 432
Query: 327 LFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHK----------EGHQ 376
FD P F L+ DDI H+N HIP+++G+ Y+ + + +G
Sbjct: 433 CFDAPKFYNPLSKNVDDIRTRHANQHIPMIVGALRSYKTNKNPFYYHLSQNFWHLVQGRY 492
Query: 377 LESSGTNIGHFNFKSDPK----RLASN--------LDSNTEESCTTYNMLKVSRHLFRWT 424
+ ++G +G+ P +A+N + + E+C TYN+LK++ L +
Sbjct: 493 MYATG-GVGNGEMFRQPYTQILSMATNGMQEGERQANPDINETCCTYNLLKLTSDLNCYN 551
Query: 425 KEIA-YADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCY 483
+ A Y DYYER L N ++G P A G + + + G + CC
Sbjct: 552 PDDARYMDYYERGLYNQIVG---SLNPDKYETCYQYAVGLNATKPF---GNETPQSTCCG 605
Query: 484 GTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRV 543
GTG E+ +K + YF +++ Y+ + L WK+ + + Q+ +W P
Sbjct: 606 GTGSENHTKYQAAAYFANTHT---LWVGLYMPTTLHWKAKGLTIRQE----CAW-PAQHT 657
Query: 544 TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKT-WSSDDKL 601
+ ++G G T L LR+P W ++ G + +NG+ + L P +++++ KT W + D +
Sbjct: 658 AIQI-AEGKGEFT-LKLRVPYW-ATGGFEVKVNGKKVKQLFRPSSYVALEKTRWKAGDVV 714
Query: 602 TIQLPLTLRTE----------AIQDDRP-EYASIQAILYGPYVLAGHSIGDW 642
I +P T E A D P A + ++YGP + G W
Sbjct: 715 EIDMPFTKHIEYGADKLTSEVASMDGTPLRTAWVGTLMYGPLAMTGTGSAIW 766
>gi|433651701|ref|YP_007278080.1| hypothetical protein Prede_0695 [Prevotella dentalis DSM 3688]
gi|433302234|gb|AGB28050.1| hypothetical protein Prede_0695 [Prevotella dentalis DSM 3688]
Length = 1032
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 164/584 (28%), Positives = 261/584 (44%), Gaps = 86/584 (14%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWE--E 172
L EV+L D + A + N + LL D D+L+ F + A L Y GW+
Sbjct: 27 LSEVTLFDSPFKT------AMELNFKVLLDYDADRLLAPFVRQAGLNTG--DYAGWQTLH 78
Query: 173 PSC--------ELRGHFVGHYLSASALMWASTHNES----LKEKMSAVVSALSACQK--- 217
P+ +L GH GHYLSA AL +A+ + LK+++ ++ L CQ
Sbjct: 79 PNFANWGGNGFDLSGHVGGHYLSALALAYAACRDAGMKARLKQRLEYMLKVLKDCQDAYD 138
Query: 218 ---EIGSGYLSAFP-TEQFDRLEA-------LIPVWAPYYTIHKILAGLLDQYTYADNAE 266
E G++ P E + +L A + W P+Y HK+LAGL D Y YA N E
Sbjct: 139 GNTEGLRGFIGGQPINEAWKKLYAGDVSGFRSVRGWVPFYCQHKVLAGLRDAYVYAGNKE 198
Query: 267 ALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAH 326
A M + ++ NV+ + L+ E GGMN+ L + + D K++ A
Sbjct: 199 AREMFRKLADWSV----NVVARLDNAAMQSVLDTEHGGMNESLADAYTLFGDQKYMDAAQ 254
Query: 327 LFDKPCFLGLLALQ-ADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLES------ 379
+ L + +Q A + H+NT +P IG + E G +L K+ ++L +
Sbjct: 255 KYSHQTMLNGMQMQNATFLDNRHANTQVPKYIGFERIGEQGGSELQKK-YELAAGNFWND 313
Query: 380 ---------SGTNIG-HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAY 429
G ++ HF ++ R +LD ESC + NMLK+S L T + Y
Sbjct: 314 VALNRTVCIGGNSVAEHFLSAANSHRYIDHLDG--PESCNSNNMLKLSEMLSDNTHDARY 371
Query: 430 ADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIES 489
AD+YE + N +L Q + G +Y L P + Y + + WCC GTG+E+
Sbjct: 372 ADFYEYTTWNHILSTQD-PKTGGYVYFTTLRP-----QGYRIYSQVNQGMWCCVGTGMEN 425
Query: 490 FSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSS 549
SK G +Y + +Y+ + +S+L + + + Q+ ++P R+T+
Sbjct: 426 HSKYGHFVYTHDGDSV--IYVNLFTASKL--ANAKFALTQQT--AYPYEPQTRITI---D 476
Query: 550 KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL---PSPGNFLSVTKTWSSDDKLTIQLP 606
KG T L +R P WT+ G +NG+ + P + +T+ W D +T+ LP
Sbjct: 477 KGGSYT--LAVRHPWWTTE-GYAILVNGEKQQVAVTPGKAGYARLTRKWKRGDVVTVALP 533
Query: 607 LTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATS 650
+ LRT P Y A YGP +LA + D T++ T+
Sbjct: 534 MQLRTVEC----PNYTDYVAFEYGPLLLAAQTTA-VDATDADTT 572
>gi|340345934|ref|ZP_08669064.1| acetyl-CoA carboxylase [Prevotella dentalis DSM 3688]
gi|339612921|gb|EGQ17717.1| acetyl-CoA carboxylase [Prevotella dentalis DSM 3688]
Length = 1039
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 164/584 (28%), Positives = 261/584 (44%), Gaps = 86/584 (14%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWE--E 172
L EV+L D + A + N + LL D D+L+ F + A L Y GW+
Sbjct: 34 LSEVTLFDSPFKT------AMELNFKVLLDYDADRLLAPFVRQAGLNTG--DYAGWQTLH 85
Query: 173 PSC--------ELRGHFVGHYLSASALMWASTHNES----LKEKMSAVVSALSACQK--- 217
P+ +L GH GHYLSA AL +A+ + LK+++ ++ L CQ
Sbjct: 86 PNFANWGGNGFDLSGHVGGHYLSALALAYAACRDAGMKARLKQRLEYMLKVLKDCQDAYD 145
Query: 218 ---EIGSGYLSAFP-TEQFDRLEA-------LIPVWAPYYTIHKILAGLLDQYTYADNAE 266
E G++ P E + +L A + W P+Y HK+LAGL D Y YA N E
Sbjct: 146 GNTEGLRGFIGGQPINEAWKKLYAGDVSGFRSVRGWVPFYCQHKVLAGLRDAYVYAGNKE 205
Query: 267 ALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAH 326
A M + ++ NV+ + L+ E GGMN+ L + + D K++ A
Sbjct: 206 AREMFRKLADWSV----NVVARLDNAAMQSVLDTEHGGMNESLADAYTLFGDQKYMDAAQ 261
Query: 327 LFDKPCFLGLLALQ-ADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLES------ 379
+ L + +Q A + H+NT +P IG + E G +L K+ ++L +
Sbjct: 262 KYSHQTMLNGMQMQNATFLDNRHANTQVPKYIGFERIGEQGGSELQKK-YELAAGNFWND 320
Query: 380 ---------SGTNIG-HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAY 429
G ++ HF ++ R +LD ESC + NMLK+S L T + Y
Sbjct: 321 VALNRTVCIGGNSVAEHFLSAANSHRYIDHLDG--PESCNSNNMLKLSEMLSDNTHDARY 378
Query: 430 ADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIES 489
AD+YE + N +L Q + G +Y L P + Y + + WCC GTG+E+
Sbjct: 379 ADFYEYTTWNHILSTQD-PKTGGYVYFTTLRP-----QGYRIYSQVNQGMWCCVGTGMEN 432
Query: 490 FSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSS 549
SK G +Y + +Y+ + +S+L + + + Q+ ++P R+T+
Sbjct: 433 HSKYGHFVYTHDGDSV--IYVNLFTASKL--ANAKFALTQQT--AYPYEPQTRITI---D 483
Query: 550 KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL---PSPGNFLSVTKTWSSDDKLTIQLP 606
KG T L +R P WT+ G +NG+ + P + +T+ W D +T+ LP
Sbjct: 484 KGGSYT--LAVRHPWWTTE-GYAILVNGEKQQVAVTPGKAGYARLTRKWKRGDVVTVALP 540
Query: 607 LTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATS 650
+ LRT P Y A YGP +LA + D T++ T+
Sbjct: 541 MQLRTVEC----PNYTDYVAFEYGPLLLAAQTTA-VDATDADTT 579
>gi|256831608|ref|YP_003160335.1| hypothetical protein Jden_0363 [Jonesia denitrificans DSM 20603]
gi|256685139|gb|ACV08032.1| protein of unknown function DUF1680 [Jonesia denitrificans DSM
20603]
Length = 744
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 147/536 (27%), Positives = 240/536 (44%), Gaps = 66/536 (12%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWA 194
+ T L+Y L LD +LV +R+ + LP YG WE + L GH +GH LSA L +A
Sbjct: 20 RNTALDYTLALDPQRLVAPYRRESGLPLLAPSYGNWE--NSGLDGHTLGHVLSA--LAYA 75
Query: 195 S-TH---NESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ--FDRLE---------ALI 239
S TH + +E++ +V+ + CQ +G+GY+ P + ++R+ L
Sbjct: 76 SVTHTPRSAEARERLEWLVAQVQECQAAVGTGYVGGIPQGRALWERIGNGDVDADSFGLH 135
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
W P+Y +HK+ AGL+D A A A + + ++ V + E+ L
Sbjct: 136 GAWVPWYNLHKVFAGLVDAGWVAGVAVARDVVVGLANWWLR----VAARLRDEQFQAMLV 191
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLF-DKPCFLGLLALQADDISGFHSNTHIPIVIG 358
E G +N L T D ++L +A F D+ F L+A + D + G H+NT I +G
Sbjct: 192 TEFGAINGAFADLAVHTGDARYLEMAKRFTDRALFDALVAGE-DPLVGLHANTQIAKALG 250
Query: 359 --------SQMRYEVTGDQLHK---EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEES 407
Y V ++ H L G ++ + DP A + ES
Sbjct: 251 WARVALAGGGREYLVAARRVWDVVVRDHTLSFGGNSV-REHCAGDP--WAPFVSEQGPES 307
Query: 408 CTTYNMLKVSRHLFRWTKEI-AYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK 465
C T+NML+++ L + D+ E +L N V+ P G +Y P P +
Sbjct: 308 CNTHNMLRLTGALLELGESPRPLVDFVEVALMNHVV---SSVHPEGGFVYFTPARPQHYR 364
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
S H + FWCC GTG+E K G+ +Y + G+++ ++S +W S +
Sbjct: 365 VYSQVH-----ECFWCCVGTGMEHLMKNGELVYSPDA---TGLFVHLGVASVGEWASRGV 416
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSP 585
V Q P D + V + +G G ++++R+P W T+ D + +
Sbjct: 417 RVRQ---PWTLDDAGITVGIDAVGQGEG-EFAIHVRVPGWVDG---PVTVRVNDAVISTR 469
Query: 586 ---GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
+++VT+ WS+ D+L + LP TLR + P + S Q GP+VLA +
Sbjct: 470 VEHSGYVTVTRVWSAGDRLDVSLPATLRLRPAPRNAP-FVSFQK---GPWVLAARA 521
>gi|297725075|ref|NP_001174901.1| Os06g0612950 [Oryza sativa Japonica Group]
gi|255677224|dbj|BAH93629.1| Os06g0612950 [Oryza sativa Japonica Group]
Length = 198
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 86/167 (51%), Positives = 106/167 (63%), Gaps = 14/167 (8%)
Query: 27 KECTNAYPELASHTFRSNLLSSKNESYI-KQIHSHNDHLTPSDDSAWLSLMPRKILREEE 85
KECTN +L+SHT R+ L SS + ++ + H DHL P+D++AW+ LMP E
Sbjct: 23 KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMPLAAASASE 82
Query: 86 QDELFSWAMLYRKIKNPG-----QFKVPERSGEFLKEVSLHDVRL----GSDSMHWRAQQ 136
F WAMLYR +K FL+EVSLHDVRL G D ++ RAQQ
Sbjct: 83 ----FDWAMLYRSLKGAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQQ 138
Query: 137 TNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVG 183
TNLEYLL+L+VD+LVW+FR A LPAPG+PYGGWE P ELRGHFVG
Sbjct: 139 TNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVG 185
>gi|345514178|ref|ZP_08793691.1| hypothetical protein BSEG_03388 [Bacteroides dorei 5_1_36/D4]
gi|229437170|gb|EEO47247.1| hypothetical protein BSEG_03388 [Bacteroides dorei 5_1_36/D4]
Length = 1118
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 159/636 (25%), Positives = 271/636 (42%), Gaps = 129/636 (20%)
Query: 118 VSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYG-GWEEPSCE 176
+ L++V++ ++ + ++ ++ DV + ++N+R T L G GW+ P +
Sbjct: 151 IPLNNVKIDGNNRLTSNRDLAIKEIISWDVSQQLYNYRDTYGLSTEGYTRSDGWDSPETK 210
Query: 177 LRGHFVGHYLSASALMWAS----THNESLKEKMSAVVSALSACQKEI------------- 219
L+GH GHY+SA AL +A+ +H E L+ ++ +V+ L CQ+
Sbjct: 211 LKGHGSGHYMSALALAYAAATNPSHKEILRRNITRMVNELRECQERTFVWSEELGRYLEA 270
Query: 220 -----------------------------GSGYLSAFPTEQFDRLEALIP------VWAP 244
G GYL+A P +E VWAP
Sbjct: 271 RDFAPEEELKKMKGTWEAFDEHKTKWATYGYGYLNAIPPHHPALIEMYRAYNNSDWVWAP 330
Query: 245 YYTIHKILAGLLDQYTYADNA----EALRMTTWMVEYFYNRV--QNVIKKYSIERHWQT- 297
YY+IHK LAGL+D TY D+ +AL + M + +NR+ + +KK + +T
Sbjct: 331 YYSIHKQLAGLIDIATYMDDKSIADKALLIAKDMGLWVWNRMHYRTYVKKDGTQEERRTR 390
Query: 298 -----------LNEEAGGMNDVLYKLFCITQDPKH----LMLAHLFDKPCFLGLLALQAD 342
+ E GGM + L +L + P+ + ++ FD P F L+ D
Sbjct: 391 PGNRYEMWNMYIAGEVGGMGESLARLSEMVSAPEEKARLIEASNCFDSPAFYEPLSKNID 450
Query: 343 DISGFHSNTHIPIVIGSQMRYEVTGDQLHK----------EGHQLESSGTNIGHFNFKSD 392
DI H+N HIP++IG+ Y D + +G S+G +G+
Sbjct: 451 DIRNRHANQHIPMIIGALRSYLSNNDTFYYHVSHNFWNLIQGRYRYSTG-GVGNGEMFRQ 509
Query: 393 P----KRLASNLDSNTE--------ESCTTYNMLKVSRHLFRWTKEIA-YADYYERSLTN 439
P +A N S E E+C TYN+LK+++ L + + A Y DYYER+L N
Sbjct: 510 PYTQIVSMAMNGVSEGESHSNPHINETCCTYNLLKLTKDLNCFNPDDARYMDYYERTLYN 569
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
++G E Y + +SK WG + CC GTG E+ K ++ YF
Sbjct: 570 QIIG-SLHPEHYQTTYQYAVGLNASKP-----WGNETPQSTCCGGTGSENHVKYQEATYF 623
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +++ Y+ + L W+ I + Q+ W P T+ ++ + ++
Sbjct: 624 VSDNT---LWVALYMPTTLHWEEKNITLQQE----CLW-PAKSSTIKVTAGEARF--AMK 673
Query: 560 LRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSV-TKTWSSDDKLTIQLPLTLRTEAIQDD 617
LR+P W +++G LNG + P ++ + + W +D + I +P T + D
Sbjct: 674 LRVPYW-ATDGFDVKLNGISIATHYQPCSYAVIPARQWKENDIVEITMPFTKHIDYGPDK 732
Query: 618 RP-----------EYASIQAILYGPYVLAGHSIGDW 642
P E A + ++YGP+ + I +W
Sbjct: 733 LPAKIASKDGHQLETAWVGTLMYGPFAMTATDITNW 768
>gi|150003704|ref|YP_001298448.1| hypothetical protein BVU_1135 [Bacteroides vulgatus ATCC 8482]
gi|149932128|gb|ABR38826.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 1116
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 158/636 (24%), Positives = 271/636 (42%), Gaps = 129/636 (20%)
Query: 118 VSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYG-GWEEPSCE 176
+ L++V++ ++ + ++ ++ DV + ++N+R T L G GW+ P +
Sbjct: 149 IPLNNVKINGNNRLTSNRDLAIKEIISWDVSQQLYNYRDTYGLSTEGYTRSDGWDSPETK 208
Query: 177 LRGHFVGHYLSASALMWAS----THNESLKEKMSAVVSALSACQKEI------------- 219
L+GH GHY+SA AL +A+ +H E L+ ++ +V+ L CQ+
Sbjct: 209 LKGHGSGHYMSALALAYAAATNPSHKEILRRNITRMVNELRECQERTFVWSEELGRYLEA 268
Query: 220 -----------------------------GSGYLSAFPTEQFDRLEALIP------VWAP 244
G GYL+A P +E VWAP
Sbjct: 269 RDFAPEEELKKMKGTWEAFDEHKTKWATYGYGYLNAIPPHHPALIEMYRAYNNSDWVWAP 328
Query: 245 YYTIHKILAGLLDQYTYADNA----EALRMTTWMVEYFYNRV--QNVIKKYSIERHWQT- 297
YY+IHK LAGL+D TY D+ +AL + M + +NR+ + +KK + +T
Sbjct: 329 YYSIHKQLAGLIDIATYMDDKSIADKALLIAKDMGLWVWNRMHYRTYVKKDGTQEERRTH 388
Query: 298 -----------LNEEAGGMNDVLYKLFCITQDPKH----LMLAHLFDKPCFLGLLALQAD 342
+ E GGM + L +L + P+ + ++ FD P F L+ D
Sbjct: 389 PGNRYEMWNMYIAGEVGGMGESLARLSEMVSAPEEKARLIEASNCFDSPAFYEPLSKNID 448
Query: 343 DISGFHSNTHIPIVIGSQMRYEVTGDQLHK----------EGHQLESSGTNIGHFNFKSD 392
DI H+N HIP++IG+ Y D + +G S+G +G+
Sbjct: 449 DIRNRHANQHIPMIIGALRSYLSNNDTFYYHVSHNFWNLIQGRYRYSTG-GVGNGEMFRQ 507
Query: 393 P----KRLASNLDSNTE--------ESCTTYNMLKVSRHLFRWTKEIA-YADYYERSLTN 439
P +A N S E E+C YN+LK+++ L + + A Y DYYER+L N
Sbjct: 508 PYTQIVSMAMNGVSEGESHSNPHINETCCAYNLLKLTKDLNCFNPDDARYMDYYERTLYN 567
Query: 440 GVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYF 499
++G E Y + +SK WG + CC GTG E+ K ++ YF
Sbjct: 568 QIIG-SLHPEHYQTTYQYAVGLNASKP-----WGNETPQSTCCGGTGSENHVKYQEATYF 621
Query: 500 EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
+ +++ Y+ + L W+ I + Q+ W P T+ ++ + ++
Sbjct: 622 VSDNT---LWVALYMPTTLHWEEKNITLQQE----CLW-PAKSSTIKVTAGEARF--AMK 671
Query: 560 LRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSV-TKTWSSDDKLTIQLPLTLRTEAIQDD 617
LR+P W +++G LNG + P ++ + T+ W +D + I +P T + D
Sbjct: 672 LRVPYW-ATDGFDVKLNGISIATHYQPCSYAVIPTRQWKENDIVEITMPFTKHIDYGPDK 730
Query: 618 RP-----------EYASIQAILYGPYVLAGHSIGDW 642
P E A + +++GP+ + I +W
Sbjct: 731 LPAEIASKDGHQLETAWVGTLMHGPFAMTATDITNW 766
>gi|297606173|ref|NP_001058068.2| Os06g0613000 [Oryza sativa Japonica Group]
gi|255677225|dbj|BAF19982.2| Os06g0613000, partial [Oryza sativa Japonica Group]
Length = 279
Score = 156 bits (394), Expect = 5e-35, Method: Composition-based stats.
Identities = 113/281 (40%), Positives = 149/281 (53%), Gaps = 45/281 (16%)
Query: 616 DDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITP------------------ 657
DDRPEY+SIQA+L+GP++LAG + G+ + S S S +TP
Sbjct: 4 DDRPEYSSIQAVLFGPHLLAGLTHGNQTVKTSNDSNSG-LTPGVWEVNATHAAAAVAVWV 62
Query: 658 --IPASYNSQLITFTQEYGNTK----FVLTNS--NQSITMEKFPKSGTDAALHATFRLIL 709
+ S NSQL+T TQ G+ + FVL+ S + ++TM++ P +G+DA +HATFR
Sbjct: 63 TPVSQSLNSQLVTLTQRDGDAQAAAAFVLSVSIADGALTMQESPVAGSDACVHATFRAYH 122
Query: 710 NDSSGSEFSSLNDFI-GKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAG 768
+ S S + + G+ V LEPFD PGM V D L V A + F+ VAG
Sbjct: 123 SPSGASAIDAATGRLQGRDVALEPFDRPGMAVT-----DALSVGRPGPA---TRFNAVAG 174
Query: 769 LDGGDRTVSLESETYKGCFV------YTA---VNLQSSESTKLGCISESTEAGFNNAASF 819
LDG TVSLE T GCFV Y A + + T G + + F AASF
Sbjct: 175 LDGLPGTVSLELATRPGCFVAAPTTAYLAGAKAQVSCRKPTAAGGGEDDDDTAFRRAASF 234
Query: 820 VIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYFD 860
L YHP+SF A G +RNFLL PL SL+DE YTVYF+
Sbjct: 235 TQAAPLRLYHPLSFSATGTDRNFLLEPLQSLQDEFYTVYFN 275
>gi|225351247|ref|ZP_03742270.1| hypothetical protein BIFPSEUDO_02839 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
gi|225158703|gb|EEG71945.1| hypothetical protein BIFPSEUDO_02839 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
Length = 853
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 201/812 (24%), Positives = 319/812 (39%), Gaps = 132/812 (16%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAP----GEP--- 166
L+ V L VRL H+ AQQ YLL LDVD+L++ FR+ A LP P G P
Sbjct: 5 ILERVPLQQVRL-LPGEHFDAQQAGARYLLDLDVDRLLYPFRREAGLPQPTDADGNPVTS 63
Query: 167 YGGWEEPSCELRGHFVGHYLSAS-ALMWASTHNESLKEKMSAVVSALSACQKEIGS---- 221
Y WEE L GH GHYLSA + + ++ + VV + CQ+
Sbjct: 64 YPNWEETG--LDGHIAGHYLSACVGFAQVADDPQPFIDRAATVVRSWHECQQSFAGDAVM 121
Query: 222 -GYLSAFPTEQ--FDRLEA---------LIPVWAPYYTIHKILAGLLDQYTYADNAEALR 269
GY+ P + F RL A + W P Y +HK AGLLD T+AD A
Sbjct: 122 RGYVGGVPDSRTVFGRLAAGDVESQNFSMNDAWVPMYNVHKTFAGLLD--TWADFASIDE 179
Query: 270 MTTWMVEY-------FYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHL 322
T+ + ++ R+ + + +R L E GGM + +L+ T + ++
Sbjct: 180 QTSQLARTVVLDLADWWCRIAEPLDDETFDR---ILVSEFGGMCESFAELYARTGEERYH 236
Query: 323 MLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQ------ 376
++A F LA D ++G H+NT IP V+G + + D+
Sbjct: 237 VMADRFKDHAIFDPLAQGEDVLTGMHANTQIPKVLGWERLGAICNDEQADAATNTFWDSV 296
Query: 377 LESSGTNIG------HFNFKSDPKRLASNLDSNT-EESCTTYNMLKVSRHLFRWTKEIAY 429
+ +IG HF+ D +S ++S E+C +YNM K++ L+ + Y
Sbjct: 297 VHHRSVSIGAHSVSEHFHPTDD---FSSMIESREGPETCNSYNMSKLAERLWLRSGSADY 353
Query: 430 ADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIES 489
++YER L N +L +PG +Y P+ + + Y + TP + FWCC G+G+E+
Sbjct: 354 INFYERVLENHLLSTINPKQPG-FVYFTPM-----RSQHYRAYSTPQECFWCCVGSGLEN 407
Query: 490 FSKLGDSIYF------------------------------EEEGKYPGVYIIQYISSRLD 519
++ G IY E + + + YI S D
Sbjct: 408 HARYGRLIYALQRPAAQDSADSAAAGFASSAAETGNTVSNNAEAEATRLLVNLYIDSTFD 467
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSK--------GSGLTTSLNLRIPTWTSSNGA 571
+ + Q+ + Y VT T S G T+L LR P W G
Sbjct: 468 CPEQGLRITQRAARIEDGVDYT-VTFTLESTAEHVPDTPGGLRETTLFLRRPWWAEHYGV 526
Query: 572 KATLNGQDLPLPS-----PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
P+ P +L + W+ ++ ++L + E + D P +
Sbjct: 527 MEATCAVCTLDPARTNDIPEGYLPLRLRWNGVAEVVMRLRPRITVERMPDGSPWV----S 582
Query: 627 ILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQS 686
+ GP V+A S D D + + + ++ I LI+ GN ++
Sbjct: 583 FMKGPKVMALAS--DSDDMDGEFADAGRMSHIATGPLRPLISMPIINGNPVKACAQVSR- 639
Query: 687 ITMEKFPKSGTDAALHATFRLILNDSSGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETD 746
+ T AA + R +L D EFSS++ SV L D + ++ +
Sbjct: 640 ----PYVHGLTVAATDVSGRTMLFDM--HEFSSMHG-CRYSVYLPVADDGNVCALRAQLA 692
Query: 747 D--------ELVVTDSFIA--QGSSVFHLVAG---LDGGDRTVSLESETYKGCFVYTAVN 793
D E V D+ Q S + H +G + G D T+ G F Y
Sbjct: 693 DIDARQAASEQTVVDTIACGQQQSEIDHRYSGDNDMMGADGTLHWRRALAGGEFQYAMRG 752
Query: 794 LQSSESTKLGCISESTEAGFNNAASFVIEKGL 825
+ ++ I++S E+ N A V+ GL
Sbjct: 753 RGQAHRLEIEVIADSAESDGENTAYEVMLDGL 784
>gi|82523843|emb|CAI78585.1| hypothetical protein [uncultured candidate division OP8 bacterium]
Length = 766
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 113/385 (29%), Positives = 170/385 (44%), Gaps = 81/385 (21%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAP--GEPYGGWE 171
L V L+ G +++ + + L L ++ D ++NFR LP P GGW+
Sbjct: 378 LLGRVVLNRDAAGRETLFMKNRDKFLSTLAEVNPDNFLYNFRDAFGLPQPEGAVQLGGWD 437
Query: 172 EPSCELRGHFVGHYLSASALMWA-STHNESLK----EKMSAVVSAL-------------- 212
+ + LRGH GHYLSA A +A S ++ +L+ +KM+ ++ L
Sbjct: 438 DQTTRLRGHASGHYLSALAQAYAGSVYDSALQANFLQKMNYMIDTLYDLAQKSGRPVESG 497
Query: 213 SAC---------------------QKEI-------GSGYLSAFPTEQFDRLE-------A 237
C QK + G G++SA+P +QF LE
Sbjct: 498 GLCNPDPTTVPSGPGKSGYDSDLSQKGLRHDYWNWGVGFISAYPPDQFIMLEQGATYGGT 557
Query: 238 LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQT 297
+WAPYYT+HKILAGLLD Y N +AL++ M + R+Q V + I +
Sbjct: 558 NAQIWAPYYTLHKILAGLLDCYEVGGNPKALQIAEGMGGWALKRLQAVPEATRIAMWSRY 617
Query: 298 LNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFL-------GLLALQADDISGFHSN 350
+ E GGMN+V+ +LF +T L A LFD F LA D + G H+N
Sbjct: 618 IAGEYGGMNEVMARLFRLTGKRDFLACAKLFDNTNFFFGNAGREHGLAKNVDTVRGRHAN 677
Query: 351 THIPIVIGSQMRYEVTGDQLHKE----------GHQLESSGTNIGHFN------FKSDPK 394
HIP +IG+ Y +G+ ++ E H + + G G N F ++P
Sbjct: 678 QHIPQIIGTLETYRGSGEPVYHEIAENFWEIARNHYMYNIGGVGGAKNPRNAECFTAEPD 737
Query: 395 RLASNLDS--NTEESCTTYNMLKVS 417
+N S E+C TYN+LK +
Sbjct: 738 TQFANGFSMDGQNETCATYNLLKCA 762
>gi|427409221|ref|ZP_18899423.1| hypothetical protein HMPREF9718_01897 [Sphingobium yanoikuyae ATCC
51230]
gi|425711354|gb|EKU74369.1| hypothetical protein HMPREF9718_01897 [Sphingobium yanoikuyae ATCC
51230]
Length = 616
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 145/591 (24%), Positives = 257/591 (43%), Gaps = 64/591 (10%)
Query: 110 RSGEFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGG 169
R E LKE V+L + + YL LD D+++ FR+ A LPAPG GG
Sbjct: 52 RGTEVLKEFPYGAVQLTGGVVKDHYDHIHAHYL-ALDNDRVLKVFRQQAGLPAPGPDMGG 110
Query: 170 WEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT 229
W + + G G Y+S A + A+T ++++ K++A+V + + Y
Sbjct: 111 WYDRDGFVPGLAFGQYMSGLARIGATTGDKAVHAKVAALVQGFGEFITKTRNPYAGPKAQ 170
Query: 230 EQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKY 289
+Q WA YT+ K + GL+D Y + +A + +E + + I
Sbjct: 171 DQ----------WAA-YTMDKYVVGLIDAYRLSGVEQAKTLLPITIE----KCRPYISPV 215
Query: 290 SIERHWQT--LNEEAGGMNDVLYKLFCITQDPKHLMLA--HLFDKPCFLGLLALQADDIS 345
S +R + +E +++ L+ + IT K+ +A +L +K F L A Q D +
Sbjct: 216 SRDRIGKVDPPYDETYVLSENLFHVADITGQDKYRQMAIHYLLNKEWFDPLAAGQ-DVLP 274
Query: 346 GFHSNTHIPIVIGSQMRYEVTGDQLHK----------EGHQLESSGTNIGHFNFKSDPKR 395
H+ +H + Y GD+ ++ E + S G + +
Sbjct: 275 TKHAYSHTIALSSGAQAYLHLGDEKYRKALVNAWTYMEPQRFASGGWGPEEQFVELHQGK 334
Query: 396 LASNLDSNT---EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV 452
LA++L S+ E C ++ +K++R+L R+T E Y D ER+L N +L + G
Sbjct: 335 LAASLKSSKAHFETPCGSFADMKLARYLVRFTGEPVYGDGLERTLYNTMLATRLPDSDGG 394
Query: 453 MIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
Y G++ E+ Y+H P CC GT ++ + ++YF ++ + +
Sbjct: 395 YPYYSNY--GAAAEKLYYHQKWP-----CCSGTLVQGVADYVLNLYFHDDN---ALVVNM 444
Query: 513 YISSRLDWK--SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+ S + W G + V Q+ + + LT ++ G+G ++ LRIP W + G
Sbjct: 445 FAPSTVKWDRPGGAVQVEQQTN----YPAEDTTRLTVTAPGNG-RFAMKLRIPAW--AKG 497
Query: 571 AKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
A+ +NG + PG + +TW + D + + LP LRT +I D P+ I A++ G
Sbjct: 498 AQLRVNGAAQGV-QPGTLAVIDRTWKAGDMVELTLPQALRTLSIDDKNPD---IAAVMRG 553
Query: 631 PYVLAGHSIGDW-DITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVL 680
+ G + W + + +L + P+P S + + E G V
Sbjct: 554 AVMYVG--LNPWTGVEDQPLALPASLKPVPGSS----LNYAMETGGRNLVF 598
>gi|302547294|ref|ZP_07299636.1| putative secreted protein [Streptomyces hygroscopicus ATCC 53653]
gi|302464912|gb|EFL28005.1| putative secreted protein [Streptomyces himastatinicus ATCC 53653]
Length = 740
Score = 133 bits (334), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 88/249 (35%), Positives = 127/249 (51%), Gaps = 28/249 (11%)
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG----TEPG 451
+A+ LD E+C TYNMLK+SR LF + AY DYYER LTN +L +R T P
Sbjct: 375 IAATLDGKNAETCATYNMLKLSRQLFFREPDAAYMDYYERGLTNHILASRRDAPSTTSPE 434
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
V Y + + PG +E Y + GT CC GTG+E+ +K DS+YF +Y+
Sbjct: 435 V-TYFVGMGPGVRRE--YDNTGT------CCGGTGMENHTKYQDSVYFRSADGT-ALYVN 484
Query: 512 QYISSRLDWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
++S L W V+ Q D P TLTF G L + LR+P W ++ G
Sbjct: 485 LALASTLRWPERGFVIEQTGDYPAEGVR-----TLTFREGGGRL--EVKLRVPAW-ATGG 536
Query: 571 AKATLNG-QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
T+NG + PG++L++++ W D++ I P LR E DD ++Q++ Y
Sbjct: 537 FTVTVNGVRQRGKAVPGSYLTLSRDWRRGDRIRISAPYRLRIERALDD----PAVQSVFY 592
Query: 630 GPYVLAGHS 638
GP +L S
Sbjct: 593 GPVLLVARS 601
>gi|237718517|ref|ZP_04548998.1| LOW QUALITY PROTEIN: acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
gi|229452224|gb|EEO58015.1| LOW QUALITY PROTEIN: acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
Length = 502
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 86/275 (31%), Positives = 135/275 (49%), Gaps = 20/275 (7%)
Query: 381 GTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG 440
G N +F D L+ D ESC TYNML+++ LFR YAD+YER+L N
Sbjct: 11 GGNSRREHFPDDTDYLSYVDDREGPESCNTYNMLRLTEGLFRMNPTADYADFYERALFNH 70
Query: 441 VLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFE 500
+L Q E G +Y P P Y + P+++ WCC GTG+E+ K G+ IY
Sbjct: 71 ILSTQH-PEHGGYVYFTPARPA-----HYRVYSAPNEAMWCCVGTGMENHGKYGEFIYAH 124
Query: 501 EEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNL 560
+Y+ +ISSRL+WK +I + Q S+ + LT ++K S L +
Sbjct: 125 TGD---SLYVNLFISSRLEWKKRRISLTQ----TTSFPNEGKTCLTITAKKS-TKFPLFV 176
Query: 561 RIPTWTSSNGAKATLNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
R P W T+NG+ + + N + ++ + W + D + +Q+P+ +R E ++ P
Sbjct: 177 RKPGWVGDGKVIITVNGKSIETTTAANSYYTINRKWKNGDVVEVQMPMNIRIEELK-HHP 235
Query: 620 EYASIQAILYGPYVLAGHSIGDWDITESATSLSDW 654
EY AI+ GP +L G ++G ++ S W
Sbjct: 236 EYI---AIMRGP-ILLGANVGKENLNGLVASDHRW 266
>gi|310794204|gb|EFQ29665.1| hypothetical protein GLRG_04809 [Glomerella graminicola M1.001]
Length = 436
Score = 125 bits (314), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 101/339 (29%), Positives = 146/339 (43%), Gaps = 46/339 (13%)
Query: 135 QQTNLEYLLMLDVDKLVWNFRKTARL-PAPGEPYGGWEEPSCELRGHFVGHYLSASALMW 193
Q L YL +DVD+L++ FRK L +P GW+ P R H GH+L+A A +
Sbjct: 59 QARTLVYLKWIDVDRLLYVFRKNHGLYTNNAQPNAGWDAPDFPFRSHVQGHFLNAWAFCY 118
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILA 253
A + K + + + L CQ T + PYY IHK +A
Sbjct: 119 AQLQDSECKRRATYFAAELKKCQHN---------NTNSRN---------VPYYAIHKTMA 160
Query: 254 GLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLF 313
GLLD + + A + M + R K + ++ + GGMN+VL L
Sbjct: 161 GLLDVWRLIGDTNARDVLLAMAAWVDLRT----GKLTYQQMQDMMGTVFGGMNEVLADLC 216
Query: 314 CITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPI-----VIGSQMRYEVTGD 368
T D + + +A FD LA D +SG H+NT + S Y + G+
Sbjct: 217 RQTGDQRWVTVAQRFDHAAIFNPLASNQDSLSGLHANTQDIARNAWNITVSAHSYAIGGN 276
Query: 369 QLHKEGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE-I 427
Q E HF P +A L S+T E+C TYNMLK++ L+ +
Sbjct: 277 S------QAE-------HFRL---PNAIAGFLTSDTCEACNTYNMLKLTGELWLTNPDTT 320
Query: 428 AYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK 465
Y D+YER+L N +LG Q + G + Y PL PG +
Sbjct: 321 TYFDFYERALLNHLLGQQDPSNSHGHVTYFTPLNPGGRR 359
>gi|94967351|ref|YP_589399.1| hypothetical protein Acid345_0320 [Candidatus Koribacter versatilis
Ellin345]
gi|94549401|gb|ABF39325.1| Protein of unknown function DUF1680 [Candidatus Koribacter
versatilis Ellin345]
Length = 607
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 125/536 (23%), Positives = 234/536 (43%), Gaps = 68/536 (12%)
Query: 138 NLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS----------CELRGHFVGHYLS 187
N + L LD D+L+ FR+ A LPAPGE GGW + + + GH +G Y+S
Sbjct: 58 NHAFFLKLDEDRLLKVFRQKAGLPAPGEDMGGWYDLTGFDLAKGDFHGFVPGHTLGQYVS 117
Query: 188 ASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYT 247
A A +A+T +E K K+ +V A + S + + + RL P YT
Sbjct: 118 ALARCYAATGSEETKAKVHRLVKGYGATLDDKAS-FFAGY------RL--------PAYT 162
Query: 248 IHKILAGLLDQYTYADNAEAL----RMTTWMVEYFYNRVQNVIKKYSIERHWQTLN-EEA 302
K+ GL+D + +A + +A+ ++T M++Y + + ++ + ++ +E+
Sbjct: 163 YDKLSCGLIDAHEFAHDPDAMAIHEKLTRGMLQYLPEKALSRAEQRARPHKDESFTWDES 222
Query: 303 GGMNDVLYKLFCITQDPKHLMLAHLF-DKPCFLGLLALQADDISGFHSNTHIPIVIGSQM 361
+ + L+ + T + + L F + + L+ + ++G H+ +H+ +
Sbjct: 223 YTLPENLFLAYRRTGNKFYRELGTRFLEDDTYFNPLSEGINVLAGEHAYSHMNAFCSAMQ 282
Query: 362 RYEVTGDQLHKEG----------HQLESSGTNIGHFNFKSDPKRLASNLD---SNTEESC 408
Y + H++ + G + + +L +L+ S+ E C
Sbjct: 283 AYLTLDSERHRKAARNGFRMVAEQSFATGGWGPSEAFVEFNKGQLGDSLEKSHSSFETPC 342
Query: 409 TTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERS 468
Y K++R+L + + Y D ER + N VLG + G Y A + ++
Sbjct: 343 GAYAHFKLTRYLLQTDGDSTYGDSMERVMYNTVLGAKPIQPDGTSFYYSDYA--TVGKKV 400
Query: 469 YHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS--GQIV 526
YH +D + CC GT + + SIY + GV + ++ S L WK+ G
Sbjct: 401 YH-----NDKWPCCSGTLPQVAADYHISIYLKATD---GVCVNLFVPSTLIWKASDGSCK 452
Query: 527 VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-P 585
+ Q+ +R T + +L +RIP W +S A +NGQ + + P
Sbjct: 453 LTQETKYPFETSVAMRFATT-----QPVEQTLYIRIPAWVTSEPA-LRVNGQRTDVAAKP 506
Query: 586 GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGD 641
G F ++ +TW D++ + LP+ + + ++ + A+++GP VL +IGD
Sbjct: 507 GAFAAIRRTWKDGDRIDLDLPMGFELQPVDG---QHEKLVALVHGPLVL--FAIGD 557
>gi|94967195|ref|YP_589243.1| hypothetical protein Acid345_0164 [Candidatus Koribacter versatilis
Ellin345]
gi|94549245|gb|ABF39169.1| conserved hypothetical protein [Candidatus Koribacter versatilis
Ellin345]
Length = 602
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 129/555 (23%), Positives = 231/555 (41%), Gaps = 78/555 (14%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWE--E 172
L E DV L S+ +H R Q + L+ L+ D L+ FR P PG GGW +
Sbjct: 37 LDEFGYGDVSLESE-LHNRQFQNTHDVLMGLEDDALLKPFRAMVGQPPPGRDLGGWYCFD 95
Query: 173 PSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQF 232
P+ VG +A+ W S + S + V + + +S +F
Sbjct: 96 PNYNPNDVGVGFAPTATFGQWISALSRSYALRPDPAVRDKVIRLNRLYAQTISP----EF 151
Query: 233 DRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIE 292
L+ P Y K++ GL+D + Y + +AL++ +E + ++ +++E
Sbjct: 152 YGLKNRFPA----YCYDKLVCGLIDAHQYVGDPDALKI----LERTTDTATPLLPGHAVE 203
Query: 293 RH--WQTLNE------EAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDI 344
W+++ + E+ +++ L+ + ++ L + + LA D+
Sbjct: 204 HGTVWRSVKDDGYTWDESYTISENLFLAYRRGAGDRYRALGKQYLDDTYYNPLAEGRSDL 263
Query: 345 SGFHSNTHIPIVIGSQMRYEVTGDQLH----KEGHQL------ESSGTNIGHFNFKSDPK 394
G H+ +H+ + + Y GD+ + K G + G +
Sbjct: 264 EGRHAYSHVNSLCSAMQAYLTLGDEKYFRAAKNGFDFVLAQSYATGGWGADETLRAPNSP 323
Query: 395 RLASNLDS---NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG 451
+A +L + E C +Y K++R+L R T++ Y D ER + N +LG
Sbjct: 324 EVAKSLTGTHHSFETPCGSYAHFKLTRYLLRVTRDSRYGDSMERVMYNTILGA------- 376
Query: 452 VMIYLLPLAPGSS---------KERSYHHWGTPSDSFW-CCYGTGIESFSKLGDSIYFEE 501
LPL P K ++H D+ W CC GT + + G S Y +
Sbjct: 377 -----LPLMPDGRTFYYSDYNFKGSKFYH-----DARWPCCSGTMPQIATDYGISTYLRD 426
Query: 502 EGKYPGVYIIQYISSRLDWK--SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
G+Y+ YI S + W+ Q+ + QK +DP + + L+ + + ++
Sbjct: 427 PQ---GIYVNLYIPSTVRWQQDGAQVSLTQKT--AYPFDPVVEIELSTTKQRE---FEVH 478
Query: 560 LRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
LRIP W A +NG+ +P F ++ +TW + D++ ++LPL R E + +R
Sbjct: 479 LRIPAWAEQ--ASIEVNGKREGVPVAERFATIRRTWKNGDRIQLELPLKNRLEPLNRER- 535
Query: 620 EYASIQAILYGPYVL 634
A + A+L GP VL
Sbjct: 536 --AKLVALLNGPLVL 548
>gi|336429869|ref|ZP_08609826.1| hypothetical protein HMPREF0994_05832 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336001322|gb|EGN31460.1| hypothetical protein HMPREF0994_05832 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 606
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 143/604 (23%), Positives = 235/604 (38%), Gaps = 111/604 (18%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
LK+ +V L +S+ R ++ E L + D L++ FR A L APGE GW
Sbjct: 4 LKDFRYRNVEL-KNSLWERQRRETAETYLAIPNDSLLYYFRTLAGLEAPGEGLTGWYGNG 62
Query: 175 CELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
G L A A ++A T + LKEK + C +A + FD
Sbjct: 63 AST----FGQKLGAFAKLYAVTGDYRLKEKAVYLAEGWGKC---------AAANKKVFDC 109
Query: 235 LEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIER- 293
+ Y K+L G LD Y + L + + + R + I + ++
Sbjct: 110 NDT--------YVYEKLLGGFLDMYENLGYEKGLAYCSGLTDSAAARFKRDIPRDGLQGP 161
Query: 294 --------HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDIS 345
W TL E LY+ + +T + K+L A +D L + I
Sbjct: 162 ELCENNMIEWYTLPEN-------LYRAYQLTGEQKYLDFAQEWDYTYLWDKLNNKDSAIG 214
Query: 346 GFHSNTHIPIVIGSQMRYEVTGDQLH-----------KEGHQLESSGTNIGHFNFKS--- 391
H+ + + + + M YEVTG + + E H + G F
Sbjct: 215 PRHAYSQVNSLSSAAMAYEVTGKKYYLDAIENGYTEITERHTYATGGYGPAECLFAEEEG 274
Query: 392 ----------DPKR-----------LASNLDS--NTEESCTTYNMLKVSRHLFRWTKEIA 428
DP R L D+ + E SC + + K+ +L R T +
Sbjct: 275 FLGEMLKDSWDPTRKSPVYRNFGGGLVGRNDNWGSCEVSCCAWAVFKICNYLLRITGKAK 334
Query: 429 YADYYERSLTNGVLGIQRGTEPG-VMIYLLPLAPGSSKE-RSYHHWGTPSDSFW-CCYGT 485
Y + E+ L NGV G G VM Y G+ K + G ++ W CC GT
Sbjct: 335 YGAWAEQMLINGVAGQPPIDSQGHVMYYADYFVDGAVKSVQDRRLQGNGANFEWQCCTGT 394
Query: 486 GIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK-SGQIVV-----NQKVDPVVSWDP 539
+ ++ + +Y+ +E G+Y+ QY+ SR ++ G+ V + V P+ +
Sbjct: 395 FPQDVAEYANMLYYTDE---EGIYVSQYMKSRAEFTIRGEKAVLENCSEEDVSPIRRFRI 451
Query: 540 YLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSD 598
R L F ++ RIP W + +NG+D L P P ++ + + W D
Sbjct: 452 QTRGELPF---------RISFRIPHWAKGEN-RILVNGEDSGLEPLPDSWAVLERVWQED 501
Query: 599 DKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSI----GDWDITESATSLSDW 654
D +T+ P +L + + + + I A+++GP VLA + GD + E +W
Sbjct: 502 DVITVTCPFSLAFKPVDEKNKD---IAALMFGPVVLAADKMTLFDGDMEKPE------EW 552
Query: 655 ITPI 658
IT +
Sbjct: 553 ITCV 556
>gi|336425065|ref|ZP_08605095.1| hypothetical protein HMPREF0994_01101 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336012974|gb|EGN42863.1| hypothetical protein HMPREF0994_01101 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 575
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 141/583 (24%), Positives = 230/583 (39%), Gaps = 95/583 (16%)
Query: 115 LKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPS 174
KEV+L ++ M + L + L + D ++ R++A PAPG Y GW S
Sbjct: 6 FKEVTL------NEGMMKKVLDETLAFYLKIPNDNILKYMRESAGKPAPGIFYTGWYPNS 59
Query: 175 CELRG-HFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD 233
RG +G +LSA + M+A + +E+ ++K + C Y SA T F
Sbjct: 60 ---RGIALIGQWLSAYSRMYAISGDEAFRQKAVYLADEFWDC-------YESAQHTAPFL 109
Query: 234 RLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRV--QNVIKKYSI 291
+ +Y + K+L D + Y A +++++ + + +N+ S
Sbjct: 110 TSRS-------HYDVEKLLRAHCDLFLYCKYPCAKERAGYLIDFAADNLTAENIFGDNST 162
Query: 292 ERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISG----- 346
E W TL E + F I + P+ +A F+ F L AD S
Sbjct: 163 E--WYTLAES-------FWDAFEILEIPRAQQMAERFEYREFWDLFYKDADPFSKRPQAG 213
Query: 347 -----FHSNTHIPIVIGSQMRYEVTGDQLH-------------KEGHQLESSGTNIGHFN 388
H+ +H+ YE+T +E G N H
Sbjct: 214 LYSEFCHAYSHVNSFNSCAKAYEMTKSPYFLKSLRSFYRFMQTEEVMATGGYGPNYEHLM 273
Query: 389 FKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGT 448
K+ + E C TY ++ ++L R+T E Y ++ E L N T
Sbjct: 274 PKNRIIDALRTGHDSFETQCDTYAAFRLCKYLTRFTDEPEYGNWVESLLYNAAAATIPMT 333
Query: 449 EPGVMIYL--LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
E G +IY + G K R D + CC GT +++ IYFE +G+
Sbjct: 334 EEGNIIYYSDYNMYAGYKKNR--------QDGWTCCTGTRPLLVAEIQRLIYFEGDGE-- 383
Query: 507 GVYIIQYISSRLDW--KSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPT 564
+YI QYI S L W I + Q+ + L ++L+ S+ ++ R+P
Sbjct: 384 -LYISQYIPSTLHWNRNGNDISIRQETGFPEGKETTLILSLSCSA-----AFPIHFRLPG 437
Query: 565 WTSSNGAKATLNGQDLPLPS---PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
W S + ++ ++PLP+ +L++ W D+LTI LP + ++ P
Sbjct: 438 WLS---GEMKVSCNNVPLPATVDKNGWLTIHSEWKEGDRLTISLPAEVWMHSLD---PVK 491
Query: 622 ASIQAILYGPYVLAGHSIG-----DWDITESATSLSDWITPIP 659
A LYGP VLA G DW SL++ + P+P
Sbjct: 492 NGPNAFLYGPVVLAADYSGIQTPNDW---MDVQSLTEKMKPVP 531
>gi|256375993|ref|YP_003099653.1| hypothetical protein Amir_1859 [Actinosynnema mirum DSM 43827]
gi|255920296|gb|ACU35807.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 736
Score = 109 bits (272), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 87/301 (28%), Positives = 134/301 (44%), Gaps = 55/301 (18%)
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGS 359
+EAG L L T P+HL A +FD + A D ++G H+N HIPI G
Sbjct: 273 DEAG---PALRDLRARTGKPEHLAPARMFDLDALIDACAENRDVLAGLHANQHIPIFTGL 329
Query: 360 QMRYEVTGDQLHKEGHQ-----------LESSGTNIGHFNFKSDPKRLASNLDSNTEESC 408
E TG+Q + + + GT+ G F +++ P +A L + E+C
Sbjct: 330 VRLREATGEQRYLDAARNFWDMVVPRRLYRIGGTSTGEF-WRA-PGVIAETLADDNAETC 387
Query: 409 TTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG---VMIYLLPLAPGSSK 465
+NMLK+ R LF N +LG ++ +M Y + LAPGS +
Sbjct: 388 CAHNMLKLGRALF-----------------NQILGSKQDAPSADVPLMTYFIGLAPGSVR 430
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
+ TP CC GTG+ES +K DS+YF +E +Y+ + + W I
Sbjct: 431 DF------TPEQGATCCEGTGLESAAKYQDSVYFHDEKT---LYVNLFAPTTAHWNETTI 481
Query: 526 VVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSP 585
P+ R T + G G ++ +R+P+W + GA A+LNG+ L +P+
Sbjct: 482 TRGAHF-------PHERGT-SPGIGGKGGRVTIKVRVPSW--ARGASASLNGRPLAVPAA 531
Query: 586 G 586
G
Sbjct: 532 G 532
>gi|224072775|ref|XP_002303875.1| predicted protein [Populus trichocarpa]
gi|222841307|gb|EEE78854.1| predicted protein [Populus trichocarpa]
Length = 103
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/131 (47%), Positives = 75/131 (57%), Gaps = 31/131 (23%)
Query: 560 LRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
+RIPTWT GA+ +N TW Q+P + DDRP
Sbjct: 1 MRIPTWTHLEGAETVIND---------------STW--------QIPAS-------DDRP 30
Query: 620 EYASIQAILYGPYVLAGHSIGDWDITE-SATSLSDWITPIPASYNSQLITFTQEYGNTKF 678
EYASIQAILYGPY+ AGH+ DWDI SA SLS+W TPIPA+YN L+TF+Q+ N F
Sbjct: 31 EYASIQAILYGPYLFAGHTTADWDIKNVSADSLSEWSTPIPAAYNDHLVTFSQKSRNPTF 90
Query: 679 VLTNSNQSITM 689
L NSN IT+
Sbjct: 91 FLINSNHIITV 101
>gi|413954826|gb|AFW87475.1| hypothetical protein ZEAMMB73_309562 [Zea mays]
Length = 161
Score = 106 bits (264), Expect = 6e-20, Method: Composition-based stats.
Identities = 72/182 (39%), Positives = 98/182 (53%), Gaps = 32/182 (17%)
Query: 689 MEKFPKSG--TDAALHATFRLILNDSSGSEFSSLNDFIGKSVMLEPFDSPGMLVIQHETD 746
M + PK G T+AA+HATFRL+ +G+ + MLEP D PGM+V
Sbjct: 1 MLQRPKDGGGTEAAVHATFRLVPQGGAGAG---------AAAMLEPLDMPGMVVT----- 46
Query: 747 DELVVTDSFIAQGSS--VFHLVAGLDGGDRTVSLESETYKGCFVYTAVNLQSSESTKLGC 804
D L V A+ SS F++V GL G +VSLE + GCF+ + E ++GC
Sbjct: 47 DRLTVA----AEKSSGAAFNVVPGLAGAPGSVSLELASRPGCFL-----VGGGEKVQVGC 97
Query: 805 ISESTE-----AGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLLSLRDESYTVYF 859
+ + A F +ASF + L YHP+SF A+G R+FLL PL +LRDE YTVYF
Sbjct: 98 AGGAQQKRGDGAWFRRSASFARGEPLRRYHPMSFAARGVRRSFLLEPLFTLRDEFYTVYF 157
Query: 860 DF 861
+
Sbjct: 158 NL 159
>gi|225874351|ref|YP_002755810.1| hypothetical protein ACP_2792 [Acidobacterium capsulatum ATCC
51196]
gi|225791337|gb|ACO31427.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
51196]
Length = 611
Score = 106 bits (264), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 124/537 (23%), Positives = 216/537 (40%), Gaps = 80/537 (14%)
Query: 136 QTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCE----------LRGHFVGHY 185
Q N + L LD D L+ FR+ A LPAPG GGW S E + GH G Y
Sbjct: 62 QANHAFFLALDEDALLKPFRERAGLPAPGPQMGGWYNFSKEFDPPNNMTGYIPGHSFGQY 121
Query: 186 LSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPY 245
LS A +A+T ++ K K+ +V + + + + +P P
Sbjct: 122 LSGLARAYAATGDQPTKAKVHRLVRGFA---EAVSPKFYDDYPL--------------PC 164
Query: 246 YTIHKILAGLLDQYTYADNAEALR--------MTTWMVEYFYNRVQNVIKKY-SIERHWQ 296
YT K GL+D + +A + AL + ++ + R + + + +I W
Sbjct: 165 YTFDKSNCGLIDAHQFAGDPNALHALSRALDAVMPYLPSHALTRPEMAARPHPNIAFTW- 223
Query: 297 TLNEEAGGMNDVLYKLFCITQDPKHLMLAHLF--DKPCFLGLLALQADDISGFHSNTHIP 354
+E+ + + + + + D K+L++A F DK + LA + + H+ +H+
Sbjct: 224 ---DESYTLPENFFLAYKRSGDEKYLVMAQRFLQDK-SYFDPLAEGDNVLPHQHAYSHVN 279
Query: 355 IVIGSQMRYEVTGDQLH----KEGHQ------LESSGTNIGHFNFKSDPKRLASNL---D 401
+ + Y V G + H + G Q + G + L +L
Sbjct: 280 ALNSASQAYLVLGSEKHLRAARNGFQFVLDQSFATGGWGPNETFVEPGSGGLYKSLTETH 339
Query: 402 SNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAP 461
++ E C Y KV+R+L R T + Y D E+ L N +LG + G Y
Sbjct: 340 ASFETPCGAYGHFKVTRYLMRITGDSRYGDSMEQVLYNTILGAMPLEQGGFSFYYSDYNN 399
Query: 462 GSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK 521
++K W CC GT + + G S YF G+Y+ ++ SR ++
Sbjct: 400 YAAKNYYPEQWP-------CCSGTFPQVTADYGISSYFHSP---EGLYVNLFVPSRAKFQ 449
Query: 522 SG--QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT-SLNLRIPTWTSSNGAKATLNGQ 578
G + + Q+ D ++V +G T S+ LR+P W + G T+NG+
Sbjct: 450 IGGARFSLEQRTHYPYENDIAMQV------RGDNPQTFSIALRVPAW-AGKGTSITVNGR 502
Query: 579 DLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
PG F+ + + W D++ + L + + P+ ++++ GP L
Sbjct: 503 KAEAEVKPGTFVRLHREWKDGDRIEYSIDRPLSLQPVDAQHPDTVALRS---GPLAL 556
>gi|557474|gb|AAA50392.1| ORF1, partial [Bacteroides ovatus]
Length = 436
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 102/211 (48%), Gaps = 21/211 (9%)
Query: 429 YADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIE 488
Y +YYER+L N +L Q + G +Y P+ PG Y + P S WCC G+G+E
Sbjct: 4 YVNYYERALYNHILASQE-PDKGGFVYFTPMRPGH-----YRVYSQPETSMWCCVGSGLE 57
Query: 489 SFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFS 548
+ +K G+ IY + +Y+ +I S+L WK I++ Q+ LR+
Sbjct: 58 NHTKYGEFIYAYRKDT---LYVNLFIPSQLTWKEQGIILTQETRFPDDGKVTLRINEAPK 114
Query: 549 SKGSGLTTSLNLRIPTWTS-SNGAKATLNGQD--LPLPSPGNFLSVTKTWSSDDKLTIQL 605
K +L +RIP W + S G ++NG+ +P +L +++ W D +T L
Sbjct: 115 KK-----RTLMIRIPEWANQSKGYSVSINGKRKMFVMPKGNQYLPLSRKWEKGDVITFHL 169
Query: 606 PLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
P+ + E I D + Y A LYGP VLA
Sbjct: 170 PMKVSVEQIPDKKDYY----AFLYGPIVLAA 196
>gi|224072771|ref|XP_002303873.1| predicted protein [Populus trichocarpa]
gi|222841305|gb|EEE78852.1| predicted protein [Populus trichocarpa]
Length = 103
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 60/131 (45%), Positives = 73/131 (55%), Gaps = 31/131 (23%)
Query: 560 LRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
+RIPTWT GA+ +N TW Q+P + DDRP
Sbjct: 1 MRIPTWTHLEGAETVIND---------------STW--------QIPAS-------DDRP 30
Query: 620 EYASIQAILYGPYVLAGHSIGDWDITE-SATSLSDWITPIPASYNSQLITFTQEYGNTKF 678
EYASIQAILYGP + AGH+ DWDI SA SL +W TPIPA+YN L+TF+Q+ N F
Sbjct: 31 EYASIQAILYGPSLFAGHTTADWDIKNVSADSLPEWSTPIPAAYNDHLVTFSQKSRNPNF 90
Query: 679 VLTNSNQSITM 689
L NSN IT+
Sbjct: 91 FLINSNHIITV 101
>gi|255624614|ref|XP_002540501.1| hypothetical protein RCOM_2107350 [Ricinus communis]
gi|223495313|gb|EEF21882.1| hypothetical protein RCOM_2107350 [Ricinus communis]
Length = 208
Score = 102 bits (254), Expect = 9e-19, Method: Composition-based stats.
Identities = 66/195 (33%), Positives = 99/195 (50%), Gaps = 15/195 (7%)
Query: 183 GHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQF-------DRL 235
GHYLSA A+M A+T +E ++E++ VV+ L CQ G+GY+ P +L
Sbjct: 3 GHYLSALAMMVAATGDEQVRERLDYVVAELKRCQAANGNGYIGGVPGGAAAWRDIAQGKL 62
Query: 236 EA----LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSI 291
A + W P+Y +HK AGL D YTYA N +A M + ++ ++ S
Sbjct: 63 HADNFSVNGKWVPWYNLHKTFAGLRDAYTYAGNQDAHAMLIALCDWTLELTSHL----SD 118
Query: 292 ERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNT 351
E+ + E GGMN+VL + +T K++ LA F L L D ++G H+NT
Sbjct: 119 EQMQSMMRAEHGGMNEVLADVAQMTGQQKYMDLAIRFSHQALLRPLEEGKDQLTGLHANT 178
Query: 352 HIPIVIGSQMRYEVT 366
IP VIG + ++T
Sbjct: 179 QIPKVIGFKRIGDIT 193
>gi|229818564|ref|YP_002880090.1| hypothetical protein Bcav_0062 [Beutenbergia cavernae DSM 12333]
gi|229564477|gb|ACQ78328.1| protein of unknown function DUF1680 [Beutenbergia cavernae DSM
12333]
Length = 596
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 123/555 (22%), Positives = 214/555 (38%), Gaps = 108/555 (19%)
Query: 140 EYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNE 199
E L + D +V FR A LPAPG P GW + + G ++S A + +
Sbjct: 42 ETYLGMSPDDVVHGFRLQAGLPAPGNPMTGWSSRTSQ---PTFGQWVSGLARLGVTAGVA 98
Query: 200 SLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQY 259
++ +V A +A + G + Y K++ GL D
Sbjct: 99 EASQRAVDLVDAFAATVGDDGDARMG-------------------LYGYEKLVCGLADTA 139
Query: 260 TYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDP 319
YA + +AL + E+ + + R + N+ AGG ++ +
Sbjct: 140 LYAGHEDALALLGRTAEW-------ASRTFERARPAASPNDFAGG------RIGPASH-- 184
Query: 320 KHLMLAHLFDKPCFLGLLALQADDISGF-----------------------------HSN 350
M + F + + G LA D + F H+
Sbjct: 185 ARTMEWYTFAENLYRGWLAGADDAVREFASEWHYDAYWDRFLTPPPPGQPWDVPTWLHAY 244
Query: 351 THIPIVIGSQMRYEVTGD----QLHKEGHQL-------ESSGTNIGHFNFKSDPKRLASN 399
+H+ + YEVTG+ + + H + G D L +
Sbjct: 245 SHVNTFASAAAAYEVTGEVRYLDILRNAHTYLTTTQTYATGGYGPSELTLPED-GSLGRS 303
Query: 400 LDSNTEES---CTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYL 456
++ T+ + C ++ K+S L + T E YAD+ E+ + +G+ + G Y
Sbjct: 304 IEWRTDTAEIVCGSWAAFKLSSALLKHTGEARYADWVEQLVYSGIGAVTPVRPGGRTPYY 363
Query: 457 LPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
L G + + HW D + CC GT +++ S L D +YF ++ G+ + Y+ S
Sbjct: 364 QDLRLGIATK--LPHW----DDWPCCSGTYLQAVSHLPDLVYFGDDDG--GLAVALYVPS 415
Query: 517 RLDWKSG--QIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ W+S + + Q+ PV T T + GSG L LR+P W S G +
Sbjct: 416 TVSWESAGSTVTLTQRTAFPVED-------TSTITVGGSG-RFRLRLRVPPW--SEGFRV 465
Query: 574 TLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
++NG + + +PG++ + + W+ D +T+ L LR + P A +GP
Sbjct: 466 SVNGVAVDGVATPGDWFVLERDWADGDVVTVTLGAGLRVLPVDRWHPNRV---AFAHGPV 522
Query: 633 VLAGHSIGDWDITES 647
VLA ++ DW + S
Sbjct: 523 VLAQNA--DWTMPMS 535
>gi|380482670|emb|CCF41095.1| secreted protein [Colletotrichum higginsianum]
Length = 246
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/233 (31%), Positives = 108/233 (46%), Gaps = 56/233 (24%)
Query: 413 MLKVSRHLFRWT--KEIAYADYYERSLTNGVLGIQRGTEP-GVMIYLLPLAPGSSK---- 465
MLK++R L+ + AY D+YER+L N +LG Q ++ G + Y PL PG +
Sbjct: 1 MLKLTRELWLTSPGTTTAYFDFYERALLNHLLGQQDPSDDHGHVTYFTPLNPGGRRGVGP 60
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQI 525
W T DSFWCC GTG+E+ +KL DSIYF + +Y+ +I S L+W +
Sbjct: 61 AWGGGTWSTDYDSFWCCQGTGLETNTKLTDSIYFYDASA---LYVNLFIPSVLEWTQRGV 117
Query: 526 VVNQKVDPVVSWDPYLRV-TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS 584
V Q + + R T T G+G T S+ +RIP+W +S GA
Sbjct: 118 TVTQTTE-------FPRGDTTTLKVAGAG-TWSMRVRIPSW-ASGGA------------- 155
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
QLP+ L DD ++ A+ +GP +L+G+
Sbjct: 156 -------------------QLPMKLHVIPANDD----PNVAALAFGPVILSGN 185
>gi|284043399|ref|YP_003393739.1| hypothetical protein Cwoe_1938 [Conexibacter woesei DSM 14684]
gi|283947620|gb|ADB50364.1| protein of unknown function DUF1680 [Conexibacter woesei DSM 14684]
Length = 711
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 130/566 (22%), Positives = 232/566 (40%), Gaps = 103/566 (18%)
Query: 113 EFLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEE 172
+ L ++ V LG D R + + D L++ FR APG P GW
Sbjct: 13 KILTAMNYQGVELG-DCRQRRQLEEACATFAGVSNDALLYPFRIRKGSWAPGIPLRGWYG 71
Query: 173 PSCELRGHF--VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE 230
G F +G + + A ++A+T EK A++ +E G G+LS+
Sbjct: 72 -----EGLFNNLGQFFTLYARLYAATGEHRFAEKALALLDGWEETIEEDG-GFLSSHFAG 125
Query: 231 QFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEAL----RMTTWMVEYFYNRVQNVI 286
+ Y+ K++ GLLD + Y + AL R++ WM R
Sbjct: 126 TVE------------YSYDKLVCGLLDLHEYVGSERALPVLERVSRWM-----QRHGGSS 168
Query: 287 KKYSIER----HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCF--------L 334
K Y+ W TL E L + + +T DP + LA+ + F +
Sbjct: 169 KPYAWSGMGPLEWYTLPE-------YLLRAYAVTSDPLYRELANAYRYDEFYDALLERDV 221
Query: 335 GLLALQADDISGFH-SNTHIPIVIGSQMRYEVTGDQLHKE----GHQL--ESSGTNIGHF 387
G L +AD+ F+ +++H + + YE TGD + + G++L ES G F
Sbjct: 222 GALMRRADEARNFYQAHSHANTLNSAAAVYETTGDPRYLDVLTAGYELLRESQTFATGMF 281
Query: 388 N----FKSDPKRLA--SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGV 441
F +R+ + + + E +C ++ M+++ RHL T E + D+ E ++ NG+
Sbjct: 282 GPLEAFMKPRQRVEVLHSEEGHAEVACPSWAMMRLVRHLIELTGEAQFGDWMELNVYNGI 341
Query: 442 LGIQRGTEPGVMIYLLPLAPGSSKE--------RSYHHWGTPSDSFWCCYGTGIESFSKL 493
G+ P A G + + R+ WG + CC T + ++
Sbjct: 342 -----GSAPPTR------ADGRATQYFADYGLDRATKTWGV---EWSCCSTTSGINMAEY 387
Query: 494 GDSIYFE-EEGKYPGVYIIQYISSRLDWKSGQIVVNQK----VDPVVSWDPYLRVTLTFS 548
+ IY+ + + +Y+ ++ +D + + Q+ VD V++D +RV
Sbjct: 388 VNQIYYAGPDALHVCLYLPSSVTCEID--GATLWLTQRTAYPVDERVAFD--VRVERP-- 441
Query: 549 SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLT 608
L ++ R+P WT+ + TL+G+ + + +V +TW D + + LP+
Sbjct: 442 -----LRGTIAFRVPAWTAGE-PRLTLDGEPVEHVVRDGWATVERTWEDGDAIELTLPME 495
Query: 609 LRTEAIQDDRPEYASIQAILYGPYVL 634
L ++ A A+ YGP VL
Sbjct: 496 LAVLPVEPATD--AGPVALRYGPVVL 519
>gi|374374779|ref|ZP_09632437.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
gi|373231619|gb|EHP51414.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
Length = 614
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 125/562 (22%), Positives = 229/562 (40%), Gaps = 81/562 (14%)
Query: 179 GHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEAL 238
G VG YL A+A W T N +LK +M + + L + ++ GYL + + +
Sbjct: 89 GEHVGKYLEAAANTWIITKNAALKTQMDRIFNEL--IKTQLPDGYLGTYLPDSY------ 140
Query: 239 IPVWAPYYT-IHKI-LAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQ 296
W + +HK L GLL Y + AL + + + ++ + I +
Sbjct: 141 ---WTSWDVWVHKYDLVGLLAYYRVTGDRRALTAAVKVGDLLLKNIGDLPGQKDIIKTGS 197
Query: 297 TLNEEAGGMNDVLYKLFCITQDPKHL----MLAHLFDKPCFLGLLAL-----QADDISGF 347
+ A + D + L+ T D ++L + +D P ++ Q D ++
Sbjct: 198 HVGMAATSVIDPMTDLYQWTGDRRYLDFCKYIIKAYDHPAGPSIVTTLLKEKQVDKVANG 257
Query: 348 HSNTHIPIVIGSQMRYEVTGDQLHKEG----------HQLESSGTNIGHFNFKSDPKRLA 397
+ + ++G Y +TGD+ + + +L +GT H F D L
Sbjct: 258 KAYEMLSNLVGIIKLYRLTGDEKYLQACRNAFDDIAAKRLFVTGTTSDHERFMPD-NILQ 316
Query: 398 SNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLL 457
++ ++ E C T ++ + LF T ++ Y + E+S+ N +LG + E G + Y
Sbjct: 317 ADTAAHMGEGCVTTTWIQFNVQLFAITGDLKYYNEIEKSVYNHLLGAEN-PETGCVSYYT 375
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL G R + CC + + L + + + P V + +
Sbjct: 376 PLI-GIKPYRC---------NITCCLSSVPRGIA-LIPYLNYGKLNNRPTVLLYE----A 420
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKG---------SGLTTSLNLRIPTWTSS 568
D K + + PV L++ TF +G S +L LR+P W +
Sbjct: 421 ADIKDRVVTAGGRETPVA-----LQINTTFPKEGKATIKVALPSAARFALQLRVPAW--A 473
Query: 569 NGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTI--QLPLTLRTEAIQDDRPEYASIQA 626
NG KA + G+ + + + + W+ ++ + I ++P+T + Y + A
Sbjct: 474 NGFKAVIAGKTYTAQA-NELVVIDRNWARENIIAISFEIPVT-----VLQGGASYPNYIA 527
Query: 627 ILYGPYVL-AGHSIG-DWDITESA--TSLSDWITPIPASYNSQLITFTQEYGNTKFVLTN 682
I GP VL A S+ +DIT++A T ++ +T PA +Q I Q Y T TN
Sbjct: 528 IKRGPQVLSADQSLNPSFDITKTAFRTPVAVQLTSTPAKLPAQWIG-KQAYSVTFKTGTN 586
Query: 683 SNQSITMEKFP---KSGTDAAL 701
Q + + + ++G DA++
Sbjct: 587 KEQPVLLVPYAEASQTGGDASV 608
>gi|361069271|gb|AEW08947.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
Length = 75
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/75 (50%), Positives = 51/75 (68%)
Query: 789 YTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLL 848
Y A + Q ++ +L C T+ FN A+SF G ++YHPISF+A+GA R +LLAPLL
Sbjct: 1 YGAESYQVGQAVELRCKPLVTDLAFNRASSFTWNTGFAKYHPISFIARGARRAYLLAPLL 60
Query: 849 SLRDESYTVYFDFQS 863
+ RDESYTVYF+ S
Sbjct: 61 TYRDESYTVYFNITS 75
>gi|383146477|gb|AFG54937.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146481|gb|AFG54941.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
Length = 75
Score = 80.5 bits (197), Expect = 4e-12, Method: Composition-based stats.
Identities = 37/75 (49%), Positives = 51/75 (68%)
Query: 789 YTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLL 848
Y A + Q ++ +L C T+ FN A+SF G ++YHPISF+A+GA R +LLAPLL
Sbjct: 1 YGAESYQVGQAVELRCKPLVTDLAFNRASSFTWNTGFAKYHPISFIARGARRAYLLAPLL 60
Query: 849 SLRDESYTVYFDFQS 863
+ RDESYTVYF+ +
Sbjct: 61 AYRDESYTVYFNITA 75
>gi|189467199|ref|ZP_03015984.1| hypothetical protein BACINT_03583 [Bacteroides intestinalis DSM
17393]
gi|189435463|gb|EDV04448.1| hypothetical protein BACINT_03583 [Bacteroides intestinalis DSM
17393]
Length = 175
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 57/94 (60%), Gaps = 7/94 (7%)
Query: 148 DKLVWNFRKTARLPAPGE-------PYGGWEEPSCELRGHFVGHYLSASALMWASTHNES 200
++L+ +FR A + A E GGWE CELRGH GH LSA ALM+AST +E
Sbjct: 75 NRLLHSFRDNAGVFAGREGGDMTVKKLGGWESLDCELRGHTTGHLLSAYALMYASTGSEI 134
Query: 201 LKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDR 234
K K ++V+ L+ Q +G+GYLSA+P E +R
Sbjct: 135 FKLKGDSLVTGLAEVQAALGNGYLSAYPEELINR 168
>gi|383146472|gb|AFG54932.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146473|gb|AFG54933.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146474|gb|AFG54934.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146475|gb|AFG54935.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146476|gb|AFG54936.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146478|gb|AFG54938.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146479|gb|AFG54939.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146480|gb|AFG54940.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146482|gb|AFG54942.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146483|gb|AFG54943.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146484|gb|AFG54944.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146485|gb|AFG54945.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146486|gb|AFG54946.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146487|gb|AFG54947.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146488|gb|AFG54948.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
gi|383146489|gb|AFG54949.1| Pinus taeda anonymous locus CL2380Contig1_03 genomic sequence
Length = 75
Score = 79.0 bits (193), Expect = 1e-11, Method: Composition-based stats.
Identities = 36/75 (48%), Positives = 51/75 (68%)
Query: 789 YTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLL 848
Y A + Q ++ +L C T+ FN A+SF G ++YHPISF+A+GA R +LLAPLL
Sbjct: 1 YGAESYQVGQAVELRCKPLVTDLAFNRASSFTWNTGFAKYHPISFIARGARRAYLLAPLL 60
Query: 849 SLRDESYTVYFDFQS 863
+ +DESYTVYF+ +
Sbjct: 61 AYKDESYTVYFNITA 75
>gi|357472929|ref|XP_003606749.1| hypothetical protein MTR_4g065190 [Medicago truncatula]
gi|355507804|gb|AES88946.1| hypothetical protein MTR_4g065190 [Medicago truncatula]
Length = 111
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 64/134 (47%), Gaps = 24/134 (17%)
Query: 729 MLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVAGLDGGDRTVSLESETYKGCFV 788
MLEPFD PGM V + L++ DS SSVF G R +S +
Sbjct: 1 MLEPFDLPGMTVSHQGPEKPLIIVDSSHGGPSSVFSC------GTRIGWTKSNN-----I 49
Query: 789 YTAVNLQSSESTKLGCISESTEAGFNNAASFVIEKGLSEYHPISFVAKGANRNFLLAPLL 848
+ L + FV KGL +YHPISFVAKGAN+NFLL PL
Sbjct: 50 FRITKLLLKLVLTKQLV-------------FVSGKGLRQYHPISFVAKGANQNFLLDPLF 96
Query: 849 SLRDESYTVYFDFQ 862
+ RDE YTVYF+ Q
Sbjct: 97 NFRDEHYTVYFNIQ 110
>gi|423223914|ref|ZP_17210383.1| hypothetical protein HMPREF1062_02569 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392637516|gb|EIY31383.1| hypothetical protein HMPREF1062_02569 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 664
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 94/208 (45%), Gaps = 21/208 (10%)
Query: 400 LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL 459
L N E+C T + +++++ L T E YAD ER + N V Q E GV Y
Sbjct: 339 LSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHVFAAQ-DCESGVCRY--HT 395
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
AP SK Y H P CC +G S L IY E E ++ YI QY+ S+
Sbjct: 396 APNGSKPDGYFH--GPD----CCTASGHRIISMLPTFIYAEREKEF---YINQYMPSQYT 446
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
K + ++ + LT S+ +LNLRIP+W K +NG++
Sbjct: 447 GKDFAFEITG------NYPESENMQLTIVSE-KARNKTLNLRIPSWCEHPEIK--VNGEN 497
Query: 580 LPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
+ PG +L + + W+ DK++I P+
Sbjct: 498 IADVKPGTYLKLPRKWTKGDKVSITFPM 525
>gi|224537087|ref|ZP_03677626.1| hypothetical protein BACCELL_01964 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521314|gb|EEF90419.1| hypothetical protein BACCELL_01964 [Bacteroides cellulosilyticus
DSM 14838]
Length = 664
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 64/208 (30%), Positives = 96/208 (46%), Gaps = 21/208 (10%)
Query: 400 LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL 459
L N E+C T + +++++ L T E YAD ER + N V Q E GV Y
Sbjct: 339 LSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHVFAAQ-DCESGVCRY--HT 395
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
AP SK Y H P CC +G S L IY E+ ++ YI QYI S+
Sbjct: 396 APNGSKPDGYFH--GPD----CCTASGHRIISMLPTFIYAEKGKEF---YINQYIPSQYT 446
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
K + ++ + LT S+ + T LNLRIP+W K +NG++
Sbjct: 447 GKDFAFEITG------NYPESENMQLTIVSEKAKNKT-LNLRIPSWCEHPEIK--VNGEN 497
Query: 580 LPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
+ PG +L +++ W+ DK++I P+
Sbjct: 498 IADVKPGAYLKLSRKWTKGDKVSITFPM 525
>gi|340619901|ref|YP_004738354.1| hypothetical protein zobellia_3937 [Zobellia galactanivorans]
gi|339734698|emb|CAZ98075.1| Conserved hypothetical periplasmic protein [Zobellia
galactanivorans]
Length = 629
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 108/488 (22%), Positives = 194/488 (39%), Gaps = 74/488 (15%)
Query: 176 ELRGHFVGH--YLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFD 233
E+ G F+G + AS + A +H+ + E + +V + +++ +GY + E+
Sbjct: 78 EVVGAFIGMGMLIDASVRLAAYSHDPKMMEIKNEIVDKV--IDEQLKNGYSGFYKPER-- 133
Query: 234 RLEALIPVW-----APYYTIHK---ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNV 285
RL W + IH+ I+ GL Y N +L+ ++ +
Sbjct: 134 RL------WNSQGGGDNWDIHEMAFIIDGLTSDYELFGNKRSLKAAIKTADFIMEHWHEM 187
Query: 286 IKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA------HLFDKPCFLGLLAL 339
Y+ E L+ G++ +++L+ T + + L + + +D +G
Sbjct: 188 PDDYAAEVDMHVLDT---GIDWAIFRLYKTTGEKRFLNFSEKTKSLYQWDTKIEIG---- 240
Query: 340 QADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKSDPKRLAS- 398
+ +SG H + + + Y TG++ +L N F D ++
Sbjct: 241 RRPGVSG-HMFAYFAMCMAQIELYRYTGNK------ELLQQTENAMRFFLAEDGLTISGS 293
Query: 399 ---------NLDSNTE--ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRG 447
+ D E E+C T +V L R T + Y D ER++ NG+ G Q
Sbjct: 294 AGQREIWTDDQDGENELGETCATAYQTRVYESLLRLTGKAEYGDLIERTVYNGLFGAQ-S 352
Query: 448 TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPG 507
+ G + Y P ER Y+ + CC G S+L +Y+ +
Sbjct: 353 PDGGKLRYYTPF----EGERHYYDV-----EYMCCPGNFRRIISELPGMVYYRSKEDGVA 403
Query: 508 VYIIQYISSRLDWKSGQIV-VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
V + +R++ G V V QK S+ RV L+ S + T L+LRIP+W
Sbjct: 404 VNLYAQSEARVELNDGITVDVQQK----TSYPTSGRVELSVSPNKAS-TFPLSLRIPSWA 458
Query: 567 SSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
A +NG+ PG F+ +T+ W+S D++ + P+ +R R +
Sbjct: 459 KE--ATIMVNGEKWQGEIKPGTFVDITRKWTSKDRVLLDFPMDIR---FIKGRKRNSGRV 513
Query: 626 AILYGPYV 633
A++ GP V
Sbjct: 514 ALMRGPIV 521
>gi|427384256|ref|ZP_18880761.1| hypothetical protein HMPREF9447_01794 [Bacteroides oleiciplenus YIT
12058]
gi|425727517|gb|EKU90376.1| hypothetical protein HMPREF9447_01794 [Bacteroides oleiciplenus YIT
12058]
Length = 662
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 62/214 (28%), Positives = 97/214 (45%), Gaps = 33/214 (15%)
Query: 400 LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL 459
L N E+C T + +++++ L T E YAD ER + N V Q E GV Y
Sbjct: 339 LSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHVFAAQ-DCENGVCRY--HT 395
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
AP SK Y H P CC +G S L IY E+ ++ Y+ QY+ S+ +
Sbjct: 396 APNGSKPDGYFH--GPD----CCTASGHRIISMLPTFIYAEKGKEF---YVNQYMPSQYN 446
Query: 520 WK------SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
K +G ++ ++ V+ S K T +NLRIP+W + K
Sbjct: 447 GKDFAFSITGNYPESENMELVIE-----------SEKAKNKT--INLRIPSWCEN--PKV 491
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
++NG+ + PG +L +++ W DK+ I P+
Sbjct: 492 SVNGEAVADIKPGTYLKLSRKWGKGDKINIIFPM 525
>gi|332881627|ref|ZP_08449275.1| hypothetical protein HMPREF9074_05065 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357045708|ref|ZP_09107342.1| hypothetical protein HMPREF9441_01351 [Paraprevotella clara YIT
11840]
gi|332680266|gb|EGJ53215.1| hypothetical protein HMPREF9074_05065 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355531373|gb|EHH00772.1| hypothetical protein HMPREF9441_01351 [Paraprevotella clara YIT
11840]
Length = 586
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 92/208 (44%), Gaps = 21/208 (10%)
Query: 400 LDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL 459
+ N E+C T + +++++ L T E YAD ER + N V Q E G Y
Sbjct: 265 VSGNVVETCATMSWMQLTQMLLELTGESKYADAMERLMMNHVFAAQ-DCESGTCRY--HT 321
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
AP +K Y H P CC +G S L + ++ E GK YI QY+ SR D
Sbjct: 322 APNGTKPHDYFH--GPD----CCTASGHRIISLL-PTFFYAENGK--DFYINQYLPSRYD 372
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
K ++ S V SSK LNLRIP+W + + ++NG+
Sbjct: 373 GKDFAFEISGNYPESES-----MVLTVLSSKNK--NKILNLRIPSWCKA--PEVSVNGER 423
Query: 580 LPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
+ G +L++T+ W DK+ I P+
Sbjct: 424 VSGIEAGKYLAITRKWEKGDKIGITFPM 451
>gi|330998039|ref|ZP_08321870.1| hypothetical protein HMPREF9442_02974 [Paraprevotella xylaniphila
YIT 11841]
gi|329569340|gb|EGG51120.1| hypothetical protein HMPREF9442_02974 [Paraprevotella xylaniphila
YIT 11841]
Length = 661
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 120/285 (42%), Gaps = 39/285 (13%)
Query: 339 LQADDISGF-HSNTHIPIVIGSQMRYEVTGDQ------------LHKEGHQLESSGTNIG 385
L D++ + HS+T +G Y +TGD+ +HK + +
Sbjct: 270 LGVDELQPYVHSHTFQMNFMGFLRLYRITGDKSLFRKVEGAWEDIHKRQMYITGGVSVAE 329
Query: 386 HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQ 445
H+ + N E+C T + +++++ L T E YAD ER + N V Q
Sbjct: 330 HYEHG-----YVKPVSGNVVETCATMSWMQLTQMLLELTGESKYADAMERLMMNHVFAAQ 384
Query: 446 RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKY 505
E G Y AP +K SY H P CC +G S L +Y E ++
Sbjct: 385 -DCETGTCRY--HTAPNGTKPASYFH--GPD----CCTASGHRIISMLPTFMYAERGKEF 435
Query: 506 PGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
++ QY+ S K ++ + + LT S+ + LNLRIP+W
Sbjct: 436 ---FVNQYLPSHYIGKDFAFQISGNYPEAEN------MELTVLSE-KAVDRVLNLRIPSW 485
Query: 566 TSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
+ + ++NG+++ PG +L +++ WS DK++I P+ R
Sbjct: 486 CKA--PRVSVNGKNVIGVEPGTYLKISRKWSKGDKVSIVFPMEER 528
>gi|365847237|ref|ZP_09387726.1| hypothetical protein HMPREF0880_01230 [Yokenella regensburgei ATCC
43003]
gi|364572491|gb|EHM50031.1| hypothetical protein HMPREF0880_01230 [Yokenella regensburgei ATCC
43003]
Length = 659
Score = 72.4 bits (176), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 63/239 (26%), Positives = 100/239 (41%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DTVYAESCASIGLMMFARRMLEMEADSRYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG +Y + +YI YI
Sbjct: 396 VHPKSLKFNHIYDHIKPVRQRWFGCACCPPNIARVLTSLGHYLYTSRD---EALYINLYI 452
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + + W +V++T S + + +L LRIP W + A+
Sbjct: 453 GNSVEIPVAGHALRLHISGDYPWQE--QVSITVESPDT-VNHTLALRIPDWCVN--AQVM 507
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+++PL +L +T+ W DKL + LP+ +R A AI GP V
Sbjct: 508 LNGEEIPLLPHKGYLHITRDWQEGDKLLLTLPMPVRRVYANPLMRHAAGKIAIQRGPLV 566
>gi|237719720|ref|ZP_04550201.1| predicted protein [Bacteroides sp. 2_2_4]
gi|229450989|gb|EEO56780.1| predicted protein [Bacteroides sp. 2_2_4]
Length = 663
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 68/273 (24%), Positives = 113/273 (41%), Gaps = 38/273 (13%)
Query: 348 HSNTHIPIVIGSQMRYEVTGDQ------------LHKEGHQLESSGTNIGHFNFKSDPKR 395
HS+T +G Y +TGD+ +HK + + H+
Sbjct: 282 HSHTFQMNFMGFLRLYRITGDKSLFRKVAGAWDDIHKRQMYITGGVSVAEHYEHD----- 336
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY 455
+ + E+C T + +++++ L T E YAD ER + N V Q E G Y
Sbjct: 337 YVKPISGHVVETCATMSWMQLTQMLLELTGESKYADAMERLMINHVFAAQ-DCETGSCRY 395
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
AP SK Y H P CC +G S L +Y E+ ++ Y+ QY+
Sbjct: 396 --HTAPNGSKPHGYFH--GPD----CCTASGHRIISMLPTFMYAEKGKEF---YVNQYVP 444
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S+ K+ ++ V + + LT +S+ LNLRIP+W + ++
Sbjct: 445 SQYAGKAFSFEISGNYPEVEN------MELTVTSERVA-DRVLNLRIPSWCEK--PQVSV 495
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLT 608
NG+ + PG +L +++ W DK+ I P+
Sbjct: 496 NGEKMAGVQPGTYLKISRKWVKGDKVCIVFPMV 528
>gi|284122982|ref|ZP_06386886.1| protein of unknown function DUF1680 [Candidatus Poribacteria sp.
WGA-A3]
gi|283829311|gb|EFC33713.1| protein of unknown function DUF1680 [Candidatus Poribacteria sp.
WGA-A3]
Length = 577
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 107/479 (22%), Positives = 191/479 (39%), Gaps = 90/479 (18%)
Query: 187 SASALMWASTH-NESLKEKMSAVVSALSACQKEIGSGYLSAF-----PTEQFDRLEALIP 240
+AS +W TH N + + ++ V++ ++ACQ+ GYL+++ PT+++ L +
Sbjct: 21 AASYTLW--THPNPTWEPELDEVIAKIAACQQP--DGYLNSYFTLVEPTKRWQNLGMMHE 76
Query: 241 VWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNE 300
+ Y + + Y L + + N K+ + H
Sbjct: 77 L----YCAGHLFEAAVAHYQATGKQTLLDVACRFADLIDNTF-GFDKRDGLPGH------ 125
Query: 301 EAGGMNDVLYKLFCITQDPKHLMLAHLF------------------DKPCFLGLLA---L 339
G+ L KL +T +P+++ LA F D P LG
Sbjct: 126 --EGIELALVKLARVTGEPRYMALAEYFVTRRGHSPSIFEKELENPDLPGGLGAYQHHFT 183
Query: 340 QADDISGFHSNTHIPI-----VIGSQMR----YEVTGDQLHKEG-----HQLESSGTNIG 385
+ G ++ H+PI +G +R Y D ++ G + LE+ N+G
Sbjct: 184 RDGKYEGHYAQAHLPIQEQTECVGHAVRAMYLYSGAADIAYETGDSAITNALEALWQNVG 243
Query: 386 ---HFNFKSDPKRLASNLDSNTE--------ESCTTYNMLKVSRHLFRWTKEIAYADYYE 434
+ P ++ E E+C + ++ + +F E + D E
Sbjct: 244 KRLYITGGVGPSGHNEGFTTDYELPNFSAYAETCASIGLIFWAHRMFLLRAESRFVDVLE 303
Query: 435 RSLTNGVL-GIQ-RGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSK 492
+L NG L GI GT Y PLA S +R H W + CC +
Sbjct: 304 TALYNGALSGISLDGTG---FFYQNPLA--SHGDRHRHEWFGCA----CCPPNIARLLAS 354
Query: 493 LGDSIYFEEEGKYPGVYIIQYISSRLD-WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKG 551
+G IY E E G+Y+ Y+S D +G + V + W + +T+T ++
Sbjct: 355 VGQYIYAESE---EGIYVNLYVSITADAIAAGNVPVRLTQETDYPWAGDVTLTITPTTP- 410
Query: 552 SGLTTSLNLRIPTWTSSNGAKATLNGQ-DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTL 609
+ +LNLRIP W + +NG+ D P+ +L++T+ W + D++ +QLP+ +
Sbjct: 411 --VPFTLNLRIPGWCDQ--CEVRVNGEADNSQPNATGYLTITREWRAGDRVQLQLPMPV 465
>gi|238910286|ref|ZP_04654123.1| hypothetical protein SentesTe_04004 [Salmonella enterica subsp.
enterica serovar Tennessee str. CDC07-0191]
Length = 651
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 61/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPRSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ G + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEIPVGNGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|409730702|ref|ZP_11272263.1| hypothetical protein Hham1_15864 [Halococcus hamelinensis 100A6]
gi|448723717|ref|ZP_21706233.1| hypothetical protein C447_11225 [Halococcus hamelinensis 100A6]
gi|445787256|gb|EMA38004.1| hypothetical protein C447_11225 [Halococcus hamelinensis 100A6]
Length = 639
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 63/258 (24%), Positives = 111/258 (43%), Gaps = 24/258 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + ++ + T + YAD ER+L NG L G+ G E Y PL SS
Sbjct: 335 ETCAAIGSVFWNQRMLERTGDAKYADLIERTLYNGFLAGV--GLEGKEFFYENPLE--SS 390
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
+ W T + CC F+ LG +Y ++ +++ QY+ SR+ + G
Sbjct: 391 GDHHRKGWFTCA----CCPPNAARLFASLGGYLYGDDGDD---LFVHQYVGSRVSTEVGG 443
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS 584
V+ V+ + W + + +T S G + +L LR+P W S G +NG+ +
Sbjct: 444 TAVDLDVETDLPWSGDVSLDVTAS---EGESFALRLRVPAW--SEGTTVEVNGESVDAAV 498
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDI 644
+L++ + W +DD + + T++T A + A+ GP V +
Sbjct: 499 EDGYLALDREW-TDDTVELTFEQTVQTVRAHPAVEADAGLVAVERGPLVYC------LEA 551
Query: 645 TESATSLSDWITPIPASY 662
T++ L ++ P Y
Sbjct: 552 TDNDRPLHQYVLPTDGEY 569
>gi|198242542|ref|YP_002217640.1| hypothetical protein SeD_A4064 [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gi|375121158|ref|ZP_09766325.1| Six-hairpin glycosidase-like domain protein [Salmonella enterica
subsp. enterica serovar Dublin str. SD3246]
gi|445143487|ref|ZP_21386535.1| hypothetical protein SEEDSL_010877 [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
gi|445149123|ref|ZP_21388948.1| hypothetical protein SEEDHWS_023670 [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
gi|197937058|gb|ACH74391.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gi|326625425|gb|EGE31770.1| Six-hairpin glycosidase-like domain protein [Salmonella enterica
subsp. enterica serovar Dublin str. SD3246]
gi|444848141|gb|ELX73271.1| hypothetical protein SEEDSL_010877 [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
gi|444858418|gb|ELX83404.1| hypothetical protein SEEDHWS_023670 [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
Length = 651
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVGNGALKLRIGGNYPWQEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|438041968|ref|ZP_20855782.1| hypothetical protein SEEE5646_18016 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-5646]
gi|435321796|gb|ELO94162.1| hypothetical protein SEEE5646_18016 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-5646]
Length = 646
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVGNGALKLRIGGNYPWQEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|205354717|ref|YP_002228518.1| hypothetical protein SG3751 [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|375125607|ref|ZP_09770771.1| hypothetical protein SG9_3828 [Salmonella enterica subsp. enterica
serovar Gallinarum str. SG9]
gi|445130406|ref|ZP_21381321.1| hypothetical protein SEEG9184_014572 [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
gi|205274498|emb|CAR39532.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|326629857|gb|EGE36200.1| hypothetical protein SG9_3828 [Salmonella enterica subsp. enterica
serovar Gallinarum str. SG9]
gi|444852215|gb|ELX77297.1| hypothetical protein SEEG9184_014572 [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
Length = 651
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVGNGALKLRIGGNYPWQEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|378957466|ref|YP_005214953.1| hypothetical protein SPUL_3886 [Salmonella enterica subsp. enterica
serovar Gallinarum/pullorum str. RKS5078]
gi|438120755|ref|ZP_20872004.1| hypothetical protein SEEP9120_00215 [Salmonella enterica subsp.
enterica serovar Pullorum str. ATCC 9120]
gi|357208077|gb|AET56123.1| hypothetical protein SPUL_3886 [Salmonella enterica subsp. enterica
serovar Gallinarum/pullorum str. RKS5078]
gi|434943466|gb|ELL49584.1| hypothetical protein SEEP9120_00215 [Salmonella enterica subsp.
enterica serovar Pullorum str. ATCC 9120]
Length = 651
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVGNGALKLRIGGNYPWQEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|207858916|ref|YP_002245567.1| hypothetical protein SEN3501 [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|421357264|ref|ZP_15807576.1| hypothetical protein SEEE3139_04408 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|421362069|ref|ZP_15812325.1| hypothetical protein SEEE0166_05558 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|421368596|ref|ZP_15818785.1| hypothetical protein SEEE0631_15493 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|421370704|ref|ZP_15820867.1| hypothetical protein SEEE0424_03357 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|421376619|ref|ZP_15826719.1| hypothetical protein SEEE3076_10306 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|421379882|ref|ZP_15829946.1| hypothetical protein SEEE4917_03813 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|421387196|ref|ZP_15837201.1| hypothetical protein SEEE6622_17936 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|421388833|ref|ZP_15838818.1| hypothetical protein SEEE6670_03439 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|421393233|ref|ZP_15843178.1| hypothetical protein SEEE6426_02862 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|421400876|ref|ZP_15850758.1| hypothetical protein SEEE6437_19177 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|421404698|ref|ZP_15854538.1| hypothetical protein SEEE7246_15620 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|421408356|ref|ZP_15858156.1| hypothetical protein SEEE7250_11351 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|421414364|ref|ZP_15864109.1| hypothetical protein SEEE1427_18818 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|421418252|ref|ZP_15867957.1| hypothetical protein SEEE2659_15744 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|421423488|ref|ZP_15873147.1| hypothetical protein SEEE1757_19367 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|421427667|ref|ZP_15877286.1| hypothetical protein SEEE5101_17714 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|421429796|ref|ZP_15879391.1| hypothetical protein SEEE8B1_05655 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|421437646|ref|ZP_15887162.1| hypothetical protein SEEE5518_21932 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|421438534|ref|ZP_15888029.1| hypothetical protein SEEE1618_03571 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|421443523|ref|ZP_15892964.1| hypothetical protein SEEE3079_05727 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|436605457|ref|ZP_20513395.1| hypothetical protein SEE22704_08488 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|436694238|ref|ZP_20518150.1| hypothetical protein SEE30663_08486 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|436803411|ref|ZP_20525841.1| hypothetical protein SEECHS44_21392 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|436810025|ref|ZP_20529267.1| hypothetical protein SEEE1882_15710 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|436816420|ref|ZP_20533798.1| hypothetical protein SEEE1884_15817 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|436832038|ref|ZP_20536533.1| hypothetical protein SEEE1594_06730 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|436849358|ref|ZP_20540514.1| hypothetical protein SEEE1566_04035 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|436858888|ref|ZP_20547165.1| hypothetical protein SEEE1580_15161 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|436862962|ref|ZP_20549538.1| hypothetical protein SEEE1543_04463 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|436874233|ref|ZP_20556894.1| hypothetical protein SEEE1441_19202 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|436876728|ref|ZP_20558061.1| hypothetical protein SEEE1810_02403 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|436886249|ref|ZP_20562678.1| hypothetical protein SEEE1558_02954 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|436893215|ref|ZP_20567194.1| hypothetical protein SEEE1018_02862 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|436900848|ref|ZP_20571778.1| hypothetical protein SEEE1010_03453 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|436913977|ref|ZP_20579179.1| hypothetical protein SEEE1729_18338 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|436919198|ref|ZP_20582051.1| hypothetical protein SEEE0895_09976 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|436928295|ref|ZP_20587740.1| hypothetical protein SEEE0899_15810 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|436937155|ref|ZP_20592450.1| hypothetical protein SEEE1457_16920 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|436944088|ref|ZP_20596699.1| hypothetical protein SEEE1747_15796 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|436953454|ref|ZP_20601804.1| hypothetical protein SEEE0968_18795 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|436962937|ref|ZP_20605560.1| hypothetical protein SEEE1444_14849 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|436967670|ref|ZP_20607424.1| hypothetical protein SEEE1445_01381 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|436978926|ref|ZP_20612901.1| hypothetical protein SEEE1559_06556 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|436995892|ref|ZP_20619592.1| hypothetical protein SEEE1565_17627 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|437011806|ref|ZP_20624610.1| hypothetical protein SEEE1808_20431 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|437019323|ref|ZP_20627061.1| hypothetical protein SEEE1811_09865 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|437026609|ref|ZP_20629868.1| hypothetical protein SEEE0956_01180 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|437041181|ref|ZP_20635197.1| hypothetical protein SEEE1455_05273 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|437051574|ref|ZP_20641455.1| hypothetical protein SEEE1575_14346 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|437056616|ref|ZP_20644024.1| hypothetical protein SEEE1725_04709 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|437067549|ref|ZP_20650399.1| hypothetical protein SEEE1745_14184 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|437073604|ref|ZP_20653177.1| hypothetical protein SEEE1791_05329 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|437082599|ref|ZP_20658441.1| hypothetical protein SEEE1795_09343 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|437089107|ref|ZP_20661970.1| hypothetical protein SEEE6709_04610 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|437103922|ref|ZP_20666960.1| hypothetical protein SEEE9058_06907 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|437126597|ref|ZP_20674605.1| hypothetical protein SEEE0816_23102 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|437131843|ref|ZP_20677676.1| hypothetical protein SEEE0819_15610 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|437136794|ref|ZP_20680031.1| hypothetical protein SEEE3072_04644 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|437143889|ref|ZP_20684687.1| hypothetical protein SEEE3089_05314 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|437154248|ref|ZP_20690986.1| hypothetical protein SEEE9163_14382 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|437162604|ref|ZP_20696211.1| hypothetical protein SEEE151_18100 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|437166884|ref|ZP_20698338.1| hypothetical protein SEEEN202_06125 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|437178010|ref|ZP_20704356.1| hypothetical protein SEEE3991_14003 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|437183055|ref|ZP_20707414.1| hypothetical protein SEEE3618_06829 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|437198906|ref|ZP_20711454.1| hypothetical protein SEEE1831_04503 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|437262882|ref|ZP_20719212.1| hypothetical protein SEEE2490_17657 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|437271416|ref|ZP_20723680.1| hypothetical protein SEEEL909_17675 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|437275478|ref|ZP_20725823.1| hypothetical protein SEEEL913_05569 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|437291505|ref|ZP_20731569.1| hypothetical protein SEEE4941_12100 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|437304204|ref|ZP_20733917.1| hypothetical protein SEEE7015_01180 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|437324305|ref|ZP_20739563.1| hypothetical protein SEEE7927_06991 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|437339496|ref|ZP_20744149.1| hypothetical protein SEEECHS4_07410 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|437430625|ref|ZP_20755828.1| hypothetical protein SEEE2217_21365 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|437447211|ref|ZP_20758929.1| hypothetical protein SEEE4018_14080 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|437464509|ref|ZP_20763586.1| hypothetical protein SEEE6211_14683 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|437474444|ref|ZP_20766236.1| hypothetical protein SEEE4441_05197 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|437490700|ref|ZP_20771023.1| hypothetical protein SEEE4647_06743 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|437518116|ref|ZP_20778521.1| hypothetical protein SEEE9845_22471 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|437563498|ref|ZP_20786805.1| hypothetical protein SEEE0116_18550 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|437572857|ref|ZP_20789281.1| hypothetical protein SEEE1117_07915 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|437593902|ref|ZP_20795526.1| hypothetical protein SEEE1392_17150 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|437607245|ref|ZP_20800160.1| hypothetical protein SEEE0268_17650 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|437617397|ref|ZP_20802955.1| hypothetical protein SEEE0316_08756 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|437653610|ref|ZP_20810238.1| hypothetical protein SEEE0436_23184 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|437661278|ref|ZP_20812888.1| hypothetical protein SEEE1319_12828 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|437677654|ref|ZP_20817320.1| hypothetical protein SEEE4481_12502 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|437691966|ref|ZP_20820894.1| hypothetical protein SEEE6297_07188 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|437707522|ref|ZP_20825711.1| hypothetical protein SEEE4220_08863 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|437725054|ref|ZP_20829741.1| hypothetical protein SEEE1616_05992 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 16-16]
gi|437789741|ref|ZP_20837126.1| hypothetical protein SEEE2651_21208 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 76-2651]
gi|437814063|ref|ZP_20842185.1| hypothetical protein SEEE3944_22067 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|437862553|ref|ZP_20847967.1| hypothetical protein SEEE5621_05022 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
gi|438086893|ref|ZP_20859191.1| hypothetical protein SEEE2625_08240 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|438102729|ref|ZP_20865150.1| hypothetical protein SEEE1976_15564 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|438113496|ref|ZP_20869671.1| hypothetical protein SEEE3407_15758 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|445168673|ref|ZP_21394919.1| hypothetical protein SEE8A_021969 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|445186279|ref|ZP_21399191.1| hypothetical protein SE20037_06257 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|445231881|ref|ZP_21405859.1| hypothetical protein SEE10_020946 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|445237706|ref|ZP_21407161.1| hypothetical protein SEE436_026260 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 436]
gi|445333559|ref|ZP_21414841.1| hypothetical protein SEE18569_008126 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|445345844|ref|ZP_21418446.1| hypothetical protein SEE13_018274 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|445356148|ref|ZP_21421740.1| hypothetical protein SEE23_007297 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
gi|206710719|emb|CAR35080.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|395984836|gb|EJH94014.1| hypothetical protein SEEE0631_15493 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|395991902|gb|EJI01024.1| hypothetical protein SEEE0166_05558 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|395992120|gb|EJI01241.1| hypothetical protein SEEE3139_04408 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|396001983|gb|EJI10994.1| hypothetical protein SEEE3076_10306 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|396004947|gb|EJI13927.1| hypothetical protein SEEE4917_03813 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|396005988|gb|EJI14959.1| hypothetical protein SEEE0424_03357 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|396010336|gb|EJI19249.1| hypothetical protein SEEE6622_17936 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|396017969|gb|EJI26832.1| hypothetical protein SEEE6426_02862 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|396018877|gb|EJI27737.1| hypothetical protein SEEE6670_03439 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|396022763|gb|EJI31575.1| hypothetical protein SEEE6437_19177 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|396025631|gb|EJI34407.1| hypothetical protein SEEE7246_15620 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|396028864|gb|EJI37623.1| hypothetical protein SEEE7250_11351 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|396036970|gb|EJI45625.1| hypothetical protein SEEE1427_18818 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|396037577|gb|EJI46226.1| hypothetical protein SEEE1757_19367 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|396038879|gb|EJI47511.1| hypothetical protein SEEE2659_15744 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|396049784|gb|EJI58322.1| hypothetical protein SEEE5518_21932 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|396050924|gb|EJI59443.1| hypothetical protein SEEE5101_17714 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|396058175|gb|EJI66643.1| hypothetical protein SEEE8B1_05655 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|396070205|gb|EJI78534.1| hypothetical protein SEEE3079_05727 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|396072341|gb|EJI80651.1| hypothetical protein SEEE1618_03571 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|434956555|gb|ELL50284.1| hypothetical protein SEECHS44_21392 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|434966085|gb|ELL58983.1| hypothetical protein SEEE1882_15710 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|434972090|gb|ELL64574.1| hypothetical protein SEE22704_08488 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|434972217|gb|ELL64683.1| hypothetical protein SEEE1884_15817 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|434981889|gb|ELL73751.1| hypothetical protein SEEE1594_06730 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|434987983|gb|ELL79584.1| hypothetical protein SEEE1580_15161 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|434988731|gb|ELL80315.1| hypothetical protein SEEE1566_04035 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|434997520|gb|ELL88761.1| hypothetical protein SEEE1441_19202 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|434998217|gb|ELL89439.1| hypothetical protein SEEE1543_04463 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|435000158|gb|ELL91309.1| hypothetical protein SEE30663_08486 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|435010814|gb|ELM01577.1| hypothetical protein SEEE1810_02403 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|435012005|gb|ELM02695.1| hypothetical protein SEEE1558_02954 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|435018866|gb|ELM09311.1| hypothetical protein SEEE1018_02862 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|435022069|gb|ELM12420.1| hypothetical protein SEEE1010_03453 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|435023777|gb|ELM14017.1| hypothetical protein SEEE1729_18338 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|435030256|gb|ELM20297.1| hypothetical protein SEEE0895_09976 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|435034856|gb|ELM24713.1| hypothetical protein SEEE0899_15810 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|435036430|gb|ELM26251.1| hypothetical protein SEEE1457_16920 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|435040717|gb|ELM30470.1| hypothetical protein SEEE1747_15796 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|435048135|gb|ELM37702.1| hypothetical protein SEEE0968_18795 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|435049092|gb|ELM38627.1| hypothetical protein SEEE1444_14849 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|435060990|gb|ELM50227.1| hypothetical protein SEEE1445_01381 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|435062727|gb|ELM51908.1| hypothetical protein SEEE1565_17627 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|435064420|gb|ELM53549.1| hypothetical protein SEEE1808_20431 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|435069121|gb|ELM58130.1| hypothetical protein SEEE1559_06556 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|435080300|gb|ELM68982.1| hypothetical protein SEEE1811_09865 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|435086361|gb|ELM74900.1| hypothetical protein SEEE0956_01180 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|435086388|gb|ELM74926.1| hypothetical protein SEEE1455_05273 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|435092283|gb|ELM80650.1| hypothetical protein SEEE1575_14346 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|435095779|gb|ELM84062.1| hypothetical protein SEEE1745_14184 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|435097290|gb|ELM85551.1| hypothetical protein SEEE1725_04709 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|435108390|gb|ELM96357.1| hypothetical protein SEEE1791_05329 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|435109351|gb|ELM97304.1| hypothetical protein SEEE1795_09343 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|435115756|gb|ELN03511.1| hypothetical protein SEEE0816_23102 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|435115924|gb|ELN03677.1| hypothetical protein SEEE6709_04610 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|435121957|gb|ELN09480.1| hypothetical protein SEEE9058_06907 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|435123743|gb|ELN11235.1| hypothetical protein SEEE0819_15610 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|435136035|gb|ELN23136.1| hypothetical protein SEEE3072_04644 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|435139610|gb|ELN26601.1| hypothetical protein SEEE3089_05314 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|435139761|gb|ELN26742.1| hypothetical protein SEEE9163_14382 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|435143085|gb|ELN29964.1| hypothetical protein SEEE151_18100 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|435152694|gb|ELN39323.1| hypothetical protein SEEEN202_06125 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|435153800|gb|ELN40397.1| hypothetical protein SEEE3991_14003 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|435161457|gb|ELN47685.1| hypothetical protein SEEE2490_17657 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|435162986|gb|ELN49124.1| hypothetical protein SEEE3618_06829 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|435169890|gb|ELN55648.1| hypothetical protein SEEEL909_17675 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|435174737|gb|ELN60178.1| hypothetical protein SEEEL913_05569 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|435181699|gb|ELN66752.1| hypothetical protein SEEE4941_12100 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|435188330|gb|ELN73047.1| hypothetical protein SEEE7015_01180 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|435194134|gb|ELN78592.1| hypothetical protein SEEE7927_06991 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|435195768|gb|ELN80158.1| hypothetical protein SEEECHS4_07410 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|435199033|gb|ELN83153.1| hypothetical protein SEEE2217_21365 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|435209540|gb|ELN92853.1| hypothetical protein SEEE4018_14080 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|435217080|gb|ELN99522.1| hypothetical protein SEEE6211_14683 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|435220781|gb|ELO03061.1| hypothetical protein SEEE1831_04503 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|435224213|gb|ELO06185.1| hypothetical protein SEEE4441_05197 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|435228101|gb|ELO09552.1| hypothetical protein SEEE9845_22471 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|435229852|gb|ELO11187.1| hypothetical protein SEEE4647_06743 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|435237063|gb|ELO17777.1| hypothetical protein SEEE0116_18550 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|435247221|gb|ELO27192.1| hypothetical protein SEEE1117_07915 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|435251581|gb|ELO31186.1| hypothetical protein SEEE1392_17150 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|435253937|gb|ELO33352.1| hypothetical protein SEEE0268_17650 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|435260557|gb|ELO39749.1| hypothetical protein SEEE0316_08756 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|435264830|gb|ELO43722.1| hypothetical protein SEEE0436_23184 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|435268721|gb|ELO47301.1| hypothetical protein SEEE1319_12828 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|435274894|gb|ELO52988.1| hypothetical protein SEEE4481_12502 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|435280067|gb|ELO57793.1| hypothetical protein SEEE6297_07188 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|435290984|gb|ELO67872.1| hypothetical protein SEEE4220_08863 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|435293025|gb|ELO69762.1| hypothetical protein SEEE1616_05992 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 16-16]
gi|435295196|gb|ELO71717.1| hypothetical protein SEEE2651_21208 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 76-2651]
gi|435295991|gb|ELO72414.1| hypothetical protein SEEE3944_22067 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|435318636|gb|ELO91560.1| hypothetical protein SEEE2625_08240 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|435323736|gb|ELO95733.1| hypothetical protein SEEE1976_15564 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|435329624|gb|ELP01026.1| hypothetical protein SEEE3407_15758 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|435336306|gb|ELP06273.1| hypothetical protein SEEE5621_05022 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
gi|444862919|gb|ELX87757.1| hypothetical protein SEE10_020946 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|444864401|gb|ELX89201.1| hypothetical protein SEE8A_021969 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|444869705|gb|ELX94276.1| hypothetical protein SE20037_06257 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|444875839|gb|ELY00033.1| hypothetical protein SEE18569_008126 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|444878778|gb|ELY02892.1| hypothetical protein SEE13_018274 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|444887218|gb|ELY10942.1| hypothetical protein SEE23_007297 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
gi|444891559|gb|ELY14803.1| hypothetical protein SEE436_026260 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 436]
Length = 651
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVGNGALKLRIGGNYPWQEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|417369073|ref|ZP_12140391.1| secreted protein [Salmonella enterica subsp. enterica serovar
Hvittingfoss str. A4-620]
gi|353585087|gb|EHC45022.1| secreted protein [Salmonella enterica subsp. enterica serovar
Hvittingfoss str. A4-620]
Length = 651
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVGNGALKLRIGGNYPWQEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|421448505|ref|ZP_15897898.1| hypothetical protein SEEE6482_08181 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|396073159|gb|EJI81465.1| hypothetical protein SEEE6482_08181 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
Length = 651
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVGNGALKLRIGGNYPWQEQVKIAIDSVQP---VLHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|197261863|ref|ZP_03161937.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gi|197240118|gb|EDY22738.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
Length = 651
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVGNGALKLRIGGNYPWQEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|168260569|ref|ZP_02682542.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Hadar str. RI_05P066]
gi|205350487|gb|EDZ37118.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Hadar str. RI_05P066]
Length = 651
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVGNGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|437530472|ref|ZP_20780573.1| hypothetical protein SEEE9317_09781, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 648899 3-17]
gi|435244046|gb|ELO24278.1| hypothetical protein SEEE9317_09781, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 648899 3-17]
Length = 349
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 27 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 85
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 86 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 142
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + + +L LR+P W AK T
Sbjct: 143 GNSMEIPVGNGALKLRIGGNYPWQEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 197
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 198 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 256
>gi|298248099|ref|ZP_06971904.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
gi|297550758|gb|EFH84624.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
Length = 638
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 123/528 (23%), Positives = 205/528 (38%), Gaps = 75/528 (14%)
Query: 153 NFRKTARLPAPGEPYGGWEEPSCELRGHF-----VGHYLSASALMWASTHNESLKEKMSA 207
NFR+ A G E P +G F V ++ A A A+ +E L+ +
Sbjct: 70 NFRRAA---------GQVESP---FQGRFFNDSDVYKWVEAVAWTLAAEKDEKLEALVDE 117
Query: 208 VVSALSACQKEIGSGYLSAFPT-EQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAE 266
V+ ++A Q E GYL+ + T E D+ + V Y ++ + +
Sbjct: 118 VIGLIAAAQGE--DGYLNTYFTFENADKRWTDLQVMHELYCAGHLIQAAVAHHRATGKTT 175
Query: 267 ALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAH 326
L + T +Y + V K+ H + + L +L T + ++L LA
Sbjct: 176 LLDVATRFADYI-DSVFGPGKRPGTCGHPE--------IEMALVELARDTGEERYLKLAQ 226
Query: 327 LF------------DKPCFLGLLAL-QADDISGFHSNTHIPIVIGSQMRYEVTGDQ--LH 371
F KP + Q D++ G H+ + + G+ Y TG+Q LH
Sbjct: 227 FFIDNRGQQPPIISGKPYYQDHAPFRQQDEVVG-HAVRALYLYAGATDAYTETGEQALLH 285
Query: 372 K--------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRW 423
+ H++ +G ++ ++ + D E+C + + L
Sbjct: 286 AINALWADLQQHKVYVTGGVGSRYDGEAVGESYELPNDQAYTETCAAIAHIMWAWRLLLL 345
Query: 424 TKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCC 482
T YAD E +L NG+L GI E Y PLA + R +GT CC
Sbjct: 346 TGNALYADAMELTLYNGMLAGISLDGE--SYFYQNPLA-DRGRHRRQPWFGTA-----CC 397
Query: 483 YGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ-IVVNQKVDPVVSWDPYL 541
+ L IY + +++ Y SS + + Q V+ K W+
Sbjct: 398 PPNVARLLASLPGYIYTTSDAD---LWVHLYTSSEANVRLPQGSVLKCKQTSNYPWEG-- 452
Query: 542 RVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDK 600
++ L+ K + LNLRIP W ++GA ++NG+ LP P PG++ + +TW D+
Sbjct: 453 KIKLSIEPKQANAIFGLNLRIPAW--AHGATVSVNGETLPPPIQPGSYYRIERTWQPGDQ 510
Query: 601 LTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL----AGHSIGDWDI 644
+ + LPL +R A+L GP V + H WD+
Sbjct: 511 VELVLPLLMRAVTSHPYISNNNGRVALLRGPLVYCVEQSDHEADVWDL 558
>gi|417329582|ref|ZP_12114395.1| secreted protein [Salmonella enterica subsp. enterica serovar
Adelaide str. A4-669]
gi|353564565|gb|EHC30601.1| secreted protein [Salmonella enterica subsp. enterica serovar
Adelaide str. A4-669]
Length = 651
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++T+ + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWQEQVKITIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|168465016|ref|ZP_02698908.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
gi|418762014|ref|ZP_13318148.1| hypothetical protein SEEN185_06317 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35185]
gi|418768178|ref|ZP_13324234.1| hypothetical protein SEEN199_22735 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35199]
gi|418769292|ref|ZP_13325327.1| hypothetical protein SEEN539_08791 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21539]
gi|418774344|ref|ZP_13330315.1| hypothetical protein SEEN953_04837 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 33953]
gi|418782301|ref|ZP_13338167.1| hypothetical protein SEEN188_15334 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35188]
gi|418784431|ref|ZP_13340269.1| hypothetical protein SEEN559_22215 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21559]
gi|418804570|ref|ZP_13360175.1| hypothetical protein SEEN202_19624 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35202]
gi|419790711|ref|ZP_14316381.1| hypothetical protein SEENLE01_19942 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|419795154|ref|ZP_14320760.1| hypothetical protein SEENLE15_09022 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|195632371|gb|EDX50855.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
gi|392613400|gb|EIW95860.1| hypothetical protein SEENLE01_19942 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|392613862|gb|EIW96317.1| hypothetical protein SEENLE15_09022 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|392732968|gb|EIZ90175.1| hypothetical protein SEEN199_22735 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35199]
gi|392738037|gb|EIZ95186.1| hypothetical protein SEEN185_06317 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35185]
gi|392740729|gb|EIZ97848.1| hypothetical protein SEEN539_08791 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21539]
gi|392744606|gb|EJA01653.1| hypothetical protein SEEN188_15334 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35188]
gi|392751846|gb|EJA08794.1| hypothetical protein SEEN953_04837 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 33953]
gi|392754775|gb|EJA11691.1| hypothetical protein SEEN559_22215 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21559]
gi|392770727|gb|EJA27452.1| hypothetical protein SEEN202_19624 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35202]
Length = 651
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++T+ + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWQEQVKITI---DSVQPVRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|418846200|ref|ZP_13400973.1| hypothetical protein SEEN443_06909 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19443]
gi|418858162|ref|ZP_13412783.1| hypothetical protein SEEN470_08679 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19470]
gi|418865229|ref|ZP_13419709.1| hypothetical protein SEEN536_04836 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19536]
gi|418867555|ref|ZP_13422012.1| hypothetical protein SEEN176_24132 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 4176]
gi|392811425|gb|EJA67435.1| hypothetical protein SEEN443_06909 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19443]
gi|392828511|gb|EJA84203.1| hypothetical protein SEEN536_04836 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19536]
gi|392834500|gb|EJA90106.1| hypothetical protein SEEN470_08679 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19470]
gi|392839395|gb|EJA94937.1| hypothetical protein SEEN176_24132 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 4176]
Length = 651
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAI---DSVQPVRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG D+ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLDVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|161616753|ref|YP_001590718.1| hypothetical protein SPAB_04572 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|161366117|gb|ABX69885.1| hypothetical protein SPAB_04572 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
Length = 651
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG D+ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLDVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|16766964|ref|NP_462579.1| hypothetical protein STM3679 [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gi|167990915|ref|ZP_02572014.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|374978319|ref|ZP_09719662.1| secreted protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. TN061786]
gi|378447048|ref|YP_005234680.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. D23580]
gi|378452556|ref|YP_005239916.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. 14028S]
gi|378701566|ref|YP_005183524.1| hypothetical protein SL1344_3644 [Salmonella enterica subsp.
enterica serovar Typhimurium str. SL1344]
gi|378986276|ref|YP_005249432.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|378990981|ref|YP_005254145.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. UK-1]
gi|379702940|ref|YP_005244668.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. ST4/74]
gi|383498313|ref|YP_005399002.1| hypothetical protein UMN798_3997 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
gi|422027921|ref|ZP_16374245.1| hypothetical protein B571_18397 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|422032964|ref|ZP_16379054.1| hypothetical protein B572_18529 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|427555556|ref|ZP_18929550.1| hypothetical protein B576_18491 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|427573106|ref|ZP_18934155.1| hypothetical protein B577_17365 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|427594481|ref|ZP_18939063.1| hypothetical protein B573_17940 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|427618885|ref|ZP_18943976.1| hypothetical protein B574_18451 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|427642409|ref|ZP_18948833.1| hypothetical protein B575_18572 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|427657950|ref|ZP_18953577.1| hypothetical protein B578_18134 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|427663174|ref|ZP_18958453.1| hypothetical protein B579_18597 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|427679110|ref|ZP_18963359.1| hypothetical protein B580_18826 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|427801169|ref|ZP_18968792.1| hypothetical protein B581_21834 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm5]
gi|16422244|gb|AAL22538.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gi|205330807|gb|EDZ17571.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|261248827|emb|CBG26680.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. D23580]
gi|267995935|gb|ACY90820.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. 14028S]
gi|301160215|emb|CBW19737.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. SL1344]
gi|312914705|dbj|BAJ38679.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|321226733|gb|EFX51783.1| secreted protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. TN061786]
gi|323132039|gb|ADX19469.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. ST4/74]
gi|332990528|gb|AEF09511.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. UK-1]
gi|380465134|gb|AFD60537.1| hypothetical protein UMN798_3997 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
gi|414013156|gb|EKS97053.1| hypothetical protein B571_18397 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|414014140|gb|EKS97993.1| hypothetical protein B576_18491 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|414014578|gb|EKS98419.1| hypothetical protein B572_18529 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|414027997|gb|EKT11199.1| hypothetical protein B577_17365 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|414029273|gb|EKT12434.1| hypothetical protein B573_17940 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|414031641|gb|EKT14688.1| hypothetical protein B574_18451 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|414042773|gb|EKT25304.1| hypothetical protein B575_18572 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|414043221|gb|EKT25734.1| hypothetical protein B578_18134 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|414047893|gb|EKT30155.1| hypothetical protein B579_18597 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|414056107|gb|EKT37949.1| hypothetical protein B580_18826 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|414062669|gb|EKT43947.1| hypothetical protein B581_21834 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm5]
Length = 651
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRRRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG D+ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLDVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|284172576|ref|YP_003405958.1| protein of unknown function DUF1680 [Haloterrigena turkmenica DSM
5511]
gi|284017336|gb|ADB63285.1| protein of unknown function DUF1680 [Haloterrigena turkmenica DSM
5511]
Length = 636
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 63/236 (26%), Positives = 96/236 (40%), Gaps = 22/236 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLPLAPGS 463
E+C + ++ LF + E YAD ER+L NG L G+ GTE Y PL
Sbjct: 339 ETCAAIGSVYWNQRLFELSGEAKYADLIERTLYNGFLAGVSLDGTE---FFYENPLESDG 395
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
R W T + CC + LG+ +Y + + +Y+ QY+ S +
Sbjct: 396 DHHRK--GWFTCA----CCPPNAARLLASLGEYVYSQRDS---AIYVNQYLGSSVTTAVD 446
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
V D + W +T G + L LRIP W S + T+NG+ + P
Sbjct: 447 GATVELSQDSSLPWSG----EVTVDVDADGASVPLRLRIPEWAES--STVTVNGESVETP 500
Query: 584 SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSI 639
S G +L + + W DD++ + T+ D A A+ GP V +I
Sbjct: 501 SEG-YLEIERVW-DDDRIELTFEQTVTRLEAHPDVAADAGRVALKRGPLVYCLEAI 554
>gi|291086404|ref|ZP_06355701.2| putative cytoplasmic protein [Citrobacter youngae ATCC 29220]
gi|291068139|gb|EFE06248.1| putative cytoplasmic protein [Citrobacter youngae ATCC 29220]
Length = 659
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 61/239 (25%), Positives = 99/239 (41%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G IY + +YI Y+
Sbjct: 396 VHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARILTSIGHYIYTPRQD---ALYINLYV 452
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ V+ ++ W + +VT+ S + +L LR+P W S+ +
Sbjct: 453 GNSMEVPVADGVLKLRISGNYPW--HEQVTIAIESP-QPVKHTLALRLPDWCSA--PQVL 507
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNGQ + +L +++TW D L++ LP+ +R A AI GP V
Sbjct: 508 LNGQPVAQDIRKGYLHISRTWQEGDTLSLTLPMPVRRVYGNPLVRHVAGKVAIQRGPLV 566
>gi|317048885|ref|YP_004116533.1| hypothetical protein Pat9b_2677 [Pantoea sp. At-9b]
gi|316950502|gb|ADU69977.1| protein of unknown function DUF1680 [Pantoea sp. At-9b]
Length = 651
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 93/217 (42%), Gaps = 17/217 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 PGSSKERSYHH---WGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
K S++H P W CC + LG IY E +YI Y
Sbjct: 388 V-HPKTLSFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTPRE---EALYINLY 443
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ + L+ G+ + +++ W VT+T S + +L LR+P W + +
Sbjct: 444 VGNSLEVPVGEQTLRLRINGNFPWQE--TVTITIDSP-QPVQHTLALRLPDW--CDAPQV 498
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
TLN + +L + ++WS D LT+ LP+ +R
Sbjct: 499 TLNDAAVASDIRKGYLHINRSWSEGDTLTLTLPMPVR 535
>gi|419730921|ref|ZP_14257856.1| hypothetical protein SEEH1579_14647 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|419735086|ref|ZP_14261970.1| hypothetical protein SEEH1563_04074 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|419740253|ref|ZP_14266986.1| hypothetical protein SEEH1573_01156 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|419743535|ref|ZP_14270200.1| hypothetical protein SEEH1566_22821 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|419746688|ref|ZP_14273264.1| hypothetical protein SEEH1565_23798 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|381293311|gb|EIC34483.1| hypothetical protein SEEH1579_14647 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|381295529|gb|EIC36640.1| hypothetical protein SEEH1573_01156 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|381295907|gb|EIC37016.1| hypothetical protein SEEH1563_04074 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|381312020|gb|EIC52830.1| hypothetical protein SEEH1566_22821 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|381320971|gb|EIC61499.1| hypothetical protein SEEH1565_23798 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
Length = 651
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPRSLKFNHIYEHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEVPVENGALKLRIGGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|168818493|ref|ZP_02830493.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Weltevreden str. HI_N05-537]
gi|409247363|ref|YP_006888062.1| hypothetical protein SENTW_3778 [Salmonella enterica subsp.
enterica serovar Weltevreden str. 2007-60-3289-1]
gi|205344524|gb|EDZ31288.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Weltevreden str. HI_N05-537]
gi|320088097|emb|CBY97859.1| hypothetical protein SENTW_3778 [Salmonella enterica subsp.
enterica serovar Weltevreden str. 2007-60-3289-1]
Length = 651
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W + AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPA--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|417521365|ref|ZP_12183078.1| secreted protein [Salmonella enterica subsp. enterica serovar
Uganda str. R8-3404]
gi|353641628|gb|EHC86306.1| secreted protein [Salmonella enterica subsp. enterica serovar
Uganda str. R8-3404]
Length = 651
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEVPVENGALKLRIGGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|167549076|ref|ZP_02342835.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
gi|205325554|gb|EDZ13393.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
Length = 651
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPRSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEVPVENGALKLRIGGNYPWHEQMKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|437834770|ref|ZP_20845077.1| hypothetical protein SEEERB17_013756 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SARB17]
gi|435300940|gb|ELO76997.1| hypothetical protein SEEERB17_013756 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SARB17]
Length = 651
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEVPVENGALKLRISGNYPWHEQVKIAIDSVQP---VHHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|416425586|ref|ZP_11692369.1| hypothetical protein SEEM315_05068 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|416430384|ref|ZP_11695001.1| hypothetical protein SEEM971_12506 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|416437565|ref|ZP_11698915.1| hypothetical protein SEEM973_00650 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|416443382|ref|ZP_11702995.1| hypothetical protein SEEM974_19090 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|416450281|ref|ZP_11707410.1| hypothetical protein SEEM201_09767 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|416460310|ref|ZP_11714693.1| hypothetical protein SEEM202_18350 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|416463475|ref|ZP_11715992.1| hypothetical protein SEEM954_09563 [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|416480379|ref|ZP_11722779.1| hypothetical protein SEEM054_17193 [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|416487797|ref|ZP_11725654.1| hypothetical protein SEEM675_20283 [Salmonella enterica subsp.
enterica serovar Montevideo str. OH_2009072675]
gi|416501897|ref|ZP_11732445.1| hypothetical protein SEEM965_19843 [Salmonella enterica subsp.
enterica serovar Montevideo str. CASC_09SCPH15965]
gi|416504577|ref|ZP_11733224.1| hypothetical protein SEEM031_12180 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|416517070|ref|ZP_11739340.1| hypothetical protein SEEM710_04917 [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|416543079|ref|ZP_11752034.1| hypothetical protein SEEM19N_14012 [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|416562276|ref|ZP_11762033.1| hypothetical protein SEEM42N_02114 [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
gi|416573654|ref|ZP_11767961.1| hypothetical protein SEEM41H_07852 [Salmonella enterica subsp.
enterica serovar Montevideo str. 4441 H]
gi|416578850|ref|ZP_11770886.1| hypothetical protein SEEM801_22092 [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|416584544|ref|ZP_11774245.1| hypothetical protein SEEM507_12719 [Salmonella enterica subsp.
enterica serovar Montevideo str. MD_MDA09249507]
gi|416589552|ref|ZP_11777137.1| hypothetical protein SEEM877_17791 [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|416607005|ref|ZP_11788219.1| hypothetical protein SEEM180_02276 [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|416611569|ref|ZP_11790943.1| hypothetical protein SEEM600_22079 [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|416624752|ref|ZP_11798278.1| hypothetical protein SEEM581_05149 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|416626628|ref|ZP_11798711.1| hypothetical protein SEEM501_13368 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|416644435|ref|ZP_11806741.1| hypothetical protein SEEM460_09301 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|416648059|ref|ZP_11808823.1| hypothetical protein SEEM020_002973 [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|416658271|ref|ZP_11814206.1| hypothetical protein SEEM6152_18571 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|416668027|ref|ZP_11818653.1| hypothetical protein SEEM0077_13550 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|416681176|ref|ZP_11823586.1| hypothetical protein SEEM0047_06329 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB102109-0047]
gi|416694001|ref|ZP_11826910.1| hypothetical protein SEEM0055_00612 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|416708995|ref|ZP_11833799.1| hypothetical protein SEEM0052_07867 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|416712890|ref|ZP_11836552.1| hypothetical protein SEEM3312_21482 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|416721065|ref|ZP_11842596.1| hypothetical protein SEEM5258_14872 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|416722793|ref|ZP_11843619.1| hypothetical protein SEEM1156_02237 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|416729527|ref|ZP_11848104.1| hypothetical protein SEEM9199_08226 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|416741866|ref|ZP_11855415.1| hypothetical protein SEEM8282_16962 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|416745954|ref|ZP_11857573.1| hypothetical protein SEEM8283_19676 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|416755322|ref|ZP_11861983.1| hypothetical protein SEEM8284_15210 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008284]
gi|416763125|ref|ZP_11866955.1| hypothetical protein SEEM8285_00470 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|416771775|ref|ZP_11872954.1| hypothetical protein SEEM8287_16934 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|418485126|ref|ZP_13054112.1| hypothetical protein SEEM906_06936 [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|418491104|ref|ZP_13057631.1| hypothetical protein SEEM5278_11244 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|418494659|ref|ZP_13061110.1| hypothetical protein SEEM5318_03413 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|418499800|ref|ZP_13066201.1| hypothetical protein SEEM5320_09457 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|418503417|ref|ZP_13069781.1| hypothetical protein SEEM5321_21740 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|418508996|ref|ZP_13075294.1| hypothetical protein SEEM5327_04049 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|418525130|ref|ZP_13091112.1| hypothetical protein SEEM8286_03017 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
gi|322613936|gb|EFY10872.1| hypothetical protein SEEM315_05068 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|322620305|gb|EFY17173.1| hypothetical protein SEEM971_12506 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|322625311|gb|EFY22138.1| hypothetical protein SEEM973_00650 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|322630022|gb|EFY26795.1| hypothetical protein SEEM974_19090 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|322634213|gb|EFY30948.1| hypothetical protein SEEM201_09767 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|322635886|gb|EFY32595.1| hypothetical protein SEEM202_18350 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|322643086|gb|EFY39661.1| hypothetical protein SEEM954_09563 [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|322644583|gb|EFY41119.1| hypothetical protein SEEM054_17193 [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|322650825|gb|EFY47217.1| hypothetical protein SEEM675_20283 [Salmonella enterica subsp.
enterica serovar Montevideo str. OH_2009072675]
gi|322653011|gb|EFY49346.1| hypothetical protein SEEM965_19843 [Salmonella enterica subsp.
enterica serovar Montevideo str. CASC_09SCPH15965]
gi|322659974|gb|EFY56214.1| hypothetical protein SEEM19N_14012 [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|322663307|gb|EFY59511.1| hypothetical protein SEEM801_22092 [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|322668793|gb|EFY64946.1| hypothetical protein SEEM507_12719 [Salmonella enterica subsp.
enterica serovar Montevideo str. MD_MDA09249507]
gi|322674404|gb|EFY70497.1| hypothetical protein SEEM877_17791 [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|322680894|gb|EFY76928.1| hypothetical protein SEEM180_02276 [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|322687170|gb|EFY83143.1| hypothetical protein SEEM600_22079 [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|323192129|gb|EFZ77362.1| hypothetical protein SEEM581_05149 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|323200633|gb|EFZ85707.1| hypothetical protein SEEM501_13368 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|323201343|gb|EFZ86409.1| hypothetical protein SEEM460_09301 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|323211827|gb|EFZ96659.1| hypothetical protein SEEM6152_18571 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|323216186|gb|EGA00914.1| hypothetical protein SEEM0077_13550 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|323220409|gb|EGA04863.1| hypothetical protein SEEM0047_06329 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB102109-0047]
gi|323226266|gb|EGA10481.1| hypothetical protein SEEM0055_00612 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|323228386|gb|EGA12517.1| hypothetical protein SEEM0052_07867 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|323234207|gb|EGA18295.1| hypothetical protein SEEM3312_21482 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|323237192|gb|EGA21259.1| hypothetical protein SEEM5258_14872 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|323244711|gb|EGA28715.1| hypothetical protein SEEM1156_02237 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|323249192|gb|EGA33110.1| hypothetical protein SEEM9199_08226 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|323250689|gb|EGA34569.1| hypothetical protein SEEM8282_16962 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|323257564|gb|EGA41251.1| hypothetical protein SEEM8283_19676 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|323262273|gb|EGA45834.1| hypothetical protein SEEM8284_15210 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008284]
gi|323266172|gb|EGA49663.1| hypothetical protein SEEM8285_00470 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|323268806|gb|EGA52264.1| hypothetical protein SEEM8287_16934 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|363557827|gb|EHL42031.1| hypothetical protein SEEM031_12180 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|363561441|gb|EHL45559.1| hypothetical protein SEEM710_04917 [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|363571665|gb|EHL55571.1| hypothetical protein SEEM41H_07852 [Salmonella enterica subsp.
enterica serovar Montevideo str. 4441 H]
gi|363573358|gb|EHL57244.1| hypothetical protein SEEM42N_02114 [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
gi|366056585|gb|EHN20901.1| hypothetical protein SEEM906_06936 [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|366061420|gb|EHN25666.1| hypothetical protein SEEM5318_03413 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|366063348|gb|EHN27567.1| hypothetical protein SEEM5278_11244 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|366069988|gb|EHN34105.1| hypothetical protein SEEM5320_09457 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|366073016|gb|EHN37095.1| hypothetical protein SEEM5321_21740 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|366078850|gb|EHN42847.1| hypothetical protein SEEM5327_04049 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|366830119|gb|EHN56993.1| hypothetical protein SEEM020_002973 [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|372206701|gb|EHP20203.1| hypothetical protein SEEM8286_03017 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
Length = 651
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEVPVENGALKLRISGNYPWHEQVKIAI---DSVQPVHHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|224585478|ref|YP_002639277.1| hypothetical protein SPC_3759 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gi|224470006|gb|ACN47836.1| hypothetical protein SPC_3759 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
Length = 651
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEVPVENGALKLRIGGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|421844899|ref|ZP_16278055.1| hypothetical protein D186_07681 [Citrobacter freundii ATCC 8090 =
MTCC 1658]
gi|411773762|gb|EKS57290.1| hypothetical protein D186_07681 [Citrobacter freundii ATCC 8090 =
MTCC 1658]
gi|455645502|gb|EMF24562.1| hypothetical protein H262_06439 [Citrobacter freundii GTC 09479]
Length = 651
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 97/218 (44%), Gaps = 19/218 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTPRQD---ALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWD-PYL-RVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
+ ++ VVN + +S D P+ +V +T S S + +L LR+P W S+ +
Sbjct: 445 GNSMEVP----VVNGSLKLRISGDYPWHEQVKITIESPRS-VYHTLALRLPDWCSA--PQ 497
Query: 573 ATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNGQ + +L +++TW D L++ LP+ +R
Sbjct: 498 VLLNGQPVEQDIRKGYLHISRTWQEGDTLSLTLPMPVR 535
>gi|416597563|ref|ZP_11782144.1| hypothetical protein SEEM867_21594 [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
gi|322678388|gb|EFY74449.1| hypothetical protein SEEM867_21594 [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
Length = 651
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEVPVENGALKLRISGNYPWHEQVKIAI---DSVQPVHHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|168241855|ref|ZP_02666787.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL486]
gi|194451278|ref|YP_002047708.1| hypothetical protein SeHA_C4002 [Salmonella enterica subsp.
enterica serovar Heidelberg str. SL476]
gi|386593352|ref|YP_006089752.1| hypothetical protein SU5_04156 [Salmonella enterica subsp. enterica
serovar Heidelberg str. B182]
gi|421571246|ref|ZP_16016925.1| hypothetical protein CFSAN00322_13684 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00322]
gi|421575202|ref|ZP_16020815.1| hypothetical protein CFSAN00325_10357 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00325]
gi|421579160|ref|ZP_16024730.1| hypothetical protein CFSAN00326_07107 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00326]
gi|421586317|ref|ZP_16031800.1| hypothetical protein CFSAN00328_20048 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00328]
gi|194409582|gb|ACF69801.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL476]
gi|205339076|gb|EDZ25840.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL486]
gi|383800393|gb|AFH47475.1| DUF1680 Glycosyl hydrolase [Salmonella enterica subsp. enterica
serovar Heidelberg str. B182]
gi|402521555|gb|EJW28891.1| hypothetical protein CFSAN00322_13684 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00322]
gi|402522242|gb|EJW29566.1| hypothetical protein CFSAN00325_10357 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00325]
gi|402523131|gb|EJW30450.1| hypothetical protein CFSAN00326_07107 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00326]
gi|402529042|gb|EJW36291.1| hypothetical protein CFSAN00328_20048 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00328]
Length = 651
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPRSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEVPVENGALKLRIGGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|200389015|ref|ZP_03215627.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Virchow str. SL491]
gi|199606113|gb|EDZ04658.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Virchow str. SL491]
Length = 651
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPRSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEVPVENGALKLRIGGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|418511390|ref|ZP_13077652.1| hypothetical protein SEEPO729_11665 [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
gi|366084797|gb|EHN48695.1| hypothetical protein SEEPO729_11665 [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
Length = 651
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|417376625|ref|ZP_12145767.1| secreted protein [Salmonella enterica subsp. enterica serovar
Inverness str. R8-3668]
gi|353592514|gb|EHC50495.1| secreted protein [Salmonella enterica subsp. enterica serovar
Inverness str. R8-3668]
Length = 651
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|417514299|ref|ZP_12178139.1| secreted protein [Salmonella enterica subsp. enterica serovar
Senftenberg str. A4-543]
gi|353634280|gb|EHC80885.1| secreted protein [Salmonella enterica subsp. enterica serovar
Senftenberg str. A4-543]
Length = 651
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|197247483|ref|YP_002148608.1| hypothetical protein SeAg_B3893 [Salmonella enterica subsp.
enterica serovar Agona str. SL483]
gi|440762586|ref|ZP_20941641.1| hypothetical protein F434_06500 [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|440769697|ref|ZP_20948654.1| hypothetical protein F514_18649 [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|440774815|ref|ZP_20953701.1| hypothetical protein F515_20523 [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|197211186|gb|ACH48583.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Agona str. SL483]
gi|436412179|gb|ELP10122.1| hypothetical protein F515_20523 [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|436414203|gb|ELP12135.1| hypothetical protein F514_18649 [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|436422862|gb|ELP20686.1| hypothetical protein F434_06500 [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
Length = 651
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|395228933|ref|ZP_10407251.1| cytoplasmic protein [Citrobacter sp. A1]
gi|424732388|ref|ZP_18160966.1| sugar (glycoside-pentoside-hexuronide) transporter [Citrobacter sp.
L17]
gi|394717639|gb|EJF23323.1| cytoplasmic protein [Citrobacter sp. A1]
gi|422893047|gb|EKU32896.1| sugar (glycoside-pentoside-hexuronide) transporter [Citrobacter sp.
L17]
Length = 651
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 97/218 (44%), Gaps = 19/218 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G IY + +YI Y+
Sbjct: 388 VHPKSLKLNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTPRQD---ALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWD-PYL-RVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
+ ++ VVN + +S D P+ +V +T S S + +L LR+P W S+ +
Sbjct: 445 GNSMEVP----VVNGSLKLRISGDYPWHEQVKITIESPQS-VYHTLALRLPDWCSA--PQ 497
Query: 573 ATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNGQ + +L +++TW D L++ LP+ +R
Sbjct: 498 VLLNGQPIEQDIRKGYLHISRTWQEGDTLSLTLPMPVR 535
>gi|417353052|ref|ZP_12130092.1| secreted protein [Salmonella enterica subsp. enterica serovar
Gaminara str. A4-567]
gi|353564767|gb|EHC30749.1| secreted protein [Salmonella enterica subsp. enterica serovar
Gaminara str. A4-567]
Length = 651
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAI---DSVQPVRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|204928680|ref|ZP_03219879.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Javiana str. GA_MM04042433]
gi|452122524|ref|YP_007472772.1| hypothetical protein CFSAN001992_15250 [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
gi|204322113|gb|EDZ07311.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Javiana str. GA_MM04042433]
gi|451911528|gb|AGF83334.1| hypothetical protein CFSAN001992_15250 [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
Length = 651
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAI---DSVQPVRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|416529897|ref|ZP_11744588.1| hypothetical protein SEEM010_12041 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
gi|416538915|ref|ZP_11749679.1| hypothetical protein SEEM030_12964 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB30]
gi|416553241|ref|ZP_11757602.1| hypothetical protein SEEM29N_15108 [Salmonella enterica subsp.
enterica serovar Montevideo str. 29N]
gi|417470705|ref|ZP_12166835.1| secreted protein [Salmonella enterica subsp. enterica serovar
Montevideo str. S5-403]
gi|353624652|gb|EHC73633.1| secreted protein [Salmonella enterica subsp. enterica serovar
Montevideo str. S5-403]
gi|363551713|gb|EHL36026.1| hypothetical protein SEEM010_12041 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
gi|363561277|gb|EHL45405.1| hypothetical protein SEEM030_12964 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB30]
gi|363563119|gb|EHL47199.1| hypothetical protein SEEM29N_15108 [Salmonella enterica subsp.
enterica serovar Montevideo str. 29N]
Length = 651
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSIYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIY---TPRADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAI---DSVQPVRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|417344582|ref|ZP_12124897.1| secreted protein [Salmonella enterica subsp. enterica serovar
Baildon str. R6-199]
gi|417542477|ref|ZP_12193911.1| secreted protein [Salmonella enterica subsp. enterica serovar
Wandsworth str. A4-580]
gi|353658599|gb|EHC98734.1| secreted protein [Salmonella enterica subsp. enterica serovar
Wandsworth str. A4-580]
gi|357953998|gb|EHJ80341.1| secreted protein [Salmonella enterica subsp. enterica serovar
Baildon str. R6-199]
Length = 651
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPRSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RAHALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEVPVENGALKLRIGGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|168232522|ref|ZP_02657580.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CDC 191]
gi|194471797|ref|ZP_03077781.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
gi|194458161|gb|EDX47000.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
gi|205333286|gb|EDZ20050.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CDC 191]
Length = 651
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|417386570|ref|ZP_12151238.1| secreted protein [Salmonella enterica subsp. enterica serovar
Johannesburg str. S5-703]
gi|353602920|gb|EHC58138.1| secreted protein [Salmonella enterica subsp. enterica serovar
Johannesburg str. S5-703]
Length = 651
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSIYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|378580796|ref|ZP_09829449.1| secreted protein [Pantoea stewartii subsp. stewartii DC283]
gi|377816535|gb|EHT99637.1| secreted protein [Pantoea stewartii subsp. stewartii DC283]
Length = 651
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 55/216 (25%), Positives = 92/216 (42%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S + P W CC + LG IY E ++I YI
Sbjct: 388 VHPKSLPFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTPRED---ALFINLYI 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+R++ G + ++ + W VT+T S + +L LR+P W +S + T
Sbjct: 445 GNRVEIPVGNQTLGLRISGNLPWQE--TVTITIDST-QPVNHALALRLPDWCAS--PQIT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
NG ++ + +L + + W D +T+ LP+ +R
Sbjct: 500 CNGTEVNEAARKGYLYLNRHWQEGDTVTLTLPMPVR 535
>gi|417337268|ref|ZP_12119473.1| secreted protein [Salmonella enterica subsp. enterica serovar
Alachua str. R6-377]
gi|353565179|gb|EHC31033.1| secreted protein [Salmonella enterica subsp. enterica serovar
Alachua str. R6-377]
Length = 651
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPRSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|375003535|ref|ZP_09727874.1| hypothetical protein SEENIN0B_03911 [Salmonella enterica subsp.
enterica serovar Infantis str. SARB27]
gi|353074450|gb|EHB40211.1| hypothetical protein SEENIN0B_03911 [Salmonella enterica subsp.
enterica serovar Infantis str. SARB27]
Length = 651
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSVEIPVENGALKLRIGGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLALPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|255034442|ref|YP_003085063.1| hypothetical protein Dfer_0635 [Dyadobacter fermentans DSM 18053]
gi|254947198|gb|ACT91898.1| protein of unknown function DUF1680 [Dyadobacter fermentans DSM
18053]
Length = 656
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/276 (23%), Positives = 119/276 (43%), Gaps = 36/276 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + M+ ++ + T + Y D ERSL NG L G+ + Y PL+ +
Sbjct: 335 ETCASVGMVFWNQRMNALTGDAKYIDVLERSLYNGALDGLSLTGDR--FFYGNPLSSIGN 392
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
RS +GT CC + +GD IY + +GK +++ ++ S ++ G+
Sbjct: 393 NARS-AWFGTA-----CCPSNIARLVASVGDYIYGKADGK---IWVNLFVGSNTTFQVGK 443
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS--------------NG 570
V ++ W+ +R+ +T K + +LN+RIP W + NG
Sbjct: 444 TAVPLQMSTDYPWNGSIRIKVTPPQK---VKYALNVRIPGWAAGTPVPGGLYNFAAAGNG 500
Query: 571 -AKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
+ LNG+ + S + + +TW + D++ ++LP+ +R + + AI
Sbjct: 501 RVEVLLNGKSVNYQSDKGYAVIDRTWQNGDEIEVRLPMDVRQVKARAEVKADEGRIAIQR 560
Query: 630 GPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQ 665
GP V ++A + + + P A+Y Q
Sbjct: 561 GPIVYCVEG------ADNAGEVWNLLVPANAAYTIQ 590
>gi|417394187|ref|ZP_12156450.1| secreted protein [Salmonella enterica subsp. enterica serovar
Minnesota str. A4-603]
gi|353606439|gb|EHC60665.1| secreted protein [Salmonella enterica subsp. enterica serovar
Minnesota str. A4-603]
Length = 651
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSVEIPVENGALKLRIGGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|168235286|ref|ZP_02660344.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. SL480]
gi|194737873|ref|YP_002116613.1| hypothetical protein SeSA_A3877 [Salmonella enterica subsp.
enterica serovar Schwarzengrund str. CVM19633]
gi|194713375|gb|ACF92596.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. CVM19633]
gi|197291306|gb|EDY30658.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. SL480]
Length = 651
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSVEIPVENGALKLRIGGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|417361434|ref|ZP_12135327.1| secreted protein [Salmonella enterica subsp. enterica serovar Give
str. S5-487]
gi|353584072|gb|EHC44282.1| secreted protein [Salmonella enterica subsp. enterica serovar Give
str. S5-487]
Length = 651
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSIYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSVEIPVENGALKLRIGGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|430751377|ref|YP_007214285.1| hypothetical protein Theco_3232 [Thermobacillus composti KWC4]
gi|430735342|gb|AGA59287.1| hypothetical protein Theco_3232 [Thermobacillus composti KWC4]
Length = 672
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 125/282 (44%), Gaps = 23/282 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ ESC + ++ S+ + + + Y D ER+L N L G+ + + + L +
Sbjct: 336 DTAYAESCASIGLIMFSKRMLQIEAKGEYGDVMERALYNTELAGMSQDGKRYFYVNPLEV 395
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + + H P W CC + LG +Y + + + VY YI
Sbjct: 396 WPEACRSNPGKHHVKPVRQRWFGCACCPPNIARLIASLGGYVY-DVDAESGIVYTHLYIG 454
Query: 516 --SRLD-------WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT-SLNLRIPTW 565
+RL+ G +VV Q+ + WD V LT + + GLT +L LR+P W
Sbjct: 455 GEARLNVGKEGGGHDGGTVVVRQETN--YPWDGA--VMLTVTPEAGGLTAFTLALRLPGW 510
Query: 566 TSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
+ ++ + +NG+ + + + + W D + ++L +T+R A + + A
Sbjct: 511 SRTS--EIAVNGERIAPEVRDGYAYICRDWQPGDTVELKLDMTIRLLAARPEVRADAGRV 568
Query: 626 AILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLI 667
AI GP V S + SA ++ D TP+ A+Y++QL+
Sbjct: 569 AIQRGPLVYCLESADNPGGPLSALAI-DTQTPLTATYDAQLL 609
>gi|435854425|ref|YP_007315744.1| hypothetical protein Halha_1714 [Halobacteroides halobius DSM 5150]
gi|433670836|gb|AGB41651.1| hypothetical protein Halha_1714 [Halobacteroides halobius DSM 5150]
Length = 647
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 99/218 (45%), Gaps = 17/218 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C ++ + + + YAD ER+L NGVL G+ + E + L +
Sbjct: 327 DTAYAETCAAIGLMFWAHRMLHLDLDSQYADVMERALYNGVLSGMSQDGEKFFYVNPLEV 386
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYI 514
P + +ER P+ W CC + +G+ IY +E+ Y +Y
Sbjct: 387 WPEACEERKDKEHVKPTRQKWFGCACCPPNIARLLASIGEYIYSTDEQAAYIHLYTASVT 446
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+D S + ++Q+ D WD + +T+ + + +L LRIP W S A+
Sbjct: 447 EFEIDGTS--VELDQETD--YPWDENITITVNPREE---VEFTLALRIPDWCES--AELK 497
Query: 575 LNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLR 610
+NG+ L L S ++ V ++WS D++ + L + ++
Sbjct: 498 VNGRTLELDSIIDNGYVEVNRSWSKGDQIELVLAMPVK 535
>gi|423105419|ref|ZP_17093121.1| hypothetical protein HMPREF9686_04025 [Klebsiella oxytoca 10-5242]
gi|376380736|gb|EHS93479.1| hypothetical protein HMPREF9686_04025 [Klebsiella oxytoca 10-5242]
Length = 653
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI YI
Sbjct: 388 VHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTPHDD---ALYINLYI 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + SS + +L LR+P W + + T
Sbjct: 445 GNSVEIPVGNEALRLRISGNYPWQEQVKIVIDSSSP---VNHTLALRLPDWC--DKPQVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG + +L ++ W D L + LP+ +R
Sbjct: 500 LNGAPVTQDVRKGYLHISHLWQEGDTLQLTLPMPVR 535
>gi|402843427|ref|ZP_10891823.1| putative glycosyhydrolase [Klebsiella sp. OBRC7]
gi|402277059|gb|EJU26151.1| putative glycosyhydrolase [Klebsiella sp. OBRC7]
Length = 653
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI YI
Sbjct: 388 VHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTPHDD---ALYINLYI 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + SS + +L LR+P W + + T
Sbjct: 445 GNSVEIPVGNEALRLRISGNYPWQEQVKIVIDSSSP---VNHTLALRLPDWC--DKPQVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG + +L ++ W D L + LP+ +R
Sbjct: 500 LNGAPVTQDVRKGYLHISHLWREGDTLQLTLPMPVR 535
>gi|237728888|ref|ZP_04559369.1| conserved hypothetical protein [Citrobacter sp. 30_2]
gi|226909510|gb|EEH95428.1| conserved hypothetical protein [Citrobacter sp. 30_2]
Length = 651
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/216 (23%), Positives = 90/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTPRQD---ALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W ++ +
Sbjct: 445 GNSMEVPVADGSLKLRISGDYPWHEQVKIAI---ESPQSIYHTLALRLPDWCTA--PQVL 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNGQ + +L +++TW D L++ LP+ +R
Sbjct: 500 LNGQPVEQDIRKGYLHISRTWQEGDTLSLTLPMPVR 535
>gi|365102501|ref|ZP_09332802.1| hypothetical protein HMPREF9428_03810 [Citrobacter freundii
4_7_47CFAA]
gi|363646229|gb|EHL85477.1| hypothetical protein HMPREF9428_03810 [Citrobacter freundii
4_7_47CFAA]
Length = 651
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/216 (23%), Positives = 90/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTPRQD---ALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W ++ +
Sbjct: 445 GNSMEVPVADGSLKLRISGDYPWHEQVKIAI---ESPQSIYHTLALRLPDWCTA--PQVL 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNGQ + +L +++TW D L++ LP+ +R
Sbjct: 500 LNGQPVEQDIRKGYLHISRTWQEGDTLSLTLPMPVR 535
>gi|448408500|ref|ZP_21574295.1| hypothetical protein C475_08191 [Halosimplex carlsbadense 2-9-1]
gi|445674355|gb|ELZ26899.1| hypothetical protein C475_08191 [Halosimplex carlsbadense 2-9-1]
Length = 637
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/238 (26%), Positives = 101/238 (42%), Gaps = 27/238 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + ++ LF + AYAD ER+L NG L G+ G + Y+ PLA
Sbjct: 338 ETCAAVGSVFWNQRLFELEPDPAYADLIERTLYNGFLAGV--GMDGEEFFYVNPLASDGD 395
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
RS W T + CC F+ LG +Y G+ +Y+ QY+ S L
Sbjct: 396 HHRS--GWFTCA----CCPPNAARLFASLGQYVYSTTGGE---LYVTQYVGSDLSTTVEG 446
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS 584
V + + WD V + + G+ +NLRIP W ++ A T++G ++
Sbjct: 447 TAVELDQESALPWDG--EVAIEVDADGA---VPVNLRIPEW--ADEATVTVDGDEVSHDG 499
Query: 585 PGNFLSVTKTWSS---DDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSI 639
G F+ V + W+ + +Q L A++ D A A+ GP V ++
Sbjct: 500 SG-FVRVEREWNGQWVELTFEMQSELVAAHPAVEAD----AGRVAVRRGPLVYCAEAV 552
>gi|56415571|ref|YP_152646.1| hypothetical protein SPA3530 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|197364498|ref|YP_002144135.1| hypothetical protein SSPA3296 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
gi|56129828|gb|AAV79334.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|197095975|emb|CAR61560.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
Length = 651
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSLEVPVENGALKLRIGGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +++ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTISLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|291618364|ref|YP_003521106.1| hypothetical Protein PANA_2811 [Pantoea ananatis LMG 20103]
gi|291153394|gb|ADD77978.1| Hypothetical Protein PANA_2811 [Pantoea ananatis LMG 20103]
Length = 659
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 101/483 (20%), Positives = 179/483 (37%), Gaps = 75/483 (15%)
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQF--DRLEALI 239
V +L A A + L++ V+ ++A Q E GYL+ + T + DR L
Sbjct: 82 VAKWLEAVAWSLCQKPDAELEKTADEVIELIAAAQCE--DGYLNTYFTVKAPQDRWTNLA 139
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
Y H I AG+ A R +V + + +V + H +
Sbjct: 140 ECHELYCAGHMIEAGVAFY-----QATGKRRLLEVVCRLADHIDSVFGPEEHQLHGYPGH 194
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLALQADDISGFHSNTHIP 354
E + L +L+ +TQ P++L L + F +P F + + S +H T+ P
Sbjct: 195 PE---IELALMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWH--TYGP 249
Query: 355 IVIGSQMRY----EVTGDQLHKEGHQLESS--GTNIGHF-------NFKSDPKRLASNL- 400
+ Y + +Q H GH + T + H + D RL N+
Sbjct: 250 AWMVKDKAYSQAHQPLAEQQHAVGHAVRFVYLMTGVAHLARLSQDEGKRQDCLRLWHNMA 309
Query: 401 ---------------------------DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYY 433
D+ ESC + ++ +R + + YAD
Sbjct: 310 QRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAESCASIGLMMFARRMLEMEADSQYADVM 369
Query: 434 ERSLTNGVLGIQRGTEPGVMIYLLPL--APGSSKERSYHHWGTPSDSFW----CCYGTGI 487
ER+L N VLG + Y+ PL P + + P W CC
Sbjct: 370 ERALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIA 428
Query: 488 ESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTF 547
+ LG IY E ++I Y+ +R+D G + ++ W+ + +++
Sbjct: 429 RLLTSLGHYIYTPHEN---ALFINLYVGNRVDVPVGDRTLGIRISGNFPWEETVTISVDV 485
Query: 548 SSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
+ + +L LR+P W + + + NG+ + + +L + + W D LT+ LP+
Sbjct: 486 TQP---VKHTLALRLPDWCEA--PQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPM 540
Query: 608 TLR 610
+R
Sbjct: 541 PVR 543
>gi|418817745|ref|ZP_13373230.1| hypothetical protein SEEN538_23719 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
gi|392787738|gb|EJA44277.1| hypothetical protein SEEN538_23719 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
Length = 651
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 94/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLNFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAI---DSVQPVRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|194444786|ref|YP_002042927.1| hypothetical protein SNSL254_A3957 [Salmonella enterica subsp.
enterica serovar Newport str. SL254]
gi|418790980|ref|ZP_13346748.1| hypothetical protein SEEN447_08030 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19447]
gi|418795399|ref|ZP_13351104.1| hypothetical protein SEEN449_11603 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19449]
gi|418798645|ref|ZP_13354319.1| hypothetical protein SEEN567_19023 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19567]
gi|418806870|ref|ZP_13362440.1| hypothetical protein SEEN550_01554 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21550]
gi|418811033|ref|ZP_13366570.1| hypothetical protein SEEN513_20561 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22513]
gi|418819963|ref|ZP_13375400.1| hypothetical protein SEEN425_16720 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|418824033|ref|ZP_13379418.1| hypothetical protein SEEN462_22404 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22462]
gi|418832501|ref|ZP_13387442.1| hypothetical protein SEEN486_20353 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N18486]
gi|418834359|ref|ZP_13389267.1| hypothetical protein SEEN543_19458 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N1543]
gi|418839823|ref|ZP_13394654.1| hypothetical protein SEEN554_21236 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21554]
gi|418851856|ref|ZP_13406562.1| hypothetical protein SEEN978_01684 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 37978]
gi|418853203|ref|ZP_13407898.1| hypothetical protein SEEN593_02318 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
gi|194403449|gb|ACF63671.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL254]
gi|392756265|gb|EJA13162.1| hypothetical protein SEEN447_08030 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19447]
gi|392758783|gb|EJA15648.1| hypothetical protein SEEN449_11603 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19449]
gi|392766123|gb|EJA22905.1| hypothetical protein SEEN567_19023 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19567]
gi|392780719|gb|EJA37371.1| hypothetical protein SEEN513_20561 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22513]
gi|392782028|gb|EJA38666.1| hypothetical protein SEEN550_01554 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21550]
gi|392793888|gb|EJA50323.1| hypothetical protein SEEN425_16720 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|392797650|gb|EJA53956.1| hypothetical protein SEEN486_20353 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N18486]
gi|392805302|gb|EJA61433.1| hypothetical protein SEEN543_19458 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N1543]
gi|392811613|gb|EJA67613.1| hypothetical protein SEEN554_21236 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21554]
gi|392816063|gb|EJA71993.1| hypothetical protein SEEN978_01684 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 37978]
gi|392825252|gb|EJA81005.1| hypothetical protein SEEN462_22404 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22462]
gi|392827750|gb|EJA83452.1| hypothetical protein SEEN593_02318 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
Length = 651
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 94/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLNFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +T+ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|386016685|ref|YP_005934975.1| hypothetical protein PAJ_2099 [Pantoea ananatis AJ13355]
gi|327394757|dbj|BAK12179.1| hypothetical protein PAJ_2099 [Pantoea ananatis AJ13355]
Length = 659
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 101/483 (20%), Positives = 179/483 (37%), Gaps = 75/483 (15%)
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQF--DRLEALI 239
V +L A A + L++ V+ ++A Q E GYL+ + T + DR L
Sbjct: 82 VAKWLEAVAWSLCQKPDAELEKTADEVIELIAAAQCE--DGYLNTYFTVKAPQDRWTNLA 139
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
Y H I AG+ A R +V + + +V + H +
Sbjct: 140 ECHELYCAGHMIEAGVAFY-----QATGKRRLLEVVCRLADHIDSVFGPEEHQLHGYPGH 194
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLALQADDISGFHSNTHIP 354
E + L +L+ +TQ P++L L + F +P F + + S +H T+ P
Sbjct: 195 PE---IELALMRLYEVTQQPRYLALVNTFVSQRGTQPHFYDIEYEKRGQTSYWH--TYGP 249
Query: 355 IVIGSQMRY----EVTGDQLHKEGHQLESS--GTNIGHF-------NFKSDPKRLASNL- 400
+ Y + +Q H GH + T + H + D RL N+
Sbjct: 250 AWMVKDKAYSQAHQPLAEQQHAVGHAVRFVYLMTGVAHLARLSQDEGKRQDCLRLWHNMA 309
Query: 401 ---------------------------DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYY 433
D+ ESC + ++ +R + + YAD
Sbjct: 310 QRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAESCASIGLMMFARRMLEMEADSQYADVM 369
Query: 434 ERSLTNGVLGIQRGTEPGVMIYLLPL--APGSSKERSYHHWGTPSDSFW----CCYGTGI 487
ER+L N VLG + Y+ PL P + + P W CC
Sbjct: 370 ERALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIA 428
Query: 488 ESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTF 547
+ LG IY E ++I Y+ +R+D G + ++ W+ + +++
Sbjct: 429 RLLTSLGHYIYTPHEN---ALFINLYVGNRVDVPVGDRTLGIRISGNFPWEETVTISVDV 485
Query: 548 SSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
+ + +L LR+P W + + + NG+ + + +L + + W D LT+ LP+
Sbjct: 486 TQP---VKHTLALRLPDWCEA--PQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPM 540
Query: 608 TLR 610
+R
Sbjct: 541 PVR 543
>gi|16762630|ref|NP_458247.1| hypothetical protein STY4117 [Salmonella enterica subsp. enterica
serovar Typhi str. CT18]
gi|29144119|ref|NP_807461.1| hypothetical protein t3840 [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|213052815|ref|ZP_03345693.1| hypothetical protein Salmoneentericaenterica_07808 [Salmonella
enterica subsp. enterica serovar Typhi str. E00-7866]
gi|213428126|ref|ZP_03360876.1| hypothetical protein SentesTyphi_22630 [Salmonella enterica subsp.
enterica serovar Typhi str. E02-1180]
gi|213650623|ref|ZP_03380676.1| hypothetical protein SentesTy_27330 [Salmonella enterica subsp.
enterica serovar Typhi str. J185]
gi|213854603|ref|ZP_03382843.1| hypothetical protein SentesT_11074 [Salmonella enterica subsp.
enterica serovar Typhi str. M223]
gi|289826027|ref|ZP_06545185.1| hypothetical protein Salmonellentericaenterica_11725 [Salmonella
enterica subsp. enterica serovar Typhi str. E98-3139]
gi|378962007|ref|YP_005219493.1| hypothetical protein STBHUCCB_40530 [Salmonella enterica subsp.
enterica serovar Typhi str. P-stx-12]
gi|25333173|pir||AG0977 conserved hypothetical protein STY4117 [imported] - Salmonella
enterica subsp. enterica serovar Typhi (strain CT18)
gi|16504936|emb|CAD07947.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi]
gi|29139756|gb|AAO71321.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|374355879|gb|AEZ47640.1| hypothetical protein STBHUCCB_40530 [Salmonella enterica subsp.
enterica serovar Typhi str. P-stx-12]
Length = 651
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYIYTP---RADALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 445 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +++ LP+ +R A AI GP V
Sbjct: 500 LNGLEVEQDIRKGYLHIRRTWQEGDTISLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 558
>gi|417432692|ref|ZP_12161408.1| secreted protein [Salmonella enterica subsp. enterica serovar
Mississippi str. A4-633]
gi|353614176|gb|EHC66091.1| secreted protein [Salmonella enterica subsp. enterica serovar
Mississippi str. A4-633]
Length = 352
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ P+
Sbjct: 30 DSVYAESCASIGLMMFARQMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPME 88
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G IY + +YI Y+
Sbjct: 89 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYIYTP---RADALYINMYV 145
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L+ + ++ W +++ + + +L LR+P W AK T
Sbjct: 146 GNSLEVPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 200
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +++ LP+ +R A AI GP V
Sbjct: 201 LNGLEVEQDIRKGYLHIRRTWQEGDTISLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 259
>gi|213418442|ref|ZP_03351508.1| hypothetical protein Salmonentericaenterica_11358 [Salmonella
enterica subsp. enterica serovar Typhi str. E01-6750]
Length = 385
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 63 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 121
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G IY + +YI Y+
Sbjct: 122 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYIYTP---RADALYINMYV 178
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 179 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 233
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +++ LP+ +R A AI GP V
Sbjct: 234 LNGLEVEQDIRKGYLHIRRTWQEGDTISLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 292
>gi|336239737|ref|XP_003342727.1| hypothetical protein SMAC_10375 [Sordaria macrospora k-hell]
Length = 159
Score = 63.9 bits (154), Expect = 3e-07, Method: Composition-based stats.
Identities = 33/87 (37%), Positives = 51/87 (58%), Gaps = 2/87 (2%)
Query: 138 NLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTH 197
N YLL LD ++L+ NF +A LPAP YGGWE + GH +GH+LSA AL A++
Sbjct: 71 NRRYLLDLDPERLLHNFYISAGLPAPKPVYGGWEAQG--IAGHSLGHWLSACALTVANSG 128
Query: 198 NESLKEKMSAVVSALSACQKEIGSGYL 224
+ ++ ++ + ++ Q G GY+
Sbjct: 129 DAAIAARLDHALKEMARIQAAHGDGYV 155
>gi|386078433|ref|YP_005991958.1| hypothetical protein [Pantoea ananatis PA13]
gi|354987614|gb|AER31738.1| hypothetical protein PAGR_g1212 [Pantoea ananatis PA13]
Length = 651
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 101/483 (20%), Positives = 179/483 (37%), Gaps = 75/483 (15%)
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQF--DRLEALI 239
V +L A A + L++ V+ ++A Q E GYL+ + T + DR L
Sbjct: 74 VAKWLEAVAWSLCQKPDAELEKTADEVIELIAAAQCE--DGYLNTYFTVKAPQDRWTNLA 131
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
Y H I AG+ A R +V + + +V + H +
Sbjct: 132 ECHELYCAGHMIEAGVAFY-----QATGKRRLLEVVCRLADHIDSVFGPEEHQLHGYPGH 186
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLALQADDISGFHSNTHIP 354
E + L +L+ +TQ P++L L + F +P F + + S +H T+ P
Sbjct: 187 PE---IELALMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWH--TYGP 241
Query: 355 IVIGSQMRY----EVTGDQLHKEGHQLESS--GTNIGHF-------NFKSDPKRLASNL- 400
+ Y + +Q H GH + T + H + D RL N+
Sbjct: 242 AWMVKDKAYSQAHQPLAEQQHAVGHAVRFVYLMTGVAHLARLSQDEGKRQDCLRLWHNMA 301
Query: 401 ---------------------------DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYY 433
D+ ESC + ++ +R + + YAD
Sbjct: 302 QRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAESCASIGLMMFARRMLEMEADSQYADVM 361
Query: 434 ERSLTNGVLGIQRGTEPGVMIYLLPL--APGSSKERSYHHWGTPSDSFW----CCYGTGI 487
ER+L N VLG + Y+ PL P + + P W CC
Sbjct: 362 ERALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIA 420
Query: 488 ESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTF 547
+ LG IY E ++I Y+ +R+D G + ++ W+ + +++
Sbjct: 421 RLLTSLGHYIYTPHEN---ALFINLYVGNRVDVPVGDRTLGIRISGNFPWEETVTISVDV 477
Query: 548 SSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
+ + +L LR+P W + + + NG+ + + +L + + W D LT+ LP+
Sbjct: 478 TQP---VKHTLALRLPDWCEA--PQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPM 532
Query: 608 TLR 610
+R
Sbjct: 533 PVR 535
>gi|213582277|ref|ZP_03364103.1| hypothetical protein SentesTyph_14169 [Salmonella enterica subsp.
enterica serovar Typhi str. E98-0664]
Length = 380
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 58 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 116
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G IY + +YI Y+
Sbjct: 117 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYIYTP---RADALYINMYV 173
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +++ + + +L LR+P W AK T
Sbjct: 174 GNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRLPDWCPE--AKVT 228
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG ++ +L + +TW D +++ LP+ +R A AI GP V
Sbjct: 229 LNGLEVEQDIRKGYLHIRRTWQEGDTISLTLPMPVRRVYGNPLARHVAGKVAIQRGPLV 287
>gi|347530932|ref|YP_004837695.1| hypothetical protein RHOM_03205 [Roseburia hominis A2-183]
gi|345501080|gb|AEN95763.1| hypothetical protein RHOM_03205 [Roseburia hominis A2-183]
Length = 646
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 101/240 (42%), Gaps = 19/240 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D N E+C + ++ +R++ + K YAD ER+L NG++ G+Q + + L +
Sbjct: 331 DMNYAETCASIGLVFFARNMLKTEKNGRYADVMERALYNGIISGMQLDGKRFFYVNPLEV 390
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
PG S E + P W CC + + LG + E+E VY ++
Sbjct: 391 NPGVSGEIFGYKHVIPERPGWYACACCPPNLVRMVTSLGKYAWDEDE---TAVYSHLFLG 447
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
I +V+ W+ VT S+K L T L + IP + + T+
Sbjct: 448 QEAALGKADI----RVESAYPWEG--SVTYHVSAKIDELFT-LAIHIPAYVKD--LRVTV 498
Query: 576 NGQ--DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
NG+ D +L +++ W SDD++ + PL +R E A++ GP V
Sbjct: 499 NGEAFDTAGEIRDGYLYISRKWGSDDQVELHFPLPVRKIYASTHVREDVGCVALMRGPVV 558
>gi|397660575|ref|YP_006501277.1| hypothetical protein A225_5616 [Klebsiella oxytoca E718]
gi|394348582|gb|AFN34703.1| putative secreted protein [Klebsiella oxytoca E718]
Length = 653
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 89/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTPHDDV---LYINLYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + SS + +L LR+P W + + T
Sbjct: 445 GNSVEIPVGNEALRLRISGNYPWQEQVKIVIDSSSP---VNHTLALRLPDWC--DKPQVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG + +L ++ W D L + LP+ +R
Sbjct: 500 LNGAPVTQDVRKGYLHISHLWQEGDTLQLTLPMPVR 535
>gi|375257948|ref|YP_005017118.1| hypothetical protein KOX_05730 [Klebsiella oxytoca KCTC 1686]
gi|365907426|gb|AEX02879.1| hypothetical protein KOX_05730 [Klebsiella oxytoca KCTC 1686]
Length = 653
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 89/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTPHDDV---LYINLYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + SS + +L LR+P W + + T
Sbjct: 445 GNSVEIPVGNEALRLRISGNYPWQEQVKIVIDSSSP---VNHTLALRLPDWC--DKPQVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG + +L ++ W D L + LP+ +R
Sbjct: 500 LNGAPVTQDVRKGYLHISHLWQEGDTLQLTLPMPVR 535
>gi|423126346|ref|ZP_17114025.1| hypothetical protein HMPREF9694_03037 [Klebsiella oxytoca 10-5250]
gi|376397918|gb|EHT10548.1| hypothetical protein HMPREF9694_03037 [Klebsiella oxytoca 10-5250]
Length = 653
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI Y+
Sbjct: 388 VNPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTPHDD---ALYINLYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + ++ W +++ + SS + +L LR+P W + + T
Sbjct: 445 GNSVEIPVGNEALRLRISGNYPWQEQVKIVVDSSSP---VHHTLALRLPDWC--DKPQVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG + +L ++ W D L + LP+ +R A + A+ GP V
Sbjct: 500 LNGVPVTQDVRKGYLHISHLWQEGDTLQLTLPMPVRRIYGNPLVRHQAGLVAVQRGPLV 558
>gi|397166966|ref|ZP_10490409.1| hypothetical protein Y71_0972 [Enterobacter radicincitans DSM
16656]
gi|396091112|gb|EJI88679.1| hypothetical protein Y71_0972 [Enterobacter radicincitans DSM
16656]
Length = 651
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 55/216 (25%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 PGSSKERSYHHWG--TPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
R H + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKTLRFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTPHQD---ALYINLYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G V+ +V W +V + S + +L LR+P W + + T
Sbjct: 445 GNSIEVPVGDKVLRLRVSGNFPWQE--KVMIAVESPLP-VQHTLALRMPDW--CDAPQVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG + +L + + W D LT+ LP+ +R
Sbjct: 500 LNGVAVEKAVHKGYLHIHRLWQEGDTLTLTLPMPVR 535
>gi|436834929|ref|YP_007320145.1| hypothetical protein FAES_1542 [Fibrella aestuarina BUZ 2]
gi|384066342|emb|CCG99552.1| hypothetical protein FAES_1542 [Fibrella aestuarina BUZ 2]
Length = 636
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 112/528 (21%), Positives = 206/528 (39%), Gaps = 85/528 (16%)
Query: 142 LLMLDVDKLVWNFRKTARLPAPGEPYGGWEEPSCELRGHFVGHYLSASALMWASTHNESL 201
+L +VD+LV FR E C + F G + +++ L + L
Sbjct: 68 ILAQNVDRLVAPFRDRT-------------ETRC-WQSEFWGKWFTSAVLAYRYRPEPQL 113
Query: 202 KEKMSAVVSALSACQKEIG--SGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQY 259
K + V+ L A Q G Y +Q+D +W Y L GLL Y
Sbjct: 114 KNVLDKAVADLLATQTPDGYIGNYADTSHLQQWD-------IWGRKY----CLLGLLAYY 162
Query: 260 TYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDP 319
++ +L + + ++ N + +K + + A + + + L+ T D
Sbjct: 163 DLTNDKRSLNAASKVTDHLINELS--ARKALLVKQGNHRGMAATSVLEPVCLLYSRTADK 220
Query: 320 KHLMLAHL----FDKPCFLGLLALQADDIS--------------GFHSNTHIPIVIGSQM 361
++L A ++ P L+A D++ G + + G
Sbjct: 221 RYLAFAETIVQQWESPEGPQLIAKADVDVANRFPKPKNWFGWEQGQKAYEMMSCYEGLLE 280
Query: 362 RYEVTGDQLHKEGHQ------------LESSGTNIGHFNFKSDPKRLASNLDSNTEESCT 409
Y +TG +K + L SG+++ + + L+ N + +E+C
Sbjct: 281 LYRLTGKPAYKAAVEKTWQNIRDTEINLAGSGSSVECWFGGKALQTLSIN---HYQETCV 337
Query: 410 TYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSY 469
T +K+S+ L R T + YAD E++ N +LG + Y PL+ +
Sbjct: 338 TATWIKLSQQLLRLTGDARYADAIEQTYYNALLGSMKADGSDWTKYT-PLS--GQRLEGG 394
Query: 470 HHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR--LDWKSGQIV- 526
G + CC +G L ++ + GV + Y + GQ V
Sbjct: 395 EQCGMGLN---CCVASGPRGLFTLPQTVVMS---RADGVQVNFYAEGTYLANTPGGQSVS 448
Query: 527 VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPG 586
+ Q+ D VS L ++L + + ++ +RIP W+ + T+NGQ +P G
Sbjct: 449 LRQQTDYPVSGQSTLHLSLPKTE-----SFTVRVRIPAWSVQ--STVTVNGQAVPTVVAG 501
Query: 587 NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
++++ +TW + D+L++ L + R + D P++ AI+ GP VL
Sbjct: 502 EYVAIKRTWQTGDQLSLTLDMRGRVVRL-GDMPQHL---AIVRGPVVL 545
>gi|429083191|ref|ZP_19146237.1| COG3533 secreted protein [Cronobacter condimenti 1330]
gi|426548006|emb|CCJ72278.1| COG3533 secreted protein [Cronobacter condimenti 1330]
Length = 651
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 54/216 (25%), Positives = 90/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
DS ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DSVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + + P W CC + LG IY + +YI Y+
Sbjct: 388 VHPKTLCLNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIY---TPRPDALYINLYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G+ V+ +V W +V + S + +L LR+P W + + T
Sbjct: 445 GNSIEVPVGENVLRLRVSGNFPWQE--KVVIAIDSPLP-VQHTLALRMPDWC--DAPQVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG ++ +L + + W D LT+ LP+ +R
Sbjct: 500 LNGIEVEKSVRKGYLHIPRVWREGDTLTLTLPMPVR 535
>gi|146295756|ref|YP_001179527.1| hypothetical protein [Caldicellulosiruptor saccharolyticus DSM
8903]
gi|145409332|gb|ABP66336.1| protein of unknown function DUF1680 [Caldicellulosiruptor
saccharolyticus DSM 8903]
Length = 653
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 100/240 (41%), Gaps = 22/240 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI--QRGTEPGVMIYLLPLA--P 461
E+C + ++ + + R Y D ER+L N ++G Q G + Y+ PL P
Sbjct: 337 ETCASVGLVFFAHRMNRIKPHRKYYDVVERALYNTIIGAMSQDGKK---YFYVNPLEVFP 393
Query: 462 GSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
++R H P W CC + +G IY + +Y+ YI S
Sbjct: 394 KEVEKRFDRHHVKPERQPWFGCACCPPNVARLLASIGKYIYLYNNNE---IYVNLYIGSE 450
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSG-LTTSLNLRIPTWTSSNGAKATLN 576
++ ++ NQKV + + F +G + +LNLRIP+W K +N
Sbjct: 451 SEF----LINNQKVKIIQDSGYPFNDEVNFKIITNGEMYFTLNLRIPSWCDKFEIK--IN 504
Query: 577 GQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G+ L ++S+T+ W SDD++ I LP L+ E AI+ GP V
Sbjct: 505 GELLTGFSLKDGYVSITRGWKSDDRIEIILPTQLKRVYSNPLVRENIGKVAIVKGPVVFC 564
>gi|449310077|ref|YP_007442433.1| hypothetical protein CSSP291_17885 [Cronobacter sakazakii SP291]
gi|449100110|gb|AGE88144.1| hypothetical protein CSSP291_17885 [Cronobacter sakazakii SP291]
Length = 655
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 332 DTVYAESCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 390
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + K + P W CC + LG IY E ++I YI
Sbjct: 391 VHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTARED---ALFINLYI 447
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ + G + ++ W +R+ + + +L LR+P W + +
Sbjct: 448 GNNVQLPVGDSTLRLRISGDFPWHEEVRIHI---DSPRPVEHTLALRLPDW--CDAPRVM 502
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+ +L +T+TW D LT+ LP+ +R
Sbjct: 503 LNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 538
>gi|423122678|ref|ZP_17110362.1| hypothetical protein HMPREF9690_04684 [Klebsiella oxytoca 10-5246]
gi|376391959|gb|EHT04626.1| hypothetical protein HMPREF9690_04684 [Klebsiella oxytoca 10-5246]
Length = 653
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 104/481 (21%), Positives = 179/481 (37%), Gaps = 71/481 (14%)
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQF--DRLEALI 239
V +L A A + L++ V+ ++A Q E GYL+ + T + +R L
Sbjct: 74 VAKWLEAVAWSLCQKPDAELEKTADEVIELVAAAQCE--DGYLNTYFTVKAPAERWTNLA 131
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
Y H I AG+ A R +V + + NV + H +
Sbjct: 132 ECHELYCAGHMIEAGVA-----FFQATGKRRLLEVVCRLADHIDNVFGPGDNQLHGYPGH 186
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLALQADDISGF------- 347
E + L +L+ ITQ+P++L L + F +P F + + S +
Sbjct: 187 PE---IELALMRLYDITQEPRYLALVNYFVEERGTQPHFYDIEYEKRGKTSYWNTYGPAW 243
Query: 348 ------HSNTHIPI-----VIGSQMR--YEVTG---------------DQLHKEGHQLES 379
+S H PI IG +R Y +TG D L + +
Sbjct: 244 MVMDKPYSQAHQPISEQPVAIGHAVRFVYLMTGVAHLARLSQDEGKRQDCLRLWNNMAQR 303
Query: 380 SGTNIGHFNFKSDPKRLASNLDSNTE----ESCTTYNMLKVSRHLFRWTKEIAYADYYER 435
G +S + +S+ D + ESC + ++ +R + + YAD ER
Sbjct: 304 QLYITGGIGSQSSGEAFSSDYDLPNDTVYAESCASIGLMMFARRMLEMEADSQYADVMER 363
Query: 436 SLTNGVLGIQRGTEPGVMIYLLPLA--PGSSKERSYHHWGTPSDSFW----CCYGTGIES 489
+L N VLG + Y+ PL P S K + P W CC
Sbjct: 364 ALYNTVLG-GMALDGKHFFYVNPLEVHPTSLKFNHIYDHVKPVRQRWFGCACCPPNIARV 422
Query: 490 FSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSS 549
+ LG IY + +YI Y+ + + G + ++ W +++ +
Sbjct: 423 LTSLGHYIYTPHQD---ALYINLYVGNSAEIPVGDETLRLRISGNYPWQEQVKIAV---D 476
Query: 550 KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTL 609
+ + +L LR+P W + + TLNG+ + +L ++ W D L + LP+ +
Sbjct: 477 SPTPINHTLALRLPDWC--DNPQVTLNGKPVAQDVRKGYLHISHRWQEGDTLLLTLPMPV 534
Query: 610 R 610
R
Sbjct: 535 R 535
>gi|421728042|ref|ZP_16167199.1| hypothetical protein KOXM_22161 [Klebsiella oxytoca M5al]
gi|410371224|gb|EKP25948.1| hypothetical protein KOXM_22161 [Klebsiella oxytoca M5al]
Length = 653
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + LG IY + +YI YI
Sbjct: 388 VHPKSLKFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTPHDD---ALYINLYI 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ + G + ++ W +++ + SS + +L LR+P W + + T
Sbjct: 445 GNSAEIPVGNEALRLRISGNYPWQEQVQIVIDSSSP---VHHTLALRLPDWC--DKPQVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG + +L ++ W D L + LP+ +R
Sbjct: 500 LNGAPVTQDVRKGYLYISHLWQEGDTLLLTLPMPVR 535
>gi|403743937|ref|ZP_10953416.1| hypothetical protein URH17368_0706 [Alicyclobacillus hesperidum
URH17-3-68]
gi|403122527|gb|EJY56741.1| hypothetical protein URH17368_0706 [Alicyclobacillus hesperidum
URH17-3-68]
Length = 712
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 60/215 (27%), Positives = 93/215 (43%), Gaps = 22/215 (10%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA--PGS 463
E+C + ++ + + R + YAD ER+L N V+G + Y+ PLA P +
Sbjct: 384 ETCASIGLIFFANRMIRISPRREYADVMERALYNVVIG-SMALDGKHYCYVNPLALWPPA 442
Query: 464 SKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYF--EEEGKYPGVYIIQYISSR 517
+ + P W CC LGD IY EE+GK VY+ YI S
Sbjct: 443 NIQNPDRKHVKPVRQAWFGCACCPPNVARLMMSLGDYIYTIDEEKGK---VYVHLYIGSE 499
Query: 518 LDWKSG--QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
+ G +IV+ Q D + W RV + + SL LRIP+W + +
Sbjct: 500 ASFSVGGRKIVLIQ--DSEMPWQG--RVKFRVALGEGPVNFSLALRIPSWCADT-PSVRV 554
Query: 576 NGQDLPLPS---PGNFLSVTKTWSSDDKLTIQLPL 607
NG L + S ++ + +TW+ D L + LP+
Sbjct: 555 NGNLLSIASVTTKDGYIEIERTWTDGDVLELDLPM 589
>gi|156935976|ref|YP_001439892.1| hypothetical protein ESA_03870 [Cronobacter sakazakii ATCC BAA-894]
gi|156534230|gb|ABU79056.1| hypothetical protein ESA_03870 [Cronobacter sakazakii ATCC BAA-894]
Length = 655
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 332 DTVYAESCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 390
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + K + P W CC + LG IY E ++I YI
Sbjct: 391 VHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTARED---ALFINLYI 447
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ + G + ++ W +R+ + + +L LR+P W + +
Sbjct: 448 GNNVQLPVGDSTLRLRISGDFPWHEEVRIHI---DSPRPVEHTLALRLPDW--CDAPRVM 502
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+ +L +T+TW D LT+ LP+ +R
Sbjct: 503 LNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 538
>gi|429121562|ref|ZP_19182182.1| COG3533 secreted protein [Cronobacter sakazakii 680]
gi|426323943|emb|CCK12919.1| COG3533 secreted protein [Cronobacter sakazakii 680]
Length = 655
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 332 DTVYAESCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 390
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + K + P W CC + LG IY E ++I YI
Sbjct: 391 VHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTARED---ALFINLYI 447
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ + G + ++ W +R+ + + +L LR+P W + +
Sbjct: 448 GNNVQLPVGDSTLRLRISGDFPWHEEVRIHI---DSPRPVEHTLALRLPDW--CDAPRVM 502
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+ +L +T+TW D LT+ LP+ +R
Sbjct: 503 LNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 538
>gi|379722221|ref|YP_005314352.1| hypothetical protein PM3016_4439 [Paenibacillus mucilaginosus 3016]
gi|386724962|ref|YP_006191288.1| hypothetical protein B2K_22975 [Paenibacillus mucilaginosus K02]
gi|378570893|gb|AFC31203.1| hypothetical protein PM3016_4439 [Paenibacillus mucilaginosus 3016]
gi|384092087|gb|AFH63523.1| hypothetical protein B2K_22975 [Paenibacillus mucilaginosus K02]
Length = 660
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 92/389 (23%), Positives = 142/389 (36%), Gaps = 81/389 (20%)
Query: 308 VLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLALQADDISGFHSNTHIPIVIGSQMR 362
L KL+ T + ++L LA F +P FL Q D S + + +PI QM
Sbjct: 195 ALVKLYGATGEERYLKLAQFFIDERGTEPNFLVEECRQRDGYSHW-AKKKLPIPTAEQMA 253
Query: 363 YEVTGDQLHKEGHQLESSGTNIGH-------FNFKSDPKRLASNL--------------- 400
Y +Q HK Q + T +GH + +D RL +
Sbjct: 254 Y----NQAHKPVRQQD---TAVGHSVRAVYMYTAMADLARLTGDAELLEACRRLWDNTTK 306
Query: 401 --------------------------DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYE 434
D+ E+C + ++ +R + + + YAD E
Sbjct: 307 KQMYITGGIGSTHHGEAFSFDYDLPNDTVYAETCASIGLIFFARRMLQLEAKSEYADVLE 366
Query: 435 RSLTNGVLG--IQRGTEPGVMIYLLPLA--PGSSKERSYHHWGTPSDSFW----CCYGTG 486
R+L N V+G Q G Y+ PL P +S++ H W CC
Sbjct: 367 RALYNNVIGSMSQDGKH---YFYVNPLEVWPKASEQNPGRHHVKAVRQPWFGCSCCPPNV 423
Query: 487 IESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK--SGQIVVNQKVDPVVSWDPYLRVT 544
S L D IY G+ VY +I S +K +GQ+ + Q + + W+ R
Sbjct: 424 ARLLSSLNDYIYSASAGENT-VYTHLFIGSEASFKLAAGQVALKQ--ESRLPWEGCARFE 480
Query: 545 LTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQ 604
LT + +L LRIP+W S A+ +NG + VT+ W++ D +
Sbjct: 481 LTAVPEAP---VTLALRIPSW-SGGRAELRINGAAEAYEVENGYAVVTRRWTAGDVVEWA 536
Query: 605 LPLTLRTEAIQDDRPEYASIQAILYGPYV 633
L + A + A I GP V
Sbjct: 537 PALQAQLTAAHPEIRANAGRAVIERGPLV 565
>gi|389842783|ref|YP_006344867.1| hypothetical protein ES15_3783 [Cronobacter sakazakii ES15]
gi|387853259|gb|AFK01357.1| hypothetical protein ES15_3783 [Cronobacter sakazakii ES15]
Length = 655
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 332 DTVYAESCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 390
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + K + P W CC + LG IY E ++I YI
Sbjct: 391 VHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTARED---ALFINLYI 447
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ + G + ++ W +R+ + + +L LR+P W + +
Sbjct: 448 GNDVQLPVGDSTLRLRISGDFPWHEEVRIHI---DSPRPVEHTLALRLPDW--CDAPRVM 502
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+ +L +T+TW D LT+ LP+ +R
Sbjct: 503 LNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVR 538
>gi|262381468|ref|ZP_06074606.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262296645|gb|EEY84575.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 623
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 111/256 (43%), Gaps = 31/256 (12%)
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
+T E+C T+ +++ + T YAD E+++ N +L + + Y
Sbjct: 318 HTMETCVTFTWMQICDRMLGLTGNSLYADQIEKAMYNALLASLKADASQIAKY------- 370
Query: 463 SSKERSYHHWGTPSDSFW--CCYGTGIESFSKLGDSIYFEEEGKYPGV--YIIQYISSRL 518
S + H G CC G +F+ + Y + G+ V Y + L
Sbjct: 371 -SPLEGWRHEGEEQCGMHINCCNANGPRAFAMIPQFAY-QVNGRRIDVNLYAASSVEVEL 428
Query: 519 DWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
D K ++ + Q+ D P+ D +R+ + K S T +L RIP W S ++NG
Sbjct: 429 D-KKTRVSMTQETDYPI---DGQVRIVVE-PEKTSDFTIAL--RIPAW--SERTVVSVNG 479
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+ L G +L + +TW D++T++L + R + + QAI+ GP VLA
Sbjct: 480 EPLTDLLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARD 532
Query: 638 S-IGDWDITESATSLS 652
S D D+ E++ +S
Sbjct: 533 SRFKDGDVDEASVIVS 548
>gi|354603632|ref|ZP_09021629.1| hypothetical protein HMPREF9450_00544 [Alistipes indistinctus YIT
12060]
gi|353348727|gb|EHB92995.1| hypothetical protein HMPREF9450_00544 [Alistipes indistinctus YIT
12060]
Length = 630
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 60/254 (23%), Positives = 109/254 (42%), Gaps = 45/254 (17%)
Query: 394 KRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVM 453
+R+ + + E+C T +++ HL T + YAD ER++ N +L +G +
Sbjct: 316 RRMQTTPAYSMMETCVTMTWMQLCGHLLELTHDPLYADQIERTVYNALLAALKGDGSQIA 375
Query: 454 IYLLPL----APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKL--------GDSIYFEE 501
Y PL +PG + + + CC G +F+ + D+++
Sbjct: 376 KYS-PLEGVRSPGGPQCGMHVN---------CCNMNGPRAFAMIPELMATCAADTLFVNL 425
Query: 502 EGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLR 561
G+ S++ G++++ Q+ + + V LT + + S ++ +R
Sbjct: 426 YGES---------VSKVPLAGGEVILRQQTN----YPEQGSVELTVNPRKS-REFAVAVR 471
Query: 562 IPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
IP W S T+NGQ + PG++L+V++TW DK+ + + R E
Sbjct: 472 IPAW--SKITMVTVNGQAVADVRPGSYLTVSRTWKEGDKIALNFDMRGRLT-------EL 522
Query: 622 ASIQAILYGPYVLA 635
QAI GP VLA
Sbjct: 523 NGYQAIERGPVVLA 536
>gi|283787780|ref|YP_003367645.1| hypothetical protein ROD_42311 [Citrobacter rodentium ICC168]
gi|282951234|emb|CBG90928.1| conserved hypothetical protein [Citrobacter rodentium ICC168]
Length = 651
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 91/216 (42%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASVGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S + P W CC + +G IY + +YI Y+
Sbjct: 388 VHPKSLSFNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTP---RPEALYINLYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W + +VT+ S S + +L LR+P W AK
Sbjct: 445 GNSMELPLAGGTLRLRISGDYPW--HEQVTIAVDSPQS-IHHTLALRLPDWCPQ--AKVA 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ ++ +T++W D L + LP+ +R
Sbjct: 500 LNGEEVAQDIRKGYIHITRSWQEGDTLRLTLPMPVR 535
>gi|429117671|ref|ZP_19178589.1| COG3533 secreted protein [Cronobacter sakazakii 701]
gi|426320800|emb|CCK04702.1| COG3533 secreted protein [Cronobacter sakazakii 701]
Length = 372
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 94/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 49 DTVYAESCASIGLIMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 107
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + K + P W CC + LG IY E ++I YI
Sbjct: 108 VHPKTLKFNHIYDHVKPVRQRWFGCACCPPNIARLLTSLGHYIYTARED---ALFINLYI 164
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ + G + ++ W +R+ + + +L LR+P W + +
Sbjct: 165 GNNVQLPVGDSTLRLRISGDFPWHEEVRIHI---DSPRPVEHTLALRLPDW--CDAPRVM 219
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+ +L +T+TW D LT+ LP+ +R A AI GP +
Sbjct: 220 LNGRPCEGDIRKGYLWLTRTWHEGDTLTLTLPMPVRRVYGNPLVRHVAGKVAIQRGPLI 278
>gi|251797630|ref|YP_003012361.1| hypothetical protein Pjdr2_3643 [Paenibacillus sp. JDR-2]
gi|247545256|gb|ACT02275.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 645
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 95/235 (40%), Gaps = 10/235 (4%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA--PGS 463
E+C + ++ +R + + + YAD ER+L N VLG + Y+ PL P +
Sbjct: 324 ETCASIGLIFWARRMLQLEAKSEYADVMERALYNNVLG-SMAKDGKHFFYVNPLEVWPEA 382
Query: 464 SKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSRL 518
S + P W CC L + IY E+G V++
Sbjct: 383 SAKSPDKFHVKPVRQKWFGCSCCPPNVARLLGSLDEYIYDVSEDGSTVRVHLFIGSEVAF 442
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+ + +IV+NQK + + W+ + ++ + L LRIP W SS A +NG+
Sbjct: 443 ETEGKKIVLNQKSE--LPWNGQVEFKVSLQEDKGDVPFMLALRIPNWFSSKEALLKINGE 500
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+ + +V + W D++ LP+ + A A AI GP V
Sbjct: 501 TVRYHVDKGYATVYRVWQDGDRVEWLLPIETQLIAANPLIRADAGKAAIQRGPLV 555
>gi|323344406|ref|ZP_08084631.1| hypothetical protein HMPREF0663_11167 [Prevotella oralis ATCC
33269]
gi|323094533|gb|EFZ37109.1| hypothetical protein HMPREF0663_11167 [Prevotella oralis ATCC
33269]
Length = 627
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/237 (27%), Positives = 102/237 (43%), Gaps = 31/237 (13%)
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSS 464
+E+C T +K+SR L T YAD E+SL N +LG + Y PL+
Sbjct: 324 QETCVTATWIKLSRQLLMLTGNSKYADAIEQSLYNALLGAMKSDGSDWAKYT-PLS--GQ 380
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE-EGKY-----PGVYIIQYISSRL 518
+ + G + CC +G + + + +G PG Y +Q
Sbjct: 381 RLQGSEQCGMGLN---CCTASGPRGLFIIPQTAVMQSIKGAVINLYIPGTYTLQSP---- 433
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
K +I++ Q+ D + V + F K + T L+LRIP W S K TLNG
Sbjct: 434 --KGQEIIITQQGD----YPQTGTVRIAFKVKQTEEFT-LSLRIPEW--SKDTKVTLNGN 484
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEA-IQDDRPEYASIQAILYGPYVL 634
D+ G++L + + WS D ++L L +R + + P+Y AI GP VL
Sbjct: 485 DVVPAHNGSYLQINRKWSDGDH--VELVLDMRAQLHFMGENPQYL---AITRGPVVL 536
>gi|301309993|ref|ZP_07215932.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|423340426|ref|ZP_17318165.1| hypothetical protein HMPREF1059_04090 [Parabacteroides distasonis
CL09T03C24]
gi|300831567|gb|EFK62198.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|409227861|gb|EKN20757.1| hypothetical protein HMPREF1059_04090 [Parabacteroides distasonis
CL09T03C24]
Length = 623
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/256 (24%), Positives = 111/256 (43%), Gaps = 31/256 (12%)
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
+T E+C T+ +++ + T YAD E+++ N +L + + Y
Sbjct: 318 HTMETCVTFTWMQICDRMLGLTGNSLYADQIEKAMYNALLASLKADASQIAKY------- 370
Query: 463 SSKERSYHHWGTPSDSFW--CCYGTGIESFSKLGDSIYFEEEGKYPGV--YIIQYISSRL 518
S + H G CC G +F+ + ++ G+ V Y + L
Sbjct: 371 -SPLEGWRHEGEEQCGMHINCCNANGPRAFAMI-PRFAYQVNGRRIDVNLYAASSVEVEL 428
Query: 519 DWKSGQIVVNQKVD-PVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
D K ++ + Q+ D P+ D +R+ + K S T +L RIP W S ++NG
Sbjct: 429 D-KKTRVSMTQETDYPI---DGQVRIVVE-PEKTSDFTIAL--RIPAW--SERTVVSVNG 479
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+ L G +L + +TW D++T++L + R + + QAI+ GP VLA
Sbjct: 480 EPLTDLLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARD 532
Query: 638 S-IGDWDITESATSLS 652
S D D+ E++ +S
Sbjct: 533 SRFKDGDVDEASVIVS 548
>gi|189467307|ref|ZP_03016092.1| hypothetical protein BACINT_03695 [Bacteroides intestinalis DSM
17393]
gi|189435571|gb|EDV04556.1| hypothetical protein BACINT_03695 [Bacteroides intestinalis DSM
17393]
Length = 611
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 104/246 (42%), Gaps = 41/246 (16%)
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA-- 460
+T E+C T+ +++ L T YAD E+SL N ++ + + Y P+
Sbjct: 309 HTMETCVTFTWIQLCDKLLALTGNPFYADQIEKSLYNALMAALKDDASQIAKYS-PMEGH 367
Query: 461 PGSSKERSYHHWGTPSDSFWCCYGTGIESFS--------KLGDSIYFEEEGKYPGVYIIQ 512
+E+ H CC G +F+ K+G+ +Y G
Sbjct: 368 RCEGEEQCGMHIN-------CCNANGPRAFALIPDFAVKKMGNEVYVNYYGD-------- 412
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
+S+ L+ +++V Q VS + +T+ + + L+LR+P W++
Sbjct: 413 -MSASLENGHNKVLVKQHTTYPVS--NVIDITIDVTKEN---VFGLHLRVPVWSAQT--V 464
Query: 573 ATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
TLNG++L PG + ++T+ W D + I L + R E +QAI+ GP
Sbjct: 465 ITLNGEELKDICPGTYHAITRKWKKGDHIQIILDMPARL-------LEQNQMQAIVRGPI 517
Query: 633 VLAGHS 638
VLA S
Sbjct: 518 VLARDS 523
>gi|238023985|ref|YP_002908217.1| hypothetical protein [Burkholderia glumae BGR1]
gi|237878650|gb|ACR30982.1| Hypothetical protein bglu_2g05390 [Burkholderia glumae BGR1]
Length = 655
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 54/243 (22%), Positives = 105/243 (43%), Gaps = 22/243 (9%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ +R + ++E YAD ER+L N VL GI G + Y+ PL
Sbjct: 330 DTAYTETCASVGLVFFARRMLEASRESGYADVLERALYNTVLAGI--GLDGRSFFYVNPL 387
Query: 460 APGSSKERSYHHWG--TPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
+ R H + P W CC + L +Y ++ +Y+ Y
Sbjct: 388 ETHPAGIRGNHKYEHVKPVRQRWFGCACCPPNVARLIASLDQYVYLVDDSI---IYVNLY 444
Query: 514 IS--SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
++ +RL+ + ++ + Q+ + W LR+ + + G ++ +R+P W ++
Sbjct: 445 VAGEARLNAGTSRVTLRQQGN--YPWRGDLRIVV---EQADGFDGTIAVRLPDWCAA--P 497
Query: 572 KATLNGQDLPLPSP-GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+ +NG + + +L + + W D + + LP+T+R A A+ G
Sbjct: 498 EVRVNGDTVACSAAVDGYLHLPRVWHDGDTIELVLPMTVRRLTGHGKLRHAAGKVAVQRG 557
Query: 631 PYV 633
P V
Sbjct: 558 PIV 560
>gi|373462448|ref|ZP_09554170.1| hypothetical protein HMPREF9944_02434 [Prevotella maculosa OT 289]
gi|371948225|gb|EHO66109.1| hypothetical protein HMPREF9944_02434 [Prevotella maculosa OT 289]
Length = 932
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 112/264 (42%), Gaps = 24/264 (9%)
Query: 376 QLESSGTNIGHFNFKSDPK-RLASNLDSNTEESCTTYNMLKVS-RHLFRWTKEIAYADYY 433
Q+ G ++ +F+ PK + +NL +N E+C + + ++ R L W + YA
Sbjct: 618 QIPGGGISLCE-HFECRPKSHVLTNLPNNIYETCGSVFWIDLNHRFLQLWPTKERYASEI 676
Query: 434 ERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKL 493
E+SL N V Q E G + Y + Y+ CC + L
Sbjct: 677 EKSLYNVVFAAQ--GENGCIRYFNQVNDAKYPAMCYNT---------CCEIQATALYGML 725
Query: 494 GDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWD-PYLRVTLTFSSKGS 552
+Y GV++ + +S +D+K V +Q V + PY S
Sbjct: 726 PQYVYSVAPD---GVFVNLFSASDIDFK----VKDQPVKLTMKTQFPYSNQVALRVSADR 778
Query: 553 GLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTE 612
+T + +RIP W + G +N + + PG+++ + +TW +D++T LP+T E
Sbjct: 779 PVTMKVRVRIPEW-AKGGVVLRVNDRKVKTGMPGSYVEIDRTWKDNDEITWSLPMTWSYE 837
Query: 613 A-IQDDRPEYASIQAILYGPYVLA 635
I R A+ A YGP ++A
Sbjct: 838 KYIGATRIAGATRYAFFYGPMLMA 861
>gi|378766201|ref|YP_005194662.1| hypothetical protein PANA5342_1232 [Pantoea ananatis LMG 5342]
gi|365185675|emb|CCF08625.1| hypothetical protein PANA5342_1232 [Pantoea ananatis LMG 5342]
Length = 651
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 100/483 (20%), Positives = 178/483 (36%), Gaps = 75/483 (15%)
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQF--DRLEALI 239
V +L A A + L++ V+ ++A Q E GYL+ + T + DR L
Sbjct: 74 VAKWLEAVAWSLCQKPDAELEKTADEVIELIAAAQCE--DGYLNTYFTVKAPQDRWTNLA 131
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
Y H I AG+ A R +V + + +V + H +
Sbjct: 132 ECHELYCAGHMIEAGVAFY-----QATGKRRLLEVVCRLADHIDSVFGPEEHQLHGYPGH 186
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLALQADDISGFHSNTHIP 354
E + L +L+ +TQ P++L L + F +P F + + S +H T+ P
Sbjct: 187 PE---IELALMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWH--TYGP 241
Query: 355 IVIGSQMRY----EVTGDQLHKEGHQLESS--GTNIGHF-------NFKSDPKRLASNL- 400
+ Y + +Q H GH + T + H + D RL N+
Sbjct: 242 AWMVKDKAYSQAHQPLAEQQHAVGHAVRFVYLMTGVAHLARLSQDEGKRQDCLRLWHNMA 301
Query: 401 ---------------------------DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYY 433
D+ ESC + ++ +R + + YAD
Sbjct: 302 QRQLYITGGIGSQSSGEAFSSDYDLPNDTVYAESCASIGLMMFARRMLEMEADSQYADVM 361
Query: 434 ERSLTNGVLGIQRGTEPGVMIYLLPL--APGSSKERSYHHWGTPSDSFW----CCYGTGI 487
ER+L N VLG + Y+ PL P + + P W CC
Sbjct: 362 ERALYNTVLG-GMALDGKHFFYVNPLEVQPKTLHFNHLYDHVKPVRQRWFGCACCPPNIA 420
Query: 488 ESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTF 547
+ LG IY + ++I Y+ +R+D G + + W+ + +++
Sbjct: 421 RLLTSLGHYIYTPHQN---ALFINLYVGNRVDVPVGDRTLGIHISGNFPWEETVTISVDA 477
Query: 548 SSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
+ + +L LR+P W + + + NG+ + + +L + + W D LT+ LP+
Sbjct: 478 TQP---VKHTLALRLPDWCEA--PQVSCNGEVVTDRARKGYLYIERIWQEGDTLTLTLPM 532
Query: 608 TLR 610
+R
Sbjct: 533 PVR 535
>gi|432865910|ref|ZP_20088760.1| hypothetical protein A311_04524 [Escherichia coli KTE146]
gi|431401839|gb|ELG85171.1| hypothetical protein A311_04524 [Escherichia coli KTE146]
Length = 654
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 95/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + + T
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+++ +L +T+ W D L + LP+ +R A AI GP V
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVRRVYGNPLMRHVAGKVAIQRGPLV 558
>gi|337749269|ref|YP_004643431.1| hypothetical protein KNP414_05037 [Paenibacillus mucilaginosus
KNP414]
gi|336300458|gb|AEI43561.1| protein of unknown function DUF1680 [Paenibacillus mucilaginosus
KNP414]
Length = 660
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 92/389 (23%), Positives = 141/389 (36%), Gaps = 81/389 (20%)
Query: 308 VLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLALQADDISGFHSNTHIPIVIGSQMR 362
L KL+ T + ++L LA F +P FL Q D S + + +PI QM
Sbjct: 195 ALVKLYGATGEERYLKLAQFFIDERGTEPNFLVEECRQRDGYSHW-AKKKLPIPTAEQMA 253
Query: 363 YEVTGDQLHKEGHQLESSGTNIGH-------FNFKSDPKRLASNL--------------- 400
Y +Q HK Q + T +GH + +D RL +
Sbjct: 254 Y----NQAHKPVRQQD---TAVGHSVRAVYMYTAMADLARLTGDAELLEACRRLWDNTTK 306
Query: 401 --------------------------DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYE 434
D+ E+C + ++ +R + + + YAD E
Sbjct: 307 KQMYITGGIGSTHHGEAFSFDYDLPNDTVYAETCASIGLIFFARRMLQLEAKSEYADVLE 366
Query: 435 RSLTNGVLG--IQRGTEPGVMIYLLPLA--PGSSKERSYHHWGTPSDSFW----CCYGTG 486
R+L N V+G Q G Y+ PL P +S++ H W CC
Sbjct: 367 RALYNNVIGSMSQDGKH---YFYVNPLEVWPKASEQNPGRHHVKAVRQPWFGCSCCPPNV 423
Query: 487 IESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW--KSGQIVVNQKVDPVVSWDPYLRVT 544
S L D IY G VY +I S + +GQ+ + Q + + W+ R
Sbjct: 424 ARLLSSLNDYIYSASPGDNT-VYTHLFIGSEASFTLAAGQVALKQ--ESRLPWEGCARFE 480
Query: 545 LTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQ 604
LT + +L LRIP+W S A+ +NG + VT+ W++ D +
Sbjct: 481 LTAVPEAP---VTLALRIPSW-SGGRAELRINGAAEAYEVENGYAVVTRRWTAGDVVEWA 536
Query: 605 LPLTLRTEAIQDDRPEYASIQAILYGPYV 633
L + A + A AI GP V
Sbjct: 537 PALQAQLTAAHPEIRANAGRAAIERGPLV 565
>gi|386626404|ref|YP_006146132.1| hypothetical protein CE10_4140 [Escherichia coli O7:K1 str. CE10]
gi|349740140|gb|AEQ14846.1| hypothetical protein CE10_4140 [Escherichia coli O7:K1 str. CE10]
Length = 573
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + + T
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQIT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|345297339|ref|YP_004826697.1| hypothetical protein Entas_0157 [Enterobacter asburiae LF7a]
gi|345091276|gb|AEN62912.1| protein of unknown function DUF1680 [Enterobacter asburiae LF7a]
Length = 649
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 57/240 (23%), Positives = 99/240 (41%), Gaps = 17/240 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 PGSSKERSYHHW---GTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
K S++H P W CC + LG IY E ++I Y
Sbjct: 388 V-HPKTLSFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTVRED---ALFINLY 443
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ + + G + ++ W +++ +T +T +L LR+P W ++ +
Sbjct: 444 VGNDVAIPVGDRKLQLRISGNYPWHEQVKIDITSPVP---VTHTLALRLPDWCAN--PEI 498
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+ + +L +T+ W D +T+ LP+ +R + A A+ GP V
Sbjct: 499 ALNGEVITGEVTRGYLYLTRRWQEGDAITLTLPMPVRRLYGNPQVRQQAGKVALQRGPLV 558
>gi|373958292|ref|ZP_09618252.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373894892|gb|EHQ30789.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 679
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 129/293 (44%), Gaps = 31/293 (10%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + + + T + YAD E +L NG+L GI T P + +P
Sbjct: 361 ETCASVGNVLWNWRMLQLTGKAQYADVMELTLYNGMLSGISLNGKKFLYTNPLSVSDDMP 420
Query: 459 LAPGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISS 516
SK+R Y + SD CC I + +++G+ Y ++G + +Y +S+
Sbjct: 421 FQQRWSKDRVDYIGY---SD---CCPPNVIRTIAEIGNYAYSISDKGVWVNLYGGNNLST 474
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+L +I ++Q+ D WD + + L ++ SL LRIP W S GA T+N
Sbjct: 475 QLLKDGSKIKLSQQTD--YPWDGKISIAL---NEVPAKAFSLFLRIPGWCGS-GASVTVN 528
Query: 577 GQDL-PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G+ + + +PG + + W + DK+ + LP+ ++ E + A+ GP V
Sbjct: 529 GKAVNTILTPGQYAEINGKWHAGDKIELLLPMPVKMIEANPLVEEVRNQIAVKRGPVVYC 588
Query: 636 GHSIG-DWDITESATSLSDWITPIPASY---NSQLITFTQEYGNTKFVLTNSN 684
S G D + SLS I +P NS ++ N L N+N
Sbjct: 589 VESAGMPKDKKVFSLSLSSKINLVPQKIVIDNSDIVAL-----NGNATLENAN 636
>gi|293413020|ref|ZP_06655688.1| conserved hypothetical protein [Escherichia coli B354]
gi|291468667|gb|EFF11160.1| conserved hypothetical protein [Escherichia coli B354]
Length = 656
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + + T
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCAQ--PQVT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVGQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|417116562|ref|ZP_11967423.1| putative glycosyhydrolase [Escherichia coli 1.2741]
gi|422801520|ref|ZP_16850016.1| hypothetical protein ERJG_02686 [Escherichia coli M863]
gi|323965978|gb|EGB61421.1| hypothetical protein ERJG_02686 [Escherichia coli M863]
gi|386139106|gb|EIG80261.1| putative glycosyhydrolase [Escherichia coli 1.2741]
Length = 656
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + + T
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQIT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|389805630|ref|ZP_10202778.1| hypothetical protein UUA_00270 [Rhodanobacter thiooxydans LCS2]
gi|388447325|gb|EIM03335.1| hypothetical protein UUA_00270 [Rhodanobacter thiooxydans LCS2]
Length = 607
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 130/323 (40%), Gaps = 50/323 (15%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
E+C++ ++++R L T E YA+ ER+ N +LG Q Y+ P
Sbjct: 303 ETCSSLAWIQLNRELLAITGEARYAEEIERTGYNDLLGAQAPNGEDWCYYVFP------N 356
Query: 466 ERSYHHWGTPSDSFW-CCYGTGIESFSKLGDSIYFEEEGKYPGV--YIIQYISSRLDWKS 522
R H ++W CC +G + +L Y ++ V Y S LD +
Sbjct: 357 GRRVH------TTYWRCCKSSGAMALEELPALAYARDDDGAIAVNLYGAGSASFALD-GA 409
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL 582
G++ + Q D LR+ + G + +L LRIP+W A +NG+D +
Sbjct: 410 GELRIEQHTAYPYPDDVRLRIAV-----GRPMRFTLKLRIPSWAKD--ATLVINGEDAGV 462
Query: 583 P-SPGNFLSVTKTWSSDDKLTIQLPLTLR-----TEAIQDDR-PEYASI---------QA 626
SPG++ + + W D+L + P+ R +Q+ R P+ + + A
Sbjct: 463 ALSPGHYAVLEREWHDGDELVARFPMQPRLHRAVNRNVQESRAPDGSEVCQEVLHFEYAA 522
Query: 627 ILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITF--TQEYGNTKFVLTNSN 684
+ GP V A I + + E+ +P + Q +T Q G + L +
Sbjct: 523 VTCGPLVYATGLIDGFKVEETLR--------LPDAPPQQWLTLQGAQADGVPRITL-DPG 573
Query: 685 QSITMEKFPKSGTDAALHATFRL 707
+E P GT + ++RL
Sbjct: 574 YRAPLEFTPYFGTGGRVDGSWRL 596
>gi|170681898|ref|YP_001745874.1| hypothetical protein EcSMS35_3909 [Escherichia coli SMS-3-5]
gi|170519616|gb|ACB17794.1| conserved hypothetical protein [Escherichia coli SMS-3-5]
Length = 656
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + + T
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQIT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|416899982|ref|ZP_11929388.1| hypothetical protein ECSTEC7V_4230 [Escherichia coli STEC_7v]
gi|327251242|gb|EGE62935.1| hypothetical protein ECSTEC7V_4230 [Escherichia coli STEC_7v]
Length = 656
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + + T
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQIT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|150007964|ref|YP_001302707.1| hypothetical protein BDI_1325 [Parabacteroides distasonis ATCC
8503]
gi|149936388|gb|ABR43085.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 623
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 108/255 (42%), Gaps = 29/255 (11%)
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
+T E+C T+ +++ + T YAD E+++ N +L + + Y
Sbjct: 318 HTMETCVTFTWMQICDRMLGLTGNSLYADQIEKAMYNALLASLKADASQIAKY------- 370
Query: 463 SSKERSYHHWGTPSDSFW--CCYGTGIESFSKLGDSIYFEEEGKYPGV--YIIQYISSRL 518
S + H G CC G +F+ + Y + G+ V Y + L
Sbjct: 371 -SPLEGWRHEGEEQCGMHINCCNANGPRAFAMIPQFAY-QINGRRIDVNLYAASSVEVEL 428
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
D K+ + + P+ D +R+ + K S T +L RIP W S ++NG+
Sbjct: 429 DKKTRVSMTQETNYPI---DGQVRIVVE-PEKTSDFTIAL--RIPAW--SERTVVSVNGE 480
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
L G +L + +TW D++T++L + R + + QAI+ GP VLA S
Sbjct: 481 PLTDLLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARDS 533
Query: 639 -IGDWDITESATSLS 652
D D+ E++ +S
Sbjct: 534 RFKDGDVDEASVIVS 548
>gi|256840863|ref|ZP_05546371.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|256738135|gb|EEU51461.1| conserved hypothetical protein [Parabacteroides sp. D13]
Length = 625
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 108/255 (42%), Gaps = 29/255 (11%)
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
+T E+C T+ +++ + T YAD E+++ N +L + + Y
Sbjct: 320 HTMETCVTFTWMQICDRMLGLTGNSLYADQIEKAMYNALLASLKADASQIAKY------- 372
Query: 463 SSKERSYHHWGTPSDSFW--CCYGTGIESFSKLGDSIYFEEEGKYPGV--YIIQYISSRL 518
S + H G CC G +F+ + Y + G+ V Y + L
Sbjct: 373 -SPLEGWRHEGEEQCGMHINCCNANGPRAFAMIPQFAY-QINGRRIDVNLYAASSVEVEL 430
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
D K+ + + P+ D +R+ + K S T +L RIP W S ++NG+
Sbjct: 431 DKKTRVSMTQETNYPI---DGQVRIVVE-PEKTSDFTIAL--RIPAW--SERTVVSVNGE 482
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
L G +L + +TW D++T++L + R + + QAI+ GP VLA S
Sbjct: 483 PLTDLLAGAYLPIHRTWEKGDEITVELDMRARLVELNE-------AQAIVRGPLVLARDS 535
Query: 639 -IGDWDITESATSLS 652
D D+ E++ +S
Sbjct: 536 RFKDGDVDEASVIVS 550
>gi|190333374|gb|ACE73687.1| hypothetical protein [Geobacillus stearothermophilus]
Length = 642
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 58/243 (23%), Positives = 106/243 (43%), Gaps = 22/243 (9%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ +R + + YAD ER+L NG + G+ + + L +
Sbjct: 322 DTAYAETCASIALVFWTRRMLELEMDGKYADVMERALYNGTISGMDLDGKKFFYVNPLEV 381
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFE-EEGKYPGVYIIQYI 514
P + + H P W CC + +G IY + + + +Y+ I
Sbjct: 382 WPKACERHDKRH-VKPVRQKWFSCACCPPNLARLIASIGHYIYLQTSDALFVHLYVGSDI 440
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ +D +S +I+ WD +R+T++ S G +L LRIP W GA+ T
Sbjct: 441 QTEIDGRSVKIMQETN----YPWDGTVRLTVSPESAGE---FTLGLRIPGW--CRGAEVT 491
Query: 575 LNGQD---LPLPSPGNFLSVTKTWSSDDKLTIQLPLTL-RTEAIQDDRPEYASIQAILYG 630
+NG+ +PL G + + + W D++ + P+ + R +A R + A+ G
Sbjct: 492 INGEKVDIVPLIKKG-YAYIRRVWQQGDEVKLYFPMPVERIKAHPQVRANAGKV-ALQRG 549
Query: 631 PYV 633
P V
Sbjct: 550 PIV 552
>gi|430748744|ref|YP_007211652.1| hypothetical protein Theco_0434 [Thermobacillus composti KWC4]
gi|430732709|gb|AGA56654.1| hypothetical protein Theco_0434 [Thermobacillus composti KWC4]
Length = 806
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 56/227 (24%), Positives = 91/227 (40%), Gaps = 14/227 (6%)
Query: 387 FNFKSD-PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GI 444
F F +D P LA E+C + ++ +R + R YAD ER+L N VL G+
Sbjct: 309 FTFDNDLPNDLA------YAETCASIVLIFWARRMLRLEARSEYADVMERALYNTVLAGM 362
Query: 445 QRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFE 500
R + + L + P +S + P W CC + L D IY
Sbjct: 363 ARDGKHFFYVNPLEVWPEASLKNPDRRHVKPIRQKWFGCSCCPPNVARLLASLDDYIYDI 422
Query: 501 EEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNL 560
+E V++ YI S + + V + WD + L+ S G + +L L
Sbjct: 423 DEAA-GRVHVHLYIGSEARFAAAGREVTLHQRSGLPWDGTVTFGLSVSG-GGAVRLALAL 480
Query: 561 RIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
R+P W + +NG+ P + V + W+ D+ +LP+
Sbjct: 481 RVPDWFQTAEPVLAVNGEACPYRMEKGYAVVEREWADGDRAEWRLPM 527
>gi|422783824|ref|ZP_16836607.1| hypothetical protein ERFG_04064 [Escherichia coli TW10509]
gi|323975001|gb|EGB70110.1| hypothetical protein ERFG_04064 [Escherichia coli TW10509]
Length = 656
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSHYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + + T
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQIT 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|423115429|ref|ZP_17103120.1| hypothetical protein HMPREF9689_03177 [Klebsiella oxytoca 10-5245]
gi|376381515|gb|EHS94252.1| hypothetical protein HMPREF9689_03177 [Klebsiella oxytoca 10-5245]
Length = 655
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 93/239 (38%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+ N VLG + Y+ PL
Sbjct: 334 DTAYGESCASIGLMMFARRMLEMEGDAHYADVMERAFYNTVLG-GMALDGKHFFYVNPLE 392
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S + P W CC + +G ++ + ++I Y
Sbjct: 393 TYPKSIPHNHIYDHIKPVRQRWFGCACCPPNIARTLVAIGHYLFTP---RRDALFINFYA 449
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
S + + K+ WD V +TFS + +L LR+P W + +
Sbjct: 450 GSEAQFTINDQPLALKISGNYPWDE--EVNITFSHP-QAIQHTLALRLPEWCEA--PQVL 504
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NG+ +L +T+ W D +T++LP+TLR A AI GP V
Sbjct: 505 INGEAAQGEQLKGYLHITRQWQQGDIITLRLPMTLRRVYANPLVRHNAGKVAIQRGPLV 563
>gi|354725692|ref|ZP_09039907.1| hypothetical protein EmorL2_22781 [Enterobacter mori LMG 25706]
Length = 649
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 101/240 (42%), Gaps = 17/240 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADGHYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 PGSSKERSYHHW---GTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
K +++H P W CC + LG IY + ++I Y
Sbjct: 388 V-HPKTLAFNHIFDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTVRQD---ALFINLY 443
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ + + G + ++ W +++ +T ++ +T +L LR+P W ++
Sbjct: 444 VGNDVAIPVGDETLALRISGNYPWHEQVKIDITSTAP---VTHTLALRLPDWGAT--PDV 498
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+ + +L +T++W D +T+ LP+ +R + A A+ GP V
Sbjct: 499 LLNGEAVTGEISRGYLYLTRSWQEGDVITLTLPMPVRRVYGNPQVRQQAGKVALQRGPLV 558
>gi|440285639|ref|YP_007338404.1| hypothetical protein D782_0140 [Enterobacteriaceae bacterium strain
FGI 57]
gi|440045161|gb|AGB76219.1| hypothetical protein D782_0140 [Enterobacteriaceae bacterium strain
FGI 57]
Length = 652
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 49/216 (22%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S + P W CC + +G IY + +Y+ Y+
Sbjct: 388 VHPKSLNFNHIYDHVKPVRQRWFGCACCPPNIARVLTSIGHYIYTPRD---EALYVNLYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ G + + W +++T+ S + +L LR+P W + +
Sbjct: 445 GNSVEIPVGNETLRLTISGNYPWQEQIKITI---DSPSPVQHTLALRLPDWCVN--PRVI 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG +L +++ W D LT+ LP+ +R
Sbjct: 500 LNGDAAEGTVEKGYLHLSRRWQEGDTLTLTLPMPIR 535
>gi|423230660|ref|ZP_17217064.1| hypothetical protein HMPREF1063_02884 [Bacteroides dorei
CL02T00C15]
gi|423244371|ref|ZP_17225446.1| hypothetical protein HMPREF1064_01652 [Bacteroides dorei
CL02T12C06]
gi|392630310|gb|EIY24303.1| hypothetical protein HMPREF1063_02884 [Bacteroides dorei
CL02T00C15]
gi|392641945|gb|EIY35717.1| hypothetical protein HMPREF1064_01652 [Bacteroides dorei
CL02T12C06]
Length = 811
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 122/286 (42%), Gaps = 48/286 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + YAD ER+L NGV+ G+ + Y PL
Sbjct: 340 ETCASIANVYWNHRMFLATGDAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 397
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER HW + CC G I F + Y+ + VY+ +I S+ D ++
Sbjct: 398 HER--QHWFGCA----CCLGN-ITRF--MASVPYYMYATQGNDVYVNLFIQSKADIETES 448
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
+N + WD + + +T + +L +RIP W ++ A+A
Sbjct: 449 NKINVEQTTGYPWDGKISIAVTPEKEQE---FALRVRIPGWAQDAPVPTDLYSFTDKAQA 505
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYASIQA 626
++NG + + ++ + W + D + I LP+ +R + ++DDR + A
Sbjct: 506 YSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVKANDQVEDDRGKL----A 561
Query: 627 ILYGP--YVLAGHSIGDWDITESATSLSDWI---TPIPASYNSQLI 667
I GP + L G D +T + +I TP+ ASY++ L+
Sbjct: 562 IERGPIMFCLEGQDQAD------STVFNKFIPDGTPMEASYDAGLL 601
>gi|423109493|ref|ZP_17097188.1| hypothetical protein HMPREF9687_02739 [Klebsiella oxytoca 10-5243]
gi|376382227|gb|EHS94961.1| hypothetical protein HMPREF9687_02739 [Klebsiella oxytoca 10-5243]
Length = 655
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 93/239 (38%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+ N VLG + Y+ PL
Sbjct: 334 DTAYGESCASIGLMMFARRMLEMEGDAHYADVMERAFYNTVLG-GMALDGKHFFYVNPLE 392
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S + P W CC + +G ++ + ++I Y
Sbjct: 393 TYPKSIPHNHIYDHIKPVRQRWFGCACCPPNIARTLVAIGHYLFTP---RRDALFINFYA 449
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
S + + K+ WD V +TFS + +L LR+P W + +
Sbjct: 450 GSEAQFTINDQPLALKISGNYPWDE--EVNITFSHP-QAVQHTLALRLPEWCEA--PQVL 504
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NG+ +L +T+ W D +T++LP+TLR A AI GP V
Sbjct: 505 INGEAAQGEQLKGYLHITRQWQQGDIITLRLPMTLRRVYANPLVRHNAGKVAIQRGPLV 563
>gi|408673627|ref|YP_006873375.1| protein of unknown function DUF1680 [Emticicia oligotrophica DSM
17448]
gi|387855251|gb|AFK03348.1| protein of unknown function DUF1680 [Emticicia oligotrophica DSM
17448]
Length = 652
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 59/259 (22%), Positives = 110/259 (42%), Gaps = 36/259 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAP-GS 463
E+C + M+ ++ + T E Y D ERSL NG L G+ + Y PLA G
Sbjct: 331 ETCASVGMVFWNQRMNALTGESKYIDVLERSLYNGALDGLSLSGDR--FFYGNPLASIGR 388
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
R + +GT CC + LGD IY + E G+++ ++ S + K G
Sbjct: 389 HARREW--FGTA-----CCPSNIARLVASLGDYIYGKSEN---GIWVNLFVGSNTNIKLG 438
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL-------- 575
+ ++ + +++++ S+K +L++RIP+WT++ L
Sbjct: 439 NTEILTSIETNYPLNGKVKISMNPSTK---TKYTLHVRIPSWTTNEPVAGNLYHYLGNYA 495
Query: 576 -------NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAIL 628
NG+ + + + + WS+ D ++ +LP+ +R +++ + A+
Sbjct: 496 ANIAMMVNGRKIDYKIENGYAIIDREWSAGDIVSFELPMDVRKIVARNELKQDNDRMALQ 555
Query: 629 YGPYVLAGHSIGD----WD 643
GP V I + WD
Sbjct: 556 RGPLVYCVEGIDNEGKAWD 574
>gi|296100552|ref|YP_003610698.1| hypothetical protein ECL_00181 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
gi|295055011|gb|ADF59749.1| hypothetical protein ECL_00181 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
Length = 651
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 54/239 (22%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSRYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + + P W CC + LG IY + ++I Y+
Sbjct: 388 VHPRTLAFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIY---TVRPDALFINLYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ + G + ++ W + + + + +T +L LR+P W ++ +
Sbjct: 445 GNEVTIPVGDETLKLRISGNYPWQEEVNIEI---ASPVPVTHTLALRLPDWCAN--PHVS 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+ + +L +T+ W D LT+ LP+ +R + A A+ GP V
Sbjct: 500 LNGEGMTGEVSRGYLHLTRRWQEGDTLTLTLPMPVRRVYGHPQVRQQAGKVALQRGPLV 558
>gi|423299822|ref|ZP_17277847.1| hypothetical protein HMPREF1057_00988 [Bacteroides finegoldii
CL09T03C10]
gi|408473631|gb|EKJ92153.1| hypothetical protein HMPREF1057_00988 [Bacteroides finegoldii
CL09T03C10]
Length = 698
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/216 (28%), Positives = 94/216 (43%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWKS-GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
WK G++ + Q+ D WD +RVTL + + +G T SL LRIP W A T+N
Sbjct: 494 -TWKGKGEVALTQETD--YPWDGNVRVTLDKAPRKAG-TFSLFLRIPEWCEK--ATLTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
GQ L + + N + V + W D +L + +P+ L
Sbjct: 548 GQPLQVNAKANSYAEVNRAWKKGDVVELVMNMPVRL 583
>gi|345514174|ref|ZP_08793688.1| six-hairpin glycosidase, partial [Bacteroides dorei 5_1_36/D4]
gi|345456089|gb|EEO48255.2| six-hairpin glycosidase [Bacteroides dorei 5_1_36/D4]
Length = 810
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 122/286 (42%), Gaps = 48/286 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + YAD ER+L NGV+ G+ + Y PL
Sbjct: 340 ETCASIANVYWNHRMFLATGDAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 397
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER HW + CC G I F + Y+ + VY+ +I S+ D ++
Sbjct: 398 HER--QHWFGCA----CCPGN-ITRF--MASVPYYMYATQGNDVYVNLFIQSKADIETES 448
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
+N + WD + + +T + +L +RIP W ++ A+A
Sbjct: 449 NKINVEQTTGYPWDGKISIAVTPEKEQE---FALRVRIPGWAQDAPVPTDLYSFTDKAQA 505
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYASIQA 626
++NG + + ++ + W + D + I LP+ +R + ++DDR + A
Sbjct: 506 YSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVKANDQVEDDRGKL----A 561
Query: 627 ILYGP--YVLAGHSIGDWDITESATSLSDWI---TPIPASYNSQLI 667
I GP + L G D +T + +I TP+ ASY++ L+
Sbjct: 562 IERGPIMFCLEGQDQAD------STVFNKFIPDGTPMEASYDADLL 601
>gi|159041539|ref|YP_001540791.1| hypothetical protein Cmaq_0969 [Caldivirga maquilingensis IC-167]
gi|157920374|gb|ABW01801.1| protein of unknown function DUF1680 [Caldivirga maquilingensis
IC-167]
Length = 634
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/257 (26%), Positives = 104/257 (40%), Gaps = 25/257 (9%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D E+C + + + T + YAD E +L N L GI + Y+ PL
Sbjct: 320 DRAYSETCAAVANVMWNYRMLLATGDAKYADIMELALYNAALAGIS--LDGKSYFYVNPL 377
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
A R +H P CC + L IY GV+I YI+S
Sbjct: 378 A-----NRGWHR-RQPWFDVACCPPNIARLIASLPGYIYSTSSD---GVWIHLYIASEAK 428
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG-- 577
+V KV+ WD ++VT+ S + ++ LRIP W S G K +NG
Sbjct: 429 VNLNGGIVELKVNTDYPWDGEVKVTVNPSKEDE---FTIYLRIPGW--SRGGKLLINGVE 483
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
Q + L P +L V +TW S D++ +++P+++ A + AI GP V
Sbjct: 484 QGVEL-KPSTYLGVKRTWRSGDEVILRIPMSIELIASHPHVLANTARVAIKRGPLVYCLE 542
Query: 638 SIGD-----WDITESAT 649
+ + WDI T
Sbjct: 543 QVDNPGVDVWDIVLKRT 559
>gi|152968091|ref|YP_001363875.1| hypothetical protein Krad_4148 [Kineococcus radiotolerans SRS30216]
gi|151362608|gb|ABS05611.1| protein of unknown function DUF1680 [Kineococcus radiotolerans
SRS30216]
Length = 652
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 102/237 (43%), Gaps = 21/237 (8%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGS- 463
E+C ++ + + T E YAD ER+L N L G+ + L L G+
Sbjct: 333 ETCAAIGSVQWTWRMLLATGEARYADLVERTLYNAFLPGVSLAGTEYFYVNALQLRHGAF 392
Query: 464 -SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEE-GKYPGVYIIQYISSRLDWK 521
+ERS H P CC + + S L + GV + Q+ + ++
Sbjct: 393 AEEERSVAHGRRPWFDCACCPPNIMRTLSSLDAYVATSSATDGVAGVQVHQFTTGTIEAA 452
Query: 522 SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP 581
+ V WD +RV +T + L LR+P W + GA AT++G+ +
Sbjct: 453 GAALSVTTDY----PWDGTVRVEVTATPG----EFELALRVPAW--AQGATATVDGEAVA 502
Query: 582 LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY---GPYVLA 635
+ +PG +L V + ++ D + + LP+T+R + + P +++ + GP V A
Sbjct: 503 V-TPGEYLRVRRDFAVGDVVELVLPMTVR---VVEADPRVDAVRGCVVVERGPLVYA 555
>gi|255012840|ref|ZP_05284966.1| hypothetical protein B2_02969 [Bacteroides sp. 2_1_7]
gi|410102232|ref|ZP_11297159.1| hypothetical protein HMPREF0999_00931 [Parabacteroides sp. D25]
gi|409238954|gb|EKN31742.1| hypothetical protein HMPREF0999_00931 [Parabacteroides sp. D25]
Length = 618
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 105/244 (43%), Gaps = 23/244 (9%)
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLL 457
NLD+ E +C + M+ ++ + + T + Y D ERSL NG L GI G + Y+
Sbjct: 330 NLDAYCE-TCASVGMVLWNQRMNQLTGDSKYIDVLERSLYNGALAGISLGGDR--FFYVN 386
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL R W + CC +G+ IY + +++ YI +
Sbjct: 387 PLESKGDHHR--QEWYGCA----CCPSQLSRFLPSIGNYIYASSDD---ALWVNLYIGNT 437
Query: 518 LDWKSGQ--IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
+ G+ I++ Q+ D WD +++T++ S L + LRIP W + ++
Sbjct: 438 GQIRIGETDILLTQETD--YPWDGSVKLTISTSQP---LEKEIRLRIPDWCKT--YDLSI 490
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
NG+ + +P + +V K W S D + + + + + A E +AI GP V
Sbjct: 491 NGKRINVPKEKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFDKRAIQRGPLVYC 549
Query: 636 GHSI 639
I
Sbjct: 550 MEEI 553
>gi|222530205|ref|YP_002574087.1| hypothetical protein Athe_2242 [Caldicellulosiruptor bescii DSM
6725]
gi|222457052|gb|ACM61314.1| protein of unknown function DUF1680 [Caldicellulosiruptor bescii
DSM 6725]
Length = 652
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/245 (22%), Positives = 101/245 (41%), Gaps = 20/245 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG--IQRGTEPGVMIYLLP 458
D+ E+C + ++ + L + Y D ER+L N V+G Q G + Y+ P
Sbjct: 332 DTAYAETCASVGLIFFAHRLNKIEPHAKYYDVVERALYNTVIGSMSQDGKK---YFYVNP 388
Query: 459 LA--PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
L P ++R H P W CC + LG +Y + G+Y+
Sbjct: 389 LEVYPKEVEKRFDRHHVKPERQPWFGCACCPPNVARLLASLGRYVY---SYNHDGIYVNL 445
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
YI S + + G I V + ++ +++ L S + L LRIP W S +
Sbjct: 446 YIGSSVQVEVGGIKVLLQQVSSYPFEDMVKIDLKPSKEAR---FKLYLRIPGWCES--YE 500
Query: 573 ATLNG-QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+NG ++ P P ++ + + W +D++ +++P ++ + A++ GP
Sbjct: 501 VYVNGKKEEPEEPPSGYVCIERLWKENDQVVLKIPTEVKMVSSHPQVRSNVGKVAVVKGP 560
Query: 632 YVLAG 636
V
Sbjct: 561 VVFCA 565
>gi|423313151|ref|ZP_17291087.1| hypothetical protein HMPREF1058_01699 [Bacteroides vulgatus
CL09T03C04]
gi|392686365|gb|EIY79671.1| hypothetical protein HMPREF1058_01699 [Bacteroides vulgatus
CL09T03C04]
Length = 811
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 122/286 (42%), Gaps = 48/286 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + YAD ER+L NGV+ G+ + Y PL
Sbjct: 340 ETCASIANVYWNHRMFLATGDAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 397
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER HW + CC G I F + Y+ + VY+ +I S+ D ++
Sbjct: 398 HER--QHWFGCA----CCPGN-ITRF--MASVPYYMYATQGNDVYVNLFIQSKADIETES 448
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
+N + WD + + +T + +L +RIP W ++ A+A
Sbjct: 449 NKINVEQTTGYPWDGKISIAVTPEKEQE---FALRVRIPGWAQDAPVPTDLYSFTDKAQA 505
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYASIQA 626
++NG + + ++ + W + D + I LP+ +R + ++DDR + A
Sbjct: 506 YSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVKANDQVEDDRGKL----A 561
Query: 627 ILYGP--YVLAGHSIGDWDITESATSLSDWI---TPIPASYNSQLI 667
I GP + L G D +T + +I TP+ ASY++ L+
Sbjct: 562 IERGPIMFCLEGQDQAD------STVFNKFIPDGTPMEASYDAGLL 601
>gi|237711356|ref|ZP_04541837.1| six-hairpin glycosidase [Bacteroides sp. 9_1_42FAA]
gi|229454051|gb|EEO59772.1| six-hairpin glycosidase [Bacteroides sp. 9_1_42FAA]
Length = 806
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 122/286 (42%), Gaps = 48/286 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + YAD ER+L NGV+ G+ + Y PL
Sbjct: 335 ETCASIANVYWNHRMFLATGDAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 392
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER HW + CC G I F + Y+ + VY+ +I S+ D ++
Sbjct: 393 HER--QHWFGCA----CCPGN-ITRF--MASVPYYMYATQGNDVYVNLFIQSKADIETES 443
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
+N + WD + + +T + +L +RIP W ++ A+A
Sbjct: 444 NKINVEQTTGYPWDGKISIAVTPEKEQE---FALRVRIPGWAQDAPVPTDLYSFTDKAQA 500
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYASIQA 626
++NG + + ++ + W + D + I LP+ +R + ++DDR + A
Sbjct: 501 YSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVKANDQVEDDRGKL----A 556
Query: 627 ILYGP--YVLAGHSIGDWDITESATSLSDWI---TPIPASYNSQLI 667
I GP + L G D +T + +I TP+ ASY++ L+
Sbjct: 557 IERGPIMFCLEGQDQAD------STVFNKFIPDGTPMEASYDAGLL 596
>gi|432817355|ref|ZP_20051112.1| hypothetical protein A1Y1_03761 [Escherichia coli KTE115]
gi|431361237|gb|ELG47834.1| hypothetical protein A1Y1_03761 [Escherichia coli KTE115]
Length = 656
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|432682342|ref|ZP_19917698.1| hypothetical protein A1YW_04093 [Escherichia coli KTE143]
gi|431217316|gb|ELF14895.1| hypothetical protein A1YW_04093 [Escherichia coli KTE143]
Length = 659
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|301020201|ref|ZP_07184325.1| conserved hypothetical protein [Escherichia coli MS 69-1]
gi|300398864|gb|EFJ82402.1| conserved hypothetical protein [Escherichia coli MS 69-1]
Length = 664
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 94/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + K + P W CC + +G +Y E +YI Y
Sbjct: 396 VHPKTLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 452
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 453 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 507
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+++ +L +T+ W D L + LP+ +R A AI GP V
Sbjct: 508 LNGEEVGQDIRKGYLHITREWQEGDTLNLTLPMPVRRVYGNPQVRHVAGKVAIQRGPLV 566
>gi|294777480|ref|ZP_06742931.1| putative lipoprotein [Bacteroides vulgatus PC510]
gi|294448548|gb|EFG17097.1| putative lipoprotein [Bacteroides vulgatus PC510]
Length = 811
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 122/286 (42%), Gaps = 48/286 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + YAD ER+L NGV+ G+ + Y PL
Sbjct: 340 ETCASIANVYWNHRMFLATGDAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 397
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER HW + CC G I F + Y+ + VY+ +I S+ D ++
Sbjct: 398 HER--QHWFGCA----CCPGN-ITRF--MASVPYYMYATQGNDVYVNLFIQSKADIETES 448
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
+N + WD + + +T + +L +RIP W ++ A+A
Sbjct: 449 NKINVEQTTGYPWDGKISIAVTPEKEQE---FALRVRIPGWAQDAPVPTDLYSFTDKAQA 505
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYASIQA 626
++NG + + ++ + W + D + I LP+ +R + ++DDR + A
Sbjct: 506 YSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVKANDQVEDDRGKL----A 561
Query: 627 ILYGP--YVLAGHSIGDWDITESATSLSDWI---TPIPASYNSQLI 667
I GP + L G D +T + +I TP+ ASY++ L+
Sbjct: 562 IERGPIIFCLEGQDQAD------STVFNKFIPDGTPMEASYDAGLL 601
>gi|300937197|ref|ZP_07152048.1| conserved hypothetical protein [Escherichia coli MS 21-1]
gi|300457729|gb|EFK21222.1| conserved hypothetical protein [Escherichia coli MS 21-1]
Length = 667
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 396 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 452
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 453 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 507
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 508 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 543
>gi|319640078|ref|ZP_07994805.1| hypothetical protein HMPREF9011_00402 [Bacteroides sp. 3_1_40A]
gi|345517097|ref|ZP_08796575.1| six-hairpin glycosidase [Bacteroides sp. 4_3_47FAA]
gi|254833866|gb|EET14175.1| six-hairpin glycosidase [Bacteroides sp. 4_3_47FAA]
gi|317388356|gb|EFV69208.1| hypothetical protein HMPREF9011_00402 [Bacteroides sp. 3_1_40A]
Length = 811
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 122/286 (42%), Gaps = 48/286 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + YAD ER+L NGV+ G+ + Y PL
Sbjct: 340 ETCASIANVYWNHRMFLATGDAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 397
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER HW + CC G I F + Y+ + VY+ +I S+ D ++
Sbjct: 398 HER--QHWFGCA----CCPGN-ITRF--MASVPYYMYATQGNDVYVNLFIQSKADIETES 448
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
+N + WD + + +T + +L +RIP W ++ A+A
Sbjct: 449 NKINVEQTTGYPWDGKISIAVTPEKEQE---FALRVRIPGWAQDAPVPTDLYSFTDKAQA 505
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYASIQA 626
++NG + + ++ + W + D + I LP+ +R + ++DDR + A
Sbjct: 506 YSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVKANDQVEDDRGKL----A 561
Query: 627 ILYGP--YVLAGHSIGDWDITESATSLSDWI---TPIPASYNSQLI 667
I GP + L G D +T + +I TP+ ASY++ L+
Sbjct: 562 IERGPIMFCLEGQDQAD------STVFNKFIPDGTPMEASYDAGLL 601
>gi|150003698|ref|YP_001298442.1| hypothetical protein BVU_1129 [Bacteroides vulgatus ATCC 8482]
gi|149932122|gb|ABR38820.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 811
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 122/286 (42%), Gaps = 48/286 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + YAD ER+L NGV+ G+ + Y PL
Sbjct: 340 ETCASIANVYWNHRMFLATGDAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 397
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER HW + CC G I F + Y+ + VY+ +I S+ D ++
Sbjct: 398 HER--QHWFGCA----CCPGN-ITRF--MASVPYYMYATQGNDVYVNLFIQSKADIETES 448
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
+N + WD + + +T + +L +RIP W ++ A+A
Sbjct: 449 NKINVEQTTGYPWDGKISIAVTPEKEQE---FALRVRIPGWAQDAPVPTDLYSFTDKAQA 505
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYASIQA 626
++NG + + ++ + W + D + I LP+ +R + ++DDR + A
Sbjct: 506 YSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVKANDQVEDDRGKL----A 561
Query: 627 ILYGP--YVLAGHSIGDWDITESATSLSDWI---TPIPASYNSQLI 667
I GP + L G D +T + +I TP+ ASY++ L+
Sbjct: 562 IERGPIMFCLEGQDQAD------STVFNKFIPDGTPMEASYDAGLL 601
>gi|300822009|ref|ZP_07102152.1| conserved hypothetical protein [Escherichia coli MS 119-7]
gi|331679667|ref|ZP_08380337.1| putative cytoplasmic protein [Escherichia coli H591]
gi|300525372|gb|EFK46441.1| conserved hypothetical protein [Escherichia coli MS 119-7]
gi|331072839|gb|EGI44164.1| putative cytoplasmic protein [Escherichia coli H591]
Length = 667
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 396 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 452
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 453 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 507
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 508 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 543
>gi|432672680|ref|ZP_19908201.1| hypothetical protein A1Y7_04240 [Escherichia coli KTE119]
gi|431207880|gb|ELF06125.1| hypothetical protein A1Y7_04240 [Escherichia coli KTE119]
Length = 656
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|416822592|ref|ZP_11895028.1| hypothetical protein ECO5905_02787 [Escherichia coli O55:H7 str.
USDA 5905]
gi|425251470|ref|ZP_18644405.1| hypothetical protein EC5905_5091 [Escherichia coli 5905]
gi|320661682|gb|EFX29097.1| hypothetical protein ECO5905_02787 [Escherichia coli O55:H7 str.
USDA 5905]
gi|408161718|gb|EKH89653.1| hypothetical protein EC5905_5091 [Escherichia coli 5905]
Length = 656
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|297520697|ref|ZP_06939083.1| hypothetical protein EcolOP_23892 [Escherichia coli OP50]
Length = 563
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 94/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 233 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 291
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 292 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 348
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 349 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 403
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+++ +L +T+ W D L + LP+ +R A AI GP V
Sbjct: 404 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVRRVYGNPLVRHVAGKVAIQRGPLV 462
>gi|432836527|ref|ZP_20070058.1| hypothetical protein A1YO_03904 [Escherichia coli KTE136]
gi|431382143|gb|ELG66487.1| hypothetical protein A1YO_03904 [Escherichia coli KTE136]
Length = 659
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|423240714|ref|ZP_17221828.1| hypothetical protein HMPREF1065_02451 [Bacteroides dorei
CL03T12C01]
gi|392643676|gb|EIY37425.1| hypothetical protein HMPREF1065_02451 [Bacteroides dorei
CL03T12C01]
Length = 811
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 118/282 (41%), Gaps = 40/282 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + YAD ER+L NGV+ G+ + Y PL
Sbjct: 340 ETCASIANVYWNHRMFLATGDAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 397
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER HW + CC G I F + Y+ + VY+ +I S+ D ++
Sbjct: 398 HER--QHWFGCA----CCPGN-ITRF--MASVPYYMYATQGNDVYVNLFIQSKADIETES 448
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
+N + WD + + +T + +L +RIP WT ++ A+A
Sbjct: 449 NKINVEQTTGYPWDGKISIAVTPEKEQE---FALRVRIPGWTQDAPVPTDLYSFTDKAQA 505
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
++NG + + ++ + W + D + I LP+ +R D + AI G
Sbjct: 506 YSISVNGFKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVKANDQVEDDHGKLAIERG 565
Query: 631 P--YVLAGHSIGDWDITESATSLSDWI---TPIPASYNSQLI 667
P + L G D +T + +I TP+ ASY++ L+
Sbjct: 566 PIMFCLEGQDQAD------STVFNKFIPDGTPMEASYDADLL 601
>gi|254163510|ref|YP_003046618.1| hypothetical protein ECB_03438 [Escherichia coli B str. REL606]
gi|253975411|gb|ACT41082.1| conserved hypothetical protein [Escherichia coli B str. REL606]
Length = 659
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|432949979|ref|ZP_20144543.1| hypothetical protein A153_04333 [Escherichia coli KTE196]
gi|433045129|ref|ZP_20232605.1| hypothetical protein WIG_03662 [Escherichia coli KTE117]
gi|431453768|gb|ELH34151.1| hypothetical protein A153_04333 [Escherichia coli KTE196]
gi|431552786|gb|ELI26734.1| hypothetical protein WIG_03662 [Escherichia coli KTE117]
Length = 659
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|432451832|ref|ZP_19694088.1| hypothetical protein A13W_02801 [Escherichia coli KTE193]
gi|433035497|ref|ZP_20223187.1| hypothetical protein WIC_04061 [Escherichia coli KTE112]
gi|430977578|gb|ELC94414.1| hypothetical protein A13W_02801 [Escherichia coli KTE193]
gi|431546634|gb|ELI21027.1| hypothetical protein WIC_04061 [Escherichia coli KTE112]
Length = 656
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|419864579|ref|ZP_14387018.1| hypothetical protein ECO9340_21841 [Escherichia coli O103:H25 str.
CVM9340]
gi|388339862|gb|EIL06180.1| hypothetical protein ECO9340_21841 [Escherichia coli O103:H25 str.
CVM9340]
Length = 659
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|416342142|ref|ZP_11676508.1| hypothetical protein ECoL_01429 [Escherichia coli EC4100B]
gi|419280237|ref|ZP_13822479.1| hypothetical protein ECDEC10E_4232 [Escherichia coli DEC10E]
gi|419347353|ref|ZP_13888721.1| hypothetical protein ECDEC13A_3946 [Escherichia coli DEC13A]
gi|419351812|ref|ZP_13893141.1| hypothetical protein ECDEC13B_3790 [Escherichia coli DEC13B]
gi|419357284|ref|ZP_13898530.1| hypothetical protein ECDEC13C_4352 [Escherichia coli DEC13C]
gi|419362259|ref|ZP_13903466.1| hypothetical protein ECDEC13D_4067 [Escherichia coli DEC13D]
gi|419367374|ref|ZP_13908523.1| hypothetical protein ECDEC13E_4123 [Escherichia coli DEC13E]
gi|419377671|ref|ZP_13918688.1| hypothetical protein ECDEC14B_4282 [Escherichia coli DEC14B]
gi|419383008|ref|ZP_13923950.1| hypothetical protein ECDEC14C_4191 [Escherichia coli DEC14C]
gi|419388306|ref|ZP_13929174.1| hypothetical protein ECDEC14D_4146 [Escherichia coli DEC14D]
gi|425424537|ref|ZP_18805687.1| hypothetical protein EC01288_3892 [Escherichia coli 0.1288]
gi|432535989|ref|ZP_19772946.1| hypothetical protein A193_04433 [Escherichia coli KTE234]
gi|432811308|ref|ZP_20045165.1| hypothetical protein A1WM_02459 [Escherichia coli KTE101]
gi|320201393|gb|EFW75974.1| hypothetical protein ECoL_01429 [Escherichia coli EC4100B]
gi|378125150|gb|EHW86553.1| hypothetical protein ECDEC10E_4232 [Escherichia coli DEC10E]
gi|378182886|gb|EHX43534.1| hypothetical protein ECDEC13A_3946 [Escherichia coli DEC13A]
gi|378195992|gb|EHX56482.1| hypothetical protein ECDEC13C_4352 [Escherichia coli DEC13C]
gi|378196853|gb|EHX57338.1| hypothetical protein ECDEC13B_3790 [Escherichia coli DEC13B]
gi|378199461|gb|EHX59926.1| hypothetical protein ECDEC13D_4067 [Escherichia coli DEC13D]
gi|378210031|gb|EHX70398.1| hypothetical protein ECDEC13E_4123 [Escherichia coli DEC13E]
gi|378215636|gb|EHX75932.1| hypothetical protein ECDEC14B_4282 [Escherichia coli DEC14B]
gi|378224949|gb|EHX85150.1| hypothetical protein ECDEC14C_4191 [Escherichia coli DEC14C]
gi|378228861|gb|EHX89012.1| hypothetical protein ECDEC14D_4146 [Escherichia coli DEC14D]
gi|408341050|gb|EKJ55523.1| hypothetical protein EC01288_3892 [Escherichia coli 0.1288]
gi|431057624|gb|ELD67052.1| hypothetical protein A193_04433 [Escherichia coli KTE234]
gi|431360470|gb|ELG47081.1| hypothetical protein A1WM_02459 [Escherichia coli KTE101]
Length = 656
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|300920475|ref|ZP_07136906.1| conserved hypothetical protein [Escherichia coli MS 115-1]
gi|300412519|gb|EFJ95829.1| conserved hypothetical protein [Escherichia coli MS 115-1]
Length = 664
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 396 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 452
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 453 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 507
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 508 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 543
>gi|432604420|ref|ZP_19840650.1| hypothetical protein A1U5_04274 [Escherichia coli KTE66]
gi|431137800|gb|ELE39645.1| hypothetical protein A1U5_04274 [Escherichia coli KTE66]
Length = 654
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|422768624|ref|ZP_16822348.1| hypothetical protein ERCG_03884 [Escherichia coli E1520]
gi|323934869|gb|EGB31251.1| hypothetical protein ERCG_03884 [Escherichia coli E1520]
Length = 659
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|417141197|ref|ZP_11984110.1| putative glycosyhydrolase [Escherichia coli 97.0259]
gi|417310126|ref|ZP_12096949.1| hypothetical protein PPECC33_35210 [Escherichia coli PCN033]
gi|338768332|gb|EGP23129.1| hypothetical protein PPECC33_35210 [Escherichia coli PCN033]
gi|386155687|gb|EIH12037.1| putative glycosyhydrolase [Escherichia coli 97.0259]
Length = 654
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|331655213|ref|ZP_08356212.1| putative cytoplasmic protein [Escherichia coli M718]
gi|331047228|gb|EGI19306.1| putative cytoplasmic protein [Escherichia coli M718]
Length = 664
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 396 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 452
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 453 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 507
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 508 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 543
>gi|251786831|ref|YP_003001135.1| ybl149 [Escherichia coli BL21(DE3)]
gi|242379104|emb|CAQ33906.1| ybl149 [Escherichia coli BL21(DE3)]
Length = 667
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 396 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 452
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 453 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 507
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 508 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 543
>gi|422829813|ref|ZP_16877977.1| hypothetical protein ESNG_02482 [Escherichia coli B093]
gi|371607765|gb|EHN96330.1| hypothetical protein ESNG_02482 [Escherichia coli B093]
Length = 659
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|432752040|ref|ZP_19986617.1| hypothetical protein WEQ_03462 [Escherichia coli KTE29]
gi|431293661|gb|ELF83953.1| hypothetical protein WEQ_03462 [Escherichia coli KTE29]
Length = 659
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|422334703|ref|ZP_16415708.1| hypothetical protein HMPREF0986_04202 [Escherichia coli 4_1_47FAA]
gi|432871119|ref|ZP_20091498.1| hypothetical protein A313_02338 [Escherichia coli KTE147]
gi|373244312|gb|EHP63799.1| hypothetical protein HMPREF0986_04202 [Escherichia coli 4_1_47FAA]
gi|431408324|gb|ELG91511.1| hypothetical protein A313_02338 [Escherichia coli KTE147]
Length = 654
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDMLNLTLPMPVR 535
>gi|295098715|emb|CBK87805.1| Uncharacterized protein conserved in bacteria [Enterobacter cloacae
subsp. cloacae NCTC 9394]
Length = 657
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 95/240 (39%), Gaps = 17/240 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DTVYAESCASIGLMMFARRMLEMEADGHYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ-Y 513
P + + P W CC + LG IY P +I Y
Sbjct: 396 VHPKTLAFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTVR----PDALLINLY 451
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ + + G ++ ++ W +++ +T + +L LR+P W +
Sbjct: 452 VGNDVAIPVGDNILQLRISGNYPWHEQVKIEITSPVP---VIHTLALRLPDWCAE--PAV 506
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+LNGQ + +L + ++W D LT+ LP+ +R + A A+ GP V
Sbjct: 507 SLNGQAITGEVSRGYLYLNRSWQEGDTLTLTLPMPVRRVYGNPQVRQQAGKVALQRGPLV 566
>gi|419924680|ref|ZP_14442556.1| hypothetical protein EC54115_16625 [Escherichia coli 541-15]
gi|388389076|gb|EIL50615.1| hypothetical protein EC54115_16625 [Escherichia coli 541-15]
Length = 659
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|425263519|ref|ZP_18655509.1| hypothetical protein ECEC96038_4736 [Escherichia coli EC96038]
gi|408177761|gb|EKI04521.1| hypothetical protein ECEC96038_4736 [Escherichia coli EC96038]
Length = 656
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|417631018|ref|ZP_12281252.1| hypothetical protein ECSTECMHI813_3969 [Escherichia coli
STEC_MHI813]
gi|345370297|gb|EGX02275.1| hypothetical protein ECSTECMHI813_3969 [Escherichia coli
STEC_MHI813]
Length = 656
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|422836105|ref|ZP_16884154.1| hypothetical protein ESOG_03755 [Escherichia coli E101]
gi|371609666|gb|EHN98200.1| hypothetical protein ESOG_03755 [Escherichia coli E101]
Length = 656
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|194435948|ref|ZP_03068051.1| conserved hypothetical protein [Escherichia coli 101-1]
gi|253771579|ref|YP_003034410.1| hypothetical protein ECBD_0148 [Escherichia coli
'BL21-Gold(DE3)pLysS AG']
gi|254290260|ref|YP_003056008.1| hypothetical protein ECD_03438 [Escherichia coli BL21(DE3)]
gi|422788952|ref|ZP_16841686.1| hypothetical protein ERGG_04098 [Escherichia coli H489]
gi|442600526|ref|ZP_21018201.1| Putative glycosyl hydrolase of unknown function (DUF1680)
[Escherichia coli O5:K4(L):H4 str. ATCC 23502]
gi|194425491|gb|EDX41475.1| conserved hypothetical protein [Escherichia coli 101-1]
gi|253322623|gb|ACT27225.1| protein of unknown function DUF1680 [Escherichia coli
'BL21-Gold(DE3)pLysS AG']
gi|253979567|gb|ACT45237.1| conserved hypothetical protein [Escherichia coli BL21(DE3)]
gi|323959403|gb|EGB55063.1| hypothetical protein ERGG_04098 [Escherichia coli H489]
gi|441650536|emb|CCQ03630.1| Putative glycosyl hydrolase of unknown function (DUF1680)
[Escherichia coli O5:K4(L):H4 str. ATCC 23502]
Length = 659
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|168785451|ref|ZP_02810458.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC869]
gi|261224895|ref|ZP_05939176.1| hypothetical protein EscherichiacoliO157_09907 [Escherichia coli
O157:H7 str. FRIK2000]
gi|261254205|ref|ZP_05946738.1| hypothetical protein EscherichiacoliO157EcO_00065 [Escherichia coli
O157:H7 str. FRIK966]
gi|419100283|ref|ZP_13645472.1| hypothetical protein ECDEC4D_4507 [Escherichia coli DEC4D]
gi|420277651|ref|ZP_14779931.1| hypothetical protein ECPA40_4914 [Escherichia coli PA40]
gi|421826457|ref|ZP_16261810.1| hypothetical protein ECFRIK920_4874 [Escherichia coli FRIK920]
gi|424092641|ref|ZP_17828567.1| hypothetical protein ECFRIK1996_4807 [Escherichia coli FRIK1996]
gi|424105524|ref|ZP_17840261.1| hypothetical protein ECFRIK1990_4921 [Escherichia coli FRIK1990]
gi|424470965|ref|ZP_17920770.1| hypothetical protein ECPA41_4862 [Escherichia coli PA41]
gi|424496110|ref|ZP_17943684.1| hypothetical protein ECTW09195_4922 [Escherichia coli TW09195]
gi|425182551|ref|ZP_18580237.1| hypothetical protein ECFRIK1999_4970 [Escherichia coli FRIK1999]
gi|425195581|ref|ZP_18592342.1| hypothetical protein ECNE1487_5176 [Escherichia coli NE1487]
gi|425208438|ref|ZP_18604226.1| hypothetical protein ECFRIK2001_5178 [Escherichia coli FRIK2001]
gi|425245279|ref|ZP_18638577.1| hypothetical protein ECMA6_4977 [Escherichia coli MA6]
gi|428949368|ref|ZP_19021633.1| hypothetical protein EC881467_4850 [Escherichia coli 88.1467]
gi|428973751|ref|ZP_19044065.1| hypothetical protein EC900039_4634 [Escherichia coli 90.0039]
gi|429004396|ref|ZP_19072475.1| hypothetical protein EC950183_4871 [Escherichia coli 95.0183]
gi|429035002|ref|ZP_19100516.1| hypothetical protein EC960939_4827 [Escherichia coli 96.0939]
gi|429069551|ref|ZP_19132995.1| hypothetical protein EC990672_4784 [Escherichia coli 99.0672]
gi|189374407|gb|EDU92823.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC869]
gi|377938510|gb|EHV02277.1| hypothetical protein ECDEC4D_4507 [Escherichia coli DEC4D]
gi|390638393|gb|EIN17905.1| hypothetical protein ECFRIK1996_4807 [Escherichia coli FRIK1996]
gi|390660758|gb|EIN38450.1| hypothetical protein ECFRIK1990_4921 [Escherichia coli FRIK1990]
gi|390756526|gb|EIO26037.1| hypothetical protein ECPA40_4914 [Escherichia coli PA40]
gi|390764034|gb|EIO33252.1| hypothetical protein ECPA41_4862 [Escherichia coli PA41]
gi|390824028|gb|EIO90037.1| hypothetical protein ECTW09195_4922 [Escherichia coli TW09195]
gi|408064841|gb|EKG99322.1| hypothetical protein ECFRIK920_4874 [Escherichia coli FRIK920]
gi|408095070|gb|EKH28064.1| hypothetical protein ECFRIK1999_4970 [Escherichia coli FRIK1999]
gi|408106180|gb|EKH38296.1| hypothetical protein ECNE1487_5176 [Escherichia coli NE1487]
gi|408119214|gb|EKH50301.1| hypothetical protein ECFRIK2001_5178 [Escherichia coli FRIK2001]
gi|408157817|gb|EKH85958.1| hypothetical protein ECMA6_4977 [Escherichia coli MA6]
gi|427205698|gb|EKV75938.1| hypothetical protein EC881467_4850 [Escherichia coli 88.1467]
gi|427225134|gb|EKV93792.1| hypothetical protein EC900039_4634 [Escherichia coli 90.0039]
gi|427256997|gb|EKW23140.1| hypothetical protein EC950183_4871 [Escherichia coli 95.0183]
gi|427281172|gb|EKW45506.1| hypothetical protein EC960939_4827 [Escherichia coli 96.0939]
gi|427316599|gb|EKW78533.1| hypothetical protein EC990672_4784 [Escherichia coli 99.0672]
Length = 656
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|432491369|ref|ZP_19733231.1| hypothetical protein A171_03302 [Escherichia coli KTE213]
gi|432841396|ref|ZP_20074855.1| hypothetical protein A1YQ_04362 [Escherichia coli KTE140]
gi|433205327|ref|ZP_20389073.1| hypothetical protein WGY_03902 [Escherichia coli KTE95]
gi|431018040|gb|ELD31485.1| hypothetical protein A171_03302 [Escherichia coli KTE213]
gi|431386628|gb|ELG70584.1| hypothetical protein A1YQ_04362 [Escherichia coli KTE140]
gi|431716416|gb|ELJ80548.1| hypothetical protein WGY_03902 [Escherichia coli KTE95]
Length = 654
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|415831195|ref|ZP_11516965.1| hypothetical protein ECOK1357_3952 [Escherichia coli OK1357]
gi|323182744|gb|EFZ68146.1| hypothetical protein ECOK1357_3952 [Escherichia coli OK1357]
Length = 659
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|293417024|ref|ZP_06659661.1| hypothetical protein ECDG_04192 [Escherichia coli B185]
gi|291431600|gb|EFF04585.1| hypothetical protein ECDG_04192 [Escherichia coli B185]
Length = 656
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|284034063|ref|YP_003383994.1| hypothetical protein Kfla_6192 [Kribbella flavida DSM 17836]
gi|283813356|gb|ADB35195.1| protein of unknown function DUF1680 [Kribbella flavida DSM 17836]
Length = 637
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 122/322 (37%), Gaps = 37/322 (11%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D E+C ++ + + T YAD ER L NG L G+ G + Y+ PL
Sbjct: 323 DRAYAETCAAIGGVQWAWRMLLATGNAFYADAIERMLYNGFLAGVSLGGDE--YFYVNPL 380
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
+ E + W CC + + S L + +G + + QY
Sbjct: 381 QLRGAAEPDGNRSPAHGRRGWFDCACCPPNIMRTLSSLDGYLASTTDGA---IQLHQYAE 437
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
+ V +VD W+ ++VT+ + +L LRIP W ATL
Sbjct: 438 GAVAADLPAGTVELQVDTEYPWNGSIKVTVQQTPD---TPWALELRIPGWAEG----ATL 490
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
NG+ + G + V +TW++ D + +QLP+ RT A A+ GP V A
Sbjct: 491 NGKPV---DAGRYARVEQTWATGDTVELQLPMATRTVAADPRIDAVRGCVALERGPLVYA 547
Query: 636 GHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKS 695
+ + T + D + A +T T E G L + +T E P +
Sbjct: 548 VEQV------DQQTDVDDLHLLVGAP-----VTATHEPG-----LLDGVTVLTTEGRPGT 591
Query: 696 GTDAALHATFRLILNDSSGSEF 717
H +R L+DS G E
Sbjct: 592 -AHTPDHWPYRPGLDDSVGDEV 612
>gi|392977054|ref|YP_006475642.1| hypothetical protein A3UG_00935 [Enterobacter cloacae subsp.
dissolvens SDM]
gi|392322987|gb|AFM57940.1| hypothetical protein A3UG_00935 [Enterobacter cloacae subsp.
dissolvens SDM]
Length = 651
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 53/239 (22%), Positives = 96/239 (40%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSRYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + + P W CC + LG IY + ++I ++
Sbjct: 388 VHPRTLAFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIY---TVRPDALFINLFV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ + G + ++ W + + + + +T +L LR+P W ++ +
Sbjct: 445 GNEVTIPVGDETLKLRISGNYPWQKEVNIEI---ASPVPVTHTLALRLPDWCAN--PHVS 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+ + +L +T+ W D LT+ LP+ +R + A A+ GP V
Sbjct: 500 LNGEGMTGEVSRGYLHLTRRWQEGDTLTLTLPMPVRRVYGHPQVRQQAGKVALQRGPLV 558
>gi|387609318|ref|YP_006098174.1| hypothetical protein EC042_3892 [Escherichia coli 042]
gi|419917404|ref|ZP_14435664.1| hypothetical protein ECKD2_05705 [Escherichia coli KD2]
gi|284923618|emb|CBG36715.1| conserved hypothetical protein [Escherichia coli 042]
gi|388394341|gb|EIL55642.1| hypothetical protein ECKD2_05705 [Escherichia coli KD2]
Length = 656
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 94/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKTLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+++ +L +T+ W D L + LP+ +R A AI GP V
Sbjct: 500 LNGEEVGQDIRKGYLHITREWQEGDTLNLTLPMPVRRVYGNPQVRHVAGKVAIQRGPLV 558
>gi|193068520|ref|ZP_03049482.1| conserved hypothetical protein [Escherichia coli E110019]
gi|331670421|ref|ZP_08371260.1| putative cytoplasmic protein [Escherichia coli TA271]
gi|332282156|ref|ZP_08394569.1| conserved hypothetical protein [Shigella sp. D9]
gi|417222825|ref|ZP_12026265.1| putative glycosyhydrolase [Escherichia coli 96.154]
gi|417267012|ref|ZP_12054373.1| putative glycosyhydrolase [Escherichia coli 3.3884]
gi|417604475|ref|ZP_12255039.1| hypothetical protein ECSTEC94C_4306 [Escherichia coli STEC_94C]
gi|418040528|ref|ZP_12678768.1| hypothetical protein ECW26_09970 [Escherichia coli W26]
gi|419926997|ref|ZP_14444741.1| hypothetical protein EC5411_02255 [Escherichia coli 541-1]
gi|423707870|ref|ZP_17682250.1| hypothetical protein ESTG_02341 [Escherichia coli B799]
gi|432378754|ref|ZP_19621737.1| hypothetical protein WCQ_03650 [Escherichia coli KTE12]
gi|432482897|ref|ZP_19724846.1| hypothetical protein A15U_04037 [Escherichia coli KTE210]
gi|432676705|ref|ZP_19912149.1| hypothetical protein A1YU_03256 [Escherichia coli KTE142]
gi|433200343|ref|ZP_20384227.1| hypothetical protein WGW_03892 [Escherichia coli KTE94]
gi|192958171|gb|EDV88612.1| conserved hypothetical protein [Escherichia coli E110019]
gi|331062483|gb|EGI34403.1| putative cytoplasmic protein [Escherichia coli TA271]
gi|332104508|gb|EGJ07854.1| conserved hypothetical protein [Shigella sp. D9]
gi|345347843|gb|EGW80147.1| hypothetical protein ECSTEC94C_4306 [Escherichia coli STEC_94C]
gi|383476508|gb|EID68447.1| hypothetical protein ECW26_09970 [Escherichia coli W26]
gi|385709502|gb|EIG46500.1| hypothetical protein ESTG_02341 [Escherichia coli B799]
gi|386202627|gb|EII01618.1| putative glycosyhydrolase [Escherichia coli 96.154]
gi|386229370|gb|EII56725.1| putative glycosyhydrolase [Escherichia coli 3.3884]
gi|388408480|gb|EIL68825.1| hypothetical protein EC5411_02255 [Escherichia coli 541-1]
gi|430896388|gb|ELC18632.1| hypothetical protein WCQ_03650 [Escherichia coli KTE12]
gi|431003915|gb|ELD19148.1| hypothetical protein A15U_04037 [Escherichia coli KTE210]
gi|431210613|gb|ELF08667.1| hypothetical protein A1YU_03256 [Escherichia coli KTE142]
gi|431717675|gb|ELJ81769.1| hypothetical protein WGW_03892 [Escherichia coli KTE94]
Length = 659
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|15804123|ref|NP_290162.1| hypothetical protein Z5002 [Escherichia coli O157:H7 str. EDL933]
gi|15833713|ref|NP_312486.1| hypothetical protein ECs4459 [Escherichia coli O157:H7 str. Sakai]
gi|168746875|ref|ZP_02771897.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4113]
gi|168753398|ref|ZP_02778405.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4401]
gi|168759671|ref|ZP_02784678.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4501]
gi|168765993|ref|ZP_02791000.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4486]
gi|168772459|ref|ZP_02797466.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4196]
gi|168779729|ref|ZP_02804736.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4076]
gi|168797417|ref|ZP_02822424.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC508]
gi|195935108|ref|ZP_03080490.1| hypothetical protein EscherichcoliO157_01410 [Escherichia coli
O157:H7 str. EC4024]
gi|208809591|ref|ZP_03251928.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208813747|ref|ZP_03255076.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208821480|ref|ZP_03261800.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209399472|ref|YP_002273062.1| hypothetical protein ECH74115_4952 [Escherichia coli O157:H7 str.
EC4115]
gi|217324274|ref|ZP_03440358.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254795534|ref|YP_003080371.1| hypothetical protein ECSP_4573 [Escherichia coli O157:H7 str.
TW14359]
gi|291284953|ref|YP_003501771.1| hypothetical protein G2583_4318 [Escherichia coli O55:H7 str.
CB9615]
gi|387508986|ref|YP_006161242.1| hypothetical protein ECO55CA74_20675 [Escherichia coli O55:H7 str.
RM12579]
gi|387884760|ref|YP_006315062.1| hypothetical protein CDCO157_4196 [Escherichia coli Xuzhou21]
gi|416315758|ref|ZP_11659571.1| hypothetical protein ECoA_05479 [Escherichia coli O157:H7 str.
1044]
gi|416320011|ref|ZP_11662563.1| hypothetical protein ECoD_02794 [Escherichia coli O157:H7 str.
EC1212]
gi|416330228|ref|ZP_11669265.1| hypothetical protein ECF_04238 [Escherichia coli O157:H7 str. 1125]
gi|416778240|ref|ZP_11875812.1| hypothetical protein ECO5101_02775 [Escherichia coli O157:H7 str.
G5101]
gi|416789533|ref|ZP_11880657.1| hypothetical protein ECO9389_12801 [Escherichia coli O157:H- str.
493-89]
gi|416801447|ref|ZP_11885596.1| hypothetical protein ECO2687_16571 [Escherichia coli O157:H- str. H
2687]
gi|416812344|ref|ZP_11890513.1| hypothetical protein ECO7815_03795 [Escherichia coli O55:H7 str.
3256-97]
gi|416832964|ref|ZP_11900127.1| hypothetical protein ECOSU61_16870 [Escherichia coli O157:H7 str.
LSU-61]
gi|419047735|ref|ZP_13594666.1| hypothetical protein ECDEC3A_4594 [Escherichia coli DEC3A]
gi|419053393|ref|ZP_13600259.1| hypothetical protein ECDEC3B_4713 [Escherichia coli DEC3B]
gi|419059343|ref|ZP_13606144.1| hypothetical protein ECDEC3C_4952 [Escherichia coli DEC3C]
gi|419064888|ref|ZP_13611608.1| hypothetical protein ECDEC3D_4717 [Escherichia coli DEC3D]
gi|419071821|ref|ZP_13617428.1| hypothetical protein ECDEC3E_4925 [Escherichia coli DEC3E]
gi|419077685|ref|ZP_13623186.1| hypothetical protein ECDEC3F_4810 [Escherichia coli DEC3F]
gi|419082821|ref|ZP_13628266.1| hypothetical protein ECDEC4A_4458 [Escherichia coli DEC4A]
gi|419088700|ref|ZP_13634051.1| hypothetical protein ECDEC4B_4655 [Escherichia coli DEC4B]
gi|419094624|ref|ZP_13639902.1| hypothetical protein ECDEC4C_4545 [Escherichia coli DEC4C]
gi|419106234|ref|ZP_13651356.1| hypothetical protein ECDEC4E_4577 [Escherichia coli DEC4E]
gi|419111620|ref|ZP_13656671.1| hypothetical protein ECDEC4F_4462 [Escherichia coli DEC4F]
gi|419117157|ref|ZP_13662166.1| hypothetical protein ECDEC5A_4358 [Escherichia coli DEC5A]
gi|419122875|ref|ZP_13667817.1| hypothetical protein ECDEC5B_4720 [Escherichia coli DEC5B]
gi|419128272|ref|ZP_13673144.1| hypothetical protein ECDEC5C_4407 [Escherichia coli DEC5C]
gi|419133720|ref|ZP_13678547.1| hypothetical protein ECDEC5D_4501 [Escherichia coli DEC5D]
gi|419138882|ref|ZP_13683672.1| hypothetical protein ECDEC5E_4420 [Escherichia coli DEC5E]
gi|420271748|ref|ZP_14774099.1| hypothetical protein ECPA22_4925 [Escherichia coli PA22]
gi|420283060|ref|ZP_14785292.1| hypothetical protein ECTW06591_4436 [Escherichia coli TW06591]
gi|420288947|ref|ZP_14791129.1| hypothetical protein ECTW10246_5014 [Escherichia coli TW10246]
gi|420294768|ref|ZP_14796878.1| hypothetical protein ECTW11039_4946 [Escherichia coli TW11039]
gi|420300624|ref|ZP_14802667.1| hypothetical protein ECTW09109_5140 [Escherichia coli TW09109]
gi|420306468|ref|ZP_14808456.1| hypothetical protein ECTW10119_5262 [Escherichia coli TW10119]
gi|420311766|ref|ZP_14813694.1| hypothetical protein ECEC1738_4805 [Escherichia coli EC1738]
gi|420317423|ref|ZP_14819294.1| hypothetical protein ECEC1734_4754 [Escherichia coli EC1734]
gi|421814567|ref|ZP_16250269.1| hypothetical protein EC80416_4344 [Escherichia coli 8.0416]
gi|421821215|ref|ZP_16256686.1| hypothetical protein EC100821_5204 [Escherichia coli 10.0821]
gi|421833209|ref|ZP_16268489.1| hypothetical protein ECPA7_5406 [Escherichia coli PA7]
gi|423727615|ref|ZP_17701493.1| hypothetical protein ECPA31_4706 [Escherichia coli PA31]
gi|424079832|ref|ZP_17816792.1| hypothetical protein ECFDA505_4757 [Escherichia coli FDA505]
gi|424086239|ref|ZP_17822721.1| hypothetical protein ECFDA517_5074 [Escherichia coli FDA517]
gi|424099319|ref|ZP_17834587.1| hypothetical protein ECFRIK1985_5028 [Escherichia coli FRIK1985]
gi|424112173|ref|ZP_17846397.1| hypothetical protein EC93001_4874 [Escherichia coli 93-001]
gi|424118115|ref|ZP_17851944.1| hypothetical protein ECPA3_4889 [Escherichia coli PA3]
gi|424124302|ref|ZP_17857602.1| hypothetical protein ECPA5_4745 [Escherichia coli PA5]
gi|424130447|ref|ZP_17863346.1| hypothetical protein ECPA9_4920 [Escherichia coli PA9]
gi|424136776|ref|ZP_17869217.1| hypothetical protein ECPA10_5070 [Escherichia coli PA10]
gi|424143329|ref|ZP_17875187.1| hypothetical protein ECPA14_4913 [Escherichia coli PA14]
gi|424149721|ref|ZP_17881088.1| hypothetical protein ECPA15_5031 [Escherichia coli PA15]
gi|424155573|ref|ZP_17886500.1| hypothetical protein ECPA24_4636 [Escherichia coli PA24]
gi|424255558|ref|ZP_17892047.1| hypothetical protein ECPA25_4614 [Escherichia coli PA25]
gi|424334046|ref|ZP_17897955.1| hypothetical protein ECPA28_4952 [Escherichia coli PA28]
gi|424452012|ref|ZP_17903674.1| hypothetical protein ECPA32_4774 [Escherichia coli PA32]
gi|424458199|ref|ZP_17909303.1| hypothetical protein ECPA33_4772 [Escherichia coli PA33]
gi|424464678|ref|ZP_17915033.1| hypothetical protein ECPA39_4850 [Escherichia coli PA39]
gi|424477467|ref|ZP_17926776.1| hypothetical protein ECPA42_4929 [Escherichia coli PA42]
gi|424483230|ref|ZP_17932202.1| hypothetical protein ECTW07945_4773 [Escherichia coli TW07945]
gi|424489411|ref|ZP_17937952.1| hypothetical protein ECTW09098_4852 [Escherichia coli TW09098]
gi|424502761|ref|ZP_17949642.1| hypothetical protein ECEC4203_4848 [Escherichia coli EC4203]
gi|424509021|ref|ZP_17955394.1| hypothetical protein ECEC4196_4900 [Escherichia coli EC4196]
gi|424516380|ref|ZP_17960994.1| hypothetical protein ECTW14313_4703 [Escherichia coli TW14313]
gi|424522562|ref|ZP_17966668.1| hypothetical protein ECTW14301_4626 [Escherichia coli TW14301]
gi|424528439|ref|ZP_17972147.1| hypothetical protein ECEC4421_4687 [Escherichia coli EC4421]
gi|424534588|ref|ZP_17977927.1| hypothetical protein ECEC4422_4816 [Escherichia coli EC4422]
gi|424540646|ref|ZP_17983581.1| hypothetical protein ECEC4013_4958 [Escherichia coli EC4013]
gi|424546791|ref|ZP_17989143.1| hypothetical protein ECEC4402_4833 [Escherichia coli EC4402]
gi|424552999|ref|ZP_17994833.1| hypothetical protein ECEC4439_4786 [Escherichia coli EC4439]
gi|424559188|ref|ZP_18000588.1| hypothetical protein ECEC4436_4739 [Escherichia coli EC4436]
gi|424565524|ref|ZP_18006519.1| hypothetical protein ECEC4437_4898 [Escherichia coli EC4437]
gi|424571655|ref|ZP_18012193.1| hypothetical protein ECEC4448_4800 [Escherichia coli EC4448]
gi|424577810|ref|ZP_18017853.1| hypothetical protein ECEC1845_4760 [Escherichia coli EC1845]
gi|424583627|ref|ZP_18023264.1| hypothetical protein ECEC1863_4496 [Escherichia coli EC1863]
gi|425100295|ref|ZP_18503019.1| hypothetical protein EC34870_4837 [Escherichia coli 3.4870]
gi|425106397|ref|ZP_18508705.1| hypothetical protein EC52239_4795 [Escherichia coli 5.2239]
gi|425112407|ref|ZP_18514320.1| hypothetical protein EC60172_4950 [Escherichia coli 6.0172]
gi|425128335|ref|ZP_18529494.1| hypothetical protein EC80586_5102 [Escherichia coli 8.0586]
gi|425134077|ref|ZP_18534919.1| hypothetical protein EC82524_4714 [Escherichia coli 8.2524]
gi|425140695|ref|ZP_18541067.1| hypothetical protein EC100833_5124 [Escherichia coli 10.0833]
gi|425146362|ref|ZP_18546346.1| hypothetical protein EC100869_4617 [Escherichia coli 10.0869]
gi|425152482|ref|ZP_18552087.1| hypothetical protein EC880221_4757 [Escherichia coli 88.0221]
gi|425158354|ref|ZP_18557610.1| hypothetical protein ECPA34_4912 [Escherichia coli PA34]
gi|425164699|ref|ZP_18563578.1| hypothetical protein ECFDA506_5110 [Escherichia coli FDA506]
gi|425170445|ref|ZP_18568910.1| hypothetical protein ECFDA507_4850 [Escherichia coli FDA507]
gi|425176495|ref|ZP_18574606.1| hypothetical protein ECFDA504_4775 [Escherichia coli FDA504]
gi|425188821|ref|ZP_18586085.1| hypothetical protein ECFRIK1997_5034 [Escherichia coli FRIK1997]
gi|425202058|ref|ZP_18598257.1| hypothetical protein ECNE037_5169 [Escherichia coli NE037]
gi|425214195|ref|ZP_18609587.1| hypothetical protein ECPA4_4926 [Escherichia coli PA4]
gi|425220319|ref|ZP_18615273.1| hypothetical protein ECPA23_4795 [Escherichia coli PA23]
gi|425226960|ref|ZP_18621418.1| hypothetical protein ECPA49_5019 [Escherichia coli PA49]
gi|425233121|ref|ZP_18627153.1| hypothetical protein ECPA45_4973 [Escherichia coli PA45]
gi|425239047|ref|ZP_18632758.1| hypothetical protein ECTT12B_4670 [Escherichia coli TT12B]
gi|425257257|ref|ZP_18649759.1| hypothetical protein ECCB7326_4836 [Escherichia coli CB7326]
gi|425269512|ref|ZP_18661133.1| hypothetical protein EC5412_4766 [Escherichia coli 5412]
gi|425296972|ref|ZP_18687122.1| hypothetical protein ECPA38_4622 [Escherichia coli PA38]
gi|425313655|ref|ZP_18702824.1| hypothetical protein ECEC1735_4763 [Escherichia coli EC1735]
gi|425319635|ref|ZP_18708414.1| hypothetical protein ECEC1736_4708 [Escherichia coli EC1736]
gi|425325746|ref|ZP_18714090.1| hypothetical protein ECEC1737_4710 [Escherichia coli EC1737]
gi|425332099|ref|ZP_18719925.1| hypothetical protein ECEC1846_4816 [Escherichia coli EC1846]
gi|425338276|ref|ZP_18725622.1| hypothetical protein ECEC1847_4840 [Escherichia coli EC1847]
gi|425344593|ref|ZP_18731474.1| hypothetical protein ECEC1848_4958 [Escherichia coli EC1848]
gi|425350429|ref|ZP_18736886.1| hypothetical protein ECEC1849_4723 [Escherichia coli EC1849]
gi|425356701|ref|ZP_18742759.1| hypothetical protein ECEC1850_4950 [Escherichia coli EC1850]
gi|425362661|ref|ZP_18748298.1| hypothetical protein ECEC1856_4773 [Escherichia coli EC1856]
gi|425368889|ref|ZP_18753993.1| hypothetical protein ECEC1862_4786 [Escherichia coli EC1862]
gi|425375193|ref|ZP_18759826.1| hypothetical protein ECEC1864_4920 [Escherichia coli EC1864]
gi|425388083|ref|ZP_18771633.1| hypothetical protein ECEC1866_4680 [Escherichia coli EC1866]
gi|425394775|ref|ZP_18777875.1| hypothetical protein ECEC1868_4993 [Escherichia coli EC1868]
gi|425400871|ref|ZP_18783568.1| hypothetical protein ECEC1869_4939 [Escherichia coli EC1869]
gi|425406963|ref|ZP_18789176.1| hypothetical protein ECEC1870_4738 [Escherichia coli EC1870]
gi|425413349|ref|ZP_18795102.1| hypothetical protein ECNE098_4927 [Escherichia coli NE098]
gi|425419660|ref|ZP_18800921.1| hypothetical protein ECFRIK523_4776 [Escherichia coli FRIK523]
gi|425430935|ref|ZP_18811535.1| hypothetical protein EC01304_4902 [Escherichia coli 0.1304]
gi|428955440|ref|ZP_19027224.1| hypothetical protein EC881042_4795 [Escherichia coli 88.1042]
gi|428961439|ref|ZP_19032721.1| hypothetical protein EC890511_4747 [Escherichia coli 89.0511]
gi|428968048|ref|ZP_19038750.1| hypothetical protein EC900091_5143 [Escherichia coli 90.0091]
gi|428980186|ref|ZP_19049993.1| hypothetical protein EC902281_4755 [Escherichia coli 90.2281]
gi|428985972|ref|ZP_19055354.1| hypothetical protein EC930055_4673 [Escherichia coli 93.0055]
gi|428992156|ref|ZP_19061135.1| hypothetical protein EC930056_4727 [Escherichia coli 93.0056]
gi|428998047|ref|ZP_19066631.1| hypothetical protein EC940618_4635 [Escherichia coli 94.0618]
gi|429010405|ref|ZP_19077843.1| hypothetical protein EC951288_4497 [Escherichia coli 95.1288]
gi|429016933|ref|ZP_19083806.1| hypothetical protein EC950943_4910 [Escherichia coli 95.0943]
gi|429022675|ref|ZP_19089186.1| hypothetical protein EC960428_4625 [Escherichia coli 96.0428]
gi|429028846|ref|ZP_19094826.1| hypothetical protein EC960427_4802 [Escherichia coli 96.0427]
gi|429041099|ref|ZP_19106187.1| hypothetical protein EC960932_4872 [Escherichia coli 96.0932]
gi|429046954|ref|ZP_19111657.1| hypothetical protein EC960107_4684 [Escherichia coli 96.0107]
gi|429052309|ref|ZP_19116869.1| hypothetical protein EC970003_4427 [Escherichia coli 97.0003]
gi|429057821|ref|ZP_19122084.1| hypothetical protein EC971742_4298 [Escherichia coli 97.1742]
gi|429063366|ref|ZP_19127341.1| hypothetical protein EC970007_4185 [Escherichia coli 97.0007]
gi|429070723|ref|ZP_19134102.1| hypothetical protein EC990678_5252 [Escherichia coli 99.0678]
gi|429081416|ref|ZP_19144532.1| hypothetical protein EC990713_5250 [Escherichia coli 99.0713]
gi|429828751|ref|ZP_19359758.1| hypothetical protein EC960109_4876 [Escherichia coli 96.0109]
gi|429835191|ref|ZP_19365469.1| hypothetical protein EC970010_4835 [Escherichia coli 97.0010]
gi|444927256|ref|ZP_21246521.1| hypothetical protein EC09BKT78844_4882 [Escherichia coli
09BKT078844]
gi|444932846|ref|ZP_21251863.1| hypothetical protein EC990814_4226 [Escherichia coli 99.0814]
gi|444938322|ref|ZP_21257070.1| hypothetical protein EC990815_4259 [Escherichia coli 99.0815]
gi|444943914|ref|ZP_21262410.1| hypothetical protein EC990816_4314 [Escherichia coli 99.0816]
gi|444949405|ref|ZP_21267701.1| hypothetical protein EC990839_4245 [Escherichia coli 99.0839]
gi|444955079|ref|ZP_21273151.1| hypothetical protein EC990848_4357 [Escherichia coli 99.0848]
gi|444960466|ref|ZP_21278295.1| hypothetical protein EC991753_4289 [Escherichia coli 99.1753]
gi|444965679|ref|ZP_21283249.1| hypothetical protein EC991775_4151 [Escherichia coli 99.1775]
gi|444971675|ref|ZP_21289020.1| hypothetical protein EC991793_4596 [Escherichia coli 99.1793]
gi|444976975|ref|ZP_21294065.1| hypothetical protein EC991805_4182 [Escherichia coli 99.1805]
gi|444982346|ref|ZP_21299247.1| hypothetical protein ECATCC700728_4176 [Escherichia coli ATCC
700728]
gi|444988560|ref|ZP_21305317.1| hypothetical protein ECPA11_5184 [Escherichia coli PA11]
gi|444993068|ref|ZP_21309704.1| hypothetical protein ECPA19_4335 [Escherichia coli PA19]
gi|444998301|ref|ZP_21314794.1| hypothetical protein ECPA13_4095 [Escherichia coli PA13]
gi|445004788|ref|ZP_21321157.1| hypothetical protein ECPA2_5358 [Escherichia coli PA2]
gi|445004922|ref|ZP_21321282.1| hypothetical protein ECPA47_5269 [Escherichia coli PA47]
gi|445015398|ref|ZP_21331479.1| hypothetical protein ECPA48_5173 [Escherichia coli PA48]
gi|445015754|ref|ZP_21331819.1| hypothetical protein ECPA8_5306 [Escherichia coli PA8]
gi|445021071|ref|ZP_21337012.1| hypothetical protein EC71982_5249 [Escherichia coli 7.1982]
gi|445028321|ref|ZP_21344063.1| hypothetical protein EC991781_1757 [Escherichia coli 99.1781]
gi|445031935|ref|ZP_21347574.1| hypothetical protein EC991762_5373 [Escherichia coli 99.1762]
gi|445042200|ref|ZP_21357565.1| hypothetical protein ECPA35_4500 [Escherichia coli PA35]
gi|445043905|ref|ZP_21359240.1| hypothetical protein EC34880_0888 [Escherichia coli 3.4880]
gi|445052978|ref|ZP_21367995.1| hypothetical protein EC950083_4258 [Escherichia coli 95.0083]
gi|445061011|ref|ZP_21373522.1| hypothetical protein EC990670_4482 [Escherichia coli 99.0670]
gi|452968310|ref|ZP_21966537.1| hypothetical protein EC4009_RS06445 [Escherichia coli O157:H7 str.
EC4009]
gi|12518318|gb|AAG58726.1|AE005584_8 orf; hypothetical protein [Escherichia coli O157:H7 str. EDL933]
gi|13363934|dbj|BAB37882.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|187771563|gb|EDU35407.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4196]
gi|188018366|gb|EDU56488.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4113]
gi|189002301|gb|EDU71287.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4076]
gi|189358833|gb|EDU77252.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4401]
gi|189364486|gb|EDU82905.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4486]
gi|189369459|gb|EDU87875.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4501]
gi|189380134|gb|EDU98550.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC508]
gi|208729392|gb|EDZ78993.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208735024|gb|EDZ83711.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208741603|gb|EDZ89285.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209160872|gb|ACI38305.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4115]
gi|217320495|gb|EEC28919.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254594934|gb|ACT74295.1| hypothetical protein ECSP_4573 [Escherichia coli O157:H7 str.
TW14359]
gi|290764826|gb|ADD58787.1| hypothetical protein G2583_4318 [Escherichia coli O55:H7 str.
CB9615]
gi|320191367|gb|EFW66017.1| hypothetical protein ECoD_02794 [Escherichia coli O157:H7 str.
EC1212]
gi|320639897|gb|EFX09491.1| hypothetical protein ECO5101_02775 [Escherichia coli O157:H7 str.
G5101]
gi|320645061|gb|EFX14085.1| hypothetical protein ECO9389_12801 [Escherichia coli O157:H- str.
493-89]
gi|320650327|gb|EFX18810.1| hypothetical protein ECO2687_16571 [Escherichia coli O157:H- str. H
2687]
gi|320655901|gb|EFX23824.1| hypothetical protein ECO7815_03795 [Escherichia coli O55:H7 str.
3256-97 TW 07815]
gi|320666706|gb|EFX33689.1| hypothetical protein ECOSU61_16870 [Escherichia coli O157:H7 str.
LSU-61]
gi|326337419|gb|EGD61254.1| hypothetical protein ECoA_05479 [Escherichia coli O157:H7 str.
1044]
gi|326339944|gb|EGD63751.1| hypothetical protein ECF_04238 [Escherichia coli O157:H7 str. 1125]
gi|374360980|gb|AEZ42687.1| hypothetical protein ECO55CA74_20675 [Escherichia coli O55:H7 str.
RM12579]
gi|377889685|gb|EHU54145.1| hypothetical protein ECDEC3A_4594 [Escherichia coli DEC3A]
gi|377889783|gb|EHU54242.1| hypothetical protein ECDEC3B_4713 [Escherichia coli DEC3B]
gi|377903272|gb|EHU67570.1| hypothetical protein ECDEC3C_4952 [Escherichia coli DEC3C]
gi|377907386|gb|EHU71622.1| hypothetical protein ECDEC3D_4717 [Escherichia coli DEC3D]
gi|377908341|gb|EHU72558.1| hypothetical protein ECDEC3E_4925 [Escherichia coli DEC3E]
gi|377918108|gb|EHU82161.1| hypothetical protein ECDEC3F_4810 [Escherichia coli DEC3F]
gi|377924259|gb|EHU88215.1| hypothetical protein ECDEC4A_4458 [Escherichia coli DEC4A]
gi|377927762|gb|EHU91677.1| hypothetical protein ECDEC4B_4655 [Escherichia coli DEC4B]
gi|377939056|gb|EHV02814.1| hypothetical protein ECDEC4C_4545 [Escherichia coli DEC4C]
gi|377944467|gb|EHV08170.1| hypothetical protein ECDEC4E_4577 [Escherichia coli DEC4E]
gi|377954643|gb|EHV18202.1| hypothetical protein ECDEC4F_4462 [Escherichia coli DEC4F]
gi|377957760|gb|EHV21288.1| hypothetical protein ECDEC5A_4358 [Escherichia coli DEC5A]
gi|377962943|gb|EHV26395.1| hypothetical protein ECDEC5B_4720 [Escherichia coli DEC5B]
gi|377970279|gb|EHV33643.1| hypothetical protein ECDEC5C_4407 [Escherichia coli DEC5C]
gi|377972443|gb|EHV35793.1| hypothetical protein ECDEC5D_4501 [Escherichia coli DEC5D]
gi|377981006|gb|EHV44266.1| hypothetical protein ECDEC5E_4420 [Escherichia coli DEC5E]
gi|386798218|gb|AFJ31252.1| hypothetical protein CDCO157_4196 [Escherichia coli Xuzhou21]
gi|390639210|gb|EIN18690.1| hypothetical protein ECFDA505_4757 [Escherichia coli FDA505]
gi|390639622|gb|EIN19093.1| hypothetical protein ECFDA517_5074 [Escherichia coli FDA517]
gi|390657072|gb|EIN34899.1| hypothetical protein ECFRIK1985_5028 [Escherichia coli FRIK1985]
gi|390657374|gb|EIN35192.1| hypothetical protein EC93001_4874 [Escherichia coli 93-001]
gi|390674723|gb|EIN50894.1| hypothetical protein ECPA3_4889 [Escherichia coli PA3]
gi|390678199|gb|EIN54182.1| hypothetical protein ECPA5_4745 [Escherichia coli PA5]
gi|390682075|gb|EIN57859.1| hypothetical protein ECPA9_4920 [Escherichia coli PA9]
gi|390693074|gb|EIN67718.1| hypothetical protein ECPA10_5070 [Escherichia coli PA10]
gi|390697368|gb|EIN71789.1| hypothetical protein ECPA14_4913 [Escherichia coli PA14]
gi|390698263|gb|EIN72649.1| hypothetical protein ECPA15_5031 [Escherichia coli PA15]
gi|390712206|gb|EIN85163.1| hypothetical protein ECPA22_4925 [Escherichia coli PA22]
gi|390719137|gb|EIN91871.1| hypothetical protein ECPA25_4614 [Escherichia coli PA25]
gi|390720026|gb|EIN92739.1| hypothetical protein ECPA24_4636 [Escherichia coli PA24]
gi|390725222|gb|EIN97742.1| hypothetical protein ECPA28_4952 [Escherichia coli PA28]
gi|390738126|gb|EIO09345.1| hypothetical protein ECPA31_4706 [Escherichia coli PA31]
gi|390738929|gb|EIO10125.1| hypothetical protein ECPA32_4774 [Escherichia coli PA32]
gi|390742351|gb|EIO13360.1| hypothetical protein ECPA33_4772 [Escherichia coli PA33]
gi|390761275|gb|EIO30571.1| hypothetical protein ECPA39_4850 [Escherichia coli PA39]
gi|390765920|gb|EIO35069.1| hypothetical protein ECPA42_4929 [Escherichia coli PA42]
gi|390779851|gb|EIO47565.1| hypothetical protein ECTW06591_4436 [Escherichia coli TW06591]
gi|390786558|gb|EIO54065.1| hypothetical protein ECTW07945_4773 [Escherichia coli TW07945]
gi|390787899|gb|EIO55372.1| hypothetical protein ECTW10246_5014 [Escherichia coli TW10246]
gi|390793629|gb|EIO60962.1| hypothetical protein ECTW11039_4946 [Escherichia coli TW11039]
gi|390801428|gb|EIO68486.1| hypothetical protein ECTW09098_4852 [Escherichia coli TW09098]
gi|390804995|gb|EIO71943.1| hypothetical protein ECTW09109_5140 [Escherichia coli TW09109]
gi|390814183|gb|EIO80763.1| hypothetical protein ECTW10119_5262 [Escherichia coli TW10119]
gi|390823323|gb|EIO89388.1| hypothetical protein ECEC4203_4848 [Escherichia coli EC4203]
gi|390828114|gb|EIO93799.1| hypothetical protein ECEC4196_4900 [Escherichia coli EC4196]
gi|390841966|gb|EIP05848.1| hypothetical protein ECTW14313_4703 [Escherichia coli TW14313]
gi|390843557|gb|EIP07344.1| hypothetical protein ECTW14301_4626 [Escherichia coli TW14301]
gi|390848287|gb|EIP11762.1| hypothetical protein ECEC4421_4687 [Escherichia coli EC4421]
gi|390858717|gb|EIP21090.1| hypothetical protein ECEC4422_4816 [Escherichia coli EC4422]
gi|390863135|gb|EIP25287.1| hypothetical protein ECEC4013_4958 [Escherichia coli EC4013]
gi|390867335|gb|EIP29163.1| hypothetical protein ECEC4402_4833 [Escherichia coli EC4402]
gi|390875728|gb|EIP36731.1| hypothetical protein ECEC4439_4786 [Escherichia coli EC4439]
gi|390881173|gb|EIP41787.1| hypothetical protein ECEC4436_4739 [Escherichia coli EC4436]
gi|390890973|gb|EIP50619.1| hypothetical protein ECEC4437_4898 [Escherichia coli EC4437]
gi|390892686|gb|EIP52258.1| hypothetical protein ECEC4448_4800 [Escherichia coli EC4448]
gi|390898319|gb|EIP57592.1| hypothetical protein ECEC1738_4805 [Escherichia coli EC1738]
gi|390906250|gb|EIP65153.1| hypothetical protein ECEC1734_4754 [Escherichia coli EC1734]
gi|390916344|gb|EIP74812.1| hypothetical protein ECEC1863_4496 [Escherichia coli EC1863]
gi|390916988|gb|EIP75422.1| hypothetical protein ECEC1845_4760 [Escherichia coli EC1845]
gi|408062465|gb|EKG96971.1| hypothetical protein ECPA7_5406 [Escherichia coli PA7]
gi|408066781|gb|EKH01227.1| hypothetical protein ECPA34_4912 [Escherichia coli PA34]
gi|408077084|gb|EKH11298.1| hypothetical protein ECFDA506_5110 [Escherichia coli FDA506]
gi|408080700|gb|EKH14758.1| hypothetical protein ECFDA507_4850 [Escherichia coli FDA507]
gi|408088919|gb|EKH22258.1| hypothetical protein ECFDA504_4775 [Escherichia coli FDA504]
gi|408101414|gb|EKH33866.1| hypothetical protein ECFRIK1997_5034 [Escherichia coli FRIK1997]
gi|408112898|gb|EKH44512.1| hypothetical protein ECNE037_5169 [Escherichia coli NE037]
gi|408125331|gb|EKH55940.1| hypothetical protein ECPA4_4926 [Escherichia coli PA4]
gi|408135214|gb|EKH65012.1| hypothetical protein ECPA23_4795 [Escherichia coli PA23]
gi|408137363|gb|EKH67065.1| hypothetical protein ECPA49_5019 [Escherichia coli PA49]
gi|408144386|gb|EKH73624.1| hypothetical protein ECPA45_4973 [Escherichia coli PA45]
gi|408152571|gb|EKH81000.1| hypothetical protein ECTT12B_4670 [Escherichia coli TT12B]
gi|408171077|gb|EKH98219.1| hypothetical protein ECCB7326_4836 [Escherichia coli CB7326]
gi|408180941|gb|EKI07530.1| hypothetical protein EC5412_4766 [Escherichia coli 5412]
gi|408214152|gb|EKI38607.1| hypothetical protein ECPA38_4622 [Escherichia coli PA38]
gi|408224415|gb|EKI48128.1| hypothetical protein ECEC1735_4763 [Escherichia coli EC1735]
gi|408235748|gb|EKI58682.1| hypothetical protein ECEC1736_4708 [Escherichia coli EC1736]
gi|408239233|gb|EKI61987.1| hypothetical protein ECEC1737_4710 [Escherichia coli EC1737]
gi|408244183|gb|EKI66641.1| hypothetical protein ECEC1846_4816 [Escherichia coli EC1846]
gi|408252867|gb|EKI74491.1| hypothetical protein ECEC1847_4840 [Escherichia coli EC1847]
gi|408256804|gb|EKI78168.1| hypothetical protein ECEC1848_4958 [Escherichia coli EC1848]
gi|408263244|gb|EKI84109.1| hypothetical protein ECEC1849_4723 [Escherichia coli EC1849]
gi|408271922|gb|EKI92038.1| hypothetical protein ECEC1850_4950 [Escherichia coli EC1850]
gi|408274623|gb|EKI94619.1| hypothetical protein ECEC1856_4773 [Escherichia coli EC1856]
gi|408283205|gb|EKJ02419.1| hypothetical protein ECEC1862_4786 [Escherichia coli EC1862]
gi|408289130|gb|EKJ07907.1| hypothetical protein ECEC1864_4920 [Escherichia coli EC1864]
gi|408304578|gb|EKJ22002.1| hypothetical protein ECEC1868_4993 [Escherichia coli EC1868]
gi|408305359|gb|EKJ22756.1| hypothetical protein ECEC1866_4680 [Escherichia coli EC1866]
gi|408316515|gb|EKJ32784.1| hypothetical protein ECEC1869_4939 [Escherichia coli EC1869]
gi|408321867|gb|EKJ37871.1| hypothetical protein ECEC1870_4738 [Escherichia coli EC1870]
gi|408324176|gb|EKJ40122.1| hypothetical protein ECNE098_4927 [Escherichia coli NE098]
gi|408334438|gb|EKJ49326.1| hypothetical protein ECFRIK523_4776 [Escherichia coli FRIK523]
gi|408343399|gb|EKJ57802.1| hypothetical protein EC01304_4902 [Escherichia coli 0.1304]
gi|408545930|gb|EKK23352.1| hypothetical protein EC52239_4795 [Escherichia coli 5.2239]
gi|408546745|gb|EKK24159.1| hypothetical protein EC34870_4837 [Escherichia coli 3.4870]
gi|408547047|gb|EKK24447.1| hypothetical protein EC60172_4950 [Escherichia coli 6.0172]
gi|408564499|gb|EKK40604.1| hypothetical protein EC80586_5102 [Escherichia coli 8.0586]
gi|408576191|gb|EKK51804.1| hypothetical protein EC100833_5124 [Escherichia coli 10.0833]
gi|408579122|gb|EKK54601.1| hypothetical protein EC82524_4714 [Escherichia coli 8.2524]
gi|408588994|gb|EKK63538.1| hypothetical protein EC100869_4617 [Escherichia coli 10.0869]
gi|408594205|gb|EKK68496.1| hypothetical protein EC880221_4757 [Escherichia coli 88.0221]
gi|408599378|gb|EKK73290.1| hypothetical protein EC80416_4344 [Escherichia coli 8.0416]
gi|408606541|gb|EKK79968.1| hypothetical protein EC100821_5204 [Escherichia coli 10.0821]
gi|427201963|gb|EKV72321.1| hypothetical protein EC881042_4795 [Escherichia coli 88.1042]
gi|427202497|gb|EKV72822.1| hypothetical protein EC890511_4747 [Escherichia coli 89.0511]
gi|427218432|gb|EKV87442.1| hypothetical protein EC900091_5143 [Escherichia coli 90.0091]
gi|427221712|gb|EKV90524.1| hypothetical protein EC902281_4755 [Escherichia coli 90.2281]
gi|427238946|gb|EKW06445.1| hypothetical protein EC930056_4727 [Escherichia coli 93.0056]
gi|427239084|gb|EKW06577.1| hypothetical protein EC930055_4673 [Escherichia coli 93.0055]
gi|427243369|gb|EKW10745.1| hypothetical protein EC940618_4635 [Escherichia coli 94.0618]
gi|427258569|gb|EKW24654.1| hypothetical protein EC950943_4910 [Escherichia coli 95.0943]
gi|427260727|gb|EKW26692.1| hypothetical protein EC951288_4497 [Escherichia coli 95.1288]
gi|427273802|gb|EKW38469.1| hypothetical protein EC960428_4625 [Escherichia coli 96.0428]
gi|427276260|gb|EKW40835.1| hypothetical protein EC960427_4802 [Escherichia coli 96.0427]
gi|427289537|gb|EKW53075.1| hypothetical protein EC960932_4872 [Escherichia coli 96.0932]
gi|427296261|gb|EKW59321.1| hypothetical protein EC960107_4684 [Escherichia coli 96.0107]
gi|427298383|gb|EKW61393.1| hypothetical protein EC970003_4427 [Escherichia coli 97.0003]
gi|427308631|gb|EKW70996.1| hypothetical protein EC971742_4298 [Escherichia coli 97.1742]
gi|427311712|gb|EKW73893.1| hypothetical protein EC970007_4185 [Escherichia coli 97.0007]
gi|427324889|gb|EKW86347.1| hypothetical protein EC990713_5250 [Escherichia coli 99.0713]
gi|427336056|gb|EKW97058.1| hypothetical protein EC990678_5252 [Escherichia coli 99.0678]
gi|429251455|gb|EKY36050.1| hypothetical protein EC960109_4876 [Escherichia coli 96.0109]
gi|429252515|gb|EKY37047.1| hypothetical protein EC970010_4835 [Escherichia coli 97.0010]
gi|444535665|gb|ELV15735.1| hypothetical protein EC990814_4226 [Escherichia coli 99.0814]
gi|444536994|gb|ELV16959.1| hypothetical protein EC09BKT78844_4882 [Escherichia coli
09BKT078844]
gi|444545831|gb|ELV24637.1| hypothetical protein EC990815_4259 [Escherichia coli 99.0815]
gi|444555151|gb|ELV32633.1| hypothetical protein EC990839_4245 [Escherichia coli 99.0839]
gi|444555319|gb|ELV32789.1| hypothetical protein EC990816_4314 [Escherichia coli 99.0816]
gi|444560365|gb|ELV37532.1| hypothetical protein EC990848_4357 [Escherichia coli 99.0848]
gi|444569733|gb|ELV46300.1| hypothetical protein EC991753_4289 [Escherichia coli 99.1753]
gi|444573453|gb|ELV49819.1| hypothetical protein EC991775_4151 [Escherichia coli 99.1775]
gi|444577174|gb|ELV53320.1| hypothetical protein EC991793_4596 [Escherichia coli 99.1793]
gi|444588184|gb|ELV63570.1| hypothetical protein ECPA11_5184 [Escherichia coli PA11]
gi|444589994|gb|ELV65310.1| hypothetical protein EC991805_4182 [Escherichia coli 99.1805]
gi|444590079|gb|ELV65394.1| hypothetical protein ECATCC700728_4176 [Escherichia coli ATCC
700728]
gi|444604008|gb|ELV78694.1| hypothetical protein ECPA13_4095 [Escherichia coli PA13]
gi|444604410|gb|ELV79084.1| hypothetical protein ECPA19_4335 [Escherichia coli PA19]
gi|444611225|gb|ELV85574.1| hypothetical protein ECPA2_5358 [Escherichia coli PA2]
gi|444618641|gb|ELV92715.1| hypothetical protein ECPA48_5173 [Escherichia coli PA48]
gi|444634620|gb|ELW08085.1| hypothetical protein ECPA47_5269 [Escherichia coli PA47]
gi|444639829|gb|ELW13128.1| hypothetical protein ECPA8_5306 [Escherichia coli PA8]
gi|444646552|gb|ELW19556.1| hypothetical protein EC991781_1757 [Escherichia coli 99.1781]
gi|444649874|gb|ELW22742.1| hypothetical protein EC71982_5249 [Escherichia coli 7.1982]
gi|444652152|gb|ELW24923.1| hypothetical protein ECPA35_4500 [Escherichia coli PA35]
gi|444655466|gb|ELW28079.1| hypothetical protein EC991762_5373 [Escherichia coli 99.1762]
gi|444660513|gb|ELW32876.1| hypothetical protein EC950083_4258 [Escherichia coli 95.0083]
gi|444666637|gb|ELW38700.1| hypothetical protein EC34880_0888 [Escherichia coli 3.4880]
gi|444667586|gb|ELW39621.1| hypothetical protein EC990670_4482 [Escherichia coli 99.0670]
Length = 656
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|417243728|ref|ZP_12038126.1| putative glycosyhydrolase [Escherichia coli 9.0111]
gi|386211280|gb|EII21745.1| putative glycosyhydrolase [Escherichia coli 9.0111]
Length = 654
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|417588723|ref|ZP_12239485.1| hypothetical protein ECSTECC16502_4394 [Escherichia coli
STEC_C165-02]
gi|345331722|gb|EGW64181.1| hypothetical protein ECSTECC16502_4394 [Escherichia coli
STEC_C165-02]
Length = 654
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVRGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDMLNLTLPMPVR 535
>gi|432487351|ref|ZP_19729258.1| hypothetical protein A15Y_03854 [Escherichia coli KTE212]
gi|433175488|ref|ZP_20359993.1| hypothetical protein WGQ_03753 [Escherichia coli KTE232]
gi|431013718|gb|ELD27447.1| hypothetical protein A15Y_03854 [Escherichia coli KTE212]
gi|431688314|gb|ELJ53849.1| hypothetical protein WGQ_03753 [Escherichia coli KTE232]
Length = 656
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPLENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|365968450|ref|YP_004950011.1| hypothetical protein EcWSU1_00150 [Enterobacter cloacae EcWSU1]
gi|365747363|gb|AEW71590.1| hypothetical protein EcWSU1_00150 [Enterobacter cloacae EcWSU1]
Length = 667
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/217 (23%), Positives = 93/217 (42%), Gaps = 17/217 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 345 DTVYAESCASIGLMMFARRMLEMEADSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 403
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQY 513
P + + P W CC + LG +Y ++ + +Y+
Sbjct: 404 VHPKTLAFNHVYDHVKPVRQRWFGCACCPPNIARVLTSLGHYLYTVRQDALFINLYVGND 463
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
++ +D + Q+ ++ W + + +T + +T +L LR+P W +S
Sbjct: 464 VAIPVDEGTLQL----RISGNYPWQEEVNIEVTSPAP---VTHTLALRLPDWCAS--PAM 514
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
+LNG+ + +L +T+ W D LT+ LP+ +R
Sbjct: 515 SLNGERVTGDVSRGYLYLTRRWQEGDTLTLTLPMPVR 551
>gi|422975185|ref|ZP_16976637.1| hypothetical protein ESRG_03271 [Escherichia coli TA124]
gi|371595315|gb|EHN84166.1| hypothetical protein ESRG_03271 [Escherichia coli TA124]
Length = 654
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLCLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|265752762|ref|ZP_06088331.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263235948|gb|EEZ21443.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 811
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 118/282 (41%), Gaps = 40/282 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + YAD ER+L NGV+ G+ + Y PL
Sbjct: 340 ETCASIANVYWNHRMFLATGDAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 397
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER HW + CC G I F + Y+ + VY+ +I S+ D ++
Sbjct: 398 HER--QHWFGCA----CCPGN-ITRF--MASVPYYMYATQGNDVYVNLFIQSKADIETES 448
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
+N + WD + + +T + +L +RIP WT ++ A+A
Sbjct: 449 NKINVEQTTGYPWDGKISIAVTPEKEQE---FALRVRIPGWTQDAPVPTDLYSFTDKAQA 505
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
++NG + + ++ + W + D + I LP+ +R D + AI G
Sbjct: 506 YSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVKANDQVEDDHGKLAIERG 565
Query: 631 P--YVLAGHSIGDWDITESATSLSDWI---TPIPASYNSQLI 667
P + L G D +T + +I TP+ AS+++ L+
Sbjct: 566 PIMFCLEGQDQAD------STVFNKFIPDGTPMEASFHADLL 601
>gi|150009918|ref|YP_001304661.1| hypothetical protein BDI_3335 [Parabacteroides distasonis ATCC
8503]
gi|423333683|ref|ZP_17311464.1| hypothetical protein HMPREF1075_03115 [Parabacteroides distasonis
CL03T12C09]
gi|149938342|gb|ABR45039.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
gi|409226993|gb|EKN19895.1| hypothetical protein HMPREF1075_03115 [Parabacteroides distasonis
CL03T12C09]
Length = 617
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 65/289 (22%), Positives = 116/289 (40%), Gaps = 31/289 (10%)
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLL 457
NLD+ E +C + M+ ++ + ++T + Y D ERS+ NG L GI E Y+
Sbjct: 328 NLDAYCE-TCASVGMVLWNQRMNQFTGDSKYIDVLERSMYNGALAGIS--LEGDRFFYVN 384
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL R + CC +G+ IY +++ YI +
Sbjct: 385 PLESKGDHHRQAWY------GCACCPSQISRFLPSIGNYIYGTSN---EAIWVNLYIGNS 435
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ + V + + WD +++T+T S+ L + LRIP+W ++NG
Sbjct: 436 TEINTDNTNVTLRQETNYPWDGTVKLTVTPSNP---LKKEIRLRIPSWCEQ--YTLSVNG 490
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
Q + P+ + + K W D +++ + + ++ + +AI GP V
Sbjct: 491 QLVKAPTEKGYAVLNKEWKQGDVISLSMEMPVKLMTADPRVKQNIGKRAIQRGPLVYCME 550
Query: 638 SIG---DWDITESATSLS----------DWITPIPASYNSQLITFTQEY 673
+ D+D + A + S + IT I A+ N IT Y
Sbjct: 551 EVDNPQDFDNLKIAANTSFNAQFNPKLLNGITTIKATTNELAITLIPYY 599
>gi|326802069|ref|YP_004319888.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326552833|gb|ADZ81218.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
Length = 659
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/256 (24%), Positives = 102/256 (39%), Gaps = 34/256 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + M+ ++ + T E Y D ERSL NG L G+ Y PLA
Sbjct: 335 ETCASVGMVFWNQRMNLLTGEAKYFDILERSLYNGALDGLSYSGNR--FFYGNPLASHGG 392
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR--LDWKS 522
RS +GT CC LGD IY + V++ ++ S+ +
Sbjct: 393 YGRS-EWFGTA-----CCPSNIARLVESLGDYIYAHSD---KAVWVNLFVGSKAAIPLSQ 443
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW---------------TS 567
G + + Q+ D +RVT K L++RIP W T+
Sbjct: 444 GTVEIAQQTGYPWQGDVNIRVTPDRKRK-----FPLHIRIPGWLLGQPAPGDTYRFLDTT 498
Query: 568 SNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAI 627
N +NG+++P ++ + + W +D ++IQ+PL ++ A D + A+
Sbjct: 499 ENKYTLQVNGKNVPYHIEKGYVVIDRIWDKNDAVSIQMPLEVKKIAANDQVVANKNRIAL 558
Query: 628 LYGPYVLAGHSIGDWD 643
GP V + + D
Sbjct: 559 QRGPLVYCVEQVDNQD 574
>gi|331665212|ref|ZP_08366113.1| putative cytoplasmic protein [Escherichia coli TA143]
gi|432767960|ref|ZP_20002352.1| hypothetical protein A1S9_00751 [Escherichia coli KTE50]
gi|432964211|ref|ZP_20153463.1| hypothetical protein A15E_04412 [Escherichia coli KTE202]
gi|433065055|ref|ZP_20251959.1| hypothetical protein WIO_03878 [Escherichia coli KTE125]
gi|331057722|gb|EGI29708.1| putative cytoplasmic protein [Escherichia coli TA143]
gi|431321992|gb|ELG09585.1| hypothetical protein A1S9_00751 [Escherichia coli KTE50]
gi|431469844|gb|ELH49772.1| hypothetical protein A15E_04412 [Escherichia coli KTE202]
gi|431578217|gb|ELI50831.1| hypothetical protein WIO_03878 [Escherichia coli KTE125]
Length = 654
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGNSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|315644006|ref|ZP_07897176.1| hypothetical protein PVOR_00550 [Paenibacillus vortex V453]
gi|315280381|gb|EFU43670.1| hypothetical protein PVOR_00550 [Paenibacillus vortex V453]
Length = 653
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 109/517 (21%), Positives = 198/517 (38%), Gaps = 85/517 (16%)
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLEALI 239
V +L A+A A + L+E++ ++ ++A Q+ GYL+ + T E R L
Sbjct: 79 VAKWLEAAAYSLAIHPDPKLEEQVDQLIDLVAAAQQP--DGYLNTYFTVKEPEKRWTNLT 136
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
Y H + AG+ Y + L + + +Y + +V + H +
Sbjct: 137 DCHELYCAGHMMEAGVA-HYLATGKRKLLDVVCRLADY----IDSVFGPEDGKIHGFDGH 191
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLALQADDISGFHSN---- 350
+E + L KL+ +T++P++L L+ F +P F L + F+S+
Sbjct: 192 QE---IELALVKLYEVTREPRYLSLSQYFIDVRGTEPHFF-LQEWEQRGRKSFYSSVANP 247
Query: 351 -------THIPI-----VIGSQMR----YEVTGD--------------------QLHKEG 374
+H+P+ +G +R Y D +HK+
Sbjct: 248 PHLPYHQSHLPVREQREAVGHSVRAVYMYTAMADLAARTKDPALLEACENLWFNMVHKQM 307
Query: 375 HQLESSGTNIGHFNFKSDPKRLASNLDSNT--EESCTTYNMLKVSRHLFRWTKEIAYADY 432
+ G+ F +D +L ++T E+C + ++ +R + + YAD
Sbjct: 308 YITGGIGSTHHGEAFTTD-----YDLPNDTVYAETCASIGLIFFARRMLELAPKSEYADV 362
Query: 433 YERSLTNGVLG--IQRGTEPGVMIYLLPLA--PGSSKERSYHHWGTPSDSFW----CCYG 484
ER+L N V+G Q G Y+ PL P + + P W CC
Sbjct: 363 MERALFNTVIGSMAQDGRH---FFYVNPLEVWPAACRHNPGKFHVKPVRPGWFACACCPP 419
Query: 485 TGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVT 544
S LG+ +Y E +Y Y+ + G + V + + W+ VT
Sbjct: 420 NVARLLSSLGEYVYTMNEDT---LYTHLYMGGEASVQFGDVPVKVIQNSALPWNG--DVT 474
Query: 545 LTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS--PGNFLSVTKTWSSDDKLT 602
LT + + ++ LR+P W S A LNG+D+ + ++ + + W+ D L
Sbjct: 475 LTIQPE-KAVEWTVALRMPDW-SRGKADLRLNGEDVSIEDVMKDGYVYIKRVWAPGDTLE 532
Query: 603 IQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSI 639
++L + + + A AI GP V S+
Sbjct: 533 LELSMEIHQVRANPNIRANAGKAAIQRGPLVYCLESV 569
>gi|429738051|ref|ZP_19271876.1| hypothetical protein HMPREF9151_00303 [Prevotella saccharolytica
F0055]
gi|429161156|gb|EKY03584.1| hypothetical protein HMPREF9151_00303 [Prevotella saccharolytica
F0055]
Length = 603
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 118/285 (41%), Gaps = 43/285 (15%)
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA---- 460
+E+C T +K+SR L T YAD E+SL N +LG R Y PL+
Sbjct: 298 QETCVTATWIKLSRQLLMLTGNSKYADAIEQSLYNALLGAMRPDGSDWAKYT-PLSGQRL 356
Query: 461 PGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEE-EGKY-----PGVYIIQYI 514
PGS + CC +G + + + EG PG Y +Q
Sbjct: 357 PGSEQ---------CGMGLNCCTASGPRGLFVIPQTAVMQSSEGAVVNLYIPGTYTLQSP 407
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
++ +V Q P + + F ++ T L+LRIP W+ + +
Sbjct: 408 KNKT-----VTLVQQGEYPKTG-----NMRIVFQAQQPEEMT-LSLRIPAWSKTT--RVA 454
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
+NGQ++ G++L + + WS+ D++ + + + + + + P+Y AI GP VL
Sbjct: 455 VNGQEVSAVRSGSYLQINRQWSAGDRVELTMDMQAQLHFMGTN-PQYL---AITRGPVVL 510
Query: 635 AGHS-IGDWDITESATSLSDW-----ITPIPASYNSQLITFTQEY 673
+ + D+ T D +TP+ A + +TF ++
Sbjct: 511 THDARLSGADVQAVITPAEDKNGHLELTPVTAKDPNIWMTFKAQF 555
>gi|432720730|ref|ZP_19955692.1| hypothetical protein WCK_04370 [Escherichia coli KTE9]
gi|432794804|ref|ZP_20028883.1| hypothetical protein A1US_04040 [Escherichia coli KTE78]
gi|432796321|ref|ZP_20030359.1| hypothetical protein A1UU_01028 [Escherichia coli KTE79]
gi|431259905|gb|ELF52266.1| hypothetical protein WCK_04370 [Escherichia coli KTE9]
gi|431336741|gb|ELG23843.1| hypothetical protein A1US_04040 [Escherichia coli KTE78]
gi|431348554|gb|ELG35405.1| hypothetical protein A1UU_01028 [Escherichia coli KTE79]
Length = 654
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 89/216 (41%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKTLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ ++ +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGMLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVGQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|266624999|ref|ZP_06117934.1| putative cytoplasmic protein, partial [Clostridium hathewayi DSM
13479]
gi|288863113|gb|EFC95411.1| putative cytoplasmic protein [Clostridium hathewayi DSM 13479]
Length = 323
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/238 (22%), Positives = 95/238 (39%), Gaps = 15/238 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ +R + + + YAD ER L NGVL G+ + + L +
Sbjct: 3 DTAYAETCASVGLVFFARRMLQIRPDAQYADVMERVLYNGVLSGMALDGKSFFYVNPLEV 62
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + P W CC S +G Y E+E ++I YI
Sbjct: 63 VPEACHRDERKSHVKPVRQKWFGCACCPPNVARLLSSVGSYAYTEKEDT---IFIHLYIG 119
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
+ L + + K+ W+ + V + KG ++ IP W + + +
Sbjct: 120 AILKKQINGKEMEVKIQSEFPWNGKVNVYV----KGVREVCTIAFHIPEWGEAYQL-SKI 174
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
NG + + +L VTK W ++++ +Q P+ +R E A++ GP V
Sbjct: 175 NGATIKVKE--RYLYVTKKWEEEEEIHLQFPMEVRLIEANPFVRENIGKNAVMRGPLV 230
>gi|256420772|ref|YP_003121425.1| hypothetical protein Cpin_1728 [Chitinophaga pinensis DSM 2588]
gi|256035680|gb|ACU59224.1| protein of unknown function DUF1680 [Chitinophaga pinensis DSM
2588]
Length = 675
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 104/485 (21%), Positives = 186/485 (38%), Gaps = 75/485 (15%)
Query: 169 GWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP 228
GWEE L G YL A+ LK+K+ V+ Q++ SGY
Sbjct: 82 GWEETPYWLDGALPLAYLLDDAV---------LKDKVLRYVNWTMDHQRK--SGYFGPLT 130
Query: 229 TEQFDR---LEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNV 285
+ R ++A + ++ +L QY A E R+ +M YF R Q
Sbjct: 131 NAEITRQVDIDAAHAAEGEDWWPKMVMLKVLQQYYSA--TEDKRVIKFMSRYF--RYQLE 186
Query: 286 IKKYSIERHWQTLNEEAGGMNDVLYK-LFCITQDPKHLMLAHLFDKPCFLGLLALQADD- 343
K + W + G N ++ + L+ IT+D L LA ++ F D
Sbjct: 187 ALKVAPVGKWTEWAQSRGAENVMMAQWLYSITEDDYLLELAETIEQQSFPWTTWFGNRDW 246
Query: 344 ---ISGFHSNTH------IPIVIGSQ---MRYEVTGDQLH----KEGHQ--LESSGTNIG 385
+ + +NT + + +G + + Y+ TG Q + + G Q + G +G
Sbjct: 247 VINTTTYRNNTQWMNRHAVNVAMGLKAPAVNYQRTGKQEYLQHLRTGWQDLMTIHGLPMG 306
Query: 386 HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGV---- 441
F+ D L N + E C + ++ T ++ Y D E+ N +
Sbjct: 307 IFSGDED---LNGNDPTQGVELCAIVEAMYSLENISAITGDVFYMDALEKMAFNALPTQT 363
Query: 442 -----------LGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESF 490
+ Q GV + LP +R + + CC + +
Sbjct: 364 TDDYNEKQYFQVANQLQISKGVFNFSLPF------DREMCNVLGARSGYTCCLANMHQGW 417
Query: 491 SKLGDSIYFEEEGKYPGVYIIQY----ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLT 546
+K ++++ GK GV ++Y +++ + K + + + D + + ++ +
Sbjct: 418 TKYTSHLWYQTSGK--GVAALEYGPCVMTAEVGKKHRDVTITEVTDYPFNEEIRFQIAIK 475
Query: 547 FSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLP 606
++ L LRIP W N A LNGQ L G +++ + W D+LT+QLP
Sbjct: 476 KETE-----FPLQLRIPAW--CNEAVILLNGQPLRKDKGGQIITIEREWQDKDELTLQLP 528
Query: 607 LTLRT 611
+T+ T
Sbjct: 529 MTITT 533
>gi|383189042|ref|YP_005199170.1| hypothetical protein Rahaq2_1140 [Rahnella aquatilis CIP 78.65 =
ATCC 33071]
gi|371587300|gb|AEX51030.1| hypothetical protein Rahaq2_1140 [Rahnella aquatilis CIP 78.65 =
ATCC 33071]
Length = 657
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + + YAD ER+L N VL G+ + + L +
Sbjct: 334 DTAYTETCASIGLMMFANRMLQMDADSRYADVMERALYNTVLAGMALDGKHFFYVNPLEV 393
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P S + P W CC + LG IY + GV I YI
Sbjct: 394 HPKSIPFNHIYDHVKPVRQRWFGCACCPPNIARLLASLGHYIYTQRPD---GVDINLYIG 450
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S +D G + K W RV + + L +L LR+P W S + TL
Sbjct: 451 SDVDATIGGKALRLKQSGGYPWAE--RVLIEIDTD-QPLEATLALRLPDWCGS--PQVTL 505
Query: 576 NGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTL 609
NG L L S +L +T+ W D++ + LP+ +
Sbjct: 506 NGHPLELASLTQRGYLRLTQEWQKGDRIEMTLPMPV 541
>gi|331675072|ref|ZP_08375829.1| putative cytoplasmic protein [Escherichia coli TA280]
gi|331067981|gb|EGI39379.1| putative cytoplasmic protein [Escherichia coli TA280]
Length = 662
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DTVYAESCASIGLMMFARRMLEMEGNSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 396 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 452
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 453 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 507
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 508 LNGEEVEQDIRKGYLHITREWQEGDMLNLTLPMPVR 543
>gi|432394191|ref|ZP_19637011.1| hypothetical protein WE9_04517 [Escherichia coli KTE21]
gi|430914340|gb|ELC35436.1| hypothetical protein WE9_04517 [Escherichia coli KTE21]
Length = 656
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRISGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVGQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|334121751|ref|ZP_08495800.1| protein of hypothetical function DUF1680 [Enterobacter hormaechei
ATCC 49162]
gi|333392772|gb|EGK63868.1| protein of hypothetical function DUF1680 [Enterobacter hormaechei
ATCC 49162]
Length = 657
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/240 (22%), Positives = 96/240 (40%), Gaps = 17/240 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DTVYAESCASIGLMMFARRMLEMEADGHYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ-Y 513
P + + P W CC + LG IY P +I Y
Sbjct: 396 VHPKTLAFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTVR----PDALLINLY 451
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ + + G ++ ++ W +++ +T +T +L LR+P W +
Sbjct: 452 VGNDVAIPVGDNILQLRISGNYPWHEQVKIEITSPVP---VTHTLALRLPDWCAE--PAV 506
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+LNG+ + +L + ++W D L++ LP+ +R + A A+ GP V
Sbjct: 507 SLNGEAITGEVSRGYLYLNRSWQEGDTLSLTLPMPVRRVYGNPQVRQQAGKVALQRGPLV 566
>gi|299145521|ref|ZP_07038589.1| putative cytoplasmic protein [Bacteroides sp. 3_1_23]
gi|298516012|gb|EFI39893.1| putative cytoplasmic protein [Bacteroides sp. 3_1_23]
Length = 698
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 98/230 (42%), Gaps = 29/230 (12%)
Query: 393 PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG---- 447
P +L +N N E+C + + + T + YAD E L N VL GI
Sbjct: 370 PYQLPNNTAHN--ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKY 427
Query: 448 --TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGK 504
T P + LP KER T S +CC + + + + Y EG
Sbjct: 428 FYTNPLRISADLPYTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGI 481
Query: 505 YPGVYIIQYISSRLDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIP 563
Y +Y +++ WK G++ + Q+ D WD +RVTL + G T SL LRIP
Sbjct: 482 YCNLYGANTLTT--TWKEKGEVALTQETD--YPWDGNVRVTLDKVPRKVG-TFSLFLRIP 536
Query: 564 TWTSSNGAKATL--NGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLR 610
W KATL NGQ L + + N + V + W D + + + + +R
Sbjct: 537 EWCE----KATLRVNGQPLQVNAKANSYAEVNRAWKKGDIVELMMDMPVR 582
>gi|336416221|ref|ZP_08596557.1| hypothetical protein HMPREF1017_03665 [Bacteroides ovatus
3_8_47FAA]
gi|335938952|gb|EGN00831.1| hypothetical protein HMPREF1017_03665 [Bacteroides ovatus
3_8_47FAA]
Length = 698
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 98/230 (42%), Gaps = 29/230 (12%)
Query: 393 PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG---- 447
P +L +N N E+C + + + T + YAD E L N VL GI
Sbjct: 370 PYQLPNNTAHN--ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKY 427
Query: 448 --TEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGK 504
T P + LP KER T S +CC + + + + Y EG
Sbjct: 428 FYTNPLRISADLPYTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGI 481
Query: 505 YPGVYIIQYISSRLDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIP 563
Y +Y +++ WK G++ + Q+ D WD +RVTL + G T SL LRIP
Sbjct: 482 YCNLYGANTLTT--TWKEKGEVALTQETD--YPWDGNVRVTLDKVPRKVG-TFSLFLRIP 536
Query: 564 TWTSSNGAKATL--NGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLR 610
W KATL NGQ L + + N + V + W D + + + + +R
Sbjct: 537 EWCE----KATLRVNGQPLQVNAKANSYAEVNRAWKKGDIVELMMDMPVR 582
>gi|432618844|ref|ZP_19854944.1| hypothetical protein A1UM_04296 [Escherichia coli KTE75]
gi|431151056|gb|ELE52093.1| hypothetical protein A1UM_04296 [Escherichia coli KTE75]
Length = 659
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKTLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|331685249|ref|ZP_08385835.1| putative cytoplasmic protein [Escherichia coli H299]
gi|450194438|ref|ZP_21892361.1| hypothetical protein A364_18755 [Escherichia coli SEPT362]
gi|331077620|gb|EGI48832.1| putative cytoplasmic protein [Escherichia coli H299]
gi|449316669|gb|EMD06777.1| hypothetical protein A364_18755 [Escherichia coli SEPT362]
Length = 656
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKTLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|333378296|ref|ZP_08470027.1| hypothetical protein HMPREF9456_01622 [Dysgonomonas mossii DSM
22836]
gi|332883272|gb|EGK03555.1| hypothetical protein HMPREF9456_01622 [Dysgonomonas mossii DSM
22836]
Length = 826
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 63/261 (24%), Positives = 113/261 (43%), Gaps = 43/261 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + +F K+ Y D E SL N VL G+ E Y+ PLA +
Sbjct: 349 ETCAAVGNVFFNHRMFLLEKDGKYMDVAEVSLLNNVLAGVN--LEGNKFFYVNPLASDGT 406
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK--S 522
+RSY +GT CC ++ +Y + + ++ Y S++D+ S
Sbjct: 407 VDRSYW-FGTA-----CCPTNLARLIPQISGLMYAHTDNE---IFCSFYTGSKVDFALTS 457
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS------------NG 570
G++ + QK + +D + LT + + + T S+ +RIPTW S N
Sbjct: 458 GKVALEQKTN--YPFDE--SIVLTVNPEKNDQTFSIKMRIPTWVGSQFVPGKLYSYVDNN 513
Query: 571 AKA-----------TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR-TEAIQDDR 618
+KA L+ + + F+S+++ W DK+ ++LP+ +R + AI + +
Sbjct: 514 SKAWELYINDKKVGNLSFKKGEVSLDKGFVSISRKWKKGDKVELKLPMPVRYSHAINEVK 573
Query: 619 PEYASIQAILYGPYVLAGHSI 639
+ + AI GP V +
Sbjct: 574 ADNDRV-AITRGPLVYCAEGV 593
>gi|218707221|ref|YP_002414740.1| hypothetical protein ECUMN_4099 [Escherichia coli UMN026]
gi|293407210|ref|ZP_06651134.1| conserved hypothetical protein [Escherichia coli FVEC1412]
gi|298382958|ref|ZP_06992553.1| hypothetical protein ECFG_04115 [Escherichia coli FVEC1302]
gi|419934131|ref|ZP_14451275.1| hypothetical protein EC5761_10490 [Escherichia coli 576-1]
gi|432355611|ref|ZP_19598877.1| hypothetical protein WCA_04610 [Escherichia coli KTE2]
gi|432403987|ref|ZP_19646731.1| hypothetical protein WEK_04194 [Escherichia coli KTE26]
gi|432428252|ref|ZP_19670733.1| hypothetical protein A139_03650 [Escherichia coli KTE181]
gi|432462951|ref|ZP_19705084.1| hypothetical protein A15I_03831 [Escherichia coli KTE204]
gi|432477946|ref|ZP_19719933.1| hypothetical protein A15Q_04149 [Escherichia coli KTE208]
gi|432519807|ref|ZP_19756986.1| hypothetical protein A17U_02789 [Escherichia coli KTE228]
gi|432539967|ref|ZP_19776859.1| hypothetical protein A195_03603 [Escherichia coli KTE235]
gi|432633483|ref|ZP_19869403.1| hypothetical protein A1UW_03877 [Escherichia coli KTE80]
gi|432643180|ref|ZP_19879004.1| hypothetical protein A1W1_04060 [Escherichia coli KTE83]
gi|432668175|ref|ZP_19903747.1| hypothetical protein A1Y3_04798 [Escherichia coli KTE116]
gi|432772362|ref|ZP_20006675.1| hypothetical protein A1SG_00431 [Escherichia coli KTE54]
gi|432889014|ref|ZP_20102658.1| hypothetical protein A31C_04403 [Escherichia coli KTE158]
gi|432915187|ref|ZP_20120514.1| hypothetical protein A13Q_04153 [Escherichia coli KTE190]
gi|433020828|ref|ZP_20208923.1| hypothetical protein WI7_03757 [Escherichia coli KTE105]
gi|433055258|ref|ZP_20242416.1| hypothetical protein WIK_04060 [Escherichia coli KTE122]
gi|433069946|ref|ZP_20256714.1| hypothetical protein WIQ_03826 [Escherichia coli KTE128]
gi|433160742|ref|ZP_20345560.1| hypothetical protein WKU_03818 [Escherichia coli KTE177]
gi|433180460|ref|ZP_20364837.1| hypothetical protein WGM_04099 [Escherichia coli KTE82]
gi|218434318|emb|CAR15240.1| conserved hypothetical protein [Escherichia coli UMN026]
gi|291426021|gb|EFE99055.1| conserved hypothetical protein [Escherichia coli FVEC1412]
gi|298276794|gb|EFI18312.1| hypothetical protein ECFG_04115 [Escherichia coli FVEC1302]
gi|388409694|gb|EIL69966.1| hypothetical protein EC5761_10490 [Escherichia coli 576-1]
gi|430872588|gb|ELB96188.1| hypothetical protein WCA_04610 [Escherichia coli KTE2]
gi|430923400|gb|ELC44137.1| hypothetical protein WEK_04194 [Escherichia coli KTE26]
gi|430951024|gb|ELC70250.1| hypothetical protein A139_03650 [Escherichia coli KTE181]
gi|430986214|gb|ELD02797.1| hypothetical protein A15I_03831 [Escherichia coli KTE204]
gi|431002149|gb|ELD17675.1| hypothetical protein A15Q_04149 [Escherichia coli KTE208]
gi|431048059|gb|ELD58044.1| hypothetical protein A17U_02789 [Escherichia coli KTE228]
gi|431067015|gb|ELD75632.1| hypothetical protein A195_03603 [Escherichia coli KTE235]
gi|431167666|gb|ELE67931.1| hypothetical protein A1UW_03877 [Escherichia coli KTE80]
gi|431177575|gb|ELE77497.1| hypothetical protein A1W1_04060 [Escherichia coli KTE83]
gi|431198006|gb|ELE96833.1| hypothetical protein A1Y3_04798 [Escherichia coli KTE116]
gi|431323599|gb|ELG11078.1| hypothetical protein A1SG_00431 [Escherichia coli KTE54]
gi|431413832|gb|ELG96595.1| hypothetical protein A31C_04403 [Escherichia coli KTE158]
gi|431436255|gb|ELH17862.1| hypothetical protein A13Q_04153 [Escherichia coli KTE190]
gi|431526942|gb|ELI03673.1| hypothetical protein WI7_03757 [Escherichia coli KTE105]
gi|431566044|gb|ELI39087.1| hypothetical protein WIK_04060 [Escherichia coli KTE122]
gi|431578915|gb|ELI51501.1| hypothetical protein WIQ_03826 [Escherichia coli KTE128]
gi|431673865|gb|ELJ40054.1| hypothetical protein WKU_03818 [Escherichia coli KTE177]
gi|431697952|gb|ELJ63031.1| hypothetical protein WGM_04099 [Escherichia coli KTE82]
Length = 654
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPLRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLCLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|336427168|ref|ZP_08607172.1| hypothetical protein HMPREF0994_03178 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336010021|gb|EGN40008.1| hypothetical protein HMPREF0994_03178 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 687
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 66/322 (20%), Positives = 117/322 (36%), Gaps = 38/322 (11%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL- 459
DS E+C + ++ +R + YAD E++L NG+L + Y+ PL
Sbjct: 355 DSAYAETCASIGLVFFARRMLEIKASSKYADVMEKALYNGILS-GMALDGKSFFYVNPLE 413
Query: 460 ---APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
ER +H P W CC S + Y E E +Y+
Sbjct: 414 SLPEACHKDERKFHV--KPVRQKWFGCACCPPNIARLLSSIASYAYTEAED---ALYVHL 468
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS---N 569
Y+ S L+ G ++ ++ WD + + + L RIP W SS N
Sbjct: 469 YMGSVLEKDCGGKKLDIRISSDFPWDGKVMAEINAEEP---VACRLAFRIPGWCSSYTLN 525
Query: 570 GAKATLNGQDLPLPS-----PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
G K G+ + +L + + W+ +KL + P+ +R E
Sbjct: 526 GQKGLEEGETVTADGETRQVKDGYLIIDRVWNGGEKLELDFPMEVRLMQADARVREDIGK 585
Query: 625 QAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSN 684
A+ GP V + + + D ++ S P+P + + I G +T
Sbjct: 586 AAVTRGPIV---YCMEEADNGKNLQLYSLAEDPVPQAVQEEKI------GQRMVTITTKG 636
Query: 685 QSITMEKFPKSGTDAALHATFR 706
+ + P++ D L+ ++
Sbjct: 637 KKLV----PQAEEDGELYREYK 654
>gi|300898699|ref|ZP_07117012.1| conserved hypothetical protein [Escherichia coli MS 198-1]
gi|300357662|gb|EFJ73532.1| conserved hypothetical protein [Escherichia coli MS 198-1]
Length = 662
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 337 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 395
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 396 VHPKSLKFNHIYDHVKPLRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 452
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 453 GNSMEVPVENGTLCLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 507
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 508 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 543
>gi|269839244|ref|YP_003323936.1| hypothetical protein Tter_2215 [Thermobaculum terrenum ATCC
BAA-798]
gi|269790974|gb|ACZ43114.1| protein of unknown function DUF1680 [Thermobaculum terrenum ATCC
BAA-798]
Length = 638
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 63/266 (23%), Positives = 111/266 (41%), Gaps = 27/266 (10%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + T + YAD E +L N VL GI + + Y PL +
Sbjct: 327 ETCAAIGSVMWNWRMLLLTADARYADLIEHTLYNAVLPGIS--LDGALYFYQNPLEDEGT 384
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR--LDWKS 522
R W + CC + + LG Y G+++ Y R L +
Sbjct: 385 HRR--QEWFGCA----CCPPNVARTLASLGGYFYSTSRD---GIWVHLYSEGRAKLGLQD 435
Query: 523 G-QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP 581
G +++++Q W + + L + L + LRIP+W + +NG+D
Sbjct: 436 GREVLLSQHTS--YPWSGEVAIRLEQVPEEGEL--GIYLRIPSWCERG--EVAINGEDAA 489
Query: 582 LP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIG 640
P +PG +L + +TW + D++ ++LP+T+R E A AI+ GP + S
Sbjct: 490 TPITPGTYLELRRTWRAGDEVRLRLPMTVRRLEAHPYLSEDAGRVAIMRGPILYCIESAD 549
Query: 641 DWDITESATSLSDWITPIPASYNSQL 666
+ L D + P A+++ +L
Sbjct: 550 N-----PGVDLRDVLLPRDAAFSEEL 570
>gi|421075310|ref|ZP_15536325.1| protein of unknown function DUF1680 [Pelosinus fermentans JBW45]
gi|392526752|gb|EIW49863.1| protein of unknown function DUF1680 [Pelosinus fermentans JBW45]
Length = 650
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 97/241 (40%), Gaps = 17/241 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + + +R + + E YAD E+ L NG+L G+ + + L +
Sbjct: 328 DTVYGETCASIGAVFFARRMLEISPEGEYADVIEKELFNGILSGMSMDGKSFFYVNPLEV 387
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P +SK+ HH W CC F+ LG IY K +++ YI
Sbjct: 388 VPEASKKDQLHHHVEVERQKWFGCACCPPNIARLFASLGSYIY-SYSAKSNTLWLHLYIG 446
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
L VN V WD + +T++ + + LRIP W + + +
Sbjct: 447 GELTHTFDSQEVNFTVATNYPWDEDVEITVSLAESKE---FTYALRIPGWCKA--YEVNV 501
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDD---RPEYASIQAILYGPY 632
NG+ P + + + W + D I L + E +Q + R + + A++ GP
Sbjct: 502 NGEKTNAPIVNGYAYLQREWKNGD--VIHLHFAMPIEVMQANPRVREDLGKV-AMMRGPI 558
Query: 633 V 633
V
Sbjct: 559 V 559
>gi|417487787|ref|ZP_12172639.1| secreted protein [Salmonella enterica subsp. enterica serovar
Rubislaw str. A4-653]
gi|353632529|gb|EHC79566.1| secreted protein [Salmonella enterica subsp. enterica serovar
Rubislaw str. A4-653]
Length = 663
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 59/251 (23%), Positives = 95/251 (37%), Gaps = 27/251 (10%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERS------------LTNGVLGIQRGT 448
DS ESC + ++ +R + + YAD ER+ L N VLG
Sbjct: 329 DSIYAESCASIGLMMFARRMLEMEADSQYADVMERAREYADVMERARALYNTVLG-GMAL 387
Query: 449 EPGVMIYLLPLA--PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEE 502
+ Y+ PL P S K + P W CC + LG IY
Sbjct: 388 DGKHFFYVNPLEVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSLGHYIYTP-- 445
Query: 503 GKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRI 562
+ +YI Y+ + ++ + ++ W +++ + + +L LR+
Sbjct: 446 -RADALYINMYVGNSMEIPVENGALKLRISGNYPWHEQVKIAIDSVQP---VRHTLALRL 501
Query: 563 PTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYA 622
P W AK TLNG ++ +L + +TW D +T+ LP+ +R A
Sbjct: 502 PDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMPVRRVYGNPLARHVA 559
Query: 623 SIQAILYGPYV 633
AI GP V
Sbjct: 560 GKVAIQRGPLV 570
>gi|332666559|ref|YP_004449347.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332335373|gb|AEE52474.1| protein of unknown function DUF1680 [Haliscomenobacter hydrossis
DSM 1100]
Length = 656
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 103/248 (41%), Gaps = 37/248 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + M+ ++ + R T + + D E+SL NG L G+ + Y PLA +
Sbjct: 335 ETCASVGMVFWNQRMNRLTGQTKFIDVLEKSLYNGALDGLSLAGDR--FFYGNPLASSGT 392
Query: 465 KERSYHHW-GTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR--LDWK 521
R W GT CC + LGD IY + +Y+ ++ S +D
Sbjct: 393 HFR--REWFGTA-----CCPSNIARLIASLGDYIYASDP---QSIYVNLFVGSNTTIDLA 442
Query: 522 SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN-GAKA------- 573
G++ + Q+ + W +++T+ S +L +R+P W N GA A
Sbjct: 443 KGKVEIRQETE--YPWKGLIKLTVNPEKAQS---FALKIRLPGWAKGNPGAGALYKFLDE 497
Query: 574 --------TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
+NGQ L +L V + W+ D + + L + +R +D+ + +
Sbjct: 498 GPTNFATLKVNGQAQNLKLDNGYLIVERNWNKGDVVELNLAMPIRRVVARDEVKDNENRM 557
Query: 626 AILYGPYV 633
A+ GP V
Sbjct: 558 ALQRGPLV 565
>gi|387831475|ref|YP_003351412.1| hypothetical protein ECSF_3422 [Escherichia coli SE15]
gi|432399540|ref|ZP_19642313.1| hypothetical protein WEI_04484 [Escherichia coli KTE25]
gi|432408662|ref|ZP_19651364.1| hypothetical protein WEO_03868 [Escherichia coli KTE28]
gi|432502151|ref|ZP_19743901.1| hypothetical protein A177_04267 [Escherichia coli KTE216]
gi|432696461|ref|ZP_19931652.1| hypothetical protein A31I_03948 [Escherichia coli KTE162]
gi|432725058|ref|ZP_19959971.1| hypothetical protein WE1_04112 [Escherichia coli KTE17]
gi|432729639|ref|ZP_19964512.1| hypothetical protein WE3_04113 [Escherichia coli KTE18]
gi|432743329|ref|ZP_19978043.1| hypothetical protein WEE_04046 [Escherichia coli KTE23]
gi|432922799|ref|ZP_20125572.1| hypothetical protein A133_04523 [Escherichia coli KTE173]
gi|432929459|ref|ZP_20130509.1| hypothetical protein A135_04587 [Escherichia coli KTE175]
gi|432983040|ref|ZP_20171809.1| hypothetical protein A15W_04187 [Escherichia coli KTE211]
gi|432992699|ref|ZP_20181347.1| hypothetical protein A179_04489 [Escherichia coli KTE217]
gi|433098416|ref|ZP_20284583.1| hypothetical protein WK3_03618 [Escherichia coli KTE139]
gi|433107854|ref|ZP_20293813.1| hypothetical protein WK7_03720 [Escherichia coli KTE148]
gi|433112834|ref|ZP_20298684.1| hypothetical protein WK9_03711 [Escherichia coli KTE150]
gi|281180632|dbj|BAI56962.1| conserved hypothetical protein [Escherichia coli SE15]
gi|430912702|gb|ELC33874.1| hypothetical protein WEI_04484 [Escherichia coli KTE25]
gi|430926036|gb|ELC46624.1| hypothetical protein WEO_03868 [Escherichia coli KTE28]
gi|431025819|gb|ELD38905.1| hypothetical protein A177_04267 [Escherichia coli KTE216]
gi|431231105|gb|ELF26873.1| hypothetical protein A31I_03948 [Escherichia coli KTE162]
gi|431262277|gb|ELF54267.1| hypothetical protein WE1_04112 [Escherichia coli KTE17]
gi|431270780|gb|ELF61923.1| hypothetical protein WE3_04113 [Escherichia coli KTE18]
gi|431281486|gb|ELF72389.1| hypothetical protein WEE_04046 [Escherichia coli KTE23]
gi|431435293|gb|ELH16905.1| hypothetical protein A133_04523 [Escherichia coli KTE173]
gi|431440867|gb|ELH22195.1| hypothetical protein A135_04587 [Escherichia coli KTE175]
gi|431488798|gb|ELH68428.1| hypothetical protein A15W_04187 [Escherichia coli KTE211]
gi|431490717|gb|ELH70325.1| hypothetical protein A179_04489 [Escherichia coli KTE217]
gi|431612416|gb|ELI81663.1| hypothetical protein WK3_03618 [Escherichia coli KTE139]
gi|431623752|gb|ELI92378.1| hypothetical protein WK7_03720 [Escherichia coli KTE148]
gi|431625172|gb|ELI93765.1| hypothetical protein WK9_03711 [Escherichia coli KTE150]
Length = 657
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCIQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|386621273|ref|YP_006140853.1| hypothetical protein ECNA114_3739 [Escherichia coli NA114]
gi|432423998|ref|ZP_19666535.1| hypothetical protein A137_04431 [Escherichia coli KTE178]
gi|432560859|ref|ZP_19797513.1| hypothetical protein A1S7_04513 [Escherichia coli KTE49]
gi|432707936|ref|ZP_19943011.1| hypothetical protein WCG_01214 [Escherichia coli KTE6]
gi|432891143|ref|ZP_20103901.1| hypothetical protein A31K_00995 [Escherichia coli KTE165]
gi|333971774|gb|AEG38579.1| Hypothetical protein ECNA114_3739 [Escherichia coli NA114]
gi|430941626|gb|ELC61768.1| hypothetical protein A137_04431 [Escherichia coli KTE178]
gi|431088585|gb|ELD94458.1| hypothetical protein A1S7_04513 [Escherichia coli KTE49]
gi|431254890|gb|ELF48151.1| hypothetical protein WCG_01214 [Escherichia coli KTE6]
gi|431430258|gb|ELH12090.1| hypothetical protein A31K_00995 [Escherichia coli KTE165]
Length = 657
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCIQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|417664178|ref|ZP_12313758.1| secreted protein [Escherichia coli AA86]
gi|330909651|gb|EGH38165.1| secreted protein [Escherichia coli AA86]
Length = 657
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCIQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|432855232|ref|ZP_20083284.1| hypothetical protein A1YY_03450 [Escherichia coli KTE144]
gi|431397569|gb|ELG81016.1| hypothetical protein A1YY_03450 [Escherichia coli KTE144]
Length = 654
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGKLCLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|256419143|ref|YP_003119796.1| hypothetical protein Cpin_0089 [Chitinophaga pinensis DSM 2588]
gi|256034051|gb|ACU57595.1| protein of unknown function DUF1680 [Chitinophaga pinensis DSM
2588]
Length = 677
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 87/386 (22%), Positives = 153/386 (39%), Gaps = 45/386 (11%)
Query: 251 ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN-DVL 309
++ +L QY A + R+ T + YF ++ N + K+ ++ HW + GG N V+
Sbjct: 163 VMLKVLKQYYSATGDK--RVITLLTNYFRYQL-NELPKHPLD-HWSFWGKYRGGDNLMVV 218
Query: 310 YKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQ 369
Y L+ IT D L LA L K F A D+ + H + + ++ Q
Sbjct: 219 YWLYNITGDKFLLDLAELVHKQTFDYTEAFLHGDLLRRPFSIH-GVNLAQGIKEPGIYYQ 277
Query: 370 LHKEGHQLESSGTNIGHFNFKSD--------PKRLASNLDSNTEESCTTYNMLKVSRHLF 421
H E L++ T F + + L N + E CT M+ +
Sbjct: 278 QHPEKKYLDALQTGFKDLRFYNGMAHGLYGGDEALHGNNPTQGSELCTAVEMMFSLESIL 337
Query: 422 RWTKEIAYADYYERSLTNGVLG-----------IQRGTEPGVMIYLLPLAPGSSKERSYH 470
T ++AYAD+ E+ N + Q+ + Y+ + +
Sbjct: 338 EITGDVAYADHLEKIAFNALPAQVFENFIDRQYFQQANQVMATRYV--------RNFDQN 389
Query: 471 HWGTPS-----DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG-Q 524
H GT + CC + + K ++++ K G+ + Y S + G Q
Sbjct: 390 HAGTDVCYGLLTGYPCCTSNMHQGWPKFTQNLWYATADK--GIAALVYAPSTVTTYVGEQ 447
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS 584
V+ K + + +R T + S K S ++ +LR+P W A +NGQ S
Sbjct: 448 TPVSFKEETAYPFGESVRFTFSTSKKTSAVSFPFHLRVPAWCKQ--ATIKVNGQVF-QQS 504
Query: 585 PGN-FLSVTKTWSSDDKLTIQLPLTL 609
PGN + + ++W S D + + LP+ +
Sbjct: 505 PGNQIVKIERSWKSGDIVELILPMHI 530
>gi|312621510|ref|YP_004023123.1| hypothetical protein Calkro_0404 [Caldicellulosiruptor
kronotskyensis 2002]
gi|312201977|gb|ADQ45304.1| protein of unknown function DUF1680 [Caldicellulosiruptor
kronotskyensis 2002]
Length = 652
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 54/245 (22%), Positives = 101/245 (41%), Gaps = 20/245 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG--IQRGTEPGVMIYLLP 458
D+ E+C + ++ + L + Y D ER+L N V+G Q G + Y+ P
Sbjct: 332 DTAYAETCASVGLIFFAHRLNKIEPHAKYYDVVERALYNTVIGSMSQDGKK---YFYVNP 388
Query: 459 LA--PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
L P ++R P W CC + LG IY + G+Y+
Sbjct: 389 LEVYPKEVEKRFDRRHVKPERQPWFGCACCPPNVARLLASLGRYIY---SYNHEGIYVNL 445
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
YI S + + G + V + ++ +++ L S + L LRIP+W S +
Sbjct: 446 YIGSSVQVEVGGVKVLLQQMSSYPFEDIVKIDLKPSKEAR---FKLYLRIPSWCES--YE 500
Query: 573 ATLNG-QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+NG ++ P P ++ + + W +D++ +++P ++ + A++ GP
Sbjct: 501 VYVNGKKEEPEEPPSGYVCIERLWKENDQVILKIPTEVKMVSSHPQVRSNVGKVAVVKGP 560
Query: 632 YVLAG 636
V
Sbjct: 561 VVFCA 565
>gi|402306205|ref|ZP_10825256.1| putative glycosyhydrolase [Prevotella sp. MSX73]
gi|400379972|gb|EJP32801.1| putative glycosyhydrolase [Prevotella sp. MSX73]
Length = 816
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 72/281 (25%), Positives = 111/281 (39%), Gaps = 46/281 (16%)
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGS 463
+E+C + + + +F T E Y D YER+L NGVL G+ + Y PL
Sbjct: 346 QETCASIANVFWNYRMFLATGEGKYVDVYERALYNGVLSGVSLSGDK--FFYDNPLESMG 403
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
ER HW + CC G + F + G +Y+ YI D +G
Sbjct: 404 QHER--QHWFGCA----CCPGN-VTRFVASVPQYQYAVRGS--DIYVNLYIQGTAD-VNG 453
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT--------------SSN 569
+ Q P WD +T+T K S +L RIP W SS
Sbjct: 454 VRLAQQTRYP---WDG--DITVTVDPKRS-RRFALRFRIPGWAGACPVGTNLYHFADSSR 507
Query: 570 GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEA----IQDDRPEYASIQ 625
+NG+++ ++ + + W D++ I LP+ +R A ++DDR +Y
Sbjct: 508 PFTVKVNGREIAGEPVDGYMVIDRRWMRGDRVEISLPMEVRRVAANDNVEDDRGKY---- 563
Query: 626 AILYGP--YVLAGHSIGDWDITESATSLSDWITPIPASYNS 664
A+ GP Y L G + + + L PI A Y +
Sbjct: 564 ALERGPIVYCLEGRDQAHSTVFDKSVRLD---APIRADYRA 601
>gi|402489910|ref|ZP_10836703.1| hypothetical protein RCCGE510_19298 [Rhizobium sp. CCGE 510]
gi|401811249|gb|EJT03618.1| hypothetical protein RCCGE510_19298 [Rhizobium sp. CCGE 510]
Length = 640
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 102/237 (43%), Gaps = 30/237 (12%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + ++ + + + YAD E++L NG L G+ T+ Y PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALPGLS--TDGKTFFYDNPLESAGK 391
Query: 465 KER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
R +HH P CC + +G +Y + + V++ ++RL +G
Sbjct: 392 HHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVADDEI-AVHLYGESTARLKLANG 443
Query: 524 -----QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
Q N D V++ L+ TF+ L+LRIP W ++GA ++NG+
Sbjct: 444 AEGELQQTTNYPWDGAVAFTTRLKTPATFA---------LSLRIPDW--ADGATLSVNGE 492
Query: 579 DLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
L L + + + + W+ D++ + LPL LR + + A A++ GP V
Sbjct: 493 MLDLNANIRDGYARIDRQWADGDRVALHLPLALRPQYANPKVRQDAGRVALMRGPLV 549
>gi|329930292|ref|ZP_08283894.1| hypothetical protein HMPREF9412_2909 [Paenibacillus sp. HGF5]
gi|328935161|gb|EGG31645.1| hypothetical protein HMPREF9412_2909 [Paenibacillus sp. HGF5]
Length = 626
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/177 (24%), Positives = 81/177 (45%), Gaps = 11/177 (6%)
Query: 478 SFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSW 537
+F CC + + KL ++ +++ G+ + Y + G+ V+ +V+ +
Sbjct: 361 NFGCCTANMHQGWPKLASHLWMKDQED--GLVAVSYAPCTVRTTVGRQGVSAEVEVTGEY 418
Query: 538 DPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSS 597
RV + S + + ++LRIP W + TLNG++LP+ + + + +TW S
Sbjct: 419 PFKDRVQIHLSLE-RAESFPISLRIPAWC--DHPVITLNGRELPIQAESGYAKIVQTWQS 475
Query: 598 DDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDW 654
D L + LP+ ++TE+ R YA+ +I GP V +W + DW
Sbjct: 476 GDLLELYLPMEVKTES----RSMYAT--SITRGPLVYVLPVKENWQMIRQREMFHDW 526
>gi|423286830|ref|ZP_17265681.1| hypothetical protein HMPREF1069_00724 [Bacteroides ovatus
CL02T12C04]
gi|392674368|gb|EIY67816.1| hypothetical protein HMPREF1069_00724 [Bacteroides ovatus
CL02T12C04]
Length = 698
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 62/216 (28%), Positives = 93/216 (43%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
WK G++ + Q+ D W+ +RVTL + +G T SL LRIP W A T+N
Sbjct: 494 -TWKDKGELALTQETD--YPWEGKVRVTLDRVPRKAG-TFSLFLRIPEWCEK--ATLTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
GQ L + N + V +TW D +L + +P+ L
Sbjct: 548 GQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|329927011|ref|ZP_08281398.1| hypothetical protein HMPREF9412_4716 [Paenibacillus sp. HGF5]
gi|328938722|gb|EGG35099.1| hypothetical protein HMPREF9412_4716 [Paenibacillus sp. HGF5]
Length = 658
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 112/525 (21%), Positives = 196/525 (37%), Gaps = 76/525 (14%)
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLEALI 239
V +L A+A A+ + L+E++ ++ ++ Q+ GYL+ + T E R L
Sbjct: 79 VAKWLEAAAYSLATHPDPKLEEQVDGLIDLVADAQQP--DGYLNTYFTVKEPEKRWTNLT 136
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
Y H I AG+ A R +V + + V + H +
Sbjct: 137 DCHELYCAGHMIEAGVAHY-----RATGKRKLLDVVCRLADHIDTVFGPEDGKIHGFDGH 191
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLALQADDISGFHSNTHIP 354
+E + L KL+ +TQ+P++L L+ F +P F Q S + S H P
Sbjct: 192 QE---IELALVKLYEVTQEPRYLSLSQYFIDERGTEPHFFLQEWEQRGKKSFYRSVLHAP 248
Query: 355 IVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS---DPKRLAS------NL----- 400
+ Q V +Q GH + + + + DP L + N+
Sbjct: 249 HLAYHQSHLPVR-EQKEAVGHSVRAVYMYTAMADLAARTKDPALLEACDTLWRNMVHKQM 307
Query: 401 -----------------------DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSL 437
D+ E+C + ++ ++ + + + + YAD ER+L
Sbjct: 308 YITGGIGSTHHGEAFTTDYDLPNDTVYSETCASIGLIFFAQRMLQLSPKSEYADVMERAL 367
Query: 438 TNGVLG--IQRGTEPGVMIYLLPLA--PGSSKERSYHHWGTPSDSFW----CCYGTGIES 489
N V+G Q G Y+ PL P + + P W CC
Sbjct: 368 FNTVIGSMAQDGRH---FFYVNPLEVWPAACRYNPGKAHVKPVRPGWFACACCPPNVARL 424
Query: 490 FSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSS 549
S LG+ +Y + +Y YI + + G + V + + WD VTLT
Sbjct: 425 LSSLGEYVYTMNDDT---LYAHLYIGGEAEVRFGDVPVKVMQNSALPWDG--DVTLTLQP 479
Query: 550 KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP--SPGNFLSVTKTWSSDDKLTIQLPL 607
+ + ++ LRIP W S A +NGQ++ + + + V + W+ D + + +
Sbjct: 480 E-QAVEWTVALRIPDW-SRGKAGLRVNGQEMNVEDITQDGYACVKRVWAPGDTVELAFSM 537
Query: 608 TLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLS 652
+ + A AI GP V S+ D + S+ SL+
Sbjct: 538 EIHQVRANPNIRGNAGKAAIQRGPLVYCLESV-DHGVPVSSLSLA 581
>gi|424897290|ref|ZP_18320864.1| hypothetical protein Rleg4DRAFT_3241 [Rhizobium leguminosarum bv.
trifolii WSM2297]
gi|393181517|gb|EJC81556.1| hypothetical protein Rleg4DRAFT_3241 [Rhizobium leguminosarum bv.
trifolii WSM2297]
Length = 640
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 101/237 (42%), Gaps = 20/237 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + YAD E++L NG L G+ T+ Y PL
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALPGLS--TDGKTFFYDNPL 386
Query: 460 APGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
R +HH P CC + +G +Y + + V++ ++RL
Sbjct: 387 ESAGKHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVADDEI-AVHLYGESTTRL 438
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+G V Q+V WD + T +L+LRIP W + GA ++NG+
Sbjct: 439 KLANGAEVELQQVTNY-PWDGAVAFTTRLEKPAR---FALSLRIPDW--AEGATLSVNGE 492
Query: 579 DLPLPSP--GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
L L + + + + W+ D + + LPL+LR + + A A++ GP V
Sbjct: 493 KLDLAATMRDGYARIDRQWADGDSVALHLPLSLRPQYANPKVRQDAGRVALMRGPLV 549
>gi|212692449|ref|ZP_03300577.1| hypothetical protein BACDOR_01945 [Bacteroides dorei DSM 17855]
gi|212665028|gb|EEB25600.1| hypothetical protein BACDOR_01945 [Bacteroides dorei DSM 17855]
Length = 811
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 70/284 (24%), Positives = 120/284 (42%), Gaps = 44/284 (15%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + YAD ER+L NGV+ G+ + Y PL
Sbjct: 340 ETCASIANVYWNHRMFLATGDAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 397
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW--KS 522
ER HW + CC G I F + Y+ + VY+ YI S+ D +S
Sbjct: 398 HER--QHWFGCA----CCPGN-ITRF--VASVPYYMYATQGNDVYVNLYIQSKADIETES 448
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGA 571
+I V Q D W+ + +++T + +L +RIP W ++ A
Sbjct: 449 NKINVEQTTD--YPWNGKISISVTPEKEQE---FALRVRIPGWAQDAPVPTDLYSFTDKA 503
Query: 572 KA---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAIL 628
+A ++NG + + ++ + W + D + I LP+ +R D + AI
Sbjct: 504 QAYSISVNGSKVNAKQYDGYATLVRNWKAGDVVEINLPMEVRRVKANDQVEDDHGKLAIE 563
Query: 629 YGP--YVLAGHSIGDWDITESATSLSDWI---TPIPASYNSQLI 667
GP + L G D +T + +I TP+ AS+++ L+
Sbjct: 564 RGPIMFCLEGQDQAD------STVFNKFIPDGTPMEASFHADLL 601
>gi|154495303|ref|ZP_02034308.1| hypothetical protein PARMER_04360 [Parabacteroides merdae ATCC
43184]
gi|423722505|ref|ZP_17696681.1| hypothetical protein HMPREF1078_00744 [Parabacteroides merdae
CL09T00C40]
gi|154085227|gb|EDN84272.1| hypothetical protein PARMER_04360 [Parabacteroides merdae ATCC
43184]
gi|409242350|gb|EKN35113.1| hypothetical protein HMPREF1078_00744 [Parabacteroides merdae
CL09T00C40]
Length = 625
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 68/279 (24%), Positives = 110/279 (39%), Gaps = 57/279 (20%)
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
+T E+C T+ +++ L + T YADY E ++ N ++ + + Y
Sbjct: 318 HTMETCVTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKY------- 370
Query: 463 SSKERSYHHWGTPSDSFW--CCYGTGIESFSKLGDSIY--------------FEEEGKYP 506
S + H G CC G +F+ + Y E E P
Sbjct: 371 -SPLEGWRHEGEEQCGMHINCCNANGPRAFAMIPQFAYQVQDDCVRVNFYAPSEAELVLP 429
Query: 507 GVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
G ++ + ++ QI + +VDP +K + T +L RIP W
Sbjct: 430 GKKPVRLKQTTDYPRTDQIEI--EVDP---------------AKETAFTIAL--RIPAW- 469
Query: 567 SSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
S A ++NGQ G +L V + W D++T++L L R E QA
Sbjct: 470 -SKIAVVSVNGQPQDGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQA 521
Query: 627 ILYGPYVLAGHS-IGDWDITESATSLSD----WITPIPA 660
I+ GP VLA S GD + E++ +S +TP+ A
Sbjct: 522 IVRGPIVLARDSRFGDGFVDEASVVVSKDGYVALTPVKA 560
>gi|427384245|ref|ZP_18880750.1| hypothetical protein HMPREF9447_01783 [Bacteroides oleiciplenus YIT
12058]
gi|425727506|gb|EKU90365.1| hypothetical protein HMPREF9447_01783 [Bacteroides oleiciplenus YIT
12058]
Length = 811
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 69/301 (22%), Positives = 121/301 (40%), Gaps = 45/301 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + +F T YAD ER+L NGV+ G+ + Y PL
Sbjct: 340 ETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 397
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER HW + CC G + + +Y + +Y+ YI S+ D +
Sbjct: 398 HER--QHWFGCA----CCPGNVTRFMASVPYYMYATQGND---IYVNLYIQSKADLNTDS 448
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW-------------TSSNGA 571
V + W+ + + +T + +L RIP W T GA
Sbjct: 449 NNVALEQTTEYPWEGKVSILVTPEKEQE---FALRFRIPGWAQDAPVPTDLYSFTDKAGA 505
Query: 572 KA-TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYASIQA 626
+ ++NG+ + + ++++TW + D + I LP+ +R + ++DDR + A
Sbjct: 506 YSISVNGKKVNAKQYDGYATISRTWKAGDVVEISLPMDVRRIKANDNVEDDRGKL----A 561
Query: 627 ILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQS 686
I GP + D T + D TP+ A+Y++ L+ N VLT + +
Sbjct: 562 IERGPIMFCLEGKDQADSTVFNKFIPD-ATPMEAAYDANLL-------NGVVVLTGNAKE 613
Query: 687 I 687
+
Sbjct: 614 V 614
>gi|253575972|ref|ZP_04853305.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
gi|251844547|gb|EES72562.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
Length = 637
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 57/247 (23%), Positives = 109/247 (44%), Gaps = 26/247 (10%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
E+C + +F T+E Y D +E+ + N +LG + Y PL K
Sbjct: 317 ETCANIGNAMWAMRMFNLTQEPKYMDAFEKVVYNSLLG-SMTLDGHHFCYTNPLETRGGK 375
Query: 466 ERSYH-----HWGTP---SDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
++H H+ T + + +CC + + ++L Y + G+YI Y +
Sbjct: 376 LFNHHSPQTQHFRTARWFTHTCYCCPPQVLRTIARLHQWAYGQSN---DGLYIHLYSGNE 432
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLT-TSLNLRIPTWTSSNGAKATLN 576
L+ + + + + D T++ + S T TS++LRIP W ++GA +N
Sbjct: 433 LN---TTLSSGETLSLTMKSDFPAEETISITINNSLNTETSIHLRIPQW--ADGATVKVN 487
Query: 577 GQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEA----IQDDRPEYASIQAILYGPY 632
G G + + + W ++D++ + LP+ ++ A +++DR + A +YGP+
Sbjct: 488 GVQQGDVEAGTYHELKRKWQANDQIELLLPMRVKRIAANPMVEEDRGQVA----FMYGPF 543
Query: 633 VLAGHSI 639
V SI
Sbjct: 544 VYCLESI 550
>gi|255691741|ref|ZP_05415416.1| putative cytoplasmic protein [Bacteroides finegoldii DSM 17565]
gi|260622626|gb|EEX45497.1| hypothetical protein BACFIN_06788 [Bacteroides finegoldii DSM
17565]
Length = 700
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 95/218 (43%), Gaps = 29/218 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 383 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 442
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER+ + S +CC + + + + Y EG Y +Y +++
Sbjct: 443 YTLRWPKERTEYI------SCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 495
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL- 575
WK G++ + Q+ D WD +RVTL + +G T SL LRIP W KATL
Sbjct: 496 -TWKEKGEVALTQETD--YPWDGNIRVTLDKVPRKAG-TFSLFLRIPEWCE----KATLR 547
Query: 576 -NGQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
NGQ L + + N + V + W D +L + +P+ L
Sbjct: 548 VNGQPLQVNAKANSYAEVNRAWKKGDVVELVMDMPVRL 585
>gi|261409833|ref|YP_003246074.1| hypothetical protein GYMC10_6062 [Paenibacillus sp. Y412MC10]
gi|261286296|gb|ACX68267.1| protein of unknown function DUF1680 [Paenibacillus sp. Y412MC10]
Length = 658
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 111/525 (21%), Positives = 195/525 (37%), Gaps = 76/525 (14%)
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLEALI 239
V +L A+A A+ + L+E++ ++ ++ Q+ GYL+ + T E R L
Sbjct: 79 VAKWLEAAAYSLATHRDPKLEEQVDELIDLVADAQQP--DGYLNTYFTVKEPEKRWTNLT 136
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLN 299
Y H I AG+ A R +V + + V + H +
Sbjct: 137 DCHELYCAGHMIEAGVAHY-----RATGKRKLLDVVCRLADHIDTVFGPEDGKIHGFDGH 191
Query: 300 EEAGGMNDVLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLALQADDISGFHSNTHIP 354
+E + L KL+ +TQ+P++L L+ F +P F Q S + S H P
Sbjct: 192 QE---IELALVKLYEVTQEPRYLSLSQYFIDERGTEPHFFLQEWEQRGKKSFYRSVLHAP 248
Query: 355 IVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKS---DPKRLAS------NL----- 400
+ Q V +Q GH + + + + DP L + N+
Sbjct: 249 HLAYHQSHLPVR-EQKEAVGHSVRAVYMYTAMADLAARTKDPALLEACDTLWRNMVHKQM 307
Query: 401 -----------------------DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSL 437
D+ E+C + ++ ++ + + + + YAD ER+L
Sbjct: 308 YITGGIGSTHHGEAFTTDYDLPNDTVYSETCASIGLIFFAQRMLQLSPKSEYADVMERAL 367
Query: 438 TNGVLG--IQRGTEPGVMIYLLPLA--PGSSKERSYHHWGTPSDSFW----CCYGTGIES 489
N V+G Q G Y+ PL P + + P W CC
Sbjct: 368 FNTVIGSMAQDGRH---FFYVNPLEVWPAACRHNPGKAHVKPVRPGWFACACCPPNVARL 424
Query: 490 FSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSS 549
S LG+ +Y + +Y YI + + G + V + + WD VT T
Sbjct: 425 LSSLGEYVYTMNDDT---LYAHLYIGGEAEVRFGDVPVKVMQNSTLPWDG--DVTFTLQP 479
Query: 550 KGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP--SPGNFLSVTKTWSSDDKLTIQLPL 607
+ + ++ LRIP W S A +NGQ++ + + + V + W+ D + + +
Sbjct: 480 E-QAVEWTVALRIPDW-SRGKAGLRVNGQEMNVEDITQDGYACVKRVWAPGDTVELAFSM 537
Query: 608 TLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLS 652
+ + A AI GP V S+ D + S+ SL+
Sbjct: 538 EIHQVRANPNIRGNAGKAAIQRGPLVYCLESV-DHGVPVSSLSLA 581
>gi|432545326|ref|ZP_19782157.1| hypothetical protein A197_03919 [Escherichia coli KTE236]
gi|432550808|ref|ZP_19787564.1| hypothetical protein A199_04285 [Escherichia coli KTE237]
gi|432623948|ref|ZP_19859963.1| hypothetical protein A1UO_03835 [Escherichia coli KTE76]
gi|431071355|gb|ELD79491.1| hypothetical protein A197_03919 [Escherichia coli KTE236]
gi|431077175|gb|ELD84442.1| hypothetical protein A199_04285 [Escherichia coli KTE237]
gi|431156242|gb|ELE56979.1| hypothetical protein A1UO_03835 [Escherichia coli KTE76]
Length = 654
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKTLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCIQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|408372126|ref|ZP_11169874.1| hypothetical protein I215_14481 [Galbibacter sp. ck-I2-15]
gi|407742435|gb|EKF54034.1| hypothetical protein I215_14481 [Galbibacter sp. ck-I2-15]
Length = 664
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 106/253 (41%), Gaps = 37/253 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLPLAPGS 463
E+C + + L T ++ Y D ERSL NG+L GI GTE + P A S
Sbjct: 360 ETCAAIGDVYWNHRLHNLTGDVKYMDVLERSLYNGLLSGISLSGTE-----FFYPNALES 414
Query: 464 SKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS--SR 517
++ G+ + W CC I L + +Y +++ +++ Y++ ++
Sbjct: 415 DGTYKFNR-GSCTRQEWFDCSCCPTNMIRFLPSLPELVYSKKDDT---IFVNLYVANQAQ 470
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL-- 575
+D S +V++Q+ + WD + T+T + + +L LRIP W + TL
Sbjct: 471 IDLPSTSLVIDQQTN--YPWDGLVNFTVTPEKEAN---FTLKLRIPGWLRNEVLPGTLYQ 525
Query: 576 -------------NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYA 622
N Q + ++++ + W + L++ LP+ R D +
Sbjct: 526 YKDDMTSEFELKINDQLVDATLKDGYITINRDWKKGETLSLNLPMQPREVITNDKVEDNL 585
Query: 623 SIQAILYGPYVLA 635
A+ YGP V A
Sbjct: 586 GKLALEYGPIVYA 598
>gi|315607261|ref|ZP_07882261.1| conserved hypothetical protein [Prevotella buccae ATCC 33574]
gi|315250964|gb|EFU30953.1| conserved hypothetical protein [Prevotella buccae ATCC 33574]
Length = 813
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 72/281 (25%), Positives = 110/281 (39%), Gaps = 46/281 (16%)
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGS 463
+E+C + + + +F T E Y D YER+L NGVL G+ + Y PL
Sbjct: 343 QETCASIANVFWNYRMFLATGEGKYVDVYERALYNGVLSGVSLSGDK--FFYDNPLESMG 400
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
ER HW + CC G + F + G +Y+ YI D +G
Sbjct: 401 QHER--QHWFGCA----CCPGN-VTRFVASVPQYQYAVRGS--DIYVNLYIQGTAD-VNG 450
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT--------------SSN 569
+ Q P WD +T+T K S +L RIP W SS
Sbjct: 451 VRLAQQTRYP---WDG--DITVTVDPKRS-RRFALRFRIPGWAGACPVGTNLYHFADSSR 504
Query: 570 GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEA----IQDDRPEYASIQ 625
+NG+ + ++ + + W D++ I LP+ +R A ++DDR +Y
Sbjct: 505 PFTVKVNGRKIAGEPVDGYMVIDRRWMRGDRVEISLPMEVRRVAANDNVEDDRGKY---- 560
Query: 626 AILYGP--YVLAGHSIGDWDITESATSLSDWITPIPASYNS 664
A+ GP Y L G + + + L PI A Y +
Sbjct: 561 ALERGPIVYCLEGRDQAHSTVFDKSVRLD---APIRADYRA 598
>gi|298374271|ref|ZP_06984229.1| cytoplasmic protein [Bacteroides sp. 3_1_19]
gi|301307792|ref|ZP_07213748.1| putative cytoplasmic protein [Bacteroides sp. 20_3]
gi|423337089|ref|ZP_17314833.1| hypothetical protein HMPREF1059_00758 [Parabacteroides distasonis
CL09T03C24]
gi|298268639|gb|EFI10294.1| cytoplasmic protein [Bacteroides sp. 3_1_19]
gi|300834135|gb|EFK64749.1| putative cytoplasmic protein [Bacteroides sp. 20_3]
gi|409238277|gb|EKN31070.1| hypothetical protein HMPREF1059_00758 [Parabacteroides distasonis
CL09T03C24]
Length = 618
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 60/244 (24%), Positives = 104/244 (42%), Gaps = 23/244 (9%)
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLL 457
NLD+ E +C + M+ ++ + + T + Y D ERSL NG L GI G + Y+
Sbjct: 330 NLDAYCE-TCASVGMVLWNQRMNQLTGDSKYIDILERSLYNGALAGISLGGDR--FFYVN 386
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL R W + CC +G+ IY + +++ YI +
Sbjct: 387 PLESKGDHHR--QEWYGCA----CCPSQLSRFLPSIGNYIYASSD---DALWVNLYIGNT 437
Query: 518 LDWKSGQ--IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
+ G+ I++ Q+ D WD +++T++ S L + LRIP W + ++
Sbjct: 438 GQIRIGETDILLTQETD--YPWDGSVKLTISTSQP---LEKEIRLRIPNWCKT--YDLSI 490
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
NG+ + + + +V K W S D + + + + + A E +AI GP V
Sbjct: 491 NGKRINVSEKKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFGKRAIQRGPLVYC 549
Query: 636 GHSI 639
I
Sbjct: 550 MEEI 553
>gi|423288216|ref|ZP_17267067.1| hypothetical protein HMPREF1069_02110 [Bacteroides ovatus
CL02T12C04]
gi|392671105|gb|EIY64581.1| hypothetical protein HMPREF1069_02110 [Bacteroides ovatus
CL02T12C04]
Length = 666
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 71/307 (23%), Positives = 135/307 (43%), Gaps = 34/307 (11%)
Query: 369 QLHKEGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIA 428
+H+ G + + T H F P +L ++ N E+C T+ S LF T
Sbjct: 319 NVHRGGSETPRNATECVHEAF-GFPYQLQNSTAYN--ETCATFYGAYYSWRLFMLTGNPM 375
Query: 429 YADYYERSLTNGV--LGIQRGTE--PGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYG 484
Y D E++ N + +G+ + V+ + P S + +H T + CC
Sbjct: 376 YLDVMEKAFYNNLSSMGLDGKSYFYTNVLRWYGKQHPLLSLD--FHQRWTEECTCVCCPT 433
Query: 485 TGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK-SGQIVVNQKVDPVVSWDPYLRV 543
+ + ++ D Y ++E +++ Y S+ +D K +G+ V ++V WD ++
Sbjct: 434 SLVRFLAETKDYAYAKDEN---SLFVTLYGSNEIDTKINGKNVRFEQVTNY-PWDD--KI 487
Query: 544 TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTI 603
+ + + SL LRIP W + GA +NG D+P+ + G F V + W S DK+ +
Sbjct: 488 EMNYKGDKNA-EFSLKLRIPAW--AIGATLKVNGIDMPI-NTGVFAVVNRKWKSGDKVEL 543
Query: 604 QLPLTLRTEAIQDDRPEYASIQ---AILYGP--YVLAGHSIGDWDITESATSLSDWITPI 658
LP+ + + P+ ++ A+ YGP Y + G + + + D + P+
Sbjct: 544 VLPM---KPILNEGNPKVEEVRNQLAVSYGPLTYCVEGIDL------PNKVKIEDILLPV 594
Query: 659 PASYNSQ 665
A ++ +
Sbjct: 595 DAKFDVK 601
>gi|262275690|ref|ZP_06053499.1| putative cytoplasmic protein [Grimontia hollisae CIP 101886]
gi|262219498|gb|EEY70814.1| putative cytoplasmic protein [Grimontia hollisae CIP 101886]
Length = 660
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/216 (22%), Positives = 96/216 (44%), Gaps = 19/216 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ E+C + +L + + + + Y D ER+L N +L + Y+ PL
Sbjct: 336 DTAYTETCASVGLLMFANRMLQIESDGEYGDIMERALYNTILA-GMALDGKHFFYVNPLE 394
Query: 461 PGSSKERSYHHWG--TPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ-Y 513
+ H + P W CC + + LG I+ +E V ++ +
Sbjct: 395 VTPKVIHANHKYDHVKPVRQAWFGCSCCPTNVARTLASLGQYIFTVKED----VALLNLF 450
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
IS+ + Q + +D + + + + +++ +G ++ +RIP+W ++ A
Sbjct: 451 ISNEAKLELNQQPITLSIDANIPQSDKVSINVKDANQVNG---TIAVRIPSWCAN--MSA 505
Query: 574 TLNGQ--DLPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
TLNG+ D+ S +L +T TW++ DK+ + LP+
Sbjct: 506 TLNGKAIDVNADSKRGYLYITNTWNTGDKIEVTLPM 541
>gi|261420102|ref|YP_003253784.1| hypothetical protein GYMC61_2720 [Geobacillus sp. Y412MC61]
gi|319766914|ref|YP_004132415.1| hypothetical protein [Geobacillus sp. Y412MC52]
gi|261376559|gb|ACX79302.1| protein of unknown function DUF1680 [Geobacillus sp. Y412MC61]
gi|317111780|gb|ADU94272.1| protein of unknown function DUF1680 [Geobacillus sp. Y412MC52]
Length = 640
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/242 (23%), Positives = 102/242 (42%), Gaps = 20/242 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ +R + + YAD ER+L NG + G+ + + L +
Sbjct: 320 DTVYTETCASIALVFWARRMLELEMDGKYADVMERALYNGTISGMDLDGKRFFYVNPLEV 379
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + + H P W CC + + IY + +++ Y+
Sbjct: 380 WPKACERHDKRH-VKPVRQKWFSCACCPPNLARLIASISHYIYSQTSD---ALFVHLYVG 435
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S + + G V + WD +R+T+ S S +L LRIP W GA+ T+
Sbjct: 436 SDIQTEMGGRSVEIVQETNYPWDGKVRLTI---SPESAQEFTLGLRIPGW--GRGAEVTI 490
Query: 576 NGQDL---PLPSPGNFLSVTKTWSSDDKLTIQLPLTL-RTEAIQDDRPEYASIQAILYGP 631
NG+++ PL G + + + W D++ + P+ + R +A R + A+ GP
Sbjct: 491 NGENVDIAPLTKKG-YAYIRRVWRQGDEMVLHFPMPVERIKAHPQVRANIGKV-ALQRGP 548
Query: 632 YV 633
V
Sbjct: 549 IV 550
>gi|189464183|ref|ZP_03012968.1| hypothetical protein BACINT_00520 [Bacteroides intestinalis DSM
17393]
gi|189437973|gb|EDV06958.1| hypothetical protein BACINT_00520 [Bacteroides intestinalis DSM
17393]
Length = 812
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 68/292 (23%), Positives = 118/292 (40%), Gaps = 44/292 (15%)
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLL 457
N +N E+C + + +F T YAD ER+L NGV+ G+ + Y
Sbjct: 334 NNHTNYCETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVISGVSLSGDK--FFYDN 391
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL ER HW + CC G + + +Y + +Y+ YI S+
Sbjct: 392 PLESMGQHER--QHWFGCA----CCPGNVTRFMASVPYYMYATQGND---IYVNLYIQSK 442
Query: 518 LDWK--SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW---------- 565
D S I + Q + W+ + + +T + +L RIP W
Sbjct: 443 ADLNTDSNNIALEQTTE--YPWEGKVSILVTPEKEQE---FALRFRIPGWAQDAPVPTDL 497
Query: 566 ---TSSNGAKA-TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEY 621
T GA + ++NG+ + + ++++TW D + I LP+ +R D+ +
Sbjct: 498 YSFTDKAGAYSISVNGKKVNAKQYDGYATISRTWKVGDVVEINLPMDVRRIKANDNVEDD 557
Query: 622 ASIQAILYGP--YVLAGHSIGDWDITESATSLSDWI---TPIPASYNSQLIT 668
AI GP + L G D +T + +I TP+ ++Y++ L+
Sbjct: 558 CGKLAIERGPIMFCLEGKDQAD------STVFNKFIPDGTPMASAYDANLLN 603
>gi|435854457|ref|YP_007315776.1| hypothetical protein Halha_1747 [Halobacteroides halobius DSM 5150]
gi|433670868|gb|AGB41683.1| hypothetical protein Halha_1747 [Halobacteroides halobius DSM 5150]
Length = 655
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/215 (22%), Positives = 90/215 (41%), Gaps = 24/215 (11%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + T E +AD ER+L NG L G+ + Y+ PL
Sbjct: 344 DTAYAETCAAVGSMMWNQRMLKLTGEACFADIIERTLYNGFLSGVSLTGDK--FFYVNPL 401
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS--SR 517
+ R W S CC + L IY + E ++I QYIS +
Sbjct: 402 ESDGTHHRK--GWFKVS----CCPPNIARFLASLEKYIYLKNE---DCIFINQYISGKGK 452
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ ++++ Q D WD + + + + +L+LRIP W A +N
Sbjct: 453 VSIAEEEVIIRQ--DTAYPWDDKVNIKINLKNPSE---FTLSLRIPDWCQE--ASLQINN 505
Query: 578 QDLPLPSPGN---FLSVTKTWSSDDKLTIQLPLTL 609
Q L + S N + + + W + D++ ++ + +
Sbjct: 506 QSLEIESIINDNGYAQIRRKWRNGDQIRLEFAMPI 540
>gi|288925304|ref|ZP_06419239.1| cytoplasmic protein [Prevotella buccae D17]
gi|288338069|gb|EFC76420.1| cytoplasmic protein [Prevotella buccae D17]
Length = 813
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 96/237 (40%), Gaps = 37/237 (15%)
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGS 463
+E+C + + + +F T E Y D YER+L NGVL G+ + Y PL
Sbjct: 343 QETCASIANVFWNYRMFLATGEGKYVDVYERALYNGVLSGVSLSGDK--FFYDNPLESMG 400
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
ER HW + CC G + F + G +Y+ YI D +G
Sbjct: 401 QHER--QHWFGCA----CCPGN-VTRFVASVPQYQYAVRGS--DIYVNLYIQGTAD-VNG 450
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT--------------SSN 569
+ Q P WD +T+T K S +L RIP W SS
Sbjct: 451 VRLAQQTRYP---WDG--DITVTVDPKRS-RRFALRFRIPGWAGACPVGTNLYHFADSSR 504
Query: 570 GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEA----IQDDRPEYA 622
+NG+++ ++ + + W D++ I LP+ +R A ++DDR +YA
Sbjct: 505 PFTVKVNGREIAGEPVDGYMVIDRRWMRGDRVEISLPMEVRRVAANDNVEDDRGKYA 561
>gi|262382782|ref|ZP_06075919.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262295660|gb|EEY83591.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 618
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/242 (23%), Positives = 100/242 (41%), Gaps = 19/242 (7%)
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLL 457
NLD+ E +C + M+ ++ + + T + Y D ERSL NG L GI G + Y+
Sbjct: 330 NLDAYCE-TCASVGMVLWNQRMNQLTGDSKYIDVLERSLYNGALAGISLGGDR--FFYVN 386
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL R W + CC +G+ IY + +++ YI +
Sbjct: 387 PLESKGDHHR--QEWYGCA----CCPSQLSRFLPSIGNYIYASSD---DALWVNLYIGNT 437
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ G+ + + WD +++T++ S L + LRIP W + ++NG
Sbjct: 438 GQIRIGETDIQLTQETDYPWDGSVKLTISTSQP---LEKEIRLRIPNWCKT--YDLSING 492
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+ + + + +V K W S D + + + + + A E +AI GP V
Sbjct: 493 KRINVSEEKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFGKRAIQRGPLVYCME 551
Query: 638 SI 639
I
Sbjct: 552 EI 553
>gi|448238166|ref|YP_007402224.1| AraN-like protein [Geobacillus sp. GHH01]
gi|445207008|gb|AGE22473.1| AraN-like protein [Geobacillus sp. GHH01]
Length = 643
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 58/243 (23%), Positives = 103/243 (42%), Gaps = 22/243 (9%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ +R + + YAD ER+L NG + G+ + + L +
Sbjct: 323 DTAYAETCASIALVFWARRMLELETDGKYADVMERALYNGTISGMDLDGKKFFYVNPLEV 382
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFE-EEGKYPGVYIIQYI 514
P + + H P W CC + +G IY + + + +Y+ I
Sbjct: 383 WPKACERHDKRH-VKPVRQKWFSCACCPPNLARLIASIGHYIYSQTSDALFVHLYVGSDI 441
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ L +S +IV WD +R+T+ S G ++ LRIP W GA T
Sbjct: 442 RTELGGRSVEIVQETN----YPWDGTVRLTVLPESAGE---FTIGLRIPGW--CRGATLT 492
Query: 575 LNGQD---LPLPSPGNFLSVTKTWSSDDKLTIQLPLTL-RTEAIQDDRPEYASIQAILYG 630
+NG+ +PL G + + + W D++ + P+ + R +A R + A+ G
Sbjct: 493 INGEKVDMVPLIQKG-YAYIKRIWKKGDQVELVFPMPVERIKAHPQVRANAGKV-ALQRG 550
Query: 631 PYV 633
P V
Sbjct: 551 PIV 553
>gi|312126770|ref|YP_003991644.1| hypothetical protein Calhy_0533 [Caldicellulosiruptor
hydrothermalis 108]
gi|311776789|gb|ADQ06275.1| protein of unknown function DUF1680 [Caldicellulosiruptor
hydrothermalis 108]
Length = 654
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/245 (21%), Positives = 100/245 (40%), Gaps = 20/245 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG--IQRGTEPGVMIYLLP 458
D+ E+C + ++ + L + Y D ER+L N V+G Q G + Y+ P
Sbjct: 332 DAAYAETCASVGLIFFAHRLNKIEPHAKYYDVVERALYNTVIGSMSQDGKK---YFYVNP 388
Query: 459 LA--PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
L P ++R H P W CC + LG +Y + G+Y+
Sbjct: 389 LEVYPKEVEKRFDRHHVKPERQPWFGCACCPPNVARLLASLGRYVY---SYNHDGIYVNL 445
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
YI S + + G + V + ++ +++ L S + L LRIP W + +
Sbjct: 446 YIGSSVQVEVGGVKVLLQQVSSYPFEDMVKIDLKPSKEAR---FKLYLRIPGWCEN--YE 500
Query: 573 ATLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+NG+ + P ++ + + W +D++ +++P ++ + A++ GP
Sbjct: 501 VYVNGKKEEMQKLPSGYVCIERLWKENDQVVLKIPTEVKMVSSHPQVRSNVGKVAVVKGP 560
Query: 632 YVLAG 636
V
Sbjct: 561 VVFCA 565
>gi|295084107|emb|CBK65630.1| Uncharacterized protein conserved in bacteria [Bacteroides
xylanisolvens XB1A]
Length = 698
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 99/244 (40%), Gaps = 23/244 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
WK G++ + Q+ D W+ +RVTL + +G SL LRIP W A T+N
Sbjct: 494 -TWKDKGELALTQETD--YPWEGKVRVTLDRVPRKAG-AFSLFLRIPEWCEK--ATLTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
GQ L + N + V +TW D + + + + +R E + + GP V
Sbjct: 548 GQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRLLEAHPLAEEIRNQAVVKRGPLVYC 607
Query: 636 GHSI 639
S+
Sbjct: 608 LESM 611
>gi|424886647|ref|ZP_18310255.1| hypothetical protein Rleg10DRAFT_4682 [Rhizobium leguminosarum bv.
trifolii WSM2012]
gi|393175998|gb|EJC76040.1| hypothetical protein Rleg10DRAFT_4682 [Rhizobium leguminosarum bv.
trifolii WSM2012]
Length = 640
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 59/242 (24%), Positives = 104/242 (42%), Gaps = 30/242 (12%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + YAD E++L NG L G+ T+ Y PL
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALPGLS--TDGKTFFYDNPL 386
Query: 460 APGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
R +HH P CC + +G +Y + + V++ ++RL
Sbjct: 387 ESAGKHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVADDEI-AVHLYGESTARL 438
Query: 519 DWKSG-----QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+G Q V N D V++ L+ F+ L+LRIP W + GA
Sbjct: 439 KLANGAEVELQQVTNYPWDGAVAFATKLKTPARFA---------LSLRIPDW--AEGATL 487
Query: 574 TLNGQDLPLPSP--GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
++NG+ L L + + + + W+ D++ + LPL+LR + + A A++ GP
Sbjct: 488 SVNGERLDLGATMRDGYARLDRQWADGDRVDLFLPLSLRPQYANPKVRQDAGRVALMRGP 547
Query: 632 YV 633
V
Sbjct: 548 LV 549
>gi|255531160|ref|YP_003091532.1| hypothetical protein Phep_1254 [Pedobacter heparinus DSM 2366]
gi|255344144|gb|ACU03470.1| protein of unknown function DUF1680 [Pedobacter heparinus DSM 2366]
Length = 684
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 87/418 (20%), Positives = 161/418 (38%), Gaps = 60/418 (14%)
Query: 251 ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLY 310
I+ ++ QY A E++ +M +YF N + +KK I + W ++ G N ++
Sbjct: 167 IMLKVIQQYYSATQDESV--IPFMTKYF-NYQKEALKKCPIGK-WSEWSQSRGTDNVMMV 222
Query: 311 K-LFCITQDPKHLMLAHLFDKPCFLG----------LLALQADDISGFHSNTHIPIVIGS 359
+ L+ T+D L LA L + F + A + + S + + +G
Sbjct: 223 QWLYGHTKDESLLELAGLINSQSFAWSQWFGGRDWVINAAARPNGKKWMSRHGVNVAMGL 282
Query: 360 Q---MRYEVTGDQLHKEGHQ------LESSGTNIGHFNFKSDPKRLASNLDSNTEESCTT 410
+ + ++ TGD + + + + G G F+ D L N + E C T
Sbjct: 283 KDPAINFQRTGDSTYLKSLKTVFNDLMTLHGLPNGIFSADED---LHGNQPTQGTELCAT 339
Query: 411 YNMLKVSRHLFRWTKEIAYADYYERSLTNGV---------------LGIQRGTEPGVMIY 455
+ + T + Y D ER N + + Q GV +
Sbjct: 340 VEAMYSLEEIINITGDTHYIDALERMTFNAMPSQTTDDYHEKQYFQMANQIEISRGVFAF 399
Query: 456 LLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
LP +R + + CCY + ++K +++ + E G+ + Y
Sbjct: 400 TLPF------DRKMNCVLGAKSGYTCCYVNMHQGWTKFSQNLWHKTEN---GLAALIYGP 450
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
+ L K G + ++ V ++ ++ S K + LRIPTW A +
Sbjct: 451 NTLSTKVGAQQTDVTIEEVTNYPFEDQINFNLSLK-KAVAFPFQLRIPTWCKE--AVILI 507
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
NG+ G ++V +TW + D+LT+QLP+ + D+ +A+ GP V
Sbjct: 508 NGKIYSKEKGGKIITVNRTWQNKDRLTLQLPMEIAVSEWADNS------RAVERGPLV 559
>gi|293371493|ref|ZP_06617913.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292633530|gb|EFF52093.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 698
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 92/216 (42%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
WK G++ + Q+ D W+ +RVTL + +G SL LRIP W A T+N
Sbjct: 494 -TWKDKGELALTQETD--YPWEGKVRVTLDRVPRKAG-AFSLFLRIPEWCEK--ATLTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
GQ L + N + V +TW D +L + +P+ L
Sbjct: 548 GQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|218261883|ref|ZP_03476568.1| hypothetical protein PRABACTJOHN_02239 [Parabacteroides johnsonii
DSM 18315]
gi|218223731|gb|EEC96381.1| hypothetical protein PRABACTJOHN_02239 [Parabacteroides johnsonii
DSM 18315]
Length = 625
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/259 (23%), Positives = 101/259 (38%), Gaps = 37/259 (14%)
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
+T E+C T+ +++ L + T YADY E ++ N ++ + + Y
Sbjct: 318 HTMETCVTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKY------- 370
Query: 463 SSKERSYHHWGTPSDSFW--CCYGTGIESFSKL-GDSIYFEEEGKYPGVYIIQYISSRLD 519
S + H G CC G +F+ + G + +++ Y L
Sbjct: 371 -SPLEGWRHEGEEQCGMHINCCNANGPRAFAMIPGFAYQVQDDCVRVNFYAPSEAELVLP 429
Query: 520 WKSG----QIVVNQKVDPV-VSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
K Q + D + + DP T T + LRIP W S A +
Sbjct: 430 GKKSVWLRQTTEYPRTDQIEIEVDPTKETTFTIA-----------LRIPAW--SKIATVS 476
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
+NG+ G +L V + W D++T++L L R E QAI+ GP VL
Sbjct: 477 VNGRPEAGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQAIVRGPLVL 529
Query: 635 AGHS-IGDWDITESATSLS 652
A S GD + E++ +S
Sbjct: 530 ARDSRFGDGSVDEASVVVS 548
>gi|336402464|ref|ZP_08583200.1| hypothetical protein HMPREF0127_00513 [Bacteroides sp. 1_1_30]
gi|335948631|gb|EGN10334.1| hypothetical protein HMPREF0127_00513 [Bacteroides sp. 1_1_30]
Length = 698
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 92/216 (42%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
WK G++ + Q+ D W+ +RVTL + +G SL LRIP W A T+N
Sbjct: 494 -TWKDKGELALTQETD--YPWEGKVRVTLDRVPRKAG-AFSLFLRIPEWCEK--ATLTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
GQ L + N + V +TW D +L + +P+ L
Sbjct: 548 GQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|384256908|ref|YP_005400842.1| hypothetical protein Q7S_05115 [Rahnella aquatilis HX2]
gi|380752884|gb|AFE57275.1| hypothetical protein Q7S_05115 [Rahnella aquatilis HX2]
Length = 657
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + + YAD ER+L N VL G+ + + L +
Sbjct: 334 DTAYTETCASIGLMMFANRMLQMDSDSRYADVMERALYNTVLAGMALDGKHFFYVNPLEV 393
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P S + P W CC + LG IY + GV I YI
Sbjct: 394 HPKSIPFNHIYDHVKPVRQRWFGCACCPPNIARLLASLGHYIYTQRPD---GVDINLYIG 450
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S ++ G + K W + + + L +L LR+P W +S + TL
Sbjct: 451 SDVEATIGGKALRLKQSGGYPWAEGVLIEIDTDQP---LEATLALRLPDWCAS--PQVTL 505
Query: 576 NGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTL 609
NG L L S +L +T+ W D++ + LP+ +
Sbjct: 506 NGNPLELTSLTQRGYLRLTQEWQKGDRIEMMLPMPV 541
>gi|423343638|ref|ZP_17321351.1| hypothetical protein HMPREF1077_02781 [Parabacteroides johnsonii
CL02T12C29]
gi|409214660|gb|EKN07669.1| hypothetical protein HMPREF1077_02781 [Parabacteroides johnsonii
CL02T12C29]
Length = 625
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/259 (23%), Positives = 101/259 (38%), Gaps = 37/259 (14%)
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
+T E+C T+ +++ L + T YADY E ++ N ++ + + Y
Sbjct: 318 HTMETCVTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKY------- 370
Query: 463 SSKERSYHHWGTPSDSFW--CCYGTGIESFSKL-GDSIYFEEEGKYPGVYIIQYISSRLD 519
S + H G CC G +F+ + G + +++ Y L
Sbjct: 371 -SPLEGWRHEGEEQCGMHINCCNANGPRAFAMIPGFAYQVQDDCVRVNFYAPSEAELVLP 429
Query: 520 WKSG----QIVVNQKVDPV-VSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
K Q + D + + DP T T + LRIP W S A +
Sbjct: 430 GKKSVWLRQTTEYPRTDQIEIEVDPTKETTFTIA-----------LRIPAW--SKIATVS 476
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
+NG+ G +L V + W D++T++L L R E QAI+ GP VL
Sbjct: 477 VNGRPEAGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQAIVRGPLVL 529
Query: 635 AGHS-IGDWDITESATSLS 652
A S GD + E++ +S
Sbjct: 530 ARDSRFGDGSVDEASVVVS 548
>gi|261407601|ref|YP_003243842.1| hypothetical protein GYMC10_3802 [Paenibacillus sp. Y412MC10]
gi|261284064|gb|ACX66035.1| protein of unknown function DUF1680 [Paenibacillus sp. Y412MC10]
Length = 626
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/177 (23%), Positives = 80/177 (45%), Gaps = 11/177 (6%)
Query: 478 SFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSW 537
+F CC + + KL ++ +++ GV + Y + G+ V+ ++ +
Sbjct: 361 NFGCCTANMHQGWPKLASHLWMKDQED--GVVAVSYAPCTVRTTVGRQGVSAEIAVTGEY 418
Query: 538 DPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSS 597
R+ + S + + ++LRIP W + TLNG+++P+ + + + +TW S
Sbjct: 419 PFKDRIQIHLSLE-RAESFRISLRIPAWC--DHPVITLNGREMPIQAESGYAEIMQTWQS 475
Query: 598 DDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDW 654
D L + LP+ ++TE+ R YA+ +I GP V +W + DW
Sbjct: 476 GDLLELYLPMEVKTES----RSMYAT--SITRGPLVYVLPVKENWQMIRQREMFHDW 526
>gi|420349607|ref|ZP_14850981.1| hypothetical protein SB96558_4565 [Shigella boydii 965-58]
gi|391265984|gb|EIQ24949.1| hypothetical protein SB96558_4565 [Shigella boydii 965-58]
Length = 656
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + L + +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLSMPVR 535
>gi|194430977|ref|ZP_03063270.1| conserved hypothetical protein [Shigella dysenteriae 1012]
gi|417675158|ref|ZP_12324583.1| hypothetical protein SD15574_4764 [Shigella dysenteriae 155-74]
gi|194420432|gb|EDX36508.1| conserved hypothetical protein [Shigella dysenteriae 1012]
gi|332084488|gb|EGI89683.1| hypothetical protein SD15574_4764 [Shigella dysenteriae 155-74]
Length = 656
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + L + +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLSMPVR 535
>gi|332980748|ref|YP_004462189.1| hypothetical protein Mahau_0144 [Mahella australiensis 50-1 BON]
gi|332698426|gb|AEE95367.1| protein of unknown function DUF1680 [Mahella australiensis 50-1
BON]
Length = 647
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/240 (22%), Positives = 96/240 (40%), Gaps = 14/240 (5%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ E+C + ++ + +F+ ++ Y D ER+L N V + Y+ PL
Sbjct: 326 DTAYTETCASIGLIFWAHRMFKMDQDAKYIDVMERALYNTVFA-SMSLDGKRYFYVNPLE 384
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P +R H W CC + +G +Y +E K +++ Y+
Sbjct: 385 VWPEVCHKREDHRHVKTERQKWYDCACCPPNIARLLTSIGKYVYALDEDK-NMLFVNLYM 443
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
++ + + + D V WD + T+T + +T SL RIP W K
Sbjct: 444 DGQVKFNLNDKEIMLEQDTVYPWDGSISFTVT---SNTPVTFSLAFRIPDWCKKWSIK-- 498
Query: 575 LNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NGQ++ + +T+ W + DK+ + L + + + A AI GP V
Sbjct: 499 INGQEIQEHEKNKGYAVITRAWVAGDKVELMLDMPVMMMRANPEVRADAGKVAIQRGPVV 558
>gi|294643636|ref|ZP_06721438.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294808056|ref|ZP_06766829.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|292641013|gb|EFF59229.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294444697|gb|EFG13391.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 698
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 92/216 (42%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
WK G++ + Q+ D W+ +RVTL + +G SL LRIP W A T+N
Sbjct: 494 -TWKDKGKLALTQETD--YPWEGKVRVTLDRVPRKAG-AFSLFLRIPEWCEK--ATLTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
GQ L + N + V +TW D +L + +P+ L
Sbjct: 548 GQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|416288023|ref|ZP_11649060.1| hypothetical protein SGB_04738 [Shigella boydii ATCC 9905]
gi|320178140|gb|EFW53118.1| hypothetical protein SGB_04738 [Shigella boydii ATCC 9905]
Length = 656
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + L + +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLSMPVR 535
>gi|448238160|ref|YP_007402218.1| hypothetical protein GHH_c19490 [Geobacillus sp. GHH01]
gi|445207002|gb|AGE22467.1| hypothetical protein GHH_c19490 [Geobacillus sp. GHH01]
Length = 640
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/242 (23%), Positives = 102/242 (42%), Gaps = 20/242 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ +R + + YAD ER+L NG + G+ + + L +
Sbjct: 320 DTVYAETCASIALVFWARRMLELEMDGKYADVMERALYNGTISGMDLDGKRFFYVNPLEV 379
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + + H P W CC + +G IY + +++ Y+
Sbjct: 380 WPKACERHDKRH-VKPVRQKWFSCACCPPNLARLIASIGHYIYSQTSD---ALFVHLYVG 435
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S + + G V + WD +R+T+ S S +L LRIP W GA+ T+
Sbjct: 436 SNIQTEIGGRSVEIVQETNYPWDGTVRLTI---SPESAQEFTLGLRIPGW--CRGAEVTI 490
Query: 576 NGQDL---PLPSPGNFLSVTKTWSSDDKLTIQLPLTL-RTEAIQDDRPEYASIQAILYGP 631
NG+++ PL G + + + W D++ + + + R +A R + A+ GP
Sbjct: 491 NGENVDIAPLTKKG-YAYIRRVWRQGDEMVLHFSMPVERIKAHPQVRANAGKV-ALQRGP 548
Query: 632 YV 633
V
Sbjct: 549 IV 550
>gi|423142165|ref|ZP_17129803.1| hypothetical protein SEHO0A_03740 [Salmonella enterica subsp.
houtenae str. ATCC BAA-1581]
gi|379050094|gb|EHY67987.1| hypothetical protein SEHO0A_03740 [Salmonella enterica subsp.
houtenae str. ATCC BAA-1581]
Length = 651
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/216 (21%), Positives = 85/216 (39%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSRYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + + P W CC + LG +Y + +YI Y+
Sbjct: 388 VHPKTLTFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYLY---TPRNEALYINMYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + ++ W + +T+ S L +L LR+P W +
Sbjct: 445 GNSVEIPLENGALKLRISGNYPWQEQITITVESSQP---LRHTLALRLPEWCPQ--PQVE 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
+NGQ + +L + + W D + + LP+ +R
Sbjct: 500 VNGQPVEQDIRKGYLHIQRDWQEGDTIALTLPMPVR 535
>gi|423296614|ref|ZP_17274699.1| hypothetical protein HMPREF1070_03364 [Bacteroides ovatus
CL03T12C18]
gi|392670337|gb|EIY63822.1| hypothetical protein HMPREF1070_03364 [Bacteroides ovatus
CL03T12C18]
Length = 698
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 92/216 (42%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
WK G++ + Q+ D W+ +RVTL + +G T SL LRIP W T+N
Sbjct: 494 -TWKDKGELALTQETD--YPWEGKVRVTLDRVPRKAG-TFSLFLRIPEWCEK--TTLTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
GQ L + N + V +TW D +L + +P+ L
Sbjct: 548 GQPLQTNTKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|374385208|ref|ZP_09642716.1| hypothetical protein HMPREF9449_01102 [Odoribacter laneus YIT
12061]
gi|373226413|gb|EHP48739.1| hypothetical protein HMPREF9449_01102 [Odoribacter laneus YIT
12061]
Length = 614
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 95/229 (41%), Gaps = 17/229 (7%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + M+ ++ + E Y D ER++ NG L GI + Y+ PLA S
Sbjct: 332 ETCASVGMVFWNQRMNMLKGESRYEDVLERAMYNGALAGISLSGDR--FFYVNPLAS-SG 388
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
K +GT CC +G+ IY E V++ YI S + ++
Sbjct: 389 KHHRKAWYGTA-----CCPSQISRFLPSVGNYIYALSENT---VWVNLYIGSETEVETSG 440
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS 584
+ V K + + WD VT + + S + LRIP W K +NGQ
Sbjct: 441 VTVALKQETLYPWDG--NVTFYVNPRESK-DFKMKLRIPAWCEKYVVK--VNGQIEEGKK 495
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
++ + + W++ D + + + +T++ A A +A+ GP V
Sbjct: 496 EKGYVVIDRLWAAGDVMELNMNMTVKVVAADPRVKANAGKRALQRGPLV 544
>gi|380695298|ref|ZP_09860157.1| hypothetical protein BfaeM_15227 [Bacteroides faecis MAJ27]
Length = 698
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 93/216 (43%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YA+ E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYAELVETCLYNSVLSGISLDGKRYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y +EG Y +Y ++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLNDEGIYCNLYGANTLT-- 492
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+ WK G+IV+ Q+ D WD +RV L + +G SL RIP W A T+N
Sbjct: 493 IHWKDKGEIVLTQETD--YPWDGNVRVRLNKLPRKAG-AFSLFFRIPEWCEK--ATLTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
G+ + + + N + V + W D +LT+ +P+ L
Sbjct: 548 GEPVQIAAKANTYAEVNRIWKKGDMAELTMDMPVRL 583
>gi|298385749|ref|ZP_06995307.1| hypothetical protein HMPREF9007_02451 [Bacteroides sp. 1_1_14]
gi|298261890|gb|EFI04756.1| hypothetical protein HMPREF9007_02451 [Bacteroides sp. 1_1_14]
Length = 698
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 93/216 (43%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+WK G++ + Q+ D W+ +RVTL + +G SL RIP W A T+N
Sbjct: 494 -NWKDKGELALVQETD--YPWEGNVRVTLNKVPRKAG-AFSLFFRIPEWCGK--AALTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
GQ + + + N + V +TW D +L + +P+ L
Sbjct: 548 GQPVSMNAKANTYAEVNRTWKKGDVVELVMDMPVCL 583
>gi|298481311|ref|ZP_06999504.1| hypothetical protein HMPREF0106_01752 [Bacteroides sp. D22]
gi|298272515|gb|EFI14083.1| hypothetical protein HMPREF0106_01752 [Bacteroides sp. D22]
Length = 698
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 92/216 (42%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTTI 494
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
WK G++ + Q+ D W+ +RVTL + +G SL LRIP W A T+N
Sbjct: 495 --WKDKGELALTQETD--YPWEGKVRVTLDRVPRKAG-AFSLFLRIPEWCEK--ATLTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
GQ L + N + V +TW D +L + +P+ L
Sbjct: 548 GQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|417691895|ref|ZP_12341101.1| hypothetical protein SB521682_4174 [Shigella boydii 5216-82]
gi|332085042|gb|EGI90222.1| hypothetical protein SB521682_4174 [Shigella boydii 5216-82]
Length = 656
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/216 (23%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHLFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ + +T+ W D L + L + +R
Sbjct: 500 LNGEEVEQDIRKGYFHITREWQEGDTLNLTLSMPVR 535
>gi|209551193|ref|YP_002283110.1| hypothetical protein Rleg2_3619 [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|209536949|gb|ACI56884.1| protein of unknown function DUF1680 [Rhizobium leguminosarum bv.
trifolii WSM2304]
Length = 640
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 101/237 (42%), Gaps = 30/237 (12%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + ++ + + + YAD E++L NG L G+ T+ Y PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALPGLS--TDGKTFFYDNPLESAGK 391
Query: 465 KER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
R +HH P CC + +G +Y + + V++ ++RL +G
Sbjct: 392 HHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVADDEI-AVHLYGESTARLKLANG 443
Query: 524 -----QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
Q N D V++ L+ F+ L+LRIP W + GA ++NG+
Sbjct: 444 AEVELQQTTNYPWDGAVTFATRLKAPAKFA---------LSLRIPDW--AEGATLSVNGE 492
Query: 579 DLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
L L + + + + W+ D++ + LPL+LR + + A A++ GP V
Sbjct: 493 MLDLAANIRDGYARIDRQWTDGDRVALSLPLSLRPQYANPKVRQDAGRVALMRGPLV 549
>gi|256838374|ref|ZP_05543884.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|256739293|gb|EEU52617.1| conserved hypothetical protein [Parabacteroides sp. D13]
Length = 618
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 103/244 (42%), Gaps = 23/244 (9%)
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLL 457
NLD+ E +C + M+ ++ + + T + Y D ERSL NG L GI G + Y+
Sbjct: 330 NLDAYCE-TCASVGMVLWNQRMNQLTGDSKYIDVLERSLYNGALAGISLGGDR--FFYVN 386
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PL R W + CC +G+ IY + +++ YI +
Sbjct: 387 PLESKGDHHR--QEWYGCA----CCPSQLSRFLPSIGNYIYASSD---DALWVNLYIGNT 437
Query: 518 LDWKSGQ--IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
+ G+ I++ Q+ D WD +++T++ S L + LRIP W + ++
Sbjct: 438 GQIRIGETDILLTQETD--YPWDGSVKLTISTSQP---LEKEIRLRIPNWCKT--YDLSI 490
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
NG+ + + + +V K W S D + + + + + A E + I GP V
Sbjct: 491 NGKRINVSEEKGY-AVIKDWKSQDVIALDMDMPVEIVAADPHVKENFGKRVIQRGPLVYC 549
Query: 636 GHSI 639
I
Sbjct: 550 MEEI 553
>gi|410096807|ref|ZP_11291792.1| hypothetical protein HMPREF1076_00970 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225424|gb|EKN18343.1| hypothetical protein HMPREF1076_00970 [Parabacteroides goldsteinii
CL02T12C30]
Length = 675
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 99/472 (20%), Positives = 179/472 (37%), Gaps = 48/472 (10%)
Query: 251 ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN-DVL 309
+L ++ QY A + R+T +M YF R Q + +W E N +
Sbjct: 160 VLLKIMQQYYSATGDK--RVTDFMTRYF--RYQLETLPSTPLGNWTFWAEYRACDNLQAV 215
Query: 310 YKLFCITQDPKHLMLAHLFDKPCFLGL-LALQADDISGFHSNTHIPIVIGSQMRYEVTGD 368
Y L+ IT D L L HL K + + + L DD++ F NT + + ++ V
Sbjct: 216 YWLYNITGDAFLLDLGHLLHKQSYDFVDMFLNRDDLTRF--NTIHCVNLAQGIKEPVIYY 273
Query: 369 QLHKEGHQLESSGTNIGHF-NFKSDPKR-------LASNLDSNTEESCTTYNMLKVSRHL 420
Q H + L++ + P+ L N + E C+ ++ +
Sbjct: 274 QQHPDKKYLDAVKKGFADIRQYNGQPQGMYGGDEGLHGNNPTQGSELCSAVELMYSLEKI 333
Query: 421 FRWTKEIAYADYYERSLTNGVLG-----------IQRGTEPGVMIYLLPLAPGSSKERSY 469
T ++A+ D+ ER N + Q+ + + + ++ +
Sbjct: 334 MEITGDLAFTDHLERIAFNALPTQVTDDFMDKQYFQQANQVMITRHAHNFYEDANHAETD 393
Query: 470 HHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG---QIV 526
+GT + + CC+ + + K S+++ G+ + Y S + K G +I
Sbjct: 394 IIYGTRT-GYPCCFSNMHQGWPKFTQSLWYATPDN--GIAALAYSPSEVTAKVGNGCKIK 450
Query: 527 VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPG 586
+ ++ D +++T+ K + L+LRIP W A T+NG
Sbjct: 451 ITEET--CYPMDDKIQLTIRLLDKTKEIAFPLHLRIPGWCKE--ATVTVNGVPESTAKGN 506
Query: 587 NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITE 646
+ + +TW S D++ + LP+ + T Y + A+ GP V A W+ E
Sbjct: 507 SVAIIRRTWKSGDQVLLHLPMEVSTSKW------YENSVAVERGPLVYALKMDEKWEKKE 560
Query: 647 SATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTN--SNQSITMEKFPKSG 696
D IT SY YG F N N +T++K ++G
Sbjct: 561 FK---GDEITQFGKSYYEVTSPTKWNYGIVAFDPDNMQENFQVTIDKSKQAG 609
>gi|294673046|ref|YP_003573662.1| hypothetical protein PRU_0271 [Prevotella ruminicola 23]
gi|294472095|gb|ADE81484.1| conserved hypothetical protein [Prevotella ruminicola 23]
Length = 774
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 100/245 (40%), Gaps = 35/245 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + +F T E Y D ER+L N VL G+ + Y PL
Sbjct: 307 ETCAAIANVYWNYRMFLATGESKYIDVCERALYNNVLSGVSLSGDK--FFYDNPLESDGE 364
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER W + CC G I F + +GK +++ Y + K G
Sbjct: 365 HER--QKWFGCA----CCPGN-ITRFVASVPGYIYARQGK--DIFVNLYAQGKA--KIGN 413
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS-----------NGAK- 572
I + Q D WD +R+ +T KGSG ++ LR+P+W + + AK
Sbjct: 414 IELEQTTD--YPWDGKIRIKVT---KGSG-KFAIKLRVPSWLKTSPTNNDLYQYQDKAKT 467
Query: 573 --ATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
++NG+ L P +++ ++++W D + + P+ +R D+ + A G
Sbjct: 468 YSVSVNGKAL-YPENRDYIEISRSWKKGDTIELDFPMDVRRIVANDNAEDDRGKVAFERG 526
Query: 631 PYVLA 635
P V
Sbjct: 527 PIVFC 531
>gi|384538328|ref|YP_005722412.1| hypothetical protein SM11_pD0078 [Sinorhizobium meliloti SM11]
gi|336036981|gb|AEH82911.1| hypothetical protein SM11_pD0078 [Sinorhizobium meliloti SM11]
Length = 640
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 59/243 (24%), Positives = 102/243 (41%), Gaps = 32/243 (13%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI------ 454
D+ E+C + ++ + + + YAD E++L NG L PG+ I
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFF 381
Query: 455 YLLPLAPGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y PL R +HH P CC + +G +Y E + V++
Sbjct: 382 YDNPLESTGRHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGE 433
Query: 514 ISSRLDWKSG-QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
++RL SG ++ + Q+ + W+ + T +L+LRIP W + GA
Sbjct: 434 STARLKLASGAEVELRQETN--YPWEGAIAFTTKLDRPAK---FALSLRIPEWAA--GAT 486
Query: 573 ATLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
++NG L L + G + + + WS D++ + LPL LR + + A++ G
Sbjct: 487 LSVNGTMLDLSAHLTGGYARIEREWSDGDRVALYLPLALRPQYANPKVRQDVGRVALMRG 546
Query: 631 PYV 633
P V
Sbjct: 547 PLV 549
>gi|304316161|ref|YP_003851306.1| hypothetical protein Tthe_0663 [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
gi|302777663|gb|ADL68222.1| protein of unknown function DUF1680 [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
Length = 673
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 98/239 (41%), Gaps = 14/239 (5%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+N E+C + ++ + + + + Y+D ER+L N V+ G+ + + L +
Sbjct: 353 DTNYSETCASVGLVFFAHRMLQIDPDRQYSDVMERALYNTVISGMSLDGKKFFYVNPLEV 412
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + ++ + W CC + LG IY K V++ Y+
Sbjct: 413 WPEACEKNKVKSHVKYTRQPWFGCACCPPNIARLLTSLGKYIY---SKKAKEVFVHLYVD 469
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S L K + VN K WD ++ + SK T L++RIP W K
Sbjct: 470 SELKEKISESEVNIKQSTQYPWDE--KIIIDIDSKKETEFT-LSIRIPGWCKEAKVKVNN 526
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDD-KLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
N DL + + + W D ++ + +P+ +R +A + R + + AI GP V
Sbjct: 527 NEIDLDSVMEKGYAKINRRWKHDSLEIYLSMPV-MRIKANPNVREDEGKV-AIQRGPIV 583
>gi|218195658|gb|EEC78085.1| hypothetical protein OsI_17564 [Oryza sativa Indica Group]
Length = 640
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/240 (22%), Positives = 94/240 (39%), Gaps = 17/240 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 320 DTVYAESCASIGLMMFARRMLEMEADSHYADVMERALYNTVLG-GMALDGKHFFYVNPLE 378
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ-Y 513
P + + P W CC + LG IY P +I Y
Sbjct: 379 VHPKTLAFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTVR----PDALLINLY 434
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ + + + + + ++ W + + +T +T +L LR+P W +
Sbjct: 435 VGNDVAIQIDENTLRLRISGNYPWQDQVTIEITSPVP---VTHTLALRLPDWCAE--PAV 489
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+LNG+ + +L + + W D LT+ LP+ +R + A A+ GP V
Sbjct: 490 SLNGERVTGEVVRGYLYLNRCWHEGDTLTLTLPMPVRRVYGNPQVRQQAGKVALQRGPLV 549
>gi|322831792|ref|YP_004211819.1| hypothetical protein Rahaq_1069 [Rahnella sp. Y9602]
gi|321166993|gb|ADW72692.1| protein of unknown function DUF1680 [Rahnella sp. Y9602]
Length = 657
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + + YAD ER+L N VL G+ + + L +
Sbjct: 334 DTAYTETCASIGLMMFANRMLQMDSDSRYADVMERALYNTVLAGMALDGKHFFYVNPLEV 393
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P S + P W CC + LG IY + GV I YI
Sbjct: 394 HPKSIPFNHIYDHVKPVRQRWFGCACCPPNIARLLASLGHYIYTQRPD---GVDINLYIG 450
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S ++ G + K W + + + L +L LR+P W S + TL
Sbjct: 451 SDVEATIGGKALRLKQSGGYPWAEGVLIEIDTDQP---LEATLALRLPDWCVS--PQVTL 505
Query: 576 NGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTL 609
NG L L S +L +T+ W D++ + LP+ +
Sbjct: 506 NGNPLELTSLTQRGYLRLTQEWQKGDRIEMMLPMPV 541
>gi|401761699|ref|YP_006576706.1| hypothetical protein ECENHK_00925 [Enterobacter cloacae subsp.
cloacae ENHKU01]
gi|400173233|gb|AFP68082.1| hypothetical protein ECENHK_00925 [Enterobacter cloacae subsp.
cloacae ENHKU01]
Length = 649
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/240 (22%), Positives = 94/240 (39%), Gaps = 17/240 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMEADSHYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ-Y 513
P + + P W CC + LG IY P +I Y
Sbjct: 388 VHPKTLAFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTVR----PDALLINLY 443
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ + + + + + ++ W + + +T +T +L LR+P W +
Sbjct: 444 VGNDVAIQIDENTLRLRISGNYPWQDQVTIEITSPVP---VTHTLALRLPDWCAE--PAV 498
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+LNG+ + +L + + W D LT+ LP+ +R + A A+ GP V
Sbjct: 499 SLNGERVTGEVVRGYLYLNRCWHEGDTLTLTLPMPVRRVYGNPQVRQQAGKVALQRGPLV 558
>gi|384534128|ref|YP_005716792.1| hypothetical protein [Sinorhizobium meliloti BL225C]
gi|433610342|ref|YP_007193803.1| hypothetical protein C770_GR4pD0078 [Sinorhizobium meliloti GR4]
gi|333816304|gb|AEG08971.1| protein of unknown function DUF1680 [Sinorhizobium meliloti BL225C]
gi|429555284|gb|AGA10204.1| hypothetical protein C770_GR4pD0078 [Sinorhizobium meliloti GR4]
Length = 640
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/243 (24%), Positives = 102/243 (41%), Gaps = 32/243 (13%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI------ 454
D+ E+C + ++ + + + YAD E++L NG L PG+ I
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFF 381
Query: 455 YLLPLAPGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y PL R +HH P CC + +G +Y E + V++
Sbjct: 382 YDNPLESTGRHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGE 433
Query: 514 ISSRLDWKSG-QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
++RL SG ++ + Q+ + W+ + T +L+LRIP W + GA
Sbjct: 434 STARLKLASGAEVELRQETN--YPWEGAIAFTTKLDRPAK---FALSLRIPEWAA--GAT 486
Query: 573 ATLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
++NG L L + G + + + WS D++ + LPL LR + + A++ G
Sbjct: 487 LSVNGTMLDLSAHLTGGYARIEREWSDGDRVALYLPLALRPQYANPKVRQDVGRVALMRG 546
Query: 631 PYV 633
P V
Sbjct: 547 PLV 549
>gi|56962984|ref|YP_174711.1| hypothetical protein ABC1212 [Bacillus clausii KSM-K16]
gi|56909223|dbj|BAD63750.1| conserved hypothetical protein [Bacillus clausii KSM-K16]
Length = 641
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 97/430 (22%), Positives = 159/430 (36%), Gaps = 57/430 (13%)
Query: 308 VLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLALQADDISGF-------HSNTHIPI 355
L KL+ + D ++L LA F +P F A + + F +S +H+P+
Sbjct: 190 ALLKLYRVKGDRRYLRLAQFFIEERGKEPHFFDDEAKKRGEDGTFWYSGRYEYSQSHLPV 249
Query: 356 -----VIGSQMRY------------EVTGDQLHKEGHQLESSGTN--------IGHFNFK 390
G +R E +QL K L + TN IG F
Sbjct: 250 RQQQEATGHAVRAVYMYTAMADLANETDDEQLAKVCRTLWDNVTNQQMYITGGIGSAEF- 308
Query: 391 SDPKRLASNL--DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG 447
+ A +L D E+C + ++ ++++ + Y D ER+L NG + GIQ
Sbjct: 309 GEAFTFAYDLPNDLAYTETCASIGLVFWAKNMLELEADSRYGDVMERALYNGTISGIQLD 368
Query: 448 TEPGVMIYLLPLAPGSSKER-SYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEG 503
+ L + P ++K R H T ++ CC + +G IY
Sbjct: 369 GTKFFYVNPLEVWPQAAKHRHDLKHVKTERQPWFGCACCPPNIARLLASIGQYIY---TT 425
Query: 504 KYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIP 563
K +I YI + G V K+ W V L + S T L RIP
Sbjct: 426 KNQTGFIHLYIGNESTLTIGSGEVGLKMKSSFPWKG--EVGLEVNPDTSRPFT-LAFRIP 482
Query: 564 TWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYAS 623
+W +N + T+NG + + + V +TW D ++IQ PL + + A
Sbjct: 483 SW--ANDYQLTVNGHFVDVEVRDGYAYVERTWQKGDHISIQFPLETKVIYAHPEVRANAG 540
Query: 624 IQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLI--TFTQEYGNTKFVLT 681
A+ GP V + +S I AS+++ + E + V
Sbjct: 541 KIALQRGPIVFCAEEADNGSNLQSVAIRCQ--ENIDASFDTDRLNGVIVLEGKGVRTVTA 598
Query: 682 NSNQSITMEK 691
N+N S+ + K
Sbjct: 599 NANGSLYLAK 608
>gi|418401306|ref|ZP_12974836.1| hypothetical protein SM0020_14414 [Sinorhizobium meliloti
CCNWSX0020]
gi|359504683|gb|EHK77215.1| hypothetical protein SM0020_14414 [Sinorhizobium meliloti
CCNWSX0020]
Length = 640
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/243 (24%), Positives = 101/243 (41%), Gaps = 32/243 (13%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI------ 454
D+ E+C + ++ + + + YAD E++L NG L PG+ I
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFF 381
Query: 455 YLLPLAPGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y PL R +HH P CC + +G +Y E + V++
Sbjct: 382 YDNPLESTGRHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGE 433
Query: 514 ISSRLDWKSG-QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
++RL SG ++ + Q+ + W+ + T L+LRIP W + GA
Sbjct: 434 STARLKLASGAEVELRQETN--YPWEGAIAFTTKLDRPAK---FELSLRIPEWAA--GAT 486
Query: 573 ATLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
++NG L L + G + + + WS D++ + LPL LR + + A++ G
Sbjct: 487 LSVNGTMLDLSAHLTGGYARIEREWSDGDRVALYLPLALRPQYANPKVRQDVGRVALMRG 546
Query: 631 PYV 633
P V
Sbjct: 547 PLV 549
>gi|150376304|ref|YP_001312900.1| hypothetical protein Smed_4162 [Sinorhizobium medicae WSM419]
gi|150030851|gb|ABR62967.1| protein of unknown function DUF1680 [Sinorhizobium medicae WSM419]
Length = 640
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 102/248 (41%), Gaps = 32/248 (12%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI------ 454
D+ E+C + ++ + + + YAD E++L NG L PG+ I
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGKTFF 381
Query: 455 YLLPLAPGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y PL R +HH P CC + +G +Y E + V++
Sbjct: 382 YDNPLESTGRHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGE 433
Query: 514 ISSRLDWKSG-QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
++RL +G ++ + Q + WD + T +L+LRIP W + GA
Sbjct: 434 SAARLKLANGAEVELRQATN--YPWDGAIAFTARLDRPAR---FALSLRIPEWAA--GAT 486
Query: 573 ATLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
++NG L L + + + + WS D++ + LPLTLR + + A++ G
Sbjct: 487 LSVNGSMLDLSAHLADGYARIEREWSDGDRVALYLPLTLRPQYANPKVRQDVGRVALMRG 546
Query: 631 PYVLAGHS 638
P V +
Sbjct: 547 PLVYCAEA 554
>gi|431797074|ref|YP_007223978.1| hypothetical protein Echvi_1703 [Echinicola vietnamensis DSM 17526]
gi|430787839|gb|AGA77968.1| hypothetical protein Echvi_1703 [Echinicola vietnamensis DSM 17526]
Length = 679
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 127/287 (44%), Gaps = 40/287 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLPLAPGS 463
E+C + + + + T E Y D E +L N +L GI +GTE Y PL+ +
Sbjct: 361 ETCANIGNVLWNWRMLQLTGEAKYMDVIELNLYNSILSGISLQGTE---FFYTNPLS--A 415
Query: 464 SKERSYH-HWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
K+ YH W + + CC + +++ + Y E G+Y+ Y S++L
Sbjct: 416 KKDLPYHLRWPNTREGYIALSNCCPPNVARTLAEVANYAYSTTE---DGLYVNLYGSNKL 472
Query: 519 D--WKSGQ-IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
GQ +++NQ WD + + + + K S+ LRIP W A T+
Sbjct: 473 QTTLADGQELLINQSTS--YPWDETISLDIEKAPKDD---YSVFLRIPGWCHE--ASVTV 525
Query: 576 NGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
NG++ + + G ++ + ++W D++T+ L + ++ + A+ GP V
Sbjct: 526 NGEEQHMDLAAGQYVEINRSWKKGDQVTLTLAMPVQYLEANPLVEQARGQVAVKRGPVVY 585
Query: 635 --------AGHSIGDWDITESATSLSDWITPIPASY-NSQLITFTQE 672
AG S+ D I +LS+ ++P + NS+LI+ T E
Sbjct: 586 CVESMDLPAGKSVDDVVI-----ALSEELSPEAFTIGNSELISLTGE 627
>gi|424916536|ref|ZP_18339900.1| hypothetical protein Rleg9DRAFT_4102 [Rhizobium leguminosarum bv.
trifolii WSM597]
gi|392852712|gb|EJB05233.1| hypothetical protein Rleg9DRAFT_4102 [Rhizobium leguminosarum bv.
trifolii WSM597]
Length = 640
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 104/242 (42%), Gaps = 30/242 (12%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + YAD E++L NG L G+ T+ Y PL
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALPGLS--TDGKTFFYDNPL 386
Query: 460 -APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
+ G +HH P CC + +G +Y + + V++ ++RL
Sbjct: 387 ESVGKHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVADDEI-AVHLYGESTARL 438
Query: 519 DWKSGQIV-----VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+G V N D V++ L+ F+ L+LRIP W + GA
Sbjct: 439 KLANGADVELEQTTNYPWDGAVAFTTRLKTPAKFA---------LSLRIPDW--AEGATL 487
Query: 574 TLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
++NG+ L L + + + + W+ D++ + LPL+LR + + A A++ GP
Sbjct: 488 SVNGEMLDLAANIRDGYARIDRQWADGDRVALSLPLSLRPQYANPKVRQDAGRVALMRGP 547
Query: 632 YV 633
V
Sbjct: 548 LV 549
>gi|423345501|ref|ZP_17323190.1| hypothetical protein HMPREF1060_00862 [Parabacteroides merdae
CL03T12C32]
gi|409223287|gb|EKN16224.1| hypothetical protein HMPREF1060_00862 [Parabacteroides merdae
CL03T12C32]
Length = 625
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 65/274 (23%), Positives = 106/274 (38%), Gaps = 47/274 (17%)
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
+T E+C T+ +++ L + T YADY E ++ N ++ + + Y
Sbjct: 318 HTMETCVTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMASLKADASQIAKY------- 370
Query: 463 SSKERSYHHWGTPSDSFW--CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW 520
S + H G CC G +F+ + Y ++ V + Y S +
Sbjct: 371 -SPLEGWRHEGEEQCGMHINCCNANGPRAFAMIPQFAYQVQDD---CVRVNFYAPSEAEL 426
Query: 521 --------KSGQIVVNQKVDPV-VSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+ Q + D + + DP T + LRIP W S A
Sbjct: 427 VLPDKKPVRLKQTTDYPRTDQIEIEVDPAKETAFTIA-----------LRIPAW--SKIA 473
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
++NGQ G +L V + W D++T++L L R E QAI+ GP
Sbjct: 474 VVSVNGQPQDGVLQGAYLPVNRKWKKGDRITVKLDLRARLV-------ERNQAQAIVRGP 526
Query: 632 YVLAGHS-IGDWDITESATSLSD----WITPIPA 660
VLA S GD + E++ +S +TP+ A
Sbjct: 527 IVLARDSRFGDGFVDEASVVVSKDGYVELTPVKA 560
>gi|110807746|ref|YP_691266.1| hypothetical protein SFV_3953 [Shigella flexneri 5 str. 8401]
gi|418259896|ref|ZP_12882543.1| hypothetical protein SF660363_4453 [Shigella flexneri 6603-63]
gi|424840119|ref|ZP_18264756.1| hypothetical protein SF5M90T_3865 [Shigella flexneri 5a str. M90T]
gi|110617294|gb|ABF05961.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
gi|383469171|gb|EID64192.1| hypothetical protein SF5M90T_3865 [Shigella flexneri 5a str. M90T]
gi|397894067|gb|EJL10519.1| hypothetical protein SF660363_4453 [Shigella flexneri 6603-63]
Length = 659
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ES + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESYASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|420368547|ref|ZP_14869294.1| hypothetical protein SF123566_9752 [Shigella flexneri 1235-66]
gi|391322141|gb|EIQ78842.1| hypothetical protein SF123566_9752 [Shigella flexneri 1235-66]
Length = 659
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 87/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ES + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESYASIGLMMFARRMLEMEGDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S K + P W CC + +G +Y E +YI Y
Sbjct: 388 VHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIARVLTSIGHYLYTPRED---ALYINIYA 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ ++ + +V W +VT+ S + +L LR+P W + +
Sbjct: 445 GNSMEVPVENGTLRLRVSGNYPWQE--QVTIAVESP-QPVRHTLALRLPDWCTQ--PQII 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
LNG+++ +L +T+ W D L + LP+ +R
Sbjct: 500 LNGEEVEQDIRKGYLHITREWQEGDTLNLTLPMPVR 535
>gi|354581746|ref|ZP_09000649.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
gi|353200363|gb|EHB65823.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
Length = 657
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 66/264 (25%), Positives = 104/264 (39%), Gaps = 22/264 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG--IQRGTEPGVMIYLLP 458
D+ E+C + ++ +R + + + +AD ER+L N V+G Q GT Y+ P
Sbjct: 331 DTVYAETCASIGLIFFARRMLELSPKSEFADVMERALYNTVIGSMAQDGTH---FFYVNP 387
Query: 459 LA--PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGK-YPGVYII 511
L P + + H P W CC + LG+ +Y E + +YI
Sbjct: 388 LEVWPDACRHNPGKHHVKPVRPGWFACACCPPNVARLLTSLGEYVYTSNEDTLFAHLYIG 447
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+ L + + V Q + + W VT T S + T L LRIP W A
Sbjct: 448 GEAAVSL--RGNAVKVKQTSE--LPWSG--NVTFTIESPQTAEWT-LALRIPGWCRGQ-A 499
Query: 572 KATLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
+NG++L + +T+ W+S D L + L L + A AI
Sbjct: 500 VIRVNGEELKASGLIREGYAYITRAWASGDTLELALSLDILQVRAHPLVRANAGKAAIQR 559
Query: 630 GPYVLAGHSIGDWDITESATSLSD 653
GP V SI + + T +D
Sbjct: 560 GPLVYCWESIDNGAPISAVTLAAD 583
>gi|433654337|ref|YP_007298045.1| hypothetical protein Thethe_00658 [Thermoanaerobacterium
thermosaccharolyticum M0795]
gi|433292526|gb|AGB18348.1| hypothetical protein Thethe_00658 [Thermoanaerobacterium
thermosaccharolyticum M0795]
Length = 647
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 64/311 (20%), Positives = 125/311 (40%), Gaps = 36/311 (11%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+N E+C + ++ + + + + Y+D ER+L N V+ G+ + + L +
Sbjct: 327 DTNYSETCASVGLVFFAHRMLQIDPDRQYSDVMERALYNTVISGMSLDGKKFFYVNPLEV 386
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + ++ + W CC + LG IY K +++ Y+
Sbjct: 387 WPEACEKNKVKSHVKYTRQPWFGCACCPPNIARLLTSLGKYIY---SKKNKEIFVHLYVD 443
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S L K + VN K WD + + + + +L+LRIP W AK +
Sbjct: 444 SELKEKISESQVNIKQSTQYPWDEKIDIEVDCEEETE---FTLSLRIPGWCKE--AKIKI 498
Query: 576 NGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPL-TLRTEAIQDDRPEYASIQAILYGPY 632
N +++ L S + + + W DK+ I + +R +A + R + + AI GP
Sbjct: 499 NNEEIDLNSVMAKGYAKINRIWKH-DKIEIYFSMPVMRIKANPNVREDEGKV-AIQRGPI 556
Query: 633 VLAGHSIGDWDITESATSLSDWITPIPASYN------------SQLITFTQEYGNTKFVL 680
V I ++ +L++ + P + + + + F ++Y N L
Sbjct: 557 VYCLEEI------DNGKNLNNIVLPTDSKFEIKTDKDLNNVCVIETVAFREKYENWNDEL 610
Query: 681 TNSNQSITMEK 691
S+ ++ EK
Sbjct: 611 YKSDVKVSYEK 621
>gi|261341800|ref|ZP_05969658.1| hypothetical protein ENTCAN_08284 [Enterobacter cancerogenus ATCC
35316]
gi|288316173|gb|EFC55111.1| putative cytoplasmic protein [Enterobacter cancerogenus ATCC 35316]
Length = 651
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 53/239 (22%), Positives = 94/239 (39%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 329 DTVYAESCASIGLMMFARRMLEMETDSQYADVMERALYNTVLG-GMALDGKHFFYVNPLE 387
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P + + P W CC + LG IY ++I Y+
Sbjct: 388 VHPKTLAFNHIYDHVKPVRQRWFGCACCPPNIARVLTSLGHYIYTLHPET---LFINLYV 444
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ + G + ++ W + + + + +T +L LR+P W + + +
Sbjct: 445 GNDIAVPVGDQQLQLRISGNYPWHEQVNIEI---ASPVPVTHTLALRLPDWCEN--PEVS 499
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG + +L + ++W D LT+ LP+ +R + A A+ GP V
Sbjct: 500 LNGAAVTGEVSRGYLYLRRSWQEGDVLTLTLPMPVRRVYGNPQVRQQAGKVALQRGPLV 558
>gi|386724368|ref|YP_006190694.1| hypothetical protein B2K_19810, partial [Paenibacillus
mucilaginosus K02]
gi|384091493|gb|AFH62929.1| hypothetical protein B2K_19810 [Paenibacillus mucilaginosus K02]
Length = 380
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 88/215 (40%), Gaps = 21/215 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG--IQRGTEPGVMIYLLPLA--P 461
E+C + ++ +R + R + YAD ER+L V+G GT Y+ PL P
Sbjct: 58 ETCASVGLIFFARRMLRLHRNSRYADVLERALYKTVIGGLSLDGTR---FFYVNPLEVYP 114
Query: 462 GS-SKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
K ++Y H ++ CC + LG+ IY EE VY+ YI R
Sbjct: 115 DVLGKNKNYSHIKAQRQGWFSCACCPPNAARLLASLGEYIYTAEEDT---VYVELYIGGR 171
Query: 518 LDWK-SGQIV-VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
++ GQ+V ++Q+ D + +T S + +L LR P+W+ K
Sbjct: 172 VEIPLGGQVVGIDQQSDYTAEGTTRIEIT-----AASSVRFTLALRFPSWSDHAVVKTGD 226
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
Q+ ++ V W+ + I + +R
Sbjct: 227 QVQEYLHGDEDGYIRVEGEWAGTKTVEISFSMPVR 261
>gi|365837320|ref|ZP_09378689.1| hypothetical protein HMPREF0454_03564 [Hafnia alvei ATCC 51873]
gi|364562052|gb|EHM39922.1| hypothetical protein HMPREF0454_03564 [Hafnia alvei ATCC 51873]
Length = 665
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 91/239 (38%), Gaps = 18/239 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ + + + + YAD ER+L N VLG + Y+ PL
Sbjct: 347 DTVYAESCASIGLMMFANRMLQMEGDSQYADVMERALYNTVLG-GMALDGRHFFYVNPLE 405
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S + P W CC + +G IY + + +YI Y+
Sbjct: 406 VHPKSIPFNHIYDHVKPIRQRWFGCACCPPNIARILTSIGHYIYTQ---RSDALYINLYV 462
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ +G + P WD + V + L +L LR+P W +
Sbjct: 463 GNETHLDNGLKIAISGNYP---WDENVSVHIRTEKP---LHQTLALRMPEWCEKPSVQ-- 514
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+ +L +T+ W D+L I LP+ +R A AI GP V
Sbjct: 515 LNGKTCEGLLKRGYLHITREWHDGDRLEIVLPMPVRRVYGNPLLRHVAGKVAIQRGPLV 573
>gi|89067251|ref|ZP_01154764.1| hypothetical protein OG2516_10441 [Oceanicola granulosus HTCC2516]
gi|89046820|gb|EAR52874.1| hypothetical protein OG2516_10441 [Oceanicola granulosus HTCC2516]
Length = 633
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 56/238 (23%), Positives = 99/238 (41%), Gaps = 25/238 (10%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + M+ + + + YAD E +L N L G+ R E L
Sbjct: 327 DTAYAETCASVAMVFWAARMLNLDLDGQYADILELALYNNALAGLSRDGEHYFYDNKL-- 384
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
+ S+H W W CC + + Y E + V++ +
Sbjct: 385 ----ESDGSHHRWA------WHECPCCTMNVSRLVASVAGYFYGVAETEI-AVHLYGGAT 433
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
+ L G++ + + D WD +R+ L +G+ T +L+LR+P W +GA A++
Sbjct: 434 ATLPVAGGRVTLTETSD--YPWDGAVRIAL--EPEGT-RTFTLSLRVPGW--CHGATASV 486
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
NG+ L + +L +T+ W+ D + + LP+ D + A A+ GP V
Sbjct: 487 NGEALEVAPERGYLKITRDWAPGDVVELNLPMQAERLYAHPDVRQDAGRVALRRGPLV 544
>gi|340346785|ref|ZP_08669904.1| hypothetical protein HMPREF9136_0902 [Prevotella dentalis DSM 3688]
gi|433652020|ref|YP_007278399.1| hypothetical protein Prede_1032 [Prevotella dentalis DSM 3688]
gi|339611002|gb|EGQ15842.1| hypothetical protein HMPREF9136_0902 [Prevotella dentalis DSM 3688]
gi|433302553|gb|AGB28369.1| hypothetical protein Prede_1032 [Prevotella dentalis DSM 3688]
Length = 663
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 105/470 (22%), Positives = 182/470 (38%), Gaps = 76/470 (16%)
Query: 183 GHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVW 242
G +L ++ L + ++ L +K V+ + Q+ GYL A + + + I
Sbjct: 89 GKWLESAYLSAIQSGDKELLDKAKKVLHRIIGSQES--DGYLGA-TAKSYRSPQRPIRGM 145
Query: 243 APYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFY--------------------NRV 282
PY ++ + Y + EAL+ + EYF NR
Sbjct: 146 DPY-ELYFVFHAFETIYEETGDKEALKAVEKLAEYFLTYFGPGKLEFWPSKTLRAPENRH 204
Query: 283 QNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHL----------FDKPC 332
Q + + H + E + D + +L+ IT ++L A +D
Sbjct: 205 QTLNGQSDFAGHSVHYSWEGTLLCDPIARLYTITGKKRYLDWAKWVVGNIDKWSGWDAFS 264
Query: 333 FLGLLA---LQADDISGF-HSNTHIPIVIGSQMRYEVTGDQ--LHK-EG--------HQL 377
L +A L D + + H++T +G Y++TGD+ L K EG
Sbjct: 265 RLDSIADGKLGVDQLQPYVHAHTFQMNFMGFLRLYQITGDRSLLRKVEGAWNDIYRRQMY 324
Query: 378 ESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSL 437
+ G ++ K K L+ N+ E+C T + +++++ L T + YAD E+ +
Sbjct: 325 ITGGVSVAEHYEKGYVKPLSGNI----IETCATMSWMQLTQMLLELTGDTKYADAIEKIM 380
Query: 438 TNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSI 497
N V Q G Y AP K Y H P CC +G S L +
Sbjct: 381 LNHVFAAQDALS-GTCRY--HTAPNGFKPDGYFH--GPD----CCTASGHRIISLL-PTF 430
Query: 498 YFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTS 557
++ E+GK YI Q + + +++ I N + VS + V +K
Sbjct: 431 FYAEKGK--SFYINQLLPA--NYRGKAIDFNISGNYPVSDSVVIDVNRMQGNK------- 479
Query: 558 LNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPL 607
L +R+P W + T+NG+ + G + V K WS D++ + LP+
Sbjct: 480 LFIRVPAW--CDNPSITVNGKPQGNVAAGKYYVVNKKWSKGDRIVMHLPM 527
>gi|423214778|ref|ZP_17201306.1| hypothetical protein HMPREF1074_02838 [Bacteroides xylanisolvens
CL03T12C04]
gi|423294029|ref|ZP_17272156.1| hypothetical protein HMPREF1070_00821 [Bacteroides ovatus
CL03T12C18]
gi|392676837|gb|EIY70260.1| hypothetical protein HMPREF1070_00821 [Bacteroides ovatus
CL03T12C18]
gi|392692684|gb|EIY85921.1| hypothetical protein HMPREF1074_02838 [Bacteroides xylanisolvens
CL03T12C04]
Length = 621
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 57/252 (22%), Positives = 106/252 (42%), Gaps = 31/252 (12%)
Query: 404 TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL---- 459
T E+C T+ +++ L T YA+ +E ++ N ++ + + Y PL
Sbjct: 312 TMETCVTFTYMQLCHRLLCKTGNSFYAEEFEHTMYNALMATMKNDGSQISKYS-PLEGRR 370
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKL-GDSIYFEEEGKYPGVYIIQYISSRL 518
PG +E+ H CC G F+ + + ++ Y +Y+ + L
Sbjct: 371 QPG--EEQCGMHIN-------CCNANGPRGFALIPKTACTIKDNHIYLNLYLPLQATISL 421
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+ K ++ +N + D + + + + K +L LRIPT KA +NG+
Sbjct: 422 N-KKNKVHLNVESDYPIHGKVNVNIGVQKKEK-----FTLALRIPTQIEK--MKAYINGE 473
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
+ + G +L + + W + DK+T+ + + + + QAI+ GP + A S
Sbjct: 474 EQEITHKGGYLYIERIWENADKVTLDFKIETKVVKLNNS-------QAIVRGPLLFARDS 526
Query: 639 -IGDWDITESAT 649
D DI E AT
Sbjct: 527 RFNDGDIDECAT 538
>gi|160882339|ref|ZP_02063342.1| hypothetical protein BACOVA_00287 [Bacteroides ovatus ATCC 8483]
gi|156112253|gb|EDO13998.1| hypothetical protein BACOVA_00287 [Bacteroides ovatus ATCC 8483]
Length = 698
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL- 575
WK G++ + Q+ D W+ +RVTL + +G SL LRIP W KATL
Sbjct: 494 -TWKDKGELALTQETD--YPWEGKVRVTLDRVPRKAG-AFSLFLRIPEWCE----KATLA 545
Query: 576 -NGQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
NGQ L + N + V +TW D +L + +P+ L
Sbjct: 546 VNGQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|237720781|ref|ZP_04551262.1| six-hairpin glycosidase [Bacteroides sp. 2_2_4]
gi|229449616|gb|EEO55407.1| six-hairpin glycosidase [Bacteroides sp. 2_2_4]
Length = 698
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 60/216 (27%), Positives = 91/216 (42%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YAD E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYADLVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
WK G++ + Q+ D W+ +RVTL + +G SL LRIP W T+N
Sbjct: 494 -TWKDKGELTLTQETD--YPWEGKVRVTLDRVPRKAG-AFSLFLRIPEWCEK--TTLTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
GQ L + N + V +TW D +L + +P+ L
Sbjct: 548 GQPLQTNAKANSYAEVNRTWKKGDVVELVMDMPVRL 583
>gi|383111125|ref|ZP_09931943.1| hypothetical protein BSGG_2229 [Bacteroides sp. D2]
gi|313694694|gb|EFS31529.1| hypothetical protein BSGG_2229 [Bacteroides sp. D2]
Length = 621
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 57/252 (22%), Positives = 106/252 (42%), Gaps = 31/252 (12%)
Query: 404 TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL---- 459
T E+C T+ +++ L T YA+ +E ++ N ++ + + Y PL
Sbjct: 312 TMETCVTFTYMQLCHRLLCKTGNSFYAEEFEHTMYNALMATMKNDGSQISKYS-PLEGRR 370
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKL-GDSIYFEEEGKYPGVYIIQYISSRL 518
PG +E+ H CC G F+ + + ++ Y +Y+ + L
Sbjct: 371 QPG--EEQCGMHIN-------CCNANGPRGFALIPKTACTIKDNHIYLNLYLPLQATISL 421
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+ K ++ +N + D + + + + K +L LRIPT KA +NG+
Sbjct: 422 N-KKNKVHLNVESDYPIHGKVNVNIGVQKKEK-----FTLALRIPTQIEK--MKAYINGE 473
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
+ + G +L + + W + DK+T+ + + + + QAI+ GP + A S
Sbjct: 474 EQEITHKGGYLYIERIWENADKVTLDFKIETKVVKLNNS-------QAIVRGPLLFARDS 526
Query: 639 -IGDWDITESAT 649
D DI E AT
Sbjct: 527 RFNDGDIDECAT 538
>gi|336417454|ref|ZP_08597777.1| hypothetical protein HMPREF1017_04885 [Bacteroides ovatus
3_8_47FAA]
gi|335935949|gb|EGM97896.1| hypothetical protein HMPREF1017_04885 [Bacteroides ovatus
3_8_47FAA]
Length = 621
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 57/252 (22%), Positives = 106/252 (42%), Gaps = 31/252 (12%)
Query: 404 TEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL---- 459
T E+C T+ +++ L T YA+ +E ++ N ++ + + Y PL
Sbjct: 312 TMETCVTFTYMQLCHRLLCKTGNSFYAEEFEHTMYNALMATMKNDGSQISKYS-PLEGRR 370
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKL-GDSIYFEEEGKYPGVYIIQYISSRL 518
PG +E+ H CC G F+ + + ++ Y +Y+ + L
Sbjct: 371 QPG--EEQCGMHIN-------CCNANGPRGFALIPKTACTIKDNHIYLNLYLPLQATISL 421
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+ K ++ +N + D + + + + K +L LRIPT KA +NG+
Sbjct: 422 N-KKNKVHLNVESDYPIHGKVNVNIGVQKKEK-----FTLALRIPTQIEK--MKAYINGE 473
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHS 638
+ + G +L + + W + DK+T+ + + + + QAI+ GP + A S
Sbjct: 474 EQEITHKGGYLYIERIWENADKVTLDFKIETKVVKLNNS-------QAIVRGPLLFARDS 526
Query: 639 -IGDWDITESAT 649
D DI E AT
Sbjct: 527 RFNDGDIDECAT 538
>gi|116254107|ref|YP_769945.1| hypothetical protein RL4374 [Rhizobium leguminosarum bv. viciae
3841]
gi|115258755|emb|CAK09861.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 640
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/238 (23%), Positives = 103/238 (43%), Gaps = 22/238 (9%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + YAD E++L NG L G+ T+ Y PL
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALPGLS--TDGKTFFYDNPL 386
Query: 460 APGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
R +HH P CC + +G +Y + + V++ ++RL
Sbjct: 387 ESAGKHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVSDNEI-AVHLYGESTARL 438
Query: 519 DWKSG-QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+G ++ + Q + W+ + T +L+LRIP W + GA ++NG
Sbjct: 439 KLANGAEVELEQTTN--YPWEGAVAFTTRLEKPAK---FALSLRIPDW--AEGATLSVNG 491
Query: 578 Q--DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+ DL ++ + + W++ D++ + LPL LR + + A A++ GP V
Sbjct: 492 EMLDLNANMRDGYIRIDREWAAGDRVALYLPLALRPQYANPKVRQDAGRVALMRGPLV 549
>gi|29346413|ref|NP_809916.1| hypothetical protein BT_1003 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29338309|gb|AAO76110.1| Six-hairpin glycosidase [Bacteroides thetaiotaomicron VPI-5482]
Length = 698
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 58/216 (26%), Positives = 93/216 (43%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YA+ E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYAELVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+WK G++ + Q+ D W+ +RVTL + +G SL RIP W A T+N
Sbjct: 494 -NWKDKGELALVQETD--YPWEGNVRVTLNKVPRKAG-AFSLFFRIPEWCGK--AALTVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
GQ + + + N + V +TW D +L + +P+ L
Sbjct: 548 GQPVSMNAKANTYAEVNRTWKKGDVVELVMDMPVCL 583
>gi|23465020|ref|NP_695623.1| hypothetical protein BL0422 [Bifidobacterium longum NCC2705]
gi|23325624|gb|AAN24259.1| narrowly conserved hypothetical protein [Bifidobacterium longum
NCC2705]
gi|291517556|emb|CBK71172.1| Uncharacterized protein conserved in bacteria [Bifidobacterium
longum subsp. longum F8]
Length = 658
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 57/248 (22%), Positives = 104/248 (41%), Gaps = 19/248 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG-VLGIQRGTEPGVMIYLLPL 459
D+ E+C + M ++ + + YAD E+ L NG + GI + + L
Sbjct: 333 DTMYGETCASVAMSMFAQQMLDLEPKGEYADVLEKELFNGSIAGISLDGKQYYYVNALET 392
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P HH + ++ CC + + IY E +G V Q+I++
Sbjct: 393 TPDGLDNPDRHHVLSHRVDWFGCACCPANIARLIASVDRYIYTERDGG-KTVLSHQFIAN 451
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+ D+ SG + V Q+ D WD ++ T++ + + + LRIP W S T+N
Sbjct: 452 KADFASG-LTVEQRSD--FPWDSHVEYTVSLPASAADSSVRFGLRIPGW-SLGSYTLTVN 507
Query: 577 GQDLPLPSPGNFLS--VTKTWSSDDKLTIQLPLTLRTEAIQDD---RPEYASIQAILYGP 631
G+ P+ G+ V ++ D L I L L + + ++ + R + + A++ GP
Sbjct: 508 GK----PAVGSLEDGFVYLVVNAGDTLEIALELDMSVKFVRANSRVRSDAGQV-AVMRGP 562
Query: 632 YVLAGHSI 639
V +
Sbjct: 563 LVYCAEQV 570
>gi|239622627|ref|ZP_04665658.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis CCUG 52486]
gi|322688383|ref|YP_004208117.1| hypothetical protein BLIF_0192 [Bifidobacterium longum subsp.
infantis 157F]
gi|239514624|gb|EEQ54491.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis CCUG 52486]
gi|320459719|dbj|BAJ70339.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis 157F]
Length = 658
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 106/250 (42%), Gaps = 23/250 (9%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG-VLGIQRGTEPGVMIYLLPL 459
D+ E+C + M ++ + + YAD E+ L NG + GI + + L
Sbjct: 333 DTMYGETCASVAMSMFAQQMLDLEPKGEYADVLEKELFNGSIAGISLDGKQYYYVNALET 392
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYII--QYI 514
P HH + ++ CC + + IY E +G G ++ Q+I
Sbjct: 393 TPDGLDNPDRHHVLSHRVDWFGCACCPANIARLIASVDRYIYTERDG---GKIVLSHQFI 449
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+++ D+ SG + V Q+ D WD ++ T++ + + + LRIP W S T
Sbjct: 450 ANKADFASG-LTVEQRSD--FPWDSHVEYTVSLPASAADSSVRFGLRIPGW-SLGSYTLT 505
Query: 575 LNGQDLPLPSPGNFLS--VTKTWSSDDKLTIQLPLTLRTEAIQDD---RPEYASIQAILY 629
+NG+ P+ G+ V ++ D L I L L + + ++ + R + + A++
Sbjct: 506 VNGK----PAVGSLEDGFVYLVVNAGDTLEIALELDMSVKFVRANSRVRSDAGQV-AVMR 560
Query: 630 GPYVLAGHSI 639
GP V +
Sbjct: 561 GPLVYCAEQV 570
>gi|227545698|ref|ZP_03975747.1| protein of hypothetical function DUF1680 [Bifidobacterium longum
subsp. longum ATCC 55813]
gi|227213814|gb|EEI81653.1| protein of hypothetical function DUF1680 [Bifidobacterium longum
subsp. infantis ATCC 55813]
Length = 668
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 57/248 (22%), Positives = 104/248 (41%), Gaps = 19/248 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG-VLGIQRGTEPGVMIYLLPL 459
D+ E+C + M ++ + + YAD E+ L NG + GI + + L
Sbjct: 343 DTMYGETCASVAMSMFAQQMLDLEPKGEYADVLEKELFNGSIAGISLDGKQYYYVNALET 402
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P HH + ++ CC + + IY E +G V Q+I++
Sbjct: 403 TPDGLDNPDRHHVLSHRVDWFGCACCPANIARLIASVDRYIYTERDGG-KTVLSHQFIAN 461
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+ D+ SG + V Q+ D WD ++ T++ + + + LRIP W S T+N
Sbjct: 462 KADFASG-LTVEQRSD--FPWDSHVEYTVSLPASAADSSVRFGLRIPGW-SLGSYTLTVN 517
Query: 577 GQDLPLPSPGNFLS--VTKTWSSDDKLTIQLPLTLRTEAIQDD---RPEYASIQAILYGP 631
G+ P+ G+ V ++ D L I L L + + ++ + R + + A++ GP
Sbjct: 518 GK----PAVGSLEDGFVYLVVNAGDTLEIALELDMSVKFVRANSRVRSDAGQV-AVMRGP 572
Query: 632 YVLAGHSI 639
V +
Sbjct: 573 LVYCAEQV 580
>gi|317482736|ref|ZP_07941749.1| hypothetical protein HMPREF0177_01144 [Bifidobacterium sp.
12_1_47BFAA]
gi|316915859|gb|EFV37268.1| hypothetical protein HMPREF0177_01144 [Bifidobacterium sp.
12_1_47BFAA]
Length = 658
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 57/248 (22%), Positives = 104/248 (41%), Gaps = 19/248 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG-VLGIQRGTEPGVMIYLLPL 459
D+ E+C + M ++ + + YAD E+ L NG + GI + + L
Sbjct: 333 DTMYGETCASVAMSMFAQQMLDLEPKGEYADVLEKELFNGSIAGISLDGKQYYYVNALET 392
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P HH + ++ CC + + IY E +G V Q+I++
Sbjct: 393 TPDGLDNPDRHHVLSHRVDWFGCACCPANIARLIASVDRYIYTERDGG-KTVLSHQFIAN 451
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+ D+ SG + V Q+ D WD ++ T++ + + + LRIP W S T+N
Sbjct: 452 KADFASG-LTVEQRSD--FPWDSHVEYTVSLPASAADSSVRFGLRIPGW-SLGSYTLTVN 507
Query: 577 GQDLPLPSPGNFLS--VTKTWSSDDKLTIQLPLTLRTEAIQDD---RPEYASIQAILYGP 631
G+ P+ G+ V ++ D L I L L + + ++ + R + + A++ GP
Sbjct: 508 GK----PAVGSLEDGFVYLVVNAGDTLEIALELDMSVKFVRANSRVRSDAGQV-AVMRGP 562
Query: 632 YVLAGHSI 639
V +
Sbjct: 563 LVYCAEQV 570
>gi|417109929|ref|ZP_11963472.1| hypothetical protein RHECNPAF_800032 [Rhizobium etli CNPAF512]
gi|327188729|gb|EGE55928.1| hypothetical protein RHECNPAF_800032 [Rhizobium etli CNPAF512]
Length = 640
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 56/237 (23%), Positives = 101/237 (42%), Gaps = 20/237 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + YAD E++L NG L G+ T+ Y PL
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALPGLS--TDGKTFFYDNPL 386
Query: 460 APGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
R +HH P CC + +G +Y + + V++ ++RL
Sbjct: 387 ESAGKHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAIADDEI-AVHLYGESTTRL 438
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+G V Q+ W+ + T +L+LRIP W ++GA ++NG+
Sbjct: 439 KLANGAAVELQQATNY-PWEGAVAFTTRLEKPAK---FALSLRIPDW--ADGATLSVNGE 492
Query: 579 --DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
DL + + + + W D++ + LPL+LR + + A A++ GP V
Sbjct: 493 KLDLGAATRDGYARIDRQWVDGDRVDLFLPLSLRPQYANPKVRQDAGRVALMRGPLV 549
>gi|190893687|ref|YP_001980229.1| hypothetical protein RHECIAT_CH0004122 [Rhizobium etli CIAT 652]
gi|190698966|gb|ACE93051.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
Length = 640
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/237 (23%), Positives = 101/237 (42%), Gaps = 20/237 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + YAD E++L NG L G+ T+ Y PL
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALPGLS--TDGKTFFYDNPL 386
Query: 460 APGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
R +HH P CC + +G +Y + + V++ ++RL
Sbjct: 387 ESAGKHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVADDEI-AVHLYGESTTRL 438
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+G V Q+ W+ + T +L+LRIP W ++GA ++NG+
Sbjct: 439 KLANGAAVELQQATNY-PWEGAVAFTTRLEKPAK---FALSLRIPDW--ADGATLSVNGE 492
Query: 579 --DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
DL + + + + W D++ + LPL+LR + + A A++ GP V
Sbjct: 493 KLDLGAVTRDGYARIDRQWVDGDRVDLFLPLSLRPQYANPKVRQDAGRVALMRGPLV 549
>gi|16265291|ref|NP_438083.1| hypothetical protein SM_b20631 [Sinorhizobium meliloti 1021]
gi|15141431|emb|CAC49943.1| conserved hypothetical protein [Sinorhizobium meliloti 1021]
Length = 640
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 105/244 (43%), Gaps = 34/244 (13%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI------ 454
D+ E+C + ++ + + + YAD E++L NG L PG+ I
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFF 381
Query: 455 YLLPLAPGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y PL R +HH P CC + +G +Y E + V++
Sbjct: 382 YDNPLESTGRHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGE 433
Query: 514 ISSRLDWKSG-QIVVNQKVDPVVSWDPYLRVTLTFSSK-GSGLTTSLNLRIPTWTSSNGA 571
++RL SG ++ + Q+ + W+ + F++K +L+LRIP W + GA
Sbjct: 434 STARLKLASGAEVELRQETN--YPWEG----AIAFATKLDRPAKFALSLRIPEWAA--GA 485
Query: 572 KATLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
++NG L L + G + + + WS D++ + LPL +R + + A++
Sbjct: 486 TLSVNGTMLDLSAHLTGGYARIEREWSDGDRVALYLPLAIRPQYANPKVRQDVGRVALMR 545
Query: 630 GPYV 633
GP V
Sbjct: 546 GPLV 549
>gi|410100001|ref|ZP_11294966.1| hypothetical protein HMPREF1076_04144 [Parabacteroides goldsteinii
CL02T12C30]
gi|409216556|gb|EKN09540.1| hypothetical protein HMPREF1076_04144 [Parabacteroides goldsteinii
CL02T12C30]
Length = 618
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/285 (22%), Positives = 114/285 (40%), Gaps = 27/285 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + M+ + + + T + Y D ERS+ NGVL GI + Y+ PL
Sbjct: 336 ETCASVGMVFWNHRMNQITGDAKYIDILERSMYNGVLAGISLSGDR--FFYVNPLESKGD 393
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSRLDWKSG 523
R W + CC +G+ IY ++ + +YI ++R
Sbjct: 394 HHR--QEWYGCA----CCPSQLSRFLPTIGNYIYAISDDALWVNLYIGN--TTRFTLNDD 445
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
+++ Q+ + WD +++T+ S L + LRIP W + T+NG+++ L
Sbjct: 446 NVILRQETN--YPWDGSVKLTV---SSTKDLDKEIRLRIPGWCKN--YTITINGKEVGLS 498
Query: 584 SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWD 643
+ ++ W D +++ + + + E+ E +AI GP V +
Sbjct: 499 QEKGY-AIVYDWKPGDMISLDMDMPVEVESADPLVTENIGKRAIQRGPLVYCAEETDNSA 557
Query: 644 ITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSIT 688
+ T SD T S+ + L+ G N QSIT
Sbjct: 558 YFDRLTLTSD--TEYHTSFEAGLLN-----GVKTINAKNEQQSIT 595
>gi|86359423|ref|YP_471315.1| hypothetical protein RHE_CH03841 [Rhizobium etli CFN 42]
gi|86283525|gb|ABC92588.1| hypothetical conserved protein [Rhizobium etli CFN 42]
Length = 640
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/233 (22%), Positives = 101/233 (43%), Gaps = 22/233 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + ++ + + + YAD E++L NG L G+ T+ Y PL
Sbjct: 334 ETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALPGLS--TDGKTFFYDNPLESAGK 391
Query: 465 KER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
R +HH P CC + +G +Y + + V++ ++RL +G
Sbjct: 392 HHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVADDEI-AVHLYGESTARLKLANG 443
Query: 524 -QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL 582
++ + Q + WD + T + +L+LRIP W + GA ++NG + L
Sbjct: 444 AEVELEQATN--YPWDGAVAFTAKLAKSAK---FALSLRIPDW--AEGASLSVNGTGVEL 496
Query: 583 PS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+ ++ + + W+ D++ + LP+ LR + + A A++ GP V
Sbjct: 497 GAHLRDGYIRIEREWAHGDRVALDLPMALRPQYANPKVRQDAGRVALMRGPLV 549
>gi|334320143|ref|YP_004556772.1| hypothetical protein [Sinorhizobium meliloti AK83]
gi|407722785|ref|YP_006842446.1| hypothetical protein BN406_05164 [Sinorhizobium meliloti Rm41]
gi|334097882|gb|AEG55892.1| protein of unknown function DUF1680 [Sinorhizobium meliloti AK83]
gi|407322845|emb|CCM71446.1| hypothetical protein BN406_05164 [Sinorhizobium meliloti Rm41]
Length = 640
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 105/244 (43%), Gaps = 34/244 (13%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI------ 454
D+ E+C + ++ + + + YAD E++L NG L PG+ I
Sbjct: 329 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGRTFF 381
Query: 455 YLLPLAPGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y PL R +HH P CC + +G +Y E + V++
Sbjct: 382 YDNPLESTGRHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVAEDEI-AVHLYGE 433
Query: 514 ISSRLDWKSG-QIVVNQKVDPVVSWDPYLRVTLTFSSK-GSGLTTSLNLRIPTWTSSNGA 571
++RL SG ++ + Q+ + W+ + F++K +L+LRIP W + GA
Sbjct: 434 STARLKLASGAEVELRQETN--YPWEG----AIAFATKLDRPAKFALSLRIPEWAA--GA 485
Query: 572 KATLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY 629
++NG L L + G + + + WS D++ + LPL +R + + A++
Sbjct: 486 TLSVNGTMLDLSAHLTGGYARIEREWSDGDRVALYLPLAIRPQYANPKVRQDVGRVALMR 545
Query: 630 GPYV 633
GP V
Sbjct: 546 GPLV 549
>gi|392965453|ref|ZP_10330872.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
gi|387844517|emb|CCH52918.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
Length = 650
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/262 (22%), Positives = 106/262 (40%), Gaps = 38/262 (14%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D E+C L + +F T + Y D +ER L NG L G+ E Y+ PL
Sbjct: 340 DVAYAETCAAVANLLWNHRMFLLTGQSKYMDVFERVLYNGFLAGVS--LEGDKFFYVNPL 397
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
A S +R ++ + W CC + L +Y + V++ +++
Sbjct: 398 A--SDGKRKFNVGVAAERAPWFGTSCCPTNVVRFLPSLPGYVYAVKNND---VFVNLFLT 452
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN------ 569
+ + G+ V + WD VT+T S + + L +RIP WT
Sbjct: 453 NSSELTVGKTPVQVQQQTNYPWDG--AVTMTVSPR-NAQAFDLLVRIPGWTLGKPMPGNL 509
Query: 570 -------GAKATL--NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQD 616
GA +L NG+ +P+ + +++TW D++ +++ + +R + ++D
Sbjct: 510 YSYRRNIGATPSLKVNGKAVPVKMDNGYARISRTWKPGDRVELRMEMPVREVIANQQVKD 569
Query: 617 DRPEYASIQAILYGPYVLAGHS 638
D A AI GP V +
Sbjct: 570 D----AGRVAIERGPIVYCAEA 587
>gi|374374966|ref|ZP_09632624.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
gi|373231806|gb|EHP51601.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
Length = 629
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/247 (21%), Positives = 90/247 (36%), Gaps = 46/247 (18%)
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPG 462
++ E+C T +K+ L R T + +A+ ER+ N +LG ++P
Sbjct: 326 HSNETCVTATWMKLCLQLLRTTGDAKWANEIERTFYNALLGA-----------MMPDG-- 372
Query: 463 SSKERSYHHWGTPSD--------------SFWCCYGTGIESFSKLGDSIYFEEEGKYPGV 508
H W +D CC G L + G+
Sbjct: 373 -------HTWNKYTDLRGVKYLGENQCGMDINCCIANGPRGLMVLPKEAFMINAA---GI 422
Query: 509 YIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS 568
+ Y ++ GQ N+ V+ P + G L +L LRIP W++
Sbjct: 423 AVNFYGTASATLSVGQ---NKVTLNTVTEYPKNGAVTIIVNPGKPLDFNLQLRIPEWSAH 479
Query: 569 NGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAIL 628
++NG + PG + ++ +TW D + +Q + +R + D Y +
Sbjct: 480 T--NISINGVAVDNAVPGKYTAIKRTWKQGDIVKLQFQMDVRQYFVPGDSTRY----CLQ 533
Query: 629 YGPYVLA 635
YGP VLA
Sbjct: 534 YGPLVLA 540
>gi|325103091|ref|YP_004272745.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324971939|gb|ADY50923.1| protein of unknown function DUF1680 [Pedobacter saltans DSM 12145]
Length = 673
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 115/494 (23%), Positives = 187/494 (37%), Gaps = 117/494 (23%)
Query: 186 LSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTE-------QF-DRLEA 237
+ A A ++AST ++ L E M ++ ++ Q+E G Y A + QF DRL
Sbjct: 106 IEAVASLYASTKDKKLDEMMDKAIAVIAKSQREDGYIYTKAMIDQRKTGVKNQFEDRLS- 164
Query: 238 LIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEY---FYNRVQNVIKKYSI-ER 293
+ Y H + AG + Y L + +Y FY + + + +I
Sbjct: 165 ----FEAYNIGHLMTAGCV-HYRATGKKNLLNVAIKATDYLYKFYKQASPTLARNAICPS 219
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLA-HLFDKPCFLGLLALQADDISGFHSNTH 352
H+ + E ++ D ++L LA HL D G + DD +
Sbjct: 220 HYMGVVE-----------MYRTLGDKRYLELAKHLID---IKGEIEDGTDD-----NQDR 260
Query: 353 IPI-----VIGSQMR-----------YEVTGD-----QLHK------------------- 372
IP V+G +R Y TGD QLHK
Sbjct: 261 IPFRKQEKVMGHAVRANYLYAGVADVYAETGDRTLISQLHKMWNDVTQHKMYITGGCGSL 320
Query: 373 ------EGHQLESSGTNIGHFNFKSD---PKRLASNLDSNTEESCTTYNMLKVSRHLFRW 423
+G E H + D P A N E+C + + + +
Sbjct: 321 YDGVSPDGTVYEPPIVQKVHQAYGRDYQLPNFTAHN------ETCANIGNVLWNWRMLQL 374
Query: 424 TKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLPLAPGSSKERSYHHWGTPS 476
+ YAD E +L N VL GI T P LP SKER + S
Sbjct: 375 EGDAKYADVMELALYNSVLSGISLDGKRFLYTNPLSYSDNLPFKQRWSKERV--EYIKLS 432
Query: 477 DSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVV 535
+ CC + + +++ + Y +G Y +Y +S++LD S + Q P
Sbjct: 433 N---CCPPNTVRTIAEVSNYAYSISNKGVYVNLYGSNNLSTKLDDGSTIKLTQQTEYP-- 487
Query: 536 SWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ--DLPLPSPGNFLSVTK 593
W+ + +T++ S K S+ +RIP W +N AK ++NG+ D + S G +L + +
Sbjct: 488 -WEGRVAITISESKKSP---FSIFMRIPGW--ANSAKVSINGKSVDADIKS-GQYLELNR 540
Query: 594 TWSSDDKLTIQLPL 607
W D++ + LP+
Sbjct: 541 NWKKGDQIVLNLPM 554
>gi|375146847|ref|YP_005009288.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361060893|gb|AEV99884.1| protein of unknown function DUF1680 [Niastella koreensis GR20-10]
Length = 674
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/222 (25%), Positives = 94/222 (42%), Gaps = 37/222 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C L + + + + YAD E L NG+L GI + Y PL+
Sbjct: 359 ETCANIGNLLWNWRMLLLSGDAKYADVMELELYNGILSGIS--LDGNNFFYTNPLS---- 412
Query: 465 KERSYHHWGTPSDSFW-------------CCYGTGIESFSKLGDSIY-FEEEGKYPGVYI 510
H P W CC + + +++GD Y +G + +Y
Sbjct: 413 -----HSADYPYTLRWQEAGRVPYIKLSNCCPPNTVRTMAEVGDYAYTTSNKGLWVHLYG 467
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
IS++L+ S + Q P WD +++ T+T K SL LRIP W +
Sbjct: 468 ANKISTKLEDGSALEMTQQSNYP---WDGHIKFTVT---KAEAKAFSLYLRIPGW--CDK 519
Query: 571 AKATLNGQDLPLPS-PGNFLSVTKTWSSDD--KLTIQLPLTL 609
A T+NG+ + P+ P ++ + + W + D +L + +P+TL
Sbjct: 520 AALTVNGKPVTGPNKPATYVELNRAWKAGDVVELNLSMPVTL 561
>gi|317492212|ref|ZP_07950641.1| hypothetical protein HMPREF0864_01405 [Enterobacteriaceae bacterium
9_2_54FAA]
gi|316919551|gb|EFV40881.1| hypothetical protein HMPREF0864_01405 [Enterobacteriaceae bacterium
9_2_54FAA]
Length = 661
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 90/239 (37%), Gaps = 18/239 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ ESC + ++ + + + + YAD ER+L N VLG + Y+ PL
Sbjct: 343 DTVYAESCASIGLMMFANRMLQMEGDSQYADVMERALYNTVLG-GMALDGRHFFYVNPLE 401
Query: 461 --PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
P S + P W CC + +G IY + + +YI Y+
Sbjct: 402 VHPKSIPFNHIYDHVKPIRQRWFGCACCPPNIARILTSIGHYIYTQ---RSDALYINLYV 458
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+ +G + P WD + V + L +L LR+P W +
Sbjct: 459 GNETLLDNGLKIAISGNYP---WDENVSVHIRTEKP---LHQTLALRMPEWCEK--PRVQ 510
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG+ +L + + W D+L I LP+ +R A AI GP V
Sbjct: 511 LNGETCEDLLQRGYLHIAREWQDGDRLEIVLPMPVRRVYGNPLLRHVAGKVAIQRGPLV 569
>gi|410725713|ref|ZP_11364076.1| hypothetical protein A370_02153 [Clostridium sp. Maddingley
MBC34-26]
gi|410601724|gb|EKQ56224.1| hypothetical protein A370_02153 [Clostridium sp. Maddingley
MBC34-26]
Length = 648
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/239 (20%), Positives = 98/239 (41%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ +R + + + YAD E++L NGV+ G+ + L +
Sbjct: 328 DTIYAETCASIGLVFFARRMLEISPKSKYADIMEKALYNGVISGMSLDGTKFFYVNPLEV 387
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYI 514
P SS++ W CC + +G Y +E + +Y+ I
Sbjct: 388 VPESSEKDHLRAHVKVERQKWFGCACCPPNLARLLASIGSYAYSIKENTMFMHLYMGGEI 447
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
++ L + V KV+ WD +++TL + + + +RIP W + K
Sbjct: 448 TTNLSNNN----VAFKVETNYPWDENVKITLNIKEE---INFEVAIRIPEWCGNYNIK-- 498
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+NG+D+ + + + W + D + + + + + + E A++ GP V
Sbjct: 499 VNGEDVEYKIIYGYAYIDRVWKNADAIDVDFKMPVEVMSANVNVRENIGKVAVMRGPIV 557
>gi|384202264|ref|YP_005588011.1| hypothetical protein BLNIAS_02509 [Bifidobacterium longum subsp.
longum KACC 91563]
gi|338755271|gb|AEI98260.1| hypothetical protein BLNIAS_02509 [Bifidobacterium longum subsp.
longum KACC 91563]
Length = 658
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/242 (23%), Positives = 103/242 (42%), Gaps = 19/242 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG-VLGIQRGTEPGVMIYLLPL 459
D+ E+C + M ++ + + YAD E+ L NG + GI + + L
Sbjct: 333 DTMYGETCASVAMSMFAQQMLDLEPKGEYADVLEKELFNGSIAGISLDGKQYYYVNALET 392
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P HH + ++ CC + + IY E +G V Q+I++
Sbjct: 393 TPDGLDNPDRHHVLSHRVDWFGCACCPANIARLIASVDRYIYTERDGG-KTVLSHQFIAN 451
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+ D+ SG + V Q+ D WD ++ T++ + + + LRIP W S T+N
Sbjct: 452 KADFASG-LTVEQRSD--FPWDSHVEYTVSLPASAADSSVRFGLRIPGW-SLGSYTLTVN 507
Query: 577 GQDLPLPSPGNFLS--VTKTWSSDDKLTIQLPLTLRTEAIQDD---RPEYASIQAILYGP 631
G+ P+ G+ + ++ D L I L L + + ++ + R + + A++ GP
Sbjct: 508 GK----PAVGSLEDGFIYLVVNAGDTLEIALELDMSVKFVRANSRVRSDAGQV-AVMRGP 562
Query: 632 YV 633
V
Sbjct: 563 LV 564
>gi|222082345|ref|YP_002541710.1| hypothetical protein Arad_8964 [Agrobacterium radiobacter K84]
gi|221727024|gb|ACM30113.1| conserved hypothetical protein [Agrobacterium radiobacter K84]
Length = 643
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/236 (22%), Positives = 96/236 (40%), Gaps = 18/236 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
+S E+C + ++ + + YAD E++L NG + + Y PL
Sbjct: 328 ESAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GLSLDGKTFFYENPLE 386
Query: 461 PGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
G R ++HH P CC + +G +Y + + V++ +R+
Sbjct: 387 SGGKHHRWTWHH--CP-----CCPPNIARLLASIGSYMYAAADNEI-AVHLYGESKARVP 438
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ- 578
SG + V + WD +R + +L+LRIP W ++GA +NG
Sbjct: 439 LASG-VTVELAQETRYPWDGAIRFEVNPDRNAR---FALSLRIPEW--ADGATLAVNGVP 492
Query: 579 -DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
DL + + + + W + D++ + +PL RT + A A++ GP V
Sbjct: 493 VDLSAVTIDGYARIERDWQAGDRVDLNIPLIPRTLFANPKVRQDAGRAALMRGPLV 548
>gi|291540943|emb|CBL14054.1| Uncharacterized protein conserved in bacteria [Roseburia
intestinalis XB6B4]
Length = 650
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 90/207 (43%), Gaps = 18/207 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D N ESC + + + + TK+ YAD E++L N VL GI + + L +
Sbjct: 329 DRNYSESCASIGLAMFGNRMAQITKDAKYADIVEKALYNTVLAGIAMDGKSFFYVNPLEV 388
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + ER+ P W CC + + LG IY +E +YI YIS
Sbjct: 389 WPDNCIERTSMEHVKPVRQKWFGVACCPPNIARTLASLGQYIYGADEN---SLYINLYIS 445
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLR---VTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
S+ ++++ + V+ +L+ VT+ S+ + T L LRIP +T
Sbjct: 446 SQT-----KLLIGETETEVIMESSFLKDGTVTVHLESEKASKGT-LALRIPGYTKEFTVW 499
Query: 573 ATLNGQDLPLPSPGNFLSVTKTWSSDD 599
+ + PL G +L +T +S++
Sbjct: 500 RGVQRIETPLIKKG-YLMITDLAASEE 525
>gi|398379890|ref|ZP_10538009.1| hypothetical protein PMI03_03641 [Rhizobium sp. AP16]
gi|397721906|gb|EJK82452.1| hypothetical protein PMI03_03641 [Rhizobium sp. AP16]
Length = 643
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/236 (22%), Positives = 96/236 (40%), Gaps = 18/236 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
+S E+C + ++ + + YAD E++L NG + + Y PL
Sbjct: 328 ESAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GLSLDGKTFFYENPLE 386
Query: 461 PGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
G R ++HH P CC + +G +Y + + V++ +R+
Sbjct: 387 SGGKHHRWTWHH--CP-----CCPPNIARLLASIGSYMYAAADNEI-AVHLYGESKARVP 438
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ- 578
SG + V + WD +R + +L+LRIP W ++GA +NG
Sbjct: 439 LASG-VTVELAQETRYPWDGAIRFEVNPDRNAR---FALSLRIPEW--ADGATLAVNGVP 492
Query: 579 -DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
DL + + + + W + D++ + +PL RT + A A++ GP V
Sbjct: 493 VDLSAVTIDGYARIERDWQAGDRVDLNIPLIPRTLFANPKVRQDAGRAALMRGPLV 548
>gi|424872619|ref|ZP_18296281.1| hypothetical protein Rleg5DRAFT_4131 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393168320|gb|EJC68367.1| hypothetical protein Rleg5DRAFT_4131 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 648
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/238 (23%), Positives = 103/238 (43%), Gaps = 22/238 (9%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + YAD E++L NG L G+ T+ Y PL
Sbjct: 337 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALPGLS--TDGKTFFYDNPL 394
Query: 460 APGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
R +HH P CC + +G +Y + + V++ ++RL
Sbjct: 395 ESAGKHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVSDNEI-AVHLYGESTARL 446
Query: 519 DWKSG-QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+G ++ + Q + W+ + T +L+LRIP W + GA ++NG
Sbjct: 447 KLANGAEVELEQTTN--YPWEGAVAFTTRLEKPAR---FALSLRIPDW--AEGATLSVNG 499
Query: 578 QDLPLPSP--GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+ L L + + + + W++ D++ + LPL LR + + A A++ GP V
Sbjct: 500 EMLDLNANMYDGYARIDREWAAGDRVALYLPLALRPQYANPKVRQDAGRVALMRGPLV 557
>gi|298247044|ref|ZP_06970849.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
gi|297549703|gb|EFH83569.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
Length = 639
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 82/369 (22%), Positives = 142/369 (38%), Gaps = 61/369 (16%)
Query: 308 VLYKLFCITQDPKHLMLAHLF-----DKPCFLGLLA-LQADDISGF------HSNTHIPI 355
L KL+ +T + ++L L+ F +P + A L+ DD F ++ +H+PI
Sbjct: 199 ALVKLYRVTGEKRYLNLSQYFVDERGKQPHYFDEEAHLRGDDPRDFWAQTYEYNQSHVPI 258
Query: 356 -----VIGSQMR----YEVTGDQLHKEGHQ--LESSGTNIGHFNFKSDPKRLASNLDSNT 404
V+G +R Y D L KE + L +G + H + S + + S
Sbjct: 259 REQREVVGHAVRAMYLYSAVAD-LVKERYDESLFQTGERLWH-HLVSKRLYITGGIGSTA 316
Query: 405 E-----------------ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQR 446
+ ESC + ++ + L + + YAD ER+L NG+L GI
Sbjct: 317 KNEGFTEDYDLPNLTAYAESCASIGLVMWNHRLLQLDADSRYADLLERALYNGMLSGI-- 374
Query: 447 GTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP 506
+ Y+ PL R W + CC + LG +Y +
Sbjct: 375 SLDGSKYFYVNPLESKGDHHRV--GWFKCA----CCPPNIARTLMSLGQYVYTVSDTD-- 426
Query: 507 GVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT 566
++ YI + G V + + WD + + + LNLRIP W
Sbjct: 427 -IFTHLYIQGTGELSVGGHNVKVEQETKYPWDGAISLKMELDEPAD---FGLNLRIPGWC 482
Query: 567 SSNGAKATLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
+ A+ +LNG+ + L ++ + + W S D++ + L + + D E +
Sbjct: 483 QA--AQLSLNGEAIALDDHLQKGYVRIERRWQSGDQIVLNLAMPVMRVYAHPDIRENSDR 540
Query: 625 QAILYGPYV 633
A+ GP V
Sbjct: 541 VALQRGPLV 549
>gi|241206592|ref|YP_002977688.1| hypothetical protein Rleg_3907 [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|240860482|gb|ACS58149.1| protein of unknown function DUF1680 [Rhizobium leguminosarum bv.
trifolii WSM1325]
Length = 648
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/243 (22%), Positives = 103/243 (42%), Gaps = 32/243 (13%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI------ 454
D+ E+C + ++ + + + YAD E++L NG L PG+ I
Sbjct: 337 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGAL-------PGLSIDGKTFF 389
Query: 455 YLLPLAPGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
Y PL R +HH P CC + +G +Y + + V++
Sbjct: 390 YDNPLESAGKHHRWKWHH--CP-----CCPPNIARLVTSIGSYMYAVSDNEI-AVHLYGE 441
Query: 514 ISSRLDWKSG-QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
++RL +G ++ + Q + W+ + T +L+LR+P W ++GA
Sbjct: 442 STARLKLANGAEVELEQTTN--YPWEGAVAFTTRLEKPAK---FALSLRVPDW--ADGAT 494
Query: 573 ATLNGQ--DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
++NG+ DL + + + W++ D++ + LPL LR + + A A++ G
Sbjct: 495 LSVNGEMLDLNANMRDGYARIDREWAAGDRVALYLPLALRPQYANPKVRQDAGRVALMRG 554
Query: 631 PYV 633
P V
Sbjct: 555 PLV 557
>gi|312135914|ref|YP_004003252.1| hypothetical protein Calow_1923 [Caldicellulosiruptor owensensis
OL]
gi|311775965|gb|ADQ05452.1| protein of unknown function DUF1680 [Caldicellulosiruptor
owensensis OL]
Length = 652
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/242 (19%), Positives = 98/242 (40%), Gaps = 14/242 (5%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG-IQRGTEPGVMIYLLPL 459
D+ E+C + ++ + L R Y D ER+L N V+G + + + + L +
Sbjct: 332 DAAYAETCASVGLIFFAHRLNRIEPHAKYYDAVERALYNTVIGSMSQDGKKYFYVNPLEV 391
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYI 514
P ++R P W CC + LG IY + +E +Y+ YI
Sbjct: 392 YPKEVEKRFDRRHVKPERQPWFGCACCPPNVARLLASLGRYIYSYNQE----EIYVNLYI 447
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
S + + G V + + ++ +++ L S + L LRIP+W
Sbjct: 448 GSSVQVEVGSAKVLLQQESGYPFEDMVKIDLKTSKEAR---FKLYLRIPSWCEKYEVYVN 504
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVL 634
+++ P ++ + + W+ ++++ +++P ++ + S A++ GP V
Sbjct: 505 EKKEEMQ-KLPSGYVCIERLWTENNQVVLKIPTEVKMVSSHPQVRSNVSKVAVVKGPVVF 563
Query: 635 AG 636
Sbjct: 564 CA 565
>gi|116625572|ref|YP_827728.1| hypothetical protein Acid_6519 [Candidatus Solibacter usitatus
Ellin6076]
gi|116228734|gb|ABJ87443.1| protein of unknown function DUF1680 [Candidatus Solibacter usitatus
Ellin6076]
Length = 631
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/291 (20%), Positives = 109/291 (37%), Gaps = 48/291 (16%)
Query: 478 SFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSW 537
+F CC + + KL S++ G + Y + SG + + ++ D
Sbjct: 383 NFGCCTANMHQGWPKLAASLWMATNDG--GFAAVAYGPGEV--TSGGVTIEERTD----- 433
Query: 538 DPYLR-VTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWS 596
P+ V+L + S L LRIP W +NGA +NGQ PG F V + W
Sbjct: 434 YPFRENVSLLVKTDKS---FPLVLRIPAW--ANGATVAVNGQQQAGVKPGAFFRVQRAWR 488
Query: 597 SDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWIT 656
+ D++ + P+ +R + + + ++ GP V + +W + SDW
Sbjct: 489 AGDRVELHFPMAVRMSSW------FNNSTSVERGPLVYSLRIGENWHKIKQTGPSSDWEV 542
Query: 657 PIPASYNSQLITFTQEYGNTKFVLTNSNQSITMEKFPKSGTDAALHATFRLILNDSSGSE 716
+N L+ K T + I + F + + A R + E
Sbjct: 543 YPSTPWNYALV---------KGAFTAVERPIERQPFRAESSPVEITAKARRL------PE 587
Query: 717 FSSLNDFIGKSVMLEPFDSPGMLVIQHETDDELVVTDSFIAQGSSVFHLVA 767
++ ++ DSPG+L + T T + + G++ + A
Sbjct: 588 WTLVD------------DSPGVLPVSPVTSKRPEETITLVPYGAAKLRITA 626
>gi|270339568|ref|ZP_06005245.2| conserved hypothetical protein [Prevotella bergensis DSM 17361]
gi|270334558|gb|EFA45344.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
Length = 813
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 114/281 (40%), Gaps = 48/281 (17%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLPLAPGS 463
E+C + + + +F T + Y D YER+L NGVL G+ G E Y PL S
Sbjct: 344 ETCASIANVYWNYRMFLATGDAKYVDVYERALYNGVLSGVSLSGKE---FFYDNPLE--S 398
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
+ + W + CC G + F + G +++ YI + D
Sbjct: 399 MGQHARQAWFGCA----CCPGN-VTRFVASVPQYQYATRGN--DIFVNLYIQGKADINGV 451
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS-----------NGAK 572
Q+ WD + + ++ + T ++ RIP W + + AK
Sbjct: 452 QLTQTTN----YPWDGNISIQVSPKRRS---TFAIRFRIPGWAHNKPVSTNLYHFIDKAK 504
Query: 573 ---ATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYASIQ 625
LNG + ++ +++ W D++ I+LP+ +R + ++DDR +
Sbjct: 505 PYAVKLNGDVVDATLEDGYVVISRKWKKGDRVEIELPMDVRRVQANDNVEDDRGKI---- 560
Query: 626 AILYGP--YVLAGHSIGDWDITESATSLSDWITPIPASYNS 664
A+ GP + L G D + +L+ TPI ASY+S
Sbjct: 561 ALERGPVMFCLEGKDQSDNTVFNKIITLT---TPITASYHS 598
>gi|160878749|ref|YP_001557717.1| hypothetical protein Cphy_0591 [Clostridium phytofermentans ISDg]
gi|160427415|gb|ABX40978.1| protein of unknown function DUF1680 [Clostridium phytofermentans
ISDg]
Length = 646
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 65/269 (24%), Positives = 105/269 (39%), Gaps = 41/269 (15%)
Query: 394 KRLASNLD----SNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGT 448
+R +N D SN E+C + + R + + T +Y D ER+L N VL GI
Sbjct: 314 ERFTANYDLPNNSNYSETCASIGLALFGRRMAQITHNASYMDVVERALYNTVLAGIAMDG 373
Query: 449 EPGVMIYLLPLAPGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGK 504
+ + L + PG+ +R+ P W CC + + LG+ IYF +E
Sbjct: 374 KSFFYVNPLEVWPGNCIKRTSKEHVKPIRQPWFGVACCPPNVARTLASLGEYIYFYDEN- 432
Query: 505 YPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGS---------GLT 555
+++ +IS NQ + + + LR+ F G G
Sbjct: 433 --SIWVNLFIS------------NQTTVKLQNREATLRLATRFPYDGKVHMEVDGEEGFC 478
Query: 556 TSLNLRIPTWTSSNGAKATLNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLRTEAI 614
L +RIP + +NG +L N +L + T S K TI + TL+ I
Sbjct: 479 GKLYIRIPEYAKEYC--VFVNGLELTQKEITNGYLEIEITSS---KKTIDMEFTLKPRMI 533
Query: 615 QDD--RPEYASIQAILYGPYVLAGHSIGD 641
+ + E AI+ GP V + +
Sbjct: 534 RANPLVKEDIGKVAIMKGPLVYCMEEVDN 562
>gi|397691075|ref|YP_006528329.1| six-hairpin glycosidase [Melioribacter roseus P3M]
gi|395812567|gb|AFN75316.1| six-hairpin glycosidase [Melioribacter roseus P3M]
Length = 643
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 98/247 (39%), Gaps = 29/247 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
E+C + + LF T + YAD ER+L NG++ G + P S
Sbjct: 338 ETCAAIGSVYWNYRLFEMTGDSKYADVIERTLYNGLIS---GISLDGKNFFYPNPLESDG 394
Query: 466 ERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK 521
E ++ G + W CC I L IY + VY+ ++ S+ D +
Sbjct: 395 EYKFNM-GACTRQPWFDCSCCPTNLIRFIPSLPGLIYSVDRD---SVYVNLFVGSKADIE 450
Query: 522 SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS------------- 568
G N ++ S+ +VTL + + T L +RIP W+ +
Sbjct: 451 LGN--KNVRIIQKTSYPLDYKVTLNIEPQAATQFT-LKIRIPGWSRNIPLPGDLYRYANK 507
Query: 569 -NGA-KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
NG + +NG++ L + +TK W DK+ + LP ++ + E + A
Sbjct: 508 QNGKIRLLVNGEEQSLNISSGYAVITKLWEKGDKVDLILPKEVKKVLANEKVKENRNKVA 567
Query: 627 ILYGPYV 633
I GP+V
Sbjct: 568 IELGPFV 574
>gi|251796469|ref|YP_003011200.1| hypothetical protein Pjdr2_2459 [Paenibacillus sp. JDR-2]
gi|247544095|gb|ACT01114.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 659
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/243 (24%), Positives = 90/243 (37%), Gaps = 20/243 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG--IQRGTEPGVMIYLLP 458
D+ E+C + ++ ++ + + + YAD ER+L N V+G Q G Y+ P
Sbjct: 333 DTVYAETCASIGLIFFAQRMLKLEAKSEYADVLERALYNNVVGSMSQDGKH---YFYVNP 389
Query: 459 LA--PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
L P +S++ H W CC S L D IY +Y
Sbjct: 390 LEVWPQASEKNPGRHHVKAERQKWFGCSCCPPNVARLLSSLNDYIYTVSAANNT-IYTHL 448
Query: 513 YISS--RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNG 570
+I S R + +G + + Q+ + W Y R G + LRIP+W S
Sbjct: 449 FIGSVARFELAAGSVSLKQQSQ--LPWKGYTRFEF---DDVPGAAFTFALRIPSW-SRGK 502
Query: 571 AKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
A +NGQ + V + W D + L + A A AI G
Sbjct: 503 AVLNINGQAAEYTEENGYALVNRNWQQGDVAEWEPALEAQLTAAHPQIRANAGKVAIERG 562
Query: 631 PYV 633
P V
Sbjct: 563 PLV 565
>gi|325261850|ref|ZP_08128588.1| putative cytoplasmic protein [Clostridium sp. D5]
gi|324033304|gb|EGB94581.1| putative cytoplasmic protein [Clostridium sp. D5]
Length = 643
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 97/248 (39%), Gaps = 32/248 (12%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL- 459
D E+C ++ +R + + YAD ER+L NGVLG G + Y+ PL
Sbjct: 324 DRAYAETCAAVGLVFWARKMLNIALDGNYADVMERALYNGVLG-GMGRDGRHFFYVNPLE 382
Query: 460 -APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEG-KYPGVY---I 510
PG S + + P W CC + LG + E G Y +Y I
Sbjct: 383 VVPGISGQVPGYEHVRPVRPRWYACACCPPNIARLLASLGKYAWGEAPGFVYSHLYLGGI 442
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS-- 568
+R+ WK+ V + R+ + + T+L +RIP W S
Sbjct: 443 FHAAQNRISWKT-----------VTDYPWEGRILYEVYNSENEEQTALVIRIPGWCPSYS 491
Query: 569 ---NGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
NG + T NG + + ++++ + W D + +QL + ++ E
Sbjct: 492 LSVNGKECT-NGHE----NRQGYITIKRAWKKGDTVCLQLSMEIKRIYANLMVREDTGCI 546
Query: 626 AILYGPYV 633
A++ GP V
Sbjct: 547 ALMRGPLV 554
>gi|326789389|ref|YP_004307210.1| hypothetical protein Clole_0260 [Clostridium lentocellum DSM 5427]
gi|326540153|gb|ADZ82012.1| protein of unknown function DUF1680 [Clostridium lentocellum DSM
5427]
Length = 638
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 88/218 (40%), Gaps = 19/218 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLP 458
D+ E+C ++ +R + K YAD ER+L N VL G+Q GT+ Y+ P
Sbjct: 323 DTAYAETCAAIGLIFFARKMIDLEKNNEYADIMERALYNCVLAGMQLDGTK---FFYVNP 379
Query: 459 LA--PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQ 512
L PG S E H P W CC S +G + EE VY
Sbjct: 380 LESIPGISGEAVTHRHALPQRPKWFTCACCPPNVARLLSSMGRYAWSEEGNT---VYSHL 436
Query: 513 YISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
+I LD ++ K+ S+ +V F + +L +R+P W S
Sbjct: 437 FIGGTLDLTD---TLHGKIKVETSYPYGNQVRYRFEPNDESMDLTLAIRLPLW--SENTS 491
Query: 573 ATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
L+ + ++ +TK ++ +D +T+ + ++
Sbjct: 492 IMLDEKKANYEIRNGYVYLTKAFTQEDMVTVTFDMNVK 529
>gi|302883148|ref|XP_003040476.1| hypothetical protein NECHADRAFT_44741 [Nectria haematococca mpVI
77-13-4]
gi|256721360|gb|EEU34763.1| hypothetical protein NECHADRAFT_44741 [Nectria haematococca mpVI
77-13-4]
Length = 645
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 70/165 (42%), Gaps = 21/165 (12%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIY---LLPLAPG 462
E+C T+ ++ + R + YAD E +L NG LG + G Y +L G
Sbjct: 339 ETCATFALINWCARMLRLDLDAEYADVMEVALYNGFLGAV--NQDGDAFYYENVLRTRKG 396
Query: 463 SSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
KERS W + CC + LG IY ++ V I QYI S L
Sbjct: 397 EFKERS--KWFGVA----CCPPNVAKLLGNLGSLIY-SQDASTNLVAIHQYIDSELKIPE 449
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS 567
+++ QK D + WD + S +GS +L LRIP+W
Sbjct: 450 SGVIIRQKTD--MPWDG----QVVLSIQGSA---NLALRIPSWAK 485
>gi|291535675|emb|CBL08787.1| Uncharacterized protein conserved in bacteria [Roseburia
intestinalis M50/1]
Length = 650
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 89/207 (42%), Gaps = 18/207 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D N ESC + + + + TK+ YAD E++L N VL GI + + L +
Sbjct: 329 DRNYSESCASIGLAMFGNRMAQITKDAKYADIVEKALYNTVLAGIAMDGKSFFYVNPLEV 388
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + ER+ P W CC + + LG IY +E +YI YIS
Sbjct: 389 WPDNCIERTSMEHVKPVRQKWFGVACCPPNIARTLASLGQYIYGADEN---SLYINLYIS 445
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLR---VTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
S+ ++++ + V+ +L+ VT+ S+ + T L LRIP +T
Sbjct: 446 SQT-----KLLIGETETEVIMESSFLKDGTVTVHLESEKASKGT-LALRIPGYTKEFTVW 499
Query: 573 ATLNGQDLPLPSPGNFLSVTKTWSSDD 599
+ PL G +L +T +S++
Sbjct: 500 RGTQKIETPLIKKG-YLMITDLAASEE 525
>gi|405380414|ref|ZP_11034253.1| hypothetical protein PMI11_04239 [Rhizobium sp. CF142]
gi|397323106|gb|EJJ27505.1| hypothetical protein PMI11_04239 [Rhizobium sp. CF142]
Length = 642
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 57/238 (23%), Positives = 97/238 (40%), Gaps = 23/238 (9%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLP 458
+S E+C + ++ + + YAD E++L NG + G+ GT Y P
Sbjct: 329 ESAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMAGLSLDGTR---FFYENP 385
Query: 459 LAPGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
L R +HH P CC + +G +Y E + V++ +R
Sbjct: 386 LESAGKHHRWIWHH--CP-----CCPPNIARLLASVGSYMYAIAEDEI-AVHLYGESKAR 437
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
D ++ ++Q+ WD + LT +L+LRIP W + G ++NG
Sbjct: 438 FDLAGAKVELSQQTR--YPWDGAIHFDLTLDRPAH---FALSLRIPEW--AEGVALSVNG 490
Query: 578 QDLPLPSPG--NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+ L L S + + + W S DK+ + +PL R + A A++ GP V
Sbjct: 491 EKLDLQSTTVEGYARIERDWKSGDKVDLSIPLAARKLFANPLVRQDAGRTALMRGPLV 548
>gi|298247843|ref|ZP_06971648.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
gi|297550502|gb|EFH84368.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
Length = 643
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 104/258 (40%), Gaps = 33/258 (12%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
E+C + + L + E + D E++L NGV+ + + Y PLA
Sbjct: 328 ETCAAIASVMWNWRLLQARPEARFTDVIEQTLYNGVIA-GSSLDGKLYFYQNPLADRGKH 386
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS--SRLDWKSG 523
R P CC + L Y E G+++ Y S +++ SG
Sbjct: 387 RRQ------PWFDTACCPPNIARLLASLPGYFYSTSE---EGIWLHLYASNTAQIPLASG 437
Query: 524 Q-IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP- 581
+ I + Q+ + WD + V L +L +RIP W + GA+ +N Q +
Sbjct: 438 EAITIEQQTN--YPWDEEIGVRLQMREAQD---FTLFVRIPAWAT--GAQIQVNKQPVEG 490
Query: 582 -LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ---AILYGPYV---- 633
PG + + +TW DK+TI LPL +R + + P S + AI GP V
Sbjct: 491 LAIKPGTYAQLNRTWQPGDKVTIVLPLEVR---LLESHPHVTSNRGRVAIARGPLVYCLE 547
Query: 634 -LAGHSIGDWDITESATS 650
+ S+ WDI S +
Sbjct: 548 QVDHGSVDVWDIVLSGQT 565
>gi|383122644|ref|ZP_09943336.1| hypothetical protein BSIG_0612 [Bacteroides sp. 1_1_6]
gi|251842259|gb|EES70339.1| hypothetical protein BSIG_0612 [Bacteroides sp. 1_1_6]
Length = 698
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 92/216 (42%), Gaps = 25/216 (11%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C + + + T + YA+ E L N VL GI T P + LP
Sbjct: 381 ETCANIGNMLFNWRMLEVTGDAKYAELVETCLYNSVLSGISLDGKKYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSR 517
KER T S +CC + + + + Y EG Y +Y +++
Sbjct: 441 YTLRWPKER------TEYISCFCCPPNTLRTLCQAQNYAYTLSPEGIYCNLYGANTLTT- 493
Query: 518 LDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+WK G++ + Q+ D W+ +RVTL + +G SL RIP W A +N
Sbjct: 494 -NWKDKGELALVQETD--YPWEGNIRVTLDKVPRKAG-AFSLFFRIPEWCGK--AALIVN 547
Query: 577 GQDLPLPSPGN-FLSVTKTWSSDD--KLTIQLPLTL 609
GQ + + + N + V +TW D +L + +P+ L
Sbjct: 548 GQPVSMNAKANTYAEVNRTWKKGDVVELVMDMPVCL 583
>gi|344201929|ref|YP_004787072.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
gi|343953851|gb|AEM69650.1| protein of unknown function DUF1680 [Muricauda ruestringensis DSM
13258]
Length = 656
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 56/264 (21%), Positives = 101/264 (38%), Gaps = 43/264 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
E+C + + L T ++ Y D ER+L NG++ G + P A S
Sbjct: 347 ETCAAIGDVYWNHRLHNLTGDVKYFDVIERTLYNGLIS---GLSLDGQKFFYPNALESDG 403
Query: 466 ERSYHHWG-TPSDSFWC-CYGTGIESF---------SKLGDSIYFEEEGKYPGVYIIQYI 514
++ T D F C C T + F SK D+IY V +
Sbjct: 404 VYKFNQGACTRKDWFDCSCCPTNVIRFLPAMPGLIYSKTDDTIY---------VNLYAAN 454
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN----- 569
+ ++ K + ++Q+ WD +++ + + KG ++ R+P W +
Sbjct: 455 GATVNLKDRAVKLSQETK--YPWDGKVKLMVDPTEKGK---FTIKFRVPGWARNKVLPGN 509
Query: 570 ----------GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
K +LNG++L L + + ++ K W D + ++ P+ +R
Sbjct: 510 LYQYATVINKKNKISLNGEELDLQAGDGYFTIAKEWEKGDVVELEFPMEVRKVEANQLVE 569
Query: 620 EYASIQAILYGPYVLAGHSIGDWD 643
E ++ YGP V A I + D
Sbjct: 570 ENKDKMSLEYGPMVYAVEEIDNKD 593
>gi|380510716|ref|ZP_09854123.1| hypothetical protein XsacN4_05853 [Xanthomonas sacchari NCPPB 4393]
Length = 660
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 46/216 (21%), Positives = 88/216 (40%), Gaps = 19/216 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL- 459
D+ ESC + ++ + + + + YAD ER+L N VLG + Y+ PL
Sbjct: 334 DTAYNESCASIGLMMFANRMLQLAPDGRYADVMERALYNTVLG-GMALDGRHFFYVNPLE 392
Query: 460 --APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
P ++ H P W CC + LG +Y + +Y+ Y
Sbjct: 393 VHPPTLHGNHTFDHV-KPVRQRWFGCACCPPNIARVLTSLGHYLYTRHDDT---LYVNLY 448
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ S ++ G ++ + W + + S+ + +L LR+P W + +
Sbjct: 449 VGSDARFEVGGQILTLRQRGEYPWQDTIDFDVACSAP---MDAALALRLPDWCQA--PQL 503
Query: 574 TLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPL 607
LNG+ + + + + + + W S D L ++LP+
Sbjct: 504 LLNGEPVAIEAHRQHGYCVLRRRWQSGDTLQLRLPM 539
>gi|410616495|ref|ZP_11327487.1| hypothetical protein GPLA_0709 [Glaciecola polaris LMG 21857]
gi|410164204|dbj|GAC31625.1| hypothetical protein GPLA_0709 [Glaciecola polaris LMG 21857]
Length = 659
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 111/262 (42%), Gaps = 27/262 (10%)
Query: 409 TTYN--MLKVSRHLFRW-----TKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAP 461
T YN +S +F W T E +AD E L N + + TE Y PL
Sbjct: 336 TAYNETCANISNAMFNWRLLGITGEAKHADVIELVLHNSAM-VGISTEGDKYFYANPLRM 394
Query: 462 G-SSKERSYHHWGTPSDS------FWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQY 513
+E S H T S +CC + + +++ Y + G ++
Sbjct: 395 NFGQREYSDHCDCTESPDREAYIECFCCPPNLVRTIAQVSAWAYSLTDVGLAVNLFGSNA 454
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
++++L + ++Q+ D WD +V L S L + +RIP+W + GA
Sbjct: 455 LNTKL-LDGSTLRLSQQTD--FPWDG--KVALKIEECKSALF-DIQIRIPSW--AKGATL 506
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
++NG+ +P+ G + + + W + D +T+ +P+ ++ E + A+ GP V
Sbjct: 507 SVNGETIPVVEAGQYTKIERQWQAGDNITLNMPMDIQFVEGHPRIEEIRNQVAVKRGPLV 566
Query: 634 LAGHSIGDWDITESATSLSDWI 655
+ I DI ES++ L +I
Sbjct: 567 ---YCIETPDIPESSSILDMYI 585
>gi|284039567|ref|YP_003389497.1| hypothetical protein Slin_4720 [Spirosoma linguale DSM 74]
gi|283818860|gb|ADB40698.1| protein of unknown function DUF1680 [Spirosoma linguale DSM 74]
Length = 655
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 60/265 (22%), Positives = 102/265 (38%), Gaps = 42/265 (15%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D E+C + + +F T E Y D +ER L NG L G+ E Y+ PL
Sbjct: 339 DVAYAETCAAVANMLWNHRMFLLTGESKYMDVFERVLYNGFLAGVS--LEGDSFFYVNPL 396
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
A S +R ++ + + W CC + L +Y K ++I +++
Sbjct: 397 A--SDGKRKFNVGQAATRAPWFGTSCCPTNVVRFLPSLPGYVY---ATKGDNLFINLFLT 451
Query: 516 --SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
S+L + + Q+ + WD + +T+ T ++ LR+P W S
Sbjct: 452 NQSKLSVNGKSVQIRQETN--YPWDGNVAITV---QPKLAQTFTIQLRLPGWASGTPMPG 506
Query: 574 TL---------------NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAI 614
L NG+ +P + +++TW D+L L + +R E +
Sbjct: 507 YLYEYVNTTAKTPVLLVNGKPVPYKIENGYARISRTWKPGDRLEWTLDMPVREVKANEQV 566
Query: 615 QDDRPEYASIQAILYGPYVLAGHSI 639
DDR + AI GP V +
Sbjct: 567 TDDRKKV----AIERGPLVYCAEGV 587
>gi|308067034|ref|YP_003868639.1| hypothetical protein PPE_00219 [Paenibacillus polymyxa E681]
gi|305856313|gb|ADM68101.1| DUF1680 domain containing protein [Paenibacillus polymyxa E681]
Length = 647
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 105/245 (42%), Gaps = 23/245 (9%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
DS E+C + + + + R + YAD ER+L NG + G+ G + + L +
Sbjct: 331 DSMYCETCASVGLAFWANRMLRLAPDRKYADVLERALYNGTISGMDLGGKRFFYVNPLEV 390
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P + H T ++ CC + + D++Y + + +Y YI+S
Sbjct: 391 NPFQKSRKDQEHVKTERQKWFFCACCPPNLARMIASVEDNMYTQTDDT---LYTHLYIAS 447
Query: 517 RLDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT-SLNLRIPTWTSSNGAKAT 574
+++ SGQ V + WD LTFS + T LRIP W A+
Sbjct: 448 KVNMTLSGQEVEITQTHH-YPWD----ADLTFSIHVTEPTPFKWALRIPGWCKQ--AEVK 500
Query: 575 LNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ---AILYG 630
+NG+ + L ++ + +TW D +T+ L + + E I+ + P+ + Q A+ G
Sbjct: 501 VNGETISLDRLEKGYIEIQRTWKDGDVVTLHLAMPV--ERIRSN-PQVSMNQQQIALQRG 557
Query: 631 PYVLA 635
P V
Sbjct: 558 PVVFC 562
>gi|373456252|ref|ZP_09548019.1| protein of unknown function DUF1680 [Caldithrix abyssi DSM 13497]
gi|371717916|gb|EHO39687.1| protein of unknown function DUF1680 [Caldithrix abyssi DSM 13497]
Length = 676
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 122/591 (20%), Positives = 216/591 (36%), Gaps = 70/591 (11%)
Query: 139 LEYLLMLDVDKLVWNFRKTARLPAP-----GEPYGGWEEPSCELRGHFVGHYLSASALMW 193
LEY L L + L + + R P G GWE L G Y+
Sbjct: 60 LEYQLKLAANGLTGHLDEVWRDVGPDNGWLGGSGDGWERGPYWLDGLVPLAYI------- 112
Query: 194 ASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFP--TEQFDRL---------EALIPVW 242
+++L +K + + Q+E GY P T FD E + W
Sbjct: 113 --LKDKTLIKKAKKWIEYILTHQQE--DGYFGPLPDSTRVFDNTKWGRRQAWQEKVKQDW 168
Query: 243 APYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEA 302
P+ + K++ TY + + R+ +M YF +++N IK+ ++ +W +
Sbjct: 169 WPHMIVLKVMQ------TYYEATQDERVLDFMRRYFQYQMKN-IKEKPLD-YWTHWAKSR 220
Query: 303 GGMNDV-LYKLFCITQDPKHLMLAHLFDKPCFLGLLALQA---DDISGFHSNTHIPIV-I 357
GG N +Y L+ T D L L + + ++ D + NT + I
Sbjct: 221 GGENLASIYWLYNHTGDAFLLDLGKIIFEQTLDWTQRFESANPQDWNWHGVNTAMGIKQP 280
Query: 358 GSQMRYEVTGDQLHKEGHQLESSGTNIGH-FNFKSDPKRLASNLDSNTEESCTTYNMLKV 416
G +Y L +E + G + + + LA ESCT +
Sbjct: 281 GVWYQYSKDERYLKAVKTGIEKLMKHHGQVYGLWAADELLAGKDPVRGTESCTVVEYMFS 340
Query: 417 SRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTP- 475
+ + + + Y D ER N + + Y LA +R +H++ T
Sbjct: 341 LETMLQISGDAEYGDILERVALNALPAFLKPGHTARQYY--QLANQVICDRGWHNFSTKH 398
Query: 476 ---------SDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIV 526
+ CC + + K ++++ + G+ + Y S + + ++
Sbjct: 399 GETELLFGLETGYGCCTANYHQGWPKYVMNLWYATQDN--GLAALVYAPSEV---TARVA 453
Query: 527 VNQKVDPVVSWDPYLRVTLTFSSKGS-GLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSP 585
N +V V D + + F K S G+ +LRIP W + A +NG+ P
Sbjct: 454 DNVEVTFVEETDYPFKERIKFICKKSNGVAFPFHLRIPEW--CDNAVVFVNGKVYGKPQA 511
Query: 586 GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDIT 645
G+ VT+ W D L + LP+ +R + A+ GP V A +W
Sbjct: 512 GSITKVTRRWKKGDVLELYLPMKIRISYW------FQRSAAVERGPLVFALGLNEEWKKI 565
Query: 646 ESATSLSDWITPIPASYNSQLITFTQEYGNTKFVL---TNSNQSITMEKFP 693
+D+ +N L+ ++ +T F++ T NQ T++ P
Sbjct: 566 GGKEPYADYEVLPKDPWNYGLLRNYVDHPDTTFIVKEFTVKNQPWTLKNAP 616
>gi|237808692|ref|YP_002893132.1| hypothetical protein Tola_1947 [Tolumonas auensis DSM 9187]
gi|237500953|gb|ACQ93546.1| protein of unknown function DUF1680 [Tolumonas auensis DSM 9187]
Length = 655
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/216 (21%), Positives = 88/216 (40%), Gaps = 15/216 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + Y D ER+L N VL G+ + + L +
Sbjct: 331 DTAYTETCASIGLMMFANRMLQLDTNSKYGDVMERALYNTVLAGMALDGKHFFYVNPLEV 390
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P S + + P+ W CC +G+ IY K GV + YI
Sbjct: 391 HPKSIQHNHIYDHVKPTRQQWFGCACCPPNIARIIGSIGNYIY---SIKDDGVLVNLYIG 447
Query: 516 SR--LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
++ ++ GQ+++ Q + W +++ + S L T + LRIP W S
Sbjct: 448 NKTHIELPQGQLLLEQNGN--YPWQDSIQIDV---SPTMPLRTKIALRIPDWCHSPILFI 502
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTL 609
Q+L + + + W + D++ + LP+ +
Sbjct: 503 NDQQQELESIISQGYAEIDRIWKAGDRIRLSLPMDV 538
>gi|333994236|ref|YP_004526849.1| hypothetical protein TREAZ_1028 [Treponema azotonutricium ZAS-9]
gi|333736667|gb|AEF82616.1| conserved hypothetical protein [Treponema azotonutricium ZAS-9]
Length = 675
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 53/239 (22%), Positives = 92/239 (38%), Gaps = 14/239 (5%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + + +R + + ++AD E +L NG++ G+ + + L +
Sbjct: 352 DTVYAETCASIGLAFFARRMLSIAPKGSFADVLETALYNGIISGMSLDGKSFFYVNPLEV 411
Query: 460 AP-GSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYI 514
P + K+R H ++ CC S LG IY ++ Y ++I
Sbjct: 412 IPEANEKDRIRRHVKGVRQKWFACACCPPNLARIISSLGSYIYSVKDNALYTHLFIGSTA 471
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
++L K V K++ W+ +RV F G G R+P W S
Sbjct: 472 KAQLSGKE----VTVKLETSYPWEEKVRV--DFQVPGEGAKFDYAFRLPGWCRS--CSVE 523
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LNG + +++ W S D L+I + + E + AI GP V
Sbjct: 524 LNGAKADYKKADGYAIISREWKSGDSLSIVFDMPVNFVEANPKVRENSGKLAITRGPVV 582
>gi|257413449|ref|ZP_05591656.1| putative cytoplasmic protein [Roseburia intestinalis L1-82]
gi|257203499|gb|EEV01784.1| putative cytoplasmic protein [Roseburia intestinalis L1-82]
Length = 523
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 77/174 (44%), Gaps = 17/174 (9%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D N ESC + + + + TK+ YAD E++L N VL GI + + L +
Sbjct: 329 DRNYSESCASIGLAMFGNRMAQITKDAKYADIVEKALYNTVLAGIAMDGKSFFYVNPLEV 388
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + ER+ P W CC + + LG IY +E +YI YIS
Sbjct: 389 WPDNCIERTSMEHVKPVRQKWFGVACCPPNIARTLASLGQYIYGADEN---SLYINLYIS 445
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLR---VTLTFSSKGSGLTTSLNLRIPTWT 566
S+ ++++ + V+ +L+ VT+ S+ + T L LRIP +T
Sbjct: 446 SQT-----KLLIGETETEVIMESSFLKDGTVTVHLESEKASKGT-LALRIPGYT 493
>gi|212715353|ref|ZP_03323481.1| hypothetical protein BIFCAT_00247 [Bifidobacterium catenulatum DSM
16992 = JCM 1194]
gi|212661728|gb|EEB22303.1| hypothetical protein BIFCAT_00247 [Bifidobacterium catenulatum DSM
16992 = JCM 1194]
Length = 727
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 62/287 (21%), Positives = 107/287 (37%), Gaps = 24/287 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ ESC + +R + + YAD E +L N L G+ + + L +
Sbjct: 369 DTAYSESCAAIALAFFARRMLEIQPKSEYADVMESALYNTTLAGMALDGKSFFYVNPLEV 428
Query: 460 APGSS--KERSYHHWGTPSDSFW----CC---YGTGIESFSKLGDSIYFEEEGKYPGVYI 510
P + ER +H P W CC +ES + ++ + Y +Y+
Sbjct: 429 VPEACHRDERKFH--VKPVRQKWFGCACCPPNIARMVESVQQYAYTVADDASTLYVHLYM 486
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT---SLNLRIPTWTS 567
+S++L G V+ +V + W+ +T+T S G +L LR+P W
Sbjct: 487 GGVVSAKL----GGSDVSLEVRAGMPWNGAGAITVTLPSSDEGQVPESFALALRLPAWAG 542
Query: 568 SNGAKATLNG-----QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYA 622
A +++ + + +L +T TW D + P+ +R A E A
Sbjct: 543 GESAADSIHATGEKDSRITRTTRDGYLYLTGTWRDGDVIDFDFPMPVRMIAANPLVREDA 602
Query: 623 SIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITF 669
A + GP + D + ++ I P S ITF
Sbjct: 603 GKVAFIRGPLAYCAEGTDNGDNLHLLHADAETIAADPDSVKVNEITF 649
>gi|227509160|ref|ZP_03939209.1| conserved hypothetical protein, partial [Lactobacillus brevis
subsp. gravesensis ATCC 27305]
gi|227191367|gb|EEI71434.1| conserved hypothetical protein [Lactobacillus brevis subsp.
gravesensis ATCC 27305]
Length = 106
Score = 49.7 bits (117), Expect = 0.007, Method: Composition-based stats.
Identities = 35/102 (34%), Positives = 49/102 (48%), Gaps = 17/102 (16%)
Query: 177 LRGHFVGHYLSASALMWASTHNE----SLKEKMSAVVSALSACQKEIG------SGYLSA 226
RGHF GHYLSA + S ++ L K+ + L Q+ +GY+SA
Sbjct: 1 FRGHFFGHYLSALSQAIDSVSDDDTRSQLLSKLRIGIEGLFRAQQAYAKSHPQSAGYVSA 60
Query: 227 FPTEQFDRLEA-LIP------VWAPYYTIHKILAGLLDQYTY 261
F D +E +P V P+Y +HKILAGL+D Y +
Sbjct: 61 FREVALDEVEGKRVPESEKENVIVPWYNLHKILAGLIDGYEH 102
>gi|338212418|ref|YP_004656473.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336306239|gb|AEI49341.1| protein of unknown function DUF1680 [Runella slithyformis DSM
19594]
Length = 618
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 55/231 (23%), Positives = 95/231 (41%), Gaps = 19/231 (8%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAP-GS 463
E+C + M+ ++ + ++ E Y D ERSL NG L G+Q + Y+ PLA G
Sbjct: 331 ETCASVGMVFWNQRMNLYSGEAKYVDVLERSLYNGALAGVQ--LTGNLFFYVNPLASFGL 388
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
R ++ GT CC +G IY E +++ Y+ S + G
Sbjct: 389 HHRRPWY--GTA-----CCPSNVSRLMPSVGGYIYNTSENT---LWVNLYVGSETEVMLG 438
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDL-PL 582
V W + + S + +L LRIP W + +NG+ + L
Sbjct: 439 NHKVKFAKKTNYPWAGEVEIKAIPDSSKADF--ALKLRIPAWCDKYTVE--INGKPVEKL 494
Query: 583 PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+++V +TW+ +D L +++ + ++ A +AI GP V
Sbjct: 495 TVDKGYVTVARTWAKNDVLKLRMDMPVKVVAADPRVKANEGKRAIQRGPLV 545
>gi|257067398|ref|YP_003153653.1| hypothetical protein Bfae_01840 [Brachybacterium faecium DSM 4810]
gi|256558216|gb|ACU84063.1| uncharacterized conserved protein [Brachybacterium faecium DSM
4810]
Length = 643
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 58/238 (24%), Positives = 100/238 (42%), Gaps = 22/238 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL---APG 462
E+C + S L+ T + YAD+ ER L N V+ + + Y PL PG
Sbjct: 332 ETCAGIAAIMFSWRLYLATGGVEYADFIERVLYN-VVAVSPSPDGRAFFYSNPLHQREPG 390
Query: 463 SSKERSYHHWGTPS-DSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
S S + S + W CC + + + DS + +G+ G+ ++QY S
Sbjct: 391 DSASSSVNMRAEGSTRAPWFDVSCCPTNVARTLASV-DSFFAATDGE--GLTLLQYASGT 447
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ + V+ + + + LT T L LR+P+W ++GA T+
Sbjct: 448 YRTPALTVAVHTE------YPAQGAIALTVLDAAEDPAT-LRLRVPSW--ADGAALTVGS 498
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+ + +PG + VT+TW + +++ + LP+ R A+ GP VLA
Sbjct: 499 EPVRTVTPG-WSEVTRTWRAGERVLLDLPVVPRFSWPHPRIDAVRGTVAVERGPLVLA 555
>gi|160932013|ref|ZP_02079405.1| hypothetical protein CLOLEP_00846 [Clostridium leptum DSM 753]
gi|156869055|gb|EDO62427.1| hypothetical protein CLOLEP_00846 [Clostridium leptum DSM 753]
Length = 643
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 47/238 (19%), Positives = 94/238 (39%), Gaps = 13/238 (5%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + AY D E++L NGVL G+ + + L +
Sbjct: 324 DTAYAETCAAVAVCFFAQRMMKISPSGAYGDVLEQALYNGVLSGMALDGKSFFYVNPLEV 383
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + ++ P W CC F+ +G ++F + +Y Y++
Sbjct: 384 VPEACQKDQRKKHVKPIRQKWFACACCPPNLARLFASIGGYLHFI---RAETLYTNLYVT 440
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S ++ + + +D +D + ++L+ + S +RIP W + +
Sbjct: 441 STSEFTFQGLPIKLHMDSAYPFDEKIHISLSLPRP---MEFSYAVRIPAWCADY--HVLI 495
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
NG+ FL + + W D++ + L + +R E AI GP V
Sbjct: 496 NGKICAGTLKDGFLYLHRCWRDGDEVELTLSMPVRVVRANSLVRENIGKSAICRGPIV 553
>gi|198274386|ref|ZP_03206918.1| hypothetical protein BACPLE_00531 [Bacteroides plebeius DSM 17135]
gi|198272752|gb|EDY97021.1| hypothetical protein BACPLE_00531 [Bacteroides plebeius DSM 17135]
Length = 821
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 61/279 (21%), Positives = 111/279 (39%), Gaps = 34/279 (12%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + +F T + Y D ER+L NGV+ G+ + Y PL
Sbjct: 350 ETCAAIANVYWNYRMFLATGDSKYVDVLERALYNGVISGVSLSGDK--FFYDNPLESMGE 407
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER W + CC G + + Y ++ +Y+ YI + + ++
Sbjct: 408 HER--QRWFGCA----CCPGNVTRFMASVPSYAYATQQND---IYVNLYIQGKAEMQTAD 458
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
V + W+ + + +T +G ++ LRIP WT ++ AK
Sbjct: 459 NKVTLEQTTEYPWNGKVTIKVTPEKEGK---FAIRLRIPGWTKAAPVASDLYAYTDAAKK 515
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYG 630
+NG + ++ +TW + D + +++P+ +R D + A+ G
Sbjct: 516 YTLKVNGSATRGAEGDGYETIVRTWKAGDVIELEMPMDVRRIKANDKVEVDRGMVALERG 575
Query: 631 P--YVLAGHSIGDWDITESATSLSDWITPIPASYNSQLI 667
P + L G D I + +D TPI ASY++ L+
Sbjct: 576 PIMFCLEGKDQPD-SIVFNKFIPND--TPIEASYDANLL 611
>gi|297545103|ref|YP_003677405.1| hypothetical protein Tmath_1689 [Thermoanaerobacter mathranii
subsp. mathranii str. A3]
gi|296842878|gb|ADH61394.1| protein of unknown function DUF1680 [Thermoanaerobacter mathranii
subsp. mathranii str. A3]
Length = 648
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 50/241 (20%), Positives = 102/241 (42%), Gaps = 17/241 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C ++ + + + + YAD ER+L N V+ G+ + + L +
Sbjct: 326 DTVYAETCAAIGLVFFAHRMLQIDPDRRYADVMERALYNSVISGMSLDGKKYFYVNPLEV 385
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + ++ + W CC + LG IY + + +Y+ Y+
Sbjct: 386 WPEACEKNKVKAHVKYTRQPWFKCACCPPNLARLLASLGKYIYSIRDNE---LYVHLYVD 442
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
S + K + V + + WD + + + + L +L LRIP W AK ++
Sbjct: 443 SEVQTKISENEVKVRQETEYPWDGRIVINILPERE---LDFTLALRIPGWCKD--AKVSV 497
Query: 576 NGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLT-LRTEAIQDDRPEYASIQAILYGPY 632
NG+++ + + + + W D++ + L +T +R +A + R + + AI GP
Sbjct: 498 NGEEIDISGIMDKGYAKIKRLWKPGDRIELLLSMTVMRVKANPNVREDEGRV-AIQRGPV 556
Query: 633 V 633
+
Sbjct: 557 I 557
>gi|284036949|ref|YP_003386879.1| hypothetical protein Slin_2035 [Spirosoma linguale DSM 74]
gi|283816242|gb|ADB38080.1| protein of unknown function DUF1680 [Spirosoma linguale DSM 74]
Length = 678
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 90/426 (21%), Positives = 173/426 (40%), Gaps = 54/426 (12%)
Query: 211 ALSACQKEIGSGYLSAFPTE---QFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEA 267
A+++ Q G L+ +P E Q D + W P + KIL QY A +
Sbjct: 130 AINSQQSNGYFGPLTDYPQEAGVQRDNCQD----WWPKMVMLKIL----KQYYSATQDQ- 180
Query: 268 LRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN-DVLYKLFCITQDPKHLMLAH 326
R+ M YF +++ + K+ ++ HW GG N V+Y L+ T D L LA
Sbjct: 181 -RVIKLMTNYFKYQLRE-LPKHPLD-HWTFWARYRGGDNLMVVYWLYNHTGDAFLLQLAD 237
Query: 327 LFDKPCFLGLLALQADDISGFHSNTH-IPIVIGSQ---MRYEVTGDQLHKEGHQLESSGT 382
L K F + ++ + H + + G + + Y+ DQ + + ++
Sbjct: 238 LLHKQTFDYTNSFLNTNLLSQQGSIHCVNLAQGFKEPLIYYQQHPDQKYVKA--VDKGLA 295
Query: 383 NIGHFN-----FKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER-- 435
++ HFN + L N + E C+ M+ + T +AYAD E+
Sbjct: 296 DLRHFNGMAHGLYGGDEALHGNNPTQGSELCSAVEMMFSLESMLNITGRVAYADQLEKIA 355
Query: 436 ------SLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSY--HHWGTPS-----DSFWCC 482
+T+ +G Q + ++ + R++ +H GT + CC
Sbjct: 356 FNALPAQVTDDFMGRQYFQQANQVML-------TRHVRNFDQNHGGTDVCMGLLTGYPCC 408
Query: 483 YGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK-SGQIVVNQKVDPVVSWDPYL 541
+ + K ++++ K G+ + + S ++ + +G V + +D +
Sbjct: 409 TSNMHQGWPKFTQNLWYATPDK--GLAALVFSPSEVNAQVAGGNAVTFTEETNYPFDETI 466
Query: 542 RVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKL 601
+ TLT + + L ++RIP W + A T+NG+ + ++V ++W S D +
Sbjct: 467 KFTLTTDKQATSLAFPFHMRIPAWCTK--ATITVNGRVWKETTGNQIVTVNRSWKSGDVV 524
Query: 602 TIQLPL 607
+ LP+
Sbjct: 525 ELHLPM 530
>gi|451817780|ref|YP_007453981.1| hypothetical protein Cspa_c09510 [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451783759|gb|AGF54727.1| hypothetical protein Cspa_c09510 [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 662
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/238 (24%), Positives = 99/238 (41%), Gaps = 21/238 (8%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA--PGS 463
E+C + + + + + YAD E +L N ++G + Y+ PL P +
Sbjct: 344 ETCASVGLAFFAHRMLMIEPKSEYADVMESALYNTIIG-GMAQDGKSFFYVNPLEVNPEA 402
Query: 464 SKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSRL 518
++ H P W CC + + LG IY EE Y +YI S L
Sbjct: 403 CEKNPTKHHVKPRRQKWFTCACCPPNITRTLTSLGQYIYTVNEETIYTNLYIGGEASISL 462
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+I + Q+ D W +++ + F+ + T L LRIP+W AK +N Q
Sbjct: 463 --ADNEIKLIQETD--YPWKEEIKIKV-FTEEEIKFT--LALRIPSWCPE--AKIKVNNQ 513
Query: 579 --DLPLPSPGNFLSVTKTWSSDDKLTIQLPL-TLRTEAIQDDRPEYASIQAILYGPYV 633
D+ + + + + W + D++ + L + LR +A R + + AI GP V
Sbjct: 514 VVDIEERTLNGYAMINREWKASDEIVLILKMPILRMKANPLVRADIGKV-AIQRGPLV 570
>gi|258512866|ref|YP_003186300.1| hypothetical protein Aaci_2907 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius DSM 446]
gi|257479592|gb|ACV59911.1| protein of unknown function DUF1680 [Alicyclobacillus
acidocaldarius subsp. acidocaldarius DSM 446]
Length = 659
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 49/239 (20%), Positives = 96/239 (40%), Gaps = 18/239 (7%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG--IQRGTEPGVMIYLLPLA--P 461
E+C + ++ ++ + YAD ER+L N V+G Q G Y+ PL P
Sbjct: 334 ETCASVGLIFFAKRMLELAPRSEYADVMERALYNTVIGSMAQDGKH---YCYVNPLEVWP 390
Query: 462 GSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
+++E P+ W CC LGD +Y E + +Y+ +I S
Sbjct: 391 RANEENPDRRHVRPTRQAWFGCACCPPNVARLLMSLGDYVYSWHEA-HRTLYVHLHIGSS 449
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
++W + + W + + ++ S ++ +RIP W + +NG
Sbjct: 450 VEWDLDGSRAQVALASSLPWRGEMSLRMSVSHGPRRF--AIAVRIPGWCAGK-PSVRVNG 506
Query: 578 QDLP---LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
Q L + + + + +++ D++ ++ P+ R + + + AI GP V
Sbjct: 507 QPLARSEVCMENGYAVIEREFANGDEVALEFPMEARWVVGHPELRAVSGMVAIERGPLV 565
>gi|332882008|ref|ZP_08449643.1| hypothetical protein HMPREF9074_05437 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357048166|ref|ZP_09109720.1| hypothetical protein HMPREF9441_03769 [Paraprevotella clara YIT
11840]
gi|332679932|gb|EGJ52894.1| hypothetical protein HMPREF9074_05437 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355528749|gb|EHG98227.1| hypothetical protein HMPREF9441_03769 [Paraprevotella clara YIT
11840]
Length = 818
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 62/249 (24%), Positives = 98/249 (39%), Gaps = 41/249 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + Y D ER+L NGV+ G+ + Y PL
Sbjct: 341 ETCASIANVYWNHRMFLATGDSRYEDVLERALYNGVISGVSLSGD--RFFYDNPLESMGQ 398
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS--RLDWKS 522
ER W + CC G + + + +Y +GK V++ YI S L
Sbjct: 399 HER--QAWFGCA----CCPGNVTRFMASVPNYMY-ATQGK--DVFVNLYIQSTAHLSTSQ 449
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT--------------SS 568
+I + Q D WD +R+T+ K T +L RIP W
Sbjct: 450 NKIEIRQTTD--YPWDGKIRMTVHPEKK---QTFALRCRIPGWAQDRPVPTDLYHYTGKG 504
Query: 569 NGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTL-RTEA---IQDDRPEYASI 624
G +NG+D + + + W D + + P+ + R EA ++DDR +
Sbjct: 505 KGYTIQVNGKDAEFRVENGYAVILRKWKKGDTVQLDFPMDVRRVEARGEVEDDRGK---- 560
Query: 625 QAILYGPYV 633
AI GP V
Sbjct: 561 AAIERGPIV 569
>gi|302809111|ref|XP_002986249.1| hypothetical protein SELMODRAFT_425170 [Selaginella moellendorffii]
gi|300146108|gb|EFJ12780.1| hypothetical protein SELMODRAFT_425170 [Selaginella moellendorffii]
Length = 192
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 12/74 (16%)
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPV 241
GHYLSA+A +WASTHN +K++M A+V+ L+ CQ + S P F L
Sbjct: 7 AGHYLSATAKLWASTHNAEVKKRMDALVNILAECQ---AASRKSELPVNLFQFLS----- 58
Query: 242 WAPYYTIHKILAGL 255
+ +I+AGL
Sbjct: 59 ----LELFQIMAGL 68
>gi|256394126|ref|YP_003115690.1| hypothetical protein Caci_4989 [Catenulispora acidiphila DSM 44928]
gi|256360352|gb|ACU73849.1| protein of unknown function DUF1680 [Catenulispora acidiphila DSM
44928]
Length = 647
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 51/234 (21%), Positives = 90/234 (38%), Gaps = 24/234 (10%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMI------YLLPL 459
E+C ++ + + T E Y+D ER+L N VL PGV + Y PL
Sbjct: 329 ETCAAIASVQWNWRMALLTGEAKYSDLAERTLYNAVL-------PGVSLDGTRWFYANPL 381
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
+ G +++ C L ++ G G+ + QY + +
Sbjct: 382 QVRDEHLDRHGDHGVSRKAWFRCACCPPNVMRLLASLPHYFVSGDADGIQLHQYATGSYE 441
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
+G + +V+ W + VT+ G +L+LR+P W + +A +NG
Sbjct: 442 AVAGTV----RVETGYPWSGGIAVTIE-----RGGEWTLSLRVPGWCAD--VEAGVNGVA 490
Query: 580 LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+ P +L + + W D +++ L + +R A AI GP V
Sbjct: 491 VDTVVPDGWLRIRRAWQPGDVVSLNLAMPIRLTAADPRVDAVRGCAAIERGPLV 544
>gi|340619112|ref|YP_004737565.1| hypothetical protein zobellia_3147 [Zobellia galactanivorans]
gi|339733909|emb|CAZ97286.1| Conserved hypothetical periplasmic protein [Zobellia
galactanivorans]
Length = 681
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 70/160 (43%), Gaps = 14/160 (8%)
Query: 478 SFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD---WKSGQIVVNQKVDPV 534
S +CC I + +K+ Y E G+++ Y S+ LD I + Q+ +
Sbjct: 438 SVFCCPPNIIRTIAKMHTYAYSTSE---KGIWVNLYGSNVLDTDLADGSNIKLTQESN-- 492
Query: 535 VSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTK 593
WD +++T+ K +L LRIP W + GA +NG+ P G++ V +
Sbjct: 493 YPWDGNIKITIDSKKKKE---YALMLRIPAW--AEGANIKVNGEKQDQSPKAGSYAEVNR 547
Query: 594 TWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
W D + ++LP+ R + E + A+ GP V
Sbjct: 548 KWKKGDVVELELPMAPRLITADPNVEETRNQVAVKRGPIV 587
>gi|315647722|ref|ZP_07900823.1| hypothetical protein PVOR_20464 [Paenibacillus vortex V453]
gi|315276368|gb|EFU39711.1| hypothetical protein PVOR_20464 [Paenibacillus vortex V453]
Length = 621
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 43/178 (24%), Positives = 75/178 (42%), Gaps = 14/178 (7%)
Query: 478 SFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ-IVVNQKVDPVVS 536
+F CC + + KL ++ ++ + G+ + Y + GQ + V +V
Sbjct: 361 NFGCCTANMHQGWPKLTSHLWMKD--REEGLAAVSYAPCTVRTTVGQGVAVVVEVRGEYP 418
Query: 537 WDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWS 596
+ +++ L+ S L+LRIP W + TLNG L + + + W
Sbjct: 419 FKDRVQIKLSLERPES---FPLSLRIPAWC--DHPVITLNGHKLEFQVTSGYARLVQNWQ 473
Query: 597 SDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDW 654
S D+L I LP+ +RT + R YA+ +I GP V +W + + DW
Sbjct: 474 SGDRLDIHLPMEVRTSS----RSMYAA--SIERGPLVYVLPVKENWQMIQQRDMFHDW 525
>gi|218291237|ref|ZP_03495221.1| protein of unknown function DUF1680 [Alicyclobacillus
acidocaldarius LAA1]
gi|218238839|gb|EED06050.1| protein of unknown function DUF1680 [Alicyclobacillus
acidocaldarius LAA1]
Length = 659
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 52/239 (21%), Positives = 98/239 (41%), Gaps = 18/239 (7%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG--IQRGTEPGVMIYLLPLA--P 461
E+C + ++ ++ + + + YAD ER+L N V+G Q G Y+ PL P
Sbjct: 334 ETCASVGLIFFAKRMLDLSPKAEYADVIERALYNTVIGSMAQDGKH---YCYVNPLDVWP 390
Query: 462 GSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
+++E P+ W CC LGD +Y E + +Y+ +I S
Sbjct: 391 RANEENPDRRHVRPTRQAWFGCACCPPNVARLLMSLGDYVYSWHEA-HRTLYVHLHIGSN 449
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ W+ + W +L S G ++ +RI W + A +NG
Sbjct: 450 VAWELDGSRAQVAQASGLPWRG--ETSLCVSIAGEPRRFAIAVRILGWCAREPA-IRVNG 506
Query: 578 QDLP---LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
Q L + + ++ + +++ D++ ++LP+ R + + + AI GP V
Sbjct: 507 QPLAQTDVRMEDGYAAIEREFANGDEVVLELPMAARFVVSHPELRATSGMVAIERGPLV 565
>gi|154495096|ref|ZP_02034101.1| hypothetical protein PARMER_04143 [Parabacteroides merdae ATCC
43184]
gi|423725062|ref|ZP_17699202.1| hypothetical protein HMPREF1078_03096 [Parabacteroides merdae
CL09T00C40]
gi|154085646|gb|EDN84691.1| hypothetical protein PARMER_04143 [Parabacteroides merdae ATCC
43184]
gi|409235418|gb|EKN28236.1| hypothetical protein HMPREF1078_03096 [Parabacteroides merdae
CL09T00C40]
Length = 679
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 92/437 (21%), Positives = 172/437 (39%), Gaps = 41/437 (9%)
Query: 197 HNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLL 256
++++L EK+ + A QK +GY P D L A + ++ ++
Sbjct: 113 NDQALIEKVQPWIEWTLASQKP--NGYFG--PDTDRDYEPGLQRNNAQDWWPKMVMLKVM 168
Query: 257 DQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN-DVLYKLFCI 315
QY A + R+ +M YF ++ + K + W E+ GG N V+Y L+ I
Sbjct: 169 QQYYTA--TQDRRVIDFMTRYFRYQLDELPK--NPLGKWTFWGEQRGGDNLMVVYWLYNI 224
Query: 316 TQDPKHLMLAHLFDKPCF-LGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG 374
T D L L L K F + L + + HS + + G + + Q K+
Sbjct: 225 TGDKFLLDLGELIHKQTFNWTDIFLNQNHLRRQHSLHCVNLAQG--FKEPIVYYQQGKDS 282
Query: 375 HQLESSGTNIGHFNFK--------SDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE 426
Q++++ + + L + E CT M+ + T +
Sbjct: 283 KQIQATRQAVNDIRHTIGLPTGLWGGDELLRFGKPTTGSELCTAVEMMYSLETILEVTGD 342
Query: 427 IAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSD--------- 477
+ +ADY ER N L Q + Y + R + + TP D
Sbjct: 343 MQWADYLERVAYNA-LPTQVTDDYSARQYYQQTN-QIAVTREWREFSTPHDDTDLLFGEL 400
Query: 478 -SFWCCYGTGIESFSKLGDSIYF--EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPV 534
+ CC + + K ++++ + G ++ +++R+ +G I VN K +
Sbjct: 401 TGYPCCTSNLHQGWPKFVQNLWYATADNGLASLLFAPSQVTARV---AGGIEVNLKEETA 457
Query: 535 VSWDPYLRVTLTFSSKG-SGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVT 592
++ +R ++F+ K + +LRIP W K LNG+ L + + PG +
Sbjct: 458 YPFEETVRYHVSFTDKKVKKVFFPFHLRIPGWCKQPVVK--LNGKPLTVDAYPGTVTRIN 515
Query: 593 KTWSSDDKLTIQLPLTL 609
+ W D L+++LP+ +
Sbjct: 516 REWKEGDILSLELPMEV 532
>gi|421589478|ref|ZP_16034616.1| hypothetical protein RCCGEPOP_11663, partial [Rhizobium sp. Pop5]
gi|403705566|gb|EJZ21118.1| hypothetical protein RCCGEPOP_11663, partial [Rhizobium sp. Pop5]
Length = 299
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 90/210 (42%), Gaps = 22/210 (10%)
Query: 429 YADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSSKER-SYHHWGTPSDSFWCCYGTG 486
YAD E++L NG L G+ T+ Y PL R +HH P CC
Sbjct: 16 YADIMEQALYNGALPGLS--TDGKTFFYDNPLESAGKHHRWKWHH--CP-----CCPPNI 66
Query: 487 IESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG-QIVVNQKVDPVVSWDPYLRVTL 545
+ +G +Y + + V++ ++RL +G ++ + Q + WD + T
Sbjct: 67 ARLVTSIGSYMYAVADDEI-AVHLYGESTARLKLANGAEVELEQATN--YPWDGAVAFTT 123
Query: 546 TFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ--DLPLPSPGNFLSVTKTWSSDDKLTI 603
+ +L+LRIP W + GA ++NG DL + + + W+ D++ +
Sbjct: 124 RLTKPAR---FALSLRIPDW--AEGATLSVNGAMLDLGAHVRDGYARINREWADGDRVAL 178
Query: 604 QLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
LPL LR + + A A++ GP V
Sbjct: 179 YLPLALRPQYANPKVRQDAGRVALMRGPLV 208
>gi|218260014|ref|ZP_03475493.1| hypothetical protein PRABACTJOHN_01154 [Parabacteroides johnsonii
DSM 18315]
gi|218224797|gb|EEC97447.1| hypothetical protein PRABACTJOHN_01154 [Parabacteroides johnsonii
DSM 18315]
Length = 816
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 55/255 (21%), Positives = 98/255 (38%), Gaps = 33/255 (12%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + ++ +F T + Y D ER+L NGV+ G+ + Y PL
Sbjct: 338 ETCASIANVYWNQRMFLATGDAKYIDVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 395
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER+ P CC G + + +Y + +Y+ Y+ S
Sbjct: 396 HERA------PWFGCACCPGNVTRFMASVPKYMYATQGNS---LYVNLYVGSESRVALAN 446
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT---------- 574
V D WD +++T++ K S SL LRIP+WT + +
Sbjct: 447 DTVTLVQDTEYPWDGLVKLTVS-PRKASSF--SLKLRIPSWTGNEPVPGSDLYTYIKRDR 503
Query: 575 ------LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAIL 628
+NG L + ++ + + W D + +++P+ +R + + A+
Sbjct: 504 EPCAVFVNGTPLKEKAHHGYVVIEREWEPGDVIELRMPMDVRRVKAHEKVRADQGLLAVE 563
Query: 629 YGP--YVLAGHSIGD 641
GP Y L G + D
Sbjct: 564 RGPVVYCLEGVDMPD 578
>gi|383110943|ref|ZP_09931761.1| hypothetical protein BSGG_2048 [Bacteroides sp. D2]
gi|313694513|gb|EFS31348.1| hypothetical protein BSGG_2048 [Bacteroides sp. D2]
Length = 684
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 60/116 (51%), Gaps = 11/116 (9%)
Query: 544 TLTFS-SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKL 601
++ FS S G +T LRIP+WT GA+ +NG+ + + P G +L + + WS+ D++
Sbjct: 463 SIAFSVSTGEKVTFPFYLRIPSWTK--GAEVRVNGKKVNVAPVAGKYLCIHREWSNGDRV 520
Query: 602 TIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA---GHSIGDWDITESATSLSDW 654
+ LP++L Q ++ + ++ YGP L+ + D E+A S W
Sbjct: 521 ELTLPMSLSMRTWQVNK----NSVSVDYGPLTLSLKIAEKYVEKDSRETAIGDSKW 572
>gi|393781505|ref|ZP_10369700.1| hypothetical protein HMPREF1071_00568 [Bacteroides salyersiae
CL02T12C01]
gi|392676568|gb|EIY70000.1| hypothetical protein HMPREF1071_00568 [Bacteroides salyersiae
CL02T12C01]
Length = 696
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 64/267 (23%), Positives = 107/267 (40%), Gaps = 39/267 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRG------TEPGVMIYLLP 458
E+C L + +F+ + Y D E L N +L GI T P + LP
Sbjct: 381 ETCANIGNLLFNWRMFQTSGNARYVDIVENCLYNSILSGISLDGKRYFYTNPLRISADLP 440
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
K+R T S +CC + + ++ + +Y + GV+ Y S L
Sbjct: 441 YTLRWPKQR------TEYISCFCCPPNTLRTLCEVQNYVYTLSD---EGVWCNLYGGSEL 491
Query: 519 D--WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
D W I + Q+ D WD + +TL + L SL LR+P W + KATL
Sbjct: 492 DTEWMGNHIQLLQETD--YPWDGAVSITLKEVPEKKPL--SLFLRVPEWCT----KATLA 543
Query: 577 GQDLPLPS---PGNFLSVTKTWSSDDKLTIQL---PLTLRTEAIQDDRPEYASIQAILYG 630
D+P+ + G + + + W D++ + P+ L + + + E + A+ G
Sbjct: 544 VNDVPVTTDLKAGTYAEIKRIWKKGDRVAFVMGMEPVLLESHPLVE---ETRNQVAVKRG 600
Query: 631 PYVLAGHSIGDWDITESATSLSDWITP 657
P V S+ E+ + D + P
Sbjct: 601 PVVYCLESMD----VEAGKRIDDILIP 623
>gi|325298731|ref|YP_004258648.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
gi|324318284|gb|ADY36175.1| protein of unknown function DUF1680 [Bacteroides salanitronis DSM
18170]
Length = 666
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 100/250 (40%), Gaps = 43/250 (17%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + V+ LF + + Y D ERSL NGVL GI + G Y PL
Sbjct: 335 ETCAAIGNVYVNHRLFLFHGDAKYYDVLERSLYNGVLSGIS--LDGGRFFYPNPLESAGG 392
Query: 465 KERSYHHWGTPSDSFWCCYGTGIES--FSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
ER S C + + ++ GDS+Y V + +S +
Sbjct: 393 YERKAWFGCACCPSNLCRFLPSVPGYMYATRGDSLY---------VNLFMEGTSEIQVGK 443
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGA 571
+I + Q+ +D +R+TL KGSG +R+P WT ++G
Sbjct: 444 RKISIRQQT--AYPFDGNIRLTL---QKGSG-EFVWKVRVPGWTRGEVVPGGLYRFADGK 497
Query: 572 KAT----LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYAS 623
+ + +NG+ + + S+++ W D + + +T R E ++ DR
Sbjct: 498 QTSYSVKVNGEKVEGSIEKGYFSISRRWKKGDVVEVSFDMTPRLVLADEKVEADR----G 553
Query: 624 IQAILYGPYV 633
+ AI GP V
Sbjct: 554 MLAIERGPLV 563
>gi|320161641|ref|YP_004174866.1| hypothetical protein ANT_22400 [Anaerolinea thermophila UNI-1]
gi|319995495|dbj|BAJ64266.1| hypothetical protein ANT_22400 [Anaerolinea thermophila UNI-1]
Length = 664
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 56/260 (21%), Positives = 101/260 (38%), Gaps = 38/260 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
E+C + + L + T + Y++ +E L N + G + +Y PL
Sbjct: 353 ETCAALASMFWNWELAQITGKARYSELFEWQLYNAA-SVGMGLDGTTYLYNNPLTCRGGV 411
Query: 466 ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK---- 521
ER P + CC +F+ LGD +Y + G+ +Y+ QY+SS L +
Sbjct: 412 ERR------PWYAVPCCPSNLSRTFAWLGDYLYSAKPGR---LYVHQYLSSDLPAQEIPC 462
Query: 522 --SGQIVVNQKVDPVVSWDPYLRVTLT---FSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
++ ++ ++D + W ++ + L + LR+P+W + + TLN
Sbjct: 463 ANGNRVRLSLQMDSQLPWHGHVVLRLRRWEVLDPDQPAPLEILLRLPSWAEN--PRLTLN 520
Query: 577 GQDLPL-----------------PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
GQ L L P FL +++ W+ D L ++ L +R
Sbjct: 521 GQPLFLQIPQPQQDGEPPADGYDPRQAVFLPLSQPWAEGDTLELRFDLPIRLRHAAPRLR 580
Query: 620 EYASIQAILYGPYVLAGHSI 639
A+ GP V S+
Sbjct: 581 SRRGKVAVTRGPLVYCAESL 600
>gi|291455931|ref|ZP_06595321.1| putative cytoplasmic protein [Bifidobacterium breve DSM 20213 = JCM
1192]
gi|291382340|gb|EFE89858.1| putative cytoplasmic protein [Bifidobacterium breve DSM 20213 = JCM
1192]
Length = 626
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 49/231 (21%), Positives = 94/231 (40%), Gaps = 7/231 (3%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG-VLGIQRGTEPGVMIYLLPL 459
D+ E+C + M ++ + + YAD E+ L NG + GI + + L
Sbjct: 305 DTMYGETCASVAMSMFAQQMLDLEPKGEYADVLEKKLFNGSIAGISLDGKQYYYVNALET 364
Query: 460 AP-GSSKERSYHHWGTPSDSFWC-CYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
P G + +H D F C C T I D + E V Q+I+++
Sbjct: 365 TPDGLANPDRHHVLSHRVDWFGCACCPTNIAQLIASVDRYIYTERDGGKTVLSHQFITNK 424
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
++ SG + V Q+ D W+ ++ T++ + + + LRIP W+ + A T+NG
Sbjct: 425 AEFASG-LTVEQRSD--FPWNGHVEYTVSLPASATDSSVRFGLRIPGWSLGSYA-LTVNG 480
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAIL 628
+ F+ + +L + + ++ D + A ++ +L
Sbjct: 481 KSAVAQPEDGFVYLMVNAGDTLELDMSVKFVRANSRVRSDAGQVAVMRGLL 531
>gi|383777558|ref|YP_005462124.1| hypothetical protein AMIS_23880 [Actinoplanes missouriensis 431]
gi|381370790|dbj|BAL87608.1| hypothetical protein AMIS_23880 [Actinoplanes missouriensis 431]
Length = 496
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 65/269 (24%), Positives = 103/269 (38%), Gaps = 46/269 (17%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLL-P 458
D E+C + +++ L T ++ YAD ER L NG+ G+ + G + P
Sbjct: 176 DRAYAETCASVASFQLAWRLLLATGDVRYADEMERVLLNGIAAGV---SADGTAFFTANP 232
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
L + R P CC + L + G G+ + Y S L
Sbjct: 233 LQARTGLTRQ------PPQPGACCPSAVSALMASLPGHV---ATGDNSGIQLHLYGSGAL 283
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
I V+ + WD + VT+T SS G +L LR P W + + T+NG
Sbjct: 284 RSADRAIDVSTRY----PWDEQITVTVTESS---GEPWTLALRAPAWCAD--LRLTVNGT 334
Query: 579 DLPLPSPG------NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
P+P +L + +TW D++T+ L + R A A++ GP
Sbjct: 335 ----PAPARRLVEKGYLRLHRTWHPGDQITLTLAMPARRVAAHPRVDATRGAAALVRGPL 390
Query: 633 V-------------LAGHSIGDWDITESA 648
V LAG ++ D ++ SA
Sbjct: 391 VYCLEQADLPVSGKLAGATVDDVELDPSA 419
>gi|326799752|ref|YP_004317571.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326550516|gb|ADZ78901.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
Length = 679
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 110/482 (22%), Positives = 184/482 (38%), Gaps = 93/482 (19%)
Query: 186 LSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGY----LSAFPTEQFDRLEALIPV 241
L A A ++A T + +L +KM V+ ++ Q+E G Y + T ++ E +
Sbjct: 110 LEAVASLYAVTKDPALDKKMDEVIKTIALSQREDGYIYTLSMIQQRKTGVKNQFEDRLSF 169
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEY---FYNRVQNVIKKYSI-ERHWQT 297
A Y I ++ Y L + +Y FY + + +I H+
Sbjct: 170 EA--YNIGHLMTAACVHYRATGKRNLLDVAIKATDYLYRFYKSASPTLARNAICPSHYMG 227
Query: 298 LNEEAGGMNDVLYKLFCITQDPKHLMLA-HLFDKPCFLGLLALQADDISGFHSNTHIPI- 355
+ E ++ D ++L LA HL D G + DD + IP
Sbjct: 228 VVE-----------MYRTLGDKRYLELAKHLID---IKGQIEDGTDD-----NQDRIPFR 268
Query: 356 ----VIGSQMR-----------YEVTGD-----QLHK-----EGHQLESSGTNIGHFNFK 390
V+G +R Y TGD QLHK H++ +G ++
Sbjct: 269 EQQKVMGHAVRANYLYAGVADVYAETGDTSLFNQLHKMWTDVTSHKMYITGGCGSLYDGV 328
Query: 391 S------DPKRLAS------------NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADY 432
S DPK + N ++ E NML R L T +AD
Sbjct: 329 SPDGTSYDPKEVQKIHQAYGRDYQLPNFTAHNETCANIGNMLWNWRMLLL-TGNAKFADV 387
Query: 433 YERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSFW----CCYGTGI 487
E +L N VL GI E +Y PLA S K W + CC +
Sbjct: 388 LELALYNSVLSGISLDGER--FLYTNPLA-YSDKLPFKQRWSKDRVPYIALSNCCPPNVV 444
Query: 488 ESFSKLGDSIY-FEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLT 546
+ +++ + Y +EG + +Y + + L G + + Q+ WD ++V +
Sbjct: 445 RTLAEVHNYFYSISDEGIWINLYGGSELKTSLP-NGGTVKLKQET--AYPWDGAIKVVVE 501
Query: 547 FSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDL-PLPSPGNFLSVTKTWSSDDKLTIQL 605
+ K SL LRIP W ++ A +NGQD+ + PG++ + + W D + +++
Sbjct: 502 EAVKDD---FSLFLRIPGW--ADQAMIQVNGQDVDKVLKPGSYTMIRRKWKKGDVVFLKM 556
Query: 606 PL 607
P+
Sbjct: 557 PM 558
>gi|372209243|ref|ZP_09497045.1| hypothetical protein FbacS_03931 [Flavobacteriaceae bacterium S85]
Length = 671
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 67/271 (24%), Positives = 111/271 (40%), Gaps = 26/271 (9%)
Query: 373 EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADY 432
+ H SS ++ H F ++ + NL + E N + S + E YAD
Sbjct: 324 QTHHGVSSHVDMVHEGFINE--YMMPNLTAYNETCANVCNSM-FSYRMLGLHGEAKYADV 380
Query: 433 YERSLTNGVL-GIQRGTEPGVMIYLLPLA-------PGSSKERSYHHWGTPSDSFWCCYG 484
E L N L GI E Y PL PG+ E P +CC
Sbjct: 381 MELVLFNSALSGIS--IEGKDYFYANPLRVSHKGHDPGNDTEFDMRR---PYIPCFCCPP 435
Query: 485 TGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRV 543
+ + +KL Y G +Y +++ L S +V Q P W+ +V
Sbjct: 436 NLVRTIAKLSGWAYSLTTNGVAVNLYGGNKLTTTLLDGSKLELVQQSGYP---WNG--KV 490
Query: 544 TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP-SPGNFLSVTKTWSSDDKLT 602
TL K + +R+P W + G++ +NG+ + LP G+++++ + WS +DK+T
Sbjct: 491 TLIIK-KAKKEAFDIKIRVPEW--AKGSQIQINGKAVSLPVKAGSYVTLHQKWSKNDKIT 547
Query: 603 IQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+Q+P+ ++ E + AI GP V
Sbjct: 548 LQMPMEIKLLEGNPLIEEVRNQIAIKRGPVV 578
>gi|336251952|ref|YP_004585920.1| hypothetical protein Halxa_0515 [Halopiger xanaduensis SH-6]
gi|335339876|gb|AEH39114.1| protein of unknown function DUF1680 [Halopiger xanaduensis SH-6]
Length = 636
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 92/213 (43%), Gaps = 25/213 (11%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLP 458
D+ E+C + +R +F T + YAD ER+L NG L G+ GTE Y
Sbjct: 330 DTAYAETCAAIGSVFWNRRMFELTGDAKYADLIERTLYNGFLAGVSLDGTE---FFYDNR 386
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
L S R W + CC F+ L +Y + + +Y+ QY+ S
Sbjct: 387 LESDGSHGR--QGWFDCA----CCPPNVARLFASLERYLYTVDGRE---LYVNQYVESTA 437
Query: 519 --DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
++ V Q D WD VT+ + T ++LR+P W A +N
Sbjct: 438 TPTVDDAELEVAQTTD--YPWDS--EVTIDVEAPEPTQAT-ISLRVPEWCDE--ASIEVN 490
Query: 577 GQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTL 609
G+ +P+ G ++S+ +TW DD++T +++
Sbjct: 491 GEPIPVDGDG-YVSLERTW-DDDRITATFEMSV 521
>gi|322690403|ref|YP_004219973.1| hypothetical protein BLLJ_0211 [Bifidobacterium longum subsp.
longum JCM 1217]
gi|320455259|dbj|BAJ65881.1| conserved hypothetical protein [Bifidobacterium longum subsp.
longum JCM 1217]
gi|346706304|dbj|BAK79118.1| beta-L-arabinofuranosidase [Bifidobacterium longum subsp. longum]
Length = 658
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 103/248 (41%), Gaps = 19/248 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG-VLGIQRGTEPGVMIYLLPL 459
D+ E+C + M ++ + + YAD E+ L NG + GI + + L
Sbjct: 333 DTMYGETCASVAMSMFAQQMLDLEPKGEYADVLEKELFNGSIAGISLDGKQYYYVNALET 392
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P HH + ++ CC + + IY E +G V Q+I++
Sbjct: 393 TPDGLDNPDRHHVLSHRVDWFGCACCPANIARLIASVDRYIYTERDGG-KTVLSHQFIAN 451
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
++ SG + V Q+ + WD ++ T++ + + + LRIP W S T+N
Sbjct: 452 TAEFASG-LTVEQRSN--FPWDGHVEYTVSLPASATDSSVRFGLRIPGW-SRGSYTLTVN 507
Query: 577 GQDLPLPSPGNFLS--VTKTWSSDDKLTIQLPLTLRTEAIQDD---RPEYASIQAILYGP 631
G+ P+ G+ V ++ D L I L L + + ++ + R + + A++ GP
Sbjct: 508 GK----PAVGSLEDGFVYLVVNAGDTLEIALELDMSVKFVRANSRVRSDAGQV-AVMRGP 562
Query: 632 YVLAGHSI 639
V +
Sbjct: 563 LVYCAEQV 570
>gi|312133430|ref|YP_004000769.1| protein [Bifidobacterium longum subsp. longum BBMN68]
gi|311772660|gb|ADQ02148.1| Hypothetical protein BBMN68_1167 [Bifidobacterium longum subsp.
longum BBMN68]
Length = 658
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 103/248 (41%), Gaps = 19/248 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG-VLGIQRGTEPGVMIYLLPL 459
D+ E+C + M ++ + + YAD E+ L NG + GI + + L
Sbjct: 333 DTMYGETCASVAMSMFAQQMLDLEPKGEYADVLEKELFNGSIAGISLDGKQYYYVNALET 392
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P HH + ++ CC + + IY E +G V Q+I++
Sbjct: 393 TPDGLDNPDRHHVLSHRVDWFGCACCPANIARLIASVDRYIYTERDGG-KTVLSHQFIAN 451
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
++ SG + V Q+ + WD ++ T++ + + + LRIP W S T+N
Sbjct: 452 TAEFASG-LTVEQRSN--FPWDGHVEYTVSLPASATDSSVRFGLRIPGW-SRGSYTLTVN 507
Query: 577 GQDLPLPSPGNFLS--VTKTWSSDDKLTIQLPLTLRTEAIQDD---RPEYASIQAILYGP 631
G+ P+ G+ V ++ D L I L L + + ++ + R + + A++ GP
Sbjct: 508 GK----PAVGSLEDGFVYLVVNAGDTLEIALELDMSVKFVRANSRVRSDAGQV-AVMRGP 562
Query: 632 YVLAGHSI 639
V +
Sbjct: 563 LVYCAEQV 570
>gi|256421765|ref|YP_003122418.1| hypothetical protein Cpin_2738 [Chitinophaga pinensis DSM 2588]
gi|256036673|gb|ACU60217.1| protein of unknown function DUF1680 [Chitinophaga pinensis DSM
2588]
Length = 680
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 95/222 (42%), Gaps = 30/222 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C L +R + T + Y D E +L N +L G+ + Y PLA +S
Sbjct: 358 ETCANIGNLLWNRRMLELTGDAKYGDIVELTLYNSILSGVS--MDGADFFYTNPLA--AS 413
Query: 465 KERSYH-HWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
++ Y W + CC + + +++ + Y ++ G+YI Y ++L
Sbjct: 414 RDFPYQLRWMGGRQPYIALSNCCPPNTVRTIAEVSNYFYSLDD---KGIYIDLYGGNQLK 470
Query: 520 --WKSGQIV-VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
K G + + Q+ D WD + +T+ + LRIP W G T+N
Sbjct: 471 TTLKDGSTLSLEQETD--YPWDGTINITI---KDAPAHPFDIALRIPGWCQRAGI--TIN 523
Query: 577 GQDL-----PLPSPGNFLSVTKTWSSDDK--LTIQLPLTLRT 611
G+ + P +P ++ + + W S DK LT+ +P TL T
Sbjct: 524 GKPVGQTATPSITPASYHKLNRQWKSGDKITLTLDMPATLIT 565
>gi|251797570|ref|YP_003012301.1| hypothetical protein Pjdr2_3583 [Paenibacillus sp. JDR-2]
gi|247545196|gb|ACT02215.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 674
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 89/239 (37%), Gaps = 15/239 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ E+C M S +LF T E Y D E + N VL R + Y PL
Sbjct: 354 DNGYLETCAGVGMGFFSWNLFLATGESRYIDKLETIIYNIVLA-GRSMDGHKYFYENPLV 412
Query: 461 PGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW 520
R H S CC ++ +L IY +GK G +I YI S +
Sbjct: 413 SKGGHNRWEWH------SCPCCPPMIMKLMPELASYIY-AYDGK--GAFINLYIGSESEL 463
Query: 521 KSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDL 580
G + V K W + +T+T L LRIP W + +N Q
Sbjct: 464 LIGDVPVTVKQQTNYPWSGAVGITVTPERDAE---FDLRLRIPEWCGQYAIR--VNDQAA 518
Query: 581 PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSI 639
+ + + WS D++ ++L + + + + +A AI GP + S+
Sbjct: 519 NYELENGYAVLHRVWSPGDRIQLELDMPVHLVEVHPNVTTHADKAAIRRGPVLYCLESV 577
>gi|306824190|ref|ZP_07457561.1| protein of hypothetical function DUF1680 [Bifidobacterium dentium
ATCC 27679]
gi|309801097|ref|ZP_07695227.1| conserved hypothetical protein [Bifidobacterium dentium JCVIHMP022]
gi|304552578|gb|EFM40494.1| protein of hypothetical function DUF1680 [Bifidobacterium dentium
ATCC 27679]
gi|308222323|gb|EFO78605.1| conserved hypothetical protein [Bifidobacterium dentium JCVIHMP022]
Length = 721
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 61/287 (21%), Positives = 106/287 (36%), Gaps = 24/287 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ ESC + +R + + YAD E +L N L G+ + + L +
Sbjct: 363 DTAYSESCAAIALAFFARRMLEIQPKSEYADVMESALYNTTLAGMALDGKSFFYVNPLEV 422
Query: 460 APGSS--KERSYHHWGTPSDSFW----CC---YGTGIESFSKLGDSIYFEEEGKYPGVYI 510
P + ER +H P W CC +ES + ++ + Y +Y+
Sbjct: 423 VPEACHRDERKFH--VKPVRQKWFGCACCPPNIARMVESVQQYAYTVADDASTLYVHLYM 480
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT---SLNLRIPTWTS 567
+S++L G V+ +V + W+ +T+T S G +L LR+P W
Sbjct: 481 GGVVSAKL----GGSDVSLEVRAGMPWNGAGAITVTLPSSDEGQVPESFALALRLPAWAG 536
Query: 568 SNGAKATLNGQD-----LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYA 622
A +++ + +L +T TW D + P+ +R A E A
Sbjct: 537 GESAADSIHAMGEKDSRITRTIRDGYLYLTGTWRDGDVIDFDFPMPVRMIAANPLVREDA 596
Query: 623 SIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITF 669
A + GP + D + ++ I P + ITF
Sbjct: 597 GKVAFIRGPLAYCAEGTDNGDNLHLLHADAETIAADPDAVKVNEITF 643
>gi|325282251|ref|YP_004254793.1| hypothetical protein Odosp_3669 [Odoribacter splanchnicus DSM
20712]
gi|324314060|gb|ADY34613.1| protein of unknown function DUF1680 [Odoribacter splanchnicus DSM
20712]
Length = 796
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 63/254 (24%), Positives = 100/254 (39%), Gaps = 50/254 (19%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + + LF T E Y D ER+L NGV+ G+ + Y PL S
Sbjct: 337 ETCASISNVYWNYRLFLLTGESKYYDVLERALYNGVISGVS--LDGKRYFYDNPLMSDGS 394
Query: 465 KERSYHHWGTPSDSFWC-CYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
+RS W F C C + I F + G +++ Y+ + G
Sbjct: 395 HDRS--EW------FGCSCCPSNITRFMPSIPGYVYAVRGN--TLFVNLYMGN-----EG 439
Query: 524 QIV-----VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT---- 574
QI V K + W+ +++TL S S +L LRIP W T
Sbjct: 440 QITLEGQPVRIKQETRYPWEGRIKLTLDHSPASS---FTLALRIPGWVQQQPLPGTLYTY 496
Query: 575 -----------LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRT----EAIQDDRP 619
LNG+ + + + W +D++ + LP+ +R + DDR
Sbjct: 497 LDKDTPSYTISLNGKTVKPEVRNGYALLRGDWKGNDQIVLNLPMQVRKVIADPQVIDDRN 556
Query: 620 EYASIQAILYGPYV 633
+Y A++YGP V
Sbjct: 557 KY----ALIYGPIV 566
>gi|423348680|ref|ZP_17326362.1| hypothetical protein HMPREF1060_04034 [Parabacteroides merdae
CL03T12C32]
gi|409213201|gb|EKN06225.1| hypothetical protein HMPREF1060_04034 [Parabacteroides merdae
CL03T12C32]
Length = 679
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 91/437 (20%), Positives = 171/437 (39%), Gaps = 41/437 (9%)
Query: 197 HNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLL 256
++++L EK+ + A QK +GY P D L A + ++ ++
Sbjct: 113 NDQALIEKVQPWIEWTLASQKP--NGYFG--PDTDRDYEPGLQRNNAQDWWPKMVMLKVM 168
Query: 257 DQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN-DVLYKLFCI 315
QY A + R+ +M YF ++ + K + W E+ GG N V+Y L+ I
Sbjct: 169 QQYYTA--TQDRRVIDFMTRYFRYQLDELPK--NPLGKWTFWGEQRGGDNLMVVYWLYNI 224
Query: 316 TQDPKHLMLAHLFDKPCF-LGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG 374
T D L L L K F + L + + HS + + G + + Q K+
Sbjct: 225 TGDKFLLDLGELIHKQTFNWTDIFLNQNHLRRQHSLHCVNLAQG--FKEPIVYYQQGKDS 282
Query: 375 HQLESSGTNIGHFNFK--------SDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE 426
Q++++ + + L + E CT M+ + T +
Sbjct: 283 KQIQATRQAVNDIRHTIGLPTGLWGGDELLRFGKPTTGSELCTAVEMMYSLETILEVTGD 342
Query: 427 IAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSD--------- 477
+ +ADY ER N L Q + Y + R + + TP D
Sbjct: 343 MQWADYLERVAYNA-LPTQVTDDYSARQYYQQTN-QIAVTREWREFSTPHDDTDLLFGEL 400
Query: 478 -SFWCCYGTGIESFSKLGDSIYF--EEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPV 534
+ CC + + K ++++ + G ++ +++R+ +G I VN K +
Sbjct: 401 TGYPCCTSNLHQGWPKFVQNLWYATADNGLASLLFAPSQVTARV---AGGIEVNLKEETA 457
Query: 535 VSWDPYLRVTLTFSSKG-SGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVT 592
++ +R ++F+ K + +LRIP W K NG+ L + + PG +
Sbjct: 458 YPFEETVRYHVSFTDKKVKKVFFPFHLRIPGWCKQPVVK--FNGKPLTVDAYPGTVTRIN 515
Query: 593 KTWSSDDKLTIQLPLTL 609
+ W D L+++LP+ +
Sbjct: 516 REWKEGDILSLELPMEV 532
>gi|239624187|ref|ZP_04667218.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA]
gi|239520573|gb|EEQ60439.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA]
Length = 701
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 86/378 (22%), Positives = 144/378 (38%), Gaps = 36/378 (9%)
Query: 274 MVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCF 333
+ YF N + E Q + E GG +L K F + Q P L AHL +
Sbjct: 229 LAAYFLNERGKQPYFFEEEARQQGRDPEDGGPKGILGKSF-LAQGPYALFQAHLPVREQM 287
Query: 334 LG-----LLALQADDISGFHSNTHIPIVIGSQMRY--EVTGDQLHKEGHQLESSGTNIGH 386
LA ++ S T + + +R VT +++ G G +
Sbjct: 288 TAEGHAVRLAYMGAGMADVASETGDKSLWQACVRLWDNVTSKRMYITGGIGSQDGCERFN 347
Query: 387 FNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ 445
F+++ P + + E+C + M+ + + + Y D ER+L NGVL G+
Sbjct: 348 FDYQL-PN------EESYHETCASIAMVMWGFRMLQVAPDRRYGDVMERALYNGVLSGVS 400
Query: 446 RGTEPGVMIYLLPLAPGSSKERSYHHWGT-PSDSFW----CCYGTGIESFSKLGDSIY-- 498
+ L P ++R + P W CC LG Y
Sbjct: 401 LSGDRFFYANHLAAHPEMFRDRIIRNPRMFPERQRWFAVSCCPMNLARLLESLGGYQYTQ 460
Query: 499 --FEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT 556
E+ G+ V++ Q ++ + + ++V+ Q+ D W + V + G+
Sbjct: 461 GKLEDGGQAVYVHLYQEGTADIRVRDKKVVIRQETD--YPWQGDILVMVGTDLDGA---W 515
Query: 557 SLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLT-LRTEAIQ 615
+L LRIP W+ + L +D + +L V K WS + L + LP+ + EA
Sbjct: 516 TLALRIPEWS----GQPVLETEDAEVWEDRGYLYVRKDWSKNGHLHLSLPMQPVLMEAHP 571
Query: 616 DDRPEYASIQAILYGPYV 633
R + AI YGP V
Sbjct: 572 GVRMDCGKA-AIQYGPLV 588
>gi|421598168|ref|ZP_16041640.1| hypothetical protein BCCGELA001_11816 [Bradyrhizobium sp.
CCGE-LA001]
gi|404269708|gb|EJZ33916.1| hypothetical protein BCCGELA001_11816 [Bradyrhizobium sp.
CCGE-LA001]
Length = 276
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 35/153 (22%), Positives = 63/153 (41%), Gaps = 8/153 (5%)
Query: 481 CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPY 540
CC F+ +G IY + +Y+ YI + + G + +++ W+
Sbjct: 39 CCPPNIARLFTSVGHYIYTP---RSEALYVNLYIGNSVAIAVGGHTLRLRMNGNYPWEDL 95
Query: 541 LRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDK 600
+ + + +T +L LR+P W S+ K LNG+ + +L + +TW D+
Sbjct: 96 VEIAVESEQP---ITHTLALRLPEWCSAPEVK--LNGEPVNCEPRKGYLHIHRTWRKGDR 150
Query: 601 LTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+QLP+ R A AI GP +
Sbjct: 151 CKLQLPMKSRRVYGHPQLRHLAGKVAIQRGPLI 183
>gi|419849270|ref|ZP_14372326.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
gi|419852420|ref|ZP_14375295.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
gi|386410676|gb|EIJ25451.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
gi|386412392|gb|EIJ27063.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
Length = 658
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 103/248 (41%), Gaps = 19/248 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG-VLGIQRGTEPGVMIYLLPL 459
D+ E+C + M ++ + + YAD E+ L NG + GI + + L
Sbjct: 333 DTMYGETCASVAMSMFAQQMLDLEPKGEYADVLEKELFNGSIAGISLDGKQYYYVNALET 392
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P HH + ++ CC + + IY E +G V Q+I++
Sbjct: 393 TPDGLDNPDRHHVLSHRVDWFGCACCPANIARLIASVDRYIYTERDGG-KTVLSHQFIAN 451
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
++ SG + V Q+ + WD ++ T++ + + + LRIP W S T+N
Sbjct: 452 TAEFASG-LTVEQRSN--FPWDGHVEYTVSLPASATDSSVRFGLRIPGW-SRGSYTLTVN 507
Query: 577 GQDLPLPSPGNFLS--VTKTWSSDDKLTIQLPLTLRTEAIQDD---RPEYASIQAILYGP 631
G+ P+ G+ V ++ D L I L L + + ++ + R + + A++ GP
Sbjct: 508 GK----PAVGSLEDGFVYLVVNAGDTLEIALELDMSVKFVRANSRVRSDAGQV-AVMRGP 562
Query: 632 YVLAGHSI 639
V +
Sbjct: 563 LVYCAEQV 570
>gi|419848449|ref|ZP_14371547.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
1-6B]
gi|419854628|ref|ZP_14377413.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
44B]
gi|386407624|gb|EIJ22591.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
1-6B]
gi|386417540|gb|EIJ32018.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
44B]
Length = 658
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 103/248 (41%), Gaps = 19/248 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNG-VLGIQRGTEPGVMIYLLPL 459
D+ E+C + M ++ + + YAD E+ L NG + GI + + L
Sbjct: 333 DTMYGETCASVAMSMFAQQMLDLEPKGEYADVLEKELFNGSIAGISLDGKQYYYVNALET 392
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P HH + ++ CC + + IY E +G V Q+I++
Sbjct: 393 TPDGLDNPDRHHVLSHRVDWFGCACCPANIARLIASVDRYIYTERDGG-KTVLSHQFIAN 451
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
++ SG + V Q+ + WD ++ T++ + + + LRIP W S T+N
Sbjct: 452 TAEFASG-LTVEQRSN--FPWDGHVEYTVSLPASATDSSVRFGLRIPGW-SRGSYTLTVN 507
Query: 577 GQDLPLPSPGNFLS--VTKTWSSDDKLTIQLPLTLRTEAIQDD---RPEYASIQAILYGP 631
G+ P+ G+ V ++ D L I L L + + ++ + R + + A++ GP
Sbjct: 508 GK----PAVGSLEDGFVYLVVNAGDTLEIALELDMSVKFVRANSRVRSDAGQV-AVMRGP 562
Query: 632 YVLAGHSI 639
V +
Sbjct: 563 LVYCAEQV 570
>gi|154495095|ref|ZP_02034100.1| hypothetical protein PARMER_04142 [Parabacteroides merdae ATCC
43184]
gi|423725063|ref|ZP_17699203.1| hypothetical protein HMPREF1078_03097 [Parabacteroides merdae
CL09T00C40]
gi|154085645|gb|EDN84690.1| hypothetical protein PARMER_04142 [Parabacteroides merdae ATCC
43184]
gi|409235419|gb|EKN28237.1| hypothetical protein HMPREF1078_03097 [Parabacteroides merdae
CL09T00C40]
Length = 617
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 52/237 (21%), Positives = 100/237 (42%), Gaps = 21/237 (8%)
Query: 399 NLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLL 457
NLD+ E +C + M+ ++ + ++T + Y D ERS+ NG L G+ + Y+
Sbjct: 329 NLDAYCE-TCASVGMVYWNQRMNQFTGDSKYIDVLERSMYNGALAGVSLAGDR--FFYVN 385
Query: 458 PLAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISS 516
PL R + CC +G+ IY ++ + ++I
Sbjct: 386 PLESNGDHHRQAWY------GCACCPSQISRFLPSIGNYIYGTSDKALWVNLFIGNTTEV 439
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+D K ++V+ Q+ D WD +++T+T L L +RIP W S ++N
Sbjct: 440 TIDGK--KVVMKQETD--YPWDGLVKLTVTSEQP---LGKELRIRIPGWCKS--YTLSVN 490
Query: 577 GQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
G + + + +V K W + D + + + + + + + +A+ GP V
Sbjct: 491 GNKVDSTTDKGY-TVIKEWKTGDLIVLNMDMPVEKVSADPRVRQNTGKRALQRGPLV 546
>gi|189462782|ref|ZP_03011567.1| hypothetical protein BACCOP_03480 [Bacteroides coprocola DSM 17136]
gi|189430398|gb|EDU99382.1| hypothetical protein BACCOP_03480 [Bacteroides coprocola DSM 17136]
Length = 578
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 66/313 (21%), Positives = 122/313 (38%), Gaps = 54/313 (17%)
Query: 393 PKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPG 451
P+ + N D+ E+C + + +F K+ Y D E +L N VL G+ +
Sbjct: 97 PEYVLPNKDA-YNETCAAVGNVMFNYRMFLTKKDARYVDVAEVALYNNVLAGVN--LDGN 153
Query: 452 VMIYLLPLAPGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPG 507
Y+ PL + R+ + G S W CC ++ +Y +
Sbjct: 154 KFFYVNPL---EADARNAFNQGLKGRSPWFGTACCPSNIARLIPQIPGMMYAHTDND--- 207
Query: 508 VYIIQY--ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW 565
+Y Y S+ + G++ + Q + +D +R + + S +++ RIPTW
Sbjct: 208 IYCTFYAGTSTVVPLSDGKVTIKQTTN--YPFDESVRFEI--KPEQSKQKFAMHFRIPTW 263
Query: 566 TSSNGA---------------KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
K LNG+++ + F+++ + W S D + +QLP+ +R
Sbjct: 264 AGKQFVPGKLYHYLNDKPAEWKVLLNGKEVSVKPHKGFVTIERAWKSGDLVELQLPMLVR 323
Query: 611 -TEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASY---NSQL 666
+AI + + I GP V S+ + +PASY S+
Sbjct: 324 YNKAISQVEADIDRV-CITRGPLVYCAESVDN--------------VAMPASYVVNPSED 368
Query: 667 ITFTQEYGNTKFV 679
I+ T+ G K++
Sbjct: 369 ISITKGAGALKYI 381
>gi|171741882|ref|ZP_02917689.1| hypothetical protein BIFDEN_00978 [Bifidobacterium dentium ATCC
27678]
gi|283456925|ref|YP_003361489.1| hypothetical protein BDP_2104 [Bifidobacterium dentium Bd1]
gi|171277496|gb|EDT45157.1| hypothetical protein BIFDEN_00978 [Bifidobacterium dentium ATCC
27678]
gi|283103559|gb|ADB10665.1| Conserved hypothetical protein [Bifidobacterium dentium Bd1]
Length = 721
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 61/287 (21%), Positives = 106/287 (36%), Gaps = 24/287 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ ESC + +R + + YAD E +L N L G+ + + L +
Sbjct: 363 DTAYSESCAAIALAFFARRMLEIQPKSEYADVMESALYNTTLAGMALDGKSFFYVNPLEV 422
Query: 460 APGSS--KERSYHHWGTPSDSFW----CC---YGTGIESFSKLGDSIYFEEEGKYPGVYI 510
P + ER +H P W CC +ES + ++ + Y +Y+
Sbjct: 423 VPEACHRDERKFH--VKPVRQKWFGCACCPPNIARMVESVQQYAYTVADDASTLYVHLYM 480
Query: 511 IQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT---SLNLRIPTWTS 567
+S++L G V+ +V + W+ +T+T S G +L LR+P W
Sbjct: 481 GGVVSAKL----GGSDVSLEVRAGMPWNGAGAITVTLPSSDEGQVPEPFALALRLPAWAG 536
Query: 568 SNGAKATLNGQD-----LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYA 622
A +++ + +L +T TW D + P+ +R A E A
Sbjct: 537 GESAADSIHAAGEKDSRITRTIRDGYLYLTGTWRDGDVIDFDFPMPVRMIAANPLVREDA 596
Query: 623 SIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITF 669
A + GP + D + ++ I P + ITF
Sbjct: 597 GKVAFIRGPLAYCAEGTDNGDNLHLLHADAETIAADPDAVKVNEITF 643
>gi|386822341|ref|ZP_10109556.1| hypothetical protein JoomaDRAFT_0361 [Joostella marina DSM 19592]
gi|386423587|gb|EIJ37418.1| hypothetical protein JoomaDRAFT_0361 [Joostella marina DSM 19592]
Length = 684
Score = 47.4 bits (111), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 70/155 (45%), Gaps = 17/155 (10%)
Query: 481 CCYGTGIESFSKLGDSIYFE--EEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWD 538
CCY + ++K ++F+ E G +Y IS+++ K+ +IV+ + D
Sbjct: 420 CCYVNMHQGWTKFTQHLWFKNKEGGLAALIYSPNTISTKI--KNQEIVIKENTSYPFGED 477
Query: 539 PYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSD 598
+T G + ++ RIP W N A T+NG+ + + +++ +TW +
Sbjct: 478 VNFEITT-----GKEIDFPMDFRIPKW--CNNASITVNGEKVIFEKNKSIVTINRTWENG 530
Query: 599 DKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
D + + LP+ ++ ++ +AI GP V
Sbjct: 531 DLIKLSLPMEVKVSQWAENS------RAIERGPLV 559
>gi|423344367|ref|ZP_17322079.1| hypothetical protein HMPREF1077_03509 [Parabacteroides johnsonii
CL02T12C29]
gi|409212765|gb|EKN05799.1| hypothetical protein HMPREF1077_03509 [Parabacteroides johnsonii
CL02T12C29]
Length = 816
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 55/257 (21%), Positives = 103/257 (40%), Gaps = 37/257 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + ++ +F T + Y D ER+L NGV+ G+ + Y PL
Sbjct: 338 ETCASIANVYWNQRMFLATGDAKYIDVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 395
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS--SRLDWKS 522
ER+ P CC G + + +Y + +Y+ Y+ SR+ +
Sbjct: 396 HERA------PWFGCACCPGNVTRFMASVPKYMYATQGNS---LYVNLYVGSESRVALAN 446
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT-------- 574
+ + Q + WD +++T++ K S SL LRIP+WT + +
Sbjct: 447 DTVTLVQNTE--YPWDGLVKLTVS-PRKASSF--SLKLRIPSWTGNEPVPGSDLYTYIKR 501
Query: 575 --------LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
+NG L + ++ + + W D + +++P+ +R + + A
Sbjct: 502 DREPCAVFVNGTPLKEKAHHGYVVIEREWEPGDVIELRMPMDVRRVKAHEKVRADQGLLA 561
Query: 627 ILYGP--YVLAGHSIGD 641
+ GP Y L G + D
Sbjct: 562 VERGPVVYCLEGVDMPD 578
>gi|340619113|ref|YP_004737566.1| hypothetical protein zobellia_3148 [Zobellia galactanivorans]
gi|339733910|emb|CAZ97287.1| Conserved hypothetical protein [Zobellia galactanivorans]
Length = 656
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 91/213 (42%), Gaps = 22/213 (10%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLPLAP-G 462
E+C S + E YAD E L N L GI G E Y PL
Sbjct: 335 ETCANLCNAMFSYRMLNLKAEAKYADIVELVLYNSALSGISVSGKE---YFYANPLRMLN 391
Query: 463 SSKERSYHHWGT------PSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYIS 515
++++ + H T P S +CC + + + + + Y E G +Y ++
Sbjct: 392 NTRDYNAHENVTETPNREPYLSCFCCPPNLVRTIATVSEWAYSLSENGISVNLYGANHLD 451
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
+RL I V+Q+ W+ +++ + + S++LRIP W + +K TL
Sbjct: 452 TRL-LDDSPIKVSQET--AYPWEGRVKLNI---EECKTEAFSISLRIPKWAKN--SKLTL 503
Query: 576 NGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPL 607
NG++L L PG+F + + W D L + +P+
Sbjct: 504 NGEELTMLLEPGSFAHIERNWKKGDVLILDMPM 536
>gi|160932141|ref|ZP_02079532.1| hypothetical protein CLOLEP_00975 [Clostridium leptum DSM 753]
gi|156868743|gb|EDO62115.1| hypothetical protein CLOLEP_00975 [Clostridium leptum DSM 753]
Length = 705
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 58/245 (23%), Positives = 96/245 (39%), Gaps = 25/245 (10%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL- 459
D+ E+C + ++ + + + + Y D ER+L N VLG + Y+ PL
Sbjct: 384 DTAYAETCASIGLIFFAHRMLQMDMDSRYGDVMERALYNVVLG-SASRDGKRFFYVNPLE 442
Query: 460 ----APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
A G + ++ + P W CC + L +Y +E +Y
Sbjct: 443 VWPKACGGNPDKQHV---KPVRQKWFGCACCPPNVARLMASLNQYLYSTDEDT---IYTH 496
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
YIS K + K + WD +++ T+ + L SL LR+P W +
Sbjct: 497 LYISGEAGIKIAGGEMRLKQESSYPWDGHIKFTVLSALPEDEL--SLGLRLPGWCRN--W 552
Query: 572 KATLNGQDLPLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILY- 629
NG+ +P P +L V W D T++L L + E +Q + A I +
Sbjct: 553 SVLFNGKPVPRPVVQKGYLKVAAHWHEGD--TVELRLEMPVECLQANPQVRADAGKIAFQ 610
Query: 630 -GPYV 633
GP V
Sbjct: 611 RGPLV 615
>gi|160934492|ref|ZP_02081878.1| hypothetical protein CLOLEP_03364 [Clostridium leptum DSM 753]
gi|156865945|gb|EDO59317.1| hypothetical protein CLOLEP_03364 [Clostridium leptum DSM 753]
Length = 650
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 95/234 (40%), Gaps = 16/234 (6%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA--PGS 463
ESC + ++ ++ + T E Y D ER+L N VLG E Y+ PL P +
Sbjct: 334 ESCASVGLMMFAQRMASLTGEAVYYDVVERALCNTVLG-GISKEGKRYFYVNPLEVWPQN 392
Query: 464 SKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
+ P W CC + + LG IY + E +Y+ Q+ISS
Sbjct: 393 CLASTSMAHVKPVRQKWFGCACCPPNIARTLASLGQYIYAQSED---SLYVNQFISSSSA 449
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
+ G + +D D +R+T + L L +RIP + K +NG+D
Sbjct: 450 VEIGGQEIEFSMDSTYMKDGAVRITAKCGKREEALY--LRVRIPEYFKKPTLK--VNGKD 505
Query: 580 LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
L + + ++ L ++ L A ++ R + + AI+ GPYV
Sbjct: 506 ATLKLEQGYAVIPLEELTEVCLQGEI-LPRFVAANRNVRADMGRL-AIMKGPYV 557
>gi|149276410|ref|ZP_01882554.1| hypothetical protein PBAL39_01782 [Pedobacter sp. BAL39]
gi|149232930|gb|EDM38305.1| hypothetical protein PBAL39_01782 [Pedobacter sp. BAL39]
Length = 670
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 80/392 (20%), Positives = 152/392 (38%), Gaps = 43/392 (10%)
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEE 301
W P + KIL QY Y+ A+ R+ M YF +++ + K+ HW
Sbjct: 153 WWPKMVMLKILK----QY-YSATADP-RVIKLMTAYFRFQLKELPSKHL--DHWSFWARY 204
Query: 302 AGGMNDVL-YKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
GG N ++ Y L+ IT D L L L + F A ++ S+ H + +
Sbjct: 205 RGGDNLMMVYWLYNITGDAFLLDLGELLHRQTFDFTNAFANTNMLSSLSSIHT-VNLAQG 263
Query: 361 MRYEVTGDQLHKEGHQLESSG---------TNIGHFNFKSDPKRLASNLDSNTEESCTTY 411
M+ V Q HK+ L++ + H + D + L N + E CT
Sbjct: 264 MKEPVIYYQQHKDQKYLDAVDKGLADIRKYNGMAHGGYGGD-EALHGNNPTQGLELCTAV 322
Query: 412 NMLKVSRHLFRWTKEIAYADYYER--------SLTNGVLGIQRGTEPGVMIYLLPLAPGS 463
M+ + T + +YAD E+ +T+ + Q + + +
Sbjct: 323 EMMFSLESMLEITGKTSYADKLEKLAFNALPAQVTDDFMARQYYQQANQV-----MVTRG 377
Query: 464 SKERSYHHWGTPS-----DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
++ +H GT F CC + + K +++++ + + G+ + Y S +
Sbjct: 378 TRNFEQNHNGTDVCYGLLTGFPCCTSNMHQGWPKFTQNLWYKTDDQ--GIAALVYAPSEV 435
Query: 519 DWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ + I + K ++ +R TL + L+ +LRIP W A +NG
Sbjct: 436 HAQVANGIEIFFKEQTNYPFEERIRFTLEMPKRIKNLSFPFHLRIPEWCKR--ATVKING 493
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTL 609
+ +++ W++ D + + LP+ +
Sbjct: 494 NTWKEVDGNQVVKISRQWNTGDVVELLLPMEI 525
>gi|153852636|ref|ZP_01994073.1| hypothetical protein DORLON_00046 [Dorea longicatena DSM 13814]
gi|149754278|gb|EDM64209.1| hypothetical protein DORLON_00046 [Dorea longicatena DSM 13814]
Length = 649
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 38/173 (21%), Positives = 77/173 (44%), Gaps = 11/173 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
+ N E+C + + R + + TK+ +Y D ER+L N +L GI + + + L +
Sbjct: 328 NCNYSETCASIGLALFGRRMAQITKDASYMDMVERALYNTLLSGIAQDGKSFFYVNPLEV 387
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + +R+ P W CC + + +G IYF ++ Y+ YIS
Sbjct: 388 WPDNCIDRTSKEHVKPVRQKWFGVACCPPNIARTLASMGQYIYFTDKNT---AYVNLYIS 444
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS 568
+ + + + +++ ++ ++R+ +T +G L LRIP + +
Sbjct: 445 NEAQIELEEGALKIQIESDLTNTGHIRMAITPDGEGE---HRLALRIPDYVKT 494
>gi|398351289|ref|YP_006396753.1| cytoplasmic protein [Sinorhizobium fredii USDA 257]
gi|390126615|gb|AFL49996.1| putative cytoplasmic protein [Sinorhizobium fredii USDA 257]
Length = 937
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 54/242 (22%), Positives = 97/242 (40%), Gaps = 19/242 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + +AD E++L NG L G+ + Y PL
Sbjct: 624 DTAYAETCASVALVFWASRMLGRGPNRRFADIMEQALYNGALSGLS--LDGKTFFYDNPL 681
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
R H + CC + +G +Y + V++ ++RL+
Sbjct: 682 ESTGKHHRWRWH------NCPCCPPNIARLVASVGAYMYGVATDEI-AVHLYGESTARLE 734
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ- 578
+ + Q + W+ + + L +L+LRIP W ++GA ++NG
Sbjct: 735 LDGSNVTLRQVTN--YPWEGAVSIRLELEEP---RQFALSLRIPEW--ADGASISVNGSG 787
Query: 579 -DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
DL + + + + WS D ++I LPL LR + + A A+L GP V
Sbjct: 788 IDLEHVTLDGYARIEREWSDGDAVSIDLPLKLRPQFANPKVRQDAGRIALLRGPLVYCAE 847
Query: 638 SI 639
I
Sbjct: 848 EI 849
>gi|354583084|ref|ZP_09001984.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
gi|353198501|gb|EHB63971.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
Length = 626
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 39/177 (22%), Positives = 76/177 (42%), Gaps = 11/177 (6%)
Query: 478 SFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSW 537
+F CC + + KL ++ +++ + G+ + Y + G+ V ++ +
Sbjct: 361 NFGCCTANMHQGWPKLAAHLWMKDQEE--GLVAVSYAPCTVMTTVGRHDVAAVIEVTGEY 418
Query: 538 DPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSS 597
R+ + S + + L+LRIP W + TLNG++LP + + + W +
Sbjct: 419 PFKDRIRIHMSLE-RAESFPLSLRIPAWC--DDPVITLNGRELPFQVESGYARIVQHWQN 475
Query: 598 DDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDW 654
D+L + LP+ +R + R YA+ +I GP V +W + DW
Sbjct: 476 GDRLELHLPMEVRLVS----RNMYAT--SIERGPLVYVLPVKENWQMIRQRDMFHDW 526
>gi|310639743|ref|YP_003944501.1| hypothetical protein [Paenibacillus polymyxa SC2]
gi|386038944|ref|YP_005957898.1| hypothetical protein PPM_0254 [Paenibacillus polymyxa M1]
gi|309244693|gb|ADO54260.1| hypothetical protein PPSC2_c0275 [Paenibacillus polymyxa SC2]
gi|343094982|emb|CCC83191.1| hypothetical protein PPM_0254 [Paenibacillus polymyxa M1]
Length = 647
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 49/216 (22%), Positives = 92/216 (42%), Gaps = 17/216 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
DS E+C + + + + R + YAD ER+L NG + G+ + + L +
Sbjct: 331 DSMYCETCASVGLAFWANRMLRLAPDRKYADVLERALYNGTISGMDLDGKRFFYVNPLEV 390
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
P + H T ++ CC + + D++Y + E +Y YI+S
Sbjct: 391 NPFQKSRKDQEHVKTERQKWFFCACCPPNLARMIASVEDNMYTQTEDT---LYTHLYIAS 447
Query: 517 RLDWK-SGQ-IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKAT 574
+++ SGQ I + Q WD L +++ + + LRIP W A+
Sbjct: 448 KVNMTLSGQEIEITQTHH--YPWDADLALSIHVTEPTA---FKWALRIPGWCKQ--AEVK 500
Query: 575 LNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTL 609
+NG+ + L ++ + +TW D +T+ L + +
Sbjct: 501 VNGEVISLDHLEKGYVEIQRTWKDGDMVTLHLAMPV 536
>gi|154486968|ref|ZP_02028375.1| hypothetical protein BIFADO_00805 [Bifidobacterium adolescentis
L2-32]
gi|154084831|gb|EDN83876.1| hypothetical protein BIFADO_00805 [Bifidobacterium adolescentis
L2-32]
Length = 660
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 94/215 (43%), Gaps = 22/215 (10%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLPLA--P 461
E+C + ML + L + AD E+ L NGVL G+Q GT Y+ PL P
Sbjct: 344 ETCASVAMLFYGKSLMETKPRGSVADVMEKELFNGVLSGVQLDGTR---YFYVNPLEADP 400
Query: 462 GSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISS 516
+SK + W CC + L +Y +GK VY Q++++
Sbjct: 401 AASKGNPTKAHILTRRAGWFDCACCPANLGRLIASLDQYLYTVSNDGKT--VYAHQFVAN 458
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFS-SKGSGLTTSLNLRIPTWTSSNGAKATL 575
+ +++ G + + W +TF S +GL + +RIP W S +
Sbjct: 459 KTEFEDGFTIEQTQAGDEYPWSG----DITFHVSNPNGLDKKVAVRIPQW--SKDYTLEV 512
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
NG+ + LP F++V + ++D ++ + L +++R
Sbjct: 513 NGEAVELPVVDGFVTVDAS-AADTEIHLVLDMSVR 546
>gi|375085154|ref|ZP_09731863.1| hypothetical protein HMPREF9454_00474 [Megamonas funiformis YIT
11815]
gi|374567570|gb|EHR38783.1| hypothetical protein HMPREF9454_00474 [Megamonas funiformis YIT
11815]
Length = 654
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 53/247 (21%), Positives = 105/247 (42%), Gaps = 23/247 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPG-S 463
E+C + ++ + ++ + + YAD E++L N V+ G+ + + L + P S
Sbjct: 338 ETCASIGLIFFANNMLKLDVDSQYADIMEKALYNTVIDGMALDGKHFFYVNPLEVVPQLS 397
Query: 464 SKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW 520
K+ H T +++ CC S L + +Y ++ +Y Y+S++ D+
Sbjct: 398 HKDPGKSHVKTVRPAWFGCACCPPNLARLLSSLDEYMYTVKDDV---IYSNLYVSNKSDF 454
Query: 521 KSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDL 580
K V++ + WD ++T +S+ T L LRIP+W +N LNG++
Sbjct: 455 KINNQVISIEEITDYPWDG--KITFKVNSEA---TFKLGLRIPSW--ANRYLFKLNGKEF 507
Query: 581 PLPSPGNFLSVTKTWSSDD----KLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
+ + +TW D + I+ +++D Y + AI GP +
Sbjct: 508 TPKIEKGYAIIDRTWEKGDIVIFDIQIEANFVCANPLVRED---YGKV-AIQRGPIIYCA 563
Query: 637 HSIGDWD 643
+ + D
Sbjct: 564 EGVDNGD 570
>gi|212716839|ref|ZP_03324967.1| hypothetical protein BIFCAT_01782 [Bifidobacterium catenulatum DSM
16992 = JCM 1194]
gi|212660124|gb|EEB20699.1| hypothetical protein BIFCAT_01782 [Bifidobacterium catenulatum DSM
16992 = JCM 1194]
Length = 660
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 94/215 (43%), Gaps = 22/215 (10%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLPLA--P 461
E+C + ML + L + AD E+ L NGVL G+Q GT Y+ PL P
Sbjct: 344 ETCASVAMLFYGKSLMETKPRGSVADVMEKELFNGVLSGVQLDGTR---YFYVNPLEADP 400
Query: 462 GSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISS 516
+SK + W CC + L +Y +GK VY Q++++
Sbjct: 401 AASKGNPTKAHILTRRAGWFDCACCPANLGRLITSLDQYLYTVSNDGKT--VYAHQFVAN 458
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFS-SKGSGLTTSLNLRIPTWTSSNGAKATL 575
+ +++ G + + W +TF S +GL + +RIP W S +
Sbjct: 459 KTEFEDGFTIEQTQAGDEYPWSG----DITFHVSNPNGLDKKVAVRIPQW--SKDYTLEV 512
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
NG+ + LP F++V + ++D ++ + L +++R
Sbjct: 513 NGEAVELPVVDGFVTVDAS-AADTEIHLVLDMSVR 546
>gi|255035900|ref|YP_003086521.1| hypothetical protein Dfer_2133 [Dyadobacter fermentans DSM 18053]
gi|254948656|gb|ACT93356.1| protein of unknown function DUF1680 [Dyadobacter fermentans DSM
18053]
Length = 673
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 105/244 (43%), Gaps = 25/244 (10%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLA---- 460
E+C + + + + T E YAD E +L N VL GI + +Y PLA
Sbjct: 357 ETCANIGNVLWNWRMLQITGEAKYADIVELALYNSVLSGISLKGDK--FLYTNPLAYSDA 414
Query: 461 -PGSSK-ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
P + E+ + + S+ CC + + +++ Y + GV+ Y ++
Sbjct: 415 LPFKQRWEKDRQAYISKSN---CCPPNTVRTVAEVSQYAYSLSDA---GVFFNLYGGNKF 468
Query: 519 D--WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
K GQ+ + Q D W+ + +TL + K + SL RIP W S+ A +N
Sbjct: 469 QTAVKGGQLQLTQVTD--YPWNGKISITLDQAPKDA---LSLFFRIPGWCSN--ASMVIN 521
Query: 577 GQ-DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G+ + + G++ + +TW S DK+ + L + ++ E + A+ GP V
Sbjct: 522 GKKETAKLASGSYAELRRTWKSGDKIELMLEMPVKLIESNPLVEETRNQVAVKRGPVVYC 581
Query: 636 GHSI 639
S+
Sbjct: 582 VESV 585
>gi|378763347|ref|YP_005191963.1| hypothetical protein SFHH103_05359 [Sinorhizobium fredii HH103]
gi|365182975|emb|CCE99824.1| hypothetical protein SFHH103_05359 [Sinorhizobium fredii HH103]
Length = 879
Score = 47.0 bits (110), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 51/242 (21%), Positives = 97/242 (40%), Gaps = 19/242 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + +AD E++L NG L G+ + Y PL
Sbjct: 566 DTAYAETCASVALVFWASRMLGRGPNRRFADIMEQALYNGALSGLS--LDGKTFFYDNPL 623
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
R H + CC + +G +Y + V++ + RL+
Sbjct: 624 ESTGKHHRWKWH------NCPCCPPNIARLVASVGAYMYGVAAEEI-AVHLYGESTVRLE 676
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ- 578
+ + Q + WD + + L +L+LRIP W ++GA+ +NG
Sbjct: 677 VGGSDVTLQQVTN--YPWDGAVSIKLDLKEP---RQFALSLRIPEW--ADGARIAINGSS 729
Query: 579 -DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
DL + + + W++ D ++++LPL LR + + A A++ GP V
Sbjct: 730 VDLDAVMTDGYARIERQWANGDAVSLELPLQLRPQYANPKVRQDAGRVALMRGPLVYCAE 789
Query: 638 SI 639
+
Sbjct: 790 EV 791
>gi|423214410|ref|ZP_17200938.1| hypothetical protein HMPREF1074_02470 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692825|gb|EIY86061.1| hypothetical protein HMPREF1074_02470 [Bacteroides xylanisolvens
CL03T12C04]
Length = 679
Score = 47.0 bits (110), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 98/444 (22%), Positives = 167/444 (37%), Gaps = 57/444 (12%)
Query: 198 NESLKEKMSAVVSALSACQKEIG-------SGYLSAFPTEQFDRLEALIPVWAPYYTIHK 250
N+ LK+K+ + A QK G GY P Q D W P + K
Sbjct: 112 NKELKQKVQPWIEWTLASQKPNGYFGPDTDKGYE---PGLQRDNARD----WWPKMVVLK 164
Query: 251 ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN-DVL 309
I+ QY A + R+ +M YF +++ + K + W E+ GG N ++
Sbjct: 165 IM----QQYYSATKDQ--RVIPFMTNYFKYQLEELPK--NPLGKWTFWAEQRGGDNLMIV 216
Query: 310 YKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQMRYEVTGDQ 369
Y L+ IT D L L L + D+ + H + + + Q
Sbjct: 217 YWLYNITGDKFLLELGELLNSQNVNWTDVFTKDNHLYRQHSLHC-VNLAQGFKQPTVYYQ 275
Query: 370 LHKEGHQLESS-----------GTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSR 418
K+ LE++ GT IG + + R + + E CT M+
Sbjct: 276 QSKDKENLEAAEKAMKTIRNTIGTPIGLWA-GDELIRFGDPIYGS--ELCTAVEMMYSLE 332
Query: 419 HLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDS 478
++ T + +AD ER N L Q + Y + + YH++ TP +
Sbjct: 333 NMLEITGNMQWADQLERIAYNA-LPTQISDDAQARQYYQQVN-QIAVVNDYHNFSTPHEG 390
Query: 479 ----------FWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK-SGQIVV 527
+ CC + + K +++ GV + Y SS + + + I+V
Sbjct: 391 TDNLFGTLTGYPCCSSNLHQGWPKFVQHLWYATVDN--GVAALVYASSEVKMQVANNILV 448
Query: 528 NQKVDPVVSWDPYLRVTLTFSSKG-SGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPG 586
N K + +D + ++T+ K T +LR+P W LNGQ + G
Sbjct: 449 NIKEETYYPFDETVSFSITYPDKKIKKATFPFHLRVPEWCKK--PIVNLNGQTIKTDVTG 506
Query: 587 -NFLSVTKTWSSDDKLTIQLPLTL 609
+ + + W +DK+TI+ P T+
Sbjct: 507 ERMIILNREWQQNDKITIEFPATI 530
>gi|423348679|ref|ZP_17326361.1| hypothetical protein HMPREF1060_04033 [Parabacteroides merdae
CL03T12C32]
gi|409213200|gb|EKN06224.1| hypothetical protein HMPREF1060_04033 [Parabacteroides merdae
CL03T12C32]
Length = 617
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 49/230 (21%), Positives = 96/230 (41%), Gaps = 20/230 (8%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + M+ ++ + ++T + Y D ERS+ NG L G+ + Y+ PL
Sbjct: 335 ETCASVGMVYWNQRMNQFTGDSKYIDVLERSMYNGALAGVSLAGDR--FFYVNPLESNGD 392
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSRLDWKSG 523
R + CC +G+ IY ++ + ++I +D K
Sbjct: 393 HHRQAWY------GCACCPSQISRFLPSIGNYIYGTSDKALWVNLFIGNTTEVTIDGK-- 444
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
++V+ Q+ D WD +++T+T L L +RIP W S ++NG +
Sbjct: 445 KVVMKQETD--YPWDGLVKLTVTSEQP---LGKELRIRIPGWCKS--YTLSVNGNKVDST 497
Query: 584 SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+ + +V K W + D + + + + + + + +A+ GP V
Sbjct: 498 TDKGY-TVIKEWKTGDLIVLNMDMPVEKVSADPRVRQNTGKRALQRGPLV 546
>gi|418468281|ref|ZP_13039095.1| hypothetical protein SMCF_2011 [Streptomyces coelicoflavus ZG0656]
gi|371551122|gb|EHN78456.1| hypothetical protein SMCF_2011 [Streptomyces coelicoflavus ZG0656]
Length = 796
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 69/143 (48%), Gaps = 19/143 (13%)
Query: 477 DSFWCC---YGTGIESFSK---LGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK 530
D++ CC YG G F++ LG + G +Y +++ + ++ V +
Sbjct: 386 DNYRCCPHNYGMGWPYFTEELWLGTP----DRGLAAAMYAPSRVTAAVGADGTRVTVTED 441
Query: 531 VDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLS 590
D +D + +T++ + + L+LRIP W G + +NG+ +P F+
Sbjct: 442 TD--YPFDDTITLTVSGPRR---VAFPLSLRIPGW--CEGPQVRVNGRPVPAADGPAFVR 494
Query: 591 VTKTWSSDDKLTIQLP--LTLRT 611
V +TWS D++T++LP TLR+
Sbjct: 495 VERTWSDGDRVTLRLPQRTTLRS 517
>gi|374385207|ref|ZP_09642715.1| hypothetical protein HMPREF9449_01101 [Odoribacter laneus YIT
12061]
gi|373226412|gb|EHP48738.1| hypothetical protein HMPREF9449_01101 [Odoribacter laneus YIT
12061]
Length = 679
Score = 46.6 bits (109), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 90/424 (21%), Positives = 168/424 (39%), Gaps = 60/424 (14%)
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQ-NVIKKYSIERHWQTLNE 300
W P + KIL QY A E R+ +M +YF R Q N + + +W E
Sbjct: 156 WWPRMVVLKIL----QQYYSATGDE--RVIAFMTQYF--RYQWNTLPTVPLG-NWTFWAE 206
Query: 301 EAGGMN-DVLYKLFCITQDPKHLMLAHLFDKPCFLGL-LALQADDISGFHSNTHIPIVIG 358
N +Y L+ IT D L L L + + L + L DD++ ++ + + G
Sbjct: 207 YRACDNLQAVYWLYNITGDAFLLDLGKLLHRQGYDYLDMFLYRDDLTRINTIHCVNLAQG 266
Query: 359 SQ---MRYEVTGDQLHKEGHQLESSGTNIGHFNFKSD-----PKRLASNLDSNTEESCTT 410
+ + Y+ D+ + + ++ + +I F+ + + L N + E C+
Sbjct: 267 IKEPVIYYQQETDERYLQA--VKKAFKDIRQFHGQPQGMYGGDEALHGNNPTQGSELCSA 324
Query: 411 YNMLKVSRHLFRWTKEIAYADYYER--------SLTNGVLGIQRGTEPG-VMIYLLPLAP 461
++ + T ++ +AD+ E+ +T+ + Q +P VMI
Sbjct: 325 VELMYSLEKMLEITADVQFADHLEKIAFNALPTQITDDFMARQYFQQPNQVMI------- 377
Query: 462 GSSKERSYHHWGTPSDSFW-------CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
+ +R++ +D + CC + + K ++++ K +
Sbjct: 378 -TRHKRNFDIDHGETDLVYGLLSGYPCCSSNMHQGWPKFTQNLWYATADKGMAALVYSPS 436
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTF---SSKGSGLTTSLNLRIPTWTSSNGA 571
R GQ V + + D R+ +F +K G+T L+LRIP W A
Sbjct: 437 VVRAKVADGQ-TVEIREETFYPMDD--RINFSFHLLENKKKGVTFPLHLRIPAWCRE--A 491
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ +NG+ L +T+ W +D+LT+ LP+ + T+ Y + A+ GP
Sbjct: 492 RIEINGKLLKTAGGNRIEVITRHWKEEDQLTLVLPMQVTTDTW------YENSIAVERGP 545
Query: 632 YVLA 635
V A
Sbjct: 546 LVYA 549
>gi|423290501|ref|ZP_17269350.1| hypothetical protein HMPREF1069_04393 [Bacteroides ovatus
CL02T12C04]
gi|392665888|gb|EIY59411.1| hypothetical protein HMPREF1069_04393 [Bacteroides ovatus
CL02T12C04]
Length = 684
Score = 46.6 bits (109), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 55/110 (50%), Gaps = 10/110 (9%)
Query: 549 SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPL 607
S G + LRIP+WT GA+ +NG+ + + P G +L + + W++ D++ + LP+
Sbjct: 469 STGEKVAFPFYLRIPSWTK--GAEVRVNGKKVSVTPVAGKYLCINREWANGDRVELTLPM 526
Query: 608 TLRTEAIQDDRPEYASIQAILYGPYVLA---GHSIGDWDITESATSLSDW 654
+L Q ++ + ++ YGP L+ + D E+A S W
Sbjct: 527 SLSMRTWQVNK----NSVSVDYGPLTLSLKIAEKYVEKDSRETAIGDSKW 572
>gi|417534741|ref|ZP_12188420.1| secreted protein, partial [Salmonella enterica subsp. enterica
serovar Urbana str. R8-2977]
gi|353658157|gb|EHC98420.1| secreted protein, partial [Salmonella enterica subsp. enterica
serovar Urbana str. R8-2977]
Length = 289
Score = 46.6 bits (109), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 79/205 (38%), Gaps = 15/205 (7%)
Query: 435 RSLTNGVLGIQRGTEPGVMIYLLPLA--PGSSKERSYHHWGTPSDSFW----CCYGTGIE 488
R+L N VLG + Y+ PL P S K + P W CC
Sbjct: 1 RALYNTVLG-GMALDGKHFFYVNPLEVHPKSLKFNHIYDHVKPIRQRWFGCACCPPNIAR 59
Query: 489 SFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFS 548
+ LG IY + +YI Y+ + ++ + ++ W +++ +
Sbjct: 60 VLTSLGHYIYTP---RADALYINMYVGNSMEIPVENGALKLRISGNYPWHEQVKIAIDSV 116
Query: 549 SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLT 608
+ +L LR+P W AK TLNG ++ +L + +TW D +T+ LP+
Sbjct: 117 QP---VRHTLALRLPDWCPE--AKVTLNGLEVEQDIRKGYLHIRRTWQEGDTITLTLPMP 171
Query: 609 LRTEAIQDDRPEYASIQAILYGPYV 633
+R A AI GP V
Sbjct: 172 VRRVYGNPLARHVAGKVAIQRGPLV 196
>gi|302672069|ref|YP_003832029.1| glycoside hydrolase [Butyrivibrio proteoclasticus B316]
gi|302396542|gb|ADL35447.1| glycoside hydrolase [Butyrivibrio proteoclasticus B316]
Length = 648
Score = 46.6 bits (109), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 56/239 (23%), Positives = 93/239 (38%), Gaps = 20/239 (8%)
Query: 402 SNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA- 460
+N E+C + M+ + + K +Y D ER L N +L E Y+ PL
Sbjct: 330 TNYCETCASVGMMMFGQRMAALKKNASYYDTVERVLYNTILAAMN-LEGDRYFYVNPLEM 388
Query: 461 -PGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P E +Y P+ W CC + + L +Y +E G+YI Q+IS
Sbjct: 389 IPQFCTENTYMDHVKPARQKWFSVACCPPNLARTLASLSQYLYACDE---KGIYINQFIS 445
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGL-TTSLNLRIPTWTSSNGAKAT 574
S L V N + V L T S L T + +R+P + +
Sbjct: 446 STLS------VDNSGQEIFVELKSALLTDGTVDIGISTLQATDIRIRVPAYAKD--MEIA 497
Query: 575 LNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
L+G+ L + N+ +V ++ + + + R A + A A+++GP V
Sbjct: 498 LDGEKLSYIADNNY-AVIALKGGKHRIELNMGIHPRFVAADHNVRADAGKVAVMHGPMV 555
>gi|448391565|ref|ZP_21566711.1| hypothetical protein C477_10858 [Haloterrigena salina JCM 13891]
gi|445665886|gb|ELZ18561.1| hypothetical protein C477_10858 [Haloterrigena salina JCM 13891]
Length = 637
Score = 46.6 bits (109), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 44/210 (20%), Positives = 80/210 (38%), Gaps = 14/210 (6%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSS- 464
E+C + + +F+ + ++ Y + ER+L NG L + Y PL G
Sbjct: 319 ETCAAVGSVFWNHRMFQLSGDVQYPELVERTLYNGFLA-GLSLDATEFFYANPLEVGPDG 377
Query: 465 ---KERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
+ + + ++ CC + LG IY + P VY+ Q++ S
Sbjct: 378 HALADENPDRFSNQRQGWFDCACCPPNAARLIASLGRYIYARATDE-PAVYVNQFVGSEA 436
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
V + + + W VTLT +L +R+P W S AT+ G+
Sbjct: 437 ALTIDDTDVRLRQESALPWAG--DVTLTV-DPAEPTDFALRVRVPEWCSD--VTATVAGE 491
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLT 608
+ ++ V + W D+LT+ +
Sbjct: 492 SRSVEPDDGYIEVAREWEDGDELTVTFGMA 521
>gi|390456185|ref|ZP_10241713.1| hypothetical protein PpeoK3_19381 [Paenibacillus peoriae KCTC 3763]
Length = 647
Score = 46.6 bits (109), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 44/215 (20%), Positives = 90/215 (41%), Gaps = 15/215 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
DS E+C + + + + R + + YAD ER+L NG + G+ + + L +
Sbjct: 331 DSMYCETCASVGLAFWANRMLRLSPDRKYADVLERALYNGTISGMDLDGQRFFYVNPLEV 390
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFE-EEGKYPGVYIIQYIS 515
P + H T ++ CC + + D+IY + + Y +YI ++
Sbjct: 391 NPHQKSRKDQEHVKTERQKWFFCACCPPNLARMIASVEDNIYTQTADTLYTHLYIAGKVN 450
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
L + +I + WD L ++ + S + LRIP W A+ +
Sbjct: 451 LNLSGQEVEITQTHR----YPWDADLSFSIHVAEPTS---FTWALRIPGWCKQ--AEVKV 501
Query: 576 NGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTL 609
NG+ + L ++ + ++W+ D +++ L + +
Sbjct: 502 NGEAISLDHLAKGYVEIQRSWNDGDVVSLHLAMPV 536
>gi|271965305|ref|YP_003339501.1| hypothetical protein [Streptosporangium roseum DSM 43021]
gi|270508480|gb|ACZ86758.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021]
Length = 654
Score = 46.6 bits (109), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 57/248 (22%), Positives = 93/248 (37%), Gaps = 24/248 (9%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL- 459
D E+C + ++ L T ++ YAD ER++ N VL E Y PL
Sbjct: 299 DRAYSETCAGIGSIMLAHRLLLATGDVRYADLAERTMFN-VLATSPALEGRSFFYANPLH 357
Query: 460 --APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
P + E S W CC +++ L + + GV I +
Sbjct: 358 VRVPAAPPEGMNPAAEGGLRSPWFTVSCCPNNIARTYASLAAYVATSDAS---GVQIHHH 414
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ + G ++ +V+ W VT+ GSG ++LR+P W S GA+
Sbjct: 415 TPAEIH-HEGLVL---RVETGYPWS--GEVTVRVVRGGSG---RISLRVPPWAS--GARI 463
Query: 574 TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+ G P+P+ + W D++ + LP+T R A+ GP V
Sbjct: 464 SHGGTTRPVPA--GYAVAEGRWRPGDEIRLHLPMTPRWTYPDRRVDAVRGCAAVERGPLV 521
Query: 634 LAGHSIGD 641
S+ D
Sbjct: 522 YCAESVKD 529
>gi|409439808|ref|ZP_11266847.1| conserved hypothetical protein [Rhizobium mesoamericanum STM3625]
gi|408748645|emb|CCM78028.1| conserved hypothetical protein [Rhizobium mesoamericanum STM3625]
Length = 637
Score = 46.2 bits (108), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 47/214 (21%), Positives = 85/214 (39%), Gaps = 19/214 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
+S E+C + ++ + + YAD E++L NG + + Y PL
Sbjct: 329 ESAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GLSLDGKKFFYENPLE 387
Query: 461 PGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
R +HH P CC + +G +Y E + + + Y R
Sbjct: 388 SAGKHHRWIWHH--CP-----CCPPNIARLLASIGSYMYGVAEDE---IAVHLYGEGRAR 437
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
+K G V W +R+ + ++ + +++LRIP W +NGA +NG+
Sbjct: 438 FKIGGTDVELTQKTRYPWHGAVRLDIKLNAP---VLFAISLRIPEW--ANGATLAVNGEA 492
Query: 580 LPLPSP--GNFLSVTKTWSSDDKLTIQLPLTLRT 611
+ L S + + + W DK+ + +PL R
Sbjct: 493 IDLGSADVDGYARIEREWRDGDKIDLNIPLETRA 526
>gi|118587171|ref|ZP_01544600.1| hypothetical protein OENOO_61069 [Oenococcus oeni ATCC BAA-1163]
gi|118432450|gb|EAV39187.1| hypothetical protein OENOO_61069 [Oenococcus oeni ATCC BAA-1163]
Length = 658
Score = 46.2 bits (108), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 119/536 (22%), Positives = 195/536 (36%), Gaps = 114/536 (21%)
Query: 177 LRGHFVG---------HYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSA- 226
++GH G +L A+A +E LK+ ++ +S Q++ GYLS
Sbjct: 73 MKGHHYGFPFQDTDVYKWLEAAAYSLKYNPDEDLKKITDGLIDLISEAQED--DGYLSTE 130
Query: 227 ----FPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRV 282
+P +F RL+ + Y H I AG++ Y N +AL + M
Sbjct: 131 FQIDYPDRKFKRLKQSHEL---YTMGHYIEAGVV-YYQITGNEKALNIAKKMAN------ 180
Query: 283 QNVIKKYSIERHWQTLNEEAGGMND------VLYKLFCITQDPKHLMLAHLF------DK 330
I+ ++ N + G + L +L+ T++ K+L LAH F DK
Sbjct: 181 -------CIDSNFGLENGKIPGYDGHPEIELALSRLYETTREEKYLKLAHYFLNQRGKDK 233
Query: 331 PCFLGLLALQA-----DDISGF----------------------HSNTHIPIVIGSQMRY 363
F + D I G H+ + + G
Sbjct: 234 NFFDNQIKEDGASSDRDLIDGMRDFPLSYYQASKPIEDQKTADGHAVRVVYLCTGMAYVA 293
Query: 364 EVTGDQLHKEG---------HQLESSGTNIGH------FNFKSDPKRLASNLDSNTEESC 408
+TGDQ E H+ NIG F + D D+ E+C
Sbjct: 294 RLTGDQQLLEACHRFWKDIVHRRMYITGNIGSTTTGEAFTYDYDLPN-----DTMYGETC 348
Query: 409 TTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA--PGSSKE 466
+ + +R + + Y D E+ L NG L + Y+ PL P +SK
Sbjct: 349 ASVGLSFFARQMLAIEAKGEYGDILEKELFNGALA-GMALDGKHFFYVNPLEADPIASKY 407
Query: 467 R--SYHHWGTPSDSFWC-CYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
H +D F C C + + D + G + Q+IS+ + +G
Sbjct: 408 NPGKKHVLTKRADWFGCACCPSNVARLVASVDKYIYTVNGD--TILSHQFISNNAQFGNG 465
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
I V+Q D W + + ++ L L +RIP+W S N +NG+ + L
Sbjct: 466 -IEVSQ--DNHFPWSGEIHYEINNPNQ---LAFKLGIRIPSW-SRNKFGLKINGKKIDLA 518
Query: 584 SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP---EYASIQAILYGPYVLAG 636
S F+ + +D+ LT+ L L + T+ ++ Y I A+ GP V A
Sbjct: 519 SEDGFIYIN---VNDESLTVDLSLDMNTKFMRSSNKVSSNYGKI-AVQRGPIVYAA 570
>gi|448360425|ref|ZP_21549056.1| hypothetical protein C481_00200 [Natrialba asiatica DSM 12278]
gi|445653038|gb|ELZ05910.1| hypothetical protein C481_00200 [Natrialba asiatica DSM 12278]
Length = 674
Score = 46.2 bits (108), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 85/222 (38%), Gaps = 26/222 (11%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ E+C + +R LF +T YAD ER+L N VL + R + Y LA
Sbjct: 343 DTAYAETCAAIGSVFWNRRLFEFTGRARYADLIERTLYNAVL-VGRSRDGTEFFYDNRLA 401
Query: 461 PGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFE-EEGKYPGVYIIQYISSRLD 519
+ R W + CC + LG +Y E +Y+ QYI S
Sbjct: 402 SDGNHHR--QEWFECA----CCPPNIARVLAALGRYLYATGGESDERCLYVNQYIGSSAT 455
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
G VV W+ VTL + +L LR+P+W + +NG+
Sbjct: 456 ATIGDTVVELDQTSGFPWNG--EVTLDV-EPATPTEFALRLRVPSWCEDVSIR--VNGEA 510
Query: 580 LPLP------------SPGNFLSVTKTWSSDD-KLTIQLPLT 608
+P + +L + + W D ++T ++P+
Sbjct: 511 VPTALGDDDSGRNGERTDDGYLVIEREWDGDRVEITFEVPVV 552
>gi|330996652|ref|ZP_08320530.1| hypothetical protein HMPREF9442_01617 [Paraprevotella xylaniphila
YIT 11841]
gi|329572724|gb|EGG54357.1| hypothetical protein HMPREF9442_01617 [Paraprevotella xylaniphila
YIT 11841]
Length = 816
Score = 46.2 bits (108), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 98/249 (39%), Gaps = 41/249 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + +F T + Y D ER+L NGV+ G+ + Y PL S
Sbjct: 341 ETCASIANVYWNHRMFLATGDSRYEDILERALYNGVISGVSLSGD--RFFYDNPLE--SM 396
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
+ W + CC G + + + +Y +GK V++ YI S + Q
Sbjct: 397 GQHGRQAWFGCA----CCPGNVTRFMASVPNYMY-ATQGK--DVFVNLYIQSTASLSTSQ 449
Query: 525 --IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWT--------------SS 568
I + Q D WD +R+ + K T +L RIP W
Sbjct: 450 NKIEIRQTTD--YPWDGNIRLAVHPEKK---QTFALRCRIPGWAQGRPVPTDLYHYTGKG 504
Query: 569 NGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTL-RTEA---IQDDRPEYASI 624
G +NG+D+ + + + W D + + P+ + R EA ++DDR +
Sbjct: 505 KGYTIQVNGKDVDFHVENGYAVILRKWKKGDTVQLDFPMDVRRVEARVEVEDDRGK---- 560
Query: 625 QAILYGPYV 633
AI GP V
Sbjct: 561 AAIERGPIV 569
>gi|312135930|ref|YP_004003268.1| hypothetical protein Calow_1942 [Caldicellulosiruptor owensensis
OL]
gi|311775981|gb|ADQ05468.1| protein of unknown function DUF1680 [Caldicellulosiruptor
owensensis OL]
Length = 658
Score = 46.2 bits (108), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 103/483 (21%), Positives = 184/483 (38%), Gaps = 87/483 (18%)
Query: 182 VGHYLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPT--EQFDRLEALI 239
V +L A++ + + +NE L K++ V+ + Q E GY++ + T E +R L
Sbjct: 85 VYKWLEAASYVLEANYNEDLDRKVNEVIDLIEKAQWE--DGYINTYFTIKEPQNRWTNLQ 142
Query: 240 PVWAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRV---QNVIKKYSIERHWQ 296
Y H I A + Y N L + ++ N + +K Y + +
Sbjct: 143 ECHELYCAGHLIEAAVA-YYLATGNDRLLNIARKFADHINNVFGPDEGKLKGYPGHQEIE 201
Query: 297 TLNEEAGGMNDVLYKLFCITQDPKHLMLAHLF-----DKPCFL-------GLLALQADDI 344
L KL+ +T+D ++L LA F +P + G I
Sbjct: 202 L----------ALIKLYEVTKDERYLNLARYFIEERGKEPYYFDIEWEKRGRTEHWPGLI 251
Query: 345 SGF---HSNTHIPI-----VIGSQMR----YEVTGD--QLHKEGHQLESSGT-------- 382
F ++ TH+P+ +G +R Y D ++ K+ LE+
Sbjct: 252 RNFGREYAQTHLPVRKQKEAVGHAVRATYMYSAMADIARITKDEELLETCKALFKDIVTR 311
Query: 383 ------NIG------HFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYA 430
IG F+F+ D D E+C + ++ + +F Y
Sbjct: 312 KMYITGGIGASAHGESFSFEYDLPN-----DRAYAETCASVGLIFFAHRMFLVDHNSYYY 366
Query: 431 DYYERSLTNGVLGIQRGTEPGVMIYLLPLA--PGSSKER-SYHHWGTPSDSFW---CCYG 484
D E+ L N ++G + Y+ PL P + ++R H P ++ CC
Sbjct: 367 DVIEQILYNNIIG-SMSLDGRSYFYVNPLEVIPKACEKRWDTQHVKVPRQRWFGCACCPP 425
Query: 485 TGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWD-PYLRV 543
S +G IY E + +Y+ YIS+ + G+ KV +++ D P+
Sbjct: 426 NVARLLSSIGKYIYAYSENE---LYVNLYISNEYEVDIGE----NKVKIILNSDYPFGDN 478
Query: 544 TLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDL-PLPSPGNFLSVTKTWSSDDKLT 602
L + + L L LRIP W K +NG++ ++ + KTW ++D++
Sbjct: 479 VLLRINVKNPLAFDLKLRIPKWCVE--YKVFVNGKEENNYKKEKEYVVINKTWKNNDEIF 536
Query: 603 IQL 605
+ L
Sbjct: 537 LNL 539
>gi|149197213|ref|ZP_01874265.1| hypothetical protein LNTAR_12426 [Lentisphaera araneosa HTCC2155]
gi|149139759|gb|EDM28160.1| hypothetical protein LNTAR_12426 [Lentisphaera araneosa HTCC2155]
Length = 799
Score = 46.2 bits (108), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 55/256 (21%), Positives = 98/256 (38%), Gaps = 35/256 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + +F ++ +Y D E SL N L G+ E Y+ PL +
Sbjct: 329 ETCAAIANVFFNYRMFLLHRDASYFDVAEVSLLNNSLAGVN--MEGDKFFYVNPLE--AD 384
Query: 465 KERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS--RL 518
+R ++H G S W CC ++ +Y E + ++ + Y S L
Sbjct: 385 GQRLFNH-GNAGRSHWFDCACCPSNIARLMPQVSGYMYATSEDE---IFSLLYAGSDVSL 440
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN---GA---- 571
D +G++ + Q+ + ++ ++ L + LRIP+W N GA
Sbjct: 441 DLANGKVSLKQETE--YPFEGKVKFDLDMDEDSE---FTFKLRIPSWARDNFLPGALYKY 495
Query: 572 --------KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYAS 623
+NG + F S+ +TWS D + + LP+ + +
Sbjct: 496 ISKPNENWTVKINGAAVQCTLDRGFASIRRTWSKGDVVELDLPMPIMSSVCDTRVDANVG 555
Query: 624 IQAILYGPYVLAGHSI 639
A+ GP VLA +
Sbjct: 556 RIALTRGPLVLAAEEV 571
>gi|448418968|ref|ZP_21580124.1| hypothetical protein C474_15274 [Halosarcina pallida JCM 14848]
gi|445675954|gb|ELZ28481.1| hypothetical protein C474_15274 [Halosarcina pallida JCM 14848]
Length = 642
Score = 45.8 bits (107), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 58/241 (24%), Positives = 94/241 (39%), Gaps = 47/241 (19%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLPLAPGS 463
E+C + ++ L T E YAD ER+L NG L G+ GT Y PL S
Sbjct: 342 ETCAAIGSIFWNQRLLELTGEAKYADLIERTLYNGFLAGVSLDGTR---FFYENPLE--S 396
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
S + W T + CC F+ LG +Y +G + + QY+ S + G
Sbjct: 397 SGDHHRKGWFTCA----CCPPNAARLFASLGRYVYSNVDGV---LTVNQYVGSTVTTTVG 449
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
V + W VTLT + + + LR+P W + A +++G++
Sbjct: 450 GTEVELTQSSSLPWSG--EVTLTVDADEA---VPIRLRVPAWATD--ASVSIDGEEAERS 502
Query: 584 SPGNFLSVTKTWSSDDKLTIQL-------------------------PLTLRTEAIQDDR 618
G ++ + W+ D++T++ PL EA+ +DR
Sbjct: 503 DDGAYVELDGEWNG-DRITVRFGQETELVRAHPAVESDAGRVAVERGPLVYCAEAVDNDR 561
Query: 619 P 619
P
Sbjct: 562 P 562
>gi|375306375|ref|ZP_09771673.1| hypothetical protein WG8_0195 [Paenibacillus sp. Aloe-11]
gi|375081628|gb|EHS59838.1| hypothetical protein WG8_0195 [Paenibacillus sp. Aloe-11]
Length = 647
Score = 45.8 bits (107), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 44/215 (20%), Positives = 89/215 (41%), Gaps = 15/215 (6%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
DS E+C + + + + R + + YAD ER+L NG + G+ + + L +
Sbjct: 331 DSMYCETCASVGLAFWANRMLRLSPDRKYADVLERALYNGTISGMDLDGKRFFYVNPLEV 390
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFE-EEGKYPGVYIIQYIS 515
P + H T ++ CC + + D IY + ++ Y +YI ++
Sbjct: 391 NPHQKSRKDQEHVKTERQKWFFCACCPPNLARMIASVEDHIYTQTDDTLYTHLYIAGKVN 450
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
L ++ +I + WD L ++ + S + LRIP W A+ +
Sbjct: 451 LNLSGQAVEITQTHR----YPWDADLSFSIHVTEPAS---FTWALRIPGWCKQ--AEVKV 501
Query: 576 NGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTL 609
NG+ + L + + + W+ D +++ L + +
Sbjct: 502 NGEVISLDHLAKGYAEIQRIWNDGDVVSLHLAMPV 536
>gi|384136953|ref|YP_005519667.1| hypothetical protein TC41_3269 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
gi|339291038|gb|AEJ45148.1| protein of unknown function DUF1680 [Alicyclobacillus
acidocaldarius subsp. acidocaldarius Tc-4-1]
Length = 632
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 49/241 (20%), Positives = 98/241 (40%), Gaps = 22/241 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG--IQRGTEPGVMIYLLPLA--P 461
E+C + ++ ++ + AYAD ER+L N ++G Q G Y+ PL P
Sbjct: 307 ETCASVGLIFFAKRMLDLAPRSAYADVMERALYNTIIGSMAQDGKH---YCYVNPLEVWP 363
Query: 462 GSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
+++E P+ W CC L D +Y E + +Y+ +I S
Sbjct: 364 RANEENPDRRHVRPTRQAWFGCACCPPNVARLLMSLEDYVYSWHEA-HRTLYVHLHIGSS 422
Query: 518 LDWKSGQIVVNQKVDPVVSW--DPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
++W + + W + LRV+++ + +L +RIP W + +
Sbjct: 423 VEWDLDGSRAQVTMTSGLPWRGEASLRVSMSDGPR----RFALAIRIPGWCAGE-PSLRV 477
Query: 576 NGQDLP---LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
NG+ + + + + + ++ D++ ++ P+ R + + + AI GP
Sbjct: 478 NGKPIAESEVCLKNGYAVIERAFTDGDEVALEFPMEARWVVGHPELRAVSGMAAIERGPL 537
Query: 633 V 633
V
Sbjct: 538 V 538
>gi|227820086|ref|YP_002824057.1| hypothetical protein NGR_b18560 [Sinorhizobium fredii NGR234]
gi|227339085|gb|ACP23304.1| putative cytoplasmic protein [Sinorhizobium fredii NGR234]
Length = 640
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 50/242 (20%), Positives = 97/242 (40%), Gaps = 19/242 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + +AD E++L NG + G+ + Y PL
Sbjct: 327 DTAYAETCASVALVFWASRMLGRGPNRRFADIMEQALYNGAISGLS--LDGKTFFYDNPL 384
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
R H P CC + +G +Y + V++ + RL+
Sbjct: 385 ESTGKHHRWKWH-NCP-----CCPPNIARLVASVGAYMYGVAADEI-AVHLYGESTVRLE 437
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
Q+ + Q + W+ + + + +L+LRIP W ++GA+ +NG
Sbjct: 438 LGGSQVTLRQVTN--YPWEGAVSIRIELDEPRH---FALSLRIPEW--ADGARVAVNGSS 490
Query: 580 LPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+ L + + + WS D++++ LPL LR + + A A++ GP V
Sbjct: 491 IDLDGVMTDGYALIEREWSDGDEISLDLPLRLRPQYANPKVRQDAGRVALMRGPLVYCAE 550
Query: 638 SI 639
+
Sbjct: 551 EV 552
>gi|383763276|ref|YP_005442258.1| hypothetical protein CLDAP_23210 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381383544|dbj|BAM00361.1| hypothetical protein CLDAP_23210 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 636
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 49/212 (23%), Positives = 86/212 (40%), Gaps = 29/212 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLPLAPGS 463
E+C ++ + L ++ E YAD E++L NG + G+ RG Y+ PLA
Sbjct: 329 ETCAAIALILWNHRLLQFAGEGKYADVMEQTLYNGFISGVSLRGDS---FFYVNPLASNG 385
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
S R TP CC + LG+ +Y EG G+++ Y +
Sbjct: 386 SHHR------TPWFECPCCPPNVGRILASLGNYLYSTGEG---GLWVHFYAQNSARTTVD 436
Query: 524 QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS-----NGAKATLNGQ 578
V +++ WD +++ +T + +L LRIP W NGA A +
Sbjct: 437 GTEVGLRLESRYPWDGAVKLMITPAQPQR---FTLYLRIPGWCDRWSLRVNGAAADARVE 493
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR 610
+ ++ +TW D + + L + ++
Sbjct: 494 R-------GYAAIERTWQPGDVVALDLAMPVQ 518
>gi|29348940|ref|NP_812443.1| hypothetical protein BT_3531 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340847|gb|AAO78637.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 687
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 44/77 (57%), Gaps = 7/77 (9%)
Query: 560 LRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
LRIP+WT GA+ +NG+ + + P G +L + + W+ DK+ + LP++L Q ++
Sbjct: 483 LRIPSWTE--GAEVRVNGKKISVKPVSGKYLCIEREWADGDKVEMTLPMSLSMRTWQVNK 540
Query: 619 PEYASIQAILYGPYVLA 635
+ ++ YGP L+
Sbjct: 541 ----NSVSVDYGPLTLS 553
>gi|333381634|ref|ZP_08473313.1| hypothetical protein HMPREF9455_01479 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829563|gb|EGK02209.1| hypothetical protein HMPREF9455_01479 [Dysgonomonas gadei ATCC
BAA-286]
Length = 821
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 75/375 (20%), Positives = 144/375 (38%), Gaps = 64/375 (17%)
Query: 309 LYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPI-----VIGSQMRY 363
L KL+ +T D K+L +A F G + +S H+PI ++G +R
Sbjct: 222 LVKLYSVTDDKKYLDMARYFVDETGRGTDGHRLSP----YSQDHMPILEQEEIVGHAVRA 277
Query: 364 E-----VTGDQLHKEGHQLESSGTN------------IGHFNFKSDPKRLASNLD----S 402
VT + H+L + IG ++ + + + +
Sbjct: 278 GYLYSGVTDVASMQHDHKLFDAVNRVWDNMASKKLYIIGGIGSRAQGEGFGPDYELNNFN 337
Query: 403 NTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAP 461
N E+C + + ++ +F T E Y D ER+L NG++ G+ + Y PLA
Sbjct: 338 NYCETCASIANVYWNQRMFLATGESKYVDILERALYNGLIAGVSLSGDK--FFYGNPLAS 395
Query: 462 GSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI--SSRLD 519
ER+ P CC G + + Y + +Y+ ++ +S++
Sbjct: 396 DGGFERA------PWFGCACCPGNVTRFMASVPGYAYAVNKKD---IYVNLFVEGNSKIK 446
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSS----------- 568
+ ++ + QK W + + + ++K ++ +RIP W
Sbjct: 447 VDNNEVELVQKTK--YPWQGEVEIEVNPAAKEK---FTMLVRIPGWAKGQPVPSDLYQYV 501
Query: 569 NGAKA----TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
+GAK ++NGQD G + + + W + DK++I + + +R + +
Sbjct: 502 DGAKPEVKISVNGQDAKKKIRGGYAVIEREWKAGDKISIHMDMPVRRVQAHKEVKYDEGL 561
Query: 625 QAILYGPYVLAGHSI 639
++ GP V SI
Sbjct: 562 LSMERGPIVYGLESI 576
>gi|440223623|ref|YP_007337019.1| hypothetical protein RTCIAT899_PC03365 [Rhizobium tropici CIAT 899]
gi|440042495|gb|AGB74473.1| hypothetical protein RTCIAT899_PC03365 [Rhizobium tropici CIAT 899]
Length = 643
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 52/237 (21%), Positives = 95/237 (40%), Gaps = 20/237 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
+S E+C + ++ + + YAD E +L NG + G+ + + Y PL
Sbjct: 328 ESAYAETCASVGLVFWANRMLGRGPNRRYADIMEVALYNGAMAGLSQDGK--TFFYENPL 385
Query: 460 APGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
R ++HH P CC + +G +Y + + V++ +R+
Sbjct: 386 ESAGKHHRWTWHH--CP-----CCPPNIARLLASVGSYMYAAADNEI-AVHLYGESKARV 437
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+G + V + WD +R + + +L+LRIP W + GA +NG
Sbjct: 438 PL-AGGVTVQLSQETRYPWDGAIRFEV---NPDRAAKFALSLRIPEW--AEGATLAINGA 491
Query: 579 --DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
DL + + + + W + D + + LPL RT + A ++ GP V
Sbjct: 492 SVDLATVTVDGYARIEREWQAGDSVDLTLPLIPRTLFANPKVRQDAGRATLMRGPLV 548
>gi|383124478|ref|ZP_09945142.1| hypothetical protein BSIG_3498 [Bacteroides sp. 1_1_6]
gi|251839029|gb|EES67113.1| hypothetical protein BSIG_3498 [Bacteroides sp. 1_1_6]
Length = 687
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 44/77 (57%), Gaps = 7/77 (9%)
Query: 560 LRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
LRIP+WT GA+ +NG+ + + P G +L + + W+ DK+ + LP++L Q ++
Sbjct: 483 LRIPSWTE--GAEVRVNGKKISVKPVSGKYLCIEREWADGDKVEMTLPMSLSMRTWQVNK 540
Query: 619 PEYASIQAILYGPYVLA 635
+ ++ YGP L+
Sbjct: 541 ----NSVSVDYGPLTLS 553
>gi|313147858|ref|ZP_07810051.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|313136625|gb|EFR53985.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
Length = 678
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 88/414 (21%), Positives = 159/414 (38%), Gaps = 41/414 (9%)
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEE 301
W P + KIL QY A N + R+ +M YF +++ + +K +W E
Sbjct: 159 WWPRMVMLKIL----QQYYSATNDQ--RVIRFMTNYFRYQLKTLPEK--PLGNWTFWAEF 210
Query: 302 AGGMN-DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
N +Y L+ IT D L L L K F + + D+ ++ + + G +
Sbjct: 211 RACDNLQAVYWLYNITGDSFLLDLGKLIHKQSFSFVDMVNRGDLRRINTIHCVNLAQGIK 270
Query: 361 ---MRYEVTGDQLHKEGHQLESSGTNIGHFNFKSD-----PKRLASNLDSNTEESCTTYN 412
+ Y+ D+ + + ++ + ++I F+ + + L +N + E C+
Sbjct: 271 EPVIYYQQEPDKAYLDA--VKRAFSDIRQFHGQPQGMYGGDEALHANNPTQGSELCSAVE 328
Query: 413 MLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLP----LAPGSSKERS 468
++ + T +I +AD+ ER N L Q + Y + +
Sbjct: 329 LMYSLEKMVEITGDIDFADHLERIAFN-ALPTQISDDFMTKQYFQQANQVMVTRHRRNFD 387
Query: 469 YHHWGTPS-----DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
H GT + + CC + + K S+++ G+ + Y S + K
Sbjct: 388 QDHGGTDNCFGLLTGYPCCASNMHQGWPKFTQSLWYATPDG--GLAVTAYAPSEVTAKVA 445
Query: 524 Q-IVVNQKVDPVVSWDPYLRVTL-TFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP 581
+ +V D D + TL + K + +L LRIP W G ++NGQ L
Sbjct: 446 EGCMVTFCEDTYYPMDDKISFTLQSMDKKRKEVNFALQLRIPKWCKQAGI--SVNGQLLQ 503
Query: 582 LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G V + W D++ + LP+ + + Y + AI GP V A
Sbjct: 504 HVEGGRMAVVDRIWKKGDRVELHLPMEVTADTW------YENSVAIERGPLVFA 551
>gi|380693342|ref|ZP_09858201.1| hypothetical protein BfaeM_05087 [Bacteroides faecis MAJ27]
Length = 687
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 45/88 (51%), Gaps = 7/88 (7%)
Query: 549 SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPL 607
S G + LRIP+WT GA+ +NG+ + P G +L + + W DK+ + LP+
Sbjct: 472 STGEKVNFPFYLRIPSWTE--GAEVRVNGKKISAKPVSGKYLCIEREWEDGDKVEMTLPM 529
Query: 608 TLRTEAIQDDRPEYASIQAILYGPYVLA 635
+L Q ++ + ++ YGP L+
Sbjct: 530 SLSMRTWQVNK----NSVSVDYGPLTLS 553
>gi|167537610|ref|XP_001750473.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771013|gb|EDQ84687.1| predicted protein [Monosiga brevicollis MX1]
Length = 2823
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 70/172 (40%), Gaps = 21/172 (12%)
Query: 114 FLKEVSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARLPAPGEPYGGWEEP 173
F EV +V L S+ RA N+ YLL D L++ FR P P GW+
Sbjct: 93 FQVEVPTSNVTLTPGSVLRRAFDANIIYLLGHPTDDLLYFFRLRNGNPNPPGQCWGWD-- 150
Query: 174 SCELRGHFVGHYLSASALM--WASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQ 231
LRG G +L S + W N +L+ +M VV+ + Q++ GY F +
Sbjct: 151 -ANLRGSLAGEFLMGSGGISRWPMA-NATLRARMDEVVAGI--LQEQEADGYAMGFARNE 206
Query: 232 FDRLEALIPVWA---PYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYN 280
W P Y + GLL+ A N +AL + + +F N
Sbjct: 207 ---------TWTHENPDYVTSWVTHGLLEA-AIAGNEQALPLIRRHLNWFNN 248
>gi|403252781|ref|ZP_10919089.1| hypothetical protein EMP_03370 [Thermotoga sp. EMP]
gi|402811987|gb|EJX26468.1| hypothetical protein EMP_03370 [Thermotoga sp. EMP]
Length = 644
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 95/234 (40%), Gaps = 17/234 (7%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGI--QRGTEPGVMIYLLPLAPGS 463
ESC L + + + E +AD E L N +LG GT+ L + P
Sbjct: 329 ESCAAVGNLLWTWRMLKIFGEARFADIVELVLYNAILGAISLDGTKFFYTNTLRQVNP-P 387
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK-- 521
K R W + + C+ + S+ + G+++ Y +++L K
Sbjct: 388 FKLR----WSRKREPYITCFCCPPNVVRTIAQSVTYAYTTSKDGIWVNLYGTNKLRVKLA 443
Query: 522 -SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDL 580
+ I + Q + W+ Y+++ L KG+ + LRIP W S ++N Q +
Sbjct: 444 TNTHIALAQYSE--YPWNGYIKIVLE-EIKGNP-NFKIYLRIPGW--SRNVNVSVNRQGI 497
Query: 581 PLP-SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
PG +LS+ K W D + + +PL ++ E + AI+ GP V
Sbjct: 498 KKDIVPGTYLSLEKNWEEGDVIEMDIPLEVKLIEAHPLVEECRNQVAIMRGPIV 551
>gi|317474351|ref|ZP_07933625.1| hypothetical protein HMPREF1016_00604 [Bacteroides eggerthii
1_2_48FAA]
gi|316909032|gb|EFV30712.1| hypothetical protein HMPREF1016_00604 [Bacteroides eggerthii
1_2_48FAA]
Length = 619
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 60/273 (21%), Positives = 103/273 (37%), Gaps = 29/273 (10%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + M+ + + ++T + Y D ERS+ NG L GI + Y+ PL
Sbjct: 336 ETCASVGMVLWNHRMNQFTGDSKYIDVLERSMYNGALAGISLNGDR--FFYVNPL----- 388
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
E H P CC +G+ IY + +++ YI + +
Sbjct: 389 -ESKGDHHRLPWYGCACCPSQLSRFLPSIGNYIYGISDN---AIWVNLYIGNVAEVNVDG 444
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS 584
+ V K + W+ ++ T+ + + L LRIP W +NG+ +
Sbjct: 445 VQVTMKEETKYPWNGRIKFTINADEE---INKELRLRIPGWCKK--YNLFINGKKVKKLR 499
Query: 585 PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI--QAILYGPYVLAGHSIGDW 642
V W+S D I+L + E ++ D +I +AI GP V +
Sbjct: 500 IDKGYVVIADWNSGD--NIELDFDMPVEVVKSDVRVKQNIGKRAIQRGPLVYCIEDAQNK 557
Query: 643 DITESATSLSDWITP---IPASYNSQLITFTQE 672
D E +I+P +N L+ Q+
Sbjct: 558 DTIEGI-----YISPKTSFKTDFNVNLLNGVQQ 585
>gi|269926240|ref|YP_003322863.1| hypothetical protein Tter_1126 [Thermobaculum terrenum ATCC
BAA-798]
gi|269789900|gb|ACZ42041.1| protein of unknown function DUF1680 [Thermobaculum terrenum ATCC
BAA-798]
Length = 628
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 53/208 (25%), Positives = 91/208 (43%), Gaps = 25/208 (12%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLG--IQRGTEPGVMIYLLPLAPGS 463
E+C + + L + YAD E +L N VL Q G + Y PLA
Sbjct: 325 ETCAAIASIMWNWRLLLLEGDPKYADLIEHTLYNAVLPSIAQSGDK---YFYENPLA--- 378
Query: 464 SKERSYHHWGTPSDSFWC-CYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS--RLDW 520
Y+ T S+ F C C I + K V+I QY+ S R+
Sbjct: 379 ----DYYALHTRSEWFECACCPPNIARLIASLPGYLYSTANK--AVWIHQYVPSINRVQI 432
Query: 521 KSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDL 580
+ G+ + V+ W+ +R+ + + + +LNLRIP+W+ S ++ TL +
Sbjct: 433 E-GEDELEFAVETNYPWEDEIRIKIL-----TNMHCTLNLRIPSWSQS--SEITLPNNEH 484
Query: 581 PLPSPGNFLSVTKTWSSDDKLTIQLPLT 608
+ GN+ ++ + W++ D LT++L L+
Sbjct: 485 LQAAGGNYFTIERHWNAGDLLTLRLDLS 512
>gi|229822407|ref|YP_002883933.1| hypothetical protein Bcav_3930 [Beutenbergia cavernae DSM 12333]
gi|229568320|gb|ACQ82171.1| protein of unknown function DUF1680 [Beutenbergia cavernae DSM
12333]
Length = 640
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 55/238 (23%), Positives = 95/238 (39%), Gaps = 24/238 (10%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D E+C ++ +R + + Y D ER+L NGV+ G+ + Y PL
Sbjct: 334 DCAYAETCAAIGLVFWARRMASLSGSAQYVDVLERALYNGVIAGVSADGQK--FFYENPL 391
Query: 460 AP-GSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYP-GVYIIQYISSR 517
A GS+ R + CC + LG +Y +Y+ ++ R
Sbjct: 392 ASDGSAVRRDWFDCA-------CCPPNLARLEASLGSYVYAASADSLAVDLYVGSTVARR 444
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
L + + Q D V LT SS + SL LR P+W + G ++NG
Sbjct: 445 L--GGADVRLRQSSSSPAGGD----VALTVSSSAPAV-WSLLLRAPSW--ARGTAVSVNG 495
Query: 578 Q--DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+ D + G ++++ + W+ D++ + + +R A A+ YGP+V
Sbjct: 496 EATDAVVGEDG-YVTLRREWADGDRVDVAFDVEVRRLYASTHVAADAGRTALAYGPFV 552
>gi|212717058|ref|ZP_03325186.1| hypothetical protein BIFCAT_02005 [Bifidobacterium catenulatum DSM
16992 = JCM 1194]
gi|212660046|gb|EEB20621.1| hypothetical protein BIFCAT_02005 [Bifidobacterium catenulatum DSM
16992 = JCM 1194]
Length = 657
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 50/237 (21%), Positives = 95/237 (40%), Gaps = 10/237 (4%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + M +R + YAD ER L NG + GI + + L
Sbjct: 333 DTMYGETCASVAMSMFARQMLLLEPNGEYADVLERELFNGAIAGISLDGKQYYYVNALET 392
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
+P S HH + ++ CC + + +Y E +G V Q+I++
Sbjct: 393 SPDGSDNPDRHHVLSHRVDWFGCACCPANVARLIASVDRYVYTERDGGRT-VLAHQFIAN 451
Query: 517 RLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
+ + SG + V Q+ D W+ ++ + ++ + + +RIPTW++ + A T +
Sbjct: 452 QASFDSG-LHVEQRSD--FPWNGHIEYMVELPAEAAD-SVRFGVRIPTWSADSYA-LTCD 506
Query: 577 GQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
G + F+ + + + L + +R A A++ GP V
Sbjct: 507 GVAVKTAPENGFVYFAVAPGTALHVVLDLDMAVRLVRANSHVRCDAGRVAVMRGPLV 563
>gi|395803606|ref|ZP_10482850.1| hypothetical protein FF52_17068 [Flavobacterium sp. F52]
gi|395434160|gb|EJG00110.1| hypothetical protein FF52_17068 [Flavobacterium sp. F52]
Length = 682
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 69/293 (23%), Positives = 116/293 (39%), Gaps = 28/293 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
E+C + + + + T + YAD E +L N VL E +Y PL S
Sbjct: 367 ETCANIGNVLWNWRMLQITGDAKYADIVELALYNSVLS-GMNLEGDKFLYNNPL--NVSN 423
Query: 466 ERSYHH-WGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDW 520
+ +H WG + + CC + +++G+ Y + G+Y+ Y S+ L+
Sbjct: 424 DLPFHQRWGNVREGYIALSNCCAPNVTRTVAEVGNYAYNLSKD---GLYVNLYGSNTLNT 480
Query: 521 KS--GQIV-VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
K+ G+ + + Q+ + WD +VTL L LRIP W S N + N
Sbjct: 481 KTLNGETLEIEQQTN--YPWDG--KVTLKILKAPKDLQNFF-LRIPGW-SQNAEVSVNNS 534
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+ G +L + + W D + + +P+ + E + A+ GP V
Sbjct: 535 KISDKIVSGTYLKLNQKWKKGDVIELNMPMPVELMEANPLVEEVKNQVAVKRGPLVYCLE 594
Query: 638 SIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSITME 690
S D + TS++D I + NS T E N K V + I +
Sbjct: 595 S----DQLPANTSVNDVILNL----NSDFKTDFTELKNRKLVTIKATSKIAAD 639
>gi|298386662|ref|ZP_06996217.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|298260336|gb|EFI03205.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 687
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 44/77 (57%), Gaps = 7/77 (9%)
Query: 560 LRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
LRIP+WT GA+ +NG+ + + P G +L + + W+ DK+ + LP++L Q ++
Sbjct: 483 LRIPSWTE--GAEVRVNGKKISVKPVSGKYLCIEREWADGDKVEMTLPMSLSMRMWQVNK 540
Query: 619 PEYASIQAILYGPYVLA 635
+ ++ YGP L+
Sbjct: 541 ----NSVSVDYGPLTLS 553
>gi|319951999|ref|YP_004163266.1| hypothetical protein [Cellulophaga algicola DSM 14237]
gi|319420659|gb|ADV47768.1| protein of unknown function DUF1680 [Cellulophaga algicola DSM
14237]
Length = 699
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 42/81 (51%), Gaps = 3/81 (3%)
Query: 560 LRIPTWTSSNGAKATLNGQDLP-LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
LRIP W + G+K +NG++ L +PG + ++ +TW ++D + + LPL +
Sbjct: 527 LRIPEW--AEGSKIMINGKESEILATPGTYATLNRTWKANDTIRLDLPLAINFIEGHGRI 584
Query: 619 PEYASIQAILYGPYVLAGHSI 639
E + AI GP V S+
Sbjct: 585 EEVRNQVAIKRGPVVYCLESV 605
>gi|424665928|ref|ZP_18102964.1| hypothetical protein HMPREF1205_01803 [Bacteroides fragilis HMW
616]
gi|404574181|gb|EKA78932.1| hypothetical protein HMPREF1205_01803 [Bacteroides fragilis HMW
616]
Length = 678
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 88/414 (21%), Positives = 158/414 (38%), Gaps = 41/414 (9%)
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEE 301
W P + KIL QY A N + R+ +M YF +++ + +K +W E
Sbjct: 159 WWPRMVMLKIL----QQYYSATNDQ--RVIRFMTNYFRYQLKTLPEK--PLGNWTFWAEF 210
Query: 302 AGGMN-DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
N +Y L+ IT D L L L K F + + D+ ++ + + G +
Sbjct: 211 RACDNLQAVYWLYNITGDSFLLDLGKLIHKQSFSFVDMVNRGDLRRINTIHCVNLAQGIK 270
Query: 361 ---MRYEVTGDQLHKEGHQLESSGTNIGHFNFKSD-----PKRLASNLDSNTEESCTTYN 412
+ Y+ D+ + + ++ + ++I F+ + + L N + E C+
Sbjct: 271 EPVIYYQQEPDKAYLDA--VKRAFSDIRQFHGQPQGMYGGDEALHGNNPTQGSELCSAVE 328
Query: 413 MLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLP----LAPGSSKERS 468
++ + T +I +AD+ ER N L Q + Y + +
Sbjct: 329 LMYSLEKMVEITGDIDFADHLERIAFN-ALPTQISDDFMTKQYFQQANQVMVTRHRRNFD 387
Query: 469 YHHWGTPS-----DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
H GT + + CC + + K S+++ G+ + Y S + K
Sbjct: 388 QDHGGTDNCFGLLTGYPCCASNMHQGWPKFTQSLWYATPDG--GLAVTAYAPSEVTAKVA 445
Query: 524 Q-IVVNQKVDPVVSWDPYLRVTL-TFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP 581
+ +V D D + TL + K + +L LRIP W G ++NGQ L
Sbjct: 446 EGCMVTFCEDTYYPMDDKISFTLQSMDKKRKEVNFALQLRIPKWCKQAGI--SVNGQLLQ 503
Query: 582 LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G V + W D++ + LP+ + + Y + AI GP V A
Sbjct: 504 HVEGGRMAVVDRIWKKGDRVELHLPMEVTADTW------YENSVAIERGPLVFA 551
>gi|436837570|ref|YP_007322786.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
gi|384068983|emb|CCH02193.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
Length = 683
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 64/134 (47%), Gaps = 9/134 (6%)
Query: 481 CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ-IVVNQKVDPVVSWDP 539
CC + +++Y G+ ++ Y +S + K G V K + ++
Sbjct: 404 CCQHNHANGWVYYAENLYMATPDN--GLAVVLYNASEVTAKVGNGSAVTLKQETSYPFEE 461
Query: 540 YLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPS-PGNFLSVTKTWSSD 598
+R+T+ + + L LR+P W S+ + +NG+ +P+ + G ++ +T TW S
Sbjct: 462 QVRLTVQAARPTA---FPLYLRVPAWCSNPTVR--VNGRAVPVTAKAGQYIVLTDTWQSG 516
Query: 599 DKLTIQLPLTLRTE 612
DK+T+ LP+ LR
Sbjct: 517 DKITLDLPMRLRVR 530
>gi|423303854|ref|ZP_17281853.1| hypothetical protein HMPREF1072_00793 [Bacteroides uniformis
CL03T00C23]
gi|423307425|ref|ZP_17285415.1| hypothetical protein HMPREF1073_00165 [Bacteroides uniformis
CL03T12C37]
gi|392686852|gb|EIY80152.1| hypothetical protein HMPREF1072_00793 [Bacteroides uniformis
CL03T00C23]
gi|392690034|gb|EIY83305.1| hypothetical protein HMPREF1073_00165 [Bacteroides uniformis
CL03T12C37]
Length = 663
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 52/239 (21%), Positives = 99/239 (41%), Gaps = 31/239 (12%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + ++ LF + Y D ER+L NG++ G+ + G Y PL+
Sbjct: 339 ETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLISGVS--LDGGKFFYPNPLSCDGK 396
Query: 465 KERSYHHWGTPSDSFWC-CYGTGIESF-SKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
+ H T F C C + I F L +Y ++ + VY+ ++S+R + K
Sbjct: 397 YHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQ---VYVNLFLSNRAELKL 453
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN------------- 569
+ V + + W+ +RV + ++G+ L ++N+RIP W +
Sbjct: 454 NEKKVVLEQETGYPWNGDIRVKV---AQGN-LPFTMNIRIPGWVRGSVLPSDLYSYADDL 509
Query: 570 --GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRT----EAIQDDRPEYA 622
G + +NG+++ +L + + W D + + + R E + DR A
Sbjct: 510 KLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMQPRVVKANEKVVADRGRVA 568
>gi|115376362|ref|ZP_01463600.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
gi|310821528|ref|YP_003953886.1| hypothetical protein STAUR_4279 [Stigmatella aurantiaca DW4/3-1]
gi|115366641|gb|EAU65638.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
gi|309394600|gb|ADO72059.1| conserved uncharacterized protein MerU [Stigmatella aurantiaca
DW4/3-1]
Length = 940
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 69/154 (44%), Gaps = 16/154 (10%)
Query: 543 VTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLT 602
+TL+ + G T L LRIP W ++ + +NG +P+ + S T+TW++ D +T
Sbjct: 455 ITLSLAMTGPA-TFPLQLRIPAWCTA--PELRINGATVPVSGGPRYASTTRTWANGDTVT 511
Query: 603 IQLPL--TLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDWITPIPA 660
++LP+ T+RT P + ++ +GP + +W T + +
Sbjct: 512 LRLPMRPTVRTW------PAQHNAVSVNHGPLTFSLRITENWVQTGGTAQWPQYDVHAGS 565
Query: 661 SYNSQL-----ITFTQEYGNTKFVLTNSNQSITM 689
S+N L I+ T GN T +N I +
Sbjct: 566 SWNYGLVPGAAISVTTGVGNLADPFTPANAPIRL 599
>gi|365865404|ref|ZP_09405054.1| putative secreted protein [Streptomyces sp. W007]
gi|364005161|gb|EHM26251.1| putative secreted protein [Streptomyces sp. W007]
Length = 408
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 5/77 (6%)
Query: 543 VTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLT 602
VTL+ +S L L LR+P W + + +NGQ + P+ F V +TWSS DK+T
Sbjct: 137 VTLSLTSPKP-LRFPLVLRVPAWCADPEIR--VNGQRVAAPAGPAFTRVERTWSSGDKVT 193
Query: 603 IQLP--LTLRTEAIQDD 617
++LP T+RT A D
Sbjct: 194 LRLPQRTTVRTWADNHD 210
>gi|340346782|ref|ZP_08669901.1| hypothetical protein HMPREF9136_0899 [Prevotella dentalis DSM 3688]
gi|433652017|ref|YP_007278396.1| hypothetical protein Prede_1029 [Prevotella dentalis DSM 3688]
gi|339610999|gb|EGQ15839.1| hypothetical protein HMPREF9136_0899 [Prevotella dentalis DSM 3688]
gi|433302550|gb|AGB28366.1| hypothetical protein Prede_1029 [Prevotella dentalis DSM 3688]
Length = 1163
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 91/247 (36%), Gaps = 36/247 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + +F E Y D ERSL NGVL GI G + Y PL
Sbjct: 347 ETCAAIANIYWNWRMFLTYGESKYYDVIERSLYNGVLSGIGLGGDH--FFYPNPLESTGG 404
Query: 465 KERSYHHWGTPSDSFWC-CYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS--SRLDWK 521
RS W F C C + + F + +G VY+ ++ + +
Sbjct: 405 YSRS--AW------FGCACCPSNLCRFIPSVPGYVYACQGN--SVYVNLFVQGHASIGLA 454
Query: 522 SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA---------- 571
+G + + Q WD RVTLT S L +R+P W S
Sbjct: 455 NGNMQIAQTTG--YPWDG--RVTLTVSHAPES-EVKLMIRVPGWAKSQPVPSRLYHYLQP 509
Query: 572 -----KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQA 626
K TLNG + +++V++ W D L + P+ +R D + A
Sbjct: 510 QKPSLKLTLNGTAVDYHEEKGYIAVSRQWHDGDALQVNFPMEVRRVVANDSVAADRGMVA 569
Query: 627 ILYGPYV 633
+ GP V
Sbjct: 570 LERGPIV 576
>gi|423281130|ref|ZP_17260041.1| hypothetical protein HMPREF1203_04258 [Bacteroides fragilis HMW
610]
gi|404583294|gb|EKA87975.1| hypothetical protein HMPREF1203_04258 [Bacteroides fragilis HMW
610]
Length = 678
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 88/414 (21%), Positives = 158/414 (38%), Gaps = 41/414 (9%)
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEE 301
W P + KIL QY A N + R+ +M YF +++ + +K +W E
Sbjct: 159 WWPRMVMLKIL----QQYYSATNDQ--RVIRFMTNYFRYQLKTLPEK--PLGNWTFWAEF 210
Query: 302 AGGMN-DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
N +Y L+ IT D L L L K F + + D+ ++ + + G +
Sbjct: 211 RACDNLQAVYWLYNITGDSFLLDLGKLIHKQSFSFVDMVNRGDLRRINTIHCVNLAQGIK 270
Query: 361 ---MRYEVTGDQLHKEGHQLESSGTNIGHFNFKSD-----PKRLASNLDSNTEESCTTYN 412
+ Y+ D+ + + ++ + ++I F+ + + L N + E C+
Sbjct: 271 EPVIYYQQEPDKAYLDA--VKRAFSDIRQFHGQPQGMYGGDEALHGNNPTQGSELCSAVE 328
Query: 413 MLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLP----LAPGSSKERS 468
++ + T +I +AD+ ER N L Q + Y + +
Sbjct: 329 LMYSLEKMVEITGDIDFADHLERIAFN-ALPTQISDDFMTKQYFQQANQVMVTRHRRNFD 387
Query: 469 YHHWGTPS-----DSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
H GT + + CC + + K S+++ G+ + Y S + K
Sbjct: 388 QDHGGTDNCFGLLTGYPCCASNMHQGWPKFTQSLWYATPDG--GLAVTAYAPSEVTAKVA 445
Query: 524 Q-IVVNQKVDPVVSWDPYLRVTL-TFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP 581
+ +V D D + TL + K + +L LRIP W G ++NGQ L
Sbjct: 446 EGCMVTFCEDTYYPMDDKISFTLQSMDKKRKEVNFALQLRIPKWCKQAGI--SVNGQLLQ 503
Query: 582 LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
G V + W D++ + LP+ + + Y + AI GP V A
Sbjct: 504 HVEGGRMAVVDRIWRKGDRVELHLPMEVTADTW------YENSVAIERGPLVFA 551
>gi|393782812|ref|ZP_10370994.1| hypothetical protein HMPREF1071_01862 [Bacteroides salyersiae
CL02T12C01]
gi|392672197|gb|EIY65667.1| hypothetical protein HMPREF1071_01862 [Bacteroides salyersiae
CL02T12C01]
Length = 675
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/134 (23%), Positives = 64/134 (47%), Gaps = 6/134 (4%)
Query: 478 SFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK-SGQIVVNQKVDPVVS 536
F CC + + KL +++F G+ + Y S++ K +G + V+ + +
Sbjct: 399 GFPCCTSNLHQGWPKLVQNLWFATYDN--GIAALVYAPSKVTAKVAGNVTVDIEENTGYP 456
Query: 537 WDPYLRVTLTFSSKGSGLTT-SLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTW 595
+D +R + F K + +LRIP W + +NG+ + N + +TW
Sbjct: 457 FDEIIRFKMNFPDKKARTARFPFHLRIPEWCEKPVIR--VNGEVVSCVPVANIAVLERTW 514
Query: 596 SSDDKLTIQLPLTL 609
S+D++T++LP+++
Sbjct: 515 KSNDEVTLELPMSV 528
>gi|410866647|ref|YP_006981258.1| hypothetical protein PACID_21170 [Propionibacterium acidipropionici
ATCC 4875]
gi|410823288|gb|AFV89903.1| hypothetical protein PACID_21170 [Propionibacterium acidipropionici
ATCC 4875]
Length = 632
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 91/244 (37%), Gaps = 22/244 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL------ 459
E+C + V+ L T +I+ AD ER+L N V R + Y PL
Sbjct: 319 ETCAGIGSVMVAWRLLLATGDISLADVIERTLYNVVAASPR-LDGRAFFYTNPLHQRVRA 377
Query: 460 ---APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
A R+ P CC + ++LG + G+ ++QY +
Sbjct: 378 EEVADDRPSPRAEAQLRAPWFEVSCCPTNVSRTLAQLGAYLAITSAD---GLQLLQYAAG 434
Query: 517 RLDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
R+ G V +VD D + VT+ + G L LRIP W + GA T+
Sbjct: 435 RISTALPGGGHVTVRVDTHYPDDGRIAVTVEQAPAGP---WQLTLRIPRW--AGGATVTV 489
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
GQ +P + +S + D + + LP+ R A+ GP VL
Sbjct: 490 GGQTRTAEAPAHVVS---GLVAGDTVVLDLPMAPRFTFPDPRIDAVRGSVAVERGPLVLC 546
Query: 636 GHSI 639
S+
Sbjct: 547 AESV 550
>gi|359411024|ref|ZP_09203489.1| protein of unknown function DUF1680 [Clostridium sp. DL-VIII]
gi|357169908|gb|EHI98082.1| protein of unknown function DUF1680 [Clostridium sp. DL-VIII]
Length = 665
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 49/220 (22%), Positives = 91/220 (41%), Gaps = 23/220 (10%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + ++ + Y D E+ L N V+ G+ + + L +
Sbjct: 341 DTMYSETCASVGLIFFAYNMLKNDPLSIYGDVMEKCLYNSVISGMALDGKHFFYVNPLEV 400
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P +S++ P+ W CC + + LG IY +YI YIS
Sbjct: 401 NPEASEKDPTKSHVKPTRPAWFGCACCPPNVARTLTSLGKYIYTVSNST---LYIHLYIS 457
Query: 516 SRLDWKSGQIVVNQKV----DPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+ +S +V N K+ + W + ++L + + SL RIP W +S
Sbjct: 458 N----ESNILVYNNKISVKQETSYPWSENITISL---AGEENVNLSLAFRIPEWCNSYSI 510
Query: 572 KATLNGQDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLR 610
K ++P S N + +T+TWS D + I + ++
Sbjct: 511 KV---NSEIPEYSICNGYAYITRTWSKSDIIEIHFKMEIQ 547
>gi|326781063|ref|ZP_08240328.1| protein of unknown function DUF1680 [Streptomyces griseus
XylebKG-1]
gi|326661396|gb|EGE46242.1| protein of unknown function DUF1680 [Streptomyces griseus
XylebKG-1]
Length = 814
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 42/73 (57%), Gaps = 5/73 (6%)
Query: 543 VTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLT 602
VTL+ ++ L L LR+P W S + +NGQ + PS F + +TWSS D++T
Sbjct: 464 VTLSLTAPKP-LAFPLVLRVPAWCSDPDIR--VNGQRVAAPSGPAFTRIERTWSSGDRVT 520
Query: 603 IQLP--LTLRTEA 613
++LP T+RT A
Sbjct: 521 LRLPQRTTVRTWA 533
>gi|270295877|ref|ZP_06202077.1| six-hairpin glycosidase [Bacteroides sp. D20]
gi|270273281|gb|EFA19143.1| six-hairpin glycosidase [Bacteroides sp. D20]
Length = 663
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 52/239 (21%), Positives = 99/239 (41%), Gaps = 31/239 (12%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + ++ LF + Y D ER+L NG++ G+ + G Y PL+
Sbjct: 339 ETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLISGVS--LDGGKFFYPNPLSCDGK 396
Query: 465 KERSYHHWGTPSDSFWC-CYGTGIESF-SKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
+ H T F C C + I F L +Y ++ + VY+ ++S+R + K
Sbjct: 397 YHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQ---VYVNLFLSNRAELKL 453
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN------------- 569
+ V + + W+ +RV + ++G+ L ++N+RIP W +
Sbjct: 454 NEKKVVLEQETGYPWNGDIRVKV---AQGN-LPFTMNIRIPGWVRGSVLPSDLYSYADDL 509
Query: 570 --GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRT----EAIQDDRPEYA 622
G + +NG+++ +L + + W D + + + R E + DR A
Sbjct: 510 KLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMHPRVVKANEKVVADRGRVA 568
>gi|283456555|ref|YP_003361119.1| hypothetical protein BDP_1703 [Bifidobacterium dentium Bd1]
gi|283103189|gb|ADB10295.1| Conserved hypothetical protein [Bifidobacterium dentium Bd1]
Length = 586
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 53/242 (21%), Positives = 87/242 (35%), Gaps = 9/242 (3%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + M +SR + + YAD ER L NG + GI + + L
Sbjct: 263 DTMYGETCASVGMSMLSRQMLLLEPKGEYADVLERELFNGAIAGISLDGKQYYYVNALES 322
Query: 460 APGSSKERSYHH-WGTPSDSFWC-CYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
P HH D F C C I D + E V Q+I++
Sbjct: 323 TPDGLDNPDRHHVLSHRVDWFGCACCPANIARLIASVDRYMYTERDGGKTVLSHQFIANE 382
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ SG VV + P W ++ + + +RIP+W S+N ++G
Sbjct: 383 ATFDSGLYVVQRSDMP---WSGHVEFEVNLAEGAQ--PVRFGVRIPSW-SANAYALAVDG 436
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+ F+ +LT+ L ++++ A AI+ GP V
Sbjct: 437 EPCEKNVEDGFVYFDVFAGQTLRLTLDLDMSVKLIRANSHVRSDAGKVAIMRGPLVYCAE 496
Query: 638 SI 639
+
Sbjct: 497 QV 498
>gi|433678396|ref|ZP_20510262.1| hypothetical protein BN444_02464 [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430816487|emb|CCP40741.1| hypothetical protein BN444_02464 [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 664
Score = 44.3 bits (103), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 48/242 (19%), Positives = 91/242 (37%), Gaps = 19/242 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL- 459
D+ ESC + ++ + + + + YAD ER+L N VL + Y+ PL
Sbjct: 334 DTAYNESCASIGLMMFANRMLQLAPDSRYADVMERALYNTVLA-GMALDGRHFFYVNPLE 392
Query: 460 --APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
P + H P W CC + LG +Y + +Y+ Y
Sbjct: 393 VHPPTVHGNHGFDHV-KPVRQRWFGCACCPPNIARVLTSLGHYLYTRRDDT---LYVNLY 448
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ S + G + + W + +++ + + +L LR+P W + +
Sbjct: 449 VGSDAAFDVGGQTLTLRQRGEYPWQEQVELSVDCDAP---VEAALALRLPDWCRA--PQL 503
Query: 574 TLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
LNG+ + + + + + + W D L + LP+ + + A A+ GP
Sbjct: 504 RLNGEAVAIAAHLQHGYCVLRRRWQRGDTLHLHLPMPVMRVSGHPRVRHLAGKVALQRGP 563
Query: 632 YV 633
V
Sbjct: 564 LV 565
>gi|146301833|ref|YP_001196424.1| hypothetical protein Fjoh_4097 [Flavobacterium johnsoniae UW101]
gi|146156251|gb|ABQ07105.1| protein of unknown function DUF1680 [Flavobacterium johnsoniae
UW101]
Length = 672
Score = 44.3 bits (103), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 64/289 (22%), Positives = 108/289 (37%), Gaps = 39/289 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + + + T + YAD E +L N VL G+ E +Y PL S
Sbjct: 357 ETCANIGNVLWNWRMLQITGDAKYADIIELALYNSVLSGMDLEGEK--FLYNNPL--NVS 412
Query: 465 KERSYHH-WGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
+ +H WG + + CC + +++G+ Y + G+Y+ Y S++L
Sbjct: 413 NDLPFHQRWGNEREGYIALSNCCAPNVTRTIAEVGNYAYNISK---EGLYVNLYGSNQLK 469
Query: 520 WKS---GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN 576
KS +I + Q+ + WD ++TL L LRIP W S A+ +N
Sbjct: 470 TKSLNGEEIEIEQQTN--YPWDG--KITLKIVKAPKDLQNFF-LRIPGW--SQNAEILIN 522
Query: 577 GQDL-PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+ G +L + + W D + + P+ + E + A+ GP V
Sbjct: 523 NSKINDKIVSGTYLKLNQKWKKGDVIELNFPMPVELMEANPLVEEVKNQVAVKRGPLVYC 582
Query: 636 GHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSN 684
L P S N + + F+L N N
Sbjct: 583 ---------------LESDQLPAKVSVNDVALNLKSNFATNNFILNNRN 616
>gi|429218465|ref|YP_007180109.1| hypothetical protein Deipe_0766 [Deinococcus peraridilitoris DSM
19664]
gi|429129328|gb|AFZ66343.1| hypothetical protein Deipe_0766 [Deinococcus peraridilitoris DSM
19664]
Length = 689
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 56/249 (22%), Positives = 89/249 (35%), Gaps = 21/249 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL- 459
D+ E+C + ++ +R + + YAD ER+L N VLG + Y+ PL
Sbjct: 345 DTVYAETCASIGLIFFARRMLQLEPRGEYADVMERALYNTVLG-SMSMDGRHYFYVNPLE 403
Query: 460 -----APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI 514
+ G+ R P CC S LG+ +Y + VY ++
Sbjct: 404 VWPAASAGNPGRRHVKATRQPWFGCSCCPPNVARLLSSLGEYLYQVSDDDRT-VYAHLFV 462
Query: 515 SSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSS---------KGSGLTT-SLNLRIPT 564
S + V + + + W R T T S G G L LR+P
Sbjct: 463 GSIVTLSVAGHDVTLRQESSLPWSG--RATFTIGSLAAREPRGQHGPGEAAFQLALRVPA 520
Query: 565 WTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASI 624
W + + +NG+D + V + W D + LP+ + + A
Sbjct: 521 WRAGE-PQLRVNGEDAAYNVNDGYALVDRAWREGDTVEWILPMAAQLMTAHPNVRANAGR 579
Query: 625 QAILYGPYV 633
AI GP V
Sbjct: 580 VAIQRGPLV 588
>gi|171742352|ref|ZP_02918159.1| hypothetical protein BIFDEN_01462 [Bifidobacterium dentium ATCC
27678]
gi|171277966|gb|EDT45627.1| hypothetical protein BIFDEN_01462 [Bifidobacterium dentium ATCC
27678]
Length = 656
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 53/242 (21%), Positives = 87/242 (35%), Gaps = 9/242 (3%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + M +SR + + YAD ER L NG + GI + + L
Sbjct: 333 DTMYGETCASVGMSMLSRQMLLLEPKGEYADVLERELFNGAIAGISLDGKQYYYVNALES 392
Query: 460 APGSSKERSYHH-WGTPSDSFWC-CYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
P HH D F C C I D + E V Q+I++
Sbjct: 393 TPDGLDNPDRHHVLSHRVDWFGCACCPANIARLIASVDRYMYTERDGGKTVLSHQFIANE 452
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
+ SG VV + P W ++ + + +RIP+W S+N ++G
Sbjct: 453 ATFDSGLYVVQRSDMP---WSGHVEFEVNLAEGAQ--PVRFGVRIPSW-SANAYALAVDG 506
Query: 578 QDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGH 637
+ F+ +LT+ L ++++ A AI+ GP V
Sbjct: 507 EPCEKNVEDGFVYFDVFAGQTLRLTLDLDMSVKLIRANSHVRSDAGKVAIMRGPLVYCAE 566
Query: 638 SI 639
+
Sbjct: 567 QV 568
>gi|160890885|ref|ZP_02071888.1| hypothetical protein BACUNI_03330 [Bacteroides uniformis ATCC 8492]
gi|156859884|gb|EDO53315.1| hypothetical protein BACUNI_03330 [Bacteroides uniformis ATCC 8492]
Length = 663
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 52/239 (21%), Positives = 99/239 (41%), Gaps = 31/239 (12%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + ++ LF + Y D ER+L NG++ G+ + G Y PL+
Sbjct: 339 ETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLISGVS--LDGGKFFYPNPLSCDGK 396
Query: 465 KERSYHHWGTPSDSFWC-CYGTGIESF-SKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
+ H T F C C + I F L +Y ++ + VY+ ++S+R + K
Sbjct: 397 YHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQ---VYVNLFLSNRAELKL 453
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN------------- 569
+ V + + W+ +RV + ++G+ L ++N+RIP W +
Sbjct: 454 NEKKVVLEQETGYPWNGDIRVKV---AQGN-LPFTMNIRIPGWVRGSVLPSDLYSYADDL 509
Query: 570 --GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRT----EAIQDDRPEYA 622
G + +NG+++ +L + + W D + + + R E + DR A
Sbjct: 510 KLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMHPRVVKANEKVVADRGRVA 568
>gi|251798052|ref|YP_003012783.1| hypothetical protein Pjdr2_4067 [Paenibacillus sp. JDR-2]
gi|247545678|gb|ACT02697.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 622
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 87/415 (20%), Positives = 144/415 (34%), Gaps = 54/415 (13%)
Query: 269 RMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDV-LYKLFCITQDPKHLMLAHL 327
R+ +M YF +++ + ER + GG N + +Y L+ T DP + LA L
Sbjct: 135 RVIPFMTNYFRYQLKQLP-----ERPLADWAKARGGDNLISVYWLYNRTGDPFLMELAQL 189
Query: 328 FDKPCFLGLLALQADDISG-------------FHSNTHIPIVIGS----QMRYEVTGDQL 370
L +Q +D G F H+ V S ++Y +TGD+
Sbjct: 190 ---------LIVQTEDWKGLYEQYPYWYRQTSFDHRVHVVNVAMSFKQPALQYLLTGDET 240
Query: 371 HKEG--HQLESSGTNIGHFN--FKSDPKRLASNLDSNTEESCTTYNMLKVSRHLFRWTKE 426
K + S G N F D + LA S E C+ + +L R T +
Sbjct: 241 DKAVVYKAINSVMACHGQVNGMFSGD-EWLAGTHPSQGTELCSVVEYMYSLENLIRITGD 299
Query: 427 IAYADYYERSLTNGVLG-------IQRGTEPGVMIYLLPLAPGSSKERSYHHWGTPSDSF 479
+ D E+ N + + + + I ++ + + F
Sbjct: 300 GFFGDILEKIAYNALPAAISPDWKVHQYDQQANQIMCTHAKRNWTENNNEANLFGVEPHF 359
Query: 480 WCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDP 539
CC + + KL ++ EG G+ I Y + G + V + P
Sbjct: 360 GCCTANMHQGWPKLAARLWMASEGG--GIAAISYAPCLVTAALGSDKKTKAEIQVETSYP 417
Query: 540 YLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDD 599
+ S ++ LRIP W +NG+ PL F+S+ + W +D
Sbjct: 418 FRDTVNIKVGLESSAAFAMKLRIPAWCEE--PVLQINGEPYPLQPVNGFVSIERIWMPED 475
Query: 600 KLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDWDITESATSLSDW 654
+L + LP R + P + YGP +LA W + DW
Sbjct: 476 ELLLTLP---RHATLI---PRANGAAGVQYGPLMLAIPVKEQWQKHRTYPPYHDW 524
>gi|160887789|ref|ZP_02068792.1| hypothetical protein BACUNI_00192 [Bacteroides uniformis ATCC 8492]
gi|423304369|ref|ZP_17282368.1| hypothetical protein HMPREF1072_01308 [Bacteroides uniformis
CL03T00C23]
gi|423310517|ref|ZP_17288501.1| hypothetical protein HMPREF1073_03251 [Bacteroides uniformis
CL03T12C37]
gi|156862731|gb|EDO56162.1| hypothetical protein BACUNI_00192 [Bacteroides uniformis ATCC 8492]
gi|392681688|gb|EIY75045.1| hypothetical protein HMPREF1073_03251 [Bacteroides uniformis
CL03T12C37]
gi|392684698|gb|EIY78021.1| hypothetical protein HMPREF1072_01308 [Bacteroides uniformis
CL03T00C23]
Length = 688
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 89/434 (20%), Positives = 168/434 (38%), Gaps = 51/434 (11%)
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEE 301
W P + KIL QY A N + R+ +M +YF ++ + +K HW + E
Sbjct: 171 WWPRMVVLKIL----QQYYSATNDK--RVVAFMTKYFRYQLNTLPQKPL--GHWSSWAEF 222
Query: 302 AGGMN-DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDISGFHSNTHIPIVIGSQ 360
N +Y L+ +T + L L HL + F + + D+ + + + G +
Sbjct: 223 RACDNLQAVYWLYNLTGEDFLLELGHLLHRQSFSFIDMVDRGDLRRPCTIHCVNLAQGIK 282
Query: 361 ---MRYEVTGDQLHKEGHQLESSGTNIGHFNFKSD-----PKRLASNLDSNTEESCTTYN 412
+ Y+ D+ + + ++ +I F+ + + L N + E C+
Sbjct: 283 EPIIYYQQDTDRKYIDA--VKEGFRDIRRFHGQPQGMYGGDEALHGNNPTQGSELCSAVE 340
Query: 413 MLKVSRHLFRWTKEIAYADYYER--------SLTNGVLGIQRGTEPG-VMIYLLPLAPGS 463
++ + T +I +AD+ ER +++ + Q +P VM+
Sbjct: 341 LMYSLEKMVEITGDIDFADHLERIAFNALPAQISDDFMTKQYFQQPNQVMVTRHRRNFDQ 400
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSG 523
E + +GT + + CC+ + + K +++ G+ I Y S + G
Sbjct: 401 DHEGTDLAFGTLT-GYPCCFSNMHQGWPKFTQHLWYATPDN--GIAAIVYSPSEVTANVG 457
Query: 524 QIVVNQKVDPVVSWDPYL----RVTLTFS---SKGSGLTTSLNLRIPTWTSSNGAKATLN 576
V V+S D Y ++T T +K + +LR+P W A+ +N
Sbjct: 458 D-----NVPVVISEDTYYPMDHQITFTIKEVRNKVKQVKFPFHLRVPKWCKQ--AEIRVN 510
Query: 577 GQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAG 636
G+ G V + W +DK+ + LP+ + T Y + +I GP V A
Sbjct: 511 GKMEQTVKGGKIAIVDRIWKRNDKIELYLPMEVFTSTW------YENAVSIERGPLVYAL 564
Query: 637 HSIGDWDITESATS 650
+W+ E S
Sbjct: 565 KMEENWEKKEFKDS 578
>gi|318062606|ref|ZP_07981327.1| putative secreted protein [Streptomyces sp. SA3_actG]
gi|318081209|ref|ZP_07988541.1| putative secreted protein [Streptomyces sp. SA3_actF]
Length = 812
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 69/147 (46%), Gaps = 15/147 (10%)
Query: 477 DSFWCC---YGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDP 533
D + CC YG G F++ ++ G+ + Y + + K+G V
Sbjct: 400 DQYRCCPHNYGMGWPWFAQ---ELWLATPDN--GLAAVMYAPNEVRAKAGADATEVTVST 454
Query: 534 VVSWDPYLRVTLTFSSK-GSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVT 592
++ P+ TLTF+ + + L LR+P W ++ + T+NG P+ F +V+
Sbjct: 455 DTAY-PFGD-TLTFTVRTPRPVAFPLRLRVPAWCAA--PELTVNGAKSTAPAGPAFTTVS 510
Query: 593 KTWSSDDKLTIQLP--LTLRTEAIQDD 617
+TW D + ++LP +T+RT A Q D
Sbjct: 511 RTWQDGDTVRLRLPQRVTVRTWAAQHD 537
>gi|317479689|ref|ZP_07938812.1| hypothetical protein HMPREF1007_01928 [Bacteroides sp. 4_1_36]
gi|316904142|gb|EFV25973.1| hypothetical protein HMPREF1007_01928 [Bacteroides sp. 4_1_36]
Length = 647
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 52/239 (21%), Positives = 99/239 (41%), Gaps = 31/239 (12%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + ++ LF + Y D ER+L NG++ G+ + G Y PL+
Sbjct: 339 ETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLISGVS--LDGGKFFYPNPLSCDGK 396
Query: 465 KERSYHHWGTPSDSFWC-CYGTGIESF-SKLGDSIYFEEEGKYPGVYIIQYISSRLDWKS 522
+ H T F C C + I F L +Y ++ + VY+ ++S+R + K
Sbjct: 397 YHFNADHTITRQPWFGCACCPSNISRFIPSLPGYVYAVKDNQ---VYVNLFLSNRAELKL 453
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN------------- 569
+ V + + W+ +RV + ++G+ L ++N+RIP W +
Sbjct: 454 NEKKVVLEQETGYPWNGDIRVKV---AQGN-LPFTMNIRIPGWVRGSVLPSDLYSYADDL 509
Query: 570 --GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRT----EAIQDDRPEYA 622
G + +NG+++ +L + + W D + + + R E + DR A
Sbjct: 510 KLGYRVLVNGEEVTGELRKGYLRIDRKWKKGDVVEVHFDMHPRVVKANEKVVADRGRVA 568
>gi|343085566|ref|YP_004774861.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342354100|gb|AEL26630.1| protein of unknown function DUF1680 [Cyclobacterium marinum DSM
745]
Length = 690
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 57/276 (20%), Positives = 111/276 (40%), Gaps = 22/276 (7%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPG--VMIYLLPLAPGS 463
E+C + + + T + +AD E SL N VL GT+ G Y PL
Sbjct: 373 ETCANIGNVLWNHRMLLVTGDSRFADILELSLFNSVLS---GTDLGGTNFNYTNPLRVDK 429
Query: 464 SKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSRL 518
++ W + + CC + + ++ + Y + G +Y + + L
Sbjct: 430 DLPFTFR-WNKVREPYISKSNCCPPNVVRTVAETHNYAYALSDNGLVVNLYGSNELKTSL 488
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+ + Q+ D WD +++++ + + +++LR+P W S A+ T+NG+
Sbjct: 489 P-NGSSLELKQETD--YPWDGKIKLSIQKTGQDP---LAIDLRVPAWASQ--AEITVNGE 540
Query: 579 D-LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP--YVLA 635
P G++ S+ + W D + + LP+T R E + A++ GP Y +
Sbjct: 541 KSKEKPIAGSYFSLVRQWEKGDVIELNLPMTARLMEANPLVEETRNQVAVVRGPIVYCIE 600
Query: 636 GHSIGDWDITESATSLSDWITPIPASYNSQLITFTQ 671
+ D I + + TP+ +TF +
Sbjct: 601 SSDLQDARIFDVELPAAIQFTPVIKMVKGASLTFLE 636
>gi|256838606|ref|ZP_05544116.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|256739525|gb|EEU52849.1| conserved hypothetical protein [Parabacteroides sp. D13]
Length = 675
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 52/256 (20%), Positives = 105/256 (41%), Gaps = 31/256 (12%)
Query: 396 LASNLDSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYER--------SLTNGVLGIQRG 447
L N + E C+ ++ + T ++ + D+ ER +T+ + Q
Sbjct: 309 LHGNNPTQGSELCSAVELMYSLEKMMEITGDLTFTDHLERIAFNALPTQITDDFMNKQYF 368
Query: 448 TEPGVMIYLLPLAPGSSKERSYHH-----WGTPSDSFWCCYGTGIESFSKLGDSIYFEEE 502
+ + ++ P + E ++H +GT + + CC+ +++ K S+++
Sbjct: 369 QQANQI--MITRHPHNFYEDAHHAATDIIYGTRT-GYPCCFSNMHQAWPKFTQSLWYATP 425
Query: 503 GKYPGVYIIQYISSRLDWKSG---QIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLN 559
K G+ + Y S + + G +I + + D D +R T+ S+ +T +
Sbjct: 426 DK--GIAALAYSPSEVVAQVGDGHEISIIE--DTYYPMDDKIRFTIRLSNSVKEVTFPFH 481
Query: 560 LRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRP 619
LRIP W GA T+NG + + + + W D++ + LP+ + +
Sbjct: 482 LRIPEWCK--GAAVTINGITDSINGGSDMAILHRPWKDGDQVILSLPMKVESSRW----- 534
Query: 620 EYASIQAILYGPYVLA 635
Y + AI GP V A
Sbjct: 535 -YENSVAIERGPLVYA 549
>gi|340619115|ref|YP_004737568.1| hypothetical protein zobellia_3150 [Zobellia galactanivorans]
gi|339733912|emb|CAZ97289.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
Length = 694
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 57/134 (42%), Gaps = 10/134 (7%)
Query: 527 VNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPG 586
+ QK D WD +++T+ + + LRIP+W + G + +NG + PG
Sbjct: 502 LTQKTD--YPWDGAVKITV---DECKAEAFEVLLRIPSW--AKGTQIKVNGTKVAKAQPG 554
Query: 587 NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIG---DWD 643
F + + W+ D++TI +P+ + E + A+ GP V S D
Sbjct: 555 TFAKIERQWAEGDEITIDMPMETKFIEGHPRIEEVRNQVALKRGPVVYCIESADLPEKTD 614
Query: 644 ITESATSLSDWITP 657
IT S +TP
Sbjct: 615 ITNVYLSSKKQLTP 628
>gi|302521079|ref|ZP_07273421.1| conserved hypothetical protein [Streptomyces sp. SPB78]
gi|302429974|gb|EFL01790.1| conserved hypothetical protein [Streptomyces sp. SPB78]
Length = 812
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 69/147 (46%), Gaps = 15/147 (10%)
Query: 477 DSFWCC---YGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDP 533
D + CC YG G F++ ++ G+ + Y + + K+G V
Sbjct: 400 DQYRCCPHNYGMGWPWFAQ---ELWLATPDN--GLAAVMYAPNEVRAKAGTDATEVTVST 454
Query: 534 VVSWDPYLRVTLTFSSK-GSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVT 592
++ P+ TLTF+ + + L LR+P W ++ + T+NG P+ F +V+
Sbjct: 455 DTAY-PFGD-TLTFTVRTPRPVAFPLRLRVPAWCAA--PELTVNGAKSTAPAGPAFTTVS 510
Query: 593 KTWSSDDKLTIQLP--LTLRTEAIQDD 617
+TW D + ++LP +T+RT A Q D
Sbjct: 511 RTWQDGDTVRLRLPQRVTVRTWAAQHD 537
>gi|386820698|ref|ZP_10107914.1| hypothetical protein JoomaDRAFT_2662 [Joostella marina DSM 19592]
gi|386425804|gb|EIJ39634.1| hypothetical protein JoomaDRAFT_2662 [Joostella marina DSM 19592]
Length = 660
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 57/275 (20%), Positives = 114/275 (41%), Gaps = 45/275 (16%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQ-RGTEPGVMIYLLPLAPGS 463
E+C + + L T + Y D ER+L NG++ G+ GT+ + P A S
Sbjct: 358 ETCAAIGDVYWNHRLHNMTGNVKYFDVIERTLYNGLISGLSLNGTQ-----FFYPNALES 412
Query: 464 SKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR-- 517
++ G + W CC I L IY + V++ Y +++
Sbjct: 413 DGVYKFNQ-GACTRKDWFDCSCCPTNVIRFIPSLPGLIYSKTSDT---VFVNLYAANQAT 468
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL-- 575
+ + I + Q+ W+ +++T+T + ++ LRIP W + TL
Sbjct: 469 IGLEETAIAITQETS--YPWNGSVKLTVTPETASD---FTIKLRIPGWARNEVLPGTLYS 523
Query: 576 -------------NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDR 618
NG+ + ++++T+ W + +++++P+ +R E +++DR
Sbjct: 524 YKEKIKAVPEVKVNGELVEATIDNGYITLTRNWKKGETISLEIPMKVREVLANEKVEEDR 583
Query: 619 PEYASIQAILYGPYVLAGHSIGDWDITESATSLSD 653
+ A + YGP V A I + + ++ T +D
Sbjct: 584 GKIA----LEYGPIVYAVEEIDNKNNFDAITISND 614
>gi|399041428|ref|ZP_10736483.1| hypothetical protein PMI09_04045 [Rhizobium sp. CF122]
gi|398060198|gb|EJL52027.1| hypothetical protein PMI09_04045 [Rhizobium sp. CF122]
Length = 640
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 50/236 (21%), Positives = 91/236 (38%), Gaps = 19/236 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
+S E+C + ++ + + YAD E++L NG + + Y PL
Sbjct: 329 ESAYAETCASVGLVFWANRMLGRGPNRRYADIMEQALYNGAMA-GLSLDGKTFFYENPLE 387
Query: 461 PGSSKER-SYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
R +HH P CC + +G +Y E + V++ +R
Sbjct: 388 SAGKHHRWIWHH--CP-----CCPPNIARLLASIGSYMYGVAEDEI-AVHLYGEGRARFK 439
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
+ + QK W + + S +++LRIP W +NGA +NG+
Sbjct: 440 MAGADVALTQKTR--YPWHGAVHFDIKTSKPAQ---FAVSLRIPGW--ANGATLAVNGEA 492
Query: 580 LPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+ + S + + + W DK+ + +PL R+ + A A++ GP V
Sbjct: 493 IDIGSVDVDGYARIEREWRDGDKIDLDIPLEARSLWANPLVRQDAGRAALMRGPLV 548
>gi|440731554|ref|ZP_20911563.1| hypothetical protein A989_09226 [Xanthomonas translucens DAR61454]
gi|440372448|gb|ELQ09250.1| hypothetical protein A989_09226 [Xanthomonas translucens DAR61454]
Length = 664
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 43/216 (19%), Positives = 83/216 (38%), Gaps = 19/216 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL- 459
D+ ESC + ++ + + + + YAD ER+L N VL + Y+ PL
Sbjct: 334 DTAYNESCASIGLMMFANRMLQLAPDSRYADVMERALYNTVLA-GMALDGRHFFYVNPLE 392
Query: 460 --APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
P + H P W CC + LG +Y + +Y+ Y
Sbjct: 393 VHPPTVHGNHGFDHV-KPVRQRWFGCACCPPNIARVLTSLGHYLYTRRDDT---LYVNLY 448
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ S + G + + W + +++ + + +L LR+P W + +
Sbjct: 449 VGSDAAFDVGGQTLTLRQRGEYPWQEQVELSVDCDAP---VEAALALRLPDWCRA--PQL 503
Query: 574 TLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPL 607
LNG+ + + + + + + W D L + LP+
Sbjct: 504 RLNGEAVAIAAHLQHGYCVLRRRWQRGDTLHLHLPM 539
>gi|182440394|ref|YP_001828113.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC
13350]
gi|178468910|dbj|BAG23430.1| putative secreted protein [Streptomyces griseus subsp. griseus NBRC
13350]
Length = 814
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 42/73 (57%), Gaps = 5/73 (6%)
Query: 543 VTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLT 602
VTL+ ++ L L LR+P W + + +NGQ + PS F + +TWSS D++T
Sbjct: 464 VTLSLTAPKP-LAFPLVLRVPAWCADPDIR--VNGQRVAAPSGPAFTRIERTWSSGDRVT 520
Query: 603 IQLP--LTLRTEA 613
++LP T+RT A
Sbjct: 521 LRLPQRTTVRTWA 533
>gi|225018685|ref|ZP_03707877.1| hypothetical protein CLOSTMETH_02635, partial [Clostridium
methylpentosum DSM 5476]
gi|224948545|gb|EEG29754.1| hypothetical protein CLOSTMETH_02635 [Clostridium methylpentosum
DSM 5476]
Length = 1108
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 59/256 (23%), Positives = 98/256 (38%), Gaps = 46/256 (17%)
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGV-----MIY--LL 457
+E+C + +K + T + YAD E++ N +LG +G V +Y
Sbjct: 529 QETCISVTWMKFCEKMLSITGDPIYADQIEKTAYNALLGAMQGPNAQVDDVCSTLYWDYF 588
Query: 458 PLAPGSSK-ERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
L G+ E H G S CC +GI G P I+ +
Sbjct: 589 TLYNGTRHHEFGGHIEGVDS----CCSASGISGL------------GVIPLAQIMNSAAG 632
Query: 517 RLD--WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT---------SLNLRIPTW 565
+ + G + N V +D V + +G ++ LRIP W
Sbjct: 633 PVINLYSPGSMAANTPSGNKVRFD----VDTNYPVEGEIKMVVQPDVQEQFTVKLRIPAW 688
Query: 566 TSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQ 625
+ K +NG + PG FL + +TW D TI++ + RT ++ + + + +
Sbjct: 689 SEQTVVK--VNGAEQKDVVPGTFLELNRTWKPGD--TIEISMDFRTWIVESPKGKGSDTE 744
Query: 626 ---AILYGPYVLAGHS 638
A++ GP VLA S
Sbjct: 745 GNIALVRGPVVLARDS 760
>gi|332669318|ref|YP_004452326.1| hypothetical protein Celf_0799 [Cellulomonas fimi ATCC 484]
gi|332338356|gb|AEE44939.1| protein of unknown function DUF1680 [Cellulomonas fimi ATCC 484]
Length = 634
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 59/243 (24%), Positives = 94/243 (38%), Gaps = 24/243 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL---APG 462
E+C + V+ L T E +AD ER+L N V+ + Y PL PG
Sbjct: 326 ETCAGVASVMVAWRLLLATGEARWADVVERTLYN-VVATSPAQDGQAFFYTNPLHKRVPG 384
Query: 463 SSKE------RSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISS 516
S+ + R+ P CC + + LG + + GV + QY +
Sbjct: 385 SAADPDQVSARALSRLRAPWFEVSCCPTNVARTLASLGAYLATTTDD---GVQLHQYAPA 441
Query: 517 RLDWKSGQ-IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
R+ G + +V D + V +T + +G L+LR+P+W ATL
Sbjct: 442 RIATTLGDGRPIGLEVATGYPHDGDVVVRVTQAPEGE---VGLSLRVPSWAVG---AATL 495
Query: 576 NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
+G P G V + ++ D++ + LP+ R D A+ GP VL
Sbjct: 496 DGA----PVEGGVAVVRRVFAVGDEVRLSLPVEPRVTTPDDRIDAVRGCVAVERGPLVLC 551
Query: 636 GHS 638
S
Sbjct: 552 AES 554
>gi|224537077|ref|ZP_03677616.1| hypothetical protein BACCELL_01954 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521304|gb|EEF90409.1| hypothetical protein BACCELL_01954 [Bacteroides cellulosilyticus
DSM 14838]
Length = 811
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 50/236 (21%), Positives = 93/236 (39%), Gaps = 33/236 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + +F T YAD ER+L NGV+ G+ + Y PL
Sbjct: 340 ETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 397
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER W + CC G + + +Y + +Y+ YI S+ + +
Sbjct: 398 HER--QQWFGCA----CCPGNVTRFMASVPFYMYATQGND---IYVNLYIQSKAELNTET 448
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
V + WD + +++ + +L +RIP W ++ AKA
Sbjct: 449 NNVKLEQITTYPWDGKVSISVNPEKEQE---FALRVRIPGWAQDAPVPTDLYSFTDKAKA 505
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYA 622
++NG+ + + ++ W + D + I P+ +R + ++DDR + A
Sbjct: 506 YTISINGKKVNATQLDGYATILHDWKTGDVVEINFPMDVRRVKANDNVEDDRGKLA 561
>gi|373954097|ref|ZP_09614057.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373890697|gb|EHQ26594.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 800
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 53/239 (22%), Positives = 93/239 (38%), Gaps = 38/239 (15%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + +F + Y D ER+L NG+L G+ + Y PLA
Sbjct: 335 ETCAAIGNVYWNNRMFLLHGDAKYIDVLERTLYNGLLSGVSLSGD--RFFYPNPLASMFQ 392
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYI--SSRLDWKS 522
+RS W + + CC L +Y + + +Y+ ++ SS + S
Sbjct: 393 HQRS--AWISCA----CCISNMTRFLPSLPGYVYAKNKND---LYVNLFMSNSSNIKLAS 443
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL------- 575
G + + Q+ D W + +T+ K + T L +RIP W L
Sbjct: 444 GNVNIVQQTD--YPWKGQVDMTIN-PVKTTDFT--LRVRIPGWAKQQPVPGNLYSFMDKT 498
Query: 576 --------NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPL----TLRTEAIQDDRPEYA 622
NG+ + + + + W DK+++ LPL L + ++DDR +A
Sbjct: 499 PLPVVIYINGKATSFVTEKGYAVLKRNWKKGDKVSLALPLETEKVLANDKVKDDRGRFA 557
>gi|355670901|ref|ZP_09057548.1| hypothetical protein HMPREF9469_00585 [Clostridium citroniae
WAL-17108]
gi|354815817|gb|EHF00407.1| hypothetical protein HMPREF9469_00585 [Clostridium citroniae
WAL-17108]
Length = 647
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 65/279 (23%), Positives = 109/279 (39%), Gaps = 38/279 (13%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D N ESC + M + + T E Y D ER+L N VL GI + + L +
Sbjct: 325 DCNYSESCASIGMAMFGQRMGNITGEAKYYDVVERALYNTVLAGIALDGKSFFYVNPLEV 384
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P + R+ P W CC + + LG IY ++ +Y+ +IS
Sbjct: 385 WPDNCIPRTSREHVKPVRQKWFGVACCPPNIARTLASLGQYIYGADQNS---LYVNLFIS 441
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKG---SGLTTSLNLRIPTWTSSNGAK 572
++ G ++ ++ WD +++ + KG SG+ L +RIP + S
Sbjct: 442 NQTSVDLGGREISVQMQTRFPWD----MSVDIACKGVPASGI--RLAVRIPDYAGSFTVT 495
Query: 573 ATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYASIQA-I 627
Q L + ++ T D L I++ R ++ D + A ++ I
Sbjct: 496 KAGTQQPLAFSREKGYAVISLT--EDAGLRIEMDAKARFVRSNPLVRADSGKVALVRGPI 553
Query: 628 LY-------GP-----YVLAGHSIGD--WDITESATSLS 652
+Y GP YV +G I + WD+ T L+
Sbjct: 554 VYCLEEVDNGPNLAAVYVDSGTEIKEEKWDLMGEITGLT 592
>gi|374985914|ref|YP_004961409.1| hypothetical protein SBI_03157 [Streptomyces bingchenggensis BCW-1]
gi|297156566|gb|ADI06278.1| hypothetical protein SBI_03157 [Streptomyces bingchenggensis BCW-1]
Length = 644
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 43/214 (20%), Positives = 82/214 (38%), Gaps = 20/214 (9%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA--PGS 463
E+C ++ + +F T + Y D ER L N + + Y PL P
Sbjct: 315 ETCAAIGTMQWAWRMFLATGDARYPDVLERVLYN-AFAVGLSADGRAFFYDNPLQRRPDH 373
Query: 464 SKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
+ G P W CC + ++L D + E G+ + + Y + +D
Sbjct: 374 EQRSGAEEGGEPLRQAWFSCPCCPPNVVRWMAQLADFLVAERPGE---LLVAGYAQAGVD 430
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLN--G 577
+ + WD +R+T+ + ++LR+P W + T+ G
Sbjct: 431 GAEAALDMATGY----PWDGEVRLTV---RRAPDEPYRISLRVPGWADPGQVRLTVGTAG 483
Query: 578 QDLPLPSPGN-FLSVTKTWSSDDKLTIQLPLTLR 610
++ + +L+V + W D+L + LP+ +R
Sbjct: 484 EETAAGDVSDGWLTVERRWRPGDELRLSLPMPVR 517
>gi|354604714|ref|ZP_09022703.1| hypothetical protein HMPREF9450_01618 [Alistipes indistinctus YIT
12060]
gi|353347293|gb|EHB91569.1| hypothetical protein HMPREF9450_01618 [Alistipes indistinctus YIT
12060]
Length = 623
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 98/456 (21%), Positives = 169/456 (37%), Gaps = 76/456 (16%)
Query: 201 LKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAP------YYTIHKILAG 254
L+ V+ ++A Q+ GY++ + T L L W Y H I AG
Sbjct: 117 LRRTADQWVAKIAAAQQP--DGYINTYYT-----LTGLDKRWTDMDKHEMYCAGHMIEAG 169
Query: 255 LLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFC 314
+ D L ++T MV + N +RHW +EE + L KL+
Sbjct: 170 IAYLLATGDRT-LLEVSTRMVGHMMNEFG------PGKRHWVPGHEE---IELALAKLYS 219
Query: 315 ITQDPKHLMLAHLFDKPCFLG-----------------LLALQADDISGFHSNTHIPIVI 357
+T +PK+L A + G + + DI+G H+ + +
Sbjct: 220 VTGEPKYLEFARWLLEERGHGYGRNEEGTWNAAYYQDSIPVSRMTDITG-HAVRCMYLFC 278
Query: 358 GSQMRYEVTGDQLHK-----------EGHQLESSGTNIGHFNFKSDPKRLASNLDSNTEE 406
G ++GD +++ + + + G H N NL++ E
Sbjct: 279 GMADMSMLSGDTVYRAALDRVWDDVVQRNMYITGGIGSSHQNEGFTEDYDLPNLEAYCE- 337
Query: 407 SCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL-APGSS 464
+C + M+ + + R + YAD ER+L NG L GI + Y+ PL + G
Sbjct: 338 TCASVGMVLWNARMNRLKGDAKYADVMERALYNGALAGIS--LDGKRFFYVNPLESKGDH 395
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL---DWK 521
++++ CC +G IY V++ Y+ S
Sbjct: 396 HRKAWYGCA-------CCPSQLSRFLPSIGSYIYSHSLDS-DTVWVNLYLGSNAAIPTQD 447
Query: 522 SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP 581
+ V+ Q W+ R+T+ S + L LRIP W ++ +NG+
Sbjct: 448 GSRFVLTQTTR--YPWEGNARITV--SEAPGKIRKELRLRIPGWCKNH--TLWVNGELFD 501
Query: 582 LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDD 617
P+ + V ++W D+ I L L + TE + D
Sbjct: 502 HPTDKGYAVVNRSWKKGDR--IDLSLAMPTEVVAAD 535
>gi|67538270|ref|XP_662909.1| hypothetical protein AN5305.2 [Aspergillus nidulans FGSC A4]
gi|40743275|gb|EAA62465.1| hypothetical protein AN5305.2 [Aspergillus nidulans FGSC A4]
gi|259485256|tpe|CBF82133.1| TPA: DUF1680 domain protein (AFU_orthologue; AFUA_1G08910)
[Aspergillus nidulans FGSC A4]
Length = 629
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 77/176 (43%), Gaps = 25/176 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA--PGS 463
E+C + ++ + + + + YAD E L NG LG G + G Y PL G
Sbjct: 336 ETCACFALIIWCQRMLQLDLDAKYADVMEVGLYNGFLG-AVGLDGGSFYYQNPLRTYTGH 394
Query: 464 SKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIY-FEEEGKYPGVYIIQYISSRLDWKS 522
KERS W + CC + + IY F+++ V I YI S
Sbjct: 395 PKERS--EWFEVA----CCPPNVAKLLGSMESLIYSFKDD----LVAIHLYIESDFTVPE 444
Query: 523 GQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
+VV+QK + S D + S KG TT+L LRIPTW + G +++ G+
Sbjct: 445 TGVVVSQKTNMPWSGD------VEISVKG---TTALALRIPTW--AEGYSSSVQGE 489
>gi|255038580|ref|YP_003089201.1| hypothetical protein Dfer_4835 [Dyadobacter fermentans DSM 18053]
gi|254951336|gb|ACT96036.1| protein of unknown function DUF1680 [Dyadobacter fermentans DSM
18053]
Length = 648
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 56/266 (21%), Positives = 94/266 (35%), Gaps = 40/266 (15%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLA 460
D+ E+C + + ++ T E Y D +ER L NG LG G + Y+ P++
Sbjct: 338 DNAYAETCAAIANMLWNHKMYLRTGEAKYMDVFERVLYNGFLG-GMGVKGNTFFYVNPMS 396
Query: 461 --------PGSSKERSYHHW-GTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYII 511
GS R H W GT CC T + F + +G V +
Sbjct: 397 SNGKNDFNKGSGAVR--HEWFGTA-----CC-PTNVSRFLPSMPGYMYATQGNALVVNLF 448
Query: 512 QYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGA 571
+ + + + ++Q+ W +R+ + G+ L++RIP W +
Sbjct: 449 GDTKANITLPATAVQISQQTQ--YPWQGNIRIQVDPEKSGA---FPLHIRIPGWATGQAI 503
Query: 572 KATL---------------NGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQD 616
L NG+ +L + +TW D + + L + +R +
Sbjct: 504 PGDLYSYEDKLAKPVTVQINGKKADAAIENGYLKLNRTWKKGDVVELVLDMPVRRVISNE 563
Query: 617 DRPEYASIQAILYGP--YVLAGHSIG 640
AI GP Y GH G
Sbjct: 564 KLTANKGKVAIERGPVLYCAEGHDNG 589
>gi|423223926|ref|ZP_17210395.1| hypothetical protein HMPREF1062_02581 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392637372|gb|EIY31243.1| hypothetical protein HMPREF1062_02581 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 820
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 50/236 (21%), Positives = 93/236 (39%), Gaps = 33/236 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + + +F T YAD ER+L NGV+ G+ + Y PL
Sbjct: 349 ETCAAIANVYWNHRMFLATGNAKYADVLERALYNGVISGVSLSGDK--FFYDNPLESMGQ 406
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
ER W + CC G + + +Y + +Y+ YI S+ + +
Sbjct: 407 HER--QQWFGCA----CCPGNVTRFMASVPFYMYATQGND---IYVNLYIQSKAELNTET 457
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTS-----------SNGAKA 573
V + WD + +++ + +L +RIP W ++ AKA
Sbjct: 458 NNVKLEQITTYPWDGKVSISVNPEKEQE---FALRVRIPGWAQDAPVPTDLYSFTDKAKA 514
Query: 574 ---TLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYA 622
++NG+ + + ++ W + D + I P+ +R + ++DDR + A
Sbjct: 515 YTISINGKKVNATQLDGYATILHDWKTGDIVEINFPMDVRRVKANDNVEDDRGKLA 570
>gi|395771959|ref|ZP_10452474.1| hypothetical protein Saci8_19398 [Streptomyces acidiscabies 84-104]
Length = 654
Score = 43.5 bits (101), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 113/535 (21%), Positives = 191/535 (35%), Gaps = 94/535 (17%)
Query: 153 NFRKTARLPAPGE--PYGGWEEPSCELRGHFVGHYLSASALMWASTHNESLKEKMSAVVS 210
NFR A L G P G + + V +L A+ A T +E+L ++ A+V
Sbjct: 59 NFRAAAALRTDGADTPSGTGFSGDFQFQDSDVYKWLEAACWQLADTPDETLATEVEAIVE 118
Query: 211 ALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAPYYTIHKILAGLLDQYTYADN------ 264
++A Q+E GYL + + +L P P + AG L Q A +
Sbjct: 119 LIAAAQRE--DGYL-----QTYYQLGGGTPWTEPGWGHELYCAGHLIQAAVAHHRATGSD 171
Query: 265 ---AEALRMTTWMVEYFY--NRVQNVIKKYSIERHWQTLNEEAGGMNDVLYKLFCITQDP 319
A A R+ + F +V+ V +E L +L T +
Sbjct: 172 RLLAVARRLADHIDSVFGPGKQVETVCGHPEVE--------------TALVELHRTTDEK 217
Query: 320 KHLMLAHLFDKPCFLGLLALQAD-----DISGFHSNTHIPIVIGSQMRYEVTGDQLHKEG 374
++L LA F + G L+ AD D + H PI EVTG + +
Sbjct: 218 RYLDLARYFLERRGHGTLSSGADRGHDRDPGPEYWQDHTPIRAAD----EVTGHAVRQLY 273
Query: 375 HQLESSGT--NIGHFNFKSDPKRLASNL----------------------------DSNT 404
++ G ++ +RL ++ D
Sbjct: 274 LLAGAADLAAETGDTELRTALERLWRDMVTTKTYLTGAVGSRHDWEAFGDAHELPADRAY 333
Query: 405 EESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSS 464
E+C + S + T E Y+D ER+L NG L G + +Y+ PL
Sbjct: 334 AETCAAIASVHFSWRMALLTGEARYSDLVERTLFNGFLA-GAGLDGRTWLYVNPL---HR 389
Query: 465 KERSYHHWG------TPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
+ RS+ G TP CC + + L + ++ G+ + QY +
Sbjct: 390 RARSHERPGDQTAHRTPWFRCACCPPNVMRLLAGLPHYLATADDS---GLQLHQYATG-- 444
Query: 519 DWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ 578
G + +V W+ VT+T + L +L+LR+P W + + T+NG
Sbjct: 445 --VYGGDGLTVRVTTEYPWEGT--VTVTVDEAPTALPRTLSLRLPAWCADH--TLTVNGT 498
Query: 579 DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
+ + +L +T+ ++ D + + L + R A+ GP V
Sbjct: 499 TVEDGADSGWLRITRAFTPGDTVRLDLAMPARLTVPSSRVDAVRGCAAVERGPLV 553
>gi|290962053|ref|YP_003493235.1| hypothetical protein SCAB_77341 [Streptomyces scabiei 87.22]
gi|260651579|emb|CBG74703.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length = 654
Score = 43.5 bits (101), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 104/501 (20%), Positives = 180/501 (35%), Gaps = 92/501 (18%)
Query: 185 YLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAP 244
+L A+ A T +E+L ++ A+V ++A Q+E GYL + + +L IP P
Sbjct: 93 WLEAACWQLADTPDETLATEVEAIVELIAAAQRE--DGYL-----QTYYQLGGGIPWTEP 145
Query: 245 YYTIHKILAGLLDQYTYADN---------AEALRMTTWMVEYFY--NRVQNVIKKYSIER 293
+ AG L Q A + A A R+ + F +V V +E
Sbjct: 146 GWGHELYCAGHLIQAAVAHHRATGSDRLLAVARRLADHIDSVFGPGKQVDTVCGHPEVE- 204
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQAD-----DISGFH 348
L +L T + ++L LA F + G L+ AD D +
Sbjct: 205 -------------TALVELHRTTDEKRYLDLARYFLERRGHGTLSSGADRGHDRDPGPEY 251
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGT--NIGHFNFKSDPKRLASNL------ 400
H P+ EVTG + + ++ G ++ +RL ++
Sbjct: 252 WQDHTPVRAAD----EVTGHAVRQLYLLAGAADLAAETGDTELRTALERLWRDMVTTKTY 307
Query: 401 ----------------------DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLT 438
D E+C + S + T E Y+D ER+L
Sbjct: 308 LTGAVGSRHDWEAFGDAHELPADRAYAETCAAIASVHFSWRMALLTGEARYSDLVERTLF 367
Query: 439 NGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWG------TPSDSFWCCYGTGIESFSK 492
NG L G + +Y+ PL + RS+ G TP CC + +
Sbjct: 368 NGFLA-GAGLDGRTWLYVNPL---HRRARSHERPGDQTAHRTPWFRCACCPPNVMRLLAG 423
Query: 493 LGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGS 552
L + ++ G+ + QY + G + +V W+ VT+T +
Sbjct: 424 LPHYLATADDS---GLQLHQYATG----VYGGDGLTVRVTTEYPWEGT--VTVTVDEAPT 474
Query: 553 GLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTE 612
L +L+LR+P W + + T+NG + + +L +T+ ++ D + + L + R
Sbjct: 475 ALPRTLSLRLPAWCADH--TLTVNGTTVEDGADSGWLRITRAFTPGDTVRLDLAMPARLT 532
Query: 613 AIQDDRPEYASIQAILYGPYV 633
A+ GP V
Sbjct: 533 VPSSRVDAVRGCAAVERGPLV 553
>gi|440699526|ref|ZP_20881821.1| hypothetical protein STRTUCAR8_01370 [Streptomyces turgidiscabies
Car8]
gi|440277899|gb|ELP65960.1| hypothetical protein STRTUCAR8_01370 [Streptomyces turgidiscabies
Car8]
Length = 654
Score = 43.5 bits (101), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 104/501 (20%), Positives = 180/501 (35%), Gaps = 92/501 (18%)
Query: 185 YLSASALMWASTHNESLKEKMSAVVSALSACQKEIGSGYLSAFPTEQFDRLEALIPVWAP 244
+L A+ A T +E+L ++ A+V ++A Q+E GYL + + +L IP P
Sbjct: 93 WLEAACWQLADTPDETLATEVEAIVELIAAAQRE--DGYL-----QTYYQLGGGIPWTEP 145
Query: 245 YYTIHKILAGLLDQYTYADN---------AEALRMTTWMVEYFY--NRVQNVIKKYSIER 293
+ AG L Q A + A A R+ + F +V V +E
Sbjct: 146 GWGHELYCAGHLIQAAVAHHRATGSDRLLAVARRLADHIDSVFGPGKQVDTVCGHPEVE- 204
Query: 294 HWQTLNEEAGGMNDVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQAD-----DISGFH 348
L +L T + ++L LA F + G L+ AD D +
Sbjct: 205 -------------TALVELHRTTDEKRYLDLARYFLERRGHGTLSSGADRGHDRDPGPEY 251
Query: 349 SNTHIPIVIGSQMRYEVTGDQLHKEGHQLESSGT--NIGHFNFKSDPKRLASNL------ 400
H P+ EVTG + + ++ G ++ +RL ++
Sbjct: 252 WQDHTPVRAAD----EVTGHAVRQLYLLAGAADLAAETGDTELRTALERLWRDMVTTKTY 307
Query: 401 ----------------------DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLT 438
D E+C + S + T E Y+D ER+L
Sbjct: 308 LTGAVGSRHDWEAFGDAHELPADRAYAETCAAIASVHFSWRMALLTGEARYSDLVERTLF 367
Query: 439 NGVLGIQRGTEPGVMIYLLPLAPGSSKERSYHHWG------TPSDSFWCCYGTGIESFSK 492
NG L G + +Y+ PL + RS+ G TP CC + +
Sbjct: 368 NGFLA-GAGLDGRTWLYVNPL---HRRARSHERPGDQTAHRTPWFRCACCPPNVMRLLAG 423
Query: 493 LGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGS 552
L + ++ G+ + QY + G + +V W+ VT+T +
Sbjct: 424 LPHYLATADDS---GLQLHQYATG----VYGGDGLTVRVTTEYPWEGT--VTVTVDEAPT 474
Query: 553 GLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTE 612
L +L+LR+P W + + T+NG + + +L +T+ ++ D + + L + R
Sbjct: 475 ALPRTLSLRLPAWCADH--TLTVNGTTVEDGADSGWLRITRAFTPGDTVRLDLAMPARLT 532
Query: 613 AIQDDRPEYASIQAILYGPYV 633
A+ GP V
Sbjct: 533 VPSSRVDAVRGCAAVERGPLV 553
>gi|326802068|ref|YP_004319887.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326552832|gb|ADZ81217.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
Length = 696
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 45/193 (23%), Positives = 83/193 (43%), Gaps = 17/193 (8%)
Query: 481 CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDPVVSWDPY 540
CC + + KL +++++ GV + Y S + + + D +D
Sbjct: 435 CCTANMHQGWPKLVQNLWYQTADG--GVAALLYGPSHVKAQVNGQPIEISEDTYYPFDE- 491
Query: 541 LRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQ-DLPLPSPGNFLSVTKTWSSDD 599
R+ T SK L+ +LRIP W + A+ +NG+ PG+ + +++ W + D
Sbjct: 492 -RIHFTIHSK-KDLSFPFHLRIPHWAKN--AQIKINGELSNEAVKPGSIVKISRLWKNGD 547
Query: 600 KLTIQLPLTLRTEAIQDDRPEYASIQ-AILYGPYVLAGHSIGDWDITESATSLSDWITPI 658
++T+ LP+ + T +A + A+ GP V A DW D++
Sbjct: 548 QITLVLPMQIETS-------RWAELSVAVERGPLVYALKIDEDWRKVNDGDYFGDYLEVH 600
Query: 659 PAS-YNSQLITFT 670
P S +N L++ T
Sbjct: 601 PKSDWNFGLLSKT 613
>gi|423294214|ref|ZP_17272341.1| hypothetical protein HMPREF1070_01006 [Bacteroides ovatus
CL03T12C18]
gi|392676116|gb|EIY69555.1| hypothetical protein HMPREF1070_01006 [Bacteroides ovatus
CL03T12C18]
Length = 684
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 53/110 (48%), Gaps = 10/110 (9%)
Query: 549 SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDL-PLPSPGNFLSVTKTWSSDDKLTIQLPL 607
S G + LRIP+WT A+ +NG+ + P G +L + + W++ D++ + LP+
Sbjct: 469 STGEKVAFPFYLRIPSWTQK--AEVRVNGKKVSAAPVAGKYLCINREWANGDRVELTLPM 526
Query: 608 TLRTEAIQDDRPEYASIQAILYGPYVLA---GHSIGDWDITESATSLSDW 654
+L Q ++ + ++ YGP L+ + D E+A S W
Sbjct: 527 SLSMRTWQVNK----NSVSVDYGPLTLSLKIAEKYVEKDSRETAIGDSKW 572
>gi|336404541|ref|ZP_08585236.1| hypothetical protein HMPREF0127_02549 [Bacteroides sp. 1_1_30]
gi|335942338|gb|EGN04185.1| hypothetical protein HMPREF0127_02549 [Bacteroides sp. 1_1_30]
Length = 704
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 53/110 (48%), Gaps = 10/110 (9%)
Query: 549 SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDL-PLPSPGNFLSVTKTWSSDDKLTIQLPL 607
S G + LRIP+WT A+ +NG+ + P G +L + + W++ D++ + LP+
Sbjct: 489 STGEKVAFPFYLRIPSWTQK--AEVRVNGKKVSAAPVAGKYLCINREWANGDRVELTLPM 546
Query: 608 TLRTEAIQDDRPEYASIQAILYGPYVLA---GHSIGDWDITESATSLSDW 654
+L Q ++ + ++ YGP L+ + D E+A S W
Sbjct: 547 SLSMRTWQVNK----NSVSVDYGPLTLSLKIAEKYVEKDSRETAIGDSKW 592
>gi|424792517|ref|ZP_18218744.1| hypothetical protein XTG29_01554 [Xanthomonas translucens pv.
graminis ART-Xtg29]
gi|422797058|gb|EKU25452.1| hypothetical protein XTG29_01554 [Xanthomonas translucens pv.
graminis ART-Xtg29]
Length = 664
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 43/216 (19%), Positives = 82/216 (37%), Gaps = 19/216 (8%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPL- 459
D+ ESC + ++ + + + + YAD ER+L N VL + Y+ PL
Sbjct: 334 DTAYNESCASIGLMMFANRMLQLAPDSRYADVMERALYNTVLA-GMALDGRHFFYVNPLE 392
Query: 460 --APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQY 513
P + H P W CC + LG +Y + +Y+ Y
Sbjct: 393 VHPPTVHGNHGFDHV-KPVRQRWFGCACCPPNIARVVTSLGHYLYTRRDDT---LYVNLY 448
Query: 514 ISSRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKA 573
+ S + G + + W + +++ + + L LR+P W + +
Sbjct: 449 VGSDAAFDVGGQTLTLRQRGEYPWQEQVELSMDCDAP---IEAGLALRLPDWCRA--PQL 503
Query: 574 TLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPL 607
LNG+ + + + + + + W D L + LP+
Sbjct: 504 QLNGEAVAIAAHLQHGYCVLRQRWQRGDTLHLHLPM 539
>gi|297204508|ref|ZP_06921905.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|197710567|gb|EDY54601.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length = 638
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 53/237 (22%), Positives = 86/237 (36%), Gaps = 20/237 (8%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLA---- 460
E+C ++ S + T + Y+D ER+L NG L G+ E +Y+ PL
Sbjct: 317 ETCAAIASIQWSWRMALLTGDTRYSDLIERTLFNGFLAGVSLDGE--RWLYVNPLQVRDG 374
Query: 461 ---PGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSR 517
PG + W + CC + + L ++ G+ I QY++ R
Sbjct: 375 HTDPGGDQSARRTRWFRCA----CCPPNVMRLLASL---EHYLASSDGSGLQIHQYVTGR 427
Query: 518 LDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG 577
G V + W + T + + +LRIP W + +
Sbjct: 428 YTGDLGGTPVAVSAETDYPWQGT--IAFTVEETPADRPWTFSLRIPQWCGTYRVRCADTA 485
Query: 578 QD-LPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYV 633
D P +L + +TWS D++ ++L L R A AI GP V
Sbjct: 486 YDETDAPVTDGWLRLERTWSPGDRVVLELSLAPRLTAADPRVDAVRGCVAIERGPLV 542
>gi|218678364|ref|ZP_03526261.1| hypothetical protein RetlC8_05602 [Rhizobium etli CIAT 894]
Length = 345
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 54/234 (23%), Positives = 99/234 (42%), Gaps = 17/234 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C + ++ + + + YAD E++L NG L G+ T+ Y PL
Sbjct: 126 DTAYAETCASVGLVFWASRMLGRGPDRRYADIMEQALYNGALPGLS--TDGKTFFYDNPL 183
Query: 460 APGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLD 519
GS+ + HH + +G +Y + + V++ ++RL
Sbjct: 184 --GSAGK---HHPLENGIIAPAARPNIARLVTSIGSYMYAVADDEI-AVHLYGESTTRLK 237
Query: 520 WKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQD 579
+G V Q+ WD + T +L+LRIP W + GA ++NG+
Sbjct: 238 LANGAAVELQQATNY-PWDGAVAFTTRLEKPAK---FALSLRIPDW--AEGATLSVNGEK 291
Query: 580 LPLPSP--GNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
L L + + + + W+ D++ + LPL+LR + + A A++ GP
Sbjct: 292 LDLGAAVRDGYARIDRQWADGDRVDLFLPLSLRPQYANPKVRQDAGRVALMRGP 345
>gi|150009917|ref|YP_001304660.1| hypothetical protein BDI_3334 [Parabacteroides distasonis ATCC
8503]
gi|423333684|ref|ZP_17311465.1| hypothetical protein HMPREF1075_03116 [Parabacteroides distasonis
CL03T12C09]
gi|149938341|gb|ABR45038.1| putative exported protein [Parabacteroides distasonis ATCC 8503]
gi|409226994|gb|EKN19896.1| hypothetical protein HMPREF1075_03116 [Parabacteroides distasonis
CL03T12C09]
Length = 683
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 57/266 (21%), Positives = 97/266 (36%), Gaps = 29/266 (10%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVLGIQRGTEPGVMIYLLPLAPGSSK 465
E CT M+ + T ++ +ADY ER N L Q + Y +
Sbjct: 326 ELCTAVEMMFSLEEMLEITGDVQWADYLERVAYNA-LPTQVTDDYSARQYYQQTNQ-VAV 383
Query: 466 ERSYHHWGTPSD----------SFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
R + ++ TP D + CC + + KL ++++ G+ + Y
Sbjct: 384 TREWRNFSTPHDDTDILFGELTGYPCCTSNLHQGWPKLVQNLWYATADN--GIAALVYAP 441
Query: 516 SRLDWK-SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTT-SLNLRIPTWTSSNGAKA 573
S + K + + V + + +D L F K ++RIP W N
Sbjct: 442 SSVKAKVANGVTVQIEEETAYPFDETLHFKFAFEDKKIKRAFFPFHIRIPAW--CNQPVI 499
Query: 574 TLNGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPY 632
LNG+++ + + PG + + W D LT++LP+ + Y I GP
Sbjct: 500 KLNGENVVVDAYPGEIARINREWKQGDVLTVELPMQVAASRW------YGGSAVIERGPL 553
Query: 633 VLAGHSIGDWDIT----ESATSLSDW 654
V A W+ E A +W
Sbjct: 554 VYALKMNEKWEKKTFEGEKAAQYGNW 579
>gi|440750208|ref|ZP_20929452.1| putative secreted protein [Mariniradius saccharolyticus AK6]
gi|436481249|gb|ELP37430.1| putative secreted protein [Mariniradius saccharolyticus AK6]
Length = 667
Score = 43.1 bits (100), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 39/78 (50%), Gaps = 8/78 (10%)
Query: 558 LNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDD 617
+LRIP W + K TLNGQ + + + +TW + DK+T+ LP+ L+T
Sbjct: 472 FHLRIPAW--AKDPKITLNGQAVDFVATNQVAVLNRTWKNGDKVTLTLPMELKTSTW--- 526
Query: 618 RPEYASIQAILYGPYVLA 635
Y + +I GP V +
Sbjct: 527 ---YKGMVSIERGPLVFS 541
>gi|423223921|ref|ZP_17210390.1| hypothetical protein HMPREF1062_02576 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392637419|gb|EIY31288.1| hypothetical protein HMPREF1062_02576 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 801
Score = 43.1 bits (100), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 51/222 (22%), Positives = 85/222 (38%), Gaps = 30/222 (13%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + V+ LF E Y D ER+L NG++ G+ + G Y PL
Sbjct: 338 ETCAAIGNVYVNYRLFLLHGESKYYDVLERTLYNGLISGVS--LDGGGFFYPNPL----- 390
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
E H P CC L IY ++ VY+ ++S+ D K G
Sbjct: 391 -ESMGQHQRQPWFGCACCPSNICRFIPSLPGYIYAVKDKD---VYVNLFMSNTSDLKVGG 446
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTW-----------TSSNGAK- 572
V+ + W+ + + + +S G +L +RIP W T S+G +
Sbjct: 447 KAVSIEQTTKYPWNGDITIGINKNSAGP---FNLKVRIPGWVRGQVVPSDLYTYSDGKRL 503
Query: 573 ---ATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRT 611
+NG+ + + + + W DK+ + + RT
Sbjct: 504 KYTVKVNGEAVQNELKDGYFCIDRRWKKGDKVEVHFDMEPRT 545
>gi|336407814|ref|ZP_08588310.1| hypothetical protein HMPREF1018_00325 [Bacteroides sp. 2_1_56FAA]
gi|335944893|gb|EGN06710.1| hypothetical protein HMPREF1018_00325 [Bacteroides sp. 2_1_56FAA]
Length = 687
Score = 43.1 bits (100), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 560 LRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
LRIP+WT GA +NG+ + P G + + + W +D++ IQLP+ L Q ++
Sbjct: 483 LRIPSWTE--GATIFVNGKKVAANPEAGQYACINREWKDNDQVEIQLPMQLSMRTWQVNK 540
Query: 619 PEYASIQAILYGPYVLAGHSIGDWDITES-ATSLSD 653
+ ++ YGP ++ D+ +S AT++ D
Sbjct: 541 ----NSVSVDYGPLTMSLKIDEDYVKKDSRATAIGD 572
>gi|60679875|ref|YP_210019.1| hypothetical protein BF0282 [Bacteroides fragilis NCTC 9343]
gi|423269824|ref|ZP_17248796.1| hypothetical protein HMPREF1079_01878 [Bacteroides fragilis
CL05T00C42]
gi|423272722|ref|ZP_17251669.1| hypothetical protein HMPREF1080_00322 [Bacteroides fragilis
CL05T12C13]
gi|60491309|emb|CAH06057.1| putative exported protein [Bacteroides fragilis NCTC 9343]
gi|392700670|gb|EIY93832.1| hypothetical protein HMPREF1079_01878 [Bacteroides fragilis
CL05T00C42]
gi|392708636|gb|EIZ01742.1| hypothetical protein HMPREF1080_00322 [Bacteroides fragilis
CL05T12C13]
Length = 687
Score = 43.1 bits (100), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 560 LRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
LRIP+WT GA +NG+ + P G + + + W +D++ IQLP+ L Q ++
Sbjct: 483 LRIPSWTE--GATIFVNGKKVAANPEAGQYACINREWKDNDQVEIQLPMQLSMRTWQVNK 540
Query: 619 PEYASIQAILYGPYVLAGHSIGDWDITES-ATSLSD 653
+ ++ YGP ++ D+ +S AT++ D
Sbjct: 541 ----NSVSVDYGPLTMSLKIDEDYVKKDSRATAIGD 572
>gi|53711625|ref|YP_097617.1| hypothetical protein BF0334 [Bacteroides fragilis YCH46]
gi|265765010|ref|ZP_06093285.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
gi|423248287|ref|ZP_17229303.1| hypothetical protein HMPREF1066_00313 [Bacteroides fragilis
CL03T00C08]
gi|423253236|ref|ZP_17234167.1| hypothetical protein HMPREF1067_00811 [Bacteroides fragilis
CL03T12C07]
gi|423259330|ref|ZP_17240253.1| hypothetical protein HMPREF1055_02530 [Bacteroides fragilis
CL07T00C01]
gi|423263698|ref|ZP_17242701.1| hypothetical protein HMPREF1056_00388 [Bacteroides fragilis
CL07T12C05]
gi|52214490|dbj|BAD47083.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|263254394|gb|EEZ25828.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
gi|387776910|gb|EIK39010.1| hypothetical protein HMPREF1055_02530 [Bacteroides fragilis
CL07T00C01]
gi|392657136|gb|EIY50773.1| hypothetical protein HMPREF1067_00811 [Bacteroides fragilis
CL03T12C07]
gi|392660394|gb|EIY54008.1| hypothetical protein HMPREF1066_00313 [Bacteroides fragilis
CL03T00C08]
gi|392707120|gb|EIZ00240.1| hypothetical protein HMPREF1056_00388 [Bacteroides fragilis
CL07T12C05]
Length = 687
Score = 43.1 bits (100), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 560 LRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
LRIP+WT GA +NG+ + P G + + + W +D++ IQLP+ L Q ++
Sbjct: 483 LRIPSWTE--GATIFVNGKKVAANPEAGQYACINREWKDNDQVEIQLPMQLSMRTWQVNK 540
Query: 619 PEYASIQAILYGPYVLAGHSIGDWDITES-ATSLSD 653
+ ++ YGP ++ D+ +S AT++ D
Sbjct: 541 ----NSVSVDYGPLTMSLKIDEDYVKKDSRATAIGD 572
>gi|375356719|ref|YP_005109491.1| hypothetical protein BF638R_0339 [Bacteroides fragilis 638R]
gi|383116630|ref|ZP_09937378.1| hypothetical protein BSHG_1295 [Bacteroides sp. 3_2_5]
gi|251948094|gb|EES88376.1| hypothetical protein BSHG_1295 [Bacteroides sp. 3_2_5]
gi|301161400|emb|CBW20940.1| putative exported protein [Bacteroides fragilis 638R]
Length = 687
Score = 43.1 bits (100), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 560 LRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
LRIP+WT GA +NG+ + P G + + + W +D++ IQLP+ L Q ++
Sbjct: 483 LRIPSWTE--GATIFVNGKKVAANPEAGQYACINREWKDNDQVEIQLPMQLSMRTWQVNK 540
Query: 619 PEYASIQAILYGPYVLAGHSIGDWDITES-ATSLSD 653
+ ++ YGP ++ D+ +S AT++ D
Sbjct: 541 ----NSVSVDYGPLTMSLKIDEDYVKKDSRATAIGD 572
>gi|333025235|ref|ZP_08453299.1| putative secreted protein [Streptomyces sp. Tu6071]
gi|332745087|gb|EGJ75528.1| putative secreted protein [Streptomyces sp. Tu6071]
Length = 812
Score = 43.1 bits (100), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 68/147 (46%), Gaps = 15/147 (10%)
Query: 477 DSFWCC---YGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQKVDP 533
D + CC YG G F++ ++ G+ + Y + + K G V
Sbjct: 400 DQYRCCPHNYGMGWPWFAQ---ELWLATPDN--GLAAVMYAPNEVRAKVGADATEVTVST 454
Query: 534 VVSWDPYLRVTLTFSSK-GSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLPSPGNFLSVT 592
++ P+ TLTF+ + + L LR+P W ++ + T+NG P+ F +V+
Sbjct: 455 DTAY-PFGD-TLTFTVRTPRPVAFPLRLRVPAWCAA--PELTVNGAKSTAPAGPAFTTVS 510
Query: 593 KTWSSDDKLTIQLP--LTLRTEAIQDD 617
+TW D + ++LP +T+RT A Q D
Sbjct: 511 RTWQDGDTVRLRLPQRVTVRTWAAQHD 537
>gi|270290499|ref|ZP_06196724.1| hypothetical protein HMPREF9024_00684 [Pediococcus acidilactici
7_4]
gi|270281280|gb|EFA27113.1| hypothetical protein HMPREF9024_00684 [Pediococcus acidilactici
7_4]
Length = 664
Score = 43.1 bits (100), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 48/240 (20%), Positives = 98/240 (40%), Gaps = 15/240 (6%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + M ++ + + YAD E+ L NG L G+ + + L P +S
Sbjct: 353 ETCASVGMAFFAKQMLNIKAKGEYADILEKELFNGALSGMSLDGKHFFYVNPLEADPEAS 412
Query: 465 KER--SYHHWGTPSDSFWC-CYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWK 521
++ H +D F C C + D + +G + Q+I++R +++
Sbjct: 413 RKNPGKSHVLTHRADWFGCACCPANLARLITSIDKYIYTLDGD--TILSHQFIANRAEFE 470
Query: 522 SGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLP 581
+G +V P WD + + ++ L +RIP+W S N LNG+ +
Sbjct: 471 NGISIVQNNNYP---WDGDIHYVI---KDPKNISFRLGIRIPSW-SKNNINIVLNGKKVI 523
Query: 582 LPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGD 641
L F+ + D ++ + L ++++ + + + A+ GP + A + +
Sbjct: 524 LEVEDGFVYL--DIEKDTQIDVDLDMSVKFMQSSNRVSQNINKLAVQRGPIIYAAEEVDN 581
>gi|423282411|ref|ZP_17261296.1| hypothetical protein HMPREF1204_00834 [Bacteroides fragilis HMW
615]
gi|404581979|gb|EKA86674.1| hypothetical protein HMPREF1204_00834 [Bacteroides fragilis HMW
615]
Length = 687
Score = 43.1 bits (100), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 560 LRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
LRIP+WT GA +NG+ + P G + + + W +D++ IQLP+ L Q ++
Sbjct: 483 LRIPSWTE--GATIFVNGKKVAANPEAGQYACINREWKDNDQVEIQLPMQLSMRTWQVNK 540
Query: 619 PEYASIQAILYGPYVLAGHSIGDWDITES-ATSLSD 653
+ ++ YGP ++ D+ +S AT++ D
Sbjct: 541 ----NSVSVDYGPLTMSLKIDEDYVKKDSRATAIGD 572
>gi|323345036|ref|ZP_08085260.1| hypothetical protein HMPREF0663_11796 [Prevotella oralis ATCC
33269]
gi|323094306|gb|EFZ36883.1| hypothetical protein HMPREF0663_11796 [Prevotella oralis ATCC
33269]
Length = 695
Score = 43.1 bits (100), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 52/109 (47%), Gaps = 7/109 (6%)
Query: 528 NQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATLNG-QDLPLPSPG 586
N+KV + D + F+ G + LRIP+WT N A+ ++NG + P G
Sbjct: 458 NKKVTITETTDYPFSDKICFTISKGGGRFPIYLRIPSWT--NNAEVSINGVKQNAEPVSG 515
Query: 587 NFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLA 635
++ + W D +T+ +P+TL Q ++ + +I YGP L+
Sbjct: 516 KYIRMVYNWKKGDVITLHVPMTLHIRRWQVNK----NSASIDYGPLTLS 560
>gi|270295052|ref|ZP_06201253.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270274299|gb|EFA20160.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 688
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 91/439 (20%), Positives = 162/439 (36%), Gaps = 61/439 (13%)
Query: 242 WAPYYTIHKILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEE 301
W P + KIL QY A N + R+ +M +YF ++ + +K HW + E
Sbjct: 171 WWPRMVVLKIL----QQYYSATNDK--RVVAFMTKYFRYQLNTLPQKPL--GHWSSWAEF 222
Query: 302 AGGMN-DVLYKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDIS--------GFHSNTH 352
N +Y L+ +T + L L HL + F + + D+
Sbjct: 223 RACDNLQAVYWLYNLTGEDFLLELGHLLHRQSFSFIDMVDRGDLRRPCTIHCVNLAQGIK 282
Query: 353 IPIVIGSQMRYEVTGDQLHKEGHQLESSGTNIGHFNFKSD-----PKRLASNLDSNTEES 407
PI+ Q D K ++ +I F+ + + L N + E
Sbjct: 283 EPIIYYLQ-------DTDRKYIDAVKEGFRDIRRFHGQPQGMYGGDEALHGNNPTQGSEL 335
Query: 408 CTTYNMLKVSRHLFRWTKEIAYADYYER--------SLTNGVLGIQRGTEPG-VMIYLLP 458
C+ ++ + T +I +AD+ ER +++ + Q +P VM+
Sbjct: 336 CSAVELMYSLEKMVEITGDIDFADHLERIAFNALPAQISDDFMTKQYFQQPNQVMVTRHR 395
Query: 459 LAPGSSKERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRL 518
E + +GT + + CC+ + + K +++ G+ I Y S +
Sbjct: 396 RNFDQDHEGTDLAFGTLT-GYPCCFSNMHQGWPKFTQHLWYATPDN--GIAAIVYSPSEV 452
Query: 519 DWKSGQIVVNQKVDPVVSWDPYL----RVTLTFS---SKGSGLTTSLNLRIPTWTSSNGA 571
G V V+S D Y ++T T +K + +LR+P W A
Sbjct: 453 TANVGD-----NVPVVISEDTYYPMDHQITFTIKEVRNKVKQVKFPFHLRVPKWCKQ--A 505
Query: 572 KATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+ +NG+ G V + W +DK+ + LP+ + T Y + +I GP
Sbjct: 506 EIRVNGKMEQTVKGGKIAIVDRIWKRNDKIELYLPMEVFTSTW------YENAVSIERGP 559
Query: 632 YVLAGHSIGDWDITESATS 650
V A +W+ E S
Sbjct: 560 LVYALKMEENWEKKEFKDS 578
>gi|302875896|ref|YP_003844529.1| hypothetical protein Clocel_3075 [Clostridium cellulovorans 743B]
gi|307689330|ref|ZP_07631776.1| hypothetical protein Ccel74_14336 [Clostridium cellulovorans 743B]
gi|302578753|gb|ADL52765.1| protein of unknown function DUF1680 [Clostridium cellulovorans
743B]
Length = 648
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 61/296 (20%), Positives = 114/296 (38%), Gaps = 22/296 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
D+ E+C ++ ++ + + ++ YAD ER+L N V G+ + L +
Sbjct: 330 DTVYSETCAAIGLIFFAQRMLKLDQDRKYADVLERALYNTVTSGMALDGRHFFYVNPLEV 389
Query: 460 APGSSKERSYHHWGTPSDSFW----CCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYIS 515
P +S++ W CC + LG IY E ++ YI
Sbjct: 390 QPEASEKSPIKRHVKAERQKWYGCACCPPNVARLLTSLGQYIYTESNDT---IFTHLYIG 446
Query: 516 SRLDWKSGQIVVNQK---VDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAK 572
S+ D+ VN K V ++ + T F + T LRIP W + K
Sbjct: 447 SKADF-----TVNNKKVTVKQTTNYPSEGKATFVFDMSENNEFT-FALRIPEWCKN--YK 498
Query: 573 ATLNGQDL-PLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGP 631
+N ++ L +L +T+ + + D + I + + A A AI GP
Sbjct: 499 IFINNEEYRELDLNKGYLYITREFLNSDVVEISMEIETVLVASNPLVRANAGKVAICRGP 558
Query: 632 YVLAGHSIGDWDITESATSLSDWITPIPASYNSQLITFTQEYGNTKFVLTNSNQSI 687
V I + ++ L D P+ YN +++ E + +++++ +Q +
Sbjct: 559 LVYCLEEID--NCKNLSSILIDTSKPVKEQYNPEVLGGAIELKASGYIVSSESQDL 612
>gi|224536979|ref|ZP_03677518.1| hypothetical protein BACCELL_01855 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521418|gb|EEF90523.1| hypothetical protein BACCELL_01855 [Bacteroides cellulosilyticus
DSM 14838]
Length = 678
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 87/419 (20%), Positives = 155/419 (36%), Gaps = 51/419 (12%)
Query: 251 ILAGLLDQYTYADNAEALRMTTWMVEYFYNRVQNVIKKYSIERHWQTLNEEAGGMN-DVL 309
++ +L QY A N E R+ T+M +YF ++ + +K HW E N +
Sbjct: 166 VMLKILQQYYSATNDE--RIITFMTKYFRYQLNTLPQKPL--GHWSFWAEFRACDNLQAV 221
Query: 310 YKLFCITQDPKHLMLAHLFDKPCFLGLLALQADDIS--------GFHSNTHIPIVIGSQM 361
Y L+ +T + L L HL + + + + D+ PI+ Q
Sbjct: 222 YWLYNLTGEAFLLELGHLLHQQSYSFVDMVNRGDLRRICTIHCVNLAQGIKEPIIYYQQD 281
Query: 362 RYEVTGDQLHKEGHQ--LESSGTNIGHFNFKSDPKRLASNLDSNTEESCTTYNMLKVSRH 419
D + K G Q + G G + + L N + E C ++
Sbjct: 282 TNPKYIDAV-KRGFQDIRQFHGQPQGMY---GGDEALHGNNPTQGSELCAAVELMYSLEK 337
Query: 420 LFRWTKEIAYADYYER--------SLTNGVLGIQRGTEPG-VMIYLLPLAPGSSKERSYH 470
+ T +I +AD+ ER +++ + Q +P +M+ E +
Sbjct: 338 MVEITGDIDFADHLERIAFNALPTQISDDFMIKQYFQQPNQIMVTRHRRNFDQDHEGTDI 397
Query: 471 HWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQIVVNQK 530
+GT + + CC+ + + K +++ G+ Y S + K G
Sbjct: 398 TFGTLT-GYPCCFSNMHQGWPKFTQHLWYATPDN--GIAAFTYSPSEVTAKVGN-----N 449
Query: 531 VDPVVSWDPYL----RVTLTFS---SKGSGLTTSLNLRIPTWTSSNGAKATLNGQDLPLP 583
V V+S D Y R++ T +K + L+LRIP W A+ +NG+
Sbjct: 450 VSVVISEDTYYPMDNRISFTIKEVKNKTKQVEFPLHLRIPKWCKR--AEIIVNGKAEQYI 507
Query: 584 SPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDRPEYASIQAILYGPYVLAGHSIGDW 642
G + + W +D + + LP+ + T Y + I GP V A +W
Sbjct: 508 EGGRIAVINRIWKRNDNVELHLPMEVSTSTW------YENAVTIERGPLVYALKIKENW 560
>gi|195607558|gb|ACG25609.1| hypothetical protein [Zea mays]
Length = 49
Score = 43.1 bits (100), Expect = 0.72, Method: Composition-based stats.
Identities = 20/26 (76%), Positives = 21/26 (80%)
Query: 391 SDPKRLASNLDSNTEESCTTYNMLKV 416
SD KRLA L + TEESCTTYNMLKV
Sbjct: 6 SDRKRLAVALPTETEESCTTYNMLKV 31
>gi|393780984|ref|ZP_10369185.1| hypothetical protein HMPREF1071_00053 [Bacteroides salyersiae
CL02T12C01]
gi|392677319|gb|EIY70736.1| hypothetical protein HMPREF1071_00053 [Bacteroides salyersiae
CL02T12C01]
Length = 672
Score = 43.1 bits (100), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 49/237 (20%), Positives = 90/237 (37%), Gaps = 35/237 (14%)
Query: 406 ESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPLAPGSS 464
E+C + ++ LF + Y D ER+L NG++ G+ + G Y PLA
Sbjct: 338 ETCAAIGSVYMNYRLFLLHGDAKYFDVLERTLYNGLISGVS--LDGGSFFYPNPLASDGG 395
Query: 465 KERSYHHWGTPSDSFWCCYGTGIESFSKLGDSIYFEEEGKYPGVYIIQYISSRLDWKSGQ 524
R P CC L +Y ++ + VY+ ++S+R + K
Sbjct: 396 YSRK------PWFGCACCPSNISRFIPSLPGYVYAVKDRQ---VYVNLFLSNRAELKVND 446
Query: 525 IVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSN--------------- 569
V + + W +R+ + ++ G +N+RIP W +
Sbjct: 447 KKVVLEQETSYPWKGDIRLKVLQGNQPFG----MNVRIPGWVRGSVLPSDLYAYADHQQP 502
Query: 570 GAKATLNGQDLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLR----TEAIQDDRPEYA 622
+ +NGQ++ +L++ + W +D + I + R E + DR A
Sbjct: 503 AYRVMVNGQEVEGELHNGYLTIDRKWKKNDVVEIHFDMLPRLVKANEKVAADRGRVA 559
>gi|237720334|ref|ZP_04550815.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|229450085|gb|EEO55876.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
Length = 684
Score = 43.1 bits (100), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 51/99 (51%), Gaps = 10/99 (10%)
Query: 560 LRIPTWTSSNGAKATLNGQDLPL-PSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAIQDDR 618
LRIP+WT A+ +NG+ + + P G +L + + W++ D++ + LP++L Q ++
Sbjct: 480 LRIPSWTQK--AEVRVNGKKVSVTPVAGKYLCINREWANGDRVELTLPMSLSMRTWQVNK 537
Query: 619 PEYASIQAILYGPYVLA---GHSIGDWDITESATSLSDW 654
+ ++ YGP L+ + D E+A S W
Sbjct: 538 ----NSVSVDYGPLTLSLKIAEKYVEKDSRETAIGDSKW 572
>gi|218675303|ref|ZP_03524972.1| hypothetical protein RetlG_29862 [Rhizobium etli GR56]
Length = 175
Score = 43.1 bits (100), Expect = 0.74, Method: Composition-based stats.
Identities = 24/79 (30%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 557 SLNLRIPTWTSSNGAKATLNGQDLPLPS--PGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
+L+LRIP W + GA ++NG L L + + + + W+ D++ + LPL+LR +
Sbjct: 8 ALSLRIPDW--AEGATLSVNGTMLDLSTHIRDGYARIDRQWADGDRVALHLPLSLRPQYA 65
Query: 615 QDDRPEYASIQAILYGPYV 633
+ A A++ GP V
Sbjct: 66 NPKVRQDAGRVALMRGPLV 84
>gi|227509159|ref|ZP_03939208.1| hypothetical protein HMPREF0496_1322, partial [Lactobacillus brevis
subsp. gravesensis ATCC 27305]
gi|227191395|gb|EEI71462.1| hypothetical protein HMPREF0496_1322 [Lactobacillus brevis subsp.
gravesensis ATCC 27305]
Length = 63
Score = 42.7 bits (99), Expect = 0.77, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 118 VSLHDVRLGSDSMHWRAQQTNLEYLLMLDVDKLVWNFRKTARL-PAPGEPYGGWE 171
+ L DVR+ SD AQ+ + YLL LD + ++ F + + L P +PYGGWE
Sbjct: 5 IPLKDVRI-SDPEILNAQRNAVHYLLTLDPSRFLYGFNQVSGLKPVAAKPYGGWE 58
>gi|374321585|ref|YP_005074714.1| hypothetical protein HPL003_08640 [Paenibacillus terrae HPL-003]
gi|357200594|gb|AET58491.1| hypothetical protein HPL003_08640 [Paenibacillus terrae HPL-003]
Length = 647
Score = 42.7 bits (99), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 43/213 (20%), Positives = 88/213 (41%), Gaps = 15/213 (7%)
Query: 401 DSNTEESCTTYNMLKVSRHLFRWTKEIAYADYYERSLTNGVL-GIQRGTEPGVMIYLLPL 459
DS E+C + + + + R + YAD ER+L NG + G+ + + L +
Sbjct: 331 DSMYCETCASVGLAFWANRMLRLAPDRKYADVLERALYNGTISGMDLDGKRFFYVNPLEV 390
Query: 460 APGSSKERSYHHWGTPSDSFW---CCYGTGIESFSKLGDSIYFE-EEGKYPGVYIIQYIS 515
P + H T ++ CC + + D++Y + E+ Y +YI ++
Sbjct: 391 NPFQKSRKDQEHVKTERQKWFFCACCPPNLARMIASVEDNMYTQTEDTLYTHLYIAGKVN 450
Query: 516 SRLDWKSGQIVVNQKVDPVVSWDPYLRVTLTFSSKGSGLTTSLNLRIPTWTSSNGAKATL 575
L + +I + W+ L ++ + S + LRIP W A+ +
Sbjct: 451 LTLSGQEVEITQTHR----YPWNADLSFSIHVAEPTS---FTWALRIPGWCKH--AEVQV 501
Query: 576 NGQDLPLPS-PGNFLSVTKTWSSDDKLTIQLPL 607
NG+ + L ++ + + W+ D +++ L +
Sbjct: 502 NGEAISLDHLEKGYVEIQRIWNDGDVVSLHLAM 534
>gi|218508305|ref|ZP_03506183.1| hypothetical protein RetlB5_12284 [Rhizobium etli Brasil 5]
Length = 177
Score = 42.7 bits (99), Expect = 0.81, Method: Composition-based stats.
Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 4/79 (5%)
Query: 557 SLNLRIPTWTSSNGAKATLNGQ--DLPLPSPGNFLSVTKTWSSDDKLTIQLPLTLRTEAI 614
+L+LRIP W ++GA ++NG+ DL + + + + W D++ + LPL+LR +
Sbjct: 10 ALSLRIPDW--ADGATLSVNGEKLDLGAATRDGYARIDRQWVDGDRVDLFLPLSLRPQYA 67
Query: 615 QDDRPEYASIQAILYGPYV 633
+ A A++ GP V
Sbjct: 68 NPKVRQDAGRVALMRGPLV 86
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,062,623,141
Number of Sequences: 23463169
Number of extensions: 603589412
Number of successful extensions: 1312247
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 498
Number of HSP's successfully gapped in prelim test: 619
Number of HSP's that attempted gapping in prelim test: 1307452
Number of HSP's gapped (non-prelim): 1621
length of query: 863
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 711
effective length of database: 8,792,793,679
effective search space: 6251676305769
effective search space used: 6251676305769
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)