BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011051
(494 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224053368|ref|XP_002297785.1| predicted protein [Populus trichocarpa]
gi|222845043|gb|EEE82590.1| predicted protein [Populus trichocarpa]
Length = 858
Score = 825 bits (2130), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/483 (79%), Positives = 428/483 (88%), Gaps = 3/483 (0%)
Query: 1 MKNFVFKVLVLFLSCWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHLTPT 60
MK + ++VL + C KECTN+ QL+SHTFRY LLSS+NETWK+E+++HYHLTPT
Sbjct: 1 MKGLIV-LVVLSMLCGFGTSKECTNTPTQLSSHTFRYALLSSENETWKEEMFAHYHLTPT 59
Query: 61 DDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWR 120
DDSAW+NLLPRK+L E DE+SW M+YR +K+P K +G+FLKEVSLH+V+LDPSS+HW+
Sbjct: 60 DDSAWANLLPRKILREEDEYSWAMMYRNLKSP--LKSSGNFLKEVSLHNVRLDPSSIHWQ 117
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQQTNLEYLLMLDVDSLVWSF+KTAG T G AY GWE P CELRGHFVGHYLSASA MW
Sbjct: 118 AQQTNLEYLLMLDVDSLVWSFRKTAGLSTPGTAYGGWEAPNCELRGHFVGHYLSASAQMW 177
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILA 240
ASTHN L+++M+AVVSALS CQ KMGSGYLSAFPSE FDRFEA+KPVWAPYYTIHKILA
Sbjct: 178 ASTHNDILEKQMSAVVSALSSCQEKMGSGYLSAFPSELFDRFEAIKPVWAPYYTIHKILA 237
Query: 241 GLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLY 300
GLLDQYTFADN QALKM KWMV+YFYNRV+NVIT +SVERH+ SLNEETGGMNDVLY+L+
Sbjct: 238 GLLDQYTFADNAQALKMVKWMVDYFYNRVRNVITNFSVERHYQSLNEETGGMNDVLYKLF 297
Query: 301 TITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKV 360
+IT DPKHL+LAHLFDKPCFLGLLAVQA+DISGFHANTHIP+VIG+QMRYE+TGDPLYK
Sbjct: 298 SITGDPKHLVLAHLFDKPCFLGLLAVQAEDISGFHANTHIPIVIGAQMRYEITGDPLYKD 357
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWT 420
GTFFMDIVN+SH YATGGTS EFWSDPKRLASTL TENEESCTTYNMLKVSRHLFRWT
Sbjct: 358 IGTFFMDIVNSSHSYATGGTSVSEFWSDPKRLASTLQTENEESCTTYNMLKVSRHLFRWT 417
Query: 421 KEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYG 480
KEM YADYYERALTNGVL IQRGTEPGVMIYMLP G SK KSYHGWGT + +FWCCYG
Sbjct: 418 KEMAYADYYERALTNGVLGIQRGTEPGVMIYMLPQHPGSSKGKSYHGWGTLYDTFWCCYG 477
Query: 481 TGL 483
TG+
Sbjct: 478 TGI 480
>gi|225435510|ref|XP_002285548.1| PREDICTED: uncharacterized protein LOC100246702 [Vitis vinifera]
Length = 864
Score = 818 bits (2112), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/487 (79%), Positives = 421/487 (86%), Gaps = 6/487 (1%)
Query: 1 MKNFVFK----VLVLFLSCWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYH 56
MK FV V+ F+ C L KECTN QL+SH+FRYELL+S NE+WK E++ HYH
Sbjct: 1 MKVFVLSEVLIVVFAFVLCGCVLGKECTNVPTQLSSHSFRYELLASNNESWKAEMFQHYH 60
Query: 57 LTPTDDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSS 116
L TDDSAWSNLLPRK+L E DEFSW M+YR MKN DG +FLKE+SLHDV+LD S
Sbjct: 61 LIHTDDSAWSNLLPRKLLREEDEFSWAMMYRNMKNYDGSN--SNFLKEMSLHDVRLDSDS 118
Query: 117 LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSAS 176
LH RAQQTNL+YLL+LDVD LVWSF+KTAG T G Y GWE P ELRGHFVGHY+SAS
Sbjct: 119 LHGRAQQTNLDYLLILDVDRLVWSFRKTAGLSTPGLPYGGWEAPNVELRGHFVGHYMSAS 178
Query: 177 AHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIH 236
A MWASTHN TLKEKM+AVVSAL+ CQ KMG+GYLSAFPSE FDRFEA+KPVWAPYYTIH
Sbjct: 179 AQMWASTHNDTLKEKMSAVVSALATCQEKMGTGYLSAFPSELFDRFEAIKPVWAPYYTIH 238
Query: 237 KILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVL 296
KILAGLLDQYTFA N+QALKM WMVE+FY RVQNVIT YS+ERHW SLNEETGGMNDVL
Sbjct: 239 KILAGLLDQYTFAGNSQALKMMTWMVEHFYKRVQNVITMYSLERHWLSLNEETGGMNDVL 298
Query: 297 YRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP 356
YRLY+IT D KHL+LAHLFDKPCFLGLLAVQAD ISGFHANTHIPVVIGSQMRYEVTGDP
Sbjct: 299 YRLYSITGDQKHLVLAHLFDKPCFLGLLAVQADSISGFHANTHIPVVIGSQMRYEVTGDP 358
Query: 357 LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
LYK GTFFMDIVN+SH YATGGTS GEFWSDPKRLASTL ENEESCTTYNMLKVSRHL
Sbjct: 359 LYKAIGTFFMDIVNSSHSYATGGTSVGEFWSDPKRLASTLQRENEESCTTYNMLKVSRHL 418
Query: 417 FRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFW 476
FRWTKE+VYADYYERALTNGVLSIQRGT+PGVMIYMLPLGRGDSKA+SYHGWGT+F SFW
Sbjct: 419 FRWTKEVVYADYYERALTNGVLSIQRGTDPGVMIYMLPLGRGDSKARSYHGWGTKFDSFW 478
Query: 477 CCYGTGL 483
CCYGTG+
Sbjct: 479 CCYGTGI 485
>gi|224075776|ref|XP_002304762.1| predicted protein [Populus trichocarpa]
gi|222842194|gb|EEE79741.1| predicted protein [Populus trichocarpa]
Length = 858
Score = 816 bits (2107), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/481 (79%), Positives = 424/481 (88%), Gaps = 2/481 (0%)
Query: 3 NFVFKVLVLFLSCWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHLTPTDD 62
N + + ++ + C + KECTN QL+SH+FRYELLSS+NETWK+E++ HYHL PTDD
Sbjct: 2 NGLLVLAMVSMLCSFGISKECTNIPTQLSSHSFRYELLSSQNETWKEEMFEHYHLIPTDD 61
Query: 63 SAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQ 122
SAWS+LLPRK+L E DE SW M+YR +K+P K +G+FL E+SLH+V+LDPSS+HW+AQ
Sbjct: 62 SAWSSLLPRKILREEDEHSWEMMYRNLKSP--LKSSGNFLNEMSLHNVRLDPSSIHWKAQ 119
Query: 123 QTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWAS 182
QTNLEYLLMLDV++LVWSF+KTAGS T GKAY GWE P ELRGHFVGHYLSASA MWAS
Sbjct: 120 QTNLEYLLMLDVNNLVWSFRKTAGSSTPGKAYGGWEKPDSELRGHFVGHYLSASAQMWAS 179
Query: 183 THNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGL 242
THN TLK+KM+AVVSALS CQ KMG+GYLSAFPSE FDRFEA+KPVWAPYYTIHKILAGL
Sbjct: 180 THNETLKKKMSAVVSALSACQVKMGTGYLSAFPSELFDRFEAIKPVWAPYYTIHKILAGL 239
Query: 243 LDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTI 302
LDQYT ADN QALKM KWMV+YFYNRV+NVIT YSVERH+ SLNEETGGMNDVLY+L++I
Sbjct: 240 LDQYTLADNAQALKMVKWMVDYFYNRVRNVITNYSVERHYLSLNEETGGMNDVLYKLFSI 299
Query: 303 TQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTG 362
T DPKHL+LAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIG+QMRYE+TGDPLYK G
Sbjct: 300 TGDPKHLVLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGAQMRYEITGDPLYKDIG 359
Query: 363 TFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKE 422
FFMD+VN+SH YATGGTS EFWSDPKRLASTL TENEESCTTYNMLKVSRHLFRWTKE
Sbjct: 360 AFFMDVVNSSHSYATGGTSVSEFWSDPKRLASTLQTENEESCTTYNMLKVSRHLFRWTKE 419
Query: 423 MVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTG 482
M YADYYERALTNGVL IQRGTEPGVMIYMLP G SKAKSYHGWGT + SFWCCYGTG
Sbjct: 420 MAYADYYERALTNGVLGIQRGTEPGVMIYMLPQYPGSSKAKSYHGWGTSYDSFWCCYGTG 479
Query: 483 L 483
+
Sbjct: 480 I 480
>gi|297746357|emb|CBI16413.3| unnamed protein product [Vitis vinifera]
Length = 767
Score = 815 bits (2104), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/487 (79%), Positives = 421/487 (86%), Gaps = 6/487 (1%)
Query: 1 MKNFVFK----VLVLFLSCWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYH 56
MK FV V+ F+ C L KECTN QL+SH+FRYELL+S NE+WK E++ HYH
Sbjct: 1 MKVFVLSEVLIVVFAFVLCGCVLGKECTNVPTQLSSHSFRYELLASNNESWKAEMFQHYH 60
Query: 57 LTPTDDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSS 116
L TDDSAWSNLLPRK+L E DEFSW M+YR MKN DG +FLKE+SLHDV+LD S
Sbjct: 61 LIHTDDSAWSNLLPRKLLREEDEFSWAMMYRNMKNYDGSN--SNFLKEMSLHDVRLDSDS 118
Query: 117 LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSAS 176
LH RAQQTNL+YLL+LDVD LVWSF+KTAG T G Y GWE P ELRGHFVGHY+SAS
Sbjct: 119 LHGRAQQTNLDYLLILDVDRLVWSFRKTAGLSTPGLPYGGWEAPNVELRGHFVGHYMSAS 178
Query: 177 AHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIH 236
A MWASTHN TLKEKM+AVVSAL+ CQ KMG+GYLSAFPSE FDRFEA+KPVWAPYYTIH
Sbjct: 179 AQMWASTHNDTLKEKMSAVVSALATCQEKMGTGYLSAFPSELFDRFEAIKPVWAPYYTIH 238
Query: 237 KILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVL 296
KILAGLLDQYTFA N+QALKM WMVE+FY RVQNVIT YS+ERHW SLNEETGGMNDVL
Sbjct: 239 KILAGLLDQYTFAGNSQALKMMTWMVEHFYKRVQNVITMYSLERHWLSLNEETGGMNDVL 298
Query: 297 YRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP 356
YRLY+IT D KHL+LAHLFDKPCFLGLLAVQAD ISGFHANTHIPVVIGSQMRYEVTGDP
Sbjct: 299 YRLYSITGDQKHLVLAHLFDKPCFLGLLAVQADSISGFHANTHIPVVIGSQMRYEVTGDP 358
Query: 357 LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
LYK GTFFMDIVN+SH YATGGTS GEFWSDPKRLASTL ENEESCTTYNMLKVSRHL
Sbjct: 359 LYKAIGTFFMDIVNSSHSYATGGTSVGEFWSDPKRLASTLQRENEESCTTYNMLKVSRHL 418
Query: 417 FRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFW 476
FRWTKE+VYADYYERALTNGVLSIQRGT+PGVMIYMLPLGRGDSKA+SYHGWGT+F SFW
Sbjct: 419 FRWTKEVVYADYYERALTNGVLSIQRGTDPGVMIYMLPLGRGDSKARSYHGWGTKFDSFW 478
Query: 477 CCYGTGL 483
CCYGTG+
Sbjct: 479 CCYGTGI 485
>gi|449448754|ref|XP_004142130.1| PREDICTED: uncharacterized protein LOC101207833 [Cucumis sativus]
Length = 868
Score = 794 bits (2051), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/469 (78%), Positives = 408/469 (86%)
Query: 15 CWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHLTPTDDSAWSNLLPRKML 74
C KECTN+ QL SHTFRYELLSS N TWKKE++SHYHLTPTDD AWSNLLPRKML
Sbjct: 22 CNCDSLKECTNTPTQLGSHTFRYELLSSGNVTWKKELFSHYHLTPTDDFAWSNLLPRKML 81
Query: 75 SETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDV 134
E +E++W M+YR+MKN DG ++ G LKE+SLHDV+LDP+SLH AQ TNL+YLLMLDV
Sbjct: 82 KEENEYNWEMMYRQMKNKDGLRIPGGMLKEISLHDVRLDPNSLHGTAQTTNLKYLLMLDV 141
Query: 135 DSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTA 194
D L+WSF+KTAG PT G+ Y GWE CELRGHFVGHYLSASA MWAST N LKEKM+A
Sbjct: 142 DRLLWSFRKTAGLPTPGEPYVGWEKSDCELRGHFVGHYLSASAQMWASTGNSVLKEKMSA 201
Query: 195 VVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA 254
+VS L+ CQ+KMG+GYLSAFPSE+FDRFEA++PVWAPYYTIHKILAGLLDQYTFA N+QA
Sbjct: 202 LVSGLATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAGLLDQYTFAGNSQA 261
Query: 255 LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHL 314
LKM WMVEYFYNRVQNVI KY+VERH+ SLNEETGGMNDVLYRLY IT + KHLLLAHL
Sbjct: 262 LKMVTWMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYRITGNTKHLLLAHL 321
Query: 315 FDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHG 374
FDKPCFLGLLAVQA+DISGFH NTHIP+V+GSQMRYEVTGDPLYK T+FMDIVN+SH
Sbjct: 322 FDKPCFLGLLAVQAEDISGFHVNTHIPIVVGSQMRYEVTGDPLYKEISTYFMDIVNSSHS 381
Query: 375 YATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALT 434
YATGGTS EFW DPKRLA LGTE EESCTTYNMLKVSR+LF+WTKE+ YADYYERALT
Sbjct: 382 YATGGTSVHEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTKEIAYADYYERALT 441
Query: 435 NGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
NGVLSIQRGT+PGVMIYMLPLG G SKA SYHGWGT F SFWCCYGTG+
Sbjct: 442 NGVLSIQRGTDPGVMIYMLPLGSGSSKAISYHGWGTPFESFWCCYGTGI 490
>gi|297807305|ref|XP_002871536.1| hypothetical protein ARALYDRAFT_909245 [Arabidopsis lyrata subsp.
lyrata]
gi|297317373|gb|EFH47795.1| hypothetical protein ARALYDRAFT_909245 [Arabidopsis lyrata subsp.
lyrata]
Length = 862
Score = 760 bits (1963), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/483 (73%), Positives = 408/483 (84%), Gaps = 6/483 (1%)
Query: 4 FVFKVLVLFLSCWVALC--KECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHLTPTD 61
+ +++L + +V +C KECTN+ QL+SHTFR ELL SKNET K E++SHYHLTPTD
Sbjct: 5 LIITIVLLLYTSFVLVCVAKECTNTPTQLSSHTFRSELLQSKNETLKTELFSHYHLTPTD 64
Query: 62 DSAWSNLLPRKML-SETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWR 120
D+AWS LLPRKML E DEF+WTM+YR K+ + +G+FLKEVSLHDV+LDP+S H R
Sbjct: 65 DAAWSTLLPRKMLKEEADEFAWTMLYRTFKDSNS---SGNFLKEVSLHDVRLDPNSFHGR 121
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQQTNLEYLLMLDVD L WSF+K AG G Y GWE P ELRGHFVGHYLSA+A+MW
Sbjct: 122 AQQTNLEYLLMLDVDGLAWSFRKEAGLDAPGDHYGGWEKPDSELRGHFVGHYLSATAYMW 181
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILA 240
ASTHN TLKEKM+A+VSALSECQ K G+GYLSAFPS FDRFEA+ PVWAPYYTIHKI+A
Sbjct: 182 ASTHNDTLKEKMSALVSALSECQQKSGTGYLSAFPSSFFDRFEAITPVWAPYYTIHKIIA 241
Query: 241 GLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLY 300
GL+DQY A N+QAL+M M +YFY RV+NVI KYSVERHW SLNEETGGMND+LY+LY
Sbjct: 242 GLVDQYKLAGNSQALQMATGMADYFYGRVRNVIRKYSVERHWQSLNEETGGMNDILYQLY 301
Query: 301 TITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKV 360
+IT D K+LLLAHLFDKPCFLG+LA+QADDISGFH+NTHIP+V+GSQ RYE+TGDPL+K
Sbjct: 302 SITGDSKYLLLAHLFDKPCFLGVLAIQADDISGFHSNTHIPIVVGSQQRYEITGDPLHKE 361
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWT 420
FFMDIVNASH YATGGTS EFW +PKR+A+TL TENEESCTTYNMLKVSR+LFRWT
Sbjct: 362 ISIFFMDIVNASHSYATGGTSVSEFWQNPKRMATTLQTENEESCTTYNMLKVSRNLFRWT 421
Query: 421 KEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYG 480
KE+ YADYYERALTNGVL IQRGT+PG+MIYMLPLG+G SKA +YHGWGT + SFWCCYG
Sbjct: 422 KEVSYADYYERALTNGVLGIQRGTQPGLMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYG 481
Query: 481 TGL 483
TG+
Sbjct: 482 TGI 484
>gi|15239944|ref|NP_196799.1| uncharacterized protein [Arabidopsis thaliana]
gi|7630051|emb|CAB88259.1| putative protein [Arabidopsis thaliana]
gi|26451123|dbj|BAC42665.1| unknown protein [Arabidopsis thaliana]
gi|332004451|gb|AED91834.1| uncharacterized protein [Arabidopsis thaliana]
Length = 861
Score = 759 bits (1959), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/479 (74%), Positives = 405/479 (84%), Gaps = 6/479 (1%)
Query: 8 VLVLFLSCWV--ALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHLTPTDDSAW 65
L+L+ S +V ++ KECTN+ QL+SHTFR ELL SKNET K E++SHYHLTP DDSAW
Sbjct: 10 ALLLYTSSFVLVSVAKECTNTPTQLSSHTFRSELLQSKNETLKTELFSHYHLTPADDSAW 69
Query: 66 SNLLPRKML-SETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQT 124
S+LLPRKML E DEF+WTM+YRK K+ + +G+FLK+VSLHDV+LDP S HWRAQQT
Sbjct: 70 SSLLPRKMLKEEADEFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPDSFHWRAQQT 126
Query: 125 NLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTH 184
NLEYLLMLDVD L WSF+K AG G Y GWE P ELRGHFVGHYLSA+A+MWASTH
Sbjct: 127 NLEYLLMLDVDGLAWSFRKEAGLDAPGDYYGGWERPDSELRGHFVGHYLSATAYMWASTH 186
Query: 185 NVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLD 244
N TLKEKM+A+VSALSECQ K G+GYLSAFPS FDRFEA+ PVWAPYYTIHKILAGL+D
Sbjct: 187 NDTLKEKMSALVSALSECQQKSGTGYLSAFPSSFFDRFEAITPVWAPYYTIHKILAGLVD 246
Query: 245 QYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQ 304
QY A N+QALKM M +YFY RV+NVI KYSVERHW SLNEETGGMNDVLY+LY+IT
Sbjct: 247 QYKLAGNSQALKMATGMADYFYGRVRNVIRKYSVERHWQSLNEETGGMNDVLYQLYSITG 306
Query: 305 DPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTF 364
D K+LLLAHLFDKPCFLG+LA+QADDISGFHANTHIP+V+GSQ RYE+TGD L+K F
Sbjct: 307 DSKYLLLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEISMF 366
Query: 365 FMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMV 424
FMDI NASH YATGGTS EFW DPKR+A+ L TENEESCTTYNMLKVSR+LFRWTKE+
Sbjct: 367 FMDIFNASHSYATGGTSVSEFWQDPKRMATALQTENEESCTTYNMLKVSRNLFRWTKEVS 426
Query: 425 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
YADYYERALTNGVL IQRGT+PG+MIYMLPLG+G SKA +YHGWGT + SFWCCYGTG+
Sbjct: 427 YADYYERALTNGVLGIQRGTQPGLMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYGTGI 485
>gi|356541181|ref|XP_003539059.1| PREDICTED: uncharacterized protein LOC100781521 [Glycine max]
Length = 854
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/484 (74%), Positives = 402/484 (83%), Gaps = 4/484 (0%)
Query: 1 MKNFVFKVLVLFLSCWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHLTPT 60
M+ FVF V V L C KECTN Q SHTFRYELL SKN TWK EV HYHLTPT
Sbjct: 1 MEAFVF-VFVAILLCGCVAAKECTNIPTQ--SHTFRYELLMSKNATWKAEVMDHYHLTPT 57
Query: 61 DDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWR 120
D++ W++LLPRK LSE ++ W ++YRK+KN FK FLKEV L DV+L S+H R
Sbjct: 58 DETVWADLLPRKFLSEQNQHDWGVMYRKIKNMGVFKSGEGFLKEVPLQDVRLHKDSIHAR 117
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQQTNLEYLLMLDVDSL+WSF+KTAG T G Y GWE P ELRGHFVGHYLSASA MW
Sbjct: 118 AQQTNLEYLLMLDVDSLIWSFRKTAGLSTPGTPYGGWEGPEVELRGHFVGHYLSASALMW 177
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILA 240
AST N TLK+KM+++V+ LS CQ K+G+GYLSAFPSE FDRFE ++PVWAPYYTIHKILA
Sbjct: 178 ASTQNDTLKQKMSSLVAGLSACQEKIGTGYLSAFPSEFFDRFETVQPVWAPYYTIHKILA 237
Query: 241 GLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLY 300
GLLDQ+TFA N QALKM WMV+YFYNRVQNVITKY+V RH+ SLNEETGGMNDVLYRLY
Sbjct: 238 GLLDQHTFAGNPQALKMVTWMVDYFYNRVQNVITKYTVNRHYESLNEETGGMNDVLYRLY 297
Query: 301 TITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKV 360
+IT D KHL+LAHLFDKPCFLGLLA+QA+DI+ FHANTHIPVV+GSQMRYE+TGDPLYK
Sbjct: 298 SITGDSKHLVLAHLFDKPCFLGLLAMQANDIANFHANTHIPVVVGSQMRYEITGDPLYKQ 357
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL-GTENEESCTTYNMLKVSRHLFRW 419
GTFFMD+VN+SH YATGGTS EFWSDPKR+A L TENEESCTTYNMLKVSRHLFRW
Sbjct: 358 IGTFFMDLVNSSHSYATGGTSVSEFWSDPKRIADNLRTTENEESCTTYNMLKVSRHLFRW 417
Query: 420 TKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCY 479
TKE+ YADYYERALTNGVLSIQRGT+PGVMIYMLPLG SKA++ H WGT+F SFWCCY
Sbjct: 418 TKEVSYADYYERALTNGVLSIQRGTDPGVMIYMLPLGFAVSKARTGHSWGTQFDSFWCCY 477
Query: 480 GTGL 483
GTG+
Sbjct: 478 GTGI 481
>gi|356541912|ref|XP_003539416.1| PREDICTED: uncharacterized protein LOC100783150 [Glycine max]
Length = 854
Score = 746 bits (1927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/484 (73%), Positives = 401/484 (82%), Gaps = 4/484 (0%)
Query: 1 MKNFVFKVLVLFLSCWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHLTPT 60
M+ VF LV L C KECTN Q SHTFRYELL S N TWK EV HYHLTPT
Sbjct: 1 MEALVF-ALVAILLCGCDAAKECTNIPTQ--SHTFRYELLMSTNATWKAEVMDHYHLTPT 57
Query: 61 DDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWR 120
D++AW++LLPRK+LSE ++ W ++YRK+KN FK FLKEV L DV+L S+H R
Sbjct: 58 DETAWADLLPRKLLSEQNQHDWGVMYRKIKNMGVFKSGEGFLKEVPLQDVRLHKDSIHGR 117
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQQTNLEYLLMLDVDSL+WSF+KTA T G Y GWE P ELRGHFVGHYLSASA MW
Sbjct: 118 AQQTNLEYLLMLDVDSLIWSFRKTAALSTPGTPYGGWEGPEVELRGHFVGHYLSASALMW 177
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILA 240
AST N TLK+KM+++V+ LS CQ K+G+GYLSAFPSE FDRFEA++PVWAPYYTIHKILA
Sbjct: 178 ASTQNDTLKQKMSSLVAGLSACQEKIGTGYLSAFPSEFFDRFEAVQPVWAPYYTIHKILA 237
Query: 241 GLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLY 300
GLLDQ+TFA N QALKM WMV+YFYNRVQNVITKY+V RH+ S+NEETGGMNDVLYRLY
Sbjct: 238 GLLDQHTFAGNPQALKMVTWMVDYFYNRVQNVITKYTVNRHYQSMNEETGGMNDVLYRLY 297
Query: 301 TITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKV 360
+IT D KHL+LAHLFDKPCFLGLLAVQA+DI+ HANTHIP+V+GSQMRYE+TGDPLYK
Sbjct: 298 SITGDSKHLVLAHLFDKPCFLGLLAVQANDIADLHANTHIPIVVGSQMRYEITGDPLYKQ 357
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL-GTENEESCTTYNMLKVSRHLFRW 419
GTFFMD+VN+SH YATGGTS EFWSDPKR+A L TENEESCTTYNMLKVSRHLFRW
Sbjct: 358 IGTFFMDLVNSSHSYATGGTSVREFWSDPKRIADNLRTTENEESCTTYNMLKVSRHLFRW 417
Query: 420 TKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCY 479
TKE+ YADYYERALTNGVLSIQRGT+PGVMIYMLPLG SKA++ H WGT+F SFWCCY
Sbjct: 418 TKEVSYADYYERALTNGVLSIQRGTDPGVMIYMLPLGFAVSKARTGHSWGTQFDSFWCCY 477
Query: 480 GTGL 483
GTG+
Sbjct: 478 GTGI 481
>gi|297807309|ref|XP_002871538.1| hypothetical protein ARALYDRAFT_350453 [Arabidopsis lyrata subsp.
lyrata]
gi|297317375|gb|EFH47797.1| hypothetical protein ARALYDRAFT_350453 [Arabidopsis lyrata subsp.
lyrata]
Length = 860
Score = 745 bits (1923), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/482 (72%), Positives = 403/482 (83%), Gaps = 6/482 (1%)
Query: 5 VFKVLVLFLSCWVALC--KECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHLTPTDD 62
+ + +L + +V +C KECT+ +L+SHT R ELL S+NET K E+ SHYHLTPTDD
Sbjct: 6 IITIALLLFTSFVLVCVAKECTDIPTKLSSHTLRSELLQSQNETLKTELSSHYHLTPTDD 65
Query: 63 SAWSNLLPRKML-SETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWRA 121
+AWS LLPRKML ETD+F+WTM+YRK K+ + +G+FLK+VSLHDV+LDPSS HWRA
Sbjct: 66 AAWSTLLPRKMLKEETDDFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPSSFHWRA 122
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWA 181
QQTNLEYLLML+VD L +SF+K AG G Y GWE P ELRGHFVGHYLSA+A+MWA
Sbjct: 123 QQTNLEYLLMLNVDGLAYSFRKVAGLDAPGVPYGGWEKPDSELRGHFVGHYLSATAYMWA 182
Query: 182 STHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAG 241
STHN TLK KM+A+VSAL+ECQ K G+GYLSAFPS FDRFEA+ VWAPYYTIHKILAG
Sbjct: 183 STHNDTLKTKMSALVSALAECQQKSGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAG 242
Query: 242 LLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYT 301
L+DQY A NTQALKM M +YFY RVQNVI KYSVERHW SLNEETGGMNDVLY+LY+
Sbjct: 243 LVDQYKLAGNTQALKMATGMADYFYGRVQNVIRKYSVERHWLSLNEETGGMNDVLYQLYS 302
Query: 302 ITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVT 361
IT+D K+L LAHLFDKPCFLG+LA+QADDISGFHANTHIP+V+GSQ RYE+TGD L+K
Sbjct: 303 ITRDSKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEI 362
Query: 362 GTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTK 421
FFMDIVNASH YATGGTS EFW DPKR+A+TL TENEESCTTYNMLKVSR+LFRWTK
Sbjct: 363 SMFFMDIVNASHSYATGGTSVKEFWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWTK 422
Query: 422 EMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
E+ YADYYERALTNGVL IQRGT+PG MIYMLPLG+G SKA +YHGWGT + SFWCCYGT
Sbjct: 423 EVSYADYYERALTNGVLGIQRGTQPGRMIYMLPLGQGVSKAVTYHGWGTPYDSFWCCYGT 482
Query: 482 GL 483
G+
Sbjct: 483 GI 484
>gi|30684197|ref|NP_196800.2| uncharacterized protein [Arabidopsis thaliana]
gi|28393685|gb|AAO42255.1| unknown protein [Arabidopsis thaliana]
gi|332004452|gb|AED91835.1| uncharacterized protein [Arabidopsis thaliana]
Length = 865
Score = 742 bits (1916), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/487 (72%), Positives = 401/487 (82%), Gaps = 7/487 (1%)
Query: 1 MKNFVFKVLVLFLSC---WVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHL 57
MK+ V + L L V L KECT+ +L+SHT R ELL S+N K E +SHYHL
Sbjct: 6 MKSGVIITIALLLYTSFLLVCLAKECTDIPTKLSSHTLRSELLQSQNANLKSEEFSHYHL 65
Query: 58 TPTDDSAWSNLLPRKML-SETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSS 116
TPTDDSAWS LLPRKML ETD+F+WTM+YRK K+ + +G+FLK+VSLHDV+LDPSS
Sbjct: 66 TPTDDSAWSTLLPRKMLKEETDDFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPSS 122
Query: 117 LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSAS 176
HWRAQQTNLEYLLMLDVD L ++F+K AG G Y GWE P ELRGHFVGHYLSA+
Sbjct: 123 FHWRAQQTNLEYLLMLDVDGLAYNFRKEAGLNAPGVPYGGWEKPDSELRGHFVGHYLSAT 182
Query: 177 AHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIH 236
A+MWASTHN TLK KMTA+VSAL+ECQ K G+GYLSAFPS FDRFEA+ VWAPYYTIH
Sbjct: 183 AYMWASTHNETLKAKMTALVSALAECQQKYGTGYLSAFPSSFFDRFEAITHVWAPYYTIH 242
Query: 237 KILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVL 296
KILAGL+DQY A NTQALKM M +YFY RVQNVI KYSVERHW SLNEETGGMNDVL
Sbjct: 243 KILAGLVDQYKLAGNTQALKMATGMADYFYGRVQNVIKKYSVERHWLSLNEETGGMNDVL 302
Query: 297 YRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP 356
Y+LY+IT+D K+L LAHLFDKPCFLG+LA+QADDISGFHANTHIP+V+GSQ RYE+TGD
Sbjct: 303 YQLYSITRDSKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDL 362
Query: 357 LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
L+K FFMDIVNASH YATGGTS EFW DPKR+A+TL TENEESCTTYNMLKVSR+L
Sbjct: 363 LHKEIPMFFMDIVNASHSYATGGTSVKEFWQDPKRMATTLQTENEESCTTYNMLKVSRNL 422
Query: 417 FRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFW 476
FRWTKE+ YADYYERALTNGVL IQRGT+PG MIYMLPLG+G SKA +YHGWGT + SFW
Sbjct: 423 FRWTKEVSYADYYERALTNGVLGIQRGTDPGRMIYMLPLGKGVSKAVTYHGWGTPYDSFW 482
Query: 477 CCYGTGL 483
CCYGTG+
Sbjct: 483 CCYGTGI 489
>gi|7630052|emb|CAB88260.1| putative protein [Arabidopsis thaliana]
Length = 860
Score = 742 bits (1916), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/487 (72%), Positives = 401/487 (82%), Gaps = 7/487 (1%)
Query: 1 MKNFVFKVLVLFLSC---WVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHL 57
MK+ V + L L V L KECT+ +L+SHT R ELL S+N K E +SHYHL
Sbjct: 1 MKSGVIITIALLLYTSFLLVCLAKECTDIPTKLSSHTLRSELLQSQNANLKSEEFSHYHL 60
Query: 58 TPTDDSAWSNLLPRKML-SETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSS 116
TPTDDSAWS LLPRKML ETD+F+WTM+YRK K+ + +G+FLK+VSLHDV+LDPSS
Sbjct: 61 TPTDDSAWSTLLPRKMLKEETDDFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPSS 117
Query: 117 LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSAS 176
HWRAQQTNLEYLLMLDVD L ++F+K AG G Y GWE P ELRGHFVGHYLSA+
Sbjct: 118 FHWRAQQTNLEYLLMLDVDGLAYNFRKEAGLNAPGVPYGGWEKPDSELRGHFVGHYLSAT 177
Query: 177 AHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIH 236
A+MWASTHN TLK KMTA+VSAL+ECQ K G+GYLSAFPS FDRFEA+ VWAPYYTIH
Sbjct: 178 AYMWASTHNETLKAKMTALVSALAECQQKYGTGYLSAFPSSFFDRFEAITHVWAPYYTIH 237
Query: 237 KILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVL 296
KILAGL+DQY A NTQALKM M +YFY RVQNVI KYSVERHW SLNEETGGMNDVL
Sbjct: 238 KILAGLVDQYKLAGNTQALKMATGMADYFYGRVQNVIKKYSVERHWLSLNEETGGMNDVL 297
Query: 297 YRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP 356
Y+LY+IT+D K+L LAHLFDKPCFLG+LA+QADDISGFHANTHIP+V+GSQ RYE+TGD
Sbjct: 298 YQLYSITRDSKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDL 357
Query: 357 LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
L+K FFMDIVNASH YATGGTS EFW DPKR+A+TL TENEESCTTYNMLKVSR+L
Sbjct: 358 LHKEIPMFFMDIVNASHSYATGGTSVKEFWQDPKRMATTLQTENEESCTTYNMLKVSRNL 417
Query: 417 FRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFW 476
FRWTKE+ YADYYERALTNGVL IQRGT+PG MIYMLPLG+G SKA +YHGWGT + SFW
Sbjct: 418 FRWTKEVSYADYYERALTNGVLGIQRGTDPGRMIYMLPLGKGVSKAVTYHGWGTPYDSFW 477
Query: 477 CCYGTGL 483
CCYGTG+
Sbjct: 478 CCYGTGI 484
>gi|359478753|ref|XP_002283032.2| PREDICTED: uncharacterized protein LOC100250068 [Vitis vinifera]
Length = 874
Score = 742 bits (1916), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/470 (74%), Positives = 387/470 (82%), Gaps = 3/470 (0%)
Query: 15 CWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHY-HLTPTDDSAWSNLLPRKM 73
C L K+CTNS L+SHT RYELL SKNE+ K E +HY +L TD S W LPRK
Sbjct: 19 CGCGLGKKCTNSGSPLSSHTLRYELLFSKNESRKAEALAHYSNLIRTDGSGWLTSLPRKA 78
Query: 74 LSETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLD 133
L E DEFS M Y+ MK+ DG FLKE SLHDV+L SLHWRAQQTNLEYLLMLD
Sbjct: 79 LREEDEFSRAMKYQTMKSYDGSN--SKFLKEFSLHDVRLGSDSLHWRAQQTNLEYLLMLD 136
Query: 134 VDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMT 193
D LVWSF++TAG PT Y GWE P ELRGHFVGHYLSASA MWASTHN +LKEKM+
Sbjct: 137 ADRLVWSFRRTAGLPTPCSPYGGWESPDGELRGHFVGHYLSASAQMWASTHNESLKEKMS 196
Query: 194 AVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQ 253
AVV AL ECQ KMG+GYLSAFPSE FDRFEAL+ VWAPYYTIHKILAGLLDQYT N Q
Sbjct: 197 AVVCALGECQKKMGTGYLSAFPSELFDRFEALEEVWAPYYTIHKILAGLLDQYTLGGNAQ 256
Query: 254 ALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAH 313
ALKM WMVEYFYNRVQNVI+ YS+ERHW SLNEETGGMND LY LY IT D KH +LAH
Sbjct: 257 ALKMVTWMVEYFYNRVQNVISSYSIERHWLSLNEETGGMNDFLYNLYRITGDQKHFVLAH 316
Query: 314 LFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASH 373
LFDKPCFLGLLA+QADDISGFHANTHIP+V+G+QMRYE+TGDPLYK G FF+D VN+SH
Sbjct: 317 LFDKPCFLGLLAMQADDISGFHANTHIPIVVGAQMRYEITGDPLYKTIGAFFIDTVNSSH 376
Query: 374 GYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERAL 433
YATGGTS EFWSDPKR+A+TL TEN ESCTTYNMLKVSR+LFRWTKE+ YADYYERAL
Sbjct: 377 SYATGGTSVDEFWSDPKRMATTLQTENAESCTTYNMLKVSRNLFRWTKEVAYADYYERAL 436
Query: 434 TNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
TNG+LSIQRGT+PGVM+YMLPLG G+SKA+SYHGWGT+F SFWCCYGTG+
Sbjct: 437 TNGILSIQRGTDPGVMLYMLPLGHGNSKARSYHGWGTKFHSFWCCYGTGI 486
>gi|297811349|ref|XP_002873558.1| hypothetical protein ARALYDRAFT_488069 [Arabidopsis lyrata subsp.
lyrata]
gi|297319395|gb|EFH49817.1| hypothetical protein ARALYDRAFT_488069 [Arabidopsis lyrata subsp.
lyrata]
Length = 860
Score = 741 bits (1912), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/487 (72%), Positives = 405/487 (83%), Gaps = 7/487 (1%)
Query: 1 MKNFVFKVLVLFL-SCWVALC--KECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHL 57
MK+ V + L L + ++ +C KECT+ +L+SHT ELL S N+T K E++SHYHL
Sbjct: 1 MKSGVIITIALLLYTSFLLVCVAKECTDIPTKLSSHTLNSELLQSHNKTLKTELFSHYHL 60
Query: 58 TPTDDSAWSNLLPRKML-SETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSS 116
TPTDD+AWS LLPRKML ETDEF+WTM+YRK K+ + G+FLK+VSLHDV+LDP+S
Sbjct: 61 TPTDDAAWSTLLPRKMLKEETDEFAWTMLYRKFKDSNS---VGNFLKDVSLHDVRLDPNS 117
Query: 117 LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSAS 176
HWRAQQTNLEYLLMLDVD L +SF+K AG +G Y GWE P ELRGHFVGHYLSA+
Sbjct: 118 FHWRAQQTNLEYLLMLDVDGLAYSFRKVAGLDASGVPYGGWEKPDSELRGHFVGHYLSAT 177
Query: 177 AHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIH 236
AHMWASTHN TLK KM+A+VSAL+ECQ K G+GYLSAFPS FDRFEA+ VWAPYYTIH
Sbjct: 178 AHMWASTHNDTLKAKMSALVSALAECQQKSGTGYLSAFPSSFFDRFEAITHVWAPYYTIH 237
Query: 237 KILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVL 296
KILAGL+DQY A N QALKM M +YFY RV+NVITKYSVERH+ SLNEETGGMNDVL
Sbjct: 238 KILAGLVDQYKLAGNIQALKMATGMADYFYGRVRNVITKYSVERHYQSLNEETGGMNDVL 297
Query: 297 YRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP 356
Y+LY+IT+D K+L LAHLFDKPCFLG+LA+QADDISGFHANTHIP+V+GSQ RYE+TGD
Sbjct: 298 YQLYSITRDSKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDL 357
Query: 357 LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
L+K FFMDI+NASH YATGGTS EFW DPKR+A+TL TENEESCTTYNMLKVSR+L
Sbjct: 358 LHKEISMFFMDIINASHSYATGGTSVREFWQDPKRMATTLQTENEESCTTYNMLKVSRNL 417
Query: 417 FRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFW 476
FRWTKE+ YADYYERALTNGVL IQRGT+PG MIYMLPLG+G SKA +YHGWGT + SFW
Sbjct: 418 FRWTKEVSYADYYERALTNGVLGIQRGTQPGRMIYMLPLGQGVSKAVTYHGWGTPYDSFW 477
Query: 477 CCYGTGL 483
CCYGTG+
Sbjct: 478 CCYGTGI 484
>gi|357472931|ref|XP_003606750.1| hypothetical protein MTR_4g065200 [Medicago truncatula]
gi|355507805|gb|AES88947.1| hypothetical protein MTR_4g065200 [Medicago truncatula]
Length = 646
Score = 732 bits (1889), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/484 (73%), Positives = 399/484 (82%), Gaps = 4/484 (0%)
Query: 1 MKNFVFKVLVLFLSCWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHLTPT 60
MK FVF + + L VA KEC N+ PQ SHTFRYEL +SKNETWKKEV SHYHLTPT
Sbjct: 1 MKVFVFMFMAIMLFGCVA-GKECMNNLPQ--SHTFRYELWASKNETWKKEVMSHYHLTPT 57
Query: 61 DDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWR 120
D+SAW++LLPRK+LSE ++ W YR+MKN D K FLKEV L DV+L S+H +
Sbjct: 58 DESAWADLLPRKLLSEENQRDWAAKYREMKNADLSKPPVGFLKEVPLGDVRLLEGSIHAQ 117
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQ+TNLEYLLMLDVDSL+WSF+KTAG PT G Y GWEDP+ ELRGHFVGHYLSASA MW
Sbjct: 118 AQKTNLEYLLMLDVDSLIWSFRKTAGLPTPGTPYGGWEDPSIELRGHFVGHYLSASALMW 177
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILA 240
AST N L EKM+A+VS LS CQ K+G+GYLSAFP+E FDR EAL+ WAPYYTIHKILA
Sbjct: 178 ASTKNDNLNEKMSALVSGLSACQEKIGTGYLSAFPTELFDRVEALQYAWAPYYTIHKILA 237
Query: 241 GLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLY 300
GLLDQYT N QALKM WMV+YFYNRV NVI K +V H+ SLNEE GGMNDVLYRLY
Sbjct: 238 GLLDQYTIGGNPQALKMVTWMVDYFYNRVMNVIQKLTVNGHYQSLNEEAGGMNDVLYRLY 297
Query: 301 TITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKV 360
+IT+D KHL+LAHLFDKPCFLG+LAVQA+DI+ FHANTHIP+V+GSQ+RYEVTGDPLYK
Sbjct: 298 SITRDSKHLVLAHLFDKPCFLGVLAVQANDIANFHANTHIPIVVGSQLRYEVTGDPLYKD 357
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL-GTENEESCTTYNMLKVSRHLFRW 419
G FFMDIVN+SH YATGGTS EFW+DPKR+A L TENEESCTTYNMLKVSRHLFRW
Sbjct: 358 IGAFFMDIVNSSHTYATGGTSVREFWNDPKRIADNLKSTENEESCTTYNMLKVSRHLFRW 417
Query: 420 TKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCY 479
TKE+ YADYYERALTNGVLSIQRGT+PGVMIYMLPLG G SKAK+ GWG F++FWCCY
Sbjct: 418 TKEVSYADYYERALTNGVLSIQRGTDPGVMIYMLPLGLGVSKAKTDKGWGNPFNTFWCCY 477
Query: 480 GTGL 483
GTG+
Sbjct: 478 GTGI 481
>gi|356557388|ref|XP_003546998.1| PREDICTED: uncharacterized protein LOC100815634 [Glycine max]
Length = 841
Score = 696 bits (1795), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/484 (70%), Positives = 396/484 (81%), Gaps = 12/484 (2%)
Query: 4 FVFKVLVLFLSCWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHLTPTDDS 63
F F +V++ C A KECTN+ Q SHTFRY+L +S NETW + SH HLT DD
Sbjct: 5 FAFVAIVVW-GC--AAGKECTNNDAQ--SHTFRYQLSTSTNETW--NIMSHNHLTTKDDH 57
Query: 64 AWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGD---FLKEVSLHDVKLDPSSLHWR 120
++LLPRK+L E ++ + M+ RK++ K FLK VSLHDV+L+ S+H +
Sbjct: 58 LLADLLPRKLLKEENQRNLDML-RKIEKVGVLKPPQQPQGFLKPVSLHDVRLNQGSIHAQ 116
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQ+TNLEYLLML+VD L+WSF+KTAG PT G Y GWEDP ELRGHFVGHYLSASA MW
Sbjct: 117 AQRTNLEYLLMLNVDRLLWSFRKTAGLPTPGTPYGGWEDPKMELRGHFVGHYLSASALMW 176
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILA 240
ASTHN +LK+KM+A+V+ LS CQ K+G+GYLSAFPSE FDR EA K VWAPYYT HKILA
Sbjct: 177 ASTHNDSLKKKMSALVANLSICQEKIGTGYLSAFPSEFFDRLEATKYVWAPYYTTHKILA 236
Query: 241 GLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLY 300
GLLDQ++ A+N QALKM WMV+YFYNRVQNVITK+S+ RH+ SLNEETGGMNDVLY+LY
Sbjct: 237 GLLDQHSIAENPQALKMVTWMVDYFYNRVQNVITKFSISRHYQSLNEETGGMNDVLYKLY 296
Query: 301 TITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKV 360
+IT DP+HLLLAHLFDKPCFLGLLAV+A+DI+ FHANTHIPV++GSQMRYEVTGDPLYK
Sbjct: 297 SITGDPRHLLLAHLFDKPCFLGLLAVKANDIAHFHANTHIPVIVGSQMRYEVTGDPLYKE 356
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL-GTENEESCTTYNMLKVSRHLFRW 419
GT FMD+VN+SH YATGGTS EFWSDPKR+A TL T+NEESCTTYNMLKVSRHLF W
Sbjct: 357 IGTLFMDLVNSSHTYATGGTSVNEFWSDPKRMADTLESTDNEESCTTYNMLKVSRHLFTW 416
Query: 420 TKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCY 479
TK++ YADYYERALTNGVLSIQRGTEPGVMIYMLP GRG SKAK+Y GWGT+F SFWCCY
Sbjct: 417 TKKVSYADYYERALTNGVLSIQRGTEPGVMIYMLPQGRGVSKAKTYFGWGTKFDSFWCCY 476
Query: 480 GTGL 483
GTG+
Sbjct: 477 GTGI 480
>gi|226497412|ref|NP_001145969.1| uncharacterized protein LOC100279496 precursor [Zea mays]
gi|223945575|gb|ACN26871.1| unknown [Zea mays]
Length = 879
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/490 (67%), Positives = 379/490 (77%), Gaps = 14/490 (2%)
Query: 8 VLVLFLSCWV--ALCKECTNSFPQLASHTFRY--ELLSSKNETWKKEVYSHY------HL 57
V+V+ L+ A K CTN+FP L SHT R +L T + + H+ HL
Sbjct: 16 VVVMLLAAGFRGAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQHL 75
Query: 58 TPTDDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFK----LAGDFLKEVSLHDVKLD 113
TPTD+S W +L+PR+ L + F W M+YR+++ G AG FL E SLHDV+L+
Sbjct: 76 TPTDESTWMSLMPRRALRREEAFDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDVRLE 135
Query: 114 PSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYL 173
P S++WRAQQTNLEYLL+LDVD LVWSF+K AG G Y GWE P +LRGHFVGHYL
Sbjct: 136 PGSMYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVGHYL 195
Query: 174 SASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYY 233
SA+A MWASTHN TL KM++VV AL +CQ KMG+GYLSAFPS+ FD EA+K VWAPYY
Sbjct: 196 SATAKMWASTHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWAPYY 255
Query: 234 TIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMN 293
TIHKI+ GLLDQYT A N+ AL M M YF +RV+NVI YS+ERHW SLNEETGGMN
Sbjct: 256 TIHKIMQGLLDQYTVAGNSMALDMVIKMANYFSDRVKNVIQNYSIERHWESLNEETGGMN 315
Query: 294 DVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVT 353
DVLY+LYTIT D KHL LAHLFDKPCFLGLLAVQAD ISGFH+NTHIPVVIG+QMRYEVT
Sbjct: 316 DVLYQLYTITHDMKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRYEVT 375
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVS 413
GDPLYK +FFMD +N+SH YATGGTSAGEFW+DPKRLA TL TENEESCTTYNMLKVS
Sbjct: 376 GDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTYNMLKVS 435
Query: 414 RHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFS 473
R+LFRWTKE+ YADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA SYHGWGT++
Sbjct: 436 RNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHGWGTKYD 495
Query: 474 SFWCCYGTGL 483
SFWCCYGTG+
Sbjct: 496 SFWCCYGTGI 505
>gi|219885159|gb|ACL52954.1| unknown [Zea mays]
Length = 879
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/490 (67%), Positives = 379/490 (77%), Gaps = 14/490 (2%)
Query: 8 VLVLFLSCWV--ALCKECTNSFPQLASHTFRY--ELLSSKNETWKKEVYSHY------HL 57
V+V+ L+ A K CTN+FP L SHT R +L T + + H+ HL
Sbjct: 16 VVVMLLAAGFRGAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQHL 75
Query: 58 TPTDDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFK----LAGDFLKEVSLHDVKLD 113
TPTD+S W +L+PR+ L + F W M+YR+++ G AG FL E SLHDV+L+
Sbjct: 76 TPTDESTWMSLMPRRALRREEAFDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDVRLE 135
Query: 114 PSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYL 173
P S++WRAQQTNLEYLL+LDVD LVWSF+K AG G Y GWE P +LRGHFVGHYL
Sbjct: 136 PGSMYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVGHYL 195
Query: 174 SASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYY 233
SA+A MWASTHN TL KM++VV AL +CQ KMG+GYLSAFPS+ FD EA+K VWAPYY
Sbjct: 196 SATAKMWASTHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWAPYY 255
Query: 234 TIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMN 293
TIHKI+ GLLDQYT A N+ AL M M YF +RV+NVI YS+ERHW SLNEETGGMN
Sbjct: 256 TIHKIMQGLLDQYTVAGNSMALDMVIKMANYFSDRVKNVIQNYSIERHWESLNEETGGMN 315
Query: 294 DVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVT 353
DVLY+LYTIT D KHL LAHLFDKPCFLGLLAVQAD ISGFH+NTHIPVVIG+QMRYEVT
Sbjct: 316 DVLYQLYTITHDMKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRYEVT 375
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVS 413
GDPLYK +FFMD +N+SH YATGGTSAGEFW+DPKRLA TL TENEESCTTYNMLKVS
Sbjct: 376 GDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTYNMLKVS 435
Query: 414 RHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFS 473
R+LFRWTKE+ YADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA SYHGWGT++
Sbjct: 436 RNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHGWGTKYD 495
Query: 474 SFWCCYGTGL 483
SFWCCYGTG+
Sbjct: 496 SFWCCYGTGI 505
>gi|326495110|dbj|BAJ85651.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 868
Score = 663 bits (1711), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/477 (66%), Positives = 368/477 (77%), Gaps = 16/477 (3%)
Query: 21 KECTNSFPQLASHTFRYELLSSKNETWKKEVYSH---YHLTPTDDSAWSNLLPRKMLS-- 75
K CTN+FP S E +++ + H HLTPTD+SAW L+PR+ LS
Sbjct: 24 KVCTNTFPSSDSVATHAERAAAQLRLPAGHGHGHDHEQHLTPTDESAWMELMPRRSLSGG 83
Query: 76 -----ETDEFSWTMIYRKMKNP----DGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNL 126
+ F W M+YR+++ DG AG FL E SLHDV+L P +++W+AQQTNL
Sbjct: 84 GGSTPPREAFDWLMLYRRLRGGAAAVDG--PAGPFLSEASLHDVRLQPGTIYWQAQQTNL 141
Query: 127 EYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNV 186
EYLL+LD D LVWSF+ AG G Y GWE P ELRGHFVGHYLSA+A MWASTHN
Sbjct: 142 EYLLLLDTDRLVWSFRTQAGLTATGTPYGGWEGPNVELRGHFVGHYLSATAKMWASTHND 201
Query: 187 TLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQY 246
TL+ KM++VV L +CQ KMG+GYLSAFPSE FDR EAL VWAPYYTIHK++ GLLDQY
Sbjct: 202 TLRAKMSSVVDVLYDCQKKMGTGYLSAFPSEFFDRAEALTTVWAPYYTIHKVMQGLLDQY 261
Query: 247 TFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDP 306
T A N++AL+M M YF +RV+N+I KYS+ERHW SLNEETGGMNDVLY+LYTIT D
Sbjct: 262 TVAGNSKALEMVVGMANYFSDRVKNIIQKYSIERHWASLNEETGGMNDVLYQLYTITDDL 321
Query: 307 KHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFM 366
KHL LAHLFDKPCFLGLLA+QAD ISGFH+NTHIPVV+G+QMRYEVTGD LYK T FM
Sbjct: 322 KHLTLAHLFDKPCFLGLLALQADSISGFHSNTHIPVVVGAQMRYEVTGDVLYKQIATSFM 381
Query: 367 DIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYA 426
D++N+SH YATGGTSAGEFWSDPKRLA+TL TEN ESCTTYNMLKVSR+LFRWTKE+ YA
Sbjct: 382 DMINSSHSYATGGTSAGEFWSDPKRLAATLSTENAESCTTYNMLKVSRNLFRWTKEIAYA 441
Query: 427 DYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
DYYERAL NGVLSIQRGT+PGVMIYMLP G SKA SYHGWGT++ SFWCCYGTG+
Sbjct: 442 DYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVSYHGWGTKYDSFWCCYGTGI 498
>gi|242060854|ref|XP_002451716.1| hypothetical protein SORBIDRAFT_04g006520 [Sorghum bicolor]
gi|241931547|gb|EES04692.1| hypothetical protein SORBIDRAFT_04g006520 [Sorghum bicolor]
Length = 888
Score = 663 bits (1711), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/483 (67%), Positives = 373/483 (77%), Gaps = 20/483 (4%)
Query: 21 KECTNSFPQL-ASHTFRY--ELLSSKNETWKKEVYS----------HYHLTPTDDSAWSN 67
K CTN+FP L +SHT R +L T + V HLTPTD+S W +
Sbjct: 33 KSCTNAFPGLTSSHTERAAAQLQRGPPATALQPVVHRHGHDHDHGHEQHLTPTDESTWMS 92
Query: 68 LLPRKMLSETDEFSWTMIYRKMKNPDGFKL-------AGDFLKEVSLHDVKLDPSSLHWR 120
L+PR+ L + F W M+YRK++ AG FL + SLHDV+L+P SL+WR
Sbjct: 93 LMPRRALRREEAFDWLMLYRKLRGATAGGAPRRPGVAAGTFLSDASLHDVRLEPGSLYWR 152
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQQTNLEYLL+LDVD LVWSF+K AG G Y GWE P ELRGHFVGHYLSA+A MW
Sbjct: 153 AQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPDVELRGHFVGHYLSATAKMW 212
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILA 240
ASTHN TL KM++V+ ALS+CQ KMG+GYLSAFP+E FDR EA+KPVWAPYYTIHKI+
Sbjct: 213 ASTHNDTLNAKMSSVIDALSDCQKKMGTGYLSAFPTEFFDRVEAIKPVWAPYYTIHKIMQ 272
Query: 241 GLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLY 300
GLLDQYT A N++AL M M YF +RV+NVI KYS+ERHW SLNEETGGMNDVLY+LY
Sbjct: 273 GLLDQYTVAGNSKALDMVVNMANYFSDRVKNVIQKYSIERHWESLNEETGGMNDVLYQLY 332
Query: 301 TITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKV 360
TIT D KHL LAHLFDKPCFLGLLAVQAD ISGFH+NTHIPVVIG+QMRYEVTGDPLYK
Sbjct: 333 TITNDLKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRYEVTGDPLYKQ 392
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWT 420
+FFMD +N+SH YATGGTSAGEFW+DPK LA TL TENEESCTTYNMLK+SR+LFRWT
Sbjct: 393 IASFFMDTINSSHSYATGGTSAGEFWTDPKHLAGTLSTENEESCTTYNMLKISRNLFRWT 452
Query: 421 KEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYG 480
KE+ YADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA SYH WGT++ SFWCCYG
Sbjct: 453 KEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHSWGTKYDSFWCCYG 512
Query: 481 TGL 483
TG+
Sbjct: 513 TGI 515
>gi|357139358|ref|XP_003571249.1| PREDICTED: uncharacterized protein LOC100841742 [Brachypodium
distachyon]
Length = 883
Score = 653 bits (1684), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/510 (63%), Positives = 382/510 (74%), Gaps = 31/510 (6%)
Query: 5 VFKVLVLFLSCWV---ALCKECTNSFPQL--ASHTFRY--ELLSSKNETWKKEV------ 51
F V+ + L+ V A K CTN+FP ASHT R +L ++++E +
Sbjct: 5 AFGVVAVLLATAVLRGAEAKVCTNTFPASGSASHTERAAAQLRAAESEDAALRLPGLVDH 64
Query: 52 -YSH-YHLTPTDDSAWSNLLPRKMLSET---------DEFSWTMIYRKMKNP-DG----- 94
+ H HL PTD+SAW L+PR++L+ + F W M+YRK++ DG
Sbjct: 65 GHGHEQHLIPTDESAWMALMPRRLLAGGAGGNGAPPREAFDWLMLYRKLRGGGDGAIDGP 124
Query: 95 -FKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA 153
AG FL E SLHDV+L P +++W+AQQTNLEYLL+LD D LVWSF+ AG P G
Sbjct: 125 AAAAAGPFLSEASLHDVRLQPGTVYWQAQQTNLEYLLLLDADRLVWSFRTQAGLPATGTP 184
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
Y GWE P+ ELRGHFVGHYL+A+A MWASTHN TL+ KM++V+ L +CQ KMG GYLSA
Sbjct: 185 YGGWEGPSVELRGHFVGHYLTAAAKMWASTHNDTLRTKMSSVIDTLYDCQKKMGMGYLSA 244
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP+E FDR EAL VWAPYYTIHKI+ GLLDQYT A +++AL+M M +YF RV+NVI
Sbjct: 245 FPTEFFDRAEALTTVWAPYYTIHKIMQGLLDQYTVAGSSKALEMVVGMADYFSGRVKNVI 304
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
KYS+ERHW SLNEETGGMNDVLY+LY IT D KHL LAHLFDKPCFLGLLAVQAD ISG
Sbjct: 305 QKYSIERHWASLNEETGGMNDVLYQLYAITNDLKHLTLAHLFDKPCFLGLLAVQADSISG 364
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
FH+NTHIPVVIG+QMRYEVTGD LYK + FMD++N+SH YATGGTSAGEFW DPKRLA
Sbjct: 365 FHSNTHIPVVIGAQMRYEVTGDVLYKQIASSFMDMINSSHSYATGGTSAGEFWYDPKRLA 424
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
+TL TENEESCTTYNMLKVSR+LFRWTKE+ YADYYERAL NGVLSIQRGT+PGVMIYML
Sbjct: 425 ATLSTENEESCTTYNMLKVSRNLFRWTKEISYADYYERALINGVLSIQRGTDPGVMIYML 484
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
P G SKA YHGWGT + SFWCCYGTG+
Sbjct: 485 PQAPGRSKAVGYHGWGTLYDSFWCCYGTGI 514
>gi|125538467|gb|EAY84862.1| hypothetical protein OsI_06226 [Oryza sativa Indica Group]
Length = 891
Score = 645 bits (1663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/493 (65%), Positives = 371/493 (75%), Gaps = 30/493 (6%)
Query: 21 KECTNSFPQL-ASHTFR------------------YELLSSKNETWKKEVYSHYHLTPTD 61
K+CTN FP L ASHT R +LL + HLTPTD
Sbjct: 26 KDCTNGFPGLTASHTERAAAAAELRPDGEVEAARVLDLLLPHGHGHGDDHDGDRHLTPTD 85
Query: 62 DSAWSNLLPRKMLSET------DEFSWTMIYRKMKNPDGFKLAGD-----FLKEVSLHDV 110
+S W +L+PR++L+ D F W M+YR ++ A L E SLHDV
Sbjct: 86 ESTWMSLMPRRLLASPASSPRRDAFDWLMLYRNLRGSGSGAGAIAASGGALLAEASLHDV 145
Query: 111 KLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVG 170
+L P +++W+AQQTNLEYLL+LDVD LVWSF+ AG P +G Y GWE P ELRGHFVG
Sbjct: 146 RLQPGTVYWQAQQTNLEYLLLLDVDRLVWSFRTQAGLPASGAPYGGWEGPGVELRGHFVG 205
Query: 171 HYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWA 230
HYLSA+A MWASTHN TL+ KM++VV AL +CQ KMGSGYLSAFPSE FDR E++K VWA
Sbjct: 206 HYLSATAKMWASTHNDTLQAKMSSVVDALHDCQKKMGSGYLSAFPSEFFDRVESIKAVWA 265
Query: 231 PYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETG 290
PYYTIHKI+ GLLDQYT A N++AL + M YF +RV+NVI KYS+ERHW SLNEE+G
Sbjct: 266 PYYTIHKIMQGLLDQYTVAGNSKALDLVVGMANYFSDRVKNVIQKYSIERHWASLNEESG 325
Query: 291 GMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRY 350
GMNDVLY+LYTIT D KHL LAHLFDKPCFLGLLAVQAD ISGFH+NTHIPVVIG+QMRY
Sbjct: 326 GMNDVLYQLYTITNDQKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRY 385
Query: 351 EVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNML 410
EVTGD LYK TFFMD +N+SH YATGGTSAGEFW++PKRLA TL TENEESCTTYNML
Sbjct: 386 EVTGDLLYKQIATFFMDTINSSHSYATGGTSAGEFWTNPKRLADTLSTENEESCTTYNML 445
Query: 411 KVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGT 470
KVSR+LFRWTKE+ YADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA SYHGWGT
Sbjct: 446 KVSRNLFRWTKELSYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVSYHGWGT 505
Query: 471 RFSSFWCCYGTGL 483
++ SFWCCYGTG+
Sbjct: 506 KYDSFWCCYGTGI 518
>gi|115444811|ref|NP_001046185.1| Os02g0195500 [Oryza sativa Japonica Group]
gi|49388119|dbj|BAD25250.1| unknown protein [Oryza sativa Japonica Group]
gi|113535716|dbj|BAF08099.1| Os02g0195500 [Oryza sativa Japonica Group]
gi|125581152|gb|EAZ22083.1| hypothetical protein OsJ_05746 [Oryza sativa Japonica Group]
Length = 891
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/493 (65%), Positives = 370/493 (75%), Gaps = 30/493 (6%)
Query: 21 KECTNSFPQL-ASHTFR------------------YELLSSKNETWKKEVYSHYHLTPTD 61
K+CTN FP L ASHT R +LL + HLTPTD
Sbjct: 26 KDCTNGFPGLTASHTERAAAAAEQRPDGEVEAARVLDLLLPHGHGHGDDHDGDRHLTPTD 85
Query: 62 DSAWSNLLPRKMLSET------DEFSWTMIYRKMKNPDGFKLAGD-----FLKEVSLHDV 110
+S W +L+PR++L+ D F W M+YR ++ A L E SLHDV
Sbjct: 86 ESTWMSLMPRRLLASPVSSPRRDAFDWLMLYRNLRGSGSGAGAIAASGGALLAEASLHDV 145
Query: 111 KLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVG 170
+L P +++W+AQQTNLEYLL+LDVD LVWSF+ AG P +G Y GWE P ELRGHFVG
Sbjct: 146 RLQPGTVYWQAQQTNLEYLLLLDVDRLVWSFRTQAGLPASGAPYGGWEGPGVELRGHFVG 205
Query: 171 HYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWA 230
HYLSA+A MWASTHN TL KM++VV AL +CQ KMGSGYLSAFPSE FDR E++K VWA
Sbjct: 206 HYLSATAKMWASTHNDTLLAKMSSVVDALHDCQKKMGSGYLSAFPSEFFDRVESIKAVWA 265
Query: 231 PYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETG 290
PYYTIHKI+ GLLDQYT A N++AL + M YF +RV+NVI KYS+ERHW SLNEE+G
Sbjct: 266 PYYTIHKIMQGLLDQYTVAGNSKALDLVVGMANYFSDRVKNVIQKYSIERHWASLNEESG 325
Query: 291 GMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRY 350
GMNDVLY+LYTIT D KHL LAHLFDKPCFLGLLAVQAD ISGFH+NTHIPVVIG+QMRY
Sbjct: 326 GMNDVLYQLYTITNDQKHLTLAHLFDKPCFLGLLAVQADSISGFHSNTHIPVVIGAQMRY 385
Query: 351 EVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNML 410
EVTGD LYK TFFMD +N+SH YATGGTSAGEFW++PKRLA TL TENEESCTTYNML
Sbjct: 386 EVTGDLLYKQIATFFMDTINSSHSYATGGTSAGEFWTNPKRLADTLSTENEESCTTYNML 445
Query: 411 KVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGT 470
KVSR+LFRWTKE+ YADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA SYHGWGT
Sbjct: 446 KVSRNLFRWTKELSYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGRSKAVSYHGWGT 505
Query: 471 RFSSFWCCYGTGL 483
++ SFWCCYGTG+
Sbjct: 506 KYDSFWCCYGTGI 518
>gi|51090917|dbj|BAD35522.1| hypothetical protein [Oryza sativa Japonica Group]
gi|51090951|dbj|BAD35554.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 883
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 326/480 (67%), Positives = 372/480 (77%), Gaps = 22/480 (4%)
Query: 21 KECTNSFPQLASHTFRYELLSSKNETWK-KEVYSHY-HLTPTDDSAWSNLLPRKMLSETD 78
KECTN QL+SHT R L SS W+ +E Y H HL PTD++AW +L+P S +
Sbjct: 23 KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMPLAAASAS- 81
Query: 79 EFSWTMIYRKMKNPDGFKLAGD-----------FLKEVSLHDVKLD----PSSLHWRAQQ 123
EF W M+YR +K G +AGD FL+EVSLHDV+LD ++ RAQQ
Sbjct: 82 EFDWAMLYRSLK---GAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQQ 138
Query: 124 TNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWAST 183
TNLEYLL+L+VD LVWSF+ AG P GK Y GWE P ELRGHFVGHYLSA+A MWAST
Sbjct: 139 TNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVGHYLSAAAKMWAST 198
Query: 184 HNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLL 243
HN TL KM AVV AL +CQ G+GYLSAFP+E FDRFEA++PVWAPYYTIH I+ GLL
Sbjct: 199 HNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIH-IMQGLL 257
Query: 244 DQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTIT 303
DQ+T A N +AL M M +YF RV++VI +Y++ERHW SLNEETGGMNDVLY+LYTIT
Sbjct: 258 DQHTVAGNGKALGMVVAMADYFAGRVRSVIQRYTIERHWTSLNEETGGMNDVLYQLYTIT 317
Query: 304 QDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGT 363
+D +HL+LAHLFDKPCFLGLLAVQAD +SGFHANTHIPVVIG QMRYEVTGDPLYK T
Sbjct: 318 KDQRHLVLAHLFDKPCFLGLLAVQADSLSGFHANTHIPVVIGGQMRYEVTGDPLYKEIAT 377
Query: 364 FFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEM 423
FFMDIVN+SH YATGGTS EFWS+PK LA L TE EESCTTYNMLKVSRHLFRWTKE+
Sbjct: 378 FFMDIVNSSHSYATGGTSVSEFWSNPKHLAEALTTETEESCTTYNMLKVSRHLFRWTKEI 437
Query: 424 VYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
YADYYERAL NGVLSIQRG +PGVMIYMLP G G SKA SYHGWGT+++SFWCCYGTG+
Sbjct: 438 AYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAVSYHGWGTQYNSFWCCYGTGI 497
>gi|357123866|ref|XP_003563628.1| PREDICTED: uncharacterized protein LOC100829886 [Brachypodium
distachyon]
Length = 850
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 317/480 (66%), Positives = 366/480 (76%), Gaps = 13/480 (2%)
Query: 17 VALCKECTNSFPQLASHTFRYELLSSKN-ETWKKEV--YSHYHLTPTDDSAWSNL-LPRK 72
+A+ KECTN QL+SHT R L + E W+ + H H++PTD++ W +L P
Sbjct: 1 MAVAKECTNVPTQLSSHTVRARLQGDPSAEEWRLRALFHDHAHVSPTDEATWMDLRAPLA 60
Query: 73 MLSETDEFSWTMIYRKMKNPDGFKLAGD---FLKEVSLHDVKLD--PSSLHWRAQQTNLE 127
+ T+E W M+YR +K A FL+EV L DV+LD +++ RAQQTNLE
Sbjct: 61 SSAATEESGWAMLYRALKGSASGGSASAAAGFLEEVPLQDVRLDMEEDAVYGRAQQTNLE 120
Query: 128 YLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVT 187
YLL+LDVD L+WSF+ AG P GK Y GWE ELRGHFVGHYLSA+A WASTHN T
Sbjct: 121 YLLLLDVDRLLWSFRTQAGLPAPGKPYGGWEGADVELRGHFVGHYLSAAAKTWASTHNGT 180
Query: 188 LKEKMTAVVSALSECQNKM----GSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLL 243
L KM+AVV AL ECQ G+GYLSAFP+E FDRFEA++PVWAPYYT+HKI+ GLL
Sbjct: 181 LAAKMSAVVDALHECQQAAAANGGNGYLSAFPAEFFDRFEAIQPVWAPYYTVHKIMQGLL 240
Query: 244 DQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTIT 303
DQ+T A N +AL M M YF RV++VI ++ +ERHW SLNEETGGMNDVLY+LYTIT
Sbjct: 241 DQHTVAGNGKALAMAVAMAGYFGGRVRSVIQRHGIERHWTSLNEETGGMNDVLYQLYTIT 300
Query: 304 QDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGT 363
D +HL+LAHLFDKPCFLGLLAVQAD ++GFHANTHIPVV+G QMRYEVTGDPLYK T
Sbjct: 301 NDQRHLVLAHLFDKPCFLGLLAVQADSLTGFHANTHIPVVVGGQMRYEVTGDPLYKEIST 360
Query: 364 FFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEM 423
FFMDIVN SH YATGGTS EFWSDPKRLASTL TENEESCTTYNMLKVSRHLFRWTKE+
Sbjct: 361 FFMDIVNTSHSYATGGTSVSEFWSDPKRLASTLTTENEESCTTYNMLKVSRHLFRWTKEI 420
Query: 424 VYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
YADYYERAL NGVLSIQRG +PGVMIYMLP G G SKA SYHGWGT++ SFWCCYGTG+
Sbjct: 421 AYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAVSYHGWGTQYDSFWCCYGTGI 480
>gi|297746368|emb|CBI16424.3| unnamed protein product [Vitis vinifera]
Length = 741
Score = 617 bits (1592), Expect = e-174, Method: Compositional matrix adjust.
Identities = 282/353 (79%), Positives = 309/353 (87%)
Query: 131 MLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKE 190
MLD D LVWSF++TAG PT Y GWE P ELRGHFVGHYLSASA MWASTHN +LKE
Sbjct: 1 MLDADRLVWSFRRTAGLPTPCSPYGGWESPDGELRGHFVGHYLSASAQMWASTHNESLKE 60
Query: 191 KMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFAD 250
KM+AVV AL ECQ KMG+GYLSAFPSE FDRFEAL+ VWAPYYTIHKILAGLLDQYT
Sbjct: 61 KMSAVVCALGECQKKMGTGYLSAFPSELFDRFEALEEVWAPYYTIHKILAGLLDQYTLGG 120
Query: 251 NTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLL 310
N QALKM WMVEYFYNRVQNVI+ YS+ERHW SLNEETGGMND LY LY IT D KH +
Sbjct: 121 NAQALKMVTWMVEYFYNRVQNVISSYSIERHWLSLNEETGGMNDFLYNLYRITGDQKHFV 180
Query: 311 LAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVN 370
LAHLFDKPCFLGLLA+QADDISGFHANTHIP+V+G+QMRYE+TGDPLYK G FF+D VN
Sbjct: 181 LAHLFDKPCFLGLLAMQADDISGFHANTHIPIVVGAQMRYEITGDPLYKTIGAFFIDTVN 240
Query: 371 ASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYE 430
+SH YATGGTS EFWSDPKR+A+TL TEN ESCTTYNMLKVSR+LFRWTKE+ YADYYE
Sbjct: 241 SSHSYATGGTSVDEFWSDPKRMATTLQTENAESCTTYNMLKVSRNLFRWTKEVAYADYYE 300
Query: 431 RALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
RALTNG+LSIQRGT+PGVM+YMLPLG G+SKA+SYHGWGT+F SFWCCYGTG+
Sbjct: 301 RALTNGILSIQRGTDPGVMLYMLPLGHGNSKARSYHGWGTKFHSFWCCYGTGI 353
>gi|242096362|ref|XP_002438671.1| hypothetical protein SORBIDRAFT_10g024070 [Sorghum bicolor]
gi|241916894|gb|EER90038.1| hypothetical protein SORBIDRAFT_10g024070 [Sorghum bicolor]
Length = 887
Score = 613 bits (1580), Expect = e-173, Method: Compositional matrix adjust.
Identities = 324/495 (65%), Positives = 365/495 (73%), Gaps = 30/495 (6%)
Query: 19 LCKECTNSFPQLASHTFRYELLSSKNET-WKKEVYSHYHLTPTDDSAWSNLLP------- 70
+ KECTN +L+SHT R L +S W+ H HL PTD++AW +L+P
Sbjct: 27 MAKECTNIPTELSSHTVRARLQASPGAAEWRWRELFHEHLNPTDEAAWMDLMPPPPRGGL 86
Query: 71 ---------RKMLSETDEFSWTMIYRKMKNPD----------GFKLAGDFLKEVSLHDVK 111
E +E W M+YR +K G AG FL+EVSLHDV+
Sbjct: 87 QTAAAADAGHHHHQEEEELDWVMLYRSLKGQQVVVGGAVPASGAAAAGPFLEEVSLHDVR 146
Query: 112 LDPS---SLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHF 168
LDP + + RAQ+TNLEYLL+LDVD LVWSF+ A P G+ Y GWE P ELRGHF
Sbjct: 147 LDPDGDDAAYGRAQRTNLEYLLLLDVDRLVWSFRSQAALPAPGEPYGGWEKPDSELRGHF 206
Query: 169 VGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPV 228
VGHYLSA+A MWASTHN TL KM+AVV AL ECQ G+GYLSAFP+E FDRFEA+KPV
Sbjct: 207 VGHYLSATAKMWASTHNGTLAGKMSAVVDALDECQRAAGTGYLSAFPAEFFDRFEAIKPV 266
Query: 229 WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEE 288
WAPYYTIHKI+ GLLDQ+ A N +AL M M +YF RV+NVI +YS+ERHW SLNEE
Sbjct: 267 WAPYYTIHKIMQGLLDQHVVAGNGKALGMVVAMADYFAGRVRNVIRRYSIERHWTSLNEE 326
Query: 289 TGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQM 348
TGGMNDVLY+LYTIT D +HL+LAHLFDKPCFLGLLAVQAD +S FHANTHIPVVIG QM
Sbjct: 327 TGGMNDVLYQLYTITHDQRHLVLAHLFDKPCFLGLLAVQADSLSNFHANTHIPVVIGGQM 386
Query: 349 RYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYN 408
RYEVTGDPLYK TFFMD VN+SH YATGGTS EFWSDPKRLA L TE EESCTTYN
Sbjct: 387 RYEVTGDPLYKEIATFFMDTVNSSHAYATGGTSVSEFWSDPKRLAEALTTETEESCTTYN 446
Query: 409 MLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGW 468
MLKVSRHLFRWTKE+ YADYYERAL NGVLSIQRG +PGVMIYMLP G G SKAKSYHGW
Sbjct: 447 MLKVSRHLFRWTKEVAYADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAKSYHGW 506
Query: 469 GTRFSSFWCCYGTGL 483
GT+ SFWCCYGTG+
Sbjct: 507 GTQNESFWCCYGTGI 521
>gi|302818405|ref|XP_002990876.1| hypothetical protein SELMODRAFT_20509 [Selaginella moellendorffii]
gi|300141437|gb|EFJ08149.1| hypothetical protein SELMODRAFT_20509 [Selaginella moellendorffii]
Length = 755
Score = 608 bits (1567), Expect = e-171, Method: Compositional matrix adjust.
Identities = 286/383 (74%), Positives = 323/383 (84%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
FL+ VSLHDV+L P S AQQTNL+YLLMLDVD+LV+SF+ TAG +G AY GWE P
Sbjct: 1 FLEAVSLHDVRLLPDSWQAIAQQTNLDYLLMLDVDNLVYSFRTTAGLNASGSAYGGWELP 60
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD 220
T ELRGHFVGHYLSASA WASTHN+T+ E M AVV+AL+ECQ K+G+GYLSAFP+ FD
Sbjct: 61 TSELRGHFVGHYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSAFPTSLFD 120
Query: 221 RFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVER 280
RFEAL+ VWAPYYTIHKI+AGLLDQYT+A N+ A +M M +YF +RV+ VI KYS+ER
Sbjct: 121 RFEALESVWAPYYTIHKIMAGLLDQYTYAANSLAFEMLLGMTDYFGSRVERVIEKYSIER 180
Query: 281 HWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHI 340
HW SLNEETGGMNDVLYR+Y IT D KHL LAHLFDKPCFLGLLAV+AD ISGFHANTHI
Sbjct: 181 HWQSLNEETGGMNDVLYRVYQITGDAKHLKLAHLFDKPCFLGLLAVRADSISGFHANTHI 240
Query: 341 PVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN 400
P+VIG+Q+RYEV GD LYK +FM IV++SH YATGGTSAGEFWSDP RL TLGTEN
Sbjct: 241 PIVIGAQLRYEVVGDKLYKDLSEYFMKIVSSSHTYATGGTSAGEFWSDPSRLGDTLGTEN 300
Query: 401 EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
EESCTTYNMLKV+R+LFRWTK+M YAD+YERAL NGVL+IQRG EPGVMIYMLPL G S
Sbjct: 301 EESCTTYNMLKVARNLFRWTKQMHYADFYERALINGVLTIQRGKEPGVMIYMLPLAPGSS 360
Query: 461 KAKSYHGWGTRFSSFWCCYGTGL 483
KA SYHGWGT FSSFWCCYGT +
Sbjct: 361 KATSYHGWGTPFSSFWCCYGTAI 383
>gi|302785087|ref|XP_002974315.1| hypothetical protein SELMODRAFT_30650 [Selaginella moellendorffii]
gi|300157913|gb|EFJ24537.1| hypothetical protein SELMODRAFT_30650 [Selaginella moellendorffii]
Length = 755
Score = 605 bits (1559), Expect = e-170, Method: Compositional matrix adjust.
Identities = 284/383 (74%), Positives = 323/383 (84%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
FL VSLHDV+L P S AQQTNL+YLLMLDVD+LV+SF+ TAG +G AY GWE P
Sbjct: 1 FLGAVSLHDVRLLPDSWQAIAQQTNLDYLLMLDVDNLVYSFRTTAGLNASGSAYGGWELP 60
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD 220
T ELRGHFVGHYLSASA WASTHN+T+ E M AVV+AL+ECQ K+G+GYLSAFP+ FD
Sbjct: 61 TSELRGHFVGHYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSAFPTSLFD 120
Query: 221 RFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVER 280
RFEAL+ VWAPYYTIHKI+AGLLDQYT+A N+ A +M M +YF +RV+ VI KYS+ER
Sbjct: 121 RFEALESVWAPYYTIHKIMAGLLDQYTYAANSLAFEMLLGMTDYFGSRVEMVIEKYSIER 180
Query: 281 HWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHI 340
HW SLNEETGGMNDVLYR+Y IT D KHL LAHLFDKPCFLGLLAV+AD ISGFHANTHI
Sbjct: 181 HWQSLNEETGGMNDVLYRIYQITGDAKHLKLAHLFDKPCFLGLLAVRADSISGFHANTHI 240
Query: 341 PVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN 400
P+VIG+Q+RYEV GD LYK +FM IV++SH YATGGTS+GEFWS+P RL TLGTEN
Sbjct: 241 PIVIGAQLRYEVVGDKLYKDLSEYFMKIVSSSHTYATGGTSSGEFWSNPNRLGDTLGTEN 300
Query: 401 EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
EESCTTYNMLKV+R+LFRWTK+M YAD+YERAL NGVL+IQRG EPGVMIYMLPL G S
Sbjct: 301 EESCTTYNMLKVARNLFRWTKQMHYADFYERALINGVLTIQRGKEPGVMIYMLPLAPGSS 360
Query: 461 KAKSYHGWGTRFSSFWCCYGTGL 483
KAKSYHGWGT F+SFWCCYGT +
Sbjct: 361 KAKSYHGWGTPFTSFWCCYGTAI 383
>gi|218198543|gb|EEC80970.1| hypothetical protein OsI_23693 [Oryza sativa Indica Group]
Length = 905
Score = 595 bits (1534), Expect = e-167, Method: Compositional matrix adjust.
Identities = 314/506 (62%), Positives = 359/506 (70%), Gaps = 52/506 (10%)
Query: 21 KECTNSFPQLASHTFRYELLSSKNETWK-KEVYSHY-HLTPTDDSAWSNLLPRKMLSETD 78
KECTN QL+SHT R L SS W+ +E Y H HL PTD++AW +L+P S +
Sbjct: 23 KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMPLAAASAS- 81
Query: 79 EFSWTMIYRKMKNPDGFKLAGD-----------FLKEVSLHDVKLD----PSSLHWRAQQ 123
EF W M+YR +K G +AGD FL+EVSLHDV+LD ++ RAQQ
Sbjct: 82 EFDWAMLYRSLK---GAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQQ 138
Query: 124 TNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWAST 183
TNLEYLL+L+VD LVWSF+ AG P GK Y GWE P ELRGHFVGHYLSA+A MWAST
Sbjct: 139 TNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVGHYLSAAAKMWAST 198
Query: 184 HNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHK------ 237
HN TL KM AVV AL +CQ G+GYLSAFP+E FDRFEA++PVWAPYYTIHK
Sbjct: 199 HNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKARNATQ 258
Query: 238 --------------------ILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYS 277
I+ GLLDQ+T A N +AL M M +YF RV++VI +Y+
Sbjct: 259 SICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGKALGMVVAMADYFAGRVRSVIQRYT 318
Query: 278 VERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHAN 337
+ERHW SLNEETGGMNDVLY+L T + F + CFLGLLAVQAD +SGFHAN
Sbjct: 319 IERHWTSLNEETGGMNDVLYQLKT-----EAFGAGSSFRQACFLGLLAVQADSLSGFHAN 373
Query: 338 THIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG 397
THIPVVIG QMRYEVTGDPLYK TFFMDIVN+SH YATGGTS EFWS+PK LA L
Sbjct: 374 THIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVSEFWSNPKHLAEALT 433
Query: 398 TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 457
TE EESCTTYNMLKVSRHLFRWTKE+ YADYYERAL NGVLSIQRG +PGVMIYMLP G
Sbjct: 434 TETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMIYMLPQGP 493
Query: 458 GDSKAKSYHGWGTRFSSFWCCYGTGL 483
G SKA SYHGWGT+++SFWCCYGTG+
Sbjct: 494 GRSKAVSYHGWGTQYNSFWCCYGTGI 519
>gi|449531121|ref|XP_004172536.1| PREDICTED: uncharacterized LOC101224273, partial [Cucumis sativus]
Length = 366
Score = 571 bits (1472), Expect = e-160, Method: Compositional matrix adjust.
Identities = 268/345 (77%), Positives = 300/345 (86%)
Query: 15 CWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHLTPTDDSAWSNLLPRKML 74
C KECTN+ QL SHTFRYELLSS N TWKKE++SHYHLTPTDD AWSNLLPRKML
Sbjct: 22 CNCDSLKECTNTPTQLGSHTFRYELLSSGNVTWKKELFSHYHLTPTDDFAWSNLLPRKML 81
Query: 75 SETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDV 134
E +E++W M+YR+MKN DG ++ G LKE+SLHDV+LDP+SLH AQ TNL+YLLMLDV
Sbjct: 82 KEENEYNWEMMYRQMKNKDGLRIPGGMLKEISLHDVRLDPNSLHGTAQTTNLKYLLMLDV 141
Query: 135 DSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTA 194
D L+WSF+KTAG PT G+ Y GWE CELRGHFVGHYLSASA MWAST N LKEKM+A
Sbjct: 142 DRLLWSFRKTAGLPTPGEPYVGWEKSDCELRGHFVGHYLSASAQMWASTGNSVLKEKMSA 201
Query: 195 VVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA 254
+VS L+ CQ+KMG+GYLSAFPSE+FDRFEA++PVWAPYYTIHKILAGLLDQYTFA N+QA
Sbjct: 202 LVSGLATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAGLLDQYTFAGNSQA 261
Query: 255 LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHL 314
LKM WMVEYFYNRVQNVI KY+VERH+ SLNEETGGMNDVLYRLY IT + KHLLLAHL
Sbjct: 262 LKMVTWMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYRITGNTKHLLLAHL 321
Query: 315 FDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYK 359
FDKPCFLGLLAVQA+DISGFH NTHIP+V+GSQMRYEVTGDPLYK
Sbjct: 322 FDKPCFLGLLAVQAEDISGFHVNTHIPIVVGSQMRYEVTGDPLYK 366
>gi|168021740|ref|XP_001763399.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685534|gb|EDQ71929.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 757
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 255/383 (66%), Positives = 303/383 (79%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
LK+VSLH V+L S + AQ TNL+YLL LDVD+++WSF+K + G+ Y GWE P
Sbjct: 1 LLKDVSLHKVRLGADSPQFMAQNTNLQYLLELDVDNMMWSFRKVSNLNAPGQPYGGWESP 60
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD 220
ELRGHFVGHYLSASA MWASTHN L EKM A++ AL ECQ +G+GYLSAFPSE FD
Sbjct: 61 ASELRGHFVGHYLSASALMWASTHNEVLHEKMNALLGALKECQMSIGTGYLSAFPSEFFD 120
Query: 221 RFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVER 280
RFEA++ VWAPYYTIHKI+AGLLDQY A + AL M M YFY RV+ VI K+++ER
Sbjct: 121 RFEAIEYVWAPYYTIHKIMAGLLDQYLLAGSKDALDMVVEMANYFYKRVKTVIEKFTIER 180
Query: 281 HWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHI 340
HW SLNEETGGMNDVLYRLYT+T D KHL LAHLFDKPCFLG LA+QAD +SGFH+NTHI
Sbjct: 181 HWRSLNEETGGMNDVLYRLYTVTGDNKHLELAHLFDKPCFLGPLALQADHLSGFHSNTHI 240
Query: 341 PVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN 400
P+V+G+QMRYEVT D +Y+ +FM IVN+SH YATGGTS EFW+D R TL TEN
Sbjct: 241 PIVVGAQMRYEVTSDLIYRSIAEYFMGIVNSSHSYATGGTSVSEFWTDSMRQGDTLHTEN 300
Query: 401 EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
+E+CTTYNMLK++R LFRWTK++ Y DYY+RAL NG+L QRG +PGVMIYMLP+G G S
Sbjct: 301 QETCTTYNMLKIARTLFRWTKDIKYMDYYDRALINGILGTQRGQQPGVMIYMLPMGPGVS 360
Query: 461 KAKSYHGWGTRFSSFWCCYGTGL 483
K +SYHGWG +F+SFWCCYGT +
Sbjct: 361 KGRSYHGWGNKFNSFWCCYGTAI 383
>gi|302788790|ref|XP_002976164.1| hypothetical protein SELMODRAFT_104205 [Selaginella moellendorffii]
gi|300156440|gb|EFJ23069.1| hypothetical protein SELMODRAFT_104205 [Selaginella moellendorffii]
Length = 797
Score = 542 bits (1396), Expect = e-151, Method: Compositional matrix adjust.
Identities = 253/383 (66%), Positives = 298/383 (77%), Gaps = 1/383 (0%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
L+ SLH V++D SL + QQTNLEYLLMLDVDSL +SF+ +G PT G Y GWE P
Sbjct: 22 LLEGSSLHKVRIDADSLQGKGQQTNLEYLLMLDVDSLAYSFRNNSGLPTKGVPYGGWEAP 81
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD 220
ELRGHFVGHYLSA+A MWASTHN LK +M +V L ECQ K+G+GYLSAFP F
Sbjct: 82 DQELRGHFVGHYLSATAKMWASTHNEELKRRMDHLVDILDECQQKIGTGYLSAFPLNLFT 141
Query: 221 RFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVER 280
RFE +PVWAPYYTIHKI+AGLLDQYT A N +AL+M WM +YF RV+N I KYS++
Sbjct: 142 RFETYRPVWAPYYTIHKIMAGLLDQYTEAGNMKALRMVIWMAQYFSKRVENYIEKYSIQA 201
Query: 281 HWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHI 340
H+ +LNEETGGMNDVLY LY IT DP+HL LAHLFDKPCFLG LA+Q D +SGFHANTHI
Sbjct: 202 HFQALNEETGGMNDVLYDLYKITGDPQHLKLAHLFDKPCFLGPLALQQDTLSGFHANTHI 261
Query: 341 PVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN 400
P++IG+Q RYE+TGD + K TFFMD VN+SH + TGGTS EFW DP R+AS+LG +
Sbjct: 262 PILIGAQKRYELTGDQVSKELVTFFMDAVNSSHRFVTGGTSDNEFWKDPNRMASSLGKDV 321
Query: 401 EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
EESC++YNMLK++R+LFRWTKE Y DYYER + NGVL+IQRG EPGVMIYMLP+G G +
Sbjct: 322 EESCSSYNMLKIARNLFRWTKEASYMDYYERLILNGVLTIQRG-EPGVMIYMLPMGPGMA 380
Query: 461 KAKSYHGWGTRFSSFWCCYGTGL 483
K S GWG F SFWCCYGTG+
Sbjct: 381 KTSSTMGWGDPFDSFWCCYGTGI 403
>gi|302769588|ref|XP_002968213.1| hypothetical protein SELMODRAFT_89765 [Selaginella moellendorffii]
gi|300163857|gb|EFJ30467.1| hypothetical protein SELMODRAFT_89765 [Selaginella moellendorffii]
Length = 797
Score = 540 bits (1392), Expect = e-151, Method: Compositional matrix adjust.
Identities = 252/383 (65%), Positives = 298/383 (77%), Gaps = 1/383 (0%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
L+ SLH V++D SL + QQTNLEYLLMLDVDSL +SF+ +G PT G Y GWE P
Sbjct: 22 LLEGSSLHKVRIDADSLQGKGQQTNLEYLLMLDVDSLAYSFRNNSGLPTKGVPYGGWEAP 81
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD 220
ELRGHFVGHYLSA+A MWASTHN LK +M +V L ECQ K+G+GYLSAFP F
Sbjct: 82 DQELRGHFVGHYLSATAKMWASTHNEELKRRMDHLVDILDECQQKIGTGYLSAFPLNLFT 141
Query: 221 RFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVER 280
RFE +PVWAPYYTIHKI+AGLLDQYT A N +AL+M WM +YF RV+N I KYS++
Sbjct: 142 RFETYRPVWAPYYTIHKIMAGLLDQYTEAGNMKALRMVIWMAQYFSKRVENYIEKYSIQA 201
Query: 281 HWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHI 340
H+ +LNEETGGMNDVLY LY IT DP+HL LAHLFDKPCFLG LA+Q D +SGFHANTHI
Sbjct: 202 HFQALNEETGGMNDVLYDLYKITGDPQHLKLAHLFDKPCFLGPLALQQDTLSGFHANTHI 261
Query: 341 PVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN 400
P++IG+Q RYE+TGD + K TFFMD VN+SH + TGGTS EFW DP R+AS+LG +
Sbjct: 262 PILIGAQKRYELTGDQVSKELVTFFMDAVNSSHRFVTGGTSDNEFWKDPNRMASSLGKDV 321
Query: 401 EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
EESC++YNMLK++R+LFRWTK+ Y DYYER + NGVL+IQRG EPGVMIYMLP+G G +
Sbjct: 322 EESCSSYNMLKIARNLFRWTKDASYMDYYERLILNGVLTIQRG-EPGVMIYMLPMGPGMA 380
Query: 461 KAKSYHGWGTRFSSFWCCYGTGL 483
K S GWG F SFWCCYGTG+
Sbjct: 381 KTSSTMGWGDPFDSFWCCYGTGI 403
>gi|125556053|gb|EAZ01659.1| hypothetical protein OsI_23694 [Oryza sativa Indica Group]
Length = 898
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 249/454 (54%), Positives = 316/454 (69%), Gaps = 16/454 (3%)
Query: 44 NETWKKEVYSHYHLTPTDDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGDFLK 103
N+T + HL +++ W LLPR+ DE W +YR + G + AG FL
Sbjct: 44 NDTQGRHSDGLPHLNQAEEATWMGLLPRRA-GPRDELDWLALYRSITRGGGGEPAG-FLS 101
Query: 104 EVSLHDVKLDP--SSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
SLHDV++DP ++++W+ QQTNLEYLL LD D L W+F++ A P G+ Y GWE P
Sbjct: 102 PASLHDVRVDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPIVGEPYGGWEAPD 161
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
+LRGHF GHYLSA+AHMWASTHN L+EKMT VV L CQ KM +GYLSA+P FD
Sbjct: 162 GQLRGHFTGHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESMFDA 221
Query: 222 FEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERH 281
++ L W+PYYTIHKI+ GLLDQYT A N + L++ WM +YF RV+ +I +YS++RH
Sbjct: 222 YDELAEAWSPYYTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSIQRH 281
Query: 282 WNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIP 341
W ++NEETGG NDV+Y+LY IT++ KHL +AHLFDKPCFLG L + DDISG H NTH+P
Sbjct: 282 WEAINEETGGFNDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNTHVP 341
Query: 342 VVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG-TEN 400
V++G+Q RYEV GD LYK TFF D+VN+SH +ATGGTS E W DPKRL + + N
Sbjct: 342 VIVGAQKRYEVVGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWHDPKRLVDEIKISSN 401
Query: 401 EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
EE+C TYN+LKVSR+LFRWTKE Y D+YER L NG++ QRG EPGVMIY LP+G G S
Sbjct: 402 EETCATYNLLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGPGRS 461
Query: 461 KA-----------KSYHGWGTRFSSFWCCYGTGL 483
K+ K+ GWG ++FWCCYGTG+
Sbjct: 462 KSISGMPTSGLPPKNPGGWGNANATFWCCYGTGI 495
>gi|125597849|gb|EAZ37629.1| hypothetical protein OsJ_21963 [Oryza sativa Japonica Group]
Length = 902
Score = 530 bits (1365), Expect = e-148, Method: Compositional matrix adjust.
Identities = 249/457 (54%), Positives = 317/457 (69%), Gaps = 19/457 (4%)
Query: 44 NETWKKEVYSHYHLTPTDDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGD--- 100
N+T + HL +++ W LLPR+ DE W +YR + G + G+
Sbjct: 45 NDTQGRHSDGLPHLNQAEEATWMGLLPRRA-GPRDELDWLALYRSITR-GGGDVGGEPAG 102
Query: 101 FLKEVSLHDVKLDP--SSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWE 158
FL SLHDV++DP ++++W+ QQTNLEYLL LD D L W+F++ A PT G+ Y GWE
Sbjct: 103 FLSPASLHDVRVDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWE 162
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
P +LRGHF GHYLSA+AHMWASTHN L+EKMT VV L CQ KM +GYLSA+P
Sbjct: 163 APDGQLRGHFTGHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESM 222
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
FD ++ L W+PYYTIHKI+ GLLDQYT A N + L++ WM +YF RV+ +I +YS+
Sbjct: 223 FDAYDELAEAWSPYYTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSI 282
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
+RHW ++NEETGG NDV+Y+LY IT++ KHL +AHLFDKPCFLG L + DDISG H NT
Sbjct: 283 QRHWEAINEETGGFNDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNT 342
Query: 339 HIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG- 397
H+PV++G+Q RYEV GD LYK TFF D+VN+SH +ATGGTS E W DPKRL +
Sbjct: 343 HVPVIVGAQKRYEVVGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWHDPKRLVDEIKI 402
Query: 398 TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 457
+ NEE+C TYN+LKVSR+LFRWTKE Y D+YER L NG++ QRG EPGVMIY LP+G
Sbjct: 403 SSNEETCATYNLLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGP 462
Query: 458 GDSKA-----------KSYHGWGTRFSSFWCCYGTGL 483
G SK+ K+ GWG ++FWCCYGTG+
Sbjct: 463 GRSKSISGMPTSGLPPKNPGGWGNANATFWCCYGTGI 499
>gi|51090918|dbj|BAD35523.1| unknown protein [Oryza sativa Japonica Group]
gi|51090952|dbj|BAD35555.1| unknown protein [Oryza sativa Japonica Group]
Length = 902
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 249/457 (54%), Positives = 317/457 (69%), Gaps = 19/457 (4%)
Query: 44 NETWKKEVYSHYHLTPTDDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGD--- 100
N+T + HL +++ W LLPR+ DE W +YR + G + G+
Sbjct: 45 NDTQGRHSDGLPHLNQAEEATWMGLLPRRA-GPRDELDWLALYRSITR-GGGDVGGEPAG 102
Query: 101 FLKEVSLHDVKLDP--SSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWE 158
FL SLHDV++DP ++++W+ QQTNLEYLL LD D L W+F++ A PT G+ Y GWE
Sbjct: 103 FLSPASLHDVRVDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWE 162
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
P +LRGHF GHYLSA+AHMWASTHN L+EKMT VV L CQ KM +GYLSA+P
Sbjct: 163 APDGQLRGHFTGHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESM 222
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
FD ++ L W+PYYTIHKI+ GLLDQYT A N + L++ WM +YF RV+ +I +YS+
Sbjct: 223 FDAYDELAEAWSPYYTIHKIMQGLLDQYTLAGNPKGLEIVVWMTDYFSTRVKKLIQEYSI 282
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
+RHW ++NEETGG NDV+Y+LY IT++ KHL +AHLFDKPCFLG L + DDISG H NT
Sbjct: 283 QRHWEAINEETGGFNDVMYQLYAITKNQKHLTMAHLFDKPCFLGPLGLHDDDISGLHVNT 342
Query: 339 HIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG- 397
H+PV++G+Q RYEV GD LYK TFF D+VN+SH +ATGGTS E W DPKRL +
Sbjct: 343 HVPVIVGAQKRYEVVGDQLYKEIATFFFDVVNSSHTFATGGTSTMEHWHDPKRLVDEIKI 402
Query: 398 TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 457
+ NEE+C TYN+LKVSR+LFRWTKE Y D+YER L NG++ QRG EPGVMIY LP+G
Sbjct: 403 SSNEETCATYNLLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEPGVMIYFLPMGP 462
Query: 458 GDSKA-----------KSYHGWGTRFSSFWCCYGTGL 483
G SK+ K+ GWG ++FWCCYGTG+
Sbjct: 463 GRSKSISGMPTSGLPPKNPGGWGNANATFWCCYGTGI 499
>gi|293331149|ref|NP_001170532.1| uncharacterized protein LOC100384546 precursor [Zea mays]
gi|238005884|gb|ACR33977.1| unknown [Zea mays]
gi|413954824|gb|AFW87473.1| hypothetical protein ZEAMMB73_711416 [Zea mays]
Length = 902
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 256/464 (55%), Positives = 320/464 (68%), Gaps = 24/464 (5%)
Query: 44 NETWKKEVYSHYHLTPTDDSAWSNLLPRKMLSETD-EFSWTMIYRKMKNPDG-----FKL 97
N+T + HLTPT+++ W +LLPR++ EF W +YR + DG K
Sbjct: 45 NDTKGRHDDGLPHLTPTEEATWMSLLPRRLRGGGRAEFDWLALYRSLTRGDGPDGGAGKA 104
Query: 98 AG--DFLKEVSLHDVKLDP----SSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG 151
AG L SLHDV+L SS++WRAQQTNLEYLL LD D L W+F++ AG PT G
Sbjct: 105 AGPEGLLSPASLHDVRLHGDGSLSSMYWRAQQTNLEYLLYLDPDRLTWTFRQQAGLPTVG 164
Query: 152 KAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYL 211
Y GWE P +LRGHFVGHYLSASAH WA+THN TL+E+M VV L CQ KMG+GYL
Sbjct: 165 DPYGGWEAPDGQLRGHFVGHYLSASAHAWAATHNGTLRERMARVVDILHACQKKMGTGYL 224
Query: 212 SAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQN 271
SA+P FD +E L W+PYYT HKI+ GLLDQYT A N + L + M +YF NRV+N
Sbjct: 225 SAYPETMFDLYEQLDEAWSPYYTTHKIMQGLLDQYTLASNEKGLDVVLRMADYFSNRVKN 284
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
++ ++++RHW ++NEETGG NDV+Y+LYTIT+D KHL +AHLFDKPCFLG L + DDI
Sbjct: 285 LVQIHTIQRHWEAMNEETGGFNDVMYQLYTITRDQKHLTMAHLFDKPCFLGPLGLHKDDI 344
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
SG H NTH+PV++G+Q RYEV GD LYK T+ D+VN+SH +ATGGTS E W DPKR
Sbjct: 345 SGLHVNTHLPVLVGAQKRYEVVGDRLYKDISTYLFDVVNSSHTFATGGTSTMEHWHDPKR 404
Query: 392 LASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
L + + NEE+C TYN LKVSR+LFRWTKE YAD+YER L NG++ QRGT+PGVM+
Sbjct: 405 LVDEIKISSNEETCATYNFLKVSRNLFRWTKEAKYADHYERLLINGIMGNQRGTQPGVML 464
Query: 451 YMLPLGRGDSKA-----------KSYHGWGTRFSSFWCCYGTGL 483
Y LP+G G SK+ K+ GWG +FWCCYGTG+
Sbjct: 465 YFLPMGPGRSKSVSGQSPSGLPPKNPGGWGGPNDTFWCCYGTGI 508
>gi|242096364|ref|XP_002438672.1| hypothetical protein SORBIDRAFT_10g024080 [Sorghum bicolor]
gi|241916895|gb|EER90039.1| hypothetical protein SORBIDRAFT_10g024080 [Sorghum bicolor]
Length = 933
Score = 519 bits (1337), Expect = e-144, Method: Compositional matrix adjust.
Identities = 250/471 (53%), Positives = 313/471 (66%), Gaps = 43/471 (9%)
Query: 56 HLTPTDDSAW-------SNLLPRKMLSETDEFSWTMIYRKMK---NPD-----GFKLAGD 100
HLTPT+++ W EF W +YR + PD G G+
Sbjct: 55 HLTPTEEATWMALLPRRLRGGGGGGARARAEFDWLALYRSLTRGGGPDDDADAGKPGPGE 114
Query: 101 FLKEVSLHDVKL----------------DPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKT 144
L SLHDV+L ++++W+AQQTNLEYLL LD D L W+F++
Sbjct: 115 LLTPASLHDVRLHGDDDDDDRVLTGSSSSSAAMYWQAQQTNLEYLLYLDPDRLTWTFRRQ 174
Query: 145 AGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQN 204
AG PT G Y GWE P +LRGHF GHYLSASAHMWA+THN TL+E+MT VV L +CQ
Sbjct: 175 AGLPTVGDPYGGWEAPGGQLRGHFTGHYLSASAHMWAATHNSTLRERMTRVVDILYDCQK 234
Query: 205 KMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEY 264
KMG+GYL+A+P FD +E L W+PYYTIHKI+ GLLDQY A N + L + WM +Y
Sbjct: 235 KMGTGYLAAYPETMFDLYEQLDEAWSPYYTIHKIMQGLLDQYMLASNKKGLDVVVWMTDY 294
Query: 265 FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
F NRV+N+I KY+++RHW ++NEETGG NDV+Y+LYTIT++ KHL +AHLFDKPCFLG L
Sbjct: 295 FSNRVKNLIQKYTIQRHWEAMNEETGGFNDVMYQLYTITKNQKHLTMAHLFDKPCFLGPL 354
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+ DDISG H NTH+PV+IG+Q RYEV GD LYK T+ D+VN+SH +ATGGTS E
Sbjct: 355 GLHKDDISGLHVNTHLPVIIGTQKRYEVVGDHLYKDISTYLFDVVNSSHTFATGGTSTME 414
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
W DPKRL + + NEE+C TYN LKVSR+LFRWTKE YAD+YER L NG++ QRG
Sbjct: 415 HWHDPKRLVDEIKISSNEETCATYNFLKVSRNLFRWTKEAKYADHYERLLINGIMGNQRG 474
Query: 444 TEPGVMIYMLPLGRGDSKA-----------KSYHGWGTRFSSFWCCYGTGL 483
T+PGVM+Y LP+G G SK+ K+ GWG +FWCCYGTG+
Sbjct: 475 TQPGVMLYFLPMGPGRSKSVSGLSPSGLPPKNPGGWGGPNDTFWCCYGTGI 525
>gi|326520888|dbj|BAJ92807.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 683
Score = 499 bits (1286), Expect = e-139, Method: Compositional matrix adjust.
Identities = 232/308 (75%), Positives = 257/308 (83%), Gaps = 3/308 (0%)
Query: 179 MWASTHNVTLKEKMTAVVSALSECQN---KMGSGYLSAFPSEQFDRFEALKPVWAPYYTI 235
MWASTHN TL KM+AVV AL CQ G+GYLSAFP+E FDRFEA+KPVWAPYYTI
Sbjct: 1 MWASTHNGTLAGKMSAVVDALHACQQAPANGGAGYLSAFPAEFFDRFEAIKPVWAPYYTI 60
Query: 236 HKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDV 295
HKI+ GLLDQYT A N +AL M M YF RV++VI ++S+ERHW SLNEETGGMNDV
Sbjct: 61 HKIMQGLLDQYTVAGNGKALAMVVAMAGYFGERVRSVIQRHSIERHWTSLNEETGGMNDV 120
Query: 296 LYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGD 355
LY+LY IT D +HL+LAHLFDKPCFLGLLAVQAD +S FHANTHIP+V+G QMRYEVTGD
Sbjct: 121 LYQLYAITNDQRHLVLAHLFDKPCFLGLLAVQADSLSDFHANTHIPIVVGGQMRYEVTGD 180
Query: 356 PLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRH 415
PLYK TFFM++VN+SH YATGGTS EFW DPKRLA TL TENEESCTTYNMLKVSRH
Sbjct: 181 PLYKEIATFFMNVVNSSHSYATGGTSVSEFWFDPKRLAETLTTENEESCTTYNMLKVSRH 240
Query: 416 LFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSF 475
LFRWTKE+ YADYYERAL NGV SIQRG +PGVMIYMLP G G SKA SYHGWGT++ SF
Sbjct: 241 LFRWTKEIAYADYYERALINGVQSIQRGRDPGVMIYMLPQGPGRSKALSYHGWGTQYDSF 300
Query: 476 WCCYGTGL 483
WCCYGTG+
Sbjct: 301 WCCYGTGI 308
>gi|297606169|ref|NP_001058067.2| Os06g0612900 [Oryza sativa Japonica Group]
gi|255677223|dbj|BAF19981.2| Os06g0612900 [Oryza sativa Japonica Group]
Length = 717
Score = 476 bits (1225), Expect = e-131, Method: Compositional matrix adjust.
Identities = 234/331 (70%), Positives = 261/331 (78%), Gaps = 26/331 (7%)
Query: 179 MWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHK- 237
MWASTHN TL KM AVV AL +CQ G+GYLSAFP+E FDRFEA++PVWAPYYTIHK
Sbjct: 1 MWASTHNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKA 60
Query: 238 -------------------------ILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNV 272
I+ GLLDQ+T A N +AL M M +YF RV++V
Sbjct: 61 RNATQSICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGKALGMVVAMADYFAGRVRSV 120
Query: 273 ITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDIS 332
I +Y++ERHW SLNEETGGMNDVLY+LYTIT+D +HL+LAHLFDKPCFLGLLAVQAD +S
Sbjct: 121 IQRYTIERHWTSLNEETGGMNDVLYQLYTITKDQRHLVLAHLFDKPCFLGLLAVQADSLS 180
Query: 333 GFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRL 392
GFHANTHIPVVIG QMRYEVTGDPLYK TFFMDIVN+SH YATGGTS EFWS+PK L
Sbjct: 181 GFHANTHIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVSEFWSNPKHL 240
Query: 393 ASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYM 452
A L TE EESCTTYNMLKVSRHLFRWTKE+ YADYYERAL NGVLSIQRG +PGVMIYM
Sbjct: 241 AEALTTETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMIYM 300
Query: 453 LPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
LP G G SKA SYHGWGT+++SFWCCYGTG+
Sbjct: 301 LPQGPGRSKAVSYHGWGTQYNSFWCCYGTGI 331
>gi|125556048|gb|EAZ01654.1| hypothetical protein OsI_23690 [Oryza sativa Indica Group]
Length = 466
Score = 474 bits (1219), Expect = e-131, Method: Compositional matrix adjust.
Identities = 234/331 (70%), Positives = 261/331 (78%), Gaps = 26/331 (7%)
Query: 179 MWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHK- 237
MWASTHN TL KM AVV AL +CQ G+GYLSAFP+E FDRFEA++PVWAPYYTIHK
Sbjct: 1 MWASTHNGTLAGKMAAVVDALHDCQAAAGTGYLSAFPAEFFDRFEAIRPVWAPYYTIHKA 60
Query: 238 -------------------------ILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNV 272
I+ GLLDQ+T A N +AL M M +YF RV++V
Sbjct: 61 RNATQSICISTMAMNLICSCKCLNEIMQGLLDQHTVAGNGRALGMVVAMADYFAGRVRSV 120
Query: 273 ITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDIS 332
I +Y++ERHW SLNEETGGMNDVLY+LYTIT+D +HL+LAHLFDKPCFLGLLAVQAD +S
Sbjct: 121 IQRYTIERHWTSLNEETGGMNDVLYQLYTITKDQRHLVLAHLFDKPCFLGLLAVQADSLS 180
Query: 333 GFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRL 392
GFHANTHIPVVIG QMRYEVTGDPLYK TFFMDIVN+SH YATGGTS EFWS+PK L
Sbjct: 181 GFHANTHIPVVIGGQMRYEVTGDPLYKEIATFFMDIVNSSHSYATGGTSVSEFWSNPKHL 240
Query: 393 ASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYM 452
A L TE EESCTTYNMLKVSRHLFRWTKE+ YADYYERAL NGVLSIQRG +PGVMIYM
Sbjct: 241 AEALTTETEESCTTYNMLKVSRHLFRWTKEIAYADYYERALINGVLSIQRGRDPGVMIYM 300
Query: 453 LPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
LP G G SKA SYHGWGT+++SFWCCYGTG+
Sbjct: 301 LPQGPGRSKAVSYHGWGTQYNSFWCCYGTGI 331
>gi|357472933|ref|XP_003606751.1| hypothetical protein MTR_4g065220 [Medicago truncatula]
gi|355507806|gb|AES88948.1| hypothetical protein MTR_4g065220 [Medicago truncatula]
Length = 593
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 202/300 (67%), Positives = 236/300 (78%), Gaps = 8/300 (2%)
Query: 192 MTAVVSALSECQNKMGSGYLSAFPSEQF-DRFEALKPVWAPYYTIHKIL------AGLLD 244
M+A+VS LS CQ K +G + F + L+ WAPYYTIHK+ LD
Sbjct: 1 MSALVSGLSACQEKNWNGISVCISNRVFLIELKNLEYAWAPYYTIHKLFDFDRSWLAFLD 60
Query: 245 QYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQ 304
QYT A N Q LKM WMV+YFYNRV NVI K++V RH+ SLNEE GGMND+LYRLY++T+
Sbjct: 61 QYTIAGNPQGLKMVTWMVDYFYNRVMNVIQKFTVNRHYQSLNEEAGGMNDLLYRLYSLTR 120
Query: 305 DPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTF 364
DPKHL LAHLFDKPCFLG+LAVQ +DI+ FHANTHIP+V+G+Q+RYE+TGD YK G +
Sbjct: 121 DPKHLELAHLFDKPCFLGVLAVQGNDIADFHANTHIPIVVGAQLRYELTGDLHYKDIGQY 180
Query: 365 FMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEM 423
FMDIVN+SH YATGGTS GEFW +PKR+A L + E EESC+TYNMLKVSRHLFRWTKE+
Sbjct: 181 FMDIVNSSHAYATGGTSVGEFWRNPKRIADNLKSAETEESCSTYNMLKVSRHLFRWTKEV 240
Query: 424 VYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
YADYYERALTNGVLSIQRGT+PGVMIYMLPLG G SKA++Y WGT F SFWCCYGTG+
Sbjct: 241 TYADYYERALTNGVLSIQRGTDPGVMIYMLPLGLGVSKAQTYWKWGTPFDSFWCCYGTGI 300
>gi|357472921|ref|XP_003606745.1| hypothetical protein MTR_4g065150 [Medicago truncatula]
gi|355507800|gb|AES88942.1| hypothetical protein MTR_4g065150 [Medicago truncatula]
Length = 617
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 181/228 (79%), Positives = 203/228 (89%), Gaps = 1/228 (0%)
Query: 257 MTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFD 316
M WMV+YFY+RV NVI+KY+V RH+ SLNEETGGMNDVLY+LY++T D KHLLLAHLFD
Sbjct: 1 MVTWMVDYFYDRVVNVISKYTVNRHYQSLNEETGGMNDVLYKLYSVTGDSKHLLLAHLFD 60
Query: 317 KPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYA 376
KPCFLGLLAVQA+DI+ FHANTHIP+V+GSQMRYEVTGDPLY+ G+FFMDIVN+SH YA
Sbjct: 61 KPCFLGLLAVQANDIADFHANTHIPIVVGSQMRYEVTGDPLYREIGSFFMDIVNSSHSYA 120
Query: 377 TGGTSAGEFWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTN 435
TGGTS EFWS+PKR+A LG TENEESCTTYNMLKVSRHLFRWTKE+ YADYYERALTN
Sbjct: 121 TGGTSVREFWSNPKRIADNLGTTENEESCTTYNMLKVSRHLFRWTKEVTYADYYERALTN 180
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
GVL IQRGT+PGVMIYMLPLG G SKAK+ H WG F +FWCCYGTG+
Sbjct: 181 GVLGIQRGTDPGVMIYMLPLGIGVSKAKTGHSWGNPFDTFWCCYGTGI 228
>gi|255544804|ref|XP_002513463.1| conserved hypothetical protein [Ricinus communis]
gi|223547371|gb|EEF48866.1| conserved hypothetical protein [Ricinus communis]
Length = 759
Score = 363 bits (933), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 177/248 (71%), Positives = 190/248 (76%), Gaps = 32/248 (12%)
Query: 236 HKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDV 295
H +LAGLLDQY FADN QALKM WMVEYFYNRVQNVITKYSVERH+ SLNEETGGMNDV
Sbjct: 169 HFVLAGLLDQYIFADNAQALKMVNWMVEYFYNRVQNVITKYSVERHFLSLNEETGGMNDV 228
Query: 296 LYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGD 355
LY+L++IT +PKHL+LAHLFDKPCFLGLLAVQ
Sbjct: 229 LYKLFSITGEPKHLVLAHLFDKPCFLGLLAVQE--------------------------- 261
Query: 356 PLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRH 415
GTFFMDIVN+SH YATGGTS EFWSDPKRLASTL + EESCTTYNMLKVSRH
Sbjct: 262 -----IGTFFMDIVNSSHTYATGGTSDYEFWSDPKRLASTLNDQTEESCTTYNMLKVSRH 316
Query: 416 LFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSF 475
LFRWTKEM YADYYERALTNGVL IQRGTEPGVMIY+LP G SKA++ H WGT SF
Sbjct: 317 LFRWTKEMAYADYYERALTNGVLGIQRGTEPGVMIYLLPQNPGGSKARTIHKWGTPDDSF 376
Query: 476 WCCYGTGL 483
WCCYGTG+
Sbjct: 377 WCCYGTGI 384
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 113/173 (65%), Positives = 136/173 (78%), Gaps = 6/173 (3%)
Query: 1 MKNFV-FKVLVLFLS---CWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYH 56
MK FV F++LVL + C + KECTN QL+SHTFRY LLSS NE+ K+E+++HYH
Sbjct: 1 MKGFVVFELLVLVAASVLCGFGMSKECTNIPTQLSSHTFRYALLSSNNESLKQEMFAHYH 60
Query: 57 LTPTDDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSS 116
LTPTDDS WS+LLPRKML E DEF W M+Y+K+K+P + +G+FLKEVSLH+V+LD S
Sbjct: 61 LTPTDDSVWSSLLPRKMLKEEDEFDWAMMYKKLKSP--LQSSGNFLKEVSLHNVRLDLGS 118
Query: 117 LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFV 169
HWRAQQTNLEYLLML++D LVWSF+KTAG PT G AY GWE P ELRGHFV
Sbjct: 119 FHWRAQQTNLEYLLMLNLDRLVWSFRKTAGLPTPGTAYGGWEAPNVELRGHFV 171
>gi|159491176|ref|XP_001703549.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280473|gb|EDP06231.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1485
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 185/417 (44%), Positives = 237/417 (56%), Gaps = 56/417 (13%)
Query: 120 RAQQTNLEYLL-MLDVDSLVWSFQKTAGSPTAGKAYEG-WEDPTCELRGHFVGHYLSASA 177
R ++ N +YLL MLD D L+W F+K AG PT G+ Y G WEDP CELRGHFVGHYLSA +
Sbjct: 557 RYERINSKYLLDMLDADRLLWVFRKNAGLPTPGEPYVGSWEDPNCELRGHFVGHYLSALS 616
Query: 178 HMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHK 237
WA T N K ++ +VS L + Q K+G+GYLSAFP+ FDR E+L+ VWAPYYTIHK
Sbjct: 617 LAWAGTGNSAFKTRLDLMVSELGKVQEKLGTGYLSAFPTSWFDRVESLQAVWAPYYTIHK 676
Query: 238 ILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVL 296
I+AGL+D + A + AL M MV+Y +NR Q VI+K +HW + E E GGMN++L
Sbjct: 677 IIAGLVDAHELAGHPSALTMATRMVDYHWNRTQAVISKKGA-KHWQKVLEFEYGGMNEIL 735
Query: 297 YRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP 356
YRLY IT H A LFDK FLG +A D + HANTH+ ++G YE TG+P
Sbjct: 736 YRLYLITGKDDHRDFASLFDKTVFLGHMAAHDDVLYDLHANTHLAQIVGFAAGYEATGNP 795
Query: 357 LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
+ F +IV HGYATGGTS E W + + E+CT YNMLK++R L
Sbjct: 796 KLRTAVNNFFEIVVQHHGYATGGTSVFERWWGRRGRGPRNALKTHETCTQYNMLKIARQL 855
Query: 417 FRWTKEMVYADYYERALTNGVLSIQR---------------------------------- 442
F WT ++ YAD+YERA+ NG+ + R
Sbjct: 856 FMWTGDVYYADHYERAMVNGMWGVARLPADELPENGAAGAGGVDKGGQPVSPYTRFHDDE 915
Query: 443 ------------------GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
PGV +Y+LP+G G+SK+ + H WG F SFWCCYGT
Sbjct: 916 WMDYISFSKPKPEWNASDAAGPGVYLYLLPMGHGNSKSDNLHHWGFPFHSFWCCYGT 972
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 74/140 (52%), Gaps = 22/140 (15%)
Query: 308 HLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMD 367
H+ A LF+KP F + D + HANTH+ V G Y+ ++
Sbjct: 2 HMEFAQLFNKPFFRKPMEAGNDMLMNLHANTHLAQVAGFAEEYDTVDKRVF--------- 52
Query: 368 IVNASHGYATGGTSAGEFWSDPKRLASTL-----GTENEESCTTYNMLKVSRHLFRWTKE 422
ATGG++ EFW P LA ++ G E +E+CT YN+LK++R LFRWT +
Sbjct: 53 --------ATGGSTDHEFWQAPDELADSVLTQKHGVETQETCTQYNILKIARSLFRWTGD 104
Query: 423 MVYADYYERALTNGVLSIQR 442
+ YAD+YERAL NG+L R
Sbjct: 105 VRYADFYERALVNGILGTAR 124
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/36 (66%), Positives = 27/36 (75%)
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
PGV IY+LPLG G SK+ + H WG F SFWCCYGT
Sbjct: 195 PGVFIYLLPLGTGQSKSDNIHHWGFPFHSFWCCYGT 230
>gi|384252025|gb|EIE25502.1| DUF1680-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 648
Score = 338 bits (866), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 170/397 (42%), Positives = 238/397 (59%), Gaps = 14/397 (3%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG-WE 158
D ++ L + L+ SL +A N +Y+L L+ D L+ +F+ AG P++ + + G WE
Sbjct: 20 DIIQPFPLDQITLERDSLFDKALALNTDYMLQLNADQLLHTFRLNAGLPSSAQPFTGSWE 79
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
DP+CE+RG F+GHYLSA + + T N ++ ++T ++ L + Q + GYLSAFP E
Sbjct: 80 DPSCEVRGQFMGHYLSACSMLVNHTGNGKIESRLTYIIDELRKVQIALSGGYLSAFPEEH 139
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
F R ++L+ VWAP+Y IHKI+AGLLD + F AL+M K E+F +V+
Sbjct: 140 FVRLQSLQTVWAPFYVIHKIMAGLLDAHNFLGYDVALEMVKDEAEHFTRYYNDVVATNGT 199
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
E L E GGMN+VL+ LY +T DP+H+ LA F KP F L D + G HANT
Sbjct: 200 EHWLRMLEVEFGGMNEVLFNLYDVTGDPEHIRLAEAFTKPKFFEPLLQNTDPLPGLHANT 259
Query: 339 HIPVVIGSQMRYE-VTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL- 396
H+ V G R+E + D Y FF IV H +ATGG + E+W P++LA ++
Sbjct: 260 HLAQVNGFAARFEKASHDGSYAAVTNFF-SIVTRGHSFATGGNNDHEYWGPPRQLADSIL 318
Query: 397 --GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR--------GTEP 446
TE EE+CT YNMLK++R+LFRWT V+ADYYERA+ NG+L QR + P
Sbjct: 319 LHATETEETCTQYNMLKIARYLFRWTGAPVFADYYERAILNGLLGTQRMPADYSPHTSRP 378
Query: 447 GVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
GV+IY+LP+G G +K S GWG SFWCCYG+ +
Sbjct: 379 GVVIYLLPMGSGQTKGGSTRGWGDPLHSFWCCYGSSV 415
>gi|302844990|ref|XP_002954034.1| hypothetical protein VOLCADRAFT_106211 [Volvox carteri f.
nagariensis]
gi|300260533|gb|EFJ44751.1| hypothetical protein VOLCADRAFT_106211 [Volvox carteri f.
nagariensis]
Length = 1160
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 174/361 (48%), Positives = 224/361 (62%), Gaps = 21/361 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLL-MLDVDSLVWSFQKTAGSPTAGKAY-EGWED 159
++ +L DV+L +S R ++ N +YLL MLD D L+WSF+KTAG PT G+ Y WED
Sbjct: 30 IEPFALSDVRLLDTSHQIRYERLNAKYLLEMLDPDRLLWSFRKTAGLPTPGQPYIASWED 89
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMG-SGYLSAFPSEQ 218
P CELRGHFVGHYLSA + +AST N+ ++ +VS L + Q +G GYLSAFPSE
Sbjct: 90 PGCELRGHFVGHYLSALSLAYASTGNIAFHTRLALMVSELGKVQQALGLGGYLSAFPSEF 149
Query: 219 FDRFEALKPVWAPYYTI-----------HKILAGLLDQYTFADNTQALKMTKWMVEYFYN 267
FDR EALKPVWAPYYTI HKI+AGL+D Y +AL M MV Y +N
Sbjct: 150 FDRVEALKPVWAPYYTIPIAPFPDTTQIHKIIAGLVDAYELGGQKEALAMASRMVAYHWN 209
Query: 268 RVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
R Q +I E HWN LN E GGMN++LYR++ IT+DP HL A LF+KP F+ +
Sbjct: 210 RTQALIASKGRE-HWNGVLNCEFGGMNEILYRMHRITKDPTHLEFARLFEKPFFMKPMVN 268
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
D + HANTH+ V G Y+ GD + F DIV H +ATGG++ EFW
Sbjct: 269 NFDILESLHANTHLAQVAGFAEAYDTVGDEAARNATRNFFDIVTTHHSFATGGSNDHEFW 328
Query: 387 SDPKRLASTL-----GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
P R+A ++ E +E+CT YN+LK++R LFRWT + YAD+YERAL NG+L
Sbjct: 329 QAPDRMADSVIKQKDAVETQETCTQYNILKIARSLFRWTGNVAYADFYERALLNGILGTA 388
Query: 442 R 442
R
Sbjct: 389 R 389
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 26/36 (72%)
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
PGV +Y+ PLG G SK+ + H WG + SFWCCYGT
Sbjct: 486 PGVFLYLTPLGTGQSKSDNIHHWGFPYHSFWCCYGT 521
>gi|33113961|gb|AAP94583.1| putative protein [Zea mays]
Length = 786
Score = 298 bits (763), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 145/215 (67%), Positives = 163/215 (75%), Gaps = 4/215 (1%)
Query: 157 WEDP----TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLS 212
W P +L GHFVGHYL A+A MWASTHN TL KM+ +V+AL +CQ KMG GYLS
Sbjct: 465 WRSPGRFLDVQLWGHFVGHYLGATAKMWASTHNDTLNAKMSYIVNALYDCQKKMGIGYLS 524
Query: 213 AFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNV 272
AFPSE F EA+ VWAPYYTIHKI+ GLLDQYT A N+ AL M MV YF +RV+NV
Sbjct: 525 AFPSEFFVWVEAITSVWAPYYTIHKIMQGLLDQYTVAGNSVALVMVVKMVNYFSDRVKNV 584
Query: 273 ITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDIS 332
I YS+E HW SLNE+TGGMNDV Y+LYTI D KHL LA LFDKPCFLGLLA Q D IS
Sbjct: 585 IQNYSIETHWESLNEKTGGMNDVFYQLYTIMNDTKHLTLAPLFDKPCFLGLLAGQDDSIS 644
Query: 333 GFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMD 367
GFH+NT IPV IG+QMRY+VTGDPLYK +FFMD
Sbjct: 645 GFHSNTRIPVAIGAQMRYKVTGDPLYKQIASFFMD 679
>gi|225872906|ref|YP_002754363.1| Tat pathway signal sequence domain-containing protein
[Acidobacterium capsulatum ATCC 51196]
gi|225794208|gb|ACO34298.1| Tat pathway signal sequence domain protein [Acidobacterium
capsulatum ATCC 51196]
Length = 644
Score = 284 bits (726), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 165/387 (42%), Positives = 217/387 (56%), Gaps = 25/387 (6%)
Query: 103 KEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC 162
K+ + V++ L A + N +YL ++ D L+ +F+ TAG PT+ + GWE P C
Sbjct: 56 KDFPMTQVRMRDGVLK-NALEINRQYLYLVPNDRLLHTFRLTAGLPTSAEPLGGWEAPDC 114
Query: 163 ELRGHFVG-HYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
ELRGHF G HYLSA A M+AST + +K K A+V+ L++CQ GYLSAFP+ FDR
Sbjct: 115 ELRGHFAGGHYLSACALMYASTGDEKIKAKGDALVAELAKCQQP--DGYLSAFPASFFDR 172
Query: 222 FEALKPVWAPYYTIHKILAGLLDQYTFADNTQAL----KMTKWMVEYFYNRVQNVITKYS 277
+ VWAP+YT HKI+AG LD Y N QAL +M W +EY TK
Sbjct: 173 LRHYQKVWAPFYTYHKIMAGHLDMYVHTGNQQALETCKRMADWAIEY---------TKPI 223
Query: 278 VERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHA 336
W L E GGMN+V + LY +T + K+ L F+ LA + D ++G HA
Sbjct: 224 PADQWQRMLLVEQGGMNEVSFNLYAVTGEKKYRDLGFRFEHKLIFDPLAKREDHLAGNHA 283
Query: 337 NTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL 396
NT+IP VIG+ YEV D Y FF V + H YATGGTS GEFW P LA L
Sbjct: 284 NTNIPKVIGAARGYEVADDKRYHTIAEFFWGAVTSQHAYATGGTSDGEFWHKPGTLAEHL 343
Query: 397 GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLG 456
G EE C +YNM+K+SRHL+ WT + DYYER + N + Q G+++Y + L
Sbjct: 344 GPAAEECCCSYNMMKLSRHLYGWTGDPRIFDYYERLMYNVRIGTQ--DPKGMLMYYVSLK 401
Query: 457 RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G K +GT F +FWCC GTG+
Sbjct: 402 PGYWKT-----FGTPFDAFWCCTGTGV 423
>gi|390957656|ref|YP_006421413.1| hypothetical protein Terro_1782 [Terriglobus roseus DSM 18391]
gi|390412574|gb|AFL88078.1| hypothetical protein Terro_1782 [Terriglobus roseus DSM 18391]
Length = 635
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 166/401 (41%), Positives = 215/401 (53%), Gaps = 22/401 (5%)
Query: 88 KMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGS 147
+ + PD L + V+L R+ N +YL L VD L+ SF+ TAG
Sbjct: 29 QARRPDAMLQIDGRLSPFPMSAVRLLDGEFK-RSADVNEKYLDSLQVDRLLHSFRLTAGI 87
Query: 148 PTAGKAYEGWEDPTCELRGHFVG-HYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
++ K Y GWE P ELRGHF G HYLSA A A N TL+EK A+V+ L+ CQ
Sbjct: 88 TSSAKPYGGWEIPNGELRGHFAGGHYLSAVAFASAGAGNTTLREKGNALVAGLAACQKAN 147
Query: 207 GSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALK----MTKWMV 262
G+GYLSA+P E F R K VWAP+YT HKI+AGL+D YT N ALK M W
Sbjct: 148 GNGYLSAYPPELFQRLALGKQVWAPFYTYHKIMAGLVDMYTQTGNEDALKVAEGMAGWSS 207
Query: 263 EYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
YF + S + L E GGMN+VL LY++T ++L A F++P FL
Sbjct: 208 AYFAD--------MSDAQRQGILRIEYGGMNEVLVNLYSLTGKERYLSQARKFEQPTFLD 259
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
LA D++ G HANT IP +IG+ YE TGD Y+ ++F+D V ++H YA G TS
Sbjct: 260 PLAAHRDELQGLHANTSIPKIIGAARMYEATGDRRYQEIASYFLDDVLSAHTYAIGNTSD 319
Query: 383 GEFWSDPK-RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E W P LA +L +N E C YN++K+ RHL WT + + D YER L N L Q
Sbjct: 320 DEHWRTPAGSLAGSLSLKNAECCVAYNLMKLERHLSAWTGDARWMDAYERTLFNARLGTQ 379
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTG 482
G+ Y PL G + +G+ SFWCC GTG
Sbjct: 380 DAA--GLKQYFFPLAAG-----YWRVYGSPEESFWCCTGTG 413
>gi|383316642|ref|YP_005377484.1| hypothetical protein [Frateuria aurantia DSM 6220]
gi|379043746|gb|AFC85802.1| hypothetical protein Fraau_1370 [Frateuria aurantia DSM 6220]
Length = 651
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 158/393 (40%), Positives = 214/393 (54%), Gaps = 24/393 (6%)
Query: 96 KLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYE 155
++A D L+ +L V L P A N YL L VD L +F + AG P+ +
Sbjct: 53 EMARDSLQAFALDQVTLSPGPFA-EAAAINARYLHQLPVDRLAHNFLRQAGLPSTAQPLG 111
Query: 156 GWEDPTCELRGHFVG-HYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAF 214
GWE P CELRGHF G H+LSA+A +WA+T + TLK++ +V+ L+ CQ GYLSAF
Sbjct: 112 GWESPECELRGHFCGGHWLSAAALVWATTADRTLKQRADELVAILARCQRS--DGYLSAF 169
Query: 215 PSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMT----KWMVEYFYNRVQ 270
P F+R + VWAP+YT+HKIL G LD Y A N QAL + W V + R
Sbjct: 170 PDSFFERLSHGQKVWAPFYTLHKILCGHLDMYMHAGNQQALDIATGLGDWTVHWLNGRSD 229
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+ + L E GGMND L LY IT + ++L AH FD+ L LA D+
Sbjct: 230 AQMNEI--------LRTEYGGMNDALCELYAITGNGRYLDAAHRFDQASLLDPLAAHRDE 281
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSD-P 389
+ G H+NT +P +IG+ RYE+TG+ Y+ F + ++ + YA GG+S EFW++ P
Sbjct: 282 LKGLHSNTQLPKIIGAARRYELTGEQRYRRMAEFGWETISGTRCYANGGSSNDEFWNNGP 341
Query: 390 KRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
L LG E C YN+LK++RH++ WT + DYYER L N L Q G+
Sbjct: 342 DDLHDQLGVAAAECCVAYNLLKLTRHVYGWTGDPRAFDYYERNLYNARLGTQ--DPAGMK 399
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTG 482
+Y PL G SY + + SFWCC GTG
Sbjct: 400 LYYYPLAPG-----SYKYFNSPLHSFWCCTGTG 427
>gi|307110572|gb|EFN58808.1| hypothetical protein CHLNCDRAFT_56904 [Chlorella variabilis]
Length = 937
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 141/321 (43%), Positives = 187/321 (58%), Gaps = 5/321 (1%)
Query: 127 EYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNV 186
+YLL L+ D L+++F+K AG PT G +Y GWE E+RG F+GHY+SA A T
Sbjct: 51 QYLLALEPDRLLFNFRKNAGLPTPGASYGGWEWSESEVRGQFIGHYMSAVAFAALHTGRT 110
Query: 187 TLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQY 246
++ +V L + Q+ G+GYLSAFP FDR EAL+PVWAPYY IHKI+AGLLDQ+
Sbjct: 111 EFYDRSKLMVHELKKVQDAFGNGYLSAFPESHFDRLEALQPVWAPYYVIHKIMAGLLDQH 170
Query: 247 TFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDP 306
A +ALKM + M YF R Q V + + L E GGMN+VLY L+ +T D
Sbjct: 171 QLAGTDEALKMAEQMASYFCGRAQRVRENNGEDYWYRCLENEFGGMNEVLYNLFAVTADD 230
Query: 307 KHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFM 366
H AH FDKP F L D + G HANTH+ V G RYE GD F
Sbjct: 231 HHAECAHWFDKPVFYRPLVEGTDPLPGLHANTHLAQVQGFAARYEHLGDEEAMAAVRNFF 290
Query: 367 DIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN-----EESCTTYNMLKVSRHLFRWTK 421
++ H ++TGG++ E W + LA + + EESCT YN+LK++R+LFR T
Sbjct: 291 ALILQHHTFSTGGSNWYERWGNEDSLAEAINNTDASRITEESCTQYNILKLARYLFRHTG 350
Query: 422 EMVYADYYERALTNGVLSIQR 442
+ AD+YERA+ N V+ IQ+
Sbjct: 351 DPALADFYERAILNDVIGIQK 371
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 30/57 (52%), Gaps = 9/57 (15%)
Query: 427 DYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
D Y A N V + PGV IY LPLG G K WGT + +FWCCYGT +
Sbjct: 441 DPYAAAHANSV----QPAGPGVYIYYLPLGVGHDK-----NWGTPWDTFWCCYGTAV 488
>gi|116620365|ref|YP_822521.1| hypothetical protein Acid_1242 [Candidatus Solibacter usitatus
Ellin6076]
gi|116223527|gb|ABJ82236.1| protein of unknown function DUF1680 [Candidatus Solibacter usitatus
Ellin6076]
Length = 664
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 166/403 (41%), Positives = 222/403 (55%), Gaps = 38/403 (9%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWE--- 158
L+ + V+L P A + N Y+ L D L+ +F+ AG P++ + GWE
Sbjct: 64 LQPFPMSQVRLLPGPF-LDAAEWNRGYMNRLPADRLLHAFRLNAGLPSSAQPLGGWEIYV 122
Query: 159 DPTC--------ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMG-SG 209
+PT ELRGHFVGH+LSASA ++AS + K K +V+ L++CQ K+G SG
Sbjct: 123 EPTPGKRINSEGELRGHFVGHFLSASAQLYASMGDKDAKAKADYIVAELAKCQQKLGPSG 182
Query: 210 YLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALK----MTKWMVEYF 265
YLSAFP E FDR +A KPVWAP+YTIHKI+AG+ D YT A N QAL+ M+ W E+
Sbjct: 183 YLSAFPIEWFDRLDARKPVWAPFYTIHKIMAGMFDMYTLAGNQQALQVLEGMSNWADEW- 241
Query: 266 YNRVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
T E H L E GGMN+VLY L +T + + F K F L
Sbjct: 242 --------TASKSEAHMQDILRTEYGGMNEVLYNLAAVTGNDRWAKAGDRFTKKEFFNPL 293
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE 384
A++ D ++G H NTHIP VIG+ RYE++ D + +F V + Y T GTS GE
Sbjct: 294 ALRNDALTGLHVNTHIPQVIGAAARYEISSDMRFHDVADYFWYEVVTARSYVTEGTSNGE 353
Query: 385 FW-SDPKRLASTL--GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVL-SI 440
W + P+ LA+ L E C +YNMLK++RHL+ W + Y DYYERAL N L +I
Sbjct: 354 GWLTQPRMLAAELKRSVATAECCCSYNMLKLTRHLYGWKPDPAYFDYYERALFNHRLGTI 413
Query: 441 QRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Q T G Y L L G ++ + T SFWCC G+G+
Sbjct: 414 QPKT--GYTQYYLSLTPG-----AWKTFNTEDKSFWCCTGSGV 449
>gi|427385118|ref|ZP_18881623.1| hypothetical protein HMPREF9447_02656 [Bacteroides oleiciplenus YIT
12058]
gi|425727286|gb|EKU90146.1| hypothetical protein HMPREF9447_02656 [Bacteroides oleiciplenus YIT
12058]
Length = 629
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 150/364 (41%), Positives = 198/364 (54%), Gaps = 11/364 (3%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHM 179
RA + + +L DV+ + +F+ TAG T + GWE CELRGH GH LSA + M
Sbjct: 60 RAMEVDQRWLKEADVNRFLHAFRVTAGLATGAQNLGGWESLDCELRGHTTGHLLSALSLM 119
Query: 180 WASTHNVTLKEKMTAVVSALSECQNKMG-SGYLSAFPSEQFDRFEALKPVWAPYYTIHKI 238
+AST + + K +V L+ECQ +G +GYLSAFP DR + VWAP+YT+HK+
Sbjct: 120 YASTGDEQYRTKGAELVKGLAECQQTLGKNGYLSAFPEYFIDRAIKEEIVWAPFYTLHKV 179
Query: 239 LAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYR 298
AGLLDQYT N QAL + M ++ YN+++ + + + LN E GGM + Y
Sbjct: 180 YAGLLDQYTLCGNQQALDVLTGMCDWAYNKLKPL----TPTQLQGMLNSEFGGMPETFYN 235
Query: 299 LYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLY 358
LY +T + +H LA +F L LA + D ++G H NT IP V+G YE+TG+P
Sbjct: 236 LYALTGNARHKELAEMFYHNSILDPLAARRDSLAGIHVNTQIPKVLGEARGYEMTGNPQS 295
Query: 359 KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFR 418
FF + V H Y TGG S E +S P L+ L E+C TYNMLK++RHLF
Sbjct: 296 ATIANFFWEAVVGDHTYVTGGNSDKEIFSKPGILSDQLSENTTETCNTYNMLKLTRHLFT 355
Query: 419 WTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCC 478
W ADYYERAL N +LS Q E G + Y L G K Y F CC
Sbjct: 356 WDASPARADYYERALYNHILSSQN-PETGGVTYYHTLHPGSCKKFHY-----PFRDNTCC 409
Query: 479 YGTG 482
GTG
Sbjct: 410 VGTG 413
>gi|329957171|ref|ZP_08297738.1| hypothetical protein HMPREF9445_02614 [Bacteroides clarus YIT
12056]
gi|328523439|gb|EGF50538.1| hypothetical protein HMPREF9445_02614 [Bacteroides clarus YIT
12056]
Length = 694
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 151/389 (38%), Positives = 215/389 (55%), Gaps = 18/389 (4%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG-------SPTAGKAY 154
++ L DV+L PS + ++ ++ +DV+ L+ SF+ AG K Y
Sbjct: 96 VESFDLQDVRLLPSRFRDNMLRDSV-WMTSIDVNRLIHSFRTNAGIWAGREGGYVTVKKY 154
Query: 155 EGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAF 214
GWE CELRGH GH LSA M+A+T + K K ++V+ L + Q+ +G+GYLSAF
Sbjct: 155 GGWESLDCELRGHTTGHLLSAYGLMYAATGSEIFKLKGDSIVTELGKVQDALGNGYLSAF 214
Query: 215 PSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
P E +R + VWAP+YT+HK+ +GL+DQY +ADN QAL + M ++ Y++++ +
Sbjct: 215 PEELINRNIKGQSVWAPWYTLHKLFSGLIDQYLYADNAQALAVVTKMGDWAYDKLKPL-- 272
Query: 275 KYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
S E + E GG+N+ Y LY +T D ++ LAH F + L Q DD+
Sbjct: 273 --SEETRRRMIRNEFGGINESFYNLYAVTGDERYRWLAHFFYHNDVIDPLKEQNDDLGTK 330
Query: 335 HANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAS 394
H NT IP V+ YE+TGD K FF + H +A G +S E + D KR +
Sbjct: 331 HTNTFIPKVLAEARNYELTGDKDSKALSDFFWHTMIDHHTFAPGCSSQKEHYFDTKRFSH 390
Query: 395 TLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLP 454
L E+C TYNMLK+SRHLF W + ADYYERAL N +L Q+ + G++ Y LP
Sbjct: 391 FLNGYTGETCCTYNMLKLSRHLFCWQPDARIADYYERALYNHILG-QQDPQTGMVCYFLP 449
Query: 455 LGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
L G K S T+ +SFWCC G+G
Sbjct: 450 LLSGAHKVYS-----TKENSFWCCVGSGF 473
>gi|330995449|ref|ZP_08319354.1| hypothetical protein HMPREF9442_00414 [Paraprevotella xylaniphila
YIT 11841]
gi|329575517|gb|EGG57055.1| hypothetical protein HMPREF9442_00414 [Paraprevotella xylaniphila
YIT 11841]
Length = 618
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 157/400 (39%), Positives = 213/400 (53%), Gaps = 32/400 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLE--YLLMLDVDSLVWSFQKTAGSPTAGKAYEGWED 159
L+ S DV+L+ S W Q+ +L+ YL ++ D L+ +F+ TAG P+ K EGWE
Sbjct: 33 LRPFSGKDVELEAS---WIKQREDLDVAYLQSVEADRLLHNFRVTAGLPSLAKPLEGWES 89
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF 219
P LRGHF GHYLSA + + + +++ +V L +CQ G+GYLSAFP + F
Sbjct: 90 PGVGLRGHFTGHYLSALSVLAERYGDGWASQRLEYMVDELYKCQQAHGNGYLSAFPEKDF 149
Query: 220 DRFEA-LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
+ E VWAPYYT+HKIL GLLD YT N +A M + + Y R+ ++ +
Sbjct: 150 ETLETRFTGVWAPYYTLHKILQGLLDAYTKTGNRKAYGMVEALAGYVEGRMAK-LSPERI 208
Query: 279 ERHWNSL----NEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
ER ++ E G MN+ LY LY I+ +P+HL LA FD FL L D ++G
Sbjct: 209 ERMMYTVEANPQNEAGAMNEALYELYGISGNPRHLALAACFDPAWFLEPLVRNEDILAGL 268
Query: 335 HANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA------------ 382
HANTHI +V G RYEVTG+ YK F DI+ H Y G +S
Sbjct: 269 HANTHIVLVNGFARRYEVTGEEKYKKAAMQFWDILQRGHAYVNGTSSGPRPVVTTRTSLT 328
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ- 441
E W +P L +TL E ESC T+N K+S +LF WT + YAD Y NG L +Q
Sbjct: 329 AEHWGEPGHLCNTLTREIAESCVTHNTQKLSAYLFGWTGDPCYADAYMNTFYNGALPVQS 388
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
R T G +Y LPL G + K Y + + F+CC G+
Sbjct: 389 RST--GAYVYHLPL--GSPRNKKY----LKDNDFFCCSGS 420
>gi|298483785|ref|ZP_07001958.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
gi|298270079|gb|EFI11667.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
Length = 642
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 156/390 (40%), Positives = 215/390 (55%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV+L PS + ++ ++ +DV+ L+ SF+ AG AG K
Sbjct: 44 VESFDLKDVRLLPSRFRDNMLRDSV-WMTSIDVNRLLHSFRTNAGV-FAGREGGYMTVKK 101
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A M+A+T + K K ++V+ L+E QN + GYLSA
Sbjct: 102 LGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKGDSLVNGLTEVQNALKGGYLSA 161
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP E +R K VWAP+YT+HK+ +GL+DQY +ADN QALK M ++ YN+++ +
Sbjct: 162 FPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQQALKTVTKMGDWAYNKLKPL- 220
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
S E + E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 221 ---SEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLAEYFYHNDVIDPLKELRDDLGT 277
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP VI YE+T + K FF + H +A G +S E + DPK+ +
Sbjct: 278 KHTNTFIPKVIAEARNYELTENETSKKLSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFS 337
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 338 KHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVTYFL 396
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + T+ +SFWCC G+G
Sbjct: 397 PLLSGSHKL-----YSTKENSFWCCVGSGF 421
>gi|427386203|ref|ZP_18882400.1| hypothetical protein HMPREF9447_03433 [Bacteroides oleiciplenus YIT
12058]
gi|425726590|gb|EKU89454.1| hypothetical protein HMPREF9447_03433 [Bacteroides oleiciplenus YIT
12058]
Length = 616
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 149/393 (37%), Positives = 214/393 (54%), Gaps = 26/393 (6%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
S +V L S + R ++ N+ +L LD D L+ +F+ TAG P+ + EGWE P LR
Sbjct: 35 SNEEVTLKSSWIKQR-EELNITFLKSLDPDRLLHNFRVTAGLPSNAEPLEGWESPKIGLR 93
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEA- 224
GHFVGHYLSA + + ++ L E++ ++ L +CQ G+ YLSAFP + FD EA
Sbjct: 94 GHFVGHYLSAVSSLVEKYKDLELVERLRYMIDELCKCQQSFGNSYLSAFPDKDFDALEAK 153
Query: 225 LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS 284
VWAPYYT +K++ GLLD YT N +A M M Y NR+ ++ ++E+ +
Sbjct: 154 FTGVWAPYYTYNKVMQGLLDAYTHTGNQKAYDMLLDMAAYVDNRMSK-LSGETIEKMLYT 212
Query: 285 LN----EETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHI 340
++ E G MN+VLY+LY I+++PKHL LA +FD+ F+ LA D +SG H+NTH+
Sbjct: 213 VDANPQNEPGAMNEVLYKLYKISRNPKHLALAEIFDRNWFITPLAENKDILSGLHSNTHL 272
Query: 341 PVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA------------GEFWSD 388
+V G RY +TG+ Y T F D++ + H YA G +S E W
Sbjct: 273 VLVNGFAQRYSITGESKYYAASTNFWDMLISQHVYANGTSSGPRPNATTRTSVTAEHWGV 332
Query: 389 PKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGV 448
P L +TL E ESC ++N K++ +F WT YAD Y N VL+ Q G
Sbjct: 333 PGHLCNTLTKEIAESCVSHNTQKLTSSIFTWTAAPKYADAYMNTFYNAVLASQ-SAHTGA 391
Query: 449 MIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
+Y LPL G + K Y + + F CC G+
Sbjct: 392 YMYHLPL--GSPRNKKY----LKDNDFACCSGS 418
>gi|265752243|ref|ZP_06088036.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263237035|gb|EEZ22505.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 640
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 150/392 (38%), Positives = 217/392 (55%), Gaps = 24/392 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
+K L DV+L PS + ++ ++ ++VD L+ SF+ AG AG K
Sbjct: 42 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGV-FAGREGGYMTVKK 99
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA M+A+T + + K ++VS L+E QN +G+GYLSA
Sbjct: 100 LGGWESLDCELRGHTTGHLLSAYGLMYAATGSKLFRHKGDSLVSGLAEVQNALGNGYLSA 159
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ--N 271
+P E +R VWAP+YT+HK+ +GL+DQY ++DN +AL++ M ++ Y++++ +
Sbjct: 160 YPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVIRMADWAYHKLKPLD 219
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
T+ + R+ E GG+N+ Y LY IT D +H LA F + L DD+
Sbjct: 220 ETTRQKMIRN------EFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDL 273
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
H NT IP VI YE+T D + FF + H +A G +S E + DP R
Sbjct: 274 GTKHTNTFIPKVIAEARNYELTEDENSRKLSDFFWHTMIDHHTFAPGCSSDKEHYFDPAR 333
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
+ + E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ + G++ Y
Sbjct: 334 FSKHVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTY 392
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
LPL G K + T+ +SFWCC G+G
Sbjct: 393 FLPLLSGSHKV-----YSTKENSFWCCVGSGF 419
>gi|262407449|ref|ZP_06083997.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|262354257|gb|EEZ03349.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
Length = 642
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 156/390 (40%), Positives = 213/390 (54%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV+L PS + + ++ +DV L+ SF+ AG AG K
Sbjct: 44 VESFDLKDVRLLPSRFRDNMLRDS-AWMTSIDVSRLLHSFRTNAGV-FAGREGGYMTVKK 101
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A M+A+T + K K ++V+ L+E QN + GYLSA
Sbjct: 102 LGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKGDSLVNGLTEVQNALKGGYLSA 161
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP E +R K VWAP+YT+HK+ +GL+DQY +ADN QALK M ++ YN+++ +
Sbjct: 162 FPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQQALKTVTKMGDWAYNKLKPL- 220
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
S E + E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 221 ---SEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLAEYFYHNDVIDPLKELRDDLGT 277
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP VI YE+T + K FF + H +A G +S E + DPK+ +
Sbjct: 278 KHTNTFIPKVIAEARNYELTENETSKKLSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFS 337
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 338 KHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVTYFL 396
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + T+ +SFWCC G+G
Sbjct: 397 PLLSGSHKL-----YSTKENSFWCCVGSGF 421
>gi|326204047|ref|ZP_08193908.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
gi|325985814|gb|EGD46649.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
Length = 743
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 146/374 (39%), Positives = 207/374 (55%), Gaps = 18/374 (4%)
Query: 112 LDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGH 171
+DP ++ A + +EYL D D L+ F T G + Y GWE+ E+RGH +GH
Sbjct: 7 IDPYLVN--AFKKEIEYLEAFDCDKLLSCFYITKGLTPKAENYRGWEN--TEIRGHTMGH 62
Query: 172 YLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAP 231
YL+A A +++T++ + E++ ++ LS CQ SGYLSAFP E FDR E KP+W P
Sbjct: 63 YLTALAQAYSATNDSKIYERLQYLMKELSLCQ--FESGYLSAFPEEFFDRVENRKPIWVP 120
Query: 232 YYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGG 291
+YT+HKI+ GL+ Y A ALK+ + E+ ++R K++ E H N L E GG
Sbjct: 121 WYTMHKIITGLISVYKLAKIETALKIVSRLGEWVFSRTD----KWTPEIHANVLAVEYGG 176
Query: 292 MNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYE 351
MND +Y LY I+ + KH AH+FD+ + D ++ HANT IP +G+ RY
Sbjct: 177 MNDCMYELYKISGNEKHCTAAHMFDEIELFKEIHDGKDILNNRHANTTIPKFLGALNRYL 236
Query: 352 VTGDP--LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
G+ Y T F IV +H Y TGG S E + +P L + + N E+C TYNM
Sbjct: 237 AIGEEEQFYLDTCKEFWSIVTNNHSYVTGGNSEWEHFGEPGILDAERTSTNCETCNTYNM 296
Query: 410 LKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWG 469
LK++R LF+ T YAD+YE TN +LS Q + G+ +Y P+ G K +G
Sbjct: 297 LKMTRELFKITGNKKYADFYENTFTNAILSSQ-NPDTGMTMYFQPMETGYFKV-----YG 350
Query: 470 TRFSSFWCCYGTGL 483
F FWCC GTG+
Sbjct: 351 KPFEHFWCCTGTGM 364
>gi|345512540|ref|ZP_08792066.1| hypothetical protein BSEG_01611 [Bacteroides dorei 5_1_36/D4]
gi|423229086|ref|ZP_17215491.1| hypothetical protein HMPREF1063_01311 [Bacteroides dorei
CL02T00C15]
gi|423244926|ref|ZP_17226000.1| hypothetical protein HMPREF1064_02206 [Bacteroides dorei
CL02T12C06]
gi|345456387|gb|EEO45470.2| hypothetical protein BSEG_01611 [Bacteroides dorei 5_1_36/D4]
gi|392634839|gb|EIY28751.1| hypothetical protein HMPREF1063_01311 [Bacteroides dorei
CL02T00C15]
gi|392640967|gb|EIY34758.1| hypothetical protein HMPREF1064_02206 [Bacteroides dorei
CL02T12C06]
Length = 646
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 150/392 (38%), Positives = 217/392 (55%), Gaps = 24/392 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
+K L DV+L PS + ++ ++ ++VD L+ SF+ AG AG K
Sbjct: 48 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGV-FAGREGGYMTVKK 105
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA M+A+T + + K ++VS L+E QN +G+GYLSA
Sbjct: 106 LGGWESLDCELRGHTTGHLLSAYGLMYAATGSKLFRHKGDSLVSGLAEVQNALGNGYLSA 165
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ--N 271
+P E +R VWAP+YT+HK+ +GL+DQY ++DN +AL++ M ++ Y++++ +
Sbjct: 166 YPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVIRMADWAYHKLKPLD 225
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
T+ + R+ E GG+N+ Y LY IT D +H LA F + L DD+
Sbjct: 226 ETTRQKMIRN------EFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDL 279
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
H NT IP VI YE+T D + FF + H +A G +S E + DP R
Sbjct: 280 GTKHTNTFIPKVIAEARNYELTEDENSRKLSDFFWHTMIDHHTFAPGCSSDKEHYFDPAR 339
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
+ + E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ + G++ Y
Sbjct: 340 FSKHVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTY 398
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
LPL G K + T+ +SFWCC G+G
Sbjct: 399 FLPLLSGSHKV-----YSTKENSFWCCVGSGF 425
>gi|345512074|ref|ZP_08791613.1| hypothetical protein BSAG_00984 [Bacteroides sp. D1]
gi|229443482|gb|EEO49273.1| hypothetical protein BSAG_00984 [Bacteroides sp. D1]
Length = 640
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 156/390 (40%), Positives = 213/390 (54%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV+L PS + + ++ +DV L+ SF+ AG AG K
Sbjct: 42 VESFDLKDVRLLPSRFRDNMLRDS-AWMTSIDVSRLLHSFRTNAGV-FAGREGGYMTVKK 99
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A M+A+T + K K ++V+ L+E QN + GYLSA
Sbjct: 100 LGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKGDSLVNGLTEVQNALKGGYLSA 159
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP E +R K VWAP+YT+HK+ +GL+DQY +ADN QALK M ++ YN+++ +
Sbjct: 160 FPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQQALKTVTKMGDWAYNKLKPL- 218
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
S E + E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 219 ---SEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLAEYFYHNDVIDPLKELRDDLGT 275
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP VI YE+T + K FF + H +A G +S E + DPK+ +
Sbjct: 276 KHTNTFIPKVIAEARNYELTENETSKKLSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFS 335
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 336 KHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVTYFL 394
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + T+ +SFWCC G+G
Sbjct: 395 PLLSGSHKL-----YSTKENSFWCCVGSGF 419
>gi|294810816|ref|ZP_06769462.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|294442004|gb|EFG10825.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 642
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 156/390 (40%), Positives = 213/390 (54%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV+L PS + + ++ +DV L+ SF+ AG AG K
Sbjct: 44 VESFDLKDVRLLPSRFRDNMLRDS-AWMTSIDVSRLLHSFRTNAGV-FAGREGGYMTVKK 101
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A M+A+T + K K ++V+ L+E QN + GYLSA
Sbjct: 102 LGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKGDSLVNGLTEVQNALKGGYLSA 161
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP E +R K VWAP+YT+HK+ +GL+DQY +ADN QALK M ++ YN+++ +
Sbjct: 162 FPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQQALKTVTKMGDWAYNKLKPL- 220
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
S E + E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 221 ---SEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLAEYFYHNDVIDPLKELRDDLGT 277
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP VI YE+T + K FF + H +A G +S E + DPK+ +
Sbjct: 278 KHTNTFIPKVIAEARNYELTENETSKKLSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFS 337
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 338 KHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVTYFL 396
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + T+ +SFWCC G+G
Sbjct: 397 PLLSGSHKL-----YSTKENSFWCCVGSGF 421
>gi|299146414|ref|ZP_07039482.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
gi|298516905|gb|EFI40786.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
Length = 642
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 156/390 (40%), Positives = 213/390 (54%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV+L PS + + ++ +DV L+ SF+ AG AG K
Sbjct: 44 VESFDLKDVRLLPSRFRDNMLRDS-AWMTSIDVSRLLHSFRTNAGV-FAGREGGYMTVKK 101
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A M+A+T + K K ++V+ L+E QN + GYLSA
Sbjct: 102 LGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKGDSLVNGLTEVQNALKGGYLSA 161
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP E +R K VWAP+YT+HK+ +GL+DQY +ADN QALK M ++ YN+++ +
Sbjct: 162 FPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQQALKTVTKMGDWAYNKLKPL- 220
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
S E + E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 221 ---SEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLAEYFYHNDVIDPLKELRDDLGT 277
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP VI YE+T + K FF + H +A G +S E + DPK+ +
Sbjct: 278 KHTNTFIPKVIAEARNYELTENETSKKLSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFS 337
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 338 KHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVTYFL 396
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + T+ +SFWCC G+G
Sbjct: 397 PLLSGSHKL-----YSTKENSFWCCVGSGF 421
>gi|298246853|ref|ZP_06970658.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
gi|297549512|gb|EFH83378.1| protein of unknown function DUF1680 [Ktedonobacter racemifer DSM
44963]
Length = 600
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 144/379 (37%), Positives = 211/379 (55%), Gaps = 20/379 (5%)
Query: 110 VKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG------SPTAGKAYEGWEDPTCE 163
V L P L RA+ N Y+L L +L+ + AG PT + GWE PTC+
Sbjct: 13 VTLQPGPLKKRAE-LNRAYMLSLKSTNLLQNHYGEAGLWNPPQQPT--DCHRGWESPTCQ 69
Query: 164 LRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFE 223
LRGHF+GH+LSA+A + AST + +K K +V+ L+ CQ +M ++ + P + D
Sbjct: 70 LRGHFLGHWLSAAARLVASTGDTEIKGKADFIVAELARCQQEMEGEWIGSIPEKYLDWIA 129
Query: 224 ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWN 283
K VWAP+YT+HK L GL D Y N QAL + ++F+ ++S E+ +
Sbjct: 130 RGKRVWAPHYTLHKTLMGLYDMYEIGQNEQALDILIHWADWFHRWT----GQFSREQMDD 185
Query: 284 SLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVV 343
L+ ETGGM +V LY +T +HL L +D+ L D ++ HANT IP V
Sbjct: 186 ILDVETGGMLEVWANLYGVTNRQEHLDLIRRYDRSRLFDRLLAGEDVLTYMHANTTIPEV 245
Query: 344 IGSQMRYEVTGDPLYKVTGTFFMDIVNASHGY-ATGGTSAGEFWSDPKRLASTLGTENEE 402
G+ +EVTG+ ++ + + GY TGG ++ E W P +L LG EN+E
Sbjct: 246 HGAARAWEVTGEQRWRDIVEAYWRLAVTDRGYFCTGGQTSDEVWCPPHQLGGQLGPENQE 305
Query: 403 SCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKA 462
CT YN+++++ +LFRWT ++VYADYYER NG+L+ Q+ + G++ Y LPL G +K
Sbjct: 306 HCTVYNLMRLANYLFRWTGDVVYADYYERNFYNGILA-QQNAQTGMVAYYLPLETGGTKV 364
Query: 463 KSYHGWGTRFSSFWCCYGT 481
WGT + FWCC+GT
Sbjct: 365 -----WGTPTNDFWCCHGT 378
>gi|294646892|ref|ZP_06724513.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|292637837|gb|EFF56234.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
Length = 640
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 156/390 (40%), Positives = 213/390 (54%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV+L PS + + ++ +DV L+ SF+ AG AG K
Sbjct: 42 VESFDLKDVRLLPSRFRDNMLRDS-AWMTSIDVSRLLHSFRTNAGV-FAGREGGYMTVKK 99
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A M+A+T + K K ++V+ L+E QN + GYLSA
Sbjct: 100 LGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKGDSLVNGLTEVQNALKGGYLSA 159
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP E +R K VWAP+YT+HK+ +GL+DQY +ADN QALK M ++ YN+++ +
Sbjct: 160 FPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQQALKTVTKMGDWAYNKLKPL- 218
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
S E + E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 219 ---SEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLAEYFYHNDVIDPLKELRDDLGT 275
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP VI YE+T + K FF + H +A G +S E + DPK+ +
Sbjct: 276 KHTNTFIPKVIAEARNYELTENETSKKLSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFS 335
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 336 KHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVTYFL 394
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + T+ +SFWCC G+G
Sbjct: 395 PLLSGSHKL-----YSTKENSFWCCVGSGF 419
>gi|423287825|ref|ZP_17266676.1| hypothetical protein HMPREF1069_01719 [Bacteroides ovatus
CL02T12C04]
gi|392671840|gb|EIY65311.1| hypothetical protein HMPREF1069_01719 [Bacteroides ovatus
CL02T12C04]
Length = 643
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 153/390 (39%), Positives = 217/390 (55%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L D++L PS + + ++ +DV+ L+ SF+ AG AG K
Sbjct: 44 VESFDLKDIRLLPSRFRDNMLRDS-AWMTSIDVNRLLHSFRTNAGV-FAGREGGYMTVKK 101
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A ++A+T + K K ++V+ L+E QN + GYLSA
Sbjct: 102 LGGWESLDCELRGHTTGHLLSAYALIYAATGSEIFKLKGDSLVNGLTEVQNALKGGYLSA 161
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP E +R K VWAP+YT+HK+ +GL+DQY +ADN QALK+ M ++ YN+++++
Sbjct: 162 FPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNLQALKVVTKMGDWAYNKLKSL- 220
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
+ E + E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 221 ---TEETRKLMIRNEFGGINESFYNLYAITGDERYRWLAEYFYHNDVIDPLKELRDDLGT 277
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP VI YE+T + + FF + H +A G +S E + DPK+L+
Sbjct: 278 KHTNTFIPKVIAEARSYELTRNETSRKLSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLS 337
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 338 QHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFL 396
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + T+ +SFWCC G+G
Sbjct: 397 PLLSGSHKL-----YSTKENSFWCCVGSGF 421
>gi|336404833|ref|ZP_08585521.1| hypothetical protein HMPREF0127_02834 [Bacteroides sp. 1_1_30]
gi|335940654|gb|EGN02520.1| hypothetical protein HMPREF0127_02834 [Bacteroides sp. 1_1_30]
Length = 640
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 156/390 (40%), Positives = 213/390 (54%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV+L PS + + ++ +DV L+ SF+ AG AG K
Sbjct: 42 VESFDLKDVRLLPSRFRDNMLRDS-AWMTSIDVSRLLHSFRTNAGV-FAGREGGYMTVKK 99
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A M+A+T + K K ++V+ L+E QN + GYLSA
Sbjct: 100 LGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKGDSLVNGLTEVQNALKGGYLSA 159
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP E +R K VWAP+YT+HK+ +GL+DQY +ADN QALK M ++ YN+++ +
Sbjct: 160 FPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQQALKTVTKMGDWAYNKLKPL- 218
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
S E + E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 219 ---SEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLAEYFYHNDVIDPLKELRDDLGT 275
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP VI YE+T + K FF + H +A G +S E + DPK+ +
Sbjct: 276 KHTNTFIPKVIAEARNYELTENETSKKLSEFFWHTMIDHHTFAPGCSSDKEHFFDPKKFS 335
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 336 KHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVTYFL 394
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + T+ +SFWCC G+G
Sbjct: 395 PLLSGSHKL-----YSTKENSFWCCVGSGF 419
>gi|376260753|ref|YP_005147473.1| hypothetical protein [Clostridium sp. BNL1100]
gi|373944747|gb|AEY65668.1| hypothetical protein Clo1100_1435 [Clostridium sp. BNL1100]
Length = 743
Score = 251 bits (641), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 146/374 (39%), Positives = 205/374 (54%), Gaps = 18/374 (4%)
Query: 112 LDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGH 171
+DP ++ A + +EYL D D L+ F KT G K Y GWED E+RGH +GH
Sbjct: 7 IDPYLVN--AFKKEIEYLESFDCDKLLSCFYKTKGLAPKAKNYHGWED--TEIRGHTMGH 62
Query: 172 YLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAP 231
YL+A A +++T++ + E++ ++ LS CQ SGYLSAFP E FDR E KPVW P
Sbjct: 63 YLTALAQAYSATNDSKIYERLQYLLKELSLCQ--FESGYLSAFPEEFFDRVENRKPVWVP 120
Query: 232 YYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGG 291
+YT+HKI+ GL+ Y AL + + ++ ++R K++ E H N L E GG
Sbjct: 121 WYTMHKIITGLISVYKLTKIETALNIVSGLGDWVFSRTD----KWTPEIHANVLAVEYGG 176
Query: 292 MNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYE 351
MND LY LY IT + KH AH+FD+ + D ++ HANT IP +G+ R+
Sbjct: 177 MNDCLYELYKITGNEKHSAAAHMFDEIELFKEIHDGKDILNNRHANTTIPKFLGALNRFL 236
Query: 352 VTGDP--LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
G+ Y T F IV +H Y TGG S E + +P L + + N E+C TYNM
Sbjct: 237 AIGEEEQFYLDTCKEFWSIVTNNHSYVTGGNSEWEHFGEPNILDAERTSTNCETCNTYNM 296
Query: 410 LKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWG 469
LK++R LF+ T + YAD+YE N +LS Q + G+ +Y P+ G K +
Sbjct: 297 LKMTRVLFKITGDKKYADFYENTFINAILSSQ-NPDTGMTMYFQPMATGYFKV-----YS 350
Query: 470 TRFSSFWCCYGTGL 483
F FWCC GTG+
Sbjct: 351 KPFEHFWCCTGTGM 364
>gi|160883345|ref|ZP_02064348.1| hypothetical protein BACOVA_01314 [Bacteroides ovatus ATCC 8483]
gi|156111329|gb|EDO13074.1| hypothetical protein BACOVA_01314 [Bacteroides ovatus ATCC 8483]
Length = 643
Score = 251 bits (640), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 153/390 (39%), Positives = 216/390 (55%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L D++L PS + + ++ +DV+ L+ SF+ AG AG K
Sbjct: 44 VESFDLKDIRLLPSRFRDNMLRDS-AWMTSIDVNRLLHSFRTNAGV-FAGREGGYMTVKK 101
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A ++A+T + K K ++V+ L+E QN + GYLSA
Sbjct: 102 LGGWESLDCELRGHTTGHLLSAYALIYAATGSEIFKLKGDSLVNGLTEVQNALKGGYLSA 161
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP E +R K VWAP+YT+HK+ +GL+DQY +ADN QALK+ M ++ YN+++ +
Sbjct: 162 FPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNLQALKVVTKMGDWAYNKLKPL- 220
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
+ E + E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 221 ---TEETRKLMIRNEFGGINESFYNLYAITGDERYRWLAEYFYHNDVIDPLKELRDDLGT 277
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP VI YE+T + + FF + H +A G +S E + DPK+L+
Sbjct: 278 KHTNTFIPKVIAEARNYELTRNETSRKLSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLS 337
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 338 QHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFL 396
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + T+ +SFWCC G+G
Sbjct: 397 PLLSGAHKL-----YSTKENSFWCCVGSGF 421
>gi|423313782|ref|ZP_17291717.1| hypothetical protein HMPREF1058_02329 [Bacteroides vulgatus
CL09T03C04]
gi|392684317|gb|EIY77645.1| hypothetical protein HMPREF1058_02329 [Bacteroides vulgatus
CL09T03C04]
Length = 640
Score = 251 bits (640), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 149/392 (38%), Positives = 217/392 (55%), Gaps = 24/392 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
+K L DV+L PS + ++ ++ ++VD L+ SF+ AG AG K
Sbjct: 42 VKSFDLKDVRLLPSRFRENMMRDSV-WMASIEVDRLLHSFRTNAGV-FAGREGGYMTVKK 99
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA M+A+T + K+K ++V+ L+E Q +G+GYLSA
Sbjct: 100 LGGWESLDCELRGHTTGHLLSAYGLMYAATGSEQFKQKGDSLVNGLAEVQTALGNGYLSA 159
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ--N 271
+P E +R VWAP+YT+HK+ +GL+DQY ++DN +AL++ M ++ Y++++ +
Sbjct: 160 YPEELINRNICGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVVRMADWAYHKLKPLD 219
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
T+ + R+ E GG+N+ Y LY IT D +H LA F + L DD+
Sbjct: 220 ETTRQKMIRN------EFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDL 273
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
H NT IP VI YE+T D + FF + H +A G +S E + DP R
Sbjct: 274 GTKHTNTFIPKVIAEARNYELTEDENSRKLSDFFWHTMIDHHTFAPGCSSDKEHYFDPAR 333
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
+ + E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ + G++ Y
Sbjct: 334 FSKHVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTY 392
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
LPL G K + T+ +SFWCC G+G
Sbjct: 393 FLPLLSGSHKV-----YSTKENSFWCCVGSGF 419
>gi|319643216|ref|ZP_07997844.1| hypothetical protein HMPREF9011_03445 [Bacteroides sp. 3_1_40A]
gi|345520493|ref|ZP_08799881.1| hypothetical protein BSFG_01473 [Bacteroides sp. 4_3_47FAA]
gi|254835017|gb|EET15326.1| hypothetical protein BSFG_01473 [Bacteroides sp. 4_3_47FAA]
gi|317385120|gb|EFV66071.1| hypothetical protein HMPREF9011_03445 [Bacteroides sp. 3_1_40A]
Length = 640
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 149/392 (38%), Positives = 217/392 (55%), Gaps = 24/392 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
+K L DV+L PS + ++ ++ ++VD L+ SF+ AG AG K
Sbjct: 42 VKSFDLKDVRLLPSRFRENMMRDSV-WMASIEVDRLLHSFRTNAGV-FAGREGGYMTVKK 99
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA M+A+T + K+K ++V+ L+E Q +G+GYLSA
Sbjct: 100 LGGWESLDCELRGHTTGHLLSAYGLMYAATGSEQFKQKGDSLVNGLAEVQTALGNGYLSA 159
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ--N 271
+P E +R VWAP+YT+HK+ +GL+DQY ++DN +AL++ M ++ Y++++ +
Sbjct: 160 YPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVVRMADWAYHKLKPLD 219
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
T+ + R+ E GG+N+ Y LY IT D +H LA F + L DD+
Sbjct: 220 ETTRQKMIRN------EFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDL 273
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
H NT IP VI YE+T D + FF + H +A G +S E + DP R
Sbjct: 274 GTKHTNTFIPKVIAEARNYELTEDENSRKLSDFFWHTMIDHHTFAPGCSSDKEHYFDPAR 333
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
+ + E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ + G++ Y
Sbjct: 334 FSKHVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTY 392
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
LPL G K + T+ +SFWCC G+G
Sbjct: 393 FLPLLSGSHKV-----YSTKENSFWCCVGSGF 419
>gi|427386207|ref|ZP_18882404.1| hypothetical protein HMPREF9447_03437 [Bacteroides oleiciplenus YIT
12058]
gi|425726247|gb|EKU89112.1| hypothetical protein HMPREF9447_03437 [Bacteroides oleiciplenus YIT
12058]
Length = 641
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 148/390 (37%), Positives = 215/390 (55%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L D++L PS + +L ++ + + L+ SF+ AG AG K
Sbjct: 43 VQSFDLKDIRLLPSRFRDNMMRDSL-WMTSIATNRLLHSFRNNAGV-FAGREGGYMTVKK 100
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CE+RGH GH LSA A M+A++ + K K ++VS L+E Q+ +G+GYLSA
Sbjct: 101 LGGWESLDCEIRGHTTGHLLSAYALMYAASGSEIFKLKGDSLVSGLAEVQDALGNGYLSA 160
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
+P E +R VWAP+YT+HK+ +GL+DQY + DN QALK+ M ++ YN+++ +
Sbjct: 161 YPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALKVVTRMGDWAYNKLKPL- 219
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
E + E GG+N+ Y LY IT D ++ LA+ F + L Q DD+
Sbjct: 220 ---DEETRKRMIRNEFGGVNESFYNLYAITGDERYHWLANFFYHNDVIDPLKEQRDDLGT 276
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP V+ YE+T + + FF + A H +A G +S E + DP++ +
Sbjct: 277 KHTNTFIPKVLAEARNYELTQNAESRTLTDFFWHTMIAHHTFAPGCSSDKEHYFDPQQFS 336
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G+ Y L
Sbjct: 337 KHLTGYTGETCCTYNMLKLSRHLFCWTGDASIADYYERALYNHILG-QQDPETGMFSYFL 395
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + T+ +SFWCC G+G
Sbjct: 396 PLLSGSHKV-----YSTQENSFWCCVGSGF 420
>gi|325106457|ref|YP_004276111.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324975305|gb|ADY54289.1| protein of unknown function DUF1680 [Pedobacter saltans DSM 12145]
Length = 648
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 154/399 (38%), Positives = 220/399 (55%), Gaps = 23/399 (5%)
Query: 97 LAGDF-LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG---SPTAG- 151
L GD + L DV+L PS+ ++ + ++L+ LDV+ L+ SF+ TAG S G
Sbjct: 36 LRGDVKVYSFDLKDVRLLPSAFRDNMERDS-KWLMSLDVNRLLHSFRNTAGVFSSKEGGY 94
Query: 152 ---KAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQ---NK 205
K GWE C+LRGH GH +SA ++++AST + K K ++V+ L+E Q K
Sbjct: 95 MTIKKLGGWESLDCDLRGHTTGHIMSALSYLYASTGDERYKIKSDSIVNGLAEVQYALTK 154
Query: 206 MG-SGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEY 264
+G +G++SAFP +R A + +WAP+YT+HKI AGL+DQY + N +AL + +
Sbjct: 155 VGQNGFISAFPENFINRNIAGQSIWAPWYTLHKIYAGLIDQYLYCGNEKALDIMTKAASW 214
Query: 265 FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
Y ++ + + E+ L E GG N+ Y LY IT +P+HL LA F L L
Sbjct: 215 AYQKLMPL----TEEQRATMLRNEFGGTNEAFYNLYAITGNPEHLKLAEFFYHNAVLDPL 270
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE 384
A + D+ HANT IP +IG YE+ D K TFF D V Y TGG S E
Sbjct: 271 AERKSDLYFKHANTFIPKLIGEARNYELNADKRSKDVATFFWDEVVNHQTYCTGGNSHKE 330
Query: 385 FWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT 444
+ +++ L +E+C + NMLK++RHLF W YAD+YERAL N +L Q+
Sbjct: 331 KFIHTDKVSENLTGYTQETCNSNNMLKLTRHLFSWDANPKYADFYERALYNHILG-QQDP 389
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G++ Y LPL G SY + T +SFWCC GTG
Sbjct: 390 QTGMVAYFLPLLPG-----SYKVYSTAENSFWCCVGTGF 423
>gi|423212948|ref|ZP_17199477.1| hypothetical protein HMPREF1074_01009 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694204|gb|EIY87432.1| hypothetical protein HMPREF1074_01009 [Bacteroides xylanisolvens
CL03T12C04]
Length = 642
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 156/390 (40%), Positives = 211/390 (54%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV L PS + + ++ +DV L+ SF+ AG AG K
Sbjct: 44 VESFDLKDVCLLPSRFRDNMLRDS-AWMTSIDVSRLLHSFRTNAGV-FAGREGGYMTVKK 101
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A M+A+T + K K ++V+ L+E QN + GYLSA
Sbjct: 102 LGGWESLDCELRGHTTGHLLSAYALMYAATGSEIFKLKGDSLVNGLTEVQNALKGGYLSA 161
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP E +R K VWAP+YT+HK+ +GL+DQY +ADN QALK M ++ YN+++ +
Sbjct: 162 FPEELINRNIRGKSVWAPWYTLHKLYSGLIDQYLYADNQQALKTVTKMGDWAYNKLKPL- 220
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
S E + E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 221 ---SEETRKLMIRNEFGGVNESFYNLYAITGDERYRWLAEYFYHNDVIDPLKELRDDLGT 277
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP VI YE+T + K FF + H +A G +S E + DPK +
Sbjct: 278 KHTNTFIPKVIAEARNYELTENETSKKLSEFFWHTMIDHHTFAPGCSSDKEHFFDPKNFS 337
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 338 KHLTGYTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVTYFL 396
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + T+ +SFWCC G+G
Sbjct: 397 PLLSGSHKL-----YSTKENSFWCCVGSGF 421
>gi|413926260|gb|AFW66192.1| hypothetical protein ZEAMMB73_605676 [Zea mays]
gi|413952504|gb|AFW85153.1| hypothetical protein ZEAMMB73_422486 [Zea mays]
Length = 510
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 111/136 (81%), Positives = 121/136 (88%)
Query: 348 MRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTY 407
MRYEVTGDPLYK +FFMD +N+SH YATGGTSAGEFW+DPKRLA TL TENEESCTTY
Sbjct: 1 MRYEVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTY 60
Query: 408 NMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHG 467
NMLKVSR+LFRWTKE+ YADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA SYHG
Sbjct: 61 NMLKVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHG 120
Query: 468 WGTRFSSFWCCYGTGL 483
WGT++ SFWCCYGTG+
Sbjct: 121 WGTKYDSFWCCYGTGI 136
>gi|150002728|ref|YP_001297472.1| hypothetical protein BVU_0120 [Bacteroides vulgatus ATCC 8482]
gi|294776982|ref|ZP_06742443.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|149931152|gb|ABR37850.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
gi|294449230|gb|EFG17769.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 640
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 148/392 (37%), Positives = 217/392 (55%), Gaps = 24/392 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
+K L DV+L PS + ++ ++ ++V+ L+ SF+ AG AG K
Sbjct: 42 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVNRLLHSFRTNAGV-FAGREGGYMTVKK 99
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA M+A+T + K+K ++V+ L+E Q +G+GYLSA
Sbjct: 100 LGGWESLDCELRGHTTGHLLSAYGLMYAATGSEQFKQKGDSLVNGLAEVQTALGNGYLSA 159
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ--N 271
+P E +R VWAP+YT+HK+ +GL+DQY ++DN +AL++ M ++ Y++++ +
Sbjct: 160 YPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEVVIRMADWAYHKLKPLD 219
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
T+ + R+ E GG+N+ Y LY IT D +H LA F + L DD+
Sbjct: 220 ETTRQKMIRN------EFGGVNESFYNLYAITGDERHRWLAQFFYHNEVIDPLKELRDDL 273
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
H NT IP VI YE+T D + FF + H +A G +S E + DP R
Sbjct: 274 GTKHTNTFIPKVIAEARNYELTEDENSRKLSDFFWHTMIDHHTFAPGCSSDKEHYFDPAR 333
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
+ + E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ + G++ Y
Sbjct: 334 FSKHVSGYTGETCCTYNMLKLSRHLFCWTADAAIADYYERALYNHILG-QQDPQTGMVTY 392
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
LPL G K + T+ +SFWCC G+G
Sbjct: 393 FLPLLSGSHKV-----YSTKENSFWCCVGSGF 419
>gi|298384470|ref|ZP_06994030.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
gi|298262749|gb|EFI05613.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
Length = 641
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 149/360 (41%), Positives = 201/360 (55%), Gaps = 19/360 (5%)
Query: 132 LDVDSLVWSFQKTAGSPTAG--------KAYEGWEDPTCELRGHFVGHYLSASAHMWAST 183
LDV+ L+ SF+ AG AG K GWE CELRGH GH LSA A M+A+T
Sbjct: 73 LDVNRLLHSFRTNAGV-FAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAAT 131
Query: 184 HNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLL 243
+ K K ++V+ L+E QN + GYLSA+P E +R K VWAP+YT+HK+ +GL+
Sbjct: 132 GSEIFKLKGDSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYSGLI 191
Query: 244 DQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTIT 303
DQY +ADN QAL + M ++ YN+++ + S E + E GG+N+ Y LY IT
Sbjct: 192 DQYLYADNQQALSVVTKMGDWAYNKLKPL----SEETRRLMIRNEFGGINESFYNLYAIT 247
Query: 304 QDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGT 363
D ++ LA F + L DD+ H NT IP VI YE+T + K
Sbjct: 248 GDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKKLSE 307
Query: 364 FFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEM 423
FF + H +A G +S E + DPK+ + L E+C TYNMLK+SRHLF WT +
Sbjct: 308 FFWHTMIDHHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWTGDS 367
Query: 424 VYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
ADYYERAL N +L Q+ E G++ Y LPL G K + T+ +SFWCC G+G
Sbjct: 368 SIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKL-----YSTKENSFWCCVGSGF 421
>gi|29345547|ref|NP_809050.1| hypothetical protein BT_0137 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337439|gb|AAO75244.1| Acetyl-CoA carboxylase-like protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 641
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 149/360 (41%), Positives = 201/360 (55%), Gaps = 19/360 (5%)
Query: 132 LDVDSLVWSFQKTAGSPTAG--------KAYEGWEDPTCELRGHFVGHYLSASAHMWAST 183
LDV+ L+ SF+ AG AG K GWE CELRGH GH LSA A M+A+T
Sbjct: 73 LDVNRLLHSFRTNAGV-FAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAAT 131
Query: 184 HNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLL 243
+ K K ++V+ L+E QN + GYLSA+P E +R K VWAP+YT+HK+ +GL+
Sbjct: 132 GSEIFKLKGDSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYSGLI 191
Query: 244 DQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTIT 303
DQY +ADN QAL + M ++ YN+++ + S E + E GG+N+ Y LY IT
Sbjct: 192 DQYLYADNQQALSVVTKMGDWAYNKLKPL----SEETRRLMIRNEFGGINESFYNLYAIT 247
Query: 304 QDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGT 363
D ++ LA F + L DD+ H NT IP VI YE+T + K
Sbjct: 248 GDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKKLSE 307
Query: 364 FFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEM 423
FF + H +A G +S E + DPK+ + L E+C TYNMLK+SRHLF WT +
Sbjct: 308 FFWHTMIDHHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWTGDS 367
Query: 424 VYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
ADYYERAL N +L Q+ E G++ Y LPL G K + T+ +SFWCC G+G
Sbjct: 368 SIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKL-----YSTKENSFWCCVGSGF 421
>gi|383123868|ref|ZP_09944538.1| hypothetical protein BSIG_4114 [Bacteroides sp. 1_1_6]
gi|251838901|gb|EES66986.1| hypothetical protein BSIG_4114 [Bacteroides sp. 1_1_6]
Length = 641
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 149/360 (41%), Positives = 201/360 (55%), Gaps = 19/360 (5%)
Query: 132 LDVDSLVWSFQKTAGSPTAG--------KAYEGWEDPTCELRGHFVGHYLSASAHMWAST 183
LDV+ L+ SF+ AG AG K GWE CELRGH GH LSA A M+A+T
Sbjct: 73 LDVNRLLHSFRTNAGV-FAGREGGYMTVKKLGGWESLDCELRGHTTGHLLSAYALMYAAT 131
Query: 184 HNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLL 243
+ K K ++V+ L+E QN + GYLSA+P E +R K VWAP+YT+HK+ +GL+
Sbjct: 132 GSEIFKLKGDSLVNGLTEVQNALKGGYLSAYPEELINRNIQGKSVWAPWYTLHKLYSGLI 191
Query: 244 DQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTIT 303
DQY +ADN QAL + M ++ YN+++ + S E + E GG+N+ Y LY IT
Sbjct: 192 DQYLYADNQQALSVVTKMGDWAYNKLKPL----SEETRRLMIRNEFGGINESFYNLYAIT 247
Query: 304 QDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGT 363
D ++ LA F + L DD+ H NT IP VI YE+T + K
Sbjct: 248 GDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNTFIPKVIAEARNYELTQNETSKKLSE 307
Query: 364 FFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEM 423
FF + H +A G +S E + DPK+ + L E+C TYNMLK+SRHLF WT +
Sbjct: 308 FFWHTMIDHHTFAPGCSSDKEHFFDPKKCSKHLTGYTGETCCTYNMLKLSRHLFCWTGDS 367
Query: 424 VYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
ADYYERAL N +L Q+ E G++ Y LPL G K + T+ +SFWCC G+G
Sbjct: 368 SIADYYERALYNHILG-QQDPETGMVTYFLPLLSGSHKL-----YSTKENSFWCCVGSGF 421
>gi|212690961|ref|ZP_03299089.1| hypothetical protein BACDOR_00451 [Bacteroides dorei DSM 17855]
gi|212666193|gb|EEB26765.1| hypothetical protein BACDOR_00451 [Bacteroides dorei DSM 17855]
Length = 646
Score = 248 bits (632), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 147/392 (37%), Positives = 216/392 (55%), Gaps = 24/392 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV+L PS + ++ ++ ++VD L+ SF+ AG AG K
Sbjct: 48 VRSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGV-FAGREGGYMTVKK 105
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA M+A+T + K K ++VS L+E QN +G+GYLSA
Sbjct: 106 LGGWESLDCELRGHTTGHLLSAYGLMYAATGSELFKHKGDSLVSGLAEVQNALGNGYLSA 165
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ--N 271
+P E +R VWAP+YT+HK+ +GL+DQY ++DN +AL++ M ++ Y++++ +
Sbjct: 166 YPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEIVTRMADWAYHKLKPLD 225
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+T+ + R+ E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 226 EVTRRKMIRN------EFGGINESFYNLYAITGDERYRWLARFFYHNEVIDPLKELRDDL 279
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
H NT IP V+ YE+T D + FF + H +A G +S E + DP
Sbjct: 280 GTKHTNTFIPKVLAEARNYELTEDEDSRKLSGFFWHTMIDRHTFAPGCSSDKEHYFDPDH 339
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
+ + E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ G++ Y
Sbjct: 340 FSKHISGYTGETCCTYNMLKLSRHLFCWTADAAVADYYERALYNHILG-QQDPHTGMVTY 398
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
LPL G K + T+ +SFWCC G+G
Sbjct: 399 FLPLLSGSHKV-----YSTKENSFWCCVGSGF 425
>gi|195643412|gb|ACG41174.1| hypothetical protein [Zea mays]
gi|413926261|gb|AFW66193.1| hypothetical protein ZEAMMB73_983510 [Zea mays]
Length = 262
Score = 248 bits (632), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 134/246 (54%), Positives = 166/246 (67%), Gaps = 12/246 (4%)
Query: 5 VFKVLVLFLSCWVALCKECTNSFPQLASHTFRY--ELLSSKNETWKKEVYSHY------H 56
+ V++L A K CTN+FP L SHT R +L T + + H+ H
Sbjct: 15 IVVVMLLAAGFRGAEGKSCTNAFPGLTSHTERAAAQLRPGPPATVLQPIIHHHRHGREQH 74
Query: 57 LTPTDDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFK----LAGDFLKEVSLHDVKL 112
LTPTD+S W +L+PR+ L + F W M+YR+++ G AG FL E SLHDV+L
Sbjct: 75 LTPTDESTWMSLMPRRALRREEAFDWLMLYRELRGGGGSARPGVAAGAFLSEASLHDVRL 134
Query: 113 DPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHY 172
+P S++WRAQQTNLEYLL+LDVD LVWSF+K AG G Y GWE P +LRGHFVGHY
Sbjct: 135 EPGSMYWRAQQTNLEYLLLLDVDRLVWSFRKQAGLTAPGTPYGGWEGPGIQLRGHFVGHY 194
Query: 173 LSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPY 232
LSA+A MWASTHN TL KM++VV AL +CQ KMG+GYLSAFPS+ FD EA+K VWAPY
Sbjct: 195 LSATAKMWASTHNDTLNAKMSSVVDALYDCQKKMGTGYLSAFPSDFFDCLEAIKSVWAPY 254
Query: 233 YTIHKI 238
YTIHK+
Sbjct: 255 YTIHKV 260
>gi|237722400|ref|ZP_04552881.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|229448210|gb|EEO54001.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
Length = 644
Score = 247 bits (631), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 152/385 (39%), Positives = 213/385 (55%), Gaps = 20/385 (5%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KAYEGWE 158
L DV+L PS + + ++ +DV+ L+ SF+ AG AG K GWE
Sbjct: 50 LKDVRLLPSRFRDNMLRDS-AWMTSIDVNRLLHSFRTNAGV-FAGREGGYMTVKKLGGWE 107
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
CELRGH GH LSA M+A+T + K K ++V+ L E QN + +GYLSA+P E
Sbjct: 108 SLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEEL 167
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
+R K VWAP+YT+HK+ +GL+DQY +ADN +AL + M ++ YN+++ + S
Sbjct: 168 INRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALTIVTRMGDWAYNKLKPL----SE 223
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
E + E GG+N+ Y LY+IT D ++ LA F + L DD+ H NT
Sbjct: 224 ETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNT 283
Query: 339 HIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT 398
IP VI YE+T + + FF + H +A G +S E + DPK+L+ L
Sbjct: 284 FIPKVIAEARNYELTRNETSRKLSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTG 343
Query: 399 ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 458
E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y LPL G
Sbjct: 344 YTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSG 402
Query: 459 DSKAKSYHGWGTRFSSFWCCYGTGL 483
K + T+ +SFWCC G+G
Sbjct: 403 SHKL-----YSTKENSFWCCVGSGF 422
>gi|383115004|ref|ZP_09935763.1| hypothetical protein BSGG_0819 [Bacteroides sp. D2]
gi|313693284|gb|EFS30119.1| hypothetical protein BSGG_0819 [Bacteroides sp. D2]
Length = 643
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 152/385 (39%), Positives = 213/385 (55%), Gaps = 20/385 (5%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KAYEGWE 158
L DV+L PS + + ++ +DV+ L+ SF+ AG AG K GWE
Sbjct: 50 LKDVRLLPSRFRDNMLRDS-AWMTSIDVNRLLHSFRTNAGV-FAGREGGYMTVKKLGGWE 107
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
CELRGH GH LSA M+A+T + K K ++V+ L E QN + +GYLSA+P E
Sbjct: 108 SLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEEL 167
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
+R K VWAP+YT+HK+ +GL+DQY +ADN +AL + M ++ YN+++ + S
Sbjct: 168 INRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALTIVTRMGDWAYNKLKPL----SE 223
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
E + E GG+N+ Y LY+IT D ++ LA F + L DD+ H NT
Sbjct: 224 ETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNT 283
Query: 339 HIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT 398
IP VI YE+T + + FF + H +A G +S E + DPK+L+ L
Sbjct: 284 FIPKVIAEARNYELTRNETSRKLSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTG 343
Query: 399 ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 458
E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y LPL G
Sbjct: 344 YTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSG 402
Query: 459 DSKAKSYHGWGTRFSSFWCCYGTGL 483
K + T+ +SFWCC G+G
Sbjct: 403 SHKL-----YSTKENSFWCCVGSGF 422
>gi|255692201|ref|ZP_05415876.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides
finegoldii DSM 17565]
gi|260622065|gb|EEX44936.1| hypothetical protein BACFIN_07304 [Bacteroides finegoldii DSM
17565]
Length = 644
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 152/385 (39%), Positives = 213/385 (55%), Gaps = 20/385 (5%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KAYEGWE 158
L DV+L PS + + ++ +DV+ L+ SF+ AG AG K GWE
Sbjct: 50 LKDVRLLPSRFRDNMLRDS-AWMTSIDVNRLLHSFRTNAGV-FAGREGGYMTVKKLGGWE 107
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
CELRGH GH LSA M+A+T + K K ++V+ L E QN + +GYLSA+P E
Sbjct: 108 SLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEEL 167
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
+R K VWAP+YT+HK+ +GL+DQY +ADN +AL + M ++ YN+++ + S
Sbjct: 168 INRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALIIVTRMGDWAYNKLKPL----SE 223
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
E + E GG+N+ Y LY+IT D ++ LA F + L DD+ H NT
Sbjct: 224 ETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNT 283
Query: 339 HIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT 398
IP VI YE+T + + FF + H +A G +S E + DPK+L+ L
Sbjct: 284 FIPKVIAEARNYELTRNETSRKLSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTG 343
Query: 399 ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 458
E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y LPL G
Sbjct: 344 YTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSG 402
Query: 459 DSKAKSYHGWGTRFSSFWCCYGTGL 483
K + T+ +SFWCC G+G
Sbjct: 403 SHKL-----YSTKENSFWCCVGSGF 422
>gi|336415976|ref|ZP_08596314.1| hypothetical protein HMPREF1017_03422 [Bacteroides ovatus
3_8_47FAA]
gi|335939879|gb|EGN01751.1| hypothetical protein HMPREF1017_03422 [Bacteroides ovatus
3_8_47FAA]
Length = 644
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 152/385 (39%), Positives = 213/385 (55%), Gaps = 20/385 (5%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KAYEGWE 158
L DV+L PS + + ++ +DV+ L+ SF+ AG AG K GWE
Sbjct: 50 LKDVRLLPSRFRDNMLRDS-AWMTSIDVNRLLHSFRTNAGV-FAGREGGYMTVKKLGGWE 107
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
CELRGH GH LSA M+A+T + K K ++V+ L E QN + +GYLSA+P E
Sbjct: 108 SLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEEL 167
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
+R K VWAP+YT+HK+ +GL+DQY +ADN +AL + M ++ YN+++ + S
Sbjct: 168 INRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALIIVTRMGDWAYNKLKPL----SE 223
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
E + E GG+N+ Y LY+IT D ++ LA F + L DD+ H NT
Sbjct: 224 ETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNT 283
Query: 339 HIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT 398
IP VI YE+T + + FF + H +A G +S E + DPK+L+ L
Sbjct: 284 FIPKVIAEARNYELTRNETSRKLSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTG 343
Query: 399 ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 458
E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y LPL G
Sbjct: 344 YTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSG 402
Query: 459 DSKAKSYHGWGTRFSSFWCCYGTGL 483
K + T+ +SFWCC G+G
Sbjct: 403 SHKL-----YSTKENSFWCCVGSGF 422
>gi|224539132|ref|ZP_03679671.1| hypothetical protein BACCELL_04034 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519254|gb|EEF88359.1| hypothetical protein BACCELL_04034 [Bacteroides cellulosilyticus
DSM 14838]
Length = 641
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 151/390 (38%), Positives = 217/390 (55%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV+L PS + ++ ++ + + L+ SF+ AG AG K
Sbjct: 43 VESFDLKDVRLLPSRFRDNMMRDSV-WMTSIATNRLLHSFRNNAGV-FAGREGGYMTVKK 100
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A M+AST + K K ++V+ L+E Q +G+GYLSA
Sbjct: 101 LGGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYLSA 160
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
+P E +R VWAP+YT+HK+ +GL+DQY + DN QAL++ M ++ YN+++ +
Sbjct: 161 YPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALEVVTRMGDWAYNKLK-PL 219
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
+ + +R + E GG+N+ Y LY IT D ++ LA F + L Q DD+
Sbjct: 220 DEPTRKR---MIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDLGT 276
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP V+ YE+T D + FF + H +A G +S E + DP++L+
Sbjct: 277 KHTNTFIPKVLAEARNYELTQDNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQLS 336
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 337 KHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSYFL 395
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + TR +SFWCC G+G
Sbjct: 396 PLLSGSHKV-----YSTRENSFWCCVGSGF 420
>gi|423222645|ref|ZP_17209115.1| hypothetical protein HMPREF1062_01301 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392641932|gb|EIY35705.1| hypothetical protein HMPREF1062_01301 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 641
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 151/390 (38%), Positives = 217/390 (55%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV+L PS + ++ ++ + + L+ SF+ AG AG K
Sbjct: 43 VESFDLKDVRLLPSRFRDNMMRDSV-WMTSIATNRLLHSFRNNAGV-FAGREGGYMTIKK 100
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A M+AST + K K ++V+ L+E Q +G+GYLSA
Sbjct: 101 LGGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYLSA 160
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
+P E +R VWAP+YT+HK+ +GL+DQY + DN QAL++ M ++ YN+++ +
Sbjct: 161 YPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYTDNKQALEVVTRMGDWAYNKLK-PL 219
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
+ + +R + E GG+N+ Y LY IT D ++ LA F + L Q DD+
Sbjct: 220 DEPTRKR---MIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDLGT 276
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP V+ YE+T D + FF + H +A G +S E + DP++L+
Sbjct: 277 KHTNTFIPKVLAEARNYELTQDNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQLS 336
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 337 KHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSYFL 395
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + TR +SFWCC G+G
Sbjct: 396 PLLSGSHKV-----YSTRENSFWCCVGSGF 420
>gi|413926259|gb|AFW66191.1| hypothetical protein ZEAMMB73_605676 [Zea mays]
gi|413952505|gb|AFW85154.1| hypothetical protein ZEAMMB73_422486 [Zea mays]
Length = 250
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 111/136 (81%), Positives = 121/136 (88%)
Query: 348 MRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTY 407
MRYEVTGDPLYK +FFMD +N+SH YATGGTSAGEFW+DPKRLA TL TENEESCTTY
Sbjct: 1 MRYEVTGDPLYKQIASFFMDTINSSHSYATGGTSAGEFWTDPKRLAGTLSTENEESCTTY 60
Query: 408 NMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHG 467
NMLKVSR+LFRWTKE+ YADYYERAL NGVLSIQRGT+PGVMIYMLP G SKA SYHG
Sbjct: 61 NMLKVSRNLFRWTKEIAYADYYERALINGVLSIQRGTDPGVMIYMLPQAPGHSKAVSYHG 120
Query: 468 WGTRFSSFWCCYGTGL 483
WGT++ SFWCCYGTG+
Sbjct: 121 WGTKYDSFWCCYGTGI 136
>gi|293369447|ref|ZP_06616030.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292635445|gb|EFF53954.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 644
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 151/385 (39%), Positives = 213/385 (55%), Gaps = 20/385 (5%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KAYEGWE 158
L DV+L PS + + ++ +DV+ L+ SF+ AG AG K GWE
Sbjct: 50 LKDVRLLPSRFRDNMLRDS-AWMTSIDVNRLLHSFRTNAGV-FAGREGGYMTVKKLGGWE 107
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
CELRGH GH LSA M+A+T + K K ++V+ L E QN + +GYLSA+P E
Sbjct: 108 SLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEEL 167
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
+R K VWAP+YT+HK+ +GL+DQY +ADN +AL + M ++ YN+++ + S
Sbjct: 168 INRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALTIVTRMGDWAYNKLKPL----SE 223
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
E + E GG+N+ Y LY+IT D ++ LA F + L DD+ H NT
Sbjct: 224 ETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNT 283
Query: 339 HIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT 398
IP VI YE+T + + FF + H +A G +S E + DP++L+ L
Sbjct: 284 FIPKVIAEARNYELTRNETSRKLSEFFWHTMIDHHTFAPGCSSDKEHYFDPRKLSQHLTG 343
Query: 399 ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 458
E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y LPL G
Sbjct: 344 YTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSG 402
Query: 459 DSKAKSYHGWGTRFSSFWCCYGTGL 483
K + T+ +SFWCC G+G
Sbjct: 403 SHKL-----YSTKENSFWCCVGSGF 422
>gi|237712552|ref|ZP_04543033.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|229453873|gb|EEO59594.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
Length = 640
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 147/392 (37%), Positives = 215/392 (54%), Gaps = 24/392 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
+K L DV+L PS + ++ ++ ++VD L+ SF+ AG AG K
Sbjct: 42 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGV-FAGREGGYMTVKK 99
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA M+A+T + K K ++VS L+E QN +G+GYLSA
Sbjct: 100 LGGWESLDCELRGHTTGHLLSAYGLMYAATGSELFKHKGDSLVSGLAEVQNALGNGYLSA 159
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ--N 271
+P E +R VWAP+YT+HK+ +GL+DQY ++DN +AL++ M ++ Y++++ +
Sbjct: 160 YPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEIVTRMADWAYHKLKPLD 219
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+T+ + R+ E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 220 EVTRRKMIRN------EFGGINESFYNLYAITGDERYRWLARFFYHNEVIDPLKELRDDL 273
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
H NT IP V+ YE+T D + FF + H +A G +S E + DP
Sbjct: 274 GTKHTNTFIPKVLAEARNYELTEDEDSRKLSGFFWHTMIDRHTFAPGCSSDKEHYFDPDH 333
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
+ + E+C TYNMLK+S HLF WT + ADYYERAL N +L Q+ G++ Y
Sbjct: 334 FSKHISGYTGETCCTYNMLKLSSHLFCWTADAAVADYYERALYNHILG-QQDPHTGMVTY 392
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
LPL G K + T+ +SFWCC G+G
Sbjct: 393 FLPLLSGSHKV-----YSTKENSFWCCVGSGF 419
>gi|423295661|ref|ZP_17273788.1| hypothetical protein HMPREF1070_02453 [Bacteroides ovatus
CL03T12C18]
gi|392672370|gb|EIY65839.1| hypothetical protein HMPREF1070_02453 [Bacteroides ovatus
CL03T12C18]
Length = 644
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 151/385 (39%), Positives = 213/385 (55%), Gaps = 20/385 (5%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KAYEGWE 158
L DV+L PS + + ++ +DV+ L+ SF+ AG AG K GWE
Sbjct: 50 LKDVRLLPSRFRDNMLRDS-AWMTSIDVNRLLHSFRTNAGV-FAGREGGYMTVKKLGGWE 107
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
CELRGH GH LSA M+A+T + K K ++V+ L E QN + +GYLSA+P E
Sbjct: 108 SLDCELRGHTTGHMLSALGLMYAATGSEIFKLKGDSLVNGLEEVQNALKNGYLSAWPEEL 167
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
+R K VWAP+YT+HK+ +GL+DQY +ADN +AL + + ++ YN+++ + S
Sbjct: 168 INRNIQGKGVWAPWYTLHKLFSGLIDQYLYADNKKALTIVTRVGDWAYNKLKPL----SE 223
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
E + E GG+N+ Y LY+IT D ++ LA F + L DD+ H NT
Sbjct: 224 ETRKLMIRNEFGGINESFYNLYSITGDERYRWLAEYFYHNDVIDPLKELRDDLGTKHTNT 283
Query: 339 HIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT 398
IP VI YE+T + + FF + H +A G +S E + DPK+L+ L
Sbjct: 284 FIPKVIAEARNYELTRNETSRKLSEFFWHTMIDHHTFAPGCSSDKEHYFDPKKLSQHLTG 343
Query: 399 ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 458
E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y LPL G
Sbjct: 344 YTGETCCTYNMLKLSRHLFCWTGDSSIADYYERALYNHILG-QQDPETGMVAYFLPLLSG 402
Query: 459 DSKAKSYHGWGTRFSSFWCCYGTGL 483
K + T+ +SFWCC G+G
Sbjct: 403 SHKL-----YSTKENSFWCCVGSGF 422
>gi|424790951|ref|ZP_18217449.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
pv. graminis ART-Xtg29]
gi|422797791|gb|EKU25992.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
pv. graminis ART-Xtg29]
Length = 651
Score = 245 bits (626), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 137/365 (37%), Positives = 197/365 (53%), Gaps = 14/365 (3%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVG-HYLSASAH 178
+A+ + YL+ + D L+ +F+ AG + + GWE P CE+RGHF G HYLSA A
Sbjct: 74 QARDRDRRYLMSIPNDRLLHTFRLVAGLDSQAEPLGGWESPHCEIRGHFAGGHYLSACAL 133
Query: 179 MWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKI 238
++A+T + LK+K A+V+ L+ CQ GY+ A+PS +DR + VW P YT HKI
Sbjct: 134 LYAATGDAALKDKADALVAELARCQR--ADGYIGAYPSSFYDRLGRHEEVWVPIYTAHKI 191
Query: 239 LAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYR 298
LAG LD A N QAL+ + F + + + + + L E GG++ L
Sbjct: 192 LAGHLDMARHAGNAQALRT----AQRFADWLGAWMDGFDDAQWQRILGVEFGGVHASLLE 247
Query: 299 LYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLY 358
LY ++ D K+ A +++ L LA Q D ++G HANT IP ++ + YE+ G P
Sbjct: 248 LYLLSGDAKYQRWATRYEQASLLEPLAQQRDALAGLHANTQIPKIVAAARAYEIDGAPRQ 307
Query: 359 KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFR 418
+ FF V+ H Y TGG S E + P A L + E C +YNMLK++RHL+
Sbjct: 308 RQIAEFFWRTVSGHHAYCTGGVSDYEMFGKPDHFAGHLSGHSHECCCSYNMLKLTRHLYT 367
Query: 419 WTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCC 478
W + DYYER L N L Q E G+M+Y +P+ G K + T F+SFWCC
Sbjct: 368 WQPDAALMDYYERVLFNARLGTQ--DEAGMMMYFVPMDAGYWKL-----YNTPFASFWCC 420
Query: 479 YGTGL 483
GTG+
Sbjct: 421 TGTGV 425
>gi|423239921|ref|ZP_17221036.1| hypothetical protein HMPREF1065_01659 [Bacteroides dorei
CL03T12C01]
gi|392644910|gb|EIY38644.1| hypothetical protein HMPREF1065_01659 [Bacteroides dorei
CL03T12C01]
Length = 646
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 147/392 (37%), Positives = 214/392 (54%), Gaps = 24/392 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
+K L DV+L PS + ++ ++ ++VD L+ SF+ AG AG K
Sbjct: 48 VKSFDLKDVRLLPSRFRENMMRDSM-WMASIEVDRLLHSFRTNAGV-FAGREGGYMTVKK 105
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA M+A+T + K K ++VS L E QN +G+GYLSA
Sbjct: 106 LGGWESLDCELRGHTTGHLLSAYGLMYAATGSELFKHKGDSLVSGLVEVQNALGNGYLSA 165
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ--N 271
+P E +R VWAP+YT+HK+ +GL+DQY ++DN +AL++ M ++ Y++++ +
Sbjct: 166 YPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYSDNQKALEIVTRMADWAYHKLKPLD 225
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+T+ + R+ E GG+N+ Y LY IT D ++ LA F + L DD+
Sbjct: 226 EVTRRKMIRN------EFGGINESFYNLYAITGDERYRWLARFFYHNEVIDPLKELRDDL 279
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
H NT IP V+ YE+T D + FF + H +A G +S E + DP
Sbjct: 280 GTKHTNTFIPKVLAEARNYELTEDEDSRKLSGFFWHTMIDRHTFAPGCSSDKEHYFDPDH 339
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
+ + E+C TYNMLK+S HLF WT + ADYYERAL N +L Q+ G++ Y
Sbjct: 340 FSKHISGYTGETCCTYNMLKLSSHLFCWTADAAVADYYERALYNHILG-QQDPHTGMVTY 398
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
LPL G K + T+ +SFWCC G+G
Sbjct: 399 FLPLLSGSHKV-----YSTKENSFWCCVGSGF 425
>gi|270296104|ref|ZP_06202304.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|423303646|ref|ZP_17281645.1| hypothetical protein HMPREF1072_00585 [Bacteroides uniformis
CL03T00C23]
gi|423307631|ref|ZP_17285621.1| hypothetical protein HMPREF1073_00371 [Bacteroides uniformis
CL03T12C37]
gi|270273508|gb|EFA19370.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|392688010|gb|EIY81301.1| hypothetical protein HMPREF1072_00585 [Bacteroides uniformis
CL03T00C23]
gi|392689500|gb|EIY82777.1| hypothetical protein HMPREF1073_00371 [Bacteroides uniformis
CL03T12C37]
Length = 641
Score = 244 bits (623), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 150/390 (38%), Positives = 215/390 (55%), Gaps = 20/390 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KA 153
++ L DV+L PS + + ++ + + L+ F+ AG AG K
Sbjct: 43 VESFDLKDVRLLPSRFRDNMMRDS-AWMTSIATNRLLHGFRNNAGV-FAGREGGYMTVKK 100
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
GWE CELRGH GH LSA A M+AST + K K ++V+ L+E Q +G+GYLSA
Sbjct: 101 LGGWESLDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYLSA 160
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
+P E +R VWAP+YT+HK+ +GL+DQY +ADN AL++ M ++ YN+++ +
Sbjct: 161 YPEELINRNIRGTSVWAPWYTLHKLFSGLIDQYLYADNKPALEVVTRMGDWAYNKLK-PL 219
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
+ + +R + E GG+N+ Y LY IT D ++ LA F + L Q DD+
Sbjct: 220 DEATRKR---MIRNEFGGVNESFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDLGT 276
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H NT IP V+ YE+T D + FF + H +A G +S E + DP++L+
Sbjct: 277 KHTNTFIPKVLAEARNYELTQDNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQLS 336
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
L E+C TYNMLK+SRHLF WT + ADYYERAL N +L Q+ E G++ Y L
Sbjct: 337 KHLTGYTGETCCTYNMLKLSRHLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSYFL 395
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL G K + TR +SFWCC G+G
Sbjct: 396 PLLSGSHKV-----YSTRENSFWCCVGSGF 420
>gi|255075873|ref|XP_002501611.1| predicted protein [Micromonas sp. RCC299]
gi|226516875|gb|ACO62869.1| predicted protein [Micromonas sp. RCC299]
Length = 1214
Score = 244 bits (623), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 158/466 (33%), Positives = 228/466 (48%), Gaps = 80/466 (17%)
Query: 102 LKEVSLHDVKLDPSSL------HWRAQQTNLEYL-LMLDVDSLVWSFQKTAGSPT----- 149
L VSL + P+++ H AQ+ N YL ++D L+ +F+ AG P
Sbjct: 168 LSSVSLQPDAVPPANVLHGAGVHLDAQRLNARYLTAVVDPRRLLANFRVVAGLPPETIPD 227
Query: 150 --------------AGKAYEG-----WEDPTCELRGHFVGHYLSASAHMWASTHN----- 185
+G +Y WE P CELRGHF GHYLSA A + A +
Sbjct: 228 RHPTETVAPYCDVGSGLSYAEHPGACWEAPDCELRGHFAGHYLSALAFVAAGAGDRPNTS 287
Query: 186 ---------------VT-----------LKEKMTAVVSALSECQNKMG--SGYLSAFPSE 217
VT +E + V L+ Q G +GY+SAFP E
Sbjct: 288 PDRTSSSDHLSDPEYVTGHQSDVATARHAREMLDRFVDGLATAQASSGTSAGYVSAFPEE 347
Query: 218 QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYS 277
DR A+ WAPYYT+HKI GL+D + A N +AL + K + RV +I +
Sbjct: 348 VLDRQGAVGGAWAPYYTLHKIGQGLMDAHVVAGNAKALDVLKGLANAVLTRVMGLIQQRG 407
Query: 278 VERHW---------NSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQA 328
HW + E+GG N++ +RLY +T + ++ LA LFD P FLG +
Sbjct: 408 AS-HWFGGALEYSKAAFGAESGGFNELAWRLYQLTGNGDYVTLASLFDHPTFLGRMRAGG 466
Query: 329 DDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSD 388
D ++ HAN H P+ +G+ RYE+TGD + F++++ + YATGGT GE W
Sbjct: 467 DGLTREHANFHEPIAMGAYSRYEITGDTESRRAFRNFIELLRDTRSYATGGTCDGERWQA 526
Query: 389 PKRLASTL-GTENEESCTTYNMLKVSRHL---FRWTKEMVYADYYERALTNGVLSIQRGT 444
P RL + TE +E+CT N +++ F + +ADY ERA +G + +QR
Sbjct: 527 PGRLERIIVSTETQETCTQVNFERLANAAVASFGEAEARDWADYSERASLHGPVGLQR-- 584
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGLGFIRWIQ 490
+PG ++Y PLG G SK +S HGWG ++FWCCYGTG+ + +Q
Sbjct: 585 KPGELLYTTPLGVGVSKGRSGHGWGRPDAAFWCCYGTGVEALARLQ 630
>gi|433678837|ref|ZP_20510648.1| hypothetical protein BN444_02893 [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430816044|emb|CCP41169.1| hypothetical protein BN444_02893 [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 648
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/366 (37%), Positives = 198/366 (54%), Gaps = 16/366 (4%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVG-HYLSASAH 178
+A++ N YL+ + L+ +F+ AG + + GWE P CELRGHF G HYLSA A
Sbjct: 71 QARERNRRYLMSIPNARLLHNFRLVAGLSSDAEPLGGWESPKCELRGHFAGGHYLSACAL 130
Query: 179 MWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKI 238
++A+T + LK+K A+V+ L+ CQ + GYL A+P+ + R + VW P YT HKI
Sbjct: 131 LYAATSDAALKDKADALVAELARCQRQ--DGYLGAYPAAFYARLRRGEDVWVPLYTAHKI 188
Query: 239 LAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMNDVLY 297
LAG LD A N QAL+ + ++ + W L E GG+ + L
Sbjct: 189 LAGHLDMARHAGNAQALRSAQRFADWL-----GAWMDGCDDAQWQHILGVEFGGVQESLL 243
Query: 298 RLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPL 357
LY ++ DPK+ A + +P L LA Q D ++G HANT IP ++ + YE+ G+P
Sbjct: 244 ELYLLSGDPKYQRWAARYAQPALLEPLAQQRDALAGLHANTQIPKIVAAARAYEIGGEPR 303
Query: 358 YKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLF 417
+ FF V+ H Y TGGTS E + P A L + E C +YNMLK++RHL+
Sbjct: 304 QRDIAAFFWRTVSGHHAYCTGGTSDYELFGKPDHFAGRLSGHSHECCCSYNMLKLTRHLY 363
Query: 418 RWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWC 477
W + DYYER L N L Q E G+++Y +P+ G K + T F+SFWC
Sbjct: 364 TWQPDAALMDYYERVLFNARLGTQ--DEAGMLMYFVPMDAGYWKL-----YNTPFASFWC 416
Query: 478 CYGTGL 483
C GTG+
Sbjct: 417 CTGTGV 422
>gi|256394133|ref|YP_003115697.1| hypothetical protein Caci_4996 [Catenulispora acidiphila DSM 44928]
gi|256360359|gb|ACU73856.1| protein of unknown function DUF1680 [Catenulispora acidiphila DSM
44928]
Length = 846
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 154/365 (42%), Positives = 190/365 (52%), Gaps = 16/365 (4%)
Query: 126 LEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHN 185
L YL +D D L++ F+ T G T+ GWEDPT ELRGH GH +SA A +AST +
Sbjct: 84 LAYLRFVDPDRLLYMFRTTVGIATSASPCGGWEDPTEELRGHSTGHIMSALAQAYASTGD 143
Query: 186 VTLKEKMTAVVSALSECQNKMG-----SGYLSAFPSEQFDRFEALKPVWAPYYTIHKILA 240
TLK K VS+L+ CQ +GYLSAFP FDR E+ + VWAPYYTIHKI+A
Sbjct: 144 STLKSKGDYFVSSLAACQAASPAAGFHTGYLSAFPESFFDRLESGQSVWAPYYTIHKIMA 203
Query: 241 GLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLY 300
GLLDQY A NTQAL + K M + R + S + L E GGM +VL LY
Sbjct: 204 GLLDQYLVAGNTQALTVLKGMAAWVKTRTDPL----SHSQMQAVLQTEFGGMPEVLAHLY 259
Query: 301 TITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKV 360
+T D L A FD LA D ++GFHANT +P +IG+ Y TG Y
Sbjct: 260 QVTGDANTLTAAQRFDHAQIEDPLAAGTDQLAGFHANTQVPKIIGALREYLATGTARYLT 319
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL-FRW 419
F I H Y GG S GE++ P +AS L E C TYN LK+SR L F
Sbjct: 320 IAQNFWAITTGHHMYEIGGFSNGEYFQTPNAIASQLSNTTCEVCVTYNELKLSRGLFFTD 379
Query: 420 TKEMVYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCC 478
Y DYYER L N VL Q + G + Y PL G Y + ++ F C
Sbjct: 380 PTRAAYLDYYERGLFNTVLGQQDPASSHGFVCYYTPLQPG-----GYKTYSNDYNDFTCD 434
Query: 479 YGTGL 483
+GTG+
Sbjct: 435 HGTGM 439
>gi|395774802|ref|ZP_10455317.1| protein [Streptomyces acidiscabies 84-104]
Length = 818
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 148/374 (39%), Positives = 200/374 (53%), Gaps = 16/374 (4%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWA 181
Q+ N YL +D+D L+ +F+ G P+ + GWE P ELRGH GH LS A A
Sbjct: 43 QRRNTAYLRFVDLDRLLHTFRLNVGLPSTAQPCSGWEGPNVELRGHSTGHLLSGLALTHA 102
Query: 182 STHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQFDRFEALKPVWAPYYTIH 236
+T + L++K +V+AL+ECQ +GYLSAFP FDR EA VWAPYYT+H
Sbjct: 103 NTGDTELRDKGRRLVAALAECQAASPAAGFNAGYLSAFPESFFDRLEAGTGVWAPYYTLH 162
Query: 237 KILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVL 296
KI+AGL+DQY + N QAL + ++ R + S ER L+ E GGMNDVL
Sbjct: 163 KIMAGLVDQYRLSGNEQALDVVLRKGDWVDRRTAGL----SYERMQRVLDTEFGGMNDVL 218
Query: 297 YRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP 356
L+ IT D + L +A F LA D ++G HANT IP ++G+ +E D
Sbjct: 219 ADLHEITGDARWLAVAERFTHARVFDPLARGEDRLAGLHANTQIPKMVGALRMWEEGLDV 278
Query: 357 LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
Y+ G F IV H Y GG S GE + +P +A L E+C +YNMLK++R L
Sbjct: 279 RYRTIGENFWRIVTGHHTYVIGGNSNGEAFHEPDVIAGQLSDSTCENCNSYNMLKLTRLL 338
Query: 417 -FRWTKEMVYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGDSKAKSY-----HGWG 469
F DYYERAL N +L Q G+E G IY L G +K + +
Sbjct: 339 HFHAPGRTDLLDYYERALFNQMLGEQDPGSEHGYNIYYTGLAPGSAKRQPSFMSPEDAYS 398
Query: 470 TRFSSFWCCYGTGL 483
T +++F C +GTG+
Sbjct: 399 TDYTNFSCDHGTGM 412
>gi|330997549|ref|ZP_08321396.1| hypothetical protein HMPREF9442_02496 [Paraprevotella xylaniphila
YIT 11841]
gi|329570407|gb|EGG52138.1| hypothetical protein HMPREF9442_02496 [Paraprevotella xylaniphila
YIT 11841]
Length = 622
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 147/385 (38%), Positives = 209/385 (54%), Gaps = 20/385 (5%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KAYEGWE 158
L DV+L P + ++ +++ + VD L+ F+ TAG AG K GWE
Sbjct: 31 LQDVRLLPGRFRDNMMRDSV-WMVSIGVDRLLHGFRTTAGI-FAGREGGYMTVKKLGGWE 88
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
CELRGH GH+LSA + M+A+T + K K ++V+ L+E Q +G+GYLSAFP E
Sbjct: 89 SLDCELRGHTTGHFLSALSLMYAATGSEVFKLKGDSLVAGLAEVQVALGNGYLSAFPEEL 148
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
+R VWAP+YT+HKI +GL+DQY +A NTQAL++ + M ++ Y +++ + S
Sbjct: 149 INRNIRATSVWAPWYTLHKIFSGLIDQYLYAGNTQALEVVRKMGDWAYAKLKPL----SE 204
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
E + E GG+N+ Y LY +T D ++ LA F + L Q DD+ H NT
Sbjct: 205 ETRRKMIRNEFGGVNESFYNLYALTGDERYKWLAGFFYHNEVIDPLKAQKDDLGTKHTNT 264
Query: 339 HIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT 398
IP V+ YE+TGD K FF + H +A G +S E + + + +
Sbjct: 265 FIPKVLAEARNYELTGDADSKALSEFFWHTMIDRHTFAPGCSSDKEHYFPTDKFTAHISG 324
Query: 399 ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 458
E+C TYNMLK+SRHLF W ADYYERAL N +L Q+ G++ Y LPL G
Sbjct: 325 YTGETCCTYNMLKLSRHLFCWDASPEVADYYERALYNHILG-QQDPASGMVAYFLPLQTG 383
Query: 459 DSKAKSYHGWGTRFSSFWCCYGTGL 483
+ S T +SFWCC G+G
Sbjct: 384 THRVYS-----TPENSFWCCVGSGF 403
>gi|440732599|ref|ZP_20912422.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
DAR61454]
gi|440368630|gb|ELQ05659.1| Tat pathway signal sequence domain protein [Xanthomonas translucens
DAR61454]
Length = 652
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 139/366 (37%), Positives = 197/366 (53%), Gaps = 16/366 (4%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVG-HYLSASAH 178
+A++ N YL+ + L+ +F+ AG + + GWE P CELRGHF G HYLSA A
Sbjct: 75 QARERNRRYLMSIPNARLLHNFRLVAGLSSDAEPLGGWESPKCELRGHFAGGHYLSACAL 134
Query: 179 MWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKI 238
++A+T + LK+K A+V+ L+ CQ + GYL A+P+ + R + VW P YT HKI
Sbjct: 135 LYAATGDAALKDKADALVAELARCQRQ--DGYLGAYPAAFYARLRRGEDVWVPLYTAHKI 192
Query: 239 LAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMNDVLY 297
LAG LD A N QAL+ + ++ + W L E GG+ + L
Sbjct: 193 LAGHLDMARHAGNAQALRSAQRFADWL-----GAWMDGCDDAQWQHILGVEFGGVQESLL 247
Query: 298 RLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPL 357
LY ++ DPK+ A + +P L LA Q D ++G HANT IP ++ + YE+ DP
Sbjct: 248 ELYLLSGDPKYQRWAARYAQPALLEPLAQQRDALAGLHANTQIPKIVAAARAYEIGRDPR 307
Query: 358 YKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLF 417
+ FF V+ H Y TGGTS E + P A L + E C +YNMLK++RHL+
Sbjct: 308 QRDVAAFFWRTVSGHHAYCTGGTSDYELFGKPDHFAGRLSGHSHECCCSYNMLKLTRHLY 367
Query: 418 RWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWC 477
W + DYYER L N L Q E G+++Y +P+ G K + T F+SFWC
Sbjct: 368 TWQPDAALMDYYERVLFNARLGTQ--DEAGMLMYFVPMDAGYWKL-----YNTPFASFWC 420
Query: 478 CYGTGL 483
C GTG+
Sbjct: 421 CTGTGV 426
>gi|220928663|ref|YP_002505572.1| hypothetical protein Ccel_1236 [Clostridium cellulolyticum H10]
gi|110588920|gb|ABG76968.1| CBM22- and dockerin-containing enzyme [Clostridium cellulolyticum
H10]
gi|219998991|gb|ACL75592.1| protein of unknown function DUF1680 [Clostridium cellulolyticum
H10]
Length = 955
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 146/393 (37%), Positives = 206/393 (52%), Gaps = 21/393 (5%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWED 159
+ LK+ + VK+ + + A + YL +D + L+ F+KTAG T Y GWE+
Sbjct: 33 ELLKQFDMEQVKI-TDTYYVNALNKEVAYLQAIDPNRLLVGFKKTAGLSTTYSYYGGWEN 91
Query: 160 PTCELRGHFVGHYLSASAHMWASTH-----NVTLKEKMTAVVSALSECQNKMGSGYLSAF 214
T ++GH +GHY+SA A + +T N LK ++ ++S L CQNK G+GYL A
Sbjct: 92 NTL-IQGHTMGHYMSALAQAYKNTKSDPTVNADLKSRIDLIISELQACQNKNGNGYLFAT 150
Query: 215 PSEQFDRFE--ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNV 272
P+ QFD E A W P+YT+HKI++GLLD Y F N AL + + + Y RV
Sbjct: 151 PATQFDVVEGKASGSSWVPWYTMHKIMSGLLDIYKFGGNQTALTIATNLGNWIYKRVN-- 208
Query: 273 ITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDIS 332
+ L E GGMND LY LY +T + HL AH FD+ +A + +
Sbjct: 209 --AWDSATQSRVLGVEYGGMNDCLYELYKLTGNGNHLTAAHKFDENSLFNTIAAGTNVLP 266
Query: 333 GFHANTHIPVVIGSQMRYEVTG--DPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
G HANT IP IG+ RY G + Y F IV H Y TGG S E + D
Sbjct: 267 GKHANTTIPKFIGALNRYSTLGTSESSYLKAAQQFWAIVLKDHTYVTGGNSEDERFRDAG 326
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+L + N E+C NMLK+++ LF+ T ++ YADYYE AL N +++ Q E G+
Sbjct: 327 KLDAYRDNVNNETCNVNNMLKLTKELFKATGDVKYADYYENALINEIMASQN-PETGMAT 385
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y +G G K + ++F+ FWCC GTG+
Sbjct: 386 YFKAMGTGYFKV-----FSSQFNHFWCCTGTGM 413
>gi|116625830|ref|YP_827986.1| hypothetical protein Acid_6783 [Candidatus Solibacter usitatus
Ellin6076]
gi|116228992|gb|ABJ87701.1| protein of unknown function DUF1680 [Candidatus Solibacter usitatus
Ellin6076]
Length = 675
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 148/423 (34%), Positives = 218/423 (51%), Gaps = 29/423 (6%)
Query: 71 RKMLSETDEFSWTMIY-RKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYL 129
R + ET F + + RK+ P + + V+L P S + +Q+ N Y+
Sbjct: 38 RPLAPETPAFETPLEFTRKIVTPRA--------EPFPMPQVRLLPGSAYHDSQEWNRGYM 89
Query: 130 LMLDVDSLVWSFQKTAGSPT-AGKAYEGWEDP-----TCELRGHFVGHYLSASAHMWAST 183
L D L+ +F+ AG P + K GWE P + ELRGHF GH+LSASA + ++
Sbjct: 90 ERLAADRLLHTFRANAGLPVGSAKPLGGWEQPENGQRSSELRGHFAGHFLSASAQL-SAN 148
Query: 184 HNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLL 243
+ + K +V+ ++ CQ K+G YLSAFP+ +DR + VWAP+YTIHKI+AG+
Sbjct: 149 GDKNAQSKGDFMVAEMARCQQKLGGKYLSAFPTTWWDRLGKGERVWAPFYTIHKIMAGMF 208
Query: 244 DQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTIT 303
D Y+ A N QAL++ + M + + E L E GG+ + LYRL T
Sbjct: 209 DMYSLAGNQQALEVLEGMAAW----ADEWTAPKAAEHMQQILTIEFGGIAETLYRLAAAT 264
Query: 304 QDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGT 363
+ + F K FL LA + D++ G H NTHIP V+ + RY+++GD +
Sbjct: 265 DQDRWGRVGDRFQKKSFLNPLAARRDELRGLHVNTHIPQVMAAARRYDLSGDMRFHDVAD 324
Query: 364 FFMDIVNASHGYATGGTSAGEFW-SDPKRLAS--TLGTENEESCTTYNMLKVSRHLFRWT 420
+F V + Y TGGTS E W + P+RLA+ L E C YNMLK++RHL+ W
Sbjct: 325 YFFSEVAGARTYVTGGTSNAEAWLAPPRRLATELKLSVNTAECCCAYNMLKLARHLYSWD 384
Query: 421 KEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYG 480
+ Y DYYE L N + R + G+ Y L L G ++ + T +FWCC G
Sbjct: 385 PKPSYFDYYEHLLLNHRIGTIR-PKVGLTQYYLSLTPG-----AWKTFNTEDQTFWCCTG 438
Query: 481 TGL 483
+G+
Sbjct: 439 SGV 441
>gi|251795999|ref|YP_003010730.1| hypothetical protein Pjdr2_1987 [Paenibacillus sp. JDR-2]
gi|247543625|gb|ACT00644.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 626
Score = 241 bits (614), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 133/327 (40%), Positives = 182/327 (55%), Gaps = 12/327 (3%)
Query: 156 GWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP 215
GWE TCELRGH +GH+LSA+A ++A T + +K K +V L CQ G +L+AFP
Sbjct: 71 GWESVTCELRGHIMGHWLSAAAQIYAQTSDALVKAKADYIVEELVRCQEANGGEWLAAFP 130
Query: 216 SEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITK 275
R VWAP+YTIHK+L GL D Y A N QAL++ + + ++FY N
Sbjct: 131 ESYMHRIAKGSFVWAPHYTIHKLLMGLYDMYAIAGNEQALRVMRGIADWFYKWTGN---- 186
Query: 276 YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
+S E L+ ETGGM +V LY IT++ KHL L +D+ F L D ++ H
Sbjct: 187 FSQEEMDELLDLETGGMLEVWADLYGITKEDKHLNLVKRYDRRRFFDALLEGQDVLTNKH 246
Query: 336 ANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGY-ATGGTSAGEFWSDPKRLAS 394
ANT IP ++G+ +EVTG+ Y+ F + GY ATG GE W + S
Sbjct: 247 ANTQIPEILGAARAWEVTGEDRYRRIVEAFWRLAVTDRGYVATGAGDNGELWMPRGEMGS 306
Query: 395 TLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLP 454
LG +E C YNM++++ L RWT + YADY+ER NGVL+ Q G + G++ Y L
Sbjct: 307 RLGV-GQEHCCNYNMMRLAHVLLRWTGDPAYADYWERRFYNGVLAHQHG-DTGMISYFLG 364
Query: 455 LGRGDSKAKSYHGWGTRFSSFWCCYGT 481
+G G K+ WGT FWCC+GT
Sbjct: 365 MGAGSKKS-----WGTPTQHFWCCHGT 386
>gi|332663228|ref|YP_004446016.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332332042|gb|AEE49143.1| protein of unknown function DUF1680 [Haliscomenobacter hydrossis
DSM 1100]
Length = 791
Score = 240 bits (613), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 140/388 (36%), Positives = 209/388 (53%), Gaps = 23/388 (5%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L D++L P S + A + + YLL ++ D L+ F AG PT Y GWE L G
Sbjct: 50 LEDLRLLPGSAFYNAMEKDAAYLLKIESDRLLHRFYANAGLPTKAPVYGGWESEG--LSG 107
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-----QFDR 221
H +GHYLSA A M+A + + E++ +V L+ CQ +GY+ A P E Q R
Sbjct: 108 HTLGHYLSACALMYAGSKDEKYLERVNYLVQELARCQVARKTGYVGAIPKEDSIFAQVAR 167
Query: 222 FEA------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITK 275
+ L W+P+YTIHK++AGL D Y + +N QAL++ + M ++ +V+ K
Sbjct: 168 GDIRSSGFDLNGGWSPWYTIHKVMAGLADAYLYTNNDQALQVLRGMSDW----TASVVDK 223
Query: 276 YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
+ + L E GGMN++L +Y T + K+L L++ F + L+ + D + G H
Sbjct: 224 LNDPQRQKMLKCEYGGMNEILANVYAFTGEKKYLDLSYKFYDDFVMEPLSKKIDPLPGKH 283
Query: 336 ANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAST 395
+NT++P IGS +YE+TG+ + +FF + + +H Y GG S E+ D +L
Sbjct: 284 SNTNVPKAIGSARQYELTGNTRDQTIASFFWETMVHNHTYVIGGNSNYEYCGDAGKLNDR 343
Query: 396 LGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPL 455
L E+C TYNMLK++RHLF W ADYYERAL N +L+ Q E G+M Y +PL
Sbjct: 344 LSDNTCETCNTYNMLKLTRHLFCWQPSAELADYYERALYNHILASQH-PETGMMTYFVPL 402
Query: 456 GRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G K + F +F CC G+G+
Sbjct: 403 RMGSKKE-----FSNEFHTFTCCVGSGM 425
>gi|312621677|ref|YP_004023290.1| hypothetical protein Calkro_0576 [Caldicellulosiruptor
kronotskyensis 2002]
gi|312202144|gb|ADQ45471.1| protein of unknown function DUF1680 [Caldicellulosiruptor
kronotskyensis 2002]
Length = 588
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/372 (34%), Positives = 214/372 (57%), Gaps = 15/372 (4%)
Query: 115 SSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG----SPTAGKAYEGWEDPTCELRGHFVG 170
S +R + N Y+L L ++L+ +F +G S + GWE PTC+LRGHF+G
Sbjct: 18 ESEFYRRFEINRNYMLSLKTENLLQNFYLESGLVSWSFLPQDIHGGWESPTCQLRGHFLG 77
Query: 171 HYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWA 230
H+LSA+A ++A+ + +K K +++ L +CQ + G ++ + P + F+ K VWA
Sbjct: 78 HWLSAAAKIYANFGDEEIKGKADYIINELEKCQRENGGEWVGSIPEKYFEWMARGKYVWA 137
Query: 231 PYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETG 290
P+YT+HK GL+D Y +A N +AL++ +FY ++S E+ + L+ ETG
Sbjct: 138 PHYTVHKTFMGLVDMYKYASNQKALEIADKWANWFYRWS----GQFSREKMDDILDYETG 193
Query: 291 GMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRY 350
GM ++ LY IT+D K+ L + + L + D ++G HANT IP + G+ +
Sbjct: 194 GMLEIWAELYDITKDSKYKDLMERYYRGRLFDRLLMGEDVLTGKHANTTIPEIHGAARVW 253
Query: 351 EVTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
E+TG+ + K+ +++ + V+ + TGG + GE W+ +++ + LGT N+E C YNM
Sbjct: 254 EITGEEKFRKIVESYWKEAVDERGYFCTGGQTLGEVWTPKQKIKNYLGTTNQEHCVVYNM 313
Query: 410 LKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWG 469
++++ LFRWT + Y+DY ER + NG+ + QR + G++ Y LPL G K WG
Sbjct: 314 IRLAEFLFRWTGDKRYSDYIERNIYNGLFAQQR-LKDGMVTYYLPLMPGSQKR-----WG 367
Query: 470 TRFSSFWCCYGT 481
T + FWCC+GT
Sbjct: 368 TPTNDFWCCHGT 379
>gi|376260258|ref|YP_005146978.1| hypothetical protein [Clostridium sp. BNL1100]
gi|373944252|gb|AEY65173.1| hypothetical protein Clo1100_0916 [Clostridium sp. BNL1100]
Length = 952
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 145/391 (37%), Positives = 202/391 (51%), Gaps = 21/391 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
LK+ + VK+ + + A + YL +D + L+ F+K AG T Y GWE+ T
Sbjct: 35 LKQFDMEQVKI-TDAYYVNAFNKEVAYLRAIDPNRLLVGFKKAAGLSTTYSYYGGWENNT 93
Query: 162 CELRGHFVGHYLSASAHMWASTH-----NVTLKEKMTAVVSALSECQNKMGSGYLSAFPS 216
++GH +GHY+SA A + +T N LK ++ ++S L CQNK G+GYL A P
Sbjct: 94 L-IQGHTMGHYMSALAQAYKNTKSDATVNADLKSRIDLIISELQACQNKNGNGYLFATPV 152
Query: 217 EQFDRFE--ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
QFD E A W P+YT+HKI++GLLD Y F N AL + + + Y RV
Sbjct: 153 TQFDVVEGKASGSSWVPWYTMHKIMSGLLDVYKFEGNQTALTIATNLGNWIYKRVN---- 208
Query: 275 KYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
+ L E GGMND LY LY +T + HL AH FD+ +A + + G
Sbjct: 209 AWDSATQSKVLGVEYGGMNDCLYELYKLTGNSNHLTAAHKFDETSLFNTIAAGTNVLPGK 268
Query: 335 HANTHIPVVIGSQMRYEVTG--DPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRL 392
HANT IP IG+ RY G + Y F +IV H Y TGG S E + +L
Sbjct: 269 HANTTIPKFIGALNRYRTLGTTESSYLTAAQQFWNIVLKDHTYVTGGNSEDEHFRAAGKL 328
Query: 393 ASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYM 452
+ N E+C NMLK++R LF+ T ++ YADYYE AL N +++ Q E G+ Y
Sbjct: 329 DAYRDNVNNETCNVNNMLKLTRELFKVTGDVKYADYYENALINEIMASQN-PETGMATYF 387
Query: 453 LPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+G G K + ++F FWCC GTG+
Sbjct: 388 KAMGTGYFKV-----FSSQFDHFWCCTGTGM 413
>gi|374324035|ref|YP_005077164.1| hypothetical protein HPL003_21035 [Paenibacillus terrae HPL-003]
gi|357203044|gb|AET60941.1| hypothetical protein HPL003_21035 [Paenibacillus terrae HPL-003]
Length = 767
Score = 238 bits (607), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 138/394 (35%), Positives = 208/394 (52%), Gaps = 20/394 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDP 160
+KE HDV+L+ S A L+Y+ +D D ++++F+ TA T G + GW+ P
Sbjct: 191 VKEFKGHDVRLEKESEFGAAMDRFLQYVRSVDDDQMLYNFRATAAVDTKGAQPMTGWDAP 250
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM------GSGYLSAF 214
C L+GH GHYLSA A + +T + L K+ +V+ L +CQ + G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYNATEDSALLGKIQYMVAELGKCQTALSEQAGYGRGFLSAY 310
Query: 215 PSEQFDRFE---ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQN 271
EQF+ E +WAPYYT+HKI+AGLLD Y A +AL++ + + +NR+
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALEICDKLGHWLHNRLSR 370
Query: 272 VITKYSVERHWN-SLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+ + + + W+ + E GGMN+VL +LY IT +L+ A FD + D
Sbjct: 371 -LPREQLHKMWSLYIAGEFGGMNEVLAKLYAITSHEHYLITAKYFDNEKLFLPMKENVDT 429
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
+ HAN HIP VIG+ +EV G+ Y F +V H Y+ GG E + +P
Sbjct: 430 LGNMHANQHIPQVIGALKLFEVAGEKAYFKIAENFWTMVTQRHIYSIGGAGETEMFREPD 489
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP-GVM 449
+A L + E+C +YNMLK+++ LF++ Y DYYE+AL N +L+ + + G
Sbjct: 490 AIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEGGS 549
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y +PL G K H CC+GTGL
Sbjct: 550 TYFMPLAPGSIKKFDTH-------ENTCCHGTGL 576
>gi|374372949|ref|ZP_09630610.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
gi|373235025|gb|EHP54817.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
Length = 653
Score = 238 bits (606), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 150/390 (38%), Positives = 204/390 (52%), Gaps = 24/390 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-------KAY 154
L EV L D + + L R Q +LL + + SL+ SF AG A K Y
Sbjct: 57 LSEVKLLDSRFKENML--REQH----WLLAISLKSLLHSFYTNAGMYDANEGGYDEIKKY 110
Query: 155 EGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMG-SGYLSA 213
GWE CELRGH GH LS A M+AST K K ++ AL+ Q + +GY+SA
Sbjct: 111 AGWESMDCELRGHSTGHILSGLALMYASTGEQIYKSKGDTIIKALAAIQKTLNQNGYISA 170
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP E +R + VWAP+YT+HKILAG+LDQY + +N QAL + K + Y ++ +
Sbjct: 171 FPQEFINRNIRGEKVWAPWYTLHKILAGVLDQYLYCNNDQALDIAKNFSAWAYKKLHPL- 229
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
+ + L E GGMN+V + LY IT D K L + F L L D++ G
Sbjct: 230 ---TAGQRTLMLRNEFGGMNEVFFNLYAITGDEKDKWLGNFFYDNRMLDPLKAGIDNLKG 286
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
HANT+IP ++G YE+ G+ FF V H +ATG S E + P ++
Sbjct: 287 AHANTYIPKLLGVTRDYEIEGNAGGDAVVRFFWQRVTTHHSFATGSNSDREHFFQPDAIS 346
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
+ L ESC YNMLK++RHL+ + + YADYYE+AL N +L Q+ G++ Y L
Sbjct: 347 THLTGYTGESCNVYNMLKLTRHLYIHSGNVKYADYYEKALFNHILG-QQDPATGMIAYFL 405
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
P+ G K + T SSFWCC GTG
Sbjct: 406 PMLPGAHKV-----YSTPDSSFWCCVGTGF 430
>gi|333382563|ref|ZP_08474231.1| hypothetical protein HMPREF9455_02397 [Dysgonomonas gadei ATCC
BAA-286]
gi|332828505|gb|EGK01205.1| hypothetical protein HMPREF9455_02397 [Dysgonomonas gadei ATCC
BAA-286]
Length = 644
Score = 237 bits (605), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 148/389 (38%), Positives = 207/389 (53%), Gaps = 24/389 (6%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KAYEGWE 158
L DV+L S ++ + +++L L VD L+ SF+ TAG AG K GWE
Sbjct: 46 LKDVRLLDSPFRQNMERES-KWILSLGVDRLLHSFRNTAGV-YAGREGGYMTIKKLGGWE 103
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM----GSGYLSAF 214
CELRGH +GH +S A+++AST + K K ++V+ L+E Q+ + GY+SA+
Sbjct: 104 SLDCELRGHSIGHIMSGLAYLYASTGDERYKIKADSLVAGLAEVQDILIENGQKGYISAY 163
Query: 215 PSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
P +R A K VWAP+YT+HK+ AGL+DQY + DN +AL + K + Y ++ +
Sbjct: 164 PENLINRNIAGKSVWAPWYTLHKVYAGLIDQYLYCDNKEALDIMKEAASWAYQKLMPL-- 221
Query: 275 KYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
S E+ L E GG+N+ Y LY IT +P+H A F + LA D+
Sbjct: 222 --SEEQRALMLRNEFGGVNEAFYNLYAITGNPEHKKSAEFFYHADVIDPLAEHKADLYFK 279
Query: 335 HANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAS 394
HANT IP VIG YE+ K FF + V Y TGG S E + ++
Sbjct: 280 HANTFIPKVIGEARNYELHNSERSKDIANFFWNTVIDHQTYCTGGNSHKEKFIHSDSISK 339
Query: 395 TLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLP 454
L +E+C T NMLK++RHLF W YADYYERAL N +L Q+ + G++ Y LP
Sbjct: 340 NLTGYTQETCNTNNMLKLTRHLFCWDANAKYADYYERALYNHILG-QQDPQSGMVAYFLP 398
Query: 455 LGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G K + T +SFWCC GTG
Sbjct: 399 MLPGAHKV-----YSTPENSFWCCVGTGF 422
>gi|325836901|ref|ZP_08166283.1| hypothetical protein HMPREF9402_1694 [Turicibacter sp. HGF1]
gi|325491107|gb|EGC93399.1| hypothetical protein HMPREF9402_1694 [Turicibacter sp. HGF1]
Length = 763
Score = 237 bits (605), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 147/384 (38%), Positives = 204/384 (53%), Gaps = 27/384 (7%)
Query: 110 VKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFV 169
V+L+ SL +Q +YLL LDV+ L+ + A +Y GWE + E++GH +
Sbjct: 6 VRLEKDSLFEISQAVGKQYLLDLDVERLLAPIYEGASQIPPKPSYGGWE--SLEIKGHSI 63
Query: 170 GHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAF---PSEQF------- 219
GHYLSA A M+ +T ++ LKE+M ++ S Q GYL F P EQ
Sbjct: 64 GHYLSALACMYEATKDLELKERMDYIIETFSLLQR--ADGYLGGFLSTPFEQVFTGEFHV 121
Query: 220 DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVE 279
D F +L W P+Y+IHKI AGL+D Y N +AL + K + ++ Y + + S E
Sbjct: 122 DHF-SLSHYWVPWYSIHKIYAGLMDAYQIGKNVEALNILKKLADWAYEGSRLM----SDE 176
Query: 280 RHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTH 339
+ L E GGMN+V+ LY ITQD ++L LA F + + LA DD+ G HANT
Sbjct: 177 QFQRMLICEYGGMNEVMAELYEITQDERYLYLAKRFTQHLIMDPLAAGVDDLQGRHANTQ 236
Query: 340 IPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTE 399
IP V+G+ YEVTGD Y FF + V Y GG S+GE + L E
Sbjct: 237 IPKVLGAAKLYEVTGDDYYYRVAKFFFETVVLHRSYVIGGNSSGEHFGPSD--TEPLSRE 294
Query: 400 NEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGD 459
E+C TYNM+K++++LF+WTK+ Y D+ ERA N +L+ Q G IY G
Sbjct: 295 AAETCNTYNMIKLAKYLFKWTKDSKYIDFIERATYNHILASQ-DPHTGCKIYFTSNYPGH 353
Query: 460 SKAKSYHGWGTRFSSFWCCYGTGL 483
K +GT+ SFWCC GTG+
Sbjct: 354 FKV-----YGTKEDSFWCCTGTGM 372
>gi|345011855|ref|YP_004814209.1| hypothetical protein [Streptomyces violaceusniger Tu 4113]
gi|344038204|gb|AEM83929.1| protein of unknown function DUF1680 [Streptomyces violaceusniger Tu
4113]
Length = 849
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 147/375 (39%), Positives = 199/375 (53%), Gaps = 17/375 (4%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWA 181
Q N YL +D+D L+ +F+ G +A + GWE PT ELRGH GH LS A +A
Sbjct: 72 QSRNTAYLRFVDIDRLLHTFRLNVGLSSAAQPCGGWESPTTELRGHSTGHLLSGLALTYA 131
Query: 182 STHNVTLKEKMTAVVSALSECQNKM-----GSGYLSAFPSEQFDRFEALKPVWAPYYTIH 236
+T + ++K A+VSAL+ CQ + G GYLSAFP FDR EA VWAPYYTIH
Sbjct: 132 ATGDTAPRDKGRALVSALAACQARSPAAGYGQGYLSAFPESFFDRLEAGTGVWAPYYTIH 191
Query: 237 KILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVL 296
KI+AGL+DQY A N +AL+ + R K S ++ L E GGMNDVL
Sbjct: 192 KIMAGLVDQYRLAGNAEALQTVLRQAAWVDTRTG----KLSYDQMQRVLQTEFGGMNDVL 247
Query: 297 YRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP 356
L+ IT D + L +A F LA D ++G HANT IP ++G+ +E D
Sbjct: 248 ADLHEITGDSRWLKVAERFTHARVFDPLARNEDRLAGLHANTQIPKMVGAMRLWEEGLDS 307
Query: 357 LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
Y+ G F IV H Y GG S GE + +P +A+ L E+C +YNMLK++R +
Sbjct: 308 RYRTIGENFWKIVTDHHTYVIGGNSNGEAFHEPDAIAAQLSDNACENCNSYNMLKLTRLI 367
Query: 417 -FRWTKEMVYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGDSKAK-SYHG-----W 468
F + DYYER L N +L Q + G IY L G K + S+ G +
Sbjct: 368 HFHAPERTDLLDYYERTLLNQMLGEQDPDSAHGFNIYYTGLAPGSFKQQPSFMGTDPNQY 427
Query: 469 GTRFSSFWCCYGTGL 483
T + +F C +G+G+
Sbjct: 428 STDYDNFSCDHGSGM 442
>gi|336425130|ref|ZP_08605160.1| hypothetical protein HMPREF0994_01166 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013039|gb|EGN42928.1| hypothetical protein HMPREF0994_01166 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 628
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 134/371 (36%), Positives = 207/371 (55%), Gaps = 26/371 (7%)
Query: 128 YLLMLDVDSLVWSFQKTAGSPTAGKAYEG----WEDPTCELRGHFVGHYLSASAHMWAST 183
Y++ L+ L+ +F +G T+ +A EG WE PTC+LRGHF+GH+LSA+A + +T
Sbjct: 32 YMMHLENRFLLLNFNLESGRDTSAEAIEGMHGGWEFPTCQLRGHFLGHWLSAAAMHYHAT 91
Query: 184 HNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLL 243
+ LK K +V L+ECQ + G + + P + R K VWAP+YTIHK+ GLL
Sbjct: 92 GDRELKAKADTLVEELAECQKENGGKWAAPIPEKYLYRIAEGKQVWAPHYTIHKVFMGLL 151
Query: 244 DQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTIT 303
D Y +A N AL++ + ++FY+ ++ +S + + L+ ETGGM ++ +LY IT
Sbjct: 152 DMYEYAGNAIALEIAENFADWFYDWTKD----FSRDEMDDILDFETGGMLEIWVQLYAIT 207
Query: 304 QDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGT 363
K+ L + + L D ++ HANT IP +IG Y+VTGD ++
Sbjct: 208 GKDKYAALMERYYRGRLFDPLLKGEDVLTNMHANTTIPEIIGCARAYDVTGDEKWRKIAE 267
Query: 364 FFMDIVNASHG-YATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKE 422
+ D+ G YATGG + GE WS K+L + LG + +E CT YNM++++ LFRW+ +
Sbjct: 268 NYWDLAVTQRGQYATGGQTCGEIWSPKKKLGARLGLKGQEHCTVYNMIRLAGFLFRWSLD 327
Query: 423 MVYADYYERALTNGVLS-------IQRG-TEP----GVMIYMLPLGRGDSKAKSYHGWGT 470
Y DY E+ L NG+++ + G T P G++ Y LP+ G K GW +
Sbjct: 328 PAYLDYQEKLLYNGLMAQAYWQSNLSHGFTSPYPSKGLLTYFLPMQAGGRK-----GWSS 382
Query: 471 RFSSFWCCYGT 481
+ F+CC+GT
Sbjct: 383 KTGDFFCCHGT 393
>gi|332880745|ref|ZP_08448418.1| hypothetical protein HMPREF9074_04196 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357045883|ref|ZP_09107513.1| hypothetical protein HMPREF9441_01526 [Paraprevotella clara YIT
11840]
gi|332681379|gb|EGJ54303.1| hypothetical protein HMPREF9074_04196 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355530889|gb|EHH00292.1| hypothetical protein HMPREF9441_01526 [Paraprevotella clara YIT
11840]
Length = 618
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 150/402 (37%), Positives = 200/402 (49%), Gaps = 28/402 (6%)
Query: 98 AGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGW 157
A + L HDV+L S + R + N +L L+ D L+ +F+ AG P+ K EGW
Sbjct: 29 ATEMLLPFPSHDVELASSWVKQR-EDLNTAFLRSLEPDRLLHNFRVNAGLPSVAKPLEGW 87
Query: 158 EDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE 217
E P LRGHFVGHYLSA + + + L + VV + CQ G+GYLSAFP
Sbjct: 88 ESPGVGLRGHFVGHYLSAVSALVERYEDAGLARNLEKVVEGMYACQQAHGNGYLSAFPET 147
Query: 218 QFDRFEA-LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNV---- 272
+ E VWAPYYT+HKI+ GLLD Y N +A M + + Y R+ +
Sbjct: 148 DIEVLETRFTGVWAPYYTLHKIMQGLLDVYLRTGNEKAYAMVEGLAGYVDRRMSKLDPAT 207
Query: 273 ITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDIS 332
+ + N N E GGMN+VLY+LY ++ P++L LA LFD FL L D +S
Sbjct: 208 VARMMYTADANPQN-EMGGMNEVLYQLYCVSGKPRYLELASLFDPSWFLEPLVRNEDILS 266
Query: 333 GFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA---------- 382
G HANTHI +V G RYE TG+ Y + F +++ H Y G +S
Sbjct: 267 GLHANTHIALVNGFARRYESTGEECYGKSVANFWNMLMHFHAYVNGTSSGPRPNVTTETS 326
Query: 383 --GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSI 440
E W +P L +TL ESC T+N +++ LF WT YAD Y N VL +
Sbjct: 327 LTAEHWGEPCHLCNTLTKGIAESCVTHNTQRLNASLFSWTGNPCYADVYMNMFYNAVLPV 386
Query: 441 Q-RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
Q R T G +Y LPLG KA + F CC G+
Sbjct: 387 QSRST--GAYVYHLPLGSPRHKAYMAD------NDFKCCSGS 420
>gi|293375008|ref|ZP_06621302.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
gi|292646370|gb|EFF64386.1| conserved hypothetical protein [Turicibacter sanguinis PC909]
Length = 763
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 147/384 (38%), Positives = 204/384 (53%), Gaps = 27/384 (7%)
Query: 110 VKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFV 169
V+L+ SL +Q +YLL LDV+ L+ + A +Y GWE + E++GH +
Sbjct: 6 VRLEKDSLFEISQAVGKQYLLDLDVERLLAPIYEGASQIPPKPSYGGWE--SLEIKGHSI 63
Query: 170 GHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAF---PSEQF------- 219
GHYLSA M+ +T ++ LKE+M ++ S Q GYL F P EQ
Sbjct: 64 GHYLSALTCMYEATKDLELKERMDYIIETFSLLQR--ADGYLGGFLSTPFEQVFTGEFHV 121
Query: 220 DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVE 279
D F +L W P+Y+IHKI AGL+D Y N +AL + K + ++ Y + + S E
Sbjct: 122 DHF-SLSHYWVPWYSIHKIYAGLMDAYQIGKNVEALNILKKLADWAYEGSRLM----SDE 176
Query: 280 RHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTH 339
+ L E GGMN+V+ LY ITQD ++L LA F + + LA DD+ G HANT
Sbjct: 177 QFQRMLICEYGGMNEVMAELYEITQDERYLYLAKRFTQHLIMDPLAAGVDDLQGRHANTQ 236
Query: 340 IPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTE 399
IP V+G+ YEVTGD Y FF + V Y GG S+GE + A L E
Sbjct: 237 IPKVLGAAKLYEVTGDDYYYRVAKFFFETVVLHRSYVIGGNSSGEHFGPSDTEA--LSRE 294
Query: 400 NEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGD 459
E+C TYNM+K++++LF+WTK+ Y D+ ERA N +L+ Q G IY G
Sbjct: 295 AAETCNTYNMIKLAKYLFKWTKDSKYIDFIERATYNHILASQ-DPHTGCKIYFTSNYPGH 353
Query: 460 SKAKSYHGWGTRFSSFWCCYGTGL 483
K +GT+ SFWCC GTG+
Sbjct: 354 FKV-----YGTKEDSFWCCTGTGM 372
>gi|390456441|ref|ZP_10241969.1| hypothetical protein PpeoK3_20683 [Paenibacillus peoriae KCTC 3763]
Length = 759
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 139/393 (35%), Positives = 211/393 (53%), Gaps = 20/393 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPT-AGKAYEGWEDP 160
L ++S V L+ SL AQ L++LL ++ D ++++F+K AG T A GW+
Sbjct: 185 LHDISTQKVHLEGPSLLKTAQNRRLQFLLTVNDDQMLYNFRKAAGLDTLNAPAMIGWDSD 244
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQ------NKMGSGYLSAF 214
L+GH GHYLSA A +AST N +++K+ ++ L++ Q ++ G+LSA+
Sbjct: 245 DSLLKGHTTGHYLSALALCYASTGNERIRQKLAYLIDELNKVQLAFEADDRYHYGFLSAY 304
Query: 215 PSEQFDRFEALK---PVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQN 271
EQFD E +WAPYYT+HKI AGLLD Y A AL + + ++ YNR+ +
Sbjct: 305 SEEQFDLLEVYTRYPEIWAPYYTLHKIFAGLLDSYHIAGIELALVIADKVGDWIYNRL-S 363
Query: 272 VITKYSVERHWN-SLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
V+ + +++ W + E GG+N+ L LYT TQ H+ A LFD + D
Sbjct: 364 VLPQEQLKKMWGLYIAGEYGGINESLAELYTYTQKEHHIAAAKLFDNDRLFFPMEQHVDA 423
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
+ G HAN HIP ++G+ +E TG+ Y FF + V +H Y+ GGT GE + P
Sbjct: 424 LGGMHANQHIPQIVGAFKIFEATGEQKYYDIAKFFWESVVNAHIYSIGGTGEGEMFKQPY 483
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
++ + L E+C +YNMLK+++ L+ + ++ Y DYYER + N +LS G
Sbjct: 484 QIGAHLTEHTAETCASYNMLKLTKQLYVYENDVKYMDYYERTMINHILSSTDHECLGAST 543
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y +P G K G+ S CC+GTGL
Sbjct: 544 YFMPTSSGGQK-----GYDEENS---CCHGTGL 568
>gi|251798261|ref|YP_003012992.1| hypothetical protein Pjdr2_4282 [Paenibacillus sp. JDR-2]
gi|247545887|gb|ACT02906.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 758
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 146/384 (38%), Positives = 200/384 (52%), Gaps = 17/384 (4%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L L+ VKL S A Q L+YL DVD L+ F++T+G Y GWE+
Sbjct: 10 LNHFELNRVKL-YSEYQTNAFQKELDYLRSYDVDRLLAGFRETSGLQPKADKYPGWEN-- 66
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
E+RGH +GHYL+A + +A T + L EK+ +V+ L+E Q + +GYLSAFP FD
Sbjct: 67 TEIRGHTLGHYLTAVSQAYAQTQDSGLLEKLKYLVAELAEAQQE--NGYLSAFPETLFDN 124
Query: 222 FEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERH 281
E KP W P+YT+HKI+AGL+ Y QA ++ + ++ +R +S E
Sbjct: 125 VENRKPAWVPWYTMHKIIAGLIAVYQATKLQQAYEVVSRLGDWVADRA----CSWSEELQ 180
Query: 282 WNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIP 341
L E GGMND +Y LY +T + HL AH FD+ L D + G HANT IP
Sbjct: 181 ATVLAVEYGGMNDCMYDLYKLTGNNLHLEAAHKFDEISLFEALREGKDVLKGKHANTMIP 240
Query: 342 VVIGSQMRYEVTGDPL--YKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTE 399
IG+ RY G+ Y F D V H Y TGG S E + +P L
Sbjct: 241 KFIGALNRYLTLGESERGYLEAAVNFWDTVVYHHSYLTGGNSECEHFGEPDILDGKRSDV 300
Query: 400 NEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGD 459
E+C +YNMLK+++ LF+ T+ YAD+YER N +LS Q E G+ +Y P+ G
Sbjct: 301 TCETCNSYNMLKLTKELFKLTQNSKYADFYERTYINAILSSQ-NPETGMTMYFQPMATGY 359
Query: 460 SKAKSYHGWGTRFSSFWCCYGTGL 483
K + + F FWCC GTG+
Sbjct: 360 FKI-----YSSPFEHFWCCTGTGM 378
>gi|332880466|ref|ZP_08448140.1| hypothetical protein HMPREF9074_03915 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357046164|ref|ZP_09107794.1| hypothetical protein HMPREF9441_01810 [Paraprevotella clara YIT
11840]
gi|332681454|gb|EGJ54377.1| hypothetical protein HMPREF9074_03915 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355531170|gb|EHH00573.1| hypothetical protein HMPREF9441_01810 [Paraprevotella clara YIT
11840]
Length = 641
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 145/385 (37%), Positives = 202/385 (52%), Gaps = 20/385 (5%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--------KAYEGWE 158
L DV+L P + + +++ + D L+ F+ TAG AG K GWE
Sbjct: 47 LQDVRLLPGRFRDNMMRDS-AWMVSIGADRLLHGFRTTAGV-FAGREGGYMTVKKLGGWE 104
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
CELRGH GH LSA A M+A+T + K K ++V+ L+E Q GYLSA+P E
Sbjct: 105 SLDCELRGHTTGHVLSALALMYAATGSDVFKMKGDSLVAGLAEVQAAGTGGYLSAYPEEL 164
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
+R + VWAP+YT+HK+ +GL+DQY +A N QAL + + M ++ Y +++ +
Sbjct: 165 INRNIRGESVWAPWYTLHKLFSGLIDQYLYARNAQALDVVRKMGDWAYGKLRPL----PE 220
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
E + E GG+N+ Y LY +T D ++ LA F + L Q DD+ H NT
Sbjct: 221 EMRRKMIRNEFGGINESFYNLYALTGDERYRWLAGFFYHNDVIDPLKEQRDDLGTKHTNT 280
Query: 339 HIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT 398
IP V+ YE+TGD K FF + H +A G +S E + DP + +
Sbjct: 281 FIPKVLAEARNYELTGDGDSKALSEFFWHTMIGRHTFAPGCSSDKEHYFDPDEFSKHISG 340
Query: 399 ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 458
E+C TYNMLK+SRHLF W ADYYERAL N +L Q+ G++ Y LPL G
Sbjct: 341 YTGETCCTYNMLKLSRHLFCWEASPEVADYYERALYNHILG-QQDPATGMVSYFLPLQSG 399
Query: 459 DSKAKSYHGWGTRFSSFWCCYGTGL 483
K S T +SFWCC G+G
Sbjct: 400 THKVYS-----TPENSFWCCVGSGF 419
>gi|312135764|ref|YP_004003102.1| hypothetical protein Calow_1766 [Caldicellulosiruptor owensensis
OL]
gi|311775815|gb|ADQ05302.1| protein of unknown function DUF1680 [Caldicellulosiruptor
owensensis OL]
Length = 587
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 129/372 (34%), Positives = 209/372 (56%), Gaps = 15/372 (4%)
Query: 115 SSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG----SPTAGKAYEGWEDPTCELRGHFVG 170
S +++ + N Y+L L ++L+ +F +G S + GWE PTC+LRGHF+G
Sbjct: 18 DSEYYKRFKLNRSYMLSLKTENLLQNFYLESGIMSWSFLPQDIHGGWESPTCQLRGHFLG 77
Query: 171 HYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWA 230
H+LSA+A ++A+ + +K K +V L CQ + G ++ + P + F+ K VWA
Sbjct: 78 HWLSAAARIYANFGDEEIKGKADYIVDELERCQKENGGEWVGSIPEKYFEWMARGKWVWA 137
Query: 231 PYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETG 290
P+YT+HK GL+D Y + N +AL++ +FY ++S E+ + L+ ETG
Sbjct: 138 PHYTVHKTFMGLVDMYKYTSNQKALEIVDRWANWFYRWS----GQFSREKMDDILDYETG 193
Query: 291 GMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRY 350
GM ++ LY IT+D K+ L + + L D ++G HANT IP + G+ +
Sbjct: 194 GMLEIWAELYNITKDIKYRDLMERYYRGRLFDRLLNGEDVLTGRHANTTIPEIHGAARVW 253
Query: 351 EVTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
EVTG+ + K+ +++ + V + TGG + GE W+ +++ + LG N+E C YNM
Sbjct: 254 EVTGEEKFRKIVESYWREAVEERGYFCTGGQTLGEVWTPKQKIKNYLGPTNQEHCVVYNM 313
Query: 410 LKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWG 469
++++ LFRWT + Y+DY ER + NG+ + QR + G++ Y LPL G K WG
Sbjct: 314 IRLAEFLFRWTGDKRYSDYIERNIYNGLFAQQR-LKDGMVTYFLPLMPGSQKR-----WG 367
Query: 470 TRFSSFWCCYGT 481
T + FWCC+GT
Sbjct: 368 TPTNDFWCCHGT 379
>gi|29827685|ref|NP_822319.1| protein [Streptomyces avermitilis MA-4680]
gi|29604785|dbj|BAC68854.1| putative secreted protein [Streptomyces avermitilis MA-4680]
Length = 854
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 147/375 (39%), Positives = 196/375 (52%), Gaps = 17/375 (4%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWA 181
Q+ N YL +D+D L+ +F+ G P+ + GWE P ELRGH GH LS A A
Sbjct: 77 QRRNSAYLRFVDIDRLLHTFRTNVGLPSDAEPCGGWEGPGVELRGHSTGHLLSGLALAHA 136
Query: 182 STHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQFDRFEALKPVWAPYYTIH 236
ST L++K +V+AL+ECQ+ G+GYLSAFP FDR EA VWAPYYTIH
Sbjct: 137 STGEEALRDKGRRLVAALAECQSAAPAAGFGTGYLSAFPESFFDRLEAGSGVWAPYYTIH 196
Query: 237 KILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVL 296
KI+AGL++QY QAL++ + R K S E+ L E GGMNDVL
Sbjct: 197 KIMAGLVEQYRLVGVGQALEVVLRQARWVDERT----AKLSYEQMQRVLETEFGGMNDVL 252
Query: 297 YRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP 356
L+ +T DP+ L +A F LA D ++G HANT IP ++G+ +E
Sbjct: 253 ADLHALTGDPRWLDVAERFTHARVFDPLAGNQDKLAGLHANTQIPKMVGALRLWEEGRAD 312
Query: 357 LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
Y+ F IV H Y GG S GE + +P +A L E+C +YNMLK++R L
Sbjct: 313 RYRTVAENFWQIVTDHHTYVIGGNSNGEAFHEPDVIAGQLSDNTCENCNSYNMLKLTRLL 372
Query: 417 -FRWTKEMVYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGDSKAK-SYHG-----W 468
F DYYER L N +L Q +E G IY L G K + S+ G +
Sbjct: 373 HFHAPDRTDLLDYYERTLLNQMLGEQDPDSEHGFAIYYTGLAPGSFKRQPSFMGPDPDVY 432
Query: 469 GTRFSSFWCCYGTGL 483
T + +F C +GTG+
Sbjct: 433 STDYDNFSCDHGTGM 447
>gi|345851934|ref|ZP_08804893.1| secreted protein [Streptomyces zinciresistens K42]
gi|345636594|gb|EGX58142.1| secreted protein [Streptomyces zinciresistens K42]
Length = 867
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 156/397 (39%), Positives = 206/397 (51%), Gaps = 22/397 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L L +V+L S ++T+ YLL +D D L+ +F+ TAG P++ + GWE P
Sbjct: 63 LDAFGLSEVRLLESPFLANMRRTS-AYLLFVDADRLLHTFRLTAGLPSSAQPCGGWEAPD 121
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS-----GYLSAFPS 216
+LRGH GH LSA A A T EK A+V+AL+ECQ + GYLSAFP
Sbjct: 122 VQLRGHTTGHLLSALAQAHAHTGERAYAEKGRALVAALAECQRAAPAAGFTRGYLSAFPE 181
Query: 217 EQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKY 276
F R EA WAPYYT+HKI+AGLLDQY A + QAL + + M + R +
Sbjct: 182 SVFARLEAGGKPWAPYYTLHKIMAGLLDQYLLAGDRQALDVLREMAAWAEARTAPL---- 237
Query: 277 SVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHA 336
+ N L E GGMNDVL RLY T DP HL A FD LA D+++G HA
Sbjct: 238 PYPQMQNVLRVEFGGMNDVLMRLYLETGDPAHLRTARRFDHEDLYAPLAAGRDELAGRHA 297
Query: 337 NTHIPVVIGSQMRYEVTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAST 395
NT I ++G+ YE TGD Y + TF+ +V H YA GG S E + P + S
Sbjct: 298 NTEIAKIVGTVPSYEATGDTRYLDIADTFWTTVVR-HHSYAIGGNSNQELFGPPDEIVSR 356
Query: 396 LGTENEESCTTYNMLKVSRHLFRWTKEMV-YADYYERALTNGVLSIQR-GTEPGVMIYML 453
L E+C +YNMLK+ R LF + Y D+YE L N +L Q + G + Y
Sbjct: 357 LSDVTCENCNSYNMLKLGRGLFLHRPDRAGYMDHYEWTLYNQMLGEQDPASAHGFVTYYT 416
Query: 454 PLGRGDSKAKSYHGWGTR-------FSSFWCCYGTGL 483
L G S+ + G G+ + +F C +GTGL
Sbjct: 417 GLWAG-SRREPKAGLGSAPGSYSSDYDNFSCDHGTGL 452
>gi|383779543|ref|YP_005464109.1| hypothetical protein AMIS_43730 [Actinoplanes missouriensis 431]
gi|381372775|dbj|BAL89593.1| hypothetical protein AMIS_43730 [Actinoplanes missouriensis 431]
Length = 799
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 146/383 (38%), Positives = 203/383 (53%), Gaps = 17/383 (4%)
Query: 110 VKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFV 169
V+L PS +T + YL +D+D ++ F+ TAG P+A + GWE PT +LRGH
Sbjct: 46 VRLLPSRFLDNMNRT-VAYLRFVDLDRMLHMFRVTAGLPSAAEPLGGWEAPTVQLRGHTT 104
Query: 170 GHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVW 229
GH LS A + LK + A+V L CQ +GYLSAFP FD+ EA K W
Sbjct: 105 GHLLSGLAQAAYHLDDRDLKARSAALVDGLKACQAP--NGYLSAFPETIFDQLEAGKNPW 162
Query: 230 APYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
APYYTIHKI AGLLDQ+ NT AL + + M ++ +RV +K + E+ L+ E
Sbjct: 163 APYYTIHKIFAGLLDQHRLLGNTTALDVARRMADWVGSRV----SKLTREQMQKVLHVEF 218
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GGMN+ LY +T + HL LA FD L+ + D ++G HANT IP V+G+
Sbjct: 219 GGMNESFVNLYRVTGEAAHLELARAFDHDEIFVPLSEKRDTLAGRHANTDIPKVVGAAAM 278
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
Y+ TG ++ T+F D V H Y GG S EF+ P ++ S LG E+C TYNM
Sbjct: 279 YQATGSDYHRTIATYFWDQVVRHHSYVIGGNSNAEFFGPPGQVVSQLGENTCENCNTYNM 338
Query: 410 LKVSRHLFRWTKEMV-YADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGDSKAKSYHG 467
LK++ L+ Y DY+E AL N +L Q + G + Y L S+ K G
Sbjct: 339 LKLTERLYAIDPSRTDYLDYHEWALINQMLGEQDPDSAHGNVTYYTGLSSTASR-KGKEG 397
Query: 468 -------WGTRFSSFWCCYGTGL 483
+ + + +F C +G+GL
Sbjct: 398 LVSDPGSYSSDYGNFSCDHGSGL 420
>gi|427384529|ref|ZP_18881034.1| hypothetical protein HMPREF9447_02067 [Bacteroides oleiciplenus YIT
12058]
gi|425727790|gb|EKU90649.1| hypothetical protein HMPREF9447_02067 [Bacteroides oleiciplenus YIT
12058]
Length = 777
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 141/383 (36%), Positives = 200/383 (52%), Gaps = 40/383 (10%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A++ YLL L+ D + F+ AG YEGWE + + G +GHY+SA A +
Sbjct: 51 AEEKEATYLLELEPDRFLSGFRSEAGLVPKAPKYEGWE--SLGVAGQTLGHYMSACAMYY 108
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP---------------SEQFDRFEAL 225
A++ + +K+ +++ L CQ G+GYL+A P S+ FD L
Sbjct: 109 ATSGDERFLQKLEYIINELDSCQQANGNGYLAATPGGKKIFAEVSAGNIYSQGFD----L 164
Query: 226 KPVWAPYYTIHKILAGLLDQYTFADNTQAL----KMTKWMVEYFYNRVQNVITKYSVERH 281
W P Y +HK+LAGL+D Y +A + QAL K+ WM FY+ ++ + K
Sbjct: 165 NGGWVPLYVMHKVLAGLIDAYQYARSEQALRIAEKLADWMYGTFYHLTEDQMQKV----- 219
Query: 282 WNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDK-PCFLGLLAVQADDISGFHANTHI 340
L E GGMN+ L LY T++ K LLLA FD + LA+ DD+ G HANT +
Sbjct: 220 ---LACEFGGMNEALANLYAYTKNDKFLLLAQRFDNHKAIMDSLAIGVDDLEGKHANTQV 276
Query: 341 PVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN 400
P +IG+ YE+TG +FF V +H Y GG S GE + P++L L T N
Sbjct: 277 PKMIGAARLYELTGSKRDSSIASFFWHTVVDNHSYVNGGNSDGEHFGTPRKLNERLSTSN 336
Query: 401 EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
E+C TYNMLK++RHLF W Y+ YYERA+ N +L+ Q + G+ Y PL G
Sbjct: 337 TETCNTYNMLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGK 395
Query: 461 KAKSYHGWGTRFSSFWCCYGTGL 483
K G+ + F SF CC G+G+
Sbjct: 396 K-----GYLSPFQSFCCCSGSGM 413
>gi|379720404|ref|YP_005312535.1| hypothetical protein PM3016_2500 [Paenibacillus mucilaginosus 3016]
gi|378569076|gb|AFC29386.1| hypothetical protein PM3016_2500 [Paenibacillus mucilaginosus 3016]
Length = 749
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 147/392 (37%), Positives = 206/392 (52%), Gaps = 32/392 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
LH V+++ L A + N YLL L+ D L+ F++ AG YEGWE + + G
Sbjct: 8 LHKVRIESGPLK-HAMELNASYLLNLEADRLLSRFREYAGLEPKAPHYEGWE--SRGISG 64
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--SEQFDRFEA 224
H +GHYLS A M+AST L ++ VV L +CQ GSG++S P E F +A
Sbjct: 65 HTLGHYLSGCALMYASTGREELLSRVNYVVEELQQCQRADGSGFISGIPRGKELFQEVKA 124
Query: 225 ---------LKPVWAPYYTIHKILAGLLDQYTFADNTQAL----KMTKWMVEYFYNRVQN 271
L W P YT+HK+ AGL D Y A + +AL K+ W+ +
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLAGSRKALEIEIKLGLWL--------DD 176
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
V + S E+ L+ E GGMN+VL L + D + L LA F LG +A + D +
Sbjct: 177 VFSGLSHEQVQRVLHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAERKDTL 236
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
G HANT IP +IG+ +YEVTG+ Y FF D V H Y GG S E + +P +
Sbjct: 237 GGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHFGEPDK 296
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
L LG E+C TYNMLK++RHLF+W YADYYERA+ N +L+ Q+ + G + Y
Sbjct: 297 LNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILASQQPVD-GRVCY 355
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ L G K+ + +++ F CC G+G+
Sbjct: 356 FVSLEMGGHKS-----FNSQYEDFTCCVGSGM 382
>gi|21218915|ref|NP_624694.1| hypothetical protein SCO0371 [Streptomyces coelicolor A3(2)]
gi|5881940|emb|CAB55733.1| putative secreted protein [Streptomyces coelicolor A3(2)]
Length = 869
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 159/408 (38%), Positives = 206/408 (50%), Gaps = 26/408 (6%)
Query: 93 DGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK 152
+G G L+ L V+L S ++T YL +D D L+ +F+ G P+A +
Sbjct: 57 NGAHRPGPLLEPFPLSAVRLLDSPFLANMRRT-CAYLRFVDPDRLLHTFRLNVGLPSAAE 115
Query: 153 AYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS---- 208
GWE P +LRGH GH LSA A A T +K +VSAL+ECQ +
Sbjct: 116 PCGGWEAPDVQLRGHTTGHLLSALAQAHAGTGETAYADKARLLVSALAECQRAAPAAGFH 175
Query: 209 -GYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYN 267
GYLSAFP FD+ EA WAPYYT+HKI+AGLLDQY + N +A + M +
Sbjct: 176 RGYLSAFPESVFDQLEAGGKPWAPYYTLHKIMAGLLDQYRLSGNREAFDVLLEMAAWTEA 235
Query: 268 RVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQ 327
R + S ER + L E GGMNDVL RL+ T DP HL A FD LA
Sbjct: 236 RTAPL----SRERMQSVLKVEFGGMNDVLARLHLETGDPVHLRTARRFDHDELYAPLAAG 291
Query: 328 ADDISGFHANTHIPVVIGSQMRYEVTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFW 386
D+++G HANT I V+G+ YE TGD Y + TF+ +V H YA GG S E +
Sbjct: 292 RDELAGRHANTEIAKVVGAVPAYEATGDRRYLDIADTFWTTVVR-HHSYAIGGNSNQELF 350
Query: 387 SDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMV-YADYYERALTNGVLSIQR-GT 444
P +AS L E+C +YNMLK+ R LFR E Y D+YE L N +L+ Q +
Sbjct: 351 GPPDEIASRLSEVTCENCNSYNMLKLGRDLFRHDPERTEYLDHYEWTLYNQMLAEQDPDS 410
Query: 445 EPGVMIYML---------PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G + Y P G S SY G + +F C +GTGL
Sbjct: 411 AHGFVTYYTGLWAGSRREPKGGLGSAPGSYSG---DYDNFSCDHGTGL 455
>gi|300785310|ref|YP_003765601.1| hypothetical protein AMED_3413 [Amycolatopsis mediterranei U32]
gi|384148599|ref|YP_005531415.1| hypothetical protein RAM_17360 [Amycolatopsis mediterranei S699]
gi|399537193|ref|YP_006549855.1| hypothetical protein AMES_3374 [Amycolatopsis mediterranei S699]
gi|299794824|gb|ADJ45199.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340526753|gb|AEK41958.1| hypothetical protein RAM_17360 [Amycolatopsis mediterranei S699]
gi|398317963|gb|AFO76910.1| hypothetical protein AMES_3374 [Amycolatopsis mediterranei S699]
Length = 740
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 147/364 (40%), Positives = 189/364 (51%), Gaps = 15/364 (4%)
Query: 126 LEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHN 185
L Y +D D L+ +F+ AG ++ + GWE P ELRGH GH LS A +A+T +
Sbjct: 68 LAYFRFVDADRLLHTFRLNAGLASSAQPCGGWESPGTELRGHSTGHLLSGLAQAYANTGD 127
Query: 186 VTLKEKMTAVVSALSECQ-----NKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILA 240
K K +V+AL+ CQ +GYLSAFP FDR E+ + VWAPYYT+HKI+A
Sbjct: 128 TAHKTKGDYLVNALAACQAAAPGRGFHAGYLSAFPENFFDRLESGQSVWAPYYTLHKIMA 187
Query: 241 GLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLY 300
GLLDQY A N QAL + + R + SV + +L E GGM +VL LY
Sbjct: 188 GLLDQYLLAGNQQALDVLLRKAAWTKTRTDPL----SVTQMQAALRTEFGGMPEVLTNLY 243
Query: 301 TITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKV 360
+T D HL A FD L LA D +SGFHANT IP ++G+ Y TG Y+
Sbjct: 244 QVTGDANHLATAQRFDHAQILDPLAANQDRLSGFHANTQIPKILGAIREYHATGTTRYRD 303
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWT 420
F IV H Y GG S GE++ P +AS L E C TYNMLK++R LF
Sbjct: 304 IAVNFWRIVLDHHTYVIGGNSDGEYFQAPDAIASQLSDTTCEVCNTYNMLKLTRQLFFTN 363
Query: 421 KEMVYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCY 479
Y DYYE AL N +L Q + G + Y PL G K + + F C +
Sbjct: 364 PAPEYMDYYELALFNQILGEQDPDSSHGFVTYYTPLRAGGIKT-----YANDYDDFTCDH 418
Query: 480 GTGL 483
GTG+
Sbjct: 419 GTGM 422
>gi|289773961|ref|ZP_06533339.1| secreted protein [Streptomyces lividans TK24]
gi|289704160|gb|EFD71589.1| secreted protein [Streptomyces lividans TK24]
Length = 854
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 159/408 (38%), Positives = 206/408 (50%), Gaps = 26/408 (6%)
Query: 93 DGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK 152
+G G L+ L V+L S ++T YL +D D L+ +F+ G P+A +
Sbjct: 42 NGAHRPGPLLEPFPLSAVRLLDSPFLANMRRT-CAYLRFVDPDRLLHTFRLNVGLPSAAE 100
Query: 153 AYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS---- 208
GWE P +LRGH GH LSA A A T +K +VSAL+ECQ +
Sbjct: 101 PCGGWEAPDVQLRGHTTGHLLSALAQAHAGTGETAYADKARLLVSALAECQRAAPAAGFH 160
Query: 209 -GYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYN 267
GYLSAFP FD+ EA WAPYYT+HKI+AGLLDQY + N +A + M +
Sbjct: 161 RGYLSAFPESVFDQLEAGGKPWAPYYTLHKIMAGLLDQYRLSGNREAFDVLLEMAAWTEA 220
Query: 268 RVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQ 327
R + S ER + L E GGMNDVL RL+ T DP HL A FD LA
Sbjct: 221 RTAPL----SRERMQSVLKVEFGGMNDVLARLHLETGDPVHLRTARRFDHDELYAPLAAG 276
Query: 328 ADDISGFHANTHIPVVIGSQMRYEVTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFW 386
D+++G HANT I V+G+ YE TGD Y + TF+ +V H YA GG S E +
Sbjct: 277 RDELAGRHANTEIAKVVGAVPAYEATGDRRYLDIADTFWTTVVR-HHSYAIGGNSNQELF 335
Query: 387 SDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMV-YADYYERALTNGVLSIQR-GT 444
P +AS L E+C +YNMLK+ R LFR E Y D+YE L N +L+ Q +
Sbjct: 336 GPPDEIASRLSEVTCENCNSYNMLKLGRDLFRHDPERTEYLDHYEWTLYNQMLAEQDPDS 395
Query: 445 EPGVMIYML---------PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G + Y P G S SY G + +F C +GTGL
Sbjct: 396 AHGFVTYYTGLWAGSRREPKGGLGSAPGSYSG---DYDNFSCDHGTGL 440
>gi|375148455|ref|YP_005010896.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361062501|gb|AEW01493.1| protein of unknown function DUF1680 [Niastella koreensis GR20-10]
Length = 786
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 148/389 (38%), Positives = 209/389 (53%), Gaps = 25/389 (6%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
+L DV+L +A + ++ YL +++ D L+ F++ AG G+ Y GWE L
Sbjct: 46 NLQDVQLLDGPFK-KAMEADVRYLQVIEPDRLLADFREHAGLKPKGEHYGGWEH--SGLA 102
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ------- 218
GH +GHYLSA A +A++H+ K+ +V L+ECQ K +GY+ A P E
Sbjct: 103 GHTLGHYLSACAMHYAASHDKQFLGKVNYIVDELAECQPKR-NGYVGAIPKEDSMWAEVE 161
Query: 219 ----FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
R L W+P+YT+HKI+AGLLD Y + DN +AL + M ++ + ++N +
Sbjct: 162 KGNIHSRGFDLNGAWSPWYTVHKIMAGLLDAYLYCDNKKALAVETGMADWTAHLLRN-LP 220
Query: 275 KYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
S++R L E GGMNDVL Y +T + K+L L++ F L LA+Q D + G
Sbjct: 221 DSSLQR---MLFCEYGGMNDVLNNTYALTGEKKYLDLSYKFHDKRILDSLALQKDILPGK 277
Query: 335 HANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAS 394
H+NT IP VIG RYE+T K G FF V H YA GG S E+ +L
Sbjct: 278 HSNTQIPKVIGCIRRYELTAGEKDKTIGDFFWQTVVNDHTYAPGGNSNYEYLGPAGQLNE 337
Query: 395 TLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLP 454
TL E+C TYNMLK++RHLF DYYERAL N +LS Q + G+M Y +P
Sbjct: 338 TLTDNTMETCNTYNMLKLTRHLFALQPTASLMDYYERALYNHILSSQDHST-GMMCYFVP 396
Query: 455 LGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
L G K + F++F CC G+G+
Sbjct: 397 LRMGTQKE-----FSDSFNTFTCCVGSGM 420
>gi|337746495|ref|YP_004640657.1| hypothetical protein KNP414_02226 [Paenibacillus mucilaginosus
KNP414]
gi|336297684|gb|AEI40787.1| hypothetical protein KNP414_02226 [Paenibacillus mucilaginosus
KNP414]
Length = 749
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 147/392 (37%), Positives = 205/392 (52%), Gaps = 32/392 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
LH V+++ L A + N YLL L+ D L+ F++ AG YEGWE + + G
Sbjct: 8 LHKVRIESGPLK-HAMELNASYLLNLEADRLLSRFREYAGLAPKAPHYEGWE--SRGISG 64
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--SEQFDRFEA 224
H +GHYLS A M+AST L ++ VV L +CQ GSG++S P E F +A
Sbjct: 65 HTLGHYLSGCALMYASTGREELLSRVNYVVEELQQCQRADGSGFISGIPRGKELFQEVKA 124
Query: 225 ---------LKPVWAPYYTIHKILAGLLDQYTFADNTQAL----KMTKWMVEYFYNRVQN 271
L W P YT+HK+ AGL D Y A + +AL K+ W+ +
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLAGSRKALEIEIKLGLWL--------DD 176
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
V + S E+ L+ E GGMN+VL L + D + L LA F LG +A + D +
Sbjct: 177 VFSGLSHEQVQRVLHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAERKDTL 236
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
G HANT IP +IG+ +YEVTG+ Y FF D V H Y GG S E + +P +
Sbjct: 237 GGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHFGEPDK 296
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
L LG E+C TYNMLK++RHLF+W YADYYERA+ N +L Q+ + G + Y
Sbjct: 297 LNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILGSQQPVD-GRVCY 355
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ L G K+ + +++ F CC G+G+
Sbjct: 356 FVSLEMGGHKS-----FNSQYEDFTCCVGSGM 382
>gi|386723005|ref|YP_006189331.1| hypothetical protein B2K_12670 [Paenibacillus mucilaginosus K02]
gi|384090130|gb|AFH61566.1| hypothetical protein B2K_12670 [Paenibacillus mucilaginosus K02]
Length = 749
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 146/392 (37%), Positives = 206/392 (52%), Gaps = 32/392 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
LH V+++ L A + N YLL L+ D L+ F++ AG YEGWE + + G
Sbjct: 8 LHKVRIESGPLK-HAMELNASYLLNLEADRLLSRFREYAGLAPKAPHYEGWE--SRGISG 64
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--SEQFDRFEA 224
H +GHYLS A M+AST L ++ VV L +CQ GSG++S P E F+ +A
Sbjct: 65 HTLGHYLSGCALMYASTGREELLSRVNYVVEELEQCQRADGSGFISGIPRGKELFEEVKA 124
Query: 225 ---------LKPVWAPYYTIHKILAGLLDQYTFADNTQAL----KMTKWMVEYFYNRVQN 271
L W P YT+HK+ AGL D Y + +AL K+ W+ +
Sbjct: 125 GDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLTGSRKALEIEIKLGLWL--------DD 176
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
V + S E+ L+ E GGMN+VL L + D + L LA F LG +A + D +
Sbjct: 177 VFSGLSHEQVQRVLHCEFGGMNEVLTDLAVHSGDDRFLKLAERFWHGEVLGDIAERKDTL 236
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
G HANT IP +IG+ +YEVTG+ Y FF D V H Y GG S E + +P +
Sbjct: 237 GGRHANTQIPKIIGAARQYEVTGEERYAGISRFFWDRVVNHHSYVIGGNSYNEHFGEPDK 296
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
L LG E+C TYNMLK++RHLF+W YADYYERA+ N +L+ Q+ + G + Y
Sbjct: 297 LNDRLGEGTCETCNTYNMLKLTRHLFQWDALAAYADYYERAMFNHILASQQPVD-GRVCY 355
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ L G K+ + +++ F CC G+G+
Sbjct: 356 FVSLEMGGHKS-----FNSQYEDFTCCVGSGM 382
>gi|302872476|ref|YP_003841112.1| hypothetical protein COB47_1852 [Caldicellulosiruptor obsidiansis
OB47]
gi|302575335|gb|ADL43126.1| protein of unknown function DUF1680 [Caldicellulosiruptor
obsidiansis OB47]
Length = 587
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 130/375 (34%), Positives = 207/375 (55%), Gaps = 15/375 (4%)
Query: 112 LDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG----SPTAGKAYEGWEDPTCELRGH 167
L S ++ + + Y+ L ++L+ +F +G S + GWE PTC+LRGH
Sbjct: 15 LHSDSEYYNRFKLDRNYIASLKTENLLQNFYLESGIMSWSFLPQDIHGGWESPTCQLRGH 74
Query: 168 FVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKP 227
F+GH+LSA+A ++AS + +K K +V L CQ + G ++ + P + F+ K
Sbjct: 75 FLGHWLSAAARIYASFGDEEIKGKADYIVDELERCQKENGGEWVGSIPEKYFEWMARGKW 134
Query: 228 VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNE 287
VWAP+YT+HK GL+D Y + N +AL++ +FY ++S E+ + L+
Sbjct: 135 VWAPHYTVHKTFMGLVDMYKYTSNQKALEIADRWANWFYRWS----GQFSREKMDDILDY 190
Query: 288 ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQ 347
ETGGM ++ LY IT+D K+ L + + L D ++G HANT IP + G+
Sbjct: 191 ETGGMLEIWAELYNITKDSKYKELMERYYRGRLFDRLLNGEDVLTGRHANTTIPEIHGAA 250
Query: 348 MRYEVTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTT 406
+EVTG+ + K+ +++ + V + TGG + GE W+ R+ + LG N+E C
Sbjct: 251 RVWEVTGEEKFRKIVESYWREAVEERGYFCTGGQTLGEVWTPKHRIRNYLGPTNQEHCVV 310
Query: 407 YNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYH 466
YNM++++ LFRWT + Y+DY ER + NG+ + QR + G++ Y LPL G K
Sbjct: 311 YNMIRLAEFLFRWTGDKKYSDYIERNIYNGLFAQQR-LKDGMVTYFLPLMPGSQKR---- 365
Query: 467 GWGTRFSSFWCCYGT 481
WGT + FWCC+GT
Sbjct: 366 -WGTPTNDFWCCHGT 379
>gi|393783247|ref|ZP_10371422.1| hypothetical protein HMPREF1071_02290 [Bacteroides salyersiae
CL02T12C01]
gi|392669526|gb|EIY63014.1| hypothetical protein HMPREF1071_02290 [Bacteroides salyersiae
CL02T12C01]
Length = 1022
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 134/368 (36%), Positives = 191/368 (51%), Gaps = 33/368 (8%)
Query: 132 LDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEK 191
L D + F AG PT G Y GWE+ + G GHY+SA + ++A+T +K +
Sbjct: 79 LKPDRFLHRFHANAGLPTKGTIYGGWEN--TDQSGFSFGHYISALSMLYATTGEEDIKIR 136
Query: 192 MTAVVSALSECQNKMGSGYLSAFPSEQF-----------DRFEALKPVWAPYYTIHKILA 240
+ +S L CQ+K G+GY+ A P+E R L VW P+Y +HK+ +
Sbjct: 137 LDYCISELKRCQDKRGTGYVGAIPNEDKLWDDVSKGIIDGRNFNLNNVWVPWYNLHKLWS 196
Query: 241 GLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHW-NSLNEETGGMNDV 295
GL+D Y F +N A + +T W + F K E W N L E GGMND
Sbjct: 197 GLIDAYIFGENETAKTIVIALTDWACDKF---------KDLTEEQWQNILTCEHGGMNDA 247
Query: 296 LYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGD 355
LY +Y IT D +HL +A+ F L L+ + ++++G HANT IP VIG YE+TG+
Sbjct: 248 LYNVYAITGDTRHLEIANKFYHKKVLDPLSKRKNELAGLHANTQIPKVIGISRSYELTGN 307
Query: 356 PLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRH 415
+ ++F V H Y GG S E + +P +L+ L + E+C TYNMLK++RH
Sbjct: 308 QDHHTISSYFWHTVTHEHSYCIGGNSNYEHFVEPGKLSGELSNKTTETCNTYNMLKLTRH 367
Query: 416 LFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSF 475
LF W D+YERAL N +L+ Q E G++ Y +PL A S + ++F
Sbjct: 368 LFAWNPSAELMDFYERALYNHILASQN-PETGMVCYCVPLA-----ANSQKNYCNAENNF 421
Query: 476 WCCYGTGL 483
WCC GTG
Sbjct: 422 WCCVGTGF 429
>gi|329849035|ref|ZP_08264063.1| hypothetical protein ABI_21080 [Asticcacaulis biprosthecum C19]
gi|328844098|gb|EGF93667.1| hypothetical protein ABI_21080 [Asticcacaulis biprosthecum C19]
Length = 773
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 147/397 (37%), Positives = 201/397 (50%), Gaps = 34/397 (8%)
Query: 107 LHDVKLDPSSLHWR-AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
L V L PS WR A N YLL L+ D L+ +F K+AG G Y GWE+ +
Sbjct: 35 LEAVTLMPSV--WRDAVDANGHYLLSLEPDRLLHNFHKSAGLAPKGDIYGGWEN--MGIA 90
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEAL 225
GH +GHYL+A +A T + K K+ VS ++ Q G GY+ E+ + +
Sbjct: 91 GHSLGHYLTALGLAYAQTRDPAYKAKLDYTVSEMAIIQKAHGDGYIGGTTVERDGKLQDG 150
Query: 226 KPV-------------------WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFY 266
K V W P YT HK+ AGLLD + +A+N QALK+ M +Y
Sbjct: 151 KIVYEEVRKHVITSHGFDLNGGWVPLYTWHKVHAGLLDAHRYANNGQALKIAIGMSDYLI 210
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
V+ S E L E GG+N+ +Y T D ++L A L LA
Sbjct: 211 G----VLGDLSDEEMQKVLAAEHGGLNETYAEMYVRTGDKRYLDTARRIYHKAVLTPLAQ 266
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
+ D++ G HANT IP +IG YEVTGD Y T ++F D V H Y GG SAGE +
Sbjct: 267 RRDELEGKHANTQIPKLIGLARLYEVTGDKAYGDTASYFWDRVIHHHSYVIGGNSAGEHF 326
Query: 387 SDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP 446
P +L+ L + ESC TYNMLK++RHL++W + + DYYERA N +L+ Q +
Sbjct: 327 GAPDKLSGRLDDKTCESCNTYNMLKLTRHLYQWQPDAAWFDYYERAHLNHILAHQ-DPQT 385
Query: 447 GVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y +PL G + S T +SFWCC G+G+
Sbjct: 386 GAFVYFVPLASGSQRLYS-----TPDTSFWCCVGSGM 417
>gi|337745980|ref|YP_004640142.1| hypothetical protein KNP414_01710 [Paenibacillus mucilaginosus
KNP414]
gi|336297169|gb|AEI40272.1| protein of unknown function DUF1680 [Paenibacillus mucilaginosus
KNP414]
Length = 636
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 147/398 (36%), Positives = 209/398 (52%), Gaps = 30/398 (7%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG----SPTAGKAYE-- 155
+KE+S V+L P L R + N Y++ L ++L+ +F AG S G
Sbjct: 1 MKELSSGRVRLAPGPLQARLE-LNKRYVMSLTNENLLRNFYLEAGLWSYSGNGGTTSATT 59
Query: 156 -----------GWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQN 204
GWE PTCELRGH +GH+LSA+A ++ T + +K K +V+ L+ CQ
Sbjct: 60 TSTDGPEHWHWGWESPTCELRGHIMGHWLSAAATIYGQTQDGLVKAKADYIVAELARCQE 119
Query: 205 KMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEY 264
G +L+AFP R K VWAP+YTIHK+L GL D Y A + AL++ M +
Sbjct: 120 ANGGEWLAAFPESYMHRIARGKYVWAPHYTIHKLLMGLYDMYRLAGSAAALELMTNMAAW 179
Query: 265 FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
FY R + T+ ++ + L+ ETGGM + LY +T HL L +D+ F L
Sbjct: 180 FY-RWTDGFTREEMD---DLLDLETGGMLETWADLYGVTGSGAHLELVRRYDRRRFFDAL 235
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGY-ATGGTSAG 383
D ++ HANT IP ++G+ +EVTG+ Y+ F + GY ATG G
Sbjct: 236 LEGRDVLTNKHANTQIPEILGAARAWEVTGEERYRRIVEAFWRCAVSERGYTATGAGDNG 295
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E W +A+ LG +E C YNM+++++ L RWT + YADY+ER NGVL+ Q G
Sbjct: 296 ELWMPQGEMAARLGA-GQEHCCNYNMMRLAQVLLRWTGDPAYADYWERRFVNGVLAHQHG 354
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
E G++ Y + LG G K WGT FWCC+GT
Sbjct: 355 -ETGMISYFIGLGAGSRKT-----WGTPTGHFWCCHGT 386
>gi|379719928|ref|YP_005312059.1| hypothetical protein PM3016_2010 [Paenibacillus mucilaginosus 3016]
gi|378568600|gb|AFC28910.1| hypothetical protein PM3016_2010 [Paenibacillus mucilaginosus 3016]
Length = 641
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 147/398 (36%), Positives = 209/398 (52%), Gaps = 30/398 (7%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG----SPTAGKAYE-- 155
+KE+S V+L P L R + N Y++ L ++L+ +F AG S G
Sbjct: 6 MKELSSGRVRLAPGPLQARLE-LNKRYVMSLTNENLLRNFYLEAGLWSYSGNGGTTSATT 64
Query: 156 -----------GWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQN 204
GWE PTCELRGH +GH+LSA+A ++ T + +K K +V+ L+ CQ
Sbjct: 65 TSTDGPEHWHWGWESPTCELRGHIMGHWLSAAATIYGQTQDGLVKAKADYIVAELARCQE 124
Query: 205 KMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEY 264
G +L+AFP R K VWAP+YTIHK+L GL D Y A + AL++ M +
Sbjct: 125 ANGGEWLAAFPESYMHRIARGKYVWAPHYTIHKLLMGLYDMYRLAGSAAALELMTNMAAW 184
Query: 265 FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
FY R + T+ ++ + L+ ETGGM + LY +T HL L +D+ F L
Sbjct: 185 FY-RWTDGFTREEMD---DLLDLETGGMLETWADLYGVTGSGAHLELVRRYDRRRFFDAL 240
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGY-ATGGTSAG 383
D ++ HANT IP ++G+ +EVTG+ Y+ F + GY ATG G
Sbjct: 241 LEGRDVLTNKHANTQIPEILGAARAWEVTGEERYRRIVEAFWRCAVSERGYTATGAGDNG 300
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E W +A+ LG +E C YNM+++++ L RWT + YADY+ER NGVL+ Q G
Sbjct: 301 ELWMPQGEMAARLGA-GQEHCCNYNMMRLAQVLLRWTGDPAYADYWERRFVNGVLAHQHG 359
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
E G++ Y + LG G K WGT FWCC+GT
Sbjct: 360 -ETGMISYFIGLGAGSRKT-----WGTPTGHFWCCHGT 391
>gi|256376951|ref|YP_003100611.1| hypothetical protein Amir_2836 [Actinosynnema mirum DSM 43827]
gi|255921254|gb|ACU36765.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 614
Score = 231 bits (588), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 145/373 (38%), Positives = 194/373 (52%), Gaps = 20/373 (5%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHM 179
R + YL LD D L+ +F++ G + GWE PT ELRGH GH LSA A
Sbjct: 66 RNESRTHAYLKFLDPDRLLHTFRRNVGLASGATPCGGWESPTTELRGHSTGHVLSALAQA 125
Query: 180 WASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQFDRFEALKPVWAPYYT 234
ST + K K +V+ L+ CQ++ +GYLSAFP DR EA + VWAPYYT
Sbjct: 126 HTSTGDTAFKTKSDYLVAGLAACQDRAAAAGFNTGYLSAFPESFIDRVEARQQVWAPYYT 185
Query: 235 IHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMND 294
+HKILAGLLD + + QAL + + R + + + L E GGMN+
Sbjct: 186 LHKILAGLLDAHQLTGSAQALTVLTRKAAWVAWRNG----RLTQAQRQAMLGTEFGGMNE 241
Query: 295 VLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTG 354
VL LY +T DP HL A FD LA D +SGFHANT IP +G+ Y TG
Sbjct: 242 VLANLYQLTGDPLHLTAARYFDHAQVFDPLAAGRDALSGFHANTQIPKALGAIREYHATG 301
Query: 355 DPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSR 414
+ Y+ F + V +H YA GG S GE++ +P R+AS L E C T+NMLK++R
Sbjct: 302 ETRYRDIARNFWNFVVGAHTYAIGGNSNGEYFKNPGRIASELSDSTCECCNTHNMLKLTR 361
Query: 415 HLFR---WTKEMVYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGDSKAKSYHGWGT 470
LFR E+ D++E+AL N +L Q + G Y +PL G + S
Sbjct: 362 QLFRTEPGRPELF--DFHEKALYNHLLGAQNPDSAHGHHSYYVPLRAGGQRTFS-----N 414
Query: 471 RFSSFWCCYGTGL 483
+ F CC+GTG+
Sbjct: 415 DYQDFTCCHGTGM 427
>gi|427385120|ref|ZP_18881625.1| hypothetical protein HMPREF9447_02658 [Bacteroides oleiciplenus YIT
12058]
gi|425727288|gb|EKU90148.1| hypothetical protein HMPREF9447_02658 [Bacteroides oleiciplenus YIT
12058]
Length = 778
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 142/393 (36%), Positives = 198/393 (50%), Gaps = 39/393 (9%)
Query: 110 VKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFV 169
++L P S A N E+LL L D L+ F+ AG G+ Y GWE + + GH +
Sbjct: 44 LRLLPGSPFKHAMDKNGEWLLDLSPDRLLHRFRLNAGLTPKGEIYGGWE--SRGVSGHTL 101
Query: 170 GHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ----------- 218
GHYLSA A M+A++ + KE++ +V L+ECQ+ +GY+ P E
Sbjct: 102 GHYLSACAMMYAASGDKRFKERVDYIVKELAECQDARKTGYVGGIPDEDKIWAEVSSGDI 161
Query: 219 ----FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQ 270
FD L W P+YT+HK+ AGL+D Y +A + QA K++ W V F +
Sbjct: 162 RSQGFD----LNGGWVPWYTLHKLWAGLIDAYRYAGSEQAKEVGTKLSDWAVRSFGD--- 214
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
S E L E GGMN+ +Y IT + +L LA F L L Q D+
Sbjct: 215 -----LSEEDFQKMLACEFGGMNESFADMYAITGNESYLKLARQFYHKAILDPLKEQRDE 269
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
+ G H+NT +P +IG YE+TGD TF+ D + H Y GG S E P
Sbjct: 270 LEGKHSNTQVPKIIGEARLYELTGDKDMHTIATFYWDRIVNHHTYVNGGNSNYEHLGKPD 329
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
L L E+C TYNMLK+++HLF W + Y DYYE+AL N +L+ Q + G++
Sbjct: 330 CLNDRLSPFTSETCNTYNMLKLTKHLFSWDPQAAYMDYYEQALYNHILASQN-PDDGMVC 388
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y +PL G K + TRF SFWCC +G+
Sbjct: 389 YSVPLESGTKKE-----FSTRFDSFWCCVASGI 416
>gi|326203856|ref|ZP_08193718.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
gi|325985954|gb|EGD46788.1| protein of unknown function DUF1680 [Clostridium papyrosolvens DSM
2782]
Length = 854
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 139/393 (35%), Positives = 206/393 (52%), Gaps = 22/393 (5%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWED 159
D L+ + V + + L A + YL +D + L+ +++TAG T+ Y GWE+
Sbjct: 36 DKLQPFDMEQVNITDTYLA-NAFNKEISYLQSIDPNRLLVGYRQTAGLSTSYSKYGGWEN 94
Query: 160 PTCELRGHFVGHYLSASAHMWASTH-----NVTLKEKMTAVVSALSECQNKMGSGYLSAF 214
L+GH +GHY+SA A + +T N +K+++ ++S L +CQNK G GY+ A
Sbjct: 95 --TPLKGHTLGHYMSALAQAYKNTKSNATVNADMKKRIDLIISELQQCQNKRGDGYIYAE 152
Query: 215 PSEQFDRFE--ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNV 272
EQF+ E A +WAP+YT+HKI++GL+ Y N AL + + ++ YNRV
Sbjct: 153 TPEQFNVVEGKATGTLWAPWYTMHKIMSGLISIYELEGNPTALTVASKLGDWIYNRVN-- 210
Query: 273 ITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDIS 332
+ L E GGMND L LY +T HL A F++P L +A + ++
Sbjct: 211 --AWDSATQAKVLGVEYGGMNDCLIELYKLTGKSNHLAAAKKFEEPSLLNTIASGNNVLA 268
Query: 333 GFHANTHIPVVIGSQMRYEVTG--DPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
G HANT IP IG+ RY G + Y F ++V H Y TGG S E +
Sbjct: 269 GKHANTTIPKFIGAINRYRTLGTSEASYLTAAQQFWNMVIRDHTYVTGGNSQWEAFRAAG 328
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+L N E+C +YNMLK++R LF+ T ++ YAD+YER+ N +L+ Q E G+
Sbjct: 329 KLDQYRDEVNNETCNSYNMLKLTRELFQVTGDVKYADFYERSFINEILASQN-PETGMTT 387
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+G G K + F +FWCC GTG+
Sbjct: 388 YFKPMGTGYFKV-----FSKPFDNFWCCTGTGM 415
>gi|423223548|ref|ZP_17210017.1| hypothetical protein HMPREF1062_02203 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638305|gb|EIY32149.1| hypothetical protein HMPREF1062_02203 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 777
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 138/379 (36%), Positives = 196/379 (51%), Gaps = 32/379 (8%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A++ YLL L+ D + F+ AG YEGWE + + G +GHYLSA A +
Sbjct: 51 AEEKETAYLLELEPDRFLSGFRSEAGLVPKAPKYEGWE--SLGVAGQTLGHYLSACAMYY 108
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP---------------SEQFDRFEAL 225
A++ + +++ ++ L CQ G GYL+A P S+ FD L
Sbjct: 109 ATSGDERFLQRLEYTINELDSCQQANGDGYLAATPDGKRIFKEVSAGKIYSQGFD----L 164
Query: 226 KPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSL 285
W P Y +HK+LAGL+D Y +A N +AL + + + + Y Q++ + E+ L
Sbjct: 165 NGGWVPLYVMHKVLAGLIDTYQYAHNERALAVAEKLANWMYGTFQHL----TEEQMQKVL 220
Query: 286 NEETGGMNDVLYRLYTITQDPKHLLLAHLFDK-PCFLGLLAVQADDISGFHANTHIPVVI 344
E GGMN+ L LY T++ K L LA FD + LAV DD+ G HANT +P +I
Sbjct: 221 ACEFGGMNEALANLYACTKNEKFLALAQRFDNHKAIMDSLAVGVDDLEGKHANTQVPKII 280
Query: 345 GSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESC 404
G+ YE+TG +FF V +H Y GG S GE + P +L L T N E+C
Sbjct: 281 GAARLYELTGSKRDSAIASFFWHTVVQNHSYVNGGNSDGEHFGTPGQLNERLSTSNTETC 340
Query: 405 TTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKS 464
TYNMLK++RHLF W Y+ YYERA+ N +L+ Q + G+ Y PL G K
Sbjct: 341 NTYNMLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGKK--- 396
Query: 465 YHGWGTRFSSFWCCYGTGL 483
G+ + F SF CC G+G+
Sbjct: 397 --GYLSPFQSFCCCSGSGM 413
>gi|224536588|ref|ZP_03677127.1| hypothetical protein BACCELL_01463 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521844|gb|EEF90949.1| hypothetical protein BACCELL_01463 [Bacteroides cellulosilyticus
DSM 14838]
Length = 777
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 138/379 (36%), Positives = 196/379 (51%), Gaps = 32/379 (8%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A++ YLL L+ D + F+ AG YEGWE + + G +GHYLSA A +
Sbjct: 51 AEEKETAYLLELEPDRFLSGFRSEAGLVPKAPKYEGWE--SLGVAGQTLGHYLSACAMYY 108
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP---------------SEQFDRFEAL 225
A++ + +++ ++ L CQ G GYL+A P S+ FD L
Sbjct: 109 ATSGDERFLQRLEYTINELDSCQQANGDGYLAATPDGKRIFKEVSAGKIYSQGFD----L 164
Query: 226 KPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSL 285
W P Y +HK+LAGL+D Y +A N +AL + + + + Y Q++ + E+ L
Sbjct: 165 NGGWVPLYVMHKVLAGLIDTYQYAHNERALVVAEKLANWMYGTFQHL----TEEQMQKVL 220
Query: 286 NEETGGMNDVLYRLYTITQDPKHLLLAHLFDK-PCFLGLLAVQADDISGFHANTHIPVVI 344
E GGMN+ L LY T++ K L LA FD + LAV DD+ G HANT +P +I
Sbjct: 221 ACEFGGMNEALANLYACTKNEKFLALAQRFDNHKAIMDSLAVGVDDLEGKHANTQVPKII 280
Query: 345 GSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESC 404
G+ YE+TG +FF V +H Y GG S GE + P +L L T N E+C
Sbjct: 281 GAARLYELTGSKRDSAIASFFWHTVVQNHSYVNGGNSDGEHFGTPGQLNERLSTSNTETC 340
Query: 405 TTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKS 464
TYNMLK++RHLF W Y+ YYERA+ N +L+ Q + G+ Y PL G K
Sbjct: 341 NTYNMLKLTRHLFSWQSLPEYSAYYERAVFNHILASQN-PDDGMCTYYTPLISGGKK--- 396
Query: 465 YHGWGTRFSSFWCCYGTGL 483
G+ + F SF CC G+G+
Sbjct: 397 --GYLSPFQSFCCCSGSGM 413
>gi|373954098|ref|ZP_09614058.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373890698|gb|EHQ26595.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 787
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 144/389 (37%), Positives = 209/389 (53%), Gaps = 24/389 (6%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
+L DVKL S +A + + YLL ++ D L+ F+ +G GK YEGWE + L
Sbjct: 49 NLKDVKLLNSPFK-QAMEVDAAYLLSIEPDRLLSGFRAHSGLKPKGKMYEGWE--SSGLA 105
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF------ 219
GH +GHYLSA + +A+T + +++ +V L ECQ +GY+ A P E
Sbjct: 106 GHTLGHYLSAISMHYAATRDPEFLKRVNYIVKELGECQVARKTGYVGAIPKEDTVWAEVA 165
Query: 220 -----DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
R L W+P+YT+HK++AGLLD + + ++TQAL + K M ++ ++N+
Sbjct: 166 KGDIRSRGFDLNGGWSPWYTVHKVMAGLLDAFLYCNSTQALHVCKGMADWTGETLKNL-- 223
Query: 275 KYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
E+ L E GGM + L LY I + K+L L++ F L LA Q D + G
Sbjct: 224 --DDEKLQKMLLCEYGGMAETLVNLYAINGNKKYLDLSYKFYDKRILDPLANQQDILPGK 281
Query: 335 HANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAS 394
H+NT IP +I S RYE+ GD K FF + + +H YATGG S E+ S+P +L
Sbjct: 282 HSNTQIPKIIASARRYELNGDKKDKAIAEFFWETIVNNHSYATGGNSNYEYLSEPNKLND 341
Query: 395 TLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLP 454
L E+C TYNMLK++RHLF DYYE+AL N +L+ Q E G+M Y +P
Sbjct: 342 KLTENTTETCNTYNMLKLTRHLFALEPSAKLMDYYEKALYNHILASQ-NHETGMMCYFVP 400
Query: 455 LGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
L G K + + F +F CC G+G+
Sbjct: 401 LRMGGKKE-----YSSPFDTFTCCVGSGM 424
>gi|21231831|ref|NP_637748.1| hypothetical protein XCC2394 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66768042|ref|YP_242804.1| hypothetical protein XC_1718 [Xanthomonas campestris pv. campestris
str. 8004]
gi|21113547|gb|AAM41672.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66573374|gb|AAY48784.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 791
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 142/402 (35%), Positives = 204/402 (50%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 IRAVPLAQVRLMPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS----- 216
+ GH +GHYLSA A M A T + + + + +V+ L+ CQ G GY++ F
Sbjct: 108 --IAGHTLGHYLSALALMHAQTDDAQCRTRASYLVAELARCQAHAGDGYVAGFTRKNAAG 165
Query: 217 ------EQFDRFE--ALKPV-------WAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
E FD + ++P+ WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGREVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVGL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q V + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQAVFSVLDDAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF + V H Y GG
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P +A L + E C++YNMLK++RHL++W + Y DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSIARFLTEQTCEHCSSYNMLKLTRHLYQWGPQAAYFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM P+ G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|374984433|ref|YP_004959928.1| secreted protein [Streptomyces bingchenggensis BCW-1]
gi|297155085|gb|ADI04797.1| secreted protein [Streptomyces bingchenggensis BCW-1]
Length = 875
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 150/405 (37%), Positives = 209/405 (51%), Gaps = 18/405 (4%)
Query: 92 PDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG 151
P G A ++ L V L PS+ Q N YL +D+D L+ +F+ G ++
Sbjct: 70 PRGRARALTGVRPFPLGAVTLLPSAFK-DNQSRNTAYLRYVDIDRLLHTFRLNVGLASSA 128
Query: 152 KAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM----- 206
+ GWE PT ELRGH GH LS A +A+T + L +K +VSAL+ CQ K
Sbjct: 129 QPCGGWESPTTELRGHSTGHLLSGLALSYANTGDTALLDKGRKLVSALAACQAKSPAAGY 188
Query: 207 GSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFY 266
G GYLSAFP FDR E+ VWAPYYTIHKI+AGL+DQ+ A N +AL + + +
Sbjct: 189 GQGYLSAFPENFFDRLESGSGVWAPYYTIHKIMAGLVDQHRLAGNAEALDVVERQAAWVD 248
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
R K ++ L E GGMN+VL L+ IT D + L +A F LA
Sbjct: 249 TRTG----KLGYDQMQRVLQTEFGGMNEVLADLHAITGDTRWLRVAERFTHARVFDPLAR 304
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
D ++G HANT IP ++G+ +E + Y+ G F IV H Y GG S GE +
Sbjct: 305 NEDQLAGLHANTQIPKMVGALRLWEQGLNSRYRTIGENFWKIVTDHHTYVIGGNSNGEAF 364
Query: 387 SDPKRLASTLGTENEESCTTYNMLKVSRHL-FRWTKEMVYADYYERALTNGVLSIQR-GT 444
+P +A+ L E+C +YNMLK++R + F DYYER L N +L Q +
Sbjct: 365 HEPDAIAAQLSNNCCENCNSYNMLKLTRLIHFHAPDRTDLLDYYERTLFNQMLGEQDPDS 424
Query: 445 EPGVMIYMLPLGRGDSKAK-SYHG-----WGTRFSSFWCCYGTGL 483
G IY L G K + S+ G + T +++F C +G+G+
Sbjct: 425 AHGFNIYYTGLAPGAFKQQPSFMGTDPNQYSTDYNNFSCDHGSGM 469
>gi|375308065|ref|ZP_09773352.1| hypothetical protein WG8_1877 [Paenibacillus sp. Aloe-11]
gi|375080396|gb|EHS58617.1| hypothetical protein WG8_1877 [Paenibacillus sp. Aloe-11]
Length = 759
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 138/393 (35%), Positives = 207/393 (52%), Gaps = 20/393 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPT-AGKAYEGWEDP 160
L +S V L+ SL AQ L++LL ++ D ++++F+K A T A GW+
Sbjct: 185 LHGISTQKVHLEGPSLLKSAQNRRLQFLLTVNDDQMLYNFRKAASLDTLNAPAMIGWDSD 244
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQ------NKMGSGYLSAF 214
L+GH GHYLSA A +AST N + +K+ +V L++ Q ++ G+LSA+
Sbjct: 245 ESLLKGHTTGHYLSALALCYASTGNERIHQKLAYLVDELNKVQLAFEADDRYHYGFLSAY 304
Query: 215 PSEQFDRFEAL---KPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQN 271
EQFD E +WAPYYT+HKILAGLLD Y A AL + + ++ YNR+ +
Sbjct: 305 SEEQFDLLEVYTRYPEIWAPYYTLHKILAGLLDSYHIAGIELALAIADKVGDWIYNRL-S 363
Query: 272 VITKYSVERHWN-SLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
V+ +++ W + E GG+N+ L L+T TQ H+ A LFD + Q D
Sbjct: 364 VLPHEQLKKMWGLYIAGEFGGINESLAELFTYTQKEHHIAAAKLFDNDRLFFPMEQQVDA 423
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
+ HAN HIP ++G+ +E TG+ Y FF + V +H Y+ GGT GE + P
Sbjct: 424 LGAMHANQHIPQIVGAFKIFEATGEQKYYDIAKFFWESVVNAHIYSIGGTGEGEMFKQPH 483
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
++ + L E+C +YN+LK+++ L+ + + Y DYYER + N +LS G
Sbjct: 484 KIGTHLTEHTAETCASYNLLKLTKQLYVYENDAKYMDYYERTMLNHILSSTDHECLGAST 543
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y +P G K G+ S CC+GTGL
Sbjct: 544 YFMPTSPGGQK-----GYDEENS---CCHGTGL 568
>gi|374991816|ref|YP_004967311.1| secreted protein [Streptomyces bingchenggensis BCW-1]
gi|297162468|gb|ADI12180.1| secreted protein [Streptomyces bingchenggensis BCW-1]
Length = 858
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 144/377 (38%), Positives = 193/377 (51%), Gaps = 23/377 (6%)
Query: 123 QTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWAS 182
+ L YL +D + L+ +F+ P+ + GWE P LRGH GH LSA A A
Sbjct: 75 RRTLAYLRFVDPERLLHTFRLNVQLPSTAQPCGGWEAPNVLLRGHSTGHLLSALAFAHAH 134
Query: 183 THNVTLKEKMTAVVSALSECQNKMG-----SGYLSAFPSEQFDRFEALKPVWAPYYTIHK 237
T T +K +V+AL+ECQ +GYLSAFP FD EA WAPYYTIHK
Sbjct: 135 TGEQTYADKARGIVAALAECQAASPGAGYRTGYLSAFPERIFDELEAGGKPWAPYYTIHK 194
Query: 238 ILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLY 297
I+AGLLDQ+ + N QAL++ + M + +R + + +++R L E GGMN+VL
Sbjct: 195 IMAGLLDQHRLSGNDQALEVLRGMAAWVDSRTAP-LDEATMQR---LLGVEFGGMNEVLA 250
Query: 298 RLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPL 357
LY +T DP HL A FD G L D++ G HANT I ++G+ Y TGDP
Sbjct: 251 GLYLVTGDPVHLRTARRFDHQSLYGPLDEGRDELDGRHANTEIAKIVGAAEEYRATGDPR 310
Query: 358 YKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLF 417
Y F DIV H Y GG S EF+ P ++ S L + E+C +YNMLK+ R LF
Sbjct: 311 YLRIARNFWDIVVRDHSYVIGGNSNQEFFGPPGQIVSRLSEDTCENCNSYNMLKIGRQLF 370
Query: 418 -RWTKEMVYADYYERALTNGVLSIQ-RGTEPGVMIY---------MLPLGRGDSKAKSYH 466
Y D+YE L N +L Q ++ G + Y P G S SY
Sbjct: 371 LHEPGRAAYMDHYEWTLYNQMLGEQDPDSDHGFVTYYTGLWAGSRRQPKGGLGSAPGSYS 430
Query: 467 GWGTRFSSFWCCYGTGL 483
G + +F C +GTG+
Sbjct: 431 G---DYDNFSCDHGTGM 444
>gi|399029634|ref|ZP_10730435.1| hypothetical protein PMI10_02273 [Flavobacterium sp. CF136]
gi|398072450|gb|EJL63666.1| hypothetical protein PMI10_02273 [Flavobacterium sp. CF136]
Length = 642
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 145/404 (35%), Positives = 212/404 (52%), Gaps = 24/404 (5%)
Query: 88 KMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG- 146
KM + K+ G +L DVKL S + + ++++ + L+ SF+ AG
Sbjct: 34 KMDDTKNVKVLG-----FNLQDVKLLDSPFKDNMMRES-KWIMDISTKRLLHSFKTNAGV 87
Query: 147 -SPTAGKAYE-----GWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALS 200
S G + GWE C+LRGH GH LS A ++A+T K K ++V+ L
Sbjct: 88 FSSQEGGYFTVDKLGGWESLDCDLRGHSTGHILSGLALLYAATGEKMYKIKADSLVTGLD 147
Query: 201 ECQNKMG-SGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTK 259
E Q + +GYLSAFP DR A K VWAP+YT HK+ +GL+DQY + D+ AL++ K
Sbjct: 148 EVQKVLNQNGYLSAFPQNLIDRAIAGKSVWAPWYTQHKLFSGLMDQYLYCDSEPALEIVK 207
Query: 260 WMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPC 319
M ++ Y +++++ E L E GGMND Y LY IT + K+ LA F
Sbjct: 208 GMADWAYEKLKSLTN----EERKRMLRNEFGGMNDSFYALYEITAESKYKFLAEFFYHED 263
Query: 320 FLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGG 379
L L + D+++ HANT+IP +IG YE+ G + FF + V H + TG
Sbjct: 264 ALDPLLNKTDNLNKKHANTYIPKLIGISRDYELEGGSKNREIPEFFWNTVVNHHTFVTGS 323
Query: 380 TSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLS 439
S E + +P L+ L ESC YNMLK++RHL+ ++ Y DYYE+AL N +L
Sbjct: 324 NSDKEKFFEPDHLSEHLSGFTGESCNVYNMLKLTRHLYGVNPQIKYVDYYEKALYNHILG 383
Query: 440 IQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Q+ + G++ Y LP+ G K + T +SFWCC G+G
Sbjct: 384 -QQDPKTGMVAYFLPMMPGAHKV-----YSTPENSFWCCVGSGF 421
>gi|390452646|ref|ZP_10238174.1| hypothetical protein PpeoK3_01345 [Paenibacillus peoriae KCTC 3763]
Length = 767
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 136/394 (34%), Positives = 205/394 (52%), Gaps = 20/394 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDP 160
+KE V L+ S A L+++ ++ D ++++F++ A T G + GW+ P
Sbjct: 191 VKEFKGQKVSLERESEFEAAMNRFLQFVRSVNDDQMLYNFREAAAIDTKGAQPMTGWDAP 250
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM------GSGYLSAF 214
C L+GH GHYLSA A + +T + L K+ +V L +CQ + G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYNATEDSALLGKIQYMVVELGKCQTALSEQAGYGRGFLSAY 310
Query: 215 PSEQFDRFE---ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQN 271
EQF+ E +WAPYYT+HKI+AGLLD Y A +AL + + + +NR+
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALDICDKLGHWLHNRLGR 370
Query: 272 VITKYSVERHWN-SLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+ + + + W+ + E GGMN+VL +LY IT + +L+ A FD + D
Sbjct: 371 -LPREQLHKMWSLYIAGEFGGMNEVLAKLYAITGNKNYLMTAKYFDNEKLFLPMKENVDT 429
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
+ HAN HIP VIG+ +EV GD Y F +V SH Y GGT E + +P
Sbjct: 430 LGNTHANQHIPQVIGALKLFEVAGDEAYFNIAENFWTMVTQSHIYPIGGTGETEMFREPD 489
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP-GVM 449
+A L + E+C +YNMLK+++ LF++ Y DYYE+AL N +L+ + + G
Sbjct: 490 AIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEGGS 549
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y +PL G K H CC+GTGL
Sbjct: 550 TYFMPLAPGSIKKFDTH-------ENTCCHGTGL 576
>gi|291544618|emb|CBL17727.1| Uncharacterized protein conserved in bacteria [Ruminococcus
champanellensis 18P13]
Length = 597
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 131/381 (34%), Positives = 204/381 (53%), Gaps = 15/381 (3%)
Query: 105 VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYE---GWEDPT 161
+ + + L P RA N YL+ L ++L+ +F AG T E GWE PT
Sbjct: 5 IQIENTYLLPGLFKERAD-INRAYLMELKSENLLQNFLLEAGVRTDRDVTEMHLGWESPT 63
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
C+LRGHF+GH+LSA+A + A + LK K+ ++ AL+ CQ G ++ + P + F++
Sbjct: 64 CQLRGHFLGHWLSAAALLIAQNQDRELKAKLDTIIDALARCQELNGGRWIGSIPEKYFEK 123
Query: 222 FEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERH 281
+ + +W+P YT+HK L GL +A N AL++ +++ + ++ K +
Sbjct: 124 LKKNEYIWSPQYTLHKTLLGLYHSALYAKNQVALEILGRAADWYLEWTEKMMQKNPHAVY 183
Query: 282 WNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIP 341
+ E GGM +V LY +T+D ++L LA + P G LA D +S HAN IP
Sbjct: 184 ----SGEEGGMLEVWAGLYQLTEDERYLTLAQRYAHPSIFGRLADGEDPLSNCHANASIP 239
Query: 342 VVIGSQMRYEVTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN 400
G+ YE+TGD + ++ F+ V+ + TGG ++GEFW P++L LG
Sbjct: 240 WAHGAAKMYEITGDAAWLELVKRFWQCAVSDRDAFCTGGQNSGEFWIPPRKLGMFLGERT 299
Query: 401 EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
+E CT YNM++++ +LF +T Y DY E L NG L+ Q+ G+ Y LP+
Sbjct: 300 QEFCTVYNMVRLADYLFCFTGAHEYLDYIENNLYNGFLA-QQNKYTGMPAYFLPM----- 353
Query: 461 KAKSYHGWGTRFSSFWCCYGT 481
KA S WG++ FWCC+GT
Sbjct: 354 KAGSVKKWGSKTKDFWCCHGT 374
>gi|302548275|ref|ZP_07300617.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Streptomyces
hygroscopicus ATCC 53653]
gi|302465893|gb|EFL28986.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Streptomyces
himastatinicus ATCC 53653]
Length = 849
Score = 228 bits (582), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 143/375 (38%), Positives = 198/375 (52%), Gaps = 17/375 (4%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWA 181
Q N YL +D++ L+ +F+ G ++ + GWE PT ELRGH GH LS A +A
Sbjct: 72 QSRNTAYLRFVDINRLLHTFRLNVGIASSAQPCGGWESPTTELRGHSTGHLLSGLALTYA 131
Query: 182 STHNVTLKEKMTAVVSALSECQNKMGS-----GYLSAFPSEQFDRFEALKPVWAPYYTIH 236
+T + L +K +VSAL+ CQ K + GYLSAFP FDR EA VWAPYYTIH
Sbjct: 132 NTGDTALLDKSRKLVSALAACQAKSPAAGYRTGYLSAFPENFFDRLEAGSGVWAPYYTIH 191
Query: 237 KILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVL 296
KI+AGL+DQY A N +AL+ + R + S ++ L E GGMNDVL
Sbjct: 192 KIMAGLVDQYRLAGNAEALETVLRQAAWVDTRT----ARLSYDQMQRVLETEYGGMNDVL 247
Query: 297 YRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP 356
L+ IT D + L +A F L+ D ++G HANT IP ++G+ +E D
Sbjct: 248 ADLHAITGDSRWLRVAERFTHARVFDPLSRNEDRLAGLHANTQIPKMVGALRLWEEGLDS 307
Query: 357 LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
Y+ G F IV H Y GG S GE + +P +A+ L E+C +YNMLK++R +
Sbjct: 308 RYRTIGENFWKIVTDHHTYVIGGNSNGEAFHEPDAIAAQLSGSCCENCNSYNMLKLARLI 367
Query: 417 -FRWTKEMVYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGDSKAK-SYHG-----W 468
F + DYYER L N +L Q + G IY L G K + S+ G +
Sbjct: 368 HFHAPERTDLLDYYERTLFNQMLGEQDPDSAHGFNIYYTGLAPGSFKQQPSFMGPDPNQY 427
Query: 469 GTRFSSFWCCYGTGL 483
T + +F C +G+G+
Sbjct: 428 STDYDNFSCDHGSGM 442
>gi|188991168|ref|YP_001903178.1| hypothetical protein xccb100_1772 [Xanthomonas campestris pv.
campestris str. B100]
gi|167732928|emb|CAP51124.1| Putative secreted protein [Xanthomonas campestris pv. campestris]
Length = 791
Score = 228 bits (581), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 204/402 (50%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 IRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + + +V+ L+ CQ +G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTDDAHCRTRASYLVAELARCQAHVGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFE--ALKPV-------WAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + ++P+ WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVAL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFAALDDTQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHTVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF + V H Y GG
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C++YNMLK++RHL++W + Y DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCSSYNMLKLTRHLYQWGPQAAYFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM P+ G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|334364979|ref|ZP_08513951.1| conserved hypothetical protein [Alistipes sp. HGB5]
gi|313158812|gb|EFR58195.1| conserved hypothetical protein [Alistipes sp. HGB5]
Length = 778
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 141/391 (36%), Positives = 206/391 (52%), Gaps = 26/391 (6%)
Query: 105 VSLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCE 163
V L+DV++ LH AQ+ + +L +D D + F+ AG Y GWE C
Sbjct: 45 VPLNDVRITGGPFLH--AQEMDRRWLDSMDPDRYLSGFRSEAGLEPKAPRYGGWESAGCS 102
Query: 164 LRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--FDR 221
GH GH+LSA+A M+A+T + L +K+ + L+ECQ K G+G L+ F + F
Sbjct: 103 --GHGFGHFLSAAAMMYAATGDRALLDKINYSIDGLAECQQKEGTGLLAGFERSRALFAE 160
Query: 222 FEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNV 272
E L W P+YT+HK+ AGL+D + N +AL + + F + + +
Sbjct: 161 LERGDIRSQGFDLNGGWVPFYTLHKMYAGLVDVCRYTPNAKALTV----LVRFADWLDGL 216
Query: 273 ITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDIS 332
+ K S E+ L E GG+ + L +Y +T + K+L LA FD L LA D +
Sbjct: 217 VAKLSDEQMDKILICEHGGITESLADIYVLTGERKYLELARRFDHREILRPLAAGVDSLP 276
Query: 333 GFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRL 392
G HANT IP ++G+ YE +GD Y+ +F V H YA GG S E + P L
Sbjct: 277 GKHANTQIPKIVGAVREYECSGDERYRRIADYFWHRVVGFHSYAIGGNSEYEHFGAPGML 336
Query: 393 ASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYM 452
A+ L E+C TYNMLK+++HL++ + ADYYERAL N +L+ Q + G++ YM
Sbjct: 337 ANRLSDGTCETCNTYNMLKLTKHLYQLDPTVRRADYYERALYNQILASQ-NPDDGMVCYM 395
Query: 453 LPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
P+G G K G+ F SFWCC G+G+
Sbjct: 396 SPMGSGHRK-----GFCLPFDSFWCCVGSGM 421
>gi|302873208|ref|YP_003841841.1| hypothetical protein Clocel_0296 [Clostridium cellulovorans 743B]
gi|307688627|ref|ZP_07631073.1| hypothetical protein Ccel74_10733 [Clostridium cellulovorans 743B]
gi|302576065|gb|ADL50077.1| protein of unknown function DUF1680 [Clostridium cellulovorans
743B]
Length = 607
Score = 227 bits (579), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 131/391 (33%), Positives = 203/391 (51%), Gaps = 22/391 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG----------SPTAG 151
LK ++ ++KL PS R N YL+ + L+ +F AG +P
Sbjct: 2 LKPINTKNIKLLPSIFKERYD-LNRNYLINVKNQGLLQNFYLEAGIILPGLQVLHNPDTD 60
Query: 152 KAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYL 211
+ + GW+ PTC+LRGHF+GH+LSA+A ++ S + LK K+ ++ L +CQ G ++
Sbjct: 61 EIHWGWDAPTCQLRGHFLGHWLSAAASIFVSEQDHELKAKLDKIIDELIKCQELNGGEWI 120
Query: 212 SAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQN 271
P + F + E VW+P Y +HK+L GL++ Y ++ +AL + + ++ +
Sbjct: 121 GPIPEKYFQKLENSHHVWSPQYVMHKVLMGLMNSYIDTNSDKALAILDKLSNWYIKWTDD 180
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
++ K + E GM +V +Y IT + K+L LA + P L D +
Sbjct: 181 MLIKNPRAIY----GGEEAGMLEVWITMYEITAEEKYLELAKKYSNPRIFRDLEAGRDTL 236
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
+ HAN IP G+ YEVTGD + K+T F+ + V Y +GG AGE+W+ P
Sbjct: 237 TNCHANASIPWSHGAAKLYEVTGDEKWRKITEAFWKNAVTDRGYYCSGGQGAGEYWTPPF 296
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+L L N+E CT YNM++ + +L++WT + +ADY E L NG L+ Q+ G+
Sbjct: 297 KLGLFLSDSNQEFCTVYNMIRTASYLYKWTGDTSFADYIELNLYNGFLA-QQNKYTGMPT 355
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
Y LPLG G K WGT FWCC+GT
Sbjct: 356 YFLPLGAGSKKK-----WGTETRDFWCCHGT 381
>gi|325281981|ref|YP_004254523.1| hypothetical protein Odosp_3391 [Odoribacter splanchnicus DSM
20712]
gi|324313790|gb|ADY34343.1| protein of unknown function DUF1680 [Odoribacter splanchnicus DSM
20712]
Length = 782
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 140/397 (35%), Positives = 206/397 (51%), Gaps = 32/397 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
+K L DV+L S A N ++L +D+D L+ +F K AG G++Y WE +
Sbjct: 40 VKYFGLKDVRLLDSPFK-NAMDRNAAWMLEMDMDRLLSNFLKNAGLEPKGESYGSWE--S 96
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP------ 215
+ GH +GHYLSA A +AST + K+++ +V L CQ +G++ P
Sbjct: 97 MGIAGHTLGHYLSAVAQQYASTGDERFKQRVDYIVHELDSCQQYFVNGFIGGMPGGDRVF 156
Query: 216 ---------SEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFY 266
S FD L +W P+Y HK + GL D Y A N A K+ + +Y
Sbjct: 157 KQVKKGIIRSAGFD----LNGLWVPWYNEHKTMMGLNDAYLLAGNKTAKKVLVNLADYLV 212
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
+ V+ + E+ LN E GGMN+ L ++Y +T D K+L ++ F + LA
Sbjct: 213 D----VLAGLTDEQVQTMLNCEFGGMNEALAQVYALTGDKKYLDASYRFYHRRLMEPLAE 268
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
D + G H+NT IP +IGS +YE+TG+P + FF + H YA GG S+GE+
Sbjct: 269 GKDILPGLHSNTQIPKIIGSARQYELTGNPKDERIAEFFWTTMVNHHSYANGGNSSGEYL 328
Query: 387 SDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP 446
S P +L L E+C TYNMLK+SRHL+ WT + Y D+YE+AL N +L+ Q E
Sbjct: 329 STPDKLNDRLTHSTCETCNTYNMLKLSRHLYEWTGDPKYLDFYEKALYNHILASQH-PET 387
Query: 447 GVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G+ Y +PL G K + +++SF CC G+G
Sbjct: 388 GMTCYFVPLAMGTRKD-----FCDKYNSFTCCMGSGF 419
>gi|294667526|ref|ZP_06732741.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292602646|gb|EFF46082.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 791
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 142/402 (35%), Positives = 199/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A QTN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLTPS-LFLDALQTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + +N QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCENAQALQVAVAL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q V + +L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGVFAALDDAQLQKALSCEFGGLNESFVELHVQTGDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D ++ H+NT+IP +IG YEVTGDP FF V H Y GG
Sbjct: 282 DPLIAQRDALAHQHSNTNIPKLIGLAREYEVTGDPASGAAARFFWHTVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|375308750|ref|ZP_09774033.1| hypothetical protein WG8_2558 [Paenibacillus sp. Aloe-11]
gi|375079377|gb|EHS57602.1| hypothetical protein WG8_2558 [Paenibacillus sp. Aloe-11]
Length = 770
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/394 (34%), Positives = 206/394 (52%), Gaps = 20/394 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDP 160
+KE + V L+ S A L+++ ++ D ++++F++ A T G + GW+ P
Sbjct: 191 VKEFTGPKVSLERESEFAAAMNRFLQFVRSVNDDQMLYNFREAAAIDTKGAQPMTGWDAP 250
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM------GSGYLSAF 214
C L+GH GHYLSA A + +T + L K+ +V+ L +CQ + G G+LSA+
Sbjct: 251 ECNLKGHTTGHYLSALALAYHATEDSALLGKIQYMVAELGKCQTALSEQAGYGRGFLSAY 310
Query: 215 PSEQFDRFE---ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQN 271
EQF+ E +WAPYYT+HKI+AGLLD Y A +AL + + + ++R+
Sbjct: 311 SEEQFNLLEQYTTYPEIWAPYYTLHKIMAGLLDCYQLAGQREALDICDKLGHWLHSRLSR 370
Query: 272 VITKYSVERHWN-SLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+ + + + W+ + E GGMN+ L +LY IT + +L+ A FD + D
Sbjct: 371 -LPREQLHKMWSLYIAGEFGGMNEALAKLYAITGNENYLMTAKYFDNAKLFLPMKENVDT 429
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
+ HAN HIP VIG+ +EV GD Y F +V SH Y GGT E + +P
Sbjct: 430 LGNMHANQHIPQVIGALKLFEVAGDKAYFNIAENFWTMVTQSHIYPIGGTGETEMFREPD 489
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP-GVM 449
+A L + E+C +YNMLK+++ LF++ Y DYYE+AL N +L+ + + G
Sbjct: 490 AIAGFLTDKTAETCASYNMLKLTKELFQFNPRKTYMDYYEKALYNHILASENSQKAEGGS 549
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y +PL G K H CC+GTGL
Sbjct: 550 TYFMPLAPGSIKKFDTH-------ENTCCHGTGL 576
>gi|374322441|ref|YP_005075570.1| hypothetical protein HPL003_12970 [Paenibacillus terrae HPL-003]
gi|357201450|gb|AET59347.1| hypothetical protein HPL003_12970 [Paenibacillus terrae HPL-003]
Length = 774
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 141/375 (37%), Positives = 189/375 (50%), Gaps = 23/375 (6%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHM 179
+A + N YLL L D L+ F++ AG T YEGWE + GH +GHYLSA + M
Sbjct: 28 QAMELNRSYLLELQPDRLLARFREYAGLSTKAPQYEGWE--AMSISGHTLGHYLSACSMM 85
Query: 180 WASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFEA---------LKPV 228
+AST + KE + L CQ G GY+S P E F+ A L
Sbjct: 86 YASTGDNRFKEIAHYITDELDVCQEAHGDGYVSGIPGGKELFEEVSAGNIRSKGFDLNGA 145
Query: 229 WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEE 288
WAP YT+HK+ AGL D Y +AL + + + ++ + ++T S E+ + E
Sbjct: 146 WAPLYTLHKLFAGLRDAYHLTGCNKALLVERKLADW----LGGILTPMSDEQMQQMMFCE 201
Query: 289 TGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQM 348
GGMN+VL LY T + +L LA F L L+ Q D + G HANT IP +IG
Sbjct: 202 YGGMNEVLADLYADTGEESYLRLAECFWHKLVLDPLSSQEDCLQGIHANTQIPKLIGLAK 261
Query: 349 RYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYN 408
YE+T D + T FF D V H Y GG S GE++ P L +G E+C TYN
Sbjct: 262 EYELTNDTKRRATVEFFWDRVVDHHSYVIGGNSFGEYFGAPGGLNDRIGPHTTETCNTYN 321
Query: 409 MLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGW 468
MLK++ HLF+W AD+YER L N +L+ Q GV Y L L G K +
Sbjct: 322 MLKLTSHLFQWNVSAKEADFYERGLFNHILASQDPVHGGV-TYFLSLAMGGHKH-----F 375
Query: 469 GTRFSSFWCCYGTGL 483
++F F CC GTG+
Sbjct: 376 ESKFDDFTCCVGTGM 390
>gi|300785876|ref|YP_003766167.1| hypothetical protein AMED_3987 [Amycolatopsis mediterranei U32]
gi|384149186|ref|YP_005532002.1| hypothetical protein RAM_20325 [Amycolatopsis mediterranei S699]
gi|399537759|ref|YP_006550421.1| hypothetical protein AMES_3940 [Amycolatopsis mediterranei S699]
gi|299795390|gb|ADJ45765.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340527340|gb|AEK42545.1| hypothetical protein RAM_20325 [Amycolatopsis mediterranei S699]
gi|398318529|gb|AFO77476.1| hypothetical protein AMES_3940 [Amycolatopsis mediterranei S699]
Length = 775
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 148/396 (37%), Positives = 197/396 (49%), Gaps = 29/396 (7%)
Query: 107 LHDVKLDPSSLHWRAQQTNLE-YLLMLDVDSLVWSFQKTAGSPTAGKAYEG-WEDPTCEL 164
L V+L S W Q + YL +DV+ L++ F+ T G A G W+ P+
Sbjct: 57 LGQVRLTAS--RWLDNQNRTQNYLRFVDVNRLLYVFRANHRLSTGGAATNGGWDAPSFPF 114
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS-----GYLSAFPSEQF 219
R H GH+L+A A +WA T + T ++K T +V+ L++CQ G+ GYLS FP F
Sbjct: 115 RSHVQGHFLTAWAQLWAVTGDTTSRDKATTMVAELAKCQANNGAAGFSAGYLSGFPEADF 174
Query: 220 DRFEA--LKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVI 273
D EA L PYY IHK +AGLLD + + +TQA L + W V
Sbjct: 175 DNLEAGRLSNGNVPYYCIHKTMAGLLDVWRYIGSTQARDVLLNLAGW--------VDRRT 226
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
+ S + + LN E GGMNDVL LY T D + L A FD LA D ++G
Sbjct: 227 ARLSTSQLQSVLNTEFGGMNDVLADLYQYTGDARWLTAAQRFDHAAVFDPLAANRDQLNG 286
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
HANT +P IG+ Y+ TG Y+ T +I +H YA GG S E + P +A
Sbjct: 287 LHANTQVPKWIGAAREYKATGTTRYRDIATNAWNITVGAHTYAIGGNSQAEHFRAPNAIA 346
Query: 394 STLGTENEESCTTYNMLKVSRHLFR-WTKEMVYADYYERALTNGVLSIQRGTEP-GVMIY 451
+ L + ESC TYNMLK++R L + ADYYERAL N ++ Q + G + Y
Sbjct: 347 AYLNQDTCESCNTYNMLKLTRELIALYPDRADLADYYERALLNQMIGQQNPADSHGHITY 406
Query: 452 MLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
L RG A W T + SFWCC GTGL
Sbjct: 407 FSSLNPGGRRGLGPAWGGGTWSTDYDSFWCCQGTGL 442
>gi|456393067|gb|EMF58410.1| putative glycosylase [Streptomyces bottropensis ATCC 25435]
Length = 714
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 137/376 (36%), Positives = 190/376 (50%), Gaps = 28/376 (7%)
Query: 126 LEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCELRGHFVGHYLSASAHMWASTH 184
L Y D ++ F+ AG T G + GWE LRGH+ GH+L+ A +A T
Sbjct: 75 LNYARSYPADRILAVFRANAGLDTRGARPPGGWETSDGNLRGHYGGHFLTLVAQAYADTR 134
Query: 185 NVTLKEKMTAVVSALSECQNKMGS---------GYLSAFPSEQF---DRFEALKPVWAPY 232
LK K+ +V AL ECQ + G+L+A+P QF + + +WAPY
Sbjct: 135 EAALKSKLDQLVGALGECQAALAERGSPRPSHPGFLAAYPETQFILLESYATYPTIWAPY 194
Query: 233 YTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWN-SLNEETGG 291
YT HKI+ GLLD +T A N QAL + M ++ ++R+ + + +ER W+ + E GG
Sbjct: 195 YTCHKIMRGLLDAHTLAGNAQALTIVSRMGDWVHSRL-GALPRAQLERMWSLYIAGEYGG 253
Query: 292 MNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYE 351
MN+VL LY +T +HL A FD L A D + G HAN HIP G ++
Sbjct: 254 MNEVLADLYALTGKAEHLAAARCFDNTALLDACAQDRDILDGRHANQHIPQFTGYLRLFD 313
Query: 352 VTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLK 411
TG+ Y F +V Y+ GGT GE + +A+TL +N E+C TYNMLK
Sbjct: 314 ETGEERYAEAARNFWGMVAGPRTYSLGGTGQGEMFKARGAIAATLDDKNAETCATYNMLK 373
Query: 412 VSRHLFRWTKEMVYADYYERALTNGVLSIQRGT----EPGVMIYMLPLGRGDSKAKSYHG 467
+SRHLF + DYYER LTN +L+ +R T P V + +G G + Y
Sbjct: 374 LSRHLFFREPDAARMDYYERGLTNHILASRRDTASTSSPEVTYF---VGMGPGVVREYGN 430
Query: 468 WGTRFSSFWCCYGTGL 483
GT CC GTG+
Sbjct: 431 TGT------CCGGTGM 440
>gi|189464178|ref|ZP_03012963.1| hypothetical protein BACINT_00515 [Bacteroides intestinalis DSM
17393]
gi|189437968|gb|EDV06953.1| hypothetical protein BACINT_00515 [Bacteroides intestinalis DSM
17393]
Length = 777
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 141/394 (35%), Positives = 207/394 (52%), Gaps = 34/394 (8%)
Query: 106 SLHDVKL-DPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCEL 164
S+ DV+L D LH A N +++ LD+D L+ +F+K A + Y WE + +
Sbjct: 40 SIQDVRLLDSPFLH--AMNQNEQWMKELDLDRLLSNFRKNANLKPKAEPYGSWE--SMGI 95
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--------- 215
GH +GH L+A + +A+T + T K K+ VV+ L CQ +G++ P
Sbjct: 96 AGHTLGHLLTAMSQHYAATGDETFKAKIDYVVNELDSCQMNFVNGFIGGMPGGDKVFKEV 155
Query: 216 ------SEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRV 269
S FD L +W P+Y HK + GL D Y A N A K+ + +Y +
Sbjct: 156 KKGIIRSMGFD----LNGIWVPWYNEHKTMMGLNDAYLLAGNETAKKVLINLSDY----L 207
Query: 270 QNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQAD 329
+VI S E+ LN E GGMN+ ++Y +T D K L ++ F LA D
Sbjct: 208 ADVIAPLSEEQMQTMLNCEYGGMNEAFAQMYALTGDKKFLDASYAFYHKRLQDKLAEGVD 267
Query: 330 DISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDP 389
+ G H+NT IP +IGS +YE+TG+ + F + + H YA GG S GE+ S P
Sbjct: 268 VLQGLHSNTQIPKLIGSARQYELTGNHRDEEIARFSWETIVHHHSYANGGNSMGEYLSVP 327
Query: 390 KRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+L + LGT E+C TYNMLK++ HL+ WT ++ Y DYYERAL N +L+ Q E G +
Sbjct: 328 DKLNNRLGTNTCETCNTYNMLKLTAHLYEWTNDVQYLDYYERALYNHILASQH-PETGNV 386
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y L LG G ++ G+G+R ++F CC G+G
Sbjct: 387 CYFLSLGMG-----THKGFGSRHNNFSCCMGSGF 415
>gi|345302361|ref|YP_004824263.1| hypothetical protein Rhom172_0482 [Rhodothermus marinus
SG0.5JP17-172]
gi|345111594|gb|AEN72426.1| protein of unknown function DUF1680 [Rhodothermus marinus
SG0.5JP17-172]
Length = 641
Score = 224 bits (572), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 138/377 (36%), Positives = 196/377 (51%), Gaps = 27/377 (7%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A Q ++ YL LD D L+ F++ AG Y GWE + + GH +GHYLSA + +
Sbjct: 56 AMQRDVAYLFELDPDRLLSRFRRFAGLEPKAPEYGGWE--SQGISGHTLGHYLSALSMYY 113
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS-----EQFDRFE-------ALKPV 228
A+T + + ++ +VS L+E Q G+GY+ A P + R E +L
Sbjct: 114 AATGDEKARARIDYIVSELAEVQRAHGNGYVGAIPEGDRLWAEIARGEIWQAEPFSLNGA 173
Query: 229 WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNE 287
W P+YT+HKI GL+D Y + N QAL++ + ++ Y +N+ W L
Sbjct: 174 WVPWYTMHKIFQGLIDAYWYGGNEQALEVVTRLADWAYETTKNLTPA-----QWQQMLRT 228
Query: 288 ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQ 347
E GGMN+ L LY+IT +PKH L+ F L LA +++G HANT IP VIG
Sbjct: 229 EHGGMNEALANLYSITGNPKHRELSQKFYHAAVLSPLARGIPNLTGLHANTQIPKVIGVV 288
Query: 348 MRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTY 407
+YE+ G + FF + V H Y GG S E + LA+ LG E+C TY
Sbjct: 289 RQYELIGSDSLRAVAEFFWEEVVQHHTYVIGGNSQNEHFGPRDSLANRLGEGTAETCNTY 348
Query: 408 NMLKVSRHLFRWTKEMV-YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYH 466
NML+++RHLF E V Y D+YERAL N +L+ Q + G+ Y + L G K
Sbjct: 349 NMLRLTRHLFALHPEKVRYVDFYERALYNHILASQ-DPKHGMFTYYMSLRPGHFKT---- 403
Query: 467 GWGTRFSSFWCCYGTGL 483
+ T +SFWCC GTG+
Sbjct: 404 -YATPENSFWCCVGTGM 419
>gi|310639749|ref|YP_003944507.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus polymyxa
SC2]
gi|386038950|ref|YP_005957904.1| hypothetical protein PPM_0260 [Paenibacillus polymyxa M1]
gi|309244699|gb|ADO54266.1| Acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus polymyxa
SC2]
gi|343094988|emb|CCC83197.1| DUF1680 domain containing protein [Paenibacillus polymyxa M1]
Length = 751
Score = 224 bits (572), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 146/389 (37%), Positives = 208/389 (53%), Gaps = 24/389 (6%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
LH V +D L + A + N YLL L+ D L+ F++ AG YEGWE +
Sbjct: 7 DLHKVSIDSGPL-YHAMELNTTYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--ARGIS 63
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--SEQFDRFE 223
GH +GHYLS A M+AST + L E++ V+ L CQN G+GY+S P E F+ +
Sbjct: 64 GHTLGHYLSGCALMFASTGDKRLLERVNYVIDELEICQNSHGNGYISGIPRGKEIFEEVK 123
Query: 224 A---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
A L W P YT+HK+ AGL D + A + +AL M + ++ +++V
Sbjct: 124 AGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLAHHPKALAMEIQLGDW----LEDVFQ 179
Query: 275 KYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
S E+ L+ E GGMN+VL L + + + L LA F L LA D ++G
Sbjct: 180 GLSDEQVQQVLHCEFGGMNEVLTDLAEHSGEKRFLNLAERFYHGEVLNDLADSRDTLAGR 239
Query: 335 HANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAS 394
HANT IP +IG+ ++EVTG PLY FF D V H Y GG S E + +P +L
Sbjct: 240 HANTQIPKIIGAARQFEVTGKPLYADLSRFFWDRVVHKHSYVIGGNSYNEHFGEPGKLND 299
Query: 395 TLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLP 454
LG E+C TYNMLK++RH+F W YADYYERA+ N +L+ Q+ + G + Y +
Sbjct: 300 RLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-GRVCYFVS 358
Query: 455 LGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
L G K+ + +++ F CC G+G+
Sbjct: 359 LEMGGHKS-----FNSQYEDFTCCVGSGM 382
>gi|440694505|ref|ZP_20877120.1| hypothetical protein STRTUCAR8_01091 [Streptomyces turgidiscabies
Car8]
gi|440283503|gb|ELP70762.1| hypothetical protein STRTUCAR8_01091 [Streptomyces turgidiscabies
Car8]
Length = 747
Score = 224 bits (572), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 140/399 (35%), Positives = 200/399 (50%), Gaps = 32/399 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDP 160
L +V+L D + R + LE+ D ++ F+ AG T G + GWE
Sbjct: 90 LDQVALGD------GVFRRKRDLMLEFARSYPADRILAVFRANAGLDTRGAQPPGGWETA 143
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS---------GYL 211
LRGHF GH+L+ A +A T LK K+ +V+AL ECQ + G+L
Sbjct: 144 DGNLRGHFGGHFLTLVAQAYADTREAALKTKLDYLVTALGECQQALADHGSPRPSHPGFL 203
Query: 212 SAFPSEQF---DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNR 268
+A+P QF + + +WAPYYT HKI+ G LD +T N QAL + M ++ ++R
Sbjct: 204 AAYPETQFILLESYTTYPTIWAPYYTCHKIMRGFLDAHTLTGNQQALTIASKMGDWVHSR 263
Query: 269 VQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQ 327
+ + + ++R W+ + E GGMN+VL LY +T +HL A FD L A
Sbjct: 264 LSR-LPQAQLDRMWSIYIAGEYGGMNEVLADLYALTGRAEHLAAARCFDNTALLDACADN 322
Query: 328 ADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWS 387
D + G HAN HIP G ++ TG+ Y F +V Y+ GGT GE +
Sbjct: 323 RDILDGRHANQHIPQFTGYIRLFDHTGEAEYATAARNFWGMVAGPRTYSLGGTGQGEMFR 382
Query: 388 DPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPG 447
+A+TLG N E+C TYNMLK+SR LF T + Y DYYE+ LTN +L+ +R
Sbjct: 383 ARNAIAATLGDNNAETCATYNMLKLSRQLFFHTPDPAYMDYYEKGLTNHILASRRDARST 442
Query: 448 V---MIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
V + Y + +G G + Y GT CC GTG+
Sbjct: 443 VSPEVTYFVGMGPG--VVREYDNTGT------CCGGTGM 473
>gi|384428325|ref|YP_005637684.1| hypothetical protein XCR_2693 [Xanthomonas campestris pv. raphani
756C]
gi|341937427|gb|AEL07566.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
756C]
Length = 791
Score = 224 bits (572), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 138/402 (34%), Positives = 200/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 IRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +V+ L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTDDAQCRTRARYLVAELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFE--ALKPV-------WAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + ++P+ WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIEPLPFYLNGSWAPLYTWHKLFAGLLDVHVHCDNAQALQVAVAL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + L+ E GG+N+ L+ T + L LA
Sbjct: 226 AGY----LQGIFAALDDTQLQKVLSCEFGGLNESFVELHVRTGHAQWLALAQRLHHHAVF 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF + V H Y GG
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWNTVTDHHSYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C++YNMLK++RHL+RW + Y DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCSSYNMLKLTRHLYRWGPQAAYFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM P+ G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|325919533|ref|ZP_08181551.1| hypothetical protein XGA_0490 [Xanthomonas gardneri ATCC 19865]
gi|325549987|gb|EGD20823.1| hypothetical protein XGA_0490 [Xanthomonas gardneri ATCC 19865]
Length = 791
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 140/402 (34%), Positives = 199/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A QTN YL+ L+ D L+ +F AG AY GWE T
Sbjct: 49 IRAVPLAQVRLTPS-LFLDALQTNRRYLMRLEPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +V+ L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAHYLVAELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 KIESGRAVFDELKKGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVGL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q V + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQAVFSALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P + L + E C +YNMLK++RHL++W + + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSTSKFLTEQTCEHCASYNMLKLTRHLYQWGPQAEFFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM P+ G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|373958137|ref|ZP_09618097.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373894737|gb|EHQ30634.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 789
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 141/388 (36%), Positives = 206/388 (53%), Gaps = 24/388 (6%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L DV+L S +A + + YLL ++ D L+ F+ +G GK Y GWE + L G
Sbjct: 52 LQDVRLLESPFK-QAMEKDAAYLLSVEPDRLLSGFRSHSGLTPKGKMYGGWE--SSGLAG 108
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF------- 219
H +GHYLSA + +AS+ N E++ +V L ECQ +GY+ A P E
Sbjct: 109 HTLGHYLSAISMQYASSRNPQFLERVNYIVKELKECQVARKTGYIGAIPKEDTIWAEIKK 168
Query: 220 ----DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITK 275
R L W+P+YT+HK++AGLLD Y + +N +AL + K M ++ +QN+
Sbjct: 169 GDIRSRGFDLNGGWSPWYTVHKVMAGLLDAYLYCNNAEALNICKGMGDWTGELLQNL--- 225
Query: 276 YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
+ E+ + L E GGM + L LY IT + +L ++ F L L+ D + G H
Sbjct: 226 -NDEQIQSMLLCEYGGMAETLVNLYAITGNKAYLATSYKFYDKRILNPLSENKDILPGKH 284
Query: 336 ANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAST 395
+NT IP VI S RYE+TG+ + F +I+ H YATGG S E+ S+P +L
Sbjct: 285 SNTQIPKVIASARRYELTGEKKDEDISVNFWNIITKDHSYATGGNSNYEYLSEPDKLNDK 344
Query: 396 LGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPL 455
L E+C TYNMLK++RHLF DYYE+AL N +L+ Q + G+M Y +PL
Sbjct: 345 LTENTTETCNTYNMLKLTRHLFSVNPSAALMDYYEKALYNHILASQNHDD-GMMCYFVPL 403
Query: 456 GRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G K + + F +F CC G+G+
Sbjct: 404 RMGGKKE-----YSSPFDTFTCCVGSGM 426
>gi|325915124|ref|ZP_08177450.1| hypothetical protein XVE_1336 [Xanthomonas vesicatoria ATCC 35937]
gi|325538646|gb|EGD10316.1| hypothetical protein XVE_1336 [Xanthomonas vesicatoria ATCC 35937]
Length = 791
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 197/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 MRAVPLAQVRLTPS-LFLDALNTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCATRAAYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + N QAL++ +
Sbjct: 166 QIESGRAVFDELKKGKIDSAPFYLNGSWAPLYTWHKLFAGLLDVHAHCGNAQALQVAVGL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + + L+ E GG+N+ L+ T D + L LA +
Sbjct: 226 AGY----LQGIFAALNDAQLQQVLSCEFGGLNESFVELHVQTDDAQWLALAQRLHHHAVI 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWQTVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RHL++W + V+ DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAVHFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|427384240|ref|ZP_18880745.1| hypothetical protein HMPREF9447_01778 [Bacteroides oleiciplenus YIT
12058]
gi|425727501|gb|EKU90360.1| hypothetical protein HMPREF9447_01778 [Bacteroides oleiciplenus YIT
12058]
Length = 777
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 139/397 (35%), Positives = 209/397 (52%), Gaps = 34/397 (8%)
Query: 103 KEVSLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
K + DV+L S LH A N +++ LD+D L+ +F+K A + Y+ WE +
Sbjct: 37 KYFGIQDVRLLESPFLH--AMNQNEQWMKELDLDRLLSNFRKNANLRPKAEPYDSWE--S 92
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP------ 215
+ GH +GH L+A + +A+T + T K K+ VV+ L CQ +G++ P
Sbjct: 93 MGIAGHTLGHLLTAMSQHYAATGDETFKTKIDYVVNELDSCQMNFVNGFIGGMPGGDKVF 152
Query: 216 ---------SEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFY 266
S FD L +W P+Y HK + GL D Y A N A K+ + +Y
Sbjct: 153 KEVKKGIIRSMGFD----LNGIWVPWYNEHKTMMGLNDAYLLAGNETAKKVLINLSDY-- 206
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
+ +VI + E+ LN E GGMN+ ++Y +T D K+L ++ F LA
Sbjct: 207 --LADVIAPLNEEQMQTMLNCEYGGMNEAFAQVYALTGDEKYLDASYAFYHKRLQDKLAE 264
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
D + G H+NT IP +IGS +YE+TG+ + F + + H YA GG S GE+
Sbjct: 265 GIDALQGLHSNTQIPKLIGSARQYELTGNQRDEKIARFSWETIVLHHSYANGGNSMGEYL 324
Query: 387 SDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP 446
S P +L+ LG+ E+C TYNMLK++ HL+ WT ++ Y DYYERAL N +L+ Q E
Sbjct: 325 SVPDKLSDRLGSNTCETCNTYNMLKLTGHLYEWTNDVQYLDYYERALYNHILASQH-PET 383
Query: 447 GVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G + Y L LG G ++ G+G+R ++F CC G+G
Sbjct: 384 GNVCYFLSLGMG-----THKGFGSRHNNFSCCMGSGF 415
>gi|436837799|ref|YP_007323015.1| hypothetical protein FAES_4423 [Fibrella aestuarina BUZ 2]
gi|384069212|emb|CCH02422.1| hypothetical protein FAES_4423 [Fibrella aestuarina BUZ 2]
Length = 781
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 141/375 (37%), Positives = 201/375 (53%), Gaps = 25/375 (6%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAG-SPTAGKAYEGWEDPTCELRGHFVGHYLSASAHM 179
A + + +LL L D L+ F+ AG +P A K Y GWE + L GH +GHYLSA A
Sbjct: 58 AMEADTRFLLNLQPDRLLAQFRAHAGLAPKAAK-YGGWE--SSGLAGHSLGHYLSALALQ 114
Query: 180 WASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF-----------DRFEALKPV 228
+A+T++ +++ +V L++CQ +GY+ A P E R L
Sbjct: 115 YAATNDPEYLKRVNYIVDELADCQRARKTGYVGAIPREDTVFAEVAQGNIRSRGFDLNGA 174
Query: 229 WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEE 288
W+P+YT+HK++AGLLD Y +A N +AL +T M ++ ++N +T V++ L E
Sbjct: 175 WSPWYTVHKVMAGLLDAYLYAHNDKALAVTVGMADWTGETLKN-LTDEQVQKM---LLCE 230
Query: 289 TGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQM 348
GGMNDVL +Y +T + K+L L++ F L LA Q D + G HANT +P +IG+
Sbjct: 231 YGGMNDVLANIYALTGNKKYLDLSYKFHDRVVLDSLAHQKDILPGRHANTQVPKLIGTIR 290
Query: 349 RYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYN 408
RYE+TG FF V H YA GG S E+ S P +L L E+C T+N
Sbjct: 291 RYELTGSQPDLAMSDFFWKTVVNHHTYAPGGNSNYEYLSTPDQLTDKLTDNTMETCNTHN 350
Query: 409 MLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGW 468
MLK++RHLF Y DYYERAL N +L+ Q + G++ Y +PL G K +
Sbjct: 351 MLKLTRHLFALQPNAAYMDYYERALYNHILASQH-HKTGMVCYFVPLRMGTRKH-----F 404
Query: 469 GTRFSSFWCCYGTGL 483
F CC GTG+
Sbjct: 405 SDEEEDFTCCVGTGM 419
>gi|325927064|ref|ZP_08188334.1| hypothetical protein XPE_2335 [Xanthomonas perforans 91-118]
gi|325542563|gb|EGD14035.1| hypothetical protein XPE_2335 [Xanthomonas perforans 91-118]
Length = 791
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 197/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +V L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVGL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D+++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|226325822|ref|ZP_03801340.1| hypothetical protein COPCOM_03635 [Coprococcus comes ATCC 27758]
gi|225205946|gb|EEG88300.1| hypothetical protein COPCOM_03635 [Coprococcus comes ATCC 27758]
Length = 761
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 130/388 (33%), Positives = 201/388 (51%), Gaps = 19/388 (4%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA-YEGWEDPTCELR 165
L V+L +L+++ Q+ EYLL +D D ++++F+K G T G GW++ +C+L+
Sbjct: 198 LGQVRLKEGTLYYKYQKLMEEYLLGIDDDQMLYNFRKATGLDTKGAPPMTGWDEESCKLK 257
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS------GYLSAFPSEQF 219
GH GHYLS A +A+T N+ +K+ +V+ L +CQ+ + G+LSA+ EQF
Sbjct: 258 GHTTGHYLSGIALAFAATGNLKFLDKVNYMVAELKKCQDAFAATGKYHRGFLSAYSEEQF 317
Query: 220 DRFEAL---KPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKY 276
D E +WAPYYT+ KI++GL D + A N A ++ M ++ Y+R+ + K
Sbjct: 318 DLLEVYTKYPEIWAPYYTLDKIMSGLYDCHVLAGNETAKEILDLMGDWVYDRLSR-LPKE 376
Query: 277 SVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
++++ W + E GGM + ++Y +T HL A LF+ + + D + H
Sbjct: 377 TLDKMWAMYIAGEFGGMLGTMVKVYELTGKENHLKAAKLFENEKLFYPMEEECDTLEDMH 436
Query: 336 ANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAST 395
AN HIP +IG+ Y TGD +Y G F +IV H Y GG E + S
Sbjct: 437 ANQHIPQIIGAMDLYRATGDEIYWEIGKNFWNIVTGGHTYCIGGVGETEMFHRANTTCSY 496
Query: 396 LGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPL 455
L + ESC +YNML+++ LF +T+ DYY+ L N +L+ G Y LPL
Sbjct: 497 LTDKAAESCASYNMLRLTSQLFEYTRSGNLMDYYDNTLRNHILTSSSHKCDGGTTYFLPL 556
Query: 456 GRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G G K S CC+GTG+
Sbjct: 557 GPGGRKE-------FFLSENSCCHGTGM 577
>gi|78048280|ref|YP_364455.1| hypothetical protein XCV2724 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78036710|emb|CAJ24403.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 791
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 197/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +V L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVSL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D+++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|289661682|ref|ZP_06483263.1| putative secreted protein, partial [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 756
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 197/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLMPS-LFLDALNTNRRYLMRLQPDRLLHNFVLYAGLDPQAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +V L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVGL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D+++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLVTQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRSGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|418517157|ref|ZP_13083324.1| hypothetical protein MOU_10158 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410706214|gb|EKQ64677.1| hypothetical protein MOU_10158 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 791
Score = 221 bits (564), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 140/402 (34%), Positives = 196/402 (48%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVGL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFAALDAAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAKLFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|317057297|ref|YP_004105764.1| hypothetical protein Rumal_2655 [Ruminococcus albus 7]
gi|315449566|gb|ADU23130.1| protein of unknown function DUF1680 [Ruminococcus albus 7]
Length = 602
Score = 221 bits (564), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 127/372 (34%), Positives = 197/372 (52%), Gaps = 25/372 (6%)
Query: 123 QTNLEYLLMLDVDSLVWSFQKTAG----------SPTAGKAYEGWEDPTCELRGHFVGHY 172
+ N YL LD L+ + AG P + + GWE P C+LRGHF+GH+
Sbjct: 22 ELNKRYLKELDTVCLMQNHYLEAGIILPDRQVISEPEKAELHWGWESPACQLRGHFLGHW 81
Query: 173 LSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPY 232
+SA+A + AS + L+ K+ +V L CQ + G ++ + P + F E+ + +W+P
Sbjct: 82 MSAAAMLSASDGDAELRAKLVKIVDELERCQQRNGGKWVGSIPEKYFKLMESEEYIWSPQ 141
Query: 233 YTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVER--HWNSLNEETG 290
YT+HK L GL+D Y FA +AL + + +++ + SVE+ + E G
Sbjct: 142 YTMHKTLMGLVDAYRFAGIQKALDIADRLADWY------IEWAASVEKTAPFTVFKGEQG 195
Query: 291 GMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRY 350
GM + LY +T DPK+ L ++ + L + ++ HAN IP+ G+ Y
Sbjct: 196 GMLEEWCILYELTNDPKYRKLMDIYRENGLYHKLEQHREALTDDHANASIPLSHGAARMY 255
Query: 351 EVTGDPLYK-VTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
++TG+ +K +T F+ V +AT G ++GEFW P + S LG ++E CT YNM
Sbjct: 256 DITGEERWKIITDEFWRQAVTERGMFATTGANSGEFWVPPHSMGSYLGDTDQEFCTVYNM 315
Query: 410 LKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWG 469
++++ L+R T + VYADY ERAL NG L+ Q+ G+ Y LPL G K WG
Sbjct: 316 VRLADFLYRRTGDTVYADYIERALYNGFLA-QQNMHSGMPAYFLPLSSGSRKK-----WG 369
Query: 470 TRFSSFWCCYGT 481
++ FWCC+GT
Sbjct: 370 SKRHDFWCCHGT 381
>gi|418520534|ref|ZP_13086583.1| hypothetical protein WS7_05828 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410703915|gb|EKQ62403.1| hypothetical protein WS7_05828 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 791
Score = 221 bits (564), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 141/402 (35%), Positives = 196/402 (48%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDTQCRTRTRYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPSPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVAL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q V + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGVFAALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|374321589|ref|YP_005074718.1| hypothetical protein HPL003_08660 [Paenibacillus terrae HPL-003]
gi|357200598|gb|AET58495.1| hypothetical protein HPL003_08660 [Paenibacillus terrae HPL-003]
Length = 755
Score = 221 bits (563), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 144/392 (36%), Positives = 210/392 (53%), Gaps = 24/392 (6%)
Query: 103 KEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC 162
K LH V +D L + A + N YLL L+ D L+ F++ AG YEGWE
Sbjct: 6 KAFDLHKVSIDSGPL-YHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--AR 62
Query: 163 ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--SEQFD 220
+ GH +GHYLS A M+AST + L E++ VV+ L CQN G+GY+S P E F+
Sbjct: 63 GISGHTLGHYLSGCALMFASTGDERLLERVNYVVNELEICQNNHGNGYISGIPRGKELFE 122
Query: 221 RFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQN 271
+A L W P YT+HK+ AGL D + A + +AL+M + ++ +++
Sbjct: 123 EVKAGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLARHPKALQMEIKLGDW----LED 178
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
V + ++ L+ E GGMN+VL L + + + L LA F L LA D +
Sbjct: 179 VFKGLNDDQVQQVLHCEFGGMNEVLTDLAEHSGEERFLRLAERFYHGEVLNDLADSRDTL 238
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP +IG+ +YE+TG P Y FF + V H Y GG S E + +P +
Sbjct: 239 AGRHANTQIPKIIGAARQYEMTGKPQYADLSRFFWERVVHKHSYVIGGNSYNEHFGEPGK 298
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
L LG E+C TYNMLK++RH+F W YADYYERA+ N +L+ Q+ + G + Y
Sbjct: 299 LNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-GRVCY 357
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ L G K+ + +++ F CC G+G+
Sbjct: 358 FVSLEMGGHKS-----FNSQYDDFTCCVGSGM 384
>gi|381170950|ref|ZP_09880102.1| Tat (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas citri pv. mangiferaeindicae LMG
941]
gi|380688673|emb|CCG36589.1| Tat (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas citri pv. mangiferaeindicae LMG
941]
Length = 791
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 140/402 (34%), Positives = 198/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVDL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFSVLDDTQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D+++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLIAQRDELAHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHAVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|346725400|ref|YP_004852069.1| hypothetical protein XACM_2511 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346650147|gb|AEO42771.1| hypothetical protein XACM_2511 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 791
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 138/402 (34%), Positives = 197/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +V L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVGELARCQAHAGDGYVAGFTRKDAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAMGL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFSALDEAQLQKVLSCEFGGLNESFVELHVRTDDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D+++ H+NT+IP +IG YEVTG+ FF V H Y GG
Sbjct: 282 DPLVAQRDELAHQHSNTNIPKLIGLAREYEVTGNAASGAAARFFWHTVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRSGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|390993493|ref|ZP_10263643.1| TAT (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|372551771|emb|CCF70618.1| TAT (twin-arginine translocation) pathway signal sequence domain
protein [Xanthomonas axonopodis pv. punicae str. LMG
859]
Length = 791
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 141/402 (35%), Positives = 196/402 (48%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDTQCRTRTRYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVAL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q V + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGVFAALEDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|146301615|ref|YP_001196206.1| hypothetical protein Fjoh_3876 [Flavobacterium johnsoniae UW101]
gi|146156033|gb|ABQ06887.1| protein of unknown function DUF1680 [Flavobacterium johnsoniae
UW101]
Length = 765
Score = 221 bits (562), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 137/398 (34%), Positives = 198/398 (49%), Gaps = 33/398 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
+K L +V+L+ +AQ +L+Y+L L+ D L+ + AG P Y WE +
Sbjct: 27 MKTFPLQEVRLEDGPFK-KAQDVDLKYILALNPDKLLAPYLIDAGLPVKSTRYGNWE--S 83
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF-- 219
L GH GHYLSA + M+AST N LK ++ ++S L+ CQ+K G+GY+ P +
Sbjct: 84 LGLDGHIAGHYLSALSMMYASTGNPELKNRLDYMISELARCQDKNGNGYVGGIPQGKVFW 143
Query: 220 DRFE---------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFY 266
DR L W P Y IHK+ AGL D Y + N QA +K+ W +E
Sbjct: 144 DRIHKGDIDGSSFGLNNTWVPIYNIHKLFAGLNDAYQYTGNQQAKEVLIKLGDWFIE--- 200
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
+I S ++ L E GG+N+ LY IT+D K+L A + FL L
Sbjct: 201 -----MIKPLSDDQIQKILKTEHGGINESFADLYLITKDKKYLETAQKISQKSFLESLIK 255
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
+ D ++G HANT IP VIG + ++ D + TFF D V A GG S E +
Sbjct: 256 KEDKLTGLHANTQIPKVIGFEKIASISADKEWSEAVTFFWDNVTQKRSVAFGGNSVSEHF 315
Query: 387 SDPKRLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
+ + L + E E+C +YNM ++S+ LF +EM Y D+YER L N +LS Q E
Sbjct: 316 NPVNDFSGMLKSNEGPETCNSYNMERLSKALFLEKQEMNYLDFYERTLYNHILSSQH-PE 374
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P+ + Y + +S WCC G+GL
Sbjct: 375 KGGFVYFTPI-----RPNHYRVYSQPETSMWCCVGSGL 407
>gi|302340651|ref|YP_003805857.1| hypothetical protein Spirs_4187 [Spirochaeta smaragdinae DSM 11293]
gi|301637836|gb|ADK83263.1| protein of unknown function DUF1680 [Spirochaeta smaragdinae DSM
11293]
Length = 764
Score = 221 bits (562), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 131/392 (33%), Positives = 207/392 (52%), Gaps = 19/392 (4%)
Query: 110 VKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWEDPTCELRGHF 168
V L S+ Q +++L+ D D ++++F+ AG T G GW+ P+C LRGH
Sbjct: 196 VMLKEGSVFCDEQDKMIQHLIDTDDDQMLYNFRVAAGVDTRGALPMTGWDAPSCNLRGHT 255
Query: 169 VGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM-----GSGYLSAFPSEQFDRFE 223
GHYLS+ A W+ T L +K+ ++ +LSECQN + G+LSA+ QFD E
Sbjct: 256 TGHYLSSLALGWSVTKKTELMDKIVYLIESLSECQNALEERGCSKGFLSAYSERQFDLLE 315
Query: 224 ALKP---VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVER 280
P +WAPYYT+ KI++GL D Y+ AD++ AL + M ++ Y R+ +++ +++
Sbjct: 316 TYTPYPTIWAPYYTLDKIMSGLYDCYSLADSSLALNILCKMGDWVYERLSR-LSRNQLDK 374
Query: 281 HWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTH 339
W+ + E GGM V+ +LYT+T+ +L A+ FD + D + HAN H
Sbjct: 375 MWSMYIAGEFGGMISVMVKLYTLTKKKTYLQTAYYFDNEKLFYPMQENIDTLKDMHANQH 434
Query: 340 IPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTE 399
IP ++G+ YE G Y F +IV ASH Y+ GG E + +P + + + +
Sbjct: 435 IPQIMGAVELYEADGSGRYYDIAKNFWNIVTASHVYSIGGIGETEMFHEPNEIMTYITDK 494
Query: 400 NEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGD 459
ESC +YN+L+++ LF E D+YE L N +LS G Y +PL G
Sbjct: 495 TAESCASYNILRLTGQLFALEPERRKMDFYETVLYNHILSSFSHKSDGGTTYFMPLRPGG 554
Query: 460 SKAKSYHGWGTRFSSFWCCYGTGL-GFIRWIQ 490
K + T+ ++ CC+G+GL R++Q
Sbjct: 555 HKE-----FNTKENT--CCHGSGLETRFRYVQ 579
>gi|294624781|ref|ZP_06703443.1| secreted protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB
11122]
gi|292600913|gb|EFF44988.1| secreted protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB
11122]
Length = 791
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 198/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
+ V L V+L PS L A TN YL+ L+ D L+ +F AG AY GWE T
Sbjct: 49 FRAVPLAQVRLTPS-LFLDALHTNRRYLMRLEPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +V+ L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVAELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FD--RFEALKPV-------WAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD R + P WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 KIESGRAVFDELRRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNAQALQVAVSL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFAALDDAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ + + E C +YNMLK++RHL++W + + DYYER L N VL+ Q
Sbjct: 342 DREYFQQPDSISKFVTEQTCEHCASYNMLKLTRHLYQWGPQAEFFDYYERTLLNHVLA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM P+ G+++A W + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPMLAGEARA-----WSSPFDDFWCCVGSGM 437
>gi|268316049|ref|YP_003289768.1| hypothetical protein Rmar_0478 [Rhodothermus marinus DSM 4252]
gi|262333583|gb|ACY47380.1| protein of unknown function DUF1680 [Rhodothermus marinus DSM 4252]
Length = 641
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 136/377 (36%), Positives = 195/377 (51%), Gaps = 27/377 (7%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A Q ++ YL LD D L+ F++ AG Y GWE + + GH +GHYLSA + +
Sbjct: 56 AMQRDVAYLFELDPDRLLSRFRRFAGLEPKAPEYGGWE--SQGISGHTLGHYLSALSMYY 113
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS-----EQFDRFE-------ALKPV 228
A+T + + ++ +VS L+E Q G+GY+ A P + R E +L
Sbjct: 114 AATGDEKARARIDYIVSELAEVQRAHGNGYVGAIPEGDRLWAEIARGEIWQAEPFSLNGA 173
Query: 229 WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNE 287
W P+YT+HKI GL+D Y + + QAL++ + ++ Y +N+ W L
Sbjct: 174 WVPWYTMHKIFQGLIDAYWYGGSEQALEVVTRLADWAYETTKNLTPA-----QWQQMLRT 228
Query: 288 ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQ 347
E GGMN+ L LY+IT +PKH L+ F L L+ +++G HANT IP VIG
Sbjct: 229 EHGGMNEALANLYSITGNPKHRELSEKFYHAAVLSPLSRGIPNLTGLHANTQIPKVIGVV 288
Query: 348 MRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTY 407
+YE+ G + FF + V H Y GG S E + LA+ LG E+C TY
Sbjct: 289 RQYELIGSDSLRAVAEFFWEEVVQHHTYVIGGNSQNEHFGPRDSLANRLGEGTAETCNTY 348
Query: 408 NMLKVSRHLFRWTKEMV-YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYH 466
NML+++RHLF E V Y D+YERAL N +L+ Q + G+ Y + L G K
Sbjct: 349 NMLRLTRHLFALHPEKVRYVDFYERALYNHILASQ-DPKRGMFTYYMSLRPGHFKT---- 403
Query: 467 GWGTRFSSFWCCYGTGL 483
+ T SFWCC GTG+
Sbjct: 404 -YATPEHSFWCCVGTGM 419
>gi|333381736|ref|ZP_08473415.1| hypothetical protein HMPREF9455_01581 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829665|gb|EGK02311.1| hypothetical protein HMPREF9455_01581 [Dysgonomonas gadei ATCC
BAA-286]
Length = 775
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 146/399 (36%), Positives = 208/399 (52%), Gaps = 35/399 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
LK SL DV+L SS A + ++LL + D + F+ +G Y GWE +
Sbjct: 35 LKPFSLSDVRL-TSSPFMSAMSLDEKWLLSFEPDRFLSGFRSESGLQPKAPKYGGWE--S 91
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMG-SGYLSAFP----- 215
+ G GHYLSA + M+AST N L +++ ++ L CQ G +G ++AFP
Sbjct: 92 QGVAGQTFGHYLSALSMMYASTGNEQLNDRIKYSINELDSCQQAFGMNGIVAAFPRAKGL 151
Query: 216 ----------SEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYF 265
+E FD L W P Y++HK+ AGL+D Y + N QA K+ + +
Sbjct: 152 FTEISTGDIRTEGFD----LNGGWVPLYSMHKLFAGLIDVYEYTGNKQAYKIYINLAD-- 205
Query: 266 YNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLA 325
V +++ S E+ L E GG+N+ L +Y +T + K+L LA + L L+
Sbjct: 206 --GVDKMLSGLSDEQIQKILICEHGGINESLAEVYALTGNKKYLNLATRLNHKAVLDPLS 263
Query: 326 VQADDISGFHANTHIPVVIGSQMRYEVTG-DPLYKVTGTFFMDIVNASHGYATGGTSAGE 384
D+++G HANT IP VIG YE+TG D L+K T FF + V SH Y GG S E
Sbjct: 264 KGVDELAGKHANTQIPKVIGVIREYELTGNDDLFK-TAEFFWNTVVHSHSYVIGGNSEAE 322
Query: 385 FWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT 444
+ R + + E+C TYNMLK+++HLF ++ ADYYERAL N +L+ Q
Sbjct: 323 HFGVAGRTYDRITDKTCENCNTYNMLKLTKHLFSLQPDIQKADYYERALYNQILASQ-NP 381
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G++ YM PL G S G+ T F SFWCC GTGL
Sbjct: 382 QDGMVCYMSPLAAG-----SRRGFSTPFDSFWCCVGTGL 415
>gi|289668636|ref|ZP_06489711.1| putative secreted protein [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 793
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 197/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLMPS-LFLDALNTNRRYLMRLQPDRLLHNFVLYAGLDPQAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 KIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNVQALQVAVSL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFSALDDAQLQKVLSCEFGGLNESFVELHVRTGDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RH+++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHVYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|390456178|ref|ZP_10241706.1| hypothetical protein PpeoK3_19346 [Paenibacillus peoriae KCTC 3763]
Length = 753
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 144/389 (37%), Positives = 204/389 (52%), Gaps = 24/389 (6%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
LH V +D L A + N YLL L+ D L+ F++ AG YEGWE +
Sbjct: 9 DLHKVSIDSGPL-CHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--ARGIS 65
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--SEQFDRFE 223
GH +GHYLS + M+AST + L E++ V+ L CQN G+GY+S P E F+ +
Sbjct: 66 GHTLGHYLSGCSLMYASTGDERLLERVNYVIDELEICQNSHGNGYISGIPRGKEIFEEVK 125
Query: 224 A---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
A L W P YT+HK+ AGL D Y + +AL M + ++ +++V
Sbjct: 126 AGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAYLLVHHPKALPMEIKLGDW----LEDVFR 181
Query: 275 KYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
E+ L+ E GGMN+VL L + + + L LA F L LA D ++G
Sbjct: 182 GLDDEQMQRVLHCEFGGMNEVLTDLAEHSGEERFLKLAERFYHGEVLNDLADSRDTLAGR 241
Query: 335 HANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAS 394
HANT IP +IG+ +YEVTG P Y FF D V H Y GG S E + +P +L
Sbjct: 242 HANTQIPKIIGAARQYEVTGKPHYADLSRFFWDRVVHKHSYVIGGNSYNEHFGEPGKLND 301
Query: 395 TLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLP 454
LG E+C TYNMLK++RH+F W YADYYERA+ N +L+ Q+ + G + Y +
Sbjct: 302 RLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-GRVCYFVS 360
Query: 455 LGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
L G K+ + +++ F CC G+G+
Sbjct: 361 LEMGGHKS-----FNSQYEDFTCCVGSGM 384
>gi|429199615|ref|ZP_19191363.1| hypothetical protein STRIP9103_08616 [Streptomyces ipomoeae 91-03]
gi|428664699|gb|EKX63974.1| hypothetical protein STRIP9103_08616 [Streptomyces ipomoeae 91-03]
Length = 655
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 135/382 (35%), Positives = 193/382 (50%), Gaps = 28/382 (7%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCELRGHFVGHYLSASAH 178
R + LEY D ++ F+ AG T G + GWE LRGH+ GH+L+ A
Sbjct: 10 RKRDLMLEYARSYPADRILAVFRANAGLDTRGARPPGGWETSDGNLRGHYGGHFLTLVAQ 69
Query: 179 MWASTHNVTLKEKMTAVVSALSECQNKMGS---------GYLSAFPSEQF---DRFEALK 226
+A T LK K+ +V AL+ECQ + G+L+A+P QF + +
Sbjct: 70 AYADTREAALKAKLDYLVGALAECQRTLAERGNPRPSHPGFLAAYPETQFILLESYTTYP 129
Query: 227 PVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS-L 285
+WAPYYT HKI+ GLLD +T A N +AL + M ++ ++R+ + K ++R W+ +
Sbjct: 130 TIWAPYYTCHKIMRGLLDAHTLAGNAEALTVASKMGDWVHSRLGR-LPKAQLDRMWSIYI 188
Query: 286 NEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIG 345
E GGMN+V+ LY +T +HL A FD L A D + G HAN HIP G
Sbjct: 189 AGEYGGMNEVMADLYALTGRAEHLAAARCFDNTALLDACAEDRDILDGRHANQHIPQFTG 248
Query: 346 SQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCT 405
++ TG+ Y F +V Y+ GGT GE + +A+TL +N E+C
Sbjct: 249 YLRMFDHTGEERYADAARNFWGMVAGHRTYSLGGTGQGEMFRARDAVAATLDDKNAETCA 308
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE----PGVMIYMLPLGRGDSK 461
TYNMLK+SR LF + Y D+YER LTN +L+ +R P V + +G G
Sbjct: 309 TYNMLKLSRQLFFRDPDPAYMDHYERGLTNHILASRRDARSTDGPEVTYF---VGMGPGV 365
Query: 462 AKSYHGWGTRFSSFWCCYGTGL 483
+ Y GT CC GTG+
Sbjct: 366 VREYGNIGT------CCGGTGM 381
>gi|384418897|ref|YP_005628257.1| hypothetical protein XOC_1936 [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353461810|gb|AEQ96089.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 791
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 138/402 (34%), Positives = 197/402 (49%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRIRAGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQVAVGL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDTASGAAARFFWHTVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RH+++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHVYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM P+ G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|449522353|ref|XP_004168191.1| PREDICTED: uncharacterized protein LOC101224273 [Cucumis sativus]
Length = 495
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 99/118 (83%), Positives = 105/118 (88%)
Query: 366 MDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVY 425
MDIVN+SH YATGGTS EFW DPKRLA LGTE EESCTTYNMLKVSR+LF+WTKE+ Y
Sbjct: 1 MDIVNSSHSYATGGTSVHEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTKEIAY 60
Query: 426 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
ADYYERALTNGVLSIQRGT+PGVMIYMLPLG G SKA SYHGWGT F SFWCCYGTG+
Sbjct: 61 ADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKAISYHGWGTPFESFWCCYGTGI 118
>gi|21243263|ref|NP_642845.1| hypothetical protein XAC2530 [Xanthomonas axonopodis pv. citri str.
306]
gi|21108798|gb|AAM37381.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 791
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 140/402 (34%), Positives = 196/402 (48%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDTQCRTRTGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + +N QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCENAQALQVAVAL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q V + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGVFAALDDAQLQKVLSCEFGGLNESFVELHVQTGDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLIAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVADHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++RHL++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKFLTEQTCEHCASYNMLKLTRHLYQWGPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPLLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|375306379|ref|ZP_09771677.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus sp.
Aloe-11]
gi|375081632|gb|EHS59842.1| acetyl-CoA carboxylase, biotin carboxylase [Paenibacillus sp.
Aloe-11]
Length = 753
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 144/393 (36%), Positives = 205/393 (52%), Gaps = 32/393 (8%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
LH V +D L + A + N YLL L+ D L+ F++ AG YEGWE +
Sbjct: 9 DLHKVSIDSGPL-YHAMELNAAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--ARGIS 65
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--SEQFDRFE 223
GH +GHYLS + M+A+T + L E+++ V+ L CQN G+GY+S P E F+ +
Sbjct: 66 GHTLGHYLSGCSLMYAATGDERLLERVSYVIDELEICQNNHGNGYISGIPRGKEIFEEVK 125
Query: 224 A---------LKPVWAPYYTIHKILAGLLDQYTFADNTQAL----KMTKWMVEYFYNRVQ 270
A L W P YT+HK+ AGL D + A + +AL K+ W+ +
Sbjct: 126 AGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLLAHHPKALPIEIKLGAWL--------E 177
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+V E+ L+ E GGMN+VL L + + + L LA F L LA D
Sbjct: 178 DVFRGLDDEQMQRVLHCEFGGMNEVLTDLAEHSGEERFLKLAERFYHGEVLNDLADSRDT 237
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
++G HANT IP +IG+ +YEVTG P Y FF D V H Y GG S E + +P
Sbjct: 238 LAGRHANTQIPKIIGAARQYEVTGKPHYADLSRFFWDRVVHKHSYVIGGNSYNEHFGEPG 297
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+L LG E+C TYNMLK++RH+F W YADYYERA+ N +L+ Q+ + G +
Sbjct: 298 KLNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-GRVC 356
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y + L G K + +++ F CC G+G+
Sbjct: 357 YFVSLEMGGHKT-----FNSQYEDFTCCVGSGM 384
>gi|84624616|ref|YP_451988.1| hypothetical protein XOO_2959 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84368556|dbj|BAE69714.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 791
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 137/402 (34%), Positives = 196/402 (48%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 49 VRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 107
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +VS L+ CQ G GY++ F +
Sbjct: 108 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRKNAAG 165
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 166 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQVAVGL 225
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 226 AGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 281
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 282 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGNG 341
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++ H+++W + DYYER L N V++ Q
Sbjct: 342 DREYFQQPDSISKCLTEQTCEHCASYNMLKLTCHVYQWCPQAELFDYYERTLLNHVMA-Q 400
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM P+ G+++ GW + F FWCC G+G+
Sbjct: 401 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGM 437
>gi|322433089|ref|YP_004210338.1| hypothetical protein AciX9_4244 [Granulicella tundricola MP5ACTX9]
gi|321165316|gb|ADW71020.1| protein of unknown function DUF1680 [Granulicella tundricola
MP5ACTX9]
Length = 800
Score = 218 bits (555), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 133/397 (33%), Positives = 199/397 (50%), Gaps = 35/397 (8%)
Query: 105 VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCEL 164
+ L+ V+L L +AQ + +YLL L + ++ ++ AG + Y GW+ P +L
Sbjct: 37 LPLNSVRLTGGPLK-KAQDLDAQYLLELQPERMLAFLRQRAGLEAKAQGYGGWDGPGRQL 95
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAF---------- 214
GH GHYLSA + M+A+T +V KE+ V+ L QN G GY+ A
Sbjct: 96 TGHIAGHYLSAISMMYATTGDVRFKERADEFVAELQTIQNAQGDGYIGALLDAKGVDGKV 155
Query: 215 ----------PSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEY 264
S FD L +W+P+Y HK+ AGL D Y + AL++ +E
Sbjct: 156 KFQDLSKGEIKSGGFD----LDGLWSPWYVEHKLFAGLRDAYHLTGDRTALEVE---IE- 207
Query: 265 FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
F V+ ++ + ++ L E GGMN+VL LY T D + + L+ F+ + L
Sbjct: 208 FAGWVEGILKNLNEDQIQRMLATEFGGMNEVLADLYADTNDTRWMKLSDKFEHHAIVDPL 267
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+ D ++G HANT+IP +IG RYE TGD FF D V+ H +ATGG E
Sbjct: 268 SQGQDILAGKHANTNIPKMIGELARYEYTGDEKDGKAANFFFDEVSLHHSFATGGDGKNE 327
Query: 385 FWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT 444
++ P ++ + ESC YNM+K++R LF + YAD+ ERA N +L Q
Sbjct: 328 YFGQPDKMNDMIDGRTAESCAAYNMIKMARTLFSLDPQARYADFVERADLNAILGGQD-P 386
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
+ G + YM+P+GRG H + +F SF CC G+
Sbjct: 387 DDGRVSYMVPVGRG-----VQHEYQNKFESFTCCVGS 418
>gi|58582735|ref|YP_201751.1| hypothetical protein XOO3112 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|188577523|ref|YP_001914452.1| hypothetical protein PXO_01470 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|58427329|gb|AAW76366.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|188521975|gb|ACD59920.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
PXO99A]
Length = 783
Score = 218 bits (555), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 137/402 (34%), Positives = 196/402 (48%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ V L V+L PS L A TN YL+ L D L+ +F AG AY GWE T
Sbjct: 41 VRAVPLAQVRLTPS-LFLDALHTNRRYLMRLQPDRLLHNFVLYAGLDPKAPAYGGWEADT 99
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--- 218
+ GH +GHYLSA A M A T + + + +VS L+ CQ G GY++ F +
Sbjct: 100 --IAGHTLGHYLSALALMHAQTGDAQCRTRAGYLVSELARCQAHAGDGYVAGFTRKNAAG 157
Query: 219 --------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
FD + L WAP YT HK+ AGLLD + DN QAL++ +
Sbjct: 158 QIESGRAVFDELKRGKIDPAPFYLNGSWAPLYTWHKLFAGLLDVHAHCDNPQALQVAVGL 217
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
Y +Q + + + L+ E GG+N+ L+ T D + L LA L
Sbjct: 218 AGY----LQGIFSALDDTQLQKVLSCEFGGLNESFVELHVRTNDAQWLALAQRLHHHAVL 273
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L Q D++ H+NT+IP +IG YEVTGD FF V H Y GG
Sbjct: 274 DPLVAQRDELVHQHSNTNIPKLIGLAREYEVTGDAASGAAARFFWHTVTDHHTYVIGGNG 333
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ P ++ L + E C +YNMLK++ H+++W + DYYER L N V++ Q
Sbjct: 334 DREYFQQPDSISKCLTEQTCEHCASYNMLKLTCHVYQWGPQAELFDYYERTLLNHVMA-Q 392
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM P+ G+++ GW + F FWCC G+G+
Sbjct: 393 QHPRTGMFTYMTPMLAGEAR-----GWSSPFDDFWCCVGSGM 429
>gi|339021543|ref|ZP_08645591.1| hypothetical protein ATPR_1899 [Acetobacter tropicalis NBRC 101654]
gi|338751393|dbj|GAA08895.1| hypothetical protein ATPR_1899 [Acetobacter tropicalis NBRC 101654]
Length = 799
Score = 217 bits (553), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 141/408 (34%), Positives = 205/408 (50%), Gaps = 37/408 (9%)
Query: 98 AGDFLKEVSLHDVKLDPSSLHW-RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG 156
AG+ + V L DV+L PS HW A ++N YLL L D L+ +F++ AG P G+ Y G
Sbjct: 40 AGESVTPVPLQDVRLLPS--HWLDAVESNRAYLLSLSADRLLHNFRRQAGLPPKGEVYGG 97
Query: 157 WEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS 216
WE+ T + GH +GHYLSA A M+A T + + ++ +V L+ Q+K G GY++ F
Sbjct: 98 WENDT--IAGHTLGHYLSALALMYAQTGDTECRRRVAYIVQELAIVQDKWGDGYVAGFTR 155
Query: 217 EQ-----------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALK 256
++ F E L W+P Y IHK AGL D T+ + AL
Sbjct: 156 KEKDGTITDGKVIFAEMEKGDIRSGGFDLNGAWSPLYNIHKTFAGLFDAQTYCQDPNALA 215
Query: 257 MTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLA-HLF 315
+ + +F + +K + + L E GG+N+ L T D K L LA +
Sbjct: 216 VAVKLGGFF----EAFYSKLTDAQLQKVLTCEYGGLNESFAELAARTGDAKWLRLAKRTY 271
Query: 316 DKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGY 375
D+P L+A + DD++ HANT IP +IG EV+ D ++V FF V H Y
Sbjct: 272 DRPVLDPLMA-RHDDLANRHANTQIPKLIGLGRIAEVSRDAHWQVGPRFFWQAVTQHHSY 330
Query: 376 ATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTN 435
GG + E++S+P ++ + + E C TYNMLK++R L+ W + DYYERA N
Sbjct: 331 VIGGNADREYFSEPDTISQHITEQTCEHCNTYNMLKLTRQLYTWQPDSALFDYYERAHLN 390
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
VL+ + G+ YM P + W T SFWCC GTG+
Sbjct: 391 HVLAAH-DPQTGMFTYMTP-----TITAGVREWSTPTDSFWCCVGTGM 432
>gi|374983575|ref|YP_004959070.1| putative glycosylase [Streptomyces bingchenggensis BCW-1]
gi|297154227|gb|ADI03939.1| putative glycosylase [Streptomyces bingchenggensis BCW-1]
Length = 713
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 135/381 (35%), Positives = 192/381 (50%), Gaps = 26/381 (6%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCELRGHFVGHYLSASAH 178
R + L Y D ++ F+ AG T G + GWE LRGH+ GH+L+ A
Sbjct: 68 RKRDLMLGYARSYPADRILAVFRANAGLDTRGARPPGGWETSDGNLRGHYGGHFLTLIAQ 127
Query: 179 MWASTHNVTLKEKMTAVVSALSECQNKMGS---------GYLSAFPSEQF---DRFEALK 226
+A T LK K+ +V AL ECQ + GYL+A+P QF + +
Sbjct: 128 AYADTREAALKTKLDYLVGALGECQKALADHGSPIPSHPGYLAAYPETQFILLESYTTYP 187
Query: 227 PVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS-L 285
+WAPYYT HKI+ GLLD +T N QAL++ M ++ ++R+ + + +ER W+ +
Sbjct: 188 TIWAPYYTCHKIMRGLLDAHTLGGNQQALQIASGMGDWVHSRLGH-LPAAQLERMWSIYI 246
Query: 286 NEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIG 345
E GGMN+VL LY +T +HL A FD L A D + G HAN HIP G
Sbjct: 247 AGEYGGMNEVLADLYALTGRAEHLAAARCFDNTALLKACAENRDILEGRHANQHIPQFTG 306
Query: 346 SQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCT 405
++ T Y F +V S Y+ GGT GE + +A+TL +N E+C
Sbjct: 307 YLRLFDHTAKQEYSSAARNFWGMVTGSRMYSLGGTGQGEMFRARGAIAATLDDKNAETCA 366
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR---GTEPGVMIYMLPLGRGDSKA 462
TYNMLK++R LF + Y DYYER LTN +L+ +R T+ + Y + +G G
Sbjct: 367 TYNMLKLTRQLFFHQPDPAYMDYYERGLTNHILASRRDAAATDSPEVTYFVGMGPG--VR 424
Query: 463 KSYHGWGTRFSSFWCCYGTGL 483
+ + GT CC GTG+
Sbjct: 425 REFDNTGT------CCGGTGM 439
>gi|308067040|ref|YP_003868645.1| hypothetical protein PPE_00225 [Paenibacillus polymyxa E681]
gi|305856319|gb|ADM68107.1| DUF1680 domain containing protein [Paenibacillus polymyxa E681]
Length = 752
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 145/396 (36%), Positives = 207/396 (52%), Gaps = 32/396 (8%)
Query: 103 KEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC 162
K LH V++D L A + N YLL L+ D L+ F++ AG YEGWE
Sbjct: 4 KAFDLHKVRIDSGPL-LHAMELNTAYLLSLEPDRLLSRFREYAGLEPKAAHYEGWE--AR 60
Query: 163 ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--SEQFD 220
+ GH +GHYLS A M+AST + L E++ VV L CQN G+GY+S P E F+
Sbjct: 61 GISGHTLGHYLSGCALMFASTGDERLLERVNYVVDELEICQNSHGNGYISGIPRGKEIFE 120
Query: 221 RFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQAL----KMTKWMVEYFYN 267
+A L W P YT+HK+ AGL D + A + +AL K+ W+
Sbjct: 121 EVKAGDIRSQGFDLNGGWVPLYTMHKLFAGLRDAHLPAHHPKALSIEIKLGNWL------ 174
Query: 268 RVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQ 327
++V+ ++ L+ E GGMN+VL L + + + L LA F L LA
Sbjct: 175 --EDVLQGLDDDQVQQVLHCEFGGMNEVLTDLAEHSGEERFLSLAERFYHGEVLNDLADS 232
Query: 328 ADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWS 387
D ++G HANT IP +IG+ ++E+TG P Y FF D V H Y GG S E +
Sbjct: 233 QDTLAGRHANTQIPKIIGAARQFEMTGKPQYADLSRFFWDRVVHKHSYVIGGNSYNEHFG 292
Query: 388 DPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPG 447
+P +L LG E+C TYNMLK++RH+F W YADYYERA+ N +L+ Q+ + G
Sbjct: 293 EPGKLNDRLGEGTCETCNTYNMLKLTRHMFEWDAYAAYADYYERAMFNHILASQQPVD-G 351
Query: 448 VMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ Y + L G K+ + +++ F CC G+G+
Sbjct: 352 RVCYFVSLEMGGHKS-----FNSQYEDFTCCVGSGM 382
>gi|336428272|ref|ZP_08608256.1| hypothetical protein HMPREF0994_04262 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336006508|gb|EGN36542.1| hypothetical protein HMPREF0994_04262 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 601
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 132/383 (34%), Positives = 199/383 (51%), Gaps = 19/383 (4%)
Query: 108 HDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG-----SPTAGKAYEGWEDPTC 162
V+L S + R Q N + LL L+ S+ AG S + GWE PT
Sbjct: 11 QQVRLLDSEIR-RRFQVNEDLLLRYQSKDLLRSYYFEAGLWKDNSENPKIEHWGWEGPTS 69
Query: 163 ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRF 222
E+RGHFVGH+LSA+A +AS N L + ++ L CQ G ++ A P +Q
Sbjct: 70 EIRGHFVGHWLSAAAITYASDGNRELLGRAEYMLDELERCQKANGGEWIGAIPEKQLRWT 129
Query: 223 EALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHW 282
E + P Y +HKI+ GL+D Y +A N +AL++ ++FY V+++ T +R
Sbjct: 130 EEGRNFGVPLYNLHKIIMGLIDMYVYAGNCKALEIVGHFADWFYRWVKDIPT----DRMD 185
Query: 283 NSLNEETGGMNDVLYRLYTITQDPKH-LLLAHLFDKPCFLGLLAVQADDISGFHANTHIP 341
+ ETGG+ + RLY IT + K+ +L+ +P F LL D ++ HANT IP
Sbjct: 186 IIMETETGGILEEWCRLYEITGEEKYQVLMEKFLRRPLFHALLE-NKDVLTNMHANTTIP 244
Query: 342 VVIGSQMRYEVTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN 400
++G YEVTG+P Y K ++ V G+ TGG ++GE W P + LG N
Sbjct: 245 EILGIARMYEVTGNPEYLKAVKNYWSIAVTKRGGFVTGGQTSGEVWIPPFHIRERLGKLN 304
Query: 401 EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
+E C YNM++++ L+++T ++ + +Y E L NG+L+ Q+ G Y LP+ G
Sbjct: 305 QEHCAVYNMMRLAEFLYQYTGDIEFENYRELNLYNGILA-QQNPNTGAAAYYLPMQAGSR 363
Query: 461 KAKSYHGWGTRFSSFWCCYGTGL 483
K W T SFWCC G+G+
Sbjct: 364 KI-----WSTEKKSFWCCCGSGI 381
>gi|354580825|ref|ZP_08999729.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
gi|353201153|gb|EHB66606.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
Length = 623
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 129/379 (34%), Positives = 194/379 (51%), Gaps = 26/379 (6%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGS----PTAGKAYEGWEDPTCELRGHFVGHYLSA 175
R ++ N YL+ LD L++++Q AG A+ GWE P C+LRGHF+GH+LS
Sbjct: 18 RRERANRSYLMKLDSGHLLFNYQLEAGRFHGRTIPEGAHGGWETPVCQLRGHFLGHWLSG 77
Query: 176 SAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTI 235
+A + + ++ LK K+ A+V L ECQ G ++ P + K +WAP Y +
Sbjct: 78 AAMHYEKSGDMELKAKLDAIVQELHECQRDNGGQWVGPIPEKYLHWIARGKSIWAPQYNL 137
Query: 236 HKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDV 295
HKIL GL+D + +A N QAL + ++F N ++ E+ + L+ ETGGM +V
Sbjct: 138 HKILMGLVDAWQYAGNRQALDIVDRFADWFVNWSGT----FTREQFDDILDVETGGMLEV 193
Query: 296 LYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGD 355
L IT K+ +L + + L D ++ HANT IP V+G YEVTGD
Sbjct: 194 WADLLHITGADKYRVLLERYYRSRLFQPLLEGKDPLTNMHANTTIPEVLGCARAYEVTGD 253
Query: 356 PLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSR 414
+ + ++ V ATGG +AGE W ++ + LG +N+E CT YNM++++
Sbjct: 254 DRWLSIVQAYWKCAVTERGSLATGGQTAGEVWMPKMKMKARLGDKNQEHCTVYNMIRLAE 313
Query: 415 HLFRWTKEMVYADYYERALTNGVL------------SIQRGTEPGVMIYMLPLGRGDSKA 462
LFR T + YA Y E L NG++ S + G++ Y LP+ G K
Sbjct: 314 FLFRQTGDPSYAQYIEYNLYNGIMAQAYYQEYGLTGSQHKHPHTGLLTYFLPMKAGLRKE 373
Query: 463 KSYHGWGTRFSSFWCCYGT 481
W T SF+CC+GT
Sbjct: 374 -----WSTETDSFFCCHGT 387
>gi|374313035|ref|YP_005059465.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358755045|gb|AEU38435.1| protein of unknown function DUF1680 [Granulicella mallensis
MP5ACTX8]
Length = 798
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 136/401 (33%), Positives = 197/401 (49%), Gaps = 32/401 (7%)
Query: 103 KEVSLHDVKLDPSSLHW------RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG 156
++V L V L SS+ RAQ + +YLL L + ++ ++ A + Y G
Sbjct: 28 QKVQLKAVPLPFSSVRLTGGPLKRAQDLDAQYLLDLQPERMLARLRQRANLAPKAEGYGG 87
Query: 157 WEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAF-P 215
W+ +L GH GHYLSA + M+A+T +V K + V+ L QN G GY+ A
Sbjct: 88 WDGDGRQLTGHIAGHYLSAISMMYATTGDVRFKNRADDFVTELQNIQNAQGDGYIGALLD 147
Query: 216 SEQFD---RFEALKP------------VWAPYYTIHKILAGLLDQYTFADNTQALKMTKW 260
++ D RF+ L +W+P+Y HK+ AGL D Y N +AL +
Sbjct: 148 AKGVDGKVRFQDLSKGEIHSGGFDLNGLWSPWYVEHKLFAGLRDAYHLTGNRKALDVEI- 206
Query: 261 MVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCF 320
F + ++ S E+ L E GGMN+VL LY T DP+ L L+ F+
Sbjct: 207 ---KFAGWAETIVGHLSDEQLQRMLATEFGGMNEVLADLYADTNDPRWLKLSDKFEHHAI 263
Query: 321 LGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGT 380
+ L+ D ++G HANT IP +IG RY TGD FF D V+ H +ATGG
Sbjct: 264 VDPLSRGQDILAGKHANTQIPKMIGELARYVYTGDETDGKAAMFFFDEVSEHHSFATGGD 323
Query: 381 SAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSI 440
E++ P ++ + ESC YNM+K++R LF + YAD+ ERA N +L
Sbjct: 324 GKNEYFGQPDKMNDMIDGRTAESCAAYNMIKMARDLFSLDPQARYADFIERADLNAILGG 383
Query: 441 QRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
Q E G + YM+P+GRG H + +F SF CC G+
Sbjct: 384 QD-PEDGRVSYMVPVGRG-----VQHEYQDKFESFTCCVGS 418
>gi|367031082|ref|XP_003664824.1| hypothetical protein MYCTH_55581 [Myceliophthora thermophila ATCC
42464]
gi|347012095|gb|AEO59579.1| hypothetical protein MYCTH_55581 [Myceliophthora thermophila ATCC
42464]
Length = 608
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 135/376 (35%), Positives = 190/376 (50%), Gaps = 18/376 (4%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCELRGHFVGHYLSASAHMW 180
Q + YL +DVD L+++F+ G T G + GW+ P R H GH+L+A +H +
Sbjct: 26 QNRTVTYLKWVDVDRLLYNFRANHGLSTQGARQNGGWDAPDFPFRTHVQGHFLTAWSHCY 85
Query: 181 ASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQFDRFEA--LKPVWAPYY 233
AS + +++ T V+ L++CQ G+GYLS FP +FD EA L PYY
Sbjct: 86 ASLRDDACRDRATYFVAELAKCQANNDAVGFGAGYLSGFPESEFDALEARTLSNGNVPYY 145
Query: 234 TIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMN 293
IHK +AGLLD + +T A + + + +R + S E+ L E GGMN
Sbjct: 146 AIHKTMAGLLDVWRHVGDTTARDVLLALAGWVDSRTG----RLSYEQMQAVLGTEFGGMN 201
Query: 294 DVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVT 353
DVL L T DP+ L +A FD LA + D + G HANT +P IG+ + Y+ T
Sbjct: 202 DVLTELSLQTGDPRWLEVAQRFDHAAVFDPLASRQDRLDGLHANTQVPKWIGAVLEYKAT 261
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVS 413
G Y+ + +H YA GG S E + +P +A L + E+C TYNML+++
Sbjct: 262 GTARYRDIAANAWNFTVGAHSYAIGGNSQAEHFHEPDAIAKYLLEDTAEACNTYNMLRLT 321
Query: 414 RHLFRW-TKEMVYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGDSKAKSYHG 467
R L+ Y D+YERAL N +L Q +P G + Y PL RG A
Sbjct: 322 RELWMLDPASTAYFDFYERALLNHLLGQQNPADPHGHVTYFTPLNPGGRRGVGPAWGGGT 381
Query: 468 WGTRFSSFWCCYGTGL 483
W T + SFWCC GT L
Sbjct: 382 WSTDYDSFWCCQGTAL 397
>gi|383779461|ref|YP_005464027.1| hypothetical protein AMIS_42910 [Actinoplanes missouriensis 431]
gi|381372693|dbj|BAL89511.1| hypothetical protein AMIS_42910 [Actinoplanes missouriensis 431]
Length = 777
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 141/401 (35%), Positives = 200/401 (49%), Gaps = 25/401 (6%)
Query: 99 GDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG-W 157
G+ E V+L S L Q + YL +DV+ +++ F+ TAG A G W
Sbjct: 48 GNAASEFMPGQVRLTASRL-LDNQNRTMNYLRFVDVNRMLYVFRANHRLSTAGAAANGGW 106
Query: 158 EDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLS 212
+ P R H GH+L+A A +A T + T ++K +V+ L++CQ +GYLS
Sbjct: 107 DAPNFPFRSHMQGHFLTAWAQAYAYTGDTTCRDKADYMVAELAKCQANNAVAGFNAGYLS 166
Query: 213 AFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNR 268
FP D E+ KP+ YY IHK LAGLLD + NTQA LK+ W V++ R
Sbjct: 167 GFPESDLDAVESGKPIAVSYYCIHKTLAGLLDVWRLIGNTQAKDVLLKLAGW-VDWRTGR 225
Query: 269 VQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQA 328
+ S + +L E GGMN+VL LY T D + L +A FD LA
Sbjct: 226 L-------SYSQMQTTLQTEFGGMNEVLANLYQQTGDARWLRVAQRFDHAAIFDPLAANR 278
Query: 329 DDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSD 388
D+++G HANT+IP +G+ ++ TG Y+ +I +H YA GG S E +
Sbjct: 279 DELNGKHANTNIPKWVGAIREFKATGTTRYRDIAGNAWNITVGAHTYAIGGNSQAEHFKA 338
Query: 389 PKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMV-YADYYERALTNGVLSIQRGTEP- 446
P +A L + E C TYNMLK++R L++ Y D+YE AL N ++ Q +
Sbjct: 339 PNAIAGYLTNDTCEQCNTYNMLKLTRELWQLDPNRAGYFDFYENALYNHLIGAQNPADSH 398
Query: 447 GVMIYMLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G + Y PL RG A W T ++SFWCC GTG+
Sbjct: 399 GHITYFTPLKAGGRRGVGPAWGGGTWSTDYNSFWCCQGTGI 439
>gi|307109022|gb|EFN57261.1| hypothetical protein CHLNCDRAFT_143813 [Chlorella variabilis]
Length = 349
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 121/267 (45%), Positives = 156/267 (58%), Gaps = 10/267 (3%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
SL DV+L S + R + N EYLL L+ D L+++F+KTAG P G +Y GWE E+R
Sbjct: 27 SLADVQLARGSEYARNFEQNSEYLLALEPDRLLYNFRKTAGLPAPGASYGGWEWSGVEIR 86
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEAL 225
GHFVGHYLSA A + L+E+ +VS L + Q+ G+GYLSAFP FDR EAL
Sbjct: 87 GHFVGHYLSALALATLHSGRPELRERCGVMVSELKKVQDAAGTGYLSAFPESHFDRLEAL 146
Query: 226 KPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSL 285
+PV HKILAGLLDQ+ AL + M +F RV+ V+ + HW+ +
Sbjct: 147 QPV-------HKILAGLLDQHRLVGTAGALGAARRMASHFCARVRAVVAANGTD-HWHRV 198
Query: 286 NE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVI 344
E E GGMN+ LY LY IT+ P+H AH FDKP F LA D + G HANTH+ V
Sbjct: 199 LEVEFGGMNEALYNLYAITKSPEHAECAHFFDKPAFFRPLAEGRDPLPGLHANTHMAQVP 258
Query: 345 GSQMRYEVTGDPLYKV-TGTFFMDIVN 370
G RYE+ GD +V TFF ++
Sbjct: 259 GFTARYELLGDGEAQVAAATFFGTLLQ 285
>gi|408357216|ref|YP_006845747.1| hypothetical protein AXY_18530 [Amphibacillus xylanus NBRC 15112]
gi|407727987|dbj|BAM47985.1| hypothetical protein AXY_18530 [Amphibacillus xylanus NBRC 15112]
Length = 755
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 137/377 (36%), Positives = 192/377 (50%), Gaps = 31/377 (8%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHM 179
+QQ EYLL LD+D L+ + G Y GWE + E+ GH +GH+LSA++ M
Sbjct: 13 ESQQKGKEYLLYLDIDRLIAPCYEAVGQEPRAPRYGGWE--SMEIAGHSIGHWLSAASLM 70
Query: 180 WASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD-------RFE--ALKPVWA 230
+ T ++ LK K+ + L+ Q GY+S FP + FD R + L W
Sbjct: 71 YNVTGDLLLKHKIDYAIDELAHVQAFDPEGYVSGFPRDCFDEVFTGEFRVDNFGLGGSWV 130
Query: 231 PYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSLN 286
P+Y+IHKI AGL+D Y A N +A +K++ W ++K + E+ L
Sbjct: 131 PWYSIHKIYAGLVDAYRLASNEKAKTVLVKLSNW--------ADQGLSKLNDEQFQRMLI 182
Query: 287 EETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGS 346
E GGMN+ + +Y IT D + L LA F+ L L DD++G HANT IP VIG+
Sbjct: 183 CEFGGMNETMADVYEITGDKRFLKLAERFNHKAVLDPLIEGIDDLAGKHANTQIPKVIGA 242
Query: 347 QMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTT 406
Y++TG Y+ FF D V YA GG S E + LG + E+C T
Sbjct: 243 AKLYDMTGKEEYQKLSRFFWDQVVYHRSYAFGGNSNAEHFGPVD--TEPLGIISTETCNT 300
Query: 407 YNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYH 466
YNMLK++ HLF W + Y DYYE AL N +L Q E G+ Y +P G K
Sbjct: 301 YNMLKLTEHLFDWQPDSRYMDYYENALYNHILGSQ-DPESGMKSYFIPTEPGHFKV---- 355
Query: 467 GWGTRFSSFWCCYGTGL 483
+ + +SFWCC G+G+
Sbjct: 356 -YCSPDNSFWCCTGSGM 371
>gi|371778346|ref|ZP_09484668.1| hypothetical protein AnHS1_13085 [Anaerophaga sp. HS1]
Length = 796
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 137/375 (36%), Positives = 196/375 (52%), Gaps = 25/375 (6%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A + N + LL + D L+ F++ A + Y GWE + L GH +GHYLSA + M+
Sbjct: 63 ASKLNEKILLNYEPDRLLAHFREQAHLKPKAQHYGGWEGES--LTGHSLGHYLSACSMMY 120
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--FDRFEA----------LKPV 228
+T N +++ +V+ L Q G GYL AF + + F+ A L +
Sbjct: 121 KTTGNEEFLKRVNYIVNELDTVQKAHGDGYLGAFDNGKKIFEEEIANGNIRSAGFDLNGI 180
Query: 229 WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEE 288
WAP YT HKI+AGL+D Y N +AL++ + ++ + V+N+ S E L+ E
Sbjct: 181 WAPIYTQHKIMAGLMDAYKLCGNKKALEVEQKFADWLGSIVENL----SHEEIQKMLHCE 236
Query: 289 TGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQM 348
GG+N+ L+ +T + ++L +A LF L LA D + G HANT IP +IG
Sbjct: 237 HGGINEAYAELFAVTGNERYLKIARLFHHEAVLDPLAKGIDILPGHHANTQIPKIIGLSR 296
Query: 349 RYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYN 408
YE+TGD + T FF + V H Y TGG E++ P L++ L + E+C YN
Sbjct: 297 LYELTGDTTDRKTAQFFWERVVYHHSYVTGGNGDHEYFGPPDTLSNRLSSNTTETCNVYN 356
Query: 409 MLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGW 468
MLK+S HLF+W E ADYYERAL N +LS Q + G +IY L L G K +
Sbjct: 357 MLKLSNHLFKWEAEAEVADYYERALFNHILSSQH-PQSGHVIYNLSLEMGGHKH-----Y 410
Query: 469 GTRFSSFWCCYGTGL 483
F F CC GTG+
Sbjct: 411 QNPF-GFTCCVGTGM 424
>gi|427384528|ref|ZP_18881033.1| hypothetical protein HMPREF9447_02066 [Bacteroides oleiciplenus YIT
12058]
gi|425727789|gb|EKU90648.1| hypothetical protein HMPREF9447_02066 [Bacteroides oleiciplenus YIT
12058]
Length = 1145
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 139/393 (35%), Positives = 203/393 (51%), Gaps = 22/393 (5%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWED 159
D L+ L V+L PS AQQ + ++LL LD D L+ F K AG P G+ Y GWE+
Sbjct: 401 DQLEPFRLSQVRLLPSPFK-HAQQLDAKWLLSLDPDRLLHRFHKNAGLPPKGENYGGWEE 459
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-- 217
+GHY+SA A MWAST K++ V++ L CQ G+GY+ +
Sbjct: 460 HRGGG--RGLGHYMSACAMMWASTGEPEFKQRTDYVINELERCQKARGTGYIGSVEDSIW 517
Query: 218 -QFDRFEA------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ 270
Q R + L P++ +HK+ AGL D Y + N +A + + ++ Y +
Sbjct: 518 TQVGRGDIRSTGFDLNGGIVPWFILHKLFAGLYDIYIYTGNEKAKTVLVNLCDWAYRQFG 577
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
N+ + E+ L E GGM +VL +Y+I D K+L ++H FD F L+ Q D
Sbjct: 578 NL----NDEQWQKMLACEHGGMLEVLANVYSIVGDKKYLDMSHWFDHKQFFSPLSHQVDS 633
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
++G HANT IP V+G + R+++T KV FF + V +H Y GG GE +
Sbjct: 634 LAGLHANTQIPKVVGLERRHQLTHSEEDKVKSHFFWETVVKNHTYCIGGNGDGEHFGPKG 693
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
L++ L E+C TYNMLK+++ L T + Y DYYE+AL N +L+ Q E G+
Sbjct: 694 ILSNRLSDRTAETCNTYNMLKLTKMLLAETGDTKYGDYYEKALYNHILASQ-NPETGMTT 752
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y +PL G K G+ + F +F CC GTG
Sbjct: 753 YYVPLVAGGKK-----GYSSAFETFTCCVGTGF 780
>gi|256377207|ref|YP_003100867.1| hypothetical protein Amir_3107 [Actinosynnema mirum DSM 43827]
gi|255921510|gb|ACU37021.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 771
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 145/419 (34%), Positives = 203/419 (48%), Gaps = 32/419 (7%)
Query: 85 IYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTN-LEYLLMLDVDSLVWSFQK 143
+ R P G + V+L P W Q L YL +D D L+++F+
Sbjct: 30 VARAASVPPARPDIGAAASAFDVGQVRLTPG--RWMDNQNRALSYLRFVDPDRLLYNFRA 87
Query: 144 TAGSPTAGKA-YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSEC 202
TAG A GWE P R H GH+L+A A WA + T +++ +V+ L++C
Sbjct: 88 NHRLSTAGAAPLAGWEAPDFPFRTHSQGHFLTAWAQAWAVLGDTTSRDRANHLVAELAKC 147
Query: 203 QNKMGS-----GYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA--- 254
Q + GYLS FP D EA P YY +HK LAGLLD + +TQA
Sbjct: 148 QANNAAAGFTAGYLSGFPESDLDALEAGTPKAVSYYALHKTLAGLLDVWRHLGSTQARDV 207
Query: 255 -LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAH 313
L+ W V++ R +++ +++R L E GGMN VL LY T D + L A
Sbjct: 208 LLRFAGW-VDWRTAR----LSQATMQR---VLATEFGGMNAVLADLYQQTGDARWLATAQ 259
Query: 314 LFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASH 373
FD LA D ++G HANT +P IG+ Y+ TG Y+ T +I A+H
Sbjct: 260 RFDHAAAFDPLAANQDRLNGLHANTQVPKWIGAAREYKATGTTRYRDIATNAWNITVAAH 319
Query: 374 GYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKE---MVYADYYE 430
Y GG S E + P +A+ L T+ E+C TYNMLK++R L W E Y D+YE
Sbjct: 320 TYVIGGNSQAEHFRAPNAIAAHLATDTAEACNTYNMLKLTREL--WLLEPTKAAYFDFYE 377
Query: 431 RALTNGVLSIQRGTEP-GVMIYMLPLGRGDSKAKSYHGWG-----TRFSSFWCCYGTGL 483
RAL N ++ Q + G + Y L G + ++ WG T +S+FWCC GTG+
Sbjct: 378 RALLNHLIGQQNPADAHGHICYFTGLNPGHRRGRTGPAWGGGTWSTDYSTFWCCQGTGI 436
>gi|220928430|ref|YP_002505339.1| hypothetical protein Ccel_0997 [Clostridium cellulolyticum H10]
gi|219998758|gb|ACL75359.1| protein of unknown function DUF1680 [Clostridium cellulolyticum
H10]
Length = 597
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 141/409 (34%), Positives = 213/409 (52%), Gaps = 34/409 (8%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
+L +KL R ++T +Y+ D++ L+ +F+K AG + + GWE C LR
Sbjct: 6 NLDKIKLSDKYFSVR-RETAKKYVNDFDINRLMHTFRKNAGIESLAEPLGGWESEECNLR 64
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEAL 225
GHFVGH+LSA + S ++ LK K +V ++EC ++ +GYLSAF E D E
Sbjct: 65 GHFVGHFLSACSKFAFSDNDDCLKTKADNIVKIMAECASE--NGYLSAFGEEMLDILETE 122
Query: 226 --KPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNV-------ITKY 276
+ VWAPYYT+HKIL GL+D Y F +N AL + + Y R + + I +
Sbjct: 123 EDRGVWAPYYTLHKILQGLVDCYLFLNNKTALSLAVNLAHYIRRRFERLSYWKTDGILRC 182
Query: 277 SVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHA 336
+ N +NE GG+ DVLY LY IT D K LA +F++ F+G LA D + HA
Sbjct: 183 T---RVNPVNE-FGGIGDVLYSLYEITGDRKIFDLADIFNRDYFIGNLAADRDVLEDLHA 238
Query: 337 NTHIPVVIGSQMRYEVTGDPLYK---------VTGTFFMDIVNASHG--YATGGTS-AGE 384
NTH+P+VI + R+ +TG+ YK + G F++ ++S + G S E
Sbjct: 239 NTHLPMVISAIHRFNLTGEYKYKHAAQNFYKYLLGRTFVNGNSSSKATSFKKGEVSEKSE 298
Query: 385 FWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT 444
W L ++L ESC +N K+ + LF WT++ + ++ E N VL+ T
Sbjct: 299 HWGAHNHLENSLTGGESESCCAHNTEKIVQQLFAWTEDERFLEHLEILKYNAVLN-STST 357
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGLGFIRWIQLSL 493
G+ Y P+G G K++ G F +FWCC GTG+ + IQ ++
Sbjct: 358 VTGLSQYQQPMGTG--VKKNFSGL---FDTFWCCTGTGIEAMSEIQKNI 401
>gi|354583886|ref|ZP_09002783.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
gi|353197148|gb|EHB62641.1| protein of unknown function DUF1680 [Paenibacillus lactis 154]
Length = 778
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 136/372 (36%), Positives = 195/372 (52%), Gaps = 23/372 (6%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
+Q+T YLL LDVD L+ + A Y GWE+ + GH +GH+LSA+A M
Sbjct: 27 SQETGKGYLLHLDVDRLMAPCYEAASLEPKKPRYGGWEE--TPIAGHSIGHWLSAAAAMI 84
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD-----RFE----ALKPVWAP 231
+T + L +K+ V+ L+ Q+ GY+S FP + FD FE +L W P
Sbjct: 85 DATSDEELLKKLVYAVNELAYVQSHDKDGYVSGFPRDCFDIVFTGDFEVHNFSLAGSWVP 144
Query: 232 YYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGG 291
+Y++HKI AGL+D Y QAL++ + ++ + + + E+ L E GG
Sbjct: 145 WYSLHKIFAGLIDAYRLTGIEQALEVVIRLADW----AKKGTDRLTDEQFQRMLICEHGG 200
Query: 292 MNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYE 351
MND + LY +T + +L LA F L LA D++ G HANT IP VIG+ YE
Sbjct: 201 MNDTMADLYRLTNNHAYLELAIRFCHRAILEPLARGVDELEGKHANTQIPKVIGAAKLYE 260
Query: 352 VTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLK 411
+TGD Y+ FF V + Y GG S E + + LG E E+C TYNMLK
Sbjct: 261 ITGDDFYRKAAEFFWKEVTRNRSYIIGGNSIFEHFRAANQ--EKLGVETAETCNTYNMLK 318
Query: 412 VSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTR 471
++ HLF W+++ Y D+YERAL N +L+ Q + G+ +Y + G K +GT
Sbjct: 319 LTDHLFGWSQDAEYMDFYERALYNHILASQ-DPDTGMKMYFVSTEPGHFKV-----YGTA 372
Query: 472 FSSFWCCYGTGL 483
SFWCC GTG+
Sbjct: 373 EHSFWCCTGTGM 384
>gi|402300545|ref|ZP_10820034.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus alcalophilus
ATCC 27647]
gi|401724312|gb|EJS97686.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus alcalophilus
ATCC 27647]
Length = 761
Score = 214 bits (545), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 132/367 (35%), Positives = 196/367 (53%), Gaps = 23/367 (6%)
Query: 126 LEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHN 185
++YLL LD+D LV F + A + Y GWE+ + GH +GH+LSA+A+M+ +T N
Sbjct: 19 MDYLLFLDIDRLVAPFYEAASLAPKKQRYGGWEE--TGISGHSLGHWLSAAAYMYRNTMN 76
Query: 186 VTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR-----FEA----LKPVWAPYYTIH 236
LK+K+ + L Q+ ++ FPS F++ FE L W P+Y++H
Sbjct: 77 RALKDKINKAIDELEYIQSVHDRNFIGGFPSTCFEKVFTGNFEVDHFTLAGHWVPWYSMH 136
Query: 237 KILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVL 296
K+ AGL+D Y N +AL + + ++ V++ + + + L E GGMNDV+
Sbjct: 137 KLFAGLIDVYKLVKNEKALSVVTKLADW----VESGTVRLTEAQFQKMLICEHGGMNDVM 192
Query: 297 YRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP 356
LY +TQ+ +L LA F + L L+ + D + G HANT IP VIG+ Y++T +
Sbjct: 193 AELYLLTQNQTYLQLAIRFCEQQILEPLSNRRDLLEGKHANTQIPKVIGAAKLYDITKEE 252
Query: 357 LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
YK TFF V Y GG S E + + TLG + E+C TYNMLK++ HL
Sbjct: 253 KYKTAATFFWQEVTRVRSYIIGGNSINEHFG--RVSDETLGVQTTETCNTYNMLKLTAHL 310
Query: 417 FRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFW 476
F W ++ Y D+YERAL N +L+ Q + G+ Y + G K YH + SFW
Sbjct: 311 FLWEQKSEYYDFYERALYNHILASQ-DPDSGMKAYFVSTEPGHFKV--YH---SPEDSFW 364
Query: 477 CCYGTGL 483
CC GTG+
Sbjct: 365 CCTGTGM 371
>gi|430751026|ref|YP_007213934.1| hypothetical protein Theco_2852 [Thermobacillus composti KWC4]
gi|430734991|gb|AGA58936.1| hypothetical protein Theco_2852 [Thermobacillus composti KWC4]
Length = 621
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 129/383 (33%), Positives = 199/383 (51%), Gaps = 34/383 (8%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK----AYEGWEDPTCELRGHFVGHYLSA 175
R +Q N YL+ L+ DSL+++++ AG + + A+ GWE P C+LRGHF+GH+LSA
Sbjct: 18 RREQANRAYLMKLNSDSLLFNYRLEAGRYSGREIPPWAHGGWESPVCQLRGHFLGHWLSA 77
Query: 176 SAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTI 235
+A + +T + LK K ++ L+ECQ G + P + A K +WAP Y +
Sbjct: 78 AAIHYHATGDAELKAKADGIIDELAECQKDNGGQWAGPIPEKYLHWIAAGKAIWAPQYNL 137
Query: 236 HKILAGLLDQYTFADNTQAL----KMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGG 291
HK+ GL+D + +A N +AL + W VE+ +++ ++ + L+ ETGG
Sbjct: 138 HKLFMGLVDSFQYAGNQKALDIADRFADWFVEW--------SGRFTRDQFDDILDVETGG 189
Query: 292 MNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYE 351
M +V L IT + K+ L + + L D ++ HANT IP V+G YE
Sbjct: 190 MLEVWADLLHITGNGKYKTLLERYYRGRLFQPLLEGKDPLTNMHANTTIPEVLGCARAYE 249
Query: 352 VTGDPLYKVTGTFFMDIVNASHGY-ATGGTSAGEFWSDPKRLASTLGTENEESCTTYNML 410
VTGD + + + G+ ATGG ++GE W ++ + LG +N+E CT YNM+
Sbjct: 250 VTGDSRWMDVVKAYWNCAVTERGFLATGGQTSGEVWMPKMKMKARLGDKNQEHCTVYNMM 309
Query: 411 KVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE------------PGVMIYMLPLGRG 458
+++ LFR T + YA Y E L NGV++ E G++ Y LP+ G
Sbjct: 310 RLAEFLFRHTGDPGYAQYREYNLYNGVMAQTYYREYALNGNPHNHPGTGLLTYFLPMKAG 369
Query: 459 DSKAKSYHGWGTRFSSFWCCYGT 481
K W T SSF+CC+GT
Sbjct: 370 LRK-----DWSTETSSFFCCHGT 387
>gi|374992736|ref|YP_004968231.1| hypothetical protein SBI_09982 [Streptomyces bingchenggensis BCW-1]
gi|297163388|gb|ADI13100.1| hypothetical protein SBI_09982 [Streptomyces bingchenggensis BCW-1]
Length = 733
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 139/371 (37%), Positives = 187/371 (50%), Gaps = 18/371 (4%)
Query: 127 EYLLMLDVDSLVWSFQKTAGSPTAGKAYEG-WEDPTCELRGHFVGHYLSASAHMWASTHN 185
YL +D D L+++F+ PT G A G W+ PT R H GH+L+A A ++A T +
Sbjct: 27 NYLRFVDADRLLYNFRANHRLPTNGAASNGGWDGPTFPFRTHVQGHFLTAWAQVYAVTGD 86
Query: 186 VTLKEKMTAVVSALSECQNKMGS-----GYLSAFPSEQFDRFEA--LKPVWAPYYTIHKI 238
T ++K +V+ L++CQ G+ GYLS FP F EA L PYY IHKI
Sbjct: 87 TTCRDKAAYMVAELAKCQANNGAAGFNGGYLSGFPESDFSALEAGTLSNGNVPYYVIHKI 146
Query: 239 LAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYR 298
LAGLLD + +TQA M + + R + S ++ ++L E GGMN VL
Sbjct: 147 LAGLLDVWRHMGSTQARDMLLSLAGWVDWRTG----RLSGQQMQSTLGTEFGGMNAVLSD 202
Query: 299 LYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLY 358
LY T D + L A FD LA D ++G HANT +P IG+ Y+ TG Y
Sbjct: 203 LYLQTSDSRWLTTAQRFDHGAVFDPLASNQDRLNGLHANTQVPKWIGAAREYKATGTTRY 262
Query: 359 KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFR 418
+ T +I +H Y GG S E + P +A+ L + ESC TYNML ++R LF
Sbjct: 263 RDIATNAWNICVNAHTYVIGGNSQAEHFRPPNAIAAYLNQDACESCNTYNMLTLTRELFT 322
Query: 419 WTKEMVYA-DYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGDSKAKSYHGWGTRF 472
+ V DYYERA N ++ Q + G + Y PL RG A W T +
Sbjct: 323 LDPDRVALFDYYERAWLNQMIGQQNPADNHGHVTYFTPLNPGGRRGVGPAWGGGTWSTDY 382
Query: 473 SSFWCCYGTGL 483
SFWCC GTGL
Sbjct: 383 DSFWCCQGTGL 393
>gi|383644433|ref|ZP_09956839.1| hypothetical protein SeloA3_13744 [Sphingomonas elodea ATCC 31461]
Length = 746
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 136/394 (34%), Positives = 196/394 (49%), Gaps = 32/394 (8%)
Query: 110 VKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFV 169
++L PS + A + N LL L+ D L+ +F+K AG GK Y GWE T + GH +
Sbjct: 4 IRLRPSD-YASAVEVNHRALLQLEPDRLLHNFRKYAGLEPKGKLYGGWESDT--IAGHTL 60
Query: 170 GHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD--------- 220
GHYL+A MW T + ++ + +V+ L+E Q K G+GY+ A ++ D
Sbjct: 61 GHYLTALVLMWQQTGDPEMRRRADYIVAELAEAQAKRGTGYVGALGRKRKDGTIVDGEEI 120
Query: 221 -----RFEA------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRV 269
R E L W+P YT+HK+ AGLLD + N QAL++T + YF
Sbjct: 121 FPEIMRGEIKSGGFDLNGSWSPLYTVHKVFAGLLDVHAGWGNAQALQVTLGLAGYF---- 176
Query: 270 QNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQAD 329
+ V + + L E GG+N+ LY T+D + +++A LG L D
Sbjct: 177 EKVFAALNDAQMQQMLGCEYGGLNESYAELYARTRDARWMVVAKRLYDDRVLGPLKAGED 236
Query: 330 DISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDP 389
++ FHANT +P +IG +E+TGD FF + V H Y GG + E++S P
Sbjct: 237 KLANFHANTQVPKLIGLARIHELTGDAGDATAARFFWERVTGHHSYVIGGNADREYFSAP 296
Query: 390 KRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+A + + E C TYNMLK++ HLF W V DYYERA N V++ Q + G
Sbjct: 297 DSIAQHITDQTCEHCNTYNMLKLTSHLFAWQPNGVLFDYYERAHLNHVMAAQN-PKTGGF 355
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
YM PL G + S +FWCC G+GL
Sbjct: 356 TYMTPLMSGAERQYSQ----PNEDAFWCCIGSGL 385
>gi|302867043|ref|YP_003835680.1| hypothetical protein Micau_2566 [Micromonospora aurantiaca ATCC
27029]
gi|302569902|gb|ADL46104.1| protein of unknown function DUF1680 [Micromonospora aurantiaca ATCC
27029]
Length = 917
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 139/400 (34%), Positives = 194/400 (48%), Gaps = 24/400 (6%)
Query: 104 EVSLHDVKLDPSSLHWRA------QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG- 156
++ + DP + A Q + YL +DV+ L+++F+ T G A G
Sbjct: 47 DIGVSAYAFDPGQVRLTAGRWQDNQNRTVAYLRFVDVNRLLYNFRANHRLSTGGAAANGG 106
Query: 157 WEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS-----GYL 211
W+ P R H GH+L+A A WA + T ++K +V+ L+ CQ G+ GYL
Sbjct: 107 WDAPNFPFRTHMQGHFLTAWAQAWAVLGDTTCRDKALTMVAELARCQANNGAAGFSAGYL 166
Query: 212 SAFPSEQFDRFEA--LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRV 269
S FP F EA L PYY IHK LAGLLD + +TQA + + + R
Sbjct: 167 SGFPESDFTALEARTLSNGNVPYYCIHKTLAGLLDVWRLIGSTQARDVLLALAGWVDQRT 226
Query: 270 QNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQAD 329
+ + + L E GGMN VL LY T D + L +A FD LA +D
Sbjct: 227 G----RLTSAQMQAMLGTEFGGMNAVLTDLYQQTGDGRWLTVAQRFDHAAVFNPLAANSD 282
Query: 330 DISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDP 389
++G HANT +P IG+ Y+ TG Y+ I +H YA GG S E + P
Sbjct: 283 QLNGLHANTQVPKWIGAAREYKATGVTRYRDIAANAWAITVGAHTYAIGGNSQAEHFRAP 342
Query: 390 KRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMV-YADYYERALTNGVLSIQRGTEP-G 447
+A L + E+C TYNMLK++R L++ + V YAD+YERAL N ++ Q + G
Sbjct: 343 NAIAGYLRNDTCEACNTYNMLKLTRELWQLDPDRVAYADFYERALLNHMIGQQNPADAHG 402
Query: 448 VMIYMLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ Y PL RG A W T ++SFWCC GTGL
Sbjct: 403 HVTYFTPLNPGGRRGVGPAWGGGTWSTDYNSFWCCQGTGL 442
>gi|329847073|ref|ZP_08262101.1| hypothetical protein ABI_01350 [Asticcacaulis biprosthecum C19]
gi|328842136|gb|EGF91705.1| hypothetical protein ABI_01350 [Asticcacaulis biprosthecum C19]
Length = 800
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/400 (33%), Positives = 203/400 (50%), Gaps = 35/400 (8%)
Query: 105 VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCEL 164
V L DV+L PS A + N +YL+ L D ++ ++ K AG P G+ Y GWE T +
Sbjct: 46 VPLSDVRLLPSPF-LTAVEANTKYLMFLSPDRMLHNYHKFAGLPVKGEIYGGWESDT--I 102
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR--- 221
G +GHYLSA + ++A T + + ++ +++ L++ Q G GY + F ++ D
Sbjct: 103 AGEALGHYLSALSLLYAQTGHAEARTRIEYIIAELAKVQAAHGDGYAAGFMRKRKDASIV 162
Query: 222 ------------------FEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
F+ L W P+Y HK+ AGL+D T+A + + +
Sbjct: 163 DGKEIFAEIMAGDIRSAGFD-LNGCWVPFYNWHKLFAGLMDAQTYAGIDAGIPVAVALGG 221
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
Y ++ V + E+ L+ E GG+N+ LYT T+DP+ L LA L
Sbjct: 222 Y----IEKVFAALNDEQVQKVLDCEHGGINESFAELYTRTKDPRWLALAERIYHHRILDP 277
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
L D ++ HANT +P ++G YE+TG P Y+ +FF D V H +A GG +
Sbjct: 278 LTAGEDKLANNHANTQVPKLVGLARLYEITGKPGYRKASSFFWDRVVNHHSFAIGGNADR 337
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E++ +P +A + + ESC TYNMLK++RHL+ WT + DYYERA N +++ Q
Sbjct: 338 EYFFEPDTIAKHITEQTCESCNTYNMLKLTRHLYAWTPNAAWFDYYERAHLNHIMAHQN- 396
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E G+ YM+PL G + S T SFWCC +G+
Sbjct: 397 PETGMFAYMVPLMSGTGREYS-----TPEDSFWCCVLSGI 431
>gi|315506549|ref|YP_004085436.1| hypothetical protein ML5_5828 [Micromonospora sp. L5]
gi|315413168|gb|ADU11285.1| protein of unknown function DUF1680 [Micromonospora sp. L5]
Length = 917
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 139/400 (34%), Positives = 194/400 (48%), Gaps = 24/400 (6%)
Query: 104 EVSLHDVKLDPSSLHWRA------QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG- 156
++ + DP + A Q + YL +DV+ L+++F+ T G A G
Sbjct: 47 DIGVSAYAFDPGQVRLTAGRWQDNQNRTVAYLRFVDVNRLLYNFRANHRLSTGGAAANGG 106
Query: 157 WEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS-----GYL 211
W+ P R H GH+L+A A WA + T ++K +V+ L+ CQ G+ GYL
Sbjct: 107 WDAPNFPFRTHMQGHFLTAWAQAWAVLGDTTCRDKALTMVAELARCQANNGAAGFSAGYL 166
Query: 212 SAFPSEQFDRFEA--LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRV 269
S FP F EA L PYY IHK LAGLLD + +TQA + + + R
Sbjct: 167 SGFPESDFTALEARTLSNGNVPYYCIHKTLAGLLDVWRLIGSTQARDVLLALAGWVDQRT 226
Query: 270 QNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQAD 329
+ + + L E GGMN VL LY T D + L +A FD LA +D
Sbjct: 227 G----RLTSAQMQAMLGTEFGGMNAVLTDLYQQTGDGRWLTVAQRFDHAAVFNPLAANSD 282
Query: 330 DISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDP 389
++G HANT +P IG+ Y+ TG Y+ I +H YA GG S E + P
Sbjct: 283 QLNGLHANTQVPKWIGAAREYKATGVTRYRDIAANAWAITVGAHTYAIGGNSQAEHFRAP 342
Query: 390 KRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMV-YADYYERALTNGVLSIQRGTEP-G 447
+A L + E+C TYNMLK++R L++ + V YAD+YERAL N ++ Q + G
Sbjct: 343 NAIAGYLRNDTCEACNTYNMLKLTRELWQLDPDRVAYADFYERALLNHMIGQQNPADAHG 402
Query: 448 VMIYMLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ Y PL RG A W T ++SFWCC GTGL
Sbjct: 403 HVTYFTPLNPGGRRGVGPAWGGGTWSTDYNSFWCCQGTGL 442
>gi|251798256|ref|YP_003012987.1| hypothetical protein Pjdr2_4277 [Paenibacillus sp. JDR-2]
gi|247545882|gb|ACT02901.1| protein of unknown function DUF1680 [Paenibacillus sp. JDR-2]
Length = 605
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 142/414 (34%), Positives = 200/414 (48%), Gaps = 46/414 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +V+L R + Y+ D++ L+ +F+ AG + + GWE P C LRG
Sbjct: 7 LDEVRLTDDVFASRREHAKT-YIREFDLERLMHTFKINAGISSTAEPLGGWEAPDCGLRG 65
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD--RFEA 224
HFVGHYLSA A H+ TLK +V + C SGYLSAF E+ D E
Sbjct: 66 HFVGHYLSACAKFAYGDHDGTLKTMADEIVDVMQACAQP--SGYLSAFEEEKLDVLELEE 123
Query: 225 LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHW-- 282
+ VWAPYYT+HKI+ GL+D Y + NTQAL++ + Y R + + HW
Sbjct: 124 NRDVWAPYYTLHKIMQGLIDCYVYLQNTQALELAVNLAHYIRRRFEYL-------SHWKI 176
Query: 283 ---------NSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
N +NE GG+ D LY LY +T D L LAHLFD+ +L LA D +
Sbjct: 177 DGILRCTKLNPVNE-FGGLGDSLYTLYELTGDAALLGLAHLFDRDYWLWPLAEGRDVLED 235
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIV---------NASHGYA--TGGTS- 381
HANTH+P+++ RY++ + YK + F D + N+S A GG S
Sbjct: 236 LHANTHLPMILACMHRYKIREEDSYKKSALHFYDFLMGRTFANGNNSSKATAFIQGGVSE 295
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E W LA L ESC +N K+ L W+ E+ Y D+ E N +L+
Sbjct: 296 KAEHWGGYGELADALTGGESESCCAHNTEKIVERLLEWSPEIGYLDHLESLKYNAILN-S 354
Query: 442 RGTEPGVMIYMLPLGRGDSK--AKSYHGWGTRFSSFWCCYGTGLGFIRWIQLSL 493
+ G+ Y PLG K ++ YH SFWCC G+G+ + +Q ++
Sbjct: 355 ASAKTGLSQYHQPLGTNAVKKFSEPYH-------SFWCCTGSGIEAMSELQKNI 401
>gi|381203003|ref|ZP_09910112.1| hypothetical protein SyanX_20925 [Sphingobium yanoikuyae XLDN2-5]
Length = 790
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/400 (32%), Positives = 194/400 (48%), Gaps = 32/400 (8%)
Query: 103 KEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC 162
+ + L+ +L PS A + N YLL L+ D L+ +F+K AG G Y GWE+ T
Sbjct: 34 RALPLNATRLLPSPFA-DAVEGNRRYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWENDT- 91
Query: 163 ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRF 222
+ GH +GHYL+A A M A T + + +++ L+ECQ G GY++ F + D
Sbjct: 92 -IAGHTLGHYLTALALMHAQTGDAECARRAAYIIAELAECQAAAGDGYVAGFTRRRDDVI 150
Query: 223 EA-------------------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
E L W P+Y HK+ AGL D + N+QA + +
Sbjct: 151 EDGRLIFPEIMRGDIRSAGFDLNGCWVPFYNWHKLFAGLFDAESHLGNSQARGVALALAA 210
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
Y + V K + L+ E GG+N+ L+ T DP+ L LA L
Sbjct: 211 Y----IDGVFAKLDDAQVQQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDP 266
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
LA + + + HANT IP +IG +E+TG+ + FF + V + Y GG +
Sbjct: 267 LAQRQNSLPWIHANTQIPKLIGLARLHEITGNAADAIAANFFWETVVGQYSYVIGGNADR 326
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E++ DP ++ + + ESC +YNMLK++RHL+ W E DYYERA N +L+ Q
Sbjct: 327 EYFPDPGTISKHITEQTCESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQNP 386
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G+ YM+PL G S+ W F FWCC G+G+
Sbjct: 387 AT-GMFAYMVPLMSG-----SHRVWSEPFDDFWCCVGSGM 420
>gi|440730056|ref|ZP_20910155.1| hypothetical protein A989_02030 [Xanthomonas translucens DAR61454]
gi|440379682|gb|ELQ16270.1| hypothetical protein A989_02030 [Xanthomonas translucens DAR61454]
Length = 807
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 134/403 (33%), Positives = 201/403 (49%), Gaps = 35/403 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ + L V L PS L + QTN YLL L+ D L+ +F + AG P G+ Y GWE T
Sbjct: 60 VQALPLKQVTLKPS-LFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGEVYGGWEGDT 118
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
+ GH +GHYLSA A M A T + L++++ +V+ L+ Q K GY+ + + D+
Sbjct: 119 --IAGHTLGHYLSALAKMHAQTRDAALRQRIDYIVAELARAQAKDPDGYVGGL-TRKNDK 175
Query: 222 ---------FEA------------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKW 260
FE L W+P YT+HK+ AGLLD + A N QAL++
Sbjct: 176 GAIDNGKLVFEEVRRGIIKGSKFNLNGSWSPLYTVHKLFAGLLDAHALAGNAQALQVLLP 235
Query: 261 MVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCF 320
+ Y + V + L+ E GG+N+ L T DP+ + L
Sbjct: 236 LAGY----LGGVFDALDHAQMQTLLDTEFGGLNESYIELGARTGDPRWIALGKRLRHEKV 291
Query: 321 LGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGT 380
+ A D++ HANT +P IG ++EV GD FF + V + Y GG
Sbjct: 292 IDPAAAGRDELPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTGHYSYVIGGN 351
Query: 381 SAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSI 440
+ E++ +P +A+ L + E C +YNMLK++RHL++WT + Y DYYER L N ++
Sbjct: 352 ADREYFQEPDTIAAFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAA 411
Query: 441 QRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Q G+ YM P+ G + G+ +F SFWCC G+G+
Sbjct: 412 QHPAT-GMFTYMTPMISGGER-----GFSDKFDSFWCCVGSGM 448
>gi|386847956|ref|YP_006265969.1| Glucan endo-1,3-beta-glucosidase [Actinoplanes sp. SE50/110]
gi|359835460|gb|AEV83901.1| Glucan endo-1,3-beta-glucosidase [Actinoplanes sp. SE50/110]
Length = 765
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 133/375 (35%), Positives = 186/375 (49%), Gaps = 23/375 (6%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG-WEDPTCELRGHFVGHYLSASAHMW 180
Q L YL +D D L+++F+ G T G A G W+ P R H GH+L+A A W
Sbjct: 65 QTRTLNYLRFVDADRLLYNFRANHGRSTGGAAANGGWDAPDFPFRTHVQGHFLTAWAQAW 124
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEA--LKPVWAPYYTIHKI 238
A+ + T +++ +V+ L++CQ +GYLS FP F EA L PYY +HK
Sbjct: 125 AALGDTTCRDRANYMVAELAKCQ--AANGYLSGFPESDFTALEAGTLSNGNVPYYCVHKT 182
Query: 239 LAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMND 294
LAGLLD + TQA L++ W V + + + L E GGMN+
Sbjct: 183 LAGLLDVWRLIGGTQARDVLLRLAGW--------VDTRTARLTTSQMQAMLGTEFGGMNE 234
Query: 295 VLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTG 354
VL +Y T D + L A FD LA AD ++G HANT +P +G+ Y+ TG
Sbjct: 235 VLADIYQQTGDGRWLATAQRFDHAAVFTPLAAGADQLNGLHANTQVPKWVGAVREYKATG 294
Query: 355 DPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSR 414
Y+ G +I +H YA GG S E + P +A L + E C +YNMLK++R
Sbjct: 295 TTRYRDIGLNAWNITTGAHTYAIGGNSQAEHFRAPNAIAGYLTNDTCEHCNSYNMLKLTR 354
Query: 415 HLFRWTKE-MVYADYYERALTNGVLSIQRGTEP-GVMIYMLPL----GRGDSKAKSYHGW 468
L+ + Y D+YERAL N ++ Q + G + Y PL RG A W
Sbjct: 355 ELWLTDPDRAAYFDFYERALLNHLIGAQNPADSHGHITYFTPLRPGGRRGVGPAWGGGTW 414
Query: 469 GTRFSSFWCCYGTGL 483
T ++SFWCC GTG+
Sbjct: 415 STDYASFWCCQGTGV 429
>gi|399074049|ref|ZP_10750795.1| hypothetical protein PMI01_01867 [Caulobacter sp. AP07]
gi|398040822|gb|EJL33912.1| hypothetical protein PMI01_01867 [Caulobacter sp. AP07]
Length = 775
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 137/423 (32%), Positives = 200/423 (47%), Gaps = 33/423 (7%)
Query: 81 SWTMIYRKMKNPDGFKLAGDFLKE-VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVW 139
S M + +P AG + E V V L PS + +AQ N YL+ L D L+
Sbjct: 15 SSAMAFVGAASPGLAAPAGRVVAEPVPARHVALKPS-IFQQAQAANRAYLVSLSADRLLH 73
Query: 140 SFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSAL 199
+F + AG Y GWE + GH +GHYL+A A A T + L +++T +V+ L
Sbjct: 74 NFHQGAGLSVKAPVYGGWE--AQSIAGHTLGHYLTACALQVAGTGDPVLSDRLTYIVAEL 131
Query: 200 SECQNKMGSGYL----------SAFPSEQFDRFE---------ALKPVWAPYYTIHKILA 240
+ Q G GY+ +A + F+ +L W P YT HK+ A
Sbjct: 132 ARVQAAHGDGYVGGTTRWGQSDAAGGKQVFEELRRGDIRASRFSLNDGWVPIYTWHKVHA 191
Query: 241 GLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLY 300
GLLD + A +AL + + YF ++ S + L E GG+N+ Y
Sbjct: 192 GLLDAHRLAGTPRALAVAVGLAGYF----ATIVEGLSDAQVQQILITEHGGINEAYAETY 247
Query: 301 TITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKV 360
+T D + L +A L +A D+++G HANT IP VIG YEV GDP
Sbjct: 248 ALTGDERWLKVARRLRHKAVLDPIAEGRDELAGLHANTQIPKVIGLARLYEVGGDPAEAR 307
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWT 420
FF +V +H Y GG S E + P +A + E+C TYNMLK++R L+ W
Sbjct: 308 AARFFHQVVTENHSYVIGGNSDREHFGKPNEIARHMAETTCEACNTYNMLKLTRRLWSWA 367
Query: 421 KEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYG 480
DYYERA N +++ QR ++ G+ +Y +P+ G ++ S T SFWCC G
Sbjct: 368 PNGALFDYYERAQLNHIMAHQRPSD-GMFVYFMPMAAGGRRSYS-----TPEDSFWCCVG 421
Query: 481 TGL 483
+G+
Sbjct: 422 SGM 424
>gi|433676676|ref|ZP_20508761.1| hypothetical protein BN444_00795 [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430818203|emb|CCP39076.1| hypothetical protein BN444_00795 [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 807
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 134/403 (33%), Positives = 201/403 (49%), Gaps = 35/403 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ + L V L PS L + QTN YLL L+ D L+ +F + AG P G+ Y GWE T
Sbjct: 60 VQALPLKQVTLKPS-LFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGEVYGGWEGDT 118
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
+ GH +GHYLSA A M A T + L++++ +V+ L+ Q K GY+ + + D+
Sbjct: 119 --IAGHTLGHYLSALAKMHAQTRDAALRQRIDYIVAELARAQAKDPDGYVGGL-TRKNDK 175
Query: 222 ---------FEA------------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKW 260
FE L W+P YT+HK+ AGLLD + A N QAL++
Sbjct: 176 GAIDNGKLVFEEVRRGIIKGSKFNLNGSWSPLYTVHKLFAGLLDAHELAGNAQALQVLLP 235
Query: 261 MVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCF 320
+ Y + V + L+ E GG+N+ L T DP+ + L
Sbjct: 236 LAGY----LGGVFDALDHAQMQALLDTEFGGLNESYIELGARTGDPRWIALGKRLRHEKV 291
Query: 321 LGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGT 380
+ A D++ HANT +P IG ++EV GD FF + V + Y GG
Sbjct: 292 IDPAAAGRDELPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTGHYSYVIGGN 351
Query: 381 SAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSI 440
+ E++ +P +A+ L + E C +YNMLK++RHL++WT + Y DYYER L N ++
Sbjct: 352 ADREYFQEPDTIAAFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAA 411
Query: 441 QRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Q G+ YM P+ G + G+ +F SFWCC G+G+
Sbjct: 412 QHPAT-GMFTYMTPMIGGGER-----GFSDKFDSFWCCVGSGM 448
>gi|443291943|ref|ZP_21031037.1| Conserved secreted hypothetical protein [Micromonospora lupini str.
Lupac 08]
gi|385885131|emb|CCH19144.1| Conserved secreted hypothetical protein [Micromonospora lupini str.
Lupac 08]
Length = 778
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 134/380 (35%), Positives = 186/380 (48%), Gaps = 26/380 (6%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG-WEDPTCELRGHFVGHYLSASAHMW 180
Q L YL +DVD ++++F+ T G A G W+ P R H GH+L+A A +
Sbjct: 69 QNRTLNYLRFVDVDRMLYNFRANHRLSTNGAATNGGWDAPNFPFRTHMQGHFLTAWAQAY 128
Query: 181 ASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQFDRFEA--LKPVWAPYY 233
A + T ++K +V+ L++CQ G+GYLS FP F EA L PYY
Sbjct: 129 AVLGDTTCRDKANYMVAELAKCQANNGAAGFGAGYLSGFPESDFSALEARTLSNGNVPYY 188
Query: 234 TIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
IHK LAGLLD + + NTQA L + W V ++ S + + L E
Sbjct: 189 CIHKTLAGLLDVWRYTGNTQARTVLLALAGW--------VDTRTSRLSSSQMQSMLGTEF 240
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GGMNDVL +Y +T D + L A FD LA D ++G HANT +P +G+
Sbjct: 241 GGMNDVLTEIYQMTGDSRWLTTAQRFDHASVFNPLANNQDQLNGLHANTQVPKWVGAARE 300
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
++ TG Y+ + +I +H Y GG S E + P +A L + E C TYNM
Sbjct: 301 FKATGTTRYRDIASNAWNITVRAHTYVIGGNSQAEHFRAPNAIAGYLSNDTCEQCNTYNM 360
Query: 410 LKVSRHLFRWTKEMV-YADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGDSKAK 463
LK++R L+ Y DYYERA N ++ Q + G + Y PL RG A
Sbjct: 361 LKLTRELWLLDPSRTDYFDYYERATINHLIGAQNPADSKGHITYFTPLKPGGRRGVGPAW 420
Query: 464 SYHGWGTRFSSFWCCYGTGL 483
W T ++SFWCC GTG+
Sbjct: 421 GGGTWSTDYNSFWCCQGTGV 440
>gi|408357351|ref|YP_006845882.1| hypothetical protein AXY_19880 [Amphibacillus xylanus NBRC 15112]
gi|407728122|dbj|BAM48120.1| hypothetical protein AXY_19880 [Amphibacillus xylanus NBRC 15112]
Length = 622
Score = 211 bits (538), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 133/397 (33%), Positives = 206/397 (51%), Gaps = 34/397 (8%)
Query: 103 KEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGS----PTAGKAYEGWE 158
K V++HD L R + N YL+ L D+L+++++ AG A+ GWE
Sbjct: 7 KNVTVHDGDLK------RREAANKSYLMSLTNDNLLFNYRVEAGRFHGREIPKDAHGGWE 60
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
P C++RGHF+GH+LSA+A + + ++ LK K +VS L+ECQ G ++ P +
Sbjct: 61 TPVCQIRGHFLGHWLSAAALHYHQSGDLELKVKADLIVSELAECQKDNGGQWVGPIPEKY 120
Query: 219 FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
K +WAP Y +HK+ GL+D Y++ N QAL + ++F K++
Sbjct: 121 LHWIAEGKNIWAPQYNLHKLFMGLIDMYSYTGNQQALDIADNFADWFVKWS----GKFTR 176
Query: 279 ERHWNSLNEETGGMNDVLYRLYTIT-QDPKHLLLAHLFDKPCFLGLLAVQADDISGFHAN 337
E+ + L+ ETGGM +V L IT D LL + + F LL + D ++ HAN
Sbjct: 177 EQFDDILDVETGGMLEVWADLLEITGHDKYKFLLDRYYRQRLFQPLLEGK-DPLTNMHAN 235
Query: 338 THIPVVIGSQMRYEVTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL 396
T IP V+G YEVTGD + + ++ V ATGG ++GE W ++ + L
Sbjct: 236 TTIPEVLGCARAYEVTGDNRWLDIVKAYWNCAVTERGTLATGGNTSGEVWMPKMKIKARL 295
Query: 397 GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ-------RGTEP--- 446
G +N+E CT YNM++++ LF+ TK+ Y Y E L NG+++ GT
Sbjct: 296 GDKNQEHCTVYNMIRLADFLFQQTKDPAYGQYIEYNLYNGIMAQAYYQSYHVAGTGKNHP 355
Query: 447 --GVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGT 481
G++ Y LP+ KA Y W + +SF+CC+GT
Sbjct: 356 WTGLLTYFLPM-----KAGLYKEWSSETNSFFCCHGT 387
>gi|385677991|ref|ZP_10051919.1| hypothetical protein AATC3_18830 [Amycolatopsis sp. ATCC 39116]
Length = 886
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 149/394 (37%), Positives = 206/394 (52%), Gaps = 18/394 (4%)
Query: 103 KEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC 162
+ + L V+L S ++T + YL +D D L+ F+ TAG P+ + GWE P
Sbjct: 35 RPLELGRVRLLDSRYRQNMERT-VAYLRFVDADRLLHMFRVTAGLPSTAEPCGGWEAPDI 93
Query: 163 ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMG-----SGYLSAFPSE 217
+LRGH GH LS A A+T + L K ++V+AL+ECQ GYLSAFP
Sbjct: 94 QLRGHTTGHLLSGLALAAANTGDTELAAKGASIVAALAECQAAAPAAGFTEGYLSAFPER 153
Query: 218 QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYS 277
F EA K VWAPYYTIHKI+AGLLDQY N QAL + M + R+ N+ +
Sbjct: 154 AFADLEAGKVVWAPYYTIHKIMAGLLDQYRLLGNRQALDVLLGMARWARARMANL----T 209
Query: 278 VERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHAN 337
E L+ E GGMN+ L L +T D +HL A LFD L+ + D ++G HAN
Sbjct: 210 REAQQKVLHTEFGGMNETLASLALVTGDRQHLETAKLFDHDEIFVPLSQRRDTLAGRHAN 269
Query: 338 THIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG 397
T I ++G+ + ++ TG+ Y+ T+F D V H Y GG + EF+ P ++ S LG
Sbjct: 270 TDIAKIVGAAVEWDATGEEYYRTIATYFWDQVVHHHTYVIGGNANAEFFGPPDQIVSQLG 329
Query: 398 TENEESCTTYNMLKVSRHLF-RWTKEMVYADYYERALTNGVLSIQR-GTEPGVMIY---M 452
E+C +YNMLK+SR LF R Y DY E L N +L Q + G + Y +
Sbjct: 330 ENTCENCNSYNMLKLSRLLFLRDPSRTDYLDYSEWTLLNQMLGEQDPDSAHGFVTYYTGL 389
Query: 453 LPLGRGDSKAKSYHGWGT---RFSSFWCCYGTGL 483
+P + K GT + +F C +GTGL
Sbjct: 390 VPGAQRKGKEGVVSDPGTYSSDYGNFTCDHGTGL 423
>gi|393782435|ref|ZP_10370619.1| hypothetical protein HMPREF1071_01487 [Bacteroides salyersiae
CL02T12C01]
gi|392673263|gb|EIY66726.1| hypothetical protein HMPREF1071_01487 [Bacteroides salyersiae
CL02T12C01]
Length = 781
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 133/395 (33%), Positives = 201/395 (50%), Gaps = 36/395 (9%)
Query: 105 VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCEL 164
+S+ +V+L A + + ++L+ L D + F + AG Y+GWED +
Sbjct: 47 ISISEVRLLQGPFK-AAMEADRKWLMSLQPDRFLHRFHENAGFTPKAPMYDGWEDSS--Q 103
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ------ 218
G GHYLSA + ++A+T + L ++ ++ + +CQ +G+GY++A P
Sbjct: 104 SGFSFGHYLSAMSMLYAATGDNELLGRIEYSINEIRKCQLAIGTGYVAAIPDGDRLWNEL 163
Query: 219 -FDRFEA----LKPVWAPYYTIHKILAGLLDQYTFAD----NTQALKMTKWMVEYFYNRV 269
D+ E + WAP+Y +HK+ +G +D Y + T A+++T W + F +
Sbjct: 164 VADKIEPGGSWINGFWAPWYNLHKLWSGFIDVYLYTGVETAKTVAIELTDWACDKFRDMT 223
Query: 270 QNVITKYSVERHWNSL-NEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQA 328
+ W + + ETGGMND LY +Y IT + ++L LA F + L+ Q
Sbjct: 224 DD---------QWQRMISCETGGMNDALYNMYAITGNLRYLQLADKFYHYSVMEPLSQQR 274
Query: 329 DDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSD 388
D+++G HANT IP V G YE+ G K TFF + V H Y GG S E +
Sbjct: 275 DELNGLHANTQIPKVTGIARSYELRGREKDKTIATFFWNTVLKKHTYCIGGNSNYEHFGK 334
Query: 389 PKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGV 448
P L L + E+C TYNMLK++ HLF W + Y DYYERAL N +L+ Q E G+
Sbjct: 335 PGEL--FLSDKTTETCNTYNMLKLTGHLFAWEPKAEYMDYYERALYNHILASQ-NHETGM 391
Query: 449 MIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
++Y LPL S+ + T SFWCC GTG
Sbjct: 392 VVYSLPLAYA-----SFKEFSTPEHSFWCCVGTGF 421
>gi|427411824|ref|ZP_18902026.1| hypothetical protein HMPREF9718_04500 [Sphingobium yanoikuyae ATCC
51230]
gi|425710114|gb|EKU73137.1| hypothetical protein HMPREF9718_04500 [Sphingobium yanoikuyae ATCC
51230]
Length = 802
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/400 (32%), Positives = 191/400 (47%), Gaps = 32/400 (8%)
Query: 103 KEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC 162
+ + L +L PS A + N YLL L+ D L+ +F+K AG G Y GWE+ T
Sbjct: 46 RALPLQATRLLPSPFA-DAVEGNRLYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWENDT- 103
Query: 163 ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRF 222
+ GH +GHYL+A A M A T + + ++ L+ CQ G GY++ F + D
Sbjct: 104 -IAGHTLGHYLTALALMHAQTGDAECARRAAYIIDELAACQAAAGDGYVAGFTRRRDDVI 162
Query: 223 EA-------------------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
E L W P+Y HK+ AGL D T N+QA + +
Sbjct: 163 EDGRLIFPEIMRGDIRSAGFDLNGCWVPFYNWHKLFAGLFDAETHLGNSQARGVALALAA 222
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
Y + V K + L+ E GG+N+ L+ T DP+ L LA L
Sbjct: 223 Y----IDGVFAKLDDAQVQQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDP 278
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
LA + + + HANT IP +IG +E+TG+ + FF + V + Y GG +
Sbjct: 279 LAQRQNSLPWIHANTQIPKLIGLARLHEITGNAADAIAANFFWETVVGQYSYVIGGNADR 338
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E++ DP ++ + + ESC +YNMLK++RHL+ W E DYYERA N +L+ Q
Sbjct: 339 EYFPDPGTISKHITEQTCESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQNP 398
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G+ YM+PL G S+ W F FWCC G+G+
Sbjct: 399 AT-GMFAYMVPLMSG-----SHRVWSEPFDDFWCCVGSGM 432
>gi|399025507|ref|ZP_10727503.1| hypothetical protein PMI13_03476 [Chryseobacterium sp. CF314]
gi|398077884|gb|EJL68831.1| hypothetical protein PMI13_03476 [Chryseobacterium sp. CF314]
Length = 791
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/380 (34%), Positives = 193/380 (50%), Gaps = 24/380 (6%)
Query: 116 SLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSA 175
S+ +A QT+ +Y+L +D D L+ + K AG Y WE+ L GH GHY+SA
Sbjct: 37 SVFSKAMQTDEKYILSMDADRLLAPYLKEAGLKPKKANYPNWEN--TGLDGHIGGHYISA 94
Query: 176 SAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFE-----------A 224
A M+AST + +K+++ ++ L CQN +GYLS P+ + E
Sbjct: 95 LALMYASTGDAKVKQRLDYMIDELERCQNLSENGYLSGVPNGKKIWKEIAGGNIRAATFG 154
Query: 225 LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS 284
L W P Y IHKI +GL D Y +AD+ +A KM + ++ V +V++ ++ N
Sbjct: 155 LNDRWVPLYNIHKIYSGLRDAYWYADSGKAKKMLIRLTDWMVGEV-SVLSDAQIQ---NM 210
Query: 285 LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVI 344
L E GG+N+V +Y IT++PK+L LAH F L L D +G HANT IP VI
Sbjct: 211 LRSEHGGLNEVFADVYDITKNPKYLRLAHRFSHLAILNPLLNGEDKFTGIHANTQIPKVI 270
Query: 345 GSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT-ENEES 403
G + ++ + + FF V GG S E ++ + + + E E+
Sbjct: 271 GFKRIADLENNKEWSNAADFFWINVTQKRSAVIGGNSVSEHFNPINDFSGMIKSIEGPET 330
Query: 404 CTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAK 463
C TYNMLK+S+ L+ + Y DYYERAL N +LS Q E G +Y P+ G
Sbjct: 331 CNTYNMLKLSKELYATNPKSSYIDYYERALYNHILSTQ-NPEKGGFVYFTPMRPG----- 384
Query: 464 SYHGWGTRFSSFWCCYGTGL 483
Y + +SFWCC G+G+
Sbjct: 385 HYRVYSQPETSFWCCVGSGM 404
>gi|357472913|ref|XP_003606741.1| hypothetical protein MTR_4g065110 [Medicago truncatula]
gi|355507796|gb|AES88938.1| hypothetical protein MTR_4g065110 [Medicago truncatula]
Length = 203
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 107/172 (62%), Positives = 127/172 (73%), Gaps = 7/172 (4%)
Query: 1 MKNFVFKVLVLFLSCWVALC---KECTNSFPQLASHTFRYELLSSKNETWKKEVYSHYHL 57
MK FVF + +F++ + C KECTN Q SHTFRYEL +SKNETWKKEV SHYH+
Sbjct: 1 MKVFVF--MFMFMALMLRGCVTIKECTNIPTQ--SHTFRYELFASKNETWKKEVMSHYHV 56
Query: 58 TPTDDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSL 117
TPTD+SAW+ LLPRK+LSE ++ W ++YRK+KN FK FLKEV L DV+L S+
Sbjct: 57 TPTDESAWATLLPRKILSEENQHDWALMYRKIKNLGVFKPPVGFLKEVPLGDVRLLEGSI 116
Query: 118 HWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFV 169
H AQQTNLEYLLMLDVD L+WSF+KTAG PT G Y GWE+P ELRGHFV
Sbjct: 117 HAVAQQTNLEYLLMLDVDRLIWSFRKTAGLPTPGNPYGGWEEPNTELRGHFV 168
>gi|169596765|ref|XP_001791806.1| hypothetical protein SNOG_01152 [Phaeosphaeria nodorum SN15]
gi|111069681|gb|EAT90801.1| hypothetical protein SNOG_01152 [Phaeosphaeria nodorum SN15]
Length = 620
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 142/397 (35%), Positives = 197/397 (49%), Gaps = 25/397 (6%)
Query: 104 EVSLHDVKLDPSSLHWRAQQT-NLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPT 161
E L V L S+ W+ + L YL ++VD L+++F+ T T G + GW+ P
Sbjct: 36 EFDLSQVSL--SNSRWKDNENRTLNYLKAVNVDRLLYNFRATHKLSTNGAQPNGGWDAPN 93
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS-----GYLSAFPS 216
R H GHYL+A H +A+ + K + + V L++CQ G+ GYLS FP
Sbjct: 94 FPFRSHAQGHYLTAWVHCYATLRDNECKNRASYFVQELAKCQANNGAAQFSTGYLSGFPE 153
Query: 217 EQFDRFEA--LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
+F EA LK PYY +HK +AGLLD + +T+A + + + R +
Sbjct: 154 SEFVALEAGQLKGGNVPYYAVHKTMAGLLDAWRIIGDTKARDVLLALAGWVDGRTK---- 209
Query: 275 KYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
K S + L E GGMNDVL +Y +T + + L +A FD LA D +SG
Sbjct: 210 KLSSSQMQTMLGTEFGGMNDVLAAIYQLTGNQQWLTVAQRFDHASQFDPLANNQDRLSGN 269
Query: 335 HANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAS 394
HANT +P IG+ Y+ TG Y D +H YA GG S E + P ++++
Sbjct: 270 HANTQVPKWIGAAREYKSTGTKRYLDIAKNAWDFTINAHTYAIGGNSQAEHFRPPNQISN 329
Query: 395 TLGTENEESCTTYNMLKVSRHLFRWTKE---MVYADYYERALTNGVLSIQRGTEP-GVMI 450
L + E C TYNMLK++R L WT + Y DYYERAL N +L Q T+ G +
Sbjct: 330 FLTNDTAEQCNTYNMLKLTRDL--WTTDPSSTKYFDYYERALINHLLGAQNPTDNHGHIT 387
Query: 451 YMLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y PL RG A W T ++SFWCC GT L
Sbjct: 388 YFTPLKSGGRRGIGPAWGGGTWSTDYNSFWCCQGTAL 424
>gi|399071242|ref|ZP_10749941.1| hypothetical protein PMI01_00976 [Caulobacter sp. AP07]
gi|398043612|gb|EJL36503.1| hypothetical protein PMI01_00976 [Caulobacter sp. AP07]
Length = 789
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 135/400 (33%), Positives = 193/400 (48%), Gaps = 34/400 (8%)
Query: 105 VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCEL 164
+ L V+L PS + A + N YLL L D + +F AG P G+ Y GWE T +
Sbjct: 38 LPLSSVRLLPSD-YATAVEVNRAYLLRLSPDRFLHNFMTFAGLPAKGEIYGGWESDT--I 94
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR--- 221
GH +GHY+SA M+ T +V + + +V L+ Q K G GY+ A ++ D
Sbjct: 95 AGHTLGHYVSALVVMYEQTGDVECRRRADYIVGELARAQAKRGDGYIGALQRKRKDGTVV 154
Query: 222 ------------------FEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
F+ L W+P YT+HK AGLLD + N QAL + +
Sbjct: 155 DGEEIFAEVMKGDIRSGGFD-LNGSWSPLYTVHKTFAGLLDVHRAWGNQQALDVAVGLGG 213
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
YF + V + E+ L E GG+N+ LY T D + L++A L
Sbjct: 214 YF----ERVFAALNDEQMQTLLGCEYGGLNESYAELYARTGDRRWLVVAERIYDRKVLDP 269
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
L Q D ++ FHANT +P +IG YE+TG P FF + V H Y GG +
Sbjct: 270 LVAQQDKLANFHANTQVPKLIGLGRLYELTGKPQDAAAARFFWNTVTQHHSYVIGGNADR 329
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E++++P +A+ + + E C TYNMLK++R L+ W E DYYERA N V++ Q
Sbjct: 330 EYFAEPDTIAAHISEQTCEHCNTYNMLKLTRQLYSWRPEGALFDYYERAHLNHVMAAQN- 388
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G YM PL G + S + +FWCC GTG+
Sbjct: 389 PKTGGFTYMTPLLTGADRGYSTN----EDDAFWCCVGTGM 424
>gi|15614440|ref|NP_242743.1| hypothetical protein BH1877 [Bacillus halodurans C-125]
gi|10174495|dbj|BAB05596.1| BH1877 [Bacillus halodurans C-125]
Length = 758
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 133/387 (34%), Positives = 202/387 (52%), Gaps = 26/387 (6%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
S+ +VKL L + +Q+ + +L LD+D L+ + + A P ++Y GWE+ E+R
Sbjct: 3 SIENVKL-TKGLFYNSQKKGNDVILALDIDRLLAPYYEAANLPPKKRSYGGWEER--EIR 59
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR---- 221
GH +GH+LSA+A M+ +T + L E++ V L+ Q+ +G Y+ FD
Sbjct: 60 GHSLGHWLSAAAAMYETTGDKALLERIDRAVQELATIQDDVG--YVGGVKRAHFDEMFSG 117
Query: 222 -FEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKY 276
F+ + W P+Y +HK+ AGL+D + ++ AL + + ++ + + +T
Sbjct: 118 EFQVGHFNIAGTWVPWYNLHKLFAGLIDVHQLTGHSLALTVVTKLADW-AKKGTDQLTDD 176
Query: 277 SVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHA 336
+R L E GGMN+ + LYT+T +L LA F L LA D++ G HA
Sbjct: 177 QFQR---MLICEHGGMNEAMADLYTLTGHKDYLQLAIRFCHWAVLEPLANGIDELEGKHA 233
Query: 337 NTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL 396
NT IP VIG+ +E+TGD Y+ FF V Y GG S E + + TL
Sbjct: 234 NTQIPKVIGAAKLFEITGDDTYRAIAEFFWRQVTNDRSYIIGGNSNSEHFGPANK--ETL 291
Query: 397 GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLG 456
G E E+C TYNMLK++ HLFRW + DYYE+AL N +L+ Q + G+ Y + L
Sbjct: 292 GVETAETCNTYNMLKLTEHLFRWNRSSQLMDYYEKALYNHILASQ-DPDSGMKTYFVSLQ 350
Query: 457 RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G K + + SFWCC+GTGL
Sbjct: 351 PGHFKV-----YSSLEESFWCCFGTGL 372
>gi|333380462|ref|ZP_08472153.1| hypothetical protein HMPREF9455_00319 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826457|gb|EGJ99286.1| hypothetical protein HMPREF9455_00319 [Dysgonomonas gadei ATCC
BAA-286]
Length = 790
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 135/398 (33%), Positives = 200/398 (50%), Gaps = 41/398 (10%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
SL DV+L S A+ + +YLL L D L+ F + +G ++Y WE+ L
Sbjct: 29 SLKDVRLLDSPFK-HAEDLDKQYLLELKADRLLSPFLRESGLTPKAESYTNWEN--TGLD 85
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ------- 218
GH GHYLSA + M+AST + +KE++ +VS L CQ+ +GY+ P +
Sbjct: 86 GHIGGHYLSALSLMYASTGDKQIKERLDYMVSELKRCQDANDNGYIGGVPGGKAIWEEVA 145
Query: 219 --------FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFY 266
FD L W P Y IHK AGL D Y +A++ A +KMT W +
Sbjct: 146 NGNIRAGGFD----LNGKWVPLYNIHKTYAGLRDAYLYANSDMAKEMLIKMTDWAI---- 197
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
N+++K S E+ + L E GG+N+ + IT D K+L LAH F L L
Sbjct: 198 ----NLVSKLSEEQIQDMLRSEHGGLNETFADVAAITGDKKYLKLAHQFSHQLVLNPLLN 253
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
D ++G HANT IP V+G + +V G+ + FF + V + GG S GE +
Sbjct: 254 HEDKLTGMHANTQIPKVLGFKRIADVEGNESWSEASRFFWETVVEHRSVSIGGNSVGEHF 313
Query: 387 SDPKRLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
+ + + + E E+C TYNML++S+ L++ +++ Y DYYERAL N +LS Q E
Sbjct: 314 NPTNDFSRVIKSIEGPETCNTYNMLRLSKMLYQTSQDEKYMDYYERALYNHILSTQ-NPE 372
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y + G Y + +SFWCC G+G+
Sbjct: 373 QGGFVYFTQMRPG-----HYRVYSQPQTSFWCCVGSGI 405
>gi|261407096|ref|YP_003243337.1| hypothetical protein GYMC10_3284 [Paenibacillus sp. Y412MC10]
gi|261283559|gb|ACX65530.1| protein of unknown function DUF1680 [Paenibacillus sp. Y412MC10]
Length = 622
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 128/383 (33%), Positives = 193/383 (50%), Gaps = 34/383 (8%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGS----PTAGKAYEGWEDPTCELRGHFVGHYLSA 175
R ++ N YL+ LD L++++ AG A+ GWE P C+LRGHF+GH+LS
Sbjct: 18 RRERANRSYLMKLDSGHLLFNYHLEAGRFHGRTIPEGAHGGWETPVCQLRGHFLGHWLSG 77
Query: 176 SAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTI 235
+A + + ++ LK K+ A+V L ECQ G ++ P + + K +WAP Y
Sbjct: 78 AALHYEESGDIELKAKLDAIVHELHECQRDNGGQWVGPIPEKYLHWIASGKSIWAPQYNC 137
Query: 236 HKILAGLLDQYTFADNTQAL----KMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGG 291
HKIL GL+D + +A N QAL + W VE+ ++ E+ + L+ ETGG
Sbjct: 138 HKILMGLVDAWQYAGNRQALDIVDRFADWFVEW--------SGTFTREQFDDILDVETGG 189
Query: 292 MNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYE 351
M +V L IT K+ +L + + L D ++ HANT IP V+G YE
Sbjct: 190 MLEVWADLLHITGADKYRVLLDRYYRSRLFQPLLEGKDPLTNMHANTTIPEVLGCARAYE 249
Query: 352 VTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNML 410
VTGD + + ++ V ATGG +AGE W ++ + LG +N+E CT YNM+
Sbjct: 250 VTGDDRWLSIVQAYWNCAVTERGSLATGGQTAGEVWMPKMKMKARLGDKNQEHCTVYNMI 309
Query: 411 KVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE------------PGVMIYMLPLGRG 458
+++ LFR + + YA Y E L NG+++ E G++ Y LP+ G
Sbjct: 310 RLADFLFRQSGDPTYAQYIEYNLYNGIMAQAYYQEYGLTGSQHNYPRTGLLTYFLPMKAG 369
Query: 459 DSKAKSYHGWGTRFSSFWCCYGT 481
K W T SF+CC+GT
Sbjct: 370 LRKE-----WSTETDSFFCCHGT 387
>gi|290954983|ref|YP_003486165.1| hypothetical protein SCAB_3871 [Streptomyces scabiei 87.22]
gi|260644509|emb|CBG67594.1| putative secreted protein [Streptomyces scabiei 87.22]
Length = 768
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 141/380 (37%), Positives = 187/380 (49%), Gaps = 26/380 (6%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG-WEDPTCELRGHFVGHYLSASAHMW 180
Q YL +DVD L+++F+ TAG A G W+ PT R H GH+L+A A ++
Sbjct: 66 QDRTRNYLRFVDVDRLLYNFRANHRLSTAGAAATGGWDAPTFPFRTHVQGHFLTAWAQLY 125
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGS-----GYLSAFPSEQFDRFE--ALKPVWAPYY 233
A T + T ++K T +V+ L++CQ G+ GYLS +P F E L PYY
Sbjct: 126 AVTGDTTCRDKATRMVAELAKCQANNGAAGFNTGYLSGYPESDFTALEQRTLSNGNVPYY 185
Query: 234 TIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
TIHK LAGLLD + +TQA L + W V++ R+ ++ L E
Sbjct: 186 TIHKTLAGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRLTG-------QQMQAMLQTEF 237
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GGMN VL LY T D + L A FD LA D +SG HANT +P IG+
Sbjct: 238 GGMNAVLTDLYQQTGDARWLTAARRFDHAAVFDPLASNQDRLSGLHANTQVPKWIGAARE 297
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
Y+ TG Y+ T I A+H YA GG S E + P +A L + ESC T+NM
Sbjct: 298 YKATGTTRYRDIATNAWSITVAAHTYAIGGNSQAEHFRAPNAIAGFLNQDTCESCNTFNM 357
Query: 410 LKVSRHLFRWT-KEMVYADYYERALTNGVLSIQRGTEP-GVMIYMLPL----GRGDSKAK 463
L ++R LF DYYERA N ++ Q + G + Y PL RG A
Sbjct: 358 LVLTRELFALDPNRAALFDYYERAWLNQMIGQQNPADDHGHVTYFTPLRPGGRRGVGPAW 417
Query: 464 SYHGWGTRFSSFWCCYGTGL 483
W T + +FWCC GTGL
Sbjct: 418 GGGTWSTDYGTFWCCQGTGL 437
>gi|296331240|ref|ZP_06873712.1| hypothetical protein BSU6633_09061 [Bacillus subtilis subsp.
spizizenii ATCC 6633]
gi|296151355|gb|EFG92232.1| hypothetical protein BSU6633_09061 [Bacillus subtilis subsp.
spizizenii ATCC 6633]
Length = 761
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 132/376 (35%), Positives = 191/376 (50%), Gaps = 23/376 (6%)
Query: 117 LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSAS 176
+ + +Q EYLL LDVD L+ + Y GWE E+ GH +GH+LSA+
Sbjct: 10 MFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAGHSIGHWLSAA 67
Query: 177 AHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD-------RFE--ALKP 227
+ M+ ++ + LK K V+ LS Q GY+S F FD R + +L
Sbjct: 68 SAMYQASGDEKLKRKAEYAVNELSHIQQFDEEGYISGFSRACFDEVFSGDFRVDHFSLGG 127
Query: 228 VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNE 287
W P+Y++HK+ AGL+D Y N AL++ + ++ + + + + E+ L
Sbjct: 128 SWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRMLIC 183
Query: 288 ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQ 347
E GGMN+ + LY +T++ +L LA F L LA D++ G HANT IP VIG+
Sbjct: 184 EHGGMNEAMADLYILTKNKSYLDLAERFCHRAILQPLAEGKDELEGKHANTQIPKVIGAA 243
Query: 348 MRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTY 407
Y++TG+ Y+ FF + V YA GG S GE + + LG E+C TY
Sbjct: 244 KLYDITGNEAYRNPALFFWEQVVYQRSYAIGGNSIGEHFG--AEGSEELGVTTAETCNTY 301
Query: 408 NMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHG 467
NMLK++ HLFRW E + DYYE AL N +LS Q E G+ Y + G K
Sbjct: 302 NMLKLTGHLFRWFHEARFTDYYENALYNHILSSQ-DPESGMKTYFVSTQPGHFKV----- 355
Query: 468 WGTRFSSFWCCYGTGL 483
+ + SFWCC GTG+
Sbjct: 356 YCSPEDSFWCCTGTGM 371
>gi|305676227|ref|YP_003867899.1| hypothetical protein BSUW23_17775, partial [Bacillus subtilis
subsp. spizizenii str. W23]
gi|305414471|gb|ADM39590.1| hypothetical protein BSUW23_17775 [Bacillus subtilis subsp.
spizizenii str. W23]
Length = 497
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 132/376 (35%), Positives = 191/376 (50%), Gaps = 23/376 (6%)
Query: 117 LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSAS 176
+ + +Q EYLL LDVD L+ + Y GWE E+ GH +GH+LSA+
Sbjct: 10 MFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAGHSIGHWLSAA 67
Query: 177 AHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD-------RFE--ALKP 227
+ M+ ++ + LK K V+ LS Q GY+S F FD R + +L
Sbjct: 68 SAMYQASGDEKLKRKAEYAVNELSHIQQFDEEGYISGFSRACFDEVFSGDFRVDHFSLGG 127
Query: 228 VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNE 287
W P+Y++HK+ AGL+D Y N AL++ + ++ + + + + E+ L
Sbjct: 128 SWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRMLIC 183
Query: 288 ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQ 347
E GGMN+ + LY +T++ +L LA F L LA D++ G HANT IP VIG+
Sbjct: 184 EHGGMNEAMADLYILTKNKSYLDLAERFCHRAILQPLAEGKDELEGKHANTQIPKVIGAA 243
Query: 348 MRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTY 407
Y++TG+ Y+ FF + V YA GG S GE + + LG E+C TY
Sbjct: 244 KLYDITGNEAYRNPALFFWEQVVYQRSYAIGGNSIGEHFG--AEGSEELGVTTAETCNTY 301
Query: 408 NMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHG 467
NMLK++ HLFRW E + DYYE AL N +LS Q E G+ Y + G K
Sbjct: 302 NMLKLTGHLFRWFHEARFTDYYENALYNHILSSQ-DPESGMKTYFVSTQPGHFKV----- 355
Query: 468 WGTRFSSFWCCYGTGL 483
+ + SFWCC GTG+
Sbjct: 356 YCSPEDSFWCCTGTGM 371
>gi|373955475|ref|ZP_09615435.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
gi|373892075|gb|EHQ27972.1| protein of unknown function DUF1680 [Mucilaginibacter paludis DSM
18603]
Length = 782
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 135/391 (34%), Positives = 195/391 (49%), Gaps = 24/391 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
LK L +VKL P + A+ +L+Y++ L D L+ + + AG ++Y WE+
Sbjct: 24 LKTFRLQEVKLLPGIFN-DAENADLKYMMQLSPDKLLAPYLREAGLKPKAESYTNWEN-- 80
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE---- 217
L GH GHYLSA A M+AST + +++ +++ L CQ+K G+GY+ P
Sbjct: 81 SGLDGHIGGHYLSALAMMYASTGDKQALDRLNYMIAELKICQDKNGNGYVGGVPGSKELW 140
Query: 218 ----QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
Q D A+ W P+Y IHK AGL D YT+A N A K M+ F + +
Sbjct: 141 AAVMQGD-VGAINKKWVPFYNIHKTFAGLRDAYTYAGNETA----KVMLIKFADWFVMIA 195
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
T + ++ L E GG+N+VL +Y +T D K+L A+ F L L D ++
Sbjct: 196 TSITPQKMQEMLKTEHGGVNEVLADVYALTGDKKYLTAAYSFSHQAILEPLEQGQDKLNN 255
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
HANT IP VIG + +VT D Y FF V A GG S E ++ +
Sbjct: 256 LHANTQIPKVIGFKRISDVTADSNYNKAAQFFWQTVVQHRTVAIGGNSVREHFNPSNDFS 315
Query: 394 STLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYM 452
S + TE E+C TYNMLK++ L+ + Y DYYERAL N +LS +R G +Y
Sbjct: 316 SMITTEQGPETCNTYNMLKLTEDLYLSDPRVSYIDYYERALYNHILSTER--PGGGFVYF 373
Query: 453 LPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
P+ G Y + +S WCC G+G+
Sbjct: 374 TPMRPG-----HYRVYSQPQTSMWCCVGSGM 399
>gi|398384929|ref|ZP_10542957.1| hypothetical protein PMI04_02662 [Sphingobium sp. AP49]
gi|397722209|gb|EJK82754.1| hypothetical protein PMI04_02662 [Sphingobium sp. AP49]
Length = 802
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 130/400 (32%), Positives = 190/400 (47%), Gaps = 32/400 (8%)
Query: 103 KEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC 162
+ + L +L PS A + N YLL L+ D L+ +F+K AG G Y GWE+ T
Sbjct: 46 RALPLQATRLLPSPFA-DAVEGNRLYLLRLEPDRLLHNFRKHAGLTPKGAIYGGWENDT- 103
Query: 163 ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRF 222
+ GH +GHYL+A A M A T + + ++ L+ CQ G GY++ F + D
Sbjct: 104 -IAGHTLGHYLTALALMHAQTGDAECARRAAYIIDELAACQAAAGDGYVAGFTRRRDDVI 162
Query: 223 EA-------------------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
E L W P+Y HK+ AGL D N+QA + +
Sbjct: 163 EDGRLIFPEIMRGDIRSAGFDLNGCWVPFYNWHKLFAGLFDAEAHLGNSQARGVALALAA 222
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
Y + V K + L+ E GG+N+ L+ T DP+ L LA L
Sbjct: 223 Y----IDGVFAKLDDAQVQQVLDCEHGGINESFAELHARTGDPRWLALATRLRHRKVLDP 278
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
LA + + + HANT IP +IG +E+TG+ + FF + V + Y GG +
Sbjct: 279 LAQRQNSLPWIHANTQIPKLIGLARLHEITGNAADAIAANFFWETVVGQYSYVIGGNADR 338
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E++ DP ++ + + ESC +YNMLK++RHL+ W E DYYERA N +L+ Q
Sbjct: 339 EYFPDPGTISKHITEQTCESCNSYNMLKLTRHLYAWKPEARLFDYYERAHINHILAHQNP 398
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G+ YM+PL G S+ W F FWCC G+G+
Sbjct: 399 AT-GMFAYMVPLMSG-----SHRVWSEPFDDFWCCVGSGM 432
>gi|312131189|ref|YP_003998529.1| hypothetical protein Lbys_2513 [Leadbetterella byssophila DSM
17132]
gi|311907735|gb|ADQ18176.1| protein of unknown function DUF1680 [Leadbetterella byssophila DSM
17132]
Length = 737
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 133/397 (33%), Positives = 202/397 (50%), Gaps = 33/397 (8%)
Query: 103 KEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC 162
+ + L+ VKL + AQ +L+Y+L LD D L+ ++ AG + Y WE +
Sbjct: 18 QNIPLNQVKLK-EGVFKNAQDVDLKYILALDPDKLLAPYRIDAGLEKKAERYGNWE--SS 74
Query: 163 ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS-----E 217
L GH GHYLSA A ++AS+ LK+++ +VS L+ CQ K G+GY+ P E
Sbjct: 75 GLDGHIGGHYLSALAMLYASSGEPELKKRLDYMVSELAACQKKNGNGYVGGIPQGKVFWE 134
Query: 218 QFDRFE------ALKPVWAPYYTIHKILAGLLDQYTFADNTQALK----MTKWMVEYFYN 267
+ + + L W P Y IHK+ AGL D Y F N +AL ++ WM+E F
Sbjct: 135 RIGKGDIDGSSFGLNNTWVPLYNIHKLFAGLYDAYHFTGNNEALTVLTGLSDWMIELF-- 192
Query: 268 RVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQ 327
+ +T VE+ L E GG+N+ +Y+ T + K+L A F + FL +
Sbjct: 193 ---SALTDEQVEK---VLRTEHGGLNEAFLDVYSATGEQKYLRAAERFTQKAFLQPMIEG 246
Query: 328 ADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWS 387
D ++G HANT IP ++G++ +VT + + ++F D V A GG S E +
Sbjct: 247 KDILTGLHANTQIPKMVGAEKISQVTKNQDWHKGASYFWDNVALHRSVAFGGNSYREHFH 306
Query: 388 DPKRLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP 446
+ R L T + E+C +YNMLK+S+ L+ T + Y D+YE+ L N +LS Q E
Sbjct: 307 ELDRFDKMLETNQGPETCNSYNMLKLSKALYESTGDNKYLDFYEKTLFNHILSSQH-PEK 365
Query: 447 GVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P+ + Y + +S WCC GTGL
Sbjct: 366 GGFVYFTPI-----RPNHYRVYSQPETSMWCCVGTGL 397
>gi|398305096|ref|ZP_10508682.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus vallismortis
DV1-F-3]
Length = 762
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 134/386 (34%), Positives = 196/386 (50%), Gaps = 24/386 (6%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
+ DV L + + +Q EYLL LDVD L+ + Y GWE E+ G
Sbjct: 1 MEDVTL-LKGMFYDSQMKGKEYLLFLDVDRLLAPCYEAVSQTPKKPRYGGWE--AKEIAG 57
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD------ 220
H VGH+LSA++ M+ ++ + LK K V+ LS Q GY+S F FD
Sbjct: 58 HSVGHWLSAASAMYRASGDEELKRKTAYAVNELSHIQQFDQEGYVSGFSRACFDEVFSGD 117
Query: 221 -RFE--ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYS 277
R + +L W P+Y++HK+ AGL+D Y N AL++ + ++ + + + +
Sbjct: 118 FRVDHFSLGGSWVPWYSLHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLN 173
Query: 278 VERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHAN 337
E+ L E GGMN+ + LY +T++ +L LA F L LA D++ G HAN
Sbjct: 174 DEQFQRMLICEHGGMNEAMADLYMLTKNKAYLELAERFCHRAILQPLAEGKDELEGKHAN 233
Query: 338 THIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG 397
T IP VIG+ Y++TG+ Y+ FF + V YA GG S GE + + LG
Sbjct: 234 TQIPKVIGAAKLYDITGNEAYRNAALFFWEQVVYQRSYAIGGNSIGEHFG--AEGSEELG 291
Query: 398 TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 457
E+C TYNMLK++ HLFRW +E + DYYE AL N +L+ Q + G+ Y +
Sbjct: 292 VTTAETCNTYNMLKLTAHLFRWFQESKFMDYYENALYNHILASQ-DPDSGMKTYFVSTQP 350
Query: 458 GDSKAKSYHGWGTRFSSFWCCYGTGL 483
G K + + SFWCC GTG+
Sbjct: 351 GHFKV-----YCSPEDSFWCCTGTGM 371
>gi|380512705|ref|ZP_09856112.1| hypothetical protein XsacN4_15862 [Xanthomonas sacchari NCPPB 4393]
Length = 799
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 132/403 (32%), Positives = 198/403 (49%), Gaps = 35/403 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ + L V L PS L + QTN YLL L+ D L+ +F + AG P G Y GWE T
Sbjct: 54 VQALPLQQVTLKPS-LFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGAVYGGWEGDT 112
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
+ GH +GHYLSA A M A T + L+E++ +V+ L+ Q + GY+ F + + D+
Sbjct: 113 --IAGHTLGHYLSALAKMHAQTRDPVLRERIDYIVAELARAQAQDPDGYVGGF-TRKNDK 169
Query: 222 FEA---------------------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKW 260
E L W+P YT HK+ AGLLD + A + QAL++
Sbjct: 170 GEIEGGKAVLEDVRRGIIKGSKFNLNGSWSPLYTQHKLFAGLLDAHALAGSKQALEVLLP 229
Query: 261 MVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCF 320
+ Y V + L+ E GG+N+ L T D + + +
Sbjct: 230 LAAY----TAGVFDALDHAQMQTLLDTEFGGLNESYIELGARTGDARWVAIGKRLRHEKV 285
Query: 321 LGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGT 380
+ A D++ HANT +P IG ++EV GD FF + V A + Y GG
Sbjct: 286 IDPAAAGRDELPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTAHYSYVIGGN 345
Query: 381 SAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSI 440
+ E++ +P +A+ L + E C +YNMLK++RHL++WT + Y DYYER L N ++
Sbjct: 346 ADREYFQEPDTIAAFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAA 405
Query: 441 QRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Q G+ YM P+ G + G+ +F SFWCC G+G+
Sbjct: 406 QHPAT-GMFTYMTPMISGGER-----GFSDKFDSFWCCVGSGM 442
>gi|408393860|gb|EKJ73118.1| hypothetical protein FPSE_06731 [Fusarium pseudograminearum CS3096]
Length = 623
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 147/403 (36%), Positives = 193/403 (47%), Gaps = 29/403 (7%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQ-TNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG-W 157
D L DV L S W Q + YLL +D D L++ F+K G T G A G W
Sbjct: 29 DLADAFELSDVSLTDS--RWMDNQGRTVNYLLSIDPDRLLYVFRKNHGLDTKGAAKNGGW 86
Query: 158 EDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQN---KMG--SGYLS 212
+ P R H GH+LSA ++ +A+ N + + V L++CQ K+G SGYLS
Sbjct: 87 DAPDFPFRSHVQGHFLSAWSNCYATLGNKECGSRASYFVKELAKCQANNAKVGFTSGYLS 146
Query: 213 AFPSEQFDRFE--ALKPVWAPYYTIHKILAGLLDQYT-FADN---TQALKMTKWMVEYFY 266
FP + + E L PYY IHK LAGLLD Y DN T L + W
Sbjct: 147 GFPESEITKVEDRTLSSGNVPYYAIHKTLAGLLDVYRRVGDNDAKTVMLSLASW------ 200
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
V K S + + E GGMN+VL + TQD K L +A FD L
Sbjct: 201 --VDARTGKLSYAKMQQMMQTEFGGMNEVLADIAYYTQDNKWLKVAQRFDHAAIFDPLQN 258
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
D +SG HANT +P IG+ Y+V+GD Y G D+ H YA GG S E +
Sbjct: 259 NVDKLSGLHANTQVPKWIGALREYKVSGDKKYLDIGRNAWDLTVHKHTYAIGGNSQAEHF 318
Query: 387 SDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTK-EMVYADYYERALTNGVLSIQRGTE 445
+P +A L + E+C TYNMLK++R L+ + Y DYYE AL N +L Q +
Sbjct: 319 REPNAIAKYLTKDTCEACNTYNMLKLTRELWALNPTDASYFDYYENALMNHLLGQQNPKD 378
Query: 446 P-GVMIYMLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G + Y PL RG A W T ++SFWCC G+G+
Sbjct: 379 SHGHVTYFTPLTPGGRRGVGPAWGGGTWSTDYNSFWCCQGSGI 421
>gi|238059692|ref|ZP_04604401.1| secreted protein [Micromonospora sp. ATCC 39149]
gi|237881503|gb|EEP70331.1| secreted protein [Micromonospora sp. ATCC 39149]
Length = 740
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 133/376 (35%), Positives = 185/376 (49%), Gaps = 18/376 (4%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG-WEDPTCELRGHFVGHYLSASAHMW 180
Q L YL +DVD L+++F+ T G A G W+ P+ R H GH+L+A A +
Sbjct: 32 QNRTLSYLRFVDVDRLLYNFRANHRLSTNGAASNGGWDAPSFPFRTHVQGHFLTAWAQAY 91
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGS-----GYLSAFPSEQFDRFEA--LKPVWAPYY 233
A + T ++K +V+ L++CQ G+ GYLS FP F EA L PYY
Sbjct: 92 AVLGDTTCRDKANYMVAELAKCQANNGAAGFTAGYLSGFPESDFTALEARTLSNGNVPYY 151
Query: 234 TIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMN 293
IHK L GLLD + + NTQA + + + R + S + L E GGMN
Sbjct: 152 CIHKTLLGLLDVWRYIGNTQARSVLLALAGWVDTRT----ARLSSSQMQAMLGTEFGGMN 207
Query: 294 DVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVT 353
+ L LY T D + L +A FD LA +D ++G HANT +P IG+ Y+ T
Sbjct: 208 EALADLYQQTGDGRWLTVAQRFDHAAVFNPLAANSDQLNGLHANTQVPKWIGAAREYKAT 267
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVS 413
G Y+ + ++ +H YA GG S E + P +A L + E C T NMLK++
Sbjct: 268 GTTRYRDIASNAWNMTVNAHTYAIGGNSQAEHFRAPNAIAGYLTNDTCEHCNTVNMLKLT 327
Query: 414 RHLFRWT-KEMVYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGDSKAKSYHG 467
R L+ + Y DY+ERAL N V+ Q + G + Y PL RG A
Sbjct: 328 RELWLIDPNQAAYFDYFERALANHVIGAQNPADGHGHVTYFTPLKPGGRRGVGPAWGGGT 387
Query: 468 WGTRFSSFWCCYGTGL 483
W T + SFWCC GTG+
Sbjct: 388 WSTDYDSFWCCQGTGI 403
>gi|383641951|ref|ZP_09954357.1| hypothetical protein SchaN1_14318 [Streptomyces chartreusis NRRL
12338]
Length = 768
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 139/380 (36%), Positives = 186/380 (48%), Gaps = 26/380 (6%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG-WEDPTCELRGHFVGHYLSASAHMW 180
Q YL +DVD L+++F+ T G A G W+ P R H GH+L+A A ++
Sbjct: 66 QDRTRNYLRFVDVDRLLYNFRANHRLSTNGAAANGGWDAPDFPFRTHVQGHFLTAWAQLY 125
Query: 181 ASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQFDRFE--ALKPVWAPYY 233
A T + T ++K T +V+ L++CQ +GYLS +P F E L PYY
Sbjct: 126 AVTGDTTCRDKATTMVAELAKCQANNSTAGFNAGYLSGYPESDFTALEQRTLSNGNVPYY 185
Query: 234 TIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
TIHK L GLLD + +TQA L + W V++ R+ S ++ L E
Sbjct: 186 TIHKTLVGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRL-------SGQQMQAMLQTEF 237
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GGMN VL LY T D + L +A FD LA D +SG HANT +P IG+
Sbjct: 238 GGMNTVLTDLYQQTGDARWLTVARRFDHAAVFDPLAAGQDQLSGLHANTQVPKWIGAARE 297
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
Y+ TG Y+ T +I SH YA GG S E + P +A L + ESC T+NM
Sbjct: 298 YKATGTTRYRDIATNAWNICVNSHTYAIGGNSQAEHFRAPNAIAGFLNKDTCESCNTFNM 357
Query: 410 LKVSRHLFRWTKEMVYA-DYYERALTNGVLSIQR-GTEPGVMIYMLPLG----RGDSKAK 463
L ++R LF V DYYERA N ++ Q + G + Y PL RG A
Sbjct: 358 LTLTRELFALDPNRVALFDYYERAWLNQMIGQQNPADDHGHVTYFTPLNPGGRRGVGPAW 417
Query: 464 SYHGWGTRFSSFWCCYGTGL 483
W T + +FWCC GTGL
Sbjct: 418 GGGTWSTDYGTFWCCQGTGL 437
>gi|347738800|ref|ZP_08870212.1| hypothetical protein AZA_89687 [Azospirillum amazonense Y2]
gi|346918071|gb|EGY00199.1| hypothetical protein AZA_89687 [Azospirillum amazonense Y2]
Length = 804
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 134/396 (33%), Positives = 193/396 (48%), Gaps = 32/396 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L V+L PS A NL YL L+ D L+ +F+ AG G AY GWE T + G
Sbjct: 40 LSAVRLKPSPFKA-AVDANLAYLHSLEADRLLHNFRSGAGLQPKGAAYGGWEGDT--IAG 96
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALK 226
H +GHYLSA + M A T + K ++ +V+ L+ECQ G GY++ F ++ D E K
Sbjct: 97 HTLGHYLSALSLMHAQTGDAECKRRVDYIVAELAECQKAQGDGYVAGFTRKRGDIVEDGK 156
Query: 227 PV-------------------WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYN 267
V W P Y HK+ GL D T NTQAL + + Y
Sbjct: 157 VVFDELRRGEIRSAGFDLNGCWVPLYNWHKLYTGLFDAQTLCGNTQALDVGVKLGGY--- 213
Query: 268 RVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQ 327
+ V + + E+ L+ E GG+N+ LY T D + LLLA L L+
Sbjct: 214 -IDEVFSHLNDEQVQKVLDCEHGGINESFAELYARTGDRRWLLLAERLYHAKVLVPLSEG 272
Query: 328 ADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWS 387
D+++ HANT IP +IG E+TG + FF V +H Y GG + E++
Sbjct: 273 RDELANIHANTQIPKLIGLARLAELTGSERHAKASAFFWQTVTTNHSYVIGGNADREYFQ 332
Query: 388 DPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPG 447
+P+ ++ + + E C +YNMLK++R L+ + Y D+YERA N VL+ Q+ G
Sbjct: 333 EPRSISRHITEQTCEGCNSYNMLKLTRLLYARQADAHYFDFYERAHLNHVLA-QQNPATG 391
Query: 448 VMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ YM PL G ++ + T FWCC GTG+
Sbjct: 392 MFTYMTPLMSGSARE-----FSTPTEDFWCCVGTGM 422
>gi|451851952|gb|EMD65250.1| hypothetical protein COCSADRAFT_141970 [Cochliobolus sativus
ND90Pr]
Length = 620
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 135/386 (34%), Positives = 191/386 (49%), Gaps = 23/386 (5%)
Query: 115 SSLHWRAQQT-NLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCELRGHFVGHY 172
S+ W+ + L YL ++VD L+++F+ T T G + GW+ P R H GHY
Sbjct: 45 SNSRWKDNENRTLNYLKFVNVDRLLYNFRATHKLSTNGAQPNGGWDAPNFPFRSHVQGHY 104
Query: 173 LSASAHMWASTHNVTLKEKMTAVVSALSECQNKMG-----SGYLSAFPSEQFDRFEALKP 227
L+A + +A+ + T K++ V L++CQ G GYLS FP +F EA K
Sbjct: 105 LTAWVNCYATLRDSTCKDRAAYFVQELAKCQANNGVAGFSPGYLSGFPESEFAALEAGKL 164
Query: 228 VWA--PYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSL 285
PYY +HK +AGLLD + + +A + + + R + K S + L
Sbjct: 165 TGGNVPYYAVHKTMAGLLDAWRIIGDQKARDVLLALAGWVDGRTK----KLSTAQMQTML 220
Query: 286 NEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIG 345
E GGMNDVL +Y +T + + L +A FD LA + D +SG HANT +P IG
Sbjct: 221 GTEFGGMNDVLAEIYQLTGNKQWLTVAQRFDHAKVFDPLANKQDQLSGNHANTQVPKWIG 280
Query: 346 SQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCT 405
+ Y+ TG Y D +H YA GG S E + P ++++ L + E C
Sbjct: 281 AAREYKSTGTKRYLDIARNAWDFTINAHTYAIGGNSQAEHFRPPNQISNFLTNDTAEQCN 340
Query: 406 TYNMLKVSRHLFRWTKEMV---YADYYERALTNGVLSIQRGTEP-GVMIYMLPL----GR 457
TYNMLK++R L WT + Y DYYERAL N +L Q + G + Y PL R
Sbjct: 341 TYNMLKLTRDL--WTTDPTSTKYFDYYERALINHLLGAQNAADNHGHITYFTPLRSGGRR 398
Query: 458 GDSKAKSYHGWGTRFSSFWCCYGTGL 483
G A W T ++SFWCC GT L
Sbjct: 399 GVGPAWGGGTWSTDYNSFWCCQGTAL 424
>gi|423313734|ref|ZP_17291670.1| hypothetical protein HMPREF1058_02282 [Bacteroides vulgatus
CL09T03C04]
gi|392684669|gb|EIY77993.1| hypothetical protein HMPREF1058_02282 [Bacteroides vulgatus
CL09T03C04]
Length = 783
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 130/393 (33%), Positives = 199/393 (50%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
+ DV+L S A+ ++ YLL +D D L+ + K AG + Y WE+ L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE- 223
H GHYLSA ++M+A+T N +K ++ ++S L CQ+ G GYL P+ + + E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 224 --------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
L W P Y IHKI AGL D D+ +A +K+T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMIR-------- 201
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+++K S E+ + L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ G+ + +F + V GG S E +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 392 LASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S L +E E+C TYNML++++ L+ + ++ + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG-FV 380
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ +A Y + +SFWCC G+G+
Sbjct: 381 YFTPM-----RAGHYRVYSQPQTSFWCCVGSGM 408
>gi|319640591|ref|ZP_07995310.1| hypothetical protein HMPREF9011_00907 [Bacteroides sp. 3_1_40A]
gi|345517952|ref|ZP_08797412.1| acetyl-CoA carboxylase [Bacteroides sp. 4_3_47FAA]
gi|254835150|gb|EET15459.1| acetyl-CoA carboxylase [Bacteroides sp. 4_3_47FAA]
gi|317387761|gb|EFV68621.1| hypothetical protein HMPREF9011_00907 [Bacteroides sp. 3_1_40A]
Length = 783
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 130/393 (33%), Positives = 199/393 (50%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
+ DV+L S A+ ++ YLL +D D L+ + K AG + Y WE+ L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE- 223
H GHYLSA ++M+A+T N +K ++ ++S L CQ+ G GYL P+ + + E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 224 --------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
L W P Y IHKI AGL D D+ +A +K+T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMIR-------- 201
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+++K S E+ + L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ G+ + +F + V GG S E +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 392 LASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S L +E E+C TYNML++++ L+ + ++ + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG-FV 380
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ +A Y + +SFWCC G+G+
Sbjct: 381 YFTPM-----RAGHYRVYSQPQTSFWCCVGSGM 408
>gi|330467876|ref|YP_004405619.1| hypothetical protein VAB18032_19585 [Verrucosispora maris
AB-18-032]
gi|328810847|gb|AEB45019.1| hypothetical protein VAB18032_19585 [Verrucosispora maris
AB-18-032]
Length = 913
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 137/380 (36%), Positives = 188/380 (49%), Gaps = 26/380 (6%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA-YEGWEDPTCELRGHFVGHYLSASAHMW 180
Q L YL +DV+ L+++F+ TAG A GWE PT R H GH+L+A +HMW
Sbjct: 67 QNRTLNYLRFVDVNRLLYNFRANHRLSTAGAAALGGWEAPTFPFRTHSQGHFLTAWSHMW 126
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGS-----GYLSAFPSEQFDRFEA--LKPVWAPYY 233
A + T ++K +V+ L++CQ + GYL +P F EA L PYY
Sbjct: 127 AVLGDTTCRDKANYMVAELAKCQANNAAAGFNPGYLCGYPESDFTAVEARTLNNGNVPYY 186
Query: 234 TIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
TIHK L GLLD + N QA L + W V++ R+ + + L E
Sbjct: 187 TIHKTLVGLLDVWRHIGNNQARDVLLALAGW-VDWRTGRLSSAQMQAM-------LGTEF 238
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GGMN VL LY T D + L +A FD LA D ++G HANT IP IG+
Sbjct: 239 GGMNAVLTDLYQQTGDARWLTVAQRFDHAAVFNPLAANQDQLNGLHANTQIPKWIGAARE 298
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
++ TG Y+ + ++ + YA GG S E + P ++ L + E C TYNM
Sbjct: 299 FKATGTTRYRDIASNAWNLTVNTRTYAIGGNSQAEHFRAPNAISGYLRNDTCEHCNTYNM 358
Query: 410 LKVSRHLFRWTKEMV-YADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGDSKAK 463
LK++R L+ V Y D+YERAL N ++ Q + G + Y PL RG A
Sbjct: 359 LKLTRELWLLDPNRVAYFDFYERALLNHLIGAQNPADNHGHITYFTPLQPGGRRGVGPAW 418
Query: 464 SYHGWGTRFSSFWCCYGTGL 483
W T ++SFWCC GTGL
Sbjct: 419 GGGTWSTDYNSFWCCQGTGL 438
>gi|436835729|ref|YP_007320945.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
gi|384067142|emb|CCH00352.1| protein of unknown function DUF1680 [Fibrella aestuarina BUZ 2]
Length = 760
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 131/398 (32%), Positives = 193/398 (48%), Gaps = 33/398 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ +L DVKL AQ + Y+L L+ D L+ + AG P Y WE +
Sbjct: 22 MQPFALQDVKLTGGPFK-NAQDVDQRYILALNPDKLLAPYLIDAGLPVKAPRYGNWE--S 78
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS----- 216
L GH GHYLSA A ++AST + LK+++ +V L++CQ K G+GY+ P
Sbjct: 79 SGLDGHIGGHYLSALAMLYASTGDAELKKRLDYMVDQLAQCQAKNGNGYVGGIPQGKVFW 138
Query: 217 EQFDRFE------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFY 266
E+ + + L W P Y IHK+ AGL D Y +A N QA + + W VE
Sbjct: 139 ERIHKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDAYEYAGNQQAKQVLIGLGDWFVE--- 195
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
+I S E+ L E GG+N+ LY +T D K+L A L L
Sbjct: 196 -----LIKPLSDEQIQQVLRTEHGGINETFADLYILTNDKKYLETAQRISHRAILEPLLA 250
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
+ D ++G HANT IP VIG + + G P + T+F V+ A GG S E +
Sbjct: 251 KQDKLTGLHANTQIPKVIGFEKIAMLAGKPDWSDAATYFWQNVSQHRSVAFGGNSVREHF 310
Query: 387 SDPKRLASTL-GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
+ + L + E+C ++NML++S+ LF ++ Y D+YERAL N +LS Q E
Sbjct: 311 NPTTDFSQVLRSNQGPETCNSFNMLRLSKALFLDKSDVTYLDFYERALYNHILSSQH-PE 369
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P+ + Y + +S WCC G+G+
Sbjct: 370 KGGFVYFTPI-----RPNHYRVYSQPETSMWCCVGSGI 402
>gi|302818287|ref|XP_002990817.1| hypothetical protein SELMODRAFT_429245 [Selaginella moellendorffii]
gi|300141378|gb|EFJ08090.1| hypothetical protein SELMODRAFT_429245 [Selaginella moellendorffii]
Length = 226
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 98/150 (65%), Positives = 118/150 (78%), Gaps = 4/150 (2%)
Query: 158 EDPTCELRGHFVG----HYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
E+ +C L+ HYLSASA WASTHN+T+ E M AVV+AL+ECQ K+G+GYLSA
Sbjct: 8 EEISCHLKQQTACKDKRHYLSASAMTWASTHNLTIYENMNAVVAALAECQAKIGTGYLSA 67
Query: 214 FPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FP+ FDRFEAL+ VWAPYYTIHKI+AGLLDQYT+A N+ A +M M +YF +RV+ VI
Sbjct: 68 FPTSLFDRFEALESVWAPYYTIHKIMAGLLDQYTYAANSFAFEMLLGMTDYFGSRVERVI 127
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTIT 303
KYS+ERHW SLNEETGGMNDVLYR+Y IT
Sbjct: 128 EKYSIERHWQSLNEETGGMNDVLYRVYQIT 157
>gi|407790778|ref|ZP_11137869.1| Acetyl-CoA carboxylase, biotin carboxylase [Gallaecimonas
xiamenensis 3-C-1]
gi|407202325|gb|EKE72317.1| Acetyl-CoA carboxylase, biotin carboxylase [Gallaecimonas
xiamenensis 3-C-1]
Length = 780
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 195/402 (48%), Gaps = 41/402 (10%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L+ + L +V+L PS +AQ TN YL LD D L+ F+ AG P Y WE
Sbjct: 20 LETLPLQEVRLLPSPFK-QAQDTNRHYLDSLDPDRLLAPFRAEAGLPQPKPGYGNWE--A 76
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE---- 217
L GH GHYLSA + M+AST + L ++ ++ L +CQ+K+G+GY+ P
Sbjct: 77 DGLGGHMGGHYLSALSLMYASTGDPALLARLQYMLDELKKCQDKLGTGYIGGVPGGSALW 136
Query: 218 --------QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKM-------TKWMV 262
Q D F L W P+Y +HK+ AGL D Y + + QAL M T W+V
Sbjct: 137 QQIHQGDIQADLF-TLNQKWVPWYNLHKLYAGLRDAYRYTGSAQALAMWIKLSDWTDWLV 195
Query: 263 EYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
E S E+ L E GGMN+V LY IT K+L LA F + L
Sbjct: 196 EGL-----------SDEQMQAMLVTEYGGMNEVFADLYEITGQDKYLQLAKRFSQQQLLQ 244
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
LA D ++G HANT IP VIG + +V+GD +F V A GG S
Sbjct: 245 PLAHGQDQLNGLHANTQIPKVIGFERIAQVSGDRAMGAAADYFWHQVVEQRTVAIGGNSV 304
Query: 383 GEFWSDPKRLASTL-GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E + +S + E E+C +YNMLK++R L++ + Y YYERAL N +L+ Q
Sbjct: 305 REHFHPKDDFSSMVEEVEGPETCNSYNMLKLARLLYQRQGGLDYLAYYERALYNHILASQ 364
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G ++Y P+ + Y + + WCC G+G+
Sbjct: 365 H-PDDGGLVYFTPM-----RPNHYRVYSQADKAMWCCVGSGI 400
>gi|285018715|ref|YP_003376426.1| hypothetical protein XALc_1948 [Xanthomonas albilineans GPE PC73]
gi|283473933|emb|CBA16434.1| conserved hypothetical protein [Xanthomonas albilineans GPE PC73]
Length = 810
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 132/402 (32%), Positives = 195/402 (48%), Gaps = 33/402 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ + L V L PS L + QTN YLL L+ D L+ +F + AG P G Y GWE T
Sbjct: 62 VQALPLRQVTLKPS-LFLDSLQTNRRYLLELEPDRLLHNFLQYAGLPPKGAVYGGWEGDT 120
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD- 220
+ GH +GHYLSA + M A T + +L+ ++ +V+ L+ Q + GY+ F + +
Sbjct: 121 --IAGHTLGHYLSALSKMHAQTRDSSLRTRIDYIVAELARAQAQDPDGYVGGFTRKNDNG 178
Query: 221 RFEALKPV-------------------WAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
+ E K V W+P YT HK+ AGLLD + N QAL + +
Sbjct: 179 KIEGGKAVLEDLRRGIIKGGKFNLNGSWSPLYTQHKLFAGLLDAHALGGNAQALTVLVKV 238
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
YF V + L+ E GG+N+ L T + + + +
Sbjct: 239 AGYF----AGVFDALDHAQMQTLLDTEFGGLNESFIELGARTGQERWIAIGKRLRHEKII 294
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
LA D + HANT +P IG ++EV GD FF + V A + Y GG S
Sbjct: 295 DPLAAGHDVLPHIHANTQVPKFIGEARQFEVAGDADAAAAARFFWETVTAHYSYVIGGNS 354
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E++ +P +A L + E C +YNMLK++RHL++WT + Y DYYER L N ++ Q
Sbjct: 355 DREYFQEPDSIAGFLTEQTCEHCNSYNMLKLTRHLYQWTPQARYFDYYERTLHNHTMAAQ 414
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G+ YM P+ G + G+ +F SFWCC G+G+
Sbjct: 415 HPAT-GMFTYMTPMISGGER-----GFSEKFDSFWCCVGSGM 450
>gi|429858822|gb|ELA33628.1| secreted protein [Colletotrichum gloeosporioides Nara gc5]
Length = 623
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 133/380 (35%), Positives = 189/380 (49%), Gaps = 26/380 (6%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAG-SPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
Q L YL +DV+ L+++F+K G S +A GW+ P R HF GH+L+A A +
Sbjct: 58 QARTLTYLKWVDVERLLYNFRKNHGLSTNNAQANGGWDAPDFPFRTHFQGHFLNAWAFCY 117
Query: 181 ASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQFDRFE--ALKPVWAPYY 233
A H+ K++ T + L +CQ +GYLS FP + E +L PYY
Sbjct: 118 AQLHDTECKDRATYFAAELKKCQANNANVGFNTGYLSGFPESEITAVEDRSLSNGNVPYY 177
Query: 234 TIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
IHK +AGLLD + +T A L+M W V K + + N ++ E
Sbjct: 178 AIHKTMAGLLDVWRHIGDTNARDVLLEMAAW--------VDLRTGKLTYAQMQNMMSTEF 229
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GGMN+V+ ++ T D + L +A FD LA D ++G HANT +P IG+
Sbjct: 230 GGMNEVMADIFHQTGDQRWLTVAQRFDHAAIFDPLASNQDSLNGLHANTQVPKWIGASRE 289
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
Y+ TG Y+ +I ++H YA GG S E + P +A L ++ E+C TYNM
Sbjct: 290 YKATGTSRYQDIARNAWNITVSAHSYAIGGNSQAEHFRLPNAIAGFLNSDTCEACNTYNM 349
Query: 410 LKVSRHLFRWTKEMV-YADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGDSKAK 463
LK++R L+ Y D+YERAL N +L Q ++ G + Y PL RG A
Sbjct: 350 LKLTRELWLTNPSATHYFDFYERALLNHLLGQQDPSDSHGHITYFTPLNPGGRRGVGPAW 409
Query: 464 SYHGWGTRFSSFWCCYGTGL 483
W T + SFWCC GTGL
Sbjct: 410 GGGTWSTDYDSFWCCQGTGL 429
>gi|150003078|ref|YP_001297822.1| hypothetical protein BVU_0490 [Bacteroides vulgatus ATCC 8482]
gi|149931502|gb|ABR38200.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 783
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 130/393 (33%), Positives = 198/393 (50%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
+ DV+L S A+ ++ YLL +D D L+ + K AG + Y WE+ L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE- 223
H GHYLSA ++M+A+T N +K ++ ++S L CQ+ G GYL P+ + + E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 224 --------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
L W P Y IHKI AGL D D+ +A +K+T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTDSREAKEMLVKLTDWMIR-------- 201
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+++K S E+ L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LVSKLSDEQIQEMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHHTVLQPLLRQEDKL 261
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ G+ + +F + V GG S E +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 392 LASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S L +E E+C TYNML++++ L+ + ++ + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADVHFMDYYERALYNHILSTQDPVQGG-FV 380
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ +A Y + +SFWCC G+G+
Sbjct: 381 YFTPM-----RAGHYRVYSQPQTSFWCCVGSGM 408
>gi|294775898|ref|ZP_06741397.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|294450267|gb|EFG18768.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 783
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 130/393 (33%), Positives = 197/393 (50%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
+ DV+L S A+ ++ YLL +D D L+ + K AG + Y WE+ L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLSPKAENYTNWEN--TGLDG 89
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE- 223
H GHYLSA ++M+A+T N +K ++ ++S L CQ+ G GYL P+ + + E
Sbjct: 90 HIGGHYLSALSYMYAATGNKEIKARLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIED 149
Query: 224 --------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
L W P Y IHKI AGL D N +A +K+T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKIYAGLRDATLQTGNKEAKEMLVKLTDWMIR-------- 201
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+++K S E+ + L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LVSKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ G+ + +F + V GG S E +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVNHRSITIGGNSVREHFHPADD 321
Query: 392 LASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S L +E E+C TYNML++++ L+ + + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHFMDYYERALYNHILSTQDPVQGG-FV 380
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ +A Y + +SFWCC G+G+
Sbjct: 381 YFTPM-----RAGHYRVYSQPQTSFWCCVGSGM 408
>gi|347528202|ref|YP_004834949.1| hypothetical protein SLG_18170 [Sphingobium sp. SYK-6]
gi|345136883|dbj|BAK66492.1| hypothetical protein SLG_18170 [Sphingobium sp. SYK-6]
Length = 805
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 131/400 (32%), Positives = 192/400 (48%), Gaps = 32/400 (8%)
Query: 103 KEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC 162
+ + L +L PS + A N YLL L+ D L+ +F AG G+AY GWE T
Sbjct: 44 RPLPLSATRLLPSP-YADAVDANRRYLLQLEPDRLLHNFLVHAGLEPKGEAYGGWEGDT- 101
Query: 163 ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRF 222
+ GH +GHY++A A M A T + + +V L Q G GY++ F D
Sbjct: 102 -IAGHTLGHYMTALALMHAQTGDAECARRALYIVDELERAQKASGDGYVAGFTRRNGDVV 160
Query: 223 EALKPV-------------------WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
E K + W P+Y HK+ AGL D T+ + +A+ + +
Sbjct: 161 EDGKAIFPEIMAGDIRSAGFDLNGCWVPFYNWHKLYAGLFDIQTWIGSDKAIPIAVSLSG 220
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
Y ++ V + L+ E GG+N+ L+ T DP+ L LA L
Sbjct: 221 Y----IEKVFASLDDTQLQTVLDCEHGGINESFAELHVRTGDPRWLALAERIRHRKVLDP 276
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
L+ + + HANT IP VIG +E+TG + + +F D V + Y GG +
Sbjct: 277 LSRGENSLPWIHANTQIPKVIGLARLHEITGRADHAIAARYFWDTVVHRYSYVIGGNADR 336
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E++ DP ++ + + ESC TYNMLK++RHL+ W E DYYERA N +L+ QR
Sbjct: 337 EYFPDPDTVSRHITEQTCESCNTYNMLKLTRHLYAWRPEASLFDYYERAHINHILAQQR- 395
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
T+ G+ YM+PL G +A W F SFWCC G+G+
Sbjct: 396 TDNGMFAYMVPLMSGTHRA-----WSDPFDSFWCCVGSGI 430
>gi|46113732|ref|XP_383116.1| hypothetical protein FG02940.1 [Gibberella zeae PH-1]
Length = 1393
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 142/403 (35%), Positives = 190/403 (47%), Gaps = 29/403 (7%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQ-TNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG-W 157
D L DV L S W Q + YLL +D D L++ F+K G T G G W
Sbjct: 29 DLADAFELSDVSLTDS--RWMDNQGRTVNYLLSIDPDRLLYVFRKNHGLDTKGATKNGGW 86
Query: 158 EDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMG-----SGYLS 212
+ P R H GH+L+A ++ +A+ N + + V L++CQ K SGYLS
Sbjct: 87 DAPDFPFRSHVQGHFLTAWSNCYATLGNKECGSRASYFVKELAKCQAKNAKAGFTSGYLS 146
Query: 213 AFPSEQFDRFE--ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFY 266
FP + + E L PYY IHK LAGLLD Y + A L + W
Sbjct: 147 GFPESEIAKVENRTLNNGNVPYYAIHKTLAGLLDVYRRVGDNDAKAVMLSLAGW------ 200
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
V K S + + E GGMN+VL + TQD K L +A FD L
Sbjct: 201 --VDTRTGKLSYAQMQQMMQTEFGGMNEVLADIAYYTQDNKWLKVAQRFDHAAIFDPLQN 258
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
D +SG HANT +P IG+ Y+V+GD Y G D+ H YA GG S E +
Sbjct: 259 NVDKLSGLHANTQVPKWIGALREYKVSGDKKYLDIGRNAWDLTVHKHTYAIGGNSQAEHF 318
Query: 387 SDPKRLASTLGTENEESCTTYNMLKVSRHLFRW-TKEMVYADYYERALTNGVLSIQRGTE 445
DP +A L ++ E+C TYNMLK++R L+ + Y D+YE AL N +L Q +
Sbjct: 319 RDPDAIAKYLTSDTCEACNTYNMLKLTRELWALDPSDASYFDFYENALMNHLLGQQNPKD 378
Query: 446 P-GVMIYMLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G + Y PL RG A W T ++SFWCC G+G+
Sbjct: 379 NHGHVTYFTPLNPGGRRGVGPAWGGGTWSTDYNSFWCCQGSGI 421
>gi|198275797|ref|ZP_03208328.1| hypothetical protein BACPLE_01972 [Bacteroides plebeius DSM 17135]
gi|198271426|gb|EDY95696.1| hypothetical protein BACPLE_01972 [Bacteroides plebeius DSM 17135]
Length = 796
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 127/371 (34%), Positives = 180/371 (48%), Gaps = 22/371 (5%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQ+ NL+ L+ DVD L+ F K AG P + + W L GH GHYLSA A +
Sbjct: 48 AQELNLKVLMEYDVDRLLAPFLKEAGLPLKAEPFPNW----AGLDGHVGGHYLSAMAMNY 103
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ-------FDRFEALKPVWAPYY 233
A+T N +++M ++ L CQ G GY+ P+ + + E++ WAP+Y
Sbjct: 104 AATGNEECRKRMEYMLGELKRCQESNGDGYIGGVPNGKELWADIKNGKVESIWKYWAPWY 163
Query: 234 TIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMN 293
+HKI AGL D + + N +AL M + ++ + V S + L E GGM+
Sbjct: 164 NVHKIFAGLRDAWMYTGNKEALDMFLRLCDWGVS----VTEGLSDNQMEQMLANEFGGMD 219
Query: 294 DVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVT 353
++ Y IT K+L A F + D++ HANT IP VIG Q EV
Sbjct: 220 EIFADAYQITGKKKYLTTAKRFSHRWLFDSMVAHKDNLDNIHANTQIPKVIGYQRIAEVC 279
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL-GTENEESCTTYNMLKV 412
GD Y FF +IV A GG S E++S S + E ESC TYNMLK+
Sbjct: 280 GDNQYMDAADFFWNIVACKRSLALGGNSRREYFSSMDDFRSHVEDREGPESCNTYNMLKL 339
Query: 413 SRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRF 472
+ LFR T + VY D+YE+AL N +LS Q G + + ++ Y +
Sbjct: 340 TEGLFRMTGKAVYVDFYEKALYNHILSTQHPKHGGYVYFT------SARPAHYRVYSKPN 393
Query: 473 SSFWCCYGTGL 483
S+ WCC GTG+
Sbjct: 394 SAMWCCVGTGM 404
>gi|350267868|ref|YP_004879175.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus subtilis
subsp. spizizenii TU-B-10]
gi|349600755|gb|AEP88543.1| acetyl-CoA carboxylase, biotin carboxylase [Bacillus subtilis
subsp. spizizenii TU-B-10]
Length = 761
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 130/376 (34%), Positives = 191/376 (50%), Gaps = 23/376 (6%)
Query: 117 LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSAS 176
+ + +Q EYLL LDVD L+ + Y GWE E+ GH +GH+LSA+
Sbjct: 10 MFYDSQMKGKEYLLFLDVDRLLAPCYEAVLQTPKKPRYGGWE--AKEIAGHSIGHWLSAA 67
Query: 177 AHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD-------RFE--ALKP 227
+ M+ ++ + LK K V+ LS Q GY+S F FD R + +L
Sbjct: 68 SAMYQASGDEELKRKAEYAVNELSHIQQFDEEGYVSGFSRACFDEVFSGDFRVDHFSLGG 127
Query: 228 VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNE 287
W P+Y+IHK+ AGL+D Y N AL++ + ++ + + + + E+ L
Sbjct: 128 SWVPWYSIHKLFAGLIDTYRLTGNQTALRVVVKLADW----AKKGLDRLTDEQFQRMLIC 183
Query: 288 ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQ 347
E GGMN+ + L+ +T++ +L LA F L LA D++ G HANT IP VIG+
Sbjct: 184 EHGGMNEAMADLFMLTKNKAYLELAERFCHRAILQPLAEGKDELEGKHANTQIPKVIGAA 243
Query: 348 MRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTY 407
Y++TG+ Y+ FF + V YA GG S GE + + LG E+C TY
Sbjct: 244 KLYDITGNEAYRNAALFFWEQVVYQRSYAIGGNSIGEHFG--AEGSEELGVTTAETCNTY 301
Query: 408 NMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHG 467
NMLK++ HLFRW E + DYYE AL N +L+ Q + G+ Y + G K
Sbjct: 302 NMLKLTGHLFRWFHEARFMDYYENALYNHILASQ-DPDSGMKTYFVSTQPGHFKV----- 355
Query: 468 WGTRFSSFWCCYGTGL 483
+ + SFWCC GTG+
Sbjct: 356 YCSPEDSFWCCTGTGM 371
>gi|384109447|ref|ZP_10010323.1| hypothetical protein MSI_18910 [Treponema sp. JC4]
gi|383868978|gb|EID84601.1| hypothetical protein MSI_18910 [Treponema sp. JC4]
Length = 727
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 128/385 (33%), Positives = 194/385 (50%), Gaps = 27/385 (7%)
Query: 110 VKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFV 169
+ L+ SL ++Q+ LEY+L + D ++ + G Y GWE+ +++GH +
Sbjct: 6 INLEKDSLFEKSQRLGLEYVLEYEPDRMLAPCYRALGKNPCAINYGGWENR--QIQGHML 63
Query: 170 GHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR-------F 222
GHYLSA + + T KEK+ + + E Q K GY PS+ FD+ F
Sbjct: 64 GHYLSALSGFYYQTGKQDAKEKLDYTIDLIKELQRK--DGYFGGIPSDSFDKVFYSGGNF 121
Query: 223 E----ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSV 278
E +L W P+Y+IHKI AGL+D Y + N AL++ M ++ N +N ++ S+
Sbjct: 122 EVERFSLAGWWVPWYSIHKIYAGLIDAYVYGGNEDALQIVFKMADWAINGTKN-LSDSSI 180
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT 338
++ L E GGM V LY IT + K+L A + + + + D + G+HANT
Sbjct: 181 QKM---LTCEHGGMCKVFADLYGITGNKKYLSEAERWIHHEIIDPASKKEDKLQGYHANT 237
Query: 339 HIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT 398
IP IG YE+TG Y+ FF + V + YA GG S GE + + L
Sbjct: 238 QIPKFIGIARLYELTGKSEYRTAAEFFFETVTKNRSYAIGGNSKGEHFG--REFEEPLMR 295
Query: 399 ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 458
+ E+C TYNML+++ H+F W K AD+YE AL N +L+ Q + G Y + + +G
Sbjct: 296 DTCETCNTYNMLELAEHIFAWNKTSDIADFYENALYNHILASQ-DPQTGAKTYFVSMQQG 354
Query: 459 DSKAKSYHGWGTRFSSFWCCYGTGL 483
K H ++ WCC GTGL
Sbjct: 355 FHKVYCSHD-----NAMWCCTGTGL 374
>gi|290955577|ref|YP_003486759.1| hypothetical protein SCAB_10131 [Streptomyces scabiei 87.22]
gi|260645103|emb|CBG68189.1| putative secreted protein [Streptomyces scabiei 87.22]
Length = 786
Score = 204 bits (519), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 140/394 (35%), Positives = 191/394 (48%), Gaps = 27/394 (6%)
Query: 107 LHDVKLDPSSLHWRAQQTNLE-YLLMLDVDSLVWSFQKTAGSPTAGKAYEG-WEDPTCEL 164
L V+L S W Q + YL +DVD L+++F+ T T G G W+ P
Sbjct: 71 LGQVRLTAS--RWLDNQNRTQNYLRFIDVDRLLYNFRATHKLSTNGATPNGGWDAPNFGF 128
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQF 219
R H GH+L+A A ++A T + T ++K T +V+ L++CQ +GYLS +P F
Sbjct: 129 RTHIQGHFLTAWAQLYAVTGDTTCRDKATRMVAELAKCQANNSAAGFNTGYLSGYPESNF 188
Query: 220 DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITK 275
E YYTIHK L GLLD + +TQA L + W V++ R+
Sbjct: 189 TALEQGTSGEVLYYTIHKTLTGLLDVWRLIGSTQARDVLLALAGW-VDWRTGRLTG---- 243
Query: 276 YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
++ L E GGMN VL LY T D + L +A FD LA D ++G H
Sbjct: 244 ---QQMQTMLRIEFGGMNTVLTDLYQQTGDARWLTVAQRFDHAAVFDPLAANQDKLNGLH 300
Query: 336 ANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAST 395
ANT +P IG+ Y+ TG Y+ T +I A+H YA GG S E + P +A
Sbjct: 301 ANTQVPKWIGAAREYKATGTTRYRDIATNAWNITVAAHTYAIGGNSQAEHFRAPNAIAGF 360
Query: 396 LGTENEESCTTYNMLKVSRHLFRWTKEMVYA-DYYERALTNGVLSIQR-GTEPGVMIYML 453
L + ESC T NML ++R L+ + V DYYERA N ++ Q + G + Y
Sbjct: 361 LNNDTCESCNTVNMLTLTRELYTLDPDRVELFDYYERAWLNQMIGQQNPADDHGHVTYFT 420
Query: 454 PLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PL RG A W T + SFWCC GTGL
Sbjct: 421 PLKPGGRRGVGPALGGGTWSTDYGSFWCCQGTGL 454
>gi|218198541|gb|EEC80968.1| hypothetical protein OsI_23691 [Oryza sativa Indica Group]
Length = 759
Score = 204 bits (519), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 129/201 (64%), Gaps = 7/201 (3%)
Query: 44 NETWKKEVYSHYHLTPTDDSAWSNLLPRKMLSETDEFSWTMIYRKMKNPDGFKLAGD--- 100
N+T + HL +++ W LLPR+ DE W +YR + G + G+
Sbjct: 45 NDTQGRHSDGLPHLNQAEEATWMGLLPRRA-GPRDELDWLALYRSITR-GGGDVGGEPAG 102
Query: 101 FLKEVSLHDVKLDP--SSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWE 158
FL SLHDV++DP ++++W+ QQTNLEYLL LD D L W+F++ A PT G+ Y GWE
Sbjct: 103 FLSPASLHDVRVDPYGANMYWQGQQTNLEYLLYLDPDRLTWTFRQQAKLPTVGEPYGGWE 162
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
P +LRGHF GHYLSA+AHMWASTHN L+EKMT VV L CQ KM +GYLSA+P
Sbjct: 163 APDGQLRGHFTGHYLSAAAHMWASTHNDALREKMTKVVDILYSCQKKMNTGYLSAYPESM 222
Query: 219 FDRFEALKPVWAPYYTIHKIL 239
FD ++ L W+PYYTIHK +
Sbjct: 223 FDAYDELAEAWSPYYTIHKFI 243
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 74/108 (68%), Gaps = 12/108 (11%)
Query: 388 DPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP 446
DPKRL + + NEE+C TYN+LKVSR+LFRWTKE Y D+YER L NG++ QRG EP
Sbjct: 249 DPKRLVDEIKISSNEETCATYNLLKVSRNLFRWTKEGKYTDHYERLLINGIMGNQRGKEP 308
Query: 447 GVMIYMLPLGRGDSKA-----------KSYHGWGTRFSSFWCCYGTGL 483
GVMIY LP+G G SK+ K+ GWG ++FWCCYGTG+
Sbjct: 309 GVMIYFLPMGPGRSKSISGMPTSGLPPKNPGGWGNANATFWCCYGTGI 356
>gi|408527846|emb|CCK26020.1| secreted protein [Streptomyces davawensis JCM 4913]
Length = 731
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 143/409 (34%), Positives = 198/409 (48%), Gaps = 27/409 (6%)
Query: 92 PDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNL-EYLLMLDVDSLVWSFQKTAGSPTA 150
P AG + +L V+L ++ W Q YL +DVD L+++F+ T
Sbjct: 2 PAASAEAGVLAQPFALGQVRL--TAGRWLDNQNRTGNYLRFVDVDRLLYNFRANHKLSTN 59
Query: 151 GKAYEG-WEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS- 208
G A G W+ P R H GH+L+A A ++A T + T ++K T +V+ L++CQ +
Sbjct: 60 GAAANGGWDAPDFPFRTHIQGHFLTAWAQLYAVTGDTTCRDKATYMVAELAKCQANNSAA 119
Query: 209 ----GYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKW 260
GYLS +P F E YYTIHK LAGLLD + +TQA L + W
Sbjct: 120 GFSPGYLSGYPEANFTALEQGTKGDVLYYTIHKTLAGLLDVWRHIGSTQARDVLLALAGW 179
Query: 261 MVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCF 320
V++ R+ + E+ N L E GGMN VL L+ T D + L +A FD
Sbjct: 180 -VDWRTGRL-------TSEQMQNMLRIEFGGMNAVLTDLHVRTGDARWLAVAQRFDHAAV 231
Query: 321 LGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGT 380
LA D ++G HANT +P IG+ Y+ TG Y+ T +I SH YA GG
Sbjct: 232 FDPLAANQDKLNGLHANTQVPKWIGAAREYKATGTTRYRDIATNAWNITLDSHTYAIGGN 291
Query: 381 SAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYA-DYYERALTNGVLS 439
S E + P +A L + ESC T+NML ++R LF + DYYERA N ++
Sbjct: 292 SQAEHFRAPHAIAGFLNKDTCESCNTFNMLVLTRELFELDPDRAALFDYYERAWLNQMIG 351
Query: 440 IQR-GTEPGVMIYMLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Q + G + Y PL RG A W T + +FWCC GTGL
Sbjct: 352 QQNPADDHGHVTYFTPLNPGGRRGVGPAWGGGTWSTDYGTFWCCQGTGL 400
>gi|386837867|ref|YP_006242925.1| hypothetical protein SHJG_1777 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374098168|gb|AEY87052.1| hypothetical protein SHJG_1777 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451791159|gb|AGF61208.1| hypothetical protein SHJGH_1542 [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 769
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 132/376 (35%), Positives = 188/376 (50%), Gaps = 18/376 (4%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCELRGHFVGHYLSASAHMW 180
Q YL +DVD L+++F+ T G A GW+ PT R H GH+L+A A ++
Sbjct: 66 QDRAAAYLRFVDVDRLLYNFRANHRLSTGGASATGGWDAPTFPFRSHVQGHFLTAWAQLY 125
Query: 181 ASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQFDRFEA--LKPVWAPYY 233
A T + ++K +V+ L++CQ G+GYLS +P F EA L+ PYY
Sbjct: 126 AVTGDAVARDKALYMVAELAKCQANNGAAGFGAGYLSGYPESDFTALEAGTLRNGNVPYY 185
Query: 234 TIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMN 293
T+HK ++GLLD + +TQA + + + R + T + L E GGMN
Sbjct: 186 TVHKTMSGLLDVWRHLGSTQARDVLLALAGWVDARTGRLTTA----QMQAVLGTEFGGMN 241
Query: 294 DVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVT 353
VL LY T D + L +A FD LA D ++G HANT +P IG+ Y+ T
Sbjct: 242 AVLADLYQQTGDARWLTVAQRFDHAAVFDPLAANQDALAGLHANTQVPKWIGAVRAYKAT 301
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVS 413
G Y+ T + SH YA GG S E + P +A+ L + ESC + NML ++
Sbjct: 302 GITRYRDIATNAWNHCVGSHTYAIGGNSQAEHFRAPNAIAAYLADDTCESCNSVNMLTLT 361
Query: 414 RHLFRWTKEMVYA-DYYERALTNGVLSIQRGTEP-GVMIYMLPL----GRGDSKAKSYHG 467
R LF T + V DYYE+A N ++ Q +P G + Y PL RG A
Sbjct: 362 RELFTLTPDRVALFDYYEQAWLNHIIGNQNPADPHGHITYFTPLRPGGRRGVGPAWGGGT 421
Query: 468 WGTRFSSFWCCYGTGL 483
W T +++FWCC GTG+
Sbjct: 422 WSTDYTTFWCCQGTGV 437
>gi|116182754|ref|XP_001221226.1| hypothetical protein CHGG_02005 [Chaetomium globosum CBS 148.51]
gi|88186302|gb|EAQ93770.1| hypothetical protein CHGG_02005 [Chaetomium globosum CBS 148.51]
Length = 797
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 131/376 (34%), Positives = 187/376 (49%), Gaps = 18/376 (4%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCELRGHFVGHYLSASAHMW 180
Q + YL +DV+ L+++F+ T G A GW+ P R H GHYL+A A +
Sbjct: 48 QNRTVSYLKWVDVNRLLYNFRANHRLSTQGASANGGWDAPNFPFRTHAQGHYLTAWAFCY 107
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGS-----GYLSAFPSEQFDRFEA--LKPVWAPYY 233
AS + +++ V+ L++CQ G+ GYLS FP +F EA L PYY
Sbjct: 108 ASLRDTECRDRAAYFVAELAKCQKNNGAAGFSAGYLSGFPESEFAALEARTLNNGNVPYY 167
Query: 234 TIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMN 293
IHK +AGLLD + +T A + + + +R K S ++ + L E GGMN
Sbjct: 168 AIHKTMAGLLDVWRHLGDTNARDVLLALAGWVDSRT----GKLSYQQMQSMLGTEFGGMN 223
Query: 294 DVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVT 353
DVL L+ T+D + L +A FD LA D ++G HANT +P IG+ + Y+ T
Sbjct: 224 DVLADLHKQTKDERWLKVAQRFDHAAVFDPLAAGRDQLNGLHANTQVPKWIGAALEYKAT 283
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVS 413
G Y+ ++ +H YA GG S E + P +A L + E+C TYNML+++
Sbjct: 284 GSTRYRDIAKNAWELTVGAHTYAIGGNSQAEHFRPPNAIAGYLQKDTAEACNTYNMLRLT 343
Query: 414 RHLFRW-TKEMVYADYYERALTNGVLSIQR-GTEPGVMIYMLPLG----RGDSKAKSYHG 467
R L+ Y D+YERAL N +L Q + G + Y PL RG A
Sbjct: 344 RELWPLDAASTAYFDFYERALLNHLLGQQDPASHHGHVTYFTPLNPGGRRGVGPAWGGGT 403
Query: 468 WGTRFSSFWCCYGTGL 483
W T + SFWCC GT L
Sbjct: 404 WSTDYDSFWCCQGTAL 419
>gi|159491178|ref|XP_001703550.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280474|gb|EDP06232.1| predicted protein [Chlamydomonas reinhardtii]
Length = 226
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 107/197 (54%), Positives = 136/197 (69%), Gaps = 4/197 (2%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLL-MLDVDSLVWSFQKTAGSPTAGKAY-EGWED 159
++ + L DV+L ++L R ++ N +YLL ML+ D L+WSF+KT+G PT G Y WED
Sbjct: 28 IEPLPLSDVRLLDTALQARYEKLNAKYLLDMLEPDRLLWSFRKTSGLPTPGTPYIASWED 87
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF 219
P CELRGHFVGHYLSA + A T N K ++ +VS L + Q K+G+GYLSAFP+E F
Sbjct: 88 PGCELRGHFVGHYLSALSLALAGTGNSAFKTRLDLMVSELGKVQEKLGTGYLSAFPTEFF 147
Query: 220 DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVE 279
DR EALKPVWAPYYTIHKI+AGL+D + A + AL M MV+Y +NR Q VI E
Sbjct: 148 DRVEALKPVWAPYYTIHKIIAGLVDAHELAGHPSALAMATRMVDYHWNRTQAVIAAKGRE 207
Query: 280 RHWNS-LNEETGGMNDV 295
HWN+ LN E GGMN+V
Sbjct: 208 -HWNAVLNCEFGGMNEV 223
>gi|392964292|ref|ZP_10329713.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
gi|387847187|emb|CCH51757.1| protein of unknown function DUF1680 [Fibrisoma limi BUZ 3]
Length = 739
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 131/398 (32%), Positives = 193/398 (48%), Gaps = 33/398 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ +L +V+L +AQ +L+Y+L L+ D L+ + AG P + Y WE +
Sbjct: 1 MQPFTLQEVRLTSGPFK-QAQDVDLKYILALNPDKLLAPYLIDAGLPLKAQRYGNWE--S 57
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF-- 219
L GH GHYLSA A M+AST LK+++ ++ L+ CQ K G+GY+ P +
Sbjct: 58 VGLDGHIGGHYLSALAMMYASTGEPELKKRLDYMIGELARCQAKNGNGYVGGIPQGKVFW 117
Query: 220 DRFE---------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFY 266
DR L W P Y IHK+ AGL D Y +A N QA + + W VE
Sbjct: 118 DRIHKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDAYAYAGNGQAKQVLIGLGDWFVE--- 174
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
+I S E+ L E GG+N+ LY +T D K+L A L L
Sbjct: 175 -----LIKPLSDEQIQQVLRTEHGGINETFADLYILTNDKKYLETAQRLSHRALLYPLLE 229
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
Q D ++G HANT IP VIG + +TG + +F V+ + A GG S E +
Sbjct: 230 QQDKLTGLHANTQIPKVIGFEKIATLTGKTDWSEAAMYFWRNVSQTRSVAFGGNSVREHF 289
Query: 387 SDPKRLASTL-GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
+ + L + E+C ++NML++S+ LF ++ Y D+YER L N +LS Q E
Sbjct: 290 NPTTDFSQVLRSNQGPETCNSFNMLRLSKALFLDKNDVSYLDFYERTLYNHILSSQH-PE 348
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P+ + Y + +S WCC G+GL
Sbjct: 349 KGGFVYFTPI-----RPNHYRVYSQSETSMWCCVGSGL 381
>gi|399030291|ref|ZP_10730797.1| hypothetical protein PMI10_02670 [Flavobacterium sp. CF136]
gi|398071797|gb|EJL63044.1| hypothetical protein PMI10_02670 [Flavobacterium sp. CF136]
Length = 771
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 129/398 (32%), Positives = 195/398 (48%), Gaps = 33/398 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++E L ++KL AQ +L+YLL L+ D L+ + +AG PT Y WE+
Sbjct: 34 MQEFKLQEIKLTSGPFK-NAQNVDLKYLLDLNPDRLLAPYLISAGIPTKADRYGNWEN-- 90
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF-- 219
L GH GHYL+A + M+AST N +K ++ ++S L+ CQ K G+GY+ P +
Sbjct: 91 IGLDGHIGGHYLAALSMMYASTGNKEIKSRLDYMISELALCQEKDGTGYVGGIPEGKVFW 150
Query: 220 DRFE---------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFY 266
DR L W P Y IHK+ AGL+D Y + N +A +K+ W +E
Sbjct: 151 DRIHKGDIDGSGFGLNNTWVPIYNIHKLFAGLIDAYNYTGNEKAKEIVIKLGDWFIE--- 207
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
+I S E+ L E GG+N+ LY+IT++ K+L A + L L
Sbjct: 208 -----LIRPLSDEQIQKILKTEHGGINESFADLYSITKNKKYLETAEKLSQKAILDPLIK 262
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
+ D ++G HANT IP VIG + +++ + + FF V A GG S E +
Sbjct: 263 KEDKLTGLHANTQIPKVIGFEKIGKLSDNKQWSDAAQFFWMNVTEKRTVAFGGNSVAEHF 322
Query: 387 SDPKRLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
+ + L + + E+C +YNM ++S+ LF + Y D+YER L N +LS Q
Sbjct: 323 NPINDFSGMLKSNQGPETCNSYNMERLSKALFLDKNNVSYLDFYERTLYNHILSSQEPNR 382
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P+ + Y + +S WCC GTGL
Sbjct: 383 GG-FVYFTPI-----RPNHYRVYSQPETSMWCCVGTGL 414
>gi|325679069|ref|ZP_08158663.1| hypothetical protein CUS_6624 [Ruminococcus albus 8]
gi|324109193|gb|EGC03415.1| hypothetical protein CUS_6624 [Ruminococcus albus 8]
Length = 791
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 137/387 (35%), Positives = 186/387 (48%), Gaps = 41/387 (10%)
Query: 126 LEYLLMLDVDSLVWSFQKTAGSPTAGKA-YEGWEDPTCELRGHFVGHYLSASAHMWAS-- 182
+ YLL D D L+ F++TAG G Y GWED + GH VGHY++A A +AS
Sbjct: 29 IAYLLSFDTDRLLAGFRETAGLDMRGAVRYSGWEDDL--IGGHCVGHYMTAVAQAYASLQ 86
Query: 183 ---THNVTLKEKMTAVVSALSECQNKMGSGYLSAFP-------SEQFDRFEA-----LKP 227
+ L + L ECQ +G+G++ QFD E +
Sbjct: 87 EGDSRRDALYKLAVTTTDGLKECQQALGTGFIFGAKIIDKNNVEAQFDNVEKNLSNIMTQ 146
Query: 228 VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNE 287
W PYYT+HKILAG +D Y A + + ++ Y RV +++S E L
Sbjct: 147 AWVPYYTLHKILAGAIDIYRLTGYENAKTVASRLGDWVYRRV----SRWSEETQRTVLGI 202
Query: 288 ETGGMNDVLYRLYTITQDPKHLLLAHLFDK-PCFLGLLAVQADDISGFHANTHIPVVIGS 346
E GGMND LY LY +T +H + AH FD+ P F + A + ++ HANT IP +G+
Sbjct: 203 EYGGMNDCLYELYAVTGKEEHAIAAHCFDEVPLFENVYAGTENALNNKHANTTIPKFLGA 262
Query: 347 QMRYE------VTGDPL----YKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL 396
RY V G+ + Y F D+V H Y TGG S E + L +
Sbjct: 263 LKRYAILDGRTVNGETVDAGRYLGYAERFWDMVVQKHSYITGGNSEWEHFGCDYVLDAER 322
Query: 397 GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLG 456
N E+C TYNMLK+SR LF T E YADYYE N +LS Q E G+ Y P+
Sbjct: 323 TNANCETCNTYNMLKLSRLLFEITGEKKYADYYENTFINAILSSQN-PETGMSTYFQPMA 381
Query: 457 RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G K + T ++ FWCC G+G+
Sbjct: 382 SGYFKV-----YSTPYTKFWCCTGSGM 403
>gi|117920524|ref|YP_869716.1| acetyl-CoA carboxylase, biotin carboxylase [Shewanella sp. ANA-3]
gi|117612856|gb|ABK48310.1| acetyl-CoA carboxylase, biotin carboxylase [Shewanella sp. ANA-3]
Length = 795
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 129/396 (32%), Positives = 203/396 (51%), Gaps = 29/396 (7%)
Query: 102 LKEVSLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
L + L+DV+L LH AQQT+L Y++ +D + L+ ++K AG T Y WE+
Sbjct: 28 LTPIPLNDVRLTAGPFLH--AQQTDLAYIMSMDPERLLAPYRKAAGIATTADNYPNWEN- 84
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE--- 217
L GH GHYLSA A M+A+T + + ++ +V+ L +CQ G+GY+ P
Sbjct: 85 -TGLDGHIGGHYLSALALMYAATGDQAVLSRLNYMVAELEKCQQAHGNGYVGGVPHGDKL 143
Query: 218 ---------QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNR 268
+ D F L W P+Y +HK+ AGL D Y + N A KM ++ +
Sbjct: 144 WQQVAAGHIEADLF-TLNQSWVPWYNVHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDL 202
Query: 269 VQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQA 328
+N+ S E+ L E GG+N+ L +Y+IT K+L LA+ + L L
Sbjct: 203 SRNL----SDEQLQLMLRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQHQ 258
Query: 329 DDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSD 388
D ++G HANT IP ++G E++ + + + +F V + GG S E++
Sbjct: 259 DKLTGLHANTQIPKIVGVARIAELSNNKEWLESADYFWQQVVHQRTVSIGGNSVREYFHP 318
Query: 389 PKRLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPG 447
+ +S L + E E+C TYNMLK+S+ L+ +++ Y DYYERAL N +LS Q + G
Sbjct: 319 SEDFSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-PQTG 377
Query: 448 VMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
++Y P+ + Y + + S WCC G+G+
Sbjct: 378 GLVYFTPM-----RPDHYRVYSSAQESMWCCVGSGI 408
>gi|302670053|ref|YP_003830013.1| glycoside hydrolase [Butyrivibrio proteoclasticus B316]
gi|302394526|gb|ADL33431.1| glycoside hydrolase [Butyrivibrio proteoclasticus B316]
Length = 780
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 145/409 (35%), Positives = 202/409 (49%), Gaps = 43/409 (10%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
LKE L V ++ A ++ YL LD + L+ F + AG Y GWE+
Sbjct: 1 MLKEFDLTQVCVNDEYCA-NALNKDVAYLKSLDPERLLAGFYENAGLTPKKIRYSGWENM 59
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEK-----MTAVVSALSECQNK--------MG 207
+ GH +GHYL+A+A +A+ +K + +V L ECQ G
Sbjct: 60 L--IGGHTLGHYLTAAAQGYANPGTRKEDKKALFDIIKTLVDGLLECQEHSQGKKGFVFG 117
Query: 208 SGYLSAFPSE-QFDRFE-----ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
+ + + E QFD E + W P+YT+HKIL GL+ + F ALK+ + +
Sbjct: 118 AIIMDSNNVELQFDHVEHGRTNIITESWVPWYTMHKILDGLVSTFVFTGYEPALKVAEGI 177
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
++ YNR + +S E H L+ E GGMND LY+LY +T +HL AH FD+
Sbjct: 178 GDWTYNRA----SGWSEETHKTVLSIEYGGMNDALYKLYRLTGKKEHLEAAHAFDEEELF 233
Query: 322 GLLAV-QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTF------FMDIVNASHG 374
+A A+ ++ HANT IP +G+ RY GD V G + F D+V H
Sbjct: 234 KKVATGDANVLNNRHANTTIPKFLGALQRYMTLGD----VAGEYLTYVQKFWDMVVERHT 289
Query: 375 YATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALT 434
YATGG S E + + L + N E+C TYNMLK+SR LFR T + YADYYE
Sbjct: 290 YATGGNSEWEHFGEDFVLDAERTNCNNETCNTYNMLKMSRDLFRITGDKKYADYYENTFI 349
Query: 435 NGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
N +LS Q E G+ +Y P+ G Y +GT F FWCC GTG+
Sbjct: 350 NAILSSQN-PESGMTMYFQPMATG-----YYKVYGTPFDKFWCCTGTGM 392
>gi|328956144|ref|YP_004373477.1| hypothetical protein Corgl_1563 [Coriobacterium glomerans PW2]
gi|328456468|gb|AEB07662.1| protein of unknown function DUF1680 [Coriobacterium glomerans PW2]
Length = 751
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 140/393 (35%), Positives = 200/393 (50%), Gaps = 20/393 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYE-GWEDP 160
++ ++L V+L P + AQQ L +L +D D ++ +F++ A T G GW+ P
Sbjct: 182 MRPINLTCVRLAPGTPAAAAQQRRLSFLKQVDDDQMLINFRRAAHMDTKGAPEMIGWDTP 241
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNK------MGSGYLSAF 214
LRGH GHYLSA A WA+T + T+ K++ +V +L E Q + G+LSA+
Sbjct: 242 DSNLRGHTTGHYLSALALAWAATGDETVHSKLSYMVHSLGEVQAAFRGQPGIHEGFLSAY 301
Query: 215 PSEQFDRFEALKP---VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQN 271
QFD E P +WAPYYT+HKILAGLLD Y +A N QAL++ + + YNR+
Sbjct: 302 DESQFDLLERYTPYPEIWAPYYTLHKILAGLLDSYRYAGNRQALEIAIGVGHWVYNRLSQ 361
Query: 272 VITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+ +++ W + E GGMN+ L L IT + + A FD + + D
Sbjct: 362 -LDPIQLKKMWAMYIAGEFGGMNESLAMLGAITGEESFVKAARFFDNDKLIFPALQKVDA 420
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
+ HAN HIP VIG+ Y VT + Y FF V A H YA GGT GE + P
Sbjct: 421 LGTLHANQHIPQVIGALSLYGVTHEESYYQVAEFFWHSVVAHHIYAFGGTGDGEMFQQPC 480
Query: 391 RLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+A+ + + ESC +YNM+K++R L+ + Y E L N +LS G
Sbjct: 481 EIAAKIDEFSAESCASYNMIKLTRDLYEYEPTADKMAYCENVLINHILSSTDHEGTGGST 540
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y + G K G+ T S CC+GTGL
Sbjct: 541 YFMETQPGARK-----GFDTENS---CCHGTGL 565
>gi|402080566|gb|EJT75711.1| hypothetical protein GGTG_05643 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 640
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 131/383 (34%), Positives = 188/383 (49%), Gaps = 32/383 (8%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCELRGHFVGHYLSASAHMW 180
Q L Y+ ++VD L+++F+ T G ++ +GW+ P R HF GH+L+A A +
Sbjct: 67 QDRALTYIKSVNVDRLLYNFRANHRVSTNGAQSNKGWDAPDFPFRTHFQGHFLTAWAQCY 126
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGS-----GYLSAFPSEQFDRFE--ALKPVWAPYY 233
A+ + T ++ V+ L++CQN + GYLS FP + D+ E L PYY
Sbjct: 127 ATLGDATCRDHANYFVAELAKCQNNNAAAGFKAGYLSGFPESEIDKVEQRTLSNGNVPYY 186
Query: 234 TIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
IHK +AGLLD + +TQA L+M W V S ++ N L E
Sbjct: 187 AIHKTMAGLLDVWRVMGSTQARDVLLRMAGW--------VDTRTAALSYQQMQNMLGTEF 238
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GGMN+VL ++ T D + + A FD LA D +SG HANT +P IG+
Sbjct: 239 GGMNEVLADVFHQTGDARWIKTARRFDHAAVFDPLAQGQDRLSGLHANTQVPKWIGAARE 298
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
Y+ T + Y+ + A+H YA GG S E + P +A L + E+C +YNM
Sbjct: 299 YKATKEERYRTVARAAWNFTVAAHTYAIGGNSQSEHFRSPNAIAGYLAKDTAEACNSYNM 358
Query: 410 LKVSRHLFRWTKE---MVYADYYERALTNGVLSIQRGTEP-GVMIYMLPLGRGDSKAKSY 465
LK++R L W + Y D+YERAL N +L Q G + Y PL G +
Sbjct: 359 LKLTREL--WLADPSAAAYFDFYERALLNHMLGQQDPRSAHGHVTYFTPLNPGGRRGVG- 415
Query: 466 HGWG-----TRFSSFWCCYGTGL 483
WG T + SFWCC GTG+
Sbjct: 416 PAWGGGTYSTDYDSFWCCQGTGI 438
>gi|300777572|ref|ZP_07087430.1| acetyl-CoA carboxylase [Chryseobacterium gleum ATCC 35910]
gi|300503082|gb|EFK34222.1| acetyl-CoA carboxylase [Chryseobacterium gleum ATCC 35910]
Length = 791
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 129/393 (32%), Positives = 194/393 (49%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L V+L S+ +A + + +YL+ L+ D L+ + K AG Y WE+ L G
Sbjct: 29 LETVRLS-ESVFSKAMKADHKYLMALEPDRLLAPYLKEAGLKPKANNYPNWEN--TGLDG 85
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFE--- 223
H GHY+SA + M+AST + ++E++ ++S L CQ GY+S P+ + E
Sbjct: 86 HIGGHYISALSLMYASTGDKAIQERINYMISELERCQKASPDGYISGIPNGKKIWKEIKQ 145
Query: 224 --------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
L W P Y IHK+ +GL D Y +A N +A +K+T WM N
Sbjct: 146 GNIRASGFGLNDRWVPLYNIHKLYSGLRDAYWYAKNEKAKAMLIKLTDWMA--------N 197
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
++ S E+ + L E GG+N+V +Y IT D K+L LAH F L L D +
Sbjct: 198 EVSNLSDEQIQDMLRSEHGGLNEVFADVYEITHDQKYLKLAHRFSHQAILSPLLTGEDKL 257
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ + + FF V GG S E ++
Sbjct: 258 TGLHANTQIPKVIGYKRIADLENNTSWSNAADFFWHNVTEKRSSVIGGNSVSEHFNPVND 317
Query: 392 LASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S + + E E+C TYNMLK+++ L+ E Y DYYE+AL N +LS + + G +
Sbjct: 318 FSSMIKSIEGPETCNTYNMLKLTKELYATLPESYYIDYYEKALYNHILSTE-NHDHGGFV 376
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ G Y + +SFWCC G+G+
Sbjct: 377 YFTPMRPG-----HYRVYSQPQTSFWCCVGSGI 404
>gi|336321977|ref|YP_004601945.1| hypothetical protein Celgi_2884 [[Cellvibrio] gilvus ATCC 13127]
gi|336105558|gb|AEI13377.1| protein of unknown function DUF1680 [[Cellvibrio] gilvus ATCC
13127]
Length = 781
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 142/423 (33%), Positives = 199/423 (47%), Gaps = 54/423 (12%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWED- 159
L EVSL + S+ RAQQ ++ VD ++ F++ A G A GWE+
Sbjct: 91 LTEVSLGE------SVFTRAQQQMVDLARAYPVDRVLVVFRRNANLDVRGASAPGGWEEL 144
Query: 160 -PTCE---------------------LRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVS 197
P + LRGH+ GH+LS A +A+T + + +K+ V
Sbjct: 145 GPAPDEQRWGPAEYVRGQNTRGAGGLLRGHYGGHFLSMLAMAYATTGDQAILDKVDDFVD 204
Query: 198 ALSECQNKMGS-------GYLSAFPSEQFDRFEALKP---VWAPYYTIHKILAGLLDQYT 247
L EC+ + + G+L+A+ QF EA P +WAP+YT HKILAGL+D Y
Sbjct: 205 GLEECRAALAATGKYSHPGFLAAYGEWQFSALEAYAPYGEIWAPWYTCHKILAGLIDAYR 264
Query: 248 FADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDP 306
+ + AL++ + + + + R+ + T +ER W + E GGMND L LYT++
Sbjct: 265 YTGSALALQLAEGLGRWTHARL-SACTPEQLERMWGIYIGGEAGGMNDALVDLYTLSAAA 323
Query: 307 KH---LLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGT 363
L A LFD + A D ++G HAN HIP +G TGD Y
Sbjct: 324 DRDDFLAAAALFDLRSLVTACAQDRDTLNGKHANMHIPTFVGYAKLGAWTGDATYTAATR 383
Query: 364 FFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEM 423
F ++ YA GGT GE W +A +G N ESC YNMLKV+R LF ++
Sbjct: 384 NFFGMIVPGRMYAHGGTGEGEMWGPANTVAGDIGPRNAESCAAYNMLKVARTLFFEQQDP 443
Query: 424 VYADYYERALTNGVLSIQR---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYG 480
Y DYYER + N +L +R T +YM P+G G K GT CC G
Sbjct: 444 AYMDYYERTVLNHILGGKRDQASTTSPQNLYMFPVGPGARKEYGNGNIGT------CCGG 497
Query: 481 TGL 483
TGL
Sbjct: 498 TGL 500
>gi|418466296|ref|ZP_13037222.1| secreted protein [Streptomyces coelicoflavus ZG0656]
gi|371553101|gb|EHN80323.1| secreted protein [Streptomyces coelicoflavus ZG0656]
Length = 773
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 138/392 (35%), Positives = 192/392 (48%), Gaps = 21/392 (5%)
Query: 107 LHDVKLDPSSLHWRAQQT-NLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG-WEDPTCEL 164
L V+L PS W Q+ L YL +DVD L+ +F+ T G A G WE P
Sbjct: 54 LGAVRLTPS--RWLDNQSRTLSYLRFVDVDRLLHNFRANHRLSTNGAAATGGWEAPDFPF 111
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQF 219
R H GH+L+A A +A T + ++K +V+ L++CQ G+GYLS +P F
Sbjct: 112 RSHVQGHFLTAWAQAYAVTGDTACRDKALYMVAELAKCQANNGAAGFGTGYLSGYPESDF 171
Query: 220 DRFEA--LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYS 277
E+ L PYYTIHK LAGLL+ + +T+A + + + R + S
Sbjct: 172 AALESGTLNNGNVPYYTIHKTLAGLLEVWRLLGSTRARDVLLALAGWVDRRTG----RLS 227
Query: 278 VERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHAN 337
R L E GGMN VL L T D + L +A FD LA D ++G HAN
Sbjct: 228 TTRMQAVLGTEFGGMNAVLTDLCQQTGDTRWLAVAQRFDHAAVFDPLAANQDRLAGLHAN 287
Query: 338 THIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG 397
T +P IG+ Y+ TG Y+ T ++ +H YA GG S E + P +A+ L
Sbjct: 288 TQVPKWIGAVREYKATGSTRYRDIATNAWNMCVTTHTYAVGGNSQAEHFRPPNAIAAHLA 347
Query: 398 TENEESCTTYNMLKVSRHLFRWTKEMVYA-DYYERALTNGVLSIQRGTEP-GVMIYMLPL 455
+ ESC T NML ++R LF + + DYYE+A N ++ Q +P G + Y PL
Sbjct: 348 NDTCESCNTVNMLGLTRELFALSPDRAELFDYYEQAWLNHMIGQQNPADPHGHVTYFTPL 407
Query: 456 G----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
RG A W T +++FWCC GTGL
Sbjct: 408 KPGGRRGVGPAWGGGTWSTDYTTFWCCQGTGL 439
>gi|16126789|ref|NP_421353.1| hypothetical protein CC_2550 [Caulobacter crescentus CB15]
gi|221235569|ref|YP_002518006.1| membrane-bound glycosyl hydrolase [Caulobacter crescentus NA1000]
gi|13424115|gb|AAK24521.1| conserved hypothetical protein [Caulobacter crescentus CB15]
gi|220964742|gb|ACL96098.1| membrane-bound glycosyl hydrolase [Caulobacter crescentus NA1000]
Length = 786
Score = 201 bits (512), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 135/410 (32%), Positives = 193/410 (47%), Gaps = 33/410 (8%)
Query: 94 GFKLAGDFLK-EVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK 152
G ++ G L V V L PS + +AQ N YL+ L D L+ +F AG P
Sbjct: 37 GAEVGGRVLATPVPARHVTLKPS-IFAQAQGANRAYLVSLQPDRLLHNFHLGAGLPVKAP 95
Query: 153 AYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLS 212
Y GWE + GH +GHYLSA A A+ + L +++ V+ L+ Q G GY+
Sbjct: 96 VYGGWE--AQSIAGHTLGHYLSACALQVANDGDPVLSQRLAYTVAQLARVQAAHGDGYVG 153
Query: 213 -------AFPSEQFDRFEALK------------PVWAPYYTIHKILAGLLDQYTFADNTQ 253
A P FE L+ W P YT HKI AGLLD + A
Sbjct: 154 GTTRWGQADPVGGKAVFEELRRGDIRANRFSLNDGWVPIYTWHKIHAGLLDAHRLAATPG 213
Query: 254 ALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAH 313
AL + + Y ++ + ++ L E GG+ + Y +T DP+ L +A
Sbjct: 214 ALDVALGLAGYL----ATILEGLNDDQVQAILVAEHGGLCEAYAETYALTGDPRWLNIAR 269
Query: 314 LFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASH 373
+ LA D+++G HANT IP +IG YEV GDP T FF V H
Sbjct: 270 RLRHRELVDPLAQGRDELAGLHANTQIPKIIGLARLYEVAGDPAEARTARFFHQTVTRRH 329
Query: 374 GYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERAL 433
YA GG S E + P +A+ L E+C +YNMLK++R L+ W + D YERA
Sbjct: 330 SYAIGGNSDREHFGPPDAIATRLSETTCEACNSYNMLKLTRRLWSWAPDGALFDDYERAQ 389
Query: 434 TNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
N +++ QR ++ G+ +Y +P+ G ++ S T SFWCC G+G+
Sbjct: 390 LNHIMAHQRPSD-GMFVYFMPMAAGGRRSYS-----TPEDSFWCCVGSGM 433
>gi|445497812|ref|ZP_21464667.1| hypothetical protein Jab_2c14210 [Janthinobacterium sp. HH01]
gi|444787807|gb|ELX09355.1| hypothetical protein Jab_2c14210 [Janthinobacterium sp. HH01]
Length = 789
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 134/398 (33%), Positives = 194/398 (48%), Gaps = 33/398 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L+ L DV+L S AQ+T+L YLL ++ D L+ F + AG P +Y WE +
Sbjct: 29 LQLFPLADVRLGDSPF-LEAQRTDLHYLLEMEPDRLLAPFLREAGLPPKQPSYGNWE--S 85
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS----- 216
L GH GHYLSA A M+AST + + ++ V+ L CQ + G+GY+ P
Sbjct: 86 TGLDGHLGGHYLSALALMYASTGDEEVLRRLNYFVAELKRCQERNGNGYIGGIPDGSAAW 145
Query: 217 EQFDRFE------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFY 266
+ R E ++ W P+Y +HK+ AGL D Y +A N A + M+ W +E
Sbjct: 146 QAIARGELHVDNFSVNGKWVPWYNLHKVYAGLRDAYAYAGNADARAMLVSMSDWALE--- 202
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
+ + S E+ L E GGMN+VL + +T K++ LA F L L
Sbjct: 203 -----LTSHLSEEQMQAMLRSEHGGMNEVLADVAQMTGQKKYMDLAVRFSHQAILRPLEE 257
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
D ++G HANT IP VIG + ++TG ++ FF V A GG S E +
Sbjct: 258 GKDQLTGLHANTQIPKVIGFKHIGDMTGRRDWQQAAQFFWQTVRDHRTVAIGGNSVKEHF 317
Query: 387 SDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
D + + E E+C TYNMLK++ LF + Y DYYERAL N +LS QR +
Sbjct: 318 HDDRDFLPMVDEVEGPETCNTYNMLKLTELLFLGDAKGSYTDYYERALYNHILSSQR-PD 376
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P+ + Y + + WCC G+G+
Sbjct: 377 SGGFVYFTPM-----RPNHYRVYSQVDKAMWCCVGSGI 409
>gi|115399582|ref|XP_001215378.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114192261|gb|EAU33961.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 614
Score = 201 bits (511), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 139/396 (35%), Positives = 189/396 (47%), Gaps = 24/396 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWEDP 160
L E+SL D + Q+ L YL +D + L+ +F+ T G A GW+ P
Sbjct: 31 LSELSLGDGRF------LDNQERTLSYLKFVDTERLLLNFRANHKLDTKGAVANGGWDAP 84
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFP 215
T R H GH+L+A A +A + +E+ T VS L++CQ +GYLS FP
Sbjct: 85 TFPFRTHVQGHFLTAWAQCYAVLGDTDCQERATYFVSELAKCQANNEAAGFKTGYLSGFP 144
Query: 216 SEQFDRFEA--LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
FD EA L PYY IHK LAGLLD + +T A + + + R
Sbjct: 145 ESDFDALEAGTLNNGNVPYYNIHKTLAGLLDVWRLVGDTTARDVLLALAGWVDTRT---- 200
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
+ S + + L E GGMNDVL LY T D K L A FD LA D ++G
Sbjct: 201 SALSEAQMQSVLGTEFGGMNDVLADLYHQTSDEKWLKTAQRFDHAAVFDPLAANEDQLNG 260
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
HANT +P IG+ Y+ TGD Y I +H YA G S E + P +A
Sbjct: 261 LHANTQVPKWIGAVREYKATGDTRYLDIARNAWTITVNAHTYAIGANSQAEHFHAPNAIA 320
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKE-MVYADYYERALTNGVLSIQRGTEP-GVMIY 451
L ++ E+C +YNMLK++R L+ E Y D+YE AL N +L Q + G + Y
Sbjct: 321 QYLDSDTAEACNSYNMLKLTRELWTLDPENTTYFDFYENALLNHLLGQQNPADSHGHITY 380
Query: 452 MLPLGRGDSK--AKSYHG--WGTRFSSFWCCYGTGL 483
L G ++ ++ G W T + SFWCC GT L
Sbjct: 381 FTSLNPGGNRGVGPAWGGGTWSTDYDSFWCCQGTAL 416
>gi|388259955|ref|ZP_10137121.1| hypothetical protein O59_001162 [Cellvibrio sp. BR]
gi|387936316|gb|EIK42881.1| hypothetical protein O59_001162 [Cellvibrio sp. BR]
Length = 803
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 133/407 (32%), Positives = 197/407 (48%), Gaps = 50/407 (12%)
Query: 108 HDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGH 167
DV+L S +AQ TN +YL+ LD + L+ F++ AG P + Y WE + L GH
Sbjct: 30 RDVQLLDSPF-LQAQNTNKDYLMALDTEKLLAPFRREAGLPFK-ETYGNWE--STGLDGH 85
Query: 168 FVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE---------- 217
GHY++A A ++A+T + + +++ V++ L +CQ+K+GSGY+ P
Sbjct: 86 MGGHYVTALALLYAATKDDVVLQRLNYVIAELKKCQDKLGSGYIGGIPDSNTMWSEIARG 145
Query: 218 --QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ D F + W P+Y +HKI AGL D Y +A N A KM + W +E
Sbjct: 146 DIRADNF-STNERWVPWYNLHKIYAGLRDAYLYAGNEDAKKMLVRLSDWTIE-------- 196
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ K S E+ L E GGMN+V + IT D K+L LA F L L Q D +
Sbjct: 197 LTKKLSPEQMQTMLRTEHGGMNEVFVDVAEITGDKKYLKLAEAFSHQAILQPLEKQQDQL 256
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP +IG + + T + + FF V A GG S E + D
Sbjct: 257 TGLHANTQIPKIIGFKKVADATHNESWNKAAEFFWQTVVDKRTVAIGGNSVKEHFHDSHD 316
Query: 392 LASTL-GTENEESCTTYNMLKVSRHLFRWTKE--------------MVYADYYERALTNG 436
+ + E E+C TYNMLK+++ LF +++ M Y DYYERAL N
Sbjct: 317 FTAMIEDVEGPETCNTYNMLKLTQLLFLSSRDNSAADMKKSKNNPAMKYVDYYERALYNH 376
Query: 437 VLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+LS Q + G ++Y + + Y + WCC G+G+
Sbjct: 377 ILSSQH-PQTGGLVYFTSM-----RPNHYRKYSQVHDGMWCCVGSGI 417
>gi|390943351|ref|YP_006407112.1| hypothetical protein Belba_1756 [Belliella baltica DSM 15883]
gi|390416779|gb|AFL84357.1| hypothetical protein Belba_1756 [Belliella baltica DSM 15883]
Length = 785
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 123/375 (32%), Positives = 191/375 (50%), Gaps = 24/375 (6%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQ+ + +Y+L +DVD L+ + K AG + Y WED L GH GHYLSA + M+
Sbjct: 45 AQEVDKKYILEMDVDRLLAPYMKDAGIEWIAENYGNWED--TGLDGHIGGHYLSALSMMY 102
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFE-----------ALKPVW 229
AST ++ +K ++ ++ L Q+K +GY+ P+ Q E +L W
Sbjct: 103 ASTGDIEIKSRLDYMIEQLKLAQDKNANGYIGGVPNGQKIWEEIRVGNIKAGSFSLNDRW 162
Query: 230 APYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
P Y IHKI AGL D Y A A M + ++FY+ + +S + L E
Sbjct: 163 VPLYNIHKIYAGLKDAYLIAGIADAKPMLIALSDWFYDLTEG----FSEAQFQEILISEH 218
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GG+N+V + +T +PK+L LA L L+ + D+++G HANT IP VIG Q
Sbjct: 219 GGLNEVFADVSAMTGNPKYLELAKKMSHNLILDPLSKRQDNLTGMHANTQIPKVIGFQRI 278
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN-EESCTTYN 408
+++ + + + T+F + V + GG S E + + L ++ E+C TYN
Sbjct: 279 AQLSDEAKWNNSATYFWENVTNQRSVSIGGNSVREHFHPKDDFSPMLSSDQGPETCNTYN 338
Query: 409 MLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGW 468
M+++S LF + + Y DYYERAL N +LS Q T+ G +Y P+ + + Y +
Sbjct: 339 MMRLSEKLFESSPDRKYIDYYERALYNHILSSQHPTKGG-FVYFTPM-----RPQHYRVY 392
Query: 469 GTRFSSFWCCYGTGL 483
+FWCC G+GL
Sbjct: 393 SQPHENFWCCVGSGL 407
>gi|239627978|ref|ZP_04671009.1| secreted protein [Clostridiales bacterium 1_7_47_FAA]
gi|239518124|gb|EEQ57990.1| secreted protein [Clostridiales bacterium 1_7_47FAA]
Length = 822
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 123/390 (31%), Positives = 193/390 (49%), Gaps = 18/390 (4%)
Query: 104 EVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA-YEGWEDPTC 162
EV V+L + W AQ+ + +LL +D D ++++F+ AG G GW+ P C
Sbjct: 225 EVPAGSVRLSEGTRFWDAQERMIRWLLSVDDDQMLYNFRSAAGLDVRGAGPMTGWDAPEC 284
Query: 163 ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM-----GSGYLSAFPSE 217
L+GH GHYLS A + LK+K+ +V+AL+ECQ + G+LSA+ +
Sbjct: 285 NLKGHTTGHYLSGLALACSVHGQPELKDKINYMVNALAECQKALEAKGCAKGFLSAYSEQ 344
Query: 218 QFDRFEAL---KPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
QFD E +WAPYYT+ KI++GL D Y A + +A + + ++ Y R+ ++
Sbjct: 345 QFDLLEVYTRYPEIWAPYYTLDKIMSGLYDCYCLAGSKEAFHLLTGLGDWIYGRLSR-LS 403
Query: 275 KYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
+ +++ W+ + E GGM V+ RLY T D ++ A F + D +
Sbjct: 404 RAQLDKMWSMYIAGEFGGMISVMVRLYRETGDGRYRRAALFFRNEKLFYPMEENVDTLKD 463
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
HAN HIP IG+ Y+ G Y F +V SH Y+ GG E + +P +A
Sbjct: 464 MHANQHIPQAIGALELYKAGGGKRYLAIARNFWQMVVRSHEYSIGGVGETEMFHEPGDIA 523
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
+ ++ ESC +YN+++++ LF + + DYYE L N +LS G Y +
Sbjct: 524 HYMTDKSAESCASYNLMRLTFGLFGLSPDSRKMDYYENVLYNHILSSASHKADGGTTYFM 583
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
P+ G K + S CC+GTGL
Sbjct: 584 PVRPGGRKEFN-------TSENTCCHGTGL 606
>gi|113970330|ref|YP_734123.1| hypothetical protein Shewmr4_1993 [Shewanella sp. MR-4]
gi|113885014|gb|ABI39066.1| protein of unknown function DUF1680 [Shewanella sp. MR-4]
Length = 795
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 128/396 (32%), Positives = 202/396 (51%), Gaps = 29/396 (7%)
Query: 102 LKEVSLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
L + L+DV+L LH AQQT+L Y++ +D + L+ ++K AG T Y WE+
Sbjct: 28 LTPIPLNDVRLTAGPFLH--AQQTDLAYIMSMDPERLLAPYRKEAGIATTADNYPNWEN- 84
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE--- 217
L GH GHYLSA A M+A+T + + E++ +V+ L +CQ G+GY+ P
Sbjct: 85 -TGLDGHIGGHYLSALALMYAATGDQAVLERLNYMVAELEKCQQAHGNGYVGGVPHGDKL 143
Query: 218 ---------QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNR 268
+ D F L W P+Y +HK+ AGL D Y + N A KM ++ +
Sbjct: 144 WQQVAAGHIEADLF-TLNQSWVPWYNLHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDL 202
Query: 269 VQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQA 328
+N+ E+ L E GG+N+ L +Y+IT K+L LA+ + L L
Sbjct: 203 SRNLTD----EQLQLMLRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQHQ 258
Query: 329 DDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSD 388
+ ++G HANT IP ++G E++ + + + +F V + GG S E +
Sbjct: 259 EKLTGLHANTQIPKIVGVARIAELSHNKAWLESADYFWQQVVHQRTVSIGGNSVREHFHP 318
Query: 389 PKRLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPG 447
+ +S L + E E+C TYNMLK+S+ L+ +++ Y DYYERAL N +LS Q + G
Sbjct: 319 SEDFSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-PQTG 377
Query: 448 VMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
++Y P+ + Y + + S WCC G+G+
Sbjct: 378 GLVYFTPM-----RPDHYRVYSSAQESMWCCVGSGI 408
>gi|338209455|ref|YP_004646426.1| hypothetical protein Runsl_5734 [Runella slithyformis DSM 19594]
gi|336308918|gb|AEI52019.1| protein of unknown function DUF1680 [Runella slithyformis DSM
19594]
Length = 760
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 127/394 (32%), Positives = 197/394 (50%), Gaps = 25/394 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ SL +VK+ + AQ +L Y+L L+ D L+ + AG P + Y WE +
Sbjct: 22 MQSFSLQEVKVTGGAFK-NAQDVDLRYILSLNPDKLLAPYLIDAGLPLKAERYGNWE--S 78
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS----- 216
L GH GHYLSA A M+AST N LK+++ ++ L++CQ K G+GY+ P
Sbjct: 79 SGLDGHIGGHYLSALAMMYASTGNAELKKRLDYMIDQLAQCQAKNGNGYVGGIPQGKVFW 138
Query: 217 EQFDRFE------ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ 270
E+ + + L W P Y IHK+ AGL D Y F N QA ++ + ++F
Sbjct: 139 ERIYKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDSYEFGGNQQAKQVLIGLGDWF----A 194
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+I S ++ L E GGMN+ LY +T++ K+L A L L + D
Sbjct: 195 ELIRPLSDDQIQQILRTEHGGMNEAFADLYILTKNQKYLETAQRISHRAILNPLVQKQDK 254
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
++G HANT IP VIG + +T + + +F V+ + A GG S E ++
Sbjct: 255 LTGLHANTQIPKVIGFEKIAMLTENAKWSEAARYFWQNVSQTRTVAFGGNSVREHFNPTN 314
Query: 391 RLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+S L + + E+C ++NML++S+ LF + Y D+YER L N +LS Q + G
Sbjct: 315 DFSSMLKSNQGPETCNSFNMLRLSKALFLDKNDPSYLDFYERTLYNHILSSQH-PQKGGF 373
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+Y P+ + Y + +S WCC G+GL
Sbjct: 374 VYFTPI-----RPNHYRVYSQPETSMWCCVGSGL 402
>gi|393718114|ref|ZP_10338041.1| hypothetical protein SechA1_00115 [Sphingomonas echinoides ATCC
14820]
Length = 789
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 135/405 (33%), Positives = 195/405 (48%), Gaps = 44/405 (10%)
Query: 105 VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCEL 164
+ L V+L PS + A + N YLL L D L+ +F+ AG G+ Y GWE T +
Sbjct: 39 LPLSAVRLRPSD-YATAVEVNRAYLLRLSADRLLHNFRAYAGLKPKGEVYGGWESDT--I 95
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA----------- 213
GH +GHY+SA + T + K + +V L++ Q G+GY+ A
Sbjct: 96 AGHTLGHYMSALVLLHEQTGDAQAKRRADYIVDELADAQAARGNGYIGAMQRKRKDGTVV 155
Query: 214 -----FP--------SEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKW 260
FP S FD L W+P+YT+HK+ AGLLD + N +AL +
Sbjct: 156 DAIEIFPEIIKGDIRSGGFD----LNGAWSPFYTVHKLFAGLLDIHASWGNAKALSVAIA 211
Query: 261 MVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLA-HLFDKPC 319
YF + V + L E GG+N+ L+ T+D K L +A L+D+
Sbjct: 212 FAGYF----EPVFAALDDAQMQTMLGTEYGGLNESFAELFARTKDRKWLAIAERLYDRKV 267
Query: 320 FLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGG 379
L A Q D ++ FHANT +P +IG +E+TG+P FF V H Y GG
Sbjct: 268 LDPLTAGQ-DKLANFHANTQVPKLIGLARIHELTGEPAKAAAPRFFWQAVTKHHSYVIGG 326
Query: 380 TSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLS 439
+ E++S+P ++ + + E C TYNMLK++R L+ W + DYYERA N V++
Sbjct: 327 NADREYFSEPDSISRHITEQTCEHCNTYNMLKLTRQLYSWQPDGALFDYYERAHLNHVMA 386
Query: 440 IQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRF-SSFWCCYGTGL 483
Q G YM PL G + G+ T +FWCC GTG+
Sbjct: 387 AQDPKTAG-FTYMTPLLTG-----AVRGYSTSADDAFWCCVGTGM 425
>gi|284036341|ref|YP_003386271.1| hypothetical protein Slin_1422 [Spirosoma linguale DSM 74]
gi|283815634|gb|ADB37472.1| protein of unknown function DUF1680 [Spirosoma linguale DSM 74]
Length = 760
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 128/398 (32%), Positives = 194/398 (48%), Gaps = 33/398 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
++ +L DVK+ AQ +L+Y+L L+ + L+ + AG P Y WE +
Sbjct: 22 MQPFALQDVKVTGGPFK-NAQDVDLKYILALNPNKLLAPYLIDAGLPEKAPRYGNWE--S 78
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS----- 216
L GH GHYLSA A M+AST N K+++ +V L++CQ K G+GY+ P
Sbjct: 79 SGLDGHIGGHYLSALAMMYASTGNAETKKRLDYMVDQLAQCQAKNGNGYVGGIPQGKVFW 138
Query: 217 EQFDRFE------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFY 266
E+ + + L W P Y IHK+ AGL D Y +A N QA + + W VE
Sbjct: 139 ERIHKGDIDGSSFGLNNTWVPLYNIHKLFAGLRDAYEYAGNQQAKQVLIGLGDWFVE--- 195
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
+I S E+ L E GG+N+ LY +T+D K+L A L L
Sbjct: 196 -----LIKPLSDEQIQQVLRTEHGGINETFADLYILTKDQKYLETAQRISHRAILDPLID 250
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
+ D ++G HANT IP VIG + +TG + +F V+ + A GG S E +
Sbjct: 251 KQDKLTGLHANTQIPKVIGFEKIATLTGKSDWSDAAQYFWQNVSQTRSVAFGGNSVREHF 310
Query: 387 SDPKRLASTL-GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
+ + L + E+C ++NML++S+ LF ++ Y D+YER + N +LS Q E
Sbjct: 311 NPTTDFSQLLRSNQGPETCNSFNMLRLSKALFLDKNDVSYLDFYERTMYNHILSSQH-PE 369
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P+ + Y + +S WCC G+G+
Sbjct: 370 KGGFVYFTPI-----RPNHYRVYSQPETSMWCCVGSGI 402
>gi|407923357|gb|EKG16430.1| Six-hairpin glycosidase-like protein [Macrophomina phaseolina MS6]
Length = 612
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 137/409 (33%), Positives = 200/409 (48%), Gaps = 25/409 (6%)
Query: 92 PDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLE-YLLMLDVDSLVWSFQKTAGSPTA 150
P + AG + V+L S W+ Q YL +D+D L+++++ T G T
Sbjct: 14 PPAQEEAGVLAYPFDISQVRL--SDGRWQENQERTRTYLKFVDLDRLLYNYRATHGLSTN 71
Query: 151 GKAYEG-WEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNK---- 205
G A G W+ P R H GH+L+A W++T + +++ + L +CQ
Sbjct: 72 GAASNGGWDAPDFPFRSHAQGHFLTAWVQCWSTTGDTECRDRAVQFTAELLKCQENNEAA 131
Query: 206 -MGSGYLSAFPSEQFDRFEA--LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMV 262
+GYLS FP +FD E L PYY +HK++AGLLD + + A + +
Sbjct: 132 GFTAGYLSGFPESEFDALEGRTLSNGNVPYYVVHKLMAGLLDVWRGIGDLTARDVLLALA 191
Query: 263 EYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ R +N I+ ++R L E GGM++VL +Y + D + L +A F+ L
Sbjct: 192 GWVDARTEN-ISYGDMQR---ILQTEFGGMSEVLADIYYQSGDSRWLTVAQRFEHAAVLT 247
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
LA D ++G HANT +P IG+ Y+ TG+ Y DI +H YA GG S
Sbjct: 248 PLANNRDQLNGLHANTQVPKWIGAAREYKATGNTTYYDIARNAWDITVRAHTYAIGGNSQ 307
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKE---MVYADYYERALTNGVLS 439
E + P +A L + ESC +YNMLK++R L WT E Y DYYER L N ++
Sbjct: 308 AEHFRPPNAIAGYLTADTAESCNSYNMLKLTREL--WTTEPSSSAYFDYYERTLMNHLVG 365
Query: 440 IQRGTEP-GVMIY---MLPLG-RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Q +P G + Y + P G RG A W T + SFWCC GTG+
Sbjct: 366 QQDPEDPHGHVTYFNSLQPGGVRGVGPAWGGGTWSTDYDSFWCCQGTGV 414
>gi|237708621|ref|ZP_04539102.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
gi|229457321|gb|EEO63042.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
Length = 783
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 129/393 (32%), Positives = 196/393 (49%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
+ DV+L S A+ ++ YLL +D D L+ + K AG + Y WE+ L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE- 223
H GHYLSA ++M+A+T N +K ++ ++S L CQ+ G GYL P+ + + E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 224 --------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
L W P Y IHK+ AGL D + +A +K+T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+I+K S E+ + L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ G+ + +F + V GG S E +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 392 LASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S L +E E+C TYNML++++ L+ + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDSVQGG-FV 380
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ +A Y + +SFWCC G+G+
Sbjct: 381 YFTPM-----RAGHYRVYSQPQTSFWCCVGSGM 408
>gi|423242461|ref|ZP_17223569.1| hypothetical protein HMPREF1065_04192 [Bacteroides dorei
CL03T12C01]
gi|392639254|gb|EIY33080.1| hypothetical protein HMPREF1065_04192 [Bacteroides dorei
CL03T12C01]
Length = 783
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 129/393 (32%), Positives = 196/393 (49%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
+ DV+L S A+ ++ YLL +D D L+ + K AG + Y WE+ L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE- 223
H GHYLSA ++M+A+T N +K ++ ++S L CQ+ G GYL P+ + + E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 224 --------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
L W P Y IHK+ AGL D + +A +K+T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+I+K S E+ + L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ G+ + +F + V GG S E +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 392 LASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S L +E E+C TYNML++++ L+ + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ +A Y + +SFWCC G+G+
Sbjct: 381 YFTPM-----RAGHYRVYSQPQTSFWCCVGSGM 408
>gi|345513549|ref|ZP_08793069.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
gi|229437570|gb|EEO47647.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
Length = 783
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 129/393 (32%), Positives = 196/393 (49%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
+ DV+L S A+ ++ YLL +D D L+ + K AG + Y WE+ L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGMDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE- 223
H GHYLSA ++M+A+T N +K ++ ++S L CQ+ G GYL P+ + + E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 224 --------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
L W P Y IHK+ AGL D + +A +K+T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+I+K S E+ + L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ G+ + +F + V GG S E +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 392 LASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S L +E E+C TYNML++++ L+ + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ +A Y + +SFWCC G+G+
Sbjct: 381 YFTPM-----RAGHYRVYSQPQTSFWCCVGSGM 408
>gi|212691787|ref|ZP_03299915.1| hypothetical protein BACDOR_01282 [Bacteroides dorei DSM 17855]
gi|212665688|gb|EEB26260.1| hypothetical protein BACDOR_01282 [Bacteroides dorei DSM 17855]
Length = 783
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 129/393 (32%), Positives = 196/393 (49%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
+ DV+L S A+ ++ YLL +D D L+ + K AG + Y WE+ L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE- 223
H GHYLSA ++M+A+T N +K ++ ++S L CQ+ G GYL P+ + + E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 224 --------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
L W P Y IHK+ AGL D + +A +K+T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+I+K S E+ + L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ G+ + +F + V GG S E +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 392 LASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S L +E E+C TYNML++++ L+ + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ +A Y + +SFWCC G+G+
Sbjct: 381 YFTPM-----RAGHYRVYSQPQTSFWCCVGSGM 408
>gi|265755220|ref|ZP_06089990.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|423231114|ref|ZP_17217517.1| hypothetical protein HMPREF1063_03337 [Bacteroides dorei
CL02T00C15]
gi|423246788|ref|ZP_17227840.1| hypothetical protein HMPREF1064_04046 [Bacteroides dorei
CL02T12C06]
gi|263234362|gb|EEZ19952.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|392629229|gb|EIY23239.1| hypothetical protein HMPREF1063_03337 [Bacteroides dorei
CL02T00C15]
gi|392634665|gb|EIY28581.1| hypothetical protein HMPREF1064_04046 [Bacteroides dorei
CL02T12C06]
Length = 783
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 129/393 (32%), Positives = 196/393 (49%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
+ DV+L S A+ ++ YLL +D D L+ + K AG + Y WE+ L G
Sbjct: 33 VRDVRLTASPFK-HAEDMDIRYLLGIDPDRLLAPYLKEAGLFPKAENYTNWEN--TGLDG 89
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE- 223
H GHYLSA ++M+A+T N +K ++ ++S L CQ+ G GYL P+ + + E
Sbjct: 90 HIGGHYLSALSYMYAATGNQEIKVRLDYMISELKRCQDAAGDGYLCGVPNGRKMWKEIEE 149
Query: 224 --------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
L W P Y IHK+ AGL D + +A +K+T WM+
Sbjct: 150 GNIRASGFGLNDRWVPLYNIHKMYAGLRDATLQTGSKEAKEMLVKLTDWMI--------R 201
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+I+K S E+ + L E GG+N+ + IT D ++L LAH F L L Q D +
Sbjct: 202 LISKLSDEQIQDMLRSEHGGLNETFADVAAITGDKRYLKLAHQFSHQTVLQPLLKQEDKL 261
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ G+ + +F + V GG S E +
Sbjct: 262 TGMHANTQIPKVIGFKRIADLEGNRDWSEAARYFWETVVDHRSITIGGNSVREHFHPADD 321
Query: 392 LASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S L +E E+C TYNML++++ L+ + + DYYERAL N +LS Q + G +
Sbjct: 322 FSSMLTSEQGPETCNTYNMLRLTKMLYETSADAHLMDYYERALYNHILSTQDPVQGG-FV 380
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ +A Y + +SFWCC G+G+
Sbjct: 381 YFTPM-----RAGHYRVYSQPQTSFWCCVGSGM 408
>gi|114047478|ref|YP_738028.1| hypothetical protein Shewmr7_1982 [Shewanella sp. MR-7]
gi|113888920|gb|ABI42971.1| protein of unknown function DUF1680 [Shewanella sp. MR-7]
Length = 795
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 128/396 (32%), Positives = 201/396 (50%), Gaps = 29/396 (7%)
Query: 102 LKEVSLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
L + L+DV+L LH AQQT+L Y++ +D + L+ ++K AG T Y WE+
Sbjct: 28 LTPIPLNDVRLTAGPFLH--AQQTDLAYIMSMDPERLLAPYRKEAGIATTADNYPNWEN- 84
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE--- 217
L GH GHYLSA A M+A+T + + E++ +V+ L +CQ G+GY+ P
Sbjct: 85 -TGLDGHIGGHYLSALALMYAATGDQAVLERLNYMVAELEKCQQAHGNGYVGGVPHGDKL 143
Query: 218 ---------QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNR 268
+ D F L W P+Y +HK+ AGL D Y + N A KM ++ +
Sbjct: 144 WQQVAAGHIEADLF-TLNQSWVPWYNLHKVFAGLRDAYLYTQNPTAKKMLVGFADWMLDL 202
Query: 269 VQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQA 328
+N+ E+ L E GG+N+ L +Y+IT K+L LA+ + L L
Sbjct: 203 SRNLTD----EQLQLMLRTEYGGLNETLADVYSITGQNKYLNLANRYTDQSLLQPLLQHQ 258
Query: 329 DDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSD 388
D ++ HANT IP ++G E++ + + + +F V + GG S E +
Sbjct: 259 DKLTRLHANTQIPKIVGVARIAELSHNKAWLESADYFWQQVVHQRTVSIGGNSVREHFHP 318
Query: 389 PKRLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPG 447
+ +S L + E E+C TYNMLK+S+ L+ +++ Y DYYERAL N +LS Q + G
Sbjct: 319 SEDFSSMLDSVEGPETCNTYNMLKLSKLLYENKRDLRYIDYYERALYNHILSSQH-PQTG 377
Query: 448 VMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
++Y P+ + Y + + S WCC G+G+
Sbjct: 378 GLVYFTPM-----RPDHYRVYSSAQESMWCCVGSGI 408
>gi|217973327|ref|YP_002358078.1| hypothetical protein Sbal223_2153 [Shewanella baltica OS223]
gi|217498462|gb|ACK46655.1| protein of unknown function DUF1680 [Shewanella baltica OS223]
Length = 792
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 128/397 (32%), Positives = 203/397 (51%), Gaps = 31/397 (7%)
Query: 105 VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCEL 164
+ L+DV++ AQQT+L Y++ +D + L+ ++K AG T + Y WED L
Sbjct: 23 IPLNDVRITAGPF-LHAQQTDLHYIMSMDPERLLAPYRKDAGIATTAENYPNWED--TGL 79
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS-----EQF 219
GH GHYLSA A M+A+T + + ++ +V+ L +CQ G+GYL P+ +Q
Sbjct: 80 DGHIGGHYLSALALMYAATSDKAVLARLNYMVAELEKCQQAHGNGYLGGVPNSRKLWQQI 139
Query: 220 D--RFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVI 273
+ + EA L W P+Y +HK+ +GL D + + +N A KM + +F + + ++
Sbjct: 140 EQGKIEADLFTLNQAWVPWYNVHKVFSGLRDAHLYTNNPTAKKM----LVHFADWMLHLS 195
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
K S E+ L E GG+N+ L +Y IT K+L LA + L L D ++G
Sbjct: 196 NKLSDEQLQLMLRTEYGGLNETLADVYVITGQDKYLALAKRYTDQSLLQPLLHHEDKLTG 255
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
HANT IP ++G E++ + ++ + FF V + GG S E + +
Sbjct: 256 LHANTQIPKIVGVARIAELSNNKVWLDSADFFWQQVVHKRTVSIGGNSVREHFHPSDDFS 315
Query: 394 STL-GTENEESCTTYNMLKVSRHLF------RWTKEMVYADYYERALTNGVLSIQRGTEP 446
S L E E+C TYNMLK+S+ L+ ++ Y +YYERAL N +LS Q E
Sbjct: 316 SMLESAEGPETCNTYNMLKLSKLLYENKLLDENKADLAYIEYYERALYNHILSSQH-PEN 374
Query: 447 GVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G ++Y P+ + Y + + S WCC G+G+
Sbjct: 375 GGLVYFTPM-----RPDHYRVYSSAQQSMWCCVGSGI 406
>gi|315498357|ref|YP_004087161.1| hypothetical protein Astex_1338 [Asticcacaulis excentricus CB 48]
gi|315416369|gb|ADU13010.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
48]
Length = 797
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 125/401 (31%), Positives = 199/401 (49%), Gaps = 33/401 (8%)
Query: 103 KEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC 162
+ + L V+L PS A + N YLL L D ++++ K AG P G+ Y GWE T
Sbjct: 39 RPIPLTQVRLLPSPF-LEAVEANRRYLLFLSPDRFLYNYHKFAGMPVKGEIYGGWESDT- 96
Query: 163 ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAF-------- 214
+ G +GHYLSA + M A T + ++ ++S L + Q G GY++ F
Sbjct: 97 -IAGEGLGHYLSALSLMHAQTGDNECVARIHYIISELEKVQAAHGDGYVAGFMRKRKDGS 155
Query: 215 ---PSEQFDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMV 262
E F A L W P+Y HK+ AGLLD + + + + + +
Sbjct: 156 IVDGKEIFPEIMAGDIRSAGFDLNGCWVPFYNWHKLFAGLLDAQAYCGVDRGIPVAEKLG 215
Query: 263 EYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
Y ++ V + L+ E GG+N+ LY+ T +P+ L L+ L
Sbjct: 216 GY----IEMVFAALDDAQTQKVLDCEHGGINESFAELYSRTNNPRWLKLSERLYHHRMLD 271
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
LA + D ++ HANT +P +IG YE+T P Y+ +FF + V H + GG +
Sbjct: 272 PLAAREDKLANNHANTQVPKLIGLARLYELTQKPQYQTASSFFWERVVNHHSFVIGGNAD 331
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
E++ +P +++ + + ESC TYNMLK++RHL+ W+ + + DYYERA N +L+ Q
Sbjct: 332 REYFFEPDTISAHITEQTCESCNTYNMLKLTRHLYSWSPKAAWFDYYERAHLNHMLAHQN 391
Query: 443 GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM+PL G ++ G+ +SFWCC +G+
Sbjct: 392 -PKTGMFTYMMPLMSGAAR-----GFSDEENSFWCCVLSGI 426
>gi|90020425|ref|YP_526252.1| Acetyl-CoA carboxylase, biotin carboxylase [Saccharophagus
degradans 2-40]
gi|89950025|gb|ABD80040.1| protein of unknown function DUF1680 [Saccharophagus degradans 2-40]
Length = 803
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 133/390 (34%), Positives = 193/390 (49%), Gaps = 27/390 (6%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L DV+L S AQ N+EY+L L D L+ F K AG P + Y WE + L G
Sbjct: 36 LADVRLLDSPFK-HAQDKNVEYVLALQPDKLLAPFLKEAGLPVKAENYGNWE--SQGLDG 92
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF------- 219
H GHYL+A + +A+T + L +++ +++ L QNK +GY+ + +
Sbjct: 93 HIGGHYLTALSLAYAATGDKRLLDRLNYMLNELERAQNKNSNGYIGGVRNGKALWDNIAK 152
Query: 220 -----DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
D F AL W P+Y +HKI AGL D Y + + QA M + E+ +
Sbjct: 153 GDIRADLF-ALNDYWVPWYNLHKIYAGLRDAYIYTGSEQAKAMLIGLGEW----TIALTA 207
Query: 275 KYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
+ E+ L E GGMN+V + IT D ++L LA F L L + D ++G
Sbjct: 208 DLNDEQIEKMLTTEYGGMNEVFADMAAITGDKRYLSLAKQFSHKKILNPLLQKRDALNGL 267
Query: 335 HANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAS 394
HANT IP V+G Q E+TGD + +F V + A GG S E + D + A
Sbjct: 268 HANTQIPKVVGYQRVAELTGDEEWHKAADYFWHHVVNNRTVAIGGNSVREHFHDSEDFAP 327
Query: 395 TLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
+ E E+C TYNMLK+SR LF + Y DY+ERAL N +LS Q E G ++Y
Sbjct: 328 MINDVEGPETCNTYNMLKLSRMLFSVNPSVDYVDYFERALYNHILSSQH-PETGGLVYFT 386
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
P+ + + Y + ++ WCC G+G+
Sbjct: 387 PM-----RPQHYRMYSQVDTAMWCCVGSGI 411
>gi|332685731|ref|YP_004455505.1| hypothetical protein MPTP_0197 [Melissococcus plutonius ATCC 35311]
gi|332369740|dbj|BAK20696.1| hypothetical protein MPTP_0197 [Melissococcus plutonius ATCC 35311]
Length = 883
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 140/427 (32%), Positives = 200/427 (46%), Gaps = 59/427 (13%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG-SPTAGKAYEGWE-D 159
+K + + + +H +AQ+ + YLL LDV ++ F K AG P Y+GWE
Sbjct: 1 MKPIDTKAITIQDPYIH-KAQENVIHYLLSLDVQKFLFEFYKVAGMKPLTESGYQGWERS 59
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKM----TAVVSALSECQNKMG------SG 209
RGHF GH+LSA A + + LK+K+ ++ L Q +G
Sbjct: 60 DQVNFRGHFFGHFLSALALSYQAEKQPILKKKIHQQIKTAITGLKAIQKNYAKQHPEHAG 119
Query: 210 YLSAFPSEQFDRFEALKPV--------WAPYYTIHKILAGLLD------QYTFADNTQAL 255
Y+SAF D E KPV P+Y +HKILAGLL+ + + +AL
Sbjct: 120 YISAFKEVALDEVEG-KPVDPKEKENVLVPWYNLHKILAGLLEVNISLKEVDSQLSKEAL 178
Query: 256 KMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLF 315
+ W +Y Y R+ N+ K + L E GGMND LY L+ +TQ +H + A F
Sbjct: 179 FIASWFGDYIYKRMMNLTDKNQM------LTIEYGGMNDALYYLFELTQKKEHAIAATYF 232
Query: 316 DKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEV-TGDPL--------------YKV 360
D+ LA + + G HANT IP +IG+ RY V + L Y
Sbjct: 233 DEDNLFNQLANDENVLPGKHANTTIPKLIGALKRYMVFQSEDLSAWLSNEEKEHLMSYFK 292
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRL----ASTLGTENEESCTTYNMLKVSRHL 416
F IV +H Y TGG S E + P L G E+C T+NMLK++R L
Sbjct: 293 AAENFWQIVVDNHTYCTGGNSQSEHFHGPNELFYDSEIRQGDCTCETCNTHNMLKLTRKL 352
Query: 417 FRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFW 476
+ TK+ Y DYYE N +L+ Q ++ G+M+Y P+G G +K + + FW
Sbjct: 353 YECTKDPKYLDYYETTYINAILASQ-NSKTGMMMYFQPMGAGYNKV-----YNRPYDEFW 406
Query: 477 CCYGTGL 483
CC GTG+
Sbjct: 407 CCSGTGI 413
>gi|342872240|gb|EGU74628.1| hypothetical protein FOXB_14856 [Fusarium oxysporum Fo5176]
Length = 616
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 142/401 (35%), Positives = 188/401 (46%), Gaps = 34/401 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQT-NLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWED 159
L +VSL D + W Q L YLL +D D L++ F+K G T G + GW+
Sbjct: 34 LTQVSLTDSR-------WMDNQNRTLNYLLSVDPDRLLYVFRKNHGVDTKGAQTNGGWDA 86
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAF 214
P R H GH+LSA +AS + T V L++CQ GYLS F
Sbjct: 87 PDFPFRSHVQGHFLSAWTQCYASAGVKECGSRATYFVQELAKCQANNAKAGFNKGYLSGF 146
Query: 215 PSEQFDRFE--ALKPVWAPYYTIHKILAGLLDQYT-FADNTQA---LKMTKWMVEYFYNR 268
P + E L PYY IHK LAGLLD Y D T L + W
Sbjct: 147 PESDITKVEDRTLNNGNVPYYAIHKTLAGLLDVYRRLGDQTAKDTMLSLASW-------- 198
Query: 269 VQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQA 328
V +K S + + L E GGMN+VL + T+D K L +A FD L
Sbjct: 199 VDTRTSKLSYNQMQSMLQTEFGGMNEVLADIAFYTKDAKWLKVAQRFDHAVIFDPLQQNV 258
Query: 329 DDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSD 388
D +SG HANT +P IG+ Y+V GD Y G ++V H YA GG S E +
Sbjct: 259 DKLSGLHANTQLPKWIGALREYKVGGDKKYLDIGRNAWNMVVNKHTYAIGGNSQAEHFRA 318
Query: 389 PKRLASTLGTENEESCTTYNMLKVSRHLFRWTK-EMVYADYYERALTNGVLSIQR-GTEP 446
P +A L + E+C +YNMLK++R L+ + Y D+YE+AL N +L Q ++
Sbjct: 319 PDAIAGFLTDDTCEACNSYNMLKLTRELWALNPTDASYFDFYEKALLNHLLGQQDPSSDH 378
Query: 447 GVMIYMLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G + Y PL RG A W T ++SFWCC GTG+
Sbjct: 379 GHVTYFTPLKAGGRRGVGPAWGGGTWSTDYNSFWCCQGTGV 419
>gi|404254065|ref|ZP_10958033.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas sp. PAMC
26621]
Length = 646
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 137/398 (34%), Positives = 195/398 (48%), Gaps = 35/398 (8%)
Query: 102 LKEVSLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWE-D 159
L+ L DV L LH AQ+ YLL LD D ++ +F+ AG Y GWE D
Sbjct: 46 LQPFDLADVDLGEGPFLH--AQRKTEAYLLSLDPDRMLHAFRVNAGLKPKAAVYGGWESD 103
Query: 160 PT---CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS 216
P +GH +GHYLSA A + ST ++++ + L+ CQ+ SG + AFP
Sbjct: 104 PIWADINCQGHTLGHYLSACALAYRSTRKPAFRQRIDHIARELAACQDAAKSGLVCAFPK 163
Query: 217 -----EQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYN 267
R +A+ V P+YT+HK+ AGL D AD+ ++ L++ W V
Sbjct: 164 GPALVAAHLRGDAITGV--PWYTLHKVFAGLRDATLLADSAESRAVLLRLADWAV----- 216
Query: 268 RVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
V T+ + + ++ E E GGMN+V LY +T +P + +A F L LA
Sbjct: 217 ----VATRPLSDAQFETMLETEHGGMNEVFADLYLMTGNPDYRTMAERFSHKALLTPLAA 272
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE-F 385
D + G HANT +P ++G Q +E TG P Y FF V + +ATGG E F
Sbjct: 273 GRDQLDGLHANTQLPKIVGFQRVFEATGTPHYHEAAAFFWRTVALTRSFATGGHGDNEHF 332
Query: 386 WSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
+ + + E+C +NMLK++R LF + YADYYER L NG+L+ Q +
Sbjct: 333 FPMAEFDKHVFSAKGSETCGQHNMLKLTRALFLQDPQAEYADYYERTLYNGILASQ-DPD 391
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G++ Y G K YH T SFWCC GTG+
Sbjct: 392 TGMVTYF--QGARPGYMKLYH---TPEHSFWCCTGTGM 424
>gi|395493738|ref|ZP_10425317.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas sp. PAMC
26617]
Length = 646
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 137/398 (34%), Positives = 195/398 (48%), Gaps = 35/398 (8%)
Query: 102 LKEVSLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWE-D 159
L+ L DV L LH AQ+ YLL LD D ++ +F+ AG Y GWE D
Sbjct: 46 LQPFDLADVDLGEGPFLH--AQRKTEAYLLSLDPDRMLHAFRVNAGLKPKAAVYGGWESD 103
Query: 160 PT---CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS 216
P +GH +GHYLSA A + ST ++++ + L+ CQ+ SG + AFP
Sbjct: 104 PIWADINCQGHTLGHYLSACALAYRSTRKPAFRQRIDHIARELAACQDAARSGLVCAFPK 163
Query: 217 -----EQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYN 267
R +A+ V P+YT+HK+ AGL D AD+ ++ L++ W V
Sbjct: 164 GPALVAAHLRGDAITGV--PWYTLHKVFAGLRDATLMADSAESRAVLLRLADWAV----- 216
Query: 268 RVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
V T+ + + ++ E E GGMN+V LY +T +P + +A F L LA
Sbjct: 217 ----VATRPLSDAQFETMLETEHGGMNEVFADLYLMTGNPDYRTMAERFSHKALLTPLAA 272
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE-F 385
D + G HANT +P ++G Q +E TG P Y FF V + +ATGG E F
Sbjct: 273 GRDQLDGLHANTQLPKIVGFQRVFEATGTPHYHEAAAFFWRTVALTRSFATGGHGDNEHF 332
Query: 386 WSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
+ + + E+C +NMLK++R LF + YADYYER L NG+L+ Q +
Sbjct: 333 FPMAEFDKHVFSAKGSETCGQHNMLKLTRALFLQDPQAEYADYYERTLYNGILASQ-DPD 391
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G++ Y G K YH T SFWCC GTG+
Sbjct: 392 TGMVTYF--QGARPGYMKLYH---TPEHSFWCCTGTGM 424
>gi|357032903|ref|ZP_09094838.1| tat twin-arginine translocation pathway signal sequence domain
protein [Gluconobacter morbifer G707]
gi|356413894|gb|EHH67546.1| tat twin-arginine translocation pathway signal sequence domain
protein [Gluconobacter morbifer G707]
Length = 790
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 132/408 (32%), Positives = 198/408 (48%), Gaps = 37/408 (9%)
Query: 98 AGDFLKEVSLHDVKLDPSSLHW-RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG 156
+G + + L +V+L PS W A + N YLL L+ D L+ +F+K AG P G Y G
Sbjct: 35 SGADVTPIPLSNVRLLPSP--WLEAVERNRIYLLSLEADRLLHNFRKQAGLPPKGALYGG 92
Query: 157 WEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS 216
WE T + GH +GHYLSA A M+A T + +E++ +V L Q + G GY++ F
Sbjct: 93 WESDT--IAGHTLGHYLSALALMYAQTDDAACRERVAYIVQELVVVQKQWGDGYVAGFTR 150
Query: 217 EQ-----------FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALK 256
++ F EA L W+P Y IHK AGLLD + + QAL
Sbjct: 151 KEKNGALVDGKRIFAEIEAGDIRSSGFDLNGAWSPLYNIHKTFAGLLDAHIYCHCDQALN 210
Query: 257 MTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAH-LF 315
+ + ++ + K + + L E GG+N+ L T D + L LA+ ++
Sbjct: 211 VAVGLGQFL----KAFFGKLTDAQMQKVLTCEYGGLNESFAELAARTGDEEWLRLAYRIY 266
Query: 316 DKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGY 375
D+P L L + DD++ HANT IP ++G EV+ + + FF V H Y
Sbjct: 267 DRPV-LDPLMEERDDLANRHANTQIPKLVGLARIAEVSQNRHWMTGPQFFWKAVTRHHSY 325
Query: 376 ATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTN 435
GG + E++S+P ++ + + E C TYNMLK++R + + DYYERA N
Sbjct: 326 VIGGNADREYFSEPDTISQHITEQTCEHCNTYNMLKLTRQCYASNPQAALFDYYERAHLN 385
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ + G+ YM P + W T SFWCC GTG+
Sbjct: 386 HILAAH-DPQTGMFTYMTP-----TITAGVREWSTPTESFWCCVGTGM 427
>gi|182415028|ref|YP_001820094.1| hypothetical protein Oter_3214 [Opitutus terrae PB90-1]
gi|177842242|gb|ACB76494.1| protein of unknown function DUF1680 [Opitutus terrae PB90-1]
Length = 844
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 136/394 (34%), Positives = 193/394 (48%), Gaps = 31/394 (7%)
Query: 105 VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCEL 164
+ L V+L + A + N YLL LD D L+ F++ AG P + Y WE + L
Sbjct: 76 LPLASVRLLEGGPFFTAVKANRTYLLALDADRLLAPFRREAGLPALAQPYGNWE--SGGL 133
Query: 165 RGHFVGHYLSASAHMWASTHNVT---LKEKMTAVVSALSECQNKMGSGYLSAFPS--EQF 219
GH GHYLSA AHM A+ H+ L+ ++ +V+ L CQ+ G+GY+ P E +
Sbjct: 134 DGHTAGHYLSALAHMIAAGHDTPEGELRRRLDHMVAELKACQDANGNGYVGGVPGSHELW 193
Query: 220 DRFEA-----LKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQ 270
R A + W P+Y +HK AGL D + NT A +++ W V
Sbjct: 194 QRVAAGDVTAVNRKWVPWYNLHKTFAGLRDAWLQTGNTTARDVLVRLGDWCVA------- 246
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+ + + E+ L +E GGMN+VL +Y IT D K+L A F+ L L D+
Sbjct: 247 -LTSPLTDEQMQRMLAQEHGGMNEVLADIYAITGDKKYLTAAERFNHHAVLDPLEQHRDE 305
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
++G HANT IP V+G + +TGD FF + V A GG S E ++DP
Sbjct: 306 LTGKHANTQIPKVVGLERIATLTGDKAADSGARFFWETVTQHRSVAFGGNSVSEHFNDPH 365
Query: 391 RLASTL-GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+ L E E+C TYNML+++ LF E YADYYERAL N +L+ PG
Sbjct: 366 NFHALLVHREGPETCNTYNMLRLTEGLFASAPEAAYADYYERALFNHILASINPDHPG-Y 424
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+Y P+ + Y + FWCC GTG+
Sbjct: 425 VYFTPI-----RPNHYRVYSQPDQGFWCCVGTGM 453
>gi|383640258|ref|ZP_09952664.1| acetyl-CoA carboxylase, biotin carboxylase [Sphingomonas elodea
ATCC 31461]
Length = 652
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 138/415 (33%), Positives = 194/415 (46%), Gaps = 51/415 (12%)
Query: 93 DGFKLAGDFLKEVSLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG 151
DG +A L+ + DV L LH AQ+ YLL L+ D L+ F+ AG
Sbjct: 42 DGAPVAAPRLQPFDMADVTLGEGPFLH--AQRATEAYLLRLEPDRLLHQFRVNAGLEPKA 99
Query: 152 KAYEGWE-DP---TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMG 207
AY GWE DP +GH +GHYLSA A + +T ++++ + + L CQ+
Sbjct: 100 PAYGGWESDPLWSDIHCQGHTLGHYLSACALAYRATGEARYRQRVDYIATELGACQDAAK 159
Query: 208 SGYLSAFPSEQF---DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKW 260
SG ++AFP K P+YT+HK+ AGL D AD+ A L++ W
Sbjct: 160 SGLVTAFPKGAALVSAHLRGEKITGVPWYTLHKVYAGLRDGALLADSEPARATLLRLADW 219
Query: 261 MVEYFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPC 319
V V ++ + + ++ E E GGMN++ LY +T ++ +A F
Sbjct: 220 GV---------VASRPLSDAEFEAMLETEHGGMNEIYADLYFMTGKEEYRAIARRFSHKA 270
Query: 320 FLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGG 379
L LA D + G HANT +P V+G Q YE TGD Y+ FF V + +ATGG
Sbjct: 271 LLAPLARAQDHLDGLHANTQVPKVVGFQRVYEATGDAAYRDAAAFFWKTVAQTRSFATGG 330
Query: 380 TSAGE-FWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVL 438
E F++ + E+C +NMLK++R LF + YADYYER L NG+L
Sbjct: 331 HGDNEHFFAMADFETHVFSAKGSETCCQHNMLKLTRALFLHDPDPAYADYYERTLYNGIL 390
Query: 439 SIQ----------RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ Q +G PG M K YH T SFWCC GTG+
Sbjct: 391 ASQDPDSGMATYFQGARPGYM-------------KLYH---TPEHSFWCCTGTGM 429
>gi|336319285|ref|YP_004599253.1| hypothetical protein Celgi_0157 [[Cellvibrio] gilvus ATCC 13127]
gi|336102866|gb|AEI10685.1| protein of unknown function DUF1680 [[Cellvibrio] gilvus ATCC
13127]
Length = 1577
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 143/424 (33%), Positives = 200/424 (47%), Gaps = 55/424 (12%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG-SPTAGKAYEGWE-D 159
L++ L D+ L + L A + + EYLL L + ++ + + G +PT Y GWE
Sbjct: 368 LQDSGLEDLYLTDAYLTNAAAKEH-EYLLSLSSEKFLYEWYRNVGLTPTTTSGYGGWERS 426
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVT----LKEKMTAVVSALSECQN------KMGSG 209
RGH GHY+SA + +++T + T L E++ V+ L+ Q+ +G
Sbjct: 427 DVTNFRGHAFGHYMSALSQSYSATADATTKAALLEQVEDAVAGLTLVQDTYAAAHPASAG 486
Query: 210 YLSAFPSEQFDRFEAL----KPVWAPYYTIHKILAGLLDQYTF---ADNTQALKMTKWMV 262
Y+SAFP D + V P+Y +HK+LAGLLD + + A QAL +
Sbjct: 487 YVSAFPESALDAVDGTGTTTDKVLVPWYNLHKVLAGLLDIHDYVGGATGAQALDIASQFG 546
Query: 263 EYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
EY Y R+ + + + L E GGMND LYRLY +T DP A FD+
Sbjct: 547 EYTYQRISRLTDRTRM------LRTEYGGMNDALYRLYDLTDDPHVKTAAEAFDETALFT 600
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEV-TGD---------------PLYKVTGTFFM 366
LA D ++G HANT IP +IG+ RY V T D P Y F
Sbjct: 601 QLAAGQDVLNGKHANTTIPKLIGALKRYTVFTSDADRLASLTEAERAQLPTYLAAAEEFW 660
Query: 367 DIVNASHGYATGGTSAGEFWSDPKRL-------ASTLGTENEESCTTYNMLKVSRHLFRW 419
I H YATG S E + DP L T + E+C YNMLK+SR LF+
Sbjct: 661 QITVDHHTYATGSNSQSEHFHDPDSLHEFATQQGETGNAQTSETCNEYNMLKLSRELFKL 720
Query: 420 TKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCY 479
TK++ YA YYE N VL+ Q + G+ Y P+ G + S ++ FWCC
Sbjct: 721 TKDVKYAHYYENTFINTVLASQN-PDTGMTTYFQPMAAGYDRIYSMP-----YTEFWCCT 774
Query: 480 GTGL 483
GTG+
Sbjct: 775 GTGM 778
>gi|392554933|ref|ZP_10302070.1| Acetyl-CoA carboxylase, biotin carboxylase [Pseudoalteromonas
undina NCIMB 2128]
Length = 816
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 136/392 (34%), Positives = 190/392 (48%), Gaps = 30/392 (7%)
Query: 106 SLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCEL 164
+L V L S LH AQQTN+ YLL L D L+ + + AG +Y WED L
Sbjct: 50 ALEQVSLSASPFLH--AQQTNVRYLLALHPDQLLAPYLREAGIEPKASSYGNWED--SGL 105
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF----- 219
GH GHYLSA + WA+T + LK ++ +++ L Q ++ GYL P+ Q
Sbjct: 106 DGHIGGHYLSALSLAWAATGDEELKRRLDYMLNELQRAQ-QVNDGYLGGIPNGQAMWQQI 164
Query: 220 -------DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNV 272
D F +L W P Y I KI GL D Y A + QA M + E+F N +
Sbjct: 165 HDGNIKADLF-SLNDRWVPLYNIDKIFHGLRDAYLIAGSEQAKTMLFGLGEWFLN----L 219
Query: 273 ITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDIS 332
+K S E+ L E GG+N V + TI D ++L LA F + L + D ++
Sbjct: 220 TSKLSDEQIQQMLYSEYGGLNAVFADMATIGNDKRYLKLARQFTHHSIVDPLLKKQDKLT 279
Query: 333 GFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRL 392
G HANT IP +IG E + D ++ +F V A GG S E + D K
Sbjct: 280 GLHANTQIPKIIGMLKVAETSDDEAWQQGADYFWQTVTKERSVAIGGNSVREHFHDKKDF 339
Query: 393 ASTL-GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
+ + E E+C TYNM+K+S+ LF T + Y +YYERA N +LS Q E G ++Y
Sbjct: 340 TAMVEDVEGPETCNTYNMMKLSKLLFLKTADTRYLEYYERATYNHILSSQH-PEHGGLVY 398
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
P+ G Y + + S WCC G+G+
Sbjct: 399 FTPMRPG-----HYRMYSSVQDSMWCCVGSGI 425
>gi|346226219|ref|ZP_08847361.1| Acetyl-CoA carboxylase, biotin carboxylase [Anaerophaga
thermohalophila DSM 12881]
Length = 795
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 125/379 (32%), Positives = 185/379 (48%), Gaps = 32/379 (8%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A+ N +Y++ D D L+ F AG Y WE + L GHF GHYL++ + M
Sbjct: 49 AEALNEQYVMAHDPDRLLAPFLIDAGLEPKAPGYGNWE--SSGLNGHFGGHYLTSLSLMI 106
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFE-----------ALKPVW 229
AST N +E++ ++ L+ CQ G+GY+ P Q E +L W
Sbjct: 107 ASTGNEEARERLNYMIDELARCQEANGNGYVGGVPGGQDMWAEIAKGNIDAGNFSLNGKW 166
Query: 230 APYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSL 285
P Y IHK+ AGL D + +A N +A +K+T W ++ + I + V H
Sbjct: 167 VPLYNIHKLYAGLRDAWLYAGNEKAREILIKLTDWCIDLTAALSDDQIQEMLVSEH---- 222
Query: 286 NEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIG 345
GG+N+V +Y IT D K+L LA F L L D ++G HANT IP VIG
Sbjct: 223 ----GGLNEVFADVYDITGDEKYLELARRFSHREILEPLLQHEDRLTGLHANTQIPKVIG 278
Query: 346 SQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT-ENEESC 404
E+T D + FF + V + GG S E + +S + + + E+C
Sbjct: 279 YMRIAELTHDSAWIDASDFFWNTVVNNRTITIGGNSTHEHFHPVDDFSSMIESRQGPETC 338
Query: 405 TTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKS 464
TYNMLK+S+HLF + ++ Y DYYE+AL N +LS Q G ++Y P+ + +
Sbjct: 339 NTYNMLKLSKHLFLYKNDLKYIDYYEQALYNHILSSQHPGHGG-LVYFTPM-----RPRH 392
Query: 465 YHGWGTRFSSFWCCYGTGL 483
Y + +FWCC G+G+
Sbjct: 393 YRVYSNPEETFWCCVGSGI 411
>gi|431795908|ref|YP_007222812.1| hypothetical protein Echvi_0518 [Echinicola vietnamensis DSM 17526]
gi|430786673|gb|AGA76802.1| hypothetical protein Echvi_0518 [Echinicola vietnamensis DSM 17526]
Length = 784
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 135/391 (34%), Positives = 191/391 (48%), Gaps = 29/391 (7%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L V+L PS AQQ ++ Y+ ++VD L+ + AG A Y WE+ L G
Sbjct: 33 LDQVRLSPSPF-LNAQQVDMTYMKAMEVDRLLAPYMLEAGVDWAADRYPNWEN--TGLDG 89
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS-----EQFDR 221
H GHYLSA A M+AST + +K +M +V L+ Q K G+GY+ P E+ +
Sbjct: 90 HIGGHYLSALAMMYASTGDAEMKRRMDYMVEQLAMAQAKNGNGYVGGIPGGMAMWEEIGQ 149
Query: 222 FE------ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITK 275
E +L W P Y IHKI AGL D Y N QA ++ + ++FY + +
Sbjct: 150 GEIDAGGFSLNQKWVPLYNIHKIYAGLRDAYLIGGNAQAKEVLLDLTDWFYELTKGLTD- 208
Query: 276 YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
E+ L E GG+N+V + IT + K+L LA L L Q D ++G H
Sbjct: 209 ---EQFQQMLVSEHGGLNEVFADVAAITGEAKYLELAKKMSHEWLLEPLEEQEDKLTGMH 265
Query: 336 ANTHIPVVIGSQMRYEVTGD-PLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAS 394
ANT IP VIG Q R GD ++ FF V + A GG S E + P+ S
Sbjct: 266 ANTQIPKVIGFQ-RVAQEGDLAEWQEAADFFWHTVVENRTVAIGGNSVREHFH-PEDDFS 323
Query: 395 TLGTENE--ESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYM 452
+ + N+ E+C TYNML++S LF + Y D++ER L N +LS Q E G +Y
Sbjct: 324 PMVSSNQGPETCNTYNMLRLSEQLFMSNPQAEYVDFFERGLYNHILSSQH-PEKGGFVYF 382
Query: 453 LPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
P+ + + Y + FWCC G+GL
Sbjct: 383 TPM-----RPEHYRVYSQPQQGFWCCVGSGL 408
>gi|332185145|ref|ZP_08386894.1| hypothetical protein SUS17_217 [Sphingomonas sp. S17]
gi|332014869|gb|EGI56925.1| hypothetical protein SUS17_217 [Sphingomonas sp. S17]
Length = 782
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 133/405 (32%), Positives = 192/405 (47%), Gaps = 34/405 (8%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWED 159
D + L V+L PS ++ A +TN YL LD D L+ +F+ AG Y GWE
Sbjct: 26 DKAEPFPLSAVRLRPS-IYATAVETNRRYLYRLDPDRLLHNFRLYAGLKPKAPIYGGWES 84
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF 219
T + GH +GHY+SA W T + ++ + +VS L+E Q K G+GY+ A ++
Sbjct: 85 DT--IAGHTLGHYMSALVLTWQQTGDTEMRRRADYIVSELAEAQAKRGTGYVGALGRKRA 142
Query: 220 DR---------------------FEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMT 258
D F+ L W+P YT+HK+ AGLLD + N QAL +
Sbjct: 143 DGTIVDGEEIFHEIMAGKIKSGGFD-LNGSWSPLYTVHKLFAGLLDIHGGWGNAQALDVA 201
Query: 259 KWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKP 318
+ YF V R + L E GG+N+ LY T D + L LA
Sbjct: 202 VKLGGYF----ARVFAALDDARLQDVLGCEYGGLNESFAELYQRTGDRQWLALAERIYDN 257
Query: 319 CFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATG 378
L L D ++ HANT +P +IG +E+T P FF + V H Y G
Sbjct: 258 KVLDPLVAGKDQLANLHANTQVPKLIGLARIHEITAAPAPAAGARFFWENVTGHHSYVIG 317
Query: 379 GTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVL 438
G + E++S+P +A + + E C +YNMLK++RHL+ W + DYYERA N V+
Sbjct: 318 GNADREYFSEPDTIARHITEQTCEHCNSYNMLKLTRHLYGWQPDGRLFDYYERAHLNHVM 377
Query: 439 SIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ Q G YM PL G ++ S + +FWCC G+G+
Sbjct: 378 AAQHPVHAG-FTYMTPLMTGMAREFST----DKDDAFWCCVGSGM 417
>gi|395803808|ref|ZP_10483051.1| hypothetical protein FF52_18073 [Flavobacterium sp. F52]
gi|395434079|gb|EJG00030.1| hypothetical protein FF52_18073 [Flavobacterium sp. F52]
Length = 760
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 129/398 (32%), Positives = 190/398 (47%), Gaps = 33/398 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
+K L +VKL AQ +L+Y+L LD D L+ + + P Y WE+
Sbjct: 22 MKLFDLSEVKLKDGPFK-NAQDVDLKYILALDPDKLLAPYLLESRLPPKADRYGNWEN-- 78
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF-- 219
L GH GHYLSA A M+ ST N LK+++ ++S L+ CQ K G+GY+ P +
Sbjct: 79 IGLDGHIGGHYLSALALMYKSTGNKELKDRLDYMLSELARCQAKNGNGYVGGIPQGKVFW 138
Query: 220 DRFE---------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFY 266
DR L W P Y IHK+ AGL D Y + + QA +K+ W +E
Sbjct: 139 DRIHKGDIDGSSFGLNNTWVPIYNIHKLFAGLTDAYQYTGSEQAKDIVIKLGDWFIE--- 195
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
+I S E+ L E GG+N+ LY IT+D K+L A L L
Sbjct: 196 -----LIRPLSDEQIQKVLATEHGGINESFADLYIITKDKKYLETAEKLSHKALLNPLLQ 250
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
+ D ++G HANT IP V+G + ++ + + FF + V A GG S E +
Sbjct: 251 KEDKLTGLHANTQIPKVVGFEKIAALSDNKEWSDGVQFFWNNVTQKRTVAFGGNSVAEHF 310
Query: 387 SDPKRLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
+ + + + E E+C +YNM ++++ LF ++ Y D+YER L N +LS Q E
Sbjct: 311 NPVNDFSGMVKSNEGPETCNSYNMERLAKALFLDKNDVHYLDFYERTLYNHILSSQH-PE 369
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P+ + Y + +S WCC GTGL
Sbjct: 370 KGGFVYFTPI-----RPNHYRVYSQPQTSMWCCVGTGL 402
>gi|325106128|ref|YP_004275782.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324974976|gb|ADY53960.1| protein of unknown function DUF1680 [Pedobacter saltans DSM 12145]
Length = 782
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 131/405 (32%), Positives = 190/405 (46%), Gaps = 33/405 (8%)
Query: 95 FKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAY 154
F + L+ L +VKL + A+Q +L+Y+L +D+D L+ + + AG K+Y
Sbjct: 20 FAQSNTTLQTFPLQEVKL-LDGIFKNAEQVDLKYILSMDMDKLLAPYLREAGLSEKAKSY 78
Query: 155 EGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAF 214
WE+ L GH GHYLSA + M+AST N + +++ +S L CQ+ G GYL
Sbjct: 79 GNWEN--SGLDGHIGGHYLSALSLMYASTKNPDINKRIDYYLSELKRCQDANGDGYLGGV 136
Query: 215 PSEQF-------DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTK 259
P + + +A L W P Y IHK+ AGL D + + N A +K+
Sbjct: 137 PDGKAMWRDISDGKIDAATFSLNKKWVPLYNIHKVFAGLYDAWVYTGNNTAKDMFIKLCD 196
Query: 260 WMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPC 319
W F N + I + L E GG+N+ Y +T K++ LA F
Sbjct: 197 WATTTFGNLNEQQIQQM--------LKSEHGGINESFADAYKLTGQQKYMDLALKFSHKA 248
Query: 320 FLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGG 379
L L Q D ++G HANT IP VIG + E+ + TFF D V A GG
Sbjct: 249 ILDPLRNQEDKLTGIHANTQIPKVIGFEKISEIEHKDDWHKAATFFWDNVVYKRTVAIGG 308
Query: 380 TSAGEFWSDPKRLASTL-GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVL 438
S E + + E E+C TYNM+K+S+ L+ + E Y DY E+AL N +L
Sbjct: 309 NSVREHFHPINNFMPMIEDIEGPETCNTYNMIKLSKALYNQSGETKYIDYIEKALYNHIL 368
Query: 439 SIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
S Q E G +Y P+ + Y + +S WCC G+GL
Sbjct: 369 SSQH-PEKGGFVYFTPM-----RPNHYRVYSQPETSMWCCVGSGL 407
>gi|406027774|ref|YP_006726606.1| hypothetical protein LBUCD034_2040 [Lactobacillus buchneri CD034]
gi|405126263|gb|AFS01024.1| hypothetical protein LBUCD034_2040 [Lactobacillus buchneri CD034]
Length = 803
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 138/409 (33%), Positives = 187/409 (45%), Gaps = 57/409 (13%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPT-CELRGHFVGHYLSASA 177
RAQQ ++YLL LD + +F + AG + G Y+GWE RGHF GHYLSA +
Sbjct: 19 RAQQMTVKYLLALDPKRFLVTFDQVAGIDSGGVTGYQGWERTDGLNFRGHFFGHYLSALS 78
Query: 178 HMWASTHNVTLKE----KMTAVVSALSECQ------NKMGSGYLSAFPSEQFDRFEALK- 226
+T + +++ K+ V+ L Q + +GY+SAF D E +
Sbjct: 79 QAILATEDNAIRQQLLDKLRLGVNGLQSAQAAYAKKHPESAGYVSAFREVALDEVEGREV 138
Query: 227 ------PVWAPYYTIHKILAGLLDQYTFADNT------QALKMTKWMVEYFYNRVQNVIT 274
V P+Y +HK+LAGLL N +ALK Y + R+ +
Sbjct: 139 PKDEKENVLVPWYNLHKVLAGLLAVNVNLQNIDPLLSEKALKSAHQFGLYVFKRINQLAD 198
Query: 275 KYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
+ L E GGMND LY L+ +T D + L A FD+ LA D ++G
Sbjct: 199 PTQM------LKIEYGGMNDALYELFDLTDDKRMLTAATYFDETTLFKQLAKGDDVLAGK 252
Query: 335 HANTHIPVVIGSQMRYEVTGD----------------PLYKVTGTFFMDIVNASHGYATG 378
HANT IP +IG+ RYE D +Y F IV H Y TG
Sbjct: 253 HANTTIPKLIGALHRYESLHDVKRADQYLSPEEKGSLNMYLKAAVNFWQIVIDDHTYVTG 312
Query: 379 GTSAGEFWSDPKRLASTL----GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALT 434
G S E + +P +L G E+C TYNMLK+SR LFR T + Y DYYE+ T
Sbjct: 313 GNSQSEHFHEPGQLFHDAVLEDGATTCETCNTYNMLKLSRELFRVTGDKKYLDYYEQTYT 372
Query: 435 NGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
N +L Q G+M Y P+ G +K + F FWCC GTG+
Sbjct: 373 NAILGSQ-NPNTGMMTYFQPMAAGYTKV-----YNRPFDEFWCCTGTGI 415
>gi|256378728|ref|YP_003102388.1| hypothetical protein Amir_4712 [Actinosynnema mirum DSM 43827]
gi|255923031|gb|ACU38542.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 881
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 142/423 (33%), Positives = 204/423 (48%), Gaps = 46/423 (10%)
Query: 97 LAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG 156
LA L+ L DV+L + RA L + VD ++ F+ AG T G G
Sbjct: 4 LAPSALEPFPLRDVEL-LDGVQSRAAGQMLHLARVFPVDRVLAVFRANAGLDTRGALPPG 62
Query: 157 -WED--------------------PTCEL-RGHFVGHYLSASAHMWASTHNVTLKEKMTA 194
WED PT L RGH+ GH+LS A AST +L+ K
Sbjct: 63 NWEDFGHPDERPWSAEEYPGAGVAPTASLLRGHYAGHFLSMVALAHASTGEESLRAKAWE 122
Query: 195 VVSALSECQNKMGS-------GYLSAFPSEQFDRFEALKP---VWAPYYTIHKILAGLLD 244
+V+ L+E ++ + + G+L+A+ QF R E L P +WAPYYT HKI+AGLLD
Sbjct: 123 IVAGLAEVRDALAATGRYSHPGFLAAYGEWQFSRLEDLAPYGEIWAPYYTCHKIMAGLLD 182
Query: 245 QYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWN-SLNEETGGMNDVLYRLYTIT 303
+ + QAL++ M + RV + + ++R W+ + E GGMN+ L L+ IT
Sbjct: 183 AHEHTGSEQALELAVGMGHWVAGRVLR-LERAHLQRMWSLYIAGEFGGMNESLAALHRIT 241
Query: 304 QDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGT 363
+ L A F+ L A D + G HAN H+P+++G +Y+ TG+ Y T
Sbjct: 242 GEEVFLRAAAAFELDHLLEGAAQGRDLLDGMHANQHLPMLVGHLDQYDATGETRYLDAVT 301
Query: 364 FFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEM 423
D V +A GGT GE W +A +G N ESC TYN+LK++R LF T +
Sbjct: 302 ALWDQVVPGRTFAHGGTGEGELWGPADTVAGFIGRRNAESCATYNLLKIARSLFARTGDA 361
Query: 424 VYADYYERALTNGVLSIQRGTEPGV---MIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYG 480
Y +Y ERA N ++ + + V ++YM P+ G + Y GT CC G
Sbjct: 362 RYPEYAERAWLNHMVGSRADLDSDVSPEVVYMYPVDAG--AVREYDNVGT------CCGG 413
Query: 481 TGL 483
TGL
Sbjct: 414 TGL 416
>gi|379726800|ref|YP_005318985.1| hypothetical protein MPD5_0184 [Melissococcus plutonius DAT561]
gi|376317703|dbj|BAL61490.1| hypothetical protein MPD5_0184 [Melissococcus plutonius DAT561]
Length = 883
Score = 194 bits (494), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 139/427 (32%), Positives = 199/427 (46%), Gaps = 59/427 (13%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG-SPTAGKAYEGWE-D 159
+K + + + +H +AQ+ + YLL LDV ++ F K AG P Y+GWE
Sbjct: 1 MKPIDTKAITIQDPYIH-KAQENVIHYLLSLDVQKFLFEFYKVAGMKPLTESGYQGWERS 59
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKM----TAVVSALSECQNKMG------SG 209
RGHF GH+LSA A + + LK+K+ ++ L Q +G
Sbjct: 60 DQVNFRGHFFGHFLSALALSYQAEKQPILKKKIHQQIKTAITGLKAVQKNYAKQHPEHAG 119
Query: 210 YLSAFPSEQFDRFEALKPV--------WAPYYTIHKILAGLLD------QYTFADNTQAL 255
Y+SAF D E KPV +Y +HKILAGLL+ + + +AL
Sbjct: 120 YISAFKEVALDEVEG-KPVDPKEKENVLVSWYNLHKILAGLLEVNISLKEVDSQLSKEAL 178
Query: 256 KMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLF 315
+ W +Y Y R+ N+ K + L E GGMND LY L+ +TQ +H + A F
Sbjct: 179 FIASWFGDYIYKRMMNLTDKNQM------LTIEYGGMNDALYCLFELTQKKEHAIAATYF 232
Query: 316 DKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEV-TGDPL--------------YKV 360
D+ LA + + G HANT IP +IG+ RY V + L Y
Sbjct: 233 DEDNLFNQLANDENVLPGKHANTTIPKLIGALKRYMVFQSEDLSAWLSNEEKEHLMSYFK 292
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRL----ASTLGTENEESCTTYNMLKVSRHL 416
F IV +H Y TGG S E + +P L G E+C T+NMLK++R L
Sbjct: 293 AAEKFWQIVVDNHTYCTGGNSQSEHFHEPNELFYDSEIRQGDCTCETCNTHNMLKLTRKL 352
Query: 417 FRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFW 476
+ TK Y DYYE N +L+ Q ++ G+M+Y P+G G +K + + FW
Sbjct: 353 YECTKNPKYLDYYETTYINAILASQ-NSKTGMMMYFQPMGAGYNKV-----YNRPYDEFW 406
Query: 477 CCYGTGL 483
CC GTG+
Sbjct: 407 CCSGTGI 413
>gi|297203356|ref|ZP_06920753.1| secreted protein [Streptomyces sviceus ATCC 29083]
gi|297148382|gb|EDY55480.2| secreted protein [Streptomyces sviceus ATCC 29083]
Length = 723
Score = 194 bits (494), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 134/380 (35%), Positives = 182/380 (47%), Gaps = 26/380 (6%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWEDPTCELRGHFVGHYLSASAHMW 180
Q YL +DVD L+++F+ T G A GW+ P R H GH+L+A A ++
Sbjct: 21 QNRTQNYLRFVDVDRLLYNFRANHRLSTNGAVATGGWDAPDFPFRTHVQGHFLTAWAQLY 80
Query: 181 ASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQFDRFE--ALKPVWAPYY 233
A + + ++K T +V+ L++CQ +GYLS +P F E L PYY
Sbjct: 81 AVSGDTVCRDKATYMVAELAKCQANNSAAGFSAGYLSGYPESDFTALEQRTLSNGNVPYY 140
Query: 234 TIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
TIHK LAGLLD + +TQA L + W V++ R+ S ++ L E
Sbjct: 141 TIHKTLAGLLDVWRHIGSTQARDVLLALAGW-VDWRTGRL-------SGQQMQTMLQTEF 192
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GGMN VL LY T D + L A FD LA D +SG HANT +P IG+
Sbjct: 193 GGMNTVLTDLYQQTGDARWLTAARRFDHAAVFDPLASGQDQLSGLHANTQVPKWIGAARE 252
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNM 409
Y+ TG Y+ T + +H YA GG S E + P +A L + ESC T NM
Sbjct: 253 YKATGTTRYRDIATNAWNFTVNAHTYAIGGNSQAEHFRAPNAIAGYLNKDTCESCNTVNM 312
Query: 410 LKVSRHLFRWT-KEMVYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGDSKAK 463
L ++R LF DYYE+A N ++ Q + G + Y PL RG A
Sbjct: 313 LTLTRELFALDPNRAALFDYYEQAWLNQMIGQQNPADGHGHVTYFTPLNPGGRRGVGPAW 372
Query: 464 SYHGWGTRFSSFWCCYGTGL 483
W T + +FWCC GTGL
Sbjct: 373 GGGTWSTDYGTFWCCQGTGL 392
>gi|410638732|ref|ZP_11349285.1| acetyl-CoA carboxylase, biotin carboxylase [Glaciecola lipolytica
E3]
gi|410141260|dbj|GAC16490.1| acetyl-CoA carboxylase, biotin carboxylase [Glaciecola lipolytica
E3]
Length = 818
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 128/380 (33%), Positives = 185/380 (48%), Gaps = 35/380 (9%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQQTN+ YLL + D L+ + + AG +Y WE+ L GH GHYLSA + W
Sbjct: 67 AQQTNVGYLLAIQPDKLLAPYLREAGLEPKVDSYGNWEN--TGLDGHIGGHYLSALSLAW 124
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF------------DRFEALKPV 228
A+T + LK ++ +++ L + QN G GYL P+ + D F +L
Sbjct: 125 AATQDTELKRRLDYMLNELQKAQNANG-GYLGGIPNGKVMWDEIKQGNIKADLF-SLNDR 182
Query: 229 WAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNS 284
W P Y I KI GL D Y A++ QA L + +WM++ V S E+
Sbjct: 183 WVPLYNIDKIFHGLRDAYLIANSEQAKTMLLSLGQWMLD--------VTNNLSDEQIQQM 234
Query: 285 LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVI 344
L E GG+N+V + TI+ D +L LA F + L D+++G HANT IP +I
Sbjct: 235 LYSEHGGLNEVFADMSTISGDKAYLELARKFSHKRIIDPLVAHKDELNGLHANTQIPKII 294
Query: 345 GSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL-GTENEES 403
G+ ++ D +K FF + V A GG S E + D + + E E+
Sbjct: 295 GALKVAQLNNDESWKEAARFFWETVTKQRSVAIGGNSVREHFHDAADFSPMVEDPEGPET 354
Query: 404 CTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAK 463
C TYNM+K+S+ LF T + Y DYYERA N +LS Q E G ++Y + G
Sbjct: 355 CNTYNMIKLSKLLFLQTADTRYLDYYERATYNHILSSQH-PEHGGLVYFTSMRPG----- 408
Query: 464 SYHGWGTRFSSFWCCYGTGL 483
Y + + S WCC G+G+
Sbjct: 409 HYRMYSSVQDSMWCCVGSGI 428
>gi|413954825|gb|AFW87474.1| hypothetical protein ZEAMMB73_309562 [Zea mays]
Length = 483
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 96/118 (81%), Positives = 103/118 (87%)
Query: 366 MDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVY 425
MD VN+SH YATGGTS EFWS+PKRLA L TE EESCTTYNMLKVSRHLFRWTKE+ Y
Sbjct: 1 MDTVNSSHAYATGGTSVSEFWSNPKRLAEALTTETEESCTTYNMLKVSRHLFRWTKEIAY 60
Query: 426 ADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
ADYYERAL NGVLSIQRG +PGVMIYMLP G G SKAKSYHGWGT++ SFWCCYGTG+
Sbjct: 61 ADYYERALINGVLSIQRGRDPGVMIYMLPQGPGRSKAKSYHGWGTQYESFWCCYGTGI 118
>gi|313204495|ref|YP_004043152.1| hypothetical protein Palpr_2030 [Paludibacter propionicigenes WB4]
gi|312443811|gb|ADQ80167.1| protein of unknown function DUF1680 [Paludibacter propionicigenes
WB4]
Length = 788
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 131/389 (33%), Positives = 194/389 (49%), Gaps = 25/389 (6%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
+ DV+L S A+ ++ YLL LD D L+ + K G + Y WE+ L G
Sbjct: 36 VSDVRLTESPFK-HAEDMDINYLLGLDADRLMAPYLKGGGLTPKAENYPNWEN--TGLDG 92
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--FDRFE- 223
H GHYLSA ++M+A+T N +KE++ ++ L Q+ G GYL P+ + +D +
Sbjct: 93 HIGGHYLSALSYMYAATGNTRIKERLDYSLNELKRAQDAAGDGYLGGTPNGRKIWDEIKK 152
Query: 224 --------ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITK 275
L W P Y IHK AGL D Y + A M + ++ YN V +T
Sbjct: 153 GTINASSFGLNGGWVPLYNIHKTYAGLRDAYLQGGSLLAKDMLIKLTDWMYNTVSG-LTD 211
Query: 276 YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
V+ L E GG+N+V + +IT + K+L LAH F L LL D ++G H
Sbjct: 212 AQVQEM---LKSEHGGLNEVFADVASITGNKKYLELAHKFSHQTLLQLLLQHQDKLTGMH 268
Query: 336 ANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAST 395
ANT IP VIG + ++ G+ + +FF V + + GG S E + S
Sbjct: 269 ANTQIPKVIGFKRIADLEGNKDWSDAASFFWKTVVDNRSVSIGGNSVREHFHPSDNFTSM 328
Query: 396 LGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLP 454
+E E+C TYNML++++ LF+ + E + DYYERAL N +LS Q + G +Y P
Sbjct: 329 FESEQGPETCNTYNMLRLTKLLFQTSGEASFMDYYERALYNHILSTQDPIQGG-FVYFTP 387
Query: 455 LGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ +A Y + +SFWCC G+GL
Sbjct: 388 M-----RAGHYRVYSQPQTSFWCCVGSGL 411
>gi|315499577|ref|YP_004088380.1| hypothetical protein Astex_2584 [Asticcacaulis excentricus CB 48]
gi|315417589|gb|ADU14229.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
48]
Length = 791
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 130/404 (32%), Positives = 197/404 (48%), Gaps = 33/404 (8%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWED 159
D + + L DV+L PS A N YLL ++ D L+ +++K AG + Y GWE
Sbjct: 36 DSVTSLPLSDVRLLPSPFK-TAVDVNEAYLLSVNPDRLLHNYRKFAGLTPKAELYGGWER 94
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP---- 215
T + GH +GHYLSA + M A T N LK + ++ L+ Q G GY++ F
Sbjct: 95 DT--IAGHSLGHYLSAISLMHAQTGNAALKLRAAYIIDELALVQGAHGDGYVAGFTRKRK 152
Query: 216 -------SEQFDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTK 259
E F A L W P Y HK+ +GL D TF +AL +
Sbjct: 153 DGRVVDGKEIFPELMAGDIRSAGFDLNGCWVPLYNWHKLYSGLFDAQTFCGYDKALTVAV 212
Query: 260 WMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPC 319
+ Y ++V +T V+ LN E GG+ND LY T++P+ L LA
Sbjct: 213 GLGVYI-DKVFRALTDDQVQ---TVLNCEFGGLNDSFAELYRRTENPRWLALAQRLHHKR 268
Query: 320 FLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGG 379
+ L D ++ HANT +P ++G +EVTG+ + +FF + V H Y GG
Sbjct: 269 IIDPLTAGEDKLANNHANTQVPKLLGEATLFEVTGNENNRKAASFFWERVVNHHSYVIGG 328
Query: 380 TSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLS 439
+ E++ +P ++ + E C TYNMLK++RHL+ W + Y DY+ERA N VL+
Sbjct: 329 NADREYFFEPDTISKHITEATCEHCNTYNMLKLTRHLYGWEPDARYFDYFERAHFNHVLA 388
Query: 440 IQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Q+ + G+ YM PL G ++ G+ ++ CC+G+G+
Sbjct: 389 -QQNPKTGMFSYMTPLFTGAAR-----GFSDPVDNWTCCHGSGM 426
>gi|255936447|ref|XP_002559250.1| Pc13g08250 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583870|emb|CAP91894.1| Pc13g08250 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 627
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 127/376 (33%), Positives = 183/376 (48%), Gaps = 18/376 (4%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAG-SPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
Q L+YL +DVD L++ F+ T G S GW+ P R H GH+LSA A +
Sbjct: 58 QDRTLKYLKEIDVDRLLYVFRATHGLSTQQATPNGGWDAPDFPFRSHVQGHFLSAWAQCY 117
Query: 181 ASTHNVTLKEKMTAVVSALSECQ--NK---MGSGYLSAFPSEQFDRFE--ALKPVWAPYY 233
A + T ++ + L++CQ NK GY+S FP +F + E L PYY
Sbjct: 118 AVLRDQTCYDRAIYFAAELAKCQANNKAVGFTDGYVSGFPESEFAKLENDTLTNGNVPYY 177
Query: 234 TIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMN 293
+HK LAGLLD + ++T + + + + R + +S L E GGMN
Sbjct: 178 AVHKTLAGLLDIWRLTNDTTSRDILLSLASWVDKRTE----PFSYAAMQKLLQTEFGGMN 233
Query: 294 DVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVT 353
+V+ +Y T D + L +A FD LA D++ G HANT +P IG+ +Y+ T
Sbjct: 234 EVMADIYHQTGDERWLTVAQRFDHAVIFDPLAANKDELDGLHANTQVPKWIGAARQYKAT 293
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVS 413
G+ Y +I SH YA GG S E + P +A+ L + E+C +YNMLK++
Sbjct: 294 GESRYLDIARNAWEINVKSHTYAIGGNSQAEHFRAPNAIAAYLTNDTCEACNSYNMLKLT 353
Query: 414 RHLFRW-TKEMVYADYYERALTNGVLSIQRGTE-PGVMIYMLPLG----RGDSKAKSYHG 467
R L+ + Y D+YE +L N +L Q + G + Y PL RG A
Sbjct: 354 RELWLLDSDNSAYFDFYENSLLNHLLGQQDPHDHHGHITYFTPLNAGGRRGVGPAWGGGT 413
Query: 468 WGTRFSSFWCCYGTGL 483
W T + SFWCC GT L
Sbjct: 414 WSTDYDSFWCCQGTAL 429
>gi|268609237|ref|ZP_06142964.1| hypothetical protein RflaF_07037 [Ruminococcus flavefaciens FD-1]
Length = 1082
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 139/417 (33%), Positives = 202/417 (48%), Gaps = 46/417 (11%)
Query: 99 GDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGW 157
G + + S+ DVK+ A + ++YLL D + L+ F++ AG T G K Y GW
Sbjct: 37 GSRISDFSISDVKM-TDDYCTNAFEKEMKYLLSFDTERLLAGFRENAGLSTNGAKRYGGW 95
Query: 158 EDPTCELRGHFVGHYLSASAHMWASTHNVT------LKEKMTAVVSALSECQN--KMGSG 209
E+ + GH VGHYL+A A + + NVT L ++M ++ + CQ + G
Sbjct: 96 EN--TNIAGHCVGHYLTALAQAYQNP-NVTSDQKDALYKRMKTLIDGMQACQQHPRGKKG 152
Query: 210 YLSAFP-------SEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKM 257
+L A P QFDR E K W P+YT+HK++AG++D Y A +
Sbjct: 153 FLWAAPVPSDGNVERQFDRVEIGKANIFDDAWVPWYTMHKLIAGIVDVYNATQYAPAKDV 212
Query: 258 TKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDK 317
+ ++ YNR + +S + L+ E GGMND +Y LY IT H AH+FD+
Sbjct: 213 GSALGDWVYNRC----SGWSQQTRNTVLSIEYGGMNDCMYDLYRITGKDSHAAAAHVFDE 268
Query: 318 PCFLGLLAVQADDI-SGFHANTHIPVVIGSQMRY------EVTGDPL----YKVTGTFFM 366
++ D+ +G HANT IP IG+ RY V G + Y F
Sbjct: 269 DALFQKVSNGGRDVLNGRHANTTIPKFIGALKRYMVLDGKTVNGQKVDASAYLKYAENFW 328
Query: 367 DIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYA 426
D+V H Y TGG S E + L + N E+C +YNMLK+SR LF+ T + Y
Sbjct: 329 DMVTTHHTYITGGNSEWEHFGKDDILDAERTNCNCETCNSYNMLKLSRELFKITHDSKYM 388
Query: 427 DYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
D+YE N +LS Q E G+ Y P+ G K + T++ FWCC G+G+
Sbjct: 389 DFYENTYYNSILSSQN-PETGMTTYFQPMATGYFKV-----YSTQWDKFWCCTGSGM 439
>gi|409196987|ref|ZP_11225650.1| Acetyl-CoA carboxylase, biotin carboxylase [Marinilabilia
salmonicolor JCM 21150]
Length = 788
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 122/375 (32%), Positives = 186/375 (49%), Gaps = 24/375 (6%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A+Q N +Y+ D D L+ F AG Y WE L GH GHYL++ A M
Sbjct: 43 AEQLNEKYVFAHDPDRLLAPFLIDAGLEPKAPGYGNWE--GSGLNGHIGGHYLTSLALMV 100
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFE-----------ALKPVW 229
AST N +E++ ++ L+ CQ G+GY+ P Q E +L W
Sbjct: 101 ASTGNEEAQERLDYMIEELARCQEANGNGYVGGIPGGQPMWAEIAKGNIDAGGFSLNGKW 160
Query: 230 APYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
P Y IHK+ AGL D + +A +AL++ + ++F + V + S E+ L E
Sbjct: 161 VPLYNIHKLFAGLHDAWKYAGKEKALEILIQLTDWFID----VNSGLSDEQIQEILVSEH 216
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GG+N+V +Y IT + K+L LA + L L D ++G HANT IP V+G
Sbjct: 217 GGLNEVFADVYDITGEDKYLTLARQYSHRSILEPLLNHEDKLTGLHANTQIPKVVGFMRV 276
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT-ENEESCTTYN 408
E+ GD + FF + V ++ GG S E + +S + + + E+C TYN
Sbjct: 277 GELAGDSAWIDASDFFWNTVVSNRTITIGGNSTHEHFHPVDDFSSMVESRQGPETCNTYN 336
Query: 409 MLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGW 468
MLK+S+ L+ + ++ Y DYYE+AL N +LS Q E G ++Y P+ + + Y +
Sbjct: 337 MLKLSKQLYLYKNDLRYVDYYEQALYNHILSSQH-PEHGGLVYFTPM-----RPQHYRVY 390
Query: 469 GTRFSSFWCCYGTGL 483
+FWCC G+G+
Sbjct: 391 SNPEETFWCCVGSGI 405
>gi|317476834|ref|ZP_07936077.1| hypothetical protein HMPREF1016_03061 [Bacteroides eggerthii
1_2_48FAA]
gi|316907009|gb|EFV28720.1| hypothetical protein HMPREF1016_03061 [Bacteroides eggerthii
1_2_48FAA]
Length = 781
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 125/393 (31%), Positives = 195/393 (49%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L D+KL S +AQQT+L Y++ ++ D L+ F + AG +Y WE+ L G
Sbjct: 30 LQDIKLLESPF-LQAQQTDLHYIMAMNPDRLLAPFLREAGLAPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS---------E 217
H GHY+SA + M+A+T + T+ ++ +++ L Q +G+G++ P E
Sbjct: 87 HIGGHYISALSMMYAATGDTTVYNRLNYMLNELHRAQQAVGNGFIGGTPGSLQLWKEIKE 146
Query: 218 QFDRFEA--LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
R E+ L W P Y IHK AGL D Y +A + A +M T WM
Sbjct: 147 GNIRPESFSLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIALTDWMA--------G 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + + ++ + L E GG+N++ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLTEQQMQDMLRSEHGGLNEIFADVADITGDKKYLELARRFSHKTLLEPLIGGEDHL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++T + + FF + V GG S E +
Sbjct: 259 TGMHANTQIPKVIGYKRIADLTQNDAWDQAARFFWNTVVNHRSVCIGGNSVREHFHPADN 318
Query: 392 LASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
S L + E+C TYNML++++ LF+ + ++ +ADYYERAL N +L+ Q+ + G +
Sbjct: 319 FTSMLNDVQGPETCNTYNMLRLTKMLFQTSPDIRFADYYERALYNHILASQQPAK-GGFV 377
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ G Y + +S WCC G+GL
Sbjct: 378 YFTPMRSG-----HYRVYSQPETSMWCCVGSGL 405
>gi|404450474|ref|ZP_11015456.1| hypothetical protein A33Q_14151 [Indibacter alkaliphilus LW1]
gi|403763872|gb|EJZ24792.1| hypothetical protein A33Q_14151 [Indibacter alkaliphilus LW1]
Length = 782
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 129/400 (32%), Positives = 194/400 (48%), Gaps = 25/400 (6%)
Query: 96 KLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYE 155
K GD ++ L VKL S RAQ+ + +Y+L +DVD L+ + K AG + Y
Sbjct: 22 KAQGDQVQFFDLRQVKLKDSPFK-RAQEVDKKYILEMDVDRLLAPYMKEAGLTWSADNYG 80
Query: 156 GWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP 215
WE+ L GH GHYLSA + M+AST + + +++ ++ L Q++ G GYLS P
Sbjct: 81 NWEN--TGLDGHIGGHYLSALSLMFASTGDPEINKRLDYMLEQLKHAQDQSGDGYLSGVP 138
Query: 216 ----------SEQFDRFE-ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEY 264
S + + +L W P Y IHKI AGL D Y A M + ++
Sbjct: 139 YGRKIWNELKSGKINAGNFSLNDRWVPLYNIHKIFAGLRDAYWIGGKEIAKPMLVSLSDW 198
Query: 265 FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
F + + ++ ++ L E GG+N+V + +T D K+L LA L L
Sbjct: 199 FLD----LTDGFTEDQFQEMLISEHGGLNEVFADVAVMTGDSKYLSLAKKMSHNAILQPL 254
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+ D+++G HANT IP VIG Q +V+ D FF V + GG S E
Sbjct: 255 KEEKDELNGLHANTQIPKVIGFQRIAQVSKDQNLHQASDFFWKNVVYQRSVSIGGNSVRE 314
Query: 385 FWSDPKRLASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
+ +S L +E E+C TYNM+++S LF+ + Y DYYERA+ N +LS Q
Sbjct: 315 HFHPTSDFSSMLSSEQGPETCNTYNMMRLSEMLFQLAPDRKYIDYYERAVFNHILSTQHP 374
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G +Y + + + Y + +FWCC G+GL
Sbjct: 375 KKGG-FVYFTSM-----RPQHYRVYSQPHENFWCCVGSGL 408
>gi|295132897|ref|YP_003583573.1| glycosyl hydrolase [Zunongwangia profunda SM-A87]
gi|294980912|gb|ADF51377.1| putative glycosyl hydrolase [Zunongwangia profunda SM-A87]
Length = 797
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 132/397 (33%), Positives = 193/397 (48%), Gaps = 35/397 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L+ V+L D K A+ N+ LL DVD L+ ++K AG +Y WE
Sbjct: 36 LENVTLLDGKFK------NARDLNMSVLLQYDVDRLLAPYRKEAGLEPRKPSYPNWEG-- 87
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQ-------NKMGSGYLSAF 214
L GH GHYLSA A +A+T N +M ++ L ECQ + G GY+ F
Sbjct: 88 --LDGHIGGHYLSALAMNYAATDNQEFLARMNYMLKELRECQLANTKKHPEWGVGYVGGF 145
Query: 215 PSEQ-----FDR--FEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYN 267
P+ + F + FE WAP+Y +HK+ AGL D + +AD+ +A +M ++
Sbjct: 146 PNSEALWSSFKKGNFEKYNSAWAPFYNLHKMYAGLRDAWLYADSEKAKEMFLDFCDWGIT 205
Query: 268 RVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQ 327
+++ S E+ + LN E GGM +V Y IT + K+L A + L L+
Sbjct: 206 LTKDL----SHEQMQSVLNMEHGGMPEVYADAYQITGEKKYLEAAKRYSHEQVLHPLSKG 261
Query: 328 ADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE-FW 386
D++ HANT IP +G + EV GD + G++F + V + A GG S E F
Sbjct: 262 IDNLDNKHANTQIPKFVGFERIAEVDGDEKFAKAGSYFWETVTKNRSLAFGGNSRKEHFP 321
Query: 387 SDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP 446
S + + ESC +YNMLK++ LFR E YADYYER L N +LS Q +
Sbjct: 322 STSASIDYINEDDGPESCNSYNMLKLTEDLFRVNPEAKYADYYERTLYNHILSTQH-PQH 380
Query: 447 GVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P ++ + Y + + WCC GTG+
Sbjct: 381 GGYVYFTP-----ARPRHYRIYSAPEEAMWCCVGTGM 412
>gi|333378944|ref|ZP_08470671.1| hypothetical protein HMPREF9456_02266 [Dysgonomonas mossii DSM
22836]
gi|332885756|gb|EGK06002.1| hypothetical protein HMPREF9456_02266 [Dysgonomonas mossii DSM
22836]
Length = 787
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 134/399 (33%), Positives = 200/399 (50%), Gaps = 35/399 (8%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
+K L D+ L S RAQ + +YLL LD D L+ F + AG ++Y WE+
Sbjct: 26 IKYFDLKDITLLDSPFK-RAQDLDKKYLLDLDADRLLAPFIREAGLQKKAESYTNWEN-- 82
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ--F 219
L GH GHY+SA A M+AST + +K+++ ++S L CQ++ G+GY+ P + +
Sbjct: 83 TGLDGHIGGHYVSALALMYASTGDQQIKDRLDYMISELKRCQDENGNGYIGGVPGGKAIW 142
Query: 220 DRFE---------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFY 266
D L W P Y IHK AGL D Y A N A +KMT W V+
Sbjct: 143 DEIAKGDIQASGFGLNNRWVPLYNIHKTYAGLRDAYLIAGNETAKDMLIKMTDWAVK--- 199
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
+++ S E+ + L E GG+N+ + ITQ+ K+L LAH F L L
Sbjct: 200 -----LVSNLSEEQIQDMLRSEHGGLNETFADVAVITQNEKYLKLAHQFSHQLILNPLLA 254
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
D ++G HANT IP V+G + ++ G+ + FF + V GG S E +
Sbjct: 255 HEDKLTGLHANTQIPKVLGFKRIADIEGNESWSEASRFFWETVVEHRSVCIGGNSVREHF 314
Query: 387 SDPKRLASTLGTENE--ESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT 444
P S++ T NE E+C TYNML++S+ ++ + + Y DYYE+AL N +LS Q
Sbjct: 315 H-PTNDFSSMITSNEGPETCNTYNMLRLSKMFYQTSLDKKYIDYYEKALYNHILSSQ-NP 372
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G ++Y + G Y + +S WCC G+G+
Sbjct: 373 QTGGLVYFTQMRPG-----HYRVYSQPQTSMWCCVGSGI 406
>gi|331702303|ref|YP_004399262.1| hypothetical protein Lbuc_1953 [Lactobacillus buchneri NRRL
B-30929]
gi|329129646|gb|AEB74199.1| protein of unknown function DUF1680 [Lactobacillus buchneri NRRL
B-30929]
Length = 803
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 139/416 (33%), Positives = 189/416 (45%), Gaps = 59/416 (14%)
Query: 113 DPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPT-CELRGHFVG 170
DP H AQQ ++YLL LD + +F + AG + G Y+GWE RGHF G
Sbjct: 14 DPEIEH--AQQMTVKYLLALDPKRFLVTFDEVAGIDSGGVTGYQGWERTDGLNFRGHFFG 71
Query: 171 HYLSASAHMWASTHNVTLKE----KMTAVVSALSECQNKMG------SGYLSAFPSEQFD 220
HYLSA + +T +++ K+ V+ L Q +GY+SAF D
Sbjct: 72 HYLSALSQAILATEENDIRQQLLDKLRLGVNGLQSAQAAYAKSHPDSAGYVSAFREVALD 131
Query: 221 RFEALK-------PVWAPYYTIHKILAGLLDQYTFAD------NTQALKMTKWMVEYFYN 267
E + V P+Y +HK+LAGLL + +ALK+ Y +
Sbjct: 132 EVEGREVPKDEKENVLVPWYNLHKVLAGLLAVKVNLQGIDPLLSEKALKIAHQFGIYVFK 191
Query: 268 RVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQ 327
R+ + + L E GGMND LY L+ +T D + L A FD+ LA
Sbjct: 192 RLNQLADPTQM------LKIEYGGMNDALYELFDLTDDKRMLTAATYFDETALFKQLAEG 245
Query: 328 ADDISGFHANTHIPVVIGSQMRYEVTGD----------------PLYKVTGTFFMDIVNA 371
D ++G HANT IP +IG+ RYE D +Y F IV
Sbjct: 246 DDVLAGKHANTTIPKLIGALHRYESLHDVKRADQYLSPEEKGSLNMYLKAAVNFWQIVVD 305
Query: 372 SHGYATGGTSAGEFWSDPKRLASTLGTENE----ESCTTYNMLKVSRHLFRWTKEMVYAD 427
H Y TGG S E + +P +L E+ E+C TYNMLK+SR LFR T + Y D
Sbjct: 306 DHTYVTGGNSQSEHFHEPGQLFHDAVLEDGATTCETCNTYNMLKLSRELFRVTGDKKYLD 365
Query: 428 YYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
YYE+ TN +L Q G+M Y P+ G +K + F FWCC GTG+
Sbjct: 366 YYEQTYTNAILGSQ-NPNTGMMTYFQPMAAGYTKV-----YNRPFDEFWCCTGTGI 415
>gi|120435050|ref|YP_860736.1| hypothetical protein GFO_0692 [Gramella forsetii KT0803]
gi|117577200|emb|CAL65669.1| conserved hypothetical protein, membrane or secreted [Gramella
forsetii KT0803]
Length = 796
Score = 191 bits (485), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 139/399 (34%), Positives = 191/399 (47%), Gaps = 36/399 (9%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
LK DV+L S A +LEY+L LD D L+ F K AG T ++Y WE+
Sbjct: 34 LKLFPHEDVQLLDSPFR-DAMLVDLEYILKLDPDRLLAPFLKEAGLETKVESYPNWEN-- 90
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS----- 216
L GH GHYL+A + M+A+T N + E++ ++ L + Q + GY+ P
Sbjct: 91 TGLDGHIGGHYLTALSLMYAATGNQEVLERLNYMLDELQKVQ-QANVGYIGGVPDSKELW 149
Query: 217 EQFDRFE------ALKPVWAPYYTIHKILAGLLDQYTFAD----NTQALKMTKWMVEYFY 266
+Q +L W P Y IHK AGL D Y A T + ++ WM+E
Sbjct: 150 QQISEGNINAGSFSLNDRWVPLYNIHKTYAGLRDAYQIAGIERAKTMLIDLSDWMLE--- 206
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
V + S E+ L E GG+N+ +Y IT + K+L LA+ F + L L
Sbjct: 207 -----VTSDLSEEQIQELLISEYGGLNETFADVYEITGEKKYLDLAYAFSQKELLKPLED 261
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
D ++G HANT IP VIG Q + + Y+ +FF D V A GG S E +
Sbjct: 262 DQDVLTGMHANTQIPKVIGFQTIAALNDNREYRDAASFFWDNVVNERSVAIGGNSVREHF 321
Query: 387 SDPKRLASTL--GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT 444
PK ST+ + E+C TYNMLK+S LF Y DYYE+AL N +LS Q
Sbjct: 322 H-PKDDFSTMMSSVQGPETCNTYNMLKLSEKLFLTEANEKYVDYYEQALYNHILSSQH-P 379
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E G +Y P+ G Y + +SFWCC G+GL
Sbjct: 380 EKGGFVYFTPMRPG-----HYRVYSQPETSFWCCVGSGL 413
>gi|94494954|ref|ZP_01301535.1| hypothetical protein SKA58_00635 [Sphingomonas sp. SKA58]
gi|94425220|gb|EAT10240.1| hypothetical protein SKA58_00635 [Sphingomonas sp. SKA58]
Length = 665
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 138/406 (33%), Positives = 192/406 (47%), Gaps = 51/406 (12%)
Query: 102 LKEVSLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWE-D 159
LK + DV LD LH AQ+ YLL L D ++ +F+ AG Y GWE +
Sbjct: 64 LKPFDMADVTLDDGPFLH--AQRMTETYLLRLQPDRMLHNFRINAGLKPKAPVYGGWESE 121
Query: 160 PT---CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS 216
PT GH +GHYLSA A + ST + K+++ + S L+ CQ SG + AFP
Sbjct: 122 PTWAEINCHGHTLGHYLSACALAYRSTRDRRFKQRLDYIASELAACQKAAHSGLICAFPD 181
Query: 217 EQFDRFEAL--KPVWA-PYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRV 269
+ +P+ P+YT+HKI AGL D AD+ +A L++ W V
Sbjct: 182 GPALVAAHINGEPITGVPWYTLHKIYAGLRDAALLADSREAREVLLRLADWGV------- 234
Query: 270 QNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQA 328
V T+ + + + L E GGMN++ LY +T ++ LA F + L
Sbjct: 235 --VATRPLSDAQFEAMLATEHGGMNEIYADLYAMTGKEEYRTLARRFSHKAVMEPLVAGK 292
Query: 329 DDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE-FWS 387
D + G HANT +P ++G Q YE TGD Y FF V + +ATGG E F++
Sbjct: 293 DLLDGMHANTQVPKIVGFQRVYEETGDDRYAKAADFFFRTVAHTRSFATGGHGDNEHFFA 352
Query: 388 DPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ------ 441
+ + E+C +NMLK++R LF + YADYYER L NG+L+ Q
Sbjct: 353 MADFESHVFSAKGSETCCQHNMLKLARLLFMQDPQADYADYYERTLYNGILASQDPDSGM 412
Query: 442 ----RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+G PG M K YH T SFWCC GTG+
Sbjct: 413 ATYFQGARPGYM-------------KLYH---TPEDSFWCCTGTGM 442
>gi|332882274|ref|ZP_08449902.1| hypothetical protein HMPREF9074_05700 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|332679658|gb|EGJ52627.1| hypothetical protein HMPREF9074_05700 [Capnocytophaga sp. oral
taxon 329 str. F0087]
Length = 786
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 128/393 (32%), Positives = 195/393 (49%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L+DV+L S A+ ++ YLL LD D L+ + K AG Y WE+ L G
Sbjct: 32 LNDVRLTQSPFK-HAEDLDVRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWEN--TGLDG 88
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS-----EQFDR 221
H GHY+SA ++M+A+T + +K+++ ++S L Q+ G GYL P+ E +
Sbjct: 89 HIGGHYVSALSYMYAATGDEEIKQRLDYMLSELKRAQDAAGDGYLCGAPNGRKIWEAVSK 148
Query: 222 FE------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
+ L W P Y IHK AGL D Y A + +A +K+T WM+ N
Sbjct: 149 GDIQASSFGLNGGWVPLYNIHKTYAGLRDAYLLAGSKEARDMLVKLTDWMM--------N 200
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ S E+ + L E GG+N+V + +T +L LA F L L D +
Sbjct: 201 LTKDLSDEQIQDMLRSEHGGLNEVFADVADLTGKDNYLQLARRFSHREILDPLLEHEDRL 260
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ GD + FF + V + GG S E + +
Sbjct: 261 TGKHANTQIPKVIGYKRIADLQGDEGWDDAARFFWETVVERRSISIGGNSVREHFHPSED 320
Query: 392 LASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S L +E E+C TYNML++++ L++ + ++ Y DYYERAL N +LS + G +
Sbjct: 321 FSSMLTSEQGPETCNTYNMLRLTKMLYQTSADVHYMDYYERALYNHILSTIDPVQGG-FV 379
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ G Y + +SFWCC G+G+
Sbjct: 380 YFTPMRSG-----HYRVYSQPQTSFWCCVGSGM 407
>gi|357046482|ref|ZP_09108109.1| hypothetical protein HMPREF9441_02134 [Paraprevotella clara YIT
11840]
gi|355530721|gb|EHH00127.1| hypothetical protein HMPREF9441_02134 [Paraprevotella clara YIT
11840]
Length = 762
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 128/393 (32%), Positives = 195/393 (49%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L+DV+L S A+ ++ YLL LD D L+ + K AG Y WE+ L G
Sbjct: 8 LNDVRLTQSPFK-HAEDLDVRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWEN--TGLDG 64
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS-----EQFDR 221
H GHY+SA ++M+A+T + +K+++ ++S L Q+ G GYL P+ E +
Sbjct: 65 HIGGHYVSALSYMYAATGDEEIKQRLDYMLSELKRAQDAAGDGYLCGAPNGRKIWEAVSK 124
Query: 222 FE------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQN 271
+ L W P Y IHK AGL D Y A + +A +K+T WM+ N
Sbjct: 125 GDIQASSFGLNGGWVPLYNIHKTYAGLRDAYLLAGSKEARDMLVKLTDWMM--------N 176
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ S E+ + L E GG+N+V + +T +L LA F L L D +
Sbjct: 177 LTKDLSDEQIQDMLRSEHGGLNEVFADVADLTGKDNYLQLARRFSHREILDPLLEHEDRL 236
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++ GD + FF + V + GG S E + +
Sbjct: 237 TGKHANTQIPKVIGYKRIADLQGDEGWDDAARFFWETVVERRSISIGGNSVREHFHPSED 296
Query: 392 LASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
+S L +E E+C TYNML++++ L++ + ++ Y DYYERAL N +LS + G +
Sbjct: 297 FSSMLTSEQGPETCNTYNMLRLTKMLYQTSADVHYMDYYERALYNHILSTIDPVQGG-FV 355
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ G Y + +SFWCC G+G+
Sbjct: 356 YFTPMRSG-----HYRVYSQPQTSFWCCVGSGM 383
>gi|224540696|ref|ZP_03681235.1| hypothetical protein BACCELL_05610 [Bacteroides cellulosilyticus
DSM 14838]
gi|224517692|gb|EEF86797.1| hypothetical protein BACCELL_05610 [Bacteroides cellulosilyticus
DSM 14838]
Length = 782
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 126/400 (31%), Positives = 198/400 (49%), Gaps = 36/400 (9%)
Query: 103 KEVS---LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWED 159
+EVS L DVKL S +AQQT+L Y++ ++ D L+ F + AG +Y WE+
Sbjct: 24 QEVSYFPLQDVKLLESPF-LQAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWEN 82
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--E 217
L GH GHY+SA + M+A+T + + ++ +++ L Q +G+G++ P +
Sbjct: 83 --TGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQ 140
Query: 218 QFDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEY 264
+ +A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLVALTDWMID- 199
Query: 265 FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
+ + ++ + L E GG+N+ + IT D K+L LA F L L
Sbjct: 200 -------ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPL 252
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE 384
D ++G HANT IP VIG + ++ D + FF + V GG S E
Sbjct: 253 VKDEDRLTGMHANTQIPKVIGYKRIADLAQDQNWDHAARFFWNTVVNHRSVCIGGNSVRE 312
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
+ S L + E+C TYNML++++ L++ + ++ +ADYYERAL N +L+ Q+
Sbjct: 313 HFHPADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNHILASQQP 372
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
T+ G +Y P+ G Y + +S WCC G+GL
Sbjct: 373 TKGG-FVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 406
>gi|218129947|ref|ZP_03458751.1| hypothetical protein BACEGG_01530 [Bacteroides eggerthii DSM 20697]
gi|217988057|gb|EEC54382.1| hypothetical protein BACEGG_01530 [Bacteroides eggerthii DSM 20697]
Length = 781
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 125/393 (31%), Positives = 195/393 (49%), Gaps = 33/393 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L D+KL S +AQQT+L Y++ ++ D L+ F + AG +Y WE+ L G
Sbjct: 30 LQDIKLLESPF-LQAQQTDLYYIMAMNPDRLLAPFLREAGLAPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS---------E 217
H GHY+SA + M+A+T + T+ ++ +++ L Q +G+G++ P E
Sbjct: 87 HIGGHYISALSMMYAATGDTTVYNRLNYMLNELHRAQQAVGNGFIGGTPGSLQLWKEIKE 146
Query: 218 QFDRFEA--LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
R E+ L W P Y IHK AGL D Y +A + A +M T WM
Sbjct: 147 GSIRPESFSLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIALTDWMA--------G 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + + ++ + L E GG+N++ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLTEQQMQDMLRSEHGGLNEIFADVADITGDKKYLELARRFSHKTLLEPLIGGEDHL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
+G HANT IP VIG + ++T + + FF + V GG S E +
Sbjct: 259 TGMHANTQIPKVIGYKRIADLTQNDAWDQAARFFWNTVVNHRSVCIGGNSVREHFHPADN 318
Query: 392 LASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMI 450
S L + E+C TYNML++++ LF+ + ++ +ADYYERAL N +L+ Q+ + G +
Sbjct: 319 FTSMLNDVQGPETCNTYNMLRLTKMLFQTSPDIRFADYYERALYNHILASQQPAK-GGFV 377
Query: 451 YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y P+ G Y + +S WCC G+GL
Sbjct: 378 YFTPMRSG-----HYRVYSQPETSMWCCVGSGL 405
>gi|86196151|gb|EAQ70789.1| hypothetical protein MGCH7_ch7g196 [Magnaporthe oryzae 70-15]
gi|440463815|gb|ELQ33359.1| hypothetical protein OOU_Y34scaffold00969g44 [Magnaporthe oryzae
Y34]
gi|440485206|gb|ELQ65183.1| hypothetical protein OOW_P131scaffold00516g8 [Magnaporthe oryzae
P131]
Length = 633
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 135/415 (32%), Positives = 190/415 (45%), Gaps = 31/415 (7%)
Query: 91 NPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTA 150
+P F GD L V L+ Q L Y+ +D++ L+++F+ G T
Sbjct: 23 SPPVFTDTGDSALAFDLSQVTLNQGRFR-DNQDRTLTYIKFVDLNRLLYNFRANHGVSTN 81
Query: 151 G-KAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS- 208
G +A GW+ P R H GH+L+A A+ +A + + + V L++CQ+ +
Sbjct: 82 GAQANGGWDAPDFPFRSHIQGHFLTAWANCYAVLKDQECRSRAEQFVEELAKCQDNNAAA 141
Query: 209 ----GYLSAFPSEQFDRFE--ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMT 258
GYLS FP E L PYY IHK +AGLLD + +T+A +KM
Sbjct: 142 GFQAGYLSGFPESDITAVEQRTLTNGNVPYYAIHKTMAGLLDVWRNVGSTKAKDVLVKMA 201
Query: 259 KWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKP 318
W V + S + + + E GGM++VL ++ T D + L +A FD
Sbjct: 202 GW--------VDTRTARLSYAQMQSMMGTEFGGMSEVLADMFHQTGDERWLTVARRFDHA 253
Query: 319 CFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATG 378
L LA D + G HANT +P IG+ Y+ T D Y D +H YA G
Sbjct: 254 AVLDPLARSQDSLDGLHANTQVPKWIGAAREYKATKDQRYLDIARNAWDFTVEAHTYAIG 313
Query: 379 GTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFR-----WTKEMVYADYYERAL 433
G S E + P +A L + E+C TYNMLK++R LF + D+YERAL
Sbjct: 314 GNSQSEHFRPPNAIAGYLLHDTAEACNTYNMLKLTRELFMHDAAPGMNDTAKFDFYERAL 373
Query: 434 TNGVLSIQR-GTEPGVMIYMLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
N +L Q G G + Y PL RG A W T + SFWCC GTG+
Sbjct: 374 LNHLLGQQDPGDGHGHVTYFTPLNPGGRRGVGPAWGGGTWSTDYESFWCCQGTGI 428
>gi|389647349|ref|XP_003721306.1| hypothetical protein MGG_09030 [Magnaporthe oryzae 70-15]
gi|351638698|gb|EHA46563.1| hypothetical protein MGG_09030 [Magnaporthe oryzae 70-15]
Length = 680
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 135/415 (32%), Positives = 190/415 (45%), Gaps = 31/415 (7%)
Query: 91 NPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTA 150
+P F GD L V L+ Q L Y+ +D++ L+++F+ G T
Sbjct: 70 SPPVFTDTGDSALAFDLSQVTLNQGRFR-DNQDRTLTYIKFVDLNRLLYNFRANHGVSTN 128
Query: 151 G-KAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS- 208
G +A GW+ P R H GH+L+A A+ +A + + + V L++CQ+ +
Sbjct: 129 GAQANGGWDAPDFPFRSHIQGHFLTAWANCYAVLKDQECRSRAEQFVEELAKCQDNNAAA 188
Query: 209 ----GYLSAFPSEQFDRFE--ALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMT 258
GYLS FP E L PYY IHK +AGLLD + +T+A +KM
Sbjct: 189 GFQAGYLSGFPESDITAVEQRTLTNGNVPYYAIHKTMAGLLDVWRNVGSTKAKDVLVKMA 248
Query: 259 KWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKP 318
W V + S + + + E GGM++VL ++ T D + L +A FD
Sbjct: 249 GW--------VDTRTARLSYAQMQSMMGTEFGGMSEVLADMFHQTGDERWLTVARRFDHA 300
Query: 319 CFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATG 378
L LA D + G HANT +P IG+ Y+ T D Y D +H YA G
Sbjct: 301 AVLDPLARSQDSLDGLHANTQVPKWIGAAREYKATKDQRYLDIARNAWDFTVEAHTYAIG 360
Query: 379 GTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFR-----WTKEMVYADYYERAL 433
G S E + P +A L + E+C TYNMLK++R LF + D+YERAL
Sbjct: 361 GNSQSEHFRPPNAIAGYLLHDTAEACNTYNMLKLTRELFMHDAAPGMNDTAKFDFYERAL 420
Query: 434 TNGVLSIQR-GTEPGVMIYMLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
N +L Q G G + Y PL RG A W T + SFWCC GTG+
Sbjct: 421 LNHLLGQQDPGDGHGHVTYFTPLNPGGRRGVGPAWGGGTWSTDYESFWCCQGTGI 475
>gi|427383714|ref|ZP_18880434.1| hypothetical protein HMPREF9447_01467 [Bacteroides oleiciplenus YIT
12058]
gi|425728419|gb|EKU91277.1| hypothetical protein HMPREF9447_01467 [Bacteroides oleiciplenus YIT
12058]
Length = 791
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 127/371 (34%), Positives = 180/371 (48%), Gaps = 22/371 (5%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A N++ LL DVD L+ F K AG G+++ WE L GH GHYLSA A +
Sbjct: 46 ACDLNVQILLQYDVDRLLAPFLKEAGLQPKGESFPNWEG----LDGHVGGHYLSALAIHY 101
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALK-------PVWAPYY 233
A+T NV K++M ++S L CQ K GY+ P E K W P+Y
Sbjct: 102 AATGNVDCKKRMEYMISELKRCQQKHADGYVGGVPDGMKVWNEIKKGNVGIVWKYWVPWY 161
Query: 234 TIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMN 293
+HKI AGL D + + N +A M + ++ +I + E+ L E GGM+
Sbjct: 162 NLHKIYAGLRDAWIYGGNEEARMMFLELCDWG----MTIIAPLNDEQMEQMLANEFGGMD 217
Query: 294 DVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVT 353
+V Y +T D K+L A F L +A Q D++ HANT +P V+G Q E+
Sbjct: 218 EVYADAYQMTGDMKYLNTAKRFSHKWLLDSMAAQVDNLDNKHANTQVPKVVGYQRIAELG 277
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL-GTENEESCTTYNMLKV 412
D Y+V +F + V + + GG S E ++ S + E ESC T NMLK+
Sbjct: 278 HDKKYEVATEYFWNTVVYNRSLSLGGNSRREHFAAADDCKSYVEDREGPESCNTNNMLKL 337
Query: 413 SRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRF 472
+ LFR E YAD+YERA+ N +LS Q E G +Y ++ Y +
Sbjct: 338 TEGLFRMHPEARYADFYERAMYNHILSTQH-PEHGGYVYFT-----SARPAHYRVYSAPN 391
Query: 473 SSFWCCYGTGL 483
S+ WCC GTG+
Sbjct: 392 SAMWCCVGTGM 402
>gi|332185536|ref|ZP_08387284.1| tat (twin-arginine translocation) pathway signal sequence domain
protein [Sphingomonas sp. S17]
gi|332014514|gb|EGI56571.1| tat (twin-arginine translocation) pathway signal sequence domain
protein [Sphingomonas sp. S17]
Length = 639
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 138/406 (33%), Positives = 193/406 (47%), Gaps = 39/406 (9%)
Query: 96 KLAGDFLKEVSLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAY 154
+L ++ + DV LD LH AQ+ YL+ L D L+ +F+ AG AY
Sbjct: 36 RLPATVVQPFDMADVTLDGGPFLH--AQRMTEAYLMRLQPDRLLANFRANAGLKPKAPAY 93
Query: 155 EGWE------DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS 208
GWE D C GH +GHYLSA A + +T + ++++ + + L+ CQ GS
Sbjct: 94 GGWESEPEWADINCH--GHTLGHYLSACALAYRATKDKRYRQRIDYIANELAACQKASGS 151
Query: 209 GYLSAFPS-----EQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTK 259
G + AFP R E + V P+YT+HK+ AGL D AD+ + ++
Sbjct: 152 GLVCAFPKGPALVAAHLRGEPITGV--PWYTLHKVYAGLRDSVQLADSEPSRGVLFRLAD 209
Query: 260 WMVEYFYNRVQNVITK-YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKP 318
W V V TK S E+ L E GGMN++ LY +T + + +A F +
Sbjct: 210 WGV---------VATKPLSDEQFEKMLETEYGGMNEIYADLYFMTGNEDYRRVAERFSQK 260
Query: 319 CFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATG 378
+ LA D + G HANT IP +IG Q +E TGD Y FF V + +ATG
Sbjct: 261 AIMNPLAQGRDYLDGMHANTQIPKIIGFQRVFEATGDDKYHNAAAFFWRTVAHTRAFATG 320
Query: 379 GTSAGE-FWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGV 437
G E F++ + E+C +NMLK++R LF YADYYER L NG+
Sbjct: 321 GHGDAEHFFAMADFDKHVFSAKGSETCCQHNMLKLTRALFLRDPRAEYADYYERTLYNGI 380
Query: 438 LSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
L+ Q + G+ Y G K YH T SFWCC GTG+
Sbjct: 381 LASQ-DPDSGMATYF--QGARPGYMKLYH---TPEDSFWCCTGTGM 420
>gi|423224675|ref|ZP_17211143.1| hypothetical protein HMPREF1062_03329 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392635115|gb|EIY29021.1| hypothetical protein HMPREF1062_03329 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 782
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 126/400 (31%), Positives = 198/400 (49%), Gaps = 36/400 (9%)
Query: 103 KEVS---LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWED 159
+EVS L DVKL S +AQQT+L Y++ ++ D L+ F + AG +Y WE+
Sbjct: 24 QEVSYFPLQDVKLLESPF-LQAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWEN 82
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--E 217
L GH GHY+SA + M+A+T + + ++ +++ L Q +G+G++ P +
Sbjct: 83 --TGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQ 140
Query: 218 QFDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEY 264
+ +A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLVALTDWMID- 199
Query: 265 FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
+ + ++ + L E GG+N+ + IT D K+L LA F L L
Sbjct: 200 -------ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPL 252
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE 384
D ++G HANT IP VIG + ++ D + FF + V GG S E
Sbjct: 253 VKDEDCLTGMHANTQIPKVIGYKRIADLAQDQNWDHAARFFWNTVVNHRSVCIGGNSVRE 312
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
+ S L + E+C TYNML++++ L++ + ++ +ADYYERAL N +L+ Q+
Sbjct: 313 HFHPADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNHILASQQP 372
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
T+ G +Y P+ G Y + +S WCC G+GL
Sbjct: 373 TKGG-FVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 406
>gi|330996333|ref|ZP_08320217.1| hypothetical protein HMPREF9442_01300 [Paraprevotella xylaniphila
YIT 11841]
gi|329573383|gb|EGG54994.1| hypothetical protein HMPREF9442_01300 [Paraprevotella xylaniphila
YIT 11841]
Length = 811
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 129/394 (32%), Positives = 191/394 (48%), Gaps = 35/394 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L+DV+L A+ ++ YLL LD D L+ + K AG Y WE+ L G
Sbjct: 57 LNDVRLTQGPFK-HAEDLDIRYLLGLDPDRLLAPYLKGAGLEPKADNYTNWEN--TGLDG 113
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE--------- 217
H GHY+SA A+M+A+T N +K+++ ++S Q+ G GYL P+
Sbjct: 114 HIGGHYVSALAYMYAATGNEEIKQRLDYMLSEWKRAQDAAGDGYLCGAPNGRKIWDAVSK 173
Query: 218 ---QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQ 270
Q F L W P Y IHK AGL D Y A QA +K+T WM+
Sbjct: 174 GDIQASSF-GLNGGWVPLYNIHKTYAGLRDAYVVAGCAQAKDMLVKLTDWMM-------- 224
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
N+ S E+ + L E GG+N+V + +T ++ LA F L L Q D
Sbjct: 225 NLTKDLSDEQIQDMLRSEHGGLNEVFADVADLTGKDGYMQLARRFSHREILDPLLKQEDQ 284
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
++G HANT IP VIG + ++ GD + FF V + GG S E + +
Sbjct: 285 LTGKHANTQIPKVIGYKRIADLEGDESWDDAARFFWKTVVDQRSISIGGNSVREHFHPSE 344
Query: 391 RLASTLGTEN-EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+S L +E E+C TYNML++++ L++ + + Y DYYERAL N +LS + G
Sbjct: 345 DFSSMLTSEQGPETCNTYNMLRLTKMLYQTSADAHYMDYYERALYNHILSTIDPVQGG-F 403
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+Y P+ ++ Y + +SFWCC G+G+
Sbjct: 404 VYFTPM-----RSGHYRVYSQPQTSFWCCVGSGM 432
>gi|408500683|ref|YP_006864602.1| acetyl-CoA carboxylase [Bifidobacterium asteroides PRL2011]
gi|408465507|gb|AFU71036.1| acetyl-CoA carboxylase [Bifidobacterium asteroides PRL2011]
Length = 807
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 126/388 (32%), Positives = 185/388 (47%), Gaps = 24/388 (6%)
Query: 110 VKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFV 169
V+L P S++ AQQ +YLL LD D L+ +++ AG Y WE + L GH
Sbjct: 26 VRLTPGSIYADAQQAGADYLLSLDPDRLLAPYRREAGLTATADPYPNWE--SMGLDGHIG 83
Query: 170 GHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE------------ 217
GHYLS A W S E+ T +++ L ECQ G G+L P
Sbjct: 84 GHYLSGLAAYWQSLQTWPFLERATRMLTGLLECQEASGDGFLGGMPHSAELFRNLREGHV 143
Query: 218 QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYS 277
Q F+ L W P Y +HK+ AGLLD + A +M + MV + ++
Sbjct: 144 QAQSFDLLG-SWVPLYNLHKLFAGLLDCWQSFQTKGASEMARVMVLRLADWWCDLADNID 202
Query: 278 VERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAH-LFDKPCFLGLLAVQADDISGFHA 336
+ L E GG+N+ RLY +T ++L A L D+P F LAV D ++G HA
Sbjct: 203 EQDFQTMLTCEYGGLNEAFARLYQLTGKDRYLRQARRLTDRP-FFEPLAVGKDQLTGLHA 261
Query: 337 NTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL 396
NT IP V+G + E+TGD ++ F V + G S E ++ P ++ +
Sbjct: 262 NTQIPKVLGYERLAEITGDQAFRTAVDTFWHGVVDKRTVSIGAHSISEHFNPPDDFSAMV 321
Query: 397 GT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPL 455
+ E E+C +YNM K++ L+ T + Y D+YER L N ++S E G +Y P+
Sbjct: 322 TSREGLETCNSYNMAKLALRLYDRTGQARYLDFYERVLVNHLVSTVGIREHG-FVYFTPM 380
Query: 456 GRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ + Y + + SFWCC GTGL
Sbjct: 381 -----RPRHYRVYSSAQRSFWCCVGTGL 403
>gi|254444174|ref|ZP_05057650.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
gi|198258482|gb|EDY82790.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
Length = 788
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 124/378 (32%), Positives = 179/378 (47%), Gaps = 34/378 (8%)
Query: 123 QTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWAS 182
+ ++ Y+L D D L+ F AG + Y WE + L GH GH+LSA A +
Sbjct: 47 EADVTYVLAHDPDRLLAPFLTAAGLEPKAEKYGNWE--SSGLDGHSAGHFLSAYATLSLQ 104
Query: 183 THNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ------------FDRFEALKPVWA 230
+ N L+E++ ++ L+ CQ+ +G+GYL P+ Q DRF +L W
Sbjct: 105 SDNPLLRERLDYMLDELTRCQDAIGTGYLGGVPNSQEFTTRLFAGEIKADRF-SLNGAWV 163
Query: 231 PYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSLN 286
P+Y +HK AGL D + AD+ +A + + W V K + E+ L
Sbjct: 164 PWYNLHKTYAGLKDAWLVADSEKAKNILIALADWTVA--------ATAKLTDEQMQEMLY 215
Query: 287 EETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGS 346
E GGMN++ LY TQD ++L LA+ F L L D ++GFHANT IP VIG
Sbjct: 216 TEHGGMNEIFADLYLHTQDQRYLELAYRFTHHELLDPLLENQDKLTGFHANTQIPKVIGY 275
Query: 347 QMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT-ENEESCT 405
Q D FF D V + GG S E + S L + E E+C
Sbjct: 276 QRTALAAQDEKLHQASQFFWDTVVNHRSVSIGGNSVREHFHPADDFRSMLESREGPETCN 335
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSY 465
T+NML+++ LF DYYERAL N +LS Q E G ++Y P + + Y
Sbjct: 336 THNMLRLTTLLFEAEPTAALTDYYERALYNHILSAQH-PETGGLVYFTP-----QRPRHY 389
Query: 466 HGWGTRFSSFWCCYGTGL 483
+ ++FWCC G+G+
Sbjct: 390 RVYSVPENAFWCCVGSGI 407
>gi|326801658|ref|YP_004319477.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326552422|gb|ADZ80807.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
Length = 790
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 123/379 (32%), Positives = 179/379 (47%), Gaps = 32/379 (8%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQ+T+L Y+L L+ D L+ + + AG +Y WE+ L GH GHYLSA + M
Sbjct: 51 AQETDLRYILALNPDRLLAPYLREAGLEPKASSYGNWEN--TGLDGHIGGHYLSALSLMA 108
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ-------FDRFEA----LKPVW 229
A+T N +++++T ++S L CQ++ GY+ P + + EA L W
Sbjct: 109 AATGNHAIQDRLTYMLSELKRCQDQDSDGYVGGIPGGKQMWNDIKRGKIEAQSFSLNGKW 168
Query: 230 APYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKYSVERHWNSL 285
P Y IHK+ AGL+D Y + N A LK+ KW + F I L
Sbjct: 169 VPIYNIHKLFAGLIDAYRYTGNEHARQMVLKLGKWWLSVFGGLTDEQIQTI--------L 220
Query: 286 NEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIG 345
E GG+N+V L I+ D K+L +A L L D+++G HANT IP VIG
Sbjct: 221 RSEHGGINEVFADLAQISGDQKYLTMAKRLSHRAILQPLIAGKDELTGLHANTQIPKVIG 280
Query: 346 SQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT-ENEESC 404
+ + + FF + V + GG S E + L + E E+C
Sbjct: 281 FEKIAALADSMSWANAARFFWETVVEHRTVSIGGNSESEHFHALNSFGKMLSSREGPETC 340
Query: 405 TTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKS 464
TYNM+K+S+ LF + + DYYERA N +LS Q E G +Y P+ +
Sbjct: 341 NTYNMMKLSKDLFLQGPDRKFIDYYERATYNHILSSQHPKEGG-FVYFTPM-----RPNH 394
Query: 465 YHGWGTRFSSFWCCYGTGL 483
Y + + FWCC G+GL
Sbjct: 395 YRVYSQAQACFWCCVGSGL 413
>gi|334144880|ref|YP_004538089.1| acetyl-CoA carboxylase, biotin carboxylase [Novosphingobium sp.
PP1Y]
gi|333936763|emb|CCA90122.1| acetyl-CoA carboxylase, biotin carboxylase [Novosphingobium sp.
PP1Y]
Length = 651
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 135/398 (33%), Positives = 190/398 (47%), Gaps = 35/398 (8%)
Query: 102 LKEVSLHDVKLDPSS-LHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWE-- 158
L+ L DV L+ LH AQ+ YLL L D L+ +F+ AG Y GWE
Sbjct: 50 LEPFDLSDVTLEEGPFLH--AQRLTEAYLLRLQPDRLLHNFRVNAGLAPRAAVYGGWESD 107
Query: 159 ----DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAF 214
D C GH +GHYLSA A + ST++ K+++ + + L+ CQ GSG + AF
Sbjct: 108 EIWADINCH--GHTLGHYLSACALAFRSTNDRRFKQRVDYIANELAACQKATGSGLVCAF 165
Query: 215 PSEQF---DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYN 267
P K P+YT+HK+ AGL D AD+T + +++ W V
Sbjct: 166 PDGPALLTAHLRGDKITGVPWYTLHKVYAGLRDGALLADSTVSREVLIRLADWGV----- 220
Query: 268 RVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
V T+ + + + L E GGMN+V LY +T + + L+ F + L
Sbjct: 221 ----VATRPLTDGQFETMLATEHGGMNEVYADLYAMTGNEDYRELSQRFSHKAVMDPLVQ 276
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE-F 385
D + G HANT +P ++G Q YE+TGD Y FF V + +ATGG E F
Sbjct: 277 GRDLLDGMHANTQVPKIVGFQRVYEITGDDRYAQAANFFFRTVAHTRSFATGGHGDNEHF 336
Query: 386 WSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
++ + E+C +NMLK++R LF YADYYER L NG+L+ Q +
Sbjct: 337 FAMADFDRHVFSAKGSETCCQHNMLKLARLLFMQDPNADYADYYERTLYNGILASQ-DPD 395
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G++ Y G K YH T SFWCC GTG+
Sbjct: 396 SGMVTYF--QGARPGYMKLYH---TPEHSFWCCTGTGM 428
>gi|192360871|ref|YP_001981311.1| hypothetical protein CJA_0803 [Cellvibrio japonicus Ueda107]
gi|190687036|gb|ACE84714.1| conserved hypothetical protein [Cellvibrio japonicus Ueda107]
Length = 802
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 125/375 (33%), Positives = 182/375 (48%), Gaps = 25/375 (6%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQ TN +YL+ LDV+ L+ F++ AG P + Y WE + L GH GHY+SA A +
Sbjct: 49 AQNTNKQYLMALDVEKLLAPFRREAGLPYK-ETYGNWE--STGLDGHIGGHYISALALTY 105
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS-----EQFDRFE------ALKPVW 229
AST + + ++ V++ L +CQ+K G+GYL+ P ++ R + + W
Sbjct: 106 ASTGDPAVLARLEYVITELKKCQDKNGNGYLAGLPEGAGIWQEIARGDIRADNFSTNERW 165
Query: 230 APYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
P+Y +HK AGL D Y + N A M E+ + +++ S E+ L+ E
Sbjct: 166 VPWYNLHKTFAGLRDAYRYTGNETAKAMLVAFSEWTWALTKDL----SDEQMQTLLHTEH 221
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GGMNDV + IT D ++L LA F L L + D ++G HANT IP VIG +
Sbjct: 222 GGMNDVFVDVADITGDKRYLHLAERFSHRAILQPLLEKRDALTGLHANTQIPKVIGFKRV 281
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL-GTENEESCTTYN 408
+ ++ FF + V A GG S E + S + E E+C TYN
Sbjct: 282 GDAEQLAEWQSAAEFFWETVVNKRSVAIGGNSVREHFHPQDNFHSMIEDVEGPETCNTYN 341
Query: 409 MLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGW 468
MLK++ LF Y DYYERAL N +L Q + G +Y P+ + Y +
Sbjct: 342 MLKLTEQLFLDNPLGKYGDYYERALYNHILGSQH-PQTGGFVYFTPM-----RPNHYRVY 395
Query: 469 GTRFSSFWCCYGTGL 483
WCC G+GL
Sbjct: 396 SQVHDGMWCCVGSGL 410
>gi|86142285|ref|ZP_01060795.1| hypothetical protein MED217_11584 [Leeuwenhoekiella blandensis
MED217]
gi|85831037|gb|EAQ49494.1| hypothetical protein MED217_11584 [Leeuwenhoekiella blandensis
MED217]
Length = 793
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 124/381 (32%), Positives = 179/381 (46%), Gaps = 25/381 (6%)
Query: 115 SSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLS 174
S + A T+ Y+ LD D L+ F + AG +Y WE+ L GH GHY+S
Sbjct: 38 SGVFKEAALTDFNYIQALDADRLLAPFLREAGLEPKADSYTNWEN--TGLDGHTAGHYIS 95
Query: 175 ASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QFDRFEA--- 224
A + +AST + KE + ++ L Q G+GY+ P + + A
Sbjct: 96 ALSMYYASTGDPKAKEMLEYALAELDRVQKSNGNGYIGGVPGSDALWAEIKAGKINAGSF 155
Query: 225 -LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWN 283
L W P Y IHK GL D + A+ QA +M + ++F + + S + +
Sbjct: 156 SLNDKWVPLYNIHKTFNGLKDAWIHAELPQAKRMLIELTDWFLD----ITADLSEAQIQD 211
Query: 284 SLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVV 343
L E GG+N+V +Y IT D K+L LA F + L LA D ++G HANT IP
Sbjct: 212 MLRSEHGGLNEVFAEVYAITSDKKYLKLAEDFSQHALLKPLAANEDILTGMHANTQIPKF 271
Query: 344 IGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN-EE 402
IG + ++ Y + F D V + GG S E ++ +S + +E E
Sbjct: 272 IGFERISQLEEAKDYHDAASNFFDNVTTRRSISIGGNSVREHFNPVDDFSSVVSSEQGPE 331
Query: 403 SCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKA 462
SC TYNMLK+S+ LF T E Y D+YER L N +LS Q G +Y P+ G
Sbjct: 332 SCNTYNMLKLSKLLFEDTSEEHYIDFYERGLYNHILSSQNPD--GGFVYFTPIRPG---- 385
Query: 463 KSYHGWGTRFSSFWCCYGTGL 483
Y + +SFWCC G+G+
Sbjct: 386 -HYRVYSQPETSFWCCVGSGM 405
>gi|295133987|ref|YP_003584663.1| glycosyl hydrolase [Zunongwangia profunda SM-A87]
gi|294982002|gb|ADF52467.1| putative glycosyl hydrolase [Zunongwangia profunda SM-A87]
Length = 794
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 129/394 (32%), Positives = 193/394 (48%), Gaps = 34/394 (8%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
+L DVKL + L A T+L+Y+L ++ D L+ F + AG ++Y WE+ L
Sbjct: 35 NLKDVKLH-TGLFEEAMYTDLDYILQMEPDRLLAPFLREAGLQPKAESYPNWEN--TGLD 91
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP-SEQF----- 219
GH GHYL+A A M+AS + +++ ++ L + Q+ G+GY+ P SE+
Sbjct: 92 GHIGGHYLTALAQMYASAGSDEALQRLNYMIGELKKAQDANGNGYVGGIPDSERIWKEIS 151
Query: 220 -DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQ 270
+ A L W P Y IHK AGL D Y A N +A +M T WM++ N +
Sbjct: 152 EGKINAGGFSLNGGWVPLYNIHKTYAGLRDAYLIAGNEEAKQMLIDLTDWMIDITANLSE 211
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
I + L E GG+N+ +Y +T D K+L LA+ F + L L + D
Sbjct: 212 AQIQEM--------LKSEHGGLNETFADVYKMTGDKKYLDLAYAFTQKQVLDPLEHEKDI 263
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
++G HANT IP VIG + + + Y T+F + V + + GG S E +
Sbjct: 264 LNGMHANTQIPKVIGYETIAALDQNKDYHNAATYFWENVVNNRTVSIGGNSVREHFHPAD 323
Query: 391 RLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+S + + + E+C TYNMLK+S LF E Y D+YE+ L N +LS Q G
Sbjct: 324 DFSSMINSVQGPETCNTYNMLKLSEKLFLANPEEKYIDFYEQGLYNHILSSQHPE--GGF 381
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+Y P+ G Y + +S WCC G+GL
Sbjct: 382 VYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 410
>gi|423223047|ref|ZP_17209516.1| hypothetical protein HMPREF1062_01702 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640316|gb|EIY34118.1| hypothetical protein HMPREF1062_01702 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 790
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 125/378 (33%), Positives = 180/378 (47%), Gaps = 30/378 (7%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A+ N+E LL D D L+ ++K AG K Y W+ L GH GHYL+A A +
Sbjct: 43 ARDLNIETLLKYDCDRLMAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMA-IN 97
Query: 181 ASTHNVTLKEKMTAVVSALSECQN-------KMGSGYLSAFPSEQF-------DRFEALK 226
A+T N +++M ++S ++EC + G GY+ P+ Q F
Sbjct: 98 AATGNEECRKRMEYIISEIAECAEANCKNHPQWGVGYMGGMPNSQNIWNGFKDGDFRVYS 157
Query: 227 PVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLN 286
WAP+Y +HK+ AGL D + + N QA K + F N ++ + S E+ L
Sbjct: 158 GSWAPFYNLHKMYAGLRDAWLYCGNEQA----KSLFLQFCNWAIHITSGLSDEQMERMLG 213
Query: 287 EETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGS 346
E GGMN+VL Y IT + K+L A F ++ + D + HANT +P VIG
Sbjct: 214 NEHGGMNEVLADAYAITHEQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQVPKVIGF 273
Query: 347 QMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG-TENEESCT 405
+ E++G+ Y V +FF DIV A GG S E + + + ESC
Sbjct: 274 ERISELSGNEDYHVASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDIDGPESCN 333
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSY 465
T NMLK++ L R E YADYYE A N +LS Q E G +Y P ++ + Y
Sbjct: 334 TNNMLKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTP-----ARPRHY 387
Query: 466 HGWGTRFSSFWCCYGTGL 483
+ + WCC GTG+
Sbjct: 388 RNYSAPNEAMWCCVGTGM 405
>gi|427403045|ref|ZP_18894042.1| hypothetical protein HMPREF9710_03638 [Massilia timonae CCUG 45783]
gi|425718056|gb|EKU81008.1| hypothetical protein HMPREF9710_03638 [Massilia timonae CCUG 45783]
Length = 781
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 131/390 (33%), Positives = 182/390 (46%), Gaps = 28/390 (7%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L V+L P AQ TNL YL+ ++ D L+ F + AG +Y WE + L G
Sbjct: 25 LSAVRLGPGPF-LDAQTTNLNYLMAMEPDRLLAPFLREAGLQPRQPSYGNWE--STGLDG 81
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE--------- 217
H GHYLSA A M AST + ++ V+ L Q G GYL P
Sbjct: 82 HMGGHYLSALALMHASTGDQEALRRLNYFVAELKRAQQANGDGYLGGIPGGRQAWRDIAA 141
Query: 218 ---QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
+ D F ++ W P+Y +HK+ AGL D Y +A N A K M+ + +
Sbjct: 142 GKLEADNF-SVNGKWVPWYNLHKVYAGLRDAYRYAGNEDA----KAMLVQLSDWALALSA 196
Query: 275 KYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGF 334
K S E+ L E GGMN++ + +T + K+L LA F L LA + D ++G
Sbjct: 197 KLSPEQMQTMLRSEHGGMNEIFVDVAEMTGERKYLDLALAFSHQAVLQPLARKQDQLTGL 256
Query: 335 HANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLAS 394
HANT IP VIG + ++TG FF V A GG S E +
Sbjct: 257 HANTQIPKVIGFKRIADMTGRQDMGEAARFFWQTVVDKRTVAIGGNSVKEHFHSTDDFDP 316
Query: 395 TL-GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYML 453
+ E E+C TYNMLK++ LFR ++ +Y+DYYERAL N +LS QR G +Y
Sbjct: 317 MVHEVEGPETCNTYNMLKLTGMLFRSEQKGMYSDYYERALYNHILSSQR--PEGGFVYFT 374
Query: 454 PLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
P+ + Y + WCC G+G+
Sbjct: 375 PM-----RPNHYRVYSQVDKGMWCCVGSGI 399
>gi|224537183|ref|ZP_03677722.1| hypothetical protein BACCELL_02060 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521238|gb|EEF90343.1| hypothetical protein BACCELL_02060 [Bacteroides cellulosilyticus
DSM 14838]
Length = 790
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 125/378 (33%), Positives = 180/378 (47%), Gaps = 30/378 (7%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A+ N+E LL D D L+ ++K AG K Y W+ L GH GHYL+A A +
Sbjct: 43 ARDLNIETLLKYDCDRLMAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMA-IN 97
Query: 181 ASTHNVTLKEKMTAVVSALSECQN-------KMGSGYLSAFPSEQF-------DRFEALK 226
A+T N +++M ++S ++EC + G GY+ P+ Q F
Sbjct: 98 AATGNEECRKRMEYIISEIAECAEANSKNHPQWGIGYMGGMPNSQNIWNGFKDGDFRVYS 157
Query: 227 PVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLN 286
WAP+Y +HK+ AGL D + + N QA K + F N ++ + S E+ L
Sbjct: 158 GSWAPFYNLHKMYAGLRDAWLYCGNEQA----KSLFLQFCNWAIHITSGLSDEQMERMLG 213
Query: 287 EETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGS 346
E GGMN+VL Y IT + K+L A F ++ + D + HANT +P VIG
Sbjct: 214 NEHGGMNEVLADAYAITHEQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQVPKVIGF 273
Query: 347 QMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG-TENEESCT 405
+ E++G+ Y V +FF DIV A GG S E + + + ESC
Sbjct: 274 ERISELSGNEDYHVASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDIDGPESCN 333
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSY 465
T NMLK++ L R E YADYYE A N +LS Q E G +Y P ++ + Y
Sbjct: 334 TNNMLKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTP-----ARPRHY 387
Query: 466 HGWGTRFSSFWCCYGTGL 483
+ + WCC GTG+
Sbjct: 388 RNYSAPNEAMWCCVGTGM 405
>gi|189464752|ref|ZP_03013537.1| hypothetical protein BACINT_01096 [Bacteroides intestinalis DSM
17393]
gi|189437026|gb|EDV06011.1| hypothetical protein BACINT_01096 [Bacteroides intestinalis DSM
17393]
Length = 790
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 123/378 (32%), Positives = 181/378 (47%), Gaps = 30/378 (7%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A+ N+E LL D D L+ ++K AG K Y W+ L GH GHYL+A A +
Sbjct: 43 ARDLNIETLLKYDCDRLIAPYRKEAGLTPKAKCYPNWDG----LDGHVGGHYLTAMA-IN 97
Query: 181 ASTHNVTLKEKMTAVVSALSECQN-------KMGSGYLSAFPSEQF-------DRFEALK 226
A+T N +++M +++ ++EC K G GY+ P+ Q F
Sbjct: 98 AATGNEECRKRMEYIINEIAECAEANYKNHPKWGVGYMGGMPNSQNIWSGFKNGDFRVYS 157
Query: 227 PVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLN 286
WAP+Y +HK+ AGL D + + N QA K + F N ++ + S E+ L
Sbjct: 158 GSWAPFYNLHKMYAGLRDAWLYCGNEQA----KTLFLQFCNWAIDITSGLSDEQMERMLG 213
Query: 287 EETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGS 346
E GGMN+VL Y IT++ K+L A F ++ + D + HANT +P VIG
Sbjct: 214 NEHGGMNEVLADAYAITREQKYLDCAKRFSHKRLFTPMSQRQDCLDNMHANTQVPKVIGF 273
Query: 347 QMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG-TENEESCT 405
+ E++G+ Y + +FF DIV A GG S E + + + ESC
Sbjct: 274 ERISELSGNEDYHMASSFFWDIVTGERSLAFGGNSRREHFPAKDACMDFINDIDGPESCN 333
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSY 465
T N+LK++ L R E YADYYE A N +LS Q E G +Y P ++ + Y
Sbjct: 334 TNNILKLTEDLHRRNPEARYADYYELATFNHILSTQH-PEHGGYVYFTP-----ARPRHY 387
Query: 466 HGWGTRFSSFWCCYGTGL 483
+ + WCC GTG+
Sbjct: 388 RNYSAPNEAMWCCVGTGM 405
>gi|374992692|ref|YP_004968187.1| hypothetical protein SBI_09938 [Streptomyces bingchenggensis BCW-1]
gi|297163344|gb|ADI13056.1| hypothetical protein SBI_09938 [Streptomyces bingchenggensis BCW-1]
Length = 769
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 128/377 (33%), Positives = 178/377 (47%), Gaps = 29/377 (7%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQ T L+YLL LD D L+ ++ AG P ++Y WE + L GH VGH LS +A M
Sbjct: 19 AQATALDYLLSLDTDRLLAPLRREAGLPPVAESYGNWE--SSGLDGHTVGHALSGAALMS 76
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE------------QFDRFEALKPV 228
A T + + + +V + ECQ+ +G+GY+ P + D FE L
Sbjct: 77 AVTDDPRPRAMVDRLVQGVVECQDALGTGYVGGVPDGVRLWQRVAAGQVERDSFE-LGGA 135
Query: 229 WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEE 288
W P+Y +HK+ AGLLD Y + AL + + +++ V + H L E
Sbjct: 136 WVPWYNLHKLFAGLLDAYRHTGSEPALTAVRRLADWW----GRVAAGMDDDTHEAMLRTE 191
Query: 289 TGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQM 348
GGM +VL L +T ++ LA F L L D + G HANT I V+G Q
Sbjct: 192 FGGMCEVLADLAEVTGTDRYAALARRFLDQSLLRPLCEHRDVLDGMHANTQIAKVVGYQR 251
Query: 349 RYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT-ENEESCTTY 407
EV DP + FF + + GG S E +S L + E E+C TY
Sbjct: 252 LGEVVDDPGLRDAARFFWQAMTRHRTVSFGGNSVREHLHPRDDFSSALQSPEGPETCNTY 311
Query: 408 NMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP-GVMIYMLPLGRGDSKAKSYH 466
NMLK+SR LF + D+YERA N +LS +P G ++Y P+ G + S
Sbjct: 312 NMLKLSRALFLERPDTEVLDHYERATVNHILS---SLQPKGGLVYFTPVRPGHYRVVS-- 366
Query: 467 GWGTRFSSFWCCYGTGL 483
T + FWCC GTGL
Sbjct: 367 ---TPQNCFWCCVGTGL 380
>gi|371776971|ref|ZP_09483293.1| Acetyl-CoA carboxylase, biotin carboxylase [Anaerophaga sp. HS1]
Length = 794
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 118/375 (31%), Positives = 186/375 (49%), Gaps = 24/375 (6%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A++ N +Y++ D D ++ F AG + Y WE L GHF GHYL++ + M
Sbjct: 49 AEELNEKYVMAHDPDRILAPFLIDAGLKPKAQGYGNWE--GSGLNGHFGGHYLTSLSLMI 106
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFE-----------ALKPVW 229
AST + ++++ +V L+ CQ G+GY+ P Q E +L W
Sbjct: 107 ASTGSEEARKRLDYMVDQLARCQKANGNGYVGGIPGGQAMWAEIAKGNINAGNFSLNGKW 166
Query: 230 APYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
P Y IHK+ AGL D + A N +A ++ + ++F N +N +T +++ L E
Sbjct: 167 VPLYNIHKLFAGLRDAWLLAQNKKAKEVLINLTDWFLNLTKN-LTDDQIQK---MLVSEH 222
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GG+N+V +Y IT + +L LA F L L Q D ++G HANT IP VIG
Sbjct: 223 GGLNEVFADVYDITGNENYLKLARRFSHQAILRPLLQQKDQLTGLHANTQIPKVIGFMRI 282
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT-ENEESCTTYN 408
E+ D + FF + V + + GG S E + +S + + + E+C TYN
Sbjct: 283 GELAHDTAWINAADFFWNTVVQNRTVSIGGNSTHEHFHAVDDFSSMIESRQGPETCNTYN 342
Query: 409 MLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGW 468
MLK+S+ LF + ++ Y DYYE+AL N +LS Q G++ + + + Y +
Sbjct: 343 MLKLSKQLFLFKNDLKYIDYYEQALYNHILSSQHPLHGGLVYFT------SMRPRHYRVY 396
Query: 469 GTRFSSFWCCYGTGL 483
+FWCC G+G+
Sbjct: 397 SRPEQTFWCCVGSGI 411
>gi|359453850|ref|ZP_09243152.1| hypothetical protein P20495_1902 [Pseudoalteromonas sp. BSi20495]
gi|358049097|dbj|GAA79401.1| hypothetical protein P20495_1902 [Pseudoalteromonas sp. BSi20495]
Length = 816
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 127/376 (33%), Positives = 178/376 (47%), Gaps = 27/376 (7%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQQTN+ YLL L D L+ + + AG +Y WED L GH GHYLS+ + W
Sbjct: 64 AQQTNVRYLLALYPDQLLAPYLREAGIEQKAPSYGNWED--TGLDGHIGGHYLSSLSLAW 121
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF------------DRFEALKPV 228
A+T + LK ++ +++ L Q ++ GYL P Q D F +L
Sbjct: 122 AATGDEELKRRLDYMLNELQRAQ-QVNDGYLGGIPDGQAMWQQIHDGNIKADLF-SLNDR 179
Query: 229 WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEE 288
W P Y I KI GL D Y A + QA M + E+F N + K S E+ L E
Sbjct: 180 WVPLYNIDKIFHGLRDAYLIAGSEQAKTMLFDLGEWFLN----LTAKLSDEQIQQMLYSE 235
Query: 289 TGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQM 348
GG+N V + TI D ++L LA F + L + D ++G HANT IP +IG
Sbjct: 236 YGGLNAVFADMATIGNDKRYLKLARQFTHNNIIDPLLEKQDKLTGLHANTQIPKIIGMLK 295
Query: 349 RYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL-GTENEESCTTY 407
E + D ++ +F V A GG S E + D + E E+C TY
Sbjct: 296 VAEASDDKAWQQGADYFWQTVTKQRSVAIGGNSVSEHFHDKNDFTPMVEDVEGPETCNTY 355
Query: 408 NMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHG 467
NM+K+S+ LF T + Y +YYERA N +LS Q E G ++Y + G Y
Sbjct: 356 NMMKLSKLLFLKTADTRYLEYYERATYNHILSSQH-PEHGGLVYFTSMRPG-----HYRM 409
Query: 468 WGTRFSSFWCCYGTGL 483
+ + S WCC G+G+
Sbjct: 410 YSSVQDSMWCCVGSGI 425
>gi|319786479|ref|YP_004145954.1| hypothetical protein Psesu_0871 [Pseudoxanthomonas suwonensis 11-1]
gi|317464991|gb|ADV26723.1| protein of unknown function DUF1680 [Pseudoxanthomonas suwonensis
11-1]
Length = 806
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 130/394 (32%), Positives = 194/394 (49%), Gaps = 26/394 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L+ L DV+L + R+ NL YL LD D L+ F+ AG P+ Y WE +
Sbjct: 35 LQAFPLEDVRLGDGAFA-RSSALNLRYLAALDPDRLLAPFRIEAGLPSPAPKYPNWE--S 91
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF-- 219
L GH GHYLSA A A+ + ++ ++ +V+ALS+ Q G GY+ P+ +
Sbjct: 92 MGLDGHTAGHYLSALAQQ-AAQGSAGMRRRLDYMVAALSQVQAANGDGYVGGVPNGRVLW 150
Query: 220 -----DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ 270
F+A L+ W P+Y +HK AGL D + A N QA + ++ V
Sbjct: 151 NRIASGDFQAESFSLEGAWVPFYNLHKTYAGLRDAWLLAGNAQARDVLVRFADWAGALVA 210
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
N + ++R L+ E GGMN+VL +Y IT D ++L LA F L L + D
Sbjct: 211 N-LDDTQLQR---VLDTEHGGMNEVLADVYAITGDRRYLALARRFSHRAILDPLLRREDR 266
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
+ G HANT IP VIG E+ GD + FF + V A GG S E ++
Sbjct: 267 LDGLHANTQIPKVIGFARIGELDGDVEWIEAAQFFWERVALHRSIAFGGNSTREHFNPAD 326
Query: 391 RLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+ + + E E+C +YNML+++ L R + +AD+YERAL N +LS Q + G +
Sbjct: 327 DFSGMIASREGPETCNSYNMLRLTLLLERLRPDPRHADFYERALFNHILSTQH-PDHGGL 385
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+Y P+ + + Y + FWCC G+G+
Sbjct: 386 VYFTPI-----RPRHYRVYSQPQECFWCCVGSGM 414
>gi|452750721|ref|ZP_21950468.1| Putative glycosyl hydrolase of unknown function (DUF1680) [alpha
proteobacterium JLT2015]
gi|451961915|gb|EMD84324.1| Putative glycosyl hydrolase of unknown function (DUF1680) [alpha
proteobacterium JLT2015]
Length = 744
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 124/383 (32%), Positives = 174/383 (45%), Gaps = 31/383 (8%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
A + N EYL+ LD D L+ +++ +AG G Y GWE T + GH +GHYLSA A
Sbjct: 9 AVERNREYLMSLDPDRLLHNYRTSAGLAPKGDVYGGWESDT--IAGHTLGHYLSALALTH 66
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP-----------SEQFDRFEA----- 224
A T + + +V L+ Q G GY++ F E F A
Sbjct: 67 AQTGDEESCRRANYIVGELATVQAAHGDGYVAGFTRKRPDGEIVDGKEIFPEIMAGDIRS 126
Query: 225 ----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVER 280
L W P Y HK+ GL D N AL + + +Y + + E+
Sbjct: 127 AGFDLNGCWVPLYNWHKLYTGLYDVADLCGNRTALPIAVALGDY----IDRMFAALDDEQ 182
Query: 281 HWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHI 340
L E GG+N+ LY T + + L L L L D ++ FHANT +
Sbjct: 183 VQTVLACEYGGLNESFAELYARTGERRWLRLGERIYDNKVLDPLTRGEDRLANFHANTQV 242
Query: 341 PVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN 400
P +IG YE+T P FF D V H Y GG + E++S+P ++ + +
Sbjct: 243 PKLIGLARLYELTSKPAQGAAAEFFWDTVTKRHSYVIGGNADREYFSEPNSISKHITEQT 302
Query: 401 EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
E C +YNMLK++RHL+ W D+YERA N +LS Q+ E G YM PL G +
Sbjct: 303 CEHCNSYNMLKLTRHLYSWRPRSALFDFYERAHLNHILS-QQHPETGGFSYMTPLMSGTA 361
Query: 461 KAKSYHGWGTRFSSFWCCYGTGL 483
+ S G +FWCC GTG+
Sbjct: 362 REYSEPGK----DAFWCCVGTGM 380
>gi|291544094|emb|CBL17203.1| Uncharacterized protein conserved in bacteria [Ruminococcus
champanellensis 18P13]
Length = 1075
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 138/413 (33%), Positives = 200/413 (48%), Gaps = 44/413 (10%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDP 160
+++ SL D+ + + A +EYLL D D L+ F++ A T G K Y GWE+
Sbjct: 36 IEDFSLADLTM-TDAYTVNAFSKEVEYLLSFDTDRLLCGFRENAKLDTKGAKRYAGWENT 94
Query: 161 TCELRGHFVGHYLSASAHMW-----ASTHNVTLKEKMTAVVSALSECQ--NKMGSGYLSA 213
+ GH VGHYL+A A + + L+ K+ A++ + CQ +K G+L A
Sbjct: 95 L--IAGHSVGHYLTAVAQAYQNPTLTAAQRSALEGKIKALLDGMRVCQQNSKGKPGFLWA 152
Query: 214 FPSE-------QFDRFEA-----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
+ QFD E + W P+YT+HKI+ GL+D Y N A + +
Sbjct: 153 GQIKNANNVEVQFDLVEQGKTNIINESWVPWYTMHKIVQGLVDVYNATGNETAKTIASDL 212
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCF- 320
++ YNR +K+S + H L+ E GGMND LY LY IT H + AH FD+
Sbjct: 213 GDWTYNRA----SKWSAQTHNTVLSIEYGGMNDCLYELYEITGKDTHAVAAHYFDETNLH 268
Query: 321 LGLLAVQADDISGFHANTHIPVVIGSQMRY------EVTGDPL----YKVTGTFFMDIVN 370
+L + ++ HANT IP IG+ RY V G+ + Y F D+V
Sbjct: 269 EAVLKGGRNVLTNKHANTTIPKFIGALKRYIVLDGKTVNGEKIDASRYLEYAEAFWDMVT 328
Query: 371 ASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYE 430
H Y TGG S E + + L N E+C +YNMLK+SR LF+ T + Y D+YE
Sbjct: 329 THHTYITGGNSEWEHFGEDDILDKERTNCNCETCNSYNMLKLSRELFKITGDRKYMDFYE 388
Query: 431 RALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
N +LS Q E G+ Y P+ G K + + + SFWCC G+G+
Sbjct: 389 GTYYNSILSSQN-PESGMTTYFQPMATGYFKV-----YSSPYDSFWCCTGSGM 435
>gi|329847096|ref|ZP_08262124.1| tat twin-arginine translocation pathway signal sequence domain
protein [Asticcacaulis biprosthecum C19]
gi|328842159|gb|EGF91728.1| tat twin-arginine translocation pathway signal sequence domain
protein [Asticcacaulis biprosthecum C19]
Length = 795
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 125/399 (31%), Positives = 189/399 (47%), Gaps = 33/399 (8%)
Query: 105 VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCEL 164
++L DV+L PS A N YLL L+ D + +++K AG + Y GWE+ T +
Sbjct: 44 LALGDVRLLPSPFK-TALDVNHTYLLTLEPDRFLHNYRKGAGLTPKAEKYGGWENDT--I 100
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--------- 215
GH +GHYLSA + M+A T + TLK + V+ L+ Q G GY++ F
Sbjct: 101 AGHSLGHYLSAISLMYAQTGDATLKARAAYVIDELALIQGMQGDGYVAGFTRKRPDGTIV 160
Query: 216 --SEQFDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEY 264
E F +A L W P Y HK+ GL D TF + + + + Y
Sbjct: 161 DGKELFAEIKAGDIRSAGFDLNGCWVPLYNWHKLYTGLFDAQTFCGLNKGVVVATGLGHY 220
Query: 265 FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
+ +V + ++ LN E GG+N+ L+ T D + L LA L +
Sbjct: 221 ----IDSVFAALNDDQVQQVLNCEFGGLNESFAELHARTGDARWLTLAERMHHNRVLDPM 276
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+ D ++ H+NT IP V+G YE+TG Y FF + V H Y GG E
Sbjct: 277 IKREDKLANIHSNTTIPKVLGLARLYEITGKADYHTASDFFWERVTGHHSYVIGGNGDRE 336
Query: 385 FWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT 444
++ +P ++ + E C TYNML+++R L+ W + DY+ERA N VLS Q+
Sbjct: 337 YFFEPDTISRHITEATCEHCATYNMLRLTRFLYSWQPDASRFDYFERAHLNHVLS-QQNP 395
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ YM PL G + G+ ++ CC+GTG+
Sbjct: 396 KTGMFSYMTPLFTGAER-----GFSDPVDNWTCCHGTGM 429
>gi|302422424|ref|XP_003009042.1| secreted protein [Verticillium albo-atrum VaMs.102]
gi|261352188|gb|EEY14616.1| secreted protein [Verticillium albo-atrum VaMs.102]
Length = 635
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 128/394 (32%), Positives = 189/394 (47%), Gaps = 25/394 (6%)
Query: 107 LHDVKLDPSSLHW-RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCEL 164
+ V L+P W Q L Y+ +DVD L++ F++T G P G + GW+ P
Sbjct: 51 MSQVSLNPG--RWLENQDRTLNYIKFVDVDRLLYVFRQTHGLPLQGAQPNGGWDAPDFPF 108
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQ---NKMG--SGYLSAFPSEQF 219
R HF GH+L+A ++ WA + +++ + + L++CQ +K G GYLS FP +
Sbjct: 109 RSHFQGHFLNAWSYCWAVLRDEACRDRASYFATELAKCQGNNDKAGFNPGYLSGFPESEI 168
Query: 220 DRFE--ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYS 277
+ E L PYY+IHK +AGLLD + + A + M + R K S
Sbjct: 169 EAVEKRTLSNGNVPYYSIHKTMAGLLDVWRHIGDETARDVLLGMAGWVDLRT----GKLS 224
Query: 278 VERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHAN 337
+ ++ E GGMN+V+ ++ T D + L +A FD LA D ++G HAN
Sbjct: 225 YSQMQTMMSTEFGGMNEVMADIFHQTGDERWLTVAQRFDHASVFDPLAGNRDSLNGLHAN 284
Query: 338 THIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG 397
T +P IG+ Y+ TG Y +I +H YA G S E + P +AS L
Sbjct: 285 TQVPKWIGAAREYKATGTTRYSDIAHNAWNITVQAHTYAIGANSQSEHFRPPNAIASYLD 344
Query: 398 TENEESCTTYNMLKVSRHLFRWTKE---MVYADYYERALTNGVLSIQRGTEP-GVMIYML 453
+ E+C TYNMLK++R L W + Y D+YE+AL N + Q + G + Y
Sbjct: 345 EDTAEACNTYNMLKLTREL--WVMDPSNSKYFDFYEQALINHAIGQQDPSSAHGHVTYFT 402
Query: 454 PLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
L RG A W T + + WCC GT L
Sbjct: 403 SLNPGGHRGVGPAWGGGTWSTDYGTAWCCQGTAL 436
>gi|346970201|gb|EGY13653.1| secreted protein [Verticillium dahliae VdLs.17]
Length = 634
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 126/394 (31%), Positives = 186/394 (47%), Gaps = 25/394 (6%)
Query: 107 LHDVKLDPSSLHW-RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCEL 164
+ V L+P W Q L Y+ +DVD L++ F++T G P G + GW+ P
Sbjct: 51 MSQVSLNPG--RWLENQDRTLSYIKFVDVDRLLYVFRQTHGLPLQGAQPNGGWDAPDFPF 108
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAFPSEQF 219
R HF GH+L+A ++ WA + +++ + + L++CQ GYLS FP +
Sbjct: 109 RSHFQGHFLNAWSYCWAVLRDEECRDRASYFATELAKCQANNEQAGFNPGYLSGFPESEI 168
Query: 220 DRFE--ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYS 277
+ E L PYY+IHK +AGLLD + + A + M + R K S
Sbjct: 169 EALEKRTLSNGNVPYYSIHKTMAGLLDVWRHIGDETARDVLLGMAGWVDLRT----GKLS 224
Query: 278 VERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHAN 337
+ ++ E GGMN+V+ ++ T D + L +A FD LA D ++G HAN
Sbjct: 225 YSQMQTMMSTEFGGMNEVMADIFHQTGDERWLTVAQRFDHASVFDPLAGNRDSLNGLHAN 284
Query: 338 THIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG 397
T +P IG+ Y+ TG Y +I +H YA G S E + P +AS L
Sbjct: 285 TQVPKWIGAAREYKATGTTRYSDIARNAWNITVQAHTYAIGANSQSEHFRPPNAIASYLD 344
Query: 398 TENEESCTTYNMLKVSRHLFRWTKE---MVYADYYERALTNGVLSIQRGTEP-GVMIYML 453
+ E+C TYNMLK++R L W + Y D+YE+AL N + Q + G + Y
Sbjct: 345 EDTAEACNTYNMLKLTREL--WVMDPSNSKYFDFYEQALINHAIGQQDPSSAHGHVTYFT 402
Query: 454 PLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
L RG A W T + + WCC GT L
Sbjct: 403 SLNPGGHRGVGPAWGGGTWSTDYGTAWCCQGTAL 436
>gi|189466409|ref|ZP_03015194.1| hypothetical protein BACINT_02784 [Bacteroides intestinalis DSM
17393]
gi|189434673|gb|EDV03658.1| hypothetical protein BACINT_02784 [Bacteroides intestinalis DSM
17393]
Length = 789
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 126/407 (30%), Positives = 197/407 (48%), Gaps = 43/407 (10%)
Query: 103 KEVS---LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWED 159
+EVS L DVKL S +AQQT+L Y++ ++ D L+ F + AG +Y WE+
Sbjct: 24 QEVSYFPLQDVKLLESPF-LQAQQTDLHYIMAMEPDRLLAPFLREAGLTPKAPSYTNWEN 82
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--E 217
L GH GHY+SA + M+A+T + + ++ +++ L Q +G+G++ P +
Sbjct: 83 --TGLDGHIGGHYISALSMMYAATGDTAIYNRLNYMLAELHRAQQAVGTGFIGGTPGSLQ 140
Query: 218 QFDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEY 264
+ +A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 141 LWKEIKAGNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSNLAREMLIALTDWMID- 199
Query: 265 FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
+ + ++ + L E GG+N+ + IT D K+L LA F L L
Sbjct: 200 -------ITAGLTDQQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPL 252
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPL-------YKVTGTFFMDIVNASHGYAT 377
D ++G HANT IP VIG + ++ D + FF + V
Sbjct: 253 VKDEDRLTGMHANTQIPKVIGYKRIADLAQDDKDWNHASEWDHAARFFWNTVVNHRSVCI 312
Query: 378 GGTSAGEFWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNG 436
GG S E + S L + E+C TYNML++++ L++ + ++ +ADYYERAL N
Sbjct: 313 GGNSVREHFHPADNFTSMLNDVQGPETCNTYNMLRLTKMLYQTSPDIRFADYYERALYNH 372
Query: 437 VLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q+ E G +Y P+ G Y + +S WCC G+GL
Sbjct: 373 ILASQQ-PEKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 413
>gi|325299889|ref|YP_004259806.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
gi|324319442|gb|ADY37333.1| protein of unknown function DUF1680 [Bacteroides salanitronis DSM
18170]
Length = 797
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 124/379 (32%), Positives = 179/379 (47%), Gaps = 29/379 (7%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHM 179
+A N++ L D D L+ + K AG P+ + + WE L GH GHYLSA A
Sbjct: 43 QACDLNVKTLKQYDTDRLLAPYLKEAGLPSKAEGFSNWEG----LDGHVGGHYLSALAIH 98
Query: 180 WASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QFDRFEALKPVWAPY 232
+A+T + +++M +VS L CQ G+GY+ P Q + W P+
Sbjct: 99 YAATGDAECRQRMDYMVSELKRCQEAHGNGYIGGVPDGERLWKEIQQGNVGLIWKYWVPW 158
Query: 233 YTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGM 292
Y +HK AGL D + + N +A +M + ++ VI S E+ L E GGM
Sbjct: 159 YNLHKTYAGLRDAWAYGGNEEARQMFLDLCDWGLT----VIAPLSDEQMEQMLENEFGGM 214
Query: 293 NDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEV 352
++V Y +T D K+L A F L +A D++ HANT +P V+G Q E+
Sbjct: 215 DEVYADAYEMTGDVKYLDAAKRFSHHWLLDSMAAGIDNLDNKHANTQVPKVVGYQRIAEL 274
Query: 353 TGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR-LASTLGTENEESC 404
+ LY+ FF V + A GG S E ++ + L+ E ESC
Sbjct: 275 SARSGHTEDAALYRKASEFFWQTVVETRSLALGGNSRREHFAPAEDCLSYVYDREGPESC 334
Query: 405 TTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKS 464
T NMLK++ LFR E YADYYERA+ N +LS Q E G +Y P ++
Sbjct: 335 NTNNMLKLTEGLFRLNPEARYADYYERAVLNHILSTQH-PEHGGYVYFTP-----ARPAH 388
Query: 465 YHGWGTRFSSFWCCYGTGL 483
Y + S+ WCC GTG+
Sbjct: 389 YRVYSAPNSAMWCCVGTGM 407
>gi|380694971|ref|ZP_09859830.1| hypothetical protein BfaeM_13572 [Bacteroides faecis MAJ27]
Length = 802
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 132/409 (32%), Positives = 193/409 (47%), Gaps = 48/409 (11%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
SL DVKL SS +AQQT+L Y+L LD D L F + AG +Y WE+ L
Sbjct: 29 SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE 223
GH GHYLSA + M+A+T + + ++ +++ L Q +G+G++ P + + +
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 224 A---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQ 270
A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+ + S + + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDSQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKVILDPLIKDEDR 257
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAG 383
++G HANT IP VIG + EV+ D + FF + V GG S
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKDDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 384 EFWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEM--------VYADYYERALT 434
E + S L + E+C TYNML++++ L++ + ++ Y DYYERAL
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 435 NGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
N +LS Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|294646986|ref|ZP_06724603.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294806386|ref|ZP_06765229.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|292637657|gb|EFF56058.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294446401|gb|EFG15025.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 813
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 123/402 (30%), Positives = 188/402 (46%), Gaps = 38/402 (9%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
+ E + DVKL + A++ N+E LL DVD L+ ++K AG K Y W+
Sbjct: 39 YKNEFPIADVKL-LDGVFKHARELNIEVLLKYDVDRLLAPYRKEAGLTERKKTYPNWDG- 96
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSEC--QNKMGS-----GYLSA 213
L GH GHYLSA + +A+T N +M ++S L C N + + GY+
Sbjct: 97 ---LDGHVAGHYLSAMSMNYAATGNKECGRRMEYMISELQLCLEANAINNTEWAIGYIGG 153
Query: 214 FPSEQ-----FDR--FEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMV 262
FP+ + F + WAP+Y +HK+ AGL D + + +N QA LK W +
Sbjct: 154 FPNSKNLWSTFKKGDLRIYNSAWAPFYNLHKMYAGLRDAWLYCNNKQAKTLFLKFCDWAI 213
Query: 263 EYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
++ + E+ L E GGMN++L Y IT + K+L+ A + + L
Sbjct: 214 --------SITDDLNEEQMQTVLKMEYGGMNEILADAYQITGNKKYLVAAKRYSQNILLD 265
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
L+ D++ HANT IP IG E++GD Y F + + + A GG S
Sbjct: 266 PLSQGIDNLDNKHANTQIPKFIGFARIAELSGDTKYTNASRFSWETITGNRSLAFGGNSR 325
Query: 383 GEFWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E + + + + ESC +YNMLK++ LFR YADYYER + N +LS Q
Sbjct: 326 REHFPSVTSCSDYINDVDGPESCNSYNMLKLTEDLFRMQPSAHYADYYERTVFNHILSTQ 385
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G + + ++ + Y + + WCC GTG+
Sbjct: 386 HPEHGGYVYFT------SARPRHYRVYSAPNEAMWCCVGTGM 421
>gi|262405235|ref|ZP_06081785.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|345508054|ref|ZP_08787694.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
gi|229444700|gb|EEO50491.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
gi|262356110|gb|EEZ05200.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
Length = 801
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 123/402 (30%), Positives = 188/402 (46%), Gaps = 38/402 (9%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
+ E + DVKL + A++ N+E LL DVD L+ ++K AG K Y W+
Sbjct: 27 YKNEFPIADVKL-LDGVFKHARELNIEVLLKYDVDRLLAPYRKEAGLTERKKTYPNWDG- 84
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSEC--QNKMGS-----GYLSA 213
L GH GHYLSA + +A+T N +M ++S L C N + + GY+
Sbjct: 85 ---LDGHVAGHYLSAMSMNYAATGNKECGRRMEYMISELQLCLEANAINNTEWAIGYIGG 141
Query: 214 FPSEQ-----FDR--FEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMV 262
FP+ + F + WAP+Y +HK+ AGL D + + +N QA LK W +
Sbjct: 142 FPNSKNLWSTFKKGDLRIYNSAWAPFYNLHKMYAGLRDAWLYCNNKQAKTLFLKFCDWAI 201
Query: 263 EYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
++ + E+ L E GGMN++L Y IT + K+L+ A + + L
Sbjct: 202 --------SITDDLNEEQMQTVLKMEYGGMNEILADAYQITGNKKYLVAAKRYSQNILLD 253
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
L+ D++ HANT IP IG E++GD Y F + + + A GG S
Sbjct: 254 PLSQGIDNLDNKHANTQIPKFIGFARIAELSGDTKYTNASRFSWETITGNRSLAFGGNSR 313
Query: 383 GEFWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E + + + + ESC +YNMLK++ LFR YADYYER + N +LS Q
Sbjct: 314 REHFPSVTSCSDYINDVDGPESCNSYNMLKLTEDLFRMQPSAHYADYYERTVFNHILSTQ 373
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G + + ++ + Y + + WCC GTG+
Sbjct: 374 HPEHGGYVYFT------SARPRHYRVYSAPNEAMWCCVGTGM 409
>gi|265753023|ref|ZP_06088592.1| acetyl-CoA carboxylase [Bacteroides sp. 3_1_33FAA]
gi|263236209|gb|EEZ21704.1| acetyl-CoA carboxylase [Bacteroides sp. 3_1_33FAA]
Length = 797
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 130/400 (32%), Positives = 187/400 (46%), Gaps = 37/400 (9%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L EV L D S +A + YLL LDVD L+ +++ G G Y GWE
Sbjct: 44 LSEVELTD------SYFKKAMDLHKGYLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 94
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
+ G GHY+SA A M+AST L +K+ ++ L ECQ + G+ +
Sbjct: 95 -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 153
Query: 222 FEALK------------PVWA------PYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
+ L+ W +Y IHKILAGL D Y +A QA + + +
Sbjct: 154 LQLLQGNVVLNQPDETGQPWNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLAD 213
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
+ + ++ + + ++L+ E GGMN+V +Y+IT D K L A F+ +
Sbjct: 214 F----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIYP 269
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
+A D + G HAN IP +G YE + + +Y F +IV H A GG S
Sbjct: 270 IANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSCY 329
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E + P + L + E+C TYNMLK+SR LF + Y +YYE AL N +L+ Q
Sbjct: 330 ERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQDP 389
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PG + Y L G S+ + T F SFWCC GTG+
Sbjct: 390 DMPGCVTYYTSLLPG-----SFKQYSTPFDSFWCCVGTGM 424
>gi|212695367|ref|ZP_03303495.1| hypothetical protein BACDOR_04914 [Bacteroides dorei DSM 17855]
gi|212662096|gb|EEB22670.1| hypothetical protein BACDOR_04914 [Bacteroides dorei DSM 17855]
Length = 807
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 130/400 (32%), Positives = 187/400 (46%), Gaps = 37/400 (9%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L EV L D S +A + YLL LDVD L+ +++ G G Y GWE
Sbjct: 54 LSEVELTD------SYFKKAMDLHKGYLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 104
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
+ G GHY+SA A M+AST L +K+ ++ L ECQ + G+ +
Sbjct: 105 -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 163
Query: 222 FEALK------------PVWA------PYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
+ L+ W +Y IHKILAGL D Y +A QA + + +
Sbjct: 164 LQLLQGNVVLNQPDETGQPWNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLAD 223
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
+ + ++ + + ++L+ E GGMN+V +Y+IT D K L A F+ +
Sbjct: 224 F----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIYP 279
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
+A D + G HAN IP +G YE + + +Y F +IV H A GG S
Sbjct: 280 IANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSCY 339
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E + P + L + E+C TYNMLK+SR LF + Y +YYE AL N +L+ Q
Sbjct: 340 ERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQDP 399
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PG + Y L G S+ + T F SFWCC GTG+
Sbjct: 400 DMPGCVTYYTSLLPG-----SFKQYSTPFDSFWCCVGTGM 434
>gi|302897238|ref|XP_003047498.1| hypothetical protein NECHADRAFT_97856 [Nectria haematococca mpVI
77-13-4]
gi|256728428|gb|EEU41785.1| hypothetical protein NECHADRAFT_97856 [Nectria haematococca mpVI
77-13-4]
Length = 626
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 130/397 (32%), Positives = 181/397 (45%), Gaps = 26/397 (6%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTN-LEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWED 159
L EV+L D + W Q L YLL +D D L++ F+ G T G + GW+
Sbjct: 42 LSEVTLTDSR-------WMDNQNRTLTYLLSVDPDRLLYVFRANHGLDTKGAQKNGGWDA 94
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNK-----MGSGYLSAF 214
P R H GH+L+A + +A+ N + T L +CQ GYLS F
Sbjct: 95 PDFPFRSHIQGHFLTAWSQCYATLRNEECGSRATYFAKELGKCQANNEKANFTEGYLSGF 154
Query: 215 PSEQFDRFE--ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNV 272
P + E L PYY IHK LAGLLD + + A + + + R +
Sbjct: 155 PESEITAVEKRTLNNGNVPYYAIHKTLAGLLDVHRLVGDEDAKDVMLALAGWVDTRTK-- 212
Query: 273 ITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDIS 332
K + ++ + E GGMN+VL + D K L +A FD L D +S
Sbjct: 213 --KLTYDQMQAMMQTEFGGMNEVLADIAYYIGDKKWLEVAQRFDHATIFDPLEKGQDKLS 270
Query: 333 GFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRL 392
G HANT +P IG+ Y+V+G Y G D+ H YA GG S E + P +
Sbjct: 271 GLHANTQVPKWIGAIREYKVSGLQKYLDIGRNAWDLTVHKHTYAIGGNSQAEHFRAPDAI 330
Query: 393 ASTLGTENEESCTTYNMLKVSRHLFRWT-KEMVYADYYERALTNGVLSIQRGTE-PGVMI 450
A L + E+C TYNMLK++R L+ + + D+YE AL N +L Q + G +
Sbjct: 331 AEYLDNDTCEACNTYNMLKLTRELWVMDPSDASFFDFYENALMNHLLGQQNPEDHHGHIT 390
Query: 451 YMLPLG----RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y PL RG A W T + SFWCC G+G+
Sbjct: 391 YFTPLNPGGRRGVGPAWGGGTWSTDYDSFWCCQGSGI 427
>gi|357472937|ref|XP_003606753.1| hypothetical protein MTR_4g065240 [Medicago truncatula]
gi|355507808|gb|AES88950.1| hypothetical protein MTR_4g065240 [Medicago truncatula]
Length = 184
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 98/181 (54%), Positives = 120/181 (66%), Gaps = 8/181 (4%)
Query: 1 MKNFVFKVLVLFLSCWVALCKECTNSFPQLASHTFRYELLSSKNETWKKEVY---SHYHL 57
M+ FV+ L L L C A KEC N+ PQ SHT R EL++SKNETWKKEV SH H+
Sbjct: 1 MEAFVYVFLALIL-CGCANSKECINNLPQ--SHTLRTELMASKNETWKKEVMMYQSHVHV 57
Query: 58 TPTDDSAWSNLLPRKML--SETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPS 115
TP+D+SAW ++P++M E + R+MKN D K FLKEV L DV+L
Sbjct: 58 TPSDESAWQEMIPKEMFLTQEKPNVIGLLSNREMKNADVSKPPVGFLKEVPLGDVRLLEG 117
Query: 116 SLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSA 175
S+H +AQ+TNLEYLLMLDVD L+WSF+K AG PT G Y GWE P ELRGHFVG +SA
Sbjct: 118 SIHAQAQKTNLEYLLMLDVDRLIWSFRKMAGLPTPGAPYGGWEKPDQELRGHFVGCNVSA 177
Query: 176 S 176
+
Sbjct: 178 T 178
>gi|345513939|ref|ZP_08793454.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
gi|423241465|ref|ZP_17222578.1| hypothetical protein HMPREF1065_03201 [Bacteroides dorei
CL03T12C01]
gi|229435753|gb|EEO45830.1| acetyl-CoA carboxylase [Bacteroides dorei 5_1_36/D4]
gi|392641358|gb|EIY35135.1| hypothetical protein HMPREF1065_03201 [Bacteroides dorei
CL03T12C01]
Length = 797
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 130/400 (32%), Positives = 187/400 (46%), Gaps = 37/400 (9%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L EV L D S +A + YLL LDVD L+ +++ G G Y GWE
Sbjct: 44 LSEVELTD------SYFKKAMDLHKGYLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 94
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
+ G GHY+SA A M+AST L +K+ ++ L ECQ + G+ +
Sbjct: 95 -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 153
Query: 222 FEALK------------PVWA------PYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
+ L+ W +Y IHKILAGL D Y +A QA + + +
Sbjct: 154 LQLLQGNVVLNQPDETGQPWNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLAD 213
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
+ + ++ + + ++L+ E GGMN+V +Y+IT D K L A F+ +
Sbjct: 214 F----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIYP 269
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
+A D + G HAN IP +G YE + + +Y F +IV H A GG S
Sbjct: 270 IANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSCY 329
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E + P + L + E+C TYNMLK+SR LF + Y +YYE AL N +L+ Q
Sbjct: 330 ERFGVPGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQDP 389
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PG + Y L G S+ + T F SFWCC GTG+
Sbjct: 390 DMPGCVTYYTSLLPG-----SFKQYSTPFDSFWCCVGTGM 424
>gi|383123086|ref|ZP_09943771.1| hypothetical protein BSIG_0174 [Bacteroides sp. 1_1_6]
gi|251841821|gb|EES69901.1| hypothetical protein BSIG_0174 [Bacteroides sp. 1_1_6]
Length = 802
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 132/409 (32%), Positives = 193/409 (47%), Gaps = 48/409 (11%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
SL DVKL SS +AQQT+L Y+L LD D L F + AG +Y WE+ L
Sbjct: 29 SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE 223
GH GHYLSA + M+A+T + + ++ +++ L Q +G+G++ P + + +
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 224 A---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQ 270
A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+ + S + + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFFHKVILDPLIKNEDR 257
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAG 383
++G HANT IP VIG + EV+ D + FF + V GG S
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKDDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 384 EFWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEM--------VYADYYERALT 434
E + S L + E+C TYNML++++ L++ + ++ Y DYYERAL
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 435 NGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
N +LS Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|298384655|ref|ZP_06994215.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
gi|298262934|gb|EFI05798.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
1_1_14]
Length = 802
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 131/409 (32%), Positives = 193/409 (47%), Gaps = 48/409 (11%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
SL DVKL SS +AQQT+L Y+L LD D L F + AG +Y WE+ L
Sbjct: 29 SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE 223
GH GHYLSA + M+A+T + + ++ +++ L Q +G+G++ P + + +
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 224 A---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQ 270
A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+ + S + + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKVILDRLIKNEDR 257
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAG 383
++G HANT IP VIG + EV+ + + FF + V GG S
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKNDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 384 EFWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEM--------VYADYYERALT 434
E + S L + E+C TYNML++++ L++ + ++ Y DYYERAL
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 435 NGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
N +LS Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|332662487|ref|YP_004445275.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332331301|gb|AEE48402.1| protein of unknown function DUF1680 [Haliscomenobacter hydrossis
DSM 1100]
Length = 793
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 128/403 (31%), Positives = 187/403 (46%), Gaps = 48/403 (11%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L EVSL LD H A+ N++ LL D+D L+ ++K AG P +Y W+
Sbjct: 32 LAEVSL----LDGPFKH--ARDLNIQTLLQYDIDRLLNPYRKEAGLPEKAASYPNWDG-- 83
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM-------GSGYLSAF 214
L GH GHYLSA A M A+T N ++++ ++S L CQ G GYL
Sbjct: 84 --LDGHVGGHYLSAMA-MNAATGNAECRKRLAYMLSELKACQEAHALKHPAWGIGYLGGV 140
Query: 215 PSE-------QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVE 263
P + F+AL+ W P+Y +HK+ +GL D + + + A L W +
Sbjct: 141 PKSAEIWSTFKNGDFKALRAAWVPWYNVHKLYSGLRDAWLYTGDETAKTLFLDFCDWGIA 200
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
+ S + + L+ E GGMN++ Y +T D K+L A F L
Sbjct: 201 --------ITANLSEAQMQSMLDIEHGGMNEIFADAYQMTGDEKYLKAAKGFSHQALLDP 252
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
+++ D++ HANT +P +G Q E++ + Y G FF + V + A GG S
Sbjct: 253 MSMGKDNLDNKHANTQVPKAVGFQRIAELSKEDKYAKAGRFFWETVTSKRSLALGGNSRR 312
Query: 384 EFWSDPKRLAS---TLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSI 440
EF+ P A E ESC +YNMLK++ LFR Y DYYER L N +LS
Sbjct: 313 EFF--PSIAAGRDFVHDVEGPESCNSYNMLKLTEELFRANPSGHYIDYYERTLYNHILST 370
Query: 441 QRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Q E G +Y P ++ + Y + WCC G+G+
Sbjct: 371 QH-PEHGGYVYFTP-----ARPRHYRVYSAPNQGMWCCVGSGM 407
>gi|29345759|ref|NP_809262.1| hypothetical protein BT_0349 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337652|gb|AAO75456.1| Acetyl-CoA carboxylase, biotin carboxylase [Bacteroides
thetaiotaomicron VPI-5482]
Length = 802
Score = 181 bits (460), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 131/409 (32%), Positives = 193/409 (47%), Gaps = 48/409 (11%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
SL DVKL SS +AQQT+L Y+L LD D L F + AG +Y WE+ L
Sbjct: 29 SLQDVKL-LSSPFLQAQQTDLHYILALDPDRLSAPFLREAGLTPKAPSYTNWEN--TGLD 85
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFE 223
GH GHYLSA + M+A+T + + ++ +++ L Q +G+G++ P + + +
Sbjct: 86 GHIGGHYLSALSMMYAATGDTAIYHRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKEIK 145
Query: 224 A---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQ 270
A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 146 AGDIRAGGFSLNGKWVPLYNIHKTYAGLRDAYLYAHSDLARQMLIDLTDWMID------- 198
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+ + S + + L E GG+N+ + IT D K+L LA F L L D
Sbjct: 199 -ITSGLSDNQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKVILDPLIKNEDR 257
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPL-------YKVTGTFFMDIVNASHGYATGGTSAG 383
++G HANT IP VIG + EV+ + + FF + V GG S
Sbjct: 258 LNGMHANTQIPKVIGYKRVAEVSKNDKDWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVR 317
Query: 384 EFWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEM--------VYADYYERALT 434
E + S L + E+C TYNML++++ L++ + ++ Y DYYERAL
Sbjct: 318 EHFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSGDVDNSNKPDPRYVDYYERALY 377
Query: 435 NGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
N +LS Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 378 NHILSSQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|294675240|ref|YP_003575856.1| hypothetical protein PRU_2607 [Prevotella ruminicola 23]
gi|294471633|gb|ADE81022.1| conserved hypothetical protein [Prevotella ruminicola 23]
Length = 788
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 134/402 (33%), Positives = 188/402 (46%), Gaps = 39/402 (9%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
+ E L DV L L A+ N+E LL D D L+ + K AG GK+Y W+
Sbjct: 17 YANEFPLGDVTLLNGPLK-HARDLNIETLLKYDNDRLLAPYLKEAGLTPKGKSYPNWDG- 74
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM-------GSGYLSA 213
L GH GHYL+A A + A+T + +++M +S L C + G GY+
Sbjct: 75 ---LDGHVGGHYLTAMA-INAATGSQECRKRMEYWISELQACADANAKNHPDWGRGYVGG 130
Query: 214 FPSEQFDR---------FEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEY 264
P DR F W P+Y IHK+ AGL D + + N QA K+ ++
Sbjct: 131 VPGS--DRIWSNFKKGNFGPYFGAWVPFYNIHKMYAGLRDAWVYCGNEQAKKLFLGFCDW 188
Query: 265 FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLL 324
+ N +T +ER +L+ E GGMN+VL Y IT + K+L +A F L L
Sbjct: 189 AIDLTAN-LTDAQMER---ALDTEHGGMNEVLADAYAITGEQKYLDVARRFSHRRLLNPL 244
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+ D + HANT +P VIG + E++GD Y G +F DIV A GG S E
Sbjct: 245 MQRRDVLDNMHANTQVPKVIGFERIAELSGDEAYHTAGAYFWDIVTGERTLAFGGNSRRE 304
Query: 385 FWSDPKRLAS---TLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
+ P R A + ESC T NMLK++ L R E YAD++E A N +LS Q
Sbjct: 305 HF--PSREACQDFVQDIDGPESCNTNNMLKLTEDLHRRNPEARYADFFELATFNHILSTQ 362
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E G +Y ++ + Y + + WCC GTG+
Sbjct: 363 H-PEHGGYVYFT-----SARPRHYRNYSAPNEAMWCCVGTGM 398
>gi|315498334|ref|YP_004087138.1| hypothetical protein Astex_1314 [Asticcacaulis excentricus CB 48]
gi|315416346|gb|ADU12987.1| protein of unknown function DUF1680 [Asticcacaulis excentricus CB
48]
Length = 774
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 120/396 (30%), Positives = 187/396 (47%), Gaps = 32/396 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L V+L PS + + + N YLL L D + +F+K AG G+ Y GWE + G
Sbjct: 38 LSQVRLKPS-IFLTSIEANQRYLLSLSPDRFLHNFRKGAGLEPKGEVYGGWE--ARGIAG 94
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP----SEQFDR- 221
H +GHYLS + M+A T +++ V+S L Q K GY ++ D
Sbjct: 95 HSLGHYLSGLSLMYAQTGKPEFRDRAAHVLSELKTIQAKHSDGYAGGTTVGRNGQEVDGK 154
Query: 222 --FEALKPV------------WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYN 267
+E L+ W P YT HK+ AG LD + +A AL + + +Y
Sbjct: 155 VVYEELRKGDIRTSGFDLNGGWVPLYTYHKVFAGALDAHQYAGLADALIVATGLGDYL-- 212
Query: 268 RVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQ 327
++ S + L E GG+ + LY T++ + L L+ + LA
Sbjct: 213 --GTILESLSDAQIQEILRAEHGGLTESYAELYARTKNQRWLTLSQRLRHRAIVDPLAAG 270
Query: 328 ADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWS 387
D+++G HANT IP ++GS +E+T + FF V+ H Y GG S E +
Sbjct: 271 HDELAGKHANTQIPKIVGSARLFELTQNADDARIARFFWQTVSRDHSYVIGGNSDHEHFG 330
Query: 388 DPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPG 447
P++LAS L + E+C +YNML+++RHL+ W+ + D+YER N ++S Q+ + G
Sbjct: 331 APRQLASRLDQQTCEACNSYNMLRLTRHLYGWSGDAALFDFYERTHLNHIMS-QQDPQTG 389
Query: 448 VMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ Y L G + S + FWCC G+G+
Sbjct: 390 MFTYFTGLASGLGRVHS-----DPTNDFWCCVGSGM 420
>gi|293370109|ref|ZP_06616674.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292634837|gb|EFF53361.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 800
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 128/408 (31%), Positives = 193/408 (47%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L LD D L+ F + AG +Y WE+ L G
Sbjct: 30 LQNVKLLDSPF-LQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L+ Q +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + EV+ D + FF + V GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAEVSQDDKTWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|423303007|ref|ZP_17281028.1| hypothetical protein HMPREF1057_04169 [Bacteroides finegoldii
CL09T03C10]
gi|408470336|gb|EKJ88871.1| hypothetical protein HMPREF1057_04169 [Bacteroides finegoldii
CL09T03C10]
Length = 801
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 131/396 (33%), Positives = 179/396 (45%), Gaps = 38/396 (9%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L+DV+L AQ N LL DVD L+ F AG + + W L G
Sbjct: 34 LNDVQLLDGPFK-HAQDLNRSVLLEYDVDRLLAPFLIEAGLEPKAEKFPNWPG----LDG 88
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALK 226
H GHYLSA A + + K +M ++S L CQ G GY+ P+ + E K
Sbjct: 89 HVAGHYLSAMAMNYRAGGGEEFKRRMEYILSELYRCQQANGDGYIGGIPNGKAGWKEIKK 148
Query: 227 -------PVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQNVITK 275
WAP+Y +HK+ AGL D + +AD+ A KM W + VI+
Sbjct: 149 GNVGIIWKYWAPWYNLHKLYAGLRDAWLYADSELAKKMFLDYCDWGI--------GVISG 200
Query: 276 YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
+ E+ LN E GGMN+V Y I+ D K+L A F + D++ H
Sbjct: 201 LNDEQMEQMLNNEFGGMNEVFADAYQISGDTKYLDAAKRFSHKWLFESMRDGKDNLDNKH 260
Query: 336 ANTHIPVVIGSQMRYEVT------GDPL-YKVTGTFFMDIVNASHGYATGGTSAGE-FWS 387
ANT +P +G Q E++ GD + Y FF V A+ A GG S E F
Sbjct: 261 ANTQVPKAVGYQRVAELSVQAKRSGDAVDYTRAAYFFWQTVTANRSLAFGGNSRREHFPD 320
Query: 388 DPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPG 447
D L+ E ESC TYNML+++ LFR + YAD+YERAL N +LS Q G
Sbjct: 321 DADYLSYVDDREGPESCNTYNMLRLTEGLFRKDPKAAYADFYERALFNHILSTQHPVHGG 380
Query: 448 VMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+Y P ++ Y + + WCC GTG+
Sbjct: 381 Y-VYFTP-----ARPAHYRVYSAPNEAMWCCVGTGM 410
>gi|440700043|ref|ZP_20882328.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
gi|440277439|gb|ELP65547.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
Length = 934
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 106/284 (37%), Positives = 149/284 (52%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E++ VWAPYYT HKIL GLLD Y D+++AL + M +
Sbjct: 383 GFLAAYPETQFIALESMTSGDYTKVWAPYYTAHKILKGLLDAYLATDDSRALDLASGMCD 442
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ Y+R+ + +++R W + E GG+ + + LYTIT +HL LA LFD +
Sbjct: 443 WMYSRLSK-LPDATLQRMWGIFSSGEFGGIVETIVDLYTITNKAEHLALAKLFDLDTLID 501
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D ++G HAN HIP+ G Y+ TG+ Y F +V Y GGTS
Sbjct: 502 ACAANTDTLNGLHANQHIPIFTGYVRLYDATGEARYLTAAKNFWGMVIPQRMYGIGGTST 561
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
GEFW +A T+ N E+C YN+LK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 562 GEFWKARGVIAGTVSDTNAETCCAYNLLKLSRTLFFHEQDPKYMDYYERALYNQVLGSKQ 621
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E ++ Y + L G + Y T CC GTG+
Sbjct: 622 DKADAEKPLVTYFIGLNPG--HVRDY----TPKQGTTCCEGTGM 659
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 55/110 (50%), Gaps = 6/110 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
L+ L DV L + +Q L++ DV+ L+ F+ AG T G A GWE
Sbjct: 44 LRPFELKDVALGQGVFASK-RQLMLDHGRGYDVNRLLQVFRANAGLSTGGAVAPGGWEGL 102
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
+ LRGH+ GH+LS + +AST + +++ +V AL++ + +
Sbjct: 103 DGEANGNLRGHYTGHFLSMLSQAYASTRDQAYADRIATMVGALTDVRAAL 152
>gi|299146241|ref|ZP_07039309.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
gi|298516732|gb|EFI40613.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
3_1_23]
Length = 800
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 193/408 (47%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L LD D L+ F + AG +Y WE+ L G
Sbjct: 30 LQNVKLLDSPF-LQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L+ Q +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|295085157|emb|CBK66680.1| Uncharacterized protein conserved in bacteria [Bacteroides
xylanisolvens XB1A]
Length = 800
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 193/408 (47%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L LD D L+ F + AG +Y WE+ L G
Sbjct: 30 LQNVKLLDSPF-LQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L+ Q +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|160882548|ref|ZP_02063551.1| hypothetical protein BACOVA_00499 [Bacteroides ovatus ATCC 8483]
gi|156112129|gb|EDO13874.1| hypothetical protein BACOVA_00499 [Bacteroides ovatus ATCC 8483]
Length = 801
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 131/396 (33%), Positives = 179/396 (45%), Gaps = 38/396 (9%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L DV+L AQ N LL DVD L+ F AG + + W L G
Sbjct: 34 LSDVQLLDGPFK-HAQDLNRSVLLEYDVDRLLAPFLIEAGLKPKAEKFPNWPG----LDG 88
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALK 226
H GHYLSA A + + K +M ++S L +CQ G GY+ P+ + E K
Sbjct: 89 HVAGHYLSAMAMNYRAGDGEEFKRRMEYMLSELYKCQQANGDGYIGGIPNGKAGWKEIKK 148
Query: 227 -------PVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQNVITK 275
WAP+Y +HK+ AGL D + +AD+ A KM W + VI+
Sbjct: 149 GNVGIIWKYWAPWYNLHKLYAGLRDAWLYADSELAKKMFLDYCDWGI--------GVISG 200
Query: 276 YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
+ E+ LN E GGMN+V Y I+ D K+L A F + D++ H
Sbjct: 201 LNDEQMEQMLNNEFGGMNEVFADAYQISGDTKYLDAAKRFSHKWLFESMRDGKDNLDNKH 260
Query: 336 ANTHIPVVIGSQMRYEVT------GDPL-YKVTGTFFMDIVNASHGYATGGTSAGE-FWS 387
ANT +P +G Q E++ GD + Y FF V A+ A GG S E F
Sbjct: 261 ANTQVPKAVGYQRVAELSVQAKRSGDAVDYTRAAYFFWQTVTANRSLAFGGNSRREHFPD 320
Query: 388 DPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPG 447
D L+ E ESC TYNML+++ LFR + YAD+YERAL N +LS Q G
Sbjct: 321 DADYLSYVDDREGPESCNTYNMLRLTEGLFRKDPKAAYADFYERALFNHILSTQHPVHGG 380
Query: 448 VMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+Y P ++ Y + + WCC GTG+
Sbjct: 381 Y-VYFTP-----ARPAHYRVYSAPNEAMWCCVGTGM 410
>gi|336417295|ref|ZP_08597620.1| hypothetical protein HMPREF1017_04728 [Bacteroides ovatus
3_8_47FAA]
gi|335936275|gb|EGM98208.1| hypothetical protein HMPREF1017_04728 [Bacteroides ovatus
3_8_47FAA]
Length = 800
Score = 178 bits (452), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 193/408 (47%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L LD D L+ F + AG +Y WE+ L G
Sbjct: 30 LQNVKLLDSPF-LQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L+ Q +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 147 GKIHAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|336405535|ref|ZP_08586212.1| hypothetical protein HMPREF0127_03525 [Bacteroides sp. 1_1_30]
gi|335937406|gb|EGM99306.1| hypothetical protein HMPREF0127_03525 [Bacteroides sp. 1_1_30]
Length = 800
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 193/408 (47%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L LD D L+ F + AG +Y WE+ L G
Sbjct: 30 LQNVKLLDSPF-LQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L+ Q +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLAHQMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|298484121|ref|ZP_07002288.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
gi|298269711|gb|EFI11305.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides sp.
D22]
Length = 776
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 193/408 (47%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L LD D L+ F + AG +Y WE+ L G
Sbjct: 6 LQNVKLLDSPF-LQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 62
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L+ Q +G+G++ P +
Sbjct: 63 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 122
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 123 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 174
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 175 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 234
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V GG S E
Sbjct: 235 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 294
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 295 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 354
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 355 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 396
>gi|423213125|ref|ZP_17199654.1| hypothetical protein HMPREF1074_01186 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694381|gb|EIY87609.1| hypothetical protein HMPREF1074_01186 [Bacteroides xylanisolvens
CL03T12C04]
Length = 800
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 193/408 (47%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L LD D L+ F + AG +Y WE+ L G
Sbjct: 30 LQNVKLLDSPF-LQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L+ Q +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|423230906|ref|ZP_17217310.1| hypothetical protein HMPREF1063_03130 [Bacteroides dorei
CL02T00C15]
gi|423244617|ref|ZP_17225692.1| hypothetical protein HMPREF1064_01898 [Bacteroides dorei
CL02T12C06]
gi|392630026|gb|EIY24028.1| hypothetical protein HMPREF1063_03130 [Bacteroides dorei
CL02T00C15]
gi|392641466|gb|EIY35242.1| hypothetical protein HMPREF1064_01898 [Bacteroides dorei
CL02T12C06]
Length = 797
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 129/400 (32%), Positives = 186/400 (46%), Gaps = 37/400 (9%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L EV L D S +A + YLL LDVD L+ +++ G G Y GWE
Sbjct: 44 LSEVELTD------SYFKKAMDLHKGYLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 94
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
+ G GHY+SA A M+AST L +K+ ++ L ECQ + G+ +
Sbjct: 95 -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 153
Query: 222 FEALK------------PVWA------PYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
+ L+ W +Y IHKILAGL D Y +A QA + + +
Sbjct: 154 LQLLQGNVVLNQPDETGQPWNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLAD 213
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
+ + ++ + + ++L+ E GGMN+V +Y+IT D K L A F+ +
Sbjct: 214 F----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIYP 269
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
+A D + G HAN IP +G YE + + +Y F +IV H A GG S
Sbjct: 270 IANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSCY 329
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E + + L + E+C TYNMLK+SR LF + Y +YYE AL N +L+ Q
Sbjct: 330 ERFGVLGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQDP 389
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PG + Y L G S+ + T F SFWCC GTG+
Sbjct: 390 DMPGCVTYYTSLLPG-----SFKQYSTPFDSFWCCVGTGM 424
>gi|237711613|ref|ZP_04542094.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
gi|229454308|gb|EEO60029.1| acetyl-CoA carboxylase [Bacteroides sp. 9_1_42FAA]
Length = 770
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 129/400 (32%), Positives = 186/400 (46%), Gaps = 37/400 (9%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L EV L D S +A + YLL LDVD L+ +++ G G Y GWE
Sbjct: 17 LSEVELTD------SYFKKAMDLHKGYLLSLDVDRLIPHVRRSVGLQGKGDNYGGWE--- 67
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDR 221
+ G GHY+SA A M+AST L +K+ ++ L ECQ + G+ +
Sbjct: 68 -KHGGCTYGHYMSACAMMYASTGEKALLDKLNYMLDELQECQKQTPDGWFITGKRGKEGY 126
Query: 222 FEALK------------PVWA------PYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
+ L+ W +Y IHKILAGL D Y +A QA + + +
Sbjct: 127 LQLLQGNVVLNQPDETGQPWNYNQNGNSWYCIHKILAGLRDAYVYAGCRQAKDILMPLAD 186
Query: 264 YFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
+ + ++ + + ++L+ E GGMN+V +Y+IT D K L A F+ +
Sbjct: 187 F----ISHIALNSNRDLFQSTLSVEQGGMNEVFVDIYSITGDKKFLQTAERFNHINVIYP 242
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
+A D + G HAN IP +G YE + + +Y F +IV H A GG S
Sbjct: 243 IANGEDVLFGRHANDQIPKFMGVAREYEFSPNDIYYQAARNFWNIVIKDHTLAIGGNSCY 302
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
E + + L + E+C TYNMLK+SR LF + Y +YYE AL N +L+ Q
Sbjct: 303 ERFGVLGEESKRLDYTSAETCNTYNMLKLSRQLFMLDGDYKYLNYYEHALYNHILASQDP 362
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
PG + Y L G S+ + T F SFWCC GTG+
Sbjct: 363 DMPGCVTYYTSLLPG-----SFKQYSTPFDSFWCCVGTGM 397
>gi|383777661|ref|YP_005462227.1| hypothetical protein AMIS_24910 [Actinoplanes missouriensis 431]
gi|381370893|dbj|BAL87711.1| hypothetical protein AMIS_24910 [Actinoplanes missouriensis 431]
Length = 939
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 114/335 (34%), Positives = 170/335 (50%), Gaps = 21/335 (6%)
Query: 158 EDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE 217
E+ + ELRG+ + + + + + ++ AV++ + +G+L+A+P
Sbjct: 350 EEISGELRGNLAWYRFDETEGT--TVADASGRDWDAAVITGVGGAPGPSHAGFLAAYPET 407
Query: 218 QFDRFEAL---KPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVIT 274
QF E L +WAPYYT HKI+ GLLD +T N AL + + M E+ ++R+ +
Sbjct: 408 QFVLLEQLTTYPAIWAPYYTCHKIMRGLLDAHTLGGNATALDVVRGMGEWAHSRLSK-LP 466
Query: 275 KYSVERHWN-SLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
+ ++R W + E GGMN+V+ L T+T + L A FD L D + G
Sbjct: 467 REQLDRMWALYIAGEYGGMNEVMVDLATLTGNKTFLETARFFDNTKLLADCVADIDSLDG 526
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
HAN HIP +G YE D Y+ F D+V Y GGT GE + +A
Sbjct: 527 KHANQHIPQFLGYLRLYENGADKTYRTAAANFFDMVVPHRTYMHGGTGQGEVFRKRDVIA 586
Query: 394 -STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG----TEPGV 448
S + T N ESC YNMLKV+R+LF + + DYYE+AL N +L+ +R T+P +
Sbjct: 587 GSIVNTTNAESCAAYNMLKVARNLFSHAPDGRFMDYYEKALVNQILASRRDVDSTTDP-L 645
Query: 449 MIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ YM+P+G G + Y GT CC GTGL
Sbjct: 646 VTYMVPVGPG--ARRGYGNIGT------CCGGTGL 672
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 2/100 (2%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCEL 164
L V L PS + + L Y D D +V +F+ AG G + GW+D T L
Sbjct: 70 GLDQVDLLPSIFTEKRDRI-LAYARAYDADRIVSNFRTAAGLDNRGAQPPGGWDDATGNL 128
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQN 204
RGH+ GH++S A WA T KEK+ +V+AL ECQ+
Sbjct: 129 RGHYSGHFISMLAQAWADTGEAIFKEKLDYIVTALKECQD 168
>gi|160883737|ref|ZP_02064740.1| hypothetical protein BACOVA_01709 [Bacteroides ovatus ATCC 8483]
gi|423297720|ref|ZP_17275780.1| hypothetical protein HMPREF1070_04445 [Bacteroides ovatus
CL03T12C18]
gi|156110822|gb|EDO12567.1| hypothetical protein BACOVA_01709 [Bacteroides ovatus ATCC 8483]
gi|392665078|gb|EIY58610.1| hypothetical protein HMPREF1070_04445 [Bacteroides ovatus
CL03T12C18]
Length = 800
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 193/408 (47%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L LD D L+ F + AG +Y WE+ L G
Sbjct: 30 LQNVKLLDSPF-LQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L+ Q +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHNLILDPLIKEEDKL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|423287556|ref|ZP_17266407.1| hypothetical protein HMPREF1069_01450 [Bacteroides ovatus
CL02T12C04]
gi|392672671|gb|EIY66138.1| hypothetical protein HMPREF1069_01450 [Bacteroides ovatus
CL02T12C04]
Length = 800
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 193/408 (47%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L LD D L+ F + AG +Y WE+ L G
Sbjct: 30 LQNVKLLDSPF-LQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L+ Q +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHNLILDPLIKEEDKL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|237722208|ref|ZP_04552689.1| acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
gi|229448018|gb|EEO53809.1| acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
Length = 800
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 192/408 (47%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L LD D L+ F + AG +Y WE+ L G
Sbjct: 30 LQNVKLLDSPF-LQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L+ Q +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYSRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARQMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKDEDKL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNNTQEPDPNYVNYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|317476510|ref|ZP_07935758.1| hypothetical protein HMPREF1016_02741 [Bacteroides eggerthii
1_2_48FAA]
gi|316907322|gb|EFV29028.1| hypothetical protein HMPREF1016_02741 [Bacteroides eggerthii
1_2_48FAA]
Length = 793
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 124/402 (30%), Positives = 185/402 (46%), Gaps = 39/402 (9%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
+ E L V L +L A+ N+ LL + D L+ ++K AG + Y W+
Sbjct: 23 YPNEFPLSQVTLLEGTLK-SARDLNINTLLKYNCDRLLAPYRKEAGLTPKAECYPNWDG- 80
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQN-------KMGSGYLSA 213
L GH GHYL+A A + A+T N +++M ++ ++EC + G GY+
Sbjct: 81 ---LDGHVGGHYLTAMA-INAATGNEECRKRMEYIIKEIAECAEANRKNHPEWGVGYMGG 136
Query: 214 FPSEQ-----FDR--FEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMV 262
P+ Q F + F WAP+Y +HK+ AGL D + + N QA L+ W +
Sbjct: 137 MPNSQNIWSNFKKGDFRVYSGSWAPFYNLHKMYAGLRDAWLYCGNEQAKDLFLQFCDWAI 196
Query: 263 EYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ V + S ++ L E GGMN+VL Y IT + K+L A F
Sbjct: 197 D--------VTSNLSDKQMEQMLGNEHGGMNEVLADAYAITHEQKYLDCAKRFSHKQLFT 248
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
L + D + HANT +P IG + E++G+ Y + +FF DIV A GG S
Sbjct: 249 PLLQRQDCLDNLHANTQVPKAIGFERISELSGNEDYHMASSFFWDIVTGERSLAFGGNSR 308
Query: 383 GEFWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E + + + ESC T NMLK++ +L R E YADYYE A N +LS Q
Sbjct: 309 REHFPAKDACMDFINDIDGPESCNTNNMLKLTENLHRRNPEARYADYYELATFNHILSTQ 368
Query: 442 RGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P ++ + Y + + WCC GTG+
Sbjct: 369 HPKHGGY-VYFTP-----ARPRHYRNYSAPNEAMWCCVGTGM 404
>gi|373463723|ref|ZP_09555310.1| hypothetical protein HMPREF9104_01016 [Lactobacillus kisonensis
F0435]
gi|371763942|gb|EHO52383.1| hypothetical protein HMPREF9104_01016 [Lactobacillus kisonensis
F0435]
Length = 747
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 130/423 (30%), Positives = 201/423 (47%), Gaps = 50/423 (11%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA-YEGWEDP 160
+K VS ++VK P+S + N+ ++L L D L+++++ AG T G WE P
Sbjct: 22 MKPVSYYNVKYLPNSTLKEKFERNVNWMLSLTPDQLLYNYRINAGLDTKGATPLTVWESP 81
Query: 161 TCELRGHFVGHYLSASAHMWASTHNVT-------LKEKMTAVVSALSECQNKMGS----- 208
RGHF GHYLS ++ + +N+ LK+++ +V L ECQ K +
Sbjct: 82 DWFFRGHFTGHYLSGASRSFVELNNMEDTKEANELKDRVNKIVDGLKECQEKFDTFEEFP 141
Query: 209 GYLSAFPSEQFDRFEALK---PVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYF 265
GYL+A PS++FD E L+ + PYY + K++ GL+D Y FA N AL++T M YF
Sbjct: 142 GYLAAEPSKRFDDVEKLRFNGNHYVPYYAVQKLMDGLMDAYEFAGNQTALELTMNMTHYF 201
Query: 266 YNRVQNVITK----------YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLL--LAH 313
R++ + + Y + H+ ++E G M+ L RLY IT + + LA
Sbjct: 202 EKRMERLTPEQINAMIDTRWYQGKGHY-VYHQEFGAMHRTLLRLYEITDKKQKDIFDLAQ 260
Query: 314 LFDKPCFLGLLAVQADDISGF---HANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVN 370
FD+ F +L + DD G+ HANT + G Y VTGD YK +M+ ++
Sbjct: 261 KFDRKWFRDML-INNDDELGYYSCHANTELVCAEGMLEYYHVTGDENYKKGVVNYMNWMH 319
Query: 371 ASHGYATGGTS-----------AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRW 419
H T G S E + P+ L N ESC ++++ +S LF
Sbjct: 320 DGHELPTKGISGRSAYPAPADYGSELYDYPEMFFKHLSMLNGESCCSHDLNFLSSELFAD 379
Query: 420 TKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCY 479
TK+ D YE N +++ Q+ + + Y+ L + K Y G FWCC
Sbjct: 380 TKDATLLDDYEIRFINAIMA-QQNNDSAIAEYLYNLSVAPNSTKEYSHTG-----FWCCT 433
Query: 480 GTG 482
G+G
Sbjct: 434 GSG 436
>gi|404451488|ref|ZP_11016452.1| hypothetical protein A33Q_19258 [Indibacter alkaliphilus LW1]
gi|403762834|gb|EJZ23856.1| hypothetical protein A33Q_19258 [Indibacter alkaliphilus LW1]
Length = 1019
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 144/470 (30%), Positives = 219/470 (46%), Gaps = 86/470 (18%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQ--QTNLEYLLML---DVDSLVWSFQKTAGSPTAGKAYE- 155
L+ LH + L+ + + + ++LL L D +S ++ F+ P A
Sbjct: 373 LELFKLHQINLEEDQTGQKTKFIENRDKFLLTLAETDPNSFLYMFRHAFDQPQPENAVPL 432
Query: 156 -GWEDPTCELRGHFVGHYLSASAHMWAST-HNVTLKE----KMTAVVSALSECQ----NK 205
W+ +LRGH GHYL+A A +AST ++ L++ KM +V+ L + NK
Sbjct: 433 GVWDSQETKLRGHATGHYLTAIAQAYASTGYDEVLQQNFLDKMDYMVNVLYDLSKLSGNK 492
Query: 206 M------------------------------------GSGYLSAFPSEQFDRFEA----- 224
+ G GY+SA+P +QF E
Sbjct: 493 VNGKGNEDPVLVPKGPGKSDFDSDLSDEGIRSDYWNWGKGYISAYPPDQFIMLEKGATYG 552
Query: 225 --LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHW 282
+WAPYYT+HKILAGL+D Y + N +AL++ K M E+ Y R+ + + + ++ + W
Sbjct: 553 GQKNQIWAPYYTLHKILAGLIDIYKVSGNEKALEIAKGMGEWVYTRL-DALPQETLIKMW 611
Query: 283 NS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDK-PCFLGL------LAVQADDISGF 334
N+ + E GGMN+ + LY ITQDP+ L A LFD F G LA D G
Sbjct: 612 NTYIAGEFGGMNETMATLYEITQDPRFLKGAQLFDNIQMFFGDAEYSHGLAKNVDTFRGL 671
Query: 335 HANTHIPVVIGSQMRYEVTG-DPLYKVTGTFFMDIVNASHGYATGGTSAGE-------FW 386
HAN HIP V+GS Y V+ D ++V ++ VN + Y+ GG + F
Sbjct: 672 HANQHIPQVVGSLEMYRVSAKDEYFRVADNYWFKAVN-DYMYSIGGVAGARNPANAECFI 730
Query: 387 SDPKRL---ASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRG 443
++P L + G +N E+C TYNMLK++ +LF + + DY+ER L N +L+
Sbjct: 731 AEPATLYENGFSSGGQN-ETCATYNMLKLTGNLFLFEQRGELMDYFERGLYNHILASVAE 789
Query: 444 TEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGLGFIRWIQLSL 493
P Y +PL G K H + + F CC GT + +Q S+
Sbjct: 790 DSPA-NTYHVPLRPGSIK----HFGNAKMTGFTCCNGTSIESNTKLQQSI 834
>gi|262407626|ref|ZP_06084174.1| acetyl-CoA carboxylase [Bacteroides sp. 2_1_22]
gi|294644495|ref|ZP_06722254.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294808396|ref|ZP_06767149.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|345511903|ref|ZP_08791442.1| acetyl-CoA carboxylase [Bacteroides sp. D1]
gi|262354434|gb|EEZ03526.1| acetyl-CoA carboxylase [Bacteroides sp. 2_1_22]
gi|292640162|gb|EFF58421.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294444324|gb|EFG13038.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|345453983|gb|EEO49450.2| acetyl-CoA carboxylase [Bacteroides sp. D1]
Length = 800
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 126/408 (30%), Positives = 193/408 (47%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L LD D L+ F + AG +Y WE+ L G
Sbjct: 30 LQNVKLLDSPF-LQAQQTDLHYILALDPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L+ Q +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELNRAQQTVGTGFIGGTPGSLQLWKDIKA 146
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A +M T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYIYAGSDLARQMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L + D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLELARRFSHKLILDPLIKEEDKL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYN+L++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNILRLTKMLYQNSHNPNQTNEPDPNYVNYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|427386394|ref|ZP_18882591.1| hypothetical protein HMPREF9447_03624 [Bacteroides oleiciplenus YIT
12058]
gi|425726434|gb|EKU89299.1| hypothetical protein HMPREF9447_03624 [Bacteroides oleiciplenus YIT
12058]
Length = 792
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 119/391 (30%), Positives = 182/391 (46%), Gaps = 47/391 (12%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHM 179
+AQQT+L Y+L ++ D L+ F + AG +Y WE+ L GH GHY+SA + M
Sbjct: 42 QAQQTDLHYILAMEPDRLLAPFLREAGLAPKAPSYTNWEN--TGLDGHIGGHYISALSMM 99
Query: 180 WASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE---------------QFDRFEA 224
+A+T + + ++ ++ L Q +G+G++ P FD
Sbjct: 100 YAATGDTAVYNRLNYMLDELHRAQQAVGTGFIGGTPGSLQLWKEIKEGNIRAGGFD---- 155
Query: 225 LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQNVITKYSVER 280
L W P Y IHK AGL D Y +A + A +M T WM+ + + ++
Sbjct: 156 LNSKWVPLYNIHKTYAGLRDAYLYAGSDLAREMLIALTDWMI--------GITAGLTDQQ 207
Query: 281 HWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHI 340
+ L E GG+N+ + IT D K+L LA F L L D ++G HANT I
Sbjct: 208 MQDMLRSEHGGLNETFADVAAITGDKKYLELARRFSHKVILDPLIKDEDRLTGMHANTQI 267
Query: 341 PVVIGSQMRYEVTGDP-------LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
P VIG + E++ D + FF + V GG S E + +
Sbjct: 268 PKVIGYKRIAELSQDDNVWNHATEWDHAARFFWNTVVNHRSVCIGGNSVREHFHPANDFS 327
Query: 394 STLG-TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYM 452
L E E+C TYNML++++ L++ + + +ADYYERAL N +L+ Q + G +Y
Sbjct: 328 PMLNDIEGPETCNTYNMLRLTKMLYQDSPDSRFADYYERALYNHILASQE-PDKGGFVYF 386
Query: 453 LPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
P+ G Y + +S WCC G+GL
Sbjct: 387 TPMRPG-----HYRVYSQPETSMWCCVGSGL 412
>gi|383112514|ref|ZP_09933306.1| hypothetical protein BSGG_0614 [Bacteroides sp. D2]
gi|313693079|gb|EFS29914.1| hypothetical protein BSGG_0614 [Bacteroides sp. D2]
Length = 800
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 191/408 (46%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +VKL S +AQQT+L Y+L L+ D L+ F + AG +Y WE+ L G
Sbjct: 30 LQNVKLLDSPF-LQAQQTDLHYILALNPDRLLAPFLREAGLQPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF 219
H GHYLSA + M+A+T + + ++ +++ L Q +G+G++ P +
Sbjct: 87 HIGGHYLSALSMMYAATGDTAVYNRLNYMLNELHRAQQAVGTGFIGGTPGSLQLWKDIKA 146
Query: 220 DRFEA----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
+ A L W P Y IHK AGL D Y +A + A KM T WM++
Sbjct: 147 GKIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYAGSDLARKMLIDLTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S E+ + L E GG+N+ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLSDEQMQDMLRSEHGGLNETFADVAEITGDKKYLKLARRFSHKLILDPLIKDEDKL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGD-------PLYKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKSWSHAAEWDHAARFFWNTVVNHRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S L + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPSDNFTSMLNDVQGPETCNTYNMLRLTKMLYQNSHNPNNTQEPDPNYVNYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|374712027|gb|AEZ64557.1| putative secreted protein [Streptomyces chromofuscus]
Length = 933
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 104/284 (36%), Positives = 151/284 (53%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E++ VWAPYYT HKIL GLLD + + D+ +AL + + +
Sbjct: 382 GFLAAYPETQFITLESMTSSDYGVVWAPYYTAHKILRGLLDAHLYTDDPRALDLASGLCD 441
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ Y+R+ + +++R W + E GG+ + + L+ +T P+HL LA LFD +
Sbjct: 442 WMYSRLSR-LPASTLQRMWGIFSSGEFGGLVEAVCDLHALTGKPEHLALARLFDLDSLID 500
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D + G HAN HIP+ G ++ TG+ Y F D+V + Y GGTS
Sbjct: 501 ACAANRDVLDGLHANQHIPIFTGLLRLHDATGEARYLAAAKNFWDMVVPTRMYGIGGTST 560
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
GEFW +A T+ ESC YNMLK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 561 GEFWRGRGSVAGTISATTAESCCAYNMLKLSRLLFFHEQDPKYMDYYERALYNQVLGSKQ 620
Query: 443 GT---EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
T E ++ Y + L G + Y T + CC GTG+
Sbjct: 621 DTADAEKPLVTYFIGLTPG--HVRDY----TPKAGTTCCEGTGM 658
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 57/110 (51%), Gaps = 6/110 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
L+ L DV L P + ++ L++ DVD L+ F+ AG T G A GWE
Sbjct: 44 LRPFDLKDVTLGPGIFATK-RRFMLDHGRGYDVDRLLQVFRANAGLSTRGAVAPGGWEGL 102
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
+ LRGH+ GH+L+ A + ST + +++ ++V AL+E ++ +
Sbjct: 103 DGEANGNLRGHYTGHFLTMLAQSYGSTGDQVYADRIRSMVDALTEVRSAL 152
>gi|429195121|ref|ZP_19187172.1| hypothetical protein STRIP9103_04852 [Streptomyces ipomoeae 91-03]
gi|428669175|gb|EKX68147.1| hypothetical protein STRIP9103_04852 [Streptomyces ipomoeae 91-03]
Length = 936
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 105/284 (36%), Positives = 147/284 (51%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E++ VWAPYYT HKIL GLLD Y D+ +AL + + +
Sbjct: 384 GFLAAYPETQFIELESMTSGDYTRVWAPYYTAHKILRGLLDAYLNVDDARALDLASGLCD 443
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ Y+R+ + +++R W + E GG+ + + LYTIT +HL LA LFD +
Sbjct: 444 WMYSRLSK-LPDATLQRMWGIFSSGEFGGLVEAIVDLYTITGKAEHLALARLFDLDKLID 502
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D + G HAN HIP+ G Y+ TG+ Y F +V Y GGTS
Sbjct: 503 ACAANTDTLDGLHANQHIPIFTGLARLYDATGEVRYLTAAKNFWGMVVPPRMYGIGGTST 562
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
GEFW +A T+ N E+C YN+LK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 563 GEFWKARGVIAGTISDTNAETCCAYNLLKLSRTLFFHEQDPKYMDYYERALLNQVLGSKQ 622
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E ++ Y + L G + Y T CC GTG+
Sbjct: 623 DKTDAEKPLVTYFIGLKPG--HVRDY----TPKQGTTCCEGTGM 660
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 55/110 (50%), Gaps = 6/110 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
++ L DV L L +Q L++ DVD L+ F+ AG T G A GWE
Sbjct: 45 VRPFELKDVTLG-QGLFAGKRQLMLDHGRGYDVDRLLQVFRANAGLSTKGAVAPGGWEGL 103
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
+ LRGH+ GH+L+ A +AST + +K+ +V AL+E + +
Sbjct: 104 DGEANGNLRGHYTGHFLTTLAQAYASTADTVYADKIRYMVGALTEVRAAL 153
>gi|326798346|ref|YP_004316165.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326549110|gb|ADZ77495.1| protein of unknown function DUF1680 [Sphingobacterium sp. 21]
Length = 1022
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 158/548 (28%), Positives = 229/548 (41%), Gaps = 100/548 (18%)
Query: 20 CKECTNSFPQLASH---TFRYELLSSK------NETWKKEVYSHYHLTPTDDSAWSNLLP 70
K + P+L SH T+R + K T V T T ++L P
Sbjct: 296 VKTSIGNLPRLPSHIEGTYRQGINGPKVRVLWPAPTDNTAVLQAGRYTITGRVPGTDLQP 355
Query: 71 RKMLSETDEFSWTMIYRKMKNPDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLL 130
+ + T+I K + KLA L +VSL + + + L
Sbjct: 356 KAFV--------TVIEAKSSDIPSSKLAPFNLDQVSLEADAHGHKTKFIENRDKFINTLA 407
Query: 131 MLDVDSLVWSFQKTAGS--PTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTH---- 184
D +S ++ F+ G P + W+ +LRGH GHYL+A A +A T
Sbjct: 408 ATDPNSFLYMFRHAFGQKQPEGARPLGVWDSQETKLRGHATGHYLTAIAQAYAGTGYDKA 467
Query: 185 -NVTLKEKMTAVVSALSECQN--------------------------------------- 204
EKM +V+ L E
Sbjct: 468 LQAKFAEKMEYMVNTLYELSQLSGKPKEAGGIHVSDPTAVPYGPGKTEYDSDFSDEGIRT 527
Query: 205 ---KMGSGYLSAFPSEQFDRFE-------ALKPVWAPYYTIHKILAGLLDQYTFADNTQA 254
G G++SA+P +QF E VWAPYYT+HKILAGL+D Y + N +A
Sbjct: 528 DYWNWGEGFISAYPPDQFIMLERGAKYGGQKNQVWAPYYTLHKILAGLMDVYEVSGNKKA 587
Query: 255 LKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAH 313
L++ M ++ Y R+ + T+ ++ + WN+ + E GGMN+V+ RLY IT P +L A
Sbjct: 588 LEIATGMGDWVYARLSKLPTE-TLIKMWNTYIAGEFGGMNEVMARLYRITNKPNYLKTAQ 646
Query: 314 LFDK-PCFLG------LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPL-YKVTGTFF 365
LFD F G LA D G HAN HIP ++GS Y V+ +P+ Y + F+
Sbjct: 647 LFDNIKMFYGDASHSHGLAKNVDTFRGLHANQHIPQIVGSIEMYRVSNNPVYYSIADNFW 706
Query: 366 MDIVNASHGYATGGTSAGE-------FWSDPKRL---ASTLGTENEESCTTYNMLKVSRH 415
+VN + Y+ GG + F S P L + G +N E+C TYNMLK++
Sbjct: 707 YKVVN-DYMYSIGGVAGARNPANAECFISQPATLYENGFSAGGQN-ETCATYNMLKLTSD 764
Query: 416 LFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSF 475
LF + + DYYER L N +L+ P Y +PL G K + F
Sbjct: 765 LFLFDQRPELMDYYERGLYNHILASVAEDSP-ANTYHVPLRPGSIKQFG----NPHMTGF 819
Query: 476 WCCYGTGL 483
CC GT +
Sbjct: 820 TCCNGTAI 827
>gi|393782713|ref|ZP_10370896.1| hypothetical protein HMPREF1071_01764 [Bacteroides salyersiae
CL02T12C01]
gi|392672940|gb|EIY66406.1| hypothetical protein HMPREF1071_01764 [Bacteroides salyersiae
CL02T12C01]
Length = 796
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 127/396 (32%), Positives = 186/396 (46%), Gaps = 32/396 (8%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
SL DVKL S + A + YLL LDVD L+ ++ G + Y GWE
Sbjct: 41 SLSDVKL-TSGIFKGAMDLHKGYLLSLDVDRLIPHVRRNVGLTGKNENYGGWETHG---- 95
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQN-----------KMGSGYLSAF 214
G GHY+SA A M+AST ++++ ++ L ECQ + GY
Sbjct: 96 GCTYGHYMSACAMMYASTGEKIFRDRLEYMMDELKECQQQTQDGWFISGERAKEGYRKLL 155
Query: 215 PSEQF-DRFEALKPVWA------PYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYN 267
E F +R + K W +Y IHK+LAGL D Y +A +A ++ + ++
Sbjct: 156 HGEVFLNRPDETKQPWNYNQNGNSWYCIHKVLAGLRDVYLYAGIQKAKEILMPLADF--- 212
Query: 268 RVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQ 327
+ ++ + + ++L+ E GGMN+V +Y T D K+L A F+ + +A
Sbjct: 213 -IADIALNSNKDLFQSTLSVEQGGMNEVFTDIYAFTGDYKYLETACRFNHINVIYPVANG 271
Query: 328 ADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWS 387
D + G HAN IP IG Y +Y+ F D+V +H A GG S E +
Sbjct: 272 EDVLFGRHANDQIPKFIGVAKEYAYDTKEIYRKAAENFWDMVVNNHTLAIGGNSCYERFG 331
Query: 388 DPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPG 447
P + L + E+C TYNMLK+SR LF + Y +YYE AL N +L+ Q G
Sbjct: 332 MPGEESKRLDYSSAETCNTYNMLKLSRLLFMMNGDYKYLNYYEHALYNHILASQDPDMAG 391
Query: 448 VMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ Y L G S+ + T + SFWCC GTG+
Sbjct: 392 CVTYYTSLLPG-----SFKQYSTPYDSFWCCVGTGM 422
>gi|395772531|ref|ZP_10453046.1| glycosylase [Streptomyces acidiscabies 84-104]
Length = 828
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 106/285 (37%), Positives = 147/285 (51%), Gaps = 16/285 (5%)
Query: 208 SGYLSAFPSEQFDRFEAL-----KPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMV 262
+G+L+A+P QF + E++ VWAPYYT HKIL GLLD Y + +AL + M
Sbjct: 339 AGFLAAYPETQFIQLESMTASDYSKVWAPYYTAHKILRGLLDAYAATGDARALDLAGGMA 398
Query: 263 EYFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
++ ++R+ + +++R W + E GG+ + L LY +T +HL LA LFD +
Sbjct: 399 DWMHSRLSK-LPGATLQRMWGLFSSGEFGGIVEALCDLYDLTGKGEHLALARLFDLDRLI 457
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
A D + G HAN HIP+ G Y+ TG+ Y F D+V Y+ GGTS
Sbjct: 458 DACAANTDVLDGLHANQHIPIFTGYLRLYDATGEERYLAAARNFWDMVVPHRMYSIGGTS 517
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
EFW +A + + ESC YNMLK+SR LF ++ Y DYYERAL N VL +
Sbjct: 518 DAEFWRARDVVAGAISGASAESCCAYNMLKLSRALFLHAQDAKYMDYYERALFNQVLGSK 577
Query: 442 R---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
R E ++ Y L L G + Y T CC GTGL
Sbjct: 578 RDVADAEKPLVTYFLGLNPG--HVRDY----TPKQGTTCCEGTGL 616
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 56/110 (50%), Gaps = 6/110 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
L + L V L P L + +Q L++ DV+ L+ F+ AG T G A GWE
Sbjct: 7 LLPLPLDKVSLGPGLLADK-RQLMLDHARGYDVNRLLQVFRANAGLATLGAVAPGGWEGL 65
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
+ LRGH+ GH+L+ + +AST + EK+ +V AL+E + +
Sbjct: 66 DGEANGNLRGHYTGHFLTMLSQAYASTGDEVYAEKIRTIVGALTESREAL 115
>gi|255691978|ref|ZP_05415653.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Bacteroides
finegoldii DSM 17565]
gi|260622387|gb|EEX45258.1| hypothetical protein BACFIN_07051 [Bacteroides finegoldii DSM
17565]
Length = 800
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 124/408 (30%), Positives = 191/408 (46%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L DVKL S +AQQT+L Y+L L+ D L+ F + AG +Y WE+ L G
Sbjct: 30 LQDVKLLDSPF-LQAQQTDLHYILALNPDRLLAPFLREAGLTPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFEA 224
H GHYLSA + M+A+T + + ++ ++ L Q +G+G++ P + + +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAIYNRLNYMLDELYRAQQAVGTGFIGGTPGSLQLWKEIKA 146
Query: 225 ---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
L W P Y IHK AGL D Y + + QA +M T WM++
Sbjct: 147 GNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYTGSDQARRMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S ++ + L E G+N+ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLSDQQIQDMLRSEHSGLNETFADVAAITGDKKYLELARRFSHKIILDPLIKDKDRL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPL-------YKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V + GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAEEWDHAARFFWNTVVNNRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S + + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPADNFTSMINDVQGPETCNTYNMLRLTKMLYQNSHNPCNINEPDPNYINYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|399033094|ref|ZP_10732120.1| hypothetical protein PMI10_04007 [Flavobacterium sp. CF136]
gi|398068528|gb|EJL59944.1| hypothetical protein PMI10_04007 [Flavobacterium sp. CF136]
Length = 1019
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 193/433 (44%), Gaps = 83/433 (19%)
Query: 126 LEYLLMLDVDSLVWSFQKTAGS--PTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWAST 183
L L D DS ++ F+ G P + W+ +LRGH GHYL+A A +AST
Sbjct: 400 LTTLATTDPDSFLYMFRNAFGQEQPKEAEPLGVWDTQETKLRGHATGHYLTAIAQAYAST 459
Query: 184 H-----NVTLKEKMTAVVSAL----------SECQNKM---------------------- 206
K+KM +V+ L E K
Sbjct: 460 GYDKTLQANFKDKMEYMVNTLYDLEQLSGKPKEAGGKFVSDPTAIPFGPGKTNYDSDLSA 519
Query: 207 ----------GSGYLSAFPSEQFDRFE-------ALKPVWAPYYTIHKILAGLLDQYTFA 249
G G++SA+P +QF E +WAPYYT+HKILAGL+D Y +
Sbjct: 520 EGIRTDYWNWGKGFISAYPPDQFIMLENGATYGGQKTQIWAPYYTLHKILAGLMDVYEVS 579
Query: 250 DNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKH 308
N +AL+ K M ++ Y R++ + T+ ++ WN + E GGMN+ + RLY IT+DP +
Sbjct: 580 GNEKALETAKGMGDWVYARMKKLPTE-TLISMWNRYIAGEFGGMNEAMARLYRITKDPHY 638
Query: 309 LLLAHLFDK-PCFLGL------LAVQADDISGFHANTHIPVVIGSQMRYEVTGDP-LYKV 360
L +A LFD F G LA D G HAN HIP ++G+ Y + P Y+V
Sbjct: 639 LEVAQLFDNIKVFYGDANHSHGLAKNVDTFRGLHANQHIPQIMGALEMYRDSNTPDYYRV 698
Query: 361 TGTFFMDIVNASHGYATGGTSAGE-------FWSDPKRL---ASTLGTENEESCTTYNML 410
F+ VN + Y+ GG + F S P + + G +N E+C TYNML
Sbjct: 699 ADNFWYKTVN-DYMYSIGGVAGARNPANAECFISQPATIYENGFSSGGQN-ETCATYNML 756
Query: 411 KVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGT 470
K++ LF + + DYYER L N +LS P Y +PL G K
Sbjct: 757 KLTGDLFLYEQRGELMDYYERGLYNHILSSVAENSP-ANTYHVPLRPGSVK----QFGNP 811
Query: 471 RFSSFWCCYGTGL 483
+ F CC GT +
Sbjct: 812 HMTGFTCCNGTAI 824
>gi|290958971|ref|YP_003490153.1| glycosylase [Streptomyces scabiei 87.22]
gi|260648497|emb|CBG71608.1| putative secreted glycosylase [Streptomyces scabiei 87.22]
Length = 936
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 104/284 (36%), Positives = 145/284 (51%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E++ VWAPYYT HKIL GLLD Y D+ +AL + + +
Sbjct: 384 GFLAAYPETQFIELESMTSGDYTRVWAPYYTAHKILRGLLDAYLHVDDERALDLASGLCD 443
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ Y+R+ + +++R W + E GG+ + + LY IT HL LA LFD +
Sbjct: 444 WMYSRLSK-LPDATLQRMWGIFSSGEYGGLVEAIVDLYAITGKADHLALARLFDLDKLID 502
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D + G HAN HIP+ G Y+VTG+ Y F +V Y GGTS
Sbjct: 503 ACAANTDTLDGLHANQHIPIFTGLVRLYDVTGEARYLSAAKNFWGMVIPQRMYGIGGTST 562
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
EFW +A T+ N E+C YN+LK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 563 AEFWKARGAVAGTISDTNAETCCAYNLLKLSRSLFFHEQDPKYMDYYERALLNQVLGSKQ 622
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E ++ Y + L G + Y T CC GTG+
Sbjct: 623 DKADAEKPLVTYFIGLEPG--HVRDY----TPKQGTTCCEGTGM 660
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 55/113 (48%), Gaps = 6/113 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
++ L DV L L ++ L++ DVD L+ F+ AG T G A GWE
Sbjct: 45 VRPFELKDVTLG-QGLFAEKRRLMLDHGRGYDVDRLLQVFRANAGLSTKGAVAPGGWEGL 103
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSG 209
+ LRGH+ GH+L+ A A T + +++ ++ AL+E + + +G
Sbjct: 104 DGEANGNLRGHYTGHFLTMLAQAHAGTRDTVYSDRIRYMIGALAEVREALRTG 156
>gi|423299329|ref|ZP_17277354.1| hypothetical protein HMPREF1057_00495 [Bacteroides finegoldii
CL09T03C10]
gi|408473138|gb|EKJ91660.1| hypothetical protein HMPREF1057_00495 [Bacteroides finegoldii
CL09T03C10]
Length = 800
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 124/408 (30%), Positives = 191/408 (46%), Gaps = 48/408 (11%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L DVKL S +AQQT+L Y+L L+ D L+ F + AG +Y WE+ L G
Sbjct: 30 LQDVKLLDSPF-LQAQQTDLHYILALNPDRLLAPFLREAGLTPKAPSYTNWEN--TGLDG 86
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS--EQFDRFEA 224
H GHYLSA + M+A+T + + ++ ++ L Q +G+G++ P + + +A
Sbjct: 87 HIGGHYLSALSMMYAATGDTAIYNRLNYMLDELYRAQQAVGTGFIGGTPGSLQLWKEIKA 146
Query: 225 ---------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKM----TKWMVEYFYNRVQN 271
L W P Y IHK AGL D Y + + +A M T WM++
Sbjct: 147 GNIRAGGFDLNGKWVPLYNIHKTYAGLRDAYLYTGSDRARLMLIAFTDWMID-------- 198
Query: 272 VITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDI 331
+ + S ++ + L E GG+N+ + IT D K+L LA F L L D +
Sbjct: 199 ITSGLSDQQIQDMLRSEHGGLNETFADVAAITGDKKYLELARRFSHKIILDPLIKDEDRL 258
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDPL-------YKVTGTFFMDIVNASHGYATGGTSAGE 384
+G HANT IP VIG + E++ D + FF + V + GG S E
Sbjct: 259 TGMHANTQIPKVIGYKRIAELSQDDKNWNHAEEWDHAARFFWNTVVNNRSVCIGGNSVRE 318
Query: 385 FWSDPKRLASTLG-TENEESCTTYNMLKVSRHLFRWT--------KEMVYADYYERALTN 435
+ S + + E+C TYNML++++ L++ + + Y +YYERAL N
Sbjct: 319 HFHPADNFTSMINDVQGPETCNTYNMLRLTKMLYQNSHNPCNINEPDPNYINYYERALYN 378
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 379 HILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 420
>gi|302561993|ref|ZP_07314335.1| secreted protein [Streptomyces griseoflavus Tu4000]
gi|302479611|gb|EFL42704.1| secreted protein [Streptomyces griseoflavus Tu4000]
Length = 950
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 106/284 (37%), Positives = 149/284 (52%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E++ VWAPYYT HKIL GLLD Y D+ +AL + M +
Sbjct: 399 GFLAAYPETQFIALESMTGSDYTRVWAPYYTAHKILRGLLDAYLATDDERALDLASGMCD 458
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ + R+ +V+ +++R W + E GG+ + + L+ +T P+HL LA LFD +
Sbjct: 459 WMHARL-SVLPAATLQRMWGLFSSGEFGGIVEAVCDLHALTGRPEHLALARLFDLDRLID 517
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D + G HAN HIPV G ++ TG+ Y F +V YA GGTS+
Sbjct: 518 ACAADTDVLEGLHANQHIPVFTGLVRLHDETGEQRYLTAAKNFWGMVVPHRTYAIGGTSS 577
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
GEFW +A T+G ESC YNMLK+SR LF ++ Y DYYER L N VL ++
Sbjct: 578 GEFWKARGVIAGTIGDTTAESCCAYNMLKLSRALFFHEQDPAYMDYYERTLYNQVLGSKQ 637
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E ++ Y + L G + Y T CC GTG+
Sbjct: 638 DRPDAEKPLVTYFVGLTPG--HVRDY----TPKQGTTCCEGTGM 675
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 71/165 (43%), Gaps = 15/165 (9%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
++ L DV L P + ++ L++ DV+ L+ F+ AG T G A GWE
Sbjct: 60 VRPFGLEDVTLGPGVFAAK-RRLMLDHARGYDVNRLLQVFRANAGLSTRGAVAPGGWEGL 118
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPS 216
+ LRGH+ GH+L+ A ST +++ VV AL E + + S
Sbjct: 119 DGEANGNLRGHYTGHFLTMLAQAHRSTGEQVFADRIDTVVGALVEVREALRSEPAVLSTG 178
Query: 217 EQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
+F R A + V Y + A L D T AL ++ W+
Sbjct: 179 GRFGR--AAENVRGSYQYVDLPAAVL-------DGTPALTLSAWV 214
>gi|451820300|ref|YP_007456501.1| acetyl-CoA carboxylase, biotin carboxylase [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451786279|gb|AGF57247.1| acetyl-CoA carboxylase, biotin carboxylase [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 766
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 120/373 (32%), Positives = 177/373 (47%), Gaps = 25/373 (6%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
+Q +Y+L LDVD + + G K Y GWE + GH +GH++SA A +
Sbjct: 24 SQDLGEKYILSLDVDRFLAPCYEAHGLEPKKKRYSGWEARA--ISGHSLGHFMSALAVTY 81
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF---------DRFEALKPVWAP 231
+T N LK+ + VS LS Q G GY+ F +F+ + W P
Sbjct: 82 QATGNEELKKILDYAVSELSHIQQVTGRGYIGGLVETPFVEIIDGTNIGKFD-INGYWVP 140
Query: 232 YYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGG 291
+Y+IHKI GL+D Y A+N++AL + V F + +++ + S E+ L E GG
Sbjct: 141 WYSIHKIYKGLIDAYELAENSEALNV----VVNFADWAVSILNQMSDEQVQAMLECEHGG 196
Query: 292 MNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIG-SQMRY 350
MN + +LY T + +L A F + L DD+ G HANT IP +IG +++
Sbjct: 197 MNHIFAKLYGFTCNSIYLDTAVRFSHKAIVEPLEQCVDDLQGKHANTQIPKIIGIAEIYN 256
Query: 351 EVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNML 410
+ YK FF + V Y GG S E + +LG + ESC T+NML
Sbjct: 257 QEHAYEKYKTAAQFFWNTVVNRRSYVIGGNSLKEHFEAID--MESLGIKTAESCNTHNML 314
Query: 411 KVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGT 470
+++ LF W Y DYYE AL N ++ Q G Y L G Y + T
Sbjct: 315 LLTKLLFSWNHYSAYMDYYENALFNHIIGTQ-DCHTGNKTYFTSLLPG-----HYRIYST 368
Query: 471 RFSSFWCCYGTGL 483
+ +++WCC GTG+
Sbjct: 369 KDTAWWCCTGTGM 381
>gi|256423606|ref|YP_003124259.1| hypothetical protein Cpin_4617 [Chitinophaga pinensis DSM 2588]
gi|256038514|gb|ACU62058.1| protein of unknown function DUF1680 [Chitinophaga pinensis DSM
2588]
Length = 1025
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 136/433 (31%), Positives = 200/433 (46%), Gaps = 83/433 (19%)
Query: 126 LEYLLMLDVDSLVWSFQKTAG--SPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWAST 183
+ L D +S ++ F+ G P K + W+ +LRGH GHYL+A A +AST
Sbjct: 406 IRTLATTDPNSFLYMFRHAFGRQQPEGAKPLDVWDSQDTKLRGHATGHYLTAIAQAYAST 465
Query: 184 -HNVTLKE----KMTAVVSAL----------------------------------SECQN 204
++ TL++ KM +V+ L S+ N
Sbjct: 466 GYDKTLQQNFEQKMAYMVNTLYELSLLSGNPKETGGVAVSDPTAVPYGPGKSGYDSDLSN 525
Query: 205 KM--------GSGYLSAFPSEQFDRFEA-------LKPVWAPYYTIHKILAGLLDQYTFA 249
+ G G++SA+P +QF E +WAPYYT+HKILAGL+D Y +
Sbjct: 526 EGIRNDYWNWGKGFISAYPPDQFIMLEKGAKYGGQKNQIWAPYYTLHKILAGLMDVYEVS 585
Query: 250 DNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKH 308
N +AL + M ++ Y R+ +V + ++ + WN+ + E GGMN+ + RLY IT ++
Sbjct: 586 GNQKALTVATGMGDWVYARLSHV-PQDTLIKMWNTYIAGEFGGMNEAMARLYLITGKQQY 644
Query: 309 LLLAHLFDK-PCFLG------LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP-LYKV 360
L A LFD F G LA D G HAN HIP ++GS Y + +P YK+
Sbjct: 645 LQTAQLFDNIRVFFGDTAHSHGLAKNVDIFRGLHANQHIPQIVGSIEMYRASNNPEYYKI 704
Query: 361 TGTFFMDIVNASHGYATGGTSAGE-------FWSDPKRL---ASTLGTENEESCTTYNML 410
F+ VN + Y+ GG + F S P L + G +N E+C TYNML
Sbjct: 705 ADNFWYKAVN-DYMYSIGGVAGARNPANAECFISQPATLYENGFSSGGQN-ETCATYNML 762
Query: 411 KVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGT 470
K++ LF + + + DYYERAL N +L+ P Y +PL G K
Sbjct: 763 KLTSDLFLFDQRAEFMDYYERALYNHILASVAKDNP-ANTYHVPLRPGAIKQFG----NP 817
Query: 471 RFSSFWCCYGTGL 483
+ F CC GT +
Sbjct: 818 DMTGFTCCNGTAI 830
>gi|444305788|ref|ZP_21141565.1| hypothetical protein G205_09453 [Arthrobacter sp. SJCon]
gi|443481842|gb|ELT44760.1| hypothetical protein G205_09453 [Arthrobacter sp. SJCon]
Length = 444
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 122/377 (32%), Positives = 174/377 (46%), Gaps = 26/377 (6%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHM 179
+AQ T++ Y+L LD D L + AG A +AY WE + L GH GHYLS A +
Sbjct: 23 QAQDTSVRYILSLDADRLFAPYLHEAGLVRAAEAYGNWE--SDGLGGHIGGHYLSGCARL 80
Query: 180 WASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP-----SEQFDRFEA------LKPV 228
+A+T N L K+ A V L CQ G GY+ P ++ R E L
Sbjct: 81 YAATGNAELLAKVRAAVVILGNCQAAHGDGYVGGVPRGGDLGQELARGEVDADLFTLNGR 140
Query: 229 WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEE 288
W P Y +HK LAGLLD FA + +AL + + ++ RV + + E L+ E
Sbjct: 141 WVPLYNLHKTLAGLLDARVFAGSGEALDIAVGLAGWWL-RVSAHLADDAFE---EVLHAE 196
Query: 289 TGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQM 348
GGMN+ L+ +T ++L A F L LA D + G HANT IP V+G
Sbjct: 197 FGGMNEAFALLWELTGREEYLREARRFSHRALLDPLAAGQDLLDGLHANTQIPKVVGYAR 256
Query: 349 RYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL-GTENEESCTTY 407
T D F + V + + GG S E + + + + E+C TY
Sbjct: 257 LAGPTHDADLAHACDIFWESVVSRRSVSIGGNSVREHFHPASDFSPMVQDPQGPETCNTY 316
Query: 408 NMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR-GTEPGVMIYMLPLGRGDSKAKSYH 466
NMLK+++ F + D++ERA N +LS Q GT G ++Y P+ + Y
Sbjct: 317 NMLKLAKLRFEAHGDAAAVDFFERATYNHILSSQHPGT--GGLVYFTPM-----RPGHYR 369
Query: 467 GWGTRFSSFWCCYGTGL 483
+ S WCC G+GL
Sbjct: 370 VYSRAQESMWCCVGSGL 386
>gi|224537186|ref|ZP_03677725.1| hypothetical protein BACCELL_02063 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521241|gb|EEF90346.1| hypothetical protein BACCELL_02063 [Bacteroides cellulosilyticus
DSM 14838]
Length = 805
Score = 171 bits (433), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 123/394 (31%), Positives = 183/394 (46%), Gaps = 34/394 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L DV++ A N++ LL D D L+ F + AG P + Y WE L G
Sbjct: 31 LGDVRITAGPFK-HACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEKDG--LDG 87
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALK 226
H GHYL+A A +A+T N+ K++M +VS + Q G G + FP+ + E K
Sbjct: 88 HIGGHYLTALAIHYAATGNLECKKRMDYMVSEFARVQQANGDGSICGFPNSKKFAEEIRK 147
Query: 227 -------PVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITK 275
W +Y +HK AGL D + + N +A LK W V+ N +
Sbjct: 148 GNVGIVWNYWVAWYNMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISN-----LDD 202
Query: 276 YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
+ER L+ E GGMN+V + +T +PK+L A F +A + D++ H
Sbjct: 203 RQMER---MLDNEFGGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMARRIDNLDNKH 259
Query: 336 ANTHIPVVIGSQMRYEVTGD--PLYK---VTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
ANT +P +G Q E+ P Y FF + V + + GG S GE + +
Sbjct: 260 ANTQVPKAVGYQRVAELNSKIAPDYNDFMTAAEFFWETVVSHRSLSLGGNSRGEHFPEAG 319
Query: 391 RLASTL-GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+ + + + ESC T NMLK++ LFR ++ YAD+YERA+ N +LS Q E G
Sbjct: 320 KCSDYMHERQGPESCNTNNMLKLTEGLFRMHPKVEYADFYERAMYNHILSTQH-PEHGGY 378
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+Y P + Y + + WCC GTG+
Sbjct: 379 VYFTP-----ACPSHYRVYSAPGKAMWCCVGTGM 407
>gi|322692034|ref|YP_004221604.1| cell surface protein [Bifidobacterium longum subsp. longum JCM
1217]
gi|320456890|dbj|BAJ67512.1| putative cell surface protein [Bifidobacterium longum subsp. longum
JCM 1217]
Length = 1984
Score = 171 bits (432), Expect = 9e-40, Method: Composition-based stats.
Identities = 136/450 (30%), Positives = 202/450 (44%), Gaps = 77/450 (17%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWE 158
++L E + +V + L A + +EYLL + D L+ F+ AG T G K Y GWE
Sbjct: 372 NYLSEQGMENVTVADEYLQ-NAGKKEVEYLLSFEPDRLLVEFRAQAGLDTKGAKNYGGWE 430
Query: 159 DPTCELR------------GHFVGHYLSASAHMWAST-----HNVTLKEKMTAVVSALSE 201
+ E R GHFVGH++SA++ ST L +TAVV + E
Sbjct: 431 NGPDESRNPDGSSKPGRFTGHFVGHWISAASQAQRSTFATADQKAQLSANLTAVVKGIRE 490
Query: 202 CQ------NKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQAL 255
Q + +G+ AF + + P+Y +HK+ AG++ Y ++ + +
Sbjct: 491 AQEAYAKKDTANAGFFPAFSASVVP--NGGGGLIVPFYNLHKVEAGMVQAYDYSTDAETR 548
Query: 256 KMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTIT--QDPKHLLLA- 312
+ K F V N + ++ + L E GGMND LY++ I D + +L A
Sbjct: 549 ETAKAAAVDFAKWVVNWKSAHAST---DMLRTEYGGMNDALYQVAEIADASDKQTVLTAA 605
Query: 313 HLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRY-----------EVTGDPLYKVT 361
HLFD+ LA D ++G HANT IP + G+ RY ++ D K+T
Sbjct: 606 HLFDETALFQKLANGQDPLNGLHANTTIPKLTGAMQRYVAYTEDEDLYNSLSADERGKLT 665
Query: 362 GTF------FMDIVNASHGYATGGTS-------AGEFWSDPKRLASTLGTENE------- 401
+ F DIV H Y GG S AGE W D A+ G +N
Sbjct: 666 SLYLKAAQNFFDIVVKDHTYVNGGNSQSEHFHVAGELWKD----ATQNGDQNGGYRNFST 721
Query: 402 -ESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
E+C YNMLK++R LF+ TK+ Y++YYE N +++ Q E G+ Y P+ G
Sbjct: 722 VETCNEYNMLKLARILFQVTKDSKYSEYYEHTFINAIVASQ-NPETGMTTYFQPMKAGYP 780
Query: 461 KAKSYHG-------WGTRFSSFWCCYGTGL 483
K G +G +WCC GTG+
Sbjct: 781 KVFGITGTDYDADWFGGAIGEYWCCQGTGI 810
>gi|423223044|ref|ZP_17209513.1| hypothetical protein HMPREF1062_01699 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640313|gb|EIY34115.1| hypothetical protein HMPREF1062_01699 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 805
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 123/394 (31%), Positives = 182/394 (46%), Gaps = 34/394 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L DV++ A N++ LL D D L+ F + AG P + Y WE L G
Sbjct: 31 LGDVRITAGPFK-HACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEKDG--LDG 87
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALK 226
H GHYL+A A +A+T N+ K++M +VS + Q G G + FP+ + E K
Sbjct: 88 HIGGHYLTALAIHYAATGNLECKKRMDYMVSEFARVQQANGDGSICGFPNSKKFAEEIRK 147
Query: 227 -------PVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITK 275
W +Y +HK AGL D + + N +A LK W V+ N +
Sbjct: 148 GNVGIVWNYWVAWYNMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISN-----LDD 202
Query: 276 YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
+ER L+ E GGMN+V + +T +PK+L A F +A D++ H
Sbjct: 203 RQMER---MLDNEFGGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMARHIDNLDNKH 259
Query: 336 ANTHIPVVIGSQMRYEVTGD--PLYK---VTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
ANT +P +G Q E+ P Y FF + V + + GG S GE + +
Sbjct: 260 ANTQVPKAVGYQRVAELNSKTAPDYNDFMTAAEFFWETVVSHRSLSLGGNSRGEHFPEAG 319
Query: 391 RLASTL-GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+ + + + ESC T NMLK++ LFR ++ YAD+YERA+ N +LS Q E G
Sbjct: 320 KCSDYMHERQGPESCNTNNMLKLTEGLFRMHPKVEYADFYERAMYNHILSTQH-PEHGGY 378
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+Y P + Y + + WCC GTG+
Sbjct: 379 VYFTP-----ACPSHYRVYSAPGKAMWCCVGTGM 407
>gi|386820708|ref|ZP_10107924.1| putative glycosyl hydrolase of unknown function (DUF1680)
[Joostella marina DSM 19592]
gi|386425814|gb|EIJ39644.1| putative glycosyl hydrolase of unknown function (DUF1680)
[Joostella marina DSM 19592]
Length = 1018
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 133/430 (30%), Positives = 199/430 (46%), Gaps = 83/430 (19%)
Query: 129 LLMLDVDSLVWSFQKTAGSPT--AGKAYEGWEDPTCELRGHFVGHYLSASAHMWAST-HN 185
L + D+ ++ F+ T G P A + W+ +LRGH GHYL+A A +AST ++
Sbjct: 402 LAQTNPDAFLYMFRNTFGQPQPDAAEPLGVWDSQETKLRGHATGHYLTAIAQAYASTGYD 461
Query: 186 VTLK----EKMTAVVSALSECQNKMGS--------------------------------- 208
+L+ +KM +V+ L + G+
Sbjct: 462 KSLQNNFADKMEYMVNTLYKLAQMSGNPKTKDGSYVANPTEVPPGPGKSNYDSDLSEDGI 521
Query: 209 ---------GYLSAFPSEQFDRFE-------ALKPVWAPYYTIHKILAGLLDQYTFADNT 252
G++SA+P +QF E VWAPYYT+HKILAGLLD Y + N
Sbjct: 522 RTDYWNWGEGFISAYPPDQFIMLENGATYGGQQTQVWAPYYTLHKILAGLLDIYEVSGNK 581
Query: 253 QALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLL 311
+AL++ + M + Y R+ + T+ ++ WN + E GGMN+V+ RLY +T + K+L +
Sbjct: 582 KALEVAEGMGSWVYARLNELPTE-TLISMWNRYIAGEFGGMNEVMARLYRLTDEEKYLQV 640
Query: 312 AHLFDK-PCFLGL------LAVQADDISGFHANTHIPVVIGS-QMRYEVTGDPLYKVTGT 363
A LFD F G LA D G HAN HIP ++G+ +M + Y++
Sbjct: 641 AQLFDNIKVFYGDANHSNGLAKNVDTFRGLHANQHIPQIVGAIEMYRDSNTAEYYRIADN 700
Query: 364 FFMDIVNASHGYATGGTSAGE-------FWSDPKRL---ASTLGTENEESCTTYNMLKVS 413
F+ N + Y+ GG + F S P + + G +N E+C TYNMLK++
Sbjct: 701 FWFKSKN-DYMYSIGGVAGARNPANAECFISQPATIYENGLSAGGQN-ETCATYNMLKLT 758
Query: 414 RHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFS 473
R+LF + + Y DYYER L N +L+ P Y +PL G K H
Sbjct: 759 RNLFLFDQRAEYMDYYERGLYNHILASVAEKTPA-NTYHVPLRPGSVK----HFGNPDMK 813
Query: 474 SFWCCYGTGL 483
F CC GT +
Sbjct: 814 GFTCCNGTAI 823
>gi|419850639|ref|ZP_14373619.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
gi|419851584|ref|ZP_14374510.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
gi|386408481|gb|EIJ23391.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
gi|386413301|gb|EIJ27914.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
Length = 1834
Score = 170 bits (431), Expect = 1e-39, Method: Composition-based stats.
Identities = 136/450 (30%), Positives = 200/450 (44%), Gaps = 77/450 (17%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWE 158
++L E + +V + L A + +EYLL + D L+ F+ AG T G K Y GWE
Sbjct: 222 NYLSEQGMENVTVADEYLQ-NAGKKEVEYLLSFEPDRLLVEFRAQAGLDTKGAKNYGGWE 280
Query: 159 DPTCELR------------GHFVGHYLSASAHMWAST-----HNVTLKEKMTAVVSALSE 201
+ E R GHFVGH++SA++ ST L +TAVV + E
Sbjct: 281 NGPDESRNPDGSSKPGRFTGHFVGHWISAASQAQRSTFATADQKAQLSANLTAVVKGIRE 340
Query: 202 CQ------NKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQAL 255
Q + +G+ AF + + P+Y +HK+ AG++ Y ++ + +
Sbjct: 341 AQEAYAKKDTANAGFFPAFSASVVP--NGGGGLIVPFYNLHKVEAGMVQAYDYSTDAETR 398
Query: 256 KMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTIT--QDPKHLLLA- 312
+ K F V N + ++ + L E GGMND LY++ I D + +L A
Sbjct: 399 ETAKAAAVDFAKWVVNWKSAHAST---DMLRTEYGGMNDALYQVAEIADASDKQTVLTAA 455
Query: 313 HLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRY-----------EVTGD------ 355
HLFD+ LA D ++G HANT IP + G+ RY ++ D
Sbjct: 456 HLFDETALFQKLANGQDPLNGLHANTTIPKLTGAMQRYVAYTEDEDLYNSLSADERGELT 515
Query: 356 PLYKVTGTFFMDIVNASHGYATGGTS-------AGEFWSDPKRLASTLGTENE------- 401
LY F DIV H Y GG S AGE W D A+ G +N
Sbjct: 516 SLYLKAAQNFFDIVVKDHTYVNGGNSQSEHFHVAGELWKD----ATQNGDQNGGYRNFST 571
Query: 402 -ESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDS 460
E+C YNMLK++R LF+ TK+ Y++YYE N +++ Q E G+ Y P+ G
Sbjct: 572 VETCNEYNMLKLARILFQVTKDSKYSEYYEHTFINAIVASQ-NPETGMTTYFQPMKAGYP 630
Query: 461 KAKSYHG-------WGTRFSSFWCCYGTGL 483
K G +G +WCC GTG+
Sbjct: 631 KVFGITGTDYDADWFGGAIGEYWCCQGTGI 660
>gi|344201935|ref|YP_004787078.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
gi|343953857|gb|AEM69656.1| protein of unknown function DUF1680 [Muricauda ruestringensis DSM
13258]
Length = 1022
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 152/507 (29%), Positives = 224/507 (44%), Gaps = 89/507 (17%)
Query: 58 TPTDDSAWS---NLLPRKMLSETDEFSWTMIYRKMKNPDG---FKLAGDFLKEVSLHDVK 111
+PTD+S S N + +S TD + K G KL L +VSL+
Sbjct: 329 SPTDNSEVSKPGNYVVTGQVSGTDFQPKARVTVKASKESGTPSLKLDVFGLDQVSLNADA 388
Query: 112 LDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGS--PTAGKAYEGWEDPTCELRGHFV 169
+ + + L+ + DS ++ F+ G P K W+ +LRGH
Sbjct: 389 HGQQTKFIENRDKFINTLVQTNPDSFLYMFRNAFGQEQPEGAKPLGVWDSQETKLRGHAT 448
Query: 170 GHYLSASAHMWASTH-----NVTLKEKMTAVVSALSE----------------------- 201
GHYL+A A +AST +KM +V L +
Sbjct: 449 GHYLTAIAQAYASTGYDKALQANFADKMNYMVDVLYQLSQMSGQSAKAGGEHVADPTAVP 508
Query: 202 ------------CQNKM-------GSGYLSAFPSEQFDRFE-----ALKP--VWAPYYTI 235
+N + G G++SA+P +QF E +P VWAPYYT+
Sbjct: 509 PGPGKSTYDSDLSENGIRTDYWNWGEGFISAYPPDQFIMLENGATYGTQPTQVWAPYYTL 568
Query: 236 HKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMND 294
HKILAGL+D Y + N +AL++ K M ++ Y R+ + T + WN+ + E GGMN+
Sbjct: 569 HKILAGLMDIYEVSGNEKALEIAKGMGDWVYARLSQLPTDTLIS-MWNTYIAGEFGGMNE 627
Query: 295 VLYRLYTITQDPKHLLLAHLFDK-PCFLG------LLAVQADDISGFHANTHIPVVIGSQ 347
+ RL IT +P++L +A LFD F G LA D G HAN HIP ++G+
Sbjct: 628 AMARLDRITDEPRYLKVAQLFDNIKMFFGDAEHSHGLARNVDSFRGLHANQHIPQIVGAL 687
Query: 348 MRYEVTGDP-LYKVTGTFFMDIVNASHGYATGG-------TSAGEFWSDPKRL---ASTL 396
Y + P Y+V F+ N + Y+ GG T+A F + P L +
Sbjct: 688 EIYRDSESPEYYQVADNFWYKAKN-DYMYSIGGVAGARNPTNAECFIAQPATLYENGFSS 746
Query: 397 GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLG 456
G +N E+C TYNMLK++++LF + + DYYER L N +L+ P Y +PL
Sbjct: 747 GGQN-ETCATYNMLKLTKNLFLFDQRTELMDYYERGLYNHILASVAEDSP-ANTYHVPLR 804
Query: 457 RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G K + + F CC GT L
Sbjct: 805 PGSVKRFG----NSDMTGFTCCNGTAL 827
>gi|359776490|ref|ZP_09279799.1| hypothetical protein ARGLB_045_00070 [Arthrobacter globiformis NBRC
12137]
gi|359306199|dbj|GAB13628.1| hypothetical protein ARGLB_045_00070 [Arthrobacter globiformis NBRC
12137]
Length = 1025
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 103/284 (36%), Positives = 145/284 (51%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E+ VWAPYYT HKIL GLLD YT +AL + + +
Sbjct: 391 GFLAAYPETQFIELESRTTPDYFRVWAPYYTAHKILKGLLDAYTATAEPKALDLATGLCD 450
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ ++R+ +T +R W + E GG+ + + Y + P+HL LA FD +
Sbjct: 451 WMHSRLSK-LTPAVRQRMWGIFSSGEYGGVVEAILETYGHSGKPEHLELAKYFDLDSLID 509
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D ++G HAN HIP+ G + Y TG+ Y F +V + ++ GGTS
Sbjct: 510 ACAQDKDILAGLHANQHIPIFTGLVLMYNATGEERYLAAARNFWTMVVPTRMFSIGGTSQ 569
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
GEFW + R+A+TL + ESC YNMLK+SR LF + Y DYYERAL N VL ++
Sbjct: 570 GEFWKERDRIAATLNATDAESCCAYNMLKLSRELFFREQNPAYMDYYERALFNQVLGSKQ 629
Query: 443 GTEPG---VMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E + Y + L G + + T CC GTGL
Sbjct: 630 DKESAELPLATYFIGLQPGAVRDFTPKQGTT------CCEGTGL 667
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 50/113 (44%), Gaps = 9/113 (7%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG-------SPTAGKAY 154
++ L DV L P + R ++ L + D V F+ AG P +
Sbjct: 49 VRPFKLSDVSLGPG-VFARKRELILNFARGYDERRYVNVFRANAGLRPLDGVVPLPAGGW 107
Query: 155 EGWE-DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
EG + + LRGHF GH++S A +A T K+ +V++L EC+ +
Sbjct: 108 EGLDGEANGNLRGHFTGHHMSMLAQAYAGTGEEVFGTKLRNLVASLHECRQAL 160
>gi|443629445|ref|ZP_21113773.1| putative Secreted protein [Streptomyces viridochromogenes Tue57]
gi|443337063|gb|ELS51377.1| putative Secreted protein [Streptomyces viridochromogenes Tue57]
Length = 941
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 105/284 (36%), Positives = 147/284 (51%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEA-----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E+ VWAPYYT HKIL G+LD Y D+ +AL + M +
Sbjct: 390 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGVLDAYLATDDARALDLASGMCD 449
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ Y+R+ + + +++R W + E GG+ + + L+TIT +HL LA LFD +
Sbjct: 450 WMYSRLSK-LPEATLQRMWGLFSSGEFGGIVEAVCDLHTITGKAEHLALAQLFDLDRLID 508
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D + G HAN HIP+ G Y+ TG+ Y F +V Y GGTS
Sbjct: 509 NCAANTDILDGLHANQHIPIFTGYLRLYDATGEQRYLDAARNFWGMVVPHRMYGIGGTST 568
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
GEFW +A T+ N E+C YNMLK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 569 GEFWKARDVIAGTISATNAETCCAYNMLKLSRTLFFHEQQPKYMDYYERALFNQVLGSKQ 628
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E ++ Y + L G + Y T CC GTG+
Sbjct: 629 DKADAEKPLVTYFIGLTPG--HVRDY----TPKQGTTCCEGTGM 666
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 55/110 (50%), Gaps = 6/110 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
++ +L DV L P L +Q L++ DV+ L+ F+ AG T G A GWE
Sbjct: 51 VQPFALDDVALRPG-LFADKRQLMLDHARGYDVNRLLQVFRANAGLSTGGAVAPGGWEGL 109
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
+ LRGH+ GH+L+ + +A T +++ +V AL+E + +
Sbjct: 110 DGEANGNLRGHYTGHFLTMLSQAYAGTGEQVFVDRIRTMVGALTEVREAL 159
>gi|261415299|ref|YP_003248982.1| hypothetical protein Fisuc_0892 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|385790233|ref|YP_005821356.1| hypothetical protein FSU_1340 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|261371755|gb|ACX74500.1| protein of unknown function DUF1680 [Fibrobacter succinogenes
subsp. succinogenes S85]
gi|302327243|gb|ADL26444.1| conserved hypothetical protein [Fibrobacter succinogenes subsp.
succinogenes S85]
Length = 897
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 124/397 (31%), Positives = 184/397 (46%), Gaps = 40/397 (10%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELR 165
+L DV+L L R Q N+E LL DVD L+ F + AG + W L
Sbjct: 36 ALSDVQLLDGVLKER-QDLNVETLLSYDVDRLLAPFYEEAGMKPKASKFPNW----AGLD 90
Query: 166 GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS-----GYLSAFPSEQ-- 218
GH +GHYLSA A +A +V +KE++ ++ L Q++ GY+S P+ +
Sbjct: 91 GHVLGHYLSALAMHYADNDDVQVKERLEYILKELKTIQDQNSKDNNFKGYISGVPNGKQM 150
Query: 219 -----FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRV 269
A W P+Y IHK+ AGL D Y +A QA L + W + +
Sbjct: 151 WLKMKNGDAGAQNGYWVPWYNIHKLYAGLRDAYVYAGYEQAKTMFLALCDWGIT-----I 205
Query: 270 QNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQAD 329
N + +++ L E GGM +V Y +T+D K+L A + L ++ D
Sbjct: 206 TNGLNDSKMQQM---LGTEHGGMPEVYADAYKLTKDEKYLNAAKKWSHQWLLNPMSQGND 262
Query: 330 DISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW--- 386
+++ HANT +P V+G E++GD YK FF V A GG S E +
Sbjct: 263 NLTNVHANTQVPKVVGFARIAELSGDEKYKKGSDFFWQTVVNKRSIAIGGNSISEHFPAL 322
Query: 387 SDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP 446
++ K+ E ESC TYNMLK++ LF + Y D+YERAL N +LS T
Sbjct: 323 NNHKKFIEE--REGPESCNTYNMLKLTERLFNIKHDAHYTDFYERALFNHILSTIHPTHG 380
Query: 447 GVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P ++ + Y + + WCC G+G+
Sbjct: 381 G-YVYFTP-----ARPRHYRVYSKVNAGMWCCVGSGM 411
>gi|238061684|ref|ZP_04606393.1| secreted protein [Micromonospora sp. ATCC 39149]
gi|237883495|gb|EEP72323.1| secreted protein [Micromonospora sp. ATCC 39149]
Length = 933
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 101/284 (35%), Positives = 149/284 (52%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E++ VWAPYYT HKIL G+LD Y + +AL + M +
Sbjct: 382 GFLAAYPETQFITLESMTASDYAKVWAPYYTAHKILQGILDAYLNTGDERALDLATGMCD 441
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ ++R+ + +++R W + E GG+ + + ++ IT P HL LA LFD +
Sbjct: 442 WMHSRLSK-LPAATLQRMWGLFSSGEFGGIVETICDVHRITGSPNHLALARLFDLNSLID 500
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D I+G HAN HIP+ G ++ TG+ Y F +V + Y+ GGTS
Sbjct: 501 AAAAGTDTITGLHANQHIPIFTGLLRLHDETGEQRYLNAARNFWPMVVPTRMYSIGGTST 560
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
EFW +P +A +L N E+C YN+LK+SR LF ++ Y DYYERAL N +L +R
Sbjct: 561 VEFWKEPGAIAGSLSDTNAETCCAYNLLKLSRTLFLHEQDPKYMDYYERALYNQILGSKR 620
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E ++ Y + L G + Y T CC GTG+
Sbjct: 621 DLADAEKPLVTYFIGLVPG--HVRDY----TPKQGTTCCEGTGM 658
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 11/120 (9%)
Query: 92 PDGFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG 151
P +KL L EV+L D + R + LE+ +VD L+ F+ AG T G
Sbjct: 44 PPSWKLRPFPLGEVALRD------GVFARKRDLMLEHARGYNVDRLLQVFRANAGLDTLG 97
Query: 152 K-AYEGWE----DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
A GWE + LRGH+ GH+L+ A + ST + +K+ +V AL E + +
Sbjct: 98 AVAPSGWEGLDGEANGNLRGHYTGHFLTMLAQAYGSTGDKVFADKLKYMVGALVEARAAL 157
>gi|302549595|ref|ZP_07301937.1| secreted protein [Streptomyces viridochromogenes DSM 40736]
gi|302467213|gb|EFL30306.1| secreted protein [Streptomyces viridochromogenes DSM 40736]
Length = 943
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 104/284 (36%), Positives = 148/284 (52%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEA-----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E+ VWAPYYT HKIL GLLD YT D+ +AL + M +
Sbjct: 392 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGLLDAYTATDDDRALDLASGMCD 451
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ ++R+ + + +++R W + E GG+ + + L+T+T +HL LA LFD +
Sbjct: 452 WMHSRLSK-LPESTLQRMWGIFSSGEFGGIVEAICDLHTLTGKAEHLALAQLFDLDRLIE 510
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D + G HAN HIP+ G Y+ TG+ Y + F D+V Y GGTS
Sbjct: 511 ACAANTDILDGLHANQHIPIFTGYVRLYDETGEERYLRSAKNFWDMVVPHRMYGIGGTST 570
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
EFW +A T+ E+C YNMLK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 571 QEFWKARDVIAGTISATTAETCCAYNMLKLSRTLFFHEQDPKYMDYYERALYNQVLGSKQ 630
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E ++ Y + L G + Y T CC GTG+
Sbjct: 631 DKPDVEKPLVTYFIGLTPG--HVRDY----TPKQGTTCCEGTGM 668
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 6/110 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
++ L DV L + +Q L++ DV+ L+ F+ AG T G A GWE
Sbjct: 53 VRPFGLEDVSLG-RGVFADKRQLMLDHARGYDVNRLLQVFRANAGLATGGAVAPGGWEGL 111
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
+ LRGH+ GH+L+ A + ST +++ AVV AL+E + +
Sbjct: 112 DGEANGNLRGHYTGHFLTMLAQAYRSTKEQVFADRIGAVVGALTEVRAAL 161
>gi|383641062|ref|ZP_09953468.1| glycosylase [Streptomyces chartreusis NRRL 12338]
Length = 900
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 104/284 (36%), Positives = 147/284 (51%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEA-----LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E+ VWAPYYT HKIL GLLD Y D+ +AL + M +
Sbjct: 349 GFLAAYPETQFIDLESRTTSDYTKVWAPYYTAHKILRGLLDAYGATDDDRALDLASGMCD 408
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ ++R+ + + +++R W + E GG+ + + L+TIT +HL LA LFD +
Sbjct: 409 WMHSRLSK-LPESTLQRMWGIFSSGEFGGIVEAVCDLHTITGKAEHLALAQLFDLDRLID 467
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D + G HAN HIP+ G Y+ TG+ Y + F D+V Y GGTS
Sbjct: 468 ACAANTDILDGLHANQHIPIFTGYVRLYDETGEERYLTSAKNFWDMVVPHRMYGIGGTST 527
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
EFW +A T+ E+C YNMLK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 528 QEFWKARDVIAGTISATTAETCCAYNMLKLSRTLFFHEQDPKYMDYYERALYNQVLGSKQ 587
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E ++ Y + L G + Y T CC GTG+
Sbjct: 588 DKPDAEKPLVTYFIGLTPG--HVRDY----TPKQGTTCCEGTGM 625
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 55/110 (50%), Gaps = 6/110 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
++ +L DV L P L ++ L++ DV+ L+ F+ AG PT G A GWE
Sbjct: 10 VQPFALEDVALRPG-LFAEKRRLMLDHARGYDVNRLLQVFRANAGLPTGGAVAPGGWEGL 68
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
+ LRGH+ GH+L+ A + T +++ +V AL+E + +
Sbjct: 69 DGEANGNLRGHYTGHFLTMLAQAYRGTKERVFADRIGTMVGALTEVRAAL 118
>gi|126348374|emb|CAJ90096.1| conserved hypothetical protein [Streptomyces ambofaciens ATCC
23877]
Length = 942
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 103/284 (36%), Positives = 146/284 (51%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E++ VWAPYYT HKIL GLLD + + +AL + + +
Sbjct: 391 GFLAAYPETQFVELESMTGSDYTRVWAPYYTAHKILRGLLDAHLATGDGRALDLASGLCD 450
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ Y+R+ + +++R W + E GG+ + + L+ +T + HL LA LFD +
Sbjct: 451 WMYSRLSK-LPAATLQRMWGLFSSGEFGGIVEAICDLHAVTGEAHHLALARLFDLDRLID 509
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D + G HAN HIP+ G ++ TG+ Y F +V YA GGTS
Sbjct: 510 ACAADDDVLDGLHANQHIPIFTGLVRLHDATGEERYLTAAKNFWGMVVPHRMYAIGGTST 569
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
GEFW +A TLG ESC YNMLK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 570 GEFWQARDVIAGTLGATTAESCCAYNMLKLSRTLFFHEQDPAYMDYYERALYNQVLGSKQ 629
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E ++ Y + L G + Y T CC GTG+
Sbjct: 630 DAADAEKPLVTYFVGLTPG--HVRDY----TPKQGTTCCEGTGM 667
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 54/110 (49%), Gaps = 6/110 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
++ L DV L + ++ L++ DVD L+ F+ AG T G A GWE
Sbjct: 52 VRPFGLEDVTLG-RGVFADKRRLMLDHARGYDVDRLLQVFRANAGLSTLGAVAPGGWEGL 110
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
+ LRGH+ GH+L+ A T E++T++V+AL+E + +
Sbjct: 111 DGEANGNLRGHYTGHFLTMLAQAHRGTGEEVFAERITSMVTALTEVRESL 160
>gi|365852804|ref|ZP_09393150.1| hypothetical protein HMPREF9103_01934 [Lactobacillus parafarraginis
F0439]
gi|363714017|gb|EHL97570.1| hypothetical protein HMPREF9103_01934 [Lactobacillus parafarraginis
F0439]
Length = 728
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 125/428 (29%), Positives = 196/428 (45%), Gaps = 55/428 (12%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA-YEGWE 158
+ +K VS ++V+ P+S + N+ ++L L D L+++++K AG T G WE
Sbjct: 3 NIMKPVSYYNVEYLPNSTLKEKFERNINWMLSLTPDQLLYNYRKNAGLDTKGATPLTVWE 62
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHN--------VTLKEKMTAVVSALSECQNKMGS-- 208
P RGHF GHYLS ++ + N V LK ++ +V+ L E Q+K+
Sbjct: 63 SPDFFFRGHFTGHYLSGASKTFVELTNTDEKDPQAVELKNRVDLIVTGLKEVQDKLSETS 122
Query: 209 ---GYLSAFPSEQFDRFEALK---PVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMV 262
GYL+A P ++FD E L+ + PYY I K++ GL+D Y + N AL++ K +
Sbjct: 123 EFPGYLAAEPEKRFDNLEKLRFNGNHYVPYYAIQKLMDGLMDAYQYTGNQTALQLVKNLT 182
Query: 263 EYFYNRVQNVITKYSVERHWNSLN-------------EETGGMNDVLYRLYTITQDPKHL 309
Y V+ + K + ER L+ +E G M+ L RLY +T +
Sbjct: 183 SY----VEKRMAKLTPERISAMLDTRWYQGSGQYIFHQEFGAMHRTLLRLYELTGKKEQD 238
Query: 310 L--LAHLFDKPCFLGLLAVQADDIS--GFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFF 365
+ LA FD+ F +L D + H+NT + G Y VTGD YK +
Sbjct: 239 VFDLAEKFDRKWFRDMLINNEDKLGYYSMHSNTELVCAEGMLEYYHVTGDDQYKKGVENY 298
Query: 366 MDIVNASHGYATGGTS-----------AGEFWSDPKRLASTLGTENEESCTTYNMLKVSR 414
MD ++ H T G S E + P+ L N ESC ++++ +S
Sbjct: 299 MDWMHTGHELPTKGISGRSAYPAPADYGSELYDYPEMFFKHLSKLNGESCCSHDLNYLSS 358
Query: 415 HLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSS 474
LF TK+ V + YE N +++ Q+ + + Y+ L + K Y G
Sbjct: 359 ELFADTKDPVLMNDYEIRFINAIMA-QQNNDSAIAEYLYNLSVAPNSVKHYDRGG----- 412
Query: 475 FWCCYGTG 482
FWCC G+G
Sbjct: 413 FWCCVGSG 420
>gi|297191370|ref|ZP_06908768.1| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
gi|197720620|gb|EDY64528.1| secreted protein [Streptomyces pristinaespiralis ATCC 25486]
Length = 942
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 149/284 (52%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E++ VWAPYYT HKIL GLLD + + +AL + M +
Sbjct: 393 GFLAAYPETQFITLESMTSPDYTVVWAPYYTAHKILKGLLDAHLSTGDVRALDLASGMCD 452
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ ++R+ ++ + R W + E GGM + + ++++T +HL LA +FD +
Sbjct: 453 WMHSRLA-LLPSATRRRMWGLFSSGEYGGMVEAVVDVHSLTGRAEHLELARMFDLDPLID 511
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D +SG HAN HIP+ G ++ TG+ Y F D+V + Y GGTS
Sbjct: 512 ACAENRDVLSGLHANQHIPIFTGLIRLHDATGEERYLTAARNFWDMVVPTRMYGIGGTST 571
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
GEFW D +A TLG E+C +NMLK+SR LF ++ YAD+YER L N +L ++
Sbjct: 572 GEFWRDAGVIAGTLGDTTAETCCAHNMLKLSRLLFLHEQDPKYADHYERTLFNQILGSKQ 631
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E +M Y + L G + + T CC GTG+
Sbjct: 632 DLADAELPLMTYFIGLAPGAVRDFTPKQGTT------CCEGTGI 669
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 43/86 (50%), Gaps = 5/86 (5%)
Query: 126 LEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWEDPTCE----LRGHFVGHYLSASAHMW 180
L++ DV L+ F+ AG T G A GWE E LRGHF GH+LS + +
Sbjct: 77 LDFGRSYDVHRLLQVFRANAGLSTRGAVAPGGWEGLDGEARGNLRGHFTGHFLSMLSQAY 136
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKM 206
ST +K+ +V L+EC+ +
Sbjct: 137 VSTREQVFADKIGTMVDGLAECREAL 162
>gi|312133546|ref|YP_004000885.1| protein [Bifidobacterium longum subsp. longum BBMN68]
gi|322690281|ref|YP_004219851.1| hypothetical protein BLLJ_0089 [Bifidobacterium longum subsp.
longum JCM 1217]
gi|311772796|gb|ADQ02284.1| Hypothetical protein BBMN68_1283 [Bifidobacterium longum subsp.
longum BBMN68]
gi|320455137|dbj|BAJ65759.1| conserved hypothetical protein [Bifidobacterium longum subsp.
longum JCM 1217]
Length = 800
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 160/352 (45%), Gaps = 39/352 (11%)
Query: 164 LRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS------------GYL 211
LRGHF GH L + +A T + K+ VS L EC++ + G+L
Sbjct: 178 LRGHFAGHALHMLSQAYAETGEEAILNKINEFVSGLKECRDSLREMKYNGKARYSHPGFL 237
Query: 212 SAFPSEQFDRFEALKP---VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNR 268
+A+ QF E P +WAP+YT HKILAGL+ Y FA N AL + + + + Y R
Sbjct: 238 AAYGEWQFKALEEYAPYGEIWAPWYTEHKILAGLIAAYEFAGNADALDLAEGIGHWTYAR 297
Query: 269 VQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKH---LLLAHLFDKPCFLGLL 324
+ TK +++ W+ + E GGMND L LY +++D L + FD +
Sbjct: 298 LSKC-TKTQLQKMWDIYIGGEYGGMNDSLVDLYNVSKDKDRSEFLKASAFFDTDKLIDNC 356
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHG-------YAT 377
D ++ HAN HIP +G + + ++ V G YA
Sbjct: 357 GAGVDILNNLHANQHIPQFVGYAKDAAMGDADIDADARARYLKAVEGYWGMIVPGRMYAH 416
Query: 378 GGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGV 437
GGT GE W +A +G N ESC YNMLKV+R+LF ++ Y DYYER + N +
Sbjct: 417 GGTGEGEMWGPAHTVAGDIGKRNAESCAAYNMLKVARYLFFIEQKPAYMDYYERTILNHI 476
Query: 438 LSIQ-RGTEPGVMI-----YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
L + R + G + YM P+ K GT CC GT L
Sbjct: 477 LGGKSRDLDSGTALTPGNCYMYPVNPATQKEYGDGNIGT------CCGGTAL 522
>gi|419849455|ref|ZP_14372501.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
gi|419852148|ref|ZP_14375044.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
gi|386411767|gb|EIJ26479.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
2-2B]
gi|386411993|gb|EIJ26692.1| putative glycosyhydrolase [Bifidobacterium longum subsp. longum
35B]
Length = 800
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 160/352 (45%), Gaps = 39/352 (11%)
Query: 164 LRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS------------GYL 211
LRGHF GH L + +A T + K+ VS L EC++ + G+L
Sbjct: 178 LRGHFAGHALHMLSQAYAETGEEAILNKINEFVSGLKECRDSLREMKYNGKARYSHPGFL 237
Query: 212 SAFPSEQFDRFEALKP---VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNR 268
+A+ QF E P +WAP+YT HKILAGL+ Y FA N AL + + + + Y R
Sbjct: 238 AAYGEWQFKALEEYAPYGEIWAPWYTEHKILAGLIAAYEFAGNADALDLAEGIGHWTYAR 297
Query: 269 VQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKH---LLLAHLFDKPCFLGLL 324
+ TK +++ W+ + E GGMND L LY +++D L + FD +
Sbjct: 298 LSKC-TKTQLQKMWDIYIGGEYGGMNDSLVDLYNVSKDKDRSEFLKASAFFDTDKLIDNC 356
Query: 325 AVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHG-------YAT 377
D ++ HAN HIP +G + + ++ V G YA
Sbjct: 357 GAGVDILNNLHANQHIPQFVGYAKDAAMGDADIDADARARYLKAVEGYWGMIVPGRMYAH 416
Query: 378 GGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGV 437
GGT GE W +A +G N ESC YNMLKV+R+LF ++ Y DYYER + N +
Sbjct: 417 GGTGEGEMWGPAHTVAGDIGKRNAESCAAYNMLKVARYLFFIEQKPAYMDYYERTILNHI 476
Query: 438 LSIQ-RGTEPGVMI-----YMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
L + R + G + YM P+ K GT CC GT L
Sbjct: 477 LGGKSRDLDSGTALTPGNCYMYPVNPATQKEYGDGNIGT------CCGGTAL 522
>gi|189464749|ref|ZP_03013534.1| hypothetical protein BACINT_01093 [Bacteroides intestinalis DSM
17393]
gi|189437023|gb|EDV06008.1| hypothetical protein BACINT_01093 [Bacteroides intestinalis DSM
17393]
Length = 805
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 121/394 (30%), Positives = 178/394 (45%), Gaps = 34/394 (8%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L DV++ A N++ LL D D L+ F + AG P + Y WE L G
Sbjct: 31 LGDVRITAGPFK-HACDLNVKVLLQYDTDRLLAPFLREAGLPKKAETYGNWEKDG--LDG 87
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALK 226
H GHYLSA A +A+T N K++M +VS + Q G + FP+ + E K
Sbjct: 88 HIGGHYLSALAIHYAATGNQECKKRMDYMVSEFARVQQANDDGSICGFPNSKKFAEEIRK 147
Query: 227 -------PVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITK 275
W +Y +HK AGL D + + N +A LK W V+ N +
Sbjct: 148 GNVGIVWNYWVAWYNMHKTYAGLRDAWLYGKNEKAKKIFLKFCDWGVDVISN-----LDD 202
Query: 276 YSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFH 335
+ER L+ E GGMN+V + +T +PK+L A F + + D++ H
Sbjct: 203 RQMER---MLDNEFGGMNEVYADAWQMTGNPKYLDTAKRFSHKQIFDSMTRRIDNLDNKH 259
Query: 336 ANTHIPVVIGSQMRYEVTGDPL-----YKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
ANT +P +G Q E+ + FF + V + GG S GE + +
Sbjct: 260 ANTQVPKAVGYQRVAELNSKTASDYNEFMTAAEFFWETVVFHRSLSLGGNSRGEHFPEAG 319
Query: 391 RLASTL-GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+ + + + ESC T NMLK++ LFR ++ YAD+YERAL N +LS Q E G
Sbjct: 320 KCSDYMHERQGPESCNTNNMLKLTEGLFRIHPKVEYADFYERALYNHILSTQH-PEHGGY 378
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+Y P + Y + + WCC GTG+
Sbjct: 379 VYFTP-----ACPSHYRVYSAPGEAMWCCVGTGM 407
>gi|312131938|ref|YP_003999278.1| hypothetical protein Lbys_3265 [Leadbetterella byssophila DSM
17132]
gi|311908484|gb|ADQ18925.1| protein of unknown function DUF1680 [Leadbetterella byssophila DSM
17132]
Length = 1004
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 129/432 (29%), Positives = 189/432 (43%), Gaps = 81/432 (18%)
Query: 126 LEYLLMLDVDSLVWSFQKTAGS--PTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWAST 183
++ L D +S ++ F+ G P K W+ +LRGH GHYL+A A +AST
Sbjct: 385 IQGLAKTDPNSFLYMFRHAFGQKQPEGAKPLGVWDSQNTKLRGHATGHYLTAIAQAYAST 444
Query: 184 H-----NVTLKEKMTAVVSALSECQNKMGS------------------------------ 208
KM +V+ L E G+
Sbjct: 445 GYDKNLQANFAGKMDQLVNTLYELSRLSGTPKVQGGEAVADPTKVPMGPGKTEYDSDLTD 504
Query: 209 ------------GYLSAFPSEQFDRFEA-------LKPVWAPYYTIHKILAGLLDQYTFA 249
GY+SA+P +QF E VWAPYYT+HKILAGL+D Y +
Sbjct: 505 EGIRTDYWNWGKGYISAYPPDQFIMLEQGAKYGGQKNQVWAPYYTLHKILAGLMDVYEVS 564
Query: 250 DNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKH 308
N +AL + M E+ + R+ + + ++ + WN+ + E GGMN+ + RL+ +T++ K
Sbjct: 565 GNKKALDVAVGMSEWVHARLA-ALPQDTLIKMWNTYIAGEYGGMNESMARLFFLTKNEKF 623
Query: 309 LLLAHLFDK-PCFLG------LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVT 361
L A LFD F G LA D G HAN HIP ++GS Y V+ +P Y
Sbjct: 624 LKTAQLFDNIKMFYGDASHSHGLARNVDTFRGLHANQHIPQIVGSIEMYAVSQNPDYYFI 683
Query: 362 GTFFMDIVNASHGYATGGTSAGE-------FWSDPKRL---ASTLGTENEESCTTYNMLK 411
F + + Y+ GG + F + P + + G +N E+C TYNMLK
Sbjct: 684 AENFWHRTVSDYMYSIGGVAGARNPANAECFIAQPATIYENGFSQGGQN-ETCATYNMLK 742
Query: 412 VSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTR 471
++ LF + ++ Y DYYER L N +L+ P Y +PL G K
Sbjct: 743 LTSSLFMFDQKAEYMDYYERGLYNHILASVAKDSP-ANTYHVPLRPGSIK----QFGNPN 797
Query: 472 FSSFWCCYGTGL 483
+ F CC GT +
Sbjct: 798 MTGFTCCNGTAI 809
>gi|423223251|ref|ZP_17209720.1| hypothetical protein HMPREF1062_01906 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392639352|gb|EIY33177.1| hypothetical protein HMPREF1062_01906 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 643
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 126/394 (31%), Positives = 184/394 (46%), Gaps = 26/394 (6%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC--- 162
SL DV+L S QQ EYLL L+ DSL+ ++ AG +AY GWE
Sbjct: 41 SLEDVRLLESPF-LDLQQKGKEYLLWLNPDSLLHFYRIEAGLQPKARAYAGWESQDVWGA 99
Query: 163 -ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYL-------SAF 214
LRG F+G YLS+ + M+ +T + L +++ V++ L CQ G+L F
Sbjct: 100 GPLRGGFLGFYLSSVSMMYQATGDKELLKRLQYVLNELELCQKAGKDGFLLGIKDGRKLF 159
Query: 215 PSEQFDRFEALKP----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ 270
+ + P WAP Y I+K+L GL Y +AL M + ++F +V
Sbjct: 160 SEVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYAQCGQEKALPMMIRLADWFGYQVL 219
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
+ +T V+R L E G +N+ +Y +T + + L A + L+ D
Sbjct: 220 DKLTDEQVQR---LLVCEHGSINESFVEIYKLTGEIRFLEWAGRLNDRAMWVPLSEGKDI 276
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE-FWSDP 389
+ G+HANT IP G + YE TGD F DIVN +H + GG S GE F+
Sbjct: 277 LFGWHANTQIPKFTGFEKYYEATGDKRLLNAAMNFWDIVNQNHTWVIGGNSTGEHFFPKK 336
Query: 390 KRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+ L E+C + NML+++ LF + + A YYER L N +LS + G+
Sbjct: 337 EFEERVLLKGGPETCNSVNMLRLTETLFSYQPDAKKAAYYERVLFNHILSAYDPVK-GMC 395
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y + G Y + +R SSFWCC TGL
Sbjct: 396 CYFTSMRPG-----HYRIYASRDSSFWCCGHTGL 424
>gi|29348320|ref|NP_811823.1| hypothetical protein BT_2911 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383124515|ref|ZP_09945178.1| hypothetical protein BSIG_1739 [Bacteroides sp. 1_1_6]
gi|29340224|gb|AAO78017.1| putative Acetyl-CoA carboxylase, biotin carboxylase [Bacteroides
thetaiotaomicron VPI-5482]
gi|251841333|gb|EES69414.1| hypothetical protein BSIG_1739 [Bacteroides sp. 1_1_6]
Length = 655
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/398 (31%), Positives = 183/398 (45%), Gaps = 31/398 (7%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L+EV L D S Q+ EYLL L+ DSL+ ++ AG P+ Y GWE
Sbjct: 48 LREVRLLD------SPFLDLQRKGKEYLLWLNPDSLLHFYRIEAGLPSKAAPYAGWESQD 101
Query: 162 C----ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYL------ 211
LRG F+G YLS+ + M+ ST + L +++ V+ L CQ G+L
Sbjct: 102 VWGAGPLRGGFLGFYLSSVSMMYQSTDDKRLLKRLKYVLKELELCQKAGKDGFLLGLKDG 161
Query: 212 -SAFPSEQFDRFEALKP----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFY 266
F + + P WAP Y I+K+L GL YT +AL + + ++F
Sbjct: 162 RKLFAEVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCQMEEALPILIRLADWFG 221
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAV 326
+V + +T ++R L E G +N+ Y +T + + L A + G L+
Sbjct: 222 YQVLDKLTDDQIQR---LLICEHGSINESYVEAYELTGEKRFLDWARRLNDHAMWGPLSE 278
Query: 327 QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFW 386
D + G+HANT IP G Y+ TGD + T F +IV +H + GG S GE +
Sbjct: 279 GKDILFGWHANTQIPKFTGFHKYYQFTGDERFLTAATNFWNIVTQNHTWVIGGNSTGEHF 338
Query: 387 SDPKRLAS-TLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTE 445
+ A L E+C + NML+++ LF + A YYER L N +LS E
Sbjct: 339 FPKEEFADRVLLVGGPETCNSVNMLRLTESLFCQYPDAAKASYYERVLFNHILS-AYDPE 397
Query: 446 PGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G+ Y + G Y + +R SSFWCC TGL
Sbjct: 398 KGMCCYFTSMRPG-----HYRIYASRDSSFWCCGHTGL 430
>gi|189467200|ref|ZP_03015985.1| hypothetical protein BACINT_03584 [Bacteroides intestinalis DSM
17393]
gi|189435464|gb|EDV04449.1| beta-lactamase [Bacteroides intestinalis DSM 17393]
Length = 720
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/249 (38%), Positives = 139/249 (55%), Gaps = 10/249 (4%)
Query: 235 IHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMND 294
+HK+ +GL+ QY +ADN QAL++ M + YN+++ + + + +R + E GG+N+
Sbjct: 1 MHKLFSGLIYQYLYADNKQALEVVTRMGNWTYNKLK-PLDESTRKR---MIRNEFGGVNE 56
Query: 295 VLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTG 354
Y LY IT D ++ LA F + L Q DD+ H NT IP V+ YE+T
Sbjct: 57 SFYNLYAITGDERYQWLAEFFYHNDVIDPLKEQRDDLGTKHTNTFIPKVLTEARNYELTQ 116
Query: 355 DPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSR 414
D + FF + H +A G +S E + DP++L+ L E+C TYNMLK+SR
Sbjct: 117 DNDSRKLTDFFWHTMIDHHTFAPGCSSDKEHYFDPQQLSKHLTGYTGETCCTYNMLKLSR 176
Query: 415 HLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSS 474
HLF WT + ADYYERAL N +L Q+ E G++ Y LPL G K + TR +S
Sbjct: 177 HLFCWTGDAKVADYYERALYNHILG-QQDPETGMVSYFLPLLSGSHKV-----YSTRENS 230
Query: 475 FWCCYGTGL 483
FWCC G+G
Sbjct: 231 FWCCVGSGF 239
>gi|302539859|ref|ZP_07292201.1| putative secreted protein [Streptomyces hygroscopicus ATCC 53653]
gi|302457477|gb|EFL20570.1| putative secreted protein [Streptomyces himastatinicus ATCC 53653]
Length = 940
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 101/284 (35%), Positives = 146/284 (51%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E++ VWAPYYT HKIL GLLD + + +AL + M +
Sbjct: 389 GFLAAYPETQFITLESMTSGDYTVVWAPYYTAHKILRGLLDAHLATGDARALDLAMGMCD 448
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ Y+R+ + + +++R W + E GG+ + + LY ++ +HL LA LFD +
Sbjct: 449 WMYSRLSK-LPRSTLQRMWGIFSSGEFGGIVEAICDLYALSGKAQHLALARLFDLDKLID 507
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D + G HAN HIP+ G Y+ T + Y F D+V + Y GGTS
Sbjct: 508 ACAAGDDTLDGLHANQHIPIFTGLVRLYDETEEERYLTAAKNFWDMVVPTRMYGIGGTSN 567
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
EFW +A TL E+C YNMLK+SR LF ++ Y DYYERAL N VL ++
Sbjct: 568 REFWGARGAIAKTLSDTTAETCCAYNMLKLSRMLFFHEQDPAYMDYYERALYNQVLGSKQ 627
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E ++ Y + L G + Y T + CC GTG+
Sbjct: 628 DRADAEKPLVTYFIGLVPG--HVRDY----TPKAGTTCCEGTGM 665
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 53/110 (48%), Gaps = 6/110 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
L+ + DV L +S+ +Q L++ DVD L+ F+ AG T G A GWE
Sbjct: 50 LRPFNPEDVALR-TSVFTAKRQLMLDFGRGYDVDRLLQVFRANAGLSTRGAVAPGGWEGL 108
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
+ LRGHF GH+L+ + + T +K+ +V AL E + +
Sbjct: 109 DGEANGNLRGHFTGHFLTMLSQAYTGTGEKVYADKIRHMVGALDEVREAL 158
>gi|295133234|ref|YP_003583910.1| hypothetical protein ZPR_1378 [Zunongwangia profunda SM-A87]
gi|294981249|gb|ADF51714.1| putative secreted protein [Zunongwangia profunda SM-A87]
Length = 1016
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 133/430 (30%), Positives = 186/430 (43%), Gaps = 83/430 (19%)
Query: 129 LLMLDVDSLVWSFQKTAGS--PTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWAST--- 183
L + DS ++ F+ G P K W+ +LRGH GHYL+A A +AST
Sbjct: 400 LAQTNPDSFLYMFRNAFGQEQPVGAKPLGVWDTQETKLRGHATGHYLTAIAQAYASTGYD 459
Query: 184 ----HNVTLK-EKMTAVVSALSECQNKM-------------------------------- 206
N K E M + LS+ K
Sbjct: 460 KALQQNFADKMEYMVNTLYQLSQMSGKPAEEGGDFNANPTAVPMGPGKEIYSSDLSEEGI 519
Query: 207 -------GSGYLSAFPSEQFDRFE-------ALKPVWAPYYTIHKILAGLLDQYTFADNT 252
G G++SA+P +QF E +WAPYYT+HKILAGL+D Y + N
Sbjct: 520 RTDYWNWGEGFISAYPPDQFIMLENGAVYGTEETKIWAPYYTLHKILAGLMDIYEVSGNE 579
Query: 253 QALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLL 311
+AL + + M ++ Y R+ + T + WN + E GGMN+ + RLY IT +L
Sbjct: 580 KALAVAEGMGDWVYARLSELPTDTLISM-WNRYIAGEFGGMNEAMARLYRITGKDTYLET 638
Query: 312 AHLFDK-PCFLGL------LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLY-KVTGT 363
A LFD F G LA D G HAN HIP ++G+ Y + P Y V
Sbjct: 639 ARLFDNIKVFFGDANHSHGLAKNVDTFRGLHANQHIPQIVGALEMYRDSDKPEYFNVADN 698
Query: 364 FFMDIVNASHGYATGGTSAGE-------FWSDPKRL---ASTLGTENEESCTTYNMLKVS 413
F++ N + Y+ GG + F + P L + G +N E+C TYNMLK++
Sbjct: 699 FWVKATN-DYMYSIGGVAGARNPANAECFIAQPGTLYENGLSAGGQN-ETCATYNMLKLT 756
Query: 414 RHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFS 473
R+LF + + DYYER L N +L+ P Y +PL G K+ +
Sbjct: 757 RNLFLYEQRPELMDYYERGLYNHILASVAEDSP-ANTYHVPLRPGSKKSFG----NPNMT 811
Query: 474 SFWCCYGTGL 483
F CC GT L
Sbjct: 812 GFTCCNGTAL 821
>gi|431799831|ref|YP_007226735.1| hypothetical protein Echvi_4552 [Echinicola vietnamensis DSM 17526]
gi|430790596|gb|AGA80725.1| putative glycosyl hydrolase of unknown function (DUF1680)
[Echinicola vietnamensis DSM 17526]
Length = 1042
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 131/423 (30%), Positives = 181/423 (42%), Gaps = 81/423 (19%)
Query: 135 DSLVWSFQKTAGS--PTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTH-----NVT 187
D ++ F+ G P W+ +LRGH GHYL+A A +AST
Sbjct: 431 DDFLYMFRNAFGQEQPAGAVPLGVWDSQETKLRGHATGHYLTAIAQAYASTGYDTALQAN 490
Query: 188 LKEKMTAVVSAL---SECQNKM-------------------------------------- 206
+KM +V+ L S+ K
Sbjct: 491 FADKMAYMVNTLYNLSQMAGKPSAEADGHNADPTAVPMGPGKDFYDSDLSEEGIRTDYWN 550
Query: 207 -GSGYLSAFPSEQFDRFE-------ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMT 258
G GY+SA+P +QF E VWAPYYT+HKILAGL+D Y + N +AL +
Sbjct: 551 WGEGYISAYPPDQFIMLEHGAKYGGQKDQVWAPYYTLHKILAGLMDIYEVSGNEKALSVA 610
Query: 259 KWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDK 317
K M + R+ + T + WN+ + E GGMN+ + RLY IT ++L A LFD
Sbjct: 611 KGMGTWVAARLDKLPTSTLISM-WNTYIAGEFGGMNEAMARLYRITGSSRYLAAAKLFDN 669
Query: 318 -PCFLGL------LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVN 370
F G LA D G HAN HIP ++G+ Y T Y F I
Sbjct: 670 ITVFYGNADHDHGLAKNVDTFRGLHANQHIPQIMGALEMYRDTESAPYFHIADNFWHIAT 729
Query: 371 ASHGYATGGTSAGE-------FWSDPKRL---ASTLGTENEESCTTYNMLKVSRHLFRWT 420
+ Y+ GG + F ++P L + G +N E+C TYNMLK+SR+LF +
Sbjct: 730 NDYMYSIGGVAGARTPANAECFTTEPATLYEFGFSAGGQN-ETCATYNMLKLSRNLFLFQ 788
Query: 421 KEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYG 480
++ Y DYYER L N +L+ P Y +PL G K + F CC G
Sbjct: 789 QDPAYMDYYERGLYNHILASVAKDSP-ANTYHVPLRPGSIKQFG----NPKMKGFTCCNG 843
Query: 481 TGL 483
T +
Sbjct: 844 TAI 846
>gi|86140890|ref|ZP_01059449.1| putative secreted protein [Leeuwenhoekiella blandensis MED217]
gi|85832832|gb|EAQ51281.1| putative secreted protein [Leeuwenhoekiella blandensis MED217]
Length = 1004
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 139/463 (30%), Positives = 204/463 (44%), Gaps = 83/463 (17%)
Query: 96 KLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGS--PTAGKA 153
KL L EV+L++ L S + ++ L + DS ++ F+ G P
Sbjct: 354 KLTSFALNEVNLNNTSLGDHSKFIENRNKFIDTLAQTNPDSFLYMFRNAFGQEQPEGATP 413
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTH-----NVTLKEKM---------------- 192
W+ +LRGH GHYL+A A +AST ++KM
Sbjct: 414 LGVWDTQETKLRGHATGHYLTAIAQAYASTGYDKALQKNFEDKMNYMVNTLYDLSQLSGK 473
Query: 193 ---------------------TAVVSALSECQNKM-----GSGYLSAFPSEQFDRFE--- 223
TA S LSE + G G++SA+P +QF E
Sbjct: 474 PKTEGGAYVEDPSSVPPGPGSTAYTSDLSEDGIRTDYWNWGKGFISAYPPDQFIMLEHGA 533
Query: 224 ----ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVE 279
VWAPYYT+HKILAGL+D Y + N +AL++ + M + + R+ + T+ ++
Sbjct: 534 KYGGQETQVWAPYYTLHKILAGLIDVYEVSGNPKALQVAEGMAAWVHTRLSKLPTE-TLI 592
Query: 280 RHWNS-LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDK-PCFLGL------LAVQADDI 331
WN+ + E GG+N+ L L+ IT ++L A LFD F G LA D
Sbjct: 593 TMWNTYIAGELGGINESLAHLHRITGKSEYLETAKLFDNIKVFYGDAEHTHGLAKNVDTY 652
Query: 332 SGFHANTHIPVVIGSQMRYEVTGDP-LYKVTGTFFMDIVNASHGYATGGTSAGE------ 384
G HAN HIP ++G+ Y + P Y + F+ N + Y+ GG +
Sbjct: 653 RGLHANQHIPQIMGALELYRNSNSPEYYHIADNFWYKTKN-DYMYSIGGVAGARNPANAE 711
Query: 385 -FWSDPKRL---ASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSI 440
F + P L + G +N E+C TYNMLK++R LF + ++ DYYE+AL N +L+
Sbjct: 712 CFVAQPATLYENGLSAGGQN-ETCGTYNMLKLTRGLFFYNQQPELMDYYEQALYNQILAS 770
Query: 441 QRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
P Y +PL G K S S F CC GT +
Sbjct: 771 VAENSPA-NTYHIPLRPGSRKQFS----NADMSGFTCCNGTAI 808
>gi|408533805|emb|CCK31979.1| secreted protein [Streptomyces davawensis JCM 4913]
Length = 943
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 101/284 (35%), Positives = 143/284 (50%), Gaps = 16/284 (5%)
Query: 209 GYLSAFPSEQFDRFEALKP-----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVE 263
G+L+A+P QF E+ VWAPYYT HKIL G+LD Y D+ +AL + M +
Sbjct: 392 GFLAAYPETQFIDLESRTSSDYTKVWAPYYTAHKILRGVLDAYLATDDARALDLASGMAD 451
Query: 264 YFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLG 322
+ ++R+ + + +++R W + E GG+ + + L+ IT +HL LA LFD +
Sbjct: 452 WMHSRLSK-LPEATLQRMWGLFSSGEFGGIVEAICDLHAITGKAEHLALARLFDLDRLID 510
Query: 323 LLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSA 382
A D + G HAN HIP+ G Y+ TG+ Y F +V Y GGTS
Sbjct: 511 SCAANTDILDGLHANQHIPIFTGYLRLYDATGEQRYLDAARNFWGMVVPHRMYGIGGTST 570
Query: 383 GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
GEFW +A T+ E+C YN+LK+SR LF Y DYYERAL N VL ++
Sbjct: 571 GEFWKARDVIAGTISATTAETCCAYNLLKLSRTLFFHEPSPKYMDYYERALYNQVLGSKQ 630
Query: 443 ---GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E ++ Y + L G + Y T CC GTG+
Sbjct: 631 DKPDAEKPLVTYFIGLTPG--HVRDY----TPKQGTTCCEGTGM 668
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 53/110 (48%), Gaps = 6/110 (5%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGK-AYEGWE-- 158
+K +L V L L ++ L++ DVD L+ F+ AG PT A GWE
Sbjct: 53 VKPFALDQVTLG-QGLFADKRELMLDHARGYDVDRLLQVFRANAGLPTGDAVAPGGWEGL 111
Query: 159 --DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
+ LRGH+ GH+++ A WA T +++ ++ AL+E + +
Sbjct: 112 DGEANGNLRGHYTGHFMTMLAQAWAGTGEQVFADRLRTMIGALTEVRAAL 161
>gi|336404182|ref|ZP_08584880.1| hypothetical protein HMPREF0127_02193 [Bacteroides sp. 1_1_30]
gi|335943510|gb|EGN05349.1| hypothetical protein HMPREF0127_02193 [Bacteroides sp. 1_1_30]
Length = 650
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/378 (32%), Positives = 173/378 (45%), Gaps = 25/378 (6%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC----ELRGHFVGHYLSASA 177
QQ EYLL L+ DSL+ ++ AG P AY GWE LRG F+G YLS+ +
Sbjct: 53 QQKGKEYLLWLNPDSLLHFYRVEAGLPPKADAYAGWESQNVWGAGPLRGGFLGFYLSSVS 112
Query: 178 HMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA-------FPSEQFDRFEALKP--- 227
M ST + L +++ V+ L CQ+ G+L F + + P
Sbjct: 113 MMHQSTGDKELLKRLKYVLKELKLCQDAGKDGFLLGIKDGRMLFKEVASGKIKTNNPTVN 172
Query: 228 -VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLN 286
WAP Y I+K+L GL YT +AL M + ++F V+ K S E+ L
Sbjct: 173 GAWAPVYLINKMLLGLSAAYTQCGLEEALPMMIRLADWF---GYQVLDKLSDEQIQKLLV 229
Query: 287 EETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGS 346
E G +N+ Y +T + L A L+ D + G+HANT IP G
Sbjct: 230 CEHGSINESYVEAYELTGQKRFLDWARRLHDRAMWVPLSEGKDILYGWHANTQIPKFTGF 289
Query: 347 QMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTE-NEESCT 405
Y TGD + T F +IVN +H + GG S GE + + A L + E+C
Sbjct: 290 HKYYMFTGDKRFLTAATNFWNIVNRNHTWVIGGNSTGEHFFPKEEFADRLLLKGGPETCN 349
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSY 465
+ NML+++ LF + V A YYER L N +LS + G+ Y + G Y
Sbjct: 350 SVNMLRLTESLFSQYPDAVKASYYERVLFNHILSAY-DPKKGMCCYFTSMRPG-----HY 403
Query: 466 HGWGTRFSSFWCCYGTGL 483
+ +R SSFWCC TGL
Sbjct: 404 RIYASRDSSFWCCGHTGL 421
>gi|296129045|ref|YP_003636295.1| hypothetical protein Cfla_1194 [Cellulomonas flavigena DSM 20109]
gi|296020860|gb|ADG74096.1| protein of unknown function DUF1680 [Cellulomonas flavigena DSM
20109]
Length = 749
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/401 (30%), Positives = 173/401 (43%), Gaps = 51/401 (12%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
L+ V L D L +AQ+T LEYLL LD D L+ F++ AG P + Y WE +
Sbjct: 13 LRAVRLTD------GLFAQAQRTALEYLLGLDPDRLLAPFRREAGLPPVAEPYGSWE--S 64
Query: 162 CELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP------ 215
L GH GH LSA++ WA+T + A+V L CQ+ +G+GY+ P
Sbjct: 65 LGLDGHIGGHALSAASLQWAATGDDRAAGMAHALVDGLVLCQDALGTGYVGGLPGGVALW 124
Query: 216 ------SEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEY----- 264
+ F+ L W P+Y +HK AGL+D +A A++ + V
Sbjct: 125 ESVASGGAEAGTFD-LGGAWVPWYNVHKTYAGLIDAARYAPADVAVRAMRAAVRLGDWGV 183
Query: 265 -FYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
+R+ + L E GGM + L +T D ++ LA F LG
Sbjct: 184 ALSDRLDDAAFA-------RMLRTEFGGMCEAYGDLAALTGDARYAALARRFADESLLGP 236
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
L D++ G HANT + V+G E F+ V GG S
Sbjct: 237 LRESRDELDGLHANTQVAKVVGWPAIGEAD-------AALAFVRTVLDHRTLVLGGHSVA 289
Query: 384 E-FWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
E F P+R + E ESC T N+L+V R L+ T ++ D ER L N VLS Q
Sbjct: 290 EHFTPRPERHVTH--REGPESCNTANLLEVERRLYERTGDVALLDAAERQLVNHVLSAQH 347
Query: 443 GTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
G +Y P ++ Y + TR + WCC GT L
Sbjct: 348 --PDGGFVYFTP-----ARPGHYRVYSTRDACMWCCVGTAL 381
>gi|423219866|ref|ZP_17206362.1| hypothetical protein HMPREF1061_03135 [Bacteroides caccae
CL03T12C61]
gi|392625071|gb|EIY19149.1| hypothetical protein HMPREF1061_03135 [Bacteroides caccae
CL03T12C61]
Length = 655
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 125/399 (31%), Positives = 185/399 (46%), Gaps = 33/399 (8%)
Query: 102 LKEVSLHDVK-LDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
LKE+ L D LD QQ EYLL L+ DSL+ ++ AG + Y GWE
Sbjct: 48 LKEIRLSDGPFLD-------LQQKGKEYLLWLNPDSLLHFYRIEAGLSSKAGPYAGWESQ 100
Query: 161 TC----ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYL----- 211
LRG F+G YLS+ + M+ ST + L ++ V+ L CQ G+L
Sbjct: 101 DVWGAGPLRGGFLGFYLSSVSMMYQSTGDRELLRRLKYVLKELKLCQEAGKDGFLLGVKG 160
Query: 212 --SAFPSEQFDRFEALKP----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYF 265
F + + P WAP Y I+K+L GL YT D +AL + + ++F
Sbjct: 161 GRELFREVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCDLKEALPILVRLADWF 220
Query: 266 YNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLA 325
++V + +T +++ L E G +N+ +Y +T + L A + L+
Sbjct: 221 GSQVLDKLTDEQIQQ---LLICEHGSINESYVEVYELTGQKRFLDWARRLNDRAMWVPLS 277
Query: 326 VQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE- 384
D + G+HANT IP G Y TGD + + T F +IV +H + GG S GE
Sbjct: 278 EGKDVLFGWHANTQIPKFTGFHKYYMFTGDRAFLLAATNFWNIVKQNHTWVIGGNSTGEH 337
Query: 385 FWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT 444
F+S + + L E+C + NML+++ LF + A YYER L N +LS
Sbjct: 338 FFSKKEFIDRMLHISGPETCNSVNMLRLTEALFMQQPDATKAAYYERTLFNHILSAYDPV 397
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ Y + G Y + +R SSFWCC TGL
Sbjct: 398 K-GMCCYFTSMRPG-----HYRIYASRDSSFWCCGHTGL 430
>gi|153805786|ref|ZP_01958454.1| hypothetical protein BACCAC_00022 [Bacteroides caccae ATCC 43185]
gi|149130463|gb|EDM21669.1| hypothetical protein BACCAC_00022 [Bacteroides caccae ATCC 43185]
Length = 659
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 125/399 (31%), Positives = 184/399 (46%), Gaps = 33/399 (8%)
Query: 102 LKEVSLHDVK-LDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
LKE+ L D LD QQ EYLL L+ DSL+ ++ AG + Y GWE
Sbjct: 52 LKEIRLSDGPFLD-------LQQKGKEYLLWLNPDSLLHFYRIEAGLSSKAGPYAGWESQ 104
Query: 161 TC----ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYL----- 211
LRG F+G YLS+ + M+ ST + L ++ V+ L CQ G+L
Sbjct: 105 DVWGAGPLRGGFLGFYLSSVSMMYQSTGDRELLRRLKYVLKELKLCQEAGKDGFLLGVKG 164
Query: 212 --SAFPSEQFDRFEALKP----VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYF 265
F + + P WAP Y I+K+L GL YT D +AL + + ++F
Sbjct: 165 GRELFREVASGKIKTNNPTVNGAWAPVYLINKMLLGLSAAYTQCDLKEALPILVRLADWF 224
Query: 266 YNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLA 325
++V + +T +++ L E G +N+ +Y +T + L A + L+
Sbjct: 225 GSQVLDKLTDEQIQQ---LLICEHGSINESYVEVYELTGQKRFLDWARRLNDRAMWVPLS 281
Query: 326 VQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE- 384
D + G HANT IP G Y TGD + + T F +IV +H + GG S GE
Sbjct: 282 EGKDVLFGGHANTQIPKFTGFHKYYMFTGDRAFLLAATNFWNIVKQNHTWVIGGNSTGEH 341
Query: 385 FWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT 444
F+S + + L E+C + NML+++ LF + A YYER L N +LS
Sbjct: 342 FFSKKEFIDRMLHISGPETCNSVNMLRLTEALFMQQPDATKAAYYERTLFNHILSAYDPV 401
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+ Y + G Y + +R SSFWCC TGL
Sbjct: 402 K-GMCCYFTSMRPG-----HYRIYASRDSSFWCCGHTGL 434
>gi|423228769|ref|ZP_17215175.1| hypothetical protein HMPREF1063_00995 [Bacteroides dorei
CL02T00C15]
gi|423247580|ref|ZP_17228629.1| hypothetical protein HMPREF1064_04835 [Bacteroides dorei
CL02T12C06]
gi|392631910|gb|EIY25877.1| hypothetical protein HMPREF1064_04835 [Bacteroides dorei
CL02T12C06]
gi|392635508|gb|EIY29407.1| hypothetical protein HMPREF1063_00995 [Bacteroides dorei
CL02T00C15]
Length = 811
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 115/403 (28%), Positives = 186/403 (46%), Gaps = 26/403 (6%)
Query: 97 LAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG 156
L+ + + SL +V++ + Q + +YLL L+ D L+ F++ AG + Y
Sbjct: 28 LSKNRIDLFSLSEVRITDKYFKY-IQDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPF 86
Query: 157 WEDPTC----ELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLS 212
WE L GH +G Y+S+ + M+ +T++ + +++ +V+ L CQ G GYL
Sbjct: 87 WESEDVWGGGPLAGHILGFYMSSMSMMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLL 146
Query: 213 A-------FPSEQFDRFEALKPV----WAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
A F F P+ W P Y ++KI+ GL Y A ++ M
Sbjct: 147 ATVNGKQVFEDMIDGDFTTSNPLINQTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGM 206
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL 321
++F V + + ++++ L E G +N+ +Y IT D K+L A +
Sbjct: 207 ADWFGYEVLDKLNHENIQKM---LVCEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMW 263
Query: 322 GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
L+ D ++G+HANT IP G Y T + Y T F DIV H + GG S
Sbjct: 264 VPLSKGEDILNGWHANTQIPKFTGFNAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNS 323
Query: 382 AGEFWSDPKRLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSI 440
GE + + + ESC + NM++++ L++ + DYYER L N +L+
Sbjct: 324 TGEHFFEESMFEKKIPQYGGPESCNSVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA- 382
Query: 441 QRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
E G+ +Y P+ G Y +GTR+ SFWCC GTG
Sbjct: 383 NYDPEEGMCVYYTPMRPG-----HYKIYGTRYHSFWCCTGTGF 420
>gi|212693864|ref|ZP_03301992.1| hypothetical protein BACDOR_03386 [Bacteroides dorei DSM 17855]
gi|212663396|gb|EEB23970.1| hypothetical protein BACDOR_03386 [Bacteroides dorei DSM 17855]
Length = 811
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 175/378 (46%), Gaps = 25/378 (6%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC----ELRGHFVGHYLSASA 177
Q + +YLL L+ D L+ F++ AG + Y WE L GH +G Y+S+ +
Sbjct: 52 QDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPFWESEDVWGGGPLAGHILGFYMSSMS 111
Query: 178 HMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA-------FPSEQFDRFEALKPV-- 228
M+ +T++ + +++ +V+ L CQ G GYL A F F P+
Sbjct: 112 MMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLLATVNGKQVFEDMIDGDFTTSNPLIN 171
Query: 229 --WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLN 286
W P Y ++KI+ GL Y A ++ M ++F V + + ++++ L
Sbjct: 172 QTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGMADWFGYEVLDKLNHENIQKM---LV 228
Query: 287 EETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGS 346
E G +N+ +Y IT D K+L A + L+ D ++G+HANT IP G
Sbjct: 229 CEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMWVPLSKGEDILNGWHANTQIPKFTGF 288
Query: 347 QMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT-ENEESCT 405
Y T + Y T F DIV H + GG S GE + + + ESC
Sbjct: 289 NAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNSTGEHFFEESMFEKKIPQYGGPESCN 348
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSY 465
+ NM++++ L++ + DYYER L N +L+ E G+ +Y P+ G Y
Sbjct: 349 SVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA-NYDPEEGMCVYYTPMRPG-----HY 402
Query: 466 HGWGTRFSSFWCCYGTGL 483
+GTR+ SFWCC GTG
Sbjct: 403 KIYGTRYHSFWCCTGTGF 420
>gi|332669733|ref|YP_004452741.1| hypothetical protein Celf_1219 [Cellulomonas fimi ATCC 484]
gi|332338771|gb|AEE45354.1| protein of unknown function DUF1680 [Cellulomonas fimi ATCC 484]
Length = 752
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 123/387 (31%), Positives = 172/387 (44%), Gaps = 23/387 (5%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQ+T+L YLL LD L+ F++ AG P + Y WE + L GH GH LSA++ +W
Sbjct: 19 AQRTDLAYLLRLDPQRLLAPFRREAGLPPLAEPYGNWE--SMGLDGHTGGHALSAASLLW 76
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--SEQFDRFEA---------LKPVW 229
A+T + E A+V L CQ +G+GY+ P F+R A L W
Sbjct: 77 AATGDPRTAELAAALVDGLDACQEALGTGYVGGVPHGVALFERIAAGEVSADSFGLNGAW 136
Query: 230 APYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEET 289
P+Y +HK +AGL+D +A A + + +V F V + L E
Sbjct: 137 VPWYNLHKTVAGLVDAVRYAPAGTAERARR-VVLRFAEWWLGVAAGLDDAQFAAMLRTEF 195
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMR 349
GGM + L +T +A F L L D + G HANT I V+G
Sbjct: 196 GGMCEAFADLAALTGRDDLRAMAVRFADRTLLDPLLDGRDALDGLHANTQIAKVVGWAAL 255
Query: 350 YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT-ENEESCTTYN 408
E GD ++ F D V GG S GE + + L + E ESC T N
Sbjct: 256 AEQDGDGGWERAARTFWDAVTTHRSLVFGGDSVGEHFHPVDDFSGALTSPEGPESCNTAN 315
Query: 409 MLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGW 468
ML+++R L + D+ ERAL N VLS Q G +Y P ++ Y +
Sbjct: 316 MLELTRRLLLRRPDPTLLDFAERALVNHVLSAQH--PDGGFVYFTP-----ARPDHYRVY 368
Query: 469 GTRFSSFWCCYGTGL-GFIRWIQLSLS 494
FWCC GTGL + R +L+L+
Sbjct: 369 SQPEDGFWCCVGTGLETYARLGELALA 395
>gi|265751351|ref|ZP_06087414.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263238247|gb|EEZ23697.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 791
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 175/378 (46%), Gaps = 25/378 (6%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTC----ELRGHFVGHYLSASA 177
Q + +YLL L+ D L+ F++ AG + Y WE L GH +G Y+S+ +
Sbjct: 32 QDLDHQYLLTLEPDRLLSWFRREAGLTPKAQPYPFWESEDVWGGGPLAGHILGFYMSSMS 91
Query: 178 HMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA-------FPSEQFDRFEALKPV-- 228
M+ +T++ + +++ +V+ L CQ G GYL A F F P+
Sbjct: 92 MMYQTTNDKMILDRLNYIVNELLLCQKAHGDGYLLATVNGKQVFEDMIDGDFTTSNPLIN 151
Query: 229 --WAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLN 286
W P Y ++KI+ GL Y A ++ M ++F V + + ++++ L
Sbjct: 152 QTWEPVYIMNKIMLGLYGVYKRCHIQDAKRILMGMADWFGYEVLDKLNHENIQKM---LV 208
Query: 287 EETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGS 346
E G +N+ +Y IT D K+L A + L+ D ++G+HANT IP G
Sbjct: 209 CEHGSINESYIDVYKITGDKKYLEWAKKLNDEDMWVPLSKGEDILNGWHANTQIPKFTGF 268
Query: 347 QMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGT-ENEESCT 405
Y T + Y T F DIV H + GG S GE + + + ESC
Sbjct: 269 NAVYRYTNNKAYNDAATRFWDIVVQKHTWINGGNSTGEHFFEESMFEKKIPQYGGPESCN 328
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSY 465
+ NM++++ L++ + DYYER L N +L+ E G+ +Y P+ G Y
Sbjct: 329 SVNMMRLTESLYQTDGRVDRIDYYERVLYNHILA-NYDPEEGMCVYYTPMRPG-----HY 382
Query: 466 HGWGTRFSSFWCCYGTGL 483
+GTR+ SFWCC GTG
Sbjct: 383 KIYGTRYHSFWCCTGTGF 400
>gi|408369881|ref|ZP_11167661.1| hypothetical protein I215_03228 [Galbibacter sp. ck-I2-15]
gi|407744935|gb|EKF56502.1| hypothetical protein I215_03228 [Galbibacter sp. ck-I2-15]
Length = 1011
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 138/432 (31%), Positives = 194/432 (44%), Gaps = 87/432 (20%)
Query: 129 LLMLDVDSLVWSFQKTAG--SPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWAST-HN 185
L D DS ++ F+ G P K W+ +LRGH GHYL+A A +AS+ ++
Sbjct: 395 LAKTDPDSFLYMFRNAFGVSQPQDAKPLGVWDSQETKLRGHATGHYLTAIAQAYASSSYD 454
Query: 186 VTLKE----KMTAVV-------------------------------------SALSE--C 202
LKE KM +V S LSE
Sbjct: 455 EQLKELFAQKMNYMVETLYDLSKLSGQPINSGGEHVSDPTKVPFGPGKTDYNSDLSEQGI 514
Query: 203 QNKM---GSGYLSAFPSEQFDRFEA-------LKPVWAPYYTIHKILAGLLDQYTFADNT 252
+N G+GY+SA+P +QF E+ +WAPYYT+HKILAGLLD Y + N
Sbjct: 515 RNDYWNWGTGYISAYPPDQFIMLESGATYGGQNDQIWAPYYTLHKILAGLLDVYEISGNK 574
Query: 253 QALKMTKWMVEYFYNRVQNVITKYSVERHWNS-LNEETGGMNDVLYRLYTITQDPKHLLL 311
+AL + + M ++ R+ + T + WN + E GGMN+V+ RLY +T +L +
Sbjct: 575 KALSVAQGMGDWVSARMVELPTSTLISM-WNRYIAGEYGGMNEVMARLYRLTGTESYLKV 633
Query: 312 AHLFDK-PCFLGL------LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLY-KVTGT 363
A LFD F G LA D G H+N HIP ++G+ Y T + Y K+
Sbjct: 634 AGLFDNIKMFYGDAQHTHGLAKNVDTFRGLHSNQHIPQIVGALEMYRDTDEVEYFKIADN 693
Query: 364 FFMDIVNASHG--YATGGTSAGE-------FWSDPKRL---ASTLGTENEESCTTYNMLK 411
F+ A+H Y+ GG + F P L + G +N E+C TYNMLK
Sbjct: 694 FWF---KATHDYMYSIGGVAGARNPANAECFPVQPATLYENGFSSGGQN-ETCATYNMLK 749
Query: 412 VSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTR 471
++R LF + + DYYER L N +L+ P Y +PL G K H
Sbjct: 750 LTRDLFFFEPKAQLMDYYERGLYNHILASVAKDSP-ANTYHVPLLPGSVK----HFGNPD 804
Query: 472 FSSFWCCYGTGL 483
+ F CC GT +
Sbjct: 805 MTGFTCCNGTAI 816
>gi|237711616|ref|ZP_04542097.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|229454311|gb|EEO60032.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
Length = 780
Score = 154 bits (390), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 117/407 (28%), Positives = 182/407 (44%), Gaps = 30/407 (7%)
Query: 97 LAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEG 156
+ GD + SL +V+L S N Y+L L+ D L+ F++ AG + Y
Sbjct: 1 MNGDKISLFSLKEVRLLDSDFK-HIMDLNHAYMLSLEPDRLLSWFRREAGLTPKAQPYPF 59
Query: 157 WEDPTCE----LRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYL- 211
WE L GH +G YLS + M+ ST + + +++ ++ LS CQ G GYL
Sbjct: 60 WESEYMNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQAGGDGYLL 119
Query: 212 ------SAFPSEQFDRFEALKP--------VWAPYYTIHKILAGLLDQYTFADNTQALKM 257
+ F + F+ P W P Y ++KI+ GL Y D QA ++
Sbjct: 120 PTICGRAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMRCDLLQAKEI 179
Query: 258 TKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDK 317
M ++F +VI K S + L E G +N+ +Y IT + K+L A +
Sbjct: 180 LVKMADWF---GYSVIDKLSHDDLQKLLVCEHGSINESFIDVYQITGEEKYLKWAQRLND 236
Query: 318 PCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYAT 377
++ D + G+HANT IP G + Y + + FF D V H +
Sbjct: 237 EDMWVPMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDTVVRKHTWVM 296
Query: 378 GGTSAGEFWSDPKRLASTLGTE-NEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNG 436
GG S GE + P+ + ESC + NML+++ L+ E+ DYYE+ L N
Sbjct: 297 GGNSTGEHFFAPEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEKVDYYEKVLFNH 356
Query: 437 VLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+L+ + G+ +Y + G Y +GT++ SFWCC GTG
Sbjct: 357 ILA-NYDPDQGMCVYYTSMKPG-----HYKIYGTKYDSFWCCTGTGF 397
>gi|212695364|ref|ZP_03303492.1| hypothetical protein BACDOR_04911 [Bacteroides dorei DSM 17855]
gi|345513936|ref|ZP_08793451.1| hypothetical protein BSEG_01968 [Bacteroides dorei 5_1_36/D4]
gi|423230909|ref|ZP_17217313.1| hypothetical protein HMPREF1063_03133 [Bacteroides dorei
CL02T00C15]
gi|423241462|ref|ZP_17222575.1| hypothetical protein HMPREF1065_03198 [Bacteroides dorei
CL03T12C01]
gi|423244620|ref|ZP_17225695.1| hypothetical protein HMPREF1064_01901 [Bacteroides dorei
CL02T12C06]
gi|212662093|gb|EEB22667.1| hypothetical protein BACDOR_04911 [Bacteroides dorei DSM 17855]
gi|229435750|gb|EEO45827.1| hypothetical protein BSEG_01968 [Bacteroides dorei 5_1_36/D4]
gi|392630029|gb|EIY24031.1| hypothetical protein HMPREF1063_03133 [Bacteroides dorei
CL02T00C15]
gi|392641355|gb|EIY35132.1| hypothetical protein HMPREF1065_03198 [Bacteroides dorei
CL03T12C01]
gi|392641469|gb|EIY35245.1| hypothetical protein HMPREF1064_01901 [Bacteroides dorei
CL02T12C06]
Length = 808
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 117/405 (28%), Positives = 181/405 (44%), Gaps = 30/405 (7%)
Query: 99 GDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWE 158
GD + SL +V+L S N Y+L L+ D L+ F++ AG + Y WE
Sbjct: 31 GDKISLFSLKEVRLLDSDFK-HIMDLNHAYMLSLEPDRLLSWFRREAGLTPKAQPYPFWE 89
Query: 159 DPTCE----LRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYL--- 211
L GH +G YLS + M+ ST + + +++ ++ LS CQ G GYL
Sbjct: 90 SEYMNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQAGGDGYLLPT 149
Query: 212 ----SAFPSEQFDRFEALKP--------VWAPYYTIHKILAGLLDQYTFADNTQALKMTK 259
+ F + F+ P W P Y ++KI+ GL Y D QA ++
Sbjct: 150 ICGRAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMRCDLLQAKEILV 209
Query: 260 WMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPC 319
M ++F +VI K S + L E G +N+ +Y IT + K+L A +
Sbjct: 210 KMADWF---GYSVIDKLSHDDLQKLLVCEHGSINESFIDVYQITGEEKYLKWAQRLNDED 266
Query: 320 FLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGG 379
++ D + G+HANT IP G + Y + + FF D V H + GG
Sbjct: 267 MWVPMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDTVVRKHTWVMGG 326
Query: 380 TSAGEFWSDPKRLASTLGTE-NEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVL 438
S GE + P+ + ESC + NML+++ L+ E+ DYYE+ L N +L
Sbjct: 327 NSTGEHFFAPEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEKVDYYEKVLFNHIL 386
Query: 439 SIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ + G+ +Y + G Y +GT++ SFWCC GTG
Sbjct: 387 A-NYDPDQGMCVYYTSMKPG-----HYKIYGTKYDSFWCCTGTGF 425
>gi|265753026|ref|ZP_06088595.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263236212|gb|EEZ21707.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 808
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 117/405 (28%), Positives = 181/405 (44%), Gaps = 30/405 (7%)
Query: 99 GDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWE 158
GD + SL +V+L S N Y+L L+ D L+ F++ AG + Y WE
Sbjct: 31 GDKISLFSLKEVRLLDSDFK-HIMDLNHAYMLSLEPDRLLSWFRREAGLTPKAQPYPFWE 89
Query: 159 DPTCE----LRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYL--- 211
L GH +G YLS + M+ ST + + +++ ++ LS CQ G GYL
Sbjct: 90 SEYMNGHGPLPGHIMGFYLSGISMMYDSTGDTAILSRLSYILEELSLCQQAGGDGYLLPT 149
Query: 212 ----SAFPSEQFDRFEALKP--------VWAPYYTIHKILAGLLDQYTFADNTQALKMTK 259
+ F + F+ P W P Y ++KI+ GL Y D QA ++
Sbjct: 150 ICGRAIFENVLDGNFKTSNPFIETPYDKCWEPVYVMNKIMLGLYQVYMRCDLLQAKEILV 209
Query: 260 WMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPC 319
M ++F +VI K S + L E G +N+ +Y IT + K+L A +
Sbjct: 210 KMADWF---GYSVIDKLSHDDLQKLLVCEHGSINESFIDVYQITGEEKYLKWAQRLNDED 266
Query: 320 FLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGG 379
++ D + G+HANT IP G + Y + + FF D V H + GG
Sbjct: 267 MWVPMSEGKDILEGWHANTQIPKFTGFESVYRYDSNERFTTAARFFWDTVVRKHTWVMGG 326
Query: 380 TSAGEFWSDPKRLASTLGTE-NEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVL 438
S GE + P+ + ESC + NML+++ L+ E+ DYYE+ L N +L
Sbjct: 327 NSTGEHFFAPEEFEHRIELNGGPESCNSVNMLRLTESLYCDYAEVEKVDYYEKVLFNHIL 386
Query: 439 SIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ + G+ +Y + G Y +GT++ SFWCC GTG
Sbjct: 387 A-NYDPDQGMCVYYTSMKPG-----HYKIYGTKYDSFWCCTGTGF 425
>gi|427384823|ref|ZP_18881328.1| hypothetical protein HMPREF9447_02361 [Bacteroides oleiciplenus YIT
12058]
gi|425728084|gb|EKU90943.1| hypothetical protein HMPREF9447_02361 [Bacteroides oleiciplenus YIT
12058]
Length = 813
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 113/396 (28%), Positives = 183/396 (46%), Gaps = 31/396 (7%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRG 166
L +V+L P S + A Q + +YLL D++ ++ +K G P KAY G P R
Sbjct: 43 LSEVRLLPGSPFYHAMQVSQQYLLDADIERMLNGRRKEVGIPEK-KAYPGSNQPAG-TRA 100
Query: 167 HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA-----FPSEQFDR 221
HY+S ++ M+A T + +++ ++ L+ N+ S Y P + +
Sbjct: 101 TDWHHYISGTSLMYAQTGDRRFLDRVNYLIDELAMLDNRKDSLYRVQGKKLELPYAKLMK 160
Query: 222 FEAL--KP----------VWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRV 269
E L P W P+Y HK A D Y + DN +AL + W+ + V
Sbjct: 161 GELLLNSPDEAGYPWGGLCWIPFYWQHKEFAAYRDAYLYCDNLKALNL--WIKQA--EPV 216
Query: 270 QNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQAD 329
I K + + L+ E GG+N V LY +T D ++L ++ + + +A D
Sbjct: 217 TEFILKVNPDLFEGFLDIENGGINAVFADLYALTGDERYLAVSMKLNHQKVILNIANGKD 276
Query: 330 DISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDP 389
+ G HAN +P G+ +Y++TGD + + F I H GG S E +
Sbjct: 277 VLYGRHANFQLPAFEGTARQYQLTGDEVCRKATQNFAGIYYRDHMNCIGGNSCYERFGRS 336
Query: 390 KRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVM 449
+ LG+ + E+C TYNM+K++ + F T ++ + DY+ERAL N +L+ Q GV
Sbjct: 337 GEITKRLGSTSSETCNTYNMMKIALNTFESTGDLHHMDYFERALYNHILASQDPETGGVT 396
Query: 450 IYMLPLGRGDSKAKSYHGWGTRFS--SFWCCYGTGL 483
Y + L G + + RF+ WCC GTG+
Sbjct: 397 YYTMLLPGG------FKSYSDRFNIEGIWCCVGTGM 426
>gi|257068350|ref|YP_003154605.1| hypothetical protein Bfae_11690 [Brachybacterium faecium DSM 4810]
gi|256559168|gb|ACU85015.1| uncharacterized conserved protein [Brachybacterium faecium DSM
4810]
Length = 752
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 119/378 (31%), Positives = 172/378 (45%), Gaps = 37/378 (9%)
Query: 121 AQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
AQ+T+LEYLL L+ + L+ F++ AG T Y WE + L GH GH L+A++ MW
Sbjct: 25 AQRTDLEYLLGLEAERLLAPFRREAGIATTAAPYGNWE--SMGLDGHIGGHALAAASLMW 82
Query: 181 ASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFP--SEQFDRFEA---------LKPVW 229
A+T + E +V L ECQ ++G+GY+ P +E + + L W
Sbjct: 83 AATGDERAAELARQLVEGLRECQARLGTGYVGGIPGGAELWAQIRTIASQAQTWDLGGAW 142
Query: 230 APYYTIHKILAGLLDQYTFAD---NTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLN 286
P+Y +HK AGL++ A + AL++ + + ++ R+ + + R L
Sbjct: 143 VPWYNLHKTFAGLIEAVRHAPAGTASCALEVLRGLGDWG-ARLGEQLDDEAFAR---MLR 198
Query: 287 EETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGS 346
E GGM L IT + +H +A F L L D++ G HANT I VIG
Sbjct: 199 TEFGGMCAAYADLAEITGEERHARMARRFADESLLAPLRAGRDELDGMHANTQIAKVIG- 257
Query: 347 QMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE-FWSDPKRLASTLGTENEESCT 405
+ G+ T F+ V A GG S E F ++P LA E ESC
Sbjct: 258 ---WPALGETAAAET---FVRTVLERRTLAFGGNSVAEHFTAEP--LAHVTDREGPESCN 309
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSY 465
T NML+ + L+ D ER L VLS Q G +Y P ++ Y
Sbjct: 310 TVNMLEAEQRLYEHGGGPWLFDAIERQLVGHVLSAQH--PEGGFVYFTP-----ARPGHY 362
Query: 466 HGWGTRFSSFWCCYGTGL 483
+ TR + WCC GTGL
Sbjct: 363 RVYSTRENGMWCCVGTGL 380
>gi|340347550|ref|ZP_08670658.1| hypothetical protein HMPREF9136_1656 [Prevotella dentalis DSM 3688]
gi|339609246|gb|EGQ14121.1| hypothetical protein HMPREF9136_1656 [Prevotella dentalis DSM 3688]
Length = 1007
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 128/465 (27%), Positives = 194/465 (41%), Gaps = 94/465 (20%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCEL 164
SL DV LD + + L + DV +++++ T G T G +GW+ P +L
Sbjct: 171 SLADVTLDGDNRLTHNRDEALREICSWDVSQQLYNYRDTYGLSTDGYTRSDGWDSPDTKL 230
Query: 165 RGHFVGHYLSASAHMWASTHN----VTLKEKMTAVVSALSECQNKM-------------- 206
+GH GHY+SA A +A T + L++ +T +V+ L CQ K
Sbjct: 231 KGHGSGHYMSAIAQAYAVTKDPRQKAILRKNITRMVNELRACQEKTFVFDKALNRYWEAR 290
Query: 207 ----------------------------GSGYLSAFPSEQFDRFEALKP------VWAPY 232
G GY++A P++ E + VWAPY
Sbjct: 291 DFAPEEELRGLKGTWEAFDEYKKHPEKYGYGYINAIPAQHCALIEMYRAYNNSDWVWAPY 350
Query: 233 YTIHKILAGLLDQYTFADNT----QALKMTKWMVEYFYNRV--QNVITKYSVERHWNS-- 284
Y++HK LAGL+D T+ D+ +AL K M + +NR+ + + + E S
Sbjct: 351 YSVHKQLAGLIDIATYFDDKAICDKALLTAKDMGLWVWNRMHYRTYVKEDGTEAERRSKP 410
Query: 285 ----------LNEETGGMNDVLYRLYTITQDP----KHLLLAHLFDKPCFLGLLAVQADD 330
+ E GGM++ L RL + DP K + A FD P F L+ DD
Sbjct: 411 GNRYEMWDMYIAGEVGGMSESLARLSEMVSDPGEKAKLIEAAGCFDAPKFYNPLSKNVDD 470
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
I HAN HIP+++G+ Y+ +P Y F +V + YATGG GE + P
Sbjct: 471 IRTRHANQHIPMIVGALRSYKTNKNPFYYHLSQNFWHLVQGRYMYATGGVGNGEMFRQPY 530
Query: 391 RLASTLGTEN------------EESCTTYNMLKVSRHLFRWT-KEMVYADYYERALTNGV 437
++ T E+C TYN+LK++ L + + Y DYYER L N +
Sbjct: 531 TQILSMATNGMQEGERQANPDINETCCTYNLLKLTSDLNCYNPDDARYMDYYERGLYNQI 590
Query: 438 LSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTG 482
+ + Y +G +K +G CC GTG
Sbjct: 591 VG-SLNPDKYETCYQYAVGLNATKP-----FGNETPQSTCCGGTG 629
>gi|433653573|ref|YP_007297427.1| hypothetical protein Prede_2696 [Prevotella dentalis DSM 3688]
gi|433304106|gb|AGB29921.1| hypothetical protein Prede_2696 [Prevotella dentalis DSM 3688]
Length = 986
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 128/465 (27%), Positives = 194/465 (41%), Gaps = 94/465 (20%)
Query: 106 SLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCEL 164
SL DV LD + + L + DV +++++ T G T G +GW+ P +L
Sbjct: 150 SLADVTLDGDNRLTHNRDEALREICSWDVSQQLYNYRDTYGLSTDGYTRSDGWDSPDTKL 209
Query: 165 RGHFVGHYLSASAHMWASTHN----VTLKEKMTAVVSALSECQNKM-------------- 206
+GH GHY+SA A +A T + L++ +T +V+ L CQ K
Sbjct: 210 KGHGSGHYMSAIAQAYAVTKDPRQKAILRKNITRMVNELRACQEKTFVFDKALNRYWEAR 269
Query: 207 ----------------------------GSGYLSAFPSEQFDRFEALKP------VWAPY 232
G GY++A P++ E + VWAPY
Sbjct: 270 DFAPEEELRGLKGTWEAFDEYKKHPEKYGYGYINAIPAQHCALIEMYRAYNNSDWVWAPY 329
Query: 233 YTIHKILAGLLDQYTFADNT----QALKMTKWMVEYFYNRV--QNVITKYSVERHWNS-- 284
Y++HK LAGL+D T+ D+ +AL K M + +NR+ + + + E S
Sbjct: 330 YSVHKQLAGLIDIATYFDDKAICDKALLTAKDMGLWVWNRMHYRTYVKEDGTEAERRSKP 389
Query: 285 ----------LNEETGGMNDVLYRLYTITQDP----KHLLLAHLFDKPCFLGLLAVQADD 330
+ E GGM++ L RL + DP K + A FD P F L+ DD
Sbjct: 390 GNRYEMWDMYIAGEVGGMSESLARLSEMVSDPGEKAKLIEAAGCFDAPKFYNPLSKNVDD 449
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPK 390
I HAN HIP+++G+ Y+ +P Y F +V + YATGG GE + P
Sbjct: 450 IRTRHANQHIPMIVGALRSYKTNKNPFYYHLSQNFWHLVQGRYMYATGGVGNGEMFRQPY 509
Query: 391 RLASTLGTEN------------EESCTTYNMLKVSRHLFRWT-KEMVYADYYERALTNGV 437
++ T E+C TYN+LK++ L + + Y DYYER L N +
Sbjct: 510 TQILSMATNGMQEGERQANPDINETCCTYNLLKLTSDLNCYNPDDARYMDYYERGLYNQI 569
Query: 438 LSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTG 482
+ + Y +G +K +G CC GTG
Sbjct: 570 VG-SLNPDKYETCYQYAVGLNATKP-----FGNETPQSTCCGGTG 608
>gi|297725075|ref|NP_001174901.1| Os06g0612950 [Oryza sativa Japonica Group]
gi|255677224|dbj|BAH93629.1| Os06g0612950 [Oryza sativa Japonica Group]
Length = 198
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 87/167 (52%), Positives = 105/167 (62%), Gaps = 21/167 (12%)
Query: 21 KECTNSFPQLASHTFRYELLSSKNETWK-KEVYSHY-HLTPTDDSAWSNLLPRKMLSETD 78
KECTN QL+SHT R L SS W+ +E Y H HL PTD++AW +L+P S +
Sbjct: 23 KECTNIPTQLSSHTVRARLQSSSAAEWRWREEYFHGDHLNPTDEAAWMDLMPLAAASAS- 81
Query: 79 EFSWTMIYRKMKNPDGFKLAGD-----------FLKEVSLHDVKLD----PSSLHWRAQQ 123
EF W M+YR +K G +AGD FL+EVSLHDV+LD ++ RAQQ
Sbjct: 82 EFDWAMLYRSLK---GAAVAGDEGGGGGGGGFGFLEEVSLHDVRLDMDGGGDGVYGRAQQ 138
Query: 124 TNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVG 170
TNLEYLL+L+VD LVWSF+ AG P GK Y GWE P ELRGHFVG
Sbjct: 139 TNLEYLLLLEVDRLVWSFRTQAGLPAPGKPYGGWEGPDVELRGHFVG 185
>gi|345514178|ref|ZP_08793691.1| hypothetical protein BSEG_03388 [Bacteroides dorei 5_1_36/D4]
gi|229437170|gb|EEO47247.1| hypothetical protein BSEG_03388 [Bacteroides dorei 5_1_36/D4]
Length = 1118
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 128/466 (27%), Positives = 202/466 (43%), Gaps = 94/466 (20%)
Query: 105 VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCE 163
+ L++VK+D ++ + ++ ++ DV +++++ T G T G +GW+ P +
Sbjct: 151 IPLNNVKIDGNNRLTSNRDLAIKEIISWDVSQQLYNYRDTYGLSTEGYTRSDGWDSPETK 210
Query: 164 LRGHFVGHYLSASAHMWAS----THNVTLKEKMTAVVSALSECQNKM------------- 206
L+GH GHY+SA A +A+ +H L+ +T +V+ L ECQ +
Sbjct: 211 LKGHGSGHYMSALALAYAAATNPSHKEILRRNITRMVNELRECQERTFVWSEELGRYLEA 270
Query: 207 -----------------------------GSGYLSAFPS------EQFDRFEALKPVWAP 231
G GYL+A P E + + VWAP
Sbjct: 271 RDFAPEEELKKMKGTWEAFDEHKTKWATYGYGYLNAIPPHHPALIEMYRAYNNSDWVWAP 330
Query: 232 YYTIHKILAGLLDQYTFADNT----QALKMTKWMVEYFYNRV--QNVITKYSVERH---- 281
YY+IHK LAGL+D T+ D+ +AL + K M + +NR+ + + K +
Sbjct: 331 YYSIHKQLAGLIDIATYMDDKSIADKALLIAKDMGLWVWNRMHYRTYVKKDGTQEERRTR 390
Query: 282 -------WNS-LNEETGGMNDVLYRLYTITQDPKH----LLLAHLFDKPCFLGLLAVQAD 329
WN + E GGM + L RL + P+ + ++ FD P F L+ D
Sbjct: 391 PGNRYEMWNMYIAGEVGGMGESLARLSEMVSAPEEKARLIEASNCFDSPAFYEPLSKNID 450
Query: 330 DISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDP 389
DI HAN HIP++IG+ Y D Y F +++ + Y+TGG GE + P
Sbjct: 451 DIRNRHANQHIPMIIGALRSYLSNNDTFYYHVSHNFWNLIQGRYRYSTGGVGNGEMFRQP 510
Query: 390 KRLASTLG----TENE--------ESCTTYNMLKVSRHLFRWT-KEMVYADYYERALTNG 436
++ +E E E+C TYN+LK+++ L + + Y DYYER L N
Sbjct: 511 YTQIVSMAMNGVSEGESHSNPHINETCCTYNLLKLTKDLNCFNPDDARYMDYYERTLYNQ 570
Query: 437 VLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTG 482
++ E Y +G SK WG CC GTG
Sbjct: 571 IIG-SLHPEHYQTTYQYAVGLNASKP-----WGNETPQSTCCGGTG 610
>gi|393782709|ref|ZP_10370892.1| hypothetical protein HMPREF1071_01760 [Bacteroides salyersiae
CL02T12C01]
gi|392672936|gb|EIY66402.1| hypothetical protein HMPREF1071_01760 [Bacteroides salyersiae
CL02T12C01]
Length = 673
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 123/446 (27%), Positives = 187/446 (41%), Gaps = 85/446 (19%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA-------- 153
+ L +V+L R Q + +Y+ L+ D + F++ AG K
Sbjct: 34 FRSFGLDEVRLKDREFKLR-QNHDFDYIRTLEPDRYLSPFRRNAGIEVDSKGIPVDNTKH 92
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNK-------M 206
Y+GWE L GHYLSA + M+ T + TL K+ ++ L+ Q +
Sbjct: 93 YDGWE----FLGSSTFGHYLSAISMMYKVTGDTTLLHKINYIIDELNFIQRNPSYENENL 148
Query: 207 GSGYLSAFPSEQF---------------------------DRFEALKPVW---------- 229
G L AF ++ +R ++ V+
Sbjct: 149 RHGALVAFDRDRHKHVREPNFLRTYDELRQGQVNLTSAPDNRGATVENVYFKTFYWLSGG 208
Query: 230 APYYTIHKILAGLLDQYTFADNTQALKM-------TKWMVEYFYNRVQNVITKYSVERHW 282
+YT HKI AG+ D Y + N +A K+ W+ E +T ++ R
Sbjct: 209 LSWYTNHKIYAGIRDAYLYTGNPKAKKVFLSFCDWACWVTE--------KLTDHAFAR-- 258
Query: 283 NSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDK-----PCFLGLLAVQADDISGFHAN 337
L E G MN++L Y + + K+L A F++ PC G + A+ IS HAN
Sbjct: 259 -MLYSEHGAMNEMLTDAYAFSGERKYLDCAFRFNEQETMVPCIDGDIKKIAETISHTHAN 317
Query: 338 THIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG 397
IP G +E TGD L+KV F V + TGG S E + P + + +
Sbjct: 318 AQIPQFYGLIKEFEYTGDSLFKVAAENFFKYVTNYQSFVTGGNSEWEQFRAPGNIMAQVT 377
Query: 398 TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGR 457
+ E+C TYNMLK+++ LF T + +Y +Y ERAL N +L ++PG Y L L
Sbjct: 378 RRSGETCNTYNMLKIAKGLFELTGDTLYLNYMERALYNHILPSIHTSQPGAFTYFLSLEP 437
Query: 458 GDSKAKSYHGWGTRFSSFWCCYGTGL 483
G K + + S WCC GTG+
Sbjct: 438 GYFKT-----FSRPYDSHWCCVGTGM 458
>gi|393782707|ref|ZP_10370890.1| hypothetical protein HMPREF1071_01758 [Bacteroides salyersiae
CL02T12C01]
gi|392672934|gb|EIY66400.1| hypothetical protein HMPREF1071_01758 [Bacteroides salyersiae
CL02T12C01]
Length = 1293
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 170/387 (43%), Gaps = 43/387 (11%)
Query: 120 RAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHM 179
+A N+ YL DV+ L+ K K Y G D T HYLSA +
Sbjct: 459 QAMDKNITYLKSFDVNRLLAQTFKYNLGIDDYKLYGGANDAT-------FAHYLSAISMG 511
Query: 180 WASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA--FPSEQFDRFEALKPV--------- 228
+A+T + L +++ +V + + Q+ MG G S P+ F + K +
Sbjct: 512 YAATGDEDLLQRVNHMVDVMIQAQDVMGDGLYSNNDAPTWGFYKMAKEKVITPYGWDENG 571
Query: 229 --WA------PYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVITKY 276
W P+Y HK A D Y +A N A +K +W+V + N + + K
Sbjct: 572 HPWGNNNIGFPFYAHHKAFAAFRDAYIYAGNENARVAFVKFCEWLVMWMQNFTDDNLQKM 631
Query: 277 SVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHA 336
L E GGM +VL Y ++ K L A F + F ++ DD+SG H+
Sbjct: 632 --------LESEHGGMVEVLSDAYALSGKIKFLDAARRFTRDNFAAAMSGNRDDLSGRHS 683
Query: 337 NTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTL 396
N H+P+ +G+ + Y +GD T F IV+ H GG E + P L L
Sbjct: 684 NFHVPMAVGAAIHYLYSGDERSGKTAHNFFHIVHDHHTLCNGGNGNNERFGTPDLLTYRL 743
Query: 397 GTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLG 456
G E+C++YNMLK+++ LF + Y DYYE + N +L+I + Y + L
Sbjct: 744 GQRGPETCSSYNMLKLAKDLFCQEGDTEYLDYYENTMWNHILAILSPRSDAGVCYHVNL- 802
Query: 457 RGDSKAKSYHGWGTRFSSFWCCYGTGL 483
K ++ + +S+ WCC GTG+
Sbjct: 803 ----KPGTFKMYSDLYSNLWCCVGTGM 825
>gi|336397986|ref|ZP_08578786.1| protein of unknown function DUF1680 [Prevotella multisaccharivorax
DSM 17128]
gi|336067722|gb|EGN56356.1| protein of unknown function DUF1680 [Prevotella multisaccharivorax
DSM 17128]
Length = 943
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 134/479 (27%), Positives = 197/479 (41%), Gaps = 102/479 (21%)
Query: 98 AGDFLKEVS----LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-K 152
A + KE++ L DV ++ + + + + DV +++++ T T G K
Sbjct: 116 ANEEKKEIAQTFPLSDVTINGDNRLTHNRDEAIAAICSWDVTQQLYNYRDTYNMSTEGYK 175
Query: 153 AYEGWEDPTCELRGHFVGHYLSASAHMWASTHN----VTLKEKMTAVVSALSECQNKM-- 206
+GW+ P +L+GH GHY+SA A +A T + LK+ +T +V+ L CQ K
Sbjct: 176 VADGWDSPDTKLKGHGSGHYMSAIAQAYAVTKDPQQKAILKKNITRMVNELRACQEKTFV 235
Query: 207 ----------------------------------------GSGYLSAFPSEQFDRFEALK 226
G GY++A PS+ E +
Sbjct: 236 WNDSLGRYWEARDFAPESELKNMKGTWAAFDEYKKHPEKYGYGYINAIPSQHCALIEMYR 295
Query: 227 P------VWAPYYTIHKILAGLLDQYTFADN----TQALKMTKWMVEYFYNRVQ-NVITK 275
P VWAPYYTIHK LAGL+D T D+ +AL + K M + +NR+ K
Sbjct: 296 PYNNSDWVWAPYYTIHKELAGLIDIATLFDDKEVAAKALLIAKDMGLWVWNRMHYRTYVK 355
Query: 276 YS---VERHWNSLNE----------ETGGMNDVLYRLYTI----TQDPKHLLLAHLFDKP 318
ER N E GGM + L RL + T + L A FD P
Sbjct: 356 ADGTQEERRAKPGNRYEMWDMYIAGEVGGMQESLSRLSEMVSNSTDKARLLEAAQCFDAP 415
Query: 319 CFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATG 378
F LA DDI HAN HIP+++G+ Y+ D Y F +V + YATG
Sbjct: 416 KFYEPLAKNIDDIRTRHANQHIPMIVGALRSYKSNHDIHYYNVADNFWHLVQGRYMYATG 475
Query: 379 GTSAGEFWSDPKRLASTLGTEN------------EESCTTYNMLKVSRHLFRWTKEMV-Y 425
G GE + P ++ T E+C TYN+LK+++ L + +
Sbjct: 476 GVGNGEMFRQPYTQVLSMATNGMQEGEAMANPNLNETCCTYNLLKLTKDLNVYNPDDAEL 535
Query: 426 ADYYERALTNGVLSIQRGTEPG--VMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTG 482
DYYER L N ++ +P + Y +G +K +G CC GTG
Sbjct: 536 MDYYERGLYNQIVG---SLDPDHYAVTYQYAVGLNATKP-----FGNETPQSTCCGGTG 586
>gi|150003704|ref|YP_001298448.1| hypothetical protein BVU_1135 [Bacteroides vulgatus ATCC 8482]
gi|149932128|gb|ABR38826.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 1116
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 128/466 (27%), Positives = 201/466 (43%), Gaps = 94/466 (20%)
Query: 105 VSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG-KAYEGWEDPTCE 163
+ L++VK++ ++ + ++ ++ DV +++++ T G T G +GW+ P +
Sbjct: 149 IPLNNVKINGNNRLTSNRDLAIKEIISWDVSQQLYNYRDTYGLSTEGYTRSDGWDSPETK 208
Query: 164 LRGHFVGHYLSASAHMWAS----THNVTLKEKMTAVVSALSECQNKM------------- 206
L+GH GHY+SA A +A+ +H L+ +T +V+ L ECQ +
Sbjct: 209 LKGHGSGHYMSALALAYAAATNPSHKEILRRNITRMVNELRECQERTFVWSEELGRYLEA 268
Query: 207 -----------------------------GSGYLSAFPS------EQFDRFEALKPVWAP 231
G GYL+A P E + + VWAP
Sbjct: 269 RDFAPEEELKKMKGTWEAFDEHKTKWATYGYGYLNAIPPHHPALIEMYRAYNNSDWVWAP 328
Query: 232 YYTIHKILAGLLDQYTFADNT----QALKMTKWMVEYFYNR------VQNVITKYSVERH 281
YY+IHK LAGL+D T+ D+ +AL + K M + +NR V+ T+ H
Sbjct: 329 YYSIHKQLAGLIDIATYMDDKSIADKALLIAKDMGLWVWNRMHYRTYVKKDGTQEERRTH 388
Query: 282 -------WNS-LNEETGGMNDVLYRLYTITQDPKH----LLLAHLFDKPCFLGLLAVQAD 329
WN + E GGM + L RL + P+ + ++ FD P F L+ D
Sbjct: 389 PGNRYEMWNMYIAGEVGGMGESLARLSEMVSAPEEKARLIEASNCFDSPAFYEPLSKNID 448
Query: 330 DISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDP 389
DI HAN HIP++IG+ Y D Y F +++ + Y+TGG GE + P
Sbjct: 449 DIRNRHANQHIPMIIGALRSYLSNNDTFYYHVSHNFWNLIQGRYRYSTGGVGNGEMFRQP 508
Query: 390 KRLASTLG----TENE--------ESCTTYNMLKVSRHLFRWT-KEMVYADYYERALTNG 436
++ +E E E+C YN+LK+++ L + + Y DYYER L N
Sbjct: 509 YTQIVSMAMNGVSEGESHSNPHINETCCAYNLLKLTKDLNCFNPDDARYMDYYERTLYNQ 568
Query: 437 VLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTG 482
++ E Y +G SK WG CC GTG
Sbjct: 569 IIG-SLHPEHYQTTYQYAVGLNASKP-----WGNETPQSTCCGGTG 608
>gi|227509161|ref|ZP_03939210.1| possible acetyl-CoA carboxylase, biotin carboxylase [Lactobacillus
brevis subsp. gravesensis ATCC 27305]
gi|227191368|gb|EEI71435.1| possible acetyl-CoA carboxylase, biotin carboxylase [Lactobacillus
brevis subsp. gravesensis ATCC 27305]
Length = 606
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 84/219 (38%), Positives = 111/219 (50%), Gaps = 26/219 (11%)
Query: 285 LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVI 344
L E GGMND LY L++IT+D +HL A FD+ LA D + G HANT IP ++
Sbjct: 2 LKVEYGGMNDALYHLFSITKDERHLTAATYFDEVELFKDLAAAKDVLPGKHANTTIPKLL 61
Query: 345 GSQMRYEVTGD----------------PLYKVTGTFFMDIVNASHGYATGGTSAGEFWSD 388
G+ RYE+ D P+Y F IV H YATGG S E + D
Sbjct: 62 GAIRRYEIFDDPQMAGQYLYEKDQKQLPIYLKAAENFWRIVINHHTYATGGNSQSEHFHD 121
Query: 389 PKRLASTLGTENE----ESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT 444
P +L E+ E+C T+NMLK+SR LFR T + Y DYY+R +N +L Q
Sbjct: 122 PNQLYHDAVIEDGATTCETCNTHNMLKLSRELFRVTGDKKYLDYYDRTYSNAILGSQ-NP 180
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+ G+M Y P+ G K + + FWCC GTG+
Sbjct: 181 KTGMMTYFQPMAAGYRKV-----FNRPYDEFWCCTGTGI 214
>gi|82523843|emb|CAI78585.1| hypothetical protein [uncultured candidate division OP8 bacterium]
Length = 766
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 167/385 (43%), Gaps = 72/385 (18%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYE--GWE 158
L V L+ +L + + L L ++ D+ +++F+ G P A + GW+
Sbjct: 378 LLGRVVLNRDAAGRETLFMKNRDKFLSTLAEVNPDNFLYNFRDAFGLPQPEGAVQLGGWD 437
Query: 159 DPTCELRGHFVGHYLSASAHMWA-STHNVTLK----EKMTAVVSALSECQNKMGS----- 208
D T LRGH GHYLSA A +A S ++ L+ +KM ++ L + K G
Sbjct: 438 DQTTRLRGHASGHYLSALAQAYAGSVYDSALQANFLQKMNYMIDTLYDLAQKSGRPVESG 497
Query: 209 -------------------------------------GYLSAFPSEQFDRFE-------A 224
G++SA+P +QF E
Sbjct: 498 GLCNPDPTTVPSGPGKSGYDSDLSQKGLRHDYWNWGVGFISAYPPDQFIMLEQGATYGGT 557
Query: 225 LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNS 284
+WAPYYT+HKILAGLLD Y N +AL++ + M + R+Q V +
Sbjct: 558 NAQIWAPYYTLHKILAGLLDCYEVGGNPKALQIAEGMGGWALKRLQAVPEATRIAMWSRY 617
Query: 285 LNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFL-------GLLAVQADDISGFHAN 337
+ E GGMN+V+ RL+ +T L A LFD F LA D + G HAN
Sbjct: 618 IAGEYGGMNEVMARLFRLTGKRDFLACAKLFDNTNFFFGNAGREHGLAKNVDTVRGRHAN 677
Query: 338 THIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGE-------FWSDPK 390
HIP +IG+ Y +G+P+Y F +I + Y GG + F ++P
Sbjct: 678 QHIPQIIGTLETYRGSGEPVYHEIAENFWEIARNHYMYNIGGVGGAKNPRNAECFTAEPD 737
Query: 391 -RLASTLGTENE-ESCTTYNMLKVS 413
+ A+ + + E+C TYN+LK +
Sbjct: 738 TQFANGFSMDGQNETCATYNLLKCA 762
>gi|225351247|ref|ZP_03742270.1| hypothetical protein BIFPSEUDO_02839 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
gi|225158703|gb|EEG71945.1| hypothetical protein BIFPSEUDO_02839 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
Length = 853
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 122/415 (29%), Positives = 182/415 (43%), Gaps = 46/415 (11%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA------- 153
L+ V L V+L P H+ AQQ YLL LDVD L++ F++ AG P A
Sbjct: 5 ILERVPLQQVRLLPGE-HFDAQQAGARYLLDLDVDRLLYPFRREAGLPQPTDADGNPVTS 63
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVT-LKEKMTAVVSALSECQNKMGS---- 208
Y WE+ L GH GHYLSA + ++ VV + ECQ
Sbjct: 64 YPNWEE--TGLDGHIAGHYLSACVGFAQVADDPQPFIDRAATVVRSWHECQQSFAGDAVM 121
Query: 209 -GYLSAFPSEQ--FDRFEA---------LKPVWAPYYTIHKILAGLLDQYTFAD----NT 252
GY+ P + F R A + W P Y +HK AGLLD T+AD +
Sbjct: 122 RGYVGGVPDSRTVFGRLAAGDVESQNFSMNDAWVPMYNVHKTFAGLLD--TWADFASIDE 179
Query: 253 QALKMTKWMVEYFYN---RVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHL 309
Q ++ + +V + R+ + + +R L E GGM + LY T + ++
Sbjct: 180 QTSQLARTVVLDLADWWCRIAEPLDDETFDR---ILVSEFGGMCESFAELYARTGEERYH 236
Query: 310 LLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIV 369
++A F LA D ++G HANT IP V+G + + D F D V
Sbjct: 237 VMADRFKDHAIFDPLAQGEDVLTGMHANTQIPKVLGWERLGAICNDEQADAATNTFWDSV 296
Query: 370 NASHGYATGGTSAGEFWSDPKRLASTLGT-ENEESCTTYNMLKVSRHLFRWTKEMVYADY 428
+ G S E + +S + + E E+C +YNM K++ L+ + Y ++
Sbjct: 297 VHHRSVSIGAHSVSEHFHPTDDFSSMIESREGPETCNSYNMSKLAERLWLRSGSADYINF 356
Query: 429 YERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
YER L N +LS +PG +Y P+ +++ Y + T FWCC G+GL
Sbjct: 357 YERVLENHLLSTINPKQPG-FVYFTPM-----RSQHYRAYSTPQECFWCCVGSGL 405
>gi|261879318|ref|ZP_06005745.1| acetyl-CoA carboxylase [Prevotella bergensis DSM 17361]
gi|270334148|gb|EFA44934.1| acetyl-CoA carboxylase [Prevotella bergensis DSM 17361]
Length = 839
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 123/424 (29%), Positives = 181/424 (42%), Gaps = 71/424 (16%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA-------- 153
L EV+L D L A N++ L+ DVD L+ F + AG T A
Sbjct: 34 LDEVTLLDSPLKT------AMDLNIKMLMQYDVDRLLTPFIRQAGLHTGRYADWQSRHPN 87
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNVT----LKEKMTAVVSALSECQN----- 204
+ W +L GH GHY+SA A +A+ H+ +KE++ ++ L +CQ+
Sbjct: 88 FMNWGGNNFDLSGHVGGHYVSALAMAYAACHDTATKARIKERLDYMIDVLKDCQDAYDTN 147
Query: 205 ------------------KMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQY 246
KM +G +S+F + W P+Y HK+LAGL D Y
Sbjct: 148 TEGLYGFIGGQPINDMWKKMYAGDISSFRQHRG---------WVPFYCQHKVLAGLRDAY 198
Query: 247 TFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDP 306
+ NT A + + + ++ N V N+ S L+ E GGMN+ L YT+ D
Sbjct: 199 LYTGNTTARDLFRKLADWSVNLVSNL----SDATMQTVLDTEHGGMNETLADAYTLFGDS 254
Query: 307 KHLLLAHLFDKPCFL-GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTG--- 362
K+L A + L G+ + HANT +P IG + E DP
Sbjct: 255 KYLAAARKYSHQTMLNGMQTPNPTFLDNRHANTQVPKYIGFERVAEE--DPTATTYATAA 312
Query: 363 TFFMDIVNASHGYATGGTSAGEFW---SDPKRLASTLGTENEESCTTYNMLKVSRHLFRW 419
+ F D V + GG S GE + + R L + ESC T NM+K+S +
Sbjct: 313 SNFWDDVAQNRTVCIGGNSVGEHFLSVGNSNRYIDHL--DGPESCNTNNMMKLSEMMADR 370
Query: 420 TKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCY 479
T + YAD+YE A+ N +LS Q T G +Y L + + Y + WCC
Sbjct: 371 THDARYADFYEYAMYNHILSTQDPTTGGY-VYFTTL-----RPQGYRIYSKVNEGMWCCV 424
Query: 480 GTGL 483
GTG+
Sbjct: 425 GTGM 428
>gi|300726603|ref|ZP_07060044.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Prevotella
bryantii B14]
gi|299776135|gb|EFI72704.1| acetyl-CoA carboxylase, biotin carboxylase subunit [Prevotella
bryantii B14]
Length = 832
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 120/408 (29%), Positives = 178/408 (43%), Gaps = 44/408 (10%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA--------YEGWE 158
L DV+L + A + N LL DVD L+ F + AG A ++ W
Sbjct: 25 LQDVQLLDGPMK-SAMEINFNTLLAYDVDRLLTPFIRQAGLHEGRYADWQKKHPNFKNWG 83
Query: 159 DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTA----VVSALSECQNKMGS------ 208
+L GH GHYLSA A +A+ + KE++ + ++ L +CQN
Sbjct: 84 GDGFDLSGHIGGHYLSALAMAYAACQDAATKERLQSRLLYMIDVLKDCQNSFDQNTTGLY 143
Query: 209 GYLSAFP-SEQFDRFEA-------LKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKW 260
G++ P +E +++ W P+Y HK++AGL D Y +A N A M K
Sbjct: 144 GFIGGQPINEDWEKLYQGDISGIWQHRGWVPFYCEHKVMAGLRDAYLYAHNQDAKLMLKK 203
Query: 261 MVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCF 320
M ++ +I K S L E GG+N+ + Y I +D ++L A + +
Sbjct: 204 MADW----CTQLIAKVSDADMQKMLTIEHGGINESMADCYAIFKDTRYLEAAKKYSQREM 259
Query: 321 L-GLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPL-YKVTGTFFMDIVNASHGYATG 378
L GL ++ A + HANT +P IG + E L Y + F V G
Sbjct: 260 LEGLQSLNATFLDNRHANTQVPKYIGFERIVEEDPAALQYATAASNFWQDVAHHRTVCIG 319
Query: 379 GTSAGEFW---SDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTN 435
G S E + ++ R L E ESC T NMLK+S L T + YAD+YE A+ N
Sbjct: 320 GNSISEHFLSKTNSNRYIDNL--EGPESCNTNNMLKLSEMLSDRTHDAGYADFYEYAMWN 377
Query: 436 GVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+LS Q + G +Y L + + Y + WCC GTG+
Sbjct: 378 HILSTQ-DPQTGGYVYFTTL-----RPQGYRIYSVPNQGMWCCVGTGM 419
>gi|396489945|ref|XP_003843216.1| hypothetical protein LEMA_P073260.1 [Leptosphaeria maculans JN3]
gi|312219795|emb|CBX99737.1| hypothetical protein LEMA_P073260.1 [Leptosphaeria maculans JN3]
Length = 748
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 123/431 (28%), Positives = 177/431 (41%), Gaps = 67/431 (15%)
Query: 98 AGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSP--TAGKAYE 155
A ++ L+ V L L + Q +++ D + F K AG T
Sbjct: 42 ATALVRPFRLNQVHLGEGLLQEKRDQIK-DFVRTYDERRFLVLFNKVAGRANITNLSPPG 100
Query: 156 GWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGS------- 208
GWED L GH+ GHY+SA + + KEK+ +V+ L+ CQ
Sbjct: 101 GWEDGGL-LSGHWTGHYMSALSQAYIDKGESIFKEKLDWMVAELAACQEAYTEYKQPTHL 159
Query: 209 GYLSAFPSEQFDRFEALK----------PVWAPYYTIHKILAGLLDQYTFADNTQALKMT 258
GYL A P + R + WA +YT HKI+ GLLD Y A+NTQAL +
Sbjct: 160 GYLGALPEDTVLRLGPPRFAVYGSNISTDTWAGWYTQHKIMRGLLDAYYNANNTQALDIV 219
Query: 259 KWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKP 318
M ++ + + + + E GG N+V +Y +T + KHL A FD
Sbjct: 220 IKMADWAHLALTDTY-----------IAGEFGGANEVFPEIYALTGEEKHLQTAKAFDNR 268
Query: 319 CFLGLLAVQADDI--------------SGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTF 364
L AV DI HANTH+P IG YE TG Y +
Sbjct: 269 ESLFSAAVSDQDILVMTPERKPGRRRRERLHANTHVPQFIGYLRIYEHTGSNEYLLAAKN 328
Query: 365 FMDIVNASHGYATGGTSAG--------EFWSDPKRLASTLGTENEESCTTYNMLKVSRHL 416
F V +A+G T E + + +A+++ E E+C TYN L ++R+L
Sbjct: 329 FFGWVVPHREFASGSTGGNVPGFSANPELFQNRDNIANSIADEGAETCITYNTLNLARNL 388
Query: 417 FRWTKEMVYADYYERALTNGV----LSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRF 472
F Y D+ ER L N + + ++P + Y PL G + Y GT
Sbjct: 389 FLDEHNATYMDHCERGLFNMIAGSRVDTSNNSDP-QLTYFQPLSPG--FGREYGNTGT-- 443
Query: 473 SSFWCCYGTGL 483
CC GTG+
Sbjct: 444 ----CCGGTGM 450
>gi|256831608|ref|YP_003160335.1| hypothetical protein Jden_0363 [Jonesia denitrificans DSM 20603]
gi|256685139|gb|ACV08032.1| protein of unknown function DUF1680 [Jonesia denitrificans DSM
20603]
Length = 744
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 109/384 (28%), Positives = 165/384 (42%), Gaps = 44/384 (11%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWA 181
+ T L+Y L LD LV +++ +G P +Y WE+ L GH +GH LSA A+ +
Sbjct: 20 RNTALDYTLALDPQRLVAPYRRESGLPLLAPSYGNWEN--SGLDGHTLGHVLSALAYA-S 76
Query: 182 STH---NVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF------------DRFEALK 226
TH + +E++ +V+ + ECQ +G+GY+ P + D F L
Sbjct: 77 VTHTPRSAEARERLEWLVAQVQECQAAVGTGYVGGIPQGRALWERIGNGDVDADSF-GLH 135
Query: 227 PVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLN 286
W P+Y +HK+ AGL+D A + + + +V N V + E+ L
Sbjct: 136 GAWVPWYNLHKVFAGLVD----AGWVAGVAVARDVVVGLANWWLRVAARLRDEQFQAMLV 191
Query: 287 EETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGS 346
E G +N L T D ++L +A F L D + G HANT I +G
Sbjct: 192 TEFGAINGAFADLAVHTGDARYLEMAKRFTDRALFDALVAGEDPLVGLHANTQIAKALGW 251
Query: 347 QMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS-----AGEFWSDPKRLASTLGTENE 401
G Y V D+V H + GG S AG+ W A + +
Sbjct: 252 ARVALAGGGREYLVAARRVWDVVVRDHTLSFGGNSVREHCAGDPW------APFVSEQGP 305
Query: 402 ESCTTYNMLKVSRHLFRWTKE-MVYADYYERALTNGVLSIQRGTEP-GVMIYMLPLGRGD 459
ESC T+NML+++ L + D+ E AL N V+S P G +Y P
Sbjct: 306 ESCNTHNMLRLTGALLELGESPRPLVDFVEVALMNHVVS---SVHPEGGFVYFTP----- 357
Query: 460 SKAKSYHGWGTRFSSFWCCYGTGL 483
++ + Y + FWCC GTG+
Sbjct: 358 ARPQHYRVYSQVHECFWCCVGTGM 381
>gi|330467692|ref|YP_004405435.1| glycosylase [Verrucosispora maris AB-18-032]
gi|328810663|gb|AEB44835.1| glycosylase [Verrucosispora maris AB-18-032]
Length = 1126
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 141/322 (43%), Gaps = 56/322 (17%)
Query: 209 GYLSAFPSEQFDRF----------EALKPVWAPYYTIHKILAGLLDQYTFADNTQAL--- 255
GYL A P + R A WAP+YT HKI+ GLLD Y DN AL
Sbjct: 416 GYLGAIPEDAVLRLGPPRWAVYGSNATTNTWAPWYTQHKIMRGLLDAYYHTDNATALDVV 475
Query: 256 -KMTKW------MVEYFYNRVQNVITKYSVERHWN-SLNEETGGMNDVLYRLYTITQDPK 307
KM W + + + IT+ ++ W+ + ETGG N+V +Y +T D K
Sbjct: 476 VKMAGWAHLALTIGDKNHPAYTGPITRDNLNYMWDLYIAGETGGANEVFPEIYALTGDQK 535
Query: 308 HLLLAHLFDKPCFLGLLAVQADDI--------------SGFHANTHIPVVIGSQMRYEVT 353
HL A LFD L V+ DI HAN+H+P +G YE +
Sbjct: 536 HLETAKLFDNRESLFDACVENRDILVVTPQNNPGRRRPDRLHANSHVPQFVGYLRVYEHS 595
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAG--------EFWSDPKRLASTLGTENEESCT 405
GD Y F +V YA GGT E + + +A+++ E+CT
Sbjct: 596 GDTEYFQAAKNFYGMVVPHRMYANGGTGGNYPGSNNNIELFQNRGNIANSIAQGGAETCT 655
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT----EPGVMIYMLPLGRGDSK 461
TYN+LK++R+LF + Y DYYER L N + + T P V Y PL G ++
Sbjct: 656 TYNLLKLARNLFFHEHDAAYLDYYERGLINQIAGSRADTTTVSNPQVT-YFQPLTPGANR 714
Query: 462 AKSYHGWGTRFSSFWCCYGTGL 483
Y GT CC GTG+
Sbjct: 715 G--YGNTGT------CCGGTGV 728
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 48/114 (42%), Gaps = 8/114 (7%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAG--KAYEGWED 159
++ L DV L L + YL LD + F AG P A GWED
Sbjct: 62 VRPFRLRDVTLG-DGLFQEKRDRMKNYLRQLDERRFLVLFNNQAGRPNPAGVTAPGGWED 120
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQN----KMGSG 209
L GH+ GH ++A A +A K K+ +V L+ CQ +MGSG
Sbjct: 121 GGL-LSGHWAGHVMTALAQGYADHGEPIFKSKLDWIVDELAACQTAITARMGSG 173
>gi|310794204|gb|EFQ29665.1| hypothetical protein GLRG_04809 [Glomerella graminicola M1.001]
Length = 436
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/345 (29%), Positives = 151/345 (43%), Gaps = 47/345 (13%)
Query: 122 QQTNLEYLLMLDVDSLVWSFQKTAGSPTA-GKAYEGWEDPTCELRGHFVGHYLSASAHMW 180
Q L YL +DVD L++ F+K G T + GW+ P R H GH+L+A A +
Sbjct: 59 QARTLVYLKWIDVDRLLYVFRKNHGLYTNNAQPNAGWDAPDFPFRSHVQGHFLNAWAFCY 118
Query: 181 ASTHNVTLKEKMTAVVSALSECQ-NKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKIL 239
A + K + T + L +CQ N S + PYY IHK +
Sbjct: 119 AQLQDSECKRRATYFAAELKKCQHNNTNSRNV-------------------PYYAIHKTM 159
Query: 240 AGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRL 299
AGLLD + +T A + M + R K + ++ + + GGMN+VL L
Sbjct: 160 AGLLDVWRLIGDTNARDVLLAMAAWVDLRT----GKLTYQQMQDMMGTVFGGMNEVLADL 215
Query: 300 YTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYK 359
T D + + +A FD LA D +SG HANT ++ +
Sbjct: 216 CRQTGDQRWVTVAQRFDHAAIFNPLASNQDSLSGLHANTQ-----------DIARNA--- 261
Query: 360 VTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRW 419
+I ++H YA GG S E + P +A L ++ E+C TYNMLK++ L+
Sbjct: 262 ------WNITVSAHSYAIGGNSQAEHFRLPNAIAGFLTSDTCEACNTYNMLKLTGELWLT 315
Query: 420 TKE-MVYADYYERALTNGVLSIQRGTEP-GVMIYMLPLGRGDSKA 462
+ Y D+YERAL N +L Q + G + Y PL G +
Sbjct: 316 NPDTTTYFDFYERALLNHLLGQQDPSNSHGHVTYFTPLNPGGRRG 360
>gi|389638620|ref|XP_003716943.1| acetyl-CoA carboxylase [Magnaporthe oryzae 70-15]
gi|351642762|gb|EHA50624.1| acetyl-CoA carboxylase [Magnaporthe oryzae 70-15]
Length = 1018
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/322 (30%), Positives = 138/322 (42%), Gaps = 55/322 (17%)
Query: 209 GYLSAFPSEQFDRFEALK----------PVWAPYYTIHKILAGLLDQYTFADNTQALKMT 258
GYL A P + R + WAP+YT HKI+ GLLD Y +N+QAL++
Sbjct: 390 GYLGALPEDTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQALQVV 449
Query: 259 KWMVEYFYNRV----------QNVITKYSVERHWN-SLNEETGGMNDVLYRLYTITQDPK 307
M ++ + + + +T+ + W+ + E GG N+V +Y +T DPK
Sbjct: 450 TRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTGDPK 509
Query: 308 HLLLAHLFDKPCFLGLLAVQADDI--------------SGFHANTHIPVVIGSQMRYEVT 353
HL A FD L AV DDI HANTH+P IG +E
Sbjct: 510 HLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIFEQG 569
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAG--------EFWSDPKRLASTLGTENEESCT 405
G Y F V +A+GGT E + + +A+ +G E+CT
Sbjct: 570 GGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAETCT 629
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGV----MIYMLPLGRGDSK 461
YNMLK++R+LF Y D YER L N + + T + Y PL G +
Sbjct: 630 AYNMLKLARNLFLHNHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPGSN- 688
Query: 462 AKSYHGWGTRFSSFWCCYGTGL 483
+ Y GT CC GTGL
Sbjct: 689 -RDYGNTGT------CCGGTGL 703
>gi|440483441|gb|ELQ63839.1| acetyl-CoA carboxylase [Magnaporthe oryzae P131]
Length = 1055
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/322 (30%), Positives = 138/322 (42%), Gaps = 55/322 (17%)
Query: 209 GYLSAFPSEQFDRFEALK----------PVWAPYYTIHKILAGLLDQYTFADNTQALKMT 258
GYL A P + R + WAP+YT HKI+ GLLD Y +N+QAL++
Sbjct: 427 GYLGALPEDTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQALQVV 486
Query: 259 KWMVEYFYNRV----------QNVITKYSVERHWN-SLNEETGGMNDVLYRLYTITQDPK 307
M ++ + + + +T+ + W+ + E GG N+V +Y +T DPK
Sbjct: 487 TRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTGDPK 546
Query: 308 HLLLAHLFDKPCFLGLLAVQADDI--------------SGFHANTHIPVVIGSQMRYEVT 353
HL A FD L AV DDI HANTH+P IG +E
Sbjct: 547 HLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIFEQG 606
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAG--------EFWSDPKRLASTLGTENEESCT 405
G Y F V +A+GGT E + + +A+ +G E+CT
Sbjct: 607 GGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAETCT 666
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGV----MIYMLPLGRGDSK 461
YNMLK++R+LF Y D YER L N + + T + Y PL G +
Sbjct: 667 AYNMLKLARNLFLHDHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPGSN- 725
Query: 462 AKSYHGWGTRFSSFWCCYGTGL 483
+ Y GT CC GTGL
Sbjct: 726 -RDYGNTGT------CCGGTGL 740
>gi|440466410|gb|ELQ35678.1| acetyl-CoA carboxylase [Magnaporthe oryzae Y34]
Length = 1055
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/322 (30%), Positives = 138/322 (42%), Gaps = 55/322 (17%)
Query: 209 GYLSAFPSEQFDRFEALK----------PVWAPYYTIHKILAGLLDQYTFADNTQALKMT 258
GYL A P + R + WAP+YT HKI+ GLLD Y +N+QAL++
Sbjct: 427 GYLGALPEDTVLRLGPPRWAVYGGNQQTNTWAPWYTQHKIMRGLLDAYYNTNNSQALQVV 486
Query: 259 KWMVEYFYNRV----------QNVITKYSVERHWN-SLNEETGGMNDVLYRLYTITQDPK 307
M ++ + + + +T+ + W+ + E GG N+V +Y +T DPK
Sbjct: 487 TRMADWAHLALSIGDKNHADYKGNLTRDDLNYMWDLYIAGEFGGANEVFPEIYRLTGDPK 546
Query: 308 HLLLAHLFDKPCFLGLLAVQADDI--------------SGFHANTHIPVVIGSQMRYEVT 353
HL A FD L AV DDI HANTH+P IG +E
Sbjct: 547 HLETAKAFDNRESLFDAAVNDDDILVVRPQDRPGRRRPERLHANTHVPQFIGYMRIFEQG 606
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAG--------EFWSDPKRLASTLGTENEESCT 405
G Y F V +A+GGT E + + +A+ +G E+CT
Sbjct: 607 GGQEYFDAAKNFYGWVVPHREFASGGTGGNYPGSNDNPELFQNRGNIANAMGGNGAETCT 666
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGV----MIYMLPLGRGDSK 461
YNMLK++R+LF Y D YER L N + + T + Y PL G +
Sbjct: 667 AYNMLKLARNLFLHDHNATYMDTYERGLFNMIPGSRADTAGSAGDPQLTYFQPLTPGSN- 725
Query: 462 AKSYHGWGTRFSSFWCCYGTGL 483
+ Y GT CC GTGL
Sbjct: 726 -RDYGNTGT------CCGGTGL 740
>gi|402081502|gb|EJT76647.1| acetyl-CoA carboxylase [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1032
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 140/322 (43%), Gaps = 56/322 (17%)
Query: 209 GYLSAFPSEQFDRF----------EALKPVWAPYYTIHKILAGLLDQYTFADNTQAL--- 255
GYL A P + R +A WAP+YT HKI+ GLLD Y +NTQAL
Sbjct: 404 GYLGALPEDTVLRLGPPRWAIYGGDAATNTWAPWYTQHKIMRGLLDAYYNTNNTQALDVV 463
Query: 256 -KMTKW------MVEYFYNRVQNVITKYSVERHWN-SLNEETGGMNDVLYRLYTITQDPK 307
KM W + + Y +T+ + R W+ + E+GG N+V LY +T D +
Sbjct: 464 VKMADWAHLALTIGDKNYPGYTGNLTRDDLNRMWDLYIAGESGGANEVFPELYELTGDSR 523
Query: 308 HLLLAHLFDKPCFLGLLAVQADDI--------------SGFHANTHIPVVIGSQMRYEVT 353
HL A FD L AV+ DI HAN H+P IG +E +
Sbjct: 524 HLETAKAFDNRASLFDAAVEDRDILVLTRDKNPGPRRTDRLHANMHVPQFIGYLRIFEQS 583
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSA--------GEFWSDPKRLASTLGTENEESCT 405
+ Y F V +A+GGT E + + +A+ + E+CT
Sbjct: 584 REQDYLDAARNFYSWVFPHRQFASGGTGGNYPGSNNNAEMFQNRGNIANAIAENGAETCT 643
Query: 406 TYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT----EPGVMIYMLPLGRGDSK 461
TYNMLK++R+LF Y D YER L N + + T +P + Y PL G S
Sbjct: 644 TYNMLKLARNLFMHEHNATYMDGYERGLFNMIAGSRADTATTADP-QLTYFQPLTPGAS- 701
Query: 462 AKSYHGWGTRFSSFWCCYGTGL 483
+ Y GT CC G+GL
Sbjct: 702 -RDYGNTGT------CCGGSGL 716
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 4/107 (3%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAY--EGWED 159
++ L V+L L + +T ++L D + F K AG P+AG GWED
Sbjct: 45 VRPFRLDQVRLGDGLLQEKRDRTK-DFLREFDERRFLVLFNKQAGRPSAGGVAVPGGWED 103
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKM 206
L GH+ GHY++A + +A K K+ +V L+ CQ +
Sbjct: 104 GGL-LSGHWAGHYMTALSQAYADQGEEVFKAKLDWMVQELAACQKAI 149
>gi|433651701|ref|YP_007278080.1| hypothetical protein Prede_0695 [Prevotella dentalis DSM 3688]
gi|433302234|gb|AGB28050.1| hypothetical protein Prede_0695 [Prevotella dentalis DSM 3688]
Length = 1032
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 118/417 (28%), Positives = 182/417 (43%), Gaps = 55/417 (13%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA-------- 153
L EV+L D A + N + LL D D L+ F + AG T A
Sbjct: 27 LSEVTLFDSPFKT------AMELNFKVLLDYDADRLLAPFVRQAGLNTGDYAGWQTLHPN 80
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNV----TLKEKMTAVVSALSECQNKMGS- 208
+ W +L GH GHYLSA A +A+ + LK+++ ++ L +CQ+
Sbjct: 81 FANWGGNGFDLSGHVGGHYLSALALAYAACRDAGMKARLKQRLEYMLKVLKDCQDAYDGN 140
Query: 209 -----GYLSAFPSEQ---------FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA 254
G++ P + F +++ W P+Y HK+LAGL D Y +A N +A
Sbjct: 141 TEGLRGFIGGQPINEAWKKLYAGDVSGFRSVRG-WVPFYCQHKVLAGLRDAYVYAGNKEA 199
Query: 255 LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHL 314
+M + + ++ NV+ + + L+ E GGMN+ L YT+ D K++ A
Sbjct: 200 REMFRKLADWSV----NVVARLDNAAMQSVLDTEHGGMNESLADAYTLFGDQKYMDAAQK 255
Query: 315 FDKPCFLGLLAVQ-ADDISGFHANTHIPVVIGSQMRYEVTGDPLYK----VTGTFFMDIV 369
+ L + +Q A + HANT +P IG + E G L K G F+ D+
Sbjct: 256 YSHQTMLNGMQMQNATFLDNRHANTQVPKYIGFERIGEQGGSELQKKYELAAGNFWNDVA 315
Query: 370 NASHGYATGGTSAGEFW---SDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYA 426
+ GG S E + ++ R L + ESC + NMLK+S L T + YA
Sbjct: 316 -LNRTVCIGGNSVAEHFLSAANSHRYIDHL--DGPESCNSNNMLKLSEMLSDNTHDARYA 372
Query: 427 DYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
D+YE N +LS Q + G +Y L + + Y + WCC GTG+
Sbjct: 373 DFYEYTTWNHILSTQD-PKTGGYVYFTTL-----RPQGYRIYSQVNQGMWCCVGTGM 423
>gi|340345934|ref|ZP_08669064.1| acetyl-CoA carboxylase [Prevotella dentalis DSM 3688]
gi|339612921|gb|EGQ17717.1| acetyl-CoA carboxylase [Prevotella dentalis DSM 3688]
Length = 1039
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 118/417 (28%), Positives = 182/417 (43%), Gaps = 55/417 (13%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA-------- 153
L EV+L D A + N + LL D D L+ F + AG T A
Sbjct: 34 LSEVTLFDSPFKT------AMELNFKVLLDYDADRLLAPFVRQAGLNTGDYAGWQTLHPN 87
Query: 154 YEGWEDPTCELRGHFVGHYLSASAHMWASTHNV----TLKEKMTAVVSALSECQNKMGS- 208
+ W +L GH GHYLSA A +A+ + LK+++ ++ L +CQ+
Sbjct: 88 FANWGGNGFDLSGHVGGHYLSALALAYAACRDAGMKARLKQRLEYMLKVLKDCQDAYDGN 147
Query: 209 -----GYLSAFPSEQ---------FDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA 254
G++ P + F +++ W P+Y HK+LAGL D Y +A N +A
Sbjct: 148 TEGLRGFIGGQPINEAWKKLYAGDVSGFRSVRG-WVPFYCQHKVLAGLRDAYVYAGNKEA 206
Query: 255 LKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHL 314
+M + + ++ NV+ + + L+ E GGMN+ L YT+ D K++ A
Sbjct: 207 REMFRKLADWSV----NVVARLDNAAMQSVLDTEHGGMNESLADAYTLFGDQKYMDAAQK 262
Query: 315 FDKPCFLGLLAVQ-ADDISGFHANTHIPVVIGSQMRYEVTGDPLYK----VTGTFFMDIV 369
+ L + +Q A + HANT +P IG + E G L K G F+ D+
Sbjct: 263 YSHQTMLNGMQMQNATFLDNRHANTQVPKYIGFERIGEQGGSELQKKYELAAGNFWNDVA 322
Query: 370 NASHGYATGGTSAGEFW---SDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYA 426
+ GG S E + ++ R L + ESC + NMLK+S L T + YA
Sbjct: 323 -LNRTVCIGGNSVAEHFLSAANSHRYIDHL--DGPESCNSNNMLKLSEMLSDNTHDARYA 379
Query: 427 DYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
D+YE N +LS Q + G +Y L + + Y + WCC GTG+
Sbjct: 380 DFYEYTTWNHILSTQD-PKTGGYVYFTTL-----RPQGYRIYSQVNQGMWCCVGTGM 430
>gi|255624614|ref|XP_002540501.1| hypothetical protein RCOM_2107350 [Ricinus communis]
gi|223495313|gb|EEF21882.1| hypothetical protein RCOM_2107350 [Ricinus communis]
Length = 208
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 102/212 (48%), Gaps = 25/212 (11%)
Query: 170 GHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE------------ 217
GHYLSA A M A+T + ++E++ VV+ L CQ G+GY+ P
Sbjct: 3 GHYLSALAMMVAATGDEQVRERLDYVVAELKRCQAANGNGYIGGVPGGAAAWRDIAQGKL 62
Query: 218 QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQA----LKMTKWMVEYFYNRVQNVI 273
D F ++ W P+Y +HK AGL D YT+A N A + + W +E +
Sbjct: 63 HADNF-SVNGKWVPWYNLHKTFAGLRDAYTYAGNQDAHAMLIALCDWTLE--------LT 113
Query: 274 TKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG 333
+ S E+ + + E GGMN+VL + +T K++ LA F L L D ++G
Sbjct: 114 SHLSDEQMQSMMRAEHGGMNEVLADVAQMTGQQKYMDLAIRFSHQALLRPLEEGKDQLTG 173
Query: 334 FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFF 365
HANT IP VIG + ++T ++ FF
Sbjct: 174 LHANTQIPKVIGFKRIGDITSRDDWQRAAAFF 205
>gi|256375993|ref|YP_003099653.1| hypothetical protein Amir_1859 [Actinosynnema mirum DSM 43827]
gi|255920296|gb|ACU35807.1| protein of unknown function DUF1680 [Actinosynnema mirum DSM 43827]
Length = 736
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 49/123 (39%), Positives = 62/123 (50%)
Query: 295 VLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTG 354
L L T P+HL A +FD + A D ++G HAN HIP+ G E TG
Sbjct: 278 ALRDLRARTGKPEHLAPARMFDLDALIDACAENRDVLAGLHANQHIPIFTGLVRLREATG 337
Query: 355 DPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSR 414
+ Y F D+V Y GGTS GEFW P +A TL +N E+C +NMLK+ R
Sbjct: 338 EQRYLDAARNFWDMVVPRRLYRIGGTSTGEFWRAPGVIAETLADDNAETCCAHNMLKLGR 397
Query: 415 HLF 417
LF
Sbjct: 398 ALF 400
>gi|94967195|ref|YP_589243.1| hypothetical protein Acid345_0164 [Candidatus Koribacter versatilis
Ellin345]
gi|94549245|gb|ABF39169.1| conserved hypothetical protein [Candidatus Koribacter versatilis
Ellin345]
Length = 602
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 104/406 (25%), Positives = 170/406 (41%), Gaps = 42/406 (10%)
Query: 94 GFKLAGDFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKA 153
F ++ L E DV L+ S LH R Q + L+ L+ D+L+ F+ G P G+
Sbjct: 29 AFAISSVPLDEFGYGDVSLE-SELHNRQFQNTHDVLMGLEDDALLKPFRAMVGQPPPGRD 87
Query: 154 YEGWE--DPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYL 211
GW DP VG +A+ W S + + + V N++ + +
Sbjct: 88 LGGWYCFDPNYNPNDVGVGFAPTATFGQWISALSRSYALRPDPAVRDKVIRLNRLYAQTI 147
Query: 212 SAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQN 271
S F LK + P Y K++ GL+D + + + ALK+ +E +
Sbjct: 148 SP-------EFYGLKNRF-PAYCYDKLVCGLIDAHQYVGDPDALKI----LERTTDTATP 195
Query: 272 VITKYSVERH--WNSLNE------ETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGL 323
++ ++VE W S+ + E+ +++ L+ Y ++ L + +
Sbjct: 196 LLPGHAVEHGTVWRSVKDDGYTWDESYTISENLFLAYRRGAGDRYRALGKQYLDDTYYNP 255
Query: 324 LAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
LA D+ G HA +H+ + + Y GD Y D V A YATGG A
Sbjct: 256 LAEGRSDLEGRHAYSHVNSLCSAMQAYLTLGDEKYFRAAKNGFDFVLA-QSYATGGWGAD 314
Query: 384 EFW---SDPKRLASTLGTEN--EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVL 438
E + P+ S GT + E C +Y K++R+L R T++ Y D ER + N +L
Sbjct: 315 ETLRAPNSPEVAKSLTGTHHSFETPCGSYAHFKLTRYLLRVTRDSRYGDSMERVMYNTIL 374
Query: 439 SIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRF--SSFW-CCYGT 481
G P ++P GR Y+ G++F + W CC GT
Sbjct: 375 ----GALP-----LMPDGR-TFYYSDYNFKGSKFYHDARWPCCSGT 410
>gi|427409221|ref|ZP_18899423.1| hypothetical protein HMPREF9718_01897 [Sphingobium yanoikuyae ATCC
51230]
gi|425711354|gb|EKU74369.1| hypothetical protein HMPREF9718_01897 [Sphingobium yanoikuyae ATCC
51230]
Length = 616
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/366 (26%), Positives = 156/366 (42%), Gaps = 43/366 (11%)
Query: 130 LMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLK 189
L LD D ++ F++ AG P G GW D + G G Y+S A + A+T + +
Sbjct: 84 LALDNDRVLKVFRQQAGLPAPGPDMGGWYDRDGFVPGLAFGQYMSGLARIGATTGDKAVH 143
Query: 190 EKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFA 249
K+ A+V E K + Y +Q WA YT+ K + GL+D Y +
Sbjct: 144 AKVAALVQGFGEFITKTRNPYAGPKAQDQ----------WAA-YTMDKYVVGLIDAYRLS 192
Query: 250 DNTQALKMTKWMVEYFYNRVQNVITKYSVER--HWNSLNEETGGMNDVLYRLYTITQDPK 307
QA + +E + + I+ S +R + +ET +++ L+ + IT K
Sbjct: 193 GVEQAKTLLPITIE----KCRPYISPVSRDRIGKVDPPYDETYVLSENLFHVADITGQDK 248
Query: 308 HLLLA--HLFDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFF 365
+ +A +L +K F L A Q D + HA +H + Y GD Y+
Sbjct: 249 YRQMAIHYLLNKEWFDPLAAGQ-DVLPTKHAYSHTIALSSGAQAYLHLGDEKYRKA---- 303
Query: 366 MDIVNA-----SHGYATGGTSAGEFWSD--PKRLASTLGTEN---EESCTTYNMLKVSRH 415
+VNA +A+GG E + + +LA++L + E C ++ +K++R+
Sbjct: 304 --LVNAWTYMEPQRFASGGWGPEEQFVELHQGKLAASLKSSKAHFETPCGSFADMKLARY 361
Query: 416 LFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSF 475
L R+T E VY D ER L N +L+ + G Y G K + W
Sbjct: 362 LVRFTGEPVYGDGLERTLYNTMLATRLPDSDGGYPYYSNYGAAAEKLYYHQKWP------ 415
Query: 476 WCCYGT 481
CC GT
Sbjct: 416 -CCSGT 420
>gi|284043399|ref|YP_003393739.1| hypothetical protein Cwoe_1938 [Conexibacter woesei DSM 14684]
gi|283947620|gb|ADB50364.1| protein of unknown function DUF1680 [Conexibacter woesei DSM 14684]
Length = 711
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 92/376 (24%), Positives = 157/376 (41%), Gaps = 72/376 (19%)
Query: 135 DSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHF--VGHYLSASAHMWASTHNVTLKEKM 192
D+L++ F+ GS G GW G F +G + + A ++A+T EK
Sbjct: 47 DALLYPFRIRKGSWAPGIPLRGWYG-----EGLFNNLGQFFTLYARLYAATGEHRFAEKA 101
Query: 193 TAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNT 252
A++ E + G G+LS+ + + Y+ K++ GLLD + + +
Sbjct: 102 LALLDGWEETIEEDG-GFLSSHFAGTVE------------YSYDKLVCGLLDLHEYVGSE 148
Query: 253 QAL----KMTKWMVEYFYNRVQNVITKYSVERHWNSLNE-ETGGMNDVLYRLYTITQDPK 307
+AL ++++WM R Y+ W+ + E + + L R Y +T DP
Sbjct: 149 RALPVLERVSRWM-----QRHGGSSKPYA----WSGMGPLEWYTLPEYLLRAYAVTSDPL 199
Query: 308 HLLLAHLFDKPCF--------LGLLAVQADDISGFH-ANTHIPVVIGSQMRYEVTGDPLY 358
+ LA+ + F +G L +AD+ F+ A++H + + YE TGDP Y
Sbjct: 200 YRELANAYRYDEFYDALLERDVGALMRRADEARNFYQAHSHANTLNSAAAVYETTGDPRY 259
Query: 359 KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN---EESCTTYNMLKVSRH 415
T +++ S +ATG E + P++ L +E E +C ++ M+++ RH
Sbjct: 260 LDVLTAGYELLRESQTFATGMFGPLEAFMKPRQRVEVLHSEEGHAEVACPSWAMMRLVRH 319
Query: 416 LFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHG-------- 467
L T E + D+ E + NG+ S P R D +A Y
Sbjct: 320 LIELTGEAQFGDWMELNVYNGIGSA-------------PPTRADGRATQYFADYGLDRAT 366
Query: 468 --WGTRFSSFWCCYGT 481
WG +S CC T
Sbjct: 367 KTWGVEWS---CCSTT 379
>gi|302547294|ref|ZP_07299636.1| putative secreted protein [Streptomyces hygroscopicus ATCC 53653]
gi|302464912|gb|EFL28005.1| putative secreted protein [Streptomyces himastatinicus ATCC 53653]
Length = 740
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 67/134 (50%), Gaps = 13/134 (9%)
Query: 354 GDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVS 413
G+ Y F +V Y+ GGT GE + +A+TL +N E+C TYNMLK+S
Sbjct: 337 GETAYAAAARNFWGMVAGPRMYSLGGTGQGEMFRARNAIAATLDGKNAETCATYNMLKLS 396
Query: 414 RHLFRWTKEMVYADYYERALTNGVLSIQRG----TEPGVMIYMLPLGRGDSKAKSYHGWG 469
R LF + Y DYYER LTN +L+ +R T P V + +G G + Y G
Sbjct: 397 RQLFFREPDAAYMDYYERGLTNHILASRRDAPSTTSPEVTYF---VGMGPGVRREYDNTG 453
Query: 470 TRFSSFWCCYGTGL 483
T CC GTG+
Sbjct: 454 T------CCGGTGM 461
>gi|225874351|ref|YP_002755810.1| hypothetical protein ACP_2792 [Acidobacterium capsulatum ATCC
51196]
gi|225791337|gb|ACO31427.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
51196]
Length = 611
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 158/384 (41%), Gaps = 54/384 (14%)
Query: 123 QTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWE------DP----TCELRGHFVGHY 172
Q N + L LD D+L+ F++ AG P G GW DP T + GH G Y
Sbjct: 62 QANHAFFLALDEDALLKPFRERAGLPAPGPQMGGWYNFSKEFDPPNNMTGYIPGHSFGQY 121
Query: 173 LSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPY 232
LS A +A+T + K K+ +V +E + + +P P
Sbjct: 122 LSGLARAYAATGDQPTKAKVHRLVRGFAEA---VSPKFYDDYP--------------LPC 164
Query: 233 YTIHKILAGLLDQYTFADNTQAL-KMTKWMVEYFYNRVQNVITK--YSVERHWNSLN--E 287
YT K GL+D + FA + AL +++ + + +T+ + H N +
Sbjct: 165 YTFDKSNCGLIDAHQFAGDPNALHALSRALDAVMPYLPSHALTRPEMAARPHPNIAFTWD 224
Query: 288 ETGGMNDVLYRLYTITQDPKHLLLAHLF--DKPCFLGLLAVQADDISGFHANTHIPVVIG 345
E+ + + + Y + D K+L++A F DK + LA + + HA +H+ +
Sbjct: 225 ESYTLPENFFLAYKRSGDEKYLVMAQRFLQDK-SYFDPLAEGDNVLPHQHAYSHVNALNS 283
Query: 346 SQMRYEVTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDP------KRLASTLGT 398
+ Y V G + + F +++ S +ATGG E + +P K L T +
Sbjct: 284 ASQAYLVLGSEKHLRAARNGFQFVLDQS--FATGGWGPNETFVEPGSGGLYKSLTETHAS 341
Query: 399 ENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLPLGRG 458
E C Y KV+R+L R T + Y D E+ L N +L + G Y
Sbjct: 342 -FETPCGAYGHFKVTRYLMRITGDSRYGDSMEQVLYNTILGAMPLEQGGFSFYYSDY--N 398
Query: 459 DSKAKSYHGWGTRFSSFW-CCYGT 481
+ AK+Y + W CC GT
Sbjct: 399 NYAAKNY------YPEQWPCCSGT 416
>gi|189467199|ref|ZP_03015984.1| hypothetical protein BACINT_03583 [Bacteroides intestinalis DSM
17393]
gi|189435463|gb|EDV04448.1| hypothetical protein BACINT_03583 [Bacteroides intestinalis DSM
17393]
Length = 175
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 66/122 (54%), Gaps = 8/122 (6%)
Query: 107 LHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAG-------SPTAGKAYEGWED 159
L DV+L PS + ++ ++ + + L+ SF+ AG K GWE
Sbjct: 48 LKDVRLLPSRFRDNMMRDSV-WMTSIATNRLLHSFRDNAGVFAGREGGDMTVKKLGGWES 106
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF 219
CELRGH GH LSA A M+AST + K K ++V+ L+E Q +G+GYLSA+P E
Sbjct: 107 LDCELRGHTTGHLLSAYALMYASTGSEIFKLKGDSLVTGLAEVQAALGNGYLSAYPEELI 166
Query: 220 DR 221
+R
Sbjct: 167 NR 168
>gi|336429869|ref|ZP_08609826.1| hypothetical protein HMPREF0994_05832 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336001322|gb|EGN31460.1| hypothetical protein HMPREF0994_05832 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 606
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 104/424 (24%), Positives = 155/424 (36%), Gaps = 73/424 (17%)
Query: 100 DFLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWED 159
D LK+ +V+L +SL R ++ E L + DSL++ F+ AG G+ GW
Sbjct: 2 DRLKDFRYRNVELK-NSLWERQRRETAETYLAIPNDSLLYYFRTLAGLEAPGEGLTGWYG 60
Query: 160 PTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQF 219
G L A A ++A T + LKEK + +C +A + F
Sbjct: 61 NGAST----FGQKLGAFAKLYAVTGDYRLKEKAVYLAEGWGKC---------AAANKKVF 107
Query: 220 DRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVE 279
D + Y K+L G LD Y + L + + R + I + ++
Sbjct: 108 DCNDT--------YVYEKLLGGFLDMYENLGYEKGLAYCSGLTDSAAARFKRDIPRDGLQ 159
Query: 280 R---------HWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADD 330
W +L E LYR Y +T + K+L A +D L +
Sbjct: 160 GPELCENNMIEWYTLPEN-------LYRAYQLTGEQKYLDFAQEWDYTYLWDKLNNKDSA 212
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEF----- 385
I HA + + + + M YEVTG Y + H YATGG E
Sbjct: 213 IGPRHAYSQVNSLSSAAMAYEVTGKKYYLDAIENGYTEITERHTYATGGYGPAECLFAEE 272
Query: 386 -----------WSDPKRLAST--------LGTEN-----EESCTTYNMLKVSRHLFRWTK 421
W DP R + +G + E SC + + K+ +L R T
Sbjct: 273 EGFLGEMLKDSW-DPTRKSPVYRNFGGGLVGRNDNWGSCEVSCCAWAVFKICNYLLRITG 331
Query: 422 EMVYADYYERALTNGVLSIQRGTEPG-VMIYMLPLGRGDSKA---KSYHGWGTRFSSFWC 477
+ Y + E+ L NGV G VM Y G K+ + G G F + C
Sbjct: 332 KAKYGAWAEQMLINGVAGQPPIDSQGHVMYYADYFVDGAVKSVQDRRLQGNGANF-EWQC 390
Query: 478 CYGT 481
C GT
Sbjct: 391 CTGT 394
>gi|237718517|ref|ZP_04548998.1| LOW QUALITY PROTEIN: acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
gi|229452224|gb|EEO58015.1| LOW QUALITY PROTEIN: acetyl-CoA carboxylase [Bacteroides sp. 2_2_4]
Length = 502
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 58/116 (50%), Gaps = 7/116 (6%)
Query: 369 VNASHGYATGGTSAGE-FWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYAD 427
V A+ A GG S E F D L+ E ESC TYNML+++ LFR YAD
Sbjct: 2 VTANRSLAFGGNSRREHFPDDTDYLSYVDDREGPESCNTYNMLRLTEGLFRMNPTADYAD 61
Query: 428 YYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
+YERAL N +LS Q E G +Y P ++ Y + + WCC GTG+
Sbjct: 62 FYERALFNHILSTQH-PEHGGYVYFTP-----ARPAHYRVYSAPNEAMWCCVGTGM 111
>gi|336425065|ref|ZP_08605095.1| hypothetical protein HMPREF0994_01101 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336012974|gb|EGN42863.1| hypothetical protein HMPREF0994_01101 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 575
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 92/407 (22%), Positives = 159/407 (39%), Gaps = 57/407 (14%)
Query: 102 LKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT 161
KEV+L++ + + L + L + D+++ +++AG P G Y GW +
Sbjct: 6 FKEVTLNEGMMK------KVLDETLAFYLKIPNDNILKYMRESAGKPAPGIFYTGWYPNS 59
Query: 162 CELRG-HFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFD 220
RG +G +LSA + M+A + + ++K + +C Y SA + F
Sbjct: 60 ---RGIALIGQWLSAYSRMYAISGDEAFRQKAVYLADEFWDC-------YESAQHTAPFL 109
Query: 221 RFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRV--QNVITKYSV 278
+ +Y + K+L D + + A + +++++ + + +N+ S
Sbjct: 110 TSRS-------HYDVEKLLRAHCDLFLYCKYPCAKERAGYLIDFAADNLTAENIFGDNST 162
Query: 279 ERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISG----- 333
E W +L E + + I + P+ +A F+ F L AD S
Sbjct: 163 E--WYTLAES-------FWDAFEILEIPRAQQMAERFEYREFWDLFYKDADPFSKRPQAG 213
Query: 334 -----FHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSD 388
HA +H+ YE+T P + + F + ATGG
Sbjct: 214 LYSEFCHAYSHVNSFNSCAKAYEMTKSPYFLKSLRSFYRFMQTEEVMATGGYGPNYEHLM 273
Query: 389 PK-RLASTLGTEN---EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGT 444
PK R+ L T + E C TY ++ ++L R+T E Y ++ E L N + T
Sbjct: 274 PKNRIIDALRTGHDSFETQCDTYAAFRLCKYLTRFTDEPEYGNWVESLLYNAAAATIPMT 333
Query: 445 EPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFW-CCYGTGLGFIRWIQ 490
E G +IY S Y G+ W CC GT + IQ
Sbjct: 334 EEGNIIYY-------SDYNMYAGYKKNRQDGWTCCTGTRPLLVAEIQ 373
>gi|94967351|ref|YP_589399.1| hypothetical protein Acid345_0320 [Candidatus Koribacter versatilis
Ellin345]
gi|94549401|gb|ABF39325.1| Protein of unknown function DUF1680 [Candidatus Koribacter
versatilis Ellin345]
Length = 607
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 81/348 (23%), Positives = 141/348 (40%), Gaps = 37/348 (10%)
Query: 125 NLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPT----------CELRGHFVGHYLS 174
N + L LD D L+ F++ AG P G+ GW D T + GH +G Y+S
Sbjct: 58 NHAFFLKLDEDRLLKVFRQKAGLPAPGEDMGGWYDLTGFDLAKGDFHGFVPGHTLGQYVS 117
Query: 175 ASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYT 234
A A +A+T + K K+ +V GY + D+ P YT
Sbjct: 118 ALARCYAATGSEETKAKVHRLV-----------KGYGATLD----DKASFFAGYRLPAYT 162
Query: 235 IHKILAGLLDQYTFADNTQAL----KMTKWMVEYFYNRVQNVITKYSVERHWNSLN-EET 289
K+ GL+D + FA + A+ K+T+ M++Y + + + + S +E+
Sbjct: 163 YDKLSCGLIDAHEFAHDPDAMAIHEKLTRGMLQYLPEKALSRAEQRARPHKDESFTWDES 222
Query: 290 GGMNDVLYRLYTITQDPKHLLLAHLF-DKPCFLGLLAVQADDISGFHANTHIPVVIGSQM 348
+ + L+ Y T + + L F + + L+ + ++G HA +H+ +
Sbjct: 223 YTLPENLFLAYRRTGNKFYRELGTRFLEDDTYFNPLSEGINVLAGEHAYSHMNAFCSAMQ 282
Query: 349 RYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSD--PKRLASTLGTEN---EES 403
Y ++ +V A +ATGG E + + +L +L + E
Sbjct: 283 AYLTLDSERHRKAARNGFRMV-AEQSFATGGWGPSEAFVEFNKGQLGDSLEKSHSSFETP 341
Query: 404 CTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
C Y K++R+L + + Y D ER + N VL + G Y
Sbjct: 342 CGAYAHFKLTRYLLQTDGDSTYGDSMERVMYNTVLGAKPIQPDGTSFY 389
>gi|374374779|ref|ZP_09632437.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
gi|373231619|gb|EHP51414.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
Length = 614
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 130/314 (41%), Gaps = 25/314 (7%)
Query: 155 EGWEDPTCELR--GHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLS 212
EG++ + R G VG YL A+A+ W T N LK +M + + L + Q + GYL
Sbjct: 76 EGFQSRPGKQRWIGEHVGKYLEAAANTWIITKNAALKTQMDRIFNELIKTQ--LPDGYLG 133
Query: 213 AF-PSEQFDRFEALKPVWAPYYTIHKI-LAGLLDQYTFADNTQALKMTKWMVEYFYNRVQ 270
+ P + ++ VW +HK L GLL Y + +AL + + +
Sbjct: 134 TYLPDSYWTSWD----VW-----VHKYDLVGLLAYYRVTGDRRALTAAVKVGDLLLKNIG 184
Query: 271 NVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHL----LLAHLFDKPCFLGLLAV 326
++ + + + + + + D + LY T D ++L + +D P ++
Sbjct: 185 DLPGQKDIIKTGSHVGMAATSVIDPMTDLYQWTGDRRYLDFCKYIIKAYDHPAGPSIVTT 244
Query: 327 -----QADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTS 381
Q D ++ A + ++G Y +TGD Y D + A + TG TS
Sbjct: 245 LLKEKQVDKVANGKAYEMLSNLVGIIKLYRLTGDEKYLQACRNAFDDIAAKRLFVTGTTS 304
Query: 382 AGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E + L + E C T ++ + LF T ++ Y + E+++ N +L +
Sbjct: 305 DHERFMPDNILQADTAAHMGEGCVTTTWIQFNVQLFAITGDLKYYNEIEKSVYNHLLGAE 364
Query: 442 RGTEPGVMIYMLPL 455
E G + Y PL
Sbjct: 365 N-PETGCVSYYTPL 377
>gi|167537610|ref|XP_001750473.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771013|gb|EDQ84687.1| predicted protein [Monosiga brevicollis MX1]
Length = 2823
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 72/172 (41%), Gaps = 21/172 (12%)
Query: 101 FLKEVSLHDVKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDP 160
F EV +V L P S+ RA N+ YLL D L++ F+ G+P GW+
Sbjct: 93 FQVEVPTSNVTLTPGSVLRRAFDANIIYLLGHPTDDLLYFFRLRNGNPNPPGQCWGWD-- 150
Query: 161 TCELRGHFVGHYLSASAHM--WASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSEQ 218
LRG G +L S + W N TL+ +M VV+ + Q + GY F
Sbjct: 151 -ANLRGSLAGEFLMGSGGISRWPMA-NATLRARMDEVVAGI--LQEQEADGYAMGF---- 202
Query: 219 FDRFEALKPVWA---PYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYN 267
A W P Y + GLL + A N QAL + + + +F N
Sbjct: 203 -----ARNETWTHENPDYVTSWVTHGLL-EAAIAGNEQALPLIRRHLNWFNN 248
>gi|336239737|ref|XP_003342727.1| hypothetical protein SMAC_10375 [Sordaria macrospora k-hell]
Length = 159
Score = 52.8 bits (125), Expect = 4e-04, Method: Composition-based stats.
Identities = 33/102 (32%), Positives = 50/102 (49%), Gaps = 3/102 (2%)
Query: 110 VKLDPSSLHWRAQQTNLEYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFV 169
V L PS A N YLL LD + L+ +F +AG P Y GWE + GH +
Sbjct: 57 VTLQPSPFA-DAFAANRRYLLDLDPERLLHNFYISAGLPAPKPVYGGWEAQG--IAGHSL 113
Query: 170 GHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYL 211
GH+LSA A A++ + + ++ + ++ Q G GY+
Sbjct: 114 GHWLSACALTVANSGDAAIAARLDHALKEMARIQAAHGDGYV 155
>gi|302809111|ref|XP_002986249.1| hypothetical protein SELMODRAFT_425170 [Selaginella moellendorffii]
gi|300146108|gb|EFJ12780.1| hypothetical protein SELMODRAFT_425170 [Selaginella moellendorffii]
Length = 192
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 29/35 (82%)
Query: 169 VGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQ 203
GHYLSA+A +WASTHN +K++M A+V+ L+ECQ
Sbjct: 7 AGHYLSATAKLWASTHNAEVKKRMDALVNILAECQ 41
>gi|380482670|emb|CCF41095.1| secreted protein [Colletotrichum higginsianum]
Length = 246
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 43/82 (52%), Gaps = 7/82 (8%)
Query: 409 MLKVSRHLFRWTK--EMVYADYYERALTNGVLSIQRGTEP-GVMIYMLPLG----RGDSK 461
MLK++R L+ + Y D+YERAL N +L Q ++ G + Y PL RG
Sbjct: 1 MLKLTRELWLTSPGTTTAYFDFYERALLNHLLGQQDPSDDHGHVTYFTPLNPGGRRGVGP 60
Query: 462 AKSYHGWGTRFSSFWCCYGTGL 483
A W T + SFWCC GTGL
Sbjct: 61 AWGGGTWSTDYDSFWCCQGTGL 82
>gi|227509160|ref|ZP_03939209.1| conserved hypothetical protein, partial [Lactobacillus brevis
subsp. gravesensis ATCC 27305]
gi|227191367|gb|EEI71434.1| conserved hypothetical protein [Lactobacillus brevis subsp.
gravesensis ATCC 27305]
Length = 106
Score = 49.7 bits (117), Expect = 0.004, Method: Composition-based stats.
Identities = 34/100 (34%), Positives = 45/100 (45%), Gaps = 17/100 (17%)
Query: 164 LRGHFVGHYLSASAHMWASTHN----VTLKEKMTAVVSALSECQ------NKMGSGYLSA 213
RGHF GHYLSA + S + L K+ + L Q + +GY+SA
Sbjct: 1 FRGHFFGHYLSALSQAIDSVSDDDTRSQLLSKLRIGIEGLFRAQQAYAKSHPQSAGYVSA 60
Query: 214 FPSEQFDRFEALK-------PVWAPYYTIHKILAGLLDQY 246
F D E + V P+Y +HKILAGL+D Y
Sbjct: 61 FREVALDEVEGKRVPESEKENVIVPWYNLHKILAGLIDGY 100
>gi|340619901|ref|YP_004738354.1| hypothetical protein zobellia_3937 [Zobellia galactanivorans]
gi|339734698|emb|CAZ98075.1| Conserved hypothetical periplasmic protein [Zobellia
galactanivorans]
Length = 629
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 72/307 (23%), Positives = 121/307 (39%), Gaps = 34/307 (11%)
Query: 163 ELRGHFVGH--YLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMG-SGYLSAFPSEQF 219
E+ G F+G + AS + A +H+ + E +V + + Q K G SG+ P +
Sbjct: 78 EVVGAFIGMGMLIDASVRLAAYSHDPKMMEIKNEIVDKVIDEQLKNGYSGFYK--PERRL 135
Query: 220 DRFEALKPVWAPYYTIHK---ILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKY 276
+ W IH+ I+ GL Y N ++LK ++ + Y
Sbjct: 136 WNSQGGGDNW----DIHEMAFIIDGLTSDYELFGNKRSLKAAIKTADFIMEHWHEMPDDY 191
Query: 277 SVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLA------HLFDKPCFLGLLAVQADD 330
+ E + L+ G++ ++RLY T + + L + + +D +G +
Sbjct: 192 AAEVDMHVLDT---GIDWAIFRLYKTTGEKRFLNFSEKTKSLYQWDTKIEIG----RRPG 244
Query: 331 ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG--EFWSD 388
+SG H + + + Y TG+ M A G G SAG E W+D
Sbjct: 245 VSG-HMFAYFAMCMAQIELYRYTGNKELLQQTENAMRFFLAEDGLTISG-SAGQREIWTD 302
Query: 389 PKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGV 448
+ + LG E+C T +V L R T + Y D ER + NG+ Q + G
Sbjct: 303 DQDGENELG----ETCATAYQTRVYESLLRLTGKAEYGDLIERTVYNGLFGAQ-SPDGGK 357
Query: 449 MIYMLPL 455
+ Y P
Sbjct: 358 LRYYTPF 364
>gi|423223914|ref|ZP_17210383.1| hypothetical protein HMPREF1062_02569 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392637516|gb|EIY31383.1| hypothetical protein HMPREF1062_02569 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 664
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 48/156 (30%), Positives = 65/156 (41%), Gaps = 22/156 (14%)
Query: 334 FHANTHIPVVIGSQMRYEVTGDP--LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
FH N +G Y +TGD L KV+G + D ++ Y TGG S E +
Sbjct: 284 FHMN-----FMGFLRLYRITGDKTLLRKVSGAW--DDIHERQMYITGGVSVAEHYE--HD 334
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
L E+C T + +++++ L T E YAD ER + N V + Q E GV Y
Sbjct: 335 YVKPLSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHVFAAQ-DCESGVCRY 393
Query: 452 -MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGLGFI 486
P G SK Y F CC +G I
Sbjct: 394 HTAPNG---SKPDGY------FHGPDCCTASGHRII 420
>gi|224537087|ref|ZP_03677626.1| hypothetical protein BACCELL_01964 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521314|gb|EEF90419.1| hypothetical protein BACCELL_01964 [Bacteroides cellulosilyticus
DSM 14838]
Length = 664
Score = 45.8 bits (107), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 48/156 (30%), Positives = 65/156 (41%), Gaps = 22/156 (14%)
Query: 334 FHANTHIPVVIGSQMRYEVTGDP--LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
FH N +G Y +TGD L KV+G + D ++ Y TGG S E +
Sbjct: 284 FHMN-----FMGFLRLYRITGDKTLLRKVSGAW--DDIHERQMYITGGVSVAEHYE--HD 334
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
L E+C T + +++++ L T E YAD ER + N V + Q E GV Y
Sbjct: 335 YVKPLSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHVFAAQ-DCESGVCRY 393
Query: 452 -MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGLGFI 486
P G SK Y F CC +G I
Sbjct: 394 HTAPNG---SKPDGY------FHGPDCCTASGHRII 420
>gi|154495303|ref|ZP_02034308.1| hypothetical protein PARMER_04360 [Parabacteroides merdae ATCC
43184]
gi|423722505|ref|ZP_17696681.1| hypothetical protein HMPREF1078_00744 [Parabacteroides merdae
CL09T00C40]
gi|154085227|gb|EDN84272.1| hypothetical protein PARMER_04360 [Parabacteroides merdae ATCC
43184]
gi|409242350|gb|EKN35113.1| hypothetical protein HMPREF1078_00744 [Parabacteroides merdae
CL09T00C40]
Length = 625
Score = 45.1 bits (105), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 58/269 (21%), Positives = 109/269 (40%), Gaps = 41/269 (15%)
Query: 197 SALSECQNKMGSGYLSAF-PSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQAL 255
+A S ++ +GY+ + P Q +++ VW YT GL+ Y + + +AL
Sbjct: 105 AAESLMATQLPNGYIGNYAPEYQLQQWD----VWGRKYT----SLGLIAWYDLSGDKKAL 156
Query: 256 KMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLF 315
+ +V++ +V K + N + + + + + LY T++ ++L A
Sbjct: 157 EAACRVVDHLMTQVGP--GKVDIVSTGNYIGMPSSSVLEPVMYLYNRTKEKRYLDFAKYI 214
Query: 316 ----DKPCFLGLLAVQADDI----------------SGFHANTHIPVVIGSQMRYEVTGD 355
+ P L++ D+ +G A + G Y+VTG+
Sbjct: 215 VGQWETPGGPQLISKAIADVPVANRFPHPKTWFSRENGQKAYEMMSCYEGLLELYKVTGN 274
Query: 356 PLY-----KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNML 410
PLY K G + +N + G SA E W K + E+C T+ +
Sbjct: 275 PLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETCVTFTWM 329
Query: 411 KVSRHLFRWTKEMVYADYYERALTNGVLS 439
++ L + T +YADY E A+ N +++
Sbjct: 330 QLCNRLLQMTGNSLYADYMETAIYNALMA 358
>gi|427384256|ref|ZP_18880761.1| hypothetical protein HMPREF9447_01794 [Bacteroides oleiciplenus YIT
12058]
gi|425727517|gb|EKU90376.1| hypothetical protein HMPREF9447_01794 [Bacteroides oleiciplenus YIT
12058]
Length = 662
Score = 45.1 bits (105), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 48/156 (30%), Positives = 64/156 (41%), Gaps = 22/156 (14%)
Query: 334 FHANTHIPVVIGSQMRYEVTGDP--LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKR 391
FH N +G Y +TGD L KV G + D ++ Y TGG S E +
Sbjct: 284 FHMN-----FMGFLRLYRITGDKSLLRKVAGAW--DDIHERQMYITGGVSVAEHYE--HD 334
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
L E+C T + +++++ L T E YAD ER + N V + Q E GV Y
Sbjct: 335 YVKPLSGNIVETCATMSWMQLTQQLLELTGESKYADAMERLMINHVFAAQ-DCENGVCRY 393
Query: 452 -MLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGLGFI 486
P G SK Y F CC +G I
Sbjct: 394 HTAPNG---SKPDGY------FHGPDCCTASGHRII 420
>gi|423345501|ref|ZP_17323190.1| hypothetical protein HMPREF1060_00862 [Parabacteroides merdae
CL03T12C32]
gi|409223287|gb|EKN16224.1| hypothetical protein HMPREF1060_00862 [Parabacteroides merdae
CL03T12C32]
Length = 625
Score = 45.1 bits (105), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 350 YEVTGDPLY-----KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESC 404
Y+VTG+PLY K G + +N + G SA E W K + E+C
Sbjct: 269 YKVTGNPLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETC 323
Query: 405 TTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLS 439
T+ +++ L + T +YADY E A+ N +++
Sbjct: 324 VTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMA 358
>gi|423343638|ref|ZP_17321351.1| hypothetical protein HMPREF1077_02781 [Parabacteroides johnsonii
CL02T12C29]
gi|409214660|gb|EKN07669.1| hypothetical protein HMPREF1077_02781 [Parabacteroides johnsonii
CL02T12C29]
Length = 625
Score = 45.1 bits (105), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 350 YEVTGDPLY-----KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESC 404
Y+VTG+PLY K G + +N + G SA E W K + E+C
Sbjct: 269 YKVTGNPLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETC 323
Query: 405 TTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLS 439
T+ +++ L + T +YADY E A+ N +++
Sbjct: 324 VTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMA 358
>gi|195607558|gb|ACG25609.1| hypothetical protein [Zea mays]
Length = 49
Score = 45.1 bits (105), Expect = 0.096, Method: Composition-based stats.
Identities = 21/26 (80%), Positives = 21/26 (80%)
Query: 387 SDPKRLASTLGTENEESCTTYNMLKV 412
SD KRLA L TE EESCTTYNMLKV
Sbjct: 6 SDRKRLAVALPTETEESCTTYNMLKV 31
>gi|218261883|ref|ZP_03476568.1| hypothetical protein PRABACTJOHN_02239 [Parabacteroides johnsonii
DSM 18315]
gi|218223731|gb|EEC96381.1| hypothetical protein PRABACTJOHN_02239 [Parabacteroides johnsonii
DSM 18315]
Length = 625
Score = 44.7 bits (104), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 350 YEVTGDPLY-----KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESC 404
Y+VTG+PLY K G + +N + G SA E W K + E+C
Sbjct: 269 YKVTGNPLYLSVVEKTVGHIVREEINVA-----GSGSAFECWYGGKERQTQPTYHTMETC 323
Query: 405 TTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLS 439
T+ +++ L + T +YADY E A+ N +++
Sbjct: 324 VTFTWMQLCNRLLQMTGNSLYADYMETAIYNALMA 358
>gi|336404174|ref|ZP_08584872.1| hypothetical protein HMPREF0127_02185 [Bacteroides sp. 1_1_30]
gi|335943502|gb|EGN05341.1| hypothetical protein HMPREF0127_02185 [Bacteroides sp. 1_1_30]
Length = 669
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 112/302 (37%), Gaps = 36/302 (11%)
Query: 151 GKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGY 210
G +GWE RG + L A++ ++ TLKEK V Q G+
Sbjct: 84 GGTGDGWE------RGPYWIDGLLPLAYI---LNDQTLKEKALKWVEWCLNNQQDNGNFG 134
Query: 211 LSAFPSEQFDRF----EALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFY 266
P E +D+ + ++ W P I+ +L QY A T ++ +M+ YF
Sbjct: 135 PKPLP-ENYDKIWGVQQGMRDDWWP----KMIMLKVLQQYYMA--TGDKRVIDFMIRYFK 187
Query: 267 NRVQNVITKYSVERHWNSLNEETGGMN-DVLYRLYTITQDPKHLLLAHLFDK------PC 319
+ Q + KY + HW G N V+Y LY IT++ L L L +
Sbjct: 188 YQ-QETLPKYPLG-HWTFWANRRGADNLAVVYWLYNITKEKFLLELGELIHQQTYDWTEV 245
Query: 320 FLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGG 379
F G + + H + + Y+ D Y + + HG+ G
Sbjct: 246 FSGNVIRTLNPYPSLHCVNVAQGLKAPVIYYQQHPDEKYLSAVKEGLSALRDCHGFVNGM 305
Query: 380 TSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLS 439
E RL T+ E CT M+ + T ++ YADY E+ N VL
Sbjct: 306 YGGDE------RLHGNNPTQGSELCTAVEMMHSFESILPITGDVYYADYLEKIAYN-VLP 358
Query: 440 IQ 441
Q
Sbjct: 359 AQ 360
>gi|557474|gb|AAA50392.1| ORF1, partial [Bacteroides ovatus]
Length = 436
Score = 42.4 bits (98), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 6/59 (10%)
Query: 425 YADYYERALTNGVLSIQRGTEPGVMIYMLPLGRGDSKAKSYHGWGTRFSSFWCCYGTGL 483
Y +YYERAL N +L+ Q + G +Y P+ G Y + +S WCC G+GL
Sbjct: 4 YVNYYERALYNHILASQE-PDKGGFVYFTPMRPG-----HYRVYSQPETSMWCCVGSGL 56
>gi|237719720|ref|ZP_04550201.1| predicted protein [Bacteroides sp. 2_2_4]
gi|229450989|gb|EEO56780.1| predicted protein [Bacteroides sp. 2_2_4]
Length = 663
Score = 42.0 bits (97), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 61/154 (39%), Gaps = 15/154 (9%)
Query: 335 HANTHIPVVIGSQMRYEVTGDP--LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRL 392
H++T +G Y +TGD KV G + D ++ Y TGG S E +
Sbjct: 282 HSHTFQMNFMGFLRLYRITGDKSLFRKVAGAW--DDIHKRQMYITGGVSVAEHYE--HDY 337
Query: 393 ASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYM 452
+ E+C T + +++++ L T E YAD ER + N V + Q +
Sbjct: 338 VKPISGHVVETCATMSWMQLTQMLLELTGESKYADAMERLMINHVFAAQDCETGSCRYHT 397
Query: 453 LPLGRGDSKAKSYHGWGTRFSSFWCCYGTGLGFI 486
P G SK Y F CC +G I
Sbjct: 398 APNG---SKPHGY------FHGPDCCTASGHRII 422
>gi|397691075|ref|YP_006528329.1| six-hairpin glycosidase [Melioribacter roseus P3M]
gi|395812567|gb|AFN75316.1| six-hairpin glycosidase [Melioribacter roseus P3M]
Length = 643
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 64/219 (29%), Positives = 89/219 (40%), Gaps = 40/219 (18%)
Query: 296 LYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDIS----GFHANTHIPVV-----IGS 346
L +LY IT +++ LA F L ++ D + G +A HIP+V +G
Sbjct: 219 LIKLYQITGKKEYMELAKFF--------LDIRGDSTTHKLYGEYAQDHIPLVEQKEAVGH 270
Query: 347 QMR----YEVTGD--------PLYKVTGTFFMDIVNASHGYATGGTSA---GEFWSDPKR 391
+R Y D K T + ++VN Y TGG A GE + D
Sbjct: 271 AVRALYMYAAMTDIAVLHDDEDYRKAVFTLWDNVVN-KKTYITGGLGARHDGEAFGDDYE 329
Query: 392 LASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIY 451
L + T E+C + + LF T + YAD ER L NG++S + Y
Sbjct: 330 LPNL--TAYGETCAAIGSVYWNYRLFEMTGDSKYADVIERTLYNGLIS-GISLDGKNFFY 386
Query: 452 MLPLGRGDSKAKSYHGWGTRFSSFWC-CYGTGLGFIRWI 489
PL D + K G TR F C C T L IR+I
Sbjct: 387 PNPL-ESDGEYKFNMGACTRQPWFDCSCCPTNL--IRFI 422
>gi|150007964|ref|YP_001302707.1| hypothetical protein BDI_1325 [Parabacteroides distasonis ATCC
8503]
gi|149936388|gb|ABR43085.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 623
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 70/329 (21%), Positives = 128/329 (38%), Gaps = 49/329 (14%)
Query: 133 DVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKM 192
DVD LV F+ ++ T + F G ++ + + + L + +
Sbjct: 57 DVDHLVEPFRH--------------KEETLRWQSEFWGKWIQGAIASYRYDKDPELYKII 102
Query: 193 TAVVSALSECQNKMGSGYLSAFPSE-QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADN 251
+L E Q + +GY+ + E Q ++++ +W YT GL+ Y + +
Sbjct: 103 KNGAESLMETQ--LPNGYIGNYSEEAQLNQWD----IWGRKYT----ALGLIAYYDLSGD 152
Query: 252 TQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLL 311
+AL ++++ +V K ++ N + + + + + LY T+ K+L
Sbjct: 153 RKALDAACRVIDHLMTQVGP--GKVNIVTTGNYIGMPSSSVLEPVMYLYNRTRQDKYLDF 210
Query: 312 AHLFDK----PCFLGLLAVQADDI----------------SGFHANTHIPVVIGSQMRYE 351
A K P L++ DI +G A + G Y+
Sbjct: 211 AKYIVKQWETPEGPRLISKAIADIPVAGRFPHPKVWFSPENGQKAYEMMSCYEGLLELYK 270
Query: 352 VTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNML 410
VT +PLY V I+N A G SA E W K L + E+C T+ +
Sbjct: 271 VTKNPLYLSVVEKTMNHIINEEINVAGSG-SAFECWYGGKALQTYPTYHTMETCVTFTWM 329
Query: 411 KVSRHLFRWTKEMVYADYYERALTNGVLS 439
++ + T +YAD E+A+ N +L+
Sbjct: 330 QICDRMLGLTGNSLYADQIEKAMYNALLA 358
>gi|256840863|ref|ZP_05546371.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|256738135|gb|EEU51461.1| conserved hypothetical protein [Parabacteroides sp. D13]
Length = 625
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 70/329 (21%), Positives = 128/329 (38%), Gaps = 49/329 (14%)
Query: 133 DVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNVTLKEKM 192
DVD LV F+ ++ T + F G ++ + + + L + +
Sbjct: 59 DVDHLVEPFRH--------------KEETLRWQSEFWGKWIQGAIASYRYDKDPELYKII 104
Query: 193 TAVVSALSECQNKMGSGYLSAFPSE-QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADN 251
+L E Q + +GY+ + E Q ++++ +W YT GL+ Y + +
Sbjct: 105 KNGAESLMETQ--LPNGYIGNYSEEAQLNQWD----IWGRKYTA----LGLIAYYDLSGD 154
Query: 252 TQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLL 311
+AL ++++ +V K ++ N + + + + + LY T+ K+L
Sbjct: 155 RKALDAACRVIDHLMTQVGP--GKVNIVTTGNYIGMPSSSVLEPVMYLYNRTRQDKYLDF 212
Query: 312 AHLFDK----PCFLGLLAVQADDI----------------SGFHANTHIPVVIGSQMRYE 351
A K P L++ DI +G A + G Y+
Sbjct: 213 AKYIVKQWETPEGPRLISKAIADIPVAGRFPHPKVWFSPENGQKAYEMMSCYEGLLELYK 272
Query: 352 VTGDPLY-KVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNML 410
VT +PLY V I+N A G SA E W K L + E+C T+ +
Sbjct: 273 VTKNPLYLSVVEKTMNHIINEEINVAGSG-SAFECWYGGKALQTYPTYHTMETCVTFTWM 331
Query: 411 KVSRHLFRWTKEMVYADYYERALTNGVLS 439
++ + T +YAD E+A+ N +L+
Sbjct: 332 QICDRMLGLTGNSLYADQIEKAMYNALLA 360
>gi|281421440|ref|ZP_06252439.1| putative cytoplasmic protein [Prevotella copri DSM 18205]
gi|281404512|gb|EFB35192.1| putative cytoplasmic protein [Prevotella copri DSM 18205]
Length = 690
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 84/204 (41%), Gaps = 33/204 (16%)
Query: 296 LYRLYTITQDPKHLLLA-HLFDKPCFLGLLAVQADDISGFHANTHIPVV-----IGSQMR 349
L RLYT+T + K+L A +L D + G I ++ + +P++ +G +R
Sbjct: 238 LARLYTLTGEKKYLDEAKYLLD---YRG-----KTHIRNPYSQSQVPILEQKEAVGHAVR 289
Query: 350 Y-----------EVTGDPLY-KVTGTFFMDIVNASHGYATGGTSA---GEFWSDPKRLAS 394
+T D Y KV F +IV + Y TGG A GE + + L +
Sbjct: 290 AGYMYAGIADVAALTKDSAYMKVIDRIFENIVGKKY-YLTGGVGARHAGEAFGENYELPN 348
Query: 395 TLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYMLP 454
T E+C +M+ + +F E Y D ER L NGV+S + G Y P
Sbjct: 349 M--TAYNETCAAISMVYLFERMFLLHGESKYIDCMERTLYNGVIS-GMSMDGGRFFYPNP 405
Query: 455 LGRGDSKAKSYHGWGTRFSSFWCC 478
L A + G TR F C
Sbjct: 406 LSSDGKYAFNADGNTTRQPWFGCA 429
>gi|340346785|ref|ZP_08669904.1| hypothetical protein HMPREF9136_0902 [Prevotella dentalis DSM 3688]
gi|433652020|ref|YP_007278399.1| hypothetical protein Prede_1032 [Prevotella dentalis DSM 3688]
gi|339611002|gb|EGQ15842.1| hypothetical protein HMPREF9136_0902 [Prevotella dentalis DSM 3688]
gi|433302553|gb|AGB28369.1| hypothetical protein Prede_1032 [Prevotella dentalis DSM 3688]
Length = 663
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 70/164 (42%), Gaps = 20/164 (12%)
Query: 294 DVLYRLYTITQDPKHLLLAH-------------LFDKPCFLGLLAVQADDISGF-HANTH 339
D + RLYTIT ++L A F + + + D + + HA+T
Sbjct: 229 DPIARLYTITGKKRYLDWAKWVVGNIDKWSGWDAFSRLDSIADGKLGVDQLQPYVHAHTF 288
Query: 340 IPVVIGSQMRYEVTGDP--LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLG 397
+G Y++TGD L KV G + + + Y TGG S E + K L
Sbjct: 289 QMNFMGFLRLYQITGDRSLLRKVEGAW--NDIYRRQMYITGGVSVAEHYE--KGYVKPLS 344
Query: 398 TENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
E+C T + +++++ L T + YAD E+ + N V + Q
Sbjct: 345 GNIIETCATMSWMQLTQMLLELTGDTKYADAIEKIMLNHVFAAQ 388
>gi|436834929|ref|YP_007320145.1| hypothetical protein FAES_1542 [Fibrella aestuarina BUZ 2]
gi|384066342|emb|CCG99552.1| hypothetical protein FAES_1542 [Fibrella aestuarina BUZ 2]
Length = 636
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 48/218 (22%), Positives = 82/218 (37%), Gaps = 20/218 (9%)
Query: 239 LAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYR 298
L GLL Y ++ ++L + ++ N + K + + N + + +
Sbjct: 155 LLGLLAYYDLTNDKRSLNAASKVTDHLINELS--ARKALLVKQGNHRGMAATSVLEPVCL 212
Query: 299 LYTITQDPKHLLLAHLF----DKPCFLGLLAVQADDIS--------------GFHANTHI 340
LY+ T D ++L A + P L+A D++ G A +
Sbjct: 213 LYSRTADKRYLAFAETIVQQWESPEGPQLIAKADVDVANRFPKPKNWFGWEQGQKAYEMM 272
Query: 341 PVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTEN 400
G Y +TG P YK + + G S+ E W K L +
Sbjct: 273 SCYEGLLELYRLTGKPAYKAAVEKTWQNIRDTEINLAGSGSSVECWFGGKALQTLSINHY 332
Query: 401 EESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVL 438
+E+C T +K+S+ L R T + YAD E+ N +L
Sbjct: 333 QETCVTATWIKLSQQLLRLTGDARYADAIEQTYYNALL 370
>gi|301309993|ref|ZP_07215932.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|423340426|ref|ZP_17318165.1| hypothetical protein HMPREF1059_04090 [Parabacteroides distasonis
CL09T03C24]
gi|300831567|gb|EFK62198.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|409227861|gb|EKN20757.1| hypothetical protein HMPREF1059_04090 [Parabacteroides distasonis
CL09T03C24]
Length = 623
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 57/259 (22%), Positives = 105/259 (40%), Gaps = 33/259 (12%)
Query: 203 QNKMGSGYLSAFPSE-QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
+ ++ +GY+ + E Q ++++ +W YT GL+ Y + + +AL +
Sbjct: 111 ETQLPNGYIGNYSEEAQLNQWD----IWGRKYTA----LGLIAYYDLSGDRKALDAACRV 162
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDK---- 317
+++ +V K ++ N + + + + + LY T+ K+L A K
Sbjct: 163 IDHLMTQVGP--GKVNIVTTGNYIGMPSSSVLEPVMYLYNRTRQDKYLDFAKYIVKQWET 220
Query: 318 PCFLGLLAVQADDI----------------SGFHANTHIPVVIGSQMRYEVTGDPLY-KV 360
P L++ DI +G A + G Y+VT +PLY V
Sbjct: 221 PEGPRLISKAIADIPVAGRFPHPKVWFSPENGQKAYEMMSCYEGLLELYKVTKNPLYLSV 280
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWT 420
I+N A G SA E W K L + E+C T+ +++ + T
Sbjct: 281 VEKTMNHIINEEINVAGSG-SAFECWYGGKALQTYPTYHTMETCVTFTWMQICDRMLGLT 339
Query: 421 KEMVYADYYERALTNGVLS 439
+YAD E+A+ N +L+
Sbjct: 340 GNSLYADQIEKAMYNALLA 358
>gi|262381468|ref|ZP_06074606.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262296645|gb|EEY84575.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 623
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 57/259 (22%), Positives = 105/259 (40%), Gaps = 33/259 (12%)
Query: 203 QNKMGSGYLSAFPSE-QFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM 261
+ ++ +GY+ + E Q ++++ +W YT GL+ Y + + +AL +
Sbjct: 111 ETQLPNGYIGNYSEEAQLNQWD----IWGRKYTA----LGLIAYYDLSGDRKALDAACRV 162
Query: 262 VEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDK---- 317
+++ +V K ++ N + + + + + LY T+ K+L A K
Sbjct: 163 IDHLMTQVGP--GKVNIVTTGNYIGMPSSSVLEPVMYLYNRTRQDKYLDFAKYIVKQWET 220
Query: 318 PCFLGLLAVQADDI----------------SGFHANTHIPVVIGSQMRYEVTGDPLY-KV 360
P L++ DI +G A + G Y+VT +PLY V
Sbjct: 221 PEGPRLISKAIADIPVAGRFPHPKVWFSPENGQKAYEMMSCYEGLLELYKVTKNPLYLSV 280
Query: 361 TGTFFMDIVNASHGYATGGTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWT 420
I+N A G SA E W K L + E+C T+ +++ + T
Sbjct: 281 VEKTMNHIINEEINVAGSG-SAFECWYGGKALQTYPTYHTMETCVTFTWMQICDRMLGLT 339
Query: 421 KEMVYADYYERALTNGVLS 439
+YAD E+A+ N +L+
Sbjct: 340 GNSLYADQIEKAMYNALLA 358
>gi|429738051|ref|ZP_19271876.1| hypothetical protein HMPREF9151_00303 [Prevotella saccharolytica
F0055]
gi|429161156|gb|EKY03584.1| hypothetical protein HMPREF9151_00303 [Prevotella saccharolytica
F0055]
Length = 603
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 64/299 (21%), Positives = 106/299 (35%), Gaps = 33/299 (11%)
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAF-PSEQFDRFE 223
+ F G +++++ + + L + M V L Q+K GY+ + P ++
Sbjct: 53 QSEFWGKWMNSAVLAYRYQPSDQLLKTMKTAVDKLVATQDK--KGYIGNYAPQHHLQEWD 110
Query: 224 ALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWN 283
+W Y I GLLD Y + + +AL + ++ S+ R N
Sbjct: 111 ----IWGRKYCI----LGLLDYYGISKDKKALVAASREADCLMAELK--AGNASIVRMGN 160
Query: 284 SLNEETGGMNDVLYRLYTITQDPKHLLLAHLF-------DKPCFLGLLAVQADD------ 330
+ + LY T + K+L A D P + V +
Sbjct: 161 HHGMAASSVLKPICYLYAYTGNKKYLDFAQQIVREWETADGPQLISKADVPVGERFPKPD 220
Query: 331 -------ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAG 383
G A + G Y +TG+ YK + + TG SA
Sbjct: 221 YDNWYKWAQGQKAYEMMSCYEGLLELYRLTGNESYKAAVEKTWQSIMDTEINITGSGSAM 280
Query: 384 EFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQR 442
E W K++ +E+C T +K+SR L T YAD E++L N +L R
Sbjct: 281 ESWFGGKQVQYMPIKHYQETCVTATWIKLSRQLLMLTGNSKYADAIEQSLYNALLGAMR 339
>gi|229818564|ref|YP_002880090.1| hypothetical protein Bcav_0062 [Beutenbergia cavernae DSM 12333]
gi|229564477|gb|ACQ78328.1| protein of unknown function DUF1680 [Beutenbergia cavernae DSM
12333]
Length = 596
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 80/398 (20%), Positives = 131/398 (32%), Gaps = 96/398 (24%)
Query: 127 EYLLMLDVDSLVWSFQKTAGSPTAGKAYEGWEDPTCELRGHFVGHYLSASAHMWASTHNV 186
E L + D +V F+ AG P G GW T + G ++S A + +
Sbjct: 42 ETYLGMSPDDVVHGFRLQAGLPAPGNPMTGWSSRTSQ---PTFGQWVSGLARLGVTAGVA 98
Query: 187 TLKEKMTAVVSALSECQNKMGSGYLSAFPSEQFDRFEALKPVWAPYYTIHKILAGLLDQY 246
++ +V A + G + Y K++ GL D
Sbjct: 99 EASQRAVDLVDAFAATVGDDGDARMG-------------------LYGYEKLVCGLADTA 139
Query: 247 TFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHWNSLNEETGGMNDVLYRLYTITQDP 306
+A + AL + E+ + R S N+ GG T+
Sbjct: 140 LYAGHEDALALLGRTAEWASRTFERA-------RPAASPNDFAGGRIGPASHARTMEW-- 190
Query: 307 KHLLLAHLFDKPCFLGLLAVQADDISGF-----------------------------HAN 337
+ F + + G LA D + F HA
Sbjct: 191 ------YTFAENLYRGWLAGADDAVREFASEWHYDAYWDRFLTPPPPGQPWDVPTWLHAY 244
Query: 338 THIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASH-------GYATGGTSAGEF-WSDP 389
+H+ + YEVTG+ Y +DI+ +H YATGG E +
Sbjct: 245 SHVNTFASAAAAYEVTGEVRY-------LDILRNAHTYLTTTQTYATGGYGPSELTLPED 297
Query: 390 KRLASTLGTENEES---CTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEP 446
L ++ + + C ++ K+S L + T E YAD+ E+ + +G+
Sbjct: 298 GSLGRSIEWRTDTAEIVCGSWAAFKLSSALLKHTGEARYADWVEQLVYSGI--------- 348
Query: 447 GVMIYMLPLGRGDSKAKSYHGWGTRFSSF--W-CCYGT 481
G + + P GR G T+ + W CC GT
Sbjct: 349 GAVTPVRPGGRTPYYQDLRLGIATKLPHWDDWPCCSGT 386
>gi|331700589|ref|YP_004397548.1| hypothetical protein Lbuc_0204 [Lactobacillus buchneri NRRL
B-30929]
gi|329127932|gb|AEB72485.1| protein of unknown function DUF1680 [Lactobacillus buchneri NRRL
B-30929]
Length = 656
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 75/321 (23%), Positives = 126/321 (39%), Gaps = 53/321 (16%)
Query: 163 ELRGHFVG---------HYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
+++GH G +L A+A+ + N LK+ ++ +++ Q+ GYLS
Sbjct: 71 QMKGHHYGFPFQDTDVYKWLEAAAYSFGYHPNPDLKKITDNLIDLIADAQDD--DGYLST 128
Query: 214 F-----PSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM---VEYF 265
+ P +F R + + Y H I AG+ + N +AL + K M ++
Sbjct: 129 YFQIDAPERKFKRLQQSHEL---YTMGHYIEAGVAYHHETG-NEKALDIAKRMADCIDRN 184
Query: 266 YNRVQNVITKYS----VERHWNSLNEETGG---MNDVLYRLYTITQDPKHL--------- 309
+ + I Y +E + L EETG ++ Y L QDP
Sbjct: 185 FGLEEGKIPGYDGHPEIELALSRLYEETGEKRYLDLAHYFLNQRGQDPAFFEKQIQADGD 244
Query: 310 -----LLAHL--FDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP-LYKVT 361
L+ + F + +L ++ + HA + + G TGD L
Sbjct: 245 SPDRDLIPGMRDFTREYYLAAEPIKDQKVPHGHAVRVVYLCTGMAYVARYTGDKDLLAAC 304
Query: 362 GTFFMDIVNASHGYATGG---TSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFR 418
F+ DIV Y TG T+ GE ++ L + T+ E+C + M +R +
Sbjct: 305 DRFWNDIVK-RQMYITGNIGQTTTGEAFTYDYDLPND--TDYGETCASVGMSFFARQMLN 361
Query: 419 WTKEMVYADYYERALTNGVLS 439
+ YAD E+ L NG LS
Sbjct: 362 IHAKGEYADVLEKELFNGALS 382
>gi|406026101|ref|YP_006724933.1| hypothetical protein LBUCD034_0243 [Lactobacillus buchneri CD034]
gi|405124590|gb|AFR99350.1| hypothetical protein LBUCD034_0243 [Lactobacillus buchneri CD034]
Length = 656
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 75/321 (23%), Positives = 126/321 (39%), Gaps = 53/321 (16%)
Query: 163 ELRGHFVG---------HYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
+++GH G +L A+A+ + N LK+ ++ +++ Q+ GYLS
Sbjct: 71 QMKGHHYGFPFQDTDVYKWLEAAAYSFGYHPNPDLKKITDNLIDLIADAQDD--DGYLST 128
Query: 214 F-----PSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWM---VEYF 265
+ P +F R + + Y H I AG+ + N +AL + K M ++
Sbjct: 129 YFQIDAPERKFKRLQQSHEL---YTMGHYIEAGVAYHHETG-NEKALDIAKRMADCIDRN 184
Query: 266 YNRVQNVITKYS----VERHWNSLNEETGG---MNDVLYRLYTITQDPKHL--------- 309
+ + I Y +E + L EETG ++ Y L QDP
Sbjct: 185 FGLEEGKIPGYDGHPEIELALSRLYEETGEKRYLDLAHYFLNQRGQDPAFFEKQIQADGD 244
Query: 310 -----LLAHL--FDKPCFLGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDP-LYKVT 361
L+ + F + +L ++ + HA + + G TGD L
Sbjct: 245 SPDRDLIPGMRDFTREYYLAAEPIKDQKVPHGHAVRVVYLCTGMAYVARYTGDKDLLAAC 304
Query: 362 GTFFMDIVNASHGYATGG---TSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFR 418
F+ DIV Y TG T+ GE ++ L + T+ E+C + M +R +
Sbjct: 305 DRFWNDIVK-RQMYITGNIGQTTTGEAFTYDYDLPND--TDYGETCASVGMSFFARQMLN 361
Query: 419 WTKEMVYADYYERALTNGVLS 439
+ YAD E+ L NG LS
Sbjct: 362 IHAKGEYADVLEKELFNGALS 382
>gi|255532639|ref|YP_003093011.1| hypothetical protein Phep_2748 [Pedobacter heparinus DSM 2366]
gi|255345623|gb|ACU04949.1| protein of unknown function DUF1680 [Pedobacter heparinus DSM 2366]
Length = 684
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 73/312 (23%), Positives = 113/312 (36%), Gaps = 68/312 (21%)
Query: 173 LSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSAFPSE-------QF-DR--F 222
L A A M+AST++ L M ++ ++ Q G Y A + QF DR F
Sbjct: 118 LEAMASMYASTNDPKLDAMMDKAIAVIARSQRDDGYIYTKAMIEQRKTGSKNQFQDRLSF 177
Query: 223 EALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHW 282
EA Y I ++ Y T L + K EY YN Q ++ R+
Sbjct: 178 EA--------YNIGHLMTAACVHYRATGKTTLLNVAKKATEYLYNFYQKASP--ALARNA 227
Query: 283 NSLNEETGGMNDVLYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANT-HIP 341
+ G + +Y +DP++L LA L+A++ G N IP
Sbjct: 228 ICPSHYMG-----VIEMYRTIKDPRYLELAK--------HLIAIKGKIEDGTDDNQDRIP 274
Query: 342 VV-----IGSQMR-----------YEVTGDPLYKVTGTFFMDIVNASHGYATGGTSAGEF 385
+ +G +R Y TG+ T D VN Y TGG +
Sbjct: 275 FLQQTKAMGHAVRANYLYAGVADLYAETGNDSLMKTLNLMWDDVNQHKMYITGGCGSLYD 334
Query: 386 WSDP----------KRLASTLG--------TENEESCTTYNMLKVSRHLFRWTKEMVYAD 427
+ P +++ G T + E+C + + + + + + YAD
Sbjct: 335 GTSPDGTSYNPTEVQKIHQAFGRDFQLPNFTAHNETCANIGNVLWNWRMLQISGDAKYAD 394
Query: 428 YYERALTNGVLS 439
E AL N VLS
Sbjct: 395 VMELALHNSVLS 406
>gi|332881627|ref|ZP_08449275.1| hypothetical protein HMPREF9074_05065 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357045708|ref|ZP_09107342.1| hypothetical protein HMPREF9441_01351 [Paraprevotella clara YIT
11840]
gi|332680266|gb|EGJ53215.1| hypothetical protein HMPREF9074_05065 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355531373|gb|EHH00772.1| hypothetical protein HMPREF9441_01351 [Paraprevotella clara YIT
11840]
Length = 586
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 49/108 (45%), Gaps = 4/108 (3%)
Query: 335 HANTHIPVVIGSQMRYEVTGDP-LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRLA 393
H++T +G Y +TGD L++ + DI N Y TGG S E +
Sbjct: 206 HSHTFQMNFMGFLRLYRITGDKSLFRKVAGAWDDICNRQM-YITGGVSVAEHYE--HGYV 262
Query: 394 STLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQ 441
+ E+C T + +++++ L T E YAD ER + N V + Q
Sbjct: 263 KPVSGNVVETCATMSWMQLTQMLLELTGESKYADAMERLMMNHVFAAQ 310
>gi|330998039|ref|ZP_08321870.1| hypothetical protein HMPREF9442_02974 [Paraprevotella xylaniphila
YIT 11841]
gi|329569340|gb|EGG51120.1| hypothetical protein HMPREF9442_02974 [Paraprevotella xylaniphila
YIT 11841]
Length = 661
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 61/154 (39%), Gaps = 15/154 (9%)
Query: 335 HANTHIPVVIGSQMRYEVTGDP--LYKVTGTFFMDIVNASHGYATGGTSAGEFWSDPKRL 392
H++T +G Y +TGD KV G + + ++ Y TGG S E +
Sbjct: 280 HSHTFQMNFMGFLRLYRITGDKSLFRKVEGAW--EDIHKRQMYITGGVSVAEHYE--HGY 335
Query: 393 ASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLSIQRGTEPGVMIYM 452
+ E+C T + +++++ L T E YAD ER + N V + Q +
Sbjct: 336 VKPVSGNVVETCATMSWMQLTQMLLELTGESKYADAMERLMMNHVFAAQDCETGTCRYHT 395
Query: 453 LPLGRGDSKAKSYHGWGTRFSSFWCCYGTGLGFI 486
P G A +HG CC +G I
Sbjct: 396 AP--NGTKPASYFHGPD-------CCTASGHRII 420
>gi|227509468|ref|ZP_03939517.1| conserved hypothetical protein, partial [Lactobacillus brevis
subsp. gravesensis ATCC 27305]
gi|227191063|gb|EEI71130.1| conserved hypothetical protein [Lactobacillus brevis subsp.
gravesensis ATCC 27305]
Length = 267
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 70/173 (40%), Gaps = 39/173 (22%)
Query: 163 ELRGHFVG---------HYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMGSGYLSA 213
E+ GH G +L A+A+ + N LK+ ++ +++ Q+ GYLS
Sbjct: 71 EMTGHHYGFPFQDTDVYKWLEAAAYSFGYHPNPDLKQITDNLIDLIAKAQDD--DGYLST 128
Query: 214 F-----PSEQFDRFEALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNR 268
+ P +F R + + Y H I AG+ Y N +AL + K M
Sbjct: 129 YFQIDAPERKFKRLQQSHEL---YTMGHYIEAGVA-YYNATGNEKALDIAKRMAN----- 179
Query: 269 VQNVITKYSVERHWNSLNEETGGMND------VLYRLYTITQDPKHLLLAHLF 315
++ H+ + G + L RLY +TQD K+L LAH F
Sbjct: 180 --------CIDNHFGLEEGKIPGYDGHPEIELALSRLYEVTQDKKYLDLAHYF 224
>gi|386078433|ref|YP_005991958.1| hypothetical protein [Pantoea ananatis PA13]
gi|354987614|gb|AER31738.1| hypothetical protein PAGR_g1212 [Pantoea ananatis PA13]
Length = 651
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 73/187 (39%), Gaps = 51/187 (27%)
Query: 296 LYRLYTITQDPKHLLLAHLF-----DKPCFLGLLAVQADDISGFH-------------AN 337
L RLY +TQ P++L L + F +P F + + S +H +
Sbjct: 193 LMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 252
Query: 338 THIPVV-----IGSQMRYEVTGDPLYKVTGTFFMDIVNASHG-----------------Y 375
H P+ +G +R+ +Y +TG + ++ G Y
Sbjct: 253 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 306
Query: 376 ATGGT---SAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERA 432
TGG S+GE +S L + T ESC + ++ +R + + YAD ERA
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPND--TVYAESCASIGLMMFARRMLEMEADSQYADVMERA 364
Query: 433 LTNGVLS 439
L N VL
Sbjct: 365 LYNTVLG 371
>gi|378766201|ref|YP_005194662.1| hypothetical protein PANA5342_1232 [Pantoea ananatis LMG 5342]
gi|365185675|emb|CCF08625.1| hypothetical protein PANA5342_1232 [Pantoea ananatis LMG 5342]
Length = 651
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 73/187 (39%), Gaps = 51/187 (27%)
Query: 296 LYRLYTITQDPKHLLLAHLF-----DKPCFLGLLAVQADDISGFH-------------AN 337
L RLY +TQ P++L L + F +P F + + S +H +
Sbjct: 193 LMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 252
Query: 338 THIPVV-----IGSQMRYEVTGDPLYKVTGTFFMDIVNASHG-----------------Y 375
H P+ +G +R+ +Y +TG + ++ G Y
Sbjct: 253 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 306
Query: 376 ATGGT---SAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERA 432
TGG S+GE +S L + T ESC + ++ +R + + YAD ERA
Sbjct: 307 ITGGIGSQSSGEAFSSDYDLPND--TVYAESCASIGLMMFARRMLEMEADSQYADVMERA 364
Query: 433 LTNGVLS 439
L N VL
Sbjct: 365 LYNTVLG 371
>gi|291618364|ref|YP_003521106.1| hypothetical Protein PANA_2811 [Pantoea ananatis LMG 20103]
gi|291153394|gb|ADD77978.1| Hypothetical Protein PANA_2811 [Pantoea ananatis LMG 20103]
Length = 659
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 73/186 (39%), Gaps = 51/186 (27%)
Query: 296 LYRLYTITQDPKHLLLAHLF-----DKPCFLGLLAVQADDISGFH-------------AN 337
L RLY +TQ P++L L + F +P F + + S +H +
Sbjct: 201 LMRLYEVTQQPRYLALVNTFVTQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 260
Query: 338 THIPVV-----IGSQMRYEVTGDPLYKVTGTFFMDIVNASHG-----------------Y 375
H P+ +G +R+ +Y +TG + ++ G Y
Sbjct: 261 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 314
Query: 376 ATGGT---SAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERA 432
TGG S+GE +S L + T ESC + ++ +R + + YAD ERA
Sbjct: 315 ITGGIGSQSSGEAFSSDYDLPND--TVYAESCASIGLMMFARRMLEMEADSQYADVMERA 372
Query: 433 LTNGVL 438
L N VL
Sbjct: 373 LYNTVL 378
>gi|386016685|ref|YP_005934975.1| hypothetical protein PAJ_2099 [Pantoea ananatis AJ13355]
gi|327394757|dbj|BAK12179.1| hypothetical protein PAJ_2099 [Pantoea ananatis AJ13355]
Length = 659
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 73/186 (39%), Gaps = 51/186 (27%)
Query: 296 LYRLYTITQDPKHLLLAHLF-----DKPCFLGLLAVQADDISGFH-------------AN 337
L RLY +TQ P++L L + F +P F + + S +H +
Sbjct: 201 LMRLYEVTQQPRYLALVNTFVSQRGTQPHFYDIEYEKRGQTSYWHTYGPAWMVKDKAYSQ 260
Query: 338 THIPVV-----IGSQMRYEVTGDPLYKVTGTFFMDIVNASHG-----------------Y 375
H P+ +G +R+ +Y +TG + ++ G Y
Sbjct: 261 AHQPLAEQQHAVGHAVRF------VYLMTGVAHLARLSQDEGKRQDCLRLWHNMAQRQLY 314
Query: 376 ATGGT---SAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERA 432
TGG S+GE +S L + T ESC + ++ +R + + YAD ERA
Sbjct: 315 ITGGIGSQSSGEAFSSDYDLPND--TVYAESCASIGLMMFARRMLEMEADSQYADVMERA 372
Query: 433 LTNGVL 438
L N VL
Sbjct: 373 LYNTVL 378
>gi|374374966|ref|ZP_09632624.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
gi|373231806|gb|EHP51601.1| protein of unknown function DUF1680 [Niabella soli DSM 19437]
Length = 629
Score = 38.9 bits (89), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 59/300 (19%), Positives = 108/300 (36%), Gaps = 43/300 (14%)
Query: 165 RGHFVGHYLSASAHMWASTHNVTLKEKMTAVVSALSECQNKMG--SGYLSAFPSEQFDRF 222
+ F G +++++ + T + L + + V L Q G Y + +Q+D
Sbjct: 83 QSEFWGKWITSAIDAYNYTKDNRLLKAIQKGVEGLIATQTPDGYIGNYAPQYRLQQWD-- 140
Query: 223 EALKPVWAPYYTIHKILAGLLDQYTFADNTQALKMTKWMVEYFYNRVQNVITKYSVERHW 282
+W Y L GLL Y + ++L K + +Y + V Y+ + +
Sbjct: 141 -----IWGMKYC----LLGLLGYYNCTKDNRSLAAAKKLADYVISAV------YASGKPF 185
Query: 283 NSLNEETG----GMNDVLYRLYTITQDPKHLLLAHLF---------DKPCFLGLLAVQAD 329
N + G + + + LY IT +L A + GL +
Sbjct: 186 NEMGNHRGMAAASILEPVVLLYNITHQASYLKFADFIVASWSNPNASELIKKGLQQIPVG 245
Query: 330 D-----------ISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATG 378
D ++G A + G Y V P Y + + + TG
Sbjct: 246 DRFPTPAVWYGPMNGRKAYEMMSCYEGLMELYRVEKRPEYLEAIVNTAESIRKDEIFVTG 305
Query: 379 GTSAGEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVL 438
S+ E W + ++ +T + E+C T +K+ L R T + +A+ ER N +L
Sbjct: 306 SGSSMESWINGAKIQATPLRHSNETCVTATWMKLCLQLLRTTGDAKWANEIERTFYNALL 365
>gi|255034442|ref|YP_003085063.1| hypothetical protein Dfer_0635 [Dyadobacter fermentans DSM 18053]
gi|254947198|gb|ACT91898.1| protein of unknown function DUF1680 [Dyadobacter fermentans DSM
18053]
Length = 656
Score = 38.5 bits (88), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 75/182 (41%), Gaps = 28/182 (15%)
Query: 280 RHWNSLNEETGGMNDVLYRLYTITQDPKHLLLAHLF-------------------DKPCF 320
R W S ++E + L +LY +T + ++L LA F K C
Sbjct: 197 RPWVSGHQE---IELALMKLYHLTHEDRYLKLADWFLEQRGRGYGKGKIWDEWKDPKYCQ 253
Query: 321 LGLLAVQADDISGFHANTHIPVVIGSQMRYEVTGDPLYKVTGTFFMDIVNASHGYATGGT 380
+ Q +I+G HA + G+ VTGDP Y T + V + Y TGG
Sbjct: 254 DDVPVKQQKEITG-HAVRAMYQYTGAADVASVTGDPGYMNAMTAVWEDVVYRNMYLTGGI 312
Query: 381 SA---GEFWSDPKRLASTLGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGV 437
+ E ++D L + G E+C + M+ ++ + T + Y D ER+L NG
Sbjct: 313 GSSGHNEGFTDDYDLPN--GAAYSETCASVGMVFWNQRMNALTGDAKYIDVLERSLYNGA 370
Query: 438 LS 439
L
Sbjct: 371 LD 372
>gi|160890885|ref|ZP_02071888.1| hypothetical protein BACUNI_03330 [Bacteroides uniformis ATCC 8492]
gi|156859884|gb|EDO53315.1| hypothetical protein BACUNI_03330 [Bacteroides uniformis ATCC 8492]
Length = 663
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 66/164 (40%), Gaps = 30/164 (18%)
Query: 296 LYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVV-----IGSQMRY 350
L RLYT+T D K+L A F L A + +H PV+ +G +R
Sbjct: 223 LVRLYTVTGDKKYLDQAKFF-------LDARGTTARKDIYLQSHKPVLEQEEAVGHAVRA 275
Query: 351 -----------EVTGDPLY-KVTGTFFMDIVNASHGYATGGTSA---GEFWSDPKRLAST 395
+TGD Y K + +IV Y TGG A GE + D L +
Sbjct: 276 GYMYSGMADVAAITGDSSYIKAIDKIWENIVGKKI-YITGGIGARHTGEAFGDNYELPNL 334
Query: 396 LGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLS 439
T E+C + ++ LF + Y D ER L NG++S
Sbjct: 335 --TAYNETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLIS 376
>gi|423303854|ref|ZP_17281853.1| hypothetical protein HMPREF1072_00793 [Bacteroides uniformis
CL03T00C23]
gi|423307425|ref|ZP_17285415.1| hypothetical protein HMPREF1073_00165 [Bacteroides uniformis
CL03T12C37]
gi|392686852|gb|EIY80152.1| hypothetical protein HMPREF1072_00793 [Bacteroides uniformis
CL03T00C23]
gi|392690034|gb|EIY83305.1| hypothetical protein HMPREF1073_00165 [Bacteroides uniformis
CL03T12C37]
Length = 663
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 66/164 (40%), Gaps = 30/164 (18%)
Query: 296 LYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVV-----IGSQMRY 350
L RLYT+T D K+L A F L A + +H PV+ +G +R
Sbjct: 223 LVRLYTVTGDKKYLDQAKFF-------LDARGTTARKDIYLQSHKPVLEQEEAVGHAVRA 275
Query: 351 -----------EVTGDPLY-KVTGTFFMDIVNASHGYATGGTSA---GEFWSDPKRLAST 395
+TGD Y K + +IV Y TGG A GE + D L +
Sbjct: 276 GYMYSGMADVAAITGDSSYIKAIDKIWENIVGKKI-YITGGIGARHAGEAFGDNYELPNL 334
Query: 396 LGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLS 439
T E+C + ++ LF + Y D ER L NG++S
Sbjct: 335 --TAYNETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLIS 376
>gi|270295877|ref|ZP_06202077.1| six-hairpin glycosidase [Bacteroides sp. D20]
gi|270273281|gb|EFA19143.1| six-hairpin glycosidase [Bacteroides sp. D20]
Length = 663
Score = 38.1 bits (87), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 66/164 (40%), Gaps = 30/164 (18%)
Query: 296 LYRLYTITQDPKHLLLAHLFDKPCFLGLLAVQADDISGFHANTHIPVV-----IGSQMRY 350
L RLYT+T D K+L A F L A + +H PV+ +G +R
Sbjct: 223 LVRLYTVTGDKKYLDQAKFF-------LDARGTTARKDIYLQSHKPVLEQEEAVGHAVRA 275
Query: 351 -----------EVTGDPLY-KVTGTFFMDIVNASHGYATGGTSA---GEFWSDPKRLAST 395
+TGD Y K + +IV Y TGG A GE + D L +
Sbjct: 276 GYMYSGMADVAAITGDSSYIKAIDKIWENIVGKKI-YITGGIGARHAGEAFGDNYELPNL 334
Query: 396 LGTENEESCTTYNMLKVSRHLFRWTKEMVYADYYERALTNGVLS 439
T E+C + ++ LF + Y D ER L NG++S
Sbjct: 335 --TAYNETCAAIGNVYMNYRLFLLHGDSKYFDVLERTLYNGLIS 376
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.134 0.427
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,473,110,683
Number of Sequences: 23463169
Number of extensions: 362975581
Number of successful extensions: 695825
Number of sequences better than 100.0: 539
Number of HSP's better than 100.0 without gapping: 488
Number of HSP's successfully gapped in prelim test: 51
Number of HSP's that attempted gapping in prelim test: 693266
Number of HSP's gapped (non-prelim): 630
length of query: 494
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 347
effective length of database: 8,910,109,524
effective search space: 3091808004828
effective search space used: 3091808004828
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 79 (35.0 bits)