BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 009153
(542 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255540793|ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
gi|223550576|gb|EEF52063.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
Length = 809
Score = 911 bits (2355), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/576 (75%), Positives = 489/576 (84%), Gaps = 34/576 (5%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGG LPQSW QQL+LQKKIL R+YELGMNPVLPAFSGNVPAAL+N+FPSAKI
Sbjct: 234 MGNLHRWGGSLPQSWFFQQLILQKKILARMYELGMNPVLPAFSGNVPAALRNIFPSAKIA 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWFSVKSD RWCCTYLLDATDPLFIEIGRAFIEQQL+EYG TSHIYNCDTFDENTPP
Sbjct: 294 RLGNWFSVKSDLRWCCTYLLDATDPLFIEIGRAFIEQQLEEYGSTSHIYNCDTFDENTPP 353
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD P+YIS+LGAA++ GMQSGD+DAVWLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 354 VDDPKYISALGAAVFKGMQSGDNDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGRLVVL 413
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DLFAEVKPIW++S QFYGVPYIWCMLHNFAGN+EMYGILDSIA GPVEARTSEN+TM
Sbjct: 414 DLFAEVKPIWTSSYQFYGVPYIWCMLHNFAGNVEMYGILDSIASGPVEARTSENSTMVGV 473
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
AWIN YS RRYGRSVP+IQDAW++LYHTVY
Sbjct: 474 GMSMEGIEQNPVVYDLMSEMAFQHKKVDVKAWINLYSTRRYGRSVPSIQDAWDILYHTVY 533
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
NCTDGA DKNRDVIVAFPDV+P SV++ ++ GKPVS+ AVLK + SYDHPHLWYS
Sbjct: 534 NCTDGAYDKNRDVIVAFPDVNPFYFSVSQKRHHLNGKPVSRRAVLKENSDSYDHPHLWYS 593
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
TSEV+ ALELFI SG ELS S+TY YDL+DLTRQALAKY NELFL IIE+YQ ND +GV
Sbjct: 594 TSEVLHALELFITSGEELSGSSTYSYDLVDLTRQALAKYGNELFLKIIESYQANDGNGVA 653
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
S++FL+LVEDMD LL CH+GFLLGPWLESAKQLAQ++EQEKQ+EWNARTQITMWFDNT
Sbjct: 654 SRSQKFLDLVEDMDTLLGCHEGFLLGPWLESAKQLAQDQEQEKQFEWNARTQITMWFDNT 713
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++EASLL DYGNKYWSGLL+DYYGPRAAIYFKY+I+SLE+G F LKDWRREWIKLTN+W
Sbjct: 714 EDEASLLHDYGNKYWSGLLQDYYGPRAAIYFKYLIKSLENGKVFPLKDWRREWIKLTNEW 773
Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQGTG-VFDH 542
Q RN +PV+SNG+ALI S+WLY+KYL+ +DH
Sbjct: 774 QRSRNKFPVKSNGNALIISKWLYDKYLRNPDTTYDH 809
>gi|225457148|ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
Length = 813
Score = 911 bits (2355), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/578 (75%), Positives = 486/578 (84%), Gaps = 37/578 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWF+V +PRWCCTYLLDATDPLFIEIG+AFI+QQLKEYGRT HIYNCDTFDENTPP
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPP 354
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 355 VDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVL 414
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYGILD++A GPVEARTSEN+TM
Sbjct: 415 DLFAEVKPIWITSEQFYGVPYIWCMLHNFAGNIEMYGILDAVASGPVEARTSENSTMVGV 474
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
WI YS RRYG+SVP IQDAWN+LYHTVY
Sbjct: 475 GMSMEGIEQNPVVYDLMSEMAFQHSKVDVKVWIALYSTRRYGKSVPEIQDAWNILYHTVY 534
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTE----GKYQNYGKPVSKEAVLKSETSSYDHPH 323
NCTDG+ DKNRDVIVAFPD+DPS I + G Y YGK VS+ VLK T+S++ PH
Sbjct: 535 NCTDGSYDKNRDVIVAFPDIDPSFIPTPKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPH 594
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
LWYSTSEV AL LFIASG +L SNTYRYDL+DLTRQALAKYAN+LFL +IEAYQLND
Sbjct: 595 LWYSTSEVKDALGLFIASGGQLLGSNTYRYDLVDLTRQALAKYANQLFLEVIEAYQLNDV 654
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
G S++FLELVEDMD LLACHDGFLLGPWLESAKQLAQ+E+QE Q+EWNARTQITMW
Sbjct: 655 RGAACHSQKFLELVEDMDTLLACHDGFLLGPWLESAKQLAQDEQQEIQFEWNARTQITMW 714
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
FDNT++EASLLRDYGNKYWSGLLRDYYGPRAAIYFKY++ESLE+G+ F LKDWRREWIKL
Sbjct: 715 FDNTEDEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYLLESLETGNEFALKDWRREWIKL 774
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFD 541
TNDWQN RN YPV S+G+A+ TS+ LYNKYLQ ++D
Sbjct: 775 TNDWQNSRNAYPVRSSGNAIDTSRRLYNKYLQDPEIYD 812
>gi|224121634|ref|XP_002318632.1| predicted protein [Populus trichocarpa]
gi|222859305|gb|EEE96852.1| predicted protein [Populus trichocarpa]
Length = 812
Score = 903 bits (2334), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/575 (73%), Positives = 484/575 (84%), Gaps = 33/575 (5%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NLH WGGPLPQSW DQQLVLQKKIL R+YELGM PVLPAFSGNVPAAL+N+FPSAKIT
Sbjct: 238 MANLHRWGGPLPQSWFDQQLVLQKKILARMYELGMTPVLPAFSGNVPAALRNIFPSAKIT 297
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWFSV+SD RWCCTYLLDATDPLFIEIGRAFIEQQL EYG TSHIYNCDTFDENTPP
Sbjct: 298 RLGNWFSVRSDVRWCCTYLLDATDPLFIEIGRAFIEQQLTEYGSTSHIYNCDTFDENTPP 357
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLG +I+ GMQSGDS+AVWLMQGWLFSYDPFWRPPQ KALL+SVP+G+LVVL
Sbjct: 358 VDDPEYISSLGGSIFEGMQSGDSNAVWLMQGWLFSYDPFWRPPQTKALLHSVPIGRLVVL 417
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
DLFAEVKPIW+TS+QFYGVPYIWCMLHNFAGN+EMYG LDS+A GPVEARTSEN+TM
Sbjct: 418 DLFAEVKPIWNTSEQFYGVPYIWCMLHNFAGNLEMYGYLDSVASGPVEARTSENSTMVGV 477
Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
WI+ YS RRYGRSVP IQ+AWN+LYHTVY
Sbjct: 478 GMSMEGIEQNPVVYDLMSEMAFQKNKVDVKEWIDLYSARRYGRSVPTIQNAWNILYHTVY 537
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
NCTDGA DKNRDVIVAFPDV+P+++S+ +G++ K VS+ A L T SY+HPHLWYS
Sbjct: 538 NCTDGAYDKNRDVIVAFPDVNPNLVSMLQGRHHTDVKLVSRRAALIKNTDSYEHPHLWYS 597
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
T+EV+RALELFIA G+ELS S+TY YDL+DLTRQ LAKYANELFL +IEAY+L D+HGV
Sbjct: 598 TTEVVRALELFIAGGDELSGSSTYSYDLVDLTRQVLAKYANELFLKVIEAYRLKDSHGVA 657
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
S+ FL+LVED+D LLACH+GFLLGPWLESAKQLAQ+EEQ+ Q+EWNARTQITMW+DNT
Sbjct: 658 HQSQMFLDLVEDIDTLLACHEGFLLGPWLESAKQLAQDEEQQIQFEWNARTQITMWYDNT 717
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
+ EASLLRDYGNKYWSGLL+DYYGPRAAIYF ++ +SLE+G GF+LK WRREWIKLTN W
Sbjct: 718 EVEASLLRDYGNKYWSGLLKDYYGPRAAIYFNFLTQSLENGHGFQLKAWRREWIKLTNKW 777
Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH 542
Q R ++PVESNG+AL S+WLY+KYL +DH
Sbjct: 778 QKSRKIFPVESNGNALNISRWLYHKYLGNPDTYDH 812
>gi|356519003|ref|XP_003528164.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
Length = 812
Score = 892 bits (2306), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/567 (74%), Positives = 479/567 (84%), Gaps = 34/567 (5%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSW DQQL+LQKKIL R++ELGM PVLPAFSGNVPAAL+++FPSAKIT
Sbjct: 233 MGNLHGWGGPLPQSWFDQQLILQKKILARMFELGMTPVLPAFSGNVPAALKHIFPSAKIT 292
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWFSVK+D +WCCTYLLDATD LF+EIG+AFIE+QL+EYGRTSHIYNCDTFDENTPP
Sbjct: 293 RLGNWFSVKNDLKWCCTYLLDATDSLFVEIGKAFIEKQLQEYGRTSHIYNCDTFDENTPP 352
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLGAA + GMQSGD DAVWLMQGWLFSYDPFWRPPQMKALL+SVP+GKLVVL
Sbjct: 353 VDDPEYISSLGAATFKGMQSGDDDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGKLVVL 412
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYGILD+IA GP++ARTS N+TM
Sbjct: 413 DLFAEVKPIWVTSEQFYGVPYIWCMLHNFAGNIEMYGILDAIASGPIDARTSNNSTMVGV 472
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
AW++ YS RRYG+++P IQ+ WNVLYHT+Y
Sbjct: 473 GMSMEGIEQNPIVYDLMSEMAFQHKKVDVKAWVDMYSTRRYGQTLPLIQEGWNVLYHTIY 532
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
NCTDGA DKNRDVIVAFPDVDPS+ISV + + KP S ++K T S+D PHLWY
Sbjct: 533 NCTDGAYDKNRDVIVAFPDVDPSLISVQHEQSHHNDKPYSG-TIIKEITDSFDRPHLWYP 591
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
TSEVI ALELFI SG+ELS NTYRYDL+DLTRQ LAKYANELF +IEAYQ +D HG+
Sbjct: 592 TSEVIYALELFITSGDELSRCNTYRYDLVDLTRQVLAKYANELFFKVIEAYQSHDIHGMT 651
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
LS+RFL+LVED+D LLACHDGFLLGPWLESAKQLA NEEQE+Q+EWNARTQITMWFDN+
Sbjct: 652 LLSQRFLDLVEDLDTLLACHDGFLLGPWLESAKQLALNEEQERQFEWNARTQITMWFDNS 711
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
EEASLLRDYGNKYW+GLL DYYGPRAAIYFKY+ ESLESG+ F+L+ WRREWIKLTN+W
Sbjct: 712 DEEASLLRDYGNKYWNGLLHDYYGPRAAIYFKYLRESLESGEDFKLRGWRREWIKLTNEW 771
Query: 508 QNGRNVYPVESNGDALITSQWLYNKYL 534
Q RN++PVES+GDAL TS+WL+NKYL
Sbjct: 772 QKRRNIFPVESSGDALNTSRWLFNKYL 798
>gi|297733843|emb|CBI15090.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 879 bits (2272), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/611 (70%), Positives = 482/611 (78%), Gaps = 70/611 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWF+V +PRWCCTYLLDATDPLFIEIG+AFI+QQLKEYGRT HIYNCDTFDENTPP
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPP 354
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 355 VDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVL 414
Query: 181 DLFAEVKPIWSTSKQFYGVPYIW--------------------------------CMLHN 208
DLFAEVKPIW TS+QFYGVPYIW CMLHN
Sbjct: 415 DLFAEVKPIWITSEQFYGVPYIWKVTKSGRQQSLKFTNEKCCSFFRSHSPDSEVLCMLHN 474
Query: 209 FAGNIEMYGILDSIAFGPV--EARTSENTTMA---------------------------- 238
FAGNIEMYGILD++A GP+ A+ +E+ +
Sbjct: 475 FAGNIEMYGILDAVASGPILLRAKYAESAVVGVGMSMEGIEQNPVVYDLMSEMAFQHSKV 534
Query: 239 ----WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
WI YS RRYG+SVP IQDAWN+LYHTVYNCTDG+ DKNRDVIVAFPD+DPS I
Sbjct: 535 DVKVWIALYSTRRYGKSVPEIQDAWNILYHTVYNCTDGSYDKNRDVIVAFPDIDPSFIPT 594
Query: 295 TE----GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 350
+ G Y YGK VS+ VLK T+S++ PHLWYSTSEV AL LFIASG +L SNT
Sbjct: 595 PKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPHLWYSTSEVKDALGLFIASGGQLLGSNT 654
Query: 351 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 410
YRYDL+DLTRQALAKYAN+LFL +IEAYQLND G S++FLELVEDMD LLACHDGF
Sbjct: 655 YRYDLVDLTRQALAKYANQLFLEVIEAYQLNDVRGAACHSQKFLELVEDMDTLLACHDGF 714
Query: 411 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
LLGPWLESAKQLAQ+E+QE Q+EWNARTQITMWFDNT++EASLLRDYGNKYWSGLLRDYY
Sbjct: 715 LLGPWLESAKQLAQDEQQEIQFEWNARTQITMWFDNTEDEASLLRDYGNKYWSGLLRDYY 774
Query: 471 GPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
GPRAAIYFKY++ESLE+G+ F LKDWRREWIKLTNDWQN RN YPV S+G+A+ TS+ LY
Sbjct: 775 GPRAAIYFKYLLESLETGNEFALKDWRREWIKLTNDWQNSRNAYPVRSSGNAIDTSRRLY 834
Query: 531 NKYLQGTGVFD 541
NKYLQ ++D
Sbjct: 835 NKYLQDPEIYD 845
>gi|449441031|ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
Length = 808
Score = 851 bits (2199), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/568 (71%), Positives = 468/568 (82%), Gaps = 35/568 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLPQSW DQQL+LQKK++ R++ELGM PVLPAFSGN+PAA + ++P+AKIT
Sbjct: 236 MGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKIT 295
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWF+V SDPRWCCTYLLDA DPLF+EIG+AFIEQQ KEYGRTSH+YNCDTFDENTPP
Sbjct: 296 RLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPP 355
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD EYISSLG+AI+ GMQ+GDS+AVWLMQGW+FSYDPFWRP QMKALL+SVPLG+LVVL
Sbjct: 356 VDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVL 415
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
DL+AEVKPIW +S+QFYG+PYIWCMLHNFAGN+EMYGILDSIA GP+EAR+S +TM
Sbjct: 416 DLYAEVKPIWISSEQFYGIPYIWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGV 475
Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W+ QYSVRRYG VP+IQDAW+VLYHTVY
Sbjct: 476 GMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVY 535
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
NCTDGA DKNRDVIVAFPDVDPS I V +G S +V + + +++D PHLWY
Sbjct: 536 NCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHLWYP 593
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF I++AYQL+D +
Sbjct: 594 TSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMA 653
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQYEWNARTQITMWFDNT
Sbjct: 654 SLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNT 713
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
+EEASLLRDYGNKYWSGLL DYY PRAAIY K++ ES E+G F L +WRREWIKLTNDW
Sbjct: 714 EEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDW 773
Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQ 535
Q+ R +YPVESNGDAL TS WLYNKYLQ
Sbjct: 774 QSSRKIYPVESNGDALDTSHWLYNKYLQ 801
>gi|326515664|dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 829
Score = 779 bits (2011), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/573 (63%), Positives = 449/573 (78%), Gaps = 39/573 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y GM+PVLPAFSGN+PAAL+ FPSAK+T
Sbjct: 239 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVT 298
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 299 HLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 358
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + MQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 359 LSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 418
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL+AEVKP+W S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TM
Sbjct: 419 DLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 478
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W+ Y RRYG+SV +QDAW +L+ T+Y
Sbjct: 479 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLY 538
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAVLK-SETSSYDHP 322
NCTDG DKNRDVIVAFPDV+PS+I T G Y +NY +S+ V+K + +Y+ P
Sbjct: 539 NCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSKNYSTMLSENYVMKDAPNDAYEQP 597
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
H+WY T VI ALELF+ SG+E+S S+T+RYDL+DLTRQALAKYAN++FL II+ Y+ N+
Sbjct: 598 HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+ V L RFL LV+D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 658 VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQYEWNARTQITM 717
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFK++I SL+ + F L++WRREWI
Sbjct: 718 WFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFKHLISSLKKKEPFALEEWRREWIS 777
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
LTN+WQ+ R V+ + GDAL S+ L+ KYL+
Sbjct: 778 LTNNWQSDRKVFATTATGDALNISRALFTKYLR 810
>gi|222629680|gb|EEE61812.1| hypothetical protein OsJ_16433 [Oryza sativa Japonica Group]
Length = 1129
Score = 778 bits (2010), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/573 (63%), Positives = 442/573 (77%), Gaps = 39/573 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 537 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 596
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 597 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 656
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 657 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 716
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL+AEVKPIW S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TM
Sbjct: 717 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMVGV 776
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W+ Y RRYG+S+ +QDAW +LY T+Y
Sbjct: 777 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSMVGLQDAWKILYQTLY 836
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 322
NCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y+HP
Sbjct: 837 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 895
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
HLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 896 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 955
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+ V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 956 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQITM 1015
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFKY+I S+E + F L++WRREWI
Sbjct: 1016 WFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIYFKYLILSMEKKEPFALEEWRREWIS 1075
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
LTN+WQ+ V+P + GDAL S+ LY KYL
Sbjct: 1076 LTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 1108
>gi|326519955|dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 829
Score = 778 bits (2009), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/573 (63%), Positives = 448/573 (78%), Gaps = 39/573 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y GM+PVLPAFSGN+PAAL+ FPSAK+T
Sbjct: 239 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVT 298
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 299 HLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 358
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + MQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 359 LSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 418
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL+AEVKP W S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TM
Sbjct: 419 DLYAEVKPAWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 478
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W+ Y RRYG+SV +QDAW +L+ T+Y
Sbjct: 479 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLY 538
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAVLK-SETSSYDHP 322
NCTDG DKNRDVIVAFPDV+PS+I T G Y +NY +S+ V+K + +Y+ P
Sbjct: 539 NCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSKNYSTMLSENYVMKDAPNDAYEQP 597
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
H+WY T VI ALELF+ SG+E+S S+T+RYDL+DLTRQALAKYAN++FL II+ Y+ N+
Sbjct: 598 HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+ V L RFL LV+D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 658 VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQYEWNARTQITM 717
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFK++I SL+ + F L++WRREWI
Sbjct: 718 WFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFKHLISSLKKKEPFALEEWRREWIS 777
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
LTN+WQ+ R V+ + GDAL S+ L+ KYL+
Sbjct: 778 LTNNWQSDRKVFATTATGDALNISRALFTKYLR 810
>gi|218195716|gb|EEC78143.1| hypothetical protein OsI_17702 [Oryza sativa Indica Group]
Length = 829
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/573 (63%), Positives = 442/573 (77%), Gaps = 39/573 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 237 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 296
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 297 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 356
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 357 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 416
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL+AEVKPIW S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TM
Sbjct: 417 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMIGV 476
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W+ Y RRYG+S+ +QDAW +LY T+Y
Sbjct: 477 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSIVGLQDAWKILYQTLY 536
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 322
NCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y+HP
Sbjct: 537 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 595
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
HLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 596 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 655
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+ V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 656 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQITM 715
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFKY+I S+E + F L++WRREWI
Sbjct: 716 WFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIYFKYLILSMEKKEPFALEEWRREWIS 775
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
LTN+WQ+ V+P + GDAL S+ LY KYL
Sbjct: 776 LTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 808
>gi|414585092|tpg|DAA35663.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 831
Score = 773 bits (1996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/578 (63%), Positives = 445/578 (76%), Gaps = 39/578 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYNCDTFDENTPP
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYNCDTFDENTPP 359
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+GK++VL
Sbjct: 360 LSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGKMIVL 419
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL+AEVKP+W S Q YGVPYIWCMLHNFA + EMYG+LD++A GP++AR S+N+TM
Sbjct: 420 DLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGV 479
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W+ Y RRYG+ V +QDAW +LY T+Y
Sbjct: 480 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLY 539
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHP 322
NCTDG DKNRDVIVAFPDV+P +I+ T G + N Y SK + K +S +Y+HP
Sbjct: 540 NCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHP 598
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
HLWY T+ VI ALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+
Sbjct: 599 HLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNN 658
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+ V L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITM
Sbjct: 659 MNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITM 718
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
WFDNT+ +ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+ F LK+WRREWI
Sbjct: 719 WFDNTETKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWIS 778
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
LTN+WQ+ R V+ + GD L SQ LY KYL +
Sbjct: 779 LTNNWQSDRKVFSTTATGDPLNISQSLYTKYLSNADLL 816
>gi|38345908|emb|CAE04506.2| OSJNBb0059K02.16 [Oryza sativa Japonica Group]
Length = 829
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/573 (63%), Positives = 442/573 (77%), Gaps = 39/573 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 237 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 296
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 297 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 356
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 357 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 416
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL+AEVKPIW S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TM
Sbjct: 417 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMVGV 476
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W+ Y RRYG+S+ +QDAW +LY T+Y
Sbjct: 477 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSMVGLQDAWKILYQTLY 536
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 322
NCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y+HP
Sbjct: 537 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 595
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
HLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 596 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 655
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+ V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 656 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQITM 715
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFKY+I S+E + F L++WRREWI
Sbjct: 716 WFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIYFKYLILSMEKKEPFALEEWRREWIS 775
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
LTN+WQ+ V+P + GDAL S+ LY KYL
Sbjct: 776 LTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 808
>gi|357166414|ref|XP_003580702.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
distachyon]
Length = 829
Score = 771 bits (1990), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/573 (63%), Positives = 449/573 (78%), Gaps = 39/573 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y GM+PVLPAFSG++PAAL++ FPSAK+T
Sbjct: 238 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGSIPAALKSKFPSAKVT 297
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 298 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 357
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 358 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 417
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL+AEVKP+W S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TM
Sbjct: 418 DLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 477
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W+ Y RRYG+S+ +QDAW +L+ T+Y
Sbjct: 478 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLQVWVETYPTRRYGKSIVELQDAWRILHQTLY 537
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVLKSETS-SYDHP 322
NCTDG DKNRDVIVAFPDV+P +I T G + + K S K ++K E++ +Y+ P
Sbjct: 538 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLHTSASKMFSTMSAKSYLVKDESNDAYEQP 596
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
HLWY T+ VIRAL+LF+ G+E+S S+T+RYDL+DLTRQALAKYAN++F II++Y+ N+
Sbjct: 597 HLWYDTNVVIRALQLFLQYGDEVSDSSTFRYDLVDLTRQALAKYANQIFAKIIQSYKSNN 656
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+ V LS FL+LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 657 MNQVTTLSECFLDLVNDLDMLLASHEGFLLGPWLESAKGLARDQEQEIQYEWNARTQITM 716
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
WFDNT+ +ASLLRDY NKYWSGLL DYYGPRAAIYFKY+I SLE + F L++WRREWI
Sbjct: 717 WFDNTETKASLLRDYANKYWSGLLGDYYGPRAAIYFKYLILSLEKKEPFALEEWRREWIS 776
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
LTN+WQ+ R V+ + GDAL ++ LY KYL+
Sbjct: 777 LTNNWQSDRKVFATAATGDALNIARSLYMKYLR 809
>gi|218192858|gb|EEC75285.1| hypothetical protein OsI_11626 [Oryza sativa Indica Group]
Length = 812
Score = 692 bits (1785), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/570 (57%), Positives = 412/570 (72%), Gaps = 35/570 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVP+ + +FPSA IT
Sbjct: 243 MGNLHGWGGPLSQNWLDQQLTLQKKILSRMIELGMVPVLPSFSGNVPSVFKKLFPSANIT 302
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 303 KLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQAFIRQQMKEYGDITNIYNCDTFNENTPP 362
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLG+AIY M G+ DAVWLMQGWLF D FW+ PQMKALL+SVP GK++V
Sbjct: 363 TNEPAYISSLGSAIYEAMSRGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPTGKMIV 422
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYGILDSIA GP++ARTS N+TM
Sbjct: 423 LDLFADVKPIWQMSSQFYGVPYIWCMLHNFGGNIEMYGILDSIASGPIDARTSHNSTMVG 482
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ YS RRYG+S ++ AW +LYHT+
Sbjct: 483 VGMCMEGIEHNPVVYELMSEMAFRSQKVEVEDWLKIYSYRRYGQSNVEVEKAWGILYHTI 542
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 325
YNCTDG D N+D IV FPD+ P+ S K + + + SE S S HPHLW
Sbjct: 543 YNCTDGIADHNKDYIVQFPDISPNSFSSDVSKRKAISEVKKHRRFVLSEVSASLPHPHLW 602
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
YST E I+ALELF+ +GN+LS S TYRYDL+DLTRQ+L+K ANE++L+ + AY+ D++G
Sbjct: 603 YSTKEAIKALELFLNAGNDLSKSLTYRYDLVDLTRQSLSKLANEVYLDAMNAYRKKDSNG 662
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
+ +++FLEL+ D+D LLA D FLLGPWLE AK LA+ E + KQYEWNARTQ+TMW+D
Sbjct: 663 LNFYTKKFLELIVDIDTLLASDDNFLLGPWLEDAKSLARTENERKQYEWNARTQVTMWYD 722
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
NT+ E S L DY NK+WSGLL+ YY PRA+ YF + + L+ F+L++WR++WI +N
Sbjct: 723 NTKTEQSKLHDYANKFWSGLLKSYYLPRASKYFSRLTKGLQENQSFQLEEWRKDWIAYSN 782
Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
+WQ+G+ +Y V++ GDAL S L+ KY +
Sbjct: 783 EWQSGKELYAVKATGDALAISSSLFKKYFR 812
>gi|413955691|gb|AFW88340.1| hypothetical protein ZEAMMB73_315381 [Zea mays]
Length = 814
Score = 689 bits (1777), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 326/570 (57%), Positives = 410/570 (71%), Gaps = 35/570 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVPA +FPSA IT
Sbjct: 244 MGNLHGWGGPLSQNWLDQQLALQKKILSRMIELGMVPVLPSFSGNVPAIFAKLFPSANIT 303
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V ++P+WCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 304 RLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQAFIRQQIKEYGDVTNIYNCDTFNENTPP 363
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
D P YISSLG+AIY M G+ +AVWLMQGWLF D FW+ PQMKALL+SVP+GK++V
Sbjct: 364 TDEPAYISSLGSAIYEAMSRGNKNAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 423
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYGILDSI+ GP++ARTS N+TM
Sbjct: 424 LDLFADVKPIWKVSSQFYGVPYIWCMLHNFGGNIEMYGILDSISSGPIDARTSYNSTMIG 483
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ YS RRYG++ I+ AW LYHT+
Sbjct: 484 VGMCMEGIEHNPVVYELMSEMAFHNKKVEVEDWLKTYSCRRYGQANADIEKAWRYLYHTI 543
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS-YDHPHLW 325
YNCTDG D N+D IV FPD+ PS ++ K + + SE S PHLW
Sbjct: 544 YNCTDGIADHNKDYIVEFPDISPSSVTYQVSKRRGMSITRNHRRFFLSEVSGILPQPHLW 603
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
YST E ++ALELF+ +G+ S S TYRYDL+DLTRQ L+K ANE++L+ I YQ D+HG
Sbjct: 604 YSTKEAVKALELFLDAGSTFSESLTYRYDLVDLTRQCLSKLANEVYLDAISLYQKKDSHG 663
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
+ +R+FLE++ D+D LLA D FLLGPWLESAK LA E++ +QYEWNARTQ+TMW+D
Sbjct: 664 LNAHARKFLEIIVDIDTLLAADDNFLLGPWLESAKSLAITEKERQQYEWNARTQVTMWYD 723
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
NT+ E S L DY NK+WSGLL+ YY PRA+ YF Y+ SL+ F+L++WR++WI +N
Sbjct: 724 NTETEQSKLHDYANKFWSGLLKSYYLPRASKYFAYLTRSLQENRSFQLEEWRKDWISYSN 783
Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
+WQ+G+ VY V++ GDAL ++ LY KYL+
Sbjct: 784 EWQSGKEVYAVKATGDALAIARSLYRKYLR 813
>gi|222624949|gb|EEE59081.1| hypothetical protein OsJ_10898 [Oryza sativa Japonica Group]
Length = 812
Score = 687 bits (1772), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 326/569 (57%), Positives = 409/569 (71%), Gaps = 35/569 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVP+ + +FPSA IT
Sbjct: 243 MGNLHGWGGPLSQNWLDQQLTLQKKILSRMIELGMVPVLPSFSGNVPSVFKKLFPSANIT 302
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 303 KLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQAFIRQQMKEYGDITNIYNCDTFNENTPP 362
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLG+AIY M G+ DAVWLMQGWLF D FW+ PQMKALL+SVP GK++V
Sbjct: 363 TNEPAYISSLGSAIYEAMSRGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPTGKMIV 422
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYGILDSIA GP++ARTS N+TM
Sbjct: 423 LDLFADVKPIWQMSSQFYGVPYIWCMLHNFGGNIEMYGILDSIASGPIDARTSHNSTMVG 482
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ YS RRYG+S ++ AW +LYHT+
Sbjct: 483 VGMCMEGIEHNPVVYELMSEMAFRSQKVEVEDWLKIYSYRRYGQSNVEVEKAWGILYHTI 542
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 325
YNCTDG D N D IV FPD+ P+ S K + + + SE S S HPHLW
Sbjct: 543 YNCTDGIADHNNDYIVEFPDISPNSFSSDVSKRKAISEVKKHRRFVLSEVSASLPHPHLW 602
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
YST E I+ALELF+ +GN+LS S TYRYDL+DLTRQ+L+K ANE++L+ + AY+ D++G
Sbjct: 603 YSTKEAIKALELFLNAGNDLSKSLTYRYDLVDLTRQSLSKLANEVYLDAMNAYRKKDSNG 662
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
+ +++FLEL+ D+D LLA D FLLGPWLE AK LA+ E + KQYEWNARTQ+TMW+D
Sbjct: 663 LNFYTKKFLELIVDIDTLLASDDNFLLGPWLEDAKSLARTENERKQYEWNARTQVTMWYD 722
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
NT+ E S L DY NK+WSGLL+ YY PRA+ YF + + L+ F+L++W ++WI +N
Sbjct: 723 NTKTEQSKLHDYANKFWSGLLKSYYLPRASKYFSRLTKGLQENQSFQLEEWTKDWIAYSN 782
Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYL 534
+WQ+G+ +Y V++ GDAL S L+ KY
Sbjct: 783 EWQSGKELYAVKATGDALAISSSLFKKYF 811
>gi|357112065|ref|XP_003557830.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
distachyon]
Length = 809
Score = 680 bits (1755), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/571 (57%), Positives = 416/571 (72%), Gaps = 39/571 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL Q+WLD QL LQKKIL R+ ELGM PVLP+FSGNVP A + +FPSA IT
Sbjct: 240 MGNLHAWGGPLSQNWLDGQLALQKKILSRMTELGMVPVLPSFSGNVPVAFKKLFPSANIT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V DPRWCCTY+LD +D LFI++G AFI QQ+KEYG + IYNCDTF+ENTPP
Sbjct: 300 RLGEWNTVDGDPRWCCTYILDPSDALFIDVGHAFIRQQIKEYGDITSIYNCDTFNENTPP 359
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLG+AIY M SG+ DAVWLMQGWLF D FW+ PQMKALL+SVP+GK++V
Sbjct: 360 TNEPAYISSLGSAIYEAMSSGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 419
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFA+VKP+W S QFYGVPYIWCMLHNF GNIEMYGILDSI+ GP++ARTS +TM
Sbjct: 420 LDLFADVKPVWKMSSQFYGVPYIWCMLHNFGGNIEMYGILDSISSGPIDARTSYGSTMVG 479
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ YS RRYG+S I+ AW VLYHT+
Sbjct: 480 VGMTMEGIEHNPVVFELMSEMSFRSQKVEVEDWLKSYSYRRYGQSNVKIEKAWGVLYHTI 539
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA---VLKSETSSYDHPH 323
YNCTDG D NRD IV FPD+ PS S K + G P+ ++ L +++ HPH
Sbjct: 540 YNCTDGIADHNRDYIVEFPDMSPSSFSSHFSKQR--GMPIVRKHPRFFLSEVSANLPHPH 597
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
LWYST+E ++ALELF+ +GN+LS S T+RYDL+DLTRQ+L+K AN+++L+ +++Y+ ++
Sbjct: 598 LWYSTNEAVKALELFLNAGNDLSKSLTFRYDLVDLTRQSLSKLANKVYLDAMDSYKNKNS 657
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
G+ +++FLEL+ D+D LLA D FLLGPWLESAK LA +EE+ KQYEWNARTQ+TMW
Sbjct: 658 SGLNFHTKKFLELIVDIDILLASDDNFLLGPWLESAKSLAMSEEERKQYEWNARTQVTMW 717
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+DNT+ E S L DY NK+WSGLL++YY PRA+ YF + SL+ F+L++WRR+WI
Sbjct: 718 YDNTKTEQSHLHDYANKFWSGLLKNYYLPRASKYFTGLSRSLQENRSFQLEEWRRDWISY 777
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
+N+WQ+G +YPV++ GDAL S+ L+ KYL
Sbjct: 778 SNEWQSGEELYPVKAKGDALAISKSLFRKYL 808
>gi|297736304|emb|CBI24942.3| unnamed protein product [Vitis vinifera]
Length = 868
Score = 679 bits (1751), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/575 (56%), Positives = 408/575 (70%), Gaps = 42/575 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLD+QLVLQK+IL R+ ELGM PVLP+FSGNVP AL+ +FPSA IT
Sbjct: 294 MGNLHGWGGPLSQNWLDEQLVLQKQILCRMLELGMTPVLPSFSGNVPEALKKIFPSANIT 353
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI QQ+KEYG + IYNCDTF+EN+PP
Sbjct: 354 RLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQQIKEYGDVTDIYNCDTFNENSPP 413
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLGAAIY M GD D+VWLMQGWLF D FW+PPQMKALL+SVP GK+VV
Sbjct: 414 TNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSDSGFWKPPQMKALLHSVPFGKMVV 473
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMYGILD+++ GPV+AR S+N+TM
Sbjct: 474 LDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMYGILDAVSSGPVDARISKNSTMVG 533
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ YS RRYG++V ++ AW +LY T+
Sbjct: 534 VGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTYSYRRYGKAVHHVEAAWEILYRTI 593
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
YNCTDG D N D +V FPD DPS+ IS + Q + +L ETSS D
Sbjct: 594 YNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHIIQKILTQTGRRKILFQETSS-DL 652
Query: 322 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
P HLWYST EV+ AL LF+ +GNELS S+TYRYDL+DLTRQ L+K N+++L+ + A++
Sbjct: 653 PQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLVDLTRQVLSKLGNQVYLDAVIAFR 712
Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
DA S++F++LV+D+D LLA D FLLG WLESAK+LA N + +QYEWNARTQ
Sbjct: 713 QKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWLESAKKLAVNPREMEQYEWNARTQ 772
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
+TMWF T+ S L DY NK+WSGLL +YY PRA++YF Y+ ++L F+L++WRRE
Sbjct: 773 LTMWFYVTKTNQSKLHDYANKFWSGLLENYYLPRASMYFSYLAKALTENKNFKLEEWRRE 832
Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
WI +N WQ G+ +YPV + GD L S+ LY KY
Sbjct: 833 WISYSNKWQAGKELYPVRAKGDTLAISRALYEKYF 867
>gi|356534602|ref|XP_003535842.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
Length = 807
Score = 679 bits (1751), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/573 (56%), Positives = 400/573 (69%), Gaps = 39/573 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 230 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 289
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCTYLLD +DPLF+EIG AFI +Q+KEYG + IYNCDTF+EN+PP
Sbjct: 290 RLGDWNTVDGDPRWCCTYLLDPSDPLFVEIGEAFIRKQIKEYGDVTDIYNCDTFNENSPP 349
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ PEYIS+LGAA+Y G+ GD DAVWLMQGWLF D FW+PPQMKALL+SVP GK++V
Sbjct: 350 TNDPEYISNLGAAVYKGISKGDKDAVWLMQGWLFYSDSSFWKPPQMKALLHSVPFGKMIV 409
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFA+VKPIW S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S N+TM
Sbjct: 410 LDLFADVKPIWKNSFQFYGTPYIWCMLHNFGGNIEMYGTLDSISSGPVDARVSANSTMVG 469
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
WI Y RRYG+ + ++ AW +LYHT+
Sbjct: 470 VGMCMEGIEQNPIVYELMSEMAFRDKKVKVSEWIKSYCHRRYGKVIHQVESAWEILYHTI 529
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG----KPVSKEAVLKSETSSYDHP 322
YNCTDG D N D IV FPD +PS SVT G N P ++ + + S
Sbjct: 530 YNCTDGIADHNHDFIVMFPDWNPSTNSVT-GTSNNQKIYLLPPGNRRYLFQETLSDMPQA 588
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
HLWY + +VI+AL+LF+A G L+ S TYRYDL+DLTRQ L+K AN+++ + +YQ +
Sbjct: 589 HLWYPSDDVIKALQLFLAGGKNLAGSLTYRYDLVDLTRQVLSKLANQVYHKAVTSYQKKN 648
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+ S +FL+L++D+D LLA D FLLG WLESAK+LA N + KQYEWNARTQ+TM
Sbjct: 649 IEALQFHSNKFLQLIKDIDVLLASDDNFLLGTWLESAKKLAVNPSEIKQYEWNARTQVTM 708
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
WFD + S L DY NK+WSGLL YY PRA+ YF ++ ESL D F+L +WR++WI
Sbjct: 709 WFDTNETTQSKLHDYANKFWSGLLESYYLPRASTYFSHLTESLRQNDKFKLIEWRKQWIS 768
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
+N WQ G +YPV++ GDAL SQ LY KY Q
Sbjct: 769 QSNKWQEGNELYPVKAKGDALTISQALYEKYFQ 801
>gi|225450036|ref|XP_002273084.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
Length = 803
Score = 678 bits (1749), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/575 (56%), Positives = 408/575 (70%), Gaps = 42/575 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLD+QLVLQK+IL R+ ELGM PVLP+FSGNVP AL+ +FPSA IT
Sbjct: 229 MGNLHGWGGPLSQNWLDEQLVLQKQILCRMLELGMTPVLPSFSGNVPEALKKIFPSANIT 288
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI QQ+KEYG + IYNCDTF+EN+PP
Sbjct: 289 RLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQQIKEYGDVTDIYNCDTFNENSPP 348
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLGAAIY M GD D+VWLMQGWLF D FW+PPQMKALL+SVP GK+VV
Sbjct: 349 TNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSDSGFWKPPQMKALLHSVPFGKMVV 408
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMYGILD+++ GPV+AR S+N+TM
Sbjct: 409 LDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMYGILDAVSSGPVDARISKNSTMVG 468
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ YS RRYG++V ++ AW +LY T+
Sbjct: 469 VGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTYSYRRYGKAVHHVEAAWEILYRTI 528
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
YNCTDG D N D +V FPD DPS+ IS + Q + +L ETSS D
Sbjct: 529 YNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHIIQKILTQTGRRKILFQETSS-DL 587
Query: 322 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
P HLWYST EV+ AL LF+ +GNELS S+TYRYDL+DLTRQ L+K N+++L+ + A++
Sbjct: 588 PQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLVDLTRQVLSKLGNQVYLDAVIAFR 647
Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
DA S++F++LV+D+D LLA D FLLG WLESAK+LA N + +QYEWNARTQ
Sbjct: 648 QKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWLESAKKLAVNPREMEQYEWNARTQ 707
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
+TMWF T+ S L DY NK+WSGLL +YY PRA++YF Y+ ++L F+L++WRRE
Sbjct: 708 LTMWFYVTKTNQSKLHDYANKFWSGLLENYYLPRASMYFSYLAKALTENKNFKLEEWRRE 767
Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
WI +N WQ G+ +YPV + GD L S+ LY KY
Sbjct: 768 WISYSNKWQAGKELYPVRAKGDTLAISRALYEKYF 802
>gi|224106113|ref|XP_002314048.1| predicted protein [Populus trichocarpa]
gi|222850456|gb|EEE88003.1| predicted protein [Populus trichocarpa]
Length = 806
Score = 667 bits (1720), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/573 (54%), Positives = 404/573 (70%), Gaps = 39/573 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPAAL+ +FPSA IT
Sbjct: 233 MGNLHGWGGPLSQNWLDQQLCLQKQILSRMLELGMTPVLPSFSGNVPAALKKIFPSANIT 292
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V +PRWCCTYLL+ +DPLF+EIG AFI QQ+KEYG + IYNCDTF+EN+PP
Sbjct: 293 RLGDWNTVDKNPRWCCTYLLNPSDPLFVEIGEAFIRQQVKEYGDVTDIYNCDTFNENSPP 352
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
P YISSLGAA+Y M GD DAVWLMQGWLF D FW+PPQM+ALL+SVP GK++V
Sbjct: 353 TSDPAYISSLGAAVYKAMSRGDKDAVWLMQGWLFYSDSAFWKPPQMQALLHSVPFGKMIV 412
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFAE KPIW S QFYG PY+WC+LHNF GNIEMYGILD+I+ GPV+AR EN+TM
Sbjct: 413 LDLFAEAKPIWKNSSQFYGTPYVWCLLHNFGGNIEMYGILDAISSGPVDARIIENSTMVG 472
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ YS RRYG++V + AW++LYHT+
Sbjct: 473 VGMCMEGIEHNPVVYELMSEMAFRSGKPQVLEWLKTYSRRRYGKAVRQVVAAWDILYHTI 532
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV-----SKEAVLKSETSSYDH 321
YNCTDG D N D IV FPD DPS+ S + Q+ + + ++ + + +S +
Sbjct: 533 YNCTDGIADHNTDFIVKFPDWDPSLHSGSNISEQDNMRILLTSSGTRRFLFQETSSDFPE 592
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
HLWYST EVI+AL LF+ +GN+L+ S TYRYDL+DLTRQ L+K AN+++ + + A++
Sbjct: 593 AHLWYSTQEVIQALWLFLDAGNDLAGSPTYRYDLVDLTRQVLSKLANQVYRDAMIAFRRK 652
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
DA + ++FL++++D+D LLA D FLLG WLESAK+LA + K YEWNARTQ+T
Sbjct: 653 DARALNLHGQKFLQIIKDIDVLLASDDNFLLGTWLESAKKLAVDPNDMKLYEWNARTQVT 712
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 501
MW+D T+ S L DY NK+WSGLL DYY PRA+ YF ++++SLE F+L +WR+EWI
Sbjct: 713 MWYDTTKTNQSQLHDYANKFWSGLLEDYYLPRASTYFGHLMKSLEENKNFKLTEWRKEWI 772
Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
+N WQ +YPV++ GDAL ++ LY KY
Sbjct: 773 AFSNKWQADTKIYPVKAKGDALAIAKALYRKYF 805
>gi|357458267|ref|XP_003599414.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488462|gb|AES69665.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 832
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/599 (53%), Positives = 400/599 (66%), Gaps = 66/599 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353
Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
EYG + IYNCDTF+EN+PP P YIS+LGAA+Y G+ GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413
Query: 155 SYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473
Query: 214 EMYGILDSIAFGPVEARTSENTTMA---------------------------------WI 240
EMYG+LD+IA GPV+AR SEN+TM W+
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKINEWL 533
Query: 241 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 300
YS RRYG+++ + AW +LYHT+YN TDG D N D IV PD DPS +V G
Sbjct: 534 KSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHNHDYIVMLPDWDPS-AAVKSGMSN 592
Query: 301 NYGK-----PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 355
+ K P ++ + + + HLWY +VI+AL+LF+A G L S TYRYDL
Sbjct: 593 HQKKIYFLPPGNRRYLFQQTPAGMPQAHLWYPPEDVIKALQLFLAGGKNLKGSLTYRYDL 652
Query: 356 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 415
+DLTRQ L+K+AN++++ I ++Q + + S FLEL++D+D LLA D FLLG W
Sbjct: 653 VDLTRQVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELIKDIDLLLASDDNFLLGTW 712
Query: 416 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 475
L+SAK+LA N + KQYEWNARTQ+TMWFD + S L DY NK+WSG+L +YY PRA+
Sbjct: 713 LQSAKKLAVNPSELKQYEWNARTQVTMWFDTNETTQSKLHDYANKFWSGILENYYLPRAS 772
Query: 476 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
YF ++ ESL+ + F L +WR+EWI ++N WQ G +YPV++ GDAL SQ LY KY
Sbjct: 773 TYFSHLSESLKQNEKFNLTEWRKEWIPMSNKWQEGSELYPVKAKGDALTISQALYKKYF 831
>gi|357458271|ref|XP_003599416.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488464|gb|AES69667.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 807
Score = 651 bits (1680), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/594 (53%), Positives = 392/594 (65%), Gaps = 81/594 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353
Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
EYG + IYNCDTF+EN+PP P YIS+LGAA+Y G+ GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413
Query: 155 SYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473
Query: 214 EMYGILDSIAFGPVEARTSENTTMA---------------------------------WI 240
EMYG+LD+IA GPV+AR SEN+TM W+
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKINEWL 533
Query: 241 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 300
YS RRYG+++ + AW +LYHT+YN TDG D N D IV PD DPS + G Q
Sbjct: 534 KSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHNHDYIVMLPDWDPSAAVKSAGMPQ 593
Query: 301 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 360
HLWY +VI+AL+LF+A G L S TYRYDL+DLTR
Sbjct: 594 ---------------------AHLWYPPEDVIKALQLFLAGGKNLKGSLTYRYDLVDLTR 632
Query: 361 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 420
Q L+K+AN++++ I ++Q + + S FLEL++D+D LLA D FLLG WL+SAK
Sbjct: 633 QVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELIKDIDLLLASDDNFLLGTWLQSAK 692
Query: 421 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 480
+LA N + KQYEWNARTQ+TMWFD + S L DY NK+WSG+L +YY PRA+ YF +
Sbjct: 693 KLAVNPSELKQYEWNARTQVTMWFDTNETTQSKLHDYANKFWSGILENYYLPRASTYFSH 752
Query: 481 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
+ ESL+ + F L +WR+EWI ++N WQ G +YPV++ GDAL SQ LY KY
Sbjct: 753 LSESLKQNEKFNLTEWRKEWIPMSNKWQEGSELYPVKAKGDALTISQALYKKYF 806
>gi|15240689|ref|NP_196873.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|9758035|dbj|BAB08696.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|19423948|gb|AAL87291.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|21436231|gb|AAM51254.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|332004545|gb|AED91928.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
Length = 806
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/576 (54%), Positives = 400/576 (69%), Gaps = 43/576 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL ++WLD QL+LQK+IL R+ + GM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMTPVLPSFSGNVPSALRKIYPEANIT 290
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L NW +V D RWCCTYLL+ +DPLFIEIG AFI+QQ +EYG ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIKQQTEEYGEITNIYNCDTFNENTPP 350
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEYISSLGAA+Y M G+ +AVWLMQGWLFS D FW+PPQ+KALL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQLKALLHSVPFGKMIV 410
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL+AEVKPIW+ S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TM
Sbjct: 411 LDLYAEVKPIWNKSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ Y+ RRY + I+ AW +LYHTV
Sbjct: 471 VGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS-------KEAVLKSETSSY 319
YNCTDG D N D IV PD DPS SV + Q +S + + + +T+
Sbjct: 531 YNCTDGIADHNTDFIVKLPDWDPSS-SVQDDLKQKDSYMISTGPYETKRRVLFQDKTADL 589
Query: 320 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
HLWYST EVI+AL+LF+ +G++LS S TYRYD++DLTRQ L+K AN+++ + A+
Sbjct: 590 PKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYTEAVTAFV 649
Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
D + QLS +FLEL++DMD LLA D LLG WLESAK+LA+N ++ KQYEWNARTQ
Sbjct: 650 KKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQYEWNARTQ 709
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
+TMW+D+ S L DY NK+WSGLL DYY PRA +YF M++SL F+++ WRRE
Sbjct: 710 VTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRARLYFNEMLKSLRDKKIFKVEKWRRE 769
Query: 500 WIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKYL 534
WI +++ WQ + VYPV++ GDAL S+ L +KY
Sbjct: 770 WIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKYF 805
>gi|297807393|ref|XP_002871580.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
lyrata]
gi|297317417|gb|EFH47839.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
lyrata]
Length = 806
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/576 (55%), Positives = 403/576 (69%), Gaps = 43/576 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL ++WL+ QL+LQK+IL ++ +LGM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHTWGGPLSKNWLNDQLILQKQILSQMLKLGMTPVLPSFSGNVPSALRKIYPGANIT 290
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L NW +V D RWCCTYLL+ +DPLFI+IG AFI+QQ +EYG ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIDIGEAFIKQQPEEYGEITNIYNCDTFNENTPP 350
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEYISSLGAA+Y M G+ +AVWLMQGWLFS D FW+PPQMK LL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQMKVLLHSVPFGKMIV 410
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL+AEVKPIW+TS QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TM
Sbjct: 411 LDLYAEVKPIWNTSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ Y+ RRY + I+ AW +LYHTV
Sbjct: 471 VGMCMEGIEQNPVVYELISEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN-----YGKPVSKEAVLKSETSSYDH 321
YNCTDG D N D IV PD DPS E K+ + G +K VL + SS D
Sbjct: 531 YNCTDGIADHNTDFIVKLPDWDPSSSVQDESKHTDSYMISTGPYETKRRVLFQDKSS-DL 589
Query: 322 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
P HLWYST EVI+AL+LF+ +G+ELS S TYRYD++DLTRQ L+K AN++++ + A+
Sbjct: 590 PKAHLWYSTKEVIQALKLFLEAGDELSRSLTYRYDMVDLTRQVLSKLANQVYIEAVTAFV 649
Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
D + QLS +FLEL++D+D LLA D FLLG WLESAK+LA+N ++ KQYEWNARTQ
Sbjct: 650 KKDIGSLGQLSEKFLELIKDIDVLLASDDNFLLGTWLESAKKLARNGDERKQYEWNARTQ 709
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
+TMW+D+ S L DY NK WSGLL DYY PRA +YF M++SL F+++ W+RE
Sbjct: 710 VTMWYDSKDVNQSKLHDYANKLWSGLLEDYYLPRARLYFNEMLKSLRDKKKFKVEKWQRE 769
Query: 500 WIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKYL 534
WI +++ WQ + VYPV++ GDAL S+ L KY
Sbjct: 770 WIMMSHKWQQSSSEVYPVKAKGDALAISKHLLLKYF 805
>gi|242035709|ref|XP_002465249.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
gi|241919103|gb|EER92247.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
Length = 777
Score = 641 bits (1653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/541 (57%), Positives = 397/541 (73%), Gaps = 16/541 (2%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKK+L R+ ELGM PVLP+FSGNVPA +FPSA IT
Sbjct: 244 MGNLHGWGGPLSQNWLDQQLALQKKVLSRMIELGMVPVLPSFSGNVPAVFAKLFPSANIT 303
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG+W +V ++P+WCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 304 LLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQAFIRQQIKEYGDVTNIYNCDTFNENTPP 363
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
D P YISSLG+AIY M G+ +AVWLMQGWLF D FW+ PQMKALL+SVP+GK++V
Sbjct: 364 TDEPAYISSLGSAIYEAMSRGNKNAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 423
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYG+LDSI+ GP++ARTS N+TM
Sbjct: 424 LDLFADVKPIWKMSSQFYGVPYIWCMLHNFGGNIEMYGVLDSISSGPIDARTSYNSTMIG 483
Query: 240 INQYSVRRYGRSVPAIQDAWNVLYHTV-----YNCTDGATDKNRDVIVAFPDVDPSIISV 294
+ G + I+ V+Y + +N D N+D IV FPD+ PS IS
Sbjct: 484 V--------GMCMEGIEHN-PVVYELMSEMAFHNKKVEVEDHNKDYIVEFPDISPSSISS 534
Query: 295 TEGKYQNYGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 353
K + + SE S S HPHLWYST E I+ALELF+ +G+ S S TYRY
Sbjct: 535 QLSKRRGMSIMRNHRRFFLSEVSGSLPHPHLWYSTKEAIKALELFLDAGSTFSKSLTYRY 594
Query: 354 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
DL+DLTRQ L+K ANE++L+ + +YQ D++G+ +R+FLE++ D+D LLA D FLLG
Sbjct: 595 DLVDLTRQCLSKLANEVYLDAMSSYQKKDSNGLNSHTRKFLEIIMDIDTLLAADDNFLLG 654
Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
PWLESAK LA E++ +QYEWNARTQ+TMW+DNT+ E S L DY NK+WSGLL+ YY PR
Sbjct: 655 PWLESAKSLAITEKERQQYEWNARTQVTMWYDNTETEQSKLHDYANKFWSGLLKSYYLPR 714
Query: 474 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
A+ YF Y+ SL+ F+L++WR++WI +N+WQ+G+ VY V++ GDAL ++ LY KY
Sbjct: 715 ASKYFAYLTRSLQENQSFQLEEWRKDWISYSNEWQSGKEVYAVKATGDALAIARSLYRKY 774
Query: 534 L 534
L
Sbjct: 775 L 775
>gi|449436325|ref|XP_004135943.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
Length = 774
Score = 632 bits (1630), Expect = e-178, Method: Compositional matrix adjust.
Identities = 311/568 (54%), Positives = 382/568 (67%), Gaps = 57/568 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL ++WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPA L +FPSA IT
Sbjct: 229 MGNLHGWGGPLSKNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAGLVEIFPSANIT 288
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNW S+ +DP CCTYLL+ +DPLF++IG AFI QQ+KEYG ++IY+CDTF+ENTPP
Sbjct: 289 KLGNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTNIYSCDTFNENTPP 348
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ YISSLGA++Y M D DAVWLMQGWLF D FW+P QMKALL+SVP GK++V
Sbjct: 349 TNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSDFWKPDQMKALLHSVPFGKMIV 408
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFA+VKPIW +S QFYG PY+WCMLHNF GNIEMYGILD+I+ GPV+A SEN+TM
Sbjct: 409 LDLFADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVG 468
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ YS RYG++ + AWN+LYHT+
Sbjct: 469 VGMCMEGIEHNPVVYELMSEMAFRSKKVQVQEWLKTYSRCRYGKADHYVDAAWNILYHTI 528
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YNCTDG + N D IV PD DPS S + K KP PHLWY
Sbjct: 529 YNCTDGIANHNTDFIVKLPDWDPS--STFDLK-----KP----------------PHLWY 565
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
ST EVI AL+L + + L S TYRYDL+DLTRQ L K ANE +L + A++ +
Sbjct: 566 STQEVINALQLLVNVDDNLVHSATYRYDLVDLTRQVLGKLANEEYLKAVTAFRRQNVKAQ 625
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
S+RF++L+ D+D LLA + FLLG WLESAK+LA N + KQYEWNARTQ+TMW+DN
Sbjct: 626 NLHSKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLATNPAEMKQYEWNARTQVTMWYDN 685
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
T+ S L DY NKYWSGLL YY PRA YF Y+ +SL + F L+DWRREWI +N
Sbjct: 686 TKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWILFSNK 745
Query: 507 WQNGRNVYPVESNGDALITSQWLYNKYL 534
WQ +YPV++ G+A+ S+ LY KY
Sbjct: 746 WQAASELYPVKAEGNAVAISKALYEKYF 773
>gi|449489156|ref|XP_004158231.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Cucumis sativus]
Length = 567
Score = 623 bits (1606), Expect = e-176, Method: Compositional matrix adjust.
Identities = 307/565 (54%), Positives = 378/565 (66%), Gaps = 57/565 (10%)
Query: 4 LHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLG 63
L WGGPL ++WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPA L +FPSA IT+LG
Sbjct: 25 LKEWGGPLSKNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAGLVEIFPSANITKLG 84
Query: 64 NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDS 123
NW S+ +DP CCTYLL+ +DPLF++IG AFI QQ+KEYG ++IY+CDTF+ENTPP +
Sbjct: 85 NWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTNIYSCDTFNENTPPTND 144
Query: 124 PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDL 182
YISSLGA++Y M D DAVWLMQGWLF D FW+P QMKALL+SVP GK++VLDL
Sbjct: 145 TSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSDFWKPDQMKALLHSVPFGKMIVLDL 204
Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA---- 238
FA+VKPIW +S QFYG PY+WCMLHNF GNIEMYGILD+I+ GPV+A SEN+TM
Sbjct: 205 FADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGM 264
Query: 239 -----------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVYNC 269
W+ YS RYG++ + AWN+LYHT+YNC
Sbjct: 265 CMEGIEHNPVVYELMSEMAFRXQKVQVQEWLKTYSRCRYGKADHYVDAAWNILYHTIYNC 324
Query: 270 TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTS 329
TDG + N D IV PD DPS S + K KP PHLWYST
Sbjct: 325 TDGIANHNTDFIVKLPDWDPS--STFDLK-----KP----------------PHLWYSTQ 361
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
EVI AL+L + + L S TYRYDL+DLTRQ L K ANE +L + A++ +
Sbjct: 362 EVINALQLLVNVDDNLVHSATYRYDLVDLTRQVLGKLANEEYLKAVTAFRRQNVKAQNLH 421
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
S+RF++L+ D+D LLA + FLLG WLESAK+LA N + KQYEWNARTQ+TMW+DNT+
Sbjct: 422 SKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLATNPAEMKQYEWNARTQVTMWYDNTKV 481
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
S L DY NKYWSGLL YY PRA YF Y+ +SL + F L+DWRREWI +N WQ
Sbjct: 482 NQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWILFSNKWQA 541
Query: 510 GRNVYPVESNGDALITSQWLYNKYL 534
+YPV++ G+A+ S+ LY KY
Sbjct: 542 ASELYPVKAEGNAVAISKALYEKYF 566
>gi|414585093|tpg|DAA35664.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 721
Score = 623 bits (1606), Expect = e-175, Method: Compositional matrix adjust.
Identities = 295/469 (62%), Positives = 360/469 (76%), Gaps = 39/469 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYNCDTFDENTPP
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYNCDTFDENTPP 359
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+GK++VL
Sbjct: 360 LSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGKMIVL 419
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL+AEVKP+W S Q YGVPYIWCMLHNFA + EMYG+LD++A GP++AR S+N+TM
Sbjct: 420 DLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGV 479
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W+ Y RRYG+ V +QDAW +LY T+Y
Sbjct: 480 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLY 539
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHP 322
NCTDG DKNRDVIVAFPDV+P +I+ T G + N Y SK + K +S +Y+HP
Sbjct: 540 NCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHP 598
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
HLWY T+ VI ALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+
Sbjct: 599 HLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNN 658
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
+ V L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE Q
Sbjct: 659 MNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQ 707
>gi|4160292|emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum]
Length = 811
Score = 614 bits (1583), Expect = e-173, Method: Compositional matrix adjust.
Identities = 306/580 (52%), Positives = 394/580 (67%), Gaps = 46/580 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL Q+WL+ QL LQK+IL R+ ELGM PVLP+FSGNVPAAL+ +FPSA IT
Sbjct: 231 MGNLHAWGGPLSQNWLNIQLALQKQILSRMRELGMTPVLPSFSGNVPAALKKIFPSANIT 290
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCT+LL +DPLFIEIG AFI +Q++EYG + IYNCDTF+ENTPP
Sbjct: 291 RLGDWNTVNGDPRWCCTFLLAPSDPLFIEIGEAFIRKQIEEYGDITDIYNCDTFNENTPP 350
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWL-MQGWLFSYD-PFWRPPQMKALLNSVPLGKLV 178
D P YI Q + WL + WLF D +W+ PQM+ALL+SVP GK++
Sbjct: 351 TDDPTYIHLSALLCTKQCQKQITMRCWLNARVWLFYSDSKYWKSPQMEALLHSVPRGKMI 410
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM- 237
VLDLFA+VKPIW +S QFYG PYIWCMLHNF GNIEMYG+LD++A GP++ARTSEN+TM
Sbjct: 411 VLDLFADVKPIWKSSSQFYGTPYIWCMLHNFGGNIEMYGVLDAVASGPIDARTSENSTMV 470
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
W+ YS RRYG+ IQ AW++LYHT
Sbjct: 471 GVGMCMEGIEHNPVVYELMSEMAFREDNFQLQGWLKSYSHRRYGKVNDQIQAAWDILYHT 530
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPS-----IISVTEGKYQNYGKPVS-----KEAVLKSE 315
+YNCTDG D N+D IV FPD DPS IS T+ QN + ++ + + +
Sbjct: 531 IYNCTDGIADHNKDYIVEFPDWDPSGKTGTDISGTDSSSQNRMQKLAGFQWNRRFLFFEK 590
Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
+SS P LWYST +V +AL+LFI + +LS S TYRYDL+DL+RQ+L+K AN+++L+ I
Sbjct: 591 SSSLPKPRLWYSTEDVFQALQLFIDALKKLSGSLTYRYDLVDLSRQSLSKLANQVYLDAI 650
Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ-LAQNEEQEKQYEW 434
A++ DA + Q S +FL L++D+D LLA D FLLG WLE+ Q LA N +++KQYEW
Sbjct: 651 SAFRREDAKPLNQHSPKFLPLLQDIDRLLAADDNFLLGTWLENCPQNLAMNSDEKKQYEW 710
Query: 435 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
NARTQITMWFDNT+ S L DY NK+WSGLL YY PRA+IYF+ + +SL+ F+L+
Sbjct: 711 NARTQITMWFDNTKYNQSQLHDYANKFWSGLLEAYYLPRASIYFELLSKSLKEKVDFKLE 770
Query: 495 DWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
+WR+EWI +N WQ +YPV++ GDAL + L+ KY
Sbjct: 771 EWRKEWIAYSNKWQESTELYPVKAQGDALAIATALFEKYF 810
>gi|168060822|ref|XP_001782392.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666123|gb|EDQ52786.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 801
Score = 613 bits (1581), Expect = e-173, Method: Compositional matrix adjust.
Identities = 296/567 (52%), Positives = 376/567 (66%), Gaps = 50/567 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLPQ WLDQQL LQ KIL R+ ELGM PVLPAF+GNVPAA+ +PSA++T
Sbjct: 236 MGNLKRWGGPLPQKWLDQQLQLQIKILARMRELGMTPVLPAFAGNVPAAITKKYPSARVT 295
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V D R+CCT+LLD DPLF++IG+AFI QQ+KEYG T HIYNCDTF+EN PP
Sbjct: 296 RLGEWNTVNGDTRYCCTFLLDPKDPLFVDIGKAFILQQIKEYGGTQHIYNCDTFNENQPP 355
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P YIS+LG+ +Y M + D DA+WLMQ +YD FW+PPQMKALL+SVP+G++VVL
Sbjct: 356 TDDPSYISALGSIVYEAMSAADQDAIWLMQ----AYDKFWKPPQMKALLHSVPVGRMVVL 411
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
DLFA+VKP+WS S FYGVPYIWCMLHNF GN+EMYG LD +A P++A TS N+TM
Sbjct: 412 DLFADVKPMWSRSDHFYGVPYIWCMLHNFGGNVEMYGRLDVVATAPIQAVTSSNSTMVGV 471
Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
WI +Y+ RRYG + AW +L+ ++Y
Sbjct: 472 GMCMEGIEQNPVVYDLMAEMAFHNATVVVEDWIEEYARRRYGELTAGARIAWKMLHESIY 531
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-HLWY 326
NC+DG D N DVIV FPD+DP Q+ G+ + HP H+WY
Sbjct: 532 NCSDGIADHNGDVIVEFPDIDPKRSLFQIRPRQSLGQQI------------LGHPQHIWY 579
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
S + AL+ ++S + L S YRYD++DLTRQ L+K AN+L +++ +++ + +
Sbjct: 580 SPQDAAVALQYLLSSADALGLSKPYRYDVVDLTRQVLSKLANQLHSQVLDQFRMFNVEKM 639
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+S R LEL+ DMD LL + FLLG WLESAK LA ++E+ K YEWNARTQITMWFDN
Sbjct: 640 DNISSRLLELLSDMDDLLGASEEFLLGTWLESAKDLATSDEERKLYEWNARTQITMWFDN 699
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
T ++ S L DY NK WSGL RDYY PRA+IY KY+ +SL F ++WRREWI LTN+
Sbjct: 700 TLDKPSPLHDYANKMWSGLTRDYYLPRASIYIKYLKQSLHENTSFAFQEWRREWIALTNE 759
Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
WQ N+YP + GDAL + LY KY
Sbjct: 760 WQVASNLYPTVAKGDALEIATTLYEKY 786
>gi|302786446|ref|XP_002974994.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
gi|300157153|gb|EFJ23779.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
Length = 761
Score = 599 bits (1544), Expect = e-168, Method: Compositional matrix adjust.
Identities = 290/567 (51%), Positives = 378/567 (66%), Gaps = 37/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP+ WL+ QL+LQKKIL + LGM VLPAFSGNVP AL+ ++PSA IT
Sbjct: 192 MGNLHGWGGPLPEKWLELQLILQKKILHHMRSLGMIAVLPAFSGNVPRALKILYPSANIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W +V +P+WCCTYLL DPLFI+IG+AFIEQQ+KEYG T H+YNCDTF+EN PP
Sbjct: 252 RLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGKAFIEQQVKEYGSTQHVYNCDTFNENLPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
D P YIS+L A++Y M D A+WLMQGWLFS D FW+PPQMKALL++VP GK++V
Sbjct: 312 TDDPSYISALAASVYGAMIVADKQAIWLMQGWLFSSDAQFWKPPQMKALLHAVPFGKMIV 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFAEV+PIWS S FYGVPYIWCMLHNF GN EMYG LD ++ GPV+A+TS N+TM
Sbjct: 372 LDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGGNHEMYGRLDVVSSGPVDAKTSANSTMIG 431
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+N YS RRYG++VP +AW +L HT+
Sbjct: 432 VGMCMEGIEQNPVVYELMAEMAFRSTRNALKDWVNDYSTRRYGKAVPEALEAWQILSHTL 491
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YNC+DG D N DVIV FPD++ S ++ T +Y +L +S+ H LWY
Sbjct: 492 YNCSDGLQDHNTDVIVKFPDLNASSLT-TLSRYLAEEAGTQTRRLLTEGLTSFGH--LWY 548
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+E AL + + + LS TYRYDL+DLTRQ L K AN++ L + ++ D +
Sbjct: 549 RPTEAKVALSYMLNASSSLSNVATYRYDLVDLTRQVLMKLANQIHLQALVSFVKGDLEEL 608
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ + +++D + LL ++GFLLGPWLESAK+L N +++ YEWNARTQ+TMWFDN
Sbjct: 609 TKNCDILIGIIKDSELLLRSNNGFLLGPWLESAKKLGTNSDEKHLYEWNARTQVTMWFDN 668
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
T+ S L DY NK WSGL DYY PRA++Y K ++++L + F WR WI LTN
Sbjct: 669 TRSLPSALHDYANKMWSGLFEDYYLPRASLYTKLLVKALHDKEPFPYGSWRSSWILLTNT 728
Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
+QNG YP+E+ GD++ ++ L++KY
Sbjct: 729 FQNGTKNYPLEAAGDSIEIAKSLFSKY 755
>gi|302791289|ref|XP_002977411.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
gi|300154781|gb|EFJ21415.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
Length = 761
Score = 597 bits (1539), Expect = e-168, Method: Compositional matrix adjust.
Identities = 288/567 (50%), Positives = 376/567 (66%), Gaps = 37/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP+ WL+ QL+LQKKIL + LGM VLPAFSGNVP AL+ ++PSA IT
Sbjct: 192 MGNLHGWGGPLPEKWLELQLILQKKILHHMRSLGMIAVLPAFSGNVPRALKILYPSANIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W +V +P+WCCTYLL DPLFI+IG+AFIEQQ+KEYG T H+YNCDTF+EN PP
Sbjct: 252 RLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGKAFIEQQVKEYGSTQHVYNCDTFNENLPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
D P YIS+L A++Y M D A+WLMQGWLFS D FW+PPQMKALL++VP GK++V
Sbjct: 312 TDDPSYISALAASVYGAMIVADKQAIWLMQGWLFSSDAQFWKPPQMKALLHAVPFGKMIV 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFAEV+PIWS S FYGVPYIWCMLHNF GN EMYG LD ++ GPV+A+TS N+TM
Sbjct: 372 LDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGGNHEMYGRLDVVSSGPVDAKTSANSTMIG 431
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W++ YS RRYG++VP +AW +L HT+
Sbjct: 432 VGMCMEGIEQNPVVYELMAEMAFRSTRNALKDWVDDYSTRRYGKAVPEALEAWQILSHTL 491
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YNC+DG D N DVIV FPD++ S ++ G ++ + + TS HLWY
Sbjct: 492 YNCSDGLQDHNTDVIVKFPDLNASSLTTLSRYLAEEGGTQTRRLLTEGLTS---FGHLWY 548
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+E AL + + + LS TYRYDL+DLTRQ L K AN++ L + ++ D +
Sbjct: 549 RPTEAKVALSYMLNASSSLSNVATYRYDLVDLTRQVLMKLANQIHLQALVSFVKGDLEEL 608
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ + +++D + LL ++GFLLGPWLESAK+L N ++ YEWNARTQ+TMWFDN
Sbjct: 609 TKNCDILIGIIKDSELLLRSNNGFLLGPWLESAKKLGTNSDETNLYEWNARTQVTMWFDN 668
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
T+ S L DY NK WSGL DYY PRA++Y K ++++L + F WR WI LTN
Sbjct: 669 TRTLPSALHDYANKMWSGLFEDYYLPRASLYTKLLVKALHDKEPFPYDSWRSSWILLTNT 728
Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
+QNG YP+E+ GD++ ++ L++KY
Sbjct: 729 FQNGTKNYPLEAAGDSIEIAKSLFSKY 755
>gi|90399367|emb|CAJ86183.1| H0212B02.15 [Oryza sativa Indica Group]
gi|116311963|emb|CAJ86322.1| OSIGBa0113E10.5 [Oryza sativa Indica Group]
Length = 692
Score = 559 bits (1441), Expect = e-156, Method: Compositional matrix adjust.
Identities = 272/466 (58%), Positives = 335/466 (71%), Gaps = 44/466 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 261 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 320
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 321 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 380
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMK + G
Sbjct: 381 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKIGVGMSMEG----- 435
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWI 240
E PI +Y ++ +AF + W+
Sbjct: 436 ---IEQNPI-------------------------VYDLMSEMAF-----HHRQVDLQVWV 462
Query: 241 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY- 299
Y RRYG+S+ +QDAW +LY T+YNCTDG DKNRDVIVAFPDV+P +I T G Y
Sbjct: 463 ETYPTRRYGKSIVGLQDAWKILYQTLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYT 521
Query: 300 ---QNYGKPVSKEAV-LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 355
+ Y +SK + + + Y+HPHLWY T VIRALELF+ G+E+S SNT+RYDL
Sbjct: 522 SSSKTYSTKLSKNYIAVDASNDEYEHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDL 581
Query: 356 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 415
+DLTRQ LAKYAN++F+ IIE+Y+ N+ + V L + F++LV D+D LLA H+GFLLGPW
Sbjct: 582 VDLTRQTLAKYANQVFVKIIESYKANNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPW 641
Query: 416 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 461
LESAK LA+++EQE QYEWNARTQITMWFDNT+ +ASLLRDYG +
Sbjct: 642 LESAKGLARDKEQEMQYEWNARTQITMWFDNTKTKASLLRDYGEAH 687
>gi|326521470|dbj|BAK00311.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 223/421 (52%), Positives = 287/421 (68%), Gaps = 34/421 (8%)
Query: 149 MQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 207
+QGWLF D FW+ QMKALL+SVP+GK++VLDLFA+VKPIW TS QFYGVPYIWCMLH
Sbjct: 8 VQGWLFYSDAVFWKESQMKALLHSVPIGKMMVLDLFADVKPIWQTSSQFYGVPYIWCMLH 67
Query: 208 NFAGNIEMYGILDSIAFGPVEARTSENTTMA----------------------------- 238
NF GNIEMYG+LDSI+ GPV+ARTS N+TM
Sbjct: 68 NFGGNIEMYGVLDSISSGPVDARTSYNSTMVGVGMCMEGIEHNPVVYELMSEMAFRSQKV 127
Query: 239 ----WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ YS RRYG+S IQ AW +LYHT+YNCTDG D N+D IV FPD+ PS S
Sbjct: 128 KVEDWLKTYSHRRYGQSNVEIQKAWGILYHTIYNCTDGIADHNKDYIVEFPDMSPSSFSS 187
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
K L ++S PHLWYST E I++LELF+ +GN+LS S TYRYD
Sbjct: 188 QYSKRSISLARKHPRFFLSEVSASLPQPHLWYSTEEAIKSLELFLNAGNDLSKSLTYRYD 247
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
L+DLTRQ+L+K AN+++ + I +YQ D+ G+ ++ FLEL+ D+D LLA D FLLGP
Sbjct: 248 LVDLTRQSLSKLANKVYHDAISSYQKRDSSGLNFHTKEFLELIVDIDTLLASDDNFLLGP 307
Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
WLESAK LA E++ KQYEWNARTQ+TMW+D+T+ E S L DY NK+WSGLL+ YY PRA
Sbjct: 308 WLESAKSLAMTEDERKQYEWNARTQVTMWYDDTKTEQSKLHDYANKFWSGLLKSYYLPRA 367
Query: 475 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
+ YF + SL+ F+L++WRR+WI +N+WQ+G+ +YPV++ GD+L S+ L+ KY
Sbjct: 368 SKYFSRLSRSLQENRSFQLEEWRRDWISYSNEWQSGKELYPVKAIGDSLAISRSLFTKYF 427
Query: 535 Q 535
+
Sbjct: 428 R 428
>gi|449518399|ref|XP_004166229.1| PREDICTED: alpha-N-acetylglucosaminidase-like, partial [Cucumis
sativus]
Length = 336
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 219/324 (67%), Positives = 260/324 (80%), Gaps = 7/324 (2%)
Query: 212 NIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTD 271
N +Y ++ +AF + ++ W+ QYSVRRYG VP+IQDAW+VLYHTVYNCTD
Sbjct: 13 NPVVYDLMSEMAF-----QHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNCTD 67
Query: 272 GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEV 331
GA DKNRDVIVAFPDVDPS I V +G S +V + + +++D PHLWY TSEV
Sbjct: 68 GANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHLWYPTSEV 125
Query: 332 IRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSR 391
I AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF I++AYQL+D + LS+
Sbjct: 126 ISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQ 185
Query: 392 RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEA 451
FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQYEWNARTQITMWFDNT+EEA
Sbjct: 186 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEA 245
Query: 452 SLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGR 511
SLLRDYGNKYWSGLL DYY PRAAIY K++ ES E+G F L +WRREWIKLTNDWQ+ R
Sbjct: 246 SLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSR 305
Query: 512 NVYPVESNGDALITSQWLYNKYLQ 535
+YPVESNGDAL TS WLYNKYLQ
Sbjct: 306 KIYPVESNGDALDTSHWLYNKYLQ 329
>gi|414585094|tpg|DAA35665.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 1202
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 221/431 (51%), Positives = 281/431 (65%), Gaps = 58/431 (13%)
Query: 142 DSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPY 201
DS+ W L + DP + V +GK+ + + E + + Y
Sbjct: 308 DSNPRWCCTYLLDASDPLF-----------VEIGKMFIEEQIRE----YGRTSHIYN--- 349
Query: 202 IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA----------------------- 238
WCMLHNFA + EMYG+LD++A GP++AR S+N+TM
Sbjct: 350 -WCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGVGMSMEGIEQNPIVYDLMSEMA 408
Query: 239 ----------WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD 288
W+ Y RRYG+ V +QDAW +LY T+YNCTDG DKNRDVIVAFPDV+
Sbjct: 409 FHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLYNCTDGKNDKNRDVIVAFPDVE 468
Query: 289 PSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGN 343
P +I+ T G + N Y SK + K +S +Y+HPHLWY T+ VI ALELF+ G+
Sbjct: 469 PFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHPHLWYDTNAVIHALELFLQHGD 527
Query: 344 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 403
E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+ + V L + FL LV D+D L
Sbjct: 528 EVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNNMNQVTILCQHFLSLVNDLDTL 587
Query: 404 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 463
L+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWS
Sbjct: 588 LSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWS 647
Query: 464 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 523
GLL+DYYGPRAAIYFK+++ S+E+ F LK+WRREWI LTN+WQ+ R V+ + GD L
Sbjct: 648 GLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWISLTNNWQSDRKVFSTTATGDPL 707
Query: 524 ITSQWLYNKYL 534
SQ LY KYL
Sbjct: 708 NISQSLYTKYL 718
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 85/110 (77%), Positives = 101/110 (91%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYN 110
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYN
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYN 349
>gi|156399499|ref|XP_001638539.1| predicted protein [Nematostella vectensis]
gi|156225660|gb|EDO46476.1| predicted protein [Nematostella vectensis]
Length = 675
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 221/569 (38%), Positives = 317/569 (55%), Gaps = 75/569 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+HGWGGPLP +W +L LQ KIL + GM PVLP F+G+VPA L ++P A ++
Sbjct: 142 MGNMHGWGGPLPSTWYGMKLNLQHKILAAMRNFGMTPVLPGFAGHVPAGLLRLYPKANVS 201
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + S +CCTYLL+ +DPLF +IG AFI++Q EYG T+HIYN DTF+E P
Sbjct: 202 KLGDWGNFNST--YCCTYLLEPSDPLFQKIGTAFIKEQTAEYG-TNHIYNADTFNEMRPR 258
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P Y+ + +A+Y GM GD DAVWLMQGWLF + FW+P Q+KALL+ VP G ++VL
Sbjct: 259 SSDPTYLGAASSAVYRGMAGGDPDAVWLMQGWLFVDEGFWKPDQIKALLHGVPQGFMIVL 318
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
DL+AE PIWS ++ FYG P+IWCML NF GNI ++G + S++ GP +A S N+TM
Sbjct: 319 DLWAENSPIWSRTQSFYGTPFIWCMLLNFGGNIGLFGNIKSVSTGPPKAFQSFNSTMIGT 378
Query: 239 ----------------------------------WINQYSVRRYGRSVPAIQDAWNVLYH 264
WI Y++RRYG + PAI AW +L
Sbjct: 379 GLTMEGIEQNDMMFELMNEMGYRLEPLNPVDLDNWIKDYALRRYGGTNPAIIQAWRLLIR 438
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
+VY C D + V P +D + P+L
Sbjct: 439 SVYQCNGYCADHIHSIFVWKPSLD-------------------------------NKPNL 467
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY +V A + ++ E T+RYDL+D+TRQAL ++ ++I AY+ A
Sbjct: 468 WYDPEDVFNAWDELRSTAAEFMHVETFRYDLVDVTRQALHLRVIPIYNDLISAYKNRSAL 527
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
V R LE+ +D+D LL + FLLG WL SAK L + YE+NAR QIT+W
Sbjct: 528 NVIHFGSRLLEMFDDLDSLLQTNRNFLLGRWLNSAKALGTTPAEVALYEFNARNQITLWG 587
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ E DY NK WSGL++ YY PR ++ M+ ++ G+ + ++++ ++
Sbjct: 588 PRGEIE-----DYANKMWSGLVKAYYKPRWELFIDEMVSAIAQGEELDYEAFKKKLLEQE 642
Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
W +G+ YP + +GD+L +++L+NK+
Sbjct: 643 TAWTHGKEEYPDQPSGDSLAAAEFLHNKW 671
>gi|384247107|gb|EIE20595.1| hypothetical protein COCSUDRAFT_37819 [Coccomyxa subellipsoidea
C-169]
Length = 762
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 226/570 (39%), Positives = 318/570 (55%), Gaps = 64/570 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGPLPQS++D Q LQ+KI+ R+ ELGM+PV PAF+G VP AL P+A+I+
Sbjct: 199 MGNLRGYGGPLPQSYIDDQAELQRKIVRRMRELGMSPVFPAFAGFVPGALARERPAARIS 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTS-HIYNCDTFDENTP 119
+ NW S + R+CC +LLD +PLF EIG AF++ +EYG Y+ DTF+E TP
Sbjct: 259 RSDNWCSFPA--RYCCVHLLDPLEPLFQEIGSAFVKVLREEYGSDEVGFYSADTFNEMTP 316
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD--PFWRPPQMKALLNSVPLGKL 177
P P Y++S+ +AIY+ M + D A WLMQ WLF YD FW+PPQ++AL++ VP L
Sbjct: 317 PSSDPAYLTSVTSAIYNAMAAADPSARWLMQAWLF-YDNQKFWQPPQIQALVSGVPRDAL 375
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-------- 229
++LDL+AEV P+W ++K F+G P+I+CMLHNF GNIEMYG L+++A GP E
Sbjct: 376 IMLDLYAEVFPLWKSTKSFFGAPFIYCMLHNFGGNIEMYGALEAVARGPAEGQIDGVAGL 435
Query: 230 -------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ-DAWNVLY 263
R WI Y+ RRYG S P AW++L
Sbjct: 436 IGIGMCPEGIEQNPVVYELMSEWAFRRQPVEVEGWIEAYARRRYGNSTPPTALVAWDLLL 495
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
+VYN TDG TD +RD+ + P + P+ + + K PH
Sbjct: 496 RSVYNATDGHTDHSRDIPTSRPGLSPAEVGLWGLK-----------------------PH 532
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
LWY+ +V+ A L + S EL YRYDL+D+ RQ ++K A +++ + EAY +
Sbjct: 533 LWYNEQQVVDAWGLLLRSAGELQQVEGYRYDLVDVGRQVISKRATDIWKAVAEAYVDGRS 592
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
V + R L+L++D++ LLA + GFLLGP LE A E + + YEWN R Q+T+W
Sbjct: 593 IVVRREGARLLQLLDDLEELLATNRGFLLGPKLEEASSAGHTEAEARLYEWNLRKQLTVW 652
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+ S + DY N+ W+GL+ YY PR A++ + L G + + WR E +
Sbjct: 653 -GTSDTGGSEIEDYANREWAGLISSYYKPRWALWLLRLETDLAQGRRYDPEAWRMECLNF 711
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
T W R+ P+ GD SQ LY Y
Sbjct: 712 TLGWAYLRDQLPLHPQGDTGGVSQRLYEVY 741
>gi|390348210|ref|XP_785272.3| PREDICTED: alpha-N-acetylglucosaminidase [Strongylocentrotus
purpuratus]
Length = 793
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 213/569 (37%), Positives = 304/569 (53%), Gaps = 74/569 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLPQSW QL LQ +IL R+ +LGM PVLPAF+G+VP + VFP+A I+
Sbjct: 220 MGNIDGWGGPLPQSWHTNQLALQHQILKRMRDLGMIPVLPAFAGHVPXSFSKVFPNASIS 279
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG+W + P +CCT LLD DP+F ++G+AFI+ +E+ T HIY+ DTF+EN P
Sbjct: 280 NLGDW--GRFGPEYCCTSLLDPQDPMFKQVGKAFIDAMSEEFNGTDHIYSADTFNENKPK 337
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
Y+S+ +Y G+ GD VWLM GWLF FW P Q+KALL VP+G+++VL
Sbjct: 338 SRDSAYLSAASKGVYQGIIEGDPKGVWLMMGWLFQDTGFWGPTQIKALLQGVPIGRMIVL 397
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL+AE +P + T+ FYG P+IWCMLHNF GN +YG LD++ GP EAR +N+TM
Sbjct: 398 DLYAEARPFYKTTYSFYGQPFIWCMLHNFGGNTGLYGKLDAVNQGPFEARNYDNSTMIGM 457
Query: 238 ------------------------------AWINQYSVRRYG---RSVPAIQDAWNVLYH 264
WI QY+ RRY +AW +L
Sbjct: 458 GTTPEGIFQNYVMYNFLTDMTWRSGSTNVSKWIEQYAGRRYSNDPNKSEEATEAWVILKE 517
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
TVYN T D Y PV + + + + +
Sbjct: 518 TVYNNTGTLQD------------------------HQYAVPVRRPSNIMTSP-------V 546
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY ++V +A E + + +L S +RYDL+D+TR L A + ++ ++++ +A
Sbjct: 547 WYDYTKVAKAWEFLLEASTKLGTSPVFRYDLVDVTRNVLQDLAFDFQQKLMVSFRIRNAG 606
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
V L+ DMD + + H+ +LLG WLE AK LA N ++E YE+NA+ QIT+W
Sbjct: 607 AVGGNGTLLCNLILDMDNITSSHEDWLLGTWLEDAKSLATNNDEESLYEYNAKNQITIW- 665
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+EE + DY NK W GLLR YY R +Y +Y+ E ++S + +
Sbjct: 666 -GPKEE---ILDYANKQWGGLLRTYYHRRWQLYVQYLEECIQSHQPYDQNTFNVRSFVAE 721
Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
++W + + +P E GD + S+ LY KY
Sbjct: 722 SEWTHSKEKFPTEPVGDTMAISKALYVKY 750
>gi|14861378|gb|AAK73654.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
Length = 753
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 217/573 (37%), Positives = 316/573 (55%), Gaps = 84/573 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGW GPLP++W +QL +Q ++L R+ LGM VLPAF+G+VP + FP T
Sbjct: 212 MGNLHGWAGPLPRAWHLKQLYVQYRVLERMRSLGMITVLPAFAGHVPQGVLRAFPRVNAT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W D + CTYLLD DP+F IG F+++ +KE+G T HIY+ DTF+E P
Sbjct: 272 RLGGWSHF--DCTYSCTYLLDPEDPMFQVIGTLFLKELIKEFG-TDHIYSADTFNEMNPL 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S + +A++ M D AVWLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 329 SSDPAYLSRVSSAVFRSMTGADPKAVWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ ++ FYG P+IWCMLHNF GN ++G +++I GP AR N+TM
Sbjct: 389 LDLFAESRPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 448
Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
+W+ +Y+ RRYG A AW +L +V
Sbjct: 449 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVARYAERRYGAPNAAAASAWQLLLRSV 508
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YNCT + NR +V P L+ +T +WY
Sbjct: 509 YNCTGVCVNHNRSPLVRRPS-------------------------LRMDT------EVWY 537
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 380
+ S+V A L +++G EL +S T+ YDL D+TRQA + +E +L+I +A+Q L
Sbjct: 538 NKSDVYEAWRLLLSAGAELGSSPTFGYDLADVTRQAAQQLVSEYYLSIRQAFQSRSLPEL 597
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
A GV +L+ ++DGLL+ H FLLG WLESA+ +A ++ + +QYE NAR Q+
Sbjct: 598 LTAGGVL-----VYDLLPELDGLLSSHRLFLLGRWLESARAVATSDREAEQYELNARNQV 652
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
T+W N + DY NK GL+ DYYG R +++ ++ESL SG F + +
Sbjct: 653 TLWGPNGN-----ILDYANKQLGGLVLDYYGVRWSLFVSALVESLNSGSPFHQDQFNQAV 707
Query: 501 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
++ + + YP GD L S+ ++ KY
Sbjct: 708 FQVERGFIYNKKRYPTAPVGDTLEISKKIFLKY 740
>gi|14861380|gb|AAK73655.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
Length = 753
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 217/573 (37%), Positives = 316/573 (55%), Gaps = 84/573 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGW GPLP++W +QL +Q ++L R+ LGM VLPAF+G+VP + FP T
Sbjct: 212 MGNLHGWAGPLPRAWHLKQLYVQYRVLERMRSLGMITVLPAFAGHVPQGVLRAFPRVNAT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W D + CTYLLD DP+F IG F+++ +KE+G T HIY+ DTF+E P
Sbjct: 272 RLGGWSHF--DCTYSCTYLLDPEDPMFQVIGTLFLKELIKEFG-TDHIYSADTFNEMNPL 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S + +A++ M D AVWLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 329 SSDPAYLSRVSSAVFRSMTGADPKAVWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ ++ FYG P+IWCMLHNF GN ++G +++I GP AR N+TM
Sbjct: 389 LDLFAESRPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 448
Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
+W+ +Y+ RRYG A AW +L +V
Sbjct: 449 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVARYAERRYGAPNAAAASAWXLLLRSV 508
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YNCT + NR +V P L+ +T +WY
Sbjct: 509 YNCTGVCVNHNRSPLVRRPS-------------------------LRMDT------EVWY 537
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 380
+ S+V A L +++G EL +S T+ YDL D+TRQA + +E +L+I +A+Q L
Sbjct: 538 NKSDVYEAWRLLLSAGAELGSSPTFGYDLADVTRQAAQQLVSEYYLSIRQAFQSRSLPEL 597
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
A GV +L+ ++DGLL+ H FLLG WLESA+ +A ++ + +QYE NAR Q+
Sbjct: 598 LTAGGVL-----VYDLLPELDGLLSSHRLFLLGRWLESARAVATSDREAEQYELNARNQV 652
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
T+W N + DY NK GL+ DYYG R +++ ++ESL SG F + +
Sbjct: 653 TLWGPNGN-----ILDYANKQLGGLVLDYYGVRWSLFVSALVESLNSGSPFHQDQFNQAV 707
Query: 501 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
++ + + YP GD L S+ ++ KY
Sbjct: 708 FQVERGFIYNKKRYPTAPVGDTLEISKKIFLKY 740
>gi|375144105|ref|YP_005006546.1| alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
gi|361058151|gb|AEV97142.1| Alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
Length = 735
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 217/568 (38%), Positives = 310/568 (54%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLP SW+ LQ+KIL R ELGM PVLPAF+G+VP A + +P+AK+
Sbjct: 201 MGNLDAWGGPLPLSWMKSHKALQEKILQRERELGMKPVLPAFTGHVPPAFKKKYPNAKL- 259
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + +D TY+LD+ DPLF E+G+ F+++Q +G T H+Y+ DTF+EN PP
Sbjct: 260 KATNWTNGFAD-----TYILDSQDPLFAEMGKRFLQKQTSLFG-TDHLYSADTFNENEPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
D P ++S+L A IY GM+ D+ A W+MQGWLF D FW+ PQ++ALL +VP K+++
Sbjct: 314 SDDPAFLSALSARIYEGMKQADTAATWVMQGWLFYSDRKFWKAPQIEALLKAVPDNKMIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
LDL AE++P+W + FYG P+IW MLHNF GN+ ++G +D +A P E
Sbjct: 374 LDLAAEIEPVWKRTDAFYGKPWIWNMLHNFGGNVNLFGRMDGVATQPAETLNDKASGKLW 433
Query: 230 -------RTSENTTM-----------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
+N M AWI QY + RY + + DAW +L T
Sbjct: 434 GIGLTMEAIEQNPVMYELMTRHTWQTTPVDLDAWIPQYVLNRYRTNNTNLVDAWQILRKT 493
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYN GA + D SII+ G+P + + T PH
Sbjct: 494 VYN---GAVIR---------DGAESIIT---------GRPTFDSTTVWTRTKLNYAPH-- 530
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
E++ A +LF+ + + S+ ++YDL+D+TRQ LA YA L + A+ D+
Sbjct: 531 ----ELLPAWDLFVQAAGKGVNSDGFQYDLVDVTRQVLANYAAPLQKKWVTAFNAKDSAA 586
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
+ S+ FL+L+ DMD LLA F+LGPWL +A+ ++ YE NAR IT+W D
Sbjct: 587 FNKYSKAFLQLISDMDLLLASRKDFMLGPWLSAARSNGTTPAEKALYEQNARDLITLWGD 646
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
S L +Y N+ WSGLL D+Y PR +F + +SL +G LK +
Sbjct: 647 AN----SPLHEYSNRQWSGLLNDFYKPRWQQFFTLLQQSLRTGSTPDLKQFEENIRSWEW 702
Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKY 533
W N + YPV +G+++ +Q LY KY
Sbjct: 703 KWVNTQKAYPVVPSGNSVQVAQMLYKKY 730
>gi|73965663|ref|XP_548088.2| PREDICTED: alpha-N-acetylglucosaminidase [Canis lupus familiaris]
Length = 747
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 213/570 (37%), Positives = 312/570 (54%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAFSG+VP AL VFP IT
Sbjct: 205 MGNLHTWGGPLPHSWHLKQLYLQHRILDRMRSFGMIPVLPAFSGHVPKALTRVFPQINIT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + ++E+G T+HIY DTF+E PP
Sbjct: 265 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELIQEFG-TNHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y++S A++Y M + DSDAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 322 SSEPSYLASATASVYQAMITVDSDAVWLLQGWLFQHQPQFWGPAQVKAVLEAVPRGRLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 382 LDLFAESQPVYIQTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMLG 441
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW++ ++ RRYG + + AW +L +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVSSFAARRYGVAHRDTEAAWRLLLRS 501
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + + NR +V PS+ VT +
Sbjct: 502 VYNCSGEACSGHNRSPLVR----RPSLQMVTT---------------------------V 530
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + L++S T+RYDL+D+TRQA + + ++ AY +
Sbjct: 531 WYNRSDVFEAWRLLLTAAPTLASSPTFRYDLLDVTRQAAQELVSLYYVEARSAYLRKELV 590
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + + + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 591 PLLRAAGVLVYELLPALDKVLASDSRFLLGRWLEQARAAAVSEAEAHLYEQNSRYQLTLW 650
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + + +L
Sbjct: 651 ----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEMLVESLVQGIPFQQHQFDKNAFQL 705
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ G YP + +GD + ++ L+ KY
Sbjct: 706 EQTFIFGTQRYPSQPDGDTVDLAKKLFIKY 735
>gi|357458269|ref|XP_003599415.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488463|gb|AES69666.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 539
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 175/265 (66%), Positives = 203/265 (76%), Gaps = 27/265 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353
Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
EYG + IYNCDTF+EN+PP P YIS+LGAA+Y G+ GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413
Query: 155 SYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473
Query: 214 EMYGILDSIAFGPVEARTSENTTMA 238
EMYG+LD+IA GPV+AR SEN+TM
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMV 498
>gi|443691318|gb|ELT93213.1| hypothetical protein CAPTEDRAFT_144379, partial [Capitella teleta]
Length = 718
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 209/549 (38%), Positives = 305/549 (55%), Gaps = 75/549 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL +W QQ++LQ +IL R+ +LGM P LPAF+G+VPA + +FP K++
Sbjct: 188 MGNMRGWGGPLSTNWHHQQILLQHRILKRMRDLGMTPALPAFAGHVPANITRLFPRVKVS 247
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W S +CCT LLD DPLF EIG+AFI++ +E+G T H+YN DTF+E TP
Sbjct: 248 KLGDWGRFNST--YCCTTLLDVEDPLFKEIGKAFIDEYTREFG-TDHVYNTDTFNEMTPA 304
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P Y++ G A+YSGM S DS A+WLMQGWLF D FW+PPQ KALL SVP GK++VL
Sbjct: 305 SSDPSYLTKAGQAVYSGMVSSDSKAIWLMQGWLFLSD-FWKPPQAKALLTSVPQGKMLVL 363
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
DL++EV P + + +YG P+IWCMLHNF G + MYG ++S+ GP E R+ N+TM
Sbjct: 364 DLYSEVNPQYPRLQSYYGQPFIWCMLHNFGGTLPMYGAIESVNQGPFEGRSFVNSTMVGI 423
Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W ++Y+ RRY + AW + TVY
Sbjct: 424 GLTPEGINQNEVMYEFMMENSFRSQPVELTEWFDKYATRRYASRNANARAAWQIFKRTVY 483
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
NC+DG N+++ V P S + +WY
Sbjct: 484 NCSDGVKHHNKNIPVCRP-------------------------------SRKNKIDVWYD 512
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ + +L IA+ E+ S +RYDL+D++RQAL + + I+ +Y+ + +
Sbjct: 513 VEDFFKGWDLMIAASKEVD-SPLFRYDLVDVSRQALQVISITYYNQILTSYKQKNLTSLA 571
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
L L++DMD +LA FLLG W+ A + E++ YE+NAR Q+T+W
Sbjct: 572 SSGNDLLHLLDDMDTVLATDSHFLLGAWIAGAHRNGVTPEEKALYEFNARNQVTLW---- 627
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKLTND 506
+A++L DY NK W+GL+ DYY R ++ + +SLE+ F K ++++ + K +
Sbjct: 628 GPDANIL-DYANKQWAGLVADYYHERWELFIDELKKSLENKTSFDEKKFQKDVFEKAESP 686
Query: 507 WQNGRNVYP 515
+ NVYP
Sbjct: 687 FTYRTNVYP 695
>gi|149054264|gb|EDM06081.1| rCG33377, isoform CRA_d [Rattus norvegicus]
Length = 580
Score = 369 bits (948), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 209/570 (36%), Positives = 307/570 (53%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 43 MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 102
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLGNW + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 103 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 159
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 160 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 219
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++S + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TM
Sbjct: 220 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 279
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW++ ++ RRYG S P AW +L +
Sbjct: 280 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLRS 339
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 340 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 368
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + L+AS +RYDL+D+TRQA+ + + + A+ D
Sbjct: 369 WYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQAVQELVSSCYEEARTAFLNQDLD 428
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + +L+ +D LLA + FLLG WL+ A+++A +E + + YE N+R QIT+W
Sbjct: 429 LLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAREVAVSESEAQFYEQNSRYQITLW 488
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ DYY PR ++ + SL G F+ + + L
Sbjct: 489 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGIPFQQHQFEKSVFPL 543
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ N + YP++ GD + S+ ++ K+
Sbjct: 544 EQAFINNKKRYPIQPQGDTVDLSKKIFLKF 573
>gi|255533666|ref|YP_003094038.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346650|gb|ACU05976.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 735
Score = 369 bits (947), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 209/569 (36%), Positives = 310/569 (54%), Gaps = 75/569 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLP+S + LQKKIL R GM P+LPAF+G+VP A ++ FP AK+
Sbjct: 199 MGNIDGWGGPLPKSQMLAHEALQKKILERERSFGMTPILPAFTGHVPPAFKDKFPKAKLK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + S Y+LD D LF IG+ FIE+++K +G T H+Y DTF+ENTPP
Sbjct: 259 KT-NWTTFPS------VYILDPEDELFTTIGKRFIEEEVKTFG-TDHLYTADTFNENTPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
Y+S++ +Y M D +A W+MQGWLF + + FW+P Q+KALLN++P K++V
Sbjct: 311 TSDSLYLSNVSKKVYQSMALADPEATWIMQGWLFYHGEKFWKPTQIKALLNAIPNDKMIV 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
LDL++E P+W + +YG P+IW MLHNF GNI +YG +D +A G ++A+ + N+
Sbjct: 371 LDLWSENHPVWQRTAAYYGKPWIWNMLHNFGGNISLYGRMDEVASGAIKAKQAANSGNMV 430
Query: 236 ------------------------------TMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ YS +RYG + AW +LY T
Sbjct: 431 GIGLTPEAIEQNPVMYQLMLDNIWTDEPINVTAWLKNYSRQRYGAQNALAEQAWQILYKT 490
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VY T G P SI++ G+P +E++ P
Sbjct: 491 VY--TGG----------ILPGGPESILT---------GRPTM------AESTRSTRPKKN 523
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
Y +E+I A E + + +LS ++ ++YDL+D+TRQ L YA+ L +AYQ D
Sbjct: 524 YKPAELIPAWEALLKASQQLS-TDGFKYDLVDVTRQVLVNYADTLQRQFAQAYQGKDGKK 582
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
+LS FL +++D+D LLA FLLG WL AK++ E++K+YE NAR IT+W D
Sbjct: 583 FDRLSGDFLAVMDDVDYLLATRKDFLLGKWLNEAKRMGTTAEEKKRYERNARNLITLWAD 642
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
+ S L +Y + WSGL+ +Y PR +F Y + L+SG K + + +
Sbjct: 643 ----QNSSLNEYSCRQWSGLISSFYKPRWQQFFSYAKQQLKSGAKLDQKVFEEKMKRWEW 698
Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYL 534
DW N +V+ + +G+ + T++ LY KY+
Sbjct: 699 DWVNKNDVFTEQPSGNEIKTAESLYKKYI 727
>gi|109491871|ref|XP_001081442.1| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
gi|392351622|ref|XP_002727861.2| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
gi|149054262|gb|EDM06079.1| rCG33377, isoform CRA_b [Rattus norvegicus]
Length = 739
Score = 368 bits (944), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 209/570 (36%), Positives = 307/570 (53%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 202 MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLGNW + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 262 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 319 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++S + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TM
Sbjct: 379 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 438
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW++ ++ RRYG S P AW +L +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLRS 498
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 499 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 527
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + L+AS +RYDL+D+TRQA+ + + + A+ D
Sbjct: 528 WYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQAVQELVSSCYEEARTAFLNQDLD 587
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + +L+ +D LLA + FLLG WL+ A+++A +E + + YE N+R QIT+W
Sbjct: 588 LLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAREVAVSESEAQFYEQNSRYQITLW 647
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ DYY PR ++ + SL G F+ + + L
Sbjct: 648 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGIPFQQHQFEKSVFPL 702
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ N + YP++ GD + S+ ++ K+
Sbjct: 703 EQAFINNKKRYPIQPQGDTVDLSKKIFLKF 732
>gi|301773566|ref|XP_002922216.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Ailuropoda
melanoleuca]
Length = 634
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 206/570 (36%), Positives = 306/570 (53%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 95 MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 154
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 155 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 211
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A++Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 212 SSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQPEFWGPAQVTAVLGAVPRGRLLV 271
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F+G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 272 LDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMAG 331
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW++ + RRYG + + AW +L +
Sbjct: 332 TGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSSSAARRYGVTHKDTEAAWRLLLRS 391
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + + NR +V P + + +
Sbjct: 392 VYNCSGEACSGHNRSPLVRRPSLQMATA-------------------------------V 420
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + L+AS ++RYDL+D+TRQA + + + AY +
Sbjct: 421 WYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQAAQELVSLYYEEARAAYLNKELV 480
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + + R + EL+ +D +LA FLLG WLE A+ A +E + + YE N+R Q+T+W
Sbjct: 481 PLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQARAAAVSEAEARFYEQNSRYQLTLW 540
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + + +L
Sbjct: 541 ----GPEGNIL-DYANKQLAGLVADYYAPRWGLFMEMLVESLAQGIPFQQHQFDKNAFQL 595
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ YP + GD + ++ L+ KY
Sbjct: 596 EQAFVFSTQRYPSQPQGDTVDLAKKLFLKY 625
>gi|126307960|ref|XP_001366343.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
domestica]
Length = 741
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 199/570 (34%), Positives = 298/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +Q LQ +IL R+ GM PVLPAF+G++P A VFP A +T
Sbjct: 202 MGNLHTWGGPLPSSWDLKQSYLQYRILERMRSFGMKPVLPAFAGHIPKAFTRVFPQANVT 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + + C+YLL DPLF +G F+ + KE+G T HIY+ D F+E PP
Sbjct: 262 NLGMWGHFSCN--YSCSYLLAPEDPLFPVVGSLFLRELTKEFG-TDHIYSADIFNEMDPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ AA+Y M + D DAVWL QGWLF P FW+PPQMKA+L +VP G+ ++
Sbjct: 319 SSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNHPDFWKPPQMKAVLEAVPRGRFLI 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++S + FYG P+IWCMLHNF GN ++G+LD++ GP AR N+T+
Sbjct: 379 LDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLFGVLDAVNRGPSTARLFPNSTIVG 438
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ +RYG P + AW +L +
Sbjct: 439 TGIVPEGINQNEVVYALMAELGWRKDPFPDLGAWVAGFAAQRYGTPHPQAEAAWRLLLRS 498
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + T N +V P + +
Sbjct: 499 VYNCSWENCTGHNHSPLVKRPSLHLDF-------------------------------SV 527
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + +L+ S+ +RYDL+D+TRQ + + + + A++
Sbjct: 528 WYNRSDVFEAWRLLLEAAPQLATSSAFRYDLLDVTRQVAQELVSLYYGELKTAFEAGSMP 587
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + +L+ +D LL + FLLG WLE A+++A +E + YE NAR Q+T+W
Sbjct: 588 ALLSAGGLLVFDLLPSLDELLGTDERFLLGGWLEQAREMAVSEAEAWHYEQNARYQLTLW 647
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+ DY NK +GL+ YY PR ++ + +++SL G F + E L
Sbjct: 648 GPTGN-----ILDYANKQLAGLVAGYYAPRWKLFVEMLVKSLAEGTPFHQNQFENEAFLL 702
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ +GR +P + GD + ++ + KY
Sbjct: 703 GQAFVSGREKFPTQPQGDTVDLARKFFLKY 732
>gi|281344539|gb|EFB20123.1| hypothetical protein PANDA_011160 [Ailuropoda melanoleuca]
Length = 619
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 206/570 (36%), Positives = 306/570 (53%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 81 MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 140
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 141 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 197
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A++Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 198 SSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQPEFWGPAQVTAVLGAVPRGRLLV 257
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F+G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 258 LDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMAG 317
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW++ + RRYG + + AW +L +
Sbjct: 318 TGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSSSAARRYGVTHKDTEAAWRLLLRS 377
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + + NR +V P + + +
Sbjct: 378 VYNCSGEACSGHNRSPLVRRPSLQMATA-------------------------------V 406
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + L+AS ++RYDL+D+TRQA + + + AY +
Sbjct: 407 WYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQAAQELVSLYYEEARAAYLNKELV 466
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + + R + EL+ +D +LA FLLG WLE A+ A +E + + YE N+R Q+T+W
Sbjct: 467 PLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQARAAAVSEAEARFYEQNSRYQLTLW 526
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + + +L
Sbjct: 527 ----GPEGNIL-DYANKQLAGLVADYYAPRWGLFMEMLVESLAQGIPFQQHQFDKNAFQL 581
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ YP + GD + ++ L+ KY
Sbjct: 582 EQAFVFSTQRYPSQPQGDTVDLAKKLFLKY 611
>gi|373953359|ref|ZP_09613319.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
gi|373889959|gb|EHQ25856.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
Length = 733
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 209/570 (36%), Positives = 296/570 (51%), Gaps = 74/570 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGPLP W+ LQKKI+ R LGM PVLPAF+G+VPAA +N +P+AK+
Sbjct: 195 MGNMDSWGGPLPLRWMQTHFDLQKKIIARERALGMKPVLPAFTGHVPAAFKNKYPTAKL- 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + +D TY+LD+ DP+F IG+ F+++Q G T H+Y+ DTF+EN PP
Sbjct: 254 KTTNWKNGFAD-----TYILDSADPMFARIGQLFLQKQTALLG-TDHLYSADTFNENEPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
D PEY+ L +Y GM D+ AVW+MQGWLF D FW+P Q +ALL +VP K+++
Sbjct: 308 SDEPEYLGKLSERVYQGMHQADTAAVWVMQGWLFYSDRKFWKPEQTRALLKAVPDDKMII 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
LDL E++P+W ++ FYG P+IW ML+NF N ++G +DS A GP EA
Sbjct: 368 LDLATEIEPVWKRTEAFYGKPWIWNMLNNFGANTNLFGRMDSAAKGPAEAYHDPKSGQMK 427
Query: 230 ------------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
R W+ +Y + RYG+ Q AWN+L T
Sbjct: 428 GIGLTMEGIEQNPVLYDLLTDNTWRNQPINVDEWLPKYVLNRYGKPNAQAQKAWNILRKT 487
Query: 266 VYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
VY+ D +I A P D S S +
Sbjct: 488 VYSVLADRYIRDGAESIIQARPTTDSS--------------------------SRWARTT 521
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
L Y ++ A + I + +LS S+ +R+DL+DL+RQ LA YA L + A+Q DA
Sbjct: 522 LNYEPKALLPAWQAMIKASEDLSTSDGFRFDLVDLSRQVLANYAFTLQRRFVLAHQQKDA 581
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ S F+EL++DMD LLA FLLGPW+ A++ ++ YE NA+ IT+W
Sbjct: 582 AAFKKHSAEFIELIQDMDQLLATRKDFLLGPWVADARRCGATVSEKALYEMNAKDLITLW 641
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
D + L +Y + WSGLL D+Y PR YF+ + L F + + R+
Sbjct: 642 GD----KDCPLNEYACRQWSGLLNDFYKPRWQQYFEQINLDLTGKKPFDKEAFERKIKSW 697
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
W N R YPV+ GD ++ ++ LY KY
Sbjct: 698 EWQWVNARKDYPVKPQGDPVLEARKLYKKY 727
>gi|440903235|gb|ELR53922.1| Alpha-N-acetylglucosaminidase, partial [Bos grunniens mutus]
Length = 614
Score = 364 bits (935), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 213/574 (37%), Positives = 305/574 (53%), Gaps = 84/574 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 77 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 136
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+GNW + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 137 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 193
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 194 SSEPSYLAAATTAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 253
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TM
Sbjct: 254 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 313
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S + AW +L +
Sbjct: 314 TGMAPEGIGQNEVVYALMAELGWKKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 373
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNC S E + N+ P+ + L+ T+ +W
Sbjct: 374 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 403
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 379
Y+ S+V A L +A+ + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 404 YNRSDVFEAWRLLLAATSTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 463
Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
L A G+ EL+ +D +LA FLLG WLE A+Q A +E + YE N+R Q
Sbjct: 464 LTRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARQAAVSETEAHFYEQNSRYQ 518
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
+T+W E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + R
Sbjct: 519 LTLW----GPEGNIL-DYANKQLAGLMADYYAPRWRLFTETLVESLVQGVPFQQHQFDRN 573
Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+L + G YP + GD + + L+ KY
Sbjct: 574 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 607
>gi|348533253|ref|XP_003454120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oreochromis
niloticus]
Length = 845
Score = 364 bits (934), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 198/570 (34%), Positives = 315/570 (55%), Gaps = 77/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL + GPLPQSW QL LQ KIL R+ GM PVLPAFSGN+P + ++P A++T
Sbjct: 308 MANLFKFAGPLPQSWHVNQLYLQFKILERMRSFGMIPVLPAFSGNIPKGILRLYPEARVT 367
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W + C+ +LD DPLF IG ++ Q LK++G T HIY+ DTF+E TPP
Sbjct: 368 RLGPWSHFNCS--YSCSLVLDPQDPLFHHIGSLYLSQVLKQFG-TDHIYSTDTFNEMTPP 424
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ ++++ M + D AVWLMQGWLF D FW+P Q++ALL+ VPLG+++V
Sbjct: 425 SSDPAYLSAVSRSVFASMTAVDPQAVWLMQGWLFFSDAAFWKPAQIQALLHGVPLGRMIV 484
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFAE +PI+S ++ FYG P+IWCML NF GN ++G ++SI GP +A N+T+
Sbjct: 485 LDLFAETEPIFSYTESFYGQPFIWCMLQNFGGNSGLFGTVESINSGPFKALHFPNSTLVG 544
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W++ Y++RRYG + ++ AW +L+ ++
Sbjct: 545 IGMTPEGIEQNPVTYELMSELAWRKEPVNLAKWVSLYAIRRYGNTQESLTTAWRLLFASI 604
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG-KPVSKEAVLKSETSSYDHPHLW 325
YNCTD Y+N+ P+ + + T LW
Sbjct: 605 YNCTD-------------------------PHYRNHNHSPLVRRPSFQMNTG------LW 633
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
Y +++ +A +L + + L + T+RYDL+D+TR+ L + +I +A++ +
Sbjct: 634 YDPADLYKAWKLIMDAAPSLMSKETFRYDLVDVTREVLQVLTTSFYRDIADAFKKQNLSE 693
Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + +L+ +++ LL+ + FLLG WLE A+ LA ++++ + Y+ NAR QIT+W
Sbjct: 694 LLTAGGVLVYDLLPELNRLLSSNRNFLLGAWLERARSLAVDDKEAQLYDMNARNQITLWG 753
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ + + DY +K W GL+ DYY R ++ + ++E L SG F+ + + ++
Sbjct: 754 PSGE-----ILDYASKEWGGLMEDYYAQRWGLFVQTLVECLNSGQPFKQAAFNQAVFQIE 808
Query: 505 NDW-QNGRNVYPVESNGDALITSQWLYNKY 533
+ NGR YP + GD + ++ KY
Sbjct: 809 KGFIYNGRK-YPTKPQGDTYEIAYRIFLKY 837
>gi|410981277|ref|XP_003996997.1| PREDICTED: alpha-N-acetylglucosaminidase [Felis catus]
Length = 857
Score = 364 bits (934), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 208/570 (36%), Positives = 304/570 (53%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 318 MGNLHTWGGPLPPSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVT 377
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 378 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 434
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y++S A++Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 435 SSEPSYLASATASVYQAMVTVDPDAVWLLQGWLFQHQPQFWGPAQVSAVLGAVPRGRLLV 494
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 495 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMVG 554
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S + AW +L +
Sbjct: 555 TGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVTGFAARRYGVSHGNTEAAWRLLLRS 614
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + + NR +V P LK T+ +
Sbjct: 615 VYNCSGEACSGHNRSPLVRRPS-------------------------LKMTTT------V 643
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + L+ S T+RYDL+D+TRQA + + + AY +
Sbjct: 644 WYNRSDVFEAWRLLLTTTPSLATSPTFRYDLLDVTRQAAQELVSLYYGEARTAYLNKELV 703
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + + + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 704 PLLRAAGILVYELLPSLDKVLASDSRFLLGSWLEQARAAAVSEAEAHFYEQNSRYQLTLW 763
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + + +L
Sbjct: 764 ----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEMLVESLVRGVPFQQHQFDQNAFQL 818
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ YP + +GD + ++ L+ +Y
Sbjct: 819 EQTFVLSTQRYPSQPHGDTVDLAKKLFLRY 848
>gi|410930376|ref|XP_003978574.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Takifugu rubripes]
Length = 751
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 201/568 (35%), Positives = 314/568 (55%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLPQSW QL LQ KIL ++ GM PVLPAFSGN+P + +FP A++T
Sbjct: 214 MGNMFKFGGPLPQSWHVNQLYLQFKILAQMRSFGMIPVLPAFSGNIPKGILRLFPEARVT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W K + + C+Y+LD DPLF IG ++ Q +K++G T+HIYN DTF+E TPP
Sbjct: 274 RLEPW--SKFNCSFSCSYILDPRDPLFSRIGSLYLSQVVKQFG-TNHIYNTDTFNEMTPP 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ A+++ M + D AVWLMQGWLF D FW+P Q++ALLN VP+G+++V
Sbjct: 331 SSEPTYLSAVSRAVFASMTAVDPQAVWLMQGWLFLSDALFWKPAQIQALLNGVPVGRMIV 390
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFAE +P++S ++ FYG P+IWCMLHNF GN +G ++SI GP +A N+++
Sbjct: 391 LDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNGGFFGTVESINTGPFKALHFPNSSLVG 450
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W++ Y RRYG ++ AW +L+ +V
Sbjct: 451 IGMTPEGIEQNPVVYELMSELAWRKEPVNLLKWVSLYVTRRYGSMHESVSAAWKILFASV 510
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YNCT P Y+N+ + L S + + LWY
Sbjct: 511 YNCT-------------LP------------HYRNHN-----HSPLVRRPSFHMNSELWY 540
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+++ RA +L + + + T++YDL+D+TRQ + + +I++A+Q + +
Sbjct: 541 DPADLYRAWKLILEAAPSFMSKETFQYDLVDVTRQVMQVLTTSYYQDIVDAFQKHKMQEL 600
Query: 387 FQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
L +L+ +++ LL+ + FLLG WLE A+ LA +E + K Y+ NAR Q+T+W
Sbjct: 601 LTAGGVLLYDLLPELNRLLSSNHNFLLGTWLEQARSLALDEREAKLYDINARNQLTLWGP 660
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
+ + + DY NK W GL++DYY R ++ ++E L+SG F+ ++ + ++
Sbjct: 661 SGE-----ILDYANKQWGGLMQDYYAQRWGLFIHTLVECLDSGQPFKQDNFNKVVFQVEK 715
Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKY 533
+ R YP + GD + ++ KY
Sbjct: 716 GFIYNRRQYPTKPQGDTFEIAHRIFLKY 743
>gi|326679829|ref|XP_688608.3| PREDICTED: alpha-N-acetylglucosaminidase-like [Danio rerio]
Length = 757
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 205/569 (36%), Positives = 303/569 (53%), Gaps = 75/569 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLPQSW +QL LQ KIL R+ GM PVLPAFSG VP + +FP A +T
Sbjct: 214 MGNLFQWGGPLPQSWHVKQLYLQFKILDRMRSFGMIPVLPAFSGIVPEGITRLFPKANVT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W + C Y+LD DPLF IG F+ Q ++E+G T HIYN DTF+E P
Sbjct: 274 KLSPWSHFNCT--YSCAYVLDPRDPLFHRIGALFLTQVIEEFG-TDHIYNTDTFNEMPPA 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y++S+ AI++ M S D A+WLMQGWLF DP FW+ Q+KALL+ VPLG+++V
Sbjct: 331 SSDPTYLASISRAIFNTMTSVDPQAIWLMQGWLFISDPSFWKADQVKALLHGVPLGRMIV 390
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFAE P++S++ FYG P+IWCMLHNF GN ++G +DSI GP A N+T+
Sbjct: 391 LDLFAESMPVYSSTNSFYGQPFIWCMLHNFGGNSGLFGTVDSINSGPFNAVRFPNSTLVG 450
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W++ Y++RRYG + AW +L+ +V
Sbjct: 451 LGMTPEGIEQNPVIYELMSELAWRKDPVNLYKWVSLYALRRYGSMDENLALAWQLLFRSV 510
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK-PVSKEAVLKSETSSYDHPHLW 325
YNCT P KY+N+ + P+ L +T +W
Sbjct: 511 YNCT-------------LP------------KYKNHNRSPLVHRPSLHMQTD------IW 539
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
Y ++ RA +L + L T+RYDL+D+TRQAL E + +I A+Q
Sbjct: 540 YDPADFYRAWKLLFEAAPGLVTLETFRYDLVDVTRQALQLLTTEFYKDIKSAFQTQKLSD 599
Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + +L+ ++D +L+ ++ FLLG WL+ A+ +E + Y+ NAR QIT+W
Sbjct: 600 LLTAGGVLVYDLLPELDRILSSNEHFLLGAWLQQAQSQGVDEHEAHLYDINARNQITLWG 659
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ + + DY +K W+GL+ DYY R ++ ++E L+ G F+ + + ++
Sbjct: 660 PDGE-----ILDYASKEWAGLVEDYYLQRWGLFVNTLVECLDRGRPFKQDVFNQAVFQVE 714
Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD ++ ++ KY
Sbjct: 715 KGFVFNQRKYPTKPLGDTYDIARRIFLKY 743
>gi|350407422|ref|XP_003488083.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus impatiens]
Length = 770
Score = 361 bits (927), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 209/576 (36%), Positives = 301/576 (52%), Gaps = 75/576 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL SW ++ L LQ +IL R+ ELG+ PVLPAF+G+VP A +FP A +T
Sbjct: 218 MGNIRGFGGPLTSSWHERSLQLQHRILQRMRELGIIPVLPAFTGHVPRAFPRLFPEANVT 277
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W S ++CC YLL+ TDPLF +IG F+ +KE+G T HIYNCDTF+EN PP
Sbjct: 278 KSATWNSFSD--KYCCPYLLEPTDPLFHKIGDQFLRTYIKEFG-TDHIYNCDTFNENEPP 334
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M S D A+WLMQGWLF +D FW P++KA L SVPLG+L+V
Sbjct: 335 TSELKFLRNVGHSIFQTMLSVDPQAIWLMQGWLFVHDAVFWTEPRIKAFLTSVPLGRLIV 394
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL +E P++ K +YG P+IWCMLHNF G + M+G I E R E +TM
Sbjct: 395 LDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINRRVFEGRNMEGSTMIG 454
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W Y+ RRYG AW L TV
Sbjct: 455 TGLTPEGINQNYVIYELMNEMAYRQEPVNLDNWFEDYASRRYGAWNEYAVAAWKNLGSTV 514
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YN IS GKY +P A L WY
Sbjct: 515 YNFRG--------------------ISKIRGKYVITRRPSLNLARLT-----------WY 543
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ +F+ + + S YR+D++D+TRQAL A++++ ++E++ D
Sbjct: 544 DPEKFYSTWYIFLQARHGRKNSTLYRHDVVDITRQALQLKADKIYSVLVESFNQKDVTTF 603
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ R LEL +D++ +LA + FLLG WLE AK LA ++ + K YE+NAR QIT+W
Sbjct: 604 KLQAGRLLELFDDLEAILASSEDFLLGTWLEMAKNLATDDAESKLYEYNARNQITLWGPR 663
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW-RREWIKLTN 505
+ +RDY NK WSG++ DY+ PR AI+ + SL G + R + ++
Sbjct: 664 GE-----IRDYANKQWSGIVSDYFKPRWAIFLDGLTTSLTKGTSLNITRINERIFKEVEK 718
Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFD 541
+ R +YP + GD + + + +K+ Q + +FD
Sbjct: 719 PFTLSRKIYPTNATGDCIDIAMRILSKWYQPS-IFD 753
>gi|358419179|ref|XP_003584151.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bos taurus]
Length = 741
Score = 360 bits (925), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 213/574 (37%), Positives = 305/574 (53%), Gaps = 84/574 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+GNW + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TM
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 440
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S + AW +L +
Sbjct: 441 TGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 500
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNC S E + N+ P+ + L+ T+ +W
Sbjct: 501 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 530
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 379
Y+ S+V A L + + + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 531 YNRSDVFEAWRLLLTATSTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 590
Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
L A G+ EL+ +D +LA FLLG WLE A+Q A +E + YE N+R Q
Sbjct: 591 LTRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARQAAVSETEAHFYEQNSRYQ 645
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
+T+W E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + R
Sbjct: 646 LTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFTETLVESLVQGVPFQQHQFDRN 700
Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+L + G YP + GD + + L+ KY
Sbjct: 701 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 734
>gi|311267179|ref|XP_003131436.1| PREDICTED: alpha-N-acetylglucosaminidase [Sus scrofa]
Length = 744
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 210/575 (36%), Positives = 304/575 (52%), Gaps = 86/575 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 205 MGNLHTWSGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQISVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPSYLAAATAAVYQAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TMA
Sbjct: 382 LDLFAESQPVYVRTASFLGQPFIWCMLHNFGGNHGLFGALESVNQGPAAARLFPNSTMAG 441
Query: 239 ---------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHT 265
W+ ++ RRYG S + AW +L +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGTWVTSFAARRYGVSQGDAEAAWRLLLRS 501
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ +G T NR +V P + + +
Sbjct: 502 VYNCSGEGCTGHNRSPLVRRPSLQMATT-------------------------------V 530
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 378
WY+ S+V A L + + L++S +RYDL+D+TRQA+ + + + AY
Sbjct: 531 WYNQSDVFEAWRLLLKATPTLASSPAFRYDLVDITRQAVQELVSLYYEEARTAYLNKELV 590
Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
L A G+ EL+ +D +LA FLLG WLE A+ +A +E + YE N+R
Sbjct: 591 SLMRAGGILA-----YELLPALDKVLASDSHFLLGSWLEQARGVAVSEAEALFYEQNSRY 645
Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
Q+T+W E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + +
Sbjct: 646 QLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEMLVESLVQGIPFQQHQFDQ 700
Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+L + G YP + GD + ++ L+ KY
Sbjct: 701 NVFQLEQTFVLGTRRYPSQPQGDTVDLAKKLFLKY 735
>gi|340717403|ref|XP_003397173.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus terrestris]
Length = 770
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 207/570 (36%), Positives = 295/570 (51%), Gaps = 74/570 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL SW ++ L LQ KIL R+ ELG+ PVLPAF+G+VP A +FP A +T
Sbjct: 218 MGNIRGFGGPLTSSWHERSLQLQHKILQRMRELGIIPVLPAFTGHVPRAFPRLFPEANVT 277
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W S ++CC YLL+ TDPLF +IG F+ +KE+G T HIYNCDTF+EN PP
Sbjct: 278 KSATWNSFSD--KYCCPYLLEPTDPLFHKIGDQFLRTYIKEFG-TDHIYNCDTFNENEPP 334
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M S D A+WLMQGWLF +D FW P++K L SVPLG+L+V
Sbjct: 335 TSELKFLRNVGHSIFQTMLSVDPQAIWLMQGWLFVHDALFWTEPRIKTFLTSVPLGRLIV 394
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL +E P++ K +YG P+IWCMLHNF G + M+G I E R E +TM
Sbjct: 395 LDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINRRVFEGRNMEGSTMIG 454
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W Y+ RRYG AW L TV
Sbjct: 455 TGLTPEGINQNYVIYELMNEMAYRQEPVNLDNWFEDYASRRYGAWNEYAVAAWKNLGSTV 514
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YN IS GKY +P A L WY
Sbjct: 515 YNFRG--------------------ISKIRGKYVITRRPSLNLARLT-----------WY 543
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ +F+ + + S YR+D++D+TRQAL A++++ ++E++ D
Sbjct: 544 DPEKFYSTWYIFLQARHGRQNSTLYRHDVVDITRQALQLKADKIYSALVESFNQKDVTTF 603
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ R LEL +D++ +LA + FLLG WLE AK LA ++ + K YE+NAR QIT+W
Sbjct: 604 KLQADRLLELFDDLEAILASSEDFLLGTWLEMAKNLATDDAESKLYEYNARNQITLWGPR 663
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW-RREWIKLTN 505
+ +RDY NK WSG++ DY+ PR AI+ + SL G + R + ++
Sbjct: 664 GE-----IRDYANKQWSGIVSDYFKPRWAIFLDALTTSLTKGTSLNITRINERIFKEVEK 718
Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
+ R +YP GD + + + +K+ Q
Sbjct: 719 PFTLSRKIYPTNVTGDCIDIAMRILSKWHQ 748
>gi|148671928|gb|EDL03875.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
CRA_a [Mus musculus]
Length = 538
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 202/570 (35%), Positives = 302/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW Q+ LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 1 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 60
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 61 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 117
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 118 FSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 177
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT--- 236
LDLFAE P++ + F+G P+IWCMLHNF GN ++G L+ + GP AR N+T
Sbjct: 178 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 237
Query: 237 -------------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
MAW++ +++RRYG S P AW +L +
Sbjct: 238 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 297
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 298 VYNCSGEACSGHNRS-------------------------PLVKRPSLQMSTA------V 326
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + L+ S +RYDL+D+TRQA+ + + + AY +
Sbjct: 327 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 386
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + + +L+ +D LLA FLLG WL+ A++ A +E + + YE N+R QIT+W
Sbjct: 387 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQFYEQNSRYQITLW 446
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ DYY PR ++ + SL G F+ ++ + L
Sbjct: 447 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPFQQHEFEKNVFPL 501
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + S+ ++ KY
Sbjct: 502 EQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 531
>gi|254910995|ref|NP_038820.2| alpha-N-acetylglucosaminidase precursor [Mus musculus]
gi|20385160|gb|AAM21194.1|AF363242_1 N-acetyl-glucosaminidase [Mus musculus]
gi|3329361|gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus]
gi|33585908|gb|AAH55733.1| Alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB) [Mus
musculus]
gi|74211094|dbj|BAE37639.1| unnamed protein product [Mus musculus]
gi|74218052|dbj|BAE42009.1| unnamed protein product [Mus musculus]
gi|148671929|gb|EDL03876.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
CRA_b [Mus musculus]
Length = 739
Score = 358 bits (918), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 202/570 (35%), Positives = 302/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW Q+ LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 202 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 262 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 319 FSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT--- 236
LDLFAE P++ + F+G P+IWCMLHNF GN ++G L+ + GP AR N+T
Sbjct: 379 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 438
Query: 237 -------------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
MAW++ +++RRYG S P AW +L +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 498
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 499 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 527
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + L+ S +RYDL+D+TRQA+ + + + AY +
Sbjct: 528 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 587
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + + +L+ +D LLA FLLG WL+ A++ A +E + + YE N+R QIT+W
Sbjct: 588 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQFYEQNSRYQITLW 647
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ DYY PR ++ + SL G F+ ++ + L
Sbjct: 648 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPFQQHEFEKNVFPL 702
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + S+ ++ KY
Sbjct: 703 EQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732
>gi|2660688|gb|AAB88084.1| Naglu [Mus musculus]
Length = 739
Score = 358 bits (918), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 202/570 (35%), Positives = 302/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW Q+ LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 202 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 262 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 319 FSEPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT--- 236
LDLFAE P++ + F+G P+IWCMLHNF GN ++G L+ + GP AR N+T
Sbjct: 379 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 438
Query: 237 -------------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
MAW++ +++RRYG S P AW +L +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 498
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 499 VYNCSGEACSGHNRS-------------------------PLVKRPSLQMSTA------V 527
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + L+ S +RYDL+D+TRQA+ + + + AY +
Sbjct: 528 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 587
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + + +L+ +D LLA FLLG WL+ A++ A +E + + YE N+R QIT+W
Sbjct: 588 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQFYEQNSRYQITLW 647
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ DYY PR ++ + SL G F+ ++ + L
Sbjct: 648 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPFQQHEFEKNVFPL 702
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + S+ ++ KY
Sbjct: 703 EQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732
>gi|91080563|ref|XP_973259.1| PREDICTED: similar to alpha-N-acetyl glucosaminidase [Tribolium
castaneum]
Length = 747
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 197/572 (34%), Positives = 314/572 (54%), Gaps = 79/572 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W + LVLQK+IL R+ G+ PVLPAF+G++P A + ++P A ++
Sbjct: 201 MGNMRGFGGPLSPAWHSRSLVLQKQILQRMRAFGIIPVLPAFAGHLPRAFKTLYPDANMS 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + +CC Y LD T+PLF EIG+AF+ +Q+ E+G T H+YNCD+F+EN P
Sbjct: 261 KMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSEQISEFG-TDHMYNCDSFNENVPT 317
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
Y++++G +IY M D DAVW+MQGWLF++D F W + KA+L +VP GK++V
Sbjct: 318 SGDLTYLANVGKSIYKAMTDTDPDAVWVMQGWLFAHDFFYWTRNRAKAILTAVPKGKMIV 377
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDL +E P + Q++G PYIWCMLH+F G + M+G I P++AR EN+TM
Sbjct: 378 LDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMFGSSTVINEVPIKARHLENSTMIG 437
Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W +YS RRYG ++AW +L TV
Sbjct: 438 TGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKYSTRRYGFPDSDAENAWRILQRTV 497
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y D ++ GKY ++K LK + + WY
Sbjct: 498 Y--------------------DYQGLNRMRGKY-----AITKSPSLKIKIWT------WY 526
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
ST++++ A + + + L A++ Y +DL+D+TRQ L Y + + +++ YQ +D+
Sbjct: 527 STNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVLQVYGDLYYKEMVKNYQSHDSANF 586
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
S++FLE+++D+D +L+ + FLLGPWLE+AK+ A + +E Q+E+NAR QIT+W
Sbjct: 587 QANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAANDSAEEAQFEYNARNQITLWGPR 646
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
+ + DY NK W+G++ ++ PR ++ Y+ + + D + + ++
Sbjct: 647 GE-----IMDYANKQWAGVVSHFFAPRWYLFINYLNSTFDGAFNQTYID-AKMFKEVEEP 700
Query: 507 WQNGRNVYPVESNGDAL-----ITSQWLYNKY 533
+ R +PVE GDA+ I +W +Y
Sbjct: 701 FTFDRTEFPVEPIGDAVEIAWKIHKKWTSEEY 732
>gi|270005801|gb|EFA02249.1| hypothetical protein TcasGA2_TC007912 [Tribolium castaneum]
Length = 747
Score = 356 bits (913), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 196/572 (34%), Positives = 315/572 (55%), Gaps = 79/572 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W + LVLQK+IL R+ G+ PVLPAF+G++P A + ++P A ++
Sbjct: 201 MGNMRGFGGPLSPAWHSRSLVLQKQILQRMRAFGIIPVLPAFAGHLPRAFKTLYPDANMS 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + +CC Y LD T+PLF EIG+AF+ +Q+ E+G T H+YNCD+F+EN P
Sbjct: 261 KMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSEQISEFG-TDHMYNCDSFNENVPT 317
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVV 179
Y++++G +IY M D DAVWL+QGW+F D FW+ + ++++L SVPLGK++V
Sbjct: 318 SGDLTYLANVGKSIYKAMTDTDPDAVWLLQGWMFYNDNFWQDTERVRSILTSVPLGKMIV 377
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDL +E P + Q++G PYIWCMLH+F G + M+G I P++AR EN+TM
Sbjct: 378 LDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMFGSSTVINEVPIKARHLENSTMIG 437
Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W +YS RRYG ++AW +L TV
Sbjct: 438 TGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKYSTRRYGFPDSDAENAWRILQRTV 497
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y D ++ GKY ++K LK + + WY
Sbjct: 498 Y--------------------DYQGLNRMRGKY-----AITKSPSLKIKIWT------WY 526
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
ST++++ A + + + L A++ Y +DL+D+TRQ L Y + + +++ YQ +D+
Sbjct: 527 STNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVLQVYGDLYYKEMVKNYQSHDSANF 586
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
S++FLE+++D+D +L+ + FLLGPWLE+AK+ A + +E Q+E+NAR QIT+W
Sbjct: 587 QANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAANDSAEEAQFEYNARNQITLWGPR 646
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
+ + DY NK W+G++ ++ PR ++ Y+ + + D + + ++
Sbjct: 647 GE-----IMDYANKQWAGVVSHFFAPRWYLFINYLNSTFDGAFNQTYID-AKMFKEVEEP 700
Query: 507 WQNGRNVYPVESNGDAL-----ITSQWLYNKY 533
+ R +PVE GDA+ I +W +Y
Sbjct: 701 FTFDRTEFPVEPIGDAVEIAWKIHKKWTSEEY 732
>gi|449491231|ref|XP_004174728.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase,
partial [Taeniopygia guttata]
Length = 752
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 209/573 (36%), Positives = 312/573 (54%), Gaps = 84/573 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL W GPLP +W +QL LQ +I+ R+ LGM VLPAF+G+VP + VFP T
Sbjct: 211 MGNLRRWAGPLPPAWHFKQLYLQYRIVERMRSLGMTTVLPAFAGHVPQGILRVFPRVNAT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W D + C YLLD DP+F IG F+++ +KE+G T H+Y+ DTF+E TP
Sbjct: 271 RLGHWSHF--DCTYSCIYLLDPEDPMFQVIGTLFLKELIKEFG-TDHVYSADTFNEMTPL 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S + A++ M D A+WLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 328 SSDPAYLSRVSNAVFRSMTGADPKALWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 387
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE KP++ ++ FYG P+IWCMLHNF GN ++G +++I GP AR N+TM
Sbjct: 388 LDLFAESKPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 447
Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
+W+ +Y+ RRYG A AW +L +V
Sbjct: 448 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVTRYAERRYGAPNAAAASAWRLLLRSV 507
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YNCT + NR P+ + L+ +T LWY
Sbjct: 508 YNCTGVCVNHNRS-------------------------PLVRRPSLRMDT------ELWY 536
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 380
+ S+V A L +++G EL +S + YDL+D+TRQA + + +L+I +A+Q L
Sbjct: 537 NASDVFEAWRLLLSAGAELGSSPAFLYDLVDVTRQAAQQLVSHYYLSIRQAFQSHALPEL 596
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
A GV +L+ ++D LL+ H FLLG WL+SA+ +A ++++ +QYE NAR Q+
Sbjct: 597 LTAGGVL-----VYDLLPELDSLLSSHSLFLLGRWLQSARAVATSDQEAEQYELNARNQV 651
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
T+W + + DY N GL+ DYY R +++ ++ESL SG F + + +
Sbjct: 652 TLWGPSGN-----ILDYANXQLGGLVLDYYAVRWSLFVSVLVESLNSGRPFHQNQFNQVF 706
Query: 501 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
++ + + YP GD + S+ L+ KY
Sbjct: 707 FQVERGFIYNKKRYPAVPFGDTMEISRKLFLKY 739
>gi|1171229|gb|AAC50512.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1171231|gb|AAC50513.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1197840|gb|AAB06188.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1479981|gb|AAB36604.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|32450702|gb|AAH53991.1| N-acetylglucosaminidase, alpha- [Homo sapiens]
gi|119581237|gb|EAW60833.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
CRA_b [Homo sapiens]
Length = 743
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 200/573 (34%), Positives = 304/573 (53%), Gaps = 76/573 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ +YY PR ++ + +++S+ G F+ + + +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQL 704
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 536
+ + YP + GD + ++ ++ KY G
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPG 737
>gi|301626955|ref|XP_002942650.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Xenopus (Silurana)
tropicalis]
Length = 759
Score = 355 bits (910), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 200/573 (34%), Positives = 299/573 (52%), Gaps = 84/573 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+H WGGPL SW++++L LQ +I R+ LGM VLPAF+G++P + VFP ++
Sbjct: 212 MGNIHTWGGPLSISWMEKRLSLQLQITERMRSLGMITVLPAFAGHIPEGILRVFPKVTVS 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W + + C+YLLD DPLF IG F+ Q ++ +G T HIY+ DTF+E +P
Sbjct: 272 RLGGWSNFNCT--YSCSYLLDPEDPLFQWIGELFLSQMVQSFG-TDHIYSADTFNEMSPT 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ AI+ M D DA+WLMQGWLF +P FWRP Q KALL+ P+G+++V
Sbjct: 329 SSDPGYLSAVSGAIFKSMAKVDPDAIWLMQGWLFINNPSFWRPAQTKALLHGAPIGRIIV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFAE P++ T++ FYG P+IWCML+NF GN ++G ++ + GP +A N+TM
Sbjct: 389 LDLFAETVPVYLTTESFYGQPFIWCMLNNFGGNHGLFGNIEGVNRGPFDAAKFPNSTMVG 448
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
WI+ YS RRYG+S + AW +L +V
Sbjct: 449 TGLTPEGIEQNDMIYEFMNEIGWSSQPINLTKWISNYSDRRYGQSNTDARMAWQILLRSV 508
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YNCT + N +V P ++ + + Y
Sbjct: 509 YNCTQILHNHNHSPLVRRPSLNMNT-------------------------------DICY 537
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QL 380
+ +++ A + L S T+ YDL+D+TR+A+ + +E +L I EAY QL
Sbjct: 538 NKADIYEAWRFMHNASFALGKSATFLYDLVDITREAVQQLVSEYYLEIKEAYGKKSLQQL 597
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
A GV +L+ ++D LL+ GFLLG WL++AK +A + Y+ NAR QI
Sbjct: 598 MTAGGVL-----VYDLLPELDSLLSSQPGFLLGSWLKAAKSMASTPAEAALYDMNARNQI 652
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
T+W + DY NK + GL++DYY R ++ ++++SL G+ F + +
Sbjct: 653 TLWGPTGN-----ILDYANKQYGGLVQDYYTERWGLFVWFLVQSLNKGEHFNQDKFNKAV 707
Query: 501 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
L D+ Y GD L + +Y KY
Sbjct: 708 FVLEEDFVYNGKEYMASPTGDTLEIANKIYLKY 740
>gi|1479983|gb|AAB36605.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|119581236|gb|EAW60832.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
CRA_a [Homo sapiens]
Length = 639
Score = 355 bits (910), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 200/573 (34%), Positives = 304/573 (53%), Gaps = 76/573 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 100 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 159
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 160 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 216
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 217 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 276
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 277 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 336
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S P AW +L +
Sbjct: 337 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 396
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + NR +V P L+ TS +
Sbjct: 397 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 425
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 426 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 485
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 486 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 545
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ +YY PR ++ + +++S+ G F+ + + +L
Sbjct: 546 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQL 600
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 536
+ + YP + GD + ++ ++ KY G
Sbjct: 601 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPG 633
>gi|66346698|ref|NP_000254.2| alpha-N-acetylglucosaminidase precursor [Homo sapiens]
gi|317373322|sp|P54802.2|ANAG_HUMAN RecName: Full=Alpha-N-acetylglucosaminidase; AltName:
Full=N-acetyl-alpha-glucosaminidase; Short=NAG;
Contains: RecName: Full=Alpha-N-acetylglucosaminidase 82
kDa form; Contains: RecName:
Full=Alpha-N-acetylglucosaminidase 77 kDa form; Flags:
Precursor
Length = 743
Score = 354 bits (909), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 199/570 (34%), Positives = 303/570 (53%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ +YY PR ++ + +++S+ G F+ + + +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQL 704
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
>gi|355568706|gb|EHH24987.1| Alpha-N-acetylglucosaminidase, partial [Macaca mulatta]
Length = 711
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 201/570 (35%), Positives = 303/570 (53%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 172 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 231
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 232 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 288
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 289 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 348
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 349 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 408
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ +RYG S P AW +L +
Sbjct: 409 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 468
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + NR +V P L+ TS +
Sbjct: 469 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 497
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY +
Sbjct: 498 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 557
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + EL+ +D LLA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 558 SLLRAGGVLAYELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 617
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + + +L
Sbjct: 618 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 672
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + ++ ++ KY
Sbjct: 673 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 702
>gi|402900329|ref|XP_003913130.1| PREDICTED: alpha-N-acetylglucosaminidase [Papio anubis]
Length = 743
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 201/570 (35%), Positives = 302/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPEAARLFPNSTMVG 440
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ +RYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 500
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 529
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY +
Sbjct: 530 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 589
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + EL+ +D LLA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + + +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 704
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
>gi|348562747|ref|XP_003467170.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Cavia porcellus]
Length = 750
Score = 353 bits (906), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 199/569 (34%), Positives = 308/569 (54%), Gaps = 74/569 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP++W +QL LQ +IL R+ LGM PVLPAF+G+VP A+ VFP IT
Sbjct: 211 MGNLHGWGGPLPRTWHLKQLSLQHQILDRMRALGMTPVLPAFAGHVPKAIGRVFPQVNIT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + ++E+G T+HIY DTF+E PP
Sbjct: 271 QLGSWGHFNCS--YSCSFLLAPEDPLFPLIGGIFLRELIREFG-TNHIYGADTFNEMQPP 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A++ M + DSDAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 328 SSDPAYLAAATEAVFKAMVAVDSDAVWLLQGWLFQHQPEFWGPAQVGAVLGAVPQGRLLV 387
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 388 LDLFAESQPVYTRTASFRGQPFIWCMLHNFGGNHGLFGALEAVNRGPTAARLFPNSTMVG 447
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+++++ RRYG + P + AW +L +
Sbjct: 448 TGITPEGIGQNEVVYALMAELGWRKDPVPDLLAWVSRFAERRYGVAQPDAEAAWRLLLRS 507
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNC+ A + P+ + L+ T+ +W
Sbjct: 508 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTA------VW 537
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
Y+ S+V A L + + +L+ S T+RYDL+D+TRQAL + + + + AY + G
Sbjct: 538 YNRSDVFEAWRLLLKASPKLTTSPTFRYDLLDVTRQALQELVSLYYEEVRAAYLHQELAG 597
Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + +L+ +D +LA FLLG WL A+ A +E + + YE N+R Q+T+W
Sbjct: 598 LLRAGGVLAYQLLPALDEVLASDHHFLLGSWLAQARAAAASETEARLYEQNSRYQLTLW- 656
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
E ++L DY NK +GL+ YY PR ++ + + +SL F+ + ++ L
Sbjct: 657 ---GPEGNIL-DYANKQLAGLVAHYYAPRWQLFIESLADSLARAAPFQQHQFDKDVFLLE 712
Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ Y + GD + ++ ++ ++
Sbjct: 713 QAFVLSSRRYRSQPQGDTVDLARKVFLRF 741
>gi|114667172|ref|XP_523654.2| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Pan
troglodytes]
gi|410216584|gb|JAA05511.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410258938|gb|JAA17435.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410304442|gb|JAA30821.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410337929|gb|JAA37911.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
Length = 743
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 199/570 (34%), Positives = 302/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + + +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 704
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
>gi|355754184|gb|EHH58149.1| Alpha-N-acetylglucosaminidase, partial [Macaca fascicularis]
Length = 650
Score = 352 bits (904), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 201/570 (35%), Positives = 302/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 111 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 170
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 171 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 227
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 228 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 287
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 288 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 347
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ +RYG S P AW +L +
Sbjct: 348 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 407
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + NR +V P L+ TS +
Sbjct: 408 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 436
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY +
Sbjct: 437 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 496
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + EL+ +D LLA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 497 SLLRAGGVLAYELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 556
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + + +L
Sbjct: 557 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 611
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + ++ ++ KY
Sbjct: 612 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 641
>gi|426348060|ref|XP_004041658.1| PREDICTED: alpha-N-acetylglucosaminidase [Gorilla gorilla gorilla]
Length = 743
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 199/570 (34%), Positives = 302/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIEAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + + +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 704
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPQGDTVDLAKKIFLKY 734
>gi|397485721|ref|XP_003813989.1| PREDICTED: alpha-N-acetylglucosaminidase [Pan paniscus]
Length = 682
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 199/570 (34%), Positives = 302/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 143 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 202
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 203 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 259
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 260 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 319
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 320 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNEGPEAARLFPNSTMVG 379
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S P AW +L +
Sbjct: 380 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 439
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + NR +V P L+ TS +
Sbjct: 440 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 468
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 469 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 528
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 529 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 588
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + + +L
Sbjct: 589 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 643
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + ++ ++ KY
Sbjct: 644 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 673
>gi|432926094|ref|XP_004080826.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oryzias latipes]
Length = 882
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 203/574 (35%), Positives = 312/574 (54%), Gaps = 85/574 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N++ +GGPLPQSW QL LQ +IL R+ GM PVLPAFSGNVP + + P A +T
Sbjct: 345 MANMYKFGGPLPQSWHVNQLRLQFRILERMRAFGMIPVLPAFSGNVPKGILKLHPEANVT 404
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W + C+Y+LD DPLF++IG ++ Q +K++G T HIYN DTF+E TPP
Sbjct: 405 RLGPWAHFNCS--FSCSYVLDPRDPLFLQIGSLYLSQVVKQFG-TDHIYNTDTFNEMTPP 461
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ ++++ M + D A+WLMQGWLF D FW+PPQ++ALL+ VPLG+++V
Sbjct: 462 SSDPAYLSAISRSVFASMTAVDPKAIWLMQGWLFFSDAAFWKPPQIRALLHGVPLGRMIV 521
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFAE +P++S ++ FYG P+IWCMLHNF GN +G ++SI GP +A +N+TM
Sbjct: 522 LDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNNGFFGTVESINSGPFKALNFKNSTMVG 581
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W + Y+ RRYG ++ AW +L+ +V
Sbjct: 582 IGMTPEGIHQNPVIYELMSELAWRKESVNLTKWASLYAARRYGSMHESLSAAWKLLFSSV 641
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YNCT V + N+ P+ + T LWY
Sbjct: 642 YNCT-----------------------VPHYRNHNH-SPLVRRPSFNMNTG------LWY 671
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QL 380
++++ +LF+ + L + T+RYDL+D+TRQ L + +I +A+ +L
Sbjct: 672 DPADLLETWKLFMEAAPSLMSKETFRYDLVDVTRQVLQDLTTYFYQDIKDAFHSKKMPEL 731
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
+ GV +L +++ LL FLLG WLE A+ A +E + + Y+ NAR Q+
Sbjct: 732 LTSGGVL-----IYDLFPELNRLLNSDRNFLLGTWLEQAQSFALDEPEARLYDLNARNQL 786
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
T+W + + + DY NK W GL+ DYY R +++ + +++ L SG F+ + +
Sbjct: 787 TLWGPSGE-----ILDYANKEWGGLVEDYYAQRWSLFVQTLVDCLNSGLPFKQDAFNQAV 841
Query: 501 IKLTNDW-QNGRNVYPVESNGDALITSQWLYNKY 533
++ + NGR YP + GD + ++ KY
Sbjct: 842 FRVEKGFISNGRK-YPTKPQGDTYEIAHRIFLKY 874
>gi|354485058|ref|XP_003504701.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cricetulus griseus]
gi|344251941|gb|EGW08045.1| Alpha-N-acetylglucosaminidase [Cricetulus griseus]
Length = 740
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 204/575 (35%), Positives = 303/575 (52%), Gaps = 86/575 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 203 MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRAFGMIPVLPAFAGHVPKAITRVFPQVNVF 262
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E P
Sbjct: 263 QLGSWGHFNCS--YSCSFLLAPGDPVFPLIGSLFLRELIKEFG-TDHIYGADTFNEMQPI 319
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P ++++ AA+Y M S D DA+WL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 320 SSDPSFLTAATAAVYEAMISVDPDAIWLLQGWLFQHQPQFWGPAQVKAVLQAVPRGRLLV 379
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE P++ + FYG P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 380 LDLFAESHPVYMQTASFYGQPFIWCMLHNFGGNHGLFGALEAVNQGPRAARIFPNSTMVG 439
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
W+++++ RYG S P + AW +L +
Sbjct: 440 TGIAPEGIGQNEMVYALMAELGWRKDPVPDLEVWVSRFASHRYGMSHPDAEAAWRLLLRS 499
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC + NR +V P + + I +
Sbjct: 500 VYNCPGETYNGHNRSPLVKRPSLQINTI-------------------------------V 528
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ----- 379
WY+ S+V A L + + L+ S +RYDL+D+TRQ+L + + + A+
Sbjct: 529 WYNRSDVFEAWRLLLTAAPNLTTSKAFRYDLLDVTRQSLQELVSLFYEEARIAFMKEELD 588
Query: 380 -LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
L A G+ ++R+ L+ +D LLA FLLG WL A+ +A +E++ + YE N+
Sbjct: 589 LLLRAGGI--ITRK---LLPALDELLASDSRFLLGTWLNQARAMAVSEDEAQFYELNSLY 643
Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
Q+T+W E +++ DY NK +GL+ DYY PR ++ + + SL G FR ++ +
Sbjct: 644 QLTLW----GPEGNIM-DYANKQLAGLVADYYQPRWGLFMEALAHSLARGVPFRQHEFEK 698
Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
L + + YP GD + S+ L+ KY
Sbjct: 699 NVFPLELAFIINKKRYPSHPQGDTVDLSKKLFLKY 733
>gi|395532374|ref|XP_003768245.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Sarcophilus
harrisii]
Length = 726
Score = 352 bits (902), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 201/570 (35%), Positives = 299/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL SW +Q LQ +IL R+ GM PVLPAF+G+VP A VFP A +T
Sbjct: 187 MGNLHSWGGPLSSSWHRKQSSLQYQILERMRSFGMKPVLPAFAGHVPKAFTRVFPQAYVT 246
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + C+YLL DPLF +G F+ + +E+G T HIY+ DTF+E PP
Sbjct: 247 HLGMWGHFNCT--YSCSYLLAPEDPLFPVVGSLFLRELTQEFG-TDHIYSADTFNEMEPP 303
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW+PPQ+KA+L +VPLG+L+V
Sbjct: 304 SSEPAYLAAATAAVYEAMIAVDVDAVWLLQGWLFQHQPDFWKPPQVKAVLKAVPLGRLLV 363
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDL+AE KP++S + FYG P+IWCMLHNF GN ++G LD++ GP +A N+T
Sbjct: 364 LDLYAESKPVYSRTDSFYGQPFIWCMLHNFGGNHGLFGALDAVNRGPSDAWLFPNSTFVG 423
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ +RYG + AW +L +
Sbjct: 424 TGIVPEGINQNEVVYALMAELGWQKGPLPDLGAWVAGFAAQRYGTPHSHAEAAWKLLLQS 483
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ D T NR +V P + I +
Sbjct: 484 VYNCSGDLCTGHNRSPLVKRPSLHLDI-------------------------------SV 512
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + L++S +RYDL+D+TRQ + + + + A++
Sbjct: 513 WYNRSDVFEAWRLLLEAAPVLASSPAFRYDLLDVTRQVAQELVSLYYEELRTAFEAGAMP 572
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + +L+ +D LLA + FLLG WLE A+++A +E + QY+ NA Q+T+W
Sbjct: 573 ALLTAGGLLVFDLLPSLDELLASDERFLLGAWLEQAREMAVSEAEAWQYKQNALYQLTLW 632
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+ DY NK +GL+ YY PR ++ + +++SL G F + E + L
Sbjct: 633 GPTGN-----ILDYANKQLAGLVAGYYAPRWKLFVEMLVKSLAEGTPFHQNQFESEALLL 687
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
++ GR +P + GD + + + +Y
Sbjct: 688 GQNFVLGREKFPTQPQGDTVDLVKKFFLRY 717
>gi|297701096|ref|XP_002827555.1| PREDICTED: alpha-N-acetylglucosaminidase [Pongo abelii]
Length = 836
Score = 352 bits (902), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 199/570 (34%), Positives = 301/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 297 MGNLHSWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 356
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HI+ DTF+E PP
Sbjct: 357 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIFGADTFNEMQPP 413
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 414 SSEPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 473
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 474 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 533
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S P AW +L +
Sbjct: 534 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 593
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + NR +V P L+ TS +
Sbjct: 594 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 622
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 623 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 682
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 683 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 742
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + + +L
Sbjct: 743 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 797
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + ++ ++ KY
Sbjct: 798 EQAFVLSKQRYPSQPQGDTVDLAKKIFLKY 827
>gi|256422141|ref|YP_003122794.1| alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
gi|256037049|gb|ACU60593.1| Alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
Length = 728
Score = 351 bits (900), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 197/570 (34%), Positives = 300/570 (52%), Gaps = 77/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGPLPQ W D VLQ++IL +GM P+LPAF+G+VP A ++ +P+ +I
Sbjct: 195 MGNIDAWGGPLPQHWKDSHKVLQQQILAAERSMGMLPILPAFTGHVPPAFKDKYPN-EIV 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW D + Y+LD P+F +IG+ F+E Q K +G T H Y+ DTF+EN PP
Sbjct: 254 KPTNW-----DAGFPDVYILDPNSPMFDKIGKKFLEAQTKAFG-TDHFYSADTFNENVPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
++ ++ +Y+ M + D AVW+MQGW+F Y+ +W PQ++ALLN+VP ++V
Sbjct: 308 SSDSSFLDAMSRKVYASMAAADPKAVWVMQGWMFHYNASYWHQPQIRALLNAVPDDHMIV 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
LDL++E P W ++ +YG P+IW MLHNF GN M+G +D+ A P A
Sbjct: 368 LDLYSESHPEWRNTQAYYGKPWIWNMLHNFGGNTGMWGGMDAAAHDPATALHDPASGKMS 427
Query: 230 ------------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
R W+ Y+ +RYG A+ AW +LYHT
Sbjct: 428 GIGLTPEGIEQNPALYQLMIDNVWRDQPINVDTWLQSYAKQRYGAENEAVNKAWQILYHT 487
Query: 266 VY--NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
VY T+GA + +IVA P +D ++ E V
Sbjct: 488 VYIGGPTEGAPES---IIVARPTLD-----------------IAAERV---------KTK 518
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
L Y ++V+ A +LFI + +L + ++YDL+DLTRQ L YA+ L + AY+ D
Sbjct: 519 LEYDPAKVVPAWDLFINAAAQLKPTEGFKYDLVDLTRQVLGNYASPLQQRVATAYRNKDL 578
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
Q S +F+ L++DMD LL +GFLLG W+ A+ ++ YE+NA+ +T+W
Sbjct: 579 AAFKQYSTQFIGLLDDMDMLLGTQEGFLLGKWVSDARSNGITPAEQDLYEFNAKDLVTLW 638
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
D + S + +Y N+ W+GL++ +Y PR +F + SL+ G+ LK + +
Sbjct: 639 GD----KDSPVHEYSNRQWNGLIKGFYKPRWQQFFTLLESSLKKGETADLKAFEEQVKAF 694
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
W NG + Y V+ GDA+ + L+ KY
Sbjct: 695 EWKWANGHDKYAVKPQGDAVKAAVQLHKKY 724
>gi|395827009|ref|XP_003786703.1| PREDICTED: alpha-N-acetylglucosaminidase [Otolemur garnettii]
Length = 756
Score = 350 bits (899), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 201/569 (35%), Positives = 296/569 (52%), Gaps = 74/569 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWGGPLPFSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QL +W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 264 QLSSWGHFNCS--YSCSFLLAPGDPIFSLIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D DAVWL+QGWLF + P FW P Q+KA+L +VPLG+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMIAVDPDAVWLLQGWLFQHQPQFWGPTQIKAVLRAVPLGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++S + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 381 LDLFAESQPVYSRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNQGPKAARLFPNSTMVG 440
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S + AW +L +
Sbjct: 441 TGMAPEGINQNEVVYALMAELGWRKDPVPDLVAWVTSFADRRYGISHGDAEAAWRLLLRS 500
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNC+ A + P+ K L+ T+ +W
Sbjct: 501 VYNCSGEACSGHNH------------------------SPLVKRPSLQMNTT------VW 530
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
Y+ S+V A L + S L+AS +RYDL+D+TRQA+ + + + AY +
Sbjct: 531 YNRSDVFEAWRLLLTSAPTLAASPIFRYDLLDITRQAIQELVSLYYEKARTAYLNKELVP 590
Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + EL+ +D +LA + FLLG WL A+ +A +E + YE N+R Q+T+W
Sbjct: 591 LLRAGGLLAYELLPALDEVLASDNHFLLGSWLAQARAVAISEAEANFYEQNSRYQLTLW- 649
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ DY NK +GL+ DYY PR ++ + + L G F+ + + + L
Sbjct: 650 ----GPVGNILDYANKQLAGLVADYYAPRWQLFMQALGNCLAQGIPFQQRQFDKNVFPLE 705
Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ YP + G+ + ++ ++ KY
Sbjct: 706 QAFVLNSKRYPSQPQGNTMDLAKKIFLKY 734
>gi|295132875|ref|YP_003583551.1| hypothetical protein ZPR_1010 [Zunongwangia profunda SM-A87]
gi|294980890|gb|ADF51355.1| predicted protein [Zunongwangia profunda SM-A87]
Length = 750
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 207/578 (35%), Positives = 303/578 (52%), Gaps = 96/578 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGPLPQSW + LQKKIL R ELGM PVLPAF+G+VPA+ + FP A +
Sbjct: 206 MGNLDGWGGPLPQSWKESHRDLQKKILKRSRELGMKPVLPAFTGHVPASFKKFFPDADLK 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + D TY+LDA DPLF EIG+ F+E+Q + +G T H Y DTF+EN PP
Sbjct: 266 KT-NWGNDFGD-----TYILDAEDPLFAEIGKRFLEKQEEVFG-TDHFYTADTFNENEPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVV 179
D P+Y+ L I+ GM++ D +A W+MQGWLF S+ FW+ PQ+K LL++VP ++++
Sbjct: 319 SDDPKYLGELSEKIFEGMKAADPEATWVMQGWLFYSHKDFWKTPQIKGLLSTVPDDRMII 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
LDL E++P+W ++ FYG +IW MLHNF GNI M+G ++++A P A
Sbjct: 379 LDLATEIEPVWKQTEAFYGKQWIWNMLHNFGGNISMFGRIETVAEQPALALNDSTSGNLK 438
Query: 230 ------------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
R + +W+ Y+ RYG +I +AW++L T
Sbjct: 439 GIGLTMEAIEQNPVLYELMTDNTWRDTPIELKSWLKNYTRNRYGAVNDSILEAWDILVAT 498
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
YN T D +I A P ++ Y + + +
Sbjct: 499 AYNGT-TIRDGAESIIAARP------------TFEGYRR--------------WARTKMN 531
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
Y +++ A +LFI + + S+ + YDL+DL+RQ LA YA + + AY+ ND
Sbjct: 532 YDPLDLLPAWDLFIGARDRFKDSDGFAYDLVDLSRQVLANYALPVQQQMRIAYENNDKEA 591
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF- 444
+ S L L+ D+D LLA FLLGPW+ A+ E++ YE NAR IT+W
Sbjct: 592 FKKHSEELLTLISDLDRLLATRKDFLLGPWIADARSWGTTPEEKALYERNARDLITLWGG 651
Query: 445 -DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG--------FRLKD 495
DN L +Y + WSG+L D+Y PR ++ I +E+ G ++K+
Sbjct: 652 PDNP------LHEYSCRQWSGVLDDFYKPR----WQQFIADVEANWGDFDQEVFDEKIKE 701
Query: 496 WRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
W +W+ N YP + +GD+ ++ LY+KY
Sbjct: 702 WEWKWV-------NKEEAYPTQPSGDSYKVAKALYDKY 732
>gi|405964692|gb|EKC30145.1| Alpha-N-acetylglucosaminidase [Crassostrea gigas]
Length = 859
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 195/598 (32%), Positives = 309/598 (51%), Gaps = 70/598 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+HGWGGP+ Q+W+D QL+LQ KIL R+ GM PVLP F+G+VP A +P A ++
Sbjct: 217 MGNMHGWGGPITQNWIDDQLILQHKILERMRSFGMIPVLPGFAGHVPEATILRYPQANVS 276
Query: 61 QLGNWFSVKSDPRW--------------------CCTYLLDATDPLFIEIGRAFIEQQLK 100
+L +W W CC YLLD DPLF++I FI++
Sbjct: 277 RLTDWAGFNQSFCWHYPTANVSRLRDWGHFNKTYCCNYLLDFNDPLFMKIAVRFIKEMEN 336
Query: 101 EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 160
E+G H+Y+ DTF+E P +S EY++ G +Y ++ DS A+WLMQGWLF FW
Sbjct: 337 EFG-VDHVYSVDTFNEMRPRSNSTEYLALSGRTVYKSLKEADSKAIWLMQGWLFIDGGFW 395
Query: 161 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
+ PQ+KALL +VP G++++LDL++E+ PI++ ++ +YG P+IWCMLH+F G +E+YG L
Sbjct: 396 KQPQIKALLTAVPQGEMIILDLYSEIIPIYTQTESYYGQPFIWCMLHDFGGTMELYGALK 455
Query: 221 SIAFGPVEARTSENTTM---------------------------------AWINQYSVRR 247
I GP R N++M WI++Y + R
Sbjct: 456 LINEGPFNGRAFPNSSMVGLGMTPEGIFQNEVVYEFFTENVWRKAPRDISTWISKYVLNR 515
Query: 248 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE---GKYQN--- 301
YG++ I AW L ++VYN +D D + + I PD PS+ G Y N
Sbjct: 516 YGKTNKFIDLAWQYLKNSVYNNSDNLKDHDSNAI---PDHRPSLSPALHPDLGIYNNTDY 572
Query: 302 -YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 360
+ ++ + +WY+ ++ A ++ + +E S S+ + YD++D+TR
Sbjct: 573 LHDNSINIIVTTLPRMTPLIQQDVWYNPEDLYVAWDIMTLNLDEFSNSSLFMYDIVDVTR 632
Query: 361 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 420
+L + + + +++ A+ D H V + L L+ DMD +L FLLG W+++A
Sbjct: 633 NSLQILSIKYYTDLVYAFGRGDIHAVESHGNQLLGLLSDMDTVLGSDSHFLLGRWIKAAT 692
Query: 421 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 480
A + + ++NAR QIT+W + +RDY K WSGL++DYY PR I+ Y
Sbjct: 693 DNAMDMQDNWFLQFNARNQITLWGPRGE-----IRDYACKQWSGLIKDYYLPRWEIFVNY 747
Query: 481 MIESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGT 537
++ + + + + K+ + + YP E GD++ + L+ KY T
Sbjct: 748 TLDIMAHNKTYNATELDIMIYEKVEFPFSYRLDQYPTEPQGDSVAIVKSLHKKYRPDT 805
>gi|426238065|ref|XP_004012978.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 1 [Ovis aries]
Length = 748
Score = 348 bits (893), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 211/581 (36%), Positives = 301/581 (51%), Gaps = 84/581 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 211 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 271 QMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D AVWL+QGWLF P FW P Q+ A+L +VP G+L+V
Sbjct: 328 SSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQPEFWGPAQVAAVLGAVPRGRLLV 387
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+T+
Sbjct: 388 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPATARRFPNSTLVG 447
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S + AW +L +
Sbjct: 448 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 507
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNC S E + N+ P+ K L T+ +W
Sbjct: 508 VYNC-----------------------SGEECRGHNH-SPLVKRPSLHMVTT------VW 537
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 379
Y+ S+V A L + + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 538 YNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 597
Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
L A G+ EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q
Sbjct: 598 LMRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARLAAVSETEAHFYEQNSRYQ 652
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
+T+W E ++L DY NK +GL+ DYY PR ++ + + ESL G F+ + +
Sbjct: 653 LTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFAETLAESLVQGVPFQQHQFDKN 707
Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
+L + G YP + GD + + L+ KY G F
Sbjct: 708 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKYYPRAGSF 748
>gi|426238067|ref|XP_004012979.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Ovis aries]
Length = 739
Score = 348 bits (893), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 211/581 (36%), Positives = 301/581 (51%), Gaps = 84/581 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 202 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 262 QMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D AVWL+QGWLF P FW P Q+ A+L +VP G+L+V
Sbjct: 319 SSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQPEFWGPAQVAAVLGAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+T+
Sbjct: 379 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPATARRFPNSTLVG 438
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S + AW +L +
Sbjct: 439 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 498
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNC S E + N+ P+ K L T+ +W
Sbjct: 499 VYNC-----------------------SGEECRGHNH-SPLVKRPSLHMVTT------VW 528
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 379
Y+ S+V A L + + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 529 YNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 588
Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
L A G+ EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q
Sbjct: 589 LMRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARLAAVSETEAHFYEQNSRYQ 643
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
+T+W E ++L DY NK +GL+ DYY PR ++ + + ESL G F+ + +
Sbjct: 644 LTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFAETLAESLVQGVPFQQHQFDKN 698
Query: 500 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
+L + G YP + GD + + L+ KY G F
Sbjct: 699 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKYYPRAGSF 739
>gi|444714090|gb|ELW54978.1| Alpha-N-acetylglucosaminidase [Tupaia chinensis]
Length = 724
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 205/571 (35%), Positives = 303/571 (53%), Gaps = 76/571 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ ++L R+ GM PVLPAF G+VP A+ VFP +T
Sbjct: 174 MGNLHTWGGPLPHSWHLKQLYLQHRVLDRMRSFGMIPVLPAFPGHVPKAITRVFPQVNVT 233
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 234 QLGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRELTKEFG-TDHIYGADTFNELQPP 290
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AAIY+ M + D AVWL+QGW+F + P FW P Q+KA+L +VP G+L+V
Sbjct: 291 SSEPSYLAAATAAIYAAMTAVDPGAVWLLQGWIFQHQPDFWGPAQVKAVLEAVPRGRLLV 350
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F G P+IWCMLHNF GN +YG L+++ +GP AR N++M
Sbjct: 351 LDLFAETRPVYLYTASFLGQPFIWCMLHNFGGNHGLYGTLEAVNWGPKAARLFPNSSMVG 410
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ Y+ RRYG S+ + AW +L +
Sbjct: 411 TGMAPEGINQNEVVYALMAELGWRKDPVPDLAAWVTSYADRRYGVSLGDAEAAWRLLLRS 470
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + NR P+ K L+ T+ +
Sbjct: 471 VYNCSGQMCSGHNRS-------------------------PLVKRPSLQMNTT------V 499
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ S+V A L + + L+AS T+RYDL+D+TRQA+ + + + AY +
Sbjct: 500 WYNRSDVFEAWRLLLTAAPTLAASPTFRYDLLDVTRQAVQELVSLYYEEARTAYLNKELV 559
Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + + EL+ D+D LLA F+LG WLE A+ +A +E + + YE N+R Q+T+W
Sbjct: 560 SLLRAGGILVYELLPDLDNLLATDGRFMLGSWLEQARAVAVSETEAQFYEQNSRYQLTLW 619
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+ DY NK +GL+ DYY PR ++ + + SL G F+ + + +L
Sbjct: 620 GPTGN-----ILDYANKQLAGLVADYYAPRWQLFMEMLANSLTQGIPFQQHQFDQNAFQL 674
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
+ YP + GD + ++ ++ KY
Sbjct: 675 EQAFVLSVERYPSQPQGDTVELAKKIFLKYF 705
>gi|344285558|ref|XP_003414528.1| PREDICTED: alpha-N-acetylglucosaminidase [Loxodonta africana]
Length = 744
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 205/575 (35%), Positives = 297/575 (51%), Gaps = 86/575 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 205 MGNLHSWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAVTRVFPQVNVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DP+F IG F+ + E+G T HIY DTF+E PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRELTTEFG-TDHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G L+V
Sbjct: 322 SSEPSYLAAATAAVYEAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGHLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 382 LDLFAETQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGTLETVNQGPAAARLFPNSTMVG 441
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG + AW +L +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVPDLGAWVASFAARRYGGIHQDAETAWRLLLRS 501
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 502 VYNCSGESCSGHNRS-------------------------PLVKRPSLQMNTT------V 530
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 378
WY+ S+V A L +A+ L+AS +RYDL+D+TRQA + + + + AY
Sbjct: 531 WYNRSDVFEAWRLLLATTPALAASPAFRYDLLDVTRQAAQELVSFYYGEVRTAYLNKELV 590
Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
L A GV EL+ +D +LA FLLG WLE A+ A +E + +E N+R
Sbjct: 591 HLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARVAAVSEAEAHFFEQNSRY 645
Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
Q+T+W + DY NK +GL+ DYY PR ++ ++ESL F+ + +
Sbjct: 646 QLTLW-----GPVGNILDYANKQLAGLVSDYYTPRWQLFVGALVESLVQDVPFQQRQFDE 700
Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+L + YP + GD + ++ L+ KY
Sbjct: 701 NVFQLEQAFVLNTRRYPTQPKGDTVDLAKRLFLKY 735
>gi|156545487|ref|XP_001606979.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Nasonia vitripennis]
Length = 755
Score = 346 bits (887), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 201/570 (35%), Positives = 297/570 (52%), Gaps = 76/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N GWGGPL Q+W + + LQ I+ R+ ELG+ PVLPAF+G+VP VFP A +T
Sbjct: 221 MGNFRGWGGPLSQAWHNHTIQLQHSIVRRMRELGITPVLPAFAGHVPRDFIRVFPEANVT 280
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + ++CC Y LD TDPLF +GR F++ E+G T+HIYNCD+F+EN P
Sbjct: 281 KVVSWNGFED--QYCCPYSLDPTDPLFKTVGREFLKAYTDEFG-TNHIYNCDSFNENDPH 337
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+Y+S+ G AIYSGM D DA+WLMQGWLF + FW P++KA + SVP+GK+++
Sbjct: 338 TGDLDYLSNTGKAIYSGMTGADPDAIWLMQGWLFVHSEYFWTFPRVKAFVTSVPIGKMII 397
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL +E P + ++G P+IWCMLHNF G + M+G I G EART+ +TM
Sbjct: 398 LDLQSEQFPQYKRFHSYFGQPFIWCMLHNFGGTLGMFGSAGVINKGVFEARTTNGSTMIG 457
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W Y+VRRYG++ +I+ +W L +
Sbjct: 458 TGLTPEGINQNYVIYEFMNEMSYRKKPVVLDNWFENYAVRRYGQADESIRTSWQELGREL 517
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YN DG T G Y ++K L E P WY
Sbjct: 518 YN-YDGKTK-------------------IRGHY-----VITKRPSLNIE------PWYWY 546
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ F+ +GN + +++DL+D+TRQAL A+ ++ +I AY + +
Sbjct: 547 DLKTFLAVWNSFVHAGNGTMKNELFKHDLVDITRQALQITADFIYADIKAAYTQKNLTQL 606
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN--EEQEKQYEWNARTQITMWF 444
S L+L +D++ LA FLLG WLE AK +A + YE+NAR QIT+W
Sbjct: 607 QIASSHLLDLFDDLEKNLASSKDFLLGSWLEDAKAIAPEGATRDRENYEFNARNQITLWG 666
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKL 503
+ + DY NK WSG++ DY+ PR IY K + ES+ +R + ++
Sbjct: 667 PRGE-----IVDYANKQWSGVVADYFKPRWEIYLKELQESIRKQTAVPTAKLKRMIFNQV 721
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + +YP + GD+++ ++ LY K+
Sbjct: 722 ELPFSYSKKLYPTQPKGDSILIAKELYAKW 751
>gi|431890602|gb|ELK01481.1| Alpha-N-acetylglucosaminidase [Pteropus alecto]
Length = 740
Score = 345 bits (884), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 200/569 (35%), Positives = 302/569 (53%), Gaps = 74/569 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 201 MGNLHTWGGPLPFSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ +W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 261 QMDSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 317
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 318 SSEPSYLAAATAAVYQAMTTVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 377
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 378 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVG 437
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S + AW +L +
Sbjct: 438 TGMAPEGIDQNEVVYALMAELGWRKDPVTDLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 497
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNC S + + N+ P+ + L+ T+ +W
Sbjct: 498 VYNC-----------------------SGEDCRGHNH-SPLVRRPSLQMVTT------VW 527
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
Y+ S+V A + + + L+ S + Y+L+D+TRQA+ + + + + AY D
Sbjct: 528 YNQSDVFEAWRMLLTATPTLATSPLFSYELVDITRQAIQELVSLYYEEVRTAYLNKDLVT 587
Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+F+ + EL+ +D +LA FLLG WLE A+ A ++ + YE N+R Q+T+W
Sbjct: 588 LFRAAGILAYELLPSLDNILATDSHFLLGSWLEQARAAAVSKAEASFYEQNSRYQLTLW- 646
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
E ++L DY NK +GL+ +YY PR ++ + ++ESL G F+ + + +L
Sbjct: 647 ---GPEGNIL-DYANKQLAGLIANYYTPRWRLFMEMLVESLVQGIPFQQHQFDKNAFQLE 702
Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ YP + GD + ++ L+ KY
Sbjct: 703 QTFVFSTQRYPNQPQGDTVDLAKKLFLKY 731
>gi|351699889|gb|EHB02808.1| Alpha-N-acetylglucosaminidase, partial [Heterocephalus glaber]
Length = 652
Score = 345 bits (884), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 199/559 (35%), Positives = 297/559 (53%), Gaps = 74/559 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP +W +QL LQ ++L R+ LGM PVLPAF+G+VP A+ VFP +T
Sbjct: 114 MGNLHGWGGPLPHAWHLKQLYLQHRVLDRMRALGMTPVLPAFAGHVPKAVTRVFPQVNVT 173
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + +E+G T H Y DTF+E PP
Sbjct: 174 QLGSWGHFNCS--YSCSFLLAPGDPLFPLIGSLFLRELNREFG-TDHFYGADTFNEMQPP 230
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 231 SSEPAYLAAATAAVYEAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVGAVLGAVPQGRLLV 290
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+T+
Sbjct: 291 LDLFAENQPVYTRTASFGGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTVVG 350
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ +++ +RYG + P AW +L H+
Sbjct: 351 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLSAWVARFAEQRYGVAQPDAVLAWRLLLHS 410
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNC+ A + P+ + L+ T+ +W
Sbjct: 411 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------VW 440
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
Y+ S+V A L + + L+AS +RYDL+D+TRQ L + + + AY + G
Sbjct: 441 YNRSDVFEAWRLLLKATPNLTASPAFRYDLLDVTRQGLQELVSLYYEEARAAYMRQELEG 500
Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + +L+ +D +LA FLLG WLE A+ +A + + YE N+R Q+T+W
Sbjct: 501 LLRAGGVLAYKLLPALDEVLASDHRFLLGSWLEQARAVAVSSAEADLYEQNSRYQLTLW- 559
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
E ++L DY NK +GL+ DYY PR ++ + + SL G F+ + + + L
Sbjct: 560 ---GPEGNIL-DYANKQLAGLVADYYVPRWRLFVETLASSLARGVPFQQQQFNSDVFLLE 615
Query: 505 NDWQNGRNVYPVESNGDAL 523
+ R YP + GD +
Sbjct: 616 QAFVLSRKRYPSQPQGDTV 634
>gi|307192254|gb|EFN75548.1| Alpha-N-acetylglucosaminidase [Harpegnathos saltator]
Length = 741
Score = 344 bits (882), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 193/569 (33%), Positives = 297/569 (52%), Gaps = 79/569 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W ++ + LQ +IL R+ +LG+ PVLPAF+G+VP A +FP+A +T
Sbjct: 194 MGNIRGFGGPLSINWHERTVRLQHRILRRMRDLGIVPVLPAFAGHVPRAFARLFPNANMT 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W K + ++CC YLL+ TDPLF IG F+ + E+G T HIYNCDTF+EN P
Sbjct: 254 KIEPW--NKFEDKYCCPYLLEPTDPLFQTIGEKFLRMYINEFG-TDHIYNCDTFNENEPG 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
Y+S++G +++ M + D A+WLMQGWLF +D FW P++++ L SVP G+++V
Sbjct: 311 NSELAYLSNVGRSVFQAMSTVDPQAIWLMQGWLFVHDFIFWTEPRVRSFLTSVPTGRMLV 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDL +E P + K +YG P+IWCMLHNF G + M+G I E R +TM
Sbjct: 371 LDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRHMNGSTMVG 430
Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
AW Y+ RRYG AW L T+
Sbjct: 431 TGLTPEGINQNYVIYELMNEMAYRHEPVDLDAWFESYATRRYGAWNEYAVAAWKHLGRTI 490
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YN + VI P ++ S P +WY
Sbjct: 491 YNFVGIERIRGHYVITRRPSLNIS-------------------------------PWVWY 519
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ + +F+ + + YR+D++D+TRQAL A+ +++N+++ Y+ + G
Sbjct: 520 NREDFYHTWNVFLKARYGRGNNTLYRHDVVDITRQALQLMADNIYMNVVDCYKRKNITGF 579
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ L+L +D++ +LA FLLG WL AK +A +E++ + YE+NAR QIT+W N
Sbjct: 580 QSHAAALLDLFDDIEAILASGSNFLLGTWLAQAKDMAVDEKERQSYEYNARNQITLWGPN 639
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIKLTN 505
+ +RDY NK WSG++ DY+ PR A + K + +SL + + R ++++
Sbjct: 640 GE-----IRDYANKQWSGVVADYFKPRWAFFLKALEKSLVERTRLNMTEINDRMFLEVEQ 694
Query: 506 DWQNGRNVYPVESNGDAL-----ITSQWL 529
+ +YPV + GD L I S+WL
Sbjct: 695 AFTFSTKLYPVGTKGDTLDIAVKIISKWL 723
>gi|380030624|ref|XP_003698943.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Apis florea]
Length = 769
Score = 343 bits (881), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 196/584 (33%), Positives = 303/584 (51%), Gaps = 82/584 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W ++ + LQ +IL R+ LG+ PVLPAF+G+VP A +FP A +T
Sbjct: 216 MGNMRGFGGPLSSNWHEKSIRLQHRILERMRALGIIPVLPAFAGHVPRAFLRLFPKANVT 275
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + ++CC YLL+ DPLF +IG+ F++ ++E+G T H+YNCDTF+EN P
Sbjct: 276 KSAVWNNFSD--KYCCPYLLEPMDPLFKQIGQQFLKTYIEEFG-TDHVYNCDTFNENEPY 332
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M + DS A+WLMQGWLF +D FW P+ + L S+PLG+++V
Sbjct: 333 TSELKFLRNIGHSIFEAMSNVDSKAIWLMQGWLFYHDSVFWTEPRTRTFLTSIPLGRMIV 392
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL +E P + +YG P+IWCMLHNF G + M+G + I EAR +TM
Sbjct: 393 LDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGGTLGMFGSAEIINHRIFEARNMNGSTMVG 452
Query: 239 --------------------------------WINQYSVRRYG--RSVPAIQDAWNVLYH 264
W Y+ RRYG + AW +
Sbjct: 453 TGLTPEGINQNYVIYELMNEMAYRKRPVNLDKWFENYANRRYGDTKGNEHTVTAWKGFKN 512
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
TVYN +D + + I P+++ S P
Sbjct: 513 TVYNFSDTRRIRGKYAITIRPNLNFS-------------------------------PWR 541
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ I + + + + S YR+D++D+TRQAL A+E++ ++IE++ +
Sbjct: 542 WYNKDAFIHYWYMLLQARDLKRNSTLYRHDVVDVTRQALQLIADEIYTDLIESFNKKNID 601
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
Q ++ L L +D++ +LA + FLLG WL+ AK LA N+E+E YE+NAR QIT+W
Sbjct: 602 LFKQNAKLLLALFDDLEEILASSEDFLLGKWLKMAKDLATNDEEEILYEYNARNQITLW- 660
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK-L 503
+RDY NK WSG++ DY+ PR AI+ + SL +G + + +
Sbjct: 661 ----GPLGEIRDYANKQWSGIVADYFKPRWAIFLNELETSLTTGTRVNTTKMNEQIFENV 716
Query: 504 TNDWQNGRNVYPVESNGDAL-----ITSQWLYNKYLQGTGVFDH 542
+ R +YP ++ GD++ I S+W Y+ YL F H
Sbjct: 717 EEAFTFSRKIYPTKATGDSIDIAERILSEW-YDPYLPFHKTFRH 759
>gi|328778968|ref|XP_623833.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Apis mellifera]
Length = 752
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 196/572 (34%), Positives = 303/572 (52%), Gaps = 85/572 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W D+ + LQ +IL R+ LG+ PVLPAF+G+VP AL +FP A +T
Sbjct: 198 MGNMRGFGGPLNSNWHDKSIRLQHRILERMRALGIIPVLPAFAGHVPRALLKLFPKANVT 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + ++CC YLL+ TDPLF +IG+ F++ ++E+G T H+YNCDTF+EN P
Sbjct: 258 KSAVWNNFSD--KYCCPYLLEPTDPLFKQIGQQFLKTYIEEFG-TDHVYNCDTFNENEPY 314
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M S DS A+WLMQGWLF +D FW P+ + L SVPLG+++V
Sbjct: 315 TSELKFLRNIGHSIFEAMNSVDSKAIWLMQGWLFYHDSVFWTEPRTRTFLTSVPLGRMIV 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL +E P + +YG P+IWCMLHNF G + M+G + I EAR +TM
Sbjct: 375 LDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGGTLGMFGSAEIINHRVFEARNMNGSTMVG 434
Query: 239 --------------------------------WINQYSVRRYG--RSVPAIQDAWNVLYH 264
W ++ RRYG + AW +
Sbjct: 435 TGLTPEGINQNYVIYELMNEMAYRKKPVNLDKWFENFANRRYGDIKGNEHTVTAWKGFKN 494
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
TVYN +D + + VI P+++ P
Sbjct: 495 TVYNFSDTRRIRGKYVITIRPNLN-------------------------------FFPWR 523
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY+ I + + + + S YR+D++D+TRQAL A+E++ ++IE++ +
Sbjct: 524 WYNKDAFIYYWYVLLQARDLKRNSTLYRHDVVDVTRQALQLIADEIYTDLIESFNKKNID 583
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
Q ++ L L +D++ +LA + FLLG WL+ AK LA ++E+E YE+NAR QIT+W
Sbjct: 584 LFKQNAKLLLALFDDLEEILASSEDFLLGKWLKMAKDLATDDEEEILYEYNARNQITLW- 642
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRREWI 501
+RDY NK WSG++ DY+ PR AI+ + SL +G + R+ +R +
Sbjct: 643 ----GPLGEIRDYANKQWSGIVADYFKPRWAIFLNELETSLTTGTRVNTTRIN--KRIFE 696
Query: 502 KLTNDWQNGRNVYPVESNGDAL-----ITSQW 528
+ + R +YP ++ GD++ I S+W
Sbjct: 697 NVEKAFTFSRKIYPTKATGDSIDIAERILSEW 728
>gi|291406137|ref|XP_002719212.1| PREDICTED: alpha-N-acetylglucosaminidase [Oryctolagus cuniculus]
Length = 743
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 207/576 (35%), Positives = 305/576 (52%), Gaps = 86/576 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWAGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAVTRVFPHINVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DP+F IG F+ + +E+G T H+Y DTF+E PP
Sbjct: 264 QLGSWGHFNCS--YSCSFLLAPEDPMFPLIGSLFLRELTREFG-TDHVYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA++ M + D DAVWL+QGWLF + P FW P Q+KA+LN+VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVFEAMIAVDPDAVWLLQGWLFQHQPQFWGPSQVKAVLNAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 381 LDLFAENQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMVG 440
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG + P AW +L +
Sbjct: 441 TGIAPEGISQNEVVYALMAELGWRKEPVPDLEAWVTSFAGRRYGVAHPDAGAAWRLLLRS 500
Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
VYNC+ D NR +V P L+ T+ +
Sbjct: 501 VYNCSGDACRGHNRSPLVRRPS-------------------------LQLNTT------V 529
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 378
WY+ S+V A L + + L++S +RYDL+D+TRQA+ + + + AY
Sbjct: 530 WYNRSDVFEAWRLLLKATPTLASSPAFRYDLLDVTRQAVQELVSLYYEEARTAYLHKELA 589
Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
L A GV EL+ +D +LA FLLG WLE A+ A +E + + YE N+R
Sbjct: 590 TLLRAGGVLA-----YELLPALDRVLATDSRFLLGSWLEQARAAAASEAEAQLYEQNSRF 644
Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
Q+T+W E ++L DY NK +GL+ YY PR ++ + + +SL G F+ + + +
Sbjct: 645 QLTLW----GPEGNIL-DYANKQLAGLVAQYYSPRWQLFLEALADSLARGVPFQQRLFDK 699
Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
+L + YP + GD + +Q ++ KY
Sbjct: 700 LVFRLEQAFVLSSRRYPTQPQGDTVDLAQKIFLKYF 735
>gi|325103828|ref|YP_004273482.1| alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
gi|324972676|gb|ADY51660.1| Alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
Length = 738
Score = 342 bits (876), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 205/578 (35%), Positives = 298/578 (51%), Gaps = 88/578 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ WGGPLPQSW+D LQKKIL R ELGM PVLPAF+G+VP + FP AK+
Sbjct: 195 MNNMDAWGGPLPQSWIDSHKDLQKKILARQRELGMIPVLPAFTGHVPKSFVKKFPEAKVD 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
SV + Y+L+ DP+F +IG F+++Q +EYG T H Y+ D F+E PP
Sbjct: 255 ------SVNWQGNFPNIYMLNPNDPMFSKIGEQFLKEQTREYG-TDHYYSSDIFNELNPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY---DPFWRPPQMKALLNSVPLGKL 177
P+Y+ + +YS M+ D +VW+MQ WLF FW P +M+A L VP KL
Sbjct: 308 SSDPKYLYDISEKVYSSMKKVDPKSVWVMQAWLFVSAHGRKFWTPERMQAFLKPVPDDKL 367
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE---- 233
++LDL+ E +P W ++ +YG ++W MLHNF GNI ++G +IA P +
Sbjct: 368 IILDLYTENRPRWKNTEGYYGKKWVWNMLHNFGGNIGLFGKAQTIASEPARVLSDPMKGN 427
Query: 234 ----NTTM--------------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
TM W N+Y RRYG AW +L
Sbjct: 428 YSGIGLTMEGIEQNPFIYQLMLDHVWNNEPIELEKWTNKYITRRYGVLDNNAVKAWEILL 487
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
+TVY D N+D SI+S G+P ++ S +
Sbjct: 488 NTVY------KDNNKD-----QGAPESILS---------GRPTF------AQNSYWTWTD 521
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
L+Y E +RA + I S ++L S+ ++YD++D+TRQA+A YA L + Y D
Sbjct: 522 LYYDNREFVRAWDYLIKSADKLRNSDGFQYDIVDITRQAMANYATALQRQLAYTYYAGDV 581
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + SRRFLEL+ D+D LLA FLLG W++ AK+ A N+ + K YE+NA+ ++MW
Sbjct: 582 NTYEKESRRFLELLSDLDRLLATRKDFLLGIWIDDAKKWATNDAERKLYEFNAKDLVSMW 641
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR-------LKDW 496
+ + DY + WSGL+ +YY R I+F ++ L++ + + +KDW
Sbjct: 642 ----GHKDITINDYSARQWSGLVENYYKQRWKIFFDQSLQKLKNNEIWDQAEFEKYIKDW 697
Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
EW +W N R YP + GD + S+ +YNKY
Sbjct: 698 --EW-----NWVNRRETYPTNTKGDPVNVSKEMYNKYF 728
>gi|374385255|ref|ZP_09642763.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
12061]
gi|373226460|gb|EHP48786.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
12061]
Length = 736
Score = 338 bits (868), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 198/557 (35%), Positives = 300/557 (53%), Gaps = 52/557 (9%)
Query: 2 SNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ GW GPLP+SW++ LQKKIL R ELGM P+LPAF+G+VP + FP A++ Q
Sbjct: 199 GNIDGWCGPLPKSWMESHEELQKKILARERELGMTPILPAFTGHVPPTFKEHFPEARLRQ 258
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
+ NW + R+ TYLL+A DPLF IG F+E+Q++ +G T H+Y DTF+E PP
Sbjct: 259 V-NW-----EGRFDDTYLLEADDPLFQTIGNRFMEEQIRTFG-TDHLYGADTFNEMFPPS 311
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
+ Y+ + A+Y M + D +AVW+MQGWLF FW+P QMKA L +VP L+VL
Sbjct: 312 EDSTYLDGISKAVYQSMAAVDPEAVWVMQGWLFHDKRDFWKPAQMKAYLGAVPDEHLIVL 371
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP--VEARTSENTTMA 238
DL+ E PIW ++ FYG P+IWCMLHNF G ++G +A P V A ++ +
Sbjct: 372 DLWGEEFPIWDRTEAFYGKPWIWCMLHNFGGRNMLFGNALKLAEEPSRVLADPAKGQLLG 431
Query: 239 W------INQYSV------RRYGRSVPAIQDAWNVLY-HTVYNCTDGATDKNRDVIVAFP 285
I Q V R+ D W Y + Y C D A +K D++
Sbjct: 432 LGAVPEGIEQNPVIYSLLFSHIWRNTAVELDEWFETYLESRYGCRDEAVEKAWDIL---- 487
Query: 286 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 345
+ EG NY ++ + +++ + + Y+ EVI+A + + + + L
Sbjct: 488 ---RKTVYANEG---NYESAITARPTFEKH-NNWAYTDIPYNPVEVIKAWKYLLQAADRL 540
Query: 346 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLA 405
+ YRYDLI + +Q LA YA + E Y+ D + SR F+EL++DMD L+
Sbjct: 541 GENPCYRYDLILVGKQVLANYATIIQQKFGEDYRTKDLPAFTRNSREFMELIDDMDELMG 600
Query: 406 CHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 465
H+ FLLG WLE A+ + +++ YE NAR QIT+W ++L DY +K WSGL
Sbjct: 601 THEAFLLGKWLEDARSWGKTASEKQLYEKNARDQITLWGGKD----AVLHDYASKQWSGL 656
Query: 466 LRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVES 518
+ +Y R ++ + + +++G + R++ W EW+ NG+ YP
Sbjct: 657 FKGFYKGRWQLFIDEVYDCIKTGRKYDHTASDDRVRSWEWEWV-------NGQEKYPAVP 709
Query: 519 NGDALITSQWLYNKYLQ 535
GD ++ S+ ++ KY++
Sbjct: 710 QGDPVVVSERMFGKYIK 726
>gi|403304646|ref|XP_003942904.1| PREDICTED: alpha-N-acetylglucosaminidase [Saimiri boliviensis
boliviensis]
Length = 754
Score = 338 bits (866), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 199/569 (34%), Positives = 303/569 (53%), Gaps = 74/569 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP++W +QL LQ +IL R+ GM PVLPAFSG+VP A+ VFP +T
Sbjct: 214 MGNLHTWDGPLPRAWHIKQLYLQHRILDRMRSFGMIPVLPAFSGHVPRAINRVFPRVNVT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DP+F +G F+ + KE+G T HIY DTF+E PP
Sbjct: 274 QMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRELTKEFG-TDHIYGADTFNEMQPP 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D+DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 331 SSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLV 390
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 391 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPEAARLFPNSTMVG 450
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ +RYG S P AW +L +
Sbjct: 451 TGMAPEGINQNEVVYSLMAELSWRKDPVPDLAAWVTSFATQRYGVSHPDAGAAWRLLLRS 510
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNC+ A + P+ + L+ T+ +W
Sbjct: 511 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------VW 540
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
Y+ S+V A L +++ L+AS T+RYDL+D+TRQA+ + + AY + H
Sbjct: 541 YNRSDVFEAWRLLLSAAATLAASPTFRYDLLDVTRQAVQELVGLYYEEARSAYLSKELHS 600
Query: 386 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + EL+ +D +LA FLLG WLE A+ +A +E + YE ++R Q+T+W
Sbjct: 601 LLRAGGILAYELLPALDEVLASDSHFLLGSWLEQARAVAVSEAEADFYEQSSRYQLTLW- 659
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
E ++L DY NK +GL+ YY PR ++ + + S+ G F + + +L
Sbjct: 660 ---GPEGNIL-DYANKQLAGLVASYYTPRWRLFLEVLAASVAQGIPFPQHQFDKNVFQLE 715
Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + YP + GD + ++ ++ KY
Sbjct: 716 QAFVLSKQRYPSQPRGDTVDLAKKIFLKY 744
>gi|255553488|ref|XP_002517785.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
gi|223543057|gb|EEF44592.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
Length = 360
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 164/359 (45%), Positives = 232/359 (64%), Gaps = 39/359 (10%)
Query: 215 MYGILDSIAFGPVEARTSENTTMA---------------------------------WIN 241
MYGILDSI+ GP+EAR SEN+TM W+
Sbjct: 1 MYGILDSISTGPIEARVSENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSEKVQVLEWLK 60
Query: 242 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 301
YS RRYG++V ++ AW +LYHT+YNCTDG D N D IV FPD DPS+ S ++ Q+
Sbjct: 61 TYSRRRYGKAVHQVEAAWEILYHTIYNCTDGIADHNTDFIVKFPDWDPSVQSGSDTSQQD 120
Query: 302 -----YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 356
+ S+ + + S+ H+WYS +VI AL+LFI G+ L+ S TYRYDL+
Sbjct: 121 NKHIFLHRSGSRRFLFEGPNSTLPQAHIWYSIQKVINALQLFIDGGSHLTGSLTYRYDLV 180
Query: 357 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 416
DLTRQ L+K AN+++++ I A++ NDA + S++F++L++D+D LLA D FL+G WL
Sbjct: 181 DLTRQVLSKLANQVYVDAIIAFRSNDARALNLHSQKFIQLIKDIDVLLASDDNFLIGTWL 240
Query: 417 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 476
ESAK+LA N + +QYEWNARTQ+TMW+D T+ S L DY NK+WSGLL DYY PRA+
Sbjct: 241 ESAKELALNPSEMRQYEWNARTQVTMWYDTTKTNQSKLHDYANKFWSGLLEDYYLPRAST 300
Query: 477 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNG-DALITSQWLYNKYL 534
YF ++++SL+ + F+L++WR +WI +N+WQ G +YP++ +G DAL S+ LY+KY
Sbjct: 301 YFDHLVKSLKQNEKFKLQEWREKWIAFSNEWQAGTKLYPMKGSGDDALAISKALYDKYF 359
>gi|320162905|gb|EFW39804.1| lysosomal alpha-N-acetyl glucosaminidase [Capsaspora owczarzaki
ATCC 30864]
Length = 786
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 194/571 (33%), Positives = 297/571 (52%), Gaps = 71/571 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+ W+ +Q LQ IL R+ GM PVLP+F+G+VP+AL FP+A IT
Sbjct: 238 MGNIKGWGGPISLEWIYKQRNLQVLILQRMRTFGMTPVLPSFAGHVPSALAQHFPNANIT 297
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q +W + ++CC LDA+DPLF +IG F+ Q + YG T+H+YNCD F+E TP
Sbjct: 298 QSSDWNNFPD--QYCCVGFLDASDPLFTQIGAEFLRLQNETYG-TNHLYNCDQFNEMTPA 354
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
Y+ G A+Y M + D AVW+MQGWLF + +W +++ALL+ VP +++
Sbjct: 355 STDLGYLKQAGMAVYQSMTAYDPAAVWVMQGWLFFNEAAWWSNDRVQALLSGVPDDHMII 414
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLF++V P+W+ + +YG P+IW MLH+F GNI +YGIL SI GP A + TM
Sbjct: 415 LDLFSDVTPVWNRLESYYGKPFIWNMLHDFGGNIGLYGILPSINEGPFAALATPGNTMVG 474
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQD-AWNVLYHT 265
W++ + RRYG S PA+ A+ L +
Sbjct: 475 IGLTPEGINQNYILYEFMMENMWRSAPVNLPTWVDAFVGRRYGPSTPAVAKLAYQQLLQS 534
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNCT+G + ++ P V+ +S+ + + +L+
Sbjct: 535 VYNCTNGQYSVTKSLLEIRPAVN-----------------MSRNGFMPT--------NLY 569
Query: 326 YSTSEVIRALELFIA---SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
Y VI A++ +A S +L++ +RYD++D TRQ L+ A + N+ A
Sbjct: 570 YDPGHVILAVDHILAAAKSAPQLASVVPFRYDVVDFTRQMLSNLAIDFHSNLTLALTSKQ 629
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
AH V + + L+ D+D LL FLLGPWL +A+ ++N + E+NAR QIT+
Sbjct: 630 AHLVHLYGQGIVGLIADLDELLVSDAHFLLGPWLAAARSWSENTAAQDLLEFNARNQITL 689
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
W N + + DY +K W+GL+ YY PR ++ + + ES F + ++
Sbjct: 690 WGPNGE-----ITDYASKQWAGLMSSYYRPRWELFVSFASAAAESDLPFNDAAFNAAVLE 744
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ WQ+ + + V GD++ + L KY
Sbjct: 745 VEKAWQHSHHNFTVTPLGDSIAIATRLRAKY 775
>gi|332018247|gb|EGI58852.1| Alpha-N-acetylglucosaminidase [Acromyrmex echinatior]
Length = 686
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 190/571 (33%), Positives = 297/571 (52%), Gaps = 74/571 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W + + LQ +IL R+ +LG+ PVLPAF+G+VP A +FP+A +T
Sbjct: 152 MGNIRGFGGPLSSNWHNYTIRLQHQILQRMRDLGIVPVLPAFAGHVPRAFARLFPNANMT 211
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W K + ++CC YLL+ TDPLF IG F++ + E+G T HIYNCDTF+EN P
Sbjct: 212 KINPW--NKFEDKYCCPYLLEPTDPLFRTIGEKFLQMYIDEFG-TDHIYNCDTFNENEPG 268
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
Y+ ++G +I+S M + DS A+WLMQ WLF +D FW +++A L SVP+G+++V
Sbjct: 269 NTELIYLRNVGHSIFSAMNAVDSKAIWLMQAWLFVHDIMFWTKSRVRAFLTSVPIGRMLV 328
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL +E P + K +YG P+IWCMLHNF G + M+G I E R ++TM
Sbjct: 329 LDLQSEQFPQYDRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRNMNDSTMVG 388
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W Y+ RRYG AW L TV
Sbjct: 389 TGLTPEGINQNYVIYELMNEMAYRHVPVNLDNWFESYATRRYGAWNEYAVAAWQHLGRTV 448
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YN + VI P ++ S+ + WY
Sbjct: 449 YNFIGTQKIRGHYVITRRPSLNISLWT-------------------------------WY 477
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ +F+ + + YR+D++D+TRQAL A+++++ I++ Y+ +
Sbjct: 478 DRKDFYAMWNMFLKARYGRGNNTLYRHDVVDITRQALQLIADDIYMTILDCYKKKNITAF 537
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ LEL +D++ +LA + FLLG WL AK +A NEE+ + YE+NAR QIT+W N
Sbjct: 538 QSSANALLELFDDLESILASGNNFLLGTWLAQAKDIAVNEEERRSYEYNARNQITLWGPN 597
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIKLTN 505
+ +RDY NK WSG++ DY+ R ++ K + +SL + + R + ++
Sbjct: 598 GE-----IRDYANKQWSGVVADYFKLRWELFLKALEKSLIQRIEPNITEINDRIFHEVER 652
Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKYLQG 536
+ +YP+E+ GD + + + +K+ +G
Sbjct: 653 SFTFSTKLYPIETKGDTIDIAMKIISKWYKG 683
>gi|307168312|gb|EFN61518.1| Alpha-N-acetylglucosaminidase [Camponotus floridanus]
Length = 737
Score = 332 bits (851), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 186/555 (33%), Positives = 284/555 (51%), Gaps = 74/555 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W ++ + LQ +IL R+ LG+ PVLPAF+G+VP A +FP+A +T
Sbjct: 221 MGNIRGFGGPLSTNWHNRTIHLQHQILRRMRNLGIVPVLPAFAGHVPRAFARLFPNANMT 280
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + + ++CC YLL+ TDPLF IG F+ + E+G T HIYNCDTF+EN P
Sbjct: 281 KINPWNNFED--KYCCPYLLEPTDPLFQIIGEKFLRMYINEFG-TDHIYNCDTFNENEPG 337
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
Y+ ++ A+++ + + DS A+WLMQ WLF +D FW P++K+ L SVP+G++++
Sbjct: 338 STELIYLRNVSHAVFAAINAVDSKAIWLMQAWLFVHDFMFWTEPRVKSFLTSVPMGRMLI 397
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDL +E P + K +YG P+IWCMLHNF G + M+G I E R +TM
Sbjct: 398 LDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRNMNGSTMVG 457
Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
AW Y+ RRYG W L TV
Sbjct: 458 TGLTPEGINQNYVIYELMNEMAYRHEPVDLDAWFQNYATRRYGAWNEYAVTTWQYLGRTV 517
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YN + V+ P ++ S+ +WY
Sbjct: 518 YNFIGSQRIRGHYVVTRRPSLNISL-------------------------------WIWY 546
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ F+ + + S YR+D++D+TRQAL ++L+ I+++Y+ +
Sbjct: 547 NRKNFYSMWNTFLKARHGRRNSTLYRHDVVDITRQALQLMGDDLYTIILDSYKKRNITAF 606
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ LEL +D++ +LA FLLG WL AK +A NEE+ K YE+NA+ QIT+W N
Sbjct: 607 RSSANALLELFDDLESILASGSNFLLGTWLSQAKDVATNEEERKSYEYNAKNQITLWGPN 666
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKLTN 505
+ +RDY NK WSG++ DY+ PR ++ K + +SL F + + + + K+
Sbjct: 667 GE-----IRDYANKQWSGVMADYFKPRWELFLKALEKSLVENTKFNVTEINNKIFDKVER 721
Query: 506 DWQNGRNVYPVESNG 520
+ YPVE G
Sbjct: 722 PFTFSTKFYPVEPKG 736
>gi|196001339|ref|XP_002110537.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
gi|190586488|gb|EDV26541.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
Length = 757
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 193/567 (34%), Positives = 291/567 (51%), Gaps = 72/567 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ W GPLP W+++Q+ LQ KIL R+ + GM P+LPAF+GN+P AL ++P AKI
Sbjct: 217 MGNIQRWAGPLPHDWINKQITLQVKILDRMRKYGMLPILPAFNGNIPNALTKIYPKAKIV 276
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ WF R+ T LLD D LFI I + FIE+++K YG T H+Y+ D F+E P
Sbjct: 277 KSSPWFGFSK--RYGETALLDPRDKLFIVISKLFIEEEIKAYG-TDHLYSLDLFNEIDPQ 333
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
EY++++ + Y + S D+ AVW+MQGW+F D + W +++A L+ +P G++V+
Sbjct: 334 SKELEYLTAVSKSAYLALNSADTKAVWIMQGWMFYNDNYYWENKRIQAFLSPIPKGRIVI 393
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFAEV+P + S F+G P+IWCML+NF GN MYG ++I G + A +N+TM
Sbjct: 394 LDLFAEVEPQYHRSNSFFGHPFIWCMLNNFGGNAGMYGTFETITEGAISAYDMKNSTMIG 453
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ Y+ RRYG I AW L TV
Sbjct: 454 TGMAPEGIGNNYIMYDLMAEMGWRKIAVDVRDWVVVYTERRYGGLDENIIKAWLRLSETV 513
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YNC D R A P V PS+ + +WY
Sbjct: 514 YNCNDMRQYHCR----ALPAVRPSLKIAND---------------------------VWY 542
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
S ++ A E + + NE + T++YD++D+TRQAL + A ++ + + Y N+ +
Sbjct: 543 SADDIFFAWEHMLRANNEFISEETFQYDIVDVTRQALQELAFIMYKKVTQCYHDNNQETL 602
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+EL DMD LL + FLLG W+ A Q + N ++Q +NA QIT+W
Sbjct: 603 KTAGGELIELFTDMDTLLGTNSHFLLGRWVADALQHSNNISIKQQLRFNALNQITLW--- 659
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
S+L DY NK W+GL+ +Y R ++ K + +S+ + F + + K
Sbjct: 660 -GPSKSILHDYANKMWNGLVDKFYKKRWLMFIKALSDSISNNILFDQQKFNLAVQKFEAA 718
Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
W + N Y S+G ++ S+ L++KY
Sbjct: 719 WASENNTYATTSSGSSVTVSKQLFSKY 745
>gi|390463730|ref|XP_003733088.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Callithrix jacchus]
Length = 830
Score = 328 bits (842), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 199/563 (35%), Positives = 293/563 (52%), Gaps = 69/563 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W PLP +W +QL LQ IL R+ GM PVLP F G+VP A+ VFP +T
Sbjct: 297 MGNLHTWDAPLPHAWHIKQLYLQHWILDRMRSFGMVPVLPMFLGHVPKAITRVFPRVSVT 356
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DP+F +G F+ + KE+G T HIY DTF+E PP
Sbjct: 357 QMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRELTKEFG-TDHIYGADTFNEMQPP 413
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D+DAVWL+QGWLF Y P FW P Q++A+L S P G L+V
Sbjct: 414 SSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQYQPQFWGPAQVRAVLGSAPHGCLLV 473
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 474 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAMNRGPEAARLFPNSTMVG 533
Query: 239 ------WINQYSVR--------------------RYGRSVPAIQDAWNVLYHTVYNCTDG 272
I+Q V YG S P AW +L +VYNC+
Sbjct: 534 TGMAPEGISQNXVVYSLMAELSWXKDPVPDLVAWXYGVSHPDTGAAWRLLLRSVYNCSGE 593
Query: 273 ATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVI 332
A + P+ + L+ T+ +WY+ S+V
Sbjct: 594 ACRGHNH------------------------SPLVRRPSLQMNTT------IWYNQSDVF 623
Query: 333 RALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR 392
A L ++ L+AS T+RYDL+D+TRQ + + + + AY L+ G +
Sbjct: 624 EAWRLLFSAAATLAASPTFRYDLLDVTRQVVQELVSLYYEEARSAY-LSKELGSLLRAGG 682
Query: 393 FL--ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEE 450
L EL+ +D +LA FLLG WLE A+ +A +E + YE N+R Q+T+W E
Sbjct: 683 ILAYELLPALDEVLASDSHFLLGSWLEQARAVAVSEAEADFYEQNSRYQLTLW----GPE 738
Query: 451 ASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNG 510
++L DY NK +GL+ YY PR ++ + + S+ G F+ + + +L +
Sbjct: 739 GNIL-DYANKQLAGLVAHYYAPRRRLFLEALAASVAQGIPFQQHQFDKNVFQLEQAFVLS 797
Query: 511 RNVYPVESNGDALITSQWLYNKY 533
+ YP + GD + ++ ++ KY
Sbjct: 798 KQRYPSQPRGDTVDLAKKIFLKY 820
>gi|194216885|ref|XP_001917396.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Equus caballus]
Length = 744
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 203/575 (35%), Positives = 300/575 (52%), Gaps = 86/575 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPL +SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 205 MGNLHTWDGPLTRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 265 QLGSWGHFNCS--YSCSFLLAPEDPLFPVVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPAYLAAATAAVYQAMTAVDPDAVWLLQGWLFHHQRTFWGPAQVGAVLGAVPRGRLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 382 LDLFAESQPMYIRTASFQGQPFIWCMLHNFGGNQGLFGALEAVNRGPAAARLFPNSTMVG 441
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S + AW +L +
Sbjct: 442 TGMTPEGIGQNEVVYALMAELGWRKDPVADLEAWVTSFAARRYGVSHKDAETAWKLLLRS 501
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNC+ A + P+ K L+ T+ +W
Sbjct: 502 VYNCSAEAYSGHNQ------------------------SPLVKRPSLQMGTT------VW 531
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY-------ANELFLNIIEAY 378
Y+ S+V A L + + L++S + YDL+D+TRQA + A +LN E
Sbjct: 532 YNRSDVFEAWWLLLTAAPALASSPAFLYDLVDVTRQAAQELISLYYEEARTAYLN-KELV 590
Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
L A G+ EL+ +D +LA FLLG WL+ A+++A +E + YE N+R
Sbjct: 591 PLLRAGGILA-----YELLPALDKVLASDSRFLLGSWLKQAREMAVSEAEAHFYEQNSRY 645
Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
Q+T+W E ++L DY NK +GL+ DYY PR ++ + +++SL G F+ + + +
Sbjct: 646 QLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWQLFVEMLVQSLAQGVPFQQQQFDK 700
Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+L + YP + GD + ++ + KY
Sbjct: 701 NAFELEEAFVLSTRRYPSQPQGDTVDLAKKFFLKY 735
>gi|332260899|ref|XP_003279518.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Nomascus leucogenys]
Length = 736
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 186/551 (33%), Positives = 288/551 (52%), Gaps = 76/551 (13%)
Query: 20 LVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYL 79
L LQ ++L R+ +PVLPAF+G+VP A+ VFP +T++G+W + C++L
Sbjct: 216 LFLQHRVLDRMRSSAXDPVLPAFAGHVPEAVTRVFPRVNVTKMGSWGHFNCS--YSCSFL 273
Query: 80 LDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQ 139
L DP+F IG F+ + +KE+G T HIY DTF+E PP P Y+++ A+Y M
Sbjct: 274 LAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMI 332
Query: 140 SGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYG 198
+ D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ + F G
Sbjct: 333 AVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQG 392
Query: 199 VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--------------------- 237
P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 393 QPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMA 452
Query: 238 -------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVA 283
AW+ ++ +RYG S P AW +L +VYNC+ + NR +V
Sbjct: 453 ELGWRKDPVPDLAAWVTSFAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVR 512
Query: 284 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 343
P L+ TS +WY+ S+V A L + S
Sbjct: 513 RPS-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAP 541
Query: 344 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDG 402
L+AS +RYDL+DLTRQA+ + + + AY + + + EL+ +D
Sbjct: 542 SLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLRKELASLLRAGGVLAYELLPALDE 601
Query: 403 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 462
+LA FLLG WLE A+ A +E + YE N+R Q+T+W E ++L DY NK
Sbjct: 602 VLASDSRFLLGSWLELARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQL 656
Query: 463 SGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDA 522
+GL+ +YY PR ++ + + +S+ G F+ + + +L + + YP + GD
Sbjct: 657 AGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDT 716
Query: 523 LITSQWLYNKY 533
+ ++ ++ KY
Sbjct: 717 VDLAKKIFLKY 727
>gi|321472423|gb|EFX83393.1| hypothetical protein DAPPUDRAFT_301977 [Daphnia pulex]
Length = 799
Score = 322 bits (824), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 181/531 (34%), Positives = 285/531 (53%), Gaps = 76/531 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N WGGPL +W L+LQ KIL R+ GM PVLPAF+G+VP A++ V+P+A T
Sbjct: 212 MGNFRAWGGPLSDNWQQATLILQHKILERMRSFGMTPVLPAFAGHVPRAMERVYPNASYT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L +W + + ++CC L T+PLF EIG FI++ E+G + H+YNCD F+E P
Sbjct: 272 HLTSWLNFQD--QYCCPLFLQPTEPLFTEIGSRFIKEMALEFG-SDHVYNCDVFNEVRPT 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P ++SS+G A+++ M + D DA+WLMQGWLF D +W KALL SVP G++++
Sbjct: 329 QADPVFVSSVGTAVFNAMTTADPDAIWLMQGWLFKSDADYWTADLSKALLTSVPQGRMLI 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL AE+ P + FYG P+++C+LHNF G + + G + I+ ++AR N+TM
Sbjct: 389 LDLQAELDPQYIRLNSFYGQPFVFCLLHNFGGTLGLNGAIQIISQRVIDARNFPNSTMVG 448
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W ++Y+VRRYG + A+ AW L ++V
Sbjct: 449 TGLTMEGIDQNYVVYDKMLEMGWRDKVPNLNQWFDEYTVRRYGVNNTAVMSAWRFLQNSV 508
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YN + + + + V+V P + + P +WY
Sbjct: 509 YNDSSRRSFRGQYVLVTRPAL-------------------------------WQLPFVWY 537
Query: 327 STSEVIRALELFIA---SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
+ +VI A + I+ + LS ++ +R+D++DLTRQ++ + + L+ ++E Y ++
Sbjct: 538 NPHDVILAWDHLISGLMTEPLLSNASNFRHDMVDLTRQSMQEIFHLLYSQLLEVYLEKNS 597
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ ++ + ++L++D+D LL FLLG W+ AK E ++ QYEWNAR QIT+W
Sbjct: 598 TAIEGIAYKMIDLLQDLDELLQTGKKFLLGKWIADAKSWGTTEGEKLQYEWNARNQITLW 657
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
+ +RDY K W+G++ DYY PR ++ + M SL+ F K
Sbjct: 658 GPRGE-----IRDYAAKQWAGVVADYYKPRWEVFIREMQMSLDENRAFNKK 703
>gi|301106961|ref|XP_002902563.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
gi|262098437|gb|EEY56489.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
Length = 684
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 190/577 (32%), Positives = 296/577 (51%), Gaps = 74/577 (12%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ KIL R+ E GM P LPAF+G+VP ++ +FP+AK
Sbjct: 121 MGNLRGSWVKGPLPQAFIDSQYALQLKILNRMREFGMIPALPAFAGHVPEEMKALFPNAK 180
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T+ NW + +CC Y+LD +DPL+ +IG+ F+E+Q Y TS +Y CDT++E
Sbjct: 181 FTRSPNWGDFSDE--FCCVYMLDFSDPLYYDIGKTFLEEQRALYDYTSSLYQCDTYNEMD 238
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P P + + A+ M + D++AVWL+QGWLF P +W ++KA L+ V K+
Sbjct: 239 PDFTDPAKLQAASRAVIDSMTAADANAVWLIQGWLFENSPDYWTKNRVKAYLDGVSNEKM 298
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL++EV+P+WS ++G +++C+LHNF GN M G L ++ PV+A N TM
Sbjct: 299 IILDLYSEVRPVWSKMDNYFGKSWVYCVLHNFGGNTGMRGDLATLGTAPVQASRDSNGTM 358
Query: 238 ---------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYH 264
W+ ++ +RY + AW L
Sbjct: 359 IGVGLTMEGIYQNYVVYDLTLQMAWVDTPLDMDEWVPSFAAQRYHSQDVHTERAWGFLLQ 418
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
+VYN T G + +I P + K V + S T
Sbjct: 419 SVYNRTLGFGGVTKSLICLIP----------------HWKLVRDGFMPTSIT-------- 454
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ--LND 382
Y ++ RA + + +G+EL A +TYR+DL+D+TRQ L+ + +L++ E Y+
Sbjct: 455 -YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFLSDHFMAQYLHLKEMYEGKTQP 513
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ------EKQYEWNA 436
AH + + R L +E MD +LA ++ LLG W+ A+ LA+ E + YE+ A
Sbjct: 514 AHQLCAWTERMLLTIERMDEILATNEDSLLGNWIADARALAEESESIESSNLQDYYEYEA 573
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
R Q+T W DN E + DY K W+GL++ YY PR ++ + ++ G +
Sbjct: 574 RNQVTRWGDNNSET---IHDYAGKEWAGLVKGYYLPRWRMWLGEVCQAYTQGRTINKEVV 630
Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
++ I WQ YP + GDAL+ SQ +Y+++
Sbjct: 631 KKARIAFELKWQLSHEHYPTTTVGDALVVSQRIYDEF 667
>gi|255533286|ref|YP_003093658.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346270|gb|ACU05596.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 734
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 189/577 (32%), Positives = 296/577 (51%), Gaps = 92/577 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGP+ ++++ +Q LQKKIL R LGM P+LP+F+G+VP + ++ FP K+
Sbjct: 190 MGNLDAWGGPMSKNFMAKQEALQKKILARERALGMTPILPSFTGHVPPSFKDKFPDIKVN 249
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ + P Y+L+ P+F EIGR F+ + +G T H+Y+ DTF+E TP
Sbjct: 250 TQQ--WGINVSP----AYVLNPETPMFKEIGRKFLTALINTFG-TDHLYSADTFNEMTPV 302
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ Y++ + IY M + D+ AVW+MQGW+F P FW+P QMKAL ++VP KL+V
Sbjct: 303 SNDSTYLNGMAKKIYESMAAVDTQAVWIMQGWMFLDRPNFWQPTQMKALFSAVPQDKLIV 362
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMA 238
LDL +E+ P+WS + FYG +IWCMLHNF G + M+G + I P A + + M+
Sbjct: 363 LDLNSELNPVWSRTDAFYGEKWIWCMLHNFGGRLSMFGDMSRIGNDPAAALKNDQRGKMS 422
Query: 239 ---------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHT 265
W+ Y+ RRYG+ + AW VL +T
Sbjct: 423 GIGLTMEGIEQNPAIYSLMLEHIWNDKPIDLDNWLKGYAQRRYGKRNSNAEKAWEVLKNT 482
Query: 266 VYNCTD--GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
VY+ G +II+ G+P A + + T+
Sbjct: 483 VYSHQPWWGTN---------------TIIT---------GRPTFDAATVWTYTA------ 512
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
+ YS+ E+++A + + +EL +S+ ++YDL+D+TRQ LA YAN L + +Y+ D
Sbjct: 513 IPYSSKELMKAWSYLLTASDELKSSDGFQYDLVDVTRQVLANYANVLQQDFASSYKQKDM 572
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ S +FLEL++D+D LL FLLG W+ +AK L N ++K +E NAR IT+W
Sbjct: 573 ATFNKKSAQFLELIDDIDQLLGTRSDFLLGKWINNAKALGDNPAEKKLFERNARDLITLW 632
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDW 496
D + + +Y K W+G+++ +Y PR +F + +G +KDW
Sbjct: 633 LD----KDCNIHEYACKEWAGMMKGFYKPRWQQFFDEVRLQASAGKEIDQIKFENTIKDW 688
Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+W+ N Y + G+ + ++ LY KY
Sbjct: 689 EWKWV-------NANEAYTDKPTGNPVTVAKALYAKY 718
>gi|255533285|ref|YP_003093657.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346269|gb|ACU05595.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 749
Score = 317 bits (813), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 193/577 (33%), Positives = 283/577 (49%), Gaps = 93/577 (16%)
Query: 2 SNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G GPLP+SW++ LQKKIL R ELGM P+LPAFSG+VP + FP+A++ +
Sbjct: 207 GNIDGLNGPLPKSWMESHEQLQKKILARERELGMKPILPAFSGHVPPTFKARFPNARVDR 266
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
L NW + R+ TY+L DPLF +I F+ +Q K +G T H+Y DTF+E P
Sbjct: 267 L-NW-----EGRFADTYVLHPDDPLFQQIADKFMAEQDKAFGNTDHLYGADTFNEMYLPY 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
Y+ +G A+Y GM D +A+W+MQGW+F FW+P +K L+ VP L++L
Sbjct: 321 TDTAYVRKIGTAVYKGMAKADPEAIWVMQGWMFWDKRDFWKPEVVKNYLSGVPDDNLIML 380
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT----- 235
DLFA+ +PIW+ ++ F+G +IWCMLHNF G +YG L+ I P E N
Sbjct: 381 DLFADEQPIWTKTEAFWGKKWIWCMLHNFGGRNPLYGDLNYIGREPAEMVHDPNRGRLSG 440
Query: 236 -----------------------------TMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
+W+ Y+ RRYG+ P + AW +L+ TV
Sbjct: 441 IGLVPEGIEQNPVVYSLMLEHVWNDQVIDVKSWLVNYAQRRYGQRDPQTEKAWQILHQTV 500
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY--GKPVSKEAVLKSETSSYDHPHL 324
Y EG Y+ +P ++ ++ + D P
Sbjct: 501 Y--------------------------AKEGSYETIISARPTHEK---HADWTGTDLP-- 529
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
Y +++ A + + N ++ Y++DL+ + RQ LA YA L ++ +
Sbjct: 530 -YDGDKLVPAWTYLLNASNRFKNNDCYQFDLVTVGRQVLANYATVLQRLFARDFRNKNLT 588
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ FL L+ DMD L+ FLLG WL AK+ A NE + + YE NAR IT+W
Sbjct: 589 AYRAHTAEFLTLIADMDKLMGTRKDFLLGKWLNDAKKWATNESESRLYEKNARDLITLW- 647
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWR 497
++AS L +Y NK W+GL +YG R + +LE G F R+KDW
Sbjct: 648 --GGKDAS-LHEYANKQWAGLFNGFYGKRWQTFIAETSTALEQGKSFDQEAFETRMKDW- 703
Query: 498 REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
EW +W NGR Y + G+ + S L+ KY+
Sbjct: 704 -EW-----NWVNGREQYTDKPQGNPVTVSIQLHKKYI 734
>gi|157134500|ref|XP_001656341.1| alpha-n-acetylglucosaminidase [Aedes aegypti]
gi|108881379|gb|EAT45604.1| AAEL003150-PA [Aedes aegypti]
Length = 763
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 187/566 (33%), Positives = 296/566 (52%), Gaps = 80/566 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL ++++ LQ +++ + LGM LPAF+G++P ++P AK+T
Sbjct: 211 MGNIRGWGGPLTTNFINFSKKLQNQVIDEMRRLGMVLALPAFAGHLPVQFAQLYPEAKLT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + ++ LD DPLF EIG+ F+ + ++ YG ++HIY CD F+E P
Sbjct: 271 PVENWNGFPA--QYASPLFLDPIDPLFQEIGKRFLTKVIERYG-SNHIYFCDPFNEIQPR 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
S +Y+SS A IY M D AVWL+QGW+F +P+W ++A L +VPLG+++VL
Sbjct: 328 SFSAKYLSSASAGIYKAMNDVDPFAVWLLQGWMFVKNPYWSDVAIRAFLQAVPLGRMLVL 387
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL +E P + ++ ++G P+IWCML NF G + M G +D + + RT+++ TM
Sbjct: 388 DLQSEQFPQYDRTESYHGQPFIWCMLSNFGGTLGMLGSVDLVFQRIRDVRTNDSMTMIGT 447
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W Y+ RYG ++DAW++ +TVY
Sbjct: 448 GITPEGINQNYGLYEFALEMGWNPNIDNVEEWFRTYASVRYGTQDKRLKDAWSMFRYTVY 507
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ + + GKY +P K HP LWY+
Sbjct: 508 SFKEQ--------------------EMMRGKYTFNRRPSLKL-----------HPWLWYN 536
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ ++L + S S + +R D++DLTRQ L A+ L+LNI+EAY + + V
Sbjct: 537 ETLFNAGVQLLLESN---STNTLFRNDVVDLTRQFLQNTADRLYLNIMEAYNTKNPNSVK 593
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
LS F +L+EDMD LL FLLG WLESAK +A+ + ++YE+NAR QIT+W
Sbjct: 594 YLSILFQKLLEDMDRLLRTDQHFLLGRWLESAKAVAETSLERQKYEYNARNQITLWGPQG 653
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---RLKDWRREWIKLT 504
Q + DY NK W+G+++D++ PR ++ M + +E +++D + ++L
Sbjct: 654 Q-----IVDYANKQWAGMVQDFFLPRWKLFLTEMTKDVEQNRTLNEGKVRDKIFKMVELP 708
Query: 505 NDWQNGRNVYPVESNGDALITSQWLY 530
N R YP+ +GDAL+ ++ L+
Sbjct: 709 FCTSNKR--YPIRPDGDALLVARELF 732
>gi|449675146|ref|XP_002156234.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Hydra
magnipapillata]
Length = 646
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 183/567 (32%), Positives = 281/567 (49%), Gaps = 69/567 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL-QNVFPSAKI 59
M NL GWGGPL SW +QL LQ+ I+ R+ GM PVLP F G++P AL +FP++K
Sbjct: 89 MGNLEGWGGPLSSSWYSKQLQLQQNIISRMRSFGMIPVLPGFGGHIPKALVSRLFPTSKY 148
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
+L W K ++ T+LLD DPLF ++G AF+E Q + Y T H+YN D F+E P
Sbjct: 149 YKLKPW--NKFTGKYGGTFLLDPQDPLFKKVGAAFVEMQKQLYNGTDHVYNADIFNEMDP 206
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
P + +I++ +Y+ M + DSDAVWLMQGW+F W+P ++A L ++P GKL++
Sbjct: 207 PQLTSAFITNTSIGVYNAMLASDSDAVWLMQGWMF-LSSVWKPELVEAWLQAIPYGKLII 265
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
LDL +++ P++ + FYG P+IWCM+ NF G +YG L + G + AR + +
Sbjct: 266 LDLASDIYPLYDQTNAFYGHPFIWCMIENFGGTTRLYGQLTGVMKGVISARKTYKSFMIG 325
Query: 236 -------------------TMAWINQ----------YSVRRYGRSVPAIQDAWNVLYHTV 266
M W N+ Y RRYG + DAW +L T+
Sbjct: 326 TGMTPEGINQNDINFELMNEMGWRNEEFNISDWTLSYIKRRYGDYPKMVSDAWLILIDTI 385
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YNC DG + D + P + P + N PV H+WY
Sbjct: 386 YNCNDGRENGGYDGRI--PVMRPQL---------NAKLPV----------------HMWY 418
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
S ++ A +L + + + +T+R DL+ L Q L + + ++ Y V
Sbjct: 419 SIKDLYNAWKLMVKGSDYMPLIDTFRNDLVRLGTQVLEDLSIVFYTQMVSGYFNKSTLNV 478
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ + L+ DMD LLA LLG W++SA+ + + K E+NA+ QIT+W N
Sbjct: 479 EKYGSKITVLLTDMDRLLATDQYSLLGRWIQSARSMGDTLNETKLLEYNAKNQITLWGPN 538
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
+ +RDY NK W+GL+ +Y R ++ ++ +SL+ G + + + ++
Sbjct: 539 GE-----IRDYANKNWAGLVGSFYFERWNMFINFLSDSLKRGVPYDDSAFVSKLLQFEKK 593
Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
W N + + GDA S L Y
Sbjct: 594 WNNEIKEFSADPTGDAFGISHQLLRAY 620
>gi|328867411|gb|EGG15793.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
Length = 1501
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 191/587 (32%), Positives = 300/587 (51%), Gaps = 97/587 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++GWGGPL ++ Q LQ++IL R+ + GM PVLP F+G+VP A ++FP+A IT
Sbjct: 950 MGNVNGWGGPLDYDFIAGQHDLQQQILERMRQYGMKPVLPGFAGHVPRAFMSLFPTANIT 1009
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + TY LD +DPLF + + F++ Q YG T H Y+ D F+E TPP
Sbjct: 1010 QLGDWRAFNG------TYYLDPSDPLFANVSQTFVKVQTAIYG-TDHYYSFDPFNEITPP 1062
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
Y+ + +++Y+ + D AVW++Q W F D FW+PPQ+KA L VP+G L+V
Sbjct: 1063 SSDAGYLQNSSSSMYNALAYADPQAVWVLQAWFFISDAWFWQPPQVKAFLGGVPIGHLLV 1122
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
LD +AE P W+ + QF G +IWCMLHNF G MYG + I GP++A
Sbjct: 1123 LDTWAEESPAWTVTDQFNGHDWIWCMLHNFGGRTGMYGKIPRITAGPIDARKQSPGMKGT 1182
Query: 230 ----------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
RT+ WINQY+ RRYG VP + AWN L TVY
Sbjct: 1183 GLTPEAIEQNYIMYDLMSEMSWRTTAPNMTEWINQYTQRRYGVFVPELAQAWNSLASTVY 1242
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
N D + DKN +F + P + +++Y
Sbjct: 1243 NAPD-SIDKNPS---SFVGIRPELNMTN---------------------------NIYYD 1271
Query: 328 TSEVIRALELFIASGNE-LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+S + +A +L+++ +E + +++TY +D+ ++T QAL+ E + + +AY+
Sbjct: 1272 SSIIQKAWQLYLSVTDEYVLSTSTYSFDIAEITIQALSNLFIETEIAMYDAYKTGKGTEF 1331
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ-----LAQNEEQEK----------Q 431
+ + L ++ DMD + + L+G W +A+Q L++N+++++ Q
Sbjct: 1332 DEHAMNCLNIITDMDMIASTQQLLLVGTWTANARQWANYNLSRNKDEDRNTDKEQMTIEQ 1391
Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM------IESL 485
YE+NAR QIT+W S L DY WSGLL D+Y R +++ KY+ +
Sbjct: 1392 YEFNARNQITLW----GPSNSTLHDYAYHLWSGLLNDFYLARWSLFIKYLDSSLSSSSTN 1447
Query: 486 ESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
++G GF+ +++ + L W YP G+A S+++ N+
Sbjct: 1448 DAGTGFKNQEYINDIESLEESWNLQTYQYPTRPTGNAYQLSKFINNQ 1494
>gi|194759443|ref|XP_001961958.1| GF14678 [Drosophila ananassae]
gi|190615655|gb|EDV31179.1| GF14678 [Drosophila ananassae]
Length = 783
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 187/577 (32%), Positives = 298/577 (51%), Gaps = 87/577 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++IL LGM+ LPAF+G+VP AL + P+ T
Sbjct: 228 MGNIRGWAGPLKPEWRQFQLLLQQEILSAQRNLGMSVALPAFAGHVPRALSRLHPNTSFT 287
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R+CC ++ T+PLF +I F++ + YG ++HI+ CD F+E PP
Sbjct: 288 DVQRWNQFPD--RYCCGLFVEPTEPLFHQIATTFLQSVVTIYG-SNHIFFCDPFNELEPP 344
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAI++ M + D +A+WL+QGW+F +PFW P +A L +VP G+++VL
Sbjct: 345 VAKPEYMRSTAAAIHNSMTAVDPEAIWLLQGWMFVKNPFWTPDMAEAFLTAVPRGRILVL 404
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL +E P + + ++G P+IWCMLHNF G + M+G I G AR+ N+++
Sbjct: 405 DLQSEQFPQYELTHSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEAARSMPNSSIVGT 464
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
+W ++V RYG ++ AW +L ++VY
Sbjct: 465 GITPEGIGQNYVVYSLTLERGWSRNSIDLDSWFRHFTVTRYGVKDESLAKAWLLLKNSVY 524
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH-PHLWY 326
+ + G+Y +P S++H P WY
Sbjct: 525 SFHG--------------------LQKMRGQYVVTRRP------------SFNHDPFTWY 552
Query: 327 STSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
+ S+V+ A L +++ E + Y +DL+D+TRQ L A++L++N+ +++
Sbjct: 553 NASDVLEAWHLLLSARVIIPLEDDRYDVYEHDLVDITRQFLQITADQLYVNLKSSFRKRQ 612
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
LS R L+L +D++ +L+ FLLG WLE AKQ+A + E K +E+NAR QIT
Sbjct: 613 LPRFEFLSTRLLQLFDDLELILSSGRNFLLGNWLEQAKQVAPHPEDRKSFEFNARNQITA 672
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWR 497
W N Q + DY K WSGL++DYY PR +++F + +L S G F+ K +
Sbjct: 673 WGPNGQ-----ILDYACKQWSGLVKDYYKPRWSLFFDDVNVALHSQRPFNGSAFKQKVSQ 727
Query: 498 REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
R I+L + N ++YP + + S ++ +++
Sbjct: 728 R--IELP--FSNKTDIYPTDPVENVWFISHTIFERWM 760
>gi|195577611|ref|XP_002078662.1| GD22403 [Drosophila simulans]
gi|194190671|gb|EDX04247.1| GD22403 [Drosophila simulans]
Length = 778
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 187/569 (32%), Positives = 287/569 (50%), Gaps = 85/569 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ + LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPGWRRYQLLLQQEIITAQHNLGMSVALPAFAGHVPRALKRLHPESTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC ++ TD LF EI F++ + +YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTDNLFKEIASRFLQNIITKYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIDEARRLPNSSLVGT 459
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
+W +S RYG ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNFSHTRYGVKDERLEQAWLLLKNSVY 519
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ + G+Y V+ S P WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548
Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
S V+ A L + S + + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 549 ASAVLDAWHLLLTSRAIIPLEDDRYEIYEHDLVDITRQFLQISADQLYVNLRSAYRKRQV 608
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
LS + L+L +DM+ +LA FLLG WL+ AKQ A N +++ +E+NAR QIT W
Sbjct: 609 ARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGEQRNFEFNARNQITAW 668
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRR 498
+ Q + DY K WSGL+ DYY PR ++ + + +L + G F+LK
Sbjct: 669 GPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLEDVTVALHAGRPYNGTAFKLK--VS 721
Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQ 527
+ I+L + N +VYPV G+ + SQ
Sbjct: 722 QEIELP--FSNKADVYPVTPVGNTWLISQ 748
>gi|383856382|ref|XP_003703688.1| PREDICTED: alpha-N-acetylglucosaminidase [Megachile rotundata]
Length = 744
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 181/521 (34%), Positives = 278/521 (53%), Gaps = 77/521 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPL SW +Q + LQ KIL R+ LG+ PVLP+F+G+VP A +FP+A +T
Sbjct: 194 MGNIRAFGGPLYPSWHEQSINLQHKILERMRSLGIIPVLPSFAGHVPRAFPRLFPNANVT 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W + +CC YLL TDPLF +IG+ F++ ++E+G T HIYNCDTF+EN P
Sbjct: 254 KLAPWNNFPD--VYCCLYLLAPTDPLFQQIGQLFLKTYIEEFG-TDHIYNCDTFNENEPH 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G + + M + D DA+WLMQGWLF++D FW P+++A L SVP G+++V
Sbjct: 311 TSELKFLRNVGHSTFQAMNAVDPDAIWLMQGWLFTHDKLFWTEPRVEAFLTSVPRGRMIV 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL +E P + K ++G P+IWCMLHNF G + M+G I E R +N+TM
Sbjct: 371 LDLQSEQFPQYGRLKSYFGQPFIWCMLHNFGGTLGMFGSAQIINQRVFEGRNMKNSTMVG 430
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W Y+ RRYG AW L TV
Sbjct: 431 TGLTPEGINQNYVIYELMNEMAYRKEPVNLNKWFENYASRRYGVWNEYAVSAWQSLGRTV 490
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YN + + + VI P ++ S + WY
Sbjct: 491 YNFSGTRKIRGKYVISRRPSLNLSTWT-------------------------------WY 519
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ +F+ + + S YR+D++DLTRQ L A E++ +I+++ +
Sbjct: 520 DRDTLYNTWSVFLQARHGRRNSTLYRHDVVDLTRQVLQAKAEEIYPVLIDSFNKKNLTAF 579
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
S + L+L +D++ +LA FLLG WL++AK+LA N+E+ + Y+ NA+ QI++W
Sbjct: 580 KYHSDKLLDLFDDLELILASGKDFLLGKWLDAAKKLASNDEELRLYQVNAKYQISLWGPR 639
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
+ +RDY NK W+G++ DY+ PR +I+ +ESLE+
Sbjct: 640 GE-----IRDYANKQWAGVVADYFKPRWSIF----LESLEN 671
>gi|194863164|ref|XP_001970307.1| GG23441 [Drosophila erecta]
gi|190662174|gb|EDV59366.1| GG23441 [Drosophila erecta]
Length = 778
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 185/551 (33%), Positives = 288/551 (52%), Gaps = 49/551 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPEWRRYQLLLQQEIIAAQRNLGMSVALPAFAGHVPRALKRLHPGSTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC L+ TD LF EI F+++ + YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFLEPTDNLFNEIALIFLQKIITAYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY ++ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESIRRLDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWI 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 241 N-------------QYSVRRYGRSVPAIQDAWNVLY-HTVYNCTDGATDKNRDVIVAFPD 286
+++ R + P D+W + H Y D ++
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNRPLDLDSWFTSFSHARYGVKDERLEQ---------- 509
Query: 287 VDPSIISVTEGKYQNYG-KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 345
+ + + Y +G + + + V+ S P WY+ S V+ A L ++S +
Sbjct: 510 ---AWLQLKNSVYSFHGLQKMRGQYVVTRRPSFKQEPFTWYNASAVLDAWHLLLSSRAII 566
Query: 346 SASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMD 401
+ Y +DL+D+TRQ L A++L++N+ AY+ LS + L+L +DM+
Sbjct: 567 PLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAYKKRQVSRFEFLSSKLLKLFDDME 626
Query: 402 GLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKY 461
+LA FLLG WL+ AKQ A + +++ YE+NAR QIT W + Q + DY K
Sbjct: 627 LILASSRNFLLGNWLQQAKQAAPHPGEQRNYEFNARNQITAWGPDGQ-----ILDYACKQ 681
Query: 462 WSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPV 516
WSGL+ DYY PR ++ + + +L S G F+LK + I+L + N +VYPV
Sbjct: 682 WSGLVSDYYRPRWRLFLEDVTVALHSLRPFNGTAFKLK--VSQEIELP--FSNKVDVYPV 737
Query: 517 ESNGDALITSQ 527
G+ SQ
Sbjct: 738 TPVGNTWFISQ 748
>gi|21356587|ref|NP_652045.1| CG13397, isoform A [Drosophila melanogaster]
gi|442626853|ref|NP_001260251.1| CG13397, isoform B [Drosophila melanogaster]
gi|16185856|gb|AAL13967.1| LP03571p [Drosophila melanogaster]
gi|22945953|gb|AAF52672.2| CG13397, isoform A [Drosophila melanogaster]
gi|440213562|gb|AGB92787.1| CG13397, isoform B [Drosophila melanogaster]
Length = 778
Score = 305 bits (782), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 184/568 (32%), Positives = 286/568 (50%), Gaps = 83/568 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL +W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPAWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLNPESTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC ++ T+ LF EI F+ + +YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLHNIITKYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
+W +S RYG ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNFSHSRYGVKDERLEQAWLLLKNSVY 519
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ + G+Y V+ S P WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548
Query: 328 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
S V+ A L + E + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 549 ASAVLDAWHLLLTFRAIIPLEDNRYEIYEHDLVDITRQFLQISADQLYINLRSAYRKRQV 608
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
LS + L+L +DM+ +LA FLLG WL+ AKQ A N Q++ +E+NAR QIT W
Sbjct: 609 SRFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGQQRNFEFNARNQITAW 668
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+ Q + DY K WSGL+ DYY PR ++ + + +L +G F ++ +K+
Sbjct: 669 GPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLEDVTVALHAGRPFNGTAFK---LKV 720
Query: 504 TND----WQNGRNVYPVESNGDALITSQ 527
+++ + N +VYPV G+ + SQ
Sbjct: 721 SHEIELPFSNKDDVYPVTPVGNTWLISQ 748
>gi|198476648|ref|XP_001357424.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
gi|198137793|gb|EAL34493.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
Length = 767
Score = 305 bits (780), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 183/571 (32%), Positives = 292/571 (51%), Gaps = 77/571 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL + Q +LQ+ IL +LG++ LPAF+G++P A++ ++P+ T
Sbjct: 212 MGNIRGWGGPLKPEYQRLQELLQQHILRAQRDLGISVALPAFAGHLPTAMRRIYPNGNYT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S DP +CC +D DP+F + F+ + ++ YG ++HI+ CD F+E PP
Sbjct: 272 EVERWNSFP-DP-YCCGLFVDPLDPIFDLVAALFLRRVVQRYG-SNHIFFCDPFNELQPP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V P+Y+ S AAI++ M+S D +AVWL+QGW+F + FW M+A L +VP+G+L+VL
Sbjct: 329 VAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNIFWTDAMMEAFLTAVPIGRLIVL 388
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL +E P + + +YG P++WCMLHNF G + M+G D + G AR N+++
Sbjct: 389 DLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGSADLVNNGIEAARRMPNSSIVGV 448
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
+W ++ RYG +Q AW +L +VY
Sbjct: 449 GITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFARTRYGVDDEGLQQAWQLLRRSVY 508
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ K R G V++ L + P WY+
Sbjct: 509 SFR--GLQKMRG-----------------------GYTVTRRPALNLD------PFTWYN 537
Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
S+V+ A +L ++S + + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 538 ASDVLEAWKLLLSSRAIIPLEDDNYAIYEHDLVDITRQYLQISADQLYVNLKSAYRKRQV 597
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
L + L+L D++ +LA FLLG WL A++ A N + +E+NAR QIT W
Sbjct: 598 ARFEYLGSKLLQLFGDLERILASGSNFLLGTWLADAQRAAPNAADKPNFEFNARNQITAW 657
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIK 502
+ Q + DY K WSGL+ DYY PR A++ + +L S F ++ R +
Sbjct: 658 GPDGQ-----ILDYACKQWSGLVLDYYRPRWALFLDDVTLALHSNRTFNSTAFKLRVSQE 712
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + N +VYPVE G+ SQ +Y ++
Sbjct: 713 VELPFSNKSDVYPVEPMGNTWFISQNIYERW 743
>gi|195339231|ref|XP_002036223.1| GM12949 [Drosophila sechellia]
gi|194130103|gb|EDW52146.1| GM12949 [Drosophila sechellia]
Length = 778
Score = 304 bits (779), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 185/569 (32%), Positives = 286/569 (50%), Gaps = 85/569 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTAGWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLHPESTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC ++ T+ LF EI F++ + +YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLQNIITKYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D +A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPEAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W +S RYG ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDGWFTNFSHTRYGVKDERLEQAWLLLKNSVY 519
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ + G+Y V+ S P WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548
Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
S V+ A L + S + + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 549 ASAVLDAWHLLLTSRAIIPLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAYRKRQV 608
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
LS + L+L +DM+ +LA FLLG WL+ AKQ A N +++ +E+NAR QIT W
Sbjct: 609 ARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGEQRNFEFNARNQITAW 668
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRR 498
+ Q + DY K WSGL+ +YY PR ++ + + +L + G F+LK
Sbjct: 669 GPDGQ-----ILDYACKQWSGLVSNYYRPRWRLFLEDVTVALHAGRPYNGTAFKLK--VS 721
Query: 499 EWIKLTNDWQNGRNVYPVESNGDALITSQ 527
+ I+L + N +VYPV G+ + SQ
Sbjct: 722 QEIELP--FSNKIDVYPVTPVGNTWLISQ 748
>gi|195155652|ref|XP_002018715.1| GL25802 [Drosophila persimilis]
gi|194114868|gb|EDW36911.1| GL25802 [Drosophila persimilis]
Length = 767
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 182/571 (31%), Positives = 292/571 (51%), Gaps = 77/571 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL + Q +LQ+ IL +LG++ LPAF+G++P A++ ++P+ T
Sbjct: 212 MGNIRGWGGPLKPEYQRLQELLQQHILRAQRDLGISVALPAFAGHLPTAMRRIYPNGNYT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S DP +CC +D DP+F + F+ + ++ YG ++HI+ CD F+E PP
Sbjct: 272 EVERWNSF-PDP-YCCGLFVDPLDPIFDLVAALFLRRVVQRYG-SNHIFFCDPFNELQPP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V P+Y+ S AAI++ M+S D +AVWL+QGW+F + +W M+A L +VP+G+L+VL
Sbjct: 329 VAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNIYWTDAMMEAFLTAVPIGRLIVL 388
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL +E P + + +YG P++WCMLHNF G + M+G D + G AR N+++
Sbjct: 389 DLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGSADLVNNGIEAARRMPNSSIVGV 448
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
+W ++ RYG +Q AW +L +VY
Sbjct: 449 GITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFARTRYGVDDEGLQQAWQLLRRSVY 508
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ K R G V++ L + P WY+
Sbjct: 509 SFR--GLQKMRG-----------------------GYTVTRRPALNLD------PFTWYN 537
Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
S+V+ A +L ++S + + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 538 ASDVLEAWKLLLSSRAIIPLEDDKYAIYEHDLVDITRQYLQISADQLYVNLKSAYRKRQV 597
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
L + L+L D++ +LA FLLG WL A++ A N + +E+NAR QIT W
Sbjct: 598 ARFEYLGSKLLQLFGDLEHILASGSNFLLGTWLADAQRAAPNAADKPNFEFNARNQITAW 657
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIK 502
+ Q + DY K WSGL+ DYY PR A++ + +L S F ++ R +
Sbjct: 658 GPDGQ-----ILDYACKQWSGLVLDYYRPRWALFLDDVTLALHSNRTFNSTAFKLRVSQE 712
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ + N +VYPVE G+ SQ +Y ++
Sbjct: 713 VELPFSNKSDVYPVEPMGNTWFISQNIYERW 743
>gi|242011515|ref|XP_002426494.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
gi|212510620|gb|EEB13756.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
Length = 1345
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 182/521 (34%), Positives = 282/521 (54%), Gaps = 79/521 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ + L +WL QQL+LQ KIL R+ ELG+ PVLP+F G VP + ++ +P AK+
Sbjct: 836 MGNVRNFSYGLTNNWLQQQLLLQHKILNRLRELGITPVLPSFCGIVPRSFKDSYPFAKLL 895
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W D +CC YLLD+ DPLF + R F+++ + E+G T+HIYNCD F+EN P
Sbjct: 896 EMPKWNKFSRD--YCCPYLLDSNDPLFSVVSRVFLKEYINEFG-TNHIYNCDVFNENKPA 952
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRP-PQMKALLNSVPLGKLVV 179
+S +Y+S++ + IY M S D A WL+QGW+F DPFW ++KA +N+VP G++++
Sbjct: 953 SESLDYLSTISSTIYKAMSSVDPRATWLVQGWMF-IDPFWASLKRVKAFINAVPKGRMLI 1011
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL +++ P + + ++G P+IWC LHNF G + MYG L+ + G + R +N+TM
Sbjct: 1012 LDLQSDLTPQYKRLQSYFGQPFIWCTLHNFGGQLGMYGHLNRVNLGVFKGRKFKNSTMVG 1071
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
WI +Y++RRYG I DAW +L +T+
Sbjct: 1072 IGIAPEGIDQNYIMYDFTLDLALRTKPVDLDDWITKYALRRYGLIEKNILDAWLILKNTL 1131
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YN PD + + S Y + +L S + WY
Sbjct: 1132 YNYN--------------PDSNFRLTSSNVKMYTLVKGEHIAKNILTKFPSLRMNEFTWY 1177
Query: 327 STSEVIRALELF-IASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
+ S ++ E F IAS N + ++++ +++DLID+TRQ +
Sbjct: 1178 NRSIILDIFEKFQIASSNSILSTSSLFQHDLIDVTRQTI--------------------Q 1217
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ S FLEL+ ++D +L FLLG WLESAK +A N+ ++ YE+NAR QIT+W
Sbjct: 1218 IAIENSNMFLELLNELDMILNTGKKFLLGNWLESAKNMATNKLEKDNYEFNARNQITLWG 1277
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL 485
N + +RDY K W+G++ D+Y PR ++F+ + ES+
Sbjct: 1278 SNGE-----IRDYAAKQWAGMIHDFYKPRWKLFFQALNESI 1313
>gi|170060634|ref|XP_001865888.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
gi|167879069|gb|EDS42452.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
Length = 761
Score = 302 bits (773), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 188/571 (32%), Positives = 287/571 (50%), Gaps = 89/571 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL +S+ LQ K++ + GM LPAF+G++P + +FP AK+
Sbjct: 210 MGNIRGWGGPLKESFKTFASDLQAKVVQEMRRFGMILALPAFAGHLPVQFKTLFPQAKLN 269
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + ++ LD DPLF +IG F+ + + YG T HIY D F+E P
Sbjct: 270 PVEVWNGFPA--QYASPLFLDPVDPLFQKIGSKFVAKAIARYG-TDHIYFSDPFNEIQPR 326
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+S Y++S A IY M D AVWL+QGW+ +PFW +KA +VP G+++VL
Sbjct: 327 SESARYLASAAAGIYQAMVDVDPLAVWLLQGWMLVKNPFWSDRAIKAFFTAVPNGRMLVL 386
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
DL +E P + ++ +YG P+IWCML NF G + M G +D + E R++E+ TM
Sbjct: 387 DLQSEQFPQYVRTQSYYGQPFIWCMLSNFGGTLGMLGSVDLVFERIRETRSNESMTMIGT 446
Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W +Y++ RYG +QDAW++ TVY
Sbjct: 447 GITPEGINQNYGLYEFALEMGWNPDISDVDNWFTRYAMVRYGNDDKRLQDAWSIFRSTVY 506
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ + + GKY +P K P +WY+
Sbjct: 507 SFKG--------------------MEMMRGKYTFNRRPSLKL-----------QPWVWYN 535
Query: 328 TSEVIRALELFIA--SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
+ +EL +A NEL +++D++DLTRQ L A++L+L I++ Y L +A
Sbjct: 536 ETRFDEGVELILAVNGSNEL-----FKHDVVDLTRQFLQNTADKLYLTIMDTYTLKNAAA 590
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
S F EL++++D LLA + FLLG WLESAK LA + ++YE+NAR QIT+W
Sbjct: 591 FKHYSNLFKELLQNIDRLLATNTHFLLGRWLESAKSLATTSLERQKYEYNARNQITLWGP 650
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-----RLKDWRREW 500
Q + DY NK WSG+++D++ PR +++ + M +L + R K +R+
Sbjct: 651 QGQ-----IVDYANKQWSGVVQDFFLPRWSLFLQEMELALATNGTINETKVRDKIFRKVE 705
Query: 501 IKLTNDWQNGRNVYPVESNG-DALITSQWLY 530
+ D R YP E++G DAL ++ LY
Sbjct: 706 LPFNTD----RKKYPAEASGEDALELARELY 732
>gi|348681836|gb|EGZ21652.1| hypothetical protein PHYSODRAFT_247428 [Phytophthora sojae]
Length = 991
Score = 301 bits (771), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 183/581 (31%), Positives = 292/581 (50%), Gaps = 78/581 (13%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ +IL R+ E GM P LPAF+G+VP L+ P+A
Sbjct: 422 MGNLRGSWVKGPLPQAFIDNQHELQLRILQRMREFGMIPALPAFAGHVPEDLKLTLPNAN 481
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T+ NW + ++CC Y+++ TDPL+ EIG+AF+E+Q Y TS +Y CDT+ E
Sbjct: 482 FTRSPNWGNFTD--QYCCVYMIEPTDPLYREIGKAFLEEQRALYNYTSSLYQCDTYMEMA 539
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P + A+ GM + D +AVWLMQGW F DP +W P++KA L VP KL
Sbjct: 540 PEFTDLSELKGAARAVIDGMTAADPNAVWLMQGWPFVDDPHYWTRPRVKAYLEGVPTDKL 599
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LD ++E PIW+ ++G +I+ +LHNF GN M G L ++A PV+A+ N TM
Sbjct: 600 IILDFYSEAVPIWNKMDNYFGKNWIYSVLHNFGGNTGMRGDLPTLATAPVQAQRDGNGTM 659
Query: 238 A---------------------------------WINQYSVRRYGRSVPAIQDAWNVLYH 264
W+++Y+ RRY ++ AW+ L
Sbjct: 660 VGVGLTMEGIFQNYVVYDLTLQMAWEDSPLDVDEWVSKYASRRYHTQNEHVERAWSYLSR 719
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
+VYN T +A+ V S++ + Y + + +
Sbjct: 720 SVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR--------------FQPTLI 754
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF--LNIIEAYQLND 382
Y +++ A + + +G+EL +TYR+DL+D+T+Q L+ E + L +I + +
Sbjct: 755 KYDPKDIVLAWKELLLAGDELRNVDTYRHDLVDVTKQFLSNKLLEQYQHLKVIYSAKSAP 814
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN----------EEQEKQY 432
A+ V +L++ L + ++ +LA ++ FLLG W+ A LA + + ++ Y
Sbjct: 815 ANEVCELTKTMLTTINRLEEILATNEDFLLGNWVADALNLAGDLNIGGDSVTRTKLQEYY 874
Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
E+ AR Q+T W DN E + DY K W+GL++ YY PR ++ + +
Sbjct: 875 EYEARNQVTRWGDNNNEA---IHDYAGKEWAGLVKSYYLPRWTMWLTEVCSAYTDRREMD 931
Query: 493 LKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
K ++ I WQ YP + GD+ S+ +Y++Y
Sbjct: 932 EKGLKKRRIAFELKWQLSHEKYPTTTVGDSFSISKRIYSEY 972
>gi|428176410|gb|EKX45295.1| hypothetical protein GUITHDRAFT_51145, partial [Guillardia theta
CCMP2712]
Length = 680
Score = 301 bits (770), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 180/583 (30%), Positives = 297/583 (50%), Gaps = 72/583 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGG L QSW+DQQ LQ KIL R ELGM PVLPAF+G VP ++++FP AK T
Sbjct: 121 MINIKAWGGGLTQSWIDQQRDLQLKILARERELGMLPVLPAFAGGVPEGMKSLFPEAKFT 180
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ GNW CC ++D TDPLF++IG+ F+E+ YG ++HIY+CDTF+EN P
Sbjct: 181 RHGNWGGFAEQH--CCVMMVDPTDPLFLKIGKMFVEEVRAVYG-SNHIYSCDTFNENRPR 237
Query: 121 VDSP----EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG 175
+ +++S A++ M++ D DAVWLMQGWLF D FW+ ++ A L+ VP
Sbjct: 238 SEHGSVGLDFLSHSSRAVFESMRAADPDAVWLMQGWLFMNDARFWQKRELDAYLSGVPED 297
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVP-----YIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
++++LDLF +V P+W P ++W MLH+F GN MYG L I+ PV A+
Sbjct: 298 RMIILDLFTDVFPVWKRRDLQRPTPIEKRRWVWNMLHSFGGNSGMYGRLQVISKDPVVAK 357
Query: 231 TSENT--------------------------------TMAWINQYSVRRYG----RSVPA 254
T M+W+ +++ RR G R A
Sbjct: 358 KESQTMVGVGITTEGIEQNPVVYEMMAEMRWREQEVDVMSWVEKWADRRLGPEASRERKA 417
Query: 255 I-QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
+ ++AW L TVY+C + + ++ + P +D + + + + +EA+++
Sbjct: 418 LGEEAWRELASTVYSCPGTQMGQVKSMVESRPRLDLASGWIPNSDFMPIKRHYPEEALVR 477
Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
+ W ++ L + + S++ +D+ D+TRQ L+ LF
Sbjct: 478 A----------W------LKLLRATRGGADGYTCSSSASFDIADVTRQVLSDLFARLFQP 521
Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
+ Q A + L ++ DMD ++ LLG W+E A+ +++E+E+ E
Sbjct: 522 LSSFCQTRLAGSAAVRMQTLLGIISDMDKMVGTQPRMLLGKWIEDARAWGKSKEEEEVLE 581
Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRL 493
+NAR +T+W + + DY +K W GLL DYY R ++F+++ +++ F
Sbjct: 582 FNARNLVTLWGPRGE-----IADYASKQWQGLLSDYYMSRWKLFFEHLQQAIRGTRIFSQ 636
Query: 494 KDWRREWIKLTNDWQN-GRNVYPVESNGDALITSQWLYNKYLQ 535
+ +++E + WQ + +P G+A+ + L++KY++
Sbjct: 637 QRFQQELLVFEQQWQTRTSSSFPSSPEGNAVELAWQLHDKYIK 679
>gi|158300970|ref|XP_320760.4| AGAP011750-PA [Anopheles gambiae str. PEST]
gi|157013415|gb|EAA00039.4| AGAP011750-PA [Anopheles gambiae str. PEST]
Length = 770
Score = 301 bits (770), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 181/572 (31%), Positives = 292/572 (51%), Gaps = 82/572 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL S+ LQ +++ + LGM LPAF+G++P + ++P+
Sbjct: 213 MGNIRGWGGPLTPSFTQFAHTLQVRVVGEMRRLGMAVALPAFAGHLPVQFRTLYPNVSFA 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + P++ LD T+PLF IG F++ +K YG T H+Y D F+E P
Sbjct: 273 NVSVWNNFP--PQYASPLFLDPTEPLFAAIGSRFLQLAIKTYG-TDHVYFSDPFNEIDPT 329
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ S +Y+SS+ AIYS M D DA+WL+QGW+F +PFW +++ L++VPLG+++VL
Sbjct: 330 LPSGKYLSSVSEAIYSTMVQVDPDAIWLLQGWMFVKNPFWSDRAIRSFLSAVPLGRMLVL 389
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL +E P + + + G P+IWCML NF G + M G + ++ G E R + T+
Sbjct: 390 DLQSEQYPQYGRTASYAGQPFIWCMLSNFGGTLGMLGSVGNVFRGIRETRDNSTYTLLGT 449
Query: 238 ------------------------------AWINQYSVRRYGR-SVPAIQDAWNVLYHTV 266
W ++Y+V RYG S Q AWN+ TV
Sbjct: 450 GITPEGINQNYALYEFALEMGWNAELDSAEQWFSEYAVARYGNDSDERAQQAWNIFLRTV 509
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y AF ++ + GKY +P SK P WY
Sbjct: 510 Y---------------AFEGLE-----LMRGKYTFNRRPSSK-----------IRPWTWY 538
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ LEL ++ E S + +YDL+D TRQ L A+ L+L ++++++ D
Sbjct: 539 DVHTFNQGLELLLSFAEEASCNQLCQYDLVDATRQCLQHTADALYLTLMDSFKKRDLTSF 598
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
S FL+L+ D+D LL ++ FLLGPWLESAK A+ + +YE+NAR QIT+W
Sbjct: 599 RLHSSLFLQLLSDLDVLLRTNEHFLLGPWLESAKAHAETTLERHKYEYNARIQITLWGPQ 658
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKD--WRREWI 501
Q + DY NK W+G+++D++ PR ++ + ++L + + +++D +R +
Sbjct: 659 GQ-----IVDYANKQWAGMVQDFFLPRWRVFLGELDQALATNGTINDLKIRDKIFRTVEL 713
Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+D ++ Y + +GD + T++ LY ++
Sbjct: 714 PFVSDSKH----YATQPSGDTVRTARTLYERW 741
>gi|348681870|gb|EGZ21686.1| hypothetical protein PHYSODRAFT_495971 [Phytophthora sojae]
Length = 692
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 178/577 (30%), Positives = 286/577 (49%), Gaps = 74/577 (12%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ KIL R+ GM P LPAF+G+VP L+ ++P+AK
Sbjct: 121 MGNLRGSWVEGPLPQAFIDGQYELQLKILERMRGFGMVPALPAFAGHVPEELKTLYPNAK 180
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T+ NW + +CC Y+LD DPL+ EIG+ F+E+Q Y TS +Y CDT++E
Sbjct: 181 FTRSPNWGGFSDE--FCCVYMLDPQDPLYYEIGKTFLEEQRALYDYTSSLYQCDTYNEMD 238
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P P + + A+ M + D +AVWL+QGWLF P +W +++ L+ VP K+
Sbjct: 239 PDFTDPAKLQAASRAVIDSMTAADPNAVWLIQGWLFVNSPNYWTKERVQTYLDGVPNDKM 298
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL++EV+P+W+ ++G +I+C+LHNF GN M G L ++ PV A + + TM
Sbjct: 299 IILDLYSEVRPVWNKMDNYFGKSWIYCVLHNFGGNTGMRGDLPTLGTAPVLANRASSGTM 358
Query: 238 ---------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYH 264
W+ ++ +RY + AW L
Sbjct: 359 IGMGLTMEGIFQNYVVYDLTLQMAWVDAPLDMDEWVPSFAAQRYHSQDAHTERAWGFLLQ 418
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
+VYN T G + ++ P + ++ + + +
Sbjct: 419 SVYNRTLGYGGVTKSLVCLIPHWK-----------------LVRDGFMPTLIT------- 454
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-- 382
Y ++ RA + + +G+EL A +TYR+DL+D+TRQ L+ + +L++ + Y +
Sbjct: 455 -YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFLSDHFMAQYLHLEDMYAGKETP 513
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN------EEQEKQYEWNA 436
A + + R L +E +D +LA +D FLLG W+ A+ LA + YE+ A
Sbjct: 514 ADQLCAWTDRMLVTIEWLDEILATNDDFLLGNWVADARALADEVGAAEVTSLQDYYEYEA 573
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
R Q+T W DN E + DY K W+GL+ YY PR ++ + +S
Sbjct: 574 RNQVTRWGDNNSES---IHDYAGKEWAGLVSGYYLPRWRMWLTEVCQSYTQKRDVNEAAL 630
Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
++ + +WQ YP + GD L S+ +Y ++
Sbjct: 631 KKARVDFELNWQLSHERYPTTTTGDTLAVSKRIYEEF 667
>gi|195115262|ref|XP_002002183.1| GI17241 [Drosophila mojavensis]
gi|193912758|gb|EDW11625.1| GI17241 [Drosophila mojavensis]
Length = 773
Score = 294 bits (752), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 183/574 (31%), Positives = 290/574 (50%), Gaps = 83/574 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLP + Q +LQ++IL ELGM+ LPAFSG VP A++ VFP+A T
Sbjct: 212 MGNLRSWGGPLPPAHRQLQQLLQQRILAAQRELGMSVALPAFSGYVPTAMRRVFPNASFT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q W DP +CC ++ DPLF ++G F+ + ++ YG ++HIY D F+E P
Sbjct: 272 QSDRWNHF-PDP-YCCVLFVEPQDPLFQQVGAMFLRRVIQVYG-SNHIYFSDPFNEMMPR 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V P Y+ AIY+ MQ D+DAVWL+QGW+F +W ++A L +VP G+++ L
Sbjct: 329 VREPNYVRYTAKAIYNSMQVVDADAVWLIQGWMFLKSVYWTNDLIEAYLTAVPRGRILAL 388
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
DL +E P + + +YG P++WCML+NF GN+ ++G I G + AR+ N +M
Sbjct: 389 DLQSEQFPQYERTHSYYGQPFVWCMLNNFGGNLGLFGSAQLIPSGIIAARSMPNGSMVGV 448
Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W +++ RYG + + W +L +VY
Sbjct: 449 GITPEGIGQNYALFALTLEQAWSPDELQLEDWFEYFALTRYGVNDTRLSQVWQLLRESVY 508
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ +G+ + GK L S + +P +WY+
Sbjct: 509 SF--------------------------QGRERMRGK-----YTLNKRPSLHHYPWVWYN 537
Query: 328 TSEVIRALELFIASGNELSASNT----YRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
+ V A L + + + ++ Y +DL+D+TRQ L + ++N+ A +
Sbjct: 538 VTMVYEAWRLMLEAKETVPLNDNRRAIYEHDLVDITRQCLQLSFDRFYVNLKSACRHKQL 597
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ V L+ + LEL DM+ +LA + +LLG WLE+AK+LA +EEQ YE+NAR Q+T W
Sbjct: 598 NRVEYLAGKLLELFADMERILASGEHYLLGNWLEAAKRLAPSEEQRPIYEFNARNQLTSW 657
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
N Q + DY K WSGL+ DY+ PR ++ + +I++L++ F ++++ ++
Sbjct: 658 GPNYQ-----IPDYATKQWSGLMSDYFQPRWNMFLEAVIQALKTQTPFNYSEFKQ---RV 709
Query: 504 TND----WQNGRNVYPVESNGDALITSQWLYNKY 533
N+ + N YP G S +Y K+
Sbjct: 710 ENEIELPFSNHTKAYPTSPVGSTWNISHDIYEKW 743
>gi|198433857|ref|XP_002122480.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 880
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 174/455 (38%), Positives = 243/455 (53%), Gaps = 70/455 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP W+ QL+LQ +IL+R+ LGM PVLP F+G++P+A+ N++P A +
Sbjct: 211 MGNLHGWGGPLPSFWIKSQLILQHQILIRMRSLGMIPVLPGFAGHIPSAILNLYPKADVI 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QL +W + CTYLL DPLF IG FI++Q+ EY T+HIYN DTF+E TPP
Sbjct: 271 QLSHWSHFNCT--YSCTYLLQPHDPLFNTIGSMFIKEQMLEYNGTNHIYNADTFNEMTPP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S+ A+Y M D DAVWLMQGWLF ++P FW+ Q KALL VP GK++V
Sbjct: 329 SSDPGYLSNASRAVYDAMAVADPDAVWLMQGWLFHHEPTFWKTAQKKALLTGVPKGKMLV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLF+E P + ++G P++WCMLH+F GN+ YG ++++ P A TS N+TM
Sbjct: 389 LDLFSESYPQY-LPDWYFGQPFLWCMLHDFGGNMGFYGKINTVNTQPGIALTSVNSTMVG 447
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+ +Y++RRY S P WN+L +T+
Sbjct: 448 TGVTPEGINQNYMIYDFMLETGFTVHSVNVTNWLKEYTMRRYNTSSPEAIKTWNILGNTI 507
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL-- 324
YN D FP S+I G PV + + D+P L
Sbjct: 508 YN----------DTKPGFP--SKSLIR---------GSPVKRPTL--------DNPGLPY 538
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
WY S + A + F S N L T RYD +D+TRQ L L+ ++E +
Sbjct: 539 WYQYSSLALAWDNFSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDP 598
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 419
G +L + L+L++D D +L F +G W++ A
Sbjct: 599 G--KLGEQLLDLLDDFDKMLCSDAHFSMGKWIQDA 631
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 103/197 (52%), Gaps = 8/197 (4%)
Query: 338 FIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 397
F S N L T RYD +D+TRQ L L+ ++E + G +L + L+L+
Sbjct: 684 FSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDPG--KLGEQLLDLL 741
Query: 398 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 457
+D D +L F +G W++ AK L E++ YE+NAR Q+T+W N + + DY
Sbjct: 742 DDFDKMLCSDAHFSMGKWIQDAKILGTTAEEKDLYEYNARIQVTLWGPNGE-----ILDY 796
Query: 458 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKLTNDWQNGRNVYPV 516
+K+W L++ YY PR A++ Y+ + + F K + + + + + R+V+P
Sbjct: 797 ASKHWCSLVKHYYRPRWALFVSYLNHAYATKSKFDHKAFASDVFTNVEEPFTKDRSVFPS 856
Query: 517 ESNGDALITSQWLYNKY 533
+ G+A+ ++ +Y K+
Sbjct: 857 TATGNAIELAKDMYIKW 873
>gi|195473052|ref|XP_002088810.1| GE10991 [Drosophila yakuba]
gi|194174911|gb|EDW88522.1| GE10991 [Drosophila yakuba]
Length = 778
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 182/550 (33%), Positives = 288/550 (52%), Gaps = 47/550 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPQWRRYQLLLQQEIIAAQRNLGMSVALPAFAGHVPRALKRLNPDSTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W ++CC ++ + LF EI F+++ + YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--QYCCGLFVEPKENLFNEIALNFLQKIITIYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTSAAIYESMRRIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWI 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 241 N-------------QYSVRRYGRSVPAIQDAWNVLY-HTVYNCTDGATDKNRDVIVAFPD 286
+++ R + P D+W + HT Y D ++ A+
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNKPLDLDSWFTNFSHTRYGVKDERLEQ------AWLQ 513
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELS 346
+ S+ S G + G + V+ S P WY S V+ A L ++S +
Sbjct: 514 LKNSVYSF-RGLQKMRG-----QYVVTRRPSFNQEPFTWYDASAVLDAWHLLLSSRAIIP 567
Query: 347 ASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDG 402
+ Y +DL+D+TRQ L A++L++N+ A++ LS + L+L +DM+
Sbjct: 568 LEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAFRKRQVTRFEYLSTKLLKLFDDMEL 627
Query: 403 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 462
+LA FLLG WL+ AK+ A + ++ +E+NAR QIT W + Q + DY K W
Sbjct: 628 ILASSRNFLLGNWLQQAKRAAPSPGEQTNFEFNARNQITAWGPDGQ-----ILDYACKQW 682
Query: 463 SGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRREWIKLTNDWQNGRNVYPVE 517
SGL+ DYY PR ++ + + +L S G F+LK + I+L + + +VYPV
Sbjct: 683 SGLVSDYYRPRWRLFLEDVTVALHSRRPFNGTAFKLK--VSQEIELP--FSHKVDVYPVT 738
Query: 518 SNGDALITSQ 527
G+ + SQ
Sbjct: 739 PVGNTWLISQ 748
>gi|440800773|gb|ELR21808.1| AlphaN-acetylglucosaminidase (NAGLU) [Acanthamoeba castellanii str.
Neff]
Length = 800
Score = 291 bits (744), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 190/592 (32%), Positives = 287/592 (48%), Gaps = 105/592 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGPL +SW + Q LQKKI+ ++ + A L+ V+P A IT
Sbjct: 199 MGNVQSWGGPLTKSWREGQAELQKKIVQGVWN---EERAVSVRWARAAGLKRVYPHANIT 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W R +LLD DP+F +IG AFI+ Q + YG T HIYN DTF+E PP
Sbjct: 256 LSPTWAHFTDPYR---VWLLDPFDPIFQKIGTAFIDAQTRVYG-TDHIYNADTFNELDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P Y+++ A+Y GM + D A+WLMQGWLF +W ++KA L+ V +++L
Sbjct: 312 SADPTYLAAASNAVYQGMAAADPKALWLMQGWLFR-SVWWSNDRIKAYLSGVKNDNMLIL 370
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
DL+AEV PIWS ++ ++G P++WCMLH+F GN ++YG L IA PV+ART+ +TM
Sbjct: 371 DLYAEVDPIWSKTESYFGKPFVWCMLHDFGGNRDLYGNLTHIATAPVDARTAPGSTMVGT 430
Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W++ Y RYG P+ + AW +L+ + Y
Sbjct: 431 GLTMEAIEQNPVIYELMSEMGWRSAHVDVDDWLDHYVSFRYGADSPSAKKAWRLLHQSAY 490
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
P + SI + ++ S H YS
Sbjct: 491 QN---------------PVIMRSIYTFVPNRH----------------VSRNHH----YS 515
Query: 328 TSEVIRALELFIASGNEL----SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND- 382
++ A L + S EL + + YDL+D+TRQ L + LF +AY L D
Sbjct: 516 PDVLVEAWGLLLQSRLELPNPAQPNGPWEYDLVDVTRQVL----DNLFH---DAYGLLDG 568
Query: 383 AHGVFQLSRR------------FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
A+ + +RR ++++ D+D +LA + +LLG W E A+ A NEE+++
Sbjct: 569 AYDAYVATRRDPFNQVKTIGAALIQILSDIDTVLATNQNYLLGVWTERARSWATNEEEKR 628
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YE+NAR QIT+W N + + DY +K W+GL+ YY PR I+ Y+ +S+ G
Sbjct: 629 LYEFNARNQITLWGPNGE-----INDYASKEWAGLVGTYYRPRWQIFVAYLFDSIAKGTV 683
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH 542
+ + + W N N +P ++ G+ SQ LY +Y+ + H
Sbjct: 684 IDPNKYAADLLLWEQRWNNQTNAFPSQATGNVAEVSQALYARYVSAAELKQH 735
>gi|195398029|ref|XP_002057627.1| GJ18000 [Drosophila virilis]
gi|194141281|gb|EDW57700.1| GJ18000 [Drosophila virilis]
Length = 766
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 178/574 (31%), Positives = 294/574 (51%), Gaps = 83/574 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPLP + Q +LQ+ I+ ELGM+ LPAF+G+VP A++ VFP+A T
Sbjct: 214 MGNIRGWAGPLPPAHRRLQQLLQQLIVRAQRELGMSVALPAFAGHVPTAMRRVFPNANYT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W + ++CC ++ DPLF ++G F+ + ++ YG ++HIY D F+E PP
Sbjct: 274 PAERWNNFPD--QYCCDLFVEPHDPLFQQLGAMFLRRVIQVYG-SNHIYFSDPFNEMQPP 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P Y+ S AIY+ M+ D +AVWL+QGW+F D FW ++A L +VP G+++VL
Sbjct: 331 LAEPGYMRSTAKAIYNSMREVDGNAVWLLQGWMFLKDIFWTDELIEAFLTAVPRGRILVL 390
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
DL +E P + + +YG P++WCML+NF G + ++G I G AR N+++
Sbjct: 391 DLQSEQFPQYQRTHSYYGQPFVWCMLNNFGGTLGLFGSAQFIGSGIASARIMPNSSLVGV 450
Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W + +++ RYG + + AW +L VY
Sbjct: 451 GITPEGIGQNYAIFALTLEQGWSASELQLGDWFDHFALTRYGVNDTRLAQAWQLLRGGVY 510
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ + GKY L +P WY+
Sbjct: 511 SFHG--------------------LQRMRGKY-----------ALNRRPGLNLNPWTWYN 539
Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
S V A +L +AS + ++ Y +DL+D+TRQ L + +++++N+ AY+
Sbjct: 540 GSSVTDAWQLLLASREMVPLTDDRYAIYEHDLVDITRQFLQQSFDQIYVNLRSAYRKEQL 599
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + L+ + LEL++DM+ +LA +LLG WLE+AK+LA +++ YE+NAR Q+T W
Sbjct: 600 NRLEYLAGKLLELLDDMERILASGVHYLLGTWLEAAKKLAPSDKLRPLYEFNARNQLTSW 659
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
N Q + DY K WSGL+ DYY PR A++ + ++++ F D+++ ++
Sbjct: 660 GPNGQ-----ILDYATKQWSGLMCDYYQPRWAMFLDAVTRAMQTHRPFNATDFKQ---RV 711
Query: 504 TND----WQNGRNVYPVESNGDALITSQWLYNKY 533
N+ + N +YP + G+ + S +Y K+
Sbjct: 712 ANEIELPFSNLTKMYPTKPMGNTWLISNDIYIKW 745
>gi|330791218|ref|XP_003283691.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
gi|325086434|gb|EGC39824.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
Length = 712
Score = 288 bits (736), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 174/574 (30%), Positives = 284/574 (49%), Gaps = 83/574 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ WGGP+ WL++Q LQ +IL R+ GM PVLP F+G++P A+Q +FP+A ++
Sbjct: 170 MGNVNNWGGPITMDWLEKQRDLQIQILTRMRAYGMKPVLPGFAGHIPGAIQTLFPTANVS 229
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W T+ LD +DPLF +I + FI + + +G T H YN D F+E PP
Sbjct: 230 ILSTWCEFNG------TFYLDPSDPLFGKITQLFITELIGVFG-TDHYYNFDPFNELAPP 282
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSYDPFWRPPQMKALLNSVPLGKLVV 179
++ +Y+ M + D AVW++QGW + Y FW+ Q +A + VP+G +V
Sbjct: 283 SSDLGFLKQTSQQMYNNMLAADPKAVWVLQGWFIVDYPEFWQANQTQAWFSGVPIGGFIV 342
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL+++V P W+ ++ FYG ++WCMLHNF G MYG + IA P+ AR+ + M
Sbjct: 343 LDLWSDVAPAWNITEYFYGHYWLWCMLHNFGGRSGMYGRIPFIATNPIIARSLSDNMMGT 402
Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
WI QY+ RRYG+ +P + + W + TV+
Sbjct: 403 GLTPEAIEQNVVVYDLMSEMAWRSTAPDLEEWITQYTNRRYGKIMPEVVEVWMSMVDTVF 462
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
N T +N +F + PSI N+G +++Y
Sbjct: 463 NATAYWARRNMGAPESFIALRPSI---------NFGD------------------NVFYD 495
Query: 328 TSEVIRALELF-IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
S + A +F + + + + ++ T+++D+ ++T QAL+ + + + N+I++Y ++D
Sbjct: 496 PSVMFNAWHVFSLVNDSYVISTETFQFDISEITMQALSNFFMDTYFNLIKSYNVSDIESF 555
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA--QNEEQEKQ---------YEWN 435
+ S +E + MD + + LG W A+ A NE Q YE+N
Sbjct: 556 QRESITMMETISFMDLIASTQPELQLGVWTYRARLWAYPDNETPSLQNSSNSATLPYEFN 615
Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
AR Q+T+W S+L DY K W GL+ D+YGPR ++ K +++SLE+ F +
Sbjct: 616 ARNQLTLW----GPSDSVLHDYAFKLWGGLISDFYGPRWNLFLKTLLQSLENRIPFDANN 671
Query: 496 WRREWIKLTNDWQNGRNVYPVESNGDALITSQWL 529
+ L W +YP+ G TS+++
Sbjct: 672 FISNVQALEQQWVLESTIYPILPFGQGYNTSRYI 705
>gi|195050088|ref|XP_001992825.1| GH13491 [Drosophila grimshawi]
gi|193899884|gb|EDV98750.1| GH13491 [Drosophila grimshawi]
Length = 771
Score = 288 bits (736), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 183/574 (31%), Positives = 299/574 (52%), Gaps = 83/574 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLP + Q +LQ++I+ +LGM+ LPAF+G+VP L +FP+A T
Sbjct: 213 MGNIRGWGGPLPPAHRRLQQLLQQRIVQAQRDLGMSVALPAFAGHVPTGLPRIFPTANFT 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W DP +CC ++ +DPLF +G F+ + ++ YG ++HIY D F+E P
Sbjct: 273 SVERWNQFP-DP-YCCALFIEPSDPLFQLVGAQFLRRVIQIYG-SNHIYFSDPFNEMQPR 329
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISS AIY+ M+ D D VWL+QGW+F + +W ++A L +VP G+++VL
Sbjct: 330 IAEPGYISSTARAIYNSMRMVDKDPVWLLQGWMFLDNAYWSDELIEAFLTAVPRGRMLVL 389
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
DL +E P + + +YG P++WCML+NF G + M+G I G + AR+ N++M
Sbjct: 390 DLQSEQFPQYQRTFSYYGQPFVWCMLNNFGGTLGMFGSAHLINAGIMAARSMPNSSMVGV 449
Query: 239 -------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W +Q+++ RYG + + AW +L +VY
Sbjct: 450 GITPEGIGQNYALFALTLEQGWSGSKLELSDWFDQFTLTRYGVNDTDLILAWQLLRGSVY 509
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ + GKY L S P +WY+
Sbjct: 510 HFHG--------------------LQRMRGKY-----------ALNKRPSFNLKPWIWYN 538
Query: 328 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
S V+ A +L +A+ + + Y++DL+D+TRQ L + +++++N+ AY+ +
Sbjct: 539 ASSVVEAWQLLLAANQTIPVEDDRYALYKHDLVDITRQFLQQSFDQVYVNLKSAYRKSQL 598
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
L+ + LEL+ DM+ +LA + +LLG WLE+AK+LA + +Q YE+NAR Q+T W
Sbjct: 599 ARFEYLAAKLLELLADMERILASGEHYLLGNWLEAAKELAPSADQRHIYEFNARNQLTAW 658
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+ Q + DY K WSGL++DYY PR +++ + ++ S F +R+ ++
Sbjct: 659 GPSNQ-----ILDYATKQWSGLMQDYYTPRWSMFLDAVTLAMHSKRPFNATAFRQ---RV 710
Query: 504 TND----WQNGRNVYPVESNGDALITSQWLYNKY 533
N+ + N VYP E G + SQ +++K+
Sbjct: 711 ANEIELPFSNLTKVYPTEPVGSTWLISQEIHDKW 744
>gi|301107007|ref|XP_002902586.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
gi|262098460|gb|EEY56512.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
Length = 736
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 183/572 (31%), Positives = 281/572 (49%), Gaps = 87/572 (15%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ +IL R+ E GM P LPAF+G+VP L+ P+A
Sbjct: 196 MGNLRGSWVKGPLPQAFIDNQHELQLRILERMREFGMIPALPAFAGHVPEELKLRLPNAH 255
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
TQ NW + + CC ++++ TD L+ EIG+ F+++Q + Y TS +Y CDT+ E
Sbjct: 256 FTQSPNWGNFSEEH--CCVFMIEPTDALYREIGKNFLKEQRELYNYTSSLYQCDTYMEMA 313
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P + A+ GM + D +AVWLMQGW F DP FW P++KA L+ VP KL
Sbjct: 314 PEFTDLTELEGAARAVIDGMTAADPNAVWLMQGWPFVDDPHFWTKPRVKAYLDGVPTDKL 373
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LD ++E PIWS ++G +I+ +LHNF GN M G L ++A PV A + N TM
Sbjct: 374 IILDFYSESVPIWSKMDNYFGKSWIYSVLHNFGGNTGMRGDLLTLATAPVLANWAGNGTM 433
Query: 238 A---------------------------------WINQYSVRRYGRSVPAIQDAWNVLYH 264
WI QY+ +RY ++ AW+ L
Sbjct: 434 VGVGLTMEGIFQNYIVYDLTLQMAWVDNPLDVNTWIPQYAAQRYHTHNEHVEQAWSYLLR 493
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
+VYN T +A+ V S++ + Y + + ++K
Sbjct: 494 SVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR--FQPTLIK----------- 529
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA- 383
Y ++V+ A + + + NEL +TYR+DL+D+T+Q L+ E ++++ Y A
Sbjct: 530 -YDPNDVVLAWKELLLAENELRDVDTYRHDLVDVTKQFLSNKLLEQYIHLKGIYNAKKAS 588
Query: 384 -HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+ V L++ L +E ++ +LA ++ FLLG W+ AR Q+T
Sbjct: 589 PNEVCGLTKTMLTTMERLEEILATNEDFLLGNWI-------------------ARNQVTR 629
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
W DN E + DY K W+GL++ YY PR ++ + + K + + I
Sbjct: 630 WGDNNNEA---IHDYAGKEWAGLVKGYYIPRWTMWLSEVCNAYTDKREMNEKALKEKRIA 686
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
WQ G YP + GDA S+ YN+Y+
Sbjct: 687 FELKWQLGHESYPTTTVGDAFTISKRFYNEYI 718
>gi|291515668|emb|CBK64878.1| Alpha-N-acetylglucosaminidase (NAGLU) [Alistipes shahii WAL 8301]
Length = 713
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 187/566 (33%), Positives = 276/566 (48%), Gaps = 72/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W GPLP+ W+D QL LQ++IL R ELGM PVLPAF+G+VP L+ + P A+IT
Sbjct: 179 MSNIDRWQGPLPEEWIDGQLALQQRILARERELGMKPVLPAFAGHVPQELKRLHPDARIT 238
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W D R+ C++ LD DPLF I R F+ +Q + +G T HIY D F+E P
Sbjct: 239 RVSYWGGF--DDRYRCSF-LDPMDPLFAVIQREFLTEQTRLFG-TGHIYGADPFNEIDAP 294
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE ++ + IY M D +AVWL GWLF DP W ++A L +VP +L++
Sbjct: 295 TWDPETLAGMSRHIYESMAEVDPEAVWLQMGWLFYADPTHWTAENIRAFLGAVPQDRLLM 354
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG--------ILDSIAFGPVEAR- 230
LD F E IW +++F+G PY+WC L NF GN + G + D+ A G R
Sbjct: 355 LDYFCEFTEIWKQTEKFHGQPYLWCYLGNFGGNTMLSGNFHTVSARMEDAFAHGGDNLRG 414
Query: 231 --------------------TSENTTMA---WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
+ NT +A WI + + RR G PA + W L +VY
Sbjct: 415 VGSTLEGFGVNQFMYEFVLDKAWNTGIADDEWIARLADRRTGFRDPAARTGWRTLCDSVY 474
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T A + A P + EG + KP T+ Y P LW
Sbjct: 475 --TLPAQTGQSPLTNAHPAL--------EGNWHWTTKP----------TTGYRFPTLW-- 512
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
R E +A +E +TYR+D++++ RQ L Y AY +D +
Sbjct: 513 -----RVWEELLAVDSE---RDTYRFDVVNIGRQVLGDYFLIERDRFAAAYAQHDRKAMD 564
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+RR L+ D++ L ACH F L W+ +A+ + + YE NAR I++W D+
Sbjct: 565 AAARRMTGLLADINLLTACHPEFSLERWIAAARGFGSDNASKDYYETNARMLISVWGDSY 624
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
L DY ++ WSG++ YY PR ++ + ++E+ +G F + + RE W
Sbjct: 625 H-----LTDYASRTWSGMISTYYAPRWRLFIERVMEAARTGRMFDHEAFDREIRDFECRW 679
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
+ + GDA+ T++ L +KY
Sbjct: 680 ADASHPLTFPEAGDAVRTARELASKY 705
>gi|288927792|ref|ZP_06421639.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella sp. oral taxon 317 str. F0108]
gi|288330626|gb|EFC69210.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella sp. oral taxon 317 str. F0108]
Length = 734
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 182/570 (31%), Positives = 278/570 (48%), Gaps = 78/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W GPLP SW+ Q LQKKIL R +LGM PVLPAF+G+VP L+ +P AKIT
Sbjct: 198 MSNIDHWMGPLPMSWIKNQEKLQKKILRRTRDLGMKPVLPAFAGHVPEILKEKYPKAKIT 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W + R + LD D LF +I + +I++Q K YG T HIY D F+E PP
Sbjct: 258 PLSIWGDFEDQYR---CHFLDPFDSLFTDIQKTYIDEQTKLYG-TDHIYGVDPFNELAPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY+++ A IY +++ DS AVWL W+FSY W ++K+ + +VP K ++
Sbjct: 314 SWEPEYLANASAKIYDVLKNADSKAVWLQMTWMFSYQRKDWTDERIKSYITAVPDKKQIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSI------AFGPVEARTSE 233
LD +AE +W S+ +Y P+IWC L NF GN + G + + AF E+
Sbjct: 374 LDYYAERTEVWKFSESYYKQPFIWCYLGNFGGNTMIAGNIAEVDRRLNEAFANAESMVGV 433
Query: 234 NTTMA--------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
+T+ W Q++ RR G + + AW +L +Y
Sbjct: 434 GSTLEGFDVNPIMYDFVFEKVWHKDGISLHDWTVQWAQRRVGTTDENAEKAWKLLIDKIY 493
Query: 268 N----CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
CT+G R PS+ +N+ K
Sbjct: 494 VQYSLCTEGTLTNAR----------PSLTGHGNWTTKNWTK------------------- 524
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
Y+ +++ A L + S + Y+YD++++ RQ L Y L +AY+ D
Sbjct: 525 --YNNRDLLEAWGLLLRS--KAITKIAYKYDIVNIGRQVLGNYFTVLRDEFTQAYERKDI 580
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ L L+ D++ LL FLLGPWL +A+ + +N E+ + YE NAR IT W
Sbjct: 581 SALTIKGNEMLSLLNDLEALLYTSPSFLLGPWLTNAQNMGRNMEESRYYEKNARNIITNW 640
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+TQ A L DYGN+ W+GLL+ YY PR ++ + +I +++ F + + ++
Sbjct: 641 --STQGVA--LNDYGNRTWAGLLQGYYTPRWKMFIEEVISAVKQNKEFNNETFFKKVTDE 696
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
W + YP+++ GD+ + + Y+KY
Sbjct: 697 EWQWISKTENYPIQATGDSYLLANKFYHKY 726
>gi|423292430|ref|ZP_17271008.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
CL02T12C04]
gi|423294620|ref|ZP_17272747.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
CL03T12C18]
gi|392661665|gb|EIY55241.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
CL02T12C04]
gi|392675811|gb|EIY69252.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 284 bits (727), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 173/566 (30%), Positives = 263/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQNKLLL 361
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTT- 236
LD + E +W + +++G PY+WC L NF GN + G + V EN +
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421
Query: 237 -----------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
WI Q + RR G ++ AW +LY ++Y
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPDSVWIEQLADRRIGLRNQQMRRAWKLLYDSIY 481
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
P+ + G ++ LK + P + YS
Sbjct: 482 TA-------------------PAALG--------QGTLMNARPCLKGNGNWTTTPTVAYS 514
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ E+ + +G + TY YD++++ RQ L Y +L E Y +
Sbjct: 515 NETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVLGNYFGKLRDEFAETYSRKQLPLLK 572
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
Q +L+ D++ LL+ FLLG W+E A+ L +E + YE NART ++ W D
Sbjct: 573 QKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLGIDEASKNYYEENARTIVSTWGDKD 632
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
Q L DY N+ W GL+ YY PR ++ +I S+ + F + + + DW
Sbjct: 633 QS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEIDW 688
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
YP E G+A+ + L NKY
Sbjct: 689 VKSHERYPSEPVGNAVEIATLLMNKY 714
>gi|295086519|emb|CBK68042.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 727
Score = 284 bits (727), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 173/566 (30%), Positives = 263/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 361
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTT- 236
LD + E +W + +++G PY+WC L NF GN + G + V EN +
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421
Query: 237 -----------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
WI Q + RR G ++ AW +LY ++Y
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPDSVWIEQLADRRIGLRNQQMRRAWKLLYDSIY 481
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
P+ + G ++ LK + P + YS
Sbjct: 482 TV-------------------PAALG--------QGALMNARPCLKGNGNWTTTPTVAYS 514
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ E+ + +G ++ Y YD++++ RQ L Y +L EAY +
Sbjct: 515 NETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLLK 572
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
Q +L+ D+D LL+ FLLG W+E A+ L +E + YE NART ++ W D
Sbjct: 573 QKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGTDEVSKNYYEENARTIVSTWGDKD 632
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
Q L DY N+ W GL+ YY PR ++ +I S+ + F + + + DW
Sbjct: 633 QS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEIDW 688
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
YP E G+ + + L NKY
Sbjct: 689 VKSHERYPSEPVGNVVEIATLLMNKY 714
>gi|160883168|ref|ZP_02064171.1| hypothetical protein BACOVA_01137 [Bacteroides ovatus ATCC 8483]
gi|156111393|gb|EDO13138.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
Length = 737
Score = 284 bits (726), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 173/566 (30%), Positives = 263/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 196 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 256 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 312 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQNKLLL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTT- 236
LD + E +W + +++G PY+WC L NF GN + G + V EN +
Sbjct: 372 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 431
Query: 237 -----------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
WI Q + RR G ++ AW +LY ++Y
Sbjct: 432 LGSTLEGFDVNPFMYEYVFSKAWDCNLPDSVWIEQLADRRIGLRNQQMRRAWKLLYDSIY 491
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
P+ + G ++ LK + P + YS
Sbjct: 492 TA-------------------PAALG--------QGTLMNARPCLKGNGNWTTTPTVAYS 524
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ E+ + +G + TY YD++++ RQ L Y +L E Y +
Sbjct: 525 NETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVLGNYFGKLRDEFAETYSRKQLPLLK 582
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
Q +L+ D++ LL+ FLLG W+E A+ L +E + YE NART ++ W D
Sbjct: 583 QKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLGIDEASKNYYEENARTIVSTWGDKD 642
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
Q L DY N+ W GL+ YY PR ++ +I S+ + F + + + DW
Sbjct: 643 QS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEIDW 698
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
YP E G+A+ + L NKY
Sbjct: 699 VKSHERYPSEPVGNAVEIATLLMNKY 724
>gi|66801665|ref|XP_629757.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
gi|60463162|gb|EAL61355.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
Length = 798
Score = 282 bits (721), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 175/573 (30%), Positives = 282/573 (49%), Gaps = 81/573 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++GWGGP+ WL++Q LQ KIL R+ + GM PVLP F+G++P A+Q +FP A I+
Sbjct: 255 MGNVNGWGGPITLDWLEKQRDLQIKILERMRQYGMKPVLPGFAGHIPGAIQQLFPQANIS 314
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W + T+ L++TDPLF +I FI + + +G T H YN D F+E PP
Sbjct: 315 VLSTWCNFNG------TFYLESTDPLFAKITTMFIGELIDVFG-TDHFYNFDPFNELEPP 367
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ +Y+ ++Y + D AVW++QGW P FW+ Q +A + VP+G ++V
Sbjct: 368 SNDTDYLRQTSQSMYENVLLADPKAVWVLQGWFIVDAPEFWQAKQTEAWFSGVPIGGVLV 427
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
LDL+++V P W+T+ +YG ++WCMLHNF G MYG L I+ P+ A
Sbjct: 428 LDLWSDVIPGWTTTNYYYGHYWVWCMLHNFGGRSGMYGRLPWISSNPITARGLSPNMVGI 487
Query: 230 ----------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
R+ + W+ QY+ RRYG+ VP I D W L +TV+
Sbjct: 488 GLTPEAIEQNVVVYDMMSEMSWRSVQPNLTEWVTQYTHRRYGKLVPEIVDVWISLVNTVF 547
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
N T N +F + P + +G +S+ +P++ Y+
Sbjct: 548 NATAATARANMGAPESFIALRPQL---------TFGN------------NSFYNPNILYN 586
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
V ++ + ++ T+ +D+ + T Q+L+ Y + + +IEA+ +D +
Sbjct: 587 AWNVFSMVD-----DEYVISTETFEFDISEFTMQSLSNYFMDQYFLLIEAFNASDVQTLS 641
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA---------QNEEQEKQ--YEWNA 436
+S L+++ MD + + LG W A+ A QN YE+NA
Sbjct: 642 TISIELLDIINYMDEIASTQSSLQLGLWTYRARLWAYPTNDIPTLQNSSNSNTAPYEFNA 701
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
R +T+W S+L DY K WSGL+ D+Y PR ++ K +++S+E+ F + +
Sbjct: 702 RNVLTLW----GPSNSVLHDYAFKLWSGLVSDFYSPRWQLFLKSLVQSVENRKPFNKESF 757
Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWL 529
R L W + +YP G A TS+++
Sbjct: 758 NRMVENLEEQWVVQQTIYPTVPVGQAYNTSKYI 790
>gi|383114162|ref|ZP_09934927.1| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
gi|382948607|gb|EFS30964.2| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
Length = 727
Score = 281 bits (718), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 174/566 (30%), Positives = 265/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 361
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTT- 236
LD + E +W + +++G PY+WC L NF GN + G + V EN +
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421
Query: 237 -----------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
WI Q + RR G ++ AW +LY ++Y
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPDSVWIEQLADRRIGLRNQQMRRAWKLLYDSIY 481
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
++ +G N +P K + TS+ + YS
Sbjct: 482 TAP---------------------AALGQGTLMN-ARPCLKGNGNWTTTST-----VAYS 514
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ E+ + +G ++ Y YD++++ RQ L Y +L EAY +
Sbjct: 515 NETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLLK 572
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
Q +L+ D+D LL+ FLLG W+E A+ L + + YE NART ++ W D
Sbjct: 573 QKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGTDGASKNYYEENARTIVSTWGDKD 632
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
Q L DY N+ W GL+ YY PR ++ +I S+ + F + + + DW
Sbjct: 633 QS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEIDW 688
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
YP E G+A+ + L NKY
Sbjct: 689 VKSHERYPSEPVGNAVEIATLLMNKY 714
>gi|237719130|ref|ZP_04549611.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451509|gb|EEO57300.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 737
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 174/566 (30%), Positives = 265/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 196 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 256 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 312 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTT- 236
LD + E +W + +++G PY+WC L NF GN + G + V EN +
Sbjct: 372 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 431
Query: 237 -----------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
WI Q + RR G ++ AW +LY ++Y
Sbjct: 432 LGSTLEGFDVNPFMYEYVFSKAWDCNLPDSVWIEQLADRRIGLRNQQMRRAWKLLYDSIY 491
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
++ +G N +P K + TS+ + YS
Sbjct: 492 TAP---------------------AALGQGTLMN-ARPCLKGNGNWTTTST-----VAYS 524
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ E+ + +G ++ Y YD++++ RQ L Y +L EAY +
Sbjct: 525 NETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLLK 582
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
Q +L+ D+D LL+ FLLG W+E A+ L +E + YE NART ++ W D
Sbjct: 583 QKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGIDEASKNYYEENARTIVSTWGDKD 642
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
Q L DY N+ W GL+ YY PR ++ +I S+ + F + + + DW
Sbjct: 643 QS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEIDW 698
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
YP E +A+ + L NKY
Sbjct: 699 VKSHERYPSEPVSNAVEIATLLMNKY 724
>gi|340617022|ref|YP_004735475.1| alpha-N-acetylglucosaminidase [Zobellia galactanivorans]
gi|339731819|emb|CAZ95084.1| Alpha-N-acetylglucosaminidase, family GH89 [Zobellia
galactanivorans]
Length = 747
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 170/550 (30%), Positives = 271/550 (49%), Gaps = 58/550 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G GGPLPQ+W+ Q+ LQ KIL R+ LGM PVL AF+G+VP L+ ++P A I
Sbjct: 197 MGNIDGMGGPLPQNWITQRKELQVKILNRMRSLGMKPVLQAFTGHVPQVLKKLYPEANIF 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ +W V+ TY LD TD LF +IG AFI++Q + YG T H+Y+ D F E PP
Sbjct: 257 QIEDWAGVEG------TYFLDPTDELFQKIGTAFIKKQTELYG-TDHLYDADCFIEVDPP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM-KALLNSVPLGKLVV 179
P ++ + ++Y M+ DS A W++QGW F + + + +A L+ +P + +V
Sbjct: 310 SKDPAFLKQVSESVYKSMELADSKATWVLQGWFFFFKKDFWTKERGRAFLDGIPKNRAIV 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE------ 233
LDL+ E P W + FYG P+IW ++ N + M G L+ + EA TSE
Sbjct: 370 LDLYGEKNPTWDKTDAFYGQPWIWNVICNEDQKVNMSGDLEEMQRQFQEAYTSEIGNNLK 429
Query: 234 ----------------------------NTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
WI Y+ RYG P+++ AW +L +
Sbjct: 430 GIGVIPEGLGYNPIVQDFIFEKAWDPQKVNVQEWIEDYATIRYGTKSPSVKKAWQLLGES 489
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VY G T ++ P ++ EG ++ + +++ ++D
Sbjct: 490 VY----GRTRTMWSPLI----TTPRLMIFEEGSKEDIRHVRKDFKITETDPFAWD----- 536
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
+ ++ +A L + NEL TY +DL ++ R+ L ++ ++ AYQ D
Sbjct: 537 FDVYKLAKAAGLLLGEANELQDVETYNFDLTNVYRELLFSLTHKSINDVSVAYQEKDRQA 596
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
+ + ++ +L++D++ + ++ FLLG WLE AK E+++ YEWNART +T+W
Sbjct: 597 LDRSAKSLFKLMDDLEAITGANENFLLGKWLEDAKSWGSTPEEKEYYEWNARTIVTIWQP 656
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
+ LRDY K W+GL YY PR ++ ++ SL G F K + E ++
Sbjct: 657 YPE---GGLRDYAGKQWNGLFSGYYKPRWQLFVDHLRRSLTEGVDFDPKAYDAEVREMDY 713
Query: 506 DWQNGRNVYP 515
W +YP
Sbjct: 714 KWTRSHQIYP 723
>gi|390334740|ref|XP_003724005.1| PREDICTED: uncharacterized protein LOC100893810 [Strongylocentrotus
purpuratus]
Length = 1043
Score = 278 bits (711), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 164/471 (34%), Positives = 232/471 (49%), Gaps = 74/471 (15%)
Query: 100 KEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF 159
+E+ T HIYN DTF+EN P + Y+S+ +Y G+ GD VWLMQGWLF F
Sbjct: 573 EEFNGTDHIYNADTFNENQPRSNDSAYLSAASRGVYQGIVEGDPQGVWLMQGWLFQKTDF 632
Query: 160 WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL 219
W P Q+KALL+ VP+G+++VLDLFAE +PI++ ++ FYG P+IWCMLHNF GN +YG L
Sbjct: 633 WGPSQIKALLHGVPIGRMIVLDLFAEARPIYNATQSFYGQPFIWCMLHNFGGNTGLYGKL 692
Query: 220 DSIAFGPVEARTSENTTM---------------------------------AWINQYSVR 246
D++ P EAR ++TM WI +YS R
Sbjct: 693 DAVNKFPFEARQFNSSTMIGMGLTPEGILQNYVMYNFLTDMTWRSESMNVSKWIEEYSGR 752
Query: 247 RY----GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 302
RY G S A + AW +L TVYN T D A P V PS N
Sbjct: 753 RYSPESGHSEEAAK-AWAILQATVYNNTGIDKDHQH----AVPVVRPS----------NK 797
Query: 303 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 362
K V +WY +EV +A + + L S+ +RYDL+D+TR
Sbjct: 798 TKSV-----------------IWYDYTEVAKAWGFLLQASETLGTSSLFRYDLVDVTRNV 840
Query: 363 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 422
L A + + I+ ++ + + L+ DMD + + H +LLG WLE AK L
Sbjct: 841 LQDLAFDFYEQIMVSFHAKNITAIRGNGTLLCNLILDMDNITSSHQDWLLGTWLEDAKSL 900
Query: 423 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 482
A N ++E YE+NAR QIT+W + DY NK W GLLR YY R ++ +++
Sbjct: 901 ATNHKEESLYEYNARNQITVWGPRGEH-----LDYANKQWGGLLRSYYYNRWQLFVQFLD 955
Query: 483 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+E + + + +W N +P + GD + S+ LY+KY
Sbjct: 956 GCIELHVPYDQSKFDMRSFIMETEWTNSTEKFPTKPVGDTVSISRALYSKY 1006
>gi|440799253|gb|ELR20308.1| alpha-N-acetylglucosaminidase family protein [Acanthamoeba
castellanii str. Neff]
Length = 854
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 172/586 (29%), Positives = 274/586 (46%), Gaps = 100/586 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL +W Q LQKKI+ R GM P+LP F+G VP ++ ++P+A +T
Sbjct: 240 MGNIQGWGGPLDPAWRKAQAELQKKIVERQRMFGMLPILPGFAGFVPDGIKRIYPTANLT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ +W ++ Y L D L+ IGR I + E+G T HIYN DTF+E +PP
Sbjct: 300 KSADWAGFPH--QYTNVYFLSPLDSLYKTIGRMVIRRVTAEFG-TDHIYNADTFNEMSPP 356
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWR-PPQMKALLNSVPLGKLVV 179
P Y+++ A+Y GM + D A+W+MQGW F +D FW ++++ L+ V +++
Sbjct: 357 SADPTYLAAASRAVYEGMAAEDPQALWVMQGWSFVFDKFWEDKSRVRSYLSGVSDKDMLI 416
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDL ++ P WS + ++G ++WCMLHN G +YG L + P+ A + TM
Sbjct: 417 LDLASDNNPEWSKTDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSDPLLALATPGNTMLI 476
Query: 238 ---------------------------------------AWINQYSVRRYGRS--VPAIQ 256
W+ +Y+ RRYG + + ++
Sbjct: 477 CGTCEQVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAAGLSSVG 536
Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
+AW +L Y N+ VI D T G YG
Sbjct: 537 EAWELLREATY---------NQSVI------DYGWFGFTPGLGMGYGGVA---------- 571
Query: 317 SSYDHPHLWYSTSEVIRALELFIASG--NELSASNTYRYDLIDLTRQALAKYANELFLNI 374
+ ++ + AL LF+ S + + ++YD +DLTRQ LA +++
Sbjct: 572 ----------NAAKEVEALRLFLQSALTKGYAPNGPWQYDCVDLTRQVLANTFRDIYAQF 621
Query: 375 IEAYQLNDAHGVF------QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
AY AH + L L L+ D+D +LA + +LLG W++SA A +Q
Sbjct: 622 DAAYSAYAAHKTYTVDQLKSLGSALLTLIGDIDEILATNPNYLLGTWIQSALSWADTPDQ 681
Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
Y++NAR QIT+W + Q + DY K+W+ L+R YY PR ++ +++++ +G
Sbjct: 682 ALHYQFNARNQITLWGPDGQ-----ITDYATKHWADLVRSYYQPRWTLFITSVLQAVYAG 736
Query: 489 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
+++R E ++L W Y G+ L + L KYL
Sbjct: 737 -----REYRGELLQLEQKWNRENTTYATTPTGNTLQVAYKLAAKYL 777
>gi|392588150|gb|EIW77482.1| glycoside hydrolase family 89 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 761
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 179/583 (30%), Positives = 283/583 (48%), Gaps = 81/583 (13%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP W+D Q VL K+I+ R+ +LGM PVLPAF+G VP A+ N++P+A I
Sbjct: 208 NIQGSWGGDLPTQWIDDQFVLGKQIVQRMVDLGMTPVLPAFTGFVPPAMHNLYPNASIVN 267
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W P++ L+ DPLF ++ ++FI +Q +G SHIY D ++EN P
Sbjct: 268 GSAWNDFA--PQFTNDSFLEPFDPLFAQVQQSFISKQQAAFGNVSHIYTLDQYNENDPYS 325
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVP----LGK 176
P Y++++ AA +S +++ D DA WLMQGWL FS FW P +++A L VP
Sbjct: 326 GDPSYLTNISAATFSSLRAADPDATWLMQGWLFFSSADFWTPERVEAYLAGVPGDDDGSG 385
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDL++E +P W ++G +IWC LH++ GN+ G ++ P+ A S N +
Sbjct: 386 MLILDLYSEAQPQWQRLSSYFGKRWIWCELHDYGGNMGFEGNFANVTEAPLAALASPNVS 445
Query: 237 M-----------------------AW----INQ------YSVRRY-GRSVPAIQ-DAWNV 261
M AW IN+ ++ RRY +P +AW
Sbjct: 446 MVGVGLTPEGMEGNEIIYDVLLDQAWSSSPINKTEYAQAWATRRYPADELPECAIEAWQT 505
Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
L TVY+ TD + I+ +++VT H
Sbjct: 506 LAATVYSNTDPGSQATVKSILELEPALSGLVNVTG-----------------------HH 542
Query: 322 P-HLWYST-SEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
P H++Y T + ++ AL+ + +G+ L A YRYDL+DLTRQ L +L+ +++
Sbjct: 543 PTHVFYDTNTTIVPALQQLVQAGHSTPSLLAIPEYRYDLVDLTRQLLVNRFIDLYADLLA 602
Query: 377 AYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-E 433
Y + V + LELV D+D +L ++ F L W ++A+ A Y E
Sbjct: 603 VYNTTSASSASVSAAGQPMLELVADLDKVLMTNENFQLSRWTDAARSWANGNASYAAYLE 662
Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRL 493
+NAR QIT+W + + DY +K W GL+ DYYG R A++ +Y+ S +G +
Sbjct: 663 YNARNQITLWGPKGE-----INDYASKQWGGLVGDYYGKRWAMFIQYLEGSKSNGTVYNA 717
Query: 494 KDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 536
+ + W + V + ++ WL + + G
Sbjct: 718 TAIAGALLDIGEQWSQSTSA--VGARDTMVVGDTWLVVQEMAG 758
>gi|404487028|ref|ZP_11022215.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
YIT 11860]
gi|404335524|gb|EJZ61993.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
YIT 11860]
Length = 726
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 179/566 (31%), Positives = 260/566 (45%), Gaps = 76/566 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ GW GPLP WL+ Q LQKKIL R EL M PVLPAF+G+VPAAL+ + P A I
Sbjct: 196 MANIDGWNGPLPMQWLESQAELQKKILARERELNMTPVLPAFAGHVPAALKRIHPDANIQ 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R + L+ +PLF EI ++F+E+Q K +G T HIY D F+E PP
Sbjct: 256 YLGKWAGFGDSYR---CHFLNPEEPLFAEIQKSFLEEQEKMFG-TDHIYGVDPFNEVDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY++ + + +Y + + D DAVWL W+F +D W P++KALL VP KLV+
Sbjct: 312 SWEPEYLAQVSSDMYKSLAAADPDAVWLQMTWMFYHDRKLWTAPRVKALLTGVPSDKLVL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------ILDSIAFG------- 225
LD E +W ++++F+G PYIWC L NF GN + G LD+
Sbjct: 372 LDYHCENVELWKSTEKFHGQPYIWCYLGNFGGNTTLTGNVKESGDRLDNALINGGDNLKG 431
Query: 226 -------------PVE-----ARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
P E A T + W+ + + R G + ++AW +L+
Sbjct: 432 IGSTLEGLDINQFPYEYIFEKAWTIDVNGQDWVERLADRHVGAVSESAREAWQILFD--- 488
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
DV V P + NY K S E Y
Sbjct: 489 -----------DVFVQVPRTLGILPGYRPKLGDNYNKRTSNE----------------YD 521
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ ++R EL + + + + D+I RQ L Y ++ Y+ + G+
Sbjct: 522 NATLLRVWELLLEVPS--CDRDAFEIDVIMTGRQLLGNYFLDVKKEFDGFYKKRNVPGLK 579
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ + E++ D++ L + H+ L W+E A+ L +E + YE NAR IT W
Sbjct: 580 EKASEMREILSDLELLNSFHNRASLDKWIEDARSLGDTDELKNYYEKNARNLITTW---- 635
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
L DY ++ W+GLL DYY R IYF +I + E G + + +W
Sbjct: 636 ---GGSLNDYASRTWAGLLNDYYARRWEIYFDAVIGAAEKGIELDKDELKSRLATFEQEW 692
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
+E NG L TS+ L KY
Sbjct: 693 VESTTPVCIERNGTLLDTSRRLLEKY 718
>gi|313203962|ref|YP_004042619.1| alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
gi|312443278|gb|ADQ79634.1| Alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
Length = 738
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 189/571 (33%), Positives = 273/571 (47%), Gaps = 69/571 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ WGGPLP S+++ Q LQ+ IL R LGM P+L AF+G+VP L+ + PSAKIT
Sbjct: 196 MANMDKWGGPLPISYIEGQKKLQQHILQRSRALGMKPILSAFAGHVPEQLKTLRPSAKIT 255
Query: 61 QLG-NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
++ W + ++ TY LD TD LF EI + F+ Q K YG T H+Y+ D F+E TP
Sbjct: 256 RIEPGWGGMAAE---YTTYFLDPTDNLFGEIQKRFLTVQQKLYG-TDHLYSADPFNEITP 311
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLV 178
P P+Y++++G IY M D +A+W W F DP W P++ A++++VP GKL
Sbjct: 312 PSWEPDYLANVGKTIYETMSQVDKEAIWYQMSWTFYNDPTHWTRPRLSAMIHAVPQGKLF 371
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM---------------YG------ 217
LD E + + S FYG P+IWC L NF N + YG
Sbjct: 372 FLDYNCEEEEFFRKSDNFYGAPFIWCYLGNFGANTHLVAPLNKVVNRLGKLTYGSACVGV 431
Query: 218 --ILDSIAFGP--------VEARTSENTTM-AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
L+ I P + R E T I Y+ RR G A+ +AW +L V
Sbjct: 432 GSTLEGINVNPEIYETVLEMPWRADETVTADTLIRHYAERRAGARDKAVIEAWQLLRQHV 491
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
D A +V V P + +T + +P + Y
Sbjct: 492 --LVDTAVGIWNHCVVF--QVSP-VTDLTRAFWAT-------------------NPKIPY 527
Query: 327 STSEVIRAL-ELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
++ AL +F AS N ++ YR+D+++LTRQAL Y L+ ++EAY +
Sbjct: 528 RNVDLAIALNRMFQASANS-KKTDAYRFDVVNLTRQALGNYGTVLYHKMMEAYSRKNLID 586
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
+ S FL+L +++DGLLA FLLG WL A+ ++ YE NAR IT W
Sbjct: 587 FRKYSGEFLQLGQEIDGLLATRHEFLLGKWLADARSWGTTPAEKAYYERNAREIITTW-- 644
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
+ L DY N+ W+GLLR YY PR + + SL +G + K +
Sbjct: 645 --HKAGGGLTDYSNRQWNGLLRSYYLPRWKEFINRLDTSLSTGKDYDDKAFAAWCSAFEQ 702
Query: 506 DWQNG-RNVYPVESNGDALITSQWLYNKYLQ 535
W + + Y GDA+ + L+ KY Q
Sbjct: 703 HWVDSPSSAYSDTETGDAVKMAFELFGKYKQ 733
>gi|298385999|ref|ZP_06995556.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
1_1_14]
gi|298261227|gb|EFI04094.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
1_1_14]
Length = 715
Score = 268 bits (684), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 173/570 (30%), Positives = 272/570 (47%), Gaps = 87/570 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GGPLP W +Q+VLQKKIL R+ E GM PV F G VP+ L+ +P A++
Sbjct: 194 MGNLENIGGPLPDEWFKEQIVLQKKILARMREYGMKPVFQGFFGMVPSLLKEKYPEARLV 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S++ P +LD DPLF + + + + K YG+ + ++ D F E T
Sbjct: 254 EQGLWNSLQRPP------VLDPADPLFERMAKVWYAEYEKLYGK-ADLFGGDLFHEGGKT 306
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D ++ + + M+ + DA W++Q WL + P+ K LL + +
Sbjct: 307 GGID----VTDAARRVQTAMKRYNPDATWVIQAWLGN-------PK-KELLAGLDRKNTL 354
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART------- 231
++DL AE W K F G P++W + N+ GNI ++G LD+IA GPV+ +
Sbjct: 355 IVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGGNIGLHGRLDAIATGPVDGQKDSAASPS 414
Query: 232 ---------------------------SENTTM-AWINQYSVRRYGRSVPAIQDAWNVLY 263
SE+ + W+ +YSVRRYG +++AW + +
Sbjct: 415 MKGTSSTPEGIEVNPVVFDLLNEMRWRSEHLDLDVWLKEYSVRRYGVEDENLKEAWTIFH 474
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
T Y G + V A P + I+ S++
Sbjct: 475 RTAYGTYTGHRRPSESVFCAPPSLKRDKITA----------------------SAWSQCR 512
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
++Y + + LF+ S + L ++TY+YD +D RQ LA E + N+++AY+ D
Sbjct: 513 IFYDPELFAQGVGLFLQSADRLKQTSTYQYDAVDFVRQYLADLGRETYYNLVDAYRAKDT 572
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
S RFL+L++D + LL+ H+ F +G WL+ A+ ++ E + YE NAR I W
Sbjct: 573 KQFDYWSERFLQLIKDQNELLSTHERFFVGRWLDMARLKSKQPELQDLYEHNARMLIGTW 632
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E S +RDY +K W GLL+DYY PR Y Y+ +LE G + D +
Sbjct: 633 ----TETLSPVRDYAHKEWGGLLKDYYLPRWTNYIAYLKGTLE-GRSLTVPD----SFQA 683
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
W N N Y +E+ D + T++ +Y+KY
Sbjct: 684 EKAWVNAHNKYVLEAGVDPVQTAKRMYSKY 713
>gi|297273081|ref|XP_001095618.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Macaca mulatta]
Length = 691
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 172/517 (33%), Positives = 249/517 (48%), Gaps = 113/517 (21%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ + P A
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRTSCMPVAS 263
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ S+ P GR I + +
Sbjct: 264 LPA-----SLPPSPG-----------------GRKLIH----------------SINLMQ 285
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
PP +P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L
Sbjct: 286 PPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRL 345
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 346 LVLDLFAESQPVYTLTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTM 405
Query: 238 ----------------------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
AW+ ++ +RYG S P AW +L
Sbjct: 406 VGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLL 465
Query: 264 HTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
+VYNC+ + NR +V P L+ TS
Sbjct: 466 RSVYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS----- 495
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
+WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY +
Sbjct: 496 -VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKE 554
Query: 383 AHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
+ + EL+ +D LLA FLLG WLE A+ A +E + YE N+R Q+T
Sbjct: 555 LTSLLRAGGVLAYELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLT 614
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
+W E ++L DY NK +GL+ +YY PR ++
Sbjct: 615 LW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFL 646
>gi|119480815|ref|XP_001260436.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
181]
gi|119408590|gb|EAW18539.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
181]
Length = 748
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 165/524 (31%), Positives = 265/524 (50%), Gaps = 77/524 (14%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP SW+D Q LQKKI+ R+ ELGM PVLPAF+G VP A+ V P+A +
Sbjct: 197 NIQGSWGGDLPYSWIDSQFELQKKIVRRMVELGMTPVLPAFTGFVPRAISRVLPNATVVN 256
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W D R+ L+ DP F + R+FI++Q + YG +H+Y D ++EN P
Sbjct: 257 GSRWGGF--DERYTNDTFLEPFDPSFTRLQRSFIQKQQQAYGNITHVYTLDQYNENDPYS 314
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
+Y+ ++ + ++S D +AVWLMQGWLF S FW ++KA L+ V + + ++V
Sbjct: 315 GDLDYLRNVTRNTWLSLKSADPNAVWLMQGWLFYSNSDFWTDERVKAYLSGVEVDQDMLV 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLF+E +P W ++ +YG P+IWC LH++ GN+ +YG + ++ +A + ++ +
Sbjct: 375 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNVTVNATQALAASDSLVGF 434
Query: 238 --------------------AWINQ---------------YSVRRYGRSVP-AIQDAWNV 261
AW Q YS G +VP + AW++
Sbjct: 435 GLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHNWVKTRYSSGVRGSAVPEELHQAWDI 494
Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
L TVYN T+ + I ++ PSI G G DH
Sbjct: 495 LRTTVYNNTNLTSTAVSKSIF---ELQPSI----SGLLNRTG----------------DH 531
Query: 322 P-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEA 377
P + Y + +++A +L ++ ++ L + + YD++D+TRQ +A +++N++
Sbjct: 532 PTTVNYDPAALVQAWQLMDSAASKDRSLWSQPAFLYDMVDITRQVMANAFIPMYINLVST 591
Query: 378 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 437
YQ V ++L+ D+D +L+ +D F L W+ SA+ A+N+ + YE+NAR
Sbjct: 592 YQA--GASVSTDGSNLIQLLRDVDSVLSTNDNFRLSTWIRSARSWARNDTEADFYEYNAR 649
Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
QI +W + DY +K W GL+ YY PR + Y+
Sbjct: 650 NQIALW-----GPMGEINDYASKQWGGLVSAYYIPRWQTFLHYL 688
>gi|156121099|ref|NP_001095696.1| alpha-N-acetylglucosaminidase precursor [Bos taurus]
gi|151554244|gb|AAI48148.1| NAGLU protein [Bos taurus]
gi|296476361|tpg|DAA18476.1| TPA: alpha-N-acetylglucosaminidase [Bos taurus]
Length = 667
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 148/383 (38%), Positives = 205/383 (53%), Gaps = 68/383 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+GNW + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TM
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 440
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ ++ RRYG S + AW +L +
Sbjct: 441 TGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 500
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYNC S E + N+ P+ + L+ T+ +W
Sbjct: 501 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 530
Query: 326 YSTSEVIRALELFIASGNELSAS 348
Y+ S+V A L + + + L++S
Sbjct: 531 YNRSDVFEAWRLLLTATSTLASS 553
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 60/115 (52%), Gaps = 5/115 (4%)
Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
A A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR ++
Sbjct: 551 ASSPAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFT 605
Query: 479 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ ++ESL G F+ + R +L + G YP + GD + + L+ KY
Sbjct: 606 ETLVESLVQGVPFQQHQFDRNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 660
>gi|404406438|ref|ZP_10998022.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 726
Score = 265 bits (676), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 179/576 (31%), Positives = 271/576 (47%), Gaps = 82/576 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLPQSW+D QL LQ++I+ R ELG+ PV +F+G+VP AL+ +FP A I
Sbjct: 190 MANIDAWHGPLPQSWIDGQLELQRRIIARERELGIQPVFTSFTGHVPKALKTLFPDADIE 249
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W S + R +Y L+ +PLF I +A++++Q + +G +S +Y D F+E PP
Sbjct: 250 RLNPWTSFE---RPYNSYYLNPAEPLFNRIQQAYMQEQRRLFGESS-VYGVDPFNELDPP 305
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
PEY++ Y + D DAVWL W+F + W P ++KA L +VP GKL++
Sbjct: 306 NWDPEYLARAARLTYESITQFDKDAVWLQMAWVFYHKRRDWTPERLKAYLCAVPDGKLLM 365
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG---------------------- 217
LD + + +W +++ FYG P+IW L NF GN + G
Sbjct: 366 LDYYCDKVELWRSTESFYGQPFIWSYLGNFGGNTMLAGDVKDVSRKLDRAYAEAGRNFVG 425
Query: 218 ---ILDSIAFGPV-------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
L+ + P A T WI++ + R GR + AW +LY +Y
Sbjct: 426 IGCTLEGLDVNPFMYEYVLDRAWTQLYDDAGWIDRLADRHSGRIDVHYRQAWRILYDKIY 485
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
G NR V +P K S + PHL Y
Sbjct: 486 CAPSG----NRSAAVC-------------------ARPNMK------GRSKWSGPHLDYD 516
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+++R E + E +AS+ R+D +++ RQ L Y L I A + D V
Sbjct: 517 NRDLLRVWEQLTLARPERTASS--RFDCVNIPRQCLENYFGNLNERCIAACRGGDRETVA 574
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+LS R LEL++D+D L+A FLLG W+ A+++ ++ +E +AR +T W
Sbjct: 575 RLSARLLELLDDIDRLVAADAYFLLGKWIADARRMGATPAEKDYFERDARNILTTW---- 630
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---RLKDWRREWIKLT 504
L DY N+ WSGL+ DYY R ++ + E + L+D+ EW+
Sbjct: 631 GGRGYSLNDYANRTWSGLVSDYYKERWRRFYDRLQSDGEPDEDALLQELQDFEWEWVG-- 688
Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
+ GR + GDA + LY KY G F
Sbjct: 689 ---RKGR--FAERPRGDAFRLCRSLYTKYAAEIGRF 719
>gi|393783261|ref|ZP_10371436.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
CL02T12C01]
gi|392669540|gb|EIY63028.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
CL02T12C01]
Length = 724
Score = 264 bits (675), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 169/567 (29%), Positives = 267/567 (47%), Gaps = 79/567 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ GW GPLP WLD Q+ LQKKIL R EL M PVLPAF+G+VP AL+ +FP A I
Sbjct: 198 MANIDGWNGPLPMHWLDSQVELQKKILTRERELNMKPVLPAFAGHVPGALKRIFPEANIQ 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + R + L+ + LF I + +I++Q + +G T HIY D F+E PP
Sbjct: 258 NLGKWAGFAEEYR---CHFLNPEEALFATIQKQYIKEQTRLFG-TDHIYGVDPFNEVDPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY+S + A +Y + + D A W+ W+F +D W P++KA+L VP GK+V+
Sbjct: 314 SWEPEYLSKVSADMYHTLTAADPKAEWMQMTWMFYFDRKDWTAPRVKAMLTGVPQGKMVL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W T++ F+G PYIWC L NF GN + G
Sbjct: 374 LDYHCENVELWKTTEHFHGQPYIWCYLGNFGGNTTLTGNVKESGARLDNTLINGGSNFKG 433
Query: 219 -------LDSIAFGPVE-----ARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E A T +W+N + R G + +++AW++L++ V
Sbjct: 434 IGSTLEGLDVMQF-PYEYIFEKAWTLNTDDRSWLNALADRHTGVTSEPVREAWDILFNQV 492
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y + P++ P + KP ++ ++ Y
Sbjct: 493 YVQVP-------RTLAVLPNLRPVM-----------NKPNNRTSIN-------------Y 521
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ +++A + + + + + R D+I + RQ L Y + + Y+ D +
Sbjct: 522 PNTALLQAWQKLLQAPD--CNRDALRLDIITVGRQLLGNYFLTVKDDFDRMYEAKDLPAL 579
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ E++ D++ L A H L W+ A++ E + YE NAR IT W
Sbjct: 580 KARAAEMREILNDLERLNAFHSRCSLDKWISDARKYGNTPELKNYYEKNARNLITTW--- 636
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
L DY ++ W+GL++DYY R +Y ++ ++E+ F + E+ +
Sbjct: 637 ----GGRLNDYASRTWAGLIKDYYSKRWDMYLDAVVAAVENNREFDQEKLDGEFRLFEDS 692
Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
W + V GD LI +++L NKY
Sbjct: 693 WVSSTRPVEVTPEGDLLIYARFLLNKY 719
>gi|423287380|ref|ZP_17266231.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
CL02T12C04]
gi|392672495|gb|EIY65962.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
CL02T12C04]
Length = 726
Score = 264 bits (675), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 170/566 (30%), Positives = 268/566 (47%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP SWL Q LQK+I+ R LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 192 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D DPLF +I + ++E+Q K YG T HIY D F+E P
Sbjct: 252 QMSQWGGYDKKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++ I+ ++ DS A W+ W+F + W P++KA LNSVP KL++
Sbjct: 308 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
LD + + IW ++Q+YG PYIWC L NF GN + G +D ++
Sbjct: 368 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 427
Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
F +A + T W+ +++ R G I DAW LY +Y
Sbjct: 428 VGATLEGLDVNPFMYEFVLEKAWSHTITNADWMKNWALCRGGSKSSHIIDAWQQLYKKIY 487
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
I T G+ ++ +L+ S HP ++Y
Sbjct: 488 -----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYYD 520
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+ F+ + N S+ Y++D+I++ RQ L ++ + Y+ + G+
Sbjct: 521 NKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGMK 578
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ + + L D+D LL+C F +G W++ A+ +N ++++ YE NAR +T W
Sbjct: 579 EWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARCILTTW---- 634
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++A+ L DY N+ W GL YY R ++ +Y I+ + G K + + W
Sbjct: 635 GQKATQLNDYANRGWGGLTDSYYRKRWELFTQYAIDEMSHGKEIDEKSFYNLITEFEYQW 694
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
NVY S D + + LY KY
Sbjct: 695 TLQTNVYSESSGEDPIRIANLLYIKY 720
>gi|295087651|emb|CBK69174.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 703
Score = 264 bits (675), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 170/566 (30%), Positives = 268/566 (47%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP SWL Q LQK+I+ R LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 169 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 228
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D DPLF +I + ++E+Q K YG T HIY D F+E P
Sbjct: 229 QMSQWGGYDEKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 284
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++ I+ ++ DS A W+ W+F + W P++KA LNSVP KL++
Sbjct: 285 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 344
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
LD + + IW ++Q+YG PYIWC L NF GN + G +D ++
Sbjct: 345 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 404
Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
F +A + T W+ +++ R G I DAW LY +Y
Sbjct: 405 VGATLEGLDVNPFMYEFVLEKAWSHTITNADWMKNWALCRGGSKSSHIIDAWQQLYKKIY 464
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
I T G+ ++ +L+ S HP ++Y
Sbjct: 465 -----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYYD 497
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+ F+ + N S+ Y++D+I++ RQ L ++ + Y+ + G+
Sbjct: 498 NKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGMK 555
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ + + L D+D LL+C F +G W++ A+ +N ++++ YE NAR +T W
Sbjct: 556 EWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARCILTTW---- 611
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++A+ L DY N+ W GL YY R ++ +Y I+ + G K + + W
Sbjct: 612 GQKATQLNDYANRGWGGLTDSYYRKRWELFTQYAIDEMSHGKEIDEKSFYNLITEFEYQW 671
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
NVY S D + + LY KY
Sbjct: 672 TLQTNVYSESSGEDPIRIANLLYIKY 697
>gi|423213214|ref|ZP_17199743.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693674|gb|EIY86904.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
CL03T12C04]
Length = 726
Score = 264 bits (675), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 170/566 (30%), Positives = 268/566 (47%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP SWL Q LQK+I+ R LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 192 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D DPLF +I + ++E+Q K YG T HIY D F+E P
Sbjct: 252 QMSQWGGYDEKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++ I+ ++ DS A W+ W+F + W P++KA LNSVP KL++
Sbjct: 308 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
LD + + IW ++Q+YG PYIWC L NF GN + G +D ++
Sbjct: 368 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 427
Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
F +A + T W+ +++ R G I DAW LY +Y
Sbjct: 428 VGATLEGLDVNPFMYEFVLEKAWSHTITNADWMKNWALCRGGSKSSHIIDAWQQLYKKIY 487
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
I T G+ ++ +L+ S HP ++Y
Sbjct: 488 -----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYYD 520
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+ F+ + N S+ Y++D+I++ RQ L ++ + Y+ + G+
Sbjct: 521 NKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGMK 578
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ + + L D+D LL+C F +G W++ A+ +N ++++ YE NAR +T W
Sbjct: 579 EWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARCILTTW---- 634
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++A+ L DY N+ W GL YY R ++ +Y I+ + G K + + W
Sbjct: 635 GQKATQLNDYANRGWGGLTDSYYRKRWELFTQYAIDEMSHGKEIDEKSFYNLITEFEYQW 694
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
NVY S D + + LY KY
Sbjct: 695 TLQTNVYSESSGEDPIRIANLLYIKY 720
>gi|380692804|ref|ZP_09857663.1| putative alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 709
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 169/570 (29%), Positives = 267/570 (46%), Gaps = 87/570 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GGPLP W +Q VLQKKIL R+ E GM PV F G VP++L+ +P A +
Sbjct: 188 MGNLENIGGPLPDEWFKEQTVLQKKILARMREYGMKPVFQGFFGMVPSSLKEKYPEAHLV 247
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S++ P +LD DPLF ++ + + + K YG+ + ++ D F E T
Sbjct: 248 EQGLWNSLQRPP------VLDPADPLFEQMAKVWYTEYEKLYGK-ADLFGGDLFHEGGKT 300
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D ++ + + M+ + DA W++Q WL + P+ K LL + +
Sbjct: 301 GGID----VTDAARRVQTAMKQYNPDATWVIQAWLGN-------PK-KELLAGLDRKHTL 348
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA 238
++DL AE W K F G P++W + N+ NI ++G LD+IA GP++ R + +
Sbjct: 349 IVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGANIGLHGRLDAIATGPIDGRKDPEASPS 408
Query: 239 -----------------------------------WINQYSVRRYGRSVPAIQDAWNVLY 263
W+ +YSVRRYG ++ AW + +
Sbjct: 409 MKGTSSTPEGIEVNPVVFDLLNEMRWRSEYLDIDTWLKEYSVRRYGAEDENLKKAWIIFH 468
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
T Y G + V A P + I+ S++
Sbjct: 469 RTAYGTYSGHRRPSESVFCAPPSLKRDKITA----------------------SAWSQCR 506
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
++Y + + LF+ S + L ++TY+YD +D RQ LA E + N+++AY+ D
Sbjct: 507 IFYDPDLFAQGVGLFLQSADHLKQTSTYQYDAVDFVRQYLADLGREAYYNLVDAYRAKDT 566
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
S RFL+L++D + LL+ H F +G WL+ A+ ++ E + YE NAR I W
Sbjct: 567 KQFDYWSERFLQLIKDQNELLSTHKCFFVGRWLDMARSKSKQPELQDLYEHNARMLIGTW 626
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
E S +RDY +K W GLL+DYY PR Y Y+ +LE G + + ++
Sbjct: 627 ----TETLSPVRDYAHKEWGGLLKDYYLPRWTNYIAYLKGTLE-GRSLTVPN----SFQV 677
Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
W N N Y +E+ D + T++ +Y KY
Sbjct: 678 EKAWVNAHNKYVLETGVDPVETAKRMYRKY 707
>gi|71001188|ref|XP_755275.1| alpha-N-acetylglucosaminidase [Aspergillus fumigatus Af293]
gi|66852913|gb|EAL93237.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
Af293]
gi|159129357|gb|EDP54471.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
A1163]
Length = 756
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 161/520 (30%), Positives = 262/520 (50%), Gaps = 69/520 (13%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP SW+D Q LQKKI+ R+ ELGM PVLPAF+G VP A+ V P+A +
Sbjct: 205 NIQGSWGGELPYSWIDSQFELQKKIVRRMVELGMTPVLPAFTGFVPRAVSRVLPNATVVN 264
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + D R+ L+ DP F+ + R+FI++Q + YG +HIY D ++EN P
Sbjct: 265 GSRW--EEFDERYTSDTFLEPFDPSFMRLQRSFIKKQQQAYGNITHIYTLDQYNENAPYS 322
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
+Y+ ++ + ++S D +AVWLMQGWLF S FW ++KA L+ V + + ++V
Sbjct: 323 GDLDYLHNVTHNTWLSLKSADPNAVWLMQGWLFYSSSGFWTDERVKAYLSGVEVDQDMLV 382
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLF+E +P W ++ +YG P+IWC LH++ GN+ +YG + ++ +A + ++ +
Sbjct: 383 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNVTVNATQALAASDSLVGF 442
Query: 238 --------------------AWINQ---------------YSVRRYGRSVP-AIQDAWNV 261
AW Q YS G +VP + AW++
Sbjct: 443 GLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHNWAKTRYSSGVRGSAVPEELYQAWDI 502
Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
L T YN T+ + I ++ PSI + + T SYD
Sbjct: 503 LRITAYNNTNLTSTAVSKSIF---ELQPSISGLLNRTSHH------------PTTVSYDP 547
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
L + R ++ + L + + YD++D+TRQ ++ ++ N++ YQ
Sbjct: 548 AAL----VQAWRLMDSAASKAPSLWSQPAFLYDMVDITRQVMSNAFIPVYTNLVSTYQAG 603
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
+ V ++L+ D+D +L+ +D F L W++SA+ +N+ + YE+NAR Q+T
Sbjct: 604 GS--VSTDGSNLIQLLRDLDSVLSTNDNFRLSTWIQSARSWVRNDTEADFYEYNARNQVT 661
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
+W + + DY +K W GL+ YY PR + Y+
Sbjct: 662 LWGPKGE-----INDYASKQWGGLVSSYYIPRWQKFLNYL 696
>gi|237717696|ref|ZP_04548177.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229453015|gb|EEO58806.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 729
Score = 261 bits (667), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 167/568 (29%), Positives = 264/568 (46%), Gaps = 71/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 196 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+ L+ D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 256 RVSEWGGFADQYR--CTF-LNPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D +AVWL WLF D W P++K+ L SVP +L++
Sbjct: 312 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
LD F E IW + ++G PY+WC L NF GN + G ++ ++
Sbjct: 372 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 431
Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
F +A T W + + RR G+ P + AW +L + VY
Sbjct: 432 VGSTLEGIDLNQFMYEFVLDKAWNGGQTDKEWFFKLADRRIGKISPEARKAWEILANKVY 491
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
V P+ V +G N +P LK P + Y
Sbjct: 492 -------------------VQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEYQ 524
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+++ A L ++ + ++Y +DL+++ RQ L Y N + AY+ D +
Sbjct: 525 PKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIPMMK 582
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ E++ D+D L++CH F L W+ A+ + + + YE NAR+ IT+W D+
Sbjct: 583 NRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHDAASKNYYEMNARSLITIWGDSY 642
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
L DY N+ W+GL YY R + +IE+ E F +++ + N+W
Sbjct: 643 H-----LTDYANRSWAGLTNQYYSVRWDHFINEVIEAAEKKKNFDEEEFFNQSRMYENEW 697
Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQ 535
N N GD + ++ +Y KY +
Sbjct: 698 VNPSNRISYNEGGDGIKLARQIYKKYAK 725
>gi|299149196|ref|ZP_07042257.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298512863|gb|EFI36751.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 738
Score = 261 bits (667), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 167/568 (29%), Positives = 264/568 (46%), Gaps = 71/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 205 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+ L+ D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 265 RVSEWGGFADQYR--CTF-LNPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D +AVWL WLF D W P++K+ L SVP +L++
Sbjct: 321 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
LD F E IW + ++G PY+WC L NF GN + G ++ ++
Sbjct: 381 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 440
Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
F +A T W + + RR G+ P + AW +L + VY
Sbjct: 441 VGSTLEGIDLNQFMYEFVLDKAWNGGQTDKEWFFKLADRRIGKISPEARKAWEILANKVY 500
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
V P+ V +G N +P LK P + Y
Sbjct: 501 -------------------VQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEYQ 533
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+++ A L ++ + ++Y +DL+++ RQ L Y N + AY+ D +
Sbjct: 534 PKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIPMMK 591
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ E++ D+D L++CH F L W+ A+ + + + YE NAR+ IT+W D+
Sbjct: 592 NRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHDAASKNYYEMNARSLITIWGDSY 651
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
L DY N+ W+GL YY R + +IE+ E F +++ + N+W
Sbjct: 652 H-----LTDYANRSWAGLTNQYYSVRWDHFINEVIEAAEKKKNFDEEEFFNQSRMYENEW 706
Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQ 535
N N GD + ++ +Y KY +
Sbjct: 707 VNPSNRISYNEGGDGIKLARQIYKKYAK 734
>gi|357622373|gb|EHJ73879.1| putative alpha-N-acetyl glucosaminidase [Danaus plexippus]
Length = 780
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 163/534 (30%), Positives = 257/534 (48%), Gaps = 79/534 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+HGWGGPLPQSW D+Q +Q+ + +++LGM PV PAF+G+VP A + +FP+
Sbjct: 206 MGNVHGWGGPLPQSWHDRQKQIQEVVTDLMFKLGMIPVFPAFNGHVPKAFEKIFPNTTFH 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W K D +CC +D +P F I + F+ + G +SHIY D F+E
Sbjct: 266 PVETW--NKFDEDYCCNLFVDPREPDFKMISKMFMREITAGLG-SSHIYTADPFNEIKIQ 322
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
S + AI+S + D DAVWL+Q W+F ++P W ++ + L SVP G+++V
Sbjct: 323 PWSTSLVVETAKAIFSSISEYDKDAVWLVQNWMFVHNPLLWPLKRVNSFLTSVPNGRMLV 382
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDL +E P + + +YG P+IW MLHNF G + M+G +I E R EN+TM
Sbjct: 383 LDLQSEQWPQYDLYQMYYGQPFIWSMLHNFGGTLGMFGNTKTINKDVYEVRKRENSTMVG 442
Query: 239 ---------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHT 265
W++ Y+ RRYG + +I W L +
Sbjct: 443 IGLTPEGINQNYVIYDLMLESAWRKGPVPDLEEWVSDYAERRYGCNATSI--GWKYLLRS 500
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYN T ++ GKY V+ S P W
Sbjct: 501 VYNFTG--------------------LNRIRGKY-----------VMTRRPSFNIRPWAW 529
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
Y ++ AL+ F+ N +++ + +DL+D+TRQAL ++++N+ N +
Sbjct: 530 YKGHDLFEALKNFVYVQNPACSTSGFLHDLVDVTRQALQYKIEQIYMNLQNDRYSN--YM 587
Query: 386 VFQLS-RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
VF + F++ + DM +LA F + WL SA+ ++ + Y++NAR QIT+W
Sbjct: 588 VFNYTISSFIDAMTDMQNILATSSDFKITSWLSSARAISNLPLESSLYDFNARNQITLWG 647
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
N + + DY K W+ L + YY PR +I+ +++ + F K +R
Sbjct: 648 PNGE-----ISDYACKQWAELFKYYYIPRWSIFLSMALDAKTRNEPFDEKGAQR 696
>gi|449541595|gb|EMD32578.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
B]
Length = 752
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 166/550 (30%), Positives = 275/550 (50%), Gaps = 72/550 (13%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP W++ Q LQK+IL R+ ELGM PVLPAF+G VP A+ V +A I
Sbjct: 198 NIQGSWGGELPMQWVNDQFALQKQILARMTELGMTPVLPAFTGFVPRAMSTVHSNASIVN 257
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTPP 120
W + P L+ DPLF + ++FI +Q + YG SHIY D ++EN P
Sbjct: 258 GSQW-APGFPPSLTNVSFLEPFDPLFATLQKSFIAKQQEAYGANISHIYTLDQYNENNPF 316
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-KLV 178
+ Y+SS+ ++ +++ D DAVW++QGWL FS + FW +++A L VP ++
Sbjct: 317 SGNLSYLSSISEGTFTSLRAADPDAVWMLQGWLFFSSEAFWTNERIEAYLGGVPTNDSMI 376
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM- 237
VLDL++E +P W+ + ++G ++WC LH++ G I + G LD+I GP+ A S ++M
Sbjct: 377 VLDLYSEAQPQWNRTSSYFGKQWVWCELHDYGGTIGLEGNLDAITTGPIAALNSPGSSMK 436
Query: 238 ----------------------AW----------INQYSVRRY-GRSVP-AIQDAWNVLY 263
AW + + RRY +P A Q+AW +L
Sbjct: 437 GMGLTMEGQEGNEIVYDLLLDQAWSSSPINIASYVKGWVSRRYLVEPLPSAAQEAWRILS 496
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
TVYN D ++ I +++P + + V++ +L + YD
Sbjct: 497 TTVYNNQD---PNSQSTIKNIYELEPVLTGL-----------VNRTGILPT-VIPYD--- 538
Query: 324 LWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
+ S ++ AL+L + A LS + +D++D++RQ L+ + + +I+ Y
Sbjct: 539 ---TNSTIVPALQLLVKAKAQNAALSTVPEFVHDVVDVSRQLLSNRFIDAYTALIDTYNN 595
Query: 381 ND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNAR 437
+ + V + + + ++ +D LLA ++ FLL W+ A+ L+ +E Y E+NAR
Sbjct: 596 TNVTSDAVIRAGQPLMTILSQLDALLATNENFLLSSWIAQARNLSHGDESYAAYLEYNAR 655
Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR 497
QIT+W + + + DY +K W+GL+ YY R + Y+ + F +
Sbjct: 656 NQITLWGPDGE-----INDYASKAWAGLISTYYAARWQTFIDYLASTKRLARPFDTSAFS 710
Query: 498 REWIKLTNDW 507
+ I L +W
Sbjct: 711 NQMILLGQEW 720
>gi|333031143|ref|ZP_08459204.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
gi|332741740|gb|EGJ72222.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
Length = 723
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 176/567 (31%), Positives = 255/567 (44%), Gaps = 79/567 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP+ WLD Q LQK+IL R EL M PVLPAF+G+VP+ L+++FP A I
Sbjct: 197 MANIDSWNGPLPKEWLDHQSDLQKQILKRERELNMKPVLPAFAGHVPSELKHLFPEADIQ 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ DPLF +I R F+E+Q + +G T HIY D F+E PP
Sbjct: 257 HLGKWAGFADKYR--CNF-LNPNDPLFAKIQRLFLEEQTRLFG-TDHIYGVDPFNEVDPP 312
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
PEY+ + A +Y + D A WL WLF + W P+++ALL VP +L +
Sbjct: 313 SWEPEYLKKVAADMYRTLTDVDPKAKWLQMTWLFYHGKKKWTAPRIEALLTGVPQDELYL 372
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W T+ F+G PYIWC L NF GN + G
Sbjct: 373 LDYHCENVELWKTTDYFHGQPYIWCYLGNFGGNTTITGNVKESGQRLENTLINGGNNFKG 432
Query: 219 -------LDSIAFGPVE-----ARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E A T +W+ + R G+ A ++AW +L++ V
Sbjct: 433 IGSTLEGLDVMQF-PYEYIFDKAWTFNMDDNSWVENLADRHLGKKSEAYREAWKILFNDV 491
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y + + P+ P + KP + ++ + D +W
Sbjct: 492 YVQVPKS-------LGVLPNFRPEM-----------SKPNKRTV---NDYKNKDLVKVWA 530
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
EV + Y DLI + RQ L Y + + YQ D G+
Sbjct: 531 KLLEVKEC------------TRDAYIIDLITVGRQVLGNYFLVVKNEFDQMYQFKDLPGL 578
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ E++ D++ L A H+ L W+ A+ L E + YE NAR IT W
Sbjct: 579 ESRGAKLREILNDLENLTAFHNHCTLEKWISDARALGNTIELKDYYEKNARNLITTW--- 635
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
L DY ++ WSGL++DYY R +Y + E+L+ F + + L
Sbjct: 636 ----GGSLNDYASRTWSGLIKDYYAKRWNLYIDSVTEALKENKKFNQSELNEKLNILEEA 691
Query: 507 WQNGRNVYPVESNGDALITSQWLYNKY 533
W N GD L S++L++KY
Sbjct: 692 WVNKVETVTSYEQGDILELSKYLFDKY 718
>gi|281210062|gb|EFA84230.1| hypothetical protein PPL_03307 [Polysphondylium pallidum PN500]
Length = 744
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 172/565 (30%), Positives = 272/565 (48%), Gaps = 84/565 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGG LPQSW+ Q LQ KIL R+ E GM+PV P F+G+VP A + +PSA I
Sbjct: 214 MGNLDGWGGVLPQSWIKGQHELQIKILKRMSEYGMSPVFPGFAGHVPVAFKQFYPSANIV 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHI--YNCDTFDENT 118
+L +W + T L TDP++ + F + Q + YG + I ++ D F+E
Sbjct: 274 ELPSWHGFNA------TNHLLTTDPMYDIVADRFYQVQNEIYGAYAKIDYFSIDPFNELI 327
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
PP +S ++++ + I++ + + D+ W++Q W + FW Q+ + L VP+G+L+
Sbjct: 328 PPSNSSQFLNECSSRIFNAINRFNPDSTWVLQNWFLN-SAFWGDGQVASFLGGVPIGRLI 386
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA 238
VLDL++E+KP+W+ + + G +IW MLHNF G + G + IA P+EA+ S T +
Sbjct: 387 VLDLWSELKPLWNRTANYQGHKWIWNMLHNFGGRPTISGRMPIIANEPLEAKASSPTMVG 446
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W++ Y RRYG ++P ++ W +L +TV
Sbjct: 447 IGLTPEAIEQNVIVYDLMSEMGWRSRSFDLNLWVDAYVTRRYGVNLPNLKPVWKMLAYTV 506
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y P+ P+ NY ++K+ L + L+Y
Sbjct: 507 YFS---------------PNRSPA----------NY---IAKKPSLDFQLG------LYY 532
Query: 327 STSEVIRA-LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
+ ++ A EL + +S TYRYDL ++T QAL+ Y N + ++Y +D
Sbjct: 533 NPVVIVDAWRELLAVDSTIVRSSETYRYDLAEITLQALSNYFNGNLKQLYQSYYASDFQ- 591
Query: 386 VFQLSRRFLEL-VEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
FQ +R+ + MD + LG W A++ A + + + YE+NAR QIT+W
Sbjct: 592 TFQSARQNCSFALRAMDAVADTVQLLKLGKWTADARKWATDNNERELYEYNARNQITLWG 651
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
DY NK+WSGL+ DYY PR I+F+++ ++ F +
Sbjct: 652 WKDMGNP----DYANKWWSGLIADYYFPRWQIFFEHLEHAIFDKSKFNEHSLAVNTMLHE 707
Query: 505 NDWQNGRNVYP--VESNGDALITSQ 527
W N+YP V SN D S+
Sbjct: 708 ERWNKQTNIYPSDVNSNDDVHTVSK 732
>gi|449541596|gb|EMD32579.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
B]
Length = 754
Score = 258 bits (658), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 159/552 (28%), Positives = 276/552 (50%), Gaps = 77/552 (13%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP W++ Q LQK+IL R+ ELGM P LPAF+G VP A+ ++P+A I
Sbjct: 201 NIQGSWGGALPMQWVNDQFALQKQILTRMTELGMTPALPAFTGFVPRAMSTLYPNASIVN 260
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTPP 120
W + L+ DPLF + ++FI +Q + YG +HIY D ++EN P
Sbjct: 261 GSAWSGFPAS--LTNVSFLEPFDPLFSTLQKSFITKQQQAYGTNVTHIYTLDQYNENNPF 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-KLV 178
+ Y+SS+ A ++ +++ D DA+W++QGWL FS + FW +++A L VP ++
Sbjct: 319 SGNISYLSSVSAGTFASLRAADPDAIWMLQGWLFFSSETFWTDERIQAYLGGVPTNDSMI 378
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM- 237
VLDL++E +P W+ + ++G ++WC LH + GN+ + G L++I GP+ A +S+ ++M
Sbjct: 379 VLDLYSEAQPQWNRTSSYFGKQWVWCELHGYGGNMGLEGNLNAITAGPIAALSSQGSSMK 438
Query: 238 ----------------------AW----------INQYSVRRYG-RSVP-AIQDAWNVLY 263
AW + + RRY +P A Q+AW +L
Sbjct: 439 GMGLTMEGQEGNEIVYDVLLDQAWSSAPIDIASYVKSWVARRYTVEPLPSAAQEAWQILS 498
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
TVYN D ++ I + +++P++ + + HP
Sbjct: 499 TTVYNNQD---PNSQATIKSIYELEPTLTGLVN--------------------RTGHHPT 535
Query: 324 L--WYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
L + + + V+ AL+L + + + L+A + YD +D++RQ L+ + + +++ Y
Sbjct: 536 LIPYDTNTTVVPALQLLVKAKEQNAALAAIPEFVYDAVDVSRQLLSNRFIDAYTGLVDTY 595
Query: 379 QLNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWN 435
+A V + + + ++ +D LLA ++ FLL W+ A+ + +E Y E+N
Sbjct: 596 NNANATSDAVVRAGQPLMVILSQLDALLATNENFLLSSWIAQARNWSHGDESYAAYLEYN 655
Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
AR Q+T+W + + + DY +K W+GL+ YY R + Y+ + F
Sbjct: 656 ARNQVTLWGPDGE-----INDYASKAWAGLISTYYSSRWQTFVDYLASTKRLSRPFDSSA 710
Query: 496 WRREWIKLTNDW 507
+ + I L W
Sbjct: 711 FSSQMILLGQQW 722
>gi|393785791|ref|ZP_10373937.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
CL02T12C05]
gi|392661410|gb|EIY54996.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
CL02T12C05]
Length = 727
Score = 257 bits (657), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 170/566 (30%), Positives = 261/566 (46%), Gaps = 77/566 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ GW GPLP WLD Q+ LQKKIL R EL M PVLPAF+G+VPAAL+ ++P A I
Sbjct: 198 MANIDGWNGPLPMEWLDNQVELQKKILARERELNMKPVLPAFAGHVPAALKRIYPEANIQ 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R Y L+ +PLF I + F+++Q + +G T HIY D F+E PP
Sbjct: 258 HLGKWAGFADTYR---CYFLNPEEPLFATIQKHFLQEQTRLFG-TDHIYGVDPFNEVDPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY+S + + +Y + + D A W+ W+F +D W P++KALL VP K+ +
Sbjct: 314 SWEPEYLSQVSSDMYRTLTAADPKAEWMQMTWMFYHDRKDWTAPRIKALLTGVPQDKMFL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG--------ILDSIAFGPVEART 231
LD E +W ++ F+G PYIWC L NF GN + G + +++ G R
Sbjct: 374 LDYHCENVELWKNTEHFHGQPYIWCYLGNFGGNTTLTGNVKESGDRLDNALINGGSNLRG 433
Query: 232 SENT------------------------TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
+T AW+ + R G +++AW++L++ +Y
Sbjct: 434 IGSTLEGLDVMQFPYEYIFEKAWDLNLDNEAWLQNLADRHAGTVSQPVREAWDILFNQIY 493
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
V P T G NY +PV + ++ YS
Sbjct: 494 --------------VQVPK--------TLGVLPNY-RPVMNKPNRRTVID--------YS 522
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ +++A E + + + + R D+I + RQ L Y + + Y + D G+
Sbjct: 523 NATLLQAWEKLLQATD--CNRDALRLDIITVGRQLLGNYFLIVKDDFDRMYTVKDLPGLK 580
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ E++ D+D L A H L WL A+ L E + YE NAR IT W
Sbjct: 581 ARAAEMKEILNDLDRLNAFHSRCALDKWLADARALGTTPEVKDYYEKNARNLITTW---- 636
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
L DY ++ W+GL++DYY R +Y +I ++E F K + W
Sbjct: 637 ---GGSLNDYASRTWAGLIKDYYSKRWDMYMDAVISAVEGNREFDQKKLDESIKNFEDAW 693
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
+ + V G+ + +++L KY
Sbjct: 694 VDSTDPILVAPQGELMQYARFLLQKY 719
>gi|336417192|ref|ZP_08597519.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
3_8_47FAA]
gi|423297818|ref|ZP_17275878.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
CL03T12C18]
gi|335936512|gb|EGM98438.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
3_8_47FAA]
gi|392664455|gb|EIY57993.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 166/568 (29%), Positives = 261/568 (45%), Gaps = 71/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K +
Sbjct: 194 MCNLDGWQSPLPKEWLSSQAELQEQIVAREREFNMQPVLPAFAGHVPAALKRVYPNIKTS 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+L + D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 254 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D A+WL WLF D W P++K+ L SVP KL++
Sbjct: 310 SWDTDSLGMMAKHIYESVAAVDPKAIWLQMTWLFYADIKHWTTPRIKSYLRSVPQDKLIL 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
LD F E IW + ++G PY+WC L NF GN + G + ++
Sbjct: 370 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVKLVSERLADALKNGGSNLKG 429
Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
F +A S T W + + RR G+ P + AW +L VY
Sbjct: 430 VGSTLEGIDLNQFMYEFVLDKAWNSGQTDKEWFLKLADRRTGKVSPEARKAWEILADKVY 489
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ P+ V +G N +P LK P + Y
Sbjct: 490 -------------------IQPA--QVGQGTLTN-ARPC-----LKGNGHWTTKPTIEYQ 522
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+++ A L + + ++Y +DL+++ RQ L Y N + AY+ D +
Sbjct: 523 PKDLVEAWRLLLLVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIMMMK 580
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ E++ D+D L++CH F L W+ A+ + + + YE NAR+ IT+W D+
Sbjct: 581 NRGDKMREILADLDKLVSCHPTFSLNKWITDARDMGHDATSKNYYEMNARSLITIWGDSY 640
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
L DY N+ W+GL YY R + +I+++E F + + E N+W
Sbjct: 641 H-----LTDYANRSWAGLTNQYYSVRWDRFINEVIKAVEKKKAFDEEVFFNESRMYENEW 695
Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQ 535
N N GD + ++ +Y KY +
Sbjct: 696 VNPSNRINYNEGGDGIKLARQIYKKYAK 723
>gi|126307952|ref|XP_001365931.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
domestica]
Length = 481
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 120/238 (50%), Positives = 160/238 (67%), Gaps = 4/238 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +Q LQ +IL R+ GM PVLPAF+G++P A VFP A +T
Sbjct: 205 MGNLHTWGGPLPSSWDLKQSYLQYQILERMRSFGMKPVLPAFAGHIPKAFTRVFPQANVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L NW + C+YLL DPLF +G F+ + KE+G T HIY+ D F+E PP
Sbjct: 265 KLDNWIDFNCT--YSCSYLLAPEDPLFPVVGSLFLRELAKEFG-TDHIYSADIFNEMDPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ AA+Y M + D DAVWL QGWLF P FW+PPQMKA+L +VP G+ ++
Sbjct: 322 SSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNHPDFWKPPQMKAVLEAVPRGRFLI 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
LDLFAE +P++S + FYG P+IWCMLHNF GN ++G+LD++ GP AR N+T+
Sbjct: 382 LDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLFGVLDAVNRGPSTARLFPNSTI 439
>gi|195454475|ref|XP_002074254.1| GK18384 [Drosophila willistoni]
gi|194170339|gb|EDW85240.1| GK18384 [Drosophila willistoni]
Length = 743
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 154/508 (30%), Positives = 253/508 (49%), Gaps = 76/508 (14%)
Query: 32 ELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 91
ELG++ LPAF+G+VP AL+ +FP A T+ W + +CC ++ +PLF ++
Sbjct: 262 ELGISVALPAFAGHVPRALRRIFPQANFTETERWNRFPN--AYCCDLFVEPQEPLFRQLA 319
Query: 92 RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 151
F+ + + YG ++HI+ CD F+E PPV +++ S AAIY+ M+ D A+WL+QG
Sbjct: 320 TTFLRRVTQRYG-SNHIFFCDPFNELEPPVSQADFMRSTAAAIYASMREVDPKAIWLLQG 378
Query: 152 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 211
W+F + FW ++A L +VP G L+VLDL +E P + +K +YG P++WCMLHNF G
Sbjct: 379 WMFVKNIFWTDELIEAFLTAVPQGNLLVLDLQSEQFPQYQRTKSYYGQPFVWCMLHNFGG 438
Query: 212 NIEMYGILDSIAFGPVEARTSENTTM---------------------------------A 238
+ M G ++ + G AR N++M
Sbjct: 439 TLGMLGSVELVNSGMDLARQMPNSSMVGAGITPEGIGQNYVMYSFALERGWSDRKLDSAG 498
Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 298
W +++ RYG + AW +L +VY + + I P ++ S
Sbjct: 499 WFTHFALTRYGVQDERLNQAWQLLRTSVYTFHGLQKMRGKYTITRRPAINLS-------- 550
Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT----YRYD 354
P WY+ + V+ A +L +++ + + + Y++D
Sbjct: 551 -----------------------PFTWYNVTHVLEAWQLMLSARSIIPLDDNRYDIYQHD 587
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
L+D+TRQ L A++L++N+ +Y+ L + LEL++D++ +L FLLG
Sbjct: 588 LVDITRQYLQITADQLYVNLNSSYRKRQLARFVYLGNKLLELLDDLERILGSGSNFLLGT 647
Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
WLE+AK LA E + +E+NAR QIT W N + + DY K WSG++ DYY PR
Sbjct: 648 WLEAAKLLAPTVEDQSNFEFNARNQITTWGPNGE-----ILDYACKQWSGMISDYYRPRW 702
Query: 475 AIYFKYMIESLESGDGFRLKDWRREWIK 502
A + + +L+S F +++ +K
Sbjct: 703 ARFLDDVTLALQSNQPFNASAYKQHVLK 730
>gi|121698957|ref|XP_001267859.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
1]
gi|119396001|gb|EAW06433.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
1]
Length = 671
Score = 255 bits (652), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 157/485 (32%), Positives = 246/485 (50%), Gaps = 70/485 (14%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G W G LP SW+D Q LQKKI+ R+ ELGM PVLPAF+G VP A+ V P A +
Sbjct: 197 NIQGSWHGELPYSWIDAQFELQKKIVRRMVELGMTPVLPAFTGFVPRAITRVLPDATVVN 256
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W D ++ L+ DP F + R+FI +Q + YG +HIY D ++EN P
Sbjct: 257 GSRWSGF--DEKYTNDTFLEPFDPNFARLQRSFIHKQQQAYGNITHIYTLDQYNENDPYS 314
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
PEY+ ++ + ++S D DA+W+MQGWLF S FW ++ A L+ V + ++V
Sbjct: 315 GDPEYLRNVTHNTWQSLKSADPDAIWMMQGWLFYSNSDFWTDERVHAYLSGVETDEDMLV 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA- 238
LDLF+E +P W ++ +YG P+IWC LH++ GN+ +YG + +I +A ++ +
Sbjct: 375 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNITVNATDALAVSDSLVGY 434
Query: 239 -----------------------------------WIN-QYSVRRYGRSVPAIQDAWNVL 262
W+ +YS R + AW++L
Sbjct: 435 GLTMEGQEGNEIVYDLLLDQAWSSRPIDTDSYFHDWVKARYSTARRHNVPHELYQAWDIL 494
Query: 263 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
T YN T+ AT V SI + +P K L ++T HP
Sbjct: 495 RTTAYNNTNLATATA---------VSKSIFEL---------QP--KLTGLVNQTGH--HP 532
Query: 323 HLW-YSTSEVIRALELFIASGNELSA---SNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
+ Y S ++R+ +L +++ +E +A +RYD++D+TRQ +A ++LN+ Y
Sbjct: 533 TVVNYEASSLVRSWKLMVSAASESTALWSHPAFRYDMVDVTRQVMANAFIPMYLNVTSTY 592
Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
Q + Q + L+ D+D +L+ +D F L W+ESA+ A+N+ + YE+NAR
Sbjct: 593 Q--KGGPISQQGDSLIRLLRDLDAVLSTNDNFRLATWIESARTWARNDTEADFYEYNARN 650
Query: 439 QITMW 443
QIT+W
Sbjct: 651 QITLW 655
>gi|346324333|gb|EGX93930.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
Length = 751
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 160/546 (29%), Positives = 269/546 (49%), Gaps = 80/546 (14%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LPQ+W++ Q LQKKI+ R+ ELGM P+LPAF G VP + V+P+ + +
Sbjct: 203 NIQGSWGGDLPQAWIEDQFELQKKIVKRMIELGMTPILPAFPGFVPENITRVWPNVSLAE 262
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W R+ + DP F E+ +AF+ +Q + YG + + D F+EN P
Sbjct: 263 SPIWSGFSG--RFTADKYITPYDPHFAELQKAFLTKQNEAYGNVTSFWTLDQFNENKPAS 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK-LVV 179
+Y+ ++ + +++ D AVW+MQGWLF+ D +W ++K+ L+ VP+ + +++
Sbjct: 321 GELDYLKNVSHNTWQTLKAADPSAVWVMQGWLFASDKTYWIDDRVKSFLDGVPVNEDMLL 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---------- 229
LDLFAE P W ++ FYG P+IWC LH++ GN+ +YG ++++ VEA
Sbjct: 381 LDLFAESTPQWQRTESFYGKPWIWCQLHDYGGNMGLYGQIENVTKNAVEAVQTSKSIVGF 440
Query: 230 ----------------------RTSENTTMAWINQYSVRRYG---RSVPA-IQDAWNVLY 263
R T + + + RYG + +P + AW+ +
Sbjct: 441 GLSMEGQEGNEIMYDLLLDQAWRKEAIETDKYFSDWVTVRYGADHKEIPENLYTAWDKVR 500
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP- 322
TVYN TD + V + ++ PSI + + HP
Sbjct: 501 STVYNNTDSSVTA---VTKSIFELAPSISGLV--------------------NRTGHHPT 537
Query: 323 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
+ Y T +I A ++G++ L + YRYDL D TRQ LA + ++E Y+
Sbjct: 538 KITYDTKTLISAWNDMFSAGDQARWLFDNEAYRYDLTDWTRQVLANAFEATYNKLVEKYK 597
Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
N+ GV R +++ MD +L + F L W+++A++ E +E+NAR Q
Sbjct: 598 SNNTKGVKCAGDRLQAILQTMDQVLDTNPSFKLSTWIQAARK--SGGEAADFFEYNARNQ 655
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI-----ESLESGDGFRLK 494
+T+W N + + DY +K W+GL+ +YY R ++ Y++ E ++ +L+
Sbjct: 656 VTLWGPNGE-----IEDYASKQWAGLVGNYYAHRWQMFVDYLVATDPKEYDQNVFKKKLR 710
Query: 495 DWRREW 500
+W +W
Sbjct: 711 EWETQW 716
>gi|29348998|ref|NP_812501.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340905|gb|AAO78695.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 732
Score = 254 bits (649), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 171/566 (30%), Positives = 267/566 (47%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 196 MSNVDFWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D D L+ I R F+E+Q K YG T HIY D F+E P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
S ++++++ + IY + DS A WL W+F YD W P++++ L +VP KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDNKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--------------- 224
LD + + IW ++++YG PYIWC L NF GN + G L+ I F
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431
Query: 225 ----------GPV-------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
P+ +A TT WI +S+ R G I AW L+ +Y
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDYPVTTDQWITNWSMCRGGNQDANIIKAWRALHQKIY 491
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T+ AT ++ A P +T K N + Y + LW
Sbjct: 492 --TEHATCGQSVLMNARP-------RLTGTKSWNTNPGIH-----------YANNDLWQI 531
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+++A + ++ +R+D+I++ RQ L +E Y D G+
Sbjct: 532 WKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSEYRDQFTACYNRKDTTGMR 582
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ S R L+ D+D LL+C +G WL+ A+ ++ YE NAR +T+W
Sbjct: 583 EWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCGATVSEKDYYEENARCILTVW---- 638
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++ + L DY N+ W GL R +Y R + +I ++ F + ++ + +W
Sbjct: 639 GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIAAVSEDKPFDEDKFHQDITQFEYNW 698
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
++ +P+ S D + + L KY
Sbjct: 699 TLQKDSFPIVSEEDPIQIADSLILKY 724
>gi|282877910|ref|ZP_06286719.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
gi|281299911|gb|EFA92271.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
Length = 723
Score = 254 bits (649), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 174/543 (32%), Positives = 252/543 (46%), Gaps = 78/543 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP SWL+QQ LQ++IL+R L M PVLPAFSG+VPA L+ ++P A I
Sbjct: 199 MANIDKWNGPLPMSWLEQQKELQQRILLRERSLNMKPVLPAFSGHVPAKLKELYPQANIK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + R + L+ DPLF +I + ++E+Q +G T HIY D F+E PP
Sbjct: 259 YLGRWAGFSDNYR---CHFLNPEDPLFAKIQKMYLEEQKALFG-TDHIYGIDPFNEVDPP 314
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
PEY+ + IY + S D A W+ W+F ++ W P ++KALL V GK+ +
Sbjct: 315 SWKPEYLKEISHNIYRTVTSVDPGAEWMQMSWMFYHNKKQWTPKRIKALLTGVSRGKMSL 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W T+ FYG PYIWC L NF GN + G
Sbjct: 375 LDYHCENVELWKTTNNFYGQPYIWCYLGNFGGNTTITGNVKESGQRLNEALNKKNKNLIG 434
Query: 219 -------LDSIAFGPVE-----ARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD I F P E A T+ WI+ + R G S P ++ AW +L++ +
Sbjct: 435 IGSTLEGLDVIQF-PYEYILTQAWTATPADKEWIDNLADRHVGFSSPKLRQAWQILFNDI 493
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y T R + + P + P + GKYQ ++ E +W
Sbjct: 494 Y------TQIPRSLGI-LPALRPIL-----GKYQERRTEITYPTKRLEE--------VWK 533
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
S+V N Y+ DLI + RQ L +L L + Y D G+
Sbjct: 534 LMSDVSEC------------DRNEYQLDLIAVGRQVLGNKFLKLKLELDSCYVNKDLVGL 581
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ E++ D+D L A + +G W++ A+ N+ ++ YE NAR IT W
Sbjct: 582 QRTGNTMKEVLVDLDYLTAGNSRCSIGKWIDDARAYGNNDLEKAYYEKNARNLITTW--- 638
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
L DY N+ WSGL+R YY R ++Y + S+ SG F + + + +
Sbjct: 639 ----GGSLNDYANRTWSGLIRTYYVRRWSMYIDELTASVMSGKPFDQQQLDKAIGEFEQN 694
Query: 507 WQN 509
W N
Sbjct: 695 WVN 697
>gi|242077446|ref|XP_002448659.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
gi|241939842|gb|EES12987.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
Length = 252
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 122/185 (65%), Positives = 145/185 (78%)
Query: 353 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 412
YDL+DLTRQ LAKYAN++FL IIE+Y+ N + V L + FL LV D+D LL+ H+GFLL
Sbjct: 51 YDLVDLTRQVLAKYANDVFLKIIESYKSNKMNQVTILCKHFLNLVNDLDTLLSSHEGFLL 110
Query: 413 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 472
GPWLESAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGP
Sbjct: 111 GPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLRDYYGP 170
Query: 473 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
RAAIYFK+++ S+E F L++WRREWI LTN+WQ+ R V+ GD+L S LY K
Sbjct: 171 RAAIYFKHLLLSMEKNAPFALEEWRREWISLTNNWQSDRKVFSTTPTGDSLNISWSLYIK 230
Query: 533 YLQGT 537
YL T
Sbjct: 231 YLSNT 235
>gi|410100551|ref|ZP_11295511.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
CL02T12C30]
gi|409215586|gb|EKN08585.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
CL02T12C30]
Length = 739
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 183/592 (30%), Positives = 271/592 (45%), Gaps = 112/592 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW+D L ++IL R ELGM P+L +F+G VP L+ +P A+I
Sbjct: 194 MTNIETYGGPLPQSWIDSHQALGQQILERQRELGMTPILQSFTGFVPIKLKEKYPDARI- 252
Query: 61 QLGNWFSVKSDPRWC----CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
K RWC T LD DPLF E+G+AF+E+Q K YG T+HIY D F E
Sbjct: 253 --------KDKNRWCNAFTATVQLDPLDPLFKEMGQAFLEEQQKLYG-TNHIYAADPFHE 303
Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
P + Y+ ++G I+ D +AV MQ W +A+ + P +
Sbjct: 304 GAAPSNEKSYLEAVGKVIWEVASGFDPEAVIAMQTWSLR----------EAITRTFPQDR 353
Query: 177 LVVLDLFAEVKPIWSTSK--QFYGVPYIWCMLHNFAGNIEMYGIL--------------- 219
L++LDL W+ K F+ PY+ +LHN+ G + M G L
Sbjct: 354 LLLLDLGG-----WNVEKFNSFWNYPYVAGVLHNYGGRVYMGGNLALYAKNAHELKQSPK 408
Query: 220 -----------DSIAFGPVEARTSENTT--------MAWINQYSVRRYGRSVPAIQDAWN 260
++I PV S T WI Y+ RYG+ + W
Sbjct: 409 GGNIQGIGLFPEAIEHNPVVYELSTEITWMQDAPDLQKWITDYARARYGKLPAGAEQGWK 468
Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
VL TVY G V+ A P + ++ + +P
Sbjct: 469 VLLETVYGSKAGRLPSTESVMCARPALTIQKVAAN----GDLSRP--------------- 509
Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
YST + A++ F+ + N+L S+TYRYDL+D+ RQ L+ + L I EAY
Sbjct: 510 -----YSTVRLWDAVDHFLQASNDLKKSDTYRYDLVDVMRQCLSDLSLPLQKQITEAYLA 564
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
D + Q +FL L++D D LL FLLG W++ A+Q EE++ YEWNART +
Sbjct: 565 EDNEKLQQAGEQFLALIDDFDRLLGTRSTFLLGKWIKEARQWGTTEEEKALYEWNARTLV 624
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF---------------KYMIESL 485
T+W N ++ L +Y N+ W+GL++ YY PR + +Y+ +SL
Sbjct: 625 TVWGPN--HPSAHLFEYSNRQWAGLMKGYYKPRWEKFISYLKAQPKGEWRYDEQYIRKSL 682
Query: 486 ESGDGFRLKDWRREWIKLTN---DWQNGRNVYPVESNGDALITSQWLYNKYL 534
D+ + +LTN DW ++VY G+ + + LY K+L
Sbjct: 683 AGRPALDASDF---YTRLTNWEYDWAFNKDVYTDTPQGNEIEIVKELYAKWL 731
>gi|404406328|ref|ZP_10997912.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 738
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 158/507 (31%), Positives = 246/507 (48%), Gaps = 36/507 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W PLPQSWLD Q+ LQK+I+ R EL M PVLPAF+G+VPA L ++P AKI+
Sbjct: 206 MSNLDYWQSPLPQSWLDAQVELQKRIVARERELNMKPVLPAFAGHVPAELGEIYPEAKIS 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + R ++ LD DPLF I R F+ +Q +G T HIY D F+E PP
Sbjct: 266 RMSKWGGFEDRYR---SHFLDPLDPLFARIQREFLAEQTALFG-TDHIYGADPFNEVDPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PE+++ + IY M D +A WL WLF D W +++A + +VP K+++
Sbjct: 322 SWEPEFLARVSRTIYDTMTEADPEAEWLQMTWLFYLDRDKWHDDRIEAFVTAVPQDKMLL 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
LD + E +W + ++G PY WC L NF GN + G D ++ ++ +E
Sbjct: 382 LDYYCENTEVWRQTHSYHGQPYFWCYLGNFGGNTMLVGNFDEVS-KRIDGVLAEGGN--- 437
Query: 240 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNR------DVIVAFPDVD---- 288
++R G ++ + D+ +Y V+ D D +R D + + D
Sbjct: 438 ----NLRGLGSTLEGL-DSNPFMYDYVFERAWDFPVDDDRWFDALADRYLGYEDTGYRRA 492
Query: 289 ----PSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 344
+ +T KY + ++ L+ + + Y E+ I +G+
Sbjct: 493 WDALRKNVYITSSKY-GHCPLLNARPTLEGILTGTTDAEIKYDNDELFEVWAKMIDAGD- 550
Query: 345 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLL 404
S +TYRY L+++ RQ L L A + D + +L LEL D++ L
Sbjct: 551 -SGRDTYRYWLVNVGRQTLGNLFLPLRDGFTAACRAKDLARMKELRSEMLELAADLETLT 609
Query: 405 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 464
A H F + W++ ++ E+ YE N RT +T W D Q + DY N+ WSG
Sbjct: 610 AQHGAFSMQKWIDDSRSFGTTPEERDYYEVNGRTLLTTWGDRAQS----INDYANRTWSG 665
Query: 465 LLRDYYGPRAAIYFKYMIESLESGDGF 491
L+ DYY R ++ + ++E+G F
Sbjct: 666 LVADYYAERWRMFLDAAVGAVEAGRKF 692
>gi|298386708|ref|ZP_06996263.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
gi|298260382|gb|EFI03251.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
Length = 732
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 170/566 (30%), Positives = 267/566 (47%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 196 MSNVDYWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D D L+ I R F+E+Q K YG T HIY D F+E P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
S ++++++ + IY + DS A WL W+F YD W P++++ L +VP KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDNKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--------------- 224
LD + + IW ++++YG PYIWC L NF GN + G L+ I F
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431
Query: 225 ----------GPV-------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
P+ +A TT WI +S+ R G I AW L+ +Y
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDYSVTTDQWITNWSMCRGGNQDANIIKAWRALHQKIY 491
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T+ AT ++ A P +T K N + Y + LW
Sbjct: 492 --TEHATCGQSVLMNARP-------RLTGTKSWNTNPGIH-----------YANNDLWQI 531
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+++A + ++ +R+D+I++ RQ L ++ Y D G+
Sbjct: 532 WKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSKYRDQFTACYNRKDTTGMR 582
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ S R L+ D+D LL+C +G WL+ A+ ++ YE NAR +T+W
Sbjct: 583 EWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCGATVSEKDYYEENARCILTVW---- 638
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++ + L DY N+ W GL R +Y R + +I ++ F + ++ + +W
Sbjct: 639 GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIAAVSEDKPFDEDKFHQDITQFEYNW 698
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
++ +P+ S D + + L KY
Sbjct: 699 TLQKDSFPIVSEEDPIQIADSLILKY 724
>gi|400599317|gb|EJP67021.1| alpha-N-acetylglucosaminidase, putative [Beauveria bassiana ARSEF
2860]
Length = 753
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 161/546 (29%), Positives = 263/546 (48%), Gaps = 80/546 (14%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LPQ+W+D Q LQ+KI+ R+ ELGM P+LPAF G VP + V+P+ + +
Sbjct: 203 NIQGSWGGDLPQAWIDDQFALQRKIIKRMVELGMTPILPAFPGFVPENITRVWPNVSLAE 262
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W R+ + DP F E+ +AF+ +Q + YG + + D F+EN P
Sbjct: 263 SPTWSGFSG--RFTADKYITPYDPRFAELQKAFLTKQNEAYGNVTSFWTLDQFNENKPAS 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-LVV 179
Y+ ++ + ++ D AVW+MQGWLF+ D +W ++K+ L+ VP+ + +++
Sbjct: 321 GELGYLRNVSHNTWQTLKDADPSAVWVMQGWLFASDKAYWTDDRVKSFLDGVPVNEDMLL 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLFAE P W + FYG P+IWC LH + GN+ +YG ++++ VEA + +
Sbjct: 381 LDLFAESTPQWQRTDSFYGKPWIWCQLHGYGGNMGLYGQIENVTRNAVEAVQKSPSIVGL 440
Query: 238 --------------------AW----------INQYSVRRYG---RSVPA-IQDAWNVLY 263
AW + + RYG + +P + AW+ +
Sbjct: 441 GLSMEGQEGNEIMYNLLLDQAWSKEALETDKYFSDWVTVRYGADQKEIPKDLYTAWDKVR 500
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
TVYN TD + + A ++ T G G +K
Sbjct: 501 STVYNNTDSS-------VTAVAKSIFELVPSTSGLVNRTGHHATK--------------- 538
Query: 324 LWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
+ Y T +I A +G++ L + Y YDL D TRQ LA + ++E Y+
Sbjct: 539 ITYDTETLISAWNDMFNAGSQARWLFDNEAYSYDLTDWTRQVLANAFEATYNKLVEKYKS 598
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
N+ GV R ++ MD +L + F L W+++A++ + +E+NAR Q+
Sbjct: 599 NNIKGVKCAGSRLQAILRTMDQVLETNVHFRLSTWIQAARK--SGGDAADFFEYNARNQV 656
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD----GFR--LK 494
T+W N + + DY +K W+GL+ DYY R ++ Y++ + + GD F+ L
Sbjct: 657 TLWGPNGE-----IEDYASKQWAGLIGDYYAHRWQMFVDYLVAT-DPGDYDQMAFQKTLI 710
Query: 495 DWRREW 500
+W +EW
Sbjct: 711 EWEKEW 716
>gi|395331391|gb|EJF63772.1| alpha-N-acetylglucosaminidase [Dichomitus squalens LYAD-421 SS1]
Length = 750
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 172/556 (30%), Positives = 280/556 (50%), Gaps = 74/556 (13%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP +W+D Q LQK+IL R+ ELGM PVLP+F+G VP AL +++P+A I
Sbjct: 195 FGNIQGSWGGDLPVTWVDDQFQLQKQILQRMVELGMTPVLPSFTGFVPRALSSLYPNASI 254
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W + L+ DPLF I +FI +Q + YG SHIY D ++EN P
Sbjct: 255 VNGSQWEGFPT--ALTNDSFLEPFDPLFTTIQTSFISKQREAYGNVSHIYALDQYNENDP 312
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-KL 177
P Y++++ A ++ +++ D DAVWLMQGWL FS FW +++A L VP +
Sbjct: 313 FSGDPAYLANVTAGTFASLRAADPDAVWLMQGWLFFSSAAFWTNERIEAYLGGVPGNDSM 372
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL++E +P W+ + +YG ++WC LH + GNI M G LD++ P+ A + ++M
Sbjct: 373 IILDLYSEAQPQWNRTSSYYGKQWVWCELHGYGGNIGMEGDLDALTQNPIAALHAPGSSM 432
Query: 238 -----------------------AW----------INQYSVRRYG-RSVP-AIQDAWNVL 262
AW ++Q+ RRY R +P + DAW L
Sbjct: 433 KGVGLTMEGQEGNELVYDILLDQAWSSAPLNLSSYVDQWVARRYNVRRLPKSALDAWRTL 492
Query: 263 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
TVY+ D T + I + ++ P++ +T + T +
Sbjct: 493 ATTVYSNKDSGT---QAAIKSIYELAPALTGMT------------------NRTGHHPTA 531
Query: 323 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
+ + S V+ A + + + +E L+ + YD++D+TRQ L+ + + ++ Y
Sbjct: 532 IPYDTNSTVLVAAKALLEARSENPLLATIPEFAYDVVDVTRQLLSNRFIDHYNVLVATYN 591
Query: 380 LNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ---EKQYEW 434
N V + L L++D+D LLA ++ FLL W+ AK+ ++ + E+
Sbjct: 592 SNATAPRNVAAAAGPLLALLDDLDELLATNEHFLLSNWIADAKRWTHGADRAAYARLLEY 651
Query: 435 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
NAR QIT+W + + + DY +K W+GL+R YY PR + +Y+ ++ E+G +
Sbjct: 652 NARNQITLWGPDGE-----INDYASKAWAGLVRTYYKPRWEAFVEYLAQTKEAGAAYDAH 706
Query: 495 DWRREWIKLTNDWQNG 510
+ I + W NG
Sbjct: 707 VVSAKMIAIGQQWSNG 722
>gi|242809019|ref|XP_002485282.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
ATCC 10500]
gi|218715907|gb|EED15329.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
ATCC 10500]
Length = 755
Score = 251 bits (640), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 159/527 (30%), Positives = 265/527 (50%), Gaps = 75/527 (14%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G W G LP W+D Q LQKKI+ R+ ELGM P+LPAF G VP A+ V P A +
Sbjct: 203 NIQGSWSGSLPYDWVDSQFDLQKKIVKRMTELGMTPILPAFPGFVPRAITRVLPDADVIN 262
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + + + ++ TDP F EI ++FI +Q++ YG + Y D F+EN P
Sbjct: 263 GSAWEAFPT--MYTNDTFMEPTDPHFTEIQKSFIAKQIEAYGNVTTFYTLDQFNENNPSS 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPL-GKLVV 179
Y+ ++ + +++ DS+AVW+MQGWLF S FW +++A L V + L++
Sbjct: 321 GDLSYLRNVSQGTWKTLKAADSNAVWVMQGWLFTSNSAFWTNDRIEAYLGGVAVDSDLLI 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
LDL +E P W + +YG P+IWC +H++ GN+ YG + +I P+ A + ++
Sbjct: 381 LDLASESSPQWQRTNSYYGKPWIWCEIHDYGGNMGFYGQVMNITNNPIAALHNSSSLVGF 440
Query: 236 ----------------------------TMAWINQYSVRRYG--RSVP-AIQDAWNVLYH 264
T ++ + + RY RS+P ++ AW++L
Sbjct: 441 GLSMEGQEGNEIVYDLLLDQAWNAAPIDTESYFHDWVTARYAGSRSIPSSVYSAWDILRT 500
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-H 323
TVYN T+ A + A P +I T G G HP
Sbjct: 501 TVYNNTNLAAN-------AVPKAIFELIPSTTGLLNRTGH----------------HPTK 537
Query: 324 LWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
L Y+T+++++A LF S + L + + +DL+D++RQ LA ++ N+I Y
Sbjct: 538 LNYNTADMVQAWNLFYTSAFKEPSLWLNPAFEFDLVDMSRQVLANAFIPVYENLISTYNT 597
Query: 381 ND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNAR 437
++ + + + + +++ +D +LA + F L WL +A+ A ++ + + E+NAR
Sbjct: 598 SNPSSTKLQTIGAELIGILQALDTVLATNKNFKLSTWLSAARASAGSQHNIEDFLEYNAR 657
Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
QIT+W Q + DY +K W+GL+ YY PR ++ +Y+I++
Sbjct: 658 NQITLWGPTGQ-----ISDYASKSWAGLVSSYYIPRWKMFVEYLIDT 699
>gi|319900259|ref|YP_004159987.1| alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
gi|319415290|gb|ADV42401.1| Alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
Length = 718
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 178/581 (30%), Positives = 270/581 (46%), Gaps = 102/581 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P W Q LQK+IL R+ E G+ PVLP +SG +PA +
Sbjct: 187 MNNLEGWGGPNPDQWYSHQEQLQKRILKRMREYGIEPVLPGYSGMIPANAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG V +WC L +D F I R + ++ + YG+ ++ Y+ D F E
Sbjct: 239 KLG--LDVADPGKWCGYRRPAFLQPSDKNFRRIARLYYKEMTRLYGKANY-YSMDPFHEG 295
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G +I M+ + AVW+ Q W YD ++ ++P
Sbjct: 296 GNTKGVD----LDAAGKSIRDAMKEANPQAVWVAQAWGACPYD---------NMIKNLPE 342
Query: 175 GKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
G ++VLDL++E +P W KQ +G +I+CML NF GN+ +YG ++ +
Sbjct: 343 GDMIVLDLYSESRPQWGDPASAWYRKQGFGRHGWIYCMLLNFGGNVGLYGKMEHVIDEFY 402
Query: 223 -----AFGP------VEARTSENTTM------------------AWINQYSVRRYGRSVP 253
AFG + SEN + W+ Y RYG++ P
Sbjct: 403 KARESAFGGTLQGVGLTMEGSENNPVMYELLCELPWHGRRISKDQWLKSYLKARYGKTTP 462
Query: 254 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
+AW L +T+YN + +T + + V A P ++ YQ
Sbjct: 463 QTVEAWLKLSNTIYNSPNASTQQGTHESVFCARPSLEA---------YQ----------- 502
Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
SS+ +Y+ +++IRA I + E +N + YDLID+ RQA+A+ ++
Sbjct: 503 ----VSSWSEMKDYYAPADIIRAAGKMIEAAEEFRGNNNFEYDLIDVVRQAVAEKGRLVY 558
Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
++ AY+ D S RFLEL+E D LL F LG W A+ + + + Q+
Sbjct: 559 PIVVSAYKAADKQLFEAASARFLELIELQDKLLGTRREFRLGTWTNYARNMGETDAQKDL 618
Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
YEWNAR QIT W + T L DY +K W+GLLRD+Y R YF + +L +G+
Sbjct: 619 YEWNARVQITTWGNRTAANEGGLHDYAHKEWNGLLRDFYYMRWKAYFDELRSTL-NGNAP 677
Query: 492 RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
+ D + L +W N Y E GDA ++ +Y K
Sbjct: 678 KETD----FYTLEENWAGQHNPYSAEPEGDATDIAKEVYGK 714
>gi|383124408|ref|ZP_09945072.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
gi|251839096|gb|EES67180.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
Length = 732
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 170/566 (30%), Positives = 266/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 196 MSNVDYWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D D L+ I R F+E+Q K YG T HIY D F+E P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYQVIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
S ++++++ + IY + DS A WL W+F YD W P++++ L +VP KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDDKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF--------------- 224
LD + + IW ++++YG PYIWC L NF GN + G L+ I F
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431
Query: 225 ----------GPV-------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
P+ +A TT WI +S+ R G I AW L+ +Y
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDYPVTTDQWITNWSMCRGGDQDANIIKAWRALHQNIY 491
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T+ A ++ A P +T K N + Y + LW
Sbjct: 492 --TEYAICGQSVLMNARP-------RLTGTKSWNTNPGI-----------HYANNDLWQI 531
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+++A + ++ +R+D+I++ RQ L +E Y D G+
Sbjct: 532 WKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSEYRDQFTACYNRKDTTGMR 582
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ S R L+ D+D LL+C +G WL+ A+ ++ YE NAR +T+W
Sbjct: 583 EWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARDCGTTVSEKDYYEENARCILTVW---- 638
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++ + L DY N+ W GL R +Y R + +I ++ F + ++ + +W
Sbjct: 639 GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIGAVSKNKPFDEDKFHQDITQFEYNW 698
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
++ +P+ S D + + L KY
Sbjct: 699 TLQKDSFPIVSEEDPIQIADSLILKY 724
>gi|423346424|ref|ZP_17324112.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
CL03T12C32]
gi|409220242|gb|EKN13198.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
CL03T12C32]
Length = 718
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 175/578 (30%), Positives = 266/578 (46%), Gaps = 100/578 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQK+I+ R+ E G+ PV P +SG VP + ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQKRIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + L TDP F EI + ++ K YG+ + Y+ D F E
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKADY-YSMDPFHEGGSV 298
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V + + G AI M+ + AVW+ Q W + P ++ ++ G L+VL
Sbjct: 299 VGVD--LDAAGKAIMQAMKKNNPKAVWVAQAWQANPRP--------QMIGNLEAGDLIVL 348
Query: 181 DLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSI----------A 223
DLFAE +P W K +G +I+CML N+ GN+ ++G + +
Sbjct: 349 DLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNVGLHGKMKHVIDEFYKAKESP 408
Query: 224 FGP------VEARTSENTTMA------------------WINQYSVRRYGRSVPAIQDAW 259
FG + SEN + W+ +Y+V RYG+S P +QDAW
Sbjct: 409 FGKTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDKDQWLREYTVARYGKSNPTVQDAW 468
Query: 260 NVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 317
+L +++YNC D T + + V A P TE YQ S
Sbjct: 469 ILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ---------------VS 504
Query: 318 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 377
S+ +Y + VIRA + ++ +E +N + YDL+D+ RQA+A+ L ++EA
Sbjct: 505 SWSEMKDYYDPNNVIRAAAMMVSVADEFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVVEA 563
Query: 378 -YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
+ D S RFL L+ D LLA F +G W+ A+ L E+++ YEWNA
Sbjct: 564 AFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGSTPEEKELYEWNA 623
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
R QIT W + + LRDY ++ W+G+L+D+Y R +F Y RL D
Sbjct: 624 RVQITTWGNRLAADEGGLRDYAHREWNGILKDFYYMRWKTWFDYQT---------RLLDG 674
Query: 497 RR----EWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
R+ ++ + W NVY E GD + T + ++
Sbjct: 675 RKTAAIDFYAIEERWTKATNVYSSEPEGDCISTVKRIF 712
>gi|154489986|ref|ZP_02030247.1| hypothetical protein PARMER_00215 [Parabacteroides merdae ATCC
43184]
gi|423722990|ref|ZP_17697143.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
CL09T00C40]
gi|154089428|gb|EDN88472.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
43184]
gi|409241820|gb|EKN34587.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
CL09T00C40]
Length = 718
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 176/580 (30%), Positives = 267/580 (46%), Gaps = 104/580 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQK+I+ R+ E G+ PV P +SG VP + ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQKRIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + L TDP F EI + ++ K YG+ + Y+ D F E +
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKADY-YSMDPFHEGGSV 298
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + G AI M+ + AVW+ Q W + P ++ ++ G L+
Sbjct: 299 AGVD----LDAAGKAIMQAMKKNNPKAVWVAQAWQANPRP--------QMIGNLEAGDLI 346
Query: 179 VLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSI--------- 222
VLDLFAE +P W F +I+CML N+ GN+ ++G L +
Sbjct: 347 VLDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNVGLHGKLKHVIDEFYKAKE 406
Query: 223 -AFGP------VEARTSENTTMA------------------WINQYSVRRYGRSVPAIQD 257
FG + SEN + W+ +Y+V RYG+S P +QD
Sbjct: 407 SPFGKTLKGVGMTMEGSENNPVMFELLTELPWCPQRFDKDQWLREYTVARYGKSNPTVQD 466
Query: 258 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
AW +L +++YNC D T + + V A P TE YQ
Sbjct: 467 AWILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ--------------- 502
Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
SS+ +Y ++VIRA + ++ +E +N + YDL+D+ RQA+A+ L ++
Sbjct: 503 VSSWSEMKDYYDPNDVIRAAAMMVSVADEFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVV 561
Query: 376 E-AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEW 434
E A+ D S RFL L+ D LLA F +G W+ A+ L E+++ YEW
Sbjct: 562 EAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGGTPEEKELYEW 621
Query: 435 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
NAR QIT W + + LRDY ++ W+G+L+D+Y R +F Y RL
Sbjct: 622 NARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYYMRWKTWFDYQT---------RLL 672
Query: 495 DWRR----EWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
D R+ ++ + W NVY E GD + T + ++
Sbjct: 673 DGRKTAAIDFYAIEERWTKATNVYSSEPEGDCISTVKRIF 712
>gi|281200618|gb|EFA74836.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
Length = 469
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 166/519 (31%), Positives = 256/519 (49%), Gaps = 88/519 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ W G L W+ Q LQ +IL R+ + GM VLP F+G+VP AL++ +P+A IT
Sbjct: 1 MGNVNEWAGNLTLGWMVDQRDLQIQILTRMRQFGMQAVLPGFAGHVPEALKSHYPNANIT 60
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QL +W T + + F+ I Q YG T H YN D F+E PP
Sbjct: 61 QLSSW---------NMTVYIHQSPNTFMSI-------QQDLYG-TDHFYNFDPFNELEPP 103
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+ + ++++ + + D +W++QGWLF YD FW+PPQ++A L+ VP+GK++V
Sbjct: 104 SSDPAYLKNCSQSMFNNLIAVDPQGIWVLQGWLFVYDTEFWQPPQIEAFLSGVPIGKMIV 163
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SEN---- 234
LDL+A+V W + FYG +IWCMLHNF G MYG + I+ P+EAR+ S N
Sbjct: 164 LDLWADVDAGWKITNYFYGHNWIWCMLHNFGGRSGMYGKIPFISTNPIEARSLSPNMVGT 223
Query: 235 -----------------TTMA----------WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
+ MA W++QY RRYG+ + + D W L TV+
Sbjct: 224 GLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQYVTRRYGKYIEVLADTWYELVGTVF 283
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
NC+ VT+G VS L TS Y P +
Sbjct: 284 NCS----------------------IVTKGPVTIL---VSVRPQLNFTTSLYYDPIV--- 315
Query: 328 TSEVIRALELFIASGN-ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ +A F++ + + ++T+ +DL ++T QAL+ L + A+ LND++
Sbjct: 316 ---ISKAWSAFLSIDDLHVVNTSTFSFDLTEITTQALSNLFMTTELQMNAAF-LNDSYEE 371
Query: 387 FQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
F LS L +++D++ +++ + L+G W A+ L E + YE NAR QIT+W
Sbjct: 372 FSLLSDALLSIIQDINTIVSTQEMLLVGNWTARARALTPANETTELYEMNARNQITLW-- 429
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
S DY K W GL D+Y R ++ + + ++
Sbjct: 430 --GPPDSFDHDYAYKLWGGLTEDFYLARWTLFSQSIFKT 466
>gi|329963073|ref|ZP_08300853.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
gi|328529114|gb|EGF56044.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
Length = 717
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 177/583 (30%), Positives = 276/583 (47%), Gaps = 106/583 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW Q+ LQK+IL R+ E G+ PVLP +SG VP +
Sbjct: 187 MNNLEGWGGPNPDSWYTQREALQKQILKRMREYGIQPVLPGYSGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L TDP F EI + ++ + YG+ + Y+ D F E
Sbjct: 239 RLG--LNV-SDPGLWCGYPRPAFLQPTDPRFGEIADLYYKEMTRLYGK-ADFYSMDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+ VD +++ G AI+ M+ + AVW+ Q W + P+ K ++ ++P
Sbjct: 295 GGSIAGVD----LNAAGQAIWGAMKKVNPKAVWVAQAWQAN-------PRQK-MIENIPQ 342
Query: 175 GKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDLF+E +P W K+ +G +++CML N+ GN+ ++G + +
Sbjct: 343 GDLIVLDLFSESRPQWGDPASTWYRKEGFGKHDWLYCMLLNYGGNVGLHGKMRHVIDEFY 402
Query: 228 EARTS----------------ENTTMA------------------WINQYSVRRYGRSVP 253
+A+TS EN ++ W+ Y+ RYG++
Sbjct: 403 KAKTSPFGKTLKGVGMTMEGSENNSVMFELLCELPWRPAQFEKDEWLKNYTAARYGKADA 462
Query: 254 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
+Q AW +L +++YNC D T + + V A P +D YQ
Sbjct: 463 TVQQAWLLLSNSIYNCPDANTQQGTHESVFCARPGMD---------VYQ----------- 502
Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
SS+ +Y EVIRA + +++ + +N + YDL+D+ RQA+A+ ++
Sbjct: 503 ----VSSWSEMVKYYEPEEVIRAAGILLSAADRFKGNNNFEYDLVDIVRQAVAEKGRLVY 558
Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
+I+A + + S+RFL L+ D LLA F +G W+E A+ L +E++K
Sbjct: 559 PIMIDALKAGEKELFAAASQRFLNLILLQDRLLATRPEFKVGTWIEKARNLGTTQEEKKL 618
Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES--GD 489
YEWNAR QI W + T + LRDY +K W+G+LRD+Y R ++ L
Sbjct: 619 YEWNARVQIATWGNRTAADEGGLRDYAHKEWNGMLRDFYYHRWKLWIDAQTAQLNGAPAQ 678
Query: 490 GFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
GF W TND YP GD + ++ Y +
Sbjct: 679 GFDFYAIEEPWTLQTND-------YPSHPEGDVIEVARTAYKE 714
>gi|383122982|ref|ZP_09943669.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
gi|251841923|gb|EES70003.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
Length = 730
Score = 248 bits (634), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 167/558 (29%), Positives = 273/558 (48%), Gaps = 50/558 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP+SWL+QQ VLQK+IL R + M PVLPAFSG+VP L+ ++P AKI
Sbjct: 197 MSNVDYWQSPLPKSWLEQQEVLQKQILKRERDFNMTPVLPAFSGHVPKELKAIYPDAKIH 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S R ++ ++ D LF I + ++E+Q YG T HIY D F+E P
Sbjct: 257 EMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEEQTAIYG-TDHIYGIDPFNEVDSP 312
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ ++++ + IY + D++A WL W+F +D W P++++ L +VP KL++
Sbjct: 313 NWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHDQKKWTQPRIRSFLEAVPDDKLIL 372
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP----VEARTSENT 235
LD + + IW ++ +YG PY+WC L NF GN M G LD + VE +
Sbjct: 373 LDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMVGNLDDVDVKIEKLFVEGGENVYG 432
Query: 236 TMAWINQYSVRRY--------GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 287
A + + V + P D W C G D R ++ A+ +
Sbjct: 433 LGATLEGFDVNPFMYEFVFDQAWDYPLTTDQW---IQNWAKCRGGNQD--RHILKAWDSL 487
Query: 288 DPSIISVTEGKYQNYGKPVSKEA--VLKSETS-------SYDHPHLWYSTSEVIRALELF 338
I KY G+ V A +L S +Y++ LW +E+++A +
Sbjct: 488 HKKIYK----KYATAGQAVLMNARPMLVGTDSWNTYPDITYNNRDLWDIWTEMLKASHI- 542
Query: 339 IASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 397
+NT YR+D+I++ RQ L + + + Y D G+ + + + L+
Sbjct: 543 ---------NNTGYRFDVINVGRQVLGNLFSSFRDHFTQCYSEKDIDGMKKWADQMDSLL 593
Query: 398 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 457
D D LL+C F +G W++ A+ + E +++ YE NAR +T+W ++A+ L DY
Sbjct: 594 IDTDRLLSCETNFSIGKWIDDARSFGKTEAEKEYYEENARCILTVW----GQKATQLNDY 649
Query: 458 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 517
N+ W GL YY R + +I + SG F K + + +W + +P+
Sbjct: 650 ANRGWGGLTYSYYRERWKRFTTEVITASLSGQKFDEKQFYQSITDFEYEWTLSKEHHPII 709
Query: 518 SNGDALITSQWLYNKYLQ 535
S + ++ ++ L KY+Q
Sbjct: 710 SGENPILLAKTLSEKYMQ 727
>gi|29345848|ref|NP_809351.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337741|gb|AAO75545.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 730
Score = 248 bits (633), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 167/558 (29%), Positives = 273/558 (48%), Gaps = 50/558 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP+SWL+QQ VLQK+IL R + M PVLPAFSG+VP L+ ++P AKI
Sbjct: 197 MSNVDYWQSPLPKSWLEQQEVLQKQILKRERDFNMTPVLPAFSGHVPKELKAIYPDAKIH 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S R ++ ++ D LF I + ++E+Q YG T HIY D F+E P
Sbjct: 257 EMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEEQTAIYG-TDHIYGIDPFNEVDSP 312
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ ++++ + IY + D++A WL W+F +D W P++++ L +VP KL++
Sbjct: 313 NWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHDQKKWTQPRIRSFLEAVPDDKLIL 372
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP----VEARTSENT 235
LD + + IW ++ +YG PY+WC L NF GN M G LD + VE +
Sbjct: 373 LDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMVGNLDDVDVKIEKLFVEGGENVYG 432
Query: 236 TMAWINQYSVRRY--------GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 287
A + + V + P D W C G D R ++ A+ +
Sbjct: 433 LGATLEGFDVNPFMYEFVFDQAWDYPLTTDQW---IQNWAKCRGGNQD--RHILKAWDSL 487
Query: 288 DPSIISVTEGKYQNYGKPVSKEA--VLKSETS-------SYDHPHLWYSTSEVIRALELF 338
I KY G+ V A +L S +Y++ LW +E+++A +
Sbjct: 488 HKKIYK----KYATAGQAVLMNARPMLVGTDSWNTYPDITYNNRDLWDIWTEMLKASHI- 542
Query: 339 IASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 397
+NT YR+D+I++ RQ L + + + Y D G+ + + + L+
Sbjct: 543 ---------NNTGYRFDVINVGRQVLGNLFSSFRDHFTQCYSEKDIDGMKKWADQMDALL 593
Query: 398 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 457
D D LL+C F +G W++ A+ + E +++ YE NAR +T+W ++A+ L DY
Sbjct: 594 IDTDRLLSCETNFSIGKWIDDARSFGKTEAEKEYYEENARCILTVW----GQKATQLNDY 649
Query: 458 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 517
N+ W GL YY R + +I + SG F K + + +W + +P+
Sbjct: 650 ANRGWGGLTYSYYRERWKRFTTEVITASLSGQKFDEKQFYQSITDFEYEWTLSKEHHPII 709
Query: 518 SNGDALITSQWLYNKYLQ 535
S + ++ ++ L KY+Q
Sbjct: 710 SGENPILLAKTLSEKYMQ 727
>gi|429740222|ref|ZP_19273924.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
gi|429153947|gb|EKX96708.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
Length = 730
Score = 248 bits (632), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 167/568 (29%), Positives = 251/568 (44%), Gaps = 79/568 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP+ WL++Q LQK+IL R M PVLPAF+G+VPA L+ +FP A I
Sbjct: 198 MANIDRWNGPLPKEWLEEQRDLQKQILARERAFNMKPVLPAFAGHVPAELKRIFPDANIK 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W D ++ C + L+ +PLF +I + F+E+Q +G T HIY D F+E PP
Sbjct: 258 SLGKWGGF--DEQYLC-HFLNPGEPLFAKIQKLFLEEQTALFG-TDHIYGVDPFNEGEPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+ + +Y + + D A W+ GW+F YD W P ++KA L VP GK+ +
Sbjct: 314 SWEPAYLKEISKNMYGTLTAVDPKAEWMQMGWMFYYDKKVWTPKRVKAFLTGVPQGKMSL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W T+ FYG PYIWC L NF GN + G
Sbjct: 374 LDYHCENVELWKTNDGFYGQPYIWCYLGNFGGNTTLTGNVKETGKRLDAALKAARRNMLG 433
Query: 219 -------LDSIAFGPVEA------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
LD I F P E S+ WI++ + R G + P+++ AW +L+
Sbjct: 434 VGSTLEGLDVIQF-PYEYVFDKVWTHSDKGNQQWIDELADRHAGFTSPSVRKAWQILFDE 492
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
++ G Y S+ VL S
Sbjct: 493 IFVQVPGT----------------------------YSILPSRSPVLNDNHSERTEIKYP 524
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
E + +L L + N + DLI + RQ L + AY D
Sbjct: 525 AQRLEEVWSLLLDVPQ----CERNELQVDLIAVGRQVLGNKFLAVKSEFDAAYAAKDITL 580
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
+ Q + EL+ D+D L + + + W++ A+ L +N E + YE NAR IT+W
Sbjct: 581 LRQKAYEMEELLSDLDCLTSFNTRCTVNKWIDDARALGRNAEMKNYYERNARYLITLW-- 638
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
L DY ++ W GL+ YYG R +Y ++ S ++G F K + + +
Sbjct: 639 -----GGHLSDYASRAWGGLIGSYYGGRWRLYIHDILASAQTGKPFDQKAFDEKRSQFEQ 693
Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKY 533
W + + D L + +++KY
Sbjct: 694 TWVHSTTPITLPQRNDLLTFCKMMFSKY 721
>gi|237719043|ref|ZP_04549524.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451821|gb|EEO57612.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 713
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 176/577 (30%), Positives = 269/577 (46%), Gaps = 93/577 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 181 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 240
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 241 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 296
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 297 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 356
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W ++ F+ PYIWC L NF GN + G
Sbjct: 357 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 416
Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E + + WI + R G +++DAW L++ +
Sbjct: 417 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 475
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y V P T G Y +P + K ++ Y + L
Sbjct: 476 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 510
Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
EV R L NE S+ + +R DLI + RQ L Y ++ + + D
Sbjct: 511 --LEVWRKL-------NEASSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVETKDHQ 561
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 562 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 620
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
L DY ++ W+GL+ DYY R +Y I++ E G D +L+D +E
Sbjct: 621 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAAEKGVEVDQKQLEDELKEIE 674
Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 675 EGWVNATDREDTRKDVH---STTDGLLSFSTFLFSKY 708
>gi|374385779|ref|ZP_09643282.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
12061]
gi|373225481|gb|EHP47815.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
12061]
Length = 715
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 165/542 (30%), Positives = 242/542 (44%), Gaps = 94/542 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW ++Q+ LQ +IL R+ E G+ PV P ++G + P
Sbjct: 186 MNNLEGWGGPNPESWYERQMQLQHRILNRMREYGIEPVFPGYAG--------MLPHNASE 237
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG VK WC L +P F I + + K +G+ + Y D F E
Sbjct: 238 KLG--IEVKDPGLWCGYQRPAFLYPENPAFKRIAGLYYMEMEKRFGK-AKFYGMDPFHEG 294
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
N +D +++ ++ M++ + +AVW+MQ W + ++ ++ G
Sbjct: 295 GNVQGID----LAAAAQSVLQAMKTANPEAVWVMQAW--------QANPRHEMITALQPG 342
Query: 176 KLVVLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
+++LDL +E +P+W K F G +++CML NF GN+ MYG +D + G
Sbjct: 343 NVLILDLSSENRPMWGDKESVWYREKGFEGQDWLYCMLLNFGGNVGMYGRMDRVINGFYA 402
Query: 229 ARTSEN----------------------------------TTMAWINQYSVRRYGRSVPA 254
A N T W+ Y RYG+ P
Sbjct: 403 AVQHPNGASLRGVGKTMEGIENNPVMYELLLELPWRKIPFTKEEWLKGYVKARYGKDDPR 462
Query: 255 IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKS 314
+Q AW +L YNC V A P A S
Sbjct: 463 LQQAWQILGKAAYNCPVVQEGTTESVFCARP------------------------AEEIS 498
Query: 315 ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNI 374
SS+ L+Y+ E + LF+ + +N + YDL D+ RQALA N L I
Sbjct: 499 GASSWGTSELYYAPEESKKVAALFLEVSEQYKGNNNFEYDLTDIMRQALADKGNVLQKKI 558
Query: 375 IEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEW 434
EAY+L D LSR FL+L+ D LLA F LG WLE AK + EE+++ YEW
Sbjct: 559 TEAYRLKDETAFRNLSREFLQLILWQDTLLATRPEFRLGTWLERAKAKGETEEEKRLYEW 618
Query: 435 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
NAR QIT W + + LRDY ++ W+GLL+D+Y PR YF ++E +G+
Sbjct: 619 NARVQITTWGNRQAADKGGLRDYSHREWAGLLKDFYYPRWKAYFD-LLEKRLAGEETEDI 677
Query: 495 DW 496
DW
Sbjct: 678 DW 679
>gi|336412606|ref|ZP_08592959.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
3_8_47FAA]
gi|335942652|gb|EGN04494.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
3_8_47FAA]
Length = 727
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 175/577 (30%), Positives = 269/577 (46%), Gaps = 93/577 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W ++ F+ PYIWC L NF GN + G
Sbjct: 371 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E + + WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACAEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
L DY ++ W+GL+ DYY R +Y I++ E G D +L+D +E
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAAEKGVEVDQKQLEDELKEIE 688
Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|212537509|ref|XP_002148910.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
gi|210068652|gb|EEA22743.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
Length = 768
Score = 245 bits (625), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 159/552 (28%), Positives = 269/552 (48%), Gaps = 77/552 (13%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WG PLP +W+D Q LQKKI+ R+ ELGM P+LPAF G VP A+ V P A +
Sbjct: 212 NIQGSWGSPLPYAWVDSQFDLQKKIVKRMVELGMTPILPAFPGFVPRAITRVLPDADVIN 271
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + + + ++ TDP F EI ++FI +Q YG + Y D F+EN P
Sbjct: 272 GSAWEAFPA--MFTSDTFMEPTDPHFTEIQKSFISKQTAAYGNVTTFYTLDQFNENNPSS 329
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPL-GKLVV 179
Y+ S+ + +++ D AVW+MQGWL FS FW +++A L V + L+V
Sbjct: 330 GDLNYLRSVSHGTWQALKAADPSAVWVMQGWLFFSNSAFWTNDRVEAYLGGVTVDSDLLV 389
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDL +E +P W + ++G P+IWC +H++ GN+ YG + +I P+ A + ++
Sbjct: 390 LDLASESQPQWQRTNSYFGKPWIWCQIHDYGGNMGFYGQVMNITVNPIAALNNATASLVG 449
Query: 238 ---------------------AW----------INQYSVRRYG--RSVPA-IQDAWNVLY 263
AW + + RY +S+P + AW++L
Sbjct: 450 FGLSMEGQEGNEVVYDLLLDQAWSAKPIDTATYFHDWVTARYAGSKSIPTDVYSAWDMLR 509
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP- 322
+VYN T+ A++ A P +I T G G HP
Sbjct: 510 TSVYNNTNLASN-------AVPKAIFELIPSTTGLVNRTGH----------------HPT 546
Query: 323 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
L Y+ +++++A LF ++ + L + Y +DL+D++RQ LA ++ ++I A+
Sbjct: 547 TLNYNPADMVKAWSLFYSAAFKEPSLWLNPAYEFDLVDMSRQVLANAFIPVYHDLIAAWN 606
Query: 380 LNDAHGVF--QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 437
+ + + + +++ +D +L ++ F L W+ +A+ A + E E+NA
Sbjct: 607 TTNPSTIRIQIIGAELIGILQAIDTILDTNEHFKLSTWISAARTSAGEQSLEDFLEYNAL 666
Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR 497
QIT+W Q + DY +K W+GL+ YY PR ++ +Y++++ + + ++
Sbjct: 667 NQITLWGPTGQ-----ISDYASKSWAGLVSSYYIPRWKMFIEYLVDTKPA--QYNQTAFK 719
Query: 498 REWIKLTNDWQN 509
E +K WQN
Sbjct: 720 AELLKWELQWQN 731
>gi|325299497|ref|YP_004259414.1| alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
gi|324319050|gb|ADY36941.1| Alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
Length = 723
Score = 244 bits (623), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 175/572 (30%), Positives = 267/572 (46%), Gaps = 95/572 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PVLP +SG VP A Q + +
Sbjct: 190 MNNLEGWGGPNPDSWYTQQEALQKKILKRMREYGIEPVLPGYSGMVPHDAHQKLGLNVTE 249
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+L N F+ + L TD F EI + E+Q K +G+ ++ Y+ D F E N
Sbjct: 250 PELWNGFTRPA--------FLMPTDKRFAEIAALYYEEQEKLFGKANY-YSMDPFHELEN 300
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G A+ M+ + AVW++QGW + +P RP MK L N G L
Sbjct: 301 AGEVD----FDAAGKAVMDAMKQVNPKAVWVVQGW--TENP--RPEMMKNLKN----GDL 348
Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSI-------AF 224
++LDLF+E +P IW K + +++CML NF N+ ++G +D +
Sbjct: 349 LILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLLNNFYLTKN 408
Query: 225 GPVEA---------------------------RTSENTTMAWINQYSVRRYGRSVPAIQD 257
P+ A R + T +W+ +Y RYG I+
Sbjct: 409 NPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKITKESWLKEYLAARYGAKDEKIEQ 468
Query: 258 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
AW +L +YNC G + + + P ++ +S + K +NY P S EA
Sbjct: 469 AWMILADGIYNCPFGNNQQGPHESIFCGRPSMNNFQVS-SWSKMENYYDPTSTEA----- 522
Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
A L + + ++ +N + YDL+D+ RQALA ++ I
Sbjct: 523 ------------------AARLMLEAADKFRGNNNFEYDLVDIVRQALADRGRIVYNRAI 564
Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 435
++ D + S+ FL L+ D LLA F +G W+ A+ L E++ YEWN
Sbjct: 565 ADFKSFDKRSYARHSKEFLNLLLAQDRLLATRSEFRVGRWINQARSLGNTPEEKDLYEWN 624
Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
AR QIT W + + LRDY +K W+G+L+D+Y R A +++ + L DG ++D
Sbjct: 625 ARVQITTWGNRECADKGGLRDYAHKEWNGILKDFYYKRWAAWWEMLQGVL---DGGEMQD 681
Query: 496 WRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
+W + W N Y E+ GD + T++
Sbjct: 682 --IDWYAMEEPWTLQHNPYKAEAEGDCIETAR 711
>gi|326437768|gb|EGD83338.1| lysosomal alpha-N-acetyl glucosaminidase [Salpingoeca sp. ATCC
50818]
Length = 820
Score = 244 bits (623), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 169/581 (29%), Positives = 255/581 (43%), Gaps = 93/581 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL W PL + W Q LQ KIL R ELGM LP F+G+VP A++ +FP A +T
Sbjct: 215 MGNLKYWAAPLDKDWRTSQYNLQLKILSRARELGMVSALPGFAGHVPTAIKRIFPHANLT 274
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q W + S + LL TDPLF+++G F + +K +G T H++ DT++E P
Sbjct: 275 QTAGWANFNS--TYSDVSLLQPTDPLFLQLGTKFYKMLIKAFG-TDHVFQMDTYNEMQPS 331
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ ++ +Y M + D +AV+LMQGWLF ++ +W P +K L+ VP K+++L
Sbjct: 332 FTNMTLLAESNRVVYQAMANADPEAVYLMQGWLF-HESYWTPEHVKVYLSGVPDDKMIIL 390
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL E P++S + ++G +IW ML N+ G +YG I+ P+ TM
Sbjct: 391 DLNTEANPVFSLTSDYFGKLWIWNMLLNYGGRRGLYGNATDISTRPLLDLHRAQGTMDGI 450
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
WI Y+ RYG+ Q AW +L VY
Sbjct: 451 GITPEAIENNPVMFELMLEMGWHATPPDMHDWIAAYASSRYGKRESLTQSAWQLLLEHVY 510
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ D +R + PD +S +E + N
Sbjct: 511 D----QPDIDRFHMEMVPD-----LSSSESRNSN-------------------------- 535
Query: 328 TSEVIRALELFI--ASGNELSASNTYRYDLIDLTRQA-----------LAKYANELFLNI 374
T+ +++A L + A L + + YDL+D+ RQA L + E NI
Sbjct: 536 TTALVQAWRLLVTAAVNGSLPITGPFSYDLVDVGRQALLNLWSDVRGMLVAHVKEYNANI 595
Query: 375 IEAYQLNDAH--GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
+ +H + L L++ D+D LL +LLG WLESAK A N ++
Sbjct: 596 DSSPSTAASHVPAIKSLFTLLLDITSDLDRLLGTDVNYLLGVWLESAKATAANADERATR 655
Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
E+NAR QIT+W + + + DY K W GL+ DYY R + + +L S
Sbjct: 656 EFNARNQITLWGPDGE-----ITDYAAKQWQGLVSDYYVKRWEMMHDATLSALNSSTKID 710
Query: 493 LKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
++ +K W N YP D + S + KY
Sbjct: 711 TSA-PKDTLKFEQAWGNENKTYPTAPQADVVKVSAAMLQKY 750
>gi|340520426|gb|EGR50662.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
Length = 747
Score = 244 bits (623), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 156/518 (30%), Positives = 264/518 (50%), Gaps = 69/518 (13%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP+SW+D+Q LQ KIL R+ ELG+ P+LPAF G VP + VFP ++
Sbjct: 203 NIQGSWGGTLPRSWVDEQFSLQLKILKRMEELGITPILPAFPGFVPRNISRVFPDISLST 262
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + + ++ DP F ++ + FI +Q + YG ++ + D F+EN P
Sbjct: 263 SPIWSNFGT--TLSADIYINPFDPRFAQLQKLFINKQQELYGNVTNFWTLDQFNENRPLS 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-LVV 179
+Y+ ++ ++ +++ D +AVW+MQ WLFS D FW +++ALL VP+ + +++
Sbjct: 321 GDLDYLRNVSHNTWAALKAADPEAVWVMQAWLFSSDSSFWTNDRVEALLGGVPVNQDMLL 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENT--- 235
LDLFAE P W + FYG P+IWC LHN+ GN+ +YG ++++ ++A R S++
Sbjct: 381 LDLFAESAPQWQRTDSFYGKPWIWCELHNYGGNMGLYGQIENVTINSMDAVRNSDSIVGF 440
Query: 236 ----------------------------TMAWINQYSVRRYG-RSVPAIQDAWNVLYHTV 266
T + + + RYG ++V + W +L TV
Sbjct: 441 GLTMEGQEGNEIMYDLLLDQAWSPKPIDTDTYFHDWVSARYGAKNVKGLYKGWEMLRPTV 500
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
+N T+ + + I+ ++ PSI + G+ +G + + + E S
Sbjct: 501 FNNTNLTVNAVQKSIL---ELTPSISGLL-GRTGRHGTTIMYDPAVMVEAWS-------- 548
Query: 327 STSEVIRALELFIASGNELSASN--TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
ELF A +L+ N +Y+YDL+D TRQ L + + ++++AY + +
Sbjct: 549 ---------ELFKAGLQDLTLFNNPSYQYDLVDWTRQVLVNSFEDHYKDLVDAYNKSSSP 599
Query: 385 GVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
V + + + L++ +D +LA + F L PW++ A+ A + +E+NAR QIT+W
Sbjct: 600 TVIRTRGAKLVTLLKTLDAVLATNKNFQLTPWIDRAR--ASSPSSANFFEFNARNQITLW 657
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
Q E DY +K W+GL+ YY R + Y+
Sbjct: 658 GPQGQIE-----DYASKQWAGLVGTYYAERWQQFVDYL 690
>gi|393236266|gb|EJD43816.1| putative alpha-N-acetylglucosaminidase [Auricularia delicata
TFB-10046 SS5]
Length = 778
Score = 244 bits (623), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 161/560 (28%), Positives = 264/560 (47%), Gaps = 87/560 (15%)
Query: 8 GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
G LP W+D Q LQKKI+ R+ ELGM P LP+F+G VP A+ V P A + W S
Sbjct: 219 GSSLPMEWIDDQFELQKKIVRRMVELGMTPALPSFTGFVPRAISRVLPGASVVNGSRW-S 277
Query: 68 VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
D T+L + DP F + ++FIE+Q+ YG SH+Y D ++EN P + Y+
Sbjct: 278 GFPDALTRVTFL-EPFDPAFARLQKSFIEKQIAAYGPVSHVYTLDQYNENDPLKNDVGYL 336
Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KLVVLDLFAE 185
+ + + +++ D DA+WLMQGWLF S FW +++A L V +++LDLF+E
Sbjct: 337 RDVSRSTWQSLKAADPDAIWLMQGWLFYSNRGFWTNARVEAFLGGVEKNDDMLILDLFSE 396
Query: 186 VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---------- 235
+P W + +YG P+IWC LH++ GN+ +YG + +I VEA +
Sbjct: 397 SEPQWQRTNSYYGKPWIWCQLHDYGGNLGLYGQVMNITLNAVEALEKSPSLVGFGLTMEG 456
Query: 236 ----------------------TMAWINQYSVRRY------GRSVP-AIQDAWNVLYHTV 266
T ++ ++ RRY G +P AI +AW++L TV
Sbjct: 457 QEGNEIMYDLLLSQAWSRKPIDTASYFRSWATRRYNAGGIIGSLLPSAIYNAWDILRTTV 516
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YN T A++ V + ++ P++ + + + + + Y
Sbjct: 517 YNNTKLASNA---VTKSVFELRPALSGI-------------------ANRTGHHATTITY 554
Query: 327 STSEVIRALELFIASGNELSA----SNTYRYDLIDLTRQALAKYANELFLNIIEAYQ--- 379
T +++A +LF + A + Y +D +D RQ L+ + + +++ Y
Sbjct: 555 DTQALVKAYDLFDKAAIYTPALWFNNPAYEFDNVDFARQVLSNAFSTQYDDLVATYNEIS 614
Query: 380 --------LNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
L +A + R + ++ +D +L F L WL+ A+ A+ +E
Sbjct: 615 KPGGSGATLAEAAKIIHDKGERMMGVLASLDKVLRTSKHFTLKKWLQDARAWARGGHEE- 673
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
+E+NAR QIT+W Q + DYG+K W GL+ +YY R I+F Y+ + +G
Sbjct: 674 LFEYNARNQITLWGPTGQ-----INDYGSKAWGGLVSEYYAQRWRIFFTYLESVVAAGQP 728
Query: 491 FRLKDWRREWIKLTNDWQNG 510
F L +++ DWQ
Sbjct: 729 FNLTAVGNQFLAFQLDWQTA 748
>gi|294647264|ref|ZP_06724861.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|292637401|gb|EFF55822.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
Length = 733
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 168/566 (29%), Positives = 264/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 196 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 256 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 312 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F
Sbjct: 372 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 431
Query: 227 -----------------VEARTSENT--TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
V R +N+ WI ++ R G I AW LY +Y
Sbjct: 432 LGVTLEGLDVNPLMYEFVFERAWQNSMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKIY 491
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T A ++ A P ++ T Y Y + LW
Sbjct: 492 --TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWEI 531
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 532 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGTK 582
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+R +L+ D+D LL C F +G W++ A+ A NE+++K YE NAR +T+W
Sbjct: 583 VWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW---- 638
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +W
Sbjct: 639 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 698
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
+P+ S + + ++ L KY
Sbjct: 699 TLKNEDFPIISEENPISLAKELILKY 724
>gi|423259033|ref|ZP_17239956.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
CL07T00C01]
gi|423263996|ref|ZP_17242999.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
CL07T12C05]
gi|387776613|gb|EIK38713.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
CL07T00C01]
gi|392706262|gb|EIY99385.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
CL07T12C05]
Length = 718
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 171/577 (29%), Positives = 263/577 (45%), Gaps = 103/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
G N M W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKPP 679
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
++ ++ + W N Y E+ GD + ++
Sbjct: 680 AKI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|294807833|ref|ZP_06766618.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|294444952|gb|EFG13634.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
Length = 703
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 167/566 (29%), Positives = 263/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 166 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 225
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 226 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 281
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 282 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 341
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F
Sbjct: 342 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 401
Query: 227 -----------------VEARTSENT--TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
V R +N+ WI ++ R G I AW LY +Y
Sbjct: 402 LGVTLEGLDVNPLMYEFVFERAWQNSMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKIY 461
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T A ++ A P ++ E Y + LW
Sbjct: 462 --TSAALCGQAVLMNARPQLE------------------GVEGWNTLPGYDYKNIDLWEI 501
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 502 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGTK 552
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+R +L+ D+D LL C F +G W++ A+ A NE+++K YE NAR +T+W
Sbjct: 553 VWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW---- 608
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +W
Sbjct: 609 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 668
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
+P+ S + + ++ L KY
Sbjct: 669 TLKNEDFPIISEENPISLAKELILKY 694
>gi|345511813|ref|ZP_08791352.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|229443748|gb|EEO49539.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
Length = 720
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 168/566 (29%), Positives = 264/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 183 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 242
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 243 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 298
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 299 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 358
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F
Sbjct: 359 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 418
Query: 227 -----------------VEARTSENT--TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
V R +N+ WI ++ R G I AW LY +Y
Sbjct: 419 LGVTLEGLDVNPLMYEFVFERAWQNSMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKIY 478
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T A ++ A P ++ T Y Y + LW
Sbjct: 479 --TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWEI 518
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 519 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGTK 569
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+R +L+ D+D LL C F +G W++ A+ A NE+++K YE NAR +T+W
Sbjct: 570 VWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW---- 625
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +W
Sbjct: 626 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 685
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
+P+ S + + ++ L KY
Sbjct: 686 TLKNEDFPIISEENPISLAKELILKY 711
>gi|262406058|ref|ZP_06082608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294806855|ref|ZP_06765680.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345510563|ref|ZP_08790130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|262356933|gb|EEZ06023.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294445884|gb|EFG14526.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345454460|gb|EEO49066.2| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
Length = 727
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 175/577 (30%), Positives = 267/577 (46%), Gaps = 93/577 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W ++ F+ PYIWC L NF GN + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E + + WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
L DY ++ W+GL+ DYY R +Y I++ E G D +L+D +E
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYVNTFIKAAEEGVEVDQKQLEDELKEIE 688
Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|262407713|ref|ZP_06084261.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|262354521|gb|EEZ03613.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
Length = 735
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 167/566 (29%), Positives = 263/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 198 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 258 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 314 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F
Sbjct: 374 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 433
Query: 227 -----------------VEARTSENT--TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
V R +N+ WI ++ R G I AW LY +Y
Sbjct: 434 LGVTLEGLDVNPLMYEFVFERAWQNSMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKIY 493
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T A ++ A P ++ E Y + LW
Sbjct: 494 --TSAALCGQAVLMNARPQLE------------------GVEGWNTLPGYDYKNIDLWEI 533
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 534 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGTK 584
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+R +L+ D+D LL C F +G W++ A+ A NE+++K YE NAR +T+W
Sbjct: 585 VWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW---- 640
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +W
Sbjct: 641 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 700
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
+P+ S + + ++ L KY
Sbjct: 701 TLKNEDFPIISEENPISLAKELILKY 726
>gi|83775903|dbj|BAE66022.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 633
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 160/532 (30%), Positives = 257/532 (48%), Gaps = 80/532 (15%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 73 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 132
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L+ D F ++ ++ I +Q + +G +H+Y D F+E
Sbjct: 133 VVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISRQTRAFGNVTHVYALDQFNEIN 190
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW P ++ A L+ V
Sbjct: 191 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDPNRISAYLSGVERND 249
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA ++
Sbjct: 250 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEALNKSDS 309
Query: 236 --------------------------------TMAWINQYSVRRYGR--SVP-AIQDAWN 260
T A+ + RY SVP + AW+
Sbjct: 310 LVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQSWVRSRYSGNFSVPNELYTAWD 369
Query: 261 VLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEGKYQ-----NYGKPVSKEAVLKS 314
+L TVYN T+ T + + PD+ + V G Y NY V E
Sbjct: 370 LLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--GHYPTPTSINYDPMVLNEVWSLF 427
Query: 315 ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNI 374
++ P LW+S + Y YD++D+TRQ + ++ ++
Sbjct: 428 MNATRKEPSLWHSPA---------------------YEYDMVDITRQLMGNAFVNVYSDL 466
Query: 375 IEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E +
Sbjct: 467 ISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNENFSLTTWISSARDWGNTTETKDF 526
Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
+E+NAR QIT+W + + DY +K W+GL+ YY PR +I+ Y+ E
Sbjct: 527 FEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGE 573
>gi|53711968|ref|YP_097960.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis YCH46]
gi|52214833|dbj|BAD47426.1| alpha-N-acetylglucosaminidase precursor [Bacteroides fragilis
YCH46]
Length = 718
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 171/577 (29%), Positives = 262/577 (45%), Gaps = 103/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
G N M W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|313145188|ref|ZP_07807381.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
gi|313133955|gb|EFR51315.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
Length = 718
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 173/577 (29%), Positives = 262/577 (45%), Gaps = 103/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ + Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLGA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
G L++LDL +E +P W S K YG +++CML N+ GN+ ++G +D++
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 223 --AFGPVEARTSENTTMA----------------------------WINQYSVRRYGRSV 252
P T + MA W+ +Y RYG
Sbjct: 404 LAKADPHAGSTLKGVGMAPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y +VI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ AY+ D S++FL L+ D LL F +G W+E A+ L E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEKA 619
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF + + LE
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDSLSQKLEGKTP 679
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
++ ++ + W N Y E+ GD + T++
Sbjct: 680 EKI-----DFYAVEEPWAKATNPYSAEAEGDCIETAK 711
>gi|423248659|ref|ZP_17229675.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
CL03T00C08]
gi|423253608|ref|ZP_17234539.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
CL03T12C07]
gi|392655237|gb|EIY48880.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
CL03T12C07]
gi|392657600|gb|EIY51231.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
CL03T00C08]
Length = 718
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 171/577 (29%), Positives = 262/577 (45%), Gaps = 103/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
G N M W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|383115203|ref|ZP_09935961.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
gi|313695380|gb|EFS32215.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
Length = 727
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 173/574 (30%), Positives = 261/574 (45%), Gaps = 87/574 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W ++ F+ PYIWC L NF GN + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E + + WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNADDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFFDVKVEFDRMVEAKDYQ 575
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
L DY ++ W+GL+ DYY R +Y I++ E G K E ++
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAAEKGVEVDQKQLEDELKEIE 688
Query: 505 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 533
W N + V S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDVRKDIHSATDGLLSFSTFLFSKY 722
>gi|336386984|gb|EGO28130.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 738
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 158/536 (29%), Positives = 267/536 (49%), Gaps = 80/536 (14%)
Query: 7 WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW- 65
WGG LP+ W++ Q LQK+I+ R+ ELGM PVLP+F+G VP A+ ++P+A I W
Sbjct: 191 WGGDLPEQWINDQFALQKQIISRMVELGMTPVLPSFTGFVPRAMHTLYPNASIVNGSQWN 250
Query: 66 -FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
F+++ + L+ DPLF + +FI +Q+ YG SH+Y D ++EN+P
Sbjct: 251 GFTIQ----YTNDSFLEPFDPLFSTLQTSFISKQVAAYGNVSHVYTLDQYNENSPYSGDT 306
Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-KLVVLDL 182
Y++++ AA ++ +++ D AVWLMQGWLF D FW +++A L VP +++LDL
Sbjct: 307 SYLANVTAATFASLRAADPQAVWLMQGWLFYSDSTFWTTERVEAYLGGVPGNDSMIILDL 366
Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM----- 237
++E +P W ++G +IWC LH++ GN+ G +++ P++A + +M
Sbjct: 367 YSEAQPQWQRLNSYFGKQWIWCELHDYGGNMGFEGNFENVTTQPIKALATPGNSMVGMGL 426
Query: 238 ----------------------------AWINQYSVRRYGRSVP----AIQDAWNVLYHT 265
A+I+ ++ RRY +VP A +AW +L T
Sbjct: 427 TMEGQEGNEIIYDVLLDQAWSSTPLNRTAYISAWASRRY--NVPDLPTAALEAWEILGAT 484
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYN D T I+ ++ PSI L + T ++ +
Sbjct: 485 VYNNQDVTTQSTVKSIL---ELSPSITG------------------LVNRTGTHSTKLFY 523
Query: 326 YSTSEVIRALELFIASGNELSA-SNT--YRYDLIDLTRQALAKYANELFLNIIEAY--QL 380
+ + ++ AL+L + + E SA SN ++YD++D+TRQ LA +L+ ++I+ +
Sbjct: 524 DTNTTIVPALKLLLQARQEASALSNIPEFQYDVVDVTRQLLANRFIDLYTSLIDTFSSTS 583
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ--NEEQEKQYEWNART 438
+ + V L L++D+D +L FLL W+ +A+ N E+NAR
Sbjct: 584 SSSSAVSAAGAPLLALLQDLDSVLLTDTHFLLARWISAARNWTHGDNATYAAYLEYNARN 643
Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
Q+T+W + + DY +K W GL+ YY R + Y+ S E+ + +
Sbjct: 644 QVTLWGPRGE-----VNDYASKQWGGLVGTYYVQRWETFVGYLAGSKENATVYNVS 694
>gi|298480128|ref|ZP_06998327.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|336404356|ref|ZP_08585054.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
gi|298273937|gb|EFI15499.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|335943684|gb|EGN05523.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
Length = 727
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 172/574 (29%), Positives = 261/574 (45%), Gaps = 87/574 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W ++ F+ PYIWC L NF GN + G
Sbjct: 371 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E + + WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
L DY ++ W+GL+ DYY R +Y I+++ G K E ++
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAVGEGVEVDQKQLEDELKEIE 688
Query: 505 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 533
W N + V S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDVRKDVHSTTDGLLSFSTFLFSKY 722
>gi|391873368|gb|EIT82411.1| alpha-N-acetylglucosaminidase [Aspergillus oryzae 3.042]
Length = 633
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 159/535 (29%), Positives = 262/535 (48%), Gaps = 84/535 (15%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 73 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 132
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L D F ++ ++ I +Q++ +G +H+Y D F+E
Sbjct: 133 VVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISRQMRAFGNITHVYALDQFNEIN 190
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW ++ A L+ V
Sbjct: 191 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDSNRISAYLSGVERND 249
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA N+
Sbjct: 250 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEALNKSNS 309
Query: 236 --------------------------------TMAWINQYSVRRYGR--SVP-AIQDAWN 260
T A+ + RY R SVP + AW+
Sbjct: 310 LVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQSWVRSRYSRNFSVPNELYTAWD 369
Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS--KEAVLKSET-- 316
+L TVYN T+ T V + ++ P I + G+ +Y P S + ++ +E
Sbjct: 370 LLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GRVGHYPTPTSINYDPMVLNEVWS 425
Query: 317 ----SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
++ P LW++ + Y YD++D+TRQ + ++
Sbjct: 426 LFMNATRKEPSLWHNPA---------------------YEYDMVDITRQLMGNAFVNVYS 464
Query: 373 NIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
+I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E +
Sbjct: 465 VLITSWKSETENRTTKVTSQSERLLNLLSAIDKVLSCNENFSLATWISSARDWGNTTETK 524
Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
+E+NAR QIT+W + + DY +K W+GL+ YY PR +I+ Y+ E+
Sbjct: 525 DFFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGEN 574
>gi|336374066|gb|EGO02404.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 761
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 158/536 (29%), Positives = 267/536 (49%), Gaps = 80/536 (14%)
Query: 7 WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW- 65
WGG LP+ W++ Q LQK+I+ R+ ELGM PVLP+F+G VP A+ ++P+A I W
Sbjct: 214 WGGDLPEQWINDQFALQKQIISRMVELGMTPVLPSFTGFVPRAMHTLYPNASIVNGSQWN 273
Query: 66 -FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
F+++ + L+ DPLF + +FI +Q+ YG SH+Y D ++EN+P
Sbjct: 274 GFTIQ----YTNDSFLEPFDPLFSTLQTSFISKQVAAYGNVSHVYTLDQYNENSPYSGDT 329
Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-KLVVLDL 182
Y++++ AA ++ +++ D AVWLMQGWLF D FW +++A L VP +++LDL
Sbjct: 330 SYLANVTAATFASLRAADPQAVWLMQGWLFYSDSTFWTTERVEAYLGGVPGNDSMIILDL 389
Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM----- 237
++E +P W ++G +IWC LH++ GN+ G +++ P++A + +M
Sbjct: 390 YSEAQPQWQRLNSYFGKQWIWCELHDYGGNMGFEGNFENVTTQPIKALATPGNSMVGMGL 449
Query: 238 ----------------------------AWINQYSVRRYGRSVP----AIQDAWNVLYHT 265
A+I+ ++ RRY +VP A +AW +L T
Sbjct: 450 TMEGQEGNEIIYDVLLDQAWSSTPLNRTAYISAWASRRY--NVPDLPTAALEAWEILGAT 507
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYN D T I+ ++ PSI L + T ++ +
Sbjct: 508 VYNNQDVTTQSTVKSIL---ELSPSITG------------------LVNRTGTHSTKLFY 546
Query: 326 YSTSEVIRALELFIASGNELSA-SNT--YRYDLIDLTRQALAKYANELFLNIIEAY--QL 380
+ + ++ AL+L + + E SA SN ++YD++D+TRQ LA +L+ ++I+ +
Sbjct: 547 DTNTTIVPALKLLLQARQEASALSNIPEFQYDVVDVTRQLLANRFIDLYTSLIDTFSSTS 606
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ--NEEQEKQYEWNART 438
+ + V L L++D+D +L FLL W+ +A+ N E+NAR
Sbjct: 607 SSSSAVSAAGAPLLALLQDLDSVLLTDTHFLLARWISAARNWTHGDNATYAAYLEYNARN 666
Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
Q+T+W + + DY +K W GL+ YY R + Y+ S E+ + +
Sbjct: 667 QVTLWGPRGE-----VNDYASKQWGGLVGTYYVQRWETFVGYLAGSKENATVYNVS 717
>gi|260642393|ref|ZP_05415712.2| alpha-N-acetylglucosaminidase [Bacteroides finegoldii DSM 17565]
gi|260622285|gb|EEX45156.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides finegoldii DSM
17565]
Length = 735
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 168/566 (29%), Positives = 263/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 198 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I F+E+Q K YG T+HIY D F+E P
Sbjct: 258 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQHRFLEEQTKVYG-TNHIYGIDPFNEVDSP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +QS DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 314 NWNEEFLSNVSDKIYKSIQSVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F
Sbjct: 374 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 433
Query: 227 -----------------VEARTSENT--TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
V R EN+ WI ++ R G I AW LY +Y
Sbjct: 434 LGVTLEGLDVNPLMYEFVFERAWENSIPVHQWIANWAQCRGGNVDNHIIKAWKQLYEKIY 493
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T A ++ A P ++ T Y Y + LW
Sbjct: 494 --TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWEI 533
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 534 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFADCYRKKDLEGTK 584
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+R +L+ D+D LL C +G W++ A+ A NE+++K YE NAR +T+W
Sbjct: 585 VWGQRMDQLLLDVDRLLCCSPVLSIGKWIKDARDFAVNEQEQKYYEENARCILTVW---- 640
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +W
Sbjct: 641 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 700
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
+P+ S + + ++ L KY
Sbjct: 701 TLKNEDFPITSGENPISLAKELILKY 726
>gi|423293377|ref|ZP_17271504.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
CL03T12C18]
gi|392678320|gb|EIY71728.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 172/577 (29%), Positives = 267/577 (46%), Gaps = 93/577 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W ++ F+ PYIWC L NF GN + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E + + WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y + + T G Y +P + K ++ Y + L
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
L DY ++ W+GL+ DYY R +Y I+++ G D +L+D +E
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAVGEGVEVDQKQLEDELKEIE 688
Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|380697007|ref|ZP_09861866.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 703
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 169/566 (29%), Positives = 263/566 (46%), Gaps = 70/566 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 166 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 225
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 226 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 281
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 282 NWNEEFLSNVSDKIYKSIQDVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 341
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP------------- 226
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F
Sbjct: 342 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 401
Query: 227 -----------------VEARTSENTTMA--WINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
V R EN+ A WI ++ R G I AW LY +Y
Sbjct: 402 LGVTLEGLDVNPLMYEFVFERAWENSMPAHQWIANWAQCRGGNVDNHIVKAWKQLYEKIY 461
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T A ++ A P ++ T Y Y + LW
Sbjct: 462 --TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWEI 501
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
E+++A ++ + Y +D+I++ RQ L + + Y+
Sbjct: 502 WKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKKLEETK 552
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+R +L+ D+D LL C F +G W++ AK A NE+++K YE NAR +T+W
Sbjct: 553 VWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDAKDFAVNEQEQKYYEENARCILTVW---- 608
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +W
Sbjct: 609 GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYEW 668
Query: 508 QNGRNVYPVESNGDALITSQWLYNKY 533
+P+ S + + ++ L KY
Sbjct: 669 TLKNEDFPITSEENPISLAKELILKY 694
>gi|403416059|emb|CCM02759.1| predicted protein [Fibroporia radiculosa]
Length = 705
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 162/577 (28%), Positives = 279/577 (48%), Gaps = 74/577 (12%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G W G LP W++ Q LQ++I+ R+ ELGM PVLPAF+G VP A+ ++P+A I
Sbjct: 158 NIQGSWSGALPTQWINDQWALQQQIVQRMVELGMTPVLPAFTGFVPRAMSTLYPNASIVN 217
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTPP 120
W S + T L+ DPLF + ++FI +Q YG SH+Y D ++EN P
Sbjct: 218 GSQWEGFPSTLTY--TTFLEPFDPLFTTMQKSFISKQQAAYGANVSHVYTLDQYNENDPY 275
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-KLV 178
Y++++ A ++ +Q+ D +AVW+MQGWL F+ + FW ++ A L +VP ++
Sbjct: 276 SGDVGYLANISAGTFASLQAADPEAVWMMQGWLFFASEAFWTTERIAAFLGAVPSNDSMI 335
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT-- 236
+LDL++E P W + +YG +IWC LH+F GN+ G L + GP++A ++ ++
Sbjct: 336 ILDLYSEAAPQWQRTDSYYGKQWIWCELHDFGGNMGFEGNLPELVTGPIQALSNASSMRG 395
Query: 237 --------------------MAW----------INQYSVRRYG-RSVP-AIQDAWNVLYH 264
AW + + RRY + +P A Q+AW +L
Sbjct: 396 MGLTPEGQEGNEIVYDILLDQAWSSTSIDIASYVEAWVARRYTVQDLPSAAQEAWTILST 455
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
TVY+ +D P+ +I S+ E ++ + ++ + + +
Sbjct: 456 TVYSNSD-------------PNTQATIKSIFE---------LAPDLSGLTDRTGHHCTEI 493
Query: 325 WYSTS-EVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
Y T+ ++ AL+ + + E L + + YD++D+TRQ LA +++ ++ +
Sbjct: 494 PYDTNITIVPALQNLVQAATENPLLLSVPEFMYDVVDVTRQLLANRFIDVYNELVSTFYS 553
Query: 381 N--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNAR 437
A V + L ++ D+D LL +D FLL W+ A L+ N Y E+NAR
Sbjct: 554 TGVTAASVKNAGQPLLTILSDVDTLLWTNDNFLLSNWILGAINLSDNNGTYADYLEYNAR 613
Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR 497
QIT+W + + + DY +K W+G + YY R ++ Y+ + ++G + +
Sbjct: 614 NQITLWGPDGE-----INDYASKQWAGFVGTYYYDRWNMFITYLEDITQNGTAYNDTAIQ 668
Query: 498 REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
+ +W +GD + L K+L
Sbjct: 669 TVMLNFGKEWDTQTYSLSATVSGDTMSIVDSLIQKWL 705
>gi|317158657|ref|XP_001827155.2| alpha-N-acetylglucosaminidase [Aspergillus oryzae RIB40]
Length = 849
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 160/532 (30%), Positives = 257/532 (48%), Gaps = 80/532 (15%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 157 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 216
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L+ D F ++ ++ I +Q + +G +H+Y D F+E
Sbjct: 217 VVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISRQTRAFGNVTHVYALDQFNEIN 274
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW P ++ A L+ V
Sbjct: 275 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDPNRISAYLSGVERND 333
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA ++
Sbjct: 334 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEALNKSDS 393
Query: 236 --------------------------------TMAWINQYSVRRYGR--SVP-AIQDAWN 260
T A+ + RY SVP + AW+
Sbjct: 394 LVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQSWVRSRYSGNFSVPNELYTAWD 453
Query: 261 VLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEGKYQ-----NYGKPVSKEAVLKS 314
+L TVYN T+ T + + PD+ + V G Y NY V E
Sbjct: 454 LLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--GHYPTPTSINYDPMVLNEVWSLF 511
Query: 315 ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNI 374
++ P LW+S + Y YD++D+TRQ + ++ ++
Sbjct: 512 MNATRKEPSLWHSPA---------------------YEYDMVDITRQLMGNAFVNVYSDL 550
Query: 375 IEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E +
Sbjct: 551 ISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNENFSLTTWISSARDWGNTTETKDF 610
Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
+E+NAR QIT+W + + DY +K W+GL+ YY PR +I+ Y+ E
Sbjct: 611 FEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGE 657
>gi|423214204|ref|ZP_17200732.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693149|gb|EIY86384.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
CL03T12C04]
Length = 727
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 170/574 (29%), Positives = 260/574 (45%), Gaps = 87/574 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T H+Y D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHVYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W ++ F+ PYIWC L NF GN + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGERLENALINGGGNLKG 430
Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E + + WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDDKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y + + T G Y +P + K ++ Y + L
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNIEL-- 524
Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
L DY ++ W+GL+ DYY R +Y I++ E G K E ++
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAAEKGVEVDQKQLEDELKEIE 688
Query: 505 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 533
W N + V S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDVRKDIHSATDGLLSFSTFLFSKY 722
>gi|224025137|ref|ZP_03643503.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
18228]
gi|224018373|gb|EEF76371.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
18228]
Length = 718
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 177/581 (30%), Positives = 274/581 (47%), Gaps = 102/581 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW ++Q LQK+IL R+ E G+ PVLP +SG VP ++ +
Sbjct: 187 MNNLEGWGGPNPDSWYERQEELQKRILKRMREYGIEPVLPGYSGMVPHNAKDRL-GLNVA 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W PR L TDP F I + + + YG+ S+ Y+ D F E NT
Sbjct: 246 DPGRW---NGYPR---PAFLQPTDPQFERIAALYYREMTRLYGKVSY-YSMDPFHEGGNT 298
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + G AI+ M+ + A W++Q W +P PQM + ++P G +V
Sbjct: 299 SGVD----LEAAGKAIWKAMKQANPRAAWVVQAW--GANP---RPQM---IRNLPAGDMV 346
Query: 179 VLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYG-------------- 217
VLDLF+E +P W K+ +G +++CML N+ GN+ ++G
Sbjct: 347 VLDLFSESRPQWGDPASSWYRKEGFGQHDWLFCMLLNYGGNVGLHGKMAHLIEEFYKAKD 406
Query: 218 ------------ILDSIAFGPV-----------EARTSENTTMAWINQYSVRRYGRSVPA 254
++ I PV E R S++ W+ Y RYG+S
Sbjct: 407 SSFGKTLKGVGMTMEGIENNPVMYELLCELPWREQRFSKDE---WLEGYLKARYGKSDSQ 463
Query: 255 IQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
+ AW +L +T+YNC +T + + ++ A P +S
Sbjct: 464 VSQAWMLLSNTIYNCPAASTQQGTHESILCARPSWKAYQVSSW----------------- 506
Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
SE S Y Y ++VIRA + + + +N + YDL+D+ RQA+A+ ++
Sbjct: 507 -SEMSDY------YDPADVIRAAGMMVDAAERFRGNNNFEYDLVDIVRQAVAEKGRLMYR 559
Query: 373 NIIEAYQLNDAHGVFQLSR-RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
+++AY+ D +F+LS RFL L+ D LLA F +G WLESA+ L EE++
Sbjct: 560 VLVDAYKAGDRE-LFKLSSDRFLRLILMQDRLLATRSEFKVGRWLESARNLGSTEEEKDW 618
Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
YEWNAR QIT W + + L DY ++ W+GLLRD+Y R + ++S E G
Sbjct: 619 YEWNARVQITTWGNRVAADDGGLHDYAHREWNGLLRDFYYLRWKTWLDEQLKSFEGGQPK 678
Query: 492 RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
+ ++ L W N Y E+ G+ + + +Y +
Sbjct: 679 AI-----DFYALEEPWTLKHNSYASEAEGNPVDIACEIYRE 714
>gi|373461342|ref|ZP_09553084.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
gi|371952896|gb|EHO70729.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
Length = 731
Score = 242 bits (617), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 152/491 (30%), Positives = 254/491 (51%), Gaps = 24/491 (4%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++GW GPLPQSW+D Q LQ++IL R E GM PVLPAF+G+VP + + P A+IT
Sbjct: 194 MLNINGWQGPLPQSWIDGQADLQRRILQREREFGMRPVLPAFNGSVPLDYKRLHPEARIT 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G W R TY L TDP F ++ ++F+++Q + +G T H+Y D+F+E PP
Sbjct: 254 EVGQWGGFGQAYR---TYFLSPTDPRFGKLQKSFLDEQRRMFG-TDHLYCLDSFNEVQPP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
SP+ + L I++ + D +VW+ GWLF D W P ++A L+ +P + ++
Sbjct: 310 SWSPDTLCMLARHIHASLDKADPQSVWVQMGWLFYNDRKHWTPDVIRAYLSGIPKDRALL 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
LD + + +W ++ FYG PYI C+L NF GN + G + ++ ++A +++ MA
Sbjct: 370 LDYYIDHTELWRLTESFYGRPYIACVLGNFGGNTMLQGDVGKVS-SRLDAAIAQDGNMAG 428
Query: 240 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT-EGK 298
+ ++ +G + D + ++ ++C G TD RD + D S
Sbjct: 429 VGA-TMEGFGVN----PDFYAFVFDKAWDC--GTTD--RDWLCRMADRHVGFASAAGRTA 479
Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHL---WYSTSEVIRALELFIASGNELSASNTYRYDL 355
+Q + V +S T P + +T+ L ++ + S + YD+
Sbjct: 480 WQVLFDRIMPSYVNESGTVVCARPSFEARYLNTTYPAELLGVWKMLLDIDSDKREHLYDV 539
Query: 356 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 415
+++ RQ L + + AY + V +RR ++++D+D LLAC + F L W
Sbjct: 540 VNVGRQVLGDFFAFERDGLHRAYLSQRSDSVDYYARRMDKMLDDLDRLLACSEEFSLRKW 599
Query: 416 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 475
+E A+ ++ YE NART IT+W D+ Q L DY N+ W+GL+ YY R
Sbjct: 600 IEDARGFGATAAEKDYYERNARTLITVWGDSRQ-----LTDYANRTWAGLVSSYYKQRWH 654
Query: 476 IYFKYMIESLE 486
I+ ++ ++
Sbjct: 655 IFTAHVRRAVR 665
>gi|424666301|ref|ZP_18103337.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
616]
gi|404573840|gb|EKA78592.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
616]
Length = 718
Score = 242 bits (617), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 174/577 (30%), Positives = 261/577 (45%), Gaps = 103/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ + Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDS------ 221
G L++LDL +E +P W S K YG +++CML N+ GN+ ++G +D+
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 222 ---------------------IAFGPVEA--------RTSENTTMAWINQYSVRRYGRSV 252
I PV R T W+ +Y RYG
Sbjct: 404 LAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHEAVFCARP---------AEDVYQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y +VI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ AY+ D S++FL L+ D LL F +G W+E A+ L E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEKA 619
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF + + LE
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDSLSQKLEGKTP 679
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
++ ++ + W N Y E+ GD + T++
Sbjct: 680 EKI-----DFYAVEEPWAKATNPYSAEAEGDCIETAK 711
>gi|299144715|ref|ZP_07037783.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298515206|gb|EFI39087.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 727
Score = 242 bits (617), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 172/577 (29%), Positives = 268/577 (46%), Gaps = 93/577 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRLYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-IDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIVSDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W ++ F+ PYIWC L NF GN + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E + + WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y + + T G Y +P + K ++ Y + L
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILHDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
L DY ++ W+GL+RDYY R +Y I+++ G D +L+D +E
Sbjct: 635 ------GGSLNDYASRSWAGLIRDYYAKRWEVYINTFIKAVGEGVEVDQKQLEDELKEIE 688
Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|423299508|ref|ZP_17277533.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
CL09T03C10]
gi|408473317|gb|EKJ91839.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
CL09T03C10]
Length = 727
Score = 242 bits (617), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 164/571 (28%), Positives = 268/571 (46%), Gaps = 80/571 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP+ WL QQ LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 194 MSNVDYWQSPLPKDWLVQQEELQKRILAREREFNMTPVLPAFAGHVPAELKKIYPNAKIY 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I + F+E+Q K YG T HIY D F+E P
Sbjct: 254 TMSQWGGFDKQYR---SHFIDPMDSLYSVIQKRFLEEQTKIYG-TDHIYGIDPFNEVDSP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY + S D +A WL W+F Y W P ++K+ L +VP KL++
Sbjct: 310 DWNEEFLSNVSRKIYESLHSVDPEAQWLQMTWMFYYAKDKWTPSRIKSFLRAVPQDKLIL 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG---------------------- 217
LD + + IW ++ +YG PYIWC L NF GN + G
Sbjct: 370 LDYYCDHTEIWKKTEGYYGQPYIWCYLGNFGGNTMLAGNLNDTYEKIHQVLAEGGQNIHG 429
Query: 218 ---ILDSIAFGPV-------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
L++ P+ +A T WI ++ R G++ PA+ AW L+ +Y
Sbjct: 430 LGVTLEAFDVNPMMYEFVFEQAWEGAQPTDEWIATWAKCRGGQTCPAVLKAWKELHEKIY 489
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ PS+ +AVL + + W +
Sbjct: 490 -------------------IAPSLCG---------------QAVLMNARPQLEGVQGWNT 515
Query: 328 TSEV-IRALELFIASGNELSASNT----YRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
E +L++ G+ L + + +D++++ RQ L ++ Y+ D
Sbjct: 516 FPEYKYDNKDLWVIWGSLLQVGSIDKPGHAFDVVNVGRQVLGNLFSDYRAQFTACYKRKD 575
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
G + ++R L+ D+D LLAC F +G W++ A+ EE++K YE NAR +T+
Sbjct: 576 VKGAQEWAQRMDALLLDVDRLLACSPLFSMGKWIQDARDCGTTEEEKKYYEENARCILTI 635
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
W ++ + L DY N+ W+GL + +Y R + ++ ++++ F K + ++
Sbjct: 636 W----GQKDTQLNDYANRSWAGLTKGFYRERWKRFTDSVLTAMQANRSFDAKKFHKDITD 691
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+W + V S DA+ + L+NKY
Sbjct: 692 FEYEWTLQHETFSVSSGEDAVKVANELWNKY 722
>gi|160884062|ref|ZP_02065065.1| hypothetical protein BACOVA_02038 [Bacteroides ovatus ATCC 8483]
gi|423291477|ref|ZP_17270325.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
CL02T12C04]
gi|156110404|gb|EDO12149.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392663477|gb|EIY57027.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
CL02T12C04]
Length = 727
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 174/577 (30%), Positives = 266/577 (46%), Gaps = 93/577 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W ++ F+ PYIWC L NF GN + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGERLENALINGGGNLKG 430
Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E + + WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDDKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y V P T G Y +P K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNRNSEKRTSNVYSNVEL-- 524
Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKVEFDRMVEAKDHQ 575
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 499
L DY ++ W+GL+ DYY R +Y I+++ G D +L+D +E
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAVGEGVEVDQKQLEDELKEIE 688
Query: 500 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 533
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|423282107|ref|ZP_17260992.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
615]
gi|404582594|gb|EKA87288.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
615]
Length = 718
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 171/577 (29%), Positives = 261/577 (45%), Gaps = 103/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
G N M W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKAP 679
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|60680169|ref|YP_210313.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC 9343]
gi|375357012|ref|YP_005109784.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
gi|383116930|ref|ZP_09937677.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
gi|60491603|emb|CAH06355.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC
9343]
gi|251947777|gb|EES88059.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
gi|301161693|emb|CBW21233.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
Length = 718
Score = 241 bits (616), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 171/577 (29%), Positives = 261/577 (45%), Gaps = 103/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
G N M W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|423269418|ref|ZP_17248390.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
CL05T00C42]
gi|423273021|ref|ZP_17251968.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
CL05T12C13]
gi|392701212|gb|EIY94372.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
CL05T00C42]
gi|392708585|gb|EIZ01692.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
CL05T12C13]
Length = 718
Score = 241 bits (616), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 171/577 (29%), Positives = 261/577 (45%), Gaps = 103/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
G N M W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKAP 679
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSTEAEGDCIEVAK 711
>gi|423280158|ref|ZP_17259071.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
610]
gi|404584494|gb|EKA89159.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
610]
Length = 718
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 174/577 (30%), Positives = 261/577 (45%), Gaps = 103/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ + Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLGA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDS------ 221
G L++LDL +E +P W S K YG +++CML N+ GN+ ++G +D+
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 222 ---------------------IAFGPVEA--------RTSENTTMAWINQYSVRRYGRSV 252
I PV R T W+ +Y RYG
Sbjct: 404 LAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHESVFSARP---------AEDVYQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y +VI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ AY+ D S++FL L+ D LL F +G W+E A+ L E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEKA 619
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF + + LE
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDSLSQKLEGKTP 679
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
++ ++ + W N Y E+ GD + T++
Sbjct: 680 EKI-----DFYAVEEPWTKATNPYSAEAEGDCIETAK 711
>gi|218258436|ref|ZP_03474815.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
DSM 18315]
gi|423342591|ref|ZP_17320305.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
CL02T12C29]
gi|218225494|gb|EEC98144.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
DSM 18315]
gi|409217508|gb|EKN10484.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
CL02T12C29]
Length = 718
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 170/576 (29%), Positives = 262/576 (45%), Gaps = 96/576 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQ++I+ R+ E G+ PV P +SG VP + ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQQQIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + L TDP F EI + ++ K YG+ ++ Y+ D F E +
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKANY-YSMDPFHEGGSV 298
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + G AI M+ + AVW+ Q W + P ++ ++ G L+
Sbjct: 299 AGVD----LDAAGKAIMQAMKKNNPKAVWVAQAWQANPRP--------QMIGNLEAGDLI 346
Query: 179 VLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSI--------- 222
LDLFAE +P W F +I+CML N+ GNI ++G + +
Sbjct: 347 ALDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNIGLHGKMKHVIDEFYKAKE 406
Query: 223 -AFGP------VEARTSENTTMA------------------WINQYSVRRYGRSVPAIQD 257
FG + SEN + W+ Y+V RYG+S P +QD
Sbjct: 407 SPFGTTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDKDQWLKAYTVARYGKSNPVVQD 466
Query: 258 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
AW +L +++YNC D T + + V A P TE YQ
Sbjct: 467 AWILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ--------------- 502
Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
SS+ +Y ++VIRA + ++ ++ +N + YDL+D+ RQA+A+ L ++
Sbjct: 503 VSSWSEMKDYYDPNDVIRAAAMMVSVSDQFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVV 561
Query: 376 EA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEW 434
EA + D S RFL L+ D LLA F +G W+ A+ L E++ YEW
Sbjct: 562 EAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGNTSEEKDLYEW 621
Query: 435 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 494
NAR QIT W + + LRDY ++ W+G+L+D+Y R +F Y L+ +
Sbjct: 622 NARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYYMRWKTWFDYQTRLLDGKKTAAI- 680
Query: 495 DWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
++ + W N Y E GD + T Q ++
Sbjct: 681 ----DFYAIEEPWTKQTNPYSNEPEGDCIPTVQRIF 712
>gi|265765312|ref|ZP_06093587.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
gi|263254696|gb|EEZ26130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
Length = 718
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 171/577 (29%), Positives = 261/577 (45%), Gaps = 103/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
G N M W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 FYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKAP 679
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|261880010|ref|ZP_06006437.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
gi|270333326|gb|EFA44112.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
Length = 719
Score = 241 bits (614), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 174/572 (30%), Positives = 258/572 (45%), Gaps = 88/572 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP + +Q LQ+KIL R L M PVLPAFSG+VP ++ ++P + I
Sbjct: 195 MANIDKWKGPLPYHTVVEQRDLQQKILARERSLNMTPVLPAFSGHVPGQIKQLYPESNIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + SD C Y + DPLF +I R ++E+Q YG T HIY D F+E PP
Sbjct: 255 HLGRWAAF-SDQYRC--YFMSPQDPLFAKIQRMYLEEQRAIYG-TDHIYGIDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
P+Y+ + IY + D A WL WLF + W P ++KAL+ V GK+V+
Sbjct: 311 SWDPDYLFQISKGIYQTLAHVDPKAEWLQMSWLFYHKKKKWTPERVKALITGVETGKMVL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------ILDS-IAFG------ 225
LD F + IW + +FYG PYIWC L NF GN + G LDS + G
Sbjct: 371 LDYFCDRNEIWKMTDKFYGQPYIWCYLGNFGGNTTVAGNVKACGAKLDSTLTLGGKNLQG 430
Query: 226 -------------PVE--------ARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
P E +SEN WI+ + G + P+ + AW +LYH
Sbjct: 431 VGLTLEGFDVCQFPYEYILDKVWSGNSSENQ---WIDALADSHVGYASPSFRKAWQLLYH 487
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
V+ + G+ P P + S+ ++ + H+
Sbjct: 488 DVFVQSAGSNG-------ILPCYRPELNSL-----------------------NWHYTHV 517
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN---IIEAYQLN 381
Y ++I A +L + S + DLI RQ L NE + AY
Sbjct: 518 DYDRQKLIEAWKLMQHDAD--SKRTAAQLDLIHYGRQVL---GNEFLTHKQLFDSAYAHC 572
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
D G+ + ++ D+D L A H L W++ A+Q+A + YE NAR+ IT
Sbjct: 573 DLAGMMAQAASMRHIMLDIDTLTAYHPRCTLAGWIDGARQMAPDSVCADYYEDNARSLIT 632
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 501
W L DY K W+GL+ DYY R YF + I ++ + F + + +E
Sbjct: 633 TW-------GGKLNDYACKGWAGLMSDYYLTRWERYFAHAINAVRAHRKFDQQAYDKEIA 685
Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ W + R++ VE++ + + + KY
Sbjct: 686 RFELSWASHRDIPRVETHESLALYCKKIIQKY 717
>gi|281200617|gb|EFA74835.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
Length = 688
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 162/528 (30%), Positives = 248/528 (46%), Gaps = 72/528 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ W G L W+ Q LQ +IL R+ + GM VLP F+G+VP AL+ +P A IT
Sbjct: 213 MGNVNEWAGNLTLGWMADQRDLQIQILTRMRQFGMQAVLPGFAGHVPEALETHYPKANIT 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG W + TY L+ DPLF +I +AF+ Q + YG T H YN D F+E PP
Sbjct: 273 QLGGWGTFSG------TYYLNPDDPLFSKIAQAFVITQNQLYG-TDHFYNFDPFNELEPP 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
Y+ + ++++ + + D +W++QGW DP FW PPQ +A L+ VP+GK++V
Sbjct: 326 SSDLTYLKNCSQSMFNNLIAADPQGIWVLQGWFLVDDPEFWLPPQTEAFLSGVPIGKMIV 385
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
LDL+++V P W+++ +YG +IWCMLHNF G MYG + I+ P+EAR+ +
Sbjct: 386 LDLWSDVIPAWNSTNYYYGHNWIWCMLHNFGGRSGMYGKIPFISTNPIEARSLSPNMVG- 444
Query: 240 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 299
G + AI+ NV+ VY+ ++ PD+ + +Y
Sbjct: 445 --------TGLTPEAIEQ--NVI---VYDLMSEMAWRSTP-----PDLKEWVDQYVTRRY 486
Query: 300 QNYGKP---VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 356
+G P ++ ++ + T S+ DL
Sbjct: 487 GKHGLPFLSINDTSITNTSTFSF----------------------------------DLT 512
Query: 357 DLTRQALAK--YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
++T QAL NEL LN A+ N + S L +++D+ + + + L+G
Sbjct: 513 EITTQALINLFMTNELQLN--SAFLNNSLEEFNKYSEALLSIIQDVYTIASTQEMLLVGH 570
Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
W A+ L E YE NAR QIT+W S + DY K W GL D+Y R
Sbjct: 571 WTARARALTPANESTNLYEMNARNQITLW----GPTYSDVHDYAYKLWGGLTEDFYLARW 626
Query: 475 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDA 522
++ K + SL S F ++ + W YP G++
Sbjct: 627 TLFVKELQYSLTSSQPFNSTLFQTNCEAVEEVWNLQTYPYPTIPTGNS 674
>gi|391338146|ref|XP_003743422.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Metaseiulus
occidentalis]
Length = 665
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 151/502 (30%), Positives = 241/502 (48%), Gaps = 77/502 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGPLP SW QQ +LQK IL R+ + GM PV+P F+G VP A + + P+ +
Sbjct: 188 MGNLRGFGGPLPSSWQLQQQLLQKMILRRMRDFGMTPVVPGFNGFVPRAFERLHPAVSWS 247
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + + + L T+ F+ + +I YG + H+Y+ D F+E TP
Sbjct: 248 RASRWNNFPDE--YAMLTFLAPTESFFLNVSSLYITMYRSIYG-SDHLYSVDLFNEETPD 304
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ P ++ + + +Y + D +W+MQGWLF + +W ++KA L PLGK++V
Sbjct: 305 TNDPAALAEMSSNVYESIAKADPKGIWVMQGWLFVHGGDYWNHDRVKAFLGGPPLGKMIV 364
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR--------- 230
LDLF+E P + ++G P+IWCMLHN+ G ++G L+ I P+ R
Sbjct: 365 LDLFSEQSPQFPRFSNYFGQPFIWCMLHNYGGVSGLFGNLEWINSEPLNVRRSVPNMIGI 424
Query: 231 ------------------------TSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
+SEN ++ W+ Y RYG S P +++AW +L +V
Sbjct: 425 GIAPEGTGQNEVIYEFMAENSYRDSSENVSL-WLQNYVGARYGLSDPHLENAWELLRKSV 483
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y S+T +N+G +L P +WY
Sbjct: 484 Y-------------------------SLTSKSIENHGN-----YILTHRPKLNSTPLIWY 513
Query: 327 STSEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
+ S+VI A I EL + DL+D+ RQAL ++ +L ++ ++ N
Sbjct: 514 NGSDVIGAATELIRGATLHRELCHERLFHQDLVDVVRQALQVRVSDEYLQMMSHFKANSL 573
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ-NEEQEKQYEWNARTQITM 442
+ SRR L + +D +L+ FLLG WL +++ A + + + Q+E+NAR QIT
Sbjct: 574 IDFEEHSRRLLHCIRVLDKVLSTDPNFLLGSWLRDSRESAGLDRDLQDQFEFNARNQITR 633
Query: 443 WFDNTQEEASLLRDYGNKYWSG 464
W N + + DY +K W+G
Sbjct: 634 WGPNGE-----IVDYASKMWNG 650
>gi|393782608|ref|ZP_10370791.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
CL02T12C01]
gi|392672835|gb|EIY66301.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
CL02T12C01]
Length = 761
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 161/525 (30%), Positives = 240/525 (45%), Gaps = 84/525 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+S +D+ L KKI+ R ELGM P+ FSG VP L++ +P+A I
Sbjct: 199 MQNLQSYGGPLPKSVIDRHAALGKKIIARQLELGMQPIQQGFSGYVPRELKDKYPTANIN 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q +W K + LD TD LF +GR F+E+Q + +G +Y D F E+ PP
Sbjct: 259 QQRSWCGFKGAAQ------LDPTDSLFTRMGRVFLEEQARLFG-AHGVYAADPFHESVPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD+PEY+ ++G I+ + D + W MQ W +A++ +VP L++L
Sbjct: 312 VDTPEYLKAVGETIHRLFREFDPQSTWAMQSWSLR----------EAIVKAVPKEALLIL 361
Query: 181 DLFAEVKPIWSTSK-QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
DL STSK +F+G P + LHNF G I M+G L +A N +
Sbjct: 362 DLRGS-----STSKAEFWGYPTVVGNLHNFGGRINMHGDLALLASNQYSKAKRLNPAVCG 416
Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
AW+ QY+ RRYG PA Q AW +L
Sbjct: 417 SGLFMEAIEQNPVYYELAFEMPCHPDSIDLRAWLKQYATRRYGAFSPATQKAWMLLLEGP 476
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y T+K+ ++ A P +D V K YD P L
Sbjct: 477 YRQGTNGTEKS-SIVAARPALD-----------------VKKSGPNAGLEIPYD-PAL-- 515
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+IRA L + ++LSAS YR+DL+D+ RQ + + EA++ D
Sbjct: 516 ----IIRAQSLLLEDADKLSASRPYRFDLVDVQRQMMTNLGQLIHRKAAEAFRSKDREAF 571
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
S RFL ++ DMD LL + WL A+ + EE++ Q E +A + +T+W +
Sbjct: 572 TLHSGRFLGMLADMDTLLRTRSEYSFDRWLTEARSWGETEEEKNQMERDATSLVTIWGAD 631
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
+ DY + W+GL+ YY PR ++ + + L+ G +
Sbjct: 632 GDPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQQHLDEGTSY 673
>gi|295085509|emb|CBK67032.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 716
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 171/574 (29%), Positives = 260/574 (45%), Gaps = 87/574 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 184 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 243
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 244 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 299
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 300 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 359
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD E +W ++ F+ PYIWC L NF GN + G
Sbjct: 360 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 419
Query: 219 -------LDSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD + F P E + + WI + R G +++DAW L++ +
Sbjct: 420 IGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 478
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y V P T G Y +P + K ++ Y + L
Sbjct: 479 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 513
Query: 327 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 514 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 564
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 565 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 623
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
L DY ++ W+GL+ DYY R +Y I+++ K E ++
Sbjct: 624 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAVGEDVEVDQKQLEDELKEIE 677
Query: 505 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 533
W N + V S D L++ S +L++KY
Sbjct: 678 EGWVNATDRKDVRKDVHSTTDGLLSFSTFLFSKY 711
>gi|153808241|ref|ZP_01960909.1| hypothetical protein BACCAC_02529 [Bacteroides caccae ATCC 43185]
gi|423219048|ref|ZP_17205544.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
CL03T12C61]
gi|149129144|gb|EDM20360.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
gi|392625814|gb|EIY19870.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
CL03T12C61]
Length = 752
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 169/534 (31%), Positives = 246/534 (46%), Gaps = 87/534 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ ++L KKI+ R ELGM P+ FSG VP L++ +P AKI
Sbjct: 193 MQNLQSYGGPLPKSWIDKHIILAKKIIDRERELGMTPIQQGFSGYVPRELKDKYPEAKI- 251
Query: 61 QLGNWFSVKSDPRWC---CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF +GR F+E++ K YG T IY D F E+
Sbjct: 252 --------RLQPGWCGFKGAGQLDPTDALFATLGRDFLEEEKKLYG-TYGIYAADPFHES 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPV++PEY+S++G AIY ++ D A W MQ W R P +KA VP L
Sbjct: 303 APPVNTPEYLSAVGHAIYKLIKDFDPKAKWAMQAWSL------REPIVKA----VPQNDL 352
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------------------- 217
++LDL E K F+G P + LHNF G I M+G
Sbjct: 353 IILDLNGEK---IKGRKGFWGYPAVEGNLHNFGGRINMHGDLRLLASNQYMTALKQYPNV 409
Query: 218 -----ILDSIAFGPV--------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
+++I PV E W+ QY+ RRYG P+ Q A L
Sbjct: 410 CGSGLFMEAIEQNPVYYDLAFEMPLHKGEVAIEEWLKQYANRRYGAVSPSAQQAMICLLE 469
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
Y T+ R I+A P++ G G P
Sbjct: 470 GPYRPGTNGTE--RSSIIA---ARPALNVKKSGPNAGLGIP------------------- 505
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
YS VI+A L + ++L S YR+D+ID+ RQ + + EA+ D
Sbjct: 506 -YSPLLVIQAEGLLLKDADKLKNSEPYRFDVIDVQRQMMTNMGQVIHKRAAEAFLNRDKE 564
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
S+RFL+++ED+D LL F WL SA+ EE++ E++A + +T+W
Sbjct: 565 AFALHSKRFLQMLEDVDELLRTRPEFNFDRWLTSARSWGDTEEEKNLLEYDATSLVTIW- 623
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 498
+ S+ DY + W+GL++ YY PR ++ + E L++G + + R+
Sbjct: 624 -GADGDPSIF-DYSWREWTGLIKGYYLPRWTKFYAMLQEHLDNGTTYSEEGLRQ 675
>gi|452988463|gb|EME88218.1| glycoside hydrolase family 89 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 772
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 155/545 (28%), Positives = 263/545 (48%), Gaps = 85/545 (15%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LPQSW+D Q L KKI+ R+ ELGM PVLP F+G VP + ++P+A
Sbjct: 202 NIQGSWGGDLPQSWIDHQFELNKKIVARMVELGMTPVLPCFTGFVPTQISRLYPNASFVN 261
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W +++ + L+ DPLF + ++FI +Q++ YG S IY D ++EN P
Sbjct: 262 GSRWNGFQAE--YTNVTFLEPFDPLFTTLQKSFISKQIEAYGNVSSIYTLDQYNENDPFS 319
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
Y+ ++ + +++ D +A+W +QGWLF S FW +++A L V +++L
Sbjct: 320 GELAYLKNVTSNTIKSLKAADPEAIWFIQGWLFYSSADFWTDERVEAYLGGVANEDMLIL 379
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DLF+E +P W + ++G P+IWC LH++ GN ++G ++++ PV+A ++ +TM
Sbjct: 380 DLFSESQPQWQRTNSYFGKPWIWCQLHDYGGNQGLHGQVENVTINPVQALANKTSTMVGM 439
Query: 238 --------------------AW----------INQYSVRRY-GRSVPA-IQDAWNVLYHT 265
AW + + RY G +P+ + AW+V+ T
Sbjct: 440 GSTMEGQEGNEIIYDILLDQAWSKEPIDSDSYFHDWVTSRYAGSKLPSGLYTAWDVMRQT 499
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VYN TD + V + +++P+ + + + + VL S +
Sbjct: 500 VYNSTD--IEAAEAVTKSIFELEPNTTGLLNRRGHHSTLILYDPNVLVSAWN-------- 549
Query: 326 YSTSEVIRALELFIASGNELSA--SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
+L+ AS +++ Y++DL+D TRQ LA L+ + + +
Sbjct: 550 ----------DLYNASNDDIQLWDVKAYQFDLVDTTRQVLANAFYPLYTDFVHSAN-KSV 598
Query: 384 HGVF------QLSRRFLELVEDMDGLLACHDG--FLLGPWLESAKQLAQNEEQEKQ---- 431
G + + + + L++D+D +L F L W+ESA+ A E+
Sbjct: 599 QGTYSPTKAEEKGKEMIMLLKDLDSVLEASGNAHFKLSSWIESARLWAPAEDYADDKNTT 658
Query: 432 ------YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL 485
YE+ AR QIT+W N + + DY +K W+GL+R YY PR + + + S
Sbjct: 659 AKIADFYEYTARNQITLWGPNGE-----ISDYASKQWAGLIRSYYVPRWQRFVDFTLNST 713
Query: 486 ESGDG 490
S +G
Sbjct: 714 TSMNG 718
>gi|336408181|ref|ZP_08588675.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
gi|335939481|gb|EGN01355.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
Length = 718
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 170/577 (29%), Positives = 260/577 (45%), Gaps = 103/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ P+LP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPMLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ +
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIEDLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSI----- 222
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 223 -------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGRSV 252
G N M W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679
Query: 491 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|392566857|gb|EIW60032.1| alpha-N-acetylglucosaminidase [Trametes versicolor FP-101664 SS1]
Length = 747
Score = 239 bits (610), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 171/554 (30%), Positives = 273/554 (49%), Gaps = 73/554 (13%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP +W+D Q LQK++L R+ ELGM PV+P+F+G VP AL + P+A I
Sbjct: 193 FGNIQGSWGGELPTAWVDDQFALQKRLLPRMVELGMTPVMPSFTGFVPRALAALHPNASI 252
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGR-TSHIYNCDTFDENT 118
W + L+ DPLF + ++FI +Q YG SH+Y D ++EN
Sbjct: 253 VTGSQWSGFPTS--LTNDSFLEPFDPLFATLQQSFIAKQQAAYGADISHVYTLDQYNEND 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-K 176
P +Y+ ++ A ++ +++ D AVWLMQGWLF D FW ++ A L VP
Sbjct: 311 PFSGDLDYLRNVSAGTFASLRAADPAAVWLMQGWLFFSDAVFWTDDRVAAYLGGVPGNDS 370
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
++VLDL++E +P W+ + + G ++WC LH++ GNI M G LD + P+ A +S ++
Sbjct: 371 MIVLDLYSEAQPQWNRTASYSGKQWVWCELHDYGGNIGMEGNLDVLTHAPLTALSSPGSS 430
Query: 237 M-----------------------AW----------INQYSVRRYG-RSVP-AIQDAWNV 261
M AW ++ + RRY + +P A QDAW +
Sbjct: 431 MKGVGLTMEGQEGNEIVYGVLLDQAWSATSLNTSSYVSSWVSRRYPVKPLPKAAQDAWRI 490
Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
L TVYN D T + I ++ P++ +T + P S YD
Sbjct: 491 LSTTVYNNQDPNT---QATIKGIYELAPALTGMTNRIGHH---PTSIP---------YD- 534
Query: 322 PHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
+ + ++ AL+L + A LSA + YD++D+ RQ L+ L+ +I+ Y
Sbjct: 535 -----TDATMLSALKLLLEARAQHPTLSAVPEFVYDVVDVARQLLSNRFIGLYDTLIQTY 589
Query: 379 QLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWN 435
A V + L L+ D+D LL+ ++ FLL W+ A++ A Y E+N
Sbjct: 590 NSTSSTAQSVSAAGQPLLALLTDLDALLSTNEHFLLSSWIADARKWADGSASYGAYLEYN 649
Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
AR Q+T+W + + + DY +K W+GL+ YY PR A + Y+ E+ +G +
Sbjct: 650 ARNQVTLWGPDGE-----INDYASKAWAGLVGTYYKPRWAAFVDYLAETKGTGQAYNATA 704
Query: 496 WRREWIKLTNDWQN 509
+ + + +W N
Sbjct: 705 VKSTMLAIGQEWGN 718
>gi|379334158|gb|AFD03088.1| putative alpha-N-acetylglucosaminidase [uncultured bacterium 8]
Length = 726
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 171/561 (30%), Positives = 261/561 (46%), Gaps = 73/561 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+++L GW GPLPQSW+D+ L ++IL R LGM PVL FSG+VP L A+ T
Sbjct: 181 LASLDGWSGPLPQSWIDRHADLGRRILARERALGMRPVLQGFSGHVPQELIAER-GARST 239
Query: 61 QLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
L W F V +LD DPLF E G + +Q + +G T H+Y D F E TP
Sbjct: 240 TLPWWDFEVG---------MLDPRDPLFEEFGTTLLTEQTRLFG-TDHLYAADPFIETTP 289
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLV 178
PV P ++ + A++ M + D A W++Q W FSY +W P + A L+++P ++
Sbjct: 290 PVSDPADLAQVARAVHGVMTAVDDRATWVLQAWPFSYRSRYWTPERTGAFLDAIPDDGML 349
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART------- 231
+LDL+AE +P+W + + P++WCMLH+ G +YG LD IA G A+
Sbjct: 350 ILDLWAEHRPVWQRTDGYRKKPWVWCMLHSLGGRPGLYGKLDEIATGAARAQADARGGSL 409
Query: 232 ---------------------------SENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
S + AW+ ++ RYGR+ P + AW++L+
Sbjct: 410 SGIGASMEAFGGDPVLYELLADVAWQGSVDDVRAWLETWTRARYGRATPGLLRAWDLLHD 469
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
+VY ++G VIV P + EG ++ + V ++ S D P
Sbjct: 470 SVY-ASEGPGPPG-SVIVGRPTL--------EGDLRH------ELPVHLADPPSPDVP-- 511
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
+ L + E SA R DL D+T Q L A E +A DA
Sbjct: 512 ---PALAEAWALLADEATQEDSAGPLGR-DLCDVTAQVLTHVACERQWRAADAALARDAD 567
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
G + +R L+ +ED+D LLA L WL A+ A + YE +AR +T+W
Sbjct: 568 GFQRAARALLDTIEDLDTLLATRPEHRLDGWLADARGWATTPAEADLYETDARRLLTLWG 627
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
S L DY ++W+GL+ +Y PR +++++ +LE+G +R +++ +
Sbjct: 628 HTR----SKLHDYSGRHWAGLVGTFYLPRWRSWYEHIARALETGSPYRAEEFEASLLAQE 683
Query: 505 NDWQNGRNVYPVESNGDALIT 525
W RN G A T
Sbjct: 684 ERWVADRNGPTTPEAGTAGAT 704
>gi|238506383|ref|XP_002384393.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
NRRL3357]
gi|220689106|gb|EED45457.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
NRRL3357]
Length = 669
Score = 239 bits (609), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 157/534 (29%), Positives = 257/534 (48%), Gaps = 84/534 (15%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 109 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMRPVLPAFPGFVPPAIKRVRPHAT 168
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L D F ++ ++ I +Q++ +G +H+Y D F+E
Sbjct: 169 VVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISRQMRAFGNITHVYALDQFNEIN 226
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW ++ A L+ V
Sbjct: 227 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDSNRISAYLSGVERND 285
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA ++
Sbjct: 286 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEALNKSDS 345
Query: 236 --------------------------------TMAWINQYSVRRYGR--SVP-AIQDAWN 260
T A+ + RY SVP + AW+
Sbjct: 346 LVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQSWVRSRYSGNLSVPNELYTAWD 405
Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS--------KEAVL 312
+L TVYN T+ T V + ++ P I + G+ +Y P S E +
Sbjct: 406 LLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GRVGHYPTPTSINYDPMVLNEVLS 461
Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
++ P LW++ + Y YD++D+TRQ + ++
Sbjct: 462 LFMNATRKEPSLWHNPA---------------------YEYDMVDITRQLMGNAFVNVYS 500
Query: 373 NIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
+I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E +
Sbjct: 501 VLITSWKSETENRTTKVTSHSERLLNLLSAIDKVLSCNENFSLATWISSARDWGNTTETK 560
Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
+E+NAR QIT+W + + DY +K W+GL+ YY PR +I+ Y+ E
Sbjct: 561 DFFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGE 609
>gi|423241433|ref|ZP_17222546.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
CL03T12C01]
gi|392641326|gb|EIY35103.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
CL03T12C01]
Length = 754
Score = 238 bits (607), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 176/597 (29%), Positives = 268/597 (44%), Gaps = 112/597 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F +I F E+ K YG TS Y+
Sbjct: 252 EKKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTQIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E ++ G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHEAKSLPAGLDF-GKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414
Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
+D + P+ A R + T WI QY
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474
Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
RYG +I+ AW +L + +YNC G + SI G+
Sbjct: 475 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 516
Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572
Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632
Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 633 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692
Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
L+ + D + +W L W +N+Y + GD + ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNIYAASAEGDCIEVAK 746
>gi|265753065|ref|ZP_06088634.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263236251|gb|EEZ21746.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 750
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 177/597 (29%), Positives = 266/597 (44%), Gaps = 112/597 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 188 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 247
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 248 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 306
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 307 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 358
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 359 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 410
Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
+D + P+ A R + T WI QY
Sbjct: 411 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 470
Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
RYG +IQ AW +L + +YNC G + SI G+
Sbjct: 471 RARYGTDDESIQQAWQILTNGIYNCPAGNNQQG---------PHESIFC---------GR 512
Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 513 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 568
Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 569 DRARIVYNYAVADFKSFDKKNYNTHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 628
Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 629 TPEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 688
Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
L+ + D + +W L W +N Y + GD + ++
Sbjct: 689 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 742
>gi|212695333|ref|ZP_03303461.1| hypothetical protein BACDOR_04880 [Bacteroides dorei DSM 17855]
gi|212662112|gb|EEB22686.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
Length = 754
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 177/597 (29%), Positives = 265/597 (44%), Gaps = 112/597 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 252 EEKTAGDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414
Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
+D + P+ A R + T WI QY
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474
Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
RYG +IQ AW +L + +YNC G + SI G+
Sbjct: 475 RARYGTDDESIQQAWQILTNGIYNCPAGNNQQG---------PHESIFC---------GR 516
Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572
Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYNTHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632
Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 633 TPEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692
Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
L+ + D + +W L W +N Y + GD + ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 746
>gi|345513909|ref|ZP_08793424.1| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
gi|345456132|gb|EEO45798.2| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
Length = 754
Score = 236 bits (602), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 176/597 (29%), Positives = 266/597 (44%), Gaps = 112/597 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 252 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414
Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
+D + P+ A R + T WI QY
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474
Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
RYG +I+ AW +L + +YNC G + SI G+
Sbjct: 475 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 516
Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572
Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632
Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 633 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692
Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
L+ + D + +W L W +N Y + GD + ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 746
>gi|237711645|ref|ZP_04542126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
gi|229454340|gb|EEO60061.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
Length = 732
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 176/597 (29%), Positives = 266/597 (44%), Gaps = 112/597 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 170 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 229
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 230 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 288
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 289 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 340
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 341 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 392
Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
+D + P+ A R + T WI QY
Sbjct: 393 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 452
Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
RYG +I+ AW +L + +YNC G + SI G+
Sbjct: 453 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 494
Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 495 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 550
Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 551 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 610
Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 611 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 670
Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
L+ + D + +W L W +N Y + GD + ++
Sbjct: 671 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 724
>gi|404487206|ref|ZP_11022393.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
YIT 11860]
gi|404335702|gb|EJZ62171.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
YIT 11860]
Length = 731
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 171/578 (29%), Positives = 259/578 (44%), Gaps = 96/578 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ Q ++D+Q LQ+K+L R+ EL M PV F G VP +L+ FP A I
Sbjct: 196 MGNLEGFGGPVSQKFIDRQTDLQQKMLRRMRELDMAPVFQGFYGMVPNSLKEKFPEANIK 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W + + LD DPLF +I + E+Q K +G+ + + D F E +
Sbjct: 256 EQGEWQTYQRPA------FLDPNDPLFDKIADIYYEEQEKLFGKAVY-FAGDPFHEGGQS 308
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D + + I M+ +AVW++QGW R P M+ LL + G+ +
Sbjct: 309 EGID----VKAAAKKILKAMRRKTPEAVWIIQGWQ-------RNP-MRDLLEGLEHGEAI 356
Query: 179 VLDLFAEVKPIWSTSKQ--FYGVP------YIWCMLHNFAGNIEMYGILDSIAFGPVEA- 229
+LDL A +P W K FY +IWC L NF G ++G + S A G V A
Sbjct: 357 ILDLMACERPQWGGIKNSLFYKAEGHMHHDWIWCALPNFGGKTGLHGKMSSYASGVVFAK 416
Query: 230 ---------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ 256
R W+NQY+ RYG++ P
Sbjct: 417 NHPLGKNLCGIGTAPEGIGTIPVVYDMVYDMAWREDSIDIKDWVNQYTQYRYGKADPNCN 476
Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
AW +L T+Y C + I A P +
Sbjct: 477 RAWEILSKTIYECHNEIGGPVESYICARP------------------------SDTIKHA 512
Query: 317 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
SS+ ++Y +E++ A E +E + S TY+YDL+DLTRQ L YA L +
Sbjct: 513 SSWGTAEIFYDPAEIVTAWECMYNVRHEFAQSETYQYDLVDLTRQVLGDYAKYLHKQAVN 572
Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
A+ ND G S +FL L+ D D LL+ F +G W+ A+ A ++++++ NA
Sbjct: 573 AFYRNDLKGFQTYSSKFLVLIRDEDKLLSTRKEFNVGTWINQARNAACTPQEQERFVANA 632
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
+ QIT W ++ S L DY K WSGL+RD Y PR + Y + +L G+ + D
Sbjct: 633 KRQITTWTNHD----SKLHDYALKEWSGLMRDMYLPRWKAWVDYKL-ALLRGETAQEPD- 686
Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
+ ++ +W + Y S G+A+ + +Y KY
Sbjct: 687 ---YFQIEKNWVDSDTRYDSTSTGNAISAVEEIYKKYF 721
>gi|423230938|ref|ZP_17217342.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
CL02T00C15]
gi|423244649|ref|ZP_17225724.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
CL02T12C06]
gi|392630058|gb|EIY24060.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
CL02T00C15]
gi|392641498|gb|EIY35274.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
CL02T12C06]
Length = 754
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 176/597 (29%), Positives = 266/597 (44%), Gaps = 112/597 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 252 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414
Query: 219 LDSIAFG-------PVEA---------------------------RTSENTTMAWINQYS 244
+D + P+ A R + T WI QY
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474
Query: 245 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 304
RYG +I+ AW +L + +YNC G + SI G+
Sbjct: 475 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 516
Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572
Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632
Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 633 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692
Query: 485 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 527
L+ + D + +W L W +N Y + GD + ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 746
>gi|261199246|ref|XP_002626024.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
gi|239594232|gb|EEQ76813.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
Length = 752
Score = 235 bits (599), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 151/529 (28%), Positives = 257/529 (48%), Gaps = 73/529 (13%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
NL G WGG P W D Q LQKKIL R+ ELGM P+LPAF G VP A+ V P A+
Sbjct: 201 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 260
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + +P++ T L DP + + ++FI + ++ YG +H Y D F+E
Sbjct: 261 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 318
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNS-VPLGK 176
P PE++ + ++S D +A W+MQGWLF + +W +++A L++
Sbjct: 319 PSSGDPEFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 378
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDLFAE P+W +K F+G ++WC + F GN +YG + +I GP +A
Sbjct: 379 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAQAMAQHPNM 438
Query: 237 MA--------------------------------WINQYSVRRY---GRSVP-AIQDAWN 260
+ + + + RRY GR+VP + +AW
Sbjct: 439 VGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHGRTVPNELYEAWQ 498
Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
+L + YN T+ +V P + ++ + + +K +L E
Sbjct: 499 LLRLSAYNNTN---------LVDAPLLPHALFAASPSIN-------AKMPMLFIEG---- 538
Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
L Y +++++A L I L ++Y+YD++D+TRQ L+ + ++ Y+
Sbjct: 539 ---LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKVKYKG 593
Query: 381 NDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ-YEWNART 438
VF + + L +++ +D +L+ ++ F L W+ +A+ A ++ + +E NAR
Sbjct: 594 GAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDDSEAADFFEHNARN 653
Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
QIT+W E +L DY K W+GL+ YY PR ++ +Y+ ++ S
Sbjct: 654 QITIW----GSEVGVLDDYAQKQWAGLVSGYYTPRWRMFLEYLKDTPAS 698
>gi|392584963|gb|EIW74305.1| glycoside hydrolase family 89 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 772
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 156/534 (29%), Positives = 255/534 (47%), Gaps = 93/534 (17%)
Query: 8 GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
GG LPQ W+D QL LQK+I+ RI ELGM PVLPAF G VP A+ +FP+A I +
Sbjct: 221 GGKLPQEWMDAQLALQKQIVPRIVELGMTPVLPAFPGFVPPAMHTLFPNASIVNGSEYPG 280
Query: 68 VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
+ + ++ L DPL+ ++ +F+ +Q + G +H++ D ++EN+P Y+
Sbjct: 281 IPA--QYSNDSFLAPFDPLYAQLQSSFLAKQTEALGNVTHVWTIDQYNENSPYSGDLTYL 338
Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 186
+++ + ++ +++ D DA+WLMQGWLF D PFW ++ A L+ +P +++LDLF++V
Sbjct: 339 ANIANSTFASLRAHDPDAIWLMQGWLFFADEPFWTSDRVDAYLDQIPNDGMIILDLFSDV 398
Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--------- 237
P W + G ++WC +H+F GN+ + G + GPV+A S N++M
Sbjct: 399 YPQWQRLDSYRGKSWVWCEVHDFGGNMGLEGNFSVVTNGPVDALNSPNSSMKGVGLAMEG 458
Query: 238 ------------------------AWINQYSVRRY-------------GRSVPAIQ-DAW 259
A+ ++ RR+ S+PA +AW
Sbjct: 459 LEGNEIIYDVLLDQAWSAAPLDRDAYAKAWATRRFHLPTANSSTTTATNTSIPASAIEAW 518
Query: 260 NVLYHTVYNCTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
L TVY+ T+ GAT S+I + Y P S +
Sbjct: 519 QTLASTVYSSTNPNVWGATK--------------SLIELAPSLGGMYSAPSSTIIFYDTN 564
Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFL 372
TS ++ AL +A+G L A + +R D ID+ RQ LA + +
Sbjct: 565 TS-------------LVPALRGLVAAGTSAPALWALDEFRTDSIDVARQLLANRFADAYT 611
Query: 373 NIIEAYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA---QNEE 427
AY + + + + R +++++D+D LL H+ +LL + SA+ A +E
Sbjct: 612 ATTGAYNASGPGSAALNATAARMMQIIDDLDRLLMTHEPYLLSSRIASARAWAGDGGDEA 671
Query: 428 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
E+ AR+Q+T+W S+L DY +K W GL+ YY R + +YM
Sbjct: 672 YADYLEYEARSQVTLW----GPVPSVLNDYASKVWGGLVGTYYRQRWTAFVEYM 721
>gi|336371253|gb|EGN99592.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.3]
gi|336384013|gb|EGO25161.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 761
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 159/538 (29%), Positives = 260/538 (48%), Gaps = 80/538 (14%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP+ W++ Q VLQK+IL R+ ELGM PVLP+F+G VP A+ ++P+A I
Sbjct: 208 NIQGSWGGDLPEQWINDQFVLQKQILARMVELGMTPVLPSFTGFVPRAMHTLYPNASIVN 267
Query: 62 LGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W F+++ L+ DPLF + +F+ + YG SHIY D ++E P
Sbjct: 268 GSQWSTFTIQH----TNDSFLEPFDPLFSTLQTSFMTKYAAAYGNVSHIYTLDQYNEMMP 323
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KL 177
+ Y+SS+ +A ++ +++ D +AVW+MQGWLF Y FW +++A L VP +
Sbjct: 324 YSGNTSYLSSISSATFASLRATDPEAVWMMQGWLFYIYASFWTDERVEAYLGGVPGNDSM 383
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDLF+E P W ++G +IWC LH+F GN+ G +++ PV+A + TM
Sbjct: 384 IILDLFSEAYPQWQRLNSYFGKQWIWCELHDFGGNMGFEGNFENVTTQPVKALATPGNTM 443
Query: 238 -----------------------AW----------INQYSVRRYGRSVP----AIQDAWN 260
AW ++ ++ RRY +VP A +AW
Sbjct: 444 VGMGLTMEGQEGNEIMYDVLFDQAWSPTPINRTSYVSAWTSRRY--NVPNLPTAATEAWE 501
Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
+L TVYN D I ++++T + G P
Sbjct: 502 ILASTVYNNQDPLLQATIKSIFELEPAINGLVNLTVLQ----GIPTG------------- 544
Query: 321 HPHLWYST-SEVIRALELFIASGNELSASN---TYRYDLIDLTRQALAKYANELFLNIIE 376
L+Y T + ++ AL+ + + E SA + ++YD++ + RQ LA +L+ ++++
Sbjct: 545 ---LFYDTNTTIVPALQSLLQARQESSALDEVPEFQYDVVYIIRQLLANRFIDLYTSLVD 601
Query: 377 AYQLNDAHGVFQL--SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-E 433
Y + + L++D+D +L FLL W+ +A+ A + Y E
Sbjct: 602 TYNSTTSSSSDVSTAGAPLITLLKDVDSVLLTDTHFLLSNWISAARNWAHDNSTYAAYLE 661
Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
+NAR QIT+W + + DY +K W GL+ YY R + Y+ S +G +
Sbjct: 662 YNARNQITLWGPRGE-----VHDYASKQWGGLVGTYYVQRWEEFVSYLSGSKANGTAY 714
>gi|400595379|gb|EJP63180.1| alpha-N-acetylglucosaminidase [Beauveria bassiana ARSEF 2860]
Length = 761
Score = 234 bits (597), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 159/525 (30%), Positives = 250/525 (47%), Gaps = 56/525 (10%)
Query: 1 MSNLHG-WGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSA 57
N+HG WGG L W++QQ LQKKI+ R+ ELG+ PVLP F G VPAAL+ + P
Sbjct: 205 FGNIHGTWGGEGRLSAEWINQQFALQKKIVARMVELGITPVLPGFPGFVPAALKKLRPDV 264
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
I + W V + T L+ TD + E+ FI+ Q+KE+G +++Y D F+E
Sbjct: 265 NIAEAPVWVDVPRNN--TATAFLNPTDKTYAELQSLFIKNQIKEFGNVTNVYTVDQFNEI 322
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVP-LG 175
P +YI+ + ++ Y G+ + + A+WLMQGWLF S FW ++ A L P
Sbjct: 323 NPSSGDTKYITDVSSSTYKGITAANPAAIWLMQGWLFYSSQSFWTQQRVDAYLAGPPGQD 382
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDLF+E +P W ++ ++G P+IWC LH+F GN ++G + ++ ++N+
Sbjct: 383 DMIILDLFSESQPQWQRTRSYFGRPWIWCELHDFGGNQALHGKITNV---------TQNS 433
Query: 236 TMAWINQYSVRRYGRSVPA-----------IQDAW--------NVLYHTVYNCTDGATDK 276
A S+ YG + + AW N N A
Sbjct: 434 VQALKESGSIVGYGLTPEGYEGNEVVYDILLDQAWEGSPIDTANYFRAWARNRYSAAGII 493
Query: 277 NRDVIVAFPDV--------DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
DV A+ + D +I SV YQ + P K V + T Y P
Sbjct: 494 PEDVFTAWEQLRQHAYDVQDNAIPSVGVSVYQLF--PSLKGLV--NRTGHYPPPTALQYD 549
Query: 329 SEVIRAL-ELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
+V++ + LF S + L + D +D+TRQ L +++ +++ +Q
Sbjct: 550 PKVMKNIWHLFYNSTIDSPGLLQIPAFHLDFVDVTRQVLGNAFIDIYTDLVNQFQATANA 609
Query: 385 GVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
V Q L L +ED+D L ++ F WL SA+ Q+ +NAR+Q+T+W
Sbjct: 610 TVIQDLGNSMLSFIEDLDMALNTNEHFTFKKWLNSAESWGQSIGAPDAVAFNARSQVTVW 669
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
E+ L DY K WSG+++ YYG R I+ ++ + E G
Sbjct: 670 ----STESRALDDYAAKAWSGIVKSYYGERWRIFINSLVSAREQG 710
>gi|239615395|gb|EEQ92382.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ER-3]
Length = 829
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 151/530 (28%), Positives = 254/530 (47%), Gaps = 79/530 (14%)
Query: 3 NLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
NL G WGG P W D Q LQKKIL R+ ELGM P+LPAF G VP A+ V P A++
Sbjct: 223 NLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQVV 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W + +P++ T L DP + + ++FI + ++ YG +H Y D F+E P
Sbjct: 283 NASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMIPS 340
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNS-VPLGKLV 178
P+++ + ++S D +A W+MQGWLF + +W +++A L++ ++
Sbjct: 341 SGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRDML 400
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA 238
+LDLFAE P+W +K F+G ++WC + F GN +YG + +I GP EA +
Sbjct: 401 ILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAEAMAQHPNMVG 460
Query: 239 --------------------------------WINQYSVRRYG---RSVPA-IQDAWNVL 262
+ + + RRY R+VP+ + +AW +L
Sbjct: 461 VGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHERTVPSELYEAWQLL 520
Query: 263 YHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSETSSY 319
+ YN T+ A + A P ++ + + EG
Sbjct: 521 RLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG---------------------- 558
Query: 320 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
L Y +++++A L I L ++Y+YD++D+TRQ L+ + ++ Y+
Sbjct: 559 ----LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKVKYK 612
Query: 380 LNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ-YEWNAR 437
VF + + L +++ +D +L+ ++ F L W+ +A+ A +E + +E NAR
Sbjct: 613 GGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDESEAADFFEHNAR 672
Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
QIT+W E +L DY K W+GL+ YY PR ++ +Y+ ++ S
Sbjct: 673 NQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPRWRMFLEYLKDTPAS 718
>gi|393788286|ref|ZP_10376416.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
CL02T12C05]
gi|392655959|gb|EIY49600.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
CL02T12C05]
Length = 757
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 170/586 (29%), Positives = 264/586 (45%), Gaps = 103/586 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP++ +D+ L KKI+ R ELGM P+ FSG VP L+ +P+A I
Sbjct: 196 MQNLQSYGGPLPKTVIDKHAALGKKIISRQLELGMQPIQQGFSGYVPRELKEKYPTANIN 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q +W K + LD TD LF +GRAF+E+Q + +G +Y D F E+ PP
Sbjct: 256 QQRSWCGFKGAAQ------LDPTDSLFTRMGRAFLEEQARLFG-AHGVYAADPFHESAPP 308
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+D+PEY+ ++G I+ + D + W MQ W D ++ +VP L++L
Sbjct: 309 IDTPEYLKAVGERIHHLFRDFDPHSTWAMQSWSLRED----------IVKAVPKDALLIL 358
Query: 181 DLFAEVKPIWSTSKQ-FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
DL + STSK F+G + LHNF G I M+G L +A N +
Sbjct: 359 DLNGK-----STSKALFWGYSTVVGNLHNFGGRINMHGDLKLLASNQYSKAKRLNPAVCG 413
Query: 238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
AW+ QY+ RRYG PA Q+AW +L +
Sbjct: 414 SGLFMEAVEQNPVYYELAFEMPCHADSINLQAWLKQYATRRYGAFSPAAQEAWLLLLNGP 473
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y T+K+ ++ A P +D K A L+ + Y
Sbjct: 474 YRRGTNGTEKS-SIVAARPALDV--------------KKSGPNAALE----------IPY 508
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ VIRA L + ++LS S YR+D++D+ RQ + + EA++ D
Sbjct: 509 DPTLVIRAQSLLLKDIDKLSVSRPYRFDIVDVQRQLMTNLGQLIHRQAAEAFRKKDQCAF 568
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
S RFLE++ DMD LL + WL A+ +E++ E +A + +T+W +
Sbjct: 569 TLHSGRFLEMLADMDKLLRTRSEYSFDRWLTEARSWGDTDEEKNLMERDATSLVTIWGAD 628
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD----WRREWIK 502
+ DY + WSGL+ YY PR ++ + + L+ G + + RE +
Sbjct: 629 GDPR---IFDYSWREWSGLISGYYLPRWQKFYAMLQQHLDVGTSYEEAGLPLIYGREAFR 685
Query: 503 LTNDWQN-------------GRNVYPVESNGDALITSQWLYNKYLQ 535
ND+ N G+ P+ + GD +I + L++KYL+
Sbjct: 686 -ANDFYNGLAEWELAYVDTYGKARTPI-TEGDEIIMVKQLFDKYLK 729
>gi|345517325|ref|ZP_08796802.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
gi|345457718|gb|EET14396.2| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
Length = 754
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 175/603 (29%), Positives = 262/603 (43%), Gaps = 124/603 (20%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
M+NL GWGGP P SW QQ LQKKIL R+ E GM+PVLP +SG +P+
Sbjct: 192 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 251
Query: 49 ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
L N + + L W +L DP F +I F E+ K YG T
Sbjct: 252 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 304
Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
S Y+ D F E + + G AI M+ + AVW++QGW + +P RP M
Sbjct: 305 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 359
Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
KAL G L++LDLF+E +P+W G+P IW C+L NF GN
Sbjct: 360 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 408
Query: 213 IEMYGILDSIAFG-------PVEA---------------------------RTSENTTMA 238
+ ++G +D + P+ A R + T
Sbjct: 409 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 468
Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 298
WI QY RYG +I AW +L + +YNC G + SI
Sbjct: 469 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQG---------PHESIFC----- 514
Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 358
G+P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+
Sbjct: 515 ----GRP----SLNNFQASSWSKMCNYYDPTTTAEAARLMVSVAHKYRGNNNFEYDLVDI 566
Query: 359 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 418
TRQA+A A ++ + ++ D +R+FLEL+ D LL F +G W++
Sbjct: 567 TRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWIQQ 626
Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
A+ L E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y+
Sbjct: 627 ARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYW 686
Query: 479 KYMIESLE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 524
+ + + L+ + D + +W L W +N Y + GD +
Sbjct: 687 QVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIE 743
Query: 525 TSQ 527
++
Sbjct: 744 VAK 746
>gi|294777713|ref|ZP_06743164.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
gi|294448781|gb|EFG17330.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
Length = 752
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 175/603 (29%), Positives = 262/603 (43%), Gaps = 124/603 (20%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
M+NL GWGGP P SW QQ LQKKIL R+ E GM+PVLP +SG +P+
Sbjct: 190 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 249
Query: 49 ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
L N + + L W +L DP F +I F E+ K YG T
Sbjct: 250 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 302
Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
S Y+ D F E + + G AI M+ + AVW++QGW + +P RP M
Sbjct: 303 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 357
Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
KAL G L++LDLF+E +P+W G+P IW C+L NF GN
Sbjct: 358 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 406
Query: 213 IEMYGILDSIAFG-------PVEA---------------------------RTSENTTMA 238
+ ++G +D + P+ A R + T
Sbjct: 407 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 466
Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 298
WI QY RYG +I AW +L + +YNC G + SI
Sbjct: 467 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQG---------PHESIFC----- 512
Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 358
G+P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+
Sbjct: 513 ----GRP----SLNNFQASSWSKMCNYYDPTTTAEAARLMVSVAHKYRGNNNFEYDLVDI 564
Query: 359 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 418
TRQA+A A ++ + ++ D +R+FLEL+ D LL F +G W++
Sbjct: 565 TRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWIQQ 624
Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
A+ L E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y+
Sbjct: 625 ARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYW 684
Query: 479 KYMIESLE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 524
+ + + L+ + D + +W L W +N Y + GD +
Sbjct: 685 QVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIE 741
Query: 525 TSQ 527
++
Sbjct: 742 VAK 744
>gi|319640296|ref|ZP_07995021.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
gi|317388071|gb|EFV68925.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
Length = 752
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 175/603 (29%), Positives = 262/603 (43%), Gaps = 124/603 (20%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
M+NL GWGGP P SW QQ LQKKIL R+ E GM+PVLP +SG +P+
Sbjct: 190 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 249
Query: 49 ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
L N + + L W +L DP F +I F E+ K YG T
Sbjct: 250 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 302
Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
S Y+ D F E + + G AI M+ + AVW++QGW + +P RP M
Sbjct: 303 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 357
Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
KAL G L++LDLF+E +P+W G+P IW C+L NF GN
Sbjct: 358 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 406
Query: 213 IEMYGILDSIAFG-------PVEA---------------------------RTSENTTMA 238
+ ++G +D + P+ A R + T
Sbjct: 407 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 466
Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 298
WI QY RYG +I AW +L + +YNC G + SI
Sbjct: 467 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQG---------PHESIFC----- 512
Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 358
G+P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+
Sbjct: 513 ----GRP----SLNNFQASSWSKMCNYYDPTTTAEAARLMVSVAHKYRGNNNFEYDLVDI 564
Query: 359 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 418
TRQA+A A ++ + ++ D +R+FLEL+ D LL F +G W++
Sbjct: 565 TRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWIQQ 624
Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
A+ L E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y+
Sbjct: 625 ARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYW 684
Query: 479 KYMIESLE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 524
+ + + L+ + D + +W L W +N Y + GD +
Sbjct: 685 QVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIE 741
Query: 525 TSQ 527
++
Sbjct: 742 VAK 744
>gi|212541222|ref|XP_002150766.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
gi|210068065|gb|EEA22157.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
Length = 787
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 168/526 (31%), Positives = 257/526 (48%), Gaps = 76/526 (14%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
+ N+ G WG PLP W++ Q LQKKIL R+ ELG+ PVLP+F+G VP A+ V P+AK+
Sbjct: 203 LGNIQGFWGDPLPNEWIESQFELQKKILARMVELGITPVLPSFTGFVPRAITRVLPNAKV 262
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W S+ + C L+ D F + ++ I +Q YG SHIY D ++EN P
Sbjct: 263 VPGSRWNVFSSN--YTCDTFLEPFDDNFALLQKSTISKQQAYYGNISHIYALDQYNENNP 320
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-LV 178
+P+Y+ ++ +++ D DAVWLMQ WLF FW + A L+ V ++
Sbjct: 321 FSSNPDYLRNISRTTSQSLKAADPDAVWLMQSWLFLDATFWNNVTICAYLSGVENNSDML 380
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM- 237
+LDLFAE +P+W + +YG P+IWC +H++ GN+ +YG + +I A S + +
Sbjct: 381 ILDLFAESQPVWQLTDSYYGKPWIWCQVHDYGGNMGLYGQIMNITENATAALASSGSMVG 440
Query: 238 ---------------------AW------INQY-----SVRRYG-RSVP-AIQDAWNVLY 263
AW +QY +VR G + VP + DAW +L
Sbjct: 441 FGHTMESQEGNEIVYDLLLDQAWSETPINTSQYFEDWVTVRYAGTQHVPQQLFDAWEILR 500
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISV--TEGKYQNYGKPVSKEAVLKSETSSYDH 321
+ YN T+ A+ I+ +++PSI + EG + T +YD
Sbjct: 501 WSAYNNTNLASSSVPKSIL---ELEPSISGLLNREGHHPT--------------TINYD- 542
Query: 322 PHLWYSTSEVIRALEL-FIASGNELSA--SNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
P L V+ A L + A+ ELS + + YDLI LTRQ L + +I Y
Sbjct: 543 PEL------VVEAWALTYEAALLELSLWDNPAFNYDLIFLTRQVLVNAFIPRYELLISFY 596
Query: 379 QLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ-NEEQEKQYEWN 435
+ + R+ ++L++ +D +L ++ F L W+ A A N YE+N
Sbjct: 597 NNENYSVPAIVSAGRQLIDLLQSLDTVLGTNECFQLAQWINKAVSRAHGNTTLAAYYEYN 656
Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
AR QIT+W N + + DY +K W+GL+ YY PR I Y+
Sbjct: 657 ARNQITLWGPNGE-----ISDYASKQWAGLISSYYVPRWQILVDYL 697
>gi|327356744|gb|EGE85601.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ATCC 18188]
Length = 752
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 150/532 (28%), Positives = 254/532 (47%), Gaps = 79/532 (14%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
NL G WGG P W D Q LQKKIL R+ ELGM P+LPAF G VP A+ V P A+
Sbjct: 201 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 260
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + +P++ T L DP + + ++FI + ++ YG +H Y D F+E
Sbjct: 261 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 318
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNS-VPLGK 176
P P+++ + ++S D +A W+MQGWLF + +W +++A L++
Sbjct: 319 PSSGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 378
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDLFAE P+W +K F+G ++WC + F GN +YG + +I GP EA
Sbjct: 379 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAEAMAQHPNM 438
Query: 237 MA--------------------------------WINQYSVRRYG---RSVPA-IQDAWN 260
+ + + + RRY R+VP+ + +AW
Sbjct: 439 VGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHERTVPSELYEAWQ 498
Query: 261 VLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSETS 317
+L + YN T+ A + A P ++ + + EG
Sbjct: 499 LLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG-------------------- 538
Query: 318 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 377
L Y +++++A L I L ++Y+YD++D+TRQ L+ + ++
Sbjct: 539 ------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKVK 590
Query: 378 YQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ-YEWN 435
Y+ VF + + L +++ +D +L+ ++ F L W+ +A+ A ++ + +E N
Sbjct: 591 YKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDDSEAADFFEHN 650
Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
AR QIT+W E +L DY K W+GL+ YY PR ++ +Y+ ++ S
Sbjct: 651 ARNQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPRWRMFLEYLKDTPAS 698
>gi|323344412|ref|ZP_08084637.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
gi|323094539|gb|EFZ37115.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
Length = 730
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 167/568 (29%), Positives = 252/568 (44%), Gaps = 79/568 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP+ WL+ Q LQKKIL R M PVLPAF+G+VPA L+ +FP A I
Sbjct: 198 MANIDRWNGPLPKEWLNGQKELQKKILARERAFNMKPVLPAFAGHVPAELKRIFPDANIK 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + ++ C + L +PLF +I + ++E+Q +G T HIY D F+E PP
Sbjct: 258 SLGKWGGFEE--KYLC-HFLSPEEPLFSKIQKLYLEEQTALFG-TDHIYGVDPFNEVEPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
P Y+ + +Y + + D A W+ GW+FSYD W P +++A L VP GK+ +
Sbjct: 314 SWEPAYLRKVSKNMYGTLTAVDPKAEWMQMGWMFSYDNKHWTPDRVQAFLTGVPKGKMSL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI--------------------- 218
LD + E +W T+ FYG PYIWC L NF GN + G
Sbjct: 374 LDYYCENVELWKTTDGFYGQPYIWCYLGNFGGNTTLMGNVKESGRRLDNALANGQRNMLG 433
Query: 219 -------LDSIAFGPVEARTSENTTMA-----WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
LD I F P E ++ + A WI+ + R YG P+++ AW++L++ +
Sbjct: 434 AGSTLEGLDVIQF-PYEYLYNKLWSHAVADSRWIDDLADRHYGGVSPSVRKAWHILFNDI 492
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y + + +G N+ +P Y L
Sbjct: 493 Y---------------------VQVSASMQGVLTNF-RPALNNNYPHRTAIEYPAERL-- 528
Query: 327 STSEVIR-ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
EV R L++ NEL + D+I + RQ L + AY D
Sbjct: 529 --EEVWRLLLDVPRCDRNEL------QLDIIAVGRQVLGNRFAVVKTQFDSAYANKDIPR 580
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
+ + EL+ D+D L + + + W++ A++L +E + YE NAR IT W
Sbjct: 581 LKAKACEMEELLGDLDRLTSFNSRCSINRWIDDARKLGSTKELKDYYEKNARNLITTWGG 640
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
N + DY ++ W GL+ YY R +Y ++ + E+ F + + K
Sbjct: 641 N-------INDYASRTWGGLIGSYYAHRWRLYIDDILAAAEANKEFDQNAFNEKVSKFEQ 693
Query: 506 DWQNGRNVYPVESNGDALITSQWLYNKY 533
W V D L + L KY
Sbjct: 694 AWIISTEPITVPKRTDLLTFCRILIQKY 721
>gi|424665881|ref|ZP_18102917.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
616]
gi|404574134|gb|EKA78885.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
616]
Length = 732
Score = 233 bits (593), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 168/577 (29%), Positives = 257/577 (44%), Gaps = 96/577 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGP+ Q ++D+Q LQKK++ R+ E GM PVL F G VP ++ FP+A I
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTQLQKKMIDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + + L +DPLF ++ + F E+Q K +G+ S Y D F E N+
Sbjct: 258 DAGKWITYQRPA------FLVPSDPLFAKVAQIFYEEQEKLFGK-SRYYGGDPFHEGGNS 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
++ I+ + IY M++ + DA+W++QGW +P + ALL + G+ +
Sbjct: 311 EGIN----ITEAASDIYKAMKANNPDAIWVLQGW--GANPSY------ALLKGLKQGEAL 358
Query: 179 VLDLFAEVKPIW-----STSKQFYGV---PYIWCMLHNFAGNIEMYGILDSIAFGPVEA- 229
+LDL + +P W S S + G +IWC L NF G I MYG L S A G + A
Sbjct: 359 ILDLMSCARPQWGGDPSSQSHREDGYLDHNWIWCALPNFGGRIGMYGKLQSYATGVIRAE 418
Query: 230 ---------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ 256
RT +WI Y+ RYG +
Sbjct: 419 HHPKGKYVCGVGTTPEGIGTNPIDYDMVYDMAWRTDSIDVKSWIANYTTYRYGSPNNNAK 478
Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
A L +VYNC A A P + T
Sbjct: 479 AAMQQLSTSVYNCPWAADGPQESYFCARPSLKID------------------------RT 514
Query: 317 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
SS+ HL+Y V++ALE + + NEL +TYRYD++D+TRQ LA Y + I +
Sbjct: 515 SSWGTAHLYYQPINVLQALEHLLKAENELKEIDTYRYDVVDVTRQMLADYGKYIHKCIAD 574
Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
AY D + +FL+++ D D LL+ FLLG ++ A N +++ + NA
Sbjct: 575 AYYGKDTEKFDFYTSKFLQMISDQDLLLSTRKEFLLGKFIRQADACGSNPMEKRMFINNA 634
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
+ QIT W S L +Y +K W+G+L Y PR YF Y+ LE + +
Sbjct: 635 KRQITTW----ASVNSSLHEYAHKEWNGILGTLYAPRWKAYFDYLRTKLEGKNPKEI--- 687
Query: 497 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
++ + DW + + + ++ +Y+ Y
Sbjct: 688 --DFFTMETDWVESKKEFSAVPIKKEIEIAKTIYHNY 722
>gi|399028591|ref|ZP_10729778.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
gi|398073682|gb|EJL64846.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
Length = 727
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 161/549 (29%), Positives = 257/549 (46%), Gaps = 82/549 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLPQ W++++ +QKKIL R+ LGM+PV+PAFSG VP A P +KI+
Sbjct: 209 MGNINSLEGPLPQEWINKKENVQKKILQRMRALGMHPVVPAFSGYVPKAFAEKHPGSKIS 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W S + TYLLDA DPLF EIG+ FIE K YG+ + Y D F+E TPP
Sbjct: 269 ELKSW----SGGGFESTYLLDANDPLFKEIGKRFIEIYTKLYGQ-ADFYLADAFNEITPP 323
Query: 121 VDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V E +S G I+ + DA W+MQGWLF + FW KA L+ VP +
Sbjct: 324 VSKEHKYEELSDYGKTIFETINEASPDATWVMQGWLFGDNKEFWTKEATKAFLSKVPNDR 383
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD---------------- 220
+++ D + +W + FYG + + +HN+ G+ +YG L+
Sbjct: 384 MMIQDYANDRHKVWEKQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKNELTHLLGNSNKG 443
Query: 221 -SIAFGPVEARTSENTTM-----------------AWINQYSVRRYGR--SVPAIQDAWN 260
+ +G + + N+ + W+N+Y RYG+ S P Q AW
Sbjct: 444 NVVGYGVMPEGLNNNSIVYEYIYDLPWSQGKESVNDWLNKYLSARYGKNISTPVFQ-AWK 502
Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
+L +VY+ T D A+ + +TE K G P K+
Sbjct: 503 LLIESVYSTKYWETRWWDDRAGAYLFFKRPTLKITEFK----GNPGDKQ----------- 547
Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
++ +AL++ + ++ Y YDL+D++R + ++L + + AY+L
Sbjct: 548 ---------KLKQALDILKRESKSFNKNSLYFYDLLDMSRHYYSLCIDDLLIECVTAYEL 598
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
D +L ++ + D+D +L+ L WL+SA + E K Y NA+T I
Sbjct: 599 KDIKKADELFKKIEKQALDIDNMLSGQPLNSLNNWLKSASDYGSSPEVSKLYVKNAKTLI 658
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RL 493
T+W L DY ++ W G+ + +Y PR ++ + ES+ + F +
Sbjct: 659 TLWGGEGH-----LNDYASRSWRGMYKGFYWPRWKMFLQAQRESVVNNTSFDELKVRESI 713
Query: 494 KDWRREWIK 502
K W +W +
Sbjct: 714 KQWEIKWCQ 722
>gi|383120707|ref|ZP_09941431.1| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
gi|382984934|gb|EES68331.2| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
Length = 736
Score = 231 bits (589), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 157/526 (29%), Positives = 244/526 (46%), Gaps = 92/526 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 197 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 248
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ + Y+ D F E
Sbjct: 249 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 305
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I S M+ + +AVW+MQ W+ +A+++++ G
Sbjct: 306 GNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQA--------WQANPREAMVSTLDSG 353
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG--- 225
L+VLDL++E P +W K F +++CML NF GN+ ++G ++ + G
Sbjct: 354 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 413
Query: 226 ---PVEART-----------SENTTM-----------------AWINQYSVRRYGRSV-P 253
V +T N M AW+ Y RYG + P
Sbjct: 414 ACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWREERFAPDAWLQAYLKARYGNDLSP 473
Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
+ +AW L HTVYN + S++ G +Q+
Sbjct: 474 EVAEAWRALEHTVYNAPKNYQGEG---------TVESLLCARPGFHQD------------ 512
Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
TS++ + L+YS +A L ++ ++ +N + YDL+D+ RQ+LA N L
Sbjct: 513 -RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNNFEYDLVDVVRQSLADKGNVLLEE 571
Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
I ++Y D + S++FLEL+ D LL+ F + WL +A+ L EE++K YE
Sbjct: 572 ISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEFSVSSWLNAARSLGTTEEEKKLYE 631
Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 479
WNA IT+W D+ L DY ++ WSG+L+D Y R +F+
Sbjct: 632 WNASALITVWGDSIAANRGGLHDYSHREWSGILKDLYYQRWKTFFE 677
>gi|29349767|ref|NP_813270.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341678|gb|AAO79464.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 744
Score = 231 bits (589), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 157/526 (29%), Positives = 244/526 (46%), Gaps = 92/526 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 205 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 256
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ + Y+ D F E
Sbjct: 257 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 313
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I S M+ + +AVW+MQ W+ +A+++++ G
Sbjct: 314 GNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQA--------WQANPREAMVSTLDSG 361
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG--- 225
L+VLDL++E P +W K F +++CML NF GN+ ++G ++ + G
Sbjct: 362 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 421
Query: 226 ---PVEART-----------SENTTM-----------------AWINQYSVRRYGRSV-P 253
V +T N M AW+ Y RYG + P
Sbjct: 422 ACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWREERFAPDAWLQAYLKARYGNDLSP 481
Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
+ +AW L HTVYN + S++ G +Q+
Sbjct: 482 EVAEAWRALEHTVYNAPKNYQGEG---------TVESLLCARPGFHQD------------ 520
Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
TS++ + L+YS +A L ++ ++ +N + YDL+D+ RQ+LA N L
Sbjct: 521 -RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNNFEYDLVDVVRQSLADKGNVLLEE 579
Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
I ++Y D + S++FLEL+ D LL+ F + WL +A+ L EE++K YE
Sbjct: 580 ISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEFSVSSWLNAARSLGTTEEEKKLYE 639
Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 479
WNA IT+W D+ L DY ++ WSG+L+D Y R +F+
Sbjct: 640 WNASALITVWGDSIAANRGGLHDYSHREWSGILKDLYYQRWKTFFE 685
>gi|189465172|ref|ZP_03013957.1| hypothetical protein BACINT_01517 [Bacteroides intestinalis DSM
17393]
gi|189437446|gb|EDV06431.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides intestinalis DSM
17393]
Length = 723
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 168/587 (28%), Positives = 266/587 (45%), Gaps = 109/587 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 192 MNNLEGWGGPNPDSWYAQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 251
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+ L N F+ + L TD F EI + +Q K +G+ + Y+ D F E N
Sbjct: 252 SDLWNGFTRPA--------FLQPTDARFAEIADLYYREQEKLFGKADY-YSMDPFHEAEN 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI + M+ + A W++QGW + +P RP +K + N G L
Sbjct: 303 AASVD----FDAAGKAIMTAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 350
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSI-- 222
++LDLF+E +P+W G+P IW CML NF GN+ ++G +D +
Sbjct: 351 LILDLFSECRPMW-------GIPSIWKRDKGYEQHDWLFCMLLNFGGNVGLHGRMDQLLN 403
Query: 223 -----AFGPVEART---------SENTTM------------------AWINQYSVRRYGR 250
P+ SEN M W+ Y RYG
Sbjct: 404 NFYLTKNNPLATHLKGIGLTMEGSENNAMMFELMCELPWRPEKFTKEEWLKDYLFARYGV 463
Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 308
I+ AW +L +T+YNC G + + + P ++ + + + K +NY P
Sbjct: 464 RDEKIEQAWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVT 522
Query: 309 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 368
E A L + ++ +N + YDL+D+ RQ+L+
Sbjct: 523 E-----------------------EAARLMLEVADKYRGNNNFEYDLVDIVRQSLSDKGR 559
Query: 369 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
++ I ++ D + SR+FL+++ D LL F +G W+E A++L E+
Sbjct: 560 IVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARKLGTTPEE 619
Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
+ YEWNAR QIT W + + LRDY +K W+G+LRD+Y R A Y++ + + L+
Sbjct: 620 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 679
Query: 489 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
+L ++ + W +N Y S G+ + ++ + K +
Sbjct: 680 PEVKL-----DYYAMEEPWTLAKNPYGSTSEGNCVDVAKEAFEKVFE 721
>gi|423248233|ref|ZP_17229249.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
CL03T00C08]
gi|423253182|ref|ZP_17234113.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
CL03T12C07]
gi|392657082|gb|EIY50719.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
CL03T12C07]
gi|392660340|gb|EIY53954.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
CL03T00C08]
Length = 732
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 162/533 (30%), Positives = 239/533 (44%), Gaps = 91/533 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGP+ Q ++D+Q LQKK+L R+ E GM PVL F G VP ++ FP+A I
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTKLQKKMLDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + + L +DPLF ++ F E+Q K +G S Y D F E N+
Sbjct: 258 NAGKWITYQRPA------FLVPSDPLFAKVAEIFYEEQKKLFGE-SRYYGGDPFHEGGNS 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
++ I+ + IY M++ + +A+W++QGW S +P ALL + G+ +
Sbjct: 311 KGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--SGNP------SVALLKGLKHGEAL 358
Query: 179 VLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA- 229
VLDL A +P W F +IWC L NF G I MYG L S A G ++A
Sbjct: 359 VLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALPNFGGRIGMYGKLQSYATGVIKAE 418
Query: 230 ---------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ 256
RT +WI Y+ RYG +
Sbjct: 419 HHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDSIDIKSWIANYTTYRYGSENSNAK 478
Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
A L +VYNC A A P + +
Sbjct: 479 AAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYV------------------------ 514
Query: 317 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
SS+ HL+Y V++ALE + + EL +TYRYD++D+TRQ LA Y + I +
Sbjct: 515 SSWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRYDVVDITRQMLADYGKYIHKCISD 574
Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
AY+ + + +FL+++ D D LL+ FLLG ++ A N +++ + NA
Sbjct: 575 AYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLGEYIRQADTCGSNPTEKRMFINNA 634
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
+ QIT W S L +Y +K W+G+L Y PR +YF Y+ LE +
Sbjct: 635 KRQITSW----TSVNSSLHEYAHKEWNGILSTLYAPRWKVYFDYLHAKLEGKN 683
>gi|423269877|ref|ZP_17248849.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
CL05T00C42]
gi|423272668|ref|ZP_17251615.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
CL05T12C13]
gi|392700723|gb|EIY93885.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
CL05T00C42]
gi|392708745|gb|EIZ01850.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
CL05T12C13]
Length = 732
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 162/533 (30%), Positives = 239/533 (44%), Gaps = 91/533 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGP+ Q ++D+Q LQKK+L R+ E GM PVL F G VP ++ FP+A I
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTKLQKKMLDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + + L +DPLF ++ F E+Q K +G S Y D F E N+
Sbjct: 258 DAGKWITYQRPA------FLVPSDPLFAKVAEIFYEEQKKLFGE-SRYYGGDPFHEGGNS 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
++ I+ + IY M++ + +A+W++QGW S +P ALL + G+ +
Sbjct: 311 KGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--SGNP------SVALLKGLKHGEAL 358
Query: 179 VLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA- 229
VLDL A +P W F +IWC L NF G I MYG L S A G ++A
Sbjct: 359 VLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALPNFGGRIGMYGKLQSYATGVIKAE 418
Query: 230 ---------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ 256
RT +WI Y+ RYG +
Sbjct: 419 HHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDSIDIKSWIANYTTYRYGSENSNAK 478
Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
A L +VYNC A A P + +
Sbjct: 479 AAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYV------------------------ 514
Query: 317 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
SS+ HL+Y V++ALE + + EL +TYRYD++D+TRQ LA Y + I +
Sbjct: 515 SSWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRYDVVDITRQMLADYGKYIHKCISD 574
Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
AY+ + + +FL+++ D D LL+ FLLG ++ A N +++ + NA
Sbjct: 575 AYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLGEYIRQADTCGSNPTEKRMFINNA 634
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
+ QIT W S L +Y +K W+G+L Y PR +YF Y+ LE +
Sbjct: 635 KRQITSW----TSVNSSLHEYAHKEWNGILSTLYAPRWKVYFDYLHAKLEGKN 683
>gi|299140550|ref|ZP_07033688.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella oris C735]
gi|298577516|gb|EFI49384.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella oris C735]
Length = 741
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 157/526 (29%), Positives = 237/526 (45%), Gaps = 80/526 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW +QQ LQKKIL R++E GM PVLP F G +P + +T
Sbjct: 186 MNNLEGWGGPLPDSWYNQQEALQKKILKRMHEYGMQPVLPGFCGMMPHDAKAKL-GLNVT 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L TD F +I + + K YG+ ++ Y+ D F E T
Sbjct: 245 DGGIWNGYTRPAN------LSPTDAHFDKIADLYYAELTKLYGKANY-YSMDPFHE-TND 296
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
++ +Y S G + M+ + A W++QGW + +P RP +K + N G L+VL
Sbjct: 297 DETIDY-SKAGCKVMEAMKRVNPKATWVIQGW--TENP--RPQMIKNMKN----GDLLVL 347
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
DLF+E +P IW K + +++CML NF N+ ++G +D + + S
Sbjct: 348 DLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLLHNFYSTKQSSP 407
Query: 235 TTMA----------------------------------WINQYSVRRYGRSVPAIQDAWN 260
T WI Y RYG++ P I+ AW
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSELPWRTECKKEDWIKGYVKARYGKTSPEIERAWQ 467
Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
+L T+YNC G + SI G+P +KS + +
Sbjct: 468 LLSETIYNCPAGNNQQG---------PHESIFC---------GRPSLNNFQVKSWSKMRN 509
Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
+ Y + A +L ++ +N + YDL+D+ RQALA +L I Y
Sbjct: 510 Y----YDPQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALADQGRLQYLKTIADYNG 565
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
+ + RFLE++ D LL F LG W E+A++L ++++ YEWNAR QI
Sbjct: 566 FSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGTTQQEKDLYEWNARVQI 625
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
T W + + L DY +K W G+L+D+Y R I+ + + +E
Sbjct: 626 TTWGNRICADKGGLHDYAHKEWQGILKDFYYKRWKIFMDALAKQME 671
>gi|160887167|ref|ZP_02068170.1| hypothetical protein BACOVA_05183 [Bacteroides ovatus ATCC 8483]
gi|423295093|ref|ZP_17273220.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
CL03T12C18]
gi|156107578|gb|EDO09323.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392673999|gb|EIY67450.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
CL03T12C18]
Length = 711
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 156/526 (29%), Positives = 245/526 (46%), Gaps = 92/526 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ ++ Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ GA+I + M+ + AVW++Q W S P+ + ++ S+ G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPKAVWIIQAWQAS-------PR-EEMIASLNQG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E +P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSENTTM----------------------------------AWINQYSVRRYGRSV-P 253
A N M W+ Y RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472
Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
I +AW L HTVYN D + + S++ G + +
Sbjct: 473 EIMEAWRALEHTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511
Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
TS++ + L+Y+ +A LF + ++ +N + YDL+D+ RQ+ A N L
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570
Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
I ++Y D + +++FL+L+ D LL+ F + WL +A+ L EE+++ YE
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKRLYE 630
Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 479
WNA IT+W D+ L DY ++ WSGLL+D Y R +F+
Sbjct: 631 WNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLYYQRWKAFFE 676
>gi|340347658|ref|ZP_08670763.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
gi|433652542|ref|YP_007296396.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
3688]
gi|339608852|gb|EGQ13735.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
gi|433303075|gb|AGB28890.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
3688]
Length = 781
Score = 227 bits (579), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 168/540 (31%), Positives = 237/540 (43%), Gaps = 97/540 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW QQ LQK+IL R ELGM PVLP + G +P + +T
Sbjct: 207 MNNLEGWGGPLPDSWYRQQEALQKRILQRERELGMEPVLPGYCGMMPHDAKQKL-GLDVT 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + L ATDP F EI + +Q + YG+ SH Y+ D F E +
Sbjct: 266 PGGTWNG------YVRPANLSATDPRFDEIADLYYREQTRLYGK-SHYYSMDPFHETSDD 318
Query: 121 VDSPEYI--SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
V YI + G + + M+ + A W++QGW + P A+ + +P G L
Sbjct: 319 V----YIDYAQAGRKLMAAMKRENPKANWVIQGWTENPRP--------AMTDGLPAGSLT 366
Query: 179 VLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDS------IAFGP 226
VLDLF+E +P IW ++ + +++CML NF GN+ ++G +D +A P
Sbjct: 367 VLDLFSECRPMFGAPSIWKRAEGYGQHDWLFCMLENFGGNVGLHGRMDQLIGNFRLATSP 426
Query: 227 VEA---------------------------------RTSENTTMA--------WINQYSV 245
RT E A W+ Y
Sbjct: 427 QSPLQQARRHLRGIGFTMEGSENNPIMFELMSELPWRTDEVAQAADARTFRTEWVRGYVK 486
Query: 246 RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP 305
RYG P Q AW +L T+YNC G + SI G+P
Sbjct: 487 ARYGTDDPHAQQAWQLLAETIYNCPAGNNQQG---------PHESIFD---------GRP 528
Query: 306 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 365
+KS + ++ Y S + A L A+ + L +N Y YDL+D+ RQA+
Sbjct: 529 SLNNFQVKSWSKMRNY----YEPSATLEAARLMAAAADRLKGNNNYEYDLVDIVRQAIDD 584
Query: 366 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 425
A +++L+ I Y D + S RFL L+ D LL F LG W E+A+ L
Sbjct: 585 QARQVYLHAIADYNGFDRRAFSRDSARFLGLLLMQDRLLGTRREFRLGRWTEAARSLGTT 644
Query: 426 EEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL 485
++ YEWNAR QIT W + + LRDY +K W GLL D+Y R Y + +
Sbjct: 645 PAEKDLYEWNARVQITTWGNRACADQGGLRDYAHKEWQGLLADFYYMRWHTYLDALSRQM 704
>gi|449299394|gb|EMC95408.1| glycoside hydrolase family 89 protein [Baudoinia compniacensis UAMH
10762]
Length = 801
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 153/553 (27%), Positives = 254/553 (45%), Gaps = 96/553 (17%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP SW+ Q L K+I+ R+ ELGM PVLP F G VP + +P+A
Sbjct: 226 NIQGSWGGDLPMSWISSQFTLGKQIVARMVELGMTPVLPCFPGFVPMQIGRYYPNAMYIN 285
Query: 62 LGNWFSVKSDPRWCCTY-LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W PR L+ DPL+ + ++FI +Q YG S IY D ++EN P
Sbjct: 286 GSQW---NGFPRQNTNVSFLEPFDPLYTTLQKSFISKQTAAYGNVSSIYTLDQYNENNPY 342
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVV 179
Y+ ++ A + +++ D +AVW++QGWLF S FW ++A L V +++
Sbjct: 343 SADTTYLRNISAGTIAALKAADPNAVWMLQGWLFFSSATFWTDAAIRAYLGGVNNTDMII 402
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDLF+E +P W + +YG P+IWC LH++ GN+ +YG ++++ P++A + ++TM
Sbjct: 403 LDLFSETQPQWQRTNSYYGKPWIWCELHDYGGNMGLYGQVENVTINPIQALNNASSTMVG 462
Query: 238 ---------------------AW----------INQYSVRRYGRSV---PAIQDAWNVLY 263
AW + + RY + P + AW+ +
Sbjct: 463 MGLTMEGQEGNEIMYDILLDQAWSSTPLNNSLYFHDWVTSRYHGAASLPPGLYTAWDTMR 522
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP- 322
TVYN T +T + V + ++ P++ + + HP
Sbjct: 523 QTVYNNTQISTIQ--SVTKSIWELTPNVTGLLN--------------------RTGHHPT 560
Query: 323 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
+ Y+TS ++ A + F + + L S Y +DL D+TRQ +A L+ + + A
Sbjct: 561 TIQYNTSTLVGAWKQFYGAAAQEPTLWDSPGYLFDLTDVTRQVMANAFYPLYTSFVSASN 620
Query: 380 LNDAHGVFQ------LSRRFLELVEDMDGLLACH--DGFLLGPWLESAK----------- 420
+ A+ + ++ + L+ +D +LA F L W+ A+
Sbjct: 621 -HSANATYSPGNATIYGQQMVSLLSALDSMLAASPIPYFHLSTWIAEARSWSAPTATLPN 679
Query: 421 ---QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 477
L + + YE+NAR QIT+W Q + DY +K W+GL+ YY PR ++
Sbjct: 680 NATNLTSSSQTASFYEYNARNQITLWGPTGQ-----ISDYASKQWAGLISSYYVPRWQLF 734
Query: 478 FKYMIESLESGDG 490
Y + + +G
Sbjct: 735 VNYTLNGTTASNG 747
>gi|423226735|ref|ZP_17213200.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392627008|gb|EIY21049.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 718
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 165/587 (28%), Positives = 265/587 (45%), Gaps = 109/587 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 189 MNNLEGWGGPNPDSWYVQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 248
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+ L N F+ + L TD F EI + ++Q K +G+ + Y+ D F E N
Sbjct: 249 SDLWNGFTRPA--------FLQPTDVRFAEIADLYYQEQEKLFGKVDY-YSMDPFHEAEN 299
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI + M+ + A W++QGW + +P RP +K + N G L
Sbjct: 300 AASVD----FDAAGKAIMAAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 347
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSI-- 222
++LDLF+E +P+W G+P IW CML NF GN+ ++G +D +
Sbjct: 348 LILDLFSECRPMW-------GIPSIWKRDKGYEQHNWLFCMLLNFGGNVGLHGRMDQLLD 400
Query: 223 -----AFGPVEA---------------------------RTSENTTMAWINQYSVRRYGR 250
P+ R + T W+ Y RYG
Sbjct: 401 NFYLTKNNPLAVHLKGIGLTMEGAENNPMMFELMCELPWRPEKFTKEEWLKDYLFARYGV 460
Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 308
I+ AW +L +T+YNC G + + + P ++ + + + K +NY P
Sbjct: 461 RDEKIEKAWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVT 519
Query: 309 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 368
E A L + ++ +N + YDL+D+ RQ+L+
Sbjct: 520 E-----------------------EAARLMVEVADKYRGNNNFEYDLVDIVRQSLSDKGR 556
Query: 369 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
++ I ++ D + SR+FL+++ D LL F +G W+E A+ L E+
Sbjct: 557 IVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARNLGTTPEE 616
Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
+ YEWNAR QIT W + + LRDY +K W+G+LRD+Y R A Y++ + + L+
Sbjct: 617 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 676
Query: 489 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
+L ++ + W +N Y G + ++ ++ K ++
Sbjct: 677 PEVKL-----DYYAMEEPWTLAKNPYSSVPEGSCVDVAKEVFEKAMR 718
>gi|224537466|ref|ZP_03678005.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520904|gb|EEF90009.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
DSM 14838]
Length = 721
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 165/587 (28%), Positives = 265/587 (45%), Gaps = 109/587 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 192 MNNLEGWGGPNPDSWYVQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 251
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+ L N F+ + L TD F EI + ++Q K +G+ + Y+ D F E N
Sbjct: 252 SDLWNGFTRPA--------FLQPTDVRFAEIADLYYQEQEKLFGKVDY-YSMDPFHEAEN 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI + M+ + A W++QGW + +P RP +K + N G L
Sbjct: 303 AASVD----FDAAGKAIMAAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 350
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSI-- 222
++LDLF+E +P+W G+P IW CML NF GN+ ++G +D +
Sbjct: 351 LILDLFSECRPMW-------GIPSIWKRDKGYEQHNWLFCMLLNFGGNVGLHGRMDQLLD 403
Query: 223 -----AFGPVEA---------------------------RTSENTTMAWINQYSVRRYGR 250
P+ R + T W+ Y RYG
Sbjct: 404 NFYLTKNNPLAVHLKGIGLTMEGAENNPMMFELMCELPWRPEKFTKEEWLKDYLFARYGV 463
Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 308
I+ AW +L +T+YNC G + + + P ++ + + + K +NY P
Sbjct: 464 RDEKIEKAWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVT 522
Query: 309 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 368
E A L + ++ +N + YDL+D+ RQ+L+
Sbjct: 523 E-----------------------EAARLMVEVADKYRGNNNFEYDLVDIVRQSLSDKGR 559
Query: 369 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
++ I ++ D + SR+FL+++ D LL F +G W+E A+ L E+
Sbjct: 560 IVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARNLGTTPEE 619
Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
+ YEWNAR QIT W + + LRDY +K W+G+LRD+Y R A Y++ + + L+
Sbjct: 620 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 679
Query: 489 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
+L ++ + W +N Y G + ++ ++ K ++
Sbjct: 680 PEVKL-----DYYAMEEPWTLAKNPYSSVPEGSCVDVAKEVFEKAMR 721
>gi|299148671|ref|ZP_07041733.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
3_1_23]
gi|383114572|ref|ZP_09935334.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
gi|298513432|gb|EFI37319.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
3_1_23]
gi|313693722|gb|EFS30557.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
Length = 711
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/526 (29%), Positives = 243/526 (46%), Gaps = 92/526 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ ++ Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ GA+I + M+ + AVW++Q W+ + ++ S+ G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPKAVWIIQA--------WQANPREEMIASLNQG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E +P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSENTTM----------------------------------AWINQYSVRRYGRSV-P 253
A N M W+ Y RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472
Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
I +AW L HTVYN D + + S++ G + +
Sbjct: 473 EIMEAWRALEHTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511
Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
TS++ + L+Y+ +A LF + ++ +N + YDL+D+ RQ+ A N L
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570
Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
I ++Y D + +++FL+L+ D LL+ F + WL +A+ L EE+++ YE
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKRLYE 630
Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 479
WNA IT+W D+ L DY ++ WSGLL+D Y R +F+
Sbjct: 631 WNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLYYQRWKAFFE 676
>gi|380694112|ref|ZP_09858971.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 736
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 158/540 (29%), Positives = 245/540 (45%), Gaps = 106/540 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 199 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 250
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ + Y+ D F E
Sbjct: 251 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 307
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I M+ + +AVW+MQ W+ +A++N++ G
Sbjct: 308 GNTEGVD----LAKAGTSIMGAMKKANPEAVWVMQA--------WQANPREAMVNTLDSG 355
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E P +W K F +++CML NF GN+ ++G ++ + G
Sbjct: 356 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 415
Query: 229 ARTS----------------ENTTMA------------------WINQYSVRRYGRSV-P 253
A EN M W+ Y RYG + P
Sbjct: 416 ACAHINGKTLRGVGATPEGIENNPMMFELLYELPWREERFSPDIWLQGYLKARYGDDLSP 475
Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
+ +AW L HTVYN +NY + E++L
Sbjct: 476 EVTEAWRALEHTVYNAP-----------------------------KNYQGEGTVESLLC 506
Query: 314 S-------ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 366
+ TS++ + L+YS +A +L ++ + +N + YDL+D+ RQ+LA
Sbjct: 507 ARPGFHLDRTSTWGYAKLFYSPDSTAKAAQLLLSVADRYKGNNNFEYDLVDIVRQSLADK 566
Query: 367 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 426
AN L I ++Y D + +++FL L+ D LL+ F + WL +A+ L E
Sbjct: 567 ANVLLEEISQSYDRKDKDSFRKQTQQFLGLILSQDSLLSTRKEFSVSSWLSAARSLGTTE 626
Query: 427 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
E++K YEWNA IT+W D+ L DY ++ WSGLL+D Y R +F+ + L+
Sbjct: 627 EEKKLYEWNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLYYQRWNTFFEQKQQELD 686
>gi|322702923|gb|EFY94542.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
ARSEF 23]
Length = 589
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 161/538 (29%), Positives = 254/538 (47%), Gaps = 85/538 (15%)
Query: 7 WGG--PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGN 64
WGG L W+D Q LQKKI+ R+ ELG+ PVLPAF G VP A V P A T+
Sbjct: 28 WGGVGNLSSGWIDAQFELQKKIVARMVELGITPVLPAFPGFVPPAFSRVQPDANTTKAPR 87
Query: 65 WFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
W + D T+ L D + + +AFI +Q++ +G ++IY D F+E P + P
Sbjct: 88 WTGLP-DTNTRDTF-LSPLDTSYARLQQAFISKQIEAFGNVTNIYTLDQFNEMPPTSNEP 145
Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG--KLVVLDL 182
Y+S + Y + + + AVWL+QGWLF W ++ A L P G ++VLDL
Sbjct: 146 SYLSQVSTYTYKALTAANPAAVWLLQGWLFLNSGLWTEERVTAYLGG-PEGHNSMLVLDL 204
Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT----- 236
++E +P W +K ++G P+IWC LH+F GN+ MYG + I ++A RTS + +
Sbjct: 205 YSESRPQWQRTKGYFGRPWIWCQLHDFGGNMGMYGQISDITVQSMDALRTSPSLSGFGMT 264
Query: 237 ----------------MAWINQ----------YSVRRYG--RSVPAIQDAWNVLYHTVYN 268
AW Y VRRY ++ AW++L +Y+
Sbjct: 265 PEGYEGNEVVYQMLFDQAWTTTPIDTSGYFYGYVVRRYAGVSQTNSLFQAWDILRQNIYD 324
Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-HLWYS 327
+K+R V V G YQN + L + T ++ P ++Y
Sbjct: 325 ------NKDRQVPC-----------VGVGIYQN----APSLSGLVNRTGNWPPPTKVYYD 363
Query: 328 TSEVIRALELFIASGNELSA---SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
+ + +A L I + NE+ T++ D++D+TRQ ++ N ++ + ++ + +
Sbjct: 364 PATLKKAHSLLIQAANEIPQLWDIPTFQLDVVDVTRQVMSNAFNTMYTDYVQTFNSQLSR 423
Query: 385 GVFQLSRR---------------FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
+S R L+ + D+D +LA + F L WL++A+ A+
Sbjct: 424 QKSHISNRGGLQRRDDFATKGKQLLDFLTDLDRVLATNQHFRLDSWLDAAQYWAKQTGAN 483
Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
+NAR+QIT W E+ L DY K WSGL R YY R +I+ + ++L S
Sbjct: 484 DLIAFNARSQITTWI----WESEALNDYAVKEWSGLTRSYYRGRWSIFVDGLNKALAS 537
>gi|393784337|ref|ZP_10372502.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
CL02T12C01]
gi|392666113|gb|EIY59630.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
CL02T12C01]
Length = 728
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 151/522 (28%), Positives = 232/522 (44%), Gaps = 87/522 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ ++ +Q LQ+K+L R+ ELGM PV F G VP L+ +P A+I
Sbjct: 193 MGNLEGFGGPVSPEFIARQTDLQQKMLKRMRELGMKPVFQGFYGMVPNVLKKKYPDARIK 252
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W + + LD TDPLF + + E+Q K +G + + D F E
Sbjct: 253 EQGTWQTYQRPA------FLDPTDPLFDRVAAIYYEEQKKLFG-DAEFFGGDPFHEGG-- 303
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
++ I M+ + AVW++QGW ++P +K L++ + G+ ++L
Sbjct: 304 TSEGIHVKLAAQKILQAMRKVNPKAVWVLQGW--QHNP------VKDLMDGLNPGETIIL 355
Query: 181 DLFAEVKPIWS--TSKQFYGVP------YIWCMLHNFAGNIEMYGILDSIAFGPVEA--- 229
DL A +P W T+ F+ +IWC L NF G ++G + S A G V A
Sbjct: 356 DLMACERPQWGGVTTSMFHKPEGHQDHRWIWCALPNFGGKTGLHGKMSSYASGAVFAKEH 415
Query: 230 -------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQDA 258
RT W+ Y+ RYG A
Sbjct: 416 PMGRNICGIGTAPEGIGTVPVVYDMVYDMAWRTDSIQIPQWLTNYTYYRYGMEDTNCDKA 475
Query: 259 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 318
W +L TVY C + I A P A S+
Sbjct: 476 WKILSETVYECHNELGGPVESYICARP------------------------ADTIDHVST 511
Query: 319 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
+ + ++Y +++ A E S N + +TY YDL+D+TRQ L+ YA L ++EA+
Sbjct: 512 WGNARIFYEPVKMVEAWEFLYQSRNRFNHCDTYEYDLVDVTRQVLSDYAKYLHKEMVEAF 571
Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 438
+ +G + S FL++++D D LL+ F+LG WL A+ E+++++ NA+
Sbjct: 572 HQKNENGFMKYSTEFLDVIKDEDRLLSTRKEFMLGTWLTEAENAGCTPEEKRRFVTNAKR 631
Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 480
+T W D + L DY NK WSGLL D+Y PR Y Y
Sbjct: 632 LVTTWTDRDSD----LHDYANKEWSGLLSDFYLPRWEAYVTY 669
>gi|393788556|ref|ZP_10376683.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
CL02T12C05]
gi|392654236|gb|EIY47884.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
CL02T12C05]
Length = 732
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 170/552 (30%), Positives = 248/552 (44%), Gaps = 96/552 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ ++ +Q LQ+K+L R+ ELGM PV F G VP AL+ FP A+I
Sbjct: 196 MGNLEGFGGPVTPEFIARQTDLQQKMLKRMRELGMKPVFQGFYGMVPNALKEKFPDARIK 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + + LD TDPLF ++ + E+Q +G + + D F E
Sbjct: 256 DQGIWGTYQRPA------FLDPTDPLFDKLAAIYYEEQKNLFGE-AQFFGGDPFHEG--- 305
Query: 121 VDSPEYISSLGAA--IYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+ E I+ AA I M+ + AVW++QGW ++P +K L+ V G+ +
Sbjct: 306 -GTSEGINVKLAAQKILQAMRKVNPQAVWVLQGW--QHNP------VKELMEGVKPGETI 356
Query: 179 VLDLFAEVKPIWSTSK-QFYGVP-------YIWCMLHNFAGNIEMYGILDSIAFGPVEA- 229
+LDL A +P W K + P +IWC L NF G ++G + S A GPV A
Sbjct: 357 ILDLMACERPQWGGVKTSMFHKPEGHWNHQWIWCALPNFGGKTGLHGKMSSYASGPVFAK 416
Query: 230 ---------------------------------RTSENTTMAWINQYSVRRYGRSVPAIQ 256
RT W++ Y+ RYG
Sbjct: 417 HHPMGKNICGIGTAPEGIGTIPVVYDMVYDMAWRTDSIHIPQWLDNYTYYRYGTEDNNCN 476
Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 316
AW +L T+Y C + I A P S T +G V
Sbjct: 477 RAWKLLSETIYECHNELGGPVESYICARP-------SDTIQHVSTWGNAV---------- 519
Query: 317 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
++Y +V++A +L S + S+TY YDL D+TRQ L+ YA L ++
Sbjct: 520 -------MFYDPMKVVKAWDLLYQSRKRFNHSDTYEYDLTDVTRQVLSDYAKYLHERMVL 572
Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
A+Q D + S +FL +++D D LL+ F+LG WL A++ E+++++ NA
Sbjct: 573 AFQKKDKERFMEYSGKFLNIIKDEDRLLSTRKEFMLGTWLAEAEKAGGTPEEKRRFVTNA 632
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
+ IT W D + L DY NK WSGLL D+Y PR Y Y SL G D+
Sbjct: 633 KRLITTWTDTDSD----LHDYANKEWSGLLIDFYLPRWEAYVTYKT-SLLYGKKLPYPDY 687
Query: 497 RR---EWIKLTN 505
+ EW+ LTN
Sbjct: 688 SKMEQEWV-LTN 698
>gi|423223006|ref|ZP_17209475.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640582|gb|EIY34381.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 755
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 171/589 (29%), Positives = 260/589 (44%), Gaps = 109/589 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP+S +D+ ++L KKIL R ELGM P+ FSG VP LQ +P AKI+
Sbjct: 195 MQNIQSYGGPLPKSVIDKHVILGKKILARQLELGMQPIQQGFSGYVPRELQAKYPQAKIS 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W T LD TDPLF E+G AF+E+Q K +G + +Y D F E+ PP
Sbjct: 255 MKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDKLFG-SYGVYAADPFHESAPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+D+PEY++ +G I+ Q+ D+ A+W+MQ W D ++ +VP L++L
Sbjct: 308 IDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED----------IVKAVPKESLLIL 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG---ILDSIAFGPVEAR------- 230
DL + + +G P I LHNF G I M+G +L S + +AR
Sbjct: 358 DLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLALLASNQYQKAKARYPNVCGS 413
Query: 231 ------TSENTT-----------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
+N AW+ Y+ RRYG A AW L Y
Sbjct: 414 GLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERRYGAKSAAAGKAWMYLLEGPY 473
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
G R IVA P++ G G P Y
Sbjct: 474 R--QGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP--------------------YE 508
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
VIRA + ++L+ S YR+D++D+ RQ + + EA+ D
Sbjct: 509 PMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLGQLVHKKAAEAFASKDKAAFV 568
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
S RFLEL+ DMD LL + WL A+ + +E++ E +A + +T+W +
Sbjct: 569 LHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKEEKDLMERDATSLVTIWGADG 628
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---------------- 491
+ DY + W+GL+ YY PR ++ + L++G +
Sbjct: 629 DPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQGHLDAGTDYQEEGLSLAYGREDFRA 685
Query: 492 -----RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
RL +W ++ Q G+ PV ++GD L+ ++ L++KYL+
Sbjct: 686 NDFYNRLAEWELAYVD-----QTGKARTPV-THGDELVVTRRLFDKYLK 728
>gi|224537227|ref|ZP_03677766.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521150|gb|EEF90255.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
DSM 14838]
Length = 755
Score = 224 bits (572), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 171/589 (29%), Positives = 260/589 (44%), Gaps = 109/589 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP+S +D+ ++L KKIL R ELGM P+ FSG VP LQ +P AKI+
Sbjct: 195 MQNIQSYGGPLPKSVIDKHVILGKKILARQLELGMQPIQQGFSGYVPRELQAKYPQAKIS 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W T LD TDPLF E+G AF+E+Q K +G + +Y D F E+ PP
Sbjct: 255 MKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDKLFG-SYGVYAADPFHESAPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+D+PEY++ +G I+ Q+ D+ A+W+MQ W D ++ +VP L++L
Sbjct: 308 IDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED----------IVKAVPKESLLIL 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG---ILDSIAFGPVEAR------- 230
DL + + +G P I LHNF G I M+G +L S + +AR
Sbjct: 358 DLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLALLASNQYQKAKARYPNVCGS 413
Query: 231 ------TSENTT-----------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
+N AW+ Y+ RRYG A AW L Y
Sbjct: 414 GLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERRYGAKSAAAGKAWMYLLEGPY 473
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
G R IVA P++ G G P Y
Sbjct: 474 R--RGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP--------------------YE 508
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
VIRA + ++L+ S YR+D++D+ RQ + + EA+ D
Sbjct: 509 PMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLGQLVHKKAAEAFASKDKAAFA 568
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
S RFLEL+ DMD LL + WL A+ + +E++ E +A + +T+W +
Sbjct: 569 LHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKEEKDLMERDATSLVTIWGADG 628
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---------------- 491
+ DY + W+GL+ YY PR ++ + L++G +
Sbjct: 629 DPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQGHLDAGTDYQEEGLSLAYGREDFRA 685
Query: 492 -----RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
RL +W ++ Q G+ PV ++GD L+ ++ L++KYL+
Sbjct: 686 NDFYNRLAEWELAYVD-----QTGKARTPV-THGDELVVTRRLFDKYLK 728
>gi|393786624|ref|ZP_10374756.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
CL02T12C05]
gi|392657859|gb|EIY51489.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
CL02T12C05]
Length = 717
Score = 224 bits (572), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 171/581 (29%), Positives = 264/581 (45%), Gaps = 105/581 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P W QQ+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPDHWYTQQVSLQKKILKRMHEYGIEPVLPGYCGMVPHNAK--------A 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L D F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 239 KLG--LNV-SDPGVWCGYRRPAFLQPDDSRFEEISSLYYKELEKLYGKANY-YSMDPFHE 294
Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+D + ++G A+ M+ + AVW++Q W + P L+ ++ G
Sbjct: 295 GGS-IDGVN-LDAVGKAVMKAMKKANPKAVWVIQAWQANPRP--------ELIRNLETGD 344
Query: 177 LVVLDLFAEVKPIWST------SKQFYGVP-YIWCMLHNFAGNIEMYGILDSI------A 223
L++LDL +E +P W K YG +++CML N+ N+ ++G +D++ A
Sbjct: 345 LLILDLTSECRPQWGDPESEWYRKDGYGKHNWVYCMLLNYGANVGLHGKMDNVIDNYYLA 404
Query: 224 FGPVEARTS-----------EN------------------TTMAWINQYSVRRYGRSVPA 254
+ AR + EN T W+ Y RYG+ P
Sbjct: 405 KENLRARATLKGVGMTPEGIENNPVMYELLMELPWRPERFTKEDWLKGYVKARYGKDEPV 464
Query: 255 IQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
+Q AW L +++YN T + + V A P +D YQ
Sbjct: 465 LQLAWGKLANSIYNAPKELTQQGTHESVFCARPGLDV---------YQ------------ 503
Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
SS+ +Y EVI A L ++ + + + YDL+D+ RQA+A+ +
Sbjct: 504 ---VSSWSEMKDYYDPQEVIEAARLMVSVADRYRGNTNFEYDLVDVVRQAIAEKGRLMQK 560
Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
+ AY+ D S++FL L+ D LL F LG W+ SA+ L E++ Y
Sbjct: 561 AVTTAYRAGDKELFAMASQKFLNLILLQDQLLGTRTEFRLGRWINSARALGVTPEEKALY 620
Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
EWN R Q+T W + E LRDY +K W+GLL+D+Y R +YF + +E G+
Sbjct: 621 EWNTRVQVTTWGNRNAAERGGLRDYAHKEWNGLLKDFYYMRWKLYFDNLACKME-GETIP 679
Query: 493 LKDW---RREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
D+ W+K TN +Q E GD + T++ ++
Sbjct: 680 EIDFYAVEEAWVKRTNPYQ-------AEPEGDCVDTAKLIF 713
>gi|409042145|gb|EKM51629.1| glycoside hydrolase family 89 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 749
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 148/534 (27%), Positives = 256/534 (47%), Gaps = 72/534 (13%)
Query: 6 GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW 65
WGG LP W+ Q LQK+IL R+ ELGM P+LPAF+G VP+ + +P+A I W
Sbjct: 201 AWGGLLPMQWISDQQALQKQILPRMLELGMTPILPAFTGFVPSNMSAHYPNASIIDGSAW 260
Query: 66 FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPE 125
S L+ DPL+ ++ ++FI +Q + YG +H Y D ++EN P +
Sbjct: 261 SGFPS--TLTNVSFLEPFDPLYPQMQQSFITKQQEAYGNITHFYTLDQYNENNPFSGNDS 318
Query: 126 YISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-KLVVLDLF 183
Y+SS+ + + +++ D +A W+MQGWL FS + FW +++A L +++LDL+
Sbjct: 319 YLSSVSTSTIASLRAADPEATWVMQGWLFFSSETFWTNDRIEAYLGGAQGNDSMLILDLY 378
Query: 184 AEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA----- 238
+E +P W+ + ++G ++WC LH++ GN+ + G L +I GP+ A S ++M
Sbjct: 379 SEAQPQWNRTDSYFGKQWVWCELHDYGGNMGLEGNLAAITEGPIAALNSNGSSMVGMGLT 438
Query: 239 -----------------------------WINQYSVRRY-GRSVPA-IQDAWNVLYHTVY 267
W+ +++ RRY +++P +Q AW +L T+Y
Sbjct: 439 MEGMEIGNEIVYDILLDQAWSSTPLNVSDWVAKWAARRYLVKTLPTELQQAWTILSTTIY 498
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
N D P+ +I S+ E + G L + T + + +
Sbjct: 499 NNQD-------------PNSQATIKSILELEPATTG--------LVNVTGHHPTEIPYDT 537
Query: 328 TSEVIRALELFI-ASGNELSASNT--YRYDLIDLTRQALAKYANELFLNIIEAYQLND-- 382
+ ++ AL+LF+ AS ++ S + D+++L+RQ + +L+ ++I + +
Sbjct: 538 NTTILHALQLFVNASKSQPSLKQVPEFAVDILELSRQLMVNRFIDLYTDLINTWNSSSST 597
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNARTQIT 441
A V L L+ D+D LL ++ +L W+ AKQ A Y E+ AR Q T
Sbjct: 598 AQNVTTAGVPLLSLISDLDVLLYTNENYLFSTWIADAKQWAHGNVSYAAYLEYQARNQQT 657
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
+W + DY +K +GL+ +YY R + + E SG + +
Sbjct: 658 LWGPQGN-----INDYASKQTAGLVGEYYATRWQTFVVMLAEQKTSGQPYNATE 706
>gi|237708859|ref|ZP_04539340.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
gi|345513372|ref|ZP_08792893.1| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
gi|423228941|ref|ZP_17215347.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
CL02T00C15]
gi|423242228|ref|ZP_17223337.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
CL03T12C01]
gi|423247755|ref|ZP_17228803.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
CL02T12C06]
gi|229457285|gb|EEO63006.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
gi|345456211|gb|EEO47557.2| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
gi|392631297|gb|EIY25272.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
CL02T12C06]
gi|392635177|gb|EIY29082.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
CL02T00C15]
gi|392639514|gb|EIY33330.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
CL03T12C01]
Length = 717
Score = 224 bits (571), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 163/538 (30%), Positives = 252/538 (46%), Gaps = 100/538 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYIRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + ++ K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYKELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W + +++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--------QDNPRTSMIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ G
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALINGFY 402
Query: 228 EARTSEN-----------------------------------TTMAWINQYSVRRYGRSV 252
+A+T + T W+ Y RYG
Sbjct: 403 DAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
A+Q AW++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGTRKEFKVGTWIEAARSAGQTQEEK 617
Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+ L+
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 675
>gi|212693694|ref|ZP_03301822.1| hypothetical protein BACDOR_03214 [Bacteroides dorei DSM 17855]
gi|265755881|ref|ZP_06090348.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
gi|212663753|gb|EEB24327.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
gi|263233959|gb|EEZ19560.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
Length = 718
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 163/538 (30%), Positives = 252/538 (46%), Gaps = 100/538 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPESWYIRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + ++ K YG+T Y D F E
Sbjct: 240 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYKELTKLYGKTG-FYAIDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W + +++ +
Sbjct: 296 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--------QDNPRTSMIEHLEA 343
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ G
Sbjct: 344 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALINGFY 403
Query: 228 EARTSEN-----------------------------------TTMAWINQYSVRRYGRSV 252
+A+T + T W+ Y RYG
Sbjct: 404 DAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
A+Q AW++L + +YN + + V A P +D YQ
Sbjct: 464 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 560 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGTRKEFKVGTWIEAARSAGQTQEEK 618
Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+ L+
Sbjct: 619 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 676
>gi|410096483|ref|ZP_11291470.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
CL02T12C30]
gi|409226447|gb|EKN19356.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
CL02T12C30]
Length = 718
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 170/587 (28%), Positives = 273/587 (46%), Gaps = 108/587 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQ++I+ R+ E G+ PV P +SG VP +
Sbjct: 187 MNNLEGWGGPNPDSWYKQQITLQQRIVKRMREYGIEPVFPGYSGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L TDP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 239 KLG--LNV-SDPGLWCGYHRPAFLQPTDPRFQEIASLYYKELNKLYGK-ANFYSMDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+ VD + + G AI M+ + AVW+ Q W + ++ ++
Sbjct: 295 GGSVAGVD----LDAAGKAIMQAMKKNNPKAVWVAQAW--------QANPRSQMIENLKA 342
Query: 175 GKLVVLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSI----- 222
G ++VLDLF+E +P W + F +I+CML N+ GN+ ++G + +
Sbjct: 343 GDMIVLDLFSESRPQWGDPESTWHRKDGFGQHDWIYCMLLNYGGNVGLHGKMAHVIDEYY 402
Query: 223 -----AFGP------VEARTSENTTM------------------AWINQYSVRRYGRSVP 253
+FG + SEN + W+ Y+V RYG++ P
Sbjct: 403 KAKESSFGKTLCGVGMTMEGSENNPVMFELLTELPWRPVHFDKNEWLKNYTVARYGKANP 462
Query: 254 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
+Q+AW +L +++YNC T + + + A P P ++S
Sbjct: 463 TVQEAWILLSNSIYNCPPENTQQGTHESIFCARPSDHPYLVSSW---------------- 506
Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
SE S Y Y+ +VIRA + ++ ++ + +N + YDL+D+ RQA+A+ L
Sbjct: 507 --SEMSDY------YNPDDVIRAAAMMVSVADQFTGNNNFEYDLVDIVRQAIAE-KGRLV 557
Query: 372 LNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
++EA + D + RFL+L+ D LL F +G W+ + L E++
Sbjct: 558 EKVVEASFASGDKQLYNTAANRFLQLLLLQDELLGTRPEFKVGNWIARTRSLGNTPEEKD 617
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R +F Y E L+
Sbjct: 618 LYEWNARVQITTWGNRNAADKGGLRDYAHKEWNGILKDFYYMRWKTWFDYQNELLDGKKP 677
Query: 491 FRLKDWRRE--WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
+ + E W KLT+ Y E GD + T + ++ + +
Sbjct: 678 TAIDFYALEEPWTKLTDS-------YSSEPEGDCISTVKRIFAEVFE 717
>gi|373460171|ref|ZP_09551927.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
gi|371956556|gb|EHO74342.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
Length = 742
Score = 223 bits (569), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 155/529 (29%), Positives = 238/529 (44%), Gaps = 85/529 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW QQ LQKKIL R++E GM PVLP F G +P + +T
Sbjct: 186 MNNLEGWGGPLPDSWYKQQETLQKKILQRMHEYGMEPVLPGFCGMMPHDAKEKL-GLNVT 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L TD F I + + + YG+ ++ Y+ D F E+
Sbjct: 245 DGGKWNGYTRPAN------LSPTDSQFNRIADLYYAELTRLYGKANY-YSMDPFHESNDD 297
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D+ +Y G+ + M+ + A W++QGW + +P RP ++ + N G L++L
Sbjct: 298 -DALDY-GKAGSVMLEAMKRINPKATWVIQGW--TENP--RPRMIQDMKN----GDLLIL 347
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEART- 231
DLF+E +P +W K + +++CML NF N+ ++G +D + F + R+
Sbjct: 348 DLFSECRPMFGIPSVWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLIHNFYSTKKRSP 407
Query: 232 --------------SENTTMA------------------WINQYSVRRYGRSVPAIQDAW 259
SEN + W+ Y RYGR I+ AW
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSELPWRPEIFKKEDWVRGYVKARYGRKDETIERAW 467
Query: 260 NVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 317
+L T+YNC G + + V P ++ N+ +
Sbjct: 468 LLLAETIYNCPAGNNQQGPHESVFCGRPGLN------------NF------------QVK 503
Query: 318 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 377
S+ +Y + A L + + +N + YDLID+ RQALA +L I
Sbjct: 504 SWSKMRNYYDPQATLEAARLMASVSSRYKGNNNFEYDLIDICRQALADQGRLQYLKTIAD 563
Query: 378 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 437
Y + ++RFL+++ D LL F LG W E+A+ L + ++ YEWNAR
Sbjct: 564 YNGFSRAAFAKDAKRFLDMILLQDRLLGTRKEFRLGHWTEAARSLGTTQAEKDLYEWNAR 623
Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
QIT W + T + LRDY +K W G+L+D+Y R IY + + +E
Sbjct: 624 VQITTWGNRTCADNGGLRDYAHKEWQGILKDFYYKRWKIYMDALAKQME 672
>gi|427385205|ref|ZP_18881710.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
12058]
gi|425727373|gb|EKU90233.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
12058]
Length = 719
Score = 223 bits (569), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 162/586 (27%), Positives = 263/586 (44%), Gaps = 109/586 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ VLQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 190 MNNLEGWGGPNPDSWYAQQEVLQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 249
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+ L N F+ + L TD F EI + +Q K +G+ + Y+ D F E N
Sbjct: 250 SDLWNGFTRPA--------FLQPTDTRFAEIANLYYREQEKLFGKADY-YSMDPFHEAEN 300
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI M+ + A W++QGW + +P RP ++ + N G L
Sbjct: 301 AASVD----FDAAGKAIMQAMKKVNPKATWVVQGW--TENP--RPEMIENMKN----GDL 348
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSI-- 222
++LDLF+E +P+W G+P IW CML NF GN+ ++G +D +
Sbjct: 349 LILDLFSECRPMW-------GIPSIWKRDKGYEQHDWLFCMLLNFGGNVGLHGRMDQLLD 401
Query: 223 ---------------AFGPVEARTSENTTM-----------------AWINQYSVRRYGR 250
G + N M W+ Y RYG
Sbjct: 402 NFYQTKDNPLATHLKGIGLTMEGSENNPVMFELMCELPWRPEKFTKEEWLKDYLFARYGV 461
Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 308
I+ AW +L +++YNC G + + + P ++ + + + K +NY P
Sbjct: 462 KDEKIEKAWTLLANSIYNCPFGNNQQGPHESIFCGRPSMN-NFQASSWSKMKNYYDPTVT 520
Query: 309 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 368
E A L + ++ +N + YDL+D+ RQ+L+
Sbjct: 521 E-----------------------EAARLMLEVADKYRGNNNFEYDLVDIVRQSLSDKGR 557
Query: 369 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
++ I ++ D + S++FL+++ D LL F +G W+E A+ L E+
Sbjct: 558 IVYNQTIADFKSFDKRSFARDSQKFLDILLLQDRLLGTRSEFRVGRWIEQARNLGTTPEE 617
Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
+ YEWNAR QIT W + + LRDY +K W+G+LRD+Y R A Y++ + + L+
Sbjct: 618 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 677
Query: 489 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
+L ++ + W + Y G+++ ++ ++ K +
Sbjct: 678 PEVKL-----DYYAMEEPWTLAKTPYDSTPEGNSVDVAKEVFEKAM 718
>gi|156046298|ref|XP_001589681.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980]
gi|154693798|gb|EDN93536.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 795
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 164/556 (29%), Positives = 267/556 (48%), Gaps = 102/556 (18%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP SW+++Q +LQKKI+ R+ ELG+ PVLPAF+G VP+AL+ + P+A I
Sbjct: 207 FGNIQGSWGGTLPLSWIEEQHLLQKKIVKRMVELGITPVLPAFTGFVPSALRRIAPNANI 266
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G+W ++ T+L TDPLF + F+ Q + YG +HIY D ++EN P
Sbjct: 267 INGGDWGNIFPVEYSNDTFLY-PTDPLFTTLQHKFLSFQSEYYGNVTHIYTLDQYNENNP 325
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-L 177
Y+ ++ Y +QS D AVW++QGWLF S FW +++A + VP + +
Sbjct: 326 ASGDLSYLRNVSRGTYESLQSFDPCAVWMLQGWLFYSLSSFWTQDRIEAYIGGVPKNESM 385
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSEN-- 234
++LDLF+E P W + +YG P+IWC L ++ G + +YG + +I +EA R SEN
Sbjct: 386 LILDLFSESFPQWERTHYYYGKPWIWCQLRDYGGTLGLYGQIYNITNSLIEAFRESENMV 445
Query: 235 ---TTM------------------------------AWINQ-YSVRRYGRSVPA-IQDAW 259
TM +W+ + Y ++ + +P I +AW
Sbjct: 446 GVGNTMEGQGGNGLMYELLLDQAWNIDPIDTEDYFKSWVRKRYHIKGAKKRLPGEIYEAW 505
Query: 260 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 319
++L T YN T+ + V + ++ P+I +N+G+ ++S+
Sbjct: 506 DILRRTAYNNTNLTLADS--VPKSLHELQPNIT-------ENHGR--------LGQSSTI 548
Query: 320 DHPHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
D Y ++ RA EL S EL +++D++D+TRQ LA+ ++ +IE
Sbjct: 549 DL----YDPDDLFRAWELLYNASVSVPELWEDKGWKFDMVDITRQVLAERFKLEYVELIE 604
Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ--------------- 421
Y+ + + ++E +D +L+ F L W+ +A
Sbjct: 605 KYK--KGADISCDGDILIGILESLDDVLSASPHFRLDTWVNAAVSSAPLPASTNCSSTSI 662
Query: 422 -------------LAQNEEQEKQ-YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 467
L N +Q + +NA QIT+W Q + DY +K W GL+R
Sbjct: 663 NNSSLLFNSSTSILTSNLTPTQQFFAYNAINQITIWGPTGQ-----IDDYASKSWGGLVR 717
Query: 468 DYYGPRAAIYFKYMIE 483
YY PR ++ +Y+ E
Sbjct: 718 GYYLPRWKMFLEYIDE 733
>gi|423217398|ref|ZP_17203894.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
CL03T12C61]
gi|392628557|gb|EIY22583.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
CL03T12C61]
Length = 707
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 156/557 (28%), Positives = 251/557 (45%), Gaps = 95/557 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVSRMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L + D F + E+ K YG+ + Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I M+ + +AVW++Q W + P A+++ + G
Sbjct: 305 GNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAWQANPRP--------AMIDVLNAG 352
Query: 176 KLVVLDLFAEVKPIWSTS-------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
++VLDL++E +P W S K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DMLVLDLYSEKRPQWGDSDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 A----------------------------------RTSENTTMAWINQYSVRRYGRSV-P 253
A R + W+ Y RYG + P
Sbjct: 413 ACAHVNGKRMRGVGATPEGIENNPVMFELLYELPWRAERFSPDVWLQGYLKARYGGELSP 472
Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
+ +AW L HTVYN P P EG ++ ++
Sbjct: 473 EVMEAWRALEHTVYNA---------------PKNSPG-----EGTLESL--LCARPGFHL 510
Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
TS++ + L+YS +A +L ++ + N + YDL+D+ RQ+ A N L
Sbjct: 511 DRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGDNNFEYDLVDIVRQSNADKGNALLDE 570
Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
I ++Y D + +++FLEL+ D LL+ F + WL +A+ L + ++K YE
Sbjct: 571 ISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEFSVSSWLAAARSLGNTDAEKKLYE 630
Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE---SGDG 490
WNA IT+W D+ L DY ++ WSGLL+D Y R +F+ + LE SG+
Sbjct: 631 WNASALITVWGDSIASNQGGLHDYSHREWSGLLKDLYYLRWKTFFEQKQQELEGKASGEV 690
Query: 491 FRLKDWRREWIKLTNDW 507
+ W + +++
Sbjct: 691 INFYGMEKAWAEKVHEY 707
>gi|423722278|ref|ZP_17696454.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
CL09T00C40]
gi|409242419|gb|EKN35181.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
CL09T00C40]
Length = 752
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 169/584 (28%), Positives = 249/584 (42%), Gaps = 102/584 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL K+I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250
Query: 61 QLGNWFSVKSDPRWC---CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF IGR F+E++ K YG +Y D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPVD+PEY+ ++G AI+ D +++W MQ W R P +KA VP L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL------REPIVKA----VPKENL 351
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------------------- 217
++LDL +G P + LHNF G I ++G
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408
Query: 218 -----ILDSIAFGPV--------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
++SI PV E W+ +Y+ RRYG+ AW L
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
Y G R I+A P++ G G P
Sbjct: 469 GPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
YS V++A L + L S+ YR+D++D+ RQ ++ + EA++ D
Sbjct: 505 -YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMSNLGQAIHKQAAEAFRKKDKE 563
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
S RFLE++ D D LL F WL A+ N E++ +E +A +T+W
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEKDLFEKDATALVTVW- 622
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW----RREW 500
+ L+ DY + W+GL+ YY R ++ + + L++G + KD RE
Sbjct: 623 --GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDHLDAGTNYSEKDLPQTHGRES 680
Query: 501 IKLTN------DWQ-----NGRNVYPVESNGDALITSQWLYNKY 533
+ + DW+ V + GD + T+ LY KY
Sbjct: 681 FRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRLYKKY 724
>gi|423312588|ref|ZP_17290525.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
CL09T03C04]
gi|392688276|gb|EIY81565.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
CL09T03C04]
Length = 717
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 166/532 (31%), Positives = 250/532 (46%), Gaps = 100/532 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ G
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 402
Query: 228 EA-------RTSENTTMA----------------------------WINQYSVRRYGRSV 252
+A RT M W+ Y RYG
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
A+Q AW++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617
Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYL 669
>gi|237721435|ref|ZP_04551916.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|293370838|ref|ZP_06617383.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|229449231|gb|EEO55022.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|292634054|gb|EFF52598.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 711
Score = 222 bits (566), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 151/517 (29%), Positives = 240/517 (46%), Gaps = 92/517 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ ++ Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ GA+I + M+ + +AVW++Q W+ + ++ S+ G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPEAVWIIQA--------WQANPREEMIASLNQG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E +P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSENTTM----------------------------------AWINQYSVRRYGRSV-P 253
A N M W+ Y RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472
Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
I +AW L +TVYN D + + S++ G + +
Sbjct: 473 EIMEAWRALEYTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511
Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
TS++ + L+Y+ +A LF + ++ +N + YDL+D+ RQ+ A N L
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570
Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
I ++Y D + +++FL+L+ D LL+ F + WL +A+ L EE+++ YE
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKRLYE 630
Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
WNA IT+W D+ L DY ++ WSGLL+D Y
Sbjct: 631 WNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLY 667
>gi|395804724|ref|ZP_10483959.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
gi|395433112|gb|EJF99070.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
Length = 722
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 157/547 (28%), Positives = 255/547 (46%), Gaps = 79/547 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLPQ W ++ LQKKIL R+ L M+PV+PAFSG VP A P AKIT
Sbjct: 208 MGNINSLEGPLPQEWFVKKEALQKKILERMKALDMHPVVPAFSGYVPKAFAEKHPEAKIT 267
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W S + T+LLD+ DPLF +IG+ FIE K YG+ S+ Y D+F+E PP
Sbjct: 268 ELKSW----SGGGFASTFLLDSKDPLFKQIGKRFIEIYTKMYGK-SNFYLADSFNEIEPP 322
Query: 121 V---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V + E +S+ G+A+Y + AVW+MQGWLF + FW KA L+ VP K
Sbjct: 323 VSEHNKYEELSNYGSAVYETIDEAAPGAVWVMQGWLFGDNKEFWTKEATKAFLSKVPNEK 382
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD---------------- 220
++V D + +W + FYG + + +HN+ G+ +YG L+
Sbjct: 383 VMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKDELASLLKNPNRG 442
Query: 221 -SIAFGPVEARTSENTTM-----------------AWINQYSVRRYGRSVPAIQDAWNVL 262
+ +G + + N+ + W+ +Y RYG++ ++ AW +L
Sbjct: 443 NIVGYGAMPEGLNNNSIVYEYIYDLPWTKAEQPLNDWMAKYLNARYGQTSESVFHAWELL 502
Query: 263 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
+VYN T D A+ + +TE K G P K
Sbjct: 503 LKSVYNVKYWETRWWNDWAGAYLLFKRPTVKITEFK----GNPGDK-------------- 544
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
++ AL++ + + +N +YDLID++R + +E + I+AYQ +
Sbjct: 545 ------IKLKEALDILKKEAKKYNKNNLIQYDLIDVSRHYNSLSIDEELIECIKAYQEKN 598
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
QL ++ + V + D +++ L W++SA + E Y NA+T IT+
Sbjct: 599 IAKGDQLFKQIEKQVLETDKMMSGQPLNNLNQWVKSASDYGSSPEVSSLYAKNAKTLITL 658
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKD 495
W L DY ++ W G+ + +Y PR ++ + + ++ + F +K+
Sbjct: 659 WGGEGH-----LNDYASRSWKGMYKGFYWPRWKMFLEALKKAAVTNTSFDENKERESIKN 713
Query: 496 WRREWIK 502
W W +
Sbjct: 714 WEINWTE 720
>gi|154492110|ref|ZP_02031736.1| hypothetical protein PARMER_01741 [Parabacteroides merdae ATCC
43184]
gi|154087335|gb|EDN86380.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
43184]
Length = 752
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 165/584 (28%), Positives = 249/584 (42%), Gaps = 102/584 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL K+I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250
Query: 61 QLGNWFSVKSDPRWC---CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF IGR F+E++ K YG +Y D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPVD+PEY+ ++G AI+ D +++W MQ W ++++ +VP L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSLR----------ESIVKAVPKENL 351
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------------------- 217
++LDL +G P + LHNF G I ++G
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408
Query: 218 -----ILDSIAFGPV--------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
++SI PV E W+ +Y+ RRYG+ AW L
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
Y G R I+A P++ G G P
Sbjct: 469 GPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
YS V++A L + L S+ YR+D++D+ RQ ++ + EA++ D
Sbjct: 505 -YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMSNLGQAIHKQAAEAFRKKDKE 563
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
S RFLE++ D D LL F WL A+ N E++ +E +A +T+W
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEKDLFEKDATALVTVW- 622
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW----RREW 500
+ L+ DY + W+GL+ YY R ++ + + L++G + KD RE
Sbjct: 623 --GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDHLDAGTNYSEKDLPQTHGRES 680
Query: 501 IKLTN------DWQ-----NGRNVYPVESNGDALITSQWLYNKY 533
+ + DW+ V + GD + T+ LY KY
Sbjct: 681 FRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRLYKKY 724
>gi|346323119|gb|EGX92717.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
Length = 742
Score = 221 bits (564), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 151/526 (28%), Positives = 253/526 (48%), Gaps = 75/526 (14%)
Query: 10 PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVK 69
PL SW+DQQ LQK+I+ R+ +LG+ P+LPAF G VP A + P A + + W +
Sbjct: 195 PLSLSWIDQQFALQKRIVARMVQLGITPILPAFPGFVPDAFARLRPGADLVRAPAWGGLP 254
Query: 70 SDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS 129
+D L D + E+ R F+E Q++ YG +++Y D F+E P + +Y+S+
Sbjct: 255 ADSPNTRALFLSPLDDAYAELQRLFVEAQIEAYGNVTNVYAMDQFNEINPVSGATDYLSA 314
Query: 130 LGAAIYSGMQSGDSDAVWLMQGWLF--SYDPFWRPPQMKALLNSVP-LGKLVVLDLFAEV 186
+ Y+ + + + AVWLMQGWLF S FW +++A L +V+LDLF+E
Sbjct: 315 VSRRSYAALAAANPAAVWLMQGWLFYLSEGNFWTQERIEAYLRGPEDRAGMVILDLFSET 374
Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENT---------- 235
P W + + G P+IWC +H+F GN ++G + + P+EA R S++
Sbjct: 375 APQWQRTGSYAGRPWIWCQVHDFGGNQNLFGKITNTTVNPMEALRESDSMVGLGIATEAY 434
Query: 236 ---------------------TMAWINQYSVRRYG--RSVPA-IQDAWNVLYHTVYNCTD 271
T+++ + ++ RRY R +PA + AW +L TVY+
Sbjct: 435 EGNEVLYDLFFDQGWSATPIDTVSYFHDWTTRRYSGVRQLPASLYQAWELLRVTVYDY-- 492
Query: 272 GATDKNRDVI---VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
+ D+I V+ ++P++ + Y K L + ++ P +W
Sbjct: 493 ----RASDLIGVPVSVYQLEPNLTGL-------YNTTTGKPTALHYDPAAL--PPIW--- 536
Query: 329 SEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
LF+A+ L A +R DL+D+ RQ L+ L+ +++ A+
Sbjct: 537 -------RLFVAAAAAQPRLWAEPGFRLDLVDVMRQVLSNAFGRLYADLVAAFTGGAPPS 589
Query: 386 -VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ Q +R ++ D+D LLA F L WL +A+ ++ + + AR+Q+T+W
Sbjct: 590 EIAQRGQRMRAVLGDVDALLATQPHFSLRRWLNAARAWGESTGENAAIAYEARSQVTIWA 649
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
T LL DY K WSGL+ YY R I+ ++++ E+ G
Sbjct: 650 PGT-----LLNDYAAKAWSGLIATYYDERWRIFVDRLVDAAENHGG 690
>gi|198277542|ref|ZP_03210073.1| hypothetical protein BACPLE_03764 [Bacteroides plebeius DSM 17135]
gi|198270040|gb|EDY94310.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides plebeius DSM
17135]
Length = 722
Score = 221 bits (564), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 163/577 (28%), Positives = 260/577 (45%), Gaps = 95/577 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 191 MNNLEGWGGPNPDSWYTQQEALQKKILKRMREYGIEPVFPGYSGMVPHDANKKLGLNVTE 250
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
L N F+ + L TD F EI + ++ K +G+ ++ Y+ D F E +
Sbjct: 251 PALWNGFTRPA--------FLLPTDSRFNEIASLYYKELEKLFGKANY-YSMDPFHELED 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G A+ M++ + A W++QGW + +P RP +K L N G +
Sbjct: 302 AGSVD----FDAAGKAVLKAMKNVNPKATWVIQGW--TENP--RPEMIKNLNN----GDI 349
Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSI-------AF 224
++LDLF+E +P IW K + +++CM+ NF GN+ ++G +D +
Sbjct: 350 LILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMIENFGGNVGLHGRMDQLLNNFYLTKN 409
Query: 225 GPVEA---------------------------RTSENTTMAWINQYSVRRYGRSVPAIQD 257
P+ A R + T W+ Y RYG I
Sbjct: 410 NPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKFTKEEWLKDYLFARYGVRDEKITQ 469
Query: 258 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
AW++L +YNC G + + + P ++ + + + K QNY P S EA
Sbjct: 470 AWSILADGIYNCPFGNNQQGPHESIFCGRPGLN-NFQASSWSKMQNYYDPTSTEA----- 523
Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
A L + ++ +N + YDL+D+ RQ+L+ ++ I
Sbjct: 524 ------------------AARLMLEVADKYKGNNNFEYDLVDIVRQSLSDRGRIVYNQTI 565
Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 435
++ D S+ FL ++ D LL F +G W+E A+ L E++ YEWN
Sbjct: 566 ADFKSFDKKSFATHSQEFLNILLAQDRLLGTRSEFRVGRWIEQARNLGTTPEEKDLYEWN 625
Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 495
AR QIT W + LRDY +K W+GLL+D+Y R A Y++ + + L+ L
Sbjct: 626 ARVQITTWGNRVCANDGGLRDYAHKEWNGLLKDFYYKRWAAYWQTLQDVLDGKPMVEL-- 683
Query: 496 WRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 532
++ + W N Y + GD + ++ ++NK
Sbjct: 684 ---DYYAMEEPWTLAHNPYASQPEGDCVSVAKEVFNK 717
>gi|358378969|gb|EHK16650.1| glycoside hydrolase family 89 protein [Trichoderma virens Gv29-8]
Length = 748
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 149/520 (28%), Positives = 258/520 (49%), Gaps = 73/520 (14%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG +P+SW+D Q LQ KIL R+ ELG+ P+LPAF G VP + VFP ++
Sbjct: 203 NIQGSWGGSMPRSWVDSQFDLQLKILDRMEELGITPILPAFPGFVPRNISRVFPDISLST 262
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + ++ ++ DP F ++ + FI +Q + YG ++ + D F+EN P
Sbjct: 263 SPIWSNFGTEL--SADIYINPFDPRFAQLQKLFISKQQELYGNVTNFWTLDQFNENQPLS 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-LVV 179
Y+ ++ +S +++ D +AVW+MQ WLFS D FW ++++ L +P+ +++
Sbjct: 321 GDLGYLQNVSHNTWSALKAADPEAVWVMQAWLFSSDSAFWTNDRIESFLGGIPVNSDMLL 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
LDLFAE P W + FYG P+IWC LH++ GN+ +YG ++++ ++A + +
Sbjct: 381 LDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGLYGQIENVTINSMDAVRNSGSLVGF 440
Query: 236 ----------------------------TMAWINQYSVRRYG-RSVPAIQDAWNVLYHTV 266
T + + + RYG ++V ++ W +L TV
Sbjct: 441 GLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHDWVSTRYGTKNVKSLYTGWELLRPTV 500
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
+N T+ + + I+ ++ PS + G+ ++G ++ Y
Sbjct: 501 FNNTNLTMNAVQKSIL---ELVPSTTGLL-GRVGHHGTTIT------------------Y 538
Query: 327 STSEVIRA-LELFIASGNELS--ASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
+ + ++ A ELF A ++ + Y+YDL+D TRQ L L+ +++ AY +
Sbjct: 539 NPAVMVEAWTELFKAGLQDIKLFTNPAYQYDLVDWTRQVLVNSFEGLYKDLVAAYNSAAS 598
Query: 384 HGVFQLSR--RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
V + SR + + L+ +D +LA ++ F L PW+ A+ A + E+NAR QIT
Sbjct: 599 SSVIK-SRGAKLIALLRTLDAVLATNEHFQLTPWINEAR--ASSPSTADFLEYNARNQIT 655
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
+W E DY +K W+GL+ YY R + Y+
Sbjct: 656 LWGPQGNIE-----DYASKQWAGLVGTYYVERWQQFIDYL 690
>gi|62088640|dbj|BAD92767.1| huntingtin interacting protein-1-related [Homo sapiens]
Length = 449
Score = 221 bits (563), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 134/413 (32%), Positives = 207/413 (50%), Gaps = 73/413 (17%)
Query: 142 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 200
D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P
Sbjct: 18 DTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQP 77
Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM----------------------- 237
+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 78 FIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAEL 137
Query: 238 -----------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 285
AW+ ++ RRYG S P AW +L +VYNC+ + NR +V P
Sbjct: 138 GWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRP 197
Query: 286 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 345
L+ TS +WY+ S+V A L + S L
Sbjct: 198 S-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAPSL 226
Query: 346 SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLL 404
+ S +RYDL+DLTRQA+ + + + AY + + + EL+ +D +L
Sbjct: 227 ATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVL 286
Query: 405 ACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSG 464
A FLLG WLE A+ A +E + YE N+R Q+T+W E ++L DY NK +G
Sbjct: 287 ASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAG 341
Query: 465 LLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 517
L+ +YY PR ++ + +++S+ G F+ + + +L + + YP +
Sbjct: 342 LVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQ 394
>gi|410095990|ref|ZP_11290981.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
CL02T12C30]
gi|409227396|gb|EKN20294.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
CL02T12C30]
Length = 753
Score = 221 bits (562), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 173/590 (29%), Positives = 251/590 (42%), Gaps = 114/590 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D L KK++ R ELGM P+ FSG VP L+N +P AKI
Sbjct: 193 MQNLQSYGGPLPKSWIDSHAELGKKVINRQLELGMQPIQQGFSGYVPRELKNKYPDAKI- 251
Query: 61 QLGNWFSVKSDPRWC---CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF GR F+E++ K +G +Y D F E+
Sbjct: 252 --------QLQPSWCGFTGAAQLDPTDSLFSAFGRDFLEEEKKLFG-AHGVYAADPFHES 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PP+D+PEY+S++G +IY Q D A+W MQ W R P +KA VP L
Sbjct: 303 RPPIDTPEYLSAVGNSIYKLFQDFDPSAIWAMQAWSL------REPIVKA----VPKEHL 352
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------------------- 217
++LDL +T +G P + LHNF G I ++G
Sbjct: 353 LILDLNGGRSRQENTC---WGYPVVAGNLHNFGGRINLHGDLRLLASNQYAVAKQKSPNV 409
Query: 218 -----ILDSIAFGPV--------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
++SI PV E W+ Y+ RRYG + AW L
Sbjct: 410 CGSGLFMESIEQNPVYYDLAFEMPLHADEVDIEEWLGDYAERRYGAASENAHKAWLHLLE 469
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
Y G R I+A P++ G G P
Sbjct: 470 GPYR--PGTNGTERSSIIA---ARPALNVKKSGPNAGLGIP------------------- 505
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
YS VI+A L + ++L+AS YR+D++D+ RQ ++ + EA+ D
Sbjct: 506 -YSPLLVIQAQGLLLKDADKLNASTPYRFDVVDIQRQLMSNLGQAIHKKAAEAFVKKDKA 564
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
S RFLE++ D+D LL F WL A+ E++ E +A +T+W
Sbjct: 565 AFTLHSNRFLEMLRDVDVLLRTRPEFNFDKWLTDARSWGTTNEEKDLLEKDATALVTVW- 623
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG--------------DG 490
+ L+ DY + W+GL+ YY R ++ + E L+ G +
Sbjct: 624 --GADGDPLIFDYSWREWTGLIDSYYLKRWEKFYAMLQEHLDEGNEYSEKGLPMTHGREA 681
Query: 491 FR-------LKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
FR L DW E++ TN + P+ + GD + T+ +Y KY
Sbjct: 682 FRANDFYSELGDWELEFVSRTNKART-----PI-TQGDEIETALKMYKKY 725
>gi|294775488|ref|ZP_06741000.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
gi|294450633|gb|EFG19121.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
Length = 712
Score = 221 bits (562), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 166/538 (30%), Positives = 252/538 (46%), Gaps = 100/538 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 182 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 233
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 234 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 289
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 290 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 337
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ G
Sbjct: 338 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 397
Query: 228 EARTS-----------------EN------------------TTMAWINQYSVRRYGRSV 252
+A+ EN T W+ Y RYG
Sbjct: 398 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 457
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 458 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 498
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 499 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 553
Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 554 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 612
Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+ L+
Sbjct: 613 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 670
>gi|150004413|ref|YP_001299157.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149932837|gb|ABR39535.1| glycoside hydrolase family 89 [Bacteroides vulgatus ATCC 8482]
Length = 717
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 165/532 (31%), Positives = 249/532 (46%), Gaps = 100/532 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ G
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 402
Query: 228 EA-------RTSENTTMA----------------------------WINQYSVRRYGRSV 252
+A RT M W+ Y RYG
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617
Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYL 669
>gi|345519733|ref|ZP_08799147.1| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
gi|345457107|gb|EET15964.2| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
Length = 717
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 166/538 (30%), Positives = 251/538 (46%), Gaps = 100/538 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ G
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALIDGFY 402
Query: 228 EA-------RTSENTTMA----------------------------WINQYSVRRYGRSV 252
+A RT M W+ Y RYG
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617
Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+ L+
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 675
>gi|319643377|ref|ZP_07998003.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
gi|317385006|gb|EFV65959.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
Length = 718
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 166/538 (30%), Positives = 251/538 (46%), Gaps = 100/538 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 240 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 296 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 343
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ G
Sbjct: 344 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALIDGFY 403
Query: 228 EA-------RTSENTTMA----------------------------WINQYSVRRYGRSV 252
+A RT M W+ Y RYG
Sbjct: 404 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 463
Query: 253 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 464 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 504
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 559
Query: 371 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 429
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 560 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 618
Query: 430 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+ L+
Sbjct: 619 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 676
>gi|423212382|ref|ZP_17198911.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694828|gb|EIY88054.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
CL03T12C04]
Length = 705
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 155/527 (29%), Positives = 237/527 (44%), Gaps = 94/527 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ VLQKKI+ R+ ELG+ PV P ++G VP + +I
Sbjct: 196 MNNLEGWGGPNPDSWYRQQEVLQKKIVARMRELGIEPVFPGYAGMVPRNIGEKL-GYQIA 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W S PR L D F + E+ K YG+ ++ Y+ D F E NT
Sbjct: 255 DPGKWCSF---PR---PAFLSTEDEHFESFAAMYYEELEKLYGKANY-YSMDPFHEGGNT 307
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD ++ GA+I + M+ + AVW++Q W+ + +++S+ G ++
Sbjct: 308 EGVD----LAKTGASIMAAMKKANPKAVWVIQA--------WQANPREEMISSLNQGDML 355
Query: 179 VLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
VLDL++E P W K F +++CML NF N+ ++G +D + G +A
Sbjct: 356 VLDLYSERLPQWGDPDSKWYREKGFGKHDWLYCMLLNFGANVGLHGRMDLLVNGYYDACA 415
Query: 232 SENTTM----------------------------------AWINQYSVRRYGRSV-PAIQ 256
N W+ Y RYG+ V P +
Sbjct: 416 HANGKTLRGVGATPEGIENNPVMFELLYELPWREERFSPDEWLQGYLKARYGKDVSPEVM 475
Query: 257 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVL 312
+AW L HTVYN RD YQ G S +
Sbjct: 476 EAWRALEHTVYNAP-------RD-------------------YQGEGTVESLLCARPGFH 509
Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
TS++ + L+YS +A L + + SN + YDL+D+ RQ+ A N L
Sbjct: 510 LDRTSTWGYAKLFYSPDSTAKAARLLTSVAKQYEGSNNFEYDLVDIVRQSNADKGNVLLE 569
Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
+I ++Y D + +++FL+L+ D LL+ F + WL++A+ L + ++K Y
Sbjct: 570 DISQSYDRKDKENFRKQTQQFLDLIVSQDSLLSTRKEFSVSTWLDAARSLGTTDAEKKLY 629
Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 479
EWNA IT+W D+ L DY ++ WSG+L+D Y R +F+
Sbjct: 630 EWNASALITVWGDSIASNQGGLHDYSHREWSGILKDLYYQRWKAFFE 676
>gi|153807690|ref|ZP_01960358.1| hypothetical protein BACCAC_01972 [Bacteroides caccae ATCC 43185]
gi|149130052|gb|EDM21264.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
Length = 707
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 156/564 (27%), Positives = 252/564 (44%), Gaps = 109/564 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVSRMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L + D F + E+ K YG+ + Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I M+ + +AVW++Q W + P A+++ + G
Sbjct: 305 GNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAWQANPRP--------AMVDVLNAG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
++VLDL++E P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DMLVLDLYSERLPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSENTTM----------------------------------AWINQYSVRRYGRSV-P 253
A T N W+ Y RYG + P
Sbjct: 413 ACTHANGKTLRGVGTTPEGIENNPVMFELLYELPWRAERFSPDTWLQGYLKARYGGELSP 472
Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
+ +AW L HTVYN +NY + E++L
Sbjct: 473 EVMEAWRALEHTVYNAP-----------------------------KNYQGEGTVESLLC 503
Query: 314 S-------ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 366
+ TS++ + L+YS +A +L ++ + +N + YDL+D+ RQ+ A
Sbjct: 504 ARPGFHLDRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGNNNFEYDLVDIVRQSNADK 563
Query: 367 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 426
N L I ++Y D + +++FLEL+ D LL+ F + WL +A+ L +
Sbjct: 564 GNALLDEISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEFSVSSWLTAARSLGNTD 623
Query: 427 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
++K YEWNA IT+W D+ L DY ++ WSGLL+D Y R +F+ + LE
Sbjct: 624 AEKKLYEWNASALITVWGDSIASNQGGLHDYSHREWSGLLKDLYYLRWKTFFEQKQQELE 683
Query: 487 ---SGDGFRLKDWRREWIKLTNDW 507
SG+ + W + +++
Sbjct: 684 GKASGEVINFYGMEKAWAEKAHEY 707
>gi|358391826|gb|EHK41230.1| glycoside hydrolase family 89 protein [Trichoderma atroviride IMI
206040]
Length = 751
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 154/541 (28%), Positives = 262/541 (48%), Gaps = 72/541 (13%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G W G +P +W+D Q LQ +IL R+ ELG+ P+LPAF G VP + VFP ++
Sbjct: 204 NIQGSWNGNMPGNWVDDQFALQLQILDRMKELGITPILPAFPGFVPRNISRVFPGISLST 263
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + D TY ++ DP F ++ + FI +Q + YG + + D F+EN P
Sbjct: 264 SPLWENFAEDLS-ADTY-VNPFDPHFTQLQKLFIGKQQELYGNVTKFWTLDQFNENQPLS 321
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPL-GKLVV 179
Y+ ++ ++ ++S DA+W+MQ WLFS D FW ++A L + +++
Sbjct: 322 SDLGYLRNVSQNTWTALKSASPDAIWVMQAWLFSADSSFWTNDAIEAFLGGITEDSDMLL 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
LDLFAE P W + FYG P+IWC LH++ GN+ +YG ++++ ++A + ++
Sbjct: 382 LDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGLYGQIENVTINAMQAVRNSSSLVGF 441
Query: 236 ----------------------------TMAWINQYSVRRYG-RSVPAIQDAWNVLYHTV 266
T + + + RYG +V ++ W +L TV
Sbjct: 442 GLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHDWVSARYGTENVKSLYTGWELLRPTV 501
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
+N T+ + I+ ++ P+I + G+ +G + +YD +
Sbjct: 502 FNNTNLTVNAVPKSIL---ELTPNINGLL-GRVGRHGTTI-----------NYDPAVMVD 546
Query: 327 STSEVIRA-LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN-DAH 384
+ +E+ +A LE GN Y+YDL+D TRQ L + L+ +++ AY + +A
Sbjct: 547 AWTELFKAGLEDVKLFGNP-----AYQYDLVDWTRQVLVNSFDGLYKDLVTAYNSSANAA 601
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + L++ +D +LA ++ F L W+ +A+ A N E+NAR Q+T+W
Sbjct: 602 EIRSRGSKLTALLKTLDAVLATNENFQLATWIAAAR--ASNPSNTSFLEYNARNQVTLWG 659
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGF--RLKDWRRE 499
Q E DY +K W+GL+ DYY R + Y+ + S F +L+ W +
Sbjct: 660 PTGQIE-----DYASKQWAGLVGDYYLGRWQQFIDYLATTKHSSYNQTAFYHKLQAWEIQ 714
Query: 500 W 500
W
Sbjct: 715 W 715
>gi|374312699|ref|YP_005059129.1| alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
gi|358754709|gb|AEU38099.1| Alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
Length = 754
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 160/551 (29%), Positives = 245/551 (44%), Gaps = 96/551 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ + GPLPQ +++++ +LQ+++L R+ ELGM PV PAF+G VP + + P +
Sbjct: 221 MGNINHFAGPLPQHFMEEKRILQRQVLNRMRELGMKPVAPAFAGFVPQGFKRLHPEVETF 280
Query: 61 QLGNWF--SVKSDPRWCCTYLLD-ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
L W K+ PR T++L L+ +IG+ FIE+ EYG + Y DTF+E
Sbjct: 281 TL-LWLRKEFKTIPRSTRTFILHPGQQELYRQIGKKFIEEYKAEYGEVEY-YLADTFNEL 338
Query: 118 TPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
PV E + G ++ +Q+GD W+MQGWLF YD FW ++ALL +P
Sbjct: 339 EVPVREDHRYEDLERFGRTVFESIQAGDPKGTWVMQGWLFVYDSDFWNKESVEALLRGIP 398
Query: 174 LGKLVVLDLFAEVKPI---------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
+++++D ++ P W K F+G P+I M H F GN + G L +A
Sbjct: 399 NDRMLIIDYANDLAPSVQGKYLPGQWKLQKAFFGKPWINGMAHTFGGNNNIKGNLKLMAT 458
Query: 225 GPVEARTS----------------ENTTM------------------AWINQYSVRRYGR 250
P S EN + WI Y RYG
Sbjct: 459 EPSTVLASPERGNLVGWGMCPEGIENNEVVYELMTDAGWQSEAIDLATWIPAYCRSRYGD 518
Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
PA+Q AW +L + Y+ T + A P V P SV G P +
Sbjct: 519 CPPAMQQAWELLLKSAYSSHIWMT---KQAWQAEPSVHPIAASVDAG-------PTFQ-- 566
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
RA+ELF++ +L+ S YR DLI+ QA+ +E
Sbjct: 567 ----------------------RAVELFLSCAPQLAKSELYRNDLIEFVSQAVGGRVDEA 604
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
++A + R +E + +DGL+ L W+++ + A+ +++
Sbjct: 605 LALAVQAGDAKQDEDAVAHAARAVEWMRRIDGLMNLRPDRRLETWMQATRAYAKTDDEAT 664
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
Y+ NAR IT W L DY ++ WSGL+RDYY R +F ES +G
Sbjct: 665 FYDENARLLITTW------GWPELSDYASRVWSGLIRDYYAARWEAWF----ESRHTGRS 714
Query: 491 FRLKDWRREWI 501
F L W++ W+
Sbjct: 715 FSLDLWQQTWL 725
>gi|423345423|ref|ZP_17323112.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
CL03T12C32]
gi|409223209|gb|EKN16146.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
CL03T12C32]
Length = 752
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 168/584 (28%), Positives = 250/584 (42%), Gaps = 102/584 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL K+I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250
Query: 61 QLGNWFSVKSDPRWC---CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF IGR F+E++ K YG +Y D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPVD+PEY+ ++G AI+ D +++W MQ W R P +KA VP L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL------REPIVKA----VPKENL 351
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-------------------- 217
++LDL +G P + LHNF G I ++G
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408
Query: 218 -----ILDSIAFGPV--------EARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
++SI PV E W+ +Y+ RRYG+ AW L
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
Y T+ R I+A P++ G G P
Sbjct: 469 GPYRPGTNGTE--RSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
YS V++A L + L S+ YR+D++D+ RQ ++ + +A++ D
Sbjct: 505 -YSPLSVVQAEGLLLKDAARLEDSDPYRFDIVDIQRQLMSNLGQVIHKQAAKAFRKKDKE 563
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
S RFLE++ D D LL F WL A+ N E++ +E +A +T+W
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEKDLFEKDATALVTVW- 622
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW----RREW 500
+ L+ DY + W+GL+ YY R ++ + + L++G + KD RE
Sbjct: 623 --GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDHLDAGTNYSEKDLPQTHGRES 680
Query: 501 IKLTN------DWQ-----NGRNVYPVESNGDALITSQWLYNKY 533
+ + DW+ V + GD + T+ LY KY
Sbjct: 681 FRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRLYKKY 724
>gi|410097657|ref|ZP_11292638.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
CL02T12C30]
gi|409223747|gb|EKN16682.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
CL02T12C30]
Length = 740
Score = 218 bits (555), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 160/584 (27%), Positives = 262/584 (44%), Gaps = 99/584 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP+SW+D + L K+++ R ELGM P+ FSG VP + FP AKI
Sbjct: 195 MPNIESFGGPLPKSWIDSHIALGKQVVNRQLELGMTPIQQGFSGAVPRKMMEKFPEAKIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ +W+ + C LD DPLF E+G+ F+E++ K YG T +Y D F E+ PP
Sbjct: 255 KQPDWYGFEG----ICQ--LDPLDPLFTELGKTFLEEEQKLYG-TYGLYAADPFHESKPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD+PEY++++G++I+ M++ D DA+W+MQ W F D + + VP L+VL
Sbjct: 308 VDTPEYLNAVGSSIHKLMKTFDPDALWVMQAWSFRKD----------IASVVPKHDLLVL 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG----------------------- 217
L + F ++ LHNF G + ++G
Sbjct: 358 SLNGAL----GGEDHFCNHDFVVGNLHNFGGRVNLHGDLPLVSSNQFMKAKQKTPNVVGS 413
Query: 218 --ILDSIAFGPV------EARTSENTTM--AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
++SI PV E +++ W+N+Y+ RRYG A AW +L Y
Sbjct: 414 GLFMESIGQNPVFYELAFEMPVHQDSVKLEEWLNKYAERRYGAFSDAANKAWELLLAGPY 473
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ + +I A P VD G + P ++++++E
Sbjct: 474 RAGTNGVESS-SIICARPAVDVK----KSGPNAGFNIPYDPQSLIEAEVC---------- 518
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ +L S YR+D++D+ RQ ++ E+ EA++ D
Sbjct: 519 ----------LLQDAEQLKGSGPYRFDIVDVQRQIMSNLGQEIHKKAAEAFKKKDKEAFA 568
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
S RFLEL++D+D LL F WL A+ +E+ +E NA + +T+W
Sbjct: 569 LHSGRFLELLKDVDILLRTRTEFNFDQWLTDARAWGTTDEERNLFEKNASSLVTIWGGQV 628
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR---------- 497
DY + W+GL+ YY R ++ + L++G +R +D +
Sbjct: 629 DVRQF---DYSWREWTGLIEGYYLQRWKQFYDMLQGHLDNGTIYREEDAKMDLGRQAFRA 685
Query: 498 REWIKLTNDWQ------NGRNVYPVESNGDALITSQWLYNKYLQ 535
E+ DW+ G+ PV + GD + ++ + +KY Q
Sbjct: 686 NEFYDSLADWELAFVDRPGKARTPV-TEGDEVAVARRMLDKYKQ 728
>gi|261880159|ref|ZP_06006586.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
gi|270333130|gb|EFA43916.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
Length = 772
Score = 218 bits (555), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 155/543 (28%), Positives = 238/543 (43%), Gaps = 98/543 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP +W QQ LQK+IL R ELGM+PVLP + G +P + +T
Sbjct: 193 MNNLEGWGGPLPDAWYAQQEALQKRILKREKELGMSPVLPGYCGMMPHDAKAKL-GLDVT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L ATDP F I + + + YG+ + Y+ D F E+ P
Sbjct: 252 DGGTWNGYTRPAN------LSATDPKFDHIADLYYRELTRLYGKADY-YSMDPFHES--P 302
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D+ + G + + M+ + + W++QGW+ + P ++ ++P G +++L
Sbjct: 303 DDASVDYAEAGRKLLAAMKRANGKSNWVIQGWMENPRP--------QMIEALPEGDIIIL 354
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD------SIAFGPV- 227
DLF+E +P IW + + +++CML NF N+ ++G +D +A P
Sbjct: 355 DLFSECRPMFGAPSIWQRKEGYGRHNWLFCMLENFGANVGLHGRMDQLVHNFKLAASPST 414
Query: 228 ---EART-----------SENTTM---------------------------AWINQYSVR 246
AR SEN + W Y
Sbjct: 415 PYQNARKHLKGIGFTMEGSENNPIMFELMSELVWRANDLVSAERDRRDFKEGWTRNYVKA 474
Query: 247 RYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGK 304
RYG P IQ+AW +L ++YNC G + + + P +D + + K +NY
Sbjct: 475 RYGIDNPKIQEAWQLLIGSIYNCPVGNNQQGPHESIFNGRPSLDNFQVK-SWSKMRNY-- 531
Query: 305 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 364
Y + +RA +L + + +N + YDL+D+ RQA+
Sbjct: 532 ---------------------YDPNVTLRAAQLMTSVADRYRGNNNFEYDLVDIVRQAMD 570
Query: 365 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 424
A +L I Y+ D S RFL ++ D LL F LG +E A+ L+
Sbjct: 571 DQARLQYLRTIADYKGFDRTAFSADSARFLNMLLLQDKLLGTRQEFRLGTRIEQARSLST 630
Query: 425 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
E++ YEWNAR QIT W + T LRDY +K W GLLRD+Y R Y + +
Sbjct: 631 TLEEKNLYEWNARVQITTWGNRTCANEGGLRDYAHKEWQGLLRDFYFMRWHTYLDALSKQ 690
Query: 485 LES 487
+ +
Sbjct: 691 MTA 693
>gi|262406054|ref|ZP_06082604.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294648118|ref|ZP_06725661.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|294806859|ref|ZP_06765684.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345510559|ref|ZP_08790126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|229443271|gb|EEO49062.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|262356929|gb|EEZ06019.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|292636502|gb|EFF54977.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|294445888|gb|EFG14530.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
Length = 718
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 158/551 (28%), Positives = 263/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP A P +
Sbjct: 197 MGNLNKWDGPLSDTWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLTEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVVPDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+ + +EL +S YR DLI+ LA A + ++ N +
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYLAAKAENFYKQALKDDSENRVFAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVEFARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697
Query: 510 GRNVY--PVES 518
+ PV++
Sbjct: 698 STTPFDDPVKA 708
>gi|329962235|ref|ZP_08300241.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
gi|328530343|gb|EGF57220.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
Length = 726
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 152/517 (29%), Positives = 246/517 (47%), Gaps = 93/517 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP SW + ++ LQK+IL R+ E G++PVLP +SG +P + ++
Sbjct: 188 MNNLEGWGGPNTDSWYEDRIALQKRILKRMREYGIHPVLPGYSGMLPHNAKEKL-GVNVS 246
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
G W C Y L TD F EI + E+ + YG+ + Y+ D F E
Sbjct: 247 DPGTW----------CGYNRPAFLQPTDTRFGEIAALYYEEMNRLYGK-ADFYSMDPFHE 295
Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ + G AI+ M+ ++VW++Q W + P+ + ++ +VP G
Sbjct: 296 GGKVAGVN--LDAAGQAIWQAMKKNSRNSVWVVQAWGAN-------PRAQ-MIKNVPRGD 345
Query: 177 LVVLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSI------- 222
++VLDL++E +P W + F G +++CML N+ GN+ ++G + +
Sbjct: 346 MLVLDLYSESRPQWGEPESSWYRENGFDGHQWLYCMLLNYGGNVGLHGKMQHVIDAYYKA 405
Query: 223 ---AFG------PVEARTSENTTM------------------AWINQYSVRRYGRSVPAI 255
+FG + SEN + W+ Y RYG+ P +
Sbjct: 406 SRSSFGNTLKGVGMTMEGSENNPVMYELLCELPWRPSTFSKDEWLEGYIAARYGKCTPRL 465
Query: 256 QDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
++AW +L +++YNC +T + + + A P + + +A
Sbjct: 466 REAWVLLGNSIYNCPPRSTQQGTHESIFCARPSLK------------------AYQASSW 507
Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
SE S Y P +VIRA LF+ + ++ + YDL+D+TRQA+A+ ++
Sbjct: 508 SEMSDYYRPQ------DVIRAAGLFLEEAGQFKGNDNFEYDLVDITRQAVAEKGRLIYKV 561
Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 433
I +Y+ D + Q S RFLEL+ D LLA F +G W+E A+ L ++ E
Sbjct: 562 IQASYEAGDKPLLRQASDRFLELLLLQDRLLATRPEFKVGRWIEQARNLGHTPAEKDWLE 621
Query: 434 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
WNAR QIT W + T + LRDY +K W+GLL+D+Y
Sbjct: 622 WNARVQITTWGNRTASDRGGLRDYAHKEWNGLLKDFY 658
>gi|422873453|ref|ZP_16919938.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens F262]
gi|380305838|gb|EIA18115.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens F262]
Length = 2104
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 167/563 (29%), Positives = 260/563 (46%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GNLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604
Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
NT MAW IN Y RRYG++ I DAWN++ T
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY DA
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNRDAEKF 761
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900
>gi|423214208|ref|ZP_17200736.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693153|gb|EIY86388.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
CL03T12C04]
Length = 718
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 158/551 (28%), Positives = 263/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP A P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNAMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADM-----GWSSDSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVVPDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+ + +EL +S YR DLI+ +A A + ++ N +
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697
Query: 510 GRNVY--PVES 518
+ PVE+
Sbjct: 698 STTPFDDPVEA 708
>gi|429766730|ref|ZP_19298977.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
gi|429183354|gb|EKY24416.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
Length = 2284
Score = 216 bits (549), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 159/563 (28%), Positives = 256/563 (45%), Gaps = 83/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP +W + ++ L +++ R+ LG+ PVL +SG VP Q P A+I
Sbjct: 378 MQNMTSFGGPLPDNWFEDRVELGRQLHERMQTLGIKPVLQGYSGMVPLDFQKKNPDAQIL 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ D F E+ F E+Q + YG + Y D F E NT
Sbjct: 438 SQGGWCGFDR-PNMLKTYVNDGERDYFQEVADVFYEKQKEVYGDITDYYAVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+DS + + I M D DA+W++Q W + D ++ L N + +
Sbjct: 497 GGMDS----ARIYGTIQDKMIEHDEDAIWVIQHWQGNPDN----TKLSGLTNK---EQAL 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT- 236
+LDL +++ P + T +P++W MLHNF G + + G ++++A EA T+EN
Sbjct: 546 ILDLNSDLNPDY-TRFDNQDIPWVWNMLHNFGGRMGLDGQVETVATSITEALATTENMKG 604
Query: 237 ------------------------------MAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
W+N Y RRYG +AW +L T
Sbjct: 605 IGITPEALANSPIVYELMGDMIWTRDPINYREWVNNYIERRYGAVNEDAIEAWEILLETA 664
Query: 267 YNCTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
Y +D GA + SII+ ++ A + S++ H
Sbjct: 665 YKTSDYYYQGAAE--------------SIIN-------------ARPATSINSASTWGHS 697
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
+ Y E+ RA+ELFI+ +EL S+ + YD +D+T+Q LA A E ++ AY D
Sbjct: 698 KISYDKKELERAMELFISCYDELKDSDAFVYDFLDVTKQVLANSAQEYHKEMVAAYNSGD 757
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQI 440
A ++S FL+L+ + +L+ FL+G W+E ++ + + + + +E+NAR I
Sbjct: 758 AEKFERISEHFLDLIRLQERVLSTSPEFLVGTWIEQSRTMLADADDWTKDLFEFNARALI 817
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
T W D L+DY N+ W+GL D Y R ++ + LE+G DW +
Sbjct: 818 TTWGDYKN---GSLKDYSNRQWAGLTEDLYLKRWEMWIDGIRTELETGVTAPSIDWHKVE 874
Query: 501 IKLTNDWQNGRNVYPVESNGDAL 523
+ + + N YP E +G+ L
Sbjct: 875 YEWATEKTDESNAYPTEGSGEDL 897
>gi|326934230|ref|XP_003213195.1| PREDICTED: hypothetical protein LOC100549752 [Meleagris gallopavo]
Length = 650
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 143/218 (65%), Gaps = 7/218 (3%)
Query: 33 LGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 92
LGM VLPAF+G+VP + VFP T+LGNW D C YLL +P+F IG
Sbjct: 4 LGMTTVLPAFAGHVPPGVLRVFPRINATRLGNWSHF--DCTLSCAYLLSPEEPMFQVIGT 61
Query: 93 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 152
F+++ +KE+G T HIY+ DTF+E +P P Y++ + A++ M D +A WLMQGW
Sbjct: 62 LFLKELIKEFG-TDHIYSADTFNEMSPLSSDPAYLAGITNAVFRAMTGADPEAQWLMQGW 120
Query: 153 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 211
LF + P FW+PPQ++A+L +VPLG+++VLDLFAE KP++ ++ FYG P+IWCMLHNF G
Sbjct: 121 LFQHQPAFWQPPQVQAVLRAVPLGRMIVLDLFAESKPVYEWTESFYGQPFIWCMLHNFGG 180
Query: 212 NIEMYGILDSIAFGPVEARTSENTTMA---WINQYSVR 246
N ++G +++I GP AR N+TM I Y +R
Sbjct: 181 NHGLFGAVEAINRGPFVARRFPNSTMVLRELIEPYELR 218
>gi|160884066|ref|ZP_02065069.1| hypothetical protein BACOVA_02042 [Bacteroides ovatus ATCC 8483]
gi|423291473|ref|ZP_17270321.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
CL02T12C04]
gi|156110408|gb|EDO12153.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392663473|gb|EIY57023.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
CL02T12C04]
Length = 718
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 158/551 (28%), Positives = 263/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP A P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADM-----GWSSDSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMTIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+ + +EL +S YR DLI+ +A A + ++ N +
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWIN 697
Query: 510 GRNVY--PVES 518
+ PVE+
Sbjct: 698 STTPFDDPVEA 708
>gi|295085513|emb|CBK67036.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 718
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 158/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +WST + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWSTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+ + +EL +S YR DLI+ +A A + ++ N +
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697
Query: 510 GRNVY--PVES 518
+ PVE+
Sbjct: 698 STTPFDDPVEA 708
>gi|453081268|gb|EMF09317.1| glycoside hydrolase family 89 protein [Mycosphaerella populorum
SO2202]
Length = 784
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 152/532 (28%), Positives = 253/532 (47%), Gaps = 64/532 (12%)
Query: 8 GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
GG LPQSW+DQQ L + I+ R+ ELGM PVLP F+G VP + ++P+A W
Sbjct: 218 GGDLPQSWIDQQFELNQLIIARMIELGMTPVLPCFTGFVPTQISRLYPNASFVNGSQWNG 277
Query: 68 VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
++ ++ L+ DPLF + ++FI + YG S +Y D ++EN P + Y+
Sbjct: 278 FQA--QYTNVTFLEPFDPLFTTLQKSFISKLDAAYGNVSSVYTLDQYNENDPFSGNVTYL 335
Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEV 186
+ + +++ D +A+W +QGWLF S FW ++KA L V +++LDLF+E
Sbjct: 336 EDVASNTIKSLKAADPEAIWFIQGWLFYSAADFWDEERIKAYLGGVEDKDMLILDLFSES 395
Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVR 246
+P W + ++G P+IWC LH++ GN ++G ++++ P+ A +E +TM I
Sbjct: 396 QPQWQRTNSYFGKPWIWCQLHDYGGNQGLHGQVENVTMNPILALANETSTMVGIGLTMEG 455
Query: 247 RYGRSV---PAIQDAW-----------NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSII 292
+ G + + AW + + Y+C D +D+ +A+ + +I
Sbjct: 456 QEGNEIIYDILLDQAWTPEPIESAGYFDDWVTSRYHCDDAVAGLPQDLYIAWDMMRQTIY 515
Query: 293 SVTEGKYQNYGKPVSKEAVLKS------------ETSSYDHPHLWYSTSEVIRALELFIA 340
+ T+ + EAV KS + + + + Y ++ A + F +
Sbjct: 516 NNTDID--------TAEAVTKSIFELQPNTTGLLDRTGHHSTRILYDPEILVSAWKHFYS 567
Query: 341 SGNE---LSASNTYRYDLIDLTRQALAKYANEL---FLNIIE--AYQLNDAHGVFQLSRR 392
+ E L +YR+DL+D+TRQ LA L F+N+ + + Q R
Sbjct: 568 ASQETPQLWELESYRFDLVDITRQVLANAFYPLYGEFVNMTANSSLPSSSTASAEQTGAR 627
Query: 393 FLELVEDMDGLLACHDG--FLLGPWLESAKQLAQNEEQEKQ------------YEWNART 438
L L+ D+D +L F L W+ SA+ A E YE+NAR
Sbjct: 628 MLSLLLDLDSVLEASGNAHFSLESWIHSARLWAPTETNAADGDNMTAAAIADFYEYNARN 687
Query: 439 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
QIT+W + + DY +K W+GL++ YY PR + + + S S DG
Sbjct: 688 QITLWGPGGE-----ISDYASKQWAGLIKTYYVPRWERFVHFTLNSSTSADG 734
>gi|146300873|ref|YP_001195464.1| alpha-N-acetylglucosaminidase [Flavobacterium johnsoniae UW101]
gi|146155291|gb|ABQ06145.1| Candidate alpha-glycosidase; Glycoside hydrolase family 89
[Flavobacterium johnsoniae UW101]
Length = 723
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 157/548 (28%), Positives = 251/548 (45%), Gaps = 80/548 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLPQ W ++ LQKKIL R+ L M+PV+PAFSG VP A P AKIT
Sbjct: 208 MGNINSLEGPLPQEWFSKKEELQKKILERMRTLDMHPVVPAFSGYVPKAFAEKHPEAKIT 267
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W S + T+LLD+ DPLF +IG+ FIE K YG+ S+ Y D+F+E PP
Sbjct: 268 ELNSW----SGGGFESTFLLDSKDPLFKKIGKRFIEIYTKMYGK-SNFYLADSFNEIEPP 322
Query: 121 V---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V + E +++ G+AIY ++ AVW+MQGWLF + FW A L+ VP +
Sbjct: 323 VTEHNKYEELANYGSAIYETIEEAAPGAVWVMQGWLFGDNKNFWTKEATSAFLSKVPNDR 382
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL----------------- 219
L+V D + +W + FYG + + +HN+ G+ +YG L
Sbjct: 383 LMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKNELVSLLKNPHRG 442
Query: 220 DSIAFGPVEA-----------------RTSENTTMAWINQYSVRRY-GRSVPAIQDAWNV 261
+ + +G + E + W+ Y RY ++ ++ AW +
Sbjct: 443 NVVGYGAMPEGLNNNAIVYEFIYDLPWSKGEQSVKDWLTNYLNARYEQKTSDSVFKAWEL 502
Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
L +VY+ T D A+ ++TE K G P K+
Sbjct: 503 LLESVYSTKYWETRWWNDRAGAYLLFKRPTATITEFK----GNPGDKD------------ 546
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
++ AL++ A + N +YDLID +R + +E + ++AYQ
Sbjct: 547 --------KLKEALDILKAEAKKYDKKNFIQYDLIDASRHYYSLSIDEDLVECVKAYQQK 598
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
D QL ++ + V ++D ++ L W++SA + E K Y NA+T IT
Sbjct: 599 DITKGDQLFKKIEKQVLEIDKSMSGQPLNSLNYWVKSASEYGSTPEVSKLYVKNAKTLIT 658
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL-------ESGDGFRLK 494
+W L DY ++ W G+ + +Y PR ++ ++ E+ + +K
Sbjct: 659 LWGGEGH-----LNDYASRSWQGMYKGFYWPRWKMFLTAFKKTAVNNTPFDETKEREEIK 713
Query: 495 DWRREWIK 502
+W +W K
Sbjct: 714 NWEIKWTK 721
>gi|293371915|ref|ZP_06618319.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|292633161|gb|EFF51738.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 718
Score = 214 bits (545), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 157/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVISDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+ + +EL +S YR DLI+ +A A + ++ N +
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697
Query: 510 GRNVY--PVES 518
+ PVE+
Sbjct: 698 STTPFDDPVEA 708
>gi|336404352|ref|ZP_08585050.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
gi|335943680|gb|EGN05519.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
Length = 718
Score = 214 bits (545), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 157/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVISDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+ + +EL +S YR DLI+ +A A + ++ N +
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697
Query: 510 GRNVY--PVES 518
+ PVE+
Sbjct: 698 STTPFDDPVEA 708
>gi|299144719|ref|ZP_07037787.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298515210|gb|EFI39091.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 718
Score = 214 bits (545), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 157/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKADKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADM-----GWSSDSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVVPDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+ + +EL +S YR DLI+ +A A + ++ N +
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697
Query: 510 GRNVY--PVES 518
+ PVE+
Sbjct: 698 STTPFDDPVEA 708
>gi|422345314|ref|ZP_16426228.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
WAL-14572]
gi|373228039|gb|EHP50349.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
WAL-14572]
Length = 1842
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 165/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604
Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
NT MAW IN Y RRYG++ I DAWN++ T
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900
>gi|298480124|ref|ZP_06998323.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|298273933|gb|EFI15495.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
Length = 718
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 157/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADM-----GWSSDSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+ + +EL +S YR DLI+ +A A + ++ N +
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697
Query: 510 GRNVY--PVES 518
+ PVE+
Sbjct: 698 STTPFDDPVEA 708
>gi|237719039|ref|ZP_04549520.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451817|gb|EEO57608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 718
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 156/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADM-----GWSSDSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+++ +EL S YR DLI+ +A A + ++ N +
Sbjct: 535 DYLQAIRLYVSCADELKGSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDKLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697
Query: 510 GRNVY--PVES 518
+ PV++
Sbjct: 698 STTPFDDPVKA 708
>gi|168216263|ref|ZP_02641888.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens NCTC 8239]
gi|182381741|gb|EDT79220.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens NCTC 8239]
Length = 2104
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 165/563 (29%), Positives = 260/563 (46%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEEEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L G+ +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKGQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604
Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
NT MAW IN Y RRYG++ I +AWN++ T
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTA 664
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900
>gi|168207628|ref|ZP_02633633.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens E str. JGS1987]
gi|170661027|gb|EDT13710.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens E str. JGS1987]
Length = 2104
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 165/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQEEVFGEVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604
Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
NT MAW IN Y RRYG++ I DAWN++ T
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900
>gi|182624959|ref|ZP_02952737.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens D str. JGS1721]
gi|177909756|gb|EDT72174.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens D str. JGS1721]
Length = 2104
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 165/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604
Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
NT MAW IN Y RRYG++ I DAWN++ T
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900
>gi|423293381|ref|ZP_17271508.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
CL03T12C18]
gi|392678324|gb|EIY71732.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
CL03T12C18]
Length = 718
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 157/551 (28%), Positives = 262/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPITPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+ + +EL +S YR DLI+ +A A + ++ N +
Sbjct: 535 DYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWIN 697
Query: 510 GRNVY--PVES 518
+ PVE+
Sbjct: 698 STTPFDDPVEA 708
>gi|194695302|gb|ACF81735.1| unknown [Zea mays]
Length = 173
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 100/152 (65%), Positives = 119/152 (78%)
Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 448
L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITMWFDNT+
Sbjct: 7 LCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTE 66
Query: 449 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQ 508
+ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+ F LK+WRREWI LTN+WQ
Sbjct: 67 TKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWISLTNNWQ 126
Query: 509 NGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
+ R V+ + GD L SQ LY KYL +
Sbjct: 127 SDRKVFSTTATGDPLNISQSLYTKYLSNADLL 158
>gi|169346867|ref|ZP_02865815.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens C str. JGS1495]
gi|169296926|gb|EDS79050.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens C str. JGS1495]
Length = 2104
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 165/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604
Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
NT MAW IN Y RRYG++ I DAWN++ T
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKME 877
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900
>gi|18309848|ref|NP_561782.1| alpha-N-acetylglucosaminidase [Clostridium perfringens str. 13]
gi|18144526|dbj|BAB80572.1| probable alpha-N-acetylglucosaminidase [Clostridium perfringens
str. 13]
gi|288872041|dbj|BAI70446.1| alpha-N-acetylglucosaminidase [Clostridium perfringens]
Length = 2104
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 165/563 (29%), Positives = 258/563 (45%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F + F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604
Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
NT MAW IN Y RRYG++ I DAWN++ T
Sbjct: 605 IGITPEAINTNPLAHELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 821
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKME 877
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900
>gi|168212494|ref|ZP_02638119.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens CPE str. F4969]
gi|170716100|gb|EDT28282.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens CPE str. F4969]
Length = 2104
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 165/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604
Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
NT MAW IN Y RRYG++ I DAWN++ T
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDADDWTKDLFEFNARALVTTWG 821
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 822 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900
>gi|320106778|ref|YP_004182368.1| alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
gi|319925299|gb|ADV82374.1| Alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
Length = 754
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 165/551 (29%), Positives = 243/551 (44%), Gaps = 96/551 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLP+ +++Q+ VLQ+KIL R+ LGM PV PAFSG VP + + P A+
Sbjct: 220 MGNVNNIDGPLPEHFIEQKRVLQRKILDRMRSLGMRPVAPAFSGFVPQGFKRLHPKAETF 279
Query: 61 QLGNWF--SVKSDPRWCCTYLLDATDP-LFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
L W K+ PR T++L + L+ IG+ FIE+ EYG + Y DTF+E
Sbjct: 280 TL-LWLPEEFKTIPRSTRTFILHPGEQDLYRLIGKKFIEEYKAEYGEVQY-YLADTFNEL 337
Query: 118 TPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
PV E + G +Y G+ +GD + W+MQGWLF YD FW + ALL +P
Sbjct: 338 AVPVREEHRFEDLERFGRTVYEGILAGDPNGTWVMQGWLFVYDVAFWNSESVAALLRGIP 397
Query: 174 LGKLVVLDLFAEVKPI---------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
+++++D ++ P W T K F+G +I M H F GN + G L +A
Sbjct: 398 NDRMLIIDYANDLAPAVKGKYAPGQWKTQKAFFGKQWINGMAHTFGGNNNVKGNLKLMAS 457
Query: 225 GPVEARTS-----------------------ENTTMA-----------WINQYSVRRYGR 250
P TS E T A WI Y RYG
Sbjct: 458 EPASVLTSPERGNLVGWGMCPEGIETNEVVYELMTDAGWQREAIDLKQWIPAYCRSRYGA 517
Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
P + +AW +L + Y+ T + P + P+ SV G P +
Sbjct: 518 CPPVMLEAWTLLMQSAYSAHIWMTHQAWQTE---PSLAPAAASVDAG-------PTFR-- 565
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
RA+ LF++ EL YR DLI+L QA ++
Sbjct: 566 ----------------------RAVALFLSCAPELGQKELYRNDLIELVVQAAGGSVDQT 603
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
F ++A Q + + + L + MD LL L W+++A+ A+++++
Sbjct: 604 FSLAVQAGQSHQNEVATEYAAHALGWMGRMDALLNLRPDRRLETWMQAARSYAKSDDEAA 663
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
Y+ NAR IT W L DY ++ WSGL RDYY R +F SL +G
Sbjct: 664 YYDENARRLITTW------GWPELSDYASRAWSGLTRDYYASRWEAWFA----SLHAGRP 713
Query: 491 FRLKDWRREWI 501
F L W++ W+
Sbjct: 714 FSLDIWQQTWL 724
>gi|383115207|ref|ZP_09935965.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
gi|313695376|gb|EFS32211.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
Length = 718
Score = 212 bits (539), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 156/551 (28%), Positives = 261/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVVPDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+ + +EL S YR DLI+ +A A + ++ N +
Sbjct: 535 DYLQAIRLYASCADELKGSELYRNDLIEFVSYYVAAKAENFYKQALKDDSENRVLAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697
Query: 510 GRNVY--PVES 518
+ PV++
Sbjct: 698 STTPFDDPVKA 708
>gi|440792549|gb|ELR13759.1| peptidase, S8/S53 subfamily protein [Acanthamoeba castellanii str.
Neff]
Length = 981
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 146/531 (27%), Positives = 236/531 (44%), Gaps = 104/531 (19%)
Query: 50 LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIY 109
++ ++P+A +T+ +W ++ Y L D L+ IG I +E+G T HIY
Sbjct: 434 IKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG-TDHIY 490
Query: 110 NCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALL 169
N DTF+E +PP P Y+++ A+Y GM + D A+W+MQGW F +DPFW ++KA L
Sbjct: 491 NADTFNEMSPPSADPTYLAAASRAVYEGMATQDPQALWVMQGWSFVFDPFWTKDRIKAYL 550
Query: 170 NSVPLGKLVVLDLFAEVKPIWSTSKQF----YGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
+ V +++LDL ++ P W+ + QF +G ++WCMLHN G +YG L +
Sbjct: 551 SGVDNSDMLILDLASDNSPEWNKTGQFRDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSD 610
Query: 226 PVEARTSENTTMA---------------------------------WINQYSVRRYGRSV 252
P+ A + TM W+ +Y+ RRYG +
Sbjct: 611 PLIALATPGNTMVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAT 670
Query: 253 PA--IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
+ + +AW +L YN + +D + +G +
Sbjct: 671 GSSPVGEAWELLREATYNQS---------------GLDAGLFGFAPALGMGHGGTSN--- 712
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNE--LSASNTYRYDLIDLTRQALAKYAN 368
+T EV AL LF+ S + + ++YD +DLTRQ LA N
Sbjct: 713 ----------------ATKEV-EALRLFLQSAQTEGYAPNGPWQYDCVDLTRQVLANTFN 755
Query: 369 ELFLNIIEAY-----QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 423
+++ + AY +D L+ L ++ D+D LLA + +LLG W++ A A
Sbjct: 756 DVYSQLDAAYTSYATNKSDTLPFLPLAAELLGIISDLDRLLATNPNYLLGTWIKDAVSWA 815
Query: 424 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
EQ Y++NAR QIT+W + Q + DY K+W+GLL ++
Sbjct: 816 SIPEQALHYQFNARNQITLWGPDGQ-----ISDYATKHWAGLL---------------MK 855
Query: 484 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
++ +G F + E ++L W YP GD L + + KYL
Sbjct: 856 AVGAGVMFNSTAYGTELLQLEQKWNQENTTYPTTPTGDTLQVALRISQKYL 906
>gi|168209163|ref|ZP_02634788.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens B str. ATCC 3626]
gi|170712640|gb|EDT24822.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens B str. ATCC 3626]
Length = 2104
Score = 211 bits (538), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 165/563 (29%), Positives = 257/563 (45%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F + F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 604
Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
NT MAW IN Y RRYG++ I DAWN++ T
Sbjct: 605 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILDTA 664
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 665 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 701
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 702 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 761
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 762 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDADDWTKDLFEFNARALVTTWG 821
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 822 SRNNANGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFKME 877
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 878 YDWVNKKSDTDKLYPTEASNENL 900
>gi|110801838|ref|YP_698175.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens SM101]
gi|110682339|gb|ABG85709.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens SM101]
Length = 2095
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 166/563 (29%), Positives = 259/563 (46%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 369 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 428
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F + F E+Q + +G ++ Y D F E NT
Sbjct: 429 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 487
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 488 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 536
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE
Sbjct: 537 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 595
Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
NT MAW IN Y RRYG++ I +AWN++ T
Sbjct: 596 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNEEILEAWNIILDTA 655
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 656 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 692
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY DA
Sbjct: 693 DKSEFEKAIEIFSKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNRDAEKF 752
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 753 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDADDWTKDLFEFNARALVTTWG 812
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 813 SRNNADGGGLKDYSNRQWSGLTGDYYYARWEKWINGLQIELDGG----AKAPNIDWFKME 868
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 869 YDWVNKKSDTDKLYPTEASNENL 891
>gi|440731409|ref|ZP_20911430.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
gi|440373101|gb|ELQ09870.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
Length = 732
Score = 211 bits (537), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 165/591 (27%), Positives = 247/591 (41%), Gaps = 110/591 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ + PLPQ W++ + LQ++IL R+ LGM PVLPAFSG VP A P A+I
Sbjct: 168 MGNIEAYDAPLPQQWIEDKYALQQRILQRMRTLGMKPVLPAFSGYVPKAFAQAHPQARIY 227
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I + FI+ + YG+ ++ Y D F+E PP
Sbjct: 228 RMRAWEGFHE------TYWLDPADPLFTKIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 280
Query: 121 VDS----------------------PEY--------ISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + PE ++ G A+Y + + DAVW+MQ
Sbjct: 281 IAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 340
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 341 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 400
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENT-------TMAWINQ-- 242
+ G+ +YG L + FG +N+ T+AW Q
Sbjct: 401 YGGSNPVYGDLAFYRDDLRALLADKDKQQLVGFGAFPEGLHDNSVVYEYMYTLAWGGQQR 460
Query: 243 --------YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
Y+ RYG + PA++ AW+ L V + P
Sbjct: 461 SLQDWLGDYTRARYGHTSPALRAAWDDLQAAVLSTRYWT---------------PRWWRS 505
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + + E + D P L RAL+ +A E + + YRYD
Sbjct: 506 RAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLYRYD 556
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
L+D R + + AY+ D R V+ +DGL+ +L
Sbjct: 557 LVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDAAFARVQAAVQQLDGLVGGQQE-ILSS 615
Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
WL A+ A+ + Y +A+ QI++W L DY +K W G+ DYY PR
Sbjct: 616 WLGDAEGDAKTPQDAAYYRRDAKAQISVWGGEGN-----LGDYASKAWQGMYADYYLPRW 670
Query: 475 AIYFKYMIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVES 518
A+ + + + SG RL+ W R+W+ + PV +
Sbjct: 671 ALAMQALRAAAVSGGSVDEAALQQRLRVWERDWVACETPYTRRAPADPVAA 721
>gi|170292392|pdb|2VC9|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With 2-Acetamido-1,2-Dideoxynojirmycin
gi|170292393|pdb|2VCA|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With Beta-N-Acetyl-D-Glucosamine
gi|170292394|pdb|2VCB|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With Pugnac
gi|170292395|pdb|2VCC|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
Length = 891
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 161/563 (28%), Positives = 257/563 (45%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + A+
Sbjct: 344 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 403
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 404 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 462
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 463 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 511
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTT- 236
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+
Sbjct: 512 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 570
Query: 237 --------------------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
MAW IN Y RRYG++ I +AWN++ T
Sbjct: 571 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTA 630
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 631 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 667
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 668 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 727
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 728 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 787
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 788 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKME 843
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 844 YDWVNKKSDTDKLYPTEASNENL 866
>gi|154321596|ref|XP_001560113.1| hypothetical protein BC1G_00945 [Botryotinia fuckeliana B05.10]
Length = 701
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 150/506 (29%), Positives = 243/506 (48%), Gaps = 81/506 (16%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG +P +W++ Q +LQKKI+ R+ ELG+ PVLPAF+G VP L+ V P+A I
Sbjct: 185 FGNIQGSWGGTIPLAWIEDQHLLQKKIVQRMVELGITPVLPAFTGFVPRDLRRVAPNANI 244
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
+W ++ T+L DPLF + F+ Q + YG SHIY D F+EN P
Sbjct: 245 INGSDWGNLFPFEYSNDTFLY-PIDPLFKTLQHTFLSLQSEYYGNVSHIYTLDQFNENLP 303
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-L 177
P Y+ ++ Y +QS DS+A W++QGWLF + FW +++A L VP + +
Sbjct: 304 ASGDPLYLGNISRGTYDSLQSFDSNATWMLQGWLFYAASSFWTQDRVEAYLGGVPKNESM 363
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT 236
++LDLF+E P W + Q+YG P+IWC LH + G +YG + +I +EA R SE
Sbjct: 364 LILDLFSESFPEWENTHQYYGKPWIWCQLHGYGGTPGIYGQIYNITNSSIEAFRNSEKMV 423
Query: 237 MAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE 296
+ N +G + + ++ P+I
Sbjct: 424 ----------------------------GMGNTMEGQDGNGLILPKSLLEMQPNIT---- 451
Query: 297 GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI---ASGNELSASNTYRY 353
+N+G+ + +S T +P +++ RA L S EL N +++
Sbjct: 452 ---ENHGR------LGQSLTIDLFNP------ADLFRAWGLLYNASISVPELWYDNGWKF 496
Query: 354 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV-EDMDGLLACHDGFLL 412
D++D+TRQ L++ ++++IE Y A F+ + L ++ ++D +L+ F L
Sbjct: 497 DMVDVTRQVLSERFKLEYVDLIEKYT---AEIDFEATSENLSMILRELDDILSTSPHFRL 553
Query: 413 GPWLESAKQLAQNE-----------------EQEKQYEWNARTQITMWFDNTQEEASLLR 455
W+ +A + N + + + +NA QIT+W Q +
Sbjct: 554 DTWINAAIASSPNSSTYPIPSSDGSSELNITQTQHLFAYNAINQITIWGPTGQ-----IN 608
Query: 456 DYGNKYWSGLLRDYYGPRAAIYFKYM 481
DY +K W GL+R YY R I+ Y+
Sbjct: 609 DYASKSWGGLVRGYYLKRWEIFLDYI 634
>gi|322703040|gb|EFY94656.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
ARSEF 23]
Length = 774
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 151/541 (27%), Positives = 255/541 (47%), Gaps = 84/541 (15%)
Query: 5 HGWGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 62
WGG LP ++++QQ LQK+I+ R+ ELG+ PVLPAF G VP +++ V P+A +T
Sbjct: 209 RSWGGKGDLPLAFIEQQFELQKQIVTRMVELGITPVLPAFPGFVPESIKKVRPNANLTVS 268
Query: 63 GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 122
NWF+ D ++ LD D + E+ + F+ +Q+ +G +++Y D F+E +P
Sbjct: 269 PNWFAPAPD-KYTRDLFLDPLDDTYAELQKLFVTKQIDAFGNVTNVYTLDQFNELSPASG 327
Query: 123 SPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGK-LVVL 180
Y+ + Y+G+ + + AVWL+QGWL FS FW P++ A L V + ++VL
Sbjct: 328 DTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSSRNFWTQPRIDAYLGGVEDHQGMLVL 387
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT----- 235
DL++EV P W + + G P+IWC LH+F GN+ + G + ++ P++A +
Sbjct: 388 DLYSEVNPQWQRTNSYSGKPWIWCQLHDFGGNMALEGRVQTLTSAPIDALAQSKSLVGFG 447
Query: 236 ---------------------------TMAWINQYSVRRYG--RSVPA-IQDAWNVLYHT 265
T A+ + +RY S+P+ + AW +L
Sbjct: 448 LTPEAYEGNEVVYDILLDQAWSATPLDTQAYFASWVTKRYAGISSIPSELYRAWEILRTD 507
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VY+ T TD + V VA + P++ + + T + HP
Sbjct: 508 VYSNTR--TDIPQ-VPVATYQLRPALSGIA------------------NRTGHFPHPTAL 546
Query: 326 YSTSEVIRA-----LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
+ V++ LE G+ L ++ D +D++RQ L+ + L+ +++ AY+
Sbjct: 547 HYDPLVLQGVWKLMLEALTRQGS-LWKVPAFQLDFVDVSRQMLSNQFDVLYADLVNAYKC 605
Query: 381 NDAHG-------------VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 427
+ G V R L L+ +D L F L W+++A +
Sbjct: 606 STGAGGSRELRSNTPNCDVKAAGARLLFLLSTLDLTLLTSRHFALQSWVDAASAWGKAAG 665
Query: 428 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
E + +NAR+Q+T+W Q A+ L DY K W GL+ YY R +I+ ++ + S
Sbjct: 666 NEDLFTFNARSQVTVW----QVNATNLNDYAAKAWGGLVGSYYKGRWSIFVDALVAASSS 721
Query: 488 G 488
G
Sbjct: 722 G 722
>gi|336412611|ref|ZP_08592964.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
3_8_47FAA]
gi|335942657|gb|EGN04499.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
3_8_47FAA]
Length = 718
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 156/551 (28%), Positives = 261/551 (47%), Gaps = 72/551 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
V+A + N+ ++ +G + +++ V+Y + + + D I D
Sbjct: 432 VKALRA-------ANKGNLIGFGSAPEGLENN-EVVYELLADMGWSS-----DSI----D 474
Query: 287 VDPSIISVTEGKYQNYGKPVSKEAVLKSET---SSYDHPHLWYST--------------S 329
+D + E +Y Y + + L +T S Y +P + T
Sbjct: 475 LDDWMKIYCEARYGGYPDAMEEAWKLFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDLSD 534
Query: 330 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 389
+ ++A+ L+ + +EL S YR DLI+ +A A + ++ N +
Sbjct: 535 DYLQAIRLYASCADELKNSELYRNDLIEFVSYYVAAKAEIFYKQALKDDSENRVLAAQRN 594
Query: 390 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 449
++ ++L+ D+D LLA H + L W+E A+ +++ YE NA+ IT W
Sbjct: 595 LQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAYEANAKRLITSW------ 648
Query: 450 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 509
+ DY ++WSGL++DYY PR +YF + D ++++W +WI T+ W N
Sbjct: 649 -GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNKIREWEEQWI--TSPWSN 697
Query: 510 GRNVY--PVES 518
+ PV++
Sbjct: 698 STTPFDDPVKA 708
>gi|380512475|ref|ZP_09855882.1| N-acetylglucosaminidase [Xanthomonas sacchari NCPPB 4393]
Length = 785
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 158/574 (27%), Positives = 242/574 (42%), Gaps = 110/574 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W++ + LQ +IL R+ LGM PVLPAF+G VP A P A+I
Sbjct: 221 MGNIEGYDAPLPQQWIEDKHALQLRILQRMRALGMKPVLPAFAGYVPKAFAQAHPQARIY 280
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE---- 116
++ W TY LD DPLF +I + FI+ + YG+ ++ Y D F+E
Sbjct: 281 RMRAWEGFHE------TYWLDPADPLFAQIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 333
Query: 117 --------------------------NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
PPV + +++ G A+Y+ + + DAVW+MQ
Sbjct: 334 IAADGSDARLASYGDSTANTAKTKPPEVPPVQRDKRLAAYGRALYASIHRANPDAVWVMQ 393
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 394 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 453
Query: 209 FAGNIEMYGIL-----------------DSIAFG--PVEARTS---------------EN 234
+ G+ +YG L + FG P T+ +
Sbjct: 454 YGGSNPVYGDLAFYREDLRALLADKDKQQLVGFGAFPEGLHTTSVVYEYMYALAWGAQQR 513
Query: 235 TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W++ Y+ RYG + PA++ AW+ L +V + P
Sbjct: 514 PLQDWLDDYTRARYGHTSPALRAAWDDLQASVLSTRYWT---------------PRWWRS 558
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + + E + D P L RAL+ +A E + + YRYD
Sbjct: 559 RAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALQQLLALAPEYADAPLYRYD 609
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
L+D R + + AY+ D + R E V +D L+ L
Sbjct: 610 LVDFARHYATGRVDVQLQQAVAAYRRGDVAAGDAATARVREAVTQLDSLVGGQQD-TLSS 668
Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
WL++A A + Y +A+ Q+++W L DY +K W G+ DYY PR
Sbjct: 669 WLDAAAGYATTPQDAAYYRRDAKAQVSVWGGEGN-----LGDYASKAWQGMYADYYLPRW 723
Query: 475 AIYFKYMIESLESGDGF-------RLKDWRREWI 501
+ + + E+ +G RL+ W R+W+
Sbjct: 724 TLALQMLSEAAVAGGSVDEAQLQQRLRAWERDWV 757
>gi|294674521|ref|YP_003575137.1| alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
gi|294472030|gb|ADE81419.1| putative alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
Length = 754
Score = 209 bits (533), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 153/532 (28%), Positives = 237/532 (44%), Gaps = 87/532 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW +Q LQK+IL R+ +LGM+PVLP + G VP + +
Sbjct: 186 MNNLEGWGGPLPTSWYARQEKLQKQILARMKQLGMHPVLPGYCGMVPHDAKEKL-GLNVA 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
G W + L TD F EI + + K +G+ + Y+ D F E+
Sbjct: 245 DAGLWNGFQRPAN------LLPTDARFSEIATLYYNELTKLFGKADY-YSMDPFHESNDD 297
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
P +D + G A+ M+ + AVW++QGW + +P +A+++ + G L+
Sbjct: 298 PNID----YAKAGQAMMQAMKRVNPKAVWVIQGW--TENP------REAMVDDMKTGDLL 345
Query: 179 VLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD------------ 220
VLDLF+E +P IW + + +++C+L NF N+ ++G +D
Sbjct: 346 VLDLFSECRPMFGIPSIWKREQGYKQHQWLFCLLENFGANVGLHGRMDQLLDNFYMLQSS 405
Query: 221 ----------SIAF-------GPVEA--------RTSENTTMAWINQYSVRRYGRSVPAI 255
I F PV R + T W+ Y RYG AI
Sbjct: 406 KFQAQSSKLKGIGFTMEGSENNPVMFELMSELPWRPEKFTKEQWVKNYVKARYGVEDEAI 465
Query: 256 QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
+ AW L ++YNC G + SI G+P + +
Sbjct: 466 EKAWLTLAKSIYNCPAGNNQQG---------PHESIFC---------GRPT----LNNFQ 503
Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
SS+ +Y + +A +L + + +N + YDL+D+TRQALA A + I
Sbjct: 504 ASSWSKMKNYYDPAMTKKAAKLMNSVAEKYRGNNNFEYDLVDITRQALADQARLQYQKTI 563
Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 435
Y+ + + RFL+++ D LL F +G W + A E++K YEWN
Sbjct: 564 ADYKAFSRKQFDRDAERFLKMLLLQDKLLGTRTEFRVGHWTQDAVNAGNTAEEKKLYEWN 623
Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 487
AR QIT W + + LRDY +K W GLL+D+Y R YF + +++
Sbjct: 624 ARVQITTWGNRYCADTGGLRDYAHKEWQGLLKDFYYVRWKSYFDALAAQMKA 675
>gi|383280354|pdb|4A4A|A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In
Complex With Its Substrate Glcnac-Alpha-1,4-Galactose
Length = 914
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 160/563 (28%), Positives = 257/563 (45%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + A+
Sbjct: 367 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 426
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F + NT
Sbjct: 427 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHQGGNT 485
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 486 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 534
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTT- 236
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+
Sbjct: 535 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 593
Query: 237 --------------------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
MAW IN Y RRYG++ I +AWN++ T
Sbjct: 594 IGITPQAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTA 653
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 654 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 690
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 691 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 750
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 751 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 810
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 811 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKME 866
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 867 YDWVNKKSDTDKLYPTEASNENL 889
>gi|224027030|ref|ZP_03645396.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
18228]
gi|224020266|gb|EEF78264.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
18228]
Length = 837
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 160/552 (28%), Positives = 233/552 (42%), Gaps = 106/552 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI- 59
M N+ G GPL W QL LQ KIL R+ LGM P+ P F G +P A + ++P I
Sbjct: 193 MGNVSGIDGPLNPDWHAGQLALQHKILDRMRALGMKPICPGFPGFIPEAFKRIYPDLHIV 252
Query: 60 -TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T G F +++ T+PLF +I AFI++ KE+G+ + Y D+F+E
Sbjct: 253 ETHWGGAFH---------NWMISPTEPLFAKISEAFIKEWEKEFGKCDY-YLVDSFNEMD 302
Query: 119 PPVDSP------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNS 171
P E +S G +YS ++ + DAVW+MQGW+F Y W + AL++
Sbjct: 303 IPFPEKGNPARYEMAASYGEKVYSSIKRANKDAVWVMQGWMFGYQRHIWDYETLGALVSR 362
Query: 172 VPLGKLVVLDL-------FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
VP K+++LDL F + W K FY +++ ++ N G M G+LD A
Sbjct: 363 VPDDKMLLLDLAVDYNRHFWHSEVNWEYYKGFYNKQWVYSVIPNMGGKTGMTGVLDFYAN 422
Query: 225 GPVEARTSENT----------------------------------TMAWINQYSVRRYGR 250
G +EA +S N W+ QYS+ RYG+
Sbjct: 423 GHLEALSSSNRGNLVAHGLAPEGIENNEVLYELVTDAGWSDHRMDVRDWLKQYSINRYGK 482
Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
+ + AW+ L +VY F D G +N +S
Sbjct: 483 APAQLMKAWDYLLKSVYG--------------TFTDHPRFNWQFRPGLVKNGSINIS--- 525
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
+ + LE F+A+ EL S Y DL ++T L A L
Sbjct: 526 ------------------DDYFKGLESFVAASEELKDSPYYLTDLCEMTAHYLGSKAEIL 567
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
I + Y L D L RF + MD +L+ H L W+ A + A+ E Q K
Sbjct: 568 TRQIDQEYLLGDTLQAHFLQSRFETFMLGMDRILSQHPTLRLDRWVSFASKAARTEAQRK 627
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
QYE NAR +T+W + DY + WSGL+ YY R Y+K +SG
Sbjct: 628 QYEMNARRIVTVW-------GPPVDDYSARMWSGLVGSYYLGRWKEYYK----GRDSGKS 676
Query: 491 FRLKDWRREWIK 502
L W R+W++
Sbjct: 677 ADLSSWERKWVE 688
>gi|374384144|ref|ZP_09641670.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
12061]
gi|373228751|gb|EHP51054.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
12061]
Length = 835
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 152/549 (27%), Positives = 248/549 (45%), Gaps = 103/549 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GP+P W Q+ LQ KIL R+ LGM P+ PAF+G VP AL+ ++P KI
Sbjct: 192 MGNISQIDGPMPVEWHSDQVELQHKILKRMKLLGMKPICPAFAGFVPLALKRLYPDVKII 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
+ W + ++L + LF IG+ FIE+ KE+G+ + Y D+F+E
Sbjct: 252 ET-TWAGFHN-------WMLSPEEELFTRIGQLFIEEWEKEFGK-NDFYLADSFNEMDVP 302
Query: 119 -PPVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
PP+ + E L G +Y G+++G+ DAVW+MQGW+F Y W ++AL++ VP
Sbjct: 303 FPPIGTKERYDMLAFYGEQVYKGIKAGNPDAVWVMQGWMFGYQRDIWDYETLQALVSKVP 362
Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K+++LDL A+ K +W K F+ +++ ++ N G GIL A G
Sbjct: 363 DDKMMLLDLAADYNKNVWGNGMNWEFYKGFFNKLWVYSVIPNMGGKTGATGILSFYANGH 422
Query: 227 VEA----------------------------------RTSENTTMAWINQYSVRRYGRSV 252
+EA +SE W+ YS+ RYG++
Sbjct: 423 LEALNSPNRGRLFGFGMAPEGTENNEVVYEMICDAGWSSSEIDVKQWLKDYSLCRYGKTC 482
Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
P + + W L +VY F D + + G+ S + +
Sbjct: 483 PEMDEVWEGLCKSVYG--------------TFTDHPRFLWQLRPGR--------SGKGTV 520
Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
++++ Y RA+E +++ S ++ D +++T L L
Sbjct: 521 NTDSNFY-------------RAVEKMAECAPKMTESPLFKADFLEMTAFYLGGKMEALAS 567
Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
I ++Y + ++ ++F EL E +D LL H + L W++ A++ E+ + Y
Sbjct: 568 AIGKSYLYGNTADALKMQQQFEELGEGLDSLLESHPVYRLQRWIDFARKHGDTEKLKDYY 627
Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
E NAR +T+W + DY K WSGL+RDYY PR YF+ E+G +
Sbjct: 628 EMNARRIVTIW-------GPPVSDYACKLWSGLIRDYYLPRWREYFR----CKETGSKYD 676
Query: 493 LKDWRREWI 501
L W +W+
Sbjct: 677 LASWESDWV 685
>gi|110800516|ref|YP_695309.1| alpha-N-acetylglucosaminidase [Clostridium perfringens ATCC 13124]
gi|110675163|gb|ABG84150.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens ATCC 13124]
Length = 2095
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 163/563 (28%), Positives = 258/563 (45%), Gaps = 80/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + A+
Sbjct: 369 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 428
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 429 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 487
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 488 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 536
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE
Sbjct: 537 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 595
Query: 234 --------NTT---------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266
NT MAW IN Y RRYG++ I +AWN++ T
Sbjct: 596 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTA 655
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y K D + SII+ G +G +KS S++ H + Y
Sbjct: 656 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 692
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 693 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 752
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 753 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 812
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 813 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKME 868
Query: 505 NDWQNGRN----VYPVESNGDAL 523
DW N ++ +YP E++ + L
Sbjct: 869 YDWVNKKSDTDKLYPTEASNENL 891
>gi|404487024|ref|ZP_11022211.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
YIT 11860]
gi|404335520|gb|EJZ61989.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
YIT 11860]
Length = 722
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 163/570 (28%), Positives = 244/570 (42%), Gaps = 110/570 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+GW GPL W +Q+ LQ KIL R+ ELGM+P+ PAF+G VP A P +
Sbjct: 200 MGNLNGWDGPLTNGWQKEQIKLQHKILNRMRELGMDPIAPAFAGFVPTAFAERHPEIQFK 259
Query: 61 QLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
L W F K + Y+L P F EIG+ FIE+ KE+G+ ++ Y D+F+E
Sbjct: 260 HL-EWGGFDEKYN-----AYVLPPETPYFKEIGKLFIEEWEKEFGKNTY-YLSDSFNEMK 312
Query: 119 PPV------DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNS 171
PV + ++ G +IY + +G+ DAVW+ QGW F Y FW ++ALL+
Sbjct: 313 LPVAEGDDDGKHKLLAQYGESIYHSIAAGNPDAVWVTQGWTFGYQHDFWDKASLQALLSR 372
Query: 172 VPLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
VP K++++DL + + W FYG +I+ + NF G M G L A
Sbjct: 373 VPDDKMIIIDLGNDYPKWVWGTEQTWKNHDGFYGKKWIFSYVPNFGGKTPMTGDLQMYAT 432
Query: 225 GPVEARTSENTTM----------------------------------AWINQYSVRRYGR 250
EA S N +W+ Y RYG
Sbjct: 433 SSAEALHSANAGNLVGFGSAPEGLENNEVVYELLADMGWTADSIDLDSWLPVYCKARYGG 492
Query: 251 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 310
A+ AW T Y+ +V PD + +SK
Sbjct: 493 CPAAMDSAWQRFKETAYSSLYSYPRFTWQTVV--PDT----------------RRISKLD 534
Query: 311 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 370
V S ++ +ELF++ + L +S Y D I+ LA A++
Sbjct: 535 VSDS----------------FLQGVELFLSCADSLESSPLYVNDAIEYASYYLAAKADDC 578
Query: 371 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 430
+ ++ L + Q R +E++ D+D LLA H + L W++ A+ + + ++
Sbjct: 579 YKRALKEDSLGNRVAAMQQLDRSVEILLDVDKLLASHPLYRLEEWVDMARDWGKTDLEKD 638
Query: 431 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 490
YE NA+ IT W DY ++WSGL++DYY PR +YF S + D
Sbjct: 639 AYEANAKRLITTW-------GGFQEDYAARFWSGLIKDYYIPRMKLYF-----SEQRAD- 685
Query: 491 FRLKDWRREWIKLTNDWQNGRNVY--PVES 518
L W WIK W N + P++S
Sbjct: 686 --LDRWEENWIKAP--WHNTSTSFEDPLQS 711
>gi|187734575|ref|YP_001876687.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
gi|187424627|gb|ACD03906.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
Length = 848
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 157/536 (29%), Positives = 255/536 (47%), Gaps = 69/536 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GPLP SW +Q+ LQ +IL R+ LGM P+ PAFSG VP + ++P AK+
Sbjct: 202 MGNIVNHDGPLPASWHKEQIALQHRILHRMKSLGMTPICPAFSGFVPRGILRLYPEAKLH 261
Query: 61 QLG--NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+LG W P+ + L +PLF++IGR ++++ KE+G+ ++ + D+F+E
Sbjct: 262 RLGWGGW------PQKNHAHFLSPEEPLFLKIGRLYMQEWQKEFGKNTY-FLADSFNEME 314
Query: 119 PPVDSP------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNS 171
P + +SSLG IY + S + DAVW+MQGW+F Y W +KALL+
Sbjct: 315 LPENKGGVEARNNMLSSLGEQIYRSISSTNPDAVWVMQGWMFGYQRNIWNADTLKALLSK 374
Query: 172 VPLGKLVVLDLFAEVKPI-------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
VP K+++LDL A+ W K F+ P+++ ++ N G M G++D A
Sbjct: 375 VPDDKMLLLDLAADYNKTFWRNGMNWDVFKGFFNKPWVYSVVPNMGGKCAMTGVMDFYAN 434
Query: 225 GPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 284
G +EA S + + + G + I++ +V+Y + TD A +NR
Sbjct: 435 GHLEALNSSS-------RGRLSGMGMAPEGIENN-DVIYELI---TDAAW-RNRQ----- 477
Query: 285 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY----DHPHLWY------------ST 328
+V+ + + +Y NY + KEA ++Y DHP + ++
Sbjct: 478 ENVEQYLENYCRARYGNYPDSM-KEAWNLFRRTAYSNLKDHPRFNWQMKPGTRGCSVNTS 536
Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
+ ++ L LF+ + L S +R D +++ L NE EA D +
Sbjct: 537 EDFLKGLSLFVNT-RGLEQSPLFRQDAVEMAVHYLGIRMNEAIRAAQEALDEQDQENAEK 595
Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 448
F + D LL H + L W+ A+ + E++ +YE NAR +T W
Sbjct: 596 CMAYFRKYALLADSLLEGHPTWRLSRWISFARSHGTSPEEKNKYEQNARRLVTRW----- 650
Query: 449 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ DY K WSGL+RDYY PR +++ I+S S + W +W++ T
Sbjct: 651 --GPPVDDYAAKIWSGLIRDYYLPR----WEHFIQSRLSEKNPDMGAWEEKWVRST 700
>gi|340514474|gb|EGR44736.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
Length = 762
Score = 207 bits (528), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 146/525 (27%), Positives = 249/525 (47%), Gaps = 74/525 (14%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
NL G W LP W+D Q LQKKI+ R+ ELG+ P+LPAF G VP A V P A++
Sbjct: 209 NLQGSWSSSLPFEWVDDQFALQKKIVKRMVELGITPILPAFPGFVPRAAPRVLPDARLLH 268
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + LD DPLF ++ R+FI +Q + YG ++ Y D F+E PP
Sbjct: 269 SIQWAGFPE--IFTEDTFLDPVDPLFAQMQRSFITKQKQAYGNVTNFYTLDQFNEMIPPS 326
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPL-GKLVV 179
Y+ ++ + + ++S D +A+W+ Q WLF+ + FW +++A L V +++
Sbjct: 327 GDVAYLRNVSSNTWKALKSADPNAIWVFQAWLFAQNTTFWTNERIEAYLGGVTADSDMLI 386
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT---- 235
LD+++E P W ++ +YG P+IWC L N+ I +YG + ++ P+ A +
Sbjct: 387 LDIWSESMPQWQRAQSYYGKPWIWCELQNYGATINLYGQIQNVTNSPILALQESTSLSGF 446
Query: 236 ----------------------------TMAWINQYSVRRYG--RSVPAIQDAWNVLYHT 265
T A+ + ++ RY + I DAW + T
Sbjct: 447 GLSMEGQQNNEIVYDLLLAQAWSSEPLDTEAYFHNWASARYSSDQRPGFIHDAWETVRTT 506
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VY+ T+ + P V SII + V + + + ++ + L
Sbjct: 507 VYDNTN---------LTLMPSVPKSIIEL-----------VPRTSNM-ADITGILGTKLP 545
Query: 326 YSTSEVIRALELFIASG---NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
Y + ++ A + +G L ++ Y+YDL+D TRQ LA ++ NI++ Y ++
Sbjct: 546 YDPAVMVSAWKQLYHAGLQDTSLFNNSAYQYDLVDWTRQVLANAFIPIYKNIVDIYYNSN 605
Query: 383 AHGVFQLSR------RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
++ R + +L+ +D +L+ + F L WL +A+ A + +E+ A
Sbjct: 606 QTAGSRIQRLKAQGQQVTKLLLSLDLVLSSNRNFRLSTWLSAARSSAPSPAYVDSFEYEA 665
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 481
R QIT+W + Q L DY +K WSGL++ Y+ R ++ +Y+
Sbjct: 666 RNQITLWGPSGQ-----LIDYASKAWSGLMKTYHLKRWQMFVEYL 705
>gi|322699924|gb|EFY91682.1| alpha-N-acetylglucosaminidase, putative [Metarhizium acridum CQMa
102]
Length = 775
Score = 207 bits (528), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 156/565 (27%), Positives = 260/565 (46%), Gaps = 84/565 (14%)
Query: 5 HGWGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 62
WGG LP ++++ Q LQKKI+ R+ ELG+ PVLPAF G VP +++ V P +T
Sbjct: 211 RSWGGKGDLPLAFIELQFELQKKIVARMVELGITPVLPAFPGFVPESIKKVRPDVNLTVS 270
Query: 63 GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 122
NWF+ D ++ LD D + E+ R F+ +Q+ +G ++IY D F+E +P
Sbjct: 271 PNWFAPAPD-KYTRDLFLDPLDDTYAELQRLFVSKQMDAFGNVTNIYTLDQFNELSPASG 329
Query: 123 SPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGK-LVVL 180
Y+ + Y+G+ + + AVWL+QGWL FS FW P++ A L V + ++VL
Sbjct: 330 DTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSSRRFWTQPRIDAYLGGVEDDQGMLVL 389
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT----- 235
DL++E P W + + G P+IWC LH+F GN+ + G + ++ P++A +
Sbjct: 390 DLYSEANPQWQRTNSYSGKPWIWCQLHDFGGNMALEGRVQTLTSAPIDALAQSESLVGFG 449
Query: 236 ---------------------------TMAWINQYSVRRYG--RSVPA-IQDAWNVLYHT 265
T + + +RY S+P+ + AW +L
Sbjct: 450 LTPEAYEGNEVVYDILLDQAWSATPLDTQTYFASWVTKRYAGVSSIPSELYRAWEMLRTD 509
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
VY+ T TD + V VA + P++ + + T + HP
Sbjct: 510 VYSNTR--TDIPQ-VPVATYQLRPALSGIA------------------NRTGHFPHPTAL 548
Query: 326 YSTSEVIR-----ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
+ V++ LE G+ L ++ D +D++RQ L+ + L+ +++ AY+
Sbjct: 549 HYDPLVLQEAWKLMLEAMTRQGS-LWKVPAFQLDFVDVSRQMLSNQFDVLYADLVNAYKC 607
Query: 381 NDAHGVFQL------------SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 428
+ A G +L R L L+ +D L F L W+++A +
Sbjct: 608 SAAGGSRELRSSAPSCDVEAAGARLLSLLSTLDLTLLTSRHFTLQSWVDAAGSWGKAAGN 667
Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
E + +NAR+Q+T+W Q +A+ L DY K W GL+ YY R +I+ ++ + SG
Sbjct: 668 EDLFTFNARSQVTVW----QVDATNLNDYAAKAWGGLVGSYYKGRWSIFVDALVAASNSG 723
Query: 489 DGFRLKDWRREWIKLTNDWQNGRNV 513
RE +WQ G +
Sbjct: 724 -SLDEGALTRELEAFEAEWQAGEHA 747
>gi|429740221|ref|ZP_19273923.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
gi|429153946|gb|EKX96707.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
Length = 721
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 160/551 (29%), Positives = 241/551 (43%), Gaps = 103/551 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W QQ+ LQ KIL R+ LGM+P+ PAF+G VP + P ++
Sbjct: 197 MGNLNTWNGPLSANWHSQQIALQHKILERMRLLGMHPITPAFAGFVPEGFVKLHPEVRVK 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
F + Y+L P F++IG+ FIE+ KE+ + ++ Y D+F+E P
Sbjct: 257 H----FEWGGFDKSLNAYMLPPDSPYFLQIGKLFIEEWEKEFSKNTY-YLSDSFNEMELP 311
Query: 121 VDSPE-------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSV 172
V SP+ +S G AIY + +G+ +AVW+ QGW F Y FW ++ALL V
Sbjct: 312 V-SPDDTDGKHRLLSKYGEAIYQSIVAGNPNAVWITQGWTFGYQHRFWDKESLQALLERV 370
Query: 173 PLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDS---- 221
P KL+++DL + + W T K FYG +I + NF G + G L+
Sbjct: 371 PNDKLIIVDLANDYPKWVWKTEQTWKTHKGFYGKRWILSYVPNFGGKTLLTGDLNLYASC 430
Query: 222 -------------IAFGPVEARTSENTT-------MAWINQ----------YSVRRYGRS 251
I FG N M W NQ Y RYG
Sbjct: 431 SAEALAHPDKGRLIGFGSAPEGLENNEVVYELLADMGWQNQPIDLDHWLIEYCRSRYGSC 490
Query: 252 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
A+Q AW L +VY+ S+ S +Q V + +
Sbjct: 491 PNAMQKAWKGLCRSVYS---------------------SLYSYPRFTWQT----VIPDTL 525
Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
KS+ YD ++ RA+E F+ +L S YR D + Q + A+ L+
Sbjct: 526 RKSK---YDFNDTYF------RAVEDFLLCAPQLKDSPLYRSDALLFAAQYIGAKADNLY 576
Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
++A + + QL + ++L+ D LLA H L W+++A+ A ++ Q
Sbjct: 577 RKALQAKAVGNRARAKQLVDKVIQLLLQADKLLASHPTDRLSRWVDAARTAAATPQERMQ 636
Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
YE +A+ IT W + +DY +YWSGL++ YY PR +YF
Sbjct: 637 YEMDAKRLITSW-------GGIQQDYAARYWSGLIKTYYVPRIKLYF-------AGSKKK 682
Query: 492 RLKDWRREWIK 502
L +W W+K
Sbjct: 683 ELNNWEENWLK 693
>gi|294648124|ref|ZP_06725667.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|292636508|gb|EFF54983.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
Length = 499
Score = 205 bits (521), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 156/543 (28%), Positives = 244/543 (44%), Gaps = 93/543 (17%)
Query: 35 MNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAF 94
M PVLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F
Sbjct: 1 MKPVLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLF 57
Query: 95 IEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
+++Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F
Sbjct: 58 LDEQKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMF 116
Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
+D W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN
Sbjct: 117 YFDKDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNT 176
Query: 214 EMYGI----------------------------LDSIAFGPVEARTSENTTM-----AWI 240
+ G LD + F P E + + WI
Sbjct: 177 TLTGNVKESGARLENALINGGGNLKGIGSTLEGLDVMQF-PYEYILEKAWNLNVDDNKWI 235
Query: 241 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 300
+ R G ++DAW L++ +Y V P T G
Sbjct: 236 ECLADRHVGCVSQPVRDAWKRLFNDIY--------------VQVP--------RTLGTLP 273
Query: 301 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDL 358
Y +P + K ++ Y + L EV R L NE + + +R DLI +
Sbjct: 274 GY-RPALNKNSEKRTSNVYSNVELL----EVWRKL-------NEAPSDRRDAFRLDLITV 321
Query: 359 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 418
RQ L Y ++ + + D + + E++ D+D L A H L W++
Sbjct: 322 GRQVLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDD 381
Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
A+++ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 382 ARKMGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYV 434
Query: 479 KYMIESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLY 530
I++ E G D +L+D +E W+ T+ ++V+ S D L++ S +L+
Sbjct: 435 NTFIKAAEEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLF 491
Query: 531 NKY 533
+KY
Sbjct: 492 SKY 494
>gi|393783265|ref|ZP_10371440.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
CL02T12C01]
gi|392669544|gb|EIY63032.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
CL02T12C01]
Length = 723
Score = 204 bits (520), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 155/554 (27%), Positives = 248/554 (44%), Gaps = 70/554 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL W Q+ LQ +I+ R+ ELGM P+ PAF+G VP A P K
Sbjct: 199 MGNLNTWDGPLSDEWQKSQIELQHQIINRMRELGMQPIAPAFAGFVPMAFAEKHPDIKFK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W Y+L P F EIG+ F+++ KE+G+ ++ Y D+F+E P
Sbjct: 259 HL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKEWEKEFGKNTY-YLSDSFNEMELP 313
Query: 121 VDSPEY------ISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
V + ++ G +IY + +G+ DA+W+ QGW F Y FW ++ALL+ VP
Sbjct: 314 VAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQGWTFGYQHDFWDKASLQALLSHVP 373
Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + + W FYG +I+ + NF G + G L A
Sbjct: 374 DDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIFSYVPNFGGKTPLTGDLQMYATSS 433
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNC--TDGATDKNRDVIVAF 284
EA + + ++ +G + +++ V+Y + + TD A D + +
Sbjct: 434 AEALKAPS-------HGNLIGFGSAPEGLENN-EVVYELLADMGWTDQAIDPEQWM---- 481
Query: 285 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST--------------SE 330
PS + G Y K + + +S Y +P + T +
Sbjct: 482 ----PSYCTARYGAYPESMKNAWELFRKTAYSSLYSYPRFTWQTVIPDQRRISKIDVSDD 537
Query: 331 VIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLS 390
+ +ELF+AS + L+ S Y D I+ +A A++L+ + +Q
Sbjct: 538 FLHGIELFLASADSLNRSKLYVNDAIEFASYYIAAQADKLYKQALTEDTAGKPVAAYQHL 597
Query: 391 RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEE 450
+ ++L+ ++D LLA H + L W+E A+ ++ YE NA+ IT W
Sbjct: 598 NQAIDLLLNVDKLLASHPLYRLEEWVELARNSGTTPAEKDAYEANAKRLITTW------- 650
Query: 451 ASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNG 510
DY ++WSGL++DYY PR +YF S + GD L +W EWI+ W N
Sbjct: 651 GGFQEDYAARFWSGLIKDYYIPRLKLYF-----SKQRGD---LDNWEEEWIR--TPWHN- 699
Query: 511 RNVYPVESNGDALI 524
P E D I
Sbjct: 700 -TTTPFEKPLDMAI 712
>gi|224026593|ref|ZP_03644959.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
18228]
gi|224019829|gb|EEF77827.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
18228]
Length = 635
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 144/476 (30%), Positives = 211/476 (44%), Gaps = 78/476 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL ++I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 196 MQNLQSYGGPLPKSWIDKHVVLGQQIIRRELELGMKPIQQGFSGYVPRELKEKYPEAKIQ 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+W K + LD TD LF IGR F+E++ K +G +Y D F E+ PP
Sbjct: 256 PQPSWCGFKGAAQ------LDPTDSLFQVIGRDFLEEEKKLFG-AHGVYAADPFHESRPP 308
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD+PEY+S++G +I++ Q D ++W MQ W R P +KA VP L++L
Sbjct: 309 VDTPEYLSAVGRSIHTLFQEFDPYSLWAMQAWSL------REPIVKA----VPEEHLLIL 358
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
DL + +G P + LHNF G I M+G L +A EA S + +
Sbjct: 359 DLNGSK---CTQRNACWGYPVVAGNLHNFGGRINMHGDLPLLAGNQYEAAVSLSPNVCGS 415
Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
W+ +Y++RRYG A +L Y
Sbjct: 416 GLFMEGIEQNPLYYELAFEMPLQKGKVELDGWLKEYALRRYGSKWENTHKALLLLLEGPY 475
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T+ + +I A P++ G G P YS
Sbjct: 476 RPGTNGTELS-SIIAA----RPALHVKKSGPNAGLGIP--------------------YS 510
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+I A + L S YR+D++DL RQ + + +A++ D G
Sbjct: 511 PWLLIEAQAFMLKDAGILKTSEAYRFDIMDLQRQIMTNLGQAIHKEAAKAFEAGDEKGFE 570
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
SRR+LEL+ D+D LL F WL A+ EE++ Q+E NA +T+W
Sbjct: 571 LHSRRYLELLTDVDTLLRTRPEFNFDRWLADARSWGDTEEEKNQFERNATALVTIW 626
>gi|282881077|ref|ZP_06289764.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
5C-B1]
gi|281304881|gb|EFA96954.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
5C-B1]
Length = 688
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 146/522 (27%), Positives = 231/522 (44%), Gaps = 79/522 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ Q +D + LQ KIL R+ +LG+ PV+ F G VP+ L + +P A +
Sbjct: 164 MGNLEGWGGPMSQQMIDDRYKLQIKILRRMRQLGIEPVVQGFPGIVPSFLHDKYPKACVV 223
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + +L LF + +A+ + + YG + D F E N
Sbjct: 224 SQGKWNGFQRPS------ILLPQSQLFYCMAKAYYDNMKRYYGTDLRYFGGDLFHEGGNA 277
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD +SS + + M S DA W++QGW + P ALL + ++
Sbjct: 278 KGVD----LSSTASKVQKCMLSHFPDAKWVLQGWNGNPSP--------ALLAGLDKKHVL 325
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN---- 234
+++L E+ W S +F P+IW +++F G +M G L + P A +
Sbjct: 326 LINLAGEIDASWKQSDEFGQTPWIWGSVNHFGGKTDMGGQLPVLVEQPHRALAASQHGRL 385
Query: 235 --------------------TTMAW----------INQYSVRRYGRSVPAIQDAWNVLYH 264
AW + QY RYG + AW +L
Sbjct: 386 KGLGILPEGIHTNPVVYDLALQTAWSDTVPSVDHLLRQYIWYRYGTWNDDLYRAWQLLAS 445
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
+VY EG Y++ ++ ++ S S++ +
Sbjct: 446 SVYG---------------------EFEVKGEGTYESVF--CARPSLHVSSVSTWGPKKM 482
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
Y ++++AL LF + S TY YDL+DL RQ +A A ++ ++ AY D+
Sbjct: 483 QYQPEKLLQALVLFRKAAVHFKGSETYEYDLVDLARQVMANNARNVYNQVVHAYNEKDSL 542
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + S FL L++ D LL+ + FLLG WL++A+Q +NE+ ++Q NART I+ W
Sbjct: 543 ALNRYSSTFLHLIDLQDSLLSTNKFFLLGKWLQAARQYGENEQDQRQALVNARTLISYW- 601
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
+ + L DY NK W+GLL+ YY PR +F + L
Sbjct: 602 -GPDDATTRLHDYANKEWAGLLKQYYAPRWRAFFAMLAGQLR 642
>gi|210611122|ref|ZP_03288736.1| hypothetical protein CLONEX_00926, partial [Clostridium nexile DSM
1787]
gi|210152109|gb|EEA83116.1| hypothetical protein CLONEX_00926 [Clostridium nexile DSM 1787]
Length = 1662
Score = 202 bits (514), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 156/577 (27%), Positives = 253/577 (43%), Gaps = 102/577 (17%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW ++ L +K + + +LGM PVL +SG VP + + PSA++
Sbjct: 685 MANLSGFGGPVHDSWFTERTELARKNQLIMRKLGMQPVLQGYSGMVPVDITDKDPSAQVI 744
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S + +L F + + F + Q + YG S Y D F E NT
Sbjct: 745 KQGTWCSFQRPS------MLKTDSETFDKYAQLFYKVQKEVYGDVSDYYATDPFHEGGNT 798
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-- 176
+ SP I+ + + M D + +W++Q W + ALL + +
Sbjct: 799 GGM-SPTVIAE---KVLANMMEADENGIWIIQSW--------QGNPSTALLQGLDAARDH 846
Query: 177 LVVLDLFAEVKPIWSTSK-----------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
+VLDL+AE P W+ + +F P+++CML+NF G + ++G +++ G
Sbjct: 847 ALVLDLYAEKTPHWNETDPGSYGGAEGGGEFLNTPWVYCMLNNFGGRLGLHGHIENFVNG 906
Query: 226 PVEARTSENTTMA-------------------------------------WINQYSVRRY 248
+A N W Y+ RRY
Sbjct: 907 VAQAAAQANHMAGIGITPEASVNNPVLYDLFFETIWSDDGENLSAINLDEWFKDYTTRRY 966
Query: 249 GRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKPV 306
G + +A +L TVYN + V+ A P +D G +G V
Sbjct: 967 GAESQSAYEAMQILNDTVYNPEMNMKGQGAPESVVNARPGLDI-------GAASTWGNAV 1019
Query: 307 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 366
+ Y +E+ +A L + ++L S Y+YDL ++ Q L+
Sbjct: 1020 -----------------IDYDKAELEKAAALLLKDYDKLKDSAGYQYDLANVLEQVLSNT 1062
Query: 367 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 426
A E + +A++ DA ++S FLE++ ++ + + F+LG WLESAK LA+N
Sbjct: 1063 AQEYQKKMADAFREGDAEKFEKMSNSFLEIITKVEEVTGTQEEFMLGTWLESAKALAKNA 1122
Query: 427 EQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
+ ++ YE NAR IT W Q + L DY N+ WSGL DYY PR + K++ E
Sbjct: 1123 DDFTKELYELNARGLITTWGSIEQANSGGLIDYSNRQWSGLTSDYYKPR---WEKWIAER 1179
Query: 485 LESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNG 520
+ G K++ +W ++ W N YP ++NG
Sbjct: 1180 KKELAGEESKNYSAADWFEMEWAWARSNNEYPTKANG 1216
>gi|160914140|ref|ZP_02076362.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
gi|158433951|gb|EDP12240.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
Length = 2150
Score = 202 bits (514), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 141/514 (27%), Positives = 242/514 (47%), Gaps = 81/514 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ +GGPLP W +Q++ L +K+ R+ G++PV+ F G VP + A +T
Sbjct: 387 MQNLYSFGGPLPDDWFEQRVELGRKMHDRMQAFGIDPVIQGFCGQVPMSFVEKNEGAVLT 446
Query: 61 QLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
+ W S + P TYL F ++ + F E+Q +G S Y D F
Sbjct: 447 PIDEWPSF-TRPAMIKTYLSQEEIAAGKKDYFKDVAKTFYEKQKNVFGDVSDYYASDPFH 505
Query: 116 E--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
E NT +D ++++ + M ++DA+W+MQ W + D ++ L V
Sbjct: 506 EGGNTQGLD----VTNIFKTVQEEMLKSNADAIWVMQQWQGNLD----HAKLSGL---VK 554
Query: 174 LGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS- 232
+ + LDL +++ P S+ + G+ +IWCMLHNF G + + G ++ IA P A ++
Sbjct: 555 PEQALALDLQSDMNP--SSVMENEGISWIWCMLHNFGGRMGLDGEVEVIAKEPAIAASNN 612
Query: 233 -------------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNV 261
EN+ + AW+++Y+ RR G S ++Q+AW++
Sbjct: 613 QYMKGIGITPEALENSPIVYEMLFDMTWSKDPIDYQAWVDKYATRRAGGSSDSLQEAWDM 672
Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
L T Y +D + + ++I+ G N+ S S++ H
Sbjct: 673 LLETAY----------KDKGIYYQGAGETVINARPG--TNF-----------SSASTWGH 709
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
++ Y E+ + L L I + + +AS YRYDL D+ Q L A E +++A
Sbjct: 710 SNILYDKEELDKVLSLLIENYDAFAASEAYRYDLADVAEQVLCNAAIEYHALMVQALNNK 769
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
D+ ++S FLEL++ D +L + F+LG W+ A+++ N + + +E+NAR
Sbjct: 770 DSAEFKRISTHFLELIDLSDRILGSSEEFMLGTWIHDAREMLDNADDWTKDLFEFNARAV 829
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
+T W E + L+DY N+ W+GL +Y R
Sbjct: 830 VTTW---GGERSGSLKDYSNRKWAGLTSSFYKER 860
>gi|126347839|emb|CAJ89559.1| putative alpha-N-acetylglucosaminidase [Streptomyces ambofaciens
ATCC 23877]
Length = 740
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 144/556 (25%), Positives = 234/556 (42%), Gaps = 77/556 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGP+ + L+ + L ++I R+ +LGM PVLP + G VP P +
Sbjct: 213 MQNMSGFGGPVSERLLEDRADLGRRIADRLRQLGMTPVLPGYYGTVPPGFTERNPVGPVV 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W + P W LD +F + AF Q + +G TS +Y D E P
Sbjct: 273 PQGDWVGFER-PDW-----LDPRSAVFPRVAAAFYRHQRELFG-TSTMYKMDLLHEGGRP 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P + A+ +Q+ AVW + GW + P + +++++ +L+++
Sbjct: 326 GNVP--VRDAAQAVMKALQTARPGAVWTLIGWQNN-------PSTQ-IIDAIDKRRLLIV 375
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILDSI 222
D ++ ++G PY + + NF G+ M L I
Sbjct: 376 DGLSDRYDGLDREATWHGAPYAFGTIPNFGGHTTMGANTAVWAERFDQWRTKAGSALAGI 435
Query: 223 AFGP---------------VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
A+ P + RT W +Y+ RRYG + P AW +L Y
Sbjct: 436 AYMPEGTGGNPVAYELFTELAWRTEPVDQRKWFAEYAQRRYGGADPHAASAWELLRSGPY 495
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ G +++D + ++ + + +S+ + Y
Sbjct: 496 STPSGTWSESQDSLF-----------------------TARPRLTATNAASWSPGAMRYD 532
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
V RAL + L A++ YR+DL+D+ RQ LA + L I AY D
Sbjct: 533 PGTVRRALTELVRVAPALRATDAYRFDLVDVARQVLANRSRTLLPQIKAAYDAEDLPRFR 592
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ + + +D LLA FLLGPWLE AK + E + E++AR+ +T W +
Sbjct: 593 ARAAEWKNCLSLLDRLLATDARFLLGPWLEDAKSWGRTEAERAAAEFDARSILTTWGHRS 652
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
+A LRDY N+ WSGL+ D+Y R Y + +L +G D W L +DW
Sbjct: 653 GSDAGGLRDYANREWSGLVSDFYAMRWTKYLDSLDTALVTGRPPVAID----WFALEDDW 708
Query: 508 QNGRNVYPVESNGDAL 523
R+ YPV +GD +
Sbjct: 709 NRQRDGYPVRPSGDPV 724
>gi|365104185|ref|ZP_09333846.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
4_7_47CFAA]
gi|363644798|gb|EHL84079.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
4_7_47CFAA]
Length = 1049
Score = 202 bits (513), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 150/555 (27%), Positives = 242/555 (43%), Gaps = 74/555 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 361 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 420
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 421 DQGEWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 478
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D DAVW++Q W + A LN + ++L
Sbjct: 479 ADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW--------QENPTDAFLNGLKKDHALIL 528
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------FG 225
DL+A+ KP + +F P+IW MLH F G + G+ + +A G
Sbjct: 529 DLYADNKPNHAMRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLAQEIPQSLAESKKMKGVG 588
Query: 226 PVEARTSEN-----------------TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
N ++ A+I+ + RYG P I+ AW+++ T Y+
Sbjct: 589 VTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAYH 648
Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
D+ R + SII G +G V++ + YD
Sbjct: 649 RR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------- 684
Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
+E + L L+++ + A+ Y++DL+D+TRQ LA + E + +A+ D Q
Sbjct: 685 AEFEKILPLYLSVYDHFKANPAYQHDLVDITRQVLANASYEYYRAFEDAWIAKDYSAFNQ 744
Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFDN 446
LS +FL L++ D +L+ F+LG W+ SA+ + ++ Q+E+NAR +T W
Sbjct: 745 LSGKFLRLIKLQDQVLSTRPEFMLGTWINSARTMLDGMDDWTRDQFEFNARAMVTTWGTE 804
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
+A LRDY N+ W GL D+Y R A + + + + +G + + W L
Sbjct: 805 QAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALKSAAATGQ--KQDAIKVNWFPLEYR 861
Query: 507 WQN-GRNVYPVESNG 520
W N N YP + +G
Sbjct: 862 WVNQSGNGYPTQPSG 876
>gi|296115989|ref|ZP_06834611.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
23769]
gi|295977458|gb|EFG84214.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
23769]
Length = 758
Score = 201 bits (512), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 150/579 (25%), Positives = 252/579 (43%), Gaps = 91/579 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGP+P+ ++++ V ++I+ R+ ELGM PVLP F G VP FP A +
Sbjct: 222 MANMCCYGGPVPRELIEKRAVSAQQIIGRMRELGMRPVLPGFYGMVPDDFGKRFPQAHVI 281
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD DP+F ++ + ++Q K +G + +Y+ F E P
Sbjct: 282 GQGEWNRFRR-PAW-----LDPRDPMFAKVAAIYYDEQKKLFG-DAPVYDIQPFQEGGTP 334
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ G I + + A+W++ W D +L V +L ++
Sbjct: 335 GDVP--LADAGQGIQKALDTAHPGAMWMLMAWYEEPD--------ARMLAGVDRKRLFIV 384
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG---PVEARTSENT-- 235
DL + + F G P+++ L +F G + G S +G P RT +N
Sbjct: 385 DLEQNTRVRENRDADFQGAPFLYGGLWDFGGRTSLGG--SSYDYGVRLPGLWRTQKNMIG 442
Query: 236 -----------------------------TMAWINQYSVRRYGRSVPA-IQDAWNVLYHT 265
T W Y+ RRYG+ + AW++L H+
Sbjct: 443 TAVFPEGMDNNPYIFDLFTEAAWRRDGVDTTQWTRDYADRRYGQPGDVHARKAWDLLLHS 502
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP-VSKEAVLKSETSSYDHPHL 324
+ S Q++G+ + +++ ++ S H
Sbjct: 503 AF-------------------------SYRATGIQDFGEASAAPDSLFNAQPSLDTHSAA 537
Query: 325 W-------YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 377
W Y V A+ + + + A+ YRYDL+D+TRQA+A A + I +A
Sbjct: 538 WNGMKVLPYDPHLVEAAMAELLQASDATRATEAYRYDLVDVTRQAVANQARAMLPQIGDA 597
Query: 378 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 437
+ D + L+ R+LEL++ D LLA + F +G WL + + + Q K +++AR
Sbjct: 598 FAARDRAKLHALTTRWLELMDRQDSLLATNTFFRVGTWLSWPQAWSDDPAQRKLMDYDAR 657
Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR 497
+T W T + LRDY NK W+GL +DYY R ++F + SL +G R D
Sbjct: 658 VILTNWGGRTASQVGHLRDYANKDWAGLTKDYYRVRWQLFFDSLETSLATGRPPREID-- 715
Query: 498 REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 536
W K+ +W + VY GD+ ++ +++ QG
Sbjct: 716 --WYKVGEEWCHNGRVYSPTPEGDSYTVARDIHDYLTQG 752
>gi|315500594|ref|YP_004089396.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
gi|315418606|gb|ADU15245.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
Length = 765
Score = 201 bits (511), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 154/572 (26%), Positives = 231/572 (40%), Gaps = 103/572 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ P+PQ+W+ ++ LQ +IL R+ ELGM P+LPAF G VP A P A+I
Sbjct: 202 MGNIEGYLAPVPQAWIQKKHKLQSRILGRMKELGMTPILPAFGGYVPKAFAQKHPQARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W TY LD DPLF +I FI + YG + Y D+F+E PP
Sbjct: 262 PMRPWEGFHE------TYWLDPADPLFAKIAARFIALYTETYGEGRY-YLADSFNEMLPP 314
Query: 121 VD-----------------------------SPEYISSLGAAIYSGMQSGDSDAVWLMQG 151
+ E +++ G AIY ++ DAVW MQG
Sbjct: 315 ISHDGSDVKNAKYGDSTANTKETETVVDPAVKAERLAAYGKAIYDSIRQARPDAVWTMQG 374
Query: 152 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNF 209
WLF D FW P + A L VP KL++LD+ + P +W +S F G P+I+ +HN+
Sbjct: 375 WLFGADKHFWTPDAIGAFLRDVPQDKLMILDIGNDRYPGVWQSSNAFQGKPWIYGYVHNY 434
Query: 210 AGNIEMYGIL-----------------DSIAFGPVEARTSEN-----------------T 235
+ +YG L D FG N T
Sbjct: 435 GASNPVYGDLGFYRDDIRGLLARKDTGDLKGFGLFPEGLHNNSIVYEYAYDLAWGQANQT 494
Query: 236 TMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVT 295
W+ Y RYG+ PA+ AW+ ++ + P
Sbjct: 495 VTEWLTTYLKSRYGQVTPALILAWSTYVEAAFSTRYWS---------------PRWWRSK 539
Query: 296 EGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 355
G Y +P + + HP ++ RA++ + S S YR+D+
Sbjct: 540 AGAYLLCKRPTADMVEFEG------HPG---DRKKLRRAIDALL-SLKGFGGSALYRHDV 589
Query: 356 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 415
ID R +++ ++ + ++AY+ D L + LV +D L+ L W
Sbjct: 590 IDAVRHLVSEEIDDRLIAAMKAYKSGDVKTGDGLREEVIALVTQVDTLMGAQPD-TLASW 648
Query: 416 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 475
++ A E++ Y NA+ Q+T+W L DY +K W GL +D+Y PR
Sbjct: 649 IDEASAYGDTSEEKAYYVMNAKAQVTVWGGKGN-----LNDYASKAWQGLYKDFYLPRWM 703
Query: 476 IYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
+ S G F K + RE I W
Sbjct: 704 KLLAALRASASGGAPFDQKTFTRELIDWEQAW 735
>gi|393785795|ref|ZP_10373941.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
CL02T12C05]
gi|392661414|gb|EIY55000.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
CL02T12C05]
Length = 724
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 149/549 (27%), Positives = 234/549 (42%), Gaps = 102/549 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL W + Q+ LQ +I+ R+ ELGM+P+ PAF+G VP A P K
Sbjct: 199 MGNLNTWDGPLSDEWQEGQIQLQHQIINRMRELGMSPIAPAFAGFVPMAFAEKHPDIKFK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W Y+L P F EIG+ F+++ KE+G+ ++ Y D+F+E P
Sbjct: 259 HL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKEWEKEFGKNTY-YLSDSFNEMELP 313
Query: 121 VDSPEY------ISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
V + ++ G +IY + +G+ DA+W+ QGW F Y FW ++ALL+ VP
Sbjct: 314 VAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQGWTFGYQHSFWDKASLQALLSHVP 373
Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + + W FYG +I+ + NF G M G L A
Sbjct: 374 DDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIFSYVPNFGGKTPMTGDLQMYASSS 433
Query: 227 VEARTSEN------------------------TTMAWINQ----------YSVRRYGRSV 252
EA SE+ M W +Q Y + RYG
Sbjct: 434 AEALQSESHGNLIGFGSAPEGLENNEVVYELLADMGWTDQAIDLDKWMPSYCMARYGAYP 493
Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
++DAW++ T Y+ ++ PD K +
Sbjct: 494 ETMKDAWDLFRKTAYSSLYSYPRFTWQTVI--PD---------------------KRRIS 530
Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
K + S + + +ELF+ S + L S Y D I+ +A A++L+
Sbjct: 531 KIDVSD-----------DFLHGVELFLNSADSLKNSKLYVNDAIEFASYYIAAKADKLYG 579
Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
+ + + Q + ++++ ++D LLA H + L W+ A+ ++ Y
Sbjct: 580 KALAEDTVGRSAVAQQYLNQTIDMLLNVDKLLASHPLYRLEEWVNFARNSGTTPAEKDAY 639
Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
E NA+ IT W DY ++WSGL++DYY PR IYF SL+
Sbjct: 640 EINAKRLITTW-------GGFQEDYAARFWSGLIKDYYIPRLKIYFSKQRGSLD------ 686
Query: 493 LKDWRREWI 501
+W EWI
Sbjct: 687 --NWEEEWI 693
>gi|291086028|ref|ZP_06354661.2| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
ATCC 29220]
gi|291069185|gb|EFE07294.1| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
ATCC 29220]
Length = 1014
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 150/555 (27%), Positives = 241/555 (43%), Gaps = 74/555 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 326 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 385
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 386 DQGDWVGFVRPPM-LRTYVKQGAD-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 443
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D DAVW++Q W+ A LN + ++L
Sbjct: 444 ADLD--MVKVAQTVQNKMLEHDKDAVWIIQN--------WQENPTDAFLNGLKKDHALIL 493
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------FG 225
DL+A+ KP + +F P+IW MLH F G + G+ + +A G
Sbjct: 494 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLAQEIPQSLAESKYMKGVG 553
Query: 226 PVEARTSEN-----------------TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
N ++ A+I+ + RYG P I+ AW+++ T Y+
Sbjct: 554 VTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAYH 613
Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
D+ R + SII G +G V++ + YD
Sbjct: 614 RR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------- 649
Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D Q
Sbjct: 650 AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFNQ 709
Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFDN 446
LS +FL L++ D +L F+LG WL SA+ + ++ Q+E+NAR +T W
Sbjct: 710 LSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLDGMDDWTRDQFEFNARAMVTTWGIE 769
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
+A LRDY N+ W GL D+Y R A + + + + +G + + W L
Sbjct: 770 QAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALKNAAATGQ--KQDAIKVNWFPLEYR 826
Query: 507 WQNGR-NVYPVESNG 520
W N N YP + +G
Sbjct: 827 WVNQTGNGYPTQPSG 841
>gi|225875033|ref|YP_002756492.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
51196]
gi|225793771|gb|ACO33861.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
51196]
Length = 800
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 154/558 (27%), Positives = 243/558 (43%), Gaps = 78/558 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL + P+ +S LD+++ ++I+ R+ ELG+ PV P + G VP P A +
Sbjct: 241 MGNLCCFDEPISRSLLDRRIRSAQQIIRRLRELGITPVFPGYFGMVPEDFARRHPGAHVI 300
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
GNW + P W LD DPLF + +F + Q + +G +S IY+ + F E
Sbjct: 301 PQGNWNGFRR-PAW-----LDPRDPLFAAVAASFYKHQQELFGDSS-IYDIELFQEGGSA 353
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P +SS AI + A+W+ W +P +ALL++V L+V+
Sbjct: 354 ADVP--VSSAAKAIQKALLRAHPQAMWMTLAW--QNNP------SRALLSAVDRSHLLVV 403
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA----------------- 223
D+ P + + F G Y++ L +F G + L A
Sbjct: 404 DIDQGRTPHENRERDFMGAAYLFGGLWDFGGRTTLGANLYDYAVRLPRMGLRAGSTMKGT 463
Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
F + RTS W +Y+ RRYG P + AW +L T Y
Sbjct: 464 ALFSEGLDNNPAAFDLFTEMAWRTSPVDLRTWSREYARRRYGMDDPHTRRAWRILMETAY 523
Query: 268 NC-TDGATDKN-RDVIV-AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
DG ++ RD + D PS+ +V+ SS+ L
Sbjct: 524 GTRADGVSNHGERDAPPESLFDAQPSLDAVS--------------------ASSWSPDRL 563
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
Y + AL + + + TY+YDL+D+ RQ LA ++ + I +AY
Sbjct: 564 RYDPKKFEAALTELLQAPPGMREMPTYQYDLVDVARQTLANWSRKTLPEIKDAYDHRHEA 623
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
L +++L ++ D LLA + F++GPWL + A ++++ +++AR+ +T W
Sbjct: 624 RFETLEKQWLCMMMLQDKLLATNTSFMVGPWLNAVSPWAATATEQRRLDYDARSILTTWG 683
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
+ T EA LRDYGNK W+GL RDYY R IYF + SL++G D W
Sbjct: 684 NRTASEAG-LRDYGNKDWAGLTRDYYYRRWQIYFNDLDRSLKTGTPPHPID----WFAFG 738
Query: 505 NDWQNGRNVYPVESNGDA 522
W + Y ++ GD+
Sbjct: 739 EKWNRAQTHYATQARGDS 756
>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 1105
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 153/542 (28%), Positives = 229/542 (42%), Gaps = 103/542 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W++ + LQ++IL R+ LGM PVLPAF+G VP A P A+I
Sbjct: 164 MGNIEGYDAPLPQQWIEDKHALQQRILQRMRALGMKPVLPAFAGYVPKAFAQAHPQARIY 223
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I + FI+ + YG+ ++ Y D F+E PP
Sbjct: 224 RMRAWEGFHE------TYWLDPADPLFAKIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 276
Query: 121 VDS----------------------PEY--------ISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + PE ++ G A+Y + + DAVW+MQ
Sbjct: 277 IAADGSDARLASYGDSTANTANTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 336
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 337 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 396
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
+ G+ +YG L + FG N+ +
Sbjct: 397 YGGSNPVYGDLAFYRDDLRALLADKDKQQLVGFGAFPEGLHTNSVVYEYMYALAWGGQQR 456
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ Y+ RYG S PA++ AW+ L +V + P
Sbjct: 457 SLQDWLGDYTRARYGHSSPALRAAWDDLQASVLSTR---------------YWTPRWWRS 501
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + + E + D P L RAL+ +A E + + YRYD
Sbjct: 502 RAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLYRYD 552
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
L+D R + + AY+ D R V +DGL+ L
Sbjct: 553 LVDFARHYATGRVDTQLQQALAAYKRGDVAAGDAAFARVQAAVRQLDGLVGGQQE-TLSS 611
Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
WL++A+ A+ + Y +A+ Q+++W L DY +K W G+ DYY PR
Sbjct: 612 WLDAAEGDAKTPQDAAYYRRDAKAQVSVWGGEGN-----LGDYASKAWQGMYADYYLPRW 666
Query: 475 AI 476
A+
Sbjct: 667 AL 668
>gi|433678127|ref|ZP_20510026.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430816763|emb|CCP40478.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 691
Score = 198 bits (504), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 160/592 (27%), Positives = 241/592 (40%), Gaps = 112/592 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PL Q W++ + LQ++IL R+ LGM PVLPAF G VP A P A+I
Sbjct: 127 MGNIEGYDAPLQQQWIEDKHALQQRILQRMRTLGMKPVLPAFVGYVPKAFAQAHPQARIY 186
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I FI+ + YG+ ++ Y D F+E PP
Sbjct: 187 RMRAWEGFHE------TYWLDPADPLFAKIALRFIQLYDRTYGKGTY-YLADAFNEMLPP 239
Query: 121 VDS----------------------PEY--------ISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + PE ++ G A+Y + + DAVW+MQ
Sbjct: 240 IAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 299
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 300 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 359
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
+ G+ +YG L + FG +N+ +
Sbjct: 360 YGGSNPVYGDLAFYRDDLRALLADKDKQQLVGFGAFPEGLHDNSVVYEYMYALAWGGQQR 419
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ Y RYG + PA++ AW+ L V + P
Sbjct: 420 SLQDWLGDYIRARYGHTSPALRAAWDDLQAAVLSTRYWT---------------PRWWRS 464
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + + E + D P L RAL+ +A E + + YRYD
Sbjct: 465 RAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLYRYD 515
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
L+D R + + AY+ D R V+ +DGL+ L
Sbjct: 516 LVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDAAFARVQAAVQQLDGLVGGQQE-TLSS 574
Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
WL A+ A+ + Y +A+ Q+++W L DY +K W G+ DYY PR
Sbjct: 575 WLGDAEGDAKTPQDAAYYRRDAKAQVSVWGGEGN-----LGDYASKAWQGMYADYYLPRW 629
Query: 475 AIYFKYMIESLESGDGF--------RLKDWRREWIKLTNDWQNGRNVYPVES 518
A+ + + + G G RL+ W +W+K + PV +
Sbjct: 630 ALAMQ-ALRAAAVGSGSVDEAALQQRLRAWELDWVKRETPYTRQAPADPVAA 680
>gi|210631701|ref|ZP_03296968.1| hypothetical protein COLSTE_00853, partial [Collinsella stercoris
DSM 13279]
gi|210159960|gb|EEA90931.1| F5/8 type C domain protein, partial [Collinsella stercoris DSM
13279]
Length = 1906
Score = 198 bits (504), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 154/535 (28%), Positives = 241/535 (45%), Gaps = 73/535 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP SW +Q++ L ++I R+ G++PV+ F G VP Q P++
Sbjct: 354 MQNLYSVGGPLPDSWFEQRVELARRIHDRMQTYGIDPVIQGFGGQVPTDFQQKNPNSVAA 413
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G+W S + P TYL DA + F ++G F E Q + +G+ SH Y D F
Sbjct: 414 SSGSW-SGFARPYMIKTYLTDADRAAGKEDYFQKVGTTFYEAQERIFGKVSHFYAVDPFH 472
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E V I + + M D AVW+MQ W + D L
Sbjct: 473 EGG-TVPQGFNIVDIYRTVQQKMLDYDPQAVWVMQQWQWGIDE-------NKLSGLAKKE 524
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
+ +VLDL ++++ ++ + VP++W MLHNF G + M G+ + +A +A S
Sbjct: 525 QSLVLDLQSDLRS-QASPMENQQVPWVWNMLHNFGGRMGMDGVPEVLAIKIPQAYNSNRY 583
Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
+N+ + AW Y RRYG + IQ+AW++L
Sbjct: 584 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRAWTRSYIERRYGGTDAKIQEAWDILL 643
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
T Y DG + SI++ +P + S++ H
Sbjct: 644 DTAYKHVDGEY---------YQGASESIMNA---------RPSDNKI---GSASTWGHSD 682
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
+ Y E RA +LFI S + S +RYD +D+ RQ LA E +AY+ DA
Sbjct: 683 IDYDKKEFERAAQLFIESYDTYKDSEAFRYDFVDVMRQVLANAFQEYQPLAGDAYKQRDA 742
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQIT 441
L+ + LE+++ D +L+ F+LG W+E+A+ L ++ + +E NAR+ IT
Sbjct: 743 ERFELLANQMLEMLDAQDRMLSTSSDFMLGTWIENARTLLEDADDWTADLFELNARSLIT 802
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
W ++ SL+ DY N+ WSGL YY PR + ++LE G + +W
Sbjct: 803 TW--GLEKNGSLI-DYSNRQWSGLTGSYYKPRWESWANARKKALEDGGSAQDLNW 854
>gi|432896403|ref|ZP_20107613.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
gi|433031274|ref|ZP_20219108.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
gi|431432398|gb|ELH14169.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
gi|431538475|gb|ELI14460.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
Length = 1049
Score = 198 bits (503), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 148/555 (26%), Positives = 240/555 (43%), Gaps = 74/555 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 361 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 420
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 421 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 478
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D +AVW++Q W + LN + ++L
Sbjct: 479 ADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW--------QENPTDDFLNDLKKDHALIL 528
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------FG 225
DL+A+ KP + +F P+IW MLH F G + G+ + +A G
Sbjct: 529 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMQEVLAQEIPQSLAESKYMKGVG 588
Query: 226 PVEARTSEN-----------------TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
N ++ A+I+ + RYG P I+ AW+++ T Y+
Sbjct: 589 VTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAYH 648
Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
D+ R + SII G +G V++ + YD
Sbjct: 649 RR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------- 684
Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D Q
Sbjct: 685 AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFNQ 744
Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFDN 446
LS +FL L++ D +L F+LG WL SA+ + ++ Q+E+NAR +T W
Sbjct: 745 LSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLDGMDDWTRDQFEFNARAMVTTWGTE 804
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
+A LRDY N+ W GL D+Y R A + + + + +G + + W L
Sbjct: 805 QAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQTLKSAAATGQ--KQDAIKVHWFPLEYR 861
Query: 507 WQNGR-NVYPVESNG 520
W N N YP + +G
Sbjct: 862 WVNQTGNGYPTQPSG 876
>gi|355706271|gb|AES02588.1| N-acetylglucosaminidase [Mustela putorius furo]
Length = 333
Score = 198 bits (503), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 129/372 (34%), Positives = 185/372 (49%), Gaps = 83/372 (22%)
Query: 142 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 200
D DAVWL+QGWLF + P FW P Q++A+L +VP G+L++LDLFAE +P++ + F+G P
Sbjct: 3 DPDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLILDLFAESQPVYLRTASFHGQP 62
Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM----------------------- 237
+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 63 FIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVGTGMAPEGIGQNEVVYALMAEL 122
Query: 238 -----------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 285
AW+ ++ RRYG + AW +L +VYNC+ + T NR +V
Sbjct: 123 GWRKDPVADLEAWVTSFAARRYGVDSKETEVAWRLLLGSVYNCSGEACTGHNRSPLVR-- 180
Query: 286 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 345
PS+ VT +WY+ S V A L +A+ L
Sbjct: 181 --RPSLQMVTT---------------------------VWYNRSAVFEAWRLLLAAAPTL 211
Query: 346 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 399
+ S T+RYDL+D+TRQA + + + AY L A G+ EL+
Sbjct: 212 AKSPTFRYDLLDVTRQAAQELVSLYYTEARTAYLNKELVPLMRAAGIL-----VYELLPA 266
Query: 400 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 459
+DG+LA FLLG WLE A+ +A +E + YE N R Q+T+W E ++L DY N
Sbjct: 267 LDGVLASDSRFLLGTWLEQARAVAVSETDARFYEQNGRYQLTLW----GPEGNIL-DYAN 321
Query: 460 KYWSGLLRDYYG 471
K +GL+ YY
Sbjct: 322 KQLAGLVAGYYA 333
>gi|62318937|dbj|BAD94027.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
Length = 182
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 94/181 (51%), Positives = 128/181 (70%), Gaps = 1/181 (0%)
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
++DLTRQ L+K AN+++ + A+ D + QLS +FLEL++DMD LLA D LLG
Sbjct: 1 MVDLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGT 60
Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
WLESAK+LA+N ++ KQYEWNARTQ+TMW+D+ S L DY NK+WSGLL DYY PRA
Sbjct: 61 WLESAKKLAKNGDERKQYEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRA 120
Query: 475 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKY 533
+YF M++SL F+++ WRREWI +++ WQ + VYPV++ GDAL S+ L +KY
Sbjct: 121 RLYFNEMLKSLRDKKIFKVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKY 180
Query: 534 L 534
Sbjct: 181 F 181
>gi|331660873|ref|ZP_08361805.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
TA206]
gi|422369309|ref|ZP_16449711.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
gi|315298924|gb|EFU58178.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
gi|331051915|gb|EGI23954.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
TA206]
Length = 1052
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 148/555 (26%), Positives = 240/555 (43%), Gaps = 74/555 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 364 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 423
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 424 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 481
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D +AVW++Q W + LN + ++L
Sbjct: 482 ADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW--------QENPTDDFLNGLKKDHALIL 531
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------FG 225
DL+A+ KP + +F P+IW MLH F G + G+ + +A G
Sbjct: 532 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMQEVLAQEIPQSLAESKYMKGVG 591
Query: 226 PVEARTSEN-----------------TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
N ++ A+I+ + RYG P I+ AW+++ T Y+
Sbjct: 592 VTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAYH 651
Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
D+ R + SII G +G V++ + YD
Sbjct: 652 RR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------- 687
Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D Q
Sbjct: 688 AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFNQ 747
Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFDN 446
LS +FL L++ D +L F+LG WL SA+ + ++ Q+E+NAR +T W
Sbjct: 748 LSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLDGMDDWTRDQFEFNARAMVTTWGTE 807
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
+A LRDY N+ W GL D+Y R A + + + + +G + + W L
Sbjct: 808 QAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQTLKSAAATGQ--KQDAIKVHWFPLEYR 864
Query: 507 WQNGR-NVYPVESNG 520
W N N YP + +G
Sbjct: 865 WVNQTGNGYPTQPSG 879
>gi|161505009|ref|YP_001572121.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160866356|gb|ABX22979.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 1014
Score = 197 bits (500), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 146/555 (26%), Positives = 242/555 (43%), Gaps = 74/555 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 326 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 385
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 386 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFYEGGNR 443
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D DAVW++Q W+ A LN + ++L
Sbjct: 444 ADL--NMVKVAQTVQNKMLEHDKDAVWIIQN--------WQENPTDAFLNGLKKDHALIL 493
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------FG 225
DL+A+ KP + +F P+IW MLH F G + G+ + +A G
Sbjct: 494 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLAQEIPQSLAESKYMKGVG 553
Query: 226 PVEARTSEN-----------------TTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
N ++ A+I+ + RYG P I+ AW+++ T Y+
Sbjct: 554 VTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHNWLTSRYGAQSPEIEQAWDIMVKTAYH 613
Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
D+ R + SII G +G + +Y + + Y
Sbjct: 614 RR---KDRQR--------AEDSIIDAKPG----FG---------VTRACTYYNALIDYDK 649
Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D Q
Sbjct: 650 AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAQDYSAFNQ 709
Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFDN 446
LS +FL L++ D +L+ F+LG W+ +++ + ++ Q+E+NAR +T W
Sbjct: 710 LSGKFLRLIKLQDKVLSTRPEFMLGNWINNSRTMLDGMDDWTRDQFEFNARAMVTTWGTE 769
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
+A LRDY N+ W GL D+Y R A + + + + +G + + W L
Sbjct: 770 QAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALKTAAATGQ--KQDAIKVSWFPLEYR 826
Query: 507 WQNGR-NVYPVESNG 520
W N N YP + +G
Sbjct: 827 WVNQTGNGYPTQPSG 841
>gi|421734750|ref|ZP_16173809.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
gi|407077324|gb|EKE50171.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
Length = 1919
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 152/537 (28%), Positives = 240/537 (44%), Gaps = 77/537 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 362 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 421
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 422 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 480
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 481 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 532
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
+ +VLDL ++++ +++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 533 QTLVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 591
Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
+N+ + +W +Y+ RRYG + I+ AW++L
Sbjct: 592 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDILL 651
Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
T Y TDG + +I A P D +I S S++ H
Sbjct: 652 DTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 688
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
+ Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 689 SDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 748
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 749 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 808
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 809 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 862
>gi|313140918|ref|ZP_07803111.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
bifidum NCIMB 41171]
gi|313133428|gb|EFR51045.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
bifidum NCIMB 41171]
Length = 2005
Score = 196 bits (497), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 152/537 (28%), Positives = 240/537 (44%), Gaps = 77/537 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 448 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 507
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 508 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQESVFGKVSNYYAVDPFH 566
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 567 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 618
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
+ +VLDL ++++ +++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 619 QALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 677
Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
+N+ + +W +Y+ RRYG + I+ AW++L
Sbjct: 678 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDILL 737
Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
T Y TDG + +I A P D +I S S++ H
Sbjct: 738 DTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 774
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
+ Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 775 SDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 834
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 835 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 894
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 895 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 948
>gi|311064845|ref|YP_003971571.1| beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
gi|310867165|gb|ADP36534.1| Beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
Length = 1923
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 152/537 (28%), Positives = 240/537 (44%), Gaps = 77/537 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 366 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 425
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 426 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 484
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 485 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 536
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
+ +VLDL ++++ +++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 537 QALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 595
Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
+N+ + +W +Y+ RRYG + I+ AW++L
Sbjct: 596 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDILL 655
Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
T Y TDG + +I A P D +I S S++ H
Sbjct: 656 DTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 692
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
+ Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 693 SDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 752
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 753 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 812
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 813 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 866
>gi|118370728|ref|XP_001018564.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila]
gi|89300331|gb|EAR98319.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila
SB210]
Length = 879
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 166/551 (30%), Positives = 254/551 (46%), Gaps = 93/551 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ Q+++D Q LQKKIL R+ LGM P+L F G VP +L+ FP +KI
Sbjct: 218 MGNLEGYGGPVTQAYIDGQYNLQKKILKRMRNLGMQPILQGFYGMVPNSLKAKFPLSKIY 277
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--T 118
+W + LDA D LF I F + K YGR + Y D F E
Sbjct: 278 GDQSWLGFRRPA------FLDANDELFSNIANIFYSESEKLYGR-AKFYGGDPFHEGAIV 330
Query: 119 PPVDSPEYISSLGAAIYSGMQ----SGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
P ++ ++S +IY MQ D W++Q W + P Q LL +
Sbjct: 331 PGLN----LTSQAQSIYRAMQYTDNPKDEKVKWILQSWQEN------PSQQ--LLQGLQN 378
Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
+ ++LDL AE + W T+ F G ++W L NF I YG+++ P A + +N
Sbjct: 379 DECIILDLMAEARSKWQTND-FSGHDFLWTSLPNFGLRIGQYGMIEQYVSQPPLAYSIKN 437
Query: 235 TTM-----------------------AWI--NQ----------------YSVRRYG-RSV 252
+TM AWI NQ + RYG ++
Sbjct: 438 STMKGIGSIPEGILTNVLDYEILFDKAWIQPNQDTNLTPRQQVLQYLGDFIRYRYGEQNN 497
Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA-- 310
+ AW++L +++YN T+ + V++A P +S Y + EA
Sbjct: 498 KNLFSAWSLLTNSIYNSTNPWDGPSESVMLARPASYIDKVSSWGTSYIYWNTTNVLEAWK 557
Query: 311 ----VLKSETSSYDHPHLWYSTSEVIRAL-------ELFIA-SGNELSA--SNTYRYDLI 356
+K + HL E+ + L E F+ S NE +T+ YDL+
Sbjct: 558 LFTNYVKEKKQKNRSQHL-QKLEEINKKLGRSDDDMEAFVEISQNEERNIFKDTFLYDLV 616
Query: 357 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 416
D+ RQ LA Y+ L+ ++ A+ D S++FLEL++D D LL+ F+LG +L
Sbjct: 617 DVARQNLASYSYLLYNKVMLAFNQTDTIKFALYSQQFLELIKDQDQLLSSRKEFMLGYYL 676
Query: 417 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 476
ES +L +++++ + + QIT+W D S L DY NK W+G+L+D+Y PR +
Sbjct: 677 ESVSKLGTTDQEKQNFIEQIKRQITVWSDF----PSDLHDYANKEWNGILKDFYLPRWEL 732
Query: 477 YFK----YMIE 483
YFK Y++E
Sbjct: 733 YFKSLQSYIVE 743
>gi|289667570|ref|ZP_06488645.1| N-acetylglucosaminidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 798
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 162/575 (28%), Positives = 243/575 (42%), Gaps = 113/575 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQ++IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQQQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG T Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFMELYTQAYG-TGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYG------------ILDS-----IAFGPVEARTSENTTM-------------- 237
+ + +YG + DS FG N+ +
Sbjct: 434 YGASNPLYGDFAFYRQDLQALLADSEKRNLRGFGIFPEGLHSNSVVYEYLYALAWEGPQQ 493
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ QY RYGRS A+ AW+ L +Y + P +
Sbjct: 494 PWSQWLTQYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + ++K + D L RA++ + + + YRYD
Sbjct: 539 RAGAYLLFKRPTAD--IVKFDDRPGDPQRL-------RRAIDALLQQAERYADAPLYRYD 589
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
LI+ R L+ A+ +++AY D A G QL+ R +LV+ +D L+ L
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDVQLA-RITQLVQGLDALVGGQHE-TLA 647
Query: 414 PWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 472
W A A N+ + Y NAR Q+++W + L DY +K W G+ D+Y
Sbjct: 648 DWTGQAAAAAGNDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQ 702
Query: 473 RAAIYFKYMIESLESGDGF-------RLKDWRREW 500
R + + ++G F +L W R W
Sbjct: 703 RWTRFLSAYRAARKAGTPFEAAAVDQQLATWERHW 737
>gi|390937398|ref|YP_006394957.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
gi|389891011|gb|AFL05078.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
Length = 1957
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 152/537 (28%), Positives = 239/537 (44%), Gaps = 77/537 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 400 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 459
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 460 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 518
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 519 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 570
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
+ +VLDL ++++ ++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 571 QALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 629
Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
+N+ + +W +Y+ RRYG + I+ AW++L
Sbjct: 630 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDILL 689
Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
T Y TDG + +I A P D +I S S++ H
Sbjct: 690 DTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 726
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
+ Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 727 SDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 786
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 787 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 846
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 847 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 900
>gi|169351448|ref|ZP_02868386.1| hypothetical protein CLOSPI_02228 [Clostridium spiroforme DSM 1552]
gi|169291670|gb|EDS73803.1| LPXTG-motif cell wall anchor domain protein [Clostridium spiroforme
DSM 1552]
Length = 1990
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 152/562 (27%), Positives = 253/562 (45%), Gaps = 77/562 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GG LP +W ++++ L +K+ R+ G+ PVL FSG VP ++ + +
Sbjct: 360 MQNMTSYGGKLPNNWFEERVELARKMHDRMQTYGITPVLSGFSGQVPTNFKDKYQDVQYV 419
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W + P TY+ + F ++ F + Q +G ++IY D F E
Sbjct: 420 AQGSWCGYER-PDMLRTYVDNGGTDYFSQMADVFYKAQRDIFGDVTNIYAVDPFHEGGKI 478
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK--LV 178
D + + + M D DA+WL+Q W S P ++ + L K ++
Sbjct: 479 GDMN--YTKVYETVQKKMMENDEDAIWLIQEWSGSIAS--NPSKL------INLDKEHVI 528
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSE---- 233
VLDLF+EV P +++ + P+IW MLHNF G + + + ++ P + SE
Sbjct: 529 VLDLFSEVSP-RNSALEAADTPWIWNMLHNFGGRMGLDANPEKVSQNIPNTYQNSEHMVG 587
Query: 234 ---------NTTMA------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
N+ MA W Y+ R YG + I++ WN+L T
Sbjct: 588 IGMTPEAIENSPMAYELLWDMTWTKDPIDFRQWCQDYAKRIYGGTNEDIEEVWNILLDTG 647
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
YN D + S+I+ +P + + SS+ H + Y
Sbjct: 648 YNRKDNY----------YQGAPESVIN---------ARPTTN----FTSASSWGHSTINY 684
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
E+ RA+ L + +E S + YDL D+TRQ ++ A E ++ AYQ +
Sbjct: 685 DKEELERAVYLMAKNYDEFKDSPAFIYDLSDITRQLISNSAQEYHKAMVNAYQAGNLSEF 744
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444
LS +FLE++ D +L+ + FL+G W+E A+ + ++ + + +E+NAR IT W
Sbjct: 745 EVLSDKFLEMILLQDQILSTNSDFLVGKWIEQARTMIEDSDDWTKDLFEFNARDLITTWG 804
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW---RREWI 501
LRDY N+ W+GL +DYY PR + + E++++ DW EW
Sbjct: 805 GLKNANGGGLRDYSNRQWAGLTKDYYYPRWQKWINDVKEAMKNNTAVPSTDWFLMEWEWA 864
Query: 502 KLTNDWQNGRNVYPVESNGDAL 523
L +D N Y +E++ AL
Sbjct: 865 NLKSDEGNE---YSIEASNLAL 883
>gi|288927801|ref|ZP_06421648.1| putative alpha-N-acetylglucosaminidase
(N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
oral taxon 317 str. F0108]
gi|288330635|gb|EFC69219.1| putative alpha-N-acetylglucosaminidase
(N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
oral taxon 317 str. F0108]
Length = 723
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 152/550 (27%), Positives = 238/550 (43%), Gaps = 102/550 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++G GPL W Q+ LQ KIL R+ +L M+P+ P F+G VP AL+ ++P+A I
Sbjct: 198 MGNIYGIDGPLSNQWHQDQIALQHKILDRMRKLDMHPICPGFAGFVPEALKELYPTADI- 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
Q W + Y+L DPLF +IG FI++ KE+GR Y D+F+E
Sbjct: 257 QYTTWEKAFHN------YILSPADPLFHKIGVMFIQEWEKEFGRCD-FYLIDSFNEMDIP 309
Query: 119 -PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
PP D P E+++ G +Y ++ + A W+MQGW+F Y P W + AL++ VP
Sbjct: 310 FPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQGWMFGYQPEIWDYKTLNALVSQVP 369
Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K+++LDL A+ K +W T K F G +I+ ++ N G + G LD A G
Sbjct: 370 DNKMIMLDLAADYNKFLWKTPFNWDFYKGFCGKQWIYSVIPNMGGKSALTGALDFYAKGH 429
Query: 227 VEARTSENT----------------------------------TMAWINQYSVRRYGRSV 252
+EA S+N W+ Y+ RYG
Sbjct: 430 LEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGWAKQGVELRPWLRNYTYSRYGCYP 489
Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
++ WN + +VY G ++++ + +
Sbjct: 490 IGMEQYWNEMIQSVY-----------------------------GSFKSHPRFNWQFRPG 520
Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
K + S D + +Y E++ + L GN+L + D ++ L L
Sbjct: 521 KEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL-----FEADFKEMAANYLGGKVEILVR 574
Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
I +AY+ D QL RF L+ MD +L H + W++ A+ + + Y
Sbjct: 575 QIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPTKDMQKWIDYARARGVSYNKADCY 634
Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
E NAR +T+W + DY + W+GL+RDYY PR YF SG F
Sbjct: 635 ESNARRIVTVW-------GPPIDDYSARIWAGLIRDYYLPRWKHYF----NQKRSGKPFD 683
Query: 493 LKDWRREWIK 502
W ++++
Sbjct: 684 FSTWELDFVE 693
>gi|293402122|ref|ZP_06646261.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
bacterium 5_2_54FAA]
gi|291304514|gb|EFE45764.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
bacterium 5_2_54FAA]
Length = 2295
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 144/535 (26%), Positives = 247/535 (46%), Gaps = 88/535 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ +GGPLP +W +Q+ L +K+ R+ G++PV+ FSG VP P+A IT
Sbjct: 398 MQNLYSYGGPLPDNWFEQRTELARKMHDRMQTYGISPVVQGFSGQVPDNFDKKQPTALIT 457
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
++ +W + P Y+ ++ + L+ ++ + F + Q +G ++ Y D F
Sbjct: 458 EMKDWVGY-TRPSIIQPYITESDAAKGKENLYPQVAKDFYDAQKNVFGNVTNYYATDPFH 516
Query: 116 ENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
E P +D E + + M + AVW+MQ W + D L V
Sbjct: 517 EGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQQWQGNLDA-------TKLSGLVK 565
Query: 174 LGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+ + LDL ++ P + S+ P++WCMLHNF G + M G L ++A P A
Sbjct: 566 PSQALALDLQTDLNPQNGVMENSE----TPWLWCMLHNFGGRMGMDGNLPNVAKNPAIAM 621
Query: 231 TS--------------ENTTMA------------------WINQYSVRRYGRSVPAIQDA 258
EN+ +A WI +Y+ RR G + +Q+A
Sbjct: 622 NESKYMKGIGITPEALENSPVAYELLFDMTWTKDPIDEDAWIAKYAQRRAGGTSEKLQEA 681
Query: 259 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 318
W +L T Y GA + ++ +II+ T +S S+
Sbjct: 682 WKILNETAY----GAKQE------SYQGAAETIINAT------------PRDSFRS-AST 718
Query: 319 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
+ H ++ Y E +AL+L I + ++ AS YRYDL D+ Q L A E +++A
Sbjct: 719 WGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYDLADVANQVLCNVAIEYHSLMVKAK 778
Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNA 436
++A + S++FLE+++ D +L + F++G W+ A+ + + + + +E+NA
Sbjct: 779 NESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGNWINDARNMMSDGDDWTKDLFEFNA 838
Query: 437 RTQITMWFDNTQEEASL--LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
R +T W + E +SL L DY N+ W+GL +D+YG R ++ + L+ D
Sbjct: 839 RAMVTTW---SGERSSLNNLNDYSNRKWNGLTKDFYGKRWKVWIENRQAELDGKD 890
>gi|333031147|ref|ZP_08459208.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
gi|332741744|gb|EGJ72226.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
Length = 721
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 149/568 (26%), Positives = 244/568 (42%), Gaps = 106/568 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL W Q+ LQ KIL R+ EL M P+ PAF+G VP A P
Sbjct: 199 MGNLNKWDGPLSDEWHTSQIELQHKILDRMRELEMKPIAPAFAGFVPMAFAEKHPDINFK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W DP + Y+L P F EIG+ FIE+ E+G ++ Y D+F+E P
Sbjct: 259 HM-RWGGF--DPEYNA-YVLPPDSPFFEEIGKLFIEEWENEFGSNTY-YLSDSFNEMELP 313
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + + G +IY + +G+ +A+W+ QGW F Y FW ++ALL++VP
Sbjct: 314 IDKDDTEGKYRLLRQYGESIYKSISAGNPEAIWVTQGWTFGYQHSFWDTTSLQALLSNVP 373
Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + + W FYG +I+ + NF G M G + A
Sbjct: 374 NEKMIIIDLGNDYPKWVWNTEQTWKVQNGFYGKGWIFSYVPNFGGKTTMTGDMQMYATSS 433
Query: 227 VEAR---------------------------------TSENTTM-AWINQYSVRRYGRSV 252
EA TSE+ + W+ Y + RYG
Sbjct: 434 AEALASPNKGNLIGFGSAPEGLENNEVIYELLADMGWTSESINLDEWMQSYCLSRYGGYP 493
Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
+Q AW + TVY+ + ++P Y E L
Sbjct: 494 ENVQKAWELFRKTVYSN-----------LYSYP---------------RYTWQTVVEDTL 527
Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
+ + ++ E + +ELF+++ NEL S Y DLI+ + A A++++
Sbjct: 528 RINKIN--------TSDEFLIGVELFVSAVNELKDSELYVNDLIEFSSFYAAAKADKIYK 579
Query: 373 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 432
+ ++ + L + ++++ +D LLA H + L W++ A+ ++ +
Sbjct: 580 EALILFERGNKKEARSLLNQSIQILLKVDKLLASHPIYRLEEWVKYARNSGSTVAEKDAF 639
Query: 433 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 492
E NA+ IT W + DY ++WSGL++DYY PR + F S +
Sbjct: 640 EANAKRLITTW-------GGIQDDYAARFWSGLIKDYYIPRMELNF--------SSERNS 684
Query: 493 LKDWRREWIKLTNDWQNGRNVY--PVES 518
L+ W W+ + W N + P+E+
Sbjct: 685 LRQWEENWV--STPWNNPTQPFDNPIEA 710
>gi|373451393|ref|ZP_09543318.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
3_1_31]
gi|371968665|gb|EHO86120.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
3_1_31]
Length = 2190
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 144/535 (26%), Positives = 250/535 (46%), Gaps = 88/535 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ +GGPLP +W +Q+ L +K+ R+ G++PV+ FSG VP P+A IT
Sbjct: 390 MQNLYSYGGPLPDNWFEQRTELARKMHDRMQTYGISPVVQGFSGQVPDNFDKKQPTALIT 449
Query: 61 QLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
++ +W + P Y+ + + L+ ++ + F + Q +G ++ Y D F
Sbjct: 450 EMKDWVGY-TRPSIIQPYITENDAAKGKENLYPQVAKDFYDAQKNVFGNVTNYYATDPFH 508
Query: 116 ENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
E P +D E + + M + AVW+MQ W + D ++ LL
Sbjct: 509 EGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQQWQGNLDA----TKLSGLLKP-- 558
Query: 174 LGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+ + LDL ++ P + S+ P++WCMLHNF G + M G L ++A P A
Sbjct: 559 -SQALALDLQTDLNPQNGVMENSE----TPWLWCMLHNFGGRMGMDGNLPNVAKNPAIAM 613
Query: 231 TS--------------ENTTMA------------------WINQYSVRRYGRSVPAIQDA 258
EN+ +A WI +Y+ RR G + +Q+A
Sbjct: 614 NESKYMKGIGITPEALENSPVAYELLFDMTWTKDPIDEDAWIAKYAQRRAGGTSEKLQEA 673
Query: 259 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 318
W +L T Y GA + ++ +II+ T +++ + S+
Sbjct: 674 WKILNETAY----GAKQE------SYQGAAETIINAT-----------PRDSFRSA--ST 710
Query: 319 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
+ H ++ Y E +AL+L I + ++ AS YRYDL D+ Q L A E +++A
Sbjct: 711 WGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYDLADVADQVLCNVAIEYHSLMVKAK 770
Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNA 436
++A + S++FLE+++ D +L + F++G W+ A+ + + + + +E+NA
Sbjct: 771 NESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGNWINDARNMMSDGDDWTKDLFEFNA 830
Query: 437 RTQITMWFDNTQEEASL--LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
R +T W + E +SL L DY N+ W+GL +D+YG R I+ + L+ D
Sbjct: 831 RAMVTTW---SGERSSLNNLNDYSNRKWNGLTKDFYGKRWKIWIENRQAELDGKD 882
>gi|289663931|ref|ZP_06485512.1| N-acetylglucosaminidase [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 798
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 162/575 (28%), Positives = 243/575 (42%), Gaps = 113/575 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQ++IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQQQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG T Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFMELYTQAYG-TGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYG------------ILDS-----IAFGPVEARTSENTTM-------------- 237
+ + +YG + DS FG N+ +
Sbjct: 434 YGASNPLYGDFAFYRQDLQALLADSEKRNLRGFGIFPEGLHSNSVVYEYLYALAWEGPQQ 493
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ QY RYGRS A+ AW+ L +Y + P +
Sbjct: 494 PWSQWLMQYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + ++K + D L RA++ + + + YRYD
Sbjct: 539 RAGAYLLFKRPTAD--IVKFDDRPGDPQRL-------RRAIDALLQQAERYADAPLYRYD 589
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
LI+ R L+ A+ +++AY D A G QL+ R +LV+ +D L+ L
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDVQLA-RITQLVQGLDALVGGQHE-TLA 647
Query: 414 PWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 472
W A A N+ + Y NAR Q+++W + L DY +K W G+ D+Y
Sbjct: 648 DWTGQAAAAAGNDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQ 702
Query: 473 RAAIYFKYMIESLESGDGF-------RLKDWRREW 500
R + + ++G F +L W R W
Sbjct: 703 RWTRFLSAYRAARKAGTPFDAAAVDQQLATWERHW 737
>gi|410634789|ref|ZP_11345419.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
gi|410145665|dbj|GAC22286.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
Length = 750
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 158/587 (26%), Positives = 268/587 (45%), Gaps = 108/587 (18%)
Query: 1 MSNLHGWGG--PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL W G LP+S+ D+Q+ L KIL R+ LGM P++ AF+G VP A +FP A+
Sbjct: 211 MGNLITWDGGDKLPESYFDEQIALNHKILKRLRSLGMTPIVHAFAGFVPPATSELFPEAQ 270
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE-N 117
I +L +W P YLL +PLF++IG+ +IE+ KE+G+ + Y D+F+E +
Sbjct: 271 IRRL-SWGG--GLPESTYGYLLSPENPLFVKIGKMYIEEWQKEFGKNEY-YLADSFNEMD 326
Query: 118 TPPVDS-PEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP------FWRPPQMKA 167
PP D+ E ++ L G +Y +++ + DA W+MQGW F Y FW P ++ A
Sbjct: 327 VPPADTEAELLTELAGYGDRVYQSIKAANPDATWVMQGWTFPYHKDENRQLFWTPERLHA 386
Query: 168 LLNSVPLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
L++ VP KL++LDL E + P W F+ +I+ + N G + G D
Sbjct: 387 LVSKVPDDKLLILDLANEYNKLWWKIDPSWKMYSGFFNKKWIYSFIPNMGGKTPLNGRFD 446
Query: 221 SIAFGPVEARTSEN------------------------TTMAW------INQ----YSVR 246
A P++A ++ T MAW ++Q Y+++
Sbjct: 447 IYAELPIDALNYKDKGNLIGFGFAPEGIENNEMIYELLTDMAWQRKAIDVDQWQAKYAMQ 506
Query: 247 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 306
RYG +++ A++ L N+ + +F VD I Y+N P
Sbjct: 507 RYGAYPGSLEKAFSYL--------------NKSALGSF--VDHPIHRFQLRPYRN---PE 547
Query: 307 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 366
E DH + + + + I+A LF+ + +L + Y++D +++T L+
Sbjct: 548 GVE----------DHATV-HESEDFIKATGLFLQASEQLKDNKLYQHDAMEITTLFLSLV 596
Query: 367 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 426
+ L + + V + + ++ MD LLA H L W++ A+
Sbjct: 597 TDNLLTKFLAKDVEQRDYSVLDEA---ISVMHTMDKLLAEHPNHQLVTWVDYARTWGSTT 653
Query: 427 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
++ YE NA+ +T W + + DY + WSGL+ +YY PR Y ++++
Sbjct: 654 AEKDYYESNAKRLLTTWGGDP------VNDYAGRVWSGLIGNYYAPRWQSYH----DAVK 703
Query: 487 SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ F ++ W W+ ++N Y D + +Q +Y KY
Sbjct: 704 NNQTFDVRQWEENWV--MTPYKNTSTAYQ-----DPVRVAQAMYFKY 743
>gi|281424178|ref|ZP_06255091.1| N-acetylglucosaminidase [Prevotella oris F0302]
gi|281401447|gb|EFB32278.1| N-acetylglucosaminidase [Prevotella oris F0302]
Length = 723
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 153/534 (28%), Positives = 246/534 (46%), Gaps = 70/534 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++G GPL W Q+ LQ KIL R+ +L M+P+ P F+G VP AL+ ++P+A I
Sbjct: 198 MGNIYGIDGPLSNQWHQDQIALQHKILDRMRKLDMHPICPGFAGFVPEALKELYPTADI- 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
Q W + Y+L DPLF +IG FI++ KE+GR Y D+F+E
Sbjct: 257 QYTTWEKAFHN------YILSPADPLFHKIGVMFIQEWEKEFGRCD-FYLIDSFNEMDIP 309
Query: 119 -PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
PP D P E+++ G +Y ++ + A W+MQGW+F Y P W + AL++ VP
Sbjct: 310 FPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQGWMFGYQPEIWDYKTLNALVSQVP 369
Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K+++LDL + K +W T K F G +I+ ++ N G + G LD A G
Sbjct: 370 DNKMIMLDLAVDYNKFLWKTPFNWDFYKGFCGKQWIYSVIPNMGGKSALTGALDFYAKGH 429
Query: 227 VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 286
+EA S+N + +G + I++ V+Y + C G + +
Sbjct: 430 LEALNSQNRG-------KLIGFGFAPEGIENN-EVVYELL--CDAGWAKQG-------VE 472
Query: 287 VDPSIISVTEGKYQNYGKPVSKE----AVLKSETSSY-DHPHL-W--------YSTSEV- 331
+ P + + T +Y Y P+ E +L+S S+ HP W Y + ++
Sbjct: 473 LRPWLRNYTYSRYGCY--PIGMEQYWNEMLQSVYGSFKSHPRFNWQFRPGKEKYGSVDLD 530
Query: 332 ---IRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
A+E+ +++ + + D ++ L L I +AY+ D Q
Sbjct: 531 NHFYHAVEIMAGMLSQMKGNKLFEADFKEMAANYLGGKVEILVRQIDKAYESQDTINANQ 590
Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 448
L RF L+ MD +L H + W++ A+ + + YE NAR +T+W
Sbjct: 591 LETRFYRLMTGMDLVLQGHPTKDMQKWIDYARARGVSYNKADCYESNARRIVTVW----- 645
Query: 449 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
+ DY + W+GL+RDYY PR YF SG F W ++++
Sbjct: 646 --GPPIDDYSARIWAGLIRDYYLPRWKHYF----NQKRSGKPFDFSTWELDFVE 693
>gi|310287970|ref|YP_003939229.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
gi|309251907|gb|ADO53655.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
Length = 1923
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 151/537 (28%), Positives = 238/537 (44%), Gaps = 77/537 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 366 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 425
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 426 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGDTFYKAQESVFGKVSNYYAVDPFH 484
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 485 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 536
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
+ +VLDL ++++ ++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 537 QALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 595
Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
+N+ + +W +Y+ RRYG + I+ AW++L
Sbjct: 596 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDILL 655
Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
T Y DG + +I A P D +I S S++ H
Sbjct: 656 DTAYKHMDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 692
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
+ Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 693 SDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 752
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 753 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 812
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 813 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 866
>gi|345014586|ref|YP_004816940.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
gi|344040935|gb|AEM86660.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
Length = 1044
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 148/564 (26%), Positives = 240/564 (42%), Gaps = 78/564 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGP+ + LD++ L ++I R+ ELGM PV P + G VP + P A+
Sbjct: 216 LQNMSEYGGPVSTALLDKRTELGRRIADRLRELGMRPVFPGYFGTVPDGFADRNPEARTV 275
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W ++ P W LD F ++ AF Q + +G + ++ D E P
Sbjct: 276 PQGDWNGLRR-PDW-----LDPRTESFRKVAAAFYRHQRELFGE-AGLFKMDLLHEGGDP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +++ A+W++ GW +P + LL++V +++V+
Sbjct: 329 GDVP--VPDAARAVETALRTARPGAIWVILGW--QENP------RRDLLDAVDHDRMLVV 378
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI------------------LDSI 222
D +++ + K + VPY + + NF G + L
Sbjct: 379 DGLSDLDTVTDREKDWGAVPYAFGTIPNFGGRTTIGAKTHMWTKRFTVWRDKPGSKLVGT 438
Query: 223 AFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
A+ P A R W Y+ RYG ++A+ L T Y
Sbjct: 439 AYMPEAAERDPAAFELFSELAWREEAVDRAEWFRSYAEMRYGGRDAKAREAFAALRDTAY 498
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ + V A P S+T NY + T ++D
Sbjct: 499 EISSKDGRPHDSVFAARP-------SLTARSGTNYA----------THTPAFDPAGF--- 538
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+V A L + +G L S+ YR+DL D+ RQALA + +L + +AY D
Sbjct: 539 --DVAFAALLGVRAG--LRDSDAYRHDLTDIARQALANRSWQLIPQLQDAYDRKDRTAFR 594
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
L+R +L+L+ D + H FLLGPWLE AK++A +E+ + E ART IT W D
Sbjct: 595 TLARLWLKLMRLSDDMTGAHRRFLLGPWLEDAKRMASGDEESARLERAARTLITTWADRA 654
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
+ L +Y N+ WSGL+ D++ P+ Y + ++L R D W + W
Sbjct: 655 TADGGKLANYANRDWSGLIADFHLPQWQSYLDELEDALAENRPPRAFD----WFAVEEPW 710
Query: 508 QNGRNVYPVESNGDALITSQWLYN 531
R YPV DA T+Q +Y
Sbjct: 711 TRERTSYPVRPTTDAHRTAQRVYE 734
>gi|384417770|ref|YP_005627130.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353460684|gb|AEQ94963.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 798
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 154/574 (26%), Positives = 239/574 (41%), Gaps = 111/574 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQQWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
V + +Y +++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAAYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
+ + +YG + + FG N+ +
Sbjct: 434 YGASNPLYGDVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYLYALAWEGPQH 493
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ +Y RYGRS A+ AW L +Y + P +
Sbjct: 494 PWSQWLARYLRARYGRSDAALLSAWTDLEAGIYQTRYWS---------------PRWWNT 538
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + D P + RA++ + + + + YRYD
Sbjct: 539 HAGAYLLFKRPTADIVNFD------DRPG---DPQRLRRAIDALLQQADRYADAPLYRYD 589
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
LI+ R L+ A+ +++AY D A G QL+R +LV+ +D L+ L
Sbjct: 590 LIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDALVGGQHETLAA 648
Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
++A + + Y NAR Q+++W + L DY +K W G+ D+Y R
Sbjct: 649 WTGQAAAAAGNDARLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 703
Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
+ + ++G F +L W R+W
Sbjct: 704 WTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 737
>gi|260910505|ref|ZP_05917173.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
F0295]
gi|260635347|gb|EEX53369.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
F0295]
Length = 1566
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 150/559 (26%), Positives = 254/559 (45%), Gaps = 83/559 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ +S + +L Q+++L R+ +LG+ PV+ F G VP + FP A+I
Sbjct: 199 MGNLEGWGGPMSESLIALRLQQQRQMLQRMRQLGIQPVVQGFPGIVPTFFKERFPQARII 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--T 118
+ G W S + LL D +F ++ A+ + K +G D F E T
Sbjct: 259 EQGKWGSFQRP-----AVLLPNNDGVFEKVAEAYYQSLTKLFGTDFEFLGGDLFHEGGIT 313
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + S+ A + M A W++QGW + +P PQ+ +L+ +
Sbjct: 314 TGVD----VGSVAAQVQRQMLRFFPRAKWVLQGW--NKNP---SPQLLRVLDK---RHTL 361
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA--------- 229
+++L E+ W +S +F G P++W +++F G +M G L I P A
Sbjct: 362 LVNLSGEIAASWESSDEFGGTPWLWGSVNHFGGKTDMGGQLPVIVTEPHRALALTVDSVM 421
Query: 230 -------------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYH 264
T+ + + QY RYG P + AW ++
Sbjct: 422 QGIGILPEGIGTNPVVYDLALKTAWHTATPDVDSMLVQYLGYRYGEVHPDLLAAWRIMLK 481
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
+VY EG +++ ++ ++ + S++ +
Sbjct: 482 SVYG---------------------EFAIKGEGTFESVF--CARPSLRVTSVSTWGPKQM 518
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
Y +++ RAL LF+ + +L S TY+YDL+DL RQ+LA YA + ++++AY+ +A
Sbjct: 519 QYQPADLYRALGLFLKAAPKLRDSETYQYDLVDLARQSLANYARTAYADVVKAYEAKNAE 578
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ Q ++RF L+ D LL + FLLG WL+ A Q A NE + NA+T I+ W
Sbjct: 579 QLQQATQRFERLIVLQDSLLLTNRHFLLGNWLQQATQYAPNEADRQLCLHNAQTLISYW- 637
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
E + + DY NK W+G+L YY PR +F+ + S+ +G+ + ++ ++
Sbjct: 638 -GPDEPTTKVHDYANKEWAGMLSTYYLPRWQAFFRVLQASINTGNPPAI-----DFFEME 691
Query: 505 NDWQNGRNVYPVESNGDAL 523
W N + GDA+
Sbjct: 692 KRWANTPQPINTKPQGDAV 710
>gi|403512485|ref|YP_006644123.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
[Nocardiopsis alba ATCC BAA-2165]
gi|402798758|gb|AFR06168.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
[Nocardiopsis alba ATCC BAA-2165]
Length = 718
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 139/509 (27%), Positives = 222/509 (43%), Gaps = 71/509 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL + GP+P+SW++ L +++L R LGM PVLP F+G+VP +L ++
Sbjct: 160 MGNLDHFAGPMPRSWIEGHRELGRRVLERQRALGMTPVLPGFTGHVPPSLAPGRTGSRTW 219
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q T++L TDPL+ + +E Q KE T H Y D F E P
Sbjct: 220 Q------------GLVTHVLVPTDPLYTTLCAEIVETQ-KELFDTDHQYAIDPFIEMIPV 266
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P + + A G+ D AVW +Q W FSY FW P +++A L+++P L +
Sbjct: 267 DSDPGFPGLVARATIEGLTRADPRAVWFLQTWPFSYQSDFWSPERVEAFLDAIPDDHLHL 326
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI----EMYGILDSIAFGPVEARTSENT 235
LDL+AE P WS F G P+ WC L NF G ++ G D I A E
Sbjct: 327 LDLWAEYDPQWSRFHAFGGTPWTWCALLNFGGRTDPMADLQGAADRIGAAKDSAHPPEGI 386
Query: 236 TMA----------------------------WINQYSVRRYGRSV-PAIQDAWNVLYHTV 266
++ W+ + +RYG PA+ + W L TV
Sbjct: 387 GLSMEATRNNPAFFELVVDQAWTRTGRVEEEWLPDFVAQRYGPGHDPALLEGWRGLLRTV 446
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
+ + FP+ ++++ + + + + + L++E ++ +WY
Sbjct: 447 LGASG---------VRIFPEQFNGVLTL-----RPHYRHLEDSSALRAEVTAL----VWY 488
Query: 327 STSEVIRALELFIASG--NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
+++ A E +A + L+ +DL+D+ L++ A+ +L ++E +
Sbjct: 489 PWPDLLAAWERLVAGAETDPLAVEGPLGHDLVDVAMAVLSRVADHRYLEMVEHLDHHPEL 548
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
L RFLE+ +D+D LL + W A A E + NAR +T+W
Sbjct: 549 PEGDLE-RFLEVFDDLDALLETRPEYRYRTWEAKATSWATGTEDHRVLTDNARRILTVW- 606
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
T + L DY + WSGL+ YY PR
Sbjct: 607 --TTLDDPRLDDYAGRLWSGLVGGYYRPR 633
>gi|375146756|ref|YP_005009197.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
koreensis GR20-10]
gi|361060802|gb|AEV99793.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
koreensis GR20-10]
Length = 1147
Score = 191 bits (485), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 146/552 (26%), Positives = 228/552 (41%), Gaps = 80/552 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+PQS +D + +L +K++ R+ LG+ PV+P F G VP N A++
Sbjct: 191 MGNIEGWGGPMPQSQIDSRKILVQKMIARMQALGIEPVMPGFYGMVPHNF-NTKSKARVI 249
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
GNW + + +LD TD F + F E+ K YGR ++ D F E
Sbjct: 250 TQGNWGA------FIRPAILDPTDTAFDRVAGIFYEETKKLYGRNIRFFSGDPFHEGG-- 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ + + GA I MQ A+W++QG W+ K LL L++
Sbjct: 302 ITNGVNLGKAGANIQKAMQQYFPGAIWVLQG--------WQDNPKKELLAETDKSALLIQ 353
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFA------GNIEMY-GILDSIAFGPVEA---- 229
+LF E W T + G P+IWC ++NF G +E Y G + A GP
Sbjct: 354 ELFGENTNNWETRNGYEGTPFIWCCVNNFGERPGLNGKLERYAGEVYRAATGPFREYMKG 413
Query: 230 -----------------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
T WIN Y RYG++ I AW + T+
Sbjct: 414 VGIMPEGINNNPASYDLVLELGWHNQPVETGKWINDYVKARYGKANDQIATAWTLFLQTI 473
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y+ +++ A P A+ SS+ Y
Sbjct: 474 YSNPGYQEGPPENILCARP------------------------ALQVKSVSSWGKLKKGY 509
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
T+ + ++ F A+ S TY+ DLI+ TRQ L+ A+ +F +++ AY+ +
Sbjct: 510 DTALFEKGVQAFAAAAPLFGNSETYKIDLINFTRQVLSNRADTVFASLVTAYKEENTVAF 569
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ FL L + LL H + L + + A + + K NA IT W +N
Sbjct: 570 NAAAEAFLSLHALTNELLNSHSYYRLTSYQQQALRSGNTPIERKNNLHNAMMLITYWGEN 629
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD---WRREWIKL 503
++E L +Y K W G++ +Y R +YF Y+ +L +G D W REW+
Sbjct: 630 NRQE-DYLHEYAYKEWGGMMTTFYQQRWKLYFDYLRNNL-AGKSVTPPDFFAWEREWVTQ 687
Query: 504 TNDWQNGRNVYP 515
++ YP
Sbjct: 688 NEQVKSEVQPYP 699
>gi|408676293|ref|YP_006876120.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
gi|328880622|emb|CCA53861.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
Length = 855
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 146/556 (26%), Positives = 229/556 (41%), Gaps = 81/556 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL G+ GP+ + ++ + L +I + LGM PVLP + G VP P A
Sbjct: 327 LQNLSGFAGPVSEQLIEARAALGARIARHLRSLGMTPVLPGYFGTVPPDFTARNPGAHTV 386
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W P W LD T P+F + + Q + +G S ++ D E P
Sbjct: 387 PQGRWVGF-GRPDW-----LDPTGPVFARLAAVYYRHQRQRFG-DSDMFKMDLLHEGGAP 439
Query: 121 --VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD +S+ A+ +++ A W+M GW + P ALL+ V +L+
Sbjct: 440 GTVD----VSAAAGAVQRALEAARPGATWVMLGWQLNPTP--------ALLHGVDRRRLL 487
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILD 220
++D ++ ++ G PY + + NF G+ + L
Sbjct: 488 IVDGLSDRYDELDRETRWGGTPYAFGTIPNFGGHTSIGANTGAWVSRFHAWLAKPDSALR 547
Query: 221 SIAFGPVEARTSEN-----TTMAW----INQ------YSVRRYGRSVPAIQDAWNVLYHT 265
IA+ P T+ T +AW I+Q Y+ RRYG + AW L
Sbjct: 548 GIAYLPEATGTNPVAFGLFTELAWQPGPIDQQRWFAGYAARRYGGADRHAAAAWEALRLG 607
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
Y+ G+ + +D + A PS+ + T ++ +
Sbjct: 608 PYSMRTGSWSEPQDSLFA---ARPSLTASTAARWSPKA--------------------MR 644
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
Y + V RAL + L S+ YR+D++D+ RQAL A L I AY+ D
Sbjct: 645 YDAATVERALAELLRVAPRLRTSDAYRFDVVDVARQALTNRARVLLPRIRAAYEARDLDA 704
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
L R + E + L+ FL+GPWL +A+ + + + E++AR+ +T W D
Sbjct: 705 FRALVREWGAAEELLGRLVGSDRRFLVGPWLAAARSWGADPAERDRLEYDARSILTTWAD 764
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
E+ L DY N+ WSGL+RD Y PR A YF + +L +G D W +
Sbjct: 765 RVPSESGGLHDYANREWSGLVRDVYAPRWAAYFASLDRALVNGTEPVAID----WFARDD 820
Query: 506 DWQNGRNVYPVESNGD 521
W G YP +GD
Sbjct: 821 AWARGHRSYPTLPSGD 836
>gi|32564213|ref|NP_496948.2| Protein K09E4.4 [Caenorhabditis elegans]
gi|25814792|emb|CAB70170.2| Protein K09E4.4 [Caenorhabditis elegans]
Length = 715
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 136/522 (26%), Positives = 227/522 (43%), Gaps = 81/522 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GG L + + L K+I+ R+ ELG+ P+LP F+G VP L+ +FP++K
Sbjct: 210 MGNLKAYGGGLSDAQMLNDHNLAKRIIDRLLELGITPILPTFAGFVPDHLETLFPASKFN 269
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
+L W + S+ C + DPLF +IG F+ Q K +G +++Y+ D F+E P
Sbjct: 270 RLPRWNNFTSET--SCMLSVSPFDPLFQKIGSTFLRHQKKMFGGDVTNMYSADPFNEILP 327
Query: 120 PVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ +++ AI + + D + VW++Q W F+YD W +K+ L+++P+G
Sbjct: 328 SESAKFDAKFVKQTAQAIMNSCKKVDKNCVWVLQSWSFTYDQ-WPAWAIKSFLSAIPVGN 386
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
L++LDL+AEV P W + F G ++WC+LHNF G+ E+ G L I G A +
Sbjct: 387 LLILDLYAEVVPAWQMTSSFQGHHFVWCLLHNFGGSRELRGNLQKIDKGYQLALMKAGSN 446
Query: 237 MA---------------------------------WINQYSVRRYGRSVPAIQDAWNVLY 263
+ W+ YS RY Q W +L
Sbjct: 447 LVGAGLSMEAIDQNYVVYQFMIDRMWSPEPLPLNNWLKAYSESRYSADFKVAQKFWTLLA 506
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
T YN + V + Y +P +
Sbjct: 507 GTFYNQPEKWGTPRFSVFL-------------------YHRPGFGRKI-----------E 536
Query: 324 LWYSTSEVI-RALELFIASGNELSASNTYRYDLIDLTRQALA-KYANELFLNIIEAYQLN 381
W+ E R EL A + L +R DL D+ R+ + NE L++ EA+ +
Sbjct: 537 YWFPVEETFSRFRELLPALVHVLGEHPLFREDLNDVMREMTQFEMGNEAALSMSEAFLME 596
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
D V +E+ + ++ + + W+E+AK +A E+ + + A +T
Sbjct: 597 DKQQVGASCEMLMEMFQKLES----YSNRDVRQWIENAKSIAPTSEERQVFPVTAGDILT 652
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
+W Q DY ++ W+GL+ YYG R + +++E
Sbjct: 653 VWGPTGQN-----LDYAHREWAGLMSGYYGRRWQYFCDWILE 689
>gi|421736727|ref|ZP_16175487.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
IPLA 20015]
gi|407295984|gb|EKF15606.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
IPLA 20015]
Length = 1044
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 150/537 (27%), Positives = 236/537 (43%), Gaps = 77/537 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q + L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 338 MQNLYSVGGPLPAAWFEQCVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 397
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++ F + Q +G+ S+ Y D F
Sbjct: 398 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVCDTFYKAQENVFGKVSNYYAVDPFH 456
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 457 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 508
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS--- 232
+ +VLDL ++++ ++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 509 QTLVLDLQSDLRSQ-ASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 567
Query: 233 -----------ENTTM------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
+N+ + +W +Y+ RRYG + I+ W++L
Sbjct: 568 MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKVWDILL 627
Query: 264 HTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
T Y TDG + +I A P D +I S S++ H
Sbjct: 628 DTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWGH 664
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
+ Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 665 SDIDYDKRQFEKAAALFEQAYDSYKNSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKSG 724
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQ 439
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 725 DLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARAL 784
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 785 VTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 838
>gi|341892319|gb|EGT48254.1| hypothetical protein CAEBREN_28412 [Caenorhabditis brenneri]
Length = 713
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 140/503 (27%), Positives = 233/503 (46%), Gaps = 43/503 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GG L + + L L K+I+ R+ ELG+ P+LP F+G VP L+ +FPS+K T
Sbjct: 208 MGNLKAYGGGLSDAQMLNDLNLAKRIINRLLELGITPILPTFAGFVPDQLEKLFPSSKFT 267
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
+L W + S+ C + DPLF +IG F+ Q K +G +++Y+ D F+E P
Sbjct: 268 RLPCWNNFTSET--SCLLSVSPFDPLFQKIGSLFLRHQKKMFGGDITNLYSADPFNEILP 325
Query: 120 PVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ +++ AI + + D + +W++Q W F+YD W +K+ L++VP+G
Sbjct: 326 SDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWSFTYDE-WPSWAIKSFLSAVPIGN 384
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT- 235
L++LDL++EV P W ++ F+G YIWCMLH+F G+ E+ G L + G A +
Sbjct: 385 LLILDLYSEVVPAWQSTSSFHGHNYIWCMLHSFGGSRELRGNLQKVDKGYQLALMKGGSN 444
Query: 236 ------TMAWIN------QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVA 283
TM I+ Q+ V R S P + W Y D ++A
Sbjct: 445 LIGAGLTMEAIDQNYVIYQFMVDRMWSSEPLPLNTWIKSYSESRYSADFKVSHKFWTLLA 504
Query: 284 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIAS-G 342
F + + + +G P L + W+ E L+ I S
Sbjct: 505 FSFYN---------QPEKWGNP-RFSVFLYHRPAFGKKIEYWFPVEETFGHLQSLIPSLI 554
Query: 343 NELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDM 400
+ L ++ DL D+ R A+ ++ N+ L + EA+ + D + + DM
Sbjct: 555 HVLGDHPLFKEDLNDVMR-AITQFEVGNDAALTLTEAFLMEDKQQIGSTCENLM----DM 609
Query: 401 DGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNK 460
L + + W+E +K +A E+ + + A +T+W Q DY ++
Sbjct: 610 FLKLESYSNRDMKHWIEDSKSIAATSEERQVFPATAADILTVWGPEGQN-----LDYAHR 664
Query: 461 YWSGLLRDYYGPRAAIYFKYMIE 483
W GLL YYG R + +++E
Sbjct: 665 EWEGLLSGYYGRRWQYFCDWILE 687
>gi|147860882|emb|CAN83148.1| hypothetical protein VITISV_031934 [Vitis vinifera]
Length = 562
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 86/100 (86%), Positives = 94/100 (94%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 100
+LGNWF+V +PRWCCTYLLDATDPLFIEIGRAFI+QQLK
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGRAFIQQQLK 334
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 82/92 (89%), Positives = 89/92 (96%)
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
DTFDENTPPVD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+S
Sbjct: 429 DTFDENTPPVDDPEYISSLGAAIFKGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHS 488
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW 203
VP+G+LVVLDLFAEVKPIW TS+QFYGVPYIW
Sbjct: 489 VPMGRLVVLDLFAEVKPIWITSEQFYGVPYIW 520
>gi|390989490|ref|ZP_10259787.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372555759|emb|CCF66762.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 798
Score = 189 bits (481), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 153/574 (26%), Positives = 232/574 (40%), Gaps = 111/574 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
+ + +YG + FG N+ +
Sbjct: 434 YGASNPLYGDFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEGPQQ 493
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ Y RYGRS A+ AW+ L +Y + P +
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + D P + RA++ + N + + YRYD
Sbjct: 539 RAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLYRYD 589
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
LI+ R L+ A+ +++AY D A G QL+R +LV +D L+ L
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGGQHETLAD 648
Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
++A + + Y NAR Q+++W + L DY +K W G+ D+Y R
Sbjct: 649 WTGQAAAATGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 703
Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
+ + ++G F +L W R+W
Sbjct: 704 WTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737
>gi|197302378|ref|ZP_03167435.1| hypothetical protein RUMLAC_01107 [Ruminococcus lactaris ATCC 29176]
gi|197298557|gb|EDY33100.1| F5/8 type C domain protein [Ruminococcus lactaris ATCC 29176]
Length = 1655
Score = 189 bits (480), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 156/580 (26%), Positives = 249/580 (42%), Gaps = 108/580 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ +W ++ L +K + + +LGM PVL +SG VP + + PSA++
Sbjct: 680 MANLSGYGGPVHDTWFTERTELARKNQLIMRKLGMQPVLQGYSGMVPVDITSKDPSAEVI 739
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S + +L F + F + Q + YG ++H Y D F E NT
Sbjct: 740 KQGTWCSFQRPS------MLRTDSESFTKYAALFYKVQKEVYGDSAHYYATDPFHEGGNT 793
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-- 176
+DS + + + + M + D A W++Q W+ ALL + +
Sbjct: 794 GGMDS----AVISQKVLASMMTADPHATWVIQS--------WQGNPTTALLQGLGDNRDH 841
Query: 177 LVVLDLFAEVKPIWSTSK-----------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
+VLDL+AE P W+ + +F P+++CML+NF G + ++G +D+ G
Sbjct: 842 ALVLDLYAEKTPHWNETNPGYYGGAEGGGEFLNTPWVYCMLNNFGGRLGLHGHIDNYVEG 901
Query: 226 PVEARTSENTTMA--------------------------------------WINQYSVRR 247
V A + + MA W Y RR
Sbjct: 902 IVNA-SKQAEHMAGIGITPEASVNNPVLYDLFFETIWADDGNNLQKINLDEWFKNYVTRR 960
Query: 248 YGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKP 305
YG + A +L+ TVYN + V+ A P +D G +G
Sbjct: 961 YGADSDSAYQAMEILHDTVYNPAYNMKGQGAPESVVNARPGLDI-------GAASTWGNA 1013
Query: 306 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 365
V + Y ++ +A EL +A ++L S Y+YDL ++ Q L+
Sbjct: 1014 V-----------------VDYDKKKLEKAAELLLADYDKLKNSAGYQYDLANVLEQVLSN 1056
Query: 366 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 425
A E + A++ DA LS +FL +++ ++ + FL+G W+ AK+LA+N
Sbjct: 1057 TAQEYQKKMAAAFRSGDAEEFSTLSDKFLSIIDMVEKVTGTQKEFLVGTWINGAKKLAKN 1116
Query: 426 EEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
+ ++ YE NAR+ IT W Q + L DY N+ W+GL DYY R + +
Sbjct: 1117 SDDFTKELYELNARSLITTWGSYDQAISGGLIDYSNRQWAGLTNDYYKMRWEKWITERKK 1176
Query: 484 SL--ESGDGFRLKDW-RREWIKLTNDWQNGRNVYPVESNG 520
L ES + +DW EW W G N Y NG
Sbjct: 1177 ELAGESYTNYSAQDWFEMEWA-----WARGTNKYSGTPNG 1211
>gi|294812279|ref|ZP_06770922.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
gi|294324878|gb|EFG06521.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
Length = 1086
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 148/564 (26%), Positives = 237/564 (42%), Gaps = 87/564 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGP+ + LD+++ L ++I+ R+ LGM PV+P + G VP P A++
Sbjct: 257 LQNMSEYGGPVSPALLDRRIELGQRIVTRMRRLGMRPVVPGYFGTVPDGFVARNPGARVI 316
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD P+F EI A+ Q + +G H + D E P
Sbjct: 317 PQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRHQEELFGEIDH-FKMDLLHEGGTP 369
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +++ A W++ GW + P ALL+++ K++++
Sbjct: 370 GDVP--VPDAARAVETALRAARPAATWVILGWQSNPRP--------ALLDAIDTSKVLIV 419
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD------------------SI 222
D +++ + ++ G PY + + NF G + D
Sbjct: 420 DGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPNSALVGT 479
Query: 223 AFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
A+ P A R + AW Y+ RYG PA ++A+ L T Y
Sbjct: 480 AYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYAQFRYGAKDPAAEEAFAALAGTAY 539
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T T R + F + + ++ S +++D
Sbjct: 540 QLT---TTDGRPIDSLF---------------------LRRPSMSSSVATAFDQAAFDRG 575
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ ++R E EL S+ YRYDL DL RQALA + L L + AY D
Sbjct: 576 FAALLRVNE-------ELRGSDAYRYDLTDLARQALALRSRTLQLALRAAYATKDVTAFR 628
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
++ +L L+ D + CH FLLGPWLE AK+ A + E+ + E AR IT W D
Sbjct: 629 GVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFATSTEEAVELERTARVLITTWGD-- 686
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
+ A L +Y N+ W GL+ D + P+ YF + +L G + D W W
Sbjct: 687 RAAAVELSNYANRDWQGLIGDVHVPQWEQYFTEVATALAEGRAPKAID----WYPGEETW 742
Query: 508 QNGRNVYPVESNGDALITSQWLYN 531
R YPV GD +Q +++
Sbjct: 743 TKDRRPYPVRPTGDVHKVAQRVHD 766
>gi|326440885|ref|ZP_08215619.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
Length = 1038
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 148/564 (26%), Positives = 237/564 (42%), Gaps = 87/564 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGP+ + LD+++ L ++I+ R+ LGM PV+P + G VP P A++
Sbjct: 209 LQNMSEYGGPVSPALLDRRIELGQRIVTRMRRLGMRPVVPGYFGTVPDGFVARNPGARVI 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD P+F EI A+ Q + +G H + D E P
Sbjct: 269 PQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRHQEELFGEIDH-FKMDLLHEGGTP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +++ A W++ GW + P ALL+++ K++++
Sbjct: 322 GDVP--VPDAARAVETALRAARPAATWVILGWQSNPRP--------ALLDAIDTSKVLIV 371
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD------------------SI 222
D +++ + ++ G PY + + NF G + D
Sbjct: 372 DGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPNSALVGT 431
Query: 223 AFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
A+ P A R + AW Y+ RYG PA ++A+ L T Y
Sbjct: 432 AYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYAQFRYGAKDPAAEEAFAALAGTAY 491
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T T R + F + + ++ S +++D
Sbjct: 492 QLT---TTDGRPIDSLF---------------------LRRPSMSSSVATAFDQAAFDRG 527
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ ++R E EL S+ YRYDL DL RQALA + L L + AY D
Sbjct: 528 FAALLRVNE-------ELRGSDAYRYDLTDLARQALALRSRTLQLALRAAYATKDVTAFR 580
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
++ +L L+ D + CH FLLGPWLE AK+ A + E+ + E AR IT W D
Sbjct: 581 GVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFATSTEEAVELERTARVLITTWGD-- 638
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
+ A L +Y N+ W GL+ D + P+ YF + +L G + D W W
Sbjct: 639 RAAAVELSNYANRDWQGLIGDVHVPQWEQYFTEVATALAEGRAPKAID----WYPGEETW 694
Query: 508 QNGRNVYPVESNGDALITSQWLYN 531
R YPV GD +Q +++
Sbjct: 695 TKDRRPYPVRPTGDVHKVAQRVHD 718
>gi|268533054|ref|XP_002631655.1| Hypothetical protein CBG20846 [Caenorhabditis briggsae]
Length = 712
Score = 188 bits (478), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 131/523 (25%), Positives = 234/523 (44%), Gaps = 83/523 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GG L + + L K+I+ R+ ELG+ P+LP FSG VP L+ +FP++K
Sbjct: 207 MGNLKGYGGGLSDAQMLNDFNLAKRIINRLLELGITPILPTFSGFVPDRLEKLFPTSKFN 266
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
+L W + S+ C + DPLF +IG +F+ Q K G +++Y+ D F+E P
Sbjct: 267 RLPCWNNFTSET--SCLLSVSPFDPLFQKIGSSFLRHQKKMLGGDITNLYSADPFNEVLP 324
Query: 120 PVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ +++ AI + + D + +W++Q W F+YD W +K+ L++VP+G+
Sbjct: 325 SDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWSFTYDQ-WPNWAIKSFLSAVPIGQ 383
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDL++EV P W + F+G ++WCMLHNF G+ E+ G + + G A +
Sbjct: 384 MLILDLYSEVVPAWQMTSSFHGHNFVWCMLHNFGGSRELRGNVQKVDKGYQLALMKAGSN 443
Query: 237 M---------------------------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
+ +W+ YS RY W +L
Sbjct: 444 LVGAGLSMEAIDQNYMMYQFMIDRMWTQEPIPLNSWLKSYSESRYSADFKVAHKFWTILA 503
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
+ YN + + V + Y +P + +
Sbjct: 504 GSFYNQPEKWGNPRFSVFL-------------------YHRPAFGKKI-----------E 533
Query: 324 LWYSTSEVIRALE-LFIASGNELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQL 380
W+ E LE L ++ + L ++ DL D+ R A+ ++ NE L++ EA+ +
Sbjct: 534 YWFPVEETFTHLESLVLSLLHILGDHPLFKEDLNDVMR-AITQFEIGNEAALSLTEAFLM 592
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
D + + + + ++ + + W+E AK +A E+ + + +A +
Sbjct: 593 EDKQQIGTTCENLMGMFQKLEP----YSNRDVRDWIEDAKSIAPTTEEREVFPISASDIL 648
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
T+W Q DY ++ W+GLL YYG R + +++E
Sbjct: 649 TVWGPTGQN-----LDYAHREWAGLLSGYYGRRWQYFCDWILE 686
>gi|331092442|ref|ZP_08341267.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330401285|gb|EGG80874.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 1598
Score = 187 bits (476), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 144/574 (25%), Positives = 248/574 (43%), Gaps = 111/574 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + LGM PVL +SG VP ++ SA++
Sbjct: 680 MANLSGFGGPIHDSWFEERTELARKNQLSMRRLGMQPVLQGYSGMVPTNIREKDSSAEVI 739
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--- 117
+ G W S + +L F + + F + Q + YG ++H Y D F E
Sbjct: 740 EQGTWCSFRRPD------MLKTDSASFDKYAKLFYQAQKEVYGESAHYYATDPFHEGGDT 793
Query: 118 ---TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
P V + + M D D +W++Q W+ ALL +
Sbjct: 794 GGLNPTV--------IAGKVLDAMLEADKDGIWIIQS--------WQGNPTTALLKGLEG 837
Query: 175 GK--LVVLDLFAEVKPIWSTSK-------QFYGVPYIWCMLHNFAGNIEMYGILDSIA-- 223
K +VLDL+AE P W+ + +F P+++CML+NF G + ++G LD++A
Sbjct: 838 RKEHALVLDLYAEKTPHWNETNPNEYGGGEFNDTPWVFCMLNNFGGRLGLHGHLDNLAKN 897
Query: 224 -------------FGPVEARTSENTTM-----------------------AWINQYSVRR 247
G + N + W+ Y+ RR
Sbjct: 898 IPAALNSAKHMEGIGITPEASVNNPLLYDFLFETVWTDNAKEKLPVIDLDKWLKDYAKRR 957
Query: 248 YGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 302
YG+ + +A ++ TVY GA + V+ A P +D S +
Sbjct: 958 YGKESQSAYEALLIMKDTVYKAELNMKGQGAPES---VVNARPALDIGAAST-------W 1007
Query: 303 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 362
G V + Y +++ +A EL + ++L S+ Y YDL + +Q
Sbjct: 1008 GNAV-----------------ISYDKAKLEKAAELLLKDYDKLKDSDGYMYDLATMLQQV 1050
Query: 363 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 422
L+ A E + A++ N+ + +FL +++ M+ + + +LLG W+E AK L
Sbjct: 1051 LSNSAQEYQRKMANAFKENNKEEFNTYADKFLSIIDSMEKVTSTSKYYLLGTWVEQAKAL 1110
Query: 423 AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 480
A+N + + YE+NA+ +T W Q E L+DY N+ WSGLL+D+Y R + +
Sbjct: 1111 AKNADDFTKDLYEFNAKALVTTWGSINQAEGGGLKDYSNRQWSGLLKDFYKVRWQKWIQA 1170
Query: 481 MIESLESGDGFRLK--DWRREWIKLTNDWQNGRN 512
+ L+ + +W +W++ ++ N N
Sbjct: 1171 RNDELDGKQPENINWFEWEWKWVRENTEYTNTPN 1204
>gi|374990497|ref|YP_004965992.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
gi|297161149|gb|ADI10861.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
Length = 1001
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 148/564 (26%), Positives = 232/564 (41%), Gaps = 78/564 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ G+GGP+ L +++ L +KI R+ ELGM PV P + G VP + P A+
Sbjct: 174 LQNMSGYGGPVSPELLAKRIALGQKIAERLRELGMRPVYPGYFGTVPDGFVDRNPGARTV 233
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + + P W LD F ++ AF Q + +G ++ D E
Sbjct: 234 PQGTWNGL-ARPDW-----LDPRTESFGQVAAAFYRHQQELFGECD-LFKMDLLHEGGAA 286
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ A+ + +Q+ A W++ GW + + LL++V ++V+
Sbjct: 287 GDVP--VADAARAVETALQTARPGATWVILGWQAN--------PRRELLDAVNHDHMLVV 336
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI----------------------EMYGI 218
D +++ I + + VPY + + NF G ++ G
Sbjct: 337 DGLSDLDSIGDREQDWGSVPYAFGTIPNFGGRTTIGAKTHIWARRFTQWRDKPGSKLVGT 396
Query: 219 L--------DSIAFGPVEARTSENTTM---AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
D AF NT + W Y+ R G +DA+ L T Y
Sbjct: 397 AYMAEAVGRDPAAFELFSELAWRNTAVDRDEWFRTYADVRLGGRDERARDAYAALRDTAY 456
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
T + V A PD VT NY + +
Sbjct: 457 QITSSDGRPHDSVFSARPD-------VTARSGTNYATRIPA-----------------FD 492
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
++ AL + L S+ YR+DL D+ RQALA + L ++ +AY+ D
Sbjct: 493 LADFDPALAALLDVRPSLRDSDAYRHDLTDIARQALADRSWTLIPHLHDAYERKDLEAFR 552
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
L+R +L+L+ D + H GFLLGPWLE AK+LA +E + E ART IT W D
Sbjct: 553 TLARLWLKLMRLSDDMTGAHRGFLLGPWLEDAKRLASDEAEAAHLEHLARTLITTWADRV 612
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
+ L +Y N+ W+GL+ D++ P+ Y + ++L G R D W + W
Sbjct: 613 TADTGKLANYANRDWNGLIGDFHLPQWQSYLDELEDALAEGREPRDFD----WFAVEEPW 668
Query: 508 QNGRNVYPVESNGDALITSQWLYN 531
R YPV DA T + +Y
Sbjct: 669 TRERKSYPVRPTTDAHRTGRRVYE 692
>gi|282877909|ref|ZP_06286718.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
gi|281299910|gb|EFA92270.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
Length = 717
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 148/550 (26%), Positives = 232/550 (42%), Gaps = 104/550 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W QQ+ LQ KI+ R+ ELGM+P+ PAF+G VP A P
Sbjct: 195 MGNLNQWDGPLSDAWHKQQITLQHKIISRMRELGMHPIAPAFAGFVPKAFAKKHPEINFK 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W Y+L F ++G+ FIE+ +E+G ++ Y D+F+E P
Sbjct: 255 HL-RWGGFADS---LNAYVLPPESSYFKQLGKLFIEEWEREFGENTY-YLSDSFNEMKLP 309
Query: 121 V------DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
V + ++ G AIY + +G+ A+W+ QGW F Y FW + ALL+ VP
Sbjct: 310 VNPNDEEEKCRLLAEYGKAIYQSINAGNPHAIWVTQGWTFGYQHDFWNRKSLSALLSQVP 369
Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFA------GNIEMYGILD 220
+++++DL + + W FYG +I+ + NF G++EMY
Sbjct: 370 NDRMIIIDLGNDYPKWVWHTEQTWKRHNGFYGKQWIFSYVPNFGGKTLLTGDLEMYATDA 429
Query: 221 SIAFGPVE---------------------------ARTSENTTM-AWINQYSVRRYGRSV 252
S+A A T + + WI Y + RYG+
Sbjct: 430 SLALSAANKGNLVGIGSAPEGLENNEVVYELLSDAAWTDKGINLDEWIANYCMARYGKYP 489
Query: 253 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD-PSIISVTEGKYQNYGKPVSKEAV 311
++ AWN +VY+ + ++P ++I T K
Sbjct: 490 DKMKAAWNGFRKSVYSS-----------LYSYPRFTWQTVIPDTRRK------------- 525
Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
S +D ++ +A+E F++ +EL + Y+ D I Q L A+ +
Sbjct: 526 -----SRHDLNETYF------KAVEDFLSCADELGGAKFYQDDAILFAAQYLGAKADIYY 574
Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
N + LN + + +EL+ D +LA H L W+ A+ +++ Q
Sbjct: 575 ENALRYGSLNKHVEANKQLSKAIELLLFADKILASHPTDRLDVWIAKARSQGHTPQEKNQ 634
Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 491
YE NA+ IT W + + DY + WSGL++DYY PR IYF S
Sbjct: 635 YEANAKRLITTWGGHQE-------DYAARCWSGLIKDYYIPRIQIYF--------SNQRK 679
Query: 492 RLKDWRREWI 501
L W WI
Sbjct: 680 MLDQWEENWI 689
>gi|21241480|ref|NP_641062.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
gi|21106823|gb|AAM35598.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
Length = 798
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 153/574 (26%), Positives = 232/574 (40%), Gaps = 111/574 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASRAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
+ + +YG + FG N+ +
Sbjct: 434 YGASNPLYGDFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEGPQQ 493
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ Y RYGRS A+ AW+ L +Y + P +
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + D P + RA++ + N + + YRYD
Sbjct: 539 RAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLYRYD 589
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
LI+ R L+ A+ +++AY D A G QL+R +LV +D L+ L
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALVGGQHETLAD 648
Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
++A + + Y NAR Q+++W + L DY +K W G+ D+Y R
Sbjct: 649 WTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 703
Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
+ + ++G F +L W R+W
Sbjct: 704 WTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737
>gi|317501265|ref|ZP_07959469.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
8_1_57FAA]
gi|316897332|gb|EFV19399.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
8_1_57FAA]
Length = 1847
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 142/533 (26%), Positives = 235/533 (44%), Gaps = 103/533 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + +LGM PVL +SG VP + + +A++
Sbjct: 658 MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 717
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W S + +L T F + + F + Q + YG S+ Y D F E
Sbjct: 718 EQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 769
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
+ S + + + M + D DAVW++Q W + ALLN +
Sbjct: 770 ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 821
Query: 176 KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA-- 223
++LDL+AE P + ++F P+++CML+NF G + ++G LD++A
Sbjct: 822 HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLANN 881
Query: 224 -------------FGPVEARTSENTTM-----------------------AWINQYSVRR 247
G + N + W++ Y+ RR
Sbjct: 882 IPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATRR 941
Query: 248 YGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 302
YG + AW++L TVY + GA + V+ A P++ T G +
Sbjct: 942 YGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAASTW 991
Query: 303 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 362
G V + Y ++ A L +A ++L S Y+YDL ++ +Q
Sbjct: 992 GNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQV 1034
Query: 363 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 422
L+ A E + A+ D S +F+ ++EDM+ + + FLLG W+E AK L
Sbjct: 1035 LSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKAL 1094
Query: 423 AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
A N + ++ YE+NA+ +T W Q E L+DY N+ WSGL+ D+Y R
Sbjct: 1095 ANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKDYSNRQWSGLIGDFYKAR 1147
>gi|418515337|ref|ZP_13081518.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|410708056|gb|EKQ66505.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 782
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 154/574 (26%), Positives = 232/574 (40%), Gaps = 111/574 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 185 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 245 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 297
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 298 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 357
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 358 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 417
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
+ + +YG + FG N+ +
Sbjct: 418 YGASNPLYGDFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEGPQQ 477
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ Y RYGRS A+ AW+ L +Y + P +
Sbjct: 478 SWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 522
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + A D P + RA++ + N + + YRYD
Sbjct: 523 RAGAYLLFKRPTADIADFD------DRPG---DPQRLRRAIDALLQQANRYADAPLYRYD 573
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
LI+ R L+ A+ +++AY D A G QL+R +LV +D L+ L
Sbjct: 574 LIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALVGGQYETLAD 632
Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
++A + + Y NAR Q+++W + L DY +K W G+ D+Y R
Sbjct: 633 WTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 687
Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
+ + +G F +L W R+W
Sbjct: 688 WTRFLSAYRAARMAGTPFDAVAMDHQLATWERQW 721
>gi|418520969|ref|ZP_13087015.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410702945|gb|EKQ61442.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
Length = 798
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 154/574 (26%), Positives = 232/574 (40%), Gaps = 111/574 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
+ + +YG + FG N+ +
Sbjct: 434 YGASNPLYGDFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEGPQQ 493
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ Y RYGRS A+ AW+ L +Y + P +
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + A D P + RA++ + N + + YRYD
Sbjct: 539 RAGAYLLFKRPTADIADFD------DRPG---DPQRLRRAIDALLQQANRYADAPLYRYD 589
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
LI+ R L+ A+ +++AY D A G QL+R +LV +D L+ L
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALVGGQYETLAD 648
Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
++A + + Y NAR Q+++W + L DY +K W G+ D+Y R
Sbjct: 649 WTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 703
Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
+ + +G F +L W R+W
Sbjct: 704 WTRFLSAYRAARMAGTPFDAVAMDHQLATWERQW 737
>gi|153814573|ref|ZP_01967241.1| hypothetical protein RUMTOR_00787 [Ruminococcus torques ATCC 27756]
gi|331089988|ref|ZP_08338878.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
3_1_46FAA]
gi|145848067|gb|EDK24985.1| F5/8 type C domain protein [Ruminococcus torques ATCC 27756]
gi|330402902|gb|EGG82468.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 1863
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 142/533 (26%), Positives = 235/533 (44%), Gaps = 103/533 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + +LGM PVL +SG VP + + +A++
Sbjct: 674 MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 733
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W S + +L T F + + F + Q + YG S+ Y D F E
Sbjct: 734 EQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 785
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
+ S + + + M + D DAVW++Q W + ALLN +
Sbjct: 786 ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 837
Query: 176 KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA-- 223
++LDL+AE P + ++F P+++CML+NF G + ++G LD++A
Sbjct: 838 HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLANN 897
Query: 224 -------------FGPVEARTSENTTM-----------------------AWINQYSVRR 247
G + N + W++ Y+ RR
Sbjct: 898 IPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATRR 957
Query: 248 YGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 302
YG + AW++L TVY + GA + V+ A P++ T G +
Sbjct: 958 YGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAASTW 1007
Query: 303 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 362
G V + Y ++ A L +A ++L S Y+YDL ++ +Q
Sbjct: 1008 GNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQV 1050
Query: 363 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 422
L+ A E + A+ D S +F+ ++EDM+ + + FLLG W+E AK L
Sbjct: 1051 LSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKAL 1110
Query: 423 AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
A N + ++ YE+NA+ +T W Q E L+DY N+ WSGL+ D+Y R
Sbjct: 1111 ANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKDYSNRQWSGLIGDFYKAR 1163
>gi|336439030|ref|ZP_08618649.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
1_1_57FAA]
gi|336017072|gb|EGN46842.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 1863
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 142/533 (26%), Positives = 235/533 (44%), Gaps = 103/533 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + +LGM PVL +SG VP + + +A++
Sbjct: 674 MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 733
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W S + +L T F + + F + Q + YG S+ Y D F E
Sbjct: 734 EQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 785
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
+ S + + + M + D DAVW++Q W + ALLN +
Sbjct: 786 ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 837
Query: 176 KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA-- 223
++LDL+AE P + ++F P+++CML+NF G + ++G LD++A
Sbjct: 838 HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLANN 897
Query: 224 -------------FGPVEARTSENTTM-----------------------AWINQYSVRR 247
G + N + W++ Y+ RR
Sbjct: 898 IPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATRR 957
Query: 248 YGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 302
YG + AW++L TVY + GA + V+ A P++ T G +
Sbjct: 958 YGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAASTW 1007
Query: 303 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 362
G V + Y ++ A L +A ++L S Y+YDL ++ +Q
Sbjct: 1008 GNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQV 1050
Query: 363 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 422
L+ A E + A+ D S +F+ ++EDM+ + + FLLG W+E AK L
Sbjct: 1051 LSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKAL 1110
Query: 423 AQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
A N + ++ YE+NA+ +T W Q E L+DY N+ WSGL+ D+Y R
Sbjct: 1111 ANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKDYSNRQWSGLIGDFYKAR 1163
>gi|381169859|ref|ZP_09879021.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689629|emb|CCG35508.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 798
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 153/574 (26%), Positives = 232/574 (40%), Gaps = 111/574 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
+ + +YG + FG N+ +
Sbjct: 434 YGASNPLYGDFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWESPQQ 493
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ Y RYGRS A+ AW+ L +Y + P +
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRWWNK 538
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + D P + RA++ + N + + YRYD
Sbjct: 539 RAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLYRYD 589
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
LI+ R L+ A+ +++AY D A G QL+R +LV +D L+ L
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALIGGQYETLAD 648
Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
++A + + Y NAR Q+++W + L DY +K W G+ D+Y R
Sbjct: 649 WTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYLQR 703
Query: 474 AAIYFKYMIESLESGDGF-------RLKDWRREW 500
+ + ++G F +L W R+W
Sbjct: 704 WTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737
>gi|294667089|ref|ZP_06732314.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 10535]
gi|292603099|gb|EFF46525.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 10535]
Length = 798
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 158/576 (27%), Positives = 236/576 (40%), Gaps = 115/576 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWTDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
+ + +YG + FG N+ +
Sbjct: 434 YGASNPLYGDFAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVIYAYLYALAWEGPQQ 493
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ Y RYGRS A+ AW L +Y + P +
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQTRYWS---------------PRWWNK 538
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + D P + RA++ + N + + YRYD
Sbjct: 539 RAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLQQANRYADAPLYRYD 589
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLA-CHDGFLL 412
LI+ R L+ A+ +++AY D A G QL+R +LV +D L+ HD L
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGDQHD--TL 646
Query: 413 GPWL-ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 471
W ++A + + Y NAR Q+++W + L DY +K W G+ D+Y
Sbjct: 647 ADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYL 701
Query: 472 PRAAIYFKYMIESLESGDGF-------RLKDWRREW 500
R + + ++G F +L W R+W
Sbjct: 702 QRWTRFLSAYRAARKAGTPFDAVTVDHQLAAWERQW 737
>gi|374985456|ref|YP_004960951.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
gi|297156108|gb|ADI05820.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
Length = 1039
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 156/560 (27%), Positives = 233/560 (41%), Gaps = 88/560 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ G+GGP+ + +DQ+ L KKI+ R+ ELGM PVLP + G VP P A +
Sbjct: 214 LQNMSGFGGPVSKHLIDQRAALAKKIINRVRELGMTPVLPGYYGTVPDDFLAKNPGASLV 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + K P W LD LF E+ AF Q + YG +S +Y D E P
Sbjct: 274 AQGTWGAFKR-PDW-----LDPRTDLFAEVAAAFYRHQRERYGDSS-MYKMDLLHEGGNP 326
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +Q + AVW + G W+ + +L +V ++V+
Sbjct: 327 GDVP--VGEAAKAVEAALQKAHAGAVWAILG--------WQTNPSREILGAVDKSMMLVV 376
Query: 181 DLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGN----------IEMY--------GILDS 221
D ++ + + G PY + + NF G+ +E Y L
Sbjct: 377 DGLSDRYTTVIDRESDWDGTPYAFGSIWNFGGHTPIGANAPDWVEQYPKWRDKTGSALTG 436
Query: 222 IAFGPVEARTSENTTMA----------------WINQYSVRRYGRSVPAIQDAWNVLYHT 265
IA P E + MA W Y+V RYG P AW + T
Sbjct: 437 IAMMP-EGADNNPAAMALFTDLAWTPGAIGLDDWFASYAVSRYGGEDPHAVAAWKAIRDT 495
Query: 266 VYNCTDGATDKNRDVIVAFPD----VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
YN + D PD PS+ +K A E YD
Sbjct: 496 AYNMS------RADAWSEAPDGLFGARPSL-------------GANKAAAWGPEADRYDT 536
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
+ +E+++ +A G L S+ Y YDL D+ RQ L+ + L I AY+
Sbjct: 537 TAFDAALTELLQ-----VAPG--LRDSSAYAYDLADVARQVLSNRSRVLLPQIKTAYEAG 589
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
D +L++ +L ++ MD +LA LLG WL A+ ++ Q E++AR+ IT
Sbjct: 590 DRGRFDRLTKTWLSWMKLMDKVLATSGQHLLGRWLADARSWGATRAEKDQLEYDARSIIT 649
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 501
W E L DY N+ WSGLL Y R YF + +L +G + +W
Sbjct: 650 TWGGRASSEEG-LHDYANREWSGLLGGLYHLRWKTYFDELSTALAAG----RQPAGIDWF 704
Query: 502 KLTNDWQNGRNVYPVESNGD 521
L + W + YPV ++GD
Sbjct: 705 ALEDHWARRHDSYPVRTSGD 724
>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
Length = 1552
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 139/525 (26%), Positives = 242/525 (46%), Gaps = 78/525 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ ++ ++ + LQ+K+L R+ LG+ PV+ F G VP+ + FP+A++
Sbjct: 196 MGNLEGWGGPMSEALIEARYQLQRKMLQRMQALGIQPVVQGFPGLVPSFFKERFPAAQLV 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P LL + LF ++ +A+ E ++ YGR D F E NT
Sbjct: 256 LQGRWGHFNRPP-----MLLPSDKDLFQQVAKAYYESLIRCYGRDFKFLGGDLFHEGGNT 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD +++ + S A W++QGW + P LL+ + ++
Sbjct: 311 KGVDVAATAAAVQQTMLRYFPS----AKWVLQGWNNNPSP--------TLLSKLDKQHVL 358
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTM 237
+++L E+ W +S +F G P++W +++F G +M G L + P A + ++N M
Sbjct: 359 LINLSGEIAASWESSNEFGGTPWLWGSVNHFGGKTDMGGQLPVLVAEPHRAFSQTKNGVM 418
Query: 238 -----------------------AW----------INQYSVRRYGRSVPAIQDAWNVLYH 264
AW + Y RYG ++ AW++L H
Sbjct: 419 QGIGILPEGINSNPVVYDLALKTAWYTTTPDLDRLLRDYIAYRYGHVDESLVQAWHILSH 478
Query: 265 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
+VY EG +++ ++ + + S++ +
Sbjct: 479 SVYG---------------------EFKIKGEGTFESIF--CARPGLHVTSVSTWGPKQM 515
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
Y+ ++ +AL LF ++ S TY+YDL+DL RQ +A +A +++ ++AY+ DA
Sbjct: 516 QYNPKDLEKALGLFRRVADQYKGSATYQYDLVDLARQVMANHARDVYAAAMQAYRNKDAA 575
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
+ + + F+ L++ D LL FLLG WL A ++Q NA+ IT W
Sbjct: 576 LLHEKGQEFMHLLQLQDRLLQTDTHFLLGNWLAQAANYGVTAADKQQALHNAKMLITYWG 635
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
++ A+ + DY NK W+GLL+ YY PR +F + +S+ +G+
Sbjct: 636 PDS--AATRVHDYANKEWAGLLKSYYEPRWQKFFYALYQSVNTGE 678
>gi|406693970|gb|EKC97309.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
asahii CBS 8904]
Length = 781
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 149/574 (25%), Positives = 244/574 (42%), Gaps = 74/574 (12%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+HG W G WL+ Q LQK+IL R E GM PVLP F G VP L N
Sbjct: 238 NIHGNWHGTTTWQWLEGQHNLQKQILARQREFGMTPVLPGFCGFVPPELHNYIGGPDFKT 297
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W S ++ + +D + + AF+ +Q + YG TS Y D F E+ P
Sbjct: 298 YPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRKQKELYGFTSDYYMVDLFTESKPTS 355
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVL 180
P Y+ + A+ + + +A W+MQGW+F DP W KA L+ L+VL
Sbjct: 356 TDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVNDPKSWTETASKAFLDGAG-ESLLVL 414
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN------ 234
DL AE P W K F+G ++WC L N+ N +YG LD ++A+ +
Sbjct: 415 DLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLYGALDKWNHDIMDAKANGGRLSGMG 474
Query: 235 ----------------TTMAWINQ----------YSVRRY-GRSVPAIQDAWNVLYHTVY 267
T W +Q + RRY G+++ Q AW +L ++VY
Sbjct: 475 IVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNWVKRRYRGQNLDLAQKAWELLDNSVY 534
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ A ++ D+ P++ + L T +Y + Y
Sbjct: 535 KSNNTALKCTTRSLI---DLRPAV------------------SGLIGTTGNYLATAITYE 573
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+V+ AL+ + S + + + YDL+D+ RQ A ++ +I A+ ++
Sbjct: 574 PRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQVFVNAAIPIYQAMINAWNGSNKADTE 632
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ R + L+ D+D L+A F L W+ A+ AQ+ + E+ AR Q+ +W T
Sbjct: 633 KYGRELVGLINDIDRLMATSRHFRLESWVGDARNWAQDAGAKDDMEFQARNQLILWGAAT 692
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI------ 501
L R Y K+W G++ + Y + ++ ++++ K W +
Sbjct: 693 FAPWPLDR-YAAKHWHGIMSEVYAKGWELLYQNLLKT-------EPKAWNKTAFASELME 744
Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
K+ W+N ++ GD++ + L KY Q
Sbjct: 745 KVEKPWENVKSGGVQGPQGDSVAVIRELREKYKQ 778
>gi|401885538|gb|EJT49648.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
asahii CBS 2479]
Length = 781
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 149/574 (25%), Positives = 241/574 (41%), Gaps = 74/574 (12%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+HG W G WL+ Q LQK+IL R E GM PVLP F G VP L N
Sbjct: 238 NIHGNWHGTTTWQWLEGQHNLQKQILARQREFGMTPVLPGFCGFVPPELHNYIGGPDFKT 297
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W S ++ + +D + + AF+ +Q + YG TS Y D F E+ P
Sbjct: 298 YPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRKQKELYGFTSDYYMVDLFTESKPTS 355
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVL 180
P Y+ + A+ + + +A W+MQGW+F DP W KA L+ L+VL
Sbjct: 356 TDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVNDPKSWTETASKAFLDGAG-ESLLVL 414
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDS---------------IAFG 225
DL AE P W K F+G ++WC L N+ N +YG LD G
Sbjct: 415 DLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLYGALDKWNHDIMDAKANGGRLSGMG 474
Query: 226 PVEARTSENTTM-----------------AWINQYSVRRY-GRSVPAIQDAWNVLYHTVY 267
V + N + W + RRY G+++ Q AW +L ++VY
Sbjct: 475 IVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNWVKRRYRGQNLDLAQKAWELLDNSVY 534
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ A ++ D+ P++ + L T +Y + Y
Sbjct: 535 KSNNTALKCTTRSLI---DLRPAV------------------SGLIGTTGNYLATAITYE 573
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+V+ AL+ + S + + + YDL+D+ RQ A ++ +I A+ ++
Sbjct: 574 PRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQVFVNAAIPIYQAMINAWNGSNKADTE 632
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ R + L+ D+D L+A F L W+ A+ AQ+ + E+ AR Q+ +W T
Sbjct: 633 KYGRELVGLINDIDRLMATSRHFRLESWVGDARNWAQDAGAKDDMEFQARNQLILWGPAT 692
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI------ 501
L R Y K+W G++ + Y + ++ ++++ K W +
Sbjct: 693 FAPWPLDR-YAAKHWHGIMSEVYAKGWELLYQNLLKT-------EPKAWNKTAFASELME 744
Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
K+ W+N ++ GD++ + L KY Q
Sbjct: 745 KVEKPWENVKSGGVQGPQGDSVAVIRELREKYKQ 778
>gi|423219557|ref|ZP_17206053.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
CL03T12C61]
gi|392624762|gb|EIY18840.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
CL03T12C61]
Length = 715
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 143/540 (26%), Positives = 222/540 (41%), Gaps = 81/540 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+PQS +D + L +K+L R+ LG+ P++P F G VP+ L+N A I
Sbjct: 195 MGNIEGWGGPMPQSQIDSRKKLVQKMLKRMKSLGIEPLMPGFYGMVPSNLKNK-SKAHII 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + +LD DP F + F ++ + YG ++ D F E
Sbjct: 254 PQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDETRRLYGSDIRFFSGDPFHEGG-- 305
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ G AI MQ ++W++QGW + P LL + ++V
Sbjct: 306 ATDGVALGDAGRAIQKTMQKHFPGSIWVLQGWQDNPKP--------GLLEKLDKRYVLVQ 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
+LF E W T K + G P+IW + NF + G L A A SE
Sbjct: 358 ELFGENTNNWETRKGYEGTPFIWATVTNFGERPGINGKLQRFADEVYRASNSEYAKYMKG 417
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
WI Y RYGR I+ AW ++ ++
Sbjct: 418 VGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESYVTARYGRITDEIRTAWKMMLKSI 477
Query: 267 YNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
Y+ G + +++ A P ++ +S + Y + + K+A
Sbjct: 478 YSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKYDRDLYKKAAF------------- 524
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
LF + E + TYR DLI RQ +A A+ +F ++I AYQ
Sbjct: 525 -----------LFAKAMPEFNEVRTYRIDLIHFLRQVIANEADSVFYDMITAYQEKKVEK 573
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
Q +FL +++ + LLA F L W + AK ++K N IT W +
Sbjct: 574 FEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDAGNTAAEKKNNFHNLMMLITYWGE 633
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD---WRREWIK 502
+ E + L DY K W+G++ YY R +YF Y+ +L G+ + D W REW++
Sbjct: 634 HVTSEDN-LHDYAYKEWAGMMNTYYKERWLVYFDYL-RALLRGEEAKAPDYFHWEREWVE 691
>gi|153806010|ref|ZP_01958678.1| hypothetical protein BACCAC_00255 [Bacteroides caccae ATCC 43185]
gi|149130687|gb|EDM21893.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
Length = 715
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 143/540 (26%), Positives = 222/540 (41%), Gaps = 81/540 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+PQS +D + L +K+L R+ LG+ P++P F G VP+ L+N A I
Sbjct: 195 MGNIEGWGGPMPQSQIDSRKKLVQKMLKRMKSLGIEPLMPGFYGMVPSNLKNK-SKAHII 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + +LD DP F + F ++ + YG ++ D F E
Sbjct: 254 PQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDETRRLYGSDIRFFSGDPFHEGG-- 305
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ G AI MQ ++W++QGW + P LL + ++V
Sbjct: 306 ATDGVALGDAGRAIQKTMQKHFPGSIWVLQGWQDNPKP--------GLLEKLDKRYVLVQ 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
+LF E W T K + G P+IW + NF + G L A A SE
Sbjct: 358 ELFGENTNNWETRKGYEGTPFIWATVTNFGERPGINGKLQRFADEVYRASNSEYAKYMKG 417
Query: 239 --------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTV 266
WI Y RYGR I+ AW ++ ++
Sbjct: 418 VGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESYVTARYGRITDEIRTAWKMMLKSI 477
Query: 267 YNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
Y+ G + +++ A P ++ +S + Y + + K+A
Sbjct: 478 YSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKYDRDLYKKAAF------------- 524
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
LF + E + TYR DLI RQ +A A+ +F ++I AYQ
Sbjct: 525 -----------LFAKAMPEFNEVRTYRIDLIHFLRQVIANEADSVFYDMITAYQEKKVEK 573
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
Q +FL +++ + LLA F L W + AK ++K N IT W +
Sbjct: 574 FEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDAGNTAAEKKNNFHNLMMLITYWGE 633
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD---WRREWIK 502
+ E + L DY K W+G++ YY R +YF Y+ +L G+ + D W REW++
Sbjct: 634 HVTSEDN-LHDYAYKEWAGMMNTYYKERWLVYFDYL-RALLRGEEAKAPDYFHWEREWVE 691
>gi|302526099|ref|ZP_07278441.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
gi|302434994|gb|EFL06810.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
Length = 860
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 140/561 (24%), Positives = 232/561 (41%), Gaps = 85/561 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL------QNVF 54
+ N+ + GP+ LD ++ + KK++ R+ +LGM PVLP + G VP +
Sbjct: 200 LQNMASFTGPVSPQLLDARVAMAKKVITRLKDLGMTPVLPGYFGTVPRGFADKSKKADAS 259
Query: 55 PSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTF 114
A++ G W P W LD + ++ AF + Q +G TS +Y D
Sbjct: 260 SDARVIGQGTWVGFDR-PDW-----LDPRTSSYRKVAAAFYQAQHDLFGDTS-MYKMDLL 312
Query: 115 DENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
E D P + + + +Q+ A W++ GW + PP +A++++V
Sbjct: 313 HEGGKSGDVP--VGDAARGVMTALQTARPGATWVLLGWQNN------PP--RAIVDAVDK 362
Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG----------------- 217
KL V+D ++ Q+ PY + ++NF G+ +
Sbjct: 363 SKLFVVDGLSDRYGQRDPDSQWNNTPYAFGTIYNFGGHTTIGANTGVWTQRFPQWRTKQG 422
Query: 218 -ILDSIAFGPVEARTSENT---------------TMAWINQYSVRRYGRSVPAIQDAWNV 261
L IA+ P T+ AW Y+ RRYG AW++
Sbjct: 423 SALTGIAYLPEGTGTNPAAFELFTELAWRQTPIHQAAWFADYASRRYGGPDTRAATAWDL 482
Query: 262 LYHTVYNC-TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
L T Y+ G ++ + A P++D + +++
Sbjct: 483 LRQTAYSMPASGWSEAQDSLYAARPNLD------------------------AATAATWS 518
Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
L Y + +AL+ + L ++ YR+DL+D+ RQAL + L I AY
Sbjct: 519 PASLRYQQATFGKALDELLNVDPALRGTDAYRFDLVDVARQALTNTSRTLLPQIKTAYTN 578
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
D L+ R++ + +D LLA FLLGPWLE+AK A + ++ + E++AR+ I
Sbjct: 579 RDRTQFTTLTSRWMSNMTLLDKLLATDSRFLLGPWLEAAKSWAGTDTEQARLEYDARSLI 638
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
T W + L DY N+ WSGL+ D+Y R YF + ++ +G + +W
Sbjct: 639 TTWGPRAGSDDGRLHDYANREWSGLVSDFYAKRWKQYFDSLNTAMNTGG----QPASIDW 694
Query: 501 IKLTNDWQNGRNVYPVESNGD 521
+ W RN YP GD
Sbjct: 695 FAAEDGWAKQRNPYPTTPAGD 715
>gi|329851961|ref|ZP_08266642.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
biprosthecum C19]
gi|328839810|gb|EGF89383.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
biprosthecum C19]
Length = 731
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 149/568 (26%), Positives = 232/568 (40%), Gaps = 109/568 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLP SW+ ++ LQK+IL + ELGM P+LPAF+G VP A P A+I
Sbjct: 182 MGNIEGYQAPLPLSWIVKKRELQKRILGAMRELGMEPILPAFAGYVPKAFAESHPQARIY 241
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ F++ + YG+ Y D F+E PP
Sbjct: 242 RMRAWEGFHE------TYWLDPADPLFAKLAGRFLDLYDQTYGK-GRFYLADAFNEMLPP 294
Query: 121 V-DSP------------------------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 155
V D P E +++ G ++ ++S DAVW+MQGWLF
Sbjct: 295 VGDGPVEGGYGDSTANKEAVAEVDPAVKAERLAAYGQRLHDSIRSARPDAVWVMQGWLFG 354
Query: 156 YDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW + A L +VP L+VLD+ + P + T++ F+G +I+ +HN+ +
Sbjct: 355 ADQGFWTGDAIAAFLRNVPDDGLMVLDIGNDRYPKVRQTAQAFHGKGWIYGYVHNYGASN 414
Query: 214 EMYGILD-------SIAFGPVEAR---------------------------TSENTTMAW 239
+YG L +I P R + W
Sbjct: 415 PIYGDLGFYRRDMAAITSDPARGRLQGFGVFPEGLDSNSIVYAYLYDLAWNGGTKSLSDW 474
Query: 240 INQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY 299
+ Y+ RYG S P + AW + VY R A+ I++ + +
Sbjct: 475 LAGYTRARYGISSPEVVTAWLDIVKGVYGTRYWTPRWWRSTAGAYLLCKRPDIAMADFEG 534
Query: 300 QNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLT 359
+ + + + +D P L RYD+I+ T
Sbjct: 535 APGDRAALRAGLARLAAIRHDSPLL---------------------------RYDVIEFT 567
Query: 360 RQALAKYANELFLNIIEAYQLNDAHGVFQLS---RRFLELVEDMDGLLACHDGFLLGPWL 416
R + + + L + AY+ D + + RR ++D+ G CH L W+
Sbjct: 568 RHLASLHLDNLIRTALVAYRDGDVAAGDRSATEVRRVTIAIDDLMGAQPCH----LAGWI 623
Query: 417 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 476
E A+ ++ YE NAR Q+T+W L DY +K W GL RD+Y PR +
Sbjct: 624 EQARAYGDTATEKPYYERNARAQVTVWGGKGN-----LHDYASKAWQGLYRDFYLPRWEM 678
Query: 477 YFKYMIESL--ESGDGFRLKDWRREWIK 502
F + + RL W W++
Sbjct: 679 LFAALRTGTYNPAATTERLIAWENAWVE 706
>gi|294627661|ref|ZP_06706243.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 11122]
gi|292598013|gb|EFF42168.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 11122]
Length = 798
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 156/576 (27%), Positives = 238/576 (41%), Gaps = 115/576 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ LPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRASLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
V + +Y ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSVANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
+ + +YG + FG N+ +
Sbjct: 434 YGASNPLYGDFAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVIYEYLYALAWEGPQQ 493
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ Y RYGRS A+ AW L +Y + P +
Sbjct: 494 SWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQTRYWS---------------PRWWNK 538
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
G Y + +P + ++ + D L RA++ + N + + YRYD
Sbjct: 539 RAGAYLLFKRPTAD--IVDFDDCPGDPQRL-------RRAIDALLQQANRYADAPLYRYD 589
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLAC-HDGFLL 412
LI+ R L+ A+ +++AY D A G QL+R +LV +D L+ HD L
Sbjct: 590 LIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGGQHD--TL 646
Query: 413 GPWL-ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 471
W ++A + + Y NAR Q+++W + L DY +K W G+ D+Y
Sbjct: 647 ADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFYL 701
Query: 472 PRAAIYFKYMIESLESGDGF-------RLKDWRREW 500
R + + ++G F +L W R+W
Sbjct: 702 QRWTRFLSAYRAARKAGTPFDAVAVDHQLAAWERQW 737
>gi|373461651|ref|ZP_09553390.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
gi|371951955|gb|EHO69797.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
Length = 713
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 141/509 (27%), Positives = 211/509 (41%), Gaps = 86/509 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ Q ++D Q L ++IL R+ LG+ PVL F G V ++++ +P+A +
Sbjct: 194 MGNLEGWGGPVSQDFIDAQSRLGRRILDRMATLGIQPVLQGFYGMVSRSIRDRYPNAVMP 253
Query: 61 QLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
Q G W F + D +L T+ LF EI + + K YG H + D F E
Sbjct: 254 Q-GMWGFFERPD-------ILKPTEKLFDEIADTYYREIKKHYGTGFHYFGGDLFHEGGQ 305
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
++ G A+ MQ + W++QGW + +P LL + K++V
Sbjct: 306 --TGTLNVADCGLAVQQAMQRNFPGSTWVLQGWSGNPNPL--------LLTKLDREKVLV 355
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
+DLF E W+ +K + G P++WC++ NF MYG L IA + R S+
Sbjct: 356 VDLFGENDEAWNRTKAYQGTPFLWCIVSNFGEQCGMYGKLQRIALQIDKVRKSDYKAYLK 415
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ Y RYG I AW + T
Sbjct: 416 GVGIMPEGINNNPVVYDMVLHAPLTDRKINVEAWLKSYITYRYGSYNADIYAAWLIFLQT 475
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVLKSETSSYDH 321
+Y SV E YG P S + V ++TSS+
Sbjct: 476 IY------------------------ASVPE----KYGLPESVFCARPGVKVTQTSSWGV 507
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
+Y + LF+ + S TY YD+ DL RQ + N ++ ++I A
Sbjct: 508 RARYYDMDFFKEGVRLFLKAKTSFEDSETYAYDMFDLLRQVQSDKGNRVYDDMIAAIDAK 567
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
+ + Q S RFL + D LLA GF L WL A + + NA+ Q+T
Sbjct: 568 NPNRFEQTSDRFLHELLRQDTLLAQSKGFTLERWLGQASRFGKTVYDRDLALKNAKMQLT 627
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
W + + + DY K W+G+LR Y
Sbjct: 628 FWGPDWNPTTT-VHDYAAKEWAGMLRTLY 655
>gi|325922205|ref|ZP_08183992.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
19865]
gi|325547324|gb|EGD18391.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
19865]
Length = 807
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 150/579 (25%), Positives = 234/579 (40%), Gaps = 117/579 (20%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQ +IL R+ ELGM PVLPAF+G VP A P+A+I
Sbjct: 202 MGNIEGYRAPLPQQWIDSKRVLQTQILTRMRELGMQPVLPAFAGYVPKAFAQAHPNARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 262 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 314
Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 315 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPQATWVMQ 374
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 375 GWLFGADREFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 434
Query: 209 FAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM-------------- 237
+ + +YG + FG N+ +
Sbjct: 435 YGASNPLYGDFAFYRQDLQALLADPDKRNLRGFGVFPEGLHSNSVVYEYLYALAWEGPQQ 494
Query: 238 ---AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 294
W+ QY+ RYG S A+ AW+ D+D I
Sbjct: 495 SWSQWLTQYTRARYGHSDAALLQAWS-------------------------DLDAGIYQT 529
Query: 295 TEGKYQNYGKPVSKEAVLKSETSSY----DHPHLWYSTSEVIRALELFIASGNELSASNT 350
+ + K + K T+ D P + RA++ + + + +
Sbjct: 530 RYWSLRWWNKRAGAYLLFKRPTADIVGFDDRPG---DPQRLRRAIDALLQQADRYADAPL 586
Query: 351 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 410
YRYDLI+ R L+ +A+ +++AY D L R LV+ +D L+
Sbjct: 587 YRYDLIEDARHYLSLHADRQLQAVVQAYGTGDFARGDALLARTTRLVQGLDALVGGQHET 646
Query: 411 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
L ++A + + Y NAR Q+++W + L DY +K W G+ ++Y
Sbjct: 647 LADWTDQAAAAAGDDAALRRVYVGNARAQVSVWGGDGN-----LADYASKAWQGMYAEFY 701
Query: 471 GPRAAIYFKYMIESLESGDGF-------RLKDWRREWIK 502
R + + ++G F +L W R+W +
Sbjct: 702 LQRWTRFLSAYRAARKAGTPFDEAAFNKQLAAWERQWAE 740
>gi|372221472|ref|ZP_09499893.1| alpha-N-acetylglucosaminidase [Mesoflavibacter zeaxanthinifaciens
S86]
Length = 712
Score = 181 bits (460), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 95/224 (42%), Positives = 131/224 (58%), Gaps = 9/224 (4%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++G GPLPQ W+ ++ LQKKIL ++ +LGM PV+PAFSG +PAAL FP+AKI+
Sbjct: 200 MGNINGHAGPLPQEWITKKAKLQKKILSKMRDLGMKPVVPAFSGYIPAALAEKFPNAKIS 259
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W D TYLLD DPLF EIG+ FIE +EYG+ + Y D+F+E TPP
Sbjct: 260 ELNGWSGGGFD----STYLLDPKDPLFKEIGKRFIELYNQEYGKAEY-YLADSFNEVTPP 314
Query: 121 VDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V + + L G IY + A W+MQGWLF +D FW + A L+ VP K
Sbjct: 315 VSTENKLDELAAYGQVIYETLNEAAPGATWVMQGWLFGHDAYFWEKDAVIAFLSKVPNDK 374
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
L++ D + +W FYG + + +HN+ G+ +YG D
Sbjct: 375 LIIQDFGNDRYKVWEKQDAFYGKQWTYGYVHNYGGSNPIYGDFD 418
>gi|404403947|ref|ZP_10995531.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 828
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 157/528 (29%), Positives = 244/528 (46%), Gaps = 62/528 (11%)
Query: 1 MSNLHGW-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
M N+ G G P PQ W + Q+ LQ +I+ R+ LGM PV F+G VP A++ + P +
Sbjct: 184 MGNMSGLDGAPTPQ-WHEAQIALQHRIIDRMEALGMTPVYQGFAGFVPPAMKRIHPETTL 242
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE-NT 118
T+ W K+ ++L DPLF EIG AF+ +E+G+ + Y D+F+E +
Sbjct: 243 TET-KWSGFKN-------WMLSPLDPLFSEIGTAFVRAWEEEFGKGKY-YLIDSFNEMDV 293
Query: 119 P--PVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSV 172
P P SPE ++L G IY + + DAVW+MQGW+F Y W P ++ALL
Sbjct: 294 PFGPKGSPERAATLRHYGETIYRSLAEANPDAVWVMQGWMFGYQRNSWDPESVRALLEGA 353
Query: 173 PLGKLVVLDLFAEVKP-IWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
P G++++LDL + IW + K F+G +I+ + NF G + G L+ A G
Sbjct: 354 PDGRMMILDLAVDFNNFIWRSEKSWNHLQGFFGREWIYSTVPNFGGRTALIGNLEFYANG 413
Query: 226 PVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFP 285
+EA +S N+ + YG S ++ ++Y + G +D D+
Sbjct: 414 HLEALSSP-------NRGRLTGYGTSPEGVESN-EIVYEII--AAAGWSDDRIDLKKFLH 463
Query: 286 DVDPS--------IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSE-VIRALE 336
D + I G Q+ + A + + Y H +E A+E
Sbjct: 464 DYSAARYGGCPEGIDRFWSGMLQSSYNECTNNARYRWQLRPYSHRMPTMGINENYYTAIE 523
Query: 337 LFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG----VFQLSRR 392
F+A EL + YR D I LA A+ L +EA D +G + + R
Sbjct: 524 QFLACAGELGGNELYRTDAIQYAALYLASKADML----LEAANWADLYGAREEAYDCAMR 579
Query: 393 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 452
EL+ D D LLA H L W A++ EE+++++ +R I++W +
Sbjct: 580 IEELLLDADRLLASHPLLRLDRWSGMARKAGCTEEEKERFVGESRRLISVWGGPS----- 634
Query: 453 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
L DY + WSG++RDYY PR Y +E+ G F + W +W
Sbjct: 635 -LSDYSARVWSGVIRDYYVPRLNKY----LEAKTDGTVFDFRTWDEQW 677
>gi|383643231|ref|ZP_09955637.1| N-acetylglucosaminidase [Sphingomonas elodea ATCC 31461]
Length = 778
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 143/594 (24%), Positives = 230/594 (38%), Gaps = 115/594 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PL +W++++ VLQ++IL R+ LGM P+LPAFSG VP A P AKI
Sbjct: 209 MGNIAGYRAPLSANWIEKKRVLQRQILARMRSLGMKPILPAFSGYVPEAFAKAHPEAKIY 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W TY LD +DPLF + F++ YG Y D F+E PP
Sbjct: 269 QMRQWEGFPG------TYWLDPSDPLFARLAARFLQLYTATYG-PGEYYLADAFNEMVPP 321
Query: 121 VDS-------------------------PEYI-----SSLGAAIYSGMQSGDSDAVWLMQ 150
+ P+ + ++ G +Y + + +A W+MQ
Sbjct: 322 IAEDGSDARAATYGDAIANTAATRAAALPKEVRDARLAAYGERLYRSITAAAPNATWVMQ 381
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L+ VP ++++LD+ + P IW+ ++ FYG + + +HN
Sbjct: 382 GWLFGADKAFWTPDAIAAFLSKVPDERMLILDIGNDRYPGIWNATRAFYGKGWAYGYVHN 441
Query: 209 FAGNIEMYGILD---------------------------------------SIAFGPVEA 229
+ G+ +YG L +A+G ++A
Sbjct: 442 YGGSNPVYGDLAFYRSDITAALANPGHGRMRGFGLFPEGLHSNGIAYAYAYDLAWGEIDA 501
Query: 230 RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 289
AWI Y+ RYG++ PA+ AW+ Y P
Sbjct: 502 TGKARPLDAWIGDYTRARYGKTSPALVAAWDKAIAGAYTTR---------------YWTP 546
Query: 290 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 349
G Y + P A D+P + + +E +A +
Sbjct: 547 RWWHEQAGGYLFFKFPSLDGA---------DYPAAPGDPAALRAGIEALLAQAPQHGGEP 597
Query: 350 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 409
Y YD++DL R + ++ + AY+ D + + L +D LA +
Sbjct: 598 LYTYDVVDLVRHYASVQLDDRLKTAVAAYKAGDLAAGDRATAAAERLARHIDA-LAGNQQ 656
Query: 410 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 469
LG WL A ++ + A+ +T+W L DY ++ W GL Y
Sbjct: 657 ETLGSWLADAAAYGDTPAEKAAFVEQAKAVVTVWGGTGH-----LSDYASRAWQGLYAGY 711
Query: 470 YGPRAAIYFKYMIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPV 516
Y PR + + + F ++ W+ W+K W R P+
Sbjct: 712 YWPRWQRFLAAQRAAAAAHTPFDAKATSDAIRTWQAAWLKDGRMWPRQRPAAPL 765
>gi|261880009|ref|ZP_06006436.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
gi|270333325|gb|EFA44111.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
Length = 722
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 143/534 (26%), Positives = 228/534 (42%), Gaps = 70/534 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ KI+ R+ LGM+P+ PAF+G VP P ++
Sbjct: 194 MGNLNSWNGPLTDAWQQGQITLQHKIIDRMRALGMHPIAPAFAGFVPEQFVEAHPGLQVK 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W D R Y+L P F +IGR F+E+ KE+G+ + Y D+F+E P
Sbjct: 254 KL-TWGGF--DDR-LNAYVLSPESPYFKQIGRLFVEEWEKEFGKNT-FYQSDSFNEMEIP 308
Query: 121 VDSPEYISS------LGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
V+ + I G IY + + DAVW+ QGW F Y W ++ALL VP
Sbjct: 309 VEPGDSIGKWKLLEQYGDVIYRSIAEANPDAVWVTQGWTFGYQHKMWDSKSLQALLRHVP 368
Query: 174 LGKLVVLDL-------FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + + W +YG +++ + NF G G + A
Sbjct: 369 DDKMLIIDLANDYPKWIWKTQQTWKVQHGYYGKQWVFSYVPNFGGKTLPTGDMQMYASAS 428
Query: 227 VEA-RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFP 285
EA SE M +G + I++ +V+Y + + G TDK
Sbjct: 429 AEALHHSERGNMV--------GFGSAPEGIENN-DVIYELLADM--GWTDKA-------V 470
Query: 286 DVDPSIISVTEGKYQNYGKPVSK--EAVLKSETSS-YDHPHLWYST-------------- 328
D+D I E +Y Y + K + +L+S S Y +P + T
Sbjct: 471 DLDLWIKDYCEARYGGYPSDMQKAWQCMLRSVYGSLYSYPRFTWQTVTPDSRRVSTHALN 530
Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
+ + F+ +L +S YR D I L L A+ + ++
Sbjct: 531 DTFLSGVAHFLRCARQLGSSPLYRSDAISLASLYLGTKADRHYTKALDLKASGKQQAASA 590
Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 448
+ ++L+ D LLA H L W++ A+ ++ +YE +A+ IT+W
Sbjct: 591 ELHQTIDLLTKADRLLASHPTHRLDRWIQFARNHGITTAEKNRYESDAKRLITIW----- 645
Query: 449 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
DY ++W+GL+ YY PR YF + + L W +W+K
Sbjct: 646 --GGFQEDYAARFWNGLIAHYYIPRIRYYFDHGRPA--------LMQWEEQWVK 689
>gi|302546018|ref|ZP_07298360.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
[Streptomyces hygroscopicus ATCC 53653]
gi|302463636|gb|EFL26729.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
[Streptomyces himastatinicus ATCC 53653]
Length = 679
Score = 179 bits (453), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 147/550 (26%), Positives = 226/550 (41%), Gaps = 82/550 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGP+ + L+++ L ++I R+ ELGM PV P + G VP + P A
Sbjct: 177 MQNMSEWGGPVSTALLEKRTDLGRRIADRLRELGMRPVFPGYFGTVPDGFADRNPGAHTV 236
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G+W ++ P W LD F E+ AF Q +G ++ D E N
Sbjct: 237 PQGDWNGLRR-PDW-----LDPRTDAFHEVAAAFYRHQHDLFG-ACDLFKMDLLHEGGNA 289
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
V P+ A+ +Q+ A+W++ GW + + LL++V ++
Sbjct: 290 GDVSVPDAAR----AVEKALQTSRPGAIWVILGWQSN--------PRRDLLDAVDHDHML 337
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI----------EMYGI--------LD 220
V+D +++ I K + VPY + + NF G E + + L
Sbjct: 338 VVDGLSDLDTITDREKDWGSVPYAFGTIPNFGGRTTIGAKTHMWTERFTVWRDKPGSKLV 397
Query: 221 SIAFGP--VEA-------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
A+ P VE R + AW Y+ RYG ++A+ L T
Sbjct: 398 GTAYMPEAVERDPAAYELFSELAWRDTAVDRDAWFRDYADVRYGARDAKAREAFAALRDT 457
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
Y + + V A P S+T NY + T ++D
Sbjct: 458 AYQISSKDGRPHDSVFAARP-------SLTARSGTNYA----------THTPAFD----- 495
Query: 326 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 385
+ AL + L S+ YRYDL D RQALA + +L + +AY D
Sbjct: 496 --PARFDAALAALLGVRAGLRDSDAYRYDLADTARQALANRSWQLIGQLADAYARKDLDT 553
Query: 386 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 445
LSR +L+L+ D + H LLGPWLE AK++A E+ Q E+ AR IT W D
Sbjct: 554 FRALSRLWLKLMRLSDDITGTHRLLLLGPWLEDAKRMASGAEESAQLEFAARALITTWAD 613
Query: 446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
+ L +Y N+ W+GL+ D++ P+ Y + ++L G R D W +
Sbjct: 614 RGAADPGKLANYANRDWNGLIGDFHVPQWQTYLDELEDALAEGRAPRTFD----WYTVEE 669
Query: 506 DWQNGRNVYP 515
W R YP
Sbjct: 670 PWTRERKSYP 679
>gi|293369246|ref|ZP_06615836.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|292635671|gb|EFF54173.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 521
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 102/300 (34%), Positives = 151/300 (50%), Gaps = 37/300 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 196 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+ L+ D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 256 RVSEWGGFADQYR--CTF-LNPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D +AVWL WLF D W P++K+ L SVP +L++
Sbjct: 312 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA---------------- 223
LD F E IW + ++G PY+WC L NF GN + G ++ ++
Sbjct: 372 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 431
Query: 224 ----------------FGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
F +A T W + + RR G+ P + AW +L + VY
Sbjct: 432 VGSTLEGIDLNQFMYEFVLDKAWNGGQTDKEWFFKLADRRIGKISPEARKAWEILANKVY 491
>gi|295690503|ref|YP_003594196.1| alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
gi|295432406|gb|ADG11578.1| Alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
Length = 770
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 153/589 (25%), Positives = 233/589 (39%), Gaps = 111/589 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLP +W+D++ LQ +IL R+ LGM P+LPAF G VP A P A+I
Sbjct: 202 MGNIEGYRAPLPTNWIDKKKDLQVQILGRMRSLGMTPILPAFGGYVPKAFAQKNPKARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I F+ + YG T Y D+F+E PP
Sbjct: 262 RMRPWEGFHE------TYWLDPADPLFAKIAGRFLALYTQTYG-TGTYYLADSFNEMLPP 314
Query: 121 VD-------------------------------SPEYISSLGAAIYSGMQSGDSDAVWLM 149
++ + +++ G AIY ++ DAVW+M
Sbjct: 315 INADGADARDAAYGDGAANTAATKTKVEVDPALKAQRLAAYGKAIYDSIRQARPDAVWVM 374
Query: 150 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 207
QGWLF D FW P + A L+ VP KL++LD+ + P +W +K F G P+I+ +H
Sbjct: 375 QGWLFGADSHFWDPTAISAYLSLVPDDKLMILDIGNDRYPAVWKNAKAFGGKPWIYGYVH 434
Query: 208 NFAGNIEMYGILD-------SIAFGPVEAR---------------------------TSE 233
N+ G+ +YG LD +IA P +
Sbjct: 435 NYGGSNPVYGDLDYYRRDIPAIAANPEAGKLAGFGMFPEGLHNNSIVYDAVYDLAWGAGR 494
Query: 234 NTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 293
+ AW++ Y+ RYG++ P + A L Y+ + P
Sbjct: 495 ESLSAWLSTYARARYGKTSPELDAALGQLVEAAYSTRYWS---------------PRWWK 539
Query: 294 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRAL-ELFIASGNELSASNTYR 352
G Y + +P + HP ++AL L A NE +
Sbjct: 540 SKAGAYLFFKRPTATIGEFPP------HPGDRAKLEAAVKALTALAPAYANE----PLFV 589
Query: 353 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 412
DL D TR ++L + AY+ D Q L +D LL L
Sbjct: 590 LDLTDATRHLATMKIDDLLQAAVAAYRRGDVASGDQARVEIAALALSIDKLLGVQPE-TL 648
Query: 413 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 472
W++ A+ Y NA+ Q+T+W L DY +K W GL R +Y P
Sbjct: 649 ATWIDDARAYGDTPADAAAYVANAKAQVTVWGGEGN-----LNDYASKAWQGLYRGFYLP 703
Query: 473 RAAIYFKYM----IESLESGDGFRLK-DWRREWIKLTNDWQNGRNVYPV 516
R +++ + + + R W R W+ ++ + PV
Sbjct: 704 RWSMFLDALKAAGTGTFDEPAAVRASIAWERAWVDAEVAYRREKPADPV 752
>gi|187735714|ref|YP_001877826.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
gi|187425766|gb|ACD05045.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
Length = 852
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 141/576 (24%), Positives = 244/576 (42%), Gaps = 85/576 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ-NV-FPSAK 58
M NL G GGPL Q +++ + ++I+ R+ +LGM PVL + G VP+ Q NV K
Sbjct: 198 MGNLEGHGGPLSQQQINKMAQMGRRIVSRMEQLGMTPVLQGYVGFVPSDFQENVRIDGLK 257
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ G W + + +++D T F ++ + + K YG ++ D F E
Sbjct: 258 LIPQGEWVNFRR------PWVVDPTCEAFPKLAADWYKALRKVYGIPGKMFGGDLFHEGG 311
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
D ++ + MQ A W++Q W +P + LL+ + + +
Sbjct: 312 RKGDID--VTQAAQEVQKAMQKASPGAFWVIQAW--GGNP------TRELLSGLDPERAL 361
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG--------------------- 217
VL L ++ + F G+P++WC L NF GN MYG
Sbjct: 362 VLQLTKDMANGGKNLRTFNGIPWVWCELANFGGNTGMYGGVPLLSRLGSELSGYKDKGLV 421
Query: 218 ----ILDSIAFGPVEAR-------TSENTTM-AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
+ + + P+ T E+ ++ W+ +Y+ +RYG + A+ A VL +
Sbjct: 422 GMGTLSEGLETNPLHYALFSDRLWTREDISVREWLGKYARQRYGFAPKAVVKALEVLSFS 481
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 325
+YN +I A P + + S++ +
Sbjct: 482 IYNPVRSQEGCTESIICARPSWNV------------------------RKASTWSSGERY 517
Query: 326 YSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
Y ++++A ++ + N+ L T+RYDL+D+ RQALA A + A+ D
Sbjct: 518 YHLGDIVKAARGYLKAANDQPNLVKKETFRYDLVDVVRQALADAAFYQLQQVRSAFDSGD 577
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+ +RFL L+ DMD LLA FLLG W + A + +++ + +A+ IT
Sbjct: 578 LAAYRKQVKRFLSLISDMDALLATDSQFLLGTWQKRALDWGDSRQEKALMDKSAKMLITT 637
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
W D L DY N+ W+GL+ D+Y PR +F++ ++ L +G R K
Sbjct: 638 WIDQVPRS---LNDYSNRQWAGLVSDFYLPRWKNFFEFQMDVL-TGKKTRDAAHAAFMDK 693
Query: 503 LTND---WQNGRNVYPVESNGDALITSQWLYNKYLQ 535
+ D + +Y V+ GD L + + N + +
Sbjct: 694 MVRDELAFAGNGKIYSVKPAGDTLAVANRVMNTHRE 729
>gi|229818803|ref|YP_002880329.1| alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
gi|229564716|gb|ACQ78567.1| Alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
Length = 751
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 162/562 (28%), Positives = 252/562 (44%), Gaps = 66/562 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M + +GGPLP SW +++ L ++IL R ELGM VLPAF G+VP L
Sbjct: 194 MGSTSSFGGPLPDSWFERRAELGRRILERQRELGMRAVLPAFGGHVPDGLGA-------- 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ + T LL D F + F QQ + +G T H+Y D F E+ PP
Sbjct: 246 ------GARTHWQGFSTALLGPDDDAFAVVAAEFARQQRELFG-TDHLYAADPFIESVPP 298
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PE +++ AA Y+GM + D +A W+MQ W F Y FW ++ A+ ++VP +L++
Sbjct: 299 SGEPEDLAAFAAATYAGMSAADPEATWVMQAWPFHYHRRFWTAERIAAVTDAVPRDRLLL 358
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPV---EARTSEN 234
LDL+AE P+W + ++WC +HNF G ++G L +A G V ART
Sbjct: 359 LDLWAEHAPVWDDGRGIAEHQWLWCAVHNFGGRFSVHGDLHGLARDLGGVLDDGARTGGF 418
Query: 235 TTM----------------------------AWINQYSVRRYG-----RSVPAIQDAWNV 261
T + AW+ ++ +RYG + A+ AW +
Sbjct: 419 TGVGMAMEALENNPVFYELLTDLVWERPDVDAWVGRFVDQRYGFADGTAARDAVHGAWAI 478
Query: 262 LYHTVYNCTDGATDKNRDVIVAFPD--VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 319
L T+Y G T ++A P V P G++ + PV A + +E
Sbjct: 479 LLRTLYGP--GMTRSIPSPVIARPADVVAPFHTQRLAGEFLDPDAPVIVSANIDAEAD-- 534
Query: 320 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 379
P + E+ RA L + A +DL DL +A+ I+ A +
Sbjct: 535 --PRVEGDLPEIARAAALLREAAGSSDAGGPLAHDLADLLTHVVAQRTRAPIRAIVAAAR 592
Query: 380 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 439
DA V + D+D + A LLG WL +A++ A ++ + + +AR Q
Sbjct: 593 AGDADAVRANGALLAAAIADLDAVAATQPDRLLGTWLAAAQRWADDDGERRVLLRDARRQ 652
Query: 440 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 499
+T+W E+ S L DY ++WSGLL +Y PR ++ ++ E+ ESG ++ RR
Sbjct: 653 LTVW----GEQTSGLHDYSGRHWSGLLGGFYAPRWQLWVDWLAEAAESGSEPDPQELRRA 708
Query: 500 WIKLTNDWQNGRNVYPVESNGD 521
+ L W P + GD
Sbjct: 709 VVALEESWVARDETGPTDPAGD 730
>gi|308480701|ref|XP_003102557.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
gi|308261289|gb|EFP05242.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
Length = 718
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 128/526 (24%), Positives = 228/526 (43%), Gaps = 86/526 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN---VFPSA 57
M NL +GG L + + L K+I+ R+ ELG+ P+LP F+G VP L+ +FP++
Sbjct: 210 MGNLKAYGGGLSDAQMLNDFNLAKRIINRLLELGIVPILPTFAGFVPDQLEKDFRLFPTS 269
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDE 116
K +L W + S+ C + DPLF +IG F+ Q K G +++Y+ D F+E
Sbjct: 270 KFNRLPCWNNFTSET--SCLLSVSPFDPLFQKIGSTFLRHQKKMLGGDITNLYSADPFNE 327
Query: 117 NTPPVDSPEYISSL----GAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSV 172
P DS ++ +S +I + + D + +W++Q W F+YD W +K+ L++V
Sbjct: 328 -ILPSDSSKFDASFMKQTAQSIMNSCRKVDKNCIWVLQSWSFTYDQ-WPNWAIKSFLSAV 385
Query: 173 PLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS 232
P+G L++LDL++EV P W + F+G ++WC+LHNF G+ E+ G L + G A
Sbjct: 386 PIGNLLILDLYSEVVPAWQMTSSFHGHNFVWCLLHNFGGSRELRGNLQKVDKGYQLALMK 445
Query: 233 ENTTMA---------------------------------WINQYSVRRYGRSVPAIQDAW 259
+ + W+ YS RY W
Sbjct: 446 AGSNLVGAGLSMEAIDQNYVVYQFMIDRMWSQEPIPLNNWLKSYSESRYSADFKVSHKFW 505
Query: 260 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 319
+L + Y+ + + V + Y +P + +
Sbjct: 506 TILAGSFYSQPEKWGNPRFSVFL-------------------YHRPAFAKKI-------- 538
Query: 320 DHPHLWYSTSEVIRALELFIAS-GNELSASNTYRYDLIDLTRQALA-KYANELFLNIIEA 377
W+ E L+ + S + L ++ DL D+ R + + NE L++ EA
Sbjct: 539 ---EYWFPVEETFNHLQSLMPSLMHVLGDHPLFKEDLNDVMRAVIQFEIGNEAALSLTEA 595
Query: 378 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 437
+ + D + +++ + ++ + W+E +K +A E+ + + A
Sbjct: 596 FLMEDKQQIGASCENLMDMFQKLES----YSNRDFKEWIEDSKSIAPTSEERQVFPVTAS 651
Query: 438 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 483
+T+W Q DY ++ W+GLL YYG R + +++E
Sbjct: 652 DILTVWGPTGQN-----LDYAHREWAGLLSGYYGRRWQYFCDWILE 692
>gi|453051703|gb|EME99203.1| alpha-N-acetylglucosaminidase [Streptomyces mobaraensis NBRC 13819
= DSM 40847]
Length = 763
Score = 174 bits (442), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 140/564 (24%), Positives = 233/564 (41%), Gaps = 83/564 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGP+ ++ LD++ L ++I R+ ELG+ PVLP ++G VP A+
Sbjct: 239 MQNMSAFGGPVSRALLDRRTALAQRITRRLRELGITPVLPGYAGTVPPDFTRRNKGARTV 298
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P W LD F + R + Q + YG S +Y D E P
Sbjct: 299 PQGDWAGFPR-PDW-----LDPRTAHFARVARTYYRVQRELYG-ASSMYKIDLLHEGGTP 351
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P + + A+ +++ DA W + GW + + +L++V K++VL
Sbjct: 352 --GPVPVGAAAKAVEKALRAAHPDATWAILGWQTN--------PRREILDAVDRSKMLVL 401
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILDS 221
D + P + K + G PY + + NF G+ M L
Sbjct: 402 DGIPDHYPRVTDREKDWGGTPYAFGTIWNFGGHTAMGANTQDWVSLFHRWRTKKGSALRG 461
Query: 222 IAFGPVEARTSENTTMA----------------WINQYSVRRYGRSVPAIQDAWNVLYHT 265
IA P EA + +A W ++ V+RYG + P + AW+VL T
Sbjct: 462 IALMP-EAADNNPAALALFSDLAWTEGRLDLKDWFARWPVQRYGAADPNARRAWDVLRRT 520
Query: 266 VYNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
Y T DG ++ + A PD+ AV + +++
Sbjct: 521 AYGTTRADGWSEAADGLFGARPDL----------------------AV--NRAAAWSPRQ 556
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
L Y + AL +A L S+ YR DL D+ RQ ++ + L I AY D
Sbjct: 557 LRYDAAAFDEALPALLAVAPALRGSSAYRCDLTDVARQCVSNRSRLLLPRIKAAYDAGDR 616
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
L+R++L+ + ++ +A + LLG W+ A+ + + E +A + +T+W
Sbjct: 617 TRFRTLTRQWLDWMTLLEETVATSERHLLGRWIAEARAWGGTAAERDRLEHDAVSLLTVW 676
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+ L DY N+ W+GL+ Y R YF + +L + R K +W L
Sbjct: 677 GPRASADGGKLHDYANREWAGLVGGLYRLRWKTYFTELEAALTA----RRKPKPIDWYAL 732
Query: 504 TNDWQNGRNVYPVESNGDALITSQ 527
+ W R YP + +GD + ++
Sbjct: 733 EDRWTRKRPAYPAKPSGDIVAVAR 756
>gi|291302495|ref|YP_003513773.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
gi|290571715|gb|ADD44680.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
Length = 696
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 151/574 (26%), Positives = 246/574 (42%), Gaps = 95/574 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M + G+GG + + ++++ L ++I R+ ELG+ PVLP F+G VP ++ +A I
Sbjct: 178 MGCMCGFGG-VSRRLVEERAELGRRITDRMRELGIEPVLPGFAGLVPG---DIGDTAAIP 233
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
Q G WF P W T T + E+ F +Q + G T D E +
Sbjct: 234 Q-GQWFGFDR-PAWLPT-----TTRAYAEVAEVFYAKQTERLGAT-RAQAVDLLHEGGTS 285
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + AA M+ D +W++Q W W P + L +
Sbjct: 286 GGVDLADATRGIAAA----MERAHDDYLWVLQAW-------WDNPLPEVLAAT----DSD 330
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM- 237
L L W +K ++G P+ L NF G ++G L IA P + +++
Sbjct: 331 HLLLLDLTGEGWRKTKGWHGKPWARGSLTNFGGRTVLFGGLPEIAELPSLKDDPKASSLV 390
Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
AW+ +Y RYG++ P AW+ L T
Sbjct: 391 GTALVEEAWQVNPVVWSLFTQTSWADGDIDLNAWVPEYVAARYGKAHPRAVRAWHGLLAT 450
Query: 266 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH-L 324
Y DG ++ A P +D ++ +S + PH L
Sbjct: 451 AYRSMDGRPGGAESLLCAMPSLD-------------------------ADRASMNGPHSL 485
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
Y + A +A+ L ++T+R+DL+D+TRQ ++ A L + AY + +
Sbjct: 486 PYPAEALEVAWRDLLAAREALGGADTFRFDLVDVTRQVISNRARPLLPLLRTAYAMKELD 545
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
LS F++L E +D +LA + FL+G WL A+ LA +E++ E++ART IT W
Sbjct: 546 RFIALSHSFIDLFELLDPVLATREEFLVGRWLADARALAADEDEADALEFDARTIITTWG 605
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
D+ + A+L+ DY N W+GL+ DYY PR Y K + L G D+ +
Sbjct: 606 DSPESSATLI-DYANHEWAGLIADYYRPRWEKYLKSLETELREGKPAEPIDFYAD----A 660
Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNK--YLQG 536
W + YP E +GDA+ + + +++ Y +G
Sbjct: 661 AAWARSHDTYPTEPSGDAVSSCRAVHHALPYFEG 694
>gi|418473272|ref|ZP_13042874.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
coelicoflavus ZG0656]
gi|371546106|gb|EHN74664.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
coelicoflavus ZG0656]
Length = 716
Score = 171 bits (434), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 136/548 (24%), Positives = 224/548 (40%), Gaps = 77/548 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ GP+ + ++Q+ L ++I R+ ELGM PVLP + G VP P +
Sbjct: 213 MQNMSGFAGPVSERLIEQRAALGRRIANRLRELGMTPVLPGYYGTVPPDFTARNPGGTVV 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD +F + +F Q + +G S +Y D E P
Sbjct: 273 PQGQWVGFER-PDW-----LDPRTGVFSRVAASFYRHQRELFG-DSTMYKMDLLHEGGRP 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P + A+ + +Q+ AVW + GW + P + ++++V +L+++
Sbjct: 326 GNVP--VGDAARAVMNALQTARPGAVWTLIGWQNN-------PSTQ-IIDAVDKSRLLIV 375
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI------------------EMYGILDSI 222
D ++ ++G PY + + NF G+ E L I
Sbjct: 376 DGLSDRYDGLDRETAWHGAPYAFGTIPNFGGHTTVGANTAVWAERFDRWRTEPGSALAGI 435
Query: 223 AFGP---------------VEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
A+ P + RT W Y+ RRYGR P AW +L Y
Sbjct: 436 AYLPEGTGGNPVAYELFTELAWRTEPVDHSGWFAAYAERRYGRPDPHAARAWELLRTGPY 495
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ G + +D + ++ + + +S+ + Y
Sbjct: 496 SMPSGTWSEAQDSLF-----------------------TARPRLTATSAASWSPGAMRYD 532
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
V AL + L ++ YR+DL+D+ RQALA + L I AY D
Sbjct: 533 PDTVRAALAELLKVAPALRTTDAYRFDLVDVARQALANRSRSLLPEIKAAYDAGDLSRFR 592
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ + + ++ +D LLA FLLGPWL A+ + ++ E++AR+ +T W +
Sbjct: 593 AGAAEWKDDLDLLDRLLATDSRFLLGPWLADARSWGRTAAEKDAAEFDARSLLTTWGHRS 652
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
+A LRDY N+ WSGL+ D+Y R Y + +L +G D W L N W
Sbjct: 653 GSDAGGLRDYANREWSGLVSDFYAMRWTTYLDSLDTALVTGRPPAAID----WFSLENAW 708
Query: 508 QNGRNVYP 515
+ YP
Sbjct: 709 NQRHDDYP 716
>gi|257067709|ref|YP_003153964.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
4810]
gi|256558527|gb|ACU84374.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
4810]
Length = 768
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 145/583 (24%), Positives = 235/583 (40%), Gaps = 91/583 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M H G L L+ + L ++I R ELGM VLP F G +PA L + ++
Sbjct: 201 MGITHDLGAALTDEALEARAELGRRIAERERELGMTVVLPGFGGQLPAELVG---TERMI 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W + + P DPLF E + Q + G T H Y D + E+ PP
Sbjct: 258 DWQGWHNALAAP----------GDPLFAEAAASLHRHQRQLLG-TDHHYAVDPYIESLPP 306
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
SP+ ++ AI++ M+ D AVW++QGW F Y +W ++ +LL+ VP +L++
Sbjct: 307 TTSPQQLAEHAEAIFTAMRDADPQAVWILQGWPFHYRAAYWTEERVHSLLSRVPEDRLIL 366
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSI----------------- 222
LDL+ E P+W + YG ++WC+ H F G ++G L ++
Sbjct: 367 LDLWGEHAPMWHRTAAMYGRRWLWCLAHTFGGRFGLFGDLAALDDDLRGLRTAAEAGTRG 426
Query: 223 ---AFGPVEARTSENTTM----------------AWINQYSVRRYGRSVPAIQDAWNVLY 263
FG +N + W+ ++ +RRYG + P +Q AW V+
Sbjct: 427 RLEGFGITSEALDDNAVVYELATRALWSPMPPRERWLEEHIIRRYGTAAPEVQQAWQVIA 486
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
HT+Y G T ++A P G P + + + D P
Sbjct: 487 HTLYGP--GRTRSTPSPLIARP--------------WTRGLPFASQRLAGEALPDADGPP 530
Query: 324 LWYSTSE--------------VIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 369
+E +R+L + SG + DL L A+ A
Sbjct: 531 SANIDAENDAEMLGALAPLAHAVRSLLPVLRSGEH---RDALARDLAQLAIHVGAQSARA 587
Query: 370 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ- 428
I+ A D + + L+ +D + A L+G W+ A+ A +E+
Sbjct: 588 PLRAIVAAAAEADGERLRAEASTLEALLRAVDAVAATRPDMLVGRWIADARAGAGTDERL 647
Query: 429 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES- 487
E +AR+ I++W TQ+ S L DY ++WSG L D + R + ++ + E
Sbjct: 648 ADALERDARSLISVW--GTQD--SGLHDYSARHWSGSLTDLHLARWRAWTDWLARTAEEP 703
Query: 488 GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD-ALITSQWL 529
L+ + + DW++ YP G+ A SQ L
Sbjct: 704 STPPDLEQLHAQIRGIEEDWRDSTAPYPTTPRGEPAAAISQLL 746
>gi|16124795|ref|NP_419359.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
gi|221233511|ref|YP_002515947.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
gi|13421729|gb|AAK22527.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
gi|220962683|gb|ACL94039.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
Length = 770
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 138/540 (25%), Positives = 217/540 (40%), Gaps = 104/540 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLP +W+D++ LQ KIL R+ LGM P+LPAF G VP A P A+I
Sbjct: 202 MGNIEGYKAPLPTAWIDKKKDLQVKILGRMRSLGMTPILPAFGGYVPKAFAEKNPKARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I F+ + +G ++ Y D+F+E PP
Sbjct: 262 RMRPWEGFHE------TYWLDPADPLFAKIAARFLALYTETFGAGTY-YLADSFNEMLPP 314
Query: 121 VD-------------------------------SPEYISSLGAAIYSGMQSGDSDAVWLM 149
++ + +++ G AIY ++ DAVW+M
Sbjct: 315 INADGADARDAAYGDGTANTAVTKTKVEVDPALKAQRLAAYGKAIYDSIRQTRPDAVWVM 374
Query: 150 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 207
QGWLF D FW P + A L+ VP KL++LD+ + P +W +K F G P+I+ +H
Sbjct: 375 QGWLFGADSHFWDPAAISAYLSLVPDDKLMILDIGNDRYPNVWKNAKAFGGKPWIYGYVH 434
Query: 208 NFAGNIEMYGILD-------SIAFGPVEARTS---------------------------E 233
N+ G+ +YG L +IA P + + +
Sbjct: 435 NYGGSNPVYGDLGFYRQDIPAIAANPDAGKLAGFGMFPEGLHNNSIVYEAVYDLAWSEGQ 494
Query: 234 NTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 293
+ W+ +Y+ RYG++ PA+ A L ++ + P
Sbjct: 495 ASPATWLTRYARARYGKTSPALDAALGQLVEAAFSTRYWS---------------PRWWK 539
Query: 294 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 353
G Y + +P + D P +++ A++ A +
Sbjct: 540 SKAGAYLFFKRPTATVG---------DFPQHPGDRAKLEAAVKALTALAPTYGQEPLFVL 590
Query: 354 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 413
DL D TR ++L + AY+ D L +D LL L
Sbjct: 591 DLTDATRHLATMKIDDLLQVAVAAYRRGDTAAGDAARVEIEALALSIDKLLGVQPD-TLA 649
Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
W++ A+ Y NA+ Q+T+W L DY +K W GL + +Y PR
Sbjct: 650 TWIDEARAYGDTPADAAAYVANAKAQVTIWGGEGN-----LNDYASKAWQGLYKSFYLPR 704
>gi|329934959|ref|ZP_08285000.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
gi|329305781|gb|EGG49637.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
Length = 1017
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 136/562 (24%), Positives = 223/562 (39%), Gaps = 81/562 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPS-AKI 59
+ NL+G+GGPL + ++ L ++I R+ LGM PVLP + G+VP + A +
Sbjct: 191 LQNLYGYGGPLSAELIARRAALGRRIADRLRALGMRPVLPGYYGHVPKDFADRRGGDAHV 250
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W P W LD F E+ +F Q +G + D E
Sbjct: 251 VPQGTWHGFDR-PSW-----LDPRTDAFAEVAASFYRHQEDVFGPAGD-FKMDLLHEGGT 303
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P ++ G + +++ A W++ GW + + LL++V ++++
Sbjct: 304 AGDVPVPDAARG--VEKALRAARPGATWVILGWEAN--------PLPELLDAVDKKRMLI 353
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILD 220
+D ++ + + + G PY + + NF G + L
Sbjct: 354 VDGVSDRYTSVTDREEDWGGTPYAFGTIPNFGGRTTIGARTHIWREKFFAWRDKPGSALA 413
Query: 221 SIAFGPVEAR--------------TSENTTMA-WINQYSVRRYGRSVPAIQDAWNVLYHT 265
A+ P A T E A W Y+ RYG + AW L+ T
Sbjct: 414 GTAYLPEAADRDPAAFELFSELAWTDEPVDRARWFTGYADFRYGGRDAGARRAWRALHDT 473
Query: 266 VYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
Y + +D + + A PD+ + + Y L
Sbjct: 474 AYQQHANERSDPHDSLFCARPDL------------------------AATRAARYAPAAL 509
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
Y + AL +A YRYDL+D+ RQALA + + + A+ DA
Sbjct: 510 TYDPARFDAALSGLLAVAAHRRGGAAYRYDLVDVARQALAHRSRQYLPQLKAAFDREDAA 569
Query: 385 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 444
L+ ++L L+ + + H FLLGPW+E A+++A N + ++E A+ +T+W
Sbjct: 570 TFKALATQWLTLMRLSEDITGTHPAFLLGPWIEDARRMATNPRERAEFERTAKALVTVWG 629
Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504
D +A L +YGN+ W GLL D+Y PR + ++L +G D W
Sbjct: 630 DRATSDAGNLHEYGNREWHGLLSDFYLPRWQKWLDACEDALATGTAPAAVD----WFAFE 685
Query: 505 NDWQNGRNVYPVESNGDALITS 526
W R YP+ GDA T+
Sbjct: 686 EPWTRERKDYPLRPVGDAYRTA 707
>gi|333023613|ref|ZP_08451677.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
gi|332743465|gb|EGJ73906.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
Length = 741
Score = 168 bits (426), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 147/560 (26%), Positives = 241/560 (43%), Gaps = 90/560 (16%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ ++Q+ L +I+ R+ ELGM+PVLP + G VPA + P AK
Sbjct: 215 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 274
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD LF E+ AF E Q + YGR + +Y D E
Sbjct: 275 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P ++ G + +++ DAVW++ GW + PP K ++ + ++V+
Sbjct: 328 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 377
Query: 181 D----LFAEVKPIWSTSKQFYGVPYIWCMLHNFAGN----------IEMY--------GI 218
D F+EV + G PY + + NF G+ +++Y
Sbjct: 378 DGLSDRFSEVN---DRESDWQGTPYAFGSIWNFGGHTALGANARDWVDLYPRWRDRSGSR 434
Query: 219 LDSIAFGPVEARTSENTTMA------------------WINQYSVRRYGRSVPAIQDAWN 260
L IA P A +N A W +Y+ RYG S + AW+
Sbjct: 435 LSGIALMPEAA---DNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWD 491
Query: 261 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 320
+L T Y D+ + P++ +V+ GK+ L+ +S++
Sbjct: 492 ILRTTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFE 541
Query: 321 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 380
AL+ +A EL S TYR DL+D+ RQALA + L + AYQ
Sbjct: 542 P------------ALDELLAVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYQA 589
Query: 381 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 440
+ +L RR++ L++ ++ L+A + LLG W+ESA+ + ++ Q +++A + +
Sbjct: 590 KNQAEFARLGRRWIALMDLLEQLVATDENHLLGRWVESARAWGGSAREKSQLQYDALSLL 649
Query: 441 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 500
T W +A LRDY N+ WSGL+ Y R + Y + +L+ G K +W
Sbjct: 650 TTWGTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELSAALKEG----RKPVAVDW 704
Query: 501 IKLTNDWQNGRNVYPVESNG 520
L + W + G
Sbjct: 705 FALEDRWTRNPGALATQPRG 724
>gi|318078904|ref|ZP_07986236.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actF]
Length = 719
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 140/533 (26%), Positives = 235/533 (44%), Gaps = 80/533 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ ++Q+ L +I+ R+ ELGM+PVLP + G VPA + P AK
Sbjct: 193 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 252
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD LF E+ AF E Q + YGR + +Y D E
Sbjct: 253 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 305
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P ++ G + +++ DAVW++ GW + PP K ++ + ++V+
Sbjct: 306 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 355
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGN----------IEMY--------GILDS 221
D ++ P + + G PY + + NF G+ +++Y L
Sbjct: 356 DGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALGANTRDWVDLYPRWRDRSGSRLSG 415
Query: 222 IAFGPVEARTSENTTMA------------------WINQYSVRRYGRSVPAIQDAWNVLY 263
IA P A +N A W +Y+ RYG S + AW++L
Sbjct: 416 IALMPEAA---DNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILR 472
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
T Y D+ + P++ +V+ GK+ L+ +S++
Sbjct: 473 TTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP-- 520
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
AL+ ++ EL S TYR DL+D+ RQALA + L + AY+ +
Sbjct: 521 ----------ALDELLSVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQ 570
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+L RR++ L++ ++ L+A + LLG W+ESA+ + ++ Q +++A + +T W
Sbjct: 571 AEFARLGRRWIALIDLLEQLVATDENHLLGRWVESARAWGGSAREKNQLQYDALSLLTTW 630
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
+A LRDY N+ WSGL+ Y R + Y + +L+ G DW
Sbjct: 631 GTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELSAALKEGRKPVAVDW 682
>gi|429201402|ref|ZP_19192867.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
91-03]
gi|428663010|gb|EKX62401.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
91-03]
Length = 1042
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 136/559 (24%), Positives = 222/559 (39%), Gaps = 83/559 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
+ NL G+GGPL +D++ L ++I R+ ELGM+PVLP + G+VP ++ A +
Sbjct: 212 LQNLSGYGGPLSPELIDRRAALGRRIADRLRELGMSPVLPGYYGHVPKEFVERNGGDAHV 271
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W + P W LD F ++ +F Q +G +H + D E
Sbjct: 272 VPQGVWHGFER-PDW-----LDPRTDSFAKVAASFYGHQEDVFGEAAH-FKMDLLHEGGT 324
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P + + +Q A W++ GW +P + LL+++ ++++
Sbjct: 325 AGDVP--VPGAAQGVERALQKARPGATWVILGW--QENP------LPELLDAIDKSRMLI 374
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILD 220
+D ++ + + + G PY + + NF G + L
Sbjct: 375 VDGVSDRYTSVTDRERDWGGTPYCFGTIPNFGGRTTIGARAHIWNEKFFAWRDKANSALA 434
Query: 221 SIAFGPVEARTSENTTM----------------AWINQYSVRRYGRSVPAIQDAWNVLYH 264
AF P EA + AW + Y+ RYG + + AW L+
Sbjct: 435 GTAFMP-EATDRDPAAFELFSELAWTPTKIDRAAWFSAYADYRYGARDDSARRAWRALHD 493
Query: 265 TVYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
T Y +D + + A PD+ + Y
Sbjct: 494 TAYQQRAVERSDPHDSLFCARPDL------------------------AADRAAEYAPRA 529
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
L Y AL + L S Y+YD++D+ RQALA + + + AYQ D
Sbjct: 530 LTYDPGRFDAALAGLLGVAGGLRGSAAYKYDVVDVARQALAHRSRQYLPQLRAAYQRKDL 589
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
LS +L L+ D + + FLLGPW+ A+ LA N+ + ++E A+ IT+W
Sbjct: 590 ATFRALSTLWLRLMRLSDEVTGANSAFLLGPWVNDARLLATNDAERAEFERTAKVLITVW 649
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+A L +YGN+ W GL+ D+Y PR + + ++L +G D W
Sbjct: 650 GGRATSDAGDLHEYGNREWHGLMADFYVPRWEKWLDTLEDALATGTAPAAVD----WFAF 705
Query: 504 TNDWQNGRNVYPVESNGDA 522
W R Y + GDA
Sbjct: 706 EEPWTRERKDYALRPVGDA 724
>gi|318057780|ref|ZP_07976503.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actG]
Length = 741
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 143/557 (25%), Positives = 241/557 (43%), Gaps = 84/557 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ ++Q+ L +I+ R+ ELGM+PVLP + G VPA + P AK
Sbjct: 215 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 274
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD LF E+ AF E Q + YGR + +Y D E
Sbjct: 275 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P ++ G + +++ DAVW++ GW + PP K ++ + ++V+
Sbjct: 328 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 377
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGN----------IEMY--------GILDS 221
D ++ P + + G PY + + NF G+ +++Y L
Sbjct: 378 DGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALGANTRDWVDLYPRWRDRSGSRLSG 437
Query: 222 IAFGPVEARTSENTTMA------------------WINQYSVRRYGRSVPAIQDAWNVLY 263
IA P A +N A W +Y+ RYG S + AW++L
Sbjct: 438 IALMPEAA---DNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILR 494
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
T Y D+ + P++ +V+ GK+ L+ +S++
Sbjct: 495 TTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP-- 542
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
AL+ ++ EL S TYR DL+D+ RQALA + L + AY+ +
Sbjct: 543 ----------ALDELLSVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQ 592
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+L RR++ L++ ++ L+A + LLG W+ESA+ + ++ Q +++A + +T W
Sbjct: 593 AEFARLGRRWIALIDLLEQLVATDENHLLGRWVESARAWGGSAREKNQLQYDALSLLTTW 652
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+A LRDY N+ WSGL+ Y R + Y + +L+ G K +W L
Sbjct: 653 GTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELSAALKEG----RKPVAVDWFAL 707
Query: 504 TNDWQNGRNVYPVESNG 520
+ W + G
Sbjct: 708 EDRWTRNPGTLATQPRG 724
>gi|429198382|ref|ZP_19190217.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
gi|428665917|gb|EKX65105.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
Length = 747
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 139/555 (25%), Positives = 233/555 (41%), Gaps = 77/555 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ + LD + L ++I+ R+ ELGM PV P + G VP P A+
Sbjct: 220 LQNLSSFPSPVSRQLLDARAALGRRIVGRLRELGMTPVFPGYFGTVPPGFAERNPGARTV 279
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W + P W LD F + AF Q + +G S +Y D E P
Sbjct: 280 PQGDWMGF-ARPDW-----LDPRTNEFKRVAAAFYRAQDELFGGPSTLYKMDLLHEGGDP 333
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ + +++ DA W++ GW + PP +A++++V +++V+
Sbjct: 334 GDVP--VADAAKGVERALRAAHPDATWVILGWQHN------PP--RAIVDAVDKKRMLVV 383
Query: 181 DLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNI----------EMY--------GILDS 221
D ++ P + +G PY + + NF G+ E+Y L
Sbjct: 384 DGLSDRFPTVIDREADWGDTPYAFGSIWNFGGHTALGANTPVWAELYEKWRTKDGSKLRG 443
Query: 222 IAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
IA P A R E W ++++ RYG P + AW++L T
Sbjct: 444 IALMPEAADNNPAAFALFSELAWRKDELDLKTWFSEWAHARYGARDPHAEAAWDILRRTA 503
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y G T +R +EG +G S+ A+ + + L Y
Sbjct: 504 Y----GTTRADR---------------WSEGADGLFG---SRPALNTVRAARWSPKQLRY 541
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+E AL ++ L +S+ YR DL+D+ RQ L+ + L I AY D
Sbjct: 542 DAAEFEPALGELLSVRPGLRSSSAYRRDLLDVARQTLSNRSRVLLPRIRGAYDARDTARF 601
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+L+ +L L++ +D LLA LLG W+ A+ ++ + ++ ++ + +T+W
Sbjct: 602 DELTGTWLSLMDLLDRLLATDSAHLLGRWVADARAWGASDAERERLAYDNLSLLTVWGTR 661
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
+A LRDY N+ W+GL+ Y R + YF+ + +L G + D W L +
Sbjct: 662 KGADAG-LRDYANREWAGLVGGLYRLRWSTYFEELRAALREGRTPKKID----WFALEDR 716
Query: 507 WQNGRNVYPVESNGD 521
W E GD
Sbjct: 717 WTRAPGRLATEPTGD 731
>gi|291301158|ref|YP_003512436.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
gi|290570378|gb|ADD43343.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
Length = 734
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 136/554 (24%), Positives = 229/554 (41%), Gaps = 79/554 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL + GP+ Q LD + L ++I R+ ELG+ PVLP + G +P A+
Sbjct: 209 MQNLSAFPGPISQHLLDSRAELARRIRTRMAELGIRPVLPGYFGTIPGGFAKRNQQARTV 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W+ S P W LD T F ++ +F Q + G + +Y D E P
Sbjct: 269 PQGVWYGF-SRPDW-----LDPTGNEFAKVAASFYRHQAQLLGE-ADMYKMDLMHEGGDP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P ++ G A+ +Q A W+M GW R +L + +++++
Sbjct: 322 GGIPIPDAAKGVAL--ALQRARPGATWVMLGW--------RKNPRTDILTDIDTSRVLIV 371
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA-- 238
D ++ + G PY + + NF G+ + A + RT+ ++ ++
Sbjct: 372 DGISDRFDDLDREHTWPGTPYAFGTIPNFGGHTTIGANAKVWAKRFGQWRTAPDSAVSGI 431
Query: 239 ------------------------------WINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
W Y+ RRYG + + AW+ L + Y
Sbjct: 432 AWMPEGAGRDPAAFELFAELAWRDSIDLGEWFADYADRRYGGADDNARTAWDALRRSAYA 491
Query: 269 CTDGATDKNRDVIV-AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
G + D + A P +D VT Y + L Y
Sbjct: 492 MPSGRWAEAADGLFGARPGLD-----VTHADY-------------------FSPEFLRYD 527
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ +AL + L ++ YR+DL+D+ RQ+L EL + A+ +
Sbjct: 528 AAVFAQALPALLDVDKSLH-NDAYRFDLVDVARQSLVNAGRELLPRVKSAFVNQNKKQFD 586
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ +R +L+ + +D LL FLLGPWLE+A++ A+ ++ K E++ART +++W +
Sbjct: 587 KHTRTWLDWMRLLDRLLETDRRFLLGPWLEAARRSARTADEAKDLEYDARTIVSVWGHRS 646
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
+ L DY N+ +GL+ D Y R YF + ESL+SG + D W L ++W
Sbjct: 647 GSDEGRLHDYANRELAGLVSDLYAMRWRRYFDSLAESLDSGQAPQHID----WFALEHEW 702
Query: 508 QNGRNVYPVESNGD 521
+ + + E GD
Sbjct: 703 ASKTDDHATEPKGD 716
>gi|365876979|ref|ZP_09416485.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
gi|442587289|ref|ZP_21006107.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
anophelis R26]
gi|365755253|gb|EHM97186.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
gi|442562959|gb|ELR80176.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
anophelis R26]
Length = 712
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 140/551 (25%), Positives = 227/551 (41%), Gaps = 87/551 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ + QQ LQKKIL R+ ELG+ PVL F G VP L+N AK+
Sbjct: 193 MGNLEGWGGPVSMDMMKQQAELQKKILKRMKELGIEPVLQGFYGMVPHDLKNKISEAKVI 252
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W P +LD T LF +I + + YG H + + F E T
Sbjct: 253 EQGKWAGEFQRPG-----ILDPTTKLFSKIADTYYTEMKNLYGEDIHYFGGEPFHEGGKT 307
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D + S I + MQ ++ W++QG W+ LL + +
Sbjct: 308 NGLDLKNVVES----IQTSMQKSYPNSTWVLQG--------WQQNPSDGLLAGLKKENTL 355
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL------------------- 219
+++LF E W K + G +IW + NF +YG L
Sbjct: 356 IIELFGENTANWEKRKGYGGTSFIWSNVSNFGEKNGLYGKLQRFIDEVFRAKESIYGANL 415
Query: 220 -------DSIAFGPVE-------ARTSENTTM-AWINQYSVRRYGRSVPAIQDAWNVLYH 264
+ I PV A SE + W+ +Y+ RYG+ + AW
Sbjct: 416 KGIGIIPEGIFNNPVAYDLMLDIAWYSEKPILDQWLTEYTKYRYGKENQDVIQAWKEFAQ 475
Query: 265 TVYNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
T+Y+ D + + + A P ++ + +S + +NY + KEAV
Sbjct: 476 TIYSSPDVYQEGPSESIYCARPSLNVNPVSSWGTRKRNYDQSRFKEAV------------ 523
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
++F+ + + S TY+ D D RQ A + ++ +I+A
Sbjct: 524 ------------KVFVKADTDFKDSETYQTDKTDFLRQVWANKGDVVYDELIKAIHEKKT 571
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
+ + +FLE++ + LL + F L L+ A+ + + +NA++Q+T W
Sbjct: 572 TKIQKSGHQFLEMISIQNMLLGNNRYFTLNRLLKEAEHFGEKLPDAQNVMFNAKSQLTYW 631
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+ + LRDY +K W+GLL Y R +K IE +SG + + +
Sbjct: 632 GPDNNPKTD-LRDYAHKEWNGLLSSLYYNR----WKVFIEQAQSG----IITAPEVFYNM 682
Query: 504 TNDWQNGRNVY 514
+W G+N+Y
Sbjct: 683 EVEWSKGKNMY 693
>gi|290956360|ref|YP_003487542.1| alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
gi|260645886|emb|CBG68977.1| putative alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
Length = 732
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 146/556 (26%), Positives = 229/556 (41%), Gaps = 80/556 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL G+ P+ + LD + VL ++I R ELGM PV P + G VPA P A+
Sbjct: 206 LQNLSGFPSPVSRQLLDARAVLGRRIADRARELGMIPVFPGYFGTVPAGFAERVPGARTV 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD F + AF Q + +G S +Y D E P
Sbjct: 266 PQGRWMGF-ARPDW-----LDPRTDEFARVAAAFYRTQDEMFG-PSALYKMDLLHEGGDP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ + +Q A W+M GW + PP +A++++V ++V+
Sbjct: 319 GDVP--VADAAKGVERALQRAHPGATWVMLGWQHN------PP--RAIVDAVDKQHMLVV 368
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILDS 221
D ++ P + + G PY + + NF G+ + L
Sbjct: 369 DGLSDRFPTVTDREADWGGTPYAFGSIWNFGGHTALGANTPDWAALYEKWRTKDGSTLHG 428
Query: 222 IAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
IA P A R E W +++ RYG P + AW++L T
Sbjct: 429 IALMPEAADNNPAAFALFSELAWREGELDLETWFAEWAHARYGARDPHAEAAWDILRRTA 488
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y T S +EG +G S+ A+ + L Y
Sbjct: 489 YGTT-------------------RADSWSEGADGLFG---SRPALTAVRAGRWSPKQLRY 526
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
+ ++ AL + EL AS+ YR DL+D+ RQAL+ + + + AY DA +
Sbjct: 527 NAADFEPALGEMLKVRPELRASSAYRRDLLDVARQALSNRSRVMLPQLKAAYDAKDAARL 586
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+ SR +L L++ +D L+A LLG W+ A+ A + + ++A + +T+W
Sbjct: 587 AKGSRDWLSLMDLLDELVATDSRHLLGRWVADARSWAVGSTERTELAYDALSLLTVW--G 644
Query: 447 TQEEASL-LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
T+E A LRDY N+ W+GL+ Y R A YF+ + +L G + D W L +
Sbjct: 645 TREGADAGLRDYANREWAGLVGGLYRLRWATYFEELRAALAEGRAPKKID----WFALED 700
Query: 506 DWQNGRNVYPVESNGD 521
W E GD
Sbjct: 701 RWARNPGTLATEPAGD 716
>gi|440695019|ref|ZP_20877582.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
gi|440282912|gb|ELP70302.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
Length = 1050
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 147/569 (25%), Positives = 237/569 (41%), Gaps = 85/569 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL--QNVFPSAK 58
+ N+ G+GGP+ + ++++ L KI R+ ELGM PVLP + G VP +N +A
Sbjct: 216 LQNMSGFGGPVSRRLIEKRADLAAKITERVRELGMTPVLPGYFGTVPDEFVARNGGDAAV 275
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ Q G+W + K P W LD F E+ AF + Q + +G S +Y D E
Sbjct: 276 VPQ-GDWGAFKR-PDW-----LDPRTTAFGEVAAAFYQAQSERFG-DSTMYKMDLLHEGG 327
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
P D P + A+ + ++ AVW + GW + P + +L++V ++
Sbjct: 328 NPGDVP--VGRAAQAVEAALRKAHPGAVWAILGWQNN-------PSGE-ILDAVDKSRMF 377
Query: 179 VLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGN----------IEMY--------GIL 219
V+D ++ + + G PY + + NF G+ +E Y L
Sbjct: 378 VVDGLSDRYTTVTDRESDWGGTPYAFGSIWNFGGHTPMGANAPDWVEQYPKWRDKEDSAL 437
Query: 220 DSIAFGPVEARTSENTTMA---------------WINQYSVRRYGRSVPAIQDAWNVLYH 264
IA P A + W Y+V RYG P AW ++
Sbjct: 438 AGIAAMPEAADNNHAALALLTDLAWTPGTIDLDDWFASYAVSRYGAEDPHALAAWKIIGD 497
Query: 265 TVYNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 322
T Y + DG ++ + A P + +K A E YD
Sbjct: 498 TAYGMSRADGWSEAPDGLFGARPSLG-----------------ANKAAAWGPEADRYD-- 538
Query: 323 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 382
T+ AL + L ++ YRYDL D+ RQ L+ + L I AY D
Sbjct: 539 -----TTAFDLALTELLQVAPALRGNSAYRYDLADVARQVLSNRSRMLLPQIRAAYDTAD 593
Query: 383 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+L+ +L+ + MD +LA LLG WL A+ ++ Q E++AR+ IT
Sbjct: 594 RVRFDELTGVWLDWMRLMDKVLATSGQHLLGRWLADARSWGATRGEKDQLEYDARSIITT 653
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
W E L DY N+ WSGL+ Y R +YF+ + +L + D W
Sbjct: 654 WGGRASSEEG-LHDYANREWSGLVGGLYLTRWTLYFRELSRALRQNRPPKTVD----WFT 708
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYN 531
L +DW + + +P +++GD ++ ++N
Sbjct: 709 LEDDWAHRHDSHPTKTSGDVHKLARRVHN 737
>gi|398786493|ref|ZP_10549210.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
gi|396993639|gb|EJJ04702.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
Length = 1048
Score = 161 bits (407), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 132/565 (23%), Positives = 225/565 (39%), Gaps = 80/565 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ G+GGP + ++ L ++I R+ ELGM+PVLP + G VP P A+
Sbjct: 206 LQNMSGYGGPTSSELIAKRAELGQRITGRLRELGMHPVLPGYFGTVPGGFAARNPGARTV 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + + P W LD +F + AF Q G H + D E P
Sbjct: 266 PQGTWSGL-ARPDW-----LDPRTEVFAKTAAAFYRHQEHLLGPADH-FKMDLLHEGGDP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ +++ A W++ GW + + LL++V +++++
Sbjct: 319 GDVP--VPDAARAVEKALRTARPGATWVILGWQNN--------PRRDLLDAVDHDRMLIV 368
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI------------------LDSI 222
D ++++ + + + GVPY + + NF G + L
Sbjct: 369 DGLSDLETVTDRERDWGGVPYAFGSIPNFGGRTTIGAKTHVWAERFPAWRDKPGSRLAGT 428
Query: 223 AFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
A+ P A R AW + Y+ RYG + A+ L + Y
Sbjct: 429 AYMPEAAERDPAAFELFSELAWRERPVDRAAWFDGYADLRYGARDKGARAAFAALGTSAY 488
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ + V A PD+ V + T ++D
Sbjct: 489 EISSKDGRPHDSVFAARPDLA-----------------ARSGTVYATHTPAFD------- 524
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
+ A + L S+ YR DL D RQALA + +L + +AY+ D
Sbjct: 525 PAAFDTAFAALLTVRPALRGSDAYRRDLTDTARQALANRSWQLIGQLQDAYRRKDRATFR 584
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
LS +L L+ + + H FLLGPWL A+ +A E+E + E +AR +T W D
Sbjct: 585 ALSGLWLHLMRLSEDVTGAHRQFLLGPWLTDARAMASGPEEEARLEHSARALLTTWADRP 644
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW-RREWIKLTND 506
+ L +Y N+ W GL+ + + P+ Y + ++L + + DW RE
Sbjct: 645 TADGGSLANYANRDWHGLIGEVHLPQWQAYLGELADALAADRPPKPFDWYARE-----EP 699
Query: 507 WQNGRNVYPVESNGDALITSQWLYN 531
W + R P+ +A T++ +++
Sbjct: 700 WTHERTTPPLHPTTEAYRTARRVHD 724
>gi|29828556|ref|NP_823190.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
gi|29605660|dbj|BAC69725.1| putative alpha-N-acetylglucosaminidase [Streptomyces avermitilis
MA-4680]
Length = 728
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 133/555 (23%), Positives = 222/555 (40%), Gaps = 77/555 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ Q LD + L ++I R+ ELGM PV P + G VP + A
Sbjct: 201 LQNLSAFPDPVSQQLLDARAALGRRIANRLRELGMTPVFPGYFGTVPPGFADRNAGAHTV 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD F + AF Q + +G S Y D E P
Sbjct: 261 PQGTWMGF-ARPDW-----LDPRTEHFTRVAAAFYRIQDEMFGGASTRYKMDLLHEGGSP 314
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + + +++ AVW++ GW + PP +A++++V +++V+
Sbjct: 315 GDVP--VGDAAKGVERALRAAHPGAVWVILGWQHN------PP--RAIVDAVDKDRMLVV 364
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILDS 221
D + P + ++G PY + + NF G+ + L
Sbjct: 365 DGLCDRFPKVTDREADWHGTPYAFGSIWNFGGHTTLGANTPDWASLYERWRTRPGSTLRG 424
Query: 222 IAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
+A P A R + AW +++ RYG P + AW++L T
Sbjct: 425 VALLPEAADNNPAAFALFSELAWREGDLDLRAWFARWARSRYGGRDPHAEAAWDILRRTA 484
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y T S +EG +G ++ ++ ++ +S+ L Y
Sbjct: 485 YGTT-------------------RADSWSEGADGLFG---ARPSLAATKAASWSPKRLRY 522
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
E AL + L S+ YR DL+D+ RQAL+ + L I AY+ D
Sbjct: 523 RPEEFEPALGELLKVRPGLRGSSAYRRDLLDVARQALSNRSRVLLPQIRTAYEAKDTARF 582
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
+L+ +L L++ ++ LLA LLG W+ A+ + + + ++A + +T+W
Sbjct: 583 DRLTGVWLALMDLLEALLATDSRHLLGRWVADARAWGASAAERDRLAYDALSLLTVWGTR 642
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
+A LRDY N+ W+GL+ Y R + YF + + G + D W L +
Sbjct: 643 AGADAG-LRDYANREWAGLVGGLYRLRWSTYFAELRSASREGRTPKKTD----WFALEDR 697
Query: 507 WQNGRNVYPVESNGD 521
W GD
Sbjct: 698 WTRNPGGLATRPTGD 712
>gi|456388164|gb|EMF53654.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
25435]
Length = 732
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 141/556 (25%), Positives = 228/556 (41%), Gaps = 80/556 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ + LD + VL ++I R+ ELGM PV P + G VPA P A+
Sbjct: 206 LQNLSAFPSPVSRQLLDARAVLGRRIADRVRELGMTPVFPGYFGTVPAGFAERVPGARTV 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD F + AF Q + +G +S +Y D E P
Sbjct: 266 PQGEWMGF-ARPDW-----LDPRTDDFARVAAAFYRVQEEMFGPSS-LYKMDLLHEGGDP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ + ++ A W++ GW + PP +A++++V ++V+
Sbjct: 319 GDVP--VADAAKGVERALRRSRPGATWVILGWQHN------PP--RAIVDAVDKQHMLVV 368
Query: 181 DLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMYG------------------ILDS 221
D ++ P + + +G PY + + NF G+ + L
Sbjct: 369 DGLSDRFPTVTDREADWGDTPYAFGSIWNFGGHTALGANTPDWAALYEKWRTKDGSRLHG 428
Query: 222 IAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
IA P A R E W +++ RYG P + AW++L T
Sbjct: 429 IALMPEAADNNPAAFALFSELAWREGELDLKTWFAEWAHARYGGRDPHAEAAWDILRRTA 488
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y T S +EG +G S+ A+ + L Y
Sbjct: 489 YGTT-------------------RADSWSEGADGLFG---SRPALNAVRAGRWSPKQLRY 526
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
++ AL + EL AS+ YR DL+D+ RQAL+ + + I AY DA +
Sbjct: 527 DAADFEPALGEMLRVRPELRASSAYRRDLLDVARQALSNRSRVMLPQIKAAYDAKDATRL 586
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
SR +L L++ +D L+A LLG W+ A+ + + ++ + +T+W
Sbjct: 587 AAASRDWLSLMDLLDELVATDSRHLLGRWVADARSWGAGAAERTELGYDNLSLLTVW--G 644
Query: 447 TQEEASL-LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 505
T+E A LRDY N+ W+GL+ Y R + YF+ + +L +G + D W L +
Sbjct: 645 TREGADAGLRDYANREWAGLVGGLYRLRWSTYFEELRAALAAGRAPKKID----WFALED 700
Query: 506 DWQNGRNVYPVESNGD 521
W E GD
Sbjct: 701 RWARNPGPLATEPTGD 716
>gi|297194750|ref|ZP_06912148.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
25486]
gi|297152431|gb|EFH31740.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
25486]
Length = 816
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 139/541 (25%), Positives = 230/541 (42%), Gaps = 68/541 (12%)
Query: 16 LDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPS--AKITQLGNWFSVKSDPR 73
++++ L ++I R+ ELGM+PVLP + G VP P A++ G W P
Sbjct: 6 IERRTELGRRITDRLRELGMHPVLPGYFGTVPDDFPGHNPGSDARVIPQGTWGGGMRRPD 65
Query: 74 WCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAA 133
W LD F ++ AF Q + +G SH + D E D P + A
Sbjct: 66 W-----LDPRTQAFSDVAAAFYRHQGELFGDVSH-FKMDLLHEGGTAGDVP--VPDAARA 117
Query: 134 IYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS 193
+ + +Q+ A W++ GW + P +L+S+ +++++D +++ +
Sbjct: 118 VETSLQTARPGATWVILGWQSNPRPV--------MLDSIDTSRVLIVDGLSDLDTVTDRE 169
Query: 194 KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSV- 252
+ G PY + G I +G +I G R +E T AW ++ G +
Sbjct: 170 ADWGGAPYAF-------GTIPNFGGRTTI--GANTDRWTEKFT-AWRDKPGSALVGTAYM 219
Query: 253 -------PAI---------------QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPS 290
PA ++AW Y + G D + A
Sbjct: 220 PEAAERDPAALELFSELAWREEKIDREAWFAEYAQIRY---GGVDHSAREAFAALAATAY 276
Query: 291 IISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 350
++ T+G+ Y S+ L + + P + RA +A L S+
Sbjct: 277 KLTSTDGR--PYDSLFSRRPSLTTAIGTAFDP------AGFDRAFAALLAVRAPLRDSDA 328
Query: 351 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 410
YR+DL D+ RQALA + L L + AY+ D +S +L+++ D + CH F
Sbjct: 329 YRHDLTDVARQALANRSRTLQLALRAAYRNKDVATFRAVSALWLKVMRLSDTMAGCHRQF 388
Query: 411 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 470
LLGPWLE AK+LA + E+ Q E ART IT W D + A+ L +Y N+ W GL+ D +
Sbjct: 389 LLGPWLEDAKRLATSPEEAVQLERTARTLITTWAD--RPTANSLSNYANRDWQGLMADVH 446
Query: 471 GPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 530
P+ + +++ +G + DW + W R+ YPV GDA T+ ++
Sbjct: 447 VPQWEAFLTEQADAMAAGRAPKSFDWYPQ----EEAWTQERHTYPVRPTGDAYSTALRVF 502
Query: 531 N 531
+
Sbjct: 503 D 503
>gi|29832531|ref|NP_827165.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
gi|29609651|dbj|BAC73700.1| putative alpha-N-acetylglucosaminidase, secreted [Streptomyces
avermitilis MA-4680]
Length = 1038
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 133/563 (23%), Positives = 223/563 (39%), Gaps = 83/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
+ NL G+GGPL + ++ L ++I R+ LGM PVLP + G+VP ++ A +
Sbjct: 212 LQNLSGYGGPLSPELIAERAGLGRRICDRLRALGMAPVLPGYYGHVPKGFVERNGGDAHV 271
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W + P W LD F + ++F Q +G+ +H + D E
Sbjct: 272 VPQGIWHGFER-PDW-----LDPRTASFAAVAKSFYRHQKDVFGKAAH-FKMDLLHEGGT 324
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P + + +Q+ A W++ GW +P + ALL+++ K+++
Sbjct: 325 AGDVP--VPGAARGVEKALQAAHPGATWVILGW--EANP------LPALLDAIDKKKMLI 374
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILD 220
+D ++ + K + G PY + + NF G + L
Sbjct: 375 VDGVSDRYTSVTDREKDWGGTPYAFGTIPNFGGRTTIGARAHLWNEKFFAWRDKAGSALA 434
Query: 221 SIAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHT 265
A+ P A + AW + Y+ RYG + Q AW L+ T
Sbjct: 435 GTAYLPEAADRDPAAFELFSELAWSAGKIDRAAWFSSYADFRYGGRDASAQKAWRALHDT 494
Query: 266 VYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
Y +D + + A PD+ + + Y L
Sbjct: 495 AYQQHAVERSDAHDSLFCARPDL------------------------AANRAAEYAPRAL 530
Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
Y AL + L S Y YDL+D+ RQALA + + +L ++ A
Sbjct: 531 TYDPGRFDAALSGLLGVAGGLRGSAAYTYDLVDVARQALAHRSRQ-YLPLLRAAYARKDA 589
Query: 385 GVF-QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
F L+ +L L+ D + H FLLGPW+ A+ LA + + ++E A+ +T+W
Sbjct: 590 AAFTSLATLWLRLMGLSDEVTGTHPAFLLGPWINDARLLATDAGERAEFERTAKVLLTVW 649
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+A L +Y + W+GL+ D+Y PR + + ++L +G D W +
Sbjct: 650 GGRATSDAGDLHEYAGREWNGLMADFYLPRWKKWLDALADALATGTPPAAVD----WFAV 705
Query: 504 TNDWQNGRNVYPVESNGDALITS 526
W R YP+ GD T+
Sbjct: 706 EEPWTRERKDYPLRPVGDPYRTA 728
>gi|409097333|ref|ZP_11217357.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Pedobacter agri
PB92]
Length = 724
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 137/557 (24%), Positives = 219/557 (39%), Gaps = 83/557 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN---VFPSA 57
M NL GWGG + Q +QKK+L R+ EL ++P+L F G VP L A
Sbjct: 195 MGNLEGWGGTNSLQLMQLQSNIQKKVLSRMKELEIDPILQGFYGMVPHDLNKKVAALKDA 254
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+I GNW + + +L T+ F + + + K YG + + F E
Sbjct: 255 QIIDQGNWVFTE----FIRPAILAPTNDKFNTVADVYYSELKKLYGSDIKFFGGEPFHEG 310
Query: 118 TPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
VD I+++ ++ MQ ++ W++QGW + ALL +
Sbjct: 311 GKKGGVD----ITAVAKSVQDVMQKNFPNSTWVLQGW--------QNNPADALLAGLKKE 358
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL---------------- 219
++++LF E W K + G +IW + NF +YG L
Sbjct: 359 NTLIIELFGENTSNWEQRKGYGGTNFIWSNVSNFGEKNGLYGRLQRFLDEVYRIKQSPYK 418
Query: 220 ----------DSIAFGPVEA--------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNV 261
+ I PV R + WI Y+ RYG + DAW V
Sbjct: 419 DYLKGVGIIPEGINNNPVAYDLMLDIAWRNEKPPLDKWITDYTTYRYGSYNKDVADAWKV 478
Query: 262 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 321
TVY+ +K + V P +E Y ++ ++ + SS+
Sbjct: 479 FTETVYSSP--VNEKGKIVYQEGP---------SESIY------CARPSLKVNPVSSWGT 521
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
Y T +A+ LFI + + S TY+ D D RQ +A ++ + +I A Q
Sbjct: 522 RKRNYDTKLFKQAVALFIKAETQFKNSETYQTDKTDFLRQVMADKGDQAYDELINAIQAK 581
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
D + + + FL ++ D LL + F L WL A L + K +NA+ QIT
Sbjct: 582 DKNAIKEKGNHFLTMILQQDSLLNNNHFFTLNRWLNQAVALGKGLPDAKNILFNAKAQIT 641
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI-ESLESGDGFRLKDWRREW 500
W + + + LRDY +K W GLL Y R ++ + + + S F
Sbjct: 642 FWGPDNNPKTT-LRDYAHKEWGGLLSSLYYNRWKLFIDDALNDKITSASTF--------- 691
Query: 501 IKLTNDWQNGRNVYPVE 517
+ W N+YP++
Sbjct: 692 YDMEVKWSKDSNLYPIK 708
>gi|329940646|ref|ZP_08289927.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
gi|329300707|gb|EGG44604.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
Length = 798
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 133/530 (25%), Positives = 210/530 (39%), Gaps = 83/530 (15%)
Query: 10 PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVK 69
P+ LD + VL +++ R+ ELGM PVLP + G VP A+ G W
Sbjct: 227 PVSTQLLDARAVLGRRLADRLRELGMVPVLPGYFGTVPPGFAARNRGARTVPQGTWMGFD 286
Query: 70 SDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS 129
P W LD LF + AF Q + +G ++H Y D E D P +
Sbjct: 287 R-PDW-----LDPRTDLFARVAAAFYRVQGELFGASTH-YKMDLLHEGGTAGDVP--VGE 337
Query: 130 LGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG---------KLVVL 180
+ ++ DAVW++ GW + PP +A+L++V G +L+V+
Sbjct: 338 AAKGVERALRRARPDAVWVLLGWRHN------PP--RAILDAVASGGPDGAAGRERLLVV 389
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILDS 221
D ++ P + + GVPY + + NF G+ + L
Sbjct: 390 DGLSDRFPTVTDREADWGGVPYAFGSIWNFGGHTTLGANTPDWARLYEAWRTKEGSALRG 449
Query: 222 IAFGPVEA---------------RTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTV 266
IA P A E AW +++ RYG + AW+VL T
Sbjct: 450 IALLPEAADNNPAAFALFSELPWHEGELDLKAWFARWARSRYGAYDAHAEAAWDVLRRTA 509
Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326
Y T S +EG +G ++ ++ +S+ L Y
Sbjct: 510 YGTT-------------------RADSWSEGADGLFG---ARPSLTARRAASWSPKELRY 547
Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386
E RAL+ + L S+ YR DL+D+ RQ L+ + L I A D
Sbjct: 548 DAHEFERALDELLKVRPGLRESSAYRRDLLDVARQCLSNRSRALLPRIARACAARDVKAF 607
Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 446
S +L L++ ++ L+ LLG W A+ +E + + +++A + +T+W
Sbjct: 608 DAASGDWLSLMDLLERLVGTDARHLLGRWTAQARAWGADEAERDRLQYDALSLLTVWGTR 667
Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
EA LRDY N+ W+GL+ Y R + YF + +L G DW
Sbjct: 668 QGAEAG-LRDYANREWAGLVGGLYRLRWSTYFTELRAALTEGRAPAAVDW 716
>gi|456390168|gb|EMF55563.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
25435]
Length = 1042
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 135/563 (23%), Positives = 227/563 (40%), Gaps = 83/563 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
+ NL G+GGPL + ++ L ++I R+ ELGM+PVLP + G+VP ++ A +
Sbjct: 212 LQNLSGYGGPLSPQLIARRAGLGRRITDRLRELGMSPVLPGYYGHVPKQFVERNGGDAHV 271
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W + P W LD F + +F +G +H + D E
Sbjct: 272 VPQGLWHGFER-PDW-----LDPRTDSFARVAASFYGHVRDVFGAAAH-FKMDLLHEGGT 324
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P + + + DA+W++ GW +P + LL+++ ++++
Sbjct: 325 AGDVP--VPDAARGVERALHKAHPDAIWVILGW--QENP------LPELLDAIDRSRMLI 374
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG------------------ILD 220
+D ++ + + + G PY + + NF G + L
Sbjct: 375 VDGVSDRYASVTDRERDWGGTPYCFGTIPNFGGRTTIGARAHLWTDKFFAWRDKPDSALV 434
Query: 221 SIAFGPVEARTSENTTM----------------AWINQYSVRRYGRSVPAIQDAWNVLYH 264
A+ P EA + AW + Y+ RYG A + AW L+
Sbjct: 435 GTAYMP-EATDRDPAAFELFSELAWTPGKIDRAAWFSAYADFRYGGRDDAARAAWRALHE 493
Query: 265 TVYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 323
T Y +D + + A PD+ + A T +YD
Sbjct: 494 TAYQQRAVERSDPHDSLFCARPDL-----------------AADRAAEYAPRTLTYDPGR 536
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
+ A L +A G + + YRYD++DL RQALA + + + A++ D
Sbjct: 537 F-----DAAFAGLLDVAGGRRRNPA--YRYDVVDLARQALAHRSRQYLPQLRAAHRRKDL 589
Query: 384 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
LS +L L+ D + FLLGPW+ A+ LA ++ + ++E A+ IT+W
Sbjct: 590 TTFRALSTLWLRLMRLSDEVTGTDGAFLLGPWVNDARLLATDDAERAEFERTAKVLITVW 649
Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
+ L +YGN+ W+GL+ D+Y PR + + ++L +G D W
Sbjct: 650 GGRATSDTGDLHEYGNREWNGLMADFYVPRWQKWLDALEDALATGTAPAAVD----WFAF 705
Query: 504 TNDWQNGRNVYPVESNGDALITS 526
W R YP+ GDA T+
Sbjct: 706 EEPWTRERKDYPLRPVGDAYRTA 728
>gi|380804373|gb|AFE74062.1| alpha-N-acetylglucosaminidase precursor, partial [Macaca mulatta]
Length = 265
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 92/145 (63%), Gaps = 3/145 (2%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 124 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 183
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 184 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 240
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDA 145
+P Y+++ A+Y M + D++A
Sbjct: 241 SSAPSYLAAATTAVYEAMIAVDTEA 265
>gi|339238239|ref|XP_003380674.1| GDP-L-fucose synthetase [Trichinella spiralis]
gi|316976398|gb|EFV59699.1| GDP-L-fucose synthetase [Trichinella spiralis]
Length = 1203
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 110/449 (24%), Positives = 190/449 (42%), Gaps = 54/449 (12%)
Query: 89 EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 148
+G + + L+ Y H Y+ D F+E P ++ ++ AIY+ M S D +VW+
Sbjct: 801 HVGNEVVWKSLENYFGLFHAYSADPFNEMVPNTFDVMFLRNVSFAIYNVMLSVDPKSVWV 860
Query: 149 MQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 207
+Q W+F W + K L +VP G ++V+DL+AE P++ FY P+IWCMLH
Sbjct: 861 LQSWMFLSSERWLENENAKHFLTAVPTGSILVVDLYAEEYPLYEKFSGFYNQPFIWCMLH 920
Query: 208 NFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
NF G +YG L I + T N M G S+ I + V+Y
Sbjct: 921 NFGGVQGLYGNLARINQKLADVSTVSNINMVGT--------GLSMEGIDQNY-VVYQMAL 971
Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
+ ++ D+ + + + +T+ Y +G
Sbjct: 972 DRFWSPNNQKVDLAAWYIYIHLG-VGITKSIYTAWGA----------------------- 1007
Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
F+ S + Y DL++LT+ AL +L+ + +Y
Sbjct: 1008 ----------FLQSSRTCQENEIYINDLVELTKHALMLTGAKLYEKLQASYIRKCGQEFL 1057
Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
+ + +++ D++ + H +L W+E A+ + Q Q E N R Q+T+W
Sbjct: 1058 ENAAAVEQVLSDLEWISKTHSRSMLSKWIEIARANGKTAAQSDQLEENLRMQVTIW--GP 1115
Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM---IESLESGDGFRLKDWRREWIKLT 504
Q E + DY K W+ L +YY PR ++F ++ I LE+ + L I+L
Sbjct: 1116 QGE---IVDYARKQWAALFSEYYLPRWRLFFAHLYADILQLETFNQTLLNSRLFHEIELP 1172
Query: 505 NDWQNGRNVYPVESNGDALITSQWLYNKY 533
Q N+ + G+ ++ S+ LY++Y
Sbjct: 1173 FALQKIPNI--DQPTGNTVVVSKILYSRY 1199
>gi|212722968|ref|NP_001131519.1| uncharacterized protein LOC100192858 [Zea mays]
gi|194691748|gb|ACF79958.1| unknown [Zea mays]
Length = 114
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 61/99 (61%), Positives = 75/99 (75%)
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 501
MWFDNT+ +ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+ F LK+WRREWI
Sbjct: 1 MWFDNTETKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWI 60
Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 540
LTN+WQ+ R V+ + GD L SQ LY KYL +
Sbjct: 61 SLTNNWQSDRKVFSTTATGDPLNISQSLYTKYLSNADLL 99
>gi|328867426|gb|EGG15808.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
Length = 992
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 150/321 (46%), Gaps = 43/321 (13%)
Query: 212 NIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTD 271
N MY +++ +A+ RT+ WINQY+ RRYG VP + AWN+L TV+N T
Sbjct: 232 NYMMYDLMNEMAW-----RTTAPNMTEWINQYTQRRYGVFVPELAQAWNLLIPTVFNATL 286
Query: 272 GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEV 331
G YG P S + + D L+Y S V
Sbjct: 287 GY----------------------------YGPPSSFVGMRPQLNMTND---LYYDPSVV 315
Query: 332 IRALELFIASGNE-LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLS 390
+A +L++ +E + ++ T+ +D+ ++T QAL+ + + + +AY N + + +
Sbjct: 316 QQAWQLYLGVTDEYVLSTATFSFDVSEITLQALSNLFMDTQMAMYDAYLTNQSTVFEERA 375
Query: 391 RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE--QEKQYEWNARTQITMWFDNTQ 448
L ++ DMD + A L+G W +A+Q A N + +E+NAR QIT+W
Sbjct: 376 TSCLNIITDMDTIAATQQMLLVGTWTANARQWALNTSSGETAPFEFNARNQITLW----G 431
Query: 449 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQ 508
S L DY WSGLL D+Y R A++ KYM SL + F D+ + L W
Sbjct: 432 PPNSSLHDYAYHLWSGLLNDFYFARWALFIKYMDTSLSTNTTFNNTDYTNDIESLEESWN 491
Query: 509 NGRNVYPVESNGDALITSQWL 529
N YP G+A + S+++
Sbjct: 492 NQNYQYPTLPTGNAYLLSKFI 512
>gi|402824586|ref|ZP_10873940.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
gi|402261896|gb|EJU11905.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
Length = 486
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 123/252 (48%), Gaps = 39/252 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+ PL W++++ LQ +IL R+ LGM PVLPAF+G VP A P A+I
Sbjct: 223 MGNLAGYRAPLSSGWIEKKHQLQLRILARMRALGMKPVLPAFAGYVPEAFAKAHPKARIY 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD +DPLF ++ F+ + YG + Y D F+E PP
Sbjct: 283 KMRAWEGFPP------TYWLDPSDPLFTQLAARFVTLYNRTYGEGEY-YLADAFNEMIPP 335
Query: 121 V-------DSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + EY +++ G +Y + + A W+MQ
Sbjct: 336 IAEDGSDAAAAEYGDSIANTAATRAAALPPAVRDARLAAYGERLYGSITAAAPKATWVMQ 395
Query: 151 GWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D +R P+ + A L+ VP ++++LD+ + P IW + F G + + +HN
Sbjct: 396 GWLFGADKAFRTPEAIAAFLSRVPDDRMLILDIGNDRYPGIWQKTDAFDGKAWTYGYVHN 455
Query: 209 FAGNIEMYGILD 220
+ G+ +YG L+
Sbjct: 456 YGGSNPVYGDLE 467
>gi|84625358|ref|YP_452730.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|84369298|dbj|BAE70456.1| putative N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
MAFF 311018]
Length = 590
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 124/521 (23%), Positives = 202/521 (38%), Gaps = 111/521 (21%)
Query: 54 FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 113
P A+I ++ W TY LD DPLF ++ R F+E + YG Y D
Sbjct: 46 LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 98
Query: 114 FDENTPPVD-------SPEY-----------------------ISSLGAAIYSGMQSGDS 143
F+E PPV + +Y +++ G A+Y + +
Sbjct: 99 FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 158
Query: 144 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 201
A W+MQGWLF D FW+P + A L VP +L+VLD+ + P W S+ F +
Sbjct: 159 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 218
Query: 202 IWCMLHNFAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM------- 237
I+ +HN+ + +YG + + FG N+ +
Sbjct: 219 IYGYVHNYGASNPLYGDVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYLYAL 278
Query: 238 ----------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 287
W+ QY RYGRS A+ AW L +Y +
Sbjct: 279 AWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTRYWS-------------- 324
Query: 288 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 347
P + G Y + +P + D P + A++ + + +
Sbjct: 325 -PRWWNTHAGAYLLFKRPTADIVNFD------DRPG---DPQRLRSAIDALLQQADRYAD 374
Query: 348 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLAC 406
+ YRYDLI+ R L+ A+ +++AY D A G QL+R +LV+ +D L+
Sbjct: 375 APLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDALVGG 433
Query: 407 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 466
L ++A + + + Y NAR Q+++W + L DY +K W G+
Sbjct: 434 QHETLAAWTGQAAAAVGNDARLLRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMY 488
Query: 467 RDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 500
D+Y R + + ++G F +L W R+W
Sbjct: 489 ADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 529
>gi|293371910|ref|ZP_06618314.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292633156|gb|EFF51733.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 411
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 114/449 (25%), Positives = 188/449 (41%), Gaps = 89/449 (19%)
Query: 129 SLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVK 187
+ + +Y+ + + D A W+ W+F +D W +MKALL VP K+++LD E
Sbjct: 3 KIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMILLDYHCENV 62
Query: 188 PIWSTSKQFYGVPYIWCMLHNFAGNIEMYGI----------------------------L 219
+W ++ F+ PYIWC L NF GN + G L
Sbjct: 63 ELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKGIGSTLEGL 122
Query: 220 DSIAFGPVEARTSENTTM-----AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGAT 274
D + F P E + + WI + R G +++DAW L++ +Y
Sbjct: 123 DVMQF-PYEYILEKAWNLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDIY------- 174
Query: 275 DKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA 334
V P T G Y +P + K ++ Y + L EV R
Sbjct: 175 -------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVELL----EVWRK 214
Query: 335 LELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR 392
L NE + + +R DLI + RQ L Y ++ + + D + +
Sbjct: 215 L-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQALKACGEK 267
Query: 393 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 452
E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 268 MKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW-------GG 320
Query: 453 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE----WIKLTN 505
L DY ++ W+GL+ DYY R +Y I+ + G D +L+D +E W+ T+
Sbjct: 321 SLNDYASRSWAGLISDYYAKRWEVYINTFIKVVGEGVEVDQKQLEDELKEIEEGWVNATD 380
Query: 506 DWQNGRNVYPVESNGDALIT-SQWLYNKY 533
++V+ S D L++ S +L++KY
Sbjct: 381 RKDTRKDVH---STTDGLLSFSTFLFSKY 406
>gi|58583545|ref|YP_202561.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
gi|58428139|gb|AAW77176.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
Length = 753
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 122/521 (23%), Positives = 200/521 (38%), Gaps = 111/521 (21%)
Query: 54 FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 113
P A+I ++ W TY LD DPLF ++ R F+E + YG Y D
Sbjct: 209 LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 261
Query: 114 FDENTPPV------------------------------DSPEYISSLGAAIYSGMQSGDS 143
F+E PPV +++ G A+Y + +
Sbjct: 262 FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 321
Query: 144 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 201
A W+MQGWLF D FW+P + A L VP +L+VLD+ + P W S+ F +
Sbjct: 322 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 381
Query: 202 IWCMLHNFAGNIEMYGIL-----------------DSIAFGPVEARTSENTTM------- 237
I+ +HN+ + +YG + + FG N+ +
Sbjct: 382 IYGYVHNYGASNPLYGDVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYLYAL 441
Query: 238 ----------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 287
W+ QY RYGRS A+ AW L +Y
Sbjct: 442 AWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTR---------------YW 486
Query: 288 DPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSA 347
P + G Y + +P + ++ + D + A++ + + +
Sbjct: 487 SPRWWNTHAGAYLLFKRPTAD--IVNFDDRPGD-------PQRLRSAIDALLQQADRYAD 537
Query: 348 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLAC 406
+ YRYDLI+ R L+ A+ +++AY D A G QL+ R +LV+ +D L+
Sbjct: 538 APLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLA-RTTQLVQGLDALVGG 596
Query: 407 HDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLL 466
L ++A + + + Y NAR Q+++W + L DY +K W G+
Sbjct: 597 QHETLAAWTGQAAAAVGNDARLLRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMY 651
Query: 467 RDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 500
D+Y R + + ++G F +L W R+W
Sbjct: 652 ADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 692
>gi|47212645|emb|CAF95026.1| unnamed protein product [Tetraodon nigroviridis]
Length = 121
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 56/121 (46%), Positives = 83/121 (68%), Gaps = 3/121 (2%)
Query: 20 LVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYL 79
L LQ KIL ++ GM PVLPAFSGNVP + ++P A++T+LG W K + + C+Y+
Sbjct: 4 LSLQFKILEQMRSFGMTPVLPAFSGNVPKGILRLYPEARVTRLGPW--SKFNCSFSCSYI 61
Query: 80 LDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQ 139
LD DPLF+ IG ++ Q +K++G T+HIYN DTF+E TPP P Y+S++ A+++ M
Sbjct: 62 LDPRDPLFLRIGSLYLAQVVKQFG-TNHIYNTDTFNEMTPPSSEPNYLSAVSRAVFAAMT 120
Query: 140 S 140
+
Sbjct: 121 A 121
>gi|281423204|ref|ZP_06254117.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
gi|281402540|gb|EFB33371.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
Length = 291
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 112/240 (46%), Gaps = 22/240 (9%)
Query: 247 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 306
RYG++ P I+ AW +L T+YNC G + SI G+P
Sbjct: 4 RYGKTSPEIERAWQLLSETIYNCPAGNNQQG---------PHESIFC---------GRP- 44
Query: 307 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 366
++ + S+ +Y + A +L ++ +N + YDL+D+ RQALA
Sbjct: 45 ---SLNNFQVKSWSKMRNYYDLQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALADQ 101
Query: 367 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 426
+L I Y + + RFLE++ D LL F LG W E+A++L +
Sbjct: 102 GRLQYLKTIADYNGFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGTTQ 161
Query: 427 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 486
+++ YEWNAR QIT W + + L DY +K W G+L+D+Y R I+ + + +E
Sbjct: 162 QEKDLYEWNARVQITTWGNRMCADKGGLHDYAHKEWQGILKDFYYKRWKIFMDALAKQME 221
>gi|169351438|ref|ZP_02868376.1| hypothetical protein CLOSPI_02218 [Clostridium spiroforme DSM 1552]
gi|169291660|gb|EDS73793.1| F5/8 type C domain protein [Clostridium spiroforme DSM 1552]
Length = 1762
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 141/564 (25%), Positives = 227/564 (40%), Gaps = 111/564 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GGP+ W+ +L + ++ LGM VL ++G VP + + +I
Sbjct: 816 MDNMEVIGGPVSDEWVKGRLEMARENQRWKNSLGMQTVLQGYAGMVPNNFTD-YQDVEIL 874
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP- 119
+ GNW V PR ++ L+ + + F E Q +G+TS+ Y D F E
Sbjct: 875 EQGNWCGV---PR---PDMIRTDGELYDQYAKLFYEAQEWAFGKTSNYYAVDPFHEGGKR 928
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKAL--LNSVPLGKL 177
P D + + + + + + D +AVW++Q W W P L + +
Sbjct: 929 PSDLTDDV--ISREVLNSLLEYDQEAVWMVQAW-------WSNPTNDLLKGMGDDREDHV 979
Query: 178 VVLDLFA------------EVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG----ILDS 221
++LDL E S +F ++WCML N+ GN M G I++
Sbjct: 980 IILDLNGLNDAYDSYWDKTEYNGTVLESDEFNSTSWVWCMLENYGGNPSMDGRPKEIINR 1039
Query: 222 IAFGPVEAR-------TSENT-----------TMA----------WINQYSVRRYGRSVP 253
I +A SE T MA W+++Y +RRYG
Sbjct: 1040 INKASTQAEHMKGIGFISEATYDNPMIYELLLDMAWQQDTIDLDDWLDEYVLRRYGDYSE 1099
Query: 254 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
+ +AW++L TVY+ + TD ++A DPS++ YG P
Sbjct: 1100 SAGEAWDILLKTVYSRSGKTTD-----VIARS--DPSLVQ--------YGLP-------- 1136
Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
Y+ SE+ ALEL ++LSAS YRYDL ++ RQ + YA +
Sbjct: 1137 ------------YTASELEEALELLYKDYDKLSASEAYRYDLTEIMRQVVNNYAVVRLGD 1184
Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK-QY 432
+ AY + L ++L ++ ++ + L+G W+ A A++ + Y
Sbjct: 1185 LKTAYDAKEIDNFKSLKEQYLNAIDLLNEVCGTQQDLLIGEWVGRAVDWAKDTNSDDFAY 1244
Query: 433 E---WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 489
+ NA+T IT+W +T L Y + + G++ D Y Y E LE G
Sbjct: 1245 DSMIINAKTLITVWAPSTT-----LGTYAYRNYEGMINDIYKVIWQAYLDQSEEILEFGS 1299
Query: 490 G----FRLKDWRREWIKLTNDWQN 509
D +WI D QN
Sbjct: 1300 AKTNLVNYHDLCMDWIYADWDLQN 1323
>gi|281423203|ref|ZP_06254116.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
gi|281402539|gb|EFB33370.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
Length = 450
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 78/226 (34%), Positives = 113/226 (50%), Gaps = 24/226 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW +QQ LQKKIL R++E GM PVLP F G +P + +T
Sbjct: 186 MNNLEGWGGPLPDSWYNQQEALQKKILKRMHEYGMQPVLPGFCGMMPHDAKAKL-GLNVT 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L TD +I + + YG+ ++ Y+ D F E
Sbjct: 245 DGGIWNGYTRPAN------LSPTDAHSDKIADLYYAELTNLYGKANY-YSMDPFHETND- 296
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D S G + M+ + +A W++QGW + +P RP +K + N G L+VL
Sbjct: 297 -DEAIDYSKAGRKVMEAMKRVNPNATWVIQGW--TENP--RPQMIKNMKN----GDLLVL 347
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
DLF+E +P IW K + +++CML NF N+ ++G +D
Sbjct: 348 DLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMD 393
>gi|255079272|ref|XP_002503216.1| GH family 89 protein [Micromonas sp. RCC299]
gi|226518482|gb|ACO64474.1| GH family 89 protein [Micromonas sp. RCC299]
Length = 1260
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/179 (37%), Positives = 96/179 (53%), Gaps = 7/179 (3%)
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
K +LG + K D + LD +D LF +G AF +Q ++++G T H+Y DTF E
Sbjct: 396 KDAELGKY--AKKDDSVRSVHFLDPSDALFQSLGAAFTKQLVEDFG-TDHLYLADTFREI 452
Query: 118 TPPVD--SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL 174
P D S ++ +GAA + M+S D A W+ Q F +P FW + ALL SV +
Sbjct: 453 RDPNDDFSETHVVRVGAATLAAMRSADPRATWVFQSDAFRRNPRFWNEGRRGALLRSVDI 512
Query: 175 GKLVVLDLFAEVKPIW-STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS 232
G ++VLD AE P + F G P++WC+ HN GN+ M G L +IA GP A S
Sbjct: 513 GDMLVLDSAAETDPYYLREPVHFAGQPFVWCVKHNHGGNLGMRGRLSAIATGPAAAMDS 571
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 65/220 (29%), Positives = 98/220 (44%), Gaps = 34/220 (15%)
Query: 256 QDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 313
++AW +L TVY D++ RD P + +S Y + K V
Sbjct: 727 REAWEILGKTVYGARAKGEDEDHVRDACSWQPSLRADELSP---DYFDAAKVVD------ 777
Query: 314 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 373
Y+ +I A A+G A YD++D+ RQ LA+ +N L
Sbjct: 778 ------------YAFKPLIDAAPTLRANG----AGTRVDYDIVDVGRQLLARQSNVLATQ 821
Query: 374 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA---QNEEQEK 430
I ++ N+A + LEL++DMD LL H GFLLG ++ESAK A E E
Sbjct: 822 IRDSLNSNNASEAKMYGTQMLELLDDMDALLRSHKGFLLGNYIESAKSWAGKRNKESDEA 881
Query: 431 QYEWNARTQITMWFDNTQEEASLL----RDYGNKYWSGLL 466
E +AR+ I+ + + + + L DY N+ WSG+L
Sbjct: 882 NLERSARSLISGFGPSGSKLGAPLGHPMHDYSNRQWSGML 921
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 38/61 (62%)
Query: 7 WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWF 66
W G P+ WL +Q LQ+ + + + GM PVLP F+G+VP A+ FP AK+ ++ NW
Sbjct: 217 WTGGRPKKWLKRQWDLQRDAVKLMRDFGMTPVLPGFNGHVPPAIARRFPEAKLRRVENWL 276
Query: 67 S 67
+
Sbjct: 277 T 277
>gi|293402299|ref|ZP_06646437.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
5_2_54FAA]
gi|291304406|gb|EFE45657.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
5_2_54FAA]
Length = 2330
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 122/532 (22%), Positives = 213/532 (40%), Gaps = 97/532 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGP+P ++ ++ L + LGMN VL ++G VP P+ +T
Sbjct: 670 MQNIETFGGPIPDQYVVDRVELARTTQRWKNSLGMNTVLQGYAGMVPTNFNEFQPNVPLT 729
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+W + + P T P + E + F E Q YG TS Y D + E
Sbjct: 730 AQKSWGGL-ARPSMIPT-----DSPYYDEYAKLFYEAQEYIYGATSDYYAVDPYHEGGT- 782
Query: 121 VDSPEYIS--SLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSV---PLG 175
PE +S ++ + + + D DAVW++Q W+ LLN +
Sbjct: 783 --RPEGLSDETVAREVLNSLLDYDKDAVWVVQA--------WQSNPTDGLLNGMGEYREN 832
Query: 176 KLVVLDLFAEVKPIWSTSK----QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
++++DL PI S +K +F G + W +L F GN M G + ++ ++
Sbjct: 833 HVLIVDLIK--YPIKSWTKYNKSEFKGTSWAWGLLGGFGGNPTMNGEMQTM-VNDIQTAK 889
Query: 232 SENTTMA---------------------------------WINQYSVRRYGRSVPAIQDA 258
E T MA W+N+Y RRYG + ++A
Sbjct: 890 KERTHMAGLGIISEAQYDNPVLYDLIFDLAWVDDDFSLDQWLNKYIERRYGGTSDNAKEA 949
Query: 259 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 318
W ++ + YN V F + + Q+YGK
Sbjct: 950 WKIMKNANYNHG-----------VRFT---AQVYGMKGKSPQDYGK-------------- 981
Query: 319 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 378
++ Y ++ A L I ++ S YRYDL ++ RQ ++ Y+ + N+I+A
Sbjct: 982 ---QNISYGADKLETAFRLLIEDYDKFKDSECYRYDLTEIMRQMVSNYSTLTYNNVIDAR 1038
Query: 379 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNA 436
+ + + +FL+ + ++ + L G W+ A+ A + + + +E NA
Sbjct: 1039 EDKNIEKFKEEKAKFLKSFDVLNDIQETQVDQLAGEWIGKAQDRAADYDDFAKDAFEMNA 1098
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 488
+ IT W ++ A L+DY + + G+ D Y Y + +LE+G
Sbjct: 1099 KALITSWA--SRSSAGGLKDYAWRNYQGMFIDLYKQNWIDYLDQVEANLENG 1148
>gi|260821254|ref|XP_002605948.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
gi|229291285|gb|EEN61958.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
Length = 673
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 83/289 (28%), Positives = 122/289 (42%), Gaps = 66/289 (22%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLPQSW QL LQ KIL R+ N + L +++ +
Sbjct: 368 MGNIRGWGGPLPQSWHQNQLELQHKILARMR-------------NFDSTLMHLY----LD 410
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G ++ C T + +C E
Sbjct: 411 YSGGDLKTRTVAHTCWTLRI-----------------------------HCFLTLEECLL 441
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P Y+S GAA+Y+GM +GD A+WLMQGWLF FW+P Q KALL SVP G +
Sbjct: 442 LSEPNYLSKAGAAVYAGMLAGDPQAIWLMQGWLFQARDFWQPAQTKALLQSVPEGPFL-- 499
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCML--HNFAGNIEMYGILDSIAFGPVEARTSENTTMA 238
++++ G + L N MY +++ +A+ P + +N
Sbjct: 500 ------------ARKYLGSTMVGTGLTPEGIDQNYIMYELMNEVAWMPQPFQILDN---- 543
Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDV 287
W + Y+ RYG W +L +VY+C +G D V+V PD+
Sbjct: 544 WASDYAWSRYGVKNSNASLGWQILLKSVYDCENGFKDHCDSVVVHRPDL 592
>gi|386386798|ref|ZP_10071901.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
gi|385665738|gb|EIF89378.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
Length = 1033
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 67/193 (34%), Positives = 99/193 (51%), Gaps = 6/193 (3%)
Query: 334 ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRF 393
A +A+G EL S+TYR+DL +L RQALA + L + A D +S +
Sbjct: 531 AFAALLAAGPELRDSDTYRHDLTELARQALANRSRTLQFALRAARASKDVAAFRGVSALW 590
Query: 394 LELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASL 453
L+L+ D + CH FLLGPWLE AK+LA + + + E AR IT W D + A+
Sbjct: 591 LKLMRLADTMAGCHRSFLLGPWLEDAKRLATSPAEAVELERTARALITTWAD--RPAANA 648
Query: 454 LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNV 513
L +Y N+ W+GL+ D + P+ + + ++LE+G + DW + W R V
Sbjct: 649 LSNYANRDWNGLIADVHVPQWDAFLTEVADALEAGRAPKSFDWYPQ----EEAWTKDRRV 704
Query: 514 YPVESNGDALITS 526
YP GD T+
Sbjct: 705 YPSAPTGDPYATA 717
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 102/215 (47%), Gaps = 17/215 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGPL ++ LD + L +KI R+ ELGM PVLP + G VP + P A++
Sbjct: 213 LQNMSEYGGPLSKTLLDARAELGRKITARLRELGMRPVLPGYFGTVPDGFADRNPGARVV 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W ++ P W LD +F ++ AF Q K +G ++ D E
Sbjct: 273 AQGLWNGLRR-PDW-----LDPRTTVFPKVAAAFYRHQTKLFG-ACDLFKMDLLHEGGNA 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ +++ +AVW++ GW + +ALL++V +++++
Sbjct: 326 GDVP--VPDAARAVEKALRTARPNAVWVILGWQSN--------PRRALLDAVDKRRMLIV 375
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEM 215
D +++ ++ G PY + + NF G +
Sbjct: 376 DGLSDLDTTGDRESEWGGTPYAFGTIPNFGGRTTL 410
>gi|297723521|ref|NP_001174124.1| Os04g0650900 [Oryza sativa Japonica Group]
gi|255675839|dbj|BAH92852.1| Os04g0650900, partial [Oryza sativa Japonica Group]
Length = 128
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 49/64 (76%), Positives = 56/64 (87%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 64 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 123
Query: 61 QLGN 64
LGN
Sbjct: 124 HLGN 127
>gi|302522684|ref|ZP_07275026.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
gi|302431579|gb|EFL03395.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
Length = 355
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 72/258 (27%), Positives = 122/258 (47%), Gaps = 23/258 (8%)
Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 298
W +Y+ RYG S + AW++L TVY D+ + P++ +V+ GK
Sbjct: 84 WFREYARVRYGGSDAHAEAAWDILRTTVYGTRR--DDRWSEPADGLFGARPALDAVSAGK 141
Query: 299 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 358
+ L+ +S++ AL+ ++ EL S TYR DL+D+
Sbjct: 142 WS--------PKALRYPAASFEP------------ALDELLSVRAELRDSATYRRDLLDV 181
Query: 359 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 418
RQALA + L + AY+ + +L RR++ L++ ++ L+A + LLG W+ES
Sbjct: 182 ARQALANRSRTLLPRLAAAYKAKNQAEFARLGRRWIALIDLLEQLVATDENHLLGRWVES 241
Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
A+ + ++ Q +++A + +T W +A LRDY N+ WSGL+ Y R Y
Sbjct: 242 ARAWGGSAREKSQLQYDALSLLTTWGTRQGADAG-LRDYANREWSGLVGGLYRLRWGTYI 300
Query: 479 KYMIESLESGDGFRLKDW 496
+ +L+ G DW
Sbjct: 301 DELSAALKEGRKPVAVDW 318
>gi|293369245|ref|ZP_06615835.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
gi|292635670|gb|EFF54172.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
Length = 221
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 102/214 (47%), Gaps = 7/214 (3%)
Query: 322 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 381
P + Y +++ A L ++ + ++Y +DL+++ RQ L Y N + AY+
Sbjct: 11 PTIEYQPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAG 68
Query: 382 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 441
D + + E++ D+D L++CH F L W+ A+ + + + YE NAR+ IT
Sbjct: 69 DIPMMKNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHDAASKNYYEMNARSLIT 128
Query: 442 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 501
+W D+ L DY N+ W+GL YY R + +IE+ E F +++ +
Sbjct: 129 IWGDSYH-----LTDYANRSWAGLTNQYYSVRWDHFINEVIEAAEKKKNFDEEEFFNQSR 183
Query: 502 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 535
N+W N N GD + ++ +Y KY +
Sbjct: 184 MYENEWVNPSNRISYNEGGDGIKLARQIYKKYAK 217
>gi|293371911|ref|ZP_06618315.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
gi|292633157|gb|EFF51734.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
Length = 289
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 3/98 (3%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 184 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 243
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 98
LG W R C + L+ D LF +I + F+++Q
Sbjct: 244 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQ 278
>gi|321458423|gb|EFX69492.1| hypothetical protein DAPPUDRAFT_35389 [Daphnia pulex]
Length = 132
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 78/137 (56%), Gaps = 5/137 (3%)
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 414
++DLTRQ++ + + L+ ++E Y ++ + ++ + + L++D+D L+ FLLG
Sbjct: 1 MVDLTRQSMQEIFHLLYSKLLEVYLEKNSTAIEGIAYKMINLLQDLDELIQTGKTFLLGK 60
Query: 415 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 474
W+ AK E ++ QYEWNAR QIT+W + +RDY K W+G++ DYY P
Sbjct: 61 WIADAKSWGTTEGEKLQYEWNARNQITLWGPRGE-----IRDYAAKKWAGVVADYYKPHW 115
Query: 475 AIYFKYMIESLESGDGF 491
++ + M SL+ F
Sbjct: 116 EVFIREMQMSLDENRAF 132
>gi|342731751|ref|YP_004770590.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-Japan]
gi|342329206|dbj|BAK55848.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Japan]
Length = 898
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 115/527 (21%), Positives = 209/527 (39%), Gaps = 105/527 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 352 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 408
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 409 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 461
Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 462 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 509
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 510 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 569
Query: 228 EAR-----------TSENTTM--------------------AWINQYSVRRYGRSVPAIQ 256
+A+ TSE ++ +Y RYG+S +
Sbjct: 570 DAKYNSDYLSGIANTSEGVGFNNFIDELSTELIFSDEVNMDEFVKRYLKNRYGKSDRDLL 629
Query: 257 DAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
A+N+L TVYN TD GA++ VI A P ++ +
Sbjct: 630 VAFNILLDTVYNPVTDIYHEGASES---VINARPSLEIN--------------------- 665
Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E +
Sbjct: 666 ---SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYY 722
Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
I E Y + + +S++FL L+ +L+ +D L + L ++ +
Sbjct: 723 QIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFKDT 782
Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
++N + +T W+D E LRDY N D+Y +Y+
Sbjct: 783 LKYNKKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 822
>gi|417965571|ref|ZP_12607078.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-4]
gi|380336329|gb|EIA26351.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-4]
Length = 685
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 111/523 (21%), Positives = 208/523 (39%), Gaps = 97/523 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 146 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 202
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 203 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 255
Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 256 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 303
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 304 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 363
Query: 228 EAR-----------TSENTTM--------------------AWINQYSVRRYGRSVPAIQ 256
+A+ TSE ++ +Y RYG+S +
Sbjct: 364 DAKYNSDYLSGIANTSEGVGFNNFIDELSTELIFSDEVNMDEFVKRYLKNRYGKSDRDLL 423
Query: 257 DAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
A+N+L TVYN TD + S+I+ ++ ++ +
Sbjct: 424 VAFNILLDTVYNPVTD----------IYHEGASESVIN-------------ARPSLGINS 460
Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E + I
Sbjct: 461 ASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQIIQ 520
Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 435
E Y + + +S++FL L+ +L+ +D L + L ++ + ++N
Sbjct: 521 EYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFKDTLKYN 580
Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
+ +T W+D E LRDY N D+Y +Y+
Sbjct: 581 KKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 616
>gi|417967717|ref|ZP_12608785.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-co]
gi|380340884|gb|EIA29424.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-co]
Length = 741
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 111/523 (21%), Positives = 208/523 (39%), Gaps = 97/523 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 195 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 252 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 304
Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 305 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 352
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 353 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 412
Query: 228 EAR-----------TSENTTM--------------------AWINQYSVRRYGRSVPAIQ 256
+A+ TSE ++ +Y RYG+S +
Sbjct: 413 DAKYNSDYLSGIANTSEGVGFNNFIDELSTELIFSDEVNMDEFVKRYLKNRYGKSDRDLL 472
Query: 257 DAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 315
A+N+L TVYN TD + S+I+ ++ ++ +
Sbjct: 473 VAFNILLDTVYNPVTD----------IYHEGASESVIN-------------ARPSLGINS 509
Query: 316 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 375
S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E + I
Sbjct: 510 ASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQIIQ 569
Query: 376 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 435
E Y + + +S++FL L+ +L+ +D L + L ++ + ++N
Sbjct: 570 EYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFKDTLKYN 629
Query: 436 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
+ +T W+D E LRDY N D+Y +Y+
Sbjct: 630 KKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 665
>gi|384455191|ref|YP_005667784.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Yit]
gi|418016862|ref|ZP_12656425.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|418371995|ref|ZP_12964091.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-SU]
gi|345505596|gb|EGX27892.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|346983532|dbj|BAK79208.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Yit]
gi|380342872|gb|EIA31299.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-SU]
Length = 898
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 115/527 (21%), Positives = 208/527 (39%), Gaps = 105/527 (19%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 352 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 408
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 409 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 461
Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 462 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 509
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 510 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 569
Query: 228 EAR-----------TSENTTM--------------------AWINQYSVRRYGRSVPAIQ 256
+A+ TSE ++ +Y RYG+S +
Sbjct: 570 DAKYNSDYLSGIANTSEGVGFNNFIDELSTELIFSDEVNMDEFVKRYLKNRYGKSDRDLL 629
Query: 257 DAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 311
A+N+L TVYN TD GA++ VI A P + +
Sbjct: 630 VAFNILLDTVYNPVTDIYHEGASES---VINARPSLGIN--------------------- 665
Query: 312 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 371
S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E +
Sbjct: 666 ---SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYY 722
Query: 372 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
I E Y + + +S++FL L+ +L+ +D L + L ++ +
Sbjct: 723 QIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFKDT 782
Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
++N + +T W+D E LRDY N D+Y +Y+
Sbjct: 783 LKYNKKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 822
>gi|322792283|gb|EFZ16267.1| hypothetical protein SINV_02225 [Solenopsis invicta]
Length = 87
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/92 (45%), Positives = 61/92 (66%), Gaps = 5/92 (5%)
Query: 393 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 452
LEL +D++ +LA FLLG WL AK++A NEE+ + YE+NAR QIT+W N +
Sbjct: 1 LLELFDDLESILASGSNFLLGTWLTQAKEMADNEEERRSYEYNARNQITLWGPNGE---- 56
Query: 453 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 484
+RDY NK WSG++ DY+ PR ++ K + +S
Sbjct: 57 -IRDYANKQWSGVVADYFKPRWELFLKALEKS 87
>gi|315131339|emb|CBM69278.1| venom protein Ci-120 [Chelonus inanitus]
Length = 165
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 79/131 (60%), Gaps = 7/131 (5%)
Query: 357 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPW 415
D+TRQ+L A ++L + +++ D VF+ L +L D++ +L+ + FL+G W
Sbjct: 1 DVTRQSLQLIAEHVYLKLQQSFHQKDL-AVFKAHANLLMQLFSDLESILSTNKHFLVGKW 59
Query: 416 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 475
+++A+ L N +++K YE NAR QIT+W N + +RDY NK W+G++ Y+G R +
Sbjct: 60 IKNARSLGTNVQEQKLYELNARNQITLWGPNGE-----IRDYANKQWAGVMSQYFGARWS 114
Query: 476 IYFKYMIESLE 486
+Y + +LE
Sbjct: 115 LYLSVLEFALE 125
>gi|347541919|ref|YP_004856555.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-rat-Yit]
gi|346984954|dbj|BAK80629.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-rat-Yit]
Length = 912
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 116/544 (21%), Positives = 213/544 (39%), Gaps = 100/544 (18%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E G+ P+ F G P +
Sbjct: 364 MGNISSIGGELTPKWFEDRAKLSIDIQTRMIEFGIEPIHQMFIGYFPYKEN---SGVNVI 420
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ W +K R LD + I + ++Q + +G + + + D F E N
Sbjct: 421 RGSYWSKIKGPDR------LDFNNNDVEFISSVYYKKQKELFGESKY-FAGDLFHEGNNL 473
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
D E + + + + + +++W++Q W S P + + N + +
Sbjct: 474 YGYDPVELSNKVLKLL---IDNNGENSIWIIQSWSHS-------PSSETIEN-LNRNNTL 522
Query: 179 VLDLFAEVKPIWSTSKQFYGVPY----------IWCMLHNFAGNIEMYGILDSIAFGPVE 228
+LDL +++ W +F + + I+ +L+NF G +YG + +
Sbjct: 523 ILDLHSQLNTRWKGISKFNNMSWKDREFDRSNWIFGVLNNFGGRSGLYGHTRHLLNQFYD 582
Query: 229 ARTSENTTMA-------------------------------WINQYSVRRYGRSVPAIQD 257
A+ + N ++++Y RYG+S +
Sbjct: 583 AKYNSNYLKGVAHTSEGIGFNNFIDELVTEIIFSDKLDIDEFVSRYLRNRYGKSDNDLLK 642
Query: 258 AWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 312
A+N+L TVYN +GA++ VI A P +D +
Sbjct: 643 AFNILLDTVYNPVINIYHEGASES---VINARPSLD-----------------------V 676
Query: 313 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 372
KS S + H Y++ ++ AL ++ + NE S Y DLID+ + + +NE +
Sbjct: 677 KS-ASKWGSIHKNYNSEKLEEALRIYFSKYNEFKDSKGYMTDLIDIASEVIINLSNEYYK 735
Query: 373 NIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 431
N+ + Y N F+L S+RFL ++ +L ++ L ++ L ++ E
Sbjct: 736 NLQDYYN-NGEIEFFKLNSQRFLNMILLQANILYYNERKSLQKLIDKLDDLNYDDYFEDT 794
Query: 432 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG--D 489
N +T +T W+D E LRDY N + ++ Y R +F + E+ +G D
Sbjct: 795 LIINKKTILTTWYDKQVSEDDGLRDYANTDFYDIVGTLYYNRWKRFFDNIQENAVNGFYD 854
Query: 490 GFRL 493
+R
Sbjct: 855 DYRF 858
>gi|440799252|gb|ELR20307.1| AlphaN-acetylglucosaminidase, putative, partial [Acanthamoeba
castellanii str. Neff]
Length = 389
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 50/132 (37%), Positives = 68/132 (51%), Gaps = 19/132 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA----------- 49
M N+ GWGGPL +W Q LQKKI+ R GM PVLPAF+G VP A
Sbjct: 261 MGNIQGWGGPLDLAWRLAQAELQKKIVERQRMFGMLPVLPAFAGFVPEASVKFTLGRGGG 320
Query: 50 -----LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGR 104
++ ++P+A +T+ +W ++ Y L D L+ IG I +E+G
Sbjct: 321 CGEQGIKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG- 377
Query: 105 TSHIYNCDTFDE 116
T HIYN DTF+E
Sbjct: 378 TDHIYNADTFNE 389
>gi|326435733|gb|EGD81303.1| alpha-N-acetylglucosaminidase [Salpingoeca sp. ATCC 50818]
Length = 696
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 61/116 (52%), Gaps = 18/116 (15%)
Query: 6 GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW 65
G GGPLP W QQ LQ+ I+ R ELG+ +LPAF GNVPAAL ++P A I+ W
Sbjct: 246 GVGGPLPSQWYKQQWELQRAIVQRQTELGIGSLLPAFQGNVPAALAQLYPHANISN--GW 303
Query: 66 FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT-FDENTPP 120
LD DPLF I +++ + ++G T H Y D FD +T P
Sbjct: 304 --------------LDGLDPLFATIADLTMQELIADFGAT-HFYQADGFFDHSTGP 344
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 57/116 (49%), Gaps = 7/116 (6%)
Query: 134 IYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST 192
+Y+ M D A+W+ QGW+ D M ++VP G+LV+LD+ AE IW+
Sbjct: 500 VYTTMTKRDPHAIWVYQGWIWLDLDNAQGFSFMSGFTSAVPRGRLVILDMEAEFDEIWAW 559
Query: 193 SKQFYGVPYIWCMLHNFAGNIEMYGILDSI------AFGPVEARTSENTTMAWINQ 242
S+ F+ +IW + NF GN MYG + + F +A TM I+Q
Sbjct: 560 SQSFFNTTFIWAAMDNFGGNNGMYGDIQLVFDRTRRVFAQSDAVVGVGITMEGIDQ 615
>gi|84625359|ref|YP_452731.1| hypothetical protein XOO_3702 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84369299|dbj|BAE70457.1| truncated N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
MAFF 311018]
Length = 369
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 33/62 (53%), Positives = 43/62 (69%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQQWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QL 62
++
Sbjct: 261 RM 262
>gi|323456608|gb|EGB12475.1| hypothetical protein AURANDRAFT_20306 [Aureococcus anophagefferens]
Length = 243
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/64 (56%), Positives = 47/64 (73%), Gaps = 1/64 (1%)
Query: 5 HGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT-QLG 63
HG GGPLP+++ D QL L K+IL R+ LG+ PVLP+F GNVP AL+++FP A IT Q
Sbjct: 120 HGVGGPLPRTFADAQLALAKRILARMRGLGIVPVLPSFQGNVPPALKDLFPEANITVQAP 179
Query: 64 NWFS 67
+W S
Sbjct: 180 HWTS 183
>gi|390353486|ref|XP_003728120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Strongylocentrotus
purpuratus]
Length = 385
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 32/50 (64%), Positives = 38/50 (76%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL 50
M N+ GWGGP+PQSW QL LQ KIL R+ ELGM PVLPAF+G+VP +
Sbjct: 218 MGNIDGWGGPIPQSWHTNQLALQHKILKRMRELGMIPVLPAFAGHVPKSF 267
>gi|358381741|gb|EHK19415.1| hypothetical protein TRIVIDRAFT_224650 [Trichoderma virens Gv29-8]
Length = 217
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 76/158 (48%), Gaps = 14/158 (8%)
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL-G 175
TPP Y+ + + + ++S D +A+W+ Q WLF+ + FW +++ + +
Sbjct: 2 TPPSGELNYLRNASSNTWKALKSADPEAIWVFQAWLFAQNTTFWTNDRIEVYPGGITIDS 61
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LD++ E W ++ +Y P+IWC L N+ I MYG + ++ P+ A
Sbjct: 62 DMLILDIWLESMSQWQCAQSYYSKPWIWCELQNYGATINMYGQIQNLTKSPILALQESQ- 120
Query: 236 TMAWINQYSVRRYGRSVPAIQD---AWNVLYHTVYNCT 270
S+ G S+ A Q +++L +NCT
Sbjct: 121 --------SLVGLGLSMEAQQSNEIVFDLLLSQAWNCT 150
>gi|224135741|ref|XP_002322149.1| predicted protein [Populus trichocarpa]
gi|222869145|gb|EEF06276.1| predicted protein [Populus trichocarpa]
Length = 173
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 33/60 (55%), Gaps = 2/60 (3%)
Query: 477 YFKYMIESLESGD--GFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 534
++ + I++ E G F +K + K + R + PVESNG+AL S+WL+ KYL
Sbjct: 97 HYTFQIQNTEFGKWPRFPVKGLEKRMDKASKQLAESRKIIPVESNGNALNISRWLFYKYL 156
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.432
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,295,306,757
Number of Sequences: 23463169
Number of extensions: 401806658
Number of successful extensions: 854199
Number of sequences better than 100.0: 514
Number of HSP's better than 100.0 without gapping: 500
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 850859
Number of HSP's gapped (non-prelim): 1141
length of query: 542
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 394
effective length of database: 8,886,646,355
effective search space: 3501338663870
effective search space used: 3501338663870
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)