BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008173
(575 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225457148|ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
Length = 813
Score = 995 bits (2572), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/578 (81%), Positives = 518/578 (89%), Gaps = 4/578 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWF+V +PRWCCTYLLDATDPLFIEIG+AFI+QQLKEYGRT HIYNCDTFDENTPP
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPP 354
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 355 VDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVL 414
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYGILD++A GPVEARTSEN+TMVGV
Sbjct: 415 DLFAEVKPIWITSEQFYGVPYIWCMLHNFAGNIEMYGILDAVASGPVEARTSENSTMVGV 474
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNPVVYDLMSEMAFQH KVDVK WI YS RRYG+SVP IQDAWN+LYHTVY
Sbjct: 475 GMSMEGIEQNPVVYDLMSEMAFQHSKVDVKVWIALYSTRRYGKSVPEIQDAWNILYHTVY 534
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTE----GKYQNYGKPVSKEAVLKSETSSYDHPH 356
NCTDG+ DKNRDVIVAFPD+DPS I + G Y YGK VS+ VLK T+S++ PH
Sbjct: 535 NCTDGSYDKNRDVIVAFPDIDPSFIPTPKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPH 594
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
LWYSTSEV AL LFIASG +L SNTYRYDL+DLTRQALAKYAN+LFL +IEAYQLND
Sbjct: 595 LWYSTSEVKDALGLFIASGGQLLGSNTYRYDLVDLTRQALAKYANQLFLEVIEAYQLNDV 654
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
G S++FLELVEDMD LLACHDGFLLGPWLESAKQLAQ+E+QE Q+EWNARTQITMW
Sbjct: 655 RGAACHSQKFLELVEDMDTLLACHDGFLLGPWLESAKQLAQDEQQEIQFEWNARTQITMW 714
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
FDNT++EASLLRDYGNKYWSGLLRDYYGPRAAIYFKY++ESLE+G+ F LKDWRREWIKL
Sbjct: 715 FDNTEDEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYLLESLETGNEFALKDWRREWIKL 774
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFD 574
TNDWQN RN YPV S+G+A+ TS+ LYNKYLQ ++D
Sbjct: 775 TNDWQNSRNAYPVRSSGNAIDTSRRLYNKYLQDPEIYD 812
>gi|255540793|ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
gi|223550576|gb|EEF52063.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
Length = 809
Score = 995 bits (2572), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/576 (81%), Positives = 522/576 (90%), Gaps = 1/576 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGG LPQSW QQL+LQKKIL R+YELGMNPVLPAFSGNVPAAL+N+FPSAKI
Sbjct: 234 MGNLHRWGGSLPQSWFFQQLILQKKILARMYELGMNPVLPAFSGNVPAALRNIFPSAKIA 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWFSVKSD RWCCTYLLDATDPLFIEIGRAFIEQQL+EYG TSHIYNCDTFDENTPP
Sbjct: 294 RLGNWFSVKSDLRWCCTYLLDATDPLFIEIGRAFIEQQLEEYGSTSHIYNCDTFDENTPP 353
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD P+YIS+LGAA++ GMQSGD+DAVWLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 354 VDDPKYISALGAAVFKGMQSGDNDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGRLVVL 413
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DLFAEVKPIW++S QFYGVPYIWCMLHNFAGN+EMYGILDSIA GPVEARTSEN+TMVGV
Sbjct: 414 DLFAEVKPIWTSSYQFYGVPYIWCMLHNFAGNVEMYGILDSIASGPVEARTSENSTMVGV 473
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNPVVYDLMSEMAFQH+KVDVKAWIN YS RRYGRSVP+IQDAW++LYHTVY
Sbjct: 474 GMSMEGIEQNPVVYDLMSEMAFQHKKVDVKAWINLYSTRRYGRSVPSIQDAWDILYHTVY 533
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
NCTDGA DKNRDVIVAFPDV+P SV++ ++ GKPVS+ AVLK + SYDHPHLWYS
Sbjct: 534 NCTDGAYDKNRDVIVAFPDVNPFYFSVSQKRHHLNGKPVSRRAVLKENSDSYDHPHLWYS 593
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
TSEV+ ALELFI SG ELS S+TY YDL+DLTRQALAKY NELFL IIE+YQ ND +GV
Sbjct: 594 TSEVLHALELFITSGEELSGSSTYSYDLVDLTRQALAKYGNELFLKIIESYQANDGNGVA 653
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
S++FL+LVEDMD LL CH+GFLLGPWLESAKQLAQ++EQEKQ+EWNARTQITMWFDNT
Sbjct: 654 SRSQKFLDLVEDMDTLLGCHEGFLLGPWLESAKQLAQDQEQEKQFEWNARTQITMWFDNT 713
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
++EASLL DYGNKYWSGLL+DYYGPRAAIYFKY+I+SLE+G F LKDWRREWIKLTN+W
Sbjct: 714 EDEASLLHDYGNKYWSGLLQDYYGPRAAIYFKYLIKSLENGKVFPLKDWRREWIKLTNEW 773
Query: 541 QNGRNVYPVESNGDALITSQWLYNKYLQGTG-VFDH 575
Q RN +PV+SNG+ALI S+WLY+KYL+ +DH
Sbjct: 774 QRSRNKFPVKSNGNALIISKWLYDKYLRNPDTTYDH 809
>gi|224121634|ref|XP_002318632.1| predicted protein [Populus trichocarpa]
gi|222859305|gb|EEE96852.1| predicted protein [Populus trichocarpa]
Length = 812
Score = 981 bits (2537), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 456/575 (79%), Positives = 515/575 (89%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NLH WGGPLPQSW DQQLVLQKKIL R+YELGM PVLPAFSGNVPAAL+N+FPSAKIT
Sbjct: 238 MANLHRWGGPLPQSWFDQQLVLQKKILARMYELGMTPVLPAFSGNVPAALRNIFPSAKIT 297
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWFSV+SD RWCCTYLLDATDPLFIEIGRAFIEQQL EYG TSHIYNCDTFDENTPP
Sbjct: 298 RLGNWFSVRSDVRWCCTYLLDATDPLFIEIGRAFIEQQLTEYGSTSHIYNCDTFDENTPP 357
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLG +I+ GMQSGDS+AVWLMQGWLFSYDPFWRPPQ KALL+SVP+G+LVVL
Sbjct: 358 VDDPEYISSLGGSIFEGMQSGDSNAVWLMQGWLFSYDPFWRPPQTKALLHSVPIGRLVVL 417
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DLFAEVKPIW+TS+QFYGVPYIWCMLHNFAGN+EMYG LDS+A GPVEARTSEN+TMVGV
Sbjct: 418 DLFAEVKPIWNTSEQFYGVPYIWCMLHNFAGNLEMYGYLDSVASGPVEARTSENSTMVGV 477
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNPVVYDLMSEMAFQ KVDVK WI+ YS RRYGRSVP IQ+AWN+LYHTVY
Sbjct: 478 GMSMEGIEQNPVVYDLMSEMAFQKNKVDVKEWIDLYSARRYGRSVPTIQNAWNILYHTVY 537
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
NCTDGA DKNRDVIVAFPDV+P+++S+ +G++ K VS+ A L T SY+HPHLWYS
Sbjct: 538 NCTDGAYDKNRDVIVAFPDVNPNLVSMLQGRHHTDVKLVSRRAALIKNTDSYEHPHLWYS 597
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
T+EV+RALELFIA G+ELS S+TY YDL+DLTRQ LAKYANELFL +IEAY+L D+HGV
Sbjct: 598 TTEVVRALELFIAGGDELSGSSTYSYDLVDLTRQVLAKYANELFLKVIEAYRLKDSHGVA 657
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
S+ FL+LVED+D LLACH+GFLLGPWLESAKQLAQ+EEQ+ Q+EWNARTQITMW+DNT
Sbjct: 658 HQSQMFLDLVEDIDTLLACHEGFLLGPWLESAKQLAQDEEQQIQFEWNARTQITMWYDNT 717
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
+ EASLLRDYGNKYWSGLL+DYYGPRAAIYF ++ +SLE+G GF+LK WRREWIKLTN W
Sbjct: 718 EVEASLLRDYGNKYWSGLLKDYYGPRAAIYFNFLTQSLENGHGFQLKAWRREWIKLTNKW 777
Query: 541 QNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH 575
Q R ++PVESNG+AL S+WLY+KYL +DH
Sbjct: 778 QKSRKIFPVESNGNALNISRWLYHKYLGNPDTYDH 812
>gi|356519003|ref|XP_003528164.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
Length = 812
Score = 975 bits (2520), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/567 (80%), Positives = 512/567 (90%), Gaps = 1/567 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSW DQQL+LQKKIL R++ELGM PVLPAFSGNVPAAL+++FPSAKIT
Sbjct: 233 MGNLHGWGGPLPQSWFDQQLILQKKILARMFELGMTPVLPAFSGNVPAALKHIFPSAKIT 292
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWFSVK+D +WCCTYLLDATD LF+EIG+AFIE+QL+EYGRTSHIYNCDTFDENTPP
Sbjct: 293 RLGNWFSVKNDLKWCCTYLLDATDSLFVEIGKAFIEKQLQEYGRTSHIYNCDTFDENTPP 352
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLGAA + GMQSGD DAVWLMQGWLFSYDPFWRPPQMKALL+SVP+GKLVVL
Sbjct: 353 VDDPEYISSLGAATFKGMQSGDDDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGKLVVL 412
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYGILD+IA GP++ARTS N+TMVGV
Sbjct: 413 DLFAEVKPIWVTSEQFYGVPYIWCMLHNFAGNIEMYGILDAIASGPIDARTSNNSTMVGV 472
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAFQH+KVDVKAW++ YS RRYG+++P IQ+ WNVLYHT+Y
Sbjct: 473 GMSMEGIEQNPIVYDLMSEMAFQHKKVDVKAWVDMYSTRRYGQTLPLIQEGWNVLYHTIY 532
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
NCTDGA DKNRDVIVAFPDVDPS+ISV + + KP S ++K T S+D PHLWY
Sbjct: 533 NCTDGAYDKNRDVIVAFPDVDPSLISVQHEQSHHNDKPYSG-TIIKEITDSFDRPHLWYP 591
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
TSEVI ALELFI SG+ELS NTYRYDL+DLTRQ LAKYANELF +IEAYQ +D HG+
Sbjct: 592 TSEVIYALELFITSGDELSRCNTYRYDLVDLTRQVLAKYANELFFKVIEAYQSHDIHGMT 651
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
LS+RFL+LVED+D LLACHDGFLLGPWLESAKQLA NEEQE+Q+EWNARTQITMWFDN+
Sbjct: 652 LLSQRFLDLVEDLDTLLACHDGFLLGPWLESAKQLALNEEQERQFEWNARTQITMWFDNS 711
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
EEASLLRDYGNKYW+GLL DYYGPRAAIYFKY+ ESLESG+ F+L+ WRREWIKLTN+W
Sbjct: 712 DEEASLLRDYGNKYWNGLLHDYYGPRAAIYFKYLRESLESGEDFKLRGWRREWIKLTNEW 771
Query: 541 QNGRNVYPVESNGDALITSQWLYNKYL 567
Q RN++PVES+GDAL TS+WL+NKYL
Sbjct: 772 QKRRNIFPVESSGDALNTSRWLFNKYL 798
>gi|297733843|emb|CBI15090.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 958 bits (2477), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/611 (75%), Positives = 511/611 (83%), Gaps = 37/611 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWF+V +PRWCCTYLLDATDPLFIEIG+AFI+QQLKEYGRT HIYNCDTFDENTPP
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPP 354
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 355 VDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVL 414
Query: 181 DLFAEVKPIWSTSKQFYGVPYIW--------------------------------CMLHN 208
DLFAEVKPIW TS+QFYGVPYIW CMLHN
Sbjct: 415 DLFAEVKPIWITSEQFYGVPYIWKVTKSGRQQSLKFTNEKCCSFFRSHSPDSEVLCMLHN 474
Query: 209 FAGNIEMYGILDSIAFGPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 267
FAGNIEMYGILD++A GP+ R + +VGVGMSMEGIEQNPVVYDLMSEMAFQH KV
Sbjct: 475 FAGNIEMYGILDAVASGPILLRAKYAESAVVGVGMSMEGIEQNPVVYDLMSEMAFQHSKV 534
Query: 268 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 327
DVK WI YS RRYG+SVP IQDAWN+LYHTVYNCTDG+ DKNRDVIVAFPD+DPS I
Sbjct: 535 DVKVWIALYSTRRYGKSVPEIQDAWNILYHTVYNCTDGSYDKNRDVIVAFPDIDPSFIPT 594
Query: 328 TE----GKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT 383
+ G Y YGK VS+ VLK T+S++ PHLWYSTSEV AL LFIASG +L SNT
Sbjct: 595 PKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPHLWYSTSEVKDALGLFIASGGQLLGSNT 654
Query: 384 YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGF 443
YRYDL+DLTRQALAKYAN+LFL +IEAYQLND G S++FLELVEDMD LLACHDGF
Sbjct: 655 YRYDLVDLTRQALAKYANQLFLEVIEAYQLNDVRGAACHSQKFLELVEDMDTLLACHDGF 714
Query: 444 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 503
LLGPWLESAKQLAQ+E+QE Q+EWNARTQITMWFDNT++EASLLRDYGNKYWSGLLRDYY
Sbjct: 715 LLGPWLESAKQLAQDEQQEIQFEWNARTQITMWFDNTEDEASLLRDYGNKYWSGLLRDYY 774
Query: 504 GPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 563
GPRAAIYFKY++ESLE+G+ F LKDWRREWIKLTNDWQN RN YPV S+G+A+ TS+ LY
Sbjct: 775 GPRAAIYFKYLLESLETGNEFALKDWRREWIKLTNDWQNSRNAYPVRSSGNAIDTSRRLY 834
Query: 564 NKYLQGTGVFD 574
NKYLQ ++D
Sbjct: 835 NKYLQDPEIYD 845
>gi|449441031|ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
Length = 808
Score = 934 bits (2414), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/568 (76%), Positives = 500/568 (88%), Gaps = 2/568 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLPQSW DQQL+LQKK++ R++ELGM PVLPAFSGN+PAA + ++P+AKIT
Sbjct: 236 MGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKIT 295
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNWF+V SDPRWCCTYLLDA DPLF+EIG+AFIEQQ KEYGRTSH+YNCDTFDENTPP
Sbjct: 296 RLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPP 355
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD EYISSLG+AI+ GMQ+GDS+AVWLMQGW+FSYDPFWRP QMKALL+SVPLG+LVVL
Sbjct: 356 VDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVL 415
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKPIW +S+QFYG+PYIWCMLHNFAGN+EMYGILDSIA GP+EAR+S +TMVGV
Sbjct: 416 DLYAEVKPIWISSEQFYGIPYIWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGV 475
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNPVVYDLMSEMAFQH KVDVK W+ QYSVRRYG VP+IQDAW+VLYHTVY
Sbjct: 476 GMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVY 535
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
NCTDGA DKNRDVIVAFPDVDPS I V +G S +V + + +++D PHLWY
Sbjct: 536 NCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHLWYP 593
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF I++AYQL+D +
Sbjct: 594 TSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMA 653
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQYEWNARTQITMWFDNT
Sbjct: 654 SLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNT 713
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
+EEASLLRDYGNKYWSGLL DYY PRAAIY K++ ES E+G F L +WRREWIKLTNDW
Sbjct: 714 EEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDW 773
Query: 541 QNGRNVYPVESNGDALITSQWLYNKYLQ 568
Q+ R +YPVESNGDAL TS WLYNKYLQ
Sbjct: 774 QSSRKIYPVESNGDALDTSHWLYNKYLQ 801
>gi|326515664|dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 829
Score = 855 bits (2210), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/573 (68%), Positives = 479/573 (83%), Gaps = 6/573 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y GM+PVLPAFSGN+PAAL+ FPSAK+T
Sbjct: 239 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVT 298
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 299 HLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 358
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + MQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 359 LSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 418
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKP+W S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TMVGV
Sbjct: 419 DLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 478
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEM F H +VD+K W+ Y RRYG+SV +QDAW +L+ T+Y
Sbjct: 479 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLY 538
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAVLK-SETSSYDHP 355
NCTDG DKNRDVIVAFPDV+PS+I T G Y +NY +S+ V+K + +Y+ P
Sbjct: 539 NCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSKNYSTMLSENYVMKDAPNDAYEQP 597
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
H+WY T VI ALELF+ SG+E+S S+T+RYDL+DLTRQALAKYAN++FL II+ Y+ N+
Sbjct: 598 HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+ V L RFL LV+D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 658 VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQYEWNARTQITM 717
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFK++I SL+ + F L++WRREWI
Sbjct: 718 WFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFKHLISSLKKKEPFALEEWRREWIS 777
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
LTN+WQ+ R V+ + GDAL S+ L+ KYL+
Sbjct: 778 LTNNWQSDRKVFATTATGDALNISRALFTKYLR 810
>gi|222629680|gb|EEE61812.1| hypothetical protein OsJ_16433 [Oryza sativa Japonica Group]
Length = 1129
Score = 855 bits (2209), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/573 (68%), Positives = 473/573 (82%), Gaps = 6/573 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 537 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 596
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 597 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 656
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 657 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 716
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKPIW S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TMVGV
Sbjct: 717 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMVGV 776
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+S+ +QDAW +LY T+Y
Sbjct: 777 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSMVGLQDAWKILYQTLY 836
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 355
NCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y+HP
Sbjct: 837 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 895
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 896 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 955
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+ V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 956 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQITM 1015
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFKY+I S+E + F L++WRREWI
Sbjct: 1016 WFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIYFKYLILSMEKKEPFALEEWRREWIS 1075
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
LTN+WQ+ V+P + GDAL S+ LY KYL
Sbjct: 1076 LTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 1108
>gi|326519955|dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 829
Score = 854 bits (2206), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/573 (68%), Positives = 478/573 (83%), Gaps = 6/573 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y GM+PVLPAFSGN+PAAL+ FPSAK+T
Sbjct: 239 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGNIPAALKLKFPSAKVT 298
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPL++EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 299 HLGNWFTVDSNPRWCCTYLLDASDPLYVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 358
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + MQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 359 LSDPNYISSLGAATFRAMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 418
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKP W S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TMVGV
Sbjct: 419 DLYAEVKPAWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 478
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEM F H +VD+K W+ Y RRYG+SV +QDAW +L+ T+Y
Sbjct: 479 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLKVWVETYPTRRYGKSVVGLQDAWRILHQTLY 538
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAVLK-SETSSYDHP 355
NCTDG DKNRDVIVAFPDV+PS+I T G Y +NY +S+ V+K + +Y+ P
Sbjct: 539 NCTDGKNDKNRDVIVAFPDVEPSVIQ-TPGLYARTSKNYSTMLSENYVMKDAPNDAYEQP 597
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
H+WY T VI ALELF+ SG+E+S S+T+RYDL+DLTRQALAKYAN++FL II+ Y+ N+
Sbjct: 598 HIWYDTIAVIHALELFLESGDEVSDSSTFRYDLVDLTRQALAKYANQIFLKIIQGYKSNN 657
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+ V L RFL LV+D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 658 VNQVTTLCERFLNLVKDLDMLLASHEGFLLGPWLESAKGLARSQEQEIQYEWNARTQITM 717
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFK++I SL+ + F L++WRREWI
Sbjct: 718 WFDNTETKASLLRDYANKYWSGLLRDYYGPRAAIYFKHLISSLKKKEPFALEEWRREWIS 777
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
LTN+WQ+ R V+ + GDAL S+ L+ KYL+
Sbjct: 778 LTNNWQSDRKVFATTATGDALNISRALFTKYLR 810
>gi|218195716|gb|EEC78143.1| hypothetical protein OsI_17702 [Oryza sativa Indica Group]
Length = 829
Score = 850 bits (2196), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/573 (68%), Positives = 473/573 (82%), Gaps = 6/573 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 237 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 296
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 297 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 356
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 357 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 416
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKPIW S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TM+GV
Sbjct: 417 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMIGV 476
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+S+ +QDAW +LY T+Y
Sbjct: 477 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSIVGLQDAWKILYQTLY 536
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 355
NCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y+HP
Sbjct: 537 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 595
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 596 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 655
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+ V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 656 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQITM 715
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFKY+I S+E + F L++WRREWI
Sbjct: 716 WFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIYFKYLILSMEKKEPFALEEWRREWIS 775
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
LTN+WQ+ V+P + GDAL S+ LY KYL
Sbjct: 776 LTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 808
>gi|414585092|tpg|DAA35663.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 831
Score = 850 bits (2196), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/578 (67%), Positives = 476/578 (82%), Gaps = 6/578 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYNCDTFDENTPP
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYNCDTFDENTPP 359
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+GK++VL
Sbjct: 360 LSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGKMIVL 419
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKP+W S Q YGVPYIWCMLHNFA + EMYG+LD++A GP++AR S+N+TMVGV
Sbjct: 420 DLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGV 479
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+ V +QDAW +LY T+Y
Sbjct: 480 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLY 539
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHP 355
NCTDG DKNRDVIVAFPDV+P +I+ T G + N Y SK + K +S +Y+HP
Sbjct: 540 NCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHP 598
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T+ VI ALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+
Sbjct: 599 HLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNN 658
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+ V L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITM
Sbjct: 659 MNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITM 718
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
WFDNT+ +ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+ F LK+WRREWI
Sbjct: 719 WFDNTETKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWIS 778
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 573
LTN+WQ+ R V+ + GD L SQ LY KYL +
Sbjct: 779 LTNNWQSDRKVFSTTATGDPLNISQSLYTKYLSNADLL 816
>gi|38345908|emb|CAE04506.2| OSJNBb0059K02.16 [Oryza sativa Japonica Group]
Length = 829
Score = 850 bits (2196), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/573 (68%), Positives = 473/573 (82%), Gaps = 6/573 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 237 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 296
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 297 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 356
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMKALL+SVP+G+++VL
Sbjct: 357 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKALLHSVPVGRMIVL 416
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKPIW S QFYGVPYIWCMLHNFA + EMYG+LD +A GP++AR S N+TMVGV
Sbjct: 417 DLYAEVKPIWINSDQFYGVPYIWCMLHNFAADFEMYGVLDMVASGPIDARLSANSTMVGV 476
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+S+ +QDAW +LY T+Y
Sbjct: 477 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSMVGLQDAWKILYQTLY 536
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSYDHP 355
NCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y+HP
Sbjct: 537 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEYEHP 595
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+ N+
Sbjct: 596 HLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYKSNN 655
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+ V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 656 VNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQITM 715
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
WFDNT+ +ASLLRDY NKYWSGLLRDYYGPRAAIYFKY+I S+E + F L++WRREWI
Sbjct: 716 WFDNTKTKASLLRDYANKYWSGLLRDYYGPRAAIYFKYLILSMEKKEPFALEEWRREWIS 775
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
LTN+WQ+ V+P + GDAL S+ LY KYL
Sbjct: 776 LTNNWQSDWKVFPTTATGDALNISRTLYKKYLH 808
>gi|357166414|ref|XP_003580702.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
distachyon]
Length = 829
Score = 845 bits (2184), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/573 (67%), Positives = 479/573 (83%), Gaps = 6/573 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QL LQKKIL R+Y GM+PVLPAFSG++PAAL++ FPSAK+T
Sbjct: 238 MANMHGWGGPLPQTWLDDQLTLQKKILSRMYAFGMSPVLPAFSGSIPAALKSKFPSAKVT 297
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSH+YNCDTFDENTPP
Sbjct: 298 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGRTSHVYNCDTFDENTPP 357
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD DA+WLMQGWLF+YDPFW PPQMKALL+SVP+G+++VL
Sbjct: 358 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGRMIVL 417
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKP+W S QFYGVPYIWCMLHNFA + EMYG+LD++A GP++AR SEN+TMVGV
Sbjct: 418 DLYAEVKPVWINSDQFYGVPYIWCMLHNFAADFEMYGVLDAVASGPIDARLSENSTMVGV 477
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEM F H +VD++ W+ Y RRYG+S+ +QDAW +L+ T+Y
Sbjct: 478 GMSMEGIEQNPIVYDLMSEMVFHHRQVDLQVWVETYPTRRYGKSIVELQDAWRILHQTLY 537
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVLKSETS-SYDHP 355
NCTDG DKNRDVIVAFPDV+P +I T G + + K S K ++K E++ +Y+ P
Sbjct: 538 NCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLHTSASKMFSTMSAKSYLVKDESNDAYEQP 596
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T+ VIRAL+LF+ G+E+S S+T+RYDL+DLTRQALAKYAN++F II++Y+ N+
Sbjct: 597 HLWYDTNVVIRALQLFLQYGDEVSDSSTFRYDLVDLTRQALAKYANQIFAKIIQSYKSNN 656
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+ V LS FL+LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQITM
Sbjct: 657 MNQVTTLSECFLDLVNDLDMLLASHEGFLLGPWLESAKGLARDQEQEIQYEWNARTQITM 716
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
WFDNT+ +ASLLRDY NKYWSGLL DYYGPRAAIYFKY+I SLE + F L++WRREWI
Sbjct: 717 WFDNTETKASLLRDYANKYWSGLLGDYYGPRAAIYFKYLILSLEKKEPFALEEWRREWIS 776
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
LTN+WQ+ R V+ + GDAL ++ LY KYL+
Sbjct: 777 LTNNWQSDRKVFATAATGDALNIARSLYMKYLR 809
>gi|218192858|gb|EEC75285.1| hypothetical protein OsI_11626 [Oryza sativa Indica Group]
Length = 812
Score = 761 bits (1966), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/570 (61%), Positives = 442/570 (77%), Gaps = 2/570 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVP+ + +FPSA IT
Sbjct: 243 MGNLHGWGGPLSQNWLDQQLTLQKKILSRMIELGMVPVLPSFSGNVPSVFKKLFPSANIT 302
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 303 KLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQAFIRQQMKEYGDITNIYNCDTFNENTPP 362
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLG+AIY M G+ DAVWLMQGWLF D FW+ PQMKALL+SVP GK++V
Sbjct: 363 TNEPAYISSLGSAIYEAMSRGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPTGKMIV 422
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYGILDSIA GP++ARTS N+TMVG
Sbjct: 423 LDLFADVKPIWQMSSQFYGVPYIWCMLHNFGGNIEMYGILDSIASGPIDARTSHNSTMVG 482
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF+ +KV+V+ W+ YS RRYG+S ++ AW +LYHT+
Sbjct: 483 VGMCMEGIEHNPVVYELMSEMAFRSQKVEVEDWLKIYSYRRYGQSNVEVEKAWGILYHTI 542
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 358
YNCTDG D N+D IV FPD+ P+ S K + + + SE S S HPHLW
Sbjct: 543 YNCTDGIADHNKDYIVQFPDISPNSFSSDVSKRKAISEVKKHRRFVLSEVSASLPHPHLW 602
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
YST E I+ALELF+ +GN+LS S TYRYDL+DLTRQ+L+K ANE++L+ + AY+ D++G
Sbjct: 603 YSTKEAIKALELFLNAGNDLSKSLTYRYDLVDLTRQSLSKLANEVYLDAMNAYRKKDSNG 662
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+ +++FLEL+ D+D LLA D FLLGPWLE AK LA+ E + KQYEWNARTQ+TMW+D
Sbjct: 663 LNFYTKKFLELIVDIDTLLASDDNFLLGPWLEDAKSLARTENERKQYEWNARTQVTMWYD 722
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
NT+ E S L DY NK+WSGLL+ YY PRA+ YF + + L+ F+L++WR++WI +N
Sbjct: 723 NTKTEQSKLHDYANKFWSGLLKSYYLPRASKYFSRLTKGLQENQSFQLEEWRKDWIAYSN 782
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
+WQ+G+ +Y V++ GDAL S L+ KY +
Sbjct: 783 EWQSGKELYAVKATGDALAISSSLFKKYFR 812
>gi|413955691|gb|AFW88340.1| hypothetical protein ZEAMMB73_315381 [Zea mays]
Length = 814
Score = 758 bits (1958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/570 (61%), Positives = 440/570 (77%), Gaps = 2/570 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVPA +FPSA IT
Sbjct: 244 MGNLHGWGGPLSQNWLDQQLALQKKILSRMIELGMVPVLPSFSGNVPAIFAKLFPSANIT 303
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V ++P+WCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 304 RLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQAFIRQQIKEYGDVTNIYNCDTFNENTPP 363
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
D P YISSLG+AIY M G+ +AVWLMQGWLF D FW+ PQMKALL+SVP+GK++V
Sbjct: 364 TDEPAYISSLGSAIYEAMSRGNKNAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 423
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYGILDSI+ GP++ARTS N+TM+G
Sbjct: 424 LDLFADVKPIWKVSSQFYGVPYIWCMLHNFGGNIEMYGILDSISSGPIDARTSYNSTMIG 483
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF ++KV+V+ W+ YS RRYG++ I+ AW LYHT+
Sbjct: 484 VGMCMEGIEHNPVVYELMSEMAFHNKKVEVEDWLKTYSCRRYGQANADIEKAWRYLYHTI 543
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS-YDHPHLW 358
YNCTDG D N+D IV FPD+ PS ++ K + + SE S PHLW
Sbjct: 544 YNCTDGIADHNKDYIVEFPDISPSSVTYQVSKRRGMSITRNHRRFFLSEVSGILPQPHLW 603
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
YST E ++ALELF+ +G+ S S TYRYDL+DLTRQ L+K ANE++L+ I YQ D+HG
Sbjct: 604 YSTKEAVKALELFLDAGSTFSESLTYRYDLVDLTRQCLSKLANEVYLDAISLYQKKDSHG 663
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+ +R+FLE++ D+D LLA D FLLGPWLESAK LA E++ +QYEWNARTQ+TMW+D
Sbjct: 664 LNAHARKFLEIIVDIDTLLAADDNFLLGPWLESAKSLAITEKERQQYEWNARTQVTMWYD 723
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
NT+ E S L DY NK+WSGLL+ YY PRA+ YF Y+ SL+ F+L++WR++WI +N
Sbjct: 724 NTETEQSKLHDYANKFWSGLLKSYYLPRASKYFAYLTRSLQENRSFQLEEWRKDWISYSN 783
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
+WQ+G+ VY V++ GDAL ++ LY KYL+
Sbjct: 784 EWQSGKEVYAVKATGDALAIARSLYRKYLR 813
>gi|222624949|gb|EEE59081.1| hypothetical protein OsJ_10898 [Oryza sativa Japonica Group]
Length = 812
Score = 756 bits (1953), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/569 (61%), Positives = 439/569 (77%), Gaps = 2/569 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKKIL R+ ELGM PVLP+FSGNVP+ + +FPSA IT
Sbjct: 243 MGNLHGWGGPLSQNWLDQQLTLQKKILSRMIELGMVPVLPSFSGNVPSVFKKLFPSANIT 302
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 303 KLGDWNTVDGDPRWCCTYLLDPSDALFIDVGQAFIRQQMKEYGDITNIYNCDTFNENTPP 362
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLG+AIY M G+ DAVWLMQGWLF D FW+ PQMKALL+SVP GK++V
Sbjct: 363 TNEPAYISSLGSAIYEAMSRGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPTGKMIV 422
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYGILDSIA GP++ARTS N+TMVG
Sbjct: 423 LDLFADVKPIWQMSSQFYGVPYIWCMLHNFGGNIEMYGILDSIASGPIDARTSHNSTMVG 482
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF+ +KV+V+ W+ YS RRYG+S ++ AW +LYHT+
Sbjct: 483 VGMCMEGIEHNPVVYELMSEMAFRSQKVEVEDWLKIYSYRRYGQSNVEVEKAWGILYHTI 542
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 358
YNCTDG D N D IV FPD+ P+ S K + + + SE S S HPHLW
Sbjct: 543 YNCTDGIADHNNDYIVEFPDISPNSFSSDVSKRKAISEVKKHRRFVLSEVSASLPHPHLW 602
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
YST E I+ALELF+ +GN+LS S TYRYDL+DLTRQ+L+K ANE++L+ + AY+ D++G
Sbjct: 603 YSTKEAIKALELFLNAGNDLSKSLTYRYDLVDLTRQSLSKLANEVYLDAMNAYRKKDSNG 662
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+ +++FLEL+ D+D LLA D FLLGPWLE AK LA+ E + KQYEWNARTQ+TMW+D
Sbjct: 663 LNFYTKKFLELIVDIDTLLASDDNFLLGPWLEDAKSLARTENERKQYEWNARTQVTMWYD 722
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
NT+ E S L DY NK+WSGLL+ YY PRA+ YF + + L+ F+L++W ++WI +N
Sbjct: 723 NTKTEQSKLHDYANKFWSGLLKSYYLPRASKYFSRLTKGLQENQSFQLEEWTKDWIAYSN 782
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKYL 567
+WQ+G+ +Y V++ GDAL S L+ KY
Sbjct: 783 EWQSGKELYAVKATGDALAISSSLFKKYF 811
>gi|356534602|ref|XP_003535842.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max]
Length = 807
Score = 748 bits (1931), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/573 (61%), Positives = 429/573 (74%), Gaps = 6/573 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 230 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 289
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCTYLLD +DPLF+EIG AFI +Q+KEYG + IYNCDTF+EN+PP
Sbjct: 290 RLGDWNTVDGDPRWCCTYLLDPSDPLFVEIGEAFIRKQIKEYGDVTDIYNCDTFNENSPP 349
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ PEYIS+LGAA+Y G+ GD DAVWLMQGWLF D FW+PPQMKALL+SVP GK++V
Sbjct: 350 TNDPEYISNLGAAVYKGISKGDKDAVWLMQGWLFYSDSSFWKPPQMKALLHSVPFGKMIV 409
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S N+TMVG
Sbjct: 410 LDLFADVKPIWKNSFQFYGTPYIWCMLHNFGGNIEMYGTLDSISSGPVDARVSANSTMVG 469
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNP+VY+LMSEMAF+ +KV V WI Y RRYG+ + ++ AW +LYHT+
Sbjct: 470 VGMCMEGIEQNPIVYELMSEMAFRDKKVKVSEWIKSYCHRRYGKVIHQVESAWEILYHTI 529
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN---YGKPVSKEAVLKSET-SSYDHP 355
YNCTDG D N D IV FPD +PS SVT G N Y P L ET S
Sbjct: 530 YNCTDGIADHNHDFIVMFPDWNPSTNSVT-GTSNNQKIYLLPPGNRRYLFQETLSDMPQA 588
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY + +VI+AL+LF+A G L+ S TYRYDL+DLTRQ L+K AN+++ + +YQ +
Sbjct: 589 HLWYPSDDVIKALQLFLAGGKNLAGSLTYRYDLVDLTRQVLSKLANQVYHKAVTSYQKKN 648
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+ S +FL+L++D+D LLA D FLLG WLESAK+LA N + KQYEWNARTQ+TM
Sbjct: 649 IEALQFHSNKFLQLIKDIDVLLASDDNFLLGTWLESAKKLAVNPSEIKQYEWNARTQVTM 708
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
WFD + S L DY NK+WSGLL YY PRA+ YF ++ ESL D F+L +WR++WI
Sbjct: 709 WFDTNETTQSKLHDYANKFWSGLLESYYLPRASTYFSHLTESLRQNDKFKLIEWRKQWIS 768
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
+N WQ G +YPV++ GDAL SQ LY KY Q
Sbjct: 769 QSNKWQEGNELYPVKAKGDALTISQALYEKYFQ 801
>gi|357112065|ref|XP_003557830.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Brachypodium
distachyon]
Length = 809
Score = 747 bits (1928), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/571 (61%), Positives = 447/571 (78%), Gaps = 6/571 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL Q+WLD QL LQKKIL R+ ELGM PVLP+FSGNVP A + +FPSA IT
Sbjct: 240 MGNLHAWGGPLSQNWLDGQLALQKKILSRMTELGMVPVLPSFSGNVPVAFKKLFPSANIT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V DPRWCCTY+LD +D LFI++G AFI QQ+KEYG + IYNCDTF+ENTPP
Sbjct: 300 RLGEWNTVDGDPRWCCTYILDPSDALFIDVGHAFIRQQIKEYGDITSIYNCDTFNENTPP 359
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLG+AIY M SG+ DAVWLMQGWLF D FW+ PQMKALL+SVP+GK++V
Sbjct: 360 TNEPAYISSLGSAIYEAMSSGNKDAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 419
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKP+W S QFYGVPYIWCMLHNF GNIEMYGILDSI+ GP++ARTS +TMVG
Sbjct: 420 LDLFADVKPVWKMSSQFYGVPYIWCMLHNFGGNIEMYGILDSISSGPIDARTSYGSTMVG 479
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM+MEGIE NPVV++LMSEM+F+ +KV+V+ W+ YS RRYG+S I+ AW VLYHT+
Sbjct: 480 VGMTMEGIEHNPVVFELMSEMSFRSQKVEVEDWLKSYSYRRYGQSNVKIEKAWGVLYHTI 539
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA---VLKSETSSYDHPH 356
YNCTDG D NRD IV FPD+ PS S K + G P+ ++ L +++ HPH
Sbjct: 540 YNCTDGIADHNRDYIVEFPDMSPSSFSSHFSKQR--GMPIVRKHPRFFLSEVSANLPHPH 597
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
LWYST+E ++ALELF+ +GN+LS S T+RYDL+DLTRQ+L+K AN+++L+ +++Y+ ++
Sbjct: 598 LWYSTNEAVKALELFLNAGNDLSKSLTFRYDLVDLTRQSLSKLANKVYLDAMDSYKNKNS 657
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
G+ +++FLEL+ D+D LLA D FLLGPWLESAK LA +EE+ KQYEWNARTQ+TMW
Sbjct: 658 SGLNFHTKKFLELIVDIDILLASDDNFLLGPWLESAKSLAMSEEERKQYEWNARTQVTMW 717
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+DNT+ E S L DY NK+WSGLL++YY PRA+ YF + SL+ F+L++WRR+WI
Sbjct: 718 YDNTKTEQSHLHDYANKFWSGLLKNYYLPRASKYFTGLSRSLQENRSFQLEEWRRDWISY 777
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
+N+WQ+G +YPV++ GDAL S+ L+ KYL
Sbjct: 778 SNEWQSGEELYPVKAKGDALAISKSLFRKYL 808
>gi|297736304|emb|CBI24942.3| unnamed protein product [Vitis vinifera]
Length = 868
Score = 747 bits (1928), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/575 (61%), Positives = 436/575 (75%), Gaps = 9/575 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLD+QLVLQK+IL R+ ELGM PVLP+FSGNVP AL+ +FPSA IT
Sbjct: 294 MGNLHGWGGPLSQNWLDEQLVLQKQILCRMLELGMTPVLPSFSGNVPEALKKIFPSANIT 353
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI QQ+KEYG + IYNCDTF+EN+PP
Sbjct: 354 RLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQQIKEYGDVTDIYNCDTFNENSPP 413
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLGAAIY M GD D+VWLMQGWLF D FW+PPQMKALL+SVP GK+VV
Sbjct: 414 TNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSDSGFWKPPQMKALLHSVPFGKMVV 473
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMYGILD+++ GPV+AR S+N+TMVG
Sbjct: 474 LDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMYGILDAVSSGPVDARISKNSTMVG 533
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPV Y+LMSEMAF+ EKV + W+ YS RRYG++V ++ AW +LY T+
Sbjct: 534 VGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTYSYRRYGKAVHHVEAAWEILYRTI 593
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
YNCTDG D N D +V FPD DPS+ IS + Q + +L ETSS D
Sbjct: 594 YNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHIIQKILTQTGRRKILFQETSS-DL 652
Query: 355 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
P HLWYST EV+ AL LF+ +GNELS S+TYRYDL+DLTRQ L+K N+++L+ + A++
Sbjct: 653 PQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLVDLTRQVLSKLGNQVYLDAVIAFR 712
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
DA S++F++LV+D+D LLA D FLLG WLESAK+LA N + +QYEWNARTQ
Sbjct: 713 QKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWLESAKKLAVNPREMEQYEWNARTQ 772
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 532
+TMWF T+ S L DY NK+WSGLL +YY PRA++YF Y+ ++L F+L++WRRE
Sbjct: 773 LTMWFYVTKTNQSKLHDYANKFWSGLLENYYLPRASMYFSYLAKALTENKNFKLEEWRRE 832
Query: 533 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
WI +N WQ G+ +YPV + GD L S+ LY KY
Sbjct: 833 WISYSNKWQAGKELYPVRAKGDTLAISRALYEKYF 867
>gi|225450036|ref|XP_002273084.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera]
Length = 803
Score = 746 bits (1925), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/575 (61%), Positives = 436/575 (75%), Gaps = 9/575 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLD+QLVLQK+IL R+ ELGM PVLP+FSGNVP AL+ +FPSA IT
Sbjct: 229 MGNLHGWGGPLSQNWLDEQLVLQKQILCRMLELGMTPVLPSFSGNVPEALKKIFPSANIT 288
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V ++ RWCCTYLLDA+DPLFI+IG+AFI QQ+KEYG + IYNCDTF+EN+PP
Sbjct: 289 RLGEWNTVDNNTRWCCTYLLDASDPLFIQIGKAFIRQQIKEYGDVTDIYNCDTFNENSPP 348
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ P YISSLGAAIY M GD D+VWLMQGWLF D FW+PPQMKALL+SVP GK+VV
Sbjct: 349 TNDPAYISSLGAAIYKAMSQGDKDSVWLMQGWLFYSDSGFWKPPQMKALLHSVPFGKMVV 408
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+ KPIW TS QFYG PYIWCMLHNF GNIEMYGILD+++ GPV+AR S+N+TMVG
Sbjct: 409 LDLFADAKPIWRTSSQFYGTPYIWCMLHNFGGNIEMYGILDAVSSGPVDARISKNSTMVG 468
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPV Y+LMSEMAF+ EKV + W+ YS RRYG++V ++ AW +LY T+
Sbjct: 469 VGMCMEGIEQNPVAYELMSEMAFRSEKVQLVEWLKTYSYRRYGKAVHHVEAAWEILYRTI 528
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSI-----ISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
YNCTDG D N D +V FPD DPS+ IS + Q + +L ETSS D
Sbjct: 529 YNCTDGIADHNTDFMVNFPDWDPSLNPSSDISKEQHIIQKILTQTGRRKILFQETSS-DL 587
Query: 355 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
P HLWYST EV+ AL LF+ +GNELS S+TYRYDL+DLTRQ L+K N+++L+ + A++
Sbjct: 588 PQSHLWYSTHEVVNALRLFLDAGNELSKSSTYRYDLVDLTRQVLSKLGNQVYLDAVIAFR 647
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
DA S++F++LV+D+D LLA D FLLG WLESAK+LA N + +QYEWNARTQ
Sbjct: 648 QKDAKNFHLHSQKFVQLVKDIDTLLASDDNFLLGTWLESAKKLAVNPREMEQYEWNARTQ 707
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 532
+TMWF T+ S L DY NK+WSGLL +YY PRA++YF Y+ ++L F+L++WRRE
Sbjct: 708 LTMWFYVTKTNQSKLHDYANKFWSGLLENYYLPRASMYFSYLAKALTENKNFKLEEWRRE 767
Query: 533 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
WI +N WQ G+ +YPV + GD L S+ LY KY
Sbjct: 768 WISYSNKWQAGKELYPVRAKGDTLAISRALYEKYF 802
>gi|224106113|ref|XP_002314048.1| predicted protein [Populus trichocarpa]
gi|222850456|gb|EEE88003.1| predicted protein [Populus trichocarpa]
Length = 806
Score = 731 bits (1887), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/573 (59%), Positives = 430/573 (75%), Gaps = 6/573 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPAAL+ +FPSA IT
Sbjct: 233 MGNLHGWGGPLSQNWLDQQLCLQKQILSRMLELGMTPVLPSFSGNVPAALKKIFPSANIT 292
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V +PRWCCTYLL+ +DPLF+EIG AFI QQ+KEYG + IYNCDTF+EN+PP
Sbjct: 293 RLGDWNTVDKNPRWCCTYLLNPSDPLFVEIGEAFIRQQVKEYGDVTDIYNCDTFNENSPP 352
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
P YISSLGAA+Y M GD DAVWLMQGWLF D FW+PPQM+ALL+SVP GK++V
Sbjct: 353 TSDPAYISSLGAAVYKAMSRGDKDAVWLMQGWLFYSDSAFWKPPQMQALLHSVPFGKMIV 412
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE KPIW S QFYG PY+WC+LHNF GNIEMYGILD+I+ GPV+AR EN+TMVG
Sbjct: 413 LDLFAEAKPIWKNSSQFYGTPYVWCLLHNFGGNIEMYGILDAISSGPVDARIIENSTMVG 472
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF+ K V W+ YS RRYG++V + AW++LYHT+
Sbjct: 473 VGMCMEGIEHNPVVYELMSEMAFRSGKPQVLEWLKTYSRRRYGKAVRQVVAAWDILYHTI 532
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV-----SKEAVLKSETSSYDH 354
YNCTDG D N D IV FPD DPS+ S + Q+ + + ++ + + +S +
Sbjct: 533 YNCTDGIADHNTDFIVKFPDWDPSLHSGSNISEQDNMRILLTSSGTRRFLFQETSSDFPE 592
Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
HLWYST EVI+AL LF+ +GN+L+ S TYRYDL+DLTRQ L+K AN+++ + + A++
Sbjct: 593 AHLWYSTQEVIQALWLFLDAGNDLAGSPTYRYDLVDLTRQVLSKLANQVYRDAMIAFRRK 652
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 474
DA + ++FL++++D+D LLA D FLLG WLESAK+LA + K YEWNARTQ+T
Sbjct: 653 DARALNLHGQKFLQIIKDIDVLLASDDNFLLGTWLESAKKLAVDPNDMKLYEWNARTQVT 712
Query: 475 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 534
MW+D T+ S L DY NK+WSGLL DYY PRA+ YF ++++SLE F+L +WR+EWI
Sbjct: 713 MWYDTTKTNQSQLHDYANKFWSGLLEDYYLPRASTYFGHLMKSLEENKNFKLTEWRKEWI 772
Query: 535 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
+N WQ +YPV++ GDAL ++ LY KY
Sbjct: 773 AFSNKWQADTKIYPVKAKGDALAIAKALYRKYF 805
>gi|357458267|ref|XP_003599414.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488462|gb|AES69665.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 832
Score = 725 bits (1872), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/599 (57%), Positives = 428/599 (71%), Gaps = 33/599 (5%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353
Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
EYG + IYNCDTF+EN+PP P YIS+LGAA+Y G+ GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413
Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473
Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 273
EMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE NP+VY+LMSEMAF+ EKV + W+
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKINEWL 533
Query: 274 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 333
YS RRYG+++ + AW +LYHT+YN TDG D N D IV PD DPS +V G
Sbjct: 534 KSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHNHDYIVMLPDWDPS-AAVKSGMSN 592
Query: 334 NYGK-----PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDL 388
+ K P ++ + + + HLWY +VI+AL+LF+A G L S TYRYDL
Sbjct: 593 HQKKIYFLPPGNRRYLFQQTPAGMPQAHLWYPPEDVIKALQLFLAGGKNLKGSLTYRYDL 652
Query: 389 IDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPW 448
+DLTRQ L+K+AN++++ I ++Q + + S FLEL++D+D LLA D FLLG W
Sbjct: 653 VDLTRQVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELIKDIDLLLASDDNFLLGTW 712
Query: 449 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 508
L+SAK+LA N + KQYEWNARTQ+TMWFD + S L DY NK+WSG+L +YY PRA+
Sbjct: 713 LQSAKKLAVNPSELKQYEWNARTQVTMWFDTNETTQSKLHDYANKFWSGILENYYLPRAS 772
Query: 509 IYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
YF ++ ESL+ + F L +WR+EWI ++N WQ G +YPV++ GDAL SQ LY KY
Sbjct: 773 TYFSHLSESLKQNEKFNLTEWRKEWIPMSNKWQEGSELYPVKAKGDALTISQALYKKYF 831
>gi|357458271|ref|XP_003599416.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488464|gb|AES69667.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 807
Score = 720 bits (1858), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/594 (57%), Positives = 420/594 (70%), Gaps = 48/594 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353
Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
EYG + IYNCDTF+EN+PP P YIS+LGAA+Y G+ GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413
Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473
Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 273
EMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE NP+VY+LMSEMAF+ EKV + W+
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKINEWL 533
Query: 274 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 333
YS RRYG+++ + AW +LYHT+YN TDG D N D IV PD DPS + G Q
Sbjct: 534 KSYSHRRYGKAIHEVDAAWEILYHTIYNSTDGIADHNHDYIVMLPDWDPSAAVKSAGMPQ 593
Query: 334 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 393
HLWY +VI+AL+LF+A G L S TYRYDL+DLTR
Sbjct: 594 ---------------------AHLWYPPEDVIKALQLFLAGGKNLKGSLTYRYDLVDLTR 632
Query: 394 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
Q L+K+AN++++ I ++Q + + S FLEL++D+D LLA D FLLG WL+SAK
Sbjct: 633 QVLSKFANQVYIKAITSFQKKNIDALQLNSHMFLELIKDIDLLLASDDNFLLGTWLQSAK 692
Query: 454 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 513
+LA N + KQYEWNARTQ+TMWFD + S L DY NK+WSG+L +YY PRA+ YF +
Sbjct: 693 KLAVNPSELKQYEWNARTQVTMWFDTNETTQSKLHDYANKFWSGILENYYLPRASTYFSH 752
Query: 514 MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
+ ESL+ + F L +WR+EWI ++N WQ G +YPV++ GDAL SQ LY KY
Sbjct: 753 LSESLKQNEKFNLTEWRKEWIPMSNKWQEGSELYPVKAKGDALTISQALYKKYF 806
>gi|15240689|ref|NP_196873.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|9758035|dbj|BAB08696.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|19423948|gb|AAL87291.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|21436231|gb|AAM51254.1| putative alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
gi|332004545|gb|AED91928.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
Length = 806
Score = 719 bits (1855), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/576 (59%), Positives = 430/576 (74%), Gaps = 10/576 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL ++WLD QL+LQK+IL R+ + GM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMTPVLPSFSGNVPSALRKIYPEANIT 290
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L NW +V D RWCCTYLL+ +DPLFIEIG AFI+QQ +EYG ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIKQQTEEYGEITNIYNCDTFNENTPP 350
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEYISSLGAA+Y M G+ +AVWLMQGWLFS D FW+PPQ+KALL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQLKALLHSVPFGKMIV 410
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+AEVKPIW+ S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TMVG
Sbjct: 411 LDLYAEVKPIWNKSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPVVY+L SEMAF+ EKVDV+ W+ Y+ RRY + I+ AW +LYHTV
Sbjct: 471 VGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS-------KEAVLKSETSSY 352
YNCTDG D N D IV PD DPS SV + Q +S + + + +T+
Sbjct: 531 YNCTDGIADHNTDFIVKLPDWDPS-SSVQDDLKQKDSYMISTGPYETKRRVLFQDKTADL 589
Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
HLWYST EVI+AL+LF+ +G++LS S TYRYD++DLTRQ L+K AN+++ + A+
Sbjct: 590 PKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYTEAVTAFV 649
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
D + QLS +FLEL++DMD LLA D LLG WLESAK+LA+N ++ KQYEWNARTQ
Sbjct: 650 KKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQYEWNARTQ 709
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 532
+TMW+D+ S L DY NK+WSGLL DYY PRA +YF M++SL F+++ WRRE
Sbjct: 710 VTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRARLYFNEMLKSLRDKKIFKVEKWRRE 769
Query: 533 WIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKYL 567
WI +++ WQ + VYPV++ GDAL S+ L +KY
Sbjct: 770 WIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKYF 805
>gi|297807393|ref|XP_002871580.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
lyrata]
gi|297317417|gb|EFH47839.1| alpha-N-acetylglucosaminidase family [Arabidopsis lyrata subsp.
lyrata]
Length = 806
Score = 718 bits (1854), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/576 (59%), Positives = 434/576 (75%), Gaps = 10/576 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL ++WL+ QL+LQK+IL ++ +LGM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHTWGGPLSKNWLNDQLILQKQILSQMLKLGMTPVLPSFSGNVPSALRKIYPGANIT 290
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L NW +V D RWCCTYLL+ +DPLFI+IG AFI+QQ +EYG ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIDIGEAFIKQQPEEYGEITNIYNCDTFNENTPP 350
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEYISSLGAA+Y M G+ +AVWLMQGWLFS D FW+PPQMK LL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQMKVLLHSVPFGKMIV 410
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+AEVKPIW+TS QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TMVG
Sbjct: 411 LDLYAEVKPIWNTSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPVVY+L+SEMAF+ EKVDV+ W+ Y+ RRY + I+ AW +LYHTV
Sbjct: 471 VGMCMEGIEQNPVVYELISEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN-----YGKPVSKEAVLKSETSSYDH 354
YNCTDG D N D IV PD DPS E K+ + G +K VL + SS D
Sbjct: 531 YNCTDGIADHNTDFIVKLPDWDPSSSVQDESKHTDSYMISTGPYETKRRVLFQDKSS-DL 589
Query: 355 P--HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
P HLWYST EVI+AL+LF+ +G+ELS S TYRYD++DLTRQ L+K AN++++ + A+
Sbjct: 590 PKAHLWYSTKEVIQALKLFLEAGDELSRSLTYRYDMVDLTRQVLSKLANQVYIEAVTAFV 649
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
D + QLS +FLEL++D+D LLA D FLLG WLESAK+LA+N ++ KQYEWNARTQ
Sbjct: 650 KKDIGSLGQLSEKFLELIKDIDVLLASDDNFLLGTWLESAKKLARNGDERKQYEWNARTQ 709
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 532
+TMW+D+ S L DY NK WSGLL DYY PRA +YF M++SL F+++ W+RE
Sbjct: 710 VTMWYDSKDVNQSKLHDYANKLWSGLLEDYYLPRARLYFNEMLKSLRDKKKFKVEKWQRE 769
Query: 533 WIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKYL 567
WI +++ WQ + VYPV++ GDAL S+ L KY
Sbjct: 770 WIMMSHKWQQSSSEVYPVKAKGDALAISKHLLLKYF 805
>gi|449436325|ref|XP_004135943.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus]
Length = 774
Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/568 (59%), Positives = 411/568 (72%), Gaps = 24/568 (4%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL ++WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPA L +FPSA IT
Sbjct: 229 MGNLHGWGGPLSKNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAGLVEIFPSANIT 288
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNW S+ +DP CCTYLL+ +DPLF++IG AFI QQ+KEYG ++IY+CDTF+ENTPP
Sbjct: 289 KLGNWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTNIYSCDTFNENTPP 348
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ YISSLGA++Y M D DAVWLMQGWLF D FW+P QMKALL+SVP GK++V
Sbjct: 349 TNDTSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSDFWKPDQMKALLHSVPFGKMIV 408
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW +S QFYG PY+WCMLHNF GNIEMYGILD+I+ GPV+A SEN+TMVG
Sbjct: 409 LDLFADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVG 468
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF+ +KV V+ W+ YS RYG++ + AWN+LYHT+
Sbjct: 469 VGMCMEGIEHNPVVYELMSEMAFRSKKVQVQEWLKTYSRCRYGKADHYVDAAWNILYHTI 528
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCTDG + N D IV PD DPS S + K KP PHLWY
Sbjct: 529 YNCTDGIANHNTDFIVKLPDWDPS--STFDLK-----KP----------------PHLWY 565
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
ST EVI AL+L + + L S TYRYDL+DLTRQ L K ANE +L + A++ +
Sbjct: 566 STQEVINALQLLVNVDDNLVHSATYRYDLVDLTRQVLGKLANEEYLKAVTAFRRQNVKAQ 625
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
S+RF++L+ D+D LLA + FLLG WLESAK+LA N + KQYEWNARTQ+TMW+DN
Sbjct: 626 NLHSKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLATNPAEMKQYEWNARTQVTMWYDN 685
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
T+ S L DY NKYWSGLL YY PRA YF Y+ +SL + F L+DWRREWI +N
Sbjct: 686 TKVNQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWILFSNK 745
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKYL 567
WQ +YPV++ G+A+ S+ LY KY
Sbjct: 746 WQAASELYPVKAEGNAVAISKALYEKYF 773
>gi|414585093|tpg|DAA35664.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 721
Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/469 (68%), Positives = 391/469 (83%), Gaps = 6/469 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYNCDTFDENTPP
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYNCDTFDENTPP 359
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISSLGAA + GMQSGD+DA+WLMQGWLF+YDPFW PPQMKALL+SVP+GK++VL
Sbjct: 360 LSDPNYISSLGAATFRGMQSGDNDAIWLMQGWLFTYDPFWEPPQMKALLHSVPVGKMIVL 419
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEVKP+W S Q YGVPYIWCMLHNFA + EMYG+LD++A GP++AR S+N+TMVGV
Sbjct: 420 DLYAEVKPVWINSDQLYGVPYIWCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGV 479
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+ V +QDAW +LY T+Y
Sbjct: 480 GMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLY 539
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHP 355
NCTDG DKNRDVIVAFPDV+P +I+ T G + N Y SK + K +S +Y+HP
Sbjct: 540 NCTDGKNDKNRDVIVAFPDVEPFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHP 598
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
HLWY T+ VI ALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+
Sbjct: 599 HLWYDTNAVIHALELFLQHGDEVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNN 658
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ V L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE Q
Sbjct: 659 MNQVTILCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQ 707
>gi|449489156|ref|XP_004158231.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Cucumis sativus]
Length = 567
Score = 692 bits (1786), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/565 (58%), Positives = 407/565 (72%), Gaps = 24/565 (4%)
Query: 4 LHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLG 63
L WGGPL ++WLDQQL LQK+IL R+ ELGM PVLP+FSGNVPA L +FPSA IT+LG
Sbjct: 25 LKEWGGPLSKNWLDQQLALQKQILSRMRELGMTPVLPSFSGNVPAGLVEIFPSANITKLG 84
Query: 64 NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDS 123
NW S+ +DP CCTYLL+ +DPLF++IG AFI QQ+KEYG ++IY+CDTF+ENTPP +
Sbjct: 85 NWNSIDADPSTCCTYLLNPSDPLFVKIGEAFIRQQIKEYGDVTNIYSCDTFNENTPPTND 144
Query: 124 PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDL 182
YISSLGA++Y M D DAVWLMQGWLF D FW+P QMKALL+SVP GK++VLDL
Sbjct: 145 TSYISSLGASVYKAMVKADKDAVWLMQGWLFYSDSDFWKPDQMKALLHSVPFGKMIVLDL 204
Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGM 242
FA+VKPIW +S QFYG PY+WCMLHNF GNIEMYGILD+I+ GPV+A SEN+TMVGVGM
Sbjct: 205 FADVKPIWKSSSQFYGTPYVWCMLHNFGGNIEMYGILDAISSGPVDALASENSTMVGVGM 264
Query: 243 SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNC 302
MEGIE NPVVY+LMSEMAF+ +KV V+ W+ YS RYG++ + AWN+LYHT+YNC
Sbjct: 265 CMEGIEHNPVVYELMSEMAFRXQKVQVQEWLKTYSRCRYGKADHYVDAAWNILYHTIYNC 324
Query: 303 TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTS 362
TDG + N D IV PD DPS S + K KP PHLWYST
Sbjct: 325 TDGIANHNTDFIVKLPDWDPS--STFDLK-----KP----------------PHLWYSTQ 361
Query: 363 EVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQL 422
EVI AL+L + + L S TYRYDL+DLTRQ L K ANE +L + A++ +
Sbjct: 362 EVINALQLLVNVDDNLVHSATYRYDLVDLTRQVLGKLANEEYLKAVTAFRRQNVKAQNLH 421
Query: 423 SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQE 482
S+RF++L+ D+D LLA + FLLG WLESAK+LA N + KQYEWNARTQ+TMW+DNT+
Sbjct: 422 SKRFIQLIRDIDKLLASNSNFLLGTWLESAKKLATNPAEMKQYEWNARTQVTMWYDNTKV 481
Query: 483 EASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQN 542
S L DY NKYWSGLL YY PRA YF Y+ +SL + F L+DWRREWI +N WQ
Sbjct: 482 NQSKLHDYANKYWSGLLEGYYLPRALTYFYYLSKSLRKNESFHLEDWRREWILFSNKWQA 541
Query: 543 GRNVYPVESNGDALITSQWLYNKYL 567
+YPV++ G+A+ S+ LY KY
Sbjct: 542 ASELYPVKAEGNAVAISKALYEKYF 566
>gi|242035709|ref|XP_002465249.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
gi|241919103|gb|EER92247.1| hypothetical protein SORBIDRAFT_01g034960 [Sorghum bicolor]
Length = 777
Score = 687 bits (1773), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/569 (58%), Positives = 416/569 (73%), Gaps = 39/569 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQL LQKK+L R+ ELGM PVLP+FSGNVPA +FPSA IT
Sbjct: 244 MGNLHGWGGPLSQNWLDQQLALQKKVLSRMIELGMVPVLPSFSGNVPAVFAKLFPSANIT 303
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG+W +V ++P+WCCTYLLD +D LFI++G+AFI QQ+KEYG ++IYNCDTF+ENTPP
Sbjct: 304 LLGDWNTVDANPKWCCTYLLDPSDSLFIDVGQAFIRQQIKEYGDVTNIYNCDTFNENTPP 363
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
D P YISSLG+AIY M G+ +AVWLMQGWLF D FW+ PQMKALL+SVP+GK++V
Sbjct: 364 TDEPAYISSLGSAIYEAMSRGNKNAVWLMQGWLFYSDAAFWKEPQMKALLHSVPIGKMIV 423
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFA+VKPIW S QFYGVPYIWCMLHNF GNIEMYG+LDSI+ GP++ARTS N+TM+G
Sbjct: 424 LDLFADVKPIWKMSSQFYGVPYIWCMLHNFGGNIEMYGVLDSISSGPIDARTSYNSTMIG 483
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIE NPVVY+LMSEMAF ++KV+V+
Sbjct: 484 VGMCMEGIEHNPVVYELMSEMAFHNKKVEVE----------------------------- 514
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYDHPHLW 358
D N+D IV FPD+ PS IS K + + SE S S HPHLW
Sbjct: 515 --------DHNKDYIVEFPDISPSSISSQLSKRRGMSIMRNHRRFFLSEVSGSLPHPHLW 566
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
YST E I+ALELF+ +G+ S S TYRYDL+DLTRQ L+K ANE++L+ + +YQ D++G
Sbjct: 567 YSTKEAIKALELFLDAGSTFSKSLTYRYDLVDLTRQCLSKLANEVYLDAMSSYQKKDSNG 626
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+ +R+FLE++ D+D LLA D FLLGPWLESAK LA E++ +QYEWNARTQ+TMW+D
Sbjct: 627 LNSHTRKFLEIIMDIDTLLAADDNFLLGPWLESAKSLAITEKERQQYEWNARTQVTMWYD 686
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
NT+ E S L DY NK+WSGLL+ YY PRA+ YF Y+ SL+ F+L++WR++WI +N
Sbjct: 687 NTETEQSKLHDYANKFWSGLLKSYYLPRASKYFAYLTRSLQENQSFQLEEWRKDWISYSN 746
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKYL 567
+WQ+G+ VY V++ GDAL ++ LY KYL
Sbjct: 747 EWQSGKEVYAVKATGDALAIARSLYRKYL 775
>gi|168060822|ref|XP_001782392.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666123|gb|EDQ52786.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 801
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/567 (56%), Positives = 404/567 (71%), Gaps = 17/567 (2%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLPQ WLDQQL LQ KIL R+ ELGM PVLPAF+GNVPAA+ +PSA++T
Sbjct: 236 MGNLKRWGGPLPQKWLDQQLQLQIKILARMRELGMTPVLPAFAGNVPAAITKKYPSARVT 295
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W +V D R+CCT+LLD DPLF++IG+AFI QQ+KEYG T HIYNCDTF+EN PP
Sbjct: 296 RLGEWNTVNGDTRYCCTFLLDPKDPLFVDIGKAFILQQIKEYGGTQHIYNCDTFNENQPP 355
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P YIS+LG+ +Y M + D DA+WLMQ +YD FW+PPQMKALL+SVP+G++VVL
Sbjct: 356 TDDPSYISALGSIVYEAMSAADQDAIWLMQ----AYDKFWKPPQMKALLHSVPVGRMVVL 411
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DLFA+VKP+WS S FYGVPYIWCMLHNF GN+EMYG LD +A P++A TS N+TMVGV
Sbjct: 412 DLFADVKPMWSRSDHFYGVPYIWCMLHNFGGNVEMYGRLDVVATAPIQAVTSSNSTMVGV 471
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
GM MEGIEQNPVVYDLM+EMAF + V V+ WI +Y+ RRYG + AW +L+ ++Y
Sbjct: 472 GMCMEGIEQNPVVYDLMAEMAFHNATVVVEDWIEEYARRRYGELTAGARIAWKMLHESIY 531
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-HLWY 359
NC+DG D N DVIV FPD+DP Q+ G+ + HP H+WY
Sbjct: 532 NCSDGIADHNGDVIVEFPDIDPKRSLFQIRPRQSLGQQI------------LGHPQHIWY 579
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + AL+ ++S + L S YRYD++DLTRQ L+K AN+L +++ +++ + +
Sbjct: 580 SPQDAAVALQYLLSSADALGLSKPYRYDVVDLTRQVLSKLANQLHSQVLDQFRMFNVEKM 639
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+S R LEL+ DMD LL + FLLG WLESAK LA ++E+ K YEWNARTQITMWFDN
Sbjct: 640 DNISSRLLELLSDMDDLLGASEEFLLGTWLESAKDLATSDEERKLYEWNARTQITMWFDN 699
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
T ++ S L DY NK WSGL RDYY PRA+IY KY+ +SL F ++WRREWI LTN+
Sbjct: 700 TLDKPSPLHDYANKMWSGLTRDYYLPRASIYIKYLKQSLHENTSFAFQEWRREWIALTNE 759
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
WQ N+YP + GDAL + LY KY
Sbjct: 760 WQVASNLYPTVAKGDALEIATTLYEKY 786
>gi|4160292|emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum]
Length = 811
Score = 679 bits (1751), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/580 (56%), Positives = 421/580 (72%), Gaps = 13/580 (2%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL Q+WL+ QL LQK+IL R+ ELGM PVLP+FSGNVPAAL+ +FPSA IT
Sbjct: 231 MGNLHAWGGPLSQNWLNIQLALQKQILSRMRELGMTPVLPSFSGNVPAALKKIFPSANIT 290
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W +V DPRWCCT+LL +DPLFIEIG AFI +Q++EYG + IYNCDTF+ENTPP
Sbjct: 291 RLGDWNTVNGDPRWCCTFLLAPSDPLFIEIGEAFIRKQIEEYGDITDIYNCDTFNENTPP 350
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWL-MQGWLFSYD-PFWRPPQMKALLNSVPLGKLV 178
D P YI Q + WL + WLF D +W+ PQM+ALL+SVP GK++
Sbjct: 351 TDDPTYIHLSALLCTKQCQKQITMRCWLNARVWLFYSDSKYWKSPQMEALLHSVPRGKMI 410
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VLDLFA+VKPIW +S QFYG PYIWCMLHNF GNIEMYG+LD++A GP++ARTSEN+TMV
Sbjct: 411 VLDLFADVKPIWKSSSQFYGTPYIWCMLHNFGGNIEMYGVLDAVASGPIDARTSENSTMV 470
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GVGM MEGIE NPVVY+LMSEMAF+ + ++ W+ YS RRYG+ IQ AW++LYHT
Sbjct: 471 GVGMCMEGIEHNPVVYELMSEMAFREDNFQLQGWLKSYSHRRYGKVNDQIQAAWDILYHT 530
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPS-----IISVTEGKYQNYGKPVS-----KEAVLKSE 348
+YNCTDG D N+D IV FPD DPS IS T+ QN + ++ + + +
Sbjct: 531 IYNCTDGIADHNKDYIVEFPDWDPSGKTGTDISGTDSSSQNRMQKLAGFQWNRRFLFFEK 590
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
+SS P LWYST +V +AL+LFI + +LS S TYRYDL+DL+RQ+L+K AN+++L+ I
Sbjct: 591 SSSLPKPRLWYSTEDVFQALQLFIDALKKLSGSLTYRYDLVDLSRQSLSKLANQVYLDAI 650
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ-LAQNEEQEKQYEW 467
A++ DA + Q S +FL L++D+D LLA D FLLG WLE+ Q LA N +++KQYEW
Sbjct: 651 SAFRREDAKPLNQHSPKFLPLLQDIDRLLAADDNFLLGTWLENCPQNLAMNSDEKKQYEW 710
Query: 468 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 527
NARTQITMWFDNT+ S L DY NK+WSGLL YY PRA+IYF+ + +SL+ F+L+
Sbjct: 711 NARTQITMWFDNTKYNQSQLHDYANKFWSGLLEAYYLPRASIYFELLSKSLKEKVDFKLE 770
Query: 528 DWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
+WR+EWI +N WQ +YPV++ GDAL + L+ KY
Sbjct: 771 EWRKEWIAYSNKWQESTELYPVKAQGDALAIATALFEKYF 810
>gi|302786446|ref|XP_002974994.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
gi|300157153|gb|EFJ23779.1| hypothetical protein SELMODRAFT_102402 [Selaginella moellendorffii]
Length = 761
Score = 664 bits (1712), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/567 (55%), Positives = 406/567 (71%), Gaps = 4/567 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP+ WL+ QL+LQKKIL + LGM VLPAFSGNVP AL+ ++PSA IT
Sbjct: 192 MGNLHGWGGPLPEKWLELQLILQKKILHHMRSLGMIAVLPAFSGNVPRALKILYPSANIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W +V +P+WCCTYLL DPLFI+IG+AFIEQQ+KEYG T H+YNCDTF+EN PP
Sbjct: 252 RLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGKAFIEQQVKEYGSTQHVYNCDTFNENLPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
D P YIS+L A++Y M D A+WLMQGWLFS D FW+PPQMKALL++VP GK++V
Sbjct: 312 TDDPSYISALAASVYGAMIVADKQAIWLMQGWLFSSDAQFWKPPQMKALLHAVPFGKMIV 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAEV+PIWS S FYGVPYIWCMLHNF GN EMYG LD ++ GPV+A+TS N+TM+G
Sbjct: 372 LDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGGNHEMYGRLDVVSSGPVDAKTSANSTMIG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPVVY+LM+EMAF+ + +K W+N YS RRYG++VP +AW +L HT+
Sbjct: 432 VGMCMEGIEQNPVVYELMAEMAFRSTRNALKDWVNDYSTRRYGKAVPEALEAWQILSHTL 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNC+DG D N DVIV FPD++ S ++ T +Y +L +S+ HLWY
Sbjct: 492 YNCSDGLQDHNTDVIVKFPDLNASSLT-TLSRYLAEEAGTQTRRLLTEGLTSFG--HLWY 548
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+E AL + + + LS TYRYDL+DLTRQ L K AN++ L + ++ D +
Sbjct: 549 RPTEAKVALSYMLNASSSLSNVATYRYDLVDLTRQVLMKLANQIHLQALVSFVKGDLEEL 608
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ + +++D + LL ++GFLLGPWLESAK+L N +++ YEWNARTQ+TMWFDN
Sbjct: 609 TKNCDILIGIIKDSELLLRSNNGFLLGPWLESAKKLGTNSDEKHLYEWNARTQVTMWFDN 668
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
T+ S L DY NK WSGL DYY PRA++Y K ++++L + F WR WI LTN
Sbjct: 669 TRSLPSALHDYANKMWSGLFEDYYLPRASLYTKLLVKALHDKEPFPYGSWRSSWILLTNT 728
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
+QNG YP+E+ GD++ ++ L++KY
Sbjct: 729 FQNGTKNYPLEAAGDSIEIAKSLFSKY 755
>gi|302791289|ref|XP_002977411.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
gi|300154781|gb|EFJ21415.1| hypothetical protein SELMODRAFT_107285 [Selaginella moellendorffii]
Length = 761
Score = 662 bits (1707), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/567 (54%), Positives = 404/567 (71%), Gaps = 4/567 (0%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP+ WL+ QL+LQKKIL + LGM VLPAFSGNVP AL+ ++PSA IT
Sbjct: 192 MGNLHGWGGPLPEKWLELQLILQKKILHHMRSLGMIAVLPAFSGNVPRALKILYPSANIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W +V +P+WCCTYLL DPLFI+IG+AFIEQQ+KEYG T H+YNCDTF+EN PP
Sbjct: 252 RLPDWNTVDGNPQWCCTYLLQPMDPLFIQIGKAFIEQQVKEYGSTQHVYNCDTFNENLPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
D P YIS+L A++Y M D A+WLMQGWLFS D FW+PPQMKALL++VP GK++V
Sbjct: 312 TDDPSYISALAASVYGAMIVADKQAIWLMQGWLFSSDAQFWKPPQMKALLHAVPFGKMIV 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAEV+PIWS S FYGVPYIWCMLHNF GN EMYG LD ++ GPV+A+TS N+TM+G
Sbjct: 372 LDLFAEVRPIWSKSSHFYGVPYIWCMLHNFGGNHEMYGRLDVVSSGPVDAKTSANSTMIG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPVVY+LM+EMAF+ + +K W++ YS RRYG++VP +AW +L HT+
Sbjct: 432 VGMCMEGIEQNPVVYELMAEMAFRSTRNALKDWVDDYSTRRYGKAVPEALEAWQILSHTL 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNC+DG D N DVIV FPD++ S ++ G ++ + + TS HLWY
Sbjct: 492 YNCSDGLQDHNTDVIVKFPDLNASSLTTLSRYLAEEGGTQTRRLLTEGLTS---FGHLWY 548
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+E AL + + + LS TYRYDL+DLTRQ L K AN++ L + ++ D +
Sbjct: 549 RPTEAKVALSYMLNASSSLSNVATYRYDLVDLTRQVLMKLANQIHLQALVSFVKGDLEEL 608
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ + +++D + LL ++GFLLGPWLESAK+L N ++ YEWNARTQ+TMWFDN
Sbjct: 609 TKNCDILIGIIKDSELLLRSNNGFLLGPWLESAKKLGTNSDETNLYEWNARTQVTMWFDN 668
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
T+ S L DY NK WSGL DYY PRA++Y K ++++L + F WR WI LTN
Sbjct: 669 TRTLPSALHDYANKMWSGLFEDYYLPRASLYTKLLVKALHDKEPFPYDSWRSSWILLTNT 728
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
+QNG YP+E+ GD++ ++ L++KY
Sbjct: 729 FQNGTKNYPLEAAGDSIEIAKSLFSKY 755
>gi|326521470|dbj|BAK00311.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 520 bits (1339), Expect = e-145, Method: Compositional matrix adjust.
Identities = 248/421 (58%), Positives = 316/421 (75%), Gaps = 1/421 (0%)
Query: 149 MQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 207
+QGWLF D FW+ QMKALL+SVP+GK++VLDLFA+VKPIW TS QFYGVPYIWCMLH
Sbjct: 8 VQGWLFYSDAVFWKESQMKALLHSVPIGKMMVLDLFADVKPIWQTSSQFYGVPYIWCMLH 67
Query: 208 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 267
NF GNIEMYG+LDSI+ GPV+ARTS N+TMVGVGM MEGIE NPVVY+LMSEMAF+ +KV
Sbjct: 68 NFGGNIEMYGVLDSISSGPVDARTSYNSTMVGVGMCMEGIEHNPVVYELMSEMAFRSQKV 127
Query: 268 DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 327
V+ W+ YS RRYG+S IQ AW +LYHT+YNCTDG D N+D IV FPD+ PS S
Sbjct: 128 KVEDWLKTYSHRRYGQSNVEIQKAWGILYHTIYNCTDGIADHNKDYIVEFPDMSPSSFSS 187
Query: 328 TEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 387
K L ++S PHLWYST E I++LELF+ +GN+LS S TYRYD
Sbjct: 188 QYSKRSISLARKHPRFFLSEVSASLPQPHLWYSTEEAIKSLELFLNAGNDLSKSLTYRYD 247
Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
L+DLTRQ+L+K AN+++ + I +YQ D+ G+ ++ FLEL+ D+D LLA D FLLGP
Sbjct: 248 LVDLTRQSLSKLANKVYHDAISSYQKRDSSGLNFHTKEFLELIVDIDTLLASDDNFLLGP 307
Query: 448 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 507
WLESAK LA E++ KQYEWNARTQ+TMW+D+T+ E S L DY NK+WSGLL+ YY PRA
Sbjct: 308 WLESAKSLAMTEDERKQYEWNARTQVTMWYDDTKTEQSKLHDYANKFWSGLLKSYYLPRA 367
Query: 508 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
+ YF + SL+ F+L++WRR+WI +N+WQ+G+ +YPV++ GD+L S+ L+ KY
Sbjct: 368 SKYFSRLSRSLQENRSFQLEEWRRDWISYSNEWQSGKELYPVKAIGDSLAISRSLFTKYF 427
Query: 568 Q 568
+
Sbjct: 428 R 428
>gi|449518399|ref|XP_004166229.1| PREDICTED: alpha-N-acetylglucosaminidase-like, partial [Cucumis
sativus]
Length = 336
Score = 510 bits (1314), Expect = e-142, Method: Compositional matrix adjust.
Identities = 247/331 (74%), Positives = 281/331 (84%), Gaps = 2/331 (0%)
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VGVGMSMEGIEQNPVVYDLMSEMAFQH KVDVK W+ QYSVRRYG VP+IQDAW+VLYH
Sbjct: 1 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 60
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
TVYNCTDGA DKNRDVIVAFPDVDPS I V +G S +V + + +++D PHL
Sbjct: 61 TVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDS--SVDRLQDATFDRPHL 118
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY TSEVI AL+LFIA G++LS+SNTYRYDL+DLTRQALAKY+NELF I++AYQL+D
Sbjct: 119 WYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQ 178
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ LS+ FLELV D+D LLACH+GFLLGPWL+SAKQLA++EE+EKQYEWNARTQITMWF
Sbjct: 179 TMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWF 238
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
DNT+EEASLLRDYGNKYWSGLL DYY PRAAIY K++ ES E+G F L +WRREWIKLT
Sbjct: 239 DNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLT 298
Query: 538 NDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
NDWQ+ R +YPVESNGDAL TS WLYNKYLQ
Sbjct: 299 NDWQSSRKIYPVESNGDALDTSHWLYNKYLQ 329
>gi|414585094|tpg|DAA35665.1| TPA: hypothetical protein ZEAMMB73_337226 [Zea mays]
Length = 1202
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 248/431 (57%), Positives = 312/431 (72%), Gaps = 25/431 (5%)
Query: 142 DSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPY 201
DS+ W L + DP + V +GK+ + + E + + Y
Sbjct: 308 DSNPRWCCTYLLDASDPLF-----------VEIGKMFIEEQIRE----YGRTSHIYN--- 349
Query: 202 IWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMA 261
WCMLHNFA + EMYG+LD++A GP++AR S+N+TMVGVGMSMEGIEQNP+VYDLMSEMA
Sbjct: 350 -WCMLHNFAADFEMYGVLDALASGPIDARLSDNSTMVGVGMSMEGIEQNPIVYDLMSEMA 408
Query: 262 FQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD 321
F H +VD++ W+ Y RRYG+ V +QDAW +LY T+YNCTDG DKNRDVIVAFPDV+
Sbjct: 409 FHHRQVDLQVWVKTYPTRRYGKPVKGLQDAWWILYRTLYNCTDGKNDKNRDVIVAFPDVE 468
Query: 322 PSIISVTEGKYQN----YGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGN 376
P +I+ T G + N Y SK + K +S +Y+HPHLWY T+ VI ALELF+ G+
Sbjct: 469 PFVIA-TPGLHVNTRQMYSTVPSKNYIRKDVSSDAYEHPHLWYDTNAVIHALELFLQHGD 527
Query: 377 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 436
E+S SNT+RYDL+DLTRQ LAKYAN++FL IIE+Y+ N+ + V L + FL LV D+D L
Sbjct: 528 EVSDSNTFRYDLVDLTRQVLAKYANDVFLKIIESYKSNNMNQVTILCQHFLSLVNDLDTL 587
Query: 437 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 496
L+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWS
Sbjct: 588 LSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWS 647
Query: 497 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 556
GLL+DYYGPRAAIYFK+++ S+E+ F LK+WRREWI LTN+WQ+ R V+ + GD L
Sbjct: 648 GLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWISLTNNWQSDRKVFSTTATGDPL 707
Query: 557 ITSQWLYNKYL 567
SQ LY KYL
Sbjct: 708 NISQSLYTKYL 718
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 85/110 (77%), Positives = 101/110 (91%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQ+WLD QLVLQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 240 MANMHGWGGPLPQTWLDDQLVLQKKILSRMYSFGMFPVLPAFSGNIPAALKSKFPSAKVT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYN 110
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYGRTSHIYN
Sbjct: 300 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKMFIEEQIREYGRTSHIYN 349
>gi|156399499|ref|XP_001638539.1| predicted protein [Nematostella vectensis]
gi|156225660|gb|EDO46476.1| predicted protein [Nematostella vectensis]
Length = 675
Score = 473 bits (1218), Expect = e-131, Method: Compositional matrix adjust.
Identities = 237/569 (41%), Positives = 344/569 (60%), Gaps = 42/569 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+HGWGGPLP +W +L LQ KIL + GM PVLP F+G+VPA L ++P A ++
Sbjct: 142 MGNMHGWGGPLPSTWYGMKLNLQHKILAAMRNFGMTPVLPGFAGHVPAGLLRLYPKANVS 201
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + S +CCTYLL+ +DPLF +IG AFI++Q EYG T+HIYN DTF+E P
Sbjct: 202 KLGDWGNFNST--YCCTYLLEPSDPLFQKIGTAFIKEQTAEYG-TNHIYNADTFNEMRPR 258
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P Y+ + +A+Y GM GD DAVWLMQGWLF + FW+P Q+KALL+ VP G ++VL
Sbjct: 259 SSDPTYLGAASSAVYRGMAGGDPDAVWLMQGWLFVDEGFWKPDQIKALLHGVPQGFMIVL 318
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AE PIWS ++ FYG P+IWCML NF GNI ++G + S++ GP +A S N+TM+G
Sbjct: 319 DLWAENSPIWSRTQSFYGTPFIWCMLLNFGGNIGLFGNIKSVSTGPPKAFQSFNSTMIGT 378
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHE---KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G++MEGIEQN ++++LM+EM ++ E VD+ WI Y++RRYG + PAI AW +L
Sbjct: 379 GLTMEGIEQNDMMFELMNEMGYRLEPLNPVDLDNWIKDYALRRYGGTNPAIIQAWRLLIR 438
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VY C D + V P +D + P+L
Sbjct: 439 SVYQCNGYCADHIHSIFVWKPSLD-------------------------------NKPNL 467
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY +V A + ++ E T+RYDL+D+TRQAL ++ ++I AY+ A
Sbjct: 468 WYDPEDVFNAWDELRSTAAEFMHVETFRYDLVDVTRQALHLRVIPIYNDLISAYKNRSAL 527
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
V R LE+ +D+D LL + FLLG WL SAK L + YE+NAR QIT+W
Sbjct: 528 NVIHFGSRLLEMFDDLDSLLQTNRNFLLGRWLNSAKALGTTPAEVALYEFNARNQITLWG 587
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
+ E DY NK WSGL++ YY PR ++ M+ ++ G+ + ++++ ++
Sbjct: 588 PRGEIE-----DYANKMWSGLVKAYYKPRWELFIDEMVSAIAQGEELDYEAFKKKLLEQE 642
Query: 538 NDWQNGRNVYPVESNGDALITSQWLYNKY 566
W +G+ YP + +GD+L +++L+NK+
Sbjct: 643 TAWTHGKEEYPDQPSGDSLAAAEFLHNKW 671
>gi|384247107|gb|EIE20595.1| hypothetical protein COCSUDRAFT_37819 [Coccomyxa subellipsoidea
C-169]
Length = 762
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 246/570 (43%), Positives = 347/570 (60%), Gaps = 31/570 (5%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGPLPQS++D Q LQ+KI+ R+ ELGM+PV PAF+G VP AL P+A+I+
Sbjct: 199 MGNLRGYGGPLPQSYIDDQAELQRKIVRRMRELGMSPVFPAFAGFVPGALARERPAARIS 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTS-HIYNCDTFDENTP 119
+ NW S + R+CC +LLD +PLF EIG AF++ +EYG Y+ DTF+E TP
Sbjct: 259 RSDNWCSFPA--RYCCVHLLDPLEPLFQEIGSAFVKVLREEYGSDEVGFYSADTFNEMTP 316
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD--PFWRPPQMKALLNSVPLGKL 177
P P Y++S+ +AIY+ M + D A WLMQ WLF YD FW+PPQ++AL++ VP L
Sbjct: 317 PSSDPAYLTSVTSAIYNAMAAADPSARWLMQAWLF-YDNQKFWQPPQIQALVSGVPRDAL 375
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL+AEV P+W ++K F+G P+I+CMLHNF GNIEMYG L+++A GP E + +
Sbjct: 376 IMLDLYAEVFPLWKSTKSFFGAPFIYCMLHNFGGNIEMYGALEAVARGPAEGQIDGVAGL 435
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ-DAWNVLY 296
+G+GM EGIEQNPVVY+LMSE AF+ + V+V+ WI Y+ RRYG S P AW++L
Sbjct: 436 IGIGMCPEGIEQNPVVYELMSEWAFRRQPVEVEGWIEAYARRRYGNSTPPTALVAWDLLL 495
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
+VYN TDG TD +RD+ + P + P+ + + K PH
Sbjct: 496 RSVYNATDGHTDHSRDIPTSRPGLSPAEVGLWGLK-----------------------PH 532
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
LWY+ +V+ A L + S EL YRYDL+D+ RQ ++K A +++ + EAY +
Sbjct: 533 LWYNEQQVVDAWGLLLRSAGELQQVEGYRYDLVDVGRQVISKRATDIWKAVAEAYVDGRS 592
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
V + R L+L++D++ LLA + GFLLGP LE A E + + YEWN R Q+T+W
Sbjct: 593 IVVRREGARLLQLLDDLEELLATNRGFLLGPKLEEASSAGHTEAEARLYEWNLRKQLTVW 652
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+ S + DY N+ W+GL+ YY PR A++ + L G + + WR E +
Sbjct: 653 -GTSDTGGSEIEDYANREWAGLISSYYKPRWALWLLRLETDLAQGRRYDPEAWRMECLNF 711
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
T W R+ P+ GD SQ LY Y
Sbjct: 712 TLGWAYLRDQLPLHPQGDTGGVSQRLYEVY 741
>gi|14861378|gb|AAK73654.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
Length = 753
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 233/573 (40%), Positives = 341/573 (59%), Gaps = 51/573 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGW GPLP++W +QL +Q ++L R+ LGM VLPAF+G+VP + FP T
Sbjct: 212 MGNLHGWAGPLPRAWHLKQLYVQYRVLERMRSLGMITVLPAFAGHVPQGVLRAFPRVNAT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W D + CTYLLD DP+F IG F+++ +KE+G T HIY+ DTF+E P
Sbjct: 272 RLGGWSHF--DCTYSCTYLLDPEDPMFQVIGTLFLKELIKEFG-TDHIYSADTFNEMNPL 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S + +A++ M D AVWLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 329 SSDPAYLSRVSSAVFRSMTGADPKAVWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ ++ FYG P+IWCMLHNF GN ++G +++I GP AR N+TMVG
Sbjct: 389 LDLFAESRPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 448
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G+ EGIEQN +VY+LM+E+ ++ E +D+ +W+ +Y+ RRYG A AW +L +V
Sbjct: 449 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVARYAERRYGAPNAAAASAWQLLLRSV 508
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT + NR +V P L+ +T +WY
Sbjct: 509 YNCTGVCVNHNRSPLVRRPS-------------------------LRMDT------EVWY 537
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 413
+ S+V A L +++G EL +S T+ YDL D+TRQA + +E +L+I +A+Q L
Sbjct: 538 NKSDVYEAWRLLLSAGAELGSSPTFGYDLADVTRQAAQQLVSEYYLSIRQAFQSRSLPEL 597
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
A GV +L+ ++DGLL+ H FLLG WLESA+ +A ++ + +QYE NAR Q+
Sbjct: 598 LTAGGVL-----VYDLLPELDGLLSSHRLFLLGRWLESARAVATSDREAEQYELNARNQV 652
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 533
T+W N + DY NK GL+ DYYG R +++ ++ESL SG F + +
Sbjct: 653 TLWGPNGN-----ILDYANKQLGGLVLDYYGVRWSLFVSALVESLNSGSPFHQDQFNQAV 707
Query: 534 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
++ + + YP GD L S+ ++ KY
Sbjct: 708 FQVERGFIYNKKRYPTAPVGDTLEISKKIFLKY 740
>gi|390348210|ref|XP_785272.3| PREDICTED: alpha-N-acetylglucosaminidase [Strongylocentrotus
purpuratus]
Length = 793
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 224/569 (39%), Positives = 326/569 (57%), Gaps = 41/569 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLPQSW QL LQ +IL R+ +LGM PVLPAF+G+VP + VFP+A I+
Sbjct: 220 MGNIDGWGGPLPQSWHTNQLALQHQILKRMRDLGMIPVLPAFAGHVPXSFSKVFPNASIS 279
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG+W + P +CCT LLD DP+F ++G+AFI+ +E+ T HIY+ DTF+EN P
Sbjct: 280 NLGDW--GRFGPEYCCTSLLDPQDPMFKQVGKAFIDAMSEEFNGTDHIYSADTFNENKPK 337
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
Y+S+ +Y G+ GD VWLM GWLF FW P Q+KALL VP+G+++VL
Sbjct: 338 SRDSAYLSAASKGVYQGIIEGDPKGVWLMMGWLFQDTGFWGPTQIKALLQGVPIGRMIVL 397
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AE +P + T+ FYG P+IWCMLHNF GN +YG LD++ GP EAR +N+TM+G+
Sbjct: 398 DLYAEARPFYKTTYSFYGQPFIWCMLHNFGGNTGLYGKLDAVNQGPFEARNYDNSTMIGM 457
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPAIQDAWNVLYH 297
G + EGI QN V+Y+ +++M ++ +V WI QY+ RRY +AW +L
Sbjct: 458 GTTPEGIFQNYVMYNFLTDMTWRSGSTNVSKWIEQYAGRRYSNDPNKSEEATEAWVILKE 517
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
TVYN T D Y PV + + + + +
Sbjct: 518 TVYNNTGTLQD------------------------HQYAVPVRRPSNIMTSP-------V 546
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY ++V +A E + + +L S +RYDL+D+TR L A + ++ ++++ +A
Sbjct: 547 WYDYTKVAKAWEFLLEASTKLGTSPVFRYDLVDVTRNVLQDLAFDFQQKLMVSFRIRNAG 606
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
V L+ DMD + + H+ +LLG WLE AK LA N ++E YE+NA+ QIT+W
Sbjct: 607 AVGGNGTLLCNLILDMDNITSSHEDWLLGTWLEDAKSLATNNDEESLYEYNAKNQITIW- 665
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
+EE + DY NK W GLLR YY R +Y +Y+ E ++S + +
Sbjct: 666 -GPKEE---ILDYANKQWGGLLRTYYHRRWQLYVQYLEECIQSHQPYDQNTFNVRSFVAE 721
Query: 538 NDWQNGRNVYPVESNGDALITSQWLYNKY 566
++W + + +P E GD + S+ LY KY
Sbjct: 722 SEWTHSKEKFPTEPVGDTMAISKALYVKY 750
>gi|14861380|gb|AAK73655.1| lysosomal alpha-N-acetyl glucosaminidase [Dromaius novaehollandiae]
Length = 753
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 233/573 (40%), Positives = 341/573 (59%), Gaps = 51/573 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGW GPLP++W +QL +Q ++L R+ LGM VLPAF+G+VP + FP T
Sbjct: 212 MGNLHGWAGPLPRAWHLKQLYVQYRVLERMRSLGMITVLPAFAGHVPQGVLRAFPRVNAT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W D + CTYLLD DP+F IG F+++ +KE+G T HIY+ DTF+E P
Sbjct: 272 RLGGWSHF--DCTYSCTYLLDPEDPMFQVIGTLFLKELIKEFG-TDHIYSADTFNEMNPL 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S + +A++ M D AVWLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 329 SSDPAYLSRVSSAVFRSMTGADPKAVWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ ++ FYG P+IWCMLHNF GN ++G +++I GP AR N+TMVG
Sbjct: 389 LDLFAESRPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 448
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G+ EGIEQN +VY+LM+E+ ++ E +D+ +W+ +Y+ RRYG A AW +L +V
Sbjct: 449 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVARYAERRYGAPNAAAASAWXLLLRSV 508
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT + NR +V P L+ +T +WY
Sbjct: 509 YNCTGVCVNHNRSPLVRRPS-------------------------LRMDT------EVWY 537
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 413
+ S+V A L +++G EL +S T+ YDL D+TRQA + +E +L+I +A+Q L
Sbjct: 538 NKSDVYEAWRLLLSAGAELGSSPTFGYDLADVTRQAAQQLVSEYYLSIRQAFQSRSLPEL 597
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
A GV +L+ ++DGLL+ H FLLG WLESA+ +A ++ + +QYE NAR Q+
Sbjct: 598 LTAGGVL-----VYDLLPELDGLLSSHRLFLLGRWLESARAVATSDREAEQYELNARNQV 652
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 533
T+W N + DY NK GL+ DYYG R +++ ++ESL SG F + +
Sbjct: 653 TLWGPNGN-----ILDYANKQLGGLVLDYYGVRWSLFVSALVESLNSGSPFHQDQFNQAV 707
Query: 534 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
++ + + YP GD L S+ ++ KY
Sbjct: 708 FQVERGFIYNKKRYPTAPVGDTLEISKKIFLKY 740
>gi|375144105|ref|YP_005006546.1| alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
gi|361058151|gb|AEV97142.1| Alpha-N-acetylglucosaminidase [Niastella koreensis GR20-10]
Length = 735
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 231/568 (40%), Positives = 334/568 (58%), Gaps = 40/568 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLP SW+ LQ+KIL R ELGM PVLPAF+G+VP A + +P+AK+
Sbjct: 201 MGNLDAWGGPLPLSWMKSHKALQEKILQRERELGMKPVLPAFTGHVPPAFKKKYPNAKL- 259
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + +D TY+LD+ DPLF E+G+ F+++Q +G T H+Y+ DTF+EN PP
Sbjct: 260 KATNWTNGFAD-----TYILDSQDPLFAEMGKRFLQKQTSLFG-TDHLYSADTFNENEPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
D P ++S+L A IY GM+ D+ A W+MQGWLF D FW+ PQ++ALL +VP K+++
Sbjct: 314 SDDPAFLSALSARIYEGMKQADTAATWVMQGWLFYSDRKFWKAPQIEALLKAVPDNKMIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMV 238
LDL AE++P+W + FYG P+IW MLHNF GN+ ++G +D +A P E + + +
Sbjct: 374 LDLAAEIEPVWKRTDAFYGKPWIWNMLHNFGGNVNLFGRMDGVATQPAETLNDKASGKLW 433
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ME IEQNPV+Y+LM+ +Q VD+ AWI QY + RY + + DAW +L T
Sbjct: 434 GIGLTMEAIEQNPVMYELMTRHTWQTTPVDLDAWIPQYVLNRYRTNNTNLVDAWQILRKT 493
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYN GA + D SII+ G+P + + T PH
Sbjct: 494 VYN---GAVIR---------DGAESIIT---------GRPTFDSTTVWTRTKLNYAPH-- 530
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
E++ A +LF+ + + S+ ++YDL+D+TRQ LA YA L + A+ D+
Sbjct: 531 ----ELLPAWDLFVQAAGKGVNSDGFQYDLVDVTRQVLANYAAPLQKKWVTAFNAKDSAA 586
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+ S+ FL+L+ DMD LLA F+LGPWL +A+ ++ YE NAR IT+W D
Sbjct: 587 FNKYSKAFLQLISDMDLLLASRKDFMLGPWLSAARSNGTTPAEKALYEQNARDLITLWGD 646
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
S L +Y N+ WSGLL D+Y PR +F + +SL +G LK +
Sbjct: 647 AN----SPLHEYSNRQWSGLLNDFYKPRWQQFFTLLQQSLRTGSTPDLKQFEENIRSWEW 702
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKY 566
W N + YPV +G+++ +Q LY KY
Sbjct: 703 KWVNTQKAYPVVPSGNSVQVAQMLYKKY 730
>gi|357458269|ref|XP_003599415.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
gi|355488463|gb|AES69666.1| Alpha-N-acetylglucosaminidase [Medicago truncatula]
Length = 539
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 199/296 (67%), Positives = 231/296 (78%), Gaps = 27/296 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPL Q+WLDQQLVLQK+I+ R+ ELGM PVLP+FSGNVPAAL +FPSAKIT
Sbjct: 234 MGNLHGWGGPLSQNWLDQQLVLQKQIISRMLELGMTPVLPSFSGNVPAALTKIFPSAKIT 293
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK-------------------- 100
+LG+W +V +DPRWCCTYLLD +DPLF+EIG AFI +Q+K
Sbjct: 294 RLGDWNTVDADPRWCCTYLLDPSDPLFVEIGEAFIRKQIKATETIHQESEDLGSLIIMDR 353
Query: 101 ------EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
EYG + IYNCDTF+EN+PP P YIS+LGAA+Y G+ GD DAVWLMQGWLF
Sbjct: 354 AVRLDDEYGDVTDIYNCDTFNENSPPTSDPAYISTLGAAVYQGISKGDKDAVWLMQGWLF 413
Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW+PPQMKALL SVP GK++VLDLFA+VKPIW TS QFYG PYIWCMLHNF GNI
Sbjct: 414 YSDSSFWKPPQMKALLQSVPSGKMIVLDLFADVKPIWKTSFQFYGTPYIWCMLHNFGGNI 473
Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV 269
EMYG+LD+IA GPV+AR SEN+TMVGVGM MEGIE NP+VY+LMSEMAF+ EKV +
Sbjct: 474 EMYGVLDAIASGPVDARVSENSTMVGVGMCMEGIEHNPIVYELMSEMAFRDEKVKI 529
>gi|348533253|ref|XP_003454120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oreochromis
niloticus]
Length = 845
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 218/570 (38%), Positives = 343/570 (60%), Gaps = 44/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL + GPLPQSW QL LQ KIL R+ GM PVLPAFSGN+P + ++P A++T
Sbjct: 308 MANLFKFAGPLPQSWHVNQLYLQFKILERMRSFGMIPVLPAFSGNIPKGILRLYPEARVT 367
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W + C+ +LD DPLF IG ++ Q LK++G T HIY+ DTF+E TPP
Sbjct: 368 RLGPWSHFNCS--YSCSLVLDPQDPLFHHIGSLYLSQVLKQFG-TDHIYSTDTFNEMTPP 424
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ ++++ M + D AVWLMQGWLF D FW+P Q++ALL+ VPLG+++V
Sbjct: 425 SSDPAYLSAVSRSVFASMTAVDPQAVWLMQGWLFFSDAAFWKPAQIQALLHGVPLGRMIV 484
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +PI+S ++ FYG P+IWCML NF GN ++G ++SI GP +A N+T+VG
Sbjct: 485 LDLFAETEPIFSYTESFYGQPFIWCMLQNFGGNSGLFGTVESINSGPFKALHFPNSTLVG 544
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+GM+ EGIEQNPV Y+LMSE+A++ E V++ W++ Y++RRYG + ++ AW +L+ ++
Sbjct: 545 IGMTPEGIEQNPVTYELMSELAWRKEPVNLAKWVSLYAIRRYGNTQESLTTAWRLLFASI 604
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYG-KPVSKEAVLKSETSSYDHPHLW 358
YNCTD Y+N+ P+ + + T LW
Sbjct: 605 YNCTDP-------------------------HYRNHNHSPLVRRPSFQMNTG------LW 633
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y +++ +A +L + + L + T+RYDL+D+TR+ L + +I +A++ +
Sbjct: 634 YDPADLYKAWKLIMDAAPSLMSKETFRYDLVDVTREVLQVLTTSFYRDIADAFKKQNLSE 693
Query: 419 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + +L+ +++ LL+ + FLLG WLE A+ LA ++++ + Y+ NAR QIT+W
Sbjct: 694 LLTAGGVLVYDLLPELNRLLSSNRNFLLGAWLERARSLAVDDKEAQLYDMNARNQITLWG 753
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
+ + + DY +K W GL+ DYY R ++ + ++E L SG F+ + + ++
Sbjct: 754 PSGE-----ILDYASKEWGGLMEDYYAQRWGLFVQTLVECLNSGQPFKQAAFNQAVFQIE 808
Query: 538 NDW-QNGRNVYPVESNGDALITSQWLYNKY 566
+ NGR YP + GD + ++ KY
Sbjct: 809 KGFIYNGRK-YPTKPQGDTYEIAYRIFLKY 837
>gi|410930376|ref|XP_003978574.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Takifugu rubripes]
Length = 751
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 222/568 (39%), Positives = 343/568 (60%), Gaps = 40/568 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLPQSW QL LQ KIL ++ GM PVLPAFSGN+P + +FP A++T
Sbjct: 214 MGNMFKFGGPLPQSWHVNQLYLQFKILAQMRSFGMIPVLPAFSGNIPKGILRLFPEARVT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W K + + C+Y+LD DPLF IG ++ Q +K++G T+HIYN DTF+E TPP
Sbjct: 274 RLEPW--SKFNCSFSCSYILDPRDPLFSRIGSLYLSQVVKQFG-TNHIYNTDTFNEMTPP 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ A+++ M + D AVWLMQGWLF D FW+P Q++ALLN VP+G+++V
Sbjct: 331 SSEPTYLSAVSRAVFASMTAVDPQAVWLMQGWLFLSDALFWKPAQIQALLNGVPVGRMIV 390
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S ++ FYG P+IWCMLHNF GN +G ++SI GP +A N+++VG
Sbjct: 391 LDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNGGFFGTVESINTGPFKALHFPNSSLVG 450
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+GM+ EGIEQNPVVY+LMSE+A++ E V++ W++ Y RRYG ++ AW +L+ +V
Sbjct: 451 IGMTPEGIEQNPVVYELMSELAWRKEPVNLLKWVSLYVTRRYGSMHESVSAAWKILFASV 510
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT P Y+N+ + L S + + LWY
Sbjct: 511 YNCT-------------LP------------HYRNH-----NHSPLVRRPSFHMNSELWY 540
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+++ RA +L + + + T++YDL+D+TRQ + + +I++A+Q + +
Sbjct: 541 DPADLYRAWKLILEAAPSFMSKETFQYDLVDVTRQVMQVLTTSYYQDIVDAFQKHKMQEL 600
Query: 420 FQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
L +L+ +++ LL+ + FLLG WLE A+ LA +E + K Y+ NAR Q+T+W
Sbjct: 601 LTAGGVLLYDLLPELNRLLSSNHNFLLGTWLEQARSLALDEREAKLYDINARNQLTLWGP 660
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
+ + + DY NK W GL++DYY R ++ ++E L+SG F+ ++ + ++
Sbjct: 661 SGE-----ILDYANKQWGGLMQDYYAQRWGLFIHTLVECLDSGQPFKQDNFNKVVFQVEK 715
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKY 566
+ R YP + GD + ++ KY
Sbjct: 716 GFIYNRRQYPTKPQGDTFEIAHRIFLKY 743
>gi|443691318|gb|ELT93213.1| hypothetical protein CAPTEDRAFT_144379, partial [Capitella teleta]
Length = 718
Score = 420 bits (1080), Expect = e-114, Method: Compositional matrix adjust.
Identities = 223/549 (40%), Positives = 329/549 (59%), Gaps = 42/549 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL +W QQ++LQ +IL R+ +LGM P LPAF+G+VPA + +FP K++
Sbjct: 188 MGNMRGWGGPLSTNWHHQQILLQHRILKRMRDLGMTPALPAFAGHVPANITRLFPRVKVS 247
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W S +CCT LLD DPLF EIG+AFI++ +E+G T H+YN DTF+E TP
Sbjct: 248 KLGDWGRFNST--YCCTTLLDVEDPLFKEIGKAFIDEYTREFG-TDHVYNTDTFNEMTPA 304
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P Y++ G A+YSGM S DS A+WLMQGWLF D FW+PPQ KALL SVP GK++VL
Sbjct: 305 SSDPSYLTKAGQAVYSGMVSSDSKAIWLMQGWLFLSD-FWKPPQAKALLTSVPQGKMLVL 363
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL++EV P + + +YG P+IWCMLHNF G + MYG ++S+ GP E R+ N+TMVG+
Sbjct: 364 DLYSEVNPQYPRLQSYYGQPFIWCMLHNFGGTLPMYGAIESVNQGPFEGRSFVNSTMVGI 423
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y+ M E +F+ + V++ W ++Y+ RRY + AW + TVY
Sbjct: 424 GLTPEGINQNEVMYEFMMENSFRSQPVELTEWFDKYATRRYASRNANARAAWQIFKRTVY 483
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
NC+DG N+++ V P S + +WY
Sbjct: 484 NCSDGVKHHNKNIPVCRP-------------------------------SRKNKIDVWYD 512
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+ + +L IA+ E+ S +RYDL+D++RQAL + + I+ +Y+ + +
Sbjct: 513 VEDFFKGWDLMIAASKEVD-SPLFRYDLVDVSRQALQVISITYYNQILTSYKQKNLTSLA 571
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
L L++DMD +LA FLLG W+ A + E++ YE+NAR Q+T+W
Sbjct: 572 SSGNDLLHLLDDMDTVLATDSHFLLGAWIAGAHRNGVTPEEKALYEFNARNQVTLW---- 627
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKLTND 539
+A++L DY NK W+GL+ DYY R ++ + +SLE+ F K ++++ + K +
Sbjct: 628 GPDANIL-DYANKQWAGLVADYYHERWELFIDELKKSLENKTSFDEKKFQKDVFEKAESP 686
Query: 540 WQNGRNVYP 548
+ NVYP
Sbjct: 687 FTYRTNVYP 695
>gi|73965663|ref|XP_548088.2| PREDICTED: alpha-N-acetylglucosaminidase [Canis lupus familiaris]
Length = 747
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 229/570 (40%), Positives = 337/570 (59%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAFSG+VP AL VFP IT
Sbjct: 205 MGNLHTWGGPLPHSWHLKQLYLQHRILDRMRSFGMIPVLPAFSGHVPKALTRVFPQINIT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + ++E+G T+HIY DTF+E PP
Sbjct: 265 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELIQEFG-TNHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y++S A++Y M + DSDAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 322 SSEPSYLASATASVYQAMITVDSDAVWLLQGWLFQHQPQFWGPAQVKAVLEAVPRGRLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM+G
Sbjct: 382 LDLFAESQPVYIQTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMLG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D++AW++ ++ RRYG + + AW +L +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVSSFAARRYGVAHRDTEAAWRLLLRS 501
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR +V PS+ VT +
Sbjct: 502 VYNCSGEACSGHNRSPLVR----RPSLQMVTT---------------------------V 530
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L++S T+RYDL+D+TRQA + + ++ AY +
Sbjct: 531 WYNRSDVFEAWRLLLTAAPTLASSPTFRYDLLDVTRQAAQELVSLYYVEARSAYLRKELV 590
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + + + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 591 PLLRAAGVLVYELLPALDKVLASDSRFLLGRWLEQARAAAVSEAEAHLYEQNSRYQLTLW 650
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + + +L
Sbjct: 651 ----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEMLVESLVQGIPFQQHQFDKNAFQL 705
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ G YP + +GD + ++ L+ KY
Sbjct: 706 EQTFIFGTQRYPSQPDGDTVDLAKKLFIKY 735
>gi|326679829|ref|XP_688608.3| PREDICTED: alpha-N-acetylglucosaminidase-like [Danio rerio]
Length = 757
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 224/569 (39%), Positives = 332/569 (58%), Gaps = 42/569 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLPQSW +QL LQ KIL R+ GM PVLPAFSG VP + +FP A +T
Sbjct: 214 MGNLFQWGGPLPQSWHVKQLYLQFKILDRMRSFGMIPVLPAFSGIVPEGITRLFPKANVT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W + C Y+LD DPLF IG F+ Q ++E+G T HIYN DTF+E P
Sbjct: 274 KLSPWSHFNCT--YSCAYVLDPRDPLFHRIGALFLTQVIEEFG-TDHIYNTDTFNEMPPA 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y++S+ AI++ M S D A+WLMQGWLF DP FW+ Q+KALL+ VPLG+++V
Sbjct: 331 SSDPTYLASISRAIFNTMTSVDPQAIWLMQGWLFISDPSFWKADQVKALLHGVPLGRMIV 390
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++S++ FYG P+IWCMLHNF GN ++G +DSI GP A N+T+VG
Sbjct: 391 LDLFAESMPVYSSTNSFYGQPFIWCMLHNFGGNSGLFGTVDSINSGPFNAVRFPNSTLVG 450
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+GM+ EGIEQNPV+Y+LMSE+A++ + V++ W++ Y++RRYG + AW +L+ +V
Sbjct: 451 LGMTPEGIEQNPVIYELMSELAWRKDPVNLYKWVSLYALRRYGSMDENLALAWQLLFRSV 510
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK-PVSKEAVLKSETSSYDHPHLW 358
YNCT P KY+N+ + P+ L +T +W
Sbjct: 511 YNCT-------------LP------------KYKNHNRSPLVHRPSLHMQTD------IW 539
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y ++ RA +L + L T+RYDL+D+TRQAL E + +I A+Q
Sbjct: 540 YDPADFYRAWKLLFEAAPGLVTLETFRYDLVDVTRQALQLLTTEFYKDIKSAFQTQKLSD 599
Query: 419 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + +L+ ++D +L+ ++ FLLG WL+ A+ +E + Y+ NAR QIT+W
Sbjct: 600 LLTAGGVLVYDLLPELDRILSSNEHFLLGAWLQQAQSQGVDEHEAHLYDINARNQITLWG 659
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
+ + + DY +K W+GL+ DYY R ++ ++E L+ G F+ + + ++
Sbjct: 660 PDGE-----ILDYASKEWAGLVEDYYLQRWGLFVNTLVECLDRGRPFKQDVFNQAVFQVE 714
Query: 538 NDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + YP + GD ++ ++ KY
Sbjct: 715 KGFVFNQRKYPTKPLGDTYDIARRIFLKY 743
>gi|373953359|ref|ZP_09613319.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
gi|373889959|gb|EHQ25856.1| alpha-N-acetylglucosaminidase [Mucilaginibacter paludis DSM 18603]
Length = 733
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 224/570 (39%), Positives = 326/570 (57%), Gaps = 41/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGPLP W+ LQKKI+ R LGM PVLPAF+G+VPAA +N +P+AK+
Sbjct: 195 MGNMDSWGGPLPLRWMQTHFDLQKKIIARERALGMKPVLPAFTGHVPAAFKNKYPTAKL- 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + +D TY+LD+ DP+F IG+ F+++Q G T H+Y+ DTF+EN PP
Sbjct: 254 KTTNWKNGFAD-----TYILDSADPMFARIGQLFLQKQTALLG-TDHLYSADTFNENEPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
D PEY+ L +Y GM D+ AVW+MQGWLF D FW+P Q +ALL +VP K+++
Sbjct: 308 SDEPEYLGKLSERVYQGMHQADTAAVWVMQGWLFYSDRKFWKPEQTRALLKAVPDDKMII 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
LDL E++P+W ++ FYG P+IW ML+NF N ++G +DS A GP EA ++ M
Sbjct: 368 LDLATEIEPVWKRTEAFYGKPWIWNMLNNFGANTNLFGRMDSAAKGPAEAYHDPKSGQMK 427
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++MEGIEQNPV+YDL+++ ++++ ++V W+ +Y + RYG+ Q AWN+L T
Sbjct: 428 GIGLTMEGIEQNPVLYDLLTDNTWRNQPINVDEWLPKYVLNRYGKPNAQAQKAWNILRKT 487
Query: 299 VYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
VY+ D +I A P D S S +
Sbjct: 488 VYSVLADRYIRDGAESIIQARPTTDSS--------------------------SRWARTT 521
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
L Y ++ A + I + +LS S+ +R+DL+DL+RQ LA YA L + A+Q DA
Sbjct: 522 LNYEPKALLPAWQAMIKASEDLSTSDGFRFDLVDLSRQVLANYAFTLQRRFVLAHQQKDA 581
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ S F+EL++DMD LLA FLLGPW+ A++ ++ YE NA+ IT+W
Sbjct: 582 AAFKKHSAEFIELIQDMDQLLATRKDFLLGPWVADARRCGATVSEKALYEMNAKDLITLW 641
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
D + L +Y + WSGLL D+Y PR YF+ + L F + + R+
Sbjct: 642 GD----KDCPLNEYACRQWSGLLNDFYKPRWQQYFEQINLDLTGKKPFDKEAFERKIKSW 697
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
W N R YPV+ GD ++ ++ LY KY
Sbjct: 698 EWQWVNARKDYPVKPQGDPVLEARKLYKKY 727
>gi|255533666|ref|YP_003094038.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346650|gb|ACU05976.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 735
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 225/569 (39%), Positives = 334/569 (58%), Gaps = 42/569 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLP+S + LQKKIL R GM P+LPAF+G+VP A ++ FP AK+
Sbjct: 199 MGNIDGWGGPLPKSQMLAHEALQKKILERERSFGMTPILPAFTGHVPPAFKDKFPKAKLK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + S Y+LD D LF IG+ FIE+++K +G T H+Y DTF+ENTPP
Sbjct: 259 KT-NWTTFPS------VYILDPEDELFTTIGKRFIEEEVKTFG-TDHLYTADTFNENTPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
Y+S++ +Y M D +A W+MQGWLF + + FW+P Q+KALLN++P K++V
Sbjct: 311 TSDSLYLSNVSKKVYQSMALADPEATWIMQGWLFYHGEKFWKPTQIKALLNAIPNDKMIV 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMV 238
LDL++E P+W + +YG P+IW MLHNF GNI +YG +D +A G ++A+ + N+ MV
Sbjct: 371 LDLWSENHPVWQRTAAYYGKPWIWNMLHNFGGNISLYGRMDEVASGAIKAKQAANSGNMV 430
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ E IEQNPV+Y LM + + E ++V AW+ YS +RYG + AW +LY T
Sbjct: 431 GIGLTPEAIEQNPVMYQLMLDNIWTDEPINVTAWLKNYSRQRYGAQNALAEQAWQILYKT 490
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VY T G P SI++ G+P +E++ P
Sbjct: 491 VY--TGG----------ILPGGPESILT---------GRPTM------AESTRSTRPKKN 523
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y +E+I A E + + +LS ++ ++YDL+D+TRQ L YA+ L +AYQ D
Sbjct: 524 YKPAELIPAWEALLKASQQLS-TDGFKYDLVDVTRQVLVNYADTLQRQFAQAYQGKDGKK 582
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+LS FL +++D+D LLA FLLG WL AK++ E++K+YE NAR IT+W D
Sbjct: 583 FDRLSGDFLAVMDDVDYLLATRKDFLLGKWLNEAKRMGTTAEEKKRYERNARNLITLWAD 642
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
+ S L +Y + WSGL+ +Y PR +F Y + L+SG K + + +
Sbjct: 643 ----QNSSLNEYSCRQWSGLISSFYKPRWQQFFSYAKQQLKSGAKLDQKVFEEKMKRWEW 698
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKYL 567
DW N +V+ + +G+ + T++ LY KY+
Sbjct: 699 DWVNKNDVFTEQPSGNEIKTAESLYKKYI 727
>gi|350407422|ref|XP_003488083.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus impatiens]
Length = 770
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 225/576 (39%), Positives = 327/576 (56%), Gaps = 42/576 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL SW ++ L LQ +IL R+ ELG+ PVLPAF+G+VP A +FP A +T
Sbjct: 218 MGNIRGFGGPLTSSWHERSLQLQHRILQRMRELGIIPVLPAFTGHVPRAFPRLFPEANVT 277
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W S ++CC YLL+ TDPLF +IG F+ +KE+G T HIYNCDTF+EN PP
Sbjct: 278 KSATWNSFSD--KYCCPYLLEPTDPLFHKIGDQFLRTYIKEFG-TDHIYNCDTFNENEPP 334
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M S D A+WLMQGWLF +D FW P++KA L SVPLG+L+V
Sbjct: 335 TSELKFLRNVGHSIFQTMLSVDPQAIWLMQGWLFVHDAVFWTEPRIKAFLTSVPLGRLIV 394
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P++ K +YG P+IWCMLHNF G + M+G I E R E +TM+G
Sbjct: 395 LDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINRRVFEGRNMEGSTMIG 454
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++ E V++ W Y+ RRYG AW L TV
Sbjct: 455 TGLTPEGINQNYVIYELMNEMAYRQEPVNLDNWFEDYASRRYGAWNEYAVAAWKNLGSTV 514
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN IS GKY +P A L WY
Sbjct: 515 YNFRG--------------------ISKIRGKYVITRRPSLNLARLT-----------WY 543
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +F+ + + S YR+D++D+TRQAL A++++ ++E++ D
Sbjct: 544 DPEKFYSTWYIFLQARHGRKNSTLYRHDVVDITRQALQLKADKIYSVLVESFNQKDVTTF 603
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ R LEL +D++ +LA + FLLG WLE AK LA ++ + K YE+NAR QIT+W
Sbjct: 604 KLQAGRLLELFDDLEAILASSEDFLLGTWLEMAKNLATDDAESKLYEYNARNQITLWGPR 663
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW-RREWIKLTN 538
+ +RDY NK WSG++ DY+ PR AI+ + SL G + R + ++
Sbjct: 664 GE-----IRDYANKQWSGIVSDYFKPRWAIFLDGLTTSLTKGTSLNITRINERIFKEVEK 718
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFD 574
+ R +YP + GD + + + +K+ Q + +FD
Sbjct: 719 PFTLSRKIYPTNATGDCIDIAMRILSKWYQPS-IFD 753
>gi|149054264|gb|EDM06081.1| rCG33377, isoform CRA_d [Rattus norvegicus]
Length = 580
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 225/570 (39%), Positives = 331/570 (58%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 43 MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 102
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLGNW + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 103 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 159
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 160 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 219
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TMVG
Sbjct: 220 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 279
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW++ ++ RRYG S P AW +L +
Sbjct: 280 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLRS 339
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 340 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 368
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+AS +RYDL+D+TRQA+ + + + A+ D
Sbjct: 369 WYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQAVQELVSSCYEEARTAFLNQDLD 428
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + +L+ +D LLA + FLLG WL+ A+++A +E + + YE N+R QIT+W
Sbjct: 429 LLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAREVAVSESEAQFYEQNSRYQITLW 488
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ DYY PR ++ + SL G F+ + + L
Sbjct: 489 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGIPFQQHQFEKSVFPL 543
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ N + YP++ GD + S+ ++ K+
Sbjct: 544 EQAFINNKKRYPIQPQGDTVDLSKKIFLKF 573
>gi|109491871|ref|XP_001081442.1| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
gi|392351622|ref|XP_002727861.2| PREDICTED: alpha-N-acetylglucosaminidase [Rattus norvegicus]
gi|149054262|gb|EDM06079.1| rCG33377, isoform CRA_b [Rattus norvegicus]
Length = 739
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 225/570 (39%), Positives = 331/570 (58%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 202 MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLGNW + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 262 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 319 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TMVG
Sbjct: 379 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 438
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW++ ++ RRYG S P AW +L +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLRS 498
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 499 VYNCSGEACSGHNR-------------------------SPLVKRPSLQMSTA------V 527
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+AS +RYDL+D+TRQA+ + + + A+ D
Sbjct: 528 WYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQAVQELVSSCYEEARTAFLNQDLD 587
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + +L+ +D LLA + FLLG WL+ A+++A +E + + YE N+R QIT+W
Sbjct: 588 LLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAREVAVSESEAQFYEQNSRYQITLW 647
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ DYY PR ++ + SL G F+ + + L
Sbjct: 648 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGIPFQQHQFEKSVFPL 702
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ N + YP++ GD + S+ ++ K+
Sbjct: 703 EQAFINNKKRYPIQPQGDTVDLSKKIFLKF 732
>gi|340717403|ref|XP_003397173.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bombus terrestris]
Length = 770
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 223/570 (39%), Positives = 321/570 (56%), Gaps = 41/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL SW ++ L LQ KIL R+ ELG+ PVLPAF+G+VP A +FP A +T
Sbjct: 218 MGNIRGFGGPLTSSWHERSLQLQHKILQRMRELGIIPVLPAFTGHVPRAFPRLFPEANVT 277
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W S ++CC YLL+ TDPLF +IG F+ +KE+G T HIYNCDTF+EN PP
Sbjct: 278 KSATWNSFSD--KYCCPYLLEPTDPLFHKIGDQFLRTYIKEFG-TDHIYNCDTFNENEPP 334
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M S D A+WLMQGWLF +D FW P++K L SVPLG+L+V
Sbjct: 335 TSELKFLRNVGHSIFQTMLSVDPQAIWLMQGWLFVHDALFWTEPRIKTFLTSVPLGRLIV 394
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P++ K +YG P+IWCMLHNF G + M+G I E R E +TM+G
Sbjct: 395 LDLQSEQFPLYGKLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINRRVFEGRNMEGSTMIG 454
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++ E V++ W Y+ RRYG AW L TV
Sbjct: 455 TGLTPEGINQNYVIYELMNEMAYRQEPVNLDNWFEDYASRRYGAWNEYAVAAWKNLGSTV 514
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN IS GKY +P A L WY
Sbjct: 515 YNFRG--------------------ISKIRGKYVITRRPSLNLARLT-----------WY 543
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +F+ + + S YR+D++D+TRQAL A++++ ++E++ D
Sbjct: 544 DPEKFYSTWYIFLQARHGRQNSTLYRHDVVDITRQALQLKADKIYSALVESFNQKDVTTF 603
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ R LEL +D++ +LA + FLLG WLE AK LA ++ + K YE+NAR QIT+W
Sbjct: 604 KLQADRLLELFDDLEAILASSEDFLLGTWLEMAKNLATDDAESKLYEYNARNQITLWGPR 663
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW-RREWIKLTN 538
+ +RDY NK WSG++ DY+ PR AI+ + SL G + R + ++
Sbjct: 664 GE-----IRDYANKQWSGIVSDYFKPRWAIFLDALTTSLTKGTSLNITRINERIFKEVEK 718
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
+ R +YP GD + + + +K+ Q
Sbjct: 719 PFTLSRKIYPTNVTGDCIDIAMRILSKWHQ 748
>gi|281344539|gb|EFB20123.1| hypothetical protein PANDA_011160 [Ailuropoda melanoleuca]
Length = 619
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 221/570 (38%), Positives = 330/570 (57%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 81 MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 140
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 141 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 197
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A++Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 198 SSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQPEFWGPAQVTAVLGAVPRGRLLV 257
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F+G P+IWCMLHNF GN ++G L+++ GP AR N+TM G
Sbjct: 258 LDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMAG 317
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN +VY LM+E+ ++ + V D++AW++ + RRYG + + AW +L +
Sbjct: 318 TGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSSSAARRYGVTHKDTEAAWRLLLRS 377
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR +V P + + +
Sbjct: 378 VYNCSGEACSGHNRSPLVRRPSLQMATA-------------------------------V 406
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+AS ++RYDL+D+TRQA + + + AY +
Sbjct: 407 WYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQAAQELVSLYYEEARAAYLNKELV 466
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + + R + EL+ +D +LA FLLG WLE A+ A +E + + YE N+R Q+T+W
Sbjct: 467 PLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQARAAAVSEAEARFYEQNSRYQLTLW 526
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + + +L
Sbjct: 527 ----GPEGNIL-DYANKQLAGLVADYYAPRWGLFMEMLVESLAQGIPFQQHQFDKNAFQL 581
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ YP + GD + ++ L+ KY
Sbjct: 582 EQAFVFSTQRYPSQPQGDTVDLAKKLFLKY 611
>gi|301773566|ref|XP_002922216.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Ailuropoda
melanoleuca]
Length = 634
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 221/570 (38%), Positives = 330/570 (57%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 95 MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 154
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 155 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 211
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A++Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 212 SSEPSYLAAATASVYQAMITVDPDAVWLLQGWLFQHQPEFWGPAQVTAVLGAVPRGRLLV 271
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F+G P+IWCMLHNF GN ++G L+++ GP AR N+TM G
Sbjct: 272 LDLFAESQPVYIRTASFHGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMAG 331
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN +VY LM+E+ ++ + V D++AW++ + RRYG + + AW +L +
Sbjct: 332 TGMAPEGIGQNEMVYALMAELGWRKDPVADLEAWVSSSAARRYGVTHKDTEAAWRLLLRS 391
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR +V P + + +
Sbjct: 392 VYNCSGEACSGHNRSPLVRRPSLQMATA-------------------------------V 420
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+AS ++RYDL+D+TRQA + + + AY +
Sbjct: 421 WYNRSDVFEAWRLLLTAAPTLAASPSFRYDLLDVTRQAAQELVSLYYEEARAAYLNKELV 480
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + + R + EL+ +D +LA FLLG WLE A+ A +E + + YE N+R Q+T+W
Sbjct: 481 PLLRAAGRLVYELLPALDKVLASDRRFLLGSWLEQARAAAVSEAEARFYEQNSRYQLTLW 540
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + + +L
Sbjct: 541 ----GPEGNIL-DYANKQLAGLVADYYAPRWGLFMEMLVESLAQGIPFQQHQFDKNAFQL 595
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ YP + GD + ++ L+ KY
Sbjct: 596 EQAFVFSTQRYPSQPQGDTVDLAKKLFLKY 625
>gi|410981277|ref|XP_003996997.1| PREDICTED: alpha-N-acetylglucosaminidase [Felis catus]
Length = 857
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 225/570 (39%), Positives = 329/570 (57%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 318 MGNLHTWGGPLPPSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVT 377
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 378 QLGSWGHFNCS--YSCSFLLAPEDPLFPIIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 434
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y++S A++Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 435 SSEPSYLASATASVYQAMVTVDPDAVWLLQGWLFQHQPQFWGPAQVSAVLGAVPRGRLLV 494
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 495 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMVG 554
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D++AW+ ++ RRYG S + AW +L +
Sbjct: 555 TGMAPEGIGQNEVVYALMAELGWRKDPVADLEAWVTGFAARRYGVSHGNTEAAWRLLLRS 614
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR +V P LK T+ +
Sbjct: 615 VYNCSGEACSGHNRSPLVRRPS-------------------------LKMTTT------V 643
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+ S T+RYDL+D+TRQA + + + AY +
Sbjct: 644 WYNRSDVFEAWRLLLTTTPSLATSPTFRYDLLDVTRQAAQELVSLYYGEARTAYLNKELV 703
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + + + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 704 PLLRAAGILVYELLPSLDKVLASDSRFLLGSWLEQARAAAVSEAEAHFYEQNSRYQLTLW 763
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + + +L
Sbjct: 764 ----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEMLVESLVRGVPFQQHQFDQNAFQL 818
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ YP + +GD + ++ L+ +Y
Sbjct: 819 EQTFVLSTQRYPSQPHGDTVDLAKKLFLRY 848
>gi|440903235|gb|ELR53922.1| Alpha-N-acetylglucosaminidase, partial [Bos grunniens mutus]
Length = 614
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 230/574 (40%), Positives = 329/574 (57%), Gaps = 51/574 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 77 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 136
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+GNW + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 137 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 193
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 194 SSEPSYLAAATTAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 253
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TMVG
Sbjct: 254 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 313
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 314 TGMAPEGIGQNEVVYALMAELGWKKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 373
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC S E + N+ P+ + L+ T+ +W
Sbjct: 374 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 403
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
Y+ S+V A L +A+ + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 404 YNRSDVFEAWRLLLAATSTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 463
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
L A G+ EL+ +D +LA FLLG WLE A+Q A +E + YE N+R Q
Sbjct: 464 LTRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARQAAVSETEAHFYEQNSRYQ 518
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 532
+T+W E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + R
Sbjct: 519 LTLW----GPEGNIL-DYANKQLAGLMADYYAPRWRLFTETLVESLVQGVPFQQHQFDRN 573
Query: 533 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+L + G YP + GD + + L+ KY
Sbjct: 574 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 607
>gi|432926094|ref|XP_004080826.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Oryzias latipes]
Length = 882
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 222/574 (38%), Positives = 340/574 (59%), Gaps = 52/574 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N++ +GGPLPQSW QL LQ +IL R+ GM PVLPAFSGNVP + + P A +T
Sbjct: 345 MANMYKFGGPLPQSWHVNQLRLQFRILERMRAFGMIPVLPAFSGNVPKGILKLHPEANVT 404
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W + C+Y+LD DPLF++IG ++ Q +K++G T HIYN DTF+E TPP
Sbjct: 405 RLGPWAHFNCS--FSCSYVLDPRDPLFLQIGSLYLSQVVKQFG-TDHIYNTDTFNEMTPP 461
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ ++++ M + D A+WLMQGWLF D FW+PPQ++ALL+ VPLG+++V
Sbjct: 462 SSDPAYLSAISRSVFASMTAVDPKAIWLMQGWLFFSDAAFWKPPQIRALLHGVPLGRMIV 521
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S ++ FYG P+IWCMLHNF GN +G ++SI GP +A +N+TMVG
Sbjct: 522 LDLFAETEPVFSYTESFYGQPFIWCMLHNFGGNNGFFGTVESINSGPFKALNFKNSTMVG 581
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+GM+ EGI QNPV+Y+LMSE+A++ E V++ W + Y+ RRYG ++ AW +L+ +V
Sbjct: 582 IGMTPEGIHQNPVIYELMSELAWRKESVNLTKWASLYAARRYGSMHESLSAAWKLLFSSV 641
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT V + N+ P+ + T LWY
Sbjct: 642 YNCT-----------------------VPHYRNHNH-SPLVRRPSFNMNTG------LWY 671
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QL 413
++++ +LF+ + L + T+RYDL+D+TRQ L + +I +A+ +L
Sbjct: 672 DPADLLETWKLFMEAAPSLMSKETFRYDLVDVTRQVLQDLTTYFYQDIKDAFHSKKMPEL 731
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
+ GV +L +++ LL FLLG WLE A+ A +E + + Y+ NAR Q+
Sbjct: 732 LTSGGVL-----IYDLFPELNRLLNSDRNFLLGTWLEQAQSFALDEPEARLYDLNARNQL 786
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 533
T+W + + + DY NK W GL+ DYY R +++ + +++ L SG F+ + +
Sbjct: 787 TLWGPSGE-----ILDYANKEWGGLVEDYYAQRWSLFVQTLVDCLNSGLPFKQDAFNQAV 841
Query: 534 IKLTNDW-QNGRNVYPVESNGDALITSQWLYNKY 566
++ + NGR YP + GD + ++ KY
Sbjct: 842 FRVEKGFISNGRK-YPTKPQGDTYEIAHRIFLKY 874
>gi|126307960|ref|XP_001366343.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
domestica]
Length = 741
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 214/570 (37%), Positives = 320/570 (56%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +Q LQ +IL R+ GM PVLPAF+G++P A VFP A +T
Sbjct: 202 MGNLHTWGGPLPSSWDLKQSYLQYRILERMRSFGMKPVLPAFAGHIPKAFTRVFPQANVT 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + + C+YLL DPLF +G F+ + KE+G T HIY+ D F+E PP
Sbjct: 262 NLGMWGHFSCN--YSCSYLLAPEDPLFPVVGSLFLRELTKEFG-TDHIYSADIFNEMDPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ AA+Y M + D DAVWL QGWLF P FW+PPQMKA+L +VP G+ ++
Sbjct: 319 SSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNHPDFWKPPQMKAVLEAVPRGRFLI 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S + FYG P+IWCMLHNF GN ++G+LD++ GP AR N+T+VG
Sbjct: 379 LDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLFGVLDAVNRGPSTARLFPNSTIVG 438
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+ EGI QN VVY LM+E+ ++ + D+ AW+ ++ +RYG P + AW +L +
Sbjct: 439 TGIVPEGINQNEVVYALMAELGWRKDPFPDLGAWVAGFAAQRYGTPHPQAEAAWRLLLRS 498
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + T N +V P + +
Sbjct: 499 VYNCSWENCTGHNHSPLVKRPSLHLDF-------------------------------SV 527
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + +L+ S+ +RYDL+D+TRQ + + + + A++
Sbjct: 528 WYNRSDVFEAWRLLLEAAPQLATSSAFRYDLLDVTRQVAQELVSLYYGELKTAFEAGSMP 587
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + +L+ +D LL + FLLG WLE A+++A +E + YE NAR Q+T+W
Sbjct: 588 ALLSAGGLLVFDLLPSLDELLGTDERFLLGGWLEQAREMAVSEAEAWHYEQNARYQLTLW 647
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+ DY NK +GL+ YY PR ++ + +++SL G F + E L
Sbjct: 648 GPTGN-----ILDYANKQLAGLVAGYYAPRWKLFVEMLVKSLAEGTPFHQNQFENEAFLL 702
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ +GR +P + GD + ++ + KY
Sbjct: 703 GQAFVSGREKFPTQPQGDTVDLARKFFLKY 732
>gi|358419179|ref|XP_003584151.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Bos taurus]
Length = 741
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 231/574 (40%), Positives = 329/574 (57%), Gaps = 51/574 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+GNW + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ +Q + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 441 TGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 500
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC S E + N+ P+ + L+ T+ +W
Sbjct: 501 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 530
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
Y+ S+V A L + + + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 531 YNRSDVFEAWRLLLTATSTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 590
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
L A G+ EL+ +D +LA FLLG WLE A+Q A +E + YE N+R Q
Sbjct: 591 LTRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARQAAVSETEAHFYEQNSRYQ 645
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 532
+T+W E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + R
Sbjct: 646 LTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFTETLVESLVQGVPFQQHQFDRN 700
Query: 533 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+L + G YP + GD + + L+ KY
Sbjct: 701 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 734
>gi|449491231|ref|XP_004174728.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase,
partial [Taeniopygia guttata]
Length = 752
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 225/573 (39%), Positives = 337/573 (58%), Gaps = 51/573 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL W GPLP +W +QL LQ +I+ R+ LGM VLPAF+G+VP + VFP T
Sbjct: 211 MGNLRRWAGPLPPAWHFKQLYLQYRIVERMRSLGMTTVLPAFAGHVPQGILRVFPRVNAT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W D + C YLLD DP+F IG F+++ +KE+G T H+Y+ DTF+E TP
Sbjct: 271 RLGHWSHF--DCTYSCIYLLDPEDPMFQVIGTLFLKELIKEFG-TDHVYSADTFNEMTPL 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S + A++ M D A+WLMQGWLF + P FW+P Q++ALL+ VPLG+++V
Sbjct: 328 SSDPAYLSRVSNAVFRSMTGADPKALWLMQGWLFQHQPDFWQPAQVRALLHGVPLGRMIV 387
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE KP++ ++ FYG P+IWCMLHNF GN ++G +++I GP AR N+TMVG
Sbjct: 388 LDLFAESKPVYQWTESFYGQPFIWCMLHNFGGNHGLFGTVEAINHGPFAARRFPNSTMVG 447
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G+ EGIEQN +VY+LM+E+ ++ E +D+ +W+ +Y+ RRYG A AW +L +V
Sbjct: 448 TGLVPEGIEQNDMVYELMNELGWRQEPLDLPSWVTRYAERRYGAPNAAAASAWRLLLRSV 507
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT + NR P+ + L+ +T LWY
Sbjct: 508 YNCTGVCVNHNRS-------------------------PLVRRPSLRMDT------ELWY 536
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ------L 413
+ S+V A L +++G EL +S + YDL+D+TRQA + + +L+I +A+Q L
Sbjct: 537 NASDVFEAWRLLLSAGAELGSSPAFLYDLVDVTRQAAQQLVSHYYLSIRQAFQSHALPEL 596
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
A GV +L+ ++D LL+ H FLLG WL+SA+ +A ++++ +QYE NAR Q+
Sbjct: 597 LTAGGVL-----VYDLLPELDSLLSSHSLFLLGRWLQSARAVATSDQEAEQYELNARNQV 651
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 533
T+W + + DY N GL+ DYY R +++ ++ESL SG F + + +
Sbjct: 652 TLWGPSGN-----ILDYANXQLGGLVLDYYAVRWSLFVSVLVESLNSGRPFHQNQFNQVF 706
Query: 534 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
++ + + YP GD + S+ L+ KY
Sbjct: 707 FQVERGFIYNKKRYPAVPFGDTMEISRKLFLKY 739
>gi|148671928|gb|EDL03875.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
CRA_a [Mus musculus]
Length = 538
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 218/570 (38%), Positives = 326/570 (57%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW Q+ LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 1 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 60
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 61 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 117
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 118 FSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 177
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++ + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TMVG
Sbjct: 178 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 237
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW++ +++RRYG S P AW +L +
Sbjct: 238 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 297
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 298 VYNCSGEACSGHNRS-------------------------PLVKRPSLQMSTA------V 326
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+ S +RYDL+D+TRQA+ + + + AY +
Sbjct: 327 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 386
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + + +L+ +D LLA FLLG WL+ A++ A +E + + YE N+R QIT+W
Sbjct: 387 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQFYEQNSRYQITLW 446
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ DYY PR ++ + SL G F+ ++ + L
Sbjct: 447 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPFQQHEFEKNVFPL 501
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + YP + GD + S+ ++ KY
Sbjct: 502 EQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 531
>gi|270005801|gb|EFA02249.1| hypothetical protein TcasGA2_TC007912 [Tribolium castaneum]
Length = 747
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 210/572 (36%), Positives = 339/572 (59%), Gaps = 46/572 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W + LVLQK+IL R+ G+ PVLPAF+G++P A + ++P A ++
Sbjct: 201 MGNMRGFGGPLSPAWHSRSLVLQKQILQRMRAFGIIPVLPAFAGHLPRAFKTLYPDANMS 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + +CC Y LD T+PLF EIG+AF+ +Q+ E+G T H+YNCD+F+EN P
Sbjct: 261 KMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSEQISEFG-TDHMYNCDSFNENVPT 317
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVV 179
Y++++G +IY M D DAVWL+QGW+F D FW+ + ++++L SVPLGK++V
Sbjct: 318 SGDLTYLANVGKSIYKAMTDTDPDAVWLLQGWMFYNDNFWQDTERVRSILTSVPLGKMIV 377
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + Q++G PYIWCMLH+F G + M+G I P++AR EN+TM+G
Sbjct: 378 LDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMFGSSTVINEVPIKARHLENSTMIG 437
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+E A++ V++ W +YS RRYG ++AW +L TV
Sbjct: 438 TGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKYSTRRYGFPDSDAENAWRILQRTV 497
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y D ++ GKY ++K LK + + WY
Sbjct: 498 Y--------------------DYQGLNRMRGKY-----AITKSPSLKIKIWT------WY 526
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
ST++++ A + + + L A++ Y +DL+D+TRQ L Y + + +++ YQ +D+
Sbjct: 527 STNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVLQVYGDLYYKEMVKNYQSHDSANF 586
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
S++FLE+++D+D +L+ + FLLGPWLE+AK+ A + +E Q+E+NAR QIT+W
Sbjct: 587 QANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAANDSAEEAQFEYNARNQITLWGPR 646
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
+ + DY NK W+G++ ++ PR ++ Y+ + + D + + ++
Sbjct: 647 GE-----IMDYANKQWAGVVSHFFAPRWYLFINYLNSTFDGAFNQTYID-AKMFKEVEEP 700
Query: 540 WQNGRNVYPVESNGDAL-----ITSQWLYNKY 566
+ R +PVE GDA+ I +W +Y
Sbjct: 701 FTFDRTEFPVEPIGDAVEIAWKIHKKWTSEEY 732
>gi|91080563|ref|XP_973259.1| PREDICTED: similar to alpha-N-acetyl glucosaminidase [Tribolium
castaneum]
Length = 747
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 211/572 (36%), Positives = 338/572 (59%), Gaps = 46/572 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W + LVLQK+IL R+ G+ PVLPAF+G++P A + ++P A ++
Sbjct: 201 MGNMRGFGGPLSPAWHSRSLVLQKQILQRMRAFGIIPVLPAFAGHLPRAFKTLYPDANMS 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + +CC Y LD T+PLF EIG+AF+ +Q+ E+G T H+YNCD+F+EN P
Sbjct: 261 KMAPWNGF--NDTYCCPYFLDPTEPLFNEIGKAFLSEQISEFG-TDHMYNCDSFNENVPT 317
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
Y++++G +IY M D DAVW+MQGWLF++D F W + KA+L +VP GK++V
Sbjct: 318 SGDLTYLANVGKSIYKAMTDTDPDAVWVMQGWLFAHDFFYWTRNRAKAILTAVPKGKMIV 377
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + Q++G PYIWCMLH+F G + M+G I P++AR EN+TM+G
Sbjct: 378 LDLQSEQFPQYERLNQYFGQPYIWCMLHDFGGTLGMFGSSTVINEVPIKARHLENSTMIG 437
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+E A++ V++ W +YS RRYG ++AW +L TV
Sbjct: 438 TGLTPEGINQNYVIYELMTETAWRQAPVNLTEWFEKYSTRRYGFPDSDAENAWRILQRTV 497
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y D ++ GKY ++K LK + + WY
Sbjct: 498 Y--------------------DYQGLNRMRGKY-----AITKSPSLKIKIWT------WY 526
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
ST++++ A + + + L A++ Y +DL+D+TRQ L Y + + +++ YQ +D+
Sbjct: 527 STNDLLEAWTSLLEASDNLGANSGYLHDLVDVTRQVLQVYGDLYYKEMVKNYQSHDSANF 586
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
S++FLE+++D+D +L+ + FLLGPWLE+AK+ A + +E Q+E+NAR QIT+W
Sbjct: 587 QANSKKFLEILDDLDEILSTNSAFLLGPWLEAAKKAANDSAEEAQFEYNARNQITLWGPR 646
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
+ + DY NK W+G++ ++ PR ++ Y+ + + D + + ++
Sbjct: 647 GE-----IMDYANKQWAGVVSHFFAPRWYLFINYLNSTFDGAFNQTYID-AKMFKEVEEP 700
Query: 540 WQNGRNVYPVESNGDAL-----ITSQWLYNKY 566
+ R +PVE GDA+ I +W +Y
Sbjct: 701 FTFDRTEFPVEPIGDAVEIAWKIHKKWTSEEY 732
>gi|1171229|gb|AAC50512.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1171231|gb|AAC50513.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1197840|gb|AAB06188.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|1479981|gb|AAB36604.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|32450702|gb|AAH53991.1| N-acetylglucosaminidase, alpha- [Homo sapiens]
gi|119581237|gb|EAW60833.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
CRA_b [Homo sapiens]
Length = 743
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 217/573 (37%), Positives = 328/573 (57%), Gaps = 43/573 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ +YY PR ++ + +++S+ G F+ + + +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQL 704
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 569
+ + YP + GD + ++ ++ KY G
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPG 737
>gi|255553488|ref|XP_002517785.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
gi|223543057|gb|EEF44592.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis]
Length = 360
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 190/359 (52%), Positives = 260/359 (72%), Gaps = 6/359 (1%)
Query: 215 MYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWIN 274
MYGILDSI+ GP+EAR SEN+TMVGVGM MEGIE NPVVY+LMSEMAF+ EKV V W+
Sbjct: 1 MYGILDSISTGPIEARVSENSTMVGVGMCMEGIEHNPVVYELMSEMAFRSEKVQVLEWLK 60
Query: 275 QYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
YS RRYG++V ++ AW +LYHT+YNCTDG D N D IV FPD DPS+ S ++ Q+
Sbjct: 61 TYSRRRYGKAVHQVEAAWEILYHTIYNCTDGIADHNTDFIVKFPDWDPSVQSGSDTSQQD 120
Query: 335 -----YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI 389
+ S+ + + S+ H+WYS +VI AL+LFI G+ L+ S TYRYDL+
Sbjct: 121 NKHIFLHRSGSRRFLFEGPNSTLPQAHIWYSIQKVINALQLFIDGGSHLTGSLTYRYDLV 180
Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
DLTRQ L+K AN+++++ I A++ NDA + S++F++L++D+D LLA D FL+G WL
Sbjct: 181 DLTRQVLSKLANQVYVDAIIAFRSNDARALNLHSQKFIQLIKDIDVLLASDDNFLIGTWL 240
Query: 450 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 509
ESAK+LA N + +QYEWNARTQ+TMW+D T+ S L DY NK+WSGLL DYY PRA+
Sbjct: 241 ESAKELALNPSEMRQYEWNARTQVTMWYDTTKTNQSKLHDYANKFWSGLLEDYYLPRAST 300
Query: 510 YFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNG-DALITSQWLYNKYL 567
YF ++++SL+ + F+L++WR +WI +N+WQ G +YP++ +G DAL S+ LY+KY
Sbjct: 301 YFDHLVKSLKQNEKFKLQEWREKWIAFSNEWQAGTKLYPMKGSGDDALAISKALYDKYF 359
>gi|311267179|ref|XP_003131436.1| PREDICTED: alpha-N-acetylglucosaminidase [Sus scrofa]
Length = 744
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 225/575 (39%), Positives = 326/575 (56%), Gaps = 53/575 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 205 MGNLHTWSGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQISVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPSYLAAATAAVYQAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TM G
Sbjct: 382 LDLFAESQPVYVRTASFLGQPFIWCMLHNFGGNHGLFGALESVNQGPAAARLFPNSTMAG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ W+ ++ RRYG S + AW +L +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGTWVTSFAARRYGVSQGDAEAAWRLLLRS 501
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ +G T NR +V P + + +
Sbjct: 502 VYNCSGEGCTGHNRSPLVRRPSLQMATT-------------------------------V 530
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + + L++S +RYDL+D+TRQA+ + + + AY
Sbjct: 531 WYNQSDVFEAWRLLLKATPTLASSPAFRYDLVDITRQAVQELVSLYYEEARTAYLNKELV 590
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
L A G+ EL+ +D +LA FLLG WLE A+ +A +E + YE N+R
Sbjct: 591 SLMRAGGILA-----YELLPALDKVLASDSHFLLGSWLEQARGVAVSEAEALFYEQNSRY 645
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 531
Q+T+W E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + +
Sbjct: 646 QLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEMLVESLVQGIPFQQHQFDQ 700
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+L + G YP + GD + ++ L+ KY
Sbjct: 701 NVFQLEQTFVLGTRRYPSQPQGDTVDLAKKLFLKY 735
>gi|1479983|gb|AAB36605.1| alpha-N-acetylglucosaminidase [Homo sapiens]
gi|119581236|gb|EAW60832.1| N-acetylglucosaminidase, alpha- (Sanfilippo disease IIIB), isoform
CRA_a [Homo sapiens]
Length = 639
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 217/573 (37%), Positives = 328/573 (57%), Gaps = 43/573 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 100 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 159
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 160 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 216
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 217 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 276
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 277 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 336
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 337 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 396
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 397 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 425
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 426 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 485
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 486 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 545
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ +YY PR ++ + +++S+ G F+ + + +L
Sbjct: 546 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQL 600
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 569
+ + YP + GD + ++ ++ KY G
Sbjct: 601 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPG 633
>gi|2660688|gb|AAB88084.1| Naglu [Mus musculus]
Length = 739
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 218/570 (38%), Positives = 326/570 (57%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW Q+ LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 202 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 262 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 319 FSEPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++ + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TMVG
Sbjct: 379 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 438
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW++ +++RRYG S P AW +L +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 498
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 499 VYNCSGEACSGHNRS-------------------------PLVKRPSLQMSTA------V 527
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+ S +RYDL+D+TRQA+ + + + AY +
Sbjct: 528 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 587
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + + +L+ +D LLA FLLG WL+ A++ A +E + + YE N+R QIT+W
Sbjct: 588 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQFYEQNSRYQITLW 647
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ DYY PR ++ + SL G F+ ++ + L
Sbjct: 648 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPFQQHEFEKNVFPL 702
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + YP + GD + S+ ++ KY
Sbjct: 703 EQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732
>gi|254910995|ref|NP_038820.2| alpha-N-acetylglucosaminidase precursor [Mus musculus]
gi|20385160|gb|AAM21194.1|AF363242_1 N-acetyl-glucosaminidase [Mus musculus]
gi|3329361|gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus]
gi|33585908|gb|AAH55733.1| Alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB) [Mus
musculus]
gi|74211094|dbj|BAE37639.1| unnamed protein product [Mus musculus]
gi|74218052|dbj|BAE42009.1| unnamed protein product [Mus musculus]
gi|148671929|gb|EDL03876.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB), isoform
CRA_b [Mus musculus]
Length = 739
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 218/570 (38%), Positives = 326/570 (57%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW Q+ LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 202 MGNLHTWDGPLPRSWHLSQVYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVI 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 262 KLGSWGHFNCS--YSCSFLLAPGDPMFPLIGNLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 319 FSDPSYLAATTAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIRAVLEAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++ + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TMVG
Sbjct: 379 LDLFAESHPVYMHTASFHGQPFIWCMLHNFGGNHGLFGALEDVNRGPQAARLFPNSTMVG 438
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW++ +++RRYG S P AW +L +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLMAWVSSFAIRRYGVSQPDAVAAWKLLLRS 498
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 499 VYNCSGEACSGHNRS-------------------------PLVKRPSLQMSTA------V 527
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+ S +RYDL+D+TRQA+ + + + AY +
Sbjct: 528 WYNRSDVFEAWRLLLTAAPNLTTSPAFRYDLLDVTRQAVQELVSLCYEEARTAYLKQELD 587
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + + +L+ +D LLA FLLG WL+ A++ A +E + + YE N+R QIT+W
Sbjct: 588 LLLRAGGLLVYKLLPTLDELLASSSHFLLGTWLDQARKAAVSEAEAQFYEQNSRYQITLW 647
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ DYY PR ++ + SL G F+ ++ + L
Sbjct: 648 ----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPFQQHEFEKNVFPL 702
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + YP + GD + S+ ++ KY
Sbjct: 703 EQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732
>gi|66346698|ref|NP_000254.2| alpha-N-acetylglucosaminidase precursor [Homo sapiens]
gi|317373322|sp|P54802.2|ANAG_HUMAN RecName: Full=Alpha-N-acetylglucosaminidase; AltName:
Full=N-acetyl-alpha-glucosaminidase; Short=NAG;
Contains: RecName: Full=Alpha-N-acetylglucosaminidase 82
kDa form; Contains: RecName:
Full=Alpha-N-acetylglucosaminidase 77 kDa form; Flags:
Precursor
Length = 743
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 216/570 (37%), Positives = 327/570 (57%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ +YY PR ++ + +++S+ G F+ + + +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQL 704
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + YP + GD + ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
>gi|355568706|gb|EHH24987.1| Alpha-N-acetylglucosaminidase, partial [Macaca mulatta]
Length = 711
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 222/575 (38%), Positives = 328/575 (57%), Gaps = 53/575 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 172 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 231
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 232 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 288
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 289 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 348
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 349 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 408
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ +RYG S P AW +L +
Sbjct: 409 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 468
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 469 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 497
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY
Sbjct: 498 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 557
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
L A GV EL+ +D LLA FLLG WLE A+ A +E + YE N+R
Sbjct: 558 SLLRAGGVLAY-----ELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRY 612
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 531
Q+T+W E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + +
Sbjct: 613 QLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDK 667
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+L + + YP + GD + ++ ++ KY
Sbjct: 668 NVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 702
>gi|307192254|gb|EFN75548.1| Alpha-N-acetylglucosaminidase [Harpegnathos saltator]
Length = 741
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 212/569 (37%), Positives = 324/569 (56%), Gaps = 46/569 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W ++ + LQ +IL R+ +LG+ PVLPAF+G+VP A +FP+A +T
Sbjct: 194 MGNIRGFGGPLSINWHERTVRLQHRILRRMRDLGIVPVLPAFAGHVPRAFARLFPNANMT 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W K + ++CC YLL+ TDPLF IG F+ + E+G T HIYNCDTF+EN P
Sbjct: 254 KIEPW--NKFEDKYCCPYLLEPTDPLFQTIGEKFLRMYINEFG-TDHIYNCDTFNENEPG 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
Y+S++G +++ M + D A+WLMQGWLF +D FW P++++ L SVP G+++V
Sbjct: 311 NSELAYLSNVGRSVFQAMSTVDPQAIWLMQGWLFVHDFIFWTEPRVRSFLTSVPTGRMLV 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + K +YG P+IWCMLHNF G + M+G I E R +TMVG
Sbjct: 371 LDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRHMNGSTMVG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++HE VD+ AW Y+ RRYG AW L T+
Sbjct: 431 TGLTPEGINQNYVIYELMNEMAYRHEPVDLDAWFESYATRRYGAWNEYAVAAWKHLGRTI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN + VI P ++ S P +WY
Sbjct: 491 YNFVGIERIRGHYVITRRPSLNIS-------------------------------PWVWY 519
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ + +F+ + + YR+D++D+TRQAL A+ +++N+++ Y+ + G
Sbjct: 520 NREDFYHTWNVFLKARYGRGNNTLYRHDVVDITRQALQLMADNIYMNVVDCYKRKNITGF 579
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ L+L +D++ +LA FLLG WL AK +A +E++ + YE+NAR QIT+W N
Sbjct: 580 QSHAAALLDLFDDIEAILASGSNFLLGTWLAQAKDMAVDEKERQSYEYNARNQITLWGPN 639
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIKLTN 538
+ +RDY NK WSG++ DY+ PR A + K + +SL + + R ++++
Sbjct: 640 GE-----IRDYANKQWSGVVADYFKPRWAFFLKALEKSLVERTRLNMTEINDRMFLEVEQ 694
Query: 539 DWQNGRNVYPVESNGDAL-----ITSQWL 562
+ +YPV + GD L I S+WL
Sbjct: 695 AFTFSTKLYPVGTKGDTLDIAVKIISKWL 723
>gi|402900329|ref|XP_003913130.1| PREDICTED: alpha-N-acetylglucosaminidase [Papio anubis]
Length = 743
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 222/575 (38%), Positives = 327/575 (56%), Gaps = 53/575 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPEAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ +RYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY
Sbjct: 530 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 589
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
L A GV EL+ +D LLA FLLG WLE A+ A +E + YE N+R
Sbjct: 590 SLLRAGGVLAY-----ELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRY 644
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 531
Q+T+W E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + +
Sbjct: 645 QLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDK 699
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+L + + YP + GD + ++ ++ KY
Sbjct: 700 NVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
>gi|114667172|ref|XP_523654.2| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Pan
troglodytes]
gi|410216584|gb|JAA05511.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410258938|gb|JAA17435.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410304442|gb|JAA30821.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
gi|410337929|gb|JAA37911.1| N-acetylglucosaminidase, alpha [Pan troglodytes]
Length = 743
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 216/570 (37%), Positives = 326/570 (57%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + + +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 704
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + YP + GD + ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
>gi|355754184|gb|EHH58149.1| Alpha-N-acetylglucosaminidase, partial [Macaca fascicularis]
Length = 650
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 218/570 (38%), Positives = 326/570 (57%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 111 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 170
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 171 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 227
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 228 SSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 287
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 288 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 347
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ +RYG S P AW +L +
Sbjct: 348 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLLRS 407
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 408 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 436
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY +
Sbjct: 437 WYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELT 496
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + EL+ +D LLA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 497 SLLRAGGVLAYELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 556
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + + +L
Sbjct: 557 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 611
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + YP + GD + ++ ++ KY
Sbjct: 612 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 641
>gi|301626955|ref|XP_002942650.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Xenopus (Silurana)
tropicalis]
Length = 759
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 212/573 (36%), Positives = 323/573 (56%), Gaps = 51/573 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+H WGGPL SW++++L LQ +I R+ LGM VLPAF+G++P + VFP ++
Sbjct: 212 MGNIHTWGGPLSISWMEKRLSLQLQITERMRSLGMITVLPAFAGHIPEGILRVFPKVTVS 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LG W + + C+YLLD DPLF IG F+ Q ++ +G T HIY+ DTF+E +P
Sbjct: 272 RLGGWSNFNCT--YSCSYLLDPEDPLFQWIGELFLSQMVQSFG-TDHIYSADTFNEMSPT 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S++ AI+ M D DA+WLMQGWLF +P FWRP Q KALL+ P+G+++V
Sbjct: 329 SSDPGYLSAVSGAIFKSMAKVDPDAIWLMQGWLFINNPSFWRPAQTKALLHGAPIGRIIV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++ T++ FYG P+IWCML+NF GN ++G ++ + GP +A N+TMVG
Sbjct: 389 LDLFAETVPVYLTTESFYGQPFIWCMLNNFGGNHGLFGNIEGVNRGPFDAAKFPNSTMVG 448
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGIEQN ++Y+ M+E+ + + +++ WI+ YS RRYG+S + AW +L +V
Sbjct: 449 TGLTPEGIEQNDMIYEFMNEIGWSSQPINLTKWISNYSDRRYGQSNTDARMAWQILLRSV 508
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNCT + N +V P ++ + + Y
Sbjct: 509 YNCTQILHNHNHSPLVRRPSLNMNT-------------------------------DICY 537
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QL 413
+ +++ A + L S T+ YDL+D+TR+A+ + +E +L I EAY QL
Sbjct: 538 NKADIYEAWRFMHNASFALGKSATFLYDLVDITREAVQQLVSEYYLEIKEAYGKKSLQQL 597
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
A GV +L+ ++D LL+ GFLLG WL++AK +A + Y+ NAR QI
Sbjct: 598 MTAGGVL-----VYDLLPELDSLLSSQPGFLLGSWLKAAKSMASTPAEAALYDMNARNQI 652
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 533
T+W + DY NK + GL++DYY R ++ ++++SL G+ F + +
Sbjct: 653 TLWGPTGN-----ILDYANKQYGGLVQDYYTERWGLFVWFLVQSLNKGEHFNQDKFNKAV 707
Query: 534 IKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
L D+ Y GD L + +Y KY
Sbjct: 708 FVLEEDFVYNGKEYMASPTGDTLEIANKIYLKY 740
>gi|397485721|ref|XP_003813989.1| PREDICTED: alpha-N-acetylglucosaminidase [Pan paniscus]
Length = 682
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 216/570 (37%), Positives = 326/570 (57%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 143 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 202
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 203 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 259
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 260 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 319
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 320 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNEGPEAARLFPNSTMVG 379
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 380 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 439
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 440 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 468
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 469 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 528
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 529 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 588
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + + +L
Sbjct: 589 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 643
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + YP + GD + ++ ++ KY
Sbjct: 644 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 673
>gi|426348060|ref|XP_004041658.1| PREDICTED: alpha-N-acetylglucosaminidase [Gorilla gorilla gorilla]
Length = 743
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 216/570 (37%), Positives = 326/570 (57%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIEAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + + +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQL 704
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + YP + GD + ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPQGDTVDLAKKIFLKY 734
>gi|297701096|ref|XP_002827555.1| PREDICTED: alpha-N-acetylglucosaminidase [Pongo abelii]
Length = 836
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 220/575 (38%), Positives = 326/575 (56%), Gaps = 53/575 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 297 MGNLHSWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 356
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HI+ DTF+E PP
Sbjct: 357 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIFGADTFNEMQPP 413
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 414 SSEPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLV 473
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 474 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 533
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 534 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 593
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR +V P L+ TS +
Sbjct: 594 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------V 622
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY
Sbjct: 623 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 682
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
L A GV EL+ +D +LA FLLG WLE A+ A +E + YE N+R
Sbjct: 683 SLLRAGGVLAY-----ELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRY 737
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 531
Q+T+W E ++L DY NK +GL+ +YY PR ++ + + +S+ G F+ + +
Sbjct: 738 QLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDK 792
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+L + + YP + GD + ++ ++ KY
Sbjct: 793 NVFQLEQAFVLSKQRYPSQPQGDTVDLAKKIFLKY 827
>gi|348562747|ref|XP_003467170.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Cavia porcellus]
Length = 750
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 215/569 (37%), Positives = 332/569 (58%), Gaps = 41/569 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP++W +QL LQ +IL R+ LGM PVLPAF+G+VP A+ VFP IT
Sbjct: 211 MGNLHGWGGPLPRTWHLKQLSLQHQILDRMRALGMTPVLPAFAGHVPKAIGRVFPQVNIT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + ++E+G T+HIY DTF+E PP
Sbjct: 271 QLGSWGHFNCS--YSCSFLLAPEDPLFPLIGGIFLRELIREFG-TNHIYGADTFNEMQPP 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A++ M + DSDAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 328 SSDPAYLAAATEAVFKAMVAVDSDAVWLLQGWLFQHQPEFWGPAQVGAVLGAVPQGRLLV 387
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 388 LDLFAESQPVYTRTASFRGQPFIWCMLHNFGGNHGLFGALEAVNRGPTAARLFPNSTMVG 447
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW+++++ RRYG + P + AW +L +
Sbjct: 448 TGITPEGIGQNEVVYALMAELGWRKDPVPDLLAWVSRFAERRYGVAQPDAEAAWRLLLRS 507
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ + L+ T+ +W
Sbjct: 508 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTA------VW 537
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y+ S+V A L + + +L+ S T+RYDL+D+TRQAL + + + + AY + G
Sbjct: 538 YNRSDVFEAWRLLLKASPKLTTSPTFRYDLLDVTRQALQELVSLYYEEVRAAYLHQELAG 597
Query: 419 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + +L+ +D +LA FLLG WL A+ A +E + + YE N+R Q+T+W
Sbjct: 598 LLRAGGVLAYQLLPALDEVLASDHHFLLGSWLAQARAAAASETEARLYEQNSRYQLTLW- 656
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
E ++L DY NK +GL+ YY PR ++ + + +SL F+ + ++ L
Sbjct: 657 ---GPEGNIL-DYANKQLAGLVAHYYAPRWQLFIESLADSLARAAPFQQHQFDKDVFLLE 712
Query: 538 NDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ Y + GD + ++ ++ ++
Sbjct: 713 QAFVLSSRRYRSQPQGDTVDLARKVFLRF 741
>gi|295132875|ref|YP_003583551.1| hypothetical protein ZPR_1010 [Zunongwangia profunda SM-A87]
gi|294980890|gb|ADF51355.1| predicted protein [Zunongwangia profunda SM-A87]
Length = 750
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 221/578 (38%), Positives = 330/578 (57%), Gaps = 63/578 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGPLPQSW + LQKKIL R ELGM PVLPAF+G+VPA+ + FP A +
Sbjct: 206 MGNLDGWGGPLPQSWKESHRDLQKKILKRSRELGMKPVLPAFTGHVPASFKKFFPDADLK 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + D TY+LDA DPLF EIG+ F+E+Q + +G T H Y DTF+EN PP
Sbjct: 266 KT-NWGNDFGD-----TYILDAEDPLFAEIGKRFLEKQEEVFG-TDHFYTADTFNENEPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVV 179
D P+Y+ L I+ GM++ D +A W+MQGWLF S+ FW+ PQ+K LL++VP ++++
Sbjct: 319 SDDPKYLGELSEKIFEGMKAADPEATWVMQGWLFYSHKDFWKTPQIKGLLSTVPDDRMII 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
LDL E++P+W ++ FYG +IW MLHNF GNI M+G ++++A P A S + +
Sbjct: 379 LDLATEIEPVWKQTEAFYGKQWIWNMLHNFGGNISMFGRIETVAEQPALALNDSTSGNLK 438
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ME IEQNPV+Y+LM++ ++ +++K+W+ Y+ RYG +I +AW++L T
Sbjct: 439 GIGLTMEAIEQNPVLYELMTDNTWRDTPIELKSWLKNYTRNRYGAVNDSILEAWDILVAT 498
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
YN T D +I A P ++ Y + + +
Sbjct: 499 AYNGT-TIRDGAESIIAARP------------TFEGYRR--------------WARTKMN 531
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y +++ A +LFI + + S+ + YDL+DL+RQ LA YA + + AY+ ND
Sbjct: 532 YDPLDLLPAWDLFIGARDRFKDSDGFAYDLVDLSRQVLANYALPVQQQMRIAYENNDKEA 591
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF- 477
+ S L L+ D+D LLA FLLGPW+ A+ E++ YE NAR IT+W
Sbjct: 592 FKKHSEELLTLISDLDRLLATRKDFLLGPWIADARSWGTTPEEKALYERNARDLITLWGG 651
Query: 478 -DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG--------FRLKD 528
DN L +Y + WSG+L D+Y PR ++ I +E+ G ++K+
Sbjct: 652 PDNP------LHEYSCRQWSGVLDDFYKPR----WQQFIADVEANWGDFDQEVFDEKIKE 701
Query: 529 WRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
W +W+ N YP + +GD+ ++ LY+KY
Sbjct: 702 WEWKWV-------NKEEAYPTQPSGDSYKVAKALYDKY 732
>gi|395827009|ref|XP_003786703.1| PREDICTED: alpha-N-acetylglucosaminidase [Otolemur garnettii]
Length = 756
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 218/569 (38%), Positives = 320/569 (56%), Gaps = 41/569 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWGGPLPFSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QL +W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 264 QLSSWGHFNCS--YSCSFLLAPGDPIFSLIGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D DAVWL+QGWLF + P FW P Q+KA+L +VPLG+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMIAVDPDAVWLLQGWLFQHQPQFWGPTQIKAVLRAVPLGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYSRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNQGPKAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 441 TGMAPEGINQNEVVYALMAELGWRKDPVPDLVAWVTSFADRRYGISHGDAEAAWRLLLRS 500
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ K L+ T+ +W
Sbjct: 501 VYNCSGEACSGHNH------------------------SPLVKRPSLQMNTT------VW 530
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y+ S+V A L + S L+AS +RYDL+D+TRQA+ + + + AY +
Sbjct: 531 YNRSDVFEAWRLLLTSAPTLAASPIFRYDLLDITRQAIQELVSLYYEKARTAYLNKELVP 590
Query: 419 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + EL+ +D +LA + FLLG WL A+ +A +E + YE N+R Q+T+W
Sbjct: 591 LLRAGGLLAYELLPALDEVLASDNHFLLGSWLAQARAVAISEAEANFYEQNSRYQLTLW- 649
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
+ DY NK +GL+ DYY PR ++ + + L G F+ + + + L
Sbjct: 650 ----GPVGNILDYANKQLAGLVADYYAPRWQLFMQALGNCLAQGIPFQQRQFDKNVFPLE 705
Query: 538 NDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ YP + G+ + ++ ++ KY
Sbjct: 706 QAFVLNSKRYPSQPQGNTMDLAKKIFLKY 734
>gi|354485058|ref|XP_003504701.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cricetulus griseus]
gi|344251941|gb|EGW08045.1| Alpha-N-acetylglucosaminidase [Cricetulus griseus]
Length = 740
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 219/575 (38%), Positives = 328/575 (57%), Gaps = 53/575 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 203 MGNLHTWGGPLPRSWHLKQLYLQHRILDRMRAFGMIPVLPAFAGHVPKAITRVFPQVNVF 262
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E P
Sbjct: 263 QLGSWGHFNCS--YSCSFLLAPGDPVFPLIGSLFLRELIKEFG-TDHIYGADTFNEMQPI 319
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P ++++ AA+Y M S D DA+WL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 320 SSDPSFLTAATAAVYEAMISVDPDAIWLLQGWLFQHQPQFWGPAQVKAVLQAVPRGRLLV 379
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P++ + FYG P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 380 LDLFAESHPVYMQTASFYGQPFIWCMLHNFGGNHGLFGALEAVNQGPRAARIFPNSTMVG 439
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN +VY LM+E+ ++ + V D++ W+++++ RYG S P + AW +L +
Sbjct: 440 TGIAPEGIGQNEMVYALMAELGWRKDPVPDLEVWVSRFASHRYGMSHPDAEAAWRLLLRS 499
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC + NR +V P + + I +
Sbjct: 500 VYNCPGETYNGHNRSPLVKRPSLQINTI-------------------------------V 528
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ----- 412
WY+ S+V A L + + L+ S +RYDL+D+TRQ+L + + + A+
Sbjct: 529 WYNRSDVFEAWRLLLTAAPNLTTSKAFRYDLLDVTRQSLQELVSLFYEEARIAFMKEELD 588
Query: 413 -LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
L A G+ ++R+ L+ +D LLA FLLG WL A+ +A +E++ + YE N+
Sbjct: 589 LLLRAGGI--ITRK---LLPALDELLASDSRFLLGTWLNQARAMAVSEDEAQFYELNSLY 643
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 531
Q+T+W E +++ DY NK +GL+ DYY PR ++ + + SL G FR ++ +
Sbjct: 644 QLTLW----GPEGNIM-DYANKQLAGLVADYYQPRWGLFMEALAHSLARGVPFRQHEFEK 698
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
L + + YP GD + S+ L+ KY
Sbjct: 699 NVFPLELAFIINKKRYPSHPQGDTVDLSKKLFLKY 733
>gi|380030624|ref|XP_003698943.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase-like
[Apis florea]
Length = 769
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 212/584 (36%), Positives = 328/584 (56%), Gaps = 49/584 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W ++ + LQ +IL R+ LG+ PVLPAF+G+VP A +FP A +T
Sbjct: 216 MGNMRGFGGPLSSNWHEKSIRLQHRILERMRALGIIPVLPAFAGHVPRAFLRLFPKANVT 275
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + ++CC YLL+ DPLF +IG+ F++ ++E+G T H+YNCDTF+EN P
Sbjct: 276 KSAVWNNFSD--KYCCPYLLEPMDPLFKQIGQQFLKTYIEEFG-TDHVYNCDTFNENEPY 332
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M + DS A+WLMQGWLF +D FW P+ + L S+PLG+++V
Sbjct: 333 TSELKFLRNIGHSIFEAMSNVDSKAIWLMQGWLFYHDSVFWTEPRTRTFLTSIPLGRMIV 392
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + +YG P+IWCMLHNF G + M+G + I EAR +TMVG
Sbjct: 393 LDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGGTLGMFGSAEIINHRIFEARNMNGSTMVG 452
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPAIQDAWNVLYH 297
G++ EGI QN V+Y+LM+EMA++ V++ W Y+ RRYG + AW +
Sbjct: 453 TGLTPEGINQNYVIYELMNEMAYRKRPVNLDKWFENYANRRYGDTKGNEHTVTAWKGFKN 512
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
TVYN +D + + I P+++ S P
Sbjct: 513 TVYNFSDTRRIRGKYAITIRPNLNFS-------------------------------PWR 541
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ I + + + + S YR+D++D+TRQAL A+E++ ++IE++ +
Sbjct: 542 WYNKDAFIHYWYMLLQARDLKRNSTLYRHDVVDVTRQALQLIADEIYTDLIESFNKKNID 601
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
Q ++ L L +D++ +LA + FLLG WL+ AK LA N+E+E YE+NAR QIT+W
Sbjct: 602 LFKQNAKLLLALFDDLEEILASSEDFLLGKWLKMAKDLATNDEEEILYEYNARNQITLW- 660
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK-L 536
+RDY NK WSG++ DY+ PR AI+ + SL +G + + +
Sbjct: 661 ----GPLGEIRDYANKQWSGIVADYFKPRWAIFLNELETSLTTGTRVNTTKMNEQIFENV 716
Query: 537 TNDWQNGRNVYPVESNGDAL-----ITSQWLYNKYLQGTGVFDH 575
+ R +YP ++ GD++ I S+W Y+ YL F H
Sbjct: 717 EEAFTFSRKIYPTKATGDSIDIAERILSEW-YDPYLPFHKTFRH 759
>gi|395532374|ref|XP_003768245.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Sarcophilus
harrisii]
Length = 726
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 217/570 (38%), Positives = 321/570 (56%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL SW +Q LQ +IL R+ GM PVLPAF+G+VP A VFP A +T
Sbjct: 187 MGNLHSWGGPLSSSWHRKQSSLQYQILERMRSFGMKPVLPAFAGHVPKAFTRVFPQAYVT 246
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + C+YLL DPLF +G F+ + +E+G T HIY+ DTF+E PP
Sbjct: 247 HLGMWGHFNCT--YSCSYLLAPEDPLFPVVGSLFLRELTQEFG-TDHIYSADTFNEMEPP 303
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW+PPQ+KA+L +VPLG+L+V
Sbjct: 304 SSEPAYLAAATAAVYEAMIAVDVDAVWLLQGWLFQHQPDFWKPPQVKAVLKAVPLGRLLV 363
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+AE KP++S + FYG P+IWCMLHNF GN ++G LD++ GP +A N+T VG
Sbjct: 364 LDLYAESKPVYSRTDSFYGQPFIWCMLHNFGGNHGLFGALDAVNRGPSDAWLFPNSTFVG 423
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+ EGI QN VVY LM+E+ +Q + D+ AW+ ++ +RYG + AW +L +
Sbjct: 424 TGIVPEGINQNEVVYALMAELGWQKGPLPDLGAWVAGFAAQRYGTPHSHAEAAWKLLLQS 483
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ D T NR +V P + I +
Sbjct: 484 VYNCSGDLCTGHNRSPLVKRPSLHLDI-------------------------------SV 512
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L++S +RYDL+D+TRQ + + + + A++
Sbjct: 513 WYNRSDVFEAWRLLLEAAPVLASSPAFRYDLLDVTRQVAQELVSLYYEELRTAFEAGAMP 572
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + +L+ +D LLA + FLLG WLE A+++A +E + QY+ NA Q+T+W
Sbjct: 573 ALLTAGGLLVFDLLPSLDELLASDERFLLGAWLEQAREMAVSEAEAWQYKQNALYQLTLW 632
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+ DY NK +GL+ YY PR ++ + +++SL G F + E + L
Sbjct: 633 GPTGN-----ILDYANKQLAGLVAGYYAPRWKLFVEMLVKSLAEGTPFHQNQFESEALLL 687
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
++ GR +P + GD + + + +Y
Sbjct: 688 GQNFVLGREKFPTQPQGDTVDLVKKFFLRY 717
>gi|444714090|gb|ELW54978.1| Alpha-N-acetylglucosaminidase [Tupaia chinensis]
Length = 724
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 222/571 (38%), Positives = 327/571 (57%), Gaps = 43/571 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ ++L R+ GM PVLPAF G+VP A+ VFP +T
Sbjct: 174 MGNLHTWGGPLPHSWHLKQLYLQHRVLDRMRSFGMIPVLPAFPGHVPKAITRVFPQVNVT 233
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DP+F IG F+ + KE+G T HIY DTF+E PP
Sbjct: 234 QLGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRELTKEFG-TDHIYGADTFNELQPP 290
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AAIY+ M + D AVWL+QGW+F + P FW P Q+KA+L +VP G+L+V
Sbjct: 291 SSEPSYLAAATAAIYAAMTAVDPGAVWLLQGWIFQHQPDFWGPAQVKAVLEAVPRGRLLV 350
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN +YG L+++ +GP AR N++MVG
Sbjct: 351 LDLFAETRPVYLYTASFLGQPFIWCMLHNFGGNHGLYGTLEAVNWGPKAARLFPNSSMVG 410
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ Y+ RRYG S+ + AW +L +
Sbjct: 411 TGMAPEGINQNEVVYALMAELGWRKDPVPDLAAWVTSYADRRYGVSLGDAEAAWRLLLRS 470
Query: 299 VYNCTDG-ATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + NR P+ K L+ T+ +
Sbjct: 471 VYNCSGQMCSGHNRS-------------------------PLVKRPSLQMNTT------V 499
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ S+V A L + + L+AS T+RYDL+D+TRQA+ + + + AY +
Sbjct: 500 WYNRSDVFEAWRLLLTAAPTLAASPTFRYDLLDVTRQAVQELVSLYYEEARTAYLNKELV 559
Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + + EL+ D+D LLA F+LG WLE A+ +A +E + + YE N+R Q+T+W
Sbjct: 560 SLLRAGGILVYELLPDLDNLLATDGRFMLGSWLEQARAVAVSETEAQFYEQNSRYQLTLW 619
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+ DY NK +GL+ DYY PR ++ + + SL G F+ + + +L
Sbjct: 620 GPTGN-----ILDYANKQLAGLVADYYAPRWQLFMEMLANSLTQGIPFQQHQFDQNAFQL 674
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
+ YP + GD + ++ ++ KY
Sbjct: 675 EQAFVLSVERYPSQPQGDTVELAKKIFLKYF 705
>gi|426238065|ref|XP_004012978.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 1 [Ovis aries]
Length = 748
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 228/581 (39%), Positives = 325/581 (55%), Gaps = 51/581 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 211 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 271 QMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D AVWL+QGWLF P FW P Q+ A+L +VP G+L+V
Sbjct: 328 SSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQPEFWGPAQVAAVLGAVPRGRLLV 387
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+T+VG
Sbjct: 388 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPATARRFPNSTLVG 447
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 448 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 507
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC S E + N+ P+ K L T+ +W
Sbjct: 508 VYNC-----------------------SGEECRGHNH-SPLVKRPSLHMVTT------VW 537
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
Y+ S+V A L + + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 538 YNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 597
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
L A G+ EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q
Sbjct: 598 LMRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARLAAVSETEAHFYEQNSRYQ 652
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 532
+T+W E ++L DY NK +GL+ DYY PR ++ + + ESL G F+ + +
Sbjct: 653 LTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFAETLAESLVQGVPFQQHQFDKN 707
Query: 533 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 573
+L + G YP + GD + + L+ KY G F
Sbjct: 708 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKYYPRAGSF 748
>gi|256422141|ref|YP_003122794.1| alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
gi|256037049|gb|ACU60593.1| Alpha-N-acetylglucosaminidase [Chitinophaga pinensis DSM 2588]
Length = 728
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 210/570 (36%), Positives = 324/570 (56%), Gaps = 44/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGPLPQ W D VLQ++IL +GM P+LPAF+G+VP A ++ +P+ +I
Sbjct: 195 MGNIDAWGGPLPQHWKDSHKVLQQQILAAERSMGMLPILPAFTGHVPPAFKDKYPN-EIV 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW D + Y+LD P+F +IG+ F+E Q K +G T H Y+ DTF+EN PP
Sbjct: 254 KPTNW-----DAGFPDVYILDPNSPMFDKIGKKFLEAQTKAFG-TDHFYSADTFNENVPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
++ ++ +Y+ M + D AVW+MQGW+F Y+ +W PQ++ALLN+VP ++V
Sbjct: 308 SSDSSFLDAMSRKVYASMAAADPKAVWVMQGWMFHYNASYWHQPQIRALLNAVPDDHMIV 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
LDL++E P W ++ +YG P+IW MLHNF GN M+G +D+ A P A + M
Sbjct: 368 LDLYSESHPEWRNTQAYYGKPWIWNMLHNFGGNTGMWGGMDAAAHDPATALHDPASGKMS 427
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ EGIEQNP +Y LM + ++ + ++V W+ Y+ +RYG A+ AW +LYHT
Sbjct: 428 GIGLTPEGIEQNPALYQLMIDNVWRDQPINVDTWLQSYAKQRYGAENEAVNKAWQILYHT 487
Query: 299 VY--NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
VY T+GA + +IVA P +D ++ E V
Sbjct: 488 VYIGGPTEGAPES---IIVARPTLD-----------------IAAERV---------KTK 518
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
L Y ++V+ A +LFI + +L + ++YDL+DLTRQ L YA+ L + AY+ D
Sbjct: 519 LEYDPAKVVPAWDLFINAAAQLKPTEGFKYDLVDLTRQVLGNYASPLQQRVATAYRNKDL 578
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
Q S +F+ L++DMD LL +GFLLG W+ A+ ++ YE+NA+ +T+W
Sbjct: 579 AAFKQYSTQFIGLLDDMDMLLGTQEGFLLGKWVSDARSNGITPAEQDLYEFNAKDLVTLW 638
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
D + S + +Y N+ W+GL++ +Y PR +F + SL+ G+ LK + +
Sbjct: 639 GD----KDSPVHEYSNRQWNGLIKGFYKPRWQQFFTLLESSLKKGETADLKAFEEQVKAF 694
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
W NG + Y V+ GDA+ + L+ KY
Sbjct: 695 EWKWANGHDKYAVKPQGDAVKAAVQLHKKY 724
>gi|426238067|ref|XP_004012979.1| PREDICTED: alpha-N-acetylglucosaminidase isoform 2 [Ovis aries]
Length = 739
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 228/581 (39%), Positives = 325/581 (55%), Gaps = 51/581 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 202 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 262 QMGSWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D AVWL+QGWLF P FW P Q+ A+L +VP G+L+V
Sbjct: 319 SSEPSYLAAATAAVYQAMTAVDPGAVWLLQGWLFQNQPEFWGPAQVAAVLGAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+T+VG
Sbjct: 379 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPATARRFPNSTLVG 438
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 439 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 498
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC S E + N+ P+ K L T+ +W
Sbjct: 499 VYNC-----------------------SGEECRGHNH-SPLVKRPSLHMVTT------VW 528
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------Q 412
Y+ S+V A L + + L++S +RYDL+D+TRQA+ + + + + AY
Sbjct: 529 YNRSDVFEAWRLLLTATPTLASSPAFRYDLVDVTRQAVQELVSLYYEEMRTAYLKKELVP 588
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
L A G+ EL+ +D +LA FLLG WLE A+ A +E + YE N+R Q
Sbjct: 589 LMRAGGILA-----YELLPALDQVLASDCHFLLGSWLEQARLAAVSETEAHFYEQNSRYQ 643
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 532
+T+W E ++L DY NK +GL+ DYY PR ++ + + ESL G F+ + +
Sbjct: 644 LTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFAETLAESLVQGVPFQQHQFDKN 698
Query: 533 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 573
+L + G YP + GD + + L+ KY G F
Sbjct: 699 AFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKYYPRAGSF 739
>gi|328778968|ref|XP_623833.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Apis mellifera]
Length = 752
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 212/572 (37%), Positives = 329/572 (57%), Gaps = 52/572 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W D+ + LQ +IL R+ LG+ PVLPAF+G+VP AL +FP A +T
Sbjct: 198 MGNMRGFGGPLNSNWHDKSIRLQHRILERMRALGIIPVLPAFAGHVPRALLKLFPKANVT 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + ++CC YLL+ TDPLF +IG+ F++ ++E+G T H+YNCDTF+EN P
Sbjct: 258 KSAVWNNFSD--KYCCPYLLEPTDPLFKQIGQQFLKTYIEEFG-TDHVYNCDTFNENEPY 314
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G +I+ M S DS A+WLMQGWLF +D FW P+ + L SVPLG+++V
Sbjct: 315 TSELKFLRNIGHSIFEAMNSVDSKAIWLMQGWLFYHDSVFWTEPRTRTFLTSVPLGRMIV 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + +YG P+IWCMLHNF G + M+G + I EAR +TMVG
Sbjct: 375 LDLQSEQFPQYKRLNSYYGQPFIWCMLHNFGGTLGMFGSAEIINHRVFEARNMNGSTMVG 434
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPAIQDAWNVLYH 297
G++ EGI QN V+Y+LM+EMA++ + V++ W ++ RRYG + AW +
Sbjct: 435 TGLTPEGINQNYVIYELMNEMAYRKKPVNLDKWFENFANRRYGDIKGNEHTVTAWKGFKN 494
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
TVYN +D + + VI P+++ P
Sbjct: 495 TVYNFSDTRRIRGKYVITIRPNLN-------------------------------FFPWR 523
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY+ I + + + + S YR+D++D+TRQAL A+E++ ++IE++ +
Sbjct: 524 WYNKDAFIYYWYVLLQARDLKRNSTLYRHDVVDVTRQALQLIADEIYTDLIESFNKKNID 583
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
Q ++ L L +D++ +LA + FLLG WL+ AK LA ++E+E YE+NAR QIT+W
Sbjct: 584 LFKQNAKLLLALFDDLEEILASSEDFLLGKWLKMAKDLATDDEEEILYEYNARNQITLW- 642
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRREWI 534
+RDY NK WSG++ DY+ PR AI+ + SL +G + R+ +R +
Sbjct: 643 ----GPLGEIRDYANKQWSGIVADYFKPRWAIFLNELETSLTTGTRVNTTRIN--KRIFE 696
Query: 535 KLTNDWQNGRNVYPVESNGDAL-----ITSQW 561
+ + R +YP ++ GD++ I S+W
Sbjct: 697 NVEKAFTFSRKIYPTKATGDSIDIAERILSEW 728
>gi|344285558|ref|XP_003414528.1| PREDICTED: alpha-N-acetylglucosaminidase [Loxodonta africana]
Length = 744
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 222/575 (38%), Positives = 321/575 (55%), Gaps = 53/575 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 205 MGNLHSWGGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAVTRVFPQVNVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DP+F IG F+ + E+G T HIY DTF+E PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPGDPMFPIIGSLFLRELTTEFG-TDHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G L+V
Sbjct: 322 SSEPSYLAAATAAVYEAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGHLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 382 LDLFAETQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGTLETVNQGPAAARLFPNSTMVG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG + AW +L +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVPDLGAWVASFAARRYGGIHQDAETAWRLLLRS 501
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ + + NR P+ K L+ T+ +
Sbjct: 502 VYNCSGESCSGHNRS-------------------------PLVKRPSLQMNTT------V 530
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L +A+ L+AS +RYDL+D+TRQA + + + + AY
Sbjct: 531 WYNRSDVFEAWRLLLATTPALAASPAFRYDLLDVTRQAAQELVSFYYGEVRTAYLNKELV 590
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
L A GV EL+ +D +LA FLLG WLE A+ A +E + +E N+R
Sbjct: 591 HLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARVAAVSEAEAHFFEQNSRY 645
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 531
Q+T+W + DY NK +GL+ DYY PR ++ ++ESL F+ + +
Sbjct: 646 QLTLW-----GPVGNILDYANKQLAGLVSDYYTPRWQLFVGALVESLVQDVPFQQRQFDE 700
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+L + YP + GD + ++ L+ KY
Sbjct: 701 NVFQLEQAFVLNTRRYPTQPKGDTVDLAKRLFLKY 735
>gi|405964692|gb|EKC30145.1| Alpha-N-acetylglucosaminidase [Crassostrea gigas]
Length = 859
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 209/598 (34%), Positives = 330/598 (55%), Gaps = 37/598 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+HGWGGP+ Q+W+D QL+LQ KIL R+ GM PVLP F+G+VP A +P A ++
Sbjct: 217 MGNMHGWGGPITQNWIDDQLILQHKILERMRSFGMIPVLPGFAGHVPEATILRYPQANVS 276
Query: 61 QLGNWFSVKSDPRW--------------------CCTYLLDATDPLFIEIGRAFIEQQLK 100
+L +W W CC YLLD DPLF++I FI++
Sbjct: 277 RLTDWAGFNQSFCWHYPTANVSRLRDWGHFNKTYCCNYLLDFNDPLFMKIAVRFIKEMEN 336
Query: 101 EYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW 160
E+G H+Y+ DTF+E P +S EY++ G +Y ++ DS A+WLMQGWLF FW
Sbjct: 337 EFG-VDHVYSVDTFNEMRPRSNSTEYLALSGRTVYKSLKEADSKAIWLMQGWLFIDGGFW 395
Query: 161 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
+ PQ+KALL +VP G++++LDL++E+ PI++ ++ +YG P+IWCMLH+F G +E+YG L
Sbjct: 396 KQPQIKALLTAVPQGEMIILDLYSEIIPIYTQTESYYGQPFIWCMLHDFGGTMELYGALK 455
Query: 221 SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRR 280
I GP R N++MVG+GM+ EGI QN VVY+ +E ++ D+ WI++Y + R
Sbjct: 456 LINEGPFNGRAFPNSSMVGLGMTPEGIFQNEVVYEFFTENVWRKAPRDISTWISKYVLNR 515
Query: 281 YGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTE---GKYQN--- 334
YG++ I AW L ++VYN +D D + + I PD PS+ G Y N
Sbjct: 516 YGKTNKFIDLAWQYLKNSVYNNSDNLKDHDSNAI---PDHRPSLSPALHPDLGIYNNTDY 572
Query: 335 -YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR 393
+ ++ + +WY+ ++ A ++ + +E S S+ + YD++D+TR
Sbjct: 573 LHDNSINIIVTTLPRMTPLIQQDVWYNPEDLYVAWDIMTLNLDEFSNSSLFMYDIVDVTR 632
Query: 394 QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK 453
+L + + + +++ A+ D H V + L L+ DMD +L FLLG W+++A
Sbjct: 633 NSLQILSIKYYTDLVYAFGRGDIHAVESHGNQLLGLLSDMDTVLGSDSHFLLGRWIKAAT 692
Query: 454 QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 513
A + + ++NAR QIT+W + +RDY K WSGL++DYY PR I+ Y
Sbjct: 693 DNAMDMQDNWFLQFNARNQITLWGPRGE-----IRDYACKQWSGLIKDYYLPRWEIFVNY 747
Query: 514 MIESLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGT 570
++ + + + + K+ + + YP E GD++ + L+ KY T
Sbjct: 748 TLDIMAHNKTYNATELDIMIYEKVEFPFSYRLDQYPTEPQGDSVAIVKSLHKKYRPDT 805
>gi|156545487|ref|XP_001606979.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Nasonia vitripennis]
Length = 755
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 214/570 (37%), Positives = 321/570 (56%), Gaps = 43/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N GWGGPL Q+W + + LQ I+ R+ ELG+ PVLPAF+G+VP VFP A +T
Sbjct: 221 MGNFRGWGGPLSQAWHNHTIQLQHSIVRRMRELGITPVLPAFAGHVPRDFIRVFPEANVT 280
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + ++CC Y LD TDPLF +GR F++ E+G T+HIYNCD+F+EN P
Sbjct: 281 KVVSWNGFED--QYCCPYSLDPTDPLFKTVGREFLKAYTDEFG-TNHIYNCDSFNENDPH 337
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+Y+S+ G AIYSGM D DA+WLMQGWLF + FW P++KA + SVP+GK+++
Sbjct: 338 TGDLDYLSNTGKAIYSGMTGADPDAIWLMQGWLFVHSEYFWTFPRVKAFVTSVPIGKMII 397
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + ++G P+IWCMLHNF G + M+G I G EART+ +TM+G
Sbjct: 398 LDLQSEQFPQYKRFHSYFGQPFIWCMLHNFGGTLGMFGSAGVINKGVFEARTTNGSTMIG 457
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+ M+EM+++ + V + W Y+VRRYG++ +I+ +W L +
Sbjct: 458 TGLTPEGINQNYVIYEFMNEMSYRKKPVVLDNWFENYAVRRYGQADESIRTSWQELGREL 517
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN DG T G Y ++K L E P WY
Sbjct: 518 YN-YDGKTK-------------------IRGHYV-----ITKRPSLNIE------PWYWY 546
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ F+ +GN + +++DL+D+TRQAL A+ ++ +I AY + +
Sbjct: 547 DLKTFLAVWNSFVHAGNGTMKNELFKHDLVDITRQALQITADFIYADIKAAYTQKNLTQL 606
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN--EEQEKQYEWNARTQITMWF 477
S L+L +D++ LA FLLG WLE AK +A + YE+NAR QIT+W
Sbjct: 607 QIASSHLLDLFDDLEKNLASSKDFLLGSWLEDAKAIAPEGATRDRENYEFNARNQITLWG 666
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKL 536
+ + DY NK WSG++ DY+ PR IY K + ES+ +R + ++
Sbjct: 667 PRGE-----IVDYANKQWSGVVADYFKPRWEIYLKELQESIRKQTAVPTAKLKRMIFNQV 721
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + +YP + GD+++ ++ LY K+
Sbjct: 722 ELPFSYSKKLYPTQPKGDSILIAKELYAKW 751
>gi|431890602|gb|ELK01481.1| Alpha-N-acetylglucosaminidase [Pteropus alecto]
Length = 740
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 217/569 (38%), Positives = 327/569 (57%), Gaps = 41/569 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 201 MGNLHTWGGPLPFSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ +W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 261 QMDSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 317
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 318 SSEPSYLAAATAAVYQAMTTVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 377
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 378 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVG 437
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI+QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 438 TGMAPEGIDQNEVVYALMAELGWRKDPVTDLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 497
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC S + + N+ P+ + L+ T+ +W
Sbjct: 498 VYNC-----------------------SGEDCRGHNH-SPLVRRPSLQMVTT------VW 527
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y+ S+V A + + + L+ S + Y+L+D+TRQA+ + + + + AY D
Sbjct: 528 YNQSDVFEAWRMLLTATPTLATSPLFSYELVDITRQAIQELVSLYYEEVRTAYLNKDLVT 587
Query: 419 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+F+ + EL+ +D +LA FLLG WLE A+ A ++ + YE N+R Q+T+W
Sbjct: 588 LFRAAGILAYELLPSLDNILATDSHFLLGSWLEQARAAAVSKAEASFYEQNSRYQLTLW- 646
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
E ++L DY NK +GL+ +YY PR ++ + ++ESL G F+ + + +L
Sbjct: 647 ---GPEGNIL-DYANKQLAGLIANYYTPRWRLFMEMLVESLVQGIPFQQHQFDKNAFQLE 702
Query: 538 NDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ YP + GD + ++ L+ KY
Sbjct: 703 QTFVFSTQRYPNQPQGDTVDLAKKLFLKY 731
>gi|291406137|ref|XP_002719212.1| PREDICTED: alpha-N-acetylglucosaminidase [Oryctolagus cuniculus]
Length = 743
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 224/576 (38%), Positives = 330/576 (57%), Gaps = 53/576 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWAGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAVTRVFPHINVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DP+F IG F+ + +E+G T H+Y DTF+E PP
Sbjct: 264 QLGSWGHFNCS--YSCSFLLAPEDPMFPLIGSLFLRELTREFG-TDHVYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA++ M + D DAVWL+QGWLF + P FW P Q+KA+LN+VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVFEAMIAVDPDAVWLLQGWLFQHQPQFWGPSQVKAVLNAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAENQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ E V D++AW+ ++ RRYG + P AW +L +
Sbjct: 441 TGIAPEGISQNEVVYALMAELGWRKEPVPDLEAWVTSFAGRRYGVAHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ D NR +V P L+ T+ +
Sbjct: 501 VYNCSGDACRGHNRSPLVRRPS-------------------------LQLNTT------V 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
WY+ S+V A L + + L++S +RYDL+D+TRQA+ + + + AY
Sbjct: 530 WYNRSDVFEAWRLLLKATPTLASSPAFRYDLLDVTRQAVQELVSLYYEEARTAYLHKELA 589
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
L A GV EL+ +D +LA FLLG WLE A+ A +E + + YE N+R
Sbjct: 590 TLLRAGGVLA-----YELLPALDRVLATDSRFLLGSWLEQARAAAASEAEAQLYEQNSRF 644
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 531
Q+T+W E ++L DY NK +GL+ YY PR ++ + + +SL G F+ + + +
Sbjct: 645 QLTLW----GPEGNIL-DYANKQLAGLVAQYYSPRWQLFLEALADSLARGVPFQQRLFDK 699
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
+L + YP + GD + +Q ++ KY
Sbjct: 700 LVFRLEQAFVLSSRRYPTQPQGDTVDLAQKIFLKYF 735
>gi|351699889|gb|EHB02808.1| Alpha-N-acetylglucosaminidase, partial [Heterocephalus glaber]
Length = 652
Score = 390 bits (1002), Expect = e-105, Method: Compositional matrix adjust.
Identities = 215/559 (38%), Positives = 321/559 (57%), Gaps = 41/559 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP +W +QL LQ ++L R+ LGM PVLPAF+G+VP A+ VFP +T
Sbjct: 114 MGNLHGWGGPLPHAWHLKQLYLQHRVLDRMRALGMTPVLPAFAGHVPKAVTRVFPQVNVT 173
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF IG F+ + +E+G T H Y DTF+E PP
Sbjct: 174 QLGSWGHFNCS--YSCSFLLAPGDPLFPLIGSLFLRELNREFG-TDHFYGADTFNEMQPP 230
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 231 SSEPAYLAAATAAVYEAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVGAVLGAVPQGRLLV 290
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+T+VG
Sbjct: 291 LDLFAENQPVYTRTASFGGQPFIWCMLHNFGGNHGLFGALEAVNRGPAAARLFPNSTVVG 350
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW+ +++ +RYG + P AW +L H+
Sbjct: 351 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLSAWVARFAEQRYGVAQPDAVLAWRLLLHS 410
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ + L+ T+ +W
Sbjct: 411 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------VW 440
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y+ S+V A L + + L+AS +RYDL+D+TRQ L + + + AY + G
Sbjct: 441 YNRSDVFEAWRLLLKATPNLTASPAFRYDLLDVTRQGLQELVSLYYEEARAAYMRQELEG 500
Query: 419 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + +L+ +D +LA FLLG WLE A+ +A + + YE N+R Q+T+W
Sbjct: 501 LLRAGGVLAYKLLPALDEVLASDHRFLLGSWLEQARAVAVSSAEADLYEQNSRYQLTLW- 559
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
E ++L DY NK +GL+ DYY PR ++ + + SL G F+ + + + L
Sbjct: 560 ---GPEGNIL-DYANKQLAGLVADYYVPRWRLFVETLASSLARGVPFQQQQFNSDVFLLE 615
Query: 538 NDWQNGRNVYPVESNGDAL 556
+ R YP + GD +
Sbjct: 616 QAFVLSRKRYPSQPQGDTV 634
>gi|307168312|gb|EFN61518.1| Alpha-N-acetylglucosaminidase [Camponotus floridanus]
Length = 737
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 205/555 (36%), Positives = 311/555 (56%), Gaps = 41/555 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W ++ + LQ +IL R+ LG+ PVLPAF+G+VP A +FP+A +T
Sbjct: 221 MGNIRGFGGPLSTNWHNRTIHLQHQILRRMRNLGIVPVLPAFAGHVPRAFARLFPNANMT 280
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + + ++CC YLL+ TDPLF IG F+ + E+G T HIYNCDTF+EN P
Sbjct: 281 KINPWNNFED--KYCCPYLLEPTDPLFQIIGEKFLRMYINEFG-TDHIYNCDTFNENEPG 337
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
Y+ ++ A+++ + + DS A+WLMQ WLF +D FW P++K+ L SVP+G++++
Sbjct: 338 STELIYLRNVSHAVFAAINAVDSKAIWLMQAWLFVHDFMFWTEPRVKSFLTSVPMGRMLI 397
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + K +YG P+IWCMLHNF G + M+G I E R +TMVG
Sbjct: 398 LDLQSEQFPQYGRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRNMNGSTMVG 457
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++HE VD+ AW Y+ RRYG W L TV
Sbjct: 458 TGLTPEGINQNYVIYELMNEMAYRHEPVDLDAWFQNYATRRYGAWNEYAVTTWQYLGRTV 517
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN + V+ P ++ S+ +WY
Sbjct: 518 YNFIGSQRIRGHYVVTRRPSLNISL-------------------------------WIWY 546
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ F+ + + S YR+D++D+TRQAL ++L+ I+++Y+ +
Sbjct: 547 NRKNFYSMWNTFLKARHGRRNSTLYRHDVVDITRQALQLMGDDLYTIILDSYKKRNITAF 606
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ LEL +D++ +LA FLLG WL AK +A NEE+ K YE+NA+ QIT+W N
Sbjct: 607 RSSANALLELFDDLESILASGSNFLLGTWLSQAKDVATNEEERKSYEYNAKNQITLWGPN 666
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKLTN 538
+ +RDY NK WSG++ DY+ PR ++ K + +SL F + + + + K+
Sbjct: 667 GE-----IRDYANKQWSGVMADYFKPRWELFLKALEKSLVENTKFNVTEINNKIFDKVER 721
Query: 539 DWQNGRNVYPVESNG 553
+ YPVE G
Sbjct: 722 PFTFSTKFYPVEPKG 736
>gi|325103828|ref|YP_004273482.1| alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
gi|324972676|gb|ADY51660.1| Alpha-N-acetylglucosaminidase [Pedobacter saltans DSM 12145]
Length = 738
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 217/578 (37%), Positives = 321/578 (55%), Gaps = 55/578 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ WGGPLPQSW+D LQKKIL R ELGM PVLPAF+G+VP + FP AK+
Sbjct: 195 MNNMDAWGGPLPQSWIDSHKDLQKKILARQRELGMIPVLPAFTGHVPKSFVKKFPEAKVD 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
SV + Y+L+ DP+F +IG F+++Q +EYG T H Y+ D F+E PP
Sbjct: 255 ------SVNWQGNFPNIYMLNPNDPMFSKIGEQFLKEQTREYG-TDHYYSSDIFNELNPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY---DPFWRPPQMKALLNSVPLGKL 177
P+Y+ + +YS M+ D +VW+MQ WLF FW P +M+A L VP KL
Sbjct: 308 SSDPKYLYDISEKVYSSMKKVDPKSVWVMQAWLFVSAHGRKFWTPERMQAFLKPVPDDKL 367
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTT 236
++LDL+ E +P W ++ +YG ++W MLHNF GNI ++G +IA P +
Sbjct: 368 IILDLYTENRPRWKNTEGYYGKKWVWNMLHNFGGNIGLFGKAQTIASEPARVLSDPMKGN 427
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
G+G++MEGIEQNP +Y LM + + +E ++++ W N+Y RRYG AW +L
Sbjct: 428 YSGIGLTMEGIEQNPFIYQLMLDHVWNNEPIELEKWTNKYITRRYGVLDNNAVKAWEILL 487
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
+TVY D N+D SI+S G+P ++ S +
Sbjct: 488 NTVY------KDNNKD-----QGAPESILS---------GRPTF------AQNSYWTWTD 521
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
L+Y E +RA + I S ++L S+ ++YD++D+TRQA+A YA L + Y D
Sbjct: 522 LYYDNREFVRAWDYLIKSADKLRNSDGFQYDIVDITRQAMANYATALQRQLAYTYYAGDV 581
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + SRRFLEL+ D+D LLA FLLG W++ AK+ A N+ + K YE+NA+ ++MW
Sbjct: 582 NTYEKESRRFLELLSDLDRLLATRKDFLLGIWIDDAKKWATNDAERKLYEFNAKDLVSMW 641
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR-------LKDW 529
+ + DY + WSGL+ +YY R I+F ++ L++ + + +KDW
Sbjct: 642 ----GHKDITINDYSARQWSGLVENYYKQRWKIFFDQSLQKLKNNEIWDQAEFEKYIKDW 697
Query: 530 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
EW +W N R YP + GD + S+ +YNKY
Sbjct: 698 --EW-----NWVNRRETYPTNTKGDPVNVSKEMYNKYF 728
>gi|332018247|gb|EGI58852.1| Alpha-N-acetylglucosaminidase [Acromyrmex echinatior]
Length = 686
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 207/571 (36%), Positives = 323/571 (56%), Gaps = 41/571 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPL +W + + LQ +IL R+ +LG+ PVLPAF+G+VP A +FP+A +T
Sbjct: 152 MGNIRGFGGPLSSNWHNYTIRLQHQILQRMRDLGIVPVLPAFAGHVPRAFARLFPNANMT 211
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W K + ++CC YLL+ TDPLF IG F++ + E+G T HIYNCDTF+EN P
Sbjct: 212 KINPW--NKFEDKYCCPYLLEPTDPLFRTIGEKFLQMYIDEFG-TDHIYNCDTFNENEPG 268
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
Y+ ++G +I+S M + DS A+WLMQ WLF +D FW +++A L SVP+G+++V
Sbjct: 269 NTELIYLRNVGHSIFSAMNAVDSKAIWLMQAWLFVHDIMFWTKSRVRAFLTSVPIGRMLV 328
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + K +YG P+IWCMLHNF G + M+G I E R ++TMVG
Sbjct: 329 LDLQSEQFPQYDRLKSYYGQPFIWCMLHNFGGTLGMFGSAQIINQRTFEGRNMNDSTMVG 388
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++H V++ W Y+ RRYG AW L TV
Sbjct: 389 TGLTPEGINQNYVIYELMNEMAYRHVPVNLDNWFESYATRRYGAWNEYAVAAWQHLGRTV 448
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN + VI P ++ S+ + WY
Sbjct: 449 YNFIGTQKIRGHYVITRRPSLNISLWT-------------------------------WY 477
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +F+ + + YR+D++D+TRQAL A+++++ I++ Y+ +
Sbjct: 478 DRKDFYAMWNMFLKARYGRGNNTLYRHDVVDITRQALQLIADDIYMTILDCYKKKNITAF 537
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ LEL +D++ +LA + FLLG WL AK +A NEE+ + YE+NAR QIT+W N
Sbjct: 538 QSSANALLELFDDLESILASGNNFLLGTWLAQAKDIAVNEEERRSYEYNARNQITLWGPN 597
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIKLTN 538
+ +RDY NK WSG++ DY+ R ++ K + +SL + + R + ++
Sbjct: 598 GE-----IRDYANKQWSGVVADYFKLRWELFLKALEKSLIQRIEPNITEINDRIFHEVER 652
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKYLQG 569
+ +YP+E+ GD + + + +K+ +G
Sbjct: 653 SFTFSTKLYPIETKGDTIDIAMKIISKWYKG 683
>gi|403304646|ref|XP_003942904.1| PREDICTED: alpha-N-acetylglucosaminidase [Saimiri boliviensis
boliviensis]
Length = 754
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 216/569 (37%), Positives = 328/569 (57%), Gaps = 41/569 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP++W +QL LQ +IL R+ GM PVLPAFSG+VP A+ VFP +T
Sbjct: 214 MGNLHTWDGPLPRAWHIKQLYLQHRILDRMRSFGMIPVLPAFSGHVPRAINRVFPRVNVT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DP+F +G F+ + KE+G T HIY DTF+E PP
Sbjct: 274 QMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRELTKEFG-TDHIYGADTFNEMQPP 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D+DAVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 331 SSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLV 390
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 391 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNRGPEAARLFPNSTMVG 450
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E++++ + V D+ AW+ ++ +RYG S P AW +L +
Sbjct: 451 TGMAPEGINQNEVVYSLMAELSWRKDPVPDLAAWVTSFATQRYGVSHPDAGAAWRLLLRS 510
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ + L+ T+ +W
Sbjct: 511 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------VW 540
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y+ S+V A L +++ L+AS T+RYDL+D+TRQA+ + + AY + H
Sbjct: 541 YNRSDVFEAWRLLLSAAATLAASPTFRYDLLDVTRQAVQELVGLYYEEARSAYLSKELHS 600
Query: 419 VFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + EL+ +D +LA FLLG WLE A+ +A +E + YE ++R Q+T+W
Sbjct: 601 LLRAGGILAYELLPALDEVLASDSHFLLGSWLEQARAVAVSEAEADFYEQSSRYQLTLW- 659
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
E ++L DY NK +GL+ YY PR ++ + + S+ G F + + +L
Sbjct: 660 ---GPEGNIL-DYANKQLAGLVASYYTPRWRLFLEVLAASVAQGIPFPQHQFDKNVFQLE 715
Query: 538 NDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + YP + GD + ++ ++ KY
Sbjct: 716 QAFVLSKQRYPSQPRGDTVDLAKKIFLKY 744
>gi|196001339|ref|XP_002110537.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
gi|190586488|gb|EDV26541.1| hypothetical protein TRIADDRAFT_54660 [Trichoplax adhaerens]
Length = 757
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 209/567 (36%), Positives = 315/567 (55%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ W GPLP W+++Q+ LQ KIL R+ + GM P+LPAF+GN+P AL ++P AKI
Sbjct: 217 MGNIQRWAGPLPHDWINKQITLQVKILDRMRKYGMLPILPAFNGNIPNALTKIYPKAKIV 276
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ WF R+ T LLD D LFI I + FIE+++K YG T H+Y+ D F+E P
Sbjct: 277 KSSPWFGFSK--RYGETALLDPRDKLFIVISKLFIEEEIKAYG-TDHLYSLDLFNEIDPQ 333
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
EY++++ + Y + S D+ AVW+MQGW+F D + W +++A L+ +P G++V+
Sbjct: 334 SKELEYLTAVSKSAYLALNSADTKAVWIMQGWMFYNDNYYWENKRIQAFLSPIPKGRIVI 393
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAEV+P + S F+G P+IWCML+NF GN MYG ++I G + A +N+TM+G
Sbjct: 394 LDLFAEVEPQYHRSNSFFGHPFIWCMLNNFGGNAGMYGTFETITEGAISAYDMKNSTMIG 453
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
GM+ EGI N ++YDLM+EM ++ VDV+ W+ Y+ RRYG I AW L TV
Sbjct: 454 TGMAPEGIGNNYIMYDLMAEMGWRKIAVDVRDWVVVYTERRYGGLDENIIKAWLRLSETV 513
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNC D R A P V PS+ + +WY
Sbjct: 514 YNCNDMRQYHCR----ALPAVRPSLKIAND---------------------------VWY 542
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S ++ A E + + NE + T++YD++D+TRQAL + A ++ + + Y N+ +
Sbjct: 543 SADDIFFAWEHMLRANNEFISEETFQYDIVDVTRQALQELAFIMYKKVTQCYHDNNQETL 602
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+EL DMD LL + FLLG W+ A Q + N ++Q +NA QIT+W
Sbjct: 603 KTAGGELIELFTDMDTLLGTNSHFLLGRWVADALQHSNNISIKQQLRFNALNQITLW--- 659
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
S+L DY NK W+GL+ +Y R ++ K + +S+ + F + + K
Sbjct: 660 -GPSKSILHDYANKMWNGLVDKFYKKRWLMFIKALSDSISNNILFDQQKFNLAVQKFEAA 718
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W + N Y S+G ++ S+ L++KY
Sbjct: 719 WASENNTYATTSSGSSVTVSKQLFSKY 745
>gi|320162905|gb|EFW39804.1| lysosomal alpha-N-acetyl glucosaminidase [Capsaspora owczarzaki
ATCC 30864]
Length = 786
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 206/571 (36%), Positives = 319/571 (55%), Gaps = 38/571 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+ W+ +Q LQ IL R+ GM PVLP+F+G+VP+AL FP+A IT
Sbjct: 238 MGNIKGWGGPISLEWIYKQRNLQVLILQRMRTFGMTPVLPSFAGHVPSALAQHFPNANIT 297
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q +W + ++CC LDA+DPLF +IG F+ Q + YG T+H+YNCD F+E TP
Sbjct: 298 QSSDWNNFPD--QYCCVGFLDASDPLFTQIGAEFLRLQNETYG-TNHLYNCDQFNEMTPA 354
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
Y+ G A+Y M + D AVW+MQGWLF + +W +++ALL+ VP +++
Sbjct: 355 STDLGYLKQAGMAVYQSMTAYDPAAVWVMQGWLFFNEAAWWSNDRVQALLSGVPDDHMII 414
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF++V P+W+ + +YG P+IW MLH+F GNI +YGIL SI GP A + TMVG
Sbjct: 415 LDLFSDVTPVWNRLESYYGKPFIWNMLHDFGGNIGLYGILPSINEGPFAALATPGNTMVG 474
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD-AWNVLYHT 298
+G++ EGI QN ++Y+ M E ++ V++ W++ + RRYG S PA+ A+ L +
Sbjct: 475 IGLTPEGINQNYILYEFMMENMWRSAPVNLPTWVDAFVGRRYGPSTPAVAKLAYQQLLQS 534
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNCT+G + ++ P V+ +S+ + + +L+
Sbjct: 535 VYNCTNGQYSVTKSLLEIRPAVN-----------------MSRNGFMPT--------NLY 569
Query: 359 YSTSEVIRALELFIA---SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
Y VI A++ +A S +L++ +RYD++D TRQ L+ A + N+ A
Sbjct: 570 YDPGHVILAVDHILAAAKSAPQLASVVPFRYDVVDFTRQMLSNLAIDFHSNLTLALTSKQ 629
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
AH V + + L+ D+D LL FLLGPWL +A+ ++N + E+NAR QIT+
Sbjct: 630 AHLVHLYGQGIVGLIADLDELLVSDAHFLLGPWLAAARSWSENTAAQDLLEFNARNQITL 689
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W N + + DY +K W+GL+ YY PR ++ + + ES F + ++
Sbjct: 690 WGPNGE-----ITDYASKQWAGLMSSYYRPRWELFVSFASAAAESDLPFNDAAFNAAVLE 744
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ WQ+ + + V GD++ + L KY
Sbjct: 745 VEKAWQHSHHNFTVTPLGDSIAIATRLRAKY 775
>gi|374385255|ref|ZP_09642763.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
12061]
gi|373226460|gb|EHP48786.1| hypothetical protein HMPREF9449_01149 [Odoribacter laneus YIT
12061]
Length = 736
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 208/576 (36%), Positives = 316/576 (54%), Gaps = 57/576 (9%)
Query: 2 SNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ GW GPLP+SW++ LQKKIL R ELGM P+LPAF+G+VP + FP A++ Q
Sbjct: 199 GNIDGWCGPLPKSWMESHEELQKKILARERELGMTPILPAFTGHVPPTFKEHFPEARLRQ 258
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
+ NW + R+ TYLL+A DPLF IG F+E+Q++ +G T H+Y DTF+E PP
Sbjct: 259 V-NW-----EGRFDDTYLLEADDPLFQTIGNRFMEEQIRTFG-TDHLYGADTFNEMFPPS 311
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
+ Y+ + A+Y M + D +AVW+MQGWLF FW+P QMKA L +VP L+VL
Sbjct: 312 EDSTYLDGISKAVYQSMAAVDPEAVWVMQGWLFHDKRDFWKPAQMKAYLGAVPDEHLIVL 371
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTMVG 239
DL+ E PIW ++ FYG P+IWCMLHNF G ++G +A P ++G
Sbjct: 372 DLWGEEFPIWDRTEAFYGKPWIWCMLHNFGGRNMLFGNALKLAEEPSRVLADPAKGQLLG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G EGIEQNPV+Y L+ +++ V++ W Y RYG A++ AW++L TV
Sbjct: 432 LGAVPEGIEQNPVIYSLLFSHIWRNTAVELDEWFETYLESRYGCRDEAVEKAWDILRKTV 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y EG NY ++ + +++ + + Y
Sbjct: 492 Y--------------------------ANEG---NYESAITARPTFEKH-NNWAYTDIPY 521
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ EVI+A + + + + L + YRYDLI + +Q LA YA + E Y+ D
Sbjct: 522 NPVEVIKAWKYLLQAADRLGENPCYRYDLILVGKQVLANYATIIQQKFGEDYRTKDLPAF 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ SR F+EL++DMD L+ H+ FLLG WLE A+ + +++ YE NAR QIT+W
Sbjct: 582 TRNSREFMELIDDMDELMGTHEAFLLGKWLEDARSWGKTASEKQLYEKNARDQITLW--- 638
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRRE 532
+ ++L DY +K WSGL + +Y R ++ + + +++G + R++ W E
Sbjct: 639 -GGKDAVLHDYASKQWSGLFKGFYKGRWQLFIDEVYDCIKTGRKYDHTASDDRVRSWEWE 697
Query: 533 WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
W+ NG+ YP GD ++ S+ ++ KY++
Sbjct: 698 WV-------NGQEKYPAVPQGDPVVVSERMFGKYIK 726
>gi|390463730|ref|XP_003733088.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Callithrix jacchus]
Length = 830
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 215/570 (37%), Positives = 315/570 (55%), Gaps = 50/570 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W PLP +W +QL LQ IL R+ GM PVLP F G+VP A+ VFP +T
Sbjct: 297 MGNLHTWDAPLPHAWHIKQLYLQHWILDRMRSFGMVPVLPMFLGHVPKAITRVFPRVSVT 356
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DP+F +G F+ + KE+G T HIY DTF+E PP
Sbjct: 357 QMGSWGHFNCS--YSCSFLLAPEDPIFPILGSLFLRELTKEFG-TDHIYGADTFNEMQPP 413
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D+DAVWL+QGWLF Y P FW P Q++A+L S P G L+V
Sbjct: 414 SSEPSYLAAATAAVYEAMIAVDTDAVWLLQGWLFQYQPQFWGPAQVRAVLGSAPHGCLLV 473
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 474 LDLFAESQPVYIRTASFQGQPFIWCMLHNFGGNHGLFGALEAMNRGPEAARLFPNSTMVG 533
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+++ + V D+ AW YG S P AW +L +
Sbjct: 534 TGMAPEGISQNXVVYSLMAELSWXKDPVPDLVAWX-------YGVSHPDTGAAWRLLLRS 586
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ + L+ T+ +W
Sbjct: 587 VYNCSGEACRGHNH------------------------SPLVRRPSLQMNTT------IW 616
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y+ S+V A L ++ L+AS T+RYDL+D+TRQ + + + + AY L+ G
Sbjct: 617 YNQSDVFEAWRLLFSAAATLAASPTFRYDLLDVTRQVVQELVSLYYEEARSAY-LSKELG 675
Query: 419 VFQLSRRFL--ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ L EL+ +D +LA FLLG WLE A+ +A +E + YE N+R Q+T+W
Sbjct: 676 SLLRAGGILAYELLPALDEVLASDSHFLLGSWLEQARAVAVSEAEADFYEQNSRYQLTLW 735
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E ++L DY NK +GL+ YY PR ++ + + S+ G F+ + + +L
Sbjct: 736 ----GPEGNIL-DYANKQLAGLVAHYYAPRRRLFLEALAASVAQGIPFQQHQFDKNVFQL 790
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + YP + GD + ++ ++ KY
Sbjct: 791 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 820
>gi|194216885|ref|XP_001917396.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Equus caballus]
Length = 744
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 220/575 (38%), Positives = 325/575 (56%), Gaps = 53/575 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPL +SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 205 MGNLHTWDGPLTRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKAITRVFPQVNVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 265 QLGSWGHFNCS--YSCSFLLAPEDPLFPVVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPAYLAAATAAVYQAMTAVDPDAVWLLQGWLFHHQRTFWGPAQVGAVLGAVPRGRLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 382 LDLFAESQPMYIRTASFQGQPFIWCMLHNFGGNQGLFGALEAVNRGPAAARLFPNSTMVG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D++AW+ ++ RRYG S + AW +L +
Sbjct: 442 TGMTPEGIGQNEVVYALMAELGWRKDPVADLEAWVTSFAARRYGVSHKDAETAWKLLLRS 501
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC+ A + P+ K L+ T+ +W
Sbjct: 502 VYNCSAEAYSGHNQ------------------------SPLVKRPSLQMGTT------VW 531
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY-------ANELFLNIIEAY 411
Y+ S+V A L + + L++S + YDL+D+TRQA + A +LN E
Sbjct: 532 YNRSDVFEAWWLLLTAAPALASSPAFLYDLVDVTRQAAQELISLYYEEARTAYLNK-ELV 590
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
L A G+ EL+ +D +LA FLLG WL+ A+++A +E + YE N+R
Sbjct: 591 PLLRAGGILAY-----ELLPALDKVLASDSRFLLGSWLKQAREMAVSEAEAHFYEQNSRY 645
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 531
Q+T+W E ++L DY NK +GL+ DYY PR ++ + +++SL G F+ + + +
Sbjct: 646 QLTLW----GPEGNIL-DYANKQLAGLVADYYTPRWQLFVEMLVQSLAQGVPFQQQQFDK 700
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+L + YP + GD + ++ + KY
Sbjct: 701 NAFELEEAFVLSTRRYPSQPQGDTVDLAKKFFLKY 735
>gi|332260899|ref|XP_003279518.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylglucosaminidase
[Nomascus leucogenys]
Length = 736
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 203/551 (36%), Positives = 312/551 (56%), Gaps = 43/551 (7%)
Query: 20 LVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYL 79
L LQ ++L R+ +PVLPAF+G+VP A+ VFP +T++G+W + C++L
Sbjct: 216 LFLQHRVLDRMRSSAXDPVLPAFAGHVPEAVTRVFPRVNVTKMGSWGHFNCS--YSCSFL 273
Query: 80 LDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQ 139
L DP+F IG F+ + +KE+G T HIY DTF+E PP P Y+++ A+Y M
Sbjct: 274 LAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMI 332
Query: 140 SGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYG 198
+ D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L+VLDLFAE +P+++ + F G
Sbjct: 333 AVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRLLVLDLFAESQPVYTRTASFQG 392
Query: 199 VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMS 258
P+IWCMLHNF GN ++G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+
Sbjct: 393 QPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMA 452
Query: 259 EMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVA 316
E+ ++ + V D+ AW+ ++ +RYG S P AW +L +VYNC+ + NR +V
Sbjct: 453 ELGWRKDPVPDLAAWVTSFAAQRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVR 512
Query: 317 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 376
P L+ TS +WY+ S+V A L + S
Sbjct: 513 RPS-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAP 541
Query: 377 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDG 435
L+AS +RYDL+DLTRQA+ + + + AY + + + EL+ +D
Sbjct: 542 SLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLRKELASLLRAGGVLAYELLPALDE 601
Query: 436 LLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYW 495
+LA FLLG WLE A+ A +E + YE N+R Q+T+W E ++L DY NK
Sbjct: 602 VLASDSRFLLGSWLELARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQL 656
Query: 496 SGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDA 555
+GL+ +YY PR ++ + + +S+ G F+ + + +L + + YP + GD
Sbjct: 657 AGLVANYYTPRWRLFLEALADSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDT 716
Query: 556 LITSQWLYNKY 566
+ ++ ++ KY
Sbjct: 717 VDLAKKIFLKY 727
>gi|255533286|ref|YP_003093658.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346270|gb|ACU05596.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 734
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 200/575 (34%), Positives = 313/575 (54%), Gaps = 55/575 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGP+ ++++ +Q LQKKIL R LGM P+LP+F+G+VP + ++ FP K+
Sbjct: 190 MGNLDAWGGPMSKNFMAKQEALQKKILARERALGMTPILPSFTGHVPPSFKDKFPDIKVN 249
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ + P Y+L+ P+F EIGR F+ + +G T H+Y+ DTF+E TP
Sbjct: 250 TQQ--WGINVSP----AYVLNPETPMFKEIGRKFLTALINTFG-TDHLYSADTFNEMTPV 302
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ Y++ + IY M + D+ AVW+MQGW+F P FW+P QMKAL ++VP KL+V
Sbjct: 303 SNDSTYLNGMAKKIYESMAAVDTQAVWIMQGWMFLDRPNFWQPTQMKALFSAVPQDKLIV 362
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
LDL +E+ P+WS + FYG +IWCMLHNF G + M+G + I P A + + M
Sbjct: 363 LDLNSELNPVWSRTDAFYGEKWIWCMLHNFGGRLSMFGDMSRIGNDPAAALKNDQRGKMS 422
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++MEGIEQNP +Y LM E + + +D+ W+ Y+ RRYG+ + AW VL +T
Sbjct: 423 GIGLTMEGIEQNPAIYSLMLEHIWNDKPIDLDNWLKGYAQRRYGKRNSNAEKAWEVLKNT 482
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VY+ +I P D + + + + +
Sbjct: 483 VYSHQPWWG--TNTIITGRPTFDAATV--------------------------WTYTAIP 514
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
YS+ E+++A + + +EL +S+ ++YDL+D+TRQ LA YAN L + +Y+ D
Sbjct: 515 YSSKELMKAWSYLLTASDELKSSDGFQYDLVDVTRQVLANYANVLQQDFASSYKQKDMAT 574
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+ S +FLEL++D+D LL FLLG W+ +AK L N ++K +E NAR IT+W D
Sbjct: 575 FNKKSAQFLELIDDIDQLLGTRSDFLLGKWINNAKALGDNPAEKKLFERNARDLITLWLD 634
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRR 531
+ + +Y K W+G+++ +Y PR +F + +G +KDW
Sbjct: 635 ----KDCNIHEYACKEWAGMMKGFYKPRWQQFFDEVRLQASAGKEIDQIKFENTIKDWEW 690
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+W+ N Y + G+ + ++ LY KY
Sbjct: 691 KWV-------NANEAYTDKPTGNPVTVAKALYAKY 718
>gi|321472423|gb|EFX83393.1| hypothetical protein DAPPUDRAFT_301977 [Daphnia pulex]
Length = 799
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 197/531 (37%), Positives = 309/531 (58%), Gaps = 43/531 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N WGGPL +W L+LQ KIL R+ GM PVLPAF+G+VP A++ V+P+A T
Sbjct: 212 MGNFRAWGGPLSDNWQQATLILQHKILERMRSFGMTPVLPAFAGHVPRAMERVYPNASYT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L +W + + ++CC L T+PLF EIG FI++ E+G + H+YNCD F+E P
Sbjct: 272 HLTSWLNFQD--QYCCPLFLQPTEPLFTEIGSRFIKEMALEFG-SDHVYNCDVFNEVRPT 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P ++SS+G A+++ M + D DA+WLMQGWLF D +W KALL SVP G++++
Sbjct: 329 QADPVFVSSVGTAVFNAMTTADPDAIWLMQGWLFKSDADYWTADLSKALLTSVPQGRMLI 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL AE+ P + FYG P+++C+LHNF G + + G + I+ ++AR N+TMVG
Sbjct: 389 LDLQAELDPQYIRLNSFYGQPFVFCLLHNFGGTLGLNGAIQIISQRVIDARNFPNSTMVG 448
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++MEGI+QN VVYD M EM ++ + ++ W ++Y+VRRYG + A+ AW L ++V
Sbjct: 449 TGLTMEGIDQNYVVYDKMLEMGWRDKVPNLNQWFDEYTVRRYGVNNTAVMSAWRFLQNSV 508
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN + + + + V+V P + + P +WY
Sbjct: 509 YNDSSRRSFRGQYVLVTRPAL-------------------------------WQLPFVWY 537
Query: 360 STSEVIRALELFIA---SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
+ +VI A + I+ + LS ++ +R+D++DLTRQ++ + + L+ ++E Y ++
Sbjct: 538 NPHDVILAWDHLISGLMTEPLLSNASNFRHDMVDLTRQSMQEIFHLLYSQLLEVYLEKNS 597
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ ++ + ++L++D+D LL FLLG W+ AK E ++ QYEWNAR QIT+W
Sbjct: 598 TAIEGIAYKMIDLLQDLDELLQTGKKFLLGKWIADAKSWGTTEGEKLQYEWNARNQITLW 657
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 527
+ +RDY K W+G++ DYY PR ++ + M SL+ F K
Sbjct: 658 GPRGE-----IRDYAAKQWAGVVADYYKPRWEVFIREMQMSLDENRAFNKK 703
>gi|301106961|ref|XP_002902563.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
gi|262098437|gb|EEY56489.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
Length = 684
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 207/577 (35%), Positives = 320/577 (55%), Gaps = 41/577 (7%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ KIL R+ E GM P LPAF+G+VP ++ +FP+AK
Sbjct: 121 MGNLRGSWVKGPLPQAFIDSQYALQLKILNRMREFGMIPALPAFAGHVPEEMKALFPNAK 180
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T+ NW + +CC Y+LD +DPL+ +IG+ F+E+Q Y TS +Y CDT++E
Sbjct: 181 FTRSPNWGDFSDE--FCCVYMLDFSDPLYYDIGKTFLEEQRALYDYTSSLYQCDTYNEMD 238
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P P + + A+ M + D++AVWL+QGWLF P +W ++KA L+ V K+
Sbjct: 239 PDFTDPAKLQAASRAVIDSMTAADANAVWLIQGWLFENSPDYWTKNRVKAYLDGVSNEKM 298
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL++EV+P+WS ++G +++C+LHNF GN M G L ++ PV+A N TM
Sbjct: 299 IILDLYSEVRPVWSKMDNYFGKSWVYCVLHNFGGNTGMRGDLATLGTAPVQASRDSNGTM 358
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
+GVG++MEGI QN VVYDL +MA+ +D+ W+ ++ +RY + AW L
Sbjct: 359 IGVGLTMEGIYQNYVVYDLTLQMAWVDTPLDMDEWVPSFAAQRYHSQDVHTERAWGFLLQ 418
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VYN T G + +I P + K V + S T
Sbjct: 419 SVYNRTLGFGGVTKSLICLIP----------------HWKLVRDGFMPTSIT-------- 454
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ--LND 415
Y ++ RA + + +G+EL A +TYR+DL+D+TRQ L+ + +L++ E Y+
Sbjct: 455 -YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFLSDHFMAQYLHLKEMYEGKTQP 513
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ------EKQYEWNA 469
AH + + R L +E MD +LA ++ LLG W+ A+ LA+ E + YE+ A
Sbjct: 514 AHQLCAWTERMLLTIERMDEILATNEDSLLGNWIADARALAEESESIESSNLQDYYEYEA 573
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
R Q+T W DN E + DY K W+GL++ YY PR ++ + ++ G +
Sbjct: 574 RNQVTRWGDNNSET---IHDYAGKEWAGLVKGYYLPRWRMWLGEVCQAYTQGRTINKEVV 630
Query: 530 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
++ I WQ YP + GDAL+ SQ +Y+++
Sbjct: 631 KKARIAFELKWQLSHEHYPTTTVGDALVVSQRIYDEF 667
>gi|255533285|ref|YP_003093657.1| alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
gi|255346269|gb|ACU05595.1| Alpha-N-acetylglucosaminidase [Pedobacter heparinus DSM 2366]
Length = 749
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 211/577 (36%), Positives = 307/577 (53%), Gaps = 60/577 (10%)
Query: 2 SNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G GPLP+SW++ LQKKIL R ELGM P+LPAFSG+VP + FP+A++ +
Sbjct: 207 GNIDGLNGPLPKSWMESHEQLQKKILARERELGMKPILPAFSGHVPPTFKARFPNARVDR 266
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
L NW + R+ TY+L DPLF +I F+ +Q K +G T H+Y DTF+E P
Sbjct: 267 L-NW-----EGRFADTYVLHPDDPLFQQIADKFMAEQDKAFGNTDHLYGADTFNEMYLPY 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
Y+ +G A+Y GM D +A+W+MQGW+F FW+P +K L+ VP L++L
Sbjct: 321 TDTAYVRKIGTAVYKGMAKADPEAIWVMQGWMFWDKRDFWKPEVVKNYLSGVPDDNLIML 380
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT-TMVG 239
DLFA+ +PIW+ ++ F+G +IWCMLHNF G +YG L+ I P E N + G
Sbjct: 381 DLFADEQPIWTKTEAFWGKKWIWCMLHNFGGRNPLYGDLNYIGREPAEMVHDPNRGRLSG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+ EGIEQNPVVY LM E + + +DVK+W+ Y+ RRYG+ P + AW +L+ TV
Sbjct: 441 IGLVPEGIEQNPVVYSLMLEHVWNDQVIDVKSWLVNYAQRRYGQRDPQTEKAWQILHQTV 500
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY--GKPVSKEAVLKSETSSYDHPHL 357
Y EG Y+ +P ++ ++ + D P
Sbjct: 501 Y--------------------------AKEGSYETIISARPTHEK---HADWTGTDLP-- 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y +++ A + + N ++ Y++DL+ + RQ LA YA L ++ +
Sbjct: 530 -YDGDKLVPAWTYLLNASNRFKNNDCYQFDLVTVGRQVLANYATVLQRLFARDFRNKNLT 588
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ FL L+ DMD L+ FLLG WL AK+ A NE + + YE NAR IT+W
Sbjct: 589 AYRAHTAEFLTLIADMDKLMGTRKDFLLGKWLNDAKKWATNESESRLYEKNARDLITLW- 647
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWR 530
++AS L +Y NK W+GL +YG R + +LE G F R+KDW
Sbjct: 648 --GGKDAS-LHEYANKQWAGLFNGFYGKRWQTFIAETSTALEQGKSFDQEAFETRMKDW- 703
Query: 531 REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
EW +W NGR Y + G+ + S L+ KY+
Sbjct: 704 -EW-----NWVNGREQYTDKPQGNPVTVSIQLHKKYI 734
>gi|449675146|ref|XP_002156234.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Hydra
magnipapillata]
Length = 646
Score = 365 bits (936), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 195/567 (34%), Positives = 304/567 (53%), Gaps = 36/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL-QNVFPSAKI 59
M NL GWGGPL SW +QL LQ+ I+ R+ GM PVLP F G++P AL +FP++K
Sbjct: 89 MGNLEGWGGPLSSSWYSKQLQLQQNIISRMRSFGMIPVLPGFGGHIPKALVSRLFPTSKY 148
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
+L W K ++ T+LLD DPLF ++G AF+E Q + Y T H+YN D F+E P
Sbjct: 149 YKLKPW--NKFTGKYGGTFLLDPQDPLFKKVGAAFVEMQKQLYNGTDHVYNADIFNEMDP 206
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
P + +I++ +Y+ M + DSDAVWLMQGW+F W+P ++A L ++P GKL++
Sbjct: 207 PQLTSAFITNTSIGVYNAMLASDSDAVWLMQGWMF-LSSVWKPELVEAWLQAIPYGKLII 265
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +++ P++ + FYG P+IWCM+ NF G +YG L + G + AR + + M+G
Sbjct: 266 LDLASDIYPLYDQTNAFYGHPFIWCMIENFGGTTRLYGQLTGVMKGVISARKTYKSFMIG 325
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
GM+ EGI QN + ++LM+EM +++E+ ++ W Y RRYG + DAW +L T+
Sbjct: 326 TGMTPEGINQNDINFELMNEMGWRNEEFNISDWTLSYIKRRYGDYPKMVSDAWLILIDTI 385
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YNC DG + D + P + P + N PV H+WY
Sbjct: 386 YNCNDGRENGGYDGRI--PVMRPQL---------NAKLPV----------------HMWY 418
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S ++ A +L + + + +T+R DL+ L Q L + + ++ Y V
Sbjct: 419 SIKDLYNAWKLMVKGSDYMPLIDTFRNDLVRLGTQVLEDLSIVFYTQMVSGYFNKSTLNV 478
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ + L+ DMD LLA LLG W++SA+ + + K E+NA+ QIT+W N
Sbjct: 479 EKYGSKITVLLTDMDRLLATDQYSLLGRWIQSARSMGDTLNETKLLEYNAKNQITLWGPN 538
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
+ +RDY NK W+GL+ +Y R ++ ++ +SL+ G + + + ++
Sbjct: 539 GE-----IRDYANKNWAGLVGSFYFERWNMFINFLSDSLKRGVPYDDSAFVSKLLQFEKK 593
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W N + + GDA S L Y
Sbjct: 594 WNNEIKEFSADPTGDAFGISHQLLRAY 620
>gi|383856382|ref|XP_003703688.1| PREDICTED: alpha-N-acetylglucosaminidase [Megachile rotundata]
Length = 744
Score = 361 bits (926), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 198/521 (38%), Positives = 304/521 (58%), Gaps = 44/521 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPL SW +Q + LQ KIL R+ LG+ PVLP+F+G+VP A +FP+A +T
Sbjct: 194 MGNIRAFGGPLYPSWHEQSINLQHKILERMRSLGIIPVLPSFAGHVPRAFPRLFPNANVT 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W + +CC YLL TDPLF +IG+ F++ ++E+G T HIYNCDTF+EN P
Sbjct: 254 KLAPWNNFPD--VYCCLYLLAPTDPLFQQIGQLFLKTYIEEFG-TDHIYNCDTFNENEPH 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++G + + M + D DA+WLMQGWLF++D FW P+++A L SVP G+++V
Sbjct: 311 TSELKFLRNVGHSTFQAMNAVDPDAIWLMQGWLFTHDKLFWTEPRVEAFLTSVPRGRMIV 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + K ++G P+IWCMLHNF G + M+G I E R +N+TMVG
Sbjct: 371 LDLQSEQFPQYGRLKSYFGQPFIWCMLHNFGGTLGMFGSAQIINQRVFEGRNMKNSTMVG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN V+Y+LM+EMA++ E V++ W Y+ RRYG AW L TV
Sbjct: 431 TGLTPEGINQNYVIYELMNEMAYRKEPVNLNKWFENYASRRYGVWNEYAVSAWQSLGRTV 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN + + + VI P ++ S + WY
Sbjct: 491 YNFSGTRKIRGKYVISRRPSLNLSTWT-------------------------------WY 519
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +F+ + + S YR+D++DLTRQ L A E++ +I+++ +
Sbjct: 520 DRDTLYNTWSVFLQARHGRRNSTLYRHDVVDLTRQVLQAKAEEIYPVLIDSFNKKNLTAF 579
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
S + L+L +D++ +LA FLLG WL++AK+LA N+E+ + Y+ NA+ QI++W
Sbjct: 580 KYHSDKLLDLFDDLELILASGKDFLLGKWLDAAKKLASNDEELRLYQVNAKYQISLWGPR 639
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
+ +RDY NK W+G++ DY+ PR +I+ +ESLE+
Sbjct: 640 GE-----IRDYANKQWAGVVADYFKPRWSIF----LESLEN 671
>gi|428176410|gb|EKX45295.1| hypothetical protein GUITHDRAFT_51145, partial [Guillardia theta
CCMP2712]
Length = 680
Score = 358 bits (920), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 201/584 (34%), Positives = 328/584 (56%), Gaps = 41/584 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGG L QSW+DQQ LQ KIL R ELGM PVLPAF+G VP ++++FP AK T
Sbjct: 121 MINIKAWGGGLTQSWIDQQRDLQLKILARERELGMLPVLPAFAGGVPEGMKSLFPEAKFT 180
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ GNW CC ++D TDPLF++IG+ F+E+ YG ++HIY+CDTF+EN P
Sbjct: 181 RHGNWGGFAEQH--CCVMMVDPTDPLFLKIGKMFVEEVRAVYG-SNHIYSCDTFNENRPR 237
Query: 121 VDSP----EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG 175
+ +++S A++ M++ D DAVWLMQGWLF D FW+ ++ A L+ VP
Sbjct: 238 SEHGSVGLDFLSHSSRAVFESMRAADPDAVWLMQGWLFMNDARFWQKRELDAYLSGVPED 297
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVP-----YIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
++++LDLF +V P+W P ++W MLH+F GN MYG L I+ PV A+
Sbjct: 298 RMIILDLFTDVFPVWKRRDLQRPTPIEKRRWVWNMLHSFGGNSGMYGRLQVISKDPVVAK 357
Query: 231 TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG----RSVP 286
E+ TMVGVG++ EGIEQNPVVY++M+EM ++ ++VDV +W+ +++ RR G R
Sbjct: 358 -KESQTMVGVGITTEGIEQNPVVYEMMAEMRWREQEVDVMSWVEKWADRRLGPEASRERK 416
Query: 287 AI-QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+ ++AW L TVY+C + + ++ + P +D + + + + +EA++
Sbjct: 417 ALGEEAWRELASTVYSCPGTQMGQVKSMVESRPRLDLASGWIPNSDFMPIKRHYPEEALV 476
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
++ W ++ L + + S++ +D+ D+TRQ L+ LF
Sbjct: 477 RA----------W------LKLLRATRGGADGYTCSSSASFDIADVTRQVLSDLFARLFQ 520
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
+ Q A + L ++ DMD ++ LLG W+E A+ +++E+E+
Sbjct: 521 PLSSFCQTRLAGSAAVRMQTLLGIISDMDKMVGTQPRMLLGKWIEDARAWGKSKEEEEVL 580
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E+NAR +T+W + + DY +K W GLL DYY R ++F+++ +++ F
Sbjct: 581 EFNARNLVTLWGPRGE-----IADYASKQWQGLLSDYYMSRWKLFFEHLQQAIRGTRIFS 635
Query: 526 LKDWRREWIKLTNDWQN-GRNVYPVESNGDALITSQWLYNKYLQ 568
+ +++E + WQ + +P G+A+ + L++KY++
Sbjct: 636 QQRFQQELLVFEQQWQTRTSSSFPSSPEGNAVELAWQLHDKYIK 679
>gi|194759443|ref|XP_001961958.1| GF14678 [Drosophila ananassae]
gi|190615655|gb|EDV31179.1| GF14678 [Drosophila ananassae]
Length = 783
Score = 356 bits (913), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 201/577 (34%), Positives = 317/577 (54%), Gaps = 54/577 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++IL LGM+ LPAF+G+VP AL + P+ T
Sbjct: 228 MGNIRGWAGPLKPEWRQFQLLLQQEILSAQRNLGMSVALPAFAGHVPRALSRLHPNTSFT 287
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R+CC ++ T+PLF +I F++ + YG ++HI+ CD F+E PP
Sbjct: 288 DVQRWNQFPD--RYCCGLFVEPTEPLFHQIATTFLQSVVTIYG-SNHIFFCDPFNELEPP 344
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAI++ M + D +A+WL+QGW+F +PFW P +A L +VP G+++VL
Sbjct: 345 VAKPEYMRSTAAAIHNSMTAVDPEAIWLLQGWMFVKNPFWTPDMAEAFLTAVPRGRILVL 404
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + ++G P+IWCMLHNF G + M+G I G AR+ N+++VG
Sbjct: 405 DLQSEQFPQYELTHSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEAARSMPNSSIVGT 464
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN VVY L E + +D+ +W ++V RYG ++ AW +L ++VY
Sbjct: 465 GITPEGIGQNYVVYSLTLERGWSRNSIDLDSWFRHFTVTRYGVKDESLAKAWLLLKNSVY 524
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH-PHLWY 359
+ + G+Y +P S++H P WY
Sbjct: 525 SFHG--------------------LQKMRGQYVVTRRP------------SFNHDPFTWY 552
Query: 360 STSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
+ S+V+ A L +++ E + Y +DL+D+TRQ L A++L++N+ +++
Sbjct: 553 NASDVLEAWHLLLSARVIIPLEDDRYDVYEHDLVDITRQFLQITADQLYVNLKSSFRKRQ 612
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
LS R L+L +D++ +L+ FLLG WLE AKQ+A + E K +E+NAR QIT
Sbjct: 613 LPRFEFLSTRLLQLFDDLELILSSGRNFLLGNWLEQAKQVAPHPEDRKSFEFNARNQITA 672
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWR 530
W N Q + DY K WSGL++DYY PR +++F + +L S G F+ K +
Sbjct: 673 WGPNGQ-----ILDYACKQWSGLVKDYYKPRWSLFFDDVNVALHSQRPFNGSAFKQKVSQ 727
Query: 531 REWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
R I+L + N ++YP + + S ++ +++
Sbjct: 728 R--IELP--FSNKTDIYPTDPVENVWFISHTIFERWM 760
>gi|328867411|gb|EGG15793.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
Length = 1501
Score = 356 bits (913), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 205/588 (34%), Positives = 324/588 (55%), Gaps = 66/588 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++GWGGPL ++ Q LQ++IL R+ + GM PVLP F+G+VP A ++FP+A IT
Sbjct: 950 MGNVNGWGGPLDYDFIAGQHDLQQQILERMRQYGMKPVLPGFAGHVPRAFMSLFPTANIT 1009
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG+W + TY LD +DPLF + + F++ Q YG T H Y+ D F+E TPP
Sbjct: 1010 QLGDWRAFNG------TYYLDPSDPLFANVSQTFVKVQTAIYG-TDHYYSFDPFNEITPP 1062
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
Y+ + +++Y+ + D AVW++Q W F D FW+PPQ+KA L VP+G L+V
Sbjct: 1063 SSDAGYLQNSSSSMYNALAYADPQAVWVLQAWFFISDAWFWQPPQVKAFLGGVPIGHLLV 1122
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD +AE P W+ + QF G +IWCMLHNF G MYG + I GP++AR ++ M G
Sbjct: 1123 LDTWAEESPAWTVTDQFNGHDWIWCMLHNFGGRTGMYGKIPRITAGPIDAR-KQSPGMKG 1181
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ E IEQN ++YDLMSEM+++ ++ WINQY+ RRYG VP + AWN L TV
Sbjct: 1182 TGLTPEAIEQNYIMYDLMSEMSWRTTAPNMTEWINQYTQRRYGVFVPELAQAWNSLASTV 1241
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN D + DKN +F + P + +++Y
Sbjct: 1242 YNAPD-SIDKNPS---SFVGIRPELNMTN---------------------------NIYY 1270
Query: 360 STSEVIRALELFIASGNE-LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+S + +A +L+++ +E + +++TY +D+ ++T QAL+ E + + +AY+
Sbjct: 1271 DSSIIQKAWQLYLSVTDEYVLSTSTYSFDIAEITIQALSNLFIETEIAMYDAYKTGKGTE 1330
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ-----LAQNEEQEK---------- 463
+ + L ++ DMD + + L+G W +A+Q L++N+++++
Sbjct: 1331 FDEHAMNCLNIITDMDMIASTQQLLLVGTWTANARQWANYNLSRNKDEDRNTDKEQMTIE 1390
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM------IES 517
QYE+NAR QIT+W S L DY WSGLL D+Y R +++ KY+ +
Sbjct: 1391 QYEFNARNQITLW----GPSNSTLHDYAYHLWSGLLNDFYLARWSLFIKYLDSSLSSSST 1446
Query: 518 LESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 565
++G GF+ +++ + L W YP G+A S+++ N+
Sbjct: 1447 NDAGTGFKNQEYINDIESLEESWNLQTYQYPTRPTGNAYQLSKFINNQ 1494
>gi|348681836|gb|EGZ21652.1| hypothetical protein PHYSODRAFT_247428 [Phytophthora sojae]
Length = 991
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 202/581 (34%), Positives = 317/581 (54%), Gaps = 45/581 (7%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ +IL R+ E GM P LPAF+G+VP L+ P+A
Sbjct: 422 MGNLRGSWVKGPLPQAFIDNQHELQLRILQRMREFGMIPALPAFAGHVPEDLKLTLPNAN 481
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T+ NW + ++CC Y+++ TDPL+ EIG+AF+E+Q Y TS +Y CDT+ E
Sbjct: 482 FTRSPNWGNFTD--QYCCVYMIEPTDPLYREIGKAFLEEQRALYNYTSSLYQCDTYMEMA 539
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P + A+ GM + D +AVWLMQGW F DP +W P++KA L VP KL
Sbjct: 540 PEFTDLSELKGAARAVIDGMTAADPNAVWLMQGWPFVDDPHYWTRPRVKAYLEGVPTDKL 599
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LD ++E PIW+ ++G +I+ +LHNF GN M G L ++A PV+A+ N TM
Sbjct: 600 IILDFYSEAVPIWNKMDNYFGKNWIYSVLHNFGGNTGMRGDLPTLATAPVQAQRDGNGTM 659
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VGVG++MEGI QN VVYDL +MA++ +DV W+++Y+ RRY ++ AW+ L
Sbjct: 660 VGVGLTMEGIFQNYVVYDLTLQMAWEDSPLDVDEWVSKYASRRYHTQNEHVERAWSYLSR 719
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VYN T +A+ V S++ + Y + + +
Sbjct: 720 SVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR--------------FQPTLI 754
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF--LNIIEAYQLND 415
Y +++ A + + +G+EL +TYR+DL+D+T+Q L+ E + L +I + +
Sbjct: 755 KYDPKDIVLAWKELLLAGDELRNVDTYRHDLVDVTKQFLSNKLLEQYQHLKVIYSAKSAP 814
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN----------EEQEKQY 465
A+ V +L++ L + ++ +LA ++ FLLG W+ A LA + + ++ Y
Sbjct: 815 ANEVCELTKTMLTTINRLEEILATNEDFLLGNWVADALNLAGDLNIGGDSVTRTKLQEYY 874
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E+ AR Q+T W DN E + DY K W+GL++ YY PR ++ + +
Sbjct: 875 EYEARNQVTRWGDNNNEA---IHDYAGKEWAGLVKSYYLPRWTMWLTEVCSAYTDRREMD 931
Query: 526 LKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
K ++ I WQ YP + GD+ S+ +Y++Y
Sbjct: 932 EKGLKKRRIAFELKWQLSHEKYPTTTVGDSFSISKRIYSEY 972
>gi|157134500|ref|XP_001656341.1| alpha-n-acetylglucosaminidase [Aedes aegypti]
gi|108881379|gb|EAT45604.1| AAEL003150-PA [Aedes aegypti]
Length = 763
Score = 353 bits (906), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 198/566 (34%), Positives = 315/566 (55%), Gaps = 47/566 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL ++++ LQ +++ + LGM LPAF+G++P ++P AK+T
Sbjct: 211 MGNIRGWGGPLTTNFINFSKKLQNQVIDEMRRLGMVLALPAFAGHLPVQFAQLYPEAKLT 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ NW + ++ LD DPLF EIG+ F+ + ++ YG ++HIY CD F+E P
Sbjct: 271 PVENWNGFPA--QYASPLFLDPIDPLFQEIGKRFLTKVIERYG-SNHIYFCDPFNEIQPR 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
S +Y+SS A IY M D AVWL+QGW+F +P+W ++A L +VPLG+++VL
Sbjct: 328 SFSAKYLSSASAGIYKAMNDVDPFAVWLLQGWMFVKNPYWSDVAIRAFLQAVPLGRMLVL 387
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCML NF G + M G +D + + RT+++ TM+G
Sbjct: 388 DLQSEQFPQYDRTESYHGQPFIWCMLSNFGGTLGMLGSVDLVFQRIRDVRTNDSMTMIGT 447
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN +Y+ EM + +V+ W Y+ RYG ++DAW++ +TVY
Sbjct: 448 GITPEGINQNYGLYEFALEMGWNPNIDNVEEWFRTYASVRYGTQDKRLKDAWSMFRYTVY 507
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + + GKY +P K HP LWY+
Sbjct: 508 SFKEQ--------------------EMMRGKYTFNRRPSLKL-----------HPWLWYN 536
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+ ++L + S S + +R D++DLTRQ L A+ L+LNI+EAY + + V
Sbjct: 537 ETLFNAGVQLLLESN---STNTLFRNDVVDLTRQFLQNTADRLYLNIMEAYNTKNPNSVK 593
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
LS F +L+EDMD LL FLLG WLESAK +A+ + ++YE+NAR QIT+W
Sbjct: 594 YLSILFQKLLEDMDRLLRTDQHFLLGRWLESAKAVAETSLERQKYEYNARNQITLWGPQG 653
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---RLKDWRREWIKLT 537
Q + DY NK W+G+++D++ PR ++ M + +E +++D + ++L
Sbjct: 654 Q-----IVDYANKQWAGMVQDFFLPRWKLFLTEMTKDVEQNRTLNEGKVRDKIFKMVELP 708
Query: 538 NDWQNGRNVYPVESNGDALITSQWLY 563
N R YP+ +GDAL+ ++ L+
Sbjct: 709 FCTSNKR--YPIRPDGDALLVARELF 732
>gi|440800773|gb|ELR21808.1| AlphaN-acetylglucosaminidase (NAGLU) [Acanthamoeba castellanii str.
Neff]
Length = 800
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 210/592 (35%), Positives = 313/592 (52%), Gaps = 72/592 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGPL +SW + Q LQKKI+ ++ + A L+ V+P A IT
Sbjct: 199 MGNVQSWGGPLTKSWREGQAELQKKIVQGVWN---EERAVSVRWARAAGLKRVYPHANIT 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W R +LLD DP+F +IG AFI+ Q + YG T HIYN DTF+E PP
Sbjct: 256 LSPTWAHFTDPYR---VWLLDPFDPIFQKIGTAFIDAQTRVYG-TDHIYNADTFNELDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P Y+++ A+Y GM + D A+WLMQGWLF +W ++KA L+ V +++L
Sbjct: 312 SADPTYLAAASNAVYQGMAAADPKALWLMQGWLFR-SVWWSNDRIKAYLSGVKNDNMLIL 370
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+AEV PIWS ++ ++G P++WCMLH+F GN ++YG L IA PV+ART+ +TMVG
Sbjct: 371 DLYAEVDPIWSKTESYFGKPFVWCMLHDFGGNRDLYGNLTHIATAPVDARTAPGSTMVGT 430
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ME IEQNPV+Y+LMSEM ++ VDV W++ Y RYG P+ + AW +L+ + Y
Sbjct: 431 GLTMEAIEQNPVIYELMSEMGWRSAHVDVDDWLDHYVSFRYGADSPSAKKAWRLLHQSAY 490
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
P + SI + ++ S H YS
Sbjct: 491 QN---------------PVIMRSIYTFVPNRH----------------VSRNHH----YS 515
Query: 361 TSEVIRALELFIASGNEL----SASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND- 415
++ A L + S EL + + YDL+D+TRQ L + LF +AY L D
Sbjct: 516 PDVLVEAWGLLLQSRLELPNPAQPNGPWEYDLVDVTRQVL----DNLFH---DAYGLLDG 568
Query: 416 AHGVFQLSRR------------FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
A+ + +RR ++++ D+D +LA + +LLG W E A+ A NEE+++
Sbjct: 569 AYDAYVATRRDPFNQVKTIGAALIQILSDIDTVLATNQNYLLGVWTERARSWATNEEEKR 628
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YE+NAR QIT+W N + + DY +K W+GL+ YY PR I+ Y+ +S+ G
Sbjct: 629 LYEFNARNQITLWGPNGE-----INDYASKEWAGLVGTYYRPRWQIFVAYLFDSIAKGTV 683
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH 575
+ + + W N N +P ++ G+ SQ LY +Y+ + H
Sbjct: 684 IDPNKYAADLLLWEQRWNNQTNAFPSQATGNVAEVSQALYARYVSAAELKQH 735
>gi|242011515|ref|XP_002426494.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
gi|212510620|gb|EEB13756.1| alpha-N-acetylglucosaminidase, putative [Pediculus humanus corporis]
Length = 1345
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 195/521 (37%), Positives = 306/521 (58%), Gaps = 46/521 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ + L +WL QQL+LQ KIL R+ ELG+ PVLP+F G VP + ++ +P AK+
Sbjct: 836 MGNVRNFSYGLTNNWLQQQLLLQHKILNRLRELGITPVLPSFCGIVPRSFKDSYPFAKLL 895
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W D +CC YLLD+ DPLF + R F+++ + E+G T+HIYNCD F+EN P
Sbjct: 896 EMPKWNKFSRD--YCCPYLLDSNDPLFSVVSRVFLKEYINEFG-TNHIYNCDVFNENKPA 952
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRP-PQMKALLNSVPLGKLVV 179
+S +Y+S++ + IY M S D A WL+QGW+F DPFW ++KA +N+VP G++++
Sbjct: 953 SESLDYLSTISSTIYKAMSSVDPRATWLVQGWMF-IDPFWASLKRVKAFINAVPKGRMLI 1011
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +++ P + + ++G P+IWC LHNF G + MYG L+ + G + R +N+TMVG
Sbjct: 1012 LDLQSDLTPQYKRLQSYFGQPFIWCTLHNFGGQLGMYGHLNRVNLGVFKGRKFKNSTMVG 1071
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G++ EGI+QN ++YD ++A + + VD+ WI +Y++RRYG I DAW +L +T+
Sbjct: 1072 IGIAPEGIDQNYIMYDFTLDLALRTKPVDLDDWITKYALRRYGLIEKNILDAWLILKNTL 1131
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
YN PD + + S Y + +L S + WY
Sbjct: 1132 YNYN--------------PDSNFRLTSSNVKMYTLVKGEHIAKNILTKFPSLRMNEFTWY 1177
Query: 360 STSEVIRALELF-IASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
+ S ++ E F IAS N + ++++ +++DLID+TRQ +
Sbjct: 1178 NRSIILDIFEKFQIASSNSILSTSSLFQHDLIDVTRQTI--------------------Q 1217
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ S FLEL+ ++D +L FLLG WLESAK +A N+ ++ YE+NAR QIT+W
Sbjct: 1218 IAIENSNMFLELLNELDMILNTGKKFLLGNWLESAKNMATNKLEKDNYEFNARNQITLWG 1277
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL 518
N + +RDY K W+G++ D+Y PR ++F+ + ES+
Sbjct: 1278 SNGE-----IRDYAAKQWAGMIHDFYKPRWKLFFQALNESI 1313
>gi|348681870|gb|EGZ21686.1| hypothetical protein PHYSODRAFT_495971 [Phytophthora sojae]
Length = 692
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 194/577 (33%), Positives = 310/577 (53%), Gaps = 41/577 (7%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ KIL R+ GM P LPAF+G+VP L+ ++P+AK
Sbjct: 121 MGNLRGSWVEGPLPQAFIDGQYELQLKILERMRGFGMVPALPAFAGHVPEELKTLYPNAK 180
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T+ NW + +CC Y+LD DPL+ EIG+ F+E+Q Y TS +Y CDT++E
Sbjct: 181 FTRSPNWGGFSDE--FCCVYMLDPQDPLYYEIGKTFLEEQRALYDYTSSLYQCDTYNEMD 238
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P P + + A+ M + D +AVWL+QGWLF P +W +++ L+ VP K+
Sbjct: 239 PDFTDPAKLQAASRAVIDSMTAADPNAVWLIQGWLFVNSPNYWTKERVQTYLDGVPNDKM 298
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL++EV+P+W+ ++G +I+C+LHNF GN M G L ++ PV A + + TM
Sbjct: 299 IILDLYSEVRPVWNKMDNYFGKSWIYCVLHNFGGNTGMRGDLPTLGTAPVLANRASSGTM 358
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
+G+G++MEGI QN VVYDL +MA+ +D+ W+ ++ +RY + AW L
Sbjct: 359 IGMGLTMEGIFQNYVVYDLTLQMAWVDAPLDMDEWVPSFAAQRYHSQDAHTERAWGFLLQ 418
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VYN T G + ++ P + ++ + + +
Sbjct: 419 SVYNRTLGYGGVTKSLVCLIPHWK-----------------LVRDGFMPTLIT------- 454
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-- 415
Y ++ RA + + +G+EL A +TYR+DL+D+TRQ L+ + +L++ + Y +
Sbjct: 455 -YDPMDITRAWKELLLAGSELHAVDTYRHDLVDVTRQFLSDHFMAQYLHLEDMYAGKETP 513
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN------EEQEKQYEWNA 469
A + + R L +E +D +LA +D FLLG W+ A+ LA + YE+ A
Sbjct: 514 ADQLCAWTDRMLVTIEWLDEILATNDDFLLGNWVADARALADEVGAAEVTSLQDYYEYEA 573
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
R Q+T W DN E + DY K W+GL+ YY PR ++ + +S
Sbjct: 574 RNQVTRWGDNNSES---IHDYAGKEWAGLVSGYYLPRWRMWLTEVCQSYTQKRDVNEAAL 630
Query: 530 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
++ + +WQ YP + GD L S+ +Y ++
Sbjct: 631 KKARVDFELNWQLSHERYPTTTTGDTLAVSKRIYEEF 667
>gi|195577611|ref|XP_002078662.1| GD22403 [Drosophila simulans]
gi|194190671|gb|EDX04247.1| GD22403 [Drosophila simulans]
Length = 778
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 199/569 (34%), Positives = 306/569 (53%), Gaps = 52/569 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ + LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPGWRRYQLLLQQEIITAQHNLGMSVALPAFAGHVPRALKRLHPESTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC ++ TD LF EI F++ + +YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTDNLFKEIASRFLQNIITKYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++VG
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIDEARRLPNSSLVGT 459
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y E + + +D+ +W +S RYG ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNFSHTRYGVKDERLEQAWLLLKNSVY 519
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + G+Y V+ S P WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A L + S + + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 549 ASAVLDAWHLLLTSRAIIPLEDDRYEIYEHDLVDITRQFLQISADQLYVNLRSAYRKRQV 608
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
LS + L+L +DM+ +LA FLLG WL+ AKQ A N +++ +E+NAR QIT W
Sbjct: 609 ARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGEQRNFEFNARNQITAW 668
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRR 531
+ Q + DY K WSGL+ DYY PR ++ + + +L + G F+LK
Sbjct: 669 GPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLEDVTVALHAGRPYNGTAFKLK--VS 721
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+ I+L + N +VYPV G+ + SQ
Sbjct: 722 QEIELP--FSNKADVYPVTPVGNTWLISQ 748
>gi|198476648|ref|XP_001357424.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
gi|198137793|gb|EAL34493.2| GA12255 [Drosophila pseudoobscura pseudoobscura]
Length = 767
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 197/571 (34%), Positives = 313/571 (54%), Gaps = 44/571 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL + Q +LQ+ IL +LG++ LPAF+G++P A++ ++P+ T
Sbjct: 212 MGNIRGWGGPLKPEYQRLQELLQQHILRAQRDLGISVALPAFAGHLPTAMRRIYPNGNYT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S DP +CC +D DP+F + F+ + ++ YG ++HI+ CD F+E PP
Sbjct: 272 EVERWNSFP-DP-YCCGLFVDPLDPIFDLVAALFLRRVVQRYG-SNHIFFCDPFNELQPP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V P+Y+ S AAI++ M+S D +AVWL+QGW+F + FW M+A L +VP+G+L+VL
Sbjct: 329 VAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNIFWTDAMMEAFLTAVPIGRLIVL 388
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + +YG P++WCMLHNF G + M+G D + G AR N+++VGV
Sbjct: 389 DLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGSADLVNNGIEAARRMPNSSIVGV 448
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y L+ E + +D+ +W ++ RYG +Q AW +L +VY
Sbjct: 449 GITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFARTRYGVDDEGLQQAWQLLRRSVY 508
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ K R G V++ L + P WY+
Sbjct: 509 SFR--GLQKMRG-----------------------GYTVTRRPALNLD------PFTWYN 537
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S+V+ A +L ++S + + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 538 ASDVLEAWKLLLSSRAIIPLEDDNYAIYEHDLVDITRQYLQISADQLYVNLKSAYRKRQV 597
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
L + L+L D++ +LA FLLG WL A++ A N + +E+NAR QIT W
Sbjct: 598 ARFEYLGSKLLQLFGDLERILASGSNFLLGTWLADAQRAAPNAADKPNFEFNARNQITAW 657
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIK 535
+ Q + DY K WSGL+ DYY PR A++ + +L S F ++ R +
Sbjct: 658 GPDGQ-----ILDYACKQWSGLVLDYYRPRWALFLDDVTLALHSNRTFNSTAFKLRVSQE 712
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + N +VYPVE G+ SQ +Y ++
Sbjct: 713 VELPFSNKSDVYPVEPMGNTWFISQNIYERW 743
>gi|195155652|ref|XP_002018715.1| GL25802 [Drosophila persimilis]
gi|194114868|gb|EDW36911.1| GL25802 [Drosophila persimilis]
Length = 767
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 196/571 (34%), Positives = 313/571 (54%), Gaps = 44/571 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL + Q +LQ+ IL +LG++ LPAF+G++P A++ ++P+ T
Sbjct: 212 MGNIRGWGGPLKPEYQRLQELLQQHILRAQRDLGISVALPAFAGHLPTAMRRIYPNGNYT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S DP +CC +D DP+F + F+ + ++ YG ++HI+ CD F+E PP
Sbjct: 272 EVERWNSF-PDP-YCCGLFVDPLDPIFDLVAALFLRRVVQRYG-SNHIFFCDPFNELQPP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V P+Y+ S AAI++ M+S D +AVWL+QGW+F + +W M+A L +VP+G+L+VL
Sbjct: 329 VAEPDYMRSTAAAIHNSMRSVDPEAVWLLQGWMFVKNIYWTDAMMEAFLTAVPIGRLIVL 388
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + +YG P++WCMLHNF G + M+G D + G AR N+++VGV
Sbjct: 389 DLQSEQFPQYQRTDSYYGQPFVWCMLHNFGGTLGMFGSADLVNNGIEAARRMPNSSIVGV 448
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y L+ E + +D+ +W ++ RYG +Q AW +L +VY
Sbjct: 449 GITPEGIGQNYVMYSLVLERGWSELPLDLDSWFKHFARTRYGVDDEGLQQAWQLLRRSVY 508
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ K R G V++ L + P WY+
Sbjct: 509 SFR--GLQKMRG-----------------------GYTVTRRPALNLD------PFTWYN 537
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S+V+ A +L ++S + + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 538 ASDVLEAWKLLLSSRAIIPLEDDKYAIYEHDLVDITRQYLQISADQLYVNLKSAYRKRQV 597
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
L + L+L D++ +LA FLLG WL A++ A N + +E+NAR QIT W
Sbjct: 598 ARFEYLGSKLLQLFGDLEHILASGSNFLLGTWLADAQRAAPNAADKPNFEFNARNQITAW 657
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR-REWIK 535
+ Q + DY K WSGL+ DYY PR A++ + +L S F ++ R +
Sbjct: 658 GPDGQ-----ILDYACKQWSGLVLDYYRPRWALFLDDVTLALHSNRTFNSTAFKLRVSQE 712
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ + N +VYPVE G+ SQ +Y ++
Sbjct: 713 VELPFSNKSDVYPVEPMGNTWFISQNIYERW 743
>gi|21356587|ref|NP_652045.1| CG13397, isoform A [Drosophila melanogaster]
gi|442626853|ref|NP_001260251.1| CG13397, isoform B [Drosophila melanogaster]
gi|16185856|gb|AAL13967.1| LP03571p [Drosophila melanogaster]
gi|22945953|gb|AAF52672.2| CG13397, isoform A [Drosophila melanogaster]
gi|440213562|gb|AGB92787.1| CG13397, isoform B [Drosophila melanogaster]
Length = 778
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 196/568 (34%), Positives = 305/568 (53%), Gaps = 50/568 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL +W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPAWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLNPESTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC ++ T+ LF EI F+ + +YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLHNIITKYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++VG
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y E + + +D+ +W +S RYG ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNFSHSRYGVKDERLEQAWLLLKNSVY 519
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + G+Y V+ S P WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548
Query: 361 TSEVIRALELFIASGN----ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A L + E + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 549 ASAVLDAWHLLLTFRAIIPLEDNRYEIYEHDLVDITRQFLQISADQLYINLRSAYRKRQV 608
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
LS + L+L +DM+ +LA FLLG WL+ AKQ A N Q++ +E+NAR QIT W
Sbjct: 609 SRFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGQQRNFEFNARNQITAW 668
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+ Q + DY K WSGL+ DYY PR ++ + + +L +G F ++ +K+
Sbjct: 669 GPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLEDVTVALHAGRPFNGTAFK---LKV 720
Query: 537 TND----WQNGRNVYPVESNGDALITSQ 560
+++ + N +VYPV G+ + SQ
Sbjct: 721 SHEIELPFSNKDDVYPVTPVGNTWLISQ 748
>gi|194863164|ref|XP_001970307.1| GG23441 [Drosophila erecta]
gi|190662174|gb|EDV59366.1| GG23441 [Drosophila erecta]
Length = 778
Score = 345 bits (885), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 200/569 (35%), Positives = 305/569 (53%), Gaps = 52/569 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPEWRRYQLLLQQEIIAAQRNLGMSVALPAFAGHVPRALKRLHPGSTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC L+ TD LF EI F+++ + YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFLEPTDNLFNEIALIFLQKIITAYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY ++ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESIRRLDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++VG
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y E + + +D+ +W +S RYG ++ AW L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNRPLDLDSWFTSFSHARYGVKDERLEQAWLQLKNSVY 519
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + G+Y +P K+ P WY+
Sbjct: 520 SFHG--------------------LQKMRGQYVVTRRPSFKQ-----------EPFTWYN 548
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A L ++S + + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 549 ASAVLDAWHLLLSSRAIIPLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAYKKRQV 608
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
LS + L+L +DM+ +LA FLLG WL+ AKQ A + +++ YE+NAR QIT W
Sbjct: 609 SRFEFLSSKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPHPGEQRNYEFNARNQITAW 668
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRR 531
+ Q + DY K WSGL+ DYY PR ++ + + +L S G F+LK
Sbjct: 669 GPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLEDVTVALHSLRPFNGTAFKLK--VS 721
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+ I+L + N +VYPV G+ SQ
Sbjct: 722 QEIELP--FSNKVDVYPVTPVGNTWFISQ 748
>gi|90399367|emb|CAJ86183.1| H0212B02.15 [Oryza sativa Indica Group]
gi|116311963|emb|CAJ86322.1| OSIGBa0113E10.5 [Oryza sativa Indica Group]
Length = 692
Score = 344 bits (883), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 163/262 (62%), Positives = 205/262 (78%), Gaps = 6/262 (2%)
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
+GVGMSMEGIEQNP+VYDLMSEMAF H +VD++ W+ Y RRYG+S+ +QDAW +LY
Sbjct: 427 IGVGMSMEGIEQNPIVYDLMSEMAFHHRQVDLQVWVETYPTRRYGKSIVGLQDAWKILYQ 486
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKY----QNYGKPVSKEAV-LKSETSSY 352
T+YNCTDG DKNRDVIVAFPDV+P +I T G Y + Y +SK + + + Y
Sbjct: 487 TLYNCTDGKNDKNRDVIVAFPDVEPFVIQ-TPGLYTSSSKTYSTKLSKNYIAVDASNDEY 545
Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+HPHLWY T VIRALELF+ G+E+S SNT+RYDL+DLTRQ LAKYAN++F+ IIE+Y+
Sbjct: 546 EHPHLWYDTDAVIRALELFLRYGDEVSDSNTFRYDLVDLTRQTLAKYANQVFVKIIESYK 605
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
N+ + V L + F++LV D+D LLA H+GFLLGPWLESAK LA+++EQE QYEWNARTQ
Sbjct: 606 ANNVNQVSNLCQHFIDLVNDLDTLLASHEGFLLGPWLESAKGLARDKEQEMQYEWNARTQ 665
Query: 473 ITMWFDNTQEEASLLRDYGNKY 494
ITMWFDNT+ +ASLLRDYG +
Sbjct: 666 ITMWFDNTKTKASLLRDYGEAH 687
Score = 295 bits (756), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 133/189 (70%), Positives = 157/189 (83%), Gaps = 3/189 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 261 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 320
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LGNWF+V S+PRWCCTYLLDA+DPLF+EIG+ FIE+Q++EYG TSH+Y+CDTFDENTPP
Sbjct: 321 HLGNWFTVDSNPRWCCTYLLDASDPLFVEIGKLFIEEQIREYGGTSHVYSCDTFDENTPP 380
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG---KL 177
+ P YISSLGAA + GMQSGD DA+WLMQGWLFSYDPFW PPQMK + G
Sbjct: 381 LSDPNYISSLGAATFRGMQSGDDDAIWLMQGWLFSYDPFWEPPQMKIGVGMSMEGIEQNP 440
Query: 178 VVLDLFAEV 186
+V DL +E+
Sbjct: 441 IVYDLMSEM 449
>gi|195339231|ref|XP_002036223.1| GM12949 [Drosophila sechellia]
gi|194130103|gb|EDW52146.1| GM12949 [Drosophila sechellia]
Length = 778
Score = 344 bits (882), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 197/569 (34%), Positives = 305/569 (53%), Gaps = 52/569 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTAGWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLHPESTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC ++ T+ LF EI F++ + +YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLQNIITKYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D +A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPEAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++VG
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y E + + +D+ W +S RYG ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDGWFTNFSHTRYGVKDERLEQAWLLLKNSVY 519
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + G+Y V+ S P WY+
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYN 548
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A L + S + + Y +DL+D+TRQ L A++L++N+ AY+
Sbjct: 549 ASAVLDAWHLLLTSRAIIPLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAYRKRQV 608
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
LS + L+L +DM+ +LA FLLG WL+ AKQ A N +++ +E+NAR QIT W
Sbjct: 609 ARFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNTGEQRNFEFNARNQITAW 668
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRR 531
+ Q + DY K WSGL+ +YY PR ++ + + +L + G F+LK
Sbjct: 669 GPDGQ-----ILDYACKQWSGLVSNYYRPRWRLFLEDVTVALHAGRPYNGTAFKLK--VS 721
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+ I+L + N +VYPV G+ + SQ
Sbjct: 722 QEIELP--FSNKIDVYPVTPVGNTWLISQ 748
>gi|170060634|ref|XP_001865888.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
gi|167879069|gb|EDS42452.1| alpha-N-acetyl glucosaminidase [Culex quinquefasciatus]
Length = 761
Score = 340 bits (873), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 200/571 (35%), Positives = 306/571 (53%), Gaps = 56/571 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL +S+ LQ K++ + GM LPAF+G++P + +FP AK+
Sbjct: 210 MGNIRGWGGPLKESFKTFASDLQAKVVQEMRRFGMILALPAFAGHLPVQFKTLFPQAKLN 269
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + ++ LD DPLF +IG F+ + + YG T HIY D F+E P
Sbjct: 270 PVEVWNGFPA--QYASPLFLDPVDPLFQKIGSKFVAKAIARYG-TDHIYFSDPFNEIQPR 326
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+S Y++S A IY M D AVWL+QGW+ +PFW +KA +VP G+++VL
Sbjct: 327 SESARYLASAAAGIYQAMVDVDPLAVWLLQGWMLVKNPFWSDRAIKAFFTAVPNGRMLVL 386
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ +YG P+IWCML NF G + M G +D + E R++E+ TM+G
Sbjct: 387 DLQSEQFPQYVRTQSYYGQPFIWCMLSNFGGTLGMLGSVDLVFERIRETRSNESMTMIGT 446
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN +Y+ EM + + DV W +Y++ RYG +QDAW++ TVY
Sbjct: 447 GITPEGINQNYGLYEFALEMGWNPDISDVDNWFTRYAMVRYGNDDKRLQDAWSIFRSTVY 506
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + + GKY +P K P +WY+
Sbjct: 507 SFKG--------------------MEMMRGKYTFNRRPSLKL-----------QPWVWYN 535
Query: 361 TSEVIRALELFIA--SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+ +EL +A NEL +++D++DLTRQ L A++L+L I++ Y L +A
Sbjct: 536 ETRFDEGVELILAVNGSNEL-----FKHDVVDLTRQFLQNTADKLYLTIMDTYTLKNAAA 590
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
S F EL++++D LLA + FLLG WLESAK LA + ++YE+NAR QIT+W
Sbjct: 591 FKHYSNLFKELLQNIDRLLATNTHFLLGRWLESAKSLATTSLERQKYEYNARNQITLWGP 650
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-----RLKDWRREW 533
Q + DY NK WSG+++D++ PR +++ + M +L + R K +R+
Sbjct: 651 QGQ-----IVDYANKQWSGVVQDFFLPRWSLFLQEMELALATNGTINETKVRDKIFRKVE 705
Query: 534 IKLTNDWQNGRNVYPVESNG-DALITSQWLY 563
+ D R YP E++G DAL ++ LY
Sbjct: 706 LPFNTD----RKKYPAEASGEDALELARELY 732
>gi|198433857|ref|XP_002122480.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 880
Score = 339 bits (870), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 189/455 (41%), Positives = 263/455 (57%), Gaps = 37/455 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLP W+ QL+LQ +IL+R+ LGM PVLP F+G++P+A+ N++P A +
Sbjct: 211 MGNLHGWGGPLPSFWIKSQLILQHQILIRMRSLGMIPVLPGFAGHIPSAILNLYPKADVI 270
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QL +W + CTYLL DPLF IG FI++Q+ EY T+HIYN DTF+E TPP
Sbjct: 271 QLSHWSHFNCT--YSCTYLLQPHDPLFNTIGSMFIKEQMLEYNGTNHIYNADTFNEMTPP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+S+ A+Y M D DAVWLMQGWLF ++P FW+ Q KALL VP GK++V
Sbjct: 329 SSDPGYLSNASRAVYDAMAVADPDAVWLMQGWLFHHEPTFWKTAQKKALLTGVPKGKMLV 388
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E P + ++G P++WCMLH+F GN+ YG ++++ P A TS N+TMVG
Sbjct: 389 LDLFSESYPQY-LPDWYFGQPFLWCMLHDFGGNMGFYGKINTVNTQPGIALTSVNSTMVG 447
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI QN ++YD M E F V+V W+ +Y++RRY S P WN+L +T+
Sbjct: 448 TGVTPEGINQNYMIYDFMLETGFTVHSVNVTNWLKEYTMRRYNTSSPEAIKTWNILGNTI 507
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL-- 357
YN D FP S+I G PV + + D+P L
Sbjct: 508 YN----------DTKPGFP--SKSLIR---------GSPVKRPTL--------DNPGLPY 538
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
WY S + A + F S N L T RYD +D+TRQ L L+ ++E +
Sbjct: 539 WYQYSSLALAWDNFSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDP 598
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESA 452
G +L + L+L++D D +L F +G W++ A
Sbjct: 599 G--KLGEQLLDLLDDFDKMLCSDAHFSMGKWIQDA 631
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 103/197 (52%), Gaps = 8/197 (4%)
Query: 371 FIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELV 430
F S N L T RYD +D+TRQ L L+ ++E + G +L + L+L+
Sbjct: 684 FSQSLNTLKDLETVRYDAVDITRQMLQAVHRLLYYAMVEEFLWKRDPG--KLGEQLLDLL 741
Query: 431 EDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDY 490
+D D +L F +G W++ AK L E++ YE+NAR Q+T+W N + + DY
Sbjct: 742 DDFDKMLCSDAHFSMGKWIQDAKILGTTAEEKDLYEYNARIQVTLWGPNGE-----ILDY 796
Query: 491 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE-WIKLTNDWQNGRNVYPV 549
+K+W L++ YY PR A++ Y+ + + F K + + + + + R+V+P
Sbjct: 797 ASKHWCSLVKHYYRPRWALFVSYLNHAYATKSKFDHKAFASDVFTNVEEPFTKDRSVFPS 856
Query: 550 ESNGDALITSQWLYNKY 566
+ G+A+ ++ +Y K+
Sbjct: 857 TATGNAIELAKDMYIKW 873
>gi|158300970|ref|XP_320760.4| AGAP011750-PA [Anopheles gambiae str. PEST]
gi|157013415|gb|EAA00039.4| AGAP011750-PA [Anopheles gambiae str. PEST]
Length = 770
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 192/572 (33%), Positives = 310/572 (54%), Gaps = 49/572 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL S+ LQ +++ + LGM LPAF+G++P + ++P+
Sbjct: 213 MGNIRGWGGPLTPSFTQFAHTLQVRVVGEMRRLGMAVALPAFAGHLPVQFRTLYPNVSFA 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + P++ LD T+PLF IG F++ +K YG T H+Y D F+E P
Sbjct: 273 NVSVWNNFP--PQYASPLFLDPTEPLFAAIGSRFLQLAIKTYG-TDHVYFSDPFNEIDPT 329
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ S +Y+SS+ AIYS M D DA+WL+QGW+F +PFW +++ L++VPLG+++VL
Sbjct: 330 LPSGKYLSSVSEAIYSTMVQVDPDAIWLLQGWMFVKNPFWSDRAIRSFLSAVPLGRMLVL 389
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + + G P+IWCML NF G + M G + ++ G E R + T++G
Sbjct: 390 DLQSEQYPQYGRTASYAGQPFIWCMLSNFGGTLGMLGSVGNVFRGIRETRDNSTYTLLGT 449
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR-SVPAIQDAWNVLYHTV 299
G++ EGI QN +Y+ EM + E + W ++Y+V RYG S Q AWN+ TV
Sbjct: 450 GITPEGINQNYALYEFALEMGWNAELDSAEQWFSEYAVARYGNDSDERAQQAWNIFLRTV 509
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y AF ++ + GKY +P SK P WY
Sbjct: 510 Y---------------AFEGLE-----LMRGKYTFNRRPSSK-----------IRPWTWY 538
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ LEL ++ E S + +YDL+D TRQ L A+ L+L ++++++ D
Sbjct: 539 DVHTFNQGLELLLSFAEEASCNQLCQYDLVDATRQCLQHTADALYLTLMDSFKKRDLTSF 598
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
S FL+L+ D+D LL ++ FLLGPWLESAK A+ + +YE+NAR QIT+W
Sbjct: 599 RLHSSLFLQLLSDLDVLLRTNEHFLLGPWLESAKAHAETTLERHKYEYNARIQITLWGPQ 658
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKD--WRREWI 534
Q + DY NK W+G+++D++ PR ++ + ++L + + +++D +R +
Sbjct: 659 GQ-----IVDYANKQWAGMVQDFFLPRWRVFLGELDQALATNGTINDLKIRDKIFRTVEL 713
Query: 535 KLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+D ++ Y + +GD + T++ LY ++
Sbjct: 714 PFVSDSKH----YATQPSGDTVRTARTLYERW 741
>gi|301107007|ref|XP_002902586.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
gi|262098460|gb|EEY56512.1| alpha-N-acetylglucosaminidase (NAGLU), putative [Phytophthora
infestans T30-4]
Length = 736
Score = 338 bits (867), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 201/572 (35%), Positives = 305/572 (53%), Gaps = 54/572 (9%)
Query: 1 MSNLHG-W-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL G W GPLPQ+++D Q LQ +IL R+ E GM P LPAF+G+VP L+ P+A
Sbjct: 196 MGNLRGSWVKGPLPQAFIDNQHELQLRILERMREFGMIPALPAFAGHVPEELKLRLPNAH 255
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
TQ NW + + CC ++++ TD L+ EIG+ F+++Q + Y TS +Y CDT+ E
Sbjct: 256 FTQSPNWGNFSEEH--CCVFMIEPTDALYREIGKNFLKEQRELYNYTSSLYQCDTYMEMA 313
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
P + A+ GM + D +AVWLMQGW F DP FW P++KA L+ VP KL
Sbjct: 314 PEFTDLTELEGAARAVIDGMTAADPNAVWLMQGWPFVDDPHFWTKPRVKAYLDGVPTDKL 373
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LD ++E PIWS ++G +I+ +LHNF GN M G L ++A PV A + N TM
Sbjct: 374 IILDFYSESVPIWSKMDNYFGKSWIYSVLHNFGGNTGMRGDLLTLATAPVLANWAGNGTM 433
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VGVG++MEGI QN +VYDL +MA+ +DV WI QY+ +RY ++ AW+ L
Sbjct: 434 VGVGLTMEGIFQNYIVYDLTLQMAWVDNPLDVNTWIPQYAAQRYHTHNEHVEQAWSYLLR 493
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VYN T +A+ V S++ + Y + + ++K
Sbjct: 494 SVYNRT-----------LAYGGVTKSLVCLIPHWRLLYDR--FQPTLIK----------- 529
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA- 416
Y ++V+ A + + + NEL +TYR+DL+D+T+Q L+ E ++++ Y A
Sbjct: 530 -YDPNDVVLAWKELLLAENELRDVDTYRHDLVDVTKQFLSNKLLEQYIHLKGIYNAKKAS 588
Query: 417 -HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+ V L++ L +E ++ +LA ++ FLLG W+ AR Q+T
Sbjct: 589 PNEVCGLTKTMLTTMERLEEILATNEDFLLGNWI-------------------ARNQVTR 629
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W DN E + DY K W+GL++ YY PR ++ + + K + + I
Sbjct: 630 WGDNNNEA---IHDYAGKEWAGLVKGYYIPRWTMWLSEVCNAYTDKREMNEKALKEKRIA 686
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
WQ G YP + GDA S+ YN+Y+
Sbjct: 687 FELKWQLGHESYPTTTVGDAFTISKRFYNEYI 718
>gi|330791218|ref|XP_003283691.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
gi|325086434|gb|EGC39824.1| hypothetical protein DICPUDRAFT_26247 [Dictyostelium purpureum]
Length = 712
Score = 335 bits (859), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 193/575 (33%), Positives = 309/575 (53%), Gaps = 52/575 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ WGGP+ WL++Q LQ +IL R+ GM PVLP F+G++P A+Q +FP+A ++
Sbjct: 170 MGNVNNWGGPITMDWLEKQRDLQIQILTRMRAYGMKPVLPGFAGHIPGAIQTLFPTANVS 229
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W T+ LD +DPLF +I + FI + + +G T H YN D F+E PP
Sbjct: 230 ILSTWCEFNG------TFYLDPSDPLFGKITQLFITELIGVFG-TDHYYNFDPFNELAPP 282
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSYDPFWRPPQMKALLNSVPLGKLVV 179
++ +Y+ M + D AVW++QGW + Y FW+ Q +A + VP+G +V
Sbjct: 283 SSDLGFLKQTSQQMYNNMLAADPKAVWVLQGWFIVDYPEFWQANQTQAWFSGVPIGGFIV 342
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+++V P W+ ++ FYG ++WCMLHNF G MYG + IA P+ AR S + M+G
Sbjct: 343 LDLWSDVAPAWNITEYFYGHYWLWCMLHNFGGRSGMYGRIPFIATNPIIAR-SLSDNMMG 401
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ E IEQN VVYDLMSEMA++ D++ WI QY+ RRYG+ +P + + W + TV
Sbjct: 402 TGLTPEAIEQNVVVYDLMSEMAWRSTAPDLEEWITQYTNRRYGKIMPEVVEVWMSMVDTV 461
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
+N T +N +F + PSI N+G +++Y
Sbjct: 462 FNATAYWARRNMGAPESFIALRPSI---------NFGD------------------NVFY 494
Query: 360 STSEVIRALELF-IASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
S + A +F + + + + ++ T+++D+ ++T QAL+ + + + N+I++Y ++D
Sbjct: 495 DPSVMFNAWHVFSLVNDSYVISTETFQFDISEITMQALSNFFMDTYFNLIKSYNVSDIES 554
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA--QNEEQEKQ---------YEW 467
+ S +E + MD + + LG W A+ A NE Q YE+
Sbjct: 555 FQRESITMMETISFMDLIASTQPELQLGVWTYRARLWAYPDNETPSLQNSSNSATLPYEF 614
Query: 468 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 527
NAR Q+T+W S+L DY K W GL+ D+YGPR ++ K +++SLE+ F
Sbjct: 615 NARNQLTLW----GPSDSVLHDYAFKLWGGLISDFYGPRWNLFLKTLLQSLENRIPFDAN 670
Query: 528 DWRREWIKLTNDWQNGRNVYPVESNGDALITSQWL 562
++ L W +YP+ G TS+++
Sbjct: 671 NFISNVQALEQQWVLESTIYPILPFGQGYNTSRYI 705
>gi|195115262|ref|XP_002002183.1| GI17241 [Drosophila mojavensis]
gi|193912758|gb|EDW11625.1| GI17241 [Drosophila mojavensis]
Length = 773
Score = 332 bits (852), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 195/574 (33%), Positives = 312/574 (54%), Gaps = 50/574 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL WGGPLP + Q +LQ++IL ELGM+ LPAFSG VP A++ VFP+A T
Sbjct: 212 MGNLRSWGGPLPPAHRQLQQLLQQRILAAQRELGMSVALPAFSGYVPTAMRRVFPNASFT 271
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q W DP +CC ++ DPLF ++G F+ + ++ YG ++HIY D F+E P
Sbjct: 272 QSDRWNHF-PDP-YCCVLFVEPQDPLFQQVGAMFLRRVIQVYG-SNHIYFSDPFNEMMPR 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V P Y+ AIY+ MQ D+DAVWL+QGW+F +W ++A L +VP G+++ L
Sbjct: 329 VREPNYVRYTAKAIYNSMQVVDADAVWLIQGWMFLKSVYWTNDLIEAYLTAVPRGRILAL 388
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + +YG P++WCML+NF GN+ ++G I G + AR+ N +MVGV
Sbjct: 389 DLQSEQFPQYERTHSYYGQPFVWCMLNNFGGNLGLFGSAQLIPSGIIAARSMPNGSMVGV 448
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN ++ L E A+ +++ ++ W +++ RYG + + W +L +VY
Sbjct: 449 GITPEGIGQNYALFALTLEQAWSPDELQLEDWFEYFALTRYGVNDTRLSQVWQLLRESVY 508
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ +G+ + GK L S + +P +WY+
Sbjct: 509 SF--------------------------QGRERMRGK-----YTLNKRPSLHHYPWVWYN 537
Query: 361 TSEVIRALELFIASGNELSASNT----YRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
+ V A L + + + ++ Y +DL+D+TRQ L + ++N+ A +
Sbjct: 538 VTMVYEAWRLMLEAKETVPLNDNRRAIYEHDLVDITRQCLQLSFDRFYVNLKSACRHKQL 597
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ V L+ + LEL DM+ +LA + +LLG WLE+AK+LA +EEQ YE+NAR Q+T W
Sbjct: 598 NRVEYLAGKLLELFADMERILASGEHYLLGNWLEAAKRLAPSEEQRPIYEFNARNQLTSW 657
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
N Q + DY K WSGL+ DY+ PR ++ + +I++L++ F ++++ ++
Sbjct: 658 GPNYQ-----IPDYATKQWSGLMSDYFQPRWNMFLEAVIQALKTQTPFNYSEFKQ---RV 709
Query: 537 TND----WQNGRNVYPVESNGDALITSQWLYNKY 566
N+ + N YP G S +Y K+
Sbjct: 710 ENEIELPFSNHTKAYPTSPVGSTWNISHDIYEKW 743
>gi|440799253|gb|ELR20308.1| alpha-N-acetylglucosaminidase family protein [Acanthamoeba
castellanii str. Neff]
Length = 854
Score = 332 bits (851), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 194/586 (33%), Positives = 301/586 (51%), Gaps = 67/586 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPL +W Q LQKKI+ R GM P+LP F+G VP ++ ++P+A +T
Sbjct: 240 MGNIQGWGGPLDPAWRKAQAELQKKIVERQRMFGMLPILPGFAGFVPDGIKRIYPTANLT 299
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ +W ++ Y L D L+ IGR I + E+G T HIYN DTF+E +PP
Sbjct: 300 KSADWAGFPH--QYTNVYFLSPLDSLYKTIGRMVIRRVTAEFG-TDHIYNADTFNEMSPP 356
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWR-PPQMKALLNSVPLGKLVV 179
P Y+++ A+Y GM + D A+W+MQGW F +D FW ++++ L+ V +++
Sbjct: 357 SADPTYLAAASRAVYEGMAAEDPQALWVMQGWSFVFDKFWEDKSRVRSYLSGVSDKDMLI 416
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
LDL ++ P WS + ++G ++WCMLHN G +YG L + P+ A + TM
Sbjct: 417 LDLASDNNPEWSKTDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSDPLLALATPGNTMLI 476
Query: 238 ------VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS--VPAIQ 289
VGVGM+ME IEQNPVVY+LMSEM ++ E D+ W+ +Y+ RRYG + + ++
Sbjct: 477 CGTCEQVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAAGLSSVG 536
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
+AW +L Y N+ VI D T G YG
Sbjct: 537 EAWELLREATY---------NQSVI------DYGWFGFTPGLGMGYGGVA---------- 571
Query: 350 SSYDHPHLWYSTSEVIRALELFIASG--NELSASNTYRYDLIDLTRQALAKYANELFLNI 407
+ ++ + AL LF+ S + + ++YD +DLTRQ LA +++
Sbjct: 572 ----------NAAKEVEALRLFLQSALTKGYAPNGPWQYDCVDLTRQVLANTFRDIYAQF 621
Query: 408 IEAYQLNDAHGVF------QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
AY AH + L L L+ D+D +LA + +LLG W++SA A +Q
Sbjct: 622 DAAYSAYAAHKTYTVDQLKSLGSALLTLIGDIDEILATNPNYLLGTWIQSALSWADTPDQ 681
Query: 462 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 521
Y++NAR QIT+W + Q + DY K+W+ L+R YY PR ++ +++++ +G
Sbjct: 682 ALHYQFNARNQITLWGPDGQ-----ITDYATKHWADLVRSYYQPRWTLFITSVLQAVYAG 736
Query: 522 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
+++R E ++L W Y G+ L + L KYL
Sbjct: 737 -----REYRGELLQLEQKWNRENTTYATTPTGNTLQVAYKLAAKYL 777
>gi|195473052|ref|XP_002088810.1| GE10991 [Drosophila yakuba]
gi|194174911|gb|EDW88522.1| GE10991 [Drosophila yakuba]
Length = 778
Score = 331 bits (848), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 193/569 (33%), Positives = 303/569 (53%), Gaps = 52/569 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPQWRRYQLLLQQEIIAAQRNLGMSVALPAFAGHVPRALKRLNPDSTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W ++CC ++ + LF EI F+++ + YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--QYCCGLFVEPKENLFNEIALNFLQKIITIYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTSAAIYESMRRIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++VG
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y E + ++ +D+ +W +S RYG ++ AW L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNKPLDLDSWFTNFSHTRYGVKDERLEQAWLQLKNSVY 519
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + G+Y V+ S P WY
Sbjct: 520 SFRG--------------------LQKMRGQY-----------VVTRRPSFNQEPFTWYD 548
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A L ++S + + Y +DL+D+TRQ L A++L++N+ A++
Sbjct: 549 ASAVLDAWHLLLSSRAIIPLEDDRYEMYEHDLVDITRQFLQISADQLYVNLRSAFRKRQV 608
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
LS + L+L +DM+ +LA FLLG WL+ AK+ A + ++ +E+NAR QIT W
Sbjct: 609 TRFEYLSTKLLKLFDDMELILASSRNFLLGNWLQQAKRAAPSPGEQTNFEFNARNQITAW 668
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES-----GDGFRLKDWRR 531
+ Q + DY K WSGL+ DYY PR ++ + + +L S G F+LK
Sbjct: 669 GPDGQ-----ILDYACKQWSGLVSDYYRPRWRLFLEDVTVALHSRRPFNGTAFKLK--VS 721
Query: 532 EWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+ I+L + + +VYPV G+ + SQ
Sbjct: 722 QEIELP--FSHKVDVYPVTPVGNTWLISQ 748
>gi|66801665|ref|XP_629757.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
gi|60463162|gb|EAL61355.1| hypothetical protein DDB_G0291998 [Dictyostelium discoideum AX4]
Length = 798
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 192/574 (33%), Positives = 308/574 (53%), Gaps = 50/574 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++GWGGP+ WL++Q LQ KIL R+ + GM PVLP F+G++P A+Q +FP A I+
Sbjct: 255 MGNVNGWGGPITLDWLEKQRDLQIKILERMRQYGMKPVLPGFAGHIPGAIQQLFPQANIS 314
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W + T+ L++TDPLF +I FI + + +G T H YN D F+E PP
Sbjct: 315 VLSTWCNFNG------TFYLESTDPLFAKITTMFIGELIDVFG-TDHFYNFDPFNELEPP 367
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ +Y+ ++Y + D AVW++QGW P FW+ Q +A + VP+G ++V
Sbjct: 368 SNDTDYLRQTSQSMYENVLLADPKAVWVLQGWFIVDAPEFWQAKQTEAWFSGVPIGGVLV 427
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+++V P W+T+ +YG ++WCMLHNF G MYG L I+ P+ AR + MVG
Sbjct: 428 LDLWSDVIPGWTTTNYYYGHYWVWCMLHNFGGRSGMYGRLPWISSNPITAR-GLSPNMVG 486
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G++ E IEQN VVYD+MSEM+++ + ++ W+ QY+ RRYG+ VP I D W L +TV
Sbjct: 487 IGLTPEAIEQNVVVYDMMSEMSWRSVQPNLTEWVTQYTHRRYGKLVPEIVDVWISLVNTV 546
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
+N T N +F + P + +G +S+ +P++ Y
Sbjct: 547 FNATAATARANMGAPESFIALRPQL---------TFGN------------NSFYNPNILY 585
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ V ++ + ++ T+ +D+ + T Q+L+ Y + + +IEA+ +D +
Sbjct: 586 NAWNVFSMVD-----DEYVISTETFEFDISEFTMQSLSNYFMDQYFLLIEAFNASDVQTL 640
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA---------QNEEQEKQ--YEWN 468
+S L+++ MD + + LG W A+ A QN YE+N
Sbjct: 641 STISIELLDIINYMDEIASTQSSLQLGLWTYRARLWAYPTNDIPTLQNSSNSNTAPYEFN 700
Query: 469 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 528
AR +T+W S+L DY K WSGL+ D+Y PR ++ K +++S+E+ F +
Sbjct: 701 ARNVLTLW----GPSNSVLHDYAFKLWSGLVSDFYSPRWQLFLKSLVQSVENRKPFNKES 756
Query: 529 WRREWIKLTNDWQNGRNVYPVESNGDALITSQWL 562
+ R L W + +YP G A TS+++
Sbjct: 757 FNRMVENLEEQWVVQQTIYPTVPVGQAYNTSKYI 790
>gi|195050088|ref|XP_001992825.1| GH13491 [Drosophila grimshawi]
gi|193899884|gb|EDV98750.1| GH13491 [Drosophila grimshawi]
Length = 771
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 195/574 (33%), Positives = 319/574 (55%), Gaps = 50/574 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLP + Q +LQ++I+ +LGM+ LPAF+G+VP L +FP+A T
Sbjct: 213 MGNIRGWGGPLPPAHRRLQQLLQQRIVQAQRDLGMSVALPAFAGHVPTGLPRIFPTANFT 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W DP +CC ++ +DPLF +G F+ + ++ YG ++HIY D F+E P
Sbjct: 273 SVERWNQFP-DP-YCCALFIEPSDPLFQLVGAQFLRRVIQIYG-SNHIYFSDPFNEMQPR 329
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P YISS AIY+ M+ D D VWL+QGW+F + +W ++A L +VP G+++VL
Sbjct: 330 IAEPGYISSTARAIYNSMRMVDKDPVWLLQGWMFLDNAYWSDELIEAFLTAVPRGRMLVL 389
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + +YG P++WCML+NF G + M+G I G + AR+ N++MVGV
Sbjct: 390 DLQSEQFPQYQRTFSYYGQPFVWCMLNNFGGTLGMFGSAHLINAGIMAARSMPNSSMVGV 449
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN ++ L E + K+++ W +Q+++ RYG + + AW +L +VY
Sbjct: 450 GITPEGIGQNYALFALTLEQGWSGSKLELSDWFDQFTLTRYGVNDTDLILAWQLLRGSVY 509
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + GKY L S P +WY+
Sbjct: 510 HFHG--------------------LQRMRGKY-----------ALNKRPSFNLKPWIWYN 538
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V+ A +L +A+ + + Y++DL+D+TRQ L + +++++N+ AY+ +
Sbjct: 539 ASSVVEAWQLLLAANQTIPVEDDRYALYKHDLVDITRQFLQQSFDQVYVNLKSAYRKSQL 598
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
L+ + LEL+ DM+ +LA + +LLG WLE+AK+LA + +Q YE+NAR Q+T W
Sbjct: 599 ARFEYLAAKLLELLADMERILASGEHYLLGNWLEAAKELAPSADQRHIYEFNARNQLTAW 658
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+ Q + DY K WSGL++DYY PR +++ + ++ S F +R+ ++
Sbjct: 659 GPSNQ-----ILDYATKQWSGLMQDYYTPRWSMFLDAVTLAMHSKRPFNATAFRQ---RV 710
Query: 537 TND----WQNGRNVYPVESNGDALITSQWLYNKY 566
N+ + N VYP E G + SQ +++K+
Sbjct: 711 ANEIELPFSNLTKVYPTEPVGSTWLISQEIHDKW 744
>gi|195398029|ref|XP_002057627.1| GJ18000 [Drosophila virilis]
gi|194141281|gb|EDW57700.1| GJ18000 [Drosophila virilis]
Length = 766
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 189/574 (32%), Positives = 313/574 (54%), Gaps = 50/574 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPLP + Q +LQ+ I+ ELGM+ LPAF+G+VP A++ VFP+A T
Sbjct: 214 MGNIRGWAGPLPPAHRRLQQLLQQLIVRAQRELGMSVALPAFAGHVPTAMRRVFPNANYT 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W + ++CC ++ DPLF ++G F+ + ++ YG ++HIY D F+E PP
Sbjct: 274 PAERWNNFPD--QYCCDLFVEPHDPLFQQLGAMFLRRVIQVYG-SNHIYFSDPFNEMQPP 330
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P Y+ S AIY+ M+ D +AVWL+QGW+F D FW ++A L +VP G+++VL
Sbjct: 331 LAEPGYMRSTAKAIYNSMREVDGNAVWLLQGWMFLKDIFWTDELIEAFLTAVPRGRILVL 390
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + + +YG P++WCML+NF G + ++G I G AR N+++VGV
Sbjct: 391 DLQSEQFPQYQRTHSYYGQPFVWCMLNNFGGTLGLFGSAQFIGSGIASARIMPNSSLVGV 450
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN ++ L E + ++ + W + +++ RYG + + AW +L VY
Sbjct: 451 GITPEGIGQNYAIFALTLEQGWSASELQLGDWFDHFALTRYGVNDTRLAQAWQLLRGGVY 510
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + GKY L +P WY+
Sbjct: 511 SFHG--------------------LQRMRGKY-----------ALNRRPGLNLNPWTWYN 539
Query: 361 TSEVIRALELFIASGNELSASN----TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
S V A +L +AS + ++ Y +DL+D+TRQ L + +++++N+ AY+
Sbjct: 540 GSSVTDAWQLLLASREMVPLTDDRYAIYEHDLVDITRQFLQQSFDQIYVNLRSAYRKEQL 599
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + L+ + LEL++DM+ +LA +LLG WLE+AK+LA +++ YE+NAR Q+T W
Sbjct: 600 NRLEYLAGKLLELLDDMERILASGVHYLLGTWLEAAKKLAPSDKLRPLYEFNARNQLTSW 659
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
N Q + DY K WSGL+ DYY PR A++ + ++++ F D+++ ++
Sbjct: 660 GPNGQ-----ILDYATKQWSGLMCDYYQPRWAMFLDAVTRAMQTHRPFNATDFKQ---RV 711
Query: 537 TND----WQNGRNVYPVESNGDALITSQWLYNKY 566
N+ + N +YP + G+ + S +Y K+
Sbjct: 712 ANEIELPFSNLTKMYPTKPMGNTWLISNDIYIKW 745
>gi|390334740|ref|XP_003724005.1| PREDICTED: uncharacterized protein LOC100893810 [Strongylocentrotus
purpuratus]
Length = 1043
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 176/471 (37%), Positives = 257/471 (54%), Gaps = 41/471 (8%)
Query: 100 KEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF 159
+E+ T HIYN DTF+EN P + Y+S+ +Y G+ GD VWLMQGWLF F
Sbjct: 573 EEFNGTDHIYNADTFNENQPRSNDSAYLSAASRGVYQGIVEGDPQGVWLMQGWLFQKTDF 632
Query: 160 WRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL 219
W P Q+KALL+ VP+G+++VLDLFAE +PI++ ++ FYG P+IWCMLHNF GN +YG L
Sbjct: 633 WGPSQIKALLHGVPIGRMIVLDLFAEARPIYNATQSFYGQPFIWCMLHNFGGNTGLYGKL 692
Query: 220 DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 279
D++ P EAR ++TM+G+G++ EGI QN V+Y+ +++M ++ E ++V WI +YS R
Sbjct: 693 DAVNKFPFEARQFNSSTMIGMGLTPEGILQNYVMYNFLTDMTWRSESMNVSKWIEEYSGR 752
Query: 280 RY----GRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNY 335
RY G S A + AW +L TVYN T D A P V PS N
Sbjct: 753 RYSPESGHSEEAAK-AWAILQATVYNNTGIDKDHQH----AVPVVRPS----------NK 797
Query: 336 GKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQA 395
K V +WY +EV +A + + L S+ +RYDL+D+TR
Sbjct: 798 TKSV-----------------IWYDYTEVAKAWGFLLQASETLGTSSLFRYDLVDVTRNV 840
Query: 396 LAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL 455
L A + + I+ ++ + + L+ DMD + + H +LLG WLE AK L
Sbjct: 841 LQDLAFDFYEQIMVSFHAKNITAIRGNGTLLCNLILDMDNITSSHQDWLLGTWLEDAKSL 900
Query: 456 AQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI 515
A N ++E YE+NAR QIT+W + DY NK W GLLR YY R ++ +++
Sbjct: 901 ATNHKEESLYEYNARNQITVWGPRGEH-----LDYANKQWGGLLRSYYYNRWQLFVQFLD 955
Query: 516 ESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+E + + + +W N +P + GD + S+ LY+KY
Sbjct: 956 GCIELHVPYDQSKFDMRSFIMETEWTNSTEKFPTKPVGDTVSISRALYSKY 1006
>gi|288927792|ref|ZP_06421639.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella sp. oral taxon 317 str. F0108]
gi|288330626|gb|EFC69210.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella sp. oral taxon 317 str. F0108]
Length = 734
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 193/571 (33%), Positives = 298/571 (52%), Gaps = 47/571 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W GPLP SW+ Q LQKKIL R +LGM PVLPAF+G+VP L+ +P AKIT
Sbjct: 198 MSNIDHWMGPLPMSWIKNQEKLQKKILRRTRDLGMKPVLPAFAGHVPEILKEKYPKAKIT 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W + R C + LD D LF +I + +I++Q K YG T HIY D F+E PP
Sbjct: 258 PLSIWGDFEDQYR-C--HFLDPFDSLFTDIQKTYIDEQTKLYG-TDHIYGVDPFNELAPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY+++ A IY +++ DS AVWL W+FSY W ++K+ + +VP K ++
Sbjct: 314 SWEPEYLANASAKIYDVLKNADSKAVWLQMTWMFSYQRKDWTDERIKSYITAVPDKKQIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD +AE +W S+ +Y P+IWC L NF GN + G + + EA + +MVG
Sbjct: 374 LDYYAERTEVWKFSESYYKQPFIWCYLGNFGGNTMIAGNIAEVDRRLNEAFANAE-SMVG 432
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG + NP++YD + E + + + + W Q++ RR G + + AW +L +
Sbjct: 433 VGSTLEGFDVNPIMYDFVFEKVWHKDGISLHDWTVQWAQRRVGTTDENAEKAWKLLIDKI 492
Query: 300 YN----CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
Y CT+G R PS+ +N+ K
Sbjct: 493 YVQYSLCTEGTLTNAR----------PSLTGHGNWTTKNWTK------------------ 524
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
Y+ +++ A L + S + Y+YD++++ RQ L Y L +AY+ D
Sbjct: 525 ---YNNRDLLEAWGLLLRS--KAITKIAYKYDIVNIGRQVLGNYFTVLRDEFTQAYERKD 579
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+ L L+ D++ LL FLLGPWL +A+ + +N E+ + YE NAR IT
Sbjct: 580 ISALTIKGNEMLSLLNDLEALLYTSPSFLLGPWLTNAQNMGRNMEESRYYEKNARNIITN 639
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W +TQ A L DYGN+ W+GLL+ YY PR ++ + +I +++ F + + ++
Sbjct: 640 W--STQGVA--LNDYGNRTWAGLLQGYYTPRWKMFIEEVISAVKQNKEFNNETFFKKVTD 695
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
W + YP+++ GD+ + + Y+KY
Sbjct: 696 EEWQWISKTENYPIQATGDSYLLANKFYHKY 726
>gi|340617022|ref|YP_004735475.1| alpha-N-acetylglucosaminidase [Zobellia galactanivorans]
gi|339731819|emb|CAZ95084.1| Alpha-N-acetylglucosaminidase, family GH89 [Zobellia
galactanivorans]
Length = 747
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 183/550 (33%), Positives = 294/550 (53%), Gaps = 25/550 (4%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G GGPLPQ+W+ Q+ LQ KIL R+ LGM PVL AF+G+VP L+ ++P A I
Sbjct: 197 MGNIDGMGGPLPQNWITQRKELQVKILNRMRSLGMKPVLQAFTGHVPQVLKKLYPEANIF 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ +W V+ TY LD TD LF +IG AFI++Q + YG T H+Y+ D F E PP
Sbjct: 257 QIEDWAGVEG------TYFLDPTDELFQKIGTAFIKKQTELYG-TDHLYDADCFIEVDPP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM-KALLNSVPLGKLVV 179
P ++ + ++Y M+ DS A W++QGW F + + + +A L+ +P + +V
Sbjct: 310 SKDPAFLKQVSESVYKSMELADSKATWVLQGWFFFFKKDFWTKERGRAFLDGIPKNRAIV 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMV 238
LDL+ E P W + FYG P+IW ++ N + M G L+ + EA TSE +
Sbjct: 370 LDLYGEKNPTWDKTDAFYGQPWIWNVICNEDQKVNMSGDLEEMQRQFQEAYTSEIGNNLK 429
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G+ EG+ NP+V D + E A+ +KV+V+ WI Y+ RYG P+++ AW +L +
Sbjct: 430 GIGVIPEGLGYNPIVQDFIFEKAWDPQKVNVQEWIEDYATIRYGTKSPSVKKAWQLLGES 489
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VY G T ++ P ++ EG ++ + +++ ++D
Sbjct: 490 VY----GRTRTMWSPLI----TTPRLMIFEEGSKEDIRHVRKDFKITETDPFAWD----- 536
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+ ++ +A L + NEL TY +DL ++ R+ L ++ ++ AYQ D
Sbjct: 537 FDVYKLAKAAGLLLGEANELQDVETYNFDLTNVYRELLFSLTHKSINDVSVAYQEKDRQA 596
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+ + ++ +L++D++ + ++ FLLG WLE AK E+++ YEWNART +T+W
Sbjct: 597 LDRSAKSLFKLMDDLEAITGANENFLLGKWLEDAKSWGSTPEEKEYYEWNARTIVTIWQP 656
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
+ LRDY K W+GL YY PR ++ ++ SL G F K + E ++
Sbjct: 657 YPE---GGLRDYAGKQWNGLFSGYYKPRWQLFVDHLRRSLTEGVDFDPKAYDAEVREMDY 713
Query: 539 DWQNGRNVYP 548
W +YP
Sbjct: 714 KWTRSHQIYP 723
>gi|297273081|ref|XP_001095618.2| PREDICTED: alpha-N-acetylglucosaminidase-like [Macaca mulatta]
Length = 691
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 189/517 (36%), Positives = 273/517 (52%), Gaps = 80/517 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ + P A
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRTSCMPVAS 263
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ S+ P GR I + +
Sbjct: 264 LPA-----SLPPSPG-----------------GRKLIH----------------SINLMQ 285
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKL 177
PP +P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q+ A+L +VP G+L
Sbjct: 286 PPSSAPSYLAAATTAVYEAMIAVDTEAVWLLQGWLFQHQPQFWGPAQIGAVLGAVPRGRL 345
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
+VLDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 346 LVLDLFAESQPVYTLTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTM 405
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
VG GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ +RYG S P AW +L
Sbjct: 406 VGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTNFAAQRYGVSHPDAGAAWRLLL 465
Query: 297 HTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
+VYNC+ + NR +V P L+ TS
Sbjct: 466 RSVYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS----- 495
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
+WY+ S V A L + S L+AS +RYDL+DLTRQA+ + + + AY +
Sbjct: 496 -VWYNRSSVFEAWRLLLTSAPSLAASPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKE 554
Query: 416 AHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 474
+ + EL+ +D LLA FLLG WLE A+ A +E + YE N+R Q+T
Sbjct: 555 LTSLLRAGGVLAYELLPALDELLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLT 614
Query: 475 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 511
+W E ++L DY NK +GL+ +YY PR ++
Sbjct: 615 LW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFL 646
>gi|298385999|ref|ZP_06995556.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
1_1_14]
gi|298261227|gb|EFI04094.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
1_1_14]
Length = 715
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 187/570 (32%), Positives = 296/570 (51%), Gaps = 54/570 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GGPLP W +Q+VLQKKIL R+ E GM PV F G VP+ L+ +P A++
Sbjct: 194 MGNLENIGGPLPDEWFKEQIVLQKKILARMREYGMKPVFQGFFGMVPSLLKEKYPEARLV 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S++ P +LD DPLF + + + + K YG+ + ++ D F E T
Sbjct: 254 EQGLWNSLQRPP------VLDPADPLFERMAKVWYAEYEKLYGK-ADLFGGDLFHEGGKT 306
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D ++ + + M+ + DA W++Q WL + P+ K LL + +
Sbjct: 307 GGID----VTDAARRVQTAMKRYNPDATWVIQAWLGN-------PK-KELLAGLDRKNTL 354
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR--TSENTT 236
++DL AE W K F G P++W + N+ GNI ++G LD+IA GPV+ + ++ + +
Sbjct: 355 IVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGGNIGLHGRLDAIATGPVDGQKDSAASPS 414
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
M G + EGIE NPVV+DL++EM ++ E +D+ W+ +YSVRRYG +++AW + +
Sbjct: 415 MKGTSSTPEGIEVNPVVFDLLNEMRWRSEHLDLDVWLKEYSVRRYGVEDENLKEAWTIFH 474
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
T Y G + V A P + I+ S++
Sbjct: 475 RTAYGTYTGHRRPSESVFCAPPSLKRDKITA----------------------SAWSQCR 512
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
++Y + + LF+ S + L ++TY+YD +D RQ LA E + N+++AY+ D
Sbjct: 513 IFYDPELFAQGVGLFLQSADRLKQTSTYQYDAVDFVRQYLADLGRETYYNLVDAYRAKDT 572
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
S RFL+L++D + LL+ H+ F +G WL+ A+ ++ E + YE NAR I W
Sbjct: 573 KQFDYWSERFLQLIKDQNELLSTHERFFVGRWLDMARLKSKQPELQDLYEHNARMLIGTW 632
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E S +RDY +K W GLL+DYY PR Y Y+ +LE G + D +
Sbjct: 633 ----TETLSPVRDYAHKEWGGLLKDYYLPRWTNYIAYLKGTLE-GRSLTVPD----SFQA 683
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
W N N Y +E+ D + T++ +Y+KY
Sbjct: 684 EKAWVNAHNKYVLEAGVDPVQTAKRMYSKY 713
>gi|156121099|ref|NP_001095696.1| alpha-N-acetylglucosaminidase precursor [Bos taurus]
gi|151554244|gb|AAI48148.1| NAGLU protein [Bos taurus]
gi|296476361|tpg|DAA18476.1| TPA: alpha-N-acetylglucosaminidase [Bos taurus]
Length = 667
Score = 314 bits (804), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 166/383 (43%), Positives = 229/383 (59%), Gaps = 35/383 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+GNW + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ +Q + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 441 TGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 500
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYNC S E + N+ P+ + L+ T+ +W
Sbjct: 501 VYNC-----------------------SGEECRGHNH-SPLVRRPSLQMVTT------VW 530
Query: 359 YSTSEVIRALELFIASGNELSAS 381
Y+ S+V A L + + + L++S
Sbjct: 531 YNRSDVFEAWRLLLTATSTLASS 553
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 60/115 (52%), Gaps = 5/115 (4%)
Query: 452 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 511
A A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR ++
Sbjct: 551 ASSPAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFT 605
Query: 512 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ ++ESL G F+ + R +L + G YP + GD + + L+ KY
Sbjct: 606 ETLVESLVQGVPFQQHQFDRNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 660
>gi|392588150|gb|EIW77482.1| glycoside hydrolase family 89 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 761
Score = 311 bits (798), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 181/535 (33%), Positives = 287/535 (53%), Gaps = 46/535 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP W+D Q VL K+I+ R+ +LGM PVLPAF+G VP A+ N++P+A I
Sbjct: 208 NIQGSWGGDLPTQWIDDQFVLGKQIVQRMVDLGMTPVLPAFTGFVPPAMHNLYPNASIVN 267
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W P++ L+ DPLF ++ ++FI +Q +G SHIY D ++EN P
Sbjct: 268 GSAWNDFA--PQFTNDSFLEPFDPLFAQVQQSFISKQQAAFGNVSHIYTLDQYNENDPYS 325
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPL----GK 176
P Y++++ AA +S +++ D DA WLMQGWLF S FW P +++A L VP
Sbjct: 326 GDPSYLTNISAATFSSLRAADPDATWLMQGWLFFSSADFWTPERVEAYLAGVPGDDDGSG 385
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDL++E +P W ++G +IWC LH++ GN+ G ++ P+ A S N +
Sbjct: 386 MLILDLYSEAQPQWQRLSSYFGKRWIWCELHDYGGNMGFEGNFANVTEAPLAALASPNVS 445
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQ-DAWNV 294
MVGVG++ EG+E N ++YD++ + A+ ++ + ++ RRY +P +AW
Sbjct: 446 MVGVGLTPEGMEGNEIIYDVLLDQAWSSSPINKTEYAQAWATRRYPADELPECAIEAWQT 505
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L TVY+ TD + I+ +++VT H
Sbjct: 506 LAATVYSNTDPGSQATVKSILELEPALSGLVNVTG-----------------------HH 542
Query: 355 P-HLWYST-SEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
P H++Y T + ++ AL+ + +G+ L A YRYDL+DLTRQ L +L+ +++
Sbjct: 543 PTHVFYDTNTTIVPALQQLVQAGHSTPSLLAIPEYRYDLVDLTRQLLVNRFIDLYADLLA 602
Query: 410 AYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-E 466
Y + V + LELV D+D +L ++ F L W ++A+ A Y E
Sbjct: 603 VYNTTSASSASVSAAGQPMLELVADLDKVLMTNENFQLSRWTDAARSWANGNASYAAYLE 662
Query: 467 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 521
+NAR QIT+W + + DY +K W GL+ DYYG R A++ +Y+ S +G
Sbjct: 663 YNARNQITLWGPKGE-----INDYASKQWGGLVGDYYGKRWAMFIQYLEGSKSNG 712
>gi|295086519|emb|CBK68042.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 727
Score = 311 bits (798), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 180/567 (31%), Positives = 278/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 361
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + +++G PY+WC L NF GN + G + T+ G
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + A+ D WI Q + RR G ++ AW +LY ++
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 480
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y P+ + G ++ LK + P + Y
Sbjct: 481 YTV-------------------PAALG--------QGALMNARPCLKGNGNWTTTPTVAY 513
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + E+ + +G ++ Y YD++++ RQ L Y +L EAY +
Sbjct: 514 SNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLL 571
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
Q +L+ D+D LL+ FLLG W+E A+ L +E + YE NART ++ W D
Sbjct: 572 KQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGTDEVSKNYYEENARTIVSTWGDK 631
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
Q L DY N+ W GL+ YY PR ++ +I S+ + F + + + D
Sbjct: 632 DQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEID 687
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W YP E G+ + + L NKY
Sbjct: 688 WVKSHERYPSEPVGNVVEIATLLMNKY 714
>gi|160883168|ref|ZP_02064171.1| hypothetical protein BACOVA_01137 [Bacteroides ovatus ATCC 8483]
gi|156111393|gb|EDO13138.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
Length = 737
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 180/567 (31%), Positives = 278/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 196 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 256 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 312 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQNKLLL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + +++G PY+WC L NF GN + G + T+ G
Sbjct: 372 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + A+ D WI Q + RR G ++ AW +LY ++
Sbjct: 432 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y P+ + G ++ LK + P + Y
Sbjct: 491 YTA-------------------PAALG--------QGTLMNARPCLKGNGNWTTTPTVAY 523
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + E+ + +G + TY YD++++ RQ L Y +L E Y +
Sbjct: 524 SNETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVLGNYFGKLRDEFAETYSRKQLPLL 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
Q +L+ D++ LL+ FLLG W+E A+ L +E + YE NART ++ W D
Sbjct: 582 KQKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLGIDEASKNYYEENARTIVSTWGDK 641
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
Q L DY N+ W GL+ YY PR ++ +I S+ + F + + + D
Sbjct: 642 DQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEID 697
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W YP E G+A+ + L NKY
Sbjct: 698 WVKSHERYPSEPVGNAVEIATLLMNKY 724
>gi|423292430|ref|ZP_17271008.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
CL02T12C04]
gi|423294620|ref|ZP_17272747.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
CL03T12C18]
gi|392661665|gb|EIY55241.1| hypothetical protein HMPREF1069_06051 [Bacteroides ovatus
CL02T12C04]
gi|392675811|gb|EIY69252.1| hypothetical protein HMPREF1070_01412 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 180/567 (31%), Positives = 278/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQNKLLL 361
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + +++G PY+WC L NF GN + G + T+ G
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + A+ D WI Q + RR G ++ AW +LY ++
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 480
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y P+ + G ++ LK + P + Y
Sbjct: 481 YTA-------------------PAALG--------QGTLMNARPCLKGNGNWTTTPTVAY 513
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + E+ + +G + TY YD++++ RQ L Y +L E Y +
Sbjct: 514 SNETLFEVWEMLLKAGEHRHS--TYEYDVVNIGRQVLGNYFGKLRDEFAETYSRKQLPLL 571
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
Q +L+ D++ LL+ FLLG W+E A+ L +E + YE NART ++ W D
Sbjct: 572 KQKGAEMKQLLRDVNTLLSTQSSFLLGKWIEDARSLGIDEASKNYYEENARTIVSTWGDK 631
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
Q L DY N+ W GL+ YY PR ++ +I S+ + F + + + D
Sbjct: 632 DQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEID 687
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W YP E G+A+ + L NKY
Sbjct: 688 WVKSHERYPSEPVGNAVEIATLLMNKY 714
>gi|291515668|emb|CBK64878.1| Alpha-N-acetylglucosaminidase (NAGLU) [Alistipes shahii WAL 8301]
Length = 713
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 190/567 (33%), Positives = 287/567 (50%), Gaps = 41/567 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W GPLP+ W+D QL LQ++IL R ELGM PVLPAF+G+VP L+ + P A+IT
Sbjct: 179 MSNIDRWQGPLPEEWIDGQLALQQRILARERELGMKPVLPAFAGHVPQELKRLHPDARIT 238
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W D R+ C++ LD DPLF I R F+ +Q + +G T HIY D F+E P
Sbjct: 239 RVSYWGGF--DDRYRCSF-LDPMDPLFAVIQREFLTEQTRLFG-TGHIYGADPFNEIDAP 294
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE ++ + IY M D +AVWL GWLF DP W ++A L +VP +L++
Sbjct: 295 TWDPETLAGMSRHIYESMAEVDPEAVWLQMGWLFYADPTHWTAENIRAFLGAVPQDRLLM 354
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F E IW +++F+G PY+WC L NF GN + G +++ +A + G
Sbjct: 355 LDYFCEFTEIWKQTEKFHGQPYLWCYLGNFGGNTMLSGNFHTVSARMEDAFAHGGDNLRG 414
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG N +Y+ + + A+ D WI + + RR G PA + W L +V
Sbjct: 415 VGSTLEGFGVNQFMYEFVLDKAWNTGIAD-DEWIARLADRRTGFRDPAARTGWRTLCDSV 473
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A + A P + EG + KP T+ Y P LW
Sbjct: 474 Y--TLPAQTGQSPLTNAHPAL--------EGNWHWTTKP----------TTGYRFPTLW- 512
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
R E +A +E +TYR+D++++ RQ L Y AY +D +
Sbjct: 513 ------RVWEELLAVDSE---RDTYRFDVVNIGRQVLGDYFLIERDRFAAAYAQHDRKAM 563
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+RR L+ D++ L ACH F L W+ +A+ + + YE NAR I++W D+
Sbjct: 564 DAAARRMTGLLADINLLTACHPEFSLERWIAAARGFGSDNASKDYYETNARMLISVWGDS 623
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
L DY ++ WSG++ YY PR ++ + ++E+ +G F + + RE
Sbjct: 624 YH-----LTDYASRTWSGMISTYYAPRWRLFIERVMEAARTGRMFDHEAFDREIRDFECR 678
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W + + GDA+ T++ L +KY
Sbjct: 679 WADASHPLTFPEAGDAVRTARELASKY 705
>gi|380692804|ref|ZP_09857663.1| putative alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 709
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 185/570 (32%), Positives = 291/570 (51%), Gaps = 54/570 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GGPLP W +Q VLQKKIL R+ E GM PV F G VP++L+ +P A +
Sbjct: 188 MGNLENIGGPLPDEWFKEQTVLQKKILARMREYGMKPVFQGFFGMVPSSLKEKYPEAHLV 247
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S++ P +LD DPLF ++ + + + K YG+ + ++ D F E T
Sbjct: 248 EQGLWNSLQRPP------VLDPADPLFEQMAKVWYTEYEKLYGK-ADLFGGDLFHEGGKT 300
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D ++ + + M+ + DA W++Q WL + P+ K LL + +
Sbjct: 301 GGID----VTDAARRVQTAMKQYNPDATWVIQAWLGN-------PK-KELLAGLDRKHTL 348
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART--SENTT 236
++DL AE W K F G P++W + N+ NI ++G LD+IA GP++ R + +
Sbjct: 349 IVDLAAEFWDNWRKRKGFDGFPWLWSHISNYGANIGLHGRLDAIATGPIDGRKDPEASPS 408
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
M G + EGIE NPVV+DL++EM ++ E +D+ W+ +YSVRRYG ++ AW + +
Sbjct: 409 MKGTSSTPEGIEVNPVVFDLLNEMRWRSEYLDIDTWLKEYSVRRYGAEDENLKKAWIIFH 468
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
T Y G + V A P + I+ S++
Sbjct: 469 RTAYGTYSGHRRPSESVFCAPPSLKRDKITA----------------------SAWSQCR 506
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
++Y + + LF+ S + L ++TY+YD +D RQ LA E + N+++AY+ D
Sbjct: 507 IFYDPDLFAQGVGLFLQSADHLKQTSTYQYDAVDFVRQYLADLGREAYYNLVDAYRAKDT 566
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
S RFL+L++D + LL+ H F +G WL+ A+ ++ E + YE NAR I W
Sbjct: 567 KQFDYWSERFLQLIKDQNELLSTHKCFFVGRWLDMARSKSKQPELQDLYEHNARMLIGTW 626
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
E S +RDY +K W GLL+DYY PR Y Y+ +LE G + + ++
Sbjct: 627 ----TETLSPVRDYAHKEWGGLLKDYYLPRWTNYIAYLKGTLE-GRSLTVPN----SFQV 677
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
W N N Y +E+ D + T++ +Y KY
Sbjct: 678 EKAWVNAHNKYVLETGVDPVETAKRMYRKY 707
>gi|281210062|gb|EFA84230.1| hypothetical protein PPL_03307 [Polysphondylium pallidum PN500]
Length = 744
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 190/566 (33%), Positives = 297/566 (52%), Gaps = 53/566 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGG LPQSW+ Q LQ KIL R+ E GM+PV P F+G+VP A + +PSA I
Sbjct: 214 MGNLDGWGGVLPQSWIKGQHELQIKILKRMSEYGMSPVFPGFAGHVPVAFKQFYPSANIV 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHI--YNCDTFDENT 118
+L +W + T L TDP++ + F + Q + YG + I ++ D F+E
Sbjct: 274 ELPSWHGFNA------TNHLLTTDPMYDIVADRFYQVQNEIYGAYAKIDYFSIDPFNELI 327
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
PP +S ++++ + I++ + + D+ W++Q W + FW Q+ + L VP+G+L+
Sbjct: 328 PPSNSSQFLNECSSRIFNAINRFNPDSTWVLQNWFLN-SAFWGDGQVASFLGGVPIGRLI 386
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VLDL++E+KP+W+ + + G +IW MLHNF G + G + IA P+EA+ S + TMV
Sbjct: 387 VLDLWSELKPLWNRTANYQGHKWIWNMLHNFGGRPTISGRMPIIANEPLEAKAS-SPTMV 445
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ E IEQN +VYDLMSEM ++ D+ W++ Y RRYG ++P ++ W +L +T
Sbjct: 446 GIGLTPEAIEQNVIVYDLMSEMGWRSRSFDLNLWVDAYVTRRYGVNLPNLKPVWKMLAYT 505
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VY P+ P+ NY ++K+ L + Y +P +
Sbjct: 506 VYFS---------------PNRSPA----------NY---IAKKPSLDFQLGLYYNPVV- 536
Query: 359 YSTSEVIRA-LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
++ A EL + +S TYRYDL ++T QAL+ Y N + ++Y +D
Sbjct: 537 -----IVDAWRELLAVDSTIVRSSETYRYDLAEITLQALSNYFNGNLKQLYQSYYASDFQ 591
Query: 418 GVFQLSRRFLEL-VEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
FQ +R+ + MD + LG W A++ A + + + YE+NAR QIT+W
Sbjct: 592 -TFQSARQNCSFALRAMDAVADTVQLLKLGKWTADARKWATDNNERELYEYNARNQITLW 650
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
DY NK+WSGL+ DYY PR I+F+++ ++ F +
Sbjct: 651 GWKDMGNP----DYANKWWSGLIADYYFPRWQIFFEHLEHAIFDKSKFNEHSLAVNTMLH 706
Query: 537 TNDWQNGRNVYP--VESNGDALITSQ 560
W N+YP V SN D S+
Sbjct: 707 EERWNKQTNIYPSDVNSNDDVHTVSK 732
>gi|237719130|ref|ZP_04549611.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451509|gb|EEO57300.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 737
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 181/567 (31%), Positives = 280/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 196 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 256 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 312 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + +++G PY+WC L NF GN + G + T+ G
Sbjct: 372 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + A+ D WI Q + RR G ++ AW +LY ++
Sbjct: 432 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y ++ +G N +P K + TS+ + Y
Sbjct: 491 YTAP---------------------AALGQGTLMN-ARPCLKGNGNWTTTST-----VAY 523
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + E+ + +G ++ Y YD++++ RQ L Y +L EAY +
Sbjct: 524 SNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLL 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
Q +L+ D+D LL+ FLLG W+E A+ L +E + YE NART ++ W D
Sbjct: 582 KQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGIDEASKNYYEENARTIVSTWGDK 641
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
Q L DY N+ W GL+ YY PR ++ +I S+ + F + + + D
Sbjct: 642 DQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEID 697
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W YP E +A+ + L NKY
Sbjct: 698 WVKSHERYPSEPVSNAVEIATLLMNKY 724
>gi|383114162|ref|ZP_09934927.1| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
gi|382948607|gb|EFS30964.2| hypothetical protein BSGG_1664 [Bacteroides sp. D2]
Length = 727
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 181/567 (31%), Positives = 280/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W GPLP+ WLD Q LQK+I+ R + M P+LPAF+G+VP+ L+ ++P AKI+
Sbjct: 186 MSNLDYWQGPLPKEWLDTQEALQKQIVARERQFNMRPILPAFAGHVPSELKRIYPEAKIS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ +W + R ++ LD DPLF I + F+E+Q K +G T HIY D F+E PP
Sbjct: 246 RMSSWGGFEDKYR---SHFLDPLDPLFATIQKEFLEEQTKLFG-TDHIYGADPFNEVAPP 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PE++++ IY M D DA WL WLF D W +++A L +VP KL++
Sbjct: 302 SWEPEFLANCSKHIYQSMTHVDPDATWLQMTWLFYIDRHLWTNERVEAFLKAVPQDKLLL 361
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + +++G PY+WC L NF GN + G + T+ G
Sbjct: 362 LDYYCENTEVWKQTDRYFGQPYLWCYLGNFGGNTMLAGNTKEVGKRIENVYTNGGENFSG 421
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + A+ D WI Q + RR G ++ AW +LY ++
Sbjct: 422 LGSTLEGFDVNPFMYEYVFSKAWDCNLPD-SVWIEQLADRRIGLRNQQMRRAWKLLYDSI 480
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y ++ +G N +P K + TS+ + Y
Sbjct: 481 YTAP---------------------AALGQGTLMN-ARPCLKGNGNWTTTST-----VAY 513
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + E+ + +G ++ Y YD++++ RQ L Y +L EAY +
Sbjct: 514 SNETLFEVWEMLLKAGEHRHSA--YEYDVVNIGRQVLGNYFGKLRDEFAEAYSRKQLPLL 571
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
Q +L+ D+D LL+ FLLG W+E A+ L + + YE NART ++ W D
Sbjct: 572 KQKGAEMKQLLRDVDTLLSTQSSFLLGKWIEDARSLGTDGASKNYYEENARTIVSTWGDK 631
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
Q L DY N+ W GL+ YY PR ++ +I S+ + F + + + D
Sbjct: 632 DQS----LNDYANRTWGGLVSGYYAPRWEMFIDEVIRSVSNKQPFNADAFHQRVTQFEID 687
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W YP E G+A+ + L NKY
Sbjct: 688 WVKSHERYPSEPVGNAVEIATLLMNKY 714
>gi|357622373|gb|EHJ73879.1| putative alpha-N-acetyl glucosaminidase [Danaus plexippus]
Length = 780
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 180/534 (33%), Positives = 282/534 (52%), Gaps = 46/534 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+HGWGGPLPQSW D+Q +Q+ + +++LGM PV PAF+G+VP A + +FP+
Sbjct: 206 MGNVHGWGGPLPQSWHDRQKQIQEVVTDLMFKLGMIPVFPAFNGHVPKAFEKIFPNTTFH 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W K D +CC +D +P F I + F+ + G +SHIY D F+E
Sbjct: 266 PVETW--NKFDEDYCCNLFVDPREPDFKMISKMFMREITAGLG-SSHIYTADPFNEIKIQ 322
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
S + AI+S + D DAVWL+Q W+F ++P W ++ + L SVP G+++V
Sbjct: 323 PWSTSLVVETAKAIFSSISEYDKDAVWLVQNWMFVHNPLLWPLKRVNSFLTSVPNGRMLV 382
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P + + +YG P+IW MLHNF G + M+G +I E R EN+TMVG
Sbjct: 383 LDLQSEQWPQYDLYQMYYGQPFIWSMLHNFGGTLGMFGNTKTINKDVYEVRKRENSTMVG 442
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
+G++ EGI QN V+YDLM E A++ V D++ W++ Y+ RRYG + +I W L +
Sbjct: 443 IGLTPEGINQNYVIYDLMLESAWRKGPVPDLEEWVSDYAERRYGCNATSI--GWKYLLRS 500
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYN T ++ GKY V+ S P W
Sbjct: 501 VYNFTG--------------------LNRIRGKY-----------VMTRRPSFNIRPWAW 529
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y ++ AL+ F+ N +++ + +DL+D+TRQAL ++++N+ N +
Sbjct: 530 YKGHDLFEALKNFVYVQNPACSTSGFLHDLVDVTRQALQYKIEQIYMNLQNDRYSN--YM 587
Query: 419 VFQLS-RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
VF + F++ + DM +LA F + WL SA+ ++ + Y++NAR QIT+W
Sbjct: 588 VFNYTISSFIDAMTDMQNILATSSDFKITSWLSSARAISNLPLESSLYDFNARNQITLWG 647
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 531
N + + DY K W+ L + YY PR +I+ +++ + F K +R
Sbjct: 648 PNGE-----ISDYACKQWAELFKYYYIPRWSIFLSMALDAKTRNEPFDEKGAQR 696
>gi|119480815|ref|XP_001260436.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
181]
gi|119408590|gb|EAW18539.1| alpha-N-acetylglucosaminidase, putative [Neosartorya fischeri NRRL
181]
Length = 748
Score = 305 bits (780), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 175/525 (33%), Positives = 287/525 (54%), Gaps = 46/525 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP SW+D Q LQKKI+ R+ ELGM PVLPAF+G VP A+ V P+A +
Sbjct: 197 NIQGSWGGDLPYSWIDSQFELQKKIVRRMVELGMTPVLPAFTGFVPRAISRVLPNATVVN 256
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W D R+ L+ DP F + R+FI++Q + YG +H+Y D ++EN P
Sbjct: 257 GSRWGGF--DERYTNDTFLEPFDPSFTRLQRSFIQKQQQAYGNITHVYTLDQYNENDPYS 314
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
+Y+ ++ + ++S D +AVWLMQGWLF S FW ++KA L+ V + + ++V
Sbjct: 315 GDLDYLRNVTRNTWLSLKSADPNAVWLMQGWLFYSNSDFWTDERVKAYLSGVEVDQDMLV 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E +P W ++ +YG P+IWC LH++ GN+ +YG + ++ +A + + ++VG
Sbjct: 375 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNVTVNATQALAASD-SLVG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-----GRSVP-AIQDAWN 293
G++MEG E N ++YDL+ + A+ + +D + + + RY G +VP + AW+
Sbjct: 434 FGLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHNWVKTRYSSGVRGSAVPEELHQAWD 493
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
+L TVYN T+ + I ++ PSI G G D
Sbjct: 494 ILRTTVYNNTNLTSTAVSKSIF---ELQPSI----SGLLNRTG----------------D 530
Query: 354 HP-HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
HP + Y + +++A +L ++ ++ L + + YD++D+TRQ +A +++N++
Sbjct: 531 HPTTVNYDPAALVQAWQLMDSAASKDRSLWSQPAFLYDMVDITRQVMANAFIPMYINLVS 590
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 469
YQ V ++L+ D+D +L+ +D F L W+ SA+ A+N+ + YE+NA
Sbjct: 591 TYQA--GASVSTDGSNLIQLLRDVDSVLSTNDNFRLSTWIRSARSWARNDTEADFYEYNA 648
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 514
R QI +W + DY +K W GL+ YY PR + Y+
Sbjct: 649 RNQIALW-----GPMGEINDYASKQWGGLVSAYYIPRWQTFLHYL 688
>gi|449541595|gb|EMD32578.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
B]
Length = 752
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 176/550 (32%), Positives = 297/550 (54%), Gaps = 39/550 (7%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP W++ Q LQK+IL R+ ELGM PVLPAF+G VP A+ V +A I
Sbjct: 198 NIQGSWGGELPMQWVNDQFALQKQILARMTELGMTPVLPAFTGFVPRAMSTVHSNASIVN 257
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTPP 120
W + P L+ DPLF + ++FI +Q + YG SHIY D ++EN P
Sbjct: 258 GSQW-APGFPPSLTNVSFLEPFDPLFATLQKSFIAKQQEAYGANISHIYTLDQYNENNPF 316
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KLV 178
+ Y+SS+ ++ +++ D DAVW++QGWLF S + FW +++A L VP ++
Sbjct: 317 SGNLSYLSSISEGTFTSLRAADPDAVWMLQGWLFFSSEAFWTNERIEAYLGGVPTNDSMI 376
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VLDL++E +P W+ + ++G ++WC LH++ G I + G LD+I GP+ A S ++M
Sbjct: 377 VLDLYSEAQPQWNRTSSYFGKQWVWCELHDYGGTIGLEGNLDAITTGPIAALNSPGSSMK 436
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVP-AIQDAWNVLY 296
G+G++MEG E N +VYDL+ + A+ +++ +++ + RRY +P A Q+AW +L
Sbjct: 437 GMGLTMEGQEGNEIVYDLLLDQAWSSSPINIASYVKGWVSRRYLVEPLPSAAQEAWRILS 496
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
TVYN D ++ I +++P + + V++ +L + YD
Sbjct: 497 TTVYNNQD---PNSQSTIKNIYELEPVLTGL-----------VNRTGILPT-VIPYD--- 538
Query: 357 LWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
+ S ++ AL+L + A LS + +D++D++RQ L+ + + +I+ Y
Sbjct: 539 ---TNSTIVPALQLLVKAKAQNAALSTVPEFVHDVVDVSRQLLSNRFIDAYTALIDTYNN 595
Query: 414 ND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNAR 470
+ + V + + + ++ +D LLA ++ FLL W+ A+ L+ +E Y E+NAR
Sbjct: 596 TNVTSDAVIRAGQPLMTILSQLDALLATNENFLLSSWIAQARNLSHGDESYAAYLEYNAR 655
Query: 471 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR 530
QIT+W + + + DY +K W+GL+ YY R + Y+ + F +
Sbjct: 656 NQITLWGPDGE-----INDYASKAWAGLISTYYAARWQTFIDYLASTKRLARPFDTSAFS 710
Query: 531 REWIKLTNDW 540
+ I L +W
Sbjct: 711 NQMILLGQEW 720
>gi|71001188|ref|XP_755275.1| alpha-N-acetylglucosaminidase [Aspergillus fumigatus Af293]
gi|66852913|gb|EAL93237.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
Af293]
gi|159129357|gb|EDP54471.1| alpha-N-acetylglucosaminidase, putative [Aspergillus fumigatus
A1163]
Length = 756
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 171/521 (32%), Positives = 285/521 (54%), Gaps = 38/521 (7%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP SW+D Q LQKKI+ R+ ELGM PVLPAF+G VP A+ V P+A +
Sbjct: 205 NIQGSWGGELPYSWIDSQFELQKKIVRRMVELGMTPVLPAFTGFVPRAVSRVLPNATVVN 264
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + D R+ L+ DP F+ + R+FI++Q + YG +HIY D ++EN P
Sbjct: 265 GSRW--EEFDERYTSDTFLEPFDPSFMRLQRSFIKKQQQAYGNITHIYTLDQYNENAPYS 322
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
+Y+ ++ + ++S D +AVWLMQGWLF S FW ++KA L+ V + + ++V
Sbjct: 323 GDLDYLHNVTHNTWLSLKSADPNAVWLMQGWLFYSSSGFWTDERVKAYLSGVEVDQDMLV 382
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E +P W ++ +YG P+IWC LH++ GN+ +YG + ++ +A + + ++VG
Sbjct: 383 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNVTVNATQALAASD-SLVG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-----GRSVP-AIQDAWN 293
G++MEG E N ++YDL+ + A+ + +D + + ++ RY G +VP + AW+
Sbjct: 442 FGLTMEGQEGNEIMYDLLLDQAWSRQPIDTDHYFHNWAKTRYSSGVRGSAVPEELYQAWD 501
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
+L T YN T+ + I ++ PSI + + T SYD
Sbjct: 502 ILRITAYNNTNLTSTAVSKSIF---ELQPSISGLLNRTSHH------------PTTVSYD 546
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
L + R ++ + L + + YD++D+TRQ ++ ++ N++ YQ
Sbjct: 547 PAAL----VQAWRLMDSAASKAPSLWSQPAFLYDMVDITRQVMSNAFIPVYTNLVSTYQA 602
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
+ V ++L+ D+D +L+ +D F L W++SA+ +N+ + YE+NAR Q+
Sbjct: 603 GGS--VSTDGSNLIQLLRDLDSVLSTNDNFRLSTWIQSARSWVRNDTEADFYEYNARNQV 660
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 514
T+W + + DY +K W GL+ YY PR + Y+
Sbjct: 661 TLWGPKGE-----INDYASKQWGGLVSSYYIPRWQKFLNYL 696
>gi|449541596|gb|EMD32579.1| glycoside hydrolase family 89 protein [Ceriporiopsis subvermispora
B]
Length = 754
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 169/552 (30%), Positives = 298/552 (53%), Gaps = 44/552 (7%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP W++ Q LQK+IL R+ ELGM P LPAF+G VP A+ ++P+A I
Sbjct: 201 NIQGSWGGALPMQWVNDQFALQKQILTRMTELGMTPALPAFTGFVPRAMSTLYPNASIVN 260
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTPP 120
W + L+ DPLF + ++FI +Q + YG +HIY D ++EN P
Sbjct: 261 GSAWSGFPAS--LTNVSFLEPFDPLFSTLQKSFITKQQQAYGTNVTHIYTLDQYNENNPF 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KLV 178
+ Y+SS+ A ++ +++ D DA+W++QGWLF S + FW +++A L VP ++
Sbjct: 319 SGNISYLSSVSAGTFASLRAADPDAIWMLQGWLFFSSETFWTDERIQAYLGGVPTNDSMI 378
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VLDL++E +P W+ + ++G ++WC LH + GN+ + G L++I GP+ A +S+ ++M
Sbjct: 379 VLDLYSEAQPQWNRTSSYFGKQWVWCELHGYGGNMGLEGNLNAITAGPIAALSSQGSSMK 438
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNVLY 296
G+G++MEG E N +VYD++ + A+ +D+ +++ + RRY +P A Q+AW +L
Sbjct: 439 GMGLTMEGQEGNEIVYDVLLDQAWSSAPIDIASYVKSWVARRYTVEPLPSAAQEAWQILS 498
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
TVYN D ++ I + +++P++ + + HP
Sbjct: 499 TTVYNNQD---PNSQATIKSIYELEPTLTGLVN--------------------RTGHHPT 535
Query: 357 L--WYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 411
L + + + V+ AL+L + + + L+A + YD +D++RQ L+ + + +++ Y
Sbjct: 536 LIPYDTNTTVVPALQLLVKAKEQNAALAAIPEFVYDAVDVSRQLLSNRFIDAYTGLVDTY 595
Query: 412 QLNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWN 468
+A V + + + ++ +D LLA ++ FLL W+ A+ + +E Y E+N
Sbjct: 596 NNANATSDAVVRAGQPLMVILSQLDALLATNENFLLSSWIAQARNWSHGDESYAAYLEYN 655
Query: 469 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 528
AR Q+T+W + + + DY +K W+GL+ YY R + Y+ + F
Sbjct: 656 ARNQVTLWGPDGE-----INDYASKAWAGLISTYYSSRWQTFVDYLASTKRLSRPFDSSA 710
Query: 529 WRREWIKLTNDW 540
+ + I L W
Sbjct: 711 FSSQMILLGQQW 722
>gi|281200618|gb|EFA74836.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
Length = 469
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 183/520 (35%), Positives = 278/520 (53%), Gaps = 57/520 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ W G L W+ Q LQ +IL R+ + GM VLP F+G+VP AL++ +P+A IT
Sbjct: 1 MGNVNEWAGNLTLGWMVDQRDLQIQILTRMRQFGMQAVLPGFAGHVPEALKSHYPNANIT 60
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QL +W T + + F+ I Q YG T H YN D F+E PP
Sbjct: 61 QLSSW---------NMTVYIHQSPNTFMSI-------QQDLYG-TDHFYNFDPFNELEPP 103
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+ + ++++ + + D +W++QGWLF YD FW+PPQ++A L+ VP+GK++V
Sbjct: 104 SSDPAYLKNCSQSMFNNLIAVDPQGIWVLQGWLFVYDTEFWQPPQIEAFLSGVPIGKMIV 163
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+A+V W + FYG +IWCMLHNF G MYG + I+ P+EAR S + MVG
Sbjct: 164 LDLWADVDAGWKITNYFYGHNWIWCMLHNFGGRSGMYGKIPFISTNPIEAR-SLSPNMVG 222
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ E IEQN +VYDLMSEMA++ D+K W++QY RRYG+ + + D W L TV
Sbjct: 223 TGLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQYVTRRYGKYIEVLADTWYELVGTV 282
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
+NC+ VT+G VS L TS Y P +
Sbjct: 283 FNCS----------------------IVTKGPVTIL---VSVRPQLNFTTSLYYDPIV-- 315
Query: 360 STSEVIRALELFIASGN-ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+ +A F++ + + ++T+ +DL ++T QAL+ L + A+ LND++
Sbjct: 316 ----ISKAWSAFLSIDDLHVVNTSTFSFDLTEITTQALSNLFMTTELQMNAAF-LNDSYE 370
Query: 419 VFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
F LS L +++D++ +++ + L+G W A+ L E + YE NAR QIT+W
Sbjct: 371 EFSLLSDALLSIIQDINTIVSTQEMLLVGNWTARARALTPANETTELYEMNARNQITLW- 429
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 517
S DY K W GL D+Y R ++ + + ++
Sbjct: 430 ---GPPDSFDHDYAYKLWGGLTEDFYLARWTLFSQSIFKT 466
>gi|313203962|ref|YP_004042619.1| alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
gi|312443278|gb|ADQ79634.1| Alpha-N-acetylglucosaminidase [Paludibacter propionicigenes WB4]
Length = 738
Score = 302 bits (773), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 196/573 (34%), Positives = 290/573 (50%), Gaps = 40/573 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ WGGPLP S+++ Q LQ+ IL R LGM P+L AF+G+VP L+ + PSAKIT
Sbjct: 196 MANMDKWGGPLPISYIEGQKKLQQHILQRSRALGMKPILSAFAGHVPEQLKTLRPSAKIT 255
Query: 61 QLG-NWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
++ W + ++ TY LD TD LF EI + F+ Q K YG T H+Y+ D F+E TP
Sbjct: 256 RIEPGWGGMAAE---YTTYFLDPTDNLFGEIQKRFLTVQQKLYG-TDHLYSADPFNEITP 311
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLV 178
P P+Y++++G IY M D +A+W W F DP W P++ A++++VP GKL
Sbjct: 312 PSWEPDYLANVGKTIYETMSQVDKEAIWYQMSWTFYNDPTHWTRPRLSAMIHAVPQGKLF 371
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
LD E + + S FYG P+IWC L NF N + L+ + + + + V
Sbjct: 372 FLDYNCEEEEFFRKSDNFYGAPFIWCYLGNFGANTHLVAPLNKVV--NRLGKLTYGSACV 429
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQ-HEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
GVG ++EGI NP +Y+ + EM ++ E V I Y+ RR G A+ +AW +L
Sbjct: 430 GVGSTLEGINVNPEIYETVLEMPWRADETVTADTLIRHYAERRAGARDKAVIEAWQLLRQ 489
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
V D A +V V P + +T + +P +
Sbjct: 490 HV--LVDTAVGIWNHCVVF--QVSP-VTDLTRAFWAT-------------------NPKI 525
Query: 358 WYSTSEVIRAL-ELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
Y ++ AL +F AS N ++ YR+D+++LTRQAL Y L+ ++EAY +
Sbjct: 526 PYRNVDLAIALNRMFQASANS-KKTDAYRFDVVNLTRQALGNYGTVLYHKMMEAYSRKNL 584
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ S FL+L +++DGLLA FLLG WL A+ ++ YE NAR IT W
Sbjct: 585 IDFRKYSGEFLQLGQEIDGLLATRHEFLLGKWLADARSWGTTPAEKAYYERNAREIITTW 644
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+ L DY N+ W+GLLR YY PR + + SL +G + K +
Sbjct: 645 ----HKAGGGLTDYSNRQWNGLLRSYYLPRWKEFINRLDTSLSTGKDYDDKAFAAWCSAF 700
Query: 537 TNDWQNG-RNVYPVESNGDALITSQWLYNKYLQ 568
W + + Y GDA+ + L+ KY Q
Sbjct: 701 EQHWVDSPSSAYSDTETGDAVKMAFELFGKYKQ 733
>gi|404406438|ref|ZP_10998022.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 726
Score = 297 bits (761), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 186/577 (32%), Positives = 290/577 (50%), Gaps = 51/577 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLPQSW+D QL LQ++I+ R ELG+ PV +F+G+VP AL+ +FP A I
Sbjct: 190 MANIDAWHGPLPQSWIDGQLELQRRIIARERELGIQPVFTSFTGHVPKALKTLFPDADIE 249
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W S + R +Y L+ +PLF I +A++++Q + +G +S +Y D F+E PP
Sbjct: 250 RLNPWTSFE---RPYNSYYLNPAEPLFNRIQQAYMQEQRRLFGESS-VYGVDPFNELDPP 305
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
PEY++ Y + D DAVWL W+F + W P ++KA L +VP GKL++
Sbjct: 306 NWDPEYLARAARLTYESITQFDKDAVWLQMAWVFYHKRRDWTPERLKAYLCAVPDGKLLM 365
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + +W +++ FYG P+IW L NF GN + G + ++ A VG
Sbjct: 366 LDYYCDKVELWRSTESFYGQPFIWSYLGNFGGNTMLAGDVKDVSRKLDRAYAEAGRNFVG 425
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ NP +Y+ + + A+ + D WI++ + R GR + AW +LY +
Sbjct: 426 IGCTLEGLDVNPFMYEYVLDRAWT-QLYDDAGWIDRLADRHSGRIDVHYRQAWRILYDKI 484
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y G NR V +P K S + PHL Y
Sbjct: 485 YCAPSG----NRSAAVC-------------------ARPNMK------GRSKWSGPHLDY 515
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+++R E + E +AS+ R+D +++ RQ L Y L I A + D V
Sbjct: 516 DNRDLLRVWEQLTLARPERTASS--RFDCVNIPRQCLENYFGNLNERCIAACRGGDRETV 573
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+LS R LEL++D+D L+A FLLG W+ A+++ ++ +E +AR +T W
Sbjct: 574 ARLSARLLELLDDIDRLVAADAYFLLGKWIADARRMGATPAEKDYFERDARNILTTW--- 630
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---RLKDWRREWIKL 536
L DY N+ WSGL+ DYY R ++ + E + L+D+ EW+
Sbjct: 631 -GGRGYSLNDYANRTWSGLVSDYYKERWRRFYDRLQSDGEPDEDALLQELQDFEWEWVG- 688
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 573
+ GR + GDA + LY KY G F
Sbjct: 689 ----RKGR--FAERPRGDAFRLCRSLYTKYAAEIGRF 719
>gi|346324333|gb|EGX93930.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
Length = 751
Score = 297 bits (760), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 172/547 (31%), Positives = 293/547 (53%), Gaps = 49/547 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LPQ+W++ Q LQKKI+ R+ ELGM P+LPAF G VP + V+P+ + +
Sbjct: 203 NIQGSWGGDLPQAWIEDQFELQKKIVKRMIELGMTPILPAFPGFVPENITRVWPNVSLAE 262
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W R+ + DP F E+ +AF+ +Q + YG + + D F+EN P
Sbjct: 263 SPIWSGFSG--RFTADKYITPYDPHFAELQKAFLTKQNEAYGNVTSFWTLDQFNENKPAS 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK-LVV 179
+Y+ ++ + +++ D AVW+MQGWLF+ D +W ++K+ L+ VP+ + +++
Sbjct: 321 GELDYLKNVSHNTWQTLKAADPSAVWVMQGWLFASDKTYWIDDRVKSFLDGVPVNEDMLL 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P W ++ FYG P+IWC LH++ GN+ +YG ++++ VEA + ++VG
Sbjct: 381 LDLFAESTPQWQRTESFYGKPWIWCQLHDYGGNMGLYGQIENVTKNAVEA-VQTSKSIVG 439
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAWNVL 295
G+SMEG E N ++YDL+ + A++ E ++ + + + RYG + +P + AW+ +
Sbjct: 440 FGLSMEGQEGNEIMYDLLLDQAWRKEAIETDKYFSDWVTVRYGADHKEIPENLYTAWDKV 499
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
TVYN TD + V + ++ PSI + + HP
Sbjct: 500 RSTVYNNTDSSVTA---VTKSIFELAPSISGLV--------------------NRTGHHP 536
Query: 356 -HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 411
+ Y T +I A ++G++ L + YRYDL D TRQ LA + ++E Y
Sbjct: 537 TKITYDTKTLISAWNDMFSAGDQARWLFDNEAYRYDLTDWTRQVLANAFEATYNKLVEKY 596
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
+ N+ GV R +++ MD +L + F L W+++A++ E +E+NAR
Sbjct: 597 KSNNTKGVKCAGDRLQAILQTMDQVLDTNPSFKLSTWIQAARK--SGGEAADFFEYNARN 654
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI-----ESLESGDGFRL 526
Q+T+W N + + DY +K W+GL+ +YY R ++ Y++ E ++ +L
Sbjct: 655 QVTLWGPNGE-----IEDYASKQWAGLVGNYYAHRWQMFVDYLVATDPKEYDQNVFKKKL 709
Query: 527 KDWRREW 533
++W +W
Sbjct: 710 REWETQW 716
>gi|195454475|ref|XP_002074254.1| GK18384 [Drosophila willistoni]
gi|194170339|gb|EDW85240.1| GK18384 [Drosophila willistoni]
Length = 743
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 171/508 (33%), Positives = 273/508 (53%), Gaps = 43/508 (8%)
Query: 32 ELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIG 91
ELG++ LPAF+G+VP AL+ +FP A T+ W + +CC ++ +PLF ++
Sbjct: 262 ELGISVALPAFAGHVPRALRRIFPQANFTETERWNRFPN--AYCCDLFVEPQEPLFRQLA 319
Query: 92 RAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQG 151
F+ + + YG ++HI+ CD F+E PPV +++ S AAIY+ M+ D A+WL+QG
Sbjct: 320 TTFLRRVTQRYG-SNHIFFCDPFNELEPPVSQADFMRSTAAAIYASMREVDPKAIWLLQG 378
Query: 152 WLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 211
W+F + FW ++A L +VP G L+VLDL +E P + +K +YG P++WCMLHNF G
Sbjct: 379 WMFVKNIFWTDELIEAFLTAVPQGNLLVLDLQSEQFPQYQRTKSYYGQPFVWCMLHNFGG 438
Query: 212 NIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
+ M G ++ + G AR N++MVG G++ EGI QN V+Y E + K+D
Sbjct: 439 TLGMLGSVELVNSGMDLARQMPNSSMVGAGITPEGIGQNYVMYSFALERGWSDRKLDSAG 498
Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 331
W +++ RYG + AW +L +VY T K R GK
Sbjct: 499 WFTHFALTRYGVQDERLNQAWQLLRTSVY--TFHGLQKMR------------------GK 538
Query: 332 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT----YRYD 387
Y +P + P WY+ + V+ A +L +++ + + + Y++D
Sbjct: 539 YTITRRPAINLS-----------PFTWYNVTHVLEAWQLMLSARSIIPLDDNRYDIYQHD 587
Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
L+D+TRQ L A++L++N+ +Y+ L + LEL++D++ +L FLLG
Sbjct: 588 LVDITRQYLQITADQLYVNLNSSYRKRQLARFVYLGNKLLELLDDLERILGSGSNFLLGT 647
Query: 448 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 507
WLE+AK LA E + +E+NAR QIT W N + + DY K WSG++ DYY PR
Sbjct: 648 WLEAAKLLAPTVEDQSNFEFNARNQITTWGPNGE-----ILDYACKQWSGMISDYYRPRW 702
Query: 508 AIYFKYMIESLESGDGFRLKDWRREWIK 535
A + + +L+S F +++ +K
Sbjct: 703 ARFLDDVTLALQSNQPFNASAYKQHVLK 730
>gi|295087651|emb|CBK69174.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 703
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 178/567 (31%), Positives = 284/567 (50%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP SWL Q LQK+I+ R LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 169 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 228
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D DPLF +I + ++E+Q K YG T HIY D F+E P
Sbjct: 229 QMSQWGGYDEKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 284
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++ I+ ++ DS A W+ W+F + W P++KA LNSVP KL++
Sbjct: 285 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 344
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G +D ++ + G
Sbjct: 345 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 404
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG++ NP +Y+ + E A+ H + W+ +++ R G I DAW LY +
Sbjct: 405 VGATLEGLDVNPFMYEFVLEKAWSHTITNAD-WMKNWALCRGGSKSSHIIDAWQQLYKKI 463
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y I T G+ ++ +L+ S HP ++Y
Sbjct: 464 Y-----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYY 496
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+ F+ + N S+ Y++D+I++ RQ L ++ + Y+ + G+
Sbjct: 497 DNKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGM 554
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ + + L D+D LL+C F +G W++ A+ +N ++++ YE NAR +T W
Sbjct: 555 KEWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARCILTTW--- 611
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++A+ L DY N+ W GL YY R ++ +Y I+ + G K + +
Sbjct: 612 -GQKATQLNDYANRGWGGLTDSYYRKRWELFTQYAIDEMSHGKEIDEKSFYNLITEFEYQ 670
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W NVY S D + + LY KY
Sbjct: 671 WTLQTNVYSESSGEDPIRIANLLYIKY 697
>gi|423346424|ref|ZP_17324112.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
CL03T12C32]
gi|409220242|gb|EKN13198.1| hypothetical protein HMPREF1060_01784 [Parabacteroides merdae
CL03T12C32]
Length = 718
Score = 296 bits (757), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 187/578 (32%), Positives = 290/578 (50%), Gaps = 67/578 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQK+I+ R+ E G+ PV P +SG VP + ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQKRIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + L TDP F EI + ++ K YG+ + Y+ D F E
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKADY-YSMDPFHEGGSV 298
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V + + G AI M+ + AVW+ Q W + P ++ ++ G L+VL
Sbjct: 299 VGVD--LDAAGKAIMQAMKKNNPKAVWVAQAWQANPRP--------QMIGNLEAGDLIVL 348
Query: 181 DLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE 233
DLFAE +P W K +G +I+CML N+ GN+ ++G + + +A+ S
Sbjct: 349 DLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNVGLHGKMKHVIDEFYKAKESP 408
Query: 234 -NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
T+ GVGM+MEG E NPV+++L++E+ ++ ++ D W+ +Y+V RYG+S P +QDAW
Sbjct: 409 FGKTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDKDQWLREYTVARYGKSNPTVQDAW 468
Query: 293 NVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
+L +++YNC D T + + V A P TE YQ S
Sbjct: 469 ILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ---------------VS 504
Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
S+ +Y + VIRA + ++ +E +N + YDL+D+ RQA+A+ L ++EA
Sbjct: 505 SWSEMKDYYDPNNVIRAAAMMVSVADEFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVVEA 563
Query: 411 -YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 469
+ D S RFL L+ D LLA F +G W+ A+ L E+++ YEWNA
Sbjct: 564 AFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGSTPEEKELYEWNA 623
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
R QIT W + + LRDY ++ W+G+L+D+Y R +F Y RL D
Sbjct: 624 RVQITTWGNRLAADEGGLRDYAHREWNGILKDFYYMRWKTWFDYQT---------RLLDG 674
Query: 530 RR----EWIKLTNDWQNGRNVYPVESNGDALITSQWLY 563
R+ ++ + W NVY E GD + T + ++
Sbjct: 675 RKTAAIDFYAIEERWTKATNVYSSEPEGDCISTVKRIF 712
>gi|423287380|ref|ZP_17266231.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
CL02T12C04]
gi|392672495|gb|EIY65962.1| hypothetical protein HMPREF1069_01274 [Bacteroides ovatus
CL02T12C04]
Length = 726
Score = 296 bits (757), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 178/567 (31%), Positives = 284/567 (50%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP SWL Q LQK+I+ R LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 192 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D DPLF +I + ++E+Q K YG T HIY D F+E P
Sbjct: 252 QMSQWGGYDKKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++ I+ ++ DS A W+ W+F + W P++KA LNSVP KL++
Sbjct: 308 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G +D ++ + G
Sbjct: 368 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 427
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG++ NP +Y+ + E A+ H + W+ +++ R G I DAW LY +
Sbjct: 428 VGATLEGLDVNPFMYEFVLEKAWSHTITNAD-WMKNWALCRGGSKSSHIIDAWQQLYKKI 486
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y I T G+ ++ +L+ S HP ++Y
Sbjct: 487 Y-----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYY 519
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+ F+ + N S+ Y++D+I++ RQ L ++ + Y+ + G+
Sbjct: 520 DNKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGM 577
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ + + L D+D LL+C F +G W++ A+ +N ++++ YE NAR +T W
Sbjct: 578 KEWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARCILTTW--- 634
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++A+ L DY N+ W GL YY R ++ +Y I+ + G K + +
Sbjct: 635 -GQKATQLNDYANRGWGGLTDSYYRKRWELFTQYAIDEMSHGKEIDEKSFYNLITEFEYQ 693
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W NVY S D + + LY KY
Sbjct: 694 WTLQTNVYSESSGEDPIRIANLLYIKY 720
>gi|423213214|ref|ZP_17199743.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693674|gb|EIY86904.1| hypothetical protein HMPREF1074_01275 [Bacteroides xylanisolvens
CL03T12C04]
Length = 726
Score = 296 bits (757), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 178/567 (31%), Positives = 284/567 (50%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP SWL Q LQK+I+ R LGM PVLPAFSG+VPA L+ ++P A IT
Sbjct: 192 MSNVDYWQSPLPLSWLKNQRKLQKQIVDRERLLGMTPVLPAFSGHVPAELKRLYPDAAIT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D DPLF +I + ++E+Q K YG T HIY D F+E P
Sbjct: 252 QMSQWGGYDEKYR---SHFIDPMDPLFGKIQKRYLEKQTKLYG-TDHIYGIDPFNEVDSP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+++ ++ I+ ++ DS A W+ W+F + W P++KA LNSVP KL++
Sbjct: 308 NWDEDFLRTVSDKIFHSIEQVDSLAHWIQMTWMFYHSKDKWSQPRIKAFLNSVPDDKLIL 367
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G +D ++ + G
Sbjct: 368 LDYYCDSVEIWRETQQYYGKPYIWCYLGNFGGNSMLAGHVDDVSAKLNRLFVEGGKNISG 427
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG++ NP +Y+ + E A+ H + W+ +++ R G I DAW LY +
Sbjct: 428 VGATLEGLDVNPFMYEFVLEKAWSHTITNAD-WMKNWALCRGGSKSSHIIDAWQQLYKKI 486
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y I T G+ ++ +L+ S HP ++Y
Sbjct: 487 Y-----------------------IHHATAGQ----AVLMNARPMLEGTDSWNTHPDIYY 519
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+ F+ + N S+ Y++D+I++ RQ L ++ + Y+ + G+
Sbjct: 520 DNKELWHIWGKFLEAKN--VDSSGYKFDVINIGRQVLGNLFSDFRDSFTACYRQKNIEGM 577
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ + + L D+D LL+C F +G W++ A+ +N ++++ YE NAR +T W
Sbjct: 578 KEWAEKMNTLFTDVDRLLSCESSFSIGKWIKDARDWGKNLKEKEYYEQNARCILTTW--- 634
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++A+ L DY N+ W GL YY R ++ +Y I+ + G K + +
Sbjct: 635 -GQKATQLNDYANRGWGGLTDSYYRKRWELFTQYAIDEMSHGKEIDEKSFYNLITEFEYQ 693
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W NVY S D + + LY KY
Sbjct: 694 WTLQTNVYSESSGEDPIRIANLLYIKY 720
>gi|121698957|ref|XP_001267859.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
1]
gi|119396001|gb|EAW06433.1| alpha-N-acetylglucosaminidase, putative [Aspergillus clavatus NRRL
1]
Length = 671
Score = 295 bits (756), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 171/486 (35%), Positives = 270/486 (55%), Gaps = 39/486 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G W G LP SW+D Q LQKKI+ R+ ELGM PVLPAF+G VP A+ V P A +
Sbjct: 197 NIQGSWHGELPYSWIDAQFELQKKIVRRMVELGMTPVLPAFTGFVPRAITRVLPDATVVN 256
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W D ++ L+ DP F + R+FI +Q + YG +HIY D ++EN P
Sbjct: 257 GSRWSGF--DEKYTNDTFLEPFDPNFARLQRSFIHKQQQAYGNITHIYTLDQYNENDPYS 314
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-LVV 179
PEY+ ++ + ++S D DA+W+MQGWLF S FW ++ A L+ V + ++V
Sbjct: 315 GDPEYLRNVTHNTWQSLKSADPDAIWMMQGWLFYSNSDFWTDERVHAYLSGVETDEDMLV 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E +P W ++ +YG P+IWC LH++ GN+ +YG + +I +A ++ +VG
Sbjct: 375 LDLFSESQPQWQRTQSYYGKPWIWCQLHDYGGNMGLYGQVMNITVNATDALAVSDS-LVG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG----RSVP-AIQDAWNV 294
G++MEG E N +VYDL+ + A+ +D ++ + + RY +VP + AW++
Sbjct: 434 YGLTMEGQEGNEIVYDLLLDQAWSSRPIDTDSYFHDWVKARYSTARRHNVPHELYQAWDI 493
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L T YN T+ AT V SI + +P K L ++T H
Sbjct: 494 LRTTAYNNTNLATATA---------VSKSIFEL---------QP--KLTGLVNQTGH--H 531
Query: 355 PHLW-YSTSEVIRALELFIASGNELSA---SNTYRYDLIDLTRQALAKYANELFLNIIEA 410
P + Y S ++R+ +L +++ +E +A +RYD++D+TRQ +A ++LN+
Sbjct: 532 PTVVNYEASSLVRSWKLMVSAASESTALWSHPAFRYDMVDVTRQVMANAFIPMYLNVTST 591
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 470
YQ + Q + L+ D+D +L+ +D F L W+ESA+ A+N+ + YE+NAR
Sbjct: 592 YQ--KGGPISQQGDSLIRLLRDLDAVLSTNDNFRLATWIESARTWARNDTEADFYEYNAR 649
Query: 471 TQITMW 476
QIT+W
Sbjct: 650 NQITLW 655
>gi|404487028|ref|ZP_11022215.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
YIT 11860]
gi|404335524|gb|EJZ61993.1| hypothetical protein HMPREF9448_02671 [Barnesiella intestinihominis
YIT 11860]
Length = 726
Score = 295 bits (755), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 183/567 (32%), Positives = 276/567 (48%), Gaps = 45/567 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ GW GPLP WL+ Q LQKKIL R EL M PVLPAF+G+VPAAL+ + P A I
Sbjct: 196 MANIDGWNGPLPMQWLESQAELQKKILARERELNMTPVLPAFAGHVPAALKRIHPDANIQ 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R + L+ +PLF EI ++F+E+Q K +G T HIY D F+E PP
Sbjct: 256 YLGKWAGFGDSYR---CHFLNPEEPLFAEIQKSFLEEQEKMFG-TDHIYGVDPFNEVDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY++ + + +Y + + D DAVWL W+F +D W P++KALL VP KLV+
Sbjct: 312 SWEPEYLAQVSSDMYKSLAAADPDAVWLQMTWMFYHDRKLWTAPRVKALLTGVPSDKLVL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++++F+G PYIWC L NF GN + G + A + + G
Sbjct: 372 LDYHCENVELWKSTEKFHGQPYIWCYLGNFGGNTTLTGNVKESGDRLDNALINGGDNLKG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ N Y+ + E A+ + V+ + W+ + + R G + ++AW +L+
Sbjct: 432 IGSTLEGLDINQFPYEYIFEKAWTID-VNGQDWVERLADRHVGAVSESAREAWQILFD-- 488
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
DV V P + NY K S E Y
Sbjct: 489 ------------DVFVQVPRTLGILPGYRPKLGDNYNKRTSNE----------------Y 520
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ ++R EL + + + + D+I RQ L Y ++ Y+ + G+
Sbjct: 521 DNATLLRVWELLLEVPS--CDRDAFEIDVIMTGRQLLGNYFLDVKKEFDGFYKKRNVPGL 578
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ + E++ D++ L + H+ L W+E A+ L +E + YE NAR IT W
Sbjct: 579 KEKASEMREILSDLELLNSFHNRASLDKWIEDARSLGDTDELKNYYEKNARNLITTW--- 635
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
L DY ++ W+GLL DYY R IYF +I + E G + + +
Sbjct: 636 ----GGSLNDYASRTWAGLLNDYYARRWEIYFDAVIGAAEKGIELDKDELKSRLATFEQE 691
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W +E NG L TS+ L KY
Sbjct: 692 WVESTTPVCIERNGTLLDTSRRLLEKY 718
>gi|242809019|ref|XP_002485282.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
ATCC 10500]
gi|218715907|gb|EED15329.1| alpha-N-acetylglucosaminidase, putative [Talaromyces stipitatus
ATCC 10500]
Length = 755
Score = 295 bits (755), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 173/528 (32%), Positives = 287/528 (54%), Gaps = 44/528 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G W G LP W+D Q LQKKI+ R+ ELGM P+LPAF G VP A+ V P A +
Sbjct: 203 NIQGSWSGSLPYDWVDSQFDLQKKIVKRMTELGMTPILPAFPGFVPRAITRVLPDADVIN 262
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + + + ++ TDP F EI ++FI +Q++ YG + Y D F+EN P
Sbjct: 263 GSAWEAFPT--MYTNDTFMEPTDPHFTEIQKSFIAKQIEAYGNVTTFYTLDQFNENNPSS 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPL-GKLVV 179
Y+ ++ + +++ DS+AVW+MQGWLF S FW +++A L V + L++
Sbjct: 321 GDLSYLRNVSQGTWKTLKAADSNAVWVMQGWLFTSNSAFWTNDRIEAYLGGVAVDSDLLI 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E P W + +YG P+IWC +H++ GN+ YG + +I P+ A ++++VG
Sbjct: 381 LDLASESSPQWQRTNSYYGKPWIWCEIHDYGGNMGFYGQVMNITNNPIAA-LHNSSSLVG 439
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVP-AIQDAWNVLY 296
G+SMEG E N +VYDL+ + A+ +D +++ + + RY RS+P ++ AW++L
Sbjct: 440 FGLSMEGQEGNEIVYDLLLDQAWNAAPIDTESYFHDWVTARYAGSRSIPSSVYSAWDILR 499
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP- 355
TVYN T+ A + A P +I T G G HP
Sbjct: 500 TTVYNNTNLAAN-------AVPKAIFELIPSTTGLLNRTGH----------------HPT 536
Query: 356 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
L Y+T+++++A LF S + L + + +DL+D++RQ LA ++ N+I Y
Sbjct: 537 KLNYNTADMVQAWNLFYTSAFKEPSLWLNPAFEFDLVDMSRQVLANAFIPVYENLISTYN 596
Query: 413 LND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNA 469
++ + + + + +++ +D +LA + F L WL +A+ A ++ + + E+NA
Sbjct: 597 TSNPSSTKLQTIGAELIGILQALDTVLATNKNFKLSTWLSAARASAGSQHNIEDFLEYNA 656
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 517
R QIT+W Q + DY +K W+GL+ YY PR ++ +Y+I++
Sbjct: 657 RNQITLWGPTGQ-----ISDYASKSWAGLVSSYYIPRWKMFVEYLIDT 699
>gi|400599317|gb|EJP67021.1| alpha-N-acetylglucosaminidase, putative [Beauveria bassiana ARSEF
2860]
Length = 753
Score = 294 bits (753), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 172/547 (31%), Positives = 287/547 (52%), Gaps = 49/547 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LPQ+W+D Q LQ+KI+ R+ ELGM P+LPAF G VP + V+P+ + +
Sbjct: 203 NIQGSWGGDLPQAWIDDQFALQRKIIKRMVELGMTPILPAFPGFVPENITRVWPNVSLAE 262
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W R+ + DP F E+ +AF+ +Q + YG + + D F+EN P
Sbjct: 263 SPTWSGFSG--RFTADKYITPYDPRFAELQKAFLTKQNEAYGNVTSFWTLDQFNENKPAS 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-LVV 179
Y+ ++ + ++ D AVW+MQGWLF+ D +W ++K+ L+ VP+ + +++
Sbjct: 321 GELGYLRNVSHNTWQTLKDADPSAVWVMQGWLFASDKAYWTDDRVKSFLDGVPVNEDMLL 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P W + FYG P+IWC LH + GN+ +YG ++++ VEA ++ ++VG
Sbjct: 381 LDLFAESTPQWQRTDSFYGKPWIWCQLHGYGGNMGLYGQIENVTRNAVEA-VQKSPSIVG 439
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAWNVL 295
+G+SMEG E N ++Y+L+ + A+ E ++ + + + RYG + +P + AW+ +
Sbjct: 440 LGLSMEGQEGNEIMYNLLLDQAWSKEALETDKYFSDWVTVRYGADQKEIPKDLYTAWDKV 499
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
TVYN TD + + A ++ T G G +K
Sbjct: 500 RSTVYNNTDSS-------VTAVAKSIFELVPSTSGLVNRTGHHATK-------------- 538
Query: 356 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+ Y T +I A +G++ L + Y YDL D TRQ LA + ++E Y+
Sbjct: 539 -ITYDTETLISAWNDMFNAGSQARWLFDNEAYSYDLTDWTRQVLANAFEATYNKLVEKYK 597
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
N+ GV R ++ MD +L + F L W+++A++ + +E+NAR Q
Sbjct: 598 SNNIKGVKCAGSRLQAILRTMDQVLETNVHFRLSTWIQAARK--SGGDAADFFEYNARNQ 655
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD----GFR--L 526
+T+W N + + DY +K W+GL+ DYY R ++ Y++ + + GD F+ L
Sbjct: 656 VTLWGPNGE-----IEDYASKQWAGLIGDYYAHRWQMFVDYLVAT-DPGDYDQMAFQKTL 709
Query: 527 KDWRREW 533
+W +EW
Sbjct: 710 IEWEKEW 716
>gi|154489986|ref|ZP_02030247.1| hypothetical protein PARMER_00215 [Parabacteroides merdae ATCC
43184]
gi|423722990|ref|ZP_17697143.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
CL09T00C40]
gi|154089428|gb|EDN88472.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
43184]
gi|409241820|gb|EKN34587.1| hypothetical protein HMPREF1078_01203 [Parabacteroides merdae
CL09T00C40]
Length = 718
Score = 294 bits (753), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 189/580 (32%), Positives = 292/580 (50%), Gaps = 71/580 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQK+I+ R+ E G+ PV P +SG VP + ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQKRIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + L TDP F EI + ++ K YG+ + Y+ D F E +
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKADY-YSMDPFHEGGSV 298
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + G AI M+ + AVW+ Q W + P ++ ++ G L+
Sbjct: 299 AGVD----LDAAGKAIMQAMKKNNPKAVWVAQAWQANPRP--------QMIGNLEAGDLI 346
Query: 179 VLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
VLDLFAE +P W K +G +I+CML N+ GN+ ++G L + +A+
Sbjct: 347 VLDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNVGLHGKLKHVIDEFYKAKE 406
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
S T+ GVGM+MEG E NPV+++L++E+ + ++ D W+ +Y+V RYG+S P +QD
Sbjct: 407 SPFGKTLKGVGMTMEGSENNPVMFELLTELPWCPQRFDKDQWLREYTVARYGKSNPTVQD 466
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +++YNC D T + + V A P TE YQ
Sbjct: 467 AWILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ--------------- 502
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
SS+ +Y ++VIRA + ++ +E +N + YDL+D+ RQA+A+ L ++
Sbjct: 503 VSSWSEMKDYYDPNDVIRAAAMMVSVADEFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVV 561
Query: 409 EA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEW 467
EA + D S RFL L+ D LLA F +G W+ A+ L E+++ YEW
Sbjct: 562 EAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGGTPEEKELYEW 621
Query: 468 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 527
NAR QIT W + + LRDY ++ W+G+L+D+Y R +F Y RL
Sbjct: 622 NARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYYMRWKTWFDYQT---------RLL 672
Query: 528 DWRR----EWIKLTNDWQNGRNVYPVESNGDALITSQWLY 563
D R+ ++ + W NVY E GD + T + ++
Sbjct: 673 DGRKTAAIDFYAIEERWTKATNVYSSEPEGDCISTVKRIF 712
>gi|299149196|ref|ZP_07042257.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298512863|gb|EFI36751.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 738
Score = 294 bits (753), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 176/569 (30%), Positives = 285/569 (50%), Gaps = 40/569 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 205 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+L + D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 265 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D +AVWL WLF D W P++K+ L SVP +L++
Sbjct: 321 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F E IW + ++G PY+WC L NF GN + G ++ ++ +A + + + G
Sbjct: 381 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EGI+ N +Y+ + + A+ + D K W + + RR G+ P + AW +L + V
Sbjct: 441 VGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKLADRRIGKISPEARKAWEILANKV 499
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P+ V +G N +P LK P + Y
Sbjct: 500 Y-------------------VQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEY 532
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+++ A L ++ + ++Y +DL+++ RQ L Y N + AY+ D +
Sbjct: 533 QPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIPMM 590
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ E++ D+D L++CH F L W+ A+ + + + YE NAR+ IT+W D+
Sbjct: 591 KNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHDAASKNYYEMNARSLITIWGDS 650
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
L DY N+ W+GL YY R + +IE+ E F +++ + N+
Sbjct: 651 YH-----LTDYANRSWAGLTNQYYSVRWDHFINEVIEAAEKKKNFDEEEFFNQSRMYENE 705
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKYLQ 568
W N N GD + ++ +Y KY +
Sbjct: 706 WVNPSNRISYNEGGDGIKLARQIYKKYAK 734
>gi|319900259|ref|YP_004159987.1| alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
gi|319415290|gb|ADV42401.1| Alpha-N-acetylglucosaminidase [Bacteroides helcogenes P 36-108]
Length = 718
Score = 294 bits (753), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 189/581 (32%), Positives = 290/581 (49%), Gaps = 69/581 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P W Q LQK+IL R+ E G+ PVLP +SG +PA +
Sbjct: 187 MNNLEGWGGPNPDQWYSHQEQLQKRILKRMREYGIEPVLPGYSGMIPANAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG V +WC L +D F I R + ++ + YG+ ++ Y+ D F E
Sbjct: 239 KLG--LDVADPGKWCGYRRPAFLQPSDKNFRRIARLYYKEMTRLYGKANY-YSMDPFHEG 295
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW-LFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G +I M+ + AVW+ Q W YD ++ ++P
Sbjct: 296 GNTKGVD----LDAAGKSIRDAMKEANPQAVWVAQAWGACPYD---------NMIKNLPE 342
Query: 175 GKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G ++VLDL++E +P W KQ +G +I+CML NF GN+ +YG ++ +
Sbjct: 343 GDMIVLDLYSESRPQWGDPASAWYRKQGFGRHGWIYCMLLNFGGNVGLYGKMEHVIDEFY 402
Query: 228 EARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP 286
+AR S T+ GVG++MEG E NPV+Y+L+ E+ + ++ W+ Y RYG++ P
Sbjct: 403 KARESAFGGTLQGVGLTMEGSENNPVMYELLCELPWHGRRISKDQWLKSYLKARYGKTTP 462
Query: 287 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+AW L +T+YN + +T + + V A P ++ YQ
Sbjct: 463 QTVEAWLKLSNTIYNSPNASTQQGTHESVFCARPSLEA---------YQ----------- 502
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
SS+ +Y+ +++IRA I + E +N + YDLID+ RQA+A+ ++
Sbjct: 503 ----VSSWSEMKDYYAPADIIRAAGKMIEAAEEFRGNNNFEYDLIDVVRQAVAEKGRLVY 558
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
++ AY+ D S RFLEL+E D LL F LG W A+ + + + Q+
Sbjct: 559 PIVVSAYKAADKQLFEAASARFLELIELQDKLLGTRREFRLGTWTNYARNMGETDAQKDL 618
Query: 465 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 524
YEWNAR QIT W + T L DY +K W+GLLRD+Y R YF + +L +G+
Sbjct: 619 YEWNARVQITTWGNRTAANEGGLHDYAHKEWNGLLRDFYYMRWKAYFDELRSTL-NGNAP 677
Query: 525 RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 565
+ D + L +W N Y E GDA ++ +Y K
Sbjct: 678 KETD----FYTLEENWAGQHNPYSAEPEGDATDIAKEVYGK 714
>gi|237717696|ref|ZP_04548177.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229453015|gb|EEO58806.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 729
Score = 294 bits (752), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 176/569 (30%), Positives = 285/569 (50%), Gaps = 40/569 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 196 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+L + D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 256 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D +AVWL WLF D W P++K+ L SVP +L++
Sbjct: 312 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F E IW + ++G PY+WC L NF GN + G ++ ++ +A + + + G
Sbjct: 372 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EGI+ N +Y+ + + A+ + D K W + + RR G+ P + AW +L + V
Sbjct: 432 VGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKLADRRIGKISPEARKAWEILANKV 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P+ V +G N +P LK P + Y
Sbjct: 491 Y-------------------VQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEY 523
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+++ A L ++ + ++Y +DL+++ RQ L Y N + AY+ D +
Sbjct: 524 QPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIPMM 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ E++ D+D L++CH F L W+ A+ + + + YE NAR+ IT+W D+
Sbjct: 582 KNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHDAASKNYYEMNARSLITIWGDS 641
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
L DY N+ W+GL YY R + +IE+ E F +++ + N+
Sbjct: 642 YH-----LTDYANRSWAGLTNQYYSVRWDHFINEVIEAAEKKKNFDEEEFFNQSRMYENE 696
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKYLQ 568
W N N GD + ++ +Y KY +
Sbjct: 697 WVNPSNRISYNEGGDGIKLARQIYKKYAK 725
>gi|395331391|gb|EJF63772.1| alpha-N-acetylglucosaminidase [Dichomitus squalens LYAD-421 SS1]
Length = 750
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 182/556 (32%), Positives = 302/556 (54%), Gaps = 41/556 (7%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP +W+D Q LQK+IL R+ ELGM PVLP+F+G VP AL +++P+A I
Sbjct: 195 FGNIQGSWGGDLPVTWVDDQFQLQKQILQRMVELGMTPVLPSFTGFVPRALSSLYPNASI 254
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W + L+ DPLF I +FI +Q + YG SHIY D ++EN P
Sbjct: 255 VNGSQWEGFPT--ALTNDSFLEPFDPLFTTIQTSFISKQREAYGNVSHIYALDQYNENDP 312
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KL 177
P Y++++ A ++ +++ D DAVWLMQGWLF S FW +++A L VP +
Sbjct: 313 FSGDPAYLANVTAGTFASLRAADPDAVWLMQGWLFFSSAAFWTNERIEAYLGGVPGNDSM 372
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL++E +P W+ + +YG ++WC LH + GNI M G LD++ P+ A + ++M
Sbjct: 373 IILDLYSEAQPQWNRTSSYYGKQWVWCELHGYGGNIGMEGDLDALTQNPIAALHAPGSSM 432
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNVL 295
GVG++MEG E N +VYD++ + A+ +++ ++++Q+ RRY R +P + DAW L
Sbjct: 433 KGVGLTMEGQEGNELVYDILLDQAWSSAPLNLSSYVDQWVARRYNVRRLPKSALDAWRTL 492
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
TVY+ D T + I + ++ P++ +T + T +
Sbjct: 493 ATTVYSNKDSGT---QAAIKSIYELAPALTGMT------------------NRTGHHPTA 531
Query: 356 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+ + S V+ A + + + +E L+ + YD++D+TRQ L+ + + ++ Y
Sbjct: 532 IPYDTNSTVLVAAKALLEARSENPLLATIPEFAYDVVDVTRQLLSNRFIDHYNVLVATYN 591
Query: 413 LNDA--HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ---EKQYEW 467
N V + L L++D+D LLA ++ FLL W+ AK+ ++ + E+
Sbjct: 592 SNATAPRNVAAAAGPLLALLDDLDELLATNEHFLLSNWIADAKRWTHGADRAAYARLLEY 651
Query: 468 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 527
NAR QIT+W + + + DY +K W+GL+R YY PR + +Y+ ++ E+G +
Sbjct: 652 NARNQITLWGPDGE-----INDYASKAWAGLVRTYYKPRWEAFVEYLAQTKEAGAAYDAH 706
Query: 528 DWRREWIKLTNDWQNG 543
+ I + W NG
Sbjct: 707 VVSAKMIAIGQQWSNG 722
>gi|212537509|ref|XP_002148910.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
gi|210068652|gb|EEA22743.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
Length = 768
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 173/552 (31%), Positives = 290/552 (52%), Gaps = 44/552 (7%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WG PLP +W+D Q LQKKI+ R+ ELGM P+LPAF G VP A+ V P A +
Sbjct: 212 NIQGSWGSPLPYAWVDSQFDLQKKIVKRMVELGMTPILPAFPGFVPRAITRVLPDADVIN 271
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + + + ++ TDP F EI ++FI +Q YG + Y D F+EN P
Sbjct: 272 GSAWEAFPA--MFTSDTFMEPTDPHFTEIQKSFISKQTAAYGNVTTFYTLDQFNENNPSS 329
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPL-GKLVV 179
Y+ S+ + +++ D AVW+MQGWLF S FW +++A L V + L+V
Sbjct: 330 GDLNYLRSVSHGTWQALKAADPSAVWVMQGWLFFSNSAFWTNDRVEAYLGGVTVDSDLLV 389
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL +E +P W + ++G P+IWC +H++ GN+ YG + +I P+ A + ++VG
Sbjct: 390 LDLASESQPQWQRTNSYFGKPWIWCQIHDYGGNMGFYGQVMNITVNPIAALNNATASLVG 449
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNVLY 296
G+SMEG E N VVYDL+ + A+ + +D + + + RY +S+P + AW++L
Sbjct: 450 FGLSMEGQEGNEVVYDLLLDQAWSAKPIDTATYFHDWVTARYAGSKSIPTDVYSAWDMLR 509
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP- 355
+VYN T+ A++ A P +I T G G HP
Sbjct: 510 TSVYNNTNLASN-------AVPKAIFELIPSTTGLVNRTGH----------------HPT 546
Query: 356 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
L Y+ +++++A LF ++ + L + Y +DL+D++RQ LA ++ ++I A+
Sbjct: 547 TLNYNPADMVKAWSLFYSAAFKEPSLWLNPAYEFDLVDMSRQVLANAFIPVYHDLIAAWN 606
Query: 413 LNDAHGVF--QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 470
+ + + + +++ +D +L ++ F L W+ +A+ A + E E+NA
Sbjct: 607 TTNPSTIRIQIIGAELIGILQAIDTILDTNEHFKLSTWISAARTSAGEQSLEDFLEYNAL 666
Query: 471 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR 530
QIT+W Q + DY +K W+GL+ YY PR ++ +Y++++ + + ++
Sbjct: 667 NQITLWGPTGQ-----ISDYASKSWAGLVSSYYIPRWKMFIEYLVDTKPA--QYNQTAFK 719
Query: 531 REWIKLTNDWQN 542
E +K WQN
Sbjct: 720 AELLKWELQWQN 731
>gi|410100551|ref|ZP_11295511.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
CL02T12C30]
gi|409215586|gb|EKN08585.1| hypothetical protein HMPREF1076_04689 [Parabacteroides goldsteinii
CL02T12C30]
Length = 739
Score = 292 bits (747), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 194/592 (32%), Positives = 292/592 (49%), Gaps = 79/592 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW+D L ++IL R ELGM P+L +F+G VP L+ +P A+I
Sbjct: 194 MTNIETYGGPLPQSWIDSHQALGQQILERQRELGMTPILQSFTGFVPIKLKEKYPDARI- 252
Query: 61 QLGNWFSVKSDPRWC----CTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
K RWC T LD DPLF E+G+AF+E+Q K YG T+HIY D F E
Sbjct: 253 --------KDKNRWCNAFTATVQLDPLDPLFKEMGQAFLEEQQKLYG-TNHIYAADPFHE 303
Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
P + Y+ ++G I+ D +AV MQ W +A+ + P +
Sbjct: 304 GAAPSNEKSYLEAVGKVIWEVASGFDPEAVIAMQTWSLR----------EAITRTFPQDR 353
Query: 177 LVVLDLFAEVKPIWSTSK--QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS-E 233
L++LDL W+ K F+ PY+ +LHN+ G + M G L A E + S +
Sbjct: 354 LLLLDLGG-----WNVEKFNSFWNYPYVAGVLHNYGGRVYMGGNLALYAKNAHELKQSPK 408
Query: 234 NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWN 293
+ G+G+ E IE NPVVY+L +E+ + + D++ WI Y+ RYG+ + W
Sbjct: 409 GGNIQGIGLFPEAIEHNPVVYELSTEITWMQDAPDLQKWITDYARARYGKLPAGAEQGWK 468
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
VL TVY G V+ A P + ++ + +P
Sbjct: 469 VLLETVYGSKAGRLPSTESVMCARPALTIQKVAAN----GDLSRP--------------- 509
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
YST + A++ F+ + N+L S+TYRYDL+D+ RQ L+ + L I EAY
Sbjct: 510 -----YSTVRLWDAVDHFLQASNDLKKSDTYRYDLVDVMRQCLSDLSLPLQKQITEAYLA 564
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
D + Q +FL L++D D LL FLLG W++ A+Q EE++ YEWNART +
Sbjct: 565 EDNEKLQQAGEQFLALIDDFDRLLGTRSTFLLGKWIKEARQWGTTEEEKALYEWNARTLV 624
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF---------------KYMIESL 518
T+W N ++ L +Y N+ W+GL++ YY PR + +Y+ +SL
Sbjct: 625 TVWGPN--HPSAHLFEYSNRQWAGLMKGYYKPRWEKFISYLKAQPKGEWRYDEQYIRKSL 682
Query: 519 ESGDGFRLKDWRREWIKLTN---DWQNGRNVYPVESNGDALITSQWLYNKYL 567
D+ + +LTN DW ++VY G+ + + LY K+L
Sbjct: 683 AGRPALDASDF---YTRLTNWEYDWAFNKDVYTDTPQGNEIEIVKELYAKWL 731
>gi|224025137|ref|ZP_03643503.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
18228]
gi|224018373|gb|EEF76371.1| hypothetical protein BACCOPRO_01871 [Bacteroides coprophilus DSM
18228]
Length = 718
Score = 290 bits (743), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 189/578 (32%), Positives = 296/578 (51%), Gaps = 63/578 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW ++Q LQK+IL R+ E G+ PVLP +SG VP ++ +
Sbjct: 187 MNNLEGWGGPNPDSWYERQEELQKRILKRMREYGIEPVLPGYSGMVPHNAKDRL-GLNVA 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W PR L TDP F I + + + YG+ S+ Y+ D F E NT
Sbjct: 246 DPGRW---NGYPR---PAFLQPTDPQFERIAALYYREMTRLYGKVSY-YSMDPFHEGGNT 298
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + G AI+ M+ + A W++Q W +P PQM + ++P G +V
Sbjct: 299 SGVD----LEAAGKAIWKAMKQANPRAAWVVQAW--GANP---RPQM---IRNLPAGDMV 346
Query: 179 VLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
VLDLF+E +P W K+ +G +++CML N+ GN+ ++G + + +A+
Sbjct: 347 VLDLFSESRPQWGDPASSWYRKEGFGQHDWLFCMLLNYGGNVGLHGKMAHLIEEFYKAKD 406
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
S T+ GVGM+MEGIE NPV+Y+L+ E+ ++ ++ W+ Y RYG+S +
Sbjct: 407 SSFGKTLKGVGMTMEGIENNPVMYELLCELPWREQRFSKDEWLEGYLKARYGKSDSQVSQ 466
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +T+YNC +T + + ++ A P +S SE
Sbjct: 467 AWMLLSNTIYNCPAASTQQGTHESILCARPSWKAYQVSSW------------------SE 508
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
S Y Y ++VIRA + + + +N + YDL+D+ RQA+A+ ++ ++
Sbjct: 509 MSDY------YDPADVIRAAGMMVDAAERFRGNNNFEYDLVDIVRQAVAEKGRLMYRVLV 562
Query: 409 EAYQLNDAHGVFQLSR-RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEW 467
+AY+ D +F+LS RFL L+ D LLA F +G WLESA+ L EE++ YEW
Sbjct: 563 DAYKAGDRE-LFKLSSDRFLRLILMQDRLLATRSEFKVGRWLESARNLGSTEEEKDWYEW 621
Query: 468 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 527
NAR QIT W + + L DY ++ W+GLLRD+Y R + ++S E G +
Sbjct: 622 NARVQITTWGNRVAADDGGLHDYAHREWNGLLRDFYYLRWKTWLDEQLKSFEGGQPKAI- 680
Query: 528 DWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 565
++ L W N Y E+ G+ + + +Y +
Sbjct: 681 ----DFYALEEPWTLKHNSYASEAEGNPVDIACEIYRE 714
>gi|423259033|ref|ZP_17239956.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
CL07T00C01]
gi|423263996|ref|ZP_17242999.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
CL07T12C05]
gi|387776613|gb|EIK38713.1| hypothetical protein HMPREF1055_02233 [Bacteroides fragilis
CL07T00C01]
gi|392706262|gb|EIY99385.1| hypothetical protein HMPREF1056_00686 [Bacteroides fragilis
CL07T12C05]
Length = 718
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 185/577 (32%), Positives = 290/577 (50%), Gaps = 70/577 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKPP 679
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
++ ++ + W N Y E+ GD + ++
Sbjct: 680 AKI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|218258436|ref|ZP_03474815.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
DSM 18315]
gi|423342591|ref|ZP_17320305.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
CL02T12C29]
gi|218225494|gb|EEC98144.1| hypothetical protein PRABACTJOHN_00470 [Parabacteroides johnsonii
DSM 18315]
gi|409217508|gb|EKN10484.1| hypothetical protein HMPREF1077_01735 [Parabacteroides johnsonii
CL02T12C29]
Length = 718
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 183/576 (31%), Positives = 287/576 (49%), Gaps = 63/576 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQ++I+ R+ E G+ PV P +SG VP + ++
Sbjct: 187 MNNLEGWGGPNPDSWYKQQIALQQQIVKRMREYGIEPVFPGYSGMVPHNAKEKL-GLNVS 245
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + L TDP F EI + ++ K YG+ ++ Y+ D F E +
Sbjct: 246 DPGLWNGYRRPA------FLQPTDPRFEEIASLYYKEMNKLYGKANY-YSMDPFHEGGSV 298
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + G AI M+ + AVW+ Q W + P ++ ++ G L+
Sbjct: 299 AGVD----LDAAGKAIMQAMKKNNPKAVWVAQAWQANPRP--------QMIGNLEAGDLI 346
Query: 179 VLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
LDLFAE +P W F +I+CML N+ GNI ++G + + +A+
Sbjct: 347 ALDLFAESRPQWGDPASTWYRKDGFGQHDWIYCMLLNYGGNIGLHGKMKHVIDEFYKAKE 406
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
S TT+ GVGM+MEG E NPV+++L++E+ ++ ++ D W+ Y+V RYG+S P +QD
Sbjct: 407 SPFGTTLKGVGMTMEGSENNPVMFELLTELPWRPQRFDKDQWLKAYTVARYGKSNPVVQD 466
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +++YNC D T + + V A P TE YQ
Sbjct: 467 AWILLSNSIYNCPDANTQQGTHESVFCARP---------TEHPYQ--------------- 502
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
SS+ +Y ++VIRA + ++ ++ +N + YDL+D+ RQA+A+ L ++
Sbjct: 503 VSSWSEMKDYYDPNDVIRAAAMMVSVSDQFKGNNNFEYDLVDIVRQAIAE-KGRLTEKVV 561
Query: 409 EA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEW 467
EA + D S RFL L+ D LLA F +G W+ A+ L E++ YEW
Sbjct: 562 EAAFAAGDKKLYKDASDRFLRLILLQDELLATRPEFKVGTWIARARSLGNTSEEKDLYEW 621
Query: 468 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 527
NAR QIT W + + LRDY ++ W+G+L+D+Y R +F Y L+ +
Sbjct: 622 NARVQITTWGNRLAADEGGLRDYAHREWNGILKDFYYMRWKTWFDYQTRLLDGKKTAAI- 680
Query: 528 DWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 563
++ + W N Y E GD + T Q ++
Sbjct: 681 ----DFYAIEEPWTKQTNPYSNEPEGDCIPTVQRIF 712
>gi|393782608|ref|ZP_10370791.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
CL02T12C01]
gi|392672835|gb|EIY66301.1| hypothetical protein HMPREF1071_01659 [Bacteroides salyersiae
CL02T12C01]
Length = 761
Score = 289 bits (739), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 176/525 (33%), Positives = 261/525 (49%), Gaps = 51/525 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+S +D+ L KKI+ R ELGM P+ FSG VP L++ +P+A I
Sbjct: 199 MQNLQSYGGPLPKSVIDRHAALGKKIIARQLELGMQPIQQGFSGYVPRELKDKYPTANIN 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q +W K + LD TD LF +GR F+E+Q + +G +Y D F E+ PP
Sbjct: 259 QQRSWCGFKGAAQ------LDPTDSLFTRMGRVFLEEQARLFG-AHGVYAADPFHESVPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD+PEY+ ++G I+ + D + W MQ W +A++ +VP L++L
Sbjct: 312 VDTPEYLKAVGETIHRLFREFDPQSTWAMQSWSLR----------EAIVKAVPKEALLIL 361
Query: 181 DLFAEVKPIWSTSK-QFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
DL STSK +F+G P + LHNF G I M+G L +A N + G
Sbjct: 362 DLRGS-----STSKAEFWGYPTVVGNLHNFGGRINMHGDLALLASNQYSKAKRLNPAVCG 416
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G+ ME IEQNPV Y+L EM + +D++AW+ QY+ RRYG PA Q AW +L
Sbjct: 417 SGLFMEAIEQNPVYYELAFEMPCHPDSIDLRAWLKQYATRRYGAFSPATQKAWMLLLEGP 476
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T+K+ ++ A P +D V K YD P L
Sbjct: 477 YRQGTNGTEKS-SIVAARPALD-----------------VKKSGPNAGLEIPYD-PAL-- 515
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+IRA L + ++LSAS YR+DL+D+ RQ + + EA++ D
Sbjct: 516 ----IIRAQSLLLEDADKLSASRPYRFDLVDVQRQMMTNLGQLIHRKAAEAFRSKDREAF 571
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
S RFL ++ DMD LL + WL A+ + EE++ Q E +A + +T+W +
Sbjct: 572 TLHSGRFLGMLADMDTLLRTRSEYSFDRWLTEARSWGETEEEKNQMERDATSLVTIWGAD 631
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 524
+ DY + W+GL+ YY PR ++ + + L+ G +
Sbjct: 632 GDPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQQHLDEGTSY 673
>gi|53711968|ref|YP_097960.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis YCH46]
gi|52214833|dbj|BAD47426.1| alpha-N-acetylglucosaminidase precursor [Bacteroides fragilis
YCH46]
Length = 718
Score = 289 bits (739), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 185/577 (32%), Positives = 289/577 (50%), Gaps = 70/577 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|329963073|ref|ZP_08300853.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
gi|328529114|gb|EGF56044.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
Length = 717
Score = 289 bits (739), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 188/583 (32%), Positives = 296/583 (50%), Gaps = 73/583 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW Q+ LQK+IL R+ E G+ PVLP +SG VP +
Sbjct: 187 MNNLEGWGGPNPDSWYTQREALQKQILKRMREYGIQPVLPGYSGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L TDP F EI + ++ + YG+ + Y+ D F E
Sbjct: 239 RLG--LNV-SDPGLWCGYPRPAFLQPTDPRFGEIADLYYKEMTRLYGK-ADFYSMDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+ VD +++ G AI+ M+ + AVW+ Q W + P+ K ++ ++P
Sbjct: 295 GGSIAGVD----LNAAGQAIWGAMKKVNPKAVWVAQAWQAN-------PRQK-MIENIPQ 342
Query: 175 GKLVVLDLFAEVKPIWST------SKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDLF+E +P W K+ +G +++CML N+ GN+ ++G + +
Sbjct: 343 GDLIVLDLFSESRPQWGDPASTWYRKEGFGKHDWLYCMLLNYGGNVGLHGKMRHVIDEFY 402
Query: 228 EARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP 286
+A+TS T+ GVGM+MEG E N V+++L+ E+ ++ + + W+ Y+ RYG++
Sbjct: 403 KAKTSPFGKTLKGVGMTMEGSENNSVMFELLCELPWRPAQFEKDEWLKNYTAARYGKADA 462
Query: 287 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+Q AW +L +++YNC D T + + V A P +D YQ
Sbjct: 463 TVQQAWLLLSNSIYNCPDANTQQGTHESVFCARPGMD---------VYQ----------- 502
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
SS+ +Y EVIRA + +++ + +N + YDL+D+ RQA+A+ ++
Sbjct: 503 ----VSSWSEMVKYYEPEEVIRAAGILLSAADRFKGNNNFEYDLVDIVRQAVAEKGRLVY 558
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+I+A + + S+RFL L+ D LLA F +G W+E A+ L +E++K
Sbjct: 559 PIMIDALKAGEKELFAAASQRFLNLILLQDRLLATRPEFKVGTWIEKARNLGTTQEEKKL 618
Query: 465 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES--GD 522
YEWNAR QI W + T + LRDY +K W+G+LRD+Y R ++ L
Sbjct: 619 YEWNARVQIATWGNRTAADEGGLRDYAHKEWNGMLRDFYYHRWKLWIDAQTAQLNGAPAQ 678
Query: 523 GFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 565
GF W TND YP GD + ++ Y +
Sbjct: 679 GFDFYAIEEPWTLQTND-------YPSHPEGDVIEVARTAYKE 714
>gi|424666301|ref|ZP_18103337.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
616]
gi|404573840|gb|EKA78592.1| hypothetical protein HMPREF1205_02176 [Bacteroides fragilis HMW
616]
Length = 718
Score = 288 bits (738), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 187/577 (32%), Positives = 287/577 (49%), Gaps = 70/577 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ + Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +++CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A +T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+ + W+ +Y RYG
Sbjct: 404 LAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHEAVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y +VI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S++FL L+ D LL F +G W+E A+ L E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEKA 619
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF + + LE
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDSLSQKLEGKTP 679
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
++ ++ + W N Y E+ GD + T++
Sbjct: 680 EKI-----DFYAVEEPWAKATNPYSAEAEGDCIETAK 711
>gi|423248659|ref|ZP_17229675.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
CL03T00C08]
gi|423253608|ref|ZP_17234539.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
CL03T12C07]
gi|392655237|gb|EIY48880.1| hypothetical protein HMPREF1067_01183 [Bacteroides fragilis
CL03T12C07]
gi|392657600|gb|EIY51231.1| hypothetical protein HMPREF1066_00685 [Bacteroides fragilis
CL03T00C08]
Length = 718
Score = 288 bits (738), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 185/577 (32%), Positives = 289/577 (50%), Gaps = 70/577 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMHEYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|326437768|gb|EGD83338.1| lysosomal alpha-N-acetyl glucosaminidase [Salpingoeca sp. ATCC
50818]
Length = 820
Score = 288 bits (738), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 182/581 (31%), Positives = 276/581 (47%), Gaps = 60/581 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL W PL + W Q LQ KIL R ELGM LP F+G+VP A++ +FP A +T
Sbjct: 215 MGNLKYWAAPLDKDWRTSQYNLQLKILSRARELGMVSALPGFAGHVPTAIKRIFPHANLT 274
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q W + S + LL TDPLF+++G F + +K +G T H++ DT++E P
Sbjct: 275 QTAGWANFNS--TYSDVSLLQPTDPLFLQLGTKFYKMLIKAFG-TDHVFQMDTYNEMQPS 331
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ ++ +Y M + D +AV+LMQGWLF ++ +W P +K L+ VP K+++L
Sbjct: 332 FTNMTLLAESNRVVYQAMANADPEAVYLMQGWLF-HESYWTPEHVKVYLSGVPDDKMIIL 390
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL E P++S + ++G +IW ML N+ G +YG I+ P+ TM G+
Sbjct: 391 DLNTEANPVFSLTSDYFGKLWIWNMLLNYGGRRGLYGNATDISTRPLLDLHRAQGTMDGI 450
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E IE NPV+++LM EM + D+ WI Y+ RYG+ Q AW +L VY
Sbjct: 451 GITPEAIENNPVMFELMLEMGWHATPPDMHDWIAAYASSRYGKRESLTQSAWQLLLEHVY 510
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D +R + PD +S +E + N
Sbjct: 511 D----QPDIDRFHMEMVPD-----LSSSESRNSN-------------------------- 535
Query: 361 TSEVIRALELFI--ASGNELSASNTYRYDLIDLTRQA-----------LAKYANELFLNI 407
T+ +++A L + A L + + YDL+D+ RQA L + E NI
Sbjct: 536 TTALVQAWRLLVTAAVNGSLPITGPFSYDLVDVGRQALLNLWSDVRGMLVAHVKEYNANI 595
Query: 408 IEAYQLNDAH--GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
+ +H + L L++ D+D LL +LLG WLESAK A N ++
Sbjct: 596 DSSPSTAASHVPAIKSLFTLLLDITSDLDRLLGTDVNYLLGVWLESAKATAANADERATR 655
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E+NAR QIT+W + + + DY K W GL+ DYY R + + +L S
Sbjct: 656 EFNARNQITLWGPDGE-----ITDYAAKQWQGLVSDYYVKRWEMMHDATLSALNSSTKID 710
Query: 526 LKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
++ +K W N YP D + S + KY
Sbjct: 711 TSA-PKDTLKFEQAWGNENKTYPTAPQADVVKVSAAMLQKY 750
>gi|313145188|ref|ZP_07807381.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
gi|313133955|gb|EFR51315.1| glycoside hydrolase family 89 [Bacteroides fragilis 3_1_12]
Length = 718
Score = 288 bits (737), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 187/577 (32%), Positives = 287/577 (49%), Gaps = 70/577 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ + Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLGA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +++CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A +T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+ + W+ +Y RYG
Sbjct: 404 LAKADPHAGSTLKGVGMAPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y +VI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S++FL L+ D LL F +G W+E A+ L E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEKA 619
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF + + LE
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDSLSQKLEGKTP 679
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
++ ++ + W N Y E+ GD + T++
Sbjct: 680 EKI-----DFYAVEEPWAKATNPYSAEAEGDCIETAK 711
>gi|336417192|ref|ZP_08597519.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
3_8_47FAA]
gi|423297818|ref|ZP_17275878.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
CL03T12C18]
gi|335936512|gb|EGM98438.1| hypothetical protein HMPREF1017_04627 [Bacteroides ovatus
3_8_47FAA]
gi|392664455|gb|EIY57993.1| hypothetical protein HMPREF1070_04543 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 288 bits (736), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 174/569 (30%), Positives = 281/569 (49%), Gaps = 40/569 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K +
Sbjct: 194 MCNLDGWQSPLPKEWLSSQAELQEQIVAREREFNMQPVLPAFAGHVPAALKRVYPNIKTS 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+L + D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 254 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D A+WL WLF D W P++K+ L SVP KL++
Sbjct: 310 SWDTDSLGMMAKHIYESVAAVDPKAIWLQMTWLFYADIKHWTTPRIKSYLRSVPQDKLIL 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F E IW + ++G PY+WC L NF GN + G + ++ +A + + + G
Sbjct: 370 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVKLVSERLADALKNGGSNLKG 429
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EGI+ N +Y+ + + A+ + D K W + + RR G+ P + AW +L V
Sbjct: 430 VGSTLEGIDLNQFMYEFVLDKAWNSGQTD-KEWFLKLADRRTGKVSPEARKAWEILADKV 488
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + P+ V +G N +P LK P + Y
Sbjct: 489 Y-------------------IQPA--QVGQGTLTN-ARP-----CLKGNGHWTTKPTIEY 521
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+++ A L + + ++Y +DL+++ RQ L Y N + AY+ D +
Sbjct: 522 QPKDLVEAWRLLLLVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAGDIMMM 579
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ E++ D+D L++CH F L W+ A+ + + + YE NAR+ IT+W D+
Sbjct: 580 KNRGDKMREILADLDKLVSCHPTFSLNKWITDARDMGHDATSKNYYEMNARSLITIWGDS 639
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
L DY N+ W+GL YY R + +I+++E F + + E N+
Sbjct: 640 YH-----LTDYANRSWAGLTNQYYSVRWDRFINEVIKAVEKKKAFDEEVFFNESRMYENE 694
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKYLQ 568
W N N GD + ++ +Y KY +
Sbjct: 695 WVNPSNRINYNEGGDGIKLARQIYKKYAK 723
>gi|423280158|ref|ZP_17259071.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
610]
gi|404584494|gb|EKA89159.1| hypothetical protein HMPREF1203_03288 [Bacteroides fragilis HMW
610]
Length = 718
Score = 288 bits (736), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 187/577 (32%), Positives = 287/577 (49%), Gaps = 70/577 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTQQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ + Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ADFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLGA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +++CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWVYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A +T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+ + W+ +Y RYG
Sbjct: 404 LAKADPHAGSTLKGVGMTPEGIENNPVMYELVMELPWRAERFTKEEWLKEYVKARYGADD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTKLANSIYNSPKNLTQQGTHESVFSARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y +VI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQDVIEAARLMVSVADRYKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S++FL L+ D LL F +G W+E A+ L E++
Sbjct: 560 QKAVTAAYRSGDKELFGMASQKFLNLILLQDQLLGTRPEFRVGKWIEEARALGGTSEEKA 619
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF + + LE
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDSLSQKLEGKTP 679
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
++ ++ + W N Y E+ GD + T++
Sbjct: 680 EKI-----DFYAVEEPWTKATNPYSAEAEGDCIETAK 711
>gi|60680169|ref|YP_210313.1| alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC 9343]
gi|375357012|ref|YP_005109784.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
gi|383116930|ref|ZP_09937677.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
gi|60491603|emb|CAH06355.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis NCTC
9343]
gi|251947777|gb|EES88059.1| hypothetical protein BSHG_0978 [Bacteroides sp. 3_2_5]
gi|301161693|emb|CBW21233.1| putative alpha-N-acetylglucosaminidase [Bacteroides fragilis 638R]
Length = 718
Score = 288 bits (736), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 185/577 (32%), Positives = 288/577 (49%), Gaps = 70/577 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|423269418|ref|ZP_17248390.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
CL05T00C42]
gi|423273021|ref|ZP_17251968.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
CL05T12C13]
gi|392701212|gb|EIY94372.1| hypothetical protein HMPREF1079_01472 [Bacteroides fragilis
CL05T00C42]
gi|392708585|gb|EIZ01692.1| hypothetical protein HMPREF1080_00621 [Bacteroides fragilis
CL05T12C13]
Length = 718
Score = 287 bits (735), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 185/577 (32%), Positives = 288/577 (49%), Gaps = 70/577 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKAP 679
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSTEAEGDCIEVAK 711
>gi|423282107|ref|ZP_17260992.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
615]
gi|404582594|gb|EKA87288.1| hypothetical protein HMPREF1204_00530 [Bacteroides fragilis HMW
615]
Length = 718
Score = 287 bits (735), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 185/577 (32%), Positives = 288/577 (49%), Gaps = 70/577 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKAP 679
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|374385779|ref|ZP_09643282.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
12061]
gi|373225481|gb|EHP47815.1| hypothetical protein HMPREF9449_01668 [Odoribacter laneus YIT
12061]
Length = 715
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 179/542 (33%), Positives = 268/542 (49%), Gaps = 61/542 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW ++Q+ LQ +IL R+ E G+ PV P ++G + P
Sbjct: 186 MNNLEGWGGPNPESWYERQMQLQHRILNRMREYGIEPVFPGYAG--------MLPHNASE 237
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG VK WC L +P F I + + K +G+ + Y D F E
Sbjct: 238 KLG--IEVKDPGLWCGYQRPAFLYPENPAFKRIAGLYYMEMEKRFGK-AKFYGMDPFHEG 294
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
N +D +++ ++ M++ + +AVW+MQ W + P+ + ++ ++ G
Sbjct: 295 GNVQGID----LAAAAQSVLQAMKTANPEAVWVMQAWQAN-------PRHE-MITALQPG 342
Query: 176 KLVVLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
+++LDL +E +P+W K F G +++CML NF GN+ MYG +D + G
Sbjct: 343 NVLILDLSSENRPMWGDKESVWYREKGFEGQDWLYCMLLNFGGNVGMYGRMDRVINGFYA 402
Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
A N ++ GVG +MEGIE NPV+Y+L+ E+ ++ + W+ Y RYG+ P
Sbjct: 403 AVQHPNGASLRGVGKTMEGIENNPVMYELLLELPWRKIPFTKEEWLKGYVKARYGKDDPR 462
Query: 288 IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKS 347
+Q AW +L YNC V A P A S
Sbjct: 463 LQQAWQILGKAAYNCPVVQEGTTESVFCARP------------------------AEEIS 498
Query: 348 ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNI 407
SS+ L+Y+ E + LF+ + +N + YDL D+ RQALA N L I
Sbjct: 499 GASSWGTSELYYAPEESKKVAALFLEVSEQYKGNNNFEYDLTDIMRQALADKGNVLQKKI 558
Query: 408 IEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEW 467
EAY+L D LSR FL+L+ D LLA F LG WLE AK + EE+++ YEW
Sbjct: 559 TEAYRLKDETAFRNLSREFLQLILWQDTLLATRPEFRLGTWLERAKAKGETEEEKRLYEW 618
Query: 468 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 527
NAR QIT W + + LRDY ++ W+GLL+D+Y PR YF ++E +G+
Sbjct: 619 NARVQITTWGNRQAADKGGLRDYSHREWAGLLKDFYYPRWKAYFD-LLEKRLAGEETEDI 677
Query: 528 DW 529
DW
Sbjct: 678 DW 679
>gi|265765312|ref|ZP_06093587.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
gi|263254696|gb|EEZ26130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_16]
Length = 718
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 185/577 (32%), Positives = 288/577 (49%), Gaps = 70/577 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ ++
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIENLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 FYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRMEGKAP 679
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|336408181|ref|ZP_08588675.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
gi|335939481|gb|EGN01355.1| hypothetical protein HMPREF1018_00690 [Bacteroides sp. 2_1_56FAA]
Length = 718
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 184/577 (31%), Positives = 287/577 (49%), Gaps = 70/577 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW +Q+ LQKKIL R+ E G+ P+LP + G VP +
Sbjct: 188 MNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPMLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L +DP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 240 KLG--LNV-SDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGK-ANFYSMDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
NT VD + + G A+ M+ + AVW+ Q W + P ++ +
Sbjct: 296 GGNTAGVD----LDAAGKAVMKAMKKANPKAVWVAQAWQANPRP--------KMIEDLKA 343
Query: 175 GKLVVLDLFAEVKPIWSTS------KQFYGV-PYIWCMLHNFAGNIEMYGILDSIA--FG 225
G L++LDL +E +P W S K YG +I+CML N+ GN+ ++G +D++ F
Sbjct: 344 GDLLILDLTSECRPQWGDSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ ++ + W+ +Y RYG
Sbjct: 404 LAKADPHASATLKGVGMTPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDD 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P +Q AW L +++YN T + + V A P E YQ
Sbjct: 464 PVVQAAWTNLANSIYNSPKNLTQQGTHESVFCARP---------AEDVYQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y EVI A L ++ + +N + YDL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNFEYDLVDIVRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ AY+ D S +FL+L+ D LL F +G W+E A+ L E+++
Sbjct: 560 QKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFRVGKWIEEARALGDTPEEKE 619
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+GLL+D+Y R +YF ++ + +E
Sbjct: 620 LYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYYMRWKLYFDFLSQRIEGKTP 679
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+ ++ + W N Y E+ GD + ++
Sbjct: 680 AEI-----DFYAIEEPWTKAANPYSAEAEGDCIEVAK 711
>gi|393236266|gb|EJD43816.1| putative alpha-N-acetylglucosaminidase [Auricularia delicata
TFB-10046 SS5]
Length = 778
Score = 286 bits (731), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 173/561 (30%), Positives = 287/561 (51%), Gaps = 56/561 (9%)
Query: 8 GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
G LP W+D Q LQKKI+ R+ ELGM P LP+F+G VP A+ V P A + W S
Sbjct: 219 GSSLPMEWIDDQFELQKKIVRRMVELGMTPALPSFTGFVPRAISRVLPGASVVNGSRW-S 277
Query: 68 VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
D T+L + DP F + ++FIE+Q+ YG SH+Y D ++EN P + Y+
Sbjct: 278 GFPDALTRVTFL-EPFDPAFARLQKSFIEKQIAAYGPVSHVYTLDQYNENDPLKNDVGYL 336
Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KLVVLDLFAE 185
+ + + +++ D DA+WLMQGWLF S FW +++A L V +++LDLF+E
Sbjct: 337 RDVSRSTWQSLKAADPDAIWLMQGWLFYSNRGFWTNARVEAFLGGVEKNDDMLILDLFSE 396
Query: 186 VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSME 245
+P W + +YG P+IWC LH++ GN+ +YG + +I VEA ++ ++VG G++ME
Sbjct: 397 SEPQWQRTNSYYGKPWIWCQLHDYGGNLGLYGQVMNITLNAVEA-LEKSPSLVGFGLTME 455
Query: 246 GIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY------GRSVP-AIQDAWNVLYHT 298
G E N ++YDL+ A+ + +D ++ ++ RRY G +P AI +AW++L T
Sbjct: 456 GQEGNEIMYDLLLSQAWSRKPIDTASYFRSWATRRYNAGGIIGSLLPSAIYNAWDILRTT 515
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYN T A++ V + ++ P++ + + + + +
Sbjct: 516 VYNNTKLASNA---VTKSVFELRPALSGI-------------------ANRTGHHATTIT 553
Query: 359 YSTSEVIRALELFIASGNELSA----SNTYRYDLIDLTRQALAKYANELFLNIIEAYQ-- 412
Y T +++A +LF + A + Y +D +D RQ L+ + + +++ Y
Sbjct: 554 YDTQALVKAYDLFDKAAIYTPALWFNNPAYEFDNVDFARQVLSNAFSTQYDDLVATYNEI 613
Query: 413 ---------LNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
L +A + R + ++ +D +L F L WL+ A+ A+ +E
Sbjct: 614 SKPGGSGATLAEAAKIIHDKGERMMGVLASLDKVLRTSKHFTLKKWLQDARAWARGGHEE 673
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 522
+E+NAR QIT+W Q + DYG+K W GL+ +YY R I+F Y+ + +G
Sbjct: 674 -LFEYNARNQITLWGPTGQ-----INDYGSKAWGGLVSEYYAQRWRIFFTYLESVVAAGQ 727
Query: 523 GFRLKDWRREWIKLTNDWQNG 543
F L +++ DWQ
Sbjct: 728 PFNLTAVGNQFLAFQLDWQTA 748
>gi|83775903|dbj|BAE66022.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 633
Score = 286 bits (731), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 173/533 (32%), Positives = 282/533 (52%), Gaps = 49/533 (9%)
Query: 1 MSNLHG-WGG-PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 73 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 132
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L+ D F ++ ++ I +Q + +G +H+Y D F+E
Sbjct: 133 VVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISRQTRAFGNVTHVYALDQFNEIN 190
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW P ++ A L+ V
Sbjct: 191 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDPNRISAYLSGVERND 249
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA +++
Sbjct: 250 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEA-LNKSD 308
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVP-AIQDAW 292
++VG G++MEG E N +VYDL+ + A+ + +D +A+ + RY SVP + AW
Sbjct: 309 SLVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQSWVRSRYSGNFSVPNELYTAW 368
Query: 293 NVLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEGKYQ-----NYGKPVSKEAVLK 346
++L TVYN T+ T + + PD+ + V G Y NY V E
Sbjct: 369 DLLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--GHYPTPTSINYDPMVLNEVWSL 426
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
++ P LW+S + Y YD++D+TRQ + ++ +
Sbjct: 427 FMNATRKEPSLWHSPA---------------------YEYDMVDITRQLMGNAFVNVYSD 465
Query: 407 IIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E +
Sbjct: 466 LISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNENFSLTTWISSARDWGNTTETKD 525
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 516
+E+NAR QIT+W + + DY +K W+GL+ YY PR +I+ Y+ E
Sbjct: 526 FFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGE 573
>gi|452988463|gb|EME88218.1| glycoside hydrolase family 89 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 772
Score = 285 bits (730), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 166/545 (30%), Positives = 285/545 (52%), Gaps = 52/545 (9%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LPQSW+D Q L KKI+ R+ ELGM PVLP F+G VP + ++P+A
Sbjct: 202 NIQGSWGGDLPQSWIDHQFELNKKIVARMVELGMTPVLPCFTGFVPTQISRLYPNASFVN 261
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W +++ + L+ DPLF + ++FI +Q++ YG S IY D ++EN P
Sbjct: 262 GSRWNGFQAE--YTNVTFLEPFDPLFTTLQKSFISKQIEAYGNVSSIYTLDQYNENDPFS 319
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVL 180
Y+ ++ + +++ D +A+W +QGWLF S FW +++A L V +++L
Sbjct: 320 GELAYLKNVTSNTIKSLKAADPEAIWFIQGWLFYSSADFWTDERVEAYLGGVANEDMLIL 379
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DLF+E +P W + ++G P+IWC LH++ GN ++G ++++ PV+A ++ +TMVG+
Sbjct: 380 DLFSESQPQWQRTNSYFGKPWIWCQLHDYGGNQGLHGQVENVTINPVQALANKTSTMVGM 439
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPA-IQDAWNVLYHT 298
G +MEG E N ++YD++ + A+ E +D ++ + + RY G +P+ + AW+V+ T
Sbjct: 440 GSTMEGQEGNEIIYDILLDQAWSKEPIDSDSYFHDWVTSRYAGSKLPSGLYTAWDVMRQT 499
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYN TD + V + +++P+ + + + + VL S +
Sbjct: 500 VYNSTD--IEAAEAVTKSIFELEPNTTGLLNRRGHHSTLILYDPNVLVSAWN-------- 549
Query: 359 YSTSEVIRALELFIASGNELSA--SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
+L+ AS +++ Y++DL+D TRQ LA L+ + + +
Sbjct: 550 ----------DLYNASNDDIQLWDVKAYQFDLVDTTRQVLANAFYPLYTDFVHSAN-KSV 598
Query: 417 HGVF------QLSRRFLELVEDMDGLLACHDG--FLLGPWLESAKQLAQNEEQEKQ---- 464
G + + + + L++D+D +L F L W+ESA+ A E+
Sbjct: 599 QGTYSPTKAEEKGKEMIMLLKDLDSVLEASGNAHFKLSSWIESARLWAPAEDYADDKNTT 658
Query: 465 ------YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL 518
YE+ AR QIT+W N + + DY +K W+GL+R YY PR + + + S
Sbjct: 659 AKIADFYEYTARNQITLWGPNGE-----ISDYASKQWAGLIRSYYVPRWQRFVDFTLNST 713
Query: 519 ESGDG 523
S +G
Sbjct: 714 TSMNG 718
>gi|325299497|ref|YP_004259414.1| alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
gi|324319050|gb|ADY36941.1| Alpha-N-acetylglucosaminidase [Bacteroides salanitronis DSM 18170]
Length = 723
Score = 285 bits (729), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 185/572 (32%), Positives = 289/572 (50%), Gaps = 62/572 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PVLP +SG VP A Q + +
Sbjct: 190 MNNLEGWGGPNPDSWYTQQEALQKKILKRMREYGIEPVLPGYSGMVPHDAHQKLGLNVTE 249
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+L N F+ + L TD F EI + E+Q K +G+ ++ Y+ D F E N
Sbjct: 250 PELWNGFTRPA--------FLMPTDKRFAEIAALYYEEQEKLFGKANY-YSMDPFHELEN 300
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G A+ M+ + AVW++QGW + +P RP MK L N G L
Sbjct: 301 AGEVD----FDAAGKAVMDAMKQVNPKAVWVVQGW--TENP--RPEMMKNLKN----GDL 348
Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
++LDLF+E +P IW K + +++CML NF N+ ++G +D + +
Sbjct: 349 LILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLLNNFYLTKN 408
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+ + G+G++MEG E NPV+++LM E+ ++ EK+ ++W+ +Y RYG I+
Sbjct: 409 NPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKITKESWLKEYLAARYGAKDEKIEQ 468
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW +L +YNC G + + + P ++ +S + K +NY P S EA
Sbjct: 469 AWMILADGIYNCPFGNNQQGPHESIFCGRPSMNNFQVS-SWSKMENYYDPTSTEA----- 522
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
A L + + ++ +N + YDL+D+ RQALA ++ I
Sbjct: 523 ------------------AARLMLEAADKFRGNNNFEYDLVDIVRQALADRGRIVYNRAI 564
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 468
++ D + S+ FL L+ D LLA F +G W+ A+ L E++ YEWN
Sbjct: 565 ADFKSFDKRSYARHSKEFLNLLLAQDRLLATRSEFRVGRWINQARSLGNTPEEKDLYEWN 624
Query: 469 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 528
AR QIT W + + LRDY +K W+G+L+D+Y R A +++ + L DG ++D
Sbjct: 625 ARVQITTWGNRECADKGGLRDYAHKEWNGILKDFYYKRWAAWWEMLQGVL---DGGEMQD 681
Query: 529 WRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
+W + W N Y E+ GD + T++
Sbjct: 682 --IDWYAMEEPWTLQHNPYKAEAEGDCIETAR 711
>gi|336386984|gb|EGO28130.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 738
Score = 285 bits (728), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 169/536 (31%), Positives = 289/536 (53%), Gaps = 47/536 (8%)
Query: 7 WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW- 65
WGG LP+ W++ Q LQK+I+ R+ ELGM PVLP+F+G VP A+ ++P+A I W
Sbjct: 191 WGGDLPEQWINDQFALQKQIISRMVELGMTPVLPSFTGFVPRAMHTLYPNASIVNGSQWN 250
Query: 66 -FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
F+++ + L+ DPLF + +FI +Q+ YG SH+Y D ++EN+P
Sbjct: 251 GFTIQ----YTNDSFLEPFDPLFSTLQTSFISKQVAAYGNVSHVYTLDQYNENSPYSGDT 306
Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-KLVVLDL 182
Y++++ AA ++ +++ D AVWLMQGWLF D FW +++A L VP +++LDL
Sbjct: 307 SYLANVTAATFASLRAADPQAVWLMQGWLFYSDSTFWTTERVEAYLGGVPGNDSMIILDL 366
Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGM 242
++E +P W ++G +IWC LH++ GN+ G +++ P++A + +MVG+G+
Sbjct: 367 YSEAQPQWQRLNSYFGKQWIWCELHDYGGNMGFEGNFENVTTQPIKALATPGNSMVGMGL 426
Query: 243 SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP----AIQDAWNVLYHT 298
+MEG E N ++YD++ + A+ ++ A+I+ ++ RRY +VP A +AW +L T
Sbjct: 427 TMEGQEGNEIIYDVLLDQAWSSTPLNRTAYISAWASRRY--NVPDLPTAALEAWEILGAT 484
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYN D T I+ ++ PSI L + T ++ +
Sbjct: 485 VYNNQDVTTQSTVKSIL---ELSPSITG------------------LVNRTGTHSTKLFY 523
Query: 359 YSTSEVIRALELFIASGNELSA-SNT--YRYDLIDLTRQALAKYANELFLNIIEAY--QL 413
+ + ++ AL+L + + E SA SN ++YD++D+TRQ LA +L+ ++I+ +
Sbjct: 524 DTNTTIVPALKLLLQARQEASALSNIPEFQYDVVDVTRQLLANRFIDLYTSLIDTFSSTS 583
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ--NEEQEKQYEWNART 471
+ + V L L++D+D +L FLL W+ +A+ N E+NAR
Sbjct: 584 SSSSAVSAAGAPLLALLQDLDSVLLTDTHFLLARWISAARNWTHGDNATYAAYLEYNARN 643
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 527
Q+T+W + + DY +K W GL+ YY R + Y+ S E+ + +
Sbjct: 644 QVTLWGPRGE-----VNDYASKQWGGLVGTYYVQRWETFVGYLAGSKENATVYNVS 694
>gi|317158657|ref|XP_001827155.2| alpha-N-acetylglucosaminidase [Aspergillus oryzae RIB40]
Length = 849
Score = 285 bits (728), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 173/533 (32%), Positives = 282/533 (52%), Gaps = 49/533 (9%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 157 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 216
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L+ D F ++ ++ I +Q + +G +H+Y D F+E
Sbjct: 217 VVNGSQWSGFQK--KFTEVSFLNPLDETFAQLQKSVISRQTRAFGNVTHVYALDQFNEIN 274
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW P ++ A L+ V
Sbjct: 275 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDPNRISAYLSGVERND 333
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA +++
Sbjct: 334 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEA-LNKSD 392
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVP-AIQDAW 292
++VG G++MEG E N +VYDL+ + A+ + +D +A+ + RY SVP + AW
Sbjct: 393 SLVGFGLTMEGQEGNEIVYDLLLDQAWSAKPIDTRAYFQSWVRSRYSGNFSVPNELYTAW 452
Query: 293 NVLYHTVYNCTDGAT-DKNRDVIVAFPDVDPSIISVTEGKYQ-----NYGKPVSKEAVLK 346
++L TVYN T+ T + + PD+ + V G Y NY V E
Sbjct: 453 DLLRKTVYNNTNLTTYSLTKSIFEISPDIAGLVGRV--GHYPTPTSINYDPMVLNEVWSL 510
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
++ P LW+S + Y YD++D+TRQ + ++ +
Sbjct: 511 FMNATRKEPSLWHSPA---------------------YEYDMVDITRQLMGNAFVNVYSD 549
Query: 407 IIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E +
Sbjct: 550 LISSWKSETENRTTNVTSQSERLLNLLSAIDKVLSCNENFSLTTWISSARDWGNTTETKD 609
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 516
+E+NAR QIT+W + + DY +K W+GL+ YY PR +I+ Y+ E
Sbjct: 610 FFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGE 657
>gi|336374066|gb|EGO02404.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 761
Score = 285 bits (728), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 169/536 (31%), Positives = 289/536 (53%), Gaps = 47/536 (8%)
Query: 7 WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW- 65
WGG LP+ W++ Q LQK+I+ R+ ELGM PVLP+F+G VP A+ ++P+A I W
Sbjct: 214 WGGDLPEQWINDQFALQKQIISRMVELGMTPVLPSFTGFVPRAMHTLYPNASIVNGSQWN 273
Query: 66 -FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
F+++ + L+ DPLF + +FI +Q+ YG SH+Y D ++EN+P
Sbjct: 274 GFTIQ----YTNDSFLEPFDPLFSTLQTSFISKQVAAYGNVSHVYTLDQYNENSPYSGDT 329
Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-KLVVLDL 182
Y++++ AA ++ +++ D AVWLMQGWLF D FW +++A L VP +++LDL
Sbjct: 330 SYLANVTAATFASLRAADPQAVWLMQGWLFYSDSTFWTTERVEAYLGGVPGNDSMIILDL 389
Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGM 242
++E +P W ++G +IWC LH++ GN+ G +++ P++A + +MVG+G+
Sbjct: 390 YSEAQPQWQRLNSYFGKQWIWCELHDYGGNMGFEGNFENVTTQPIKALATPGNSMVGMGL 449
Query: 243 SMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP----AIQDAWNVLYHT 298
+MEG E N ++YD++ + A+ ++ A+I+ ++ RRY +VP A +AW +L T
Sbjct: 450 TMEGQEGNEIIYDVLLDQAWSSTPLNRTAYISAWASRRY--NVPDLPTAALEAWEILGAT 507
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
VYN D T I+ ++ PSI L + T ++ +
Sbjct: 508 VYNNQDVTTQSTVKSIL---ELSPSITG------------------LVNRTGTHSTKLFY 546
Query: 359 YSTSEVIRALELFIASGNELSA-SNT--YRYDLIDLTRQALAKYANELFLNIIEAY--QL 413
+ + ++ AL+L + + E SA SN ++YD++D+TRQ LA +L+ ++I+ +
Sbjct: 547 DTNTTIVPALKLLLQARQEASALSNIPEFQYDVVDVTRQLLANRFIDLYTSLIDTFSSTS 606
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ--NEEQEKQYEWNART 471
+ + V L L++D+D +L FLL W+ +A+ N E+NAR
Sbjct: 607 SSSSAVSAAGAPLLALLQDLDSVLLTDTHFLLARWISAARNWTHGDNATYAAYLEYNARN 666
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLK 527
Q+T+W + + DY +K W GL+ YY R + Y+ S E+ + +
Sbjct: 667 QVTLWGPRGE-----VNDYASKQWGGLVGTYYVQRWETFVGYLAGSKENATVYNVS 717
>gi|393783261|ref|ZP_10371436.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
CL02T12C01]
gi|392669540|gb|EIY63028.1| hypothetical protein HMPREF1071_02304 [Bacteroides salyersiae
CL02T12C01]
Length = 724
Score = 285 bits (728), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 170/567 (29%), Positives = 279/567 (49%), Gaps = 46/567 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ GW GPLP WLD Q+ LQKKIL R EL M PVLPAF+G+VP AL+ +FP A I
Sbjct: 198 MANIDGWNGPLPMHWLDSQVELQKKILTRERELNMKPVLPAFAGHVPGALKRIFPEANIQ 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + R + L+ + LF I + +I++Q + +G T HIY D F+E PP
Sbjct: 258 NLGKWAGFAEEYR---CHFLNPEEALFATIQKQYIKEQTRLFG-TDHIYGVDPFNEVDPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY+S + A +Y + + D A W+ W+F +D W P++KA+L VP GK+V+
Sbjct: 314 SWEPEYLSKVSADMYHTLTAADPKAEWMQMTWMFYFDRKDWTAPRVKAMLTGVPQGKMVL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W T++ F+G PYIWC L NF GN + G + + + G
Sbjct: 374 LDYHCENVELWKTTEHFHGQPYIWCYLGNFGGNTTLTGNVKESGARLDNTLINGGSNFKG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ D ++W+N + R G + +++AW++L++ V
Sbjct: 434 IGSTLEGLDVMQFPYEYIFEKAWTL-NTDDRSWLNALADRHTGVTSEPVREAWDILFNQV 492
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + P++ P + KP ++ ++ Y
Sbjct: 493 YVQVP-------RTLAVLPNLRPVM-----------NKPNNRTSIN-------------Y 521
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +++A + + + + + R D+I + RQ L Y + + Y+ D +
Sbjct: 522 PNTALLQAWQKLLQAPD--CNRDALRLDIITVGRQLLGNYFLTVKDDFDRMYEAKDLPAL 579
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ E++ D++ L A H L W+ A++ E + YE NAR IT W
Sbjct: 580 KARAAEMREILNDLERLNAFHSRCSLDKWISDARKYGNTPELKNYYEKNARNLITTW--- 636
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
L DY ++ W+GL++DYY R +Y ++ ++E+ F + E+ +
Sbjct: 637 ----GGRLNDYASRTWAGLIKDYYSKRWDMYLDAVVAAVENNREFDQEKLDGEFRLFEDS 692
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W + V GD LI +++L NKY
Sbjct: 693 WVSSTRPVEVTPEGDLLIYARFLLNKY 719
>gi|391873368|gb|EIT82411.1| alpha-N-acetylglucosaminidase [Aspergillus oryzae 3.042]
Length = 633
Score = 285 bits (728), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 172/536 (32%), Positives = 284/536 (52%), Gaps = 53/536 (9%)
Query: 1 MSNLHG-WGG-PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 73 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMTPVLPAFPGFVPPAIKRVRPHAT 132
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L D F ++ ++ I +Q++ +G +H+Y D F+E
Sbjct: 133 VVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISRQMRAFGNITHVYALDQFNEIN 190
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW ++ A L+ V
Sbjct: 191 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDSNRISAYLSGVERND 249
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA N
Sbjct: 250 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEALNKSN- 308
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVP-AIQDAW 292
++VG G++MEG E N +VYDL+ + A+ +D +A+ + RY R SVP + AW
Sbjct: 309 SLVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQSWVRSRYSRNFSVPNELYTAW 368
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS--KEAVLKSET- 349
++L TVYN T+ T V + ++ P I + G+ +Y P S + ++ +E
Sbjct: 369 DLLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GRVGHYPTPTSINYDPMVLNEVW 424
Query: 350 -----SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
++ P LW++ + Y YD++D+TRQ + ++
Sbjct: 425 SLFMNATRKEPSLWHNPA---------------------YEYDMVDITRQLMGNAFVNVY 463
Query: 405 LNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E
Sbjct: 464 SVLITSWKSETENRTTKVTSQSERLLNLLSAIDKVLSCNENFSLATWISSARDWGNTTET 523
Query: 462 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 517
+ +E+NAR QIT+W + + DY +K W+GL+ YY PR +I+ Y+ E+
Sbjct: 524 KDFFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGEN 574
>gi|340520426|gb|EGR50662.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
Length = 747
Score = 284 bits (727), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 168/520 (32%), Positives = 286/520 (55%), Gaps = 40/520 (7%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP+SW+D+Q LQ KIL R+ ELG+ P+LPAF G VP + VFP ++
Sbjct: 203 NIQGSWGGTLPRSWVDEQFSLQLKILKRMEELGITPILPAFPGFVPRNISRVFPDISLST 262
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + + ++ DP F ++ + FI +Q + YG ++ + D F+EN P
Sbjct: 263 SPIWSNFGTTL--SADIYINPFDPRFAQLQKLFINKQQELYGNVTNFWTLDQFNENRPLS 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-LVV 179
+Y+ ++ ++ +++ D +AVW+MQ WLFS D FW +++ALL VP+ + +++
Sbjct: 321 GDLDYLRNVSHNTWAALKAADPEAVWVMQAWLFSSDSSFWTNDRVEALLGGVPVNQDMLL 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMV 238
LDLFAE P W + FYG P+IWC LHN+ GN+ +YG ++++ ++A R S+ ++V
Sbjct: 381 LDLFAESAPQWQRTDSFYGKPWIWCELHNYGGNMGLYGQIENVTINSMDAVRNSD--SIV 438
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVPAIQDAWNVLYH 297
G G++MEG E N ++YDL+ + A+ + +D + + + RYG ++V + W +L
Sbjct: 439 GFGLTMEGQEGNEIMYDLLLDQAWSPKPIDTDTYFHDWVSARYGAKNVKGLYKGWEMLRP 498
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
TV+N T+ + + I+ ++ PSI + G+ +G + + + E S
Sbjct: 499 TVFNNTNLTVNAVQKSIL---ELTPSISGLL-GRTGRHGTTIMYDPAVMVEAWS------ 548
Query: 358 WYSTSEVIRALELFIASGNELSASN--TYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
ELF A +L+ N +Y+YDL+D TRQ L + + ++++AY +
Sbjct: 549 -----------ELFKAGLQDLTLFNNPSYQYDLVDWTRQVLVNSFEDHYKDLVDAYNKSS 597
Query: 416 AHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 474
+ V + + + L++ +D +LA + F L PW++ A+ A + +E+NAR QIT
Sbjct: 598 SPTVIRTRGAKLVTLLKTLDAVLATNKNFQLTPWIDRAR--ASSPSSANFFEFNARNQIT 655
Query: 475 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 514
+W Q E DY +K W+GL+ YY R + Y+
Sbjct: 656 LWGPQGQIE-----DYASKQWAGLVGTYYAERWQQFVDYL 690
>gi|153808241|ref|ZP_01960909.1| hypothetical protein BACCAC_02529 [Bacteroides caccae ATCC 43185]
gi|423219048|ref|ZP_17205544.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
CL03T12C61]
gi|149129144|gb|EDM20360.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
gi|392625814|gb|EIY19870.1| hypothetical protein HMPREF1061_02317 [Bacteroides caccae
CL03T12C61]
Length = 752
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 183/534 (34%), Positives = 264/534 (49%), Gaps = 54/534 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ ++L KKI+ R ELGM P+ FSG VP L++ +P AKI
Sbjct: 193 MQNLQSYGGPLPKSWIDKHIILAKKIIDRERELGMTPIQQGFSGYVPRELKDKYPEAKI- 251
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF +GR F+E++ K YG T IY D F E+
Sbjct: 252 --------RLQPGWCGFKGAGQLDPTDALFATLGRDFLEEEKKLYG-TYGIYAADPFHES 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPV++PEY+S++G AIY ++ D A W MQ W R P +KA VP L
Sbjct: 303 APPVNTPEYLSAVGHAIYKLIKDFDPKAKWAMQAWSL------REPIVKA----VPQNDL 352
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL E K F+G P + LHNF G I M+G L +A + +
Sbjct: 353 IILDLNGEK---IKGRKGFWGYPAVEGNLHNFGGRINMHGDLRLLASNQYMTALKQYPNV 409
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G G+ ME IEQNPV YDL EM +V ++ W+ QY+ RRYG P+ Q A L
Sbjct: 410 CGSGLFMEAIEQNPVYYDLAFEMPLHKGEVAIEEWLKQYANRRYGAVSPSAQQAMICLLE 469
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y T+ R I+A P++ G G P
Sbjct: 470 GPYRPGTNGTE--RSSIIA---ARPALNVKKSGPNAGLGIP------------------- 505
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YS VI+A L + ++L S YR+D+ID+ RQ + + EA+ D
Sbjct: 506 -YSPLLVIQAEGLLLKDADKLKNSEPYRFDVIDVQRQMMTNMGQVIHKRAAEAFLNRDKE 564
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
S+RFL+++ED+D LL F WL SA+ EE++ E++A + +T+W
Sbjct: 565 AFALHSKRFLQMLEDVDELLRTRPEFNFDRWLTSARSWGDTEEEKNLLEYDATSLVTIW- 623
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRR 531
+ S+ DY + W+GL++ YY PR ++ + E L++G + + R+
Sbjct: 624 -GADGDPSIF-DYSWREWTGLIKGYYLPRWTKFYAMLQEHLDNGTTYSEEGLRQ 675
>gi|404406328|ref|ZP_10997912.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 738
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 169/525 (32%), Positives = 252/525 (48%), Gaps = 39/525 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSNL W PLPQSWLD Q+ LQK+I+ R EL M PVLPAF+G+VPA L ++P AKI+
Sbjct: 206 MSNLDYWQSPLPQSWLDAQVELQKRIVARERELNMKPVLPAFAGHVPAELGEIYPEAKIS 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W + R ++ LD DPLF I R F+ +Q +G T HIY D F+E PP
Sbjct: 266 RMSKWGGFEDRYR---SHFLDPLDPLFARIQREFLAEQTALFG-TDHIYGADPFNEVDPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PE+++ + IY M D +A WL WLF D W +++A + +VP K+++
Sbjct: 322 SWEPEFLARVSRTIYDTMTEADPEAEWLQMTWLFYLDRDKWHDDRIEAFVTAVPQDKMLL 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W + ++G PY WC L NF GN + G D ++ + G
Sbjct: 382 LDYYCENTEVWRQTHSYHGQPYFWCYLGNFGGNTMLVGNFDEVSKRIDGVLAEGGNNLRG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ NP +YD + E A+ VD W + + R G + AW+ L V
Sbjct: 442 LGSTLEGLDSNPFMYDYVFERAWDF-PVDDDRWFDALADRYLGYEDTGYRRAWDALRKNV 500
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y I S G + E +L T + + Y
Sbjct: 501 Y-----------------------ITSSKYGHCPLLNARPTLEGILTGTTDA----EIKY 533
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+ I +G+ S +TYRY L+++ RQ L L A + D +
Sbjct: 534 DNDELFEVWAKMIDAGD--SGRDTYRYWLVNVGRQTLGNLFLPLRDGFTAACRAKDLARM 591
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+L LEL D++ L A H F + W++ ++ E+ YE N RT +T W D
Sbjct: 592 KELRSEMLELAADLETLTAQHGAFSMQKWIDDSRSFGTTPEERDYYEVNGRTLLTTWGDR 651
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 524
Q + DY N+ WSGL+ DYY R ++ + ++E+G F
Sbjct: 652 AQS----INDYANRTWSGLVADYYAERWRMFLDAAVGAVEAGRKF 692
>gi|29348998|ref|NP_812501.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340905|gb|AAO78695.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 732
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 176/567 (31%), Positives = 282/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 196 MSNVDFWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D D L+ I R F+E+Q K YG T HIY D F+E P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
S ++++++ + IY + DS A WL W+F YD W P++++ L +VP KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDNKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++++YG PYIWC L NF GN + G L+ I F + G
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP++Y+ + + A+ + V WI +S+ R G I AW L+ +
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDY-PVTTDQWITNWSMCRGGNQDANIIKAWRALHQKI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T+ AT ++ A P +T K N + Y + LW
Sbjct: 491 Y--TEHATCGQSVLMNARP-------RLTGTKSWNTNPGI-----------HYANNDLWQ 530
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A + ++ +R+D+I++ RQ L +E Y D G+
Sbjct: 531 IWKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSEYRDQFTACYNRKDTTGM 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ S R L+ D+D LL+C +G WL+ A+ ++ YE NAR +T+W
Sbjct: 582 REWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCGATVSEKDYYEENARCILTVW--- 638
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++ + L DY N+ W GL R +Y R + +I ++ F + ++ + +
Sbjct: 639 -GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIAAVSEDKPFDEDKFHQDITQFEYN 697
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W ++ +P+ S D + + L KY
Sbjct: 698 WTLQKDSFPIVSEEDPIQIADSLILKY 724
>gi|298386708|ref|ZP_06996263.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
gi|298260382|gb|EFI03251.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 1_1_14]
Length = 732
Score = 282 bits (722), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 175/567 (30%), Positives = 282/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 196 MSNVDYWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D D L+ I R F+E+Q K YG T HIY D F+E P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYSIIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
S ++++++ + IY + DS A WL W+F YD W P++++ L +VP KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDNKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++++YG PYIWC L NF GN + G L+ I F + G
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP++Y+ + + A+ + V WI +S+ R G I AW L+ +
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDYS-VTTDQWITNWSMCRGGNQDANIIKAWRALHQKI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T+ AT ++ A P +T K N + Y + LW
Sbjct: 491 Y--TEHATCGQSVLMNARP-------RLTGTKSWNTNPGI-----------HYANNDLWQ 530
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A + ++ +R+D+I++ RQ L ++ Y D G+
Sbjct: 531 IWKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSKYRDQFTACYNRKDTTGM 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ S R L+ D+D LL+C +G WL+ A+ ++ YE NAR +T+W
Sbjct: 582 REWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARNCGATVSEKDYYEENARCILTVW--- 638
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++ + L DY N+ W GL R +Y R + +I ++ F + ++ + +
Sbjct: 639 -GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIAAVSEDKPFDEDKFHQDITQFEYN 697
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W ++ +P+ S D + + L KY
Sbjct: 698 WTLQKDSFPIVSEEDPIQIADSLILKY 724
>gi|403416059|emb|CCM02759.1| predicted protein [Fibroporia radiculosa]
Length = 705
Score = 282 bits (722), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 173/578 (29%), Positives = 300/578 (51%), Gaps = 43/578 (7%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G W G LP W++ Q LQ++I+ R+ ELGM PVLPAF+G VP A+ ++P+A I
Sbjct: 158 NIQGSWSGALPTQWINDQWALQQQIVQRMVELGMTPVLPAFTGFVPRAMSTLYPNASIVN 217
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTPP 120
W S + T L+ DPLF + ++FI +Q YG SH+Y D ++EN P
Sbjct: 218 GSQWEGFPSTLTY--TTFLEPFDPLFTTMQKSFISKQQAAYGANVSHVYTLDQYNENDPY 275
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLG-KLV 178
Y++++ A ++ +Q+ D +AVW+MQGWL F+ + FW ++ A L +VP ++
Sbjct: 276 SGDVGYLANISAGTFASLQAADPEAVWMMQGWLFFASEAFWTTERIAAFLGAVPSNDSMI 335
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
+LDL++E P W + +YG +IWC LH+F GN+ G L + GP++A S ++M
Sbjct: 336 ILDLYSEAAPQWQRTDSYYGKQWIWCELHDFGGNMGFEGNLPELVTGPIQAL-SNASSMR 394
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNVLY 296
G+G++ EG E N +VYD++ + A+ +D+ +++ + RRY + +P A Q+AW +L
Sbjct: 395 GMGLTPEGQEGNEIVYDILLDQAWSSTSIDIASYVEAWVARRYTVQDLPSAAQEAWTILS 454
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
TVY+ +D P+ +I S+ E ++ + ++ + +
Sbjct: 455 TTVYSNSD-------------PNTQATIKSIFE---------LAPDLSGLTDRTGHHCTE 492
Query: 357 LWYSTS-EVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+ Y T+ ++ AL+ + + E L + + YD++D+TRQ LA +++ ++ +
Sbjct: 493 IPYDTNITIVPALQNLVQAATENPLLLSVPEFMYDVVDVTRQLLANRFIDVYNELVSTFY 552
Query: 413 LN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNA 469
A V + L ++ D+D LL +D FLL W+ A L+ N Y E+NA
Sbjct: 553 STGVTAASVKNAGQPLLTILSDVDTLLWTNDNFLLSNWILGAINLSDNNGTYADYLEYNA 612
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
R QIT+W + + + DY +K W+G + YY R ++ Y+ + ++G +
Sbjct: 613 RNQITLWGPDGE-----INDYASKQWAGFVGTYYYDRWNMFITYLEDITQNGTAYNDTAI 667
Query: 530 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
+ + +W +GD + L K+L
Sbjct: 668 QTVMLNFGKEWDTQTYSLSATVSGDTMSIVDSLIQKWL 705
>gi|393785791|ref|ZP_10373937.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
CL02T12C05]
gi|392661410|gb|EIY54996.1| hypothetical protein HMPREF1068_00217 [Bacteroides nordii
CL02T12C05]
Length = 727
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 176/567 (31%), Positives = 276/567 (48%), Gaps = 46/567 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ GW GPLP WLD Q+ LQKKIL R EL M PVLPAF+G+VPAAL+ ++P A I
Sbjct: 198 MANIDGWNGPLPMEWLDNQVELQKKILARERELNMKPVLPAFAGHVPAALKRIYPEANIQ 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R Y L+ +PLF I + F+++Q + +G T HIY D F+E PP
Sbjct: 258 HLGKWAGFADTYR---CYFLNPEEPLFATIQKHFLQEQTRLFG-TDHIYGVDPFNEVDPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PEY+S + + +Y + + D A W+ W+F +D W P++KALL VP K+ +
Sbjct: 314 SWEPEYLSQVSSDMYRTLTAADPKAEWMQMTWMFYHDRKDWTAPRIKALLTGVPQDKMFL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+G PYIWC L NF GN + G + A + + + G
Sbjct: 374 LDYHCENVELWKNTEHFHGQPYIWCYLGNFGGNTTLTGNVKESGDRLDNALINGGSNLRG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ +D +AW+ + R G +++AW++L++ +
Sbjct: 434 IGSTLEGLDVMQFPYEYIFEKAWDL-NLDNEAWLQNLADRHAGTVSQPVREAWDILFNQI 492
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G NY +PV + ++ Y
Sbjct: 493 Y--------------VQVPK--------TLGVLPNY-RPVMNKPNRRTVID--------Y 521
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S + +++A E + + + + R D+I + RQ L Y + + Y + D G+
Sbjct: 522 SNATLLQAWEKLLQATD--CNRDALRLDIITVGRQLLGNYFLIVKDDFDRMYTVKDLPGL 579
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ E++ D+D L A H L WL A+ L E + YE NAR IT W
Sbjct: 580 KARAAEMKEILNDLDRLNAFHSRCALDKWLADARALGTTPEVKDYYEKNARNLITTW--- 636
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
L DY ++ W+GL++DYY R +Y +I ++E F K +
Sbjct: 637 ----GGSLNDYASRTWAGLIKDYYSKRWDMYMDAVISAVEGNREFDQKKLDESIKNFEDA 692
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W + + V G+ + +++L KY
Sbjct: 693 WVDSTDPILVAPQGELMQYARFLLQKY 719
>gi|404487206|ref|ZP_11022393.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
YIT 11860]
gi|404335702|gb|EJZ62171.1| hypothetical protein HMPREF9448_02854 [Barnesiella intestinihominis
YIT 11860]
Length = 731
Score = 281 bits (719), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 184/578 (31%), Positives = 284/578 (49%), Gaps = 63/578 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ Q ++D+Q LQ+K+L R+ EL M PV F G VP +L+ FP A I
Sbjct: 196 MGNLEGFGGPVSQKFIDRQTDLQQKMLRRMRELDMAPVFQGFYGMVPNSLKEKFPEANIK 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W + + LD DPLF +I + E+Q K +G+ + + D F E +
Sbjct: 256 EQGEWQTYQRPA------FLDPNDPLFDKIADIYYEEQEKLFGKAVY-FAGDPFHEGGQS 308
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D + + I M+ +AVW++QGW R P M+ LL + G+ +
Sbjct: 309 EGID----VKAAAKKILKAMRRKTPEAVWIIQGWQ-------RNP-MRDLLEGLEHGEAI 356
Query: 179 VLDLFAEVKPIWSTSKQ--FYGVP------YIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+LDL A +P W K FY +IWC L NF G ++G + S A G V A+
Sbjct: 357 ILDLMACERPQWGGIKNSLFYKAEGHMHHDWIWCALPNFGGKTGLHGKMSSYASGVVFAK 416
Query: 231 TSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
+ G+G + EGI PVVYD++ +MA++ + +D+K W+NQY+ RYG++ P
Sbjct: 417 NHPLGKNLCGIGTAPEGIGTIPVVYDMVYDMAWREDSIDIKDWVNQYTQYRYGKADPNCN 476
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
AW +L T+Y C + I A P +
Sbjct: 477 RAWEILSKTIYECHNEIGGPVESYICARP------------------------SDTIKHA 512
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
SS+ ++Y +E++ A E +E + S TY+YDL+DLTRQ L YA L +
Sbjct: 513 SSWGTAEIFYDPAEIVTAWECMYNVRHEFAQSETYQYDLVDLTRQVLGDYAKYLHKQAVN 572
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 469
A+ ND G S +FL L+ D D LL+ F +G W+ A+ A ++++++ NA
Sbjct: 573 AFYRNDLKGFQTYSSKFLVLIRDEDKLLSTRKEFNVGTWINQARNAACTPQEQERFVANA 632
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
+ QIT W ++ S L DY K WSGL+RD Y PR + Y + +L G+ + D
Sbjct: 633 KRQITTWTNHD----SKLHDYALKEWSGLMRDMYLPRWKAWVDYKL-ALLRGETAQEPD- 686
Query: 530 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
+ ++ +W + Y S G+A+ + +Y KY
Sbjct: 687 ---YFQIEKNWVDSDTRYDSTSTGNAISAVEEIYKKYF 721
>gi|238506383|ref|XP_002384393.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
NRRL3357]
gi|220689106|gb|EED45457.1| alpha-N-acetylglucosaminidase, putative [Aspergillus flavus
NRRL3357]
Length = 669
Score = 281 bits (719), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 170/535 (31%), Positives = 281/535 (52%), Gaps = 53/535 (9%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
+ N+ G WGG + +W++ Q LQKKI+ RI ELGM PVLPAF G VP A++ V P A
Sbjct: 109 LGNIQGSWGGHGVSIAWIEAQFELQKKIVSRIVELGMRPVLPAFPGFVPPAIKRVRPHAT 168
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + ++ L D F ++ ++ I +Q++ +G +H+Y D F+E
Sbjct: 169 VVNGSQWSGFQK--KFTEVSFLSPLDRTFADLQKSVISRQMRAFGNITHVYALDQFNEIN 226
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP--FWRPPQMKALLNSVPLGK 176
P Y+ +L + +++ + AVW+MQGWLF YD FW ++ A L+ V
Sbjct: 227 PASGELGYLRNLSLHTWQSLKAVNPAAVWMMQGWLF-YDKKDFWDSNRISAYLSGVERND 285
Query: 177 -LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDL++E KP W ++ ++G P+IWC LH+F GN+ MYG + +I P+EA +++
Sbjct: 286 DMLILDLYSESKPQWQRTESYFGKPWIWCQLHDFGGNMGMYGQIMNITSDPIEA-LNKSD 344
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVP-AIQDAW 292
++VG G++MEG E N +VYDL+ + A+ +D +A+ + RY SVP + AW
Sbjct: 345 SLVGFGLTMEGQEGNEIVYDLLLDQAWSATPIDTRAYFQSWVRSRYSGNLSVPNELYTAW 404
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS--------KEAV 344
++L TVYN T+ T V + ++ P I + G+ +Y P S E +
Sbjct: 405 DLLRKTVYNNTNLTT---YSVTKSIFEISPDIAGLV-GRVGHYPTPTSINYDPMVLNEVL 460
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
++ P LW++ + Y YD++D+TRQ + ++
Sbjct: 461 SLFMNATRKEPSLWHNPA---------------------YEYDMVDITRQLMGNAFVNVY 499
Query: 405 LNIIEAYQL---NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+I +++ N V S R L L+ +D +L+C++ F L W+ SA+ E
Sbjct: 500 SVLITSWKSETENRTTKVTSHSERLLNLLSAIDKVLSCNENFSLATWISSARDWGNTTET 559
Query: 462 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 516
+ +E+NAR QIT+W + + DY +K W+GL+ YY PR +I+ Y+ E
Sbjct: 560 KDFFEYNARNQITLWGPTGE-----ISDYASKAWAGLISSYYKPRWSIFVDYLGE 609
>gi|379334158|gb|AFD03088.1| putative alpha-N-acetylglucosaminidase [uncultured bacterium 8]
Length = 726
Score = 281 bits (719), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 183/561 (32%), Positives = 285/561 (50%), Gaps = 40/561 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+++L GW GPLPQSW+D+ L ++IL R LGM PVL FSG+VP L A+ T
Sbjct: 181 LASLDGWSGPLPQSWIDRHADLGRRILARERALGMRPVLQGFSGHVPQELIAER-GARST 239
Query: 61 QLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
L W F V +LD DPLF E G + +Q + +G T H+Y D F E TP
Sbjct: 240 TLPWWDFEVG---------MLDPRDPLFEEFGTTLLTEQTRLFG-TDHLYAADPFIETTP 289
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLV 178
PV P ++ + A++ M + D A W++Q W FSY +W P + A L+++P ++
Sbjct: 290 PVSDPADLAQVARAVHGVMTAVDDRATWVLQAWPFSYRSRYWTPERTGAFLDAIPDDGML 349
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTM 237
+LDL+AE +P+W + + P++WCMLH+ G +YG LD IA G A+ + ++
Sbjct: 350 ILDLWAEHRPVWQRTDGYRKKPWVWCMLHSLGGRPGLYGKLDEIATGAARAQADARGGSL 409
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G SME +PV+Y+L++++A+Q DV+AW+ ++ RYGR+ P + AW++L+
Sbjct: 410 SGIGASMEAFGGDPVLYELLADVAWQGSVDDVRAWLETWTRARYGRATPGLLRAWDLLHD 469
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VY ++G VIV P + EG ++ + V ++ S D P
Sbjct: 470 SVY-ASEGPGPPG-SVIVGRPTL--------EGDLRH------ELPVHLADPPSPDVP-- 511
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
+ L + E SA R DL D+T Q L A E +A DA
Sbjct: 512 ---PALAEAWALLADEATQEDSAGPLGR-DLCDVTAQVLTHVACERQWRAADAALARDAD 567
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
G + +R L+ +ED+D LLA L WL A+ A + YE +AR +T+W
Sbjct: 568 GFQRAARALLDTIEDLDTLLATRPEHRLDGWLADARGWATTPAEADLYETDARRLLTLWG 627
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
S L DY ++W+GL+ +Y PR +++++ +LE+G +R +++ +
Sbjct: 628 HTR----SKLHDYSGRHWAGLVGTFYLPRWRSWYEHIARALETGSPYRAEEFEASLLAQE 683
Query: 538 NDWQNGRNVYPVESNGDALIT 558
W RN G A T
Sbjct: 684 ERWVADRNGPTTPEAGTAGAT 704
>gi|282877910|ref|ZP_06286719.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
gi|281299911|gb|EFA92271.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
Length = 723
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 177/543 (32%), Positives = 266/543 (48%), Gaps = 45/543 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP SWL+QQ LQ++IL+R L M PVLPAFSG+VPA L+ ++P A I
Sbjct: 199 MANIDKWNGPLPMSWLEQQKELQQRILLRERSLNMKPVLPAFSGHVPAKLKELYPQANIK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + R + L+ DPLF +I + ++E+Q +G T HIY D F+E PP
Sbjct: 259 YLGRWAGFSDNYR---CHFLNPEDPLFAKIQKMYLEEQKALFG-TDHIYGIDPFNEVDPP 314
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
PEY+ + IY + S D A W+ W+F ++ W P ++KALL V GK+ +
Sbjct: 315 SWKPEYLKEISHNIYRTVTSVDPGAEWMQMSWMFYHNKKQWTPKRIKALLTGVSRGKMSL 374
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W T+ FYG PYIWC L NF GN + G + EA +N ++G
Sbjct: 375 LDYHCENVELWKTTNNFYGQPYIWCYLGNFGGNTTITGNVKESGQRLNEALNKKNKNLIG 434
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + A+ D K WI+ + R G S P ++ AW +L++ +
Sbjct: 435 IGSTLEGLDVIQFPYEYILTQAWTATPAD-KEWIDNLADRHVGFSSPKLRQAWQILFNDI 493
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T R + + P + P + GKYQ ++ E +W
Sbjct: 494 Y------TQIPRSLGI-LPALRPIL-----GKYQERRTEITYPTKRLEE--------VWK 533
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
S+V N Y+ DLI + RQ L +L L + Y D G+
Sbjct: 534 LMSDVSEC------------DRNEYQLDLIAVGRQVLGNKFLKLKLELDSCYVNKDLVGL 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ E++ D+D L A + +G W++ A+ N+ ++ YE NAR IT W
Sbjct: 582 QRTGNTMKEVLVDLDYLTAGNSRCSIGKWIDDARAYGNNDLEKAYYEKNARNLITTW--- 638
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
L DY N+ WSGL+R YY R ++Y + S+ SG F + + + +
Sbjct: 639 ----GGSLNDYANRTWSGLIRTYYVRRWSMYIDELTASVMSGKPFDQQQLDKAIGEFEQN 694
Query: 540 WQN 542
W N
Sbjct: 695 WVN 697
>gi|423241433|ref|ZP_17222546.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
CL03T12C01]
gi|392641326|gb|EIY35103.1| hypothetical protein HMPREF1065_03169 [Bacteroides dorei
CL03T12C01]
Length = 754
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 187/597 (31%), Positives = 290/597 (48%), Gaps = 79/597 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F +I F E+ K YG TS Y+
Sbjct: 252 EKKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTQIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E ++ G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHEAKSLPAGLDF-GKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414
Query: 219 LDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 277
+D + + + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474
Query: 278 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 337
RYG +I+ AW +L + +YNC G + SI G+
Sbjct: 475 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 516
Query: 338 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 397
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572
Query: 398 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632
Query: 458 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 517
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 633 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692
Query: 518 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
L+ + D + +W L W +N+Y + GD + ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNIYAASAEGDCIEVAK 746
>gi|383124408|ref|ZP_09945072.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
gi|251839096|gb|EES67180.1| hypothetical protein BSIG_3565 [Bacteroides sp. 1_1_6]
Length = 732
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 175/567 (30%), Positives = 281/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 196 MSNVDYWQSPLPQSWLKDQEELQKRILEREREFDMTPVLPAFAGHVPAELKTIYPNAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W R ++ +D D L+ I R F+E+Q K YG T HIY D F+E P
Sbjct: 256 QMSQWGGFDEKYR---SHFIDPMDSLYQVIQRRFLEEQTKVYG-TDHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
S ++++++ + IY + DS A WL W+F YD W P++++ L +VP KL++
Sbjct: 312 DWSEDFLANVSSKIYESIHQVDSAAQWLQMTWMFFYDKKKWTQPRIRSFLKAVPDDKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++++YG PYIWC L NF GN + G L+ I F + G
Sbjct: 372 LDYYCDHTEIWRNTEKYYGNPYIWCYLGNFGGNTMIAGNLNDIDFKIKRLFKEGGDNVYG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP++Y+ + + A+ + V WI +S+ R G I AW L+ +
Sbjct: 432 LGATLEGFDVNPLMYEFVFDQAWDY-PVTTDQWITNWSMCRGGDQDANIIKAWRALHQNI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T+ A ++ A P +T K N + Y + LW
Sbjct: 491 Y--TEYAICGQSVLMNARP-------RLTGTKSWNTNPGIH-----------YANNDLWQ 530
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A + ++ +R+D+I++ RQ L +E Y D G+
Sbjct: 531 IWKELLKARNI---------NNSDFRFDVINIGRQVLGNLFSEYRDQFTACYNRKDTTGM 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ S R L+ D+D LL+C +G WL+ A+ ++ YE NAR +T+W
Sbjct: 582 REWSTRMDNLLLDVDRLLSCDATLSIGKWLQDARDCGTTVSEKDYYEENARCILTVW--- 638
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++ + L DY N+ W GL R +Y R + +I ++ F + ++ + +
Sbjct: 639 -GQQDTQLNDYANRGWGGLTRSFYRERWKRFTDGVIGAVSKNKPFDEDKFHQDITQFEYN 697
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W ++ +P+ S D + + L KY
Sbjct: 698 WTLQKDSFPIVSEEDPIQIADSLILKY 724
>gi|393788286|ref|ZP_10376416.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
CL02T12C05]
gi|392655959|gb|EIY49600.1| hypothetical protein HMPREF1068_02696 [Bacteroides nordii
CL02T12C05]
Length = 757
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 183/586 (31%), Positives = 285/586 (48%), Gaps = 70/586 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP++ +D+ L KKI+ R ELGM P+ FSG VP L+ +P+A I
Sbjct: 196 MQNLQSYGGPLPKTVIDKHAALGKKIISRQLELGMQPIQQGFSGYVPRELKEKYPTANIN 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q +W K + LD TD LF +GRAF+E+Q + +G +Y D F E+ PP
Sbjct: 256 QQRSWCGFKGAAQ------LDPTDSLFTRMGRAFLEEQARLFG-AHGVYAADPFHESAPP 308
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+D+PEY+ ++G I+ + D + W MQ W D ++ +VP L++L
Sbjct: 309 IDTPEYLKAVGERIHHLFRDFDPHSTWAMQSWSLRED----------IVKAVPKDALLIL 358
Query: 181 DLFAEVKPIWSTSKQ-FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
DL + STSK F+G + LHNF G I M+G L +A N + G
Sbjct: 359 DLNGK-----STSKALFWGYSTVVGNLHNFGGRINMHGDLKLLASNQYSKAKRLNPAVCG 413
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G+ ME +EQNPV Y+L EM + ++++AW+ QY+ RRYG PA Q+AW +L +
Sbjct: 414 SGLFMEAVEQNPVYYELAFEMPCHADSINLQAWLKQYATRRYGAFSPAAQEAWLLLLNGP 473
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T+K+ ++ A P +D K A L+ + Y
Sbjct: 474 YRRGTNGTEKS-SIVAARPALDV--------------KKSGPNAALE----------IPY 508
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ VIRA L + ++LS S YR+D++D+ RQ + + EA++ D
Sbjct: 509 DPTLVIRAQSLLLKDIDKLSVSRPYRFDIVDVQRQLMTNLGQLIHRQAAEAFRKKDQCAF 568
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
S RFLE++ DMD LL + WL A+ +E++ E +A + +T+W +
Sbjct: 569 TLHSGRFLEMLADMDKLLRTRSEYSFDRWLTEARSWGDTDEEKNLMERDATSLVTIWGAD 628
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD----WRREWIK 535
+ DY + WSGL+ YY PR ++ + + L+ G + + RE +
Sbjct: 629 GDPR---IFDYSWREWSGLISGYYLPRWQKFYAMLQQHLDVGTSYEEAGLPLIYGREAFR 685
Query: 536 LTNDWQN-------------GRNVYPVESNGDALITSQWLYNKYLQ 568
ND+ N G+ P+ + GD +I + L++KYL+
Sbjct: 686 -ANDFYNGLAEWELAYVDTYGKARTPI-TEGDEIIMVKQLFDKYLK 729
>gi|392566857|gb|EIW60032.1| alpha-N-acetylglucosaminidase [Trametes versicolor FP-101664 SS1]
Length = 747
Score = 279 bits (714), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 180/554 (32%), Positives = 293/554 (52%), Gaps = 40/554 (7%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP +W+D Q LQK++L R+ ELGM PV+P+F+G VP AL + P+A I
Sbjct: 193 FGNIQGSWGGELPTAWVDDQFALQKRLLPRMVELGMTPVMPSFTGFVPRALAALHPNASI 252
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGR-TSHIYNCDTFDENT 118
W + L+ DPLF + ++FI +Q YG SH+Y D ++EN
Sbjct: 253 VTGSQWSGFPTS--LTNDSFLEPFDPLFATLQQSFIAKQQAAYGADISHVYTLDQYNEND 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLG-K 176
P +Y+ ++ A ++ +++ D AVWLMQGWLF D FW ++ A L VP
Sbjct: 311 PFSGDLDYLRNVSAGTFASLRAADPAAVWLMQGWLFFSDAVFWTDDRVAAYLGGVPGNDS 370
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
++VLDL++E +P W+ + + G ++WC LH++ GNI M G LD + P+ A +S ++
Sbjct: 371 MIVLDLYSEAQPQWNRTASYSGKQWVWCELHDYGGNIGMEGNLDVLTHAPLTALSSPGSS 430
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVP-AIQDAWNV 294
M GVG++MEG E N +VY ++ + A+ ++ ++++ + RRY + +P A QDAW +
Sbjct: 431 MKGVGLTMEGQEGNEIVYGVLLDQAWSATSLNTSSYVSSWVSRRYPVKPLPKAAQDAWRI 490
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L TVYN D T + I ++ P++ +T + P S YD
Sbjct: 491 LSTTVYNNQDPNT---QATIKGIYELAPALTGMTNRIGHH---PTSIP---------YD- 534
Query: 355 PHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 411
+ + ++ AL+L + A LSA + YD++D+ RQ L+ L+ +I+ Y
Sbjct: 535 -----TDATMLSALKLLLEARAQHPTLSAVPEFVYDVVDVARQLLSNRFIGLYDTLIQTY 589
Query: 412 QLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWN 468
A V + L L+ D+D LL+ ++ FLL W+ A++ A Y E+N
Sbjct: 590 NSTSSTAQSVSAAGQPLLALLTDLDALLSTNEHFLLSSWIADARKWADGSASYGAYLEYN 649
Query: 469 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 528
AR Q+T+W + + + DY +K W+GL+ YY PR A + Y+ E+ +G +
Sbjct: 650 ARNQVTLWGPDGE-----INDYASKAWAGLVGTYYKPRWAAFVDYLAETKGTGQAYNATA 704
Query: 529 WRREWIKLTNDWQN 542
+ + + +W N
Sbjct: 705 VKSTMLAIGQEWGN 718
>gi|392584963|gb|EIW74305.1| glycoside hydrolase family 89 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 772
Score = 279 bits (713), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 168/534 (31%), Positives = 277/534 (51%), Gaps = 60/534 (11%)
Query: 8 GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
GG LPQ W+D QL LQK+I+ RI ELGM PVLPAF G VP A+ +FP+A I +
Sbjct: 221 GGKLPQEWMDAQLALQKQIVPRIVELGMTPVLPAFPGFVPPAMHTLFPNASIVNGSEYPG 280
Query: 68 VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
+ + ++ L DPL+ ++ +F+ +Q + G +H++ D ++EN+P Y+
Sbjct: 281 IPA--QYSNDSFLAPFDPLYAQLQSSFLAKQTEALGNVTHVWTIDQYNENSPYSGDLTYL 338
Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEV 186
+++ + ++ +++ D DA+WLMQGWLF D PFW ++ A L+ +P +++LDLF++V
Sbjct: 339 ANIANSTFASLRAHDPDAIWLMQGWLFFADEPFWTSDRVDAYLDQIPNDGMIILDLFSDV 398
Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEG 246
P W + G ++WC +H+F GN+ + G + GPV+A S N++M GVG++MEG
Sbjct: 399 YPQWQRLDSYRGKSWVWCEVHDFGGNMGLEGNFSVVTNGPVDALNSPNSSMKGVGLAMEG 458
Query: 247 IEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-------------GRSVPAIQ-DAW 292
+E N ++YD++ + A+ +D A+ ++ RR+ S+PA +AW
Sbjct: 459 LEGNEIIYDVLLDQAWSAAPLDRDAYAKAWATRRFHLPTANSSTTTATNTSIPASAIEAW 518
Query: 293 NVLYHTVYNCTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
L TVY+ T+ GAT S+I + Y P S +
Sbjct: 519 QTLASTVYSSTNPNVWGATK--------------SLIELAPSLGGMYSAPSSTIIFYDTN 564
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFL 405
TS ++ AL +A+G L A + +R D ID+ RQ LA + +
Sbjct: 565 TS-------------LVPALRGLVAAGTSAPALWALDEFRTDSIDVARQLLANRFADAYT 611
Query: 406 NIIEAYQLN--DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA---QNEE 460
AY + + + + R +++++D+D LL H+ +LL + SA+ A +E
Sbjct: 612 ATTGAYNASGPGSAALNATAARMMQIIDDLDRLLMTHEPYLLSSRIASARAWAGDGGDEA 671
Query: 461 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 514
E+ AR+Q+T+W S+L DY +K W GL+ YY R + +YM
Sbjct: 672 YADYLEYEARSQVTLW----GPVPSVLNDYASKVWGGLVGTYYRQRWTAFVEYM 721
>gi|265753065|ref|ZP_06088634.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263236251|gb|EEZ21746.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 750
Score = 279 bits (713), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 188/597 (31%), Positives = 288/597 (48%), Gaps = 79/597 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 188 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 247
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 248 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 306
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 307 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 358
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 359 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 410
Query: 219 LDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 277
+D + + + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY
Sbjct: 411 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 470
Query: 278 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 337
RYG +IQ AW +L + +YNC G + SI G+
Sbjct: 471 RARYGTDDESIQQAWQILTNGIYNCPAGNNQQG---------PHESIFC---------GR 512
Query: 338 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 397
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 513 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 568
Query: 398 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 569 DRARIVYNYAVADFKSFDKKNYNTHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 628
Query: 458 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 517
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 629 TPEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 688
Query: 518 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
L+ + D + +W L W +N Y + GD + ++
Sbjct: 689 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 742
>gi|212695333|ref|ZP_03303461.1| hypothetical protein BACDOR_04880 [Bacteroides dorei DSM 17855]
gi|212662112|gb|EEB22686.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
Length = 754
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 188/597 (31%), Positives = 287/597 (48%), Gaps = 79/597 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 252 EEKTAGDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414
Query: 219 LDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 277
+D + + + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474
Query: 278 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 337
RYG +IQ AW +L + +YNC G + SI G+
Sbjct: 475 RARYGTDDESIQQAWQILTNGIYNCPAGNNQQG---------PHESIFC---------GR 516
Query: 338 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 397
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572
Query: 398 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYNTHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632
Query: 458 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 517
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 633 TPEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692
Query: 518 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
L+ + D + +W L W +N Y + GD + ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 746
>gi|294807833|ref|ZP_06766618.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|294444952|gb|EFG13634.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
Length = 703
Score = 278 bits (712), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 175/567 (30%), Positives = 283/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 166 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 225
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 226 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 281
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 282 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 341
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 342 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 401
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+Q+ + V WI ++ R G I AW LY +
Sbjct: 402 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 460
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ E Y + LW
Sbjct: 461 Y--TSAALCGQAVLMNARPQLE------------------GVEGWNTLPGYDYKNIDLWE 500
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 501 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 551
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+R +L+ D+D LL C F +G W++ A+ A NE+++K YE NAR +T+W
Sbjct: 552 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW--- 608
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +
Sbjct: 609 -GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYE 667
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W +P+ S + + ++ L KY
Sbjct: 668 WTLKNEDFPIISEENPISLAKELILKY 694
>gi|333031143|ref|ZP_08459204.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
gi|332741740|gb|EGJ72222.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
Length = 723
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 178/567 (31%), Positives = 268/567 (47%), Gaps = 46/567 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP+ WLD Q LQK+IL R EL M PVLPAF+G+VP+ L+++FP A I
Sbjct: 197 MANIDSWNGPLPKEWLDHQSDLQKQILKRERELNMKPVLPAFAGHVPSELKHLFPEADIQ 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + DPLF +I R F+E+Q + +G T HIY D F+E PP
Sbjct: 257 HLGKWAGFADKYR--CNFL-NPNDPLFAKIQRLFLEEQTRLFG-TDHIYGVDPFNEVDPP 312
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
PEY+ + A +Y + D A WL WLF + W P+++ALL VP +L +
Sbjct: 313 SWEPEYLKKVAADMYRTLTDVDPKAKWLQMTWLFYHGKKKWTAPRIEALLTGVPQDELYL 372
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W T+ F+G PYIWC L NF GN + G + + G
Sbjct: 373 LDYHCENVELWKTTDYFHGQPYIWCYLGNFGGNTTITGNVKESGQRLENTLINGGNNFKG 432
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + + A+ +D +W+ + R G+ A ++AW +L++ V
Sbjct: 433 IGSTLEGLDVMQFPYEYIFDKAWTFN-MDDNSWVENLADRHLGKKSEAYREAWKILFNDV 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + + P+ P + KP +K V ++ + D +W
Sbjct: 492 YVQVPKS-------LGVLPNFRPEM-----------SKP-NKRTV--NDYKNKDLVKVWA 530
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
EV + Y DLI + RQ L Y + + YQ D G+
Sbjct: 531 KLLEVKEC------------TRDAYIIDLITVGRQVLGNYFLVVKNEFDQMYQFKDLPGL 578
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ E++ D++ L A H+ L W+ A+ L E + YE NAR IT W
Sbjct: 579 ESRGAKLREILNDLENLTAFHNHCTLEKWISDARALGNTIELKDYYEKNARNLITTW--- 635
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
L DY ++ WSGL++DYY R +Y + E+L+ F + + L
Sbjct: 636 ----GGSLNDYASRTWSGLIKDYYAKRWNLYIDSVTEALKENKKFNQSELNEKLNILEEA 691
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W N GD L S++L++KY
Sbjct: 692 WVNKVETVTSYEQGDILELSKYLFDKY 718
>gi|345511813|ref|ZP_08791352.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|229443748|gb|EEO49539.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
Length = 720
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 176/567 (31%), Positives = 284/567 (50%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 183 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 242
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 243 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 298
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 299 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 358
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 359 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 418
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+Q+ + V WI ++ R G I AW LY +
Sbjct: 419 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 477
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ T Y Y + LW
Sbjct: 478 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 517
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 518 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 568
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+R +L+ D+D LL C F +G W++ A+ A NE+++K YE NAR +T+W
Sbjct: 569 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW--- 625
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +
Sbjct: 626 -GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYE 684
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W +P+ S + + ++ L KY
Sbjct: 685 WTLKNEDFPIISEENPISLAKELILKY 711
>gi|237711645|ref|ZP_04542126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
gi|229454340|gb|EEO60061.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 9_1_42FAA]
Length = 732
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 187/597 (31%), Positives = 288/597 (48%), Gaps = 79/597 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 170 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 229
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 230 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 288
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 289 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 340
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 341 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 392
Query: 219 LDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 277
+D + + + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY
Sbjct: 393 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 452
Query: 278 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 337
RYG +I+ AW +L + +YNC G + SI G+
Sbjct: 453 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 494
Query: 338 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 397
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 495 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 550
Query: 398 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 551 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 610
Query: 458 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 517
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 611 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 670
Query: 518 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
L+ + D + +W L W +N Y + GD + ++
Sbjct: 671 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 724
>gi|345513909|ref|ZP_08793424.1| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
gi|345456132|gb|EEO45798.2| alpha-N-acetylglucosaminidase [Bacteroides dorei 5_1_36/D4]
Length = 754
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 187/597 (31%), Positives = 288/597 (48%), Gaps = 79/597 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 252 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414
Query: 219 LDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 277
+D + + + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474
Query: 278 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 337
RYG +I+ AW +L + +YNC G + SI G+
Sbjct: 475 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 516
Query: 338 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 397
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572
Query: 398 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632
Query: 458 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 517
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 633 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692
Query: 518 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
L+ + D + +W L W +N Y + GD + ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 746
>gi|294647264|ref|ZP_06724861.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|292637401|gb|EFF55822.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
Length = 733
Score = 278 bits (711), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 175/567 (30%), Positives = 283/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 196 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 256 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 312 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 372 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+Q+ + V WI ++ R G I AW LY +
Sbjct: 432 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 490
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ E Y + LW
Sbjct: 491 Y--TSAALCGQAVLMNARPQLE------------------GVEGWNTLPGYDYKNIDLWE 530
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 531 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+R +L+ D+D LL C F +G W++ A+ A NE+++K YE NAR +T+W
Sbjct: 582 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW--- 638
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +
Sbjct: 639 -GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYE 697
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W +P+ S + + ++ L KY
Sbjct: 698 WTLKNEDFPIISEENPISLAKELILKY 724
>gi|423230938|ref|ZP_17217342.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
CL02T00C15]
gi|423244649|ref|ZP_17225724.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
CL02T12C06]
gi|392630058|gb|EIY24060.1| hypothetical protein HMPREF1063_03162 [Bacteroides dorei
CL02T00C15]
gi|392641498|gb|EIY35274.1| hypothetical protein HMPREF1064_01930 [Bacteroides dorei
CL02T12C06]
Length = 754
Score = 278 bits (711), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 187/597 (31%), Positives = 288/597 (48%), Gaps = 79/597 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ--NVFPSAK 58
M+NL GWGGP P SW +QQ LQKKIL R+ E GM+PVLP +SG +P+ L S K
Sbjct: 192 MNNLEGWGGPNPDSWYEQQEALQKKILQRMKEWGMHPVLPGYSGMIPSKLDLGKRIDSGK 251
Query: 59 ITQLGNWFSVKSDPRWCCTY-------LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNC 111
+ + S +S + +L DP F I F E+ K YG TS Y+
Sbjct: 252 EEKTASDTSSESAQSTLNKWNGFDRPGILLPDDPKFTRIANLFYEETEKLYG-TSDYYSI 310
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
D F E + + G AI M+ + AVW++QGW + +P RP MKAL
Sbjct: 311 DPFHE-AKNLPAELDFGKAGRAIMDAMKKANPKAVWVVQGW--TENP--RPEMMKAL--- 362
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGI 218
G L++LDLF+E +P+W G+P IW C+L NF GN+ ++G
Sbjct: 363 -NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGNVGLHGR 414
Query: 219 LDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYS 277
+D + + + + G+G++MEGIE NPV+++LM E+ ++ EK + WI QY
Sbjct: 415 MDQLLHNFYLTKNNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEEWIKQYI 474
Query: 278 VRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGK 337
RYG +I+ AW +L + +YNC G + SI G+
Sbjct: 475 RARYGTDDESIRQAWQILANGIYNCPAGNNQQG---------PHESIFC---------GR 516
Query: 338 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 397
P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+TRQA+A
Sbjct: 517 P----SLNNFQASSWSKMCNYYDPTTTTEAARLMVSVADKYRGNNNFEYDLVDITRQAIA 572
Query: 398 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
A ++ + ++ D +R+FLEL+ D LL F +G W++ A+ L
Sbjct: 573 DRARIVYNYAVADFKSFDKKNYATHTRQFLELLMMQDKLLGTRKEFKVGNWIQQARNLGI 632
Query: 458 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 517
E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y++ + +
Sbjct: 633 TSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYWQVLQDQ 692
Query: 518 LE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQ 560
L+ + D + +W L W +N Y + GD + ++
Sbjct: 693 LDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIEVAK 746
>gi|262407713|ref|ZP_06084261.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|262354521|gb|EEZ03613.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
Length = 735
Score = 278 bits (710), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 175/567 (30%), Positives = 283/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 198 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 258 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 314 NWNEEFLSNVSDKIYKSIQGVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 374 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+Q+ + V WI ++ R G I AW LY +
Sbjct: 434 LGVTLEGLDVNPLMYEFVFERAWQN-SMPVHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 492
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ E Y + LW
Sbjct: 493 Y--TSAALCGQAVLMNARPQLE------------------GVEGWNTLPGYDYKNIDLWE 532
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 533 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKDLEGT 583
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+R +L+ D+D LL C F +G W++ A+ A NE+++K YE NAR +T+W
Sbjct: 584 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDARDFAVNEQEQKYYEENARCILTVW--- 640
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +
Sbjct: 641 -GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYE 699
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W +P+ S + + ++ L KY
Sbjct: 700 WTLKNEDFPIISEENPISLAKELILKY 726
>gi|391338146|ref|XP_003743422.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Metaseiulus
occidentalis]
Length = 665
Score = 278 bits (710), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 161/502 (32%), Positives = 259/502 (51%), Gaps = 44/502 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGPLP SW QQ +LQK IL R+ + GM PV+P F+G VP A + + P+ +
Sbjct: 188 MGNLRGFGGPLPSSWQLQQQLLQKMILRRMRDFGMTPVVPGFNGFVPRAFERLHPAVSWS 247
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + + + L T+ F+ + +I YG + H+Y+ D F+E TP
Sbjct: 248 RASRWNNFPDE--YAMLTFLAPTESFFLNVSSLYITMYRSIYG-SDHLYSVDLFNEETPD 304
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ P ++ + + +Y + D +W+MQGWLF + +W ++KA L PLGK++V
Sbjct: 305 TNDPAALAEMSSNVYESIAKADPKGIWVMQGWLFVHGGDYWNHDRVKAFLGGPPLGKMIV 364
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E P + ++G P+IWCMLHN+ G ++G L+ I P+ R S M+G
Sbjct: 365 LDLFSEQSPQFPRFSNYFGQPFIWCMLHNYGGVSGLFGNLEWINSEPLNVRRSV-PNMIG 423
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G++ EG QN V+Y+ M+E +++ +V W+ Y RYG S P +++AW +L +V
Sbjct: 424 IGIAPEGTGQNEVIYEFMAENSYRDSSENVSLWLQNYVGARYGLSDPHLENAWELLRKSV 483
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y S+T +N+G +L P +WY
Sbjct: 484 Y-------------------------SLTSKSIENHGN-----YILTHRPKLNSTPLIWY 513
Query: 360 STSEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
+ S+VI A I EL + DL+D+ RQAL ++ +L ++ ++ N
Sbjct: 514 NGSDVIGAATELIRGATLHRELCHERLFHQDLVDVVRQALQVRVSDEYLQMMSHFKANSL 573
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ-NEEQEKQYEWNARTQITM 475
+ SRR L + +D +L+ FLLG WL +++ A + + + Q+E+NAR QIT
Sbjct: 574 IDFEEHSRRLLHCIRVLDKVLSTDPNFLLGSWLRDSRESAGLDRDLQDQFEFNARNQITR 633
Query: 476 WFDNTQEEASLLRDYGNKYWSG 497
W N + + DY +K W+G
Sbjct: 634 WGPNGE-----IVDYASKMWNG 650
>gi|336371253|gb|EGN99592.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.3]
gi|336384013|gb|EGO25161.1| glycoside hydrolase family 89 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 761
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 168/539 (31%), Positives = 282/539 (52%), Gaps = 47/539 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP+ W++ Q VLQK+IL R+ ELGM PVLP+F+G VP A+ ++P+A I
Sbjct: 208 NIQGSWGGDLPEQWINDQFVLQKQILARMVELGMTPVLPSFTGFVPRAMHTLYPNASIVN 267
Query: 62 LGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W F+++ L+ DPLF + +F+ + YG SHIY D ++E P
Sbjct: 268 GSQWSTFTIQH----TNDSFLEPFDPLFSTLQTSFMTKYAAAYGNVSHIYTLDQYNEMMP 323
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS-YDPFWRPPQMKALLNSVPLG-KL 177
+ Y+SS+ +A ++ +++ D +AVW+MQGWLF Y FW +++A L VP +
Sbjct: 324 YSGNTSYLSSISSATFASLRATDPEAVWMMQGWLFYIYASFWTDERVEAYLGGVPGNDSM 383
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDLF+E P W ++G +IWC LH+F GN+ G +++ PV+A + TM
Sbjct: 384 IILDLFSEAYPQWQRLNSYFGKQWIWCELHDFGGNMGFEGNFENVTTQPVKALATPGNTM 443
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP----AIQDAWN 293
VG+G++MEG E N ++YD++ + A+ ++ ++++ ++ RRY +VP A +AW
Sbjct: 444 VGMGLTMEGQEGNEIMYDVLFDQAWSPTPINRTSYVSAWTSRRY--NVPNLPTAATEAWE 501
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
+L TVYN D I ++++T + G P
Sbjct: 502 ILASTVYNNQDPLLQATIKSIFELEPAINGLVNLTVLQ----GIPTG------------- 544
Query: 354 HPHLWYST-SEVIRALELFIASGNELSASNT---YRYDLIDLTRQALAKYANELFLNIIE 409
L+Y T + ++ AL+ + + E SA + ++YD++ + RQ LA +L+ ++++
Sbjct: 545 ---LFYDTNTTIVPALQSLLQARQESSALDEVPEFQYDVVYIIRQLLANRFIDLYTSLVD 601
Query: 410 AYQLNDAHGVFQL--SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-E 466
Y + + L++D+D +L FLL W+ +A+ A + Y E
Sbjct: 602 TYNSTTSSSSDVSTAGAPLITLLKDVDSVLLTDTHFLLSNWISAARNWAHDNSTYAAYLE 661
Query: 467 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
+NAR QIT+W + + DY +K W GL+ YY R + Y+ S +G +
Sbjct: 662 YNARNQITLWGPRGE-----VHDYASKQWGGLVGTYYVQRWEEFVSYLSGSKANGTAYN 715
>gi|319640296|ref|ZP_07995021.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
gi|317388071|gb|EFV68925.1| hypothetical protein HMPREF9011_00618 [Bacteroides sp. 3_1_40A]
Length = 752
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 186/603 (30%), Positives = 284/603 (47%), Gaps = 91/603 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
M+NL GWGGP P SW QQ LQKKIL R+ E GM+PVLP +SG +P+
Sbjct: 190 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 249
Query: 49 ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
L N + + L W +L DP F +I F E+ K YG T
Sbjct: 250 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 302
Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
S Y+ D F E + + G AI M+ + AVW++QGW + +P RP M
Sbjct: 303 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 357
Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
KAL G L++LDLF+E +P+W G+P IW C+L NF GN
Sbjct: 358 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 406
Query: 213 IEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
+ ++G +D + + + + G+G++MEGIE NPV+++LM E+ ++ EK +
Sbjct: 407 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 466
Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 331
WI QY RYG +I AW +L + +YNC G + SI
Sbjct: 467 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQG---------PHESIFC----- 512
Query: 332 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 391
G+P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+
Sbjct: 513 ----GRP----SLNNFQASSWSKMCNYYDPTTTAEAARLMVSVAHKYRGNNNFEYDLVDI 564
Query: 392 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 451
TRQA+A A ++ + ++ D +R+FLEL+ D LL F +G W++
Sbjct: 565 TRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWIQQ 624
Query: 452 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 511
A+ L E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y+
Sbjct: 625 ARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYW 684
Query: 512 KYMIESLE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 557
+ + + L+ + D + +W L W +N Y + GD +
Sbjct: 685 QVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIE 741
Query: 558 TSQ 560
++
Sbjct: 742 VAK 744
>gi|345517325|ref|ZP_08796802.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
gi|345457718|gb|EET14396.2| alpha-N-acetylglucosaminidase [Bacteroides sp. 4_3_47FAA]
Length = 754
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 186/603 (30%), Positives = 284/603 (47%), Gaps = 91/603 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
M+NL GWGGP P SW QQ LQKKIL R+ E GM+PVLP +SG +P+
Sbjct: 192 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 251
Query: 49 ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
L N + + L W +L DP F +I F E+ K YG T
Sbjct: 252 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 304
Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
S Y+ D F E + + G AI M+ + AVW++QGW + +P RP M
Sbjct: 305 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 359
Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
KAL G L++LDLF+E +P+W G+P IW C+L NF GN
Sbjct: 360 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 408
Query: 213 IEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
+ ++G +D + + + + G+G++MEGIE NPV+++LM E+ ++ EK +
Sbjct: 409 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 468
Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 331
WI QY RYG +I AW +L + +YNC G + SI
Sbjct: 469 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQG---------PHESIFC----- 514
Query: 332 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 391
G+P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+
Sbjct: 515 ----GRP----SLNNFQASSWSKMCNYYDPTTTAEAARLMVSVAHKYRGNNNFEYDLVDI 566
Query: 392 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 451
TRQA+A A ++ + ++ D +R+FLEL+ D LL F +G W++
Sbjct: 567 TRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWIQQ 626
Query: 452 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 511
A+ L E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y+
Sbjct: 627 ARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYW 686
Query: 512 KYMIESLE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 557
+ + + L+ + D + +W L W +N Y + GD +
Sbjct: 687 QVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIE 743
Query: 558 TSQ 560
++
Sbjct: 744 VAK 746
>gi|294777713|ref|ZP_06743164.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
gi|294448781|gb|EFG17330.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
Length = 752
Score = 275 bits (704), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 186/603 (30%), Positives = 284/603 (47%), Gaps = 91/603 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA------------ 48
M+NL GWGGP P SW QQ LQKKIL R+ E GM+PVLP +SG +P+
Sbjct: 190 MNNLEGWGGPNPDSWYKQQEDLQKKILKRMKEWGMHPVLPGYSGMIPSKLDLGKRIDGGK 249
Query: 49 ---ALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRT 105
L N + + L W +L DP F +I F E+ K YG T
Sbjct: 250 EEKTLSNTSSESAQSTLNKWNGFDRPG------ILLPDDPKFTQIASLFYEETEKLYG-T 302
Query: 106 SHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQM 165
S Y+ D F E + + G AI M+ + AVW++QGW + +P RP M
Sbjct: 303 SDYYSIDPFHE-AKSLPARLDFGKAGKAIMDAMKKANPKAVWVVQGW--TENP--RPEMM 357
Query: 166 KALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGN 212
KAL G L++LDLF+E +P+W G+P IW C+L NF GN
Sbjct: 358 KAL----NPGDLLILDLFSECRPMW-------GIPSIWKRDKGYEEHNWLFCLLENFGGN 406
Query: 213 IEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
+ ++G +D + + + + G+G++MEGIE NPV+++LM E+ ++ EK +
Sbjct: 407 VGLHGRMDQLLHNFYLTKDNPLAAQLKGIGLTMEGIENNPVMFELMCELPWRAEKFTKEE 466
Query: 272 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGK 331
WI QY RYG +I AW +L + +YNC G + SI
Sbjct: 467 WIKQYIRARYGTDDESIWQAWQILANGIYNCPAGNNQQG---------PHESIFC----- 512
Query: 332 YQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDL 391
G+P ++ + SS+ +Y + A L ++ ++ +N + YDL+D+
Sbjct: 513 ----GRP----SLNNFQASSWSKMCNYYDPTTTAEAARLMVSVAHKYRGNNNFEYDLVDI 564
Query: 392 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 451
TRQA+A A ++ + ++ D +R+FLEL+ D LL F +G W++
Sbjct: 565 TRQAIADRARIVYNYAVADFKSFDKKSYATHTRQFLELLIMQDKLLGTRKEFKVGNWIQQ 624
Query: 452 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 511
A+ L E++ YEWNAR QIT W + + LRDY +K W+GLLRD+Y R Y+
Sbjct: 625 ARNLGSTSEEKDLYEWNARVQITTWGNRYCADIGKLRDYAHKEWNGLLRDFYYKRWEKYW 684
Query: 512 KYMIESLE--------------SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 557
+ + + L+ + D + +W L W +N Y + GD +
Sbjct: 685 QVLQDQLDGKLPVLPVGNSSTPTADNPAMT---IDWYALEEPWTLAKNTYAASAEGDCIE 741
Query: 558 TSQ 560
++
Sbjct: 742 VAK 744
>gi|424665881|ref|ZP_18102917.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
616]
gi|404574134|gb|EKA78885.1| hypothetical protein HMPREF1205_01756 [Bacteroides fragilis HMW
616]
Length = 732
Score = 275 bits (703), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 182/577 (31%), Positives = 280/577 (48%), Gaps = 63/577 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGP+ Q ++D+Q LQKK++ R+ E GM PVL F G VP ++ FP+A I
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTQLQKKMIDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + + L +DPLF ++ + F E+Q K +G+ S Y D F E N+
Sbjct: 258 DAGKWITYQRPA------FLVPSDPLFAKVAQIFYEEQEKLFGK-SRYYGGDPFHEGGNS 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
++ I+ + IY M++ + DA+W++QGW +P + ALL + G+ +
Sbjct: 311 EGIN----ITEAASDIYKAMKANNPDAIWVLQGW--GANPSY------ALLKGLKQGEAL 358
Query: 179 VLDLFAEVKPIW-----STSKQFYGV---PYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+LDL + +P W S S + G +IWC L NF G I MYG L S A G + A
Sbjct: 359 ILDLMSCARPQWGGDPSSQSHREDGYLDHNWIWCALPNFGGRIGMYGKLQSYATGVIRAE 418
Query: 231 TSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
V GVG + EGI NP+ YD++ +MA++ + +DVK+WI Y+ RYG +
Sbjct: 419 HHPKGKYVCGVGTTPEGIGTNPIDYDMVYDMAWRTDSIDVKSWIANYTTYRYGSPNNNAK 478
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
A L +VYNC A A P + T
Sbjct: 479 AAMQQLSTSVYNCPWAADGPQESYFCARPSLKID------------------------RT 514
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
SS+ HL+Y V++ALE + + NEL +TYRYD++D+TRQ LA Y + I +
Sbjct: 515 SSWGTAHLYYQPINVLQALEHLLKAENELKEIDTYRYDVVDVTRQMLADYGKYIHKCIAD 574
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 469
AY D + +FL+++ D D LL+ FLLG ++ A N +++ + NA
Sbjct: 575 AYYGKDTEKFDFYTSKFLQMISDQDLLLSTRKEFLLGKFIRQADACGSNPMEKRMFINNA 634
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
+ QIT W S L +Y +K W+G+L Y PR YF Y+ LE + +
Sbjct: 635 KRQITTW----ASVNSSLHEYAHKEWNGILGTLYAPRWKAYFDYLRTKLEGKNPKEI--- 687
Query: 530 RREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
++ + DW + + + ++ +Y+ Y
Sbjct: 688 --DFFTMETDWVESKKEFSAVPIKKEIEIAKTIYHNY 722
>gi|260642393|ref|ZP_05415712.2| alpha-N-acetylglucosaminidase [Bacteroides finegoldii DSM 17565]
gi|260622285|gb|EEX45156.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides finegoldii DSM
17565]
Length = 735
Score = 275 bits (703), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 173/567 (30%), Positives = 282/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 198 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I F+E+Q K YG T+HIY D F+E P
Sbjct: 258 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQHRFLEEQTKVYG-TNHIYGIDPFNEVDSP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +QS DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 314 NWNEEFLSNVSDKIYKSIQSVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 374 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+++ + V WI ++ R G I AW LY +
Sbjct: 434 LGVTLEGLDVNPLMYEFVFERAWEN-SIPVHQWIANWAQCRGGNVDNHIIKAWKQLYEKI 492
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ E Y + LW
Sbjct: 493 Y--TSAALCGQAVLMNARPQLE------------------GVEGWNTLPGYDYKNIDLWE 532
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+ D G
Sbjct: 533 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFADCYRKKDLEGT 583
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+R +L+ D+D LL C +G W++ A+ A NE+++K YE NAR +T+W
Sbjct: 584 KVWGQRMDQLLLDVDRLLCCSPVLSIGKWIKDARDFAVNEQEQKYYEENARCILTVW--- 640
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +
Sbjct: 641 -GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYE 699
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W +P+ S + + ++ L KY
Sbjct: 700 WTLKNEDFPITSGENPISLAKELILKY 726
>gi|440792549|gb|ELR13759.1| peptidase, S8/S53 subfamily protein [Acanthamoeba castellanii str.
Neff]
Length = 981
Score = 274 bits (701), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 168/531 (31%), Positives = 263/531 (49%), Gaps = 71/531 (13%)
Query: 50 LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIY 109
++ ++P+A +T+ +W ++ Y L D L+ IG I +E+G T HIY
Sbjct: 434 IKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG-TDHIY 490
Query: 110 NCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALL 169
N DTF+E +PP P Y+++ A+Y GM + D A+W+MQGW F +DPFW ++KA L
Sbjct: 491 NADTFNEMSPPSADPTYLAAASRAVYEGMATQDPQALWVMQGWSFVFDPFWTKDRIKAYL 550
Query: 170 NSVPLGKLVVLDLFAEVKPIWSTSKQF----YGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
+ V +++LDL ++ P W+ + QF +G ++WCMLHN G +YG L +
Sbjct: 551 SGVDNSDMLILDLASDNSPEWNKTGQFRDSYFGKEFVWCMLHNGGGVRGLYGNLTQYSSD 610
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
P+ A + TMVGVGM+ME IEQNPVVY+LMSEM ++ E D+ W+ +Y+ RRYG +
Sbjct: 611 PLIALATPGNTMVGVGMTMEAIEQNPVVYELMSEMGWRSEAFDIVEWVQRYAERRYGLAT 670
Query: 286 PA--IQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
+ + +AW +L YN + +D + +G +
Sbjct: 671 GSSPVGEAWELLREATYNQS---------------GLDAGLFGFAPALGMGHGGTSN--- 712
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNE--LSASNTYRYDLIDLTRQALAKYAN 401
+T EV AL LF+ S + + ++YD +DLTRQ LA N
Sbjct: 713 ----------------ATKEV-EALRLFLQSAQTEGYAPNGPWQYDCVDLTRQVLANTFN 755
Query: 402 ELFLNIIEAY-----QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLA 456
+++ + AY +D L+ L ++ D+D LLA + +LLG W++ A A
Sbjct: 756 DVYSQLDAAYTSYATNKSDTLPFLPLAAELLGIISDLDRLLATNPNYLLGTWIKDAVSWA 815
Query: 457 QNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 516
EQ Y++NAR QIT+W + Q + DY K+W+GLL ++
Sbjct: 816 SIPEQALHYQFNARNQITLWGPDGQ-----ISDYATKHWAGLL---------------MK 855
Query: 517 SLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
++ +G F + E ++L W YP GD L + + KYL
Sbjct: 856 AVGAGVMFNSTAYGTELLQLEQKWNQENTTYPTTPTGDTLQVALRISQKYL 906
>gi|383122982|ref|ZP_09943669.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
gi|251841923|gb|EES70003.1| hypothetical protein BSIG_0276 [Bacteroides sp. 1_1_6]
Length = 730
Score = 274 bits (701), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 169/570 (29%), Positives = 288/570 (50%), Gaps = 41/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP+SWL+QQ VLQK+IL R + M PVLPAFSG+VP L+ ++P AKI
Sbjct: 197 MSNVDYWQSPLPKSWLEQQEVLQKQILKRERDFNMTPVLPAFSGHVPKELKAIYPDAKIH 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S R ++ ++ D LF I + ++E+Q YG T HIY D F+E P
Sbjct: 257 EMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEEQTAIYG-TDHIYGIDPFNEVDSP 312
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ ++++ + IY + D++A WL W+F +D W P++++ L +VP KL++
Sbjct: 313 NWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHDQKKWTQPRIRSFLEAVPDDKLIL 372
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++ +YG PY+WC L NF GN M G LD + + + G
Sbjct: 373 LDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMVGNLDDVDVKIEKLFVEGGENVYG 432
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + + A+ + + WI ++ R G I AW+ L+ +
Sbjct: 433 LGATLEGFDVNPFMYEFVFDQAWDY-PLTTDQWIQNWAKCRGGNQDRHILKAWDSLHKKI 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y AT ++ A P + V ++ +Y++ LW
Sbjct: 492 YK--KYATAGQAVLMNARPML------------------VGTDSWNTYPDITYNNRDLWD 531
Query: 360 STSEVIRALELFIASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+E+++A + +NT YR+D+I++ RQ L + + + Y D G
Sbjct: 532 IWTEMLKASHI----------NNTGYRFDVINVGRQVLGNLFSSFRDHFTQCYSEKDIDG 581
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+ + + + L+ D D LL+C F +G W++ A+ + E +++ YE NAR +T+W
Sbjct: 582 MKKWADQMDSLLIDTDRLLSCETNFSIGKWIDDARSFGKTEAEKEYYEENARCILTVW-- 639
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
++A+ L DY N+ W GL YY R + +I + SG F K + +
Sbjct: 640 --GQKATQLNDYANRGWGGLTYSYYRERWKRFTTEVITASLSGQKFDEKQFYQSITDFEY 697
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
+W + +P+ S + ++ ++ L KY+Q
Sbjct: 698 EWTLSKEHHPIISGENPILLAKTLSEKYMQ 727
>gi|29345848|ref|NP_809351.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337741|gb|AAO75545.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 730
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 169/570 (29%), Positives = 288/570 (50%), Gaps = 41/570 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP+SWL+QQ VLQK+IL R + M PVLPAFSG+VP L+ ++P AKI
Sbjct: 197 MSNVDYWQSPLPKSWLEQQEVLQKQILKRERDFNMTPVLPAFSGHVPKELKAIYPDAKIH 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W S R ++ ++ D LF I + ++E+Q YG T HIY D F+E P
Sbjct: 257 EMSQWGGYDSKYR---SHFIEPMDSLFNIIQKMYLEEQTAIYG-TDHIYGIDPFNEVDSP 312
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ ++++ + IY + D++A WL W+F +D W P++++ L +VP KL++
Sbjct: 313 NWNEDFLAKVSKKIYESIYQVDAEAKWLQMTWMFYHDQKKWTQPRIRSFLEAVPDDKLIL 372
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++ +YG PY+WC L NF GN M G LD + + + G
Sbjct: 373 LDYYCDSTEIWRNTEMYYGKPYMWCYLGNFGGNSMMVGNLDDVDVKIEKLFVEGGENVYG 432
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG + NP +Y+ + + A+ + + WI ++ R G I AW+ L+ +
Sbjct: 433 LGATLEGFDVNPFMYEFVFDQAWDY-PLTTDQWIQNWAKCRGGNQDRHILKAWDSLHKKI 491
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y AT ++ A P + V ++ +Y++ LW
Sbjct: 492 YK--KYATAGQAVLMNARPML------------------VGTDSWNTYPDITYNNRDLWD 531
Query: 360 STSEVIRALELFIASGNELSASNT-YRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+E+++A + +NT YR+D+I++ RQ L + + + Y D G
Sbjct: 532 IWTEMLKASHI----------NNTGYRFDVINVGRQVLGNLFSSFRDHFTQCYSEKDIDG 581
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+ + + + L+ D D LL+C F +G W++ A+ + E +++ YE NAR +T+W
Sbjct: 582 MKKWADQMDALLIDTDRLLSCETNFSIGKWIDDARSFGKTEAEKEYYEENARCILTVW-- 639
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
++A+ L DY N+ W GL YY R + +I + SG F K + +
Sbjct: 640 --GQKATQLNDYANRGWGGLTYSYYRERWKRFTTEVITASLSGQKFDEKQFYQSITDFEY 697
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
+W + +P+ S + ++ ++ L KY+Q
Sbjct: 698 EWTLSKEHHPIISGENPILLAKTLSEKYMQ 727
>gi|29349767|ref|NP_813270.1| alpha-N-acetylglucosaminidase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341678|gb|AAO79464.1| alpha-N-acetylglucosaminidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 744
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 169/526 (32%), Positives = 265/526 (50%), Gaps = 59/526 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 205 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 256
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ + Y+ D F E
Sbjct: 257 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 313
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I S M+ + +AVW+MQ W + +A+++++ G
Sbjct: 314 GNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQAW--------QANPREAMVSTLDSG 361
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E P +W K F +++CML NF GN+ ++G ++ + G
Sbjct: 362 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 421
Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N T+ GVG + EGIE NPV+++L+ E+ ++ E+ AW+ Y RYG + P
Sbjct: 422 ACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWREERFAPDAWLQAYLKARYGNDLSP 481
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ +AW L HTVYN + S++ G +Q+
Sbjct: 482 EVAEAWRALEHTVYNAPKNYQGEG---------TVESLLCARPGFHQD------------ 520
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+YS +A L ++ ++ +N + YDL+D+ RQ+LA N L
Sbjct: 521 -RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNNFEYDLVDVVRQSLADKGNVLLEE 579
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 466
I ++Y D + S++FLEL+ D LL+ F + WL +A+ L EE++K YE
Sbjct: 580 ISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEFSVSSWLNAARSLGTTEEEKKLYE 639
Query: 467 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 512
WNA IT+W D+ L DY ++ WSG+L+D Y R +F+
Sbjct: 640 WNASALITVWGDSIAANRGGLHDYSHREWSGILKDLYYQRWKTFFE 685
>gi|400595379|gb|EJP63180.1| alpha-N-acetylglucosaminidase [Beauveria bassiana ARSEF 2860]
Length = 761
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 167/530 (31%), Positives = 274/530 (51%), Gaps = 33/530 (6%)
Query: 1 MSNLHG-WGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSA 57
N+HG WGG L W++QQ LQKKI+ R+ ELG+ PVLP F G VPAAL+ + P
Sbjct: 205 FGNIHGTWGGEGRLSAEWINQQFALQKKIVARMVELGITPVLPGFPGFVPAALKKLRPDV 264
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
I + W V + T L+ TD + E+ FI+ Q+KE+G +++Y D F+E
Sbjct: 265 NIAEAPVWVDVPRNN--TATAFLNPTDKTYAELQSLFIKNQIKEFGNVTNVYTVDQFNEI 322
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVP-LG 175
P +YI+ + ++ Y G+ + + A+WLMQGWLF S FW ++ A L P
Sbjct: 323 NPSSGDTKYITDVSSSTYKGITAANPAAIWLMQGWLFYSSQSFWTQQRVDAYLAGPPGQD 382
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LDLF+E +P W ++ ++G P+IWC LH+F GN ++G + ++ V+A E+
Sbjct: 383 DMIILDLFSESQPQWQRTRSYFGRPWIWCELHDFGGNQALHGKITNVTQNSVQA-LKESG 441
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD---AW 292
++VG G++ EG E N VVYD++ + A++ +D + ++ RY + +D AW
Sbjct: 442 SIVGYGLTPEGYEGNEVVYDILLDQAWEGSPIDTANYFRAWARNRYSAAGIIPEDVFTAW 501
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
L Y+ D A V V+ + PS+ + + +Y P + + K + +
Sbjct: 502 EQLRQHAYDVQDNAIPS---VGVSVYQLFPSLKGLVN-RTGHYPPPTALQYDPKVMKNIW 557
Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
HL+Y+++ I S L + D +D+TRQ L +++ +++ +Q
Sbjct: 558 ---HLFYNST---------IDSPGLLQIP-AFHLDFVDVTRQVLGNAFIDIYTDLVNQFQ 604
Query: 413 LNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
V Q L L +ED+D L ++ F WL SA+ Q+ +NAR+
Sbjct: 605 ATANATVIQDLGNSMLSFIEDLDMALNTNEHFTFKKWLNSAESWGQSIGAPDAVAFNARS 664
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 521
Q+T+W E+ L DY K WSG+++ YYG R I+ ++ + E G
Sbjct: 665 QVTVW----STESRALDDYAAKAWSGIVKSYYGERWRIFINSLVSAREQG 710
>gi|429740222|ref|ZP_19273924.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
gi|429153947|gb|EKX96708.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
Length = 730
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 169/567 (29%), Positives = 264/567 (46%), Gaps = 44/567 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP+ WL++Q LQK+IL R M PVLPAF+G+VPA L+ +FP A I
Sbjct: 198 MANIDRWNGPLPKEWLEEQRDLQKQILARERAFNMKPVLPAFAGHVPAELKRIFPDANIK 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W D ++ C + L+ +PLF +I + F+E+Q +G T HIY D F+E PP
Sbjct: 258 SLGKWGGF--DEQYLC-HFLNPGEPLFAKIQKLFLEEQTALFG-TDHIYGVDPFNEGEPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
P Y+ + +Y + + D A W+ GW+F YD W P ++KA L VP GK+ +
Sbjct: 314 SWEPAYLKEISKNMYGTLTAVDPKAEWMQMGWMFYYDKKVWTPKRVKAFLTGVPQGKMSL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W T+ FYG PYIWC L NF GN + G + A + M+G
Sbjct: 374 LDYHCENVELWKTNDGFYGQPYIWCYLGNFGGNTTLTGNVKETGKRLDAALKAARRNMLG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EG++ Y+ + + + H + WI++ + R G + P+++ AW +L+ +
Sbjct: 434 VGSTLEGLDVIQFPYEYVFDKVWTHSDKGNQQWIDELADRHAGFTSPSVRKAWQILFDEI 493
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
+ G Y S+ VL S
Sbjct: 494 FVQVPGT----------------------------YSILPSRSPVLNDNHSERTEIKYPA 525
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E + +L L + N + DLI + RQ L + AY D +
Sbjct: 526 QRLEEVWSLLLDVPQ----CERNELQVDLIAVGRQVLGNKFLAVKSEFDAAYAAKDITLL 581
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
Q + EL+ D+D L + + + W++ A+ L +N E + YE NAR IT+W
Sbjct: 582 RQKAYEMEELLSDLDCLTSFNTRCTVNKWIDDARALGRNAEMKNYYERNARYLITLW--- 638
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
L DY ++ W GL+ YYG R +Y ++ S ++G F K + + +
Sbjct: 639 ----GGHLSDYASRAWGGLIGSYYGGRWRLYIHDILASAQTGKPFDQKAFDEKRSQFEQT 694
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W + + D L + +++KY
Sbjct: 695 WVHSTTPITLPQRNDLLTFCKMMFSKY 721
>gi|383120707|ref|ZP_09941431.1| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
gi|382984934|gb|EES68331.2| hypothetical protein BSIG_2292 [Bacteroides sp. 1_1_6]
Length = 736
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 169/526 (32%), Positives = 265/526 (50%), Gaps = 59/526 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 197 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 248
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ + Y+ D F E
Sbjct: 249 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 305
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I S M+ + +AVW+MQ W + +A+++++ G
Sbjct: 306 GNTEGVD----LAKAGTSIMSAMKKANPEAVWVMQAW--------QANPREAMVSTLDSG 353
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E P +W K F +++CML NF GN+ ++G ++ + G
Sbjct: 354 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 413
Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N T+ GVG + EGIE NPV+++L+ E+ ++ E+ AW+ Y RYG + P
Sbjct: 414 ACAHVNGKTLRGVGATPEGIENNPVMFELLYELPWREERFAPDAWLQAYLKARYGNDLSP 473
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ +AW L HTVYN + S++ G +Q+
Sbjct: 474 EVAEAWRALEHTVYNAPKNYQGEG---------TVESLLCARPGFHQD------------ 512
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+YS +A L ++ ++ +N + YDL+D+ RQ+LA N L
Sbjct: 513 -RTSTWGYAKLFYSPDSTAKAARLLLSVADQYKGNNNFEYDLVDVVRQSLADKGNVLLEE 571
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 466
I ++Y D + S++FLEL+ D LL+ F + WL +A+ L EE++K YE
Sbjct: 572 ISQSYDRKDKDSFGKQSQQFLELILAQDSLLSTRKEFSVSSWLNAARSLGTTEEEKKLYE 631
Query: 467 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 512
WNA IT+W D+ L DY ++ WSG+L+D Y R +F+
Sbjct: 632 WNASALITVWGDSIAANRGGLHDYSHREWSGILKDLYYQRWKTFFE 677
>gi|237719043|ref|ZP_04549524.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451821|gb|EEO57612.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 713
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 181/577 (31%), Positives = 283/577 (49%), Gaps = 60/577 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 181 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 240
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 241 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 296
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 297 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 356
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 357 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 416
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G +++DAW L++ +
Sbjct: 417 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 475
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 476 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 510
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE S+ + +R DLI + RQ L Y ++ + + D
Sbjct: 511 --LEVWRKL-------NEASSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVETKDHQ 561
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 562 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 620
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 532
L DY ++ W+GL+ DYY R +Y I++ E G D +L+D +E
Sbjct: 621 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAAEKGVEVDQKQLEDELKEIE 674
Query: 533 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 566
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 675 EGWVNATDREDTRKDVH---STTDGLLSFSTFLFSKY 708
>gi|126307952|ref|XP_001365931.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Monodelphis
domestica]
Length = 481
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 128/253 (50%), Positives = 170/253 (67%), Gaps = 4/253 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPLP SW +Q LQ +IL R+ GM PVLPAF+G++P A VFP A +T
Sbjct: 205 MGNLHTWGGPLPSSWDLKQSYLQYQILERMRSFGMKPVLPAFAGHIPKAFTRVFPQANVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L NW + C+YLL DPLF +G F+ + KE+G T HIY+ D F+E PP
Sbjct: 265 KLDNWIDFNCT--YSCSYLLAPEDPLFPVVGSLFLRELAKEFG-TDHIYSADIFNEMDPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+P Y+++ AA+Y M + D DAVWL QGWLF P FW+PPQMKA+L +VP G+ ++
Sbjct: 322 SSNPAYLAATTAAVYEAMVAVDVDAVWLFQGWLFQNHPDFWKPPQMKAVLEAVPRGRFLI 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S + FYG P+IWCMLHNF GN ++G+LD++ GP AR N+T+VG
Sbjct: 382 LDLFAESQPVYSRTNSFYGQPFIWCMLHNFGGNHGLFGVLDAVNRGPSTARLFPNSTIVG 441
Query: 240 VGMSMEGIEQNPV 252
G+ EGI QN +
Sbjct: 442 TGIVPEGINQNEI 454
>gi|380697007|ref|ZP_09861866.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 703
Score = 272 bits (695), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 173/567 (30%), Positives = 281/567 (49%), Gaps = 39/567 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLPQSWL Q LQK IL R M P+LPAF+G+VPA L+ ++P AKI
Sbjct: 166 MSNVDYWQSPLPQSWLADQEKLQKLILERERAFDMTPILPAFAGHVPAELKELYPEAKIY 225
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I R F+E+Q K YG T+HIY D F+E P
Sbjct: 226 TMSQWGGYDEKYR---SHFIDPMDSLYSVIQRRFLEEQTKVYG-TNHIYGIDPFNEVDSP 281
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY +Q DS A WL W+F + W P++K+ LN+VP KL++
Sbjct: 282 NWNEEFLSNVSDKIYKSIQDVDSAAQWLQMTWMFYHAKEKWTQPRIKSFLNAVPQDKLIL 341
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++Q+YG PYIWC L NF GN + G L+ + F + G
Sbjct: 342 LDYYCDYTEIWRDTEQYYGKPYIWCYLGNFGGNTFLAGDLNDVDFKIDRLFKEGGDNVYG 401
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++EG++ NP++Y+ + E A+++ + WI ++ R G I AW LY +
Sbjct: 402 LGVTLEGLDVNPLMYEFVFERAWEN-SMPAHQWIANWAQCRGGNVDNHIVKAWKQLYEKI 460
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T A ++ A P ++ T Y Y + LW
Sbjct: 461 Y--TSAALCGQAVLMNARPQLEGVEGWNTLPGY------------------DYKNIDLWE 500
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E+++A ++ + Y +D+I++ RQ L + + Y+
Sbjct: 501 IWKELLKAEGVY---------HSEYHFDVINVGRQVLGNLFADYRDKFTDCYRKKKLEET 551
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+R +L+ D+D LL C F +G W++ AK A NE+++K YE NAR +T+W
Sbjct: 552 KVWGQRMDQLLLDVDRLLCCSPVFSIGKWIKDAKDFAVNEQEQKYYEENARCILTVW--- 608
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
++ + L DY N+ W GL R +Y R + + +I ++ F + + ++ + +
Sbjct: 609 -GQKDTQLNDYANRGWGGLTRTFYRERWKRFTEEVIAAMTRHKNFDEEKFHQDITQFEYE 667
Query: 540 WQNGRNVYPVESNGDALITSQWLYNKY 566
W +P+ S + + ++ L KY
Sbjct: 668 WTLKNEDFPITSEENPISLAKELILKY 694
>gi|423248233|ref|ZP_17229249.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
CL03T00C08]
gi|423253182|ref|ZP_17234113.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
CL03T12C07]
gi|392657082|gb|EIY50719.1| hypothetical protein HMPREF1067_00757 [Bacteroides fragilis
CL03T12C07]
gi|392660340|gb|EIY53954.1| hypothetical protein HMPREF1066_00259 [Bacteroides fragilis
CL03T00C08]
Length = 732
Score = 271 bits (694), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 174/533 (32%), Positives = 262/533 (49%), Gaps = 58/533 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGP+ Q ++D+Q LQKK+L R+ E GM PVL F G VP ++ FP+A I
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTKLQKKMLDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + + L +DPLF ++ F E+Q K +G S Y D F E N+
Sbjct: 258 NAGKWITYQRPA------FLVPSDPLFAKVAEIFYEEQKKLFGE-SRYYGGDPFHEGGNS 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
++ I+ + IY M++ + +A+W++QGW S +P ALL + G+ +
Sbjct: 311 KGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--SGNP------SVALLKGLKHGEAL 358
Query: 179 VLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
VLDL A +P W F +IWC L NF G I MYG L S A G ++A
Sbjct: 359 VLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALPNFGGRIGMYGKLQSYATGVIKAE 418
Query: 231 TSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
V G+G + EGI NP+ YD++ +MA++ + +D+K+WI Y+ RYG +
Sbjct: 419 HHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDSIDIKSWIANYTTYRYGSENSNAK 478
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
A L +VYNC A A P + +S
Sbjct: 479 AAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYVS----------------------- 515
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
S+ HL+Y V++ALE + + EL +TYRYD++D+TRQ LA Y + I +
Sbjct: 516 -SWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRYDVVDITRQMLADYGKYIHKCISD 574
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 469
AY+ + + +FL+++ D D LL+ FLLG ++ A N +++ + NA
Sbjct: 575 AYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLGEYIRQADTCGSNPTEKRMFINNA 634
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 522
+ QIT W S L +Y +K W+G+L Y PR +YF Y+ LE +
Sbjct: 635 KRQITSW----TSVNSSLHEYAHKEWNGILSTLYAPRWKVYFDYLHAKLEGKN 683
>gi|423269877|ref|ZP_17248849.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
CL05T00C42]
gi|423272668|ref|ZP_17251615.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
CL05T12C13]
gi|392700723|gb|EIY93885.1| hypothetical protein HMPREF1079_01931 [Bacteroides fragilis
CL05T00C42]
gi|392708745|gb|EIZ01850.1| hypothetical protein HMPREF1080_00268 [Bacteroides fragilis
CL05T12C13]
Length = 732
Score = 271 bits (693), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 174/533 (32%), Positives = 262/533 (49%), Gaps = 58/533 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGP+ Q ++D+Q LQKK+L R+ E GM PVL F G VP ++ FP+A I
Sbjct: 198 MGNLEKFGGPVSQQFIDRQTKLQKKMLDRMREYGMEPVLQGFYGMVPNSMITKFPNADIR 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + + L +DPLF ++ F E+Q K +G S Y D F E N+
Sbjct: 258 DAGKWITYQRPA------FLVPSDPLFAKVAEIFYEEQKKLFGE-SRYYGGDPFHEGGNS 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
++ I+ + IY M++ + +A+W++QGW S +P ALL + G+ +
Sbjct: 311 KGIN----ITEAASNIYKAMKTNNPNAIWVLQGW--SGNP------SVALLKGLKHGEAL 358
Query: 179 VLDLFAEVKPIWSTSKQ--------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
VLDL A +P W F +IWC L NF G I MYG L S A G ++A
Sbjct: 359 VLDLMACARPQWGGEPSSSFHREDGFLDHNWIWCALPNFGGRIGMYGKLQSYATGVIKAE 418
Query: 231 TSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
V G+G + EGI NP+ YD++ +MA++ + +D+K+WI Y+ RYG +
Sbjct: 419 HHPKGKYVCGIGTTPEGIGTNPINYDMVYDMAWRTDSIDIKSWIANYTTYRYGSENSNAK 478
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
A L +VYNC A A P + +S
Sbjct: 479 AAMLQLSTSVYNCPWAADGPQESYFCARPSLKIDYVS----------------------- 515
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
S+ HL+Y V++ALE + + EL +TYRYD++D+TRQ LA Y + I +
Sbjct: 516 -SWGTAHLYYQPINVLQALEHLLKAEKELGYIDTYRYDVVDITRQMLADYGKYIHKCISD 574
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 469
AY+ + + +FL+++ D D LL+ FLLG ++ A N +++ + NA
Sbjct: 575 AYKEKNIKKFDLYTSKFLQMILDQDLLLSTRKEFLLGEYIRQADTCGSNPTEKRMFINNA 634
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 522
+ QIT W S L +Y +K W+G+L Y PR +YF Y+ LE +
Sbjct: 635 KRQITSW----TSVNSSLHEYAHKEWNGILSTLYAPRWKVYFDYLHAKLEGKN 683
>gi|423722278|ref|ZP_17696454.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
CL09T00C40]
gi|409242419|gb|EKN35181.1| hypothetical protein HMPREF1078_00517 [Parabacteroides merdae
CL09T00C40]
Length = 752
Score = 271 bits (692), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 183/584 (31%), Positives = 270/584 (46%), Gaps = 69/584 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL K+I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF IGR F+E++ K YG +Y D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPVD+PEY+ ++G AI+ D +++W MQ W R P +KA VP L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL------REPIVKA----VPKENL 351
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL +G P + LHNF G I ++G L +A +N +
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G G+ ME IEQNPV YDL EM ++V+++ W+ +Y+ RRYG+ AW L
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y G R I+A P++ G G P
Sbjct: 469 GPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YS V++A L + L S+ YR+D++D+ RQ ++ + EA++ D
Sbjct: 505 -YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMSNLGQAIHKQAAEAFRKKDKE 563
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
S RFLE++ D D LL F WL A+ N E++ +E +A +T+W
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEKDLFEKDATALVTVW- 622
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW----RREW 533
+ L+ DY + W+GL+ YY R ++ + + L++G + KD RE
Sbjct: 623 --GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDHLDAGTNYSEKDLPQTHGRES 680
Query: 534 IKLTN------DWQ-----NGRNVYPVESNGDALITSQWLYNKY 566
+ + DW+ V + GD + T+ LY KY
Sbjct: 681 FRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRLYKKY 724
>gi|336412606|ref|ZP_08592959.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
3_8_47FAA]
gi|335942652|gb|EGN04494.1| hypothetical protein HMPREF1017_00067 [Bacteroides ovatus
3_8_47FAA]
Length = 727
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 180/577 (31%), Positives = 283/577 (49%), Gaps = 60/577 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACAEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 532
L DY ++ W+GL+ DYY R +Y I++ E G D +L+D +E
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAAEKGVEVDQKQLEDELKEIE 688
Query: 533 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 566
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|62088640|dbj|BAD92767.1| huntingtin interacting protein-1-related [Homo sapiens]
Length = 449
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 155/418 (37%), Positives = 232/418 (55%), Gaps = 50/418 (11%)
Query: 142 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 200
D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+VLDLFAE +P+++ + F G P
Sbjct: 18 DTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQP 77
Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 260
+IWCMLHNF GN ++G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+
Sbjct: 78 FIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAEL 137
Query: 261 AFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 318
++ + V D+ AW+ ++ RRYG S P AW +L +VYNC+ + NR +V P
Sbjct: 138 GWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRP 197
Query: 319 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 378
L+ TS +WY+ S+V A L + S L
Sbjct: 198 S-------------------------LQMNTS------IWYNRSDVFEAWRLLLTSAPSL 226
Query: 379 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 432
+ S +RYDL+DLTRQA+ + + + AY L A GV EL+
Sbjct: 227 ATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELASLLRAGGVLAY-----ELLPA 281
Query: 433 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 492
+D +LA FLLG WLE A+ A +E + YE N+R Q+T+W E ++L DY N
Sbjct: 282 LDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYAN 336
Query: 493 KYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 550
K +GL+ +YY PR ++ + +++S+ G F+ + + +L + + YP +
Sbjct: 337 KQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQ 394
>gi|154492110|ref|ZP_02031736.1| hypothetical protein PARMER_01741 [Parabacteroides merdae ATCC
43184]
gi|154087335|gb|EDN86380.1| Alpha-N-acetylglucosaminidase (NAGLU) [Parabacteroides merdae ATCC
43184]
Length = 752
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 179/584 (30%), Positives = 270/584 (46%), Gaps = 69/584 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL K+I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF IGR F+E++ K YG +Y D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPVD+PEY+ ++G AI+ D +++W MQ W ++++ +VP L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSLR----------ESIVKAVPKENL 351
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL +G P + LHNF G I ++G L +A +N +
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G G+ ME IEQNPV YDL EM ++V+++ W+ +Y+ RRYG+ AW L
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y G R I+A P++ G G P
Sbjct: 469 GPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YS V++A L + L S+ YR+D++D+ RQ ++ + EA++ D
Sbjct: 505 -YSPLSVVQAEGLLLKDAGRLKGSDPYRFDIVDIQRQLMSNLGQAIHKQAAEAFRKKDKE 563
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
S RFLE++ D D LL F WL A+ N E++ +E +A +T+W
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEKDLFEKDATALVTVW- 622
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW----RREW 533
+ L+ DY + W+GL+ YY R ++ + + L++G + KD RE
Sbjct: 623 --GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDHLDAGTNYSEKDLPQTHGRES 680
Query: 534 IKLTN------DWQ-----NGRNVYPVESNGDALITSQWLYNKY 566
+ + DW+ V + GD + T+ LY KY
Sbjct: 681 FRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRLYKKY 724
>gi|212541222|ref|XP_002150766.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
gi|210068065|gb|EEA22157.1| alpha-N-acetylglucosaminidase, putative [Talaromyces marneffei ATCC
18224]
Length = 787
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 176/527 (33%), Positives = 271/527 (51%), Gaps = 45/527 (8%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
+ N+ G WG PLP W++ Q LQKKIL R+ ELG+ PVLP+F+G VP A+ V P+AK+
Sbjct: 203 LGNIQGFWGDPLPNEWIESQFELQKKILARMVELGITPVLPSFTGFVPRAITRVLPNAKV 262
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
W S+ + C L+ D F + ++ I +Q YG SHIY D ++EN P
Sbjct: 263 VPGSRWNVFSSN--YTCDTFLEPFDDNFALLQKSTISKQQAYYGNISHIYALDQYNENNP 320
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-LV 178
+P+Y+ ++ +++ D DAVWLMQ WLF FW + A L+ V ++
Sbjct: 321 FSSNPDYLRNISRTTSQSLKAADPDAVWLMQSWLFLDATFWNNVTICAYLSGVENNSDML 380
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
+LDLFAE +P+W + +YG P+IWC +H++ GN+ +YG + +I A S +MV
Sbjct: 381 ILDLFAESQPVWQLTDSYYGKPWIWCQVHDYGGNMGLYGQIMNITENATAALASSG-SMV 439
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVP-AIQDAWNVL 295
G G +ME E N +VYDL+ + A+ ++ + + RY + VP + DAW +L
Sbjct: 440 GFGHTMESQEGNEIVYDLLLDQAWSETPINTSQYFEDWVTVRYAGTQHVPQQLFDAWEIL 499
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISV--TEGKYQNYGKPVSKEAVLKSETSSYD 353
+ YN T+ A+ I+ +++PSI + EG + T +YD
Sbjct: 500 RWSAYNNTNLASSSVPKSIL---ELEPSISGLLNREGHHPT--------------TINYD 542
Query: 354 HPHLWYSTSEVIRALEL-FIASGNELSA--SNTYRYDLIDLTRQALAKYANELFLNIIEA 410
P L V+ A L + A+ ELS + + YDLI LTRQ L + +I
Sbjct: 543 -PEL------VVEAWALTYEAALLELSLWDNPAFNYDLIFLTRQVLVNAFIPRYELLISF 595
Query: 411 YQLND--AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ-NEEQEKQYEW 467
Y + + R+ ++L++ +D +L ++ F L W+ A A N YE+
Sbjct: 596 YNNENYSVPAIVSAGRQLIDLLQSLDTVLGTNECFQLAQWINKAVSRAHGNTTLAAYYEY 655
Query: 468 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 514
NAR QIT+W N + + DY +K W+GL+ YY PR I Y+
Sbjct: 656 NARNQITLWGPNGE-----ISDYASKQWAGLISSYYVPRWQILVDYL 697
>gi|299140550|ref|ZP_07033688.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella oris C735]
gi|298577516|gb|EFI49384.1| alpha-N-acetylglucosaminidase (N-acetyl-alpha-glucosaminidase)
(NAG) [Prevotella oris C735]
Length = 741
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 171/527 (32%), Positives = 261/527 (49%), Gaps = 49/527 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW +QQ LQKKIL R++E GM PVLP F G +P + +T
Sbjct: 186 MNNLEGWGGPLPDSWYNQQEALQKKILKRMHEYGMQPVLPGFCGMMPHDAKAKL-GLNVT 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L TD F +I + + K YG+ ++ Y+ D F E T
Sbjct: 245 DGGIWNGYTRPAN------LSPTDAHFDKIADLYYAELTKLYGKANY-YSMDPFHE-TND 296
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
++ +Y S G + M+ + A W++QGW + +P RP +K + N G L+VL
Sbjct: 297 DETIDY-SKAGCKVMEAMKRVNPKATWVIQGW--TENP--RPQMIKNMKN----GDLLVL 347
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
DLF+E +P IW K + +++CML NF N+ ++G +D + + S
Sbjct: 348 DLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLLHNFYSTKQSSP 407
Query: 235 TT--MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
T + G+G +MEG E NPV+++LMSE+ ++ E + WI Y RYG++ P I+ AW
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSELPWRTE-CKKEDWIKGYVKARYGKTSPEIERAW 466
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
+L T+YNC G + SI G+P +KS +
Sbjct: 467 QLLSETIYNCPAGNNQQG---------PHESIFC---------GRPSLNNFQVKSWSKMR 508
Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
++ Y + A +L ++ +N + YDL+D+ RQALA +L I Y
Sbjct: 509 NY----YDPQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALADQGRLQYLKTIADYN 564
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
+ + RFLE++ D LL F LG W E+A++L ++++ YEWNAR Q
Sbjct: 565 GFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGTTQQEKDLYEWNARVQ 624
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 519
IT W + + L DY +K W G+L+D+Y R I+ + + +E
Sbjct: 625 ITTWGNRICADKGGLHDYAHKEWQGILKDFYYKRWKIFMDALAKQME 671
>gi|410095990|ref|ZP_11290981.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
CL02T12C30]
gi|409227396|gb|EKN20294.1| hypothetical protein HMPREF1076_00159 [Parabacteroides goldsteinii
CL02T12C30]
Length = 753
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 187/590 (31%), Positives = 272/590 (46%), Gaps = 81/590 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D L KK++ R ELGM P+ FSG VP L+N +P AKI
Sbjct: 193 MQNLQSYGGPLPKSWIDSHAELGKKVINRQLELGMQPIQQGFSGYVPRELKNKYPDAKI- 251
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF GR F+E++ K +G +Y D F E+
Sbjct: 252 --------QLQPSWCGFTGAAQLDPTDSLFSAFGRDFLEEEKKLFG-AHGVYAADPFHES 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PP+D+PEY+S++G +IY Q D A+W MQ W R P +KA VP L
Sbjct: 303 RPPIDTPEYLSAVGNSIYKLFQDFDPSAIWAMQAWSL------REPIVKA----VPKEHL 352
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL +T +G P + LHNF G I ++G L +A ++ +
Sbjct: 353 LILDLNGGRSRQENTC---WGYPVVAGNLHNFGGRINLHGDLRLLASNQYAVAKQKSPNV 409
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G G+ ME IEQNPV YDL EM ++VD++ W+ Y+ RRYG + AW L
Sbjct: 410 CGSGLFMESIEQNPVYYDLAFEMPLHADEVDIEEWLGDYAERRYGAASENAHKAWLHLLE 469
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y G R I+A P++ G G P
Sbjct: 470 GPYR--PGTNGTERSSIIA---ARPALNVKKSGPNAGLGIP------------------- 505
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YS VI+A L + ++L+AS YR+D++D+ RQ ++ + EA+ D
Sbjct: 506 -YSPLLVIQAQGLLLKDADKLNASTPYRFDVVDIQRQLMSNLGQAIHKKAAEAFVKKDKA 564
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
S RFLE++ D+D LL F WL A+ E++ E +A +T+W
Sbjct: 565 AFTLHSNRFLEMLRDVDVLLRTRPEFNFDKWLTDARSWGTTNEEKDLLEKDATALVTVW- 623
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG--------------DG 523
+ L+ DY + W+GL+ YY R ++ + E L+ G +
Sbjct: 624 --GADGDPLIFDYSWREWTGLIDSYYLKRWEKFYAMLQEHLDEGNEYSEKGLPMTHGREA 681
Query: 524 FR-------LKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
FR L DW E++ TN + P+ + GD + T+ +Y KY
Sbjct: 682 FRANDFYSELGDWELEFVSRTNKART-----PI-TQGDEIETALKMYKKY 725
>gi|160887167|ref|ZP_02068170.1| hypothetical protein BACOVA_05183 [Bacteroides ovatus ATCC 8483]
gi|423295093|ref|ZP_17273220.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
CL03T12C18]
gi|156107578|gb|EDO09323.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392673999|gb|EIY67450.1| hypothetical protein HMPREF1070_01885 [Bacteroides ovatus
CL03T12C18]
Length = 711
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 169/526 (32%), Positives = 268/526 (50%), Gaps = 59/526 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ ++ Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ GA+I + M+ + AVW++Q W S P+ + ++ S+ G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPKAVWIIQAWQAS-------PR-EEMIASLNQG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E +P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N M+ GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
I +AW L HTVYN D + + S++ G + +
Sbjct: 473 EIMEAWRALEHTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+Y+ +A LF + ++ +N + YDL+D+ RQ+ A N L
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 466
I ++Y D + +++FL+L+ D LL+ F + WL +A+ L EE+++ YE
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKRLYE 630
Query: 467 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 512
WNA IT+W D+ L DY ++ WSGLL+D Y R +F+
Sbjct: 631 WNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLYYQRWKAFFE 676
>gi|262406058|ref|ZP_06082608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294806855|ref|ZP_06765680.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345510563|ref|ZP_08790130.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|262356933|gb|EEZ06023.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294445884|gb|EFG14526.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345454460|gb|EEO49066.2| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
Length = 727
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 180/577 (31%), Positives = 281/577 (48%), Gaps = 60/577 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 532
L DY ++ W+GL+ DYY R +Y I++ E G D +L+D +E
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYVNTFIKAAEEGVEVDQKQLEDELKEIE 688
Query: 533 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 566
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|423223006|ref|ZP_17209475.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640582|gb|EIY34381.1| hypothetical protein HMPREF1062_01661 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 755
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 182/589 (30%), Positives = 277/589 (47%), Gaps = 76/589 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP+S +D+ ++L KKIL R ELGM P+ FSG VP LQ +P AKI+
Sbjct: 195 MQNIQSYGGPLPKSVIDKHVILGKKILARQLELGMQPIQQGFSGYVPRELQAKYPQAKIS 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W T LD TDPLF E+G AF+E+Q K +G + +Y D F E+ PP
Sbjct: 255 MKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDKLFG-SYGVYAADPFHESAPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+D+PEY++ +G I+ Q+ D+ A+W+MQ W D ++ +VP L++L
Sbjct: 308 IDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED----------IVKAVPKESLLIL 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL + + +G P I LHNF G I M+G L +A + + + G
Sbjct: 358 DLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLALLASNQYQKAKARYPNVCGS 413
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G+ ME IEQNPV Y+L EM + + ++AW+ Y+ RRYG A AW L Y
Sbjct: 414 GLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERRYGAKSAAAGKAWMYLLEGPY 473
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
G R IVA P++ G G P Y
Sbjct: 474 R--QGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP--------------------YE 508
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
VIRA + ++L+ S YR+D++D+ RQ + + EA+ D
Sbjct: 509 PMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLGQLVHKKAAEAFASKDKAAFV 568
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
S RFLEL+ DMD LL + WL A+ + +E++ E +A + +T+W +
Sbjct: 569 LHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKEEKDLMERDATSLVTIWGADG 628
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---------------- 524
+ DY + W+GL+ YY PR ++ + L++G +
Sbjct: 629 DPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQGHLDAGTDYQEEGLSLAYGREDFRA 685
Query: 525 -----RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
RL +W ++ Q G+ PV ++GD L+ ++ L++KYL+
Sbjct: 686 NDFYNRLAEWELAYVD-----QTGKARTPV-THGDELVVTRRLFDKYLK 728
>gi|373461342|ref|ZP_09553084.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
gi|371952896|gb|EHO70729.1| hypothetical protein HMPREF9944_01348 [Prevotella maculosa OT 289]
Length = 731
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 164/522 (31%), Positives = 263/522 (50%), Gaps = 53/522 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++GW GPLPQSW+D Q LQ++IL R E GM PVLPAF+G+VP + + P A+IT
Sbjct: 194 MLNINGWQGPLPQSWIDGQADLQRRILQREREFGMRPVLPAFNGSVPLDYKRLHPEARIT 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G W R TY L TDP F ++ ++F+++Q + +G T H+Y D+F+E PP
Sbjct: 254 EVGQWGGFGQAYR---TYFLSPTDPRFGKLQKSFLDEQRRMFG-TDHLYCLDSFNEVQPP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
SP+ + L I++ + D +VW+ GWLF D W P ++A L+ +P + ++
Sbjct: 310 SWSPDTLCMLARHIHASLDKADPQSVWVQMGWLFYNDRKHWTPDVIRAYLSGIPKDRALL 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + +W ++ FYG PYI C+L NF GN + G + ++ ++A +++ M G
Sbjct: 370 LDYYIDHTELWRLTESFYGRPYIACVLGNFGGNTMLQGDVGKVS-SRLDAAIAQDGNMAG 428
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG +MEG NP Y + + A+ D + W+ + + R G + A + AW VL+ +
Sbjct: 429 VGATMEGFGVNPDFYAFVFDKAWDCGTTD-RDWLCRMADRHVGFASAAGRTAWQVLFDRI 487
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKS--ETSSYDHPHL 357
+++ V+ A P E +Y N P V K + S HL
Sbjct: 488 ---MPSYVNESGTVVCARPSF--------EARYLNTTYPAELLGVWKMLLDIDSDKREHL 536
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YD++++ RQ L + + AY +
Sbjct: 537 ----------------------------YDVVNVGRQVLGDFFAFERDGLHRAYLSQRSD 568
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
V +RR ++++D+D LLAC + F L W+E A+ ++ YE NART IT+W
Sbjct: 569 SVDYYARRMDKMLDDLDRLLACSEEFSLRKWIEDARGFGATAAEKDYYERNARTLITVWG 628
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 519
D+ Q L DY N+ W+GL+ YY R I+ ++ ++
Sbjct: 629 DSRQ-----LTDYANRTWAGLVSSYYKQRWHIFTAHVRRAVR 665
>gi|340347658|ref|ZP_08670763.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
gi|433652542|ref|YP_007296396.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
3688]
gi|339608852|gb|EGQ13735.1| alpha-N-acetylglucosaminidase [Prevotella dentalis DSM 3688]
gi|433303075|gb|AGB28890.1| Alpha-N-acetylglucosaminidase (NAGLU) [Prevotella dentalis DSM
3688]
Length = 781
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 178/540 (32%), Positives = 259/540 (47%), Gaps = 64/540 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW QQ LQK+IL R ELGM PVLP + G +P + +T
Sbjct: 207 MNNLEGWGGPLPDSWYRQQEALQKRILQRERELGMEPVLPGYCGMMPHDAKQKL-GLDVT 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + L ATDP F EI + +Q + YG+ SH Y+ D F E +
Sbjct: 266 PGGTWNG------YVRPANLSATDPRFDEIADLYYREQTRLYGK-SHYYSMDPFHETSDD 318
Query: 121 VDSPEYI--SSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
V YI + G + + M+ + A W++QGW + P A+ + +P G L
Sbjct: 319 V----YIDYAQAGRKLMAAMKRENPKANWVIQGWTENPRP--------AMTDGLPAGSLT 366
Query: 179 VLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSI------AFGP 226
VLDLF+E +P IW ++ + +++CML NF GN+ ++G +D + A P
Sbjct: 367 VLDLFSECRPMFGAPSIWKRAEGYGQHDWLFCMLENFGGNVGLHGRMDQLIGNFRLATSP 426
Query: 227 VEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA--------WINQYSV 278
+ G+G +MEG E NP++++LMSE+ ++ ++V A W+ Y
Sbjct: 427 QSPLQQARRHLRGIGFTMEGSENNPIMFELMSELPWRTDEVAQAADARTFRTEWVRGYVK 486
Query: 279 RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP 338
RYG P Q AW +L T+YNC G + SI G+P
Sbjct: 487 ARYGTDDPHAQQAWQLLAETIYNCPAGNNQQG---------PHESIFD---------GRP 528
Query: 339 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
+KS + ++ Y S + A L A+ + L +N Y YDL+D+ RQA+
Sbjct: 529 SLNNFQVKSWSKMRNY----YEPSATLEAARLMAAAADRLKGNNNYEYDLVDIVRQAIDD 584
Query: 399 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
A +++L+ I Y D + S RFL L+ D LL F LG W E+A+ L
Sbjct: 585 QARQVYLHAIADYNGFDRRAFSRDSARFLGLLLMQDRLLGTRREFRLGRWTEAARSLGTT 644
Query: 459 EEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL 518
++ YEWNAR QIT W + + LRDY +K W GLL D+Y R Y + +
Sbjct: 645 PAEKDLYEWNARVQITTWGNRACADQGGLRDYAHKEWQGLLADFYYMRWHTYLDALSRQM 704
>gi|374312699|ref|YP_005059129.1| alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
gi|358754709|gb|AEU38099.1| Alpha-N-acetylglucosaminidase [Granulicella mallensis MP5ACTX8]
Length = 754
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 176/551 (31%), Positives = 267/551 (48%), Gaps = 63/551 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ + GPLPQ +++++ +LQ+++L R+ ELGM PV PAF+G VP + + P +
Sbjct: 221 MGNINHFAGPLPQHFMEEKRILQRQVLNRMRELGMKPVAPAFAGFVPQGFKRLHPEVETF 280
Query: 61 QLGNWF--SVKSDPRWCCTYLLD-ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
L W K+ PR T++L L+ +IG+ FIE+ EYG + Y DTF+E
Sbjct: 281 TL-LWLRKEFKTIPRSTRTFILHPGQQELYRQIGKKFIEEYKAEYGEVEY-YLADTFNEL 338
Query: 118 TPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
PV E + G ++ +Q+GD W+MQGWLF YD FW ++ALL +P
Sbjct: 339 EVPVREDHRYEDLERFGRTVFESIQAGDPKGTWVMQGWLFVYDSDFWNKESVEALLRGIP 398
Query: 174 LGKLVVLDLFAEVKPI---------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
+++++D ++ P W K F+G P+I M H F GN + G L +A
Sbjct: 399 NDRMLIIDYANDLAPSVQGKYLPGQWKLQKAFFGKPWINGMAHTFGGNNNIKGNLKLMAT 458
Query: 225 GPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
P S E +VG GM EGIE N VVY+LM++ +Q E +D+ WI Y RYG
Sbjct: 459 EPSTVLASPERGNLVGWGMCPEGIENNEVVYELMTDAGWQSEAIDLATWIPAYCRSRYGD 518
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
PA+Q AW +L + Y+ T + A P V P SV G P +
Sbjct: 519 CPPAMQQAWELLLKSAYSSHIWMT---KQAWQAEPSVHPIAASVDAG-------PTFQ-- 566
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
RA+ELF++ +L+ S YR DLI+ QA+ +E
Sbjct: 567 ----------------------RAVELFLSCAPQLAKSELYRNDLIEFVSQAVGGRVDEA 604
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
++A + R +E + +DGL+ L W+++ + A+ +++
Sbjct: 605 LALAVQAGDAKQDEDAVAHAARAVEWMRRIDGLMNLRPDRRLETWMQATRAYAKTDDEAT 664
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
Y+ NAR IT W L DY ++ WSGL+RDYY R +F ES +G
Sbjct: 665 FYDENARLLITTW------GWPELSDYASRVWSGLIRDYYAARWEAWF----ESRHTGRS 714
Query: 524 FRLKDWRREWI 534
F L W++ W+
Sbjct: 715 FSLDLWQQTWL 725
>gi|299148671|ref|ZP_07041733.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
3_1_23]
gi|383114572|ref|ZP_09935334.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
gi|298513432|gb|EFI37319.1| alpha-N-acetylglucosaminidase family protein [Bacteroides sp.
3_1_23]
gi|313693722|gb|EFS30557.1| hypothetical protein BSGG_1257 [Bacteroides sp. D2]
Length = 711
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 167/526 (31%), Positives = 266/526 (50%), Gaps = 59/526 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ ++ Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ GA+I + M+ + AVW++Q W+ + ++ S+ G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPKAVWIIQA--------WQANPREEMIASLNQG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E +P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N M+ GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
I +AW L HTVYN D + + S++ G + +
Sbjct: 473 EIMEAWRALEHTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+Y+ +A LF + ++ +N + YDL+D+ RQ+ A N L
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 466
I ++Y D + +++FL+L+ D LL+ F + WL +A+ L EE+++ YE
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKRLYE 630
Query: 467 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 512
WNA IT+W D+ L DY ++ WSGLL+D Y R +F+
Sbjct: 631 WNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLYYQRWKAFFE 676
>gi|261199246|ref|XP_002626024.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
gi|239594232|gb|EEQ76813.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis SLH14081]
Length = 752
Score = 268 bits (686), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 162/530 (30%), Positives = 278/530 (52%), Gaps = 42/530 (7%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
NL G WGG P W D Q LQKKIL R+ ELGM P+LPAF G VP A+ V P A+
Sbjct: 201 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 260
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + +P++ T L DP + + ++FI + ++ YG +H Y D F+E
Sbjct: 261 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 318
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFS-YDPFWRPPQMKALLNS-VPLGK 176
P PE++ + ++S D +A W+MQGWLF + +W +++A L++
Sbjct: 319 PSSGDPEFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 378
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDLFAE P+W +K F+G ++WC + F GN +YG + +I GP +A +++
Sbjct: 379 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAQA-MAQHPN 437
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY---GRSVP-AIQDAW 292
MVGVG + EG N +V+ L+ + + +D + + + + RRY GR+VP + +AW
Sbjct: 438 MVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHGRTVPNELYEAW 497
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
+L + YN T+ +V P + ++ + + +K +L E
Sbjct: 498 QLLRLSAYNNTN---------LVDAPLLPHALFAASPSIN-------AKMPMLFIEG--- 538
Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
L Y +++++A L I L ++Y+YD++D+TRQ L+ + ++ Y+
Sbjct: 539 ----LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKVKYK 592
Query: 413 LNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ-YEWNAR 470
VF + + L +++ +D +L+ ++ F L W+ +A+ A ++ + +E NAR
Sbjct: 593 GGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDDSEAADFFEHNAR 652
Query: 471 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
QIT+W E +L DY K W+GL+ YY PR ++ +Y+ ++ S
Sbjct: 653 NQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPRWRMFLEYLKDTPAS 698
>gi|449299394|gb|EMC95408.1| glycoside hydrolase family 89 protein [Baudoinia compniacensis UAMH
10762]
Length = 801
Score = 268 bits (686), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 162/553 (29%), Positives = 275/553 (49%), Gaps = 63/553 (11%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG LP SW+ Q L K+I+ R+ ELGM PVLP F G VP + +P+A
Sbjct: 226 NIQGSWGGDLPMSWISSQFTLGKQIVARMVELGMTPVLPCFPGFVPMQIGRYYPNAMYIN 285
Query: 62 LGNWFSVKSDPRWCCTY-LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W PR L+ DPL+ + ++FI +Q YG S IY D ++EN P
Sbjct: 286 GSQW---NGFPRQNTNVSFLEPFDPLYTTLQKSFISKQTAAYGNVSSIYTLDQYNENNPY 342
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVV 179
Y+ ++ A + +++ D +AVW++QGWLF S FW ++A L V +++
Sbjct: 343 SADTTYLRNISAGTIAALKAADPNAVWMLQGWLFFSSATFWTDAAIRAYLGGVNNTDMII 402
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLF+E +P W + +YG P+IWC LH++ GN+ +YG ++++ P++A + ++TMVG
Sbjct: 403 LDLFSETQPQWQRTNSYYGKPWIWCELHDYGGNMGLYGQVENVTINPIQALNNASSTMVG 462
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV---PAIQDAWNVLY 296
+G++MEG E N ++YD++ + A+ ++ + + + RY + P + AW+ +
Sbjct: 463 MGLTMEGQEGNEIMYDILLDQAWSSTPLNNSLYFHDWVTSRYHGAASLPPGLYTAWDTMR 522
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP- 355
TVYN T +T + V + ++ P++ + + HP
Sbjct: 523 QTVYNNTQISTIQ--SVTKSIWELTPNVTGLLN--------------------RTGHHPT 560
Query: 356 HLWYSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+ Y+TS ++ A + F + + L S Y +DL D+TRQ +A L+ + + A
Sbjct: 561 TIQYNTSTLVGAWKQFYGAAAQEPTLWDSPGYLFDLTDVTRQVMANAFYPLYTSFVSASN 620
Query: 413 LNDAHGVFQ------LSRRFLELVEDMDGLLACH--DGFLLGPWLESAK----------- 453
+ A+ + ++ + L+ +D +LA F L W+ A+
Sbjct: 621 -HSANATYSPGNATIYGQQMVSLLSALDSMLAASPIPYFHLSTWIAEARSWSAPTATLPN 679
Query: 454 ---QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIY 510
L + + YE+NAR QIT+W Q + DY +K W+GL+ YY PR ++
Sbjct: 680 NATNLTSSSQTASFYEYNARNQITLWGPTGQ-----ISDYASKQWAGLISSYYVPRWQLF 734
Query: 511 FKYMIESLESGDG 523
Y + + +G
Sbjct: 735 VNYTLNGTTASNG 747
>gi|212693694|ref|ZP_03301822.1| hypothetical protein BACDOR_03214 [Bacteroides dorei DSM 17855]
gi|265755881|ref|ZP_06090348.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
gi|212663753|gb|EEB24327.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides dorei DSM 17855]
gi|263233959|gb|EEZ19560.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_33FAA]
Length = 718
Score = 268 bits (685), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 177/538 (32%), Positives = 275/538 (51%), Gaps = 67/538 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPESWYIRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + ++ K YG+T Y D F E
Sbjct: 240 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYKELTKLYGKTG-FYAIDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W + +++ +
Sbjct: 296 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--------QDNPRTSMIEHLEA 343
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ G
Sbjct: 344 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALINGFY 403
Query: 228 EARTSENT--TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A+T + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 404 DAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q AW++L + +YN + + V A P +D YQ
Sbjct: 464 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 560 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGTRKEFKVGTWIEAARSAGQTQEEK 618
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+ L+
Sbjct: 619 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 676
>gi|237708859|ref|ZP_04539340.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
gi|345513372|ref|ZP_08792893.1| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
gi|423228941|ref|ZP_17215347.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
CL02T00C15]
gi|423242228|ref|ZP_17223337.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
CL03T12C01]
gi|423247755|ref|ZP_17228803.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
CL02T12C06]
gi|229457285|gb|EEO63006.1| glycoside hydrolase family 89 protein [Bacteroides sp. 9_1_42FAA]
gi|345456211|gb|EEO47557.2| glycoside hydrolase family 89 protein [Bacteroides dorei 5_1_36/D4]
gi|392631297|gb|EIY25272.1| hypothetical protein HMPREF1064_05009 [Bacteroides dorei
CL02T12C06]
gi|392635177|gb|EIY29082.1| hypothetical protein HMPREF1063_01167 [Bacteroides dorei
CL02T00C15]
gi|392639514|gb|EIY33330.1| hypothetical protein HMPREF1065_03960 [Bacteroides dorei
CL03T12C01]
Length = 717
Score = 268 bits (685), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 177/538 (32%), Positives = 275/538 (51%), Gaps = 67/538 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYIRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + ++ K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYKELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W + +++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--------QDNPRTSMIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ G
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALINGFY 402
Query: 228 EARTSENT--TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A+T + T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 403 DAKTDNHAGKTLCGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q AW++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDHLLGTRKEFKVGTWIEAARSAGQTQEEK 617
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+ L+
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 675
>gi|373460171|ref|ZP_09551927.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
gi|371956556|gb|EHO74342.1| hypothetical protein HMPREF9944_00191 [Prevotella maculosa OT 289]
Length = 742
Score = 268 bits (685), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 167/529 (31%), Positives = 258/529 (48%), Gaps = 52/529 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW QQ LQKKIL R++E GM PVLP F G +P + +T
Sbjct: 186 MNNLEGWGGPLPDSWYKQQETLQKKILQRMHEYGMEPVLPGFCGMMPHDAKEKL-GLNVT 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L TD F I + + + YG+ ++ Y+ D F E+
Sbjct: 245 DGGKWNGYTRPAN------LSPTDSQFNRIADLYYAELTRLYGKANY-YSMDPFHESNDD 297
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D+ +Y G+ + M+ + A W++QGW + +P RP ++ + N G L++L
Sbjct: 298 -DALDY-GKAGSVMLEAMKRINPKATWVIQGW--TENP--RPRMIQDMKN----GDLLIL 347
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTS 232
DLF+E +P +W K + +++CML NF N+ ++G +D + F + R+
Sbjct: 348 DLFSECRPMFGIPSVWKREKGYEQHDWLFCMLENFGANVGLHGRMDQLIHNFYSTKKRSP 407
Query: 233 ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
+ G+G +MEG E NPV+++LMSE+ ++ E + W+ Y RYGR I+ AW
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSELPWRPEIFKKEDWVRGYVKARYGRKDETIERAW 467
Query: 293 NVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
+L T+YNC G + + V P ++ + + K +NY
Sbjct: 468 LLLAETIYNCPAGNNQQGPHESVFCGRPGLNNFQVK-SWSKMRNY--------------- 511
Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
Y + A L + + +N + YDLID+ RQALA +L I
Sbjct: 512 --------YDPQATLEAARLMASVSSRYKGNNNFEYDLIDICRQALADQGRLQYLKTIAD 563
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 470
Y + ++RFL+++ D LL F LG W E+A+ L + ++ YEWNAR
Sbjct: 564 YNGFSRAAFAKDAKRFLDMILLQDRLLGTRKEFRLGHWTEAARSLGTTQAEKDLYEWNAR 623
Query: 471 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 519
QIT W + T + LRDY +K W G+L+D+Y R IY + + +E
Sbjct: 624 VQITTWGNRTCADNGGLRDYAHKEWQGILKDFYYKRWKIYMDALAKQME 672
>gi|410096483|ref|ZP_11291470.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
CL02T12C30]
gi|409226447|gb|EKN19356.1| hypothetical protein HMPREF1076_00648 [Parabacteroides goldsteinii
CL02T12C30]
Length = 718
Score = 268 bits (684), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 182/587 (31%), Positives = 294/587 (50%), Gaps = 75/587 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ+ LQ++I+ R+ E G+ PV P +SG VP +
Sbjct: 187 MNNLEGWGGPNPDSWYKQQITLQQRIVKRMREYGIEPVFPGYSGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L TDP F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 239 KLG--LNV-SDPGLWCGYHRPAFLQPTDPRFQEIASLYYKELNKLYGK-ANFYSMDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+ VD + + G AI M+ + AVW+ Q W + ++ ++
Sbjct: 295 GGSVAGVD----LDAAGKAIMQAMKKNNPKAVWVAQAW--------QANPRSQMIENLKA 342
Query: 175 GKLVVLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
G ++VLDLF+E +P W + F +I+CML N+ GN+ ++G + +
Sbjct: 343 GDMIVLDLFSESRPQWGDPESTWHRKDGFGQHDWIYCMLLNYGGNVGLHGKMAHVIDEYY 402
Query: 228 EARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVP 286
+A+ S T+ GVGM+MEG E NPV+++L++E+ ++ D W+ Y+V RYG++ P
Sbjct: 403 KAKESSFGKTLCGVGMTMEGSENNPVMFELLTELPWRPVHFDKNEWLKNYTVARYGKANP 462
Query: 287 AIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
+Q+AW +L +++YNC T + + + A P P ++S
Sbjct: 463 TVQEAWILLSNSIYNCPPENTQQGTHESIFCARPSDHPYLVSSW---------------- 506
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
SE S Y Y+ +VIRA + ++ ++ + +N + YDL+D+ RQA+A+ L
Sbjct: 507 --SEMSDY------YNPDDVIRAAAMMVSVADQFTGNNNFEYDLVDIVRQAIAE-KGRLV 557
Query: 405 LNIIEA-YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
++EA + D + RFL+L+ D LL F +G W+ + L E++
Sbjct: 558 EKVVEASFASGDKQLYNTAANRFLQLLLLQDELLGTRPEFKVGNWIARTRSLGNTPEEKD 617
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R +F Y E L+
Sbjct: 618 LYEWNARVQITTWGNRNAADKGGLRDYAHKEWNGILKDFYYMRWKTWFDYQNELLDGKKP 677
Query: 524 FRLKDWRRE--WIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
+ + E W KLT+ Y E GD + T + ++ + +
Sbjct: 678 TAIDFYALEEPWTKLTDS-------YSSEPEGDCISTVKRIFAEVFE 717
>gi|224537227|ref|ZP_03677766.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521150|gb|EEF90255.1| hypothetical protein BACCELL_02104 [Bacteroides cellulosilyticus
DSM 14838]
Length = 755
Score = 268 bits (684), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 182/589 (30%), Positives = 277/589 (47%), Gaps = 76/589 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP+S +D+ ++L KKIL R ELGM P+ FSG VP LQ +P AKI+
Sbjct: 195 MQNIQSYGGPLPKSVIDKHVILGKKILARQLELGMQPIQQGFSGYVPRELQAKYPQAKIS 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W T LD TDPLF E+G AF+E+Q K +G + +Y D F E+ PP
Sbjct: 255 MKRKWCGFDG------TAQLDPTDPLFHEMGLAFLEEQDKLFG-SYGVYAADPFHESAPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+D+PEY++ +G I+ Q+ D+ A+W+MQ W D ++ +VP L++L
Sbjct: 308 IDTPEYLTGVGQTIHKLFQTFDAGALWVMQAWSMRED----------IVKAVPKESLLIL 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL + + +G P I LHNF G I M+G L +A + + + G
Sbjct: 358 DLNGSK----TAANGGWGYPVIAGNLHNFGGRINMHGDLALLASNQYQKAKARYPNVCGS 413
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G+ ME IEQNPV Y+L EM + + ++AW+ Y+ RRYG A AW L Y
Sbjct: 414 GLFMEAIEQNPVYYELAFEMPNHADSIPLQAWLAAYAERRYGAKSAAAGKAWMYLLEGPY 473
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
G R IVA P++ G G P Y
Sbjct: 474 R--RGTNGTERSSIVA---ARPALNVKKSGPNAGLGIP--------------------YE 508
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
VIRA + ++L+ S YR+D++D+ RQ + + EA+ D
Sbjct: 509 PMLVIRAQSQLLKDADKLAFSKPYRFDIVDVQRQMMTNLGQLVHKKAAEAFASKDKAAFA 568
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
S RFLEL+ DMD LL + WL A+ + +E++ E +A + +T+W +
Sbjct: 569 LHSGRFLELLRDMDELLYTRSEYSFDRWLTEARSWGETKEEKDLMERDATSLVTIWGADG 628
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF---------------- 524
+ DY + W+GL+ YY PR ++ + L++G +
Sbjct: 629 DPR---IFDYSWREWAGLINGYYLPRWQKFYTMLQGHLDAGTDYQEEGLSLAYGREDFRA 685
Query: 525 -----RLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
RL +W ++ Q G+ PV ++GD L+ ++ L++KYL+
Sbjct: 686 NDFYNRLAEWELAYVD-----QTGKARTPV-THGDELVVTRRLFDKYLK 728
>gi|189465172|ref|ZP_03013957.1| hypothetical protein BACINT_01517 [Bacteroides intestinalis DSM
17393]
gi|189437446|gb|EDV06431.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides intestinalis DSM
17393]
Length = 723
Score = 268 bits (684), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 176/587 (29%), Positives = 287/587 (48%), Gaps = 76/587 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 192 MNNLEGWGGPNPDSWYAQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 251
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+ L N F+ + L TD F EI + +Q K +G+ + Y+ D F E N
Sbjct: 252 SDLWNGFTRPA--------FLQPTDARFAEIADLYYREQEKLFGKADY-YSMDPFHEAEN 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI + M+ + A W++QGW + +P RP +K + N G L
Sbjct: 303 AASVD----FDAAGKAIMTAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 350
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSIAF 224
++LDLF+E +P+W G+P IW CML NF GN+ ++G +D +
Sbjct: 351 LILDLFSECRPMW-------GIPSIWKRDKGYEQHDWLFCMLLNFGGNVGLHGRMDQLLN 403
Query: 225 GPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
+ + T + G+G++MEG E N ++++LM E+ ++ EK + W+ Y RYG
Sbjct: 404 NFYLTKNNPLATHLKGIGLTMEGSENNAMMFELMCELPWRPEKFTKEEWLKDYLFARYGV 463
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 341
I+ AW +L +T+YNC G + + + P ++ + + + K +NY P
Sbjct: 464 RDEKIEQAWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVT 522
Query: 342 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 401
E A L + ++ +N + YDL+D+ RQ+L+
Sbjct: 523 E-----------------------EAARLMLEVADKYRGNNNFEYDLVDIVRQSLSDKGR 559
Query: 402 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
++ I ++ D + SR+FL+++ D LL F +G W+E A++L E+
Sbjct: 560 IVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARKLGTTPEE 619
Query: 462 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 521
+ YEWNAR QIT W + + LRDY +K W+G+LRD+Y R A Y++ + + L+
Sbjct: 620 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 679
Query: 522 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
+L ++ + W +N Y S G+ + ++ + K +
Sbjct: 680 PEVKL-----DYYAMEEPWTLAKNPYGSTSEGNCVDVAKEAFEKVFE 721
>gi|239615395|gb|EEQ92382.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ER-3]
Length = 829
Score = 268 bits (684), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 162/531 (30%), Positives = 275/531 (51%), Gaps = 48/531 (9%)
Query: 3 NLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
NL G WGG P W D Q LQKKIL R+ ELGM P+LPAF G VP A+ V P A++
Sbjct: 223 NLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQVV 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W + +P++ T L DP + + ++FI + ++ YG +H Y D F+E P
Sbjct: 283 NASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMIPS 340
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNS-VPLGKLV 178
P+++ + ++S D +A W+MQGWLF + +W +++A L++ ++
Sbjct: 341 SGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRDML 400
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
+LDLFAE P+W +K F+G ++WC + F GN +YG + +I GP EA +++ MV
Sbjct: 401 ILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAEA-MAQHPNMV 459
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAWNV 294
GVG + EG N +V+ L+ + + +D + + + + RRY R+VP+ + +AW +
Sbjct: 460 GVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHERTVPSELYEAWQL 519
Query: 295 LYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSETSS 351
L + YN T+ A + A P ++ + + EG
Sbjct: 520 LRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG--------------------- 558
Query: 352 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 411
L Y +++++A L I L ++Y+YD++D+TRQ L+ + ++ Y
Sbjct: 559 -----LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKVKY 611
Query: 412 QLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ-YEWNA 469
+ VF + + L +++ +D +L+ ++ F L W+ +A+ A +E + +E NA
Sbjct: 612 KGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDESEAADFFEHNA 671
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
R QIT+W E +L DY K W+GL+ YY PR ++ +Y+ ++ S
Sbjct: 672 RNQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPRWRMFLEYLKDTPAS 718
>gi|423345423|ref|ZP_17323112.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
CL03T12C32]
gi|409223209|gb|EKN16146.1| hypothetical protein HMPREF1060_00784 [Parabacteroides merdae
CL03T12C32]
Length = 752
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 182/584 (31%), Positives = 270/584 (46%), Gaps = 69/584 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL K+I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 192 MQNLQSYGGPLPKSWIDKHIVLGKQIIDRELELGMQPIQQGFSGYVPRELKEKYPDAKI- 250
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+ P WC LD TD LF IGR F+E++ K YG +Y D F E+
Sbjct: 251 --------QLQPSWCGFTGAAQLDPTDSLFTVIGRDFLEEEKKLYG-AHGVYAADPFHES 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
PPVD+PEY+ ++G AI+ D +++W MQ W R P +KA VP L
Sbjct: 302 QPPVDTPEYLRAVGNAIHKLFNDFDPNSIWAMQAWSL------REPIVKA----VPKENL 351
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
++LDL +G P + LHNF G I ++G L +A +N +
Sbjct: 352 LILDLNGAKS---QQENACWGYPLVAGNLHNFGGRINLHGDLRLLASNQYVNAVKKNPNV 408
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G G+ ME IEQNPV YDL EM ++V+++ W+ +Y+ RRYG+ AW L
Sbjct: 409 CGSGLFMESIEQNPVYYDLAFEMPLHKDEVNIEEWLCRYADRRYGKPSENAHQAWLHLLE 468
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y G R I+A P++ G G P
Sbjct: 469 GPYR--PGTNGTERSSIIA---ARPAVNVKKSGPNAGLGIP------------------- 504
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
YS V++A L + L S+ YR+D++D+ RQ ++ + +A++ D
Sbjct: 505 -YSPLSVVQAEGLLLKDAARLEDSDPYRFDIVDIQRQLMSNLGQVIHKQAAKAFRKKDKE 563
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
S RFLE++ D D LL F WL A+ N E++ +E +A +T+W
Sbjct: 564 AFALHSNRFLEMLRDADELLRTRPEFNFDKWLTQARSWGDNSEEKDLFEKDATALVTVW- 622
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR----REW 533
+ L+ DY + W+GL+ YY R ++ + + L++G + KD RE
Sbjct: 623 --GADGDPLIFDYSWREWTGLIDGYYLKRWEKFYAMLQDHLDAGTNYSEKDLPQTHGRES 680
Query: 534 IKLTN------DWQ-----NGRNVYPVESNGDALITSQWLYNKY 566
+ + DW+ V + GD + T+ LY KY
Sbjct: 681 FRANDFYSTLGDWELQFVSTPDKVRTPITQGDEVETATRLYKKY 724
>gi|380694112|ref|ZP_09858971.1| alpha-N-acetylglucosaminidase [Bacteroides faecis MAJ27]
Length = 736
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 169/540 (31%), Positives = 267/540 (49%), Gaps = 73/540 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 199 MNNLEGWGGPNPDSWYRQQEALQKKIIARMRELGIEPVFPGYAGMVPRNIGE-------- 250
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ + Y+ D F E
Sbjct: 251 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 307
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I M+ + +AVW+MQ W+ +A++N++ G
Sbjct: 308 GNTEGVD----LAKAGTSIMGAMKKANPEAVWVMQA--------WQANPREAMVNTLDSG 355
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E P +W K F +++CML NF GN+ ++G ++ + G
Sbjct: 356 DLLVLDLYSEKLPQWGDPESMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMEQLVNGYYN 415
Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N T+ GVG + EGIE NP++++L+ E+ ++ E+ W+ Y RYG + P
Sbjct: 416 ACAHINGKTLRGVGATPEGIENNPMMFELLYELPWREERFSPDIWLQGYLKARYGDDLSP 475
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ +AW L HTVYN +NY + E++L
Sbjct: 476 EVTEAWRALEHTVYNAP-----------------------------KNYQGEGTVESLLC 506
Query: 347 S-------ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
+ TS++ + L+YS +A +L ++ + +N + YDL+D+ RQ+LA
Sbjct: 507 ARPGFHLDRTSTWGYAKLFYSPDSTAKAAQLLLSVADRYKGNNNFEYDLVDIVRQSLADK 566
Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
AN L I ++Y D + +++FL L+ D LL+ F + WL +A+ L E
Sbjct: 567 ANVLLEEISQSYDRKDKDSFRKQTQQFLGLILSQDSLLSTRKEFSVSSWLSAARSLGTTE 626
Query: 460 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 519
E++K YEWNA IT+W D+ L DY ++ WSGLL+D Y R +F+ + L+
Sbjct: 627 EEKKLYEWNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLYYQRWNTFFEQKQQELD 686
>gi|393786624|ref|ZP_10374756.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
CL02T12C05]
gi|392657859|gb|EIY51489.1| hypothetical protein HMPREF1068_01036 [Bacteroides nordii
CL02T12C05]
Length = 717
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 181/581 (31%), Positives = 284/581 (48%), Gaps = 72/581 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P W QQ+ LQKKIL R++E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPDHWYTQQVSLQKKILKRMHEYGIEPVLPGYCGMVPHNAK--------A 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V SDP C Y L D F EI + ++ K YG+ ++ Y+ D F E
Sbjct: 239 KLG--LNV-SDPGVWCGYRRPAFLQPDDSRFEEISSLYYKELEKLYGKANY-YSMDPFHE 294
Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+D + ++G A+ M+ + AVW++Q W + P L+ ++ G
Sbjct: 295 GGS-IDGVN-LDAVGKAVMKAMKKANPKAVWVIQAWQANPRP--------ELIRNLETGD 344
Query: 177 LVVLDLFAEVKPIWST------SKQFYGVP-YIWCMLHNFAGNIEMYGILDSIA--FGPV 227
L++LDL +E +P W K YG +++CML N+ N+ ++G +D++ +
Sbjct: 345 LLILDLTSECRPQWGDPESEWYRKDGYGKHNWVYCMLLNYGANVGLHGKMDNVIDNYYLA 404
Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
+ T+ GVGM+ EGIE NPV+Y+L+ E+ ++ E+ + W+ Y RYG+ P
Sbjct: 405 KENLRARATLKGVGMTPEGIENNPVMYELLMELPWRPERFTKEDWLKGYVKARYGKDEPV 464
Query: 288 IQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
+Q AW L +++YN T + + V A P +D YQ
Sbjct: 465 LQLAWGKLANSIYNAPKELTQQGTHESVFCARPGLDV---------YQ------------ 503
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
SS+ +Y EVI A L ++ + + + YDL+D+ RQA+A+ +
Sbjct: 504 ---VSSWSEMKDYYDPQEVIEAARLMVSVADRYRGNTNFEYDLVDVVRQAIAEKGRLMQK 560
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
+ AY+ D S++FL L+ D LL F LG W+ SA+ L E++ Y
Sbjct: 561 AVTTAYRAGDKELFAMASQKFLNLILLQDQLLGTRTEFRLGRWINSARALGVTPEEKALY 620
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
EWN R Q+T W + E LRDY +K W+GLL+D+Y R +YF + +E G+
Sbjct: 621 EWNTRVQVTTWGNRNAAERGGLRDYAHKEWNGLLKDFYYMRWKLYFDNLACKME-GETIP 679
Query: 526 LKDW---RREWIKLTNDWQNGRNVYPVESNGDALITSQWLY 563
D+ W+K TN +Q E GD + T++ ++
Sbjct: 680 EIDFYAVEEAWVKRTNPYQ-------AEPEGDCVDTAKLIF 713
>gi|423293377|ref|ZP_17271504.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
CL03T12C18]
gi|392678320|gb|EIY71728.1| hypothetical protein HMPREF1070_00169 [Bacteroides ovatus
CL03T12C18]
Length = 727
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 177/577 (30%), Positives = 281/577 (48%), Gaps = 60/577 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + + T G Y +P + K ++ Y + L
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 532
L DY ++ W+GL+ DYY R +Y I+++ G D +L+D +E
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAVGEGVEVDQKQLEDELKEIE 688
Query: 533 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 566
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|298480128|ref|ZP_06998327.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|336404356|ref|ZP_08585054.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
gi|298273937|gb|EFI15499.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|335943684|gb|EGN05523.1| hypothetical protein HMPREF0127_02367 [Bacteroides sp. 1_1_30]
Length = 727
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 177/574 (30%), Positives = 275/574 (47%), Gaps = 54/574 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
L DY ++ W+GL+ DYY R +Y I+++ G K E ++
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAVGEGVEVDQKQLEDELKEIE 688
Query: 538 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 566
W N + V S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDVRKDVHSTTDGLLSFSTFLFSKY 722
>gi|409042145|gb|EKM51629.1| glycoside hydrolase family 89 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 749
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 161/536 (30%), Positives = 278/536 (51%), Gaps = 43/536 (8%)
Query: 6 GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW 65
WGG LP W+ Q LQK+IL R+ ELGM P+LPAF+G VP+ + +P+A I W
Sbjct: 201 AWGGLLPMQWISDQQALQKQILPRMLELGMTPILPAFTGFVPSNMSAHYPNASIIDGSAW 260
Query: 66 FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPE 125
S L+ DPL+ ++ ++FI +Q + YG +H Y D ++EN P +
Sbjct: 261 SGFPS--TLTNVSFLEPFDPLYPQMQQSFITKQQEAYGNITHFYTLDQYNENNPFSGNDS 318
Query: 126 YISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLG-KLVVLDLF 183
Y+SS+ + + +++ D +A W+MQGWLF S + FW +++A L +++LDL+
Sbjct: 319 YLSSVSTSTIASLRAADPEATWVMQGWLFFSSETFWTNDRIEAYLGGAQGNDSMLILDLY 378
Query: 184 AEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMS 243
+E +P W+ + ++G ++WC LH++ GN+ + G L +I GP+ A S ++MVG+G++
Sbjct: 379 SEAQPQWNRTDSYFGKQWVWCELHDYGGNMGLEGNLAAITEGPIAALNSNGSSMVGMGLT 438
Query: 244 MEGIE-QNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPA-IQDAWNVLYHTVY 300
MEG+E N +VYD++ + A+ ++V W+ +++ RRY +++P +Q AW +L T+Y
Sbjct: 439 MEGMEIGNEIVYDILLDQAWSSTPLNVSDWVAKWAARRYLVKTLPTELQQAWTILSTTIY 498
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-HLWY 359
N D + I+ +++VT HP + Y
Sbjct: 499 NNQDPNSQATIKSILELEPATTGLVNVTG-----------------------HHPTEIPY 535
Query: 360 ST-SEVIRALELFI-ASGNELSASNT--YRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
T + ++ AL+LF+ AS ++ S + D+++L+RQ + +L+ ++I + +
Sbjct: 536 DTNTTILHALQLFVNASKSQPSLKQVPEFAVDILELSRQLMVNRFIDLYTDLINTWNSSS 595
Query: 416 --AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY-EWNARTQ 472
A V L L+ D+D LL ++ +L W+ AKQ A Y E+ AR Q
Sbjct: 596 STAQNVTTAGVPLLSLISDLDVLLYTNENYLFSTWIADAKQWAHGNVSYAAYLEYQARNQ 655
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 528
T+W + DY +K +GL+ +YY R + + E SG + +
Sbjct: 656 QTLWGPQGN-----INDYASKQTAGLVGEYYATRWQTFVVMLAEQKTSGQPYNATE 706
>gi|423299508|ref|ZP_17277533.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
CL09T03C10]
gi|408473317|gb|EKJ91839.1| hypothetical protein HMPREF1057_00674 [Bacteroides finegoldii
CL09T03C10]
Length = 727
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 169/572 (29%), Positives = 284/572 (49%), Gaps = 49/572 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
MSN+ W PLP+ WL QQ LQK+IL R E M PVLPAF+G+VPA L+ ++P+AKI
Sbjct: 194 MSNVDYWQSPLPKDWLVQQEELQKRILAREREFNMTPVLPAFAGHVPAELKKIYPNAKIY 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W R ++ +D D L+ I + F+E+Q K YG T HIY D F+E P
Sbjct: 254 TMSQWGGFDKQYR---SHFIDPMDSLYSVIQKRFLEEQTKIYG-TDHIYGIDPFNEVDSP 309
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSY-DPFWRPPQMKALLNSVPLGKLVV 179
+ E++S++ IY + S D +A WL W+F Y W P ++K+ L +VP KL++
Sbjct: 310 DWNEEFLSNVSRKIYESLHSVDPEAQWLQMTWMFYYAKDKWTPSRIKSFLRAVPQDKLIL 369
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + + IW ++ +YG PYIWC L NF GN + G L+ + + G
Sbjct: 370 LDYYCDHTEIWKKTEGYYGQPYIWCYLGNFGGNTMLAGNLNDTYEKIHQVLAEGGQNIHG 429
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G+++E + NP++Y+ + E A++ + WI ++ R G++ PA+ AW L+ +
Sbjct: 430 LGVTLEAFDVNPMMYEFVFEQAWEGAQ-PTDEWIATWAKCRGGQTCPAVLKAWKELHEKI 488
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + PS+ +AVL + + W
Sbjct: 489 Y-------------------IAPSLCG---------------QAVLMNARPQLEGVQGWN 514
Query: 360 STSEV-IRALELFIASGNELSASNT----YRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
+ E +L++ G+ L + + +D++++ RQ L ++ Y+
Sbjct: 515 TFPEYKYDNKDLWVIWGSLLQVGSIDKPGHAFDVVNVGRQVLGNLFSDYRAQFTACYKRK 574
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 474
D G + ++R L+ D+D LLAC F +G W++ A+ EE++K YE NAR +T
Sbjct: 575 DVKGAQEWAQRMDALLLDVDRLLACSPLFSMGKWIQDARDCGTTEEEKKYYEENARCILT 634
Query: 475 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 534
+W ++ + L DY N+ W+GL + +Y R + ++ ++++ F K + ++
Sbjct: 635 IW----GQKDTQLNDYANRSWAGLTKGFYRERWKRFTDSVLTAMQANRSFDAKKFHKDIT 690
Query: 535 KLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+W + V S DA+ + L+NKY
Sbjct: 691 DFEYEWTLQHETFSVSSGEDAVKVANELWNKY 722
>gi|299144715|ref|ZP_07037783.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298515206|gb|EFI39087.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 727
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 177/577 (30%), Positives = 282/577 (48%), Gaps = 60/577 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRLYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-IDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIVSDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + + T G Y +P + K ++ Y + L
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILHDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 532
L DY ++ W+GL+RDYY R +Y I+++ G D +L+D +E
Sbjct: 635 ------GGSLNDYASRSWAGLIRDYYAKRWEVYINTFIKAVGEGVEVDQKQLEDELKEIE 688
Query: 533 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 566
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|224537466|ref|ZP_03678005.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520904|gb|EEF90009.1| hypothetical protein BACCELL_02345 [Bacteroides cellulosilyticus
DSM 14838]
Length = 721
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 175/587 (29%), Positives = 287/587 (48%), Gaps = 76/587 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 192 MNNLEGWGGPNPDSWYVQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 251
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+ L N F+ + L TD F EI + ++Q K +G+ + Y+ D F E N
Sbjct: 252 SDLWNGFTRPA--------FLQPTDVRFAEIADLYYQEQEKLFGKVDY-YSMDPFHEAEN 302
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI + M+ + A W++QGW + +P RP +K + N G L
Sbjct: 303 AASVD----FDAAGKAIMAAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 350
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSIAF 224
++LDLF+E +P+W G+P IW CML NF GN+ ++G +D +
Sbjct: 351 LILDLFSECRPMW-------GIPSIWKRDKGYEQHNWLFCMLLNFGGNVGLHGRMDQLLD 403
Query: 225 GPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
+ + + G+G++MEG E NP++++LM E+ ++ EK + W+ Y RYG
Sbjct: 404 NFYLTKNNPLAVHLKGIGLTMEGAENNPMMFELMCELPWRPEKFTKEEWLKDYLFARYGV 463
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 341
I+ AW +L +T+YNC G + + + P ++ + + + K +NY P
Sbjct: 464 RDEKIEKAWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVT 522
Query: 342 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 401
E A L + ++ +N + YDL+D+ RQ+L+
Sbjct: 523 E-----------------------EAARLMVEVADKYRGNNNFEYDLVDIVRQSLSDKGR 559
Query: 402 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
++ I ++ D + SR+FL+++ D LL F +G W+E A+ L E+
Sbjct: 560 IVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARNLGTTPEE 619
Query: 462 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 521
+ YEWNAR QIT W + + LRDY +K W+G+LRD+Y R A Y++ + + L+
Sbjct: 620 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 679
Query: 522 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
+L ++ + W +N Y G + ++ ++ K ++
Sbjct: 680 PEVKL-----DYYAMEEPWTLAKNPYSSVPEGSCVDVAKEVFEKAMR 721
>gi|423226735|ref|ZP_17213200.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392627008|gb|EIY21049.1| hypothetical protein HMPREF1062_05386 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 718
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 175/587 (29%), Positives = 287/587 (48%), Gaps = 76/587 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 189 MNNLEGWGGPNPDSWYVQQEALQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 248
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+ L N F+ + L TD F EI + ++Q K +G+ + Y+ D F E N
Sbjct: 249 SDLWNGFTRPA--------FLQPTDVRFAEIADLYYQEQEKLFGKVDY-YSMDPFHEAEN 299
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI + M+ + A W++QGW + +P RP +K + N G L
Sbjct: 300 AASVD----FDAAGKAIMAAMKKVNPKATWVVQGW--TENP--RPEMIKNMQN----GDL 347
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSIAF 224
++LDLF+E +P+W G+P IW CML NF GN+ ++G +D +
Sbjct: 348 LILDLFSECRPMW-------GIPSIWKRDKGYEQHNWLFCMLLNFGGNVGLHGRMDQLLD 400
Query: 225 GPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
+ + + G+G++MEG E NP++++LM E+ ++ EK + W+ Y RYG
Sbjct: 401 NFYLTKNNPLAVHLKGIGLTMEGAENNPMMFELMCELPWRPEKFTKEEWLKDYLFARYGV 460
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 341
I+ AW +L +T+YNC G + + + P ++ + + + K +NY P
Sbjct: 461 RDEKIEKAWTLLANTIYNCPFGNNQQGPHESIFCGRPSLN-NFQASSWSKMKNYYDPTVT 519
Query: 342 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 401
E A L + ++ +N + YDL+D+ RQ+L+
Sbjct: 520 E-----------------------EAARLMVEVADKYRGNNNFEYDLVDIVRQSLSDKGR 556
Query: 402 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
++ I ++ D + SR+FL+++ D LL F +G W+E A+ L E+
Sbjct: 557 IVYNRTIADFKSFDKRSFARDSRKFLDILLLQDKLLGTRSEFRVGRWIEQARNLGTTPEE 616
Query: 462 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 521
+ YEWNAR QIT W + + LRDY +K W+G+LRD+Y R A Y++ + + L+
Sbjct: 617 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 676
Query: 522 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
+L ++ + W +N Y G + ++ ++ K ++
Sbjct: 677 PEVKL-----DYYAMEEPWTLAKNPYSSVPEGSCVDVAKEVFEKAMR 718
>gi|327356744|gb|EGE85601.1| alpha-N-acetylglucosaminidase [Ajellomyces dermatitidis ATCC 18188]
Length = 752
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 161/533 (30%), Positives = 275/533 (51%), Gaps = 48/533 (9%)
Query: 1 MSNLHG-WGGP-LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
NL G WGG P W D Q LQKKIL R+ ELGM P+LPAF G VP A+ V P A+
Sbjct: 201 FGNLQGSWGGGNTPFKWYDAQFELQKKILARMSELGMTPILPAFPGYVPRAVTRVLPDAQ 260
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ W + +P++ T L DP + + ++FI + ++ YG +H Y D F+E
Sbjct: 261 VVNASQWAEI--NPKYTNTTFLQPFDPHTVRLQKSFISKSIEAYGNVTHFYTLDQFNEMI 318
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNS-VPLGK 176
P P+++ + ++S D +A W+MQGWLF + +W +++A L++
Sbjct: 319 PSSGDPKFLRKVSETTMEAIKSVDPEATWVMQGWLFYIFADYWTTERIEAYLSAGKKFRD 378
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDLFAE P+W +K F+G ++WC + F GN +YG + +I GP EA +++
Sbjct: 379 MLILDLFAESFPVWKKTKGFFGKAFVWCQVQEFGGNHGLYGHVANITEGPAEA-MAQHPN 437
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG---RSVPA-IQDAW 292
MVGVG + EG N +V+ L+ + + +D + + + + RRY R+VP+ + +AW
Sbjct: 438 MVGVGNAGEGQSGNEIVFSLLLDQGWSKTALDPEQYFHDWVTRRYSSHERTVPSELYEAW 497
Query: 293 NVLYHTVYNCTD--GATDKNRDVIVAFPDVDPSI-ISVTEGKYQNYGKPVSKEAVLKSET 349
+L + YN T+ A + A P ++ + + EG
Sbjct: 498 QLLRLSAYNNTNLVDAPLLPHALFAASPSINAKMPMLFIEG------------------- 538
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
L Y +++++A L I L ++Y+YD++D+TRQ L+ + ++
Sbjct: 539 -------LLYDPADMLKAWGLMIKGA--LFGDSSYQYDIVDVTRQVLSDAFTLVLQDLKV 589
Query: 410 AYQLNDAHGVFQ-LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ-YEW 467
Y+ VF + + L +++ +D +L+ ++ F L W+ +A+ A ++ + +E
Sbjct: 590 KYKGGAPASVFMPIGDKLLIILKALDAVLSMNENFWLSSWISAARASAGDDSEAADFFEH 649
Query: 468 NARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
NAR QIT+W E +L DY K W+GL+ YY PR ++ +Y+ ++ S
Sbjct: 650 NARNQITIW----GSEVGVLDDYAQKQWAGLVSGYYTPRWRMFLEYLKDTPAS 698
>gi|383115203|ref|ZP_09935961.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
gi|313695380|gb|EFS32215.1| hypothetical protein BSGG_2915 [Bacteroides sp. D2]
Length = 727
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 178/574 (31%), Positives = 274/574 (47%), Gaps = 54/574 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ D K WI + R G +++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAWNLNADDNK-WIECLADRHVGCVSQSVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFFDVKVEFDRMVEAKDYQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
L DY ++ W+GL+ DYY R +Y I++ E G K E ++
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAAEKGVEVDQKQLEDELKEIE 688
Query: 538 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 566
W N + V S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDVRKDIHSATDGLLSFSTFLFSKY 722
>gi|160884062|ref|ZP_02065065.1| hypothetical protein BACOVA_02038 [Bacteroides ovatus ATCC 8483]
gi|423291477|ref|ZP_17270325.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
CL02T12C04]
gi|156110404|gb|EDO12149.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392663477|gb|EIY57027.1| hypothetical protein HMPREF1069_05368 [Bacteroides ovatus
CL02T12C04]
Length = 727
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 179/577 (31%), Positives = 280/577 (48%), Gaps = 60/577 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGERLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDDKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P K ++ Y + L
Sbjct: 490 Y--------------VQVPR--------TLGTLPGY-RPALNRNSEKRTSNVYSNVEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKVEFDRMVEAKDHQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE-- 532
L DY ++ W+GL+ DYY R +Y I+++ G D +L+D +E
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYINTFIKAVGEGVEVDQKQLEDELKEIE 688
Query: 533 --WIKLTNDWQNGRNVYPVESNGDALIT-SQWLYNKY 566
W+ T+ ++V+ S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDTRKDVH---STTDGLLSFSTFLFSKY 722
>gi|427385205|ref|ZP_18881710.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
12058]
gi|425727373|gb|EKU90233.1| hypothetical protein HMPREF9447_02743 [Bacteroides oleiciplenus YIT
12058]
Length = 719
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 174/586 (29%), Positives = 288/586 (49%), Gaps = 76/586 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ VLQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 190 MNNLEGWGGPNPDSWYAQQEVLQKKILKRMREYGIKPVFPGYSGMVPHDADEKLGLNLTK 249
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--N 117
+ L N F+ + L TD F EI + +Q K +G+ + Y+ D F E N
Sbjct: 250 SDLWNGFTRPA--------FLQPTDTRFAEIANLYYREQEKLFGKADY-YSMDPFHEAEN 300
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G AI M+ + A W++QGW + +P RP ++ + N G L
Sbjct: 301 AASVD----FDAAGKAIMQAMKKVNPKATWVVQGW--TENP--RPEMIENMKN----GDL 348
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIW-------------CMLHNFAGNIEMYGILDSIAF 224
++LDLF+E +P+W G+P IW CML NF GN+ ++G +D +
Sbjct: 349 LILDLFSECRPMW-------GIPSIWKRDKGYEQHDWLFCMLLNFGGNVGLHGRMDQLLD 401
Query: 225 GPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
+ + + T + G+G++MEG E NPV+++LM E+ ++ EK + W+ Y RYG
Sbjct: 402 NFYQTKDNPLATHLKGIGLTMEGSENNPVMFELMCELPWRPEKFTKEEWLKDYLFARYGV 461
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSK 341
I+ AW +L +++YNC G + + + P ++ + + + K +NY P
Sbjct: 462 KDEKIEKAWTLLANSIYNCPFGNNQQGPHESIFCGRPSMN-NFQASSWSKMKNYYDPTVT 520
Query: 342 EAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 401
E A L + ++ +N + YDL+D+ RQ+L+
Sbjct: 521 E-----------------------EAARLMLEVADKYRGNNNFEYDLVDIVRQSLSDKGR 557
Query: 402 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
++ I ++ D + S++FL+++ D LL F +G W+E A+ L E+
Sbjct: 558 IVYNQTIADFKSFDKRSFARDSQKFLDILLLQDRLLGTRSEFRVGRWIEQARNLGTTPEE 617
Query: 462 EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 521
+ YEWNAR QIT W + + LRDY +K W+G+LRD+Y R A Y++ + + L+
Sbjct: 618 KDLYEWNARVQITTWGNRVCADDGGLRDYAHKEWNGILRDFYYKRWAAYWQTLQDQLDGK 677
Query: 522 DGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
+L ++ + W + Y G+++ ++ ++ K +
Sbjct: 678 PEVKL-----DYYAMEEPWTLAKTPYDSTPEGNSVDVAKEVFEKAM 718
>gi|423214204|ref|ZP_17200732.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693149|gb|EIY86384.1| hypothetical protein HMPREF1074_02264 [Bacteroides xylanisolvens
CL03T12C04]
Length = 727
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 175/574 (30%), Positives = 274/574 (47%), Gaps = 54/574 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 195 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C + L+ D LF +I + F+++Q K +G T H+Y D F+E PP
Sbjct: 255 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQKKLFG-TDHVYGLDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 311 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 371 LDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGERLENALINGGGNLKG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G ++DAW L++ +
Sbjct: 431 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDDKWIECLADRHVGCVSQPVRDAWKRLFNDI 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + + T G Y +P + K ++ Y + L
Sbjct: 490 Y----------------------AQVPRTLGTLPGY-RPALNKNSEKRTSNVYSNIEL-- 524
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 525 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQ 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 576 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 634
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
L DY ++ W+GL+ DYY R +Y I++ E G K E ++
Sbjct: 635 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAAEKGVEVDQKQLEDELKEIE 688
Query: 538 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 566
W N + V S D L++ S +L++KY
Sbjct: 689 EGWVNATDRKDVRKDIHSATDGLLSFSTFLFSKY 722
>gi|423312588|ref|ZP_17290525.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
CL09T03C04]
gi|392688276|gb|EIY81565.1| hypothetical protein HMPREF1058_01137 [Bacteroides vulgatus
CL09T03C04]
Length = 717
Score = 265 bits (678), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 178/532 (33%), Positives = 271/532 (50%), Gaps = 67/532 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ F
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 402
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q AW++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQAWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 514
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYL 669
>gi|320106778|ref|YP_004182368.1| alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
gi|319925299|gb|ADV82374.1| Alpha-N-acetylglucosaminidase [Terriglobus saanensis SP1PR4]
Length = 754
Score = 265 bits (678), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 181/551 (32%), Positives = 266/551 (48%), Gaps = 63/551 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLP+ +++Q+ VLQ+KIL R+ LGM PV PAFSG VP + + P A+
Sbjct: 220 MGNVNNIDGPLPEHFIEQKRVLQRKILDRMRSLGMRPVAPAFSGFVPQGFKRLHPKAETF 279
Query: 61 QLGNWF--SVKSDPRWCCTYLLDATDP-LFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
L W K+ PR T++L + L+ IG+ FIE+ EYG + Y DTF+E
Sbjct: 280 TL-LWLPEEFKTIPRSTRTFILHPGEQDLYRLIGKKFIEEYKAEYGEVQY-YLADTFNEL 337
Query: 118 TPPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
PV E + G +Y G+ +GD + W+MQGWLF YD FW + ALL +P
Sbjct: 338 AVPVREEHRFEDLERFGRTVYEGILAGDPNGTWVMQGWLFVYDVAFWNSESVAALLRGIP 397
Query: 174 LGKLVVLDLFAEVKPI---------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
+++++D ++ P W T K F+G +I M H F GN + G L +A
Sbjct: 398 NDRMLIIDYANDLAPAVKGKYAPGQWKTQKAFFGKQWINGMAHTFGGNNNVKGNLKLMAS 457
Query: 225 GPVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
P TS E +VG GM EGIE N VVY+LM++ +Q E +D+K WI Y RYG
Sbjct: 458 EPASVLTSPERGNLVGWGMCPEGIETNEVVYELMTDAGWQREAIDLKQWIPAYCRSRYGA 517
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
P + +AW +L + Y+ T + P + P+ SV G P +
Sbjct: 518 CPPVMLEAWTLLMQSAYSAHIWMTHQAWQTE---PSLAPAAASVDAG-------PTFR-- 565
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
RA+ LF++ EL YR DLI+L QA ++
Sbjct: 566 ----------------------RAVALFLSCAPELGQKELYRNDLIELVVQAAGGSVDQT 603
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
F ++A Q + + + L + MD LL L W+++A+ A+++++
Sbjct: 604 FSLAVQAGQSHQNEVATEYAAHALGWMGRMDALLNLRPDRRLETWMQAARSYAKSDDEAA 663
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
Y+ NAR IT W L DY ++ WSGL RDYY R +F SL +G
Sbjct: 664 YYDENARRLITTW------GWPELSDYASRAWSGLTRDYYASRWEAWFA----SLHAGRP 713
Query: 524 FRLKDWRREWI 534
F L W++ W+
Sbjct: 714 FSLDIWQQTWL 724
>gi|295085509|emb|CBK67032.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 716
Score = 265 bits (676), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 176/574 (30%), Positives = 274/574 (47%), Gaps = 54/574 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 184 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 243
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W R C +L + D LF +I + F+++Q K +G T HIY D F+E PP
Sbjct: 244 HLGKWAGFADAYR--CNFL-NPNDALFAKIQKLFLDEQKKLFG-TDHIYGLDPFNEVDPP 299
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEY+ + + +Y+ + + D A W+ W+F +D W +MKALL VP K+++
Sbjct: 300 SFEPEYLRKIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMIL 359
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD E +W ++ F+ PYIWC L NF GN + G + A + + G
Sbjct: 360 LDYHCENVELWKRTEHFHNQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKG 419
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G ++EG++ Y+ + E A+ + VD WI + R G +++DAW L++ +
Sbjct: 420 IGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDI 478
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y V P T G Y +P + K ++ Y + L
Sbjct: 479 Y--------------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL-- 513
Query: 360 STSEVIRALELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
EV R L NE + + +R DLI + RQ L Y ++ + + D
Sbjct: 514 --LEVWRKL-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDHQ 564
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 565 ALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW- 623
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
L DY ++ W+GL+ DYY R +Y I+++ K E ++
Sbjct: 624 ------GGSLNDYASRSWAGLISDYYAKRWEVYIDTFIKAVGEDVEVDQKQLEDELKEIE 677
Query: 538 NDWQNGRNVYPVE----SNGDALIT-SQWLYNKY 566
W N + V S D L++ S +L++KY
Sbjct: 678 EGWVNATDRKDVRKDVHSTTDGLLSFSTFLFSKY 711
>gi|393784337|ref|ZP_10372502.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
CL02T12C01]
gi|392666113|gb|EIY59630.1| hypothetical protein HMPREF1071_03370 [Bacteroides salyersiae
CL02T12C01]
Length = 728
Score = 264 bits (675), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 161/522 (30%), Positives = 254/522 (48%), Gaps = 54/522 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ ++ +Q LQ+K+L R+ ELGM PV F G VP L+ +P A+I
Sbjct: 193 MGNLEGFGGPVSPEFIARQTDLQQKMLKRMRELGMKPVFQGFYGMVPNVLKKKYPDARIK 252
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W + + LD TDPLF + + E+Q K +G + + D F E
Sbjct: 253 EQGTWQTYQRPA------FLDPTDPLFDRVAAIYYEEQKKLFG-DAEFFGGDPFHEGG-- 303
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
++ I M+ + AVW++QGW ++P +K L++ + G+ ++L
Sbjct: 304 TSEGIHVKLAAQKILQAMRKVNPKAVWVLQGW--QHNP------VKDLMDGLNPGETIIL 355
Query: 181 DLFAEVKPIWS--TSKQFYGVP------YIWCMLHNFAGNIEMYGILDSIAFGPVEARTS 232
DL A +P W T+ F+ +IWC L NF G ++G + S A G V A+
Sbjct: 356 DLMACERPQWGGVTTSMFHKPEGHQDHRWIWCALPNFGGKTGLHGKMSSYASGAVFAKEH 415
Query: 233 E-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDA 291
+ G+G + EGI PVVYD++ +MA++ + + + W+ Y+ RYG A
Sbjct: 416 PMGRNICGIGTAPEGIGTVPVVYDMVYDMAWRTDSIQIPQWLTNYTYYRYGMEDTNCDKA 475
Query: 292 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 351
W +L TVY C + I A P A S+
Sbjct: 476 WKILSETVYECHNELGGPVESYICARP------------------------ADTIDHVST 511
Query: 352 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 411
+ + ++Y +++ A E S N + +TY YDL+D+TRQ L+ YA L ++EA+
Sbjct: 512 WGNARIFYEPVKMVEAWEFLYQSRNRFNHCDTYEYDLVDVTRQVLSDYAKYLHKEMVEAF 571
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
+ +G + S FL++++D D LL+ F+LG WL A+ E+++++ NA+
Sbjct: 572 HQKNENGFMKYSTEFLDVIKDEDRLLSTRKEFMLGTWLTEAENAGCTPEEKRRFVTNAKR 631
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY 513
+T W D + L DY NK WSGLL D+Y PR Y Y
Sbjct: 632 LVTTWTDRDSD----LHDYANKEWSGLLSDFYLPRWEAYVTY 669
>gi|237721435|ref|ZP_04551916.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|293370838|ref|ZP_06617383.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|229449231|gb|EEO55022.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|292634054|gb|EFF52598.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 711
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 164/517 (31%), Positives = 263/517 (50%), Gaps = 59/517 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVARMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L D F + E+ K YG+ ++ Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSTEDEHFDSFAAMYYEELEKLYGKANY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ GA+I + M+ + +AVW++Q W+ + ++ S+ G
Sbjct: 305 GNTEGVD----LAKTGASIMAAMKKANPEAVWIIQA--------WQANPREEMIASLNQG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
L+VLDL++E +P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DLLVLDLYSEKRPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSENTTMV-GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N M+ GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYGR V P
Sbjct: 413 ACAHTNGKMLHGVGATPEGIENNPVMFELLYELPWREERFSSDEWLQTYLKARYGREVSP 472
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
I +AW L +TVYN D + + S++ G + +
Sbjct: 473 EIMEAWRALEYTVYNA---PKDYQGEGTIE------SLLCARPGFHLD------------ 511
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+Y+ +A LF + ++ +N + YDL+D+ RQ+ A N L
Sbjct: 512 -RTSTWGYSKLFYAPDSTAKAARLFTSVADQYKGNNNFEYDLVDIVRQSNADKGNVLLEE 570
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 466
I ++Y D + +++FL+L+ D LL+ F + WL +A+ L EE+++ YE
Sbjct: 571 ISQSYDRKDKEDFRKQTQQFLDLILAQDRLLSTRKEFSVSSWLNAARSLGTTEEEKRLYE 630
Query: 467 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 503
WNA IT+W D+ L DY ++ WSGLL+D Y
Sbjct: 631 WNASALITVWGDSIAANQGGLHDYSHREWSGLLKDLY 667
>gi|393788556|ref|ZP_10376683.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
CL02T12C05]
gi|392654236|gb|EIY47884.1| hypothetical protein HMPREF1068_02963 [Bacteroides nordii
CL02T12C05]
Length = 732
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 180/552 (32%), Positives = 270/552 (48%), Gaps = 63/552 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ ++ +Q LQ+K+L R+ ELGM PV F G VP AL+ FP A+I
Sbjct: 196 MGNLEGFGGPVTPEFIARQTDLQQKMLKRMRELGMKPVFQGFYGMVPNALKEKFPDARIK 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + + LD TDPLF ++ + E+Q +G + + D F E
Sbjct: 256 DQGIWGTYQRPA------FLDPTDPLFDKLAAIYYEEQKNLFGE-AQFFGGDPFHEG--- 305
Query: 121 VDSPEYISSLGAA--IYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+ E I+ AA I M+ + AVW++QGW ++P +K L+ V G+ +
Sbjct: 306 -GTSEGINVKLAAQKILQAMRKVNPQAVWVLQGW--QHNP------VKELMEGVKPGETI 356
Query: 179 VLDLFAEVKPIWSTSK-QFYGVP-------YIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+LDL A +P W K + P +IWC L NF G ++G + S A GPV A+
Sbjct: 357 ILDLMACERPQWGGVKTSMFHKPEGHWNHQWIWCALPNFGGKTGLHGKMSSYASGPVFAK 416
Query: 231 TSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQ 289
+ G+G + EGI PVVYD++ +MA++ + + + W++ Y+ RYG
Sbjct: 417 HHPMGKNICGIGTAPEGIGTIPVVYDMVYDMAWRTDSIHIPQWLDNYTYYRYGTEDNNCN 476
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSET 349
AW +L T+Y C + I A P S T +G V
Sbjct: 477 RAWKLLSETIYECHNELGGPVESYICARP-------SDTIQHVSTWGNAV---------- 519
Query: 350 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 409
++Y +V++A +L S + S+TY YDL D+TRQ L+ YA L ++
Sbjct: 520 -------MFYDPMKVVKAWDLLYQSRKRFNHSDTYEYDLTDVTRQVLSDYAKYLHERMVL 572
Query: 410 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 469
A+Q D + S +FL +++D D LL+ F+LG WL A++ E+++++ NA
Sbjct: 573 AFQKKDKERFMEYSGKFLNIIKDEDRLLSTRKEFMLGTWLAEAEKAGGTPEEKRRFVTNA 632
Query: 470 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
+ IT W D + L DY NK WSGLL D+Y PR Y Y SL G D+
Sbjct: 633 KRLITTWTDTDSD----LHDYANKEWSGLLIDFYLPRWEAYVTYKT-SLLYGKKLPYPDY 687
Query: 530 RR---EWIKLTN 538
+ EW+ LTN
Sbjct: 688 SKMEQEWV-LTN 698
>gi|294775488|ref|ZP_06741000.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
gi|294450633|gb|EFG19121.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides vulgatus PC510]
Length = 712
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 178/538 (33%), Positives = 272/538 (50%), Gaps = 67/538 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 182 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 233
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 234 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 289
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 290 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 337
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ F
Sbjct: 338 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 397
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 398 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 457
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 458 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 498
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 499 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 553
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 554 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 612
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+ L+
Sbjct: 613 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 670
>gi|150004413|ref|YP_001299157.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149932837|gb|ABR39535.1| glycoside hydrolase family 89 [Bacteroides vulgatus ATCC 8482]
Length = 717
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 178/538 (33%), Positives = 272/538 (50%), Gaps = 67/538 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ F
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHEWVYCMLLNFGGNIGLHGKMDALIDGFY 402
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+ L+
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 675
>gi|423217398|ref|ZP_17203894.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
CL03T12C61]
gi|392628557|gb|EIY22583.1| hypothetical protein HMPREF1061_00667 [Bacteroides caccae
CL03T12C61]
Length = 707
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 167/533 (31%), Positives = 264/533 (49%), Gaps = 59/533 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVSRMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L + D F + E+ K YG+ + Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I M+ + +AVW++Q W + P A+++ + G
Sbjct: 305 GNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAWQANPRP--------AMIDVLNAG 352
Query: 176 KLVVLDLFAEVKPIWSTS-------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
++VLDL++E +P W S K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DMLVLDLYSEKRPQWGDSDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A N M GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYG + P
Sbjct: 413 ACAHVNGKRMRGVGATPEGIENNPVMFELLYELPWRAERFSPDVWLQGYLKARYGGELSP 472
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ +AW L HTVYN P P EG ++ ++
Sbjct: 473 EVMEAWRALEHTVYNA---------------PKNSPG-----EGTLESL--LCARPGFHL 510
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
TS++ + L+YS +A +L ++ + N + YDL+D+ RQ+ A N L
Sbjct: 511 DRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGDNNFEYDLVDIVRQSNADKGNALLDE 570
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 466
I ++Y D + +++FLEL+ D LL+ F + WL +A+ L + ++K YE
Sbjct: 571 ISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEFSVSSWLAAARSLGNTDAEKKLYE 630
Query: 467 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 519
WNA IT+W D+ L DY ++ WSGLL+D Y R +F+ + LE
Sbjct: 631 WNASALITVWGDSIASNQGGLHDYSHREWSGLLKDLYYLRWKTFFEQKQQELE 683
>gi|423212382|ref|ZP_17198911.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694828|gb|EIY88054.1| hypothetical protein HMPREF1074_00443 [Bacteroides xylanisolvens
CL03T12C04]
Length = 705
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 169/527 (32%), Positives = 261/527 (49%), Gaps = 61/527 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ VLQKKI+ R+ ELG+ PV P ++G VP + +I
Sbjct: 196 MNNLEGWGGPNPDSWYRQQEVLQKKIVARMRELGIEPVFPGYAGMVPRNIGEKL-GYQIA 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W S PR L D F + E+ K YG+ ++ Y+ D F E NT
Sbjct: 255 DPGKWCSF---PR---PAFLSTEDEHFESFAAMYYEELEKLYGKANY-YSMDPFHEGGNT 307
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD ++ GA+I + M+ + AVW++Q W+ + +++S+ G ++
Sbjct: 308 EGVD----LAKTGASIMAAMKKANPKAVWVIQA--------WQANPREEMISSLNQGDML 355
Query: 179 VLDLFAEVKPIWS-------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
VLDL++E P W K F +++CML NF N+ ++G +D + G +A
Sbjct: 356 VLDLYSERLPQWGDPDSKWYREKGFGKHDWLYCMLLNFGANVGLHGRMDLLVNGYYDACA 415
Query: 232 SEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-PAIQ 289
N T+ GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYG+ V P +
Sbjct: 416 HANGKTLRGVGATPEGIENNPVMFELLYELPWREERFSPDEWLQGYLKARYGKDVSPEVM 475
Query: 290 DAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVS----KEAVL 345
+AW L HTVYN RD YQ G S +
Sbjct: 476 EAWRALEHTVYNAP-------RD-------------------YQGEGTVESLLCARPGFH 509
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
TS++ + L+YS +A L + + SN + YDL+D+ RQ+ A N L
Sbjct: 510 LDRTSTWGYAKLFYSPDSTAKAARLLTSVAKQYEGSNNFEYDLVDIVRQSNADKGNVLLE 569
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
+I ++Y D + +++FL+L+ D LL+ F + WL++A+ L + ++K Y
Sbjct: 570 DISQSYDRKDKENFRKQTQQFLDLIVSQDSLLSTRKEFSVSTWLDAARSLGTTDAEKKLY 629
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 512
EWNA IT+W D+ L DY ++ WSG+L+D Y R +F+
Sbjct: 630 EWNASALITVWGDSIASNQGGLHDYSHREWSGILKDLYYQRWKAFFE 676
>gi|345519733|ref|ZP_08799147.1| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
gi|345457107|gb|EET15964.2| glycoside hydrolase family 89 [Bacteroides sp. 4_3_47FAA]
Length = 717
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 178/538 (33%), Positives = 272/538 (50%), Gaps = 67/538 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 187 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 238
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 239 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 294
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 295 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 342
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ F
Sbjct: 343 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALIDGFY 402
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 403 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 462
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 463 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 503
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 504 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 558
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 559 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 617
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+ L+
Sbjct: 618 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 675
>gi|319643377|ref|ZP_07998003.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
gi|317385006|gb|EFV65959.1| glycoside hydrolase family 89 [Bacteroides sp. 3_1_40A]
Length = 718
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 178/538 (33%), Positives = 272/538 (50%), Gaps = 67/538 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P+SW +Q LQKKI+ R+ E G+ PVLP + G VP +
Sbjct: 188 MNNLEGWGGPNPESWYTRQEKLQKKIVKRMREYGIEPVLPGYCGMVPHNAKE-------- 239
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
+LG +V +DP + C+Y L D F EI + + K YG+T Y D F E
Sbjct: 240 KLG--LNV-ADPGFWCSYHRPAFLQPEDERFEEISALYYRELTKLYGKTG-FYAIDPFHE 295
Query: 117 --NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
+T V+ + + G AI M+ + DAVW+ Q W +P R P ++ +
Sbjct: 296 GGSTQGVN----LDAAGKAIMKAMKKTNPDAVWVAQAW--QDNP--RTP----MIEHLEA 343
Query: 175 GKLVVLDLFAEVKPIWS------TSKQFYGV-PYIWCMLHNFAGNIEMYGILDSI--AFG 225
G L+VLDL +E +P W K YG +++CML NF GNI ++G +D++ F
Sbjct: 344 GDLLVLDLHSECRPQWGDPASEWCRKGGYGQHGWVYCMLLNFGGNIGLHGKMDALIDGFY 403
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+A T+ GVGM+ EGIE NPV+Y+L+ E+ ++ + W+ Y RYG
Sbjct: 404 DAKADVHAGRTLRGVGMTPEGIENNPVMYELVMELPWREHRFTRDEWLKGYVYARYGVED 463
Query: 286 PAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+Q W++L + +YN + + V A P +D YQ
Sbjct: 464 EALQQVWDLLGNGIYNSPKEKIQQGTHESVFCARPGLDV---------YQ---------- 504
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
SS+ +Y+ +VI A L ++ ++ +N + +DL+D+ RQALA+ +
Sbjct: 505 -----VSSWSEMKEYYNPQDVIEAARLMVSVADKYQGNNNFEFDLVDVLRQALAEKGRLM 559
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ A++ D VF+L S+ FL L+ D LL F +G W+E+A+ Q +E++
Sbjct: 560 QKVVTAAFRAGDKQ-VFELASQHFLHLILLQDQLLGTRKEFKVGTWIEAARSAGQTQEEK 618
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
YEWNAR QIT W + + LRDY +K W+G+L+D+Y R YF Y+ L+
Sbjct: 619 ALYEWNARVQITTWGNRVAADQGGLRDYAHKEWNGILKDFYFMRWKAYFDYLACVLDG 676
>gi|153807690|ref|ZP_01960358.1| hypothetical protein BACCAC_01972 [Bacteroides caccae ATCC 43185]
gi|149130052|gb|EDM21264.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
Length = 707
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 167/540 (30%), Positives = 267/540 (49%), Gaps = 73/540 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP P SW QQ LQKKI+ R+ ELG+ PV P ++G VP +
Sbjct: 196 MNNLEGWGGPNPDSWYQQQEALQKKIVSRMRELGIEPVFPGYAGMVPRNIGE-------- 247
Query: 61 QLGNWFSVKSDPRWCC---TYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE- 116
+LG + + +WC L + D F + E+ K YG+ + Y+ D F E
Sbjct: 248 KLG--YQIADPGKWCGFPRPAFLSSEDEHFDSFAAMYYEELEKLYGKAKY-YSMDPFHEG 304
Query: 117 -NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
NT VD ++ G +I M+ + +AVW++Q W + P A+++ + G
Sbjct: 305 GNTEGVD----LAKAGTSIMKAMKKANPEAVWVIQAWQANPRP--------AMVDVLNAG 352
Query: 176 KLVVLDLFAEVKP-------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE 228
++VLDL++E P +W K F +++CML NF GN+ ++G ++ + G +
Sbjct: 353 DMLVLDLYSERLPQWGDPDSMWYREKGFGKHDWLYCMLLNFGGNVGLHGRMNQLVNGYYD 412
Query: 229 ARTSEN-TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-P 286
A T N T+ GVG + EGIE NPV+++L+ E+ ++ E+ W+ Y RYG + P
Sbjct: 413 ACTHANGKTLRGVGTTPEGIENNPVMFELLYELPWRAERFSPDTWLQGYLKARYGGELSP 472
Query: 287 AIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ +AW L HTVYN +NY + E++L
Sbjct: 473 EVMEAWRALEHTVYNAP-----------------------------KNYQGEGTVESLLC 503
Query: 347 S-------ETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
+ TS++ + L+YS +A +L ++ + +N + YDL+D+ RQ+ A
Sbjct: 504 ARPGFHLDRTSTWGYSKLFYSPDSTSKAADLMLSVAEQYKGNNNFEYDLVDIVRQSNADK 563
Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
N L I ++Y D + +++FLEL+ D LL+ F + WL +A+ L +
Sbjct: 564 GNALLDEISQSYDRKDKENFRKQTQQFLELILSQDSLLSTRKEFSVSSWLTAARSLGNTD 623
Query: 460 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 519
++K YEWNA IT+W D+ L DY ++ WSGLL+D Y R +F+ + LE
Sbjct: 624 AEKKLYEWNASALITVWGDSIASNQGGLHDYSHREWSGLLKDLYYLRWKTFFEQKQQELE 683
>gi|358391826|gb|EHK41230.1| glycoside hydrolase family 89 protein [Trichoderma atroviride IMI
206040]
Length = 751
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 166/542 (30%), Positives = 285/542 (52%), Gaps = 41/542 (7%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G W G +P +W+D Q LQ +IL R+ ELG+ P+LPAF G VP + VFP ++
Sbjct: 204 NIQGSWNGNMPGNWVDDQFALQLQILDRMKELGITPILPAFPGFVPRNISRVFPGISLST 263
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + D TY ++ DP F ++ + FI +Q + YG + + D F+EN P
Sbjct: 264 SPLWENFAEDLS-ADTY-VNPFDPHFTQLQKLFIGKQQELYGNVTKFWTLDQFNENQPLS 321
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPL-GKLVV 179
Y+ ++ ++ ++S DA+W+MQ WLFS D FW ++A L + +++
Sbjct: 322 SDLGYLRNVSQNTWTALKSASPDAIWVMQAWLFSADSSFWTNDAIEAFLGGITEDSDMLL 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P W + FYG P+IWC LH++ GN+ +YG ++++ ++A ++++VG
Sbjct: 382 LDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGLYGQIENVTINAMQA-VRNSSSLVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVPAIQDAWNVLYHT 298
G++MEG E N ++YDL+ + A+ + +D + + + + RYG +V ++ W +L T
Sbjct: 441 FGLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHDWVSARYGTENVKSLYTGWELLRPT 500
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
V+N T+ + I+ ++ P+I + G+ +G + +YD +
Sbjct: 501 VFNNTNLTVNAVPKSIL---ELTPNINGLL-GRVGRHGTTI-----------NYDPAVMV 545
Query: 359 YSTSEVIRA-LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN-DA 416
+ +E+ +A LE GN Y+YDL+D TRQ L + L+ +++ AY + +A
Sbjct: 546 DAWTELFKAGLEDVKLFGNP-----AYQYDLVDWTRQVLVNSFDGLYKDLVTAYNSSANA 600
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + L++ +D +LA ++ F L W+ +A+ A N E+NAR Q+T+W
Sbjct: 601 AEIRSRGSKLTALLKTLDAVLATNENFQLATWIAAAR--ASNPSNTSFLEYNARNQVTLW 658
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGF--RLKDWRR 531
Q E DY +K W+GL+ DYY R + Y+ + S F +L+ W
Sbjct: 659 GPTGQIE-----DYASKQWAGLVGDYYLGRWQQFIDYLATTKHSSYNQTAFYHKLQAWEI 713
Query: 532 EW 533
+W
Sbjct: 714 QW 715
>gi|358378969|gb|EHK16650.1| glycoside hydrolase family 89 protein [Trichoderma virens Gv29-8]
Length = 748
Score = 262 bits (670), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 167/545 (30%), Positives = 289/545 (53%), Gaps = 47/545 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+ G WGG +P+SW+D Q LQ KIL R+ ELG+ P+LPAF G VP + VFP ++
Sbjct: 203 NIQGSWGGSMPRSWVDSQFDLQLKILDRMEELGITPILPAFPGFVPRNISRVFPDISLST 262
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + ++ ++ DP F ++ + FI +Q + YG ++ + D F+EN P
Sbjct: 263 SPIWSNFGTEL--SADIYINPFDPRFAQLQKLFISKQQELYGNVTNFWTLDQFNENQPLS 320
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGK-LVV 179
Y+ ++ +S +++ D +AVW+MQ WLFS D FW ++++ L +P+ +++
Sbjct: 321 GDLGYLQNVSHNTWSALKAADPEAVWVMQAWLFSSDSAFWTNDRIESFLGGIPVNSDMLL 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE P W + FYG P+IWC LH++ GN+ +YG ++++ ++A + ++VG
Sbjct: 381 LDLFAESAPQWLRTNSFYGKPWIWCELHDYGGNMGLYGQIENVTINSMDA-VRNSGSLVG 439
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-RSVPAIQDAWNVLYHT 298
G++MEG E N ++YDL+ + A+ + +D + + + + RYG ++V ++ W +L T
Sbjct: 440 FGLTMEGQEGNEIMYDLLLDQAWSPKPIDTETYFHDWVSTRYGTKNVKSLYTGWELLRPT 499
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
V+N T+ + + I+ ++ PS + G+ ++G ++
Sbjct: 500 VFNNTNLTMNAVQKSIL---ELVPSTTGLL-GRVGHHGTTIT------------------ 537
Query: 359 YSTSEVIRA-LELFIASGNE--LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
Y+ + ++ A ELF A + L + Y+YDL+D TRQ L L+ +++ AY
Sbjct: 538 YNPAVMVEAWTELFKAGLQDIKLFTNPAYQYDLVDWTRQVLVNSFEGLYKDLVAAYNSAA 597
Query: 416 AHGVFQLSR--RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
+ V + SR + + L+ +D +LA ++ F L PW+ A+ A + E+NAR QI
Sbjct: 598 SSSVIK-SRGAKLIALLRTLDAVLATNEHFQLTPWINEAR--ASSPSTADFLEYNARNQI 654
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGF--RLKD 528
T+W E DY +K W+GL+ YY R + Y+ + S F +L
Sbjct: 655 TLWGPQGNIE-----DYASKQWAGLVGTYYVERWQQFIDYLATTKPSNYNQTAFHQKLLA 709
Query: 529 WRREW 533
W +W
Sbjct: 710 WETQW 714
>gi|322702923|gb|EFY94542.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
ARSEF 23]
Length = 589
Score = 261 bits (668), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 171/540 (31%), Positives = 271/540 (50%), Gaps = 56/540 (10%)
Query: 7 WGG--PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGN 64
WGG L W+D Q LQKKI+ R+ ELG+ PVLPAF G VP A V P A T+
Sbjct: 28 WGGVGNLSSGWIDAQFELQKKIVARMVELGITPVLPAFPGFVPPAFSRVQPDANTTKAPR 87
Query: 65 WFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSP 124
W + D T+ L D + + +AFI +Q++ +G ++IY D F+E P + P
Sbjct: 88 WTGLP-DTNTRDTF-LSPLDTSYARLQQAFISKQIEAFGNVTNIYTLDQFNEMPPTSNEP 145
Query: 125 EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG--KLVVLDL 182
Y+S + Y + + + AVWL+QGWLF W ++ A L P G ++VLDL
Sbjct: 146 SYLSQVSTYTYKALTAANPAAVWLLQGWLFLNSGLWTEERVTAYLGG-PEGHNSMLVLDL 204
Query: 183 FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTMVGVG 241
++E +P W +K ++G P+IWC LH+F GN+ MYG + I ++A RTS ++ G G
Sbjct: 205 YSESRPQWQRTKGYFGRPWIWCQLHDFGGNMGMYGQISDITVQSMDALRTSP--SLSGFG 262
Query: 242 MSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPAIQDAWNVLYHTV 299
M+ EG E N VVY ++ + A+ +D + Y VRRY ++ AW++L +
Sbjct: 263 MTPEGYEGNEVVYQMLFDQAWTTTPIDTSGYFYGYVVRRYAGVSQTNSLFQAWDILRQNI 322
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP-HLW 358
Y+ +K+R V V G YQN + L + T ++ P ++
Sbjct: 323 YD------NKDRQVPC-----------VGVGIYQN----APSLSGLVNRTGNWPPPTKVY 361
Query: 359 YSTSEVIRALELFIASGNELSA---SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
Y + + +A L I + NE+ T++ D++D+TRQ ++ N ++ + ++ +
Sbjct: 362 YDPATLKKAHSLLIQAANEIPQLWDIPTFQLDVVDVTRQVMSNAFNTMYTDYVQTFNSQL 421
Query: 416 AHGVFQLSRR---------------FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+ +S R L+ + D+D +LA + F L WL++A+ A+
Sbjct: 422 SRQKSHISNRGGLQRRDDFATKGKQLLDFLTDLDRVLATNQHFRLDSWLDAAQYWAKQTG 481
Query: 461 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
+NAR+QIT W E+ L DY K WSGL R YY R +I+ + ++L S
Sbjct: 482 ANDLIAFNARSQITTWI----WESEALNDYAVKEWSGLTRSYYRGRWSIFVDGLNKALAS 537
>gi|329962235|ref|ZP_08300241.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
gi|328530343|gb|EGF57220.1| Alpha-N-acetylglucosaminidase [Bacteroides fluxus YIT 12057]
Length = 726
Score = 261 bits (667), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 165/517 (31%), Positives = 266/517 (51%), Gaps = 60/517 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGP SW + ++ LQK+IL R+ E G++PVLP +SG +P + ++
Sbjct: 188 MNNLEGWGGPNTDSWYEDRIALQKRILKRMREYGIHPVLPGYSGMLPHNAKEKL-GVNVS 246
Query: 61 QLGNWFSVKSDPRWCCTY----LLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE 116
G W C Y L TD F EI + E+ + YG+ + Y+ D F E
Sbjct: 247 DPGTW----------CGYNRPAFLQPTDTRFGEIAALYYEEMNRLYGK-ADFYSMDPFHE 295
Query: 117 NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ + G AI+ M+ ++VW++Q W + P+ + ++ +VP G
Sbjct: 296 GGKVAGVN--LDAAGQAIWQAMKKNSRNSVWVVQAWGAN-------PRAQ-MIKNVPRGD 345
Query: 177 LVVLDLFAEVKPIWSTSKQ-------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA 229
++VLDL++E +P W + F G +++CML N+ GN+ ++G + + +A
Sbjct: 346 MLVLDLYSESRPQWGEPESSWYRENGFDGHQWLYCMLLNYGGNVGLHGKMQHVIDAYYKA 405
Query: 230 -RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAI 288
R+S T+ GVGM+MEG E NPV+Y+L+ E+ ++ W+ Y RYG+ P +
Sbjct: 406 SRSSFGNTLKGVGMTMEGSENNPVMYELLCELPWRPSTFSKDEWLEGYIAARYGKCTPRL 465
Query: 289 QDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
++AW +L +++YNC +T + + + A P + + +A
Sbjct: 466 REAWVLLGNSIYNCPPRSTQQGTHESIFCARPSLK------------------AYQASSW 507
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
SE S Y P +VIRA LF+ + ++ + YDL+D+TRQA+A+ ++
Sbjct: 508 SEMSDYYRPQ------DVIRAAGLFLEEAGQFKGNDNFEYDLVDITRQAVAEKGRLIYKV 561
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 466
I +Y+ D + Q S RFLEL+ D LLA F +G W+E A+ L ++ E
Sbjct: 562 IQASYEAGDKPLLRQASDRFLELLLLQDRLLATRPEFKVGRWIEQARNLGHTPAEKDWLE 621
Query: 467 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 503
WNAR QIT W + T + LRDY +K W+GLL+D+Y
Sbjct: 622 WNARVQITTWGNRTASDRGGLRDYAHKEWNGLLKDFY 658
>gi|198277542|ref|ZP_03210073.1| hypothetical protein BACPLE_03764 [Bacteroides plebeius DSM 17135]
gi|198270040|gb|EDY94310.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides plebeius DSM
17135]
Length = 722
Score = 261 bits (666), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 173/577 (29%), Positives = 281/577 (48%), Gaps = 62/577 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPA-ALQNVFPSAKI 59
M+NL GWGGP P SW QQ LQKKIL R+ E G+ PV P +SG VP A + + +
Sbjct: 191 MNNLEGWGGPNPDSWYTQQEALQKKILKRMREYGIEPVFPGYSGMVPHDANKKLGLNVTE 250
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD--EN 117
L N F+ + L TD F EI + ++ K +G+ ++ Y+ D F E+
Sbjct: 251 PALWNGFTRPA--------FLLPTDSRFNEIASLYYKELEKLFGKANY-YSMDPFHELED 301
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
VD + G A+ M++ + A W++QGW + +P RP +K L N G +
Sbjct: 302 AGSVD----FDAAGKAVLKAMKNVNPKATWVIQGW--TENP--RPEMIKNLNN----GDI 349
Query: 178 VVLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART 231
++LDLF+E +P IW K + +++CM+ NF GN+ ++G +D + +
Sbjct: 350 LILDLFSECRPMWGIPSIWKREKGYEQHDWLFCMIENFGGNVGLHGRMDQLLNNFYLTKN 409
Query: 232 SE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+ + G+G++MEG E NPV+++LM E+ ++ EK + W+ Y RYG I
Sbjct: 410 NPLAAHLKGIGLTMEGSENNPVMFELMCELPWRPEKFTKEEWLKDYLFARYGVRDEKITQ 469
Query: 291 AWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
AW++L +YNC G + + + P ++ + + + K QNY P S EA
Sbjct: 470 AWSILADGIYNCPFGNNQQGPHESIFCGRPGLN-NFQASSWSKMQNYYDPTSTEA----- 523
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
A L + ++ +N + YDL+D+ RQ+L+ ++ I
Sbjct: 524 ------------------AARLMLEVADKYKGNNNFEYDLVDIVRQSLSDRGRIVYNQTI 565
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 468
++ D S+ FL ++ D LL F +G W+E A+ L E++ YEWN
Sbjct: 566 ADFKSFDKKSFATHSQEFLNILLAQDRLLGTRSEFRVGRWIEQARNLGTTPEEKDLYEWN 625
Query: 469 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 528
AR QIT W + LRDY +K W+GLL+D+Y R A Y++ + + L+ L
Sbjct: 626 ARVQITTWGNRVCANDGGLRDYAHKEWNGLLKDFYYKRWAAYWQTLQDVLDGKPMVEL-- 683
Query: 529 WRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 565
++ + W N Y + GD + ++ ++NK
Sbjct: 684 ---DYYAMEEPWTLAHNPYASQPEGDCVSVAKEVFNK 717
>gi|261880010|ref|ZP_06006437.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
gi|270333326|gb|EFA44112.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
Length = 719
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 172/570 (30%), Positives = 266/570 (46%), Gaps = 51/570 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP + +Q LQ+KIL R L M PVLPAFSG+VP ++ ++P + I
Sbjct: 195 MANIDKWKGPLPYHTVVEQRDLQQKILARERSLNMTPVLPAFSGHVPGQIKQLYPESNIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + SD C Y + DPLF +I R ++E+Q YG T HIY D F+E PP
Sbjct: 255 HLGRWAAF-SDQYRC--YFMSPQDPLFAKIQRMYLEEQRAIYG-TDHIYGIDPFNEVDPP 310
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPF-WRPPQMKALLNSVPLGKLVV 179
P+Y+ + IY + D A WL WLF + W P ++KAL+ V GK+V+
Sbjct: 311 SWDPDYLFQISKGIYQTLAHVDPKAEWLQMSWLFYHKKKKWTPERVKALITGVETGKMVL 370
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F + IW + +FYG PYIWC L NF GN + G + + T + G
Sbjct: 371 LDYFCDRNEIWKMTDKFYGQPYIWCYLGNFGGNTTVAGNVKACGAKLDSTLTLGGKNLQG 430
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG+++EG + Y+ + + + + WI+ + G + P+ + AW +LYH V
Sbjct: 431 VGLTLEGFDVCQFPYEYILDKVWSGNSSE-NQWIDALADSHVGYASPSFRKAWQLLYHDV 489
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
+ + G+ P P + S+ ++ + H+ Y
Sbjct: 490 FVQSAGSNG-------ILPCYRPELNSL-----------------------NWHYTHVDY 519
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN---IIEAYQLNDA 416
++I A +L + S + DLI RQ L NE + AY D
Sbjct: 520 DRQKLIEAWKLMQHDAD--SKRTAAQLDLIHYGRQVL---GNEFLTHKQLFDSAYAHCDL 574
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
G+ + ++ D+D L A H L W++ A+Q+A + YE NAR+ IT W
Sbjct: 575 AGMMAQAASMRHIMLDIDTLTAYHPRCTLAGWIDGARQMAPDSVCADYYEDNARSLITTW 634
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
L DY K W+GL+ DYY R YF + I ++ + F + + +E +
Sbjct: 635 -------GGKLNDYACKGWAGLMSDYYLTRWERYFAHAINAVRAHRKFDQQAYDKEIARF 687
Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
W + R++ VE++ + + + KY
Sbjct: 688 ELSWASHRDIPRVETHESLALYCKKIIQKY 717
>gi|399028591|ref|ZP_10729778.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
gi|398073682|gb|EJL64846.1| Alpha-N-acetylglucosaminidase (NAGLU) [Flavobacterium sp. CF136]
Length = 727
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 170/549 (30%), Positives = 270/549 (49%), Gaps = 49/549 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLPQ W++++ +QKKIL R+ LGM+PV+PAFSG VP A P +KI+
Sbjct: 209 MGNINSLEGPLPQEWINKKENVQKKILQRMRALGMHPVVPAFSGYVPKAFAEKHPGSKIS 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W S + TYLLDA DPLF EIG+ FIE K YG+ + Y D F+E TPP
Sbjct: 269 ELKSW----SGGGFESTYLLDANDPLFKEIGKRFIEIYTKLYGQ-ADFYLADAFNEITPP 323
Query: 121 VDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V E +S G I+ + DA W+MQGWLF + FW KA L+ VP +
Sbjct: 324 VSKEHKYEELSDYGKTIFETINEASPDATWVMQGWLFGDNKEFWTKEATKAFLSKVPNDR 383
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT- 235
+++ D + +W + FYG + + +HN+ G+ +YG L+ + N
Sbjct: 384 MMIQDYANDRHKVWEKQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKNELTHLLGNSNKG 443
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR--SVPAIQDAWN 293
+VG G+ EG+ N +VY+ + ++ + K V W+N+Y RYG+ S P Q AW
Sbjct: 444 NVVGYGVMPEGLNNNSIVYEYIYDLPWSQGKESVNDWLNKYLSARYGKNISTPVFQ-AWK 502
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
+L +VY+ T D A+ + +TE K G P K+
Sbjct: 503 LLIESVYSTKYWETRWWDDRAGAYLFFKRPTLKITEFK----GNPGDKQ----------- 547
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
++ +AL++ + ++ Y YDL+D++R + ++L + + AY+L
Sbjct: 548 ---------KLKQALDILKRESKSFNKNSLYFYDLLDMSRHYYSLCIDDLLIECVTAYEL 598
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
D +L ++ + D+D +L+ L WL+SA + E K Y NA+T I
Sbjct: 599 KDIKKADELFKKIEKQALDIDNMLSGQPLNSLNNWLKSASDYGSSPEVSKLYVKNAKTLI 658
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RL 526
T+W L DY ++ W G+ + +Y PR ++ + ES+ + F +
Sbjct: 659 TLWGGEGH-----LNDYASRSWRGMYKGFYWPRWKMFLQAQRESVVNNTSFDELKVRESI 713
Query: 527 KDWRREWIK 535
K W +W +
Sbjct: 714 KQWEIKWCQ 722
>gi|262406054|ref|ZP_06082604.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|294648118|ref|ZP_06725661.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|294806859|ref|ZP_06765684.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
gi|345510559|ref|ZP_08790126.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|229443271|gb|EEO49062.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D1]
gi|262356929|gb|EEZ06019.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_1_22]
gi|292636502|gb|EFF54977.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|294445888|gb|EFG14530.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides xylanisolvens SD
CC 1b]
Length = 718
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 167/568 (29%), Positives = 279/568 (49%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP A P +
Sbjct: 197 MGNLNKWDGPLSDTWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLTEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ LA A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYLAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVFAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVEFARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PV++
Sbjct: 683 IREWEEQWI--TSPWSNSTTPFDDPVKA 708
>gi|261880159|ref|ZP_06006586.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
gi|270333130|gb|EFA43916.1| alpha-N-acetylglucosaminidase [Prevotella bergensis DSM 17361]
Length = 772
Score = 259 bits (662), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 164/543 (30%), Positives = 259/543 (47%), Gaps = 65/543 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP +W QQ LQK+IL R ELGM+PVLP + G +P + +T
Sbjct: 193 MNNLEGWGGPLPDAWYAQQEALQKRILKREKELGMSPVLPGYCGMMPHDAKAKL-GLDVT 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L ATDP F I + + + YG+ + Y+ D F E+ P
Sbjct: 252 DGGTWNGYTRPAN------LSATDPKFDHIADLYYRELTRLYGKADY-YSMDPFHES--P 302
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D+ + G + + M+ + + W++QGW+ + P ++ ++P G +++L
Sbjct: 303 DDASVDYAEAGRKLLAAMKRANGKSNWVIQGWMENPRP--------QMIEALPEGDIIIL 354
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD------SIAFGPVE 228
DLF+E +P IW + + +++CML NF N+ ++G +D +A P
Sbjct: 355 DLFSECRPMFGAPSIWQRKEGYGRHNWLFCMLENFGANVGLHGRMDQLVHNFKLAASPST 414
Query: 229 ARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF--------QHEKVDVK-AWINQYSVR 279
+ + G+G +MEG E NP++++LMSE+ + + ++ D K W Y
Sbjct: 415 PYQNARKHLKGIGFTMEGSENNPIMFELMSELVWRANDLVSAERDRRDFKEGWTRNYVKA 474
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCTDGATDK--NRDVIVAFPDVDPSIISVTEGKYQNYGK 337
RYG P IQ+AW +L ++YNC G + + + P +D + + K +NY
Sbjct: 475 RYGIDNPKIQEAWQLLIGSIYNCPVGNNQQGPHESIFNGRPSLDNFQVK-SWSKMRNY-- 531
Query: 338 PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALA 397
Y + +RA +L + + +N + YDL+D+ RQA+
Sbjct: 532 ---------------------YDPNVTLRAAQLMTSVADRYRGNNNFEYDLVDIVRQAMD 570
Query: 398 KYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQ 457
A +L I Y+ D S RFL ++ D LL F LG +E A+ L+
Sbjct: 571 DQARLQYLRTIADYKGFDRTAFSADSARFLNMLLLQDKLLGTRQEFRLGTRIEQARSLST 630
Query: 458 NEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 517
E++ YEWNAR QIT W + T LRDY +K W GLLRD+Y R Y + +
Sbjct: 631 TLEEKNLYEWNARVQITTWGNRTCANEGGLRDYAHKEWQGLLRDFYFMRWHTYLDALSKQ 690
Query: 518 LES 520
+ +
Sbjct: 691 MTA 693
>gi|410097657|ref|ZP_11292638.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
CL02T12C30]
gi|409223747|gb|EKN16682.1| hypothetical protein HMPREF1076_01816 [Parabacteroides goldsteinii
CL02T12C30]
Length = 740
Score = 259 bits (662), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 173/585 (29%), Positives = 280/585 (47%), Gaps = 68/585 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP+SW+D + L K+++ R ELGM P+ FSG VP + FP AKI
Sbjct: 195 MPNIESFGGPLPKSWIDSHIALGKQVVNRQLELGMTPIQQGFSGAVPRKMMEKFPEAKIQ 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ +W+ + C LD DPLF E+G+ F+E++ K YG T +Y D F E+ PP
Sbjct: 255 KQPDWYGFEG----ICQ--LDPLDPLFTELGKTFLEEEQKLYG-TYGLYAADPFHESKPP 307
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD+PEY++++G++I+ M++ D DA+W+MQ W F D + + VP L+VL
Sbjct: 308 VDTPEYLNAVGSSIHKLMKTFDPDALWVMQAWSFRKD----------IASVVPKHDLLVL 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
L + F ++ LHNF G + ++G L ++ + +VG
Sbjct: 358 SLNGAL----GGEDHFCNHDFVVGNLHNFGGRVNLHGDLPLVSSNQFMKAKQKTPNVVGS 413
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G+ ME I QNPV Y+L EM + V ++ W+N+Y+ RRYG A AW +L Y
Sbjct: 414 GLFMESIGQNPVFYELAFEMPVHQDSVKLEEWLNKYAERRYGAFSDAANKAWELLLAGPY 473
Query: 301 NC-TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
T+G ++ +I A P VD G + P ++++++E
Sbjct: 474 RAGTNGV--ESSSIICARPAVDVK----KSGPNAGFNIPYDPQSLIEAEVC--------- 518
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +L S YR+D++D+ RQ ++ E+ EA++ D
Sbjct: 519 -----------LLQDAEQLKGSGPYRFDIVDVQRQIMSNLGQEIHKKAAEAFKKKDKEAF 567
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
S RFLEL++D+D LL F WL A+ +E+ +E NA + +T+W
Sbjct: 568 ALHSGRFLELLKDVDILLRTRTEFNFDQWLTDARAWGTTDEERNLFEKNASSLVTIWGGQ 627
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR--------- 530
DY + W+GL+ YY R ++ + L++G +R +D +
Sbjct: 628 VDVRQF---DYSWREWTGLIEGYYLQRWKQFYDMLQGHLDNGTIYREEDAKMDLGRQAFR 684
Query: 531 -REWIKLTNDWQ------NGRNVYPVESNGDALITSQWLYNKYLQ 568
E+ DW+ G+ PV + GD + ++ + +KY Q
Sbjct: 685 ANEFYDSLADWELAFVDRPGKARTPV-TEGDEVAVARRMLDKYKQ 728
>gi|423214208|ref|ZP_17200736.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693153|gb|EIY86388.1| hypothetical protein HMPREF1074_02268 [Bacteroides xylanisolvens
CL03T12C04]
Length = 718
Score = 258 bits (660), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 167/568 (29%), Positives = 279/568 (49%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP A P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNAMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PVE+
Sbjct: 683 IREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|422873453|ref|ZP_16919938.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens F262]
gi|380305838|gb|EIA18115.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens F262]
Length = 2104
Score = 258 bits (660), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 173/565 (30%), Positives = 280/565 (49%), Gaps = 51/565 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GNLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY DA
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNRDAE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTT 819
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W + L+DY N+ WSGL DYY R + + L+ G K +W K
Sbjct: 820 WGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFK 875
Query: 536 LTNDWQNGRN----VYPVESNGDAL 556
+ DW N ++ +YP E++ + L
Sbjct: 876 MEYDWVNKKSDTDKLYPTEASNENL 900
>gi|156046298|ref|XP_001589681.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980]
gi|154693798|gb|EDN93536.1| hypothetical protein SS1G_09403 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 795
Score = 258 bits (659), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 176/558 (31%), Positives = 288/558 (51%), Gaps = 73/558 (13%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG LP SW+++Q +LQKKI+ R+ ELG+ PVLPAF+G VP+AL+ + P+A I
Sbjct: 207 FGNIQGSWGGTLPLSWIEEQHLLQKKIVKRMVELGITPVLPAFTGFVPSALRRIAPNANI 266
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G+W ++ T+L TDPLF + F+ Q + YG +HIY D ++EN P
Sbjct: 267 INGGDWGNIFPVEYSNDTFLY-PTDPLFTTLQHKFLSFQSEYYGNVTHIYTLDQYNENNP 325
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-L 177
Y+ ++ Y +QS D AVW++QGWLF S FW +++A + VP + +
Sbjct: 326 ASGDLSYLRNVSRGTYESLQSFDPCAVWMLQGWLFYSLSSFWTQDRIEAYIGGVPKNESM 385
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT 236
++LDLF+E P W + +YG P+IWC L ++ G + +YG + +I +EA R SEN
Sbjct: 386 LILDLFSESFPQWERTHYYYGKPWIWCQLRDYGGTLGLYGQIYNITNSLIEAFRESEN-- 443
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDV----KAWI-NQYSVRRYGRSVPA-IQD 290
MVGVG +MEG N ++Y+L+ + A+ + +D K+W+ +Y ++ + +P I +
Sbjct: 444 MVGVGNTMEGQGGNGLMYELLLDQAWNIDPIDTEDYFKSWVRKRYHIKGAKKRLPGEIYE 503
Query: 291 AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
AW++L T YN T+ + V + ++ P+I +N+G+ ++S
Sbjct: 504 AWDILRRTAYNNTNLTLADS--VPKSLHELQPNIT-------ENHGR--------LGQSS 546
Query: 351 SYDHPHLWYSTSEVIRALELFI---ASGNELSASNTYRYDLIDLTRQALAKYANELFLNI 407
+ D Y ++ RA EL S EL +++D++D+TRQ LA+ ++ +
Sbjct: 547 TIDL----YDPDDLFRAWELLYNASVSVPELWEDKGWKFDMVDITRQVLAERFKLEYVEL 602
Query: 408 IEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ------------- 454
IE Y+ + + ++E +D +L+ F L W+ +A
Sbjct: 603 IEKYK--KGADISCDGDILIGILESLDDVLSASPHFRLDTWVNAAVSSAPLPASTNCSST 660
Query: 455 ---------------LAQNEEQEKQ-YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGL 498
L N +Q + +NA QIT+W Q + DY +K W GL
Sbjct: 661 SINNSSLLFNSSTSILTSNLTPTQQFFAYNAINQITIWGPTGQ-----IDDYASKSWGGL 715
Query: 499 LRDYYGPRAAIYFKYMIE 516
+R YY PR ++ +Y+ E
Sbjct: 716 VRGYYLPRWKMFLEYIDE 733
>gi|346323119|gb|EGX92717.1| alpha-N-acetylglucosaminidase, putative [Cordyceps militaris CM01]
Length = 742
Score = 258 bits (658), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 161/527 (30%), Positives = 269/527 (51%), Gaps = 44/527 (8%)
Query: 10 PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVK 69
PL SW+DQQ LQK+I+ R+ +LG+ P+LPAF G VP A + P A + + W +
Sbjct: 195 PLSLSWIDQQFALQKRIVARMVQLGITPILPAFPGFVPDAFARLRPGADLVRAPAWGGLP 254
Query: 70 SDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS 129
+D L D + E+ R F+E Q++ YG +++Y D F+E P + +Y+S+
Sbjct: 255 ADSPNTRALFLSPLDDAYAELQRLFVEAQIEAYGNVTNVYAMDQFNEINPVSGATDYLSA 314
Query: 130 LGAAIYSGMQSGDSDAVWLMQGWLF--SYDPFWRPPQMKALLNSVP-LGKLVVLDLFAEV 186
+ Y+ + + + AVWLMQGWLF S FW +++A L +V+LDLF+E
Sbjct: 315 VSRRSYAALAAANPAAVWLMQGWLFYLSEGNFWTQERIEAYLRGPEDRAGMVILDLFSET 374
Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEG 246
P W + + G P+IWC +H+F GN ++G + + P+EA E+ +MVG+G++ E
Sbjct: 375 APQWQRTGSYAGRPWIWCQVHDFGGNQNLFGKITNTTVNPMEA-LRESDSMVGLGIATEA 433
Query: 247 IEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNVLYHTVYNCT 303
E N V+YDL + + +D ++ + ++ RRY R +PA + AW +L TVY+
Sbjct: 434 YEGNEVLYDLFFDQGWSATPIDTVSYFHDWTTRRYSGVRQLPASLYQAWELLRVTVYDY- 492
Query: 304 DGATDKNRDVI---VAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+I V+ ++P++ + Y K L + ++ P +W
Sbjct: 493 -----RASDLIGVPVSVYQLEPNLTGL-------YNTTTGKPTALHYDPAAL--PPIW-- 536
Query: 361 TSEVIRALELFIASGN---ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
LF+A+ L A +R DL+D+ RQ L+ L+ +++ A+
Sbjct: 537 --------RLFVAAAAAQPRLWAEPGFRLDLVDVMRQVLSNAFGRLYADLVAAFTGGAPP 588
Query: 418 G-VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ Q +R ++ D+D LLA F L WL +A+ ++ + + AR+Q+T+W
Sbjct: 589 SEIAQRGQRMRAVLGDVDALLATQPHFSLRRWLNAARAWGESTGENAAIAYEARSQVTIW 648
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
T LL DY K WSGL+ YY R I+ ++++ E+ G
Sbjct: 649 APGT-----LLNDYAAKAWSGLIATYYDERWRIFVDRLVDAAENHGG 690
>gi|453081268|gb|EMF09317.1| glycoside hydrolase family 89 protein [Mycosphaerella populorum
SO2202]
Length = 784
Score = 258 bits (658), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 164/542 (30%), Positives = 275/542 (50%), Gaps = 51/542 (9%)
Query: 8 GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFS 67
GG LPQSW+DQQ L + I+ R+ ELGM PVLP F+G VP + ++P+A W
Sbjct: 218 GGDLPQSWIDQQFELNQLIIARMIELGMTPVLPCFTGFVPTQISRLYPNASFVNGSQWNG 277
Query: 68 VKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYI 127
++ ++ L+ DPLF + ++FI + YG S +Y D ++EN P + Y+
Sbjct: 278 FQA--QYTNVTFLEPFDPLFTTLQKSFISKLDAAYGNVSSVYTLDQYNENDPFSGNVTYL 335
Query: 128 SSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEV 186
+ + +++ D +A+W +QGWLF S FW ++KA L V +++LDLF+E
Sbjct: 336 EDVASNTIKSLKAADPEAIWFIQGWLFYSAADFWDEERIKAYLGGVEDKDMLILDLFSES 395
Query: 187 KPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEG 246
+P W + ++G P+IWC LH++ GN ++G ++++ P+ A +E +TMVG+G++MEG
Sbjct: 396 QPQWQRTNSYFGKPWIWCQLHDYGGNQGLHGQVENVTMNPILALANETSTMVGIGLTMEG 455
Query: 247 IEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY--GRSVPAI-QD---AWNVLYHTVY 300
E N ++YD++ + A+ E ++ + + + RY +V + QD AW+++ T+Y
Sbjct: 456 QEGNEIIYDILLDQAWTPEPIESAGYFDDWVTSRYHCDDAVAGLPQDLYIAWDMMRQTIY 515
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
N TD D V + ++ P+ + + + + + +L S H + +
Sbjct: 516 NNTD--IDTAEAVTKSIFELQPNTTGLLDRTGHHSTRILYDPEILVSA-----WKHFYSA 568
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL---FLNIIE--AYQLND 415
+ E + EL +YR+DL+D+TRQ LA L F+N+ + +
Sbjct: 569 SQETPQLWEL-----------ESYRFDLVDITRQVLANAFYPLYGEFVNMTANSSLPSSS 617
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDG--FLLGPWLESAKQLAQNEEQEKQ--------- 464
Q R L L+ D+D +L F L W+ SA+ A E
Sbjct: 618 TASAEQTGARMLSLLLDLDSVLEASGNAHFSLESWIHSARLWAPTETNAADGDNMTAAAI 677
Query: 465 ---YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 521
YE+NAR QIT+W + + DY +K W+GL++ YY PR + + + S S
Sbjct: 678 ADFYEYNARNQITLWGPGGE-----ISDYASKQWAGLIKTYYVPRWERFVHFTLNSSTSA 732
Query: 522 DG 523
DG
Sbjct: 733 DG 734
>gi|295085513|emb|CBK67036.1| Alpha-N-acetylglucosaminidase (NAGLU). [Bacteroides xylanisolvens
XB1A]
Length = 718
Score = 258 bits (658), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 167/568 (29%), Positives = 278/568 (48%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +WST + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWSTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PVE+
Sbjct: 683 IREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|160884066|ref|ZP_02065069.1| hypothetical protein BACOVA_02042 [Bacteroides ovatus ATCC 8483]
gi|423291473|ref|ZP_17270321.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
CL02T12C04]
gi|156110408|gb|EDO12153.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus ATCC
8483]
gi|392663473|gb|EIY57023.1| hypothetical protein HMPREF1069_05364 [Bacteroides ovatus
CL02T12C04]
Length = 718
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 167/568 (29%), Positives = 279/568 (49%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP A P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEAFAQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMTIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PVE+
Sbjct: 683 IREWEEQWI--TSPWINSTTPFDDPVEA 708
>gi|323344412|ref|ZP_08084637.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
gi|323094539|gb|EFZ37115.1| alpha-N-acetylglucosaminidase [Prevotella oralis ATCC 33269]
Length = 730
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 169/568 (29%), Positives = 262/568 (46%), Gaps = 46/568 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP+ WL+ Q LQKKIL R M PVLPAF+G+VPA L+ +FP A I
Sbjct: 198 MANIDRWNGPLPKEWLNGQKELQKKILARERAFNMKPVLPAFAGHVPAELKRIFPDANIK 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
LG W + ++ C + L +PLF +I + ++E+Q +G T HIY D F+E PP
Sbjct: 258 SLGKWGGFEE--KYLC-HFLSPEEPLFSKIQKLYLEEQTALFG-TDHIYGVDPFNEVEPP 313
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
P Y+ + +Y + + D A W+ GW+FSYD W P +++A L VP GK+ +
Sbjct: 314 SWEPAYLRKVSKNMYGTLTAVDPKAEWMQMGWMFSYDNKHWTPDRVQAFLTGVPKGKMSL 373
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD + E +W T+ FYG PYIWC L NF GN + G + A + M+G
Sbjct: 374 LDYYCENVELWKTTDGFYGQPYIWCYLGNFGGNTTLMGNVKESGRRLDNALANGQRNMLG 433
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G ++EG++ Y+ + + H D + WI+ + R YG P+++ AW++L++ +
Sbjct: 434 AGSTLEGLDVIQFPYEYLYNKLWSHAVADSR-WIDDLADRHYGGVSPSVRKAWHILFNDI 492
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + + +G N+ +P Y L
Sbjct: 493 Y---------------------VQVSASMQGVLTNF-RPALNNNYPHRTAIEYPAERL-- 528
Query: 360 STSEVIR-ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
EV R L++ NEL + D+I + RQ L + AY D
Sbjct: 529 --EEVWRLLLDVPRCDRNEL------QLDIIAVGRQVLGNRFAVVKTQFDSAYANKDIPR 580
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+ + EL+ D+D L + + + W++ A++L +E + YE NAR IT W
Sbjct: 581 LKAKACEMEELLGDLDRLTSFNSRCSINRWIDDARKLGSTKELKDYYEKNARNLITTWGG 640
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
N + DY ++ W GL+ YY R +Y ++ + E+ F + + K
Sbjct: 641 N-------INDYASRTWGGLIGSYYAHRWRLYIDDILAAAEANKEFDQNAFNEKVSKFEQ 693
Query: 539 DWQNGRNVYPVESNGDALITSQWLYNKY 566
W V D L + L KY
Sbjct: 694 AWIISTEPITVPKRTDLLTFCRILIQKY 721
>gi|293371915|ref|ZP_06618319.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|292633161|gb|EFF51738.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 718
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 169/568 (29%), Positives = 277/568 (48%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ S+ S +Q +S + +
Sbjct: 492 DAMEEAWKLFRKTAYS---------------------SLYSYPRFTWQTV---ISDQRRI 527
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
S D+ ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 528 SKIDLSDDY----------LQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PVE+
Sbjct: 683 IREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|336404352|ref|ZP_08585050.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
gi|335943680|gb|EGN05519.1| hypothetical protein HMPREF0127_02363 [Bacteroides sp. 1_1_30]
Length = 718
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 169/568 (29%), Positives = 277/568 (48%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ S+ S +Q +S + +
Sbjct: 492 DAMEEAWKLFRKTAYS---------------------SLYSYPRFTWQTV---ISDQRRI 527
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
S D+ ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 528 SKIDLSDDY----------LQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PVE+
Sbjct: 683 IREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|299144719|ref|ZP_07037787.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
gi|298515210|gb|EFI39091.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 3_1_23]
Length = 718
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 166/568 (29%), Positives = 278/568 (48%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKADKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PVE+
Sbjct: 683 IREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|298480124|ref|ZP_06998323.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
gi|298273933|gb|EFI15495.1| alpha-N-acetylglucosaminidase [Bacteroides sp. D22]
Length = 718
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 166/568 (29%), Positives = 278/568 (48%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAGYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PVE+
Sbjct: 683 IREWEEQWI--TSPWSNSTTPFDDPVEA 708
>gi|237719039|ref|ZP_04549520.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
gi|229451817|gb|EEO57608.1| alpha-N-acetylglucosaminidase [Bacteroides sp. 2_2_4]
Length = 718
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 165/568 (29%), Positives = 278/568 (48%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+++ +EL S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYVSCADELKGSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDKLLASHPLYRLEEWVELARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PV++
Sbjct: 683 IREWEEQWI--TSPWSNSTTPFDDPVKA 708
>gi|168216263|ref|ZP_02641888.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens NCTC 8239]
gi|182381741|gb|EDT79220.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens NCTC 8239]
Length = 2104
Score = 256 bits (653), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 171/565 (30%), Positives = 280/565 (49%), Gaps = 51/565 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEEEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L G+ +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKGQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I +AWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTT 819
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W + L+DY N+ WSGL DYY R + + L+ G K +W K
Sbjct: 820 WGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFK 875
Query: 536 LTNDWQNGRN----VYPVESNGDAL 556
+ DW N ++ +YP E++ + L
Sbjct: 876 MEYDWVNKKSDTDKLYPTEASNENL 900
>gi|422345314|ref|ZP_16426228.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
WAL-14572]
gi|373228039|gb|EHP50349.1| hypothetical protein HMPREF9476_00301 [Clostridium perfringens
WAL-14572]
Length = 1842
Score = 255 bits (652), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 171/565 (30%), Positives = 279/565 (49%), Gaps = 51/565 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTT 819
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W + L+DY N+ WSGL DYY R + + L+ G K +W K
Sbjct: 820 WGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFK 875
Query: 536 LTNDWQNGRN----VYPVESNGDAL 556
+ DW N ++ +YP E++ + L
Sbjct: 876 MEYDWVNKKSDTDKLYPTEASNENL 900
>gi|423293381|ref|ZP_17271508.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
CL03T12C18]
gi|392678324|gb|EIY71732.1| hypothetical protein HMPREF1070_00173 [Bacteroides ovatus
CL03T12C18]
Length = 718
Score = 255 bits (652), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 166/568 (29%), Positives = 278/568 (48%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPITPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSIAAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL +S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKSSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PVE+
Sbjct: 683 IREWEEQWI--TSPWINSTTPFDDPVEA 708
>gi|281200617|gb|EFA74835.1| alpha-N-acetylglucosaminidase [Polysphondylium pallidum PN500]
Length = 688
Score = 255 bits (652), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 126/284 (44%), Positives = 180/284 (63%), Gaps = 9/284 (3%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ W G L W+ Q LQ +IL R+ + GM VLP F+G+VP AL+ +P A IT
Sbjct: 213 MGNVNEWAGNLTLGWMADQRDLQIQILTRMRQFGMQAVLPGFAGHVPEALETHYPKANIT 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLG W + TY L+ DPLF +I +AF+ Q + YG T H YN D F+E PP
Sbjct: 273 QLGGWGTFSG------TYYLNPDDPLFSKIAQAFVITQNQLYG-TDHFYNFDPFNELEPP 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
Y+ + ++++ + + D +W++QGW DP FW PPQ +A L+ VP+GK++V
Sbjct: 326 SSDLTYLKNCSQSMFNNLIAADPQGIWVLQGWFLVDDPEFWLPPQTEAFLSGVPIGKMIV 385
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+++V P W+++ +YG +IWCMLHNF G MYG + I+ P+EAR S + MVG
Sbjct: 386 LDLWSDVIPAWNSTNYYYGHNWIWCMLHNFGGRSGMYGKIPFISTNPIEAR-SLSPNMVG 444
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
G++ E IEQN +VYDLMSEMA++ D+K W++QY RRYG+
Sbjct: 445 TGLTPEAIEQNVIVYDLMSEMAWRSTPPDLKEWVDQYVTRRYGK 488
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 83/192 (43%), Gaps = 8/192 (4%)
Query: 366 RALELFIASGNELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQLNDAHGVFQLS 423
L + ++ ++T+ +DL ++T QAL NEL LN A+ N + S
Sbjct: 489 HGLPFLSINDTSITNTSTFSFDLTEITTQALINLFMTNELQLN--SAFLNNSLEEFNKYS 546
Query: 424 RRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEE 483
L +++D+ + + + L+G W A+ L E YE NAR QIT+W
Sbjct: 547 EALLSIIQDVYTIASTQEMLLVGHWTARARALTPANESTNLYEMNARNQITLW----GPT 602
Query: 484 ASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNG 543
S + DY K W GL D+Y R ++ K + SL S F ++ + W
Sbjct: 603 YSDVHDYAYKLWGGLTEDFYLARWTLFVKELQYSLTSSQPFNSTLFQTNCEAVEEVWNLQ 662
Query: 544 RNVYPVESNGDA 555
YP G++
Sbjct: 663 TYPYPTIPTGNS 674
>gi|168207628|ref|ZP_02633633.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens E str. JGS1987]
gi|170661027|gb|EDT13710.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens E str. JGS1987]
Length = 2104
Score = 255 bits (652), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 171/565 (30%), Positives = 279/565 (49%), Gaps = 51/565 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQEEVFGEVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTT 819
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W + L+DY N+ WSGL DYY R + + L+ G K +W K
Sbjct: 820 WGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFK 875
Query: 536 LTNDWQNGRN----VYPVESNGDAL 556
+ DW N ++ +YP E++ + L
Sbjct: 876 MEYDWVNKKSDTDKLYPTEASNENL 900
>gi|169346867|ref|ZP_02865815.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens C str. JGS1495]
gi|169296926|gb|EDS79050.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens C str. JGS1495]
Length = 2104
Score = 255 bits (651), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 171/565 (30%), Positives = 279/565 (49%), Gaps = 51/565 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTT 819
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W + L+DY N+ WSGL DYY R + + L+ G K +W K
Sbjct: 820 WGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFK 875
Query: 536 LTNDWQNGRN----VYPVESNGDAL 556
+ DW N ++ +YP E++ + L
Sbjct: 876 MEYDWVNKKSDTDKLYPTEASNENL 900
>gi|168212494|ref|ZP_02638119.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens CPE str. F4969]
gi|170716100|gb|EDT28282.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens CPE str. F4969]
Length = 2104
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 171/565 (30%), Positives = 279/565 (49%), Gaps = 51/565 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDADDWTKDLFEFNARALVTT 819
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W + L+DY N+ WSGL DYY R + + L+ G K +W K
Sbjct: 820 WGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFK 875
Query: 536 LTNDWQNGRN----VYPVESNGDAL 556
+ DW N ++ +YP E++ + L
Sbjct: 876 MEYDWVNKKSDTDKLYPTEASNENL 900
>gi|383115207|ref|ZP_09935965.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
gi|313695376|gb|EFS32211.1| hypothetical protein BSGG_2911 [Bacteroides sp. D2]
Length = 718
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 165/568 (29%), Positives = 277/568 (48%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVVPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKGSELYRNDLIEFVSYYVAAKAENFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PV++
Sbjct: 683 IREWEEQWI--TSPWSNSTTPFDDPVKA 708
>gi|242077446|ref|XP_002448659.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
gi|241939842|gb|EES12987.1| hypothetical protein SORBIDRAFT_06g030930 [Sorghum bicolor]
Length = 252
Score = 254 bits (650), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 122/185 (65%), Positives = 145/185 (78%)
Query: 386 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 445
YDL+DLTRQ LAKYAN++FL IIE+Y+ N + V L + FL LV D+D LL+ H+GFLL
Sbjct: 51 YDLVDLTRQVLAKYANDVFLKIIESYKSNKMNQVTILCKHFLNLVNDLDTLLSSHEGFLL 110
Query: 446 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 505
GPWLESAK LA+N EQE QYEWNARTQITMWFDNT+ +ASLLRDY NKYWSGLLRDYYGP
Sbjct: 111 GPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTETKASLLRDYANKYWSGLLRDYYGP 170
Query: 506 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNK 565
RAAIYFK+++ S+E F L++WRREWI LTN+WQ+ R V+ GD+L S LY K
Sbjct: 171 RAAIYFKHLLLSMEKNAPFALEEWRREWISLTNNWQSDRKVFSTTPTGDSLNISWSLYIK 230
Query: 566 YLQGT 570
YL T
Sbjct: 231 YLSNT 235
>gi|182624959|ref|ZP_02952737.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens D str. JGS1721]
gi|177909756|gb|EDT72174.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens D str. JGS1721]
Length = 2104
Score = 254 bits (650), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 171/565 (30%), Positives = 279/565 (49%), Gaps = 51/565 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTT 819
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W + L+DY N+ WSGL DYY R + + L+ G K +W K
Sbjct: 820 WGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFK 875
Query: 536 LTNDWQNGRN----VYPVESNGDAL 556
+ DW N ++ +YP E++ + L
Sbjct: 876 MEYDWVNKKSDTDKLYPTEASNENL 900
>gi|294674521|ref|YP_003575137.1| alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
gi|294472030|gb|ADE81419.1| putative alpha-N-acetylglucosaminidase [Prevotella ruminicola 23]
Length = 754
Score = 254 bits (649), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 163/532 (30%), Positives = 261/532 (49%), Gaps = 54/532 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW +Q LQK+IL R+ +LGM+PVLP + G VP + +
Sbjct: 186 MNNLEGWGGPLPTSWYARQEKLQKQILARMKQLGMHPVLPGYCGMVPHDAKEKL-GLNVA 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
G W + L TD F EI + + K +G+ + Y+ D F E+
Sbjct: 245 DAGLWNGFQRPAN------LLPTDARFSEIATLYYNELTKLFGKADY-YSMDPFHESNDD 297
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
P +D + G A+ M+ + AVW++QGW + +P +A+++ + G L+
Sbjct: 298 PNID----YAKAGQAMMQAMKRVNPKAVWVIQGW--TENP------REAMVDDMKTGDLL 345
Query: 179 VLDLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYG----ILDSIAFGPVE 228
VLDLF+E +P IW + + +++C+L NF N+ ++G +LD+
Sbjct: 346 VLDLFSECRPMFGIPSIWKREQGYKQHQWLFCLLENFGANVGLHGRMDQLLDNFYMLQSS 405
Query: 229 ARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAI 288
++++ + G+G +MEG E NPV+++LMSE+ ++ EK + W+ Y RYG AI
Sbjct: 406 KFQAQSSKLKGIGFTMEGSENNPVMFELMSELPWRPEKFTKEQWVKNYVKARYGVEDEAI 465
Query: 289 QDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSE 348
+ AW L ++YNC G + SI G+P + +
Sbjct: 466 EKAWLTLAKSIYNCPAGNNQQG---------PHESIFC---------GRPT----LNNFQ 503
Query: 349 TSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
SS+ +Y + +A +L + + +N + YDL+D+TRQALA A + I
Sbjct: 504 ASSWSKMKNYYDPAMTKKAAKLMNSVAEKYRGNNNFEYDLVDITRQALADQARLQYQKTI 563
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 468
Y+ + + RFL+++ D LL F +G W + A E++K YEWN
Sbjct: 564 ADYKAFSRKQFDRDAERFLKMLLLQDKLLGTRTEFRVGHWTQDAVNAGNTAEEKKLYEWN 623
Query: 469 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
AR QIT W + + LRDY +K W GLL+D+Y R YF + +++
Sbjct: 624 ARVQITTWGNRYCADTGGLRDYAHKEWQGLLKDFYYVRWKSYFDALAAQMKA 675
>gi|224026593|ref|ZP_03644959.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
18228]
gi|224019829|gb|EEF77827.1| hypothetical protein BACCOPRO_03350 [Bacteroides coprophilus DSM
18228]
Length = 635
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 161/476 (33%), Positives = 233/476 (48%), Gaps = 45/476 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GGPLP+SW+D+ +VL ++I+ R ELGM P+ FSG VP L+ +P AKI
Sbjct: 196 MQNLQSYGGPLPKSWIDKHVVLGQQIIRRELELGMKPIQQGFSGYVPRELKEKYPEAKIQ 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+W K + LD TD LF IGR F+E++ K +G +Y D F E+ PP
Sbjct: 256 PQPSWCGFKGAAQ------LDPTDSLFQVIGRDFLEEEKKLFG-AHGVYAADPFHESRPP 308
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
VD+PEY+S++G +I++ Q D ++W MQ W R P +KA VP L++L
Sbjct: 309 VDTPEYLSAVGRSIHTLFQEFDPYSLWAMQAWSL------REPIVKA----VPEEHLLIL 358
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL + +G P + LHNF G I M+G L +A EA S + + G
Sbjct: 359 DLNGSK---CTQRNACWGYPVVAGNLHNFGGRINMHGDLPLLAGNQYEAAVSLSPNVCGS 415
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G+ MEGIEQNP+ Y+L EM Q KV++ W+ +Y++RRYG A +L Y
Sbjct: 416 GLFMEGIEQNPLYYELAFEMPLQKGKVELDGWLKEYALRRYGSKWENTHKALLLLLEGPY 475
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
T+ + +I A P++ G G P YS
Sbjct: 476 RPGTNGTELS-SIIAA----RPALHVKKSGPNAGLGIP--------------------YS 510
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+I A + L S YR+D++DL RQ + + +A++ D G
Sbjct: 511 PWLLIEAQAFMLKDAGILKTSEAYRFDIMDLQRQIMTNLGQAIHKEAAKAFEAGDEKGFE 570
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
SRR+LEL+ D+D LL F WL A+ EE++ Q+E NA +T+W
Sbjct: 571 LHSRRYLELLTDVDTLLRTRPEFNFDRWLADARSWGDTEEEKNQFERNATALVTIW 626
>gi|429766730|ref|ZP_19298977.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
gi|429183354|gb|EKY24416.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
Length = 2284
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 167/561 (29%), Positives = 277/561 (49%), Gaps = 46/561 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGPLP +W + ++ L +++ R+ LG+ PVL +SG VP Q P A+I
Sbjct: 378 MQNMTSFGGPLPDNWFEDRVELGRQLHERMQTLGIKPVLQGYSGMVPLDFQKKNPDAQIL 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ D F E+ F E+Q + YG + Y D F E NT
Sbjct: 438 SQGGWCGFDR-PNMLKTYVNDGERDYFQEVADVFYEKQKEVYGDITDYYAVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+DS + + I M D DA+W++Q W + D ++ L N + +
Sbjct: 497 GGMDS----ARIYGTIQDKMIEHDEDAIWVIQHWQGNPDN----TKLSGLTNK---EQAL 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTM 237
+LDL +++ P + T +P++W MLHNF G + + G ++++A EA T+EN M
Sbjct: 546 ILDLNSDLNPDY-TRFDNQDIPWVWNMLHNFGGRMGLDGQVETVATSITEALATTEN--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G++ E + +P+VY+LM +M + + ++ + W+N Y RRYG +AW +L
Sbjct: 603 KGIGITPEALANSPIVYELMGDMIWTRDPINYREWVNNYIERRYGAVNEDAIEAWEILLE 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y +D + SII+ ++ A + S++ H +
Sbjct: 663 TAYKTSD----------YYYQGAAESIIN-------------ARPATSINSASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y E+ RA+ELFI+ +EL S+ + YD +D+T+Q LA A E ++ AY DA
Sbjct: 700 SYDKKELERAMELFISCYDELKDSDAFVYDFLDVTKQVLANSAQEYHKEMVAAYNSGDAE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
++S FL+L+ + +L+ FL+G W+E ++ + + + + +E+NAR IT
Sbjct: 760 KFERISEHFLDLIRLQERVLSTSPEFLVGTWIEQSRTMLADADDWTKDLFEFNARALITT 819
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W D L+DY N+ W+GL D Y R ++ + LE+G DW + +
Sbjct: 820 WGDYKN---GSLKDYSNRQWAGLTEDLYLKRWEMWIDGIRTELETGVTAPSIDWHKVEYE 876
Query: 536 LTNDWQNGRNVYPVESNGDAL 556
+ + N YP E +G+ L
Sbjct: 877 WATEKTDESNAYPTEGSGEDL 897
>gi|110801838|ref|YP_698175.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens SM101]
gi|110682339|gb|ABG85709.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens SM101]
Length = 2095
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 172/565 (30%), Positives = 279/565 (49%), Gaps = 51/565 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 369 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 428
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F + F E+Q + +G ++ Y D F E NT
Sbjct: 429 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 487
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 488 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 536
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 537 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 593
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I +AWN++
Sbjct: 594 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNEEILEAWNIILD 653
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 654 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 690
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY DA
Sbjct: 691 VYDKSEFEKAIEIFSKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNRDAE 750
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T
Sbjct: 751 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDADDWTKDLFEFNARALVTT 810
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W + L+DY N+ WSGL DYY R + + L+ G K +W K
Sbjct: 811 WGSRNNADGGGLKDYSNRQWSGLTGDYYYARWEKWINGLQIELDGG----AKAPNIDWFK 866
Query: 536 LTNDWQNGRN----VYPVESNGDAL 556
+ DW N ++ +YP E++ + L
Sbjct: 867 MEYDWVNKKSDTDKLYPTEASNENL 891
>gi|340514474|gb|EGR44736.1| glycoside hydrolase family 89 [Trichoderma reesei QM6a]
Length = 762
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 161/526 (30%), Positives = 272/526 (51%), Gaps = 43/526 (8%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
NL G W LP W+D Q LQKKI+ R+ ELG+ P+LPAF G VP A V P A++
Sbjct: 209 NLQGSWSSSLPFEWVDDQFALQKKIVKRMVELGITPILPAFPGFVPRAAPRVLPDARLLH 268
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W + LD DPLF ++ R+FI +Q + YG ++ Y D F+E PP
Sbjct: 269 SIQWAGFPE--IFTEDTFLDPVDPLFAQMQRSFITKQKQAYGNVTNFYTLDQFNEMIPPS 326
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPL-GKLVV 179
Y+ ++ + + ++S D +A+W+ Q WLF+ + FW +++A L V +++
Sbjct: 327 GDVAYLRNVSSNTWKALKSADPNAIWVFQAWLFAQNTTFWTNERIEAYLGGVTADSDMLI 386
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD+++E P W ++ +YG P+IWC L N+ I +YG + ++ P+ A E+T++ G
Sbjct: 387 LDIWSESMPQWQRAQSYYGKPWIWCELQNYGATINLYGQIQNVTNSPILA-LQESTSLSG 445
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA--IQDAWNVLYH 297
G+SMEG + N +VYDL+ A+ E +D +A+ + ++ RY I DAW +
Sbjct: 446 FGLSMEGQQNNEIVYDLLLAQAWSSEPLDTEAYFHNWASARYSSDQRPGFIHDAWETVRT 505
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
TVY+ T+ + P V SII + V + + + ++ + L
Sbjct: 506 TVYDNTN---------LTLMPSVPKSIIEL-----------VPRTSNM-ADITGILGTKL 544
Query: 358 WYSTSEVIRALELFIASG---NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
Y + ++ A + +G L ++ Y+YDL+D TRQ LA ++ NI++ Y +
Sbjct: 545 PYDPAVMVSAWKQLYHAGLQDTSLFNNSAYQYDLVDWTRQVLANAFIPIYKNIVDIYYNS 604
Query: 415 DAHGVFQLSR------RFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 468
+ ++ R + +L+ +D +L+ + F L WL +A+ A + +E+
Sbjct: 605 NQTAGSRIQRLKAQGQQVTKLLLSLDLVLSSNRNFRLSTWLSAARSSAPSPAYVDSFEYE 664
Query: 469 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM 514
AR QIT+W + Q L DY +K WSGL++ Y+ R ++ +Y+
Sbjct: 665 ARNQITLWGPSGQ-----LIDYASKAWSGLMKTYHLKRWQMFVEYL 705
>gi|168209163|ref|ZP_02634788.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens B str. ATCC 3626]
gi|170712640|gb|EDT24822.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens B str. ATCC 3626]
Length = 2104
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 171/565 (30%), Positives = 277/565 (49%), Gaps = 51/565 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F + F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDADDWTKDLFEFNARALVTT 819
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W L+DY N+ WSGL DYY R + + L+ G K +W K
Sbjct: 820 WGSRNNANGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQTELDGG----AKAPNIDWFK 875
Query: 536 LTNDWQNGRN----VYPVESNGDAL 556
+ DW N ++ +YP E++ + L
Sbjct: 876 MEYDWVNKKSDTDKLYPTEASNENL 900
>gi|18309848|ref|NP_561782.1| alpha-N-acetylglucosaminidase [Clostridium perfringens str. 13]
gi|18144526|dbj|BAB80572.1| probable alpha-N-acetylglucosaminidase [Clostridium perfringens
str. 13]
gi|288872041|dbj|BAI70446.1| alpha-N-acetylglucosaminidase [Clostridium perfringens]
Length = 2104
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 170/565 (30%), Positives = 278/565 (49%), Gaps = 51/565 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + P A+
Sbjct: 378 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNPEAQTI 437
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F + F E+Q + +G ++ Y D F E NT
Sbjct: 438 SQGGWCGFDR-PDMLKTYVNEGEVDYFQNVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 496
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 497 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 545
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 546 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 602
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ ++L+ +MA+ ++++ + W Y RRYG++ I DAWN++
Sbjct: 603 VGIGITPEAINTNPLAHELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILDAWNIILD 662
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 663 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 699
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 700 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 759
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T
Sbjct: 760 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTT 819
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W + L+DY N+ WSGL DYY R + + L+ G K +W K
Sbjct: 820 WGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFK 875
Query: 536 LTNDWQNGRN----VYPVESNGDAL 556
+ DW N ++ +YP E++ + L
Sbjct: 876 MEYDWVNKKSDTDKLYPTEASNENL 900
>gi|170292392|pdb|2VC9|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With 2-Acetamido-1,2-Dideoxynojirmycin
gi|170292393|pdb|2VCA|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With Beta-N-Acetyl-D-Glucosamine
gi|170292394|pdb|2VCB|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
In Complex With Pugnac
gi|170292395|pdb|2VCC|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens
Length = 891
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 166/564 (29%), Positives = 277/564 (49%), Gaps = 49/564 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + A+
Sbjct: 344 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 403
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 404 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 462
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 463 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 511
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A + + + MV
Sbjct: 512 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLA-TEIPKALANSEHMV 569
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I +AWN++ T
Sbjct: 570 GIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDT 629
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y K D + SII+ G +G +KS S++ H +
Sbjct: 630 AYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIV 666
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 667 YDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEK 726
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMW 476
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 727 FKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTW 786
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 787 GSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKM 842
Query: 537 TNDWQNGRN----VYPVESNGDAL 556
DW N ++ +YP E++ + L
Sbjct: 843 EYDWVNKKSDTDKLYPTEASNENL 866
>gi|336412611|ref|ZP_08592964.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
3_8_47FAA]
gi|335942657|gb|EGN04499.1| hypothetical protein HMPREF1017_00072 [Bacteroides ovatus
3_8_47FAA]
Length = 718
Score = 253 bits (645), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 165/568 (29%), Positives = 277/568 (48%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ +IL R+ ELGM P+ PAF+G VP P +
Sbjct: 197 MGNLNKWDGPLSDAWQQNQIALQHQILTRMRELGMQPIAPAFAGFVPEGFVQKHPDTQFR 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W + Y+L P F EIG+ F+E+ KE+G ++ Y D+F+E P
Sbjct: 257 HM-RWGGFDEE---YNAYVLPPDSPFFEEIGKLFVEEWEKEFGENTY-YLSDSFNEMELP 311
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + ++ G IY + +G+ DAVW+ QGW F Y FW +KALL++VP
Sbjct: 312 IDKEDKEAKYKLLAEYGETIYKSITAGNPDAVWVTQGWTFGYQHSFWDKESLKALLSNVP 371
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K +W+T + FYG +I+ + NF G M G LD A
Sbjct: 372 DDKMIIIDLGNDYPKWVWNTEQTWKVHDGFYGKKWIFSYVPNFGGKNTMTGDLDMYASSS 431
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
V+A R + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y RYG
Sbjct: 432 VKALRAANKGNLIGFGSAPEGLENNEVVYELLADMGWSSDSIDLDDWMKIYCEARYGGYP 491
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
A+++AW + T Y+ + ++P + + + +SK +
Sbjct: 492 DAMEEAWKLFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKIDL- 532
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++A+ L+ + +EL S YR DLI+ +A A +
Sbjct: 533 ---------------SDDYLQAIRLYASCADELKNSELYRNDLIEFVSYYVAAKAEIFYK 577
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ N + ++ ++L+ D+D LLA H + L W+E A+ +++ Y
Sbjct: 578 QALKDDSENRVLAAQRNLQQTVDLLMDVDRLLASHPLYRLEEWVELARNSGTTLQEKDAY 637
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR +YF + D +
Sbjct: 638 EANAKRLITSW-------GGIQEDYAARFWSGLIKDYYIPRIQLYF--------TKDRNK 682
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
+++W +WI T+ W N + PV++
Sbjct: 683 IREWEEQWI--TSPWSNSTTPFDDPVKA 708
>gi|404487024|ref|ZP_11022211.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
YIT 11860]
gi|404335520|gb|EJZ61989.1| hypothetical protein HMPREF9448_02667 [Barnesiella intestinihominis
YIT 11860]
Length = 722
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 178/570 (31%), Positives = 271/570 (47%), Gaps = 77/570 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+GW GPL W +Q+ LQ KIL R+ ELGM+P+ PAF+G VP A P +
Sbjct: 200 MGNLNGWDGPLTNGWQKEQIKLQHKILNRMRELGMDPIAPAFAGFVPTAFAERHPEIQFK 259
Query: 61 QLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
L W F K + Y+L P F EIG+ FIE+ KE+G+ ++ Y D+F+E
Sbjct: 260 HL-EWGGFDEKYN-----AYVLPPETPYFKEIGKLFIEEWEKEFGKNTY-YLSDSFNEMK 312
Query: 119 PPV------DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNS 171
PV + ++ G +IY + +G+ DAVW+ QGW F Y FW ++ALL+
Sbjct: 313 LPVAEGDDDGKHKLLAQYGESIYHSIAAGNPDAVWVTQGWTFGYQHDFWDKASLQALLSR 372
Query: 172 VPLGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
VP K++++DL + K +W T + FYG +I+ + NF G M G L A
Sbjct: 373 VPDDKMIIIDLGNDYPKWVWGTEQTWKNHDGFYGKKWIFSYVPNFGGKTPMTGDLQMYAT 432
Query: 225 GPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
EA S N +VG G + EG+E N VVY+L+++M + + +D+ +W+ Y RYG
Sbjct: 433 SSAEALHSANAGNLVGFGSAPEGLENNEVVYELLADMGWTADSIDLDSWLPVYCKARYGG 492
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+ AW T Y+ +V PD + +SK
Sbjct: 493 CPAAMDSAWQRFKETAYSSLYSYPRFTWQTVV--PDT----------------RRISKLD 534
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
V S ++ +ELF++ + L +S Y D I+ LA A++
Sbjct: 535 VSDS----------------FLQGVELFLSCADSLESSPLYVNDAIEYASYYLAAKADDC 578
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ ++ L + Q R +E++ D+D LLA H + L W++ A+ + + ++
Sbjct: 579 YKRALKEDSLGNRVAAMQQLDRSVEILLDVDKLLASHPLYRLEEWVDMARDWGKTDLEKD 638
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YE NA+ IT W DY ++WSGL++DYY PR +YF S + D
Sbjct: 639 AYEANAKRLITTW-------GGFQEDYAARFWSGLIKDYYIPRMKLYF-----SEQRAD- 685
Query: 524 FRLKDWRREWIKLTNDWQNGRNVY--PVES 551
L W WIK W N + P++S
Sbjct: 686 --LDRWEENWIKAP--WHNTSTSFEDPLQS 711
>gi|429740221|ref|ZP_19273923.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
gi|429153946|gb|EKX96707.1| Alpha-N-acetylglucosaminidase [Prevotella saccharolytica F0055]
Length = 721
Score = 251 bits (640), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 169/551 (30%), Positives = 266/551 (48%), Gaps = 70/551 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W QQ+ LQ KIL R+ LGM+P+ PAF+G VP + P ++
Sbjct: 197 MGNLNTWNGPLSANWHSQQIALQHKILERMRLLGMHPITPAFAGFVPEGFVKLHPEVRVK 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
F + Y+L P F++IG+ FIE+ KE+ + ++ Y D+F+E P
Sbjct: 257 H----FEWGGFDKSLNAYMLPPDSPYFLQIGKLFIEEWEKEFSKNTY-YLSDSFNEMELP 311
Query: 121 VDSPE-------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSV 172
V SP+ +S G AIY + +G+ +AVW+ QGW F Y FW ++ALL V
Sbjct: 312 V-SPDDTDGKHRLLSKYGEAIYQSIVAGNPNAVWITQGWTFGYQHRFWDKESLQALLERV 370
Query: 173 PLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
P KL+++DL + + W T K FYG +I + NF G + G L+ A
Sbjct: 371 PNDKLIIVDLANDYPKWVWKTEQTWKTHKGFYGKRWILSYVPNFGGKTLLTGDLNLYASC 430
Query: 226 PVEARTS-ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
EA + ++G G + EG+E N VVY+L+++M +Q++ +D+ W+ +Y RYG
Sbjct: 431 SAEALAHPDKGRLIGFGSAPEGLENNEVVYELLADMGWQNQPIDLDHWLIEYCRSRYGSC 490
Query: 285 VPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAV 344
A+Q AW L +VY+ S+ S +Q V + +
Sbjct: 491 PNAMQKAWKGLCRSVYS---------------------SLYSYPRFTWQT----VIPDTL 525
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
KS+ YD ++ RA+E F+ +L S YR D + Q + A+ L+
Sbjct: 526 RKSK---YDFNDTYF------RAVEDFLLCAPQLKDSPLYRSDALLFAAQYIGAKADNLY 576
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
++A + + QL + ++L+ D LLA H L W+++A+ A ++ Q
Sbjct: 577 RKALQAKAVGNRARAKQLVDKVIQLLLQADKLLASHPTDRLSRWVDAARTAAATPQERMQ 636
Query: 465 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 524
YE +A+ IT W + +DY +YWSGL++ YY PR +YF
Sbjct: 637 YEMDAKRLITSW-------GGIQQDYAARYWSGLIKTYYVPRIKLYF-------AGSKKK 682
Query: 525 RLKDWRREWIK 535
L +W W+K
Sbjct: 683 ELNNWEENWLK 693
>gi|110800516|ref|YP_695309.1| alpha-N-acetylglucosaminidase [Clostridium perfringens ATCC 13124]
gi|110675163|gb|ABG84150.1| alpha-N-acetylglucosaminidase family protein [Clostridium
perfringens ATCC 13124]
Length = 2095
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 169/565 (29%), Positives = 278/565 (49%), Gaps = 51/565 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + A+
Sbjct: 369 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 428
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT
Sbjct: 429 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 487
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 488 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 536
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ M
Sbjct: 537 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEH--M 593
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I +AWN++
Sbjct: 594 VGIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILD 653
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T Y K D + SII+ G +G +KS S++ H +
Sbjct: 654 TAYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKI 690
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 691 VYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGE 750
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T
Sbjct: 751 KFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTT 810
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W + L+DY N+ WSGL DYY R + + L+ G K +W K
Sbjct: 811 WGSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFK 866
Query: 536 LTNDWQNGRN----VYPVESNGDAL 556
+ DW N ++ +YP E++ + L
Sbjct: 867 MEYDWVNKKSDTDKLYPTEASNENL 891
>gi|383280354|pdb|4A4A|A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In
Complex With Its Substrate Glcnac-Alpha-1,4-Galactose
Length = 914
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 164/564 (29%), Positives = 277/564 (49%), Gaps = 49/564 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + A+
Sbjct: 367 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 426
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P TY+ + F ++ F E+Q + +G ++ Y D F + NT
Sbjct: 427 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHQGGNT 485
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D+ + I + M D+DAVW++Q W + P L + +
Sbjct: 486 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 534
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A + + + MV
Sbjct: 535 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLA-TEIPKALANSEHMV 592
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G++ + I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I +AWN++ T
Sbjct: 593 GIGITPQAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDT 652
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y K D + SII+ G +G +KS S++ H +
Sbjct: 653 AYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIV 689
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY +
Sbjct: 690 YDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEK 749
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMW 476
+S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W
Sbjct: 750 FKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTW 809
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+ L+DY N+ WSGL DYY R + + L+ G K +W K+
Sbjct: 810 GSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKM 865
Query: 537 TNDWQNGRN----VYPVESNGDAL 556
DW N ++ +YP E++ + L
Sbjct: 866 EYDWVNKKSDTDKLYPTEASNENL 889
>gi|224027030|ref|ZP_03645396.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
18228]
gi|224020266|gb|EEF78264.1| hypothetical protein BACCOPRO_03789 [Bacteroides coprophilus DSM
18228]
Length = 837
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 172/552 (31%), Positives = 257/552 (46%), Gaps = 73/552 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI- 59
M N+ G GPL W QL LQ KIL R+ LGM P+ P F G +P A + ++P I
Sbjct: 193 MGNVSGIDGPLNPDWHAGQLALQHKILDRMRALGMKPICPGFPGFIPEAFKRIYPDLHIV 252
Query: 60 -TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
T G F +++ T+PLF +I AFI++ KE+G+ + Y D+F+E
Sbjct: 253 ETHWGGAFH---------NWMISPTEPLFAKISEAFIKEWEKEFGKCDY-YLVDSFNEMD 302
Query: 119 PPVDSP------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNS 171
P E +S G +YS ++ + DAVW+MQGW+F Y W + AL++
Sbjct: 303 IPFPEKGNPARYEMAASYGEKVYSSIKRANKDAVWVMQGWMFGYQRHIWDYETLGALVSR 362
Query: 172 VPLGKLVVLDL-------FAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
VP K+++LDL F + W K FY +++ ++ N G M G+LD A
Sbjct: 363 VPDDKMLLLDLAVDYNRHFWHSEVNWEYYKGFYNKQWVYSVIPNMGGKTGMTGVLDFYAN 422
Query: 225 GPVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
G +EA +S N +V G++ EGIE N V+Y+L+++ + ++DV+ W+ QYS+ RYG+
Sbjct: 423 GHLEALSSSNRGNLVAHGLAPEGIENNEVLYELVTDAGWSDHRMDVRDWLKQYSINRYGK 482
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
+ + AW+ L +VY F D G +N +S
Sbjct: 483 APAQLMKAWDYLLKSVYG--------------TFTDHPRFNWQFRPGLVKNGSINIS--- 525
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
+ + LE F+A+ EL S Y DL ++T L A L
Sbjct: 526 ------------------DDYFKGLESFVAASEELKDSPYYLTDLCEMTAHYLGSKAEIL 567
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
I + Y L D L RF + MD +L+ H L W+ A + A+ E Q K
Sbjct: 568 TRQIDQEYLLGDTLQAHFLQSRFETFMLGMDRILSQHPTLRLDRWVSFASKAARTEAQRK 627
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
QYE NAR +T+W + DY + WSGL+ YY R Y+K +SG
Sbjct: 628 QYEMNARRIVTVW-------GPPVDDYSARMWSGLVGSYYLGRWKEYYK----GRDSGKS 676
Query: 524 FRLKDWRREWIK 535
L W R+W++
Sbjct: 677 ADLSSWERKWVE 688
>gi|374384144|ref|ZP_09641670.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
12061]
gi|373228751|gb|EHP51054.1| hypothetical protein HMPREF9449_00056 [Odoribacter laneus YIT
12061]
Length = 835
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 165/549 (30%), Positives = 269/549 (48%), Gaps = 70/549 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GP+P W Q+ LQ KIL R+ LGM P+ PAF+G VP AL+ ++P KI
Sbjct: 192 MGNISQIDGPMPVEWHSDQVELQHKILKRMKLLGMKPICPAFAGFVPLALKRLYPDVKII 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
+ W + ++L + LF IG+ FIE+ KE+G+ + Y D+F+E
Sbjct: 252 ET-TWAGFHN-------WMLSPEEELFTRIGQLFIEEWEKEFGK-NDFYLADSFNEMDVP 302
Query: 119 -PPVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
PP+ + E L G +Y G+++G+ DAVW+MQGW+F Y W ++AL++ VP
Sbjct: 303 FPPIGTKERYDMLAFYGEQVYKGIKAGNPDAVWVMQGWMFGYQRDIWDYETLQALVSKVP 362
Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K+++LDL A+ K +W K F+ +++ ++ N G GIL A G
Sbjct: 363 DDKMMLLDLAADYNKNVWGNGMNWEFYKGFFNKLWVYSVIPNMGGKTGATGILSFYANGH 422
Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+EA S N + G GM+ EG E N VVY+++ + + ++DVK W+ YS+ RYG++
Sbjct: 423 LEALNSPNRGRLFGFGMAPEGTENNEVVYEMICDAGWSSSEIDVKQWLKDYSLCRYGKTC 482
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
P + + W L +VY F D + + G+ S + +
Sbjct: 483 PEMDEVWEGLCKSVYG--------------TFTDHPRFLWQLRPGR--------SGKGTV 520
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
++++ Y RA+E +++ S ++ D +++T L L
Sbjct: 521 NTDSNFY-------------RAVEKMAECAPKMTESPLFKADFLEMTAFYLGGKMEALAS 567
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
I ++Y + ++ ++F EL E +D LL H + L W++ A++ E+ + Y
Sbjct: 568 AIGKSYLYGNTADALKMQQQFEELGEGLDSLLESHPVYRLQRWIDFARKHGDTEKLKDYY 627
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NAR +T+W + DY K WSGL+RDYY PR YF+ E+G +
Sbjct: 628 EMNARRIVTIW-------GPPVSDYACKLWSGLIRDYYLPRWREYFR----CKETGSKYD 676
Query: 526 LKDWRREWI 534
L W +W+
Sbjct: 677 LASWESDWV 685
>gi|322703040|gb|EFY94656.1| alpha-N-acetylglucosaminidase, putative [Metarhizium anisopliae
ARSEF 23]
Length = 774
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 162/542 (29%), Positives = 278/542 (51%), Gaps = 53/542 (9%)
Query: 5 HGWGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 62
WGG LP ++++QQ LQK+I+ R+ ELG+ PVLPAF G VP +++ V P+A +T
Sbjct: 209 RSWGGKGDLPLAFIEQQFELQKQIVTRMVELGITPVLPAFPGFVPESIKKVRPNANLTVS 268
Query: 63 GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 122
NWF+ D ++ LD D + E+ + F+ +Q+ +G +++Y D F+E +P
Sbjct: 269 PNWFAPAPD-KYTRDLFLDPLDDTYAELQKLFVTKQIDAFGNVTNVYTLDQFNELSPASG 327
Query: 123 SPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGK-LVVL 180
Y+ + Y+G+ + + AVWL+QGWL FS FW P++ A L V + ++VL
Sbjct: 328 DTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSSRNFWTQPRIDAYLGGVEDHQGMLVL 387
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL++EV P W + + G P+IWC LH+F GN+ + G + ++ P++A +++ ++VG
Sbjct: 388 DLYSEVNPQWQRTNSYSGKPWIWCQLHDFGGNMALEGRVQTLTSAPIDA-LAQSKSLVGF 446
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNVLYH 297
G++ E E N VVYD++ + A+ +D +A+ + +RY S+P+ + AW +L
Sbjct: 447 GLTPEAYEGNEVVYDILLDQAWSATPLDTQAYFASWVTKRYAGISSIPSELYRAWEILRT 506
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VY+ T TD + V VA + P++ + + T + HP
Sbjct: 507 DVYSNTR--TDIPQ-VPVATYQLRPALSGIA------------------NRTGHFPHPTA 545
Query: 358 WYSTSEVIRA-----LELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+ V++ LE G+ L ++ D +D++RQ L+ + L+ +++ AY+
Sbjct: 546 LHYDPLVLQGVWKLMLEALTRQGS-LWKVPAFQLDFVDVSRQMLSNQFDVLYADLVNAYK 604
Query: 413 LNDAHG-------------VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
+ G V R L L+ +D L F L W+++A +
Sbjct: 605 CSTGAGGSRELRSNTPNCDVKAAGARLLFLLSTLDLTLLTSRHFALQSWVDAASAWGKAA 664
Query: 460 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 519
E + +NAR+Q+T+W Q A+ L DY K W GL+ YY R +I+ ++ +
Sbjct: 665 GNEDLFTFNARSQVTVW----QVNATNLNDYAAKAWGGLVGSYYKGRWSIFVDALVAASS 720
Query: 520 SG 521
SG
Sbjct: 721 SG 722
>gi|187734575|ref|YP_001876687.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
gi|187424627|gb|ACD03906.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
Length = 848
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 165/555 (29%), Positives = 268/555 (48%), Gaps = 74/555 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GPLP SW +Q+ LQ +IL R+ LGM P+ PAFSG VP + ++P AK+
Sbjct: 202 MGNIVNHDGPLPASWHKEQIALQHRILHRMKSLGMTPICPAFSGFVPRGILRLYPEAKLH 261
Query: 61 QL--GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+L G W P+ + L +PLF++IGR ++++ KE+G+ ++ + D+F+E
Sbjct: 262 RLGWGGW------PQKNHAHFLSPEEPLFLKIGRLYMQEWQKEFGKNTY-FLADSFNEME 314
Query: 119 PPVDSP------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNS 171
P + +SSLG IY + S + DAVW+MQGW+F Y W +KALL+
Sbjct: 315 LPENKGGVEARNNMLSSLGEQIYRSISSTNPDAVWVMQGWMFGYQRNIWNADTLKALLSK 374
Query: 172 VPLGKLVVLDLFAEVKPI-------WSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
VP K+++LDL A+ W K F+ P+++ ++ N G M G++D A
Sbjct: 375 VPDDKMLLLDLAADYNKTFWRNGMNWDVFKGFFNKPWVYSVVPNMGGKCAMTGVMDFYAN 434
Query: 225 GPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
G +EA +S + G+GM+ EGIE N V+Y+L+++ A+++ + +V+ ++ Y RYG
Sbjct: 435 GHLEALNSSSRGRLSGMGMAPEGIENNDVIYELITDAAWRNRQENVEQYLENYCRARYGN 494
Query: 284 SVPAIQDAWNVLYHTVY-NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
++++AWN+ T Y N D P + + T G N
Sbjct: 495 YPDSMKEAWNLFRRTAYSNLKD------------HPRFNWQMKPGTRGCSVN-------- 534
Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
++ + ++ L LF+ + L S +R D +++ L NE
Sbjct: 535 -----------------TSEDFLKGLSLFVNT-RGLEQSPLFRQDAVEMAVHYLGIRMNE 576
Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
EA D + F + D LL H + L W+ A+ + E++
Sbjct: 577 AIRAAQEALDEQDQENAEKCMAYFRKYALLADSLLEGHPTWRLSRWISFARSHGTSPEEK 636
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 522
+YE NAR +T W + DY K WSGL+RDYY PR +++ I+S S
Sbjct: 637 NKYEQNARRLVTRW-------GPPVDDYAAKIWSGLIRDYYLPR----WEHFIQSRLSEK 685
Query: 523 GFRLKDWRREWIKLT 537
+ W +W++ T
Sbjct: 686 NPDMGAWEEKWVRST 700
>gi|355706271|gb|AES02588.1| N-acetylglucosaminidase [Mustela putorius furo]
Length = 333
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 146/372 (39%), Positives = 210/372 (56%), Gaps = 50/372 (13%)
Query: 142 DSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 200
D DAVWL+QGWLF + P FW P Q++A+L +VP G+L++LDLFAE +P++ + F+G P
Sbjct: 3 DPDAVWLLQGWLFQHQPQFWGPAQVRAVLGAVPRGRLLILDLFAESQPVYLRTASFHGQP 62
Query: 201 YIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEM 260
+IWCMLHNF GN ++G L+++ GP AR N+TMVG GM+ EGI QN VVY LM+E+
Sbjct: 63 FIWCMLHNFGGNHGLFGALEAVNQGPAAARLFPNSTMVGTGMAPEGIGQNEVVYALMAEL 122
Query: 261 AFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFP 318
++ + V D++AW+ ++ RRYG + AW +L +VYNC+ + T NR +V
Sbjct: 123 GWRKDPVADLEAWVTSFAARRYGVDSKETEVAWRLLLGSVYNCSGEACTGHNRSPLVR-- 180
Query: 319 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNEL 378
PS+ VT +WY+ S V A L +A+ L
Sbjct: 181 --RPSLQMVTT---------------------------VWYNRSAVFEAWRLLLAAAPTL 211
Query: 379 SASNTYRYDLIDLTRQALAKYANELFLNIIEAY------QLNDAHGVFQLSRRFLELVED 432
+ S T+RYDL+D+TRQA + + + AY L A G+ EL+
Sbjct: 212 AKSPTFRYDLLDVTRQAAQELVSLYYTEARTAYLNKELVPLMRAAGIL-----VYELLPA 266
Query: 433 MDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGN 492
+DG+LA FLLG WLE A+ +A +E + YE N R Q+T+W E ++L DY N
Sbjct: 267 LDGVLASDSRFLLGTWLEQARAVAVSETDARFYEQNGRYQLTLW----GPEGNIL-DYAN 321
Query: 493 KYWSGLLRDYYG 504
K +GL+ YY
Sbjct: 322 KQLAGLVAGYYA 333
>gi|393783265|ref|ZP_10371440.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
CL02T12C01]
gi|392669544|gb|EIY63032.1| hypothetical protein HMPREF1071_02308 [Bacteroides salyersiae
CL02T12C01]
Length = 723
Score = 245 bits (625), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 166/574 (28%), Positives = 269/574 (46%), Gaps = 77/574 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL W Q+ LQ +I+ R+ ELGM P+ PAF+G VP A P K
Sbjct: 199 MGNLNTWDGPLSDEWQKSQIELQHQIINRMRELGMQPIAPAFAGFVPMAFAEKHPDIKFK 258
Query: 61 QLGNW--FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
L W F K + Y+L P F EIG+ F+++ KE+G+ ++ Y D+F+E
Sbjct: 259 HL-KWGGFDDKFN-----AYVLPPDSPFFEEIGKRFVKEWEKEFGKNTY-YLSDSFNEME 311
Query: 119 PPVDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNS 171
PV + ++ G +IY + +G+ DA+W+ QGW F Y FW ++ALL+
Sbjct: 312 LPVAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQGWTFGYQHDFWDKASLQALLSH 371
Query: 172 VPLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAF 224
VP K++++DL + + W FYG +I+ + NF G + G L A
Sbjct: 372 VPDDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIFSYVPNFGGKTPLTGDLQMYAT 431
Query: 225 GPVEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGR 283
EA + + ++G G + EG+E N VVY+L+++M + + +D + W+ Y RYG
Sbjct: 432 SSAEALKAPSHGNLIGFGSAPEGLENNEVVYELLADMGWTDQAIDPEQWMPSYCTARYGA 491
Query: 284 SVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
++++AW + T Y+ + ++P + + + +SK
Sbjct: 492 YPESMKNAWELFRKTAYSS-----------LYSYPRFTWQTVIPDQRR-------ISKID 533
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
V + + + +ELF+AS + L+ S Y D I+ +A A++L
Sbjct: 534 V----------------SDDFLHGIELFLASADSLNRSKLYVNDAIEFASYYIAAQADKL 577
Query: 404 FLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK 463
+ + +Q + ++L+ ++D LLA H + L W+E A+ ++
Sbjct: 578 YKQALTEDTAGKPVAAYQHLNQAIDLLLNVDKLLASHPLYRLEEWVELARNSGTTPAEKD 637
Query: 464 QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDG 523
YE NA+ IT W DY ++WSGL++DYY PR +YF S + GD
Sbjct: 638 AYEANAKRLITTW-------GGFQEDYAARFWSGLIKDYYIPRLKLYF-----SKQRGD- 684
Query: 524 FRLKDWRREWIKLTNDWQNGRNVYPVESNGDALI 557
L +W EWI+ W N P E D I
Sbjct: 685 --LDNWEEEWIR--TPWHN--TTTPFEKPLDMAI 712
>gi|395804724|ref|ZP_10483959.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
gi|395433112|gb|EJF99070.1| alpha-N-acetylglucosaminidase [Flavobacterium sp. F52]
Length = 722
Score = 245 bits (625), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 164/547 (29%), Positives = 267/547 (48%), Gaps = 46/547 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLPQ W ++ LQKKIL R+ L M+PV+PAFSG VP A P AKIT
Sbjct: 208 MGNINSLEGPLPQEWFVKKEALQKKILERMKALDMHPVVPAFSGYVPKAFAEKHPEAKIT 267
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W S + T+LLD+ DPLF +IG+ FIE K YG+ S+ Y D+F+E PP
Sbjct: 268 ELKSW----SGGGFASTFLLDSKDPLFKQIGKRFIEIYTKMYGK-SNFYLADSFNEIEPP 322
Query: 121 V---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V + E +S+ G+A+Y + AVW+MQGWLF + FW KA L+ VP K
Sbjct: 323 VSEHNKYEELSNYGSAVYETIDEAAPGAVWVMQGWLFGDNKEFWTKEATKAFLSKVPNEK 382
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT- 235
++V D + +W + FYG + + +HN+ G+ +YG L+ + N
Sbjct: 383 VMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKDELASLLKNPNRG 442
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
+VG G EG+ N +VY+ + ++ + + + W+ +Y RYG++ ++ AW +L
Sbjct: 443 NIVGYGAMPEGLNNNSIVYEYIYDLPWTKAEQPLNDWMAKYLNARYGQTSESVFHAWELL 502
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
+VYN T D A+ + +TE K G P K
Sbjct: 503 LKSVYNVKYWETRWWNDWAGAYLLFKRPTVKITEFK----GNPGDK-------------- 544
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
++ AL++ + + +N +YDLID++R + +E + I+AYQ +
Sbjct: 545 ------IKLKEALDILKKEAKKYNKNNLIQYDLIDVSRHYNSLSIDEELIECIKAYQEKN 598
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
QL ++ + V + D +++ L W++SA + E Y NA+T IT+
Sbjct: 599 IAKGDQLFKQIEKQVLETDKMMSGQPLNNLNQWVKSASDYGSSPEVSSLYAKNAKTLITL 658
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKD 528
W L DY ++ W G+ + +Y PR ++ + + ++ + F +K+
Sbjct: 659 WGGEGH-----LNDYASRSWKGMYKGFYWPRWKMFLEALKKAAVTNTSFDENKERESIKN 713
Query: 529 WRREWIK 535
W W +
Sbjct: 714 WEINWTE 720
>gi|322699924|gb|EFY91682.1| alpha-N-acetylglucosaminidase, putative [Metarhizium acridum CQMa
102]
Length = 775
Score = 244 bits (624), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 167/566 (29%), Positives = 283/566 (50%), Gaps = 53/566 (9%)
Query: 5 HGWGGP--LPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQL 62
WGG LP ++++ Q LQKKI+ R+ ELG+ PVLPAF G VP +++ V P +T
Sbjct: 211 RSWGGKGDLPLAFIELQFELQKKIVARMVELGITPVLPAFPGFVPESIKKVRPDVNLTVS 270
Query: 63 GNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVD 122
NWF+ D ++ LD D + E+ R F+ +Q+ +G ++IY D F+E +P
Sbjct: 271 PNWFAPAPD-KYTRDLFLDPLDDTYAELQRLFVSKQMDAFGNVTNIYTLDQFNELSPASG 329
Query: 123 SPEYISSLGAAIYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGK-LVVL 180
Y+ + Y+G+ + + AVWL+QGWL FS FW P++ A L V + ++VL
Sbjct: 330 DTAYLRGIARNTYAGLTAANPAAVWLLQGWLFFSSRRFWTQPRIDAYLGGVEDDQGMLVL 389
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL++E P W + + G P+IWC LH+F GN+ + G + ++ P++A +++ ++VG
Sbjct: 390 DLYSEANPQWQRTNSYSGKPWIWCQLHDFGGNMALEGRVQTLTSAPIDA-LAQSESLVGF 448
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG--RSVPA-IQDAWNVLYH 297
G++ E E N VVYD++ + A+ +D + + + +RY S+P+ + AW +L
Sbjct: 449 GLTPEAYEGNEVVYDILLDQAWSATPLDTQTYFASWVTKRYAGVSSIPSELYRAWEMLRT 508
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VY+ T TD + V VA + P++ + + T + HP
Sbjct: 509 DVYSNTR--TDIPQ-VPVATYQLRPALSGIA------------------NRTGHFPHPTA 547
Query: 358 WYSTSEVIR-----ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+ V++ LE G+ L ++ D +D++RQ L+ + L+ +++ AY+
Sbjct: 548 LHYDPLVLQEAWKLMLEAMTRQGS-LWKVPAFQLDFVDVSRQMLSNQFDVLYADLVNAYK 606
Query: 413 LNDAHGVFQL------------SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
+ A G +L R L L+ +D L F L W+++A +
Sbjct: 607 CSAAGGSRELRSSAPSCDVEAAGARLLSLLSTLDLTLLTSRHFTLQSWVDAAGSWGKAAG 666
Query: 461 QEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
E + +NAR+Q+T+W Q +A+ L DY K W GL+ YY R +I+ ++ + S
Sbjct: 667 NEDLFTFNARSQVTVW----QVDATNLNDYAAKAWGGLVGSYYKGRWSIFVDALVAASNS 722
Query: 521 GDGFRLKDWRREWIKLTNDWQNGRNV 546
G RE +WQ G +
Sbjct: 723 G-SLDEGALTRELEAFEAEWQAGEHA 747
>gi|146300873|ref|YP_001195464.1| alpha-N-acetylglucosaminidase [Flavobacterium johnsoniae UW101]
gi|146155291|gb|ABQ06145.1| Candidate alpha-glycosidase; Glycoside hydrolase family 89
[Flavobacterium johnsoniae UW101]
Length = 723
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 166/548 (30%), Positives = 266/548 (48%), Gaps = 47/548 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++ GPLPQ W ++ LQKKIL R+ L M+PV+PAFSG VP A P AKIT
Sbjct: 208 MGNINSLEGPLPQEWFSKKEELQKKILERMRTLDMHPVVPAFSGYVPKAFAEKHPEAKIT 267
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L +W S + T+LLD+ DPLF +IG+ FIE K YG+ S+ Y D+F+E PP
Sbjct: 268 ELNSW----SGGGFESTFLLDSKDPLFKKIGKRFIEIYTKMYGK-SNFYLADSFNEIEPP 322
Query: 121 V---DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V + E +++ G+AIY ++ AVW+MQGWLF + FW A L+ VP +
Sbjct: 323 VTEHNKYEELANYGSAIYETIEEAAPGAVWVMQGWLFGDNKNFWTKEATSAFLSKVPNDR 382
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVE-ARTSENT 235
L+V D + +W + FYG + + +HN+ G+ +YG L+ V +
Sbjct: 383 LMVQDYANDRYKVWENQEAFYGKQWTYGYVHNYGGSNPVYGDLNFYKNELVSLLKNPHRG 442
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQDAWNV 294
+VG G EG+ N +VY+ + ++ + + VK W+ Y RY ++ ++ AW +
Sbjct: 443 NVVGYGAMPEGLNNNAIVYEFIYDLPWSKGEQSVKDWLTNYLNARYEQKTSDSVFKAWEL 502
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L +VY+ T D A+ ++TE K G P K+
Sbjct: 503 LLESVYSTKYWETRWWNDRAGAYLLFKRPTATITEFK----GNPGDKD------------ 546
Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
++ AL++ A + N +YDLID +R + +E + ++AYQ
Sbjct: 547 --------KLKEALDILKAEAKKYDKKNFIQYDLIDASRHYYSLSIDEDLVECVKAYQQK 598
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 474
D QL ++ + V ++D ++ L W++SA + E K Y NA+T IT
Sbjct: 599 DITKGDQLFKKIEKQVLEIDKSMSGQPLNSLNYWVKSASEYGSTPEVSKLYVKNAKTLIT 658
Query: 475 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESL-------ESGDGFRLK 527
+W L DY ++ W G+ + +Y PR ++ ++ E+ + +K
Sbjct: 659 LWGGEGH-----LNDYASRSWQGMYKGFYWPRWKMFLTAFKKTAVNNTPFDETKEREEIK 713
Query: 528 DWRREWIK 535
+W +W K
Sbjct: 714 NWEIKWTK 721
>gi|225875033|ref|YP_002756492.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
51196]
gi|225793771|gb|ACO33861.1| alpha-N-acetylglucosaminidase [Acidobacterium capsulatum ATCC
51196]
Length = 800
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 164/558 (29%), Positives = 262/558 (46%), Gaps = 45/558 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL + P+ +S LD+++ ++I+ R+ ELG+ PV P + G VP P A +
Sbjct: 241 MGNLCCFDEPISRSLLDRRIRSAQQIIRRLRELGITPVFPGYFGMVPEDFARRHPGAHVI 300
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
GNW + P W LD DPLF + +F + Q + +G +S IY+ + F E
Sbjct: 301 PQGNWNGFRR-PAW-----LDPRDPLFAAVAASFYKHQQELFGDSS-IYDIELFQEGGSA 353
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P +SS AI + A+W+ W +P +ALL++V L+V+
Sbjct: 354 ADVP--VSSAAKAIQKALLRAHPQAMWMTLAW--QNNP------SRALLSAVDRSHLLVV 403
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D+ P + + F G Y++ L +F G + L A +TM G
Sbjct: 404 DIDQGRTPHENRERDFMGAAYLFGGLWDFGGRTTLGANLYDYAVRLPRMGLRAGSTMKGT 463
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
+ EG++ NP +DL +EMA++ VD++ W +Y+ RRYG P + AW +L T Y
Sbjct: 464 ALFSEGLDNNPAAFDLFTEMAWRTSPVDLRTWSREYARRRYGMDDPHTRRAWRILMETAY 523
Query: 301 NC-TDGATDKN-RDVIV-AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
DG ++ RD + D PS+ +V+ SS+ L
Sbjct: 524 GTRADGVSNHGERDAPPESLFDAQPSLDAVS--------------------ASSWSPDRL 563
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y + AL + + + TY+YDL+D+ RQ LA ++ + I +AY
Sbjct: 564 RYDPKKFEAALTELLQAPPGMREMPTYQYDLVDVARQTLANWSRKTLPEIKDAYDHRHEA 623
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
L +++L ++ D LLA + F++GPWL + A ++++ +++AR+ +T W
Sbjct: 624 RFETLEKQWLCMMMLQDKLLATNTSFMVGPWLNAVSPWAATATEQRRLDYDARSILTTWG 683
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
+ T EA LRDYGNK W+GL RDYY R IYF + SL++G D W
Sbjct: 684 NRTASEAG-LRDYGNKDWAGLTRDYYYRRWQIYFNDLDRSLKTGTPPHPID----WFAFG 738
Query: 538 NDWQNGRNVYPVESNGDA 555
W + Y ++ GD+
Sbjct: 739 EKWNRAQTHYATQARGDS 756
>gi|160914140|ref|ZP_02076362.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
gi|158433951|gb|EDP12240.1| hypothetical protein EUBDOL_00149 [Eubacterium dolichum DSM 3991]
Length = 2150
Score = 242 bits (618), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 151/515 (29%), Positives = 262/515 (50%), Gaps = 50/515 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ +GGPLP W +Q++ L +K+ R+ G++PV+ F G VP + A +T
Sbjct: 387 MQNLYSFGGPLPDDWFEQRVELGRKMHDRMQAFGIDPVIQGFCGQVPMSFVEKNEGAVLT 446
Query: 61 QLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
+ W S + P TYL F ++ + F E+Q +G S Y D F
Sbjct: 447 PIDEWPSF-TRPAMIKTYLSQEEIAAGKKDYFKDVAKTFYEKQKNVFGDVSDYYASDPFH 505
Query: 116 E--NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
E NT +D ++++ + M ++DA+W+MQ W + D ++ L V
Sbjct: 506 EGGNTQGLD----VTNIFKTVQEEMLKSNADAIWVMQQWQGNLDH----AKLSGL---VK 554
Query: 174 LGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE 233
+ + LDL +++ P S+ + G+ +IWCMLHNF G + + G ++ IA P A S
Sbjct: 555 PEQALALDLQSDMNP--SSVMENEGISWIWCMLHNFGGRMGLDGEVEVIAKEPAIA-ASN 611
Query: 234 NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWN 293
N M G+G++ E +E +P+VY+++ +M + + +D +AW+++Y+ RR G S ++Q+AW+
Sbjct: 612 NQYMKGIGITPEALENSPIVYEMLFDMTWSKDPIDYQAWVDKYATRRAGGSSDSLQEAWD 671
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
+L T Y +D + + ++I+ G N+ S S++
Sbjct: 672 MLLETAY----------KDKGIYYQGAGETVINARPG--TNF-----------SSASTWG 708
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H ++ Y E+ + L L I + + +AS YRYDL D+ Q L A E +++A
Sbjct: 709 HSNILYDKEELDKVLSLLIENYDAFAASEAYRYDLADVAEQVLCNAAIEYHALMVQALNN 768
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNART 471
D+ ++S FLEL++ D +L + F+LG W+ A+++ N + + +E+NAR
Sbjct: 769 KDSAEFKRISTHFLELIDLSDRILGSSEEFMLGTWIHDAREMLDNADDWTKDLFEFNARA 828
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 506
+T W E + L+DY N+ W+GL +Y R
Sbjct: 829 VVTTW---GGERSGSLKDYSNRKWAGLTSSFYKER 860
>gi|210631701|ref|ZP_03296968.1| hypothetical protein COLSTE_00853, partial [Collinsella stercoris
DSM 13279]
gi|210159960|gb|EEA90931.1| F5/8 type C domain protein, partial [Collinsella stercoris DSM
13279]
Length = 1906
Score = 242 bits (618), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 166/536 (30%), Positives = 263/536 (49%), Gaps = 42/536 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP SW +Q++ L ++I R+ G++PV+ F G VP Q P++
Sbjct: 354 MQNLYSVGGPLPDSWFEQRVELARRIHDRMQTYGIDPVIQGFGGQVPTDFQQKNPNSVAA 413
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G+W S + P TYL DA + F ++G F E Q + +G+ SH Y D F
Sbjct: 414 SSGSW-SGFARPYMIKTYLTDADRAAGKEDYFQKVGTTFYEAQERIFGKVSHFYAVDPFH 472
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E V I + + M D AVW+MQ W + D L
Sbjct: 473 EGGT-VPQGFNIVDIYRTVQQKMLDYDPQAVWVMQQWQWGIDE-------NKLSGLAKKE 524
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ ++ + VP++W MLHNF G + M G+ + +A +A S N
Sbjct: 525 QSLVLDLQSDLRS-QASPMENQQVPWVWNMLHNFGGRMGMDGVPEVLAIKIPQAYNS-NR 582
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD +AW Y RRYG + IQ+AW++L
Sbjct: 583 YMRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRAWTRSYIERRYGGTDAKIQEAWDIL 642
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
T Y DG + SI++ +P + S++ H
Sbjct: 643 LDTAYKHVDGEY---------YQGASESIMNA---------RPSDNKI---GSASTWGHS 681
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
+ Y E RA +LFI S + S +RYD +D+ RQ LA E +AY+ D
Sbjct: 682 DIDYDKKEFERAAQLFIESYDTYKDSEAFRYDFVDVMRQVLANAFQEYQPLAGDAYKQRD 741
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQI 473
A L+ + LE+++ D +L+ F+LG W+E+A+ L ++ + +E NAR+ I
Sbjct: 742 AERFELLANQMLEMLDAQDRMLSTSSDFMLGTWIENARTLLEDADDWTADLFELNARSLI 801
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
T W ++ SL+ DY N+ WSGL YY PR + ++LE G + +W
Sbjct: 802 TTW--GLEKNGSLI-DYSNRQWSGLTGSYYKPRWESWANARKKALEDGGSAQDLNW 854
>gi|393785795|ref|ZP_10373941.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
CL02T12C05]
gi|392661414|gb|EIY55000.1| hypothetical protein HMPREF1068_00221 [Bacteroides nordii
CL02T12C05]
Length = 724
Score = 241 bits (616), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 159/549 (28%), Positives = 256/549 (46%), Gaps = 69/549 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL W + Q+ LQ +I+ R+ ELGM+P+ PAF+G VP A P K
Sbjct: 199 MGNLNTWDGPLSDEWQEGQIQLQHQIINRMRELGMSPIAPAFAGFVPMAFAEKHPDIKFK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W Y+L P F EIG+ F+++ KE+G+ ++ Y D+F+E P
Sbjct: 259 HL-KWGGFDDK---FNAYVLPPDSPFFEEIGKRFVKEWEKEFGKNTY-YLSDSFNEMELP 313
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
V + ++ G +IY + +G+ DA+W+ QGW F Y FW ++ALL+ VP
Sbjct: 314 VAKDDVEGKHKLLAQYGESIYRSITAGNPDAIWVTQGWTFGYQHSFWDKASLQALLSHVP 373
Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + + W FYG +I+ + NF G M G L A
Sbjct: 374 DDKMIIIDLGNDYPKWVWGTEQTWKVHDGFYGKKWIFSYVPNFGGKTPMTGDLQMYASSS 433
Query: 227 VEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
EA SE + ++G G + EG+E N VVY+L+++M + + +D+ W+ Y + RYG
Sbjct: 434 AEALQSESHGNLIGFGSAPEGLENNEVVYELLADMGWTDQAIDLDKWMPSYCMARYGAYP 493
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
++DAW++ T Y+ ++ PD K +
Sbjct: 494 ETMKDAWDLFRKTAYSSLYSYPRFTWQTVI--PD---------------------KRRIS 530
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
K + S + + +ELF+ S + L S Y D I+ +A A++L+
Sbjct: 531 KIDVS-----------DDFLHGVELFLNSADSLKNSKLYVNDAIEFASYYIAAKADKLYG 579
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
+ + + Q + ++++ ++D LLA H + L W+ A+ ++ Y
Sbjct: 580 KALAEDTVGRSAVAQQYLNQTIDMLLNVDKLLASHPLYRLEEWVNFARNSGTTPAEKDAY 639
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W DY ++WSGL++DYY PR IYF SL+
Sbjct: 640 EINAKRLITTW-------GGFQEDYAARFWSGLIKDYYIPRLKIYFSKQRGSLD------ 686
Query: 526 LKDWRREWI 534
+W EWI
Sbjct: 687 --NWEEEWI 693
>gi|126347839|emb|CAJ89559.1| putative alpha-N-acetylglucosaminidase [Streptomyces ambofaciens
ATCC 23877]
Length = 740
Score = 241 bits (614), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 154/556 (27%), Positives = 252/556 (45%), Gaps = 44/556 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+GGP+ + L+ + L ++I R+ +LGM PVLP + G VP P +
Sbjct: 213 MQNMSGFGGPVSERLLEDRADLGRRIADRLRQLGMTPVLPGYYGTVPPGFTERNPVGPVV 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W + P W LD +F + AF Q + +G TS +Y D E P
Sbjct: 273 PQGDWVGFER-PDW-----LDPRSAVFPRVAAAFYRHQRELFG-TSTMYKMDLLHEGGRP 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P + A+ +Q+ AVW + GW + P + +++++ +L+++
Sbjct: 326 GNVP--VRDAAQAVMKALQTARPGAVWTLIGWQNN-------PSTQ-IIDAIDKRRLLIV 375
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D ++ ++G PY + + NF G+ M A + RT + + G+
Sbjct: 376 DGLSDRYDGLDREATWHGAPYAFGTIPNFGGHTTMGANTAVWAERFDQWRTKAGSALAGI 435
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
EG NPV Y+L +E+A++ E VD + W +Y+ RRYG + P AW +L Y
Sbjct: 436 AYMPEGTGGNPVAYELFTELAWRTEPVDQRKWFAEYAQRRYGGADPHAASAWELLRSGPY 495
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ G +++D + ++ + + +S+ + Y
Sbjct: 496 STPSGTWSESQDSLF-----------------------TARPRLTATNAASWSPGAMRYD 532
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
V RAL + L A++ YR+DL+D+ RQ LA + L I AY D
Sbjct: 533 PGTVRRALTELVRVAPALRATDAYRFDLVDVARQVLANRSRTLLPQIKAAYDAEDLPRFR 592
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
+ + + +D LLA FLLGPWLE AK + E + E++AR+ +T W +
Sbjct: 593 ARAAEWKNCLSLLDRLLATDARFLLGPWLEDAKSWGRTEAERAAAEFDARSILTTWGHRS 652
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
+A LRDY N+ WSGL+ D+Y R Y + +L +G D W L +DW
Sbjct: 653 GSDAGGLRDYANREWSGLVSDFYAMRWTKYLDSLDTALVTGRPPVAID----WFALEDDW 708
Query: 541 QNGRNVYPVESNGDAL 556
R+ YPV +GD +
Sbjct: 709 NRQRDGYPVRPSGDPV 724
>gi|282881077|ref|ZP_06289764.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
5C-B1]
gi|281304881|gb|EFA96954.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella timonensis CRIS
5C-B1]
Length = 688
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 159/522 (30%), Positives = 250/522 (47%), Gaps = 46/522 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ Q +D + LQ KIL R+ +LG+ PV+ F G VP+ L + +P A +
Sbjct: 164 MGNLEGWGGPMSQQMIDDRYKLQIKILRRMRQLGIEPVVQGFPGIVPSFLHDKYPKACVV 223
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W + +L LF + +A+ + + YG + D F E N
Sbjct: 224 SQGKWNGFQRPS------ILLPQSQLFYCMAKAYYDNMKRYYGTDLRYFGGDLFHEGGNA 277
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD +SS + + M S DA W++QGW + P ALL + ++
Sbjct: 278 KGVD----LSSTASKVQKCMLSHFPDAKWVLQGWNGNPSP--------ALLAGLDKKHVL 325
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTM 237
+++L E+ W S +F P+IW +++F G +M G L + P A S++ +
Sbjct: 326 LINLAGEIDASWKQSDEFGQTPWIWGSVNHFGGKTDMGGQLPVLVEQPHRALAASQHGRL 385
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G+ EGI NPVVYDL + A+ V + QY RYG + AW +L
Sbjct: 386 KGLGILPEGIHTNPVVYDLALQTAWSDTVPSVDHLLRQYIWYRYGTWNDDLYRAWQLLAS 445
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VY EG Y++ ++ ++ S S++ +
Sbjct: 446 SVYG---------------------EFEVKGEGTYESVF--CARPSLHVSSVSTWGPKKM 482
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y ++++AL LF + S TY YDL+DL RQ +A A ++ ++ AY D+
Sbjct: 483 QYQPEKLLQALVLFRKAAVHFKGSETYEYDLVDLARQVMANNARNVYNQVVHAYNEKDSL 542
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + S FL L++ D LL+ + FLLG WL++A+Q +NE+ ++Q NART I+ W
Sbjct: 543 ALNRYSSTFLHLIDLQDSLLSTNKFFLLGKWLQAARQYGENEQDQRQALVNARTLISYW- 601
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 519
+ + L DY NK W+GLL+ YY PR +F + L
Sbjct: 602 -GPDDATTRLHDYANKEWAGLLKQYYAPRWRAFFAMLAGQLR 642
>gi|296115989|ref|ZP_06834611.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
23769]
gi|295977458|gb|EFG84214.1| alpha-N-acetylglucosaminidase [Gluconacetobacter hansenii ATCC
23769]
Length = 758
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 161/581 (27%), Positives = 273/581 (46%), Gaps = 62/581 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGP+P+ ++++ V ++I+ R+ ELGM PVLP F G VP FP A +
Sbjct: 222 MANMCCYGGPVPRELIEKRAVSAQQIIGRMRELGMRPVLPGFYGMVPDDFGKRFPQAHVI 281
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD DP+F ++ + ++Q K +G + +Y+ F E P
Sbjct: 282 GQGEWNRFRR-PAW-----LDPRDPMFAKVAAIYYDEQKKLFG-DAPVYDIQPFQEGGTP 334
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ G I + + A+W++ W D +L V +L ++
Sbjct: 335 GDVP--LADAGQGIQKALDTAHPGAMWMLMAWYEEPD--------ARMLAGVDRKRLFIV 384
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG---PVEARTSENTTM 237
DL + + F G P+++ L +F G + G S +G P RT +N M
Sbjct: 385 DLEQNTRVRENRDADFQGAPFLYGGLWDFGGRTSLGG--SSYDYGVRLPGLWRTQKN--M 440
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA-IQDAWNVLY 296
+G + EG++ NP ++DL +E A++ + VD W Y+ RRYG+ + AW++L
Sbjct: 441 IGTAVFPEGMDNNPYIFDLFTEAAWRRDGVDTTQWTRDYADRRYGQPGDVHARKAWDLLL 500
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP-VSKEAVLKSETSSYDHP 355
H+ + S Q++G+ + +++ ++ S H
Sbjct: 501 HSAF-------------------------SYRATGIQDFGEASAAPDSLFNAQPSLDTHS 535
Query: 356 HLW-------YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNII 408
W Y V A+ + + + A+ YRYDL+D+TRQA+A A + I
Sbjct: 536 AAWNGMKVLPYDPHLVEAAMAELLQASDATRATEAYRYDLVDVTRQAVANQARAMLPQIG 595
Query: 409 EAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWN 468
+A+ D + L+ R+LEL++ D LLA + F +G WL + + + Q K +++
Sbjct: 596 DAFAARDRAKLHALTTRWLELMDRQDSLLATNTFFRVGTWLSWPQAWSDDPAQRKLMDYD 655
Query: 469 ARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD 528
AR +T W T + LRDY NK W+GL +DYY R ++F + SL +G R D
Sbjct: 656 ARVILTNWGGRTASQVGHLRDYANKDWAGLTKDYYRVRWQLFFDSLETSLATGRPPREID 715
Query: 529 WRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG 569
W K+ +W + VY GD+ ++ +++ QG
Sbjct: 716 ----WYKVGEEWCHNGRVYSPTPEGDSYTVARDIHDYLTQG 752
>gi|365104185|ref|ZP_09333846.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
4_7_47CFAA]
gi|363644798|gb|EHL84079.1| hypothetical protein HMPREF9428_02927 [Citrobacter freundii
4_7_47CFAA]
Length = 1049
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 159/556 (28%), Positives = 264/556 (47%), Gaps = 43/556 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 361 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 420
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 421 DQGEWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 478
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D DAVW++Q W + A LN + ++L
Sbjct: 479 ADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW--------QENPTDAFLNGLKKDHALIL 528
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+A+ KP + +F P+IW MLH F G + G+ + +A + +E+ M GV
Sbjct: 529 DLYADNKPNHAMRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLA-QEIPQSLAESKKMKGV 587
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E + NP++Y+++ +MA++ + A+I+ + RYG P I+ AW+++ T Y
Sbjct: 588 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 647
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+ R + SII G +G V++ + YD
Sbjct: 648 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 684
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+E + L L+++ + A+ Y++DL+D+TRQ LA + E + +A+ D
Sbjct: 685 -AEFEKILPLYLSVYDHFKANPAYQHDLVDITRQVLANASYEYYRAFEDAWIAKDYSAFN 743
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFD 478
QLS +FL L++ D +L+ F+LG W+ SA+ + ++ Q+E+NAR +T W
Sbjct: 744 QLSGKFLRLIKLQDQVLSTRPEFMLGTWINSARTMLDGMDDWTRDQFEFNARAMVTTWGT 803
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
+A LRDY N+ W GL D+Y R A + + + + +G + + W L
Sbjct: 804 EQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALKSAAATGQ--KQDAIKVNWFPLEY 860
Query: 539 DWQN-GRNVYPVESNG 553
W N N YP + +G
Sbjct: 861 RWVNQSGNGYPTQPSG 876
>gi|440731409|ref|ZP_20911430.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
gi|440373101|gb|ELQ09870.1| N-acetylglucosaminidase, partial [Xanthomonas translucens DAR61454]
Length = 732
Score = 239 bits (610), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 172/594 (28%), Positives = 263/594 (44%), Gaps = 83/594 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ + PLPQ W++ + LQ++IL R+ LGM PVLPAFSG VP A P A+I
Sbjct: 168 MGNIEAYDAPLPQQWIEDKYALQQRILQRMRTLGMKPVLPAFSGYVPKAFAQAHPQARIY 227
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I + FI+ + YG+ ++ Y D F+E PP
Sbjct: 228 RMRAWEGFHE------TYWLDPADPLFTKIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 280
Query: 121 VDS----------------------PEY--------ISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + PE ++ G A+Y + + DAVW+MQ
Sbjct: 281 IAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 340
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 341 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 400
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ G+ +YG +AF + R + +VG G EG+ N VVY+ M +A+
Sbjct: 401 YGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFGAFPEGLHDNSVVYEYMYTLAWGG 457
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
++ ++ W+ Y+ RYG + PA++ AW+ L V + P
Sbjct: 458 QQRSLQDWLGDYTRARYGHTSPALRAAWDDLQAAVLSTRYWT---------------PRW 502
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
G Y + +P + + E + D P L RAL+ +A E + + Y
Sbjct: 503 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLY 553
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
RYDL+D R + + AY+ D R V+ +DGL+ +
Sbjct: 554 RYDLVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDAAFARVQAAVQQLDGLVGGQQE-I 612
Query: 445 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 504
L WL A+ A+ + Y +A+ QI++W L DY +K W G+ DYY
Sbjct: 613 LSSWLGDAEGDAKTPQDAAYYRRDAKAQISVWGGEGN-----LGDYASKAWQGMYADYYL 667
Query: 505 PRAAIYFKYMIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPVES 551
PR A+ + + + SG RL+ W R+W+ + PV +
Sbjct: 668 PRWALAMQALRAAAVSGGSVDEAALQQRLRVWERDWVACETPYTRRAPADPVAA 721
>gi|291086028|ref|ZP_06354661.2| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
ATCC 29220]
gi|291069185|gb|EFE07294.1| alpha-N-acetylglucosaminidase family protein [Citrobacter youngae
ATCC 29220]
Length = 1014
Score = 239 bits (609), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 159/556 (28%), Positives = 263/556 (47%), Gaps = 43/556 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 326 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 385
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 386 DQGDWVGFVRPPM-LRTYVKQGAD-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 443
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D DAVW++Q W + A LN + ++L
Sbjct: 444 ADLD--MVKVAQTVQNKMLEHDKDAVWIIQNW--------QENPTDAFLNGLKKDHALIL 493
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+A+ KP + +F P+IW MLH F G + G+ + +A + +E+ M GV
Sbjct: 494 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLA-QEIPQSLAESKYMKGV 552
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E + NP++Y+++ +MA++ + A+I+ + RYG P I+ AW+++ T Y
Sbjct: 553 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 612
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+ R + SII G +G V++ + YD
Sbjct: 613 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 649
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D
Sbjct: 650 -AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFN 708
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFD 478
QLS +FL L++ D +L F+LG WL SA+ + ++ Q+E+NAR +T W
Sbjct: 709 QLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLDGMDDWTRDQFEFNARAMVTTWGI 768
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
+A LRDY N+ W GL D+Y R A + + + + +G + + W L
Sbjct: 769 EQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALKNAAATGQ--KQDAIKVNWFPLEY 825
Query: 539 DWQNGR-NVYPVESNG 553
W N N YP + +G
Sbjct: 826 RWVNQTGNGYPTQPSG 841
>gi|380512475|ref|ZP_09855882.1| N-acetylglucosaminidase [Xanthomonas sacchari NCPPB 4393]
Length = 785
Score = 239 bits (609), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 166/577 (28%), Positives = 259/577 (44%), Gaps = 83/577 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W++ + LQ +IL R+ LGM PVLPAF+G VP A P A+I
Sbjct: 221 MGNIEGYDAPLPQQWIEDKHALQLRILQRMRALGMKPVLPAFAGYVPKAFAQAHPQARIY 280
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE---- 116
++ W TY LD DPLF +I + FI+ + YG+ ++ Y D F+E
Sbjct: 281 RMRAWEGFHE------TYWLDPADPLFAQIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 333
Query: 117 --------------------------NTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
PPV + +++ G A+Y+ + + DAVW+MQ
Sbjct: 334 IAADGSDARLASYGDSTANTAKTKPPEVPPVQRDKRLAAYGRALYASIHRANPDAVWVMQ 393
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 394 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 453
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ G+ +YG +AF + R + +VG G EG+ VVY+ M +A+
Sbjct: 454 YGGSNPVYG---DLAFYREDLRALLADKDKQQLVGFGAFPEGLHTTSVVYEYMYALAWGA 510
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
++ ++ W++ Y+ RYG + PA++ AW+ L +V + P
Sbjct: 511 QQRPLQDWLDDYTRARYGHTSPALRAAWDDLQASVLSTRYWT---------------PRW 555
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
G Y + +P + + E + D P L RAL+ +A E + + Y
Sbjct: 556 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALQQLLALAPEYADAPLY 606
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
RYDL+D R + + AY+ D + R E V +D L+
Sbjct: 607 RYDLVDFARHYATGRVDVQLQQAVAAYRRGDVAAGDAATARVREAVTQLDSLVGGQQD-T 665
Query: 445 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 504
L WL++A A + Y +A+ Q+++W L DY +K W G+ DYY
Sbjct: 666 LSSWLDAAAGYATTPQDAAYYRRDAKAQVSVWGGEGN-----LGDYASKAWQGMYADYYL 720
Query: 505 PRAAIYFKYMIESLESGDGF-------RLKDWRREWI 534
PR + + + E+ +G RL+ W R+W+
Sbjct: 721 PRWTLALQMLSEAAVAGGSVDEAQLQQRLRAWERDWV 757
>gi|421734750|ref|ZP_16173809.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
gi|407077324|gb|EKE50171.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum LMG 13195]
Length = 1919
Score = 238 bits (607), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 163/538 (30%), Positives = 261/538 (48%), Gaps = 46/538 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 362 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 421
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 422 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 480
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 481 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 532
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ +++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 533 QTLVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 591
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ AW++L
Sbjct: 592 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 650
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y TDG + +I A P D +I S S++
Sbjct: 651 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 687
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 688 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 747
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNART 471
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 748 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARA 807
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 808 LVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 862
>gi|410634789|ref|ZP_11345419.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
gi|410145665|dbj|GAC22286.1| alpha-N-acetylglucosaminidase [Glaciecola arctica BSs20135]
Length = 750
Score = 238 bits (607), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 168/587 (28%), Positives = 288/587 (49%), Gaps = 75/587 (12%)
Query: 1 MSNLHGWGG--PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAK 58
M NL W G LP+S+ D+Q+ L KIL R+ LGM P++ AF+G VP A +FP A+
Sbjct: 211 MGNLITWDGGDKLPESYFDEQIALNHKILKRLRSLGMTPIVHAFAGFVPPATSELFPEAQ 270
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE-N 117
I +L +W P YLL +PLF++IG+ +IE+ KE+G+ + Y D+F+E +
Sbjct: 271 IRRL-SWGG--GLPESTYGYLLSPENPLFVKIGKMYIEEWQKEFGKNEY-YLADSFNEMD 326
Query: 118 TPPVDS-PEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP------FWRPPQMKA 167
PP D+ E ++ L G +Y +++ + DA W+MQGW F Y FW P ++ A
Sbjct: 327 VPPADTEAELLTELAGYGDRVYQSIKAANPDATWVMQGWTFPYHKDENRQLFWTPERLHA 386
Query: 168 LLNSVPLGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILD 220
L++ VP KL++LDL E + P W F+ +I+ + N G + G D
Sbjct: 387 LVSKVPDDKLLILDLANEYNKLWWKIDPSWKMYSGFFNKKWIYSFIPNMGGKTPLNGRFD 446
Query: 221 SIAFGPVEART-SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVR 279
A P++A + ++G G + EGIE N ++Y+L+++MA+Q + +DV W +Y+++
Sbjct: 447 IYAELPIDALNYKDKGNLIGFGFAPEGIENNEMIYELLTDMAWQRKAIDVDQWQAKYAMQ 506
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 339
RYG +++ A++ L N+ + +F VD I Y+N P
Sbjct: 507 RYGAYPGSLEKAFSYL--------------NKSALGSF--VDHPIHRFQLRPYRN---PE 547
Query: 340 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
E DH + + + + I+A LF+ + +L + Y++D +++T L+
Sbjct: 548 GVE----------DHATV-HESEDFIKATGLFLQASEQLKDNKLYQHDAMEITTLFLSLV 596
Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
+ L + + V + + ++ MD LLA H L W++ A+
Sbjct: 597 TDNLLTKFLAKDVEQRDYSVLDEA---ISVMHTMDKLLAEHPNHQLVTWVDYARTWGSTT 653
Query: 460 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 519
++ YE NA+ +T W + + DY + WSGL+ +YY PR Y ++++
Sbjct: 654 AEKDYYESNAKRLLTTWGGDP------VNDYAGRVWSGLIGNYYAPRWQSYH----DAVK 703
Query: 520 SGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 566
+ F ++ W W+ ++N Y D + +Q +Y KY
Sbjct: 704 NNQTFDVRQWEENWV--MTPYKNTSTAY-----QDPVRVAQAMYFKY 743
>gi|311064845|ref|YP_003971571.1| beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
gi|310867165|gb|ADP36534.1| Beta-N-hexosaminidase [Bifidobacterium bifidum PRL2010]
Length = 1923
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 163/538 (30%), Positives = 261/538 (48%), Gaps = 46/538 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 366 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 425
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 426 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 484
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 485 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 536
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ +++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 537 QALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 595
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ AW++L
Sbjct: 596 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 654
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y TDG + +I A P D +I S S++
Sbjct: 655 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 691
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 692 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 751
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNART 471
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 752 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARA 811
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 812 LVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 866
>gi|313140918|ref|ZP_07803111.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
bifidum NCIMB 41171]
gi|313133428|gb|EFR51045.1| alpha-N-acetylglucosaminidase family protein [Bifidobacterium
bifidum NCIMB 41171]
Length = 2005
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 163/538 (30%), Positives = 261/538 (48%), Gaps = 46/538 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 448 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 507
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 508 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQESVFGKVSNYYAVDPFH 566
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 567 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 618
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ +++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 619 QALVLDLQSDLRS-QASAMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 677
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ AW++L
Sbjct: 678 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 736
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y TDG + +I A P D +I S S++
Sbjct: 737 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 773
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 774 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 833
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNART 471
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 834 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARA 893
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 894 LVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 948
>gi|333031147|ref|ZP_08459208.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
gi|332741744|gb|EGJ72226.1| Alpha-N-acetylglucosaminidase [Bacteroides coprosuis DSM 18011]
Length = 721
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 159/568 (27%), Positives = 265/568 (46%), Gaps = 73/568 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL W Q+ LQ KIL R+ EL M P+ PAF+G VP A P
Sbjct: 199 MGNLNKWDGPLSDEWHTSQIELQHKILDRMRELEMKPIAPAFAGFVPMAFAEKHPDINFK 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W DP + Y+L P F EIG+ FIE+ E+G ++ Y D+F+E P
Sbjct: 259 HM-RWGGF--DPEYNA-YVLPPDSPFFEEIGKLFIEEWENEFGSNTY-YLSDSFNEMELP 313
Query: 121 VDSPE------YISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
+D + + G +IY + +G+ +A+W+ QGW F Y FW ++ALL++VP
Sbjct: 314 IDKDDTEGKYRLLRQYGESIYKSISAGNPEAIWVTQGWTFGYQHSFWDTTSLQALLSNVP 373
Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + + W FYG +I+ + NF G M G + A
Sbjct: 374 NEKMIIIDLGNDYPKWVWNTEQTWKVQNGFYGKGWIFSYVPNFGGKTTMTGDMQMYATSS 433
Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
EA S N ++G G + EG+E N V+Y+L+++M + E +++ W+ Y + RYG
Sbjct: 434 AEALASPNKGNLIGFGSAPEGLENNEVIYELLADMGWTSESINLDEWMQSYCLSRYGGYP 493
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
+Q AW + TVY+ + ++P Y E L
Sbjct: 494 ENVQKAWELFRKTVYSN-----------LYSYP---------------RYTWQTVVEDTL 527
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ + ++ E + +ELF+++ NEL S Y DLI+ + A A++++
Sbjct: 528 RINKIN--------TSDEFLIGVELFVSAVNELKDSELYVNDLIEFSSFYAAAKADKIYK 579
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
+ ++ + L + ++++ +D LLA H + L W++ A+ ++ +
Sbjct: 580 EALILFERGNKKEARSLLNQSIQILLKVDKLLASHPIYRLEEWVKYARNSGSTVAEKDAF 639
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NA+ IT W + DY ++WSGL++DYY PR + F S +
Sbjct: 640 EANAKRLITTW-------GGIQDDYAARFWSGLIKDYYIPRMELNF--------SSERNS 684
Query: 526 LKDWRREWIKLTNDWQNGRNVY--PVES 551
L+ W W+ + W N + P+E+
Sbjct: 685 LRQWEENWV--STPWNNPTQPFDNPIEA 710
>gi|390937398|ref|YP_006394957.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
gi|389891011|gb|AFL05078.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum BGN4]
Length = 1957
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 163/538 (30%), Positives = 260/538 (48%), Gaps = 46/538 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 400 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 459
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 460 SSGTWSGFDR-PYMIKTYLTDADKAAGKEDYFQKVGDTFYKAQENVFGKVSNYYAVDPFH 518
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 519 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 570
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ ++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 571 QALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 629
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ AW++L
Sbjct: 630 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 688
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y TDG + +I A P D +I S S++
Sbjct: 689 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 725
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 726 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 785
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNART 471
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 786 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARA 845
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 846 LVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 900
>gi|432896403|ref|ZP_20107613.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
gi|433031274|ref|ZP_20219108.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
gi|431432398|gb|ELH14169.1| hypothetical protein A13U_00343 [Escherichia coli KTE192]
gi|431538475|gb|ELI14460.1| hypothetical protein WIA_04388 [Escherichia coli KTE109]
Length = 1049
Score = 235 bits (600), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 157/556 (28%), Positives = 262/556 (47%), Gaps = 43/556 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 361 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 420
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 421 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 478
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D +AVW++Q W + LN + ++L
Sbjct: 479 ADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW--------QENPTDDFLNDLKKDHALIL 528
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+A+ KP + +F P+IW MLH F G + G+ + +A + +E+ M GV
Sbjct: 529 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMQEVLA-QEIPQSLAESKYMKGV 587
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E + NP++Y+++ +MA++ + A+I+ + RYG P I+ AW+++ T Y
Sbjct: 588 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 647
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+ R + SII G +G V++ + YD
Sbjct: 648 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 684
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D
Sbjct: 685 -AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFN 743
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFD 478
QLS +FL L++ D +L F+LG WL SA+ + ++ Q+E+NAR +T W
Sbjct: 744 QLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLDGMDDWTRDQFEFNARAMVTTWGT 803
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
+A LRDY N+ W GL D+Y R A + + + + +G + + W L
Sbjct: 804 EQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQTLKSAAATGQ--KQDAIKVHWFPLEY 860
Query: 539 DWQNGR-NVYPVESNG 553
W N N YP + +G
Sbjct: 861 RWVNQTGNGYPTQPSG 876
>gi|331660873|ref|ZP_08361805.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
TA206]
gi|422369309|ref|ZP_16449711.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
gi|315298924|gb|EFU58178.1| f5/8 type C domain protein [Escherichia coli MS 16-3]
gi|331051915|gb|EGI23954.1| alpha-N-acetylglucosaminidase family protein [Escherichia coli
TA206]
Length = 1052
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 157/556 (28%), Positives = 262/556 (47%), Gaps = 43/556 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 364 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 423
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 424 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFHEGGNR 481
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D +AVW++Q W + LN + ++L
Sbjct: 482 ADLD--MVKVAQTVQNKMLEHDKNAVWIIQNW--------QENPTDDFLNGLKKDHALIL 531
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+A+ KP + +F P+IW MLH F G + G+ + +A + +E+ M GV
Sbjct: 532 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMQEVLA-QEIPQSLAESKYMKGV 590
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E + NP++Y+++ +MA++ + A+I+ + RYG P I+ AW+++ T Y
Sbjct: 591 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHSWLTSRYGAQSPEIEQAWDIMVKTAY 650
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+ R + SII G +G V++ + YD
Sbjct: 651 HRR---KDRQR--------AEDSIIDAKPG----FG--VTRACTYYTALIDYDK------ 687
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D
Sbjct: 688 -AEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAKDYSAFN 746
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFD 478
QLS +FL L++ D +L F+LG WL SA+ + ++ Q+E+NAR +T W
Sbjct: 747 QLSGKFLRLIKLQDQVLGTRPEFMLGTWLNSARTMLDGMDDWTRDQFEFNARAMVTTWGT 806
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
+A LRDY N+ W GL D+Y R A + + + + +G + + W L
Sbjct: 807 EQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQTLKSAAATGQ--KQDAIKVHWFPLEY 863
Query: 539 DWQNGR-NVYPVESNG 553
W N N YP + +G
Sbjct: 864 RWVNQTGNGYPTQPSG 879
>gi|161505009|ref|YP_001572121.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160866356|gb|ABX22979.1| hypothetical protein SARI_03139 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 1014
Score = 234 bits (598), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 155/556 (27%), Positives = 264/556 (47%), Gaps = 43/556 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ +GGPLPQSW Q+ L +KI R+ G+ PV P F+G VP P A++
Sbjct: 326 MANMQSFGGPLPQSWFAQRTELARKIHDRMEVYGITPVFPGFAGQVPDTFAAKNPQAQVI 385
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P TY+ D F ++ + + +G SH Y D F E
Sbjct: 386 DQGDWVGFVRPPM-LRTYVKQGED-YFSKVADVYYQTLKTTFGDISHYYAVDPFYEGGNR 443
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D + + + + M D DAVW++Q W+ A LN + ++L
Sbjct: 444 ADL--NMVKVAQTVQNKMLEHDKDAVWIIQN--------WQENPTDAFLNGLKKDHALIL 493
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL+A+ KP + +F P+IW MLH F G + G+ + +A + +E+ M GV
Sbjct: 494 DLYADNKPNHAIRHEFSNTPWIWNMLHAFGGRMGFSGMPEVLA-QEIPQSLAESKYMKGV 552
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ E + NP++Y+++ +MA++ + A+I+ + RYG P I+ AW+++ T Y
Sbjct: 553 GVTAESLGTNPMLYEMLYDMAWEKSPISSTAYIHNWLTSRYGAQSPEIEQAWDIMVKTAY 612
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ D+ R + SII G +G + +Y + + Y
Sbjct: 613 HRR---KDRQR--------AEDSIIDAKPG----FG---------VTRACTYYNALIDYD 648
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+E + L L+++ + + Y++DL+D+TRQ LA + E + +A+ D
Sbjct: 649 KAEFEKILPLYLSVYDRFKDNPAYQHDLVDITRQVLANASYEYYRAFEDAWMAQDYSAFN 708
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQL--AQNEEQEKQYEWNARTQITMWFD 478
QLS +FL L++ D +L+ F+LG W+ +++ + ++ Q+E+NAR +T W
Sbjct: 709 QLSGKFLRLIKLQDKVLSTRPEFMLGNWINNSRTMLDGMDDWTRDQFEFNARAMVTTWGT 768
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
+A LRDY N+ W GL D+Y R A + + + + +G + + W L
Sbjct: 769 EQAADAG-LRDYSNRQWQGLTGDFYYQRWATWIQALKTAAATGQ--KQDAIKVSWFPLEY 825
Query: 539 DWQNGR-NVYPVESNG 553
W N N YP + +G
Sbjct: 826 RWVNQTGNGYPTQPSG 841
>gi|288927801|ref|ZP_06421648.1| putative alpha-N-acetylglucosaminidase
(N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
oral taxon 317 str. F0108]
gi|288330635|gb|EFC69219.1| putative alpha-N-acetylglucosaminidase
(N-acetyl-alpha-glucosaminidase) (NAG) [Prevotella sp.
oral taxon 317 str. F0108]
Length = 723
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 164/550 (29%), Positives = 261/550 (47%), Gaps = 69/550 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++G GPL W Q+ LQ KIL R+ +L M+P+ P F+G VP AL+ ++P+A I
Sbjct: 198 MGNIYGIDGPLSNQWHQDQIALQHKILDRMRKLDMHPICPGFAGFVPEALKELYPTADI- 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
Q W + Y+L DPLF +IG FI++ KE+GR Y D+F+E
Sbjct: 257 QYTTWEKAFHN------YILSPADPLFHKIGVMFIQEWEKEFGRCD-FYLIDSFNEMDIP 309
Query: 119 -PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
PP D P E+++ G +Y ++ + A W+MQGW+F Y P W + AL++ VP
Sbjct: 310 FPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQGWMFGYQPEIWDYKTLNALVSQVP 369
Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K+++LDL A+ K +W T K F G +I+ ++ N G + G LD A G
Sbjct: 370 DNKMIMLDLAADYNKFLWKTPFNWDFYKGFCGKQWIYSVIPNMGGKSALTGALDFYAKGH 429
Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+EA S+N ++G G + EGIE N VVY+L+ + + + V+++ W+ Y+ RYG
Sbjct: 430 LEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGWAKQGVELRPWLRNYTYSRYGCYP 489
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
++ WN + +VY G ++++ + +
Sbjct: 490 IGMEQYWNEMIQSVY-----------------------------GSFKSHPRFNWQFRPG 520
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
K + S D + +Y E++ + L GN+L + D ++ L L
Sbjct: 521 KEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL-----FEADFKEMAANYLGGKVEILVR 574
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
I +AY+ D QL RF L+ MD +L H + W++ A+ + + Y
Sbjct: 575 QIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPTKDMQKWIDYARARGVSYNKADCY 634
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NAR +T+W + DY + W+GL+RDYY PR YF SG F
Sbjct: 635 ESNARRIVTVW-------GPPIDDYSARIWAGLIRDYYLPRWKHYF----NQKRSGKPFD 683
Query: 526 LKDWRREWIK 535
W ++++
Sbjct: 684 FSTWELDFVE 693
>gi|310287970|ref|YP_003939229.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
gi|309251907|gb|ADO53655.1| alpha-N-acetylglucosaminidase [Bifidobacterium bifidum S17]
Length = 1923
Score = 234 bits (597), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 162/538 (30%), Positives = 259/538 (48%), Gaps = 46/538 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q++ L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 366 MQNLYSVGGPLPAAWFEQRVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 425
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++G F + Q +G+ S+ Y D F
Sbjct: 426 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVGDTFYKAQESVFGKVSNYYAVDPFH 484
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 485 EGGMVPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 536
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ ++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 537 QALVLDLQSDLRS-QASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 595
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ AW++L
Sbjct: 596 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKAWDIL 654
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y DG + +I A P D +I S S++
Sbjct: 655 LDTAYKHMDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 691
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 692 HSDIDYDKRQFEKAAALFEQAYDSYKDSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 751
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNART 471
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 752 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARA 811
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 812 LVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 866
>gi|169351448|ref|ZP_02868386.1| hypothetical protein CLOSPI_02228 [Clostridium spiroforme DSM 1552]
gi|169291670|gb|EDS73803.1| LPXTG-motif cell wall anchor domain protein [Clostridium spiroforme
DSM 1552]
Length = 1990
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 162/564 (28%), Positives = 274/564 (48%), Gaps = 48/564 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GG LP +W ++++ L +K+ R+ G+ PVL FSG VP ++ + +
Sbjct: 360 MQNMTSYGGKLPNNWFEERVELARKMHDRMQTYGITPVLSGFSGQVPTNFKDKYQDVQYV 419
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W + P TY+ + F ++ F + Q +G ++IY D F E
Sbjct: 420 AQGSWCGYER-PDMLRTYVDNGGTDYFSQMADVFYKAQRDIFGDVTNIYAVDPFHEGGKI 478
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK--LV 178
D + + + M D DA+WL+Q W S P ++ + L K ++
Sbjct: 479 GDMN--YTKVYETVQKKMMENDEDAIWLIQEWSGSIAS--NPSKL------INLDKEHVI 528
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTTM 237
VLDLF+EV P +++ + P+IW MLHNF G + + + ++ P + SE+ M
Sbjct: 529 VLDLFSEVSP-RNSALEAADTPWIWNMLHNFGGRMGLDANPEKVSQNIPNTYQNSEH--M 585
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
VG+GM+ E IE +P+ Y+L+ +M + + +D + W Y+ R YG + I++ WN+L
Sbjct: 586 VGIGMTPEAIENSPMAYELLWDMTWTKDPIDFRQWCQDYAKRIYGGTNEDIEEVWNILLD 645
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
T YN D + S+I+ +P + + SS+ H +
Sbjct: 646 TGYNRKDN----------YYQGAPESVIN---------ARPTTN----FTSASSWGHSTI 682
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y E+ RA+ L + +E S + YDL D+TRQ ++ A E ++ AYQ +
Sbjct: 683 NYDKEELERAVYLMAKNYDEFKDSPAFIYDLSDITRQLISNSAQEYHKAMVNAYQAGNLS 742
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITM 475
LS +FLE++ D +L+ + FL+G W+E A+ + ++ + + +E+NAR IT
Sbjct: 743 EFEVLSDKFLEMILLQDQILSTNSDFLVGKWIEQARTMIEDSDDWTKDLFEFNARDLITT 802
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW---RRE 532
W LRDY N+ W+GL +DYY PR + + E++++ DW E
Sbjct: 803 WGGLKNANGGGLRDYSNRQWAGLTKDYYYPRWQKWINDVKEAMKNNTAVPSTDWFLMEWE 862
Query: 533 WIKLTNDWQNGRNVYPVESNGDAL 556
W L +D N Y +E++ AL
Sbjct: 863 WANLKSDEGNE---YSIEASNLAL 883
>gi|210611122|ref|ZP_03288736.1| hypothetical protein CLONEX_00926, partial [Clostridium nexile DSM
1787]
gi|210152109|gb|EEA83116.1| hypothetical protein CLONEX_00926 [Clostridium nexile DSM 1787]
Length = 1662
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 167/578 (28%), Positives = 273/578 (47%), Gaps = 71/578 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW ++ L +K + + +LGM PVL +SG VP + + PSA++
Sbjct: 685 MANLSGFGGPVHDSWFTERTELARKNQLIMRKLGMQPVLQGYSGMVPVDITDKDPSAQVI 744
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S + +L F + + F + Q + YG S Y D F E NT
Sbjct: 745 KQGTWCSFQRPS------MLKTDSETFDKYAQLFYKVQKEVYGDVSDYYATDPFHEGGNT 798
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-- 176
+ SP I+ + + M D + +W++Q W+ ALL + +
Sbjct: 799 GGM-SPTVIAE---KVLANMMEADENGIWIIQS--------WQGNPSTALLQGLDAARDH 846
Query: 177 LVVLDLFAEVKPIWSTSK-----------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
+VLDL+AE P W+ + +F P+++CML+NF G + ++G +++ G
Sbjct: 847 ALVLDLYAEKTPHWNETDPGSYGGAEGGGEFLNTPWVYCMLNNFGGRLGLHGHIENFVNG 906
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE-----KVDVKAWINQYSVRR 280
+A N M G+G++ E NPV+YDL E + + +++ W Y+ RR
Sbjct: 907 VAQAAAQAN-HMAGIGITPEASVNNPVLYDLFFETIWSDDGENLSAINLDEWFKDYTTRR 965
Query: 281 YGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKP 338
YG + +A +L TVYN + V+ A P +D G +G
Sbjct: 966 YGAESQSAYEAMQILNDTVYNPEMNMKGQGAPESVVNARPGLDI-------GAASTWGNA 1018
Query: 339 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
V + Y +E+ +A L + ++L S Y+YDL ++ Q L+
Sbjct: 1019 V-----------------IDYDKAELEKAAALLLKDYDKLKDSAGYQYDLANVLEQVLSN 1061
Query: 399 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
A E + +A++ DA ++S FLE++ ++ + + F+LG WLESAK LA+N
Sbjct: 1062 TAQEYQKKMADAFREGDAEKFEKMSNSFLEIITKVEEVTGTQEEFMLGTWLESAKALAKN 1121
Query: 459 EEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 516
+ ++ YE NAR IT W Q + L DY N+ WSGL DYY PR + K++ E
Sbjct: 1122 ADDFTKELYELNARGLITTWGSIEQANSGGLIDYSNRQWSGLTSDYYKPR---WEKWIAE 1178
Query: 517 SLESGDGFRLKDWR-REWIKLTNDWQNGRNVYPVESNG 553
+ G K++ +W ++ W N YP ++NG
Sbjct: 1179 RKKELAGEESKNYSAADWFEMEWAWARSNNEYPTKANG 1216
>gi|281424178|ref|ZP_06255091.1| N-acetylglucosaminidase [Prevotella oris F0302]
gi|281401447|gb|EFB32278.1| N-acetylglucosaminidase [Prevotella oris F0302]
Length = 723
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 163/550 (29%), Positives = 260/550 (47%), Gaps = 69/550 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++G GPL W Q+ LQ KIL R+ +L M+P+ P F+G VP AL+ ++P+A I
Sbjct: 198 MGNIYGIDGPLSNQWHQDQIALQHKILDRMRKLDMHPICPGFAGFVPEALKELYPTADI- 256
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT-- 118
Q W + Y+L DPLF +IG FI++ KE+GR Y D+F+E
Sbjct: 257 QYTTWEKAFHN------YILSPADPLFHKIGVMFIQEWEKEFGRCD-FYLIDSFNEMDIP 309
Query: 119 -PPVDSP---EYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVP 173
PP D P E+++ G +Y ++ + A W+MQGW+F Y P W + AL++ VP
Sbjct: 310 FPPKDDPKRYEFMADFGKKVYQCIKEANPSATWVMQGWMFGYQPEIWDYKTLNALVSQVP 369
Query: 174 LGKLVVLDLFAEV-KPIWSTS------KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K+++LDL + K +W T K F G +I+ ++ N G + G LD A G
Sbjct: 370 DNKMIMLDLAVDYNKFLWKTPFNWDFYKGFCGKQWIYSVIPNMGGKSALTGALDFYAKGH 429
Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+EA S+N ++G G + EGIE N VVY+L+ + + + V+++ W+ Y+ RYG
Sbjct: 430 LEALNSQNRGKLIGFGFAPEGIENNEVVYELLCDAGWAKQGVELRPWLRNYTYSRYGCYP 489
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
++ WN + +VY G ++++ + +
Sbjct: 490 IGMEQYWNEMLQSVY-----------------------------GSFKSHPRFNWQFRPG 520
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
K + S D + +Y E++ + L GN+L + D ++ L L
Sbjct: 521 KEKYGSVDLDNHFYHAVEIMAGM-LSQMKGNKL-----FEADFKEMAANYLGGKVEILVR 574
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
I +AY+ D QL RF L+ MD +L H + W++ A+ + + Y
Sbjct: 575 QIDKAYESQDTINANQLETRFYRLMTGMDLVLQGHPTKDMQKWIDYARARGVSYNKADCY 634
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E NAR +T+W + DY + W+GL+RDYY PR YF SG F
Sbjct: 635 ESNARRIVTVW-------GPPIDDYSARIWAGLIRDYYLPRWKHYF----NQKRSGKPFD 683
Query: 526 LKDWRREWIK 535
W ++++
Sbjct: 684 FSTWELDFVE 693
>gi|260910505|ref|ZP_05917173.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
F0295]
gi|260635347|gb|EEX53369.1| periplasmic beta-glucosidase [Prevotella sp. oral taxon 472 str.
F0295]
Length = 1566
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 165/559 (29%), Positives = 275/559 (49%), Gaps = 50/559 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ +S + +L Q+++L R+ +LG+ PV+ F G VP + FP A+I
Sbjct: 199 MGNLEGWGGPMSESLIALRLQQQRQMLQRMRQLGIQPVVQGFPGIVPTFFKERFPQARII 258
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--T 118
+ G W S + LL D +F ++ A+ + K +G D F E T
Sbjct: 259 EQGKWGSFQRP-----AVLLPNNDGVFEKVAEAYYQSLTKLFGTDFEFLGGDLFHEGGIT 313
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + S+ A + M A W++QGW + +P PQ+ +L+ +
Sbjct: 314 TGVD----VGSVAAQVQRQMLRFFPRAKWVLQGW--NKNP---SPQLLRVLDK---RHTL 361
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTM 237
+++L E+ W +S +F G P++W +++F G +M G L I P A + ++ M
Sbjct: 362 LVNLSGEIAASWESSDEFGGTPWLWGSVNHFGGKTDMGGQLPVIVTEPHRALALTVDSVM 421
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G+ EGI NPVVYDL + A+ DV + + QY RYG P + AW ++
Sbjct: 422 QGIGILPEGIGTNPVVYDLALKTAWHTATPDVDSMLVQYLGYRYGEVHPDLLAAWRIMLK 481
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VY EG +++ ++ ++ + S++ +
Sbjct: 482 SVYG---------------------EFAIKGEGTFESVF--CARPSLRVTSVSTWGPKQM 518
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y +++ RAL LF+ + +L S TY+YDL+DL RQ+LA YA + ++++AY+ +A
Sbjct: 519 QYQPADLYRALGLFLKAAPKLRDSETYQYDLVDLARQSLANYARTAYADVVKAYEAKNAE 578
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ Q ++RF L+ D LL + FLLG WL+ A Q A NE + NA+T I+ W
Sbjct: 579 QLQQATQRFERLIVLQDSLLLTNRHFLLGNWLQQATQYAPNEADRQLCLHNAQTLISYW- 637
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
E + + DY NK W+G+L YY PR +F+ + S+ +G+ + ++ ++
Sbjct: 638 -GPDEPTTKVHDYANKEWAGMLSTYYLPRWQAFFRVLQASINTGNPPAI-----DFFEME 691
Query: 538 NDWQNGRNVYPVESNGDAL 556
W N + GDA+
Sbjct: 692 KRWANTPQPINTKPQGDAV 710
>gi|373451393|ref|ZP_09543318.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
3_1_31]
gi|371968665|gb|EHO86120.1| hypothetical protein HMPREF0984_00360, partial [Eubacterium sp.
3_1_31]
Length = 2190
Score = 233 bits (593), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 154/536 (28%), Positives = 271/536 (50%), Gaps = 57/536 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ +GGPLP +W +Q+ L +K+ R+ G++PV+ FSG VP P+A IT
Sbjct: 390 MQNLYSYGGPLPDNWFEQRTELARKMHDRMQTYGISPVVQGFSGQVPDNFDKKQPTALIT 449
Query: 61 QLGNWFSVKSDPRWCCTYLLD-----ATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
++ +W + P Y+ + + L+ ++ + F + Q +G ++ Y D F
Sbjct: 450 EMKDWVGY-TRPSIIQPYITENDAAKGKENLYPQVAKDFYDAQKNVFGNVTNYYATDPFH 508
Query: 116 ENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
E P +D E + + M + AVW+MQ W + D ++ LL
Sbjct: 509 EGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQQWQGNLDA----TKLSGLLKP-- 558
Query: 174 LGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+ + LDL ++ P + S+ P++WCMLHNF G + M G L ++A P A
Sbjct: 559 -SQALALDLQTDLNPQNGVMENSE----TPWLWCMLHNFGGRMGMDGNLPNVAKNPAIA- 612
Query: 231 TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+E+ M G+G++ E +E +PV Y+L+ +M + + +D AWI +Y+ RR G + +Q+
Sbjct: 613 MNESKYMKGIGITPEALENSPVAYELLFDMTWTKDPIDEDAWIAKYAQRRAGGTSEKLQE 672
Query: 291 AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
AW +L T Y GA + ++ +II+ T +++ + S
Sbjct: 673 AWKILNETAY----GAKQE------SYQGAAETIINAT-----------PRDSFRSA--S 709
Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
++ H ++ Y E +AL+L I + ++ AS YRYDL D+ Q L A E +++A
Sbjct: 710 TWGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYDLADVADQVLCNVAIEYHSLMVKA 769
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWN 468
++A + S++FLE+++ D +L + F++G W+ A+ + + + + +E+N
Sbjct: 770 KNESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGNWINDARNMMSDGDDWTKDLFEFN 829
Query: 469 ARTQITMWFDNTQEEASL--LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 522
AR +T W + E +SL L DY N+ W+GL +D+YG R I+ + L+ D
Sbjct: 830 ARAMVTTW---SGERSSLNNLNDYSNRKWNGLTKDFYGKRWKIWIENRQAELDGKD 882
>gi|293402122|ref|ZP_06646261.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
bacterium 5_2_54FAA]
gi|291304514|gb|EFE45764.1| alpha-N-acetylglucosaminidase family protein [Erysipelotrichaceae
bacterium 5_2_54FAA]
Length = 2295
Score = 233 bits (593), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 153/536 (28%), Positives = 270/536 (50%), Gaps = 57/536 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ +GGPLP +W +Q+ L +K+ R+ G++PV+ FSG VP P+A IT
Sbjct: 398 MQNLYSYGGPLPDNWFEQRTELARKMHDRMQTYGISPVVQGFSGQVPDNFDKKQPTALIT 457
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
++ +W + P Y+ ++ + L+ ++ + F + Q +G ++ Y D F
Sbjct: 458 EMKDWVGY-TRPSIIQPYITESDAAKGKENLYPQVAKDFYDAQKNVFGNVTNYYATDPFH 516
Query: 116 ENTPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP 173
E P +D E + + M + AVW+MQ W + D L V
Sbjct: 517 EGGNPSGLDFAETFKQ----VQTEMLKANEKAVWVMQQWQGNLDA-------TKLSGLVK 565
Query: 174 LGKLVVLDLFAEVKP---IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR 230
+ + LDL ++ P + S+ P++WCMLHNF G + M G L ++A P A
Sbjct: 566 PSQALALDLQTDLNPQNGVMENSE----TPWLWCMLHNFGGRMGMDGNLPNVAKNPAIA- 620
Query: 231 TSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQD 290
+E+ M G+G++ E +E +PV Y+L+ +M + + +D AWI +Y+ RR G + +Q+
Sbjct: 621 MNESKYMKGIGITPEALENSPVAYELLFDMTWTKDPIDEDAWIAKYAQRRAGGTSEKLQE 680
Query: 291 AWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
AW +L T Y GA + ++ +II+ T +++ + S
Sbjct: 681 AWKILNETAY----GAKQE------SYQGAAETIINAT-----------PRDSFRSA--S 717
Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
++ H ++ Y E +AL+L I + ++ AS YRYDL D+ Q L A E +++A
Sbjct: 718 TWGHSNITYDKKEFEKALQLLIDNYDDFKASPAYRYDLADVANQVLCNVAIEYHSLMVKA 777
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWN 468
++A + S++FLE+++ D +L + F++G W+ A+ + + + + +E+N
Sbjct: 778 KNESNADDFRKYSKKFLEIIDLSDEILGSSEEFMVGNWINDARNMMSDGDDWTKDLFEFN 837
Query: 469 ARTQITMWFDNTQEEASL--LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 522
AR +T W + E +SL L DY N+ W+GL +D+YG R ++ + L+ D
Sbjct: 838 ARAMVTTW---SGERSSLNNLNDYSNRKWNGLTKDFYGKRWKVWIENRQAELDGKD 890
>gi|32564213|ref|NP_496948.2| Protein K09E4.4 [Caenorhabditis elegans]
gi|25814792|emb|CAB70170.2| Protein K09E4.4 [Caenorhabditis elegans]
Length = 715
Score = 232 bits (592), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 150/522 (28%), Positives = 247/522 (47%), Gaps = 48/522 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GG L + + L K+I+ R+ ELG+ P+LP F+G VP L+ +FP++K
Sbjct: 210 MGNLKAYGGGLSDAQMLNDHNLAKRIIDRLLELGITPILPTFAGFVPDHLETLFPASKFN 269
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
+L W + S+ C + DPLF +IG F+ Q K +G +++Y+ D F+E P
Sbjct: 270 RLPRWNNFTSET--SCMLSVSPFDPLFQKIGSTFLRHQKKMFGGDVTNMYSADPFNEILP 327
Query: 120 PVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ +++ AI + + D + VW++Q W F+YD W +K+ L+++P+G
Sbjct: 328 SESAKFDAKFVKQTAQAIMNSCKKVDKNCVWVLQSWSFTYDQ-WPAWAIKSFLSAIPVGN 386
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
L++LDL+AEV P W + F G ++WC+LHNF G+ E+ G L I G A +
Sbjct: 387 LLILDLYAEVVPAWQMTSSFQGHHFVWCLLHNFGGSRELRGNLQKIDKGYQLALMKAGSN 446
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
+VG G+SME I+QN VVY M + + E + + W+ YS RY Q W +L
Sbjct: 447 LVGAGLSMEAIDQNYVVYQFMIDRMWSPEPLPLNNWLKAYSESRYSADFKVAQKFWTLLA 506
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
T YN + V + Y +P +
Sbjct: 507 GTFYNQPEKWGTPRFSVFL-------------------YHRPGFGRKI-----------E 536
Query: 357 LWYSTSEVI-RALELFIASGNELSASNTYRYDLIDLTRQALA-KYANELFLNIIEAYQLN 414
W+ E R EL A + L +R DL D+ R+ + NE L++ EA+ +
Sbjct: 537 YWFPVEETFSRFRELLPALVHVLGEHPLFREDLNDVMREMTQFEMGNEAALSMSEAFLME 596
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 474
D V +E+ + ++ + + W+E+AK +A E+ + + A +T
Sbjct: 597 DKQQVGASCEMLMEMFQKLES----YSNRDVRQWIENAKSIAPTSEERQVFPVTAGDILT 652
Query: 475 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 516
+W Q DY ++ W+GL+ YYG R + +++E
Sbjct: 653 VWGPTGQN-----LDYAHREWAGLMSGYYGRRWQYFCDWILE 689
>gi|421736727|ref|ZP_16175487.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
IPLA 20015]
gi|407295984|gb|EKF15606.1| alpha-N-acetylglucosaminidase, partial [Bifidobacterium bifidum
IPLA 20015]
Length = 1044
Score = 231 bits (590), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 161/538 (29%), Positives = 257/538 (47%), Gaps = 46/538 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ GGPLP +W +Q + L ++I R+ G+ PV+ F G VPA Q P++
Sbjct: 338 MQNLYSVGGPLPAAWFEQCVELGRRIHDRMQAYGITPVIQGFGGQVPADFQEKNPTSVAA 397
Query: 61 QLGNWFSVKSDPRWCCTYLLDA-----TDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFD 115
G W P TYL DA + F ++ F + Q +G+ S+ Y D F
Sbjct: 398 SSGTWSGFDR-PYMIKTYLTDADKTAGKEDYFQKVCDTFYKAQENVFGKVSNYYAVDPFH 456
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
E D + I + + M D AVW+MQ W + D ++ L + G
Sbjct: 457 EGGTIPDGFD-IVDIYRTVQRKMLDHDPAAVWVMQQWQWGIDE----TKLSGLADK---G 508
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+ +VLDL ++++ ++ + GVP++W MLHNF G + + G+ + I+ +A S
Sbjct: 509 QTLVLDLQSDLRSQ-ASPMENQGVPWVWNMLHNFGGRMGLDGVPEVISQDITKAYNSSGY 567
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
M G+G++ E I+ +P+VY+L+ +M ++ + VD ++W +Y+ RRYG + I+ W++L
Sbjct: 568 -MRGIGITPEAIDNSPIVYELLFDMTWEQDPVDYRSWTQEYAERRYGGTDGTIEKVWDIL 626
Query: 296 YHTVYNCTDGA--TDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
T Y TDG + +I A P D +I S S++
Sbjct: 627 LDTAYKHTDGEYYQGASESIINARPS-DNTIGSA----------------------STWG 663
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
H + Y + +A LF + + S +RYD +D+ RQ LA E +AY+
Sbjct: 664 HSDIDYDKRQFEKAAALFEQAYDSYKNSAGFRYDYVDVMRQVLANSFQEYQPLAGQAYKS 723
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNART 471
D LS R L++++ D LL+ D FL+G W++ A+ + + +E NAR
Sbjct: 724 GDLETFRTLSSRMLDIIKAQDKLLSSSDDFLVGAWIDDARTMLDGADDWTADLFELNARA 783
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
+T W N + SL+ DY N+ W+GL DYY R Y + LE G F DW
Sbjct: 784 LVTTWGLN--KNGSLI-DYSNRQWAGLTGDYYYRRWKTYVDNRLNKLEHGTDFTDPDW 838
>gi|341892319|gb|EGT48254.1| hypothetical protein CAEBREN_28412 [Caenorhabditis brenneri]
Length = 713
Score = 231 bits (589), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 148/523 (28%), Positives = 249/523 (47%), Gaps = 50/523 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL +GG L + + L L K+I+ R+ ELG+ P+LP F+G VP L+ +FPS+K T
Sbjct: 208 MGNLKAYGGGLSDAQMLNDLNLAKRIINRLLELGITPILPTFAGFVPDQLEKLFPSSKFT 267
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
+L W + S+ C + DPLF +IG F+ Q K +G +++Y+ D F+E P
Sbjct: 268 RLPCWNNFTSET--SCLLSVSPFDPLFQKIGSLFLRHQKKMFGGDITNLYSADPFNEILP 325
Query: 120 PVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ +++ AI + + D + +W++Q W F+YD W +K+ L++VP+G
Sbjct: 326 SDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWSFTYDE-WPSWAIKSFLSAVPIGN 384
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
L++LDL++EV P W ++ F+G YIWCMLH+F G+ E+ G L + G A +
Sbjct: 385 LLILDLYSEVVPAWQSTSSFHGHNYIWCMLHSFGGSRELRGNLQKVDKGYQLALMKGGSN 444
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
++G G++ME I+QN V+Y M + + E + + WI YS RY W +L
Sbjct: 445 LIGAGLTMEAIDQNYVIYQFMVDRMWSSEPLPLNTWIKSYSESRYSADFKVSHKFWTLLA 504
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
+ YN + + V + Y +P + +
Sbjct: 505 FSFYNQPEKWGNPRFSVFL-------------------YHRPAFGKKI-----------E 534
Query: 357 LWYSTSEVIRALELFIAS-GNELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQL 413
W+ E L+ I S + L ++ DL D+ R A+ ++ N+ L + EA+ +
Sbjct: 535 YWFPVEETFGHLQSLIPSLIHVLGDHPLFKEDLNDVMR-AITQFEVGNDAALTLTEAFLM 593
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
D + + DM L + + W+E +K +A E+ + + A +
Sbjct: 594 EDKQQIGSTCENLM----DMFLKLESYSNRDMKHWIEDSKSIAATSEERQVFPATAADIL 649
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 516
T+W Q DY ++ W GLL YYG R + +++E
Sbjct: 650 TVWGPEGQN-----LDYAHREWEGLLSGYYGRRWQYFCDWILE 687
>gi|268533054|ref|XP_002631655.1| Hypothetical protein CBG20846 [Caenorhabditis briggsae]
Length = 712
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 143/523 (27%), Positives = 254/523 (48%), Gaps = 50/523 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GG L + + L K+I+ R+ ELG+ P+LP FSG VP L+ +FP++K
Sbjct: 207 MGNLKGYGGGLSDAQMLNDFNLAKRIINRLLELGITPILPTFSGFVPDRLEKLFPTSKFN 266
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDENTP 119
+L W + S+ C + DPLF +IG +F+ Q K G +++Y+ D F+E P
Sbjct: 267 RLPCWNNFTSET--SCLLSVSPFDPLFQKIGSSFLRHQKKMLGGDITNLYSADPFNEVLP 324
Query: 120 PVDS---PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK 176
+ +++ AI + + D + +W++Q W F+YD W +K+ L++VP+G+
Sbjct: 325 SDSAKFDAKFVKQTAQAIMNSCRKVDKNCIWVLQSWSFTYDQ-WPNWAIKSFLSAVPIGQ 383
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT 236
+++LDL++EV P W + F+G ++WCMLHNF G+ E+ G + + G A +
Sbjct: 384 MLILDLYSEVVPAWQMTSSFHGHNFVWCMLHNFGGSRELRGNVQKVDKGYQLALMKAGSN 443
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
+VG G+SME I+QN ++Y M + + E + + +W+ YS RY W +L
Sbjct: 444 LVGAGLSMEAIDQNYMMYQFMIDRMWTQEPIPLNSWLKSYSESRYSADFKVAHKFWTILA 503
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
+ YN + + V + Y +P + +
Sbjct: 504 GSFYNQPEKWGNPRFSVFL-------------------YHRPAFGKKI-----------E 533
Query: 357 LWYSTSEVIRALE-LFIASGNELSASNTYRYDLIDLTRQALAKY--ANELFLNIIEAYQL 413
W+ E LE L ++ + L ++ DL D+ R A+ ++ NE L++ EA+ +
Sbjct: 534 YWFPVEETFTHLESLVLSLLHILGDHPLFKEDLNDVMR-AITQFEIGNEAALSLTEAFLM 592
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
D + + + + ++ + + W+E AK +A E+ + + +A +
Sbjct: 593 EDKQQIGTTCENLMGMFQKLEP----YSNRDVRDWIEDAKSIAPTTEEREVFPISASDIL 648
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 516
T+W Q DY ++ W+GLL YYG R + +++E
Sbjct: 649 TVWGPTGQN-----LDYAHREWAGLLSGYYGRRWQYFCDWILE 686
>gi|423219557|ref|ZP_17206053.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
CL03T12C61]
gi|392624762|gb|EIY18840.1| hypothetical protein HMPREF1061_02826 [Bacteroides caccae
CL03T12C61]
Length = 715
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 159/540 (29%), Positives = 245/540 (45%), Gaps = 48/540 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+PQS +D + L +K+L R+ LG+ P++P F G VP+ L+N A I
Sbjct: 195 MGNIEGWGGPMPQSQIDSRKKLVQKMLKRMKSLGIEPLMPGFYGMVPSNLKNK-SKAHII 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + +LD DP F + F ++ + YG ++ D F E
Sbjct: 254 PQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDETRRLYGSDIRFFSGDPFHEGG-- 305
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ G AI MQ ++W++QGW + P LL + ++V
Sbjct: 306 ATDGVALGDAGRAIQKTMQKHFPGSIWVLQGWQDNPKP--------GLLEKLDKRYVLVQ 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT-MVG 239
+LF E W T K + G P+IW + NF + G L A A SE M G
Sbjct: 358 ELFGENTNNWETRKGYEGTPFIWATVTNFGERPGINGKLQRFADEVYRASNSEYAKYMKG 417
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG+ EGI NPV Y+L+ E+ + ++VDV WI Y RYGR I+ AW ++ ++
Sbjct: 418 VGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESYVTARYGRITDEIRTAWKMMLKSI 477
Query: 300 YNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y+ G + +++ A P ++ +S + Y + + K+A
Sbjct: 478 YSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKYDRDLYKKAAF------------- 524
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
LF + E + TYR DLI RQ +A A+ +F ++I AYQ
Sbjct: 525 -----------LFAKAMPEFNEVRTYRIDLIHFLRQVIANEADSVFYDMITAYQEKKVEK 573
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
Q +FL +++ + LLA F L W + AK ++K N IT W +
Sbjct: 574 FEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDAGNTAAEKKNNFHNLMMLITYWGE 633
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD---WRREWIK 535
+ E + L DY K W+G++ YY R +YF Y+ +L G+ + D W REW++
Sbjct: 634 HVTSEDN-LHDYAYKEWAGMMNTYYKERWLVYFDYL-RALLRGEEAKAPDYFHWEREWVE 691
>gi|153806010|ref|ZP_01958678.1| hypothetical protein BACCAC_00255 [Bacteroides caccae ATCC 43185]
gi|149130687|gb|EDM21893.1| Alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides caccae ATCC
43185]
Length = 715
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 159/540 (29%), Positives = 245/540 (45%), Gaps = 48/540 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+PQS +D + L +K+L R+ LG+ P++P F G VP+ L+N A I
Sbjct: 195 MGNIEGWGGPMPQSQIDSRKKLVQKMLKRMKSLGIEPLMPGFYGMVPSNLKNK-SKAHII 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + +LD DP F + F ++ + YG ++ D F E
Sbjct: 254 PQGTWGAFTRPD------ILDPMDPEFDRVAAIFYDETRRLYGSDIRFFSGDPFHEGG-- 305
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ G AI MQ ++W++QGW + P LL + ++V
Sbjct: 306 ATDGVALGDAGRAIQKTMQKHFPGSIWVLQGWQDNPKP--------GLLEKLDKRYVLVQ 357
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTT-MVG 239
+LF E W T K + G P+IW + NF + G L A A SE M G
Sbjct: 358 ELFGENTNNWETRKGYEGTPFIWATVTNFGERPGINGKLQRFADEVYRASNSEYAKYMKG 417
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG+ EGI NPV Y+L+ E+ + ++VDV WI Y RYGR I+ AW ++ ++
Sbjct: 418 VGILPEGINNNPVTYELLLELVWHKDRVDVDQWIESYVTARYGRITDEIRTAWKMMLKSI 477
Query: 300 YNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y+ G + +++ A P ++ +S + Y + + K+A
Sbjct: 478 YSSEVGYQEGPPENILCARPALELKSVSSWGRLAKKYDRDLYKKAAF------------- 524
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
LF + E + TYR DLI RQ +A A+ +F ++I AYQ
Sbjct: 525 -----------LFAKAMPEFNEVRTYRIDLIHFLRQVIANEADSVFYDMITAYQEKKVEK 573
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
Q +FL +++ + LLA F L W + AK ++K N IT W +
Sbjct: 574 FEQEVSKFLMMIDTENELLAQDPFFRLSTWQQQAKDAGNTAAEKKNNFHNLMMLITYWGE 633
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD---WRREWIK 535
+ E + L DY K W+G++ YY R +YF Y+ +L G+ + D W REW++
Sbjct: 634 HVTSEDN-LHDYAYKEWAGMMNTYYKERWLVYFDYL-RALLRGEEAKAPDYFHWEREWVE 691
>gi|294648124|ref|ZP_06725667.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
gi|292636508|gb|EFF54983.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CC 2a]
Length = 499
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 161/543 (29%), Positives = 258/543 (47%), Gaps = 60/543 (11%)
Query: 35 MNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAF 94
M PVLPAF+G+VPA L+ ++P A I LG W R C +L + D LF +I + F
Sbjct: 1 MKPVLPAFAGHVPADLKRIYPEADIQHLGKWAGFADAYR--CNFL-NPNDALFAKIQKLF 57
Query: 95 IEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF 154
+++Q K +G T HIY D F+E PP PEY+ + + +Y+ + + D A W+ W+F
Sbjct: 58 LDEQKKLFG-TDHIYGLDPFNEVDPPSFEPEYLRKIASDMYATLTAADPKAQWMQMTWMF 116
Query: 155 SYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI 213
+D W +MKALL VP K+++LD E +W ++ F+ PYIWC L NF GN
Sbjct: 117 YFDKDKWTSERMKALLTGVPQNKMILLDYHCENVELWKRTEHFHDQPYIWCYLGNFGGNT 176
Query: 214 EMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWI 273
+ G + A + + G+G ++EG++ Y+ + E A+ + VD WI
Sbjct: 177 TLTGNVKESGARLENALINGGGNLKGIGSTLEGLDVMQFPYEYILEKAW-NLNVDDNKWI 235
Query: 274 NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ 333
+ R G ++DAW L++ +Y V P T G
Sbjct: 236 ECLADRHVGCVSQPVRDAWKRLFNDIY--------------VQVPR--------TLGTLP 273
Query: 334 NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS--NTYRYDLIDL 391
Y +P + K ++ Y + L EV R L NE + + +R DLI +
Sbjct: 274 GY-RPALNKNSEKRTSNVYSNVELL----EVWRKL-------NEAPSDRRDAFRLDLITV 321
Query: 392 TRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLES 451
RQ L Y ++ + + D + + E++ D+D L A H L W++
Sbjct: 322 GRQVLGNYFLDVKMEFDRMVEAKDHQALKACGEKMKEILNDLDKLNAFHPYCSLDKWIDD 381
Query: 452 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 511
A+++ + + + YE NAR IT W L DY ++ W+GL+ DYY R +Y
Sbjct: 382 ARKMGDSPQLKDYYEKNARNLITTW-------GGSLNDYASRSWAGLISDYYAKRWEVYV 434
Query: 512 KYMIESLESG---DGFRLKDWRRE----WIKLTNDWQNGRNVYPVESNGDALIT-SQWLY 563
I++ E G D +L+D +E W+ T+ ++V+ S D L++ S +L+
Sbjct: 435 NTFIKAAEEGVEVDQKQLEDELKEIEEGWVNATDRKDTRKDVH---STTDGLLSFSTFLF 491
Query: 564 NKY 566
+KY
Sbjct: 492 SKY 494
>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 1105
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 163/545 (29%), Positives = 248/545 (45%), Gaps = 76/545 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W++ + LQ++IL R+ LGM PVLPAF+G VP A P A+I
Sbjct: 164 MGNIEGYDAPLPQQWIEDKHALQQRILQRMRALGMKPVLPAFAGYVPKAFAQAHPQARIY 223
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I + FI+ + YG+ ++ Y D F+E PP
Sbjct: 224 RMRAWEGFHE------TYWLDPADPLFAKIAQRFIQLYDRTYGKGTY-YLADAFNEMLPP 276
Query: 121 VDS----------------------PEY--------ISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + PE ++ G A+Y + + DAVW+MQ
Sbjct: 277 IAADGSDARLASYGDSTANTANTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 336
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 337 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 396
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ G+ +YG +AF + R + +VG G EG+ N VVY+ M +A+
Sbjct: 397 YGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFGAFPEGLHTNSVVYEYMYALAWGG 453
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
++ ++ W+ Y+ RYG S PA++ AW+ L +V + P
Sbjct: 454 QQRSLQDWLGDYTRARYGHSSPALRAAWDDLQASVLSTR---------------YWTPRW 498
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
G Y + +P + + E + D P L RAL+ +A E + + Y
Sbjct: 499 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLY 549
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
RYDL+D R + + AY+ D R V +DGL+
Sbjct: 550 RYDLVDFARHYATGRVDTQLQQALAAYKRGDVAAGDAAFARVQAAVRQLDGLVGGQQE-T 608
Query: 445 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 504
L WL++A+ A+ + Y +A+ Q+++W L DY +K W G+ DYY
Sbjct: 609 LSSWLDAAEGDAKTPQDAAYYRRDAKAQVSVWGGEGN-----LGDYASKAWQGMYADYYL 663
Query: 505 PRAAI 509
PR A+
Sbjct: 664 PRWAL 668
>gi|375146756|ref|YP_005009197.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
koreensis GR20-10]
gi|361060802|gb|AEV99793.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Niastella
koreensis GR20-10]
Length = 1147
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 157/552 (28%), Positives = 246/552 (44%), Gaps = 47/552 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGP+PQS +D + +L +K++ R+ LG+ PV+P F G VP N A++
Sbjct: 191 MGNIEGWGGPMPQSQIDSRKILVQKMIARMQALGIEPVMPGFYGMVPHNF-NTKSKARVI 249
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
GNW + + +LD TD F + F E+ K YGR ++ D F E
Sbjct: 250 TQGNWGA------FIRPAILDPTDTAFDRVAGIFYEETKKLYGRNIRFFSGDPFHEGG-- 301
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ + + GA I MQ A+W++QGW + K LL L++
Sbjct: 302 ITNGVNLGKAGANIQKAMQQYFPGAIWVLQGW--------QDNPKKELLAETDKSALLIQ 353
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMVG 239
+LF E W T + G P+IWC ++NF + G L+ A A T M G
Sbjct: 354 ELFGENTNNWETRNGYEGTPFIWCCVNNFGERPGLNGKLERYAGEVYRAATGPFREYMKG 413
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG+ EGI NP YDL+ E+ + ++ V+ WIN Y RYG++ I AW + T+
Sbjct: 414 VGIMPEGINNNPASYDLVLELGWHNQPVETGKWINDYVKARYGKANDQIATAWTLFLQTI 473
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y+ +++ A P A+ SS+ Y
Sbjct: 474 YSNPGYQEGPPENILCARP------------------------ALQVKSVSSWGKLKKGY 509
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
T+ + ++ F A+ S TY+ DLI+ TRQ L+ A+ +F +++ AY+ +
Sbjct: 510 DTALFEKGVQAFAAAAPLFGNSETYKIDLINFTRQVLSNRADTVFASLVTAYKEENTVAF 569
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ FL L + LL H + L + + A + + K NA IT W +N
Sbjct: 570 NAAAEAFLSLHALTNELLNSHSYYRLTSYQQQALRSGNTPIERKNNLHNAMMLITYWGEN 629
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKD---WRREWIKL 536
++E L +Y K W G++ +Y R +YF Y+ +L +G D W REW+
Sbjct: 630 NRQE-DYLHEYAYKEWGGMMTTFYQQRWKLYFDYLRNNL-AGKSVTPPDFFAWEREWVTQ 687
Query: 537 TNDWQNGRNVYP 548
++ YP
Sbjct: 688 NEQVKSEVQPYP 699
>gi|282877909|ref|ZP_06286718.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
gi|281299910|gb|EFA92270.1| alpha-N-acetylglucosaminidase (NAGLU) [Prevotella buccalis ATCC
35310]
Length = 717
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 159/550 (28%), Positives = 254/550 (46%), Gaps = 71/550 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W QQ+ LQ KI+ R+ ELGM+P+ PAF+G VP A P
Sbjct: 195 MGNLNQWDGPLSDAWHKQQITLQHKIISRMRELGMHPIAPAFAGFVPKAFAKKHPEINFK 254
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W Y+L F ++G+ FIE+ +E+G ++ Y D+F+E P
Sbjct: 255 HL-RWGGFADS---LNAYVLPPESSYFKQLGKLFIEEWEREFGENTY-YLSDSFNEMKLP 309
Query: 121 V------DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
V + ++ G AIY + +G+ A+W+ QGW F Y FW + ALL+ VP
Sbjct: 310 VNPNDEEEKCRLLAEYGKAIYQSINAGNPHAIWVTQGWTFGYQHDFWNRKSLSALLSQVP 369
Query: 174 LGKLVVLDLFAE-------VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
+++++DL + + W FYG +I+ + NF G + G L+ A
Sbjct: 370 NDRMIIIDLGNDYPKWVWHTEQTWKRHNGFYGKQWIFSYVPNFGGKTLLTGDLEMYATDA 429
Query: 227 VEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
A ++ N +VG+G + EG+E N VVY+L+S+ A+ + +++ WI Y + RYG+
Sbjct: 430 SLALSAANKGNLVGIGSAPEGLENNEVVYELLSDAAWTDKGINLDEWIANYCMARYGKYP 489
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVD-PSIISVTEGKYQNYGKPVSKEAV 344
++ AWN +VY+ + ++P ++I T K
Sbjct: 490 DKMKAAWNGFRKSVYSS-----------LYSYPRFTWQTVIPDTRRK------------- 525
Query: 345 LKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELF 404
S +D ++ +A+E F++ +EL + Y+ D I Q L A+ +
Sbjct: 526 -----SRHDLNETYF------KAVEDFLSCADELGGAKFYQDDAILFAAQYLGAKADIYY 574
Query: 405 LNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
N + LN + + +EL+ D +LA H L W+ A+ +++ Q
Sbjct: 575 ENALRYGSLNKHVEANKQLSKAIELLLFADKILASHPTDRLDVWIAKARSQGHTPQEKNQ 634
Query: 465 YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF 524
YE NA+ IT W + + DY + WSGL++DYY PR IYF S
Sbjct: 635 YEANAKRLITTWGGHQE-------DYAARCWSGLIKDYYIPRIQIYF--------SNQRK 679
Query: 525 RLKDWRREWI 534
L W WI
Sbjct: 680 MLDQWEENWI 689
>gi|433678127|ref|ZP_20510026.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430816763|emb|CCP40478.1| alpha-N-acetylglucosaminidase [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 691
Score = 228 bits (580), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 170/595 (28%), Positives = 259/595 (43%), Gaps = 85/595 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PL Q W++ + LQ++IL R+ LGM PVLPAF G VP A P A+I
Sbjct: 127 MGNIEGYDAPLQQQWIEDKHALQQRILQRMRTLGMKPVLPAFVGYVPKAFAQAHPQARIY 186
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I FI+ + YG+ ++ Y D F+E PP
Sbjct: 187 RMRAWEGFHE------TYWLDPADPLFAKIALRFIQLYDRTYGKGTY-YLADAFNEMLPP 239
Query: 121 VDS----------------------PEY--------ISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + PE ++ G A+Y + + DAVW+MQ
Sbjct: 240 IAADGSDARLASYGDSTANTAKTAPPEVSPAQRDKRLADYGRALYESIHRANPDAVWVMQ 299
Query: 151 GWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L VP KL+VLD+ + P W S F G +I+ +HN
Sbjct: 300 GWLFGADRHFWTPQAIAAFLREVPNDKLLVLDIGNDRYPGTWKLSDAFDGKQWIYGYVHN 359
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ G+ +YG +AF + R + +VG G EG+ N VVY+ M +A+
Sbjct: 360 YGGSNPVYG---DLAFYRDDLRALLADKDKQQLVGFGAFPEGLHDNSVVYEYMYALAWGG 416
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
++ ++ W+ Y RYG + PA++ AW+ L V + P
Sbjct: 417 QQRSLQDWLGDYIRARYGHTSPALRAAWDDLQAAVLSTRYWT---------------PRW 461
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
G Y + +P + + E + D P L RAL+ +A E + + Y
Sbjct: 462 WRSRAGAYLLFKRPTLD--IGEFEGAPGDPPRL-------RRALDQLLALAPEYADAPLY 512
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 444
RYDL+D R + + AY+ D R V+ +DGL+
Sbjct: 513 RYDLVDFARHYATGRVDAQLQQAVAAYRRGDVAAGDAAFARVQAAVQQLDGLVGGQQE-T 571
Query: 445 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 504
L WL A+ A+ + Y +A+ Q+++W L DY +K W G+ DYY
Sbjct: 572 LSSWLGDAEGDAKTPQDAAYYRRDAKAQVSVWGGEGN-----LGDYASKAWQGMYADYYL 626
Query: 505 PRAAIYFKYMIESLESGDGF--------RLKDWRREWIKLTNDWQNGRNVYPVES 551
PR A+ + + + G G RL+ W +W+K + PV +
Sbjct: 627 PRWALAMQ-ALRAAAVGSGSVDEAALQQRLRAWELDWVKRETPYTRQAPADPVAA 680
>gi|345014586|ref|YP_004816940.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
gi|344040935|gb|AEM86660.1| alpha-N-acetylglucosaminidase [Streptomyces violaceusniger Tu 4113]
Length = 1044
Score = 228 bits (580), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 156/563 (27%), Positives = 256/563 (45%), Gaps = 45/563 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGP+ + LD++ L ++I R+ ELGM PV P + G VP + P A+
Sbjct: 216 LQNMSEYGGPVSTALLDKRTELGRRIADRLRELGMRPVFPGYFGTVPDGFADRNPEARTV 275
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W ++ P W LD F ++ AF Q + +G + ++ D E P
Sbjct: 276 PQGDWNGLRR-PDW-----LDPRTESFRKVAAAFYRHQRELFGE-AGLFKMDLLHEGGDP 328
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +++ A+W++ GW +P + LL++V +++V+
Sbjct: 329 GDVP--VPDAARAVETALRTARPGAIWVILGW--QENP------RRDLLDAVDHDRMLVV 378
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D +++ + K + VPY + + NF G + R + +VG
Sbjct: 379 DGLSDLDTVTDREKDWGAVPYAFGTIPNFGGRTTIGAKTHMWTKRFTVWRDKPGSKLVGT 438
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E E++P ++L SE+A++ E VD W Y+ RYG ++A+ L T Y
Sbjct: 439 AYMPEAAERDPAAFELFSELAWREEAVDRAEWFRSYAEMRYGGRDAKAREAFAALRDTAY 498
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + V A P S+T NY + T ++D
Sbjct: 499 EISSKDGRPHDSVFAARP-------SLTARSGTNYA----------THTPAFDPAGF--- 538
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+V A L + +G L S+ YR+DL D+ RQALA + +L + +AY D
Sbjct: 539 --DVAFAALLGVRAG--LRDSDAYRHDLTDIARQALANRSWQLIPQLQDAYDRKDRTAFR 594
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
L+R +L+L+ D + H FLLGPWLE AK++A +E+ + E ART IT W D
Sbjct: 595 TLARLWLKLMRLSDDMTGAHRRFLLGPWLEDAKRMASGDEESARLERAARTLITTWADRA 654
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
+ L +Y N+ WSGL+ D++ P+ Y + ++L R D W + W
Sbjct: 655 TADGGKLANYANRDWSGLIADFHLPQWQSYLDELEDALAENRPPRAFD----WFAVEEPW 710
Query: 541 QNGRNVYPVESNGDALITSQWLY 563
R YPV DA T+Q +Y
Sbjct: 711 TRERTSYPVRPTTDAHRTAQRVY 733
>gi|294812279|ref|ZP_06770922.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
gi|294324878|gb|EFG06521.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
Length = 1086
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 156/564 (27%), Positives = 254/564 (45%), Gaps = 54/564 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGP+ + LD+++ L ++I+ R+ LGM PV+P + G VP P A++
Sbjct: 257 LQNMSEYGGPVSPALLDRRIELGQRIVTRMRRLGMRPVVPGYFGTVPDGFVARNPGARVI 316
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD P+F EI A+ Q + +G H + D E P
Sbjct: 317 PQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRHQEELFGEIDH-FKMDLLHEGGTP 369
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +++ A W++ GW + P ALL+++ K++++
Sbjct: 370 GDVP--VPDAARAVETALRAARPAATWVILGWQSNPRP--------ALLDAIDTSKVLIV 419
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D +++ + ++ G PY + + NF G + D R N+ +VG
Sbjct: 420 DGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPNSALVGT 479
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E +++P +L +E+A++ EK+D AW Y+ RYG PA ++A+ L T Y
Sbjct: 480 AYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYAQFRYGAKDPAAEEAFAALAGTAY 539
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
T T R + F + + ++ S +++D
Sbjct: 540 QLT---TTDGRPIDSLF---------------------LRRPSMSSSVATAFDQAAFDRG 575
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+ ++R E EL S+ YRYDL DL RQALA + L L + AY D
Sbjct: 576 FAALLRVNE-------ELRGSDAYRYDLTDLARQALALRSRTLQLALRAAYATKDVTAFR 628
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
++ +L L+ D + CH FLLGPWLE AK+ A + E+ + E AR IT W D
Sbjct: 629 GVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFATSTEEAVELERTARVLITTWGD-- 686
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
+ A L +Y N+ W GL+ D + P+ YF + +L G + D W W
Sbjct: 687 RAAAVELSNYANRDWQGLIGDVHVPQWEQYFTEVATALAEGRAPKAID----WYPGEETW 742
Query: 541 QNGRNVYPVESNGDALITSQWLYN 564
R YPV GD +Q +++
Sbjct: 743 TKDRRPYPVRPTGDVHKVAQRVHD 766
>gi|326440885|ref|ZP_08215619.1| alpha-N-acetylglucosaminidase [Streptomyces clavuligerus ATCC
27064]
Length = 1038
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 156/564 (27%), Positives = 254/564 (45%), Gaps = 54/564 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGP+ + LD+++ L ++I+ R+ LGM PV+P + G VP P A++
Sbjct: 209 LQNMSEYGGPVSPALLDRRIELGQRIVTRMRRLGMRPVVPGYFGTVPDGFVARNPGARVI 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD P+F EI A+ Q + +G H + D E P
Sbjct: 269 PQGVWNGLPR-PDW-----LDPRTPVFAEIAAAYYRHQEELFGEIDH-FKMDLLHEGGTP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +++ A W++ GW + P ALL+++ K++++
Sbjct: 322 GDVP--VPDAARAVETALRAARPAATWVILGWQSNPRP--------ALLDAIDTSKVLIV 371
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D +++ + ++ G PY + + NF G + D R N+ +VG
Sbjct: 372 DGLSDLDTVRDREAEWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPNSALVGT 431
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E +++P +L +E+A++ EK+D AW Y+ RYG PA ++A+ L T Y
Sbjct: 432 AYMPEAADRDPAALELFTELAWRREKIDRSAWFAGYAQFRYGAKDPAAEEAFAALAGTAY 491
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
T T R + F + + ++ S +++D
Sbjct: 492 QLT---TTDGRPIDSLF---------------------LRRPSMSSSVATAFDQAAFDRG 527
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+ ++R E EL S+ YRYDL DL RQALA + L L + AY D
Sbjct: 528 FAALLRVNE-------ELRGSDAYRYDLTDLARQALALRSRTLQLALRAAYATKDVTAFR 580
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
++ +L L+ D + CH FLLGPWLE AK+ A + E+ + E AR IT W D
Sbjct: 581 GVAALWLRLMRLADTVAGCHKAFLLGPWLEEAKRFATSTEEAVELERTARVLITTWGD-- 638
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
+ A L +Y N+ W GL+ D + P+ YF + +L G + D W W
Sbjct: 639 RAAAVELSNYANRDWQGLIGDVHVPQWEQYFTEVATALAEGRAPKAID----WYPGEETW 694
Query: 541 QNGRNVYPVESNGDALITSQWLYN 564
R YPV GD +Q +++
Sbjct: 695 TKDRRPYPVRPTGDVHKVAQRVHD 718
>gi|373461651|ref|ZP_09553390.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
gi|371951955|gb|EHO69797.1| hypothetical protein HMPREF9944_01654 [Prevotella maculosa OT 289]
Length = 713
Score = 224 bits (572), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 154/509 (30%), Positives = 232/509 (45%), Gaps = 53/509 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ Q ++D Q L ++IL R+ LG+ PVL F G V ++++ +P+A +
Sbjct: 194 MGNLEGWGGPVSQDFIDAQSRLGRRILDRMATLGIQPVLQGFYGMVSRSIRDRYPNAVMP 253
Query: 61 QLGNW-FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
Q G W F + D +L T+ LF EI + + K YG H + D F E
Sbjct: 254 Q-GMWGFFERPD-------ILKPTEKLFDEIADTYYREIKKHYGTGFHYFGGDLFHEGGQ 305
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
++ G A+ MQ + W++QGW + +P LL + K++V
Sbjct: 306 --TGTLNVADCGLAVQQAMQRNFPGSTWVLQGWSGNPNPL--------LLTKLDREKVLV 355
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-NTTMV 238
+DLF E W+ +K + G P++WC++ NF MYG L IA + R S+ +
Sbjct: 356 VDLFGENDEAWNRTKAYQGTPFLWCIVSNFGEQCGMYGKLQRIALQIDKVRKSDYKAYLK 415
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GVG+ EGI NPVVYD++ K++V+AW+ Y RYG I AW + T
Sbjct: 416 GVGIMPEGINNNPVVYDMVLHAPLTDRKINVEAWLKSYITYRYGSYNADIYAAWLIFLQT 475
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKP----VSKEAVLKSETSSYDH 354
+Y SV E YG P ++ V ++TSS+
Sbjct: 476 IY------------------------ASVPE----KYGLPESVFCARPGVKVTQTSSWGV 507
Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
+Y + LF+ + S TY YD+ DL RQ + N ++ ++I A
Sbjct: 508 RARYYDMDFFKEGVRLFLKAKTSFEDSETYAYDMFDLLRQVQSDKGNRVYDDMIAAIDAK 567
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 474
+ + Q S RFL + D LLA GF L WL A + + NA+ Q+T
Sbjct: 568 NPNRFEQTSDRFLHELLRQDTLLAQSKGFTLERWLGQASRFGKTVYDRDLALKNAKMQLT 627
Query: 475 MWFDNTQEEASLLRDYGNKYWSGLLRDYY 503
W + + + DY K W+G+LR Y
Sbjct: 628 FWGPDWNPTTT-VHDYAAKEWAGMLRTLY 655
>gi|315500594|ref|YP_004089396.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
gi|315418606|gb|ADU15245.1| Alpha-N-acetylglucosaminidase [Asticcacaulis excentricus CB 48]
Length = 765
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 161/575 (28%), Positives = 247/575 (42%), Gaps = 76/575 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ P+PQ+W+ ++ LQ +IL R+ ELGM P+LPAF G VP A P A+I
Sbjct: 202 MGNIEGYLAPVPQAWIQKKHKLQSRILGRMKELGMTPILPAFGGYVPKAFAQKHPQARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ W TY LD DPLF +I FI + YG + Y D+F+E PP
Sbjct: 262 PMRPWEGFHE------TYWLDPADPLFAKIAARFIALYTETYGEGRY-YLADSFNEMLPP 314
Query: 121 VD-----------------------------SPEYISSLGAAIYSGMQSGDSDAVWLMQG 151
+ E +++ G AIY ++ DAVW MQG
Sbjct: 315 ISHDGSDVKNAKYGDSTANTKETETVVDPAVKAERLAAYGKAIYDSIRQARPDAVWTMQG 374
Query: 152 WLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNF 209
WLF D FW P + A L VP KL++LD+ + P +W +S F G P+I+ +HN+
Sbjct: 375 WLFGADKHFWTPDAIGAFLRDVPQDKLMILDIGNDRYPGVWQSSNAFQGKPWIYGYVHNY 434
Query: 210 AGNIEMYGIL----DSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHE 265
+ +YG L D I T + + G G+ EG+ N +VY+ ++A+
Sbjct: 435 GASNPVYGDLGFYRDDIRGLLARKDTGD---LKGFGLFPEGLHNNSIVYEYAYDLAWGQA 491
Query: 266 KVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSII 325
V W+ Y RYG+ PA+ AW+ ++ + P
Sbjct: 492 NQTVTEWLTTYLKSRYGQVTPALILAWSTYVEAAFSTRYWS---------------PRWW 536
Query: 326 SVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 385
G Y +P + + HP ++ RA++ + S S YR
Sbjct: 537 RSKAGAYLLCKRPTADMVEFEG------HPG---DRKKLRRAIDALL-SLKGFGGSALYR 586
Query: 386 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 445
+D+ID R +++ ++ + ++AY+ D L + LV +D L+ L
Sbjct: 587 HDVIDAVRHLVSEEIDDRLIAAMKAYKSGDVKTGDGLREEVIALVTQVDTLMGAQPD-TL 645
Query: 446 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 505
W++ A E++ Y NA+ Q+T+W L DY +K W GL +D+Y P
Sbjct: 646 ASWIDEASAYGDTSEEKAYYVMNAKAQVTVWGGKGN-----LNDYASKAWQGLYKDFYLP 700
Query: 506 RAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
R + S G F K + RE I W
Sbjct: 701 RWMKLLAALRASASGGAPFDQKTFTRELIDWEQAW 735
>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
Length = 1552
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 151/525 (28%), Positives = 259/525 (49%), Gaps = 45/525 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ ++ ++ + LQ+K+L R+ LG+ PV+ F G VP+ + FP+A++
Sbjct: 196 MGNLEGWGGPMSEALIEARYQLQRKMLQRMQALGIQPVVQGFPGLVPSFFKERFPAAQLV 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W P LL + LF ++ +A+ E ++ YGR D F E NT
Sbjct: 256 LQGRWGHFNRPP-----MLLPSDKDLFQQVAKAYYESLIRCYGRDFKFLGGDLFHEGGNT 310
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD +++ + S A W++QGW + P LL+ + ++
Sbjct: 311 KGVDVAATAAAVQQTMLRYFPS----AKWVLQGWNNNPSP--------TLLSKLDKQHVL 358
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEART-SENTTM 237
+++L E+ W +S +F G P++W +++F G +M G L + P A + ++N M
Sbjct: 359 LINLSGEIAASWESSNEFGGTPWLWGSVNHFGGKTDMGGQLPVLVAEPHRAFSQTKNGVM 418
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G+ EGI NPVVYDL + A+ D+ + Y RYG ++ AW++L H
Sbjct: 419 QGIGILPEGINSNPVVYDLALKTAWYTTTPDLDRLLRDYIAYRYGHVDESLVQAWHILSH 478
Query: 298 TVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
+VY EG +++ ++ + + S++ +
Sbjct: 479 SVYG---------------------EFKIKGEGTFESIF--CARPGLHVTSVSTWGPKQM 515
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y+ ++ +AL LF ++ S TY+YDL+DL RQ +A +A +++ ++AY+ DA
Sbjct: 516 QYNPKDLEKALGLFRRVADQYKGSATYQYDLVDLARQVMANHARDVYAAAMQAYRNKDAA 575
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
+ + + F+ L++ D LL FLLG WL A ++Q NA+ IT W
Sbjct: 576 LLHEKGQEFMHLLQLQDRLLQTDTHFLLGNWLAQAANYGVTAADKQQALHNAKMLITYWG 635
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGD 522
++ A+ + DY NK W+GLL+ YY PR +F + +S+ +G+
Sbjct: 636 PDS--AATRVHDYANKEWAGLLKSYYEPRWQKFFYALYQSVNTGE 678
>gi|374990497|ref|YP_004965992.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
gi|297161149|gb|ADI10861.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
Length = 1001
Score = 223 bits (569), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 154/563 (27%), Positives = 248/563 (44%), Gaps = 45/563 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ G+GGP+ L +++ L +KI R+ ELGM PV P + G VP + P A+
Sbjct: 174 LQNMSGYGGPVSPELLAKRIALGQKIAERLRELGMRPVYPGYFGTVPDGFVDRNPGARTV 233
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + + P W LD F ++ AF Q + +G ++ D E
Sbjct: 234 PQGTWNGL-ARPDW-----LDPRTESFGQVAAAFYRHQQELFGECD-LFKMDLLHEGGAA 286
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ A+ + +Q+ A W++ GW + + LL++V ++V+
Sbjct: 287 GDVP--VADAARAVETALQTARPGATWVILGWQAN--------PRRELLDAVNHDHMLVV 336
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D +++ I + + VPY + + NF G + A + R + +VG
Sbjct: 337 DGLSDLDSIGDREQDWGSVPYAFGTIPNFGGRTTIGAKTHIWARRFTQWRDKPGSKLVGT 396
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E + ++P ++L SE+A+++ VD W Y+ R G +DA+ L T Y
Sbjct: 397 AYMAEAVGRDPAAFELFSELAWRNTAVDRDEWFRTYADVRLGGRDERARDAYAALRDTAY 456
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
T + V A PDV T NY + +
Sbjct: 457 QITSSDGRPHDSVFSARPDV-------TARSGTNYATRIPA-----------------FD 492
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
++ AL + L S+ YR+DL D+ RQALA + L ++ +AY+ D
Sbjct: 493 LADFDPALAALLDVRPSLRDSDAYRHDLTDIARQALADRSWTLIPHLHDAYERKDLEAFR 552
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
L+R +L+L+ D + H GFLLGPWLE AK+LA +E + E ART IT W D
Sbjct: 553 TLARLWLKLMRLSDDMTGAHRGFLLGPWLEDAKRLASDEAEAAHLEHLARTLITTWADRV 612
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
+ L +Y N+ W+GL+ D++ P+ Y + ++L G R D W + W
Sbjct: 613 TADTGKLANYANRDWNGLIGDFHLPQWQSYLDELEDALAEGREPRDFD----WFAVEEPW 668
Query: 541 QNGRNVYPVESNGDALITSQWLY 563
R YPV DA T + +Y
Sbjct: 669 TRERKSYPVRPTTDAHRTGRRVY 691
>gi|197302378|ref|ZP_03167435.1| hypothetical protein RUMLAC_01107 [Ruminococcus lactaris ATCC 29176]
gi|197298557|gb|EDY33100.1| F5/8 type C domain protein [Ruminococcus lactaris ATCC 29176]
Length = 1655
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 166/580 (28%), Positives = 268/580 (46%), Gaps = 75/580 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ +W ++ L +K + + +LGM PVL +SG VP + + PSA++
Sbjct: 680 MANLSGYGGPVHDTWFTERTELARKNQLIMRKLGMQPVLQGYSGMVPVDITSKDPSAEVI 739
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W S + +L F + F + Q + YG ++H Y D F E NT
Sbjct: 740 KQGTWCSFQRPS------MLRTDSESFTKYAALFYKVQKEVYGDSAHYYATDPFHEGGNT 793
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGK-- 176
+DS + + + + M + D A W++Q W + ALL + +
Sbjct: 794 GGMDS----AVISQKVLASMMTADPHATWVIQSW--------QGNPTTALLQGLGDNRDH 841
Query: 177 LVVLDLFAEVKPIWSTSK-----------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
+VLDL+AE P W+ + +F P+++CML+NF G + ++G +D+ G
Sbjct: 842 ALVLDLYAEKTPHWNETNPGYYGGAEGGGEFLNTPWVYCMLNNFGGRLGLHGHIDNYVEG 901
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH-----EKVDVKAWINQYSVRR 280
V A + + M G+G++ E NPV+YDL E + +K+++ W Y RR
Sbjct: 902 IVNA-SKQAEHMAGIGITPEASVNNPVLYDLFFETIWADDGNNLQKINLDEWFKNYVTRR 960
Query: 281 YGRSVPAIQDAWNVLYHTVYNCTDGATDKN--RDVIVAFPDVDPSIISVTEGKYQNYGKP 338
YG + A +L+ TVYN + V+ A P +D G +G
Sbjct: 961 YGADSDSAYQAMEILHDTVYNPAYNMKGQGAPESVVNARPGLDI-------GAASTWGNA 1013
Query: 339 VSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK 398
V + Y ++ +A EL +A ++L S Y+YDL ++ Q L+
Sbjct: 1014 V-----------------VDYDKKKLEKAAELLLADYDKLKNSAGYQYDLANVLEQVLSN 1056
Query: 399 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN 458
A E + A++ DA LS +FL +++ ++ + FL+G W+ AK+LA+N
Sbjct: 1057 TAQEYQKKMAAAFRSGDAEEFSTLSDKFLSIIDMVEKVTGTQKEFLVGTWINGAKKLAKN 1116
Query: 459 EEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 516
+ ++ YE NAR+ IT W Q + L DY N+ W+GL DYY R + +
Sbjct: 1117 SDDFTKELYELNARSLITTWGSYDQAISGGLIDYSNRQWAGLTNDYYKMRWEKWITERKK 1176
Query: 517 SL--ESGDGFRLKDW-RREWIKLTNDWQNGRNVYPVESNG 553
L ES + +DW EW W G N Y NG
Sbjct: 1177 ELAGESYTNYSAQDWFEMEWA-----WARGTNKYSGTPNG 1211
>gi|289667570|ref|ZP_06488645.1| N-acetylglucosaminidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 798
Score = 222 bits (566), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 170/578 (29%), Positives = 259/578 (44%), Gaps = 86/578 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQ++IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQQQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG T Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFMELYTQAYG-TGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + SE + G G+ EG+ N VVY+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADSEKRNLRGFGIFPEGLHSNSVVYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ QY RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQPWSQWLTQYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + ++K + D L RA++ + + + Y
Sbjct: 536 WNKRAGAYLLFKRPTAD--IVKFDDRPGDPQRL-------RRAIDALLQQAERYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGF 443
RYDLI+ R L+ A+ +++AY D A G QL+ R +LV+ +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDVQLA-RITQLVQGLDALVGGQHE- 644
Query: 444 LLGPWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 502
L W A A N+ + Y NAR Q+++W + L DY +K W G+ D+
Sbjct: 645 TLADWTGQAAAAAGNDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADF 699
Query: 503 YGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
Y R + + ++G F +L W R W
Sbjct: 700 YLQRWTRFLSAYRAARKAGTPFEAAAVDQQLATWERHW 737
>gi|289663931|ref|ZP_06485512.1| N-acetylglucosaminidase [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 798
Score = 221 bits (563), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 170/578 (29%), Positives = 259/578 (44%), Gaps = 86/578 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQ++IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQQQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG T Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFMELYTQAYG-TGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQPQAIAAFLGKVPDARLLVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + SE + G G+ EG+ N VVY+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADSEKRNLRGFGIFPEGLHSNSVVYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ QY RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQPWSQWLMQYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + ++K + D L RA++ + + + Y
Sbjct: 536 WNKRAGAYLLFKRPTAD--IVKFDDRPGDPQRL-------RRAIDALLQQAERYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGF 443
RYDLI+ R L+ A+ +++AY D A G QL+ R +LV+ +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDVQLA-RITQLVQGLDALVGGQHE- 644
Query: 444 LLGPWLESAKQLAQNEEQ-EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 502
L W A A N+ + Y NAR Q+++W + L DY +K W G+ D+
Sbjct: 645 TLADWTGQAAAAAGNDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADF 699
Query: 503 YGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
Y R + + ++G F +L W R W
Sbjct: 700 YLQRWTRFLSAYRAARKAGTPFDAAAVDQQLATWERHW 737
>gi|317501265|ref|ZP_07959469.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
8_1_57FAA]
gi|316897332|gb|EFV19399.1| hypothetical protein HMPREF1026_01412 [Lachnospiraceae bacterium
8_1_57FAA]
Length = 1847
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 153/534 (28%), Positives = 255/534 (47%), Gaps = 72/534 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + +LGM PVL +SG VP + + +A++
Sbjct: 658 MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 717
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W S + +L T F + + F + Q + YG S+ Y D F E
Sbjct: 718 EQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 769
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
+ S + + + M + D DAVW++Q W + ALLN +
Sbjct: 770 ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 821
Query: 176 KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
++LDL+AE P + ++F P+++CML+NF G + ++G LD++A
Sbjct: 822 HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLA-N 880
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH------EKVDVKAWINQYSVR 279
+ +E + G+G++ E NPV+YD + E +Q E +D+ W++ Y+ R
Sbjct: 881 NIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATR 940
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
RYG + AW++L TVY + GA + V+ A P++ T G
Sbjct: 941 RYGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAAST 990
Query: 335 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 394
+G V + Y ++ A L +A ++L S Y+YDL ++ +Q
Sbjct: 991 WGNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQ 1033
Query: 395 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
L+ A E + A+ D S +F+ ++EDM+ + + FLLG W+E AK
Sbjct: 1034 VLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKA 1093
Query: 455 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 506
LA N + ++ YE+NA+ +T W Q E L+DY N+ WSGL+ D+Y R
Sbjct: 1094 LANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKDYSNRQWSGLIGDFYKAR 1147
>gi|153814573|ref|ZP_01967241.1| hypothetical protein RUMTOR_00787 [Ruminococcus torques ATCC 27756]
gi|331089988|ref|ZP_08338878.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
3_1_46FAA]
gi|145848067|gb|EDK24985.1| F5/8 type C domain protein [Ruminococcus torques ATCC 27756]
gi|330402902|gb|EGG82468.1| hypothetical protein HMPREF1025_02461 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 1863
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 153/534 (28%), Positives = 255/534 (47%), Gaps = 72/534 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + +LGM PVL +SG VP + + +A++
Sbjct: 674 MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 733
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W S + +L T F + + F + Q + YG S+ Y D F E
Sbjct: 734 EQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 785
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
+ S + + + M + D DAVW++Q W + ALLN +
Sbjct: 786 ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 837
Query: 176 KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
++LDL+AE P + ++F P+++CML+NF G + ++G LD++A
Sbjct: 838 HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLA-N 896
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH------EKVDVKAWINQYSVR 279
+ +E + G+G++ E NPV+YD + E +Q E +D+ W++ Y+ R
Sbjct: 897 NIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATR 956
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
RYG + AW++L TVY + GA + V+ A P++ T G
Sbjct: 957 RYGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAAST 1006
Query: 335 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 394
+G V + Y ++ A L +A ++L S Y+YDL ++ +Q
Sbjct: 1007 WGNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQ 1049
Query: 395 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
L+ A E + A+ D S +F+ ++EDM+ + + FLLG W+E AK
Sbjct: 1050 VLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKA 1109
Query: 455 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 506
LA N + ++ YE+NA+ +T W Q E L+DY N+ WSGL+ D+Y R
Sbjct: 1110 LANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKDYSNRQWSGLIGDFYKAR 1163
>gi|336439030|ref|ZP_08618649.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
1_1_57FAA]
gi|336017072|gb|EGN46842.1| hypothetical protein HMPREF0990_01043 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 1863
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 153/534 (28%), Positives = 255/534 (47%), Gaps = 72/534 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + +LGM PVL +SG VP + + +A++
Sbjct: 674 MANLSGFGGPVHDSWFEERTELARKNQLIMRKLGMQPVLQGYSGMVPTNIHDYDKNAEVI 733
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+ G W S + +L T F + + F + Q + YG S+ Y D F E
Sbjct: 734 EQGEWCSFQR------PTMLKTTSSTFEKYAKKFYQCQKEVYGDVSNYYATDPFHEG--G 785
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVP-----LG 175
+ S + + + M + D DAVW++Q W + ALLN +
Sbjct: 786 ITGGMNASDISEKVLTEMITADKDAVWIIQSW--------QGNPTTALLNGLDRVEKGTD 837
Query: 176 KLVVLDLFAEVKP----------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
++LDL+AE P + ++F P+++CML+NF G + ++G LD++A
Sbjct: 838 HALILDLYAEKDPHYDEGRPGAEAYGDEEEFDKTPWLFCMLNNFGGRLGLHGHLDNLA-N 896
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH------EKVDVKAWINQYSVR 279
+ +E + G+G++ E NPV+YD + E +Q E +D+ W++ Y+ R
Sbjct: 897 NIPKVFNETKYIAGIGITPEASVNNPVLYDFLFETIWQDDASQKMEVIDLDTWLDDYATR 956
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
RYG + AW++L TVY + GA + V+ A P++ T G
Sbjct: 957 RYGAESESANQAWDILKETVYKASLNGLGQGAPES---VVNARPNL-------TIGAAST 1006
Query: 335 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 394
+G V + Y ++ A L +A ++L S Y+YDL ++ +Q
Sbjct: 1007 WGNAV-----------------ISYEKGDLEEAAALLLADYDKLKDSAGYQYDLANVLQQ 1049
Query: 395 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
L+ A E + A+ D S +F+ ++EDM+ + + FLLG W+E AK
Sbjct: 1050 VLSNSAQEYQKGMSAAFSAKDLDSFKTYSEKFMSVIEDMEKVTGTSEYFLLGRWVEQAKA 1109
Query: 455 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 506
LA N + ++ YE+NA+ +T W Q E L+DY N+ WSGL+ D+Y R
Sbjct: 1110 LANNADDFTKELYEFNAKALVTTWGSKNQAEKGGLKDYSNRQWSGLIGDFYKAR 1163
>gi|261880009|ref|ZP_06006436.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
gi|270333325|gb|EFA44111.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
Length = 722
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 156/550 (28%), Positives = 243/550 (44%), Gaps = 69/550 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL+ W GPL +W Q+ LQ KI+ R+ LGM+P+ PAF+G VP P ++
Sbjct: 194 MGNLNSWNGPLTDAWQQGQITLQHKIIDRMRALGMHPIAPAFAGFVPEQFVEAHPGLQVK 253
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W D R Y+L P F +IGR F+E+ KE+G+ + Y D+F+E P
Sbjct: 254 KL-TWGGF--DDR-LNAYVLSPESPYFKQIGRLFVEEWEKEFGKNT-FYQSDSFNEMEIP 308
Query: 121 VDSPEYISS------LGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVP 173
V+ + I G IY + + DAVW+ QGW F Y W ++ALL VP
Sbjct: 309 VEPGDSIGKWKLLEQYGDVIYRSIAEANPDAVWVTQGWTFGYQHKMWDSKSLQALLRHVP 368
Query: 174 LGKLVVLDLFAEV-KPIWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFGP 226
K++++DL + K IW T + +YG +++ + NF G G + A
Sbjct: 369 DDKMLIIDLANDYPKWIWKTQQTWKVQHGYYGKQWVFSYVPNFGGKTLPTGDMQMYASAS 428
Query: 227 VEA-RTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
EA SE MVG G + EGIE N V+Y+L+++M + + VD+ WI Y RYG
Sbjct: 429 AEALHHSERGNMVGFGSAPEGIENNDVIYELLADMGWTDKAVDLDLWIKDYCEARYGGYP 488
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
+Q AW + +VY + ++P ++ + VS A+
Sbjct: 489 SDMQKAWQCMLRSVYGS-----------LYSYPRFTWQTVTPDSRR-------VSTHALN 530
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
+ S H F+ +L +S YR D I L L A+ +
Sbjct: 531 DTFLSGVAH----------------FLRCARQLGSSPLYRSDAISLASLYLGTKADRHYT 574
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQY 465
++ + ++L+ D LLA H L W++ A+ ++ +Y
Sbjct: 575 KALDLKASGKQQAASAELHQTIDLLTKADRLLASHPTHRLDRWIQFARNHGITTAEKNRY 634
Query: 466 EWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFR 525
E +A+ IT+W DY ++W+GL+ YY PR YF + +
Sbjct: 635 ESDAKRLITIW-------GGFQEDYAARFWNGLIAHYYIPRIRYYFDHGRPA-------- 679
Query: 526 LKDWRREWIK 535
L W +W+K
Sbjct: 680 LMQWEEQWVK 689
>gi|118370728|ref|XP_001018564.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila]
gi|89300331|gb|EAR98319.1| alpha-N-acetylglucosaminidase precursor [Tetrahymena thermophila
SB210]
Length = 879
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 171/551 (31%), Positives = 268/551 (48%), Gaps = 60/551 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+GGP+ Q+++D Q LQKKIL R+ LGM P+L F G VP +L+ FP +KI
Sbjct: 218 MGNLEGYGGPVTQAYIDGQYNLQKKILKRMRNLGMQPILQGFYGMVPNSLKAKFPLSKIY 277
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--T 118
+W + LDA D LF I F + K YGR + Y D F E
Sbjct: 278 GDQSWLGFRRPA------FLDANDELFSNIANIFYSESEKLYGR-AKFYGGDPFHEGAIV 330
Query: 119 PPVDSPEYISSLGAAIYSGMQ----SGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
P ++ ++S +IY MQ D W++Q W + P Q LL +
Sbjct: 331 PGLN----LTSQAQSIYRAMQYTDNPKDEKVKWILQSWQEN------PSQQ--LLQGLQN 378
Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
+ ++LDL AE + W T+ F G ++W L NF I YG+++ P A + +N
Sbjct: 379 DECIILDLMAEARSKWQTND-FSGHDFLWTSLPNFGLRIGQYGMIEQYVSQPPLAYSIKN 437
Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD--------VKAWINQYSVRRYG-RSV 285
+TM G+G EGI N + Y+++ + A+ D V ++ + RYG ++
Sbjct: 438 STMKGIGSIPEGILTNVLDYEILFDKAWIQPNQDTNLTPRQQVLQYLGDFIRYRYGEQNN 497
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA-- 343
+ AW++L +++YN T+ + V++A P +S Y + EA
Sbjct: 498 KNLFSAWSLLTNSIYNSTNPWDGPSESVMLARPASYIDKVSSWGTSYIYWNTTNVLEAWK 557
Query: 344 ----VLKSETSSYDHPHLWYSTSEVIRAL-------ELFIA-SGNELSA--SNTYRYDLI 389
+K + HL E+ + L E F+ S NE +T+ YDL+
Sbjct: 558 LFTNYVKEKKQKNRSQHL-QKLEEINKKLGRSDDDMEAFVEISQNEERNIFKDTFLYDLV 616
Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWL 449
D+ RQ LA Y+ L+ ++ A+ D S++FLEL++D D LL+ F+LG +L
Sbjct: 617 DVARQNLASYSYLLYNKVMLAFNQTDTIKFALYSQQFLELIKDQDQLLSSRKEFMLGYYL 676
Query: 450 ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAI 509
ES +L +++++ + + QIT+W D S L DY NK W+G+L+D+Y PR +
Sbjct: 677 ESVSKLGTTDQEKQNFIEQIKRQITVWSD----FPSDLHDYANKEWNGILKDFYLPRWEL 732
Query: 510 YFK----YMIE 516
YFK Y++E
Sbjct: 733 YFKSLQSYIVE 743
>gi|404403947|ref|ZP_10995531.1| alpha-N-acetylglucosaminidase [Alistipes sp. JC136]
Length = 828
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 169/554 (30%), Positives = 264/554 (47%), Gaps = 81/554 (14%)
Query: 1 MSNLHGW-GGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
M N+ G G P PQ W + Q+ LQ +I+ R+ LGM PV F+G VP A++ + P +
Sbjct: 184 MGNMSGLDGAPTPQ-WHEAQIALQHRIIDRMEALGMTPVYQGFAGFVPPAMKRIHPETTL 242
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE-NT 118
T+ W K+ ++L DPLF EIG AF+ +E+G+ + Y D+F+E +
Sbjct: 243 TET-KWSGFKN-------WMLSPLDPLFSEIGTAFVRAWEEEFGKGKY-YLIDSFNEMDV 293
Query: 119 P--PVDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSV 172
P P SPE ++L G IY + + DAVW+MQGW+F Y W P ++ALL
Sbjct: 294 PFGPKGSPERAATLRHYGETIYRSLAEANPDAVWVMQGWMFGYQRNSWDPESVRALLEGA 353
Query: 173 PLGKLVVLDLFAEVKP-IWSTSKQ------FYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
P G++++LDL + IW + K F+G +I+ + NF G + G L+ A G
Sbjct: 354 PDGRMMILDLAVDFNNFIWRSEKSWNHLQGFFGREWIYSTVPNFGGRTALIGNLEFYANG 413
Query: 226 PVEARTSENT-TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS 284
+EA +S N + G G S EG+E N +VY++++ + +++D+K +++ YS RYG
Sbjct: 414 HLEALSSPNRGRLTGYGTSPEGVESNEIVYEIIAAAGWSDDRIDLKKFLHDYSAARYGGC 473
Query: 285 VPAIQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
I W+ + + YN CT+ A +Y+ +P S
Sbjct: 474 PEGIDRFWSGMLQSSYNECTNNA------------------------RYRWQLRPYSHRM 509
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
+Y A+E F+A EL + YR D I LA A+ L
Sbjct: 510 PTMGINENY------------YTAIEQFLACAGELGGNELYRTDAIQYAALYLASKADML 557
Query: 404 FLNIIEAYQLNDAHG----VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
+EA D +G + + R EL+ D D LLA H L W A++ E
Sbjct: 558 ----LEAANWADLYGAREEAYDCAMRIEELLLDADRLLASHPLLRLDRWSGMARKAGCTE 613
Query: 460 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 519
E+++++ +R I++W + L DY + WSG++RDYY PR Y +E+
Sbjct: 614 EEKERFVGESRRLISVWGGPS------LSDYSARVWSGVIRDYYVPRLNKY----LEAKT 663
Query: 520 SGDGFRLKDWRREW 533
G F + W +W
Sbjct: 664 DGTVFDFRTWDEQW 677
>gi|308480701|ref|XP_003102557.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
gi|308261289|gb|EFP05242.1| hypothetical protein CRE_04113 [Caenorhabditis remanei]
Length = 718
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 142/526 (26%), Positives = 248/526 (47%), Gaps = 53/526 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN---VFPSA 57
M NL +GG L + + L K+I+ R+ ELG+ P+LP F+G VP L+ +FP++
Sbjct: 210 MGNLKAYGGGLSDAQMLNDFNLAKRIINRLLELGIVPILPTFAGFVPDQLEKDFRLFPTS 269
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYG-RTSHIYNCDTFDE 116
K +L W + S+ C + DPLF +IG F+ Q K G +++Y+ D F+E
Sbjct: 270 KFNRLPCWNNFTSET--SCLLSVSPFDPLFQKIGSTFLRHQKKMLGGDITNLYSADPFNE 327
Query: 117 NTPPVDSPEYISSL----GAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSV 172
P DS ++ +S +I + + D + +W++Q W F+YD W +K+ L++V
Sbjct: 328 -ILPSDSSKFDASFMKQTAQSIMNSCRKVDKNCIWVLQSWSFTYDQ-WPNWAIKSFLSAV 385
Query: 173 PLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS 232
P+G L++LDL++EV P W + F+G ++WC+LHNF G+ E+ G L + G A
Sbjct: 386 PIGNLLILDLYSEVVPAWQMTSSFHGHNFVWCLLHNFGGSRELRGNLQKVDKGYQLALMK 445
Query: 233 ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAW 292
+ +VG G+SME I+QN VVY M + + E + + W+ YS RY W
Sbjct: 446 AGSNLVGAGLSMEAIDQNYVVYQFMIDRMWSQEPIPLNNWLKSYSESRYSADFKVSHKFW 505
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
+L + Y+ + + V + Y +P + +
Sbjct: 506 TILAGSFYSQPEKWGNPRFSVFL-------------------YHRPAFAKKI-------- 538
Query: 353 DHPHLWYSTSEVIRALELFIAS-GNELSASNTYRYDLIDLTRQALA-KYANELFLNIIEA 410
W+ E L+ + S + L ++ DL D+ R + + NE L++ EA
Sbjct: 539 ---EYWFPVEETFNHLQSLMPSLMHVLGDHPLFKEDLNDVMRAVIQFEIGNEAALSLTEA 595
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 470
+ + D + +++ + ++ + W+E +K +A E+ + + A
Sbjct: 596 FLMEDKQQIGASCENLMDMFQKLES----YSNRDFKEWIEDSKSIAPTSEERQVFPVTAS 651
Query: 471 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIE 516
+T+W Q DY ++ W+GLL YYG R + +++E
Sbjct: 652 DILTVWGPTGQN-----LDYAHREWAGLLSGYYGRRWQYFCDWILE 692
>gi|331092442|ref|ZP_08341267.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330401285|gb|EGG80874.1| hypothetical protein HMPREF9477_01910 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 1598
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 155/575 (26%), Positives = 268/575 (46%), Gaps = 80/575 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL G+GGP+ SW +++ L +K + + LGM PVL +SG VP ++ SA++
Sbjct: 680 MANLSGFGGPIHDSWFEERTELARKNQLSMRRLGMQPVLQGYSGMVPTNIREKDSSAEVI 739
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN--- 117
+ G W S + +L F + + F + Q + YG ++H Y D F E
Sbjct: 740 EQGTWCSFRRPD------MLKTDSASFDKYAKLFYQAQKEVYGESAHYYATDPFHEGGDT 793
Query: 118 ---TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
P V + + M D D +W++Q W+ ALL +
Sbjct: 794 GGLNPTV--------IAGKVLDAMLEADKDGIWIIQS--------WQGNPTTALLKGLEG 837
Query: 175 GK--LVVLDLFAEVKPIWSTSK-------QFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
K +VLDL+AE P W+ + +F P+++CML+NF G + ++G LD++A
Sbjct: 838 RKEHALVLDLYAEKTPHWNETNPNEYGGGEFNDTPWVFCMLNNFGGRLGLHGHLDNLAKN 897
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQ---HEK---VDVKAWINQYSVR 279
+ A + M G+G++ E NP++YD + E + EK +D+ W+ Y+ R
Sbjct: 898 -IPAALNSAKHMEGIGITPEASVNNPLLYDFLFETVWTDNAKEKLPVIDLDKWLKDYAKR 956
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQN 334
RYG+ + +A ++ TVY GA + V+ A P +D S
Sbjct: 957 RYGKESQSAYEALLIMKDTVYKAELNMKGQGAPES---VVNARPALDIGAAST------- 1006
Query: 335 YGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQ 394
+G V + Y +++ +A EL + ++L S+ Y YDL + +Q
Sbjct: 1007 WGNAV-----------------ISYDKAKLEKAAELLLKDYDKLKDSDGYMYDLATMLQQ 1049
Query: 395 ALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ 454
L+ A E + A++ N+ + +FL +++ M+ + + +LLG W+E AK
Sbjct: 1050 VLSNSAQEYQRKMANAFKENNKEEFNTYADKFLSIIDSMEKVTSTSKYYLLGTWVEQAKA 1109
Query: 455 LAQNEEQ--EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFK 512
LA+N + + YE+NA+ +T W Q E L+DY N+ WSGLL+D+Y R + +
Sbjct: 1110 LAKNADDFTKDLYEFNAKALVTTWGSINQAEGGGLKDYSNRQWSGLLKDFYKVRWQKWIQ 1169
Query: 513 YMIESLESGDGFRLK--DWRREWIKLTNDWQNGRN 545
+ L+ + +W +W++ ++ N N
Sbjct: 1170 ARNDELDGKQPENINWFEWEWKWVRENTEYTNTPN 1204
>gi|401885538|gb|EJT49648.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
asahii CBS 2479]
Length = 781
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 156/575 (27%), Positives = 262/575 (45%), Gaps = 43/575 (7%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+HG W G WL+ Q LQK+IL R E GM PVLP F G VP L N
Sbjct: 238 NIHGNWHGTTTWQWLEGQHNLQKQILARQREFGMTPVLPGFCGFVPPELHNYIGGPDFKT 297
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W S ++ + +D + + AF+ +Q + YG TS Y D F E+ P
Sbjct: 298 YPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRKQKELYGFTSDYYMVDLFTESKPTS 355
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVL 180
P Y+ + A+ + + +A W+MQGW+F DP W KA L+ L+VL
Sbjct: 356 TDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVNDPKSWTETASKAFLDGAG-ESLLVL 414
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL AE P W K F+G ++WC L N+ N +YG LD ++A+ + + G+
Sbjct: 415 DLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLYGALDKWNHDIMDAK-ANGGRLSGM 473
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQDAWNVLYHTV 299
G+ EGI N +++L ++ + + +D+K W + RRY G+++ Q AW +L ++V
Sbjct: 474 GIVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNWVKRRYRGQNLDLAQKAWELLDNSV 533
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + A ++ D+ P++ + L T +Y + Y
Sbjct: 534 YKSNNTALKCTTRSLI---DLRPAV------------------SGLIGTTGNYLATAITY 572
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+V+ AL+ + S + + + YDL+D+ RQ A ++ +I A+ ++
Sbjct: 573 EPRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQVFVNAAIPIYQAMINAWNGSNKADT 631
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ R + L+ D+D L+A F L W+ A+ AQ+ + E+ AR Q+ +W
Sbjct: 632 EKYGRELVGLINDIDRLMATSRHFRLESWVGDARNWAQDAGAKDDMEFQARNQLILWGPA 691
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI----- 534
T L R Y K+W G++ + Y + ++ ++++ K W +
Sbjct: 692 TFAPWPLDR-YAAKHWHGIMSEVYAKGWELLYQNLLKT-------EPKAWNKTAFASELM 743
Query: 535 -KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
K+ W+N ++ GD++ + L KY Q
Sbjct: 744 EKVEKPWENVKSGGVQGPQGDSVAVIRELREKYKQ 778
>gi|406693970|gb|EKC97309.1| alpha-N-acetylglucosaminidase, putative [Trichosporon asahii var.
asahii CBS 8904]
Length = 781
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 156/575 (27%), Positives = 262/575 (45%), Gaps = 43/575 (7%)
Query: 3 NLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQ 61
N+HG W G WL+ Q LQK+IL R E GM PVLP F G VP L N
Sbjct: 238 NIHGNWHGTTTWQWLEGQHNLQKQILARQREFGMTPVLPGFCGFVPPELHNYIGGPDFKT 297
Query: 62 LGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPV 121
W S ++ + +D + + AF+ +Q + YG TS Y D F E+ P
Sbjct: 298 YPTWMSFPAE--YTKVRAIDPEWDTWNVVQSAFLRKQKELYGFTSDYYMVDLFTESKPTS 355
Query: 122 DSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVL 180
P Y+ + A+ + + +A W+MQGW+F DP W KA L+ L+VL
Sbjct: 356 TDPTYLKGIATAVRESIHAVAPNATWIMQGWIFVNDPKSWTETASKAFLDGAG-ESLLVL 414
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL AE P W K F+G ++WC L N+ N +YG LD ++A+ + + G+
Sbjct: 415 DLAAESYPQWKRLKNFFGRRWLWCTLINYGQNDGLYGALDKWNHDIMDAK-ANGGRLSGM 473
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY-GRSVPAIQDAWNVLYHTV 299
G+ EGI N +++L ++ + + +D+K W + RRY G+++ Q AW +L ++V
Sbjct: 474 GIVPEGINNNEHLFELATDQGWSSQAIDLKQWTQNWVKRRYRGQNLDLAQKAWELLDNSV 533
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y + A ++ D+ P++ + L T +Y + Y
Sbjct: 534 YKSNNTALKCTTRSLI---DLRPAV------------------SGLIGTTGNYLATAITY 572
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+V+ AL+ + S + + + YDL+D+ RQ A ++ +I A+ ++
Sbjct: 573 EPRDVVAALDNLLQSWSG-AGGQQFDYDLVDVARQVFVNAAIPIYQAMINAWNGSNKADT 631
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ R + L+ D+D L+A F L W+ A+ AQ+ + E+ AR Q+ +W
Sbjct: 632 EKYGRELVGLINDIDRLMATSRHFRLESWVGDARNWAQDAGAKDDMEFQARNQLILWGAA 691
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI----- 534
T L R Y K+W G++ + Y + ++ ++++ K W +
Sbjct: 692 TFAPWPLDR-YAAKHWHGIMSEVYAKGWELLYQNLLKT-------EPKAWNKTAFASELM 743
Query: 535 -KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
K+ W+N ++ GD++ + L KY Q
Sbjct: 744 EKVEKPWENVKSGGVQGPQGDSVAVIRELREKYKQ 778
>gi|403512485|ref|YP_006644123.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
[Nocardiopsis alba ATCC BAA-2165]
gi|402798758|gb|AFR06168.1| alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain protein
[Nocardiopsis alba ATCC BAA-2165]
Length = 718
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 148/514 (28%), Positives = 237/514 (46%), Gaps = 48/514 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL + GP+P+SW++ L +++L R LGM PVLP F+G+VP +L ++
Sbjct: 160 MGNLDHFAGPMPRSWIEGHRELGRRVLERQRALGMTPVLPGFTGHVPPSLAPGRTGSRTW 219
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q T++L TDPL+ + +E Q KE T H Y D F E P
Sbjct: 220 QG------------LVTHVLVPTDPLYTTLCAEIVETQ-KELFDTDHQYAIDPFIEMIPV 266
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P + + A G+ D AVW +Q W FSY FW P +++A L+++P L +
Sbjct: 267 DSDPGFPGLVARATIEGLTRADPRAVWFLQTWPFSYQSDFWSPERVEAFLDAIPDDHLHL 326
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNI----EMYGILDSIAFGPVEARTSENT 235
LDL+AE P WS F G P+ WC L NF G ++ G D I A E
Sbjct: 327 LDLWAEYDPQWSRFHAFGGTPWTWCALLNFGGRTDPMADLQGAADRIGAAKDSAHPPE-- 384
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV-PAIQDAWNV 294
G+G+SME NP ++L+ + A+ + W+ + +RYG PA+ + W
Sbjct: 385 ---GIGLSMEATRNNPAFFELVVDQAWTRTGRVEEEWLPDFVAQRYGPGHDPALLEGWRG 441
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
L TV + + FP+ ++++ + + + + + L++E ++
Sbjct: 442 LLRTVLGASG---------VRIFPEQFNGVLTL-----RPHYRHLEDSSALRAEVTAL-- 485
Query: 355 PHLWYSTSEVIRALELFIASG--NELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
+WY +++ A E +A + L+ +DL+D+ L++ A+ +L ++E
Sbjct: 486 --VWYPWPDLLAAWERLVAGAETDPLAVEGPLGHDLVDVAMAVLSRVADHRYLEMVEHLD 543
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
+ L RFLE+ +D+D LL + W A A E + NAR
Sbjct: 544 HHPELPEGDLE-RFLEVFDDLDALLETRPEYRYRTWEAKATSWATGTEDHRVLTDNARRI 602
Query: 473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 506
+T+W T + L DY + WSGL+ YY PR
Sbjct: 603 LTVW---TTLDDPRLDDYAGRLWSGLVGGYYRPR 633
>gi|229818803|ref|YP_002880329.1| alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
gi|229564716|gb|ACQ78567.1| Alpha-N-acetylglucosaminidase [Beutenbergia cavernae DSM 12333]
Length = 751
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 177/564 (31%), Positives = 274/564 (48%), Gaps = 37/564 (6%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M + +GGPLP SW +++ L ++IL R ELGM VLPAF G+VP L A+
Sbjct: 194 MGSTSSFGGPLPDSWFERRAELGRRILERQRELGMRAVLPAFGGHVPDGLGA---GARTH 250
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G FS T LL D F + F QQ + +G T H+Y D F E+ PP
Sbjct: 251 WQG--FS---------TALLGPDDDAFAVVAAEFARQQRELFG-TDHLYAADPFIESVPP 298
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
PE +++ AA Y+GM + D +A W+MQ W F Y FW ++ A+ ++VP +L++
Sbjct: 299 SGEPEDLAAFAAATYAGMSAADPEATWVMQAWPFHYHRRFWTAERIAAVTDAVPRDRLLL 358
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIA--FGPVEARTSENTTM 237
LDL+AE P+W + ++WC +HNF G ++G L +A G V +
Sbjct: 359 LDLWAEHAPVWDDGRGIAEHQWLWCAVHNFGGRFSVHGDLHGLARDLGGVLDDGARTGGF 418
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYG-----RSVPAIQDAW 292
GVGM+ME +E NPV Y+L++++ + E+ DV AW+ ++ +RYG + A+ AW
Sbjct: 419 TGVGMAMEALENNPVFYELLTDLVW--ERPDVDAWVGRFVDQRYGFADGTAARDAVHGAW 476
Query: 293 NVLYHTVYNCTDGATDKNRDVIVAFPD--VDPSIISVTEGKYQNYGKPVSKEAVLKSETS 350
+L T+Y G T ++A P V P G++ + PV A + +E
Sbjct: 477 AILLRTLYGP--GMTRSIPSPVIARPADVVAPFHTQRLAGEFLDPDAPVIVSANIDAEAD 534
Query: 351 SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEA 410
P + E+ RA L + A +DL DL +A+ I+ A
Sbjct: 535 ----PRVEGDLPEIARAAALLREAAGSSDAGGPLAHDLADLLTHVVAQRTRAPIRAIVAA 590
Query: 411 YQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNAR 470
+ DA V + D+D + A LLG WL +A++ A ++ + + +AR
Sbjct: 591 ARAGDADAVRANGALLAAAIADLDAVAATQPDRLLGTWLAAAQRWADDDGERRVLLRDAR 650
Query: 471 TQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWR 530
Q+T+W E+ S L DY ++WSGLL +Y PR ++ ++ E+ ESG ++ R
Sbjct: 651 RQLTVW----GEQTSGLHDYSGRHWSGLLGGFYAPRWQLWVDWLAEAAESGSEPDPQELR 706
Query: 531 REWIKLTNDWQNGRNVYPVESNGD 554
R + L W P + GD
Sbjct: 707 RAVVALEESWVARDETGPTDPAGD 730
>gi|384417770|ref|YP_005627130.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353460684|gb|AEQ94963.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 798
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 163/577 (28%), Positives = 258/577 (44%), Gaps = 84/577 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQQWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
V + +Y +++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAAYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG +AF ++A ++ + G G+ EG+ N VVY+ + +A++
Sbjct: 434 YGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ +Y RYGRS A+ AW L +Y + P
Sbjct: 491 PQHPWSQWLARYLRARYGRSDAALLSAWTDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + D P + RA++ + + + + Y
Sbjct: 536 WNTHAGAYLLFKRPTADIVNFD------DRPG---DPQRLRRAIDALLQQADRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGF 443
RYDLI+ R L+ A+ +++AY D A G QL+R +LV+ +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDALVGGQHET 645
Query: 444 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 503
L ++A + + Y NAR Q+++W + L DY +K W G+ D+Y
Sbjct: 646 LAAWTGQAAAAAGNDARLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFY 700
Query: 504 GPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
R + + ++G F +L W R+W
Sbjct: 701 LQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 737
>gi|326934230|ref|XP_003213195.1| PREDICTED: hypothetical protein LOC100549752 [Meleagris gallopavo]
Length = 650
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 100/207 (48%), Positives = 140/207 (67%), Gaps = 4/207 (1%)
Query: 33 LGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGR 92
LGM VLPAF+G+VP + VFP T+LGNW D C YLL +P+F IG
Sbjct: 4 LGMTTVLPAFAGHVPPGVLRVFPRINATRLGNWSHF--DCTLSCAYLLSPEEPMFQVIGT 61
Query: 93 AFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGW 152
F+++ +KE+G T HIY+ DTF+E +P P Y++ + A++ M D +A WLMQGW
Sbjct: 62 LFLKELIKEFG-TDHIYSADTFNEMSPLSSDPAYLAGITNAVFRAMTGADPEAQWLMQGW 120
Query: 153 LFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAG 211
LF + P FW+PPQ++A+L +VPLG+++VLDLFAE KP++ ++ FYG P+IWCMLHNF G
Sbjct: 121 LFQHQPAFWQPPQVQAVLRAVPLGRMIVLDLFAESKPVYEWTESFYGQPFIWCMLHNFGG 180
Query: 212 NIEMYGILDSIAFGPVEARTSENTTMV 238
N ++G +++I GP AR N+TMV
Sbjct: 181 NHGLFGAVEAINRGPFVARRFPNSTMV 207
>gi|386386798|ref|ZP_10071901.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
gi|385665738|gb|EIF89378.1| alpha-N-acetylglucosaminidase [Streptomyces tsukubaensis NRRL18488]
Length = 1033
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 153/559 (27%), Positives = 254/559 (45%), Gaps = 54/559 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ +GGPL ++ LD + L +KI R+ ELGM PVLP + G VP + P A++
Sbjct: 213 LQNMSEYGGPLSKTLLDARAELGRKITARLRELGMRPVLPGYFGTVPDGFADRNPGARVV 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W ++ P W LD +F ++ AF Q K +G ++ D E
Sbjct: 273 AQGLWNGLRR-PDW-----LDPRTTVFPKVAAAFYRHQTKLFG-ACDLFKMDLLHEGGNA 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ +++ +AVW++ GW + +ALL++V +++++
Sbjct: 326 GDVP--VPDAARAVEKALRTARPNAVWVILGWQSN--------PRRALLDAVDKRRMLIV 375
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D +++ ++ G PY + + NF G + D R + +VG
Sbjct: 376 DGLSDLDTTGDRESEWGGTPYAFGTIPNFGGRTTLGANTDRWTDRFTVWRDRPGSALVGT 435
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E E++P ++L SE+A++ E++D +AW +Y+ RYG + A+ L T Y
Sbjct: 436 AYMPEAAERDPAAFELFSELAWRRERIDREAWFTEYAQIRYGSDDASAAAAFGALAATAY 495
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
++ T+G+ Y + L S + P + +
Sbjct: 496 R-----------------------LASTDGR--PYDSHFLRRPSLTSSIGTAFDPAGFDT 530
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
A +A+G EL S+TYR+DL +L RQALA + L + A D
Sbjct: 531 ------AFAALLAAGPELRDSDTYRHDLTELARQALANRSRTLQFALRAARASKDVAAFR 584
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
+S +L+L+ D + CH FLLGPWLE AK+LA + + + E AR IT W D
Sbjct: 585 GVSALWLKLMRLADTMAGCHRSFLLGPWLEDAKRLATSPAEAVELERTARALITTWAD-- 642
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
+ A+ L +Y N+ W+GL+ D + P+ + + ++LE+G + DW + W
Sbjct: 643 RPAANALSNYANRDWNGLIADVHVPQWDAFLTEVADALEAGRAPKSFDWYPQ----EEAW 698
Query: 541 QNGRNVYPVESNGDALITS 559
R VYP GD T+
Sbjct: 699 TKDRRVYPSAPTGDPYATA 717
>gi|408676293|ref|YP_006876120.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
gi|328880622|emb|CCA53861.1| Alpha-N-acetylglucosaminidase [Streptomyces venezuelae ATCC 10712]
Length = 855
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 147/556 (26%), Positives = 237/556 (42%), Gaps = 48/556 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL G+ GP+ + ++ + L +I + LGM PVLP + G VP P A
Sbjct: 327 LQNLSGFAGPVSEQLIEARAALGARIARHLRSLGMTPVLPGYFGTVPPDFTARNPGAHTV 386
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W P W LD T P+F + + Q + +G S ++ D E P
Sbjct: 387 PQGRWVGF-GRPDW-----LDPTGPVFARLAAVYYRHQRQRFG-DSDMFKMDLLHEGGAP 439
Query: 121 --VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD +S+ A+ +++ A W+M GW + P ALL+ V +L+
Sbjct: 440 GTVD----VSAAAGAVQRALEAARPGATWVMLGWQLNPTP--------ALLHGVDRRRLL 487
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
++D ++ ++ G PY + + NF G+ + + ++ +
Sbjct: 488 IVDGLSDRYDELDRETRWGGTPYAFGTIPNFGGHTSIGANTGAWVSRFHAWLAKPDSALR 547
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+ E NPV + L +E+A+Q +D + W Y+ RRYG + AW L
Sbjct: 548 GIAYLPEATGTNPVAFGLFTELAWQPGPIDQQRWFAGYAARRYGGADRHAAAAWEALRLG 607
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y+ G+ + +D + A PS+ + T ++ +
Sbjct: 608 PYSMRTGSWSEPQDSLFA---ARPSLTASTAARWSPKA--------------------MR 644
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
Y + V RAL + L S+ YR+D++D+ RQAL A L I AY+ D
Sbjct: 645 YDAATVERALAELLRVAPRLRTSDAYRFDVVDVARQALTNRARVLLPRIRAAYEARDLDA 704
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
L R + E + L+ FL+GPWL +A+ + + + E++AR+ +T W D
Sbjct: 705 FRALVREWGAAEELLGRLVGSDRRFLVGPWLAAARSWGADPAERDRLEYDARSILTTWAD 764
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
E+ L DY N+ WSGL+RD Y PR A YF + +L +G D W +
Sbjct: 765 RVPSESGGLHDYANREWSGLVRDVYAPRWAAYFASLDRALVNGTEPVAID----WFARDD 820
Query: 539 DWQNGRNVYPVESNGD 554
W G YP +GD
Sbjct: 821 AWARGHRSYPTLPSGD 836
>gi|187735714|ref|YP_001877826.1| alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
gi|187425766|gb|ACD05045.1| Alpha-N-acetylglucosaminidase [Akkermansia muciniphila ATCC
BAA-835]
Length = 852
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 152/576 (26%), Positives = 261/576 (45%), Gaps = 52/576 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQ-NV-FPSAK 58
M NL G GGPL Q +++ + ++I+ R+ +LGM PVL + G VP+ Q NV K
Sbjct: 198 MGNLEGHGGPLSQQQINKMAQMGRRIVSRMEQLGMTPVLQGYVGFVPSDFQENVRIDGLK 257
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ G W + + +++D T F ++ + + K YG ++ D F E
Sbjct: 258 LIPQGEWVNFRR------PWVVDPTCEAFPKLAADWYKALRKVYGIPGKMFGGDLFHEGG 311
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
D ++ + MQ A W++Q W +P + LL+ + + +
Sbjct: 312 RKGDID--VTQAAQEVQKAMQKASPGAFWVIQAW--GGNP------TRELLSGLDPERAL 361
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
VL L ++ + F G+P++WC L NF GN MYG + ++ E ++ +V
Sbjct: 362 VLQLTKDMANGGKNLRTFNGIPWVWCELANFGGNTGMYGGVPLLSRLGSELSGYKDKGLV 421
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+G EG+E NP+ Y L S+ + E + V+ W+ +Y+ +RYG + A+ A VL +
Sbjct: 422 GMGTLSEGLETNPLHYALFSDRLWTREDISVREWLGKYARQRYGFAPKAVVKALEVLSFS 481
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
+YN +I A P + + S++ +
Sbjct: 482 IYNPVRSQEGCTESIICARPSWN------------------------VRKASTWSSGERY 517
Query: 359 YSTSEVIRALELFIASGNE---LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
Y ++++A ++ + N+ L T+RYDL+D+ RQALA A + A+ D
Sbjct: 518 YHLGDIVKAARGYLKAANDQPNLVKKETFRYDLVDVVRQALADAAFYQLQQVRSAFDSGD 577
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+ +RFL L+ DMD LLA FLLG W + A + +++ + +A+ IT
Sbjct: 578 LAAYRKQVKRFLSLISDMDALLATDSQFLLGTWQKRALDWGDSRQEKALMDKSAKMLITT 637
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W D L DY N+ W+GL+ D+Y PR +F++ ++ L +G R K
Sbjct: 638 WIDQVPRS---LNDYSNRQWAGLVSDFYLPRWKNFFEFQMDVL-TGKKTRDAAHAAFMDK 693
Query: 536 LTND---WQNGRNVYPVESNGDALITSQWLYNKYLQ 568
+ D + +Y V+ GD L + + N + +
Sbjct: 694 MVRDELAFAGNGKIYSVKPAGDTLAVANRVMNTHRE 729
>gi|302526099|ref|ZP_07278441.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
gi|302434994|gb|EFL06810.1| alpha-N-acetylglucosaminidase [Streptomyces sp. AA4]
Length = 860
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 146/561 (26%), Positives = 246/561 (43%), Gaps = 52/561 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN------VF 54
+ N+ + GP+ LD ++ + KK++ R+ +LGM PVLP + G VP +
Sbjct: 200 LQNMASFTGPVSPQLLDARVAMAKKVITRLKDLGMTPVLPGYFGTVPRGFADKSKKADAS 259
Query: 55 PSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTF 114
A++ G W P W LD + ++ AF + Q +G TS +Y D
Sbjct: 260 SDARVIGQGTWVGFDR-PDW-----LDPRTSSYRKVAAAFYQAQHDLFGDTS-MYKMDLL 312
Query: 115 DENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPL 174
E D P + + + +Q+ A W++ GW + PP +A++++V
Sbjct: 313 HEGGKSGDVP--VGDAARGVMTALQTARPGATWVLLGWQNN------PP--RAIVDAVDK 362
Query: 175 GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
KL V+D ++ Q+ PY + ++NF G+ + + RT +
Sbjct: 363 SKLFVVDGLSDRYGQRDPDSQWNNTPYAFGTIYNFGGHTTIGANTGVWTQRFPQWRTKQG 422
Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNV 294
+ + G+ EG NP ++L +E+A++ + AW Y+ RRYG AW++
Sbjct: 423 SALTGIAYLPEGTGTNPAAFELFTELAWRQTPIHQAAWFADYASRRYGGPDTRAATAWDL 482
Query: 295 LYHTVYNC-TDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
L T Y+ G ++ + A P++D + +++
Sbjct: 483 LRQTAYSMPASGWSEAQDSLYAARPNLD------------------------AATAATWS 518
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
L Y + +AL+ + L ++ YR+DL+D+ RQAL + L I AY
Sbjct: 519 PASLRYQQATFGKALDELLNVDPALRGTDAYRFDLVDVARQALTNTSRTLLPQIKTAYTN 578
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQI 473
D L+ R++ + +D LLA FLLGPWLE+AK A + ++ + E++AR+ I
Sbjct: 579 RDRTQFTTLTSRWMSNMTLLDKLLATDSRFLLGPWLEAAKSWAGTDTEQARLEYDARSLI 638
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 533
T W + L DY N+ WSGL+ D+Y R YF + ++ +G D W
Sbjct: 639 TTWGPRAGSDDGRLHDYANREWSGLVSDFYAKRWKQYFDSLNTAMNTGGQPASID----W 694
Query: 534 IKLTNDWQNGRNVYPVESNGD 554
+ W RN YP GD
Sbjct: 695 FAAEDGWAKQRNPYPTTPAGD 715
>gi|302546018|ref|ZP_07298360.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
[Streptomyces hygroscopicus ATCC 53653]
gi|302463636|gb|EFL26729.1| LOW QUALITY PROTEIN: putative alpha-N-acetylglucosaminidase
[Streptomyces himastatinicus ATCC 53653]
Length = 679
Score = 216 bits (549), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 153/550 (27%), Positives = 238/550 (43%), Gaps = 49/550 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ WGGP+ + L+++ L ++I R+ ELGM PV P + G VP + P A
Sbjct: 177 MQNMSEWGGPVSTALLEKRTDLGRRIADRLRELGMRPVFPGYFGTVPDGFADRNPGAHTV 236
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G+W ++ P W LD F E+ AF Q +G ++ D E N
Sbjct: 237 PQGDWNGLRR-PDW-----LDPRTDAFHEVAAAFYRHQHDLFG-ACDLFKMDLLHEGGNA 289
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
V P+ A+ +Q+ A+W++ GW + + LL++V ++
Sbjct: 290 GDVSVPDAAR----AVEKALQTSRPGAIWVILGWQSN--------PRRDLLDAVDHDHML 337
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
V+D +++ I K + VPY + + NF G + R + +V
Sbjct: 338 VVDGLSDLDTITDREKDWGSVPYAFGTIPNFGGRTTIGAKTHMWTERFTVWRDKPGSKLV 397
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G E +E++P Y+L SE+A++ VD AW Y+ RYG ++A+ L T
Sbjct: 398 GTAYMPEAVERDPAAYELFSELAWRDTAVDRDAWFRDYADVRYGARDAKAREAFAALRDT 457
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y + + V A P S+T NY + T ++D
Sbjct: 458 AYQISSKDGRPHDSVFAARP-------SLTARSGTNYA----------THTPAFD----- 495
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
+ AL + L S+ YRYDL D RQALA + +L + +AY D
Sbjct: 496 --PARFDAALAALLGVRAGLRDSDAYRYDLADTARQALANRSWQLIGQLADAYARKDLDT 553
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
LSR +L+L+ D + H LLGPWLE AK++A E+ Q E+ AR IT W D
Sbjct: 554 FRALSRLWLKLMRLSDDITGTHRLLLLGPWLEDAKRMASGAEESAQLEFAARALITTWAD 613
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
+ L +Y N+ W+GL+ D++ P+ Y + ++L G R D W +
Sbjct: 614 RGAADPGKLANYANRDWNGLIGDFHVPQWQTYLDELEDALAEGRAPRTFD----WYTVEE 669
Query: 539 DWQNGRNVYP 548
W R YP
Sbjct: 670 PWTRERKSYP 679
>gi|390989490|ref|ZP_10259787.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372555759|emb|CCF66762.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 798
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 162/577 (28%), Positives = 251/577 (43%), Gaps = 84/577 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + Y S+ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N V+Y+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + D P + RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGF 443
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGGQHET 645
Query: 444 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 503
L ++A + + Y NAR Q+++W + L DY +K W G+ D+Y
Sbjct: 646 LADWTGQAAAATGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFY 700
Query: 504 GPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
R + + ++G F +L W R+W
Sbjct: 701 LQRWTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737
>gi|194695302|gb|ACF81735.1| unknown [Zea mays]
Length = 173
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 100/152 (65%), Positives = 119/152 (78%)
Query: 422 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQ 481
L + FL LV D+D LL+ H+GFLLGPWLESAK LA+N EQE QYEWNARTQITMWFDNT+
Sbjct: 7 LCQHFLSLVNDLDTLLSSHEGFLLGPWLESAKGLARNSEQEIQYEWNARTQITMWFDNTE 66
Query: 482 EEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQ 541
+ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+ F LK+WRREWI LTN+WQ
Sbjct: 67 TKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWISLTNNWQ 126
Query: 542 NGRNVYPVESNGDALITSQWLYNKYLQGTGVF 573
+ R V+ + GD L SQ LY KYL +
Sbjct: 127 SDRKVFSTTATGDPLNISQSLYTKYLSNADLL 158
>gi|21241480|ref|NP_641062.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
gi|21106823|gb|AAM35598.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. citri str. 306]
Length = 798
Score = 213 bits (542), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 160/577 (27%), Positives = 249/577 (43%), Gaps = 84/577 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASRAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N V+Y+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + D P + RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGF 443
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALVGGQHET 645
Query: 444 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 503
L ++A + + Y NAR Q+++W + L DY +K W G+ D+Y
Sbjct: 646 LADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFY 700
Query: 504 GPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
R + + ++G F +L W R+W
Sbjct: 701 LQRWTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737
>gi|381169859|ref|ZP_09879021.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689629|emb|CCG35508.1| alpha-N-acetylglucosaminidase family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 798
Score = 212 bits (539), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 160/577 (27%), Positives = 249/577 (43%), Gaps = 84/577 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N V+Y+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWES 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + D P + RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLRQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGF 443
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALIGGQYET 645
Query: 444 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 503
L ++A + + Y NAR Q+++W + L DY +K W G+ D+Y
Sbjct: 646 LADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFY 700
Query: 504 GPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
R + + ++G F +L W R+W
Sbjct: 701 LQRWTRFLSAYRAARKAGTPFDAVAVDHQLATWERQW 737
>gi|418515337|ref|ZP_13081518.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|410708056|gb|EKQ66505.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 782
Score = 211 bits (538), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 161/577 (27%), Positives = 249/577 (43%), Gaps = 84/577 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 185 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 245 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 297
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 298 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 357
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 358 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 417
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N V+Y+ + +A++
Sbjct: 418 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 474
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW+ L +Y + P
Sbjct: 475 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 519
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + A D P + RA++ + N + + Y
Sbjct: 520 WNKRAGAYLLFKRPTADIADFD------DRPG---DPQRLRRAIDALLQQANRYADAPLY 570
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGF 443
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+
Sbjct: 571 RYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALVGGQYET 629
Query: 444 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 503
L ++A + + Y NAR Q+++W + L DY +K W G+ D+Y
Sbjct: 630 LADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFY 684
Query: 504 GPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
R + + +G F +L W R+W
Sbjct: 685 LQRWTRFLSAYRAARMAGTPFDAVAMDHQLATWERQW 721
>gi|418520969|ref|ZP_13087015.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410702945|gb|EKQ61442.1| N-acetylglucosaminidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
Length = 798
Score = 211 bits (538), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 161/577 (27%), Positives = 249/577 (43%), Gaps = 84/577 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKLARRFLELYAQTYG-AGEFYLADAFNEMLPP 313
Query: 121 V------------------------------DSPEYISSLGAAIYSGMQSGDSDAVWLMQ 150
V ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAARYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADRQFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N V+Y+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRHDLQALLADPDKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW+ L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLSAWSDLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + A D P + RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTADIADFD------DRPG---DPQRLRRAIDALLQQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLACHDGF 443
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYDAGDFARGDAQLART-TQLVRGLDALVGGQYET 645
Query: 444 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 503
L ++A + + Y NAR Q+++W + L DY +K W G+ D+Y
Sbjct: 646 LADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYADFY 700
Query: 504 GPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
R + + +G F +L W R+W
Sbjct: 701 LQRWTRFLSAYRAARMAGTPFDAVAMDHQLATWERQW 737
>gi|293369246|ref|ZP_06615836.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
gi|292635671|gb|EFF54173.1| alpha-N-acetylglucosaminidase (NAGLU) [Bacteroides ovatus SD CMC
3f]
Length = 521
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 111/301 (36%), Positives = 172/301 (57%), Gaps = 6/301 (1%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GW PLP+ WL Q LQ++I+ R E M PVLPAF+G+VPAAL+ V+P+ K T
Sbjct: 196 MCNLDGWQSPLPKEWLSSQAALQEQIVAREREFNMRPVLPAFAGHVPAALKRVYPNIKTT 255
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R CT+L + D L+ I + ++ +Q + YG T+HIY D F+E PP
Sbjct: 256 RVSEWGGFADQYR--CTFL-NPMDSLYAIIQKEYLTEQTRLYG-TNHIYGIDPFNEIDPP 311
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVV 179
+ + + IY + + D +AVWL WLF D W P++K+ L SVP +L++
Sbjct: 312 SWDADSLGMMAKHIYESVAAVDPEAVWLQMTWLFYADIKHWTTPRIKSYLRSVPQDRLIL 371
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LD F E IW + ++G PY+WC L NF GN + G ++ ++ +A + + + G
Sbjct: 372 LDYFCEYTEIWKQTDSYFGQPYLWCYLGNFGGNSFLSGPVNLVSERLADALKNGGSNLKG 431
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VG ++EGI+ N +Y+ + + A+ + D K W + + RR G+ P + AW +L + V
Sbjct: 432 VGSTLEGIDLNQFMYEFVLDKAWNGGQTD-KEWFFKLADRRIGKISPEARKAWEILANKV 490
Query: 300 Y 300
Y
Sbjct: 491 Y 491
>gi|374985456|ref|YP_004960951.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
gi|297156108|gb|ADI05820.1| alpha-N-acetylglucosaminidase [Streptomyces bingchenggensis BCW-1]
Length = 1039
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 157/559 (28%), Positives = 244/559 (43%), Gaps = 53/559 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ G+GGP+ + +DQ+ L KKI+ R+ ELGM PVLP + G VP P A +
Sbjct: 214 LQNMSGFGGPVSKHLIDQRAALAKKIINRVRELGMTPVLPGYYGTVPDDFLAKNPGASLV 273
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + K P W LD LF E+ AF Q + YG +S +Y D E P
Sbjct: 274 AQGTWGAFKR-PDW-----LDPRTDLFAEVAAAFYRHQRERYGDSS-MYKMDLLHEGGNP 326
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ + +Q + AVW + GW +P + +L +V ++V+
Sbjct: 327 GDVP--VGEAAKAVEAALQKAHAGAVWAILGW--QTNP------SREILGAVDKSMMLVV 376
Query: 181 DLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D ++ + + G PY + + NF G+ + + R + + G
Sbjct: 377 DGLSDRYTTVIDRESDWDGTPYAFGSIWNFGGHTPIGANAPDWVEQYPKWRDKTGSALTG 436
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ M EG + NP L +++A+ + + W Y+V RYG P AW + T
Sbjct: 437 IAMMPEGADNNPAAMALFTDLAWTPGAIGLDDWFASYAVSRYGGEDPHAVAAWKAIRDTA 496
Query: 300 YNCTDGATDKNRDVIVAFPD----VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
YN + D PD PS+ +K A E YD
Sbjct: 497 YNMS------RADAWSEAPDGLFGARPSL-------------GANKAAAWGPEADRYDTT 537
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
+ +E+++ +A G L S+ Y YDL D+ RQ L+ + L I AY+ D
Sbjct: 538 AFDAALTELLQ-----VAPG--LRDSSAYAYDLADVARQVLSNRSRVLLPQIKTAYEAGD 590
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+L++ +L ++ MD +LA LLG WL A+ ++ Q E++AR+ IT
Sbjct: 591 RGRFDRLTKTWLSWMKLMDKVLATSGQHLLGRWLADARSWGATRAEKDQLEYDARSIITT 650
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W E L DY N+ WSGLL Y R YF + +L +G + +W
Sbjct: 651 WGGRASSEEG-LHDYANREWSGLLGGLYHLRWKTYFDELSTALAAG----RQPAGIDWFA 705
Query: 536 LTNDWQNGRNVYPVESNGD 554
L + W + YPV ++GD
Sbjct: 706 LEDHWARRHDSYPVRTSGD 724
>gi|295690503|ref|YP_003594196.1| alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
gi|295432406|gb|ADG11578.1| Alpha-N-acetylglucosaminidase [Caulobacter segnis ATCC 21756]
Length = 770
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 163/589 (27%), Positives = 249/589 (42%), Gaps = 78/589 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLP +W+D++ LQ +IL R+ LGM P+LPAF G VP A P A+I
Sbjct: 202 MGNIEGYRAPLPTNWIDKKKDLQVQILGRMRSLGMTPILPAFGGYVPKAFAQKNPKARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I F+ + YG T Y D+F+E PP
Sbjct: 262 RMRPWEGFHE------TYWLDPADPLFAKIAGRFLALYTQTYG-TGTYYLADSFNEMLPP 314
Query: 121 VD-------------------------------SPEYISSLGAAIYSGMQSGDSDAVWLM 149
++ + +++ G AIY ++ DAVW+M
Sbjct: 315 INADGADARDAAYGDGAANTAATKTKVEVDPALKAQRLAAYGKAIYDSIRQARPDAVWVM 374
Query: 150 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 207
QGWLF D FW P + A L+ VP KL++LD+ + P +W +K F G P+I+ +H
Sbjct: 375 QGWLFGADSHFWDPTAISAYLSLVPDDKLMILDIGNDRYPAVWKNAKAFGGKPWIYGYVH 434
Query: 208 NFAGNIEMYGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 266
N+ G+ +YG LD P A E + G GM EG+ N +VYD + ++A+ +
Sbjct: 435 NYGGSNPVYGDLDYYRRDIPAIAANPEAGKLAGFGMFPEGLHNNSIVYDAVYDLAWGAGR 494
Query: 267 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 326
+ AW++ Y+ RYG++ P + A L Y+ + P
Sbjct: 495 ESLSAWLSTYARARYGKTSPELDAALGQLVEAAYSTRYWS---------------PRWWK 539
Query: 327 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRAL-ELFIASGNELSASNTYR 385
G Y + +P + HP ++AL L A NE +
Sbjct: 540 SKAGAYLFFKRPTATIGEFPP------HPGDRAKLEAAVKALTALAPAYANE----PLFV 589
Query: 386 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLL 445
DL D TR ++L + AY+ D Q L +D LL L
Sbjct: 590 LDLTDATRHLATMKIDDLLQAAVAAYRRGDVASGDQARVEIAALALSIDKLLGVQPE-TL 648
Query: 446 GPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGP 505
W++ A+ Y NA+ Q+T+W L DY +K W GL R +Y P
Sbjct: 649 ATWIDDARAYGDTPADAAAYVANAKAQVTVWGGEGN-----LNDYASKAWQGLYRGFYLP 703
Query: 506 RAAIYFKYM----IESLESGDGFRLK-DWRREWIKLTNDWQNGRNVYPV 549
R +++ + + + R W R W+ ++ + PV
Sbjct: 704 RWSMFLDALKAAGTGTFDEPAAVRASIAWERAWVDAEVAYRREKPADPV 752
>gi|294667089|ref|ZP_06732314.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 10535]
gi|292603099|gb|EFF46525.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 10535]
Length = 798
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 166/579 (28%), Positives = 254/579 (43%), Gaps = 88/579 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQHWTDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF ++A ++ + G G+ EG+ N V+Y + +A++
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVIYAYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + D P + RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTADIVDFD------DRPG---DPQRLRRAIDALLQQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLA-CHDG 442
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+ HD
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGDQHD- 644
Query: 443 FLLGPWL-ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRD 501
L W ++A + + Y NAR Q+++W + L DY +K W G+ D
Sbjct: 645 -TLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYAD 698
Query: 502 YYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
+Y R + + ++G F +L W R+W
Sbjct: 699 FYLQRWTRFLSAYRAARKAGTPFDAVTVDHQLAAWERQW 737
>gi|329851961|ref|ZP_08266642.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
biprosthecum C19]
gi|328839810|gb|EGF89383.1| alpha-N-acetylglucosaminidase NAGLU family protein [Asticcacaulis
biprosthecum C19]
Length = 731
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 157/574 (27%), Positives = 249/574 (43%), Gaps = 88/574 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLP SW+ ++ LQK+IL + ELGM P+LPAF+G VP A P A+I
Sbjct: 182 MGNIEGYQAPLPLSWIVKKRELQKRILGAMRELGMEPILPAFAGYVPKAFAESHPQARIY 241
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ F++ + YG+ Y D F+E PP
Sbjct: 242 RMRAWEGFHE------TYWLDPADPLFAKLAGRFLDLYDQTYGK-GRFYLADAFNEMLPP 294
Query: 121 V-DSP------------------------EYISSLGAAIYSGMQSGDSDAVWLMQGWLFS 155
V D P E +++ G ++ ++S DAVW+MQGWLF
Sbjct: 295 VGDGPVEGGYGDSTANKEAVAEVDPAVKAERLAAYGQRLHDSIRSARPDAVWVMQGWLFG 354
Query: 156 YDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNI 213
D FW + A L +VP L+VLD+ + P + T++ F+G +I+ +HN+ +
Sbjct: 355 ADQGFWTGDAIAAFLRNVPDDGLMVLDIGNDRYPKVRQTAQAFHGKGWIYGYVHNYGASN 414
Query: 214 EMYGILD-------SIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 266
+YG L +I P R + G G+ EG++ N +VY + ++A+
Sbjct: 415 PIYGDLGFYRRDMAAITSDPARGR------LQGFGVFPEGLDSNSIVYAYLYDLAWNGGT 468
Query: 267 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 326
+ W+ Y+ RYG S P + AW + VY R A+ I+
Sbjct: 469 KSLSDWLAGYTRARYGISSPEVVTAWLDIVKGVYGTRYWTPRWWRSTAGAYLLCKRPDIA 528
Query: 327 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 386
+ + + + + + + +D P L RY
Sbjct: 529 MADFEGAPGDRAALRAGLARLAAIRHDSPLL---------------------------RY 561
Query: 387 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLS---RRFLELVEDMDGLLACHDGF 443
D+I+ TR + + + L + AY+ D + + RR ++D+ G CH
Sbjct: 562 DVIEFTRHLASLHLDNLIRTALVAYRDGDVAAGDRSATEVRRVTIAIDDLMGAQPCH--- 618
Query: 444 LLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYY 503
L W+E A+ ++ YE NAR Q+T+W L DY +K W GL RD+Y
Sbjct: 619 -LAGWIEQARAYGDTATEKPYYERNARAQVTVWGGKGN-----LHDYASKAWQGLYRDFY 672
Query: 504 GPRAAIYFKYMIESL--ESGDGFRLKDWRREWIK 535
PR + F + + RL W W++
Sbjct: 673 LPRWEMLFAALRTGTYNPAATTERLIAWENAWVE 706
>gi|418473272|ref|ZP_13042874.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
coelicoflavus ZG0656]
gi|371546106|gb|EHN74664.1| putative alpha-N-acetylglucosaminidase, partial [Streptomyces
coelicoflavus ZG0656]
Length = 716
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 145/548 (26%), Positives = 240/548 (43%), Gaps = 44/548 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ GP+ + ++Q+ L ++I R+ ELGM PVLP + G VP P +
Sbjct: 213 MQNMSGFAGPVSERLIEQRAALGRRIANRLRELGMTPVLPGYYGTVPPDFTARNPGGTVV 272
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD +F + +F Q + +G S +Y D E P
Sbjct: 273 PQGQWVGFER-PDW-----LDPRTGVFSRVAASFYRHQRELFG-DSTMYKMDLLHEGGRP 325
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P + A+ + +Q+ AVW + GW + P + ++++V +L+++
Sbjct: 326 GNVP--VGDAARAVMNALQTARPGAVWTLIGWQNN-------PSTQ-IIDAVDKSRLLIV 375
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D ++ ++G PY + + NF G+ + A RT + + G+
Sbjct: 376 DGLSDRYDGLDRETAWHGAPYAFGTIPNFGGHTTVGANTAVWAERFDRWRTEPGSALAGI 435
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
EG NPV Y+L +E+A++ E VD W Y+ RRYGR P AW +L Y
Sbjct: 436 AYLPEGTGGNPVAYELFTELAWRTEPVDHSGWFAAYAERRYGRPDPHAARAWELLRTGPY 495
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ G + +D + ++ + + +S+ + Y
Sbjct: 496 SMPSGTWSEAQDSLF-----------------------TARPRLTATSAASWSPGAMRYD 532
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
V AL + L ++ YR+DL+D+ RQALA + L I AY D
Sbjct: 533 PDTVRAALAELLKVAPALRTTDAYRFDLVDVARQALANRSRSLLPEIKAAYDAGDLSRFR 592
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
+ + + ++ +D LLA FLLGPWL A+ + ++ E++AR+ +T W +
Sbjct: 593 AGAAEWKDDLDLLDRLLATDSRFLLGPWLADARSWGRTAAEKDAAEFDARSLLTTWGHRS 652
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 540
+A LRDY N+ WSGL+ D+Y R Y + +L +G D W L N W
Sbjct: 653 GSDAGGLRDYANREWSGLVSDFYAMRWTTYLDSLDTALVTGRPPAAID----WFSLENAW 708
Query: 541 QNGRNVYP 548
+ YP
Sbjct: 709 NQRHDDYP 716
>gi|372221472|ref|ZP_09499893.1| alpha-N-acetylglucosaminidase [Mesoflavibacter zeaxanthinifaciens
S86]
Length = 712
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 163/550 (29%), Positives = 244/550 (44%), Gaps = 57/550 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++G GPLPQ W+ ++ LQKKIL ++ +LGM PV+PAFSG +PAAL FP+AKI+
Sbjct: 200 MGNINGHAGPLPQEWITKKAKLQKKILSKMRDLGMKPVVPAFSGYIPAALAEKFPNAKIS 259
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L W D TYLLD DPLF EIG+ FIE +EYG+ + Y D+F+E TPP
Sbjct: 260 ELNGWSGGGFD----STYLLDPKDPLFKEIGKRFIELYNQEYGKAEY-YLADSFNEVTPP 314
Query: 121 VDSPEYISSL---GAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGK 176
V + + L G IY + A W+MQGWLF +D FW + A L+ VP K
Sbjct: 315 VSTENKLDELAAYGQVIYETLNEAAPGATWVMQGWLFGHDAYFWEKDAVIAFLSKVPNDK 374
Query: 177 LVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA----RTS 232
L++ D + +W FYG + + +HN+ G+ +YG D F E
Sbjct: 375 LIIQDFGNDRYKVWEKQDAFYGKQWTYGYVHNYGGSNPIYGDFD---FYKEEINYLLEHD 431
Query: 233 ENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRS-VPAIQDA 291
++T ++G G+ EG+ QN +VY+ + ++ + K+ VK W+ RYG+ A
Sbjct: 432 KSTKVLGYGVMPEGLHQNSMVYEYLYDLPWD-SKIPVKDWLKTNIKARYGKDFTKETLTA 490
Query: 292 WNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSS 351
W L VY+ P + G Y + +P + K
Sbjct: 491 WIKLDSAVYSTKYWT---------------PRWWNDQAGAYLLFKQPSKEITAFKG---- 531
Query: 352 YDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY 411
HP T+ + + N+ + + D I R L+ + L AY
Sbjct: 532 --HP-----TNLKLLEEANLLLEKNK-ENNPLIQEDFIAHKRHELSLKIDTLLQQATYAY 583
Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNART 471
ND L +F L++ + LL L W++ A E + Y+ NAR
Sbjct: 584 INNDFEKGDSLQLQFHTLIDSTEQLLENSKLDRLDYWVQEATNYGDTPETKAFYKKNARL 643
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGF------- 524
I W L +Y ++ W + Y R IY + + E G
Sbjct: 644 LINQWGG-----VGNLNNYASRAWKDQYQLLYKTRWDIYLGSLRVNSELGGELNQERIEQ 698
Query: 525 RLKDWRREWI 534
+K+W W+
Sbjct: 699 NIKEWDELWL 708
>gi|294627661|ref|ZP_06706243.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 11122]
gi|292598013|gb|EFF42168.1| N-acetylglucosaminidase [Xanthomonas fuscans subsp. aurantifolii
str. ICPB 11122]
Length = 798
Score = 208 bits (529), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 164/579 (28%), Positives = 257/579 (44%), Gaps = 88/579 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ LPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRASLPQHWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPRAFAQAHPHARIY 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 261 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 313
Query: 121 -------VDSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
V + +Y ++ G A+Y + + A W+MQ
Sbjct: 314 VADDGSDVAAAKYGDSVANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPKATWVMQ 373
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+ + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 374 GWLFGADREFWQAQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 433
Query: 209 FAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF ++A ++ + G G+ EG+ N V+Y+ + +A++
Sbjct: 434 YGASNPLYG---DFAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVIYEYLYALAWEG 490
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ Y RYGRS A+ AW L +Y + P
Sbjct: 491 PQQSWSQWLTHYLRARYGRSDAALLGAWADLEAGIYQTRYWS---------------PRW 535
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTY 384
+ G Y + +P + ++ + D L RA++ + N + + Y
Sbjct: 536 WNKRAGAYLLFKRPTAD--IVDFDDCPGDPQRL-------RRAIDALLQQANRYADAPLY 586
Query: 385 RYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGLLAC-HDG 442
RYDLI+ R L+ A+ +++AY D A G QL+R +LV +D L+ HD
Sbjct: 587 RYDLIEDARHYLSLQADRQLQAVVQAYNAGDFARGDAQLART-TQLVRGLDALVGGQHD- 644
Query: 443 FLLGPWL-ESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRD 501
L W ++A + + Y NAR Q+++W + L DY +K W G+ D
Sbjct: 645 -TLADWTGQAAAAAGHDAGLRRAYVGNARAQVSVWGGDGN-----LADYASKAWQGMYAD 698
Query: 502 YYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
+Y R + + ++G F +L W R+W
Sbjct: 699 FYLQRWTRFLSAYRAARKAGTPFDAVAVDHQLAAWERQW 737
>gi|325922205|ref|ZP_08183992.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
19865]
gi|325547324|gb|EGD18391.1| Alpha-N-acetylglucosaminidase (NAGLU) [Xanthomonas gardneri ATCC
19865]
Length = 807
Score = 207 bits (528), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 158/582 (27%), Positives = 251/582 (43%), Gaps = 90/582 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQ +IL R+ ELGM PVLPAF+G VP A P+A+I
Sbjct: 202 MGNIEGYRAPLPQQWIDSKRVLQTQILTRMRELGMQPVLPAFAGYVPKAFAQAHPNARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF ++ R F+E + YG Y D F+E PP
Sbjct: 262 RMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQTYG-AGEFYLADAFNEMLPP 314
Query: 121 VD-------SPEYISSL-----------------------GAAIYSGMQSGDSDAVWLMQ 150
V + +Y S+ G A+Y + + A W+MQ
Sbjct: 315 VADDGSDVAAAKYGDSIANSDAARAKAVPPAQRDARLAEYGQALYRSIAQVNPQATWVMQ 374
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW+P + A L VP +L+VLD+ + P W S+ F +I+ +HN
Sbjct: 375 GWLFGADREFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQWIYGYVHN 434
Query: 209 FAGNIEMYGILDSIAFGPVEART----SENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQH 264
+ + +YG AF + + + + G G+ EG+ N VVY+ + +A++
Sbjct: 435 YGASNPLYG---DFAFYRQDLQALLADPDKRNLRGFGVFPEGLHSNSVVYEYLYALAWEG 491
Query: 265 EKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSI 324
+ W+ QY+ RYG S A+ AW+ D+D I
Sbjct: 492 PQQSWSQWLTQYTRARYGHSDAALLQAWS-------------------------DLDAGI 526
Query: 325 ISVTEGKYQNYGKPVSKEAVLKSETSSY----DHPHLWYSTSEVIRALELFIASGNELSA 380
+ + K + K T+ D P + RA++ + + +
Sbjct: 527 YQTRYWSLRWWNKRAGAYLLFKRPTADIVGFDDRPG---DPQRLRRAIDALLQQADRYAD 583
Query: 381 SNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACH 440
+ YRYDLI+ R L+ +A+ +++AY D L R LV+ +D L+
Sbjct: 584 APLYRYDLIEDARHYLSLHADRQLQAVVQAYGTGDFARGDALLARTTRLVQGLDALVGGQ 643
Query: 441 DGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLR 500
L ++A + + Y NAR Q+++W + L DY +K W G+
Sbjct: 644 HETLADWTDQAAAAAGDDAALRRVYVGNARAQVSVWGGDGN-----LADYASKAWQGMYA 698
Query: 501 DYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREWIK 535
++Y R + + ++G F +L W R+W +
Sbjct: 699 EFYLQRWTRFLSAYRAARKAGTPFDEAAFNKQLAAWERQWAE 740
>gi|291302495|ref|YP_003513773.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
gi|290571715|gb|ADD44680.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
Length = 696
Score = 207 bits (528), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 160/574 (27%), Positives = 264/574 (45%), Gaps = 62/574 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M + G+GG + + ++++ L ++I R+ ELG+ PVLP F+G VP + + +A I
Sbjct: 178 MGCMCGFGG-VSRRLVEERAELGRRITDRMRELGIEPVLPGFAGLVPGDIGD---TAAIP 233
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
Q G WF P W T T + E+ F +Q + G T D E +
Sbjct: 234 Q-GQWFGFDR-PAWLPT-----TTRAYAEVAEVFYAKQTERLGAT-RAQAVDLLHEGGTS 285
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
VD + + AA M+ D +W++Q W W P + L +
Sbjct: 286 GGVDLADATRGIAAA----MERAHDDYLWVLQAW-------WDNPLPEVLAAT----DSD 330
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
L L W +K ++G P+ L NF G ++G L IA P + +++V
Sbjct: 331 HLLLLDLTGEGWRKTKGWHGKPWARGSLTNFGGRTVLFGGLPEIAELPSLKDDPKASSLV 390
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G + E + NPVV+ L ++ ++ +D+ AW+ +Y RYG++ P AW+ L T
Sbjct: 391 GTALVEEAWQVNPVVWSLFTQTSWADGDIDLNAWVPEYVAARYGKAHPRAVRAWHGLLAT 450
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH-L 357
Y DG ++ A P +D ++ +S + PH L
Sbjct: 451 AYRSMDGRPGGAESLLCAMPSLD-------------------------ADRASMNGPHSL 485
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y + A +A+ L ++T+R+DL+D+TRQ ++ A L + AY + +
Sbjct: 486 PYPAEALEVAWRDLLAAREALGGADTFRFDLVDVTRQVISNRARPLLPLLRTAYAMKELD 545
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
LS F++L E +D +LA + FL+G WL A+ LA +E++ E++ART IT W
Sbjct: 546 RFIALSHSFIDLFELLDPVLATREEFLVGRWLADARALAADEDEADALEFDARTIITTWG 605
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
D+ + A+L+ DY N W+GL+ DYY PR Y K + L G D+ +
Sbjct: 606 DSPESSATLI-DYANHEWAGLIADYYRPRWEKYLKSLETELREGKPAEPIDFYAD----A 660
Query: 538 NDWQNGRNVYPVESNGDALITSQWLYNK--YLQG 569
W + YP E +GDA+ + + +++ Y +G
Sbjct: 661 AAWARSHDTYPTEPSGDAVSSCRAVHHALPYFEG 694
>gi|453051703|gb|EME99203.1| alpha-N-acetylglucosaminidase [Streptomyces mobaraensis NBRC 13819
= DSM 40847]
Length = 763
Score = 207 bits (526), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 144/563 (25%), Positives = 247/563 (43%), Gaps = 48/563 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGP+ ++ LD++ L ++I R+ ELG+ PVLP ++G VP A+
Sbjct: 239 MQNMSAFGGPVSRALLDRRTALAQRITRRLRELGITPVLPGYAGTVPPDFTRRNKGARTV 298
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W P W LD F + R + Q + YG S +Y D E P
Sbjct: 299 PQGDWAGFPR-PDW-----LDPRTAHFARVARTYYRVQRELYG-ASSMYKIDLLHEGGTP 351
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P + + A+ +++ DA W + GW + + +L++V K++VL
Sbjct: 352 --GPVPVGAAAKAVEKALRAAHPDATWAILGWQTN--------PRREILDAVDRSKMLVL 401
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D + P + K + G PY + + NF G+ M RT + + + G
Sbjct: 402 DGIPDHYPRVTDREKDWGGTPYAFGTIWNFGGHTAMGANTQDWVSLFHRWRTKKGSALRG 461
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ + E + NP L S++A+ ++D+K W ++ V+RYG + P + AW+VL T
Sbjct: 462 IALMPEAADNNPAALALFSDLAWTEGRLDLKDWFARWPVQRYGAADPNARRAWDVLRRTA 521
Query: 300 YNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y T DG ++ + A PD+ AV + +++ L
Sbjct: 522 YGTTRADGWSEAADGLFGARPDL----------------------AV--NRAAAWSPRQL 557
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y + AL +A L S+ YR DL D+ RQ ++ + L I AY D
Sbjct: 558 RYDAAAFDEALPALLAVAPALRGSSAYRCDLTDVARQCVSNRSRLLLPRIKAAYDAGDRT 617
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
L+R++L+ + ++ +A + LLG W+ A+ + + E +A + +T+W
Sbjct: 618 RFRTLTRQWLDWMTLLEETVATSERHLLGRWIAEARAWGGTAAERDRLEHDAVSLLTVWG 677
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
+ L DY N+ W+GL+ Y R YF + +L + R K +W L
Sbjct: 678 PRASADGGKLHDYANREWAGLVGGLYRLRWKTYFTELEAALTA----RRKPKPIDWYALE 733
Query: 538 NDWQNGRNVYPVESNGDALITSQ 560
+ W R YP + +GD + ++
Sbjct: 734 DRWTRKRPAYPAKPSGDIVAVAR 756
>gi|365876979|ref|ZP_09416485.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
gi|442587289|ref|ZP_21006107.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
anophelis R26]
gi|365755253|gb|EHM97186.1| alpha-N-acetylglucosaminidase [Elizabethkingia anophelis Ag1]
gi|442562959|gb|ELR80176.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Elizabethkingia
anophelis R26]
Length = 712
Score = 205 bits (521), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 151/551 (27%), Positives = 244/551 (44%), Gaps = 54/551 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL GWGGP+ + QQ LQKKIL R+ ELG+ PVL F G VP L+N AK+
Sbjct: 193 MGNLEGWGGPVSMDMMKQQAELQKKILKRMKELGIEPVLQGFYGMVPHDLKNKISEAKVI 252
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ G W P +LD T LF +I + + YG H + + F E T
Sbjct: 253 EQGKWAGEFQRPG-----ILDPTTKLFSKIADTYYTEMKNLYGEDIHYFGGEPFHEGGKT 307
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
+D + S I + MQ ++ W++QG W+ LL + +
Sbjct: 308 NGLDLKNVVES----IQTSMQKSYPNSTWVLQG--------WQQNPSDGLLAGLKKENTL 355
Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTS-ENTTM 237
+++LF E W K + G +IW + NF +YG L A+ S +
Sbjct: 356 IIELFGENTANWEKRKGYGGTSFIWSNVSNFGEKNGLYGKLQRFIDEVFRAKESIYGANL 415
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+G+ EGI NPV YDLM ++A+ EK + W+ +Y+ RYG+ + AW
Sbjct: 416 KGIGIIPEGIFNNPVAYDLMLDIAWYSEKPILDQWLTEYTKYRYGKENQDVIQAWKEFAQ 475
Query: 298 TVYNCTDGATD-KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
T+Y+ D + + + A P ++ + +S + +NY + KEAV
Sbjct: 476 TIYSSPDVYQEGPSESIYCARPSLNVNPVSSWGTRKRNYDQSRFKEAV------------ 523
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
++F+ + + S TY+ D D RQ A + ++ +I+A
Sbjct: 524 ------------KVFVKADTDFKDSETYQTDKTDFLRQVWANKGDVVYDELIKAIHEKKT 571
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
+ + +FLE++ + LL + F L L+ A+ + + +NA++Q+T W
Sbjct: 572 TKIQKSGHQFLEMISIQNMLLGNNRYFTLNRLLKEAEHFGEKLPDAQNVMFNAKSQLTYW 631
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+ + LRDY +K W+GLL Y R +K IE +SG + + +
Sbjct: 632 GPDNNPKTD-LRDYAHKEWNGLLSSLYYNR----WKVFIEQAQSG----IITAPEVFYNM 682
Query: 537 TNDWQNGRNVY 547
+W G+N+Y
Sbjct: 683 EVEWSKGKNMY 693
>gi|429198382|ref|ZP_19190217.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
gi|428665917|gb|EKX65105.1| alpha-N-acetylglucosaminidase (NAGLU) [Streptomyces ipomoeae 91-03]
Length = 747
Score = 202 bits (513), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 143/555 (25%), Positives = 252/555 (45%), Gaps = 44/555 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ + LD + L ++I+ R+ ELGM PV P + G VP P A+
Sbjct: 220 LQNLSSFPSPVSRQLLDARAALGRRIVGRLRELGMTPVFPGYFGTVPPGFAERNPGARTV 279
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G+W + P W LD F + AF Q + +G S +Y D E P
Sbjct: 280 PQGDWMGF-ARPDW-----LDPRTNEFKRVAAAFYRAQDELFGGPSTLYKMDLLHEGGDP 333
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ + +++ DA W++ GW + PP +A++++V +++V+
Sbjct: 334 GDVP--VADAAKGVERALRAAHPDATWVILGWQHN------PP--RAIVDAVDKKRMLVV 383
Query: 181 DLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D ++ P + +G PY + + NF G+ + A + RT + + + G
Sbjct: 384 DGLSDRFPTVIDREADWGDTPYAFGSIWNFGGHTALGANTPVWAELYEKWRTKDGSKLRG 443
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ + E + NP + L SE+A++ +++D+K W ++++ RYG P + AW++L T
Sbjct: 444 IALMPEAADNNPAAFALFSELAWRKDELDLKTWFSEWAHARYGARDPHAEAAWDILRRTA 503
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y G T +R +EG +G S+ A+ + + L Y
Sbjct: 504 Y----GTTRADR---------------WSEGADGLFG---SRPALNTVRAARWSPKQLRY 541
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+E AL ++ L +S+ YR DL+D+ RQ L+ + L I AY D
Sbjct: 542 DAAEFEPALGELLSVRPGLRSSSAYRRDLLDVARQTLSNRSRVLLPRIRGAYDARDTARF 601
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+L+ +L L++ +D LLA LLG W+ A+ ++ + ++ ++ + +T+W
Sbjct: 602 DELTGTWLSLMDLLDRLLATDSAHLLGRWVADARAWGASDAERERLAYDNLSLLTVWGTR 661
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
+A LRDY N+ W+GL+ Y R + YF+ + +L G + D W L +
Sbjct: 662 KGADAG-LRDYANREWAGLVGGLYRLRWSTYFEELRAALREGRTPKKID----WFALEDR 716
Query: 540 WQNGRNVYPVESNGD 554
W E GD
Sbjct: 717 WTRAPGRLATEPTGD 731
>gi|290956360|ref|YP_003487542.1| alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
gi|260645886|emb|CBG68977.1| putative alpha-N-acetylglucosaminidase [Streptomyces scabiei 87.22]
Length = 732
Score = 201 bits (511), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 152/556 (27%), Positives = 250/556 (44%), Gaps = 47/556 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL G+ P+ + LD + VL ++I R ELGM PV P + G VPA P A+
Sbjct: 206 LQNLSGFPSPVSRQLLDARAVLGRRIADRARELGMIPVFPGYFGTVPAGFAERVPGARTV 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD F + AF Q + +G S +Y D E P
Sbjct: 266 PQGRWMGF-ARPDW-----LDPRTDEFARVAAAFYRTQDEMFG-PSALYKMDLLHEGGDP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ + +Q A W+M GW + PP +A++++V ++V+
Sbjct: 319 GDVP--VADAAKGVERALQRAHPGATWVMLGWQHN------PP--RAIVDAVDKQHMLVV 368
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D ++ P + + G PY + + NF G+ + A + RT + +T+ G
Sbjct: 369 DGLSDRFPTVTDREADWGGTPYAFGSIWNFGGHTALGANTPDWAALYEKWRTKDGSTLHG 428
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ + E + NP + L SE+A++ ++D++ W +++ RYG P + AW++L T
Sbjct: 429 IALMPEAADNNPAAFALFSELAWREGELDLETWFAEWAHARYGARDPHAEAAWDILRRTA 488
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T S +EG +G S+ A+ + L Y
Sbjct: 489 YGTT-------------------RADSWSEGADGLFG---SRPALTAVRAGRWSPKQLRY 526
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ ++ AL + EL AS+ YR DL+D+ RQAL+ + + + AY DA +
Sbjct: 527 NAADFEPALGEMLKVRPELRASSAYRRDLLDVARQALSNRSRVMLPQLKAAYDAKDAARL 586
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ SR +L L++ +D L+A LLG W+ A+ A + + ++A + +T+W
Sbjct: 587 AKGSRDWLSLMDLLDELVATDSRHLLGRWVADARSWAVGSTERTELAYDALSLLTVW--G 644
Query: 480 TQEEASL-LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
T+E A LRDY N+ W+GL+ Y R A YF+ + +L G + D W L +
Sbjct: 645 TREGADAGLRDYANREWAGLVGGLYRLRWATYFEELRAALAEGRAPKKID----WFALED 700
Query: 539 DWQNGRNVYPVESNGD 554
W E GD
Sbjct: 701 RWARNPGTLATEPAGD 716
>gi|329934959|ref|ZP_08285000.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
gi|329305781|gb|EGG49637.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
Length = 1017
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 143/563 (25%), Positives = 238/563 (42%), Gaps = 50/563 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPS-AKI 59
+ NL+G+GGPL + ++ L ++I R+ LGM PVLP + G+VP + A +
Sbjct: 191 LQNLYGYGGPLSAELIARRAALGRRIADRLRALGMRPVLPGYYGHVPKDFADRRGGDAHV 250
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W P W LD F E+ +F Q +G + D E
Sbjct: 251 VPQGTWHGFDR-PSW-----LDPRTDAFAEVAASFYRHQEDVFGPAGD-FKMDLLHEGGT 303
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P ++ G + +++ A W++ GW + + LL++V ++++
Sbjct: 304 AGDVPVPDAARG--VEKALRAARPGATWVILGWEAN--------PLPELLDAVDKKRMLI 353
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTM 237
+D ++ + + + G PY + + NF G + G I A R + +
Sbjct: 354 VDGVSDRYTSVTDREEDWGGTPYAFGTIPNFGGRTTI-GARTHIWREKFFAWRDKPGSAL 412
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G E +++P ++L SE+A+ E VD W Y+ RYG + AW L+
Sbjct: 413 AGTAYLPEAADRDPAAFELFSELAWTDEPVDRARWFTGYADFRYGGRDAGARRAWRALHD 472
Query: 298 TVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
T Y + +D + + A PD+ + + Y
Sbjct: 473 TAYQQHANERSDPHDSLFCARPDL------------------------AATRAARYAPAA 508
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
L Y + AL +A YRYDL+D+ RQALA + + + A+ DA
Sbjct: 509 LTYDPARFDAALSGLLAVAAHRRGGAAYRYDLVDVARQALAHRSRQYLPQLKAAFDREDA 568
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
L+ ++L L+ + + H FLLGPW+E A+++A N + ++E A+ +T+W
Sbjct: 569 ATFKALATQWLTLMRLSEDITGTHPAFLLGPWIEDARRMATNPRERAEFERTAKALVTVW 628
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
D +A L +YGN+ W GLL D+Y PR + ++L +G D W
Sbjct: 629 GDRATSDAGNLHEYGNREWHGLLSDFYLPRWQKWLDACEDALATGTAPAAVD----WFAF 684
Query: 537 TNDWQNGRNVYPVESNGDALITS 559
W R YP+ GDA T+
Sbjct: 685 EEPWTRERKDYPLRPVGDAYRTA 707
>gi|429201402|ref|ZP_19192867.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
91-03]
gi|428663010|gb|EKX62401.1| Tat pathway signal sequence domain protein [Streptomyces ipomoeae
91-03]
Length = 1042
Score = 199 bits (506), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 144/559 (25%), Positives = 239/559 (42%), Gaps = 50/559 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
+ NL G+GGPL +D++ L ++I R+ ELGM+PVLP + G+VP ++ A +
Sbjct: 212 LQNLSGYGGPLSPELIDRRAALGRRIADRLRELGMSPVLPGYYGHVPKEFVERNGGDAHV 271
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W + P W LD F ++ +F Q +G +H + D E
Sbjct: 272 VPQGVWHGFER-PDW-----LDPRTDSFAKVAASFYGHQEDVFGEAAH-FKMDLLHEGGT 324
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P + + +Q A W++ GW +P + LL+++ ++++
Sbjct: 325 AGDVP--VPGAAQGVERALQKARPGATWVILGW--QENP------LPELLDAIDKSRMLI 374
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTTM 237
+D ++ + + + G PY + + NF G + G I A R N+ +
Sbjct: 375 VDGVSDRYTSVTDRERDWGGTPYCFGTIPNFGGRTTI-GARAHIWNEKFFAWRDKANSAL 433
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G E +++P ++L SE+A+ K+D AW + Y+ RYG + + AW L+
Sbjct: 434 AGTAFMPEATDRDPAAFELFSELAWTPTKIDRAAWFSAYADYRYGARDDSARRAWRALHD 493
Query: 298 TVYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
T Y +D + + A PD+ + Y
Sbjct: 494 TAYQQRAVERSDPHDSLFCARPDL------------------------AADRAAEYAPRA 529
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
L Y AL + L S Y+YD++D+ RQALA + + + AYQ D
Sbjct: 530 LTYDPGRFDAALAGLLGVAGGLRGSAAYKYDVVDVARQALAHRSRQYLPQLRAAYQRKDL 589
Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
LS +L L+ D + + FLLGPW+ A+ LA N+ + ++E A+ IT+W
Sbjct: 590 ATFRALSTLWLRLMRLSDEVTGANSAFLLGPWVNDARLLATNDAERAEFERTAKVLITVW 649
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+A L +YGN+ W GL+ D+Y PR + + ++L +G D W
Sbjct: 650 GGRATSDAGDLHEYGNREWHGLMADFYVPRWEKWLDTLEDALATGTAPAAVD----WFAF 705
Query: 537 TNDWQNGRNVYPVESNGDA 555
W R Y + GDA
Sbjct: 706 EEPWTRERKDYALRPVGDA 724
>gi|16124795|ref|NP_419359.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
gi|221233511|ref|YP_002515947.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
gi|13421729|gb|AAK22527.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus CB15]
gi|220962683|gb|ACL94039.1| alpha-N-acetylglucosaminidase [Caulobacter crescentus NA1000]
Length = 770
Score = 199 bits (505), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 146/540 (27%), Positives = 230/540 (42%), Gaps = 71/540 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLP +W+D++ LQ KIL R+ LGM P+LPAF G VP A P A+I
Sbjct: 202 MGNIEGYKAPLPTAWIDKKKDLQVKILGRMRSLGMTPILPAFGGYVPKAFAEKNPKARIY 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD DPLF +I F+ + +G ++ Y D+F+E PP
Sbjct: 262 RMRPWEGFHE------TYWLDPADPLFAKIAARFLALYTETFGAGTY-YLADSFNEMLPP 314
Query: 121 VD-------------------------------SPEYISSLGAAIYSGMQSGDSDAVWLM 149
++ + +++ G AIY ++ DAVW+M
Sbjct: 315 INADGADARDAAYGDGTANTAVTKTKVEVDPALKAQRLAAYGKAIYDSIRQTRPDAVWVM 374
Query: 150 QGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLH 207
QGWLF D FW P + A L+ VP KL++LD+ + P +W +K F G P+I+ +H
Sbjct: 375 QGWLFGADSHFWDPAAISAYLSLVPDDKLMILDIGNDRYPNVWKNAKAFGGKPWIYGYVH 434
Query: 208 NFAGNIEMYGILDSIAFG-PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEK 266
N+ G+ +YG L P A + + G GM EG+ N +VY+ + ++A+ +
Sbjct: 435 NYGGSNPVYGDLGFYRQDIPAIAANPDAGKLAGFGMFPEGLHNNSIVYEAVYDLAWSEGQ 494
Query: 267 VDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIIS 326
W+ +Y+ RYG++ PA+ A L ++ + P
Sbjct: 495 ASPATWLTRYARARYGKTSPALDAALGQLVEAAFSTRYWS---------------PRWWK 539
Query: 327 VTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRY 386
G Y + +P + D P +++ A++ A +
Sbjct: 540 SKAGAYLFFKRPTATVG---------DFPQHPGDRAKLEAAVKALTALAPTYGQEPLFVL 590
Query: 387 DLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLG 446
DL D TR ++L + AY+ D L +D LL L
Sbjct: 591 DLTDATRHLATMKIDDLLQVAVAAYRRGDTAAGDAARVEIEALALSIDKLLGVQPD-TLA 649
Query: 447 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 506
W++ A+ Y NA+ Q+T+W L DY +K W GL + +Y PR
Sbjct: 650 TWIDEARAYGDTPADAAAYVANAKAQVTIWGGEGN-----LNDYASKAWQGLYKSFYLPR 704
>gi|291301158|ref|YP_003512436.1| alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
gi|290570378|gb|ADD43343.1| Alpha-N-acetylglucosaminidase [Stackebrandtia nassauensis DSM
44728]
Length = 734
Score = 198 bits (504), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 144/555 (25%), Positives = 248/555 (44%), Gaps = 48/555 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL + GP+ Q LD + L ++I R+ ELG+ PVLP + G +P A+
Sbjct: 209 MQNLSAFPGPISQHLLDSRAELARRIRTRMAELGIRPVLPGYFGTIPGGFAKRNQQARTV 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W+ S P W LD T F ++ +F Q + G + +Y D E P
Sbjct: 269 PQGVWYGF-SRPDW-----LDPTGNEFAKVAASFYRHQAQLLGE-ADMYKMDLMHEGGDP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
P ++ G A+ +Q A W+M GW R +L + +++++
Sbjct: 322 GGIPIPDAAKGVAL--ALQRARPGATWVMLGW--------RKNPRTDILTDIDTSRVLIV 371
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D ++ + G PY + + NF G+ + A + RT+ ++ + G+
Sbjct: 372 DGISDRFDDLDREHTWPGTPYAFGTIPNFGGHTTIGANAKVWAKRFGQWRTAPDSAVSGI 431
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
EG ++P ++L +E+A++ + +D+ W Y+ RRYG + + AW+ L + Y
Sbjct: 432 AWMPEGAGRDPAAFELFAELAWR-DSIDLGEWFADYADRRYGGADDNARTAWDALRRSAY 490
Query: 301 NCTDGATDKNRDVIV-AFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
G + D + A P +D VT Y + L Y
Sbjct: 491 AMPSGRWAEAADGLFGARPGLD-----VTHADY-------------------FSPEFLRY 526
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
+ +AL + L ++ YR+DL+D+ RQ+L EL + A+ +
Sbjct: 527 DAAVFAQALPALLDVDKSLH-NDAYRFDLVDVARQSLVNAGRELLPRVKSAFVNQNKKQF 585
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+ +R +L+ + +D LL FLLGPWLE+A++ A+ ++ K E++ART +++W
Sbjct: 586 DKHTRTWLDWMRLLDRLLETDRRFLLGPWLEAARRSARTADEAKDLEYDARTIVSVWGHR 645
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
+ + L DY N+ +GL+ D Y R YF + ESL+SG + D W L ++
Sbjct: 646 SGSDEGRLHDYANRELAGLVSDLYAMRWRRYFDSLAESLDSGQAPQHID----WFALEHE 701
Query: 540 WQNGRNVYPVESNGD 554
W + + + E GD
Sbjct: 702 WASKTDDHATEPKGD 716
>gi|297194750|ref|ZP_06912148.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
25486]
gi|297152431|gb|EFH31740.1| alpha-N-acetylglucosaminidase [Streptomyces pristinaespiralis ATCC
25486]
Length = 816
Score = 198 bits (503), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 148/551 (26%), Positives = 245/551 (44%), Gaps = 55/551 (9%)
Query: 16 LDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPS--AKITQLGNWFSVKSDPR 73
++++ L ++I R+ ELGM+PVLP + G VP P A++ G W P
Sbjct: 6 IERRTELGRRITDRLRELGMHPVLPGYFGTVPDDFPGHNPGSDARVIPQGTWGGGMRRPD 65
Query: 74 WCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAA 133
W LD F ++ AF Q + +G SH + D E D P + A
Sbjct: 66 W-----LDPRTQAFSDVAAAFYRHQGELFGDVSH-FKMDLLHEGGTAGDVP--VPDAARA 117
Query: 134 IYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTS 193
+ + +Q+ A W++ GW + P +L+S+ +++++D +++ +
Sbjct: 118 VETSLQTARPGATWVILGWQSNPRPV--------MLDSIDTSRVLIVDGLSDLDTVTDRE 169
Query: 194 KQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVV 253
+ G PY + + NF G + D R + +VG E E++P
Sbjct: 170 ADWGGAPYAFGTIPNFGGRTTIGANTDRWTEKFTAWRDKPGSALVGTAYMPEAAERDPAA 229
Query: 254 YDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDV 313
+L SE+A++ EK+D +AW +Y+ RYG + ++A+ L T Y T
Sbjct: 230 LELFSELAWREEKIDREAWFAEYAQIRYGGVDHSAREAFAALAATAYKLTS--------- 280
Query: 314 IVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIA 373
T+G+ Y S+ L + + P + RA +A
Sbjct: 281 --------------TDGR--PYDSLFSRRPSLTTAIGTAFDP------AGFDRAFAALLA 318
Query: 374 SGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDM 433
L S+ YR+DL D+ RQALA + L L + AY+ D +S +L+++
Sbjct: 319 VRAPLRDSDAYRHDLTDVARQALANRSRTLQLALRAAYRNKDVATFRAVSALWLKVMRLS 378
Query: 434 DGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNK 493
D + CH FLLGPWLE AK+LA + E+ Q E ART IT W D + A+ L +Y N+
Sbjct: 379 DTMAGCHRQFLLGPWLEDAKRLATSPEEAVQLERTARTLITTWAD--RPTANSLSNYANR 436
Query: 494 YWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNG 553
W GL+ D + P+ + +++ +G + DW + W R+ YPV G
Sbjct: 437 DWQGLMADVHVPQWEAFLTEQADAMAAGRAPKSFDWYPQ----EEAWTQERHTYPVRPTG 492
Query: 554 DALITSQWLYN 564
DA T+ +++
Sbjct: 493 DAYSTALRVFD 503
>gi|62318937|dbj|BAD94027.1| alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
Length = 182
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 94/181 (51%), Positives = 128/181 (70%), Gaps = 1/181 (0%)
Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
++DLTRQ L+K AN+++ + A+ D + QLS +FLEL++DMD LLA D LLG
Sbjct: 1 MVDLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGT 60
Query: 448 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 507
WLESAK+LA+N ++ KQYEWNARTQ+TMW+D+ S L DY NK+WSGLL DYY PRA
Sbjct: 61 WLESAKKLAKNGDERKQYEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRA 120
Query: 508 AIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKY 566
+YF M++SL F+++ WRREWI +++ WQ + VYPV++ GDAL S+ L +KY
Sbjct: 121 RLYFNEMLKSLRDKKIFKVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKY 180
Query: 567 L 567
Sbjct: 181 F 181
>gi|29828556|ref|NP_823190.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
gi|29605660|dbj|BAC69725.1| putative alpha-N-acetylglucosaminidase [Streptomyces avermitilis
MA-4680]
Length = 728
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 142/555 (25%), Positives = 240/555 (43%), Gaps = 44/555 (7%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ Q LD + L ++I R+ ELGM PV P + G VP + A
Sbjct: 201 LQNLSAFPDPVSQQLLDARAALGRRIANRLRELGMTPVFPGYFGTVPPGFADRNAGAHTV 260
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD F + AF Q + +G S Y D E P
Sbjct: 261 PQGTWMGF-ARPDW-----LDPRTEHFTRVAAAFYRIQDEMFGGASTRYKMDLLHEGGSP 314
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + + +++ AVW++ GW + PP +A++++V +++V+
Sbjct: 315 GDVP--VGDAAKGVERALRAAHPGAVWVILGWQHN------PP--RAIVDAVDKDRMLVV 364
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D + P + ++G PY + + NF G+ + A RT +T+ G
Sbjct: 365 DGLCDRFPKVTDREADWHGTPYAFGSIWNFGGHTTLGANTPDWASLYERWRTRPGSTLRG 424
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
V + E + NP + L SE+A++ +D++AW +++ RYG P + AW++L T
Sbjct: 425 VALLPEAADNNPAAFALFSELAWREGDLDLRAWFARWARSRYGGRDPHAEAAWDILRRTA 484
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T S +EG +G ++ ++ ++ +S+ L Y
Sbjct: 485 YGTT-------------------RADSWSEGADGLFG---ARPSLAATKAASWSPKRLRY 522
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E AL + L S+ YR DL+D+ RQAL+ + L I AY+ D
Sbjct: 523 RPEEFEPALGELLKVRPGLRGSSAYRRDLLDVARQALSNRSRVLLPQIRTAYEAKDTARF 582
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
+L+ +L L++ ++ LLA LLG W+ A+ + + + ++A + +T+W
Sbjct: 583 DRLTGVWLALMDLLEALLATDSRHLLGRWVADARAWGASAAERDRLAYDALSLLTVWGTR 642
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 539
+A LRDY N+ W+GL+ Y R + YF + + G + D W L +
Sbjct: 643 AGADAG-LRDYANREWAGLVGGLYRLRWSTYFAELRSASREGRTPKKTD----WFALEDR 697
Query: 540 WQNGRNVYPVESNGD 554
W GD
Sbjct: 698 WTRNPGGLATRPTGD 712
>gi|333023613|ref|ZP_08451677.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
gi|332743465|gb|EGJ73906.1| putative alpha-N-acetylglucosaminidase [Streptomyces sp. Tu6071]
Length = 741
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 150/558 (26%), Positives = 253/558 (45%), Gaps = 53/558 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ ++Q+ L +I+ R+ ELGM+PVLP + G VPA + P AK
Sbjct: 215 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 274
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD LF E+ AF E Q + YGR + +Y D E
Sbjct: 275 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P ++ G + +++ DAVW++ GW + PP K ++ + ++V+
Sbjct: 328 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 377
Query: 181 D----LFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYG-ILDSIAFGPVEARTSENT 235
D F+EV + G PY + + NF G+ + D + P R +
Sbjct: 378 DGLSDRFSEVN---DRESDWQGTPYAFGSIWNFGGHTALGANARDWVDLYP-RWRDRSGS 433
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
+ G+ + E + NP ++L +E+ + VD+ W +Y+ RYG S + AW++L
Sbjct: 434 RLSGIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDIL 493
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
T Y D+ + P++ +V+ GK+ L+ +S++
Sbjct: 494 RTTAYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP- 542
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
AL+ +A EL S TYR DL+D+ RQALA + L + AYQ +
Sbjct: 543 -----------ALDELLAVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYQAKN 591
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+L RR++ L++ ++ L+A + LLG W+ESA+ + ++ Q +++A + +T
Sbjct: 592 QAEFARLGRRWIALMDLLEQLVATDENHLLGRWVESARAWGGSAREKSQLQYDALSLLTT 651
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W +A LRDY N+ WSGL+ Y R + Y + +L+ G K +W
Sbjct: 652 WGTRQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELSAALKEG----RKPVAVDWFA 706
Query: 536 LTNDWQNGRNVYPVESNG 553
L + W + G
Sbjct: 707 LEDRWTRNPGALATQPRG 724
>gi|318057780|ref|ZP_07976503.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actG]
Length = 741
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 146/555 (26%), Positives = 253/555 (45%), Gaps = 47/555 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ ++Q+ L +I+ R+ ELGM+PVLP + G VPA + P AK
Sbjct: 215 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 274
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD LF E+ AF E Q + YGR + +Y D E
Sbjct: 275 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 327
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P ++ G + +++ DAVW++ GW + PP K ++ + ++V+
Sbjct: 328 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 377
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENTTMV 238
D ++ P + + G PY + + NF G+ + D + P R + +
Sbjct: 378 DGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALGANTRDWVDLYP-RWRDRSGSRLS 436
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+ + E + NP ++L +E+ + VD+ W +Y+ RYG S + AW++L T
Sbjct: 437 GIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILRTT 496
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y D+ + P++ +V+ GK+ L+ +S++
Sbjct: 497 AYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP---- 542
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
AL+ ++ EL S TYR DL+D+ RQALA + L + AY+ +
Sbjct: 543 --------ALDELLSVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQAE 594
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+L RR++ L++ ++ L+A + LLG W+ESA+ + ++ Q +++A + +T W
Sbjct: 595 FARLGRRWIALIDLLEQLVATDENHLLGRWVESARAWGGSAREKNQLQYDALSLLTTWGT 654
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
+A LRDY N+ WSGL+ Y R + Y + +L+ G K +W L +
Sbjct: 655 RQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELSAALKEGR----KPVAVDWFALED 709
Query: 539 DWQNGRNVYPVESNG 553
W + G
Sbjct: 710 RWTRNPGTLATQPRG 724
>gi|318078904|ref|ZP_07986236.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SA3_actF]
Length = 719
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 146/555 (26%), Positives = 253/555 (45%), Gaps = 47/555 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ ++Q+ L +I+ R+ ELGM+PVLP + G VPA + P AK
Sbjct: 193 LQNLSSFPEPVTARLIEQRAALGARIVGRLRELGMSPVLPGYFGTVPAGFADRNPGAKTV 252
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD LF E+ AF E Q + YGR + +Y D E
Sbjct: 253 PQGKWMGF-ARPDW-----LDPRTDLFAEVAAAFYEIQEELYGRGT-LYKMDLLHEGGSA 305
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P ++ G + +++ DAVW++ GW + PP K ++ + ++V+
Sbjct: 306 GNVPVGDATRG--VQRALRAARPDAVWVILGWQKN------PP--KEVVAAADREAMLVV 355
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENTTMV 238
D ++ P + + G PY + + NF G+ + D + P R + +
Sbjct: 356 DGLSDRFPEVNDRESDWQGTPYAFGSIWNFGGHTALGANTRDWVDLYP-RWRDRSGSRLS 414
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G+ + E + NP ++L +E+ + VD+ W +Y+ RYG S + AW++L T
Sbjct: 415 GIALMPEAADNNPAAFELFAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILRTT 474
Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358
Y D+ + P++ +V+ GK+ L+ +S++
Sbjct: 475 AYGTRR--DDRWSEPADGLFGARPALDAVSAGKW--------SPKALRYPAASFEP---- 520
Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418
AL+ ++ EL S TYR DL+D+ RQALA + L + AY+ +
Sbjct: 521 --------ALDELLSVRAELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQAE 572
Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD 478
+L RR++ L++ ++ L+A + LLG W+ESA+ + ++ Q +++A + +T W
Sbjct: 573 FARLGRRWIALIDLLEQLVATDENHLLGRWVESARAWGGSAREKNQLQYDALSLLTTWGT 632
Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
+A LRDY N+ WSGL+ Y R + Y + +L+ G K +W L +
Sbjct: 633 RQGADAG-LRDYANREWSGLVGGLYRLRWSTYIDELSAALKEG----RKPVAVDWFALED 687
Query: 539 DWQNGRNVYPVESNG 553
W + G
Sbjct: 688 RWTRNPGTLATQPRG 702
>gi|383643231|ref|ZP_09955637.1| N-acetylglucosaminidase [Sphingomonas elodea ATCC 31461]
Length = 778
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 150/594 (25%), Positives = 241/594 (40%), Gaps = 82/594 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PL +W++++ VLQ++IL R+ LGM P+LPAFSG VP A P AKI
Sbjct: 209 MGNIAGYRAPLSANWIEKKRVLQRQILARMRSLGMKPILPAFSGYVPEAFAKAHPEAKIY 268
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+ W TY LD +DPLF + F++ YG Y D F+E PP
Sbjct: 269 QMRQWEGFPG------TYWLDPSDPLFARLAARFLQLYTATYG-PGEYYLADAFNEMVPP 321
Query: 121 VDS-------------------------PEYI-----SSLGAAIYSGMQSGDSDAVWLMQ 150
+ P+ + ++ G +Y + + +A W+MQ
Sbjct: 322 IAEDGSDARAATYGDAIANTAATRAAALPKEVRDARLAAYGERLYRSITAAAPNATWVMQ 381
Query: 151 GWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D FW P + A L+ VP ++++LD+ + P IW+ ++ FYG + + +HN
Sbjct: 382 GWLFGADKAFWTPDAIAAFLSKVPDERMLILDIGNDRYPGIWNATRAFYGKGWAYGYVHN 441
Query: 209 FAGNIEMYGILDSIAFGPVEARTSE-NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKV 267
+ G+ +YG L A + + M G G+ EG+ N + Y ++A+
Sbjct: 442 YGGSNPVYGDLAFYRSDITAALANPGHGRMRGFGLFPEGLHSNGIAYAYAYDLAWGEIDA 501
Query: 268 DVK-----AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDP 322
K AWI Y+ RYG++ PA+ AW+ Y P
Sbjct: 502 TGKARPLDAWIGDYTRARYGKTSPALVAAWDKAIAGAYTTR---------------YWTP 546
Query: 323 SIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASN 382
G Y + P A D+P + + +E +A +
Sbjct: 547 RWWHEQAGGYLFFKFPSLDGA---------DYPAAPGDPAALRAGIEALLAQAPQHGGEP 597
Query: 383 TYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDG 442
Y YD++DL R + ++ + AY+ D + + L +D LA +
Sbjct: 598 LYTYDVVDLVRHYASVQLDDRLKTAVAAYKAGDLAAGDRATAAAERLARHIDA-LAGNQQ 656
Query: 443 FLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDY 502
LG WL A ++ + A+ +T+W L DY ++ W GL Y
Sbjct: 657 ETLGSWLADAAAYGDTPAEKAAFVEQAKAVVTVWGGTGH-----LSDYASRAWQGLYAGY 711
Query: 503 YGPRAAIYFKYMIESLESGDGF-------RLKDWRREWIKLTNDWQNGRNVYPV 549
Y PR + + + F ++ W+ W+K W R P+
Sbjct: 712 YWPRWQRFLAAQRAAAAAHTPFDAKATSDAIRTWQAAWLKDGRMWPRQRPAAPL 765
>gi|456388164|gb|EMF53654.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
25435]
Length = 732
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 147/556 (26%), Positives = 248/556 (44%), Gaps = 47/556 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ NL + P+ + LD + VL ++I R+ ELGM PV P + G VPA P A+
Sbjct: 206 LQNLSAFPSPVSRQLLDARAVLGRRIADRVRELGMTPVFPGYFGTVPAGFAERVPGARTV 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + P W LD F + AF Q + +G +S +Y D E P
Sbjct: 266 PQGEWMGF-ARPDW-----LDPRTDDFARVAAAFYRVQEEMFGPSS-LYKMDLLHEGGDP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P ++ + ++ A W++ GW + PP +A++++V ++V+
Sbjct: 319 GDVP--VADAAKGVERALRRSRPGATWVILGWQHN------PP--RAIVDAVDKQHMLVV 368
Query: 181 DLFAEVKPIWSTSKQFYG-VPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D ++ P + + +G PY + + NF G+ + A + RT + + + G
Sbjct: 369 DGLSDRFPTVTDREADWGDTPYAFGSIWNFGGHTALGANTPDWAALYEKWRTKDGSRLHG 428
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ + E + NP + L SE+A++ ++D+K W +++ RYG P + AW++L T
Sbjct: 429 IALMPEAADNNPAAFALFSELAWREGELDLKTWFAEWAHARYGGRDPHAEAAWDILRRTA 488
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T S +EG +G S+ A+ + L Y
Sbjct: 489 YGTT-------------------RADSWSEGADGLFG---SRPALNAVRAGRWSPKQLRY 526
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
++ AL + EL AS+ YR DL+D+ RQAL+ + + I AY DA +
Sbjct: 527 DAADFEPALGEMLRVRPELRASSAYRRDLLDVARQALSNRSRVMLPQIKAAYDAKDATRL 586
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
SR +L L++ +D L+A LLG W+ A+ + + ++ + +T+W
Sbjct: 587 AAASRDWLSLMDLLDELVATDSRHLLGRWVADARSWGAGAAERTELGYDNLSLLTVW--G 644
Query: 480 TQEEASL-LRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
T+E A LRDY N+ W+GL+ Y R + YF+ + +L +G + D W L +
Sbjct: 645 TREGADAGLRDYANREWAGLVGGLYRLRWSTYFEELRAALAAGRAPKKID----WFALED 700
Query: 539 DWQNGRNVYPVESNGD 554
W E GD
Sbjct: 701 RWARNPGPLATEPTGD 716
>gi|398786493|ref|ZP_10549210.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
gi|396993639|gb|EJJ04702.1| alpha-N-acetylglucosaminidase [Streptomyces auratus AGR0001]
Length = 1048
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 139/565 (24%), Positives = 240/565 (42%), Gaps = 47/565 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
+ N+ G+GGP + ++ L ++I R+ ELGM+PVLP + G VP P A+
Sbjct: 206 LQNMSGYGGPTSSELIAKRAELGQRITGRLRELGMHPVLPGYFGTVPGGFAARNPGARTV 265
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W + + P W LD +F + AF Q G H + D E P
Sbjct: 266 PQGTWSGL-ARPDW-----LDPRTEVFAKTAAAFYRHQEHLLGPADH-FKMDLLHEGGDP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D P + A+ +++ A W++ GW + + LL++V +++++
Sbjct: 319 GDVP--VPDAARAVEKALRTARPGATWVILGWQNN--------PRRDLLDAVDHDRMLIV 368
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
D ++++ + + + GVPY + + NF G + A R + + G
Sbjct: 369 DGLSDLETVTDRERDWGGVPYAFGSIPNFGGRTTIGAKTHVWAERFPAWRDKPGSRLAGT 428
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
E E++P ++L SE+A++ VD AW + Y+ RYG + A+ L + Y
Sbjct: 429 AYMPEAAERDPAAFELFSELAWRERPVDRAAWFDGYADLRYGARDKGARAAFAALGTSAY 488
Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
+ + V A PD+ V + T ++D
Sbjct: 489 EISSKDGRPHDSVFAARPDLA-----------------ARSGTVYATHTPAFD------- 524
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
+ A + L S+ YR DL D RQALA + +L + +AY+ D
Sbjct: 525 PAAFDTAFAALLTVRPALRGSDAYRRDLTDTARQALANRSWQLIGQLQDAYRRKDRATFR 584
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
LS +L L+ + + H FLLGPWL A+ +A E+E + E +AR +T W D
Sbjct: 585 ALSGLWLHLMRLSEDVTGAHRQFLLGPWLTDARAMASGPEEEARLEHSARALLTTWADRP 644
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW-RREWIKLTND 539
+ L +Y N+ W GL+ + + P+ Y + ++L + + DW RE
Sbjct: 645 TADGGSLANYANRDWHGLIGEVHLPQWQAYLGELADALAADRPPKPFDWYARE-----EP 699
Query: 540 WQNGRNVYPVESNGDALITSQWLYN 564
W + R P+ +A T++ +++
Sbjct: 700 WTHERTTPPLHPTTEAYRTARRVHD 724
>gi|154321596|ref|XP_001560113.1| hypothetical protein BC1G_00945 [Botryotinia fuckeliana B05.10]
Length = 701
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 104/257 (40%), Positives = 153/257 (59%), Gaps = 7/257 (2%)
Query: 1 MSNLHG-WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKI 59
N+ G WGG +P +W++ Q +LQKKI+ R+ ELG+ PVLPAF+G VP L+ V P+A I
Sbjct: 185 FGNIQGSWGGTIPLAWIEDQHLLQKKIVQRMVELGITPVLPAFTGFVPRDLRRVAPNANI 244
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
+W ++ T+L DPLF + F+ Q + YG SHIY D F+EN P
Sbjct: 245 INGSDWGNLFPFEYSNDTFLY-PIDPLFKTLQHTFLSLQSEYYGNVSHIYTLDQFNENLP 303
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLF-SYDPFWRPPQMKALLNSVPLGK-L 177
P Y+ ++ Y +QS DS+A W++QGWLF + FW +++A L VP + +
Sbjct: 304 ASGDPLYLGNISRGTYDSLQSFDSNATWMLQGWLFYAASSFWTQDRVEAYLGGVPKNESM 363
Query: 178 VVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA-RTSENTT 236
++LDLF+E P W + Q+YG P+IWC LH + G +YG + +I +EA R SE
Sbjct: 364 LILDLFSESFPEWENTHQYYGKPWIWCQLHGYGGTPGIYGQIYNITNSSIEAFRNSEK-- 421
Query: 237 MVGVGMSMEGIEQNPVV 253
MVG+G +MEG + N ++
Sbjct: 422 MVGMGNTMEGQDGNGLI 438
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 111/250 (44%), Gaps = 46/250 (18%)
Query: 295 LYHTVYNCTDGATD--KNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSY 352
+Y +YN T+ + + +N + +V + EG+ N G + K + +
Sbjct: 401 IYGQIYNITNSSIEAFRNSEKMVGMGNT-------MEGQDGN-GLILPKSLLEMQPNITE 452
Query: 353 DHPHLWYS-TSEVIRALELFIASGNELSAS---------NTYRYDLIDLTRQALAKYANE 402
+H L S T ++ +LF A G +AS N +++D++D+TRQ L++
Sbjct: 453 NHGRLGQSLTIDLFNPADLFRAWGLLYNASISVPELWYDNGWKFDMVDVTRQVLSERFKL 512
Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELV-EDMDGLLACHDGFLLGPWLESAKQLAQNE-- 459
++++IE Y A F+ + L ++ ++D +L+ F L W+ +A + N
Sbjct: 513 EYVDLIEKYT---AEIDFEATSENLSMILRELDDILSTSPHFRLDTWINAAIASSPNSST 569
Query: 460 ---------------EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 504
+ + + +NA QIT+W Q + DY +K W GL+R YY
Sbjct: 570 YPIPSSDGSSELNITQTQHLFAYNAINQITIWGPTGQ-----INDYASKSWGGLVRGYYL 624
Query: 505 PRAAIYFKYM 514
R I+ Y+
Sbjct: 625 KRWEIFLDYI 634
>gi|409097333|ref|ZP_11217357.1| Alpha-N-acetylglucosaminidase, Alpha-L-fucosidase [Pedobacter agri
PB92]
Length = 724
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 150/557 (26%), Positives = 239/557 (42%), Gaps = 50/557 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQN---VFPSA 57
M NL GWGG + Q +QKK+L R+ EL ++P+L F G VP L A
Sbjct: 195 MGNLEGWGGTNSLQLMQLQSNIQKKVLSRMKELEIDPILQGFYGMVPHDLNKKVAALKDA 254
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
+I GNW + + +L T+ F + + + K YG + + F E
Sbjct: 255 QIIDQGNWVFTE----FIRPAILAPTNDKFNTVADVYYSELKKLYGSDIKFFGGEPFHEG 310
Query: 118 TPP--VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG 175
VD I+++ ++ MQ ++ W++QGW + ALL +
Sbjct: 311 GKKGGVD----ITAVAKSVQDVMQKNFPNSTWVLQGW--------QNNPADALLAGLKKE 358
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE-N 234
++++LF E W K + G +IW + NF +YG L + S
Sbjct: 359 NTLIIELFGENTSNWEQRKGYGGTNFIWSNVSNFGEKNGLYGRLQRFLDEVYRIKQSPYK 418
Query: 235 TTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNV 294
+ GVG+ EGI NPV YDLM ++A+++EK + WI Y+ RYG + DAW V
Sbjct: 419 DYLKGVGIIPEGINNNPVAYDLMLDIAWRNEKPPLDKWITDYTTYRYGSYNKDVADAWKV 478
Query: 295 LYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDH 354
TVY+ +K + V P +E Y ++ ++ + SS+
Sbjct: 479 FTETVYSSP--VNEKGKIVYQEGP---------SESIY------CARPSLKVNPVSSWGT 521
Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
Y T +A+ LFI + + S TY+ D D RQ +A ++ + +I A Q
Sbjct: 522 RKRNYDTKLFKQAVALFIKAETQFKNSETYQTDKTDFLRQVMADKGDQAYDELINAIQAK 581
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 474
D + + + FL ++ D LL + F L WL A L + K +NA+ QIT
Sbjct: 582 DKNAIKEKGNHFLTMILQQDSLLNNNHFFTLNRWLNQAVALGKGLPDAKNILFNAKAQIT 641
Query: 475 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMI-ESLESGDGFRLKDWRREW 533
W + + + LRDY +K W GLL Y R ++ + + + S F
Sbjct: 642 FWGPDNNPKTT-LRDYAHKEWGGLLSSLYYNRWKLFIDDALNDKITSASTF--------- 691
Query: 534 IKLTNDWQNGRNVYPVE 550
+ W N+YP++
Sbjct: 692 YDMEVKWSKDSNLYPIK 708
>gi|257067709|ref|YP_003153964.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
4810]
gi|256558527|gb|ACU84374.1| Alpha-N-acetylglucosaminidase (NAGLU) [Brachybacterium faecium DSM
4810]
Length = 768
Score = 191 bits (486), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 153/584 (26%), Positives = 248/584 (42%), Gaps = 60/584 (10%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M H G L L+ + L ++I R ELGM VLP F G +PA L + ++
Sbjct: 201 MGITHDLGAALTDEALEARAELGRRIAERERELGMTVVLPGFGGQLPAELVG---TERMI 257
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
W + + P DPLF E + Q + G T H Y D + E+ PP
Sbjct: 258 DWQGWHNALAAP----------GDPLFAEAAASLHRHQRQLLG-TDHHYAVDPYIESLPP 306
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
SP+ ++ AI++ M+ D AVW++QGW F Y +W ++ +LL+ VP +L++
Sbjct: 307 TTSPQQLAEHAEAIFTAMRDADPQAVWILQGWPFHYRAAYWTEERVHSLLSRVPEDRLIL 366
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGIL----DSIAFGPVEARTSENT 235
LDL+ E P+W + YG ++WC+ H F G ++G L D + A
Sbjct: 367 LDLWGEHAPMWHRTAAMYGRRWLWCLAHTFGGRFGLFGDLAALDDDLRGLRTAAEAGTRG 426
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVL 295
+ G G++ E ++ N VVY+L + A + W+ ++ +RRYG + P +Q AW V+
Sbjct: 427 RLEGFGITSEALDDNAVVYELATR-ALWSPMPPRERWLEEHIIRRYGTAAPEVQQAWQVI 485
Query: 296 YHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
HT+Y G T ++A P G P + + + D P
Sbjct: 486 AHTLYGP--GRTRSTPSPLIARP--------------WTRGLPFASQRLAGEALPDADGP 529
Query: 356 HLWYSTSE--------------VIRALELFIASGNELSASNTYRYDLIDLTRQALAKYAN 401
+E +R+L + SG + DL L A+ A
Sbjct: 530 PSANIDAENDAEMLGALAPLAHAVRSLLPVLRSGEH---RDALARDLAQLAIHVGAQSAR 586
Query: 402 ELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
I+ A D + + L+ +D + A L+G W+ A+ A +E+
Sbjct: 587 APLRAIVAAAAEADGERLRAEASTLEALLRAVDAVAATRPDMLVGRWIADARAGAGTDER 646
Query: 462 -EKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLES 520
E +AR+ I++W TQ+ S L DY ++WSG L D + R + ++ + E
Sbjct: 647 LADALERDARSLISVW--GTQD--SGLHDYSARHWSGSLTDLHLARWRAWTDWLARTAEE 702
Query: 521 -GDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGD-ALITSQWL 562
L+ + + DW++ YP G+ A SQ L
Sbjct: 703 PSTPPDLEQLHAQIRGIEEDWRDSTAPYPTTPRGEPAAAISQLL 746
>gi|147860882|emb|CAN83148.1| hypothetical protein VITISV_031934 [Vitis vinifera]
Length = 562
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 86/100 (86%), Positives = 94/100 (94%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLK 100
+LGNWF+V +PRWCCTYLLDATDPLFIEIGRAFI+QQLK
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGRAFIQQQLK 334
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 82/92 (89%), Positives = 89/92 (96%)
Query: 112 DTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNS 171
DTFDENTPPVD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+S
Sbjct: 429 DTFDENTPPVDDPEYISSLGAAIFKGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHS 488
Query: 172 VPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIW 203
VP+G+LVVLDLFAEVKPIW TS+QFYGVPYIW
Sbjct: 489 VPMGRLVVLDLFAEVKPIWITSEQFYGVPYIW 520
>gi|329940646|ref|ZP_08289927.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
gi|329300707|gb|EGG44604.1| alpha-N-acetylglucosaminidase [Streptomyces griseoaurantiacus M045]
Length = 798
Score = 186 bits (471), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 140/530 (26%), Positives = 228/530 (43%), Gaps = 50/530 (9%)
Query: 10 PLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVK 69
P+ LD + VL +++ R+ ELGM PVLP + G VP A+ G W
Sbjct: 227 PVSTQLLDARAVLGRRLADRLRELGMVPVLPGYFGTVPPGFAARNRGARTVPQGTWMGFD 286
Query: 70 SDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISS 129
P W LD LF + AF Q + +G ++H Y D E D P +
Sbjct: 287 R-PDW-----LDPRTDLFARVAAAFYRVQGELFGASTH-YKMDLLHEGGTAGDVP--VGE 337
Query: 130 LGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLG---------KLVVL 180
+ ++ DAVW++ GW + PP +A+L++V G +L+V+
Sbjct: 338 AAKGVERALRRARPDAVWVLLGWRHN------PP--RAILDAVASGGPDGAAGRERLLVV 389
Query: 181 DLFAEVKP-IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
D ++ P + + GVPY + + NF G+ + A RT E + + G
Sbjct: 390 DGLSDRFPTVTDREADWGGVPYAFGSIWNFGGHTTLGANTPDWARLYEAWRTKEGSALRG 449
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+ + E + NP + L SE+ + ++D+KAW +++ RYG + AW+VL T
Sbjct: 450 IALLPEAADNNPAAFALFSELPWHEGELDLKAWFARWARSRYGAYDAHAEAAWDVLRRTA 509
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 359
Y T S +EG +G ++ ++ +S+ L Y
Sbjct: 510 YGTT-------------------RADSWSEGADGLFG---ARPSLTARRAASWSPKELRY 547
Query: 360 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 419
E RAL+ + L S+ YR DL+D+ RQ L+ + L I A D
Sbjct: 548 DAHEFERALDELLKVRPGLRESSAYRRDLLDVARQCLSNRSRALLPRIARACAARDVKAF 607
Query: 420 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDN 479
S +L L++ ++ L+ LLG W A+ +E + + +++A + +T+W
Sbjct: 608 DAASGDWLSLMDLLERLVGTDARHLLGRWTAQARAWGADEAERDRLQYDALSLLTVWGTR 667
Query: 480 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
EA LRDY N+ W+GL+ Y R + YF + +L G DW
Sbjct: 668 QGAEAG-LRDYANREWAGLVGGLYRLRWSTYFTELRAALTEGRAPAAVDW 716
>gi|29832531|ref|NP_827165.1| alpha-N-acetylglucosaminidase [Streptomyces avermitilis MA-4680]
gi|29609651|dbj|BAC73700.1| putative alpha-N-acetylglucosaminidase, secreted [Streptomyces
avermitilis MA-4680]
Length = 1038
Score = 186 bits (471), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 139/563 (24%), Positives = 237/563 (42%), Gaps = 50/563 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
+ NL G+GGPL + ++ L ++I R+ LGM PVLP + G+VP ++ A +
Sbjct: 212 LQNLSGYGGPLSPELIAERAGLGRRICDRLRALGMAPVLPGYYGHVPKGFVERNGGDAHV 271
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W + P W LD F + ++F Q +G+ +H + D E
Sbjct: 272 VPQGIWHGFER-PDW-----LDPRTASFAAVAKSFYRHQKDVFGKAAH-FKMDLLHEGGT 324
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P + + +Q+ A W++ GW +P + ALL+++ K+++
Sbjct: 325 AGDVP--VPGAARGVEKALQAAHPGATWVILGW--EANP------LPALLDAIDKKKMLI 374
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
+D ++ + K + G PY + + NF G + R + +
Sbjct: 375 VDGVSDRYTSVTDREKDWGGTPYAFGTIPNFGGRTTIGARAHLWNEKFFAWRDKAGSALA 434
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G E +++P ++L SE+A+ K+D AW + Y+ RYG + Q AW L+ T
Sbjct: 435 GTAYLPEAADRDPAAFELFSELAWSAGKIDRAAWFSSYADFRYGGRDASAQKAWRALHDT 494
Query: 299 VYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y +D + + A PD+ + + Y L
Sbjct: 495 AYQQHAVERSDAHDSLFCARPDL------------------------AANRAAEYAPRAL 530
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
Y AL + L S Y YDL+D+ RQALA + + +L ++ A
Sbjct: 531 TYDPGRFDAALSGLLGVAGGLRGSAAYTYDLVDVARQALAHRSRQ-YLPLLRAAYARKDA 589
Query: 418 GVF-QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
F L+ +L L+ D + H FLLGPW+ A+ LA + + ++E A+ +T+W
Sbjct: 590 AAFTSLATLWLRLMGLSDEVTGTHPAFLLGPWINDARLLATDAGERAEFERTAKVLLTVW 649
Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
+A L +Y + W+GL+ D+Y PR + + ++L +G D W +
Sbjct: 650 GGRATSDAGDLHEYAGREWNGLMADFYLPRWKKWLDALADALATGTPPAAVD----WFAV 705
Query: 537 TNDWQNGRNVYPVESNGDALITS 559
W R YP+ GD T+
Sbjct: 706 EEPWTRERKDYPLRPVGDPYRTA 728
>gi|440695019|ref|ZP_20877582.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
gi|440282912|gb|ELP70302.1| Tat pathway signal sequence domain protein [Streptomyces
turgidiscabies Car8]
Length = 1050
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 149/569 (26%), Positives = 250/569 (43%), Gaps = 52/569 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL--QNVFPSAK 58
+ N+ G+GGP+ + ++++ L KI R+ ELGM PVLP + G VP +N +A
Sbjct: 216 LQNMSGFGGPVSRRLIEKRADLAAKITERVRELGMTPVLPGYFGTVPDEFVARNGGDAAV 275
Query: 59 ITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT 118
+ Q G+W + K P W LD F E+ AF + Q + +G S +Y D E
Sbjct: 276 VPQ-GDWGAFKR-PDW-----LDPRTTAFGEVAAAFYQAQSERFG-DSTMYKMDLLHEGG 327
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
P D P + A+ + ++ AVW + GW + P + +L++V ++
Sbjct: 328 NPGDVP--VGRAAQAVEAALRKAHPGAVWAILGWQNN-------PSGE-ILDAVDKSRMF 377
Query: 179 VLDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
V+D ++ + + G PY + + NF G+ M + R E++ +
Sbjct: 378 VVDGLSDRYTTVTDRESDWGGTPYAFGSIWNFGGHTPMGANAPDWVEQYPKWRDKEDSAL 437
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYH 297
G+ E + N L++++A+ +D+ W Y+V RYG P AW ++
Sbjct: 438 AGIAAMPEAADNNHAALALLTDLAWTPGTIDLDDWFASYAVSRYGAEDPHALAAWKIIGD 497
Query: 298 TVYNCT--DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHP 355
T Y + DG ++ + A P + +K A E YD
Sbjct: 498 TAYGMSRADGWSEAPDGLFGARPSLG-----------------ANKAAAWGPEADRYD-- 538
Query: 356 HLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
T+ AL + L ++ YRYDL D+ RQ L+ + L I AY D
Sbjct: 539 -----TTAFDLALTELLQVAPALRGNSAYRYDLADVARQVLSNRSRMLLPQIRAAYDTAD 593
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 475
+L+ +L+ + MD +LA LLG WL A+ ++ Q E++AR+ IT
Sbjct: 594 RVRFDELTGVWLDWMRLMDKVLATSGQHLLGRWLADARSWGATRGEKDQLEYDARSIITT 653
Query: 476 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 535
W E L DY N+ WSGL+ Y R +YF+ + +L + D W
Sbjct: 654 WGGRASSEEG-LHDYANREWSGLVGGLYLTRWTLYFRELSRALRQNRPPKTVD----WFT 708
Query: 536 LTNDWQNGRNVYPVESNGDALITSQWLYN 564
L +DW + + +P +++GD ++ ++N
Sbjct: 709 LEDDWAHRHDSHPTKTSGDVHKLARRVHN 737
>gi|456390168|gb|EMF55563.1| alpha-N-acetylglucosaminidase [Streptomyces bottropensis ATCC
25435]
Length = 1042
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 141/562 (25%), Positives = 242/562 (43%), Gaps = 48/562 (8%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA-LQNVFPSAKI 59
+ NL G+GGPL + ++ L ++I R+ ELGM+PVLP + G+VP ++ A +
Sbjct: 212 LQNLSGYGGPLSPQLIARRAGLGRRITDRLRELGMSPVLPGYYGHVPKQFVERNGGDAHV 271
Query: 60 TQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP 119
G W + P W LD F + +F +G +H + D E
Sbjct: 272 VPQGLWHGFER-PDW-----LDPRTDSFARVAASFYGHVRDVFGAAAH-FKMDLLHEGGT 324
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVV 179
D P + + + DA+W++ GW +P + LL+++ ++++
Sbjct: 325 AGDVP--VPDAARGVERALHKAHPDAIWVILGW--QENP------LPELLDAIDRSRMLI 374
Query: 180 LDLFAE-VKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238
+D ++ + + + G PY + + NF G + R ++ +V
Sbjct: 375 VDGVSDRYASVTDRERDWGGTPYCFGTIPNFGGRTTIGARAHLWTDKFFAWRDKPDSALV 434
Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G E +++P ++L SE+A+ K+D AW + Y+ RYG A + AW L+ T
Sbjct: 435 GTAYMPEATDRDPAAFELFSELAWTPGKIDRAAWFSAYADFRYGGRDDAARAAWRALHET 494
Query: 299 VYNCTD-GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
Y +D + + A PD+ + A T +YD
Sbjct: 495 AYQQRAVERSDPHDSLFCARPDL-----------------AADRAAEYAPRTLTYDPGRF 537
Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
+ A L +A G + + YRYD++DL RQALA + + + A++ D
Sbjct: 538 -----DAAFAGLLDVAGGRRRNPA--YRYDVVDLARQALAHRSRQYLPQLRAAHRRKDLT 590
Query: 418 GVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWF 477
LS +L L+ D + FLLGPW+ A+ LA ++ + ++E A+ IT+W
Sbjct: 591 TFRALSTLWLRLMRLSDEVTGTDGAFLLGPWVNDARLLATDDAERAEFERTAKVLITVWG 650
Query: 478 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 537
+ L +YGN+ W+GL+ D+Y PR + + ++L +G D W
Sbjct: 651 GRATSDTGDLHEYGNREWNGLMADFYVPRWQKWLDALEDALATGTAPAAVD----WFAFE 706
Query: 538 NDWQNGRNVYPVESNGDALITS 559
W R YP+ GDA T+
Sbjct: 707 EPWTRERKDYPLRPVGDAYRTA 728
>gi|255079272|ref|XP_002503216.1| GH family 89 protein [Micromonas sp. RCC299]
gi|226518482|gb|ACO64474.1| GH family 89 protein [Micromonas sp. RCC299]
Length = 1260
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 157/554 (28%), Positives = 231/554 (41%), Gaps = 140/554 (25%)
Query: 58 KITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDEN 117
K +LG + K D + LD +D LF +G AF +Q ++++G T H+Y DTF E
Sbjct: 396 KDAELGKY--AKKDDSVRSVHFLDPSDALFQSLGAAFTKQLVEDFG-TDHLYLADTFREI 452
Query: 118 TPPVD--SPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL 174
P D S ++ +GAA + M+S D A W+ Q F +P FW + ALL SV +
Sbjct: 453 RDPNDDFSETHVVRVGAATLAAMRSADPRATWVFQSDAFRRNPRFWNEGRRGALLRSVDI 512
Query: 175 GKLVVLDLFAEVKPIW-STSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEA---- 229
G ++VLD AE P + F G P++WC+ HN GN+ M G L +IA GP A
Sbjct: 513 GDMLVLDSAAETDPYYLREPVHFAGQPFVWCVKHNHGGNLGMRGRLSAIATGPAAAMDSL 572
Query: 230 -----------------------------RTSENTT----------MVGVGMSMEGIEQN 250
R S T +VG G++ EG+EQN
Sbjct: 573 ASRRDGERGTTHGRGTRVGSSRRMLADNKRVSREATHGSRKVGKSQLVGFGITAEGVEQN 632
Query: 251 PVVYDLMSEMAFQHEKVDVKAWINQ--------YSVRRYGRSV----------------- 285
PVVY+L + + + VDV +++ YSVR+ +
Sbjct: 633 PVVYELAALTSQSEKGVDVDWFLSDYSRRRYGGYSVRQPAPTTLPVGTGQGAFLAGFIVG 692
Query: 286 ------------------PA-------------IQDAWNVLYHTVYNCTDGATDKN--RD 312
PA ++AW +L TVY D++ RD
Sbjct: 693 NNPIAGSPGYLGPGEWYDPAKHGEMGKEEAYDRAREAWEILGKTVYGARAKGEDEDHVRD 752
Query: 313 VIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFI 372
P + +S Y + K V Y+ +I A
Sbjct: 753 ACSWQPSLRADELSP---DYFDAAKVVD------------------YAFKPLIDAAPTLR 791
Query: 373 ASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVED 432
A+G A YD++D+ RQ LA+ +N L I ++ N+A + LEL++D
Sbjct: 792 ANG----AGTRVDYDIVDVGRQLLARQSNVLATQIRDSLNSNNASEAKMYGTQMLELLDD 847
Query: 433 MDGLLACHDGFLLGPWLESAKQLA---QNEEQEKQYEWNARTQITMWFDNTQEEASLL-- 487
MD LL H GFLLG ++ESAK A E E E +AR+ I+ + + + + L
Sbjct: 848 MDALLRSHKGFLLGNYIESAKSWAGKRNKESDEANLERSARSLISGFGPSGSKLGAPLGH 907
Query: 488 --RDYGNKYWSGLL 499
DY N+ WSG+L
Sbjct: 908 PMHDYSNRQWSGML 921
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 38/61 (62%)
Query: 7 WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWF 66
W G P+ WL +Q LQ+ + + + GM PVLP F+G+VP A+ FP AK+ ++ NW
Sbjct: 217 WTGGRPKKWLKRQWDLQRDAVKLMRDFGMTPVLPGFNGHVPPAIARRFPEAKLRRVENWL 276
Query: 67 S 67
+
Sbjct: 277 T 277
>gi|169351438|ref|ZP_02868376.1| hypothetical protein CLOSPI_02218 [Clostridium spiroforme DSM 1552]
gi|169291660|gb|EDS73793.1| F5/8 type C domain protein [Clostridium spiroforme DSM 1552]
Length = 1762
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 146/565 (25%), Positives = 243/565 (43%), Gaps = 80/565 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GGP+ W+ +L + ++ LGM VL ++G VP + + +I
Sbjct: 816 MDNMEVIGGPVSDEWVKGRLEMARENQRWKNSLGMQTVLQGYAGMVPNNFTD-YQDVEIL 874
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTP- 119
+ GNW V PR ++ L+ + + F E Q +G+TS+ Y D F E
Sbjct: 875 EQGNWCGV---PR---PDMIRTDGELYDQYAKLFYEAQEWAFGKTSNYYAVDPFHEGGKR 928
Query: 120 PVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKAL--LNSVPLGKL 177
P D + + + + + + D +AVW++Q W W P L + +
Sbjct: 929 PSDLTDDV--ISREVLNSLLEYDQEAVWMVQAW-------WSNPTNDLLKGMGDDREDHV 979
Query: 178 VVLDLFA------------EVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG 225
++LDL E S +F ++WCML N+ GN M G I
Sbjct: 980 IILDLNGLNDAYDSYWDKTEYNGTVLESDEFNSTSWVWCMLENYGGNPSMDGRPKEI-IN 1038
Query: 226 PVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSV 285
+ +++ M G+G E NP++Y+L+ +MA+Q + +D+ W+++Y +RRYG
Sbjct: 1039 RINKASTQAEHMKGIGFISEATYDNPMIYELLLDMAWQQDTIDLDDWLDEYVLRRYGDYS 1098
Query: 286 PAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVL 345
+ +AW++L TVY+ + TD ++A DPS++ YG P
Sbjct: 1099 ESAGEAWDILLKTVYSRSGKTTD-----VIARS--DPSLVQ--------YGLP------- 1136
Query: 346 KSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFL 405
Y+ SE+ ALEL ++LSAS YRYDL ++ RQ + YA
Sbjct: 1137 -------------YTASELEEALELLYKDYDKLSASEAYRYDLTEIMRQVVNNYAVVRLG 1183
Query: 406 NIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEK-Q 464
++ AY + L ++L ++ ++ + L+G W+ A A++ +
Sbjct: 1184 DLKTAYDAKEIDNFKSLKEQYLNAIDLLNEVCGTQQDLLIGEWVGRAVDWAKDTNSDDFA 1243
Query: 465 YE---WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 521
Y+ NA+T IT+W +T L Y + + G++ D Y Y E LE G
Sbjct: 1244 YDSMIINAKTLITVWAPSTT-----LGTYAYRNYEGMINDIYKVIWQAYLDQSEEILEFG 1298
Query: 522 DG----FRLKDWRREWIKLTNDWQN 542
D +WI D QN
Sbjct: 1299 SAKTNLVNYHDLCMDWIYADWDLQN 1323
>gi|328867426|gb|EGG15808.1| alpha-N-acetylglucosaminidase [Dictyostelium fasciculatum]
Length = 992
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 104/329 (31%), Positives = 162/329 (49%), Gaps = 38/329 (11%)
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLY 296
M G G++ E IEQN ++YDLM+EMA++ ++ WINQY+ RRYG VP + AWN+L
Sbjct: 219 MKGTGLTPEAIEQNYMMYDLMNEMAWRTTAPNMTEWINQYTQRRYGVFVPELAQAWNLLI 278
Query: 297 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPH 356
TV+N T G YG P S + + D
Sbjct: 279 PTVFNATLGY----------------------------YGPPSSFVGMRPQLNMTND--- 307
Query: 357 LWYSTSEVIRALELFIASGNE-LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND 415
L+Y S V +A +L++ +E + ++ T+ +D+ ++T QAL+ + + + +AY N
Sbjct: 308 LYYDPSVVQQAWQLYLGVTDEYVLSTATFSFDVSEITLQALSNLFMDTQMAMYDAYLTNQ 367
Query: 416 AHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE--QEKQYEWNARTQI 473
+ + + L ++ DMD + A L+G W +A+Q A N + +E+NAR QI
Sbjct: 368 STVFEERATSCLNIITDMDTIAATQQMLLVGTWTANARQWALNTSSGETAPFEFNARNQI 427
Query: 474 TMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW 533
T+W S L DY WSGLL D+Y R A++ KYM SL + F D+ +
Sbjct: 428 TLW----GPPNSSLHDYAYHLWSGLLNDFYFARWALFIKYMDTSLSTNTTFNNTDYTNDI 483
Query: 534 IKLTNDWQNGRNVYPVESNGDALITSQWL 562
L W N YP G+A + S+++
Sbjct: 484 ESLEESWNNQNYQYPTLPTGNAYLLSKFI 512
>gi|84625358|ref|YP_452730.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|84369298|dbj|BAE70456.1| putative N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
MAFF 311018]
Length = 590
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 133/524 (25%), Positives = 221/524 (42%), Gaps = 84/524 (16%)
Query: 54 FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 113
P A+I ++ W TY LD DPLF ++ R F+E + YG Y D
Sbjct: 46 LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 98
Query: 114 FDENTPPVD-------SPEY-----------------------ISSLGAAIYSGMQSGDS 143
F+E PPV + +Y +++ G A+Y + +
Sbjct: 99 FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 158
Query: 144 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 201
A W+MQGWLF D FW+P + A L VP +L+VLD+ + P W S+ F +
Sbjct: 159 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 218
Query: 202 IWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLM 257
I+ +HN+ + +YG +AF ++A ++ + G G+ EG+ N VVY+ +
Sbjct: 219 IYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYL 275
Query: 258 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 317
+A++ + W+ QY RYGRS A+ AW L +Y +
Sbjct: 276 YALAWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTRYWS----------- 324
Query: 318 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 377
P + G Y + +P + D P + A++ + +
Sbjct: 325 ----PRWWNTHAGAYLLFKRPTADIVNFD------DRPG---DPQRLRSAIDALLQQADR 371
Query: 378 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGL 436
+ + YRYDLI+ R L+ A+ +++AY D A G QL+R +LV+ +D L
Sbjct: 372 YADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLART-TQLVQGLDAL 430
Query: 437 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 496
+ L ++A + + + Y NAR Q+++W + L DY +K W
Sbjct: 431 VGGQHETLAAWTGQAAAAVGNDARLLRAYVGNARAQVSVWGGDGN-----LADYASKAWQ 485
Query: 497 GLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
G+ D+Y R + + ++G F +L W R+W
Sbjct: 486 GMYADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 529
>gi|293371910|ref|ZP_06618314.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292633156|gb|EFF51733.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 411
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 119/449 (26%), Positives = 202/449 (44%), Gaps = 56/449 (12%)
Query: 129 SLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVVLDLFAEVK 187
+ + +Y+ + + D A W+ W+F +D W +MKALL VP K+++LD E
Sbjct: 3 KIASDMYATLTAADPKAQWMQMTWMFYFDKDKWTSERMKALLTGVPQNKMILLDYHCENV 62
Query: 188 PIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGI 247
+W ++ F+ PYIWC L NF GN + G + A + + G+G ++EG+
Sbjct: 63 ELWKRTEHFHDQPYIWCYLGNFGGNTTLTGNVKESGARLENALINGGGNLKGIGSTLEGL 122
Query: 248 EQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGAT 307
+ Y+ + E A+ + VD WI + R G +++DAW L++ +Y
Sbjct: 123 DVMQFPYEYILEKAW-NLNVDDNKWIECLADRHVGCVSQSVRDAWKRLFNDIY------- 174
Query: 308 DKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRA 367
V P T G Y +P + K ++ Y + L EV R
Sbjct: 175 -------VQVPR--------TLGTLPGY-RPALNKNSEKRTSNVYSNVEL----LEVWRK 214
Query: 368 LELFIASGNELSAS--NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRR 425
L NE + + +R DLI + RQ L Y ++ + + D + +
Sbjct: 215 L-------NEAPSDRRDAFRLDLITVGRQVLGNYFLDVKMEFDRMVEAKDYQALKACGEK 267
Query: 426 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 485
E++ D+D L A H L W++ A+++ + + + YE NAR IT W
Sbjct: 268 MKEILNDLDKLNAFHPYCSLDKWIDDARKMGDSPQLKDYYEKNARNLITTW-------GG 320
Query: 486 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG---DGFRLKDWRRE----WIKLTN 538
L DY ++ W+GL+ DYY R +Y I+ + G D +L+D +E W+ T+
Sbjct: 321 SLNDYASRSWAGLISDYYAKRWEVYINTFIKVVGEGVEVDQKQLEDELKEIEEGWVNATD 380
Query: 539 DWQNGRNVYPVESNGDALIT-SQWLYNKY 566
++V+ S D L++ S +L++KY
Sbjct: 381 RKDTRKDVH---STTDGLLSFSTFLFSKY 406
>gi|58583545|ref|YP_202561.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
gi|58428139|gb|AAW77176.1| N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae KACC 10331]
Length = 753
Score = 151 bits (381), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 132/524 (25%), Positives = 222/524 (42%), Gaps = 84/524 (16%)
Query: 54 FPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT 113
P A+I ++ W TY LD DPLF ++ R F+E + YG Y D
Sbjct: 209 LPHARIYRMRAWEGFHE------TYWLDPRDPLFAKVARRFLELYTQAYG-AGEFYLADA 261
Query: 114 FDENTPPVD-------SPEY-----------------------ISSLGAAIYSGMQSGDS 143
F+E PPV + +Y +++ G A+Y + +
Sbjct: 262 FNEMLPPVADDGSDVAAAKYGDSIANFDAARAKAVPPAQRDARLAAYGQALYRSIAQVNP 321
Query: 144 DAVWLMQGWLFSYD-PFWRPPQMKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPY 201
A W+MQGWLF D FW+P + A L VP +L+VLD+ + P W S+ F +
Sbjct: 322 KATWVMQGWLFGADCAFWQPQAIAAFLGKVPDARLMVLDIGNDRYPGTWKASQAFDNKQW 381
Query: 202 IWCMLHNFAGNIEMYGILDSIAF--GPVEARTSE--NTTMVGVGMSMEGIEQNPVVYDLM 257
I+ +HN+ + +YG +AF ++A ++ + G G+ EG+ N VVY+ +
Sbjct: 382 IYGYVHNYGASNPLYG---DVAFYRQDLQALLADPGKRNLRGFGVFPEGLHSNSVVYEYL 438
Query: 258 SEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAF 317
+A++ + W+ QY RYGRS A+ AW L +Y
Sbjct: 439 YALAWEGPQHPWSQWLAQYLRARYGRSDAALLSAWTDLGAGIYQTR-------------- 484
Query: 318 PDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 377
P + G Y + +P + ++ + D + A++ + +
Sbjct: 485 -YWSPRWWNTHAGAYLLFKRPTAD--IVNFDDRPGD-------PQRLRSAIDALLQQADR 534
Query: 378 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLND-AHGVFQLSRRFLELVEDMDGL 436
+ + YRYDLI+ R L+ A+ +++AY D A G QL+ R +LV+ +D L
Sbjct: 535 YADAPLYRYDLIEDARHYLSLQADRQLQTVVQAYNAGDFARGDAQLA-RTTQLVQGLDAL 593
Query: 437 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 496
+ L ++A + + + Y NAR Q+++W + L DY +K W
Sbjct: 594 VGGQHETLAAWTGQAAAAVGNDARLLRAYVGNARAQVSVWGGDGN-----LADYASKAWQ 648
Query: 497 GLLRDYYGPRAAIYFKYMIESLESGDGF-------RLKDWRREW 533
G+ D+Y R + + ++G F +L W R+W
Sbjct: 649 GMYADFYLQRWTRFLSAYRAARKAGTPFDAQTVDQQLATWERQW 692
>gi|293402299|ref|ZP_06646437.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
5_2_54FAA]
gi|291304406|gb|EFE45657.1| putative alpha-N-acetylglucosaminidase [Erysipelotrichaceae bacterium
5_2_54FAA]
Length = 2330
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 128/530 (24%), Positives = 228/530 (43%), Gaps = 60/530 (11%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ +GGP+P ++ ++ L + LGMN VL ++G VP P+ +T
Sbjct: 670 MQNIETFGGPIPDQYVVDRVELARTTQRWKNSLGMNTVLQGYAGMVPTNFNEFQPNVPLT 729
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+W + + P T P + E + F E Q YG TS Y D + E
Sbjct: 730 AQKSWGGL-ARPSMIPT-----DSPYYDEYAKLFYEAQEYIYGATSDYYAVDPYHEGGT- 782
Query: 121 VDSPEYIS--SLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSV---PLG 175
PE +S ++ + + + D DAVW++Q W + LLN +
Sbjct: 783 --RPEGLSDETVAREVLNSLLDYDKDAVWVVQAW--------QSNPTDGLLNGMGEYREN 832
Query: 176 KLVVLDLFAEVKPIWS--TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSE 233
++++DL W+ +F G + W +L F GN M G + ++ ++ E
Sbjct: 833 HVLIVDLIKYPIKSWTKYNKSEFKGTSWAWGLLGGFGGNPTMNGEMQTM-VNDIQTAKKE 891
Query: 234 NTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWN 293
T M G+G+ E NPV+YDL+ ++A+ + + W+N+Y RRYG + ++AW
Sbjct: 892 RTHMAGLGIISEAQYDNPVLYDLIFDLAWVDDDFSLDQWLNKYIERRYGGTSDNAKEAWK 951
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYD 353
++ + YN V F + + Q+YGK
Sbjct: 952 IMKNANYNHG-----------VRFT---AQVYGMKGKSPQDYGK---------------- 981
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQL 413
++ Y ++ A L I ++ S YRYDL ++ RQ ++ Y+ + N+I+A +
Sbjct: 982 -QNISYGADKLETAFRLLIEDYDKFKDSECYRYDLTEIMRQMVSNYSTLTYNNVIDARED 1040
Query: 414 NDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNART 471
+ + +FL+ + ++ + L G W+ A+ A + + + +E NA+
Sbjct: 1041 KNIEKFKEEKAKFLKSFDVLNDIQETQVDQLAGEWIGKAQDRAADYDDFAKDAFEMNAKA 1100
Query: 472 QITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG 521
IT W ++ A L+DY + + G+ D Y Y + +LE+G
Sbjct: 1101 LITSWA--SRSSAGGLKDYAWRNYQGMFIDLYKQNWIDYLDQVEANLENG 1148
>gi|281423203|ref|ZP_06254116.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
gi|281402539|gb|EFB33370.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
Length = 450
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 93/267 (34%), Positives = 136/267 (50%), Gaps = 26/267 (9%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+NL GWGGPLP SW +QQ LQKKIL R++E GM PVLP F G +P + +T
Sbjct: 186 MNNLEGWGGPLPDSWYNQQEALQKKILKRMHEYGMQPVLPGFCGMMPHDAKAKL-GLNVT 244
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G W L TD +I + + YG+ ++ Y+ D F E
Sbjct: 245 DGGIWNGYTRPAN------LSPTDAHSDKIADLYYAELTNLYGKANY-YSMDPFHETND- 296
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
D S G + M+ + +A W++QGW + +P RP +K + N G L+VL
Sbjct: 297 -DEAIDYSKAGRKVMEAMKRVNPNATWVIQGW--TENP--RPQMIKNMKN----GDLLVL 347
Query: 181 DLFAEVKP------IWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSEN 234
DLF+E +P IW K + +++CML NF N+ ++G +D + + S
Sbjct: 348 DLFSECRPMFGIPSIWKREKGYEQHDWLFCMLENFGANVGLHGRMDLLLHNFYSTKQSSP 407
Query: 235 TT--MVGVGMSMEGIEQNPVVYDLMSE 259
T + G+G +MEG E NPV+++LMSE
Sbjct: 408 NTQHLKGIGFTMEGSENNPVMFELMSE 434
>gi|380804373|gb|AFE74062.1| alpha-N-acetylglucosaminidase precursor, partial [Macaca mulatta]
Length = 265
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 92/145 (63%), Gaps = 3/145 (2%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L R+ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 124 MGNLHTWDGPLPPSWHIKQLYLQHRVLDRMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 183
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 184 KMGSWGHFNCS--YSCSFLLAPEDPMFPVIGSLFLRELVKEFG-TDHIYGADTFNEMQPP 240
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDA 145
+P Y+++ A+Y M + D++A
Sbjct: 241 SSAPSYLAAATTAVYEAMIAVDTEA 265
>gi|212722968|ref|NP_001131519.1| uncharacterized protein LOC100192858 [Zea mays]
gi|194691748|gb|ACF79958.1| unknown [Zea mays]
Length = 114
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 61/99 (61%), Positives = 75/99 (75%)
Query: 475 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 534
MWFDNT+ +ASLLRDY NKYWSGLL+DYYGPRAAIYFK+++ S+E+ F LK+WRREWI
Sbjct: 1 MWFDNTETKASLLRDYANKYWSGLLQDYYGPRAAIYFKHLLLSMENNAPFALKEWRREWI 60
Query: 535 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF 573
LTN+WQ+ R V+ + GD L SQ LY KYL +
Sbjct: 61 SLTNNWQSDRKVFSTTATGDPLNISQSLYTKYLSNADLL 99
>gi|260821254|ref|XP_002605948.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
gi|229291285|gb|EEN61958.1| hypothetical protein BRAFLDRAFT_132235 [Branchiostoma floridae]
Length = 673
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 93/321 (28%), Positives = 131/321 (40%), Gaps = 97/321 (30%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GWGGPLPQSW QL LQ KIL R+ N + L +++ +
Sbjct: 368 MGNIRGWGGPLPQSWHQNQLELQHKILARMR-------------NFDSTLMHLY----LD 410
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
G ++ C T + +C E
Sbjct: 411 YSGGDLKTRTVAHTCWTLRI-----------------------------HCFLTLEECLL 441
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
+ P Y+S GAA+Y+GM +GD A+WLMQGWLF FW+P Q KALL SVP G
Sbjct: 442 LSEPNYLSKAGAAVYAGMLAGDPQAIWLMQGWLFQARDFWQPAQTKALLQSVPEG----- 496
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
P AR +TMVG
Sbjct: 497 ---------------------------------------------PFLARKYLGSTMVGT 511
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDV-KAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G++ EGI+QN ++Y+LM+E+A+ + + W + Y+ RYG W +L +V
Sbjct: 512 GLTPEGIDQNYIMYELMNEVAWMPQPFQILDNWASDYAWSRYGVKNSNASLGWQILLKSV 571
Query: 300 YNCTDGATDKNRDVIVAFPDV 320
Y+C +G D V+V PD+
Sbjct: 572 YDCENGFKDHCDSVVVHRPDL 592
>gi|302522684|ref|ZP_07275026.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
gi|302431579|gb|EFL03395.1| alpha-N-acetylglucosaminidase [Streptomyces sp. SPB78]
Length = 355
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 89/333 (26%), Positives = 155/333 (46%), Gaps = 25/333 (7%)
Query: 198 GVPYIWCMLHNFAGNIEM-YGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDL 256
G PY + + NF G+ + D + P R + + G+ + E + NP ++L
Sbjct: 10 GTPYAFGSIWNFGGHTALGANTRDWVDLYP-RWRDRSGSRLSGIALMPEAADNNPAAFEL 68
Query: 257 MSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVA 316
+E+ + VD+ W +Y+ RYG S + AW++L TVY D+ +
Sbjct: 69 FAELPWTEGPVDLTDWFREYARVRYGGSDAHAEAAWDILRTTVYGTRR--DDRWSEPADG 126
Query: 317 FPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGN 376
P++ +V+ GK+ L+ +S++ AL+ ++
Sbjct: 127 LFGARPALDAVSAGKWS--------PKALRYPAASFEP------------ALDELLSVRA 166
Query: 377 ELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGL 436
EL S TYR DL+D+ RQALA + L + AY+ + +L RR++ L++ ++ L
Sbjct: 167 ELRDSATYRRDLLDVARQALANRSRTLLPRLAAAYKAKNQAEFARLGRRWIALIDLLEQL 226
Query: 437 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 496
+A + LLG W+ESA+ + ++ Q +++A + +T W +A LRDY N+ WS
Sbjct: 227 VATDENHLLGRWVESARAWGGSAREKSQLQYDALSLLTTWGTRQGADAG-LRDYANREWS 285
Query: 497 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 529
GL+ Y R Y + +L+ G DW
Sbjct: 286 GLVGGLYRLRWGTYIDELSAALKEGRKPVAVDW 318
>gi|339238239|ref|XP_003380674.1| GDP-L-fucose synthetase [Trichinella spiralis]
gi|316976398|gb|EFV59699.1| GDP-L-fucose synthetase [Trichinella spiralis]
Length = 1203
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 75/220 (34%), Positives = 113/220 (51%), Gaps = 7/220 (3%)
Query: 89 EIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWL 148
+G + + L+ Y H Y+ D F+E P ++ ++ AIY+ M S D +VW+
Sbjct: 801 HVGNEVVWKSLENYFGLFHAYSADPFNEMVPNTFDVMFLRNVSFAIYNVMLSVDPKSVWV 860
Query: 149 MQGWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLH 207
+Q W+F W + K L +VP G ++V+DL+AE P++ FY P+IWCMLH
Sbjct: 861 LQSWMFLSSERWLENENAKHFLTAVPTGSILVVDLYAEEYPLYEKFSGFYNQPFIWCMLH 920
Query: 208 NFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAF--QHE 265
NF G +YG L I + T N MVG G+SMEGI+QN VVY + + + ++
Sbjct: 921 NFGGVQGLYGNLARINQKLADVSTVSNINMVGTGLSMEGIDQNYVVYQMALDRFWSPNNQ 980
Query: 266 KVDVKAWINQYSVRRYGRSV-PAIQDAWNVLYHTVYNCTD 304
KVD+ AW Y G + +I AW + C +
Sbjct: 981 KVDLAAW---YIYIHLGVGITKSIYTAWGAFLQSSRTCQE 1017
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 94/209 (44%), Gaps = 10/209 (4%)
Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
T + A F+ S + Y DL++LT+ AL +L+ + +Y
Sbjct: 998 TKSIYTAWGAFLQSSRTCQENEIYINDLVELTKHALMLTGAKLYEKLQASYIRKCGQEFL 1057
Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 480
+ + +++ D++ + H +L W+E A+ + Q Q E N R Q+T+W
Sbjct: 1058 ENAAAVEQVLSDLEWISKTHSRSMLSKWIEIARANGKTAAQSDQLEENLRMQVTIW--GP 1115
Query: 481 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYM---IESLESGDGFRLKDWRREWIKLT 537
Q E + DY K W+ L +YY PR ++F ++ I LE+ + L I+L
Sbjct: 1116 QGE---IVDYARKQWAALFSEYYLPRWRLFFAHLYADILQLETFNQTLLNSRLFHEIELP 1172
Query: 538 NDWQNGRNVYPVESNGDALITSQWLYNKY 566
Q N+ + G+ ++ S+ LY++Y
Sbjct: 1173 FALQKIPNI--DQPTGNTVVVSKILYSRY 1199
>gi|402824586|ref|ZP_10873940.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
gi|402261896|gb|EJU11905.1| N-acetylglucosaminidase, partial [Sphingomonas sp. LH128]
Length = 486
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 123/252 (48%), Gaps = 39/252 (15%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NL G+ PL W++++ LQ +IL R+ LGM PVLPAF+G VP A P A+I
Sbjct: 223 MGNLAGYRAPLSSGWIEKKHQLQLRILARMRALGMKPVLPAFAGYVPEAFAKAHPKARIY 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W TY LD +DPLF ++ F+ + YG + Y D F+E PP
Sbjct: 283 KMRAWEGFPP------TYWLDPSDPLFTQLAARFVTLYNRTYGEGEY-YLADAFNEMIPP 335
Query: 121 V-------DSPEY-----------------------ISSLGAAIYSGMQSGDSDAVWLMQ 150
+ + EY +++ G +Y + + A W+MQ
Sbjct: 336 IAEDGSDAAAAEYGDSIANTAATRAAALPPAVRDARLAAYGERLYGSITAAAPKATWVMQ 395
Query: 151 GWLFSYDPFWRPPQ-MKALLNSVPLGKLVVLDLFAEVKP-IWSTSKQFYGVPYIWCMLHN 208
GWLF D +R P+ + A L+ VP ++++LD+ + P IW + F G + + +HN
Sbjct: 396 GWLFGADKAFRTPEAIAAFLSRVPDDRMLILDIGNDRYPGIWQKTDAFDGKAWTYGYVHN 455
Query: 209 FAGNIEMYGILD 220
+ G+ +YG L+
Sbjct: 456 YGGSNPVYGDLE 467
>gi|47212645|emb|CAF95026.1| unnamed protein product [Tetraodon nigroviridis]
Length = 121
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 56/121 (46%), Positives = 83/121 (68%), Gaps = 3/121 (2%)
Query: 20 LVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYL 79
L LQ KIL ++ GM PVLPAFSGNVP + ++P A++T+LG W K + + C+Y+
Sbjct: 4 LSLQFKILEQMRSFGMTPVLPAFSGNVPKGILRLYPEARVTRLGPW--SKFNCSFSCSYI 61
Query: 80 LDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQ 139
LD DPLF+ IG ++ Q +K++G T+HIYN DTF+E TPP P Y+S++ A+++ M
Sbjct: 62 LDPRDPLFLRIGSLYLAQVVKQFG-TNHIYNTDTFNEMTPPSSEPNYLSAVSRAVFAAMT 120
Query: 140 S 140
+
Sbjct: 121 A 121
>gi|281423204|ref|ZP_06254117.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
gi|281402540|gb|EFB33371.1| alpha-N-acetylglucosaminidase [Prevotella oris F0302]
Length = 291
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 112/240 (46%), Gaps = 22/240 (9%)
Query: 280 RYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV 339
RYG++ P I+ AW +L T+YNC G + SI G+P
Sbjct: 4 RYGKTSPEIERAWQLLSETIYNCPAGNNQQG---------PHESIFC---------GRP- 44
Query: 340 SKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKY 399
++ + S+ +Y + A +L ++ +N + YDL+D+ RQALA
Sbjct: 45 ---SLNNFQVKSWSKMRNYYDLQATLEAAQLMTGIADQYKGNNNFEYDLVDICRQALADQ 101
Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
+L I Y + + RFLE++ D LL F LG W E+A++L +
Sbjct: 102 GRLQYLKTIADYNGFSRKAFAKDAHRFLEMILLQDKLLGTRTEFRLGHWTEAARKLGTTQ 161
Query: 460 EQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLE 519
+++ YEWNAR QIT W + + L DY +K W G+L+D+Y R I+ + + +E
Sbjct: 162 QEKDLYEWNARVQITTWGNRMCADKGGLHDYAHKEWQGILKDFYYKRWKIFMDALAKQME 221
>gi|347541919|ref|YP_004856555.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-rat-Yit]
gi|346984954|dbj|BAK80629.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-rat-Yit]
Length = 912
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 127/546 (23%), Positives = 234/546 (42%), Gaps = 71/546 (13%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E G+ P+ F G P +
Sbjct: 364 MGNISSIGGELTPKWFEDRAKLSIDIQTRMIEFGIEPIHQMFIGYFPYKEN---SGVNVI 420
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
+ W +K R LD + I + ++Q + +G + + + D F E N
Sbjct: 421 RGSYWSKIKGPDR------LDFNNNDVEFISSVYYKKQKELFGESKY-FAGDLFHEGNNL 473
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178
D E + + + + + +++W++Q W S P + + N + +
Sbjct: 474 YGYDPVELSNKVLKLL---IDNNGENSIWIIQSWSHS-------PSSETIEN-LNRNNTL 522
Query: 179 VLDLFAEVKPIWSTSKQFYGVPY----------IWCMLHNFAGNIEMYGILDSIAFGPVE 228
+LDL +++ W +F + + I+ +L+NF G +YG + +
Sbjct: 523 ILDLHSQLNTRWKGISKFNNMSWKDREFDRSNWIFGVLNNFGGRSGLYGHTRHLLNQFYD 582
Query: 229 ARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAI 288
A+ + N + GV + EGI N + +L++E+ F +K+D+ ++++Y RYG+S +
Sbjct: 583 AKYNSNY-LKGVAHTSEGIGFNNFIDELVTEIIFS-DKLDIDEFVSRYLRNRYGKSDNDL 640
Query: 289 QDAWNVLYHTVYNCT-----DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEA 343
A+N+L TVYN +GA++ VI A P +D
Sbjct: 641 LKAFNILLDTVYNPVINIYHEGASES---VINARPSLD---------------------- 675
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANEL 403
+KS S + H Y++ ++ AL ++ + NE S Y DLID+ + + +NE
Sbjct: 676 -VKS-ASKWGSIHKNYNSEKLEEALRIYFSKYNEFKDSKGYMTDLIDIASEVIINLSNEY 733
Query: 404 FLNIIEAYQLNDAHGVFQL-SRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ N+ + Y N F+L S+RFL ++ +L ++ L ++ L ++ E
Sbjct: 734 YKNLQDYYN-NGEIEFFKLNSQRFLNMILLQANILYYNERKSLQKLIDKLDDLNYDDYFE 792
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESG- 521
N +T +T W+D E LRDY N + ++ Y R +F + E+ +G
Sbjct: 793 DTLIINKKTILTTWYDKQVSEDDGLRDYANTDFYDIVGTLYYNRWKRFFDNIQENAVNGF 852
Query: 522 -DGFRL 526
D +R
Sbjct: 853 YDDYRF 858
>gi|342731751|ref|YP_004770590.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-Japan]
gi|342329206|dbj|BAK55848.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Japan]
Length = 898
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 121/529 (22%), Positives = 227/529 (42%), Gaps = 76/529 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 352 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 408
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 409 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 461
Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 462 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 509
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 510 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 569
Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
+A+ + + + G+ + EG+ N + +L +E+ F E V++ ++ +Y RYG+S
Sbjct: 570 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 627
Query: 288 IQDAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
+ A+N+L TVYN TD GA++ VI A P ++ +
Sbjct: 628 LLVAFNILLDTVYNPVTDIYHEGASES---VINARPSLEIN------------------- 665
Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E
Sbjct: 666 -----SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASE 720
Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ I E Y + + +S++FL L+ +L+ +D L + L ++ +
Sbjct: 721 YYQIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFK 780
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 511
++N + +T W+D E LRDY N D+Y +Y+
Sbjct: 781 DTLKYNKKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 822
>gi|417967717|ref|ZP_12608785.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-co]
gi|380340884|gb|EIA29424.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-co]
Length = 741
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 117/525 (22%), Positives = 226/525 (43%), Gaps = 68/525 (12%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 195 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 251
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 252 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 304
Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 305 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 352
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 353 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 412
Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
+A+ + + + G+ + EG+ N + +L +E+ F E V++ ++ +Y RYG+S
Sbjct: 413 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 470
Query: 288 IQDAWNVLYHTVYN-CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLK 346
+ A+N+L TVYN TD + S+I+ ++ ++
Sbjct: 471 LLVAFNILLDTVYNPVTD----------IYHEGASESVIN-------------ARPSLGI 507
Query: 347 SETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLN 406
+ S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E +
Sbjct: 508 NSASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASEYYQI 567
Query: 407 IIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYE 466
I E Y + + +S++FL L+ +L+ +D L + L ++ + +
Sbjct: 568 IQEYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFKDTLK 627
Query: 467 WNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 511
+N + +T W+D E LRDY N D+Y +Y+
Sbjct: 628 YNKKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 665
>gi|417965571|ref|ZP_12607078.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-4]
gi|380336329|gb|EIA26351.1| Alpha-N-acetylglucosaminidase, partial [Candidatus Arthromitus sp.
SFB-4]
Length = 685
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 121/529 (22%), Positives = 226/529 (42%), Gaps = 76/529 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 146 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 202
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 203 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 255
Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 256 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 303
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 304 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 363
Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
+A+ + + + G+ + EG+ N + +L +E+ F E V++ ++ +Y RYG+S
Sbjct: 364 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 421
Query: 288 IQDAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
+ A+N+L TVYN TD GA++ VI A P + +
Sbjct: 422 LLVAFNILLDTVYNPVTDIYHEGASES---VINARPSLGIN------------------- 459
Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E
Sbjct: 460 -----SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASE 514
Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ I E Y + + +S++FL L+ +L+ +D L + L ++ +
Sbjct: 515 YYQIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFK 574
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 511
++N + +T W+D E LRDY N D+Y +Y+
Sbjct: 575 DTLKYNKKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 616
>gi|384455191|ref|YP_005667784.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Yit]
gi|418016862|ref|ZP_12656425.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|418371995|ref|ZP_12964091.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-SU]
gi|345505596|gb|EGX27892.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-NYU]
gi|346983532|dbj|BAK79208.1| alpha-N-acetylglucosaminidase family protein [Candidatus
Arthromitus sp. SFB-mouse-Yit]
gi|380342872|gb|EIA31299.1| alpha-N-acetylglucosaminidase [Candidatus Arthromitus sp.
SFB-mouse-SU]
Length = 898
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 121/529 (22%), Positives = 226/529 (42%), Gaps = 76/529 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GG L W + + L I R+ E+G+ P+ F G P +
Sbjct: 352 MGNISAVGGELTPKWFEDRAKLSIDIQKRMLEVGIEPIHQMFIGYFPYKEN---SGVNVI 408
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118
G W +K R LD + I + E+Q + G++ + + D F E N
Sbjct: 409 NGGYWSKIKGPDR------LDFNNNNVEFISSVYYEKQRELLGKSKY-FAGDLFHEGANL 461
Query: 119 PPVDSPEYISSLGAAIYSGMQSGD-SDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKL 177
D+ E L + S +++ D+VW++Q W P +++ N + +
Sbjct: 462 YGYDAGE----LSNRVLSLLKNNTGEDSVWIIQSWA-------HNPSSESIEN-LNKDNI 509
Query: 178 VVLDLFAEVKPIWS----------TSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPV 227
++LDL +++ W +K+F +I+ +L+NF G +YG + +
Sbjct: 510 LILDLHSQLNTRWKGISKFNYMSWDNKEFDNSNWIFGILNNFGGRNGLYGHSNHLLRQFY 569
Query: 228 EARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPA 287
+A+ + + + G+ + EG+ N + +L +E+ F E V++ ++ +Y RYG+S
Sbjct: 570 DAKYNSDY-LSGIANTSEGVGFNNFIDELSTELIFSDE-VNMDEFVKRYLKNRYGKSDRD 627
Query: 288 IQDAWNVLYHTVYN-CTD----GATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKE 342
+ A+N+L TVYN TD GA++ VI A P + +
Sbjct: 628 LLVAFNILLDTVYNPVTDIYHEGASES---VINARPSLGIN------------------- 665
Query: 343 AVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANE 402
S + H Y + ++ R +E++I+ +E + Y DLID+ + + A+E
Sbjct: 666 -----SASKWGTIHKNYDSRKLERVIEIYISKYDEFKDNEGYIIDLIDIASEVIINLASE 720
Query: 403 LFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQE 462
+ I E Y + + +S++FL L+ +L+ +D L + L ++ +
Sbjct: 721 YYQIIQEYYNNGNIKYLQLISKKFLNLILLQANILSYNDKKSLQKIINKLDALDYDDYFK 780
Query: 463 KQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 511
++N + +T W+D E LRDY N D+Y +Y+
Sbjct: 781 DTLKYNKKMILTTWYDKLVSEDGGLRDYANT-------DFYDIVGTLYY 822
>gi|297723521|ref|NP_001174124.1| Os04g0650900 [Oryza sativa Japonica Group]
gi|255675839|dbj|BAH92852.1| Os04g0650900, partial [Oryza sativa Japonica Group]
Length = 128
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 49/64 (76%), Positives = 56/64 (87%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+HGWGGPLPQSWLD QL LQKKIL R+Y GM PVLPAFSGN+PAAL++ FPSAK+T
Sbjct: 64 MANMHGWGGPLPQSWLDDQLALQKKILSRMYAFGMFPVLPAFSGNIPAALRSKFPSAKVT 123
Query: 61 QLGN 64
LGN
Sbjct: 124 HLGN 127
>gi|358381741|gb|EHK19415.1| hypothetical protein TRIVIDRAFT_224650 [Trichoderma virens Gv29-8]
Length = 217
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 110/214 (51%), Gaps = 11/214 (5%)
Query: 118 TPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPL-G 175
TPP Y+ + + + ++S D +A+W+ Q WLF+ + FW +++ + +
Sbjct: 2 TPPSGELNYLRNASSNTWKALKSADPEAIWVFQAWLFAQNTTFWTNDRIEVYPGGITIDS 61
Query: 176 KLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT 235
+++LD++ E W ++ +Y P+IWC L N+ I MYG + ++ P+ A E+
Sbjct: 62 DMLILDIWLESMSQWQCAQSYYSKPWIWCELQNYGATINMYGQIQNLTKSPILA-LQESQ 120
Query: 236 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRY--GRSVPAIQDAWN 293
++VG+G+SME + N +V+DL+ A+ +D + ++ RY + +I AW
Sbjct: 121 SLVGLGLSMEAQQSNEIVFDLLLSQAWNCTPIDTNIYFKSWAAARYLSSKRPASIYTAWE 180
Query: 294 VLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISV 327
+ TVY+ T N +++ + P S I V
Sbjct: 181 AVRATVYDNT------NLNMMSSVPKSRSSEIKV 208
>gi|293369245|ref|ZP_06615835.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
gi|292635670|gb|EFF54172.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
Length = 221
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 102/214 (47%), Gaps = 7/214 (3%)
Query: 355 PHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLN 414
P + Y +++ A L ++ + ++Y +DL+++ RQ L Y N + AY+
Sbjct: 11 PTIEYQPKDLVEAWRLLLSVKD--CQRDSYEFDLVNIGRQVLGNYFNVVRDEFTLAYEAG 68
Query: 415 DAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQIT 474
D + + E++ D+D L++CH F L W+ A+ + + + YE NAR+ IT
Sbjct: 69 DIPMMKNRGNKMREILADLDKLVSCHPTFSLHKWITDARDMGHDAASKNYYEMNARSLIT 128
Query: 475 MWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWI 534
+W D+ L DY N+ W+GL YY R + +IE+ E F +++ +
Sbjct: 129 IWGDSYH-----LTDYANRSWAGLTNQYYSVRWDHFINEVIEAAEKKKNFDEEEFFNQSR 183
Query: 535 KLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQ 568
N+W N N GD + ++ +Y KY +
Sbjct: 184 MYENEWVNPSNRISYNEGGDGIKLARQIYKKYAK 217
>gi|293371911|ref|ZP_06618315.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
gi|292633157|gb|EFF51734.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
Length = 289
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 3/98 (3%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M+N+ W GPLP WL+ Q+ LQKKIL R EL M PVLPAF+G+VPA L+ ++P A I
Sbjct: 184 MANIDRWNGPLPMEWLEHQVSLQKKILARERELNMKPVLPAFAGHVPADLKRIYPEADIQ 243
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQ 98
LG W R C + L+ D LF +I + F+++Q
Sbjct: 244 HLGKWAGFADAYR--CNF-LNPNDALFAKIQKLFLDEQ 278
>gi|321458423|gb|EFX69492.1| hypothetical protein DAPPUDRAFT_35389 [Daphnia pulex]
Length = 132
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 78/137 (56%), Gaps = 5/137 (3%)
Query: 388 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGP 447
++DLTRQ++ + + L+ ++E Y ++ + ++ + + L++D+D L+ FLLG
Sbjct: 1 MVDLTRQSMQEIFHLLYSKLLEVYLEKNSTAIEGIAYKMINLLQDLDELIQTGKTFLLGK 60
Query: 448 WLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRA 507
W+ AK E ++ QYEWNAR QIT+W + +RDY K W+G++ DYY P
Sbjct: 61 WIADAKSWGTTEGEKLQYEWNARNQITLWGPRGE-----IRDYAAKKWAGVVADYYKPHW 115
Query: 508 AIYFKYMIESLESGDGF 524
++ + M SL+ F
Sbjct: 116 EVFIREMQMSLDENRAF 132
>gi|322792283|gb|EFZ16267.1| hypothetical protein SINV_02225 [Solenopsis invicta]
Length = 87
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/92 (45%), Positives = 61/92 (66%), Gaps = 5/92 (5%)
Query: 426 FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEAS 485
LEL +D++ +LA FLLG WL AK++A NEE+ + YE+NAR QIT+W N +
Sbjct: 1 LLELFDDLESILASGSNFLLGTWLTQAKEMADNEEERRSYEYNARNQITLWGPNGE---- 56
Query: 486 LLRDYGNKYWSGLLRDYYGPRAAIYFKYMIES 517
+RDY NK WSG++ DY+ PR ++ K + +S
Sbjct: 57 -IRDYANKQWSGVVADYFKPRWELFLKALEKS 87
>gi|326435733|gb|EGD81303.1| alpha-N-acetylglucosaminidase [Salpingoeca sp. ATCC 50818]
Length = 696
Score = 89.0 bits (219), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 71/126 (56%), Gaps = 2/126 (1%)
Query: 134 IYSGMQSGDSDAVWLMQGWL-FSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWST 192
+Y+ M D A+W+ QGW+ D M ++VP G+LV+LD+ AE IW+
Sbjct: 500 VYTTMTKRDPHAIWVYQGWIWLDLDNAQGFSFMSGFTSAVPRGRLVILDMEAEFDEIWAW 559
Query: 193 SKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPV 252
S+ F+ +IW + NF GN MYG + + F +++ +VGVG++MEGI+QNP
Sbjct: 560 SQSFFNTTFIWAAMDNFGGNNGMYGDIQ-LVFDRTRRVFAQSDAVVGVGITMEGIDQNPA 618
Query: 253 VYDLMS 258
Y ++
Sbjct: 619 YYQAIA 624
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 61/116 (52%), Gaps = 18/116 (15%)
Query: 6 GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNW 65
G GGPLP W QQ LQ+ I+ R ELG+ +LPAF GNVPAAL ++P A I+ W
Sbjct: 246 GVGGPLPSQWYKQQWELQRAIVQRQTELGIGSLLPAFQGNVPAALAQLYPHANISN--GW 303
Query: 66 FSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDT-FDENTPP 120
LD DPLF I +++ + ++G T H Y D FD +T P
Sbjct: 304 --------------LDGLDPLFATIADLTMQELIADFGAT-HFYQADGFFDHSTGP 344
>gi|315131339|emb|CBM69278.1| venom protein Ci-120 [Chelonus inanitus]
Length = 165
Score = 88.6 bits (218), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 79/131 (60%), Gaps = 7/131 (5%)
Query: 390 DLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPW 448
D+TRQ+L A ++L + +++ D VF+ L +L D++ +L+ + FL+G W
Sbjct: 1 DVTRQSLQLIAEHVYLKLQQSFHQKDL-AVFKAHANLLMQLFSDLESILSTNKHFLVGKW 59
Query: 449 LESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAA 508
+++A+ L N +++K YE NAR QIT+W N + +RDY NK W+G++ Y+G R +
Sbjct: 60 IKNARSLGTNVQEQKLYELNARNQITLWGPNGE-----IRDYANKQWAGVMSQYFGARWS 114
Query: 509 IYFKYMIESLE 519
+Y + +LE
Sbjct: 115 LYLSVLEFALE 125
>gi|440799252|gb|ELR20307.1| AlphaN-acetylglucosaminidase, putative, partial [Acanthamoeba
castellanii str. Neff]
Length = 389
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 50/132 (37%), Positives = 68/132 (51%), Gaps = 19/132 (14%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAA----------- 49
M N+ GWGGPL +W Q LQKKI+ R GM PVLPAF+G VP A
Sbjct: 261 MGNIQGWGGPLDLAWRLAQAELQKKIVERQRMFGMLPVLPAFAGFVPEASVKFTLGRGGG 320
Query: 50 -----LQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGR 104
++ ++P+A +T+ +W ++ Y L D L+ IG I +E+G
Sbjct: 321 CGEQGIKRIYPTANLTKSADWAGFPH--QYTNVYFLSPLDSLYKTIGSKVIRLVEEEFG- 377
Query: 105 TSHIYNCDTFDE 116
T HIYN DTF+E
Sbjct: 378 TDHIYNADTFNE 389
>gi|84625359|ref|YP_452731.1| hypothetical protein XOO_3702 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84369299|dbj|BAE70457.1| truncated N-acetylglucosaminidase [Xanthomonas oryzae pv. oryzae
MAFF 311018]
Length = 369
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 33/62 (53%), Positives = 43/62 (69%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ G+ PLPQ W+D + VLQK+IL R+ ELGM PVLPAF+G VP A P A+I
Sbjct: 201 MGNIEGYRAPLPQQWIDSKRVLQKQILTRMRELGMQPVLPAFAGYVPKAFAQAHPHARIY 260
Query: 61 QL 62
++
Sbjct: 261 RM 262
>gi|323456608|gb|EGB12475.1| hypothetical protein AURANDRAFT_20306 [Aureococcus anophagefferens]
Length = 243
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/64 (56%), Positives = 47/64 (73%), Gaps = 1/64 (1%)
Query: 5 HGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT-QLG 63
HG GGPLP+++ D QL L K+IL R+ LG+ PVLP+F GNVP AL+++FP A IT Q
Sbjct: 120 HGVGGPLPRTFADAQLALAKRILARMRGLGIVPVLPSFQGNVPPALKDLFPEANITVQAP 179
Query: 64 NWFS 67
+W S
Sbjct: 180 HWTS 183
>gi|390353486|ref|XP_003728120.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Strongylocentrotus
purpuratus]
Length = 385
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 32/50 (64%), Positives = 38/50 (76%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAAL 50
M N+ GWGGP+PQSW QL LQ KIL R+ ELGM PVLPAF+G+VP +
Sbjct: 218 MGNIDGWGGPIPQSWHTNQLALQHKILKRMRELGMIPVLPAFAGHVPKSF 267
>gi|224135741|ref|XP_002322149.1| predicted protein [Populus trichocarpa]
gi|222869145|gb|EEF06276.1| predicted protein [Populus trichocarpa]
Length = 173
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/34 (76%), Positives = 29/34 (85%)
Query: 238 VGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKA 271
VGVGM M+GI+QNPVV DLMS+MAF H KVDVK
Sbjct: 30 VGVGMPMDGIKQNPVVSDLMSKMAFHHNKVDVKG 63
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 33/60 (55%), Gaps = 2/60 (3%)
Query: 510 YFKYMIESLESGD--GFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYL 567
++ + I++ E G F +K + K + R + PVESNG+AL S+WL+ KYL
Sbjct: 97 HYTFQIQNTEFGKWPRFPVKGLEKRMDKASKQLAESRKIIPVESNGNALNISRWLFYKYL 156
>gi|147798252|emb|CAN69797.1| hypothetical protein VITISV_036335 [Vitis vinifera]
Length = 273
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 31/47 (65%), Gaps = 3/47 (6%)
Query: 237 MVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVD---VKAWINQYSVRR 280
MVGVG+ MEGIEQNPVVY+ M EMAF E V + + N + RR
Sbjct: 112 MVGVGVCMEGIEQNPVVYESMFEMAFHSENVQLVVISSTCNTMARRR 158
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.428
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,777,280,274
Number of Sequences: 23463169
Number of extensions: 422024461
Number of successful extensions: 911059
Number of sequences better than 100.0: 515
Number of HSP's better than 100.0 without gapping: 503
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 907731
Number of HSP's gapped (non-prelim): 818
length of query: 575
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 427
effective length of database: 8,886,646,355
effective search space: 3794597993585
effective search space used: 3794597993585
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 80 (35.4 bits)